BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015682
(402 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359479713|ref|XP_003632343.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 400
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/372 (80%), Positives = 330/372 (88%)
Query: 27 AAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLV 86
P +P +++SDWW SFW KNS PG+ IP+DE EGFK+FFRVSKKTFDYICSLVR+DLV
Sbjct: 25 VVPIEPRATESDWWDSFWQKNSPIPGSAIPTDEAEGFKFFFRVSKKTFDYICSLVRQDLV 84
Query: 87 SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE 146
SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG AFGVGQSTVSQVTWRFIEA+E
Sbjct: 85 SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFIEAVE 144
Query: 147 ERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY 206
ERAKHHL+WPD NRMEEIKSKFE S+GL NCCGAIDATHIIMTLPAVQTSDDWCDQENNY
Sbjct: 145 ERAKHHLRWPDFNRMEEIKSKFETSYGLSNCCGAIDATHIIMTLPAVQTSDDWCDQENNY 204
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
SM +QG+VD EMRF+DIVTGWPGGM VSRLLK +GF++ CEAG+RLNGNVRI SE E+R
Sbjct: 205 SMFLQGVVDDEMRFLDIVTGWPGGMTVSRLLKCTGFYRNCEAGERLNGNVRILSEGAEIR 264
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
EY+VGGVGYPLL WLITP E N + +SM F + HEA RSLAV+AF QLKG WRILSKVM
Sbjct: 265 EYVVGGVGYPLLPWLITPCEYNDIPSSMSGFKTNHEAARSLAVRAFSQLKGTWRILSKVM 324
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRE 386
WRPDKRKLPSIILVCCLLHNIIIDS D+L PDVALS HHD GYG+QCCKQVDP+G+ RE
Sbjct: 325 WRPDKRKLPSIILVCCLLHNIIIDSKDKLQPDVALSSHHDPGYGQQCCKQVDPLGKAMRE 384
Query: 387 NLEKHLQHNQEK 398
N+ KHLQH +EK
Sbjct: 385 NIAKHLQHGKEK 396
>gi|224062361|ref|XP_002300822.1| predicted protein [Populus trichocarpa]
gi|222842548|gb|EEE80095.1| predicted protein [Populus trichocarpa]
Length = 397
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/371 (82%), Positives = 331/371 (89%), Gaps = 2/371 (0%)
Query: 26 SAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDL 85
++AP DP++ DSDWW +FW KNSS+PG IPSDEEEGFKYFFR+SKKTF+YICSLVREDL
Sbjct: 29 TSAPIDPSAIDSDWWDTFWLKNSSSPG--IPSDEEEGFKYFFRISKKTFEYICSLVREDL 86
Query: 86 VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL 145
+SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG +FGVGQSTVSQVTWRFIEAL
Sbjct: 87 ISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGASFGVGQSTVSQVTWRFIEAL 146
Query: 146 EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENN 205
EERAK HLKWPDS+RMEEIK KFE FGL NCCGAIDATHIIMTLPAV+TSDDWCD+E N
Sbjct: 147 EERAKRHLKWPDSDRMEEIKCKFETLFGLPNCCGAIDATHIIMTLPAVETSDDWCDREKN 206
Query: 206 YSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVEL 265
YSM +QGIVDHEMRF+DIVTGWPGGM V RLLK SGFFKLCE G+RLNGN R + E+
Sbjct: 207 YSMFLQGIVDHEMRFLDIVTGWPGGMTVPRLLKCSGFFKLCENGERLNGNSRNLFQGAEI 266
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
REYIVGGVGYPLL WLITP + G+S SM TFN+ HEA R LAV++F+ LKG WRILSKV
Sbjct: 267 REYIVGGVGYPLLPWLITPEVSEGISGSMSTFNARHEAARLLAVRSFMHLKGSWRILSKV 326
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTR 385
MWRPDKRKLPSIILVCCLLHNIIID GDQLHPDVALS HHDSGYGEQCCKQVDP G+ TR
Sbjct: 327 MWRPDKRKLPSIILVCCLLHNIIIDCGDQLHPDVALSGHHDSGYGEQCCKQVDPSGKITR 386
Query: 386 ENLEKHLQHNQ 396
ENL KHLQHN+
Sbjct: 387 ENLAKHLQHNK 397
>gi|356572216|ref|XP_003554266.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 392
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 289/371 (77%), Positives = 314/371 (84%), Gaps = 4/371 (1%)
Query: 25 TSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVRED 84
S P +P +S++DWW SFW KNS+ PG T+P DE EGFKYFFRVSK TF+YICSLVRED
Sbjct: 23 VSMVPIEPRTSETDWWDSFWHKNSTVPGYTVPGDEAEGFKYFFRVSKTTFEYICSLVRED 82
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
L+SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG AFGVGQSTVSQVTWRFIEA
Sbjct: 83 LISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFIEA 142
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQEN 204
LEERAKHHL WPD NRM+EIK FE S+GL NCCGAIDATHI+MTLPAVQTSDDWCDQE
Sbjct: 143 LEERAKHHLNWPDFNRMQEIKLGFEVSYGLPNCCGAIDATHIMMTLPAVQTSDDWCDQEK 202
Query: 205 NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE 264
NYSML+QGIVDHEMRFIDI+TG PGGM+VSRLLK S FFKL E G+RLNGNV+ +V
Sbjct: 203 NYSMLLQGIVDHEMRFIDIMTGLPGGMSVSRLLKCSAFFKLSENGERLNGNVKAFGGDV- 261
Query: 265 LREYIVGGVGYPLLSWLITPYET---NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI 321
+REY+VGG YPLL WL+TPYET NG+S S FN H A + LAV+AF LKG WRI
Sbjct: 262 IREYVVGGCSYPLLPWLMTPYETNGANGISVSQSIFNHKHGAAKLLAVRAFSLLKGSWRI 321
Query: 322 LSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMG 381
LSKVMWRPDKRKLPSIIL CCLLHNIIID GD L DVALS HHDSGY EQCCKQVDP+G
Sbjct: 322 LSKVMWRPDKRKLPSIILTCCLLHNIIIDCGDTLQQDVALSGHHDSGYQEQCCKQVDPLG 381
Query: 382 RTTRENLEKHL 392
RT R+NL KHL
Sbjct: 382 RTMRDNLAKHL 392
>gi|356504953|ref|XP_003521257.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 394
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 287/373 (76%), Positives = 314/373 (84%), Gaps = 4/373 (1%)
Query: 25 TSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVRED 84
S P +P +S++DWW SFW KNS+ PG T+P DE EGFKYFFRVSK TF+YICSLVRED
Sbjct: 23 VSLVPVEPRTSETDWWDSFWHKNSTVPGYTVPGDEAEGFKYFFRVSKTTFEYICSLVRED 82
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
L+SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG AFGVGQSTVSQVTWRFIEA
Sbjct: 83 LISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFIEA 142
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQEN 204
LEERAKHHL WPD N+M+EIK FE S+GL NCCGAIDATHI+MTLPAVQTSDDWCDQE
Sbjct: 143 LEERAKHHLNWPDFNQMQEIKLGFEASYGLPNCCGAIDATHIMMTLPAVQTSDDWCDQEK 202
Query: 205 NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE 264
NYSML+QGIVDHEMRF+DIVTG PGGM+VSRLLK S FFKL E G+RLNGNV+ +V
Sbjct: 203 NYSMLLQGIVDHEMRFVDIVTGLPGGMSVSRLLKCSAFFKLSENGERLNGNVKAFGGDV- 261
Query: 265 LREYIVGGVGYPLLSWLITPYET---NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI 321
+REY+VGG YPLL WL+TPYET NG+S TFN H A + LAV+AF LKG WRI
Sbjct: 262 IREYVVGGCSYPLLPWLMTPYETNGANGVSVPQSTFNHKHGAAKLLAVRAFSLLKGSWRI 321
Query: 322 LSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMG 381
LSKVMWRPDKRKLPSIIL CCLLHNIIID GD L DVALS HHDSGY EQCCKQVDP+G
Sbjct: 322 LSKVMWRPDKRKLPSIILTCCLLHNIIIDCGDTLQQDVALSGHHDSGYQEQCCKQVDPLG 381
Query: 382 RTTRENLEKHLQH 394
RT R+NL HL+
Sbjct: 382 RTMRDNLANHLRQ 394
>gi|255567510|ref|XP_002524734.1| conserved hypothetical protein [Ricinus communis]
gi|223535918|gb|EEF37577.1| conserved hypothetical protein [Ricinus communis]
Length = 363
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 284/350 (81%), Positives = 312/350 (89%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR 112
A+IP DEEE FKYFFRVSKKTF+YICSLVREDL+SRPPSGLINIEGRLLSVEKQVAIALR
Sbjct: 14 ASIPCDEEEAFKYFFRVSKKTFEYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALR 73
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF 172
RLASGESQVSVG AFGVGQSTVSQVTWRFIEALEERA+HHLKWPDSNRME+IKS+FE F
Sbjct: 74 RLASGESQVSVGAAFGVGQSTVSQVTWRFIEALEERARHHLKWPDSNRMEDIKSRFEALF 133
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
GL NCCGAIDATHIIMTLPAV+TSD WCDQENNYSM +QG+VDHEMRF+DIVTGWPGGM
Sbjct: 134 GLPNCCGAIDATHIIMTLPAVETSDHWCDQENNYSMFLQGVVDHEMRFLDIVTGWPGGMT 193
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
VSRLLK SGFFK CE+G+ LNG +R SE +E+REYIVGGV YPLL WLITP ++ S
Sbjct: 194 VSRLLKCSGFFKRCESGECLNGKLRKLSEGMEIREYIVGGVCYPLLPWLITPDGSDEHSD 253
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ FN++H R LAVK+FLQLKG WRIL+KVMWRPDKRKLPSIILVCCLLHNIIID+G
Sbjct: 254 PLSNFNTMHAVARMLAVKSFLQLKGSWRILNKVMWRPDKRKLPSIILVCCLLHNIIIDNG 313
Query: 353 DQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLVK 402
DQLHPDVALS HHDSGYGEQ CKQ+DP+GRT RENL K+LQH +E+ L K
Sbjct: 314 DQLHPDVALSGHHDSGYGEQFCKQIDPLGRTLRENLAKYLQHMKERSLSK 363
>gi|449443949|ref|XP_004139738.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449528120|ref|XP_004171054.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 396
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 296/398 (74%), Positives = 328/398 (82%), Gaps = 6/398 (1%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP KKSK K K +KKLKK K+ S P +P +SD DWW FW KN S PG +DE
Sbjct: 1 MAPTKKSK--KRNKDSKKLKKRKNLSVVPMEPRASDPDWWEIFWHKNCS-PGR---NDEA 54
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
GFKYFFR SKKTFDYICSLVREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASGESQ
Sbjct: 55 AGFKYFFRTSKKTFDYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGESQ 114
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
VSVG AFGVGQSTVSQVTWRF+EALE+RAKHHL+WP S+R+EEIKS+FE FGL NCCGA
Sbjct: 115 VSVGAAFGVGQSTVSQVTWRFVEALEQRAKHHLQWPSSSRLEEIKSQFEAFFGLPNCCGA 174
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
IDATHIIMTLPAVQTSDDWCD NNYSM +QGIVDH+MRFIDIVTGWPG M SRLLK S
Sbjct: 175 IDATHIIMTLPAVQTSDDWCDTNNNYSMFLQGIVDHQMRFIDIVTGWPGAMTTSRLLKCS 234
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
FKLC+AG+RLNGNV+ S E+REY+VGGVGYPLL WLITPYE + LS FN++
Sbjct: 235 RIFKLCDAGERLNGNVKKFSGGSEIREYLVGGVGYPLLPWLITPYENDNLSPLNFNFNAV 294
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVA 360
A + LAV+AF QLKG WRIL+KVMWRPDKRKLPSIILVCCLL NIIID+GD+L PDVA
Sbjct: 295 QGAAKLLAVRAFSQLKGSWRILNKVMWRPDKRKLPSIILVCCLLQNIIIDNGDELQPDVA 354
Query: 361 LSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEK 398
LS HHD GY E CCKQ+DP+G RENL KHL N+E+
Sbjct: 355 LSGHHDLGYQEHCCKQLDPLGNNLRENLAKHLHQNKER 392
>gi|18412695|ref|NP_567144.1| uncharacterized protein [Arabidopsis thaliana]
gi|13878075|gb|AAK44115.1|AF370300_1 unknown protein [Arabidopsis thaliana]
gi|17104745|gb|AAL34261.1| unknown protein [Arabidopsis thaliana]
gi|332646938|gb|AEE80459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 396
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 266/368 (72%), Positives = 304/368 (82%), Gaps = 3/368 (0%)
Query: 25 TSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVRED 84
+A P DP + D DWW +FW +NSS ++PSDE+ FK+FFR SK TF YICSLVRED
Sbjct: 30 VNAVPLDPEAIDCDWWDTFWLRNSS---PSVPSDEDYAFKHFFRASKTTFSYICSLVRED 86
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
L+SRPPSGLINIEGRLLSVEKQVAIALRRLASG+SQVSVG AFGVGQSTVSQVTWRFIEA
Sbjct: 87 LISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEA 146
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQEN 204
LEERAKHHL+WPDS+R+EEIKSKFEE +GL NCCGAID THIIMTLPAVQ SDDWCDQE
Sbjct: 147 LEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDWCDQEK 206
Query: 205 NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE 264
NYSM +QG+ DHEMRF+++VTGWPGGM VS+LLKFSGFFKLCE Q L+GN + S+ +
Sbjct: 207 NYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQ 266
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
+REY+VGG+ YPLL WLITP++++ S SM FN HE RS+A AF QLKG WRILSK
Sbjct: 267 IREYVVGGISYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSK 326
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTT 384
VMWRPD+RKLPSIILVCCLLHNIIID GD L DV LS HHDSGY ++ CKQ +P+G
Sbjct: 327 VMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQEDVPLSGHHDSGYADRYCKQTEPLGSEL 386
Query: 385 RENLEKHL 392
R L +HL
Sbjct: 387 RGCLTEHL 394
>gi|7523409|emb|CAB86428.1| putative protein [Arabidopsis thaliana]
Length = 463
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 266/397 (67%), Positives = 304/397 (76%), Gaps = 32/397 (8%)
Query: 25 TSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVRED 84
+A P DP + D DWW +FW +NSS ++PSDE+ FK+FFR SK TF YICSLVRED
Sbjct: 30 VNAVPLDPEAIDCDWWDTFWLRNSS---PSVPSDEDYAFKHFFRASKTTFSYICSLVRED 86
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQV------- 137
L+SRPPSGLINIEGRLLSVEKQVAIALRRLASG+SQVSVG AFGVGQSTVSQV
Sbjct: 87 LISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVPINLYSA 146
Query: 138 ----------------------TWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
TWRFIEALEERAKHHL+WPDS+R+EEIKSKFEE +GL
Sbjct: 147 PYFPDFNIYMLYPDTIHLFPQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLP 206
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NCCGAID THIIMTLPAVQ SDDWCDQE NYSM +QG+ DHEMRF+++VTGWPGGM VS+
Sbjct: 207 NCCGAIDTTHIIMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSK 266
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
LLKFSGFFKLCE Q L+GN + S+ ++REY+VGG+ YPLL WLITP++++ S SM
Sbjct: 267 LLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSMV 326
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN HE RS+A AF QLKG WRILSKVMWRPD+RKLPSIILVCCLLHNIIID GD L
Sbjct: 327 AFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYL 386
Query: 356 HPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
DV LS HHDSGY ++ CKQ +P+G R L +HL
Sbjct: 387 QEDVPLSGHHDSGYADRYCKQTEPLGSELRGCLTEHL 423
>gi|115470825|ref|NP_001059011.1| Os07g0175100 [Oryza sativa Japonica Group]
gi|113610547|dbj|BAF20925.1| Os07g0175100, partial [Oryza sativa Japonica Group]
Length = 436
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/379 (67%), Positives = 299/379 (78%), Gaps = 4/379 (1%)
Query: 18 KLKKSKSTSAAPKDP---NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
KLK K AP P D++WW +F K+S G + PSDE E F+YFFR S++TF
Sbjct: 58 KLKIVKYGGGAPPLPPELRGLDTEWWYTFLHKHSEL-GLSAPSDEGEAFRYFFRTSRRTF 116
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
DYICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTV
Sbjct: 117 DYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGAAFGVGQSTV 176
Query: 135 SQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
SQVTWRFIE++EERA+HHL WP RME+IK++FE GL NCCGAIDATHIIMTLPAV+
Sbjct: 177 SQVTWRFIESMEERARHHLVWPGQERMEQIKARFEAESGLPNCCGAIDATHIIMTLPAVE 236
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+S+DWCD NYSM +QGIVD EMRFIDIVTGWPG M SRLLK SGFFK C+AG RL+G
Sbjct: 237 SSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCSGFFKHCDAGTRLDG 296
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
V +S+E E+REYIVG YPLL WL+TPYE LSA M +FN+ +A R+L +A +
Sbjct: 297 PVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNARQKAARTLGPRALSR 356
Query: 315 LKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCC 374
LKG WRIL+KVMWRPDK KLPSIILVCCLLHNIIID D+L PDV L DHHD+GY E+ C
Sbjct: 357 LKGSWRILNKVMWRPDKNKLPSIILVCCLLHNIIIDCEDELLPDVQLPDHHDTGYSEEKC 416
Query: 375 KQVDPMGRTTRENLEKHLQ 393
+QVDP G+ R+ + +LQ
Sbjct: 417 EQVDPNGKIMRDVITGYLQ 435
>gi|25553597|dbj|BAC24862.1| unknown protein [Oryza sativa Japonica Group]
Length = 441
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/379 (67%), Positives = 299/379 (78%), Gaps = 4/379 (1%)
Query: 18 KLKKSKSTSAAPKDP---NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
KLK K AP P D++WW +F K+S G + PSDE E F+YFFR S++TF
Sbjct: 63 KLKIVKYGGGAPPLPPELRGLDTEWWYTFLHKHSEL-GLSAPSDEGEAFRYFFRTSRRTF 121
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
DYICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTV
Sbjct: 122 DYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGAAFGVGQSTV 181
Query: 135 SQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
SQVTWRFIE++EERA+HHL WP RME+IK++FE GL NCCGAIDATHIIMTLPAV+
Sbjct: 182 SQVTWRFIESMEERARHHLVWPGQERMEQIKARFEAESGLPNCCGAIDATHIIMTLPAVE 241
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+S+DWCD NYSM +QGIVD EMRFIDIVTGWPG M SRLLK SGFFK C+AG RL+G
Sbjct: 242 SSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCSGFFKHCDAGTRLDG 301
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
V +S+E E+REYIVG YPLL WL+TPYE LSA M +FN+ +A R+L +A +
Sbjct: 302 PVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNARQKAARTLGPRALSR 361
Query: 315 LKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCC 374
LKG WRIL+KVMWRPDK KLPSIILVCCLLHNIIID D+L PDV L DHHD+GY E+ C
Sbjct: 362 LKGSWRILNKVMWRPDKNKLPSIILVCCLLHNIIIDCEDELLPDVQLPDHHDTGYSEEKC 421
Query: 375 KQVDPMGRTTRENLEKHLQ 393
+QVDP G+ R+ + +LQ
Sbjct: 422 EQVDPNGKIMRDVITGYLQ 440
>gi|222636526|gb|EEE66658.1| hypothetical protein OsJ_23285 [Oryza sativa Japonica Group]
Length = 397
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/379 (67%), Positives = 299/379 (78%), Gaps = 4/379 (1%)
Query: 18 KLKKSKSTSAAPKDP---NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
KLK K AP P D++WW +F K+S G + PSDE E F+YFFR S++TF
Sbjct: 19 KLKIVKYGGGAPPFPPELRGLDTEWWYTFLHKHSEL-GLSAPSDEGEAFRYFFRTSRRTF 77
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
DYICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTV
Sbjct: 78 DYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGAAFGVGQSTV 137
Query: 135 SQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
SQVTWRFIE++EERA+HHL WP RME+IK++FE GL NCCGAIDATHIIMTLPAV+
Sbjct: 138 SQVTWRFIESMEERARHHLVWPGQERMEQIKARFEAESGLPNCCGAIDATHIIMTLPAVE 197
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+S+DWCD NYSM +QGIVD EMRFIDIVTGWPG M SRLLK SGFFK C+AG RL+G
Sbjct: 198 SSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCSGFFKHCDAGTRLDG 257
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
V +S+E E+REYIVG YPLL WL+TPYE LSA M +FN+ +A R+L +A +
Sbjct: 258 PVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNARQKAARTLGPRALSR 317
Query: 315 LKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCC 374
LKG WRIL+KVMWRPDK KLPSIILVCCLLHNIIID D+L PDV L DHHD+GY E+ C
Sbjct: 318 LKGSWRILNKVMWRPDKNKLPSIILVCCLLHNIIIDCEDELLPDVQLPDHHDTGYSEEKC 377
Query: 375 KQVDPMGRTTRENLEKHLQ 393
+QVDP G+ R+ + +LQ
Sbjct: 378 EQVDPNGKIMRDVITGYLQ 396
>gi|218199172|gb|EEC81599.1| hypothetical protein OsI_25082 [Oryza sativa Indica Group]
Length = 397
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/379 (67%), Positives = 299/379 (78%), Gaps = 4/379 (1%)
Query: 18 KLKKSKSTSAAPKDP---NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
KLK K AP P D++WW +F K+S G + PSDE E F+YFFR S++TF
Sbjct: 19 KLKIVKYGGGAPPLPPELRGLDTEWWYTFLHKHSEL-GLSAPSDEGEAFRYFFRTSRRTF 77
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
DYICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTV
Sbjct: 78 DYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGAAFGVGQSTV 137
Query: 135 SQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
SQVTWRFIE++EERA+HHL WP RME+IK++FE GL NCCGAIDATHIIMTLPAV+
Sbjct: 138 SQVTWRFIESMEERARHHLVWPGQERMEQIKARFEAESGLPNCCGAIDATHIIMTLPAVE 197
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+S+DWCD NYSM +QGIVD EMRFIDIVTGWPG M SRLLK SGFFK C+AG RL+G
Sbjct: 198 SSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMMFSRLLKCSGFFKHCDAGTRLDG 257
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
V +S+E E+REYIVG YPLL WL+TPYE LSA M +FN+ +A R+L +A +
Sbjct: 258 PVMVSAENGEIREYIVGNNCYPLLPWLMTPYEGESLSAPMASFNARQKAARTLGPRALSR 317
Query: 315 LKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCC 374
LKG WRIL+KVMWRPDK KLPSIILVCCLLHNIIID D+L PDV L DHHD+GY E+ C
Sbjct: 318 LKGSWRILNKVMWRPDKNKLPSIILVCCLLHNIIIDCEDELLPDVQLPDHHDTGYSEEKC 377
Query: 375 KQVDPMGRTTRENLEKHLQ 393
+QVDP G+ R+ + +LQ
Sbjct: 378 EQVDPNGKIMRDVITGYLQ 396
>gi|326489887|dbj|BAJ94017.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 253/399 (63%), Positives = 303/399 (75%), Gaps = 8/399 (2%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNS-------STPGA 53
M ++K+K KV K+ K S P D D++WW F +K++ S A
Sbjct: 43 MNDSSEAKTKDASKV-KRAKSGGSLVVLPADLRGPDTEWWYIFLSKDAELHKDAESGGRA 101
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
+PSD+EE F+YFFR S++TFDYICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RR
Sbjct: 102 PVPSDDEEAFRYFFRTSRRTFDYICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRR 161
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
LASG+SQVSVG AFGVGQSTVSQVTWRFIE++EERA+HHL WP+ RMEEIK++ E G
Sbjct: 162 LASGDSQVSVGSAFGVGQSTVSQVTWRFIESMEERARHHLVWPNQQRMEEIKARLEAVSG 221
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L NCCGAIDATHIIMTLPAV++S+DWCD NYSM +QGIVD EMRFIDIVTGWPG M
Sbjct: 222 LPNCCGAIDATHIIMTLPAVESSEDWCDPAKNYSMFLQGIVDDEMRFIDIVTGWPGSMTF 281
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
SRLLK SGF+KLCEAG+RLNG VR S E+ E+RE+IVG + YPLL WL+TPY+ LS
Sbjct: 282 SRLLKCSGFYKLCEAGKRLNGPVRTSGEDSEIREFIVGDMCYPLLPWLMTPYQGESLSTP 341
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
M FN+ + R L +A +LKG W+IL KVMWRPDK KLPSIILVCCLLHNII+D D
Sbjct: 342 MVNFNTQQKTARMLGTRALARLKGSWKILHKVMWRPDKNKLPSIILVCCLLHNIILDCND 401
Query: 354 QLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+L PD+ L DHHD+GY + C+Q +P G+ RE + HL
Sbjct: 402 ELLPDIQLPDHHDNGYNGENCQQENPNGKGIREIITGHL 440
>gi|242047560|ref|XP_002461526.1| hypothetical protein SORBIDRAFT_02g004140 [Sorghum bicolor]
gi|241924903|gb|EER98047.1| hypothetical protein SORBIDRAFT_02g004140 [Sorghum bicolor]
Length = 436
Score = 529 bits (1363), Expect = e-148, Method: Compositional matrix adjust.
Identities = 238/368 (64%), Positives = 292/368 (79%)
Query: 27 AAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLV 86
A P + + D++WW +F K + +P+DE E F++FFR S+ TFDYICS+VREDL+
Sbjct: 60 ALPPELRALDTEWWYTFLNKQQTELEHVVPTDEGEAFRHFFRTSRSTFDYICSIVREDLI 119
Query: 87 SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE 146
SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG+AFGVGQSTVSQVTWRFIE++E
Sbjct: 120 SRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGIAFGVGQSTVSQVTWRFIESME 179
Query: 147 ERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY 206
+RA+HHL WPD +R+EEIK+ EE +GL NCCG +D TH+ MTLPAV++S+DWCD NY
Sbjct: 180 DRARHHLSWPDQDRLEEIKAVLEEVYGLPNCCGGVDGTHVTMTLPAVESSEDWCDHAKNY 239
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
SM +QG+VD +MRFIDIVTGWPG M SRL+K+SGFFKLCEAG+RLNG+V++S+E E+R
Sbjct: 240 SMFLQGVVDDQMRFIDIVTGWPGSMTFSRLMKYSGFFKLCEAGERLNGSVKVSAEGAEIR 299
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
EY+ G YPLL WL+TPYE LSA + FN +A R L A +LKG WRIL KVM
Sbjct: 300 EYVAGDSCYPLLPWLMTPYEGKNLSAPLQNFNVRQKAARLLGTNALARLKGSWRILHKVM 359
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRE 386
WRPDK KLPSIILVCCLLHNI+ID D+L P V L +HHD+GY + C+QVDP G+ RE
Sbjct: 360 WRPDKNKLPSIILVCCLLHNILIDCHDELLPSVELPEHHDTGYTRENCEQVDPNGKVMRE 419
Query: 387 NLEKHLQH 394
++ HLQH
Sbjct: 420 SITAHLQH 427
>gi|357111421|ref|XP_003557512.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 406
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 248/393 (63%), Positives = 303/393 (77%), Gaps = 6/393 (1%)
Query: 5 KKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNS-----STPGATIPSDE 59
+ SK K + KV K+ K + +A D D++WW +F +K+ S A +PSDE
Sbjct: 2 ESSKPKDSGKV-KRAKTGGAVAALESDLRGPDTEWWYAFLSKHKERQAKSGVTAPVPSDE 60
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E F+YFFR S+ TFDYICS+VR+DL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+S
Sbjct: 61 DEAFRYFFRTSRSTFDYICSIVRDDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDS 120
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
QVSVG AFGVGQSTVSQVTWRFIE++EERA++HL WPD RM++IK+ FE GL NCCG
Sbjct: 121 QVSVGAAFGVGQSTVSQVTWRFIESMEERARYHLAWPDQERMDDIKANFEVVSGLPNCCG 180
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
AIDATHI+MTLPAV +S+DWCD NYSM +QG+ DHEMRFIDIVTGWPG M RLLKF
Sbjct: 181 AIDATHIVMTLPAVDSSEDWCDHAKNYSMFLQGVFDHEMRFIDIVTGWPGSMTFPRLLKF 240
Query: 240 SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNS 299
SGF+KLCEAG+RL+G ++S ++ ++RE+IVG V YPLL WL+TPYE LSA M FN+
Sbjct: 241 SGFYKLCEAGKRLDGPAQVSRDDAQIREFIVGDVCYPLLPWLMTPYEGENLSAQMVEFNA 300
Query: 300 LHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
+A R L +A +LKG WRIL KVMWRPDK KLPSIILVCCLLHNI+ID DQL P +
Sbjct: 301 RQKAARMLGTRALARLKGSWRILHKVMWRPDKNKLPSIILVCCLLHNILIDRQDQLLPSL 360
Query: 360 ALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
L DHHD+GY E C++ +P G+ RE + +HL
Sbjct: 361 ELPDHHDTGYTEVKCQEENPNGKVMREVITEHL 393
>gi|223943391|gb|ACN25779.1| unknown [Zea mays]
Length = 405
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 240/378 (63%), Positives = 294/378 (77%), Gaps = 1/378 (0%)
Query: 17 KKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDY 76
K ++ + A P + + D++WW +F K A +P DE E F++FFR S+KTFDY
Sbjct: 20 KIVRARGAPPALPPELRALDTEWWYAFLNKQVELEHA-VPPDEGEAFRHFFRTSRKTFDY 78
Query: 77 ICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQ 136
ICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTVSQ
Sbjct: 79 ICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGTAFGVGQSTVSQ 138
Query: 137 VTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTS 196
VTWRFIE++E+RA++HL WP R+E+IK+ E+++GL NCCG +DATH+IMTLPAV++S
Sbjct: 139 VTWRFIESMEDRARYHLAWPSQERLEQIKAVLEDAYGLPNCCGGVDATHVIMTLPAVESS 198
Query: 197 DDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV 256
DWCD NYSM +QGIVD +MRFIDIVTGWPG M SRL+K SGFFKLCEAG+RLNG+V
Sbjct: 199 ADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCSGFFKLCEAGERLNGSV 258
Query: 257 RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
++S+E E+REYI G YPLL WL+TP+E L A + FN+ +A R L A +LK
Sbjct: 259 KVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNARQKAARLLGTNALARLK 318
Query: 317 GGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQ 376
G WRIL KVMWRPDK KLPSIILVCCLLHNI+ID D+L P V L +HHD+GY + C+Q
Sbjct: 319 GSWRILHKVMWRPDKNKLPSIILVCCLLHNILIDCHDELLPSVQLPEHHDTGYNRENCEQ 378
Query: 377 VDPMGRTTRENLEKHLQH 394
VDP G+ REN+ HLQH
Sbjct: 379 VDPNGKVMRENITAHLQH 396
>gi|414883714|tpg|DAA59728.1| TPA: hypothetical protein ZEAMMB73_550258 [Zea mays]
Length = 446
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 240/378 (63%), Positives = 294/378 (77%), Gaps = 1/378 (0%)
Query: 17 KKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDY 76
K ++ + A P + + D++WW +F K A +P DE E F++FFR S+KTFDY
Sbjct: 61 KIVRARGAPPALPPELRALDTEWWYAFLNKQVELEHA-VPPDEGEAFRHFFRTSRKTFDY 119
Query: 77 ICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQ 136
ICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTVSQ
Sbjct: 120 ICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGTAFGVGQSTVSQ 179
Query: 137 VTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTS 196
VTWRFIE++E+RA++HL WP R+E+IK+ E+++GL NCCG +DATH+IMTLPAV++S
Sbjct: 180 VTWRFIESMEDRARYHLAWPSQERLEQIKAVLEDAYGLPNCCGGVDATHVIMTLPAVESS 239
Query: 197 DDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV 256
DWCD NYSM +QGIVD +MRFIDIVTGWPG M SRL+K SGFFKLCEAG+RLNG+V
Sbjct: 240 ADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCSGFFKLCEAGERLNGSV 299
Query: 257 RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
++S+E E+REYI G YPLL WL+TP+E L A + FN+ +A R L A +LK
Sbjct: 300 KVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNARQKAARLLGTNALARLK 359
Query: 317 GGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQ 376
G WRIL KVMWRPDK KLPSIILVCCLLHNI+ID D+L P V L +HHD+GY + C+Q
Sbjct: 360 GSWRILHKVMWRPDKNKLPSIILVCCLLHNILIDCHDELLPSVQLPEHHDTGYNRENCEQ 419
Query: 377 VDPMGRTTRENLEKHLQH 394
VDP G+ REN+ HLQH
Sbjct: 420 VDPNGKVMRENITAHLQH 437
>gi|226501860|ref|NP_001141503.1| uncharacterized protein LOC100273615 [Zea mays]
gi|194704850|gb|ACF86509.1| unknown [Zea mays]
Length = 405
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 239/378 (63%), Positives = 294/378 (77%), Gaps = 1/378 (0%)
Query: 17 KKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDY 76
K ++ + A P + + D++WW +F K A +P DE E F++FFR S++TFDY
Sbjct: 20 KIVRARGAPPALPPELRALDTEWWYAFLNKQVELEHA-VPPDEGEAFRHFFRTSRRTFDY 78
Query: 77 ICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQ 136
ICS+VREDL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQVSVG AFGVGQSTVSQ
Sbjct: 79 ICSIVREDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQVSVGTAFGVGQSTVSQ 138
Query: 137 VTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTS 196
VTWRFIE++E+RA++HL WP R+E+IK+ E+++GL NCCG +DATH+IMTLPAV++S
Sbjct: 139 VTWRFIESMEDRARYHLAWPSQERLEQIKAVLEDAYGLPNCCGGVDATHVIMTLPAVESS 198
Query: 197 DDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV 256
DWCD NYSM +QGIVD +MRFIDIVTGWPG M SRL+K SGFFKLCEAG+RLNG+V
Sbjct: 199 ADWCDHAKNYSMFLQGIVDDQMRFIDIVTGWPGSMTFSRLMKCSGFFKLCEAGERLNGSV 258
Query: 257 RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
++S+E E+REYI G YPLL WL+TP+E L A + FN+ +A R L A +LK
Sbjct: 259 KVSAEGAEIREYIAGDSCYPLLPWLMTPFEGKHLPAPLQNFNARQKAARLLGTNALARLK 318
Query: 317 GGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQ 376
G WRIL KVMWRPDK KLPSIILVCCLLHNI+ID D+L P V L +HHD+GY + C+Q
Sbjct: 319 GSWRILHKVMWRPDKNKLPSIILVCCLLHNILIDCHDELLPSVQLPEHHDTGYNRENCEQ 378
Query: 377 VDPMGRTTRENLEKHLQH 394
VDP G+ REN+ HLQH
Sbjct: 379 VDPNGKVMRENITAHLQH 396
>gi|357119271|ref|XP_003561367.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 471
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 243/391 (62%), Positives = 301/391 (76%), Gaps = 6/391 (1%)
Query: 7 SKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNS-----STPGATIPSDEEE 61
SK + + KV K+ K + +A D D++WW +F +K+ S A++PSDEEE
Sbjct: 69 SKPEDSGKV-KRAKPGGAVAALEPDLRGPDTEWWYAFLSKHKERQAKSGLTASVPSDEEE 127
Query: 62 GFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F+YFFR S+ TFDY+CS+VR+DL+SRPPSGLINIEGRLLSVEKQVAIA+RRLASG+SQV
Sbjct: 128 AFRYFFRTSRSTFDYVCSIVRDDLISRPPSGLINIEGRLLSVEKQVAIAMRRLASGDSQV 187
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
SVG AFGVGQSTVSQVTWRFIE++EERA+HHL WPD RM++IK+ FE GL NCCGAI
Sbjct: 188 SVGAAFGVGQSTVSQVTWRFIESMEERARHHLVWPDQERMDDIKANFEVVSGLPNCCGAI 247
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
DATHI+M LPAV++S+ W D NNYSM +QGIVDHEMRFID+VTGWPG S+LLK S
Sbjct: 248 DATHIVMMLPAVESSEYWYDHANNYSMFLQGIVDHEMRFIDVVTGWPGSTTFSQLLKLSE 307
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLH 301
F+KLCEAG+RL+G ++S E++E+RE+IVG V YPLL WL+TPY+ LSA M FN+
Sbjct: 308 FYKLCEAGKRLDGPAQVSREDLEIREFIVGDVSYPLLPWLMTPYQGESLSALMVDFNARQ 367
Query: 302 EATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
+A R L +A +LKG WRIL KVMWRPDK KLPSIILVCCLLHNI+ID DQL P + L
Sbjct: 368 KAARMLGTRALARLKGSWRILQKVMWRPDKNKLPSIILVCCLLHNIMIDRQDQLLPSLEL 427
Query: 362 SDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+HHD+GY E C++ + G+ RE + +HL
Sbjct: 428 PEHHDTGYTEVNCQKKNRNGKVMREVITEHL 458
>gi|357119278|ref|XP_003561370.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 385
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 230/375 (61%), Positives = 287/375 (76%), Gaps = 7/375 (1%)
Query: 18 KLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYI 77
K+K++K P D +W +F +K++ A +PSDEEEGF+YFFR S+ TFDYI
Sbjct: 15 KVKRAKPDLRGP------DMEWRYAFLSKHAERQAA-VPSDEEEGFRYFFRTSRSTFDYI 67
Query: 78 CSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQV 137
CS+VR+DL R PSGLI I+GRLLSVEKQVAIA+RRLASG+SQVSVG AFGVG STVSQV
Sbjct: 68 CSIVRDDLTWRLPSGLIRIDGRLLSVEKQVAIAMRRLASGDSQVSVGAAFGVGPSTVSQV 127
Query: 138 TWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSD 197
TWRFIE++E RA+HHL WPD M++IK+ FE GL NCCGAIDATHI+MTLPAV++S+
Sbjct: 128 TWRFIESMEGRAQHHLAWPDQETMDDIKANFEVVSGLPNCCGAIDATHILMTLPAVESSE 187
Query: 198 DWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR 257
+W D NYSM++QGI DHEMRFIDI TGWPGGM VS+LL+FSGF+K CEAG+RL+G +
Sbjct: 188 NWRDHAGNYSMILQGIFDHEMRFIDICTGWPGGMTVSQLLEFSGFYKFCEAGERLDGPSQ 247
Query: 258 ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKG 317
+S E ++RE+IVG + YPLL WL+TPYE L+ M FN+ +A R L +A +LKG
Sbjct: 248 VSREGAQIREFIVGDMCYPLLPWLMTPYEGESLAVPMVDFNARQKAARMLGTRALARLKG 307
Query: 318 GWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
WRIL+KVMWRPDK KLPSIILVCCLLHNI+ID DQL P + L HHD+GY E ++
Sbjct: 308 SWRILNKVMWRPDKHKLPSIILVCCLLHNIMIDRQDQLLPSLELPAHHDTGYNEVIFQKE 367
Query: 378 DPMGRTTRENLEKHL 392
+P+G+ RE ++L
Sbjct: 368 NPIGKVMREVTTEYL 382
>gi|357119276|ref|XP_003561369.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 433
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 226/358 (63%), Positives = 275/358 (76%), Gaps = 6/358 (1%)
Query: 28 APKDPNSSDSDWWPSFWAKNS-----STPGATIPSDEEEGFKYFFRVSKKTFDYICSLVR 82
A D D +WW +F +K++ S A +PSDEEE F+YFFR S+ TF YICS+VR
Sbjct: 74 AKPDLRGPDMEWWYAFLSKHAERQADSGVTAPVPSDEEEAFRYFFRTSRSTFHYICSIVR 133
Query: 83 EDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFI 142
+DL SRPPSGLI IEGRLLS+EKQVAIA+RRLASG+S+VSVG AFGVGQ TVSQVTWR I
Sbjct: 134 DDLTSRPPSGLIRIEGRLLSMEKQVAIAMRRLASGDSRVSVGAAFGVGQPTVSQVTWRLI 193
Query: 143 EALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQ 202
E++EERA+HHL WPD RM++IK+ FE GL NCCGAIDATHI++TLPAV++S +W D
Sbjct: 194 ESMEERARHHLAWPDQERMDDIKANFEVVSGLPNCCGAIDATHIVITLPAVESSRNWRDH 253
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
NYSM++QGI DHEMRFIDI TGWPGGM VS+LL+FS F++LCEAG+ L+G ++S E
Sbjct: 254 AWNYSMILQGIFDHEMRFIDICTGWPGGMTVSQLLEFSRFYELCEAGECLDGPAQVSREG 313
Query: 263 VELREYIVGGVGYPLLSWLITPYE-TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI 321
++REYIVG + YPLLSWL+TPYE L+ M FN+ +A R L +A +LKG WRI
Sbjct: 314 AQIREYIVGDMCYPLLSWLMTPYEGERSLADPMVDFNARQKAARMLGTRALARLKGSWRI 373
Query: 322 LSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDP 379
L+KVMWRPDK KLPSIILVCCLLHNI+ID DQL + L DHHD+GY E +Q P
Sbjct: 374 LNKVMWRPDKHKLPSIILVCCLLHNIMIDRQDQLLQSLELRDHHDTGYNEVNWQQEKP 431
>gi|294461638|gb|ADE76379.1| unknown [Picea sitchensis]
Length = 404
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 240/398 (60%), Positives = 289/398 (72%), Gaps = 9/398 (2%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSS--DSDWWPSFWAKNSSTP----GAT 54
MAPP++ + K+ K KKS +D +SDWW SFW +NSS+ G
Sbjct: 1 MAPPRRVLKGRNKE---KQKKSDGHMEKSRDGEEKVMESDWWHSFWQRNSSSFSSNSGLA 57
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
+E + FK F VS+KTFDYIC LVR+DL S+PPSGLIN EGRLL+VEKQV IALRRL
Sbjct: 58 YTGNETQRFKSTFGVSRKTFDYICFLVRQDLESKPPSGLINSEGRLLTVEKQVGIALRRL 117
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
ASGESQVSVG +FGVGQSTVSQVTWRFIE++EERA+HHLKWPD+N+MEEIK KFE GL
Sbjct: 118 ASGESQVSVGESFGVGQSTVSQVTWRFIESMEERARHHLKWPDANQMEEIKVKFETIQGL 177
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NCCGAIDATHIIMTLP++ +S +W D E+NYSM++QGIVD +MRF DIVTG PG MN S
Sbjct: 178 PNCCGAIDATHIIMTLPSIDSSMEWHDCESNYSMVLQGIVDADMRFRDIVTGLPGSMNDS 237
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
RLL+ SGFF+LCE G+RLNG V+ + ++REYIVG GYPLLSWLITPY+ L++S
Sbjct: 238 RLLRSSGFFRLCEHGERLNGPVKELPQGSQIREYIVGDTGYPLLSWLITPYQGKDLASSR 297
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
FNS H AT +A AF +LKG WR L+ V+WRPDK KLP IILVCCLLHNIIID +
Sbjct: 298 VEFNSKHHATSQIAECAFSRLKGTWRFLNNVIWRPDKHKLPIIILVCCLLHNIIIDRDEN 357
Query: 355 LHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
L + L HD+GY Q + D G+ R+ L +L
Sbjct: 358 LSEHIPLPGPHDAGYKRQLNQFADLQGQALRDTLSDYL 395
>gi|357120067|ref|XP_003561752.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 500
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 222/402 (55%), Positives = 278/402 (69%), Gaps = 10/402 (2%)
Query: 1 MAPPKKSKSK------KTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNS-STPGA 53
M P KK + K + K + KLK +KS + D +WW +F + + S GA
Sbjct: 93 MPPVKKGERKIMNSGTRKTKDSGKLKSAKSDGTMAAELRGPDWEWWCTFLDRQAKSGEGA 152
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
+EE F++FFR+SK TF YICS+VR+DL+S PP GLI IEGRLL VEKQVAIA+ R
Sbjct: 153 PHSDQQEEAFRHFFRMSKSTFAYICSIVRKDLISLPPPGLITIEGRLLRVEKQVAIAIWR 212
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
LASG+SQ+SVG AFGVGQSTVSQVTWRFI+++E A+HHL WP+ RMEEIKS E G
Sbjct: 213 LASGDSQMSVGAAFGVGQSTVSQVTWRFIQSMERNARHHLVWPNRARMEEIKSNLEALSG 272
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L NCCGAIDATHI+MTLP ++S+DW D YSML+QGI D EMRFI+IVTG+PG +
Sbjct: 273 LPNCCGAIDATHIVMTLPNAKSSEDWRDPAQKYSMLLQGIFDDEMRFINIVTGYPGSLPF 332
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISS---EEVELREYIVGGVGYPLLSWLITPYETNGL 290
++L+FS F++LC+AG+ L+G VR SS + E+RE+IVG YPLL WL+TPYE
Sbjct: 333 EKMLEFSQFYELCQAGKCLDGPVRGSSCKEDAAEIREFIVGDKCYPLLPWLMTPYEEEDP 392
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
SA M FN +A R L +A +LKG WRIL KV+WRPDK KLPS+ILVCCLLHNI+ID
Sbjct: 393 SAPMVNFNDHQKAARMLGTRALAKLKGCWRILDKVIWRPDKNKLPSLILVCCLLHNILID 452
Query: 351 SGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
D D + D H++GY C+ VDP G RE L ++L
Sbjct: 453 CKDDFLSDDEIPDRHETGYNRVNCEGVDPDGEEMREILTRYL 494
>gi|225435066|ref|XP_002281425.1| PREDICTED: putative nuclease HARBI1 [Vitis vinifera]
gi|297746131|emb|CBI16187.3| unnamed protein product [Vitis vinifera]
Length = 384
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/385 (49%), Positives = 254/385 (65%), Gaps = 16/385 (4%)
Query: 10 KKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRV 69
KK +K+ K+ + S+ S DWW F + G S + F+ F++
Sbjct: 8 KKRRKIEKREESSEEGSV----------DWWDEF---SKRIAGLLSSSKGLDKFESVFKI 54
Query: 70 SKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGV 129
S+KTF+YIC+LV+ED++++P I GR + + QVA+ALRRL+SG+S +++G AFG+
Sbjct: 55 SRKTFNYICALVKEDMMAKP-GNFIFTNGRPMCLNDQVAVALRRLSSGDSLLTIGDAFGL 113
Query: 130 GQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIM 188
STVSQVTWRF+E +EERA HHL+WP + + EI SKFE+ GL NCCGAID THI+M
Sbjct: 114 NHSTVSQVTWRFVEIMEERALHHLQWPSTEPEITEITSKFEKIRGLPNCCGAIDTTHIMM 173
Query: 189 TLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCE 247
LP+ +++ W D EN++SM++Q IVD EMRF DIVTGWPG M S +L+ S FFKLCE
Sbjct: 174 CLPSADSANSVWLDSENHHSMILQAIVDPEMRFRDIVTGWPGKMKDSSVLQSSNFFKLCE 233
Query: 248 AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSL 307
GQRLNG +E E+ EYIVG GYPLL WL+TPY+ LS S FN H ATR +
Sbjct: 234 KGQRLNGKKIELAEGSEISEYIVGDSGYPLLPWLVTPYQGKELSESKAEFNRRHFATRMV 293
Query: 308 AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
A +A +LK W+++ VMWRPDK +LP IILVCCLLHNI+ID D++ ++ LS HHD
Sbjct: 294 AQRALARLKEMWKVIQGVMWRPDKNRLPRIILVCCLLHNIVIDLEDEVQDEMPLSHHHDL 353
Query: 368 GYGEQCCKQVDPMGRTTRENLEKHL 392
GY +Q C+ D R+ L +L
Sbjct: 354 GYRQQICESADNNASIVRDKLSLYL 378
>gi|357155711|ref|XP_003577212.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 399
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 180/366 (49%), Positives = 236/366 (64%), Gaps = 12/366 (3%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + + T + D+ F+ F++ +KTFDY+CSLV++D++ R S ++
Sbjct: 38 DWWDGFSVRMAGTLASM---DDAHRFESVFKMPRKTFDYVCSLVKDDMMVRSSSYTF-LD 93
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
GR++S+E +VA+ALRRL SG S V+VG + GV STVS +TWRF+EA+E RA HHL+WPD
Sbjct: 94 GRIMSLEDRVAVALRRLNSGGSLVTVGGSVGVNHSTVSLITWRFVEAVEARAGHHLRWPD 153
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDH 216
S+ M IKSKFEE GL NCCG +D THI+M L + + W D + NYSM++Q ++D
Sbjct: 154 SDEMAMIKSKFEEIHGLPNCCGVVDTTHIVMCLSSAEPNCKVWQDHQKNYSMVLQAVIDP 213
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+MRF DIVTGWPG M S +L SG FKLCE G RLNG+ S+ E+ EY++G GYP
Sbjct: 214 DMRFTDIVTGWPGSMKESGILHSSGLFKLCENGYRLNGDKLTVSDGSEIGEYLLGDAGYP 273
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L+ WL+TPY+ N LS S FN H A R++A+KA + K W+ L MWRPDK KLP
Sbjct: 274 LIPWLLTPYQENDLSDSKAEFNKRHLAARTVAMKALAKFKDTWKFLHGEMWRPDKHKLPR 333
Query: 337 IILVCCLLHNIIIDSGDQ--LHPDVA-----LSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
II VCCLLHNI+ID D D A LSD HD+ Y +Q C D R+ L
Sbjct: 334 IIHVCCLLHNIVIDMEDNAATEEDAAMEEARLSDDHDANYKQQVCLLADEKAVGVRDKLS 393
Query: 390 KHLQHN 395
+HL N
Sbjct: 394 EHLNRN 399
>gi|326528045|dbj|BAJ89074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 360 bits (925), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 174/358 (48%), Positives = 233/358 (65%), Gaps = 6/358 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + + G P+ + F++ F++ ++TF+Y+CSLV++D++ R S ++
Sbjct: 34 DWWDGFCMRMA---GTLSPAADAHRFEFLFKMPRRTFNYVCSLVKDDMMVRASSYTF-LD 89
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G +LS+E +VA+ALRRL SG S V+VG + GV STVS +TWRF+EA+E RA HHL+WPD
Sbjct: 90 GTVLSLEDRVAVALRRLNSGGSLVTVGTSVGVNHSTVSLITWRFVEAVEARAGHHLRWPD 149
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDH 216
S+ M IKSKFE+ GL NCCG +D THIIM L + + W D E NYSM++Q ++D
Sbjct: 150 SDEMAMIKSKFEKIHGLPNCCGVVDTTHIIMCLSSAEPNCKVWLDHEKNYSMVLQAVIDP 209
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+MRF DIVTGWPG M S +L SG F+LCE G RLNG + S+ E+ EY++G GYP
Sbjct: 210 DMRFTDIVTGWPGSMKESSILHSSGLFRLCENGVRLNGGKLMVSDGSEVGEYVIGDAGYP 269
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
LL WL+TPY+ N LS FN H A R++A+KA + K W+ L MWRPDK KLP
Sbjct: 270 LLPWLLTPYQENDLSDLKVEFNKRHSAARTVALKALARFKDTWKFLQGEMWRPDKHKLPR 329
Query: 337 IILVCCLLHNIIID-SGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQ 393
II VCCLLHNI+ID D D +S HD+ Y +Q C+ D R+ L +HL
Sbjct: 330 IIHVCCLLHNIVIDMEEDAAMDDAQISHDHDANYRQQVCQLSDEKAVRMRDRLSEHLN 387
>gi|356536488|ref|XP_003536769.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 382
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 180/375 (48%), Positives = 247/375 (65%), Gaps = 9/375 (2%)
Query: 20 KKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICS 79
K+ K+ + +D +++ DWW F + S+ S E F+ F++S+KTF+YICS
Sbjct: 9 KRKKADNKDGQDASATSFDWWHHFSLRISAR------SKNIEKFESVFKISRKTFNYICS 62
Query: 80 LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTW 139
LV ED+V+R S +++ G LS+ QVA+ALRRL+SGES ++G +F + QSTVSQVTW
Sbjct: 63 LVEEDMVARA-SNFVDLNGNRLSLNDQVAVALRRLSSGESLSTIGDSFRMNQSTVSQVTW 121
Query: 140 RFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD 198
+F+E +EER HHL WP + MEEIKSKFE GL NCCGAID+THI+MTLP+V +
Sbjct: 122 KFVETMEERGLHHLSWPSTEMEMEEIKSKFENIRGLSNCCGAIDSTHIMMTLPSVDALNS 181
Query: 199 -WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR 257
W D+E N SM++Q +VD ++RF DIVTGWPG M+ ++L+ S FKL E G+RLNG +
Sbjct: 182 VWIDREKNCSMVLQAVVDPDLRFHDIVTGWPGSMSDEQVLRSSSLFKLAEEGKRLNGGKK 241
Query: 258 ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKG 317
+ LREYI+G G+PLL WL+TP+E GLS FN T+ +A KA +LK
Sbjct: 242 TLPDGSVLREYIIGDTGFPLLPWLLTPFEGKGLSDIQVEFNKRIVETQMVAKKALARLKD 301
Query: 318 GWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
W+I+ VMW+PDK KLP I+LVCC+LHNI++D D++H D+ HDS Y +Q +
Sbjct: 302 MWKIIQGVMWKPDKHKLPRIVLVCCILHNIVMDMEDEVHNDMLSCHQHDSRYQDQTSEFA 361
Query: 378 DPMGRTTRENLEKHL 392
D RE L +L
Sbjct: 362 DNTANIMREKLSLYL 376
>gi|255638892|gb|ACU19748.1| unknown [Glycine max]
Length = 382
Score = 360 bits (924), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 179/375 (47%), Positives = 247/375 (65%), Gaps = 9/375 (2%)
Query: 20 KKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICS 79
K+ K+ + +D +++ DWW F + S+ S E F+ F++S+KTF+YICS
Sbjct: 9 KRKKADNKDGQDASATSFDWWHHFSLRISAR------SKNIEKFESVFKISRKTFNYICS 62
Query: 80 LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTW 139
LV ED+V+R S +++ G LS+ QVA+ALRRL+SGES ++G +F + QSTVSQVTW
Sbjct: 63 LVEEDMVARA-SNFVDLNGNRLSLNDQVAVALRRLSSGESLSTIGDSFRMNQSTVSQVTW 121
Query: 140 RFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD 198
+F+E +EER HHL WP + MEEIKSKFE GL NCCGAID+THI+MTLP+V +
Sbjct: 122 KFVETMEERGLHHLSWPSTEMEMEEIKSKFENIRGLSNCCGAIDSTHIMMTLPSVDALNS 181
Query: 199 -WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR 257
W D+E N SM++Q +VD ++RF DIVTGWPG M+ ++L+ S FKL E G+RLNG +
Sbjct: 182 VWIDREKNCSMVLQAVVDPDLRFHDIVTGWPGSMSDEQVLRSSSLFKLAEEGKRLNGGKK 241
Query: 258 ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKG 317
+ LREYI+G G+PLL WL+TP+E GLS FN T+ +A KA +LK
Sbjct: 242 TLPDGSVLREYIIGDTGFPLLPWLLTPFEGKGLSDIQVEFNKRIVETQMVAKKALARLKD 301
Query: 318 GWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
W+I+ VMW+PDK KLP I+LVCC+LHN+++D D++H D+ HDS Y +Q +
Sbjct: 302 MWKIIQGVMWKPDKHKLPRIVLVCCILHNVVMDMEDEVHNDMLSCHQHDSRYQDQTSEFA 361
Query: 378 DPMGRTTRENLEKHL 392
D RE L +L
Sbjct: 362 DNTANIMREKLSLYL 376
>gi|224098298|ref|XP_002311148.1| predicted protein [Populus trichocarpa]
gi|222850968|gb|EEE88515.1| predicted protein [Populus trichocarpa]
Length = 407
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 180/353 (50%), Positives = 240/353 (67%), Gaps = 6/353 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW ++K + P + S+ + F+ FF++S+KTF+YICSLV+EDL +R S
Sbjct: 44 DWWDDDFSKRITGPSSE--SNNSKKFESFFKISRKTFNYICSLVKEDLRARQ-SNFTGSN 100
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G+ LSV QVA+ALRRL+SGES ++G G+ QSTVSQ+TWRF+EA+EER HHL WP
Sbjct: 101 GKPLSVTDQVAVALRRLSSGESLSNIGDLLGINQSTVSQITWRFVEAMEERGLHHLCWPS 160
Query: 158 SN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVD 215
+ MEEIKS FE+ GL NCCG+ID THI+MTLP V S+D W D+E N+SML+Q IVD
Sbjct: 161 TEAEMEEIKSNFEKICGLPNCCGSIDTTHIVMTLPTVDRSNDVWIDREKNHSMLLQAIVD 220
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
+MRF D++ G+PG ++ + +L+ S F+KLCE +RLNG E +EL EYI+G G+
Sbjct: 221 PDMRFRDVIVGYPGSLSDALVLQNSSFYKLCEEEKRLNGKKIELQEGMELGEYIIGDSGF 280
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLLSWL+TPY+ N LS FN H T+ +A A +LK WRI+ VMW PDK +LP
Sbjct: 281 PLLSWLLTPYQ-NALSDHQAEFNKRHSETQVVAQIALARLKEMWRIIHGVMWLPDKNRLP 339
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENL 388
II VCCLLHNI+ID D+ ++ S HD+ Y +Q C+ V+ G R+NL
Sbjct: 340 RIIFVCCLLHNIVIDMEDKGVEELPSSHRHDTDYRQQICETVNKTGIAVRDNL 392
>gi|449459932|ref|XP_004147700.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449513511|ref|XP_004164345.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 392
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 174/357 (48%), Positives = 242/357 (67%), Gaps = 7/357 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F S + + F+ F++S+KTF YICSLV+E ++++ S ++
Sbjct: 35 DWWDEF----SQRITGPLSQSKNTKFESVFKISRKTFSYICSLVKEVMMAKT-SSFTDLN 89
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G+ LS+ QVA+ALRRL SGES ++G +FG+ QS+VSQ+TWRF+EA+EE+ HHL WP
Sbjct: 90 GKPLSLNDQVAVALRRLCSGESLSNIGDSFGLNQSSVSQITWRFVEAMEEKGLHHLSWPS 149
Query: 158 SNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVD 215
+ M++IKSKF++ GL NCCG ++ THI+MTLP ++++ W D+E N SM++Q IVD
Sbjct: 150 TEEDMDKIKSKFKKIRGLPNCCGVVETTHIMMTLPTSESANGIWLDREKNCSMILQVIVD 209
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
EMRF DI+TGWPG ++ + +L+ SGFFKL + G+RLNG SE EL EYI+G G+
Sbjct: 210 PEMRFCDIITGWPGSLSDALVLQSSGFFKLSQDGERLNGKKMKLSESSELGEYIIGDSGF 269
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLL WL+TPY+ GL FN H ATR +A +A +LK W+I+ VMW+PDK +LP
Sbjct: 270 PLLPWLLTPYQGKGLPDYQAEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLP 329
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
IILVCCLLHNI+ID D++ ++ LS HHD Y +Q C+ VD +RE L +L
Sbjct: 330 RIILVCCLLHNIVIDMEDEVQDEMPLSHHHDPSYRQQSCEFVDNTASISREKLSMYL 386
>gi|15233248|ref|NP_191095.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
gi|7076778|emb|CAB75893.1| putative protein [Arabidopsis thaliana]
gi|21593300|gb|AAM65249.1| unknown [Arabidopsis thaliana]
gi|26449824|dbj|BAC42035.1| unknown protein [Arabidopsis thaliana]
gi|32362295|gb|AAP80175.1| At3g55350 [Arabidopsis thaliana]
gi|332645850|gb|AEE79371.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
Length = 406
Score = 357 bits (917), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 179/356 (50%), Positives = 230/356 (64%), Gaps = 10/356 (2%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + G+T P + F+ F++S+KTFDYICSLV+ D ++P + +
Sbjct: 53 DWWDGFSRRIYG--GSTDP----KTFESVFKISRKTFDYICSLVKADFTAKP-ANFSDSN 105
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G LS+ +VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL WP
Sbjct: 106 GNPLSLNDRVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWP- 164
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDH 216
++++EIKSKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM +Q +VD
Sbjct: 165 -SKLDEIKSKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMTLQAVVDP 223
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+MRF+D++ GWPG +N +LK SGF+KL E G+RLNG SE ELREYIVG G+P
Sbjct: 224 DMRFLDVIAGWPGSLNDDVVLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFP 283
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
LL WL+TPY+ S FN H A A +LK WRI++ VMW PD+ +LP
Sbjct: 284 LLPWLLTPYQGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPR 343
Query: 337 IILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
II VCCLLHNIIID DQ D LS HD Y ++ CK D R+ L L
Sbjct: 344 IIFVCCLLHNIIIDMEDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQL 399
>gi|225449613|ref|XP_002279396.1| PREDICTED: putative nuclease HARBI1 [Vitis vinifera]
Length = 398
Score = 357 bits (915), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 181/357 (50%), Positives = 237/357 (66%), Gaps = 6/357 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + G S F+ F++S+KTF+YICSLV+ED++++P +
Sbjct: 40 DWWDDF---SKRITGPLSQSRLSLTFESVFKISRKTFNYICSLVKEDMMAKP-ANFSASN 95
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G LS+ +VAIALRRL+SGES VG +FGV QSTV+Q+TW+F+E++E R HHL+WP
Sbjct: 96 GNPLSLNDRVAIALRRLSSGESFSIVGDSFGVNQSTVAQITWKFVESMEVRGLHHLQWPS 155
Query: 158 SNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVD 215
+ MEE KS FE+ GL NCCGA+D THI M+L + TS+ W D + +SM++Q IVD
Sbjct: 156 TEADMEETKSSFEKIRGLPNCCGAVDTTHISMSLSTMGTSNKVWIDHQKKHSMILQAIVD 215
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
EMRF D++TGWPG ++ S +L+ SGFFKL E G+RLNG + SE LREYIVG G+
Sbjct: 216 PEMRFRDVITGWPGSLSESCVLQNSGFFKLSEEGKRLNGKMIELSEGTYLREYIVGDAGF 275
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLL WL+TPY+ GLS FN H ATR +A +A +LK WRI+ VMWRPD KLP
Sbjct: 276 PLLPWLLTPYQGKGLSDFQSEFNKRHFATRMVARRALARLKEMWRIIQGVMWRPDTNKLP 335
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
IILVCCLLHNI+ID D + ++ S HD GY ++ C+ VD RENL +L
Sbjct: 336 RIILVCCLLHNIVIDLEDGVQDEMPFSYKHDPGYRQRSCESVDKTAFIQRENLSLYL 392
>gi|356575367|ref|XP_003555813.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 402
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 180/375 (48%), Positives = 243/375 (64%), Gaps = 6/375 (1%)
Query: 20 KKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICS 79
K+ K+ + D +++ DWW F + S G S E F+ F++S+KTF+YICS
Sbjct: 26 KRKKADNKDGPDASATSFDWWHHFSLRIS---GPLARSKNIEKFESVFKISRKTFNYICS 82
Query: 80 LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTW 139
LV ED+++R S +++ G LS+ QVA+ALRRL+SGES ++G +F + QSTVSQVTW
Sbjct: 83 LVEEDMLARA-SNFVDLNGNRLSLNDQVAVALRRLSSGESLSTIGESFRMNQSTVSQVTW 141
Query: 140 RFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD 198
+F+E +EER HHL W + MEEIKSKFE GL NCCGA+D+THI+MTLP+V +
Sbjct: 142 KFVETMEERGLHHLSWASTEMEMEEIKSKFENIRGLSNCCGAVDSTHIMMTLPSVDALNS 201
Query: 199 -WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR 257
W D+E N SM++Q IVD ++RF DIVTGWPG M+ ++L+ S FFKL E G+RLNG +
Sbjct: 202 VWLDREKNCSMVLQAIVDPDLRFRDIVTGWPGSMSDEQVLRSSSFFKLAEEGKRLNGGKK 261
Query: 258 ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKG 317
+ REYI+G G+PL SWL+TPYE G S FN T+ +A KA +LK
Sbjct: 262 TLPDGTLFREYIIGDTGFPLFSWLLTPYEGKGFSNVQVEFNKRVVETQMVAKKALARLKD 321
Query: 318 GWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
W+I+ VMW+PDK KLP IILVCC+LHNI+ID D++ D+ HDS Y +Q +
Sbjct: 322 MWKIIQGVMWKPDKHKLPRIILVCCILHNIVIDMEDEVLIDMPSCHQHDSRYQDQTSEFA 381
Query: 378 DPMGRTTRENLEKHL 392
D RE L +L
Sbjct: 382 DNTATIMREKLSLYL 396
>gi|255582388|ref|XP_002531983.1| RNA binding protein, putative [Ricinus communis]
gi|223528380|gb|EEF30419.1| RNA binding protein, putative [Ricinus communis]
Length = 591
Score = 353 bits (905), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 185/385 (48%), Positives = 241/385 (62%), Gaps = 7/385 (1%)
Query: 10 KKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRV 69
K+ KK KK+ ++ SA DWW F + G S + F+ F++
Sbjct: 8 KRRKKAEKKVDQNVLASALSSLKQEQTLDWWDDF---SKRITGPLSGSRNTKKFESVFKI 64
Query: 70 SKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGV 129
S+KTF+YICSLV DL +R S L G+ LS QVAIALRRL+SGES S+G +FGV
Sbjct: 65 SRKTFNYICSLVNNDLKARQ-SNLTGTNGKPLSPNDQVAIALRRLSSGESLSSIGDSFGV 123
Query: 130 GQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
QSTVS VTWRF+EA+EER HHL+WP + M IKSKFE+ L NCCG ID TH++M
Sbjct: 124 NQSTVSHVTWRFVEAMEERGLHHLRWPSTETEMAAIKSKFEKIHSLPNCCGVIDTTHVVM 183
Query: 189 TLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCE 247
TLPAV S+D W D+E N+SM++Q IVD +MRF D++ G+PG ++ + +L+ S FFKL E
Sbjct: 184 TLPAVDHSNDVWIDREKNHSMVLQAIVDPDMRFRDVIVGYPGSLSDALVLQNSSFFKLSE 243
Query: 248 AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSL 307
G+RLNG E EL EYI+G G+PL+ WL TPY+ + FN+ H AT L
Sbjct: 244 EGKRLNGKKMELMEGTELGEYIIGDAGFPLMPWLFTPYQHPHQEHQI-EFNNRHSATMLL 302
Query: 308 AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
A A +LK WRI+ VMW PDK +LP II VCCLLHNI+ID D++ ++ +SD HD
Sbjct: 303 AQIALTRLKEIWRIIHGVMWLPDKNRLPRIIFVCCLLHNIVIDMEDKVLDEMPVSDSHDK 362
Query: 368 GYGEQCCKQVDPMGRTTRENLEKHL 392
Y +Q C+ G RE L +L
Sbjct: 363 DYRQQICESASNSGTEMREKLSLYL 387
>gi|388490968|gb|AFK33550.1| unknown [Lotus japonicus]
Length = 386
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 183/382 (47%), Positives = 245/382 (64%), Gaps = 9/382 (2%)
Query: 17 KKLKKSKSTSAAPKDP-NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFD 75
KK KK+++ P S SDWW F + S G S + F+ ++S+KTF+
Sbjct: 8 KKRKKTENKDDHTATPLYSQPSDWWHHFSHRIS---GPLAQSKDIGKFESVLKISRKTFN 64
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
YICSLV +D+++R +++ G LS+ QVA+ALRRL+SGES ++G +F + QS VS
Sbjct: 65 YICSLVEKDMLAR---SCVDLNGNHLSLNDQVAVALRRLSSGESLSTIGDSFLMNQSAVS 121
Query: 136 QVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
QVTW F+EA+EER HHL WP + MEEIK KFE GL NCCGA+D+THI+MTLP+
Sbjct: 122 QVTWLFVEAMEERGLHHLSWPSTETAMEEIKFKFENIRGLSNCCGAVDSTHILMTLPSGD 181
Query: 195 TSDD-WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLN 253
T + W D++ N SM++Q IVD ++RF D+V GWPG ++ +L+ S FFKL E G+RLN
Sbjct: 182 TENSVWLDRKKNCSMILQAIVDPDLRFRDVVGGWPGSLSDEYVLRHSEFFKLAEEGKRLN 241
Query: 254 GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFL 313
G ++ E LREYI+G G+PLL WL+TPYE LS FN AT +A +A
Sbjct: 242 GAEKMLPEGTALREYIIGDTGFPLLPWLLTPYECKDLSDVEVEFNKRVVATHMVAKRALA 301
Query: 314 QLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQC 373
+LK W+I+ VMW+PDK KLP I+LVCC+LHNI+ID D + +V L HDSGY +Q
Sbjct: 302 RLKQMWKIIQGVMWKPDKHKLPRIVLVCCILHNIVIDMEDGVMDEVPLCPQHDSGYQDQT 361
Query: 374 CKQVDPMGRTTRENLEKHLQHN 395
C+ D T RE L HL N
Sbjct: 362 CEFADNTAYTMREKLSLHLSGN 383
>gi|356558280|ref|XP_003547435.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 395
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/374 (47%), Positives = 246/374 (65%), Gaps = 5/374 (1%)
Query: 21 KSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSL 80
K+ S +A+ DWW + + + PS + F+ F++S+KTF+YICSL
Sbjct: 19 KNGSAAASGSPEGDGPVDWWDDLSKRMNGLQQS--PSKCLDNFESVFKISRKTFEYICSL 76
Query: 81 VREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWR 140
V++D++ + + + + +S+ +QVA+ALRRL SGES V++G +FG+ STVSQVTWR
Sbjct: 77 VKDDMMMKA-AHFVFTNRKPISLCEQVAVALRRLGSGESLVTIGDSFGLNHSTVSQVTWR 135
Query: 141 FIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTS-DD 198
F+E++EER HHL+WP + M IKSKFE+ GL NCCG IDATHI M LPA + S +
Sbjct: 136 FVESMEERGLHHLRWPSTEVEMNAIKSKFEKIHGLPNCCGVIDATHITMCLPASEPSCNV 195
Query: 199 WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRI 258
W D + N+SM++Q IVD +MRF DIVTGWPG M + + S F KLCE G+RLNG++
Sbjct: 196 WLDHKKNHSMVLQAIVDPDMRFRDIVTGWPGKMEDWMIFESSNFNKLCEKGERLNGSILQ 255
Query: 259 SSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGG 318
SE E+REYI+G GY LL +L+ PYE LS FN+ H ATR +A +A +LK
Sbjct: 256 LSEGSEIREYIIGDSGYRLLPYLVVPYEGKELSEPKAQFNNQHLATRMVAQRALARLKDM 315
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVD 378
WRI+ +MWRPDK +LP IILVCC+LHNI+ID D++ +++LS HDSGY + C VD
Sbjct: 316 WRIIHGMMWRPDKHRLPRIILVCCVLHNIVIDLNDEVQDELSLSLDHDSGYHQLVCGAVD 375
Query: 379 PMGRTTRENLEKHL 392
G R+ L +L
Sbjct: 376 EKGVALRDRLSHYL 389
>gi|242069533|ref|XP_002450043.1| hypothetical protein SORBIDRAFT_05g027460 [Sorghum bicolor]
gi|241935886|gb|EES09031.1| hypothetical protein SORBIDRAFT_05g027460 [Sorghum bicolor]
Length = 390
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 174/358 (48%), Positives = 234/358 (65%), Gaps = 6/358 (1%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI 96
SDWW +F + S T + ++ + F+ F++ ++ FDY+C+LV+++++ R S +
Sbjct: 34 SDWWDAFCMRMSGTLSSV---EDAQRFESVFKMPRRAFDYVCNLVKDEMMVRS-SSYTFL 89
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP 156
+G +L +E +VAIALRRL SG S V+VG + GV STVS +TWRFIEA+EERA HHL+WP
Sbjct: 90 DGTMLCLEDRVAIALRRLNSGGSLVTVGSSVGVNHSTVSLITWRFIEAMEERASHHLRWP 149
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVD 215
DS+ ME+IKSKF++ GL NCCG +D THI M L + + W DQE NYSM++Q +VD
Sbjct: 150 DSSEMEKIKSKFQKIHGLPNCCGVVDTTHITMCLSSAEPNCKVWLDQEKNYSMVLQAVVD 209
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
+ RFIDIVTGWPG M S +L SG FKL E G+RLNG+ S+ E+ EY++G GY
Sbjct: 210 LDTRFIDIVTGWPGSMKESSILHSSGLFKLWEKGERLNGSKFKVSDGSEIGEYLIGDSGY 269
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLL WL+TPY+ L+ S FNS H A R +A + + K W+ L MWRPDK KLP
Sbjct: 270 PLLPWLLTPYQEKELTESSTEFNSRHSAARIVAPRTLAKFKDTWKFLQGEMWRPDKHKLP 329
Query: 336 SIILVCCLLHNIIIDSGDQLHPDV-ALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
II VCCLLHNIIID + + S HD+ Y +Q C+ D G R+ L +HL
Sbjct: 330 RIIHVCCLLHNIIIDLQEAAVDEARTWSSDHDAKYRQQVCQLADENGVKMRDKLSQHL 387
>gi|242068835|ref|XP_002449694.1| hypothetical protein SORBIDRAFT_05g021710 [Sorghum bicolor]
gi|241935537|gb|EES08682.1| hypothetical protein SORBIDRAFT_05g021710 [Sorghum bicolor]
Length = 381
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 172/359 (47%), Positives = 231/359 (64%), Gaps = 8/359 (2%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW +F + S T ++ + F+ F++ ++ FDY+C+LV+++++ R S ++
Sbjct: 24 DWWDAFCRRMSGTLSCI---EDAQRFESVFKMPRRAFDYVCNLVKDEMMVRSSSYTF-LD 79
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G +L +E +VAIALRRL SG S +VG + GV STVS +TWRFIEA+EERA HHL+WPD
Sbjct: 80 GTMLCLEDRVAIALRRLNSGGSLATVGSSVGVNHSTVSLITWRFIEAMEERASHHLRWPD 139
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDH 216
S ME+IKSKFE+ GL NCCG +D THI M L + + W DQE NYSM++Q +VD
Sbjct: 140 SGEMEKIKSKFEKIHGLTNCCGVVDTTHITMCLSSAEPNCKVWLDQEKNYSMVLQAVVDL 199
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ RF DIVTGWPG M S +L SG FKLC G+RLNG+ S+ E+ EY++G GYP
Sbjct: 200 DTRFTDIVTGWPGSMKESSILHSSGLFKLCGEGERLNGSKVKVSDGSEIGEYLIGDSGYP 259
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
LL WL+TPY+ L+ S FNS H A R++A + + K W+ L MWRPDK KLP
Sbjct: 260 LLPWLLTPYQEKDLTESSAEFNSRHSAARAVAPRTLAKFKDTWKFLQGEMWRPDKHKLPR 319
Query: 337 IILVCCLLHNIIIDSGDQLHPDVAL---SDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
II VCCLLHNIIID + + S+ HD+ Y +Q C+ D G R+ L +HL
Sbjct: 320 IIHVCCLLHNIIIDLQETTMDEARAWPKSNDHDANYRQQVCQLADENGIKVRDKLSEHL 378
>gi|212721120|ref|NP_001132644.1| hypothetical protein [Zea mays]
gi|194694982|gb|ACF81575.1| unknown [Zea mays]
gi|413920177|gb|AFW60109.1| hypothetical protein ZEAMMB73_335492 [Zea mays]
Length = 390
Score = 347 bits (890), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 182/394 (46%), Positives = 247/394 (62%), Gaps = 9/394 (2%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
M P + +K ++ + KK SAA DP DWW +F + S T + ++
Sbjct: 1 MPPLRGAKRRRKAALEKKAAAVAMASAAAGDP---PGDWWEAFCIRMSGTLSSI---EDA 54
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ F++ ++ FDY+C+LVR++++ R S ++G +L +E +VAIALRRL SG S
Sbjct: 55 QRFESIFKMPRRAFDYVCNLVRDEMMVRSCSYTF-LDGTVLCLEDRVAIALRRLNSGGSL 113
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
V+VG + GV STVS +TWRFIEA+EERA HHL+WPDS ME+IKSKFE+ GL NCCG
Sbjct: 114 VTVGSSVGVNHSTVSLITWRFIEAMEERASHHLRWPDSIEMEKIKSKFEKIHGLPNCCGV 173
Query: 181 IDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
+D THI M L + + W DQE NYSM++Q +VD + +F DIVTGWPG M S +L
Sbjct: 174 VDTTHITMCLSSAEPNCKVWLDQEKNYSMVLQAVVDLDTKFTDIVTGWPGSMKESSILHS 233
Query: 240 SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNS 299
SG FKLCE G+RLNG+ S+ E+ EY++G GYPLL WL+TPY+ L+ S FN
Sbjct: 234 SGLFKLCEKGERLNGSKLKVSDGSEIGEYLIGDSGYPLLPWLLTPYQEKDLTESSAEFNI 293
Query: 300 LHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
H + R++A + + K W+ L MWRPDK KLP II VCCLLHNIIID + +
Sbjct: 294 RHSSARTVAPRTLAKFKDTWKFLQGEMWRPDKHKLPRIIHVCCLLHNIIIDLQETAVDEA 353
Query: 360 -ALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
A HD+ Y +Q C+ D G R+ L +HL
Sbjct: 354 QAWPSDHDAKYRQQVCQLADENGVRVRDKLSQHL 387
>gi|356550247|ref|XP_003543499.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 393
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 179/374 (47%), Positives = 245/374 (65%), Gaps = 7/374 (1%)
Query: 21 KSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSL 80
K+ S S +P+ DWW + + + PS + F+ F++S+KTF+YICSL
Sbjct: 19 KNGSGSGSPE--GEGPVDWWDDLSKRMNGLQQS--PSKCLDNFESVFKISRKTFEYICSL 74
Query: 81 VREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWR 140
V++D++++ + + G+ +S+ +QVA+ALRRL SGES V++G +FG+ STVSQVTWR
Sbjct: 75 VKDDMMTKA-AHFVFTNGKPISLREQVAVALRRLGSGESLVTIGDSFGLNHSTVSQVTWR 133
Query: 141 FIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDD 198
F+E++EER HHL+WP + M IKSKFE+ GL NCCG IDAT+I M LPA + +
Sbjct: 134 FVESMEERGLHHLRWPSTEVEMNAIKSKFEKIRGLPNCCGVIDATNITMCLPASEPYCNV 193
Query: 199 WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRI 258
W D + N+SM++Q IVD +MRF DIVTGWPG M + + S F KLCE G+RLNG+
Sbjct: 194 WLDHKKNHSMVLQAIVDPDMRFRDIVTGWPGKMEDWLIFESSNFNKLCEKGERLNGSKLQ 253
Query: 259 SSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGG 318
SE E+REYI+G GY LL +L+ PYE LS FN H ATR +A +A +LK
Sbjct: 254 LSEGSEIREYILGDSGYRLLPYLVVPYEGKELSEPKAQFNKQHLATRMVAQRALARLKEM 313
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVD 378
WRI+ +MWRPDK +LP IILVCC+LHNI+ID D++ +++LS HDSGY C VD
Sbjct: 314 WRIIHGMMWRPDKHRLPRIILVCCVLHNIVIDLNDEVQDELSLSLDHDSGYHRLVCGAVD 373
Query: 379 PMGRTTRENLEKHL 392
G R+ L +L
Sbjct: 374 EKGVALRDRLSHYL 387
>gi|357454103|ref|XP_003597332.1| hypothetical protein MTR_2g096450 [Medicago truncatula]
gi|355486380|gb|AES67583.1| hypothetical protein MTR_2g096450 [Medicago truncatula]
Length = 388
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 183/389 (47%), Positives = 248/389 (63%), Gaps = 19/389 (4%)
Query: 11 KTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVS 70
K ++ T+K + + SA+ DWW F +K + + P +S
Sbjct: 8 KKRRKTEKSYDNNNDSASGSSEKEGVVDWWDEF-SKKINGLQKSPP-----------MIS 55
Query: 71 KKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVG 130
+KTF+YICSLV++D+ ++ + L G+ +S+ QVA+ALRRL SG+S V+VG +FG+
Sbjct: 56 RKTFEYICSLVKDDMTTKS-AHLTFTNGKPMSLFDQVAVALRRLGSGDSLVTVGDSFGLT 114
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHIIMT 189
STVSQVTWRF+E++E+R HHL+WP + M IKSKFE+ G NCCGA+DATHI M
Sbjct: 115 YSTVSQVTWRFVESMEDRGLHHLQWPSTQEEMNAIKSKFEKIQGFPNCCGAVDATHITMC 174
Query: 190 LPAV-QTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
LPA +SD W D NN+SM++Q IVD +M+F DIVTGWPG + + + S F KLC+
Sbjct: 175 LPATDHSSDVWLDHRNNHSMVLQAIVDPDMKFRDIVTGWPGKLEDWSIFESSNFNKLCDK 234
Query: 249 GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL--SASMPTFNSLHEATRS 306
G+RLNGN SE E+REYI+G GYPLL +L+ PY+ L S + FN LH ATR
Sbjct: 235 GERLNGNKLKLSEGSEIREYIIGDSGYPLLPYLVVPYKEKELLESEAKTHFNKLHLATRM 294
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDH-- 364
+A +A +LK WRI+ MWRPDK +LP IILVCCLLHNI+ID D++ ++ H
Sbjct: 295 VAQRALTRLKEMWRIIRGNMWRPDKHRLPRIILVCCLLHNIVIDMQDEVKDELLYLYHNN 354
Query: 365 HDSGYGEQCCKQVDPMGRTTRENLEKHLQ 393
HDSGY + C+ VD MG RE+L +L
Sbjct: 355 HDSGYHQLICEGVDEMGVALRESLSHYLN 383
>gi|218186214|gb|EEC68641.1| hypothetical protein OsI_37051 [Oryza sativa Indica Group]
Length = 391
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 223/335 (66%), Gaps = 2/335 (0%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F++ F++ ++TF+YIC LVR++++ R S ++G++LS+E +VA+AL RL SG
Sbjct: 55 DAQRFEFIFKMPRRTFNYICGLVRDEMMVRSSSYTF-LDGKVLSLEDRVAVALIRLNSGG 113
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
S V+VG A GV STVS +TWRF+EA+EERA HHL+WPDS+ +E+IKS FE+ GL NCC
Sbjct: 114 SLVTVGSAVGVNHSTVSLITWRFVEAMEERASHHLRWPDSSEVEKIKSMFEKIHGLPNCC 173
Query: 179 GAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI M L + + W D E NYSM++Q ++ +MRF+DIVTGWPG M S +L
Sbjct: 174 GVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVLQAVISPDMRFMDIVTGWPGSMKESSIL 233
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
SG FK+CE G RLNG+ + S+ E+ EYI+G GYPLL WL+TPY+ LS S F
Sbjct: 234 HSSGLFKMCEKGARLNGSKMVVSDGSEIGEYIIGDAGYPLLPWLLTPYQEKDLSDSKLEF 293
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++A + K W+ L MWRPDK +LP II VCC+LHNIII D
Sbjct: 294 NKRHAAAITVAPRTLANFKDTWKFLHGEMWRPDKHRLPRIIHVCCMLHNIIICLQDATID 353
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ A+S+ HD+ Y +Q C+ D R+ L +HL
Sbjct: 354 EAAMSNDHDANYKQQVCQLADENAVRVRDKLSEHL 388
>gi|222616440|gb|EEE52572.1| hypothetical protein OsJ_34844 [Oryza sativa Japonica Group]
Length = 552
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 222/335 (66%), Gaps = 2/335 (0%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F++ F++ ++TF+YIC LVR++++ R S ++G++LS+E +VA+AL RL SG
Sbjct: 55 DAQRFEFIFKMPRRTFNYICGLVRDEMMVRS-SSYTFLDGKVLSLEDRVAVALIRLNSGG 113
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
S V+VG A GV STVS +TWRF+EA+EERA HHL+WPDS+ +E+IKS FE+ GL NCC
Sbjct: 114 SLVTVGSAVGVNHSTVSLITWRFVEAMEERASHHLRWPDSSEVEKIKSMFEKIHGLPNCC 173
Query: 179 GAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI M L + + W D E NYSM++Q ++ +MRF+DIVTGWPG M S +L
Sbjct: 174 GVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVLQAVISPDMRFMDIVTGWPGSMKESSIL 233
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
SG FK+CE G RLNG+ + S+ E+ EYI+G GYPLL WL+TPY+ LS S F
Sbjct: 234 HSSGLFKMCEKGARLNGSKMVVSDGSEIGEYIIGDAGYPLLPWLLTPYQEKDLSDSKLEF 293
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++A K W+ L MWRPDK +LP II VCC+LHNIII D
Sbjct: 294 NKRHAAAITVAPSTLANFKDTWKFLHGEMWRPDKHRLPRIIHVCCMLHNIIICLQDATID 353
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ A+S+ HD+ Y +Q C+ D R+ L +HL
Sbjct: 354 EAAMSNDHDANYKQQVCQLADENAVRVRDKLSEHL 388
>gi|115486779|ref|NP_001068533.1| Os11g0702700 [Oryza sativa Japonica Group]
gi|62733225|gb|AAX95342.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108864702|gb|ABG22606.1| expressed protein [Oryza sativa Japonica Group]
gi|113645755|dbj|BAF28896.1| Os11g0702700 [Oryza sativa Japonica Group]
gi|215693890|dbj|BAG89089.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 391
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 222/335 (66%), Gaps = 2/335 (0%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F++ F++ ++TF+YIC LVR++++ R S ++G++LS+E +VA+AL RL SG
Sbjct: 55 DAQRFEFIFKMPRRTFNYICGLVRDEMMVRSSSYTF-LDGKVLSLEDRVAVALIRLNSGG 113
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
S V+VG A GV STVS +TWRF+EA+EERA HHL+WPDS+ +E+IKS FE+ GL NCC
Sbjct: 114 SLVTVGSAVGVNHSTVSLITWRFVEAMEERASHHLRWPDSSEVEKIKSMFEKIHGLPNCC 173
Query: 179 GAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI M L + + W D E NYSM++Q ++ +MRF+DIVTGWPG M S +L
Sbjct: 174 GVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVLQAVISPDMRFMDIVTGWPGSMKESSIL 233
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
SG FK+CE G RLNG+ + S+ E+ EYI+G GYPLL WL+TPY+ LS S F
Sbjct: 234 HSSGLFKMCEKGARLNGSKMVVSDGSEIGEYIIGDAGYPLLPWLLTPYQEKDLSDSKLEF 293
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++A K W+ L MWRPDK +LP II VCC+LHNIII D
Sbjct: 294 NKRHAAAITVAPSTLANFKDTWKFLHGEMWRPDKHRLPRIIHVCCMLHNIIICLQDATID 353
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ A+S+ HD+ Y +Q C+ D R+ L +HL
Sbjct: 354 EAAMSNDHDANYKQQVCQLADENAVRVRDKLSEHL 388
>gi|108864703|gb|ABA95488.2| expressed protein [Oryza sativa Japonica Group]
Length = 398
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 222/335 (66%), Gaps = 2/335 (0%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F++ F++ ++TF+YIC LVR++++ R S ++G++LS+E +VA+AL RL SG
Sbjct: 62 DAQRFEFIFKMPRRTFNYICGLVRDEMMVRSSSYTF-LDGKVLSLEDRVAVALIRLNSGG 120
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
S V+VG A GV STVS +TWRF+EA+EERA HHL+WPDS+ +E+IKS FE+ GL NCC
Sbjct: 121 SLVTVGSAVGVNHSTVSLITWRFVEAMEERASHHLRWPDSSEVEKIKSMFEKIHGLPNCC 180
Query: 179 GAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI M L + + W D E NYSM++Q ++ +MRF+DIVTGWPG M S +L
Sbjct: 181 GVVDTTHITMCLSSAEPNCKVWLDHEKNYSMVLQAVISPDMRFMDIVTGWPGSMKESSIL 240
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
SG FK+CE G RLNG+ + S+ E+ EYI+G GYPLL WL+TPY+ LS S F
Sbjct: 241 HSSGLFKMCEKGARLNGSKMVVSDGSEIGEYIIGDAGYPLLPWLLTPYQEKDLSDSKLEF 300
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++A K W+ L MWRPDK +LP II VCC+LHNIII D
Sbjct: 301 NKRHAAAITVAPSTLANFKDTWKFLHGEMWRPDKHRLPRIIHVCCMLHNIIICLQDATID 360
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ A+S+ HD+ Y +Q C+ D R+ L +HL
Sbjct: 361 EAAMSNDHDANYKQQVCQLADENAVRVRDKLSEHL 395
>gi|449517792|ref|XP_004165928.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Cucumis sativus]
Length = 388
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 177/354 (50%), Positives = 228/354 (64%), Gaps = 7/354 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + + A+ D FK F+VS+KTFDYIC LV++D+ ++ +
Sbjct: 34 DWWDDFSKRTNGLHSASKGLDR---FKSIFKVSRKTFDYICLLVKDDMTAKS-GHFTFLN 89
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
GR LS+ QVA+ALRRL SGES V++G + G+ STVSQVTWRF+E++EER HHL WP
Sbjct: 90 GRPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPS 149
Query: 158 SN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVD 215
+ M ++KSKFE+ GL NCCG+ID THI M LPA TS W D + N+SM++Q IVD
Sbjct: 150 NEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDXQKNHSMVLQVIVD 209
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
EMRF DI+TG PG ++ + + S F KLC+ G+RLNG + E+REYI+G GY
Sbjct: 210 AEMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGY 269
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLL +L+TPY+ LS S FN H+ TR + +A LK WRI+ VMWRPDK +LP
Sbjct: 270 PLLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLP 329
Query: 336 SIILVCCLLHNIIIDSGDQLHP-DVALSDHHDSGYGEQCCKQVDPMGRTTRENL 388
IILVCCLLHNIIID GD+ +V LS HD Y +Q C D G R+ L
Sbjct: 330 RIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRL 383
>gi|449462621|ref|XP_004149039.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 388
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 177/354 (50%), Positives = 228/354 (64%), Gaps = 7/354 (1%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + + A+ D FK F+VS+KTFDYIC LV++D+ ++ +
Sbjct: 34 DWWDDFSKRTNGLHSASKGLDR---FKSIFKVSRKTFDYICLLVKDDMTAKS-GHFTFLN 89
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
GR LS+ QVA+ALRRL SGES V++G + G+ STVSQVTWRF+E++EER HHL WP
Sbjct: 90 GRPLSLCDQVAVALRRLGSGESLVTIGDSLGLNHSTVSQVTWRFVESMEERGLHHLHWPS 149
Query: 158 SN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ-TSDDWCDQENNYSMLVQGIVD 215
+ M ++KSKFE+ GL NCCG+ID THI M LPA TS W D + N+SM++Q IVD
Sbjct: 150 NEVEMAQVKSKFEKIQGLPNCCGSIDTTHITMCLPASDPTSYVWLDDKKNHSMVLQVIVD 209
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
EMRF DI+TG PG ++ + + S F KLC+ G+RLNG + E+REYI+G GY
Sbjct: 210 AEMRFRDILTGLPGKLSDWLVFQSSNFHKLCDKGERLNGKRFELPDRSEIREYIIGDSGY 269
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PLL +L+TPY+ LS S FN H+ TR + +A LK WRI+ VMWRPDK +LP
Sbjct: 270 PLLPYLVTPYDGKELSTSKTEFNKRHKETRLVVQRALAMLKERWRIIQGVMWRPDKHRLP 329
Query: 336 SIILVCCLLHNIIIDSGDQLHP-DVALSDHHDSGYGEQCCKQVDPMGRTTRENL 388
IILVCCLLHNIIID GD+ +V LS HD Y +Q C D G R+ L
Sbjct: 330 RIILVCCLLHNIIIDIGDETEEGEVPLSIEHDVDYKQQVCDVFDSKGAYVRDRL 383
>gi|242059175|ref|XP_002458733.1| hypothetical protein SORBIDRAFT_03g039260 [Sorghum bicolor]
gi|241930708|gb|EES03853.1| hypothetical protein SORBIDRAFT_03g039260 [Sorghum bicolor]
Length = 400
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/381 (45%), Positives = 228/381 (59%), Gaps = 11/381 (2%)
Query: 18 KLKKSKSTSAAPKD----PNSSDSDWWPSFWAKNSSTPGAT-IPSDEEEGFKYFFRVSKK 72
K +K K A P P D DWW +F + ++ T PS E + F+ ++S+K
Sbjct: 8 KRRKRKPERALPAQERPLPQPLDGDWWDAFSRRLAAVRCFTGQPSKERQNFETALKMSRK 67
Query: 73 TFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASGESQVSVGVAFGV 129
TFDY+C L++ D +R N + +L VE QVAIAL RL +GES + +G FG+
Sbjct: 68 TFDYLCCLIKGD-CTRKTQSYRNFRFGDKVVLGVEDQVAIALLRLTTGESLLGIGTRFGM 126
Query: 130 GQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMT 189
S +S +TWRFIE LEERA HLKWPD M IK+KFE+ GL NCCGAID TH++M
Sbjct: 127 NHSAISNITWRFIECLEERAASHLKWPDPEEMAAIKAKFEKIQGLPNCCGAIDTTHVLMC 186
Query: 190 LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
A S+ W D EN SM++Q IVD +MRF D+V+GWPG ++ S +L+ SGF++LC+ G
Sbjct: 187 SSAQPNSNVWLDGENRNSMVLQAIVDPDMRFRDVVSGWPGSLDDSCILRTSGFYRLCQKG 246
Query: 250 QRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSL 307
RL+G + + E +REYIVG YPLL WL+TPY+ +GLSA FN H ATR +
Sbjct: 247 SRLSGQMELPGESAGSIVREYIVGDPSYPLLPWLVTPYQEHGLSAEKVEFNKCHTATRMV 306
Query: 308 AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
A LK WR+L +WRPDK +LP II CCLL NI+ID D + + S HD
Sbjct: 307 VQGALANLKDRWRVLKGELWRPDKHRLPRIIYACCLLTNIMIDLEDAVRDGMPASHSHDD 366
Query: 368 GYGEQCCKQVDPMGRTTRENL 388
GY +Q D T R+ L
Sbjct: 367 GYRQQVSNLADDGAVTQRDLL 387
>gi|357454107|ref|XP_003597334.1| hypothetical protein MTR_2g096480 [Medicago truncatula]
gi|355486382|gb|AES67585.1| hypothetical protein MTR_2g096480 [Medicago truncatula]
Length = 390
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/330 (49%), Positives = 224/330 (67%), Gaps = 7/330 (2%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S+KTF+YICSLV++D +++ S G+ +S+ QVA+ALRRL SG+S V++G +FG
Sbjct: 56 ISRKTFEYICSLVKDD-IAKKSSHFSFSNGKPVSLTDQVAVALRRLGSGDSFVTIGDSFG 114
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHII 187
+ STVSQVTWRF+E++EER HHL+WP + M IKSKFE+ G NCCGA+D THI
Sbjct: 115 LSHSTVSQVTWRFVESMEERGLHHLQWPSTQEEMNSIKSKFEKIQGFPNCCGAVDVTHIT 174
Query: 188 MTLPAV-QTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
M LPA Q+SD W D +NN+SM++Q IVD +M+F DIVTGWPG M + + S F KLC
Sbjct: 175 MLLPATEQSSDVWLDHKNNHSMVLQAIVDPDMKFRDIVTGWPGKMEDWSIFESSNFNKLC 234
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP--TFNSLHEAT 304
+ G+RLNG S+ E+REYI+G GYP L +L+ PYE + S P FN LH T
Sbjct: 235 DNGERLNGKKLKLSKGSEIREYIIGDSGYPQLPYLVVPYEEKEILESEPKAKFNKLHLET 294
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVA--LS 362
R +A +A +LK W+I+ MWRPDK +LP IILVCC+LHNI+ID D+++ ++
Sbjct: 295 RMVAQRALTRLKEMWKIIRGKMWRPDKHRLPRIILVCCILHNIVIDMQDEVNDELLCFYR 354
Query: 363 DHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
++HDSGY + + D G RE+L +L
Sbjct: 355 NNHDSGYHQLVYEGFDEKGVALRESLSHYL 384
>gi|242088183|ref|XP_002439924.1| hypothetical protein SORBIDRAFT_09g022640 [Sorghum bicolor]
gi|241945209|gb|EES18354.1| hypothetical protein SORBIDRAFT_09g022640 [Sorghum bicolor]
Length = 390
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/395 (42%), Positives = 231/395 (58%), Gaps = 12/395 (3%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP + +K +K + D SDWW F+ + G + E
Sbjct: 1 MAPVRGAKKRKRPEKPAPAPAPGLPLPPLPD----GSDWWSVFYRR---IAGHSPFPREH 53
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASG 117
+ + ++S+KTFDY+CSLV++DL ++ G N + +L VE QVA+AL RL +G
Sbjct: 54 QNMESVLKMSRKTFDYVCSLVKKDLTTKT-YGFRNFRFGDKTILGVEDQVAVALMRLTTG 112
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
ES ++G+ FG+ S +S +TWRFIE++EERA HLKWP M IK++F++ +GL NC
Sbjct: 113 ESLQNIGMWFGMNHSAISNITWRFIESVEERAICHLKWPSHEEMATIKARFDKIYGLPNC 172
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CGAID THI+M A S W D EN SM++Q +VD +MRF DIV+GWPG M+ S +L
Sbjct: 173 CGAIDTTHILMCSSAQPNSKVWLDNENKNSMVLQAVVDADMRFRDIVSGWPGSMDDSCIL 232
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
+ SG ++LCE G RL G + + +REYIVG YPLL WL+TPY +GL A+ F
Sbjct: 233 RTSGLYRLCEKGLRLEGQMELPGGSA-VREYIVGDASYPLLPWLMTPYRGHGLPAAKVEF 291
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++ A LKG WR++ +WRPDK +LP II VCCL+ NIIID
Sbjct: 292 NKRHTAATAVVQTALATLKGRWRVIQGELWRPDKHRLPRIIFVCCLITNIIIDMDGTPSK 351
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
D S +HD GY +Q D R++L +H+
Sbjct: 352 DKLFSGNHDHGYKQQFSNVADDNAVKQRDDLSQHV 386
>gi|124359622|gb|ABN06006.1| Trp repressor/replication initiator [Medicago truncatula]
Length = 342
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 164/330 (49%), Positives = 224/330 (67%), Gaps = 7/330 (2%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S+KTF+YICSLV++D +++ S G+ +S+ QVA+ALRRL SG+S V++G +FG
Sbjct: 8 ISRKTFEYICSLVKDD-IAKKSSHFSFSNGKPVSLTDQVAVALRRLGSGDSFVTIGDSFG 66
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHII 187
+ STVSQVTWRF+E++EER HHL+WP + M IKSKFE+ G NCCGA+D THI
Sbjct: 67 LSHSTVSQVTWRFVESMEERGLHHLQWPSTQEEMNSIKSKFEKIQGFPNCCGAVDVTHIT 126
Query: 188 MTLPAV-QTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
M LPA Q+SD W D +NN+SM++Q IVD +M+F DIVTGWPG M + + S F KLC
Sbjct: 127 MLLPATEQSSDVWLDHKNNHSMVLQAIVDPDMKFRDIVTGWPGKMEDWSIFESSNFNKLC 186
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP--TFNSLHEAT 304
+ G+RLNG S+ E+REYI+G GYP L +L+ PYE + S P FN LH T
Sbjct: 187 DNGERLNGKKLKLSKGSEIREYIIGDSGYPQLPYLVVPYEEKEILESEPKAKFNKLHLET 246
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVA--LS 362
R +A +A +LK W+I+ MWRPDK +LP IILVCC+LHNI+ID D+++ ++
Sbjct: 247 RMVAQRALTRLKEMWKIIRGKMWRPDKHRLPRIILVCCILHNIVIDMQDEVNDELLCFYR 306
Query: 363 DHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
++HDSGY + + D G RE+L +L
Sbjct: 307 NNHDSGYHQLVYEGFDEKGVALRESLSHYL 336
>gi|47026847|gb|AAT08649.1| unknown [Hyacinthus orientalis]
Length = 254
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 152/218 (69%), Positives = 181/218 (83%), Gaps = 2/218 (0%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
D +WW SFW KN + G+ IP DEE FKYFFRVS+ TF+YICSLVR+DLVSRPPSGLIN
Sbjct: 35 DDEWWFSFWNKNCDS-GSNIPLDEEAAFKYFFRVSRPTFEYICSLVRDDLVSRPPSGLIN 93
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
IEGRLLSVEKQVAIA+RRLASGESQVSVG AFGVGQSTVSQVTWRF+E++EERA+HH++W
Sbjct: 94 IEGRLLSVEKQVAIAMRRLASGESQVSVGGAFGVGQSTVSQVTWRFVESMEERARHHMRW 153
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P+ RME+IK++ E +FGL NCCG+IDATHI+MTLPAV++SDDWCDQE NYSM +QGIVD
Sbjct: 154 PEPERMEQIKARLEAAFGLPNCCGSIDATHIVMTLPAVESSDDWCDQERNYSMFLQGIVD 213
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFK-LCEAGQRL 252
E +VTGWPG M RL++ G + + + G RL
Sbjct: 214 DEDEGSYMVTGWPGSMTFFRLVEVLGLLQTVPQPGSRL 251
>gi|296086277|emb|CBI31718.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 160/305 (52%), Positives = 205/305 (67%), Gaps = 2/305 (0%)
Query: 90 PSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA 149
P+ G LS+ +VAIALRRL+SGES VG +FGV QSTV+Q+TW+F+E++E R
Sbjct: 5 PANFSASNGNPLSLNDRVAIALRRLSSGESFSIVGDSFGVNQSTVAQITWKFVESMEVRG 64
Query: 150 KHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYS 207
HHL+WP + MEE KS FE+ GL NCCGA+D THI M+L + TS+ W D + +S
Sbjct: 65 LHHLQWPSTEADMEETKSSFEKIRGLPNCCGAVDTTHISMSLSTMGTSNKVWIDHQKKHS 124
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
M++Q IVD EMRF D++TGWPG ++ S +L+ SGFFKL E G+RLNG + SE LRE
Sbjct: 125 MILQAIVDPEMRFRDVITGWPGSLSESCVLQNSGFFKLSEEGKRLNGKMIELSEGTYLRE 184
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
YIVG G+PLL WL+TPY+ GLS FN H ATR +A +A +LK WRI+ VMW
Sbjct: 185 YIVGDAGFPLLPWLLTPYQGKGLSDFQSEFNKRHFATRMVARRALARLKEMWRIIQGVMW 244
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
RPD KLP IILVCCLLHNI+ID D + ++ S HD GY ++ C+ VD REN
Sbjct: 245 RPDTNKLPRIILVCCLLHNIVIDLEDGVQDEMPFSYKHDPGYRQRSCESVDKTAFIQREN 304
Query: 388 LEKHL 392
L +
Sbjct: 305 LSLYF 309
>gi|115440939|ref|NP_001044749.1| Os01g0838900 [Oryza sativa Japonica Group]
gi|15623822|dbj|BAB67881.1| unknown protein [Oryza sativa Japonica Group]
gi|113534280|dbj|BAF06663.1| Os01g0838900 [Oryza sativa Japonica Group]
gi|215707022|dbj|BAG93482.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619512|gb|EEE55644.1| hypothetical protein OsJ_04014 [Oryza sativa Japonica Group]
Length = 396
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 170/397 (42%), Positives = 233/397 (58%), Gaps = 15/397 (3%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP + +K +K ++ K L + AP + DWW F + A S +
Sbjct: 1 MAPLRGAKRRK-RQPEKALPAAGQAMPAP----APGGDWWDGF----ARRLAAGQFSKDC 51
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASG 117
+ F+ F++S+KTFDY+CSL+ D +R N + +L VE QV +AL RL +G
Sbjct: 52 QNFESVFKMSRKTFDYLCSLISGDF-TRKTQSFRNFRFGDKAILGVEDQVGVALLRLTTG 110
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
ES +S+G FG+ S +S +TW+FIEALEERA +HLKWP M +KSKFE+ GL NC
Sbjct: 111 ESLLSIGNRFGMNHSAISNITWKFIEALEERAANHLKWPTPEEMATVKSKFEKIQGLPNC 170
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CGAID THI+M A S+ W D EN SM++Q IVD +MRF D+V+GWPG +N S +L
Sbjct: 171 CGAIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDADMRFRDVVSGWPGSLNDSCIL 230
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ SGF++LCE G RL+G + E +R+YI+G YPLL WL+TPY LS +
Sbjct: 231 RTSGFYRLCEKGARLDGQTELPGEPAGSVVRDYILGDASYPLLPWLMTPYREKDLSPAKA 290
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN H AT + A +LK W++L +WRPDK +LP II VCCLL NI+ID D
Sbjct: 291 DFNKRHAATIMVVQGALAKLKERWQVLKGELWRPDKHRLPRIIYVCCLLTNIMIDLEDAA 350
Query: 356 HPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ S +HD GY +Q D R+ L +++
Sbjct: 351 RGGMPPSHNHDDGYRQQFSDVADVGAAALRDQLCQYV 387
>gi|218189338|gb|EEC71765.1| hypothetical protein OsI_04365 [Oryza sativa Indica Group]
Length = 396
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 169/397 (42%), Positives = 233/397 (58%), Gaps = 15/397 (3%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP + +K +K ++ K L + AP + DWW F + A S +
Sbjct: 1 MAPLRGAKRRK-RQPEKALPAAGQAMPAP----APGGDWWDGF----ARRLAAGQFSKDC 51
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASG 117
+ F+ F++S+KTFDY+CSL+ D +R N + +L VE QV +AL RL +G
Sbjct: 52 QNFESVFKMSRKTFDYLCSLISGDF-TRKTQSFRNFRFGDKAILGVEDQVGVALLRLTTG 110
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
ES +S+G FG+ S +S +TW+FIEALEERA +HLKWP M +K+KFE+ GL NC
Sbjct: 111 ESLLSIGNRFGMNHSAISNITWKFIEALEERAANHLKWPTPEEMATVKAKFEKIQGLPNC 170
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CGAID THI+M A S+ W D EN SM++Q IVD +MRF D+V+GWPG +N S +L
Sbjct: 171 CGAIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDADMRFRDVVSGWPGSLNDSCIL 230
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ SGF++LCE G RL+G + E +R+YI+G YPLL WL+TPY LS +
Sbjct: 231 RTSGFYRLCEKGARLDGQTELPGEPAGSVVRDYILGDASYPLLPWLMTPYREKDLSPAKA 290
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN H AT + A +LK W++L +WRPDK +LP II VCCLL NI+ID D
Sbjct: 291 DFNKRHAATIMVVQGALAKLKERWQVLKGELWRPDKHRLPRIIYVCCLLTNIMIDLEDAA 350
Query: 356 HPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ S +HD GY +Q D R+ L +++
Sbjct: 351 RGGMPPSHNHDDGYRQQFSDVADVGAAALRDQLCQYV 387
>gi|212722242|ref|NP_001132690.1| Transposon protein CACTA, En/Spm sub-class [Zea mays]
gi|194695102|gb|ACF81635.1| unknown [Zea mays]
gi|195627614|gb|ACG35637.1| transposon protein CACTA, En/Spm sub-class [Zea mays]
gi|413945623|gb|AFW78272.1| Transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 404
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 163/397 (41%), Positives = 234/397 (58%), Gaps = 12/397 (3%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP + +K +K + D SDWW F+ + G + E
Sbjct: 1 MAPVRGAKKRKRPEKPAPAPTPGLPLPPLPD----GSDWWGVFYRR---VAGHSPFPREH 53
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASG 117
+ + ++S++TFDYICSLV++DL ++ G N + ++L VE QVA+AL RL +G
Sbjct: 54 QNMESVLKMSRRTFDYICSLVKKDLTTKT-YGFRNFRFGDKKVLGVEDQVAVALMRLTTG 112
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
ES ++G+ FG+ S +S +TWRFIE++EERA HLKWP M IK++F++ +GL NC
Sbjct: 113 ESLQNIGMWFGMNHSAISNITWRFIESVEERAICHLKWPSPEEMATIKTRFDKVYGLPNC 172
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CGAID THI+M A S W D EN SM++Q +VD ++RF D+V+GWPG M+ + +L
Sbjct: 173 CGAIDTTHILMCSSAQPNSKVWLDNENKNSMVLQAVVDADLRFRDVVSGWPGSMDDTCIL 232
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
+ SG ++LCE G RL G + + +REY+VG YPLL WL+TPY +GL A+ F
Sbjct: 233 RTSGLYRLCEKGLRLGGQMELPGGSA-VREYLVGDASYPLLPWLMTPYREHGLQAAKVEF 291
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N H A ++ A LKG WR++ +WRPDK +LP II VCCL+ NIIID
Sbjct: 292 NKRHTAAAAVVQTALATLKGRWRVIQGELWRPDKHRLPRIIFVCCLITNIIIDMEGTPSR 351
Query: 358 DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQH 394
D +S +HD GY +Q D R++L +H+ +
Sbjct: 352 DELVSGNHDLGYKQQFSDAADDNAIKQRDDLSRHVDN 388
>gi|147767136|emb|CAN67325.1| hypothetical protein VITISV_012830 [Vitis vinifera]
Length = 942
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 156/318 (49%), Positives = 208/318 (65%), Gaps = 12/318 (3%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ F++S+KTF+YIC+LV+ED++++P I GR + + QVA+ALRRL+SG+S ++
Sbjct: 554 FESVFKISRKTFNYICALVKEDMMAKP-GNFIFTNGRPMCLNDQVAVALRRLSSGDSLLT 612
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAI 181
+G AFG+ STVSQVTWRF+E +EERA HHL+WP + + EI SKFE+ GL NCCGAI
Sbjct: 613 IGDAFGLNHSTVSQVTWRFVEIMEERALHHLQWPSTEPEITEITSKFEKIRGLPNCCGAI 672
Query: 182 DATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
D THI+M LP+ +++ W D G D R + G+ + KF
Sbjct: 673 DTTHIMMCLPSADSANSVWLDXXC-------GPRDEVSRHSYWMARQNEGLFSAPEFKFP 725
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
FKLCE GQRLNG +E E+ EYIVG GYPLL WL+TPY+ LS S FN
Sbjct: 726 --FKLCEKGQRLNGKKIELAEGSEIXEYIVGDSGYPLLPWLVTPYQGKELSESKAEFNRR 783
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVA 360
H ATR +A +A +LK W+++ VMWRPDK +LP IILVCCLLHNI+ID D++ ++
Sbjct: 784 HFATRMVAQRALARLKEMWKVIQGVMWRPDKNRLPRIILVCCLLHNIVIDLEDEVQDEMP 843
Query: 361 LSDHHDSGYGEQCCKQVD 378
LS HHD GY +Q C+ D
Sbjct: 844 LSHHHDLGYRQQICESAD 861
>gi|357125791|ref|XP_003564573.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 161/377 (42%), Positives = 219/377 (58%), Gaps = 15/377 (3%)
Query: 1 MAPPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE 60
MAP + +K +K ++ K L P D +DWW +F + A S +
Sbjct: 1 MAPLRGAKRRK-RQPEKALPAGVMAPMPPPDA----ADWWDTF----ARRLAAGHKSKDC 51
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASG 117
F+ F++S+ TF+YICSL+ D +R N + +L VE QVA+AL RL +G
Sbjct: 52 PDFESVFKMSRGTFNYICSLISGDF-TRKTQSFRNFRFGDKTILGVEDQVAVALLRLTTG 110
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
E +S+G FG+ S +S +TW+FIE+LE+ HLKWP M IK+KFE+ GL NC
Sbjct: 111 EPLLSIGNRFGMNHSAISNITWKFIESLEDHGIRHLKWPGPEEMATIKAKFEKLQGLPNC 170
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CGAID THI+M A S+ W D EN SM++Q IVD +MRF DIV+GWPG ++ S +L
Sbjct: 171 CGAIDTTHILMCSSAQPNSNVWLDGENRNSMVLQAIVDPDMRFRDIVSGWPGSLDDSCIL 230
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ SGF+KLCE G RL + + E +REYI+G YPLL WL+TPY+ + S +
Sbjct: 231 RTSGFYKLCEKGARLEEQMELPGEPAGSVVREYIIGDASYPLLPWLMTPYQEHDPSPAQV 290
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN H A R + A +LK W++L +WRPDK +LP II CCLL NI+ID D
Sbjct: 291 EFNKRHTAARMVVQGAMARLKERWQVLKGELWRPDKHRLPRIIYACCLLTNIMIDLEDTQ 350
Query: 356 HPDVALSDHHDSGYGEQ 372
S +HD GY +Q
Sbjct: 351 RDRTPASHNHDVGYMQQ 367
>gi|255641134|gb|ACU20845.1| unknown [Glycine max]
Length = 195
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 136/168 (80%), Positives = 148/168 (88%)
Query: 25 TSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVRED 84
S P +P +S++DWW SFW KNS+ PG T+P DE EGFKYFFRVSK TF+YICSLVRED
Sbjct: 23 VSLVPVEPRTSETDWWDSFWHKNSTVPGYTVPGDEAEGFKYFFRVSKTTFEYICSLVRED 82
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
L+SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG AFGVGQSTVSQVTWRFIEA
Sbjct: 83 LISRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGAAFGVGQSTVSQVTWRFIEA 142
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPA 192
LEERAKHHL WPD N+M+EIK FE S+GL NCCGAIDATHI+MTLP
Sbjct: 143 LEERAKHHLNWPDFNQMQEIKLGFEASYGLPNCCGAIDATHIMMTLPG 190
>gi|194696338|gb|ACF82253.1| unknown [Zea mays]
gi|413945624|gb|AFW78273.1| hypothetical protein ZEAMMB73_884484 [Zea mays]
Length = 285
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 122/268 (45%), Positives = 168/268 (62%), Gaps = 1/268 (0%)
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
FG+ S +S +TWRFIE++EERA HLKWP M IK++F++ +GL NCCGAID THI
Sbjct: 3 FGMNHSAISNITWRFIESVEERAICHLKWPSPEEMATIKTRFDKVYGLPNCCGAIDTTHI 62
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+M A S W D EN SM++Q +VD ++RF D+V+GWPG M+ + +L+ SG ++LC
Sbjct: 63 LMCSSAQPNSKVWLDNENKNSMVLQAVVDADLRFRDVVSGWPGSMDDTCILRTSGLYRLC 122
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G RL G + + +REY+VG YPLL WL+TPY +GL A+ FN H A +
Sbjct: 123 EKGLRLGGQMELPGGSA-VREYLVGDASYPLLPWLMTPYREHGLQAAKVEFNKRHTAAAA 181
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHD 366
+ A LKG WR++ +WRPDK +LP II VCCL+ NIIID D +S +HD
Sbjct: 182 VVQTALATLKGRWRVIQGELWRPDKHRLPRIIFVCCLITNIIIDMEGTPSRDELVSGNHD 241
Query: 367 SGYGEQCCKQVDPMGRTTRENLEKHLQH 394
GY +Q D R++L +H+ +
Sbjct: 242 LGYKQQFSDAADDNAIKQRDDLSRHVDN 269
>gi|295829883|gb|ADG38610.1| AT3G55350-like protein [Capsella grandiflora]
gi|295829885|gb|ADG38611.1| AT3G55350-like protein [Capsella grandiflora]
gi|295829887|gb|ADG38612.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 115/208 (55%), Positives = 149/208 (71%), Gaps = 3/208 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 4 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 61
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 62 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 121
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG SE ELREYI+G G+PLL WL+TP
Sbjct: 122 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKLHLSERTELREYIIGDSGFPLLPWLLTP 181
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAF 312
Y+ LS FN H R A A
Sbjct: 182 YQGKPLSLPQTEFNKRHSEARKAAQMAM 209
>gi|295829881|gb|ADG38609.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 115/208 (55%), Positives = 149/208 (71%), Gaps = 3/208 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 4 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 61
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 62 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 121
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG SE ELREYI+G G+PLL WL+TP
Sbjct: 122 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGXKLHLSERTELREYIIGDSGFPLLPWLLTP 181
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAF 312
Y+ LS FN H R A A
Sbjct: 182 YQGKPLSLPQTEFNKRHSEARKAAQMAM 209
>gi|345291409|gb|AEN82196.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291411|gb|AEN82197.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291415|gb|AEN82199.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291417|gb|AEN82200.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291419|gb|AEN82201.1| AT3G55350-like protein, partial [Capsella rubella]
gi|345291423|gb|AEN82203.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 148/204 (72%), Gaps = 3/204 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 3 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 60
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 61 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 120
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG SE ELREYI+G G+PLL WL+TP
Sbjct: 121 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKLHLSERTELREYIIGDSGFPLLPWLLTP 180
Query: 285 YETNGLSASMPTFNSLHEATRSLA 308
Y+ LS FN H R A
Sbjct: 181 YQGKPLSLPQTEFNKRHSEARKAA 204
>gi|345291421|gb|AEN82202.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 148/204 (72%), Gaps = 3/204 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 3 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 60
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 61 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 120
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG SE ELREYI+G G+PLL WL+TP
Sbjct: 121 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKFHLSERTELREYIIGDSGFPLLPWLLTP 180
Query: 285 YETNGLSASMPTFNSLHEATRSLA 308
Y+ LS FN H R A
Sbjct: 181 YQGKPLSLPQTEFNKRHSEARKAA 204
>gi|295829889|gb|ADG38613.1| AT3G55350-like protein [Capsella grandiflora]
Length = 210
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 114/208 (54%), Positives = 148/208 (71%), Gaps = 3/208 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 4 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 61
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 62 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 121
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG E ELREYI+G G+PLL WL+TP
Sbjct: 122 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKXHLXERTELREYIIGDSGFPLLPWLLTP 181
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAF 312
Y+ LS FN H R A A
Sbjct: 182 YQGKPLSLPQTEFNKRHSEARKAAQMAM 209
>gi|345291413|gb|AEN82198.1| AT3G55350-like protein, partial [Capsella rubella]
Length = 205
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 148/204 (72%), Gaps = 3/204 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL+WP ++++EIK
Sbjct: 3 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLQWP--SKLDEIK 60
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE+ GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 61 SKFEKISGLPNCCGAIDITHIVMNLPAVEPSNRVWLDGEKNFSMILQAVVDPDMRFLDVI 120
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG SE ELREYI+G G+PLL WL+TP
Sbjct: 121 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKFHLSERTELREYIIGDSGFPLLPWLLTP 180
Query: 285 YETNGLSASMPTFNSLHEATRSLA 308
Y+ LS FN H R A
Sbjct: 181 YQGKPLSLPQTEFNKRHSEARKAA 204
>gi|295829891|gb|ADG38614.1| AT3G55350-like protein [Neslia paniculata]
Length = 210
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 116/208 (55%), Positives = 147/208 (70%), Gaps = 3/208 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
+VA+ALRRL SGES +G FG+ QSTVSQ+TWRF+E++EERA HHL WP +++EIK
Sbjct: 4 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAMHHLSWP--LKLDEIK 61
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIV 224
SKFE GL NCCGAID THI+M LPAV+ S+ W D E N+SM++Q +VD +MRF+D++
Sbjct: 62 SKFERISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGEKNFSMILQAVVDPDMRFLDVI 121
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
GWPG ++ +LK SGF+KL E G+RLNG + SE ELREYIVG G+PLL WL+TP
Sbjct: 122 AGWPGSLSDDVVLKNSGFYKLVEKGKRLNGEKFLLSERGELREYIVGDSGFPLLPWLLTP 181
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAF 312
Y+ LS FN H R A A
Sbjct: 182 YQGKPLSLPQTEFNKRHSEARKAAQMAL 209
>gi|357114568|ref|XP_003559072.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 441
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/356 (37%), Positives = 192/356 (53%), Gaps = 51/356 (14%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPS-GLINIEGRLLSVEKQVAIALRRLASG 117
+ +G + ++ +KTF+YICS V D++ R S + GR+L +E QVA+ALR L SG
Sbjct: 3 DTQGCESVLKMRRKTFNYICSFVNNDMMVRDSSYTFADGTGRVLCLEDQVAVALRMLNSG 62
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
E+ +G + GV +STVS T RF++A+ ERA+HH+KWP S+ ME++KSKF++ GL NC
Sbjct: 63 ETLEILGSSVGVNESTVSLETQRFVDAIRERARHHMKWPGSDEMEKVKSKFDKIHGLSNC 122
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CG + HI + CD E N +L+Q VD +MRF +I W N LL
Sbjct: 123 CGVVHTIHIPF-------GSENCDHERNLCVLMQAFVDPDMRFRNIWQAWSDRTNQLNLL 175
Query: 238 KFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWLITPYETNGL---SAS 293
S FK CE G LNG+ +++SSE E+ EY++G GYPLL WL+TP++ G S S
Sbjct: 176 HDSELFKECEKGAWLNGSKLKVSSEGSEVGEYVIGNAGYPLLPWLLTPHQQKGTDDHSDS 235
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILS----KV---MWRPDKRKLPSIILVCCLLHN 346
FN A S+A+KA +L W+ L K+ MW+ I CC+LHN
Sbjct: 236 QVEFNRRQSAAVSVALKALARLTDTWKCLHGGSPKIPCEMWKA--------IQACCMLHN 287
Query: 347 IIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLVK 402
I+ID + PMGR + N + ++ E+D V+
Sbjct: 288 IVIDMEEA------------------------PMGRDYKVNYSEQVRRIAEEDAVR 319
>gi|357151003|ref|XP_003575650.1| PREDICTED: uncharacterized protein LOC100833301 [Brachypodium
distachyon]
Length = 698
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 184/319 (57%), Gaps = 27/319 (8%)
Query: 43 FWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS 102
F ++S + G ++ + F+ +++++TF YICSLV+ + + + I+GR+L
Sbjct: 341 FIGRSSQSQGTLYLFEDSQMFESVLKMTRRTFSYICSLVKVPSL-KDMNNYTFIDGRVLC 399
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRME 162
+E +VA+AL L +G++ VG + GV +STVS VT RF++A+ ERA+HH+KWP S ME
Sbjct: 400 LEDRVAVALIMLNAGQTLDDVGSSVGVNKSTVSLVTERFVDAMRERARHHMKWPGSGEME 459
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
+KSKF++ GL NCCG + +HI + CD E N+ +L+Q +VD +MRF +
Sbjct: 460 NVKSKFDKILGLPNCCGVVHTSHIPF-------GSENCDHERNFCVLMQAVVDPDMRFRN 512
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWL 281
I GW N LL S FK CE G LNG+ + +SSE E+ EY++G GYPLL WL
Sbjct: 513 IWQGWSDRTNQLGLLHNSELFKECENGAWLNGSKLEVSSEGSEVGEYVIGDAGYPLLPWL 572
Query: 282 ITPYE---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-------VMWRPDK 331
+TPY+ T+ S FN H S A+KA +L W+ L + MW+
Sbjct: 573 LTPYQQKDTDDQLDSQVEFNRRHSVAVSFALKALARLTDTWKCLHRGSPKIPCEMWKA-- 630
Query: 332 RKLPSIILVCCLLHNIIID 350
I CC+LHNI+ID
Sbjct: 631 ------IQACCMLHNIVID 643
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 179/304 (58%), Gaps = 26/304 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVR----EDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
+ + F+ +++++TF YICSLV+ ED+ S + +GR+L +E +VAIAL L
Sbjct: 15 DAQRFESVLKMTRRTFSYICSLVKVQSLEDMNSYTFT-----DGRVLCLEDRVAIALIML 69
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
SGE +V ++ GV +ST+S VT RFI+A E+A HHL WP S+ +E++KS F++ GL
Sbjct: 70 NSGEPLEAVALSVGVNESTISLVTERFIDATWEQADHHLNWPGSSEIEKVKSMFDKIHGL 129
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NCCG I THII C+ E N ++VQ +VD +MRF +I G MN
Sbjct: 130 PNCCGIICTTHIIF-------GSQNCNNETNDDIIVQVVVDPDMRFNNIWLGGSDAMNQM 182
Query: 235 RLLKFSGFFKLCEAGQRLNGN-VRIS---SEEVELREYIVGGVGYPLLSWLITPYETNGL 290
LL S FFK C+AG +NG+ + +S SEEVE EY++G GYPL WL+TPY+ +
Sbjct: 183 SLLHDSQFFKECDAGALVNGSKLELSSNGSEEVE--EYVIGAAGYPLRPWLLTPYKQQNM 240
Query: 291 SA---SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP-DKRKLPSIILVCCLLHN 346
S FN H A + +K +LK W+ L MW P D ++L +I VC LHN
Sbjct: 241 DELLDSQVEFNRRHSAALTFVLKVLARLKDTWKCLHGGMWHPEDPKELSRVIYVCITLHN 300
Query: 347 IIID 350
I+ID
Sbjct: 301 IVID 304
>gi|357150977|ref|XP_003575641.1| PREDICTED: uncharacterized protein LOC100827806 [Brachypodium
distachyon]
Length = 705
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 189/340 (55%), Gaps = 20/340 (5%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVR----EDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E + F++ +++ TFDY+CSLVR ED+++R +GR+L ++ +VA+ALR L
Sbjct: 3 ETQRFEHLYKMKITTFDYVCSLVRVPFLEDMMARDHRF---ADGRVLCLQDRVAVALRML 59
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
SGE +VG + G+ +S V +T F++A+ E+A HHL WP SNR+E IK KF++ GL
Sbjct: 60 NSGEPPETVGSSLGMNKSIVLLITKSFVDAMWEKAMHHLDWPGSNRIENIKYKFDKIHGL 119
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NCCG + HI Q S D E N ML+Q +VD +MRF +I G MN S
Sbjct: 120 PNCCGVVHTDHITF---ESQNS----DHEVNAGMLMQAVVDTDMRFTNIWLGSSSNMNQS 172
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+L S FK CE LNG+ S+ ++ EYI+G G+PLL WL+T ++ N LS
Sbjct: 173 SILHDSVLFKHCEKDTWLNGSKLNLSDGRQVGEYIIGDAGFPLLPWLLTSFQENDLSDYQ 232
Query: 295 PTFNSLHEATRSLAV-KAFLQLKGGWRILSKVMWRPDKRKLPS-IILVCCLLHNIIIDSG 352
FN H ++ + KA +LK W+ L +WRP+ + P +I CC+LHNI+ID
Sbjct: 233 VEFNRRHSQAMTITLTKALAKLKDTWKFLHGGVWRPENQFEPMWVIYACCMLHNIVIDK- 291
Query: 353 DQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ + Y +Q + VD R+ L +HL
Sbjct: 292 ---ECGTGMGSYQKVNYSQQVHQLVDEDPVMVRDVLSQHL 328
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/309 (40%), Positives = 185/309 (59%), Gaps = 21/309 (6%)
Query: 51 PGATIPSDEE-EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAI 109
PG T+ E+ + + ++ ++TF+Y+C L++E + + + RL S+E++VAI
Sbjct: 335 PGGTLSIFEDAQTLESVLKMPRRTFNYVCGLLKESSL-EIMNDYFFFDMRLFSLEERVAI 393
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE 169
AL L SG+ +VG GV +STV VT F++A+ ERA+HHL+WP S+ ME++KS F+
Sbjct: 394 ALIMLNSGDPPATVGSFIGVNESTVPLVTKSFVDAMLERAQHHLRWPQSDEMEKMKSMFD 453
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI-VTGWP 228
E GL NCCG + TH+ S W D + S ++QG++ +MRF I V P
Sbjct: 454 EIHGLPNCCGVLHTTHV------TSASRSW-DHFDKDSFVLQGVIAPDMRFTSIWVAPRP 506
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWLITPYE- 286
+ S L S F+ CE G LNG+ ++++SEEV EY++G VGYPLL WL+TPY
Sbjct: 507 ANTSQSSFLHDSNLFEYCEKGAWLNGSKLKVASEEV--GEYVIGDVGYPLLPWLLTPYYQ 564
Query: 287 -TNGLSASMP---TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK-RKLPSIILVC 341
N LS +P FNS H A +++A+K +L+G W+ + WRPD R++ I C
Sbjct: 565 LQNDLS-DIPYQVEFNSRHSAVKNIALKVLARLEGTWKSMHG-EWRPDTPREMSRAIHAC 622
Query: 342 CLLHNIIID 350
C+LHNI+ID
Sbjct: 623 CILHNIVID 631
>gi|357150994|ref|XP_003575647.1| PREDICTED: putative disease resistance protein RGA4-like
[Brachypodium distachyon]
Length = 1742
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 184/319 (57%), Gaps = 27/319 (8%)
Query: 43 FWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS 102
F ++S + G ++ + F+ +++++TF YICSLV+ + + + I+GR+L
Sbjct: 1363 FIGRSSQSQGTLYLFEDSQMFESVLKMTRRTFSYICSLVKVPSL-KDMNNYTFIDGRVLC 1421
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRME 162
+E ++A+AL L +G++ VG + GV +STVS VT RF++A+ ERA+HH WP S ME
Sbjct: 1422 LEDRIAVALIMLNAGQTLEDVGSSVGVNKSTVSLVTERFVDAMRERARHHRTWPGSGEME 1481
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
++KSKF++ FGL NCCG + THI + CD E N L+Q +VD +MRF +
Sbjct: 1482 KVKSKFDKMFGLPNCCGVVHTTHIPF-------GSEHCDHERNEYELMQTVVDPDMRFRN 1534
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWL 281
I G P M+ LL S F +CE G LNG+ +++SSE E+ EY++G GYPLL WL
Sbjct: 1535 IWFGLPDSMSQLSLLHDSELFMVCEKGAWLNGSKLKVSSEGSEVGEYVIGDAGYPLLPWL 1594
Query: 282 ITPYE---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-------VMWRPDK 331
+TPY+ T+ S FN H S A+KA +L W+ L + MW+
Sbjct: 1595 LTPYQQKDTDDQLDSQVEFNRRHSVAVSFALKALARLTDTWKCLHRGSLKIPCEMWKA-- 1652
Query: 332 RKLPSIILVCCLLHNIIID 350
I CC+LHNI+ID
Sbjct: 1653 ------IQACCMLHNIVID 1665
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 183/305 (60%), Gaps = 26/305 (8%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVR----EDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
++ + F++ +++++TF YICSLV+ ED+ S + +GR+L +E +VAIAL
Sbjct: 1036 EDAQRFEFVLKMTRRTFSYICSLVKVQSLEDMNSYTFT-----DGRVLCLEDRVAIALIM 1090
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
L SGE +V ++ GV +ST+S VT RFI+A E+A HHL WP S+ +E++KS F++ G
Sbjct: 1091 LNSGEPLEAVALSVGVNESTISLVTERFIDATWEQADHHLNWPGSSEIEKVKSMFDKIHG 1150
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L NCCG I THII CD E N ++VQ +VD +MRF +I G MN
Sbjct: 1151 LPNCCGIICTTHIIF-------GSQNCDNETNDDIIVQVVVDPDMRFNNIWLGGSDAMNQ 1203
Query: 234 SRLLKFSGFFKLCEAGQRLNGN-VRIS---SEEVELREYIVGGVGYPLLSWLITPY---E 286
LL S FFK C+AG +NG+ + +S SEEVE EY++G GYPL WL+TPY +
Sbjct: 1204 MSLLHDSEFFKECDAGALVNGSKLELSSNGSEEVE--EYVIGAEGYPLRPWLLTPYKQQD 1261
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP-DKRKLPSIILVCCLLH 345
+ L S FN H A + +K +LK W+ L MW P D ++L +I VC +LH
Sbjct: 1262 MDELLDSQVEFNRRHSAALTFVLKVLARLKDTWKCLHGGMWHPEDPKELCRVIYVCIMLH 1321
Query: 346 NIIID 350
NI+ID
Sbjct: 1322 NIVID 1326
>gi|357150988|ref|XP_003575645.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 407
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 191/343 (55%), Gaps = 23/343 (6%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
++ + K ++ ++TFDY+CSL++E + + + R S+E++VAIAL L SG
Sbjct: 45 EDAQTLKSVLKMPRRTFDYVCSLLKESSLEIMNNYFL-FYVRFFSLEERVAIALIMLNSG 103
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+S VG GV +STVS VT F A+ ERAK HL+WP S+ +E+IKSKF+E GL NC
Sbjct: 104 DSPAIVGSFIGVDESTVSMVTKSFAGAMLERAKRHLRWPQSDEIEKIKSKFDEIHGLPNC 163
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP-GGMNVSRL 236
CG + THI S W D ++ S++ QG+V +MRF I+ G G N
Sbjct: 164 CGVLHTTHI------TSVSRSW-DHQDKDSIVFQGVVGPDMRFTSILVGLRVGNTNQLSF 216
Query: 237 LKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWLITPY-----ETNGL 290
L S FK CE G LNGN +++SSEEV EY++G VGYPL WL+TPY + +
Sbjct: 217 LHDSMLFKECEKGAWLNGNKLKVSSEEVG--EYVIGDVGYPLRPWLLTPYDELQNDLPNI 274
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK-RKLPSIILVCCLLHNIII 349
S+ FN H A R++A+KA + + W+I+ WRPD ++ I VCC LHNI+I
Sbjct: 275 SSYQAEFNRRHSAARNIALKAMARFESTWKIMHS-EWRPDNPSEMTRAISVCCRLHNIVI 333
Query: 350 DSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
D + + + Y +Q + D R+ L +HL
Sbjct: 334 D----MEEGAGMLSDREVNYSDQVRQLADEDAVRMRDILSQHL 372
>gi|357150951|ref|XP_003575633.1| PREDICTED: uncharacterized protein LOC100824418 [Brachypodium
distachyon]
Length = 2089
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 194/349 (55%), Gaps = 23/349 (6%)
Query: 52 GATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIAL 111
G ++ + + ++ ++TFDYICSL++E + + + R S+E++VAIAL
Sbjct: 1721 GTLTIFEDAQRLESVLKMPRRTFDYICSLLKESSLE-IMNNYFFFDMRFFSLEERVAIAL 1779
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES 171
L SG+ +VG GV +STVS VT F++A+ ERAK+H+ WP S+ E++KSKF +
Sbjct: 1780 IMLNSGDPPATVGSFLGVNESTVSLVTKSFVDAMYERAKYHVYWPQSDEREKMKSKFAKI 1839
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW-PGG 230
GL NCCG + TH+ PA S W D ++ S+L QG+V +MRF I+ G G
Sbjct: 1840 HGLPNCCGVLHTTHVS---PA---SRSW-DHQDKDSILFQGVVGPDMRFTSILVGLQAGN 1892
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
N L S FK CE G LNGN +++SSEEV EY++G GYPLL WL+TPYE
Sbjct: 1893 TNQLSFLHDSMLFKECEKGAWLNGNKLKVSSEEVG--EYVIGDAGYPLLPWLLTPYELQN 1950
Query: 290 ----LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK-RKLPSIILVCCLL 344
+S+ FN H + R++ +KA + + W+I+ WRPD R++ I VCC L
Sbjct: 1951 DLPDISSYQAEFNRRHSSARNITLKAMARFESTWKIMHS-EWRPDNPREMTRAINVCCRL 2009
Query: 345 HNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV-DPMGRTTRENLEKHL 392
HNI+I + + + Y + +Q+ D R+ L +HL
Sbjct: 2010 HNIVIG----MEEGARMPSEQEVNYSDDQVRQLADEDAVRMRDILSQHL 2054
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 157/293 (53%), Gaps = 11/293 (3%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
++ + + ++ + TFDY+CSL++E + S + R S+E++VAIAL L SG
Sbjct: 1395 EDAQRLESVLKMPRTTFDYVCSLLKESSLEIMNSYFF-FDMRSFSLEERVAIALIMLNSG 1453
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+ +VG GV ++TVS VT F + ERA HL WP S +++IK KF++ GL NC
Sbjct: 1454 DPPATVGCYIGVNEATVSLVTRTFAGVMLERALEHLWWPGSTEIDKIKYKFDKIHGLPNC 1513
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
CG + HI + D E N ML+Q +VD +MRFI+ P M S +L
Sbjct: 1514 CGVVHTVHIT-------SGSHNKDHEVNDDMLMQAVVDTDMRFINTWLTSPDSMKQSSIL 1566
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET-NGLSASMPT 296
S F CE G LNG+ + +++ EYI+G GYPLL WL+TPYE N LS S
Sbjct: 1567 HDSELFTECEKGTGLNGSKLNLPDGLQVGEYIIGDAGYPLLPWLLTPYELENELSDSQVE 1626
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
FN H ++ +A +LK W+ L WRP I C +LHNI+I
Sbjct: 1627 FNRRHLEATTIVPRAVTRLKDTWKFLHGGAWRPQNHHWA--IYACFVLHNIVI 1677
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 187/352 (53%), Gaps = 23/352 (6%)
Query: 4 PKKSKSKKTKKV-TKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEE-E 61
P+ +K + V ++L S + P S D++ PSF ++ T+P E+ +
Sbjct: 1011 PEFAKFRHVDHVKARRLYTSDFYVLYTRGPCSFDTNVNPSFLSR------GTLPIFEDAQ 1064
Query: 62 GFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
+ ++ ++ FDY+C+L++E+ + + +GR L +E VA+AL L SGE+
Sbjct: 1065 RLESALKMPRRAFDYVCNLLKEN-SWQHANKYTFPDGRYLCLEDGVAVALIVLNSGETPA 1123
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+VG + + +STVSQVT F+ A++ WP + ME+IK KF++ GL NCCG +
Sbjct: 1124 TVGSSVALNESTVSQVTESFVVAMDTPG-----WPGTTEMEKIKYKFDKIRGLPNCCGVV 1178
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
A I Q SD ++ N +L+Q +VD +MRFID+ G P M S + S
Sbjct: 1179 HAARIPF---GSQNSDR---EKKNEDLLMQVVVDSDMRFIDVHLGSPDNMKESSISHDSN 1232
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-TNGLSASMPTFNSL 300
FK CE G RLNG+ S+ ++ EY++G GYPLL WL+TPY N LS FN
Sbjct: 1233 LFKRCEKGTRLNGSKLNLSDGRQVGEYVIGDAGYPLLPWLLTPYHPENDLSDYQVEFNRR 1292
Query: 301 H-EATRSLAVKAFLQLKGGWRILSKVMWRP-DKRKLPSIILVCCLLHNIIID 350
H EA + A +LK W+ L W P D+ L + I CC LH I+I+
Sbjct: 1293 HSEAMAVVPRSALERLKDTWKFLHGRGWHPEDQYVLRNAIQTCCKLHTIVIE 1344
>gi|357150942|ref|XP_003575630.1| PREDICTED: uncharacterized protein LOC100823191 [Brachypodium
distachyon]
Length = 1128
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 195/368 (52%), Gaps = 30/368 (8%)
Query: 30 KDPNSSDSDWWPSFWAKNSSTPGATIPSDEE-EGFKYFFRVSKKTFDYICSLVREDLVSR 88
+ P S D++ PSF ++ T+P E+ + + ++ ++ FDY+C+L++E +
Sbjct: 743 RGPCSFDTNVNPSFLSR------GTLPIFEDAQRLESALKMPRRAFDYVCNLLKEK-SWQ 795
Query: 89 PPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER 148
+ +GR L +E VA+AL L SGE+ +VG + + +S VSQVT F+ A++
Sbjct: 796 DANKYTFPDGRYLCLEDGVAVALIVLNSGETPATVGSSVALNESIVSQVTKSFVVAMDTP 855
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
WP + ME+IK KF++ GL NCCG + A I Q SD ++ N M
Sbjct: 856 G-----WPGTTEMEKIKYKFDKIRGLPNCCGVVHAARIPF---GSQNSDR---EKKNEDM 904
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY 268
L+Q +D +MRFID+ G P M S + S F+ CE G RLNG+ S+ ++ EY
Sbjct: 905 LMQVTIDSDMRFIDVHLGSPDNMKKSSISHDSSLFEECEKGTRLNGSKLNLSDGRQVGEY 964
Query: 269 IVGGVGYPLLSWLITPYE-TNGLSASMPTFNSLHEATRSLAVKAFLQ-LKGGWRILSKVM 326
++G GYPLL WL+TPY N LS FN H ++ ++ L+ LK W+ L
Sbjct: 965 VIGDAGYPLLPWLLTPYHPENDLSDYQVEFNRRHSEAMAVVTRSALERLKDTWKFLHGGR 1024
Query: 327 WRPDKR-KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV-DPMGRTT 384
WRP+ R +L I CC LHNI++ ++ VA + + G G + +Q+ D
Sbjct: 1025 WRPEHRFELHQAIQACCKLHNIVL----EMECKVATTSY---GAGSKEVRQLADEDAAMA 1077
Query: 385 RENLEKHL 392
R+ L +HL
Sbjct: 1078 RDILSQHL 1085
>gi|414589198|tpg|DAA39769.1| TPA: hypothetical protein ZEAMMB73_938202 [Zea mays]
Length = 207
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 130/191 (68%), Gaps = 4/191 (2%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINI---EGRLLSVEKQVAIALRRLASGESQVSV 123
+++++TFDYICSLV++DL ++ G N + ++L VE QVA+AL +L +GES ++
Sbjct: 5 LKMTRRTFDYICSLVKKDLTTKT-YGFRNFRFGDKKVLGVEDQVAVALMKLTTGESLQNI 63
Query: 124 GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDA 183
G+ FG+ S +S +TW FIE++EE A HLKW M IK+ F++ +GL NCCGAID
Sbjct: 64 GMWFGMNHSAISNITWWFIESVEECAICHLKWSSPEEMATIKTSFDKVYGLPNCCGAIDT 123
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI+M A S W D EN +M++Q +VD ++RF D+V+GWPG M+ + +L+ SG +
Sbjct: 124 THILMCSSAQPNSKVWLDNENKNNMVLQAVVDPDLRFRDVVSGWPGSMDDACILRTSGLY 183
Query: 244 KLCEAGQRLNG 254
+LCE G RL G
Sbjct: 184 RLCEKGLRLGG 194
>gi|357129756|ref|XP_003566527.1| PREDICTED: putative disease resistance protein RGA4-like
[Brachypodium distachyon]
Length = 1346
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 181/330 (54%), Gaps = 24/330 (7%)
Query: 30 KDPNSSDSDWWPSFWAKNSSTPGATIPS-DEEEGFKYFFRVSKKTFDYICSLVREDLVSR 88
++P S D++ SF G T+ + + + ++ +K FDYICSLV++ R
Sbjct: 985 REPYSLDTNINCSFLL------GGTLSIFKDAQRLESMVKMPRKAFDYICSLVKQK-TFR 1037
Query: 89 PPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER 148
++G+LL +E +VA+AL L SG++ ++G + GV ++T S VT F+ A+
Sbjct: 1038 DMYSHTFLDGKLLCLEDRVAVALIVLNSGDTLATIGSSVGVNEATASLVTNSFVHAMPRP 1097
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
+WPD+ ME++K ++ +GL NCCG I + I Q S++ E N S+
Sbjct: 1098 -----RWPDTGEMEKVKFNSDKIYGLPNCCGVIHTSCIPF---GSQNSNN----EKNGSL 1145
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSE-EVELR 266
L+Q ++D MRFID+ G + +L S CE G LNG+ +++SS+ E+
Sbjct: 1146 LMQVVIDSNMRFIDVKLGLFDDKDKLSVLHGSYLSTNCEKGIWLNGSKLKVSSDGSGEVG 1205
Query: 267 EYIVGGVGYPLLSWLITPYET-NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
EYI+G GYPLL WL+TPY+ N LS + FN H ++A+K + K W+ L
Sbjct: 1206 EYIIGDAGYPLLPWLLTPYQLENDLSETKLEFNKRHSEAMAVALKTSARFKDIWKGLHGG 1265
Query: 326 MWRPDKR-KLPSIILVCCLLHNIIIDSGDQ 354
WRP+ R +L I VCC+LHNI+I D+
Sbjct: 1266 TWRPENRDELWRAIRVCCMLHNIVIGMDDE 1295
>gi|414590858|tpg|DAA41429.1| TPA: hypothetical protein ZEAMMB73_600948 [Zea mays]
Length = 207
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 84/181 (46%), Positives = 114/181 (62%), Gaps = 1/181 (0%)
Query: 202 QENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE 261
QE NYSM++Q +VD + +F IVTGWPG M S +L SG FKLCE G+RL+G+ S+
Sbjct: 17 QEKNYSMVLQAVVDLDTKFTVIVTGWPGSMKESSILHNSGLFKLCEKGERLSGSKLKVSD 76
Query: 262 EVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI 321
E+ +Y++G GYPLL WL+TPY+ L+ S FN H + R++A++ + K W+
Sbjct: 77 GSEIEKYLIGDSGYPLLPWLLTPYQEKDLTESSAKFNIRHSSVRTVALRTLAKFKDTWKF 136
Query: 322 LSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV-ALSDHHDSGYGEQCCKQVDPM 380
L MWRPDK KLP II VC LLHNIIID ++ + A HD+ Y +Q C+ D
Sbjct: 137 LQGEMWRPDKHKLPRIIHVCYLLHNIIIDLQERAVDEAQAWPSDHDAKYRQQVCQLADEN 196
Query: 381 G 381
G
Sbjct: 197 G 197
>gi|357150991|ref|XP_003575646.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 340
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 107/287 (37%), Positives = 153/287 (53%), Gaps = 15/287 (5%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
++ ++ D++CSL +E + +GRLL +E +VAIAL L SGE+ ++ +
Sbjct: 20 LKMPRRASDHVCSLAKEQ-SWEDMNKYTFPDGRLLCLEDRVAIALIVLNSGETPATIRSS 78
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
GV +STVSQVT FI+A+ + +WP + ME+IK KF++ GL N CG + A I
Sbjct: 79 VGVSESTVSQVTESFIQAMVSQ-----RWPSTTEMEKIKYKFDKXHGLPNYCGVVHAAQI 133
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ QE N ML+Q +D +MRF + G M S + S FK C
Sbjct: 134 PFGSQNINL------QEKNEDMLMQVTIDSDMRFTSVYLGSSDNMKRSSISHDSDLFKKC 187
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-TNGLSASMPTFNSLH-EAT 304
E G LNG+ S+ ++ EY++G GYPLL WL+TPY NGLS FN H EA
Sbjct: 188 EKGIWLNGSKLNLSDGRQVGEYVIGDSGYPLLPWLLTPYNPENGLSDCQVEFNRRHSEAM 247
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKR-KLPSIILVCCLLHNIIID 350
+ A +LK W+ L +W + R L I CC+LHNI+I+
Sbjct: 248 AVMPRNALERLKDTWKFLHGGLWFAENRYMLQGAIQTCCMLHNIVIE 294
>gi|356563940|ref|XP_003550215.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 536
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 171/348 (49%), Gaps = 38/348 (10%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
EE FK FR+ ++TFD IC ++V+ED R + V ++VA+ L RL
Sbjct: 207 EEDFKKAFRMGRETFDMICEELNSAIVKEDTTLR----------NAIPVRQRVAVCLWRL 256
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFG 173
A+G+ V FG+G ST ++ A++ +L WPD + +KS+FE G
Sbjct: 257 ATGDPLRIVSKRFGLGISTCHKLVLEVCTAIKSVLMPKYLNWPDEVALRRVKSEFEGVSG 316
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ N G++ +H+ + P + + + +Q+ +YS+ VQG+VDH F D+ GW
Sbjct: 317 IPNVVGSMYTSHVPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDHRGVFTDVCIGW 376
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M ++L+ S F QR NG + ++IVG GYPL+ W++ PY
Sbjct: 377 PGSMPDDQVLEKSALF------QRANGGLLKG-------DWIVGSSGYPLMDWVLVPYSQ 423
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN + +A AF +LKG W L K + LP ++ CC+LHNI
Sbjct: 424 QNLTWTQHAFNEKIGEVQKVARDAFARLKGRWSCLQKRT-EVKLQDLPVVLGACCVLHNI 482
Query: 348 IIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
G+++ P++ + D E + + M TR+ + +L H+
Sbjct: 483 CELKGEKIDPELKVDLMDDEMVPEVALRSMSSM--KTRDAIAHNLLHH 528
>gi|350536071|ref|NP_001234228.1| salt responsive protein 2 [Solanum lycopersicum]
gi|195549553|gb|ACG50004.1| salt responsive protein 2 [Solanum lycopersicum]
Length = 499
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 174/347 (50%), Gaps = 29/347 (8%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P EE FK FRVS+ TFD IC + E +V++ + L + + V ++VA+ + RLA
Sbjct: 165 PDFPEEEFKKAFRVSRATFDMICEEL-ESVVTKKDTML----RQAIPVRQRVAVCIWRLA 219
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSN-RMEEIKSKFEESFG 173
+GE V FG+G ST ++ A++ ++WP+ + +M EIKS+F+ G
Sbjct: 220 TGEPLREVSKRFGLGISTCHKLVLEVCTAIKGVLMPKFVQWPNEDYKMNEIKSEFQMLSG 279
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ N G+I TH+ + P V + + +Q+ +YS+ VQG+VD + F D+ GW
Sbjct: 280 MPNVGGSIYTTHVPIIAPKVSVAAYFNKRHTERNQKTSYSVTVQGVVDPKGVFTDVCIGW 339
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M+ ++L+ S ++ GQ + ++VG GYPL+ W++ PY
Sbjct: 340 PGSMSDDKVLEKSALYQRANRGQLKDT-------------WVVGNSGYPLMDWVLAPYTR 386
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN + +A +AF+++K W L K + LP ++ CC+LHNI
Sbjct: 387 QNLTWTQHAFNEKVGEVQKVAKEAFMRMKARWSCLRKRT-EVKLQDLPVVLGACCVLHNI 445
Query: 348 IIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQH 394
G+QL+P + D E + ++ + R+ + L H
Sbjct: 446 CEMRGEQLNPGLRFDLFDDEMVPENIVRSMNAV--QARDQIAHQLLH 490
>gi|356552466|ref|XP_003544588.1| PREDICTED: uncharacterized protein LOC100804219 [Glycine max]
Length = 534
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 174/368 (47%), Gaps = 41/368 (11%)
Query: 43 FWAKNSSTP---GATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLI 94
W K+ S G EE F+ FR+ ++TFD IC ++V+ED R
Sbjct: 185 LWVKDRSGAWWDGCNKEDFPEEEFRKAFRMGRETFDMICDELNSAIVKEDTTLR------ 238
Query: 95 NIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHL 153
+ V ++VA+ L RLA+G+ V FG+G ST ++ A++ +L
Sbjct: 239 ----NAIPVRQRVAVCLWRLATGDPLRIVSKRFGLGISTCHKLVLEVCTAIKSVLMPKYL 294
Query: 154 KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYS 207
WPD + +KS+FE G+ N G++ +H+ + P + + + +Q+ +YS
Sbjct: 295 NWPDEGSLRRVKSEFEGVSGIPNVVGSMYTSHVPIIAPKISVAAYFNKRHTERNQKTSYS 354
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
+ VQG+VDH F D+ GWPG M ++L+ S F QR NG +
Sbjct: 355 ITVQGVVDHRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLKGV------- 401
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+IVG GYPL+ W++ PY L+ + FN + +A AF +LKG W L K
Sbjct: 402 WIVGSSGYPLMDWVLVPYSQQNLTWTQHAFNEKIGEVQKVARDAFARLKGRWSCLQKRT- 460
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
+ LP ++ CC+LHNI G+++ P++ + D E + + M R+
Sbjct: 461 EVKLQDLPVVLGACCVLHNICELKGEKIDPELKVDLVDDEMVPEVALRSMSSM--KARDA 518
Query: 388 LEKHLQHN 395
+ +L H+
Sbjct: 519 IAHNLLHH 526
>gi|224113883|ref|XP_002332479.1| predicted protein [Populus trichocarpa]
gi|222832470|gb|EEE70947.1| predicted protein [Populus trichocarpa]
Length = 465
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 179/368 (48%), Gaps = 40/368 (10%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
D +WW S P EE FK FR+SK TFD IC + ++++ + L N
Sbjct: 122 DKEWWDD--CNRSDYP--------EEEFKKAFRMSKSTFDSICEEL-NSVIAKEDTTLRN 170
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLK 154
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ +L+
Sbjct: 171 A----IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPKYLQ 226
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSM 208
WPD + + + K++FE G+ N G++ THI + P + + + +Q+ +YS+
Sbjct: 227 WPDEDNLRKTKNEFESISGIPNVVGSMYTTHISIIAPKISVAAYFNKRHTERNQKTSYSI 286
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE- 267
VQG+VD + F D+ GWPG M ++L+ S F QR NG + L++
Sbjct: 287 TVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGL--------LKDV 332
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
++VG GYPL+ W++ PY L+ + FN +S+A AF +LKG W L K
Sbjct: 333 WLVGTSGYPLMDWVLVPYTQQNLTWTQHAFNEKIGEIQSVAKDAFARLKGRWSCLQKRT- 391
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
+ LP ++ CC+LHNI +++ P++ + D E + M R+
Sbjct: 392 EVKLQDLPVVLGACCVLHNICEMQNEEMDPELRMELVDDEMVPEVALRSASSM--KARDA 449
Query: 388 LEKHLQHN 395
+ +L H+
Sbjct: 450 IAHNLLHH 457
>gi|357437265|ref|XP_003588908.1| hypothetical protein MTR_1g015070 [Medicago truncatula]
gi|355477956|gb|AES59159.1| hypothetical protein MTR_1g015070 [Medicago truncatula]
Length = 1056
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 171/353 (48%), Gaps = 40/353 (11%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAI 109
P DE FK FR+ K TFD+IC ++V+ED R + V ++VA+
Sbjct: 724 FPEDE---FKKAFRMGKSTFDFICEKLNSAIVKEDTTLRTA----------IPVRQRVAV 770
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKF 168
L RLA+G+ V FG+G ST ++ A++ +L+WP+ + +IK++F
Sbjct: 771 CLWRLATGDPLSIVSKRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNETSLRKIKNEF 830
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFID 222
E G+ N G++ +H+ + P + +D + +Q+ +YS+ VQG+VD F D
Sbjct: 831 EGMSGIPNVVGSMYTSHVPIIAPKISVADYFNRRHTERNQKTSYSITVQGVVDTNGVFTD 890
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
+ GWPG M ++L+ S F+ G L G +IVG GYPL+ W++
Sbjct: 891 VCIGWPGSMPDDQVLEKSALFQRANNGGLLRG------------VWIVGSSGYPLMDWVL 938
Query: 283 TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
PY L+ + FN + +A AF +LKG W L K + LP + CC
Sbjct: 939 VPYTQQNLTWTQHGFNEKIGEIQKVAKDAFARLKGRWSCLQKRT-EVKLQDLPVELCACC 997
Query: 343 LLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
+LHNI G+++ P++ + D E + V+ + R+ + +L H+
Sbjct: 998 VLHNICEMKGEKMDPELMVDIVDDEMVPEVSLRSVNSL--KARDAIAHNLLHH 1048
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 172/359 (47%), Gaps = 40/359 (11%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAI 109
P DE FK FR+ K TFD IC ++V+ED R + V ++VA+
Sbjct: 178 FPEDE---FKKAFRMGKSTFDLICEELNSAIVKEDTTLRTA----------IPVRQRVAV 224
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKF 168
L RLA+G+ V FG+G ST ++ A++ +L+WP+ + +IK++F
Sbjct: 225 CLWRLATGDPLRIVSKRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNETSLRKIKNEF 284
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFID 222
E G+ N G++ +H+ + P + + + +Q+ +YS+ VQG+VD F D
Sbjct: 285 EGMSGIPNVVGSMYTSHVPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVDTNGVFTD 344
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
+ GWPG M ++L+ S F+ G L G +IVG GYPL+ W++
Sbjct: 345 VCIGWPGSMPDDQVLEKSALFQRANNGGLLKG------------VWIVGSSGYPLMDWVL 392
Query: 283 TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
PY L+ + FN + +A AF +LKG W L K + LP ++ CC
Sbjct: 393 VPYTQQNLTWTQHGFNEKIGEIQKVAKDAFARLKGRWSCLQKRT-EVKLQDLPVVLGACC 451
Query: 343 LLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLV 401
+LHNI G+++ P++ + D E + V+ + R+ + +L H+ D +
Sbjct: 452 VLHNICEMKGEKMDPELMVDIVDDEMVPEVSLRSVNSL--KARDAIAHNLLHHGLADCL 508
>gi|224092160|ref|XP_002309488.1| predicted protein [Populus trichocarpa]
gi|222855464|gb|EEE93011.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 178/368 (48%), Gaps = 40/368 (10%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
D +WW P EE FK FR+SK TFD IC + ++++ + L N
Sbjct: 164 DKEWWDD----------CNRPDYPEEEFKKAFRMSKSTFDMICEEL-NSVIAKEDTTLRN 212
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLK 154
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ +L+
Sbjct: 213 A----IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRSVLMPKYLQ 268
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSM 208
WPD + + +IK++FE G+ N G++ THI + P + + + +Q+ +YS+
Sbjct: 269 WPDEDGLRKIKNEFESISGIPNAVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSI 328
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE- 267
VQG+VD + F D+ GWPG M ++L+ S F QR NG + L++
Sbjct: 329 TVQGVVDPKGIFTDVCIGWPGSMPDDQVLEKSALF------QRANGGL--------LKDV 374
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+IVG GYPL+ W++ PY L+ + FN + +A AF +LKG W L K
Sbjct: 375 WIVGTSGYPLMDWVLVPYAQQNLTWTQHAFNEKIGEIQLVAKDAFARLKGRWSCLQKRT- 433
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
+ LP ++ CC+LHNI +++ P++ + D E + M R+
Sbjct: 434 EVKLQDLPVVLGACCVLHNICEMQNEEIDPELRVELVDDEMVPEVALRSATSM--KARDA 491
Query: 388 LEKHLQHN 395
+ +L H+
Sbjct: 492 IAHNLLHH 499
>gi|255550093|ref|XP_002516097.1| conserved hypothetical protein [Ricinus communis]
gi|223544583|gb|EEF46099.1| conserved hypothetical protein [Ricinus communis]
Length = 498
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 172/347 (49%), Gaps = 51/347 (14%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
D +WW P EE FK FR+SK TFD IC + + + + L N
Sbjct: 155 DKEWWDE----------CNRPDYPEEEFKKAFRMSKATFDLICEELH-SCIQKEDTTLRN 203
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLK 154
+ V ++VA+ + RLA+GE V FG+G ST ++ A++ +L+
Sbjct: 204 A----IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIKNVLMPKYLQ 259
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSM 208
WPD + ++++K++FE G+ N G++ THI + P + + + +Q+ +YS+
Sbjct: 260 WPDEDSLKKVKNEFESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSI 319
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE- 267
VQG+VD + F D+ GWPG M ++L+ S + QR NG + L++
Sbjct: 320 TVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKSALY------QRANGGL--------LKDV 365
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+IVG GYPL+ W++ PY L+ + FN +++A +AF +LKG W L K
Sbjct: 366 WIVGSSGYPLMDWVLVPYTQQHLTWTQHAFNEKIGEVQTVAKEAFTRLKGRWSCLQKRT- 424
Query: 328 RPDKRKLPSIILVCCLLHNII------IDS-------GDQLHPDVAL 361
+ LP ++ CC+LHNI ID D++ P+VAL
Sbjct: 425 EVKLQDLPVVLGACCVLHNICELRKEEIDPKLRVELVDDEMVPEVAL 471
>gi|15239835|ref|NP_196762.1| uncharacterized protein [Arabidopsis thaliana]
gi|7573368|emb|CAB87674.1| putative protein [Arabidopsis thaliana]
gi|16604352|gb|AAL24182.1| AT5g12010/F14F18_180 [Arabidopsis thaliana]
gi|25141199|gb|AAN73294.1| At5g12010/F14F18_180 [Arabidopsis thaliana]
gi|332004367|gb|AED91750.1| uncharacterized protein [Arabidopsis thaliana]
Length = 502
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 171/343 (49%), Gaps = 27/343 (7%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
EE FK FR+SK TF+ IC + V++ + L N + V ++VA+ + RLA+GE
Sbjct: 172 EEDFKKAFRMSKSTFELICDEL-NSAVAKEDTALRNA----IPVRQRVAVCIWRLATGEP 226
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
V FG+G ST ++ +A+++ +L+WPD + I+ +FE G+ N
Sbjct: 227 LRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVV 286
Query: 179 GAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
G++ THI + P + + + +Q+ +YS+ +Q +V+ + F D+ GWPG M
Sbjct: 287 GSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMP 346
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
++L+ S ++ G L G ++ GG G+PLL W++ PY L+
Sbjct: 347 DDKVLEKSLLYQRANNGGLLKGM------------WVAGGPGHPLLDWVLVPYTQQNLTW 394
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ FN + +A +AF +LKG W L K + LP+++ CC+LHNI
Sbjct: 395 TQHAFNEKMSEVQGVAKEAFGRLKGRWACLQKRT-EVKLQDLPTVLGACCVLHNICEMRE 453
Query: 353 DQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
+++ P++ + D E + V+ M R+ + +L H+
Sbjct: 454 EKMEPELMVEVIDDEVLPENVLRSVNAM--KARDTISHNLLHH 494
>gi|343887274|dbj|BAK61820.1| hypothetical protein [Citrus unshiu]
Length = 515
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 174/347 (50%), Gaps = 28/347 (8%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P EE FK +FR+ ++TFD IC + ++++ + L N + V ++VA+ + RLA
Sbjct: 182 PDYPEEEFKKWFRMRRQTFDMICEEL-NSVIAKEDTTLRNA----IPVRQRVAVCIWRLA 236
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGL 174
+GE V FG+G ST ++ A+ +L+WPD + + +IK +FE G+
Sbjct: 237 TGEPLRLVSKRFGLGISTCHKLVLEVCTAIRSVLMPKYLQWPDDDALRKIKDEFEVISGI 296
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWP 228
N G++ TH+ + P + + + +Q+ +YS+ VQG+V+ F D+ GWP
Sbjct: 297 PNVVGSMYTTHVPIIAPKISVAAYFNKRHTERNQKTSYSITVQGVVNPNGVFTDVCIGWP 356
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G M ++L+ S + QR +G + +IVGG GYPL+ W++ PY
Sbjct: 357 GSMPDDQVLEKSALY------QRASGGLLKGV-------WIVGGSGYPLMDWVLVPYTQQ 403
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
L+ + FN + ++ AF +LKG W L K + LP ++ CC+LHNI
Sbjct: 404 HLTWTQHAFNEKIGEIQRVSKDAFARLKGRWCCLQKRT-EVKLQDLPVVLGACCVLHNIC 462
Query: 349 IDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
+++ P++A D E + V M TR+++ +L H+
Sbjct: 463 EMMNEEMDPELAFDLVDDEMVPEVALRSVSSM--KTRDSIAHNLLHH 507
>gi|397787611|gb|AFO66517.1| putative kinase [Brassica napus]
Length = 1266
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 170/343 (49%), Gaps = 27/343 (7%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
E+ FK FR+SK TF+ IC + V++ + L N + V ++VA+ + RLA+GE
Sbjct: 175 EDDFKKAFRMSKSTFELICEELNA-AVAKEDTALRNA----IPVRQRVAVCVWRLATGEP 229
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
V FG+G ST ++ +A++E +L+WPD + I+ +E G+ N
Sbjct: 230 LRLVSKKFGLGISTCHKLVLEVCKAIKEVLMPKYLQWPDDESLRNIRETYESISGIPNVV 289
Query: 179 GAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
G++ THI + P + + + +Q+ +YS+ +Q +V+ F D+ GWPG M
Sbjct: 290 GSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPNGVFTDLCIGWPGSMP 349
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
++L+ S ++ G L G ++ GG G+PLL W++ PY L+
Sbjct: 350 DDKVLEKSLLYQRANNGGLLKG------------LWVAGGAGHPLLDWVLVPYTQQNLTW 397
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ FN + +A +AF +LKG W L K + LP+++ CC+LHNI G
Sbjct: 398 TQHAFNEKMSEVQRVAKEAFGRLKGRWACLQKRT-EVKLQDLPTVLGACCVLHNICEIRG 456
Query: 353 DQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
+++ P++ + D E + V+ M R+ + +L H+
Sbjct: 457 ERMEPELMVEVVDDEVLPENGLRSVNAM--KARDTISHNLLHH 497
>gi|86155935|gb|ABC86705.1| R111 [Coffea arabica]
Length = 498
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 190/398 (47%), Gaps = 42/398 (10%)
Query: 12 TKKVTKKLKKSKSTSAAPKDPNSSDSDWW-----PSFWAKNSSTPGATIPSDEEEGFKYF 66
T +K+ ++S AP+ SS W ++W +S P EE F+
Sbjct: 122 TASESKRARRSSPEVEAPEAGTSSQRRLWVKDRSKAWWEHCNS------PDFPEEEFRKA 175
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGV 125
FR+SK TFD IC + E +V++ + L RL + V ++VA+ + RLA+GE V
Sbjct: 176 FRMSKATFDMICDEL-ESVVTKKDTML-----RLAIPVRQRVAVCIWRLATGEPLREVSK 229
Query: 126 AFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
FG+G ST ++ A+ L+WP+ M E+K FE G+ + G+I +
Sbjct: 230 RFGLGISTCHKLVLEVCSAIRNVLMPKFLQWPNEENMREVKRGFEMISGVADVAGSIYTS 289
Query: 185 HIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
H+ + P V + + +Q+ +YS+ VQG+VD + F DI GWPG M ++L+
Sbjct: 290 HVPIIAPKVSVASYFNKRHTERNQKTSYSITVQGVVDPKGVFTDICVGWPGSMTDDKVLE 349
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S ++ A + L +V ++VG G+PL+ W++ PY L+ + N
Sbjct: 350 QSALYQ--RANRGLLKDV-----------WVVGNAGFPLMDWVLVPYTRQNLTWTQHALN 396
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD 358
+++A +AF++LK WR L K + LP ++ CC+LHNI + + P+
Sbjct: 397 EKVGEVQNIAKEAFMRLKARWRCLQKRT-EVKLQDLPILLGACCVLHNICEIRDEGMSPE 455
Query: 359 VALSDHHDSGYGEQCCKQVDPMGRTTRENL-EKHLQHN 395
+ D E + ++ + R+ + K L HN
Sbjct: 456 LRFELFDDEVVPENPVRSMNAV--QARDQIAHKLLHHN 491
>gi|356507015|ref|XP_003522267.1| PREDICTED: uncharacterized protein LOC100809836 [Glycine max]
Length = 441
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 160/329 (48%), Gaps = 36/329 (10%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E+ FK FR+ + TF+ IC + +ED R + V+++VA+
Sbjct: 101 PEFPEQEFKKAFRMRRLTFEAICEELNSVIAKEDTTLRSA----------IPVKQRVAVC 150
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
L RLA+G+ V FG+G ST ++ A++ +L+WP+ + +IK +FE
Sbjct: 151 LWRLATGDPLSVVSRRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNVVALRDIKGEFE 210
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
G+ N G++ +H+ + P + S + + + +YS+ VQG+VDH F D+
Sbjct: 211 NISGIPNVVGSMLTSHVPIIAPKMNVSAYFNKRQTERNHKTSYSVTVQGVVDHRGVFTDV 270
Query: 224 VTGWPGGMNVSRLLKFSGFF-KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
GWPG M ++L+ S F + AG L+G ++VGG GYPLL W++
Sbjct: 271 CIGWPGSMGDDQVLEKSALFHRANNAGLNLSG------------VWVVGGCGYPLLDWVL 318
Query: 283 TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
PY L+ + FN R +A AF +LKG W L K + LP ++ CC
Sbjct: 319 VPYTQQNLTWTQHAFNEKIGEVRRVAKGAFGRLKGRWGCLQKRT-EVKLQDLPFVLGACC 377
Query: 343 LLHNIIIDSGDQLHPDVALSDHHDSGYGE 371
+LHNI +++ P++ + D Y E
Sbjct: 378 VLHNICELKNEEMDPELLENIDDDGDYAE 406
>gi|340382901|ref|XP_003389956.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 418
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 104/304 (34%), Positives = 155/304 (50%), Gaps = 28/304 (9%)
Query: 67 FRVSKKTFDYICSLVREDL-----VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
FR+SK TF IC+ +R L R P + VEK+VAI+L +A+G
Sbjct: 96 FRMSKTTFVMICNELRSSLKKSSTTMRQP----------IPVEKRVAISLWFMATGTDYR 145
Query: 122 SVGVAFGVGQSTVS----QVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
++G FGV +++V QV + L ER ++KWP ++ I S F+ FG C
Sbjct: 146 TIGHLFGVSKASVCLAIRQVCRAILTTLLER---YIKWPSGENLKNIISGFKHKFGFPQC 202
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+D THI + P D+ +++ +S+L+QG VDH FIDI GWPG ++ +R+
Sbjct: 203 VGAVDGTHIPIISPE-DYPADYYNRKGWHSVLMQGTVDHLGIFIDIYIGWPGRVHDARVF 261
Query: 238 KFSGFFKLCEAGQRL-NGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMP 295
S +K + G L N I +EV L ++G YPLL WL+ PY NG L+
Sbjct: 262 VNSSLYKKGQEGTLLPNWKESIEGQEVPL--VLLGDPAYPLLPWLMKPYSDNGHLTRDQK 319
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN R + A+ +LKG WR L K + +P ++ CC+LHNI G+
Sbjct: 320 RFNYRLSKGRVVVEHAYGRLKGRWRCLLKRL-DVSVEFVPDVVAACCVLHNICEIHGETF 378
Query: 356 HPDV 359
+ ++
Sbjct: 379 NNEL 382
>gi|242056125|ref|XP_002457208.1| hypothetical protein SORBIDRAFT_03g003310 [Sorghum bicolor]
gi|241929183|gb|EES02328.1| hypothetical protein SORBIDRAFT_03g003310 [Sorghum bicolor]
Length = 470
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/339 (29%), Positives = 164/339 (48%), Gaps = 46/339 (13%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
D +WW S+P P DE F+ FR+S+ TF+ +C ++ +ED + R
Sbjct: 120 DHEWW-----DRMSSPA--CPEDE---FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAA 169
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RA 149
+ V ++VA+ + RLA+GE V FG+G ST ++ A++
Sbjct: 170 ----------IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKAVLM 219
Query: 150 KHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQ 202
++WP++ + + + FE + G+ GA+ THI + P + + +Q
Sbjct: 220 PKAVQWPEAPDAAAGVSATFEAASGIPGVVGAMYTTHIPIIAPKANVAAYYNRRHTERNQ 279
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
+ +YS+ VQG+VD F D+ GWPG M+ + +L S + A L G
Sbjct: 280 KTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDRSALYAQRGAAGLLQG-------- 331
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+++VGG GYPL+ WL+ PY + ++ + FN + R++A AF +LK W L
Sbjct: 332 ----QWVVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVARDAFQRLKSRWGCL 387
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
K + LP ++ CC+LHNI +GD + PD+A
Sbjct: 388 QKRT-EVKLQDLPVVLGACCVLHNICERAGDAVDPDIAF 425
>gi|449447145|ref|XP_004141329.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 496
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 168/368 (45%), Gaps = 41/368 (11%)
Query: 43 FWAKNSSTP---GATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLI 94
W K+ S P +E FK FR+ + TFD IC ++ +ED R
Sbjct: 147 LWVKDRSKAWWDECNSPDYPDEEFKKQFRMGRATFDMICEELNSAIAKEDTTLRTA---- 202
Query: 95 NIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHL 153
+ V+++VA+ L RLA+G+ V FG+G ST ++ A+ HL
Sbjct: 203 ------IPVQQRVAVCLWRLATGDPLRVVSKKFGLGISTCHKLVLEVCTAIRTVLMPKHL 256
Query: 154 KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYS 207
+WP+ + IK ++E G+ N G++ THI + P + + + +Q+ +YS
Sbjct: 257 QWPEEETLRRIKEEYESISGIPNVVGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYS 316
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
+ VQG+VD F D+ GWPG M ++L+ S F QR NG +
Sbjct: 317 ITVQGVVDPRGVFTDVCIGWPGSMPDDQVLEKSALF------QRANGGLLKGV------- 363
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+IVGG YPL+ W++ PY L+ + FN + +A AF +LKG WR L K
Sbjct: 364 WIVGGSSYPLMDWVLVPYTQQHLTWTQHAFNEKIGEIQKVAKDAFARLKGRWRCLQKRT- 422
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
+ LP ++ CC+LHNI ++ ++ D E + V M R+
Sbjct: 423 EVKLQDLPVVLGACCVLHNICELGNQEMDTELLTELQDDEMAPEMALRSVPSM--KARDA 480
Query: 388 LEKHLQHN 395
+ +L H+
Sbjct: 481 IAHNLLHH 488
>gi|225429570|ref|XP_002280033.1| PREDICTED: uncharacterized protein LOC100262760 [Vitis vinifera]
gi|296081670|emb|CBI20675.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 170/368 (46%), Gaps = 45/368 (12%)
Query: 43 FWAKNSSTP---GATIPSDEEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLI 94
W K+ S P EE F+ FR+S+ TFD IC + +ED + R
Sbjct: 135 LWVKDRSKAWWDECNKPEFPEEEFRKAFRMSRATFDMICDELNSVVAKEDTMLRAA---- 190
Query: 95 NIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHL 153
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ +L
Sbjct: 191 ------IPVRQRVAVCIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCSAIRTVLMPKYL 244
Query: 154 KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYS 207
+WPD + +K +FE G+ N G++ THI + P + + + +Q+ +YS
Sbjct: 245 QWPDEETLRRMKDEFESISGIPNVVGSMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYS 304
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
+ VQG+VD + F D+ GWPG M ++L+ S + QR NG +
Sbjct: 305 ITVQGVVDPKGVFTDVCIGWPGSMPDDQVLEKSALY------QRANGGLLKGV------- 351
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+IVGG GYPL+ W++ PY L+ + FN + ++ +AF +LK W L K
Sbjct: 352 WIVGGSGYPLMDWVMVPYTQQNLTWTQHAFNEKIGEIQRVSREAFARLKARWCCLQKRT- 410
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
+ LP ++ CC+LHNI +++ P++ ++ D E P+ TT
Sbjct: 411 EVKLQDLPVVLGACCVLHNICELQNEEMDPELRINLIDDEMIPEI------PLRSTTSMK 464
Query: 388 LEKHLQHN 395
+ HN
Sbjct: 465 ARDAIAHN 472
>gi|222618738|gb|EEE54870.1| hypothetical protein OsJ_02361 [Oryza sativa Japonica Group]
Length = 551
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 163/352 (46%), Gaps = 38/352 (10%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P EE F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 218 PDYPEEEFRRAFRMGRETFDMICEALGSAVAKEDTMLRAA----------IPVRQRVAVC 267
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K +F+
Sbjct: 268 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQ 327
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
++G+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 328 AAYGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDV 387
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 388 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHSAC-------------LVGGASYPLMDWVLV 434
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R +AV AF +LK W L K + LP ++ CC+
Sbjct: 435 PYTHQNLTWTQHAFNEKVGDLRRVAVDAFARLKARWACLQKRT-EVKLQDLPVVLGACCV 493
Query: 344 LHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
LHNI G++L P++ D E + + R+N+ ++L H
Sbjct: 494 LHNICETRGEELEPELRFELVDDETSPETPVRS--EAAKRARDNIAQNLLHR 543
>gi|255546077|ref|XP_002514098.1| conserved hypothetical protein [Ricinus communis]
gi|223546554|gb|EEF48052.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 161/327 (49%), Gaps = 36/327 (11%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW + P EE FK FR+SK TF+ IC+ + + V++ + L +
Sbjct: 200 DWWEK----------CSHPDFPEEEFKKAFRMSKATFETICNEL-DSAVTKKNTMLRDA- 247
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ L+WP
Sbjct: 248 ---IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPRFLQWP 304
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
D ++ IKS++E G+ N G++ THI + P + + + +Q+ +YSM V
Sbjct: 305 DEEKLRMIKSEYEAVSGIPNVGGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTSYSMTV 364
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+VD F D+ GWPG M ++L+ S F QR N R + ++V +IV
Sbjct: 365 QGVVDPRGVFTDVCIGWPGSMTDDQVLEKSALF------QRAN---RGALKDV----WIV 411
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G GYPL+ W++ PY L+ + FN + + +AF +LK W L K
Sbjct: 412 GNSGYPLMDWVLVPYTHQNLTWTQHAFNEKIGEVQRVGKEAFARLKARWSCLQKRT-EVK 470
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHP 357
+ LP ++ CC+LHNI +++ P
Sbjct: 471 LQDLPVVLGACCVLHNICEMRNEEIEP 497
>gi|356514661|ref|XP_003526022.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 440
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 160/338 (47%), Gaps = 54/338 (15%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E+ FK FR+ + TF+ IC + +ED R + V+++VA+
Sbjct: 98 PEFPEQEFKKAFRMGRLTFEAICQELNSVIAKEDTTLRTA----------IPVKQRVAVC 147
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
L RLA+G+ V FG+G ST ++ A++ +L+WP+ + +IK +FE
Sbjct: 148 LWRLATGDPLSVVSRRFGLGISTCHKLVLEVCTAIKTVLMPKYLQWPNVVTLRDIKGEFE 207
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
G+ N G++ +H+ + P + S + + + +YS+ VQG+VDH F D+
Sbjct: 208 NISGIPNVVGSMFTSHVPIIAPKMSVSAYFNKKHTERNHKTSYSVTVQGVVDHRGVFTDV 267
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S F NG + +S ++VGG GYPLL W++
Sbjct: 268 CIGWPGSMGDDQVLEKSSLFHRAN-----NGGINLSGV------WVVGGCGYPLLDWVLV 316
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + +FN R +A AF +LKG W L K + LP ++ CC+
Sbjct: 317 PYTQQNLTWTQHSFNEKIGEVRRVAKGAFGRLKGRWGCLQKRT-EVKLQDLPLVLGACCV 375
Query: 344 LHNII--------------IDSGDQ------LHPDVAL 361
LHNI ID GD + P+VA+
Sbjct: 376 LHNICELKNEEMDHELLKNIDDGDDDYVGEVMEPEVAM 413
>gi|302802405|ref|XP_002982957.1| hypothetical protein SELMODRAFT_34337 [Selaginella moellendorffii]
gi|300149547|gb|EFJ16202.1| hypothetical protein SELMODRAFT_34337 [Selaginella moellendorffii]
Length = 217
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/214 (39%), Positives = 115/214 (53%), Gaps = 2/214 (0%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ FRVS F ++C + LV RPP GL I R L + KQVAIAL+RLASG+ +
Sbjct: 5 FREVFRVSVSIFTFLCDELAPKLVRRPPPGLAEIPRRHLPISKQVAIALKRLASGDMWRT 64
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
+G AFGV RF AL E + WPD M+E+ + F+ G NCCGA+D
Sbjct: 65 IGDAFGVASCIAQACLCRFRYALLEHEGLMIHWPDEEGMKEVITGFQRLRGFPNCCGAMD 124
Query: 183 ATHIIMTLPAVQTSDDW-CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
THI + LP + + DW + YSM+VQ +VD + F+DI G G + R+ SG
Sbjct: 125 CTHIAIELPGSEDATDWYAGAKKYYSMVVQAVVDSKTSFLDITIGIAGSVPDRRVWNSSG 184
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
K +RL G V +E ++ +YI+ GY
Sbjct: 185 LKKAWIEKKRLCGPV-YHTEFGDIPQYIIADGGY 217
>gi|21902022|dbj|BAC05572.1| unknown protein [Oryza sativa Japonica Group]
Length = 551
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 162/352 (46%), Gaps = 38/352 (10%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P EE F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 218 PDYPEEEFRRAFRMGRETFDMICEALGSAVAKEDTMLRAA----------IPVRQRVAVC 267
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K +F+
Sbjct: 268 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQ 327
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
++G+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 328 AAYGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDV 387
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 388 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHSAC-------------LVGGASYPLMDWVLV 434
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R +AV AF +LK W L K + LP ++ CC+
Sbjct: 435 PYTHQNLTWTQHAFNEKVGDLRRVAVDAFARLKARWACLQKRT-EVKLQDLPVVLGACCV 493
Query: 344 LHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHN 395
LHNI G++L P++ D E + + R+N+ +L H
Sbjct: 494 LHNICETRGEELEPELRFELVDDETSPETPVRS--EAAKRARDNIAHNLLHR 543
>gi|255589984|ref|XP_002535147.1| hypothetical protein RCOM_0387610 [Ricinus communis]
gi|223523928|gb|EEF27235.1| hypothetical protein RCOM_0387610 [Ricinus communis]
Length = 128
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/125 (56%), Positives = 93/125 (74%), Gaps = 2/125 (1%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S+KTF+YICSLV+ED+ ++ + G++L++ VA+ALRRL SG+S V+V FG
Sbjct: 1 MSRKTFNYICSLVKEDMAAKSRE-FTFLNGKVLTLHDLVAVALRRLGSGDSLVTVADFFG 59
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHII 187
V STVSQVTWRF+EA+EER HHLKWP + + M EIKSKFE+ GL NCCG ID THI+
Sbjct: 60 VNHSTVSQVTWRFVEAMEERGLHHLKWPSNESEMTEIKSKFEKIRGLPNCCGVIDTTHIM 119
Query: 188 MTLPA 192
M L +
Sbjct: 120 MLLSS 124
>gi|390366728|ref|XP_003731101.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 426
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 166/322 (51%), Gaps = 21/322 (6%)
Query: 43 FWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-L 101
FW K I + E+E + FR+S +TF+++C +R P ++ + R+ +
Sbjct: 83 FWWKT-----IVIDTFEQEDWLSHFRMSWQTFNFLCGQLR------PIIQKMDTKFRVAI 131
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE-ERAKHHLKWPDSNR 160
VE +VAI L RLA+ ++ FGVG STV + + A+ A ++ P +
Sbjct: 132 RVEHRVAITLWRLATNVEYRTIAQMFGVGTSTVCCIVHQVCRAIVVTLANDMIRIPRGDA 191
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
++ +FE +G C GAID +HI + P + D+ +++ YSM++QG+VDH RF
Sbjct: 192 AADVVREFEVKWGFPQCFGAIDGSHIPVLSPK-EFRADYYNRKGFYSMVLQGLVDHRYRF 250
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
++I G+PG ++ +R+ S F+L G+ +R E ++ I+G YPLL W
Sbjct: 251 MNINFGYPGSVHDARVFTNSRVFRLGNEGELCPPLLR-EIGETQVPVAILGDSAYPLLPW 309
Query: 281 LITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK--RKLPSI 337
L+ P+ NG L+ FN R + F +LKG WRIL K R D + L +
Sbjct: 310 LMKPFHDNGQLTREKRHFNYRLSRARMVVENGFGRLKGRWRILLK---RQDTHVKYLGDL 366
Query: 338 ILVCCLLHNIIIDSGDQLHPDV 359
++ CC+LHN+ +G+ D+
Sbjct: 367 VVACCVLHNLCESAGENFDLDL 388
>gi|356532740|ref|XP_003534929.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 527
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 159/317 (50%), Gaps = 36/317 (11%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW K+ P EE F+ +FR+S+ TFD IC + + V++ + L +
Sbjct: 186 DWWEKISRKD-------FP---EEEFRRWFRMSRSTFDMICDEL-DAAVTKKNTMLRDA- 233
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+G+ V FG+G ST ++ A+ L+WP
Sbjct: 234 ---IPVRQRVAVCIWRLATGDPLRLVSKRFGLGISTCHKLVLEVCSAIRTVLMPKFLQWP 290
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
+ + ++IK +FE+ G+ N GA+ TH+ + P + S + +Q+ +YS+ V
Sbjct: 291 NEEKTKQIKDEFEKISGIPNVGGAMYTTHVPIIAPKISVSAYFNKRHTERNQKTSYSITV 350
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+VD + F D+ GWPG M R+L+ S F+ G L G +IV
Sbjct: 351 QGVVDSKGVFNDVCIGWPGSMPDDRVLEKSALFQRASRGN-LKG------------VWIV 397
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G G+PL+ W++ PY L+ + FN E + +A +AF +LKG W L K
Sbjct: 398 GNSGHPLMDWVLVPYTHANLTWTQHAFNEKIEEIQGVAKEAFARLKGRWGCLQKRT-EVK 456
Query: 331 KRKLPSIILVCCLLHNI 347
+ LP ++ CC+LHNI
Sbjct: 457 LQDLPVVLGACCVLHNI 473
>gi|357135326|ref|XP_003569261.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 539
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 149/314 (47%), Gaps = 36/314 (11%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E+ F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 203 PDFPEQEFRRAFRMGRETFDMICEALGSAVAKEDTMLRA----------AIPVRQRVAVC 252
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K FE
Sbjct: 253 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFE 312
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
SFG+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 313 RSFGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDV 372
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M +L+ S + AG + +VGG YPL+ W++
Sbjct: 373 CIGWPGSMADDVVLEKSMLHQRAAAGMMHDA-------------CLVGGASYPLMDWVMV 419
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R++AV AFL+LK W L K + LP ++ CC+
Sbjct: 420 PYTHQNLTWTQHAFNEKVGDIRNVAVDAFLRLKTRWACLQKRT-EVKLQDLPVVLGACCV 478
Query: 344 LHNIIIDSGDQLHP 357
LHNI G++L P
Sbjct: 479 LHNICEMRGEELEP 492
>gi|414875939|tpg|DAA53070.1| TPA: transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 472
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 174/372 (46%), Gaps = 48/372 (12%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
D +WW S+P P DE F+ FR+S+ TF+ +C ++ +ED + R
Sbjct: 120 DHEWWDRM-----SSPA--CPDDE---FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAA 169
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RA 149
+ V ++VA+ + RLA+GE V FG+G ST ++ A++
Sbjct: 170 ----------IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKAVLM 219
Query: 150 KHHLKWPDSNRMEEIKS-KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQ 202
++WP++ +S +FE + G+ GA+ THI + P + + +Q
Sbjct: 220 PKAVQWPEAPDAAAAESARFEAASGIPGVVGAMYTTHIPIIAPKANVAAYYNRRHTERNQ 279
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
+ +YS+ VQG+VD F D+ GWPG M+ + +L S + A L G
Sbjct: 280 KTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDRSALYLQRGAAGLLQG-------- 331
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+ +VGG GYPL+ WL+ PY + ++ + FN + R++A AF +LK W L
Sbjct: 332 ----QRLVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVARDAFQRLKSRWGCL 387
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGR 382
K + LP ++ CC+LHNI +GD + PD+A D E + +
Sbjct: 388 QKRT-EVKLQDLPVVLGACCVLHNICERAGDAVDPDIAFQLFDDDMVAENPVRSTAAVA- 445
Query: 383 TTRENLEKHLQH 394
R+N+ +L H
Sbjct: 446 -ARDNIAHNLLH 456
>gi|226493946|ref|NP_001146875.1| LOC100280483 [Zea mays]
gi|195604564|gb|ACG24112.1| transposon protein CACTA, En/Spm sub-class [Zea mays]
Length = 473
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 174/372 (46%), Gaps = 48/372 (12%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
D +WW S+P P DE F+ FR+S+ TF+ +C ++ +ED + R
Sbjct: 120 DHEWWDRM-----SSPA--CPEDE---FRRAFRMSRATFEAVCEELGAAVAKEDTMLRAA 169
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RA 149
+ V ++VA+ + RLA+GE V FG+G ST ++ A++
Sbjct: 170 ----------IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKAVLM 219
Query: 150 KHHLKWPDSNRMEEIKS-KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQ 202
++WP++ +S +FE + G+ GA+ THI + P + + +Q
Sbjct: 220 PKAVQWPEAPDAAAAESARFEAASGIPGVVGAMYTTHIPIIAPKANVAAYYNRRHTERNQ 279
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
+ +YS+ VQG+VD F D+ GWPG M+ + +L S + A L G
Sbjct: 280 KTSYSITVQGVVDAGGAFTDVCIGWPGSMSDADVLDRSALYLQRGAAGLLQG-------- 331
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+ +VGG GYPL+ WL+ PY + ++ + FN + R++A AF +LK W L
Sbjct: 332 ----QRLVGGAGYPLMDWLLVPYTHHNMTWAQHVFNERVDGVRAVARDAFQRLKSRWGCL 387
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGR 382
K + LP ++ CC+LHNI +GD + PD+A D E + +
Sbjct: 388 QKRT-EVKLQDLPVVLGACCVLHNICERAGDAVDPDIAFQLFDDDMVAENPVRSTAAVA- 445
Query: 383 TTRENLEKHLQH 394
R+N+ +L H
Sbjct: 446 -ARDNIAHNLLH 456
>gi|224114577|ref|XP_002332342.1| predicted protein [Populus trichocarpa]
gi|222831909|gb|EEE70386.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 172/365 (47%), Gaps = 38/365 (10%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW + P EE F+ FR+SK TFD IC + + +V++ + L +
Sbjct: 115 DWWEK----------CSHPDFPEEEFRKAFRMSKATFDMICVEL-DSVVTKKNTMLRDA- 162
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ L+WP
Sbjct: 163 ---IPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPKFLQWP 219
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
+ ++++ IK +FE G+ N G++ TH+ + P + + + +Q+ +YSM V
Sbjct: 220 NEDKLKMIKGEFESMSGIPNVGGSMYTTHVPIIAPKINVAAYYNKRHTERNQKTSYSMTV 279
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+V + F D+ G+PG M R+L+ S FK G N IV
Sbjct: 280 QGVVSPKGVFTDVCIGYPGSMPDDRVLEESALFKRANRGALKN-------------VCIV 326
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G G+PL+ W++ PY L+ + FN + ++ +AF +LKG W L K
Sbjct: 327 GNSGHPLMDWVLVPYTHQNLTWTQHAFNEKIGEIQRVSKEAFARLKGRWSCLQKRT-EVK 385
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEK 390
+ LP ++ CC+LHNI +++ P++ D E + + R+++
Sbjct: 386 LQDLPVVLGACCVLHNICEMRNEEMGPELKFDIFDDDMIPENSLRSAGAI--QARDHIAH 443
Query: 391 HLQHN 395
+L H+
Sbjct: 444 NLLHH 448
>gi|449452530|ref|XP_004144012.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449500453|ref|XP_004161101.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 483
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 157/323 (48%), Gaps = 26/323 (8%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P +E F+ FR+SK TFD IC + ++ + ++ + + V ++VA+ + RLA
Sbjct: 149 PDFPDEEFRRAFRMSKSTFDMICKELDSTVMKK--DTMLRVA---IPVRQRVAVCIWRLA 203
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGL 174
+GE V FG+G ST ++ A+ + L WPD +++ +IK +FE G+
Sbjct: 204 TGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLNWPDESKLTKIKQEFESISGI 263
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G+I THI + P + + +Q+ +YS+ VQG+VD F D+ GWP
Sbjct: 264 PKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITVQGVVDPSGVFTDVCIGWP 323
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G M ++L+ S ++ G S +V +IVG GYPL+ WL+ PY
Sbjct: 324 GSMPDDQVLEKSLLYERASMG---------SLNDV----FIVGNSGYPLMDWLLVPYTVQ 370
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
L+ + FN ++ A AF +LKG W L K ++LP ++ CC+LHNI
Sbjct: 371 NLTWTQHGFNEKVGEIQAAAKAAFGRLKGRWTCLQKRT-EVKLQELPVVLGACCVLHNIC 429
Query: 349 IDSGDQLHPDVALSDHHDSGYGE 371
++ P++ + D E
Sbjct: 430 EMRKEKFDPELKFEVYDDEMMPE 452
>gi|302786758|ref|XP_002975150.1| hypothetical protein SELMODRAFT_102485 [Selaginella moellendorffii]
gi|300157309|gb|EFJ23935.1| hypothetical protein SELMODRAFT_102485 [Selaginella moellendorffii]
Length = 433
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 169/364 (46%), Gaps = 62/364 (17%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
PS E F FR+SK TF+ IC ++ +ED + R + + V ++VA+
Sbjct: 103 PSFPEAEFHRAFRMSKSTFNIICDQLGLAVAKEDTMLR----------QAIPVRQRVAVC 152
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A+ + +++WP R++ + +FE
Sbjct: 153 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAINDVLLSKYVQWPTEERLQAVMEEFE 212
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
G+ N GA+ THI + P + + + +Q+ +YS+ +QG+VD F D+
Sbjct: 213 AISGIPNLVGAMYTTHIPIIAPKMNVAAYFNKRHTDRNQKTSYSITLQGVVDIRGIFTDL 272
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M+ R+L+ S + G LNG ++ GG GYPLLSWL+
Sbjct: 273 SIGWPGSMSDERILEKSNLAQRAADGL-LNGV------------WVSGGPGYPLLSWLLV 319
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY + L + FN +A AF +LKG W+ L K ++LP+++ C +
Sbjct: 320 PYTQHNLGWAHHAFNEKLSEVLKIARDAFGRLKGRWKFLQKRT-EVKLQELPAVLGACSV 378
Query: 344 LHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV------------DPMGRTTRENLEKH 391
LHN+ +HH+ G+ E+ + DP R+ + +
Sbjct: 379 LHNV--------------CEHHNEGFDEELAYEAVDDLMFPDSSSSDPAAVQARDAIAHN 424
Query: 392 LQHN 395
L H+
Sbjct: 425 LLHS 428
>gi|225444848|ref|XP_002281001.1| PREDICTED: uncharacterized protein LOC100247440 [Vitis vinifera]
Length = 509
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 163/313 (52%), Gaps = 34/313 (10%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRL 114
P EE F+ FR+S+ TFD IC + + +V++ + L RL + V ++VA+ + RL
Sbjct: 176 PDFPEEDFRRAFRMSRATFDMICDEL-DSVVTKKDTML-----RLAIPVRQRVAVCIWRL 229
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A++ L+WPD + + +K +FE G
Sbjct: 230 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKTVLMPKFLQWPDDEKSKIVKDEFESISG 289
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ N G++ THI + P V + + +Q+ +YS+ VQG+VD + F D+ GW
Sbjct: 290 IPNVGGSMYTTHIPIIAPKVSVAAYFNKRHTERNQKTSYSVTVQGVVDPKGVFTDVSIGW 349
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE-YIVGGVGYPLLSWLITPYE 286
PG M+ ++L+ S + QR N L++ ++VGG GYPLL W++ PY
Sbjct: 350 PGSMSDEKVLEQSSLY------QRANSGF--------LKDVWVVGGSGYPLLDWVMVPYT 395
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR--KLPSIILVCCLL 344
L+ + FN +++A +AF +LK W L K R + + LP ++ CC+L
Sbjct: 396 HQNLTWTQHAFNEKIGDIQNVAKEAFARLKRRWSCLQK---RTEVKLLDLPVVLGACCVL 452
Query: 345 HNIIIDSGDQLHP 357
HNI +++ P
Sbjct: 453 HNICQMRNEEMDP 465
>gi|242053391|ref|XP_002455841.1| hypothetical protein SORBIDRAFT_03g026050 [Sorghum bicolor]
gi|241927816|gb|EES00961.1| hypothetical protein SORBIDRAFT_03g026050 [Sorghum bicolor]
Length = 559
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 160/347 (46%), Gaps = 38/347 (10%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
EE F+ FR+ ++TFD IC S+ +ED + R + V ++VA+ + RL
Sbjct: 230 EEEFRRAFRMGRETFDMICDALGSSVAKEDTMLRA----------AIPVRQRVAVCIWRL 279
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A++ L+WPD K FE S G
Sbjct: 280 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEPAAARFKESFERSSG 339
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+ GW
Sbjct: 340 VPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGW 399
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M ++L+ S + AG + ++VGG YPL+ W++ PY
Sbjct: 400 PGSMPDDQVLEKSMLHQRAAAG-------------MMHESWLVGGASYPLMDWVLVPYTH 446
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +AV+AF +LK W L K + LP ++ CC+LHNI
Sbjct: 447 PNLTWTQHAFNEKVGELRRVAVEAFARLKARWACLQKRT-EVKLQDLPVVLGACCVLHNI 505
Query: 348 IIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQH 394
G+++ P + D E + + R+N+ +L H
Sbjct: 506 CELRGEEVDPAIRCELVDDETTPENPVRS--ETAKHARDNIAHNLLH 550
>gi|226499058|ref|NP_001143085.1| hypothetical protein [Zea mays]
gi|195614032|gb|ACG28846.1| hypothetical protein [Zea mays]
gi|413948677|gb|AFW81326.1| hypothetical protein ZEAMMB73_227542 [Zea mays]
Length = 451
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 147/310 (47%), Gaps = 36/310 (11%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+ + TF +C ++ +ED R + V ++VA+ + RL
Sbjct: 122 EADFRSAFRMGRATFAMLCDALGAAVAKEDTALRA----------AIPVRQRVAVCVWRL 171
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A+ L+WPD E KS+F+ G
Sbjct: 172 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRGLLMPRFLRWPDDAAAEAFKSRFQAESG 231
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ + GAI THI + P V + + + + +YS+ +QG+VD + F D+ GW
Sbjct: 232 IPSVVGAIYTTHIPIIAPKVSVTAYFNRRHTERNHKTSYSITLQGVVDPDGAFTDVCIGW 291
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M ++L+ S + AG + G+ ++VGGV +PL W++ PY
Sbjct: 292 PGSMPDDQVLERSALQQRAAAGM-MAGS------------WVVGGVSFPLTEWVLAPYAQ 338
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +AV AF +LKG W L K + LP ++ CC+LHNI
Sbjct: 339 ANLTWAQHAFNEKVAEVRRVAVDAFARLKGRWACLQKRT-EVKLQDLPVVLGACCVLHNI 397
Query: 348 IIDSGDQLHP 357
G+ + P
Sbjct: 398 CESRGEGMDP 407
>gi|326494082|dbj|BAJ85503.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 532
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 152/323 (47%), Gaps = 36/323 (11%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 199 PDFPEAEFRRAFRMGRETFDMICDALGSAVAKEDTMLRA----------AIPVRQRVAVC 248
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K FE
Sbjct: 249 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFE 308
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
SFG+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 309 RSFGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDV 368
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 369 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHDA-------------CLVGGASYPLMDWVLV 415
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R++AV+AF +LK W L K + LP ++ CC+
Sbjct: 416 PYTHQNLTWTQHAFNEKVGDIRNVAVEAFARLKTRWACLQKRT-EVKLQDLPVVLGACCV 474
Query: 344 LHNIIIDSGDQLHPDVALSDHHD 366
LHNI ++L P+V + D
Sbjct: 475 LHNICEMRREELEPEVQFALFDD 497
>gi|326526371|dbj|BAJ97202.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 532
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 152/323 (47%), Gaps = 36/323 (11%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 199 PDFPEAEFRRAFRMGRETFDMICDALGSAVAKEDTMLRA----------AIPVRQRVAVC 248
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K FE
Sbjct: 249 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFE 308
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
SFG+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 309 RSFGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDV 368
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 369 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHDA-------------CLVGGASYPLMDWVLV 415
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R++AV+AF +LK W L K + LP ++ CC+
Sbjct: 416 PYTHQNLTWTQHAFNEKVGDIRNVAVEAFARLKTRWACLQKRT-EVKLQDLPVVLGACCV 474
Query: 344 LHNIIIDSGDQLHPDVALSDHHD 366
LHNI ++L P+V + D
Sbjct: 475 LHNICEMRREELEPEVQFALFDD 497
>gi|326528651|dbj|BAJ97347.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 521
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 152/323 (47%), Gaps = 36/323 (11%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P E F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 188 PDFPEAEFRRAFRMGRETFDMICDALGSAVAKEDTMLRAA----------IPVRQRVAVC 237
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K FE
Sbjct: 238 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCGAIKSVLMPRFLQWPDEAAAGRFKEGFE 297
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
SFG+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 298 RSFGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDRAFTDV 357
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 358 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHDA-------------CLVGGASYPLMDWVLV 404
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
PY L+ + FN R++AV+AF +LK W L K + LP ++ CC+
Sbjct: 405 PYTHQNLTWTQHAFNEKVGDIRNVAVEAFARLKTRWACLQKRT-EVKLQDLPVVLGACCV 463
Query: 344 LHNIIIDSGDQLHPDVALSDHHD 366
LHNI ++L P+V + D
Sbjct: 464 LHNICEMRREELEPEVQFALFDD 486
>gi|218197366|gb|EEC79793.1| hypothetical protein OsI_21220 [Oryza sativa Indica Group]
Length = 423
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 148/312 (47%), Gaps = 36/312 (11%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+S+ TF ++C ++ +ED R + V ++VA+ + RL
Sbjct: 93 EADFRRAFRMSRPTFHFLCDALAAAVAKEDTALRAA----------IPVRQRVAVCVWRL 142
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A+ L WPD K++FE + G
Sbjct: 143 ATGEPLRVVSKRFGLGISTCHKLILEVCAAIRNLLMPRFLHWPDHPTSTAYKTRFEATSG 202
Query: 174 LLNCCGAIDATHIIMTLPAVQTS------DDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ GA+ THI + P V + + + +YS+ +QG+V + F D+ GW
Sbjct: 203 VSGVVGAMYTTHIPIIAPKVSVAAYLNRRHTERNHKTSYSITLQGVVGPDGTFTDVCIGW 262
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M+ ++L+ S + A G++ ++VGG YPL W++ PY
Sbjct: 263 PGSMSDEQVLRKSALHQRASAAA---GSM----------SWVVGGASYPLTEWMLVPYAQ 309
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +A +AF++LKG W L K + LP+++ CC+LHNI
Sbjct: 310 RNLTWTQHAFNEKVGEVRRVATEAFVRLKGRWACLQKRT-EVKLQDLPAVLAACCVLHNI 368
Query: 348 IIDSGDQLHPDV 359
G+ + PD+
Sbjct: 369 CETRGEDMDPDL 380
>gi|224085922|ref|XP_002307739.1| predicted protein [Populus trichocarpa]
gi|222857188|gb|EEE94735.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 171/365 (46%), Gaps = 38/365 (10%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW P +E F+ FR+SK TFD IC + + V++ + L +
Sbjct: 114 DWWDK----------CNHPDFPDEEFRKAFRMSKATFDLIC-MELDSAVTKKNTMLRDA- 161
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V +++A+ + RLA+GE V FG+G ST ++ A+ L+WP
Sbjct: 162 ---IPVRQRIAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPKFLQWP 218
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
+ ++++ IK +FE G+ N G++ TH+ + P + + + +Q+ +YSM V
Sbjct: 219 NEDKLKMIKGEFESMSGIPNVGGSMYTTHVPIIAPKINVAAYYNKRHTERNQKTSYSMTV 278
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+V + F D+ G+PG M R+L+ S FK G N IV
Sbjct: 279 QGVVSPKGVFTDVCIGYPGSMPDDRVLEESALFKRANRGALKNV-------------CIV 325
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G G+PL+ W++ PY L+ + FN + ++ +AF +LKG W L K
Sbjct: 326 GNSGHPLMDWVLVPYTHQNLTWTQHAFNEKIGEIQRVSKEAFARLKGRWSCLQKRT-EVK 384
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEK 390
+ LP+++ CC+LHNI +++ P++ D E + + R+++
Sbjct: 385 LQDLPAVLGACCVLHNICEMRNEEMGPELKFDLFDDVMIPENSLRSASAI--QARDHISH 442
Query: 391 HLQHN 395
+L H+
Sbjct: 443 NLLHH 447
>gi|115465799|ref|NP_001056499.1| Os05g0593000 [Oryza sativa Japonica Group]
gi|48475235|gb|AAT44304.1| unknown protein [Oryza sativa Japonica Group]
gi|113580050|dbj|BAF18413.1| Os05g0593000 [Oryza sativa Japonica Group]
gi|222632770|gb|EEE64902.1| hypothetical protein OsJ_19762 [Oryza sativa Japonica Group]
Length = 423
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 148/312 (47%), Gaps = 36/312 (11%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+S+ TF ++C ++ +ED R + V ++VA+ + RL
Sbjct: 93 EADFRRAFRMSRPTFHFLCDALAAAVAKEDTALRAA----------IPVRQRVAVCVWRL 142
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A+ L WPD K++FE + G
Sbjct: 143 ATGEPLRVVSKRFGLGISTCHKLILEVCAAIRNLLMPRFLHWPDHPTSTAYKTRFEATSG 202
Query: 174 LLNCCGAIDATHIIMTLPAVQTS------DDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ GA+ THI + P V + + + +YS+ +QG+V + F D+ GW
Sbjct: 203 VPGVVGAMYTTHIPIIAPKVSVAAYLNRRHTERNHKTSYSITLQGVVGPDGTFTDVCIGW 262
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M+ ++L+ S + A G++ ++VGG YPL W++ PY
Sbjct: 263 PGSMSDEQVLRKSALHQRASAAA---GSM----------SWVVGGASYPLTEWMLVPYAQ 309
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +A +AF++LKG W L K + LP+++ CC+LHNI
Sbjct: 310 RNLTWTQHAFNEKVGEVRRVATEAFVRLKGRWACLQKRT-EVKLQDLPAVLAACCVLHNI 368
Query: 348 IIDSGDQLHPDV 359
G+ + PD+
Sbjct: 369 CETRGEDMDPDL 380
>gi|62122827|ref|NP_001014341.1| uncharacterized protein LOC541506 [Danio rerio]
gi|61403162|gb|AAH91804.1| Zgc:113227 [Danio rerio]
Length = 415
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 167/316 (52%), Gaps = 15/316 (4%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR 112
+P E F FRVS+++F+YIC +R L + + +++ V+K+VAIAL
Sbjct: 71 VIVPEFTPEEFIQNFRVSRESFEYICRRLRHMLERKDTNFRLSV-----PVKKRVAIALC 125
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNRMEEIKSKFEES 171
+LA+G V FGVG STV F A ++ H+K+P +++E+ FE
Sbjct: 126 KLATGSEYRYVSQLFGVGVSTVFNCVQDFCSAVIKILVPVHMKFPSPEKLKEMADVFENC 185
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
+ + C G+IDA HI + P + + +++ +S+++Q +VD F D+ G+ G +
Sbjct: 186 WNVPQCIGSIDAHHIPIIAPE-KNPRGYLNRKGWHSVVLQAVVDGNGLFWDLCVGFSGNL 244
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGL 290
+ +R+L+ S + L LN N ++ ++ Y++G YPL +WL+ P+ + GL
Sbjct: 245 SDARVLRQSYLWSLLSERDLLNHN-KVDISGCDVGYYLIGDSAYPLQNWLMKPFPDIGGL 303
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK--LPSIILVCCLLHNII 348
+ +FNS + RS++ +F +LK W+ L + R D + + + L CC+LHNI
Sbjct: 304 TPQQESFNSRLSSARSVSDLSFKKLKARWQCLFR---RNDCKVELVKKMALTCCVLHNIC 360
Query: 349 IDSGDQLHPDVALSDH 364
+ G Q D + +DH
Sbjct: 361 EEKGTQFSEDHS-TDH 375
>gi|340377501|ref|XP_003387268.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 414
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 150/302 (49%), Gaps = 26/302 (8%)
Query: 67 FRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
FRV+++TF+ IC +VR++ R P +SV+++V + L RLA+
Sbjct: 91 FRVTQQTFEEICREVQPYIVRKNTYLRQP----------ISVDERVVVTLWRLATNVDYR 140
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
++ FG+G STV + E L E H ++ P +++ F+ +G GA
Sbjct: 141 TISALFGIGCSTVCTIVIETCEVLSEHLFHMYVYIPTGEGQQKVVDGFQSRWGFPQAIGA 200
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID +HI + P D+ +++ YS++VQG+VDH +F+D GWPG + +R+ + S
Sbjct: 201 IDGSHIPIIKPC-HCPSDYYNRKGFYSIIVQGLVDHTGKFLDAYIGWPGKCHDARVFQNS 259
Query: 241 GFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFN 298
+K G L R + S +V L I+G YPLLSWL+ PY ET + +N
Sbjct: 260 SLYKKGIKGSLLPCLTRKLGSVDVPL--VILGDAAYPLLSWLMKPYIETPSSPDDIKLYN 317
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR--KLPSIILVCCLLHNIIIDSGDQLH 356
R + +F +LKG WRIL K R D +PSI+ C LHN+ D
Sbjct: 318 YRQSRARMVVENSFGRLKGRWRILLK---RLDCHLDNVPSIVSACITLHNMCEKFNDHWS 374
Query: 357 PD 358
PD
Sbjct: 375 PD 376
>gi|357449945|ref|XP_003595249.1| hypothetical protein MTR_2g040130 [Medicago truncatula]
gi|355484297|gb|AES65500.1| hypothetical protein MTR_2g040130 [Medicago truncatula]
Length = 512
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 153/317 (48%), Gaps = 36/317 (11%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW S P DE F+ FR+SK+TF+ IC+ + + V++ + L +
Sbjct: 171 DWW-------ESCNREDFPDDE---FRRCFRMSKQTFNMICNEL-DSSVTKKNTTLRDA- 218
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+GE V FG+G ST ++ A++ + L+WP
Sbjct: 219 ---IPVRQRVAVCIYRLATGEPLRLVSKKFGLGISTCHKLVLEVCAAIKSVLMQKFLRWP 275
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
D +E K FE SFG+ N G + TH+ + P V + +Q+ +YS+ V
Sbjct: 276 DEETTKETKQAFEGSFGIPNIGGVMYTTHVPIIAPKVNVGLYFNKKLTERNQKTSYSVTV 335
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+V+ + F D+ GWPG M + +L+ S ++ + G N +IV
Sbjct: 336 QGVVNSKGVFTDVCIGWPGSMQDNTVLEKSALYERAKLGNLKNI-------------WIV 382
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G GY L+ W++ PY+ L+ + FN + AF +LKG W L K
Sbjct: 383 GNSGYSLMDWVLVPYKHQNLTWTQHGFNERIGEIEKIGKDAFSRLKGRWGCLKKRT-EIK 441
Query: 331 KRKLPSIILVCCLLHNI 347
+ LP ++ CC+LHNI
Sbjct: 442 LQDLPVVLGACCVLHNI 458
>gi|212276182|ref|NP_001130453.1| hypothetical protein [Zea mays]
gi|194689168|gb|ACF78668.1| unknown [Zea mays]
gi|223943395|gb|ACN25781.1| unknown [Zea mays]
gi|224029997|gb|ACN34074.1| unknown [Zea mays]
gi|224030945|gb|ACN34548.1| unknown [Zea mays]
gi|414881571|tpg|DAA58702.1| TPA: hypothetical protein ZEAMMB73_471184 [Zea mays]
Length = 568
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 149/312 (47%), Gaps = 36/312 (11%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
EE F+ FR+ + TFD IC S+ +ED + R + V ++VA+ + RL
Sbjct: 239 EEEFRRAFRMGRGTFDMICDALGSSVAKEDTMLRA----------AIPVRQRVAVCVWRL 288
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A++ L+WPD++ K FE + G
Sbjct: 289 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDADAAARFKEGFERASG 348
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+ GW
Sbjct: 349 VPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDVCIGW 408
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M ++L+ S + AG + ++V G YPL+ W++ PY
Sbjct: 409 PGSMPDDQVLEKSMLHQRAAAGMMHDS-------------WLVAGASYPLMDWVLVPYTH 455
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +AV+AF +LK W L K + LP ++ CC+LHNI
Sbjct: 456 PNLTWTQHAFNEKVAELRRVAVEAFARLKARWACLQKRT-EVKLQDLPVVLGACCVLHNI 514
Query: 348 IIDSGDQLHPDV 359
G+++ P +
Sbjct: 515 CEIRGEEVDPAI 526
>gi|270016593|gb|EFA13039.1| hypothetical protein TcasGA2_TC010570 [Tribolium castaneum]
Length = 393
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 178/355 (50%), Gaps = 33/355 (9%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR 112
ATI ++ F+ FRV+ + ++ + V L+ + +G ++ VEKQ+ +
Sbjct: 60 ATISGYTKQEFQQHFRVTIEAYEQLLQTVGPYLMRQAATGRSTVQ-----VEKQLLSVIW 114
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF 172
L + +S SVG FG+ +S++SQ R ++ L A +KWP+ +++ ++ +F
Sbjct: 115 ILVTPDSYRSVGERFGLAKSSLSQCFKRVVKILNAVAPTIIKWPEGMQLQNVERRFHAFA 174
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ +C GAID T++ + P + + ++ NY+ +Q I ++F D G+PG ++
Sbjct: 175 RMPHCIGAIDGTYVEIKAPK-EDPQSYITRKCNYAFTLQAIAVPSLKFTDAFIGYPGSVS 233
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNV-RISSEEVELREYIVGGVGYPLLSWLITPYETNG-L 290
SR+ K S F+ +N N+ +EE Y++G YP L W I PY G L
Sbjct: 234 DSRIFKNSDFYNA------VNANMGHYFAEE----HYLIGDKAYPNLPWCIAPYINRGNL 283
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
+A+ FN++H TR + ++F L G +R L K + D + +P+ ++ C+LHN+ +D
Sbjct: 284 NAAQVYFNTVHAQTRQVIERSFALLFGRFRRL-KFLDMNDTKLIPATVIAACVLHNVCLD 342
Query: 351 SGDQLHPD---------VALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQ 396
D LH D V ++ D G GE + GR R+ L + + N+
Sbjct: 343 FHD-LHIDDYINNGIDYVVNNNDGDDGIGENVAIE----GRRRRDALCRQIFLNR 392
>gi|388495434|gb|AFK35783.1| unknown [Medicago truncatula]
Length = 512
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/317 (29%), Positives = 152/317 (47%), Gaps = 36/317 (11%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW S P DE F+ FR+SK+TF+ IC+ + + V++ + L +
Sbjct: 171 DWW-------ESCNREDFPDDE---FRRCFRMSKQTFNMICNEL-DSSVTKKNTTLRDA- 218
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+GE V FG+G ST ++ A++ + L+WP
Sbjct: 219 ---IPVRQRVAVCIYRLATGEPLRLVSKKFGLGISTCHKLVLEVCAAIKSVLMQKFLRWP 275
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
D +E K FE SFG+ N G + TH+ + P V + +Q+ +YS+ V
Sbjct: 276 DEETTKETKQAFEGSFGIPNIGGVMYTTHVPIIAPKVNVGLYFNKKLTERNQKTSYSVTV 335
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG+V+ + F D+ GWP M + +L+ S ++ + G N +IV
Sbjct: 336 QGVVNSKGVFTDVCIGWPDSMRDNTVLEKSALYERAKLGNLKNI-------------WIV 382
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G GY L+ W++ PY+ L+ + FN + AF +LKG W L K
Sbjct: 383 GNSGYSLMDWVLVPYKHQNLTWTQHGFNERIGEIEKIGKDAFSRLKGRWGCLKKRT-EIK 441
Query: 331 KRKLPSIILVCCLLHNI 347
+ LP ++ CC+LHNI
Sbjct: 442 LQDLPVVLGACCVLHNI 458
>gi|255594524|ref|XP_002536107.1| hypothetical protein RCOM_1838120 [Ricinus communis]
gi|223520838|gb|EEF26276.1| hypothetical protein RCOM_1838120 [Ricinus communis]
Length = 140
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/133 (51%), Positives = 84/133 (63%)
Query: 260 SEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGW 319
SE E+REYI+G G+PLL +LI PYE LS FN H AT+ +A KA +LK W
Sbjct: 2 SEGTEIREYIIGDSGFPLLPYLIVPYEGKELSEPKVEFNKRHSATQIVAQKALARLKEMW 61
Query: 320 RILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDP 379
RI+ VMWRPDK +LP IILVCCLLHNIIID D+ D+ S HD Y +Q C+ VD
Sbjct: 62 RIIQGVMWRPDKHRLPRIILVCCLLHNIIIDMEDEAQDDIPSSHEHDLDYQQQKCESVDI 121
Query: 380 MGRTTRENLEKHL 392
G R+ L +L
Sbjct: 122 KGVCLRDKLSLYL 134
>gi|432951686|ref|XP_004084885.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 360
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 160/325 (49%), Gaps = 31/325 (9%)
Query: 67 FRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
FR+ ++TF+ + L ED R + + V+K+V + L LA+G
Sbjct: 37 FRMRQETFEMLSVRLSPMLSYEDTTFR----------QAIPVQKRVGVGLWWLATGAGYR 86
Query: 122 SVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
++ FG+ ++V + F A+ ER + ++K P+ ++ + F+ +G C GA
Sbjct: 87 TLAHLFGISDASVCLIVREFCHAVRHERMREYIKLPEGEELQTVLLGFKNRWGFPQCAGA 146
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID +HI + P + D+ +++ +S+++Q +VDH+ F +I GWPG ++ SR+L+ S
Sbjct: 147 IDGSHIPVIAPH-ENHADYFNRKGCHSVILQAVVDHKYCFKNINIGWPGSVHDSRVLRNS 205
Query: 241 GFFKLCEAGQRL-NGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFN 298
++ E+G N I +V + ++G YPL SWL+ Y ET L+ FN
Sbjct: 206 EMYEKAESGVLFPNTTEEIQGTQVPI--MLLGDPAYPLRSWLMKGYPETGNLNEQQRQFN 263
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL--H 356
+ R A AF +LKG WR LSK + D +P++I CC LHNI + H
Sbjct: 264 NRLSGARMTAECAFGRLKGRWRCLSKRL-DVDISLVPTVISACCTLHNICEKHNEAYSEH 322
Query: 357 PDVALSDHHDSGYGEQCCKQVDPMG 381
P A G + VDP+G
Sbjct: 323 PSAA-------GLPDVPPGDVDPLG 340
>gi|340381674|ref|XP_003389346.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 368
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 151/295 (51%), Gaps = 12/295 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGV 125
R+S +TF + + +R + R + + R ++VE++VAI + RLA+ ++
Sbjct: 36 LRMSYETFTMLVNQLRPYIFKR------DTQLRTAVTVEERVAITVWRLATNVEYRTLSA 89
Query: 126 AFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
FGVG STV + + + +H++K P +++ E F+ +G GAID +
Sbjct: 90 LFGVGISTVCTIVMETCTVISQHLLQHYVKIPTGSKLRETVDGFKTRWGFPQVVGAIDGS 149
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
HI + P+ + + D+ +++ YS+++QG+VD++ +FID GWPG ++ +R+ S F+
Sbjct: 150 HIPVLRPS-ECATDYYNRKCFYSIIIQGVVDYQGQFIDTYIGWPGKLHDARVFYNSSFYN 208
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSLHEA 303
G L ++ + E+ ++ I+G YPLL WL+ PY ++ + +N
Sbjct: 209 KGRQGT-LFPSMSVRIEQTDIPLLILGDPAYPLLPWLMKPYPDSPSATTEEKHYNYRQSR 267
Query: 304 TRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD 358
R + AF +LKG WR L K M +P I+ C LHNI GD D
Sbjct: 268 ARMVVENAFGRLKGRWRCLLKRM-DCHLTNVPVIVAACVTLHNICERFGDNCQED 321
>gi|326514010|dbj|BAJ92155.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 166/373 (44%), Gaps = 50/373 (13%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPS 91
S+WW + S+PG E F FR+ + TF +C ++ +ED R
Sbjct: 116 SEWW-----ELRSSPGY-----PEAEFLREFRMGRATFGMVCDALGAAVAKEDTALRA-- 163
Query: 92 GLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAK 150
+ V ++VA+ + RLA+GE V FG+G ST ++ A+
Sbjct: 164 --------AIPVRQRVAVCVWRLATGEPLRLVSKRFGIGISTCHKLVLEVCAAIRSILMP 215
Query: 151 HHLKWPD--SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQ 202
L+WPD S +K+ FE G+ + GA+ THI + P V + + +Q
Sbjct: 216 RFLRWPDRGSAAAAAVKASFEARSGVPDVVGAMYTTHIPIIAPKVHVASYFNRRHTERNQ 275
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
+ +YS+ +QG+V + F D+ GWPG ++ ++L S + AG ++G+
Sbjct: 276 KTSYSITLQGVVGPDGAFTDVCIGWPGSLSDDQVLDKSALQQRAAAGM-MDGS------- 327
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
++VGG YPL W + PY L+ + FN R +AV AF +LK W L
Sbjct: 328 -----WVVGGASYPLTGWCLVPYTHQNLTWTQHAFNEKVGELRQVAVDAFSRLKSRWACL 382
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGR 382
K + LP ++ CC+LHNI G+ + P++ + D + + D
Sbjct: 383 QKRT-EVKLQDLPVVLGACCVLHNICEARGEAMTPELRVEVQDDLPVLDNPVRSAD--AA 439
Query: 383 TTRENLEKHLQHN 395
R+ + +L H
Sbjct: 440 KARDKMAHNLLHR 452
>gi|326528107|dbj|BAJ89105.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 166/373 (44%), Gaps = 50/373 (13%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPS 91
S+WW + S+PG E F FR+ + TF +C ++ +ED R
Sbjct: 116 SEWW-----ELRSSPGYP-----EAEFLREFRMGRATFGMVCDALGAAVAKEDTALRA-- 163
Query: 92 GLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAK 150
+ V ++VA+ + RLA+GE V FG+G ST ++ A+
Sbjct: 164 --------AIPVRQRVAVCVWRLATGEPLRLVSKRFGIGISTCHKLVLEVCAAIRSILMP 215
Query: 151 HHLKWPD--SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQ 202
L+WPD S +K+ FE G+ + GA+ THI + P V + + +Q
Sbjct: 216 RFLRWPDRGSAAAAAVKASFEARSGVPDVVGAMYTTHIPIIAPKVHVASYFNRRHTERNQ 275
Query: 203 ENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE 262
+ +YS+ +QG+V + F D+ GWPG ++ ++L S + AG ++G+
Sbjct: 276 KTSYSITLQGVVGPDGAFTDVCIGWPGSLSDDQVLDKSALQQRAAAGM-MDGS------- 327
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
++VGG YPL W + PY L+ + FN R +AV AF +LK W L
Sbjct: 328 -----WVVGGASYPLTGWCLVPYTHQNLTWTQHAFNEKVGELRQVAVDAFSRLKSRWACL 382
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGR 382
K + LP ++ CC+LHNI G+ + P++ + D + + D
Sbjct: 383 QKRT-EVKLQDLPVVLGACCVLHNICEARGEAMTPELRVEVQDDLPVLDNPVRSAD--AA 439
Query: 383 TTRENLEKHLQHN 395
R+ + +L H
Sbjct: 440 KARDKMAHNLLHR 452
>gi|156375356|ref|XP_001630047.1| predicted protein [Nematostella vectensis]
gi|156217060|gb|EDO37984.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 144/292 (49%), Gaps = 24/292 (8%)
Query: 67 FRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F VS+ TF YI + L R+D + R + VEK+VA+ L RLA+G
Sbjct: 4 FHVSRNTFMYILNTFGSQLARKDTLMRES----------VPVEKRVAVGLFRLATGSPYN 53
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSK---FEESFGLLNCC 178
G FG+G+ T + F E L + AK+ +K+P + E+ ++ FE +
Sbjct: 54 KTGEQFGIGRRTSLYIKDEFCEVLVKNAKYFIKFPTTE--EDTRAAIDGFEGKCQIPQLA 111
Query: 179 GAIDATHIIMTLPAVQTSD--DWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GAI A + P++ T + D +SM++Q + D +F+D+ TG+PGG+ +
Sbjct: 112 GAIKAVRFKIKAPSLVTHKMKYYNDNLKCHSMILQAVTDSSGKFLDVSTGYPGGITNTEA 171
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMP 295
L S +K + G LN + + E ++++ I Y L W I PY + N LS S
Sbjct: 172 LAISDIYKNVKQGVILN-SPSATVEGLQVKPIIATDASYKLRKWCICPYPDNNTLSPSQE 230
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FNS + +A A +L+G W L++ + K+P +L+CC+LHNI
Sbjct: 231 RFNSELGSAVHIAEDALARLRGRWNTLARSCLDENVEKIPETVLICCVLHNI 282
>gi|326665390|ref|XP_002661050.2| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 426
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 166/349 (47%), Gaps = 40/349 (11%)
Query: 20 KKSKSTSAAPKDP-----NSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
+ S+ + P P N +DWW + PSD E F R++K+TF
Sbjct: 60 QTSRPITELPARPTRVWSNIESTDWWERVVMREFQ------PSDWLEKF----RMTKETF 109
Query: 75 DYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGV 129
+C L R+D RP L +EK+VA+AL RLAS ++ FGV
Sbjct: 110 FLLCGKLKPRLNRQDTRLRPA----------LPLEKRVAVALWRLASNVEYRTISTLFGV 159
Query: 130 GQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
G+STV + A+ + +L+ P +E+ F +G +C GA+ + H+ +
Sbjct: 160 GRSTVCKCVRDVCHAIVLLLRPLYLRTPSEQELEDAARLFATRWGFPHCVGAVGSLHVPI 219
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF-KLCE 247
P+ T + W + S++ QG V+ +F D+ G+PG S +L+ S + + C+
Sbjct: 220 IAPSSNTDNYW-NSRGWLSVVTQGAVNGLGQFWDVCAGFPGSTEHSAILQNSTLWARGCD 278
Query: 248 AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSLHEATRS 306
G L + L ++G GYPL SWL+ Y E++ L+A FN E RS
Sbjct: 279 GGFLLR-QPPLDFMGHPLGFLMLGDAGYPLKSWLLKGYPESSALTAGQRAFNRRLERARS 337
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRK--LPSIILVCCLLHNIIIDSGD 353
+ +AFL+L+ W+ L K R D R +P++IL CC+LHN+ GD
Sbjct: 338 VVDQAFLRLRARWQCLLK---RNDCRMDVVPTMILACCVLHNVCELHGD 383
>gi|356547899|ref|XP_003542342.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 432
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/311 (29%), Positives = 156/311 (50%), Gaps = 30/311 (9%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P EE F+ +FR+SK TFD IC + + V++ + L + + V ++VA+ + RLA
Sbjct: 99 PDFPEEEFRRYFRMSKATFDMICQHL-DSAVTKKNTML----RQAIPVRQRVAVCIWRLA 153
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGL 174
+G+ V FG+G ST ++ A++ L WP+ M+ I KFE G+
Sbjct: 154 TGDPLREVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLHWPNEAEMKPITQKFESLSGI 213
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G++ THI + P + + +Q+ +YS+ VQG+VD + F D+ GWP
Sbjct: 214 PYVGGSMYTTHIPIIAPKSNVNAYFNKHHTERNQKTSYSITVQGVVDSKGVFSDVCIGWP 273
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G ++ ++L+ S ++ G + ++V ++VG G+PL+ ++ PY
Sbjct: 274 GSLSDDQVLEKSALYQRATMG---------TLKDV----WVVGNSGHPLMDGVLVPYTHQ 320
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR--KLPSIILVCCLLHN 346
L+ + FN +S+A AF +LKG W L K R + + LP ++ CC+LHN
Sbjct: 321 NLTWTQHAFNQKVGEIQSIAKDAFARLKGRWSCLQK---RTEVKLEDLPVLLGACCVLHN 377
Query: 347 IIIDSGDQLHP 357
I +++ P
Sbjct: 378 ICEMRDEEMDP 388
>gi|356561879|ref|XP_003549204.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 451
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 156/330 (47%), Gaps = 42/330 (12%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW P EE F+ FR+SK TFD IC + + V++ + L
Sbjct: 109 DWWQR----------CNSPDFPEEEFRLHFRMSKATFDMICQHL-DSAVTKKNTML---- 153
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+G+ V FG+G ST ++ A++ L WP
Sbjct: 154 REAIPVRQRVAVCIWRLATGDPLREVSKRFGLGISTCHKLVLEVCSAIKSVLMPKFLHWP 213
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLV 210
+ M+ I KFE G+ N G++ THI + P + + +Q+ YS+ V
Sbjct: 214 NEAEMKPITEKFESLSGIPNVGGSMYTTHIPIIAPKSNVNAYFNKRHTERNQKTCYSITV 273
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE-YI 269
QG+VD + F D+ GWPG ++ ++L+ S ++ G L++ ++
Sbjct: 274 QGVVDPKGIFSDVCIGWPGSLSDDQVLEKSALYQRATMGN--------------LKDVWV 319
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG G+PL+ ++ PY L+ + FN +S+A AF +LKG W L K R
Sbjct: 320 VGNSGHPLMEGVLVPYTHQNLTWTQHAFNQKVGEIQSIAKDAFARLKGRWSCLQK---RT 376
Query: 330 DKR--KLPSIILVCCLLHNIIIDSGDQLHP 357
+ + LP ++ CC+LHNI +++ P
Sbjct: 377 EVKLEDLPVLLGACCVLHNICEMRDEEMKP 406
>gi|326673584|ref|XP_003199930.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 361
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 162/331 (48%), Gaps = 29/331 (8%)
Query: 34 SSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL 93
S SDWW + T ++ + FRVS++TF YICS ++ P
Sbjct: 19 SRTSDWWEN-----------TAQKYTDDQWLEDFRVSRETFRYICSTLK------PSLQR 61
Query: 94 INIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH 152
++ RL + + K+VAIAL +LAS +V F V +++V + F +A+ +
Sbjct: 62 LDTSFRLCIPLAKRVAIALYKLASTTEYRTVTNLFAVSRTSVCRCVHDFCKAVIAVLRPK 121
Query: 153 L-KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQ 211
L PD +M EI FE+ FG+ C GAID +HI + P Q D+ +++ +S+++Q
Sbjct: 122 LINTPDQAKMAEIADYFEDKFGIPQCVGAIDGSHIPILKPP-QYQSDFHNRKGWHSIILQ 180
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIV 270
+VD + F D+ G PG + + +LK S + + G V+ I EV +I+
Sbjct: 181 AVVDGKGLFWDLNVGQPGREHDASVLKKSCLWTWATSTTAFPGRVKNICGTEVGY--FIL 238
Query: 271 GGVGYPLLSWLITPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
G YPL WL+ PY +T L+ + +N R + AF +LKG W+ LSK R
Sbjct: 239 GDSAYPLQKWLLKPYPDTGRLTEAQELYNMRTSRARCVVEHAFGRLKGRWKCLSK---RN 295
Query: 330 D--KRKLPSIILVCCLLHNIIIDSGDQLHPD 358
D + ++ CC LHN+ D+ P+
Sbjct: 296 DCNVNVVVDMVETCCTLHNLCELHMDRFVPE 326
>gi|242091581|ref|XP_002441623.1| hypothetical protein SORBIDRAFT_09g030610 [Sorghum bicolor]
gi|241946908|gb|EES20053.1| hypothetical protein SORBIDRAFT_09g030610 [Sorghum bicolor]
Length = 457
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 91/314 (28%), Positives = 147/314 (46%), Gaps = 38/314 (12%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+ + TF +C ++ +ED R + V ++VA+ + RL
Sbjct: 128 EADFRAAFRMGRATFAMLCDALGSAVAKEDTALRA----------AIPVRQRVAVCVWRL 177
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSKFEESFG 173
A+GE V FG+G ST ++ A+ L+WPD+ + K++F+ G
Sbjct: 178 ATGEPLRLVSKRFGLGISTCHKLVLEVCAAIRGLLMPRFLRWPDAAAADAFKARFQAESG 237
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ GAI THI + P + + + + +YS+ +QG+V + F D+ GW
Sbjct: 238 IPGVVGAIYTTHIPIIAPKTSVAAYFNRRHTERNHKTSYSITLQGVVGPDGAFTDVCIGW 297
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG M ++L+ S + AG + G+ ++VGG +PL W++TPY
Sbjct: 298 PGSMPDDQVLERSALQQRAAAGM-MAGS------------WVVGGASFPLTEWVLTPYAQ 344
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + FN R +AV AF +LKG W L K + LP ++ CC+LHNI
Sbjct: 345 ANLTWAQHAFNEKVAEVRRVAVDAFARLKGRWACLQKRT-EVKLQDLPVVLGACCVLHNI 403
Query: 348 IIDSGDQLHPDVAL 361
G+ + D AL
Sbjct: 404 CESRGEGM--DAAL 415
>gi|156375358|ref|XP_001630048.1| predicted protein [Nematostella vectensis]
gi|156217061|gb|EDO37985.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 142/292 (48%), Gaps = 24/292 (8%)
Query: 67 FRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F VS+ TF YI + L R+D + R + VEK+VA+AL RLA+G
Sbjct: 4 FHVSRNTFMYILNTFGSQLARKDTLMRES----------VPVEKRVAVALFRLATGSPYN 53
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSK---FEESFGLLNCC 178
G FG+G+ T + F E L + AK +K+P + E+ ++ FE +
Sbjct: 54 KTGEQFGIGRCTSLNIKDEFCEVLVKNAKDFIKFPTTE--EDTRAAIDGFEGKCQIPQLA 111
Query: 179 GAIDATHIIMTLPAVQTS--DDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GAI A + P++ T + D +SM +Q + D +F+D+ TG+PGG+ +
Sbjct: 112 GAIKAVRFKIKAPSLVTHKMKYYNDNLKCHSMTLQAVTDSSGKFLDVSTGYPGGITNTEA 171
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMP 295
L S +K + G LN + + E ++++ I Y L W I PY + N LS S
Sbjct: 172 LAMSDIYKNVKQGVILN-SPSATVEGLQVKPIIATDASYKLRKWCICPYPDNNMLSPSQE 230
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FNS + +A A +L+G W L+ + K+P +L+CC+LHNI
Sbjct: 231 RFNSELGSAVHIAEDALARLRGRWNTLAGSCLDENVEKIPETVLICCVLHNI 282
>gi|118120491|ref|XP_424152.2| PREDICTED: putative nuclease HARBI1 [Gallus gallus]
Length = 421
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 178/380 (46%), Gaps = 44/380 (11%)
Query: 9 SKKTKKVTKKL---KKSKSTSAAPKD-PNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFK 64
S++ K++ L ++ S P+ P+ +DWW K P D + F
Sbjct: 41 SQRQKRLMTLLIARRRRTSCYFYPRAWPSLRGADWWERVVLKEFG------PQDWLDKF- 93
Query: 65 YFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG 124
R+S++TF YIC+ +R L P S + L +EK+VA+AL LA+ ++
Sbjct: 94 ---RMSRETFFYICNRLRPGLA--PHSAHFHPT---LPLEKRVAVALWHLATNVEYQTLS 145
Query: 125 VAFGVGQSTVS----QVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
FGVG STV +V++ + L+ PD +E + F +G +C GA
Sbjct: 146 PLFGVGPSTVQTCVREVSYAVVLLLKPLYLRL---PDEKELENMVRIFRTRWGFPHCIGA 202
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
+D+ HI + P ++ S D+C+ + +S+L Q VD +F D+ T +PG M S +L+ S
Sbjct: 203 LDSLHIPIN-PPLRLSADYCNGQGWHSILTQATVDGLGQFWDVSTAFPGSMENSAVLESS 261
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNS 299
+ L + G RL N + ++G YPL W++ PY E L+ FN
Sbjct: 262 SLWVLAKEG-RLCPNPPKDFMGKAQKYVLLGDATYPLQDWILKPYQEDESLTQRQLQFNY 320
Query: 300 LHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
+ S+ AFL+LK W+IL K LP+++L CC+LHN+
Sbjct: 321 RLKRAHSVIENAFLRLKARWQILLKCD-DCSLELLPTLVLACCILHNV------------ 367
Query: 360 ALSDHHDSGYGEQCCKQVDP 379
+ HD+ + E+ + +P
Sbjct: 368 --CEAHDNPFNEEWLEGTEP 385
>gi|218188534|gb|EEC70961.1| hypothetical protein OsI_02577 [Oryza sativa Indica Group]
Length = 299
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 140/302 (46%), Gaps = 23/302 (7%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSN 159
+ V ++VA+ + RLA+GE V FG+G ST ++ A++ L+WPD
Sbjct: 6 IPVRQRVAVCIPRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEA 65
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGI 213
K +F+ ++G+ GA+ THI + P + + + +Q+ +YS+ +QG+
Sbjct: 66 AAAAFKERFQAAYGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGV 125
Query: 214 VDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGV 273
V + F D+ GWPG M ++L+ S + AG + +VGG
Sbjct: 126 VGPDGAFTDVCIGWPGSMPDDQVLEKSMLHQRAAAGMMHSAC-------------LVGGA 172
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
YPL+ W++ PY L+ + FN R +AV AF +LK W L K +
Sbjct: 173 SYPLMDWVLVPYTHQNLTWTQHAFNEKVGDLRRVAVDAFARLKARWACLQKRT-EVKLQD 231
Query: 334 LPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQ 393
LP ++ CC+LHNI G++L P++ D E + + R+N+ +L
Sbjct: 232 LPVVLGACCVLHNICETRGEELEPELRFELVDDETSPETPVR--SEAAKRARDNIAHNLL 289
Query: 394 HN 395
H
Sbjct: 290 HR 291
>gi|55771325|dbj|BAD72250.1| unknown protein [Oryza sativa Japonica Group]
gi|55771334|dbj|BAD72259.1| unknown protein [Oryza sativa Japonica Group]
Length = 439
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 154/334 (46%), Gaps = 49/334 (14%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
DS+WW S+P P E F+ FR+SK TF+ +C ++ +ED + R
Sbjct: 94 DSEWWDMV-----SSP--DYPDSE---FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAA 143
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+ V K+VA+ + RLA+GE V FG+G ST ++ AL+
Sbjct: 144 ----------IPVRKRVAVCVWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAALKAMVM 193
Query: 151 HHL-KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQE 203
+ +WP++ I + FE G+ GAI THI + P + + +Q+
Sbjct: 194 PKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHIPIIAPKSNVASYYNRRHTERNQK 253
Query: 204 NNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEV 263
+YSM VQ +VD F D+ GWPG + +L+ S + + G
Sbjct: 254 TSYSMTVQCVVDSTGAFTDVCIGWPGSNSDEEVLEKSALYLHRGVPGLIQG--------- 304
Query: 264 ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS 323
+++VGG +PL+ W++ PY L+ + N A R +A AF +LK W L
Sbjct: 305 ---QWVVGGGSFPLMDWMLVPYTHQNLTWAQHMLNEKVAAVRGVARDAFERLKRRWGCLQ 361
Query: 324 KVMWRPDKR--KLPSIILVCCLLHNIIIDSGDQL 355
K R + + LP+++ CC+LHNI SGD +
Sbjct: 362 K---RTEVKLLDLPTVLGACCVLHNICERSGDAV 392
>gi|215740538|dbj|BAG97194.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 154/334 (46%), Gaps = 49/334 (14%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
DS+WW S+P P E F+ FR+SK TF+ +C ++ +ED + R
Sbjct: 108 DSEWWDMV-----SSP--DYPDSE---FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAA 157
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+ V K+VA+ + RLA+GE V FG+G ST ++ AL+
Sbjct: 158 ----------IPVRKRVAVCVWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAALKAMVM 207
Query: 151 HHL-KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQE 203
+ +WP++ I + FE G+ GAI THI + P + + +Q+
Sbjct: 208 PKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHIPIIAPKSNVASYYNRRHTERNQK 267
Query: 204 NNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEV 263
+YSM VQ +VD F D+ GWPG + +L+ S + + G
Sbjct: 268 TSYSMTVQCVVDSTGAFTDVCIGWPGSNSDEEVLEKSALYLHRGVPGLIQG--------- 318
Query: 264 ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS 323
+++VGG +PL+ W++ PY L+ + N A R +A AF +LK W L
Sbjct: 319 ---QWVVGGGSFPLMDWMLVPYTHQNLTWAQHMLNEKVAAVRGVARDAFERLKRRWGCLQ 375
Query: 324 KVMWRPDKR--KLPSIILVCCLLHNIIIDSGDQL 355
K R + + LP+++ CC+LHNI SGD +
Sbjct: 376 K---RTEVKLLDLPTVLGACCVLHNICERSGDAV 406
>gi|21592524|gb|AAM64474.1| unknown [Arabidopsis thaliana]
Length = 515
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 168/365 (46%), Gaps = 37/365 (10%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI 96
+DWW S P P DE F+ FR+SK TF+ IC + + V++ + L +
Sbjct: 172 TDWWDRV-----SRPD--FPEDE---FRREFRMSKSTFNLICEEL-DTTVTKKNTMLRDA 220
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
+ K+V + + RLA+G V FG+G ST ++ A+ + +L W
Sbjct: 221 ----IPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW 276
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSML 209
P + + K+KFE + N G+I THI + P V + + +Q+ +YS+
Sbjct: 277 PSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSIT 336
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
VQG+V+ + F D+ G PG + ++L+ S + A G +R S +I
Sbjct: 337 VQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAA----RGMLRDS--------WI 384
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG G+PL +L+ PY L+ + FN + +A AF +LKG W L K
Sbjct: 385 VGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRT-EV 443
Query: 330 DKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
+ LP ++ CC+LHNI +++ P++ D E + + TR+++
Sbjct: 444 KLQDLPYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAV--NTRDHIS 501
Query: 390 KHLQH 394
+L H
Sbjct: 502 HNLLH 506
>gi|4914402|emb|CAB43653.1| hypothetical protein [Arabidopsis thaliana]
gi|7269877|emb|CAB79736.1| hypothetical protein [Arabidopsis thaliana]
Length = 515
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 168/365 (46%), Gaps = 37/365 (10%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI 96
+DWW S P P DE F+ FR+SK TF+ IC + + V++ + L +
Sbjct: 172 TDWWDRV-----SRPD--FPEDE---FRREFRMSKSTFNLICEEL-DTTVTKKNTMLRDA 220
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
+ K+V + + RLA+G V FG+G ST ++ A+ + +L W
Sbjct: 221 ----IPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW 276
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSML 209
P + + K+KFE + N G+I THI + P V + + +Q+ +YS+
Sbjct: 277 PSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSIT 336
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
VQG+V+ + F D+ G PG + ++L+ S + A G +R S +I
Sbjct: 337 VQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAA----RGMLRDS--------WI 384
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG G+PL +L+ PY L+ + FN + +A AF +LKG W L K
Sbjct: 385 VGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRT-EV 443
Query: 330 DKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
+ LP ++ CC+LHNI +++ P++ D E + + TR+++
Sbjct: 444 KLQDLPYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVN--TRDHIS 501
Query: 390 KHLQH 394
+L H
Sbjct: 502 HNLLH 506
>gi|30688449|ref|NP_567834.2| uncharacterized protein [Arabidopsis thaliana]
gi|27754582|gb|AAO22738.1| unknown protein [Arabidopsis thaliana]
gi|28973634|gb|AAO64139.1| unknown protein [Arabidopsis thaliana]
gi|332660275|gb|AEE85675.1| uncharacterized protein [Arabidopsis thaliana]
Length = 540
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 168/365 (46%), Gaps = 37/365 (10%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI 96
+DWW S P P DE F+ FR+SK TF+ IC + + V++ + L +
Sbjct: 197 TDWWDRV-----SRPD--FPEDE---FRREFRMSKSTFNLICEEL-DTTVTKKNTMLRDA 245
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
+ K+V + + RLA+G V FG+G ST ++ A+ + +L W
Sbjct: 246 ----IPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLLW 301
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSML 209
P + + K+KFE + N G+I THI + P V + + +Q+ +YS+
Sbjct: 302 PSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSIT 361
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
VQG+V+ + F D+ G PG + ++L+ S + A G +R S +I
Sbjct: 362 VQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAA----RGMLRDS--------WI 409
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG G+PL +L+ PY L+ + FN + +A AF +LKG W L K
Sbjct: 410 VGNSGFPLTDYLLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRT-EV 468
Query: 330 DKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
+ LP ++ CC+LHNI +++ P++ D E + + TR+++
Sbjct: 469 KLQDLPYVLGACCVLHNICEMRKEEMLPELKFEVFDDVAVPENNIRSASAVN--TRDHIS 526
Query: 390 KHLQH 394
+L H
Sbjct: 527 HNLLH 531
>gi|357127555|ref|XP_003565445.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 470
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 177/398 (44%), Gaps = 73/398 (18%)
Query: 5 KKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPG------------ 52
KKS + T K KK S+S A + P+ S ++ P+ SS
Sbjct: 63 KKSSASGTGK-----KKRGSSSKAQRSPSGSRTETTPAPRGGRSSKHAHQRLLANYRAAE 117
Query: 53 ----ATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSV 103
+ PS E FK F VS++TF Y+C S+ +ED + V
Sbjct: 118 WWREVSSPSYPERDFKREFHVSRRTFHYLCDELGASVGKEDTWLH----------AAVPV 167
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRME 162
K+VA L RLA G ++V FGV STV ++ F A+ + WP++N
Sbjct: 168 PKRVAACLWRLAHGHPCLTVAERFGVSVSTVHKLRVDFCAAVNSVLLDRFIAWPETNPQA 227
Query: 163 EI--KSKFEESFGLLNCCGAIDATHIIMTLPA----------VQTSDDWCDQENNYSMLV 210
KFEE G+ GA+ TH+ + P +Q S + ++ Y++ +
Sbjct: 228 AAVAADKFEEVAGIHGVTGAVYTTHVPIVAPRQHVADYHNPRLQASRNDYNRSACYTVSI 287
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
QG VD FI++ G PG ++ +L KL + LN R + R ++
Sbjct: 288 QGTVDAHGAFINVCVGNPGSLSDEEVL-----LKLSD----LNNGAR---RQQPFR--VL 333
Query: 271 GGVGYPLLSWLITPYET-----NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
GG YPL +L+ PY + N L+ + N EA ++A AF++LK W L K
Sbjct: 334 GGASYPLTDYLLVPYSSSSATDNCLTPAQEEMNKAVEAGTAVAKDAFMRLKARWGFLRK- 392
Query: 326 MWRPDKR--KLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
R + + +L ++I CC+LHNI + GD+L P++A
Sbjct: 393 --RSEVKIPELSNVIQACCMLHNICVRFGDELDPELAF 428
>gi|294461349|gb|ADE76236.1| unknown [Picea sitchensis]
Length = 368
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 24/319 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
E FK FR+S TF+++C E LV++ + V ++V + L RLA+G
Sbjct: 28 ESASFKANFRMSPTTFEWLCKQF-EPLVTKTND---------VGVFERVGMGLFRLATGS 77
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
+ +V F + T T + K + +P + + +F+ GL NCC
Sbjct: 78 TYQAVSQRFVSTEPTARLCTKELCRVICTNLKFWVAYPSPCDLPNVSDEFQTLSGLPNCC 137
Query: 179 GAIDATHIIMT---LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
GAID T T P + +D + S++ Q +VD R + I+TG+ G SR
Sbjct: 138 GAIDCTRFKFTNTNYPYIYNEED---DDLQQSVVTQIVVDCSSRILSIITGFKGNKGDSR 194
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S + E G+ LN + + V +R+Y+VG GYPLL WL+ PY +++
Sbjct: 195 VLRSSTLYADIEEGRLLN-SPPFHLKGVPVRQYLVGDSGYPLLPWLMVPYTDPVVTSCQE 253
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII--DSGD 353
FNS H R A++ L+ W ILSK + D R I C +LHNI++ + G
Sbjct: 254 DFNSKHYVMRQAALRTLSCLR-KWGILSKPI-EEDVRVAVGCIGACAILHNILLAREGGS 311
Query: 354 QL---HPDVALSDHHDSGY 369
+ D AL D Y
Sbjct: 312 TILEGTEDYALHDQSSQYY 330
>gi|326667245|ref|XP_003198536.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 349
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 156/333 (46%), Gaps = 20/333 (6%)
Query: 67 FRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
R+ + +F +C L R++ R P + VE +VAI + RLA+
Sbjct: 27 LRMERSSFILLCDTLRPWLTRQNTRYRKP----------VPVEIRVAICIWRLATNLEYR 76
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
S+ FGVG ST +T + A+ K ++K P + + I F + +G GA
Sbjct: 77 SISHLFGVGVSTCCIITQEVVTAINVIMKPQYIKKPSAAEFKMIVQGFRDRWGFPQVAGA 136
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID THI + P+ T D+ +++ NYS+++Q +VD++M+F DI G PG ++ +R+ S
Sbjct: 137 IDGTHINIKAPS-NTPADYYNRKGNYSIVLQAVVDNKMKFWDINVGQPGKVHDARVFCLS 195
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNS 299
F +G L + E +++ +++G YPL WL+ PY E G++ FN
Sbjct: 196 SLFDGGSSGTLLPTWTE-TFEAIDVPLFLLGDSAYPLSHWLMKPYPEGRGVTPEQIKFNH 254
Query: 300 LHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
R +AF +LKG WR L K + I+ CC+LHN ++ +
Sbjct: 255 RLSQARMTVERAFGRLKGRWRCLLKQC-EAHITLVSRIVSACCVLHNFCEVRNEEFYGRD 313
Query: 360 ALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ + + + +Q + G R+ L ++
Sbjct: 314 DVREEEEQSQDARPQEQENVQGNNVRDALCRYF 346
>gi|297597079|ref|NP_001043414.2| Os01g0582600 [Oryza sativa Japonica Group]
gi|255673396|dbj|BAF05328.2| Os01g0582600 [Oryza sativa Japonica Group]
Length = 500
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 131/279 (46%), Gaps = 35/279 (12%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P EE F+ FR+ ++TFD IC ++ +ED + R + V ++VA+
Sbjct: 218 PDYPEEEFRRAFRMGRETFDMICEALGSAVAKEDTMLRAA----------IPVRQRVAVC 267
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+GE V FG+G ST ++ A++ L+WPD K +F+
Sbjct: 268 IWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAAIKSVLMPRFLQWPDEAAAAAFKERFQ 327
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDI 223
++G+ GA+ THI + P + + + +Q+ +YS+ +QG+V + F D+
Sbjct: 328 AAYGVPGVIGAMYTTHIPIIAPKISVAAYFNRRHTERNQKTSYSITLQGVVGPDGAFTDV 387
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
GWPG M ++L+ S + AG + +VGG YPL+ W++
Sbjct: 388 CIGWPGSMPDDQVLEKSMLHQRAAAGMMHSAC-------------LVGGASYPLMDWVLV 434
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
PY L+ + FN R +AV AF +LK W L
Sbjct: 435 PYTHQNLTWTQHAFNEKVGDLRRVAVDAFARLKARWACL 473
>gi|270004721|gb|EFA01169.1| hypothetical protein TcasGA2_TC010492 [Tribolium castaneum]
Length = 319
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 131/285 (45%), Gaps = 31/285 (10%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNRME 162
EK + + L LA+ +S S+G FG+ +S+V I L E +KWP +
Sbjct: 35 EKALLVTLWTLATPDSYRSIGERFGIAKSSVYFCLQTVINVILHELTHRFIKWPTEDETV 94
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
+ ++F + +GL G ID HI + P V+ S D+ +++ YSM++QG+ ++ FID
Sbjct: 95 YVANRFAK-YGLPKVIGVIDGCHIPIKKP-VENSIDYFNRKKFYSMVLQGVCKPDLTFID 152
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
WPG ++ R+L+ S + + E R N Y++G YPL WLI
Sbjct: 153 CDIRWPGSVHDGRVLRTSDIYPIAEQLCRPN-------------YYVIGDSAYPLKKWLI 199
Query: 283 TPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
TPY NG L FN + TR + F LKG +R L + R + ++
Sbjct: 200 TPYRNNGHLLPQQVYFNRMLAKTRVVIENTFALLKGRFRRLKDYLDRDKPEDIIQTVMAA 259
Query: 342 CLLHNIIIDSG--------------DQLHPDVALSDHHDSGYGEQ 372
C LHNI + G D P VAL + +D+ E+
Sbjct: 260 CTLHNICLVEGEDEVEQYILEGRDDDHAIPMVALYNQNDNEAAER 304
>gi|357131373|ref|XP_003567313.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 455
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 148/315 (46%), Gaps = 39/315 (12%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P + F+ FR+ ++TFD IC ++ +ED SRP + V ++VA+
Sbjct: 127 PDYPDTKFRSHFRMCRETFDMICDELGSAIAKEDTPSRPA----------IPVCQRVAVC 176
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+ RLA+G++ + FG+ ST ++ A+E ++WP + +++ F+
Sbjct: 177 IWRLATGDAFHVISNRFGLANSTCQKIVLEVCVAIESILMPRFIRWP--GQATALEASFK 234
Query: 170 ESFGLLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
G+ N G++ THI + P + + C + + YS++ QG+VD + FI + G
Sbjct: 235 AISGIPNVIGSMYTTHISIIAPRFFPLHYCNHICIRGSFYSIIFQGVVDPDGTFIHVSIG 294
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
WPG M+ R++ S L GN ++ ++VGG YPL WL+ PY
Sbjct: 295 WPGSMSDDRVVDQSHL-------SHLYGNDAMAGS------WVVGGTNYPLNDWLLVPYT 341
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL--VCCLL 344
L+ FN +A AF +LKG W L K R D +++ C +L
Sbjct: 342 HQNLTREQQIFNENVTKLNHVAKDAFARLKGRWACLQK---RADVNIKNFVLMAGACSVL 398
Query: 345 HNIIIDSGDQLHPDV 359
HNI + +++ P++
Sbjct: 399 HNICEVNKEEMDPEL 413
>gi|322785647|gb|EFZ12298.1| hypothetical protein SINV_06104 [Solenopsis invicta]
Length = 365
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 146/307 (47%), Gaps = 20/307 (6%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
+++ FK FR+ + T+ YI L+R DL P GR +S KQ+ +AL LA+
Sbjct: 57 DDQEFKSNFRLRRDTYGYILDLIRPDLEKWPERF-----GRYPISASKQLYVALWMLATP 111
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
++ SV F +G++T + R ++AL + + WP + E +K E+ +
Sbjct: 112 DAYRSVCTKFDIGKATAWRSVLRVVKALYKLRNVFISWPTREQAENTWTKLEQQYQFPGI 171
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+D T + +T P V C +++ +++ +Q + D E+RFI G P ++ R
Sbjct: 172 IGAVDGTLVKITAPKVHPEAYIC-RKHYHAIQLQVVCDAELRFIHCYAGQPESVHDMRTF 230
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPT 296
+SG + C N E+ +++G Y + ++ PY NG L+A+
Sbjct: 231 MYSGLKQKC--------NEYFFPED----SHLLGDAAYTIQRHIMVPYRDNGHLTAAENL 278
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
FN + R + ++ LKG WR L + +P I+ CC+LHNI + D++
Sbjct: 279 FNKKLSSARMIVERSIGLLKGRWRCLLDKLPMTRTDLIPRYIIACCVLHNICLLRRDEIE 338
Query: 357 PDVALSD 363
+ + D
Sbjct: 339 IPILIED 345
>gi|224056104|ref|XP_002298728.1| predicted protein [Populus trichocarpa]
gi|222845986|gb|EEE83533.1| predicted protein [Populus trichocarpa]
Length = 433
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 149/316 (47%), Gaps = 20/316 (6%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL-INIEGRLLSVEKQVAIALRRLASGES 119
+ FK FR+ TF+++ L+ L R P G IN LS E ++ I L RLA+G S
Sbjct: 100 DSFKTTFRMRSSTFEWLSGLLEPLLECRDPIGTPIN-----LSSELRLGIGLFRLATGSS 154
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
+ + FGV +S + L + + +P S ++ + E GL NCCG
Sbjct: 155 YIEIAGRFGVTESVTRFCAKQLCRVLCTNFRFWIAFPTSTELQLVSKDIEGLTGLPNCCG 214
Query: 180 AIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
ID T ++ + + D D+ + S+ VQ +VD R + I+ G+ G N SR+LK
Sbjct: 215 VIDCTRFNVVKRNDCKLASD--DEVQDDSIAVQIVVDSSSRILSIIAGFRGDKNDSRILK 272
Query: 239 FSGFFKLCE--AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
+ LC G+RL + V + +Y++G GYPLL WL+ P+ +S
Sbjct: 273 ST---TLCHDIEGRRLLNATPVIVNGVAIDQYLIGDGGYPLLPWLMVPFVDVVPGSSEEK 329
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
FN+ + A++ LK W +L+K + K + + I C +LHN+++ D
Sbjct: 330 FNAANNLMHVFALRTIASLK-NWGVLNKPVEEEFKTAV-AFIGACSILHNVLLMRED--- 384
Query: 357 PDVALSDHHDSGYGEQ 372
D AL D D +Q
Sbjct: 385 -DSALIDVEDYSLYDQ 399
>gi|255555409|ref|XP_002518741.1| conserved hypothetical protein [Ricinus communis]
gi|223542122|gb|EEF43666.1| conserved hypothetical protein [Ricinus communis]
Length = 445
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 154/322 (47%), Gaps = 23/322 (7%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ FF++ TF+++ L+ L R P G LS E ++ + L RLA+G +
Sbjct: 108 DSFRTFFKMKASTFEWLSGLLEPLLDCRDPIG----SPLSLSAELRLGVGLFRLATGSNY 163
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+ FGV +S + L + + +P ++ + + FE+ GL NCCG
Sbjct: 164 SEIADRFGVTESAARFCAKQLCRVLCTNFRFWVSFPSPVELQSVSNAFEKLIGLPNCCGV 223
Query: 181 IDATHIIMTLPAVQTSDDWC-----DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
ID+ + V+ +DD DQ++ + Q +VD R + IV G+ G SR
Sbjct: 224 IDSARFNL----VKKADDKLASNGKDQDD--MIAAQIVVDSSSRILSIVAGFRGEKGNSR 277
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+LK + +K E G+ LN + I + V + Y++GG YPLL WL+ P+ +
Sbjct: 278 MLKSTTLYKDIEGGRVLNSSPEIVN-GVAINRYLIGGGRYPLLPWLMVPFLDALPGSCEE 336
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN ++ R +++A LK W +LS+ + K + ++I C +LHN ++ D
Sbjct: 337 KFNKANDLMRVSSLRAIASLK-NWGVLSRPIQEEFKTAV-ALIGACSILHNALLMRED-- 392
Query: 356 HPDVALSDHHD-SGYGEQCCKQ 376
D AL D S Y +QC +
Sbjct: 393 --DSALLDMGGYSLYNQQCSQH 412
>gi|357128294|ref|XP_003565809.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
gi|255316766|gb|ACU01765.1| hypothetical protein [Brachypodium distachyon]
Length = 463
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 148/330 (44%), Gaps = 42/330 (12%)
Query: 43 FWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIE 97
+W + SS A+ P D F+ FR+S+ TF +C ++ +ED R
Sbjct: 114 WWEQRSS---ASYPDD---AFRRDFRMSRATFAMVCDALGAAVAKEDTALRA-------- 159
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+ V ++VA+ + RLA+GE V FG+G ST ++ A+ L+W
Sbjct: 160 --AIPVPQRVAVCVWRLATGEPLREVSKRFGIGISTCHKLVLEVSAAIRSILMPRFLQWA 217
Query: 157 DSNRME-EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSML 209
+ + K++FE G+ GA+ H+ + P + + + + + +YS+
Sbjct: 218 STAEAQARQKARFEACSGVPGVVGAMLTAHVPIIAPKLSVAAYFNRRHTERNGKTSYSVT 277
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
+QG+V + F D+ GWPG M ++L S G G+ ++
Sbjct: 278 LQGVVGPDGAFTDVCIGWPGSMPDDQVLARSALHNRAANGGMPAGS------------WV 325
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
V G +PL W + PY + L+ + FN R +AV+AF +LK W L K
Sbjct: 326 VAGASFPLTDWTLVPYTHHNLTWTQHAFNDKVAGIRLVAVRAFRRLKARWVCLQKRT-EV 384
Query: 330 DKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
+ LP ++ CC+LHNI G+ L PD+
Sbjct: 385 KLQDLPVVLGACCVLHNICEIRGEPLAPDL 414
>gi|291233883|ref|XP_002736880.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 389
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 152/309 (49%), Gaps = 29/309 (9%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALR 112
T+P ++ F+ FRV F+Y+ ++ L+ P I+ GR + +K++AI L+
Sbjct: 50 TVPQYQDRRFRRAFRVPHSVFEYLADILTGPLIENEP---IHCGGRPAILPDKKIAIFLK 106
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNRMEEIKSKFEE- 170
+ S E+ + + F + +S+V +V + L + + WP ++ I + F +
Sbjct: 107 CVGSMETILDIAQLFNITESSVIKVRRQVTNVILIHLLNNTIHWPLRRELQGISACFNDM 166
Query: 171 -SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+ L N GA+D +HI ++ P Q D + +++ S+++QG+ +++FID+ G PG
Sbjct: 167 HTSNLPNIIGAVDGSHIPISTPHEQ-PDAYYNRKKFRSVVLQGVCREDLQFIDVSVGCPG 225
Query: 230 GMNVSRLLKFSGFFKL----CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
M+ +R+L+ S + C+ GQ +++G YPL +WL+TPY
Sbjct: 226 RMHDARVLRNSTLWNTGMGNCQRGQ----------------YHVIGDAAYPLTNWLMTPY 269
Query: 286 ETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
NG L FN+ R + +AF LK +R L + D ++ I+L C++
Sbjct: 270 RDNGHLDEHQRLFNTSLSRRRVIIERAFGSLKRRFRRLLNGIDITDVNEINKIVLAACII 329
Query: 345 HNIIIDSGD 353
HNI I D
Sbjct: 330 HNICIKHDD 338
>gi|331215241|ref|XP_003320301.1| hypothetical protein PGTG_01213 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 352
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 146/298 (48%), Gaps = 20/298 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FR+S + F ++ +R+ L P G LSVE QVA+ L RL G S ++
Sbjct: 54 FLELFRMSVEDFRWLSDSLRDLLQLDPLR-----RGDPLSVEAQVAVGLYRLGHGSSYLT 108
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRME---EIKSKFEESFGLLNCC 178
+G F +G+ T + RF+ A+ R + + +P R + EI + FE G+ N
Sbjct: 109 IGHVFNIGKETADKAAGRFVIAVLHRFRLAAISYPPLGRQDQWDEISASFEAKHGIPNVV 168
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + +PA + ++++ S++ Q +VD + F ++ G PG M+ RL
Sbjct: 169 GAIDGTHIPLAVPAEDRWKGYINRKSWVSIVFQCVVDGDGNFRNVSGGAPGSMHDGRL-- 226
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F+ E G + + + + + Y++G GYP ++ PY + + FN
Sbjct: 227 ----FRRPELGHSITAGTMVPA-MIPVGTYLIGDAGYPSNVRVLLPYPSIA-APENEHFN 280
Query: 299 SLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
+ +TR + +AF +LK +RI L M RP + + + C +LHN++ G L
Sbjct: 281 FIQSSTRIIVEQAFGRLKNRFRILLHPQMARPVRAR--NNAFACMILHNLMNKRGTLL 336
>gi|340372975|ref|XP_003385019.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 386
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 159/343 (46%), Gaps = 49/343 (14%)
Query: 37 SDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPS 91
S WW +F + G I + +E F R+S+ +F +C + R+D V RP +
Sbjct: 48 SLWWDNFLS------GVVISEEWKENF----RMSRNSFHQLCDELRTYIERQDTVMRPST 97
Query: 92 GLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AK 150
SVEKQVAI L L+ +FGV +STVS + A+ +
Sbjct: 98 ----------SVEKQVAITLYYLSDEGRMRKTANSFGVSRSTVSIIVRLVTSAITQYLGP 147
Query: 151 HHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSML 209
++ P +N ++E F ++FG C G +D THI + P + D+ ++++ +S+
Sbjct: 148 KYISLPLTANDVKEKTVGFFKAFGFPQCIGTVDGTHIEIKAPR-RNPTDYINRKSKFSLN 206
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR---ISSEEVELR 266
+Q D+ +FID+V WPG ++ +R+ A +LN +R I S +V+L
Sbjct: 207 IQACCDYRYQFIDVVIKWPGSVHDARVF----------ANSKLNNMLRDETIPSCKVQLV 256
Query: 267 E-------YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGW 319
E +I+G YPL+ +++ Y G + F + R++ +F +LK +
Sbjct: 257 ENEDPIPIFIIGDPAYPLMPYVMKEYTGGGATVHEQYFGYKLCSARNVIECSFGRLKARF 316
Query: 320 RILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALS 362
L + M + LPS+I C +LHN + + D+ S
Sbjct: 317 ACLKRAM-DTNLEDLPSVIYACFVLHNFCESRNENVQDDLVRS 358
>gi|390339191|ref|XP_003724948.1| PREDICTED: putative nuclease HARBI1-like isoform 1
[Strongylocentrotus purpuratus]
gi|390339193|ref|XP_795639.3| PREDICTED: putative nuclease HARBI1-like isoform 2
[Strongylocentrotus purpuratus]
Length = 413
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 140/288 (48%), Gaps = 12/288 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR++K TF Y+ ++ +S+ + + ++++AI L RLA+ S ++
Sbjct: 89 FRMTKATFRYLLKELKPH-ISKKDTNF----RKATPPDQRLAITLWRLAANISYSTLADL 143
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
F VG S+V +T A+ + L P R++E +FE+ G AI HI
Sbjct: 144 FEVGISSVCLITLTVTAAINSVFRKRLTLPTGERLQECIQRFEDK-GFPQAVAAIGCCHI 202
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P Q ++ + + SM++Q +VD +FID+ G PG M+ + + + S ++
Sbjct: 203 PVKTPRKQ---EYINIKGFSSMILQAVVDSFGQFIDVSVGCPGSMDEAAVFRNSEMSRML 259
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATR 305
+ N ++ E + ++GG YPL+ L+ PY NG L+ + FN +
Sbjct: 260 KDDSLFNLPPKVI-EATSIPVLVLGGEAYPLMPTLMKPYSVNGTLTGAQKVFNRRLSSCH 318
Query: 306 SLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
S +AF LKG W IL M K+PSI+L CC+L N++ GD
Sbjct: 319 STVNRAFEHLKGRWNILHN-MQDNQVTKVPSIVLACCILQNLVESRGD 365
>gi|331224673|ref|XP_003325008.1| hypothetical protein PGTG_06545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 385
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 144/295 (48%), Gaps = 20/295 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FR+S + F ++ + +R+ L P G LSVE QVA+ L RL G S V+
Sbjct: 54 FIELFRMSIEDFRWLSNNLRDLLQLDPLR-----RGDPLSVEAQVAVGLYRLGHGCSYVT 108
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRME---EIKSKFEESFGLLNCC 178
+G F +G+ T + RF+ A+ R + + +P +R + EI + FE G+ N
Sbjct: 109 LGHVFNIGKETADKAAGRFVIAVLARFRRVAISYPPLDRADQWSEISASFEAKHGIPNVV 168
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + PA + ++++ S++ Q +VD E F ++ G PG M+ RL
Sbjct: 169 GAIDGTHIPVATPAEDRWKGYINRKSWASIVFQCVVDGEGNFRNVSGGAPGSMHDGRL-- 226
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F+ E G + R + + Y++G GYP ++ PY + + FN
Sbjct: 227 ----FRRSELGHSITTGTR-ARPMIPDGTYLIGDAGYPSNVRILLPYPSTA-TPENEHFN 280
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ +TR + + F +LK +RIL M RP + + + C +LHN++ G
Sbjct: 281 YIQSSTRIIVEQVFGRLKNRFRILLHAQMARPLRAR--NNAFACMILHNLLNKRG 333
>gi|331219511|ref|XP_003322432.1| hypothetical protein PGTG_03969 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 387
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 148/295 (50%), Gaps = 20/295 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FFR+S + F ++ +R+ L P G LSVE QVA+ L RLA G S ++
Sbjct: 56 FLEFFRMSVEDFRWLSDSLRDLLQLDPLR-----RGDPLSVEAQVAVGLYRLAHGSSYLT 110
Query: 123 VGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWP---DSNRMEEIKSKFEESFGLLNCC 178
+G F +G+ T + RF+ A L++ + + +P + + ++I + FE G+ N
Sbjct: 111 IGHVFNIGKETADKAAGRFVIAVLQQFRRVAISYPALANQEQWDKISASFEAKHGIPNVV 170
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + +PA + ++++ S++ Q +VD + F ++ G PG M+ RL
Sbjct: 171 GAIDGTHIPLAVPAEDRWKGYINRKSWASIVFQCVVDGDGNFRNVSGGAPGSMHDGRL-- 228
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F+ E G + ++ + + Y++G GYP ++ PY + + FN
Sbjct: 229 ----FRRSELGHSITPG-SMAPAMIPMGTYLIGDAGYPSNVRVLLPYPSIA-TPENKHFN 282
Query: 299 SLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ TR + +AF +LK ++I L M RP + + + C +LHN++ G
Sbjct: 283 FIQSLTRIIVEQAFGRLKNQFQILLHPQMARPLRAR--NNAFACMILHNLLNKRG 335
>gi|302755214|ref|XP_002961031.1| hypothetical protein SELMODRAFT_34203 [Selaginella moellendorffii]
gi|300171970|gb|EFJ38570.1| hypothetical protein SELMODRAFT_34203 [Selaginella moellendorffii]
Length = 170
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 91/174 (52%), Gaps = 7/174 (4%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F+ F V TF ++C + LV RPP L I GR L V KQVAIAL+ LASG+
Sbjct: 1 DPQRFREVFWVLVSTFTFLCDELAPKLVKRPPPSLAEIPGRHLPVSKQVAIALKHLASGD 60
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
++ AFGV T RF AL E + WPD M+ E G NCC
Sbjct: 61 IWQTIDDAFGVASCTTQACLHRFRYALLEHEGLMIHWPDEEGMK------ERLCGFPNCC 114
Query: 179 GAIDATHIIMTLPAVQTSDDW-CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
GA+D THI + L + + DW + YSM+VQ +VD + F+DI TG G +
Sbjct: 115 GAMDCTHIAIELLGSEDAMDWYARAKKYYSMVVQDVVDSKTSFLDITTGIAGSV 168
>gi|331226603|ref|XP_003325971.1| hypothetical protein PGTG_07801 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 413
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 144/295 (48%), Gaps = 20/295 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FR+S + F ++ + +R+ L P G LSVE QVA+ L RL G S V+
Sbjct: 54 FIELFRMSIEDFRWLSNNLRDLLQLDPLR-----RGDPLSVEAQVAVGLYRLGHGCSYVT 108
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRME---EIKSKFEESFGLLNCC 178
+G F +G+ T + RF+ A+ R + + +P +R + EI + FE G+ N
Sbjct: 109 LGHVFNIGKETADKAAGRFVIAVLARFRRVAISYPPLDRADQWSEISASFEAKHGIPNVV 168
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + PA + ++++ S++ Q +VD E F ++ G PG M+ RL
Sbjct: 169 GAIDGTHIPVATPAEDRWKGYINRKSWASIVFQCVVDGEGNFRNVSGGAPGSMHDGRL-- 226
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F+ E G + R + + Y++G GYP ++ PY + + FN
Sbjct: 227 ----FRRSELGHSITTGTR-ARPMIPDGTYLIGDAGYPSNVRILLPYPSTA-TPENEHFN 280
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ +TR + + F +LK +RIL M RP + + + C +LHN++ G
Sbjct: 281 YIQSSTRIIVEQVFGRLKNRFRILLHAQMARPLRAR--NNAFACMILHNLLNKRG 333
>gi|331225553|ref|XP_003325447.1| hypothetical protein PGTG_07280 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 146/298 (48%), Gaps = 26/298 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVR----EDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
F +FR+S+ F ++C +R +D + R G L+VE QVA+ L RL G
Sbjct: 59 FLEYFRMSRADFAWLCEELRGTLDQDHLRR---------GAPLTVEAQVAVGLYRLGHGA 109
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
+ V++ F +G+ T + T RF++A L R + +EIK FE G+
Sbjct: 110 TYVTISHVFNIGKETADKATGRFVKAVLKILRLRTISFPALDAHDEWDEIKESFESRHGI 169
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+ GAID THI +++P + +++N S++ Q +VD + F D+ G G ++ +
Sbjct: 170 PDVVGAIDGTHIPLSMPPGDRWKGYINRKNWPSLVFQCVVDGDGNFRDVFGGGAGSIHDT 229
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
R+ F+ + G LN + S + +++G GYP ++ PY + L+
Sbjct: 230 RV------FRRSDLGVSLNRALGQPS-MIPTGSHLIGDAGYPSDVNVLVPYPSI-LAPEN 281
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN + ATR + +AF +LK +RIL R + + + VC +LHN++ +G
Sbjct: 282 EYFNYIQSATRIVVEQAFGRLKNRFRILLTAQ-RANPIRARNTTFVCMILHNLLNRTG 338
>gi|347970693|ref|XP_559600.5| AGAP003812-PA [Anopheles gambiae str. PEST]
gi|333466789|gb|EAL41345.3| AGAP003812-PA [Anopheles gambiae str. PEST]
Length = 445
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 140/296 (47%), Gaps = 21/296 (7%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D E+ +FR+ + +D + +++R+DL P + + S EK+V I L +L +G
Sbjct: 93 DNEDDLLRYFRMGRAMYDTLVAMLRQDLAPHPL-----LAAQSCSAEKKVGIGLYKLTTG 147
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
++G FGV ++TV +F ++L + + PD M EI FE+ +
Sbjct: 148 ADYTTIGNMFGVHKATVKNCVHQFCKSLVKNYMDSEIYLPDQEEMAEISKSFEDKSDIPL 207
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GA+ H+ +T P++ S ++ + S+++Q +VD+ F I G G S +
Sbjct: 208 VIGAMGRLHVPIT-PSLADSKNYINGRKWPSLILQAVVDNNHCFRHITCGHVGATEDSVV 266
Query: 237 LKFSGFFKLCEAG----QRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
L SG ++ + Q +NGN ++ +IV YPLL WL Y +
Sbjct: 267 LGDSGLYQHFDGADLPSQNINGNT--------VKSFIVSDTVYPLLPWLQHGY-MTPTTT 317
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
TFN R +AF +L+ +RIL + + D +P I+L CC+LHN +
Sbjct: 318 EEETFNEHLNKARVAVDEAFDRLRARFRILQRKI-DIDINFVPQILLTCCILHNFL 372
>gi|340381732|ref|XP_003389375.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 399
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 85/288 (29%), Positives = 140/288 (48%), Gaps = 47/288 (16%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+SK TF ++C +R + + + +SVE++V I L A+ S+G
Sbjct: 102 FRLSKSTFMHLCDQLRPYIQHQDTRF-----RKCISVERRVGITLWCPATCGEYRSIGHL 156
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS---KFEESFGLLNCCGAIDA 183
FGV R++E+K+ F+ FG++ C G+ID
Sbjct: 157 FGVA-----------------------------RVDEVKAVVNGFKIKFGMIQCLGSIDG 187
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
+HI + PA+ + D+ +++ YSM++Q +VDH F DI WPG ++ +R+ S +
Sbjct: 188 SHIPVMPPALNHT-DYYNRKGYYSMILQAVVDHNYVFRDINIRWPGSVHDARVFVNSSLY 246
Query: 244 KLCEAGQRLNGN-VRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLH 301
+ L GN ++I + V L +++G YPL SWL+ P+ N L+ S +N
Sbjct: 247 HKAINREILTGNELKIDDKVVPL--FLIGDSAYPLSSWLMKPFAHNTELNDSERKYNYYL 304
Query: 302 EATRSLAVKAFLQLKGGWRILSKVMWRPDKR--KLPSIILVCCLLHNI 347
+R + AF +LK WR L K R D +P ++ CC+LHN+
Sbjct: 305 SKSRIVVENAFGRLKARWRRLLK---RNDMMIDNVPYVVSACCILHNV 349
>gi|449680215|ref|XP_004209527.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 382
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 146/291 (50%), Gaps = 11/291 (3%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E +K FR+ K +F + SL+ E +S P S NI R ++ +K+VAI L L +
Sbjct: 61 KEAYKKNFRLDKNSFFELVSLL-EPYIS-PDSYSPNI--RAITPDKKVAITLYYLKDSGT 116
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNC 177
FG+ T S V + A+ + + P + +M E S+FE FG++
Sbjct: 117 LNMTANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQA 176
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG ++ +++
Sbjct: 177 FGCVDGTHISIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVF 235
Query: 238 KFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
S + RL G + I+ ++++ Y++G YPLL + Y T + +
Sbjct: 236 SNSSINTNLRSS-RLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCMKEYRTCKKNDEV-I 293
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FNS+ R+ AF +LK W+IL+K M K+P++I C +LHN
Sbjct: 294 FNSMLRTARNPIECAFGRLKARWKILTKKM-DLKLEKIPTVIYACFILHNF 343
>gi|242087131|ref|XP_002439398.1| hypothetical protein SORBIDRAFT_09g005770 [Sorghum bicolor]
gi|241944683|gb|EES17828.1| hypothetical protein SORBIDRAFT_09g005770 [Sorghum bicolor]
Length = 535
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 144/323 (44%), Gaps = 48/323 (14%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIA 110
PS + F+ FR+S+ TF +C ++ +ED R + V ++VA+
Sbjct: 198 PSCPDAEFRRAFRMSRATFGALCDALVGAVAKEDTALR----------TAIPVRQRVAVC 247
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEE-IKSKF 168
L RLA+ E + FG+G ST + + AL + + WP+ + + ++F
Sbjct: 248 LWRLATAEPLREISRRFGLGISTCHSIVLQVYHALATVLRPTAISWPEPDSAAATVAARF 307
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDW------CDQENNYSMLVQGIVDHEMRFID 222
E + GL GA+ T + + P + + Q +YS+ VQ + D + F D
Sbjct: 308 EAASGLPGIIGAVYTTRVPVVTPKANVAAYYDRGLTDRRQRASYSVAVQAVADADGAFTD 367
Query: 223 IVTGW--PGGMNVSRLL---KFSGFFKLCEA-GQRLNGNVRISSEEVELREYIVGGVGYP 276
+ W PG ++ + +L SG L GQ + +VGG YP
Sbjct: 368 V---WIEPGSLSDAAILGRSALSGLLALVGGHGQE---------------QRLVGGTSYP 409
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L+ W++ PY L+ + FN R +A A +LK WR L + + LPS
Sbjct: 410 LMDWMLVPYAHQNLTWTEHAFNERVARARGVARDAVRRLKARWRCLQR-RSEVKMQDLPS 468
Query: 337 IILVCCLLHNIIIDSGDQLHPDV 359
+I CC+LHN+ +G++L PD+
Sbjct: 469 MIAACCVLHNVCERAGEELDPDL 491
>gi|331211585|ref|XP_003307062.1| hypothetical protein PGTG_00012 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 145/298 (48%), Gaps = 26/298 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVR----EDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
F +FR+S+ F ++C +R +D + R G L+VE QVA+ L RL G
Sbjct: 59 FLEYFRMSRADFAWLCEELRGTLDQDHLRR---------GAPLTVEAQVAVGLYRLGHGA 109
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
+ V++ F +G+ T + T RF++A L R + +EIK FE G+
Sbjct: 110 TYVTISHVFNIGKETADKATGRFVKAVLKILRLRTISFPALDAHDEWDEIKESFESRHGI 169
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+ GAID THI +++P + +++N S++ Q +VD + F D+ G G ++ +
Sbjct: 170 PDVVGAIDGTHIPLSMPPGDRWKGYINRKNWPSLVFQCVVDGDGNFRDVFGGGAGSIHDT 229
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
R+ F+ + G LN + S + +++G GYP ++ PY + L+
Sbjct: 230 RV------FRRSDLGVSLNRALGQPS-MIPTGAHLIGDAGYPSDVNVLVPYPSI-LAPEN 281
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN + ATR + +AF +LK +RIL R + + + C +LHN++ +G
Sbjct: 282 EYFNYIQSATRIVVEQAFGRLKNRFRILLTAQ-RANPIRARNTTFACMILHNLLNRTG 338
>gi|331251807|ref|XP_003338490.1| hypothetical protein PGTG_19974 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 387
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 149/295 (50%), Gaps = 20/295 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FFR+S + F ++ +R+ L + G LSVE QVA+ L +LA G S ++
Sbjct: 56 FLEFFRMSVEDFRWLSDSLRDLLQL-----DLLRRGDPLSVEAQVAVGLYQLAHGSSYLT 110
Query: 123 VGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWP---DSNRMEEIKSKFEESFGLLNCC 178
+G F +G+ T + RF+ A L++ + +++P + + +EI + FE G+ N
Sbjct: 111 IGHVFNIGKETADKAAGRFVIAVLQQFRRVAIRYPALANQEQWDEISASFEAKHGIPNVV 170
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G ID THI + +PA D+ ++++ S++ Q +VD F ++ G PG M+ RL
Sbjct: 171 GTIDGTHIPLAVPAEDRWKDYINRKSWASIVFQCVVDGNGNFRNVSGGAPGSMHDGRL-- 228
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F E G + ++ + + Y++G GYP ++ PY + + FN
Sbjct: 229 ----FWRSELGHSITPG-SMAPAMIPMGTYLIGDAGYPSNVRVLLPYPSIA-TPENEHFN 282
Query: 299 SLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ +TR + +AF +LK +RI L M RP + + + VC +LHN++ G
Sbjct: 283 FIQSSTRIIVEQAFGRLKNRFRILLHPQMARPVRAR--NNAFVCMILHNLLNKRG 335
>gi|449686160|ref|XP_004211088.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 381
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 145/291 (49%), Gaps = 11/291 (3%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E +K FR++K +F + L+ E +S P S NI R ++ +K VAI L L +
Sbjct: 61 KEAYKKNFRLNKNSFFELVYLL-EPYIS-PDSYSPNI--RAITPDKNVAITLYYLKDSGT 116
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNC 177
FG+ T S V + A+ + + P + +M E S+FE FG++
Sbjct: 117 LNMTANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQA 176
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI + P + D +C ++ YS+ VQ + D++ F+D+ WPGG++ +++
Sbjct: 177 FGCVDGTHIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVKCMWPGGVHDAKVF 235
Query: 238 KFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
S + RL G + I+ ++++ Y++G YPLL + Y T + +
Sbjct: 236 SNSSINTNLRSS-RLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCMKEYSTCKKNDEV-I 293
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FNS+ R+ AF +L W+IL+K M K+P++I C +LHN
Sbjct: 294 FNSMLRTARNPIECAFGRLNARWKILTKKM-DLKLEKIPTVIYACFILHNF 343
>gi|148225801|ref|NP_001089350.1| uncharacterized protein LOC734400 [Xenopus laevis]
gi|62185695|gb|AAH92316.1| MGC115312 protein [Xenopus laevis]
Length = 371
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 131/262 (50%), Gaps = 22/262 (8%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
AL SG Q SV A G+ QS+ S+ ++ I+A+ K ++ +P D + +K F
Sbjct: 100 ALHFFTSGSFQNSVSTAGGITQSSFSRFLFQVIQAIVNLYKEYISFPNDPASLRAVKQSF 159
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G N GAID TH+ ++ P+ + + ++++ +S+ +Q +V H M+ ID+V +P
Sbjct: 160 MSVAGFPNIIGAIDCTHVALSPPS-ENEYIYRNEKHFHSLNMQVVVSHNMKIIDVVAKFP 218
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G S +L SG + E G+ +G +++G Y L WL+TP N
Sbjct: 219 GSTEDSYILSQSGLHQRFENGEFGSG-------------WLLGDSMYGLKPWLMTPI-NN 264
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ + +N H ATRS+ + F LK + L K +++ P+ K+ I VCC+L
Sbjct: 265 PKTRAEKKYNHAHSATRSVIDRTFGMLKTRFGCLDKSKGVLLYSPE--KVCKIFFVCCIL 322
Query: 345 HNIIIDSGDQLHPDVALSDHHD 366
HNI ++ D + + + H +
Sbjct: 323 HNIAMNENDYMDINEEYAQHEE 344
>gi|166157907|ref|NP_001107363.1| uncharacterized protein LOC100135188 [Xenopus (Silurana)
tropicalis]
gi|163916031|gb|AAI57215.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
gi|213624369|gb|AAI71002.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
gi|213625629|gb|AAI71004.1| LOC100135188 protein [Xenopus (Silurana) tropicalis]
Length = 371
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 134/262 (51%), Gaps = 22/262 (8%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
AL ASG Q SV A G+ QS+ S+ ++ I+A+ K ++ +P D ++ +K F
Sbjct: 100 ALHFFASGSFQNSVSTAGGITQSSFSRFLFQVIQAIVNLYKEYISFPNDPASLKAVKQSF 159
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G N GAID TH+ ++ P+ + + ++++ +S+ +Q +V + M+ ID+V +P
Sbjct: 160 LSIAGFPNVIGAIDCTHVALSPPS-ENEYIYRNEKHFHSLNMQVVVSNNMKIIDVVAKFP 218
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G S +L SG + E G+ +G +++G Y L WL+TP +N
Sbjct: 219 GSTEDSYILSQSGLHQRFENGEFGSG-------------WLLGDSMYGLKPWLMTPV-SN 264
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ + +N H ATRS+ + F LK + L K +++ P+ K+ I VCC+L
Sbjct: 265 PKTRAEKKYNHAHSATRSIIDRTFGMLKTRFGCLDKSKGVLLYSPE--KVCKIFFVCCIL 322
Query: 345 HNIIIDSGDQLHPDVALSDHHD 366
HNI ++ D + + + H +
Sbjct: 323 HNIAMNESDYIEVNEEYAQHEE 344
>gi|297723773|ref|NP_001174250.1| Os05g0184901 [Oryza sativa Japonica Group]
gi|53749284|gb|AAU90143.1| hypothetical protein [Oryza sativa Japonica Group]
gi|255676090|dbj|BAH92978.1| Os05g0184901 [Oryza sativa Japonica Group]
Length = 468
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 147/320 (45%), Gaps = 34/320 (10%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+ + FD +C ++ +ED R + V ++VA+ L RL
Sbjct: 128 EAEFRRAFRMPRAVFDKLCDDLAAAVAKEDTTLRA----------AIPVPQRVAVCLWRL 177
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
A+G+ V FG+G ST + + A+ ++WPDS+ S+F+ G+
Sbjct: 178 ATGDPLREVSRRFGLGISTCHSIILQVCAAITAVLTRVVRWPDSH--AAAASRFQALSGI 235
Query: 175 LNCCGAIDATHIIMTLP---AVQTSDDWCDQENN---YSMLVQGIVDHEMRFIDIVTGWP 228
GA+ HI + P A + D NN YS+ +Q +VD + F D+ G P
Sbjct: 236 PGVVGAVHTEHIPIVAPRENAGEYYDRRLTDRNNKATYSVAMQAVVDADGAFTDVCIGHP 295
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G ++ + +L S + CEAG L + +++VGG YPL SW++ PY
Sbjct: 296 GSLSDAAVLTKSALYARCEAGLLLGDDP----------QWLVGGASYPLTSWMLVPYAQP 345
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
L+ + N+ R+ AV AF +L+ + +LP+++ CC+LHN+
Sbjct: 346 NLTWAQERLNARVADARAAAVGAFRRLRAR-WRCLRRRAEVKLPELPNMLGACCVLHNLC 404
Query: 349 IDSGDQLHPDVALSDHHDSG 368
SG +L D+ + D G
Sbjct: 405 ERSGGELDADLLHDELVDDG 424
>gi|357612248|gb|EHJ67878.1| hypothetical protein KGM_13821 [Danaus plexippus]
Length = 357
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 139/296 (46%), Gaps = 25/296 (8%)
Query: 63 FKYFFRVSKKT-FDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
FK +S+KT + I L + ++ GL +E +L I L +A E
Sbjct: 75 FKNRLILSRKTAYKLIDDLEKSGFIASHKFGLKPLEPKLC-----FYIFLSFIADTEPLT 129
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGA 180
+ F + S+ +V R + L + +KWP D N +E I ++ G+ N G
Sbjct: 130 PLANRFDISISSTFRVLRRVVAWLLTKLDDAIKWPQDFNDVETICEQYHFKTGISNILGV 189
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID THI + P + + ++C+ + +S+++Q +D +RF +I G PG N SR+L+ S
Sbjct: 190 IDCTHIKIEKP--RNAREYCNPKGYFSIVLQATIDANLRFTNIYCGEPGSSNCSRVLRKS 247
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNS 299
++ + ++VG GYP LSWL+ P+ N L++ FNS
Sbjct: 248 PLYQTATQNR---------DTLFPHNTFLVGHSGYPSLSWLVPPFRENKRLTSQQREFNS 298
Query: 300 LHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI---ILVCCLLHNIIIDSG 352
LH + R L+ KAF LK +R ++ R +P I I+ C+LHN +D
Sbjct: 299 LHASARKLSDKAFNLLKTKFR---RIKLFTVYRNIPFITDTIVAACILHNYCLDES 351
>gi|331232917|ref|XP_003329120.1| hypothetical protein PGTG_10860 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 360
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 131/264 (49%), Gaps = 17/264 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F +FR+S+ F ++C +RE LV P G LSVE QV + L R+ G + V+
Sbjct: 59 FVEYFRMSRADFQWLCDELRETLVQDPLR-----RGAPLSVEAQVGVGLYRVGHGATYVT 113
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEE---IKSKFEESFGLLNCC 178
+ F +G+ T + RF+ A+ + + + +PD + EE IK+ FE G+ +
Sbjct: 114 ISNVFNIGKETADKAFSRFVNAVLKVLRLRTISFPDLDAAEEWNEIKASFESCHGIPDVV 173
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + +P + +++N S++ Q +VD F D+ G G ++ R+
Sbjct: 174 GAIDGTHIPLAMPPSDEWKGYINRKNWASLVFQCVVDGHGNFRDVFGGGAGSIHDGRV-- 231
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F+ G LN + + + R +++G GYP ++ PY + + FN
Sbjct: 232 ----FQRSPIGNSLNRALGLPL-MIPPRTHLIGDAGYPSDVNILVPYPSIA-APENDYFN 285
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL 322
+ ATR + +AF +LK +RIL
Sbjct: 286 YIQSATRIVVEQAFGRLKNCFRIL 309
>gi|345496226|ref|XP_003427676.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 384
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 144/307 (46%), Gaps = 23/307 (7%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
FK FR+S+ TF+Y+ L+R +L GL ++ +KQ+ I L L + +S S
Sbjct: 66 FKETFRMSRTTFEYLLLLIRPNL-----EGLNEFGNMPINPDKQLYITLYVLGTPDSYRS 120
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V F VG++T + R + ++ + + ++WP + E + L GA+D
Sbjct: 121 VTTKFNVGKATAWRAVRRVVRSICQYRNYFIRWPSAREAAETSMRIARRRRLHGVIGAVD 180
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
THI + V S + +++ +S+ +Q I + ++ FI G PG ++ +R+ ++SG
Sbjct: 181 GTHIRIAALRVD-SQAYINRKGVHSIQLQVICNDKLEFIHCYAGLPGSVHDTRVFRYSGV 239
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLH 301
+ C + E +++G Y L + +I PY NG L+ FN +
Sbjct: 240 QQRC------------TDEYFPNNTHLLGDSAYTLQNHVIVPYRDNGHLTVEEVHFNHVL 287
Query: 302 EATRSLAVKAFLQLKGGWR-ILSKV-MWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
TR + ++ LK WR L K+ M R D +P I+ C+LHNI + D V
Sbjct: 288 SGTRMMVERSIGLLKVRWRYFLDKLPMRRTD--LIPYYIVCACVLHNICLKVEDTFEYPV 345
Query: 360 ALSDHHD 366
+ D D
Sbjct: 346 IIPDTID 352
>gi|432936771|ref|XP_004082271.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 439
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 139/298 (46%), Gaps = 21/298 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+S++TF Y+C +R L + S + L VEK+VA+AL RLAS ++
Sbjct: 99 FRMSRETFFYLCEKLRPRLARQDTSFRLA-----LPVEKRVAVALWRLASNIEYRTISSL 153
Query: 127 FGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FGVG+STV + A+ +L+ P+ +E+ FE +G +C AI H
Sbjct: 154 FGVGKSTVCRCVRDMCHAIVALLSCTYLQPPNEQELEDSARLFESYWGFPHCVAAIATLH 213
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ P+ + D+ + S+L Q V F D+ +PGG + + +L+ S +
Sbjct: 214 TAIITPS-NNALDYANPAGWLSVLSQVAVSGRGHFWDVCASFPGGTDPAEILQNSSLWAT 272
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN--------GLSASMPTF 297
G L+ S +R ++G YPL SWL+ + L+ F
Sbjct: 273 ASEGG-LSPAQPPSFMGRAMRYVMLGEACYPLQSWLMKAFPEERSRRKNHAALTQRQQVF 331
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR--KLPSIILVCCLLHNIIIDSGD 353
N ++ +A L+L+ W+ LSK R D R +P++IL CC+LHN+ GD
Sbjct: 332 NQQLNRALRVSEEALLRLRARWQCLSK---RNDCRLDVVPTMILACCILHNMCESHGD 386
>gi|388491000|gb|AFK33566.1| unknown [Medicago truncatula]
Length = 245
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 119/248 (47%), Gaps = 22/248 (8%)
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNY 206
+ WPD N+M IK +FE FG+ N G++ THI + P + + +Q+ Y
Sbjct: 5 ITWPDENKMNLIKQEFEGLFGMPNVGGSMYTTHIPIIAPKNNVNSYFNKRQTQRNQKTAY 64
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+ VQG+VD + F D+ G+PG N ++L+ S ++ + + GN++ S
Sbjct: 65 SVTVQGMVDAKGVFTDVFLGYPGSYNDDQVLEKSVMYQ-----RAMTGNLKDS------- 112
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
++VG G+PL+ ++ PY L+ + FN E + L+ AF ++KG W L K +
Sbjct: 113 -WVVGNSGFPLMDGILVPYTHQNLTWTQHAFNEKVEDIQKLSKDAFAKVKGRWSCLQKRI 171
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRE 386
+LP ++ CC+LHNI +++ P D E K V R+
Sbjct: 172 -EVKIEELPGVLGACCVLHNICEMRNEKMDPAWNFELFDDEMVAENGVKSV--AAAQARD 228
Query: 387 NLEKHLQH 394
N+ L H
Sbjct: 229 NIAHDLLH 236
>gi|348510655|ref|XP_003442860.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 438
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 143/299 (47%), Gaps = 23/299 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGV 125
FR++++TF Y+C +R L + N RL L VEK+VA+AL RLAS ++
Sbjct: 99 FRMNRETFFYLCDKLRPRLARQ------NTSFRLALPVEKRVAVALWRLASNIEYRTISA 152
Query: 126 AFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
FGVG+STV + A+ + +L+ P +E+ F +G +C AI
Sbjct: 153 LFGVGKSTVCRCVRDMCHAIVALLSSIYLRSPGEQELEDSAQLFLSHWGFPHCVAAIATL 212
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H + P+ + D+ + S++ Q V + F D+ +PGG + + +L+ S +
Sbjct: 213 HTAIITPS-NNASDYANPAGWLSVMSQVAVSGQGHFWDVCASFPGGTDPAEILQNSSLWA 271
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSLHE- 302
G L+ + LR ++G YPL SWL+ Y E G +AS +
Sbjct: 272 TAAEGG-LSPAPPPTFMGKSLRYVLLGEACYPLQSWLMKAYPEEQGRTASRTALTEQQQL 330
Query: 303 ----ATRSLAV--KAFLQLKGGWRILSKVMWRPD--KRKLPSIILVCCLLHNIIIDSGD 353
R+L V +A L+L+ W+ LSK R D +P++IL CC+LHN+ GD
Sbjct: 331 FNGRLARALRVSQEALLRLRARWQCLSK---RNDCGLDVVPTMILACCILHNMCESHGD 386
>gi|328699306|ref|XP_003240898.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 395
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 139/296 (46%), Gaps = 25/296 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+ + KQ+ L LA+ +S SVG F +G+S++S +R I L A + + WP
Sbjct: 102 IPIVKQLLSVLWILATPDSYRSVGEKFDMGKSSLSVSFFRIINLLIMNASNVIIWPHGEE 161
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
ME K F + N GA+D T I + P Q ++ + ++ NY++ +Q I + E++F
Sbjct: 162 MERQKELFFRMASIPNVIGAVDGTFIPIKAPK-QDAEVYVTRKCNYAITLQAITNAELKF 220
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
D+ G+PG ++ +R+ + S F Q+ E+I+G YP L+W
Sbjct: 221 TDVFVGYPGSVSDTRIFRNSDIFINIMRNQKF---------YFPNDEHILGDKAYPPLTW 271
Query: 281 LITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSII 338
+ PY G L+ + FN TR ++F G +R + M R D +PS +
Sbjct: 272 CVEPYINRGRLTEAQKNFNFNISRTRQTVERSFALFFGRFRRFKYLDMSRTD--FIPSTV 329
Query: 339 LVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG---EQCCKQVDPMGRTTRENLEKH 391
L C+LHN+ + + D + D+ + G Q + +P TRE+ E H
Sbjct: 330 LAACVLHNLCLKNKS----DANIKDYIEEGMNFLRHQRSECANP----TRESSEDH 377
>gi|326675400|ref|XP_003200346.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 151/331 (45%), Gaps = 25/331 (7%)
Query: 30 KDPNSSDSDWWPSFWAKNSST-PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSR 88
+DP ++ F+ + P + S + K +R SK++ Y+ L++ + +
Sbjct: 7 EDPVDVEAQIIRGFFRRERVIRPRIDVFSYPNDYLKERYRFSKESLVYLTRLLKPHIAN- 65
Query: 89 PPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER 148
+ G LS E ++IALR ASG +VG A VG++TV + AL+
Sbjct: 66 -----VTNRGSALSTESILSIALRFFASGHFLYNVGDAEHVGKATVCRAVRTVCLALKHL 120
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
+++P + IK++F GL N G +D THI + P++ D+ ++++ +S+
Sbjct: 121 LPTFVQFPGHKPLRVIKNEFHRVAGLPNVIGCVDGTHIPIKAPSINEG-DYVNRKSTHSI 179
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY 268
VQ I + ++ WPG ++ +R+ + S LC+A Q+ N Y
Sbjct: 180 NVQVICEATQIITNVEAKWPGSVHDARIFRES---SLCQAFQQGQYN-----------GY 225
Query: 269 IVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
++G GYP L +L+TPY N H TR+ LK ++ L +
Sbjct: 226 LLGDRGYPCLPYLMTPYPEPE-PGPQTRLNLAHSRTRAKVEMTIGILKSRFQCLRGLRVS 284
Query: 329 PDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
P++ II+ C +LHNI G+ L P +
Sbjct: 285 PERAC--DIIVACVVLHNIATIRGESLPPCI 313
>gi|413949351|gb|AFW82000.1| hypothetical protein ZEAMMB73_374541 [Zea mays]
Length = 477
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 145/324 (44%), Gaps = 31/324 (9%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
PS + F+ FR+S+ TF +C D + + + + V ++VA+ L RLA
Sbjct: 141 PSCPDAEFRRAFRMSRTTFGALC-----DALGGAVAKMDTALRTAIPVRQRVAVCLWRLA 195
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGL 174
+ E V FG+G ST + + AL + + WP + + ++FE +FGL
Sbjct: 196 TAEPLREVSRRFGLGISTSHSIVLQVYHALAAVLRPTVIYWPGPDSAATVVARFEAAFGL 255
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDW-C-----DQENNYSMLVQGIVDHEMRFIDIVTGW- 227
GA+ T + + P + + C +Q+ +YS+ VQ + D + F D+ W
Sbjct: 256 AGIVGALYTTRVPIVAPKANVAAYYDCGLTERNQKASYSVAVQAVADADGAFTDV---WI 312
Query: 228 -PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
PG ++ + +L S L G + +VGG YP + W++ PY
Sbjct: 313 EPGSLSDAAILGRSALSGLLAGGHGQG-------------QRLVGGTSYPPMDWMLVPYA 359
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
L+ + N A R +A A +LK WR L + + LPS++ CC+LHN
Sbjct: 360 HQNLTWTEHASNERVAAARGVARGAVRRLKARWRCLQRRS-EVKMQDLPSMVAACCVLHN 418
Query: 347 IIIDSGDQLHPDVALSDHHDSGYG 370
+ +G++L P++ D D G
Sbjct: 419 VCERAGEELDPELMQYDLDDDDGG 442
>gi|52353755|gb|AAU44321.1| hypothetical protein [Oryza sativa Japonica Group]
gi|125551096|gb|EAY96805.1| hypothetical protein OsI_18732 [Oryza sativa Indica Group]
Length = 483
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 149/309 (48%), Gaps = 38/309 (12%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS---VEKQVAIALRRLAS 116
E F+ FR+ + FD +C +DL + + ++ L S V ++VA+ L RLA+
Sbjct: 141 EADFRRAFRMPRAVFDKLC----DDLAA----AVAKVDTTLRSAIPVPQRVAVCLWRLAT 192
Query: 117 GESQVSVGVAFGVGQST----VSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF 172
G+ V FG+G ST + QV L R ++WPDS+ S+F+
Sbjct: 193 GDPLREVSRRFGLGISTCHNIIVQVCAAITAVLLTRV---VRWPDSH--AAAASRFQALS 247
Query: 173 GLLNCCGAIDATHIIMTLP---AVQTSDDWCDQENN---YSMLVQGIVDHEMRFIDIVTG 226
+ GA+ H + P A + D NN YS+ +Q +VD + F D+ G
Sbjct: 248 WIPGVVGAVHTEHFRIVAPREHAGKYYDHRLTGRNNKATYSVAMQAVVDADGAFTDVCIG 307
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
PG ++ + +L S + CEAG L G+ ++ ++ ++VGG YPL SW++ PY
Sbjct: 308 HPGSLSDAAILAKSALYARCEAGLLL-GHDKLGWQQPL---WLVGGASYPLTSWMLVPYT 363
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP---SIILVCCL 343
L+ + N+ R+ AV AF +L+ W+ L + + KLP +++ CC+
Sbjct: 364 QPNLTWAQDRLNARVADARAAAVGAFRRLRARWQCLRRA-----EVKLPELANMLGACCV 418
Query: 344 LHNIIIDSG 352
LHN+ SG
Sbjct: 419 LHNLCERSG 427
>gi|449671109|ref|XP_004207429.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 382
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 143/291 (49%), Gaps = 11/291 (3%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E +K FR+ K +F + SL+ E +S P S NI R ++ +K+VAI L L +
Sbjct: 61 KEAYKKNFRLDKNSFFELVSLL-EPYIS-PDSYSPNI--RAITPDKKVAITLYYLKDSGT 116
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNC 177
FG+ T S V + A+ + + P + +M E S+FE FG++
Sbjct: 117 LNMTANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQA 176
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D THI + P + D +C ++ Y + VQ + D++ +D+ WPG ++ +++
Sbjct: 177 FGCVDGTHIPIVCPTNHSQDYFCYKQ-YYLLQVQAVCDYKGSVLDVECMWPGSVHDAKVF 235
Query: 238 KFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
S + RL G + I+ ++++ Y++G YPLL Y T + +
Sbjct: 236 SNSSINTNLRSS-RLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCTKEYSTCKKNDEV-I 293
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FNS+ R+ AF +LK W+IL+K M K+P++I C +LHN
Sbjct: 294 FNSMLRTARNPIECAFGRLKARWKILTKKM-DLKLEKIPTVIYACFILHNF 343
>gi|410898682|ref|XP_003962826.1| PREDICTED: putative nuclease HARBI1-like [Takifugu rubripes]
Length = 435
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 142/300 (47%), Gaps = 25/300 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGV 125
FR+S++TF Y+C +R L + + RL L VEK+VA+AL RLAS ++
Sbjct: 99 FRMSRETFFYLCDKLRPRLTRQ------DTTFRLALPVEKRVAVALWRLASNVEYRTISA 152
Query: 126 AFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
FGVG+STV + A+ + +L+ P + + S+G +C AI
Sbjct: 153 LFGVGKSTVCRCVRDMCHAIVALLSSIYLRPPSGQELHDSAQHCLSSWGFPHCVAAIATL 212
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H + P+ + D+ + S+L Q +V+ +F D+ +PGG + + +L+ S +
Sbjct: 213 HTAIITPS-NNASDYANPAGWLSVLSQVVVNGSGQFWDVCASFPGGTDPADILQNSSLWA 271
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG--------LSASMPT 296
G + + + LR I+G YPL SWL+ Y G L+
Sbjct: 272 TAAEGGLSPSPLPMFTGR-PLRYVILGEACYPLQSWLMKAYPEEGGRRGRKATLTEPQCL 330
Query: 297 FN-SLHEATRSLAVKAFLQLKGGWRILSKVMWRPD--KRKLPSIILVCCLLHNIIIDSGD 353
FN L A R+ + L+L+ W+ LSK R D +P++IL CC+LHN+ GD
Sbjct: 331 FNQQLRRALRA-PEETLLRLRARWQCLSK---RNDCGLDVVPTMILACCILHNMCESHGD 386
>gi|326680520|ref|XP_003201537.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 494
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/295 (27%), Positives = 137/295 (46%), Gaps = 24/295 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
K +R SK++ Y+ L++ + + + G LS E ++IALR ASG +
Sbjct: 41 LKERYRFSKESLVYLTRLLKPHIAN------VTNRGSALSTESILSIALRFFASGHFLYN 94
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
VG A VG++TV + AL+ +++P + IK++F GL N G +D
Sbjct: 95 VGDAEHVGKATVCRAVRTVCLALKHLLPTFVQFPGHKPLRVIKNEFHRVAGLPNVIGCVD 154
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
THI + P++ D+ ++++ +S+ VQ I + ++ WPG ++ +R+ + S
Sbjct: 155 GTHIPIKAPSI-NEGDYVNRKSTHSINVQVICEATQIITNVEAKWPGSVHDARIFRESSL 213
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
+ + GQ NG Y++G GYP L +L+TPY N +
Sbjct: 214 CQAFQQGQ-YNG-------------YLLGDRGYPCLPYLMTPY-PEPEPGPQTRLNLAYS 258
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
TR+ LK ++ L + P+ + II+ C +LHNI G+ L P
Sbjct: 259 RTRAKVEMTIGILKSRFQCLRGLRVSPE--RACDIIVACVVLHNIATIRGESLPP 311
>gi|302771816|ref|XP_002969326.1| hypothetical protein SELMODRAFT_91574 [Selaginella moellendorffii]
gi|300162802|gb|EFJ29414.1| hypothetical protein SELMODRAFT_91574 [Selaginella moellendorffii]
Length = 158
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 89/158 (56%), Gaps = 9/158 (5%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
FFRV ++ F IC LV PS NI R + V KQV IAL RLASG+S S+
Sbjct: 10 FFRVFEEQFMTICKLV-------APSMQKNIPSRYVPVGKQVVIALWRLASGDSFASLAE 62
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FGV ++TV + +F + L WP + + +K+ F+ G NCCGAID TH
Sbjct: 63 HFGVSKTTVWKYCQKFSNTTFQHLGQFLAWPAN--LSAVKTGFKSLCGFPNCCGAIDCTH 120
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ LP + D+ +++ +YS+++Q IVD E RF++I
Sbjct: 121 FEVELPGNAFASDYYNKDKDYSIVMQAIVDSEARFLEI 158
>gi|357478393|ref|XP_003609482.1| hypothetical protein MTR_4g116180 [Medicago truncatula]
gi|355510537|gb|AES91679.1| hypothetical protein MTR_4g116180 [Medicago truncatula]
Length = 494
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 153/340 (45%), Gaps = 42/340 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+S +F ++ L+ L S P ++ + +A A+ RLA G S SV
Sbjct: 103 FRMSNPSFFHLLDLLSPTLTSSIPQ---------ITPDCALAAAIFRLAHGASYNSVARR 153
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKW-PDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FG+ S + + +A+ + + + DS+R+ + F L NC G
Sbjct: 154 FGISPSDACRAFFTVCKAVNDNLGNLFELRTDSDRVV-VGFGFSS---LPNCFG------ 203
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK- 244
I+ L + ++ + N +LVQ +VD E RF+D+ +GWP M +L S +
Sbjct: 204 -ILGLAGFEIDEEILGK--NGFLLVQALVDSEGRFLDVSSGWPNSMKPETILHESKLYHG 260
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY----ETNGLSASMPTFNSL 300
+ E+ + L G S+ + +Y++G +PLL WL+TPY E +G S++ FNS
Sbjct: 261 VVESRELLQGPSYKLSDGSLIPQYVLGDSCFPLLPWLLTPYSRGNEEDGFSSAEIAFNST 320
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK--------LPSIILVCCLLHNIIIDSG 352
H L AF +L+ W++LS D RK LP +++ CLLHN +I
Sbjct: 321 HSRAMGLFGDAFGRLRTRWQLLS------DSRKWKRGCVEYLPFVVVTGCLLHNFLIKCN 374
Query: 353 DQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
D L D +S G VD R+ L HL
Sbjct: 375 DPLLRDKGVSCVEKEGDVLASDGVVDESAVRIRDALALHL 414
>gi|290462281|gb|ADD24188.1| nuclease HARBI1 [Lepeophtheirus salmonis]
Length = 439
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 145/312 (46%), Gaps = 23/312 (7%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW F + G T+P EE K+F R+++ TF + +R L+S + +I
Sbjct: 89 DWWEKFVS------GFTLP---EEWLKHF-RMTQMTFRILTDELRPYLISSNETAIIK-- 136
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE-ERAKHHLKWP 156
KQ+A+ L L S + G+ FGV Q++V + + +A+ ++LKWP
Sbjct: 137 ----DPAKQMALTLYTLTQNISMMESGLLFGVSQNSVVNIVKKVCKAISLHLGPNYLKWP 192
Query: 157 -DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
D N +EE+ FE G C GAID I +++ + + YS+ +Q I D
Sbjct: 193 SDQNEVEELVRCFEREHGFPQCIGAIDNFFI----QTRRSNSKFLNSLGFYSLNIQAICD 248
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
++ F+D++ PG ++V+ + S K +G + + ++ + ++G Y
Sbjct: 249 YKYCFMDVMADSPGSVHVAEIFFNSSINKSFMSGNVSSFSRKLIPNTRPVPILLIGDSCY 308
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
PL+ L+ Y G + F S+ + + A +LK +R L + P LP
Sbjct: 309 PLMPHLMIEYPLGGSNEKEKHFGSILRKAKEVIRCALYRLKARFRFLGRSADIPIT-DLP 367
Query: 336 SIILVCCLLHNI 347
S+I C +LHN+
Sbjct: 368 SLIRACFILHNV 379
>gi|413949349|gb|AFW81998.1| hypothetical protein ZEAMMB73_083863 [Zea mays]
Length = 475
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 83/318 (26%), Positives = 143/318 (44%), Gaps = 27/318 (8%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
PS + F+ FR+S+ TF +C D + + + + V ++VA+ L RLA
Sbjct: 143 PSCPDAEFRRAFRMSRATFGALC-----DALGGAVAKMDTALRTAIPVRQRVAVCLWRLA 197
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGL 174
+ E V FG+G ST + + AL + + WP + + ++FE +FGL
Sbjct: 198 TAEPLREVSRRFGLGISTSHSIVLQVYHALAAVLRPTVIYWPGPDSAATVAARFEAAFGL 257
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDW-C-----DQENNYSMLVQGIVDHEMRFIDIVTGWP 228
GA+ T + + P + + C +Q+ +YS+ VQ + D + F ++ G P
Sbjct: 258 AGIVGALYTTRVPIVAPKANVAAYYDCGLTERNQKASYSVAVQAVADADGVFTNVWIG-P 316
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G ++ + +L S L G + +VGG YP + W++ PY
Sbjct: 317 GSLSDAAILGRSALSGLLAGGHGQG-------------QRLVGGTSYPPMDWMLVPYAHQ 363
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
L+ + N A R +A A +LK WR L + + L S++ CC+LHN+
Sbjct: 364 NLTWTEHASNERVAAARGVARGAVRRLKARWRCLQRRS-EVKMQDLHSMVAACCVLHNVC 422
Query: 349 IDSGDQLHPDVALSDHHD 366
+G++L P++ D D
Sbjct: 423 ERAGEELDPELMQYDLDD 440
>gi|322786153|gb|EFZ12759.1| hypothetical protein SINV_12000 [Solenopsis invicta]
Length = 313
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 135/296 (45%), Gaps = 24/296 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQV 121
FK FR+ +TF+ + L+ L + + GR +S EKQ+ IA+ +++ +S
Sbjct: 1 FKEHFRMFPETFEEVLQLIGPGL-----KAINALPGRKPISAEKQLLIAIWFMSTPDSYR 55
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
SV FGVG++T + R AL A ++WP + KFE+ G N G I
Sbjct: 56 SVSTKFGVGKATAFRALRRVTYALHCIAPQFIQWPKGAIATNVMRKFEQVCGFPNVIGCI 115
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK--- 238
D THI + P + + +++ +S+ VQ + + F G G ++ +R+ +
Sbjct: 116 DGTHIKIRAPK-EDPVSYINRKGFHSINVQVVCESRGLFTHCYAGHVGSVHDARVFRNSP 174
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTF 297
+ F +L E N +I+G Y + + P+ NG L+ + F
Sbjct: 175 VADFLQLPETYFPDN-------------SHIIGDAAYSIHPHCMVPFRDNGHLTNAQKNF 221
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
N +TR +A LK +RIL + + RK+P IL CC+LHNI I D
Sbjct: 222 NYCLSSTRMTIERAIGHLKVRFRILLDCLPLTNIRKIPEFILACCVLHNICILKND 277
>gi|290562609|gb|ADD38700.1| nuclease HARBI1 [Lepeophtheirus salmonis]
Length = 439
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 152/338 (44%), Gaps = 43/338 (12%)
Query: 27 AAPKDPNSS----------DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDY 76
++PK P + DWW F + G T+P EE K+F R+++ TF
Sbjct: 68 SSPKKPQAQIVPLIHLSEPQDDWWEKFVS------GFTLP---EEWLKHF-RMTQMTFRI 117
Query: 77 ICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQ 136
+ +R L+S + +I KQ+A+ L L S + G+ FGV Q++V
Sbjct: 118 LTDELRPYLISSDETAIIK------DPAKQMALTLYTLTQNISMMESGLLFGVSQNSVVN 171
Query: 137 VTWRFIEALE-ERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ + +A+ +LKWP D N +EE+ FE G C GAID I +
Sbjct: 172 IVKKVCKAISLHLGPKYLKWPSDQNEVEELVRCFEREHGFPQCIGAIDNFFI----QTRR 227
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
++ + + YS+ +Q I D++ F+D++ PG ++V+ + FF ++G
Sbjct: 228 SNSKFLNSLGFYSLNIQAICDYKYCFMDVMADSPGSVHVAEI-----FFNSSINKSFMSG 282
Query: 255 NVRISSEEV-----ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAV 309
NV S ++ + ++G YPL+ L+ Y G + F S+ + +
Sbjct: 283 NVSSFSRKLIPNTRPVPILLIGDSCYPLMPHLMIEYPLGGSNEKEKHFGSILRKAKEVIR 342
Query: 310 KAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
A +LK +R L + P LPS+I C +LHN+
Sbjct: 343 CALYRLKARFRFLGRSADIPIT-DLPSLIRACFILHNV 379
>gi|388500104|gb|AFK38118.1| unknown [Medicago truncatula]
Length = 162
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 78/107 (72%), Gaps = 3/107 (2%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S+KTF+YICSLV++D +++ S G+ +S+ QVA+ALRRL SG+S V++G +FG
Sbjct: 56 ISRKTFEYICSLVKDD-IAKKSSHFSFSNGKPVSLTDQVAVALRRLGSGDSFVTIGDSFG 114
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ STVSQVTWRF+E++EER HHL+WP + EE+ S +F ++
Sbjct: 115 LSHSTVSQVTWRFVESMEERGLHHLQWPSTQ--EEMNSIKSTNFAIM 159
>gi|241730710|ref|XP_002413838.1| transposase, putative [Ixodes scapularis]
gi|215507654|gb|EEC17146.1| transposase, putative [Ixodes scapularis]
Length = 413
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 145/304 (47%), Gaps = 14/304 (4%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
T+P E F+ FR+ + TF Y+ S+ + + R + + + + +EK+VAIAL R
Sbjct: 88 TLPYFPESTFRENFRLDRSTFRYVVSVC--ECMRRNNTNM----RQAIPLEKRVAIALYR 141
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESF 172
LA+ +V FGV +S+V+ + F E L +R + +K+P N + E +F
Sbjct: 142 LATSAEDRTVANLFGVSRSSVNIIFREFCEVLVQRIEPRFVKFPRPNDLAEHLRQFAAVA 201
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
G GA+D H+ + P S D+ + + YS ++ + DH RF+ G PG +
Sbjct: 202 GFPQGVGALDGCHLEVCPPKDHDS-DYHNYKGWYSTILLAVPDHAYRFLYTNVGSPGRNH 260
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
S + + S + AG + + E + + ++ +PL ++ P+ G S
Sbjct: 261 DSAVFQRSRLPGVL-AGDLFRSEAK-TFEGISVGPVLLADQAFPLQCHMMKPFPQPG-SV 317
Query: 293 SMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
PT FN + R + AF +LK +RIL K + D + +I+ CC+LHNI
Sbjct: 318 GSPTRAFNYRLSSARRVVENAFGRLKARFRILHKGL-ELDIESVNTIVRACCVLHNICEQ 376
Query: 351 SGDQ 354
D+
Sbjct: 377 LNDR 380
>gi|294462202|gb|ADE76652.1| unknown [Picea sitchensis]
Length = 427
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 144/304 (47%), Gaps = 13/304 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSN 159
+ + +A+ L RL+ G S +V + V+++T + + ++++ P+
Sbjct: 127 MPLHSALAMVLHRLSQGHSARAVASQYKADPWMVAKITNTVTRVISTKVYPYYIEIPNRQ 186
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ +I F++ GL N CGAID HI + C + N Y++++Q + DH
Sbjct: 187 NLLQIIQGFKDLTGLPNMCGAIDGNHIKLHKKPNNEFMYKC-RHNFYAVVLQAVSDHRKI 245
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F D+ PGG + S + S F + Q L +V I+ +LR Y+ G YPLLS
Sbjct: 246 FWDVCVRAPGGTDDSNHFRESSLFNKLTSEQVLMDSV-ITIRGNQLRPYLAGDWCYPLLS 304
Query: 280 WLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
+L+ P+ NG + + F++ RS+ +A LKG W++L + P I
Sbjct: 305 FLLVPFSPNGSGTPAQNMFDAALMKGRSVVEQATGLLKGRWKMLQDL--NVGLNHAPQTI 362
Query: 339 LVCCLLHNIIIDSGDQLHPD-VALSDHHDSGYGEQCC---KQVDPMGRTTRENLEKHLQH 394
+ CC+LHN+ G+ P+ + D ++G + K ++ G T+R+ L + LQ
Sbjct: 363 VACCVLHNLCQLFGE---PEPLMCRDPRENGPAARTIDSDKSINYFGETSRQALIEDLQE 419
Query: 395 NQEK 398
++
Sbjct: 420 RHQR 423
>gi|270012680|gb|EFA09128.1| hypothetical protein TcasGA2_TC015989 [Tribolium castaneum]
Length = 358
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 137/299 (45%), Gaps = 21/299 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+EE FK FR SK T ++ L+ DL P + + +S ++ I +R LA+G
Sbjct: 43 DEEEFKMRFRFSKATILWLHELIGHDL--EPTTR----RRKSISAINKILITMRYLATGS 96
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
Q VG V +STV V I+ + + +K P+ + ++ KF +
Sbjct: 97 FQQLVGDTVAVHKSTVCVVIKSVIQKIAQLKPQFIKMPNREELHNVQLKFYRKRRMPRVI 156
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID +H+ + P ++ + +++ +S+ VQ + D +++ +IV WPG ++ S +
Sbjct: 157 GAIDCSHVRIESPGGPNAEIFRNRKGFFSINVQAVCDADLQIRNIVARWPGSVHDSTIFN 216
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S E G+ NG +++G GY +L+TP N +A+ +N
Sbjct: 217 DSSLCAHLERGEYENG-------------FLLGDSGYACRPFLLTPV-LNPRTAAEEAYN 262
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
H TR+ + F LK + LS + R + I+ C +LHNI I + D P
Sbjct: 263 LSHRTTRNAIERCFGVLKRRFPCLS-LGLRTKMNTTLATIVACAVLHNIAIFTNDNEPP 320
>gi|331226844|ref|XP_003326091.1| hypothetical protein PGTG_07921 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 304
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 128/261 (49%), Gaps = 23/261 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F +FR+S+ F +IC +R+ L G L+VE QVA+ L RL G + V+
Sbjct: 59 FIEYFRMSRADFHWICDELRDTLAQDQLR-----RGAPLTVEAQVAVGLYRLGHGATYVT 113
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEE---IKSKFEESFGLLNCC 178
+ F +G+ T + + RF+ A+ + + + +PD + +E I+ F G+
Sbjct: 114 ISHVFNIGKETADKASGRFVNAVLKVLRLRAISFPDLDAHDEWVEIEDSFASRHGIPGVV 173
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID TH+ + +P + + +++N S++ Q +VD E F D+ G G ++ SR+ +
Sbjct: 174 GAIDGTHVPLAMPPLDKWKGYINRKNWPSLVFQCVVDGEGNFRDVFGGGAGSIHDSRVFR 233
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT-- 296
S C Q L+ + I +Y++G GYP ++ PY S + P
Sbjct: 234 RSPLG--CSLNQALDLPLMIPRS-----KYLIGDAGYPSNVNILVPYP----SIAAPENE 282
Query: 297 -FNSLHEATRSLAVKAFLQLK 316
FN + ATR + ++F +LK
Sbjct: 283 WFNYVQSATRIVVEQSFGRLK 303
>gi|291227593|ref|XP_002733771.1| PREDICTED: transposase, putative-like, partial [Saccoglossus
kowalevskii]
Length = 214
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 111/221 (50%), Gaps = 17/221 (7%)
Query: 95 NIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLK 154
+I R LS+EK++ L L + ES V FG+G+S++ V +AL + ++
Sbjct: 9 SIPERSLSIEKKMLSTLWVLGNQESYRGVADRFGIGKSSLHYVVMTVCQALVAKQSDYIC 68
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
WP +++I F + G GA+D THI + P+ + + +++ S+ +Q +
Sbjct: 69 WPKGIEVQQICESFRQKTGFPGVIGAVDGTHIYIPGPS-HHRNSYINRKGFPSIQLQAVC 127
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGV 273
D +RF D+ TGWPG +N +R+ K N VR + E+ +++G
Sbjct: 128 DSNLRFTDVYTGWPGSVNDARVFK--------------NSPVRNVLHNELPPNLHLLGDS 173
Query: 274 GYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFL 313
Y L ++++TPY NG L+A FN H +TR +AF+
Sbjct: 174 AYALSTYVLTPYRDNGHLNAVEKQFNKYHSSTRVDIERAFV 214
>gi|449443271|ref|XP_004139403.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449494034|ref|XP_004159428.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 447
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 128/299 (42%), Gaps = 50/299 (16%)
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEE---SFG 173
G S +VG FG+ + + + +A+ E+ H L E++S + FG
Sbjct: 169 GASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLL---------ELRSDIDRIVVGFG 219
Query: 174 ---LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NCCG + + N S+LVQ +VD E RF+D+ GWP
Sbjct: 220 WISLPNCCGVLGLRRFGFE-----------GELKNGSLLVQALVDAEGRFLDVSAGWPSS 268
Query: 231 MNVSRLLKFSGFFKLCE-AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY---- 285
M + +L+ S + E + + L G V E + +Y++G +PLL WL+TPY
Sbjct: 269 MKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELN 328
Query: 286 ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSIILVCCLL 344
E + FNS H +L AF +L+ W++LSK W+ R P IIL CLL
Sbjct: 329 EEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKP-WKEGCRDFFPFIILTGCLL 387
Query: 345 HNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV-----------DPMGRTTRENLEKHL 392
N +I ++ L + D G C + D G+ R+ L HL
Sbjct: 388 QNFLIKCSEK------LDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHL 440
>gi|224144073|ref|XP_002325176.1| predicted protein [Populus trichocarpa]
gi|222866610|gb|EEF03741.1| predicted protein [Populus trichocarpa]
Length = 421
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 163/347 (46%), Gaps = 40/347 (11%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D ++ + FFR+S+ F + SL+ LV PS + E +A AL RLA G
Sbjct: 96 DFDQRWVQFFRLSRPNFYRLLSLLSPSLVFFLPS--------TIPPETALASALFRLAHG 147
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFI-EALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
+V FG+ S S + + + +A+ ++ ++ +E I F L N
Sbjct: 148 ACYKTVARKFGLDSSEASCLAFYSVCKAVNDKLGDLFEF--RRDLERIVVGFGW-ISLPN 204
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQE---NNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
CCGA+ + D E N S+LVQ +VD E RF+DI GWP M
Sbjct: 205 CCGALGFGRFGV------------DGEVFGRNGSILVQALVDSEGRFLDISAGWPSSMEP 252
Query: 234 SRLLKFSGFFKLCEAGQR---LNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNG 289
+ + + + E + L G + S+ + +Y++G +PLL WL+TPY E +
Sbjct: 253 VAIFRQTRLYLGVEDSRESELLKGPTYMLSDGCSIPQYVMGDSCFPLLPWLLTPYSEQDS 312
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSIILVCCLLHNII 348
++ FN H L AF ++K W++L++ W+ D + LP +I++ C+LHN +
Sbjct: 313 FGSAEREFNVAHSRAMKLVSTAFGRVKARWQLLAR-RWKEDSLEFLPFVIVMGCVLHNFL 371
Query: 349 IDSGDQLHPDVALSDHHDSGYGEQCC---KQVDPMGRTTRENLEKHL 392
I G+ L P+ ++ GE+ + D G+T R+ L +HL
Sbjct: 372 IKWGEPL-PEESVGGCLK---GEELLVFEGEGDETGQTIRDALAQHL 414
>gi|346468107|gb|AEO33898.1| hypothetical protein [Amblyomma maculatum]
Length = 414
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 146/305 (47%), Gaps = 11/305 (3%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
T+P+ E+ FK FRVS+ TF YI R L+ R + + +S+ K+VA+ L +
Sbjct: 76 TLPNLGEQSFKQSFRVSETTFKYIVDSCRP-LMERQATNM----REAVSIVKRVAVGLYK 130
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESF 172
L S ++ F +G+STV+ + F E + E + +K + + + +F+
Sbjct: 131 LCSSAEDRTIAHLFDLGRSTVNTIYREFCETVVEVLEEQWVKMMSAGDIADHIREFQAVC 190
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
G GA+D H+ ++ P SD + + + YS+++ +VDH+ RF + G G +
Sbjct: 191 GFPQAVGALDGCHLPVSPPKNHASDYY-NYKGWYSVILLAVVDHKYRFRYVNVGSXGRCH 249
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
S +L+ S ++ E + ++ + +V I+ +PL L+ P+ A
Sbjct: 250 DSHVLQSSTLPRIIEGPSFMRPSLTVGGTDVP--PLILCDQAFPLTPNLMKPFPRGNTEA 307
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ TFN +R + AF +LK +R + K M + +P ++ CC+L+NI
Sbjct: 308 EL-TFNYQLSKSRRIVENAFGRLKARFRYVMKRM-ECKVKNVPVLVRACCVLNNICEHFN 365
Query: 353 DQLHP 357
D + P
Sbjct: 366 DPILP 370
>gi|449688839|ref|XP_004211862.1| PREDICTED: uncharacterized protein LOC101235141 [Hydra
magnipapillata]
Length = 310
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 147/313 (46%), Gaps = 17/313 (5%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+V +AL LAS
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVNE---LHPHLGKTTTTMREPISVVKRVTVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + + FI A+ + ++K+P S + + FE G
Sbjct: 56 EEYRVVTSLFGIGKSTANLIVYEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C GA+D HI ++ P Q + + + + YS+++ +VD RFI G PG N S
Sbjct: 116 QCIGAVDGRHIPISAPKDQ-AISYYNYKGWYSIVLFAVVDCRYRFIYTSVGSPGRNNDSY 174
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASM 294
+L+ S +A LN + + + + ++G +PL L+ PY N LS
Sbjct: 175 ILQNSSL----KATLELNLFNKCCKDSL-VPLCLIGDSVFPLTLHLLKPYPENLVLSEIQ 229
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
FN + R + AF ++K +R++ K M D I+ C LHNI D
Sbjct: 230 KNFNKILCGARRVVENAFGRVKARFRVVCKRM-ECDINFATRIVNACVTLHNICKYYDDI 288
Query: 355 LHPDVALSDHHDS 367
+ + + H+DS
Sbjct: 289 IIIESLMHHHYDS 301
>gi|357129640|ref|XP_003566469.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 486
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 159/367 (43%), Gaps = 45/367 (12%)
Query: 4 PKKSKSKKTKKVTKKLKKSKSTSAAPKDPN--SSDSDWWPSFWAKNSSTP---GATIPSD 58
P + S + V +K S S + + DP ++ W + ST + P
Sbjct: 97 PDAAGSSRPAAVRRKCSPSSSPAHSGGDPAPPAARGGGGRRVWVRERSTEWWDRLSDPLC 156
Query: 59 EEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
+ F+ FR+S+ TFD +C ++ +ED R + V ++VA+ L R
Sbjct: 157 PDAEFRAAFRMSRATFDLLCESLSGAVAKEDTALR----------AAIPVHQRVAVCLWR 206
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESF 172
LA+GE V FG+G ST + + AL ++WP E + F+
Sbjct: 207 LATGEPLREVSRRFGLGISTCHNIVLQVCAALTAVLLPTAVRWP-----SEDNAGFQAVS 261
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWC------DQENNYSMLVQGIVDHEMRFIDIVTG 226
G+ GA+ H+ + P + +++ +YS+ +Q +VD + F D+ G
Sbjct: 262 GIPGVVGAVCTGHVPIGPPKENPGEYLNRRLTEKNKKASYSVALQAVVDADGAFTDVCIG 321
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
PG ++ + +L+ S CEAG L G R +VGG PL W++ PY
Sbjct: 322 LPGSLSDAAVLERSALRARCEAG--LLGGGRC----------LVGGASLPLADWMLVPYA 369
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
L+ + FN +A R +A AF +L WR L + P L +++ CC+LHN
Sbjct: 370 HRNLTWAQHAFNGRVDAARGVARAAFRRLAARWRCLQRRT-EPKLPDLHNMLGACCVLHN 428
Query: 347 IIIDSGD 353
SG+
Sbjct: 429 FCERSGE 435
>gi|340375949|ref|XP_003386496.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 435
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 169/370 (45%), Gaps = 25/370 (6%)
Query: 5 KKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFK 64
KK + T + +KK +S P + + WW +F G I D+E +K
Sbjct: 55 KKFHTLSTPNSGRIVKKRRSLKKYWIKPGRT-TIWWDNF------VNGVVI--DQE--WK 103
Query: 65 YFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG 124
FR++K F +C +R + + + + + + E+QVA+ L L+
Sbjct: 104 DNFRMNKANFFKLCDELRP-FIEKKSTNM----RQCIETERQVALTLYYLSDEGCLRKTA 158
Query: 125 VAFGVGQSTVSQVTWRFIEALE-ERAKHHLKWPDSNRMEEIKSK---FEESFGLLNCCGA 180
AFG+ +S+ S V R + ++K P + EE+ K F +FG+ C GA
Sbjct: 159 NAFGIARSSASVVIRRVCYIISIHLGPRYIKLPLTE--EEVNEKVTGFYIAFGVPQCIGA 216
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID THI + P +D + +++N YS+ VQ DH+ F+D+V WPG ++ +R+ S
Sbjct: 217 IDGTHIDIKAPKSNPTD-YINRKNRYSLNVQACCDHKYCFLDVVVKWPGSVHDARVFSNS 275
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
L + Q + + + +++G YPL ++L+ Y G + +
Sbjct: 276 TLNNLLKTAQIPPCRRSLVEDRDPIPVFLIGDPAYPLQTYLMKEYANGGSTVQEQYYGYK 335
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD-V 359
+ R + AF +LK + IL + + + ++ +I C +LHN + + + + V
Sbjct: 336 LCSARMVIECAFGRLKAHFGILKRPL-DININEVAHVIYACFVLHNYCELNHESIAEERV 394
Query: 360 ALSDHHDSGY 369
++ H+D+ +
Sbjct: 395 QVAIHYDNRF 404
>gi|356546142|ref|XP_003541490.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 448
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 120/263 (45%), Gaps = 20/263 (7%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+A AL RLA +V F + + + +AL + H + + +
Sbjct: 156 LAAALFRLAHAAPYPAVSRRFAISPPDACRAFFAVCKALADNLGHLFELRTDSERVVVGF 215
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
F L NC G + T + DD EN +++VQ +VD E RF+D+ G
Sbjct: 216 GFS---SLPNCFGVLGFTRFKI--------DDSLLGENG-ALIVQALVDSEGRFLDVSAG 263
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQR-LNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
WP M L+ S ++ E + LNG SE + +YI+G +PLL WL+TPY
Sbjct: 264 WPSTMKPEIFLRESKLYREVEQSKELLNGPSYNLSEGCLIPQYILGDSCFPLLPWLLTPY 323
Query: 286 ----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPD-KRKLPSIIL 339
E + ++ FN +H L AF +L+ WR+L+ W+ + LP +++
Sbjct: 324 NRVNEEDSFGSAERAFNCVHGNAMELVGDAFGRLRARWRLLAASRKWKQECVEHLPFVVV 383
Query: 340 VCCLLHNIIIDSGDQLHPDVALS 362
CCLLHN ++ + + PD +S
Sbjct: 384 ACCLLHNFVVKFNEAM-PDEGVS 405
>gi|449472543|ref|XP_004153626.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
gi|449522815|ref|XP_004168421.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 451
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/259 (31%), Positives = 126/259 (48%), Gaps = 8/259 (3%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHH 152
I + L + VA+ L RL G S ++ F + VS++T L +
Sbjct: 143 IALSNLSLPSDYAVAMVLSRLCHGFSAKTLASRFSLEPYLVSKITNMVTRLLATKLYAEF 202
Query: 153 LKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHI-IMTLPAVQT-SDDWCDQENNYSML 209
+K P S R + E FEE L N CGAID + I + LPA Q S ++ + S+L
Sbjct: 203 IKIPVSRRRLIETTQAFEELTSLPNMCGAIDGSPIKLRRLPADQNFSTNYNCRFGYPSVL 262
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
+Q + D++ F D+ PGG + + + S + +G + NV I+ +R YI
Sbjct: 263 LQVVADNKKIFWDVCVKAPGGSDDASHFRDSLTYHRLTSGDVVWDNV-INVRGHHVRPYI 321
Query: 270 VGGVGYPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
VG GYPLLS+L+TP+ NG+ + + F+ + RS+ V A LK W+IL +
Sbjct: 322 VGDWGYPLLSFLLTPFSPNGMGTPAQNLFDGMLMKGRSVVVDAIGLLKARWKILQDL--N 379
Query: 329 PDKRKLPSIILVCCLLHNI 347
P I+ CC+LHN+
Sbjct: 380 VGLSHAPQTIVACCVLHNL 398
>gi|449455228|ref|XP_004145355.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 445
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/259 (31%), Positives = 126/259 (48%), Gaps = 8/259 (3%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHH 152
I + L + VA+ L RL G S ++ F + VS++T L +
Sbjct: 137 IALSNLSLPSDYAVAMVLSRLCHGFSAKTLASRFSLEPYLVSKITNMVTRLLATKLYAEF 196
Query: 153 LKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHI-IMTLPAVQT-SDDWCDQENNYSML 209
+K P S R + E FEE L N CGAID + I + LPA Q S ++ + S+L
Sbjct: 197 IKIPVSRRRLIETTQAFEELTSLPNMCGAIDGSPIKLRRLPADQNFSTNYNCRFGYPSVL 256
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
+Q + D++ F D+ PGG + + + S + +G + NV I+ +R YI
Sbjct: 257 LQVVADNKKIFWDVCVKAPGGSDDASHFRDSLTYHRLTSGDVVWDNV-INVRGHHVRPYI 315
Query: 270 VGGVGYPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
VG GYPLLS+L+TP+ NG+ + + F+ + RS+ V A LK W+IL +
Sbjct: 316 VGDWGYPLLSFLLTPFSPNGMGTPAQNLFDGMLMKGRSVVVDAIGLLKARWKILQDL--N 373
Query: 329 PDKRKLPSIILVCCLLHNI 347
P I+ CC+LHN+
Sbjct: 374 VGLSHAPQTIVACCVLHNL 392
>gi|328698759|ref|XP_003240725.1| PREDICTED: hypothetical protein LOC100568546 [Acyrthosiphon pisum]
Length = 1190
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 144/308 (46%), Gaps = 21/308 (6%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALRRLAS 116
D E+ F +FR+S+ F+ + ++ ++ + N + R + ++++ + LR LA+
Sbjct: 847 DHEDKFFKYFRMSRFQFNTLYLKIKNEISKQ------NTQFRESIPAKEKLGVCLRFLAT 900
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLL 175
G+S ++ +F +G STV + A+ + K +K P E I ++F E +
Sbjct: 901 GDSYQTIAFSFRLGHSTVQGIVIEVCNAIILKLKEECIKTPQKEDWERIANEFWEIWNFP 960
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D H+++ P T + + + +S+++ +VD + +F + G G +
Sbjct: 961 NCIGALDGKHVVIEAPP-NTGSLYYNYKKTFSIVLLALVDAQYKFTVVDIGAFGKNSDGG 1019
Query: 236 LLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+L S F K E + + N + +L I+G +PL ++L+ PY + + +
Sbjct: 1020 ILSHSNFGKALEKNKLHIPNNRALPGTNEKLPYVIIGDEAFPLKNYLLRPYPGPQMYSDV 1079
Query: 295 --PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP----SIILVCCLLHNII 348
FN R + AF QL +RI + + + LP I++ C+LHN I
Sbjct: 1080 KKKIFNERLSRARKVVEDAFGQLTAKFRIYCRRL-----KSLPENADKIVMTTCILHNYI 1134
Query: 349 IDSGDQLH 356
++H
Sbjct: 1135 KQDSSEIH 1142
>gi|357475473|ref|XP_003608022.1| hypothetical protein MTR_4g086650 [Medicago truncatula]
gi|355509077|gb|AES90219.1| hypothetical protein MTR_4g086650 [Medicago truncatula]
Length = 517
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 124/257 (48%), Gaps = 6/257 (2%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHH 152
I + L + VA+ L RLA G S +V + + V+++T L +
Sbjct: 136 IAVSNLSLPSDYAVAMVLSRLAHGLSAKTVANRYSLEPYLVTKITNMVTRLLATKLYPEF 195
Query: 153 LKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQ 211
+K P R+ EI FEE L N CGAID+T + + + + S+L+Q
Sbjct: 196 IKIPVGRRRLIEITQSFEELTSLPNMCGAIDSTSVKLRSGPSSNPATYLSRYGYPSVLLQ 255
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
+ DH+ F D+ PGG + + + S ++ +G + V I+ +R Y+VG
Sbjct: 256 VVSDHKKIFWDVCVKAPGGTDDATHFRDSLLYQRLTSGDVVWDKV-INVRGHHVRPYVVG 314
Query: 272 GVGYPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
YPLL +L+TP+ +G+ + + F+ + RS+ V+A LKG W+IL ++
Sbjct: 315 DWCYPLLPFLLTPFSPSGMGTPAQNLFDGMLMKGRSVVVEAIALLKGRWKILQEL--NVG 372
Query: 331 KRKLPSIILVCCLLHNI 347
+P I+ CC+LHN+
Sbjct: 373 LHHVPQTIVACCVLHNL 389
>gi|449669223|ref|XP_004206967.1| PREDICTED: uncharacterized protein LOC101235795 [Hydra
magnipapillata]
Length = 305
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 142/312 (45%), Gaps = 15/312 (4%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ F++ LV E P G R +SV K VA+AL LAS
Sbjct: 2 EECEFKEHFRVNRNPFNF---LVNE---LHPHLGKTTTTMREPISVVKCVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + FI A+ + ++K+P S + + FE G
Sbjct: 56 EEYRVVSSLFGIGKSTAILIVYEFINAVNDILLPKYVKFPLSVENLNKYSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C GA+D HI ++ P Q + + + YS+++ +VD FI G PG N S
Sbjct: 116 QCVGAVDRCHIPISAPKDQAISYY-NYKGWYSIVLFAVVDCRYHFIYTSVGSPGRNNDSY 174
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASM 294
+L+ S + E+ + V L ++G +PL L+ PY N LS
Sbjct: 175 ILQNSSLKAILESNLFDKCCKELGDSLVPL--CLIGDSAFPLTRHLLKPYPENLELSKIQ 232
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
F+ + R + AF +LK +R++ K M D I+ C LHN I ++ D
Sbjct: 233 KNFSKILCGARRVVENAFGRLKARFRVICKRM-ECDINFATRIVNACVTLHN-ICENYDD 290
Query: 355 LHPDVALSDHHD 366
+ L +HHD
Sbjct: 291 IIIIKWLMNHHD 302
>gi|449677574|ref|XP_004208878.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 264
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 114/225 (50%), Gaps = 7/225 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDA 183
FG+ T S V + A+ + + P + +M E S+FE FG++ G +D
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG ++ +++ S
Sbjct: 65 THIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSIN 123
Query: 244 KLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
+ RL G ++ I+ ++++ Y++G YPLL + Y T + + FNS+
Sbjct: 124 TNLRSS-RLPGTIQTITKNKIKVPCYLIGDPAYPLLPHCMKEYSTCKKNDEV-IFNSMLR 181
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
R+ AF +LK W+IL+K M K+P++I C +LHN
Sbjct: 182 TARNPIECAFGRLKTRWKILTKKM-DLKLEKIPTVIYACFILHNF 225
>gi|356565147|ref|XP_003550806.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 406
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 122/259 (47%), Gaps = 20/259 (7%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+A AL RLA +V F + + + + +AL + H + ++ +
Sbjct: 123 LAAALFRLAHAAPYPAVSRRFAISPADACRAFFAVCKALADNLGHLFELRTDSQRVVVGF 182
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
F L NC G + + DD EN +++VQ +VD E RF+D+ G
Sbjct: 183 GFS---SLPNCFGVLGFARFKI--------DDSLLGENG-ALMVQALVDSEGRFLDVSAG 230
Query: 227 WPGGMNVSRLLKFSGFFK-LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
WP M +L+ + ++ + E+ + LNG SE + +YI+G +PLL WL+TPY
Sbjct: 231 WPSTMKPETILRETKLYREVEESKELLNGPSYNLSEGCLIPQYILGDSCFPLLPWLLTPY 290
Query: 286 ----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPD-KRKLPSIIL 339
E + ++ FN+ H+ L AF +L+ WR+L+ W+ D LP +++
Sbjct: 291 NRVNEEDSFGSAERAFNTAHDNAMGLLGDAFARLRASWRLLAASRKWKRDCVEHLPFVLV 350
Query: 340 VCCLLHNIIIDSGDQLHPD 358
CLLHN ++ + + PD
Sbjct: 351 AGCLLHNFVLKFNEAM-PD 368
>gi|160773871|gb|AAI55411.1| LOC100127800 protein [Xenopus (Silurana) tropicalis]
Length = 352
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 123/255 (48%), Gaps = 26/255 (10%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
AL AS Q +V G+ Q + S+ + ++A+ + ++ +P D ++ IK F
Sbjct: 84 ALHYFASASFQGTVSALSGISQPSFSRHLTQVLKAINKLTPQYIVFPSDKAALKTIKDGF 143
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
+ N G ID TH+ ++ P T D + + +N +S+ VQ + +MR +++V G+P
Sbjct: 144 FKISSFPNVMGVIDCTHVALSPP---TEDIYRNSKNFHSLNVQMVCASDMRILNLVAGYP 200
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G + S +LK S + +G G +I+G YPL +L+TP + +
Sbjct: 201 GSTHDSYILKHSSLHSILTSGNLPEG-------------WILGDDAYPLTEYLLTPVK-D 246
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ + +N+ H+ S+ F +K +R L + + + P+K IIL CC+L
Sbjct: 247 AKTKAEKQYNAAHKLAHSIIESTFGSIKSRFRCLDRSGGVLQYSPEKGA--QIILACCIL 304
Query: 345 HNIIIDSGDQLHPDV 359
HN+ + LH D+
Sbjct: 305 HNLAVSRN--LHVDI 317
>gi|301614025|ref|XP_002936498.1| PREDICTED: putative nuclease HARBI1 [Xenopus (Silurana) tropicalis]
Length = 357
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 123/255 (48%), Gaps = 26/255 (10%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
AL AS Q +V G+ Q + S+ + ++A+ + ++ +P D ++ IK F
Sbjct: 89 ALHYFASASFQGTVSALSGISQPSFSRHLTQVLKAINKLTPQYIVFPSDKAALKTIKDGF 148
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
+ N G ID TH+ ++ P T D + + +N +S+ VQ + +MR +++V G+P
Sbjct: 149 FKISSFPNVMGVIDCTHVALSPP---TEDIYRNSKNFHSLNVQMVCASDMRILNLVAGYP 205
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G + S +LK S + +G G +I+G YPL +L+TP + +
Sbjct: 206 GSTHDSYILKHSSLHSILTSGNLPEG-------------WILGDDAYPLTEYLLTPVK-D 251
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ + +N+ H+ S+ F +K +R L + + + P+K IIL CC+L
Sbjct: 252 AKTKAEKQYNAAHKLAHSIIESTFGSIKSRFRCLDRSGGVLQYSPEKGA--QIILACCIL 309
Query: 345 HNIIIDSGDQLHPDV 359
HN+ + LH D+
Sbjct: 310 HNLAVSRN--LHVDI 322
>gi|331224048|ref|XP_003324696.1| hypothetical protein PGTG_06233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 363
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 134/297 (45%), Gaps = 55/297 (18%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FFR++ + F ++ + +L P G LSVE QVA+ L RLA G V+
Sbjct: 58 FVEFFRMTVEDFRWLSDQLSPELQQDPLR-----RGDPLSVEAQVAVGLYRLAHGVCYVT 112
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNR---MEEIKSKFEESFGLLNCC 178
+G F +G+ T + + RF+ A+ + + +P NR +EIK+ FE+ G+ N
Sbjct: 113 IGHVFNIGKETAEKASGRFVNAVLRILRDRTVGYPPLNRPDLWDEIKASFEQRHGIPNIV 172
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + +P W G M+ +R+ +
Sbjct: 173 GAIDGTHIPVAMPPADN-------------------------------WKGSMHDTRVFR 201
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT-- 296
S +L ++ G RI + Y++G GYP ++ PY L A+ P
Sbjct: 202 RS---RLGQSMTPGTGAARI----IPAGSYLIGDAGYPSNVDILVPYP---LVATEPNEW 251
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN + +TR +AF +LK +RIL S RP + + + VC +LHNI+ G
Sbjct: 252 FNFIQSSTRICVEQAFGRLKNQFRILRSAQNARPFRAR--NNTFVCMILHNILNRRG 306
>gi|449690159|ref|XP_004212260.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 416
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 136/294 (46%), Gaps = 10/294 (3%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P E +K FR++K F + +++ D V P S N R L+ K++AI + L
Sbjct: 78 PDVPETCWKRNFRMTKGCFLELAAII--DTVVSPQSNCPNY--RFLTTHKKLAITIYYLK 133
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNR-MEEIKSKFEESFG 173
S FG+ Q TVS+ +A+ +L P + M ++ S+FE FG
Sbjct: 134 DTGSLWMTANVFGIHQCTVSKTVQVVCDAINNIVGPIYLHLPKNKEDMTKLASQFEVKFG 193
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
++ G ID TH+ + P D +C ++ +S+ VQ + D + FID+ WPG ++
Sbjct: 194 MIQAFGCIDGTHVQIKRPIKNGQDYFCYKQ-YFSLNVQAVCDSKGYFIDVECKWPGSVHD 252
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+++ S K G L + + Y++G YPL ++ I +++ +
Sbjct: 253 AKMFTNSTINKKLIKGT-LPQTLYSLPNYHSIPNYLIGDPAYPLTNFCIKEFQSCSNNEE 311
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+ FNS+ + R+ AF +LK W L K++ +P +I C +LHN
Sbjct: 312 V-IFNSMLRSARNQIECAFGRLKARWGFLRKII-DIKIETVPIVIYTCFVLHNF 363
>gi|328713765|ref|XP_003245173.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 397
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 138/292 (47%), Gaps = 12/292 (4%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E F ++R+S K+FD + VR D +S+ + N LS E+++ I LR L++G +
Sbjct: 59 EKFFIYYRMSIKSFDELLQNVR-DHISKRYTPWRN----HLSAEERLTITLRYLSTGMTF 113
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
++ F +G+STV ++ A+ K + P + EI + F NC G
Sbjct: 114 TALHFEFKIGKSTVGEIVRETCVAIWNTLQKEEMPEPTTEHWLEISNTFYTKTNFPNCLG 173
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
A+D HI + P T + + + +S+++ +VD + FI I G G + S + K
Sbjct: 174 AVDGKHIRIQNPK-NTGSLFFNYKKYFSIILMAVVDANLSFIYIDVGSYGKESDSNIFKE 232
Query: 240 SGFFK-LCEAGQRLNGNVRISSEEVELREYI-VGGVGYPLLSWLITPYETNGLSASMPTF 297
S F K L + G +R+ S + + Y+ VG + L + L+ PY GL+ F
Sbjct: 233 SSFGKKLYSNALNIPGPIRLPSSQGSPQPYVFVGDEAFALHTNLLRPYPQRGLNDERRIF 292
Query: 298 NSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII 348
N R AF L WR+L + ++ P+ + I+ CC+LHN +
Sbjct: 293 NYRLSRARRTVECAFGVLANKWRVLHTTILVSPE--FVDDIVKSCCVLHNFV 342
>gi|195018230|ref|XP_001984745.1| GH14854 [Drosophila grimshawi]
gi|193898227|gb|EDV97093.1| GH14854 [Drosophila grimshawi]
Length = 394
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 156/329 (47%), Gaps = 25/329 (7%)
Query: 42 SFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
+FW K+ IP+ E FK FR+ ++ FD + L+ E + + + + +
Sbjct: 58 AFWEKD-------IPASTEAEFKQHFRMEQEYFDILVQLLPE--LRKNDTNF----RKAI 104
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHL--KWPDS 158
+EK++AIAL L S VG FG+ S V ++ F +A++ ++ +L ++
Sbjct: 105 PIEKRIAIALYTLGSTAEYRVVGKLFGISMSMVGKILNDFCQAVKRVLSQEYLPKEFLTQ 164
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
++EE + FE G C GAI H+ ++ A+ S + + ++ YS ++ +VDH
Sbjct: 165 QKLEECVTGFEAK-GFPQCLGAIGICHLEVS-AAISNSSLYYNNQDWYSKILFALVDH-- 220
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF+ + PG +++ + S ++ E+ + N R VE+ ++G +
Sbjct: 221 RFLYVNYSCPGSFTPAQVYEESYLKEILESSPQFKSN-RQPIAGVEVPVLVLGDAAFEAT 279
Query: 279 SWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
+ ++TPY + + FN + R + AF QLK +R ++K +R II
Sbjct: 280 T-VMTPY-SQPHGETEKQFNVALQDARQVVDAAFQQLKSRFRRITKSADCHSRRN--DII 335
Query: 339 LVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
+ CC+LHN +I +G + D L S
Sbjct: 336 ICCCILHNFLIGNGSNIFDDAELQQEQQS 364
>gi|449665294|ref|XP_004206114.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 264
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 113/225 (50%), Gaps = 7/225 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDA 183
FG+ T S V + A+ + + P + +M E S+FE FG++ G +D
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG ++ +++ S
Sbjct: 65 THIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSIN 123
Query: 244 KLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
+ RL G + I+ ++++ Y++G YPLL + Y T + + FNS+
Sbjct: 124 TNLRSS-RLPGTFQTITKNKIKVPCYLIGDPAYPLLPHCMKEYSTCKKNDEV-IFNSMLR 181
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
R+ AF +LK W+IL+K M K+P++I C +LHN
Sbjct: 182 TARNPIECAFGRLKARWKILTKKM-DLKLEKIPTVIYACFILHNF 225
>gi|224105513|ref|XP_002313838.1| predicted protein [Populus trichocarpa]
gi|222850246|gb|EEE87793.1| predicted protein [Populus trichocarpa]
Length = 442
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/250 (31%), Positives = 121/250 (48%), Gaps = 7/250 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSN 159
L + V++ L RLA G S ++ + + VS++T L + +K P S
Sbjct: 144 LPTDYAVSMVLSRLAHGFSAKTLASRYSLEPYLVSKITNMVTRLLATKLYPEFIKIPVSK 203
Query: 160 R-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
R + E FEE L N CGAID + I + + + C + S+L+Q + DH+
Sbjct: 204 RRLIETTQAFEELTSLPNMCGAIDGSPIKVKRADIGGNMYKC-RYGYSSVLLQVVADHKK 262
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
F D+ PGG + + L+ S + +G + V I+ +R YIVG YPLL
Sbjct: 263 VFWDVCVKAPGGSDDASHLRGSVLYNRLVSGDVVWDKV-INVRGHHVRPYIVGDWCYPLL 321
Query: 279 SWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
S+L+TP+ NG + S F+ + RS+ V A LKG W+IL + D P
Sbjct: 322 SFLMTPFSPNGSGTPSQNLFDGMLMKGRSVVVDAIALLKGRWKILQDLNVGLDHA--PQT 379
Query: 338 ILVCCLLHNI 347
I+ CC+LHN+
Sbjct: 380 IVACCVLHNL 389
>gi|241640721|ref|XP_002410899.1| transposase, putative [Ixodes scapularis]
gi|215503597|gb|EEC13091.1| transposase, putative [Ixodes scapularis]
Length = 459
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 132/304 (43%), Gaps = 41/304 (13%)
Query: 67 FRVSKKTFDYI--------CSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
FRV K + Y+ C + PP E++V+IAL RL
Sbjct: 133 FRVGKDLYHYLLTELKPFMCVDEDDRDDDYPP------------FEQRVSIALWRLGKVW 180
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
++ AFGV TV+++ F A+ + ++ P +E+I +F + C
Sbjct: 181 DDPALMRAFGVTGDTVARIVREFCAAVVKVLMPRFVQVPYDEDLEDIAEQFRDRTKFPRC 240
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+ H+ + P +D+ + E S++VQ +V + RF D+ GWPG V+++L
Sbjct: 241 AGALCFAHVPILPPQDDEGNDYTNPEGWNSIVVQAVVGADRRFWDLNIGWPGSTQVAQVL 300
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ S +K E G V EE + +++ + G GYPL W++TPY M
Sbjct: 301 RSSSLWKKGEEGTLFAAKV----EEICGIRVKQVLAAGTGYPLRRWIVTPY-------LM 349
Query: 295 PT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
P FN S+ AF +L+ + L + +PSI+ C LHN+
Sbjct: 350 PNECQLRFNRSLSEVLSVGEAAFRRLEARFPFLLRHN-GSGIDLMPSIVAACSTLHNMTE 408
Query: 350 DSGD 353
+ GD
Sbjct: 409 EWGD 412
>gi|432955674|ref|XP_004085595.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 345
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 134/297 (45%), Gaps = 26/297 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S ++ YI +L++ P I+ GR L+ E+ + ALR A+G ++G A
Sbjct: 47 YRFSLQSITYIHNLIQ------PFISNISHHGRALTSEQILCAALRFFANGSFLYNIGDA 100
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + + AL+ + +P + IK +F G N G ID THI
Sbjct: 101 EHISKATVCRAVRKVCLALKRFLHIFIVFPGHKPLRPIKEEFHRIAGFPNVVGCIDGTHI 160
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P + D+ ++++ +S+ VQ I D ++ WPG ++ SR+ + S
Sbjct: 161 PIIAPN-ENEADYVNRKSIHSINVQIICD-AAHITNVEAKWPGSVHDSRIFRESTLSNRL 218
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G E+ +++G GYP L+TPY FN H TR+
Sbjct: 219 ECG--------------EIDGFLLGDRGYPCQPKLLTPYPEPE-QGPQQRFNWAHSRTRA 263
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
LK ++ L + P++ II+ C +LHNI I G+Q HP + + D
Sbjct: 264 RVEMTIGLLKARFQCLRHLRVTPERAC--DIIVACVVLHNIAILRGEQ-HPALHIQD 317
>gi|301610538|ref|XP_002934801.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 328
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 119/244 (48%), Gaps = 22/244 (9%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
+L ASG Q GV GV Q T S+ + ++A+ ++ + +P + N +K F
Sbjct: 93 SLHFFASGSFQRVGGVYGGVSQPTFSRCLSQVLDAIRSASRTFISFPQNRNEWGTVKRDF 152
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
G+ N G ID TH+ + P Q + + +++ +S+ VQ + D +M + IV+G+
Sbjct: 153 YRVSGIPNVLGEIDCTHVALNPPPPQDREHVYRNRKGYHSINVQVVCDAKMNILSIVSGF 212
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG + + +L+ SG ++ E GQ +G +++G GYP WLITP
Sbjct: 213 PGSSHNAYILRQSGLYQTFETGQMPHG-------------WLLGDAGYPCGRWLITPIHR 259
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCL 343
A FN H T S+ + F LK +R L + +++ P K+ +II C +
Sbjct: 260 PRSRAEC-DFNQAHVRTLSVIERTFGVLKSRFRCLDRSGGSLLYSPT--KVANIIGACAV 316
Query: 344 LHNI 347
LHN+
Sbjct: 317 LHNL 320
>gi|449684574|ref|XP_004210663.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 335
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 136/294 (46%), Gaps = 10/294 (3%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P E +K FR++K F + +++ D V P S N R L+ K++AI + L
Sbjct: 4 PDVPETCWKRNFRMTKGCFLELAAII--DTVVSPQSNCPNY--RFLTTLKKLAITIYYLK 59
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNR-MEEIKSKFEESFG 173
S FG+ Q TVS+ +A+ +L P + M ++ S+FE FG
Sbjct: 60 DTGSLWMTANVFGIHQCTVSKTVKVVCDAINNIVGPIYLHLPKNKEDMTKLASQFEVKFG 119
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
++ G ID TH+ + P D +C ++ +S+ VQ + D + FID+ WPG ++
Sbjct: 120 MIQAFGCIDGTHVQIKHPIKNGQDYFCFKQ-YFSLNVQAVCDSKGYFIDVECKWPGSVHD 178
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+++ S K G L + + Y++G YPL ++ I +++ +
Sbjct: 179 AKMFTNSTINKKLIKGT-LPQTLYSLPNYHSIPNYLIGDPAYPLTNFCIKDFQSCSNNEE 237
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+ FNS+ + R+ AF +LK W L K++ +P +I C +LHN
Sbjct: 238 V-IFNSMLRSARNQIECAFGRLKARWGFLRKII-DIKIETVPIVIYTCFVLHNF 289
>gi|301632320|ref|XP_002945236.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 349
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 122/247 (49%), Gaps = 22/247 (8%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEI 164
++ +L ASG Q GV GV Q T SQ + ++A+ ++ + +P + N +
Sbjct: 89 KLPCSLHFFASGSFQRVGGVYGGVSQPTFSQCLGQVLDAIRSVSRTFISFPQNQNEWGTV 148
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
K F G+ N GAID TH+ + P + +++ +S+ +Q + D +M + IV
Sbjct: 149 KRDFYRVSGIPNVLGAIDCTHVALN-PPQDREHVYRNRKGYHSINIQVVCDAKMNILSIV 207
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
+G+PG + + +L+ SG ++ E GQ +G +++G GYP WLITP
Sbjct: 208 SGFPGSSHDAYILRQSGLYQAFETGQMPHG-------------WLLGDAGYPCGRWLITP 254
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILV 340
S + FN H TRS+ + F LK +R L + +++ P K+ +II
Sbjct: 255 IHRP-RSQAECAFNQSHVRTRSVIERTFGVLKSRFRCLDRSGGSLLYSPT--KVANIIGA 311
Query: 341 CCLLHNI 347
C +L+N+
Sbjct: 312 CAVLNNL 318
>gi|149725130|ref|XP_001489949.1| PREDICTED: putative nuclease HARBI1 [Equus caballus]
Length = 349
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGTSL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D + M+ +K +F E G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADESSMQALKDEFYELAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|327259629|ref|XP_003214638.1| PREDICTED: putative nuclease HARBI1-like [Anolis carolinensis]
Length = 348
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 127/280 (45%), Gaps = 24/280 (8%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+ +L SRP R +S E Q+ AL SG Q +G A G+ Q+++S
Sbjct: 48 YLVDLLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMS 101
Query: 136 QVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ EAL ER+ + +P D ++++K F G+ G +D TH+ + P +
Sbjct: 102 RCVANVTEALVERSSQFIHFPEDDASVQQLKDDFYTLAGMPGVLGLVDCTHVAIKAPNAE 161
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+ +++ +S+ + D + T WPG + +L+ SG EAG +G
Sbjct: 162 DL-SYVNRKGLHSLNCLMVCDARGLLLSAETHWPGSLQDCTVLQQSGLRNQFEAGMHKDG 220
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
+++G + L +WL+TP A +N H AT ++ + F
Sbjct: 221 -------------WLLGDNSFFLRTWLMTPLHIPETPAEY-RYNMAHSATHNIIEQTFWS 266
Query: 315 LKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
++ +R L SK + K IIL CC+LHNI ++ G
Sbjct: 267 IRSRFRCLDGSKGTLQYSPEKASHIILACCVLHNIALEHG 306
>gi|301608541|ref|XP_002933847.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 386
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 119/245 (48%), Gaps = 22/245 (8%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
L LASG Q ++ G Q + S+ + + A LK+ D + ++KS F
Sbjct: 91 TLHFLASGSFQTTISACTGFSQPSFSKHLKFVLAGICSLAAELLKYDMDPANLAKMKSDF 150
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
+ N GAID TH+ + +P + +++ +S+ VQ + D R +D+V+ +P
Sbjct: 151 YSIAQMPNVIGAIDCTHVAL-IPPADKERFYYNRKGFHSINVQVVCDASCRILDVVSKFP 209
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G + S + + S + ++G+ +G +++G GY + WLITP N
Sbjct: 210 GSTHDSFIFRNSHLHERLQSGEAGSG-------------WLLGDSGYSVKPWLITPL-LN 255
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ S FNS H+ATR + + F LK +R L K +M++P+ K+ II CC+L
Sbjct: 256 PQNESEENFNSSHKATRCIIERTFGILKSRFRCLDKTGGALMYKPE--KVCQIIFCCCIL 313
Query: 345 HNIII 349
HN +
Sbjct: 314 HNFAL 318
>gi|449667873|ref|XP_004206663.1| PREDICTED: uncharacterized protein LOC101235010 [Hydra
magnipapillata]
Length = 351
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 124/277 (44%), Gaps = 24/277 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSN 159
+SV K+VA+AL LAS E V FG+G+ST + + FI A+ + ++K+P
Sbjct: 75 ISVVKRVAVALHYLASCEEYCVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFP--- 131
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
FE G C GA+D HI ++ P Q + + + + YS+++ IVD R
Sbjct: 132 ------LDFEAILGFPQCVGAVDGCHIPISAPKDQ-AISYYNYKGWYSIVLFAIVDCRYR 184
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
FI G PG N S +L+ S + E+ + V L ++G +PL
Sbjct: 185 FIYTSVGLPGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLS--LIGDSAFPLTR 242
Query: 280 WLITPYETN-GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
L+ PY N LS FN + R + AF ++K +R++ K M D I+
Sbjct: 243 HLLKPYPENLELSEIQKHFNKILCGARRVVENAFGRVKARFRVICKRM-ECDINFATRIV 301
Query: 339 LVCCLLHNIIIDSGDQLHPDVA----LSDHHDSGYGE 371
C LHNI + + D+ L HHD E
Sbjct: 302 NACVTLHNIC-----EYYDDIIIIKWLMHHHDDNLAE 333
>gi|292616811|ref|XP_001344486.2| PREDICTED: putative nuclease HARBI1 [Danio rerio]
Length = 457
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 153/340 (45%), Gaps = 36/340 (10%)
Query: 22 SKSTSAAPKD-------PNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTF 74
S TS +P + P SD DWW + E + F V+++ F
Sbjct: 59 STRTSPSPTEEDLPNHVPIPSDPDWWER----------VVMTEFGPEDWLDKFHVTREIF 108
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
+ + ++ L I + +++EK +A+AL RL++ + FGV V
Sbjct: 109 LLLSTKLKPQL----EQVYIQQTQQTITLEKCIAMALMRLSTSTEYCFLSELFGVPIPAV 164
Query: 135 SQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAV 193
Q EA+ K +++ P + +E+ +F +G +C G ID+ HI + LPA
Sbjct: 165 CQCVREVCEAIILLLKPIYMQLPAQHELEDNVEQFRSLWGFPHCIGVIDSLHIPVNLPAA 224
Query: 194 QTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLN 253
++ D W + +S+++QG+V+ F D+ +G+ G + +L+ S + + E G
Sbjct: 225 ESQDCW-NPSGWHSVVLQGVVNAHGNFWDVCSGFTGSTDDMTILQSSELWTMAEKG---- 279
Query: 254 GNVRISSEEVELRE---YIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSLHEATRSLAV 309
G +E+ R ++G VG+ L +WL+ Y ++ L+ TFN +
Sbjct: 280 GFCSQPPKELMGRPLGFLLLGDVGFSLQNWLLKCYPSSSNLTPQQQTFNVKLNLGLKVIE 339
Query: 310 KAFLQLKGGWRILSKVMWRPDKRK--LPSIILVCCLLHNI 347
+AF +LK W+ L R D + + + CC+LHNI
Sbjct: 340 QAFQRLKARWQCLHN---RNDNNVDLVSKMAVACCILHNI 376
>gi|356502730|ref|XP_003520169.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 349
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/259 (29%), Positives = 121/259 (46%), Gaps = 8/259 (3%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHH 152
I + L + VA+ L RLA G S ++ + + VS++T L +
Sbjct: 41 IALSNLSLPSDYAVAMVLSRLAHGLSAKTLASRYSLDPYLVSKITNMVTRLLATKLYPEF 100
Query: 153 LKWP-DSNRMEEIKSKFEESFGLLNCCGAIDAT--HIIMTLPAVQTSDDWCDQENNYSML 209
+K P R+ E FEE L N CGAID T H+ + + + S+L
Sbjct: 101 IKIPVGRRRLLETTQAFEELTSLPNMCGAIDTTPVHLRNNPNPNTNPNFYRCRYGYPSLL 160
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
+Q + DH+ F D+ PGG + S + S + +G + V IS +R Y+
Sbjct: 161 LQVVSDHKKIFWDVCVKAPGGTDDSTHFRDSLLYHRLTSGDVVWDKV-ISVRGHHVRPYV 219
Query: 270 VGGVGYPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
VG +PLL +L+TP+ +G+ + + F+ + RS+ V+A LKG W+IL +
Sbjct: 220 VGDWCFPLLPFLLTPFSPSGMGTPAQNLFDGMLMKGRSVVVEAIALLKGRWKILQDL--N 277
Query: 329 PDKRKLPSIILVCCLLHNI 347
R P I+ CC+LHN+
Sbjct: 278 TGVRHAPQTIVACCVLHNL 296
>gi|449692693|ref|XP_004213135.1| PREDICTED: uncharacterized protein LOC101236914 [Hydra
magnipapillata]
Length = 336
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 153/350 (43%), Gaps = 26/350 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+VA+AL L S
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVNE---LHPHLGKTTTTMREPISVVKRVAVALHYLVSR 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + +FI A++ ++K+P S + + FE G
Sbjct: 56 EEYRVVSSLFGIGKSTANLIVHKFINAVKYILLPKYVKFPLSVENLNKHSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C GA+D HI + P Q + + + + Y +++ +VD RFI G PG N S
Sbjct: 116 QCVGAVDGCHISILAPKSQ-AISYYNYKGWYFIVLFSVVDCRYRFIYTNVGSPGRNNDSY 174
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASM 294
+L+ S E+ + V L ++G +PL L+ PY N LS
Sbjct: 175 ILQNSSLKAFLESSLFDKCCKELGDSLVPL--CLIGDSAFPLTRHLLKPYPENLELSEIQ 232
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
FN + R + AF ++K +R++ K M D I+ C L NI +
Sbjct: 233 KNFNKILCGARRVVENAFGRVKARFRVICKRM-ECDINFATRIVNACVTLQNIC-----E 286
Query: 355 LHPDVA----LSDHHDSGYGEQCCKQV---DPMGRTTRENLEKHLQHNQE 397
+ D+ L HHD + + G+ R+++ K+L E
Sbjct: 287 YYDDIIIIEWLMHHHDDSLAQPNIVSTTGNNGPGKNVRDSIAKYLYERDE 336
>gi|301607685|ref|XP_002933439.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 425
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 119/245 (48%), Gaps = 22/245 (8%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
L LASG Q ++ G Q + S+ + + A LK+ D + ++KS F
Sbjct: 91 TLHFLASGSFQTTISACTGYSQPSFSKHLKFVLAGICSLAPELLKYDMDPANLAKMKSDF 150
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
+ N GAID TH+ + +P + +++ +S+ VQ + D R +D+V+ +P
Sbjct: 151 YSIAQMPNVIGAIDCTHVAL-IPPADKERFYYNRKGFHSINVQVVCDASCRILDVVSKFP 209
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G + S + + S + ++G+ +G +++G GY + WLITP N
Sbjct: 210 GSTHDSFIFRNSHLHERLQSGEAGSG-------------WLLGDSGYSVKPWLITPL-LN 255
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLL 344
+ S FNS H+ATR + + F LK +R L K +M++P+ K+ II CC+L
Sbjct: 256 PQNESEENFNSSHKATRCIIERTFGILKSRFRCLDKTGGALMYKPE--KVCQIIFCCCIL 313
Query: 345 HNIII 349
HN +
Sbjct: 314 HNFAL 318
>gi|349604650|gb|AEQ00142.1| Putative nuclease HARBI1-like protein, partial [Equus caballus]
Length = 286
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 18/257 (7%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL ERA + +P D
Sbjct: 2 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPAD 61
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ M+ +K +F E G+ G +D H+ + P + + +++ +S+ + D
Sbjct: 62 ESSMQALKDEFYELAGMPGVIGVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIR 120
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ + T WPG + +L+ S EAG + +++G + L
Sbjct: 121 GALMTVETNWPGSLQDYAVLQQSSLSSQFEAGMHKDS-------------WLLGDSSFFL 167
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLP 335
+WL+TP +N H AT S+ K F L +R L SK + K
Sbjct: 168 RTWLMTPLHIPETPVEY-RYNMAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSS 226
Query: 336 SIILVCCLLHNIIIDSG 352
IIL CC+LHNI ++ G
Sbjct: 227 HIILACCVLHNISLEHG 243
>gi|346466591|gb|AEO33140.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 162/357 (45%), Gaps = 35/357 (9%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICS----LVREDLVSRPPSGLINIEGRLLSVEKQVAI 109
T+P + F+ FRV++ TF Y+ S + R+D R + + K+VAI
Sbjct: 82 TVPHLPDSEFRTHFRVNRSTFRYLLSACECMRRQDTNMR----------MAVPLHKRVAI 131
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRF----IEALEERAKHHLKWPDSNRMEEIK 165
L RLA+ S+ AFGVG+STV+++ W F + LE + + +P + E
Sbjct: 132 GLYRLATSGEDRSISNAFGVGRSTVNEIFWEFCHVVVRLLEPK---FVGFPTIRDLAEHM 188
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
+F G GA+D HI + P + + D+ + + YS ++ + DH+ + +
Sbjct: 189 RQFAAVTGFPQGVGALDGCHIEVC-PPEENAVDYYNYKGWYSTILLAVADHKYKLLYCCV 247
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
G PG + S + + S KL ++ + + V+I + V + ++ +PL + ++ PY
Sbjct: 248 GAPGRNHDSGVFQASRLPKLLQS-ELFSREVKILN-SVPVGPVLLADQAFPLQTHIMKPY 305
Query: 286 ETNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
G S PT FN + R + F +LK +R+L K + + + + I C+
Sbjct: 306 SQPGAQGS-PTQLFNFRLSSARRVVENVFGRLKARFRMLLKGL-ECNIHNVNTAIRAACV 363
Query: 344 LHNIIIDSGDQLHPDVALSD---HHDSGYGEQCC--KQVDPMGRTTRENLEKHLQHN 395
LHN+ D+ DVA D D + C + D G R+ L H+ N
Sbjct: 364 LHNVCEQMSDRC--DVAWLDAAQSEDQRRTQPVCTSSRHDAPGVAVRDALADHIFSN 418
>gi|72038947|ref|XP_792833.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 346
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 156/331 (47%), Gaps = 31/331 (9%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R+ + +C L+ +D+ + + L V QV +ALR A+G G
Sbjct: 37 YRLPRHLILELCGLLDDDIKRQTRRSMA------LPVALQVPMALRFYATGSFYAVTGDL 90
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
GV +++VS+V L A +++ +P D + ++K F + G+ N GA+D TH
Sbjct: 91 QGVSKASVSRVVRDISRVLVREASNYIVFPRDRASLLQVKRGFTDMCGIPNTLGAVDCTH 150
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + P++ + +++ +S+ +Q I D +M+F++++ +PG + S + SG +
Sbjct: 151 VRIRSPSID-EHLFVNRKGYHSINIQCICDSQMKFLNVLARFPGSSHDSYIWANSGICR- 208
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATR 305
E+V ++ +++G GYPL +L+TP N + +N H+ R
Sbjct: 209 -------------RFEDVPVQGHLLGDSGYPLRPFLLTPL-LNPTTRPEERYNQSHKRGR 254
Query: 306 SLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
S+ ++F LK +R + + + P+ ++ I + +LHNI + + L D +
Sbjct: 255 SVIERSFGVLKSRFRCIDVSGGGLQFSPE--RVCHIFVAVAVLHNICV--TNNLPYDEPI 310
Query: 362 SDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
H D + ++ GR TR NL + L
Sbjct: 311 VPHLDDNNHDDVHAEIAGDGRRTRNNLIEQL 341
>gi|297688817|ref|XP_002821868.1| PREDICTED: putative nuclease HARBI1 [Pongo abelii]
Length = 349
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + ++VG + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLVGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|328724322|ref|XP_003248105.1| PREDICTED: hypothetical protein LOC100569919 [Acyrthosiphon pisum]
Length = 404
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 152/345 (44%), Gaps = 27/345 (7%)
Query: 39 WWPSFWAKNSSTPGA-TIPSDEEEG--FKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
W FW +N GA ++ D E+G F+ F+R+ KKTFD + LV + + +
Sbjct: 29 WVHPFWRENCRGRGAFSVFQDLEDGDRFQSFYRMEKKTFDMLTELVGSKIQKKDTN---- 84
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV-------SQVTWRFIEALEER 148
R + +++ I LR + S S+ F G++TV S+ W ++ L
Sbjct: 85 -YRRSVCPRERLLITLRYFSGEVSMRSISYYFLRGETTVRNIIETTSKALWEVLQPLYMP 143
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
+ + W + I ++ E + L NC G+ID HI + P V + + + + +S+
Sbjct: 144 IPNQIMW------KTIADRYNELWNLPNCVGSIDGKHIRIKAP-VNSGSSFFNYKGYFSI 196
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC-EAGQRLNGNVRISSEEVE--L 265
++ D + +FI I G G + S++LK S F +L + L N + EE +
Sbjct: 197 VLMATADADGKFITIDVGEYGRNSDSKVLKESAFGQLLFKKKLNLPENACLPHEENDPTF 256
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
Y V +PLL ++ PY L+ + FN R AF + +RIL +
Sbjct: 257 PYYFVADEAFPLLDNVMRPYPRRSLTNTKRIFNYRLSRGRKSVECAFGMMASKFRILERP 316
Query: 326 MWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSDHHDSGY 369
+ K+ +I C+LHN+I + G P D+ D Y
Sbjct: 317 I-NFKTEKIEIVIKAICVLHNLIRVHDGTYSIPQGIQQDNVDLLY 360
>gi|340385015|ref|XP_003391006.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 419
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/326 (22%), Positives = 147/326 (45%), Gaps = 34/326 (10%)
Query: 40 WPSFWAKNSSTPGAT------IPSDEEEGFKYFFRVSKKTFDYICS----LVREDLVSRP 89
W W +N GA + + +K F R+ +F+++ + + R+D + R
Sbjct: 31 WVRQWIRNRPQYGAYNQLLCELQLTDTSSYKQFLRMDHNSFEFLSNVSPIIARQDTIMR- 89
Query: 90 PSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA 149
R + +++A+ LR LA+GES S+ + + T+ ++ + + E
Sbjct: 90 ---------RAIPAGERLALTLRFLATGESYSSLQYLYRIPSQTIGKIVIETCQGINEVV 140
Query: 150 KHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
+ +LK P + + +I +F+ + +CCGA+D H+ + +P + + + + +S+
Sbjct: 141 QDYLKVPKTQSEWRQIAEQFDSRWNFPHCCGALDGKHVNI-IPRANSGSYYYNYKQRFSI 199
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE-----EV 263
++ +VD + +FI + G G ++ + + S A Q N E +
Sbjct: 200 VLMALVDADYKFIFVDIGCNGRVSDGGIFRESTL----SAAQESNALDFPPPEPLPGCTL 255
Query: 264 ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL- 322
+ +V +PL ++ PY NGL+ + +N R + AF L +R+L
Sbjct: 256 PIPYMVVADAAFPLKHYIQKPYSQNGLTHNKRIYNYRLSRARRVVENAFGILGNRFRVLM 315
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNII 348
+ + P+K ++ I+L CC LHN +
Sbjct: 316 TPIALVPEKVEV--IVLACCTLHNFL 339
>gi|225428725|ref|XP_002285007.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 449
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 122/257 (47%), Gaps = 9/257 (3%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHH 152
I + L + VA+ L RL+ G S ++ + + +S++T L +
Sbjct: 146 IALSNLSLPSDYAVAMVLSRLSHGFSAKTLSSRYSLEPYLISKITNMVTRLLATKLYPEF 205
Query: 153 LKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQ 211
+K P S R + E FEE L N CGAID + + + ++ + + S+L+Q
Sbjct: 206 IKIPVSRRRLHETTQAFEELTSLPNMCGAIDGSPVKIR---NNSASSYRCRYGFQSVLLQ 262
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
+ DH+ F D+ PGG + + L+ S + +G + V I+ +R YIVG
Sbjct: 263 VVADHKKIFWDVCVKAPGGTDDATHLRDSLLYNRLTSGDIVWDKV-INVRNHHVRPYIVG 321
Query: 272 GVGYPLLSWLITPYETNGLSAS-MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
YPLLS+L+TP+ +G S F++ RS+ V+A LKG WRIL +
Sbjct: 322 DWCYPLLSFLLTPFSPSGSGTSGQNLFDAALMKGRSVVVEAIGLLKGRWRILQDL--NVG 379
Query: 331 KRKLPSIILVCCLLHNI 347
P I+ CC+LHN+
Sbjct: 380 MNHAPQTIVACCVLHNL 396
>gi|426368148|ref|XP_004051074.1| PREDICTED: putative nuclease HARBI1 [Gorilla gorilla gorilla]
Length = 349
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHVPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|291384923|ref|XP_002709124.1| PREDICTED: harbinger transposase derived 1 [Oryctolagus cuniculus]
Length = 349
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEHLMSMYGFPRQFIYYLVELLGTSL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D + +K +F G+
Sbjct: 86 QTRLGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASTQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + S +L+
Sbjct: 146 GVVDCLHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLRDSAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S + EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSALSRHLEAG-------------LHKGSWLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|410220586|gb|JAA07512.1| harbinger transposase derived 1 [Pan troglodytes]
Length = 349
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLQTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|114637368|ref|XP_521904.2| PREDICTED: putative nuclease HARBI1 [Pan troglodytes]
gi|410248774|gb|JAA12354.1| harbinger transposase derived 1 [Pan troglodytes]
gi|410306074|gb|JAA31637.1| harbinger transposase derived 1 [Pan troglodytes]
gi|410333521|gb|JAA35707.1| harbinger transposase derived 1 [Pan troglodytes]
Length = 349
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLQTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|72126314|ref|XP_795950.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 293
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 121/258 (46%), Gaps = 17/258 (6%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-N 159
L V QV ALR A G Q G V QS+V ++ EA+ R + +K+P + +
Sbjct: 16 LPVYLQVLTALRFYAVGSFQKMHGDEATVSQSSVCRIIKDVSEAIAGRKRQFMKFPTTRD 75
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+E + +F E G GAID TH+ + P + + +++N YS+ VQ + DH +
Sbjct: 76 EIETTQQQFYEYCGFPGVIGAIDGTHVYIRSPGGDQALYFMNRKNRYSVNVQVVCDHAGK 135
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
IV WPG + SR+ F C ++L S E ++G GYP L
Sbjct: 136 MTSIVARWPGSTHDSRI------FSECTLKEQLEARADGS-------ELLLGDSGYPCLP 182
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+L+TP ++ +N+ H+ R + + F + K + L+ + + D +II+
Sbjct: 183 YLLTPLLRPQGRPNV-RYNTAHKRGRCVIERTFGRWKRRFPCLNDLRVKVDTTF--TIIV 239
Query: 340 VCCLLHNIIIDSGDQLHP 357
C +L NI +D +Q P
Sbjct: 240 ACSVLWNISLDRREQDIP 257
>gi|345497547|ref|XP_003428014.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 381
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 148/328 (45%), Gaps = 32/328 (9%)
Query: 38 DWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
DWW +F N S E+ +YF + S+KTF ++ L++ + P G ++ +
Sbjct: 21 DWWDAFVKDNYSN---------EKCLQYF-KCSRKTFTFLVKLLKPHIA--PKIGALDSD 68
Query: 98 GRL-----LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH 152
L +SV+KQVA+ L +L + V + F + ++T+ ++ ++ + A+ + H
Sbjct: 69 SPLYLRKGVSVDKQVAVLLYKLTTCVDYVGISKKFKIHKTTIHKILYKSVIAINKYLLHS 128
Query: 153 -LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQ 211
+ P E I + FE+++ L GA+ HI ++ P + + + + + S ++Q
Sbjct: 129 TISMPKPEEAEIICNDFEQAYKLPQIIGAMTLAHIPISSP-INLNYKFLNSKLYPSFVLQ 187
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF----KLCEAGQRLNGNVRISSEEVELRE 267
++D F D+ G ++ S + K+ +R G + IS +
Sbjct: 188 TVIDSNFLFRDVSVRHAGATEPQLIIADSNIYKYSHKVMPPEKRNIGGIDISYK------ 241
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
I+ PL WL+ Y+ + FN+ E R+ A +L+ + ILS +M
Sbjct: 242 -IIAPAPGPLYRWLLNSYDEET-TKEETRFNNCLEEIRNYADSVISRLRSRFLILSHMM- 298
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQL 355
+ P +I CC++HNI +GD
Sbjct: 299 DLSYKVAPQVIAACCIIHNICERNGDTF 326
>gi|410973651|ref|XP_003993261.1| PREDICTED: putative nuclease HARBI1 [Felis catus]
Length = 349
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 129/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D M+ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERATQFIRFPADEASMQAVKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDNSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|27883880|ref|NP_776172.1| putative nuclease HARBI1 [Homo sapiens]
gi|74732341|sp|Q96MB7.1|HARB1_HUMAN RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|16552848|dbj|BAB71391.1| unnamed protein product [Homo sapiens]
gi|22477670|gb|AAH36925.1| Harbinger transposase derived 1 [Homo sapiens]
gi|119588398|gb|EAW67992.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
gi|119588399|gb|EAW67993.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
gi|119588400|gb|EAW67994.1| hypothetical protein FLJ32675, isoform CRA_a [Homo sapiens]
Length = 349
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|224123456|ref|XP_002330319.1| predicted protein [Populus trichocarpa]
gi|222871354|gb|EEF08485.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 121/257 (47%), Gaps = 21/257 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSN 159
L + V++ L RLA G S ++ + + +S++T L + +K P S
Sbjct: 142 LPSDYAVSMVLSRLAHGFSAKTLASRYSLEPYLISKITNMVTRLLATKLYPEFIKIPVSR 201
Query: 160 R-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-------SMLVQ 211
R + E FEE L N CGAID + I V+ D+ + N Y S+L+Q
Sbjct: 202 RRLIETTKAFEELTSLPNMCGAIDGSPI-----KVKCGDN---EGNVYKCRYGYSSVLLQ 253
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
+ DH+ F D+ PGG + + L+ S + +G + V I +R YIVG
Sbjct: 254 VVADHKKMFWDVCVKAPGGSDDASHLRESLLYNRLVSGDVVWDKV-IDVRGHHVRPYIVG 312
Query: 272 GVGYPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
YPLLS+L+TP+ NG + + F+ + RS+ V A LKG W+IL +
Sbjct: 313 DWCYPLLSFLMTPFSPNGSGTPAQNLFDGMLMKGRSVVVDAIALLKGRWKILQDL--NVG 370
Query: 331 KRKLPSIILVCCLLHNI 347
P I+ CC+LHN+
Sbjct: 371 LHHAPQTIVACCVLHNL 387
>gi|397488418|ref|XP_003815263.1| PREDICTED: putative nuclease HARBI1 [Pan paniscus]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLEDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLQTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|332259854|ref|XP_003278999.1| PREDICTED: putative nuclease HARBI1 [Nomascus leucogenys]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E QV AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQVLAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASVQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLQTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|58262144|ref|XP_568482.1| hypothetical protein CNM02140 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230655|gb|AAW46965.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 445
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 27/314 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVRED--LVSRPPSGLINIEGRLLSVEKQVAIALRRLAS- 116
+E F+ RV+ F + L+++ VS P N LL Q+ +AL RL
Sbjct: 72 DEDFRRKLRVNHAEFRKLLCLIKDHPVFVSHGPRKQANP---LL----QLTVALYRLGHC 124
Query: 117 --GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
S +G FGV + T + T R I+A+ ++++ WPD N + I FEE +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDI 184
Query: 175 LN-CCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ C G ID H+ PA + ++ + Y + GI DH R G+P +
Sbjct: 185 PDGCVGIIDGFHVPFAYKPARHDAVNFFSHKGRYGFNILGICDHLKRIRYFQYGYPASAH 244
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVR-----ISSEEVELREYIVGGVGYPLLSWLITPYE- 286
+R+ K F+ A + N + SE + EY++ +P W + ++
Sbjct: 245 DARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKR 304
Query: 287 ---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI----IL 339
N L FN + R A+ LK W+ L + + + S+ I
Sbjct: 305 RRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDESVATCWIR 364
Query: 340 VCCLLHNIIIDSGD 353
C +LHN++ID+GD
Sbjct: 365 ACVVLHNLLIDTGD 378
>gi|73983355|ref|XP_540753.2| PREDICTED: putative nuclease HARBI1 [Canis lupus familiaris]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 129/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D M+ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERATQFIRFPADEASMQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLNSHFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|355694124|gb|AER99563.1| harbinger transposase derived 1 [Mustela putorius furo]
Length = 339
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 129/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 23 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 76
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 77 QTRMGDAIGISQASMSRCVANVTEALVERATQFIRFPSDETSVQALKDEFYGLAGMPGVI 136
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 137 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQ 195
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 196 QSSLSSHFEAG-------------IHKDSWLLGDSSFSLRTWLMTPLHIPETPAEY-RYN 241
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 242 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 297
>gi|403254691|ref|XP_003920094.1| PREDICTED: putative nuclease HARBI1 [Saimiri boliviensis
boliviensis]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSAFFLHTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFQTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|296218013|ref|XP_002755264.1| PREDICTED: putative nuclease HARBI1 [Callithrix jacchus]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSAFFLHTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|402893678|ref|XP_003910018.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1 [Papio
anubis]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|311247913|ref|XP_003122875.1| PREDICTED: putative nuclease HARBI1 [Sus scrofa]
Length = 349
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 129/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGSSL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVTNVTEALVERASQFIRFPADEASVQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCVVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L SWL+TP A +N
Sbjct: 205 QSSLSSQFEAG-------------MHKESWLLGDSSFFLRSWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKCSHIILACCVLHNISLEHG 306
>gi|348543892|ref|XP_003459416.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 346
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 124/281 (44%), Gaps = 23/281 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S++ YI +L+ + S R L+ + V IALR ASG +VG A
Sbjct: 46 YRFSREGIIYITTLLDPYVKSS------TRRSRALTTAQTVCIALRFFASGTFLYAVGDA 99
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+G+S V + + AL++ + +P R + +K F G N GAID THI
Sbjct: 100 ENIGKSAVCRAIRKVYLALKQFLGVFVVFPSHVRPQVVKQGFFAIAGFPNVIGAIDCTHI 159
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P D+ +++ +S+ VQ + D ++ WPG ++ SR+ + SG L
Sbjct: 160 AIKAPPGPNEGDFVNRKGVHSINVQMVCDSMCHITNVEAKWPGSVHDSRIFRESGLCTLF 219
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G +G ++G GY + +TP+ +N+ TR+
Sbjct: 220 ERGA-YDG-------------ILLGDRGYACRRYFLTPFPDPN-PGPQTRYNAALARTRA 264
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
F QLK ++ L + PD + II+ C +LHNI
Sbjct: 265 CIEMTFGQLKERFQCLKGLRVAPD--RACDIIVACAVLHNI 303
>gi|395815624|ref|XP_003781325.1| PREDICTED: putative nuclease HARBI1 [Otolemur garnettii]
Length = 349
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 128/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVDLLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D M+ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASMQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + + +G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMYKDS-------------WFLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLYSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|58258447|ref|XP_566636.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|58261124|ref|XP_567972.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|58269606|ref|XP_571959.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134111525|ref|XP_775298.1| hypothetical protein CNBE0170 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|134116925|ref|XP_772689.1| hypothetical protein CNBK0630 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255307|gb|EAL18042.1| hypothetical protein CNBK0630 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257957|gb|EAL20651.1| hypothetical protein CNBE0170 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57222773|gb|AAW40817.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|57228195|gb|AAW44652.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230054|gb|AAW46455.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 445
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 132/314 (42%), Gaps = 27/314 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVRED--LVSRPPSGLINIEGRLLSVEKQVAIALRRLAS- 116
+E F+ RV+ F + L+++ VS P N LL Q+ +AL RL
Sbjct: 72 DEDFRRKLRVNHTEFRKLLCLIKDHPVFVSHGPRKQANP---LL----QLTVALYRLGHC 124
Query: 117 --GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
S +G FGV + T + T R I+A+ ++++ WPD N + I FEE +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDI 184
Query: 175 LN-CCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ C G ID H+ PA + D+ + Y + GI DH R G+P +
Sbjct: 185 PDGCVGIIDGFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASAH 244
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVR-----ISSEEVELREYIVGGVGYPLLSWLITPYE- 286
+R+ K F+ A + N + SE + EY++ +P W + ++
Sbjct: 245 DARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKR 304
Query: 287 ---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIIL 339
N L FN + R A+ LK W+ L K+ D+ I
Sbjct: 305 RRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDEGVATCWIR 364
Query: 340 VCCLLHNIIIDSGD 353
C +LHN++ID+GD
Sbjct: 365 ACVVLHNLLIDTGD 378
>gi|134118537|ref|XP_772042.1| hypothetical protein CNBM2000 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254648|gb|EAL17395.1| hypothetical protein CNBM2000 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 445
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 132/314 (42%), Gaps = 27/314 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVRED--LVSRPPSGLINIEGRLLSVEKQVAIALRRLAS- 116
+E F+ RV+ F + L+++ VS P N LL Q+ +AL RL
Sbjct: 72 DEDFRRKLRVNHTEFRKLLCLIKDHPVFVSHGPRKQANP---LL----QLTVALYRLGHC 124
Query: 117 --GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
S +G FGV + T + T R I+A+ ++++ WPD N + I FEE +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDI 184
Query: 175 LN-CCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ C G ID H+ PA + D+ + Y + GI DH R G+P +
Sbjct: 185 PDGCVGIIDGFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASAH 244
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVR-----ISSEEVELREYIVGGVGYPLLSWLITPYE- 286
+R+ K F+ A + N + SE + EY++ +P W + ++
Sbjct: 245 DARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKR 304
Query: 287 ---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIIL 339
N L FN + R A+ LK W+ L K+ D+ I
Sbjct: 305 RRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKIRNVRDEGVATCWIR 364
Query: 340 VCCLLHNIIIDSGD 353
C +LHN++ID+GD
Sbjct: 365 ACVVLHNLLIDTGD 378
>gi|21553718|gb|AAM62811.1| unknown [Arabidopsis thaliana]
Length = 441
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 121/252 (48%), Gaps = 9/252 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWP-DS 158
L + VA+ L RLA G S ++ + + +S++T L + +K P
Sbjct: 141 LPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGK 200
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHE 217
R+ E FEE L N CGAID+T + + + + Y ++L+Q + DH+
Sbjct: 201 RRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNIYGCKYGYDAVLLQVVADHK 260
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
F D+ PGG + S + S +K +G + V I+ +R YIVG YPL
Sbjct: 261 KIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIVWEKV-INIRGHHVRPYIVGDWCYPL 319
Query: 278 LSWLITPYETNGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
LS+L+TP+ NG S + P F+ + RS+ V+A LK W+IL + + P
Sbjct: 320 LSFLMTPFSPNG-SGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQGL--NVGVNRAP 376
Query: 336 SIILVCCLLHNI 347
I+ CC+LHN+
Sbjct: 377 QTIVACCVLHNL 388
>gi|387849125|ref|NP_001248435.1| putative nuclease HARBI1 [Macaca mulatta]
gi|355566570|gb|EHH22949.1| Putative nuclease HARBI1 [Macaca mulatta]
gi|355752181|gb|EHH56301.1| Putative nuclease HARBI1 [Macaca fascicularis]
gi|380786657|gb|AFE65204.1| putative nuclease HARBI1 [Macaca mulatta]
gi|383410017|gb|AFH28222.1| putative nuclease HARBI1 [Macaca mulatta]
Length = 349
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIRFPADEASIQALKDEFYGLAGMPGVM 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLGSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|57972702|ref|XP_565239.1| AGAP006957-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 120/255 (47%), Gaps = 7/255 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSN 159
++ ++ +AL L +G+ S F + V F A+ + + + +P S
Sbjct: 147 MTARMRMGVALYVLGTGKDFESAATLFHLHSRAVRASVHLFCGAVNKLLRDRQIDFPLSR 206
Query: 160 RMEEIKSK-FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ K FEE G+ GAI HI + + + + + + S+++Q IVD
Sbjct: 207 KHIRCGVKEFEELVGIPQVFGAIGCLHIPIERGTDKDAAKYINSKGWSSIILQAIVDSRG 266
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF+D+ PG N + +L S ++ E +L+G + +++ + ++ YPLL
Sbjct: 267 RFLDVFCEHPGNTNAADMLIQSSIYQRME---QLDGPCQ-KIDKISVLPLLLSDGKYPLL 322
Query: 279 SWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
WLITPY T ++ + +FN R+ V+ F +L G W+ L++ +P I
Sbjct: 323 PWLITPYPTTAQMTPAERSFNVYAAKGRACIVRTFERLVGRWKALNRCSTMVAASCVPEI 382
Query: 338 ILVCCLLHNIIIDSG 352
IL CC+LHNI +G
Sbjct: 383 ILTCCILHNIAEQNG 397
>gi|302799607|ref|XP_002981562.1| hypothetical protein SELMODRAFT_15874 [Selaginella moellendorffii]
gi|300150728|gb|EFJ17377.1| hypothetical protein SELMODRAFT_15874 [Selaginella moellendorffii]
Length = 201
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 92/180 (51%), Gaps = 6/180 (3%)
Query: 64 KYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSV 123
+ F +S TF+++CS + LV + + + + K++AIA+ RLA+G S V V
Sbjct: 27 RRLFNMSPGTFEWLCSEL-APLVHKSDTNFRSS----IPPRKRIAIAIHRLAAGSSYVEV 81
Query: 124 GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAID 182
F VG++TV T AL L +P D+ + + F L NCCG +
Sbjct: 82 SKMFAVGEATVLTCTREVYAALSTAFGSVLAFPTDAAALTRVARGFHNLMALPNCCGLLT 141
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
H+ + P+ ++ D N+S+ Q +VD MR ++I G+PGG++ +++LK SG
Sbjct: 142 VLHLRIRRPSGPCGSNYLDHNTNFSIAAQLVVDASMRILNIAVGFPGGVHHAKILKQSGL 201
>gi|301772494|ref|XP_002921667.1| PREDICTED: putative nuclease HARBI1-like [Ailuropoda melanoleuca]
gi|281340474|gb|EFB16058.1| hypothetical protein PANDA_010579 [Ailuropoda melanoleuca]
Length = 348
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ +L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGANL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA +++P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERATQFIRFPADEASVQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDYAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLNSHFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|72679318|gb|AAI00117.1| Harbi1 protein [Danio rerio]
Length = 349
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 126/283 (44%), Gaps = 29/283 (10%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+++ L+ R R +S + Q+ AL SG Q +G A G+ Q+++S
Sbjct: 48 YLVELLKDSLLRRTQ------RSRAISPDVQILAALGFYTSGSFQSKMGDAIGISQASMS 101
Query: 136 QVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF----GLLNCCGAIDATHIIMTLP 191
+ +AL E+A + + R E K +F++ F G+ N G +D HI + P
Sbjct: 102 RCVSNVTKALIEKAPEFIGF---TRDEATKQQFKDEFYRIAGIPNVTGVVDCAHIAIKAP 158
Query: 192 AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQR 251
S + +++ +S+ Q + D + T WPG + + K S KL E
Sbjct: 159 NADDSS-YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNVAKLFE---- 213
Query: 252 LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKA 311
+E + +++G YPL WL+TP ++ A +N H T + +
Sbjct: 214 --------EQENDDEGWLLGDNRYPLKKWLMTPVQSPASPADY-RYNLAHTTTHEIVDRT 264
Query: 312 FLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
F ++ +R L +K + K II CC+LHNI + SG
Sbjct: 265 FRAIQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNISLQSG 307
>gi|18402129|ref|NP_566626.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
gi|9294613|dbj|BAB02952.1| unnamed protein product [Arabidopsis thaliana]
gi|17979471|gb|AAL50072.1| AT3g19120/MVI11_3 [Arabidopsis thaliana]
gi|23506151|gb|AAN31087.1| At3g19120/MVI11_3 [Arabidopsis thaliana]
gi|332642674|gb|AEE76195.1| PIF / Ping-Pong family of plant transposase [Arabidopsis thaliana]
Length = 446
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 120/252 (47%), Gaps = 9/252 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWP-DS 158
L + VA+ L RLA G S ++ + + +S++T L + +K P
Sbjct: 146 LPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGK 205
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHE 217
R+ E FEE L N CGAID+T + + + + Y ++L+Q + DH+
Sbjct: 206 RRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNIYGCKYGYDAVLLQVVADHK 265
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
F D+ PGG + S + S +K +G + V I+ +R YIVG YPL
Sbjct: 266 KIFWDVCVKAPGGEDDSSHFRDSLLYKRLTSGDIVWEKV-INIRGHHVRPYIVGDWCYPL 324
Query: 278 LSWLITPYETNGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
LS+L+TP+ NG S + P F+ + RS+ V+A LK W+IL + P
Sbjct: 325 LSFLMTPFSPNG-SGTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQSL--NVGVNHAP 381
Query: 336 SIILVCCLLHNI 347
I+ CC+LHN+
Sbjct: 382 QTIVACCVLHNL 393
>gi|432849892|ref|XP_004066664.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 344
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 133/291 (45%), Gaps = 30/291 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
F + Y+ L+RE L R R +S E QV AL SG Q S+G
Sbjct: 39 FSFPRDFILYLVELLREALGRR------TQRSRAISPEVQVLAALGFYTSGSFQTSMGDI 92
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
G+ Q+++S+ +AL E+A + + + + E+ +F+ G+ G +D
Sbjct: 93 IGISQASMSRCVSNVTKALVEKAPQFITFDREPSSREQSLQEFQRVAGIPGVLGVLDCVQ 152
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + P + S + +++ +S+ Q + D + T WPGG++ + +L+ S +K
Sbjct: 153 VAIKAPNSEDS-SYVNKKGFHSVACQLVCDARGLLLSAETHWPGGLHDTEVLERSALYKQ 211
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATR 305
+ SEE +++G YPL WL+TP E S + +N H AT
Sbjct: 212 LQ-----------DSEE----GWLLGDGRYPLRKWLMTPVEAPE-SPAECRYNLAHTATH 255
Query: 306 SLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ + F ++ +R L + + P++ SI+L CC+LHN+ + SG
Sbjct: 256 EIVDRTFRAIQTRFRCLDGTKGYLQYSPERSS--SILLACCVLHNVSLQSG 304
>gi|308321290|gb|ADO27797.1| putative nuclease harbi1 [Ictalurus furcatus]
Length = 348
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 130/292 (44%), Gaps = 30/292 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
F ++ Y+ L+++ L+ R R +S + Q+ AL SG Q +G A
Sbjct: 39 FGFPREFIYYLVELLKDGLLRRTQ------RSRAISPDVQILAALGFYTSGSFQSKMGDA 92
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF----GLLNCCGAID 182
GV Q+++S+ +AL E+A + + NR E K + +E F G+ N G +D
Sbjct: 93 IGVSQASMSRCVSNVTKALIEKAPEFIGF---NRDEATKQQTKEEFYRVAGIPNVIGVVD 149
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
HI + P + S + +++ +S+ Q + D + + T WPG + + S
Sbjct: 150 CAHIAIKAPNAEDSS-YVNKKGFHSINCQLVCDARGLLLSVETHWPGSLTDRAVFNQSSV 208
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
KL E E+V +++G YPL WL+TP N + + +N H
Sbjct: 209 CKLFE-------------EQVTHEGWLLGDNRYPLRKWLMTPV-PNPETPAGYRYNLAHM 254
Query: 303 ATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
T + + F L+ +R L +K + K II CC+LHN+ + SG
Sbjct: 255 TTHEIVDRTFRALQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNVSLQSG 306
>gi|302760193|ref|XP_002963519.1| hypothetical protein SELMODRAFT_26120 [Selaginella moellendorffii]
gi|300168787|gb|EFJ35390.1| hypothetical protein SELMODRAFT_26120 [Selaginella moellendorffii]
Length = 201
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 6/180 (3%)
Query: 64 KYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSV 123
+ F +S TF+++CS + LV + + + + K++AIA+ RLA+G S V V
Sbjct: 27 RRLFNMSPGTFEWLCSEL-APLVHKSDTNFRSS----IPPRKRIAIAIHRLAAGSSYVEV 81
Query: 124 GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAID 182
F VG++TV T AL L +P D+ + + F L NCCG +
Sbjct: 82 SKMFAVGEATVLTCTREVYAALSTAFGSVLAFPTDAAALTRVARGFHNLMALPNCCGLLT 141
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
H+ + P+ + D N+S+ Q +VD MR ++I G+PGG++ +++LK SG
Sbjct: 142 VLHLRIRRPSGPCGSSYLDHNTNFSIAAQLVVDASMRILNIAVGFPGGVHHAKILKQSGL 201
>gi|432090387|gb|ELK23813.1| hypothetical protein MDA_GLEAN10019053 [Myotis davidii]
Length = 349
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 76/280 (27%), Positives = 125/280 (44%), Gaps = 24/280 (8%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+ L SRP R +S E Q+ AL SG Q +G A G+ Q+++S
Sbjct: 48 YLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMS 101
Query: 136 QVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ EAL ERA +++P D ++ +K +F G+ G +D H+ + P +
Sbjct: 102 RCVANVTEALVERASQIIRFPADEASIQALKDEFYGLAGMPGVIGVVDCIHVAIKAPNAE 161
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
+ +++ +S+ + D + + T WPG + +L+ S EAG +
Sbjct: 162 DL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDYAVLQQSSLSSQFEAGMHKDS 220
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
+++G + L +WL+TP + A +N H AT S+ K F
Sbjct: 221 -------------WLLGDSSFFLRAWLMTPLHIHETPAEY-RYNVAHSATHSVIEKTFRT 266
Query: 315 LKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 267 LCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|241573389|ref|XP_002403173.1| transposase, putative [Ixodes scapularis]
gi|215500182|gb|EEC09676.1| transposase, putative [Ixodes scapularis]
Length = 275
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 117/252 (46%), Gaps = 19/252 (7%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+ VE +V + L L + ES V FGV + + V + IE A ++W +
Sbjct: 25 IPVETKVLMTLWLLGNQESFRGVVDRFGVNKGVLFYVANQMIETWANLAVDFIRWL--MQ 82
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
++ + F + GA+D HI + P + D + +++ +S+++QG + EM F
Sbjct: 83 LQRVSRAFTRKWRFPGVVGAVDGCHIAIKAPE-EEQDAYYNRKEFHSIILQGCCNSEMVF 141
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+ G PG M+ +R+ SG ++ + + L +++G Y L
Sbjct: 142 THVHVGSPGRMHDARVFSISGLDEIID--------------NLPLSLHVLGDSAYSLRLG 187
Query: 281 LITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
L+ PY NG LS TFN A RS+ +AF QLKG +R L K + + +L
Sbjct: 188 LMRPYRNNGHLSEEQVTFNETLSAARSVIERAFAQLKGKFRRL-KYLDMEATEMMSKYVL 246
Query: 340 VCCLLHNIIIDS 351
C+LHNII+ S
Sbjct: 247 ASCVLHNIILQS 258
>gi|340376510|ref|XP_003386775.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 378
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 146/315 (46%), Gaps = 21/315 (6%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++ +++F R+ +F+Y+ +LV + R +I S +++A+ LR LA+GE
Sbjct: 19 DQVSYRHFLRMDINSFEYLLTLVAPKIRRRDTHMRKSI-----SPGERLALTLRFLATGE 73
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNC 177
S S+ + V T+ ++ EA+ E ++K P + + +EI +F++ + L+C
Sbjct: 74 SYSSLQYLYRVPCQTIGKIVIETCEAIVEVLADYMKVPKTQSEWKEIAIEFDQKWNFLHC 133
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+D HI + P + + + + +S++ +VD +F+ + G G ++ +
Sbjct: 134 LGALDGKHINIR-PPPSSGSYYYNYKQRFSIVFLALVDANYKFLYVDIGCNGRVSDGGVF 192
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEV------ELREYIVGGVGYPLLSWLITPYETNGLS 291
+ S L A Q N + + EV + IV +PL ++ PY GL+
Sbjct: 193 RESS---LSSAFQ--NNTLDVPPLEVLPGCTTPIPYIIVADEAFPLKDYIQKPYRQTGLT 247
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIIID 350
+N R + AF L +R+L + + P+ K+ ++ L CCLLHN +
Sbjct: 248 TERRIYNYRLSRARRVVENAFGILANHFRVLMTAINLAPE--KVETLTLTCCLLHNYLRC 305
Query: 351 SGDQLHPDVALSDHH 365
+ P +L H
Sbjct: 306 HTEAYTPPGSLDTEH 320
>gi|357475335|ref|XP_003607953.1| hypothetical protein MTR_4g085850 [Medicago truncatula]
gi|355509008|gb|AES90150.1| hypothetical protein MTR_4g085850 [Medicago truncatula]
Length = 399
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 130/293 (44%), Gaps = 20/293 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F F ++ TF+++ +L+ L R P+ L+ + L+ ++ I L RLASG
Sbjct: 80 FPTTFLMTSSTFEWLTNLLEPLLECRDPAYLLPLN---LTAGVRLGIGLFRLASGSDYQQ 136
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
+ F V S + L + + +P++N I FE GL NC G +
Sbjct: 137 IANQFNVTVSVAKFCVKQLCRVLCTNFRFWVSFPNAND-RSILQNFESISGLPNCSGVVF 195
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
++ Q + Q+ + S+ Q +VD R + I G+ G +LK S
Sbjct: 196 SSRF-------QIAPSTSPQQPHSSIAAQIVVDSTCRILSIAAGYFGHKTDYTILKASSL 248
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASM-PTFNSL 300
F E G LN + +Y++G GYPLL WL+ P+ N ++ S+ TFN+
Sbjct: 249 FNDIEEGSLLNA-----PSVNGVNQYLIGDSGYPLLPWLMVPFADNVCVTGSVEETFNAA 303
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
H R A K L+ W +LSK + R + + + I C +LHN ++ D
Sbjct: 304 HGLMRIPAFKTDASLR-KWGVLSKPV-REEIKMAVAYIGACSILHNSLLMRED 354
>gi|58270796|ref|XP_572554.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57228812|gb|AAW45247.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 428
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 131/314 (41%), Gaps = 27/314 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVRED--LVSRPPSGLINIEGRLLSVEKQVAIALRRLAS- 116
+E F+ V+ F + L+++ VS P N LL Q+ +AL RL
Sbjct: 72 DEDFRRKLHVNHTEFRKLLCLIKDHPVFVSHGPRKQANP---LL----QLTVALYRLGHC 124
Query: 117 --GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
S +G FGV + T + T R I+A+ ++++ WPD N + I FEE +
Sbjct: 125 GCAASTFEIGEQFGVSEGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDI 184
Query: 175 LN-CCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ C G ID H+ PA + D+ + Y + GI DH R G+P +
Sbjct: 185 PDGCVGIIDGFHVPFAYKPARHDAVDFFSYKGRYGFNILGICDHLKRIRYFQYGYPASAH 244
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNV-----RISSEEVELREYIVGGVGYPLLSWLITPYE- 286
+R+ K F+ A + N + SE + EY++ +P W + ++
Sbjct: 245 DARIFKNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKR 304
Query: 287 ---TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIIL 339
N L FN + R A+ LK W+ L K+ D+ I
Sbjct: 305 RRGQNDLDRPEAKFNKKCSSARVKIEHAYGILKNRWQSLRSLRVKICNVRDEGVATCWIQ 364
Query: 340 VCCLLHNIIIDSGD 353
C +LHN++ID+GD
Sbjct: 365 ACVVLHNLLIDTGD 378
>gi|449665223|ref|XP_004206095.1| PREDICTED: uncharacterized protein LOC101236818 [Hydra
magnipapillata]
Length = 336
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 139/324 (42%), Gaps = 29/324 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+VA+AL LAS
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVNE---LHPHLGQTTTTMREPISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + FI A+ + ++K+P S + FE G
Sbjct: 56 EEYRVVSSLFGIGKSTANLIAHEFINAVNDILLPKYVKFPLSVENLNNHSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENN---YSMLVQGIVDHEMRFIDIVTGWPGGMN 232
C GA+D +I P + + D N YS+++ IVD RFI G PG N
Sbjct: 116 QCFGAVDGCYI----PILASKDQAISYYNYKGWYSIVLFSIVDCRYRFIYTSVGSPGKNN 171
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLS 291
S +L S + E+ + V L ++ PL L+ PY E + LS
Sbjct: 172 DSYILHNSSLKAILESSLFDKCCKELGDSLVPL--CLISDSASPLTRHLLKPYPENSELS 229
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN + R + AF ++K + ++ K M D I+ C LHNI
Sbjct: 230 EIQKNFNKILYGARRVVENAFGRVKARFGVICKRM-ECDINFATRIVNACVTLHNIC--- 285
Query: 352 GDQLHPDVA----LSDHHDSGYGE 371
+ + D+ L HHD +
Sbjct: 286 --EYYDDIIIIEWLMHHHDDSLAQ 307
>gi|328697775|ref|XP_001946487.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 412
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 140/302 (46%), Gaps = 15/302 (4%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLAS 116
++ E F FFR+S F Y+ +L+++D+ + N + R + E+++A+ LR LA+
Sbjct: 59 NDPEKFHQFFRMSYDQFSYLHNLIKDDIKKQ------NTQFRAPVPSEQRLAVCLRFLAT 112
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLL 175
GES S+ +F +G ST ++ E + + + ++ P + + I +++E +
Sbjct: 113 GESYRSLAFSFRLGFSTTREIVEEVCEVIWKTLRPIYMPKPTKDDWQNISREYKEIWNFP 172
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC G++D HI + P + + + + S+++ +VD RFI I G G +
Sbjct: 173 NCIGSLDGKHINIQCP-INGGSAYFNYKGVNSIVLLALVDAHYRFITIDVGSYGRNSDGN 231
Query: 236 LLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE--TNGLSA 292
+ S K E + + I + IV +PL +L+ PY T G +
Sbjct: 232 VFAKSALGKALENDTLDVPPDTPIEENGDPMPYVIVADEAFPLKPYLMRPYSRVTLGGNE 291
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN R + AF L WR+ + + +P + + +I+L C LHN++ S
Sbjct: 292 GNKIFNYRLSRARRVVENAFGILSNRWRVFRTNIQVQP--KSVDNIVLAACCLHNMLCQS 349
Query: 352 GD 353
D
Sbjct: 350 HD 351
>gi|395543725|ref|XP_003773764.1| PREDICTED: putative nuclease HARBI1 [Sarcophilus harrisii]
Length = 348
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 129/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMAMYGFPRQFIYYLVDLLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDTIGISQASMSRCVANVTEALVERASQFIHFPADETSLQSLKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D +++ T WPG + +L+
Sbjct: 146 GVVDCIHVGIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALLNVETHWPGSLQDWAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG +++G + L +WL+TP G A +N
Sbjct: 205 QSALRSQFEAGMHKGC-------------WLLGDSAFFLRTWLMTPLHIPGTPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT ++ K F ++ +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHNVIEKTFRTIQSRFRCLDGSKGALQYSPEKASHIILACCVLHNISLEHG 306
>gi|51230557|ref|NP_001003734.1| putative nuclease HARBI1 [Danio rerio]
gi|82181968|sp|Q6AZB8.1|HARB1_DANRE RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|50603827|gb|AAH78390.1| Harbinger transposase derived 1 [Danio rerio]
Length = 349
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 129/292 (44%), Gaps = 29/292 (9%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
F ++ Y+ L+++ L+ R R +S + Q+ AL SG Q +G A
Sbjct: 39 FGFPREFIYYLVELLKDSLLRRTQ------RSRAISPDVQILAALGFYTSGSFQSKMGDA 92
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF----GLLNCCGAID 182
G+ Q+++S+ +AL E+A + + R E K +F++ F G+ N G +D
Sbjct: 93 IGISQASMSRCVSNVTKALIEKAPEFIGF---TRDEATKQQFKDEFYRIAGIPNVTGVVD 149
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
HI + P S + +++ +S+ Q + D + T WPG + + K S
Sbjct: 150 CAHIAIKAPNADDSS-YVNKKGFHSINCQLVCDARGLLLSAETHWPGSLTDRAVFKQSNV 208
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
KL E +E + +++G YPL WL+TP ++ A +N H
Sbjct: 209 AKLFE------------EQENDDEGWLLGDNRYPLKKWLMTPVQSPESPADY-RYNLAHT 255
Query: 303 ATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
T + + F ++ +R L +K + K II CC+LHNI + SG
Sbjct: 256 TTHEIVDRTFRAIQTRFRCLDGAKGYLQYSPEKCSHIIQACCVLHNISLQSG 307
>gi|432950594|ref|XP_004084518.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 346
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 133/297 (44%), Gaps = 25/297 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S ++ Y +L++ + + I+ GR L+ E+ + ALR A+G ++G A
Sbjct: 47 YRFSLQSITYTQNLIQPFITN------ISHRGRALTSEQILCAALRFFANGSFLYNIGDA 100
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + + AL+ + +P + +IK +F G N ID THI
Sbjct: 101 EHISKATVCRAVRKVCLALKRFLHIFIMFPGHKPLRDIKEEFHRIAGFPNVVECIDGTHI 160
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P + D+ ++++ +S+ VQ I D ++ WPG ++ SR+ + S
Sbjct: 161 PIIAPT-ENEADYVNRKSIHSINVQIICDAAHIITNVEAKWPGYVHDSRIFRESTLSNRL 219
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G E+ +++G GYP L+TPY FN H T +
Sbjct: 220 EYG--------------EIDGFLLGDRGYPCQPKLLTPYPEPE-QGPQQRFNWAHSRTGA 264
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
LK ++ L + P++ II+ C +LHNI I G+Q HP + + D
Sbjct: 265 RVEMTIGLLKACFQCLRHLRVTPERAC--DIIVACVVLHNIAILRGEQ-HPALHIQD 318
>gi|346473213|gb|AEO36451.1| hypothetical protein [Amblyomma maculatum]
Length = 423
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 132/297 (44%), Gaps = 18/297 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE----GRLLSVEKQVAIALRRLASGE 118
F F R+S + FD++ SLV+ LI ++ LS +++AI LR LASG
Sbjct: 68 FYNFVRMSPQQFDFLESLVKP---------LIQVQETPLRESLSSAERLAITLRFLASGN 118
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLLNC 177
S S+ +F VG+ST+SQ+ A+ + K L+ PD +++ I +F + NC
Sbjct: 119 SYQSLAYSFRVGKSTISQLVPEVCIAIWKVLKPRVLRQPDHRQLKAIADEFMWKWNFPNC 178
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID HI + P + + + + ++S+ + D RF + G G + +
Sbjct: 179 VGAIDGKHIHLDAPP-NSGSMYFNYKRSFSINLMASCDAGYRFTSVYIGAFGSESDGGVF 237
Query: 238 KFSGFFKLCE-AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-TNGLSASMP 295
G ++ E A Q V +S+ EL +V +PL L+ PY +
Sbjct: 238 SACGLGEIVESAPQTFPSPVALSNNGPELPFVMVADDAFPLKHNLMKPYPGVQPPNTPKR 297
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+N R AF L WR+L M K + ++ CC LHN ++ G
Sbjct: 298 VYNYRLSRARRCIENAFGILCARWRVLRNRMSLLPKNAV-AVSAACCCLHNYLMQEG 353
>gi|340368843|ref|XP_003382960.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 368
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 122/255 (47%), Gaps = 10/255 (3%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER--AKHHLKWPDSNRMEEI 164
V I L LA +FG+ +++VS + R + + K ++K P + E++
Sbjct: 96 VGIMLYYLADESRLRKTANSFGLSRASVS-IIIRLVTTVISLYLGKKYIKLPVTE--EDV 152
Query: 165 KSK---FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
+ K F +F + C GA+D THI + P +D + ++++ YS+ VQ D+ FI
Sbjct: 153 QDKVKGFYNAFNIPQCIGAVDGTHIEIKQPHSNPTD-YINRKSRYSLNVQACCDYRYCFI 211
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
D+V WPG ++ +R+ S ++ + RI E+ + Y++G YPL+ ++
Sbjct: 212 DVVVKWPGSVHDARMFANSKLNQMLKNEMIPPCKRRIDDGEIPVPVYLLGDPAYPLMPYI 271
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
+ Y + G + F + R++ +F +LK + L + M + +P +I C
Sbjct: 272 MKEYTSGGTTPREQYFGYKLCSARNVIECSFGRLKARFSCLKRAM-DININDIPMVIYAC 330
Query: 342 CLLHNIIIDSGDQLH 356
+LHN + + +H
Sbjct: 331 FVLHNFCELNNEAIH 345
>gi|328696997|ref|XP_003240200.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 356
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 144/309 (46%), Gaps = 26/309 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK FR+SK+T I L+ + S ++ +++ +ALR A+G
Sbjct: 40 DEHDFKTRFRLSKETVLMILDLIGPSISSNTD------RNNAVTTTQKLLLALRFYATGS 93
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN-- 176
+S G GV +ST + AL + +K P++N +EIK ++ +G+
Sbjct: 94 FLISAGDVVGVSKSTACVIVRDVSVALAQLRPQFIKMPETN--DEIKELQKQFYGIAKFP 151
Query: 177 -CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
GAID THI + P ++ + +++ +S+ VQ +V +++ +DIV WPG + S
Sbjct: 152 LVIGAIDCTHIKIQSPGGPNAEYFRNRKGWFSLNVQTVVSAKLKIMDIVVRWPGSTHDST 211
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ S ++N ++ + E+ IV GY ++TP+ N +
Sbjct: 212 IFSRS----------KINNDLHV--EQKWGNSLIVADSGYANTKHIVTPF-LNPQAGPEN 258
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
+N TR+ + + LK + +LS M R + ++I+ C +LHNI ID D +
Sbjct: 259 LYNESQIRTRNPVERCYGVLKRRFPVLSLGM-RLQISNIQNVIIACSVLHNIAIDCNDVM 317
Query: 356 -HPDVALSD 363
DV L D
Sbjct: 318 PMDDVQLPD 326
>gi|270015185|gb|EFA11633.1| hypothetical protein TcasGA2_TC000011 [Tribolium castaneum]
Length = 561
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 146/324 (45%), Gaps = 28/324 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK FR++K T ++ ++ + + + + LS + Q+ IALR A+G
Sbjct: 248 DEHAFKKRFRLNKATVMHLVDIIGDRVAPKTQ------RNKSLSAQTQMLIALRFYATGG 301
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNC 177
+G V +S + +V R + + H++K P + + K KF G
Sbjct: 302 FLELLGDWIHVHKSNICRVIQRVTHEIARLSPHYIKMPRTAEELMATKRKFFRICGFPRV 361
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID TH+ + P ++ + +++ +S+ VQ I D +++ + I++ WPG ++ S +
Sbjct: 362 VGAIDCTHVGIQSPGGANAELYRNRKGYFSINVQTICDADLKLLHIISRWPGSVHDSTIF 421
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
S L + R+ R Y++G GYP +L+ P N +AS +
Sbjct: 422 NNSP----------LPVDFRLG------RGYLLGDGGYPCQQYLLIPVR-NPNNASQEAY 464
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N + TR+ + F LK + +L K R + +I+ C +L+NI + D +
Sbjct: 465 NRAYIKTRNTIERFFGILKRRFPLL-KSGLRLKIDTIVQVIVACGVLYNICKERNDHIEE 523
Query: 358 ---DVALSDHHDSGYGEQCCKQVD 378
+V +D D Y E V+
Sbjct: 524 YFDEVMENDEEDFIYQENNINNVN 547
>gi|340382450|ref|XP_003389732.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 423
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 140/319 (43%), Gaps = 20/319 (6%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D+EE + RV+K+TFD +C LV L R N + S +Q+A+ LR LASG
Sbjct: 52 DDEEAHVRYLRVTKETFDVLCRLVAPYLQKR--RSYTNKRPHI-SPGEQLAMTLRFLASG 108
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
+SQ S+ +F VG+++V + ++ L + K +L +P + + ++I +F +
Sbjct: 109 DSQTSISYSFRVGKASVCHIIYKTCCVLWKVLHKKYLPFPSTEDEWKKISHQFWMLWQFP 168
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID H+ + P + + + + +S+++ I D RFI + G GG
Sbjct: 169 NCLGAIDGKHVRIQAPN-NSGSMFFNYKGTFSIVLMAICDAHYRFIMVDIG-EGGKESD- 225
Query: 236 LLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G F C GQRL + V VG +PL L+ P+ +
Sbjct: 226 ----GGVFSNCTFGQRLMEQSLQLPLPNALPGTSVVAPYVFVGDEAFPLRPDLLRPFPGS 281
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
LS FN R L +F L +RI + + K+ + + +LHN +
Sbjct: 282 NLSEKQSVFNYRLSRARRLIENSFGILASRFRIYRQPI-HSSPAKVVAYVKATIVLHNYL 340
Query: 349 IDSGDQLHPDVALSDHHDS 367
++ +D D+
Sbjct: 341 RLHESHVYCPPCFTDREDN 359
>gi|328698115|ref|XP_003240546.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 241
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 95/197 (48%), Gaps = 13/197 (6%)
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
+ WP+ ++E+IK+ F N GAID THI + P + + +++ ++S+ +Q
Sbjct: 7 ITWPNEEKVEKIKNGFFSYSTFPNVFGAIDGTHINILAPH-DHQEAYVNRKGHHSIQLQA 65
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ DH+ RFI G G ++ R+ F+L E L+ + + +++G
Sbjct: 66 VCDHKCRFIHCYAGNVGSVHDQRV------FRLSELKNYLDDATKFPNNT-----HLIGD 114
Query: 273 VGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK 331
Y L L+ PY NG L+ FN H + R ++F LKG +R L +
Sbjct: 115 SAYTLHEHLMVPYRDNGHLTQKQKNFNFCHSSARMAIERSFGFLKGRFRSLLTTLDMKRV 174
Query: 332 RKLPSIILVCCLLHNII 348
+P I+ CC+LHNII
Sbjct: 175 DLIPKYIIACCILHNII 191
>gi|115496966|ref|NP_001069136.1| putative nuclease HARBI1 [Bos taurus]
gi|119368340|sp|Q17QR8.1|HARB1_BOVIN RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|109659164|gb|AAI18218.1| Harbinger transposase derived 1 [Bos taurus]
gi|296479654|tpg|DAA21769.1| TPA: putative nuclease HARBI1 [Bos taurus]
gi|440903445|gb|ELR54100.1| Putative nuclease HARBI1 [Bos grunniens mutus]
Length = 349
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 128/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADEASVQALKDEFYGLAGIPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCMHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAG-------------MHKESWLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|426245379|ref|XP_004016489.1| PREDICTED: putative nuclease HARBI1 [Ovis aries]
Length = 349
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 128/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPGDEASVQALKDEFYGLAGIPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCMHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAG-------------MHKESWLLGDSSFFLRTWLMTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K F L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHSVIEKTFRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|328713516|ref|XP_003245100.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 358
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 137/293 (46%), Gaps = 24/293 (8%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
I +DEE FKY FR+SK DY+ SL+ L + + +S +V I LR
Sbjct: 34 ILTDEE--FKYRFRLSKSCIDYLLSLLEGKLKTATDRNMT------MSPMNKVLITLRFY 85
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
A G +SV FGV S S+ + E + ++ P N +++ K K E
Sbjct: 86 ALGTMLISVADMFGVSISCASKTIRDVSYVIAELSSIFIQIPVHN-IKDTKMKMYEIARF 144
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
GAID TH+ + P S+ + +++ +S+ VQ +V+ E++F+DIV PG + S
Sbjct: 145 PLVFGAIDCTHVRIQSPGGHDSEMFRNRKGYFSLNVQALVNSELQFMDIVARCPGSAHDS 204
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ + S F E+G+ + I+G GY L +L+T +N +
Sbjct: 205 HIFRNSRLFARLESGEF-------------QKMAILGDSGYALKPYLLTSI-SNPVGRIQ 250
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+N TR++ ++F K + +LS + R + + +II+ +LHNI
Sbjct: 251 MLYNESQIRTRNVVERSFGVWKRRFPVLS-LGLRLQLKTVQAIIVATAVLHNI 302
>gi|255583084|ref|XP_002532309.1| conserved hypothetical protein [Ricinus communis]
gi|223527978|gb|EEF30061.1| conserved hypothetical protein [Ricinus communis]
Length = 420
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 166/362 (45%), Gaps = 42/362 (11%)
Query: 41 PSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL 100
PSF+ + S+ SD + + FFR+SK TF + SLV L+S PS +
Sbjct: 84 PSFFHRFVSSAH----SDLDPRWHEFFRMSKPTFLNLLSLVSPSLLSFLPSSI------- 132
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFI-EALEERAKHHLKW-PDS 158
+ VA L RLA G S + FG+ S + + + + + L E + + + D
Sbjct: 133 -PPDFAVAATLFRLAHGASYEASARRFGLDSSAAACLAFYSVCKVLNENLANLVDFGSDV 191
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
R+ I+ SF NCCG + + + N SMLVQ +VD E
Sbjct: 192 GRI--IEGFCWISFP--NCCGVLGFGKFAIDSDVLG---------KNGSMLVQALVDSEG 238
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF+DI GWP M + + + + + + L+G ++ + +YI+G +PL
Sbjct: 239 RFLDISAGWPCTMKPNSIFTQTKLYSRIQ--ELLSGPCCQLNDGNSIPQYILGDSCFPLF 296
Query: 279 SWLITPY----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK- 333
WL+TPY E + ++ FN+ H L AF ++K W++L++ W+ + +
Sbjct: 297 PWLLTPYIRPNEEDSFGSAEREFNAAHNRAMRLVNTAFGRVKARWQLLAR-RWKEECVEF 355
Query: 334 LPSIILVCCLLHNIIIDSGDQLHPD-VALSDHHDS--GYGEQCCKQVDPMGRTTRENLEK 390
P +I++ CLLHN +I + + + V S+ + +G + D G+ R L K
Sbjct: 356 FPFVIVMGCLLHNFLIKCSEPVAEESVGCSEVEEELPVFG----GEADERGQRIRVALAK 411
Query: 391 HL 392
HL
Sbjct: 412 HL 413
>gi|432891478|ref|XP_004075569.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 303
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 113/239 (47%), Gaps = 13/239 (5%)
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHL-KWPDSNRMEEIKSKFEES 171
R S E +V + FGV ++TV + + F + + +HL K P + I +FEE
Sbjct: 13 RRRSCEYKV-IANKFGVHKTTVKKFVYSFCKWMVSSVINHLIKVPTAKDARAIARRFEEK 71
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
F + G ID THI + LP + +++ S ++Q + D RF +I PG
Sbjct: 72 FCIPQVVGCIDRTHIPV-LPPSDGFKHFVNRKRWPSYVLQAVADDMCRFWNINCQMPGST 130
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GL 290
+ +L+ S + +A IS V L +++G YPLL WLI+ Y + L
Sbjct: 131 ADADVLRQSALYN--QAHMLPQEPKEISGTSVNL--FLLGDAAYPLLDWLISDYTPSPHL 186
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK--LPSIILVCCLLHNI 347
+A +FN+ + R+ AF +LK WR+L + + D +P +I CC LHN
Sbjct: 187 TAEQESFNAYLRSARTTVDIAFGKLKSRWRVLLR---KCDFHYTFIPYVIATCCALHNF 242
>gi|126332706|ref|XP_001369586.1| PREDICTED: putative nuclease HARBI1 [Monodelphis domestica]
Length = 348
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 130/296 (43%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMAMYGFPRQFIYYLVDLLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDTIGISQASMSRCVANVTEALVERASQFIHFPTDETSIQNLKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D +++ T WPG + +++
Sbjct: 146 GVVDCIHVGIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGSLLNVETNWPGSLQDWAVVQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP G A +N
Sbjct: 205 QSALRSQFEAGMHKDC-------------WLLGDGSFFLRTWLMTPLHIPGTPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT ++ K F ++ +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 MAHSATHNVIEKTFRTIQSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|357162328|ref|XP_003579375.1| PREDICTED: uncharacterized protein LOC100828950 [Brachypodium
distachyon]
Length = 443
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/307 (26%), Positives = 133/307 (43%), Gaps = 49/307 (15%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQ-STVSQVTWRFIEALEERAKHHLKWPDSN 159
L + ++A AL RLA G +V FG+ + ++ + I A+ +R L +
Sbjct: 162 LPADHKLAAALYRLAHGAPLRAVARRFGLATPAMAARAFYEVIRAIADRLATLLDLAAPD 221
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
R+ F + L NCCGA+ + ++ Q +VD E R
Sbjct: 222 RISRAVPGFC-ALSLPNCCGALGYARF----------------GADGAVAAQALVDAECR 264
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGG 272
F+D+ GW M +L+ S + L A G+ + G+V Y +G
Sbjct: 265 FLDVSAGWDPSMPAPDILRRSKLYASQSRVLANAPHGELIGGSVP---------RYFLGP 315
Query: 273 VGYPLLSWLITPYE--TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
PLL WL+TPY+ +G+S FN +H L +AF ++ W++L + W+ +
Sbjct: 316 ACVPLLPWLVTPYKDAADGMSKES-IFNGVHAHGARLVERAFGHVRARWKLLGE-SWKGE 373
Query: 331 -KRKLPSIILVCCLLHNIIIDSGDQLHPDV----ALSDHHDSGYGEQCCKQVDPMGRTTR 385
+ LP +++ CLLHN +I G+ + PD A +D GE+ D G R
Sbjct: 374 CQEALPYVVVAGCLLHNFLIKCGEPM-PDADGVPAEADVFVDFEGER-----DREGERIR 427
Query: 386 ENLEKHL 392
+ L HL
Sbjct: 428 DVLAAHL 434
>gi|326920414|ref|XP_003206469.1| PREDICTED: putative nuclease HARBI1-like [Meleagris gallopavo]
Length = 348
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 125/286 (43%), Gaps = 33/286 (11%)
Query: 76 YICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQ 131
+IC LV DL+ SRP R +S E QV AL SG Q +G A G+ Q
Sbjct: 45 FICYLV--DLLGASLSRPTQ-----RSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQ 97
Query: 132 STVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL 190
+++S+ EAL ERA + +P D + +K F G+ G +D TH+ +
Sbjct: 98 ASMSRCVANVTEALVERAPRFIHFPEDEAAVRSLKDDFYALAGMPGVLGVVDCTHVAIKA 157
Query: 191 PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
P + + +++ +S+ + D + T WPG M +L+ +
Sbjct: 158 PNAEDL-SYVNRKGLHSLNCLMVCDARGALLSAETHWPGSMPDCTVLQQAAL-------- 208
Query: 251 RLNGNVRISSEEVELRE--YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLA 308
S E EL + +++G + L +WL+TP A +N H AT ++
Sbjct: 209 -------TSQFETELHKDGWLLGDSSFFLRTWLMTPLHIPETPAEY-RYNMAHSATHNII 260
Query: 309 VKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ F ++ +R L SK + K IIL CC+LHNI + G
Sbjct: 261 ERTFRTIRSRFRCLDGSKGTLQYSPEKSSHIILACCVLHNISLQHG 306
>gi|297841965|ref|XP_002888864.1| hypothetical protein ARALYDRAFT_339448 [Arabidopsis lyrata subsp.
lyrata]
gi|297334705|gb|EFH65123.1| hypothetical protein ARALYDRAFT_339448 [Arabidopsis lyrata subsp.
lyrata]
Length = 409
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 111/254 (43%), Gaps = 29/254 (11%)
Query: 108 AIALRRLASGESQVSVGVAFGV-GQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
A A+ RLA G S + FG S S + + + E+ L P+ + +
Sbjct: 123 AAAIFRLAHGASYECLVHRFGFDSTSQASHSFFTVCKLINEKLSQQLDAPNPDFSPNL-- 180
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
L NCCG + + + S+LVQ +VD + RF+DI G
Sbjct: 181 -------LPNCCGVVGFGRFEVNGKLLGAKG---------SILVQALVDSDGRFVDISAG 224
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
WP M + + + F + E + LN V + YI+G PLL WL+TPY+
Sbjct: 225 WPSTMKPEAIFRQTKLFSIAE--EVLNEAPTKLGNGVLVPRYILGDSCLPLLPWLVTPYD 282
Query: 287 TNG-----LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSIILV 340
+ N +H S+ + AF +++ WRIL K W+P+ + +P ++
Sbjct: 283 LTSNEEEEETFREEFNNVVHTGLLSVEI-AFAKVRARWRILDK-KWKPETIEFMPFVLTT 340
Query: 341 CCLLHNIIIDSGDQ 354
CLLHN ++DSGD
Sbjct: 341 GCLLHNFLVDSGDD 354
>gi|351697362|gb|EHB00281.1| Putative nuclease HARBI1 [Heterocephalus glaber]
Length = 349
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 127/294 (43%), Gaps = 24/294 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEDLMSMYGFPRQFIYYLVELLGPSL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q+ +G A G+ Q+++S+ EAL ERA +++P D M+ +K +F G+
Sbjct: 86 QMRMGDAIGISQASMSRCVANVTEALMERASQFIRFPADEASMQALKDEFYGLAGMPGVV 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG N +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMLKNN-------------WLLGDSSFFLRTWLMTPLPLPKTPAE-DRYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIID 350
H AT S+ K L +R L SK + K II CC+LHNI ++
Sbjct: 251 LAHSATHSVMEKTLQTLCSRFRCLDGSKGALQYSPEKSSHIITACCVLHNISLE 304
>gi|328711563|ref|XP_003244572.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 335
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 138/308 (44%), Gaps = 37/308 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E F+ FR+ K+T I + +D + P + +S Q+ + LR A+G
Sbjct: 16 DEVEFQKRFRLKKRTVKMILEEL-QDQIKYPTNS------NSISPMTQLLLTLRFFATGN 68
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNC 177
++ G GV + Q+ R L ++ ++K P + + E+K KF L
Sbjct: 69 FLITAGDFSGVSVAAAGQIVKRVSYVLATKSDKYIKMPKTTAEIMELKVKFYGLARFLKV 128
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID THI + P+ + + + ++ +S+ +Q +V+ + F+D+V WPG + S +
Sbjct: 129 IGAIDCTHIKLQSPSREYGEQYRNRNGYFSLNLQALVNANLEFLDVVARWPGSAHDSNVF 188
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREY----IVGGVGYPLLSWLITPYETNGLSAS 293
A RL +EL E+ I+G GY L +L+TP N + +
Sbjct: 189 ----------ANSRLRA-------RMELHEFKDCVILGDPGYALSHYLLTPI-ANPTTKA 230
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS---IILVCCLLHNIIID 350
+N TR++ + F G W+ V++ + K+ + +I C +LHNI
Sbjct: 231 ERLYNESQIRTRNVVERTF----GVWKRRFPVLFFGLRLKMETSMAVIQSCAILHNIARL 286
Query: 351 SGDQLHPD 358
+ D PD
Sbjct: 287 ANDPQPPD 294
>gi|449691505|ref|XP_004212695.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 223
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 98/188 (52%), Gaps = 5/188 (2%)
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
M E +FE FG++ G +D +HI + PA + D +C ++ YSM VQ + D++ F
Sbjct: 1 MREKIFEFESKFGMIQAHGCVDGSHIPIVCPANPSQDYFCYKQY-YSMQVQAVCDYKGSF 59
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLS 279
+D+ WPG ++ +++ S + RL G + I+ +++ Y++G YP+L
Sbjct: 60 LDVECKWPGSVHDAKVFSNSSINTNLRSS-RLPGTFQTITKNKIKAPCYLIGDPAYPILP 118
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ Y T + FNS+ R+ AF +LK W+IL+K M K+P++I
Sbjct: 119 HCMKEYSTCKKNDEF-IFNSMLRTARNPIECAFGRLKVRWKILTKKM-DLKLEKIPTVIY 176
Query: 340 VCCLLHNI 347
C +LHN+
Sbjct: 177 ACFILHNL 184
>gi|390338881|ref|XP_003724868.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 345
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 115/251 (45%), Gaps = 19/251 (7%)
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G L +VE A L LA Q +VG G+ Q+TVS+ + I+ + + +K+P
Sbjct: 54 GCLTTVESMEA-TLHYLADLGYQTTVGEVMGISQTTVSRHVAKNIDLIAAQHPTWIKFPT 112
Query: 158 SNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
SN + K ++ E G GAID TH+ + P Q D++ +++N S VQ D
Sbjct: 113 SNADVTHAKERWAEKLGFPFTIGAIDCTHVRIDKPRGQFGDEFINRKNYPSFNVQATCDE 172
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
F + GWPG ++ SR+ + S + +R ++G GY
Sbjct: 173 NYIFTSVDVGWPGSVHDSRIFQTSDLRDVVSRN---------------VRGCLLGDSGYG 217
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
+ +++TP+ + + +FN H R + +AF QLK IL + R +P
Sbjct: 218 ISPYMMTPF-ADPDTPVKRSFNRAHTRNRVVIEQAFGQLKRRSPIL-RYGVRLKLENIPK 275
Query: 337 IILVCCLLHNI 347
I+ C +LHN+
Sbjct: 276 CIVACFVLHNV 286
>gi|50748127|ref|XP_421117.1| PREDICTED: putative nuclease HARBI1-like [Gallus gallus]
Length = 348
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 126/286 (44%), Gaps = 33/286 (11%)
Query: 76 YICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQ 131
+IC LV DL+ SRP R +S E QV AL SG Q +G A G+ Q
Sbjct: 45 FICYLV--DLLGASLSRPTQ-----RSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQ 97
Query: 132 STVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL 190
+++S+ EAL ERA + +P D ++ +K F G+ G +D TH+ +
Sbjct: 98 ASMSRCVANVTEALVERAPQFIHFPEDEAAVQSLKDDFYALAGMPGVLGVVDCTHVAIKA 157
Query: 191 PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
P + + +++ +S+ + D + T WPG M +L+ +
Sbjct: 158 PNAEDL-SYVNRKGLHSLNCLMVCDARGALLSAETHWPGSMPDCNVLQQAAL-------- 208
Query: 251 RLNGNVRISSEEVELRE--YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLA 308
S E EL + +++G + L +WL+TP A +N H AT ++
Sbjct: 209 -------TSQFENELYKDGWLLGDSSFFLRTWLMTPLHIPETPAEY-RYNMAHSATHNVI 260
Query: 309 VKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ F ++ +R L SK + K IIL CC+LHNI + G
Sbjct: 261 ERTFRTIRSRFRCLDGSKGTLQYSPEKSSHIILACCVLHNISLQHG 306
>gi|326675671|ref|XP_003200405.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 137/325 (42%), Gaps = 28/325 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S++ YI +L+ + S + R L+ + V I LR ASG ++G A
Sbjct: 46 YRFSREGIMYIVNLLDPHIKS------LTRRSRALTSAQTVCIGLRYFASGTFLYTIGDA 99
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ +S V + + AL++ + +P R + +K F G N GAID THI
Sbjct: 100 ENLSKSAVCRAIRKVYLALKQFLGSVVVFPSHLRPQVVKQSFFAIAGFPNVIGAIDCTHI 159
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P D+ +++ +S+ VQ + D ++ WPG ++ SR+ + S +
Sbjct: 160 QIKAPPGPNEGDFVNRKGVHSINVQMVCDSMYHITNVEAKWPGSVHDSRIFRESHLYTSF 219
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G + ++G GY + +TP+ +N+ TR+
Sbjct: 220 ERG--------------DYDGILLGDRGYACRQYFMTPFPDPN-PGPQTRYNAALVRTRA 264
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHD 366
F +KG ++ L + PD + II+ C +LHNI +++H V H++
Sbjct: 265 RIEMTFGHVKGRFQCLKHLRVAPD--RACDIIVACTVLHNIATIRRERIH--VVPVQHNN 320
Query: 367 SGYGEQCCKQVDPMGRTTRENLEKH 391
Q P GR R+ + H
Sbjct: 321 D---LQPVHLDQPSGRAARDRIVSH 342
>gi|449676027|ref|XP_002158178.2| PREDICTED: uncharacterized protein LOC100199123 [Hydra
magnipapillata]
Length = 521
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 131/317 (41%), Gaps = 55/317 (17%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDL-----VSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + + L R P +SV K+VA+AL
Sbjct: 220 EECEFKEHFRVNRNTFNFLVNKLHPHLGKTTKTMREP----------ISVVKRVAVALHY 269
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEES 171
LAS E V FG+G+ST + + FI A+ + ++K+P S + + FE
Sbjct: 270 LASCEEYRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAI 329
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI ++ P Q + + + + YS+++ +VD RFI G
Sbjct: 330 LGFPQCVGAVDGCHIPISAPKDQAT-SYYNYKGWYSIVLFAVVDSRYRFIYTKLG----- 383
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GL 290
LC + G +PL L+ PY N L
Sbjct: 384 --------DSLVPLC----------------------LKGDSAFPLTRHLLKPYPENLEL 413
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
S FN + R + AF ++K +R++ K M D I+ C LHNI
Sbjct: 414 SEIQKNFNKILCGARRVVENAFGRVKARFRVICKRM-ECDINFATRIVNACVTLHNICEH 472
Query: 351 SGDQLHPDVALSDHHDS 367
D + + L H DS
Sbjct: 473 YDDIIIIEWLLHHHDDS 489
>gi|334183854|ref|NP_001185379.1| uncharacterized protein [Arabidopsis thaliana]
gi|332197176|gb|AEE35297.1| uncharacterized protein [Arabidopsis thaliana]
Length = 410
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 13/153 (8%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+LVQ +VD RF+DI GWP M + + + F + E + L+G V +
Sbjct: 206 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVP 263
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTF-----NSLHEATRSLAVKAFLQLKGGWRI 321
YI+G PLL WL+TPY+ L++ +F N +H S+ + AF +++ WRI
Sbjct: 264 RYILGDSCLPLLPWLVTPYD---LTSDEESFREEFNNVVHTGLHSVEI-AFAKVRARWRI 319
Query: 322 LSKVMWRPDKRK-LPSIILVCCLLHNIIIDSGD 353
L K W+P+ + +P +I CLLHN +++SGD
Sbjct: 320 LDK-KWKPETIEFMPFVITTGCLLHNFLVNSGD 351
>gi|47210238|emb|CAF92077.1| unnamed protein product [Tetraodon nigroviridis]
Length = 373
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 122/257 (47%), Gaps = 20/257 (7%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E QV AL SG Q S+G G+ Q+++S+ AL E+A + + D
Sbjct: 93 RAISPEVQVLAALGFYTSGSFQTSMGTTIGISQASMSRCVSDVTRALVEKAPQFITFNLD 152
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
E+ +F+ G G +D + + P ++ + +++ +S+ Q + + +
Sbjct: 153 PLSREQSFQEFQRVAGFPGVLGVLDCVQVTIKAPTIEDM-SYVNKKGFHSVACQLVCNAQ 211
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ T WPGG+ + +L+ S LN ++ ++E +++G YPL
Sbjct: 212 GLLLSAETNWPGGLRATDILERSS----------LNKQMQEAAE-----GWLLGDRRYPL 256
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLP 335
W++TP ++ SA + +N+ H AT + K F ++ ++ L +K + +
Sbjct: 257 RKWMMTPVDSPESSAEL-QYNAAHAATHEIVDKTFRAIRSRFKCLDGTKGYLQYSPERSA 315
Query: 336 SIILVCCLLHNIIIDSG 352
+I+L CC+LHN + SG
Sbjct: 316 AILLACCVLHNAFLQSG 332
>gi|432848600|ref|XP_004066426.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 340
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 129/291 (44%), Gaps = 28/291 (9%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S+ + Y+ +L++ P + G LS E + IALR A+G SVG A
Sbjct: 45 YRFSRDSLIYLTNLLK------PFISNVTHRGAALSTENILCIALRFFATGSFLYSVGDA 98
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
VG++TV + AL+ + +++P + IK E G N G +D TH+
Sbjct: 99 EHVGKATVCRAVRTACLALKRMLGYLVQFPGHKPLRVIK----EYSGFPNVIGCMDGTHV 154
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P+V D+ ++++ +S+ VQ I D +I WPG ++ +R+ + S
Sbjct: 155 PIKSPSVNEG-DYVNRKSMHSINVQVICDATHLVTNIEAKWPGSVHDARIFRESSLCPKF 213
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
G NG Y++G YP L +L++PY FN H TR+
Sbjct: 214 HQGH-FNG-------------YLLGDRVYPCLPFLMSPY-PEPQPGPQAHFNLAHSRTRA 258
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
LK ++ L + +P++ II C +LHNI G++ P
Sbjct: 259 RVEMTIGILKVRFQCLRGLRVKPERAC--DIIAACAVLHNIATMRGERAPP 307
>gi|51970904|dbj|BAD44144.1| hypothetical protein [Arabidopsis thaliana]
Length = 379
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 13/153 (8%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+LVQ +VD RF+DI GWP M + + + F + E + L+G V +
Sbjct: 175 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVP 232
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTF-----NSLHEATRSLAVKAFLQLKGGWRI 321
YI+G PLL WL+TPY+ L++ +F N +H S+ + AF +++ WRI
Sbjct: 233 RYILGDSCLPLLPWLVTPYD---LTSDEESFREEFNNVVHTGLHSVEI-AFAKVRARWRI 288
Query: 322 LSKVMWRPDKRK-LPSIILVCCLLHNIIIDSGD 353
L K W+P+ + +P +I CLLHN +++SGD
Sbjct: 289 LDK-KWKPETIEFMPFVITTGCLLHNFLVNSGD 320
>gi|403160884|ref|XP_003890524.1| hypothetical protein PGTG_20815 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170436|gb|EHS64115.1| hypothetical protein PGTG_20815 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 421
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 143/323 (44%), Gaps = 43/323 (13%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F +FR+S+ F ++ +RE L P G LSVE QVA+ L RL G + V
Sbjct: 59 FIEYFRMSRADFAWLSDELRETLQQDPLR-----RGAPLSVEAQVAVGLYRLGHGATYVM 113
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD---SNRMEEIKSKFEESFGLLNCC 178
+ F +G+ T + T F++A+ + + + +P ++ +EI FE G+ +
Sbjct: 114 ISHVFNIGKETADKATGHFVQAVLKVLRLQTISFPGLDAHDKWDEIIELFEWRHGIPDIV 173
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN------ 232
GAID THI + +P + +++N S++ Q +V+ + F D+ P N
Sbjct: 174 GAIDGTHIPLAIPPSDEWKGYINRKNWASLVFQCVVNGDGNFRDVCD--PPLHNSYIQST 231
Query: 233 VSRLLK--------------FSG---------FFKLCEAGQRLNGNVRISSEEVELREYI 269
S+LL F G F+ + G LN N + ++
Sbjct: 232 SSQLLHALMSDERVWTVGQVFGGGAGSIHDTRVFRRSDLGISLN-NALGREMRIPPGTHL 290
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
+G GYP ++ PY + ++ FN + ATR + + F +LK +RIL +
Sbjct: 291 IGDAGYPSDVNVLVPYPS-VVAPENVHFNHIQSATRIIVEQTFGRLKNRFRILLTAQ-KA 348
Query: 330 DKRKLPSIILVCCLLHNIIIDSG 352
+ + + VC +LHN++ G
Sbjct: 349 NPVRARNTAFVCMILHNLLNKRG 371
>gi|348536895|ref|XP_003455931.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 346
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 126/282 (44%), Gaps = 30/282 (10%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+RE L R R +S E QV AL SG Q S+G G+ Q+++S
Sbjct: 48 YLVELLREGLCRRTQ------RSRAISPEVQVLAALGFYTSGSFQTSMGDTIGISQASMS 101
Query: 136 QVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ AL E+A + + D + +E+ +F+ G G +D + + P +
Sbjct: 102 RCVSNVTRALVEKAPQFITFNRDLSTIEQSFREFQRVAGFPGVLGVLDCVQVAIKAPNSE 161
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG 254
S + +++ +S+ Q + D + T W GG+ + +L+ S +K + +
Sbjct: 162 DS-SYVNKKGFHSVACQLVCDARGLLLSAETHWAGGLQDTVVLERSALYKELQDTE---- 216
Query: 255 NVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQ 314
+ +++G YPL WL+TP + A +N H ATR + + F
Sbjct: 217 -----------QGWLLGDSRYPLRKWLMTPVDCPECPAEF-RYNLAHAATREIVDRTFRA 264
Query: 315 LKGGWRIL----SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
++ +R L + + P++ SI+L CC+LHN + SG
Sbjct: 265 IQTRFRCLDGTKGYLQYSPERSS--SILLACCVLHNASLQSG 304
>gi|403159653|ref|XP_003890645.1| hypothetical protein PGTG_20678 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168189|gb|EHS63584.1| hypothetical protein PGTG_20678 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 310
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 121/260 (46%), Gaps = 31/260 (11%)
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWP 156
G LSVE QVA+ L RL G S V++G F +G+ T + T RF+ A+ R + + +P
Sbjct: 25 GDPLSVEAQVAVGLYRLGHGSSYVTIGHVFSIGKETADKATGRFVNAVLARFRRVAICYP 84
Query: 157 DSNRMEE---IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGI 213
R ++ I + FE G+ N GAID THI + PA + ++++ S++ Q +
Sbjct: 85 PLARGDQWDKISASFEAKHGIPNIVGAIDGTHIPLATPADDRWKGYINRKSWASIVFQCV 144
Query: 214 VDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGV 273
VD + G SR E G + N + + Y++G
Sbjct: 145 VDGD------------GEWGSR----------SELGHSIT-NGTAAEPMIPHGTYLIGDA 181
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKR 332
GYP ++ PY + +A FN + +TR + ++F LK +RIL M RP +
Sbjct: 182 GYPSNVRVLLPYLSTA-TAENEEFNFIQSSTRIIVEQSFGSLKNRFRILLHAQMARPLRA 240
Query: 333 KLPSIILVCCLLHNIIIDSG 352
+ + C +LHN++ G
Sbjct: 241 R--NNAFACMILHNLLNKRG 258
>gi|251823947|ref|NP_001156537.1| uncharacterized protein LOC100302406 [Acyrthosiphon pisum]
gi|239791514|dbj|BAH72212.1| hypothetical protein [Acyrthosiphon pisum]
Length = 385
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 145/340 (42%), Gaps = 36/340 (10%)
Query: 40 WPSFWAKNSSTPGAT------IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL 93
W W K +T GAT + +++ ++ FFR+ Y+ S ++ + +
Sbjct: 9 WVRDWIKKRNTLGATNNICQELSLEDQSSYRNFFRMDPSLHLYLLSKIKVAIQKEDTNMR 68
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHL 153
+ I E ++ + LR LA+G+S S+ F + ++T+S ++A+ L
Sbjct: 69 LAIPA-----ECKLNVTLRFLATGDSFSSLQYLFRIPKNTISTFIPEVLDAIYSALLDFL 123
Query: 154 KWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
K P+S +EI+ F E + C GAID H+ + P S ++ + +N++S+++
Sbjct: 124 KVPNSVEEWKEIEKGFNEKWNFSGCIGAIDGKHVAIRAPMFSGS-EYYNYKNSFSIILMA 182
Query: 213 IVD--HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYI 269
+VD + R+IDI P G + G F C + N + I V
Sbjct: 183 VVDANYCFRYIDI---GPQGRHSD-----GGVFDHCSLRHMIEQNKLHIPENFV-----F 229
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG +PL S+L+ PY L+ FN R AF L +R+ K P
Sbjct: 230 VGDEAFPLKSYLMRPYPRRELNTDCKIFNYRLSRARRTVENAFGILVSRFRVFEK----P 285
Query: 330 DKRKLPS---IILVCCLLHNIIIDSGDQLHPDVALSDHHD 366
+P+ I+ C LHN + D + + L D D
Sbjct: 286 IATSVPTAVKIVKTACALHNWLQTRRDSNYFSLGLIDKED 325
>gi|432950717|ref|XP_004084578.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 343
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 140/325 (43%), Gaps = 28/325 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S+ + Y+ +L++ P + G LS E + IALR A+G S+G A
Sbjct: 44 YRFSRDSLIYLTNLLK------PFISNVTHWGAALSTENILCIALRFFATGSFLYSMGDA 97
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
VG++TV + AL+ + +++P + IK +F G N G +D TH+
Sbjct: 98 EHVGKATVCRAVRTACLALKRLLGYFVQFPGHKPLRVIKEEFHRIAGFPNVIGCMDGTHV 157
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P+V D+ ++++ +S+ VQ + +I WPG ++ +R+ + S
Sbjct: 158 PIKSPSVNEG-DYVNRKSMHSINVQVTIYGSCLVTNIEAKWPGSVHDARIFRESSLCPKF 216
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
G NG Y++G YP L +L++PY FN H TR+
Sbjct: 217 HQGH-FNG-------------YLLGDRVYPCLPFLMSPYPEPQ-PGPQAHFNLAHSRTRA 261
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHD 366
LK ++ L + +P++ II C +LHNI G++ P + D
Sbjct: 262 RVEMTIGILKVRFQCLRGLRVKPERAC--DIIAACAVLHNIATMRGERAPPPLEEDGPED 319
Query: 367 SGYGEQCCKQVDPMGRTTRENLEKH 391
+ Q D GR R+ + H
Sbjct: 320 APLDHA--DQRD--GRVVRDMISLH 340
>gi|270000989|gb|EEZ97436.1| hypothetical protein TcasGA2_TC011267 [Tribolium castaneum]
Length = 364
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 125/287 (43%), Gaps = 27/287 (9%)
Query: 68 RVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
R+ + F+Y+ L+ L G + L+ ++ +AL+ A+G Q VG +
Sbjct: 54 RLDRTVFNYVLQLISPALTKNTKKGRYS----HLTPMHKLYVALQFYATGTYQWMVGSSS 109
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKF------EESFGLLNCCGAI 181
+ QS S+ AL E A +++P + +K F G N GAI
Sbjct: 110 RISQSATSKAIHEVTAALVEVAPQFIRFPAAAEYPTVKQAFYALGMTRHGAGFPNVLGAI 169
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D TH+ + P+ + + + ++ YS+ VQ +V ++ ++ +PG + S + + S
Sbjct: 170 DCTHVRIQKPSTEAPEQYLNRHLYYSINVQLVVSSSLKIFNVFAAFPGSNHDSFIWRNSA 229
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLH 301
AG G ++VG GYP WL+TP +N++H
Sbjct: 230 VRDGIVAGNFTEG-------------WLVGDSGYPQEPWLMTPVTVPTTVPEQ-LYNAVH 275
Query: 302 EATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNI 347
+TR+ + LK W +L S + + P K+ +I CC+LHNI
Sbjct: 276 ISTRNPVERTNGVLKSRWHVLDSPLAYSPA--KVCAITTACCVLHNI 320
>gi|432955676|ref|XP_004085596.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 280
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 120/273 (43%), Gaps = 19/273 (6%)
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+GL + R +S + ALR A+G ++G A + ++TV + + AL+
Sbjct: 10 TGLPVLRLRAISSAVILCAALRFFANGSFLYNIGDAEHISKATVCRAVGKVCLALKRFLH 69
Query: 151 HHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV 210
+ +P + IK +F G N G ID THI + P + D+ ++++ +S+ V
Sbjct: 70 IFIVFPGHKPLRPIKEEFHRIAGFPNVVGCIDGTHIPIIAPT-ENEADYVNRKSIHSINV 128
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
Q I D ++ W G ++ SR+ + S E G E+ +++
Sbjct: 129 QIICDAAHIITNVEAKWHGSVHDSRIFRESTLSNRLECG--------------EIDGFLL 174
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G GYP L+TPY FN H TR+ LK ++ L + P+
Sbjct: 175 GDRGYPCQPKLLTPYPEPE-QGPQQCFNWAHSRTRARVEMTIGLLKACFQCLRHLRVTPE 233
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
+ II+ C +LHNI I G+Q HP + + D
Sbjct: 234 RAC--DIIVACVVLHNIAILRGEQ-HPALHIQD 263
>gi|403173013|ref|XP_003332118.2| hypothetical protein PGTG_13485 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170092|gb|EFP87699.2| hypothetical protein PGTG_13485 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 135/317 (42%), Gaps = 40/317 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
E FK FR+++ F +C+ +R D V + N +E+Q+ +AL+RL
Sbjct: 100 EASRFKQEFRMTRLAFTKLCARIRNDTVFQN-----NSHNPQRPIEEQLMVALKRLGCFG 154
Query: 119 SQVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SVG+ FGVG+ TV T R I A+ ++WP +EIK+ + E G
Sbjct: 155 NGASVGMLARFFGVGEGTVELYTNRCIMAILRIKTQIIQWPSPEDRKEIKADYAE-VGFD 213
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G ID I + + D+ ++ +Y + D + TGWPG
Sbjct: 214 GCVGLIDGVLIPLAECPSKNGSDFYSRKGSYGITTLIACDSNRNINFLYTGWPG------ 267
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVEL---REYIVGGVGYPLLSWLITPYET---NG 289
C QR+ GN R++ E + EY++ + ++ ++
Sbjct: 268 ----------CSHDQRVMGNSRLALEPKQFFSPGEYLLADSAFVTTPTIVAAFKRPPHGK 317
Query: 290 LSASMPTFNSLHEATRSL------AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
L+ +FN R L A+KA Q G R+ ++ R D+ ++ + I C +
Sbjct: 318 LTDDQVSFNYYLARHRVLVEHCIGALKARFQSLKGLRL--RIDGRNDQIRVNAWIQACAV 375
Query: 344 LHNIIIDSGDQLHPDVA 360
LHN +I GD+ D A
Sbjct: 376 LHNFLI-QGDEFEFDDA 391
>gi|348558866|ref|XP_003465237.1| PREDICTED: putative nuclease HARBI1-like [Cavia porcellus]
Length = 350
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 127/296 (42%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEDLMSMYGFPRQFIYYLVELLGPSL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALMERATQFIHFPADEASVQALKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGALMTVETNWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S EAG + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFEAGMHKDS-------------WLLGDSSFFLRTWLMTPLPLPKTPAE-DRYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K L +R L SK + K II+ CC+LHNI ++ G
Sbjct: 251 LAHSATHSVMQKTLRTLCSRFRCLDGSKGALQYSPEKSSHIIMACCVLHNISLEHG 306
>gi|326668947|ref|XP_003198900.1| PREDICTED: putative nuclease HARBI1-like [Danio rerio]
Length = 344
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 125/290 (43%), Gaps = 23/290 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S++ YI +L+ + S + R L+ + V I LR ASG ++G A
Sbjct: 46 YRFSREGIMYIVNLLDPHIKS------LTRRSRALTSAQTVCIGLRYFASGTFLYTIGDA 99
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ +S V + + AL++ + +P R + +K F G N GAID THI
Sbjct: 100 ENLSKSAVCRAIRKVYLALKQFLGSVVVFPSHLRPQVVKQSFFAIAGFPNVIGAIDCTHI 159
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P D+ +++ +S+ VQ + D ++ WPG ++ SR+ + S +
Sbjct: 160 QIKAPPGPNEGDFVNRKGVHSINVQMVCDSMYHITNVEAKWPGSVHDSRIFRESHLYTSF 219
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G + ++G GY + +TP+ +N+ TR+
Sbjct: 220 ERG--------------DYDGILLGDRGYACRQYFMTPFPDPNPGPET-RYNAALVRTRA 264
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
F LKG ++ L + PD + II+ C +LHNI +++H
Sbjct: 265 RIEMTFGHLKGRFQCLKHLRVAPD--RACDIIVACTVLHNIATIRRERIH 312
>gi|166795899|ref|NP_001107265.2| putative nuclease HARBI1 [Rattus norvegicus]
gi|205831681|sp|B0BN95.1|HARB1_RAT RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|165971427|gb|AAI58735.1| Harbi protein [Rattus norvegicus]
Length = 349
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 127/296 (42%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GA+D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GAVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S E G + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFETGMPKDS-------------WLLGDSSFFLHTWLLTPLHIPETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 RAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|348543355|ref|XP_003459149.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 295
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 117/262 (44%), Gaps = 20/262 (7%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+ IALR A+G ++G A V ++TV + AL+ + +P IK
Sbjct: 28 ICIALRFFANGSFLYNIGDAEHVSKATVCRAVRNVTVALKRLLYSFVVFPGHRPTRFIKE 87
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
F + G G ID THI + P+V D+ ++++ +S+ VQ I D ++
Sbjct: 88 GFHKIAGFPGVIGCIDGTHIPIIAPSVN-EGDYVNRKSFHSINVQIICDAANIITNVEAK 146
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
WPG ++ SR+ F+ C + + E Y++G GYP L +L+TPY
Sbjct: 147 WPGSVHDSRI------FRECTLSTKFG--------QGEFTGYLLGDRGYPCLPYLLTPYP 192
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
+ +N H TR+ LK ++ L ++ P++ II+ C +LHN
Sbjct: 193 -DPXXXPQQRYNLAHCRTRARVEMTIGMLKARFQCLQRLRVTPERAC--DIIVACVILHN 249
Query: 347 IIIDSGDQL--HPDVALSDHHD 366
I G+ P+++ +H+
Sbjct: 250 IATIRGEHCPSEPNISSDPNHE 271
>gi|301623960|ref|XP_002941281.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 463
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 113/247 (45%), Gaps = 22/247 (8%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEI 164
++ +L SG Q V G+ Q T S+ ++ + + + + +P N +
Sbjct: 157 KLLCSLHFFVSGSFQRVGEVYGGLSQPTFSRYLGHVLDTIRSVSMNLISFPQHRNEWNTV 216
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
K F G+ N AID TH+ + P + +++ +S+ VQ + D M + IV
Sbjct: 217 KRDFYGVSGIPNVLAAIDCTHVALN-PPQDREHIFRNRKGYHSLNVQVVCDGRMNILSIV 275
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
+G+PG + + L+ SG ++ E GQ +G +++G GYP WLITP
Sbjct: 276 SGFPGSSHDAYTLRQSGLYQSFETGQMPHG-------------WLLGDAGYPCCRWLITP 322
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILV 340
S + FN H RS+ + F LK +R L K +M+ P K+ +II
Sbjct: 323 IR-RPCSQAERAFNQAHVRARSVIEQTFGVLKSRFRCLDKSGGSLMYSPT--KVANIIGA 379
Query: 341 CCLLHNI 347
C +LHN+
Sbjct: 380 CAVLHNL 386
>gi|432924657|ref|XP_004080663.1| PREDICTED: uncharacterized protein LOC101167745 [Oryzias latipes]
Length = 707
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 138/317 (43%), Gaps = 27/317 (8%)
Query: 61 EGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
E F+ +FR+S F+ + SL +E R P + E+++A+ LR L+
Sbjct: 60 ERFRGYFRMSVSEFENLLQQLAPSLTKEQTHYRKP----------IDPEQRLAVCLRFLS 109
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESFGL 174
+G+S ++ +F +G STV+ + +AL + HL P +F E +
Sbjct: 110 TGDSYRTIASSFRLGVSTVASIVRETCDALWHCLRDEHLPVPTEEMWRSTARRFHERWNF 169
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NC GA+D HI + PA +S + + + +S+++ +VD + RF+ + G G +
Sbjct: 170 PNCLGAMDGKHIFIQAPA-NSSSLYFNYKGTFSVVLLALVDADYRFLMVDVGSYGSNSDG 228
Query: 235 RLLKFSGFFKLCEAGQRLN----GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ SG K G LN + + E +L IV +PL +L+ PY L
Sbjct: 229 GIFANSGLGKALRDGT-LNVPPPSELPGAPELGKLNHVIVADEAFPLKPYLLRPYPGRRL 287
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
FN R ++ F L +R+ + + + + ++ C+L N +
Sbjct: 288 PTEKRIFNYRLSRARRISENVFGILSQRFRVFQRTL-QVQPSVVDKVVKAACVLCNYLRP 346
Query: 351 SG-DQLHPDVALSDHHD 366
G DQ H A D HD
Sbjct: 347 KGEDQNH---ATEDDHD 360
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/299 (23%), Positives = 121/299 (40%), Gaps = 26/299 (8%)
Query: 88 RPPSGLINIEGRLLSVEKQVA--------------IALRRLASGESQVSVGVAFGVGQST 133
+P G + G+ SVE Q +A L++G+S ++ +F +G ST
Sbjct: 363 QPLQGFEHCRGQRASVEAQNVRELYKEYFNSPAGEVAWHFLSTGDSYRTIASSFRLGVST 422
Query: 134 VSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPA 192
V+ + +AL + HL P +F E + NC GA+D I + A
Sbjct: 423 VASIVRETCDALWHCLRDEHLPVPTEEMWRSTARRFHERWNFPNCLGAMDGKRISIQATA 482
Query: 193 VQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRL 252
+ + + + +S+++ +VD + RF+ + G G + + SG K G L
Sbjct: 483 -NSGSLYFNYKGTFSVVLLALVDADYRFLMVDVGSYGSNSDGGIFANSGLGKALRDGT-L 540
Query: 253 N----GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLA 308
N + + E ++ IV +PL +L+ PY L FN R ++
Sbjct: 541 NVPPPSELPGAPELGKVNHVIVADEAFPLKPYLLRPYPGRRLPTDKRIFNYRLSRARRIS 600
Query: 309 VKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG-DQLHPDVALSDHHD 366
F L +R+ + + + + ++ C+L N + G DQ H A D HD
Sbjct: 601 ENVFGILSQRFRVFQRTL-QVQPSVVDKVVKAACVLCNYLRPKGEDQNH---ATEDDHD 655
>gi|403182852|gb|EJY57671.1| AAEL017220-PA [Aedes aegypti]
Length = 367
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 123/295 (41%), Gaps = 17/295 (5%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIALRRLA 115
E+ FK F +SK +FD + LV +DL +RP G+ + ++ + + LA
Sbjct: 5 EKFFKENFHMSKNSFDQLFKLVEKDLAPKRNTRPKDGI--------PPKLKLGLVIEYLA 56
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
SG Q + + V + + + + +A+ ++ P E+ + F +
Sbjct: 57 SGGLQRHLASCYRVSKQHMGSIIDQVCDAICRALSAYVADPCQESFLEVANGFNSRWNFP 116
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID H+ + P S + + + +S+ + I D RF + G G
Sbjct: 117 NCIGAIDGKHVSIKAPPNAGSIFY-NYKGFHSLALMAICDASYRFTYLDVGAYGSEGDCN 175
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ K S F RL+ + V+L + V +PL +I PY LSA
Sbjct: 176 IFKESKF-GTDVLHDRLDFPENATVNGVKLPFFYVADDAFPLCKRIIKPYSKKNLSAEER 234
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMW-RPDKRKLPSIILVCCLLHNIII 349
FN R AF L W L K ++ PD+ + II CC+LHN +I
Sbjct: 235 IFNYRLSRARRCIENAFGLLCSKWACLKKTLYCSPDRAQ--KIISACCMLHNFLI 287
>gi|449692104|ref|XP_004212903.1| PREDICTED: uncharacterized protein LOC101239628, partial [Hydra
magnipapillata]
Length = 307
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/311 (27%), Positives = 132/311 (42%), Gaps = 33/311 (10%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
+E FK FRV++ F++ LV E +P G R +SV K+VA+AL LAS
Sbjct: 2 KECDFKEHFRVNRNPFNF---LVNE---LQPHLGKTTTTMREPISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
E V FG+G+ST + + + FI A +F C
Sbjct: 56 EEYRVVSSLFGIGKSTANLIVYEFINA---------------------QRFRSYSWFSQC 94
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+D HI ++ P Q + + + YS++ +VD RFI G PG N +L
Sbjct: 95 VGAVDGCHIPISAPKDQAISYY-NYKGWYSIVRFAVVDCRYRFIFTSVGSPGRNNDCYIL 153
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASMPT 296
+ S + E+ + V L ++G +PL L+ PY N LS
Sbjct: 154 QNSSLKAILESNLFDKCCKELGDSLVPL--CLIGDSAFPLTRHLLKPYPENLVLSEIQKN 211
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
FN + R + AF ++K +R++ K M D I+ C +LHNI + D +
Sbjct: 212 FNKILCGARRVVENAFGRVKAQFRVICKRM-ECDINFATRIVNACVILHNICENYDDIII 270
Query: 357 PDVALSDHHDS 367
+ H DS
Sbjct: 271 IKWLMYHHDDS 281
>gi|154759331|ref|NP_848839.2| putative nuclease HARBI1 [Mus musculus]
gi|81913518|sp|Q8BR93.1|HARB1_MOUSE RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|26337267|dbj|BAC32318.1| unnamed protein product [Mus musculus]
gi|62871735|gb|AAH94315.1| Harbi1 protein [Mus musculus]
gi|74151001|dbj|BAE27631.1| unnamed protein product [Mus musculus]
gi|148695628|gb|EDL27575.1| RIKEN cDNA D230010M03 [Mus musculus]
Length = 349
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 113/257 (43%), Gaps = 18/257 (7%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL ERA + +P D
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVD 124
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++ +K +F G+ G D H+ + P + + +++ +S+ + D
Sbjct: 125 EAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDL-SYVNRKGLHSLNCLVVCDIR 183
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ + T WPG + +L+ S E G + +++G + L
Sbjct: 184 GALMTVETSWPGSLQDCAVLQRSSLTSQFETGMPKD-------------SWLLGDSSFFL 230
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLP 335
SWL+TP +A +N H AT S+ + L +R L SK + K
Sbjct: 231 RSWLLTPLPIPETAAEY-RYNRAHSATHSVIERTLQTLCCRFRCLDGSKGALQYSPEKCS 289
Query: 336 SIILVCCLLHNIIIDSG 352
IIL CC+LHNI +D G
Sbjct: 290 HIILACCVLHNISLDHG 306
>gi|328705664|ref|XP_001951832.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 408
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 144/322 (44%), Gaps = 35/322 (10%)
Query: 44 WAKNSSTPGAT------IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE 97
W K + GA+ + ++ + F R+++ F+ + V+ D +S+ + +
Sbjct: 36 WIKRRNNLGASNTLLKELAVEDPRNYFNFLRINESMFNILLGKVK-DQISKQDTAM---- 90
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L+ ++ IALR LA+G+S S+ + V +S +S+ +A+ K ++ P
Sbjct: 91 REALTPRIKLEIALRFLATGDSYTSLQYLYRVSKSAISEFMPDVFDAIFAGLKEFIQVPK 150
Query: 158 S-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ N + I F S+ NC GAID HII+ PA + ++ + + ++S+++ +VDH
Sbjct: 151 TTNDWDSIVHGFNLSWNFPNCFGAIDGKHIIIECPA-NSGSNFYNYKGSFSIVLLALVDH 209
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVELREYIVGGVGY 275
F I G G + G F C + + N + + E V L G +
Sbjct: 210 SYNFTCIDVGAYGSASD------GGIFSKCTLKKAIEENQLNLPDEAVML-----GDEAF 258
Query: 276 PLLSWLITPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL 334
PL +L+ PY N L+ +N H R + +F L +R+ + + R L
Sbjct: 259 PLTKYLMKPYPRRNILTKKQKIYNYRHCRARRIVENSFGILSSRFRVFRRPL-----RLL 313
Query: 335 PSIIL----VCCLLHNIIIDSG 352
PS ++ C LHN I SG
Sbjct: 314 PSTVVKLVKAACSLHNWIRKSG 335
>gi|354469884|ref|XP_003497342.1| PREDICTED: putative nuclease HARBI1-like [Cricetulus griseus]
gi|344247825|gb|EGW03929.1| Putative nuclease HARBI1 [Cricetulus griseus]
Length = 349
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 126/296 (42%), Gaps = 24/296 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E + ++ Y+ L+ L SRP R +S E Q+ AL SG
Sbjct: 32 DEYLMSMYGFPRQFIYYLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSF 85
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCC 178
Q +G A G+ Q+++S+ EAL ERA + +P D ++ +K +F G+
Sbjct: 86 QTRMGDAIGISQASMSRCVAHVTEALVERASQFIHFPADEAAVQSLKDEFYGLAGMPGVI 145
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G +D H+ + P + + +++ +S+ + D + + T WPG + +L+
Sbjct: 146 GVVDCIHVAIKAPNAEDL-SYVNRKGLHSLNCLIVCDIRGALMTVETSWPGSLQDCAVLQ 204
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S E G + +++G + L +WL+TP A +N
Sbjct: 205 QSSLSSQFETGMHKD-------------SWLLGDSSFFLRTWLMTPLHIAETPAEY-RYN 250
Query: 299 SLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H AT S+ K L +R L SK + K IIL CC+LHNI ++ G
Sbjct: 251 RAHSATHSVIEKTLRTLCSRFRCLDGSKGALQYSPEKSSHIILACCVLHNISLEHG 306
>gi|12323666|gb|AAG51799.1|AC067754_15 hypothetical protein; 75067-63678 [Arabidopsis thaliana]
Length = 2777
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 13/153 (8%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+LVQ +VD RF+DI GWP M + + + F + E + L+G V +
Sbjct: 206 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVP 263
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTF-----NSLHEATRSLAVKAFLQLKGGWRI 321
YI+G PLL WL+TPY+ L++ +F N +H S+ + AF +++ WRI
Sbjct: 264 RYILGDSCLPLLPWLVTPYD---LTSDEESFREEFNNVVHTGLHSVEI-AFAKVRARWRI 319
Query: 322 LSKVMWRPDKRK-LPSIILVCCLLHNIIIDSGD 353
L K W+P+ + +P +I CLLHN +++SGD
Sbjct: 320 LDK-KWKPETIEFMPFVITTGCLLHNFLVNSGD 351
>gi|21397269|gb|AAM51833.1|AC105730_7 Unknown protein [Oryza sativa Japonica Group]
Length = 1202
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 139/300 (46%), Gaps = 34/300 (11%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+S+ F +C+ +R GL+ ++ +SVE+QVA+ L+++ S VG +
Sbjct: 596 LRMSRAVFYKLCARLRN-------KGLL-VDTFHVSVEEQVAMFLKKVGQHHSVSCVGFS 647
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL-NCCGAIDATH 185
F TVS+ + A+ E A+ + +N +I SK + + +C GA+D TH
Sbjct: 648 FWRSGETVSRYFRIVLRAMCEIARELIYIRSTNTHSKITSKKNKFYPYFKDCIGALDGTH 707
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK------- 238
I ++PA + D + ++ + V VD ++RFI I+ GW G + S +L+
Sbjct: 708 IRASVPAKKV-DRFRGRKPYPTQNVLAAVDFDLRFIYILAGWEGSAHDSLVLQDALSRPN 766
Query: 239 ----FSGFFKLCEAGQRLNGNVRISSEEV--ELREYIVGGVGYPLLSWLITPYETNGLSA 292
G F L +AG + S V L+E+ G G P+ G
Sbjct: 767 GLKILEGKFFLADAGYAARPGILPSYRRVRYHLKEF-RGPQG---------PHGPQGPKC 816
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN H + R+ +AF LK ++IL + P K + +++ CC LHN I+++G
Sbjct: 817 PKELFNYRHSSLRTTIERAFGALKNRFKILMNKPFIPLKAQ-SMVVIACCALHNWILENG 875
>gi|224051095|ref|XP_002200142.1| PREDICTED: putative nuclease HARBI1 [Taeniopygia guttata]
Length = 348
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 33/286 (11%)
Query: 76 YICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQ 131
+IC LV DL+ SRP R ++ E Q+ AL SG Q +G A G+ Q
Sbjct: 45 FICYLV--DLLGATLSRPTQ-----RSRAITPETQILAALGFYTSGSFQTRMGDAIGITQ 97
Query: 132 STVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL 190
+++S+ EAL ERA + +P D ++ +K F G+ G +D TH+ +
Sbjct: 98 ASMSRCVANVTEALVERASQFIHFPKDEATLQTLKDDFYGLAGMPGVLGVVDCTHVAIKA 157
Query: 191 PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
P + + +++ +S+ + D + T WPG + +L+ +
Sbjct: 158 PNAE-DLSYVNRKGLHSLNCLMVCDSRGVLLSAETHWPGSLPDCTVLEQAAL-------- 208
Query: 251 RLNGNVRISSEEVELRE--YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLA 308
S E EL + +++G + L +WL+TP A +N H AT ++
Sbjct: 209 -------TSQFETELHKDGWLLGDSSFLLRTWLMTPLHIPETPAEY-RYNMAHSATHNII 260
Query: 309 VKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+ F ++ +R L SK + K IIL CC+LHNI + G
Sbjct: 261 ERTFRTIRSRFRCLDGSKGTLQYSPEKSSHIILACCVLHNISLQHG 306
>gi|328697463|ref|XP_003240344.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 279
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 113/224 (50%), Gaps = 22/224 (9%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
FK F++SK+TF + ++ E++ + + LS E Q+ IALR A+G Q
Sbjct: 47 FKIRFKLSKETFMILLHMIGENIEHK------TLRNFSLSAEVQILIALRYYATGTFQAV 100
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAI 181
+G V ++TV ++ R + + ++++P++ ++++I+++F + G GAI
Sbjct: 101 LGDHIHVHKATVCRIVKRVSLQIAQLRPQYIQFPNNTVQLQQIQARFYKLHGFPRVIGAI 160
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D THI + P + + +++ +S+ +Q I D +++ ++IV WP ++ S +
Sbjct: 161 DCTHIRIQSPKNDIGEKFRNRKGYFSLNIQAICDSQLKIMNIVARWPRSVHDSTIFD--- 217
Query: 242 FFKLCEAGQRLNGNVRISSEEVEL-REYIVGGVGYPLLSWLITP 284
N +R E E +++G GYP S+L+TP
Sbjct: 218 -----------NSFIRAKFENNEFGNTFLLGDGGYPCRSYLLTP 250
>gi|346473924|gb|AEO36806.1| hypothetical protein [Amblyomma maculatum]
Length = 330
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 138/299 (46%), Gaps = 25/299 (8%)
Query: 34 SSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLV----REDLVSRP 89
SS+ W + + S T+P + F+ FRV++ F YI +L R+D R
Sbjct: 49 SSERRLWA--YPRGQSWYETTVPHLPDSEFRAHFRVTRSIFRYILNLCECMQRQDTAMR- 105
Query: 90 PSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA 149
R + ++K+VA++L RLAS + ++ AFG+G+STV+ + F + +
Sbjct: 106 ---------RAVPLDKRVAVSLYRLASSAEERTIANAFGLGRSTVNTIFREFCAVVVRQL 156
Query: 150 KHHL-KWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
L + P + + E +F G GA+D HI + P + + D+ + + YS
Sbjct: 157 GTKLVRLPKRSELAEHLRQFTAVTGFPQGVGALDGCHIEVC-PPEEHASDYINNKGWYST 215
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELRE 267
++ I DH+ RF+ G PG + + + + S + + R G V ++ VE
Sbjct: 216 ILLAIADHKYRFLYTNVGSPGRNHDAGVFESSSLPGILASHLFRAEGKV-VAGVTVE--P 272
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSK 324
++ +PL + ++ P+ G S PT FN R + AF +LK +RIL K
Sbjct: 273 LLLADQAFPLQNHIMKPFPNPGPPGS-PTGVFNYHLCRARRIVENAFGRLKARFRILLK 330
>gi|449688903|ref|XP_004211882.1| PREDICTED: uncharacterized protein LOC101238937 [Hydra
magnipapillata]
Length = 280
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 129/292 (44%), Gaps = 19/292 (6%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+V +AL LA
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVNE---LHPHLGKTTTTMREPISVVKRVVVALHYLALC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
E V FG G+ST + + FI A+ + ++K+P S + G +
Sbjct: 56 EEYRVVSSLFGTGESTANLIVHEFINAVNDILLPKYVKFPLS------VENLNKHIGFPH 109
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GA+D HI + P Q + + + YS+++ VD RFI G PG N S +
Sbjct: 110 CVGAVDGCHIPILAPKDQAISYY-NYKEWYSIVLFAFVDCRYRFIYTSVGSPGRNNDSYI 168
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASMP 295
L+ S + E+ + V L ++G +PL L+ PY N LS +
Sbjct: 169 LQNSSLKAILESNIFDKCCNELGDSLVPL--CLIGDSAFPLSRHLLKPYPENLELSKILK 226
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FN + R + AF ++ +R++ K M D I+ C LHNI
Sbjct: 227 NFNKILCGARRVVENAFGCVRARFRVICKRM-EFDINFAKRIVNACVALHNI 277
>gi|240254354|ref|NP_565039.4| uncharacterized protein [Arabidopsis thaliana]
gi|332197175|gb|AEE35296.1| uncharacterized protein [Arabidopsis thaliana]
Length = 2845
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 13/153 (8%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+LVQ +VD RF+DI GWP M + + + F + E + L+G V +
Sbjct: 206 SILVQALVDSNGRFVDISAGWPSTMKPEAIFRQTKLFSIAE--EVLSGAPTKLGNGVLVP 263
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTF-----NSLHEATRSLAVKAFLQLKGGWRI 321
YI+G PLL WL+TPY+ L++ +F N +H S+ + AF +++ WRI
Sbjct: 264 RYILGDSCLPLLPWLVTPYD---LTSDEESFREEFNNVVHTGLHSVEI-AFAKVRARWRI 319
Query: 322 LSKVMWRPDKRK-LPSIILVCCLLHNIIIDSGD 353
L K W+P+ + +P +I CLLHN +++SGD
Sbjct: 320 LDK-KWKPETIEFMPFVITTGCLLHNFLVNSGD 351
>gi|449274628|gb|EMC83706.1| Putative nuclease HARBI1 [Columba livia]
Length = 348
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 117/259 (45%), Gaps = 22/259 (8%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL ERA + +P D
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVTNVTEALVERASQFIHFPED 124
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++ +K F G+ G +D TH+ + P + + +++ +S+ + D
Sbjct: 125 EATVQSLKDDFYGLAGMPGVLGVVDCTHVAIKAPNAEDL-SYVNRKGLHSLNCLMVCDAR 183
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE--YIVGGVGY 275
+ T WPG + +L+ + S E EL + +++G +
Sbjct: 184 GVLLSAETHWPGSLPDCTVLQQAAL---------------TSQFETELHKDGWLLGDSSF 228
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRK 333
L +WL+TP A +N H AT ++ + F ++ +R L SK + +K
Sbjct: 229 FLRTWLMTPLHIPETPAEY-RYNMAHSATHNVIERTFRTIRSRFRCLDGSKGTLQYSPKK 287
Query: 334 LPSIILVCCLLHNIIIDSG 352
IIL CC+LHNI ++ G
Sbjct: 288 SSHIILACCVLHNISLEHG 306
>gi|449667893|ref|XP_004206668.1| PREDICTED: uncharacterized protein LOC101236189 [Hydra
magnipapillata]
Length = 254
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 120/267 (44%), Gaps = 22/267 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALRRLASG 117
EE FK FRV++ TF + LV E P G R +SV K+VA+AL LAS
Sbjct: 2 EECEFKEHFRVNRNTFIF---LVNE---LHPHLGKTTTTMREHISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + FI A+ + + ++K+P S + + FE G
Sbjct: 56 EEYRVVSSLFGIGKSTTNLIVHEFINAVNDISLPKYVKFPLSVENLNKHSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQT----SDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
C GA+D HI ++ P Q W YS ++ +VD RFI + G PG
Sbjct: 116 QCVGAVDGCHIPISAPKDQAISYIYKGW------YSNVLFAVVDCRYRFIYTIVGLPGRN 169
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GL 290
N S +L+ S + E+ + V L ++G +PL L+ PY N L
Sbjct: 170 NDSYILQNSSLKAILESNLFNKCCKELGDSLVPL--CLIGDSAFPLTRHLLKPYPENLEL 227
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKG 317
S FN + R + AF ++K
Sbjct: 228 SEIQKNFNKILCGARKVVENAFGRVKA 254
>gi|449687590|ref|XP_004211494.1| PREDICTED: uncharacterized protein LOC101237280 [Hydra
magnipapillata]
Length = 450
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 144/344 (41%), Gaps = 52/344 (15%)
Query: 59 EEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + L + R P +SV K+VA+AL
Sbjct: 144 EECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMREP----------ISVVKRVAVALHY 193
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEES 171
LAS E V FG+G+ST + + FI A+ + ++K+P S + + FE
Sbjct: 194 LASCEEYRVVSSLFGIGKSTPNLIVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAI 253
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI ++ P Q + + + + YS+++ + D RFI G P
Sbjct: 254 LGFPQCVGAVDGCHIPISAPKDQAT-SYYNYKGWYSIVLFAVADSRYRFIYTSVGSP--- 309
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GL 290
E G L V L ++G +PL L+ PY N L
Sbjct: 310 ---------------ELGDSL----------VPL--CLIGDSVFPLTRHLLKPYPENLEL 342
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
S FN + R + AF ++K +R++ K D I+ C LHNI
Sbjct: 343 SEIQKNFNKILCGARRVVENAFGRVKARFRVICK-RTECDINFATRIVNACVTLHNICEY 401
Query: 351 SGDQLHPDVALSDHHDSGYGEQCCKQV--DPMGRTTRENLEKHL 392
D + + L H DS + G+ R+++ K+L
Sbjct: 402 YDDIIIIEWLLHHHDDSLAQPNTVSTTGNNGPGKNVRDSIAKYL 445
>gi|188501616|gb|ACD54739.1| PIF/Harbinger-like protein [Adineta vaga]
Length = 411
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 144/317 (45%), Gaps = 18/317 (5%)
Query: 44 WAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL--- 100
+ KN +P + FK FRV + T Y C L+ + L +E
Sbjct: 67 YKKNGQWWSEIVPRMSDHQFKENFRVERTT--YAC------LIDKIGPYLEKLETNYRAT 118
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDS- 158
+ V+K++AIAL L S + FG+G+ST ++ F L E + +K+P S
Sbjct: 119 IPVDKRIAIALYTLGSSSELRKIAHLFGIGRSTTGEILREFCSTLVETLFYQFIKFPKSP 178
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ ++E + F + F C G++D THI + P D+ + + ++S++++ IV+ ++
Sbjct: 179 DEIKETINGFYDKFNYPMCIGSVDGTHIAIK-PPKGYETDYYNYKKHHSIIMRAIVNSDL 237
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
F + G G N S + S ++ E N + I+ ++++R + + + L
Sbjct: 238 LFTYVNIGASGRCNDSSIYNRSSLSQVIEDPIYDNHYMMIN--QIKVRCHFIADSAFSLS 295
Query: 279 SWLITPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
L+ P+ E + TFN R + F LK +R+L K + + + ++
Sbjct: 296 KTLMKPFPERPNMQKEYSTFNYRLSRARCSVERTFGALKNRFRLLHKKI-EYNLSNITNM 354
Query: 338 ILVCCLLHNIIIDSGDQ 354
+ +LHN+ I +GD
Sbjct: 355 VKAATILHNLCILNGDN 371
>gi|326489332|dbj|BAK01649.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 312
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 127/291 (43%), Gaps = 41/291 (14%)
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
S S+ FG+G ST + + A+ + ++WP + + E+ + F+ G+
Sbjct: 45 SGASISCRFGLGISTCHSIVLQVCAAITDILMPKAIRWP-LDSLPEVAAGFQAMSGIPGV 103
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQE----NN---YSMLVQGIVDHEMRFIDIVTGWPGG 230
G + HI + LP + D++ + NN YS+ +Q +VD F D+ G P
Sbjct: 104 VGTVCTDHIPIGLPK-ENVDEYYNHRLSVRNNKASYSVALQVVVDAGGAFTDVCIGIPSA 162
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
++ + +LK S + C G + R+ +GGV YPL W+I PY+ L
Sbjct: 163 LSNAAVLKRSALYIRCVTGLLGDDQFRL-----------LGGVSYPLTDWMIVPYKHLNL 211
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP---SIILVCCLLHNI 347
+ + FN A + + A +LK WR L R + KLP ++I CC+L N
Sbjct: 212 TWAQHVFNERVAAANAASHGALHRLKARWRCLQ----RRTELKLPDLHNMIGACCVLRNF 267
Query: 348 IIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPM----GRTTRENLEKHLQH 394
SG++L D+ + CC + D + R+ + K L H
Sbjct: 268 CERSGEELDIDLE---------SQLCCDEDDVVAVADAEKERDRIAKDLLH 309
>gi|301611155|ref|XP_002935119.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 320
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 99/194 (51%), Gaps = 27/194 (13%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID TH+ +T P + + +++ ++S+ VQ + D +R + + +G+PG ++ +
Sbjct: 108 NCLGAIDCTHVPLTPPRAH-QERYLNRKRSHSINVQVVCDSHLRIMSVRSGFPGSVHDAH 166
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S ++ G+ G ++VG GY +L WL+TP A
Sbjct: 167 ILRQSALYERFTQGEMPRG-------------WLVGDAGYGVLPWLMTPVRFPRTPAQR- 212
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLLHNIIIDS 351
+N H TR++ + F LK +R LS +++ P K + II+VC +LHN+ +D
Sbjct: 213 RYNRAHRKTRNVIERLFGVLKSRFRCLSVTGGALLYSPIK--VSEIIVVCAMLHNVAMDH 270
Query: 352 G------DQLHPDV 359
G D L P++
Sbjct: 271 GLGADINDALEPEI 284
>gi|301611277|ref|XP_002935169.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 427
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 115/244 (47%), Gaps = 22/244 (9%)
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFE 169
L ASG Q GV QS S+ I+++++ K ++ +P D + +K F
Sbjct: 118 LHFFASGSFQTRTSSIGGVSQSAFSRFMGPVIDSIKKHLKAYIYFPKDKAGWQRVKRGFY 177
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
G+ + G +D HI ++ P + + + + + +S+ VQ I D + + + I + +PG
Sbjct: 178 RIAGMPHVMGVLDCMHIALS-PPHEREELYRNSKGFHSVNVQVICDCKGKILSIYSAFPG 236
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
S +LK + ++ E G+ G +++GG GY WL+T N
Sbjct: 237 SSQDSVILKQTSVYEAFENGKLTGG-------------WLIGGPGYTCQPWLLTAV-ANP 282
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLLH 345
+ + +FN H T S+ + F LK ++ L K + + P + + II+VCC+LH
Sbjct: 283 TTTAEDSFNEAHARTHSVIERTFSSLKSQFKCLDKPGGVLQYNPTR--VADIIVVCCILH 340
Query: 346 NIII 349
NI I
Sbjct: 341 NIGI 344
>gi|410905903|ref|XP_003966431.1| PREDICTED: putative nuclease HARBI1-like [Takifugu rubripes]
Length = 346
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 132/291 (45%), Gaps = 30/291 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
F ++ Y+ L++E L R R +S E QV AL SG Q S+G
Sbjct: 39 FGFPRQFILYLIELLQEALCRRTQ------RSRAISPEVQVLAALGFYTSGSFQTSMGTT 92
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
G+ Q+++S+ AL E+A + + D E+ +F+ GL G +D
Sbjct: 93 IGISQASMSRCVSDVTRALVEKAPQFITFDLDPLSREQSFQEFQRVAGLPGVLGVLDCVQ 152
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + P ++ + +++ +S+ Q + + + + T WPGG+ + +L+ S
Sbjct: 153 VTIKAPTIEDL-SYVNKKGFHSVGCQLVCNAQGLLLSAETNWPGGLKDTDILQSSNL--- 208
Query: 246 CEAGQRLNGNVRISSEEVELRE--YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEA 303
S++++ E +++G YPL W++TP ++ SA + +N H A
Sbjct: 209 --------------SKQMQYTEEGWLLGDHRYPLRKWMMTPVDSPESSAEL-QYNLAHAA 253
Query: 304 TRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
T + + F ++ ++ L SK + + +I+L CC+LHN + SG
Sbjct: 254 TCEIVDRTFRAIQSRFKCLDGSKGYLQYSPERSSAILLACCVLHNAFLQSG 304
>gi|403158998|ref|XP_003319668.2| hypothetical protein PGTG_01842 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166548|gb|EFP75249.2| hypothetical protein PGTG_01842 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 435
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 117/263 (44%), Gaps = 24/263 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV +V+ R IEAL + H++WPD
Sbjct: 138 LPIAHQLALTLERLGSNGNGASVGRFSRNLQVGRGTVIKVSRRVIEALVSIGRTHVQWPD 197
Query: 158 SNRMEEIKSKFE-ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+R EI E FG C G +D T + M + + D++ YS+ +Q + D
Sbjct: 198 RHRRAEISEVMRMEGFG--GCVGFVDGTTLPMFQRPGYDGEVFFDRKKRYSLNLQILCDC 255
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ +TGWPG + S++ K +LN N +E +Y++ Y
Sbjct: 256 DKFITSFITGWPGTVGDSKVYK----------RMQLNLNPYDFFDE---GQYLIADSAYD 302
Query: 277 LLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMW----RPDK 331
L S +I Y+ ++ T FN +R LK W L ++ R
Sbjct: 303 LSSTVIPAYKGKAAEITINTDFNYCMAKSRVRNEHTIGILKARWSSLKEMRLHLYNRGHM 362
Query: 332 RKLPSIILVCCLLHNIIIDSGDQ 354
R+ + I C +LHN++ GDQ
Sbjct: 363 RQYTAWIYSCIVLHNLLAKLGDQ 385
>gi|270011972|gb|EFA08420.1| hypothetical protein TcasGA2_TC006067 [Tribolium castaneum]
Length = 384
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 136/295 (46%), Gaps = 11/295 (3%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D++ FK +FR++K F I S + ED+ + P + + +S +++AI LR LA+G
Sbjct: 55 DDDTKFKEYFRLTKSQFFSILSYISEDITTMPSTFV----KYPISPCQKLAITLRFLATG 110
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSNRMEEIKSKFEESFGLLN 176
ES S+ F + + +S++ ++ L + +L P ++ +++I F + N
Sbjct: 111 ESFRSMAFNFRIFHNHISKLVREVLKELRNKLLPIYLPKPTTSELKQIAEDFNARWNFPN 170
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
CCGAID HI + P + + + ++ +S+++ ++D +F + G G S +
Sbjct: 171 CCGAIDGKHIRIVCPK-GSGSLYFNYKSYFSIVLLALIDARYKFTWVDVGAYGKEGDSGI 229
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM-- 294
+ S ++L L + ++ L IVG + L ++ PY + A
Sbjct: 230 FQRSSIYQLIHNNNFLPEPTLLPRTDIILPHVIVGDEAFKLTENVLRPYPRDQAKADREK 289
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII 348
FN R + AF L + I + + P + I++V C LHN++
Sbjct: 290 KIFNYRLSRARRTSENAFGILCQTFCIFYTPIATHPT--LVDDIVMVSCCLHNLL 342
>gi|328697693|ref|XP_001952791.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 616
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 155/340 (45%), Gaps = 29/340 (8%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+SK ++ + + +DL P+ + ++ Q+ LR A+G+ +S+
Sbjct: 296 FRLSKDAVLFLLNWIEKDL--EYPTN----RNQSIAPVNQILCFLRFCATGKHLISIADF 349
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATH 185
G STVS++ +R + R K+ + P ++ M + +F G ID TH
Sbjct: 350 MGCHVSTVSRIVYRVARVISIRCKNFILMPKTDIEMADTTKEFYNIAKFPKVIGVIDCTH 409
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + ++ + +++ +SM VQ + + ++V WPG +V L F+
Sbjct: 410 VKVQSLGGGDAEIYRNRKGYFSMNVQCVCRANLTISNVVARWPG--SVHDLTIFN----- 462
Query: 246 CEAGQRLNGNVRISSE-EVEL-REYIV-GGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
N R+ +E E E+ R YI+ G GYPLL++L+TP + + + +N H
Sbjct: 463 ---------NSRLHTEFETEVYRNYIILGDSGYPLLNYLMTPL-LHPATVNEQLYNESHI 512
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALS 362
TR+ + F L + IL+ + + + ++I C+LHNI I DQ+ + L
Sbjct: 513 RTRNCIERCFGVLNRRFPILAYGIRLQNIDAIMAVITSTCILHNIAILFNDQVQDSIDLE 572
Query: 363 DHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLVK 402
D Q++ GR R++ ++ + NQ +L+
Sbjct: 573 DPQHLNLIINNGHQIN--GRDNRQHGDEVVGLNQITNLIN 610
>gi|432865843|ref|XP_004070641.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 266
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 108/244 (44%), Gaps = 32/244 (13%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
F YI LV L R LS+E+QV I LR ASG +V
Sbjct: 40 FHFGNADIKYIADLVMPKLQRRTR------RSHSLSLEQQVLIGLRFYASGTFYQAVSDN 93
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
GV +STVS V AL + +P+ ++ + K KF + N G ID TH+
Sbjct: 94 IGVNKSTVSDVVKAVSIALASLVNPFVSFPNDVQIAQTKHKFFTLGNMPNTIGVIDCTHV 153
Query: 187 IMTLPAVQTSDDW--CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
+ P DW +++ S+ +Q + D ++ + V WPG ++ +R+L+ S ++
Sbjct: 154 HIQAPH---ERDWEYINRKGRRSINIQLVGDADLIITNCVVKWPGSVHDARILRESALYR 210
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT---FNSLH 301
+ R +G I+G YPLL WLITP+ L+AS P FN+ H
Sbjct: 211 ELQT-NRPHG-------------IILGDSAYPLLPWLITPF----LAASTPVQAHFNTAH 252
Query: 302 EATR 305
TR
Sbjct: 253 CKTR 256
>gi|26348669|dbj|BAC37974.1| unnamed protein product [Mus musculus]
Length = 349
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 113/257 (43%), Gaps = 18/257 (7%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL ERA + +P D
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVD 124
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++ +K +F G+ G D H+ + P + + +++ +S+ + D
Sbjct: 125 EAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDL-SYVNRKGLHSLNCLVVCDIR 183
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ + T WPG + +L+ S E G + +++G + L
Sbjct: 184 GALMTVETSWPGSLQDCAVLQRSSLTSQFETGMPKD-------------SWLLGDSSFFL 230
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLP 335
SWL+TP +A +N H AT S+ + L ++ L SK + K
Sbjct: 231 RSWLLTPLPIPETAAEY-RYNRAHSATHSVIERTLQTLCCRFQCLDGSKGALQYSPEKCS 289
Query: 336 SIILVCCLLHNIIIDSG 352
IIL CC+LHNI +D G
Sbjct: 290 HIILACCVLHNISLDHG 306
>gi|241114453|ref|XP_002400196.1| conserved hypothetical protein [Ixodes scapularis]
gi|215493067|gb|EEC02708.1| conserved hypothetical protein [Ixodes scapularis]
Length = 406
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 132/307 (42%), Gaps = 44/307 (14%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVRE-----DLVSRPPSGLINIEGRLLSVEKQVAIA 110
P + + + F S++TF ++C +RE D RP + VEK+VA+
Sbjct: 96 PDFSAQDWLHNFHASQETFQHLCEALREHIQTQDTAWRPAT----------PVEKKVAVF 145
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEE 170
L RLA+ + FG+G+S+ V ++AL + P N M + FEE
Sbjct: 146 LWRLANNCHYKVLSQLFGIGRSSACLVLKEAVQALLLHVAPRIVRPAENTM----AFFEE 201
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
+GL C G + +T++ + P ++ E + + +V G G
Sbjct: 202 VYGLPGCLGVLCSTYVAVNPPLYVKAEYAVPGEEQSGL--------HACLLQVVIGRDGQ 253
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
++ R +G ++ E+G +SE Y++G +PL ++TPY+
Sbjct: 254 ISAVR----AGTQQMGESG---------ASESSLF--YLIGDSSFPLRPEVLTPYQQTAT 298
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
TFNS A + L+LKG W+ L++ + D +++ CC+LHNI
Sbjct: 299 KVQR-TFNSAVSGALGYAEQTVLKLKGRWKCLAQ-KYEGDLSNAGDLVVACCVLHNICEA 356
Query: 351 SGDQLHP 357
GD+ P
Sbjct: 357 RGDRFDP 363
>gi|222640165|gb|EEE68297.1| hypothetical protein OsJ_26550 [Oryza sativa Japonica Group]
Length = 428
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 132/314 (42%), Gaps = 62/314 (19%)
Query: 60 EEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR+ + FD +C ++ +ED R + V ++VA+ L RL
Sbjct: 128 EAEFRRAFRMPRAVFDKLCDDLAAAVAKEDTTLRA----------AIPVPQRVAVCLWRL 177
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
A+G+ V FG+G ST + + A+ ++WPDS+ S+F+ G+
Sbjct: 178 ATGDPLREVSRRFGLGISTCHSIILQVCAAITAVLTRVVRWPDSH--AAAASRFQALSGI 235
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
P V +VD + F D+ G PG ++ +
Sbjct: 236 ----------------PGV------------------AVVDADGAFTDVCIGHPGSLSDA 261
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+L S + CEAG L + +++VGG YPL SW++ PY L+ +
Sbjct: 262 AVLTKSALYARCEAGLLLGDDP----------QWLVGGASYPLTSWMLVPYAQPNLTWAQ 311
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
N+ R+ AV AF +L+ + +LP+++ CC+LHN+ SG +
Sbjct: 312 ERLNARVADARAAAVGAFRRLRAR-WRCLRRRAEVKLPELPNMLGACCVLHNLCERSGGE 370
Query: 355 LHPDVALSDHHDSG 368
L D+ + D G
Sbjct: 371 LDADLLHDELVDDG 384
>gi|255561699|ref|XP_002521859.1| conserved hypothetical protein [Ricinus communis]
gi|223538897|gb|EEF40495.1| conserved hypothetical protein [Ricinus communis]
Length = 459
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 117/254 (46%), Gaps = 16/254 (6%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSN 159
L + VA+ L RLA G S ++ + + VS++T L + +K P S
Sbjct: 162 LPSDYAVAMVLSRLAFGYSAKALASRYSLEPYLVSKITNMVTRLLATKLYPEFIKIPVSR 221
Query: 160 R-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY---SMLVQGIVD 215
R + E FEE L N CGAID + I ++ + + Y S+L+Q + D
Sbjct: 222 RRLIETTQAFEELTSLPNICGAIDGSPI-----KIRRENGANLYQCKYGFPSVLLQVVAD 276
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFF-KLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
H+ F D+ GG + + + S + +L A + + V R Y+VG
Sbjct: 277 HKKVFWDVCVKASGGTDDATHFRDSLLYNRLVSADLVWEKVINVRGHHV--RPYVVGDWC 334
Query: 275 YPLLSWLITPYETNGL-SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
YPLLS+L+TP+ NG + + F+ + RS V+A LKG W+IL +
Sbjct: 335 YPLLSFLMTPFSPNGSGTPAQNLFDGMLMKGRSAVVEAIGLLKGRWKILQDL--DVGLNH 392
Query: 334 LPSIILVCCLLHNI 347
P I+ CC+LHN+
Sbjct: 393 APQTIVACCVLHNL 406
>gi|331234270|ref|XP_003329795.1| hypothetical protein PGTG_11545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309308785|gb|EFP85376.1| hypothetical protein PGTG_11545 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 227
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 94/195 (48%), Gaps = 11/195 (5%)
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKW 155
G SVE QVA+ L RL G S V++G F +G+ T + RF+ A+ R + + +
Sbjct: 24 RGNPPSVEAQVAVGLYRLGHGSSYVTIGHVFSIGKETADKAAGRFVNAVLARFRRVAICY 83
Query: 156 PD---SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
P ++ +EI + FE G GAID THI + PA + ++++ S++ Q
Sbjct: 84 PPLARGDQWDEISASFEAKHGNPYIVGAIDGTHIPLATPADDRWKGYINRKSWASIVFQC 143
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+VD + F ++ G PG M+ RL F E G + N + + Y++G
Sbjct: 144 VVDGDGNFCNVSGGAPGSMHDGRL------FWRSELGHSIT-NGTAAEPMIPHGTYLIGD 196
Query: 273 VGYPLLSWLITPYET 287
GYP ++ PY +
Sbjct: 197 AGYPSNVRVLIPYPS 211
>gi|328699971|ref|XP_001952082.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 415
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 130/302 (43%), Gaps = 36/302 (11%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F ++R+S +FD + +R + + I R + E+++ I +R L+ G + V+
Sbjct: 79 FSEYYRMSISSFDELLEKLRPHITKK-----ITKFRRPVCPEERLTITIRYLSVGTNFVA 133
Query: 123 VGVAFGVGQSTVS-------QVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ F +G+ST+ QV W ++ E + P+ ++ EI +KF
Sbjct: 134 LQYEFLLGRSTIGNIVRETCQVLWNTLQPEE------MPEPNPDQWTEIANKFYLKTNFP 187
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D HI P + + + + + +S+++ +VD E FI I G G S
Sbjct: 188 NCVGAVDGKHIRCIKP-INSGSMFYNYKKYFSIVLMAVVDAEYSFISIDVGAYGKEGDST 246
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELRE--------YIVGGVGYPLLSWLITPYET 287
+ FK C G++L + V L ++G + L L+ PY
Sbjct: 247 I------FKNCPFGKKLYSELLNLPAPVVLPNTDNFPQPFVLIGDEAFGLHKNLLRPYPG 300
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHN 346
GL+ FN R AF L WR+L S ++ PD +I CC+LHN
Sbjct: 301 RGLTQKRKIFNYRLSRARRYVECAFGILANKWRVLHSAILVEPDFAD--DVIKACCILHN 358
Query: 347 II 348
+
Sbjct: 359 YV 360
>gi|328700311|ref|XP_003241214.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 287
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 129/286 (45%), Gaps = 30/286 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI 164
QV + LR A G +SV FGV S+ S+ A+ + LK P +N + E
Sbjct: 7 NQVLLTLRFYALGTMLISVADMFGVSVSSASRTIKNVSYAIAGFSGLFLKIP-TNDLVET 65
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
K K + GAID TH+ + P + ++ + +++ +S+ VQ +V+ +++F+DIV
Sbjct: 66 KMKMFKIARFPLVFGAIDCTHVRIQSPGGEFAESFRNRKGYFSLNVQALVNSDLKFMDIV 125
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
WPG + S + + S + E+G+ N N I+G GY L +++TP
Sbjct: 126 AIWPGSAHDSNIFRNSRLYARLESGE-FNNNA------------ILGDSGYALKPYMLTP 172
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
N + +N TR++ + F K + +LS M R + + +II+ +L
Sbjct: 173 I-LNPVGRIEMLYNESQIRTRNIIERCFGVWKRRFPVLSLGM-RLQLKTVQAIIVATAIL 230
Query: 345 HNIIID---------SGDQLH-----PDVALSDHHDSGYGEQCCKQ 376
HNI D S D LH D+ +DH GE +
Sbjct: 231 HNICRDMNEDLPEDSSDDVLHQLNEAEDMTETDHRHDDNGEDSITR 276
>gi|222630438|gb|EEE62570.1| hypothetical protein OsJ_17369 [Oryza sativa Japonica Group]
Length = 448
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 135/302 (44%), Gaps = 59/302 (19%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS---VEKQVAIALRRLAS 116
E F+ FR+ + FD +C +DL + + ++ L S V ++VA+ L RLA+
Sbjct: 141 EADFRRAFRMPRAVFDKLC----DDLAA----AVAKVDTTLRSAIPVPQRVAVCLWRLAT 192
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
G+ V FG+G ST + +++ H + I + E
Sbjct: 193 GDPLREVSRRFGLGISTCHNI---IVQSSAPCTPEHFR---------IVAPREH------ 234
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENN---YSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
A + D NN YS+ +Q +VD + F D+ G PG ++
Sbjct: 235 ---------------AGKYYDHRLTGRNNKATYSVAMQAVVDADGAFTDVCIGHPGSLSD 279
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+ +L S + CEAG L G+ ++ ++ ++VGG YPL SW++ PY L+ +
Sbjct: 280 AAILAKSALYARCEAGLLL-GHDKLGWQQPL---WLVGGASYPLTSWMLVPYTQPNLTWA 335
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP---SIILVCCLLHNIIID 350
N+ R+ AV AF +L+ W+ L + + KLP +++ CC+LHN+
Sbjct: 336 QDRLNARVADARAAAVGAFRRLRARWQCLRRA-----EVKLPELANMLGACCVLHNLCER 390
Query: 351 SG 352
SG
Sbjct: 391 SG 392
>gi|328705831|ref|XP_003242919.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 326
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 133/299 (44%), Gaps = 34/299 (11%)
Query: 72 KTF----DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
KTF DY L + V R IN ++ Q+ I LR A+G ++VG
Sbjct: 7 KTFNPRIDYFNLLNENEFVRRFRLKKINCTNNSIAPMTQLLITLRFYATGNFLITVGDFG 66
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHI 186
G+ ++ ++ R AL + +++ P++ E+K +F GAID THI
Sbjct: 67 GISVASAGKIVKRVSYALAFLSSRYIRLPETPEEKMELKVQFYGLARFPKVIGAIDCTHI 126
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P+ + + + +++ +S+ VQ +V+ + F+D+V WPG + S +
Sbjct: 127 KLQCPSKEYGELYRNRKGYFSLNVQALVNANLEFMDVVVRWPGSAHDSNIF--------- 177
Query: 247 EAGQRLNGNVRISSEEVELREY----IVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
A RL +EL E+ I+G GY L +L+TP +A +N
Sbjct: 178 -ANSRLKA-------RIELPEFSDCIILGDSGYALSHYLLTPLARTTTNAER-LYNESQI 228
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS---IILVCCLLHNIIIDSGDQLHPD 358
TR++ ++F G W+ V++ + K+ + +I C +LHN+ D PD
Sbjct: 229 RTRNVVERSF----GVWKRRFPVLFFGLRLKMETTMAVIQACAVLHNMARLQNDPQPPD 283
>gi|47229913|emb|CAG10327.1| unnamed protein product [Tetraodon nigroviridis]
Length = 585
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 134/317 (42%), Gaps = 44/317 (13%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGV 125
FR+S++TF Y+C +R L + + RL L VEK+VA+AL RLAS ++
Sbjct: 226 FRMSRETFFYLCDKLRPRLTRQ------DTTFRLALPVEKRVAVALWRLASNVEYRTISA 279
Query: 126 AFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
FGVG+STV + A+ + +L+ PD +++ S+G +C A+
Sbjct: 280 LFGVGKSTVCRCVRDMCHAIVALLSSVYLRAPDEQELQDSAQCCLSSWGFPHCVAAMTTL 339
Query: 185 HIIMTLPAVQTSD-----DWCDQENNYSMLVQGIVD-----------HEMRFIDIVTGWP 228
H + P+ SD W + + L Q R++ G
Sbjct: 340 HTAIITPSNNASDYANPAGWLSVVSQVAQLQQTCAQMLACRPSLTSRRGFRWLLTAVG-S 398
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY--- 285
GM V S + E G L+ + LR I+G YPL SWL+ Y
Sbjct: 399 SGMCVPVSQNSSLWATAAEGG--LSPSPPPVFMGRPLRYVILGEACYPLQSWLMKVYSEE 456
Query: 286 -----ETNGLSASMPTFNSLHEATRSLAV--KAFLQLKGGWRILSKVMWRPD--KRKLPS 336
++ FN H R+L + L+L+ W+ LSK R D +P+
Sbjct: 457 EGKRGRKTAMTEPQRLFN--HRLRRALRAPEETLLRLRARWQCLSK---RNDCGLDLVPT 511
Query: 337 IILVCCLLHNIIIDSGD 353
+IL CC+LHN+ GD
Sbjct: 512 MILACCILHNLCESHGD 528
>gi|289742715|gb|ADD20105.1| putative transposase [Glossina morsitans morsitans]
Length = 361
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 133/312 (42%), Gaps = 40/312 (12%)
Query: 46 KNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLV-REDLVSR---PPSGLINIEGRLL 101
+++S P T+P EE F FRVSK+ F+YI + D + R PP + + LL
Sbjct: 31 RDASNP-LTLP---EEVFLQKFRVSKEAFEYILGKIGANDGIRRTYTPPVIRLALTLELL 86
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
+A L S V Q+T S V I+ LE+ K +W +
Sbjct: 87 GSRSHGRLAGSDLCST-----------VAQATFSVVVSEMIQELED--KLCTEWIQLDIN 133
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
K F + FG+ G + THI + Q+ + + + S+ + DH+M+ I
Sbjct: 134 ANTKRWFYKEFGIPGVIGCVGGTHIYFRKTS-QSEETFINPSGKASVNGMVVCDHDMKII 192
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + S + S + N E +++GG GYPL WL
Sbjct: 193 GVNFGFPGSTHDSHVWNHSNERTYLKDHWNRN----------ESNGWLLGGSGYPLEPWL 242
Query: 282 ITPYETNGLSASMPT---FNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSI 337
+TPY+ SA PT +N +H RS+ + KG WRI + R + K+
Sbjct: 243 MTPYK----SAVDPTKRKYNDVHSQARSIVNRCINLYKGRWRIFMEDRKSRYNALKMGKF 298
Query: 338 ILVCCLLHNIII 349
VC LHNI I
Sbjct: 299 ATVCAALHNICI 310
>gi|328699930|ref|XP_003241093.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 371
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 110/238 (46%), Gaps = 25/238 (10%)
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V FG G +T ++ R + L + + + + + + + +F++ G+ + G ID
Sbjct: 111 VAERFGNGLTTQFRINDRVMNFLIDISPKVINFDEG--ITNLAREFKKVSGMPHVIGCID 168
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
T I + PA + + ++ + S+ +QG+ D++ +FID+ TG P ++ +R+ S
Sbjct: 169 GTSIPIRTPAHKIKSTYTNRHDMPSITLQGVCDYKKKFIDVFTGIPAKIHDARVFVLSDL 228
Query: 243 FK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTF 297
K +CE L +G YP+ WL+ PY+ G L+ S TF
Sbjct: 229 SKDLPSMCEKKYNL-----------------LGDGAYPIREWLLVPYKDYGRLTESQKTF 271
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
N +TR L F LK +R L ++ K+ I+ C+LHN+ ID D +
Sbjct: 272 NKTLSSTRVLIENTFGLLKSRFRQLLQLDIHS-VDKITKFIISSCVLHNLCIDMDDHI 328
>gi|328696952|ref|XP_003240188.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 371
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 110/238 (46%), Gaps = 25/238 (10%)
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V FG G +T ++ R + L + + + + + + + +F++ G+ + G ID
Sbjct: 111 VAERFGNGLTTQFRINDRVMNFLIDISPKVINFDEG--ITNLAREFKKVSGMPHVIGCID 168
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
T I + PA + + ++ + S+ +QG+ D++ +FID+ TG P ++ +R+ S
Sbjct: 169 GTSIPIRTPAHKIKSTYTNRHDMPSITLQGVCDYKKKFIDVFTGIPAKIHDARVFVLSDL 228
Query: 243 FK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTF 297
K +CE L +G YP+ WL+ PY+ G L+ S TF
Sbjct: 229 SKDLPSMCEKKYNL-----------------LGDGAYPIREWLLVPYKDYGRLTESQKTF 271
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
N +TR L F LK +R L ++ K+ I+ C+LHN+ ID D +
Sbjct: 272 NKALSSTRVLIENTFGLLKSRFRQLLQLDIHS-VDKITKFIISSCVLHNLCIDMDDHI 328
>gi|331221924|ref|XP_003323636.1| hypothetical protein PGTG_05538 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309302626|gb|EFP79217.1| hypothetical protein PGTG_05538 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 367
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 129/294 (43%), Gaps = 43/294 (14%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F +FR+ + F ++ + + L P +G LSVE QV + L RL + V
Sbjct: 59 FIEYFRMLRADFAWLSDELSKKLQQDPLR-----QGAPLSVEAQVVVGLYRLGHDATYVM 113
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD---SNRMEEIKSKFEESFGLLNCC 178
+ F +G+ T + T RF++A+ + + + +P ++ ++I FE G+ +
Sbjct: 114 ISHVFNIGKETADKATGRFVQAVLKVLRLRTISFPGLDAHDKWDKIIKSFEWRHGIPDSV 173
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAI+ THI+ +P + + +++N S++ Q +VD + F D+ G G ++ +R+ +
Sbjct: 174 GAINGTHILFAIPPLDEWKGYINRKNWVSLVFQCVVDGDGNFRDVFGGGAGSIHDTRVFR 233
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S + L +RI +++G GYPL ++ PY
Sbjct: 234 RSNLW--ISLNNALGRQMRIPPG-----THLIGDAGYPLDVNVLIPYP------------ 274
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
L LK +RIL + + + + VC +LHN+I G
Sbjct: 275 --------------LVLKNRFRILLTTQ-KANPVRARRTVFVCMILHNLIYKRG 313
>gi|328700758|ref|XP_003241371.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 394
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 132/301 (43%), Gaps = 25/301 (8%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL-INIEGR-LLSVEKQVAIALRRLA 115
D E F +FR+S K+FD +L+S+ SG+ ++ GR +S +++ + LR LA
Sbjct: 52 DNENKFFNYFRMSIKSFD--------ELLSKLESGIKVSNSGRPSISPTERLCVTLRYLA 103
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
SG + + ++ +G ST+S + E + + + P + EI FE++
Sbjct: 104 SGNTFTDLQYSYRMGISTISGIVEDVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFP 163
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D HI + P +++ + + ++ YS+++ I D F + G G + S
Sbjct: 164 NCIGALDGKHIRVIKP-IKSGSLFYNYKHYYSIVLMAICDANYCFTFVDVGAYGKFSDSS 222
Query: 236 LLKFSGFFKLCEAGQRLNGNVRI--------SSEEVELREYIVGGVGYPLLSWLITPYET 287
+ K FF+ E N + I +E L I+ + + ++ PY
Sbjct: 223 VFKNGKFFEKLE-----NETLSIPQPKPLPGDNENGPLPYVILADEAFGVSKTILRPYAR 277
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+ L+ FN R +F L WRI + M I+ CC+LHN
Sbjct: 278 SNLNYKKKIFNYRLSRARRYIESSFGILSNKWRIFHRPM-NTSVELSTKIVKACCVLHNF 336
Query: 348 I 348
+
Sbjct: 337 V 337
>gi|328724827|ref|XP_003248262.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 394
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 132/301 (43%), Gaps = 25/301 (8%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL-INIEGR-LLSVEKQVAIALRRLA 115
D E F +FR+S K+FD +L+S+ SG+ ++ GR +S +++ + LR LA
Sbjct: 52 DNENKFFNYFRMSIKSFD--------ELLSKLESGIKVSNSGRPSISPTERLCVTLRYLA 103
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
SG + + ++ +G ST+S + E + + + P + EI FE++
Sbjct: 104 SGNTFTDLQYSYRMGISTISGIVEYVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFP 163
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D HI + P +++ + + ++ YS+++ I D F + G G + S
Sbjct: 164 NCIGALDGKHIRVIKP-IKSGSLFYNYKHYYSIVLMAISDANYCFTFVDVGAYGKFSDSS 222
Query: 236 LLKFSGFFKLCEAGQRLNGNVRI--------SSEEVELREYIVGGVGYPLLSWLITPYET 287
+ K FF+ E N + I +E L I+ + + ++ PY
Sbjct: 223 VFKNGKFFEKLE-----NETLSIPQPKPLPGDNENGPLPYVILADEAFGVSKTILRPYAR 277
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+ L+ FN R +F L WRI + M I+ CC+LHN
Sbjct: 278 SNLNYKKKNFNYRLSRARRYIESSFGILSNKWRIFHRPM-NTSVELSTKIVKACCVLHNF 336
Query: 348 I 348
+
Sbjct: 337 V 337
>gi|241669308|ref|XP_002411397.1| transposase, putative [Ixodes scapularis]
gi|215504029|gb|EEC13523.1| transposase, putative [Ixodes scapularis]
Length = 411
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/303 (22%), Positives = 125/303 (41%), Gaps = 30/303 (9%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
++E F+ FR+++ +F + + +D + + S E ++ I L L
Sbjct: 68 NNETLDFREHFRITRSSFRNLLKTLWDD---------TDTKSHGWSHETELLIFLFWLGC 118
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE---ESFG 173
G V F ++T V R +E + + + +P S + EIK+ F
Sbjct: 119 GAVFKVVSACFNTPRTTTFSVVNRVLERIISKLDRMVYFPTSEDLREIKASFASLARDNK 178
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
+C G + + HI + P D +C + +YS+ +Q + DH F+D+ G+PG +
Sbjct: 179 FRSCAGVVGSCHIQVQAPESMHMDYYCRKRASYSIQMQAVCDHRGVFLDVFAGYPGSVPC 238
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS---WLITPYETNGL 290
S +L+ S F + + ++G YP + + TPY
Sbjct: 239 SCVLENSPLF--------------VGALYPPQGSTLLGDSSYPCIDAPVAIATPYGAPR- 283
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
A FN++H + +AF +KG W+ + + +K P ++ C +HN+
Sbjct: 284 DAVQRHFNAVHARACCVVEQAFALMKGRWKSVFTKPLQVSIQKAPQVVAACAAMHNVCTC 343
Query: 351 SGD 353
GD
Sbjct: 344 LGD 346
>gi|449676887|ref|XP_004208727.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 267
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 102/207 (49%), Gaps = 6/207 (2%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWP- 156
R ++ +K+VAI L L + FG+ T S V + A+ + + P
Sbjct: 51 RAITPDKKVAITLYYLKDSGTLNMTANTFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPK 110
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ +M E S+FE FG++ G +D THI + P + D +C ++ +YS+ VQ + D+
Sbjct: 111 NKQQMREKISEFESKFGMIQAFGCVDGTHIPIVCPTNHSQDYFCYKQ-HYSLQVQAVCDY 169
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-ISSEEVELREYIVGGVGY 275
+ ++D+ WPG ++ +++ S + RL G + I+ ++++ Y++G Y
Sbjct: 170 KGSYLDVECMWPGSVHDAKVFSNSSINTNLRSS-RLPGTFQTITKNKIKVPCYLIGDPAY 228
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHE 302
PLL I Y T + + FNS+ E
Sbjct: 229 PLLPHCIKKYSTCKKNDEV-IFNSMFE 254
>gi|301624428|ref|XP_002941507.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 350
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 129/298 (43%), Gaps = 26/298 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
EE +R+ +C L+ DL +P +G L V +V AL SG
Sbjct: 35 EEVIIQRYRLHPALIRDLCHLLEHDL--QPSTG----RSHALPVYVKVTAALDFYTSGTF 88
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSK-FEESFGLLNCC 178
Q G A G+ Q+++S+ + EA+ RA +++P ++ F G
Sbjct: 89 QTPAGDAAGISQASMSRCVTQVTEAIVRRAHRFIRFPRGCIQRRAAARDFHRLHGFPGAV 148
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GA+ TH+ + P+ S + ++ +SM +Q + D IV +PG ++ +L
Sbjct: 149 GAVGCTHVALKPPSDHESR-YRNRWRYHSMHMQAVCDARGALTHIVAEYPGSVSEEDILV 207
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S G++ + + + +++GG Y WL+TP E A + +N
Sbjct: 208 QSSL-----------GHMFQNRGQDDDDAWLIGGRCYTQKPWLMTPIEDPQSPAEL-QYN 255
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLLHNIIIDSG 352
H A S+ AF LK +R LS+ + + P K+ I + CC+LHN+ + G
Sbjct: 256 DSHTAALSVLSAAFGSLKSRFRCLSRRGGSLQYSP--LKVSQIFVACCVLHNMALSHG 311
>gi|299749160|ref|XP_001838556.2| hypothetical protein CC1G_11885 [Coprinopsis cinerea okayama7#130]
gi|298408310|gb|EAU83249.2| hypothetical protein CC1G_11885 [Coprinopsis cinerea okayama7#130]
Length = 453
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 131/315 (41%), Gaps = 36/315 (11%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGR--LLSVEKQVAIALRRLAS-GESQVSVGV 125
+S+ TFD + +++ R P + GR +V Q+A L R AS G +SV
Sbjct: 111 ISRTTFDKLV-----EVLERNP--IFQSRGRKPQRAVRYQLATFLLRYASRGSDTLSVAK 163
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
G+G TV R AL E + W D +R E E G +C G ID T
Sbjct: 164 RMGIGVGTVWLYCRRVTRALRELGLEVITWGDEDRHRETADHVCERTGFPDCIGMIDCTL 223
Query: 186 IIMT-LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
I +T +P++ +C ++ ++ VQG+ DH++RFI GWPG +LK S ++
Sbjct: 224 IRLTDVPSMWGEVYYC-RKKYPAVNVQGVCDHKLRFISFEMGWPGSTPDVTVLKNSDLWR 282
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA-------SMPTF 297
+ G+ +Y+ GY +L+ P+ + A F
Sbjct: 283 --NRTKYFTGD-----------QYLFADRGYQSSPYLLRPFTEPEVDAFAGPERRRRLDF 329
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH- 356
N TR AF LKG + L + + ++ +I +LHN+ ID D H
Sbjct: 330 NRTLSGTRIYIEHAFGLLKGRFHSLKDLGRHRNVNEIYQVIHALMVLHNLCIDWEDSPHD 389
Query: 357 ---PDVALSDHHDSG 368
PDV D G
Sbjct: 390 FIDPDVLEEMTEDEG 404
>gi|389609307|dbj|BAM18265.1| unknown unsecreted protein [Papilio xuthus]
Length = 384
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 141/322 (43%), Gaps = 28/322 (8%)
Query: 43 FWAKNS-------STPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
FW +N S + +EE F+ +RVS+ TF + SL+ E + + + N
Sbjct: 47 FWLRNHIKHRSEHSEYFTFFQTADEETFENSYRVSRCTFYELHSLI-EPYIRKQDTNYRN 105
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEER 148
+S +++A+ L+ LA+G+S ++G F +G +VS++ W ++ L
Sbjct: 106 S----ISSRERLAVCLKYLATGQSFTTMGENFRIGLKSVSRIVEEVCDALWNILQPLV-- 159
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
+ P N +EI F+E + NC GA+D H+ + P+ +T + + + +S+
Sbjct: 160 ----MSQPTENDWKEIAKDFDELWQFKNCIGALDGKHVYIKAPS-KTGSSFFNYKKRFSV 214
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK-LCEAGQRLNGNVRISSEEVELRE 267
++ + D + + I G G + + + S F K L E L V ++
Sbjct: 215 VLMCLADAKRKIIMADVGSMGRFSDAGIFDNSIFGKSLKEKRLNLPQPVPFYQGGAKMPF 274
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+G +PL+ + PY +GL+A FN R + F L W + K +
Sbjct: 275 VFIGDEAFPLMENFMRPYPRDGLNAEKKIFNYRLSRARRIVEATFGVLTRKWYVYHKD-F 333
Query: 328 RPDKRKLPSIILVCCLLHNIII 349
+ +I C+LHN +I
Sbjct: 334 ECKIETVDKVIKATCVLHNYLI 355
>gi|348545800|ref|XP_003460367.1| PREDICTED: putative nuclease HARBI1-like [Oreochromis niloticus]
Length = 342
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 122/298 (40%), Gaps = 36/298 (12%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS 158
R L+ + V IALR ASG +VG A + +S V + + AL+ + +P
Sbjct: 72 RALTTAQTVCIALRFFASGTFLYAVGDAENIDKSAVCRSIRKVYLALKHFLGVFVVFPSH 131
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
R + +K F G N G ID THI + P D+ +++ +S+ VQ + D
Sbjct: 132 LRPQVVKQGFFSIAGFPNVIGTIDCTHIAIKAPPGPNEGDFVNRKGVHSINVQMVCDSMC 191
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
++ WPG ++ SR+ + SG L E G +G ++G GY
Sbjct: 192 HITNVEAKWPGSVHDSRIFRESGLCTLFERGA-YDG-------------ILLGDRGYACR 237
Query: 279 SWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
P +N+ TR+ F Q+KG ++ L + PD + II
Sbjct: 238 QNFPDPN-----PGPRTRYNAALARTRARIEMTFGQIKGRFQCLKSLRVAPD--RACDII 290
Query: 339 LVCCLLHNIIIDSGD-----QLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKH 391
+VC +LHNI + ++ PD L H P GR R+ + +H
Sbjct: 291 VVCAVLHNIATIRKERTPVVEVQPDDDLQPVHLD----------QPSGRAARDRIVQH 338
>gi|328706661|ref|XP_003243166.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 428
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 135/300 (45%), Gaps = 27/300 (9%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E ++ F R+S TF+ + LV ++ P +LSV + + LR LASGES
Sbjct: 69 ELYQNFLRMSASTFEELVCLVGPKIMRFPS------RPDILSVGEVLTATLRYLASGESM 122
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLLNCCG 179
+S+ +F +G++TVS++ ++ E L + L PD+ + ++ +FE + + NC G
Sbjct: 123 MSIMYSFRIGKATVSKLIFQCCEVLWDTLNTKVLIVPDTKKWAQLGVEFENKWQVPNCIG 182
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
+ID HI+ A S+++ + + ++S+++ + D F + G G +
Sbjct: 183 SIDGKHIVHQAFANSGSENY-NYKGSHSIILLAMCDASYNFTIVDIGADGRCSD------ 235
Query: 240 SGFFKLCEAGQRLNGN-------VRISSEEVELREYIVGGVGYPLLSWLITPYETNG--- 289
G F E G+ N I S + Y +G +PLL+ L+ PY G
Sbjct: 236 GGVFSNSEMGKGFMANNLNFPTAKDIDSNSGPIPYYALGDEAFPLLTNLMRPYPGRGKRK 295
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLPSIILVCCLLHNII 348
L + FN R F + WR+ K ++ R + + +I +LHN I
Sbjct: 296 LPLNESIFNYRLSRGRRTIENTFGIMASKWRVFRKPIIAR--RNTVEAITKAAVVLHNFI 353
>gi|340382769|ref|XP_003389890.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 421
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 143/297 (48%), Gaps = 27/297 (9%)
Query: 63 FKYFFRVSKKTFDYICSLV-----REDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
+K F R+ +F+ + +LV ++D V R + +S +++A+ LR LA+G
Sbjct: 62 YKNFLRMDVSSFESLLNLVSPIIRKQDTVMR----------QSISPAERLAVTLRYLATG 111
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLN 176
E+ S+ + + T+ ++ A+ + ++ P + ++ +EI + FE + +
Sbjct: 112 ETFQSLQYLYRIPAQTIGRIVPETCGAIVKCLHSFIEVPTTTSKWKEIATGFETYWQFPH 171
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GA+D HI++ P + + + ++ +S+++ +VD + +F + G G ++ +
Sbjct: 172 CIGALDGKHIVIR-PPPNSGSYYFNYKHTFSIVLLALVDADYKFTYVNIGCNGRISDGGV 230
Query: 237 LKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
+G LC A + + NV I + + L IV +PL +++ PY GL+
Sbjct: 231 ---NGNSSLCAALETNSLNVPLPFPICEDGIPLPYMIVADEAFPLKTYIQKPYAQIGLTK 287
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNII 348
FN R + AF L +++ ++ + PDK + +I+L CC LHN +
Sbjct: 288 EKRIFNYRLSRARRIVENAFGILANRFQVFMTPIRLSPDKAE--TIVLACCSLHNFL 342
>gi|328715942|ref|XP_003245784.1| PREDICTED: hypothetical protein LOC100572478 [Acyrthosiphon pisum]
Length = 732
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 128/281 (45%), Gaps = 25/281 (8%)
Query: 93 LINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF-GVGQSTVSQVTWRFIEALEERAKH 151
L+ E +S + Q+ IALR A+G S + V F GV +ST ++ R +AL H
Sbjct: 440 LLRPENHAVSAQNQLLIALRYYATG-SFLRVSADFTGVERSTCGRIVRRVSKALATLHPH 498
Query: 152 HLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV 210
+K+P + + +E +K F C GAID THI + P ++ +++ +S+
Sbjct: 499 FIKFPTTADEVETVKQGFYRIAKFPRCIGAIDCTHIKICSPGGDNAELNRNRKQFFSINF 558
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
Q I D ++ +IV WPG + + + K S E G L GN +V
Sbjct: 559 QTICDSNLQIQNIVCRWPGSAHDANIFKNSIIRSKFEHG--LMGN-----------NLLV 605
Query: 271 GGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
G GYPL +L+TP +N + + +N TR+ ++ K + IL+ + D
Sbjct: 606 GDSGYPLKKYLMTPL-SNPQTIAEERYNESQIRTRNPIEHSYSVWKRRFPILA-IGINVD 663
Query: 331 KRKLPSIILVCCLLHNI-------IIDSGDQLHPDVALSDH 364
++I+ +LHNI I QL DV L +H
Sbjct: 664 LETAKTVIVATAVLHNIANTLRDKTIIVNPQLEEDVELINH 704
>gi|332374146|gb|AEE62214.1| unknown [Dendroctonus ponderosae]
Length = 353
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 142/304 (46%), Gaps = 33/304 (10%)
Query: 60 EEGFKYFFRVSKKTF-DYICSLV---REDLVSRP-PSGLINIEGRLLSVEKQVAIALRRL 114
E F+ FR SK+ + + LV RE + +R P LI G LL+ L+
Sbjct: 41 EADFRKRFRFSKEIVQNVLLPLVGHQREQITNRGLPVPLI--IGLLLT--------LQFY 90
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
SG ++ G G QSTVS+V + ++ E++++++K+P++ + ++++F+
Sbjct: 91 GSGSLRIICGELKGFHQSTVSRVITKVTRSIYEQSRNYIKFPEN--LNTVQTQFQTIRNF 148
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
N G I I + P ++ + + + ++S+ VQ + ++ DIV PG
Sbjct: 149 PNVIGCIGCAQIPIMSPGGPNAELFRNGKRSFSLNVQIVAGPDLEIYDIVATNPGSYRND 208
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ S V+ E+ +L Y++G GYP L++L+TP+ + +
Sbjct: 209 HVFNKSA--------------VKTRFEKKQLPGYLLGDSGYPSLTYLLTPFR-DPCTNEE 253
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
+N H R++ + F L+ + L +++ R + II+ C +LHNI I +
Sbjct: 254 KRYNEAHAQIRNIVDRTFGVLRHRFSCLRRLL-RHKHETICCIIVACAVLHNISIAHREV 312
Query: 355 LHPD 358
+ PD
Sbjct: 313 VEPD 316
>gi|312377951|gb|EFR24655.1| hypothetical protein AND_10612 [Anopheles darlingi]
Length = 472
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 142/345 (41%), Gaps = 55/345 (15%)
Query: 34 SSDSDWWPS-FWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSG 92
+ D W+ S F A + +T ATI R+ ++T+ Y+ ++R DL P S
Sbjct: 80 ARDDTWYRSQFVAGDDATLLATI------------RMDRRTYLYLVDVLRGDLT--PQSQ 125
Query: 93 LINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH 152
I +SVEK+ AI L +LASGE+ +G FGV + + +RF + R + +
Sbjct: 126 QILAPRETVSVEKRCAIGLYKLASGENFGQLGKRFGVHKLAAADSFYRFCSTIVTRLRRN 185
Query: 153 -LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQ 211
++ P ++ +FE++ L I H +T ++ + + + SML+Q
Sbjct: 186 VIRLPSEQEALKLAVRFEKNTCLPRVMALIGMMHFPIT---NAPTNSYINNDGWASMLLQ 242
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISS----------- 260
+VDH + G + +L S E N + +S+
Sbjct: 243 AVVDHSGLIRFATSNHAGSTDAYTVLDES------EVNNHFNHSELVSTGRPTRPSHEHQ 296
Query: 261 -----------------EEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEA 303
++ ++ +++GG YPLL LIT Y + FN +A
Sbjct: 297 MIMLYSLQPTEFYIGTDDQAAVKPFLIGGSEYPLLPHLITRYSAPESREEL-LFNKHLDA 355
Query: 304 TRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
+L W IL + M D +P I+ CC+LHNI+
Sbjct: 356 ALGAFDVTMQRLMARWNILHRGM-DFDPVTVPRTIVCCCILHNIL 399
>gi|449684266|ref|XP_004210586.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 247
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 102/209 (48%), Gaps = 18/209 (8%)
Query: 106 QVAIALRRLASGESQVSVG-VAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEE 163
Q+ ALR A+G Q+ VG + Q T+S++ R +L +R ++K+P + + + +
Sbjct: 53 QLLCALRFYATGSFQIVVGNSTAALSQPTISRIIRRVSLSLAKRINEYIKYPTNLHVLNK 112
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ KF E G ID THI + P + + D+ +N+S+ V+ + D++ +FID+
Sbjct: 113 SRVKFYEIAEFPKVTGVIDCTHICILKPH-EHEYAYVDRSSNHSINVEAVCDNKGKFIDV 171
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
V W G + +R L+ S K +I E + ++G GYP WL+T
Sbjct: 172 VAKWSGSTHDARTLRESKLGK------------KIMDE--TFKGLLIGDYGYPCFCWLLT 217
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAF 312
PY N +AS +N TR + + F
Sbjct: 218 PY-LNPTTASQHRYNISLRKTRVIIEQVF 245
>gi|301623992|ref|XP_002941295.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 309
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 93/181 (51%), Gaps = 21/181 (11%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID TH+ +T P + + +++ ++S+ VQ + D +R + + +G+PG ++ +
Sbjct: 109 NCLGAIDCTHVPLTPPRAH-QERYLNRKRSHSINVQVVCDSHLRIMSVRSGFPGSVHDAH 167
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S ++ G+ G ++VG GY +L WL+TP A
Sbjct: 168 ILRQSALYERFTQGEMPRG-------------WLVGDAGYGVLPWLMTPVRFPRTPAQR- 213
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLLHNIIIDS 351
+N H TR++ + F LK +R LS +++ P K+ II+VC +LHN+ +
Sbjct: 214 RYNRAHRKTRNVIERLFGVLKSRFRCLSVTGGALLYSPI--KVSEIIVVCAMLHNVALPQ 271
Query: 352 G 352
G
Sbjct: 272 G 272
>gi|168029636|ref|XP_001767331.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681395|gb|EDQ67822.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 298
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 103/215 (47%), Gaps = 23/215 (10%)
Query: 42 SFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
+ W + + G P +E+ + R+S F Y+ +R ++ P + ++ +
Sbjct: 77 TLWRRLNKRVGVLNPMMQEKQWIKHCRMSYPAFVYLVEELRPYML--PNAQMVR---EPV 131
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTV---SQVTWRFIEALEERAKHHLKWPDS 158
+E VAI L RLASG S +G +GVG +TV +++ + E+ ++ P
Sbjct: 132 ELELAVAIVLNRLASGASPAVIGNLWGVGTATVVKYTKLVTSIFASCEKLYSKYIVAPTG 191
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIM-------TLPAVQTSDDWCDQENN---YSM 208
R+E I KF + G+ N GAID THI++ T PA D+C E YS+
Sbjct: 192 ERLERISKKFFQVTGIPNVAGAIDGTHIVLQKKPDRATCPA-----DFCSGEGKDAFYSV 246
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
L+QG+ D + F D+ PGG N S LK S +
Sbjct: 247 LLQGVCDADKVFWDVCCVAPGGSNDSEHLKSSSLW 281
>gi|331218562|ref|XP_003321958.1| hypothetical protein PGTG_03495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309300948|gb|EFP77539.1| hypothetical protein PGTG_03495 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 433
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 142/333 (42%), Gaps = 44/333 (13%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++ FK FFR+++ +F +C+ V ++ + S N R V +Q+ + L RL
Sbjct: 99 QDNRFKQFFRMTRASFLKLCAQVEDNPIFHNNS---NHPQR--PVIEQMMVTLNRLGCYG 153
Query: 119 SQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ V++G+ + +G TV T R I A+ L WP+ EEIKS+FEE G
Sbjct: 154 NGVAIGMLATCYRIGDGTVELYTNRCILAILSLRSQLLTWPEPAAREEIKSEFEE-VGFD 212
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G ID T +I++ + D+ +++ YS++ + +++ + TGWPG
Sbjct: 213 GCVGLIDGTLVILSTCPEKDGPDYYNRKGFYSVVTLLVCNNKKNITYLYTGWPG------ 266
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVEL---REYIVGGVGYPLLSWLITPYE---TNG 289
C RL N ++ + EY++ + + ++
Sbjct: 267 ----------CSHDMRLMSNCALTKHPGDFFSNGEYLLADSAFTPTRTTVPAFKRKKNQN 316
Query: 290 LSASMPTFNSLHEATR-------SLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
L+ FN R L F LKG + +V + D ++ + I+ C
Sbjct: 317 LTDEQHDFNRHLSGVRVGIENCIGLLKNRFKSLKG---LRLRVSSKEDMVRVTAWIMACA 373
Query: 343 LLHNIIIDSGDQLHPDVALSDHHDSGY--GEQC 373
+LHN + GD + D+ D+ + G GE C
Sbjct: 374 VLHNFLNQGGD-FNFDITEEDNDEDGAERGEAC 405
>gi|449672868|ref|XP_004207811.1| PREDICTED: uncharacterized protein LOC101239908 [Hydra
magnipapillata]
Length = 312
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 125/297 (42%), Gaps = 37/297 (12%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS- 158
+SV K+VA+AL LAS E V FG+G+ST + + FI A+ + ++K+P S
Sbjct: 5 ISVVKRVAVALHYLASCEEYRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSV 64
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ + FE G C GA+D HI ++ P Q + + + + YS+++ +VD
Sbjct: 65 ENLNKHSRDFEAILGFPQCVGAVDGCHIPISAPKDQAT-SYYNYKGWYSIVLFAVVDSRY 123
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RFI G P E G L + ++G +PL
Sbjct: 124 RFIYTSVGSP------------------ELGDSL------------VPLCLIGDSAFPLT 153
Query: 279 SWLITPYETN-GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
L+ PY N LS FN + R + F ++K +R++ K M D I
Sbjct: 154 RHLLKPYPENLELSEIQKNFNKILCGARRVVENVFGRVKARFRVICKRM-ECDINFATRI 212
Query: 338 ILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV--DPMGRTTRENLEKHL 392
+ C LHNI D + + L H DS + G+ R+++ K+L
Sbjct: 213 VNACVTLHNICEYYDDIIIIEWLLHHHDDSLAQPNTVSTTGNNGPGKNVRDSIAKYL 269
>gi|294939454|ref|XP_002782478.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239894084|gb|EER14273.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 319
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 115/251 (45%), Gaps = 19/251 (7%)
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G+ V K++ +ALR LA+G Q+ G A Q VS+ ++A+ E + +++P
Sbjct: 39 GKRHHVMKRLLVALRYLANGSFQLISGDALDCSQEYVSRSVTEVVDAIIENRERFIRFPS 98
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIM--TLPAVQTSDDWCDQENNYSMLVQGIVD 215
+ ++ + L C G ID THI++ +P Q +D + D + N S+ VQ ++
Sbjct: 99 DETI--VRGDADVDPRLPRCIGLIDGTHILLKDVVPIDQRAD-FVDSKKNISLNVQAVLG 155
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
++ + + PG ++ SR+ + S F +R+ S + ++ G Y
Sbjct: 156 ANNLWVSVNSSNPGSVHDSRVYQSSRLF------------LRMRS--IPAGYWLCGDSAY 201
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
L L+TPY + +FN H++ RS + F +LK W I + + K
Sbjct: 202 GLSPSLLTPYNDPIPGTAEASFNIAHKSVRSTVERGFRKLKRSWAICNGSLRLTPPGKAA 261
Query: 336 SIILVCCLLHN 346
+L C LHN
Sbjct: 262 KAVLACFCLHN 272
>gi|328699817|ref|XP_003241056.1| PREDICTED: hypothetical protein LOC100574409 [Acyrthosiphon pisum]
Length = 416
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 135/299 (45%), Gaps = 28/299 (9%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALRRLASGESQV 121
F F+R+SK+++ Y+ +L+ P N R ++ E+++ I LR L +G +
Sbjct: 67 FLCFYRMSKESYLYLVNLIS------PAINKQNTNMRECVNAEERILITLRYLGTGGTFA 120
Query: 122 SVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
S+ V FG G+STV + ++ + E K + P + + + I ++FE + L NC GA
Sbjct: 121 SIAVYFGRGESTVGGIVTETVKVIWEVLEKTFMPIPSTEQWKGIANRFETLWNLPNCIGA 180
Query: 181 IDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
I+ H+ I P +S+ + ++ +SM++ D + F I TG+PG + +
Sbjct: 181 INGKHVRIEKFPNSGSSN--FNYKSYHSMVLMACCDADGLFTIIETGFPGRNSDGGIFSA 238
Query: 240 SGF-FKLCEAGQRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
S + + AG + ++ +E + Y V +PLL +++ PY L
Sbjct: 239 SAMKYWIQNAGLNIPPPTPLTYDENDSPFPYYFVADEAFPLLKYVMRPYPKRILDNVKRI 298
Query: 297 FNSLH-------EATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
N E T +A + F L G RI D + +I C+LHN +
Sbjct: 299 CNYRFSRGRKTIECTFGMACEKFAVLNGPIRIR-------DPENVNLVIKAACILHNYV 350
>gi|449692140|ref|XP_004212915.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 319
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 119/253 (47%), Gaps = 9/253 (3%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P E +K FR++K F + +++ D V P S N R L+ K++AI + L
Sbjct: 71 PDVPETCWKRNFRMTKGCFLELAAII--DTVVSPQSNCPNY--RFLTTHKKLAITIYYLK 126
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNR-MEEIKSKFEESFG 173
S FG+ Q TVS+ +A+ +L P + M ++ S+FE FG
Sbjct: 127 DTGSLWMTANVFGIHQCTVSKTVKVVCDAINNIVGPIYLHLPKNKEDMTKLASQFEVKFG 186
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
++ G ID+TH+ + P + D +C ++ +S+ VQ + D + FID+ WPG ++
Sbjct: 187 MVQAFGCIDSTHVQIKRPIKNSQDYFCYKQ-YFSLNVQAVCDCKGYFIDVECKWPGSVHD 245
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+++ S K G L + + Y++G YPL ++ I +++ +
Sbjct: 246 AKMFTNSTINKKLIKGT-LPQTLYSLPNYHSIPNYLIGDPAYPLTNFCIKEFQSCSNNEE 304
Query: 294 MPTFNSLHEATRS 306
+ FNS+ + R+
Sbjct: 305 V-IFNSMLRSARN 316
>gi|328710597|ref|XP_003244309.1| PREDICTED: hypothetical protein LOC100571044 [Acyrthosiphon pisum]
Length = 405
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 123/273 (45%), Gaps = 18/273 (6%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSN 159
L + ++ + LR LA+G+S ++ + VG+S++S+ A+ E K +++ P + +
Sbjct: 93 LPAKLKLEVTLRYLATGDSYKTLQYIYRVGKSSISEFVPEVFNAILEELKEYIEVPREKS 152
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ +I F + NC GAID HI+M PA + ++ + + +S+++ +VDH
Sbjct: 153 KWNKIMDGFNSLWNFPNCIGAIDGKHIVMECPA-NSGSNYFNYKGTFSIVLLALVDHNYN 211
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F I G G + + S K E + ++ V I+G +PLL
Sbjct: 212 FTCIDIGSYGSNSDGGIFAKSALKKAIEEHV-----LHTPTDSV-----ILGDDAFPLLP 261
Query: 280 WLITPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLPSI 337
+L+ PY N L+ +N H R + F L +R+ + + P+ +
Sbjct: 262 YLMKPYARRNHLTEREKVYNYRHCRARRIVENGFGILSSRFRVFRRPITLTPE--NTIQL 319
Query: 338 ILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG 370
+ C LHN I SG +P+ D D+ G
Sbjct: 320 VKAACALHNWIRKSGK--NPNSISVDIEDTETG 350
>gi|449690755|ref|XP_004212446.1| PREDICTED: uncharacterized protein LOC101235855 [Hydra
magnipapillata]
Length = 315
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 128/297 (43%), Gaps = 51/297 (17%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDL-----VSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK +FRV++ TF++ + + L R P +SV K+VA+AL
Sbjct: 10 EECEFKEYFRVNRNTFNFPVNELHPHLGKTTTTMREP----------ISVVKRVAVALHY 59
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKS-KFEES 171
LAS E +V FG+G+ST + + FI A+ + ++K+P S + S FE
Sbjct: 60 LASCEEYRAVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVEILNKHSIDFEAI 119
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI ++ P Q + + + + YS+++ +VD RFI G P
Sbjct: 120 LGFPQCVGAVDGCHIPISAPKDQ-AISYYNYKGWYSIVLFAVVDCRYRFIYTSVGLP--- 175
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
E G L + ++G +PL L+ PY N
Sbjct: 176 ---------------ELGDSL------------VPLCLIGDSAFPLTRHLLKPYPENLEP 208
Query: 292 ASM-PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+ + FN + R + AF +K +R++ K M D I+ C LHNI
Sbjct: 209 SEIQKNFNKILCGARRVE-NAFGCVKARFRVICKRM-ECDINFATKIVNACVTLHNI 263
>gi|449674857|ref|XP_004208274.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 262
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 91/191 (47%), Gaps = 3/191 (1%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPD 157
R LS EK++AI L L S + FG+ +TVS + + + + ++ P
Sbjct: 21 RALSTEKKLAITLYYLKDTGSLLLTANCFGIAVNTVSLIITQVCKNIVYHLGPIYISLPK 80
Query: 158 S-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ M + ++FE FG++ G ID TH+ + P + D +C ++ YSM VQ + D+
Sbjct: 81 TKTEMRQKVAEFESKFGMIQAFGCIDGTHVPIACPVENSQDYFCYKQY-YSMNVQAVCDY 139
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ F+D+ WPG ++ S++ S + G I ++ YI+G YP
Sbjct: 140 KGMFMDVECVWPGSIHNSKVFANSSINRKLRNGSLPGIFQSILHGYEKIGTYIIGDPAYP 199
Query: 277 LLSWLITPYET 287
L + I Y+T
Sbjct: 200 LTPFCIKEYQT 210
>gi|449691294|ref|XP_002166584.2| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 226
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 99/208 (47%), Gaps = 18/208 (8%)
Query: 147 ERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENN 205
+R ++K+P + + + E + +F ES G ID THI + P + + D +N
Sbjct: 1 KRVNEYIKYPTNQHVLNESRVEFYESAEFPQVTGVIDCTHICIQKPH-EHEYAYVDSSSN 59
Query: 206 YSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVEL 265
+S+ VQ + D++ +FID+V WPG + +R+L+ S K G
Sbjct: 60 HSINVQAVCDNKGKFIDVVAKWPGSTHDARILRESKLGKKFMDGT--------------F 105
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
+ ++G GYP WL+TPY N +AS ++ TR + + F + K + +L
Sbjct: 106 KGLLIGDSGYPCFRWLLTPY-LNPTTASQHRYSISLRKTRVIIEQVFGRWKRRFHLLHGE 164
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGD 353
+ R + +++ C +LHN+ I D
Sbjct: 165 I-RMTPERTCTLVAACAVLHNLAIQLND 191
>gi|72015146|ref|XP_783499.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 429
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/303 (22%), Positives = 136/303 (44%), Gaps = 22/303 (7%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
+ ++E+ F F RV+ + F + + E L + + LS ++AI LR L
Sbjct: 67 LEDEDEDAFTNFLRVTPQMFKELEQRLHERLEKQD-----TFFRKALSPALKLAITLRHL 121
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDS-NRMEEIKSKFEESF 172
A+G+S S+ +F V +T+S + A +EE +K P S + + FE+ +
Sbjct: 122 ATGDSYKSLMYSFRVAHNTISLIVREVCTAIIEEYGDELVKCPTSPQEWKAVAHGFEDRW 181
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ GA+D H+ + P + + + + YS+++ +VD +F+ + G+ G +
Sbjct: 182 NSPHTIGALDGKHVAIKCPK-DSGSIFYNYKGFYSIVLMALVDANYKFLWVDVGYEGSTS 240
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVR-------ISSEEVELREYIVGGVGYPLLSWLITPY 285
++L F CE QRL + +++++V+ + V + L +WL+ PY
Sbjct: 241 DAQL------FNSCELKQRLKSDTLGVPLPEPMTNDDVDTPYFFVADEAFALRTWLMKPY 294
Query: 286 ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLH 345
++ +N R + AF ++ L + R + + +++ C LH
Sbjct: 295 ARTAMARHERIYNYRLSRARRVVENAFGIHANRFQCLLTTL-RQNPSTVRTMVQACVCLH 353
Query: 346 NII 348
N++
Sbjct: 354 NLM 356
>gi|449684794|ref|XP_004210718.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 277
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 102/221 (46%), Gaps = 23/221 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG-V 125
FR + + + L++ DL + LI LL ALR A+G Q+ V
Sbjct: 71 FRFRRASIFNLTDLLQIDLQPKSNQTLIVAPILLL------LCALRYYATGSFQIVVSDS 124
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDAT 184
+ Q T+S++ R +L +R ++K+P + + + E K KF E G ID T
Sbjct: 125 TAALSQPTISRIIRRVSLSLAKRINKYIKYPTNQHVLNESKVKFYEIAKFPKVTGVIDCT 184
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
HI + P + + D+ +N+S+ VQ + D++ +FI +V WPG + +R+L+ S K
Sbjct: 185 HICIQKPH-EHEYAYVDRSSNHSINVQAVCDNKGKFIGVVAKWPGSTHDARILRESKLGK 243
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
G + ++G GYP WL+TPY
Sbjct: 244 KFMDG--------------TFKGLLIGDSGYPCFRWLLTPY 270
>gi|331231555|ref|XP_003328441.1| hypothetical protein PGTG_09735 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 435
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 118/270 (43%), Gaps = 26/270 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV + T R IEAL + ++ WP+
Sbjct: 139 LPIAHQLALTLERLGSNGNAASVGRFSRNLNVGRGTVIKATRRVIEALVSLGRQYVMWPN 198
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R EI S+ ++ G + C G +D T I + + + D++ YS+ Q I D +
Sbjct: 199 STRRAEI-SEVMKNEGFVGCVGFVDGTTIPLFQRPGYDGEVFYDRKRRYSINAQIICDCD 257
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG S + K +L+ N S+ + +Y++ Y +
Sbjct: 258 KYITSFITGWPGTCGDSLVYK----------RMQLHLN---PSQYFDDGQYLLADSAYEV 304
Query: 278 LSWLITPYETNGLSASMPT---FNSLHEATRSLAVKAFLQLKGGWRILSKVMW----RPD 330
++ Y+ SA +P FN R LK W L ++ R D
Sbjct: 305 SKTVVPAYKNP--SAKIPINADFNYCLAKARVRNEHTIGVLKSRWSSLREMRLHLYRRSD 362
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVA 360
R+ + + C +LHNI+ GDQ VA
Sbjct: 363 MREHVAWLYSCIILHNILAKLGDQWSEIVA 392
>gi|432850588|ref|XP_004066823.1| PREDICTED: putative nuclease HARBI1-like, partial [Oryzias latipes]
Length = 326
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 135/299 (45%), Gaps = 27/299 (9%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S ++ +I +L++ + + I+ GR L+ E+ + ALR A+G ++G A
Sbjct: 4 YRFSLQSITFIHNLIQPYITN------ISHPGRALTSEQILCAALRFFANGSFLYNIGDA 57
Query: 127 FGV-GQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
+ ++TV + + AL+ + + + + IK +F N G ID TH
Sbjct: 58 EHIRKKATVCRAVRKVCLALKRFLRIFIVFLGHKPLRAIKEEFHRIVCFPNVVGCIDGTH 117
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
I++ P + ++ ++++ +S+ VQ D ++ WPG ++ S++ + S
Sbjct: 118 ILIIAPT-ENEAEYVNRKSIHSINVQIKCDAAHIITNVEAKWPGSVHDSQIFRESTLSNR 176
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATR 305
E G E+ +++G GYP L+TP+ FN H TR
Sbjct: 177 LECG--------------EIDGFLLGDRGYPYHPKLLTPHPEPE-QGPQQRFNLAHSRTR 221
Query: 306 SLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV-ALSD 363
+ LK ++ L + P++ II+ C +LHNI I G+Q HP+ A SD
Sbjct: 222 ARVEMTIGLLKARFQCLRHLRVTPER--ACDIIVACVVLHNIAIFRGEQ-HPETFATSD 277
>gi|328704555|ref|XP_001947392.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 370
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/261 (25%), Positives = 120/261 (45%), Gaps = 30/261 (11%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEI 164
Q+ I LR A+G ++VG G+ ++ ++ R AL + +++ P++ E+
Sbjct: 89 QLLITLRFYATGNFLITVGDFGGISVASAGKIVKRVSYALAFLSSRYIRLPETPEEKMEL 148
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
K +F GAID THI + P+ + + + +++ +S+ VQ +V+ + F+D+V
Sbjct: 149 KVQFYGLARFPKVIGAIDCTHIKLQCPSKEYGELYRNRKGYFSLNVQALVNANLEFMDVV 208
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY----IVGGVGYPLLSW 280
WPG + S + A RL +EL E+ I+G GY L +
Sbjct: 209 VRWPGSAHDSNIF----------ANSRLKA-------RIELPEFSDCIILGDSGYALSHY 251
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS---I 337
L+TP +A +N TR++ ++F G W+ V++ + K+ + +
Sbjct: 252 LLTPLAHTTTNAER-LYNESQIRTRNVVERSF----GVWKRRFPVLFFGLRLKMETTMAV 306
Query: 338 ILVCCLLHNIIIDSGDQLHPD 358
I C +LHN+ D PD
Sbjct: 307 IQACAVLHNMARLQNDPQPPD 327
>gi|403168385|ref|XP_003328028.2| hypothetical protein PGTG_09322 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167479|gb|EFP83609.2| hypothetical protein PGTG_09322 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 111/262 (42%), Gaps = 22/262 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV +V+ R IEA + ++ WPD
Sbjct: 138 LPIAHQLALTLERLGSNGNGASVGRFSRNLSVGRGTVVKVSRRVIEAFISLGRRYVVWPD 197
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R EI S+ G C G +D T I M + + D++ YS+ Q I D +
Sbjct: 198 SARRAEI-SEVMSREGFRGCVGFVDGTTIPMFQRPGYDGEVFFDRKRCYSINAQIICDCD 256
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG SR+ K +L+ N S E +Y++ Y L
Sbjct: 257 KYITSFITGWPGSCGDSRVYK----------RMQLHLN---PSNYFEEGQYLLADSAYEL 303
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKR 332
+I Y+ + ++ + FN R LK W L + + R R
Sbjct: 304 SHTVIPAYKVPAANITINSQFNYCLAKARVRNEHTIGVLKSRWSSLREMRLHLYCRQHMR 363
Query: 333 KLPSIILVCCLLHNIIIDSGDQ 354
+ + C +LHNI+ GDQ
Sbjct: 364 AYVAWLYSCIILHNILAGLGDQ 385
>gi|391335787|ref|XP_003742270.1| PREDICTED: putative nuclease HARBI1-like [Metaseiulus occidentalis]
Length = 447
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 133/293 (45%), Gaps = 31/293 (10%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FR++++TF Y+ D +S P + + E VE +VA+AL L E+
Sbjct: 120 FFDLFRMTRETFSYVL-----DEIS--PFMMADDE----EVEVKVAVALWILGGSEA--- 165
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS--NRMEEIKSKFEESFGLLNCCGA 180
V + + Q +E + K + S +R++ F+ FGL NC GA
Sbjct: 166 --VYPPLLKVPEEQRIQYLLEFCRQVTKSLMPKEISLHDRLDSSMWNFQCKFGLRNCAGA 223
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNY---SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
++ THI + L ++ DD C+ NN S+++Q +V + +F D+ G+PG V +L
Sbjct: 224 LNFTHIPIYLGDMK--DDACEYLNNAGFRSIIMQAVVSSDYKFCDLSIGYPGRTPVDHVL 281
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
S ++ E+G + Y++ G YPL ++TP++ S F
Sbjct: 282 YNSAVWRKAESGVLFPAEGEPGPNQGP-HPYLIAGTCYPLSERIVTPFQAPKFSDMEKRF 340
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL---PSIILVCCLLHNI 347
N+ S+ +AF L+ + L DK + P+++ CC+LHNI
Sbjct: 341 NNSIYNALSVTQRAFKLLRSRFPFLLDF----DKVTVEDTPAVVAACCVLHNI 389
>gi|328712569|ref|XP_003244845.1| PREDICTED: hypothetical protein LOC100571710 [Acyrthosiphon pisum]
Length = 415
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 134/299 (44%), Gaps = 28/299 (9%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALRRLASGESQV 121
F F+R+SK+++ Y+ +L+ P N R ++ E+++ I LR L +G +
Sbjct: 67 FLCFYRMSKESYLYLVNLIS------PAINKQNTNMRECVNAEERILITLRYLGTGGTFA 120
Query: 122 SVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
S+ V FG G+STV + ++ + E K + P + + + I ++FE + L NC GA
Sbjct: 121 SIAVYFGRGESTVGGIVTETVKVIWEVLEKTFMPIPSTEQWKGIANRFETLWNLPNCIGA 180
Query: 181 IDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
ID H+ I P +S+ + ++ +SM++ D + F I TG+PG + +
Sbjct: 181 IDGKHVRIEKFPNSGSSN--FNYKSYHSMVLMACCDADGLFTIIETGFPGRNSDGGIFSA 238
Query: 240 SGF-FKLCEAGQRLNGNVRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
S + + AG + ++ +E + Y V +PLL +++ Y L
Sbjct: 239 SAMKYWIQNAGLNIPPPTPLTYDENDSPFPYYFVADEAFPLLKYVMRAYPKRILDNVKRI 298
Query: 297 FNSLH-------EATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
N E T +A + F L G RI D + +I C+LHN +
Sbjct: 299 CNYRFSRGRKTIECTFGMACEKFAVLNGPIRIR-------DPENVNLVIKAACILHNYV 350
>gi|294462959|gb|ADE77019.1| unknown [Picea sitchensis]
Length = 192
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 94/187 (50%), Gaps = 8/187 (4%)
Query: 206 YSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG-NVRISSEEVE 264
YS+ Q +VD +MR +++ TG+ GG + S +LK S + + L+G ++ + EV
Sbjct: 2 YSVAAQIVVDRDMRILNVSTGYSGGYSDSHVLKLSRMYTEVVSNGLLDGQSLLLHGTEVP 61
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
L Y++G +GYPL +WL+TPY SA FN H S+ LK W +L++
Sbjct: 62 L--YLIGDLGYPLKTWLLTPYYAQNRSAWEEAFNRKHGQVHSIVSMTSHVLK-RWGVLAQ 118
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIID--SGDQLHPDVALSDHHDSGYGEQCCKQVD-PMG 381
M R + + I CC+LHN++ D G +L L EQ ++D +G
Sbjct: 119 KM-RVSLKMATATIGACCVLHNMMFDWNEGLELEDYALLHSEQPLQEPEQGDVEIDYEIG 177
Query: 382 RTTRENL 388
R R+ L
Sbjct: 178 RRIRDAL 184
>gi|58264374|ref|XP_569343.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57225575|gb|AAW42036.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 429
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 125/309 (40%), Gaps = 33/309 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E F+ RV+ F + L+++ P S L+ E R ++ LR
Sbjct: 72 DEDFRRKLRVNHTEFRKLLCLIKD----HPVSSLMVQESRRTPF---YSLQLR------- 117
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN-CC 178
+ + + T + T R I+A+ ++++ WPD N + I FEE + + C
Sbjct: 118 ----SIDWDIADGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDIPDGCV 173
Query: 179 GAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G ID H+ PA + D + Y + GI DH R G+P + +R+
Sbjct: 174 GIIDGFHVPFAYKPARHDAVDSLSYKGRYGFNILGICDHLKRIRYFQYGYPASAHDARIF 233
Query: 238 KFSGFFKLCEAGQRLNGNVR-----ISSEEVELREYIVGGVGYPLLSWLITPYE----TN 288
K F+ A + N + SE + EY++ +P W + ++ N
Sbjct: 234 KNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKRRRGQN 293
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLL 344
L A FN + R A+ LK W+ L K+ D+ I C +L
Sbjct: 294 DLDAPEAKFNKKCSSARVKIEHAYGILKNRWQSLRNLRVKIRNVRDEGVATCWIRACVVL 353
Query: 345 HNIIIDSGD 353
HN++ID+GD
Sbjct: 354 HNLLIDTGD 362
>gi|134110141|ref|XP_776281.1| hypothetical protein CNBC6700 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258953|gb|EAL21634.1| hypothetical protein CNBC6700 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 429
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 125/309 (40%), Gaps = 33/309 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E F+ RV+ F + L+++ P S L+ E R ++ LR
Sbjct: 72 DEDFRRKLRVNHTEFRKLLCLIKD----HPVSSLMVQESRRTPF---YSLQLR------- 117
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN-CC 178
+ + + T + T R I+A+ ++++ WPD N + I FEE + + C
Sbjct: 118 ----SIDWDIADGTSAIWTTRVIKAILSLERNNVYWPDENERKAIDRHFEEEEDIPDGCV 173
Query: 179 GAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G ID H+ PA + D + Y + GI DH R G+P + +R+
Sbjct: 174 GIIDGFHVPFAYKPARHDAVDSLSYKGRYGFNILGICDHLKRIRYFQYGYPASAHDARIF 233
Query: 238 KFSGFFKLCEAGQRLNGNVR-----ISSEEVELREYIVGGVGYPLLSWLITPYE----TN 288
K F+ A + N + SE + EY++ +P W + ++ N
Sbjct: 234 KNCSLFEEANADAQSNREAMLQGRAVHSEMISQGEYLLADSAFPAGDWCVPLFKRRRGQN 293
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLL 344
L A FN + R A+ LK W+ L K+ D+ I C +L
Sbjct: 294 DLDAPEAKFNKKCSSARVKIEHAYGILKNRWQSLRNLRVKIRNVRDEGVATCWIRACVVL 353
Query: 345 HNIIIDSGD 353
HN++ID+GD
Sbjct: 354 HNLLIDTGD 362
>gi|449678529|ref|XP_002155205.2| PREDICTED: uncharacterized protein LOC100210225, partial [Hydra
magnipapillata]
Length = 338
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 109/237 (45%), Gaps = 20/237 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + L + R P +SV K+VA+AL
Sbjct: 98 EECEFKEHFRVNRNTFNFLVNELHPHLGKSTTTMREP----------ISVVKRVAVALHY 147
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEES 171
LAS E V FG+G+ST + + FI A+ + ++K P S + + FE
Sbjct: 148 LASCEEYRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKIPLSVENLNKHSRDFEAI 207
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI++ L + + + + YS+++ +VD RFI G PG
Sbjct: 208 LGFPQCVGAVDGCHILI-LALKDQAISYYNYKGWYSIVLFSVVDCRYRFIYTSVGSPGRN 266
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
N S +L+ S + E+ + V L ++G +PL L+ PY N
Sbjct: 267 NDSYILQNSSLKAILESSLFDKCCKELGDSLVPL--CLIGDSAFPLTRHLLKPYPEN 321
>gi|449688588|ref|XP_004211784.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 231
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 96/195 (49%), Gaps = 6/195 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDA 183
FG+ T S V + A+ + + P + +M E S+FE FG++ G +D
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG ++ +++ S
Sbjct: 65 THIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSIN 123
Query: 244 KLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
+ RL G + I+ ++++ Y++GG YPLL + Y T + + FNS+
Sbjct: 124 TNLRSS-RLPGTFQTITKNKIKVPCYLIGGPAYPLLPHCMKEYSTCKKNDEV-IFNSMLR 181
Query: 303 ATRSLAVKAFLQLKG 317
R+ AF +LK
Sbjct: 182 TARNPIECAFGRLKA 196
>gi|192455694|ref|NP_001122281.1| uncharacterized protein LOC560936 [Danio rerio]
Length = 351
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 139/338 (41%), Gaps = 36/338 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S + Y+C L+ E + P +V + + IALR SG VG A
Sbjct: 45 YRFSAEGMLYLCRLL-EPHIKNPTR-----RSHATTVPQMICIALRFFTSGTFLYEVGDA 98
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + R + AL+ + +P + IK F + GL GAID HI
Sbjct: 99 EKLSKNTVCRTIRRVVIALQRYINTFVAFPGHLPTQAIKEGFSQIAGLPGVIGAIDCIHI 158
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ P + + ++ +S+ VQ DH+ + WPG M +++ + S KLC
Sbjct: 159 PISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQIFEKS---KLC 215
Query: 247 EAGQR--LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
+ Q+ +G +VG Y S+L+TPY + FN T
Sbjct: 216 QRFQQGLFDG-------------VLVGDGTYACQSFLLTPYPEPK-TKPQHEFNIALSQT 261
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALS-D 363
R LK + L + P++ I+ C +LHNI +Q+ + S D
Sbjct: 262 RLKIDNTLAILKARFNCLRDLRVSPERAS--QIVGACAVLHNIASIRKEQMPSECQQSND 319
Query: 364 HHDSGYGEQCCKQVDPMGRTTRENL--EKHLQHNQEKD 399
DS + P GR R + E +QH+ +++
Sbjct: 320 DVDSVSRDH------PAGRAVRHAITEEFFMQHSIKQN 351
>gi|242077384|ref|XP_002448628.1| hypothetical protein SORBIDRAFT_06g030470 [Sorghum bicolor]
gi|241939811|gb|EES12956.1| hypothetical protein SORBIDRAFT_06g030470 [Sorghum bicolor]
Length = 441
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 124/302 (41%), Gaps = 44/302 (14%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVS-QVTWRFIEALEERAKHHLKWPDSNRME 162
+ ++ AL RLA +V FG+ V+ + + A+ ER L +R+
Sbjct: 164 DHKLGAALFRLAHAAPARAVARRFGLPSPAVAARAFYEVCRAIAERLALLLDLAAPDRIA 223
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
F + L NCCGA+ + + Q +VD E RF+D
Sbjct: 224 RAVPGFC-ALSLPNCCGALGYARF------------------GEAAVAQALVDAEGRFLD 264
Query: 223 IVTGWPGGMNVSRLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGGVGY 275
+ GW M + +L + + L A G+ + G+V Y +G
Sbjct: 265 VSVGWNPEMAPAEILPRTKLYTSQTMVLANAPQGELIGGSVP---------RYFLGPACC 315
Query: 276 PLLSWLITPYE-----TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD 330
PLL WL+TPY+ T+ LS FN +H + + AF ++ WR+L +
Sbjct: 316 PLLPWLVTPYKPVVDATDDLSKES-IFNHVHAHGQQVVKNAFGHVRARWRLLEECWKGEC 374
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEK 390
+ LP +++ CLLHN ++ G+ + +V SD D + + D G R+ L
Sbjct: 375 QEALPYVVVAGCLLHNFLLQCGEPMTAEVQRSDDADVFVDFE--GEKDKEGERIRDVLAA 432
Query: 391 HL 392
HL
Sbjct: 433 HL 434
>gi|221116902|ref|XP_002160102.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 351
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 119/264 (45%), Gaps = 26/264 (9%)
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERA 149
+G+++ +++ I +R LA+GE Q + F VG++TV + W+ ++ + RA
Sbjct: 14 DGKIIEGLRELIITIRYLATGELQQTQSFYFRVGRATVCHIIKETCCAIWKVLKKVFLRA 73
Query: 150 KHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSML 209
+ +K + I +F++++ C GAID H+ + PA ++ + + + YSM+
Sbjct: 74 PNDVK-----EWQNIIKEFDQNWNFPQCIGAIDEKHVRIEAPA-KSGSSFYNYKGFYSMV 127
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVEL 265
+ I D + F + G G N + +L S F G+ N +IS + ++
Sbjct: 128 LLAICDAKYCFTMVDIGAYGRDNDAAILNASTF------GRAFNKGYFNLPKISEFDPKV 181
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SK 324
VG + L WL+ PY L+ FN R +F L GWRI S
Sbjct: 182 PPVFVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFEILAAGWRIYRSP 241
Query: 325 VMWRPDKRKLPSIILVCCLLHNII 348
+ +P K+ II LHN +
Sbjct: 242 IKAKP--LKVEHIIKATVCLHNYL 263
>gi|126631710|gb|AAI34162.1| Zgc:162945 protein [Danio rerio]
Length = 351
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 139/338 (41%), Gaps = 36/338 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S + Y+C L+ E + P +V + + IALR SG VG A
Sbjct: 45 YRFSAEGMLYLCRLL-EPHIKNPTR-----RSHATTVPQMICIALRFFTSGTFLYEVGDA 98
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + R + AL+ + +P + IK F + GL GAID HI
Sbjct: 99 EKLSKNTVCRTIRRVVIALQRYINTFVAFPGHLPAQAIKEGFSQIAGLPGVIGAIDCIHI 158
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ P + + ++ +S+ VQ DH+ + WPG M +++ + S KLC
Sbjct: 159 PISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQIFEKS---KLC 215
Query: 247 EAGQR--LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
+ Q+ +G +VG Y S+L+TPY + FN T
Sbjct: 216 QRFQQGLFDG-------------VLVGDGTYACQSFLLTPYPEPK-TKPQHEFNIALSQT 261
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALS-D 363
R LK + L + P++ I+ C +LHNI +Q+ + S D
Sbjct: 262 RLKIDNTLAILKARFNCLRDLRVSPERAS--QIVGACAVLHNIASIRKEQMPSECQQSND 319
Query: 364 HHDSGYGEQCCKQVDPMGRTTRENL--EKHLQHNQEKD 399
DS + P GR R + E +QH+ +++
Sbjct: 320 DVDSVSRDH------PAGRAVRHAITEEFFMQHSIKQN 351
>gi|193610737|ref|XP_001943491.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 380
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 126/282 (44%), Gaps = 24/282 (8%)
Query: 80 LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG--VGQSTVSQV 137
LVRE L P + ++ LS+++++ ALR ASG Q +G G V Q +VS+
Sbjct: 57 LVRELLDILTPFMVSSVTNAGLSIQRKLLTALRFFASGSYQQDIGENRGSAVSQPSVSRC 116
Query: 138 TWRFIEALE--ERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP--- 191
+ A E ++ +P S + +++ F E FG+ G ID THI + P
Sbjct: 117 ITEVVNAFNRPEILNKYIHFPSSLGELNDVRLGFYEKFGIPGVIGVIDGTHIAIVPPKSE 176
Query: 192 -AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
+ + +++ +S+ Q I D M+ +++ +PG + S + + S L +
Sbjct: 177 DIIYPEHVYINRKGYHSINTQLICDSNMKILNVCAKFPGSTHDSHIWRVSPVLGLLKHLH 236
Query: 251 RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVK 310
+ + +++G GY L WL+TP + +N+ RSL +
Sbjct: 237 SIGHS----------SYFLLGDSGYGLRPWLLTPLTEYQPNTPEARYNTWLCRARSLIER 286
Query: 311 AFLQLKGGWRILSK---VMWRPDKRKLPSIILVCCLLHNIII 349
LK +R L K + + P+K SII C +LHNI I
Sbjct: 287 CNGVLKMRFRCLLKHRVLHYAPEKAS--SIINACTVLHNICI 326
>gi|307166817|gb|EFN60761.1| Putative nuclease HARBI1 [Camponotus floridanus]
Length = 163
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 85/171 (49%), Gaps = 18/171 (10%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID +HI + + D + +++ +S+ +QG VDH+M+FID+ G+PG ++ +R+ K
Sbjct: 1 GAIDGSHIRID-KLTEDLDSYINRKQYFSLHMQGTVDHKMKFIDVFIGYPGSVHDTRVFK 59
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREY-IVGGVGYPLLSWLITPYETNG-LSASMPT 296
N +R E Y ++G YP L LI PY NG L+ + +
Sbjct: 60 --------------NSPLRNDLHEFCGNNYLLLGDSAYPCLKELIVPYRDNGHLTHAQRS 105
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FN + R + AF LK + L R D ++ +I CC+LHNI
Sbjct: 106 FNQKLSSCRVIIENAFGYLKQRFPQLYHFKLR-DIVRMVYVIHACCVLHNI 155
>gi|449446093|ref|XP_004140806.1| PREDICTED: uncharacterized protein LOC101203312 [Cucumis sativus]
gi|449501700|ref|XP_004161442.1| PREDICTED: uncharacterized LOC101203312 [Cucumis sativus]
Length = 386
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 73/289 (25%), Positives = 120/289 (41%), Gaps = 54/289 (18%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ FR++ TF+++ L+ L R P G LSVE ++ + L RLA+G
Sbjct: 98 DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDF 153
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
++ FGV +S + + L + +++P N +E S FE+ GL NCC
Sbjct: 154 STISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCC-- 211
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
G+ G + S +L S
Sbjct: 212 ---------------------------------------------GFRGNKDDSTVLMSS 226
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
FK E G+ LN + + V + +Y+ G YPLL WLI P+ ++ +FN
Sbjct: 227 TLFKDIEQGRLLN-SPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEA 285
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
H A+KA + L+ W +LS+ + K + + I C +LHN ++
Sbjct: 286 HRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAV-AYIGACSILHNALL 332
>gi|328705061|ref|XP_003242682.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 379
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 137/298 (45%), Gaps = 37/298 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG-- 124
FR++K Y+ L+ + + S I++ ++L IAL +G Q VG
Sbjct: 48 FRLTKDLTRYLIELLSPFVEVKSRSSAIDLSTKVL-------IALNFFGTGSYQSPVGYN 100
Query: 125 VAFGVGQSTVSQVTWRFIEALEERAKHH--LKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
+ V Q +VS+ ++AL + + +K+P + ++E+ +E+ G G ID
Sbjct: 101 IFNAVSQPSVSRCVKEIVDALNQPQVINTWVKFPSN--IQELNQVRDETTGFPGVIGCID 158
Query: 183 ATHIIMTLPAVQTS--------DDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
TH+ + P+ + + +++N +S+ VQ I D +++ +++ +PG + +
Sbjct: 159 CTHVAIVPPSTNLNLVENHHPEYLYINRKNYHSINVQLICDSKLKILNVNALFPGSTHDN 218
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ S + + N N Y++G GYPL WL+TP N SA+
Sbjct: 219 HIWNNSNVLPVVQELHERNLN----------DYYLLGDSGYPLRQWLLTPI-LNPSSAAE 267
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSK---VMWRPDKRKLPSIILVCCLLHNIII 349
+N+ TRSL + LK +R L K + + P+K +II C +LHN+ I
Sbjct: 268 IHYNTKQMCTRSLIERCNGVLKARFRCLIKDRTLHYNPEKSS--AIINACVVLHNLCI 323
>gi|345496435|ref|XP_001603548.2| PREDICTED: hypothetical protein LOC100119837 [Nasonia vitripennis]
Length = 331
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 32/294 (10%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
++ + FD I LVR L R + R L+ E ++A L LA G S F
Sbjct: 1 MTSEQFDEILELVRPQLTKRS-------KRRALTPEMRLAAVLNFLAHGNSIQKSAWMFL 53
Query: 129 VGQST----VSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
+G+ST V++V LEE+ ++ +P + + I + + + + NC GAID
Sbjct: 54 IGRSTMYRLVTEVCTAICNVLEEK---YVSFPSQDDLSVIANMYWRIWHMPNCFGAIDGK 110
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
HI + P + + + + ++S+++ G+ D RFI + G G N +G F+
Sbjct: 111 HIRVKAPP-NSGSYFFNYKKHFSIVLMGLSDAFCRFIWVNIGDFGSSND------AGIFQ 163
Query: 245 LCEAGQRLNGN-------VRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPT 296
+ Q L+ + +V +I+G +PL ++L+ PY N L+
Sbjct: 164 RSDLRQALDNEEIDIPAPTYLPRTDVLCPYFIIGDGAFPLKNYLMKPYTRANNLTHEEKI 223
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIII 349
N R +AF L WRIL S V W+ + + ++I+ LHN +I
Sbjct: 224 VNYRLSRARLTIERAFGILTKKWRILESPVDWK--LKNIETVIMALICLHNFLI 275
>gi|432877870|ref|XP_004073237.1| PREDICTED: uncharacterized protein LOC101165567 [Oryzias latipes]
Length = 399
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 74/303 (24%), Positives = 133/303 (43%), Gaps = 21/303 (6%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEG---RLLSVEKQVAIALRRLA 115
+E F+++FR+S FD DL+SR + + + + R +S +++++ LR LA
Sbjct: 55 DEGRFQWYFRLSTAQFD--------DLLSRIGARITHQDTNYRRSISAAERLSVCLRYLA 106
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGL 174
+G+S ++ ++ VG STV+ + + A+ + + + P + I +F + +
Sbjct: 107 TGDSYRTIATSYRVGVSTVAIIIPEVLTAIWDGLVEDFMAVPSAEDWRSIAEEFHQRWNF 166
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NCCGA+D H+I+ +P + + +S+++ +VD RF I G G +
Sbjct: 167 PNCCGAVDGKHVILKVPP-NLGSQLHNCKGPFSVVLLAVVDALGRFRVIDVGDYGRTSDG 225
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELR----EYIVGGVGYPLLSWLITPYETNGL 290
+L S + G LN E R V +PL ++ P+ L
Sbjct: 226 GILANSTLGQALRCGT-LNLPPDHPLPGAEQRGPQPHVFVADEAFPLRRNMMRPFPGRAL 284
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHNIII 349
+ FN R + F L WR+ +++ RP+ L + CLLHN I
Sbjct: 285 TPERRIFNYRLSRARLVVEDTFGLLFAQWRLFRRLVDVRPE--VLEKFVKTTCLLHNFIR 342
Query: 350 DSG 352
S
Sbjct: 343 SSA 345
>gi|301632578|ref|XP_002945359.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 399
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 130/291 (44%), Gaps = 10/291 (3%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E F FFR+S +FD + ++++ L SR S + + +S E+++ + LR LA+G+S
Sbjct: 61 EKFVTFFRMSITSFDELLTILKPGL-SRARSLMRDP----ISPEERLCLTLRFLATGQSF 115
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
S+ F +G++T+ ++ + E + + PD N EI F + NC G
Sbjct: 116 SSLYFQFLIGRTTIGRIVRETCLLIWSELQRIVMPSPDENTWVEIAEDFHKKTNFPNCLG 175
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
A+D HI +T+P + + + + +S+++ +VD F I G G + +
Sbjct: 176 ALDGKHIRVTMP-FNSGSKYFNYKKYFSVVLLAVVDANYCFTIIDVGAYGSTGDASAFRN 234
Query: 240 SGFFK-LCEAGQRLNGNVRISSEEVELREYI-VGGVGYPLLSWLITPYETNGLSASMPTF 297
S + L E RL + Y+ VG + L ++ PY + S F
Sbjct: 235 SALGRQLTEGTLRLPLPKPLPGTAAPPMPYVFVGDEAFGLTENIMRPYPGSQRSVQKRLF 294
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
N R + AF L WR+ + + + + II CC+LHN +
Sbjct: 295 NYRLSRARRMVECAFGILANKWRVFHTAL-QLEPEFVDKIIKACCVLHNFV 344
>gi|449671020|ref|XP_004207411.1| PREDICTED: uncharacterized protein LOC101237154 [Hydra
magnipapillata]
Length = 311
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 135/316 (42%), Gaps = 45/316 (14%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE K FRV++ TF++ LV E P G R +SV K+VA+AL LAS
Sbjct: 2 EECELKEHFRVNRNTFNF---LVNE---LHPHLGKTTTTMREPISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVA---FGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSK-FEESF 172
E +V V FG+G+ST + + FI A+ + ++K+P S S+ F++
Sbjct: 56 EEYRNVRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFPLSVEYLNKHSRDFKDIL 115
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
C GA+D HI ++ P Q + + + + YS+++ +VD RFI G P
Sbjct: 116 DFPQCVGAVDGCHIPISAPKDQ-AISYYNYKGWYSIVLFAVVDCRYRFIYTSVGSP---- 170
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLS 291
E G L V L + G +PL L+ PY N LS
Sbjct: 171 --------------ELGDSL----------VPL--CLTGDSAFPLTRHLLKPYPENLELS 204
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN + R + AF ++ +R++ K M D I+ C LHNI
Sbjct: 205 EIQKNFNKILCGARRVVENAFGCVRARFRVICKRM-ECDINFATRIVNACVTLHNICEYY 263
Query: 352 GDQLHPDVALSDHHDS 367
D + + + H DS
Sbjct: 264 DDIIIIEWLMHHHDDS 279
>gi|449683828|ref|XP_004210468.1| PREDICTED: uncharacterized protein LOC101239328 [Hydra
magnipapillata]
Length = 302
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 150/346 (43%), Gaps = 62/346 (17%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+VA+AL LAS
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVNE---LHPHLGKTTTTMREPISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + +FI A+ + ++K+P S + + FE G
Sbjct: 56 EEYHVVSSLFGIGKSTANLIVHKFINAVNDILLPIYVKFPLSVENLNKHSRDFEAILGFP 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C GA+D HI ++ P Q + + + YS+++ +VD RFI G PG N S
Sbjct: 116 QCVGAVDGCHIPISAPKDQAISYY-NYKRWYSIVLFAVVDCRYRFIYTSVGSPGRNNDSY 174
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S + E+ + + + EL G + LL ++
Sbjct: 175 ILQNSSLKAILES-KLFDKCCK------EL------GDSFVLLCFI-------------- 207
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
R +A AF +K +R++ + M D I+ C +LHNI +
Sbjct: 208 --------ARRVAENAFGCVKARFRVICEHM-ECDINFATRIVNACVILHNIC-----EY 253
Query: 356 HPDVA----LSDHHDSGYGEQCCKQVDPM-----GRTTRENLEKHL 392
+ D+ L HHD + V + G+ R+++ K+L
Sbjct: 254 YDDIIIIEWLMHHHDDSLAQP--NTVSTLGNNGPGKNVRDSIAKYL 297
>gi|209863095|ref|NP_001129460.1| uncharacterized protein LOC100191015 [Danio rerio]
gi|190337069|gb|AAI62733.1| Similar to ENSANGP00000010363 [Danio rerio]
gi|190338874|gb|AAI62738.1| Similar to ENSANGP00000010363 [Danio rerio]
Length = 394
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 137/317 (43%), Gaps = 19/317 (5%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ +FR+ ++ FD + S V + + + +IE +++AI LR LA+G+S +
Sbjct: 53 FQRYFRLDREQFDSLLSKVGPQIARQDTNYRQSIEP-----AERLAICLRFLATGDSYRT 107
Query: 123 VGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+ ++ VG STV+ + A+ + A+ + P + I + F + NC G+I
Sbjct: 108 IAFSYRVGVSTVAGIVAAVTRAIWDTLAQEVMPVPTTEDWRNISTDFLHRWNFPNCLGSI 167
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D H+++ P + + + + YS+++ +VD + RF + G G M+ +L S
Sbjct: 168 DGKHVVIKAPD-NSGSLFYNYKGTYSVVLLAVVDSQYRFRVVDVGSYGRMSDGGVLANSI 226
Query: 242 FFKLCEAGQR-LNGNVRISSEE--VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
F + G L + +S E V +PL L+ P+ + LS FN
Sbjct: 227 FGQALRDGALGLPQDALLSGAEHFGPQPHVFVADEAFPLRRDLMRPFPGHNLSGRQRIFN 286
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC----CLLHNIIIDSGDQ 354
R + F L WR M+R P+ + C C+LHN + +
Sbjct: 287 YRLSRARLIVENTFGILTAQWR-----MYRGAIEISPANVDACVKATCVLHNFLRSTTST 341
Query: 355 LHPDVALSDHHDSGYGE 371
P + +D +G E
Sbjct: 342 RIPLPSAADGDAAGLQE 358
>gi|345494085|ref|XP_003427214.1| PREDICTED: hypothetical protein LOC100678860 [Nasonia vitripennis]
Length = 954
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 99/218 (45%), Gaps = 22/218 (10%)
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
S+ FGV +ST T+R + AL +++ WP +E + E+ FG GA+
Sbjct: 719 SITSKFGVAKSTAWCATYRVVRALCNYRNYYISWPSPAEAQETADRIEQRFGFPGVIGAL 778
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D T I + + ++ + +++ +S+ +Q I + ++ FI+ G PG ++ R+ ++SG
Sbjct: 779 DRTDIYIA-ALRRDANAYINRKGRHSIQLQVICNDKLEFINCYAGMPGSVHDMRVYRYSG 837
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSL 300
C Q N Y++ Y L ++ PY +G L+ FN++
Sbjct: 838 VQNQCN-DQFFPNNT-----------YLLADAAYTLQRHIMVPYRDHGHLTLEEICFNTM 885
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
+R + +A L K+ WR D+ + P II
Sbjct: 886 LSRSRMMVERAIGLL--------KMRWRNDEFQYPVII 915
>gi|449663129|ref|XP_004205687.1| PREDICTED: uncharacterized protein LOC101241637 [Hydra
magnipapillata]
Length = 231
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 108/233 (46%), Gaps = 12/233 (5%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIALRRLASG 117
EE FK FRV++ TF++ LV E P G R +SV K+VA+AL LAS
Sbjct: 2 EECEFKEHFRVNRNTFNF---LVYE---LHPLLGKTTTTFREPISVVKRVAVALHYLASC 55
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+S + + FI A+ + ++K+P S + + FE G
Sbjct: 56 EEYRGVSSLFGIGKSIANLIVHEFINAVNDILLPKYVKFPLSVENLIKHSRDFEAILGFQ 115
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C GA+D HI + P Q + + + + YS+++ +VD RFI G PG N S
Sbjct: 116 KCIGAVDGCHIPIFAPKDQ-AISYYNYKGWYSIVLFSVVDCRYRFIYTSVGSPGRNNDSY 174
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
+ + S + E+ + V L ++G +PL L+ PY N
Sbjct: 175 IFQNSSLKAILESSLFDKCCKELGDSLVPLC--LIGDSAFPLTRHLLKPYPKN 225
>gi|221114951|ref|XP_002160191.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 319
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 113/252 (44%), Gaps = 18/252 (7%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPD 157
+++ I +R LASGESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLASGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
F + G G N + +L S F + G N IS + ++ VG + L
Sbjct: 146 YCFTMVDIGAYGRDNDAAILNASTFGRAFNKGYFNLPN--ISEFDPKVPPVFVGDDIFAL 203
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPS 336
WL+ PY L+ FN R +F L WRI S + +P K+
Sbjct: 204 KPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRSPIKAKP--LKVEH 261
Query: 337 IILVCCLLHNII 348
II LHN +
Sbjct: 262 IIKATVCLHNYL 273
>gi|328699180|ref|XP_003240854.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 322
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 113/237 (47%), Gaps = 21/237 (8%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E ++ F R+S TF+ + LV ++ P +LSV + + LR LASGES
Sbjct: 69 ELYQNFLRMSASTFEELVCLVGPKIMRFP------TRPDILSVGEVLTATLRYLASGESM 122
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLLNCCG 179
+S+ +F +G++TVS++ ++ E L + + L PD+ + ++ +FE + + NC G
Sbjct: 123 MSIMYSFRIGKATVSKLIFQCCEVLWDTLNTNVLIVPDTKKWAQLGVEFENKWQVPNCIG 182
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
+ID HI+ A S+++ + + ++S+++ + D F + G G +
Sbjct: 183 SIDGKHIVHQAFANSGSENY-NYKRSHSIILLAMCDASYNFTIVDIGADGRCSD------ 235
Query: 240 SGFFKLCEAGQRLNGN-------VRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
G F E G+ N I S + Y +G +PLL+ L+ PY G
Sbjct: 236 GGVFSNSEMGKGFMANNLNFPTAKDIDSNSGPIPYYALGDEAFPLLTNLMRPYPGRG 292
>gi|449673827|ref|XP_004208041.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 398
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 105/224 (46%), Gaps = 6/224 (2%)
Query: 126 AFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDA 183
FG+ Q TVS+ +A+ +L P + M ++ S+FE F ++ G ID
Sbjct: 134 VFGIHQCTVSKTVKVVCDAINNIVGPIYLHLPKNKEDMTKLASQFEVKFDMIQAFGCIDG 193
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
TH+ + P D +C ++ +S+ VQ + D + FID+ WPG ++ +++ S
Sbjct: 194 THVQIKRPIKNGQDYFCYKQ-YFSLNVQAVCDIKGYFIDVECKWPGSVHDAKMFTNSTIN 252
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEA 303
K G L + + Y++G YPL ++ I +++ + + FNS+ +
Sbjct: 253 KKFIKGT-LPQTLYSLPNYHSIPNYLIGDPAYPLTNFCIKEFQSCSNNEEV-IFNSMLRS 310
Query: 304 TRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
R+ AF +LK W L K++ +P +I C +LHN
Sbjct: 311 ARNQIECAFGRLKARWGFLRKII-DIKIETVPIVIYTCFVLHNF 353
>gi|449678540|ref|XP_002170519.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 114/255 (44%), Gaps = 24/255 (9%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDNDAAILNASTF------GRTFNKGYFNLPKISEFDPKVPPVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
+ L WL+ PY L+ FN R +F L WRI + + + K
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRSPI-KANSLK 258
Query: 334 LPSIILVCCLLHNII 348
+ II LHN +
Sbjct: 259 VEHIIKATVCLHNYL 273
>gi|331237376|ref|XP_003331345.1| hypothetical protein PGTG_12667 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309310335|gb|EFP86926.1| hypothetical protein PGTG_12667 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 436
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 127/305 (41%), Gaps = 31/305 (10%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK R +K+ F ++ D +S P L + Q+A+ L RL S
Sbjct: 101 RDEDFKQSVRTTKEAFLWLL-----DRISLHPVFHSQSHRPQLPIPHQLALTLERLGSNG 155
Query: 119 SQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKF-EESFGL 174
+ VG+ VG+ T+ +V+ R IEA+ + H+ WPD R EI EE FG
Sbjct: 156 NGALVGIFSRNLTVGRGTIIKVSRRVIEAINSLSSKHVVWPDRYRRAEISDVMKEEGFG- 214
Query: 175 LNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
C G +D T I + P ++ W D++ YS+ Q + D + GWPG
Sbjct: 215 -GCVGFVDGTTIPLHQRPGLEGEAYW-DRKKKYSVNCQIVCDCDRYITSFTVGWPGSCGD 272
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
S + + + +AG + +Y++ Y L I Y+ +
Sbjct: 273 SWVFRNTTLHM--QAGDYFDAG-----------QYLIADSAYGLSCTTIPAYKAPASNKR 319
Query: 294 MPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSII--LVCCL-LHNII 348
T FN +R LKG W L ++ K+ + II + CC+ LHN++
Sbjct: 320 DNTDFNYCLAKSRVRNEHTIGILKGRWASLQQLRLHLHKKSHMKEIIRWVSCCITLHNML 379
Query: 349 IDSGD 353
+ GD
Sbjct: 380 SNLGD 384
>gi|449677010|ref|XP_004208757.1| PREDICTED: uncharacterized protein LOC101236952 [Hydra
magnipapillata]
Length = 180
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 92/187 (49%), Gaps = 18/187 (9%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDL-----VSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + + L R P +SV K+VA+AL
Sbjct: 2 EECVFKEHFRVNRNTFNFLVNELHPHLGKTTTTMREP----------ISVVKRVAVALHY 51
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP-DSNRMEEIKSKFEES 171
LAS E V FG+G+ST + + FI A+ + ++K+P + + FE
Sbjct: 52 LASCEEYRVVSSLFGIGKSTANLIVHEFINAVNDILLSKYVKFPLPVENLNKHSRDFEAI 111
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI + P Q + + + + YS+++ +VD + RFI G PG
Sbjct: 112 LGFPQCVGAVDGCHIPILAPKDQ-AISYYNYKGWYSIVLFSVVDCQYRFIYTSVGLPGRN 170
Query: 232 NVSRLLK 238
N S +L+
Sbjct: 171 NDSYILQ 177
>gi|221128761|ref|XP_002160578.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 115/256 (44%), Gaps = 26/256 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFCFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSYFYNYKRFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVHIGAYGRDNDAAILNASTF------GRAFNKGYFNLPKISEFDPKVPAVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKR 332
+ L WL+ PY L+ FN R +F L WRI S + +P
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIKNSFGILAARWRIYRSPIKAKP--L 257
Query: 333 KLPSIILVCCLLHNII 348
K+ II LHN +
Sbjct: 258 KVEHIIKATVCLHNYL 273
>gi|221122677|ref|XP_002156551.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 114/256 (44%), Gaps = 26/256 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++ + C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQDWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDNDAAVLNASTF------GRAFNKGYFNLPKISEFDPKVPRVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKR 332
+ L WL+ PY L+ FN R +F L WRI S + +P
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRSPIKAKP--L 257
Query: 333 KLPSIILVCCLLHNII 348
K+ II LHN +
Sbjct: 258 KVEHIIKATVCLHNYL 273
>gi|157423504|gb|AAI53379.1| Zgc:162945 [Danio rerio]
Length = 351
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 138/338 (40%), Gaps = 36/338 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S + Y+C L+ E + P +V + + IALR SG VG A
Sbjct: 45 YRFSAEGMLYLCRLL-EPHIKNPTR-----RSHATTVPQMICIALRFFTSGTFLYEVGDA 98
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + R + AL+ + +P + IK F + GL GAID HI
Sbjct: 99 EKLSKNTVCRTIRRVVIALQRYINTFVAFPGHLPTQAIKEGFSQIAGLPGVIGAIDCIHI 158
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ P + + ++ +S+ VQ DH+ + WPG M +++ + S KLC
Sbjct: 159 PISTPVKEIEATFLNRNATHSINVQMTCDHQCLITSLDARWPGSMQDNQIFEKS---KLC 215
Query: 247 EAGQR--LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
+ Q+ +G +VG S+L+TPY + FN T
Sbjct: 216 QRFQQGLFDG-------------VLVGDGTSACQSFLLTPYPEPK-TKPQHEFNIALSQT 261
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALS-D 363
R LK + L + P++ I+ C +LHNI +Q+ + S D
Sbjct: 262 RLKIDNTLAILKARFNCLRDLRVSPERAS--QIVGACAVLHNIASIRKEQMPSECQQSND 319
Query: 364 HHDSGYGEQCCKQVDPMGRTTRENL--EKHLQHNQEKD 399
DS + P GR R + E +QH+ +++
Sbjct: 320 DVDSVSRDH------PAGRAVRHAITEEFFMQHSIKQN 351
>gi|350536511|ref|NP_001233059.1| uncharacterized protein LOC100569528 [Acyrthosiphon pisum]
gi|239789800|dbj|BAH71500.1| hypothetical protein [Acyrthosiphon pisum]
Length = 353
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 146/333 (43%), Gaps = 29/333 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++ F+ FR+SK + L D ++ S L + + S+ K + +ALR A+G
Sbjct: 39 DDHDFRIRFRISK-----VVVLQVLDHINDQISSLTDRNHAVTSINK-LLLALRFYATGN 92
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNC 177
++ G GV ++T S + A+ + ++ P + R + +++ F +
Sbjct: 93 FLITSGDFLGVSKTTASLIVRDVSIAIAKLRPRFIQMPTTEREISKLQRSFYQIARFPRT 152
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID TH+ + P ++ + +++ +S+ VQ I ++ +D+V WPG + +
Sbjct: 153 IGAIDCTHVKIQNPGGPNAEYFRNRKGYFSINVQTIACPNLKIMDVVARWPGSCHDQTIF 212
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
K S + G+ N IV GY L+TP+ N + +
Sbjct: 213 KKSQIYYNLINGKWGNS-------------LIVADSGYANSRHLVTPF-LNPRNDIEELY 258
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
N TR+ +++ LK + +LS + R ++I+ CC+LHNI D+ D P
Sbjct: 259 NESIIRTRNPVERSYGVLKRRFPVLS-LGLRLKLETTQAVIVACCVLHNIACDNNDMDPP 317
Query: 358 --DVALSDHHDSGYGEQCCKQVDPMGRTTRENL 388
D+ L ++ + EQ P G R+ L
Sbjct: 318 ALDIVLPENENINIEEQ-----QPEGENARQQL 345
>gi|361069811|gb|AEW09217.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
Length = 141
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 71/151 (47%), Gaps = 18/151 (11%)
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+VD F DI G PG M +L+ S FKL G NG ++VG
Sbjct: 1 VVDSTGSFTDICIGLPGSMTDDVVLENSDLFKLGMNG-FFNGG------------WVVGS 47
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
YPLL WL+ PY + L+ + FN + ++ AF +LKG WR L K R + +
Sbjct: 48 SAYPLLDWLLVPYVQHNLTWTQHAFNEKISEIQKISKDAFGRLKGRWRCLEK---RTEVK 104
Query: 333 --KLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
LP ++ CC+LHNI + PD+A
Sbjct: 105 LLDLPVVLGACCVLHNICEQHNEDFAPDLAF 135
>gi|301627133|ref|XP_002942730.1| PREDICTED: putative nuclease HARBI1-like, partial [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 115/252 (45%), Gaps = 48/252 (19%)
Query: 100 LLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN 159
L + K +A+ L L SG Q G+ Q T S++ + + AL P S
Sbjct: 18 LPGISKLLAV-LHFLGSGSFQQVSARLVGMSQPTFSRILKQVLRAL---------LPHSQ 67
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ GAID TH+ +T P + + +++ ++S+ VQ + D +R
Sbjct: 68 SL-----------------GAIDCTHVPLTPPRAH-QERYLNRKRSHSINVQVVCDSHLR 109
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
+ + +G+PG ++ + +L+ S ++ G+ L G ++VG GY +L
Sbjct: 110 IMSVRSGFPGSVHDAHILRQSALYERFTQGEMLRG-------------WLVGDAGYGVLP 156
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLP 335
WL+TP A +N H TR++ + F LK +R LS +++ P K+
Sbjct: 157 WLMTPVRFPRTPAQR-RYNRAHRKTRNVIERLFGVLKLRFRCLSVTGGALLYSPI--KVS 213
Query: 336 SIILVCCLLHNI 347
II+VC +LHN+
Sbjct: 214 EIIVVCAMLHNV 225
>gi|346465349|gb|AEO32519.1| hypothetical protein [Amblyomma maculatum]
Length = 357
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 128/293 (43%), Gaps = 9/293 (3%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
++ RVS++ F ++ + VR + R ++ E ++ + LR L SGES
Sbjct: 59 YRKLLRVSREQFLHLLARVRARIEK---ENAATGARRPVNAETRLQVTLRYLTSGESYEF 115
Query: 123 VGVAFGVGQSTVSQVTWRF-IEALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGA 180
+ F V ++ + + I EE K +K P +N +E+ F + NC GA
Sbjct: 116 LSRQFRVARTKCGDLIHKTCIVIFEELKKDFMKAPKTNEDWKEVMRVFGDKCNFPNCLGA 175
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
I H+ + PA + S + + + +S+++ VD +FI I G PGG + L + +
Sbjct: 176 IGGKHVTIKKPA-RASYMYLNHKKTFSVILLAAVDANGKFIYIEAGAPGGQDSGPLWQNA 234
Query: 241 GFFK-LCEAGQRLNGNVRI-SSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
K + E R+ V + SS V L V PL ++ P+ L+ FN
Sbjct: 235 VLQKAIQEKRTRIPELVEVASSPNVLLPPAFVTDSALPLEKHVVKPFAGTSLTEDKGVFN 294
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
+ AF QL+ +R L V+ + + +I+ C LHN + +S
Sbjct: 295 YRLSKAWLVMETAFKQLEDRFRCLRTVIQEQPGKAV-AIVTAACALHNFLGNS 346
>gi|328698183|ref|XP_003240572.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 240
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 179 GAIDATHIIMTLPAVQTSDD-------WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
GAID HI + P + + + +++ ++L+QGIVD +++FID GWPG
Sbjct: 28 GAIDGCHISILAPWEKRTKMPKLNRTMFYNRKQVPTVLLQGIVDSDLKFIDCFAGWPGSS 87
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE--YIVGGVGYPLLSWLITPYETNG 289
+ +R+ F+ G++L +S V L +I+G YPL S L+ P++ NG
Sbjct: 88 HDARV------FRRSIIGEKL-----LSQPCVILPPGCHILGDGAYPLTSTLMVPFKDNG 136
Query: 290 -LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
LS S FN ++R + +AF +L G +R L K M K +I C LHN+
Sbjct: 137 HLSDSQLKFNKCLSSSRVVIEQAFRKLIGRFRKL-KHMDIYHKENCSKVITAACCLHNLC 195
Query: 349 IDSGDQLH-PDVALSDHHDSGYGE 371
ID+ D + ++ S+ D + E
Sbjct: 196 IDNSDDFNSTEIYTSEMEDGNHHE 219
>gi|115694524|ref|XP_797119.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 378
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/297 (21%), Positives = 137/297 (46%), Gaps = 13/297 (4%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
+++ F++F R+ FD + + L + + ++E L +VAI LR LAS
Sbjct: 65 NEDPRAFQHFMRMPPAVFDEVVQRLTPRLTKQDTNYRASLEPGL-----KVAITLRHLAS 119
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNR-MEEIKSKFEESFGL 174
G + ++ A+ V +T+S V ++A +EE L P + + ++ ++ + +
Sbjct: 120 GNTYRNMQYAWRVPHNTISVVVREVVKAIIEEYTDELLFCPTTEQGWRDLADQWYQRWNF 179
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+ GAID H+ P + ++ + E YS+++ +VD + +F I G + +
Sbjct: 180 PHTVGAIDGKHVACKAPP-NSGSEFYNYEGFYSVILFAMVDADYKFTYIDVSGNGSSSDA 238
Query: 235 RLLKFSGFFKLCEAGQRLNGNVR---ISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
+ S + + R++ + + ++ ++ +I+G + L ++L+ PY T L+
Sbjct: 239 HIYNESDLHRGLDQN-RIHAFPQPDPLPNDNQDVPYFIIGDDAFSLRTYLMKPYSTRNLT 297
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN R + F L ++IL M + + +I+ CC+LHN++
Sbjct: 298 HEERIFNYRLSRARRVVENTFGILANRFQILLTTM-QHHHETVRTIVEACCILHNLM 353
>gi|340383778|ref|XP_003390393.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 386
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 123/278 (44%), Gaps = 32/278 (11%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D E F+Y R++++ FD + V L+ R + + + +S +++ + LR L +G
Sbjct: 52 DPESHFRYI-RMTRERFDLLLQEVGPYLLHRTYNSSLRPQ---VSPAERLILTLRYLCTG 107
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH--HLKWPDSNR-MEEIKSKFEESFGL 174
SQ+S+ F +G+STV ++ +A+ A H +LK P + + I FE +
Sbjct: 108 NSQISLSFNFRLGRSTVCEILKETCDAIWN-ALHLAYLKAPSTEQEWIRISRNFETMWNF 166
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NC GAID HI++ P + + + + +S+++ + D G G
Sbjct: 167 PNCIGAIDGKHIVIQAP-MNAGSTFYNYKGKHSIVLLAVCD---------AGEAGR---- 212
Query: 235 RLLKFSGFFKLCEAGQRLN-GNVRI-------SSEEVELREYIVGGVGYPLLSWLITPYE 286
L SG F C+ G L G + I + E L I+G G+P+ +L+ PY
Sbjct: 213 --LSDSGIFTNCQFGTALEEGLLNIPPDKALTGTSEPSLPYVIIGDEGFPIKKYLLRPYP 270
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
LS FN R + +F L WRI +
Sbjct: 271 GRYLSKEKSIFNYRLSRARRIIENSFGILAARWRIFRR 308
>gi|328709487|ref|XP_003243975.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 332
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 140/309 (45%), Gaps = 42/309 (13%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK FR+SK+T + ++ +++ +ALR A+G
Sbjct: 33 DELDFKNRFRLSKET----------------------VINNAITPAQKLLLALRFYATGS 70
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL---L 175
+S G GV +S + AL + +K P++N +EIK ++ +GL L
Sbjct: 71 FSISAGDFMGVSKSAACVIVHDVSVALAKLRPQIVKMPETN--DEIKELHKQFYGLAKFL 128
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
GAID THI + P ++ + +++ +S+ VQ +V +++ +DIV WP + S
Sbjct: 129 LVIGAIDCTHIKIQSPGGPNAEYFRNRKGWFSLNVQTVVSPKLKIMDIVVRWPDSTHDST 188
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ S ++N ++ ++ + IV GY ++TP+ N +
Sbjct: 189 IFSHS----------KINRDLHVTQKWAN--SLIVADSGYANNMHIVTPF-LNPHAGPEN 235
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
+N TR+ + ++ K + +LS M R + ++I+ C +LHNI ID +++
Sbjct: 236 LYNESQIRTRNPVERCYVVWKRRFPVLSLGM-RLQICNIQTVIVACSVLHNIAIDCNEEM 294
Query: 356 -HPDVALSD 363
D+ L D
Sbjct: 295 PMDDIQLPD 303
>gi|291230836|ref|XP_002735371.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 445
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 139/314 (44%), Gaps = 21/314 (6%)
Query: 44 WAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLV-----REDLVSRPPSGLINIEG 98
+ +S G + ++ F+ F R++ + FD I V ++ R P
Sbjct: 78 YGHYTSLLGTQLRLEDPAAFRNFTRLTPEVFDDILERVAPVIQKQQTNYRHP-------- 129
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS 158
LS ++AI LR LA+G++ S+ F G ST+S++ A+ + K + P +
Sbjct: 130 --LSAGLKLAITLRHLATGDNYRSLAYGFRCGISTISELIPEVCTAIVQAYKDEVFNPPT 187
Query: 159 N--RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ +FE+ + + + GA+D HI + PA T + + + +S+ + VD
Sbjct: 188 TPEAWRNLAQQFEQRWNVPHAVGALDGKHIAIKKPA-NTGSLYHNYKGFFSITMLAFVDA 246
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSE-EVELREYIVGGVG 274
E +FI I G G M+ S++ S F+ E G L + + E E ++ +I+G
Sbjct: 247 EYKFIWIELGGKGHMSDSQIFTDSELFECLEDGSFGLPPSCPLPGETEPDIPYFILGDDA 306
Query: 275 YPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL 334
+ L S+++ PY G++ +N R + L +R L + + +
Sbjct: 307 FALKSYMMKPYSRRGMTDEQRIYNYRISRGRRVVENRQTGLANRFRCLLGTLEQ-KVENV 365
Query: 335 PSIILVCCLLHNII 348
+++ LLHN++
Sbjct: 366 RNLVETAVLLHNLL 379
>gi|403182853|gb|EJY57672.1| AAEL017220-PB [Aedes aegypti]
Length = 319
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 98/240 (40%), Gaps = 5/240 (2%)
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEE 170
LR LASG Q + + V + + + + +A+ ++ P E+ + F
Sbjct: 4 LRYLASGGLQRHLASCYRVSKQHMGSIIDQVCDAICRALSAYVADPCQESFLEVANGFNS 63
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
+ NC GAID H+ + P S + + + +S+ + I D RF + G G
Sbjct: 64 RWNFPNCIGAIDGKHVSIKAPPNAGSIFY-NYKGFHSLALMAICDASYRFTYLDVGAYGS 122
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ K S F RL+ + V+L + V +PL +I PY L
Sbjct: 123 EGDCNIFKESK-FGTDVLHDRLDFPENATVNGVKLPFFYVADDAFPLCKRIIKPYSKKNL 181
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW-RPDKRKLPSIILVCCLLHNIII 349
SA FN R AF L W L K ++ PD+ + II CC+LHN +I
Sbjct: 182 SAEERIFNYRLSRARRCIENAFGLLCSKWACLKKTLYCSPDRAQ--KIISACCMLHNFLI 239
>gi|331217820|ref|XP_003321588.1| hypothetical protein PGTG_03125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 127/306 (41%), Gaps = 35/306 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ FK R +K F ++ + +R P N L + Q+A+ L RL S +
Sbjct: 85 DNDFKQAVRTTKTGFLWLLNQIRIH-----PIFYNNSTRPQLPIPHQLALTLERLGSNGN 139
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV +V+ R I+A+ E + ++ WPD +R +EI E+ G
Sbjct: 140 GASVGRFSRNLSVGRGTVVKVSRRVIQAINEISSKYIVWPDKDRRKEISGVMEKE-GFKG 198
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + P + W D++ YS+ Q I D + +TGWPG S
Sbjct: 199 CIGFVDGTTIPLHQRPGIDGEVYW-DRKKRYSINCQVICDCDKFITSFMTGWPGSCGDS- 256
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPYETNGLSA 292
L F+ N+++ E + +Y++ Y L I Y+
Sbjct: 257 -LVFT--------------NMKVHKEPENYFDSGQYLIADTAYGLSMTTIPAYKAPLSKV 301
Query: 293 SMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL----VCCLLHNI 347
T FN +R LKG W L ++ +++K IL C LHN+
Sbjct: 302 HRNTEFNYCLAKSRVRNEHTIGILKGRWASLQQLQLSLNEQKDMMEILRWINACVALHNM 361
Query: 348 IIDSGD 353
+ GD
Sbjct: 362 LAHLGD 367
>gi|449686912|ref|XP_004211294.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 153
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 79/153 (51%), Gaps = 2/153 (1%)
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
+ FE+ +G GA+D HI T + + S+D+ ++++ +S+++QG+ D + F DI
Sbjct: 2 ASFEKLYGFPQVVGAVDGIHI-RTKASNKNSEDYINRKDYHSIILQGLADSKCLFRDIFV 60
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
W G + SR+ K S +K C A L N+ +E+ I+G + L +WL+ PY
Sbjct: 61 RWTGKFHDSRVFKNSPLYKECLARTFLPNNLNKLIANIEIGPLILGDSAFSLENWLMKPY 120
Query: 286 -ETNGLSASMPTFNSLHEATRSLAVKAFLQLKG 317
+ LS FN+ R + AF +LKG
Sbjct: 121 SDRENLSIEEAKFNTSLSKFRVVIENAFGRLKG 153
>gi|331231347|ref|XP_003328337.1| hypothetical protein PGTG_09631 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309307327|gb|EFP83918.1| hypothetical protein PGTG_09631 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 430
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/323 (23%), Positives = 137/323 (42%), Gaps = 33/323 (10%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++E FK FR+S+ +F +C + D V + N G V +Q+ + L+RL
Sbjct: 100 DDENFKQEFRMSRTSFLKLCEHISNDPVFQN-----NSNGPQRPVREQLMVTLKRLGCSG 154
Query: 119 SQVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
++ S+ V F + TV T R + A+ L WP + + I + + G
Sbjct: 155 NEASIDVLSRFFRLETGTVELYTNRCLMAILRLRSEVLNWPTAKERKAISVE-QAKVGFN 213
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G I+AT I +++ + D+ + +++ + D + + TGWPGG ++SR
Sbjct: 214 GCVGFINATLIPLSIAPSKNPGDFYSTKGFFAISTVIVCDGQQNITYLYTGWPGGPSLSR 273
Query: 236 LLKFSGF----FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
++ SG G+ L N ++ IV G P L+
Sbjct: 274 VMSHSGLTLKPVDFFSPGEYLLANSAFTTTPT-----IVAGYKKP---------SQGQLT 319
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK----LPSIILVCCLLHNI 347
FN+ +++L LKG ++ L+ + R D +K + + I+ C +LHN
Sbjct: 320 EEEDRFNAHLLRSQALIENCIGTLKGRFQSLNGLRLRIDGKKDQIRVNAWIMACAVLHNF 379
Query: 348 II--DSGDQLHPDVALSDHHDSG 368
+ D D H +SD+ +S
Sbjct: 380 LNQEDEFDFGHVVTEVSDNSNSA 402
>gi|331229801|ref|XP_003327566.1| hypothetical protein PGTG_09100 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 517
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 116/264 (43%), Gaps = 28/264 (10%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV + + R I+A+ ++ ++ WPD
Sbjct: 209 LPIPHQMALTLERLGSNGNGASVGRFARNLSVGRGTVIKASRRVIQAINSLSEKYVVWPD 268
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R EI S+ ++ GL C G +D T I + + + D++ YS+ Q + D +
Sbjct: 269 STRRAEI-SQVMKNEGLSGCVGFVDGTTIPLHQRPGLDGEVYFDRKKRYSINAQIVCDCD 327
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEE---VELREYIVGGVG 274
TGWPG S + + N +I SE + +Y++
Sbjct: 328 KFITAFTTGWPGSCGDSWVFQ----------------NSKIESEPDKYFDRGQYLIADSA 371
Query: 275 YPLLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
Y L I Y++ T FN +R LKG W L ++ DK+K
Sbjct: 372 YGLSLTCIPAYKSPASKKPDNTDFNYCIAKSRVRNEHTIGILKGRWASLHELRLHLDKKK 431
Query: 334 -LPSII--LVCCL-LHNIIIDSGD 353
+ I+ + CC+ LHN++ GD
Sbjct: 432 HMLEIVQWVNCCITLHNMLAQLGD 455
>gi|331245395|ref|XP_003335334.1| hypothetical protein PGTG_17187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309314324|gb|EFP90915.1| hypothetical protein PGTG_17187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 337
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 111/263 (42%), Gaps = 24/263 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG V + TV +VT R IEAL + +++WPD
Sbjct: 39 LPIPHQLALTLERLGSNGNGASVGRFSRNLQVARGTVVKVTRRVIEALISMGRVYVQWPD 98
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+R EI S+ G C G +D T I I P + + D++ YSM Q + D
Sbjct: 99 KDRRAEI-SEVMRMEGFSGCVGFVDGTTIPIFQRPGFD-GETFFDRKKRYSMNAQIVCDC 156
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ ++GWPG S++ + +L+ N S+ + +Y++ Y
Sbjct: 157 DRFITSFISGWPGSCGDSKVYQ----------RMQLHQN---PSQFFDQGQYLLADSAYD 203
Query: 277 LLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMW----RPDK 331
L I Y+ + T FN R LK W L ++ R
Sbjct: 204 LTRTTIPAYKAPATEVLINTDFNYCLAKARVRNEHTIGILKSRWASLREMRLHLYNRGHM 263
Query: 332 RKLPSIILVCCLLHNIIIDSGDQ 354
R+ + C +LHN++ GDQ
Sbjct: 264 RQYLAWFYSCIILHNLLATLGDQ 286
>gi|449669963|ref|XP_004207158.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 315
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 115/257 (44%), Gaps = 28/257 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLK-WP 156
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K W
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVKEW- 90
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ I +F++ + C GAID H+ + PA ++ + + + YSM++ I D
Sbjct: 91 -----QNIIKEFDQDWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDA 144
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGG 272
+ F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 145 KYCFTMVDIGAYGRDNDAAVLNASTF------GRAFNKGYFNLPKISEFDPKVPPVLVGD 198
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDK 331
+ L WL+ PY L+ FN R +F L WRI S + +P
Sbjct: 199 YIFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAVRWRIYRSPIKAKP-- 256
Query: 332 RKLPSIILVCCLLHNII 348
K+ II LHN +
Sbjct: 257 LKVEHIIKATVCLHNYL 273
>gi|297603451|ref|NP_001054056.2| Os04g0644200 [Oryza sativa Japonica Group]
gi|38344888|emb|CAD41911.2| OSJNBa0033G05.12 [Oryza sativa Japonica Group]
gi|125591833|gb|EAZ32183.1| hypothetical protein OsJ_16389 [Oryza sativa Japonica Group]
gi|255675827|dbj|BAF15970.2| Os04g0644200 [Oryza sativa Japonica Group]
Length = 435
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 112/263 (42%), Gaps = 43/263 (16%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVS-QVTWRFIEALEERAKHHLKWPDSNRME 162
+ ++ AL RLA +V FG+ V+ + + + +R L +R+
Sbjct: 159 DHKLGAALFRLAHAAPARAVARRFGLPSPAVAARAFYEVCRTIADRLAILLDLGAPDRIA 218
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
F + L NCCGA+ I +++ Q +VD E RF+D
Sbjct: 219 RAVPGFC-ALSLPNCCGALGYARI------------------GDAVIAQALVDAEGRFLD 259
Query: 223 IVTGWPGGMNVSRLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGGVGY 275
+ GW M + +L + + L A G+ + G+V Y +G
Sbjct: 260 VSVGWDTAMAPAEILPRTKLYSSQSLVLANAPHGELIGGSVP---------RYFLGPACC 310
Query: 276 PLLSWLITPYE----TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD- 330
P+L WL+TPY NG+S FN++H L AF ++ WR+L + W+ +
Sbjct: 311 PMLPWLVTPYNDMDAKNGMSKE-SIFNNVHSHGMRLVRNAFGHVRSRWRLLDEC-WKGEC 368
Query: 331 KRKLPSIILVCCLLHNIIIDSGD 353
+ LP +++ CLLHN +I G+
Sbjct: 369 QEALPYVVVAGCLLHNFLIKCGE 391
>gi|328701629|ref|XP_003241664.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 348
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 139/298 (46%), Gaps = 27/298 (9%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
++ F FR++K T ++ +++ +VS+ ++ +++ + LR A+G
Sbjct: 47 DDEFLNRFRLNKDTVKFLLRIIKPKIVSQTS------RNNAVTHSQKLLMTLRYYATGSF 100
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL---N 176
+ G+ ++T ++ +A+ + +P ++ +EI++ ++ F + +
Sbjct: 101 IATCADFAGIHKTTGGKIIIEVSKAIAALRPDFIHFPTTD--DEIRTVKQDFFNIAKFPS 158
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GAID THI + +D + +++ +SM VQ I D ++ +IV WPG + + +
Sbjct: 159 CIGAIDCTHIKIRSLGGNDADIFRNRKQFFSMNVQTICDSKLIIQNIVARWPGSSHDANI 218
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELRE-YIVGGVGYPLLSWLITPYETNGLSASMP 295
+ N ++ + E ++ +V GYP+ S++ITP N ++
Sbjct: 219 FR--------------NSAIKQHFDNREFKDCVLVADSGYPMQSYMITPM-LNPITNVEN 263
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TFN TR+ +++ K + ILS + + + +II+ +LHNI GD
Sbjct: 264 TFNESQIRTRNPIERSYGVWKRRFPILSLEINVRNMDTVQAIIVATAVLHNIARKFGD 321
>gi|301623609|ref|XP_002941106.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 348
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 125/274 (45%), Gaps = 10/274 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD-S 158
+S E+++ LR LA+G S + G+ + + A+ E K +LK+P+ S
Sbjct: 28 ISAEQRLIATLRFLATGRSFADLKFLTGISAQALGHIIPETCNAIVETLKGEYLKFPETS 87
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ I +F + + NC GAID HI + P + + + + +S+++ IV+
Sbjct: 88 EEWQVIAQQFNDYWNFPNCGGAIDGKHIRIN-PPPNSGSYFFNYKGFFSIVLLAIVNANY 146
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI-VGGVGYPL 277
FI + G G ++ +++ + F + ++ Q N+ ++E E ++ VG + L
Sbjct: 147 EFIMVDIGKNGRLSDGGVIEQTHFNQKLKSKQL---NLPTNAETKEGLNFVFVGDEAFGL 203
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPS 336
L+ P+ L+ FN R + AF L +RI + + P+K +
Sbjct: 204 HENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGILANRFRIFHTSINLCPEK--IDM 261
Query: 337 IILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG 370
++L CC+LHN + + V++ DH D G
Sbjct: 262 VVLSCCVLHNFLRKDNHASYAPVSMVDHEDIQAG 295
>gi|301617460|ref|XP_002938161.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 332
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 104/225 (46%), Gaps = 20/225 (8%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
+L LA+G Q GV GV Q T S+ + ++A+ + +++ +P + N IK +F
Sbjct: 117 SLHFLATGSFQKVGGVYGGVSQPTFSRCLGQVLDAIHSVSANYISFPTNRNEWNTIKRQF 176
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G+ N G ID T++ + P + ++++ +S+ +Q + D M I++G+P
Sbjct: 177 YGVSGIPNVLGDIDCTNMALNSPQ-DREHVYRNRKSYHSLNIQMVCDATMNIRSIMSGFP 235
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G + + +L+ SG + E G+ + +I G GYP WLITP
Sbjct: 236 GSSHDAYILRQSGHYDGFETGKFPS-------------YFISGDAGYPCSRWLITPIHRP 282
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRP 329
A FN H RS+ + F LK + L K +M+ P
Sbjct: 283 RTEAEC-AFNEAHVRARSVIERTFGVLKSRFCCLDKSGGSLMYSP 326
>gi|331240515|ref|XP_003332908.1| hypothetical protein PGTG_14067 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309311898|gb|EFP88489.1| hypothetical protein PGTG_14067 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 271
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 90/206 (43%), Gaps = 25/206 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
E FK FR+++ F +C+ +R + VS+ N +++Q+ +AL RL
Sbjct: 74 EASQFKQEFRMTQLAFTKLCARIRNNTVSQS-----NSHNPQHPIKEQLMVALERLGCFG 128
Query: 119 SQVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SVG+ FGVG+ T T RFI A+ ++WP +EIK+ + E G
Sbjct: 129 NGASVGMLARFFGVGEGTFELCTNRFIMAILRIKTQIIQWPSPEDRKEIKANYAE-VGFD 187
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G ID I +T + D+ ++ +Y + D + TGWPG
Sbjct: 188 GCVGLIDGVLIPLTECPSKNGSDFYSRKGSYGITTLIACDSNRNINFLYTGWPG------ 241
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSE 261
C QR+ GN R++ E
Sbjct: 242 ----------CSHDQRVMGNSRLALE 257
>gi|328701313|ref|XP_001947463.2| PREDICTED: hypothetical protein LOC100164328 [Acyrthosiphon pisum]
Length = 383
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 140/316 (44%), Gaps = 40/316 (12%)
Query: 39 WWPSFWAKNSSTPGATIPS---DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
W W K + T+ + D+E+ F +FR+++ TF ++ E+L+++ +
Sbjct: 38 WVHDLWKKRKTEGEFTLYNGLIDDEQKFYEYFRMTQYTF-HVLHEKLENLITKKETHW-- 94
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
+ + ++++A+ LR LA+G+S S+ FI A
Sbjct: 95 --RKPIPSKERLAVCLRYLATGDSMRSIS----------------FIMAA---------- 126
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P+ + +EI S+F S+ NC GAID H+ + P S +C ++ +S+++ +VD
Sbjct: 127 PNEEKWKEIASEFWTSWNFPNCLGAIDGKHVTIQAPPNSGSQYFCYKK-TFSVVLLALVD 185
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYIVGGVG 274
FI + G G + +L S K E + + + E+ I+G
Sbjct: 186 AHYNFIAVDVGSYGKNSDGGILNHSMLGKALEQNKLDIPEKAALPGTTNEVPFVIIGDEA 245
Query: 275 YPLLSWLITPYETNGLSASMP-TFNSLHEATRSLAVKAFLQLKGGWRILS-KVMWRPDKR 332
+PL ++L+ PY L + +N R + AF L +RI + ++ +P+
Sbjct: 246 FPLKTYLLRPYPGKNLDCNEKRIYNYRICRARRVVENAFGILSQKFRIYNRRIQAKPE-- 303
Query: 333 KLPSIILVCCLLHNII 348
+ II+ C+LHN I
Sbjct: 304 NVDVIIITTCILHNFI 319
>gi|449679701|ref|XP_004209397.1| PREDICTED: uncharacterized protein LOC101238054 [Hydra
magnipapillata]
Length = 198
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 91/190 (47%), Gaps = 5/190 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS- 158
+SV K+VA+ L LAS E V FG+G+STV+ + FI A+ + ++K+P S
Sbjct: 5 ISVVKRVAVGLHYLASCEEYRVVSSLFGIGKSTVNLIVHEFINAVNDILLPKYVKFPLSV 64
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ + F+ G C GA+D HI ++ P Q + + + + YS+++ +VD
Sbjct: 65 ENLNKHSRDFKAILGFPQCVGAVDGCHIPISTPKDQ-AISYYNYKRWYSIVLFAVVDCRY 123
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RFI G PG N S +L+ S + E+ + V L +G +PL
Sbjct: 124 RFIYTSVGSPGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLC--FIGDSAFPLT 181
Query: 279 SWLITPYETN 288
L+ PY N
Sbjct: 182 RHLLKPYPEN 191
>gi|328726675|ref|XP_003248996.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 337
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 110/256 (42%), Gaps = 23/256 (8%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD--SNRME 162
+ +A+ LR L SG Q+ + + + +G +T+ ++ + AL + + +P+ NR
Sbjct: 32 EMLAVTLRYLGSGSDQIDLHLTYRLGHTTIGKILRKVCNALWDCLREE-SFPEFTENRWR 90
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
EI FE+ NC GAID H+ + P + S + + +N +S+++ IVD + FI
Sbjct: 91 EIAEGFEKYCQFPNCLGAIDGKHVRIRKPRISGSLFY-NYKNFFSIVLLAIVDAKYNFIY 149
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCE-------AGQRLNGNVRISSEEVELREYIVGGVGY 275
I G G + S + + S ++ E G+ L G V + VG +
Sbjct: 150 IDVGAFGKESDSTVFERSNLYEQLENNELHIPRGKPLPGTV-----SPNMPYTFVGDEAF 204
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
L ++ PY L FN R AF L W+I K P L
Sbjct: 205 SLSKNIMRPYSGKYLVDKKRIFNYRLSRARRNVESAFGILSNKWKIFHK----PINANLD 260
Query: 336 SIILV---CCLLHNII 348
IL+ CC LHN +
Sbjct: 261 LSILIVKTCCALHNFV 276
>gi|403171294|ref|XP_003330542.2| hypothetical protein PGTG_12079 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169116|gb|EFP86123.2| hypothetical protein PGTG_12079 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 136/334 (40%), Gaps = 39/334 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E FK R +K F ++ S V + P N L + Q+A+ L RL S +
Sbjct: 102 DEDFKQAVRTTKAGFTWLLSQVNLN-----PIFHSNSFRPQLPIPHQLALTLERLGSNGN 156
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG GVG+ TV + + R I A+ + ++ +L WPD R +EI S + G
Sbjct: 157 GASVGRFSRNLGVGRGTVVKASRRVIRAINDLSEKYLTWPDEVRRKEI-SDVMKCEGFEG 215
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G +D T I + + + D++ YS+ Q + D + +TGWPG S +
Sbjct: 216 CVGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVVCDCDRFITAFMTGWPGSCGDSMV 275
Query: 237 LKFSGFFK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
K K + GQ Y++ Y L I Y+
Sbjct: 276 FKRMMLHKEPTLFFDRGQ-----------------YLIADSAYELGVHCIPAYKAPAAYI 318
Query: 293 SMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLHNI 347
+ FN +R LKG W L + + +P D ++ + C LHNI
Sbjct: 319 KENSDFNYCLARSRVRNEHTIGILKGRWASLQHLRLAIQKPSDMMEVIRWVNCCVTLHNI 378
Query: 348 IIDSGDQ-LHPDVALSDHHDS---GYGEQCCKQV 377
+ GD DV+++DH S G E+ ++V
Sbjct: 379 LAHLGDAWAELDVSINDHPGSDGPGVNEETAQEV 412
>gi|328707695|ref|XP_001951939.2| PREDICTED: hypothetical protein LOC100163229 [Acyrthosiphon pisum]
Length = 614
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 143/332 (43%), Gaps = 43/332 (12%)
Query: 39 WWPSFWAKNSSTPGATIPSDE----EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLI 94
W F N++T G + E E FK F+R+S+++F+ + +VR L + + +
Sbjct: 123 WVHPFIKNNANTLGTFSVTKELAMYPEKFKNFYRMSQESFNVLSHMVRTKLQKKDTNFRL 182
Query: 95 NIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV-------SQVTWRFIEALEE 147
+I SVE+++ I LR LA G + + + F G+ST SQV W + +
Sbjct: 183 SI-----SVEERLLITLRFLAMGGNMKAHSMYFLRGESTTRTIISETSQVIWDCLHDI-- 235
Query: 148 RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYS 207
++ P +++ +F + + + NC GAID H + P+ T + + ++ +S
Sbjct: 236 ----YMPIPSKEHWKKVADRFYDLWNIPNCIGAIDGKHFKIKCPS-NTGSAYFNYKHYFS 290
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEV---- 263
+++ VD + F+ I G G + R+ + S E N + I +V
Sbjct: 291 VVLMACVDADGLFLTIDVGDYGRNSDGRVFRRSSLGITLE-----NNALDIPEPKVLPGW 345
Query: 264 ----ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGW 319
+ Y V +PL + ++ P+ L+ +N R +F L +
Sbjct: 346 ENKDKFPHYFVADEAFPLKTNIMRPFPKRSLNKERRIYNYRCSRARRSVECSFGMLVSKF 405
Query: 320 RILSKVMWRPDKRKLPS---IILVCCLLHNII 348
R+ + P K+ + +I C+LHN I
Sbjct: 406 RLFEQ----PIGCKVETAEALIKAACVLHNFI 433
>gi|301619233|ref|XP_002939009.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 396
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 132/295 (44%), Gaps = 25/295 (8%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ FR+S +FD + ++++ L SR + + + +S E+++ + LR LA+G+S S
Sbjct: 63 FQKHFRMSTTSFDELLTILKPGL-SRTQTLMRDP----ISPEERLCLTLRFLATGQSFSS 117
Query: 123 VGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+ + F +G++T+ ++ L E + + PD N+ +I F E NC GA+
Sbjct: 118 LYIQFHIGRTTIGKIVRETCLLLWSELQRIVMPSPDENKWMQIAEDFHEKVNFPNCAGAL 177
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
HI + +PA + S + + ++ +L V++ ID+ + G N S
Sbjct: 178 GGRHIRVRMPANRGS-KYYKKNSSVVLLAAADVNYCFSVIDVGSYHSTG-NASA------ 229
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS--------WLITPYETNGLSAS 293
F+ E G++L+ L +V + Y L +I PY + S
Sbjct: 230 -FQNSELGRQLSERTLHLPLPKPLPGTVVPNMPYVFLGDEAFGLAENVIQPYPGSQKSVQ 288
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN + + AF L W +L +PD + ++I CC+LHN +
Sbjct: 289 KRVFNYRFSRAQRILECAFGILSNKWHVLHTAKLKPD--FVSTVIKACCVLHNFV 341
>gi|241566933|ref|XP_002402225.1| transposase, putative [Ixodes scapularis]
gi|215500005|gb|EEC09499.1| transposase, putative [Ixodes scapularis]
Length = 283
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 132/293 (45%), Gaps = 22/293 (7%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ FR+S TF Y+ + R + + + I +VEK+VAI+L RL S +
Sbjct: 6 QHFRACFRLSSTTFRYLVDVCRSSMQRQDTAIKTTI-----TVEKRVAISLYRLCSTAEE 60
Query: 121 VSVGVAFGVGQSTVSQVTWRF----IEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
++G F VGQS V++ F IE LE ++ D +E +F+ G N
Sbjct: 61 RTIGHLFAVGQSVVNESYREFCDVVIEELEATTVSMIRNED---LEHHMREFQAVLGFPN 117
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GA D H+ ++ P ++ D+ + + YS+++ +VD F + G PG S +
Sbjct: 118 DIGAPDGCHLPVS-PLKDSAVDYRNYKGCYSVILLALVDRRCLFRYVSVGSPGKCQDSNV 176
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI--TPYETNGLSASM 294
+L E Q V S ++ ++ +PL LI P+ N
Sbjct: 177 YGRCPLGRLLEDYQ---VAVPGSIGGTKIPPIVLCDQAFPLTRNLIKPLPHSLNHPQDEG 233
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHN 346
+L +A R + AF +LK +RI+ K M R D + + + CC+LHN
Sbjct: 234 DYSYALSKARRVVE-NAFGRLKARFRIVLKRMEVRID--NVYTAVRACCILHN 283
>gi|449674741|ref|XP_004208247.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 190
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 91/187 (48%), Gaps = 4/187 (2%)
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
M ++ S+FE FG++ G ID TH+ + P D +C ++ +S+ VQ + D + F
Sbjct: 1 MTKLASQFEVKFGMIQAFGCIDGTHVQIKRPIKNGQDYFCYKQ-YFSLNVQAVCDSKGYF 59
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
ID+ WPG ++ +++ S K G L + + Y++G YPL ++
Sbjct: 60 IDVECKWPGSVHDAKMFINSTINKKLIKGT-LPQTLYSLPNYHSIPNYLIGDPAYPLTNF 118
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILV 340
I +++ + + FNS+ + R+ AF +LK W L K++ +P +I
Sbjct: 119 CIKEFQSCSNNEEV-IFNSMLRSARNQIECAFGRLKARWGFLRKII-DIKIETVPIVIYT 176
Query: 341 CCLLHNI 347
C +LHN
Sbjct: 177 CFVLHNF 183
>gi|53749283|gb|AAU90142.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 256
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 87/162 (53%), Gaps = 12/162 (7%)
Query: 201 DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISS 260
+ + YS+ +Q +VD E F D+ G PG ++ + +L S + CEAG L G+ ++
Sbjct: 57 NNKATYSVAMQAVVDDEGAFTDVCIGHPGSLSDAAVLAKSALYARCEAGLLL-GHDKLGW 115
Query: 261 EEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR 320
++ ++VGG YPL SW++ PY + + N+ R+ AV AF +L+ W+
Sbjct: 116 QQ---PLWLVGGASYPLTSWMLVPYTQPNQTWAQDRLNAHVADARAAAVGAFRRLRARWQ 172
Query: 321 ILSKVMWRPDKRKLP---SIILVCCLLHNIIIDSGDQLHPDV 359
L R + KLP +++ VCC+LHN+ G++L ++
Sbjct: 173 CL-----RRAEVKLPELANMLAVCCVLHNLCERIGEELDANL 209
>gi|383172893|gb|AFG69822.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
gi|383172894|gb|AFG69823.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
gi|383172895|gb|AFG69824.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
gi|383172896|gb|AFG69825.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
gi|383172897|gb|AFG69826.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
Length = 141
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 18/151 (11%)
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+VD F DI G PG M +L+ S +KL G NG ++VG
Sbjct: 1 VVDSTGSFTDICIGLPGSMTDDVVLENSDLYKLGMNG-FFNGG------------WVVGS 47
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
YPLL WL+ PY + L+ + FN + +A AF +LKG WR L K R + +
Sbjct: 48 SAYPLLDWLLVPYVQHNLTWTQHAFNEKISEIQKIAKDAFGRLKGRWRCLEK---RTEVK 104
Query: 333 --KLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
LP ++ CC+LHNI + P++A
Sbjct: 105 LLDLPVVLGACCVLHNICEQYKEDFDPELAF 135
>gi|403171723|ref|XP_003889412.1| hypothetical protein PGTG_21889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169310|gb|EHS63851.1| hypothetical protein PGTG_21889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 124/306 (40%), Gaps = 35/306 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
EE FK R +K F ++ + + P N L V Q+A+ L RL S +
Sbjct: 102 EEDFKQAVRTTKTGFTWLLEQIYFN-----PIFYSNSPRPQLPVPHQLALTLERLGSNGN 156
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV + + R I A+ + ++ +L WPD++R +EI SK ++ G
Sbjct: 157 GASVGRFSRNLSVGRGTVIKASRRVIRAINDLSQRYLLWPDTDRRDEI-SKVMKAEGFEG 215
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G +D T I + + + D++ YS+ Q I D + +TGWPG S +
Sbjct: 216 CIGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDQFITAYMTGWPGSCGDSMV 275
Query: 237 LKFSGFFK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
K K + GQ Y++ Y L I Y+
Sbjct: 276 FKRMMVHKEPALFFDPGQ-----------------YLIADSAYELGLHCIPAYKAPAAYI 318
Query: 293 SMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLHNI 347
T FN +R LKG W L + + +P D ++ + C LHN+
Sbjct: 319 QENTEFNYCLARSRVRNEHTIGILKGRWASLQHLRLAIQKPSDMMEIIRWVNCCVTLHNM 378
Query: 348 IIDSGD 353
+ GD
Sbjct: 379 LAHLGD 384
>gi|301617451|ref|XP_002938158.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 276
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 115/272 (42%), Gaps = 38/272 (13%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR 112
AT+ E+ +R++K + + L+ L R R + ++ +L
Sbjct: 10 ATLEGLGEDEVVRRYRLNKTAINTLYELLEPALEPR------TCPSRAVPGMVKLLCSLH 63
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF 172
ASG Q GV GV Q T SQ + ++A+ +++ + +P +R E + S
Sbjct: 64 FFASGSFQRVGGVYGGVSQPTFSQCLGQVLDAIRSASRNFISFPQ-HRNEWVASPM---- 118
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHEMRFIDIVTGWPGGM 231
TH+ + P Q + Y S+ VQ + D M + IV+G+PG
Sbjct: 119 ----------CTHVALNPP--QDKEHISRNRKGYHSLNVQVVCDANMNIMSIVSGFPGSS 166
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
+ + +L+ SG ++ E GQ +G +++G GYP WLITP
Sbjct: 167 HDAYILRQSGLYQAFETGQMPHG-------------WLLGDAGYPCGRWLITPIHRPRSR 213
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILS 323
A FN H TRS+ + F LK +R LS
Sbjct: 214 AEC-AFNQAHVRTRSVIERTFGVLKSRFRCLS 244
>gi|302810012|ref|XP_002986698.1| hypothetical protein SELMODRAFT_425553 [Selaginella moellendorffii]
gi|300145586|gb|EFJ12261.1| hypothetical protein SELMODRAFT_425553 [Selaginella moellendorffii]
Length = 445
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 70/145 (48%), Gaps = 26/145 (17%)
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ IK F+ G LNCCGAID H + LP + D+ +++ +YS+++Q IVD+E
Sbjct: 106 LSAIKIGFKSLCGFLNCCGAIDYMHFKVELPGNVFASDYYNKDRDYSIVIQAIVDNEA-- 163
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
K ++G+ LNG +S + + REYI+G GY L W
Sbjct: 164 -----------------------KRAQSGEILNGPA-VSRSQYQFREYIIGDSGYYELPW 199
Query: 281 LITPYETNGLSASMPTFNSLHEATR 305
L+ P+ + + TFN ++R
Sbjct: 200 LVIPFPSANGTELRETFNFYLSSSR 224
>gi|403176854|ref|XP_003335463.2| hypothetical protein PGTG_17316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172428|gb|EFP91044.2| hypothetical protein PGTG_17316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 52/339 (15%)
Query: 47 NSSTPG--------ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEG 98
N +PG A + + +E FK R +K F ++ L+ + V S
Sbjct: 81 NDRSPGRTREEFDLAQLFNMRDEDFKQAVRTTKLGFMWLLGLITLNPVFHSASFRPQ--- 137
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
L V Q+A+ L RL S + SVG GVG+ TV + + R I+A+ + +L W
Sbjct: 138 --LPVPHQLALTLERLGSNGNGASVGRFSRNLGVGRGTVIKASRRVIQAINHLSHTYLLW 195
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
PD++R +EI S ++ G C G +D T I + + + D++ YS+ Q I D
Sbjct: 196 PDADRRKEI-SNVMKAEGFEGCIGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICD 254
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKF-------SGFFKLCEAGQRLNGNVRISSEEVELREY 268
+ +TGWPG S + K GFF + GQ Y
Sbjct: 255 CDRFITGYMTGWPGSCGDSMVFKKMAVHRDPGGFF---DPGQ-----------------Y 294
Query: 269 IVGGVGYPLLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV-- 325
++ Y L I Y+ T FN +R LKG W L ++
Sbjct: 295 LIADSAYELGLHCIPAYKAPAAYVLENTEFNYCLARSRVRNEHTIGILKGCWASLQQLRL 354
Query: 326 -MWRP-DKRKLPSIILVCCLLHNIII---DSGDQLHPDV 359
+ +P D ++ + C LHN++ D+ D L P +
Sbjct: 355 AIQKPSDMMEIIRWVNCCVTLHNMLAHLGDAWDHLDPSI 393
>gi|221122055|ref|XP_002167152.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/256 (25%), Positives = 114/256 (44%), Gaps = 26/256 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++ + C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQYWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G + + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDDDAAVLNASTF------GRAFNKGYFNLPKISEFDPKVPPVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKR 332
+ L WL+ PY L+ FN R +F L WRI S + +P
Sbjct: 200 VFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIKNSFGILAARWRIYRSPIKAKP--L 257
Query: 333 KLPSIILVCCLLHNII 348
K+ II LHN +
Sbjct: 258 KVEHIIKATVCLHNYL 273
>gi|449691682|ref|XP_004212760.1| PREDICTED: uncharacterized protein LOC100208553 [Hydra
magnipapillata]
Length = 492
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 144/351 (41%), Gaps = 64/351 (18%)
Query: 59 EEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + L + R P +SV K+VA+AL
Sbjct: 195 EECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMREP----------ISVVKRVAVALHY 244
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEES 171
LAS E V FG+G+ST + + F+ A+ + ++K+P S + + FE
Sbjct: 245 LASCEEYHVVSSLFGIGESTANLIVHEFVNAVNDILLPKYVKFPLSVENLNKYSRDFEAI 304
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
C GA+D HI ++ P Q + + + + S+++ +VD RFI G PG
Sbjct: 305 LDFPQCVGAVDGCHIPISAPKDQ-AISYYNYKGLCSIVLFAVVDCRYRFIYTSVGSPGRN 363
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
N S +L+ S + E+ + G L
Sbjct: 364 NDSYILQNSSLKAILESN-------------------LFGKCCKEL-------------- 390
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
+SL +R + AF ++K +R++ K M D I+ C LHN I +
Sbjct: 391 -----GDSLVPLSRRVVENAFGRVKARFRVICKRM-ECDINFATRIVNACVTLHN-ICEY 443
Query: 352 GDQLHPDVALSDHHDSGYGEQCCKQV---DPMGRTTRENLEKHLQHNQEKD 399
D + L HHD + + G+ R+++ K+L EKD
Sbjct: 444 YDDIITIEWLMHHHDDSLAQPNTVSTTGNNGPGKNVRDSIAKYL---YEKD 491
>gi|301118887|ref|XP_002907171.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105683|gb|EEY63735.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 329
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/253 (27%), Positives = 114/253 (45%), Gaps = 23/253 (9%)
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
S K+VA +L L S G A G+ +S V ++T + L+ A + +P SNR
Sbjct: 45 SYNKKVAASLYFLGSSGGYRETGAAMGMSRSYVMEITSEVVRVLKIVAPQVISFP-SNRE 103
Query: 162 E--EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
E +++ F G GAID + I + P D + ++ ++ +Q IV +
Sbjct: 104 EWNAVEAGFASKHGYPGIAGAIDGSLIEIERP--DNFDGFYCRKAYPALNMQAIVTSDGF 161
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F+ + PG + S+ ++S R GNV + ++ +G GY LL
Sbjct: 162 FLSVDVR-PGSWSDSKCWQYSTI-------GRSVGNV------LPAGKHFIGDAGYALLP 207
Query: 280 WLITPY----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
WLI PY E LS FN LH +TR F + KG +R+L + + R+
Sbjct: 208 WLIVPYCEREEGGRLSQQQKQFNFLHSSTRMAVECTFGRWKGRFRMLQCALSQETARRSA 267
Query: 336 SIILVCCLLHNII 348
+ ++ +LHN++
Sbjct: 268 NFVVATVVLHNLM 280
>gi|361069809|gb|AEW09216.1| Pinus taeda anonymous locus UMN_586_01 genomic sequence
Length = 141
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 18/151 (11%)
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+VD F DI G PG M +L+ S +KL G NG ++VG
Sbjct: 1 VVDSTGSFTDICIGLPGSMADDVVLENSDLYKLGMNG-FFNGG------------WVVGS 47
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
YPLL WL+ PY + L+ + FN + +A AF +LKG WR L K R + +
Sbjct: 48 SAYPLLDWLLVPYVQHNLTWTQHAFNEKISEIQKIAKDAFGRLKGRWRCLEK---RTEVK 104
Query: 333 --KLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
LP ++ CC+LHNI + P++A
Sbjct: 105 LLDLPVVLGACCVLHNICEQYKEDFDPELAF 135
>gi|403167044|ref|XP_003326870.2| hypothetical protein PGTG_08407 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166834|gb|EFP82451.2| hypothetical protein PGTG_08407 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 143/354 (40%), Gaps = 45/354 (12%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ FK R +K F ++ +R P N L + Q+A+ L RL S +
Sbjct: 85 DNDFKQAVRTTKDGFLWLLDRIRHH-----PIFYSNSPRPQLPIPHQLALTLERLGSNGN 139
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV +V+ R I+A+ E + +++ WP+ R EI E G
Sbjct: 140 GASVGRFSRNLSVGRGTVIKVSRRVIQAINEVSTNYIVWPNKERRAEISEVMTEE-GFEG 198
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + P + W D++ YS+ Q I D + +TGWPG S
Sbjct: 199 CIGFVDGTTIPLHQRPGLDGEVYW-DRKKQYSINCQVICDCDRFITSFMTGWPGTCGDSL 257
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEE-VELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ K N V + + + +Y++ Y L I Y++ +
Sbjct: 258 VFK--------------NMKVHLEPDRFFDPGQYLIADSAYALSMTTIPAYKSPLSKINR 303
Query: 295 PT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRPDKRKLPSII--LVCCL-LHNI 347
T FN +R LKG W L ++ ++ P + + II + CC+ LHN+
Sbjct: 304 NTEFNYCLAKSRVRNEHTIGILKGRWASLQQLRLSLYTP--QHMVEIIRWINCCVALHNM 361
Query: 348 IIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLV 401
+ GD +D+ + G E N E H + QEK L+
Sbjct: 362 LAHLGDSWDELATPNDNDEPGDSEG----------NHENNTEGHREEIQEKCLL 405
>gi|301101403|ref|XP_002899790.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102792|gb|EEY60844.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 123/295 (41%), Gaps = 34/295 (11%)
Query: 68 RVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
R +K+TF + ++E + N + R S EK+VA AL L S VG A
Sbjct: 16 RCTKETFLRLGRFLQEHGIR-----FANAKVRQHSYEKKVAAALYFLGSSGGYREVGAAM 70
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
G+ +S V ++T I L+ A + +P D R I+ +F G GAID + +
Sbjct: 71 GMSRSYVKEITDEVIHVLKLVACQVISFPRDRRRWNTIEDQFALRQGYPGVVGAIDGSLV 130
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS----GF 242
+ P D + +++ ++ VQ IV + F+ + PG + + + S
Sbjct: 131 EVERP--DDFDGFYCRKSYPALNVQAIVTMDNYFLSVEVR-PGSWSDRKCWQHSVIARNV 187
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY----ETNGLSASMPTFN 298
F + AG + VG GY L L+ PY E L+ FN
Sbjct: 188 FNIIPAGT-----------------HFVGDAGYALSPGLMVPYSDREEGGALTERQNKFN 230
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
H +TR + F KG ++IL + + R +I+ +LHN+II D
Sbjct: 231 YFHSSTRMVVESTFGHWKGRFKILQCTLNQDTPRDASDVIVATIVLHNLIISFRD 285
>gi|115953196|ref|XP_785740.2| PREDICTED: uncharacterized protein LOC580596 [Strongylocentrotus
purpuratus]
Length = 397
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 136/297 (45%), Gaps = 23/297 (7%)
Query: 62 GFKYFFRVSKKTFDYICSLV-----REDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
F+ F RV+ + FD I V +++ R P LS ++AI LR LA+
Sbjct: 76 AFRNFTRVTPEVFDEILERVAPVIQKQETNYRHP----------LSAGLKLAITLRHLAT 125
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHL-KWPDSNRM-EEIKSKFEESFGL 174
G++ S+ F G ST+S++ A+ E K + P + + +FE+ + +
Sbjct: 126 GDNYRSLAYGFRCGISTISEMIPEVYRAIVEGYKDEVFNIPTTPEAWSTLAQQFEQRWNV 185
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+ GA+D HI++ PA T + + + +S+ + +VD E +FI I G G M+ S
Sbjct: 186 PHAIGALDGKHIVIKKPA-NTGSLYYNYKGFFSIPLLALVDAEYKFIWIELGGKGHMSDS 244
Query: 235 RLLKFSGFFKLCEAGQ-RLNGNVRISSE-EVELREYIVGGVGYPLLSWLITPYETNGLSA 292
++ S F+ E G L + E + ++ +I+G + L S+++ PY G++
Sbjct: 245 QIFTDSELFECLEDGSIGLPPPCHLPGENQPDIPYFILGDDAFALKSYMMKPYSRRGMTD 304
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIILVCCLLHNII 348
N R + AF L +R +L + + D + ++ +LHN++
Sbjct: 305 EHRICNYRISRGRRVVENAFGILANRFRCLLGTLEQKVDNVR--DLVETAVVLHNLL 359
>gi|331229811|ref|XP_003327571.1| hypothetical protein PGTG_09105 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 446
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/305 (25%), Positives = 129/305 (42%), Gaps = 33/305 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ FK R +K+ F ++ + V V S L + Q+A+ L RL S +
Sbjct: 102 DSDFKQSIRTTKEGFIWLINQVSHHTVFHGQSFRPQ-----LPIPHQMALTLERLGSNGN 156
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG TV + + R I+A+ ++ ++ WPDS R +I S+ ++ GL
Sbjct: 157 GASVGRFARNLSVGHGTVIKASRRVIQAINSLSEKYVVWPDSTRRAKI-SQVMKNEGLSG 215
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G +D T I + + + D++ YS+ Q + D + TGWPG S +
Sbjct: 216 CIGFVDGTTIPLHQRPGLDGEVYFDRKKRYSINAQIVCDCDKFITAFTTGWPGSCGDSWV 275
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+ N +I SE + +Y++ Y L I Y++
Sbjct: 276 FQ----------------NSKIESEPDKYFDRGQYLIADSAYGLSLTCIPAYKSPASKKP 319
Query: 294 MPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSII--LVCCL-LHNII 348
T FN +R LKG W L ++ DK+K + I+ + CC+ LHN++
Sbjct: 320 DNTDFNYCIAKSRVRNEHTIGILKGRWASLHELRLHLDKKKHMLEIVRWVNCCITLHNML 379
Query: 349 IDSGD 353
GD
Sbjct: 380 AQLGD 384
>gi|331226986|ref|XP_003326162.1| hypothetical protein PGTG_07992 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305152|gb|EFP81743.1| hypothetical protein PGTG_07992 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 419
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 141/349 (40%), Gaps = 41/349 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ FK R +K F ++ +R P N L + Q+A+ L RL S +
Sbjct: 102 DNDFKQAVRTTKDGFLWLLDQIRHH-----PIFYSNSPRPQLPIPHQLALTLERLGSNGN 156
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV +V+ R I+A+ E + +++ WP+ R EI E G
Sbjct: 157 GASVGRFSRNLSVGRGTVIKVSRRVIQAINEVSTNYIVWPNKERRAEISEVMTEE-GFKG 215
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI-VTGWPGGMNVSR 235
C G +D T I + + + D++ YS+ Q I D + RFI + +TGWPG S
Sbjct: 216 CIGFVDGTTIPLHQQPGLDGEVYWDRKKQYSINCQVICDCD-RFITLFMTGWPGTCGDSL 274
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEE-VELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ K N V + + + +Y++ Y L I Y++ +
Sbjct: 275 VFK--------------NMKVHLEPDRFFDPGQYLIADSAYALSMTTIPAYKSPLSKINR 320
Query: 295 PT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRPDKRKLPSIILVCCL-LHNIII 349
T FN +R LKG W L ++ ++ P + CC+ LHN++
Sbjct: 321 NTEFNYCLAKSRVRNEHTIGILKGRWASLQQLRLSLYTPQHMVEIICWINCCVALHNMLA 380
Query: 350 DSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEK 398
GD +D+ + G E +N E H + QEK
Sbjct: 381 HLGDSWDELATPNDNDEPGDSEG----------NHEDNTEGHREEIQEK 419
>gi|301609160|ref|XP_002934124.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 347
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 119/281 (42%), Gaps = 28/281 (9%)
Query: 74 FDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQST 133
DY+ L+ L SRP R +S E Q+ AL SG Q +G G+ Q++
Sbjct: 46 IDYLVDLLGGSL-SRPTQ-----RSRAISPETQIMAALGFYTSGSFQTRMGDTIGISQAS 99
Query: 134 VSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPA 192
+S+ EAL ERA + +P D ++ +K +F G+ G +D T + + P
Sbjct: 100 MSRCVTNVTEALVERASQFISFPRDERSVQRLKDEFYNLAGVPGVLGVVDCTQVNIKAP- 158
Query: 193 VQTSDD--WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
S+D + ++ +S+ + D + T G M + +L S + L E
Sbjct: 159 --NSEDLSYVNRRGLHSLNCLLVCDARGSLLWAETSRLGSMQDNAVLHQSELYSLFETKM 216
Query: 251 RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVK 310
G +++ + L WL+TP + S S +N H AT S+ +
Sbjct: 217 HKQG-------------WLLADNAFILRPWLMTPVQLPE-SPSDYRYNMAHTATHSVMER 262
Query: 311 AFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIII 349
L+ +R L S+ + K I+L CC+LHNI +
Sbjct: 263 TQRSLRLRFRCLDGSRATLQYSPEKSAQIVLACCILHNIAL 303
>gi|346467053|gb|AEO33371.1| hypothetical protein [Amblyomma maculatum]
Length = 326
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 111/242 (45%), Gaps = 35/242 (14%)
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V + F + +STV + + ++ L + +P+ ++ + FE+ G G I
Sbjct: 62 VAIRFCISESTVHGIVEKLLDYLCSLLPRKICFPED--LDLLADDFEQLSGFPGVVGCIG 119
Query: 183 ATHIIMTLPA--VQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
T I PA +++S+ Q++ S+ +Q + D++ RF+D+ G PG + + S
Sbjct: 120 GTCINTRSPAHVLKSSN----QQSFASVRLQAVCDNKCRFMDVFVGPPGDLESESVFLAS 175
Query: 241 GFFKLCEAGQRLNGNVRISSEEVEL----REYIVGGVGYPLLSWLITPYETNGLSASMP- 295
+EE+ + +++G GYPL +L+TPY + S +
Sbjct: 176 PL-----------------AEELPYWCPDKNHLLGDEGYPLREYLLTPYSSGQGSKDLAE 218
Query: 296 ----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN HE TR+ AF L+ ++ L + + K+ I+ CC++HN+ +++
Sbjct: 219 QYELVFNERHEKTRAKIDNAFRLLRQRFKQLHFLEF-VTASKMRRFIMACCVVHNMCVEA 277
Query: 352 GD 353
GD
Sbjct: 278 GD 279
>gi|222635238|gb|EEE65370.1| hypothetical protein OsJ_20664 [Oryza sativa Japonica Group]
Length = 372
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 133/311 (42%), Gaps = 44/311 (14%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ K F +C+ +R S + E +++E+QVA+ + + S +
Sbjct: 46 LRMRKVVFHKLCAELR--------SRALLEETFHVTIEEQVAMFIHVVGHNWSNRPIAFE 97
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS---KFEESFGLLNCCGAIDA 183
F TVS+ ++AL A+ + + ++I S +F F C GA+D
Sbjct: 98 FMRSGETVSRYFNLVLDALCILARDLVCIKSIDTHQKITSSHGRFHPYFQ--GCIGALDG 155
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI +P + D + ++ S V VD ++RF+ ++ GW G + S +L+
Sbjct: 156 THIPACVP-MHMQDRFRGRKKFQSQNVLAAVDFDLRFLYVLAGWEGSAHDSYVLQ----- 209
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL------SASMP-- 295
+A R NG ++ +Y + GY ++ PY A P
Sbjct: 210 ---DALSRTNG------LKIPEGKYFLADAGYAARPGVLPPYRGTRYHLKEYKGAREPEN 260
Query: 296 ---TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN H + R+ +AF LK +RI + P K ++ I++ CC LHN II+ G
Sbjct: 261 PKELFNLRHSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQV-KIVMACCALHNWIIEDG 319
Query: 353 DQLHPDVALSD 363
PDV + D
Sbjct: 320 ----PDVYVYD 326
>gi|328710225|ref|XP_003244198.1| PREDICTED: hypothetical protein LOC100573332 [Acyrthosiphon pisum]
Length = 423
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/347 (23%), Positives = 157/347 (45%), Gaps = 32/347 (9%)
Query: 39 WWPSFWAKNSSTPGATIPSD-EEEG---FKYFFRVSKKTFDYICSLV-----REDLVSRP 89
W S ++ G + +D ++EG FK R+S +TF + +LV +ED R
Sbjct: 37 WMTSLYSNRGVYSGTMLLNDLQKEGGDNFKNCCRMSYETFSQLLTLVDPKIRKEDTKFR- 95
Query: 90 PSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERA 149
+ + +++A+ LR LA+G+S S+ + F + +S++S + A+ E
Sbjct: 96 ---------KAIPSNERLALTLRYLATGDSFSSLSLIFKISKSSISHIIPEVCTAIIEVL 146
Query: 150 KHHLKWPDSNRMEE---IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY 206
+ +++ P S EE + K+E+ + +C GAID H+ + P + + ++ + ++ +
Sbjct: 147 QDYIQVPKSQ--EEWLSVSKKYEDVWNFPHCIGAIDGKHVQLQAP-IGSGSNFYNYKSTF 203
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK-LCEAGQRLNGNVRISSEEVEL 265
S+++ +VD + F+ G G ++ + + + FFK L E + ++ E +
Sbjct: 204 SIVLMAVVDADYNFLYADVGCQGRISDGGVFRNTSFFKQLEEQKLEIPPPEKLIGREKAV 263
Query: 266 REYIVGGVGYPLLSWLITPYE-TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
V +PL ++ PY ++ + FN R + F L +R+ K
Sbjct: 264 PYVFVADAAFPLKENILRPYPGSHEKGSDKRIFNYRLSRARRVVENVFGILSAVFRVFRK 323
Query: 325 VM-WRPDKRKLPSIILVCCLLHNIII--DSGDQLHPDVALSDHHDSG 368
M P K +L +++ C LHN + S ++ L DH G
Sbjct: 324 PMLLEPKKAEL--VVMACVYLHNFLRKEKSSRDIYSPTDLIDHEIEG 368
>gi|307136287|gb|ADN34114.1| retrotransposon protein [Cucumis melo subsp. melo]
Length = 657
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 125/301 (41%), Gaps = 49/301 (16%)
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+GL + E ++ VE+ VA+ L LA + F T+S+ + A+ +
Sbjct: 59 AGLTSTE--VVDVEEMVAMFLHILAHDVKSRVIKREFMRSGETISRHFNMVLLAVIRLHE 116
Query: 151 HHLKWPDS--NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
LK P N + + ++ E NC GA+D T+I + +PA + + ++ +
Sbjct: 117 ELLKKPQPVPNECTDQRWRWFE-----NCLGALDGTYIKVNVPASDRAR-YRTRKGEVAT 170
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY 268
V G+ D + F+ ++ GW G SR+L+ + R + +V Y
Sbjct: 171 NVLGVCDTKGDFVYVLAGWEGSAADSRILR--------------DALSRPNRLKVPKGYY 216
Query: 269 IVGGVGYPLLSWLITPYE------------TNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
+ VGYP + PY N S S FN H + R++ +AF LK
Sbjct: 217 YLVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSKEFFNMKHYSARNVIERAFGVLK 276
Query: 317 GGWRILSKVMWRPDKRKLPSIILVCCLLHNII------------IDSGDQLHPDVALSDH 364
G W IL + P + + + IL CCLLHN+I ID D H A D
Sbjct: 277 GRWAILRGKSYYPVEVQCRT-ILACCLLHNLINREMTNFDIEDNIDEVDSTHATTAADDI 335
Query: 365 H 365
H
Sbjct: 336 H 336
>gi|403173268|ref|XP_003332347.2| hypothetical protein PGTG_13732 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170202|gb|EFP87928.2| hypothetical protein PGTG_13732 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 418
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 138/352 (39%), Gaps = 41/352 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ FK R +K F ++ +R P N L + Q+A+ L RL S +
Sbjct: 85 DNDFKQAVRTTKDGFLWLLDRIRHH-----PIFYSNSPRPQLPIPHQLALTLERLGSNGN 139
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV +V+ R I A+ E + +++ WP+ R EI E G
Sbjct: 140 GASVGRFSRNLSVGRGTVIKVSRRVIRAINEVSTNYIVWPNKERRAEISEVMTEE-GFEG 198
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + P + W D++ YS+ Q I D + +TGWPG S
Sbjct: 199 CIGFVDGTTIPLHQRPGLDGEVYW-DRKKQYSINCQVICDCDRFITSFMTGWPGTCGDSL 257
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEE-VELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ K N V + + + +Y++ Y L I Y++ +
Sbjct: 258 VFK--------------NMKVHLEPDRFFDPGQYLIADSAYALSMTTIPAYKSPLSKINR 303
Query: 295 PT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRPDKR-KLPSIILVCCLLHNIII 349
T FN +R LKG W L ++ ++ P ++ I C LHN++
Sbjct: 304 NTEFNYCLAKSRVRNEHTIGILKGRWASLQQLWLSLYTPQHMVEIIRWINCCVALHNMLA 363
Query: 350 DSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQEKDLV 401
GD +D+ + G E N E H + QEK L+
Sbjct: 364 HLGDSWDELATPNDNDEPGDSEG----------NHENNTEGHREEIQEKCLL 405
>gi|307135889|gb|ADN33754.1| retrotransposon protein [Cucumis melo subsp. melo]
Length = 623
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 138/318 (43%), Gaps = 46/318 (14%)
Query: 68 RVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
R+ ++TF +C L+R +GL + E ++ VE+ VA+ L LA + F
Sbjct: 14 RMDRRTFAILCHLLRN------VAGLSSTE--IVDVEEMVAMFLHVLAHDVKNRVIQQEF 65
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWP---DSNRMEEIKSKFEESFGLLNCCGAIDAT 184
TVS+ + A+ + +K P SN ++ FE NC GA+D T
Sbjct: 66 VRSGETVSRHFNIVLLAVLRLYEELIKRPVPVTSNCNDQRWKCFE------NCLGALDGT 119
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
+I + +PA + ++ + V G+ D + F+ ++ GW G SR+L+
Sbjct: 120 YIKVNVPAGDRPT-FRTRKGEIATNVLGVCDMKGDFVYVLAGWEGSAADSRILR------ 172
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL-------SASMPT- 296
+A + NG +V Y + GYP + PY+ +A+ PT
Sbjct: 173 --DAISQENG------LQVPKGYYYLCDAGYPNAEGFLAPYKGQRYHLQEWRGAANAPTN 224
Query: 297 ----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN H + R++ +AF LKG W IL + P + + + IL C LLHN+I
Sbjct: 225 AKEYFNMKHSSARNVIERAFGVLKGRWTILRGKSYYPLQVQCRT-ILACTLLHNLINREM 283
Query: 353 DQLHPDVALSDHHDSGYG 370
+ DV D DS Y
Sbjct: 284 TYCN-DVEDEDEGDSTYA 300
>gi|270005023|gb|EFA01471.1| hypothetical protein TcasGA2_TC007020 [Tribolium castaneum]
Length = 270
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 114/250 (45%), Gaps = 30/250 (12%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+ I LR LA Q +G GV QSTVS+ ++ + E+ + +++P SN+ K+
Sbjct: 1 MKICLRYLADPGFQKGIGEELGVEQSTVSRTVKYVVQKIIEQTANWIQFPASNQKNCGKA 60
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
L++C TH+ + P D++ +++ ++ VQ D F +
Sbjct: 61 NIGFPLQLVDC------THVEIEKPK-NHGDEYINRKGKPTINVQATCDAREIFTSVEVS 113
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE---YIVGGVGYPLLSWLIT 283
WPG ++ SR+ K N +R ++ +R ++G GY + WL+T
Sbjct: 114 WPGSVHDSRIWK--------------NSEIR----QIMMRSTNTVLMGDDGYGIEPWLMT 155
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
P+ N + +N + + R + + F Q+K + IL K + R + +II+ C +
Sbjct: 156 PF-PNPNDNTKKRYNKIFKKERVIIERCFGQVKRRFPIL-KYVCRVKLENISNIIIACFV 213
Query: 344 LHNIIIDSGD 353
LHNI GD
Sbjct: 214 LHNIAKMLGD 223
>gi|90399090|emb|CAJ86150.1| H0413E07.3 [Oryza sativa Indica Group]
gi|90399286|emb|CAH68247.1| H0306F03.14 [Oryza sativa Indica Group]
Length = 435
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 110/259 (42%), Gaps = 43/259 (16%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVS-QVTWRFIEALEERAKHHLKWPDSNRME 162
+ ++ AL RLA +V FG+ V+ + + + +R L +R+
Sbjct: 159 DHKLGAALFRLAHAAPARAVARRFGLPSPAVAARAFYEVCRTIADRLAILLDLGAPDRIA 218
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
F + L NCCGA+ I +++ Q +VD E RF+D
Sbjct: 219 RAVPGFC-ALSLPNCCGALGYARI------------------GDAVIAQALVDAEGRFLD 259
Query: 223 IVTGWPGGMNVSRLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGGVGY 275
+ GW M + +L + + L A G+ + G+V Y +G
Sbjct: 260 VSVGWDTAMAPAEILPRTKLYSSQSLVLANAPHGELIGGSVP---------RYFLGPACC 310
Query: 276 PLLSWLITPYE----TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD- 330
P+L WL+TPY NG+S FN++H L AF ++ WR+L + W+ +
Sbjct: 311 PMLPWLVTPYNDMDAKNGMSKE-SIFNNVHSHGMRLVRNAFGHVRSRWRLLDEC-WKGEC 368
Query: 331 KRKLPSIILVCCLLHNIII 349
+ LP +++ CLLHN +I
Sbjct: 369 QEALPYVVVAGCLLHNFLI 387
>gi|322780715|gb|EFZ09990.1| hypothetical protein SINV_08352 [Solenopsis invicta]
Length = 308
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 107/245 (43%), Gaps = 15/245 (6%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESF 172
LASG+ S+ + + +G+ST++ + EA+ K L P N +EI +F+E +
Sbjct: 2 LASGDGLNSIALNYRIGRSTITLIIKETCEAIWTNLNKEALFIPTQNGWKEIAQEFQEIW 61
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
NC GA+D H+ + P + + + +N +S+++ +V +F+ + G G +
Sbjct: 62 NFPNCVGALDGKHVSIICPP-KAGSQYYNYKNFHSVVLMALVSASYKFLIVDIGAQGRHS 120
Query: 233 VSRLLKFSGFFKLCEAGQRL-NGNVRIS-----SEEVELREYIVGGVGYPLLSWLITPYE 286
G FK GQR N + + SE + IV + L + + PY
Sbjct: 121 D------GGIFKNSAMGQRFYNKTMNLPDSSDISERHTVPYVIVADEAFQLTEFTMRPYP 174
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
+ L+ FN R + F L WRI K + + + +II LHN
Sbjct: 175 SKNLTKQQKIFNYRLSRARHVVENTFGILASRWRIYHKPI-NTSLKTVDAIIKATVCLHN 233
Query: 347 IIIDS 351
++D+
Sbjct: 234 FLMDT 238
>gi|449683473|ref|XP_004210370.1| PREDICTED: uncharacterized protein LOC101240658 [Hydra
magnipapillata]
Length = 223
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 85/184 (46%), Gaps = 19/184 (10%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDL-----VSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + + L R P +SV K+VA+AL
Sbjct: 2 EECEFKEHFRVNRNTFNFLVNELHPHLGKTTTTMREP----------ISVVKRVAVALHY 51
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEES 171
LAS E V FG+G+ST + FI A+ + ++K+P S + + FE
Sbjct: 52 LASCEEYCVVSSLFGIGKSTANLFVHEFINAVNDILLPKYVKFPLSVENLNKHSRDFEAI 111
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G C GA+D HI + P Q + YS+++ +VD RFI G PG
Sbjct: 112 LGFPQCVGAVDECHIPILAPKDQAISYY--NYKWYSIVLFSVVDCRYRFIYTSVGSPGRN 169
Query: 232 NVSR 235
N R
Sbjct: 170 NDGR 173
>gi|218199708|gb|EEC82135.1| hypothetical protein OsI_26179 [Oryza sativa Indica Group]
Length = 325
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 130/302 (43%), Gaps = 38/302 (12%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+++E+QVA+ + + S + F TVS+ ++AL A+ + +
Sbjct: 25 VTIEEQVAMFIHVVGHNWSNRPIAFEFMRSGETVSRYFNLVLDALCILARDLVCIKSIDT 84
Query: 161 MEEIKS---KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++I S +F F C GA+D THI +P + D + ++ S V VD +
Sbjct: 85 HQQITSSHGRFHPYFQ--GCIGALDGTHIPACVP-MHMQDRFRGRKKFPSQNVLAAVDFD 141
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+RF+ ++ GW G + S +L+ +A R NG ++ +Y + GY
Sbjct: 142 LRFLYVLAGWEGSAHDSYVLQ--------DALSRTNG------LKIPEGKYFLADAGYAA 187
Query: 278 LSWLITPYE-----------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
++ PY T FN H + R+ +AF LK +RI
Sbjct: 188 RPGVLPPYRGTRYHLKEYKGTREPENPKELFNLRHSSLRTTIERAFGALKNRFRIFKCQP 247
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRE 386
+ P K ++ I++ CC LHN II+ G P+V + D D+ + E + V +E
Sbjct: 248 FFPLKTQV-KIVMACCALHNWIIEDG----PNVYVYD--DAAWYEALPRSVRNHTDMHQE 300
Query: 387 NL 388
NL
Sbjct: 301 NL 302
>gi|328723564|ref|XP_003247874.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 194
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 79/154 (51%), Gaps = 13/154 (8%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEEVEL 265
S+L+QGIVD +++FID+ +GWPG + +R+ F+ GQ+L N + I E
Sbjct: 17 SVLLQGIVDSDLKFIDVFSGWPGSSHDARV------FRRSLIGQKLLSNDLSILPENC-- 68
Query: 266 REYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
+I+G YPL ++ PY NG L+ + FN ++R + +AF +L +R L K
Sbjct: 69 --HILGDGAYPLSENVMIPYRDNGNLTLAQKHFNRCLSSSRVVVEQAFGKLYCRFRKL-K 125
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD 358
M K II C LHNI ID D D
Sbjct: 126 HMDVYHKSLCGLIITAACCLHNICIDMQDDFDAD 159
>gi|427798307|gb|JAA64605.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 346
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 137/309 (44%), Gaps = 41/309 (13%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
EE F+ FR+SK+T +C + + + SGL S E++V ALR A+G
Sbjct: 36 EEEFRQCFRLSKRTVRSLCDELEPIIGCQRASGL--------STERKVLCALRFFATGSF 87
Query: 120 QVSVGVA--FGVGQSTVSQVTWRFIEAL------EERAKHHLKWPDSNRMEEIKSKFEES 171
Q SVG G+ QS VS EA+ + L + ++ +
Sbjct: 88 QRSVGREEHIGMAQSAVSNTIHEVTEAIITVSARRKLVDFSLTPAAKEEAKAAFARRGDI 147
Query: 172 FGLLNCCGAIDATHIIMTLPA---VQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G+L C +D T + + P + + + ++ Y++ V + + ++R + + +P
Sbjct: 148 PGVLAC---VDGTLVAIRKPEGFNLPDTASFMSRKGYYALNVMIVCNAQLRILVVDPRFP 204
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP-YET 287
G + S + + + L G + + +++ EY++G GYPL WL+TP +
Sbjct: 205 GSCHDSWVWEHNP----------LRGRL---AAQLQPGEYVLGDSGYPLEPWLLTPVLGS 251
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS---KVMWRPDKRKLPSIILVCCLL 344
+ +N H + R++ + LK +R L +++ PD+ II C L
Sbjct: 252 PPRNTPEGRYNREHASMRNVVERCIGVLKSKFRCLQHFRTMLYNPDRAA--RIIYACVAL 309
Query: 345 HNIIIDSGD 353
HNI +D+GD
Sbjct: 310 HNIALDAGD 318
>gi|221105453|ref|XP_002166626.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 370
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 111/253 (43%), Gaps = 24/253 (9%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + D
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPN-----D 86
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 87 VKEWQNIIKEFDKNWTFPQCIGAIDGKHVHIEAPA-KSGFSFYNYKEFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDNDAAILNASTF------GRAFNKGYFSLPKISEFDPKVPLVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
+ L WL+ PY L+ FN R +F L WRI + + K
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRYPI-KAKPFK 258
Query: 334 LPSIILVCCLLHN 346
+ II LHN
Sbjct: 259 VEHIIKATVCLHN 271
>gi|403177022|ref|XP_003888818.1| hypothetical protein PGTG_22337 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172694|gb|EHS64701.1| hypothetical protein PGTG_22337 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 437
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 123/303 (40%), Gaps = 29/303 (9%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E FK R +K+ F ++ D +S P L + Q+A+ L RL S +
Sbjct: 102 DEDFKQSVRTTKEAFLWLL-----DQISLHPVFHSQSHRPQLPIPHQLALTLERLGSNGN 156
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV +V+ R IEA+ + H+ WPD R EI ++ G
Sbjct: 157 GASVGRFSRNLTVGRGTVIKVSRRVIEAINSLSSKHVVWPDRYRRAEISEVMKDE-GFPG 215
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + P + W D++ YS+ Q + D + GWPG S
Sbjct: 216 CVGFVDGTTIPLHQRPGLDGEVYW-DRKKKYSVNCQIVCDCDRYITSFTVGWPGSCGDSW 274
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA-SM 294
+ + + EA + ++++ Y L I Y+ +
Sbjct: 275 VFRNTKLHTQAEA-------------YFDAGQFLIADSAYGLSCTTIPAYKAPASNKREN 321
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK-LPSII--LVCCL-LHNIIID 350
FN +R LKG W L ++ K+ + II + CC+ LHN++ +
Sbjct: 322 ADFNYCLAKSRVRNEHTIGILKGRWASLQQLRLHLHKKSHMKEIIRWVSCCVTLHNMLSN 381
Query: 351 SGD 353
GD
Sbjct: 382 LGD 384
>gi|72145861|ref|XP_795502.1| PREDICTED: uncharacterized protein LOC590821 [Strongylocentrotus
purpuratus]
Length = 372
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 134/297 (45%), Gaps = 14/297 (4%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
++++ FK F RV ++ FD + + V + + + + L ++A+ LR LAS
Sbjct: 70 NEDQAAFKNFMRVPQEMFDELLTRVGPRITKQKTNYRDALHPGL-----KLALTLRDLAS 124
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIKSKFEESFGL 174
G S+ + V +T+S + +A+ +E +K P + I KF E +
Sbjct: 125 GTKYRSMSYGWRVPHNTISLLIPEVCQAIIKEYRDEMMKCPTTPEEWRAISDKFMEKWNF 184
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
CGA+D H+ P+ + + + + YS+++ +VD + RFI G G + +
Sbjct: 185 PRTCGALDGKHVNCKRPS-NSGSLYYNYKGFYSVVLMALVDADYRFIWADIGGMGSASDA 243
Query: 235 RLLKFSGFFKLCEAGQRLN--GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
++ S K C G L + ++ ++ + VG + L ++ PY GL+
Sbjct: 244 QIYNASE-LKACVEGGSLGFPDPDPLPNDNQDMSYFFVGDDAFALRPTMMKPYSLRGLTR 302
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNII 348
+N R + AF L +++ LS + +P+ KL I+ C +LHN++
Sbjct: 303 PERIYNHPLSRARRVVENAFGILANRFQVLLSTMQQQPETVKL--IVTACMILHNLL 357
>gi|403162731|ref|XP_003322904.2| hypothetical protein PGTG_04441 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173057|gb|EFP78485.2| hypothetical protein PGTG_04441 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 575
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 112/261 (42%), Gaps = 22/261 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L V Q+A+ L RL S + SVG GVG+ TV + + R I+A+ E + +L WPD
Sbjct: 282 LPVPHQLALTLERLGSNGNGASVGRFSRNLGVGRGTVIKASRRVIQAINELSHTYLLWPD 341
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+R +EI S+ ++ G C G +D T I + + + D++ YS+ Q I D +
Sbjct: 342 EDRRKEI-SEVMKAEGFEGCIGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCD 400
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG S + K + + G+ + +Y++ Y L
Sbjct: 401 RFITGYMTGWPGSCGDSMVFKKMAVHR--DPGRFFDPG-----------QYLIADSAYEL 447
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKR 332
I Y+ T FN +R LKG W L + + +P D
Sbjct: 448 GLHCIPAYKAPAAYILENTEFNYCLARSRVRNEHTIGILKGRWASLQHLRLAIQKPSDMM 507
Query: 333 KLPSIILVCCLLHNIIIDSGD 353
++ + C LHN++ GD
Sbjct: 508 EIIRWVNCCVTLHNMLAHLGD 528
>gi|331243398|ref|XP_003334342.1| hypothetical protein PGTG_16211 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309313332|gb|EFP89923.1| hypothetical protein PGTG_16211 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 135/320 (42%), Gaps = 35/320 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E FK R +K F ++ S + + + SG + L V Q+A+ L RL S +
Sbjct: 102 DEDFKQAVRTTKSGFTWLLSQIAFNPIFH--SGSFRPQ---LPVPHQLALTLERLGSNGN 156
Query: 120 QVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV + + R I A+ + +L+WPD +R +EI S+ ++ G
Sbjct: 157 GASVGRISRNLCVGRGTVIKASRRVIRAINDMGPTYLRWPDKDRRKEI-SEVMKAEGFEG 215
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G +D T I + + + D++ YS+ Q I D + +TGWPG S +
Sbjct: 216 CVGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITGYMTGWPGSCGNSMV 275
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVEL---REYIVGGVGYPLLSWLITPYETNGLSAS 293
K +++ E + +Y++ Y L I Y+
Sbjct: 276 FK----------------RMKVHKEPDDFFDPGQYLIADSAYELGVHCIPAYKAPAAYIR 319
Query: 294 MPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLHNII 348
T FN ++ LKG W L ++ + +P D ++ + C LHN++
Sbjct: 320 ENTEFNYCLARSQVRNEHTIGILKGRWASLQQLRLAIQKPSDMMEIIRWVNCCVTLHNML 379
Query: 349 IDSGDQLHPDVALSDHHDSG 368
GD DV ++ D+G
Sbjct: 380 AHLGDAW--DVLEANMEDTG 397
>gi|357627760|gb|EHJ77342.1| hypothetical protein KGM_10683 [Danaus plexippus]
Length = 462
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 74/303 (24%), Positives = 128/303 (42%), Gaps = 31/303 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+ F +R+SK +C E+L P + +IE SVE +V AL A+G+
Sbjct: 64 DAAFLDMYRLSKDLARNLC----EELKPVMPDSIKSIE---FSVESKVLAALSFYATGKY 116
Query: 120 QVSVG--VAFGVGQSTVSQVTWRFIEALEERA--KHHLKWPD-SNRMEEIKSKFEESFGL 174
Q S+G + Q V+ + EA+ + + K ++ +P N E IK+ F +G+
Sbjct: 117 QKSIGGRSDPSITQYFVATAVMQVTEAMNDPSIIKKYIHFPHLRNEREVIKNGFYMKYGI 176
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
N G +D H+ + P + ++ +S VQ I D R + + + G +
Sbjct: 177 PNVVGCVDCVHVPIARPDEDQKKHF--NKSYHSKKVQIISDSRQRIMSVCSEGGGSYSHD 234
Query: 235 RLL-KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
LL + + L LN + + +++GG Y +L+ P +S
Sbjct: 235 ALLARHAVTVDLVS----LNNSRDLC--------WLLGGPHYSQKPYLMAPVPKMTKKSS 282
Query: 294 MPT---FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP-DKRKLPSIILVCCLLHNIII 349
M + +LH S + QLK W+ L + D + ++L CC+LHNI
Sbjct: 283 MSPEKYYTNLHAQAHSAVTETIKQLKARWKCLQATSNKQFDPPTVAKMVLACCVLHNICT 342
Query: 350 DSG 352
+ G
Sbjct: 343 EHG 345
>gi|328721185|ref|XP_003247234.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 244
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 109/223 (48%), Gaps = 27/223 (12%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE---- 169
+A+ E+ +G F + ST +V + ++E + ++WP KF
Sbjct: 1 MANCETHRQIGNRFNIADSTSHKVVLNCLNNMKELSGKFIRWPRGQEAIITVQKFNCLRP 60
Query: 170 ESF-GLLNCCGAIDATHIIMTLP-AVQTSDDWCDQENNY------SMLVQGIVDHEMRFI 221
+F G+L GA+D HI + P +T + D+ Y S+L+QGIVD +++FI
Sbjct: 61 NAFPGVL---GAVDGCHISILAPWEKRTVMEKLDRNMFYNRKQVPSVLLQGIVDSDLKFI 117
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQR-LNGNVRISSEEVELREYIVGGVGYPLLSW 280
++ +GWPG + +R+ F+ GQ+ L+ ++ I E +I+G YPL
Sbjct: 118 EVFSGWPGSSHDARV------FRRSLIGQKLLSHDLSILPENC----HILGDGAYPLSEN 167
Query: 281 LITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
++ PY NG L+ + FN ++R + +AF +L +R L
Sbjct: 168 VMIPYRDNGNLTLAQKHFNRCLSSSRVVVEQAFGKLYCRFRKL 210
>gi|221110109|ref|XP_002160688.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 112/255 (43%), Gaps = 24/255 (9%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I ++++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEYDQNWNFPQCIGAIDGKHVHIEAPA-KSGFSFYNYKEFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDNDVAILNASTF------GRAFNKGYFSLPKISEFDPKVPLVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
+ L WL+ PY L+ FN R +F L WRI + + K
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRYPI-KAKPFK 258
Query: 334 LPSIILVCCLLHNII 348
+ II LHN +
Sbjct: 259 VEHIIKATVCLHNYL 273
>gi|147779878|emb|CAN65842.1| hypothetical protein VITISV_027369 [Vitis vinifera]
Length = 579
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 134/317 (42%), Gaps = 50/317 (15%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ + TF +CS++R G + + + + VE+ VA+ L LA +
Sbjct: 72 LRMDRHTFTTLCSMLR-------TIGKLK-DSKYIDVEEMVALFLHILAHHVKNRVIKFR 123
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEEIKSKFEESFGLLNCCGAIDA 183
F T+S+ + A+ LK P+ N +E F+ NC GA+D
Sbjct: 124 FLRSGETISRHFNAVLNAVIRLQGVLLKKPEPVSENSTDERWKWFK------NCLGALDG 177
Query: 184 THIIMTLPAVQTSDD--WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
T+I + V+ D + ++N + V G+ +M+FI ++ GW G + SR+L+
Sbjct: 178 TYIKVN---VREGDKPRYRTRKNEIATNVLGVCSQDMQFIYVLPGWEGSTSDSRVLR--- 231
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE------TNGLSASMP 295
+A R NG V Y + VGY + PY + MP
Sbjct: 232 -----DAVSRRNGLT------VPHGYYYLVDVGYTNGKGFLAPYRGQRYHLNDWREGHMP 280
Query: 296 T-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
T FN H A R++ + F LK W IL + P K + IIL CCL+HN+I
Sbjct: 281 TTHEEFFNMKHSAARNVIERCFGLLKLRWAILRSPCFYPIKTQC-KIILACCLIHNLIKR 339
Query: 351 SG--DQLHPDVALSDHH 365
D L ++ + DH
Sbjct: 340 EMXVDPLEQELDVQDHQ 356
>gi|156553248|ref|XP_001599841.1| PREDICTED: hypothetical protein LOC100115003 [Nasonia vitripennis]
Length = 319
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 125/290 (43%), Gaps = 43/290 (14%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
++ + FD I LVR L R + R L+ E ++A L LA G S F
Sbjct: 1 MTSEQFDEILELVRPQLTKRS-------KRRALTPEMRLAAVLNFLAHGNSIQKSAWMFL 53
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
+G+ST+ ++ A+ N +EE + + + NC GAID HI +
Sbjct: 54 IGRSTMYRLVIEVCTAI------------CNVLEE------KIWHMPNCFGAIDGKHIRV 95
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
P + + + + ++S+++ G+ D RFI + G G N +G F+ +
Sbjct: 96 KAPP-NSGSYFFNYKKHFSIVLMGLTDAFCRFIWVNIGDFGSSND------AGIFQRSDL 148
Query: 249 GQRLNGN-------VRISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSL 300
Q L+ + +V +I+G +PL ++L+ PY N L+ FN
Sbjct: 149 RQALDNEEIDIPAPTYLPRTDVLCPYFIIGDGAFPLKNYLMKPYTRANNLTHEEKIFNYR 208
Query: 301 HEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIII 349
R +AF L WRIL S V W+ + + ++I+ LHN +I
Sbjct: 209 LSRARLTIERAFGILTKKWRILESPVDWK--LKNIETVIMALICLHNFLI 256
>gi|328724899|ref|XP_001949872.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 325
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 140/310 (45%), Gaps = 24/310 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++E F+ FR+SK + + + + SR E L K++ +ALR A+
Sbjct: 38 DDEDFRTRFRISKDVVIQVLGFINDQISSRS-------EIMPLRPLKKLLLALRFYATEN 90
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
++ G GV ++T S + A+ + +K P + + ++ +F +
Sbjct: 91 FLITAGDFMGVSKTTASLIVRDVSTAIAKLRPIFVKMPIESEIPTMQKRFYKIARFPRTI 150
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID TH+ + P ++ + +++ +S+ VQ + +++ +D+V WPG + +LK
Sbjct: 151 GAIDCTHLKIQNPGGPNAEYYRNRKGYFSLNVQTVSCPDLKIMDVVARWPGSCHDYTVLK 210
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S FF +G+ N IV GY ++TP+ N + +N
Sbjct: 211 KSRFFNRLNSGEWGNS-------------LIVADSGYANSLRIVTPF-INPRNYIEELYN 256
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD 358
TR+ +++ LK + ILS R + ++I+ CC+LHNI ID+ D P
Sbjct: 257 ESIIRTRNPVKRSYGVLKRRFPILS-FGSRLKLQTTQAVIVACCVLHNIAIDNKDTDPP- 314
Query: 359 VALSDHHDSG 368
AL + + G
Sbjct: 315 -ALLEFYQIG 323
>gi|449680707|ref|XP_004209655.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 199
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 82/165 (49%), Gaps = 5/165 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDA 183
FG+ T S V + A+ + + P + +M E S+FE FG++ G +D
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFGMIQAFGCVDG 64
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG + +++ S
Sbjct: 65 THIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVEWLWPGSVYDAKVFSNSSIN 123
Query: 244 KLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYET 287
+ RL G + I+ ++++ Y++GG YPLL + Y T
Sbjct: 124 TNLRS-LRLPGTFQTITKNKIKVPCYLIGGPAYPLLPHCMKEYST 167
>gi|328718510|ref|XP_001943626.2| PREDICTED: hypothetical protein LOC100160106 [Acyrthosiphon pisum]
Length = 476
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 127/303 (41%), Gaps = 29/303 (9%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
EE FK F +++ +F C + ++ G + + VEKQ+ + L L +
Sbjct: 165 EESNFKTHFFITRSSFQIACGHLMQEY------GSMAYKRTEFDVEKQLLVTLTYLGTVL 218
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF-----G 173
S +G F + ST + L +R + WP + + E +F E G
Sbjct: 219 SYKEIGSKFNIAISTAHKCVSDVSNTLFKRMGEIIYWPINQQAESEIEEFNEMTGNRFPG 278
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
+L G +D P+ T D S+ +Q + ++ +F ++ T + +V
Sbjct: 279 ILGVIGMVDLRKTCTANPSKHTKD-------GSSIAIQCVCNNRYQFYNVFTSYLLKSSV 331
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+ S F + A LN R+ + +IVG +PLL L+TP+ + + S
Sbjct: 332 TS----STFLESPLAEMILNDPDRLFP---NIESHIVGQCCFPLLPNLMTPF-SGDTTVS 383
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGW-RILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
T+N E ++ AF +L G + R+L +W DK + +I LHNI + +
Sbjct: 384 QDTYNDAIEVPLNIIKLAFGKLLGRFSRLLCLDLWGQDKFAI--LIFAASCLHNICLSND 441
Query: 353 DQL 355
D +
Sbjct: 442 DDI 444
>gi|326501328|dbj|BAJ98895.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 137/348 (39%), Gaps = 43/348 (12%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P+ + ++ FR+S+ F +L+R + P L + ++ AL RLA
Sbjct: 114 PALPDSAWRDAFRMSRPAF---YALLRSLALPDPADPAAASSSLSLPADHKLGAALYRLA 170
Query: 116 SGESQVSVGVAFGV-GQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGL 174
+V FG+ + ++ + A+ +R L +R+ F + L
Sbjct: 171 HAAPARAVARRFGLPSPADAARAFYEVCRAVADRLAALLDLAAPDRISRAVPGFC-ALSL 229
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NCCGA+ ++ Q +VD E RF+D+ GW
Sbjct: 230 PNCCGALGYARF-----------------GGAAVTAQALVDAEGRFLDVSVGWDPSEAPP 272
Query: 235 RLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
+L + + L A G+ + G+V Y + PLL WL+TPY
Sbjct: 273 EVLPRTKLYTSQSLVLANAPHGELIGGSVP---------RYFLAPACCPLLPWLVTPYRD 323
Query: 288 NGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD-KRKLPSIILVCCLL 344
G + FN +H L AF +++ W++L + W+ + + LP +++ CLL
Sbjct: 324 AGDDGTSKERIFNDVHAQGARLVRSAFGRVRARWQLLEEC-WKGECQEALPYVVVAGCLL 382
Query: 345 HNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
HN +I G+ + PD A + + D G R+ L HL
Sbjct: 383 HNYLIKCGEPM-PDAAAPVDAGADAFADFEGENDKEGERIRDVLAVHL 429
>gi|195379634|ref|XP_002048583.1| GJ14048 [Drosophila virilis]
gi|194155741|gb|EDW70925.1| GJ14048 [Drosophila virilis]
Length = 359
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 134/304 (44%), Gaps = 31/304 (10%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDL-VSRPPSGLINIEGRLLSVEKQVAIALRR 113
+P+ EE F FR+ K+ F LV+ L + + + + + ++K++AIAL
Sbjct: 20 VPASSEEHFINLFRMEKQYFHV---LVKRLLGIKKSDTSF----RKAIPIDKRIAIALYT 72
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHL--KWPDSNRMEEIKSKFE- 169
L S +VG F V S V ++ F + ++ A +L ++ +++EE FE
Sbjct: 73 LGSTAEYSTVGDLFSVSPSMVGKILNDFCQEVQRVLAPEYLPKEFLTQSQLEECVRGFEA 132
Query: 170 ----ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
+ FG L C I+ ++ + + DW YS ++ +VDH RF+ +
Sbjct: 133 RGLPQCFGALGSC-LIEVNNVPGNIAEYYNTQDW------YSRILFALVDH--RFLYVNY 183
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
PGG + + S + A N ++ + V++ ++G + + L+TPY
Sbjct: 184 KSPGGRQNDEVYEESSLKSIVNASTLFKSNSKLIA-GVKVPVMLLGDSAFQCSTVLLTPY 242
Query: 286 ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP-SIILVCCLL 344
N FN+ + R + KA QLK + ++ D P IIL CC+L
Sbjct: 243 -ANPQGEQQKLFNNQLDECRRVVDKALRQLKARF---CRIFECLDNHSKPFDIILCCCIL 298
Query: 345 HNII 348
HN +
Sbjct: 299 HNYL 302
>gi|345560284|gb|EGX43409.1| hypothetical protein AOL_s00215g145 [Arthrobotrys oligospora ATCC
24927]
Length = 515
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 133/322 (41%), Gaps = 36/322 (11%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
+P+ + + F+ FRV+ F +I + + + P G R S E Q+ IALRR
Sbjct: 135 VVPNLDNDRFRQNFRVNPDAFAFIL----QKISTNPVFG-----DRQGSAELQLKIALRR 185
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKF-EESF 172
+ + FG+G+ TV T R AL E +++WP +K++ ++ F
Sbjct: 186 FGGLDDVSQIAQKFGIGEGTVVLYTQRVAGALMELWSEYVRWPTVEEQAAMKARLRQKDF 245
Query: 173 GLL-NCCGAIDATHI-IMTLPAVQTSDD---WCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ +C G ID T T PA D + +++ Y + D + R + +
Sbjct: 246 AVWEDCVGFIDGTMFPFATRPAFGKEDARNYYNMRKHAYGQHATVVCDDQNRITHFTSLF 305
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
PG ++ R + + F+ E + +Y++G GY L LI P++
Sbjct: 306 PGSVSDQRAFRVTDLFQKPEEFFK------------NQYQYLLGDKGYALNERLIIPFK- 352
Query: 288 NGLSASMPT----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWR-PDKRKLPSI---IL 339
S P FN + R A LK +R L ++ R + KL I
Sbjct: 353 QPRSGKPPKEQRRFNWKLSSLRVKAEHTIGILKLRFRSLQRLPVRLVSQEKLSEALRWIG 412
Query: 340 VCCLLHNIIIDSGDQLHPDVAL 361
C +LHN+++D D+ P +
Sbjct: 413 ACVVLHNMLVDFRDEWEPTTEM 434
>gi|357140827|ref|XP_003571964.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 272
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 77/186 (41%), Gaps = 30/186 (16%)
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
S+F F NC GAID TH+ + +PA T + + + V G+ D +MRFI +V
Sbjct: 63 SRFYPHFN--NCIGAIDGTHVPVIVPAKDTVNH-VGRAGYATHNVMGVCDFDMRFISVVA 119
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
GWPG L SG+ + G L EY G
Sbjct: 120 GWPGKF----YLVDSGYPNQTGYLEPFRGT------RYHLPEYQHGAAA----------- 158
Query: 286 ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLH 345
S FN LH + R++ + F LK WR+L V P RK II+ C LH
Sbjct: 159 -----SGRKEIFNYLHSSLRNVIERCFAVLKQRWRMLKAVPSYP-PRKQARIIIACMALH 212
Query: 346 NIIIDS 351
N I DS
Sbjct: 213 NFIQDS 218
>gi|449671111|ref|XP_004207430.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 192
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 90/190 (47%), Gaps = 6/190 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEALEER-AKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDA 183
FG+ T S V + A+ + + P + +M E S+FE F ++ G +D
Sbjct: 5 TFGIAVCTTSAVIFEVCNAIVKYIGPRFVNLPKNKQQMREKISEFESKFRMIQAFGCVDG 64
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + P + D +C ++ YS+ VQ + D++ F+D+ WPG ++ +++ S
Sbjct: 65 THIPIVCPTNHSQDYFCYKQ-YYSLQVQAVCDYKGSFLDVECMWPGSVHDAKVFSNSSIN 123
Query: 244 KLCEAGQRLNGNVR-ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
+ RL G + I+ ++++ Y++G YPLL + Y T + FNS+
Sbjct: 124 TNLRSS-RLPGTFQTITKNKIKIPCYLIGDPAYPLLPHCMKEYSTCKKKDEV-IFNSMLR 181
Query: 303 ATRSLAVKAF 312
R+ AF
Sbjct: 182 TARNPIECAF 191
>gi|331228536|ref|XP_003326935.1| hypothetical protein PGTG_08472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305925|gb|EFP82516.1| hypothetical protein PGTG_08472 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 423
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 110/272 (40%), Gaps = 32/272 (11%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ V +V+ R IEAL + ++ WPD
Sbjct: 138 LPIAHQLALTLERLGSNGNGASVGRFSRNLSVGRGRVVKVSRRVIEALISLGRRYVVWPD 197
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ R EI S+ G C G +D T I M + + D++ YS+ Q I D +
Sbjct: 198 AARRAEI-SEVMSREGFRGCVGFVDGTTIPMFQRPGYDGEVFFDRKRRYSINAQIICDCD 256
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG SR+ K +L+ N S + +Y++ Y L
Sbjct: 257 KYITSFITGWPGSCGDSRVYK----------RMQLHLN---PSNYFDEGQYLLADSAYEL 303
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWR-------- 328
+I Y+ + ++ + FN R LK W L ++
Sbjct: 304 SHTVIPAYKVPAANITINSQFNFCLAKARVRNEHTIGVLKSRWSSLREMRLHLYHCQHMR 363
Query: 329 ------PDKRKLPSIILVCCLLHNIIIDSGDQ 354
R S + C +LHN++ GDQ
Sbjct: 364 LHLYHCQHMRAYVSWLYSCIILHNLLAGLGDQ 395
>gi|357138912|ref|XP_003571030.1| PREDICTED: uncharacterized protein LOC100837391 [Brachypodium
distachyon]
Length = 664
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 128/303 (42%), Gaps = 41/303 (13%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
R+ ++ F C L+R+ + + + + VE+QVA+ L + V
Sbjct: 67 MLRLRREPFFRFCKLLRDRQLLQ--------DTIHMCVEQQVAMFLNTVGHNLRNRLVAT 118
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
F TVS+ + A+ E ++ P +I+ + +C GAID TH
Sbjct: 119 NFNRSGETVSRYFNLVLHAIGELRNELIRKPSMETPTKIQGNYRRDPYFQDCVGAIDGTH 178
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
I ++ +C ++++ S V VD ++RF ++ GW G + + +L+
Sbjct: 179 IRASVTKAMEP-AFCGRKSHASQNVMAAVDFDLRFTYVLAGWEGAAHDALVLR------- 230
Query: 246 CEAGQRLNGNVRISSEEVELREYIVG----------GVGYPLLSWLITPYETNGLSASMP 295
+A +R NG +R+ + L + G GV Y L W P +
Sbjct: 231 -DALERPNG-LRVPQGKYYLVDAGYGAKPGFLPPFRGVRYHLNEWGNNPVQNEN-----E 283
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS---IILVCCLLHNIIIDSG 352
FN H + R +AF LK ++IL P R P+ I++ CC++HN ++ G
Sbjct: 284 LFNLRHSSLRVTVERAFGSLKRRFKILDDAT--PFFR-FPAQVDIVVACCIIHNWVL--G 338
Query: 353 DQL 355
D++
Sbjct: 339 DEI 341
>gi|389750746|gb|EIM91819.1| hypothetical protein STEHIDRAFT_46304 [Stereum hirsutum FP-91666
SS1]
Length = 353
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 107/238 (44%), Gaps = 33/238 (13%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---SG 117
E F+ + RV+ FD + + +++D P+ N + VE+Q+AIAL R +
Sbjct: 132 EIFRSYLRVTPDCFDALVNAIKDD-----PAFHNNSNQVQMPVEEQLAIALYRFGHYGNA 186
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH--LKWPDSNRMEEIKSKFEESFGLL 175
S + V + FGVG TV + T R + AL A + ++W + +E K K +E G
Sbjct: 187 ASTMKVALWFGVGYGTVRKATTRVMLALNSEAFRNSTVQWSN----QEAKEKAKEWVGDR 242
Query: 176 NCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+C +D T + + + W D+++NYSM VQ + + ID TG PG
Sbjct: 243 SCPAWRDGWCMVDGTLVPLYQRPAFFGNTWFDRKHNYSMTVQVVSTPDCLIIDFCTGLPG 302
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
+ + + RL R+ ++ E++ YPL W +PY++
Sbjct: 303 SQHDATAWR----------DTRLYQQHRLLLKD---GEWVWADSAYPLADWTQSPYKS 347
>gi|345492547|ref|XP_003426874.1| PREDICTED: hypothetical protein LOC100678796 [Nasonia vitripennis]
Length = 436
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 145/363 (39%), Gaps = 43/363 (11%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLV-----REDLVSRPPSGLINIEGRLLSVEKQVAIA 110
P D E+ F+Y R+ +T++ + L+ ++ L+ P + ++ I
Sbjct: 87 PVDYEKYFEYI-RMDVETYNELFKLLEPKIKKQFLIKYP-----------IPAHTRLQIC 134
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFE 169
LR LASG+S S+ AF V +TVS++ +A+ + + P + +E FE
Sbjct: 135 LRYLASGDSMTSISSAFKVANNTVSKIVTDTCQAIWDVLSNLVFPTPSVDLWKERAHSFE 194
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+ +C GAID+ +++ P V T ++C + N ++++ I+D F I
Sbjct: 195 TQWDFTHCIGAIDSRQLVLQTP-VNTESNYC--KTNETIVLCAIIDANSCFSLI------ 245
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYIVGGVGYPLLSWLI 282
++V +L+ F E G L N +I+ EL + G +PL L+
Sbjct: 246 NIDVEEVLEDGINFAASETGCHLQDNTYDIPAPSKINETGPELPYVLAGNENFPLTVNLL 305
Query: 283 TPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
PY T+ L FN R + AF L WRI K +
Sbjct: 306 RPYAATSDLDLPKQVFNYRIGRARKCSDCAFRILTNQWRIFHKKFSNSASNTIKITKAAI 365
Query: 342 CLLHNIIIDSGDQLHP----DVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQE 397
C LHN +++ + P +L D Y C V EN E N
Sbjct: 366 C-LHNFLMNKENS-RPFQDRRYSLLSESDREYNILGCTNVSISSEIPSENAET--VRNMF 421
Query: 398 KDL 400
KDL
Sbjct: 422 KDL 424
>gi|449666348|ref|XP_004206332.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 204
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 90/166 (54%), Gaps = 8/166 (4%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S + ++++ S+V + +P +++S +++ + LR LA+G+SQ++ F
Sbjct: 1 MSPERYEHLLSMVAPSITKKPCQS-----RQIISPSERLTVTLRYLATGDSQLTQAFYFR 55
Query: 129 VGQSTVSQVTWRFIEALEERAKH-HLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHI 186
+G++T +T +A+ + K +LK P + N EE+ ++FE + NC GAID H+
Sbjct: 56 LGRTTACNITNETTKAIWDVLKPCYLKAPSTLNEWEELANQFENEWNFPNCIGAIDGKHV 115
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
+ P++ S +C+ +N +SM++ I D + F + G G N
Sbjct: 116 CIKAPSLSGS-AYCNYKNFHSMVLLAICDAKYCFTLVDIGSYGRDN 160
>gi|449678194|ref|XP_004209026.1| PREDICTED: uncharacterized protein LOC100199174 [Hydra
magnipapillata]
Length = 449
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 125/340 (36%), Gaps = 88/340 (25%)
Query: 40 WPSFWAKNSSTPGATIPSDEEEG-FKYFFRVSKKTFDYICSLVREDL-----VSRPPSGL 93
W +FW D+EE FK FRV++ TF+++ + + L R P
Sbjct: 154 WKAFWPY----------FDQEECEFKEHFRVNRNTFNFLVNQLHPHLGKTTTTMREP--- 200
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA-----LEER 148
+SV K++A AL LAS E V FG+G+ST + + FI A L +
Sbjct: 201 -------ISVVKRIAFALHYLASCEEYRVVSCLFGIGKSTENLIVHEFINAVNDILLPKY 253
Query: 149 AKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
AK L +SN+ FE G C GA+D HI ++ P Q + NY
Sbjct: 254 AKFPLSVENSNKHSR---DFEAILGFPQCVGAVDGCHIPISAPKDQAISYY-----NYKG 305
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY 268
+ + D LC
Sbjct: 306 CCKELGD-------------------------SLVPLCS--------------------- 319
Query: 269 IVGGVGYPLLSWLITPYETN-GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+G +PL L+ PY N LS FN + R + AF ++K +R++ K M
Sbjct: 320 -IGDSAFPLTRHLLKPYPENLELSEIQKNFNEILCGARRVVENAFGRVKARFRVICKRM- 377
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
+ I+ C LHNI D + + + H DS
Sbjct: 378 ECNINFATRIVNACITLHNICEYYDDIIIIEWLMHHHDDS 417
>gi|301616303|ref|XP_002937601.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 269
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 90/176 (51%), Gaps = 20/176 (11%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID TH+ +T P + + +++ ++S+ VQ + D +R + + +G+PG ++ +
Sbjct: 77 NCLGAIDCTHVPLTPPRAH-QEHYLNRKRSHSINVQVVCDSHLRIMSMRSGFPGSVHDAH 135
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S ++ G+ +G ++VG GY +L W++TP A
Sbjct: 136 ILRQSALYERFTQGEMPHG-------------WLVGDAGYGVLPWMMTPVRFPRTPAQR- 181
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILS---KVMWRPDKRKLPSIILVCCLLHNII 348
+N H TR++ + F+ L +R LS ++ P K+ II+VC +L I+
Sbjct: 182 RYNCAHRKTRNVIERLFVVLMSRFRCLSVTGGALYSPI--KVSGIIVVCAMLQCIM 235
>gi|331222274|ref|XP_003323811.1| hypothetical protein PGTG_05713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309302801|gb|EFP79392.1| hypothetical protein PGTG_05713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 110/261 (42%), Gaps = 22/261 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV + + R I+A+ + + +L WPD
Sbjct: 138 LPIAHQLALTLERLGSNGNGASVGRFSRNLSVGRGTVVKASRRVIQAINDLSGRYLLWPD 197
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++R EI +E G C G +D T I + + + D++ YS+ Q I D +
Sbjct: 198 TDRRREISEVMKEE-GFEGCVGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCD 256
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG S + K L + ++ G+ +Y++ Y L
Sbjct: 257 QFITAYMTGWPGSCGDSMVFK---RMLLHQEPEKFFGD----------GQYLIADSAYEL 303
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKR 332
I Y+ T FN +R LKG W L + + +P D
Sbjct: 304 GVHCIPAYKAPAAFILENTEFNYCLARSRVRNEHTIGILKGRWASLQDLRLAIQKPTDMI 363
Query: 333 KLPSIILVCCLLHNIIIDSGD 353
++ I C LHN++ GD
Sbjct: 364 EIIRWINCCVTLHNMLAHLGD 384
>gi|328717980|ref|XP_001947852.2| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 362
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 110/242 (45%), Gaps = 8/242 (3%)
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEE 170
R A+G+S ++G +F VG STVS + E++ + + P +N EE F E
Sbjct: 29 RYFATGDSFSTIGHSFRVGFSTVSNIVTEVSESIIKVMGPMFMPEPTTNIWEESARGFYE 88
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
+ NC G+ID H+ + P S ++C + +S+++ IVD + +FI + G G
Sbjct: 89 KWQFPNCIGSIDGKHVTIKSPNNSGSRNFCYLK-KFSIVLMAIVDPDYKFICVDVGGYGR 147
Query: 231 MNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
+ +L+ S K EAG + N + + + ++G + L +L+ P+
Sbjct: 148 NSDGGILEESTMGKRLEAGTLNVPQNAPLPGQGEDTPMVLIGDEAFALKPYLMKPFPRR- 206
Query: 290 LSAS---MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
LS S + +N R + AF L WR+ K + I+L C+LHN
Sbjct: 207 LSRSDTRLDNYNYRLCRARRVVENAFGILTKKWRVY-KGPIEVKEETTKKIVLATCILHN 265
Query: 347 II 348
+
Sbjct: 266 YL 267
>gi|195390819|ref|XP_002054065.1| GJ24230 [Drosophila virilis]
gi|194152151|gb|EDW67585.1| GJ24230 [Drosophila virilis]
Length = 347
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 135/320 (42%), Gaps = 39/320 (12%)
Query: 48 SSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQV 107
+S + S + F FRV K+TF +I ++ D L+ Q+
Sbjct: 26 TSRANTCVLSLPDARFIKLFRVDKETFKWILRKIQPDFTLS-----------YLTPALQL 74
Query: 108 AIALRRLASGESQVSV--GVAFGVGQSTVSQVTWRFI---EALEERAKHHLKWPDSNRME 162
A L LA+G ++SV +G+ST S++ I EA+ R L+ +++
Sbjct: 75 AATLHFLATGSYKLSVSRNRPVNIGRSTFSRILHNVIPLMEAVLCREAITLRMS-QQQIK 133
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
+ + F E++ L +D I + P V+ + + ++ YS+ + ++ M +
Sbjct: 134 QSRDYFYENYQLPRVVACVDGMQIKILKP-VRDETIYFNDKDYYSINAMLVCNYNMEILA 192
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
I PG + + S +A + L+ NV+ Y++ Y L ++++
Sbjct: 193 IDCTHPGSCSDHYIWSNS------QAREYLSRNVKC--------HYVLADTAYALENYVL 238
Query: 283 TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
TPY + FN H A RS+ + LK + + + + D R + +++ VCC
Sbjct: 239 TPYRNADQDSVEQRFNERHAAARSVVQRTINLLKSRFGCMQRALM-YDARFVANVVNVCC 297
Query: 343 LLHNII------IDSGDQLH 356
LHNI +D + LH
Sbjct: 298 ALHNICRRRRTPLDEREMLH 317
>gi|326320058|ref|NP_001191896.1| nuclease HARBI1-like [Acyrthosiphon pisum]
gi|239790602|dbj|BAH71852.1| ACYPI004890 [Acyrthosiphon pisum]
Length = 369
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 127/287 (44%), Gaps = 33/287 (11%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVE--KQVAIALRRLASGESQVSVG 124
FR+ K TF YI + L + R+ S+ +++ +ALR A+G ++ G
Sbjct: 52 FRMKKTTFMYILGKIENTLST---------AKRMTSITPMQKLLVALRFYATGNFLITAG 102
Query: 125 VAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS---KFEESFGLLNCCGAI 181
G+ + + ++ + AL E ++K P + +EEI+ F + + GAI
Sbjct: 103 ELVGISEPSACRIVQQVTHALAELRPDYVKMPQN--VEEIRQVSYHFYKIAKMPRVIGAI 160
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D THI + P + ++ + +++ +SM VQ + D ++ +DIV WPG ++ + S
Sbjct: 161 DCTHIRIHSPGGENAEYFRNRKGYFSMNVQTVADANLQILDIVVRWPGSVHDQTIFLNSK 220
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP-YETNGLSASMPTFNSL 300
E G R N+ +V GY +ITP +TN + +N
Sbjct: 221 LKTDLENG-RFGNNL------------LVADSGYANSQHVITPLLQTNNRVEEL--YNES 265
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
TR++ + + K + ILS +R + I+ C +L NI
Sbjct: 266 LIRTRNVVERQYGVWKRRFPILS-FGFRIKLETTMAAIVACAVLFNI 311
>gi|449681091|ref|XP_002162517.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 114/253 (45%), Gaps = 20/253 (7%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAI H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQNWNFPQCIGAIGGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE-EVELREYIVGGVGYP 276
F + G N + +L S F + G N+ +SE + ++ +VG +
Sbjct: 146 YCFTMVDIAAYGRDNDAVILNASTFGRAFNKGY---FNLPKTSEFDPKVPPVLVGDDIFA 202
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLP 335
L WL+ PY L+ FN R +F L WRI S + +P K+
Sbjct: 203 LKPWLMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRSPIKAKP--LKVE 260
Query: 336 SIILVCCLLHNII 348
II LHN +
Sbjct: 261 HIIKATVCLHNYL 273
>gi|148228010|ref|NP_001088509.1| putative nuclease HARBI1 [Xenopus laevis]
gi|82180108|sp|Q5U538.1|HARB1_XENLA RecName: Full=Putative nuclease HARBI1; AltName: Full=Harbinger
transposase-derived nuclease
gi|54311411|gb|AAH84846.1| LOC495378 protein [Xenopus laevis]
Length = 347
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 128/312 (41%), Gaps = 32/312 (10%)
Query: 74 FDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQST 133
DY+ L+ L SRP R +S E Q+ AL SG Q +G G+ Q++
Sbjct: 46 IDYLVDLLGASL-SRPTH-----RSRAISPETQIMAALGFYTSGSFQTRMGDTIGISQAS 99
Query: 134 VSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPA 192
+S+ EAL ERA + +P D ++ +K +F G+ G +D T + + P
Sbjct: 100 MSRCVTNVTEALVERASQFISFPRDERSVQGLKDEFYNLAGVPGVLGVVDCTQVNIKAP- 158
Query: 193 VQTSDD--WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
S+D + + +S+ + D + T G M + +L S L E
Sbjct: 159 --NSEDLSYVNSRGLHSLNCLLVCDARGSLLWAETSRLGSMQDNAVLHQSELSGLFETKM 216
Query: 251 RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVK 310
G +++ + L WL+TP + S S +N H AT S+ +
Sbjct: 217 HKQG-------------WLLADNAFILRPWLMTPVQIPE-SPSDYRYNMAHTATHSVMER 262
Query: 311 AFLQLKGGWRIL--SKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSG 368
L+ +R L S+ + K I+L CC+LHNI + Q D+ S
Sbjct: 263 TQRSLRLRFRCLDGSRATLQYSPEKSAQIVLACCILHNIAL----QHDLDIVSESGATSL 318
Query: 369 YGEQCCKQVDPM 380
E+ C ++P+
Sbjct: 319 EPEEECVHMEPL 330
>gi|403175037|ref|XP_003333918.2| hypothetical protein PGTG_15341 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171427|gb|EFP89499.2| hypothetical protein PGTG_15341 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 332
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 67/132 (50%), Gaps = 4/132 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV + T R IEAL + ++ WP+
Sbjct: 126 LPIAHQLALTLERLGSNGNAASVGRFSRNLNVGRGTVIKATRRVIEALVSLGRQYVMWPN 185
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ R EI S+ ++ G + C G +D T I + + + D++ YS+ Q I D +
Sbjct: 186 TTRRAEI-SEVMKNEGFVGCVGFVDGTTIPLFQRPGYDGEVFYDRKRRYSINAQIICDCD 244
Query: 218 MRFIDIVTGWPG 229
+TGWPG
Sbjct: 245 KYITSFITGWPG 256
>gi|108707497|gb|ABF95292.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 311
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 87/207 (42%), Gaps = 43/207 (20%)
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHEMRFID 222
+ +FE F NC GAID THI + +PA + N Y + V I D +MRF
Sbjct: 78 LGDRFEPHFN--NCIGAIDGTHIPVVVPASKVVQHV--GRNKYPTQNVLAICDFDMRFTF 133
Query: 223 IVTGWPGGMNVSRLL-----KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
IV GWPG + R+ K++ F G + Y+V GYP
Sbjct: 134 IVAGWPGSAHDMRVFNDALRKYAAIFPHPPPG----------------KFYLVDS-GYPN 176
Query: 278 LSWLITPYETNGLSASMP-------------TFNSLHEATRSLAVKAFLQLKGGWRILSK 324
++PY+ G +P FN LH + R++ ++F LK WRIL
Sbjct: 177 QLGYLSPYK--GTKYHLPEFREGPRPSGKKEVFNHLHSSLRNVIERSFGVLKMKWRILLD 234
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDS 351
+ P K II C LHN I DS
Sbjct: 235 LPSYP-MLKQTKIIHACMALHNFIRDS 260
>gi|108707827|gb|ABF95622.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
Length = 747
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 86/194 (44%), Gaps = 35/194 (18%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V I D +MRF ++ GWPG
Sbjct: 524 LDNCIGAIDGTHIQVVVPNATAVQHRNSHKERSQN----VLCICDFDMRFTFVLAGWPGS 579
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
++ R+ +A R EE+ + GYP + PY+ G+
Sbjct: 580 VDDMRVFS--------DAQNRFGHKFPWPPEEM----FYFMDSGYPNRPGYLAPYK--GI 625
Query: 291 S--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
+ MP FN H + R++ ++F LK WRIL + P ++K I
Sbjct: 626 TYHFQEYNEGIMPQNRKEYFNYCHSSCRNVIERSFGVLKNKWRILFHLPSYP-QQKQSMI 684
Query: 338 ILVCCLLHNIIIDS 351
I C LHN I DS
Sbjct: 685 ICACLALHNFIRDS 698
>gi|346473657|gb|AEO36673.1| hypothetical protein [Amblyomma maculatum]
Length = 424
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 136/332 (40%), Gaps = 27/332 (8%)
Query: 41 PSFWAKNSSTPGATIPSDEEEG----FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINI 96
P+F + + T+ + +G F +R++ + FD + S V +DL + ++
Sbjct: 5 PAFLERKQKSLYFTLMREVRDGKLSNFTKHYRMTPRLFDVLLSFVEDDLTRQ------HV 58
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKW 155
L +++AI L LASG+ V + VG T A+ +H +K
Sbjct: 59 VREPLEPGERLAITLSYLASGKDIREVANMYLVGIETARISIHLTCRAIWTNLRHRFMKV 118
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P +I F E + NC GAI H+ + P+ ++S + + +N S+++ VD
Sbjct: 119 PTGEDWCQIAEAFAEQWQFPNCVGAIGGRHVTIATPS-RSSGGYLNHKNTSSVVLLAAVD 177
Query: 216 HEMRFI--DIVT-GWPGGMNV---SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
R+I D+ T P G N+ S L K +C + + + L Y+
Sbjct: 178 SSCRYILVDVCTESRPLGSNIFEDSELGK-----AICSGALGVPTAASLPNTGGTLAPYV 232
Query: 270 -VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-VMW 327
VG + L++P+ + FN + A A WR+L + V
Sbjct: 233 FVGDDTFSPRKHLVSPFPGEQVEDENAVFNYRLNRAQRCAENALGLTAARWRVLLRTVHL 292
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGDQLHPDV 359
+P + +I C+LHN +ID Q DV
Sbjct: 293 KPC--NIDYVIKATCMLHNFLIDMNTQQFADV 322
>gi|221110397|ref|XP_002154191.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 320
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 99/202 (49%), Gaps = 12/202 (5%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG-V 125
FR + + Y+ L++ DL +P S + +L Q+ ALR A G Q+ VG
Sbjct: 50 FRFRRASIFYLTDLLQIDL--QPKSNQTLVVAPIL----QLLCALRFYAIGSFQIVVGDS 103
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDAT 184
+ + T+S++ R +L +R ++K+P + + + E + KF E G ID T
Sbjct: 104 TAALSEPTISRIIRRVSLSLAKRINEYIKYPTNQHVLNESRVKFYEIAEFPKVTGVIDCT 163
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
HI + P + + D +N+S+ VQ + D++ +FID+V WP + +R+L+ S K
Sbjct: 164 HICIQKPH-EHKYAYVDSSSNHSINVQAVCDNKGKFIDVVAKWPWSTHDARILRESKLGK 222
Query: 245 LCEAGQ---RLNGNVRISSEEV 263
G L G R+ E+V
Sbjct: 223 KFMDGTFKGLLIGKTRVIIEQV 244
>gi|302756253|ref|XP_002961550.1| hypothetical protein SELMODRAFT_76044 [Selaginella moellendorffii]
gi|300170209|gb|EFJ36810.1| hypothetical protein SELMODRAFT_76044 [Selaginella moellendorffii]
Length = 121
Score = 72.0 bits (175), Expect = 4e-10, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 1/116 (0%)
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+K FE NC GAID+ H + LP D+ +++ YS+++Q IVD E F+DI
Sbjct: 7 VKISFESLHNFPNCYGAIDSMHFEIKLPKNAFVSDYYNKDKAYSIVMQAIVDSEATFLDI 66
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
G P N +L+ S F+K ++ + NG V +S + + EYI+G Y LS
Sbjct: 67 QVGNPRSCNNITVLRKSSFYKRAQSIEIFNGQV-VSRGQYQFWEYIIGDSRYYELS 121
>gi|301617821|ref|XP_002938332.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 361
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 109/251 (43%), Gaps = 5/251 (1%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSN 159
+S E+++ + LR LA+G+S S+ F +G++T+ ++ + E + + PD N
Sbjct: 58 ISPEERLCLTLRFLATGQSFSSLYFQFLIGRTTIGRIVRETCLLIWSELQRIVMPSPDEN 117
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+I F + NC GA+D HI +T+P + + + + +S+++ +VD
Sbjct: 118 TWVDIAEDFHKKTNFPNCLGALDGKHIRVTMP-FNSGSKYFNYKKYFSVVLLAVVDANYC 176
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFK-LCEAGQRLNGNVRISSEEVELREYI-VGGVGYPL 277
F I G G + + S + L E RL + Y+ VG + L
Sbjct: 177 FTIIDVGAYGSTGDASAFRNSALGRQLTEGTLRLPLPKPLPGTAAPPMPYVFVGDEAFGL 236
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
++ PY + S FN R + AF L WR+ + + + + I
Sbjct: 237 AENIMRPYPGSQRSVQKRLFNYRLSRARRMVECAFGILANKWRVFHTAL-QLEPEFVDKI 295
Query: 338 ILVCCLLHNII 348
I CC+LHN +
Sbjct: 296 IKACCVLHNFV 306
>gi|328700912|ref|XP_003241424.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 429
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 67/306 (21%), Positives = 132/306 (43%), Gaps = 28/306 (9%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR---LLSVEKQVAIALRRLA 115
++ F+ FR+SK + + L+R++ I++E + ++ ++ + LR A
Sbjct: 140 DDHDFRMRFRLSKVVVEQVLDLIRDN---------ISVENQWNCAIAPIDKLLLTLRFYA 190
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGL 174
+G ++ G GV +S+ + A+ + P ++ + ++ F +
Sbjct: 191 TGSFLITAGDFLGVSKSSACVIVRTVSTAIARLCHQFIYMPTTDEEVYTLQRSFYKIARF 250
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
GAID THI + P ++ + +++ +S+ VQ +V +++ +DIV WPG +
Sbjct: 251 PRAIGAIDCTHIRIQSPGGHNAEYFRNRKGYFSLNVQTVVSPDLKIMDIVARWPGSCHDQ 310
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+ + S G+ N IV GY S ++TP+ N
Sbjct: 311 TIFRNSNIHSQLVNGKWGN-------------SLIVADSGYKNTSHIVTPF-INPRGNIE 356
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
+N TR+ + + LK + ILS + R ++I+ C +LHNI ++ D
Sbjct: 357 ELYNESIIRTRNPVERTYGVLKRRFPILS-LGLRLKLTTSQAVIVTCSILHNIACNNNDL 415
Query: 355 LHPDVA 360
PD+
Sbjct: 416 EAPDLV 421
>gi|189442523|gb|AAI67676.1| LOC100170595 protein [Xenopus (Silurana) tropicalis]
Length = 328
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 112/257 (43%), Gaps = 20/257 (7%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSN 159
+S E+++ + LR LA+G+S S+ + F +G++T+ ++ L E + + PD N
Sbjct: 28 ISPEERLCLTLRFLATGQSFSSLYIQFHIGRTTIGKIVRETCLLLWSELQRIVMPSPDEN 87
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ +I F E NC GA+ HI + +PA + S + + ++ +L V++
Sbjct: 88 KWMQIAEDFHEKVNFPNCAGALGGRHIRVRMPANRGS-KYYKKNSSVVLLAAADVNYCFS 146
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
ID+ + G N S F+ E G++L+ L +V + Y L
Sbjct: 147 VIDVGSYHSTG-NASA-------FQNSELGRQLSERTLHLPLPKPLPGTVVPNMPYVFLG 198
Query: 280 --------WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK 331
+I PY + S FN + + AF L W +L +PD
Sbjct: 199 DEAFGLAENVIQPYPGSQKSVQKRVFNYRFSRAQRILECAFGILSNKWHVLHTAKLKPD- 257
Query: 332 RKLPSIILVCCLLHNII 348
+ ++I CC+LHN +
Sbjct: 258 -FVSTVIKACCVLHNFV 273
>gi|390357388|ref|XP_792559.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 182
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 90/187 (48%), Gaps = 19/187 (10%)
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF DI+ GWPG ++ +R+L S ++ +AG L + E V++ +++G YPLL
Sbjct: 5 RFTDILVGWPGCVHDARVLSNSSLYQKAQAGTLLPLQTK-EIEGVQVPLFVIGDPAYPLL 63
Query: 279 SWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
W++ Y G L+ FN R + +AF LK WR L K+ ++ K+ ++
Sbjct: 64 PWMMKGYADCGRLTPRQQNFNYRLSRARMVVERAFGLLKMRWRSLYKI----NESKVENV 119
Query: 338 I---LVCCLLHNIIIDSGDQLHPDV------ALSDHHDSGYGEQCCKQVDPMGRTTRENL 388
I C+LHNI + D L D+ +HH + E+ + T R L
Sbjct: 120 IHMVTAACVLHNICEIARDTLDLDLYEENIRQQQNHHH--HPERAAD--NNRAETYRTAL 175
Query: 389 EKHLQHN 395
+HLQ+N
Sbjct: 176 TEHLQNN 182
>gi|357117801|ref|XP_003560650.1| PREDICTED: uncharacterized protein LOC100830885 [Brachypodium
distachyon]
Length = 579
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 123/302 (40%), Gaps = 61/302 (20%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS 158
R +S + +A+ L + ES V F + T+ + + L + A +K D
Sbjct: 269 REMSSMEALAMFLWTVGGPESVTQVENRFKRSKETIHRKFEHVLGCLTQLAADIIKPRDP 328
Query: 159 N------RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
R+++ S+F F C GAID THI + +PA +D + Y +Q
Sbjct: 329 QFPTVHERLQD--SRFSPHFN--GCIGAIDGTHIRVVVPA----EDIANHVGRYGYPIQN 380
Query: 213 IV---DHEMRFIDIVTGWPGGMNVSRLLK-----FSGFFKLCEAGQRLNGNVRISSEEVE 264
++ D +MRFI IV GWPG + +R+ K +S F +G
Sbjct: 381 VMAVCDFDMRFISIVAGWPGSAHDTRIFKDTLITYSENFPHPPSG--------------- 425
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMP-----------------TFNSLHEATRSL 307
R Y+V GYP + + PY+ G +P FN H +++
Sbjct: 426 -RYYLVDS-GYPNQTGYLAPYK--GQKYHLPEFRRGIEEGRHPTGKKEIFNHAHSQLQNV 481
Query: 308 AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
++F LK WRIL V +++ II+ C LHN I +S LH D D
Sbjct: 482 IERSFGVLKMKWRILLHVSSYAIEKQT-RIIVACMALHNWIRESA--LHDDEFYKCDQDE 538
Query: 368 GY 369
Y
Sbjct: 539 NY 540
>gi|322797948|gb|EFZ19802.1| hypothetical protein SINV_05984 [Solenopsis invicta]
Length = 334
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 125/280 (44%), Gaps = 33/280 (11%)
Query: 78 CSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQV 137
C L ++D + + P +++A+ LR LASG+S S+ + VG +T S +
Sbjct: 15 CKLQKQDFIRQSPP-------------ERLALTLRYLASGDSMTSMSYQYLVGVTTASTI 61
Query: 138 TWRFIEALEERAKHHLKWPDSNRMEE-----IKSKFEESFGLLNCCGAIDATHIIMTLPA 192
+ + E L P N+++E I + F E + ++C AID H+I+ P
Sbjct: 62 IHETCKVIWEDL-FPLVLP--NQLQEKDWLDIANDFNEKWNFIHCIEAIDGKHVIIQCPN 118
Query: 193 VQTSDDWCDQENNYSMLVQGIVD--HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
+ + +N++S+++ I D + +RF+DI G G N + K S K + G
Sbjct: 119 -NAGSAYFNYKNSHSIVLMAICDANYIIRFVDI--GAYGRRNDGGIFKDSAIGKAFDEG- 174
Query: 251 RLN--GNVRISSEEVELREYIVGGVGYPLLSWLITPYE-TNGLSASMPTFNSLHEATRSL 307
R+N + I L +VG +PL S+L+ PY GL+ +N R +
Sbjct: 175 RMNIPQSAAIREGGPILPYCLVGDEAFPLKSFLLRPYPGRGGLTPEQNIYNYRLSRARRI 234
Query: 308 AVKAFLQLKGGWRILSK-VMWRPDKRKLPSIILVCCLLHN 346
F + WRI K ++ + KL ++ LHN
Sbjct: 235 IENTFGIITSQWRIYRKPIIASTENAKL--MVQATIYLHN 272
>gi|297608688|ref|NP_001061974.2| Os08g0459400 [Oryza sativa Japonica Group]
gi|255678502|dbj|BAF23888.2| Os08g0459400 [Oryza sativa Japonica Group]
Length = 705
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 122/279 (43%), Gaps = 40/279 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+Q+A+ L + +G TVS+ ++AL AK ++ R
Sbjct: 130 VSVEEQLAMFLHTVGHNLRNRVIGFYVIRSSETVSRYFNEVLKALCCLAKDMIQI----R 185
Query: 161 MEEIKSKFEESFGLL-----NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
E SK + G +C GA+D THI +P + + +++ + V VD
Sbjct: 186 SIETHSKIVSNPGRFYPYFEDCIGALDGTHIPAFVPE-NIVNRFRGRKSYPTQNVLAAVD 244
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
++RF ++ GW G + S +LK A R + ++I + Y + GY
Sbjct: 245 FDLRFTYVLVGWEGSAHDSVVLK---------AALRRSSGIQIPEGK-----YYLADAGY 290
Query: 276 PLLSWLITPYE---------TNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSK 324
++ P+ G S P FN H + R+ +AF LK +++L+
Sbjct: 291 AARPGILPPFRGVRYHLKEYEGGKSPETPQELFNLRHSSLRTSVERAFGTLKNRFKVLAS 350
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
+ P K ++ I++ CC+LHN I+D+G PD + D
Sbjct: 351 KPFFPYKVQV-KIVIACCVLHNWILDNG----PDNIIYD 384
>gi|357161564|ref|XP_003579131.1| PREDICTED: uncharacterized protein LOC100839156 [Brachypodium
distachyon]
Length = 481
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 122/302 (40%), Gaps = 61/302 (20%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS 158
R +S + +A+ L + ES V F + T+ + + L + A +K D
Sbjct: 171 REMSSMEALAMFLWTVGGPESVTQVENRFKRSKETIHRKFEHVLGCLTQLAADIIKPRDP 230
Query: 159 N------RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
R+++ S+F F C GAID THI + +PA +D + Y Q
Sbjct: 231 QFPTVHERLQD--SRFSPHFN--GCIGAIDGTHIRVVVPA----EDIANHVGRYGYPTQN 282
Query: 213 IV---DHEMRFIDIVTGWPGGMNVSRLLK-----FSGFFKLCEAGQRLNGNVRISSEEVE 264
++ D +MRFI IV GWPG + +R+ K +S F +G
Sbjct: 283 VMAVCDFDMRFISIVAGWPGSAHDTRIFKDTLITYSENFPHPPSG--------------- 327
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMP-----------------TFNSLHEATRSL 307
R Y+V GYP + + PY+ G +P FN H R++
Sbjct: 328 -RYYLVDS-GYPNQTGYLAPYK--GQKYHLPEFRRGIEEGRHPTGKKEIFNHAHSQLRNV 383
Query: 308 AVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
++F LK WRIL V +++ II+ C LHN I +S LH D D
Sbjct: 384 IERSFGVLKMKWRILLHVPSYAIEKQT-RIIVACMALHNWIRESA--LHDDEFYKCDQDE 440
Query: 368 GY 369
Y
Sbjct: 441 NY 442
>gi|226492180|ref|NP_001149744.1| retrotransposon protein [Zea mays]
gi|195630667|gb|ACG36640.1| retrotransposon protein [Zea mays]
Length = 401
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 130/301 (43%), Gaps = 36/301 (11%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+S+ +F C R+ GL+ + + VE+QVA+ L + VG
Sbjct: 70 LRLSRSSFFRFCKGFRD-------RGLLE-DTIHMCVEEQVAMFLNTVGHNLRNRLVGTN 121
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+G TVS+ + + A+ E ++ P +I +C GAID THI
Sbjct: 122 YGRSGETVSRYFNKVLRAIGELRAELIRPPSLETPSKIAGNPRWDPYFKDCIGAIDGTHI 181
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
TL ++ + +++ S V VD ++RF ++ GW G + + +L+
Sbjct: 182 -RTLVSIDMEPSFRGRKSYASQNVMAAVDFDLRFTYVLAGWEGSAHDALVLR-------- 232
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----------TNGLSASMPT 296
+A +R NG +R+ + Y+V GY + P+ N +
Sbjct: 233 DALERENG-LRVPQGKF----YLVDA-GYGAKPGFLPPFRAVRCHLNEWGNNPVQNEKEL 286
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHN-IIIDSGDQ 354
FN H + R +AF LK ++IL + P ++ I++ CC++HN +I D GD+
Sbjct: 287 FNLRHSSLRVTVERAFGSLKRRFKILDDATPFFPFSTQV-DIVVACCIIHNWVIQDGGDE 345
Query: 355 L 355
L
Sbjct: 346 L 346
>gi|37991863|gb|AAR06309.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 767
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 81/193 (41%), Gaps = 35/193 (18%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ +P+ + C + + V + D +MRF ++ GWPG ++ R
Sbjct: 542 DCIGAIDGTHVPCVVPSNKLVQHLC-CKGMTTQNVMVVCDFDMRFTFVLAGWPGSVHDMR 600
Query: 236 LLK-----FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ ++ F G+ Y + GYP + PY+
Sbjct: 601 VFDDAHTTYTHVFPHSPTGK-----------------YYLVDSGYPNRPGYLAPYKGTKY 643
Query: 291 ------SASMP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
A P FN H + R++ ++F LK WR+L K+ D RK II+
Sbjct: 644 HLQEYRDAPEPQGKEENFNYAHSSLRNVIERSFGVLKMKWRMLEKIP-SYDPRKQTQIII 702
Query: 340 VCCLLHNIIIDSG 352
CC LHN I SG
Sbjct: 703 ACCALHNFIRKSG 715
>gi|328702103|ref|XP_003241802.1| PREDICTED: hypothetical protein LOC100570146 [Acyrthosiphon pisum]
Length = 358
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 128/303 (42%), Gaps = 54/303 (17%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEK-QVAIALR 112
TIP+ + F+ FR+S +F+ R L+ P +EGR S E+ Q+ +
Sbjct: 61 TIPNFTHKQFQQHFRLSPTSFE------RCLLICAPLLQKTCVEGRKGSDERLQLLSVIW 114
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF 172
LA+ +S SV F +G+S++ R L + + WP+ M+ I F
Sbjct: 115 LLATPDSFRSVSDRFDIGKSSLHDSFVRVTTVLHRLSPTIINWPEVLLMDLI-------F 167
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
L+ Q+++ + ++ +++ +Q + D ++R ID G+PG +
Sbjct: 168 QYLH-----------------QSANSYLTRKCFHAITLQAVCDDKLRIIDAFAGYPGSVG 210
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LS 291
R+ S ++ RI + + YP+L W I PY G LS
Sbjct: 211 DRRIFTNSLIYE------------RILQNKNRYQ-------AYPILEWCIPPYIDRGNLS 251
Query: 292 ASMPTFN-SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
FN SL A S+ + F LK +R L K + +P I+ CC+LHNI +
Sbjct: 252 EQEKFFNVSLSRARLSIE-RCFALLKSRFRRL-KYLDMKKVELIPQTIIACCVLHNICLQ 309
Query: 351 SGD 353
D
Sbjct: 310 FDD 312
>gi|390360974|ref|XP_794543.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 234
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 93/201 (46%), Gaps = 20/201 (9%)
Query: 153 LKWPDSNRMEEIKSKFEESF---GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSML 209
+K+P++ MEE+++ + F G+ N GA+D TH+ + + + +++ +S+
Sbjct: 11 IKFPET--MEEVRTAQVDFFNLAGMPNVVGAVDGTHVELHGAPLDDEYIYTNRKGKHSIN 68
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
VQ I + + ++ WPG + SR+L+ S G+R + EL +
Sbjct: 69 VQLICNARYKITNVCARWPGSTHDSRVLRNS------RIGERF--------ADGELPGIL 114
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
VG GYPL WLITP +A +N H TR + QLK +R L +
Sbjct: 115 VGDSGYPLQPWLITPLRDPQGNAER-NYNRAHCRTRVTIEQLNGQLKNKFRCLMGQGIQM 173
Query: 330 DKRKLPSIILVCCLLHNIIID 350
+ II+ C +L NI D
Sbjct: 174 SAPRACDIIIACAVLFNIAKD 194
>gi|322800660|gb|EFZ21605.1| hypothetical protein SINV_10073 [Solenopsis invicta]
Length = 136
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 80/150 (53%), Gaps = 20/150 (13%)
Query: 201 DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISS 260
+++ +S+ +QG+VD M+F +I G PG ++ +R+L+ S ++ + +
Sbjct: 1 NRKKYFSITLQGVVDANMKFTNIYYGEPGSLHDARVLRRSPLYQTA-----------VHN 49
Query: 261 EEVELRE--YIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKG 317
+E E +I+G Y LSWL+ P+ NG L+ FN LH +TR + +AF LKG
Sbjct: 50 KETLFPENTFILGDSAYASLSWLVPPFRDNGHLTPQQKEFNFLHSSTRMVIERAFGYLKG 109
Query: 318 GWRILSKVMWRPDKRKLPSI---ILVCCLL 344
+R ++ + + R +P I ++ C+L
Sbjct: 110 RFR---RIKFFNEYRHMPFITNTVVCACIL 136
>gi|331213751|ref|XP_003319557.1| hypothetical protein PGTG_01731 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309298547|gb|EFP75138.1| hypothetical protein PGTG_01731 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 374
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 121/306 (39%), Gaps = 35/306 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
+E FK R +K F ++ + + + N L V Q+A+ L RL S +
Sbjct: 44 DEDFKQAVRTTKTGFTWLLEQIYFNAIFYS-----NSPRPQLPVPHQLALTLERLGSNGN 98
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG VG+ TV + + R I A+ + + +L WPD R +EI SK ++ G
Sbjct: 99 GASVGRFSRNLSVGRGTVIKASRRVIRAINDLSPRYLLWPDKERRDEI-SKVMKAEGFEG 157
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G +D T I + + + D++ YS+ Q I D + +TGWPG S +
Sbjct: 158 CIGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVICDCDRFITAYMTGWPGSCGDSMV 217
Query: 237 LKFSGFFK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
K K + GQ Y++ Y L I Y+
Sbjct: 218 FKRMMVHKEPALFFDPGQ-----------------YLIADSAYELGLHCIPAYKAPAAYI 260
Query: 293 SMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLHNI 347
T FN +R LKG W L + + +P D ++ + C LHN+
Sbjct: 261 QENTEFNYCLARSRVRNEHTIGILKGRWASLQHLRLAIQKPSDMMEIIRWVNCCVTLHNM 320
Query: 348 IIDSGD 353
+ GD
Sbjct: 321 LAHLGD 326
>gi|357622030|gb|EHJ73650.1| putative transposase [Danaus plexippus]
Length = 298
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 65/269 (24%), Positives = 119/269 (44%), Gaps = 25/269 (9%)
Query: 98 GRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
G ++ ++K+V + L L + S VG FG+ +S+VS + + L E+ + WP
Sbjct: 21 GSIVPLDKKVHVFLWALVNDCSFKEVGQIFGLHKSSVSYIFHEVVMLLAEQRYQFINWPS 80
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ K +G NC G +DA + + S Q +N+ +L+Q + D
Sbjct: 81 LEEQHITRVKVNSKYGFPNCVGFLDACRLKV------GSKRKKRQVSNF-ILLQAVCDET 133
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ FIDI G G ++ F+ + Q L ++ +I+G Y L
Sbjct: 134 LMFIDIHIGEIGNTVKGKV------FRETQLSQELKNF-------IDFDNHILGDCEYKL 180
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLP 335
L+ P+ L ++ FN +H S AF LK ++ L+ + + RPD +
Sbjct: 181 RKNLLIPFSREELVSNEEIKFNEIHWKAHSYIGSAFGLLKERFQKLNHIDINRPD--AVQ 238
Query: 336 SIILVCCLLHN-IIIDSGDQLHPDVALSD 363
++I C+LHN +++ G + + ++D
Sbjct: 239 ALICAACVLHNFVLLHEGTPVKEEAVIND 267
>gi|108708757|gb|ABF96552.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 798
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 81/193 (41%), Gaps = 35/193 (18%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ +P+ + C + + V + D +MRF ++ GWPG ++ R
Sbjct: 573 DCIGAIDGTHVPCVVPSNKLVQHLC-CKGMTTQNVMVVCDFDMRFTFVLAGWPGSVHDMR 631
Query: 236 LLK-----FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ ++ F G+ Y + GYP + PY+
Sbjct: 632 VFDDAHTTYTHVFPHSPTGK-----------------YYLVDSGYPNRPGYLAPYKGTKY 674
Query: 291 ------SASMP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
A P FN H + R++ ++F LK WR+L K+ D RK II+
Sbjct: 675 HLQEYRDAPEPQGKEENFNYAHSSLRNVIERSFGVLKMKWRMLEKIP-SYDPRKQTQIII 733
Query: 340 VCCLLHNIIIDSG 352
CC LHN I SG
Sbjct: 734 ACCALHNFIRKSG 746
>gi|390364877|ref|XP_794452.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 291
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 94/202 (46%), Gaps = 21/202 (10%)
Query: 153 LKWPDSNRMEEIKSKFEESF---GLLNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSM 208
+K+P++ MEE+++ + F G+ N GA+D TH+ + P + + +++ +S+
Sbjct: 67 IKFPET--MEEVRTAQVDFFNLAGMPNVVGAVDGTHVELHGAPLLDDEYIYTNRKGKHSI 124
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY 268
VQ I + + ++ WPG + SR+L+ S G+R + EL
Sbjct: 125 NVQLICNARYKITNVCARWPGSTHDSRVLRNS------RIGERF--------ADGELPGI 170
Query: 269 IVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
+VG GYPL WLITP +A +N H TR + QLK +R L +
Sbjct: 171 LVGDSGYPLQPWLITPLRDPQGNAER-NYNRAHCRTRVTIEQLNGQLKNKFRCLMGQGIQ 229
Query: 329 PDKRKLPSIILVCCLLHNIIID 350
+ II+ C +L NI D
Sbjct: 230 MSAPRACDIIIACAVLFNIAKD 251
>gi|449675115|ref|XP_004208330.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 141
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 74/135 (54%), Gaps = 3/135 (2%)
Query: 106 QVAIALRRLASGESQVSVG-VAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEE 163
Q+ ALR A+G Q+ VG + Q T+S + R +L +R ++K+P + + + E
Sbjct: 6 QLLCALRFYATGSFQIVVGDSTAALSQPTISGIIRRVSLSLAKRINEYIKYPTNQHVLNE 65
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ KF + G I+ THI + P + + D+ +N+S+ VQ + D++ +FID+
Sbjct: 66 SRVKFYKIAEFPKVTGVINCTHICIQKPH-EHEYAYVDRSSNHSINVQAVCDNKGKFIDV 124
Query: 224 VTGWPGGMNVSRLLK 238
V WPG + +R+L+
Sbjct: 125 VAKWPGSTHDARILR 139
>gi|193625027|ref|XP_001951483.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 224
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 95/192 (49%), Gaps = 11/192 (5%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGL-INIEGR-LLSVEKQVAIALRRLA 115
D E F +FR+S K+FD +L+S+ SG+ ++ GR +S +++ + LR LA
Sbjct: 7 DNENKFFNYFRMSIKSFD--------ELLSKLESGIKVSNSGRPSISPTERLCVTLRYLA 58
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
SG + + ++ +G ST+S + E + + + P + EI FE++
Sbjct: 59 SGNTFTDLQYSYRMGISTISGIVEDVCEQIWKMKSECIPLPTEEKWREISLDFEKNTNFP 118
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D HI + P +++ + + ++ YS+++ I D F + G G + S
Sbjct: 119 NCIGALDGKHIRVIKP-IKSGSLFYNYKHYYSIVLMAICDANYCFTFVDVGAYGKFSDSS 177
Query: 236 LLKFSGFFKLCE 247
+ K FF+ E
Sbjct: 178 VFKNGKFFEKLE 189
>gi|116317822|emb|CAH65857.1| OSIGBa0126J24.2 [Oryza sativa Indica Group]
Length = 836
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 104/234 (44%), Gaps = 49/234 (20%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 607 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHQEKSQN----VMCVCDFDMRFTFVLAGWPGS 662
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSE--EVELREYIVGGVGYPLLSWLITPYETN 288
++ R+L + R S++ + L ++ + GYP + PY+
Sbjct: 663 VHDMRVLN--------------DAQTRFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYK-- 706
Query: 289 GLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
G++ +++P FN H + R++ ++F LK WRIL + + K
Sbjct: 707 GITYQFQEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRILFS-LPSYSQEKQS 765
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 766 RIIHACIALHNFIRDS--QMADTEFNNCDHDENY--------DPLGGTSSPSSE 809
>gi|432951335|ref|XP_004084780.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 254
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 101/240 (42%), Gaps = 42/240 (17%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S ++ YI +L++ + + I+ GR L+ E+ + ALR A+G ++G A
Sbjct: 48 YRFSLQSITYIHNLIQPYITN------ISHRGRALTSEQILCAALRFFANGSFLYNIGDA 101
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + + AL+ + + +P + IK +F GL N G ID THI
Sbjct: 102 EHISKATVCRSVRKVCFALKRFLRIFIVFPGHKPLRAIKEEFHRIAGLPNVVGCIDGTHI 161
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P + D+ ++++ +S+ VQ I D ++ WPG
Sbjct: 162 PIITPT-ENEADYVNRKSIHSINVQIICDAAHFITNVEAKWPG----------------- 203
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E+ +++G GYP L+TPY FN H TR+
Sbjct: 204 -----------------EIDGFLLGDRGYPCQPKLLTPYPEPE-QGPQQRFNLAHSRTRA 245
>gi|328712060|ref|XP_003244719.1| PREDICTED: hypothetical protein LOC100574275 [Acyrthosiphon pisum]
Length = 435
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/269 (24%), Positives = 111/269 (41%), Gaps = 23/269 (8%)
Query: 133 TVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPA 192
T S+ W ++ L ++ P + +++E + L NC G+ID HI + PA
Sbjct: 21 TTSESLWSVLQPL------YMPVPTKESWTNVAERYQELWNLPNCVGSIDGKHIRIKAPA 74
Query: 193 VQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRL 252
+ + + + +S+++ D + +FI I G G + R+L K C GQ+L
Sbjct: 75 -NSGSAFYNYKGFFSIVLMATADADGKFITIDVGEYGRNSDGRVL------KECAFGQQL 127
Query: 253 NGN-------VRISSEEVE--LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEA 303
N + EE E Y VG +PL++ L+ PY L+ + FN
Sbjct: 128 LKNKLDLPEPSTLPGEENEPPYAYYFVGDEAFPLMNNLLRPYPRRQLTNAKRIFNYRLSR 187
Query: 304 TRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
R AF + +RIL + R + K+ +++ C+LHN I L +
Sbjct: 188 GRKSIECAFGMMTSKFRILESPICRKIE-KVDTLVKAICVLHNFIRTHDGIFSTSTDLQE 246
Query: 364 HHDSGYGEQCCKQVDPMGRTTRENLEKHL 392
+ GE + RT N+ +L
Sbjct: 247 YSTGYSGETQFNSQENRSRTRPSNVAINL 275
>gi|413919714|gb|AFW59646.1| hypothetical protein ZEAMMB73_124288 [Zea mays]
Length = 441
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 110/266 (41%), Gaps = 45/266 (16%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVS-QVTWRFIEALEERAKHHLKWPDSNRME 162
+ ++ AL RLA +V FG+ V+ + + A+ ER L +R+
Sbjct: 161 DHKLGAALFRLAHAAPARAVARRFGLPSPAVAARAFYEVCRAIAERLAVLLDLVAPDRIA 220
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
F + L NCCGA+ + Q +VD + RF+D
Sbjct: 221 RAVPGFC-ALSLPNCCGALGYARF------------------GEAATAQALVDADGRFLD 261
Query: 223 IVTGWPGGMNVSRLLKFSGFFK-----LCEA--GQRLNGNVRISSEEVELREYIVGGVGY 275
+ GW M + +L + + L A G+ + G+V Y +G
Sbjct: 262 VSVGWDPEMAPAEILPRTKLYTSQSLVLANAPQGELIGGSVP---------RYFLGPACC 312
Query: 276 PLLSWLITPYE--TNGLSAS-----MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
PLL WL+TPY + ++A FN +H + + AF ++ WR+L + W+
Sbjct: 313 PLLPWLVTPYRHVVDAIAADDDLSKESIFNHVHAHGQQVVKNAFGHVRARWRLLEEC-WK 371
Query: 329 PD-KRKLPSIILVCCLLHNIIIDSGD 353
+ + LP +++ CLLHN ++ G+
Sbjct: 372 GECQEALPYVVVAGCLLHNFLLQCGE 397
>gi|359484728|ref|XP_003633150.1| PREDICTED: uncharacterized protein LOC100854497 [Vitis vinifera]
Length = 427
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 141/300 (47%), Gaps = 37/300 (12%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+A AL RLA G S SVG FG+ + + + + + ++ H M E +S
Sbjct: 145 LAAALFRLAHGASYRSVGRRFGLDSAGACRAFYVVCKVVNDKLGH---------MFEFRS 195
Query: 167 ---KFEESFG---LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ FG L NCCG ++ + ++ +++VQ +VD E RF
Sbjct: 196 DIGRIVVGFGWISLPNCCG-------VLGFGKFGVDGELLGKDG--ALMVQALVDSEGRF 246
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFK-LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
+D+ GW + +L S F + E+ + LNG ++ + +YI+G PLL
Sbjct: 247 LDVSAGWSSKLKPDTILHQSALFSGVDESRELLNGPPFELTDGNLIPQYILGDSCLPLLP 306
Query: 280 WLITPY----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL- 334
WL+TPY E + LS+S FN +H L AF +++ W++L+K +W+ + +
Sbjct: 307 WLLTPYVRSSEEDSLSSSEQAFNFVHSRGMGLVSTAFCRVRARWQLLAK-LWKEECIEFF 365
Query: 335 PSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQ--VDPMGRTTRENLEKHL 392
P +I+ CLLHN +I + L PD + S GE Q ++ G+ R+ L HL
Sbjct: 366 PFVIVTGCLLHNFLIKCSEPL-PD---ENAGRSKEGELPIYQGKMNESGQRIRDALASHL 421
>gi|357139565|ref|XP_003571351.1| PREDICTED: uncharacterized protein LOC100834412 [Brachypodium
distachyon]
Length = 329
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 97/227 (42%), Gaps = 34/227 (14%)
Query: 142 IEALEERAKHHLKWPDSNRMEEIKSKFEESF--GLLNCCGAIDATHIIMTLPAVQTSDDW 199
+ L + +KH++K D E EE F GAID THI + +P +T +
Sbjct: 54 LRCLCKLSKHNIKPRDPTFSVEHDKVREERFWPHFRGAIGAIDGTHIPVIVPTDETVNYT 113
Query: 200 CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRIS 259
C + S V + D +MRFI V GWPG ++ +R+L + N
Sbjct: 114 C-RHGYTSQNVLAMCDFDMRFIFGVDGWPGSVHDTRVLNHA------------LANFPCF 160
Query: 260 SEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP---------------TFNSLHEAT 304
+ + Y+V GYP + + P + G + +P FN LH +
Sbjct: 161 PVPPKDKYYLVDS-GYPNRTGYLAPIK--GSTYHLPEFRLPRGRVLQGRDEIFNFLHSSL 217
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
R++ ++F LK WRIL K M R IIL C LHN I DS
Sbjct: 218 RNVIERSFAVLKQKWRIL-KSMPSFKPRAQTRIILACIALHNFIRDS 263
>gi|357128444|ref|XP_003565883.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 325
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 138/316 (43%), Gaps = 50/316 (15%)
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
+VE+Q+A+ L + + + + F TV++ + + A+ + L+ +R
Sbjct: 26 TVEEQIAMFLTTVGHHKKNIDISFHFTRSDETVNRYFNQVLFAIGQLGPEMLR----HRT 81
Query: 162 EEIKSK------FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
+I SK F+ F +C GAID TH+ +P+ + D + ++ + V +VD
Sbjct: 82 FDIPSKIQGNPRFDPYFK--DCIGAIDGTHVPCNVPS-RMVDRFRGRKPFPTQNVLAVVD 138
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
++ F + GW G + S +L+ S + NG +R+ + Y + GY
Sbjct: 139 FDLIFTYVSAGWEGSAHDSTVLRHSL--------EHPNG-LRVPEGK-----YYLADTGY 184
Query: 276 PLLSWLITPY--------ETNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKV 325
+ P+ E G A P FN H + R+ +AF LK +++L+
Sbjct: 185 AARRGFVPPFRQTRYHLREWRGNRARTPNELFNLRHSSLRTTVERAFGTLKNRFKVLTTR 244
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDP--MGRT 383
+ P ++ +++ CC+LHN I+ +G PD + Y EQ P GR
Sbjct: 245 PYYPFPSQV-RLVIACCILHNCILLNGG---PDELV-------YSEQNWFDHYPRSAGRV 293
Query: 384 TRENLEKHLQHNQEKD 399
R+ E+ Q QE+D
Sbjct: 294 QRDLREEQRQVLQERD 309
>gi|328700543|ref|XP_003241296.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 377
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 111/258 (43%), Gaps = 23/258 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAF--GVGQSTVSQVTWRFIEALEERA-KHHLKWPD 157
+ VE QV +A+ A G Q + G F V Q +VS+ +A+ + +++P
Sbjct: 73 IPVEIQVLVAVYFYAKGSYQRATGDHFDLNVSQPSVSRCLHAVTDAINNNLLRQWVRFPM 132
Query: 158 SNRMEEIKSKFEESFG---LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
+N +E ++ E GAID T+I + P + + + N+S+ VQ +V
Sbjct: 133 TN-LERNNAREEFCNAPQPFEGAIGAIDCTYINILAPK-DHEEAFVNHHGNHSLNVQAVV 190
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
+++ ++I +PG N S + S + E + E +++G G
Sbjct: 191 SPKLKILNINARYPGARNDSYIWSTSPIRRAME----------FHYNKGERHTWLIGDSG 240
Query: 275 YPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK---VMWRPDK 331
YPL WL+TP + H R++ + F K WR LS +M+ PD
Sbjct: 241 YPLEPWLMTPLPHYREGTRQFKYTMKHCKARNVVERFFGVFKSVWRCLSYQRVLMYAPDM 300
Query: 332 RKLPSIILVCCLLHNIII 349
I+ C +LHN+ I
Sbjct: 301 AG--KIVNACAVLHNMRI 316
>gi|302780717|ref|XP_002972133.1| hypothetical protein SELMODRAFT_15917 [Selaginella moellendorffii]
gi|300160432|gb|EFJ27050.1| hypothetical protein SELMODRAFT_15917 [Selaginella moellendorffii]
Length = 67
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 30/66 (45%), Positives = 43/66 (65%)
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+K+ FE G NCCGAID TH + LP + D+ +++ +YS+++Q IVD E RF+DI
Sbjct: 1 VKTDFESLHGFPNCCGAIDCTHFEVQLPGNAFASDYYNKDKDYSIVMQAIVDSEARFLDI 60
Query: 224 VTGWPG 229
G PG
Sbjct: 61 QVGVPG 66
>gi|328702684|ref|XP_003241982.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 246
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 96/221 (43%), Gaps = 23/221 (10%)
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEE--SFGLLNCCGAIDATHIIMTLP-----AVQTSD 197
+E +KWP N + + F+ A+D HI + P ++ D
Sbjct: 1 MEVHKHKFIKWPRGNAIHQTLQDFKHLRPKAFPGAFMALDGCHIKIPTPWNKRTRMRHVD 60
Query: 198 DWC--DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN 255
C ++++ S+++QGI D +F +I GWPG + +R+ + S G LNG
Sbjct: 61 QRCYINRKHVASVVLQGICDSNQKFTNIFAGWPGASHDARIFRKSNI------GVALNGP 114
Query: 256 VRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQ 314
++ V YI+ YPL ++ PY NG L+ FN + ++R + F +
Sbjct: 115 LQ---NLVPENCYILADSAYPLSDNVMVPYHDNGALTPEKHNFNRILNSSRVVIKLTFGK 171
Query: 315 LKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
L +R L +K + I C LHNI +D DQ+
Sbjct: 172 LLERFRYLEVY----NKDFCGAFISAACCLHNICMDQNDQI 208
>gi|331227606|ref|XP_003326471.1| hypothetical protein PGTG_07449 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 409
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 133/317 (41%), Gaps = 35/317 (11%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
FK FR+S+++F C L+ E + P + V +Q+ +AL+R+ + + S
Sbjct: 84 FKQEFRMSQQSF---CRLILE--IQDHPVFHNHSNVPQQPVPEQLMVALKRMGTHGNGAS 138
Query: 123 VGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
VG+ F + + TV R IEA+ ++ WP + +EI + G NC G
Sbjct: 139 VGMLARFFRISEGTVILYCSRVIEAILSLESTYVSWPSTEGRQEIAWQISSHTGFRNCVG 198
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
ID T ++ D+ ++ Y + + + E R + VTGWPG + +RL +
Sbjct: 199 FIDGTLFPLSEKPSIDPQDYYSRKGQYGLAALIVCNEEKRIMYYVTGWPGCCHDTRLWE- 257
Query: 240 SGFFKLCEAGQRLNGNVRISSEEV-ELREYIVGGVGYPLLSWLITPYET---NGLSASMP 295
N +++ +++ +Y++ G+P + ++ ++ + +
Sbjct: 258 -------------NTELKLQKDQLFSPGQYLIADSGFPPETNIVPAFKRPPHGAMPRARK 304
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD------KRKLPSIILVCCLLHNIII 349
FN + R LKG ++ L + R D K+ I C +LHN ++
Sbjct: 305 QFNQHLSSLRVCNEHCIGLLKGRFQSLRGL--RKDLNSAGTMEKITHWISACVILHNFLL 362
Query: 350 -DSGDQLHPDVALSDHH 365
D + DV D H
Sbjct: 363 SDQSPDVFTDVDDIDIH 379
>gi|77551559|gb|ABA94356.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 916
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 103/237 (43%), Gaps = 55/237 (23%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + + ++ N V + D +MRF ++ GWPG
Sbjct: 687 LDNCIGAIDGTHIQVVVPNSAAVQHRNRYKEKSQN----VMCVCDFDMRFTFVLAGWPGS 742
Query: 231 MNVSRLL-----KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
++ R+ +FS F G + Y+V GYP + PY
Sbjct: 743 VHDMRVFNDAQTRFSAKFPKPPPG----------------KFYLVDS-GYPNRPGYLAPY 785
Query: 286 ETNGLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ G++ +++P FN H + R++ ++F LK WRIL + +
Sbjct: 786 K--GITYHFQEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRILFSLP-SYSQE 842
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
K II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 843 KQSRIIHACIALHNFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 889
>gi|345484037|ref|XP_003424931.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 271
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 121/282 (42%), Gaps = 20/282 (7%)
Query: 76 YICSLVREDLVSR---PPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQS 132
Y+ LVRE LV R PP E + ++ L LA G+S + +G S
Sbjct: 2 YVYELVRERLVKRSRRPP----------FPPELRFSLTLNYLAHGDSIRKNEWFYNIGLS 51
Query: 133 TVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
TV QV L E L +P + + I ++F E NC GA+D H + P
Sbjct: 52 TVKQVIPEVCTVLCEVLMPLFLSFPSRQQFQVIANEFMEDLHFPNCIGALDGKHCRIRKP 111
Query: 192 AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ- 250
+ + + +N +S+++ D + RFI G G N + + S F + Q
Sbjct: 112 G-GSGSLFFNFKNFHSIVLMACCDSKKRFIWANIGDYGSCNDASVFAESDFGNVLLNNQI 170
Query: 251 RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASMPTFNSLHEATRSLAV 309
+L + + + ++ ++G G+PL +L+TP+ N ++ FN R +
Sbjct: 171 QLPPSQPLPNTHIQSPYVLIGDGGFPLKDYLMTPFLRNENITIPHRVFNYRLSHARRIIE 230
Query: 310 KAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNIIID 350
AF ++ W + S + W+ + II+ LHN+I D
Sbjct: 231 SAFGEVTERWLVNESSLKWKLATSE--RIIISSLCLHNVIKD 270
>gi|449666953|ref|XP_004206453.1| PREDICTED: uncharacterized protein LOC101234489 [Hydra
magnipapillata]
Length = 296
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 104/233 (44%), Gaps = 23/233 (9%)
Query: 63 FKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
FK FRV++ TF +I + L + R P +SV K+VA+AL LAS
Sbjct: 74 FKEHFRVNRNTFSFIVNELHPHLCKTTTTMREP----------ISVVKRVAVALHYLASH 123
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLL 175
E V FG+G+ST + + + FI A+ + ++K+P S + + F+ G
Sbjct: 124 EEYRVVSSLFGIGKSTTNLIVYEFINAVNDILLPKYVKFPLSEENLNKCSRDFKTILGFT 183
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C D HI ++ P Q + + + + YS+++ +V+ RFI G P N S
Sbjct: 184 QCA---DGCHIPISAPKDQ-AISYYNYKGWYSIVLFAVVECRYRFIYTSVGLPSKNNDSY 239
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
+L+ S + E+ + S V L ++G PL L+ Y N
Sbjct: 240 ILQNSFLKAILESNLFDKCYKELGSSLVPL--CLMGDSAVPLTRHLLKTYPKN 290
>gi|322785758|gb|EFZ12385.1| hypothetical protein SINV_02385 [Solenopsis invicta]
Length = 301
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 104/251 (41%), Gaps = 29/251 (11%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPDSNRMEEIKS 166
LA+G+ S+ A +G+ST +V T R + ++ +LK P + I
Sbjct: 2 LATGDQVSSIAFAHRIGESTAYKVIKETCVVTVRILSSI------YLKPPKKEDWKNIAI 55
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
F + NC GAID H ++ P+ + + + + N+S+++ D++ +F + G
Sbjct: 56 GFWNHWNFPNCLGAIDGKHFLIKAPS-NSGTLYFNYKKNFSIVLLAACDYQYKFTIVDCG 114
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNG--------NVRISSEEVELREYIVGGVGYPLL 278
G + G F E G+ LN N ++ +VE+ Y V +PL
Sbjct: 115 AYGSSSDG------GIFAQSEFGKCLNSDNLDIPVENCKLPLTDVEMPYYFVADEAFPLS 168
Query: 279 SWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
++ PY L+ FN R + F L WR+ + + D R II
Sbjct: 169 KRIMRPYPGQFLTDKKSIFNYRLSRARRIIENTFGILVSRWRLFQRCICL-DPRHADVII 227
Query: 339 LVCCLLHNIII 349
+ LHN ++
Sbjct: 228 MAAINLHNYLM 238
>gi|328724776|ref|XP_003248249.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 275
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 115/249 (46%), Gaps = 23/249 (9%)
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEE-SFGLL-NCCGA 180
+G F + + + +FI + + + WP + I F+ F + + G
Sbjct: 5 IGDRFDLSRGLACRTILKFIRVFSKLLDNFVLWPKGSSDAAIIKDFKSLRFNYMPSTIGC 64
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID HI + P + SD + +++ S+++ + + + F + +GWPG + +R+ K S
Sbjct: 65 IDGCHIRIHAPRDKRSD-YTNRKMFQSIVLLAVCNAKFEFTYMFSGWPGSSHDARVFKNS 123
Query: 241 GFFKLCEAGQRLNGNVRISS--EEVELREYIVGGVGYPLLSWLITPYE-TNGLSASMPTF 297
G+ I+S E + ++I+G +PLL L+ PY+ T+ L+ +F
Sbjct: 124 SL-----------GHTLINSPQEIISKNQHILGDSAFPLLENLMVPYKATHILTEKEKSF 172
Query: 298 NSLHEATRSLAVKAF-LQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
N +TR + +AF L L WR+ K++ + + C+LHN+ + + D +
Sbjct: 173 NRRLSSTRVVIEQAFGLLLGRFWRL--KILEAKSIELMSLTVTSACILHNLALQNNDFIE 230
Query: 357 PDVALSDHH 365
D +DH+
Sbjct: 231 ID---NDHN 236
>gi|77556298|gb|ABA99094.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 869
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 103/234 (44%), Gaps = 49/234 (20%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 646 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHEEKSQN----VMCVCDFDMRFTFVLAGWPGS 701
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSE--EVELREYIVGGVGYPLLSWLITPYETN 288
++ R+ + R S++ + L ++ + GYP + PY+
Sbjct: 702 VHDMRVFN--------------DAQTRFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYK-- 745
Query: 289 GLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
G++ +++P FN H + R++ ++F LK WRIL + + K
Sbjct: 746 GITYHFQEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSY-SQEKQS 804
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 805 RIIHACIALHNFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 848
>gi|331211775|ref|XP_003307157.1| hypothetical protein PGTG_00107 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309297560|gb|EFP74151.1| hypothetical protein PGTG_00107 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 427
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 128/313 (40%), Gaps = 36/313 (11%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
E+ FK FR+S+ +F +C V +D P N +E+Q+ + L+RL
Sbjct: 94 EDRRFKQEFRMSRDSFIMLCKRVADD-----PVFHNNSNNPQRPIEEQMMVTLKRLGCFG 148
Query: 119 SQVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SVG+ F VG+ TV T R I A+ LKWP + ++ E G
Sbjct: 149 NGSSVGMLARFFRVGEGTVELYTNRCIMAILRIQSQLLKWPSAEERVARAIEYGED-GFD 207
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G ID + I ++ + D+ ++ Y + + D + + TGWPG
Sbjct: 208 GCIGVIDGSLIPLSDCPSRHGSDYYSRKGFYCISTLIVCDSQRNIQYMYTGWPG------ 261
Query: 236 LLKFSGFFKLCEAGQRLNGNVRIS---SEEVELREYIVGGVGYPLLSWLITPYET---NG 289
C R+ GN I+ + + E+++ Y ++ ++
Sbjct: 262 ----------CSHDSRVMGNSPITLNPDKYFKSGEFLLADSAYTTTMNVVAAFKKPSHGS 311
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR----PDKRKLPSIILVCCLLH 345
L+ +FN R + + LKG ++ L + R D+ + + I+ C +LH
Sbjct: 312 LTEDEHSFNYYLAQKRVVIEQCIGGLKGQFQSLKGLRLRIGGKRDRVRANAWIIACGVLH 371
Query: 346 NIIIDSGDQLHPD 358
N ++ GD+ D
Sbjct: 372 N-FLNQGDEYDFD 383
>gi|38569143|emb|CAE05672.3| OSJNBb0033P05.11 [Oryza sativa Japonica Group]
Length = 877
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 103/234 (44%), Gaps = 49/234 (20%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 648 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHQEKSQN----VMCVCDFDMRFTFVLAGWPGS 703
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSE--EVELREYIVGGVGYPLLSWLITPYETN 288
++ R+ + R S++ + L ++ + GYP + PY+
Sbjct: 704 VHDMRVFN--------------DAQTRFSAKFPKPPLGKFYLVDSGYPNRPGYLAPYK-- 747
Query: 289 GLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
G++ +++P FN H + R++ ++F LK WRIL + + K
Sbjct: 748 GITYHFHEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRILFSLP-SYSQEKQS 806
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 807 RIIHACIALHNFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 850
>gi|331247914|ref|XP_003336583.1| hypothetical protein PGTG_17894 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309315573|gb|EFP92164.1| hypothetical protein PGTG_17894 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 415
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 125/329 (37%), Gaps = 62/329 (18%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLV-----SRPPSGLINIEGRLLSVEKQVAI 109
IP DE FK R SK+ F + + D + RP L + Q+A+
Sbjct: 101 IPDDE---FKQTVRTSKEGFRLLLDEISSDPIFQSTGPRPQ----------LPIAHQLAL 147
Query: 110 ALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
L RL S S SV VG+ TV +VT R I A+ K +++WP ++R EI S
Sbjct: 148 TLERLGSAGSSASVARFSRDLNVGRGTVIKVTRRVIRAITNLGKKYVQWPSADRRLEI-S 206
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
+ G C G + T T P Q D SM Q I D + I +T
Sbjct: 207 DVMTNEGFEGCVGFVGGT----TFPVYQRKRD--------SMKAQVICDCDKNIIAFITV 254
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRL--NGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
WP S +LK +LC+ + G ++S +L + ++ P
Sbjct: 255 WPDSCADSSILK---RMELCKNPNKFFSKGQYLLASSTYQLSQTVIPAYKAPAAKVT--- 308
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLK--------GGWRILSKVMWRPDKRKLPS 336
E + + + + +E T + F LK G + V W
Sbjct: 309 -ENSRFNDCVTQARARNEETIGMLKNRFCSLKEIRLQHTNGNRDMTHYVQW--------- 358
Query: 337 IILVCCLLHNIIIDSGDQLHPDVALSDHH 365
+ C +LHN++ +GD D +D H
Sbjct: 359 -VYTCVILHNMLAKTGDSWE-DTMQTDPH 385
>gi|401888633|gb|EJT52586.1| hypothetical protein A1Q1_03388 [Trichosporon asahii var. asahii
CBS 2479]
Length = 419
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 109/263 (41%), Gaps = 24/263 (9%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFG-VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
V Q+A+ L RL G + V G +G+ +V+ R I A+ + +KWPD R
Sbjct: 114 VSFQIAVFLYRLGRGATISDVCRTMGRIGRGSVTTYCMRTIVAILTTFNNVIKWPDPRRR 173
Query: 162 EEIKSKFEESFGLLNCCGAIDATHI-IMTLP--AVQTSDDWCDQENNYSMLVQGIVDHEM 218
E I ++ +G+ C G ID +HI + P +++ + + +++ Y +L+ + D
Sbjct: 174 EAISTRLRRDYGIPGCVGFIDGSHIPLHKCPSFSIEKNASFFSRKHRYGLLILAVCDEAK 233
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF + TG + R QR + R E ++++G G+
Sbjct: 234 RFTYLQTGHYASASDFR-------------AQRSSALHRRPRELFSRDQFVLGDSGFYCS 280
Query: 279 SWLITPYETNG---LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV--MWRPDK-- 331
+I Y L+ FN R AF LK W +L+ + + DK
Sbjct: 281 LNVIPMYRRRAAQDLTREQQKFNDRVAKARVKIEDAFGVLKQRWLMLNDINLTMKTDKDI 340
Query: 332 RKLPSIILVCCLLHNIIIDSGDQ 354
+ I +LHN+ I + +Q
Sbjct: 341 NTAFAYIRTAVVLHNLYIHTANQ 363
>gi|115467204|ref|NP_001057201.1| Os06g0226000 [Oryza sativa Japonica Group]
gi|113595241|dbj|BAF19115.1| Os06g0226000, partial [Oryza sativa Japonica Group]
Length = 225
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 88/198 (44%), Gaps = 31/198 (15%)
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GA+D THI +P + D + ++ S V VD ++RF+ ++ GW G + S +
Sbjct: 2 CIGALDGTHIPACVP-MHMQDRFRGRKKFQSQNVLAAVDFDLRFLYVLAGWEGSAHDSYV 60
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL------ 290
L+ +A R NG ++ +Y + GY ++ PY
Sbjct: 61 LQ--------DALSRTNG------LKIPEGKYFLADAGYAARPGVLPPYRGTRYHLKEYK 106
Query: 291 SASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLH 345
A P FN H + R+ +AF LK +RI + P K ++ I++ CC LH
Sbjct: 107 GAREPENPKELFNLRHSSLRTTIERAFGALKNRFRIFKCQPFFPLKTQV-KIVMACCALH 165
Query: 346 NIIIDSGDQLHPDVALSD 363
N II+ G PDV + D
Sbjct: 166 NWIIEDG----PDVYVYD 179
>gi|331227083|ref|XP_003326210.1| hypothetical protein PGTG_08040 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309305200|gb|EFP81791.1| hypothetical protein PGTG_08040 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 82/181 (45%), Gaps = 9/181 (4%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
E FK FFR++ F +CS V + P + N V +Q+ +AL RL +
Sbjct: 100 ENRFKQFFRMTWALFLKLCSQVASN-----PIFVNNSNHPQRPVIEQMMVALNRLGCFGN 154
Query: 120 QVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
VSVG+ + +G TV T R I A+ L+WP + EE K F + G
Sbjct: 155 GVSVGMIATCYRIGDGTVEVYTNRCIMAILSLKSTLLEWPTAAAREETKMHFGD-VGFEG 213
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G ID + ++++ + D+ ++ Y + + D I + TGWPG + RL
Sbjct: 214 CVGLIDGSLVVLSTCPEKDGPDYYSRKGFYCIATLLVCDQHKNIIYVFTGWPGCSHDMRL 273
Query: 237 L 237
+
Sbjct: 274 M 274
>gi|357119235|ref|XP_003561351.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 267
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 102/223 (45%), Gaps = 32/223 (14%)
Query: 149 AKHHLKWPDSN---RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENN 205
A H+L+ D N + I+ L +C A+D THI +LPA Q + +
Sbjct: 2 ATHYLRPKDPNFRVGHKRIRGDKRAYPHLKDCIDALDGTHIRASLPADQQVR-YIGRSGT 60
Query: 206 YSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVEL 265
S V I D +MRFI G PG M+ + +L + +R++ + I +
Sbjct: 61 ASQNVLAICDFDMRFIYASIGQPGAMHDTSVLYHA---------KRVDHD--IFPHPPKG 109
Query: 266 REYIVGGVGYPLLSWLITPYE---------TNGLSASMPT--FNSLHEATRSLAVKAFLQ 314
+ Y+V GYP + PY+ G S P FN +H A R++ ++F
Sbjct: 110 KYYLVDA-GYPNRPGYLAPYKGERYHVPDFHRGAEPSTPKEKFNRVHSAVRNVIERSFGV 168
Query: 315 LKGGWRILSKV-MWRPDKRKLPSIILVCCLLHNIIID--SGDQ 354
LK WRIL K+ + DK+KL I+ +LHN + + SGD+
Sbjct: 169 LKMKWRILLKMPNYSMDKQKL--IVAASMVLHNYVREHQSGDR 209
>gi|331246642|ref|XP_003335953.1| terpenoid cylase/protein prenyltransferase alpha-alpha toroid
[Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 429
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 129/295 (43%), Gaps = 17/295 (5%)
Query: 63 FKYFFRVSKKTFDYICSLVRED---LVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGE 118
FK FR+ ++ F I + V V +P + G+L LS ++ A R+LA G
Sbjct: 78 FKRRFRMRRELFLKIVADVEAACPYFVQKP-----DCTGKLGLSSLQKATSAFRQLAYGC 132
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+ + +G+ST + F E + + ++ +L+ P S+ + I S G C
Sbjct: 133 AMDATDEYCRIGESTARKGLKYFCETINKLYSEKYLQKPTSDNLTRILD-VNASRGFPGC 191
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G++D H + + +E +++++ + D + G G +N +L
Sbjct: 192 IGSLDCMHWAWKNCPRAWAGQYKGKEKGPTVVLEAVADQSLWIWHAFFGTAGALNDINVL 251
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT- 296
S F+ +G + + + Y+V G+ YP S LI + G+S P+
Sbjct: 252 DRSNIFEAQISGTAWGVKFELQGRKYQHAYYLVNGI-YPSWSTLI---KVKGISQDPPSL 307
Query: 297 -FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
F L EA R +AF L+ W IL++ + + I++ C +LHN+I++
Sbjct: 308 HFQKLQEAFRKDIERAFGVLQARWTILTRPARFWTQEDMLGIMMTCVILHNMIVE 362
>gi|328699471|ref|XP_003240943.1| PREDICTED: hypothetical protein LOC100570252 [Acyrthosiphon pisum]
Length = 348
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 133/311 (42%), Gaps = 31/311 (9%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S TF+ + LV VSR P N L V + ++ LR LASG+S +S+ +F
Sbjct: 1 MSSTTFEELVCLV-GPYVSRFP----NFRKDTLVVGEILSCTLRYLASGDSMMSLVYSFR 55
Query: 129 VGQSTVSQ-------VTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+G ST+S+ V W+ +E W + R +FE+ + L +C AI
Sbjct: 56 MGHSTISKLICECCLVLWQVLEKKVLLTPSIFNWKKTAR------EFEDKWNLPHCVAAI 109
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D HI+ + S + + + +S ++ + D F+ + G G + + + S
Sbjct: 110 DGKHIVHQAFSNDGSTHF-NYKGTHSTVLLAMCDANYNFLLVDIGALGRCSDGGVFRSSN 168
Query: 242 FFK-LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP----T 296
K L V I + +IVG +PL +L+ PY G S +MP
Sbjct: 169 IGKAFANKTIHLPEPVDIDGVNGPIPYFIVGDEAFPLTKYLMRPYPGRGRS-TMPKDEEI 227
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII----IDSG 352
FN R AF L +RI K + +K + +I +LHN I D+G
Sbjct: 228 FNYRLSRARRTIENAFGILASRFRIFRKPIIASEK-TIINITKATIVLHNFIKKLETDAG 286
Query: 353 DQLH-PDVALS 362
++ P+++ S
Sbjct: 287 ISMYNPNISCS 297
>gi|223944065|gb|ACN26116.1| unknown [Zea mays]
Length = 298
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 118/262 (45%), Gaps = 18/262 (6%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+ VE+QVA+ L + VG +G TVS+ + + A+ E ++ P
Sbjct: 1 MCVEEQVAMFLNTVGHNLRNRLVGTNYGRSGETVSRYFNKVLRAIGELRAELIRPPSLET 60
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+I +C GAID THI ++ ++ + +++ S V VD ++RF
Sbjct: 61 PSKIAGNPRWDPYFKDCIGAIDGTHIRASV-SIDMEPSFRGRKSYASQNVMAAVDFDLRF 119
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG---GVGYPL 277
++ GW G + + +L+ +A +R NG +R+ + L + G G P
Sbjct: 120 TYVLAGWEGSTHDALVLR--------DALERENG-LRVPQGKFYLVDAGYGAKPGFLPPF 170
Query: 278 --LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKL 334
+ + + + N + FN H + R +AF LK ++IL + P ++
Sbjct: 171 RAVRYHLNEWGNNPVQNEKELFNLRHSSLRVTVERAFGSLKRRFKILDDATPFFPFSTQV 230
Query: 335 PSIILVCCLLHN-IIIDSGDQL 355
I++ CC++HN +I D GD+L
Sbjct: 231 -DIVVACCIIHNWVIQDGGDEL 251
>gi|403178386|ref|XP_003336827.2| hypothetical protein PGTG_18233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375164133|gb|EFP92408.2| hypothetical protein PGTG_18233 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 432
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 135/336 (40%), Gaps = 41/336 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
++E FK +K F ++ S V + P N L + Q+A+ L RL S
Sbjct: 101 QDEDFKQAVCTTKAGFTWLLSQVNLN-----PIFHSNSFRPQLPIPHQLALTLERLGSNG 155
Query: 119 SQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE-ESFGL 174
+ SVG GVG TV + + R I A+ + ++ +L WPD R +EI + E F
Sbjct: 156 NGASVGRFSRNLGVGHGTVVKASRRVIRAINDLSEKYLTWPDEVRRKEISDVMKCEVFE- 214
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
C G +D T I + + + D++ YS+ Q + + + +TGWPG S
Sbjct: 215 -GCVGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQVVCNCDRFITAFMTGWPGSCGDS 273
Query: 235 RLLKFSGFFK----LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ K K + GQ Y++ Y L I Y+
Sbjct: 274 MVFKRMMLHKEPTLFFDRGQ-----------------YLIADSAYELGVHCIPAYKAPAA 316
Query: 291 SASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLH 345
+ FN +R LKG W L + + +P D ++ + C LH
Sbjct: 317 YIKENSDFNYCLARSRVWNEHTIGILKGQWASLQHLRLAIQKPSDMMEVIRWVNCCVTLH 376
Query: 346 NIIIDSGDQ-LHPDVALSDHHDS---GYGEQCCKQV 377
NI+ GD DV+++DH S G E+ ++V
Sbjct: 377 NILAHLGDAWAELDVSINDHPGSDGPGVNEETAQEV 412
>gi|449685623|ref|XP_002157724.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 368
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 112/256 (43%), Gaps = 26/256 (10%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPD 157
+++ I +R LA GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLARGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWKNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSFFYNYKGFYSMVLLEICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +V
Sbjct: 146 YCFTMVDIGAYGRDNDAAILNASTF------GRAFNKGYFNLPKISEFDPKVPPVLVEDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKR 332
+ L W + PY L+ FN R +F L WRI S + +P
Sbjct: 200 IFALKPWSMKPYPGKNLTVQQRVFNYRLSRARRTIENSFGILAARWRIYRSPIKAKP--L 257
Query: 333 KLPSIILVCCLLHNII 348
K+ II LHN +
Sbjct: 258 KVEHIIKATVCLHNYL 273
>gi|406701986|gb|EKD05058.1| hypothetical protein A1Q2_00665 [Trichosporon asahii var. asahii
CBS 8904]
Length = 419
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 109/263 (41%), Gaps = 24/263 (9%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFG-VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
V Q+A+ L RL G + V G +G+ +V+ R I A+ + +KWPD R
Sbjct: 114 VSFQIAVFLYRLGRGATISDVCRTMGRIGRGSVTTYCMRTIVAILTTFNNVIKWPDPRRR 173
Query: 162 EEIKSKFEESFGLLNCCGAIDATHI-IMTLP--AVQTSDDWCDQENNYSMLVQGIVDHEM 218
E I ++ +G+ C G ID +HI + P +++ + + +++ Y +L+ + D
Sbjct: 174 EAISTRLRRDYGIPGCVGFIDGSHIPLHKCPSFSIEKNASFFSRKHRYGLLILAVCDEAK 233
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
RF + TG + R QR + R E ++++G G+
Sbjct: 234 RFTYLQTGHYASASDFR-------------AQRSSALHRRPRELFSRDQFVLGDSGFYCS 280
Query: 279 SWLITPYETNG---LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV--MWRPDK-- 331
+I Y L+ FN R AF LK W +L+ + + DK
Sbjct: 281 LNVIPMYRRRAAQDLTREQQKFNDRVAKARVKIEHAFGVLKLRWLMLNDINLTMKTDKDI 340
Query: 332 RKLPSIILVCCLLHNIIIDSGDQ 354
+ I +LHN+ I + +Q
Sbjct: 341 NTAFAYIRTAVVLHNLYIHTANQ 363
>gi|331244786|ref|XP_003335032.1| hypothetical protein PGTG_16639 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309314022|gb|EFP90613.1| hypothetical protein PGTG_16639 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 332
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 67/132 (50%), Gaps = 4/132 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG V +STV + T + IEAL + ++ WP+
Sbjct: 139 LPIAHQLALTLERLGSNGNAASVGRFSRNLNVSRSTVIKATQQVIEALVSLGRQYVMWPN 198
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ R EI S+ ++ G + C G +D T I + + + D++ YS+ Q I D +
Sbjct: 199 TTRWAEI-SEVMKNEGFVGCVGFVDGTTIPLFQRPGYDGEVFYDRKRRYSINAQIICDCD 257
Query: 218 MRFIDIVTGWPG 229
+TGWPG
Sbjct: 258 KYITSFITGWPG 269
>gi|312377954|gb|EFR24658.1| hypothetical protein AND_10615 [Anopheles darlingi]
Length = 252
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 7/151 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ + T+D + + D+ P + + S EK+VA+AL +L SG SVG
Sbjct: 105 FRLDRPTYDMLVEALNPDMAPHPL-----LISQSCSTEKKVAVALYKLISGSDYASVGDQ 159
Query: 127 FGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FGV ++TV ++F +AL + + P ++ EI S FEE L GA+ H
Sbjct: 160 FGVHKATVKNCLFQFCKALVKNFMDAEIALPLTDEAMEISSAFEEKCDLPMVMGALGLLH 219
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
I +T P+ S ++ + + S+ +Q +VDH
Sbjct: 220 IPIT-PSGAESKNYLNSKKWASITLQAVVDH 249
>gi|340382280|ref|XP_003389648.1| PREDICTED: putative nuclease HARBI1-like [Amphimedon queenslandica]
Length = 241
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 97/205 (47%), Gaps = 20/205 (9%)
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+++ +EI + FE + +C GA+D HI++ P + + + ++ +S+++ +VD +
Sbjct: 9 TSKWKEIATGFETYWQFPHCIGALDGKHIVIR-PPPNSGSYYFNYKHTFSIVLLALVDAD 67
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
+F + G G N S LC A + + NV I + + L IV
Sbjct: 68 YKFTYVNIGC-NGRNSS----------LCAALETNSLNVPLPFPICEDGIPLPYMIVADE 116
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKR 332
+PL +++ PY GL+ FN R + AF L +++ ++ + PDK
Sbjct: 117 AFPLKTYIQKPYAQIGLTKEKRIFNYCLSRARRIVENAFGILANRFQVFMTPIRLSPDKA 176
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHP 357
+ +I+L CC LHN + S + HP
Sbjct: 177 E--TIVLACCSLHNFLRSSSAR-HP 198
>gi|270010647|gb|EFA07095.1| hypothetical protein TcasGA2_TC010085 [Tribolium castaneum]
Length = 1007
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 102/221 (46%), Gaps = 14/221 (6%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK FR++K T ++ ++ + + L + LS + Q+ IALR A+G
Sbjct: 38 DEHAFKKRFRLNKATVMHLVDIIGDRV------ALKTQRNKSLSAQTQMLIALRFYATGG 91
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEE----IKSKFEESFGL 174
+G V +S + +V R + + H++K P RM E K KF G
Sbjct: 92 FLELLGDWIHVHKSNICRVIQRVTHDIARLSPHYIKMP---RMTEELMATKRKFFRICGF 148
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
GAID TH+ + P + + +++ +S+ VQ D +++ + I++ WPG ++ S
Sbjct: 149 SRVVGAIDCTHVGIQSPGGANGELYRNRKGYFSINVQATCDADLKLLHIISRWPGLVHDS 208
Query: 235 RLLKFSGFFKLCEAGQRLNGNV-RISSEEVELREYIVGGVG 274
+ S +C + + R S E +RE ++ G G
Sbjct: 209 TKTQSSKCNLVCGLSTAVTEKILRDESIERRIRENVIIGGG 249
>gi|346471011|gb|AEO35350.1| hypothetical protein [Amblyomma maculatum]
Length = 445
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 132/308 (42%), Gaps = 23/308 (7%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
+++E F+ + R+S TFD + V+ L S + +S ++AI +R LA+
Sbjct: 60 AEDEAYFRDYLRMSPSTFDTLLGFVKHSL-----SRQVTPFSDPISAHDRLAITVRFLAN 114
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERA-KHHLKWPDSN-RMEEIKSKFEESFGL 174
G++ S+ F +G+ST S + A+ ++ +P + +I E +
Sbjct: 115 GDTFRSLSYNFLIGRSTASVLIRETTAAIWTNLWDEYIPFPQTEVEWRKIALDMENYWNF 174
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
NC G++D H+ + P S + + + +S+++ D RF + GG S
Sbjct: 175 PNCIGSLDGKHVNIECPNNSGSRN-MNYKKTFSVVLLACCDAHYRFTYVDLCHYGGEGDS 233
Query: 235 RLLKFSGFFK-LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-----ETN 288
+ S K L + + S ++ IVG +PL ++L+ PY +T+
Sbjct: 234 GIFLRSDLLKDLTNNWCGVPAPTTVGSAG-DIPYVIVGDEAFPLKTFLMRPYARRDLQTH 292
Query: 289 GLSAS-------MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
LS S TFN R + +F + WRIL + +R + +I C
Sbjct: 293 RLSPSGREEYQQRATFNYRLSRARRVIENSFGIMAARWRILRRP-FRASEETTENICKAC 351
Query: 342 CLLHNIII 349
+LHN ++
Sbjct: 352 VVLHNFMM 359
>gi|397634896|gb|EJK71629.1| hypothetical protein THAOC_06907 [Thalassiosira oceanica]
Length = 485
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 141/341 (41%), Gaps = 60/341 (17%)
Query: 63 FKYFFRVSKKTFDYICSLVREDL---------------VSRPP-----SGLI----NIEG 98
F+ +FR+S++ F +C + E++ +R P S L+ N G
Sbjct: 91 FRRYFRMSRECFYLLCRRIEENVGEKAFKSEVFLDNLKYTRDPNLVRMSKLMRAHENTTG 150
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA---------LEERA 149
+S E ++A+ LR LA G S + + + F G ST ++ + I +
Sbjct: 151 GFISGEIKLALTLRLLAGG-SYLDLSLLFECGSSTAYEIFHKVIREWICSKDKPLVNING 209
Query: 150 KHHLKWPDSNRMEEIKSKF-EESFGLLN-CCGAIDATHIIMTLPA----VQTSDDWCDQE 203
K + D RM + +F S GL + C GAID + + P+ V + ++
Sbjct: 210 KDFI--DDEERMAAVALEFARSSAGLFSGCIGAIDGWVVKIKKPSQRDNVGNVASFYSRK 267
Query: 204 NNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF-KLCEAGQRLNGNVRISSEE 262
Y + V I D + R + V G + S K S + KL + RL
Sbjct: 268 GFYGLNVVVICDRKKRILYRVINSRGAEHDSTAFKNSSLYRKLMDDCDRL---------- 317
Query: 263 VELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+E + +G Y + S+L+TP++ FN H ++R F ++ W IL
Sbjct: 318 LEKGFHFIGDSAYAIRSFLLTPFDNAVHGTPEDNFNFFHSSSRICIECTFGEVDLRWGIL 377
Query: 323 SKVMWRPDKRKLPS---IILVCCLLHNIIIDSGDQLHPDVA 360
W P K L + +I C +LHN I+D + L +V+
Sbjct: 378 ----WSPLKFSLRNNIKVIDACLMLHNFIVDHRESLGENVS 414
>gi|403179756|ref|XP_003338057.2| hypothetical protein PGTG_19587 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165340|gb|EFP93638.2| hypothetical protein PGTG_19587 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 431
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 122/303 (40%), Gaps = 27/303 (8%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK R +K F ++ + + + SG + L V Q+A+ L RL S
Sbjct: 101 RDEDFKQAVRTTKSGFTWLLRQIAFNPIFH--SGSFRPQ---LPVPHQLALTLERLGSNG 155
Query: 119 SQVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SVG VG+ T+ + + R I A+ + ++ WPD +R +EI S ++ G
Sbjct: 156 NGASVGRISRNLSVGRGTIIKASRRVIRAINDLGSTYVLWPDKDRRKEI-SDVMKAEGFE 214
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + + + D++ YS+ Q I D + +TGWPG S
Sbjct: 215 GCIGFVDGTTIPLYQRPSIDGEVFFDRKKRYSINCQIICDCDRFITGYMTGWPGSCGDSM 274
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ K K E + +Y++ Y L I Y+
Sbjct: 275 VFKKMMVHKEPE-------------RFFDPGQYLIANSAYELGVHCIPAYKAPAAYIREN 321
Query: 296 T-FNSLHEATRSLAVKAFLQLKGGWRILSKV---MWRP-DKRKLPSIILVCCLLHNIIID 350
T FN +R LKG W L + + +P D ++ + C LHN++
Sbjct: 322 TEFNYCLARSRVRNEHTIGILKGRWSSLQHLRLSIQKPTDMMEIIRWVNCCITLHNMLAH 381
Query: 351 SGD 353
GD
Sbjct: 382 LGD 384
>gi|77552491|gb|ABA95288.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 916
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 55/237 (23%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 687 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQN----VMCVCDFDMRFTFVLAGWPGS 742
Query: 231 MNVSRLL-----KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
++ R+ +FS F G + Y+V GYP + PY
Sbjct: 743 VHDMRVFNDAHTRFSAKFPKPPPG----------------KFYLVDS-GYPNRPGYLAPY 785
Query: 286 ETNGLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ G++ +++P FN H + R++ ++F LK WRIL + +
Sbjct: 786 K--GITYHFQEYNESTLPRGKREHFNYCHSSCRNVIERSFGVLKNKWRILFSLP-SYSQE 842
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
K II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 843 KQSRIIHACIALHNFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 889
>gi|241594080|ref|XP_002404243.1| transposase, putative [Ixodes scapularis]
gi|215502315|gb|EEC11809.1| transposase, putative [Ixodes scapularis]
Length = 275
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 103/226 (45%), Gaps = 27/226 (11%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL 190
Q V + R A+ + + +KWPD + I + FE G GAI +HI +
Sbjct: 22 QIHVFECVGRVGAAVVDASPDFIKWPDPEAAKSIIAGFEARSGFPGVIGAIGRSHIPIRC 81
Query: 191 PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ 250
P V S+ + ++ + +S+++Q + DH M F+D G PG + + + S F + EA +
Sbjct: 82 P-VDDSERFRNRLDFHSVVLQAVCDHRMVFLDCSAGHPGSTSDMLVFRRSLFIQSLEASK 140
Query: 251 RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL------SASMPTFNSLHEAT 304
++VG +P+ L+ P+E +G S + F+
Sbjct: 141 ------------FPFDSHMVGDATFPIGPHLMVPFEDDGELGEEEASFNEKIFDGCETIE 188
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKL-PSIILVCCLLHNIII 349
R+LA+ LKG +R L + +R L +II++ C+LHN I
Sbjct: 189 RALAL-----LKGRFRRLQGL--ETGRRDLVVTIIIMACVLHNACI 227
>gi|221106927|ref|XP_002156477.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 239
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 101/218 (46%), Gaps = 24/218 (11%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I + ++ + C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKELDQDWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N +IS + ++ +VG
Sbjct: 146 YCFTMVDIGAYGRDNDAAVLNASTF------GRAFNKGYFNLPKISEFDPKVPPVLVGDD 199
Query: 274 GYPLLSWLITPYETNGLSASMPTFN-SLHEATRSLAVK 310
+ L WL+ PY L+ FN L A R++ ++
Sbjct: 200 IFALKPWLMKPYPGKNLTVQQRVFNYRLSRAHRTIDIE 237
>gi|347972217|ref|XP_003436859.1| AGAP013326-PA [Anopheles gambiae str. PEST]
gi|333469354|gb|EGK97264.1| AGAP013326-PA [Anopheles gambiae str. PEST]
Length = 329
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 119/298 (39%), Gaps = 31/298 (10%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
+P +E F FR+ +K IC+ + + G E +S + V L
Sbjct: 10 VVPKYSDEEFLRQFRIGRKAVQMICNHLETTSAYKKLRGHGGYEA--ISPQTHVLSFLWF 67
Query: 114 LASGESQV-SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEES 171
L ++ V F + S + V R +A+ KH L P S R F +
Sbjct: 68 LGHDKTSYRDVATQFNLSVSCLHSVICRVADAILS-MKHILMIPLSEARKTASDIAFSKK 126
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G + T I + P + + + +YS+ +Q I+D +RF+D+ +P
Sbjct: 127 CNFSGVIGCVGGTQIKIDKPR-ENPERYLLTTGHYSIQLQAIIDENLRFVDVFVEYPDEP 185
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-- 289
+L E +R+ GN R ++G YP ++ L+ PY +
Sbjct: 186 ------------ELLETVKRICGN----------RYCLLGNASYPCMNQLLVPYPVDEVL 223
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L+ + T+N + S + F LK +R L + R R + +I VCC+LHN+
Sbjct: 224 LTQAQKTYNDHLQTVTSQCNQIFAHLKTRFRRLHHLKGRHLSR-MVDLIKVCCILHNL 280
>gi|328699639|ref|XP_001946822.2| PREDICTED: hypothetical protein LOC100159851 [Acyrthosiphon pisum]
Length = 401
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 132/308 (42%), Gaps = 23/308 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + FK F+R+SK FD + +V + + + I + VE+++ I LR LA+G
Sbjct: 50 DPDKFKTFYRMSKSNFDNLVHIVGPKIFKKDTNFRIAV-----PVEERILITLRFLATGC 104
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNRMEEIKSKFEESFGLLNC 177
+ ++ F G++TV ++ EA+ E + +L P + I +++ + L +C
Sbjct: 105 NFRALAQHFMRGETTVGKIIAETTEAIWECLQPTYLPVPSLELWKNIAARYNLLWQLPHC 164
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G+ID HI + S ++ + + +S+ + D + FI + G G N
Sbjct: 165 LGSIDGKHIRIKKFNNTGSRNF-NYKGFFSIQLLACADADGCFITVDIG-DLGRNSD--- 219
Query: 238 KFSGFFKLCEAGQRLN-GNVRISSEE--------VELREYIVGGVGYPLLSWLITPYETN 288
G F+L G+ L G + + E Y G +PL S+L+ PY
Sbjct: 220 --GGVFRLSRLGRWLEIGGMNVPQSEPLPHDNEGPNFPYYFCGDEAFPLKSYLLRPYPQK 277
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
L+ FN R AF + +R+ + + + SI+ C LHN I
Sbjct: 278 TLNDQKRIFNYRLSRGRKSVECAFGMMVSKFRVFETPI-ACSESTVISIVKCACALHNYI 336
Query: 349 IDSGDQLH 356
+ +L+
Sbjct: 337 RKTEGKLY 344
>gi|354542525|ref|NP_001121305.2| uncharacterized protein LOC100158389 [Xenopus laevis]
Length = 416
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 149/334 (44%), Gaps = 22/334 (6%)
Query: 44 WAKNSSTPGATIPSDE-----EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEG 98
W N T P +E + FK R+S +F+ + V L+++ + +
Sbjct: 35 WLVNHEQLSHTGPLNELRERNPDDFKNRLRMSDSSFETLLQAV-APLIAKQDTCM----R 89
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD 157
+ + E+++ L+ LA+G S + + G+ ++ + A+ E K +LK+P
Sbjct: 90 QAIPAEQRLIATLQFLATGRSLEDLKIPIGISAQSLGHIIPETCNAIFEALKTAYLKFPS 149
Query: 158 SNRMEEIKSK-FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ + ++ FEE + NC GAI H+ + P+ ++ + + + S+++ IV+
Sbjct: 150 TEAEWKATARHFEEFWNFPNCGGAIVGKHVRIKPPS-RSGSYFTNYKGYNSIVLLAIVNA 208
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYI-VGGVG 274
+ F+ + G G ++ ++ + F++ + Q +L N S+ E Y+ V G
Sbjct: 209 KYEFLMVDVGKNGRVSSVESMEQTYFYQRLQNHQLQLPSN----SDTSEGMNYVFVTGEE 264
Query: 275 YPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRK 333
+ L ++TP+ +S FN R +A AF L +RI + + P K
Sbjct: 265 FALHEHILTPFPQKDMSFERRIFNYRLSRARRVADNAFGILSNRFRIFHTAINLSPSKTD 324
Query: 334 LPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
S++L CC+LHN + + + + D D+
Sbjct: 325 --SVVLACCVLHNFLCRTSGASYMPATMLDREDT 356
>gi|291228639|ref|XP_002734285.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 429
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 7/249 (2%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWP-DSNRMEE 163
++AI LR LA+G S ++ AF V +T+S A+ E P ++ E
Sbjct: 113 KLAITLRYLATGNSYKTLQYAFRVAHNTISLFIPEVCLAIISEYQDEVFSCPITTDEWSE 172
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ + +++ + CGA+D HI + P + + + + YS+ + +VD F+
Sbjct: 173 VAQTYADTWNFHHVCGALDGKHIAIRNPP-GSGTIYYNYKGFYSLTLLALVDGNYEFLWA 231
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQR-LNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
G PG +++ S E G L + ++ + +++G +PL +W+
Sbjct: 232 DVGNPGSSLDAQVFNHSPLRHGLENGTLGLPDPDPLPDDDRDTPYFLIGDDVFPLRTWMQ 291
Query: 283 TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIILVC 341
PY + FN R + +F L WR +LS + P+K + ++I+ C
Sbjct: 292 KPYSNREQTDEERIFNYRLSRARRVVENSFGILAHRWRCLLSTLQLDPEKAR--TVIMAC 349
Query: 342 CLLHNIIID 350
LHN++ D
Sbjct: 350 MCLHNLMRD 358
>gi|432962500|ref|XP_004086700.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 387
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 112/291 (38%), Gaps = 62/291 (21%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
LSVE+ V +ALR ASG SVG A + ++T+ + A++ A + +P R
Sbjct: 73 LSVEQMVCVALRFFASGAFLYSVGDAEQLNKATICRTIRSVCLAIKALADVFISFPGHRR 132
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIM-----------------------TLPAVQTSD 197
+ ++K +F G N GA+D THI + V S+
Sbjct: 133 LCDVKEEFYRIAGFPNVIGAVDCTHIRIKRGHTQHPPLTIEGAAVERVSSTKFLGVHFSE 192
Query: 198 DWCDQENNYSMLVQG---------------------IVDHEMRFIDIVTGWPGGMNVSRL 236
D N S+ + + + + ++V WPG ++ SR+
Sbjct: 193 DLSWTTNTASLAKKAQQRSSAPALPDSPTALSIRLMVCNADCVISNVVAKWPGSVHDSRI 252
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
+ S ++ G E ++G GY +L+TP+ T+ A
Sbjct: 253 FRASEIYQCLSQG--------------EFSGVLLGDRGYGCQPFLLTPF-TDPQEAQQ-A 296
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
+N H TR+ F LK + L K+ R + I + C +LHN+
Sbjct: 297 YNHAHARTRTRVEMTFGLLKARFHCLHKL--RVNPVTACDITVACAVLHNV 345
>gi|67083861|gb|AAY66865.1| possible transposase [Ixodes scapularis]
Length = 254
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 94/202 (46%), Gaps = 22/202 (10%)
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
++ + +F++ G G++ TH+ + PA + D+E S+ VQ + +RF
Sbjct: 22 LDLLAGEFQKVAGFPGVVGSMGGTHVNVRCPAHRQRPVSTDRE--LSLAVQAVSGPWLRF 79
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+D+ G PG LL S G+RL E + R +++ V YP
Sbjct: 80 LDVFVGPPGDKRNMALLSLSPL------GKRL--------ESFDHRYHLLTDVVYPPREC 125
Query: 281 LITPY----ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L+ PY + + + F++LH T ++ A +K +R LS++ + K+
Sbjct: 126 LVPPYGEKPSDDPETLAKQEFDALHGVTHAVVNDALALVKRRFRQLSRLEFF-TLEKMSD 184
Query: 337 IILVCCLLHNIIIDSGD-QLHP 357
+L CC+LHN ++D+ D QL P
Sbjct: 185 FVLACCVLHNFLVDADDVQLDP 206
>gi|449688564|ref|XP_004211776.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 133
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 1/124 (0%)
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
+ FE+ +G A+D I + P S+D+ +++ +S+++Q + D F DI
Sbjct: 2 ASFEKLYGFPQVVRAVDKCQIRIKAP-YNNSEDYINRKEYHSIILQCLADSRYFFRDIFV 60
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
GW SR+LK S +K C A L N+ + +E+ +I+G Y L +WL+ PY
Sbjct: 61 GWTSKSRDSRVLKISPLYKECLAKTFLPNNLNKIIDNIEIGPHILGDSTYSLENWLMKPY 120
Query: 286 ETNG 289
G
Sbjct: 121 SDRG 124
>gi|357143017|ref|XP_003572773.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 122/282 (43%), Gaps = 34/282 (12%)
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
++ +SVE+QVAI L +A + ++ F +T+ + R +EA+ +++
Sbjct: 83 VDTIYVSVEEQVAIFLYAVAKNATNETLQDWFQHSPATIHRHFKRVLEAITNLTPIYIRP 142
Query: 156 PDSNRMEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
P + + K KF F NC GA+D THI+M LP Q + + +++ S
Sbjct: 143 PSLHPHSILRKPKFYPFFK--NCIGAVDGTHILMKLPLDQ-QEPYKNRKQTISQNAMVAC 199
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
D +++F+ I GW G + +R+L+ + L + L+G ++ + VG
Sbjct: 200 DFDLKFVHINPGWEGSASDARVLQDA----LNHGFEVLDG------------KFYLVDVG 243
Query: 275 YPLLSWLITPYE-------TNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRIL 322
Y + PY G + P FN H R+ + K + IL
Sbjct: 244 YANTPQFLAPYRGTRYHLNEQGRARQKPQNHKELFNLRHAQLRNHIERIIGIWKMRFPIL 303
Query: 323 SKVMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
V P ++++ I + C +LHN I I +GD P A D
Sbjct: 304 KAVSHFPKEKQI-DISVACAVLHNFIRIHNGDMTWPSDATMD 344
>gi|31430659|gb|AAP52540.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|62733440|gb|AAX95557.1| Transposase family IS4, putative [Oryza sativa Japonica Group]
gi|62733452|gb|AAX95569.1| Transposase family IS4, putative [Oryza sativa Japonica Group]
Length = 916
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 55/237 (23%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 687 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQN----VMCVCDFDMRFSFVLAGWPGS 742
Query: 231 MNVSRLL-----KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
++ R+ +FS F G + Y+V GYP + PY
Sbjct: 743 VHDMRVFNDAHTRFSAKFPKPPPG----------------KFYLVDS-GYPNRPGYLAPY 785
Query: 286 ETNGLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ G++ +++P FN H + R++ ++F LK WRIL + +
Sbjct: 786 K--GITYHFQEYNESTLPRGKREHFNYCHSSCRNVIERSFGVLKNKWRILFSLP-SYSQE 842
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
K II C LHN I DS Q+ + HD Y DP+G T+ + E
Sbjct: 843 KQSRIIHACIALHNFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 889
>gi|403160739|ref|XP_003321192.2| hypothetical protein PGTG_02234 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170375|gb|EFP76773.2| hypothetical protein PGTG_02234 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 377
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 66/141 (46%), Gaps = 4/141 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV + + R I A+ + + +L WPD
Sbjct: 73 LPIAHQLALTLERLGSNGNGASVGRFSRNLSVGRGTVVKASRRVIRAINDLSGRYLLWPD 132
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+NR EI +E G C G +D T I + + + D + YS+ Q I +
Sbjct: 133 TNRRREISEVMKEE-GFEGCVGFVDGTTIPLYQQPSIDGEVFFDCKKRYSINCQVICNCN 191
Query: 218 MRFIDIVTGWPGGMNVSRLLK 238
+TGWPG S + K
Sbjct: 192 RFITAYMTGWPGSCGDSMVFK 212
>gi|72158323|ref|XP_794409.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 321
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 103/231 (44%), Gaps = 28/231 (12%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D+E+ FK +R S++ ++ + + + R L QV +ALR A+G
Sbjct: 31 DDEQMFK-LYRFSRQGCLHLINRLSRRIQHR------TQRNHALPPSLQVFVALRFFATG 83
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF---GL 174
G+ ST S+ R AL E +K+P++ +EE+ + + F G+
Sbjct: 84 SVMDCTSCIHGIHISTTSRTVRRVALALGELRDEVIKFPET--LEEVGTAQVDFFNLAGM 141
Query: 175 LNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
+ GA+D TH+ + P + + +++ +S+ VQ I + + ++ WPG +
Sbjct: 142 PHVVGAVDGTHVELHGAPLLDDEYIFTNRKGKHSINVQLICNARYKITNVCARWPGSTHD 201
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
SR+L+ + R + EL +VG GYPL WLITP
Sbjct: 202 SRVLR---------------SHWRERFADGELPGILVGDSGYPLQPWLITP 237
>gi|115617904|ref|XP_001201283.1| PREDICTED: uncharacterized protein LOC764781 [Strongylocentrotus
purpuratus]
Length = 391
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 73/323 (22%), Positives = 138/323 (42%), Gaps = 30/323 (9%)
Query: 58 DEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRP--PSGLINIEGRLLSVEKQVAIA 110
++ FK + RVS + FD + L ++D + R P GL ++++
Sbjct: 32 NDRRKFKKYTRVSPEMFDDLLVRLTPHLQKKDTIFRKAIPPGL------------KLSVF 79
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE-ERAKHHLKWPDSNR-MEEIKSKF 168
LR LA+G + + F VG+ T+ + A+ E A + P +N E+ F
Sbjct: 80 LRHLATGATYAELSYNFRVGKETIQKFVPDVARAIVVEYAAEVISLPTTNEGWLEVAGDF 139
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
E + L +C GA D HI + P ++ + + + +S+++ +VD + +F+ I G
Sbjct: 140 EARWNLPHCLGAYDGKHIRLKKPN-KSGSLYFNYKQFFSVVLMALVDSKYQFLWIDVGRV 198
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRL---NGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
G + +++ S + EAG + EE + + VG + + ++++ PY
Sbjct: 199 GHQSDAQIYNNSELKECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTYMMKPY 258
Query: 286 ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIILVCCLL 344
+ FN R + AF L ++ L ++ P +L ++ +L
Sbjct: 259 GRRNMDQQQKVFNYRLSRARRVVENAFGILALWFQCFLGQMRQEPGTVRL--LVEAAVML 316
Query: 345 HNIIIDSGDQLHPDVALSDHHDS 367
HN+I L DV + D D+
Sbjct: 317 HNLIRKRYQAL--DVRMLDQEDA 337
>gi|301624193|ref|XP_002941412.1| PREDICTED: hypothetical protein LOC100487167 [Xenopus (Silurana)
tropicalis]
Length = 470
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 127/298 (42%), Gaps = 24/298 (8%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E F + R+S TFD + LV L R + + + +S +++ + LR LA+G +
Sbjct: 134 EKFFAYIRMSISTFDELLKLVHPHL-HRMDTNM----RQAISPAERLVVTLRFLATGSTF 188
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLLNCCG 179
++ F +G++T+ + + + K + P++ + +I F E NC G
Sbjct: 189 AALHYQFLIGRATIGMIVRETCKTIWNVTKDLVMPEPNTEKWMKIAEGFYEKTDFPNCIG 248
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
A+D HI +T P T C + + ++ +VD F I G G +
Sbjct: 249 ALDGKHIRVTRPP-NTVSKSCSKV--FFTVLLALVDSSYCFTYIDVGAYGSDGDA----- 300
Query: 240 SGFFKLCEAGQRLN-GNVRISSEE-------VELREYIVGGVGYPLLSWLITPYETNGLS 291
SGFFK G+ +N G + + + L IV + + + ++ PY L+
Sbjct: 301 SGFFK-SNLGKMVNEGKLNFPANKPLPGTKGPALPYVIVADEAFGISTNVMRPYPIRNLT 359
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
+ FN R + AF L WR+ + + + + II C+LHN+++
Sbjct: 360 GTKRAFNYRLTRARRMVECAFGILANKWRVFHSAI-QLNTAFVDDIIKCACVLHNLVL 416
>gi|183985722|gb|AAI66249.1| Unknown (protein for MGC:185250) [Xenopus (Silurana) tropicalis]
Length = 473
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 127/298 (42%), Gaps = 24/298 (8%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E F + R+S TFD + LV L R + + + +S +++ + LR LA+G +
Sbjct: 137 EKFFAYIRMSISTFDELLKLVHPHL-HRMDTNM----RQAISPAERLVVTLRFLATGSTF 191
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDSNRMEEIKSKFEESFGLLNCCG 179
++ F +G++T+ + + + K + P++ + +I F E NC G
Sbjct: 192 AALHYQFLIGRATIGMIVRETCKTIWNVTKDLVMPEPNTEKWMKIAEGFYEKTDFPNCIG 251
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
A+D HI +T P T C + + ++ +VD F I G G +
Sbjct: 252 ALDGKHIRVTRPP-NTVSKSCSKV--FFTVLLALVDSNYCFTYIDVGAYGSDGDA----- 303
Query: 240 SGFFKLCEAGQRLN-GNVRISSEE-------VELREYIVGGVGYPLLSWLITPYETNGLS 291
SGFFK G+ +N G + + + L IV + + + ++ PY L+
Sbjct: 304 SGFFK-SNLGKMVNEGKLNFPANKPLPGTKGPALPYVIVADEAFGISTNVMRPYPIRNLT 362
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
+ FN R + AF L WR+ + + + + II C+LHN+++
Sbjct: 363 GTKRAFNYRLTRARRMVECAFGILANKWRVFHSAI-QLNTAFVDDIIKCACVLHNLVL 419
>gi|397602758|gb|EJK58257.1| hypothetical protein THAOC_21635 [Thalassiosira oceanica]
Length = 442
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 138/337 (40%), Gaps = 66/337 (19%)
Query: 63 FKYFFRVSKKTFDYICSLVREDL---------------VSRPP-----SGLI----NIEG 98
F+ +FR+S++ F +C + E++ +R P S L+ N G
Sbjct: 91 FRRYFRMSRECFYLLCRRIEENVGEKAFKSEVFLDNLKYTRDPNLVRMSKLMRAHENTTG 150
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA---------LEERA 149
+S E ++A+ LR LA G S + + + F G ST ++ + I +
Sbjct: 151 GFISGEIKLALTLRLLAGG-SYLDLSLLFECGSSTAYEIFHKVIREWICSKDKPLVNING 209
Query: 150 KHHLKWPDSNRMEEIKSKF-EESFGLLN-CCGAIDATHIIMTLPAVQTSDDWCDQENNYS 207
K + D RM + +F S GL + C GAID + + P+ + Y
Sbjct: 210 KDFI--DDEERMAAVALEFARSSAGLFSGCIGAIDGWVVKIKKPS----------QGFYG 257
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF-KLCEAGQRLNGNVRISSEEVELR 266
+ V I D + R + V G + S K S + KL + RL +E
Sbjct: 258 LNVVVICDRKKRILYRVINSRGAEHDSTAFKNSSLYRKLMDDCDRL----------LEKG 307
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
+ +G Y + S+L+TP++ FN H ++R F ++ W IL
Sbjct: 308 FHFIGDSAYAIRSFLLTPFDNAVHGTPEDNFNFFHSSSRICIECTFGEVDLRWGIL---- 363
Query: 327 WRPDKRKLPS---IILVCCLLHNIIIDSGDQLHPDVA 360
W P K L + +I C +LHN I+D + L +V+
Sbjct: 364 WSPLKFSLRNNIKVIDACLMLHNFIVDHRESLGENVS 400
>gi|242038949|ref|XP_002466869.1| hypothetical protein SORBIDRAFT_01g015480 [Sorghum bicolor]
gi|241920723|gb|EER93867.1| hypothetical protein SORBIDRAFT_01g015480 [Sorghum bicolor]
Length = 515
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 80/184 (43%), Gaps = 13/184 (7%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ +P + C ++ + V D +MRF +++GWPG +
Sbjct: 157 DCIGAIDGTHVPCVVPQEKFVQHLC-RKGMTTQNVMAACDFDMRFTFVMSGWPGSAHDMT 215
Query: 236 LLK-----FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
+ K F F G+ + ++ L Y G Y L + P + G
Sbjct: 216 VFKDAISRFGDLFSHPPTGKYYLVDSGYANRLGYLAPY--KGTKYHLQEYREGP-QPEGK 272
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
TFN H + R++ +AF LK WR+L ++ K II+ CC LHN I
Sbjct: 273 EE---TFNYAHSSLRNVIERAFGVLKMKWRMLREIPSY-STEKQSRIIVACCALHNFIRT 328
Query: 351 SGDQ 354
SG Q
Sbjct: 329 SGIQ 332
>gi|331241039|ref|XP_003333169.1| terpenoid cylase/protein prenyltransferase alpha-alpha toroid
[Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 431
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 125/277 (45%), Gaps = 20/277 (7%)
Query: 98 GRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
G+L LS +++ ALR+LA G + + +G++T +V F + + + +L+
Sbjct: 113 GKLGLSALQKITSALRQLAYGCAMDATDEYCRIGEATARKVLRAFTDTINRLYSSRYLRK 172
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P+++ + + G C G++D H + + +E +++++ + D
Sbjct: 173 PNADDLIRLLD-VNNLRGFPGCIGSLDCMHWAWKNCPSAMAGQFKGKEKKPTVVLEAVAD 231
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
+ G G +N +++ S F+ AG + + + + Y+ G+ Y
Sbjct: 232 QRLWIWHAFFGTAGALNDINVIQRSTIFEDQIAGTGWDVRFEVCGRQYDHAYYLTDGI-Y 290
Query: 276 PLLSWLITPYETNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILS--KVMWRPDK 331
P S LI + G+S P+ F L EA R +AF L+ W I++ +W P
Sbjct: 291 PDWSTLI---KAKGVSQDAPSQHFKKLQEAFRKDIERAFGVLQARWLIITTPSRIWNP-- 345
Query: 332 RKLPSIILVCCLLHNIII-----DSGD--QLHPDVAL 361
+ SI+ C +LHN+I+ DS D +L P V+L
Sbjct: 346 ADMSSIMRTCVILHNMIVEDNERDSTDTFELPPSVSL 382
>gi|115528684|gb|AAI24960.1| LOC100158389 protein [Xenopus laevis]
Length = 352
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/272 (22%), Positives = 125/272 (45%), Gaps = 12/272 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSN 159
+ E+++ L+ LA+G S + + G+ ++ + A+ E K +LK+P +
Sbjct: 28 IPAEQRLIATLQFLATGRSLEDLKIPIGISAQSLGHIIPETCNAIFEALKTAYLKFPSTE 87
Query: 160 RMEEIKSK-FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ ++ FEE + NC GAI H+ + P+ ++ + + + S+++ IV+ +
Sbjct: 88 AEWKATARHFEEFWNFPNCGGAIVGKHVRIKPPS-RSGSYFTNYKGYNSIVLLAIVNAKY 146
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYI-VGGVGYP 276
F+ + G G ++ ++ + F++ + Q +L N S+ E Y+ V G +
Sbjct: 147 EFLMVDVGKNGRVSSVESMEQTYFYQRLQNHQLQLPSN----SDTSEGMNYVFVTGEEFA 202
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLP 335
L ++TP+ +S FN R +A AF L +RI + + P K
Sbjct: 203 LHEHILTPFPQKDMSFERRIFNYRLSRARRVADNAFGILSNRFRIFHTAINLSPSKTD-- 260
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDS 367
S++L CC+LHN + + + + D D+
Sbjct: 261 SVVLACCVLHNFLCRTSGASYMPATMLDREDT 292
>gi|322781647|gb|EFZ10267.1| hypothetical protein SINV_15374 [Solenopsis invicta]
Length = 207
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 22/175 (12%)
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEE------ 170
S + +G + QSTVS++ +R L ++K P + E K F+E
Sbjct: 30 RSPLVLGDIITISQSTVSRIVFRVSTLLASHINRYIKMPSTQESRSENKRLFKELGYGPG 89
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQT-SDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+ GL GAID HI + Q + + +++ +S+ VQ +V F+DIV WPG
Sbjct: 90 AIGLPCIDGAIDCCHIRLVHSRFQAIGETFRNRKGYFSLNVQAVVGPRTEFLDIVPEWPG 149
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
+ SR+ + N + + + EL +VG GYP L +L+TP
Sbjct: 150 SEHDSRIFQ--------------NSRIYMRYSQHELDGMLVGDAGYPALPFLLTP 190
>gi|331248878|ref|XP_003337060.1| hypothetical protein PGTG_18819 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309316050|gb|EFP92641.1| hypothetical protein PGTG_18819 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 310
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 82/183 (44%), Gaps = 11/183 (6%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
E+ FK R ++ F ++ + +R P N L + Q+A+ L RL S +
Sbjct: 102 EDDFKQAVRTTRSGFIWLLNKIRGH-----PVFYSNSRQEQLPIPHQLALTLERLGSNGN 156
Query: 120 QVSVGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
SVG +G+ T+ +V+ R I+A+ + ++ + WP R EI +E G
Sbjct: 157 GASVGRLSRNLSLGRGTIVKVSQRVIQAINKASEGFICWPTPERRAEISDVMKEE-GFEG 215
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T I + P + W D + YS+ Q I D + + GWPG S
Sbjct: 216 CIGFVDGTTIPLHQRPGLDGEVYW-DHKKQYSINCQIICDCDKFITSFMAGWPGTCGDSM 274
Query: 236 LLK 238
+LK
Sbjct: 275 VLK 277
>gi|395327517|gb|EJF59916.1| hypothetical protein DICSQDRAFT_40900, partial [Dichomitus squalens
LYAD-421 SS1]
Length = 188
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 74/178 (41%), Gaps = 14/178 (7%)
Query: 68 RVSKKTFDYICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIALRRLAS-GESQVS 122
R+S+ TFD + +L+ + + R P V+ Q+ L R G +
Sbjct: 1 RLSRDTFDRLVALLGCNPIFVSTGRRPQ---------RHVKYQLGAFLFRYGRLGTDSLD 51
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V GVG TV R A+ E +L+WP + E I S E+ G C G+ D
Sbjct: 52 VAQKLGVGHGTVVLYCKRVTRAIRELRSQYLQWPTKAQREAIASAIEDKSGFPKCVGSCD 111
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
+ I + + Q+ +YS +Q VDH RF GWPG + S + + S
Sbjct: 112 GSLIRFCEEPIVDGHVYVSQKKSYSTNIQTTVDHTGRFTSYDLGWPGSVPDSCIFRNS 169
>gi|449673392|ref|XP_004207945.1| PREDICTED: protocadherin Fat 4-like [Hydra magnipapillata]
Length = 2166
Score = 67.0 bits (162), Expect = 2e-08, Method: Composition-based stats.
Identities = 53/196 (27%), Positives = 98/196 (50%), Gaps = 19/196 (9%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLK-WP 156
+++ I +R LA+ ESQ + F VG++TV + W+F++ + RA + +K W
Sbjct: 1971 QRLIITIRYLATRESQQTQSFYFRVGRATVCHIIEETCCAIWKFLKKVFLRAPNDVKEW- 2029
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ I +F++++ L C GAID H+ + PA ++ + + + YSM++ I D
Sbjct: 2030 -----QNIIKEFDQNWNFLQCIGAIDGKHVRIEAPA-KSGPSFYNYKGFYSMVLLAICDA 2083
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV-RISSEEVELREYIVGGVGY 275
+ F + G G N + +L S F ++ G N+ +IS + ++ +VG +
Sbjct: 2084 KYCFTMVDIGAYGRDNDAAILNASTFGRVFNKGYF---NLPKISEFDPKVSPVLVGDDIF 2140
Query: 276 PLLSWLITPYETNGLS 291
L WL+ PY L+
Sbjct: 2141 ALKPWLMKPYPGKNLT 2156
>gi|302672390|ref|XP_003025885.1| hypothetical protein SCHCODRAFT_62806 [Schizophyllum commune H4-8]
gi|300099560|gb|EFI90982.1| hypothetical protein SCHCODRAFT_62806 [Schizophyllum commune H4-8]
Length = 337
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 104/240 (43%), Gaps = 39/240 (16%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---SG 117
E F+ + R+S FD + +++ +D P N + V Q+AIAL R +
Sbjct: 122 EIFRSYLRISPACFDALLAVIEDD-----PVFHNNSNNSQMPVAHQLAIALYRFGHHGNA 176
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
S + V + FGVG T++ VT R I+A+ E L+WP E+ K E +
Sbjct: 177 ASGLKVALLFGVGYGTINLVTSRVIKAVCGERFRTAALQWPGDAEKEKAKDWVEN----V 232
Query: 176 NCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+C +D T I + + + D+++NYS+ VQ + + ID G PG
Sbjct: 233 SCPAWRDGWLMVDGTLIPLFRRPGHFGNTFYDRKSNYSLNVQLVSTPDCMIIDYAIGLPG 292
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE-EVELR--EYIVGGVGYPLLSWLITPYE 286
+ + + RI+++ EV LR E++ YPL W +PY+
Sbjct: 293 SQHDATAWE----------------QTRIANQHEVLLREEEWVWADTAYPLRKWCQSPYK 336
>gi|293332785|ref|NP_001170134.1| uncharacterized protein LOC100384061 [Zea mays]
gi|224033751|gb|ACN35951.1| unknown [Zea mays]
Length = 412
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/264 (21%), Positives = 110/264 (41%), Gaps = 20/264 (7%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEE 163
++V A+R LA G ++ +G+ST Q RF AL + +L+ P+ +
Sbjct: 115 QKVCAAMRTLAYGLPTDAIDEYIQIGESTARQCLIRFCRALIAYFSVWYLRTPNQADIAR 174
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
I + ES G G+ID H ++ + + +M+++ + +++
Sbjct: 175 IM-QHSESRGFPGMLGSIDCMH--------WECRNYRGRNSRPTMILEAVASYDLWIWHA 225
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP----LLS 279
G PG N +L S F +GQ + ++ Y+ + YP +
Sbjct: 226 FFGMPGTNNDINVLHRSPVFDPMTSGQMPPVHYTVNGHAYNFGYYLADDI-YPNWPTFVK 284
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ P+E + F + E+ R +AF L+ W +L + +R D+ +L II
Sbjct: 285 AIRHPWEEKKIY-----FTQMQESHRKDIERAFSVLQARWAVLRGLAYRRDRNRLTEIIT 339
Query: 340 VCCLLHNIIIDSGDQLHPDVALSD 363
C ++HN+II+ + + D
Sbjct: 340 ACIIMHNMIIEDEGEFAANTDFGD 363
>gi|115629193|ref|XP_001201331.1| PREDICTED: uncharacterized protein LOC764811 [Strongylocentrotus
purpuratus]
Length = 342
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/268 (22%), Positives = 120/268 (44%), Gaps = 11/268 (4%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEE 163
++++ LR L++G + + F VG+ T+ + A+ +E A + P +N E
Sbjct: 54 KLSVFLRHLSTGATYAELSYNFRVGKETIQKFVPDVARAIVDEYAAEVISLPTTNEGWLE 113
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ FE + L +C GA D HI + P ++ + + + +S+++ +VD + +F+ I
Sbjct: 114 VAGDFEARWNLPHCLGAYDGKHIRLQKPN-KSGSLYFNYKQFFSVVLMALVDSKYQFLWI 172
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN---VRISSEEVELREYIVGGVGYPLLSW 280
G G + +++ S + EAG + + EE + + VG + + ++
Sbjct: 173 DVGGVGHQSDAQIYNNSELKECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTY 232
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIIL 339
++ PY + FN R + AF L ++ L ++ PD +L +I
Sbjct: 233 MMKPYGRRNMDQQQKIFNYRLSRARRVVENAFGILALRFQCFLGQMRQEPDTVRL--LIE 290
Query: 340 VCCLLHNIIIDSGDQLHPDVALSDHHDS 367
+LHN+I L DV + D D+
Sbjct: 291 AAVMLHNLIRKRYQAL--DVRMLDQEDA 316
>gi|403177908|ref|XP_003336349.2| hypothetical protein PGTG_17957 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173257|gb|EFP91930.2| hypothetical protein PGTG_17957 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 340
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 69/142 (48%), Gaps = 6/142 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG V + TV +VT R IEAL + +++WPD
Sbjct: 162 LPIPHQLALTLERLGSNGNGASVGQFSRNLQVARGTVVKVTRRVIEALISMGRVYVQWPD 221
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+R EI S+ G C G +D T I I P + + D++ Y M Q + D
Sbjct: 222 KDRRAEI-SEVMRMEGFSGCVGFVDGTTIPIFQRPGFD-GETFFDRKKRYFMNAQIVCDC 279
Query: 217 EMRFIDIVTGWPGGMNVSRLLK 238
+ ++GWPG S++ +
Sbjct: 280 DRFITSFISGWPGSCGDSKVYQ 301
>gi|356554812|ref|XP_003545736.1| PREDICTED: uncharacterized protein LOC100806549 [Glycine max]
Length = 418
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 136/305 (44%), Gaps = 40/305 (13%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 82 LRVSKKAFFKLCRILQE-------KGQL-VKTKNVPIDEAVAMFLHILAHNLKYRVVHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 134 YCRSMETISRQFKNVLGAIMKVSKEYLKFHEYNLEGSVENKWR---WFKNSIGALDGIHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 191 PVTVSA-EDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLR-------- 241
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYP-----LLSWLITPYETNGLSASMPT----- 296
+A R N + +Y + GY L + T Y N + P
Sbjct: 242 DALHRQN------CLHIPNGKYFLVDAGYTNGPGFLAPYRGTRYHLNEWIRNTPQNYKEL 295
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
FN H + R++ ++F LK W IL + K ++ II C +LHN I D+ H
Sbjct: 296 FNLRHASARNVIERSFGVLKKRWSILRTPSFFDIKTQI-RIINACFMLHNFI---RDEQH 351
Query: 357 PDVAL 361
D L
Sbjct: 352 SDPIL 356
>gi|308080876|ref|NP_001183817.1| uncharacterized protein LOC100502410 [Zea mays]
gi|238014710|gb|ACR38390.1| unknown [Zea mays]
Length = 341
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/265 (21%), Positives = 106/265 (40%), Gaps = 12/265 (4%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEE 163
++V A+R LA G +V +G+ST + F A+ + +L+ P + +
Sbjct: 36 QKVCAAMRILAYGLPSDAVDEYIQIGESTARECLHHFCRAIIACFSAWYLRTPTQDDITR 95
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
I ES G G+ID H +C + SM+++ + +++
Sbjct: 96 IMHN-SESRGFPGMLGSIDCMHWEWRNCPTAWRGQFCGRNGRASMILEAVASYDLWIWHA 154
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP----LLS 279
G PG N +L S F G+ N ++ Y+ G+ YP +
Sbjct: 155 FFGMPGTNNDVNVLHRSPVFDPITTGRMSPVNYTVNGHAYNFGYYLADGI-YPNWPTFVK 213
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ PYE + F + E+ R +AF L+ W +L + D+ +L I+
Sbjct: 214 AIRHPYEEKKV-----YFTQMQESCRKDIERAFGVLQARWAVLRGPAYGWDRNRLTEIMT 268
Query: 340 VCCLLHNIIIDSGDQLHPDVALSDH 364
C ++HN+I++ ++ D+
Sbjct: 269 ACIIMHNMIVEDEGPFAANIDFGDN 293
>gi|223997854|ref|XP_002288600.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975708|gb|EED94036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 499
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 132/311 (42%), Gaps = 42/311 (13%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLV---SRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
FK +R+S++ I +++D V R PS + N +L+ + + A SG +
Sbjct: 115 FKQKYRMSREALLVIAERIKDDDVFKNKRGPSQM-NPTHQLMVLLDYLGTA----GSGAN 169
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
F VG +V+ R +A + K WPD N + I + ++ F L NC
Sbjct: 170 NPKQRAYFHVGNGSVNNSRKRARDAVIHSLGKDFYHWPDENERKNISNCYKMEFNLPNCV 229
Query: 179 GAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G +D T + P + + D+ ++ +S+ + D + R +TG+PG + +R+L
Sbjct: 230 GVMDGTLFPLAFQPETEDAADYHGRKFQWSLTCLVVSDQKRRIRWYITGYPGSAHDNRML 289
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEE-VELREYIVGGVGYPLLSWLITPYETNGLSASM-- 294
+ S +++ EE + +YI+G + ++ Y+ N A
Sbjct: 290 RRSP--------------LKVRKEEYFTVYQYIIGDTAFDPSENVVPAYKANPNKAEPDD 335
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWR--------ILSKVMWRPDKRKLPSI---ILVCCL 343
P L++ V + + G W+ I ++V DK+ L + I +
Sbjct: 336 PDERLLNKVISKPRVSS-EHVNGMWKGRFPWLRLIPNRVR---DKKSLTEVMKYIHCTVI 391
Query: 344 LHNIIIDSGDQ 354
LHN +I+ GDQ
Sbjct: 392 LHNFLIEFGDQ 402
>gi|357622688|gb|EHJ74112.1| hypothetical protein KGM_12729 [Danaus plexippus]
Length = 342
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 129/288 (44%), Gaps = 18/288 (6%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGL-INIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
+S FD++ S V E L+++ + L + I ++ ++A+ LR LA+G+S S+ F
Sbjct: 1 MSPNDFDFLLSKV-EPLITKQKTRLRVPIPAKV-----RLALTLRFLATGDSYRSLHHLF 54
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ + ++ + A+ K +K P + I+S F + +C G ID H+
Sbjct: 55 KISSAAITFIIQEVCTAINTVLKDQIKMPRTTTEWLNIESGFSRKYP--HCVGCIDGKHV 112
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ P ++ + + YS ++ +VD FI G ++ + + S ++
Sbjct: 113 VIQCPINSGTEKY---KGTYSFVLLALVDSNYCFIFADIGAQDRISDGGIFQNSVLWEKI 169
Query: 247 EAGQ-RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP-TFNSLHEAT 304
G L + +S + + +G + L ++ P+ N + S+ TFN +
Sbjct: 170 STGTINLPPDSPLSDGQCNMPHVFLGDGAFALSKHVMIPFPGNHVMGSLQRTFNMRLSSA 229
Query: 305 RSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHNIIIDS 351
R + F L +RI K M + DK KL I + C LLHN + +S
Sbjct: 230 RVVVENVFGLLTTVFRIFKKPMEIKKDKAKL--ITMTCILLHNFLRNS 275
>gi|357168015|ref|XP_003581441.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 286
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 123/293 (41%), Gaps = 49/293 (16%)
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM 161
+VE+QVA+ L + + + + F TVS+ + + A+ E L+ +R
Sbjct: 26 TVEEQVAMFLTTVGHHKKNIDISFHFTRSGETVSRYFNKVLFAIGELGPEMLR----HRS 81
Query: 162 EEIKSK------FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
+I SK F+ F C GAID THI +PA + D + ++ + V VD
Sbjct: 82 LDIPSKIQGNRRFDPYFK--ECIGAIDGTHIPCNVPA-RIVDRFRGRKPFRTQNVLAAVD 138
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
++ F + GW G + S +L+ S + NG +R+ +Y + GY
Sbjct: 139 FDLLFTYVSAGWEGSTHDSTVLRHS--------LEHPNG-LRVPEG-----KYYLADAGY 184
Query: 276 PLLSWLITPYETN---------GLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSK 324
+ P+ P FN H + R+ +AF LK +++L+
Sbjct: 185 AARRGFLPPFHQTRYHLREWRGNYKPRTPNELFNLRHSSLRTTVERAFGTLKNWFKVLTT 244
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
+ P ++ +++ CC+LHN I+ +G PD + Y EQ C +
Sbjct: 245 RPYYPFPSQV-RVVIACCILHNWILLNGG---PDELV-------YSEQNCNDL 286
>gi|115767324|ref|XP_791763.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 250
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 89/189 (47%), Gaps = 15/189 (7%)
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHII 187
+ QS+VS+V AL ++++P + + ++ +F + GA+D TH+
Sbjct: 1 MSQSSVSRVVLDVSTALATLRPRYIQFPRTPQEVLRVQQRFFDYSQFPGVIGAVDCTHVP 60
Query: 188 MTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCE 247
+ P + + + +++N S+ VQ I D + R +++V WPGG + SR+ + S E
Sbjct: 61 IQNPGGEQAQRFINRKNTSSLNVQMICDCDGRILNVVARWPGGSHDSRIFRESAIKHQLE 120
Query: 248 AGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSL 307
A E + ++++G GY +++TP A +N+ H+ R +
Sbjct: 121 A-------------ERQDAKWLLGDSGYGCQPYVMTPLLQPANPAEQ-RYNTAHKRGRCI 166
Query: 308 AVKAFLQLK 316
+ F Q+K
Sbjct: 167 IERTFGQMK 175
>gi|390593960|gb|EIN03395.1| hypothetical protein PUNSTDRAFT_30660, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 290
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 107/241 (44%), Gaps = 34/241 (14%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIAL---RRL 114
D + F+ F +S FD I + D P+ N + V++Q+AIAL R
Sbjct: 71 DFPDIFRSFTGISPACFDNILCCIGTD-----PAFHNNSNNPQMPVDQQLAIALYRFRHF 125
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKH-HLKWPDSNRMEEIKSKFEES 171
+ S + V + GV TV VT R + A+ + +H + +PD+ E+ KS E
Sbjct: 126 GNAVSVLKVALWAGVSVGTVIGVTKRVMAAVCGDSGLRHGAIAFPDAEAKEKAKSWVET- 184
Query: 172 FGLLNCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
++C G +D T I + + W D+++NYSM +Q + +++ ID
Sbjct: 185 ---MSCSGWRDGWLMVDGTLIPLYERPAFFGNTWYDRKSNYSMNLQLVSTPDLQIIDYSV 241
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
G PG + S + +K E + LNG+ E++ G YPL +W PY
Sbjct: 242 GLPGSQHDSTAWAETRIYK--EHEKLLNGD-----------EWVWGDSAYPLETWCQAPY 288
Query: 286 E 286
+
Sbjct: 289 K 289
>gi|193592115|ref|XP_001950353.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 375
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 110/260 (42%), Gaps = 27/260 (10%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAF--GVGQSTVSQVTWRFIEALEERA-KHHLKWPD 157
+ VE QV A+ A G Q + G F + Q +VS+ + +R + + +P
Sbjct: 73 IPVEIQVLAAVYVYAKGSYQRATGDHFDLNISQPSVSRCLHSVTNVINDRLLRQWVTFPM 132
Query: 158 SNR-----MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
+N EE K+ + G + GAID T+I + P + + + N+S+ VQ
Sbjct: 133 TNADRNKAREEFKNAPQPFEGAI---GAIDCTYINILAPK-DHEEAYVNHHGNHSLNVQA 188
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
IV +++ ++I +PG N S + S + E + E +++G
Sbjct: 189 IVSPKLKILNINPRYPGARNDSYVWSTSPIRRAME----------YHYNQGERHTWLIGD 238
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK---VMWRP 329
GYPL WL+TP + H R++ + F K WR LS +M+ P
Sbjct: 239 SGYPLEPWLMTPLLHYRERTRQFKYTLKHCKARNVVERFFGVFKSVWRCLSYQRVLMYAP 298
Query: 330 DKRKLPSIILVCCLLHNIII 349
I+ C +LHN+ I
Sbjct: 299 VMAG--KIVNACAVLHNMRI 316
>gi|149537657|ref|XP_001519824.1| PREDICTED: putative nuclease HARBI1-like, partial [Ornithorhynchus
anatinus]
Length = 223
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 86/189 (45%), Gaps = 8/189 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+ S++ Y+ L+ L RP R +S E Q+ AL SG Q +G A
Sbjct: 39 YGFSREFIRYLVDLLGASLC-RPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDA 92
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
G+ Q+++S+ EAL ERA +++P D ++ +K F G+ G +D TH
Sbjct: 93 VGISQASMSRCVANVTEALVERASRFIRFPEDEMSVQGLKGDFYGLAGMPGVVGVLDCTH 152
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + P + + +++ +S+ + D + + T WPG + S +L+ S
Sbjct: 153 VTIKAPNAEDL-SYVNRKGLHSLNCLMVCDIRGTLLHVETHWPGSLRDSAVLQRSALAAQ 211
Query: 246 CEAGQRLNG 254
EAG +G
Sbjct: 212 LEAGMPKDG 220
>gi|77549235|gb|ABA92032.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 743
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 101/239 (42%), Gaps = 34/239 (14%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES-FGLLN-CCGAIDATHIIM 188
STVS R + + LK D N + + ++S FG GAID THI +
Sbjct: 470 NSTVSIYFRRVLSRMVMLGSKILKPIDLNFTDIPRRLLQDSRFGPFQFAVGAIDGTHIPV 529
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
T+ V ++ + ++ + + V IV + R I GWPG ++ +R+L
Sbjct: 530 TV-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVL----------- 577
Query: 249 GQRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ETNGLSASMP-- 295
N + EE + R+Y++ GYP + PY + G A P
Sbjct: 578 ----NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEG 633
Query: 296 ---TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H R++ + F +K W+IL + + P K +IIL LHN IDS
Sbjct: 634 REEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSNIILAAFCLHNFRIDS 692
>gi|57900671|gb|AAW57796.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 748
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 120/287 (41%), Gaps = 49/287 (17%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN- 159
+S + +A+ L ES F T+S+ + +L AK +++ D N
Sbjct: 436 MSTHEMLAMFLFTCGGNESNRRAQNRFKHSGETISRKFDEVLNSLMAMAKDYIRPKDPNF 495
Query: 160 -----RMEEIKSKFEESFGLLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGI 213
R+ + + F G C GA+D THI ++L P Q + + + V +
Sbjct: 496 RSVHKRIRDDRRAFPHFKG---CIGALDGTHIRVSLSPDEQVR--YIGKTGIPTQNVLAV 550
Query: 214 VDHEMRFIDIVTGWPGGMNVSRLLKFS-----GFFKLCEAGQRLNGNVRISSEEVELREY 268
D +MRF + TG PG M+ + +L + FF G+ Y
Sbjct: 551 CDFDMRFTYVATGQPGAMHDTSVLYNALRVDEKFFPHPPQGK-----------------Y 593
Query: 269 IVGGVGYPLLSWLITPYETN---------GLSASMPT--FNSLHEATRSLAVKAFLQLKG 317
V G+P + PY+ G+ + P FN +H + R++ ++F LK
Sbjct: 594 YVVDAGHPNRPGYLAPYKGERYHLPEWHRGMEPNTPMEKFNRVHSSIRNVIERSFGLLKM 653
Query: 318 GWRILSKVMWRP-DKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
W+IL K+ P K+K+ I++ +LHN I + G + PD A D
Sbjct: 654 KWQILYKMPCYPIYKQKM--IVVAAMVLHNYIREHGGE-DPDFARFD 697
>gi|38345857|emb|CAD41055.2| OSJNBa0084K11.21 [Oryza sativa Japonica Group]
gi|38345926|emb|CAE01919.2| OSJNBb0078D11.1 [Oryza sativa Japonica Group]
Length = 377
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 141/320 (44%), Gaps = 35/320 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ + +F C L R+ GL+ + + +E+QVA+ L + V
Sbjct: 50 LRLRRASFFRFCKLFRD-------RGLLE-DTIHMCIEEQVAMFLHTVGHNLRNRLVRTN 101
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
F + TVS+ + + A+ E ++ P + +I +C GAID THI
Sbjct: 102 FDRSRETVSRYFNKVLHAIGELRDELIRPPSLDTPTKIAGNPRWDPYFKDCIGAIDGTHI 161
Query: 187 IMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
++ V++S + ++++ + V VD ++RF ++ GW G + + +L+
Sbjct: 162 RASVRKNVESS--FRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDAVVLR------- 212
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----------TNGLSASMP 295
+A +R NG +R+ +Y + GY + P+ N +
Sbjct: 213 -DALERENG-LRVPQGNRLQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQNEKE 270
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHNIIIDSG-D 353
FN H + R +AF LK +++L + P + ++ I++ CC++HN +++ G D
Sbjct: 271 LFNLRHSSLRVTVERAFGSLKRRFKVLDDATPFFPFQTQV-DIVVACCIIHNWVVNDGID 329
Query: 354 QL--HPDVALSDHHDSGYGE 371
+L PD + D +S G+
Sbjct: 330 ELIAPPDWSSEDIDESLTGQ 349
>gi|297721819|ref|NP_001173273.1| Os03g0157450 [Oryza sativa Japonica Group]
gi|255674219|dbj|BAH92001.1| Os03g0157450 [Oryza sativa Japonica Group]
Length = 611
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 89/173 (51%), Gaps = 10/173 (5%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+S+ F +C+ +R GL+ ++ +SVE+QVA+ L+++ S VG +
Sbjct: 74 LRMSRAVFYKLCARLRN-------KGLL-VDTFHVSVEEQVAMFLKKVGQHHSVSCVGFS 125
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL-NCCGAIDATH 185
F TVS+ + A+ E A+ + +N +I SK + + +C GA+D TH
Sbjct: 126 FWRSGETVSRYFRIVLRAMCEIARELIYIRSTNTHSKITSKKNKFYPYFKDCIGALDGTH 185
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
I ++PA + D + ++ + V VD ++RFI I+ GW G + S +L+
Sbjct: 186 IRASVPAKKV-DRFRGRKPYPTQNVLAAVDFDLRFIYILAGWEGSAHDSLVLQ 237
>gi|449691647|ref|XP_004212750.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 182
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 81/172 (47%), Gaps = 17/172 (9%)
Query: 199 WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRI 258
+ D+ +N+S+ VQ + D++ +FID+V WPG + +R+L+ S K G
Sbjct: 9 YVDRSSNHSINVQAVCDNKGKFIDVVAKWPGSTHDARILRESKLGKKFIDGT-------- 60
Query: 259 SSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGG 318
+ ++G GYP WL+TPY N +AS +N R + + F + K
Sbjct: 61 ------FKGLLIGDSGYPCFRWLLTPY-LNPTTASQHRYNISLRKIRVIIEQVFGRGKRR 113
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVAL-SDHHDSGY 369
+L + R K +++ C +LHN+ I D + + +D +D+ Y
Sbjct: 114 LYLLHGEI-RMTPEKTCTLVAACAVLHNLAIQLDDGFMDENPIENDENDNEY 164
>gi|115474393|ref|NP_001060793.1| Os08g0106900 [Oryza sativa Japonica Group]
gi|113622762|dbj|BAF22707.1| Os08g0106900 [Oryza sativa Japonica Group]
Length = 629
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/301 (23%), Positives = 127/301 (42%), Gaps = 36/301 (11%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ K +F C L R+ GL+ + + VE+QVA+ L + V
Sbjct: 50 LRLRKDSFFRFCKLFRD-------RGLLE-DTIHMRVEEQVAMFLNTVGHNLRNRLVRTN 101
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
F TVS+ + + A+ E ++ P + +I +C GAID THI
Sbjct: 102 FDRSGETVSRYFNKVLHAIGELRDELIRPPSLDTPNKIAGNPRWDPYFKDCIGAIDGTHI 161
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ S + ++++ + V VD ++RF ++ GW G + + +L+
Sbjct: 162 RASVRKNMESS-FRGRKSHATQNVMAAVDFDLRFTYVLVGWEGTAHDAVVLR-------- 212
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----------TNGLSASMPT 296
+A +R NG +RI + Y + GY + P+ N +
Sbjct: 213 DALERENG-LRIPQGK-----YYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQNEKEL 266
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHNIIIDSG-DQ 354
FN H + R +AF LK ++IL + P ++ I+ CC++HN +I+ G D+
Sbjct: 267 FNLRHSSLRITVERAFGSLKRRFKILDDATPFFPFPTQV-DIVAACCIIHNWVINDGIDE 325
Query: 355 L 355
L
Sbjct: 326 L 326
>gi|432950744|ref|XP_004084590.1| PREDICTED: uncharacterized protein LOC101165370 [Oryzias latipes]
Length = 676
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 121/297 (40%), Gaps = 66/297 (22%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S ++ YI +L++ + + I+ GR L+ E+ + ALR A+G ++G A
Sbjct: 418 YRFSLQSITYIHNLIQPYITN------ISHRGRALTSEQILCAALRFFANGSFLYNIGDA 471
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++TV + + AL+ + + +P + IK +F ++ C DA HI
Sbjct: 472 EHISKATVCRAFRKMCLALKRFLRIFIVFPVHKPLRAIKEEFHR---IIIC----DAAHI 524
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
T+D+ WPG ++ SR+ + S
Sbjct: 525 --------TNDE--------------------------AKWPGSVHDSRIFRESALSNRL 550
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
E G+ ++G + G YP L+TPY T FN H TR+
Sbjct: 551 ECGE-IDG--------------FLQGDSYPCQHKLLTPY-TEPEQGPQQRFNLAHSRTRA 594
Query: 307 LAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
LK ++ L + P++ II+ C +LHNI I G+Q HP + + D
Sbjct: 595 RVEMTIGLLKARFQCLRHLKVTPER--ACDIIVACVVLHNIAIFRGEQ-HPALHIQD 648
>gi|449680967|ref|XP_002158505.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 285
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/264 (24%), Positives = 113/264 (42%), Gaps = 31/264 (11%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV ++ W+ ++ + RA + D
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCRIFKETCCAIWKVLKKVFLRAPN-----D 86
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 87 VKEWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGV 273
F + G G N + +L S F G+ N + + ++ +V
Sbjct: 146 YCFTMVDIGAYGRDNTA-ILNASTF------GRAFNKGYFNLPKFFEFDPKVSPVLVKDD 198
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
+ L WL+ Y L+ FN R +F L WRI P K K
Sbjct: 199 IFALKPWLMKSYPGKNLTVQQRVFNYHLSRARRTIENSFGILAARWRIYRS----PIKAK 254
Query: 334 ---LPSIILVCCLLHNIIIDSGDQ 354
+ II LHN I+ + ++
Sbjct: 255 PLIVEHIIKATVCLHNYILFAFNR 278
>gi|6979320|gb|AAF34413.1|AF172282_2 hypothetical protein [Oryza sativa]
Length = 809
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 101/238 (42%), Gaps = 34/238 (14%)
Query: 132 STVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES-FGLLN-CCGAIDATHIIMT 189
STVS R + + LK D N + + ++S FG GAID THI +T
Sbjct: 537 STVSIYFRRVLSRMVMLGSKILKPIDLNFTDIPRRLLQDSRFGPFQFAVGAIDGTHIPVT 596
Query: 190 LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ V ++ + ++ + + V IV + R I GWPG ++ +R+L
Sbjct: 597 V-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVL------------ 643
Query: 250 QRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ETNGLSASMP--- 295
N + EE + R+Y++ GYP + PY + G A P
Sbjct: 644 ---NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGR 700
Query: 296 --TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H R++ + F +K W+IL + + P K +IIL LHN IDS
Sbjct: 701 EEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSNIILAAFCLHNFRIDS 758
>gi|344280766|ref|XP_003412153.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Loxodonta africana]
Length = 305
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 79/175 (45%), Gaps = 8/175 (4%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+ L SRP R +S E Q+ AL SG Q +G A G+ Q+++S
Sbjct: 48 YLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMS 101
Query: 136 QVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ EAL ERA + +P D M+ +K F G+ G +D H+ + P +
Sbjct: 102 RCVANVTEALVERASQFIHFPADETSMQALKDGFYGLAGMPGVIGVVDCIHVAIKAPNAE 161
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ +++ +S+ + D + + T WPG + S +L+ S EAG
Sbjct: 162 DL-SYVNRKGLHSLNCLMVCDLRGALMTVETNWPGSLQDSTVLQQSSLCSQFEAG 215
>gi|301604055|ref|XP_002931683.1| PREDICTED: hypothetical protein LOC100493199 [Xenopus (Silurana)
tropicalis]
Length = 617
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 67/131 (51%), Gaps = 15/131 (11%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GAID TH+ +T P + + +++ ++S+ VQ + D +R I + +G+PG ++ +
Sbjct: 204 NCLGAIDCTHVPLTPPRAH-QERYLNRKRSHSINVQVVCDSHLRIISVRSGFPGSVHDAH 262
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L+ S ++ G+ G ++VG GY +L WL+TP + +
Sbjct: 263 ILRQSALYERFTQGEMPQG-------------WLVGDAGYGVLPWLMTPV-CFPRTPAQR 308
Query: 296 TFNSLHEATRS 306
+N H TR+
Sbjct: 309 RYNRAHRKTRN 319
>gi|353731061|ref|NP_001090585.2| uncharacterized protein LOC100036828 [Xenopus laevis]
Length = 393
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 124/290 (42%), Gaps = 17/290 (5%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ FFR+S +FD + ++++ L R + +++ +S E+++ + LR LA+G+S S
Sbjct: 62 FQRFFRMSISSFDELLTVLKPGLC-RAHTLMMDP----ISPEERLCLTLRFLATGQSFSS 116
Query: 123 VGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+ F +G++T+ ++ + E + + PD N I F ++ NC GA+
Sbjct: 117 LYFRFPIGRTTIGKIVRETCLLIWSELQRLVMPTPDENAWIHIAEDFYKTTNFPNCLGAM 176
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM-NVSRLLKFS 240
I + +P S+ + S+++ +VD F I G G + S +
Sbjct: 177 GGKRIQIKMPFKSGSEKYS------SVVLLAVVDANYCFSIIDVGAYGSTGDASAFWSSA 230
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYI-VGGVGYPLLSWLITPYETNGLSASMPTFNS 299
+L E L + Y+ VG + L ++ PY + + FN
Sbjct: 231 MGHQLSEGALHLPLPKPLPGTAAPSMPYVFVGDEAFGLTENIMRPYPGSQMDIQKRLFNY 290
Query: 300 LHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII 348
R + AF WRI + + PD + II CC+LHN +
Sbjct: 291 RLLRARRMVECAFGIFSNKWRIFHNAIQLEPDFVDI--IIKACCVLHNFV 338
>gi|302675983|ref|XP_003027675.1| hypothetical protein SCHCODRAFT_70743 [Schizophyllum commune H4-8]
gi|300101362|gb|EFI92772.1| hypothetical protein SCHCODRAFT_70743 [Schizophyllum commune H4-8]
Length = 495
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 104/240 (43%), Gaps = 39/240 (16%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---SG 117
E F+ + R+S FD + +++ +D P N + V Q+AIAL R +
Sbjct: 122 EIFRSYLRISPACFDALLAVIEDD-----PVFHNNSNNSQMPVAHQLAIALYRFGHHGNA 176
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
S + V + FGVG T++ VT R I+A+ E L+WP E+ K E +
Sbjct: 177 ASGLKVALLFGVGYGTINLVTSRVIKAVCGERFRTAALQWPGDAEKEKAKDWVEN----V 232
Query: 176 NCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+C +D T I + + + D+++NYS+ VQ + + ID G PG
Sbjct: 233 SCPAWRDGWLMVDGTLIPLFRRPGHFGNTFYDRKSNYSLNVQLVSTPDCMIIDYAIGLPG 292
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE-EVELR--EYIVGGVGYPLLSWLITPYE 286
+ + + RI+++ EV LR E++ YPL W +PY+
Sbjct: 293 SQHDATAWE----------------QTRIANQHEVLLREEEWVWADTAYPLRKWCQSPYK 336
>gi|449680023|ref|XP_004209474.1| PREDICTED: uncharacterized protein LOC101241378 [Hydra
magnipapillata]
Length = 200
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 88/193 (45%), Gaps = 8/193 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVA---FGVGQSTVSQVTWRFIEALEE-RAKHHLKWP 156
+SV K VA+AL LAS E V V FG+G+ST + + FI A+ + ++K+P
Sbjct: 5 ISVVKCVAVALHYLASCEEYRVVRVVSSLFGIGKSTANLIVHEFINAVNDILLPKYVKFP 64
Query: 157 DSNRMEEIKSK-FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
S S+ FE G C GA+D HI ++ P Q + + + YS+ + +VD
Sbjct: 65 LSVEYLNKHSRDFEAILGFPQCIGAVDGRHIPISAPKDQAISYY-NYKGWYSIALFAVVD 123
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
FI G G N S +L+ S + E+ + V L ++G +
Sbjct: 124 CRYCFIYTSVGSSGRNNDSYILQNSSLKAILESNLFDKCCKELGDSLVPLC--LIGDSAF 181
Query: 276 PLLSWLITPYETN 288
PL L+ PY N
Sbjct: 182 PLTRHLLKPYLEN 194
>gi|291230838|ref|XP_002735372.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 426
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 105/252 (41%), Gaps = 20/252 (7%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVS----QVTWRFIEALEERAKHHLKWPDSNRM 161
++AI LR LA+G S ++ AF V +T+S +V I ++ PD R
Sbjct: 117 KLAITLRYLATGNSYKTLQYAFRVAHNTISLFIPEVCQTIISEYQDEVFSCPITPDEWR- 175
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E+ + + + + CGA+D H+ + P + + + + YS+++ +VD +F+
Sbjct: 176 -EVARTYADRWNFHHVCGALDGKHVAIRNPP-GSGTIYYNYKGFYSLILLALVDGNYKFL 233
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE---YIVGGVGYPLL 278
G PG ++ S + E G G + R+ +++G +P
Sbjct: 234 WADVGNPGSSLDVQVFNHSPLRRGLENGTL--GLPDPEPLPDDDRDIPYFLIGDDAFPFR 291
Query: 279 SWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
+W+ PY + FN + +F L WR L K K ++I
Sbjct: 292 TWMQKPYSNREQTDEERIFNYRLSRACRVVENSFGILAHRWRCLLK--------KARTVI 343
Query: 339 LVCCLLHNIIID 350
+ C LHN++ D
Sbjct: 344 MACMCLHNLMRD 355
>gi|449674891|ref|XP_004208281.1| PREDICTED: uncharacterized protein LOC101238149, partial [Hydra
magnipapillata]
Length = 354
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 110/274 (40%), Gaps = 28/274 (10%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS- 158
+S K+V +AL LAS E V FG+G++T + + FI A+ ++K+P S
Sbjct: 84 ISEVKRVTVALHYLASWEEYRVVSSLFGIGKTTANIIVHEFINAVNNILLPKYVKFPLSV 143
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ + FE G C A+D HI ++ P Q + NY +G+
Sbjct: 144 ENLNKHSRGFEAILGFPQCVEAVDGCHIPISAPKDQAISYY-----NY----KGV----- 189
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLL 278
G PG N S +L+ S + E + V L ++G +PL
Sbjct: 190 -------GSPGRNNDSYILQNSSLKAILELNLFDKCCKELGDSLVPL--CLIGDSAFPLT 240
Query: 279 SWLITPYETN-GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
L+ PY N LS FN + R + AF ++K + + K M D I
Sbjct: 241 HHLLKPYPENLELSEIQKNFNKILCGARRVVENAFGRVKARFCVTCKRM-ECDINFATRI 299
Query: 338 ILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGE 371
+ C LHN I + D + L HHD +
Sbjct: 300 VNACVTLHN-ICEYYDDIIIIERLMHHHDDNLAQ 332
>gi|301627131|ref|XP_002942732.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 352
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 90/202 (44%), Gaps = 22/202 (10%)
Query: 168 FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
F S C GAID TH+ +T P + + +++ ++S+ VQ + D +R + +
Sbjct: 102 FLGSGSFQQCLGAIDCTHVPLTPPRAH-QERYLNRKRSHSISVQVVCDSHLRIMSVXXXX 160
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET 287
+ C + LNG S +++G GY +L WL+TP
Sbjct: 161 XXXXXXXXXXXXXYW---CNRWELLNG-----SASTHNFSFLLGDAGYGVLPWLMTPVRF 212
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCL 343
A +N H TR++ + F LK +R LS +++ P K + II+VC +
Sbjct: 213 PRTPAQR-RYNRAHRKTRNVIERLFGVLKLRFRCLSVTGGALLYSPIK--VSEIIVVCAM 269
Query: 344 LHNIIIDSG------DQLHPDV 359
LHN+ +D G D L P++
Sbjct: 270 LHNVAMDRGLGADINDALEPEI 291
>gi|322795514|gb|EFZ18220.1| hypothetical protein SINV_08457 [Solenopsis invicta]
Length = 97
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 1/86 (1%)
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
+KWP + + KF G+ N GAID+THI + P + D+C+++ +S+ +Q
Sbjct: 5 IKWPQEHEVIITCEKFSLKRGIQNVLGAIDSTHIQIIKPT-SNAQDYCNRKKFFSINLQA 63
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLK 238
+VD +MRFI+I G PG ++ +R+ +
Sbjct: 64 VVDSDMRFINIYCGEPGSLHDARVFR 89
>gi|62734261|gb|AAX96370.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77549127|gb|ABA91924.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 676
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 101/237 (42%), Gaps = 55/237 (23%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 447 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHKEKSQN----VMCVCDFDMRFTFVLAGWPGS 502
Query: 231 MNVSRLL-----KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
++ R+ +FS F G + Y+V GYP + PY
Sbjct: 503 VHDMRVFNDAQTRFSAKFPKPPPG----------------KFYLVDS-GYPNRPGYLAPY 545
Query: 286 ETNGLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ G++ +++P FN H + R++ ++F LK WRIL + +
Sbjct: 546 K--GITYHFQEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRILFSLPSY-SQE 602
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
K II C LH I DS Q+ + HD Y DP+G T+ + E
Sbjct: 603 KQSRIIHACIALHYFIRDS--QMADTEFDNCDHDENY--------DPLGGTSSPSSE 649
>gi|125534959|gb|EAY81507.1| hypothetical protein OsI_36676 [Oryza sativa Indica Group]
Length = 370
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 84/195 (43%), Gaps = 41/195 (21%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+ GAID +HI + + +T C + S V I D +MRFI V GWPG + SR
Sbjct: 143 DAIGAIDGSHISVVVLLDETISHTC-RHGYTSQNVLAICDFDMRFIFAVAGWPGSAHDSR 201
Query: 236 LLK--FSGF--FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
+L + F F + G+ Y + GYP + P++ G +
Sbjct: 202 ILSHALANFPSFPMPPTGK-----------------YYLVDSGYPNRIGYLAPFK--GTT 242
Query: 292 ASMP--------------TFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPS 336
+P FN LH + R++ ++F LK WRIL + + P +K
Sbjct: 243 YHIPEFRHRSGPPQGKYEVFNFLHSSLRNVIERSFGVLKQKWRILKGIPSFSPATQK--H 300
Query: 337 IILVCCLLHNIIIDS 351
II+ C LHN + DS
Sbjct: 301 IIMACLALHNFVRDS 315
>gi|307195136|gb|EFN77141.1| hypothetical protein EAI_00052 [Harpegnathos saltator]
Length = 113
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 64/117 (54%), Gaps = 8/117 (6%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRM 161
VEKQVA+AL ++AS VG FG+ +STV + +R + A+ + +++L PD
Sbjct: 1 VEKQVAVALYKMASCAEYRVVGNVFGIHKSTVKKCLFRVVNAINDLMMRNYLHMPDLYEA 60
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY---SMLVQGIVD 215
EI FE+ +L G ID+THI P + S+ + D N+ S ++Q +VD
Sbjct: 61 SEIAINFEKISCILQIIGCIDSTHI----PTLAPSEGYRDFVNSKGWPSYILQAVVD 113
>gi|357620279|gb|EHJ72527.1| hypothetical protein KGM_19675 [Danaus plexippus]
Length = 272
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 90/210 (42%), Gaps = 15/210 (7%)
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
+IK++F+ + C GAID HI + PA S+ + + + S+++ VD + F
Sbjct: 25 DIKTRFQLKWNFPGCIGAIDGKHINLRAPACSGSE-FYNNKKTISIVLLACVDDDYSFTY 83
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAGQRLN-GNVRISSEEVELREYIVGGVGYPLLSWL 281
I G G + G F C + ++ G++ I +E V IV +PL S +
Sbjct: 84 IDIGAKGRHSDV------GVFSNCSLKRAIDDGSLNIPAESV-----IVADAAFPLQSNI 132
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
+ PY N L+ FN R + AF L +RI K + K + I
Sbjct: 133 MKPYPGNNLTTRQKIFNYRVSRARRIVENAFGILASRFRIFEKPIANNVKDTI-KITRTS 191
Query: 342 CLLHNIIIDSGDQLHPDVALSDHHDSGYGE 371
C LHN + D + + L+D+ D G
Sbjct: 192 CALHN-WLRKNDPNYLEPGLADYEDISTGR 220
>gi|449692247|ref|XP_004212957.1| PREDICTED: uncharacterized protein LOC101240156, partial [Hydra
magnipapillata]
Length = 230
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 82/164 (50%), Gaps = 18/164 (10%)
Query: 59 EEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
EE FK FRV++ TF+++ + L + R P +SV K+VA+AL
Sbjct: 76 EECQFKEHFRVNRNTFNFLVNELYPHLGKTTTTMREP----------ISVVKRVAVALHY 125
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSK-FEES 171
LAS E V FG+G+ST + + FI A+ + ++K+P S +SK FE
Sbjct: 126 LASREEYRVVSSLFGIGKSTANLIVHEFINAVYDILLPKYVKFPLSVENLNKRSKDFEAI 185
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
G C GA+D I ++ P QT + + + YS+++ +VD
Sbjct: 186 LGFPQCIGAVDGRLIPISAPRDQTI-SYYNYKGWYSIVLFAVVD 228
>gi|406697727|gb|EKD00981.1| hypothetical protein A1Q2_04722 [Trichosporon asahii var. asahii
CBS 8904]
Length = 441
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 64/268 (23%), Positives = 116/268 (43%), Gaps = 28/268 (10%)
Query: 106 QVAIALRRLASGESQVSVGVAFG-VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI 164
Q+A+ L R+ SG + V G + + T+ + R + A+ + +KWPD R EI
Sbjct: 118 QLAVFLYRMGSGNTLADVQRTLGRLPRGTIVKYCMRSVVAISSKLAEVVKWPDPQRRREI 177
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
++ + + + C G ID +HI++ +V+ + + ++ Y +L+ + D + RF
Sbjct: 178 ETFLHDKYMIPGCVGFIDGSHILLHKSPSFSVEKNATFWSRKKRYGLLILAVCDEKKRFT 237
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ TG N R Q+ R ++ +YI+G G+ S +
Sbjct: 238 YLQTGHFASANDFR-------------AQQSTPMARDPAQLFSDGQYILGDSGFYCDSNV 284
Query: 282 ITPYETNGLSASMP----TFNSLHEATRSLAVK-AFLQLKGGW-RILSKVMWRPDKRKLP 335
+ Y + +P FN H AT + ++ AF LK W + + D+ L
Sbjct: 285 VPMYRRSSGQVDLPEERLEFNE-HVATARVKIEHAFGILKQRWLILNDLHLLLKDQDNLT 343
Query: 336 ---SIILVCCLLHNIIIDSGDQLHPDVA 360
+ I +LHN+ ID+ H D+A
Sbjct: 344 FAYAAIQAVVVLHNLYIDTSS-YHWDLA 370
>gi|301607557|ref|XP_002933374.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 346
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 120/274 (43%), Gaps = 12/274 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD-S 158
+S E+++ LR LA+G S + + G+ + + + E K +LK+P+ S
Sbjct: 28 ISAEQRLIATLRFLATGRSFEDLKFSTGISAQALGHIIPETCNTIVETLKGEYLKFPETS 87
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ I +F + + NC G ID HI P + + + + +S+++ IV+
Sbjct: 88 EEWQVIAQQFNDYWNFPNCGGVIDGKHIRRN-PPPNSGSYFFNYKGFFSIVLLAIVNANY 146
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE-LREYIVGGVGYPL 277
FI + G ++ +++ + F + ++ Q N+ + E E L V + L
Sbjct: 147 EFIMVDKN--GRLSDGGVIEQTHFNQKLKSKQL---NLPTNEETKEGLNFVFVADEAFGL 201
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPS 336
L+ P+ L+ FN R + AF L +RI + + P+K +
Sbjct: 202 HENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGILANRFRIFHTSINLCPEK--IDM 259
Query: 337 IILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG 370
++L CC+LHN + + + V++ DH D G
Sbjct: 260 VVLSCCVLHNFLRKNNHVSYAPVSMVDHEDIQAG 293
>gi|449691603|ref|XP_004212734.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 170
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 74/144 (51%), Gaps = 16/144 (11%)
Query: 143 EALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCD 201
+ L +R ++K P D + + E + F G ID TH+ + P + + +
Sbjct: 7 DCLAKRVNEYIKHPTDPHVLNESRVNFYNLAESPRITGLIDGTHVCIQNPR-EHEYVYVN 65
Query: 202 QENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE 261
+ +N+S+ VQ D+ +FIDIV+ WPG + +R+L+ S KLC+ + ++G
Sbjct: 66 RSSNHSINVQAGCDYNGKFIDIVSKWPGSTHKARILRES---KLCK--KMIHGT------ 114
Query: 262 EVELREYIVGGVGYPLLSWLITPY 285
L+ ++G GYP WL+TPY
Sbjct: 115 ---LKGILLGDSGYPCFCWLLTPY 135
>gi|307205232|gb|EFN83615.1| Putative nuclease HARBI1 [Harpegnathos saltator]
Length = 181
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 84/185 (45%), Gaps = 12/185 (6%)
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
S+ FG+G++T + R I +L + ++WP +I F + GL GAI
Sbjct: 4 SISDRFGIGKATCFRALRRVIFSLVQIVHQFIQWPTGRTAVKIMKDFYKISGLRGVIGAI 63
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D I + P + SDD+ ++ ++ +Q ++ F ++ G+P ++ +R+L+ S
Sbjct: 64 DGILIKINKPT-ENSDDYICRKRYPAIHLQVTCTQQLLFTSVLVGYPRSIHDARVLRNS- 121
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSL 300
R++ + YIVG YP+ S L+ P++ NG L+ FN
Sbjct: 122 ---------RIHEYLAHPDIYFSDNSYIVGDAAYPIHSNLMVPFKNNGYLTRRELNFNCC 172
Query: 301 HEATR 305
+ R
Sbjct: 173 LSSAR 177
>gi|328714341|ref|XP_001946988.2| PREDICTED: hypothetical protein LOC100165998 [Acyrthosiphon pisum]
Length = 407
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 127/281 (45%), Gaps = 26/281 (9%)
Query: 137 VTWRFIE----ALEERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
VT R I+ AL + K+ + P++ + I + F + + NC GAID HI++ P
Sbjct: 114 VTLRCIQDVCKALNQELKNEIMLPNNESGWKNIAADFYDKWNFPNCLGAIDGKHIVIQCP 173
Query: 192 AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQR 251
A + ++ + + +S+++ +V+ + F + G G ++ +L+ + FFK E +
Sbjct: 174 A-NSGSEYYNYKGTFSIVLMAVVNSDYSFTYVDIGCQGRISDGGVLRNTSFFKKLETDEL 232
Query: 252 LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE---TNGLSASMPTFNSLHEATRSLA 308
+ + + V L V + L ++ PY NG + FN R +
Sbjct: 233 IPQEGILPNTNVSLPYVFVTDEAFALSKNIMKPYSGVYENGNCKRI--FNYRLSRARRVV 290
Query: 309 VKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIIID--SGDQLH--PDVALSD 363
F + +R+ ++ P+K + I + C LLHN + S + ++ P + ++
Sbjct: 291 ENVFGIMASVFRVFRGPILLEPEKVR--DITMTCVLLHNFLRKSRSSNSIYTPPGIFDTE 348
Query: 364 HHD------SGYGEQ-CCKQVDPMGRTTREN-LEKHLQHNQ 396
++D S EQ C K + P+ R R++ LE + +Q
Sbjct: 349 NNDGIVIPGSWRSEQNCTKSMLPLQRVPRKSGLEAKMVRDQ 389
>gi|449691762|ref|XP_004212788.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 229
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 88/180 (48%), Gaps = 9/180 (5%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG-V 125
R+ + +F Y+ +L++ DL +P S + + QV ALR A+G Q+ +G
Sbjct: 50 LRLLRASFFYLENLLKTDL--QPESN----QSFCVPPIYQVLCALRIYATGSFQIVIGDS 103
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDAT 184
+ Q T+S++ R +L +R ++K+P D + + E + F G I+ T
Sbjct: 104 TAALSQPTISRIIRRVSLSLAKRVNEYIKYPTDLHVLNEKRVNFYNVAEFPRITGLIEGT 163
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H+ + P ++ +N+S+ VQ + D+ +FIDI W G + +R+L+ S K
Sbjct: 164 HVCIQKPRKHKYVS-VNRLSNHSINVQAVSDYNGKFIDIFANWAGSTHNARILRESKLSK 222
>gi|328704981|ref|XP_003242661.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 242
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 74/171 (43%), Gaps = 13/171 (7%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID THI + P V + + N+S+ VQ IVD +++ ++I +PG N S +
Sbjct: 25 GAIDCTHINILAPVVHEGA-YVNHHGNHSLNVQAIVDSDLKILNINARYPGARNDSYIWS 83
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S + E NG E R +++G GYPL WL+TP +
Sbjct: 84 TSAVRRGMEFHYN-NG---------ERRTWLIGDAGYPLEPWLMTPLPHYQEGTREFEYT 133
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
R++ + F K +M+ PDK I+ C LHN+ I
Sbjct: 134 KNLCKARNVVERFFGVFKRCLSYQRVLMYAPDKAG--RIVNACATLHNMRI 182
>gi|194765531|ref|XP_001964880.1| GF22751 [Drosophila ananassae]
gi|190617490|gb|EDV33014.1| GF22751 [Drosophila ananassae]
Length = 412
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 67/303 (22%), Positives = 133/303 (43%), Gaps = 11/303 (3%)
Query: 50 TPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAI 109
+P I + EE F+ F +S ++ + ++R L S + N++ +S E ++ +
Sbjct: 42 SPQKEIKFECEETFRVFHSMSSLSYRKLLHVLRRRL-ELQDSRMGNVQK--VSPEVRLQM 98
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRM-EEIKSK 167
LR LA+GES ++ F + + V + + ++ +R K L+ P S +I S
Sbjct: 99 TLRFLATGESFGTLTGVFHIPTADVRNIVAETLASIVKRLKRTFLQIPRSEEAWLQIASD 158
Query: 168 FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
F++ + +C GA+D H+ + + + + N S+++ +VD + RF+ I
Sbjct: 159 FQDLWNFPHCLGAVDGHHLAFR-GGTEADESYSNYRNFNSIILLALVDAQHRFLHIDATS 217
Query: 228 PGGMNVSRLLKFSGFFKLCEAG-QRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
GG + S F E+ + + + EL I+ G+ L WL+ PYE
Sbjct: 218 KGG--ATDAFNQSSLFDAMESNWLNIPPECCLLGLDEELPHVILADQGFTLQPWLMKPYE 275
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
+ + FN +A+ A + +R L + + ++ ++ +LHN
Sbjct: 276 APA-NLTRKMFNYRLNRAHRVAINALGIMNSKFRALQTEI-NLEVPQMEKLVTATGILHN 333
Query: 347 III 349
+I
Sbjct: 334 FLI 336
>gi|357153511|ref|XP_003576474.1| PREDICTED: uncharacterized protein LOC100832068 [Brachypodium
distachyon]
Length = 528
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 108/255 (42%), Gaps = 41/255 (16%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD--- 157
+S ++ +A+ L L + +S + V F ST+S+ + L+ A L D
Sbjct: 263 MSSKEALALFLWTLGAPQSNIQVANRFEHNPSTISRKFEEVLNCLDRMAGDQLAPIDPTF 322
Query: 158 SNRMEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
++ E++ K +F F + GAID THI + +PA + +++ S V + D
Sbjct: 323 THVHEKLRKPRFWPHFK--DAIGAIDGTHIPVIVPA-ELKVIHTNRKGYTSQNVMAMCDF 379
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+MRFI ++ GWPG ++ +R S QR
Sbjct: 380 DMRFIFVILGWPGSVHDTRSWVLSSI-------QR------------------------- 407
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
++ + +E TFN H + R++ + F LK WRIL ++ P K
Sbjct: 408 -ATYHVPEFEDAPPVGIQETFNHCHSSLRNVIERTFGVLKMKWRILLGILAYP-PLKQKK 465
Query: 337 IILVCCLLHNIIIDS 351
II+ C LH+ I DS
Sbjct: 466 IIIACMCLHDYIRDS 480
>gi|115683787|ref|XP_001179790.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 219
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 86/208 (41%), Gaps = 20/208 (9%)
Query: 161 MEEIKSKFEESF---GLLNCCGAIDATHIIMT-LPAVQTSDDWCDQENNYSMLVQGIVDH 216
MEE+ + F G G +D THI + P + +++ YS+ Q I D
Sbjct: 1 MEEVNRAQVDFFNISGFPQVIGVVDGTHIRLNGAPLGPGEHVYMNRKGYYSINTQIICDT 60
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ I+I+ WPG + SR+ + N V + E+++ ++G GY
Sbjct: 61 NYKIINILARWPGSTHDSRIFQ--------------NSRVGQTFEDLQQHGLLLGDSGYA 106
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L +L+TP N + + +N H TR + QLK +R L M +R
Sbjct: 107 LRPYLMTPV-LNPRTPAEQAYNRAHATTRVRIEQVNGQLKHKFRCLLNGMQMAPRRAC-K 164
Query: 337 IILVCCLLHNIIIDSGDQLHPDVALSDH 364
II C +LHN+ D D + H
Sbjct: 165 IITACAVLHNVAKDLKQPDEVDFGIEQH 192
>gi|195625940|gb|ACG34800.1| transposon protein Pong sub-class [Zea mays]
Length = 420
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/264 (21%), Positives = 104/264 (39%), Gaps = 12/264 (4%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEE 163
++V ALR LA G +V +G+ST + F + + +L+ P+ +
Sbjct: 115 QKVCAALRVLAYGLPSDAVDEYIQIGESTARECLHHFCRGIIAYFSGWYLRTPNEADITR 174
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
I ES G G+ID H +C + SM+++ + +++
Sbjct: 175 IMHH-SESRGFPGMLGSIDCMHWEWRNCPTAWRGHFCGRNGRASMILEAVATYDLWIWHA 233
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP----LLS 279
G PG N +L S F +G+ + ++ Y+ G+ YP +
Sbjct: 234 FFGMPGTNNDVNVLHRSPVFDPMTSGRMPPVHYTVNGNAYNFGYYLADGI-YPNWPTFVK 292
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ PYE + F + E+ R +AF L+ W +L + D+ +L II
Sbjct: 293 AIRHPYEQKKV-----YFTQMQESCRKDIERAFGVLQARWAVLRGPAYGWDRNRLTEIIT 347
Query: 340 VCCLLHNIIIDSGDQLHPDVALSD 363
C ++HN+I++ + D
Sbjct: 348 ACIIMHNMIVEDEGAFAANTDFGD 371
>gi|340383345|ref|XP_003390178.1| PREDICTED: hypothetical protein LOC100633099 [Amphimedon
queenslandica]
Length = 332
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 43/292 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP--DS 158
+S E+++++ LR L +G+S ++ ++ +G STVS + +AL P S
Sbjct: 28 VSAEERLSVTLRYLVTGDSMQTISFSYRLGHSTVSYIIEETCQALWRALSVEFLQPPKSS 87
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ ++I F + + +C GA+D HI+M P + + + + +S+++ + +
Sbjct: 88 DEWKKISEGFADIWNFPHCIGAMDGKHILMQAPP-NVASQYFNYKGTHSIVLMAVCYYNY 146
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR--ISSEEVELREYIVGGVGYP 276
F+ + G + G++++G + IS + +L + +G +P
Sbjct: 147 CFLLLDIG--------------------DYGKKIDGGIFNVISGQSAKLPYFFIGDSAFP 186
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLP 335
L++ ++ PY L + N R + AF L + I + V+ + K L
Sbjct: 187 LMNSMLKPYPGTYLPENKHISNYRLLRARRVIENAFGILASKFCIFRRPVVAKAHKVTL- 245
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTREN 387
I C L N + +S+ H G C P+G RE+
Sbjct: 246 -ITQAACALQNFL-----------TISEMHCPTSGRFYC----PVGYIDRED 281
>gi|328701170|ref|XP_003241510.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 175
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 17/184 (9%)
Query: 142 IEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCD 201
I ALE A + WP + EIK+ F + N GAID THI + P + + + +
Sbjct: 2 IVALEILAPAFITWPSEEKAGEIKNGFFSTCTFPNVLGAIDGTHINIHAP-IDHQEAYVN 60
Query: 202 QENNYSM-----LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV 256
++ ++S+ Q + D++ RFI G G ++ ++ F+L E L+
Sbjct: 61 RKGHHSIQLFTDFSQAVCDNKCRFIHCYAGNVGSVHDQQV------FRLSELKNYLDD-- 112
Query: 257 RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
++ + +++G Y L L+ PY NG + FN H + ++F LK
Sbjct: 113 --ATIYFPINTHLIGDAAYTLHEHLLVPYHDNG-HLTQKKFNFYHSSASMAVERSFGFLK 169
Query: 317 GGWR 320
G +R
Sbjct: 170 GQFR 173
>gi|242085952|ref|XP_002443401.1| hypothetical protein SORBIDRAFT_08g018920 [Sorghum bicolor]
gi|241944094|gb|EES17239.1| hypothetical protein SORBIDRAFT_08g018920 [Sorghum bicolor]
Length = 329
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/293 (22%), Positives = 120/293 (40%), Gaps = 14/293 (4%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRM 161
+EK A+R L G + +G+ST + RF+ A+ E + +L++P+
Sbjct: 19 IEKPATRAIRMLTYGVPADATDEYVRIGESTALESLRRFVTAINEIFGEEYLRYPNEADT 78
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
+ + E+ G G+ID H + +E +++++ + +++
Sbjct: 79 ARLLAMGEQQ-GFPGMLGSIDCMHWAWKNCPYDKQGQYKGKEEKPTIVLEAVASNDLWIW 137
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
G PG N +L S F G+ ++ + + Y+ G+ YP + L
Sbjct: 138 HAFFGMPGSHNDINVLHRSPLFDNLAEGKAPEVKFSVNGHDYTMGYYLADGI-YPTWATL 196
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL--SKVMWRPDKRKLPSIIL 339
+ T + F EA R + +AF L+ + I+ S W D L I+
Sbjct: 197 VKSI-TKPMGNKRQYFAKAQEAARKMVERAFGVLQSRFAIVRGSARFW--DIETLTKIMR 253
Query: 340 VCCLLHNIII-DSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLEKH 391
C ++HN+I+ D G + P+ + D G G+ + D RT E +E H
Sbjct: 254 ACVIMHNMIVEDEGFIVDPN----ERFDYG-GDNVEAEHDKPTRTLEEYIEAH 301
>gi|328699615|ref|XP_003240987.1| PREDICTED: hypothetical protein LOC100571096 [Acyrthosiphon pisum]
Length = 375
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/242 (21%), Positives = 109/242 (45%), Gaps = 8/242 (3%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESF 172
LA+GES S+ F + S +S + + A+ R K+ + P + ++++ + F E +
Sbjct: 100 LATGESYRSLSFQFRISHSWISTIIKELLVAICNRLKNITMPEPTEHSLKKVSNDFYEMW 159
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
NCCGAID HI + P + + + ++ +S+++ + D +F+ + G G
Sbjct: 160 NFPNCCGAIDGKHIRIVCPD-SSGSLYFNFKSFFSVVLLALCDANYKFLVVDIGSYGKEG 218
Query: 233 VSRLLKFSGFFKLCEAGQ-RLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN--G 289
+ + S KL G+ + + + ++ + VG G+ L ++ PY N
Sbjct: 219 DAGIFPKSNLGKLISTGKFKFPDPECLPNTDIVVPHVFVGDEGFKLTETMMRPYPRNQSK 278
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVCCLLHNII 348
+ FN R F + +R+ + + P+ + ++I+ C++HN++
Sbjct: 279 TDQTKAIFNYRLSRARRTTENTFGIMCQNFRVFFTPINILPE--TVDNLIMASCIIHNLL 336
Query: 349 ID 350
D
Sbjct: 337 RD 338
>gi|393235223|gb|EJD42779.1| hypothetical protein AURDEDRAFT_67179 [Auricularia delicata
TFB-10046 SS5]
Length = 467
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/310 (24%), Positives = 130/310 (41%), Gaps = 45/310 (14%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---SGES 119
F+ +FRV +TFD + + + ++ + + + I ++KQ+A+ L R+ +G S
Sbjct: 106 FRSYFRVDPETFDDLLTSIEDNNAFQNDTERVQI-----PLDKQLAVTLYRMGHYGNGAS 160
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+ + + G+G T+ VT R AL +E K + PD EE K++ E L+C
Sbjct: 161 LLKLMILSGLGIGTLQLVTRRVQYALSTDEFRKACVHLPDDEEKEEAKAEIER----LSC 216
Query: 178 CG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
+D T + W D+++NYS VQ I + + D G PG +
Sbjct: 217 AEWRNGHLLVDGTLAVFYSKPGHFGVSWYDRKSNYSTNVQIISTPDCKIRDFGVGLPGSV 276
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
+ + K + ++ S+ + E+I YPL SWL PY+ N
Sbjct: 277 HDATAWKATWTYE-------------NSASIFQPGEFIWADSAYPLQSWLCAPYKKNSSD 323
Query: 292 ASMPT--------FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR----KLPSIIL 339
P FN R + LKG ++ L + R DK+ +L S I+
Sbjct: 324 TPRPERDEPENARFNYHVSRVRVRSEHCIGYLKGRFQSLRGLRLRVDKQADVLRLDSWII 383
Query: 340 VCCLLHNIII 349
C +HN +
Sbjct: 384 ACICVHNFAL 393
>gi|195032221|ref|XP_001988458.1| GH11177 [Drosophila grimshawi]
gi|193904458|gb|EDW03325.1| GH11177 [Drosophila grimshawi]
Length = 383
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/311 (24%), Positives = 136/311 (43%), Gaps = 43/311 (13%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVS---RPPSGLINIEGRLLSVEKQVAI 109
A+I +++ FK FR+ + +F+ + ++ + P G +++ +LL
Sbjct: 57 ASISRYQDDQFKEQFRMQRSSFETLLQVIGNAIAGGEQHQPIGRVSLPEKLL-------Y 109
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE 169
L L+ +S VG+ F + +S+ ++ A H++KWP N S
Sbjct: 110 TLTLLSGEKSFREVGINFAISKSSGHEIFRWVTSAFATLLPHYVKWPADNVCS--GSGIS 167
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSML-VQGIVDHEMRFIDIVTGWP 228
+ G++ G ID I + LP + + +YS L +Q + D RF+D+ P
Sbjct: 168 KLPGVV---GVIDECRIQLKLPVREENGHL-----HYSWLALQAVCDERSRFLDVHIDVP 219
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVEL---REYIVGGVGYPLLSWLITPY 285
G N +L S F+ R+ EE L +++VG + YPLL L+TP
Sbjct: 220 G--NQQCVLLKSDLFE------------RLIDEESPLMPPHKHLVGEMMYPLLLNLMTPC 265
Query: 286 ETNG--LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL-VCC 342
N L+ +N + A +AF L +R L+ + ++ S+++ C
Sbjct: 266 ADNNGELTPCHIRYNQAVHLWNAPAERAFAALLSRFRRLASL--DVGTMEVGSMVISATC 323
Query: 343 LLHNIIIDSGD 353
+LHN I+D G+
Sbjct: 324 MLHNFILDCGE 334
>gi|301631363|ref|XP_002944767.1| PREDICTED: putative nuclease HARBI1-like, partial [Xenopus
(Silurana) tropicalis]
Length = 274
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 102/224 (45%), Gaps = 9/224 (4%)
Query: 150 KHHLKWPD-SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSM 208
K L++P+ S + I +F + + NC GAID HI + P + + + + +S+
Sbjct: 4 KFFLQFPETSEEWQVIAQQFNDYWNFPNCGGAIDGKHIRIN-PPPNSGSYFFNYKGFFSI 62
Query: 209 LVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE-LRE 267
++ IV+ FI + G G ++ +++ + F + ++ Q N+ ++E E L
Sbjct: 63 VLLAIVNANYEFIMVDIGKNGRLSDGGVIEQTHFNQKLKSKQL---NLPTNAETKEGLNF 119
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVM 326
VG + L L+ P+ L+ FN R + AF L +RI + +
Sbjct: 120 VFVGDEAFGLHENLLKPFPQKVLTPERKIFNYRLSRARRVVENAFGILANRFRIFHTSIN 179
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG 370
P+K + ++L CC+LHN + + V++ DH D G
Sbjct: 180 LCPEK--IDMVVLSCCVLHNFLRKDNHASYAPVSMVDHEDIQAG 221
>gi|397614781|gb|EJK63010.1| hypothetical protein THAOC_16359 [Thalassiosira oceanica]
Length = 492
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 128/310 (41%), Gaps = 39/310 (12%)
Query: 63 FKYFFRVSKKTFDYICSLVRE---DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
F+ +R+SK F +C +RE D RP N L E ++AI LR LA G
Sbjct: 95 FRRKYRMSKGQFQLLCQRIREKVGDEDFRPGKNQSNC----LCGEIRLAIGLRLLAGGSY 150
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP--------------DSNRMEEIK 165
+G A+GV Q + + + E + +P R++EI
Sbjct: 151 LDLIGRAYGVESP---QSIYNYFHKVIEWINNTFDFPLVGLLQGLRRKDRRAIARLKEIS 207
Query: 166 SKF--EESFGLLNCCGAIDATHIIMTLPA-VQTSDDWCD--QENNYSMLVQGIVDHEMRF 220
S+F + F C GA+D + + P+ + +D C ++N +++ VQ I D + R
Sbjct: 208 SEFASDSDFCFTGCIGAMDGLAVRIGCPSQAEIADPTCFFCRKNFFALNVQAICDRKKRI 267
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+ I G G + S + + F + + +E + ++VG YPL ++
Sbjct: 268 LWISPGHAGSTHDSTAWQDTKLF---------DLLEEMEAELEKAGFFLVGDSAYPLSAY 318
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILV 340
L PY + FN +R AF ++ + + + + P + L I+
Sbjct: 319 LQVPYPDAKPVTAEDAFNFWLSNSRIHIECAFGEIIMRFGLFWRALRFPLAKCL-DIVKA 377
Query: 341 CCLLHNIIID 350
LLHN ++D
Sbjct: 378 AALLHNFLVD 387
>gi|449664839|ref|XP_004206009.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 233
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
E F + G +D THI + P +++S D+ + + +SM VQ I D + F+++ WP
Sbjct: 2 ETKFNMPQAFGFVDGTHIPIRRP-LESSQDYFNYKGFHSMSVQAICDSKGIFMNVDCKWP 60
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEV-----ELREYIVGGVGYPLLSWLIT 283
G ++ +++ F L + ++ ++ I+ E+ ++ Y++G YPL ++ +
Sbjct: 61 GSLHDAKV-----FSNLTFNLRMISNSIPITYRELLPGFCKVPCYVIGDPAYPLSTFCLK 115
Query: 284 PYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
Y + + + FN+ R+ AF +LK W IL K + +P++I C +
Sbjct: 116 EY-LHCQNNNQVVFNTSLRTARNQIECAFGRLKARWEILKKKI-DLKLENIPTVIYACFV 173
Query: 344 LHNI 347
LHN
Sbjct: 174 LHNF 177
>gi|38346294|emb|CAE04176.2| OSJNBa0029C04.7 [Oryza sativa Japonica Group]
Length = 758
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/238 (26%), Positives = 100/238 (42%), Gaps = 34/238 (14%)
Query: 132 STVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES-FGLLN-CCGAIDATHIIMT 189
STVS R + + LK D N + + ++S FG GA+D THI +T
Sbjct: 486 STVSIYFRRVLSRMVMLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHIPVT 545
Query: 190 LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ V ++ + ++ + + V IV + R I GWPG ++ +R+L
Sbjct: 546 V-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVL------------ 592
Query: 250 QRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ETNGLSASMP--- 295
N + EE + R+Y++ GYP + PY + G A P
Sbjct: 593 ---NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPHAPPPEGR 649
Query: 296 --TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H R++ + F +K W+IL + + P K IIL LHN IDS
Sbjct: 650 EEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDS 707
>gi|72125597|ref|XP_788419.1| PREDICTED: uncharacterized protein LOC583419 [Strongylocentrotus
purpuratus]
Length = 366
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/308 (21%), Positives = 133/308 (43%), Gaps = 28/308 (9%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEG---RLLSVEKQVAIALRR 113
+++ +GF+ F R++ F ++V+R L + + L V +VAI +R
Sbjct: 6 NEDVQGFRNFTRITPAMF--------AEMVTRLTPRLQKYDTWYRKALPVGLKVAITMRY 57
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEEIKSKFEES 171
LASG+S S+ F V +T+S + +A+ E + + P + + I F +
Sbjct: 58 LASGDSYHSLMYLFYVPHNTISLLVLDVCQAIYAEYGEETISNPSTPEGWKGIAQTFSDR 117
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
+ +C GA+D HI + PA + + + S+++ +VD +F + G G
Sbjct: 118 WNFHHCLGALDGKHIRIKAPA-NCGSQFYNYKGYNSIVLLALVDGNYKFRWVEVGAGGAS 176
Query: 232 NVSRLLKFSGFFKLCEAGQRLNG-NVRISSEE------VELREYIVGGVGYPLLSWLITP 284
+ +++ + C + ++ N+ I +E ++ +++ + L + L+ P
Sbjct: 177 SDAQI------WNTCSLKEAIDDENIGIPPDEPLPHDDRDIPYFVIADAAFALRTTLMKP 230
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
+ L+ FN R AF L +R L M + + +I+L C +
Sbjct: 231 FGAKPLTMEERIFNYRLSRARRCVENAFGILANRFRCLLSAMAQVPS-TVETIVLACLCI 289
Query: 345 HNIIIDSG 352
HN++ D
Sbjct: 290 HNLLRDQA 297
>gi|322780068|gb|EFZ09807.1| hypothetical protein SINV_11159 [Solenopsis invicta]
Length = 255
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 4/92 (4%)
Query: 267 EYIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
EYI+G YP+L+W I P+ NG L+ FN + R +AF LKG +R L +
Sbjct: 123 EYIIGDKAYPVLTWCIPPFRDNGRLTEEQKRFNKILSQKRQTIERAFALLKGRFRRLKFL 182
Query: 326 -MWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
M R D +P IL C+LHNI + S D +
Sbjct: 183 DMSRLD--LIPFFILAACVLHNICLQSNDDVE 212
>gi|241999676|ref|XP_002434481.1| conserved hypothetical protein [Ixodes scapularis]
gi|215497811|gb|EEC07305.1| conserved hypothetical protein [Ixodes scapularis]
Length = 221
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 74/154 (48%), Gaps = 11/154 (7%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
T+P + F+ FRVS TF Y+ + R ++ + + I +VEK+VAI+L +
Sbjct: 75 TLPRMGDLYFRVCFRVSSTTFRYLVDVGRPSMLRQDTTMKTAI-----TVEKRVAISLYK 129
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIE-ALEERAKHHLKWPDSNRMEEIKSKFEESF 172
L S + ++G F VGQS V++ F + +EE H + + ++ +F+
Sbjct: 130 LCSSAEERTIGHLFAVGQSVVNETYREFCDVVIEELEAHTITMIRNKDLDNHMHEFQAVL 189
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY 206
G N GA+D H+ ++ P D NNY
Sbjct: 190 GFPNAIGALDGCHLPVSPPK-----DLAVDYNNY 218
>gi|449687874|ref|XP_004211572.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 273
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 100/222 (45%), Gaps = 26/222 (11%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDAT 184
AF + Q TVS+V EA+ ++ P D M+ S+ E F + T
Sbjct: 15 AFDIHQCTVSKVVLEVCEAITYHLGPNIYLPKDKTEMKNKISEMETKFNM-------PQT 67
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
HI + P +++S D+ + + +S+ VQ I D + F+D+ WPG ++ ++++ +
Sbjct: 68 HIPIRRP-LESSQDYFNYQG-FSISVQAICDSKGIFMDVDCKWPGSLHDAKIIPIT---- 121
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
+ L+G ++ Y++G PL ++ + Y + + + FN+
Sbjct: 122 ---YRELLSGFCKVPC-------YVIGDPANPLSAFCLKEY-LHCQNNNQVVFNTSLRTA 170
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
R+ AF LK W IL K + +P+ I C +LHN
Sbjct: 171 RNQIECAFDHLKARWAILEKKI-DLKVENIPTAIYACFVLHN 211
>gi|218193455|gb|EEC75882.1| hypothetical protein OsI_12917 [Oryza sativa Indica Group]
Length = 380
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 121/287 (42%), Gaps = 13/287 (4%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
F R+ D+ V++ R +GLI S ++V A+R+LA G + ++
Sbjct: 44 FLRIMNAVEDHDDYFVQK----RNAAGLIG-----FSCHQKVTAAMRQLAYGIAADALDE 94
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
G+ +ST + RF++A+ + +H +L+ P+ N + + E G G+ID
Sbjct: 95 YLGIAESTAIESLRRFVKAVVQVFEHEYLRSPNENDTTRL-LELGEDRGFPGMLGSIDCM 153
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H + + + +++++ + ++ G PG N +L S F
Sbjct: 154 HWKWKNCPTELHGMYQGHVHEPTIILEAVASKDLWIWHAFFGMPGSHNDINVLHRSPLFA 213
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
G+ N I+ + + Y+ G+ +++ T E +G F EA
Sbjct: 214 KLAEGKAPEVNYSINRHDYMMGYYLADGIYPSWATFVKTIPEPHG--NKRKYFAKAQEAV 271
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
R +AF L+ + I+ D++ L I+ C ++HN+II+
Sbjct: 272 RKDVERAFGVLQARFAIVRGPARHWDEKTLGYIMKACVIMHNMIIED 318
>gi|331213957|ref|XP_003319660.1| hypothetical protein PGTG_01834 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309298650|gb|EFP75241.1| hypothetical protein PGTG_01834 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 410
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 103/262 (39%), Gaps = 45/262 (17%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG+ TV +V+ R IEAL + H+ WPD
Sbjct: 138 LPIAHQLALTLERLGSNGNGASVGRFSRNLSVGRGTVVKVSRRVIEALISLGRRHVVWPD 197
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ R EI + P + + D++ YS+ Q I D +
Sbjct: 198 AARRAEISER-----------------------PGYD-GEVFFDRKRRYSINAQIICDCD 233
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+TGWPG SR+ K +L+ N S + +Y++ Y L
Sbjct: 234 KYITSFITGWPGSCGDSRVYK----------RMQLHLN---PSNYFDEGQYLLADSAYEL 280
Query: 278 LSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMW----RPDKR 332
+I Y+ + + + FN R LK W L ++ R R
Sbjct: 281 SHTVIPVYKVPAANIMINSQFNYCLAKARVRNEHTIGVLKSRWSSLREMRLHLYRRQHMR 340
Query: 333 KLPSIILVCCLLHNIIIDSGDQ 354
+ + C +LHN++ GDQ
Sbjct: 341 AYVAWLYSCIILHNLLAGLGDQ 362
>gi|31432223|gb|AAP53885.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 777
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 82/189 (43%), Gaps = 32/189 (16%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GA+D THI +T+ V ++ + ++ + + V IV + R I GWPG ++ +R+L
Sbjct: 554 GAVDGTHIPVTV-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWPGSVHDNRVL- 611
Query: 239 FSGFFKLCEAGQRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ET 287
N + EE + R+Y++ GYP + PY +
Sbjct: 612 --------------NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQF 657
Query: 288 NGLSASMP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
G A P FN +H R++ + F +K W+IL + + P K IIL
Sbjct: 658 KGPRAPPPEGREEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAF 717
Query: 343 LLHNIIIDS 351
LHN IDS
Sbjct: 718 CLHNFRIDS 726
>gi|57834109|emb|CAE04774.3| OSJNBa0079C19.15 [Oryza sativa Japonica Group]
Length = 728
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 115/276 (41%), Gaps = 46/276 (16%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
FR+S+ F+ + +L+ + S + +IE + + L + + +S V V
Sbjct: 467 MFRMSRPLFERLHNLLVTSYGLKSTSKMDSIEA--------LGMFLWTIGAPQSFVQVKN 518
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK-----SKFEESFGLLNCCGA 180
F + T+S ++++ +K +K P I +F+ F NC GA
Sbjct: 519 RFERSKGTISVKFEEVLQSVYLLSKDLVK-PRDPHFTTIHPRLHGDRFQPHFN--NCIGA 575
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
ID THI++ +PA + N Y + V + D +MRF IV GWPG + R+
Sbjct: 576 IDGTHILVVVPASKVVQHV--GRNKYPTQNVLAVCDFDMRFTFIVAGWPGSAHDMRVFND 633
Query: 240 SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP---- 295
+ LC+ I + Y+V GYP + PY+ G +P
Sbjct: 634 A----LCKYAT-------IFPHPPPGKFYLVDS-GYPNRMGFLAPYK--GTKYHLPEFRA 679
Query: 296 ---------TFNSLHEATRSLAVKAFLQLKGGWRIL 322
FN LH + R++ ++F LK WRIL
Sbjct: 680 GPRPSGKKEVFNHLHSSLRNVIERSFGVLKMKWRIL 715
>gi|82570160|gb|ABB83644.1| PIF-like transposase [Daucus carota]
Length = 425
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/307 (24%), Positives = 123/307 (40%), Gaps = 52/307 (16%)
Query: 64 KYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSV 123
K FR+ K+TF + S E++ PS R +SV ++V I + LA G S
Sbjct: 91 KIMFRMEKETF-FQLSRDLENIYELKPS-------RRMSVIEKVGIFVFILAQGASNRHA 142
Query: 124 GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL----NCCG 179
F T+S+V + ++ AK +K PD ++I S + +C G
Sbjct: 143 QERFQHSGETISRVFHEVLRSVCSFAKELIK-PDDPEFKKIASHILNDQRYMPHFKDCIG 201
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
AID TH+ +P V + ++N + V + +MRF ++ GW G + +R+
Sbjct: 202 AIDGTHVHACVP-VNDQVRFIGRKNLPTQNVMAVCSFDMRFTFVLAGWEGTAHDTRIF-- 258
Query: 240 SGFFKLCEAGQRLNGNVRISSEEVEL-----REYIVGGVGYPLLSWLITPYETNGLSASM 294
N +++ + +Y + GYP + PY+ G+ +
Sbjct: 259 ---------------NAAVNTPHLNFPSPPQNKYYLVDAGYPQTLGYLGPYK--GVRYHL 301
Query: 295 P-------------TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
P FN H + RS + F K W+IL+ M + R I++
Sbjct: 302 PDFRRGQAPEGYQEIFNKAHSSLRSCIERTFGVWKKRWKILA-YMPQYSFRSQRDIVVAS 360
Query: 342 CLLHNII 348
LHN I
Sbjct: 361 MALHNYI 367
>gi|222625529|gb|EEE59661.1| hypothetical protein OsJ_12058 [Oryza sativa Japonica Group]
Length = 704
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 110/265 (41%), Gaps = 42/265 (15%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEE 163
+AI L L G S F T+S+ + A+ E + ++ D SN E
Sbjct: 401 LAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVELGRDIVRPKDPNFSNVHER 460
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD---WCDQENNYSMLVQGIVDHEMRF 220
I+ +C GA+D THI+ +P DD + + + + V I DH+MRF
Sbjct: 461 IRKDRRMWPHFKDCIGAVDGTHILAVVP----DDDKIRYIGRSKSTTQNVMAICDHDMRF 516
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
G PG M+ + +L F+ L ++ I + + Y+V GYP
Sbjct: 517 TYASIGQPGSMHDTTVL-FNA----------LRTDIDIFPHPPQGKYYLVDA-GYPNRPG 564
Query: 281 LITPYETNGL------SASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKV---- 325
+ PY+ S P+ FN LH + R++ + F K WR+L K+
Sbjct: 565 YLAPYKGQRYHVPEFRRGSAPSGVKEKFNFLHCSVRTIIERCFGVWKMKWRVLLKMPSFP 624
Query: 326 MWRPDKRKLPSIILVCCLLHNIIID 350
+W K+K+ ++ LHN I D
Sbjct: 625 LW---KQKM--VVAATMALHNFIRD 644
>gi|357116426|ref|XP_003559982.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 265
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 89/195 (45%), Gaps = 29/195 (14%)
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L +C GA+D THI +LPA Q + + S V I D +M FI G PG M+
Sbjct: 30 LKDCIGALDGTHIRASLPADQQV-RYIGRSGTASQNVLAICDFDMCFIYASIGQPGAMHD 88
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE------- 286
+ +L R++ + I + + Y+V GYP + PY+
Sbjct: 89 TSVL---------YPAIRVDHD--IFPHPSKGKYYLVDA-GYPNRPGYLAPYKGERYHVP 136
Query: 287 --TNGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRI-LSKVMWRPDKRKLPSIILVC 341
G S P FN +H A R++ ++F LK WRI L+ + DK+KL I+
Sbjct: 137 DFHRGAEPSTPKKKFNRVHPAVRNVIERSFGVLKMKWRILLTMPNYSMDKQKL--IVAAS 194
Query: 342 CLLHNIIID--SGDQ 354
+LHN + + SGD+
Sbjct: 195 MVLHNYVREHQSGDR 209
>gi|218198426|gb|EEC80853.1| hypothetical protein OsI_23470 [Oryza sativa Indica Group]
Length = 465
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 104/254 (40%), Gaps = 50/254 (19%)
Query: 126 AFGVGQSTVSQVTW-------RFIEALEERAKHHLKWPDSNRMEEIKSKFEESF--GLLN 176
+F +S ++ W + L + AK ++ D N E + E+ F +
Sbjct: 179 SFSQAESCFTRSLWMVHTKFHEVLRCLRKLAKDNITPRDPNFSMEHERLREDRFWPYFKD 238
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GAID +HI + +P +T C + S V I +MRF V WPG + SR+
Sbjct: 239 AIGAIDGSHIPVVVPVDETISHTC-RHGYTSQNVLAICVFDMRFTFAVASWPGFAHDSRI 297
Query: 237 LK--FSGF--FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
L + F F + G+ Y + +GYP + P++ G +
Sbjct: 298 LSHALANFPSFPMPPTGK-----------------YYLVDLGYPNRIGYLAPFK--GATY 338
Query: 293 SMP--------------TFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSI 337
+P FN LH + R++ ++F LK WRIL + + P +K I
Sbjct: 339 HIPEFRHHSGSPQGKYEVFNFLHSSLRNVIKRSFGVLKQKWRILKGIPSFPPTTQK--HI 396
Query: 338 ILVCCLLHNIIIDS 351
I+ C LHN + DS
Sbjct: 397 IMACLALHNFVHDS 410
>gi|321249689|ref|XP_003191538.1| hypothetical protein CGB_A5100C [Cryptococcus gattii WM276]
gi|317458005|gb|ADV19751.1| Hypothetical Protein CGB_A5100C [Cryptococcus gattii WM276]
Length = 299
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 107/251 (42%), Gaps = 41/251 (16%)
Query: 142 IEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL-PAVQTSDD-W 199
+ A+ + A++ + WPD I FEE + + +CCG +D TH+ + P++ + W
Sbjct: 1 MTAIVDNARNFIYWPDGAERRRISHWFEERYKIPDCCGIMDDTHVHFKMRPSLHDPESFW 60
Query: 200 CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL-----------LKFSG-FFKLCE 247
C ++ + L+ IVDH RF I G+PG + SR+ L FS + + +
Sbjct: 61 CHHHSHGTNLMV-IVDHLCRFRHIQVGFPGSASDSRIQGNLRVFQQPELYFSADEYIIAD 119
Query: 248 AGQRLNGNV--------RISSEEVE--------LREYIVGGVGYPLLSWLITPY-ETNGL 290
AG + +V +S E E EYI+ G+ + ++ Y T L
Sbjct: 120 AGFTNDEHVVTMYRRLAELSGEHPEANQEQYFSANEYIMADAGFTNGNRVVAMYRRTAEL 179
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD--------KRKLPSIILVCC 342
S + P + L VK G W+ M R +R+L + I+
Sbjct: 180 SRNHPEAAYFNPKAARLRVKVECTF-GIWKNRFPTMQRSHTCLKDSIAQRRLHARIISSM 238
Query: 343 LLHNIIIDSGD 353
+LHN ++ GD
Sbjct: 239 VLHNPLVTIGD 249
>gi|48717071|dbj|BAD23759.1| ribosomal protein-like [Oryza sativa Japonica Group]
Length = 916
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/293 (21%), Positives = 125/293 (42%), Gaps = 13/293 (4%)
Query: 63 FKYFFRVSKKTFDYICSLVRED----LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
F+ R+S+ F I + + + + R +GLI S ++V A+R+LA G
Sbjct: 569 FRRRNRMSRPLFLRIMNAIEDHDDYFVQKRNAAGLIG-----FSCHQKVTAAMRQLAYGI 623
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNC 177
+ ++ G+ +ST + RF++A+ + +H +L+ P+ N + + E G
Sbjct: 624 AADALDEYLGIAESTAIESLRRFVKAVVQVFEHEYLRSPNENDTTRLL-ELGEDRGFPGM 682
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G+ID H + + + +++++ + ++ G PG N +L
Sbjct: 683 LGSIDCMHWKWKNCPTELHGMYQGHVHEPTIILEAVASKDLWIWHAFFGMPGSHNDINVL 742
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
S F G+ N I+ + + Y+ G+ +++ T E +G F
Sbjct: 743 HRSPLFAKLAEGKAPEVNYSINGHDYMMGYYLADGIYPSWATFVKTIPEPHG--NKRKYF 800
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
EA R +AF L+ + I+ D++ L I+ C ++HN+II+
Sbjct: 801 AKAQEAVRKDVERAFGVLQARFAIVRGPARHWDEKTLGYIMKACVIMHNMIIE 853
>gi|356569920|ref|XP_003553142.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 379
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 118/297 (39%), Gaps = 50/297 (16%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C ++ + + R +S+ +QV + L L+ S +
Sbjct: 70 FRMKKLVFLELCDILETKYNLK--------KTRNVSIYEQVGLFLYMLSQPGSVRNCEER 121
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEEIKSKFEESFGLLNCCGAIDA 183
F T+S+ +EA+ AK +K D + + EI +C GAID
Sbjct: 122 FQHSGKTISRHFHNVLEAVCMFAKDIIKPVDPSFRDTLHEILKDARYRPYFRDCIGAIDG 181
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + +P+ + ++ + V + D M F + TGW G + +++
Sbjct: 182 THIRVCVPS-HLQGVYIGRKGYTTTNVMAVCDFSMCFTFVWTGWEGSAHDTKIFM----- 235
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS---------- 293
EA LR+Y + GYP + PY+ S
Sbjct: 236 ---EA----------------LRKYYLVDSGYPTFMGFLGPYKKTRYHLSQFRIGPRIRG 276
Query: 294 -MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL-LHNII 348
+ FN H + RS+ +AF K W+IL + P K+ + I+V C+ +HN I
Sbjct: 277 RVEVFNYYHSSLRSIIERAFGLCKARWKILGNM--SPFALKIQNQIIVACMAIHNFI 331
>gi|51038093|gb|AAT93896.1| unknown protein [Oryza sativa Japonica Group]
Length = 446
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 62/286 (21%), Positives = 121/286 (42%), Gaps = 13/286 (4%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
F R+ D+ V++ R +GLI S ++V A+R+LA G + ++
Sbjct: 110 FLRIMNAVEDHDDYFVQK----RNAAGLIG-----FSCHQKVTAAMRQLAYGIAADALDE 160
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
G+ +ST + RF++A+ + +H +L+ P+ N + + E G G+ID
Sbjct: 161 YLGIAESTAIESLRRFVKAVVQVFEHEYLRSPNENDTTRL-LELGEDRGFPGMLGSIDCM 219
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H + + + +++++ + ++ G PG N +L S F
Sbjct: 220 HWKWKNCPTELHGMYQGHVHEPTIILEAVASKDLWIWHAFFGMPGSHNDINVLHRSPLFA 279
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
G+ N I+ + + Y+ G+ +++ T E +G F EA
Sbjct: 280 KLAEGKAPEVNYSINGHDYMMGYYLADGIYPSWATFVKTIPEPHG--NKRKYFAKAQEAI 337
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
R +AF L+ + I+ D++ L I+ C ++HN+II+
Sbjct: 338 RKDVERAFGVLQARFAIVRGPARHWDEKTLGYIMKACVIMHNMIIE 383
>gi|29788843|gb|AAP03389.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 411
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 110/265 (41%), Gaps = 42/265 (15%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEE 163
+AI L L G S F T+S+ + A+ E + ++ D SN E
Sbjct: 108 LAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVELGRDIVRPKDPNFSNVHER 167
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD---WCDQENNYSMLVQGIVDHEMRF 220
I+ +C GA+D THI+ +P DD + + + + V I DH+MRF
Sbjct: 168 IRKDRRMWPHFKDCIGAVDGTHILAVVP----DDDKIRYIGRSKSTTQNVMAICDHDMRF 223
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
G PG M+ + +L F+ L ++ I + + Y+V GYP
Sbjct: 224 TYASIGQPGSMHDTTVL-FNA----------LRTDIDIFPHPPQGKYYLVDA-GYPNRPG 271
Query: 281 LITPYETNGLS------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKV---- 325
+ PY+ S P+ FN LH + R++ + F K WR+L K+
Sbjct: 272 YLAPYKGQRYHVPEFRRGSAPSGVKEKFNFLHCSVRTIIERCFGVWKMKWRVLLKMPSFP 331
Query: 326 MWRPDKRKLPSIILVCCLLHNIIID 350
+W K+K+ ++ LHN I D
Sbjct: 332 LW---KQKM--VVAATMALHNFIRD 351
>gi|115758132|ref|XP_001199795.1| PREDICTED: uncharacterized protein LOC763733 [Strongylocentrotus
purpuratus]
Length = 331
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 111/248 (44%), Gaps = 9/248 (3%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRM-EEI 164
+++ LR LA+G + + F +G+ T+ + A+ +E A + P +N E+
Sbjct: 44 LSVFLRHLANGATYAELSFNFRMGKETIQKFVPDVARAVVDEYAAEVISLPTTNECWLEV 103
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
FE + L +C G D HI + P ++ + + + +S+++ +VD + +F+ I
Sbjct: 104 AGDFEARWNLPHCLGGYDGKHIRLQKPN-KSGSLYFNYKQFFSVVLMALVDSKYQFLWID 162
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN---VRISSEEVELREYIVGGVGYPLLSWL 281
G G + +++ S + EAG + + EE + + VG + + +++
Sbjct: 163 VGGVGHQSDAQIYNNSELKECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDYAFAMRTYM 222
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIILV 340
+ PY + FN R + AF L ++ L ++ PD +L +I
Sbjct: 223 MKPYGRRNMDQQQKIFNYRLSRARRVVENAFGILVLRFQCFLGQMRQEPDTVRL--LIEA 280
Query: 341 CCLLHNII 348
+LHN+I
Sbjct: 281 AVMLHNLI 288
>gi|357140134|ref|XP_003571625.1| PREDICTED: uncharacterized protein LOC100822756 [Brachypodium
distachyon]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 79/189 (41%), Gaps = 31/189 (16%)
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GAID HI + +PA QT + + + S V + D +MRF +V GW G + +R+
Sbjct: 221 CIGAIDGPHIPVIVPASQTVN-YTGRHGYTSQNVLAVCDFDMRFTFVVAGWAGSAHDTRI 279
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL------ 290
+S ++ E Y+V GYP + P++
Sbjct: 280 FNYS-----------MDKYANTYHSPPEGTYYLVDS-GYPNRKGYLAPFKGQTYHLLEFQ 327
Query: 291 -----SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS---IILVCC 342
+ + FN H + R++ ++F LK RIL V + K+ S II C
Sbjct: 328 NRRPPTGKLEVFNHAHSSLRNVIERSFGVLKQKCRILRDV----PQYKIASQTMIINSCM 383
Query: 343 LLHNIIIDS 351
LHN I DS
Sbjct: 384 ALHNFIRDS 392
>gi|72018042|ref|XP_786652.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 304
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 105/227 (46%), Gaps = 6/227 (2%)
Query: 126 AFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDA 183
A+ V +++S+V +EA+ EE L+ P + + ++ + + + + GAID
Sbjct: 4 AWRVPHNSISKVVREVVEAIVEEYVDELLRCPTNEQGWRQLAEDWYQRWNFPHTVGAIDG 63
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
H+ PA + + + + YS+L+ +VD + +FI I G + +++ S
Sbjct: 64 KHVACKAPA-NSGSTYYNYKGFYSILLFAMVDADYKFIYIDASSKGSASDAQIYNASDLK 122
Query: 244 KLCEAGQRLN--GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLH 301
E + G + ++ ++ +IVG + L ++L+ PY + L+ FN
Sbjct: 123 DGLERNLIMGFPGPDPLPNDTQDVPYFIVGDDAFSLRTYLMKPYSSRYLAREERIFNYRL 182
Query: 302 EATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
R + AF L ++IL M + D + SI+ C+LHN++
Sbjct: 183 SRARRVVENAFGILANRFQILLTTM-QHDPETVKSIVEARCILHNLM 228
>gi|357162176|ref|XP_003579328.1| PREDICTED: uncharacterized protein LOC100834753 [Brachypodium
distachyon]
Length = 611
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 114/266 (42%), Gaps = 33/266 (12%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD--- 157
+S ++ +A+ L L + +S + F ST+S + ++ A ++ D
Sbjct: 316 MSSKEALALFLWTLGAPQSNIQAANRFEHSPSTISNKFMEVLMCVDRMAGDYIAPIDPTF 375
Query: 158 SNRMEEIKS-KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
++ E++K KF F + GAID THI + +PA + +++ S V + D
Sbjct: 376 THVHEKLKKPKFWPHFK--DAIGAIDGTHIPVIVPA-ELKFIHTNRKGYTSQNVLAMCDF 432
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+MRFI V GWPG ++ +R+ + + + Y+V GYP
Sbjct: 433 DMRFIFAVPGWPGSVHDTRVWSDA------------RAEYDTFPHPPQGKYYLVDS-GYP 479
Query: 277 LLSWLITPYE---------TNGLSASM-PTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
+ PY+ N M FN H + R++ +AF LK W IL +
Sbjct: 480 NRVGYLAPYKGQRYHVLEFENAPPVGMQEMFNHCHSSLRNVIERAFGVLKRKWPILQGIP 539
Query: 327 WRPD-KRKLPSIILVCCLLHNIIIDS 351
P K+K+ I+ C LHN I DS
Sbjct: 540 AYPVLKQKM--IVSACMCLHNYIRDS 563
>gi|393229441|gb|EJD37064.1| hypothetical protein AURDEDRAFT_129708 [Auricularia delicata
TFB-10046 SS5]
Length = 172
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/176 (22%), Positives = 80/176 (45%), Gaps = 7/176 (3%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S+K F+++ +++++ V + + Q+A L + G++ + G
Sbjct: 1 MSRKAFEHLLDIIQDNPVFKSKGQKPQTPTCI-----QLATFLAKYG-GDNSLQAATVAG 54
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
V + + R I A+ + WPD+ E IK+ + +G+ C G +D + I +
Sbjct: 55 VSEGSAYNFCERVITAVRNLRDQFVHWPDAEERESIKTAMAD-YGIPGCTGLVDGSLIRL 113
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
++ + ++ Y + +Q + H RF + TGWPG + + LLK S +K
Sbjct: 114 VDKPIRNPYAYWTRKKFYGIALQTVCKHHGRFTEYETGWPGSVADTTLLKESHMWK 169
>gi|58266772|ref|XP_570542.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134110346|ref|XP_776000.1| hypothetical protein CNBD0500 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258668|gb|EAL21353.1| hypothetical protein CNBD0500 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226775|gb|AAW43235.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 427
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 116/285 (40%), Gaps = 43/285 (15%)
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW------ 155
+ + +AIAL L S +G F +S + AL + + +W
Sbjct: 113 TAAEHLAIALAYLRKKTSVNWIGEEFKKSNEFISFCIHDVVNALTSKVIYD-RWVKYPTG 171
Query: 156 ---PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQ 211
P++ ++ +++ G G+ID THI + + P +Q + + +++ S+ V
Sbjct: 172 REPPNAKVADDPLNRYRHFHG---AVGSIDGTHIAVKVSPRMQPA--YRNRKGVVSINVL 226
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
D +++F I+ GW G N S F +A R N ++ Y +
Sbjct: 227 AACDFDLKFTYIIAGWEGSANDS--------FTFNQAVHRYN------FPQLPSNRYFLA 272
Query: 272 GVGYPLLSWLITPYETN--GLSASMPT----------FNSLHEATRSLAVKAFLQLKGGW 319
G+P+ L+TPY L+ P+ FN H R+ + F K W
Sbjct: 273 DAGFPICDQLLTPYRGTRYHLADFHPSRGAPQSEKEVFNLAHAQLRNSVERIFGITKQRW 332
Query: 320 RILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL-HPDVALSD 363
+LS + + IIL CC+LHN+ I D++ DV +D
Sbjct: 333 GVLSGGLEKFKGGMQSQIILACCVLHNMCISIKDRVPERDVGEAD 377
>gi|449677576|ref|XP_002162156.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 329
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 106/253 (41%), Gaps = 27/253 (10%)
Query: 129 VGQSTVSQVT-------WRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
+G++TV + W+ ++ + RA + +K + I +F++++ C GAI
Sbjct: 24 IGRATVCHIIKETCCAIWKVLKKVFLRAPNDVK-----EWQNIIKEFDQNWNFPQCIGAI 78
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D H+ + PA ++ + + + YSM++ I D + F + G G N + +L S
Sbjct: 79 DGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGRDNDAAVLNAST 137
Query: 242 FFKLCEAGQRLNGNV----RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
F G+ N +IS + ++ +VG + L WL+ PY L+ F
Sbjct: 138 F------GRAFNKGYFNLPKISEIDPKVPPVLVGDDIFALKPWLMKPYPGKNLTVQQRVF 191
Query: 298 NSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII-IDSGDQL 355
N R +F L WRI S + +P K+ II LHN + + G
Sbjct: 192 NYRLSRARRTIENSFGILAARWRIYRSPIKAKP--LKVEHIIKATVCLHNYLRLTDGAHY 249
Query: 356 HPDVALSDHHDSG 368
P + SG
Sbjct: 250 IPTSFVDSESHSG 262
>gi|328697485|ref|XP_003240353.1| PREDICTED: hypothetical protein LOC100574226 [Acyrthosiphon pisum]
Length = 681
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/303 (22%), Positives = 130/303 (42%), Gaps = 27/303 (8%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
D++E F F R+ K T + + L S+ N +L + K + + LR A G
Sbjct: 39 DDDEFFMRF-RLKKTTVKELLFKIEFHLKSK-----TNRNRAILPITK-LLLTLRFYALG 91
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLN 176
++ G GV +++ + EA+ ++K P+ + ++E + KF
Sbjct: 92 TILLANGDFVGVSKTSACNIVRTVTEAIASLRPLYIKMPEHHCNIQETRLKFYNIARFPR 151
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GAID TH+ + P ++ + +++ +S+ VQ + + +DIV WPG + +
Sbjct: 152 IIGAIDCTHVKLQSPGGNIAEVYRNRKGYFSLNVQVVGGRSLEILDIVARWPGSTHDQVI 211
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYI-VGGVGYPLLSWLITPYETNGLSASMP 295
N + E E+ + I +G GY +++ P + +A +
Sbjct: 212 FN--------------NSTIHFKFETNEMGDNILLGDGGYECRPYILVPLISPNTNAEL- 256
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILS-KVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
+N TR+ + F K + ILS + PD+ + +II+ +LHN+ G+
Sbjct: 257 LYNESQIRTRNTIERLFGVWKRRFPILSLGIRTTPDRAQ--AIIVATAVLHNLASVLGES 314
Query: 355 LHP 357
+ P
Sbjct: 315 IPP 317
>gi|218187156|gb|EEC69583.1| hypothetical protein OsI_38912 [Oryza sativa Indica Group]
Length = 269
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 111/257 (43%), Gaps = 32/257 (12%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+++E+QVA+ + + S + F TVS+ ++AL A+ + +
Sbjct: 25 VTIEEQVAMFIHVVGHNWSNRPIAFEFMRSGETVSRYFNLVLDALCILARDLVCIKSIDT 84
Query: 161 MEEIKS---KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++I S +F F C GA+D THI +P + D + ++ S V VD +
Sbjct: 85 HQKITSSHGRFHPYFQ--GCIGALDGTHIPACVP-MHMQDRFRGRKKFPSQNVLAAVDFD 141
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+RF+ ++ GW G + S +L+ +A R NG ++ +Y + GY
Sbjct: 142 LRFLYVLAGWEGSAHDSYVLQ--------DALSRTNG------LKIPEGKYFLADAGYAA 187
Query: 278 LSWLITPYETNGL------SASMP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
L+ PY+ A P FN H + R+ +AF LK +RI
Sbjct: 188 RPGLLPPYQGTRYHLKEYKGAREPENPKELFNLRHSSLRTTIERAFGALKNRFRIFKCQP 247
Query: 327 WRPDKRKLPSIILVCCL 343
+ P K ++ I++ CC+
Sbjct: 248 FFPLKTQV-KIVMACCV 263
>gi|58263791|ref|XP_569175.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134108270|ref|XP_777086.1| hypothetical protein CNBB3180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259771|gb|EAL22439.1| hypothetical protein CNBB3180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223825|gb|AAW41868.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 427
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 116/285 (40%), Gaps = 43/285 (15%)
Query: 102 SVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW------ 155
+ + +AIAL L S +G F +S + AL + + +W
Sbjct: 113 TAAEHLAIALAYLRKKTSVNWIGEEFKKSNEFISFCIHDVVNALTSKVIYD-RWVKYPTG 171
Query: 156 ---PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQ 211
P++ ++ +++ G G+ID THI + + P +Q + + +++ S+ V
Sbjct: 172 REPPNAKVADDPLNRYRHFHG---AVGSIDGTHIAVKVSPRMQPA--YRNRKGVVSINVL 226
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
D +++F I+ GW G N S F +A R N ++ Y +
Sbjct: 227 AACDFDLKFTYIIAGWEGSANDS--------FTFNQAVHRYN------FPQLPSNRYFLA 272
Query: 272 GVGYPLLSWLITPYETN--GLSASMPT----------FNSLHEATRSLAVKAFLQLKGGW 319
G+P+ L+TPY L+ P+ FN H R+ + F K W
Sbjct: 273 DAGFPICDQLLTPYRGTRYHLADFHPSRGAPQSEKEVFNLAHAQLRNSVERIFGITKQRW 332
Query: 320 RILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL-HPDVALSD 363
+LS + + IIL CC+LHN+ I D++ DV +D
Sbjct: 333 GVLSGGLEKFKGGMQSQIILACCVLHNMCISIKDRVPERDVGEAD 377
>gi|449692739|ref|XP_004213152.1| PREDICTED: uncharacterized protein LOC101239759, partial [Hydra
magnipapillata]
Length = 226
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 83/198 (41%), Gaps = 44/198 (22%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SV K+VA+AL LAS E V FG+G+ST + IE L + ++
Sbjct: 10 ISVVKRVAVALHNLASCEEYRVVSSLFGIGKSTAN-----LIENLNKHSR---------- 54
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
FE G C G +D HI + P Q + + + + YS+++ +VD RF
Sbjct: 55 ------DFEAILGFPQCVGTVDGCHIPILAPKEQATSYY-NYKGWYSIVLFAVVDCRYRF 107
Query: 221 IDIVTGWPGGMN---------VSRLLKFSGFFKLC-EAGQRLNGNVRISSEEVELREYIV 270
I G PG N + +LK + F K C E G L + ++
Sbjct: 108 IYTSVGLPGRNNDIYILPNFPLEGILKSNLFDKCCKELGDSL------------VSLCLI 155
Query: 271 GGVGYPLLSWLITPYETN 288
G +PL L+ P+ N
Sbjct: 156 GDSAFPLTRHLLKPHPEN 173
>gi|347966157|ref|XP_003435876.1| AGAP013250-PB [Anopheles gambiae str. PEST]
gi|333470184|gb|EGK97538.1| AGAP013250-PB [Anopheles gambiae str. PEST]
Length = 358
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 64/311 (20%), Positives = 136/311 (43%), Gaps = 19/311 (6%)
Query: 47 NSSTPGATIPSDE--EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVE 104
NS+ S+E +E + +S++ F+Y+ + + + R + + +
Sbjct: 3 NSTVSEMPAASEEISDETVRELLGMSEEDFNYLLNEISGKISRRD-----TFMRKAFTAK 57
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM-EE 163
+++ + LR LA+GES +++ + + S++ + E L + K ++++P S
Sbjct: 58 ERLIVTLRFLATGESFMALASLYDISASSIRTIIPEVCECLIKALKRYVQFPPSEAGWLR 117
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ F++ + + G IDA H+ + P + T D+ + + YS+++ IVD F+ +
Sbjct: 118 VSEAFQDRWQFPHAIGVIDARHVKIRKP-LHTDKDYLNYKGFYSIVLLAIVDASANFMYV 176
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLN---GNVRISSEEVELREYIVGGVGYPLLSW 280
G G + +L+ + + E LN V V++ ++G + +
Sbjct: 177 CVGGKGSIADGGMLRNASYHSKFE-HHELNVPPPAVLDERHAVKIPYMLLGDKSFLFTEY 235
Query: 281 LITPYETNGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSI 337
I P+ + L P TFN R+ A AF L ++I ++ +PDK +
Sbjct: 236 CIRPFGGH-LKPDSPESTFNYRMSQARTPAAVAFDGLCSRFKIFGTIINLQPDKAG--KV 292
Query: 338 ILVCCLLHNII 348
++ L N +
Sbjct: 293 VMAAVYLFNFL 303
>gi|332021416|gb|EGI61784.1| hypothetical protein G5I_09901 [Acromyrmex echinatior]
Length = 419
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/330 (21%), Positives = 127/330 (38%), Gaps = 35/330 (10%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
++ E F +R+ + FD + +L+ P NI L E ++A+ L L+ G
Sbjct: 42 NDYEKFFGAYRMWPEQFDLLVNLLH-------PHFKKNILKPSLPTELRLAVTLLYLSQG 94
Query: 118 ESQVSVGVAFGVGQSTVSQVT----WRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
++ FG+G+STV ++ W L+ LK P I F + +
Sbjct: 95 DNAKLKHAEFGIGKSTVHKIVNETCWAIWTTLQPIV---LKPPSKEDWIAISEDFMKKWQ 151
Query: 174 LLNCCGAIDATHIIMTLP-----AVQTSDDW---CDQENNYSMLVQGIVDHEMRFIDIVT 225
NC GA+D ++ + P A W ++ ++M++ + D +F +
Sbjct: 152 FPNCLGALDGRYMTIQAPPNSGSAFYNYKQWLRSAEKPQFFNMILLAVCDATYKFTWVNI 211
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI-VGGVGYPLLSWLITP 284
G ++ + + AG L + + E ++ +G +PL + ++ P
Sbjct: 212 EQRGSISDDGVWANTKLASSLAAGDLLLPDPTLFPETNTPFPFVFIGDEAFPLSTHMMRP 271
Query: 285 YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
Y L+ M FN R AF L WRIL K ++ +++ L
Sbjct: 272 YPREKLTDDMRIFNYRLLRVRRTIENAFGILTARWRILHKPLYM-SITNCENVLKALVCL 330
Query: 345 HNIIIDSGDQLHPDVALSDHHDSGYGEQCC 374
HN I+ L + H+S Q C
Sbjct: 331 HNFIM-----------LGEEHESINNRQYC 349
>gi|403160032|ref|XP_003320595.2| hypothetical protein PGTG_02617 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169382|gb|EFP76176.2| hypothetical protein PGTG_02617 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 369
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 76/174 (43%), Gaps = 11/174 (6%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS--- 116
+ FK R SK F +C L RE + P N + + Q+A+ L RL S
Sbjct: 3 DSDFKQATRTSKSGF--LCVL-RE--IFSHPIFQSNSNRPQIPIPHQLALTLERLGSNGN 57
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
G S V + V TV ++T R EAL + ++KW + R +EI S + G
Sbjct: 58 GSSVVCLSRNLSVAHGTVIKITRRVTEALTSLEEQYIKWLNQQRRQEISSVMKNE-GFDG 116
Query: 177 CCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
C G +D T I + P W D + YS+ VQ + D + I + GW G
Sbjct: 117 CVGFVDGTTIPLFQWPGFNGEVFW-DHKKRYSINVQIVCDCDKNIIAFLNGWQG 169
>gi|58261902|ref|XP_568361.1| hypothetical protein CNM00420 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230534|gb|AAW46844.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21]
Length = 492
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 107/262 (40%), Gaps = 30/262 (11%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEE 163
E Q+A + R+A GE +V F + +VS T R + A+ K ++ WP
Sbjct: 115 EVQLATCIYRMAGGERMSTVENHFNLSHGSVSLYTDRSLIAIVSSLKQYVFWPSEAERGV 174
Query: 164 IKSKFEESFGLLNCCGAIDATHIIM-TLPAV--QTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ + +G+ +C G ID T I++ P++ + + + Y + +VDH RF
Sbjct: 175 LARELYAQYGIPSCIGFIDGTDIVLHQAPSIGREKAHTMHSYKERYGYKMIAVVDHLKRF 234
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY----- 275
G+ N + Q L+ R +EY++G G
Sbjct: 235 RYAWFGFSAATND-------------QMAQDLSDLHRNPHRFFSPKEYVLGDAGMKSSDT 281
Query: 276 --PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS--KVMWR--P 329
PL T G A FN R + +AF LK W+IL ++ R
Sbjct: 282 VIPLFKRERRMQVTVGPKA---YFNHKCAKARVMIEQAFGILKNRWQILQDCRLTCRTVT 338
Query: 330 DKRKLPSIILVCCLLHNIIIDS 351
D+ +L +I C +LHN+++++
Sbjct: 339 DEARLYLVIQACMVLHNLLVET 360
>gi|253761844|ref|XP_002489296.1| hypothetical protein SORBIDRAFT_0010s002010 [Sorghum bicolor]
gi|241946944|gb|EES20089.1| hypothetical protein SORBIDRAFT_0010s002010 [Sorghum bicolor]
Length = 370
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 127/308 (41%), Gaps = 21/308 (6%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVR--EDLV--SRPPSGLINIEGRLLSVEKQVAIALRRLA 115
+EG + FR+SK F I + V+ +D R +G++ LS ++V ALR L
Sbjct: 63 DEGVQITFRMSKDLFLRIMNAVQVHDDYFVQKRNCAGVLG-----LSCFQKVTAALRMLT 117
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEAL----EERAKHHLKWPDSNRMEEIKSKFEES 171
G S ++ +G+ST + RF+ A+ E H D R+ + E
Sbjct: 118 YGVSADAIDEYIRIGESTALESLHRFVAAVVGIFEAEYLRHRNEADIARLFAVN----EK 173
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G G+ID H V++ + N +++++ + ++ + G PG
Sbjct: 174 RGFPGMLGSIDCMHWAWKNCPVESQGQYKGHVNQPTIILEAVASQDLWCWHVFFGMPGSH 233
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW-LITPYETNGL 290
N +L S F + N I+ E ++ ++ G+ YP SW I + +
Sbjct: 234 NDINVLHRSPLFDNLAEVKAPEVNYTINGHEYKMGYFLADGI-YP--SWGTIVKSISIPM 290
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
F + EA R + F L+ + I+ + D L I+ C ++HN+I++
Sbjct: 291 GNKRRYFATAQEAARKDVERFFGVLQSRFAIVRQPGRIWDSETLALIMRACVIMHNMIVE 350
Query: 351 SGDQLHPD 358
L PD
Sbjct: 351 DERNLDPD 358
>gi|449671240|ref|XP_004207455.1| PREDICTED: uncharacterized protein LOC101234440 [Hydra
magnipapillata]
Length = 588
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 74/139 (53%), Gaps = 3/139 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDS- 158
+S +++ + LR A+G+SQ F +G++TV +T +A+ + + +LK P+S
Sbjct: 28 ISPSERLTVTLRYFATGDSQQIQSFYFRLGRTTVCNITNEITKAIWDVLQPSYLKAPESS 87
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ E+I ++FE + NC GAID H+ + P V + + + +N +SM++ I D +
Sbjct: 88 DEWEKIANEFENEWNFPNCIGAIDGKHVCIEAP-VSSGSAYYNYKNYHSMVLLAICDAKY 146
Query: 219 RFIDIVTGWPGGMNVSRLL 237
F + G G N + +
Sbjct: 147 CFTLVDIGSYGRDNDASIF 165
>gi|403173167|ref|XP_003332267.2| hypothetical protein PGTG_14563 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170156|gb|EFP87848.2| hypothetical protein PGTG_14563 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 284
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 66/141 (46%), Gaps = 4/141 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPD 157
L + Q+A+ L RL S + SVG VG TV + T IE L + ++ WP+
Sbjct: 39 LPIAHQLALTLERLGSNGNATSVGRFSRNLNVGCGTVIKATRWVIEVLVSLGRQYVMWPN 98
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R +I S+ ++ G + C +D T I + + + D++ YS+ Q I D +
Sbjct: 99 STRRAKI-SEVMKNEGFVGCVRFVDGTTIPLFQRPGYDGEVFYDRKRRYSINAQIICDCD 157
Query: 218 MRFIDIVTGWPGGMNVSRLLK 238
+TGWPG S + K
Sbjct: 158 KYITSFITGWPGTCRDSLVYK 178
>gi|345497222|ref|XP_003427938.1| PREDICTED: hypothetical protein LOC100679215 [Nasonia vitripennis]
Length = 287
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 78/168 (46%), Gaps = 15/168 (8%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVS---RPPSGLINIEGRLLSVEKQVAIALRRL 114
++EE F F R++ +T+ Y+ LVRE LV RPP L E + ++ L L
Sbjct: 64 NDEEQFINFTRMNVQTYMYVYELVRERLVKRSRRPP----------LPPELRFSLTLNYL 113
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFG 173
A G+S + +G STV QV L E L +P + + I ++F E
Sbjct: 114 AHGDSIRKNEWFYNIGLSTVKQVIPEVCTVLCEVLMPLFLPFPSRQQFQVIANEFMEDLH 173
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
NC GA+D H + P + + + +N +S+++ D + RFI
Sbjct: 174 FPNCIGALDGKHCRIRKPG-GSGSLFFNFKNFHSIVLMACCDSKKRFI 220
>gi|390343679|ref|XP_787690.2| PREDICTED: uncharacterized protein LOC582653 [Strongylocentrotus
purpuratus]
Length = 341
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 115/263 (43%), Gaps = 17/263 (6%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPD 157
+ L V +VAI +R LASG+S S+ F V +T+S + +A+ E + + P
Sbjct: 18 KALPVGLKVAITMRYLASGDSYHSLMYLFYVPHNTISLLVLDVCQAIYAEYGEETISNPS 77
Query: 158 SNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ ++I F + + +C GA+D HI + PA + + + S+++ +VD
Sbjct: 78 TPEGWKDIAQSFSDRWNFHHCLGALDGKHIRIKAPA-NCGSQFYNYKGYNSIVLLALVDG 136
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNG-NVRISSEEV------ELREYI 269
+F + G G + +++ + C + ++ N+ I +E ++ ++
Sbjct: 137 NYKFRWVEVGAGGASSGAQI------WNTCSLKEAIDDENIGIPPDEPLPHDDRDIPYFV 190
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
+ + L + L+ P+ L+ FN R AF L +R L M +
Sbjct: 191 IADAAFALRTTLMKPFGAKPLTMEERIFNYRLSRARRCVENAFGILANRFRCLLSAMAQV 250
Query: 330 DKRKLPSIILVCCLLHNIIIDSG 352
+ +I+L C +HN++ D
Sbjct: 251 PS-TVETIVLACLCIHNLLRDQA 272
>gi|389751484|gb|EIM92557.1| hypothetical protein STEHIDRAFT_107036 [Stereum hirsutum FP-91666
SS1]
Length = 362
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 107/276 (38%), Gaps = 27/276 (9%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLV----SRPPSGLINIEGRLLSVEKQVAIA 110
+P+ G +S+ TF+ I L+ D + R P V Q+A
Sbjct: 74 LPASASTGLNAALSMSRDTFEAIVRLLEADPIFQSKGRKPQ---------RPVRMQLACF 124
Query: 111 LRRLAS-GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE 169
L R + G G+G TV R +AL + +W +R E + E
Sbjct: 125 LMRYGTRGSDARGAAHRMGIGFGTVFLYCRRVAKALRRLGLDYAQWGGEDRRERTATNIE 184
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
G NC G +D T I +T T + ++ ++ VQ +VD RF++ GWPG
Sbjct: 185 AMSGFPNCIGVVDGTLIRLTGAPQNTGGIFYCRKKFPAITVQAVVDEFCRFLNFELGWPG 244
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
+ S + K S + LN V + E+ +Y++G P I+
Sbjct: 245 SVPDSSMWKQSWVW--------LN-RVDLFGED----QYVMGDKVRPFNENEISHAADGA 291
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
M +FN R+ +AF +LK + L ++
Sbjct: 292 ERKRMRSFNKELSGVRAWVEQAFGRLKARFPSLKEI 327
>gi|195114180|ref|XP_002001645.1| GI15789 [Drosophila mojavensis]
gi|193912220|gb|EDW11087.1| GI15789 [Drosophila mojavensis]
Length = 375
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 126/307 (41%), Gaps = 36/307 (11%)
Query: 55 IPSDEEEGFKYFFRVSKKTFDYICSLVREDLVS---RPPSGLINIEGRLLSVEKQVAIAL 111
IP ++ FK FR+ + +F+ + V + + P +++ +LL L
Sbjct: 52 IPRFLDDQFKEQFRMKRSSFELLLQAVGKAIAGAEHHQPIARVSLPEKLL-------YTL 104
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES 171
L+ +S VG +F + +S+ ++ ++KWP N S+
Sbjct: 105 TLLSGNKSFREVGESFAISKSSGHEIFKWVTSGFAALMPCYVKWPKDNACGIKCSQ---- 160
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
L G ID I + LP + S+ +Q + D RF+D+ P
Sbjct: 161 --LPGVVGVIDECRIPLKLPVREEQGHM----QYASLALQAVCDERSRFLDVHIDVPDN- 213
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG-- 289
S LLK F +L + + L + +++VG + YPLL L+TPY N
Sbjct: 214 QCSVLLKSELFERLIDMEEPL----------MPADKHLVGEMTYPLLLNLMTPYADNNGE 263
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL-VCCLLHNII 348
L+ +N + A +AF L +R L + ++ SI++ C+LHN I
Sbjct: 264 LTPCHIRYNQAIHLWNAPAERAFTALMSRFRRLKSL--DVGTMEVASIVVAAACMLHNFI 321
Query: 349 IDSGDQL 355
+D G+ +
Sbjct: 322 LDCGEPI 328
>gi|390602233|gb|EIN11626.1| hypothetical protein PUNSTDRAFT_24969, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 290
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 105/241 (43%), Gaps = 34/241 (14%)
Query: 58 DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIAL---RRL 114
D + F+ F +S FD I + D P+ N + V++Q+AIAL R
Sbjct: 71 DFPDIFRSFTGISPACFDNILCCIGTD-----PAFHNNSNNPQMPVDQQLAIALYRFRHF 125
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKH-HLKWPDSNRMEEIKSKFEES 171
+ S + V + GV TV VT + A+ + +H + +PD+ E+ KS E
Sbjct: 126 GNAVSVLKVALWAGVSVGTVIGVTKHVMAAVCGDSGLRHGAIAFPDAEAKEKAKSWVET- 184
Query: 172 FGLLNCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
++C G +D T I + + W D+++NYSM +Q + +++ ID
Sbjct: 185 ---MSCSGWRDGWLMVDGTLIPLYERPAFFGNTWYDRKSNYSMNLQLVSTPDLQIIDYSV 241
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
G PG + S + +K E + LN + E++ G YPL +W PY
Sbjct: 242 GLPGSQHDSTAWAETRIYK--EHEKLLNSD-----------EWVWGDSAYPLETWCQAPY 288
Query: 286 E 286
+
Sbjct: 289 K 289
>gi|29837768|gb|AAP05804.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108709497|gb|ABF97292.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 98/238 (41%), Gaps = 28/238 (11%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES--FGLLNCCGAIDATHIIM 188
STVS ++A+ + A LK D N ++ +E+ GA+D THI +
Sbjct: 121 NSTVSVHFHHVLQAMNDLAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHIPV 180
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
LP ++ ++ + + V I DH+ R I GWPG ++ R+L EA
Sbjct: 181 -LPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILN--------EA 231
Query: 249 GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP------ 295
Q + V L Y++ G+P + PY + + + P
Sbjct: 232 VQAYPYDF----PRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREE 287
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TFN H R + + F+ K W+IL ++ + D+ II LHN +DS D
Sbjct: 288 TFNHRHSTLRGIVERQFVIAKKMWKILKEIPYYRDEDIPARIIHAAFALHNFRLDSKD 345
>gi|389609757|dbj|BAM18490.1| unknown unsecreted protein [Papilio xuthus]
Length = 259
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 84/192 (43%), Gaps = 11/192 (5%)
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E++ F+ + L +C G +D HI M P + +D+ + N+S+++ G+VD F+
Sbjct: 9 EDVAQNFKLKWNLPHCIGHLDGRHIEMIRP--KNGNDYLNYRKNFSIVLLGLVDANYNFL 66
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY-IVGGVGYPLLSW 280
+ G PG ++ S + + + + + + Y IVG + L
Sbjct: 67 YVDVGTPGKISDSEVFNSTNLCRSLQNNTLNLPPPKPLPGRTQPSPYVIVGDDAFALSKH 126
Query: 281 LITP---YETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLPS 336
L+ P Y+ NG M FN R + AF L +R+L + ++ P K +
Sbjct: 127 LLKPYSYYQINGHPEKM--FNYRLNRGRVVVENAFGILSARFRVLRQSILVSPPKAS--T 182
Query: 337 IILVCCLLHNII 348
I L C LHN +
Sbjct: 183 IALACVYLHNFL 194
>gi|392576917|gb|EIW70047.1| hypothetical protein TREMEDRAFT_61809 [Tremella mesenterica DSM
1558]
Length = 303
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 65/138 (47%), Gaps = 2/138 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
Q+ + + RLA G ++ FG+ + TV+Q T R + A+ +R K ++WP+ IK
Sbjct: 117 QLGLLVFRLAHGHDTATIARTFGISRGTVNQWTSRSLYAMCKRLKEVIRWPEPEERPRIK 176
Query: 166 SKFEESFGLLNCCGAIDATHIIM-TLPAV-QTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ + +C G ID HI + PA+ S + ++ Y V VDH RF +
Sbjct: 177 QHILDVHTIPHCLGFIDGIHINLDRAPAIGPASGSFHSRKERYGFNVLAAVDHRKRFTFL 236
Query: 224 VTGWPGGMNVSRLLKFSG 241
G+ + RL + G
Sbjct: 237 HWGFSARSSDMRLQQSMG 254
>gi|413953029|gb|AFW85678.1| hypothetical protein ZEAMMB73_716392, partial [Zea mays]
Length = 287
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 104/270 (38%), Gaps = 38/270 (14%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+AIA+ RLASG ++ GV S + R L + L +P S+ +
Sbjct: 18 LAIAIARLASGLPYPALAELLGVPPSAPRATSRRLRRVLLANFRFWLAFP-SDPTGAYSA 76
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
G L C T + T Q + R + + G
Sbjct: 77 PLPSCRGALCCARFAGPTGPLAT---------------------QLVAGASSRVLSLTAG 115
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG-GVGYPLLSWLITPY 285
+ G +L+ S ++ E G+ L+ ++Y+VG G GYPLL WL+ P+
Sbjct: 116 FRGDRTDLEVLRLSSLYQEAEHGKLLDS-----------QQYLVGDGGGYPLLPWLMVPF 164
Query: 286 ETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
+ S FN+ H A V+ ++ GW ++++ R + I C +L
Sbjct: 165 PGPLVPGSPEAEFNAAHRAM-CRPVRRVIRSLMGWGAIARLHEEESSRAAVACIGTCAML 223
Query: 345 HNIIIDSGD--QLHPDVALSDHHDSGYGEQ 372
HN+++ D L PDV +D G Q
Sbjct: 224 HNVLLSREDYSALAPDVEEADSDLGGLHSQ 253
>gi|449688772|ref|XP_004211844.1| PREDICTED: uncharacterized protein LOC101238960, partial [Hydra
magnipapillata]
Length = 269
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 56/261 (21%), Positives = 117/261 (44%), Gaps = 12/261 (4%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGVAF 127
+S TF+ + +V D+ N + R+ + ++++AI +R LA+G++ ++ +
Sbjct: 1 MSPSTFEKLLQIVALDITK------ANTKMRIPICAQERLAITVRYLATGDAHTTIAANY 54
Query: 128 GVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATH 185
+ +TV ++ + A+ ++K P+S E+I +FE + +C GAID H
Sbjct: 55 RMSPTTVGRIVYETCNAIWNNLLGEYVKAPNSETEWEKIAKEFETRWHFSHCVGAIDGKH 114
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF-FK 244
+ M PA ++ + + + +S+++ + D + RFI + G G + + S F
Sbjct: 115 VQMFAPA-RSGSSYFNYKKTHSIVLMAVSDAKYRFILVNIGDSGRQSDGSVYNNSQLGFA 173
Query: 245 LCEAGQRLNGNVRISSEEVELREY-IVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEA 303
+ ++ + +IS+ ++ Y V + L ++ PY L FN
Sbjct: 174 IENNLLKIPKDSKISNNSDKIMPYAFVADDAFGLKRHMMKPYAFKNLLTDKLIFNYRLSR 233
Query: 304 TRSLAVKAFLQLKGGWRILSK 324
R + A +R+L K
Sbjct: 234 ARRVVENACDIASSRFRVLHK 254
>gi|226528653|ref|NP_001145667.1| uncharacterized protein LOC100279167 precursor [Zea mays]
gi|195659415|gb|ACG49175.1| hypothetical protein [Zea mays]
gi|223974183|gb|ACN31279.1| unknown [Zea mays]
Length = 382
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 123/314 (39%), Gaps = 48/314 (15%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
+ FRVS TF ++ L+ E L+S P S L +A+AL RLASG +
Sbjct: 79 YPLAFRVSAPTFHFLSGLL-EPLLSHPSS---------LPSPVLLALALARLASGLPYPA 128
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
+ FGV S + R L + L +P S+ + G L C
Sbjct: 129 LAELFGVPPSAPRAASRRLRRVLLANFRFWLAFP-SDPTGAYSAPLPSCRGALCCARFAG 187
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
T + T Q + R + + G+ G +L+ S
Sbjct: 188 PTGPLAT---------------------QLVAGASSRVLSLTAGFRGDRTDLEVLRLSSL 226
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVG-GVGYPLLSWLITPYETNGLSASMPT-FNSL 300
++ E G+ L+ ++Y+VG G GYPLL WL+ P+ + S FN+
Sbjct: 227 YQEAEHGKLLDS-----------QQYLVGDGGGYPLLPWLMVPFPGPLVPGSPEAEFNAA 275
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD--QLHPD 358
H A V+ ++ GW ++++ R + I C +LHN+++ D L PD
Sbjct: 276 HRAM-CRPVRRVIRSLMGWGAIARLHEEESSRAAVACIGTCAMLHNVLLSREDYSALAPD 334
Query: 359 VALSDHHDSGYGEQ 372
V +D G Q
Sbjct: 335 VEEADSDLGGLHSQ 348
>gi|149022650|gb|EDL79544.1| rCG26755 [Rattus norvegicus]
Length = 302
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 79/175 (45%), Gaps = 8/175 (4%)
Query: 76 YICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
Y+ L+ L SRP R +S E Q+ AL SG Q +G A G+ Q+++S
Sbjct: 71 YLVELLGASL-SRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMS 124
Query: 136 QVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ EAL ERA + +P D ++ +K +F G+ GA+D H+ + P +
Sbjct: 125 RCVANVTEALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAE 184
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ +++ +S+ + D + + T WPG + +L+ S E G
Sbjct: 185 -DLSYVNRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLSSQFETG 238
>gi|390352427|ref|XP_003727898.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 201
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 88/192 (45%), Gaps = 23/192 (11%)
Query: 179 GAIDATHIIMTLPAVQTSD-DWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID TH+ + ++T++ + +++ +S+ VQ + D I++ WPG ++ R+L
Sbjct: 4 GAIDCTHVWLDGSPLKTTEYAYVNRKGWHSINVQFVTDANYNIINVCARWPGSVHDCRVL 63
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
+ N V + EL ++ GYP SWL+TP+ N + + T+
Sbjct: 64 E--------------NSYVGQDFQRGELEGVLLSDSGYPQRSWLMTPFR-NPQTHAERTY 108
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID------S 351
N H R + + Q+K + L + + R +K II+ C +L+ + D
Sbjct: 109 NRAHMRGRVVVEQTNGQIKKKFPCLRRGL-RVKPKKACQIIIACTVLYRLSKDWKEPYLG 167
Query: 352 GDQLHPDVALSD 363
D PDV + D
Sbjct: 168 RDHELPDVGVDD 179
>gi|55296123|dbj|BAD67842.1| ribosomal protein -like [Oryza sativa Japonica Group]
gi|55296301|dbj|BAD68081.1| ribosomal protein -like [Oryza sativa Japonica Group]
Length = 428
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 61/286 (21%), Positives = 121/286 (42%), Gaps = 13/286 (4%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
F R+ D+ V++ R +GLI S ++V A+R+LA G + ++
Sbjct: 92 FLRIMNAVEDHDDYFVQK----RNAAGLIG-----FSCHQKVTAAMRQLAYGIAADALDE 142
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
G+ +ST + RF++A+ + +H +L+ P+ N + + E G G+ID
Sbjct: 143 YLGIAESTAIESLRRFVKAVVQVFEHEYLRSPNENDTTRL-LELGEDRGFPGMLGSIDCM 201
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H + + + +++++ + ++ G PG N +L S F
Sbjct: 202 HWKWKNCPTELHGMYQGHVHEPTIILEAVASKDLWIWHAFFGMPGSHNDINVLHRSPLFA 261
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
G+ N I+ + + Y+ G+ +++ T E +G F EA
Sbjct: 262 KLAEGKAPEVNYSINRHDYMMGYYLADGIYPSWATFVKTIPEPHG--NKRKYFAKAQEAV 319
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
R +AF L+ + I+ +++ L I+ C ++HN+II+
Sbjct: 320 RKDVERAFGVLQARFAIVRGPARHWNEKTLGYIMKACVIMHNMIIE 365
>gi|321262627|ref|XP_003196032.1| hypothetical Protein CGB_I0210C [Cryptococcus gattii WM276]
gi|317462507|gb|ADV24245.1| Hypothetical Protein CGB_I0210C [Cryptococcus gattii WM276]
Length = 458
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 107/256 (41%), Gaps = 31/256 (12%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEE 163
E Q+A + R+A GE + +V F + +VS T R + A+ + H+ WP
Sbjct: 115 EVQLATCIYRMAGGERESTVENHFNLSHGSVSHYTDRSLIAIASSLREHVSWPTDAERAV 174
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ K +GL +C G ID T I TS ++ ++ + +VDH RF
Sbjct: 175 LARKLYAQYGLPSCVGFIDGTDI------HPTSSIGLEK----ALKLIAVVDHLRRFRYC 224
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLIT 283
+ ++ + Q L+ IS +EY++G + +I
Sbjct: 225 WSILSAPVS-------------DQTAQDLSDFQTISDRLFSPKEYVLGDADMKSSNTVIP 271
Query: 284 PYE--TNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKVMWR----PDKRKLP 335
++ G + P FN + +A +AF LK W+IL K D+ +
Sbjct: 272 LFKRGRGGQATVGPKAYFNHKCAEAQVIAEQAFGILKNRWQILKKCRLTCRTVKDEARFF 331
Query: 336 SIILVCCLLHNIIIDS 351
+I + C +LHN++I++
Sbjct: 332 TISVACLILHNLLIET 347
>gi|72112371|ref|XP_790108.1| PREDICTED: uncharacterized protein LOC585178 [Strongylocentrotus
purpuratus]
Length = 429
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 125/301 (41%), Gaps = 44/301 (14%)
Query: 63 FKYFFRVSKKTF----DYICSLVREDLVSRP--PSGLINIEGRLLSVEKQVAIALRRLAS 116
FK + R+ F D + + + RP P+GL ++AI LR LA+
Sbjct: 81 FKAYLRIEPPMFREIIDRLTPRISKHQDCRPGLPAGL------------RLAITLRFLAT 128
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFI-EALEERAKHHLKWPDS-NRMEEIKSKFEESFGL 174
G+S S+G +F V T+S + E + E + L P + + E+ S F +
Sbjct: 129 GDSYHSLGFSFRVACCTISVLVPEVCHEIVAEYKEEVLAIPTTPDGWWEVASAFSRRWNY 188
Query: 175 LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+C G +D HI + P SD + + +S+++ G+VD + F+ + G G M+
Sbjct: 189 HHCLGVMDGKHIRIKKPRKSGSDYY--NKGFFSIILLGVVDADYTFMWVNVGARGSMSD- 245
Query: 235 RLLKFSGFFKLCEAGQRLNGNVR-------ISSEEVELREYIVGGVGYPLLSWLITPYET 287
+G F C ++++ + +S ++ + + VG + L ++ PY
Sbjct: 246 -----AGGFNGCSMKRKIDAGMLGMPDPDPLSHDDQDTQYLSVGNDAFALRPSMMKPYSH 300
Query: 288 NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
L FN R + AF +L+ + P+ SI C +LH I
Sbjct: 301 RYLKNDERIFNYRTSRARRVVENAF-------GLLTTLAIMPN--NAISITRACVILHQI 351
Query: 348 I 348
+
Sbjct: 352 M 352
>gi|331224990|ref|XP_003325166.1| hypothetical protein PGTG_06703 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 402
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/317 (22%), Positives = 130/317 (41%), Gaps = 42/317 (13%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
FK FR+S+++F C L+ E N+ R V +Q+ + L+R+ + + S
Sbjct: 84 FKQEFRMSQQSF---CRLILEIQDHPVFHNQSNVPQR--PVPEQLMVTLKRMGTHGNGAS 138
Query: 123 VGVA---FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
VG+ F + + TV R IEA+ L + +EI + G NC G
Sbjct: 139 VGMLARFFRISEGTVILYCSRVIEAI-------LSLESTEERQEIAWQISSHTGFRNCVG 191
Query: 180 AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
ID T ++ D+ ++ +Y + + + R I VTGWPG + +RL +
Sbjct: 192 FIDGTLFPLSEKPSIDPQDYYSRKGHYGLAALIVCNEAKRIIYYVTGWPGCCHDTRLWE- 250
Query: 240 SGFFKLCEAGQRLNGNVRISSEEV-ELREYIVGGVGYPLLSWLITPYET---NGLSASMP 295
N +++ +++ +Y++ G+P + ++ ++ + +
Sbjct: 251 -------------NTELKLQKDQLFSPGQYLIADSGFPPETNIVPAFKRPPHGAMERARK 297
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPD------KRKLPSIILVCCLLHNIII 349
FN + R LKG ++ L + R D K+ I C +LHN ++
Sbjct: 298 RFNQHLSSLRVCNEHCIGLLKGRFQSLRGL--RKDLNSAGTMEKITHWISACVILHNFLL 355
Query: 350 -DSGDQLHPDVALSDHH 365
D + DV +D H
Sbjct: 356 SDQSPDVFTDVDDNDIH 372
>gi|449691348|ref|XP_004212640.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 296
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 69/138 (50%), Gaps = 2/138 (1%)
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ Q T+S++ +F +L +R ++K+P D + + E + F G ID TH+
Sbjct: 80 ALSQPTISKIIRQFSMSLAKRVNEYIKYPTDPHVLNESRVNFYNVAEFPKITGLIDGTHV 139
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P + + ++ +N+S+ VQ + + +F DIV WPG + +R+L+ S
Sbjct: 140 CIQKPR-KREYIYVNRSSNHSINVQAVCAYNGKFYDIVAKWPGSTHDARVLRESNLGSTT 198
Query: 247 EAGQRLNGNVRISSEEVE 264
A +R N +R + +E
Sbjct: 199 AAQRRYNIFLRRTRVLIE 216
>gi|405978759|gb|EKC43123.1| Putative nuclease HARBI1 [Crassostrea gigas]
Length = 289
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 87/191 (45%), Gaps = 6/191 (3%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR-LLSVEKQVAIAL 111
AT+P+ E F FR+ ++TF+ +C +VS + GR ++ KQ+ I++
Sbjct: 57 ATVPAYAPEEFSSHFRMRRETFEEVC----RQIVSTGHIPQEQMHGREIIEPSKQIMISI 112
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES 171
LA+ E + F V S+V + R AL+ + +KWP E+ FE+
Sbjct: 113 WMLANMEGYRQISDRFNVTYSSVYRCFMRTCRALQCLSAEKIKWPTGAWANEVMQGFEKI 172
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSD-DWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
G GA+D HI + P + + +++ +YS+++Q + I G
Sbjct: 173 KGFPRVLGAVDGCHIEIKAPQEKYHPLSYLNRKRDYSVILQAGHIYSDSVYSIAKSDCLG 232
Query: 231 MNVSRLLKFSG 241
+N++R + G
Sbjct: 233 LNLNRYGEIEG 243
>gi|219990729|gb|ACL68738.1| MIP02663p [Drosophila melanogaster]
Length = 344
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 126/293 (43%), Gaps = 33/293 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
++ F F+ + K TF I +V+ ++ LS Q+A R L +G S
Sbjct: 49 DQSFVKFYAIDKITFKKILKVVKP-----------YVQVSALSHPIQLAAVFRYLVTGCS 97
Query: 120 QVSVGVAFGVG--QSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSK-FEESFGLL 175
+++V + V S ++++ RFI L++ + + S + +I SK F E + L
Sbjct: 98 ELAVAHDYRVRIESSMLAKILNRFIPLLQKLLCQATISIQMSRQQMQISSKNFWEKYKLP 157
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+ THI + PA SD + +++ YS+ V + + M I +PG S
Sbjct: 158 KVVACLVGTHIGIKKPAKDCSD-FLNKKGYYSLNVMLVCNDNMEIIASDATFPGSCRDSV 216
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ S +L LNG+ +I+ Y S+++TPY+ +
Sbjct: 217 IWNRSRARELLSVT--LNGH------------FILANSKYSQESFVLTPYKNAEIGTYQH 262
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLPSIILVCCLLHNI 347
TFN H R++ + LK + L + + + P + I+ VCC LHN+
Sbjct: 263 TFNLRHAQARNMVEQTIEVLKNRFLCLQRGLKYEPSFCCM--IVNVCCALHNL 313
>gi|326430471|gb|EGD76041.1| hypothetical protein PTSG_00750 [Salpingoeca sp. ATCC 50818]
Length = 1299
Score = 60.5 bits (145), Expect = 1e-06, Method: Composition-based stats.
Identities = 47/182 (25%), Positives = 83/182 (45%), Gaps = 18/182 (9%)
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L +C G + +I + PA + ++ + + +N S+ VQ +V H+ +F + G+PG +
Sbjct: 404 LPSCLGVLHHDYIGIVRPA-RDNEFFRNVDNRPSIGVQAVVGHDDKFYSVAAGFPGSVKH 462
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
L+ + LC G R Y+V G +PLLSWL+TP+ +
Sbjct: 463 DLCLRQT---PLCRQGAR-----------ALYPGYLVAGATFPLLSWLVTPFPAPQTAKQ 508
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
M N + + A F +++ + L + D ++ CCL+HN+ +D GD
Sbjct: 509 M-ALNRIISCRQQRAGSVFARVRNRFPRLEQFDLGLDWAGTAAV--ACCLIHNLCLDMGD 565
Query: 354 QL 355
L
Sbjct: 566 TL 567
>gi|240952160|ref|XP_002399331.1| conserved hypothetical protein [Ixodes scapularis]
gi|215490537|gb|EEC00180.1| conserved hypothetical protein [Ixodes scapularis]
Length = 385
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/252 (19%), Positives = 100/252 (39%), Gaps = 47/252 (18%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN-RMEEI 164
++ I L L G S ++ FG+ +ST+ V +R++ L + + P ++ ++ E+
Sbjct: 63 ELLIFLHWLGMGASYRAIAGTFGIPRSTIHNVVYRYVNNLVDHLTLLIHPPRTDAKLREV 122
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
F D C E ++ + D RF+D
Sbjct: 123 GDGFA----------------------------DLCSSEA-FARAGGALCDSHGRFLDFS 153
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS---WL 281
G+P M S + + S F E G +++GG GYP LS +
Sbjct: 154 VGYPPSMEPSEVFRASPLF---EKGDYPPPGY-----------FLIGGEGYPCLSSPVAV 199
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
+ P+ + +FN++ ++ A Q+ W+++ + +++ ++ +C
Sbjct: 200 MPPFHKPLQDEAEVSFNAVCNRALTVLHDALGQMTSRWKLVFERRLTALRQRTIKVVAIC 259
Query: 342 CLLHNIIIDSGD 353
++HNI +D GD
Sbjct: 260 AMIHNICVDEGD 271
>gi|328711999|ref|XP_003244703.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 418
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 113/254 (44%), Gaps = 21/254 (8%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
Q+ LR A+G Q + G G ST++++ + A+ ++ +PD+ EEI+
Sbjct: 122 QLLCTLRYYATGCFQTTGGDLCGFSSSTMNRIVHKVSCAIALLRSQYIHFPDNP--EEIR 179
Query: 166 S---KFEESFGLLNCCGAIDATHIIM-TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
+F GAID THI + P T++ + +++ YS+ VQ I + + +
Sbjct: 180 RTQLEFYRRAKFPRVVGAIDCTHIKLWQSPGGDTAERFRNRKGYYSLNVQAICNANLEVM 239
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
D+V + G + SR+ + S L E G + G+ +V GY S++
Sbjct: 240 DVVARYDGSTHDSRIFRESKRRALFEQG--VYGDA-----------LLVADSGYACTSYM 286
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
+TP A +N TR+ + F K + I++ + KR P II
Sbjct: 287 MTPLHECHTPAEQ-LYNESQIRTRNPIERFFGVWKRRFPIMALGLRVKLKRVFP-IITAT 344
Query: 342 CLLHNIIIDSGDQL 355
+L+NI +G+++
Sbjct: 345 LVLNNIARRAGEEV 358
>gi|326668615|ref|XP_003198838.1| PREDICTED: hypothetical protein LOC100536291 [Danio rerio]
Length = 252
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 74/152 (48%), Gaps = 7/152 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+S+ TF+++ L+ L + +E RL ++A+ L A+ ++
Sbjct: 107 FRMSRNTFEFVLQLLSPSLKRKTTGWRKPLEPRL-----RLAVVLWWYATPSEYRTISCL 161
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
FG+G STV + + AL+ + + P R+++ F G C GAID HI
Sbjct: 162 FGLGISTVCMLVRQVTNALKTLCERFICLPKGERLQKTIDGFFAR-GYKMCAGAIDGCHI 220
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ P V + +C+++ +S+++Q +VDH
Sbjct: 221 PILKPHVDQA-AYCNRKGWHSIVLQAVVDHNF 251
>gi|301611294|ref|XP_002935178.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 378
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 83/184 (45%), Gaps = 21/184 (11%)
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
G+ N GAID TH+ ++ P + + ++ +S+ +Q + D M IV+G+PG
Sbjct: 206 GIPNVLGAIDRTHVAISHPQ-DREHVFRNCKSYHSLNIQVVCDATMNIRSIVSGFPGSSQ 264
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
+ +L+ SG ++ +AG+ + +++G GYP WLITP A
Sbjct: 265 DAYILRQSGLYEGFQAGKMPHW-------------WLLGDAGYPCSRWLITPIPRPHTRA 311
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLLHNII 348
FN H S+ + LK L K +M+ P K+ I++ +LHN+
Sbjct: 312 EC-AFNEAHITAWSVIERTLGVLKSRSHCLDKSGGSLMYSPT--KVAQIVVAYAVLHNVA 368
Query: 349 IDSG 352
G
Sbjct: 369 NQHG 372
>gi|403163873|ref|XP_003323943.2| hypothetical protein PGTG_05845 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375164650|gb|EFP79524.2| hypothetical protein PGTG_05845 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 383
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 81/180 (45%), Gaps = 23/180 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E FK R +K F ++ S V + P N L + Q+A L+RL S
Sbjct: 85 RDEDFKQAVRTTKAGFTWLLSQVNLN-----PIFHSNSFRPQLPIPHQLAPTLKRLGSNG 139
Query: 119 SQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SVG GVG+ TV + + R I A+ ++ +L WPD R +EI S + G
Sbjct: 140 NGASVGQFSRNLGVGRGTVVKASRRVIRAINNLSEKYLTWPDKVRRKEI-SDVMKCEGFK 198
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G +D T T+P Q S+ + D + R++++V G+P + V R
Sbjct: 199 GCVGFVDGT----TIPLYQRP----------SIDGEVFFDRKKRYLEMVEGYPLRIPVIR 244
>gi|345484344|ref|XP_003425010.1| PREDICTED: hypothetical protein LOC100679846 [Nasonia vitripennis]
Length = 207
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 99/211 (46%), Gaps = 15/211 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+ + F + R+ TF+Y+ S+V L +R + +S E+++ I LR LASG+
Sbjct: 4 DNDIFFQYVRMDVDTFNYLLSMVEPYLRNRKK------KRDAISAEQRLTITLRYLASGD 57
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+S+ +++ +G+ TV + R + + + +++ P + I FE+ + L C
Sbjct: 58 DILSIAISYRLGKPTVYSIIKRTCSVIHQVLSPIYVRPPCTPVWRNIIKGFEQDWQLPGC 117
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+D HI + P + + + + +S+ + +H F + G G +
Sbjct: 118 IGAVDGKHICIQSPP-DSGALFYNYKKFHSIALLAACNHRYEFTVVEVGAYGSESD---- 172
Query: 238 KFSGFFKLCEAGQRLN-GNVRISSEEVELRE 267
G F E G+ L+ G + I +E V+ R+
Sbjct: 173 --GGIFLRSEFGKSLDEGCLNIPTESVKKRK 201
>gi|405973321|gb|EKC38041.1| Retinol dehydrogenase 12 [Crassostrea gigas]
Length = 313
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 6/125 (4%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R+ + +C DL RP + R L V Q+ +ALR A+G Q +GV
Sbjct: 41 YRLPRHNIIDLCGRFNRDL-RRP-----TLRSRPLPVSLQMMVALRFYATGSFQAILGVV 94
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ + +VS++ F + L + ++K P N +I F G LN GAID THI
Sbjct: 95 HIISRPSVSRIINNFTDCLVRLSPEYVKMPTQNDSVQIMQGFSNIAGFLNVIGAIDETHI 154
Query: 187 IMTLP 191
+ P
Sbjct: 155 RIKSP 159
>gi|270015953|gb|EFA12401.1| hypothetical protein TcasGA2_TC016403 [Tribolium castaneum]
Length = 308
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 258 ISSEEVELREYIVGGVGYPLLSWLITPY-ETNGLSASMPTFNSLHEATRSLAVKAFLQLK 316
I E + +I+G YPLL W I PY E + FN +H R + ++F L
Sbjct: 165 IDEEFLTPNTFIIGDKAYPLLRWCIPPYIERRQHTPLQANFNRVHAKIRQVVERSFALLF 224
Query: 317 GGWRILSKV-MWRPDKRKLPSIILVCCLLHNIIIDSGD 353
G +R L + M R D +PS IL CC+LHNI + D
Sbjct: 225 GRFRRLKYLDMQRLDL--IPSTILACCVLHNICLQHQD 260
>gi|443725999|gb|ELU13341.1| hypothetical protein CAPTEDRAFT_85761, partial [Capitella teleta]
Length = 190
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 63/117 (53%), Gaps = 3/117 (2%)
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWP-DSNRMEEIKSKFE 169
R LA+GE+ S+ F +G ST+S + ++++ + +L +P D+ E I ++E
Sbjct: 8 RFLATGETFTSLSFGFRIGVSTISAIVSETVDSIWKNLGPRYLAFPKDATEWESIAERYE 67
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
E + +C GA+D H+ + P + + + ++S+++ +VD + RFI I G
Sbjct: 68 ERWNFPHCVGALDGKHVTIMAPP-NAGSAFFNYKKSHSIVLLALVDADCRFIAIEVG 123
>gi|403173656|ref|XP_003332716.2| hypothetical protein PGTG_14381 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170638|gb|EFP88297.2| hypothetical protein PGTG_14381 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 319
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 65/138 (47%), Gaps = 4/138 (2%)
Query: 103 VEKQVAIALRRLASGESQVSVGV---AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN 159
V +Q+ +AL RL + V+VG+ + +G TV T R I A+ L+WP +
Sbjct: 113 VIEQMMVALNRLGCFGNGVAVGMIATCYRIGHGTVEVYTNRCIMAILSLKTTLLEWPTAA 172
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+E K+ + E G C G ID + ++++ + D+ ++ Y + + D
Sbjct: 173 ARQETKAHYGE-VGFKGCVGLIDGSLVVLSTCPEKDGQDYYSRKGFYCIATLLVCDQHKN 231
Query: 220 FIDIVTGWPGGMNVSRLL 237
+ TGWPG + RL+
Sbjct: 232 ITYVFTGWPGCSHDMRLM 249
>gi|322785023|gb|EFZ11784.1| hypothetical protein SINV_07926 [Solenopsis invicta]
Length = 168
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 51/100 (51%), Gaps = 4/100 (4%)
Query: 268 YIVGGVGYPLLSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
++VG Y L L+ PY NG L+ +N H + R + +AF LK +R L +
Sbjct: 38 HLVGDAAYKLHENLLVPYRDNGHLTERQRNYNFCHSSARIVIERAFGLLKERFRCLLTTL 97
Query: 327 WRPDKRKLPSIILVCCLLHNIIIDSGDQLH---PDVALSD 363
+P IL CC+LHNI + GD+L+ DV L+D
Sbjct: 98 AMDRTDLIPMHILSCCVLHNICLMRGDELNFETNDVELTD 137
>gi|357119672|ref|XP_003561559.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 332
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 116/284 (40%), Gaps = 39/284 (13%)
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
++ +SVE+QVAI L +A + ++ F T+ + R +EA+ +++
Sbjct: 20 VDTTYVSVEEQVAIFLYAVAKNATNETLEDWFQHSPDTIHRHFKRVLEAITNLTPIYIRP 79
Query: 156 PDSNRMEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
P + + K KF F NC GA+D THI M LP Q + + +++ S
Sbjct: 80 PSLHPHSILRKPKFYPFFK--NCIGAVDGTHIPMKLPLDQ-QEPYRNRKQTISXNAMVAC 136
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
D +++F+ I GW G + +++L+ LN EV ++ + G
Sbjct: 137 DCDLKFVHINLGWEGSASDAKVLQ-----------DALNHGF-----EVPDGKFYLVDAG 180
Query: 275 YPLLSWLITPY-------ETNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGW--R 320
Y + PY G + P FN H R+ ++ G W R
Sbjct: 181 YANTPQFLAPYRGTRYHLNEQGRARQKPQNHKELFNLRHAQLRN----HIERIIGIWKMR 236
Query: 321 ILSKVMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
+V K K I + C +LHN I I +GD P A+ D
Sbjct: 237 FPIQVASHFPKEKQIDISVACAVLHNFIRIHNGDMTWPSDAIMD 280
>gi|239789603|dbj|BAH71416.1| hypothetical protein [Acyrthosiphon pisum]
Length = 168
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 68/126 (53%), Gaps = 4/126 (3%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-N 159
+ E ++ + LR LA+G+S S+ F + ++T+S ++A+ LK P+S
Sbjct: 28 IPAECKLNVTLRFLATGDSFSSLQYLFRIPKNTISTFIPEVLDAIYSALLDFLKVPNSVE 87
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD--HE 217
+EI+ F E + C GAID H+ + P S ++ + +N++S+++ +VD +
Sbjct: 88 EWKEIEKGFNEKWNFPGCIGAIDGKHVAIRAPMFSGS-EYYNYKNSFSIILMAVVDANYC 146
Query: 218 MRFIDI 223
R+IDI
Sbjct: 147 FRYIDI 152
>gi|386764417|ref|NP_001245670.1| CG43088 [Drosophila melanogaster]
gi|383293392|gb|AFH07384.1| CG43088 [Drosophila melanogaster]
Length = 362
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 126/293 (43%), Gaps = 33/293 (11%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
++ F F+ + K TF I +V+ ++ LS Q+A R L +G S
Sbjct: 67 DQSFVKFYAIDKITFKKILKVVKP-----------YVQVSALSHPIQLAAVFRYLVTGCS 115
Query: 120 QVSVGVAFGVG--QSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSK-FEESFGLL 175
+++V + V S ++++ RFI L++ + + S + +I SK F E + L
Sbjct: 116 ELAVAHDYRVRIESSMLAKILNRFIPLLQKLLCQATISIQMSRQQMQISSKNFWEKYKLP 175
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+ THI + PA SD + +++ YS+ V + + M I +PG S
Sbjct: 176 KVVACLVGTHIGIKKPAKDCSD-FLNKKGYYSLNVMLVCNDNMEIIASDATFPGSCRDSV 234
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ S +L LNG+ +I+ Y S+++TPY+ +
Sbjct: 235 IWNRSRARELLSVT--LNGH------------FILANSKYSQESFVLTPYKNAEIGTYQH 280
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSK-VMWRPDKRKLPSIILVCCLLHNI 347
TFN H R++ + LK + L + + + P + I+ VCC LHN+
Sbjct: 281 TFNLRHAQARNMVEQTIEVLKNRFLCLQRGLKYEPSFCCM--IVNVCCALHNL 331
>gi|108709891|gb|ABF97686.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 701
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 99/238 (41%), Gaps = 34/238 (14%)
Query: 132 STVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES-FGLLN-CCGAIDATHIIMT 189
STVS R + + LK D N + + ++S FG GA+D THI +T
Sbjct: 429 STVSIYFRRVLSRMVMLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHIPVT 488
Query: 190 LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ V ++ + ++ + + V IV + R I GW G ++ +R+L
Sbjct: 489 V-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWLGSVHDNRVL------------ 535
Query: 250 QRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ETNGLSASMP--- 295
N + EE + R+Y++ GYP + PY + G A P
Sbjct: 536 ---NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGR 592
Query: 296 --TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H R++ + F +K W+IL + + P K IIL LHN IDS
Sbjct: 593 EEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDS 650
>gi|301108774|ref|XP_002903468.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262097192|gb|EEY55244.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 255
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 110/264 (41%), Gaps = 26/264 (9%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F+ R SKKTF + + ++ V + + K+VA AL + S
Sbjct: 11 FETNLRCSKKTFLRVANFLQCQGVR-----FAAAKSKQHCYNKKVAAALYFMGSTGGYRE 65
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAI 181
VG A G+ +S VS++T + L A + +P D + I+S+F G GAI
Sbjct: 66 VGGAMGMCRSYVSEITAEVVRVLRAAAPQIVAFPRDQGGWDAIESEFAARHGYPGVVGAI 125
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D + I + P D + ++ ++ VQ IV + F+ PG + + ++S
Sbjct: 126 DGSLIEVERP--YEFDGFYCRKCYPALNVQAIVTSDNVFLSAEVR-PGSWSDRKCWQYS- 181
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY----ETNGLSASMPTF 297
+ G+ + + + + +G GY LL LI PY E LS F
Sbjct: 182 -----KIGRTVYSTIPPGA-------HFIGDAGYALLPGLIVPYSDREEGGELSPRQRQF 229
Query: 298 NSLHEATRSLAVKAFLQLKGGWRI 321
N LH +TR F KG +RI
Sbjct: 230 NFLHSSTRMAVESTFGIWKGRFRI 253
>gi|62733091|gb|AAX95208.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77549101|gb|ABA91898.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77553958|gb|ABA96754.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 97/238 (40%), Gaps = 28/238 (11%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES--FGLLNCCGAIDATHIIM 188
STVS ++A+ + A LK D N ++ +E+ GA+D THI +
Sbjct: 121 NSTVSVHFHHVLQAMNDLAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHIPV 180
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
LP ++ ++ + + V I DH+ R I GWPG ++ R+L EA
Sbjct: 181 -LPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILN--------EA 231
Query: 249 GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP------ 295
Q + V L Y++ G+P + PY + + + P
Sbjct: 232 VQAYPYDF----PRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREE 287
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TFN H R + + F K W+IL ++ + D+ II LHN +DS D
Sbjct: 288 TFNHRHSTLRGIVERQFGIAKKMWKILKEIPYYRDEDIPARIIHAAFALHNFRLDSKD 345
>gi|20270088|gb|AAM18176.1|AC092172_36 Hypothetical protein [Oryza sativa Japonica Group]
gi|31430919|gb|AAP52771.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 396
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 97/238 (40%), Gaps = 28/238 (11%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES--FGLLNCCGAIDATHIIM 188
STVS ++A+ + A LK D N ++ +E+ GA+D THI +
Sbjct: 121 NSTVSVHFHHVLQAMNDLAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHIPV 180
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
LP ++ ++ + + V I DH+ R I GWPG ++ R+L EA
Sbjct: 181 -LPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILN--------EA 231
Query: 249 GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP------ 295
Q + V L Y++ G+P + PY + + + P
Sbjct: 232 VQAYPYDF----PRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREE 287
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TFN H R + + F K W+IL ++ + D+ II LHN +DS D
Sbjct: 288 TFNHRHSTLRGIVERQFGIAKKMWKILKEIPYYRDEDIPARIIHAAFALHNFRLDSKD 345
>gi|40737037|gb|AAR89050.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 693
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 99/238 (41%), Gaps = 34/238 (14%)
Query: 132 STVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES-FGLLN-CCGAIDATHIIMT 189
STVS R + + LK D N + + ++S FG GA+D THI +T
Sbjct: 421 STVSIYFRRVLSRMVMLGSKILKPIDPNFTDIPRRLLQDSRFGPFQFAVGAVDGTHIPVT 480
Query: 190 LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ V ++ + ++ + + V IV + R I GW G ++ +R+L
Sbjct: 481 V-GVDSAIEHMNRHDETTKNVLTIVGFDGRVIFADAGWLGSVHDNRVL------------ 527
Query: 250 QRLNGNVRISSEE---VELREYIVGGVGYPLLSWLITPY--------ETNGLSASMP--- 295
N + EE + R+Y++ GYP + PY + G A P
Sbjct: 528 ---NEAIDSYPEEFPRLPFRKYLLVDSGYPSRMGFLAPYPRVRYHKDQFKGPRAPPPEGR 584
Query: 296 --TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H R++ + F +K W+IL + + P K IIL LHN IDS
Sbjct: 585 EEKFNYIHAKLRNIIERQFGIVKKQWKILKGIPYNPYKNVQSDIILAAFCLHNFRIDS 642
>gi|38344718|emb|CAE05260.2| OSJNBb0115I09.22 [Oryza sativa Japonica Group]
gi|38347026|emb|CAE02554.2| OSJNBb0067G11.1 [Oryza sativa Japonica Group]
Length = 804
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 78/167 (46%), Gaps = 38/167 (22%)
Query: 174 LLNCCGAIDATHIIMTLP---AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
L NC GAID THI + +P AVQ + ++ N V + D +MRF ++ GWPG
Sbjct: 641 LDNCIGAIDGTHIQVVVPNSAAVQHRNRHEEKSQN----VMCVCDFDMRFTFVLAGWPGS 696
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSE--EVELREYIVGGVGYPLLSWLITPYETN 288
++ R+ + R S++ ++ L ++ + GYP + PY+
Sbjct: 697 VHDMRVFN--------------DAQTRFSAKFPKLPLGKFYLVDSGYPNRPGYLAPYK-- 740
Query: 289 GLS--------ASMPT-----FNSLHEATRSLAVKAFLQLKGGWRIL 322
G++ +++P FN H + R++ ++F LK WRIL
Sbjct: 741 GITYHFQEYNESTLPRGRREHFNYCHSSCRNVIERSFGVLKNKWRIL 787
>gi|322789477|gb|EFZ14761.1| hypothetical protein SINV_04990 [Solenopsis invicta]
Length = 185
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 75/189 (39%), Gaps = 46/189 (24%)
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
S G GAID HI P ++ D+ +++ +S+++QG+ +H +FID G P
Sbjct: 4 NRSRGFQGVIGAIDGCHIPCKQP-IRNPHDYYNRKGFHSIILQGVCNHRGKFIDCFIGLP 62
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G YPL+ L+TPY N
Sbjct: 63 GDS------------------------------------------AYPLMVNLMTPYREN 80
Query: 289 G-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
G L +N+ RS+ +AF LK +R L K + D +I C LHN
Sbjct: 81 GHLIGPQIQYNTKLSRIRSIIERAFGLLKTKFRRL-KYLDIADFDLGNKMIAAACTLHNF 139
Query: 348 IIDSGDQLH 356
IID GD L+
Sbjct: 140 IID-GDNLN 147
>gi|357155317|ref|XP_003577080.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 329
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 116/280 (41%), Gaps = 40/280 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVAI L +A + ++ F T+ + +EA+ +++ P +
Sbjct: 25 VSVEEQVAIFLYAVAKNATNETLQDWFQHSLDTIHRHFKTVLEAITNLTPIYIRPPSLHP 84
Query: 161 MEEIK-SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
++ +KF F NC GA+D THI M LP Q + + +++ S D +++
Sbjct: 85 HSILRXTKFYPFFK--NCIGAVDGTHIPMKLPLDQ-QEPYRNRKQTISQNAMVACDFDLK 141
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F+ I GW G + +R+L+ LN + ++ Y+V GY
Sbjct: 142 FVHINPGWEGSASDARVLQ-----------DALNHGFEVPDDKF----YLVDA-GYANTP 185
Query: 280 WLITPY-------ETNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRI---LSK 324
+ PY G + P FN H R+ ++ G W++ + K
Sbjct: 186 QFLAPYRGTRYHLNEQGRARQKPQNHKELFNLRHAQLRN----HIERIIGIWKMRFPILK 241
Query: 325 VMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
V K K + + C +LHN I I +GD P A D
Sbjct: 242 VASHFPKEKQIDVSVACAVLHNFIRIHNGDMTWPSDATMD 281
>gi|449686684|ref|XP_004211227.1| PREDICTED: uncharacterized protein LOC101238273 [Hydra
magnipapillata]
Length = 286
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/252 (21%), Positives = 109/252 (43%), Gaps = 35/252 (13%)
Query: 63 FKYFFRVSKKTFDYIC-----SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
FKYF R+S T++ + +V++ R P +S+ +++A+ LR L +G
Sbjct: 33 FKYF-RMSPTTYEELLLFVAPIIVKQRTTMRDP----------VSLSERLAVTLRFLVTG 81
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWPDSNRMEEIKSKFEESFG 173
++Q ++ ++ + ST+S++ A L+ER H+ + + I +FE +
Sbjct: 82 DAQCTIAASYRISASTISRIISETCAAIWTSLKERNFLHVP-SEKQEWKTIAKEFENMWN 140
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L+ GAID THI+M P ++ + + +S+++ + + + +F + G G +
Sbjct: 141 FLHAIGAIDGTHIVMQAPH-NGGSEYFNYKKTHSIVLLAVCNAKYKFTTVELGDSGRQSD 199
Query: 234 SRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYIVGGVGYPLLSWLITPYE 286
+ F C G + N I + E L +V + L ++ PY
Sbjct: 200 GSV------FNNCSLGYGIENNKLNIPDPEYIGNSEKVLPYVLVADNAFGLKRHMMKPYP 253
Query: 287 TNGLSASMPTFN 298
+ FN
Sbjct: 254 NQNIPLDQKIFN 265
>gi|225463187|ref|XP_002267524.1| PREDICTED: putative nuclease HARBI1-like [Vitis vinifera]
Length = 391
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 123/299 (41%), Gaps = 44/299 (14%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C +R+ + R +G++ +E+Q+AI L + E +
Sbjct: 57 FRMDKHVFHKLCDTLRQRGMLRDTAGVM--------IEEQLAIFLNIIGHNERNRVIQER 108
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI--KSKFEESFGLLNCCGAIDAT 184
F T+S+ ++A++ ++ L+ P + EI ++F F +C G ID
Sbjct: 109 FQHSGETISRHFNNVLKAIKSLSREFLQPPPLSTPPEILRNNRFYPYFK--DCIGVIDGM 166
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIV---DHEMRFIDIVTGWPGGMNVSRLLKFSG 241
HI +PA D N +L Q ++ +++FI + GW G + SR+L+
Sbjct: 167 HIPAHVPA----KDQSRFRNRKGILSQNVLAACTFDLQFIFVYPGWEGSVADSRVLRAV- 221
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-----------TNGL 290
L + Q + +Y + GY + I PY+ N L
Sbjct: 222 ---LDDPDQNF--------PNIPEGKYYLVDSGYSNIEGFIAPYQGVRYHLHEYRGANQL 270
Query: 291 SASM-PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
S FN H R+ ++F LK + IL K+ + I++ C++HN I
Sbjct: 271 PRSAKELFNHRHAFLRNAIQRSFDVLKARFPIL-KLAPQYAFHIQRDIVIAACVIHNHI 328
>gi|222617885|gb|EEE54017.1| hypothetical protein OsJ_00677 [Oryza sativa Japonica Group]
Length = 305
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 26/162 (16%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
DS+WW S+P P E F+ FR+SK TF+ +C ++ +ED + R
Sbjct: 111 DSEWWDMV-----SSP--DYPDSE---FRKVFRMSKATFEVVCDELAAAVAKEDTMLRAA 160
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+ V K+VA+ + RLA+GE V FG+G ST ++ AL+
Sbjct: 161 ----------IPVRKRVAVCVWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAALKAMVM 210
Query: 151 HH-LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
++WP++ I + FE G+ GAI THI + P
Sbjct: 211 PKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHIPIIAP 252
>gi|357116794|ref|XP_003560162.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 406
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 128/297 (43%), Gaps = 39/297 (13%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R++K+ F +C+L+RE GL + ++VEK+VA+ L + G + V
Sbjct: 80 LRLTKRNFHDLCTLLRE------KCGLK--DSIYVTVEKKVAMFLLVVGHGLKMRLLCVT 131
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ +K PD + K++ FG N GA+D H+
Sbjct: 132 YKRSLETISRHFSTVLSAILSLTNEFIKLPDPSIPPPNDYKWK-WFG--NALGALDGCHV 188
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS------ 240
+ + V + +++ + + + G+VD M+ + ++ GW G SR+L+ +
Sbjct: 189 DVMVD-VANQGRYRNRKQSITTNMLGVVDWNMKILYVLPGWEGSTLDSRVLRDAMRPNRQ 247
Query: 241 -------GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
G F L +AG NG +S + Y L W+ + + + +
Sbjct: 248 DTFVVPKGKFYLVDAGY-TNGPGFLSPFQ---------STRYHLKEWVSSQQQPH---TA 294
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
+N H R++ + F K W IL + + D + II CC+LHN I D
Sbjct: 295 KELYNLRHSRARNVVERTFGLWKKKWAIL-RTQFFFDIKDQIRIINACCVLHNFIRD 350
>gi|331242269|ref|XP_003333781.1| terpenoid cylase/protein prenyltransferase alpha-alpha toroid
[Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 432
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 125/297 (42%), Gaps = 18/297 (6%)
Query: 98 GRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
GRL S ++V A R LA G S S+ + +ST + F AL + A+ +L+
Sbjct: 107 GRLGFSPLQKVTAAFRLLAYGCSADSIDEYLRISESTALESLKHFCSALVKIYAEEYLRS 166
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P++N + +I + E G G++D H V + +E +++++ +
Sbjct: 167 PNANDLAKILA-VSEKRGFPGMMGSLDCMHWGWKNCPVADHGQYSGKEKEPTVILEAVAT 225
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
H++ G PG +N +L S F+ C+ G + ++ L Y+ + Y
Sbjct: 226 HDLWIWHAFFGLPGTLNDINVLDRSPIFQQCQDGVNPSFEYSVNGNRYNLGYYLTDSI-Y 284
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK--VMWR-PDKR 332
P + LI +N + + EA R +AF L+ + I+ +W+ PD
Sbjct: 285 PKYASLIQSI-SNPQGKKNKHYAKMQEAYRKDVERAFGVLQARYAIIRYPGRLWKHPD-- 341
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
L I+ +LHN+ ++ D A SD + +Q + +GR N +
Sbjct: 342 -LCIIMKTVIILHNMTVE-------DEAGSDFEEDFDYDQNARTQATIGRDNSANAD 390
>gi|28411822|dbj|BAC57297.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 396
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 96/238 (40%), Gaps = 28/238 (11%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES--FGLLNCCGAIDATHIIM 188
STVS ++A+ + A LK D N ++ +E+ GA+D THI +
Sbjct: 121 NSTVSVHFHHVLQAMNDLAGKILKPVDPNFLDYNPQILQENPFKPFYQSVGAVDGTHIPV 180
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEA 248
LP ++ ++ + + V I DH+ R I GWPG ++ R+L EA
Sbjct: 181 -LPNAASAMQHRNRHHITTRNVLVICDHDGRIIFCDAGWPGSVHDQRILN--------EA 231
Query: 249 GQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP------ 295
Q + V L Y++ G+P + PY + + + P
Sbjct: 232 VQAYPYDF----PRVPLGRYLLVDSGFPTRMGFLAPYPHVRYHRDQLAVEGAPPPVGREE 287
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TFN H R + + F K W+IL ++ + D II LHN +DS D
Sbjct: 288 TFNHRHSTLRGIVERQFGIAKKMWKILKEIPYYRDDDIPARIIHAAFALHNFRLDSKD 345
>gi|331239185|ref|XP_003332246.1| hypothetical protein PGTG_14542 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 447
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/306 (22%), Positives = 122/306 (39%), Gaps = 48/306 (15%)
Query: 65 YFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS---GESQV 121
YFF +S+ Y+ L+ + P N Q+A+A+ RL S G +
Sbjct: 37 YFFSMSQLQKRYL-------LIEQNPIFYNNSRNPQRDPPIQIAVAVCRLGSNGNGSAIY 89
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGA 180
+ F VG T+ T R I A+ + WP +S ++E + EE G C G
Sbjct: 90 RLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESEQIESSQVMREE--GFPGCVGF 147
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
+D T I ++ + + D++ YS+ V I D +FI + G+PG + + +
Sbjct: 148 VDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISYLAGFPGSSHDAYV---- 203
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
F + Q+ + + +++++ Y +++ ++ L F SL
Sbjct: 204 --FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDCYVVPAFKGKQL-LKRRRFASL 253
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQ---LHP 357
E Q++ + V W I+ C +LHN++ D DQ L+
Sbjct: 254 REIR--------TQIRNAEEMKGAVKW----------IVTCIVLHNLLADLKDQWNDLYE 295
Query: 358 DVALSD 363
D A D
Sbjct: 296 DEAPED 301
>gi|348690347|gb|EGZ30161.1| hypothetical protein PHYSODRAFT_392215 [Phytophthora sojae]
Length = 258
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/273 (22%), Positives = 117/273 (42%), Gaps = 26/273 (9%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKHHLKWPDSNR- 160
+++VA+AL L F V S + W+ ++ L + P S+
Sbjct: 1 QERVAVALYYLTHCIELEQAAEKFDVTPSAAERYVWQVVDVLLSDGVKAQCFTSPSSDEG 60
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ FE G NCC A+ + + P + + W ++ + VQ +VD + R
Sbjct: 61 WSRLSDGFEAICGYPNCCLAVGGMLVEIERP--RRWEGWYCRKRFPAENVQLVVDAQCR- 117
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+ + PGG+ L++S F G+ L +E + +++VG GY L
Sbjct: 118 VRAMDVRPGGVTDKETLRYSRF------GRSL-------AEILPEGKHVVGHAGYALSKH 164
Query: 281 LITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR------PDKRK 333
++ PY + ++ + FN+L T+ + + +K ++IL K + + +
Sbjct: 165 VVVPYPVDSSMTEAEELFNTLQSTTQRVVKRTEEMIKKRFQILQKPLRQKTGEGDAATTQ 224
Query: 334 LPSIILVCCLLHNIIIDSGDQLHPDVALSDHHD 366
+ +I +LHNI+++ GD + D +L D D
Sbjct: 225 MAQVISTAIVLHNILVELGDLVEVDSSLEDDTD 257
>gi|115770499|ref|XP_001184795.1| PREDICTED: uncharacterized protein LOC754204 [Strongylocentrotus
purpuratus]
Length = 342
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/268 (22%), Positives = 116/268 (43%), Gaps = 11/268 (4%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEE 163
++++ L LA+G + + F VG+ T+ + A+ +E A + P +N E
Sbjct: 54 KLSVFLHHLATGATYAELSYNFRVGKETIQKFVPDVARAVVDEYAAEVISLPTTNEGCLE 113
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ FE + L +C GA D HI + P ++ + + + +S+++ +VD + +F I
Sbjct: 114 VAGYFEARWNLPHCLGAYDGKHIRLQKPN-KSGSLYFNYKQFFSVVLMALVDSKYQFPWI 172
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRL---NGNVRISSEEVELREYIVGGVGYPLLSW 280
G G + +++ S + EAG + EE + + VG + + ++
Sbjct: 173 DVGGVGHQSDAQIYNNSELKECIEAGTLGIPDPAPLPQDDEEHPMPYFSVGDDAFAMRTY 232
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSKVMWRPDKRKLPSIIL 339
++ PY + FN R + AF L ++ L ++ PD +L +I
Sbjct: 233 MMKPYGLCNMDKQQKIFNYRLSRARRVVENAFGILALRFQCFLGQMRQEPDTVRL--LIE 290
Query: 340 VCCLLHNIIIDSGDQLHPDVALSDHHDS 367
+ HN+I L DV + D D+
Sbjct: 291 AAVMPHNLIRKHYQAL--DVRMLDQEDA 316
>gi|55771324|dbj|BAD72249.1| unknown protein [Oryza sativa Japonica Group]
gi|55771333|dbj|BAD72258.1| unknown protein [Oryza sativa Japonica Group]
Length = 288
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 26/162 (16%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
DS+WW S+P P E F+ FR+SK TF+ +C ++ +ED + R
Sbjct: 94 DSEWWDMV-----SSP--DYPDSE---FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAA 143
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+ V K+VA+ + RLA+GE V FG+G ST ++ AL+
Sbjct: 144 ----------IPVRKRVAVCVWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAALKAMVM 193
Query: 151 HH-LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
++WP++ I + FE G+ GAI THI + P
Sbjct: 194 PKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHIPIIAP 235
>gi|331251333|ref|XP_003338265.1| hypothetical protein PGTG_19700 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 443
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 110/258 (42%), Gaps = 24/258 (9%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+AIA+ RL S G + + F VG T+ T R I A+ + WP S R+
Sbjct: 161 QIAIAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTKSERI 220
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE F C G +D T I ++ + + D++ YS+ V I D +FI
Sbjct: 221 ESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFI 278
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + S + F + Q+ + + +++++ Y ++
Sbjct: 279 SYLAGFPGSSHDSYV------FSHMQVAQQ-------PEKYFDQKQFLLADSAYTNDRYV 325
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWR----PDKRKLPS 336
+ ++ L + H A + ++ A LKG + L ++ R + +
Sbjct: 326 VPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLQEIGTRIRNAEEMKGAVK 385
Query: 337 IILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 386 WIVTCIVLHNLLADLKDQ 403
>gi|356527700|ref|XP_003532446.1| PREDICTED: uncharacterized protein LOC100787308 [Glycine max]
Length = 380
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 88/172 (51%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 82 LRVSKKAFFKLCRILQE-------KGQL-VKTKNVPIDEAVAMFLHILAHNLKYRVVHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 134 YCRSMETISRQFKNVLRAIMKVSKEYLKFYEYNLEGSVENKWR---WFKNSIGALDGIHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+T+ A + +C+++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 191 PVTVSA-EDRPRYCNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLR 241
>gi|449689832|ref|XP_004212160.1| PREDICTED: uncharacterized protein LOC101241210, partial [Hydra
magnipapillata]
Length = 164
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 79/152 (51%), Gaps = 14/152 (9%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G ID THI + P +++S D+ + + +S+ VQ I D + F+D+ WPG ++ +++
Sbjct: 6 GFIDGTHIPIRRP-LESSQDYFNYKGFHSISVQAICDSKGIFMDVDCKWPGSLHDAKVFS 64
Query: 239 FSGF-FKLCEAGQRLNGNVRISSEEV-----ELREYIVGGVGYPLLSWLITPYETNGLSA 292
S F FK+ ++ ++ I+ E+ ++ Y++ YPL ++ + Y + +
Sbjct: 65 NSTFNFKI------ISNSIPITYRELLPGFCKVPCYVIDDPAYPLSTFCLKEY-LHCQNK 117
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
+ FN+L R+ AF +LK W IL K
Sbjct: 118 NQVVFNTLLRTARNQIECAFGRLKARWAILKK 149
>gi|390366216|ref|XP_003730990.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 288
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 88/185 (47%), Gaps = 18/185 (9%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICS-----LVREDLVSRPPSGLINIEGRLLSVEKQV 107
A + ++++ +K F R++ F+ + L ++ R P L V ++
Sbjct: 68 AELHREDQKSYKNFLRITPDLFEEMVGKLTPHLEKKTTFMREP----------LEVGLKL 117
Query: 108 AIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIK 165
A+ LR LA+G S ++ +F V +ST+S+ AL E K L P + + +++
Sbjct: 118 AVTLRFLATGNSYATLQYSFRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTEDGWKDVA 177
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
KF + NC GA+D H+ + P ++ + + + +S+++ + D E +F+ +
Sbjct: 178 KKFSSRWNYHNCLGAVDGKHVAIRKPP-KSGSLYFNYKGFHSIVLMAVADAEYKFLYVDV 236
Query: 226 GWPGG 230
G GG
Sbjct: 237 GAEGG 241
>gi|331241303|ref|XP_003333300.1| hypothetical protein PGTG_14220 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 74/324 (22%), Positives = 136/324 (41%), Gaps = 33/324 (10%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPRYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSRNPQR 108
Query: 102 SVEKQVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +
Sbjct: 109 DPPIQIAVAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R+E + EE F C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SERIESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+FI + G+PG + + + F + Q+ + + +++++ Y
Sbjct: 227 KKFISYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDQKQFLLADSAYTN 273
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPS 336
+++ ++ L + H A + ++ A LKG + L ++ R R
Sbjct: 274 NRYVVPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEE 331
Query: 337 I------ILVCCLLHNIIIDSGDQ 354
+ I+ C +LHN++ D DQ
Sbjct: 332 MKGAVKWIVTCIVLHNLLADLKDQ 355
>gi|115434978|ref|NP_001042247.1| Os01g0186900 [Oryza sativa Japonica Group]
gi|113531778|dbj|BAF04161.1| Os01g0186900 [Oryza sativa Japonica Group]
gi|218187652|gb|EEC70079.1| hypothetical protein OsI_00698 [Oryza sativa Indica Group]
Length = 302
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 26/162 (16%)
Query: 36 DSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYIC-----SLVREDLVSRPP 90
DS+WW S+P P E F+ FR+SK TF+ +C ++ +ED + R
Sbjct: 108 DSEWWDMV-----SSP--DYPDSE---FRKAFRMSKATFEVVCDELAAAVAKEDTMLRAA 157
Query: 91 SGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK 150
+ V K+VA+ + RLA+GE V FG+G ST ++ AL+
Sbjct: 158 ----------IPVRKRVAVCVWRLATGEPLRLVSKRFGLGISTCHKLVLEVCAALKAMVM 207
Query: 151 HH-LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
++WP++ I + FE G+ GAI THI + P
Sbjct: 208 PKVVRWPEAGDAAAIAAHFEAISGISGVVGAIYTTHIPIIAP 249
>gi|390367614|ref|XP_785880.2| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 248
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 88/198 (44%), Gaps = 30/198 (15%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S++ Y+ L+R DL P+G R L Q+ +AL ASG +VG
Sbjct: 38 YRFSRRGCIYLIHLLRIDL--EHPTG----RSRALPASLQIFLALAFFASGALLTTVGTM 91
Query: 127 FGVGQSTVSQVTWRFIEALEER------AKHHL----------------KWPDSNR-MEE 163
G+ ++ S+ R +AL R + HL K+P S R +
Sbjct: 92 HGISIASTSRAIRRVTKALFRRRNQSNYSYRHLVFALFGTAHSFLYQFIKFPKSERDVLH 151
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD-WCDQENNYSMLVQGIVDHEMRFID 222
I+ +F G G ID TH+ + ++ + + +++N +S+ VQ I + +
Sbjct: 152 IQEQFFAVAGFPKVVGVIDCTHVHLHGSKLKPIEHLYVNRKNRHSINVQLICGPDFLISN 211
Query: 223 IVTGWPGGMNVSRLLKFS 240
+V WPG + SR+L+ S
Sbjct: 212 VVARWPGSTHDSRILQMS 229
>gi|338808398|gb|AEJ07903.1| putative PIFa transposase [Zea mays subsp. mexicana]
Length = 558
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 116/269 (43%), Gaps = 40/269 (14%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R++K++F + +++RE SGL + +SVE+++AI L + +
Sbjct: 49 LRLTKRSFSDLSAILRE------KSGLQDTLN--VSVEEKLAIFLLIVGHNTKMRLIRST 100
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM--EEIKSKFEESFGLLNCCGAIDAT 184
+G +S+ + + + +K P+ E+ K K+ E +C GA+D T
Sbjct: 101 YGWSLEPISRHFNEVLRGILSLSHEFIKLPNPETTLPEDPKWKWFE-----DCLGALDGT 155
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF----- 239
HI + +P + +++ + V G+ D +M+F+ ++ GW G + SR+L+
Sbjct: 156 HIDVNVPLTDQGR-YRNRKQRITTNVLGVCDRQMKFLYVLAGWEGSASDSRILRDAMSRE 214
Query: 240 ------SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
SG + L +AG NG ++ Y L W I + N S +
Sbjct: 215 DSFVVPSGKYYLVDAGY-TNGPGFLAPYR---------STRYHLNEWAI---QGNNPSTA 261
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
FN H R++ + F LK W IL
Sbjct: 262 RELFNLRHATARNVIERTFGLLKMRWAIL 290
>gi|359476086|ref|XP_003631787.1| PREDICTED: uncharacterized protein LOC100853519 [Vitis vinifera]
Length = 414
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 87/206 (42%), Gaps = 36/206 (17%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV---DHEMRFIDIVTGWPGGMN 232
+C GAID THI +PA D +++ Q ++ + +M F + GW G N
Sbjct: 175 DCIGAIDGTHISAWVPA----DRQTSFRGRKTVITQNVMCACNFDMMFTFVYAGWEGTAN 230
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE------ 286
+R+ +A R N SE +Y V GYP +S + PY
Sbjct: 231 DARVF--------LDALTRPEVNFPWPSEG----KYYVVDSGYPCISGFLPPYRGERYHL 278
Query: 287 ------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSIIL 339
N FN H + R++ + F LK + IL + ++P ++ PSI++
Sbjct: 279 QEYWGRHNQPIRYKELFNYRHSSLRNIIERCFGVLKTRFPILRMMPCYKPSRQ--PSIVV 336
Query: 340 VCCLLHNIIIDS--GDQLHPDVALSD 363
CC LHN I S DQL + + D
Sbjct: 337 ACCTLHNWIRLSTRNDQLFREYEVED 362
>gi|108862658|gb|ABA98143.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 581
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 88/192 (45%), Gaps = 23/192 (11%)
Query: 176 NCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+C GAID THI ++ V++S + ++++ + V VD ++RF ++ GW G + +
Sbjct: 80 DCIGAIDGTHIRASVRKNVESS--FRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDA 137
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-------- 286
+L+ +A +R NG +R+ +Y + GY + P+
Sbjct: 138 VVLR--------DALERENG-LRVPQGNRLQGKYYLVDAGYGAKQGFLPPFRAVRYHLNE 188
Query: 287 --TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
N + FN H + R +AF LK +++L R I++ CC++
Sbjct: 189 WGNNPVQNEKELFNLRHSSLRVTVERAFGSLKRRFKVLDDATPFFPFRTQVDIVVACCII 248
Query: 345 HNIIIDSG-DQL 355
HN +++ G D+L
Sbjct: 249 HNWVVNDGIDEL 260
>gi|40786599|gb|AAR89874.1| expressed protein [Oryza sativa Japonica Group]
Length = 608
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 89/192 (46%), Gaps = 28/192 (14%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ ++P S + ++N+ + V VD ++RF ++ GW G +
Sbjct: 139 DCIGAIDGTHVRASVPKNMESS-FRGRKNHATQNVMAAVDFDLRFTYVLAGWEGAAHDVV 197
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET-------- 287
+L+ +A +R NG V ++ + VGY + P+ +
Sbjct: 198 VLR--------DALERENG------LHVPQGKFYLVDVGYGAKPGFLPPFRSTRYHLNEW 243
Query: 288 --NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLL 344
N + FN H + R +AF LK ++IL + P + ++ +I++ CC++
Sbjct: 244 GNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKILDDATPFFPFQTQV-NIVVACCII 302
Query: 345 HN-IIIDSGDQL 355
HN +I D D+L
Sbjct: 303 HNWVINDDIDEL 314
>gi|26334701|dbj|BAC31051.1| unnamed protein product [Mus musculus]
Length = 244
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 69/152 (45%), Gaps = 2/152 (1%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL ERA + +P D
Sbjct: 65 RAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPVD 124
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++ +K +F G+ G D H+ + P + + +++ +S+ + D
Sbjct: 125 EAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDL-SYVNRKGLHSLNCLVVCDIR 183
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+ + T WPG + +L+ S E G
Sbjct: 184 GALMTVETSWPGSLQDCAVLQRSSLTSQFETG 215
>gi|293335311|ref|NP_001168109.1| uncharacterized protein LOC100381847 [Zea mays]
gi|223946053|gb|ACN27110.1| unknown [Zea mays]
Length = 435
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 64/294 (21%), Positives = 114/294 (38%), Gaps = 8/294 (2%)
Query: 63 FKYFFRVSKKTFDYICSLV-REDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F+ FR+ + F I V RED + G L ++V ALR LA G
Sbjct: 78 FRRRFRMHRPLFLRILRAVQREDEYFTIRCDATGLAG--LGPLQKVCAALRILAYGLPTD 135
Query: 122 SVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+V +GQ+T RF A+ ++ +L+ P+ + + I + G G+
Sbjct: 136 AVDEYIQIGQTTAKDCLIRFCRAIISSFSERYLRIPNHDDIARIL-RVNADRGFPGMLGS 194
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID +C + + +M+++ + +++ G PG N +L S
Sbjct: 195 IDCMQWEWRNCPTAWRGQFCGRNSRPTMILEAVAGYDLWIWHAFFGMPGTNNDLNVLHRS 254
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
F G + I+ Y+ G+ YP + + F +
Sbjct: 255 PVFDPLRNGTMPPVHFTINGTTYNFGYYLADGI-YPNWPTFVKAIR-HAFEEKKVHFTTK 312
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII-DSGD 353
E+ R +AF L+ W +L + D+ L ++ C ++HN+I+ D GD
Sbjct: 313 QESCRKDIERAFGVLQARWAVLRGPAYGWDRNHLAEMMTACIIMHNMIVEDEGD 366
>gi|52077373|dbj|BAD46413.1| ribosomal protein-like [Oryza sativa Japonica Group]
Length = 373
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 114/272 (41%), Gaps = 14/272 (5%)
Query: 95 NIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFI-EALEERAKHH 152
N G+L S + +AL+ LA G + +G+ST + FI ++ K
Sbjct: 106 NAAGQLGFSALHKCTVALKMLAYGGPANELDDHLKMGESTALKTLKEFIMTVIKVFGKEF 165
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
L+ P S +E I S + G G+ID H + + + +++++
Sbjct: 166 LRPPRSEEIEHILS-INLARGFPGMIGSIDCMHWEWSSCLTGWQGMYRGHKGKPTLILEA 224
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ ++R G PG N +L S F G+ + ++ L Y+ G
Sbjct: 225 VATEDLRIWHAYFGLPGSHNDINVLHRSNVFDDVPNGRAPSVEFHVNDNIYSLGYYLADG 284
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV--MWRPD 330
+ YP + L+ +S F+ E+ R +AF L+ W+IL + +W P
Sbjct: 285 I-YPDWATLVKSIALP-VSNKQKVFSEQQESCRKDVERAFGVLQAKWKILHRPARLWSP- 341
Query: 331 KRKLPSIILVCCLLHNIIIDS--GDQL---HP 357
+ L SI+ C +LHN++++ G QL HP
Sbjct: 342 -KVLNSIMRACVILHNMVVEDERGVQLPTVHP 372
>gi|331225010|ref|XP_003325176.1| hypothetical protein PGTG_06713 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|331246035|ref|XP_003335652.1| hypothetical protein PGTG_16424 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 136/324 (41%), Gaps = 33/324 (10%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPRYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSRNPQR 108
Query: 102 SVEKQVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +
Sbjct: 109 DPPIQIAVAVCRLGSNGNGSAIHRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R+E + EE F C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SERIESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+FI + G+PG + + + F + Q+ + + +++++ Y
Sbjct: 227 KKFISYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTN 273
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPS 336
++I ++ L + H A + ++ A LKG + L ++ R R
Sbjct: 274 DRYVIPAFKGKHLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEE 331
Query: 337 I------ILVCCLLHNIIIDSGDQ 354
+ I+ C +LHN++ D DQ
Sbjct: 332 MKGAVKWIVTCIVLHNLLADLKDQ 355
>gi|77551917|gb|ABA94714.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 645
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 75/190 (39%), Gaps = 39/190 (20%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+ GAID +HI + + +T C S V I + +MRFI V GWPG + SR
Sbjct: 426 DAIGAIDGSHISVVVLLDETISHTC-HHGYTSQNVLAIYNFDMRFIFAVAGWPGSAHDSR 484
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+L S + + GYP + P++ G + +P
Sbjct: 485 IL---------------------SHALANFPSFPMPPTGYPNRIGYLAPFK--GTTYHIP 521
Query: 296 --------------TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
FN LH + R++ ++F LK WRIL + + II+ C
Sbjct: 522 EFRHRSGPPQGKYEVFNFLHSSLRNVIERSFGVLKQKWRILKGIPSFSHTTQ-KHIIMAC 580
Query: 342 CLLHNIIIDS 351
LHN + DS
Sbjct: 581 LALHNFVRDS 590
>gi|357139633|ref|XP_003571385.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 361
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 115/279 (41%), Gaps = 38/279 (13%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVAI L +A + ++ F T+ + R +EA+ +++ P +
Sbjct: 53 VSVEEQVAIFLYAVAKNATNETLQDWFQHSPDTIHRHFKRVLEAITNLTPIYIRPPSLHP 112
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
++ F NC GA+D THI M L ++ + + +++ S D ++RF
Sbjct: 113 HSILRKPKFYPFSK-NCIGAVDGTHIPMKL-SLDQQEPYRNRKQTISQNAMVASDFDLRF 170
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+ I GW G + +R+L+ + L + L+G Y+V GY
Sbjct: 171 VHINPGWEGSASDARVLQDA----LNHGFEVLDGKF-----------YLVDA-GYANTPQ 214
Query: 281 LITPYE-------TNGLSASMP-----TFNSLHEATRSLAVKAFLQLKGGWRI---LSKV 325
+ PY G + P FN H R+ ++ G W++ + KV
Sbjct: 215 FLAPYRGTRYHLNEQGRARHKPXNHKEXFNLRHAQPRN----HIERIIGIWKMRFPILKV 270
Query: 326 MWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
K K I + C +LHN I I +GD P A D
Sbjct: 271 ASHFPKEKQIDISVACAVLHNFIRIHNGDMTWPSDATMD 309
>gi|357129612|ref|XP_003566455.1| PREDICTED: uncharacterized protein LOC100843647 [Brachypodium
distachyon]
Length = 499
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 118/280 (42%), Gaps = 40/280 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVAI L +A + ++ F T+ + R +EA+ +++ P +
Sbjct: 191 VSVEEQVAIFLYAVAKNATNETLQDWFQHSPDTIHRHFKRVLEAITNLTPIYIRPPSLHP 250
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K KF F NC GA+D THI M LP Q + + +++ S D +++
Sbjct: 251 HSIMRKPKFYPFFK--NCIGAVDGTHIPMKLPLDQ-QEPYRNRKQTISQNAMVACDFDLK 307
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F+ I GW G + +R+L+ + L + L+G + Y+V GY
Sbjct: 308 FVHINPGWEGSASDARVLQDA----LNHGFEVLDG-----------KFYLVDA-GYANTP 351
Query: 280 WLITPYETN-------GLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRI---LSK 324
+ PY + G + P FN H R+ ++ G W++ + K
Sbjct: 352 QFLAPYRSTRYHLNEQGRARQKPQNHKELFNLRHAQLRN----HIERIIGIWKMRFPILK 407
Query: 325 VMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
V K K I + C +LHN I I + D P A D
Sbjct: 408 VASHFPKEKQIDISVACAVLHNFIRIHNLDMTWPSDATMD 447
>gi|115466608|ref|NP_001056903.1| Os06g0164500 [Oryza sativa Japonica Group]
gi|55296134|dbj|BAD67852.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113594943|dbj|BAF18817.1| Os06g0164500 [Oryza sativa Japonica Group]
Length = 370
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 14/149 (9%)
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
+ Q + R + + G+ G +LK S ++ E G+ L+ +
Sbjct: 185 LAAQLVAGASSRVLSLAAGFRGDRTDLEVLKLSSLYQELEQGKVLDHG-----------Q 233
Query: 268 YIVG-GVGYPLLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV 325
Y+ G G GYPLL WL+ P+ + S FN+ H+AT A + LK GW ++++
Sbjct: 234 YLAGDGDGYPLLPWLMVPFRGPAVPGSPEAEFNAAHDATCRKARRTVRSLK-GWGAIARL 292
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
R + I C +LHN+++ D
Sbjct: 293 RDEESPRAAVACIGTCAMLHNVLLAREDH 321
>gi|195386258|ref|XP_002051821.1| GJ10253 [Drosophila virilis]
gi|194148278|gb|EDW63976.1| GJ10253 [Drosophila virilis]
Length = 439
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 119/298 (39%), Gaps = 32/298 (10%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
++ F+ ++ + FD + SLV E L I E RL + L L++ E
Sbjct: 90 QDEFRLRTGMTLQVFDMLFSLVGEQLNKNSIRRPIPPECRLF-------LTLVYLSNAER 142
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
+ + AF +G STV + E L A L P+ I + ++ ++ + +C
Sbjct: 143 VLDLSKAFKMGSSTVRCIVQETSEVLWNTLAPAFLPQPNVEDWSNIVAGYQATWQVPHCL 202
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GA+DA + ++ + EN+ +L + +DI T V+ L
Sbjct: 203 GAVDAKLLDISYKS----------ENDMILLASCDANLNFTCLDIATVTDA---VNNQLH 249
Query: 239 FSGFFKLCEAGQRLNGNVRI-----SSEEVELREYIVGGVGYPLLSWLITPYETNGLSAS 293
+ F Q+L + + ++ + G +P+L L+TPY T L
Sbjct: 250 WHQDFGAALLSQQLGDLLPAEPLPEAEQKTNFIYTFIAGASFPMLEHLVTPYVTQRLDQD 309
Query: 294 MPTFNSLHEATRSLA---VKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN +R+LA KAF L WR+LS + R II +LHN +
Sbjct: 310 QLHFN--QHLSRALAGSIDKAFSVLMSRWRVLSLPL-RQSPTNAEKIIRAAVVLHNFV 364
>gi|15987055|gb|AAL11884.1|AF412282_1 putative transposase [Zea mays]
Length = 298
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 86/177 (48%), Gaps = 22/177 (12%)
Query: 67 FRVSKKTFDYICSLVRE--DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG 124
R++K++F +C+++RE D+ +N+ SVE++VAI L + G +
Sbjct: 69 LRLTKRSFSDLCTILRERCDMCD-----TLNV-----SVEEKVAIFLLVVGHGTKMRMIR 118
Query: 125 VAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSK---FEESFGLLNCCGAI 181
++G +S+ + + +K PD ++ SK FE+ C GA+
Sbjct: 119 SSYGWSLEPISRYFNEVLRGVLSLCHEFIKLPDPLAVQPEDSKWRWFED------CLGAL 172
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
D THI + +P + + +++ + V G+ D M+F+ ++ GW G + SR+L+
Sbjct: 173 DGTHIDVFVP-LADQGRYRNRKQQITTNVLGVCDRHMKFVYVLAGWEGSASDSRVLR 228
>gi|357168077|ref|XP_003581471.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 333
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 113/280 (40%), Gaps = 40/280 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVAI L +A + ++ F T+ + R +EA+ +++ P +
Sbjct: 25 VSVEEQVAIFLYAVAKNATNETLQDWFQHSPDTIRRHFKRVLEAITNLTPIYIRPPSLHP 84
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K KF F NC G +D TH M LP Q + + +++ S D +++
Sbjct: 85 HSILRKPKFYPFFK--NCIGDVDGTHTPMKLPLDQ-QEPYRNRKQTISQNAMVACDFDLK 141
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F+ I GW G + +R+L+ LN EV ++ + GY
Sbjct: 142 FVHINPGWEGSASDARVLQ-----------DALNHGF-----EVPDGKFYLVDAGYANTP 185
Query: 280 WLITPY-------ETNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRI---LSK 324
+ PY G + P FN H R+ ++ G W++ + K
Sbjct: 186 QFLAPYRGTRYHLNEQGRALQKPQNHKELFNLRHAQLRN----HIERIIGIWKMRFPILK 241
Query: 325 VMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
V K K I + C +LHN I I +GD P A D
Sbjct: 242 VASHFPKEKQIDISVACAVLHNFIRIHNGDMTWPSDATMD 281
>gi|449686978|ref|XP_004211313.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 388
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 98/228 (42%), Gaps = 18/228 (7%)
Query: 129 VGQSTVSQVT-------WRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
VG++TV + W+ ++ + RA + +K + I +F++++ C GAI
Sbjct: 83 VGRATVCHIIEETCCAIWKVLKKVFLRAPNDVK-----EWQNIIKEFDQNWNFPQCIGAI 137
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D H+ + PA ++ + + + SM++ I D + F + G N + +L S
Sbjct: 138 DGKHVRIEAPA-KSGSSFYNYKGFNSMVLLAICDAKYCFTMVDIRAYGRDNDAAILNAST 196
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLH 301
F + G ++IS + ++ +VG + L WL+ PY L+ FN
Sbjct: 197 FGRTFNKGYF--NLLKISEFDPKVPSVLVGDDIFALKPWLMKPYPGKNLTVQQRVFNYRL 254
Query: 302 EATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII 348
R +F L WRI S + +P K+ II LHN +
Sbjct: 255 SRARKTIENSFGILAARWRIYRSPIKAKP--LKVEHIIKATVCLHNYL 300
>gi|331213343|ref|XP_003319353.1| hypothetical protein PGTG_01527 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 134/324 (41%), Gaps = 33/324 (10%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPCYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSRNPQR 108
Query: 102 SVEKQVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
+ Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +
Sbjct: 109 NPPIQIAVAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R+E + EE G C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SERIESSQVMREE--GFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+FI + G+PG + + + F + Q+ + +++++ Y
Sbjct: 227 KKFISYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFNQKQFLLADSAYTN 273
Query: 278 LSWLITPYE-TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
+++ ++ L FN +R A LKG + L ++ R R
Sbjct: 274 DRYVVPAFKGKQFLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEE 331
Query: 337 I------ILVCCLLHNIIIDSGDQ 354
+ I+ C +LHN++ D DQ
Sbjct: 332 MKGAVKWIVTCIVLHNLLADLKDQ 355
>gi|449676923|ref|XP_002160362.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 189
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 73/145 (50%), Gaps = 13/145 (8%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQV-------TWRFIEALEERAKHHLKWPD 157
+++ I +R LA+GESQ + F VG++TV + W+ ++ + RA + +K
Sbjct: 32 QRLIITIRYLATGESQQTQSFYFRVGRATVCHIIEETCFAIWKVLKKVFLRAPNDVK--- 88
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
+ I +F++++ C GAID H+ + PA ++ + + + YSM++ I D +
Sbjct: 89 --EWQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDAK 145
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGF 242
F + G G N + +L S F
Sbjct: 146 YCFTMVDIGAYGRDNDAAILNASTF 170
>gi|357118418|ref|XP_003560952.1| PREDICTED: uncharacterized protein LOC100834066 [Brachypodium
distachyon]
Length = 320
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 115/268 (42%), Gaps = 33/268 (12%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD- 157
R +S ++ +A+ L L + +S + F ST+S + ++ A ++ D
Sbjct: 23 RNMSSKEALALFLWTLGAPQSNIQAANRFEHSPSTISNKFMEVLMCVDRMAGDYIAPIDP 82
Query: 158 --SNRMEEIKS-KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
++ E++K +F F + GAID THI + +PA + +++ S V +
Sbjct: 83 TFTHVHEKLKKPRFWPHFK--DAIGAIDGTHIPVIVPA-ELKVIHTNRKGYTSQNVLAMC 139
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
D++MRFI V GW G ++ +R+ + + + Y+V G
Sbjct: 140 DYDMRFIFAVPGWLGSVHDTRVWSDA------------RAEYDTFPHPPQGKYYLVDS-G 186
Query: 275 YPLLSWLITPYE---------TNGLSASM-PTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
YP + PY+ N M FN H + R++ +AF LK W IL
Sbjct: 187 YPNRVGYLAPYKGQRYHVPEFENAPPVGMQEMFNYCHSSLRNVIERAFGVLKRKWPILQG 246
Query: 325 VMWRPD-KRKLPSIILVCCLLHNIIIDS 351
+ P K+K+ I+ C LHN I DS
Sbjct: 247 IPAYPVLKQKM--IVSACMCLHNYIRDS 272
>gi|26351017|dbj|BAC39145.1| unnamed protein product [Mus musculus]
Length = 246
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 73/165 (44%), Gaps = 7/165 (4%)
Query: 86 VSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL 145
+SRP R +S E Q+ AL SG Q +G A G+ Q+++S+ EAL
Sbjct: 57 LSRPTQ-----RSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEAL 111
Query: 146 EERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQEN 204
ERA + +P D ++ +K +F G+ G D H+ + P + + +++
Sbjct: 112 VERASQFIHFPVDEAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDL-SYVNRKG 170
Query: 205 NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG 249
+S+ + D + + T WPG + +L+ S E G
Sbjct: 171 LHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSLTSQFETG 215
>gi|301607129|ref|XP_002933172.1| PREDICTED: putative nuclease HARBI1-like [Xenopus (Silurana)
tropicalis]
Length = 286
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 74/146 (50%), Gaps = 2/146 (1%)
Query: 110 ALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKF 168
+L A+G Q GV GV Q T S+ + ++ + + +++ +P + N IK +F
Sbjct: 77 SLHFFATGSFQKVGGVYGGVSQPTFSRCLCQVLDTIRSVSANYISFPTNRNEWNTIKRQF 136
Query: 169 EESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
G+ N GAID TH+ + L A + ++++ +S+ +Q + D M IV+ +P
Sbjct: 137 YGVSGIPNVLGAIDCTHVALNL-AQDREHVFRNRKSYHSLNIQVVCDATMNIRSIVSRFP 195
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNG 254
G + + +L+ S + + GQ +G
Sbjct: 196 GSSHDAYILRQSRLYDGFQTGQMSHG 221
>gi|9743357|gb|AAF97981.1|AC000103_31 F21J9.3 [Arabidopsis thaliana]
Length = 457
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 112/256 (43%), Gaps = 7/256 (2%)
Query: 98 GRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKW 155
G+L LS +++ A R LA G S +G+ST + RF A+ E A +L+
Sbjct: 35 GKLGLSGLQKMTAAFRMLAYGVPADSTDEYIKIGESTALESLKRFCRAIVEVFACRYLRS 94
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
PD+N + + ES G G++D H + + + +++++ + D
Sbjct: 95 PDANDVARLL-HIGESRGFPRMLGSLDCMHWKWKNCPTAWGGQYAGRSRSPTIILEAVAD 153
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
+++ G PG N +L+ S F G + I+ + + Y+ G+ Y
Sbjct: 154 YDLWIWHAYFGLPGSNNDINVLEASHLFANLAEGTAPPASYVINEKPYNMSYYLADGI-Y 212
Query: 276 PLLSWLI-TPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL 334
P S L+ T ++ G + F EA R +AF L+ + I++ +K L
Sbjct: 213 PKWSTLVQTIHDPRGPKKKL--FAMKQEACRKDVERAFGVLQSRFAIVAGPSRLWNKTVL 270
Query: 335 PSIILVCCLLHNIIID 350
I+ C ++HN+II+
Sbjct: 271 HDIMTSCIIMHNMIIE 286
>gi|20514796|gb|AAM23241.1|AC092553_7 Putative transposase [Oryza sativa Japonica Group]
gi|21326484|gb|AAM47612.1|AC122147_1 Putative transposase [Oryza sativa Japonica Group]
gi|110288571|gb|ABB46678.2| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 535
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 102/217 (47%), Gaps = 34/217 (15%)
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHE 217
NR+ +++ + S G +C GAID THI ++ V++S + ++++ + V VD +
Sbjct: 41 NRL--VRTNYGRS-GETDCIGAIDGTHIRASVRKNVESS--FRGRKSHATQNVMAAVDFD 95
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+RF ++ GW G + + +L+ +A +R NG + + + Y+V G GY
Sbjct: 96 LRFTYVLAGWEGTAHDAVVLR--------DALERENG-LHVPQGKF----YLVDG-GYGA 141
Query: 278 LSWLITPYE----------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
+ P+ N + FN H + R +AF LK +++L
Sbjct: 142 KQGFLPPFRAVRYHLKEWGNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKVLDDATP 201
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSG-DQLHPDVALSD 363
R I++ CC++HN +I+ G D+L VA SD
Sbjct: 202 FFPFRTQVDIVVACCIIHNWVINDGIDEL---VAPSD 235
>gi|115454089|ref|NP_001050645.1| Os03g0608700 [Oryza sativa Japonica Group]
gi|113549116|dbj|BAF12559.1| Os03g0608700, partial [Oryza sativa Japonica Group]
Length = 249
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 89/192 (46%), Gaps = 28/192 (14%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ ++P S + ++N+ + V VD ++RF ++ GW G +
Sbjct: 27 DCIGAIDGTHVRASVPKNMESS-FRGRKNHATQNVMAAVDFDLRFTYVLAGWEGAAHDVV 85
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET-------- 287
+L+ +A +R NG V ++ + VGY + P+ +
Sbjct: 86 VLR--------DALERENG------LHVPQGKFYLVDVGYGAKPGFLPPFRSTRYHLNEW 131
Query: 288 --NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLL 344
N + FN H + R +AF LK ++IL + P + ++ +I++ CC++
Sbjct: 132 GNNPVQNEKELFNLRHSSLRITVERAFGSLKRRFKILDDATPFFPFQTQV-NIVVACCII 190
Query: 345 HN-IIIDSGDQL 355
HN +I D D+L
Sbjct: 191 HNWVINDDIDEL 202
>gi|357167736|ref|XP_003581308.1| PREDICTED: uncharacterized protein LOC100844952 [Brachypodium
distachyon]
Length = 550
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 109/273 (39%), Gaps = 47/273 (17%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RV + F +C ++RE L+ R + +SVE+QVA+ L + +G
Sbjct: 2 LRVERAPFFRLCEILRERLLLR--------DSIHVSVEEQVAMFLNTIGHNLRNRVIGAN 53
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS--KFEESFGLLNCCGAIDAT 184
F TVS+ + A+ E ++ P +I +F+ F +C GAID T
Sbjct: 54 FDKSNETVSRYFRLVLHAIGELRAEYISPPSLETPAKIAGNPRFDPYFK--DCIGAIDGT 111
Query: 185 HIIMTL-----PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKF 239
H+ ++ P+ + + Q V VD ++RF ++ GW G + + +
Sbjct: 112 HVRASVSKDMKPSFRGRKSFPTQN------VMAAVDFDLRFTYVLAGWEGIAHDATI--- 162
Query: 240 SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----------TNG 289
L +A +R NG V ++ + GY + P+ +N
Sbjct: 163 -----LADAIERENGLY------VPQGKFYLVDAGYGAKPGFLPPFRAVRYQLNEWGSNP 211
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+ FN H + R +AF LK ++IL
Sbjct: 212 VQNEKELFNLRHSSLRVTVERAFGTLKRRFKIL 244
>gi|449686016|ref|XP_004211043.1| PREDICTED: uncharacterized protein LOC101236526 [Hydra
magnipapillata]
Length = 313
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 118/312 (37%), Gaps = 35/312 (11%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
EE K FRV+ TF+++ + + L +SV K+VA+AL LAS E
Sbjct: 2 EECELKEHFRVNSNTFNFLVNELHPHL-----GKTTTTTQEPISVVKRVAVALHYLASFE 56
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDS-NRMEEIKSKFEESFGLLN 176
V +G+G+ + + FI A+ + K+P S + + FE G
Sbjct: 57 EYRVVSSLYGIGK-LANLIVHEFINAVNDILLPIFFKFPLSVENLNKHNRDFEAILGFRW 115
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C M P+ C FI GWPG N S +
Sbjct: 116 SCSW-------MPHPSFSPLRSNC----------------RYGFIYTSVGWPGRNNDSCI 152
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASMP 295
L+ S + E+ + V L +G +PL L+ PY N LS
Sbjct: 153 LQNSSLKAILESNLFDKCCKELGDSFVSL--CFIGDSAFPLTRHLLKPYPKNLELSEIQK 210
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
FN + + + AF ++K +R++ K M D I+ C LHNI D +
Sbjct: 211 NFNKILCGAKKVVKNAFGRVKARFRVICKRM-ECDINFATRIVNACVTLHNICEYYDDII 269
Query: 356 HPDVALSDHHDS 367
+ + H+DS
Sbjct: 270 IIEWLMHHHNDS 281
>gi|390342959|ref|XP_003725766.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 288
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 85/180 (47%), Gaps = 8/180 (4%)
Query: 53 ATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALR 112
A + ++ + +K F R++ F+ + + L + L V ++A+ LR
Sbjct: 68 AELHREDPKSYKNFLRITPDLFEEMVGKLTPHLEKK-----TTFMREPLEVGLKLAVTLR 122
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIKSKFEE 170
LA+G S ++ +F V +ST+S+ AL E K L P + + +++ KF
Sbjct: 123 FLATGNSYTTLQYSFRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTEDGWKDVAKKFSS 182
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
+ NC GA+D H+ + P ++ + + + +S+++ + D E +F+ + G GG
Sbjct: 183 RWNYHNCLGAVDGKHVAIRKPP-KSGSLYFNYKGFHSIVLMAVADAEYKFLYVDVGAEGG 241
>gi|331213311|ref|XP_003319337.1| terpenoid cylase/protein prenyltransferase alpha-alpha toroid
[Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 512
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 124/297 (41%), Gaps = 18/297 (6%)
Query: 98 GRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKW 155
GRL S ++V A R LA G S S+ + +ST + F AL + A+ +L+
Sbjct: 187 GRLGFSPLQKVTAAFRLLAYGCSADSIDEYLRISESTALESLKHFCSALVKIYAEEYLRS 246
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P++N + +I + E G G++D H V + +E +++++ +
Sbjct: 247 PNANDLAKILA-VSEKRGFPGMMGSLDCMHWGWKNCPVADHGQYSGKEKEPTVILEAVAT 305
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGY 275
H++ G PG +N +L S F+ C+ G + ++ L Y+ + Y
Sbjct: 306 HDLWIWHAFFGLPGTLNDINVLDRSPIFQQCQDGVNPSFEYSVNGNRYNLGYYLTDSI-Y 364
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK--VMWR-PDKR 332
P + LI +N + + EA R AF L+ + I+ +W+ PD
Sbjct: 365 PKYASLIQSI-SNPQGKKNKHYAKMQEAYRKDVECAFGVLQARYAIIRYPGRLWKHPD-- 421
Query: 333 KLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRENLE 389
L I+ +LHN+ ++ D A SD + +Q + +GR N +
Sbjct: 422 -LCIIMKTVIILHNMTVE-------DEAGSDFEEDFDYDQNARTQATIGRDNSANAD 470
>gi|119226353|gb|AAI28985.1| LOC100036828 protein [Xenopus laevis]
Length = 326
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 59/252 (23%), Positives = 103/252 (40%), Gaps = 12/252 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSN 159
+S E+++ + LR LA+G+S S+ F +G++T+ ++ + E + + PD N
Sbjct: 28 ISPEERLCLTLRFLATGQSFSSLYFRFPIGRTTIGKIVRETCLLIWSELQRLVMPTPDEN 87
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
I F ++ NC GA+ I + +P S+ + S+++ +VD
Sbjct: 88 AWIHIAEDFYKTTNFPNCLGAMGGKRIQIKMPFKSGSEKYS------SVVLLAVVDANYC 141
Query: 220 FIDIVTGWPGGM-NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI-VGGVGYPL 277
F I G G + S + +L E L + Y+ VG + L
Sbjct: 142 FSIIDVGAYGSTGDASAFWSSAMGHQLSEGALHLPLPKPLPGTAAPSMPYVFVGDEAFGL 201
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPS 336
++ PY + + FN R + AF WRI + + PD +
Sbjct: 202 TENIMRPYPGSQMDIQKRLFNYRLLRARRMVECAFGIFSNKWRIFHNAIQLEPDFVDI-- 259
Query: 337 IILVCCLLHNII 348
II CC+LHN +
Sbjct: 260 IIKACCVLHNFV 271
>gi|242081155|ref|XP_002445346.1| hypothetical protein SORBIDRAFT_07g011660 [Sorghum bicolor]
gi|241941696|gb|EES14841.1| hypothetical protein SORBIDRAFT_07g011660 [Sorghum bicolor]
Length = 267
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 85/220 (38%), Gaps = 23/220 (10%)
Query: 156 PDSNRMEEIKSKFEESF---GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
P + + K E ++ NC GAID THI +T+PA + + + + V
Sbjct: 14 PKDPKFSRVHPKMEGAWFYPAFNNCIGAIDGTHIPVTVPAADAA-KYTGRCGRPTQNVMV 72
Query: 213 IVDHEMRFIDIVTGWPGGMN-----VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
D +MRF +V WPG N L K+ F G+ + + + L
Sbjct: 73 CCDFDMRFTLLVARWPGWANDFKVFTETLQKYGNCFPHPPEGKFYAVDSGYLNRKGYLAP 132
Query: 268 YIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
Y GV YP E FN+ H + R +AF G W+ + W
Sbjct: 133 Y--RGVNYPCED-----SEVPMQKGKKEIFNTAHASIRCCIERAF----GAWKKWRILHW 181
Query: 328 RP--DKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHH 365
P K +I+L C LHN I ++ D A +HH
Sbjct: 182 LPCYSVEKQIAIVLALCALHNFIREN-DLEDKHFAKVEHH 220
>gi|170111802|ref|XP_001887104.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164637878|gb|EDR02159.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 350
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 96/236 (40%), Gaps = 35/236 (14%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---SGES 119
F+ + R++ + FD + ++ + N + VE Q+AI L R +G S
Sbjct: 114 FRSYLRITPECFDSLLESIQHHEAFQN-----NSNNPQMPVECQLAITLYRFGHFGNGLS 168
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH--LKWPDSNRMEEIKSKFEESFGLLNC 177
+ V + GVG TV + T R + + H ++WP ++E K E N
Sbjct: 169 TLKVALWAGVGYGTVCECTNRVMRVCCDERFHQVVMQWPSDEEIQEAKDWVE-----ANS 223
Query: 178 CGA-------IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
C A +D T I M + D ++NYS VQ I ++ ID G PG
Sbjct: 224 CPAWRDGWLMVDGTLIPMFQRPHHFGSAYFDHKSNYSENVQIINTPDLHIIDYGIGLPGS 283
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
+ + + + L + QRL ++ E++ YP+ +W PY+
Sbjct: 284 QHDATAWQHT---HLPQERQRL----------LKDGEFVWADSAYPVRTWCQAPYK 326
>gi|432951260|ref|XP_004084775.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 273
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 77/176 (43%), Gaps = 13/176 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
+R S Y+C L+ + R LSVE+ V +AL ASG SVG A
Sbjct: 45 YRFSADGIRYLCRLLGPRIKHR------TARSHTLSVEQMVCMALHFFASGAFLYSVGDA 98
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ ++T+ + A++ A + P R+ ++K +F G GA+D THI
Sbjct: 99 EQLNKATICRTIRSVCLAIKALADVFISLPGHRRLCDMKEEFNWIAGFPKVIGAVDCTHI 158
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIV----DHEMRFID---IVTGWPGGMNVSR 235
+ P+ D+ ++++ +S+ VQ + D + + G PGG SR
Sbjct: 159 RIKAPSGAHEADFVNRKSCHSINVQADLSPAEDMALELNKGRPVFEGIPGGKETSR 214
>gi|449690961|ref|XP_002170619.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 209
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 16/140 (11%)
Query: 154 KWP-DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
++P D + + E + F G ID TH+ + P + + ++ +N+S+ VQ
Sbjct: 83 QYPTDPHVLNESRVTFYNVAEFPRITGLIDGTHVSIQKPR-KYEYVYVNRSSNHSINVQV 141
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ D+ +FIDIV WPG + +R+L+ S K G L+ ++
Sbjct: 142 VCDYNGKFIDIVAKWPGITHDARILRESKLSKKMIDG--------------TLKGLLLSD 187
Query: 273 VGYPLLSWLITPYETNGLSA 292
GYP WL+TPY + +A
Sbjct: 188 FGYPCFRWLLTPYLSPTTAA 207
>gi|116268420|gb|ABJ96377.1| hypothetical protein [Prunus persica]
Length = 458
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 105/257 (40%), Gaps = 5/257 (1%)
Query: 95 NIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHH 152
N G L L E++ +R LA G S V +G+ST+ + RF +A+E + +
Sbjct: 118 NCAGNLGLLPEQKFTAVIRMLAYGSSADQVDEIARMGKSTILESLVRFCDAVETLYTRDY 177
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
L+ P ++ + K ES G G+ID H D+ +++ S++++
Sbjct: 178 LRXPTPKDLQRLLQK-AESRGFPGMIGSIDCMHWQWKNCPTAWQRDYGNRKGQKSIILEA 236
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ + G G N +L S F G+ N I++ + Y+ G
Sbjct: 237 VAGFDTWVWHAFFGVAGSQNDLNVLGQSPVFNDVLRGEAPNITYEINNTIYQTGYYLADG 296
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ YP + + + S F E R + F L+ W I+ D+
Sbjct: 297 I-YPRWTTFVKTI-PHPRSHKEKFFAXYQEGYRKDVERCFGILQARWAIIRGAARLFDEE 354
Query: 333 KLPSIILVCCLLHNIII 349
L SI++ C +LHN+I+
Sbjct: 355 VLRSIMMTCIILHNMIV 371
>gi|331239088|ref|XP_003332198.1| hypothetical protein PGTG_13565 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 393
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 111/260 (42%), Gaps = 28/260 (10%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +S R+
Sbjct: 113 QIAVAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESERI 172
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE G C G +D T I ++ + + D++ YS+ V I D +FI
Sbjct: 173 ESSQVMREE--GFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFI 230
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + + + F + Q+ + + +++++ Y ++
Sbjct: 231 SYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDRYV 277
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPSI--- 337
+ ++ L + H A + ++ A LKG + L ++ R R +
Sbjct: 278 VPAFKGKQLLKRRNIDFNYHLAQSRVRIEHAIGILKGRFASLQEI--RTQIRNAEEMKGA 335
Query: 338 ---ILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 336 VKWIVTCIVLHNLLADLKDQ 355
>gi|331232448|ref|XP_003328886.1| hypothetical protein PGTG_10187 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 377
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 134/322 (41%), Gaps = 29/322 (9%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPRYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSCNPQR 108
Query: 102 SVEKQVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
Q+A+A+ RL S G S + F VG T+ T R I A+ + WP +
Sbjct: 109 DPPIQIAVAVCRLGSNGNGSSIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRTSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S ++E + EE F C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SEQIESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+FI + G+PG + + + F + Q+ + + +++++ Y
Sbjct: 227 KKFISYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDQKQFLLADSAYTN 273
Query: 278 LSWLITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGW----RILSKVMWRPDKR 332
+++ ++ L FN +R A LKG + I +++ + +
Sbjct: 274 DCYVVPAFKGKQLLKCCNINFNYHLAQSRVRIEHAIGILKGRFASLREIQTQIRNTEEMK 333
Query: 333 KLPSIILVCCLLHNIIIDSGDQ 354
IL C +LHN++ D DQ
Sbjct: 334 GAVKWILTCIVLHNLLADLKDQ 355
>gi|331221192|ref|XP_003323271.1| hypothetical protein PGTG_04808 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 111/260 (42%), Gaps = 28/260 (10%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +S R+
Sbjct: 113 QIAVAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESERI 172
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE F C G +D T I ++ + + D++ YS+ V I D +FI
Sbjct: 173 ESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFI 230
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + + + F + Q+ + + +++++ Y ++
Sbjct: 231 SYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDCYV 277
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPSI--- 337
+ ++ L + H A + ++ A LKG + L ++ R R +
Sbjct: 278 VPAFKGKQLLKRRNINFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEEMKGA 335
Query: 338 ---ILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 336 VKWIVTCIVLHNLLADLKDQ 355
>gi|391330334|ref|XP_003739618.1| PREDICTED: putative nuclease HARBI1-like [Metaseiulus occidentalis]
Length = 254
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 102/226 (45%), Gaps = 23/226 (10%)
Query: 68 RVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
R+S+ F Y+ +R + R + L E ++ + LR LA+G+S S+ +
Sbjct: 25 RLSESQFSYLLERIRPRIEKRDTTF-----RNALRPESKLHMTLRYLATGDSIGSLSALY 79
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL----NCCGAIDA 183
+ +ST S +A+ + + ++ P S EE K +E F L N CGAID
Sbjct: 80 RIPRSTFSSFFPVVCKAIYDALEEFIQMPSSE--EEWKVVMQE-FALRWQFPNACGAIDG 136
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
H+ + P ++ ++ + + ++S++ +VD F+ I G G +N + + + S F
Sbjct: 137 KHVTIRCPP-KSGSEFFNYKKSFSIISLAVVDARYNFLYIDVGTNGRVNDAGVFEKSSFN 195
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
+ E Q L +V + V +PL S ++ P+ G
Sbjct: 196 RALEE-QVL---------DVPIEGVFVADDAFPLRSNILKPFSRCG 231
>gi|198476040|ref|XP_002132244.1| GA25311 [Drosophila pseudoobscura pseudoobscura]
gi|198137519|gb|EDY69646.1| GA25311 [Drosophila pseudoobscura pseudoobscura]
Length = 412
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/336 (19%), Positives = 137/336 (40%), Gaps = 20/336 (5%)
Query: 21 KSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPS----DEEEGFKYFFRVSKKTFDY 76
K +A +D + ++ W + P TIP ++ + ++ F
Sbjct: 13 KIAQVAADLEDRDKAEDLPWKENARRKGHAPIRTIPKACTFTDDTSLRALHGMTWTAFGK 72
Query: 77 ICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQ 136
+ +R L + P+ LL E ++ +ALR L +G+S ++ + VG+S++
Sbjct: 73 LLKQLRGRLDRKNPT--------LLPAETRLQMALRYLTTGDSFSTLAGIYRVGKSSIKY 124
Query: 137 VTWRFIEALEERAKHH-LKWPDSNRM-EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQ 194
+ I+A+ + K L+ P + ++ +FEE + +C G++D HI V+
Sbjct: 125 IVEEVIKAIVKELKGVCLQTPQTEEEWLDVAKQFEELWNFPHCLGSLDGHHIAFRSKTVK 184
Query: 195 TSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG-QRLN 253
D + + S+++ +VD RF+ + PGG + S + E+ +
Sbjct: 185 -DDSYTNYRQFQSIIMLALVDAHHRFLYVDASSPGG--ATDAFTDSTLYNGLESNLLNIP 241
Query: 254 GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFL 313
+ EL ++ G+ L WL+ +E + FN + V A
Sbjct: 242 HEKPLLGLNEELPHVVLADKGFELQPWLMKQHEIPS-TIEKKIFNYRLNRAHRVVVNALG 300
Query: 314 QLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
+ +R L + + + + +++ +LHN ++
Sbjct: 301 IMSNSFRALQTEI-KLEASMVEKLVIATSILHNFLV 335
>gi|359486961|ref|XP_003633498.1| PREDICTED: uncharacterized protein LOC100855100 [Vitis vinifera]
Length = 409
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 138/347 (39%), Gaps = 60/347 (17%)
Query: 46 KNSSTPGATIPSDEEEGFKY----FFRVSKKTFDYICSLVR--EDLVSRPPSGLINIEGR 99
+ S GA D EG FR+ K+TF +C ++ E+L + R
Sbjct: 47 RTSMLSGAQFVRDMIEGHPQTCYELFRMDKETFMNLCDHLKRHENLQ----------DTR 96
Query: 100 LLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN 159
++VE+ VA+ L + V F TV++ AL K L P+ N
Sbjct: 97 FVTVEEAVAMFLLIVGHNVRMRVVADRFQHSTETVARHFKEVRRALCRLGKI-LICPN-N 154
Query: 160 RMEEIKSKFEESFGLL----NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV- 214
E+ S + +C AI+ THI +PA D +++ Q ++
Sbjct: 155 MTNEVSSYVASNPKYFPWFKDCISAINGTHISAWVPA----DRQTSFRGRKTVITQNVMC 210
Query: 215 --DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ +M F + GW G N +R+ +A R N SE +Y V
Sbjct: 211 ACNFDMMFTFVYAGWEGTTNDARVF--------LDALTRPEVNFPWPSEG----KYYVVD 258
Query: 273 VGYPLLSWLITPYE------------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR 320
GYP +S + PY N FN H + R++ + F LK +
Sbjct: 259 SGYPCISGFLPPYRGERYHLQEYRGRRNQPIRYKELFNYRHSSLRNIIERCFGVLKTRFP 318
Query: 321 ILSKVM--WRPDKRKLPSIILVCCLLHNIIIDS--GDQLHPDVALSD 363
I S++M ++P ++ PSI++ CC LHN I S DQL + + D
Sbjct: 319 I-SRMMPCYKPSRQ--PSIVVACCTLHNWIRLSTRNDQLFREYEVED 362
>gi|307172839|gb|EFN64056.1| Putative nuclease HARBI1 [Camponotus floridanus]
Length = 143
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 17/155 (10%)
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV----QGIVDHEMRFIDIVTGWP 228
G GAID THI + P + + + +++ ++S+ + Q + +H+ F+ G P
Sbjct: 1 GFSKVIGAIDGTHINIPAPRIYPAS-YVNRKGHHSIQLFSDFQAVCNHKTHFLHCFAGHP 59
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G ++ R+ F+L E + L GN E+ +I+G Y L LI PY N
Sbjct: 60 GSVHDQRV------FRLSELQEWL-GN----PEKFPDDYHILGDAAYKLHQNLIVPYRDN 108
Query: 289 G-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
G L+ FN H + R +AF +K +R L
Sbjct: 109 GHLTPRQKNFNFCHSSARVAIERAFGFMKTSFRSL 143
>gi|449682371|ref|XP_004210059.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 170
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 73/175 (41%), Gaps = 40/175 (22%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
G ID THI + + + D+ +N+S+ VQ + D++ +FI++V WPG +
Sbjct: 9 GVIDCTHICIQ-KTHKHEYAYVDRSSNHSINVQAVCDNKGKFINVVAKWPGSL------- 60
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
++G YP WL+TPY N +AS +N
Sbjct: 61 ------------------------------LIGDSSYPCFRWLLTPY-LNPTTASQRRYN 89
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
TR + + F + K + +L + R + ++I C +L+N+ I D
Sbjct: 90 ISLRKTRVIIEQVFGRWKRRFHLLHGEI-RMTPERTCTLIAACAVLYNLAIQLND 143
>gi|77556528|gb|ABA99324.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77556531|gb|ABA99327.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 549
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 81/194 (41%), Gaps = 41/194 (21%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDD----WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
+C GA+D THI+ +P DD + + + + V I DH+MRF G PG M
Sbjct: 318 DCIGAVDGTHILAVVP-----DDEKIRYIGRSKSTTQNVMAICDHDMRFTYAFIGQPGSM 372
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----- 286
+ +L F+ L ++ I + + Y+V GYP + PY+
Sbjct: 373 HDITVL-FTA----------LQTDLTIFPHPPQGKYYLVDA-GYPNCPGYLAPYKGQRYH 420
Query: 287 ------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV----MWRPDKRKLPS 336
S FN LH + R++ + F K W IL K+ MW K+K+
Sbjct: 421 VPEFRRGTAPSGEKEKFNFLHSSLRTIIERCFGVWKMKWWILLKIPSFPMW---KQKM-- 475
Query: 337 IILVCCLLHNIIID 350
++ LHN I D
Sbjct: 476 VVAATMALHNFIRD 489
>gi|449690868|ref|XP_004212486.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 262
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 74/170 (43%), Gaps = 13/170 (7%)
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
N +E+ ++FE + NC GAID H+ + P++ S + + +N +SM++ I D +
Sbjct: 6 NEWKELANQFENEWNFPNCIGAIDGKHVCIEAPSLSGS-AYYNYKNFHSMVLLAICDAKY 64
Query: 219 RFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV------RISSEEVELREYIVGG 272
F + G G N + + F + G+ N+ R+ S ++ +VG
Sbjct: 65 CFTLVDIGSYGRDNNASI------FNESKMGKAFKNNLFKLSKNRMLSNGTQVPPVLVGD 118
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+ L SWL+ P+ L+ FN TR F + WR+
Sbjct: 119 DIFALKSWLMKPFSGKNLTIKERIFNYRLSRTRRTIENTFGIMVAKWRVF 168
>gi|195146504|ref|XP_002014224.1| GL19055 [Drosophila persimilis]
gi|194106177|gb|EDW28220.1| GL19055 [Drosophila persimilis]
Length = 387
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 128/325 (39%), Gaps = 40/325 (12%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRP---PSGLINIEGRLLSVEKQVAIALRRLAS 116
++ FK FR+ + TF+ + +V + P G +++ +LL L L
Sbjct: 65 DDQFKERFRMQRSTFETLLQVVGRAIAGAEQYQPIGSVSLPEKLL-------YTLMLLGG 117
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
G S G + + +S+ ++ A +++WP S + G+L
Sbjct: 118 GISFRDAGEIYSISKSSGHEIFKWVTSAFSALMPCYVQWPKYGAAGVGSSSISKLPGVL- 176
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
G I+ I + LP ++ DD + + ++ +Q + D + RF+D+ P L
Sbjct: 177 --GVIEECRIPLKLP-IRIDDD--NPKQYPALTLQAVCDAQSRFLDVYVDAPNDDQHCIL 231
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
L F KL L R +IVG YPLL L+TPY L+
Sbjct: 232 LASPLFEKLIHTQTPLMAEHR----------HIVGDKTYPLLLNLMTPY-VGELTPCHAQ 280
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL---PSIILVCCLLHNIIIDSG- 352
+N + A +A L ++ L + D L ++I C+LHN I+DSG
Sbjct: 281 YNLAIRLWNAPAERALATLISRFKRLDSL----DVSSLELGTTVISAACMLHNFILDSGV 336
Query: 353 -----DQLHPDVALSDHHDSGYGEQ 372
D L D+ +H Y E
Sbjct: 337 EGMEDDILAFDMLPENHKKIPYDEH 361
>gi|242085294|ref|XP_002443072.1| hypothetical protein SORBIDRAFT_08g007605 [Sorghum bicolor]
gi|241943765|gb|EES16910.1| hypothetical protein SORBIDRAFT_08g007605 [Sorghum bicolor]
Length = 372
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/279 (20%), Positives = 112/279 (40%), Gaps = 13/279 (4%)
Query: 85 LVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
++ R +G+++ LS ++V ALR L G S + +G+ST + I
Sbjct: 50 ILKRDAAGVLS-----LSCFQKVTAALRMLTYGVSADATDEYIRIGESTALESLLVQIFE 104
Query: 145 LEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQEN 204
+E +L++P + + + E G G+ID H +Q+ + N
Sbjct: 105 VE-----YLRYPTEADVARLLA-VNEKRGFPGMLGSIDCMHWEWKNCPMQSQGQYKGHTN 158
Query: 205 NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE 264
+++++ + H++ G PG N +L S F G+ N I+ + +
Sbjct: 159 RPTIILEAVASHDLWCWHAFYGMPGSHNDINVLHRSPLFDNLAEGKAPEVNYTINGHDYK 218
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
+R + G+ YP + L+ + F EA R + F + I+
Sbjct: 219 MRYLLADGI-YPTWATLVKTISCP-MGNKQKYFARAQEAARKDVERFFGVFHSRFAIIRH 276
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
+ D+ L I+ C ++HN+I++ + PD D
Sbjct: 277 LGRIWDRETLALIMRSCVIMHNMIVEDERVVDPDERFPD 315
>gi|328715516|ref|XP_003245650.1| PREDICTED: hypothetical protein LOC100573078 [Acyrthosiphon pisum]
Length = 244
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 88/191 (46%), Gaps = 35/191 (18%)
Query: 39 WWPSFWAKNSSTPGATIPS---DEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLIN 95
W W K + T+ + D+E+ F +FR+++ TF ++ E+L+++ +
Sbjct: 38 WVHDLWKKRKTEGEFTLYNGLIDDEQKFYEYFRMTQYTF-HVLHEKLENLITKKETHW-- 94
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKW 155
+ + ++++A+ LR LA+G+S S+ FI A
Sbjct: 95 --RKPIPSKERLAVCLRYLATGDSMRSIS----------------FIMAA---------- 126
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
P+ + +EI S+F S+ NC GAID H+ + P S +C ++ +S+++ +VD
Sbjct: 127 PNEEKWKEIASEFWTSWNFPNCLGAIDGKHVTIQAPPNSGSQYFCYKK-TFSVVLLALVD 185
Query: 216 HEMRFIDIVTG 226
FI + G
Sbjct: 186 AHYNFIAVDVG 196
>gi|449693237|ref|XP_004213365.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 287
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 104/242 (42%), Gaps = 27/242 (11%)
Query: 129 VGQSTVSQVTWRFIEALEERAKHH--LKWPDSNR-MEEIKSKFEESFGLLNCCGAIDATH 185
+ Q++ +++ +F E+ +H+ L W S + ++K + E+FG+ G ID T
Sbjct: 5 ISQASTNRIIHQFCESF---MQHYITLSWYSSQEEITKVKQCYFEAFGVKGRLGLIDGTM 61
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
+ + V + ++ ++ Q +VD++ +F D+V G +
Sbjct: 62 VPIKGVTVADEPAFICRKGYSAINCQFVVDYDGQFRDVVIKCSGSCH------------- 108
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATR 305
+A N ++ + E + +++G GY L +ITP+ ++ FN H R
Sbjct: 109 -DAFVYSNSTLKQTMEADPVAGFLIGDSGYGLSLVMITPFSP-AITPEEMFFNKTHSRVR 166
Query: 306 SLAVKAFLQLKGGWRILSK----VMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVAL 361
S + LK WR L K + + P K SII C LL N+ G + +V
Sbjct: 167 SRIERCISSLKNRWRCLHKSGRSLQYSPT--KCCSIIYTCVLLENMCNSLGLEEPGNVVA 224
Query: 362 SD 363
SD
Sbjct: 225 SD 226
>gi|195156327|ref|XP_002019052.1| GL26155 [Drosophila persimilis]
gi|194115205|gb|EDW37248.1| GL26155 [Drosophila persimilis]
Length = 412
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/253 (20%), Positives = 110/253 (43%), Gaps = 8/253 (3%)
Query: 100 LLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHH-LKWPDS 158
LL E ++ +ALR L +G+S ++ + VG+S++ + I+A+ + K L+ P +
Sbjct: 88 LLPAETRLQMALRYLTTGDSFSTLAGIYRVGKSSIKYIVEEVIKAIVKELKGVCLQTPQT 147
Query: 159 NRM-EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
++ +FEE + +C G++D HI V+ D + + S+++ +VD
Sbjct: 148 EEEWLDVAKQFEELWNFPHCLGSLDGHHIAFRSKTVK-DDSYTNYRQFQSIIMLALVDAH 206
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAG-QRLNGNVRISSEEVELREYIVGGVGYP 276
RF+ + PGG + S + E+ + + EL ++ G+
Sbjct: 207 HRFLYVDASSPGG--ATDAFTESTLYNGLESNLLNIPHEKPLLGLNEELPHVVLADKGFE 264
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L WL+ +E + FN + V A + +R L + + + +
Sbjct: 265 LQPWLMKQHEIPS-TIEKKIFNYRLNRAHRVVVNALGIMSSSFRALQTEI-KLEASMVEK 322
Query: 337 IILVCCLLHNIII 349
+++ +LHN ++
Sbjct: 323 LVIATSILHNFLV 335
>gi|115692227|ref|XP_001195901.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 239
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 68/132 (51%), Gaps = 3/132 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS- 158
L V ++A+ LR LA+G S ++ +F V +ST+S+ AL E K L P +
Sbjct: 20 LEVGLKLAVTLRFLATGNSYTTLQYSFRVEKSTISRFIPEVCNALIEVYKKEVLSCPKTE 79
Query: 159 NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
+ +++ KF + NC GA+D H+ + P ++ + + + +S+++ + D E
Sbjct: 80 DGWKDVAKKFSSRWNYHNCLGAVDGKHVAIRKPP-KSGSLYFNYKGFHSIVLMAVADAEY 138
Query: 219 RFIDIVTGWPGG 230
+F+ + G GG
Sbjct: 139 KFLYVDVGAEGG 150
>gi|55296429|dbj|BAD68552.1| ribosomal protein-like [Oryza sativa Japonica Group]
Length = 436
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 114/272 (41%), Gaps = 14/272 (5%)
Query: 95 NIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHH 152
N G+L S + +AL+ LA + +G+STV + F+ + + K
Sbjct: 106 NAAGQLGFSALHKCTVALKMLAYVGPADELDDHLKMGESTVLKTLMEFVMTMIKVFGKEF 165
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
L+ P S +E I S + G G+ID H + + + +++++
Sbjct: 166 LRPPRSEEIEHILS-INLARGFPGMIGSIDCMHWEWSSCPTGWQGMYRGHKGKPTLILEV 224
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ ++R G PG N +L S F G+ + I+ L Y+ G
Sbjct: 225 VATEDLRIWHAYFGLPGSHNDINILHRSNVFDDVAKGRAPSVEFHINDNIYSLGYYLADG 284
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV--MWRPD 330
+ YP + L+ +S F+ E+ R +AF L+ W+IL + +W P
Sbjct: 285 I-YPDWATLVKSIAL-PVSNKQKVFSEQQESCRKDVERAFGVLQAKWKILHRPARLWSP- 341
Query: 331 KRKLPSIILVCCLLHNIIIDS--GDQL---HP 357
+ L SI+ C +LHN++++ G QL HP
Sbjct: 342 -KVLNSIMRACVILHNMVVEDERGVQLPTVHP 372
>gi|449668824|ref|XP_004206878.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 338
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/264 (21%), Positives = 107/264 (40%), Gaps = 26/264 (9%)
Query: 100 LLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKW 155
L+ +++A+ LR L +G++Q ++ ++ + ST+S++ A L+ER H+
Sbjct: 4 LVCPSERLAVTLRFLVTGDAQCTIAASYRISASTISRIISETCAAIWTSLKERNFLHVP- 62
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
+ + I +FE + + GAID HI+M P ++ + + +S+++ + +
Sbjct: 63 SEKQEWKTIAKEFENMWNFPHAIGAIDGKHIVMQAPH-NGGSEYFNYKKTHSIVLLAVCN 121
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-------ISSEEVELREY 268
+ F + G G + + F C G + N I + E L
Sbjct: 122 AKYEFTMVDIGDSGRQSDGSV------FNNCSLGYAIENNKLNIPDPEFIGNSEKVLPYV 175
Query: 269 IVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWR 328
+V + L ++ PY +S FN R + F +RI R
Sbjct: 176 LVADDAFGLKRHMMKPYPNQNISLDQKIFNYRLSRARRVIENTFGIATTRFRIFR----R 231
Query: 329 PDKRKLPSIILVC---CLLHNIII 349
P L +IL+ LHN ++
Sbjct: 232 PIIANLEKVILITQAIVALHNFLM 255
>gi|321459802|gb|EFX70851.1| hypothetical protein DAPPUDRAFT_60818 [Daphnia pulex]
Length = 210
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 68/127 (53%), Gaps = 3/127 (2%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSN-RME 162
+++A+ LR LASG + S+ +F VG ST + +A+ + ++K P S E
Sbjct: 2 ERLALKLRFLASGGTLTSLRYSFRVGISTAHSIVRSTSQAIFLALSPSYMKVPSSPVEWE 61
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
I ++F + +C GA++ HI ++ PA+ S + + +N +S+++ G+ D RFI
Sbjct: 62 SIANEFHNIWNFTHCVGALNGKHIRISFPALCGSQFY-NYKNFHSIVLMGLADARYRFII 120
Query: 223 IVTGWPG 229
I G G
Sbjct: 121 IDVGASG 127
>gi|357127579|ref|XP_003565457.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 396
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 110/279 (39%), Gaps = 38/279 (13%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVEKQV I L +A + ++ F T+ + R +EA+ +++ P S
Sbjct: 88 VSVEKQVVIFLYAVAKNATNETLQDWFQHSPDTIHRHFKRVLEAITNLTPIYIR-PPSLH 146
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
I K + NC GA+D THI M LP Q + +++ S D +++F
Sbjct: 147 PHSILRKPKIYPFFKNCIGAVDGTHIPMKLPLDQQK-PYRNRKQTISQNAMVACDFDLKF 205
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
+ I GW G + +R+L+ LN EV ++ + GY
Sbjct: 206 VHINPGWEGSASDARVLQ-----------DALNHGF-----EVPDGKFYLVDAGYANTPQ 249
Query: 281 LITPYET-------NGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRI---LSKV 325
+ PY G + P FN H R+ ++ G W++ + KV
Sbjct: 250 FLAPYRDTRYHLNEQGRARQKPQNHKELFNLGHAQLRN----HIERIIGIWKMRFPILKV 305
Query: 326 MWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
K K + C +LHN I I +GD P A D
Sbjct: 306 ASHFSKEKQIDTSVACAVLHNFIRIHNGDMTWPSDATMD 344
>gi|242096148|ref|XP_002438564.1| hypothetical protein SORBIDRAFT_10g022020 [Sorghum bicolor]
gi|241916787|gb|EER89931.1| hypothetical protein SORBIDRAFT_10g022020 [Sorghum bicolor]
Length = 326
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 109/270 (40%), Gaps = 10/270 (3%)
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFE 169
+R L G + +G+ST + +F+ A++E +L+ P+ + + E
Sbjct: 1 MRMLTYGVPADATDEYVRIGESTAIESLRKFVIAIDEIFGDEYLRHPNEKDTARLLAMGE 60
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+ G G+ID H + + + +E +++++ + ++ G PG
Sbjct: 61 KQ-GFPGMLGSIDCMHWVWKNYPYEKQGQYKGKEEKPTIVLEAVASDDLWIWHTFFGMPG 119
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
N +L S F G+ N ++ + Y+ G+ YP + L+ T
Sbjct: 120 SHNDINVLNRSPLFDNLAEGKAPEVNFSVNGHNYVMGYYLADGI-YPTWATLVKSI-TKP 177
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
+ F EA R + +AF L+ + I+ D L I+ C ++HN+I+
Sbjct: 178 MGNKRQYFAKAQEAARKMVERAFGVLQSRFAIVRGAARFWDVETLTRIMRACVIMHNMIV 237
Query: 350 -DSGDQLHPDVALSDHHDSGYGEQCCKQVD 378
D G + P+ +H D G GE + D
Sbjct: 238 EDEGFVVDPN----EHFDYG-GENVQPEHD 262
>gi|449690872|ref|XP_004212488.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 293
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 83/194 (42%), Gaps = 8/194 (4%)
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
D + +F++ + C GAID H+ + PA ++ + + + YSM++ I D
Sbjct: 18 DVKERQNFIKEFDQDWNFPQCIGAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAICDA 76
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV-RISSEEVELREYIVGGVGY 275
+ F + G G N + +L S F + G N+ +IS + ++ +VG +
Sbjct: 77 KYCFTMVDIGAYGRDNDAAVLNASTFGRAFNKGY---FNLPKISEFDPKVPPVLVGDDIF 133
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKL 334
L WL+ PY L+ FN R +F L WRI S + +P K+
Sbjct: 134 ALRPWLMKPYPGKNLTVQQRVFNYRLSRARRTIKNSFGILAVRWRIYRSPIKAKP--LKV 191
Query: 335 PSIILVCCLLHNII 348
II LHN +
Sbjct: 192 EHIIKATVCLHNYL 205
>gi|322782997|gb|EFZ10714.1| hypothetical protein SINV_05513 [Solenopsis invicta]
Length = 195
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 76/170 (44%), Gaps = 20/170 (11%)
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID THI + + S+ + +++ +S+ VQ I + ++ ID+V WPG + S + S
Sbjct: 1 IDCTHIKVLSYGGEDSELYRNRKGYFSLNVQVITNAKLEIIDVVARWPGSTHDSTIFNHS 60
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT---F 297
L EA + NG ++G GYP +S+L+TP L+ S P +
Sbjct: 61 RIKSLFEANRFNNG-------------LLLGDSGYPNISYLMTPL----LNPSTPAEHLY 103
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
N T S + F K + +LS K+ II +LHNI
Sbjct: 104 NEAQIQTHSTIERCFGIWKRRFAVLSIGSRFQTVEKILPIITATAILHNI 153
>gi|221121048|ref|XP_002156289.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 328
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/225 (23%), Positives = 92/225 (40%), Gaps = 23/225 (10%)
Query: 108 AIALRRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPDSNR 160
+I R + S S F VG++TV + W+ ++ + RA + +K
Sbjct: 11 SIPCYRRITTNSCDSQSFYFRVGRTTVCHIIEETCCAIWKVLKKVFLRAPNDVK-----E 65
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ I +F++++ C GAID H+ + P ++ + + + Y M++ I D + F
Sbjct: 66 WQNIIKEFDQNWNFPQCIGAIDGKHVRIEAPT-KSGFSFYNYKGFYCMVLLAICDAKYCF 124
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGVGYP 276
+ G G N + +L S F G+ N +IS + ++ +VG +
Sbjct: 125 TMVDIGAYGRDNDAAILNASTF------GRAFNKGYFNLPKISELDSKVPPVLVGDDIFA 178
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRI 321
L WL+ PY + FN R +F L WRI
Sbjct: 179 LKPWLMKPYPGKNFTVQQRVFNYRLSRARRTIKNSFGILAARWRI 223
>gi|384495665|gb|EIE86156.1| hypothetical protein RO3G_10867 [Rhizopus delemar RA 99-880]
Length = 347
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/251 (21%), Positives = 107/251 (42%), Gaps = 4/251 (1%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSN 159
S +++ ALR+LA G S + + +S+ F EA+ K +L+ P+++
Sbjct: 32 FSPYQKMTAALRQLAYGYSADCIDEYLRMSESSAMMWLKLFCEAVISVYKDEYLRRPNAD 91
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
++ + + EE G G+ID TH + + ++ + +++++ + +++
Sbjct: 92 DIKRLLAVGEER-GFPGMIGSIDCTHWVWKNCPTAWHGQYIGKDGSPTIILEAVASYDLW 150
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
G PG +N +L S F G+ N ++ + Y+ G+ YP +
Sbjct: 151 IWHAFFGSPGSLNDLNVLDRSPVFNKVAQGKSPKANFVVNGHNYDYGYYLADGI-YPKYA 209
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ ++ F E R +AF LK + I+ + K L +I+
Sbjct: 210 AFVKTIPCP-ITPKAKIFAQKQEEARKDVERAFGVLKARFAIVREAARLWKKPDLCNIMQ 268
Query: 340 VCCLLHNIIID 350
C +LHN+II+
Sbjct: 269 TCIILHNMIIE 279
>gi|409049425|gb|EKM58902.1| hypothetical protein PHACADRAFT_63878, partial [Phanerochaete
carnosa HHB-10118-sp]
Length = 311
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/238 (23%), Positives = 104/238 (43%), Gaps = 30/238 (12%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ + RVS TFD + + DL+ S + L ++ Q+AI L R +
Sbjct: 91 DKFRRYLRVSPYTFDRLVETIGNDLIFTNKS-----QNCQLPIDLQLAITLHRFGHDGNG 145
Query: 121 VSV-GVAF--GVGQSTVSQVTWRFIEALEERA--KHHLKWPDSNRMEEIKSKFEESFG-- 173
VSV GVA G+ TV VT R + AL A + + P E+ K+ E+
Sbjct: 146 VSVQGVAGWGGIAVGTVLLVTRRVMTALLRPAFMQDAVSLPSQEEKEQAKAWVEKHSCRA 205
Query: 174 -----LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
L++ +D T + + S+ + D+++NY + +Q + +R +D G
Sbjct: 206 WRDGWLISTSLMVDGTLVSLYTCPKWYSESYFDRKSNYFLNIQVVSLLNLRIVDFAYGHT 265
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
G + + + + ++ +A +E +E++ YP+ SW++ PY+
Sbjct: 266 GSTHDATAWEQTRIYQEHDA-------------LLEGQEFVWADSAYPIASWVVAPYK 310
>gi|356524593|ref|XP_003530913.1| PREDICTED: uncharacterized protein LOC100806372 [Glycine max]
Length = 343
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 88/172 (51%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKKTF +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 45 LRVSKKTFFKLCRILQE-------KGQL-VKTKNVPIDEAVAMFLHILAHNLKYRVVHFS 96
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 97 YCRSMETISRQFKNVLRAIMKVSKEYLKFYEYNLEGSVENKWR---WFKNSIGALDGIHI 153
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 154 PVTVSA-EDRPRYRNRKGDISTNVLGVCGSDLRFIYVLPGWEGSAGDSRVLR 204
>gi|356530983|ref|XP_003534058.1| PREDICTED: uncharacterized protein LOC100811756 [Glycine max]
Length = 481
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 124/299 (41%), Gaps = 44/299 (14%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ K F +C ++R+ + R +G++ +E+Q+AI L + E +
Sbjct: 147 LRMDKHVFHKLCDILRQRAMLRDTAGVM--------IEEQLAIFLNVIGHNERNRVIQER 198
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI--KSKFEESFGLLNCCGAIDAT 184
F T+S+ ++A++ ++ L+ P EI ++F F +C G ID
Sbjct: 199 FQHSGETISRHFNNVLKAIKSLSREVLQPPQLKTPPEILNNARFYPYFK--DCIGVIDG- 255
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIV---DHEMRFIDIVTGWPGGMNVSRLLKFSG 241
M +PA + D N +L Q ++ +++FI + GW G + SR+L+
Sbjct: 256 ---MNIPAHVPAKDQSRFRNKKGILSQNVLAACTFDLQFIFVYPGWEGSVTDSRVLR--- 309
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-----------TNGL 290
+ N + ++ +Y + GY I PY+ N L
Sbjct: 310 ---------AVLDNPDQNFPQIPQGKYYLVDKGYLNTEGFIAPYQGVRYQHYEFRGANQL 360
Query: 291 -SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
+ FN H R+ +++F LK + IL K+ + + I++ C+LHN I
Sbjct: 361 PRNAKELFNHRHCFLRNAILRSFNVLKTRFPIL-KLAPQYSFQIQRDIVIAGCVLHNFI 418
>gi|195386252|ref|XP_002051818.1| GJ10264 [Drosophila virilis]
gi|194148275|gb|EDW63973.1| GJ10264 [Drosophila virilis]
Length = 364
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 130/308 (42%), Gaps = 36/308 (11%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVS---RPPSGLINIEGRLLSVEKQVAIA 110
++P +++ FK FR+ + +F+ + V + + P G +++ +LL
Sbjct: 39 SMPRYQDDQFKEQFRMQRSSFEKLLQAVGKAIAGAEHHQPIGRVSLPEKLL-------YT 91
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEE 170
L L+ +S VG +F + +S+ ++ A ++KWP N S +
Sbjct: 92 LTLLSGKKSFREVGDSFAISKSSGHEIFKWVTAAFAALMPRYVKWPADNACG--GSAISQ 149
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
G++ G ID I + LP + S+ +Q + D RF+D+
Sbjct: 150 LPGVV---GVIDECRIPLKLPVREE----VGHLQYASLALQAVCDERSRFLDVHIDVSDN 202
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG- 289
V LLK F +L + + L + +++VG + YPLL L+TPY N
Sbjct: 203 QCV--LLKSELFERLIDMEEPL----------MPPHKHLVGEMMYPLLLNLMTPYADNNG 250
Query: 290 -LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL-VCCLLHNI 347
L+ +N + A +AF L + L + ++ SI++ C+LHN
Sbjct: 251 ELTPCHIRYNRAVHLWNAPAERAFAALMSRFGRLKSL--DVGTMEVGSIVVSATCMLHNF 308
Query: 348 IIDSGDQL 355
I+D G+ +
Sbjct: 309 ILDCGEPI 316
>gi|115642579|ref|XP_001196714.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 253
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/228 (21%), Positives = 104/228 (45%), Gaps = 8/228 (3%)
Query: 126 AFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEEIKSKFEESFGLLNCCGAIDA 183
A+ V +T+S V ++A+ EE L P + + ++ ++ + + + GAID
Sbjct: 4 AWRVPHNTISVVVREVVKAIIEEYTDELLFCPTTEQGWRDLADQWYQRWNFPHTVGAIDG 63
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
H+ P + ++ + E YS+++ +VD + +F I G + + + S
Sbjct: 64 KHVACKAPP-NSGSEFYNYEGFYSVILFAMVDADYKFTYIDVSGNGSSSDAHIYNESDLH 122
Query: 244 KLCEAGQRLNGNVR---ISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
+ + R++ + + ++ ++ +I+G + L ++L+ PY T L+ FN
Sbjct: 123 RGLDQN-RIHAFPQPDPLPNDNQDVPYFIIGDDAFSLRTYLMKPYSTRNLTHEERIFNYR 181
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
R + F L ++IL M + + +I+ CC+LHN++
Sbjct: 182 LSRARRVVENTFGILANRFQILLTTM-QHHHETVRTIVEACCILHNLM 228
>gi|395326160|gb|EJF58572.1| hypothetical protein DICSQDRAFT_66655 [Dichomitus squalens LYAD-421
SS1]
Length = 201
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 61/154 (39%), Gaps = 5/154 (3%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
Q+A+ L + + ES V V + V T R AL H+ WP+ + E
Sbjct: 33 QLAVFLCHMGN-ESAVKTASFASVSEGAVFLYTERVCTALRNIRNQHIYWPEDDERERFS 91
Query: 166 SKFEESFGLLNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
E +G C G+ D T+I + P+ WC ++ VDH RF
Sbjct: 92 DAMSE-WGFPGCLGSGDGTYIRLEKRPSQNGYAYWCRKKMYAVSYFPRYVDHRGRFTSYD 150
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRI 258
GWPG + SR+ K S + C Q + I
Sbjct: 151 FGWPGSVQDSRVFKSSHLW--CNKAQYFRSHEYI 182
>gi|57900044|dbj|BAD88086.1| ribosomal protein-like [Oryza sativa Japonica Group]
gi|57900542|dbj|BAD88282.1| ribosomal protein-like [Oryza sativa Japonica Group]
Length = 301
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/242 (21%), Positives = 103/242 (42%), Gaps = 4/242 (1%)
Query: 111 LRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFE 169
+R+LA G + ++ G+ +ST + RF++A+ + +H +L+ P+ N + +
Sbjct: 1 MRQLAYGIAADALDEYLGIAESTAIESLRRFVKAVVQVFEHEYLRSPNENDTTRL-LELG 59
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
E G G+ID H + + + +++++ + ++ G PG
Sbjct: 60 EDRGFPGMLGSIDCMHWKWKNCPTELHGMYQGHVHEPTIILEAVASKDLWIWHAFFGMPG 119
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
N +L S F G+ N I+ + + Y+ G+ +++ T E +G
Sbjct: 120 SHNDINVLHRSPLFAKLAEGKAPEVNYSINGHDYMMGYYLADGIYPSWATFVKTIPEPHG 179
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII 349
F EA R +AF L+ + I+ D++ L I+ C ++HN+II
Sbjct: 180 --NKRKYFAKAQEAVRKDVERAFGVLQARFAIVRGPARHWDEKTLGYIMKACVIMHNMII 237
Query: 350 DS 351
+
Sbjct: 238 ED 239
>gi|242092118|ref|XP_002436549.1| hypothetical protein SORBIDRAFT_10g004520 [Sorghum bicolor]
gi|241914772|gb|EER87916.1| hypothetical protein SORBIDRAFT_10g004520 [Sorghum bicolor]
Length = 392
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE 267
+ Q + R + + G+ G +L+ S ++ E G+ L+ +
Sbjct: 201 LAAQLVAGASSRVLSLAAGFRGDRTDLGVLRLSSLYQEVEQGKLLDSQ-----------Q 249
Query: 268 YIVG-GVGYPLLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKV 325
Y+VG G GYPLL WL+ P+ + S FN+ H+A +A L GW ++++
Sbjct: 250 YLVGDGGGYPLLPWLMVPFPGPVVPGSTEAEFNAAHKAMCRPVRRAIRSLM-GWGAIARL 308
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGD--QLHPDV 359
R + I C +LHN+++ D L PDV
Sbjct: 309 REEESSRAAVACIGTCAMLHNVLLTREDYSALAPDV 344
>gi|391327964|ref|XP_003738464.1| PREDICTED: putative nuclease HARBI1-like, partial [Metaseiulus
occidentalis]
Length = 229
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 101/222 (45%), Gaps = 23/222 (10%)
Query: 68 RVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAF 127
R+S+ F Y+ +R + R + L E ++ + LR LA+G+S S+ +
Sbjct: 25 RLSESQFSYLLERIRPRIEKRDTTF-----RNALRPESKLHMTLRYLATGDSIGSLSALY 79
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL----NCCGAIDA 183
+ +ST S +A+ + + ++ P S EE K +E F L N CGAID
Sbjct: 80 RIPRSTFSSFFPVVCKAIYDALEEFIQMPSSE--EEWKVVMQE-FALRWQFPNACGAIDG 136
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
H+ + P ++ ++ + + ++S++ +VD F+ I G G +N + + + S F
Sbjct: 137 KHVTIRCPP-KSGSEFFNYKKSFSIISLAVVDARYNFLYIDVGTNGRVNDAGVFEKSSFN 195
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
+ E Q L+ V + V +PL S ++ P+
Sbjct: 196 RALEE-QVLD---------VPIEGVFVADDAFPLRSNILKPF 227
>gi|77556456|gb|ABA99252.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 524
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 87/182 (47%), Gaps = 14/182 (7%)
Query: 176 NCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+C GAID THI ++ V++S + ++++ + V VD ++RF ++ GW G + +
Sbjct: 80 DCIGAIDGTHIRASVRKNVESS--FRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTAHDA 137
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASM 294
+L+ + E G R+ R+ +++ L + V Y L W P +
Sbjct: 138 VVLRDA---LERENGLRVPQGNRLQAKQGFLPPF--RAVRYHLNEWGNNPVQN-----EK 187
Query: 295 PTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG-D 353
FN H + R +AF LK +++L R I++ CC++HN +++ G D
Sbjct: 188 ELFNLRHSSLRVTVERAFGSLKRRFKVLDDATPFFPFRTQVDIVVACCIIHNWVVNDGID 247
Query: 354 QL 355
+L
Sbjct: 248 EL 249
>gi|116268418|gb|ABJ96375.1| hypothetical protein [Prunus persica]
Length = 458
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 105/257 (40%), Gaps = 5/257 (1%)
Query: 95 NIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHH 152
N G L L E++ +R LA G S V +G+ST+ + RF +A+E + +
Sbjct: 118 NCAGNLGLLPEQKFTAVIRMLAYGSSADQVDEIARMGKSTILESLVRFCDAVETLYTRDY 177
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
L+ P ++ + K ES G G+ID H D+ +++ S++++
Sbjct: 178 LRKPTPRDLQRLLQK-AESRGFPGMIGSIDCMHWQWKNCPTAWQGDYGNRKGQKSIILEA 236
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+ + G G N +L S F G+ N I++ + Y+ G
Sbjct: 237 VACFDTWVWHAFFGVAGSQNDLNVLGQSPVFNDVLRGEAPNITYEINNTIYQTGYYLADG 296
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
+ YP + + + S F E R + F L+ W I+ D+
Sbjct: 297 I-YPRWTTFVKTI-PHPRSHKEKFFACYQEGYRKDVERCFGILQARWAIIRGAARLFDEE 354
Query: 333 KLPSIILVCCLLHNIII 349
L SI++ C +LHN+I+
Sbjct: 355 VLRSIMMTCIILHNMIV 371
>gi|340387209|ref|XP_003392100.1| PREDICTED: hypothetical protein LOC100642039, partial [Amphimedon
queenslandica]
Length = 209
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 71/137 (51%), Gaps = 6/137 (4%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
SD + F+Y R+S++ FD++ S V L R + I +S +++ + LR LA+
Sbjct: 57 SDTDSHFRYL-RMSREKFDFLLSQVAPSLKRRHYTSSIRPN---ISPAEKLTLTLRFLAA 112
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSNR-MEEIKSKFEESFGL 174
G SQ S+ +F + ++V + + EA+ + ++ P S +I +FE+ +
Sbjct: 113 GVSQASLSFSFRISSASVCNIVYETCEAIWSALQPMYVTAPTSKEDWLKISKEFEQLWNF 172
Query: 175 LNCCGAIDATHIIMTLP 191
+C GAID H+++ P
Sbjct: 173 PHCIGAIDGKHVVIQAP 189
>gi|301106486|ref|XP_002902326.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262098946|gb|EEY56998.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 364
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 83/199 (41%), Gaps = 22/199 (11%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLV-----SRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
F FR+ + F + +LV+ D S+P G E +L++ K + +
Sbjct: 86 FHDHFRLDRDVFFRLVALVKHDSAFCSKGSKPFRG--GPELHMLTLLKFLGT----YGND 139
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNC 177
+ V +G+ G+ +V +R + + + WP+ I + ++ +G +NC
Sbjct: 140 NTAVKLGLFLGIASGSVHNYFFRATNVVLKLEPVTMAWPNEEERVMISLRMQQKYGFVNC 199
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV--DHEMRFIDIVTGWPGG----- 230
G D T + +D+ ++ +YS V G+V D R +V GWPG
Sbjct: 200 VGITDGTLFPLAAKPQHQGEDYFSRKASYS--VHGLVTCDDVGRVRSLVVGWPGSTHDNR 257
Query: 231 --MNVSRLLKFSGFFKLCE 247
MN +LK + FK E
Sbjct: 258 AWMNSPLILKRADHFKHNE 276
>gi|326509193|dbj|BAJ86989.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 418
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 111/293 (37%), Gaps = 47/293 (16%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
+ FRVS TF Y+ L+ + L+S P L +A+AL RLASG +
Sbjct: 115 YPLAFRVSAPTFHYLSGLL-DPLLSHP----------SLPCATLLALALARLASGLPYPA 163
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
+ F V S + R L + L +P S S L +C GA+
Sbjct: 164 LAALFRVPPSAPRAASRRLRRVLLANFRFWLAFPS-------DSTAPSSSPLPSCRGALA 216
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
+ Q + R + + G+ G +L+ S
Sbjct: 217 CARF---------------AGPGGPLAAQLVAGASSRILSLTAGFRGDRADLEVLRLSSL 261
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVG-GVGYPLLSWLITPYETNGLSASMPT-FNSL 300
++ E G+ L+ +Y+VG G GYPLL WL+ P+ S FN+
Sbjct: 262 YQELEQGRLLDPA-----------QYLVGDGNGYPLLRWLMVPFHGPVAPGSPEAHFNAA 310
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGD 353
H A A +A L GW ++++ R + I C +LHN+++ D
Sbjct: 311 HRAMCRPARRAVRSLM-GWGAIARLHEEESPRAAVTCIGTCAMLHNVLLARED 362
>gi|449679552|ref|XP_002158813.2| PREDICTED: uncharacterized protein LOC100207042 [Hydra
magnipapillata]
Length = 381
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 82/164 (50%), Gaps = 19/164 (11%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+S KT + + S V +S+ ++ + + EK++ IALR L +G++Q+++ +
Sbjct: 78 FRMSPKTLEVLLSWV-APFISKS-----SLRRAVATAEKRLCIALRYLVTGDAQITIAIT 131
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM-EEIKSKFEESFGLLNCCGAIDATH 185
F + +TV ++ IE K P S + EI S+F E + C GAID H
Sbjct: 132 FRMRPTTVVRI---IIET--------YKTPSSQKEWLEISSEFYERWNFPYCLGAIDGKH 180
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+I+ PA ++ + + + +S+++ + + F + G G
Sbjct: 181 LIIQSPA-RSGSMYFNYKKTFSIVLLATCNAKYEFTLVDVGGSG 223
>gi|403164564|ref|XP_003324654.2| hypothetical protein PGTG_06191 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165217|gb|EFP80235.2| hypothetical protein PGTG_06191 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 404
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
VG+ TV + + R I A+ + ++ +L WPD R +E+ S + G C G +D T I +
Sbjct: 136 VGRGTVVKASRRVIRAINDLSEKYLMWPDEVRRKEV-SDVMKCEGFEGCVGFVDGTTIPL 194
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+ + DQ+ +YS+ Q + D + +TGWPG S + K
Sbjct: 195 YQRPSIDGEVFFDQKKHYSINCQVVCDCDRFITAYMTGWPGSCGNSMVFK 244
>gi|328707269|ref|XP_003243347.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 130
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 64/128 (50%), Gaps = 15/128 (11%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFE---- 169
+A+ E+ +G F + ST +V + ++E + ++WP KF
Sbjct: 1 MANCETHRQIGNRFNIADSTSHKVVLNCLNNMKELSGKFIRWPRGQEAIITVQKFNCLRP 60
Query: 170 ESF-GLLNCCGAIDATHIIMTLP-AVQTSDDWCDQENNY------SMLVQGIVDHEMRFI 221
+F G+L GA+D HI + P +T + D+ Y S+L+QGIVD +++FI
Sbjct: 61 NAFPGVL---GAVDGCHISILAPWEKRTVMEKLDRNMFYNRKQVPSVLLQGIVDSDLKFI 117
Query: 222 DIVTGWPG 229
D+ +GWPG
Sbjct: 118 DVFSGWPG 125
>gi|384490010|gb|EIE81232.1| hypothetical protein RO3G_05937 [Rhizopus delemar RA 99-880]
Length = 433
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/251 (21%), Positives = 106/251 (42%), Gaps = 4/251 (1%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSN 159
S +++ ALR+LA G S V + +S+ F EA+ K +L+ P+++
Sbjct: 118 FSPYQKMTAALRQLAYGYSADCVDEYLHMSESSAMMWLKLFCEAVISVYKDEYLRRPNAD 177
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
++ + + EE G G+ID TH + + ++ + +++++ + +++
Sbjct: 178 DIKRLLAVGEER-GFPGMIGSIDCTHWVWKNCPTAWHGQYIGKDGSPTIILEAVASYDLW 236
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
G PG +N +L S F G+ N ++ + Y+ G+ YP +
Sbjct: 237 IWHAFFGSPGSLNDLNVLDRSPVFNEVAQGKSPKANFVVNGHNYDYGYYLADGI-YPKYA 295
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
+ ++ F E R +AF LK + I+ + K +I+
Sbjct: 296 AFVKTIPC-PITPKAKIFAQKQEEARKDVERAFGVLKARFAIVREAARLWKKPDFCNIMQ 354
Query: 340 VCCLLHNIIID 350
C +LHN+II+
Sbjct: 355 TCIILHNMIIE 365
>gi|195034814|ref|XP_001988980.1| GH10283 [Drosophila grimshawi]
gi|193904980|gb|EDW03847.1| GH10283 [Drosophila grimshawi]
Length = 435
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 131/343 (38%), Gaps = 63/343 (18%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDL----VSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
+ F+ ++ + FD + SLV E L + RP I E RL + L L++
Sbjct: 89 DDFRSRTGMTPQVFDMLFSLVGEQLNKNSIRRP----ILPECRLF-------LTLMYLSN 137
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
ES + + AF +G STV + +E L A L P + + I + ++ + +
Sbjct: 138 AESILDLSKAFKMGSSTVRAIIQETVEVLWNTLAPAFLPQPAVDDWQRIVAGYQAEWQVP 197
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GA+DAT + ++ + EN+ +L D IDI T ++
Sbjct: 198 HCLGAVDATLLDISYKS----------ENDIVLLASCDADLNFISIDIATLTDP---INN 244
Query: 236 LLKFSGFFKLCEAGQRLNGN---------VRISSEEVELREYIVGGVGYPLLSWLITPYE 286
L + F Q LNG + + +P LITPY
Sbjct: 245 QLDWHQNF----GSQLLNGRWTDLLPAKPLPSGEPHTSCVYSFIANTMFP-CEHLITPYT 299
Query: 287 TNGLSASMPTFNSLHEATRSLA---VKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL 343
L + FN L +R+LA KAF L WR+L++ + R K I+ +
Sbjct: 300 DQQLDENQMNFNRL--LSRALAGSIDKAFSVLMSRWRVLAQPL-RQSPSKAEKIVKAAVV 356
Query: 344 LHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQVDPMGRTTRE 386
LHN + HD Y P+ TTR+
Sbjct: 357 LHNFV--------------KLHDENYAPASYLTELPLISTTRQ 385
>gi|390345323|ref|XP_795221.2| PREDICTED: uncharacterized protein LOC590528 [Strongylocentrotus
purpuratus]
Length = 241
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/183 (22%), Positives = 86/183 (46%), Gaps = 5/183 (2%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIK 165
+++ LR LA+G + + F VG+ T+ + + ++E A + P +N E+
Sbjct: 44 LSVFLRHLATGATYAELSYNFRVGKETIQRFVYVARAIVDEYAAEVISLPTTNEGWLEVA 103
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVT 225
FE + L +C GA D HI + P ++ + + + +S+++ +VD + +F+ I
Sbjct: 104 GDFEARWNLPHCLGAYDGKHIRLQKPN-KSGSLYFNYKRFFSVVLMALVDSKYQFLWIDV 162
Query: 226 GWPGGMNVSRLLKFSGFFKLCEAGQR---LNGNVRISSEEVELREYIVGGVGYPLLSWLI 282
G + +++ K S + EAG + EE + + VG + + ++++
Sbjct: 163 XGVGHQSDAQIYKNSELKECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFTMRTYMM 222
Query: 283 TPY 285
PY
Sbjct: 223 KPY 225
>gi|357141517|ref|XP_003572253.1| PREDICTED: uncharacterized protein LOC100846826 [Brachypodium
distachyon]
Length = 416
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 106/263 (40%), Gaps = 30/263 (11%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+S + +AI L L ES F T+ + + + A H+LK D N
Sbjct: 101 ISSYESLAIFLWILGGCESNRRTQNRFKHSGDTIHHKFHEVLLCVIKMAAHYLKPKDPNF 160
Query: 161 ME---EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
I++ L +C GAID THI ++PA + + + + V I D +
Sbjct: 161 HSVHPTIRNDRMAFPHLKDCIGAIDGTHIRASIPA-DKKIRYVGRSDTTTQNVLAICDFD 219
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
M F G PG M+ + +L + L + S + + Y+V YP
Sbjct: 220 MHFTYASIGQPGSMHDTSVLYHA-----------LEKDKDTFSHPPKGKYYLVDA-DYPN 267
Query: 278 LSWLITPYE---------TNGLSASMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
+ PY+ G + + P FN +H + R+ +AF LK W+IL K+
Sbjct: 268 RPGYLAPYKGERYHVPDFYRGAAPNTPKEKFNKIHSSKRNAIERAFGVLKNKWQILLKMP 327
Query: 327 WRP-DKRKLPSIILVCCLLHNII 348
P + +K+ I+ LHN +
Sbjct: 328 NYPIEGQKM--IVAAAMTLHNYV 348
>gi|148910870|gb|ABR18486.1| putative ribosomal protein [Triticum turgidum]
Length = 348
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/265 (21%), Positives = 113/265 (42%), Gaps = 19/265 (7%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA-LEERAKHH 152
+N EG LS ++ A+R+LA G S+ +G++TV F++ ++ +
Sbjct: 27 LNREG--LSPLQKCTAAIRQLAYGTPADSLDEYLKIGETTVVDCLKNFVQGVIDTYVAEY 84
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQ 211
L+ P++ E + E G G++D H + + + S++++
Sbjct: 85 LRGPNTEDTELLLC-IGEGRGFPGMLGSLDCMHWRWKKCPIAWKGQFTRGYKGWPSLILE 143
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVG 271
+ ++R G G N +L S F GQ N ++ +E + Y+V
Sbjct: 144 AVASQDLRIWHAYFGIAGSNNDINVLNQSPLFIKQLKGQAPRANFFVNDKEYQHGYYLVD 203
Query: 272 GVGYPLLSWLITPYETNGLSASMPT------FNSLHEATRSLAVKAFLQLKGGWRILSKV 325
G+ YP + + + SMP F + + R +AF L+G + IL ++
Sbjct: 204 GI-YPEWAAFVK-------TISMPQTEKHKLFAAHQKGARKDVERAFGVLQGRFSILRRL 255
Query: 326 MWRPDKRKLPSIILVCCLLHNIIID 350
D+ + +++ C +LHN++++
Sbjct: 256 ARLYDRGDIQKVMIACIILHNMVVE 280
>gi|242060480|ref|XP_002451529.1| hypothetical protein SORBIDRAFT_04g003338 [Sorghum bicolor]
gi|241931360|gb|EES04505.1| hypothetical protein SORBIDRAFT_04g003338 [Sorghum bicolor]
Length = 222
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 83/191 (43%), Gaps = 26/191 (13%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH++ +P + ++ + + V +VD +++F ++ GW G + +
Sbjct: 1 DCIGAIDGTHVLARVPR-NMQQAFRGRKKDPTQNVMVVVDFDLKFTYVLAGWEGSAHDAL 59
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE--------- 286
+L +A +R +G +Y + GY + + PY
Sbjct: 60 ILS--------DAIERQDGFTMPQG------KYYLVDAGYACKNGFLPPYRGVRYHLSEF 105
Query: 287 -TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLH 345
+N + + FN H + R +A LK +RIL + K +L +++ CC++H
Sbjct: 106 GSNRPTNAQELFNLRHSSLRVTVERAIGALKNRFRILDNKPFHKYKTQL-KLVIDCCIIH 164
Query: 346 NIIIDSGDQLH 356
N I+ G H
Sbjct: 165 NWILGFGTDEH 175
>gi|331228853|ref|XP_003327093.1| hypothetical protein PGTG_08870 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 110/260 (42%), Gaps = 28/260 (10%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +S R+
Sbjct: 113 QIAVAVCRLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESERI 172
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE F C G +D T I ++ + + D++ Y + V I D +FI
Sbjct: 173 ESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYCISVTVICDINKKFI 230
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + + + F + Q+ + + +++++ Y ++
Sbjct: 231 SYLAGFPGSSHNAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDRYV 277
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPSI--- 337
+ ++ L + H A + ++ A LKG + L ++ R R +
Sbjct: 278 VPAFKGKQLLKRRNIDFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEEMKGA 335
Query: 338 ---ILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 336 VKWIVTCIVLHNLLADLKDQ 355
>gi|125537378|gb|EAY83866.1| hypothetical protein OsI_39087 [Oryza sativa Indica Group]
Length = 717
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 73/187 (39%), Gaps = 25/187 (13%)
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GAID T+I +T+ T DD+ +++ S V I D MRF + TG G + +
Sbjct: 494 CIGAIDGTYIPVTV-LDPTHDDFINRKGFTSQNVLAICDMGMRFTFVATGKRGAAHDMAV 552
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-------- 288
R N +Y + GYPL + PY
Sbjct: 553 F-------------REAVNTADHFPHPPPGKYYLVDSGYPLREGYMAPYRKTRYHLKQFN 599
Query: 289 --GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
G FN H R++ ++F LK W+IL + P K II+ C LHN
Sbjct: 600 GKGPENLCEIFNYHHSCLRNVVERSFGVLKNKWKILKGIPLYP-MEKQSKIIVACFALHN 658
Query: 347 IIIDSGD 353
+D+ +
Sbjct: 659 FAMDNNE 665
>gi|348676154|gb|EGZ15972.1| hypothetical protein PHYSODRAFT_316091 [Phytophthora sojae]
Length = 306
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 59/128 (46%), Gaps = 13/128 (10%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK 165
++A+ L RL + + G + G+G T+ T R AL + + ++KWP + +
Sbjct: 70 RLAVTLDRLGT----TANGSSLGLGTGTLDVFTARITIALIDMSDEYIKWPSPDERSQTA 125
Query: 166 SKFEESFGLLNCCGAIDATHIIMTLPAVQT----SDDWCDQENNYSMLVQGIVDHEMRFI 221
+ G C G ID T T P VQ +C++++ YS+ Q + D R I
Sbjct: 126 RRMRRE-GFPGCVGFIDGT----TFPFVQKPGVDGQCFCNRKHRYSLNGQVVCDVHRRII 180
Query: 222 DIVTGWPG 229
+GWPG
Sbjct: 181 AFYSGWPG 188
>gi|357141107|ref|XP_003572087.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 377
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 112/280 (40%), Gaps = 40/280 (14%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVAI L +A + + F T+ + R +EA+ +++ P +
Sbjct: 88 VSVEEQVAIFLYAVAKNATNETFQDWFQHSPDTIHRHFKRVLEAITNLTPIYIRPPSLHP 147
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K KF F NC GA+D T I M LP Q + + +++ S D +++
Sbjct: 148 HAIMRKPKFYPFFK--NCIGAVDGTDIPMKLPLDQ-QESYRNRKQTISQNAMVACDFDLK 204
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F+ I W G + +R+L+ LN EV ++ + GY
Sbjct: 205 FVHINPDWEGSASDARVLQ-----------DALNHGF-----EVPDGKFYLVDAGYANTP 248
Query: 280 WLITPYE-------TNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRI---LSK 324
+ PY G + P FN H R+ ++ G W++ + K
Sbjct: 249 QFLAPYRGTRYHLNEQGRARQKPQNHKELFNLRHAQLRN----HIERIIGIWKMRFPILK 304
Query: 325 VMWRPDKRKLPSIILVCCLLHNII-IDSGDQLHPDVALSD 363
V K K I + C +LHN I I +GD P A D
Sbjct: 305 VASHFPKEKQIDISVACAVLHNFIRIHNGDMTWPSDATMD 344
>gi|357130831|ref|XP_003567049.1| PREDICTED: uncharacterized protein LOC100824798 [Brachypodium
distachyon]
Length = 225
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 73/154 (47%), Gaps = 18/154 (11%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
NC GA+D TH+ + +P V + ++ S V G+ +M++I ++ GW G + R
Sbjct: 77 NCLGALDGTHVKVIVP-VDIKRRYRSRKAEISTNVLGVCGPDMKYIYMLPGWEGSAHDGR 135
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREY-IVGGVGYPLLSWLITPYETNGLSASM 294
+L+ +A R NG +R+ +++ L + G G+ L + Y G +A
Sbjct: 136 VLR--------DAISRPNG-LRVPADQYYLVDARYTNGKGF-LAPYRGQRYHIGGWTAQN 185
Query: 295 PT------FNSLHEATRSLAVKAFLQLKGGWRIL 322
P FN H R++ ++F ++K W IL
Sbjct: 186 PPNSAEEYFNMCHAKARNIVERSFARIKNKWAIL 219
>gi|328714499|ref|XP_003245376.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 180
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 80/162 (49%), Gaps = 16/162 (9%)
Query: 206 YSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVEL 265
+S+ VQ +V+ E++F+DIV WPG + S + + S F E+G+
Sbjct: 15 FSLNVQALVNSELQFMDIVARWPGSAHDSHIFRNSRLFARLESGEF-------------Q 61
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
+ I+G GY L +L+TP +N + +N TR++ ++F K + +LS +
Sbjct: 62 KMAILGDSGYALKPYLLTPI-SNPVGRIQMLYNESQIRTRNVVERSFGVWKRRFPVLS-L 119
Query: 326 MWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPD-VALSDHHD 366
R + + +II+ +LHNI + + L D L++ H+
Sbjct: 120 GLRLQLKTVQAIIVATAVLHNICREMKEDLPNDNFELTEPHN 161
>gi|221123821|ref|XP_002154997.1| PREDICTED: uncharacterized protein LOC100209168 [Hydra
magnipapillata]
Length = 338
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/263 (21%), Positives = 106/263 (40%), Gaps = 26/263 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWP 156
+S +++A+ LR L +G++Q ++ ++ + ST S++ A L+ER H+
Sbjct: 5 VSPSERLAVTLRFLVTGDAQCTIAASYRISASTTSRIISETCAAIWTSLKERNFLHVP-S 63
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ + I +FE + + GAID HI+M P ++ + + +S+++ + +
Sbjct: 64 EKQEWKTIAKEFENMWNFPHAIGAIDGKHIVMQAPH-NGGSEYFNYKKTHSIVLLAVCNA 122
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYI 269
+ F + G G + + F C G + N I + E L +
Sbjct: 123 KYEFTMVDIGDSGRQSDGSV------FNNCSLGCAIENNKLNIPDPEYIGNSEKVLPYVL 176
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
V + L ++ PY + FN R + AF +RI RP
Sbjct: 177 VADDAFGLKRHMMKPYPNQNIPLDQKIFNYRLSRARRVIENAFGIATTRFRIFR----RP 232
Query: 330 DKRKLPSIILVC---CLLHNIII 349
L +IL+ LHN ++
Sbjct: 233 IIANLEKVILITQAIVALHNFLM 255
>gi|357144148|ref|XP_003573189.1| PREDICTED: uncharacterized protein LOC100829013 [Brachypodium
distachyon]
Length = 460
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 64/319 (20%), Positives = 128/319 (40%), Gaps = 53/319 (16%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ F ++C L RE + + + SVE+QVA+ L + ++
Sbjct: 70 LRMKGAPFFHLCKLFRERKLLK--------DSIHTSVEEQVAMFLLVVGHNTRFRALQPI 121
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDAT 184
F +S+ + A+ E ++ P + +I+ ++F F +C GAID T
Sbjct: 122 FRRSTEVISRYFKAVLYAVGELRDEMIRPPSTQIHPKIQENNRFNPYFK--DCIGAIDRT 179
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H++ +P + S + +++ + V V +++F ++ GW G + + +
Sbjct: 180 HVLARVP-INISAAFRGRKDGTTQNVMAAVGFDLKFTYVLAGWEGSAHDALI-------- 230
Query: 245 LCEAGQRLNG---------NVRISSEEVEL----------REYIVGGVGYPLLSWLITPY 285
L +A +R +G N+ ++ + + Y+V GY + + PY
Sbjct: 231 LADAVERNDGLSLPPDNNLNLLVTYFSIVFAVANSCNFIGKFYLVDA-GYAVRPGFLPPY 289
Query: 286 ET-----------NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL 334
N FN H + R +AF K W+ + + P K ++
Sbjct: 290 RRTRYHLKEYGGGNHPKNYRELFNLRHSSLRISVERAFAAYKNHWKFVYNKPFHPYKTQV 349
Query: 335 PSIILVCCLLHNIIIDSGD 353
+++ C +LHN I+ G+
Sbjct: 350 -KVVIACAVLHNWILQFGE 367
>gi|449689778|ref|XP_004212142.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 289
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 89/202 (44%), Gaps = 15/202 (7%)
Query: 129 VGQSTVSQVTWRFIEALEERAKH-HLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHI 186
+G++TV + +A+ K +LK P + N EE+ ++FE + NC GAID H+
Sbjct: 1 LGRTTVCNIINETTKAIWNVLKPCYLKAPSTLNEWEELANQFENEWNFPNCIGAIDGKHV 60
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
+ P++ + + + + +SM++ I D + F + G G N + + F
Sbjct: 61 CIEAPSL-SGLAYYNYKIFHSMVLLAICDAKYCFTLVDIGSYGRDNDASI------FNES 113
Query: 247 EAGQRLNGNV------RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
+ G+ N+ R+ S ++ +VG + L SWL+ P+ L+ FN
Sbjct: 114 KMGKAFKNNLFKLPKNRMLSNGKQVPPVLVGDDTFALKSWLMKPFSGKNLTIKERIFNYR 173
Query: 301 HEATRSLAVKAFLQLKGGWRIL 322
TR F + WR+
Sbjct: 174 LSRTRRTIENTFGIMVTKWRVF 195
>gi|302787010|ref|XP_002975275.1| hypothetical protein SELMODRAFT_415424 [Selaginella moellendorffii]
gi|300156849|gb|EFJ23476.1| hypothetical protein SELMODRAFT_415424 [Selaginella moellendorffii]
Length = 251
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 63/137 (45%), Gaps = 26/137 (18%)
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYP 276
+ RF+ + GWPG ++ R+ S A + L ++ ++ +++ Y+VG GY
Sbjct: 87 QKRFLSVYAGWPGRIH-DRVFHNSTLRNKIVARELLTSPAMLAYDD-KIKPYLVGDKGYQ 144
Query: 277 LLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS 336
L L+ P+ T + S TFN H L+++ + LP+
Sbjct: 145 LQQHLMIPHTTPNPTPSKATFNLHHH-------------------LAQI-----PKYLPN 180
Query: 337 IILVCCLLHNIIIDSGD 353
II CC+LHN +ID G+
Sbjct: 181 IIKSCCILHNFLIDVGE 197
>gi|449693121|ref|XP_004213309.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 184
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 74/141 (52%), Gaps = 3/141 (2%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPD 157
+ +S +++ + LR LA+G+SQ + F + +TV + +A+ + + +LK P+
Sbjct: 26 QTISPSERLTVTLRYLATGDSQQTQSFYFRLDTTTVCSIINETTKAIWDVLQPSYLKAPE 85
Query: 158 S-NRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
S + E+I ++FE + NC GAI H+ + +V + + + +N +SM++Q I D
Sbjct: 86 SSDEWEKIANEFENEWNFPNCIGAIVGKHVCIE-ASVSSGSAYYNNKNYHSMVLQAICDA 144
Query: 217 EMRFIDIVTGWPGGMNVSRLL 237
+ F + G G N + +
Sbjct: 145 KYCFTLVDIGSYGRDNDASIF 165
>gi|328722179|ref|XP_003247501.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 338
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 84/182 (46%), Gaps = 27/182 (14%)
Query: 179 GAIDATHIIMTLPAVQTSDD----WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
G ID THI + P Q ++ + +++ +S+ VQ I D ++ + + +PG S
Sbjct: 116 GCIDCTHIAIVPPNNQNAEIPEYIYVNRKGYHSLNVQLICDSKLYILSVNAKFPGSTADS 175
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREY----IVGGVGYPLLSWLITPYETNGL 290
+ + +++ EE+ REY ++G GY L WL+TPY +
Sbjct: 176 HIWNSTT-------------DIKNILEELYRREYKGIYLLGDSGYALRPWLLTPY-SETT 221
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK---VMWRPDKRKLPSIILVCCLLHNI 347
+ +N + +TRS+ + LK +R L K + + P +K I+ C +LHN+
Sbjct: 222 TNEEEKYNKMQMSTRSIIERCNGVLKARFRCLLKHRVLHYTP--KKSGKIVNACVILHNM 279
Query: 348 II 349
I
Sbjct: 280 CI 281
>gi|55168239|gb|AAV44105.1| unknown protein [Oryza sativa Japonica Group]
Length = 1220
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 111/263 (42%), Gaps = 30/263 (11%)
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAID 182
V F TVS+ + + A+ E ++ P + +I +C GAID
Sbjct: 180 VRTNFDRSGETVSRYFNKVLHAIGELRDELIRPPSLDTPTKIAGNPRWDPYFKDCIGAID 239
Query: 183 ATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
THI ++ V++S + ++++ + V VD ++RF ++ GW G + + +L+
Sbjct: 240 GTHIRASVRKNVESS--FRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTTHDAVVLR--- 294
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE----------TNGLS 291
+A +R N S V +Y + GY + P+ N +
Sbjct: 295 -----DALEREN------SLRVPQGKYYLVDAGYGAKQGFLPPFRAVRYHLNEWGNNPVQ 343
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN H + R +AF LK +++L R I++ CC++HN +++
Sbjct: 344 NEKELFNLRHSSLRVTVERAFGSLKRRFKVLDDATPFFPFRTQVDIVVACCIIHNWVVND 403
Query: 352 G-DQL--HPDVALSDHHDSGYGE 371
G D+L PD + D +S G+
Sbjct: 404 GIDELIAPPDWSSEDIDESLTGQ 426
>gi|198475885|ref|XP_002132210.1| GA25327 [Drosophila pseudoobscura pseudoobscura]
gi|198137450|gb|EDY69612.1| GA25327 [Drosophila pseudoobscura pseudoobscura]
Length = 388
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 127/325 (39%), Gaps = 40/325 (12%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLV---SRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
++ FK FR+ + TF+ + +V + P +++ +LL L L
Sbjct: 66 DDQFKERFRMQRSTFETLLQVVGRAIAGAEQYQPIASVSLPEKLL-------YTLMLLGG 118
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLN 176
G S G + + +S+ ++ A +++WP S + G+L
Sbjct: 119 GISFRDAGEIYSISKSSGHEIFKWVTSAFSALMPCYVQWPKYGAAGVGSSSISKLPGVL- 177
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
G I+ I + LP ++ DD + + ++ +Q + D + RF+D+ P L
Sbjct: 178 --GVIEECRIPLKLP-IRIDDD--NPKQYPALTLQAVCDAQSRFLDVYVDAPNDDQHCIL 232
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
L F KL L R +IVG YPLL L+TPY L+
Sbjct: 233 LASPLFEKLIHTQTPLMAEHR----------HIVGDKTYPLLLNLMTPY-VGELTPCHAQ 281
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKL---PSIILVCCLLHNIIIDSG- 352
+N + A +A L ++ L + D L ++I C+LHN I+DSG
Sbjct: 282 YNLAIRLWNAPAERALATLISRFKRLDSL----DVSSLELGTTVISAACMLHNFILDSGV 337
Query: 353 -----DQLHPDVALSDHHDSGYGEQ 372
D L D+ +H Y E
Sbjct: 338 EGMEDDILAFDMLPENHKKIPYDEH 362
>gi|55167956|gb|AAV43825.1| putative polyprotein [Oryza sativa Japonica Group]
Length = 1067
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 86/192 (44%), Gaps = 28/192 (14%)
Query: 176 NCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVS 234
+C GAID THI ++ V++S + ++++ + V VD ++RF ++ GW G + +
Sbjct: 80 DCIGAIDGTHIRASVRKNVESS--FRGRKSHATQNVMAAVDFDLRFTYVLAGWEGTTHDA 137
Query: 235 RLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE-------- 286
+L+ +A +R N S V +Y + GY + P+
Sbjct: 138 VVLR--------DALEREN------SLRVPQGKYYLVDAGYGAKQGFLPPFRAVRYHLNE 183
Query: 287 --TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
N + FN H + R +AF LK +++L R I++ CC++
Sbjct: 184 WGNNPVQNEKELFNLRHSSLRVTVERAFGSLKRRFKVLDDATPFFPFRTQVDIVVACCII 243
Query: 345 HNIIIDSG-DQL 355
HN +++ G D+L
Sbjct: 244 HNWVVNDGIDEL 255
>gi|322782708|gb|EFZ10552.1| hypothetical protein SINV_02440 [Solenopsis invicta]
Length = 241
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 68/154 (44%), Gaps = 32/154 (20%)
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
++WP + + ++ F G+ N GAID T I + P++ E ++ +
Sbjct: 5 IRWPQKDYIA-VREDFRAKKGINNVIGAIDGTTIRIEKPSIN--------EKDFIATGKR 55
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
+VD ++F +I G PG ++ +R K + F + +I+G
Sbjct: 56 VVDANIKFTNIYCGEPGSLHDARY-KETLFPE---------------------NTFILGD 93
Query: 273 VGYPLLSWLITPYETNG-LSASMPTFNSLHEATR 305
Y LSWL+ P+ NG L+ FN LH +TR
Sbjct: 94 SAYASLSWLVPPFRDNGHLTLQQKEFNFLHSSTR 127
>gi|241651513|ref|XP_002411273.1| hypothetical protein IscW_ISCW020253 [Ixodes scapularis]
gi|215503903|gb|EEC13397.1| hypothetical protein IscW_ISCW020253 [Ixodes scapularis]
Length = 120
Score = 55.1 bits (131), Expect = 5e-05, Method: Composition-based stats.
Identities = 36/96 (37%), Positives = 52/96 (54%), Gaps = 9/96 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRF----IEALEERAKHHLKWP 156
++VEK+VAI+L RL S + ++G F VGQS V++ F IE LE R ++
Sbjct: 5 ITVEKRVAISLYRLCSTAEERTIGHHFAVGQSVVNETYREFCDVVIEELEARTITMIQNE 64
Query: 157 D-SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
D N M E F+ G N GA+DA H+ ++ P
Sbjct: 65 DLDNHMRE----FQAVLGFPNAIGALDACHLPVSPP 96
>gi|390343604|ref|XP_798400.2| PREDICTED: uncharacterized protein LOC593848 [Strongylocentrotus
purpuratus]
Length = 336
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 92/200 (46%), Gaps = 21/200 (10%)
Query: 58 DEEEGFKYFFRVSKKTFD----YICSLVREDLVSRP--PSGLINIEGRLLSVEKQVAIAL 111
++ FK + RVS + FD + +++D R P GL ++++ L
Sbjct: 121 NDRRKFKNYTRVSAEMFDDLLVRLTPHLQKDTHYRKAIPPGL------------KLSVFL 168
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEEIKSKFE 169
R LA+G + + F VG+ T+ + A+ +E A + P +N+ E+ FE
Sbjct: 169 RHLATGATYAELSYNFRVGKETIQKFVPDVARAIVDEYAAEVISLPTTNKGWLEVAGDFE 228
Query: 170 ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+ L +C GA D HI + P ++ + + + +S+++ +VD + F+ I G G
Sbjct: 229 ARWNLPHCLGAYDGKHIRIKKPN-KSGSLYFNYKQFFSVVLMALVDSKYHFLLIDVGGVG 287
Query: 230 GMNVSRLLKFSGFFKLCEAG 249
+ +++ S + EAG
Sbjct: 288 HQSDAQIYNNSELKECIEAG 307
>gi|331247668|ref|XP_003336461.1| hypothetical protein PGTG_18132 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 110/260 (42%), Gaps = 28/260 (10%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+A+A+ RL S G + + F VG T+ T R I A+ + WP +S R+
Sbjct: 113 QIAVAVCRLGSNGNGSAIHRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESERI 172
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE F C G +D T I ++ + D++ YS+ + I D +FI
Sbjct: 173 ESSQVMREEGFP--GCVGFVDGTTIPLSQKPPSDGQHYFDRKKRYSISLTVICDINKKFI 230
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + + + F + Q+ + + +++++ Y ++
Sbjct: 231 SYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDRYV 277
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVK-AFLQLKGGWRILSKVMWRPDKRKLPSI--- 337
I ++ L + H A + ++ A LKG + L ++ R R +
Sbjct: 278 IPAFKGKHLLKRRNIDFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEEMKGA 335
Query: 338 ---ILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 336 VKWIVTCIVLHNLLADLKDQ 355
>gi|115898549|ref|XP_001198475.1| PREDICTED: uncharacterized protein LOC762739 [Strongylocentrotus
purpuratus]
gi|115933623|ref|XP_793588.2| PREDICTED: uncharacterized protein LOC588831 [Strongylocentrotus
purpuratus]
Length = 253
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 91/199 (45%), Gaps = 8/199 (4%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNR-MEE 163
++++ LR LA+G + + F VG+ T+ + A+ E A + P +N E
Sbjct: 54 KLSVFLRHLATGATYAELSYNFRVGKETIQKFVPDVARAVVYEYAAKVISLPTTNEGWLE 113
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
+ FE + L +C GA D HI + P ++ + + + +S+++ +VD + ++ID+
Sbjct: 114 VAGDFEARWNLPHCLGAYDGKHIRLQKPN-KSCSLYFNYKQFFSVVLMALVDSKYQWIDV 172
Query: 224 VTGWPGGMNVSRLLKFSGFFKLCEAGQRL---NGNVRISSEEVELREYIVGGVGYPLLSW 280
G G + +++ S + EAG + EE + + VG + + ++
Sbjct: 173 --GGVGHQSDAQIYNNSELKECIEAGTLGIPDPAPLPHDDEEHPMPYFFVGDDAFAMRTY 230
Query: 281 LITPYETNGLSASMPTFNS 299
++ PY + FNS
Sbjct: 231 MMKPYGRRNMDQQQKIFNS 249
>gi|302759881|ref|XP_002963363.1| hypothetical protein SELMODRAFT_79895 [Selaginella moellendorffii]
gi|300168631|gb|EFJ35234.1| hypothetical protein SELMODRAFT_79895 [Selaginella moellendorffii]
Length = 131
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 57/110 (51%), Gaps = 2/110 (1%)
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
LAS +S ++ F V STV + +F + L W + +K+ FE G
Sbjct: 1 LASSDSFANLAEHFCVSSSTVWKYCQKFSNTTFQHLGQFLVWLVD--LSIVKTGFENLRG 58
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDI 223
N CGAID H+ + LP ++D+ ++E + +++Q I+D E +F+DI
Sbjct: 59 FSNYCGAIDCMHLEVELPRNAFANDYYNKEKDDLIVMQEIMDSEAKFLDI 108
>gi|449671595|ref|XP_004207527.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 190
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 60/121 (49%), Gaps = 7/121 (5%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ KK I + + D+ I +S E +V I+LR L+SG Q +G
Sbjct: 36 FRLPKKNIRDIVNTLHNDM------SPITSRSNSISSEAKVLISLRLLSSGSFQGVIGDT 89
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATH 185
V Q + S++ +F + ++ +KW +S M EIK K+ +S+G++ G ID T
Sbjct: 90 SHVSQPSCSRILHQFCKYFIHHYRYLVKWYESPEEMMEIKRKYFQSYGVIGLLGVIDGTM 149
Query: 186 I 186
I
Sbjct: 150 I 150
>gi|328719250|ref|XP_003246705.1| PREDICTED: hypothetical protein LOC100574982 [Acyrthosiphon pisum]
Length = 200
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/145 (21%), Positives = 75/145 (51%), Gaps = 6/145 (4%)
Query: 48 SSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQV 107
S+T + +++ + F F R+++ F+ + +V + + + + +L ++
Sbjct: 46 SNTLIKELAAEDTQSFYNFLRMNENMFNTLLEMVNNKIQRQNSVMRMALPAKL-----KL 100
Query: 108 AIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKS 166
IA R LA+G+S ++ + V + ++S+ +A+ E K++++ P + N + I +
Sbjct: 101 QIACRYLATGDSLATLQYMYRVPKCSISKFLPEVFDAIYEALKNYIEVPKNQNEWKNISN 160
Query: 167 KFEESFGLLNCCGAIDATHIIMTLP 191
+F+ + +C GA+D HI++ P
Sbjct: 161 EFDIKWNFPSCIGALDGKHIVIQCP 185
>gi|449545302|gb|EMD36273.1| hypothetical protein CERSUDRAFT_33216, partial [Ceriporiopsis
subvermispora B]
Length = 225
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 96/240 (40%), Gaps = 31/240 (12%)
Query: 57 SDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS 116
+D + F++F RVS FD I + + + S +E V Q+AI LRR+
Sbjct: 7 ADHPDLFRHFVRVSPSVFDAILEKIITNPIFYNESNNAQLE-----VRSQLAIFLRRMGH 61
Query: 117 GESQVSV---GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG 173
+ +SV V GV +V+ T R A+ + + P + K + +
Sbjct: 62 YGNAISVVDLAVWAGVSTGSVNNYTHRVAAAITALHREAIHLPTDDERAAAK---QFVYS 118
Query: 174 LLNCCG------AIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGI-VDHEMRFIDIVTG 226
C A+D T I + W ++ ++Y+M +Q + + +R +D G
Sbjct: 119 KTQCTAWTGGIMAVDGTAIKLFEEPALYGKSWYNKNSDYAMSLQVVFMVGSLRVVDYAIG 178
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
G + SR K + ++ + N E+I G YPL +W I P++
Sbjct: 179 NVGSAHDSRAFKDTAIYQHPRLFLKDN-------------EWIFGDSAYPLETWCIVPFK 225
>gi|302787024|ref|XP_002975282.1| hypothetical protein SELMODRAFT_415441 [Selaginella moellendorffii]
gi|300156856|gb|EFJ23483.1| hypothetical protein SELMODRAFT_415441 [Selaginella moellendorffii]
Length = 198
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 36/53 (67%)
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
K+KF+E FG LNC GAID T+ + LP + D+ +++ + S+++Q I+D E
Sbjct: 57 KNKFQEFFGFLNCYGAIDYTYFKIELPGNTFASDYYNKDKDKSIVMQAIIDSE 109
>gi|449689347|ref|XP_004212004.1| PREDICTED: uncharacterized protein LOC101236592, partial [Hydra
magnipapillata]
Length = 206
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 52/254 (20%), Positives = 104/254 (40%), Gaps = 50/254 (19%)
Query: 69 VSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFG 128
+S + ++++ S+V + +P + ++ K++ + LR LA+G+SQ++ F
Sbjct: 1 MSPERYEHLLSMVAPSITKKPCRS-----RQTIAPSKRLTVTLRYLATGDSQLTQAFYFR 55
Query: 129 VGQSTVSQVTWRFIEALEERAKH-HLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDATHI 186
+G++TV + + + K +LK P + N E++ ++FE + NC GAID ++
Sbjct: 56 LGRTTVCNIINETTKTIWNVLKPCYLKAPSTLNEWEKLANQFENEWNFPNCIGAIDGSY- 114
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++N+ S+ + + + + FKL
Sbjct: 115 --------------GRDNDASIFNESKMGKAFK--------------------NNLFKLP 140
Query: 247 EAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRS 306
+ + NG ++ +VG + L SWL+ P+ L+ FN TR
Sbjct: 141 KNRKLSNG--------TQIPPVLVGDDIFALKSWLMKPFSGKNLAIKERIFNYQLSRTRR 192
Query: 307 LAVKAFLQLKGGWR 320
F + WR
Sbjct: 193 TIENTFGIMVAKWR 206
>gi|390359402|ref|XP_003729473.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 336
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 100/262 (38%), Gaps = 38/262 (14%)
Query: 99 RLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL--EERAKHHLKWP 156
R L ++AI LR L G S S+ F V +++S A+ E + K
Sbjct: 25 RPLEPGMKLAITLRFLVRGNSYRSLAYDFRVAHNSISTFVPEVCTAIYKEYKEKMFNMAY 84
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY----SMLVQG 212
+ + + +F + +CCGAID HI + ++ NY S+++
Sbjct: 85 TQDEWKAVARQFGTRWKFHHCCGAIDGKHIAI-------------KKTNYKRFCSVVLLA 131
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-------ISSEEVEL 265
IVD F+ G G + + G F L N + ++ +
Sbjct: 132 IVDANYSFLWCKVGANGSSSDA------GVFNKSTLRGALEDNTTGFPVPDPLPGDDRDF 185
Query: 266 REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR-ILSK 324
+IVG +PL WL+ PY ++ N T + F L WR +L+
Sbjct: 186 PYFIVGDDAFPLRKWLLKPYSRRAMTQKERVMNY---RTHRVVENGFGILAHRWRCLLTT 242
Query: 325 VMWRPDKRKLPSIILVCCLLHN 346
+ P ++ ++L C LHN
Sbjct: 243 LQLEPG--RVIYVVLGCLTLHN 262
>gi|361067113|gb|AEW07868.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157494|gb|AFG61086.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157496|gb|AFG61087.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157498|gb|AFG61088.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157500|gb|AFG61089.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157502|gb|AFG61090.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157504|gb|AFG61091.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157506|gb|AFG61092.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157508|gb|AFG61093.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157510|gb|AFG61094.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157512|gb|AFG61095.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157514|gb|AFG61096.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157516|gb|AFG61097.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157518|gb|AFG61098.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157520|gb|AFG61099.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157522|gb|AFG61100.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157524|gb|AFG61101.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
gi|383157526|gb|AFG61102.1| Pinus taeda anonymous locus 0_13474_01 genomic sequence
Length = 94
Score = 54.3 bits (129), Expect = 9e-05, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 48/92 (52%), Gaps = 7/92 (7%)
Query: 264 ELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT---RSLAVKAFLQLKGGWR 320
+LR Y+ G YPLLS+L+ P+ +NG + P N A +S+ +A LKG WR
Sbjct: 7 QLRPYLAGDWCYPLLSFLLVPFSSNG--SGTPAQNLFDAALIRGKSVVEQAVGLLKGRWR 64
Query: 321 ILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
+L + P I+ CC+LHN+ SG
Sbjct: 65 MLQDL--NVGLSHAPQTIVACCVLHNLCQLSG 94
>gi|2739387|gb|AAC14510.1| En/Spm-like transposon protein [Arabidopsis thaliana]
Length = 292
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 77/194 (39%), Gaps = 25/194 (12%)
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
L C GA+D TH+ + P+ QT+ + ++ +M V I + +M+FI G PG +
Sbjct: 41 FLYCVGALDGTHVPVRPPS-QTAKKYKGRKLEPTMNVLAICNFDMKFIYAYVGVPGRAHD 99
Query: 234 SRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP-----YETN 288
+++L + N S + Y+V GYP + + P Y
Sbjct: 100 TKVLNYCA------------TNEPYFSHPPNGKYYLVDS-GYPTRTGYLGPHRRMRYHLG 146
Query: 289 GLSASMPT------FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
P FN H RS+ + F K WRI+ + + K I+
Sbjct: 147 QFGRGGPPVTARELFNRKHSGLRSVIERTFGVWKAKWRIVDRKHPKYGLAKWIKIVTATM 206
Query: 343 LLHNIIIDSGDQLH 356
LHN I DS + H
Sbjct: 207 ALHNFIRDSHREDH 220
>gi|356498377|ref|XP_003518029.1| PREDICTED: uncharacterized protein LOC100803756 [Glycine max]
Length = 409
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/297 (22%), Positives = 116/297 (39%), Gaps = 38/297 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C ++ + + R +S+ +QV + L L+ S +
Sbjct: 77 FRMKKLVFLELCDILETKYNLK--------KTRNVSIYEQVGLFLYMLSQPGSVCNCEER 128
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEEIKSKFEESFGLLNCCGAIDA 183
F T+S+ +EA+ AK +K D + +EI +C GAID
Sbjct: 129 FQHSGETISRHFHNVLEAVCMFAKDIIKPIDPSFRDAPDEILKDARYRPYFRDCIGAIDG 188
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + +P+ + ++ + V + D M F + GW G + +++
Sbjct: 189 THIRVCVPS-HLQGVYIGRKGYTTTNVMVVCDFSMCFTFVWAGWEGSAHDTKIFM----- 242
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-----------GLSA 292
+ L S + + Y+V GYP + PY+ +
Sbjct: 243 ------EALRKPALHFSHPPQGKYYLVDS-GYPTFMGFLGPYKKTRYHLPQFRIGPRIRG 295
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCL-LHNII 348
+ FN H + RS +AF K W+IL + P K + I+V C+ +HN I
Sbjct: 296 RVEVFNYYHSSLRSTIERAFGLCKARWKILGNM--SPFALKTQNQIIVACMAIHNFI 350
>gi|356532810|ref|XP_003534963.1| PREDICTED: uncharacterized protein LOC100782190 [Glycine max]
Length = 343
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 87/172 (50%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 45 LRVSKKAFFKLCRILQE-------KGQL-VKTKNVPIDEAVAMFLHILAHNLKYRVVHFS 96
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 97 YCRSMETISRQFKNVLRAIMKVSKEYLKFYEYNLEGSVENKWR---WFKNSIGALDGIHI 153
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 154 PVTVSA-EDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLR 204
>gi|331252603|ref|XP_003338842.1| hypothetical protein PGTG_20379 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 382
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/317 (21%), Positives = 132/317 (41%), Gaps = 32/317 (10%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPRYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSRNPQR 108
Query: 102 SVEKQVAIALRRLASGESQVSV---GVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
Q+A+A+ RL S + ++ F VG T+ T R I A+ + WP +
Sbjct: 109 DPPIQIAVAVCRLGSNGNSSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLQSSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S ++E + +E GL C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SEQIESSQVMRDE--GLPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+FI + G+PG + S + F + Q + + +++++ Y
Sbjct: 227 KKFISYLAGFPGSSHDSYV------FSHMQVAQ-------YPEKYFDQKQFLLADSAYTN 273
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
+++ ++ L + H L + F L+ I +++ + +
Sbjct: 274 NCYVVPAFKGKHLLKRCNINFNYH-----LILSCFASLR---EIRTQICNAEEMKGAVKW 325
Query: 338 ILVCCLLHNIIIDSGDQ 354
I+ C +LHN++ D DQ
Sbjct: 326 IVTCIVLHNLLADLKDQ 342
>gi|321468203|gb|EFX79189.1| hypothetical protein DAPPUDRAFT_319858 [Daphnia pulex]
Length = 191
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 17/134 (12%)
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
WP R E+ E+F +++ GAIDAT + P Q + ++ ++ +Q +
Sbjct: 48 WPREERYLELS----ENFQIVDSLGAIDATEFEIRQPLNQLPS-YTSRKCKTTIKLQIVS 102
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
H++ ID G+PG + +R+L+ S + G +L V +I+G
Sbjct: 103 THDLEIIDAAVGFPGSIGDARVLRLSPLSRAL--GAKL----------VRSNYHILGDTA 150
Query: 275 YPLLSWLITPYETN 288
YPL L+ P+ N
Sbjct: 151 YPLRQHLLVPFRNN 164
>gi|89257447|gb|ABD64939.1| hypothetical protein 24.t00017 [Brassica oleracea]
Length = 442
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/340 (21%), Positives = 135/340 (39%), Gaps = 23/340 (6%)
Query: 17 KKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDY 76
K+ K+ K + ++ D W +++ IP+ F+ FR++K F +
Sbjct: 45 KEKKRRKQRAYFERNREEGDLRLWNDYFSD--------IPTYPPNLFRRRFRMNKHLFMH 96
Query: 77 ICSLVREDLVSRPPSGLINIEGRL-LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVS 135
I + ++ P + GR+ LS ++ A+R LA G + +V +G +T
Sbjct: 97 IVQRLSNEVQFFQPKK--DALGRVSLSPLQKCTAAIRILAYGNAADAVDEYLRLGATTTR 154
Query: 136 QVTWRFIEAL-EERAKHHLKWP---DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
F++ + + +L+ P D R+ +I E G G+ID H
Sbjct: 155 SCLEHFVDGIISLFGEEYLRRPTPADLQRLLDIG----EYRGFPGMVGSIDCMHWEWKNC 210
Query: 192 AVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQR 251
+ +++++ + +++ G PG +N +L S F GQ
Sbjct: 211 PTAWKGQYTRGLGKPTIVLEAVASYDLWIWHAFFGPPGTLNDINVLDRSPVFDDIINGQA 270
Query: 252 LNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET-NGLSASMPTFNSLHEATRSLAVK 310
++ E L Y+ G+ YP + I G A + F EA R +
Sbjct: 271 PQVTYSVNGREYHLAYYLTDGI-YPKWATFIQSIPLPQGPKAVL--FAQRQEAVRKDVER 327
Query: 311 AFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
AF L+ + I+ DK K+ I+ C +LHN+I++
Sbjct: 328 AFGVLQARFAIVKNPALFWDKVKIGKIMRACIILHNMIVE 367
>gi|428165292|gb|EKX34290.1| hypothetical protein GUITHDRAFT_147326 [Guillardia theta CCMP2712]
Length = 387
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 74/153 (48%), Gaps = 14/153 (9%)
Query: 245 LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEAT 304
L E G L+ + E+++ R YIVG Y S ++TP E + LS +FN
Sbjct: 197 LGELGMELDARL----EKLDGR-YIVGDGEYDRESGVLTPIEGSELSEYELSFNYHLSLA 251
Query: 305 RSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID-SGDQLHPDVALSD 363
R+ +AF + W IL + + D + ++IL C LLHN I+D D+ P+ S
Sbjct: 252 RAPVERAFGMVVRRWPILWRRLKLDDYNLICNMILTCLLLHNFIMDVDKDKAEPEREFSL 311
Query: 364 HHDSGYGEQCCKQVDPMGRTTRENLEKHLQHNQ 396
H + C + P GR T EN ++ L++N
Sbjct: 312 DH------EVCGHIYPAGRLT-EN-DRPLENNN 336
>gi|356577175|ref|XP_003556703.1| PREDICTED: uncharacterized protein LOC100780651 [Glycine max]
Length = 507
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 88/172 (51%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKKTF +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 82 LRVSKKTFFKLCRILQE-------KGQL-VKTKNVPIDEVVAMFLHILAHNLKYRVVHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 134 YCRSMETISRQFKNVLRAIMKVSKEYLKFHEYNLEGSVENKWR---WFKNSIGALDGMHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 191 PVTVSA-EDRPRYRNKKGDISTNVLGVCGPDLRFIYVLPGWEGLAGDSRVLR 241
>gi|312377953|gb|EFR24657.1| hypothetical protein AND_10614 [Anopheles darlingi]
Length = 164
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 62/142 (43%), Gaps = 25/142 (17%)
Query: 268 YIVGGVGYPLLSWLITPYE--TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV 325
YIV YPLL WLI Y N ++ TFN ++ +AF +L+ + IL +
Sbjct: 10 YIVTEPTYPLLPWLIHDYTPLNNPITTEEQTFNDHLAKAKAAVDQAFTRLRARFGILQRK 69
Query: 326 MWRPDKRKLPSIILVCCLLHNII--------------IDSGDQLHPDVALSDHHDSGYGE 371
+ D +P I+L CC+LHNI+ + + DQ +P D
Sbjct: 70 I-DIDINFVPQILLTCCILHNILEKRKMKFEDEWIEQMTAADQKYP------QPDGSIVT 122
Query: 372 QCCKQVDPMGRTTRENLEKHLQ 393
V+ G T R+ L+ H+Q
Sbjct: 123 NYTTSVE--GETVRDQLKDHVQ 142
>gi|348681151|gb|EGZ20967.1| hypothetical protein PHYSODRAFT_298890 [Phytophthora sojae]
Length = 220
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 84/203 (41%), Gaps = 22/203 (10%)
Query: 212 GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISS-------EEVE 264
+VD + RF+DI WPG + SR+ S +L + + E +
Sbjct: 13 AVVDRDRRFLDIDVRWPGSVGDSRVFSNSPVGRLHASSFSSASGDSGAGLLQTGWFEYRK 72
Query: 265 LREYIVGGVGYPLLSWLITPYET--NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
+ +++ Y ++T YE S + N R AF K WR+L
Sbjct: 73 ITFFLLADSAYANSKHVVTTYEIAETQRSVVISKLNRKLAGMRYCVECAFRAAKMRWRLL 132
Query: 323 SK--VMWRPDKRKLPSIILVCCLLHNIIIDSGDQL-HPDVALSDHHDSGY--------GE 371
SK + R + + +P+++ C+LHN +ID D++ +P+ L H Y E
Sbjct: 133 SKPIEIARTNLKDVPTLVSSVCILHNFVIDEKDEVWNPE--LEQRHREFYFENYLRLTNE 190
Query: 372 QCCKQVDPMGRTTRENLEKHLQH 394
Q C + TR+ ++ +Q+
Sbjct: 191 QRCTENQVETSKTRDAIQHWMQY 213
>gi|397565396|gb|EJK44605.1| hypothetical protein THAOC_36845 [Thalassiosira oceanica]
Length = 518
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 88/188 (46%), Gaps = 23/188 (12%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRL----LSVEKQVAIALRRL 114
EE F+ +R+SK+ F + +V+ + P G + GR ++ ++++ALR
Sbjct: 81 EERPFRRKYRMSKEAFMTLFEIVKPHM---PDDGTLRKRGRTPNGNITKLHRLSMALRYC 137
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFG 173
A G+ V + GV V W ++A+ + + +K+P++ + + F
Sbjct: 138 AGGDP-VDIADIHGVKDDEVLNSLWDVVDAIHQSTELDIKFPENEADQRRVAAGFRRKSE 196
Query: 174 L-LNCC-GAIDATHIIMTLPAVQTSDD----------WCDQENNYSMLVQGIVDHEMRFI 221
+ ++CC GAID I + P+ T D+ +C ++ + + +QG+ D F+
Sbjct: 197 INIDCCVGAIDGILIWIHKPS--TRDEKVIGIGPKKFFCGRKKKFGLNMQGVCDARGYFL 254
Query: 222 DIVTGWPG 229
D+ +PG
Sbjct: 255 DVEIRFPG 262
>gi|440577398|emb|CCI55421.1| PH01B031C15.4 [Phyllostachys edulis]
Length = 598
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 126/306 (41%), Gaps = 31/306 (10%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRME 162
VE+QVA+ L V F TVS+ R + A+ E L+ P
Sbjct: 37 VEEQVAMFLNITGHNLRNRLVDTDFTRSGETVSRYFGRVLHAIGELRDKLLRPPSLENPS 96
Query: 163 EIKSKFE-ESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
+I E F +C GAID THI ++ + +C +++ V VD ++RF
Sbjct: 97 KIAGNPEWYPFFKYDCVGAIDGTHIRASVGKSMEAT-FCGRKSFAIQNVTVAVDFDLRFT 155
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
++ GW G + + + L A + NG + + E++ Y V GY
Sbjct: 156 YVLAGWEGTTHDATV--------LAHAIEHENG-LCVPEEKL----YPVDA-GYGAKPGF 201
Query: 282 ITP-----YETNGLSASMP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPD 330
I P Y N +S P FN H + R +AF LK +++L + P
Sbjct: 202 IPPFRAVRYHLNEWGSSPPLSEKELFNFRHSSLRVSVKRAFGSLKRRFKVLDDATPFFPF 261
Query: 331 KRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCK--QVDPMGRTTRENL 388
++ +++ C+LHN ++ G + L H + G+Q K ++P+ R
Sbjct: 262 STQV-DVVITSCILHNWVLYQGSDISFYQRLPGHL-TPVGQQGSKLLTIEPLLTRGRLLP 319
Query: 389 EKHLQH 394
+KH +H
Sbjct: 320 KKHGKH 325
>gi|432952358|ref|XP_004085075.1| PREDICTED: putative nuclease HARBI1-like [Oryzias latipes]
Length = 249
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 18/163 (11%)
Query: 201 DQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISS 260
++++ +S+ VQ I D ++ WPG ++ SR+ + S E G
Sbjct: 77 NRKSIHSINVQIICDAAHIITNVEAKWPGSVHDSRIFRESTLSNRLECG----------- 125
Query: 261 EEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR 320
E+ +++G GYP L+TPY FN H TR+ L+ ++
Sbjct: 126 ---EIDGFLLGDRGYPCQPKLLTPYPEPE-QGPQQRFNLTHSRTRARVEMTIGLLRARFQ 181
Query: 321 ILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSD 363
L + P++ I+ C +LHNI I G+Q HP + + D
Sbjct: 182 CLLHLKVTPERAC--DFIVACVVLHNIAIFRGEQ-HPALHIQD 221
>gi|328697810|ref|XP_003240445.1| PREDICTED: hypothetical protein LOC100569619 [Acyrthosiphon pisum]
Length = 461
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 84/201 (41%), Gaps = 26/201 (12%)
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
SNR++ + +F E + + NCCGA+D H++ + + + + ++S ++ + D
Sbjct: 49 SNRVK-VAKQFYEQWQIPNCCGALDGKHVVHQVAFANSGSSNFNYKGSHSTVLPALCDAN 107
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
F + G PG C + + I SE ++ YI+ +PL
Sbjct: 108 YYFTIVDIGSPG----------------CSSDGE---PIEIDSENGKIPYYIIADEAFPL 148
Query: 278 LSWLITPYETNGLSASMP----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
+ ++ PY G A++P FN R + F L I + + +K
Sbjct: 149 MENIMRPYTGRG-KANLPIRESVFNYRLSRVRRVIENTFGILASKCHIYRRSIIAGEK-T 206
Query: 334 LPSIILVCCLLHNIIIDSGDQ 354
+ +II +LHN I S D+
Sbjct: 207 INAIIKATVVLHNFIKISEDE 227
>gi|323448431|gb|EGB04330.1| hypothetical protein AURANDRAFT_32799 [Aureococcus anophagefferens]
Length = 399
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 107/267 (40%), Gaps = 17/267 (6%)
Query: 97 EGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQST--VSQVTWRFIEALEERAKHHLK 154
+ + + VE +V I LR LA+G S +G G++T V+ +TW E R +
Sbjct: 52 DAKTVFVEIKVMIVLRWLATGASFDLIGELSETGETTARVTCITW-VTEFARRRWTDFVY 110
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGI- 213
P+ ++++ F+ G+ ++DA H+ V T ++C + Y L +
Sbjct: 111 PPEGEELKKVMKVFDR-LGITGAAFSMDAVHVHWARCPV-TRSNFCTGKEGYPTLAFNVA 168
Query: 214 VDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ-------RLNGNVRISSEEVELR 266
V H R I + G N L+K+ F +G+ RL SEE +
Sbjct: 169 VTHSKRIIAATSAQYGARNDKTLVKYDAFIDDVRSGKSLKTQPFRLRKKDGTCSEET--K 226
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
Y + GY + L E F E+ R F +K R+L
Sbjct: 227 AYGICDGGYHRWACL-QAVEKLASDHDGAGFCKRIESVRKDVENVFGIMKMRSRVLFGKF 285
Query: 327 WRPDKRKLPSIILVCCLLHNII-IDSG 352
D++ + + CC++HN+ ID G
Sbjct: 286 LLLDQKVVSDVFHSCCIIHNMCQIDEG 312
>gi|357115572|ref|XP_003559562.1| PREDICTED: uncharacterized protein LOC100822590 [Brachypodium
distachyon]
Length = 372
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/291 (22%), Positives = 115/291 (39%), Gaps = 6/291 (2%)
Query: 63 FKYFFRVSKKTFDYICSLVRE-DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F++ FR+S+K F I RE D + I I G S++K ++LR LA G S
Sbjct: 23 FQHRFRMSRKLFLKIAEYFREYDDYFKLKRDAIGILG-FTSIQK-CTVSLRLLAYGISAD 80
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+ + +ST +RF A+ + +L+ P + I ++ E G G+
Sbjct: 81 THDDYLRMAESTAIDCMYRFCRAIVAVFGEQYLRTPTAEDTARIMAQNVER-GFPGMLGS 139
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID H + + + S++++ + D ++ G G N +L+ S
Sbjct: 140 IDCMHWSWKNCPFAHQGMYKSHKGSCSVMLEAVADQDLWIWHAFFGMAGSHNDINVLQCS 199
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
F G N I+ Y+ G+ YP S + + F
Sbjct: 200 HVFAKLVEGTAPPVNYEINGHVYNKGYYLADGI-YPRWSTFVKTISNAPAGGARSWFAMQ 258
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
+ R +AF L+ + I+ K ++ +I C ++HN+II+S
Sbjct: 259 QKTCRKDVERAFGVLQARFAIVRYPALTWSKDQMWEVITACVIMHNMIIES 309
>gi|242052433|ref|XP_002455362.1| hypothetical protein SORBIDRAFT_03g009305 [Sorghum bicolor]
gi|241927337|gb|EES00482.1| hypothetical protein SORBIDRAFT_03g009305 [Sorghum bicolor]
Length = 341
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 64/150 (42%), Gaps = 12/150 (8%)
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLK-----FSGFFKLCEAGQRLNGNVRISSEEVE 264
V D +MRF +++GWPG + + K F F G+ + ++
Sbjct: 81 VMAACDFDMRFTFVMSGWPGSAHDMTVFKDAISRFGDLFPHPPTGKYYLVDSGYANRLGY 140
Query: 265 LREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSK 324
L Y G Y L + P + G TFN H + R++ +AF LK WR+L +
Sbjct: 141 LAPY--KGTKYHLQEYREGP-QPEGKEE---TFNYAHSSLRNVIERAFGVLKMKWRMLRE 194
Query: 325 VMWRPDKRKLPSIILVCCLLHNIIIDSGDQ 354
+ K II+ CC LHN I SG Q
Sbjct: 195 IPSY-STEKQSRIIVACCALHNFIRTSGIQ 223
>gi|356561027|ref|XP_003548787.1| PREDICTED: uncharacterized protein LOC100797534 [Glycine max]
Length = 309
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 88/172 (51%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 82 LRVSKKAFFKLCRILQE-------KGQL-VKTKNVPIDEVVAMFLHILAHNLKYRVVHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 134 YCRSMETISRQFKNVLRAIMKVSKEYLKFHEYNLEGSVENKWR---WFKNSIGALDGIHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
++T+ A + + +++ + S V G+ ++RFI ++ GW G S++L+
Sbjct: 191 LVTVSA-EDRPRYRNRKCDISTNVLGVCGPDLRFIYVLPGWEGSAGDSQVLR 241
>gi|154281875|ref|XP_001541750.1| hypothetical protein HCAG_03848 [Ajellomyces capsulatus NAm1]
gi|150411929|gb|EDN07317.1| hypothetical protein HCAG_03848 [Ajellomyces capsulatus NAm1]
Length = 414
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 13/187 (6%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P+ K R+S TF + +R + R + I+ +E++V I L +
Sbjct: 63 PTSHPLRIKNMTRMSASTFIALVDWLRSNTDIRESEAIHAIK-----IEQKVLIFLYIMT 117
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEES---- 171
G S + F T+S+V +E+L +H++K P+S ME K +
Sbjct: 118 QGVSYRNTAEIFHHSLDTISKVFHEVLESLCVLYEHYVKMPESP-MEYSKKALGSNPRYW 176
Query: 172 -FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
F C GA+D TH+ +++P Q S W +++ S V DH+M F+ I G G
Sbjct: 177 PF-FKGCIGALDGTHLPISIPLNQQS-VWRNRKGWISQNVLAAYDHDMNFVYIYPGMEGS 234
Query: 231 MNVSRLL 237
+ SR+L
Sbjct: 235 SHDSRVL 241
>gi|449679794|ref|XP_004209423.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 225
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 78/181 (43%), Gaps = 46/181 (25%)
Query: 106 QVAIALRRLASGESQVSVGVAFGV-GQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI 164
QV ALR A+G Q+ +G + + Q T+S++ R LK+P
Sbjct: 81 QVLCALRFYATGSFQIVIGDSTAILSQPTISRIIRRV----------SLKFP-------- 122
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
G ID TH+ + P + + ++ +N+ + VQ + + FIDIV
Sbjct: 123 -----------RITGLIDGTHVCIQKPR-EHEYVYVNRSSNHLINVQAVCGYNGEFIDIV 170
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
W + +R+L+ S + G+++ +L+ ++G GYP WL+TP
Sbjct: 171 AKWHESTHNARILRES------KLGKKMID---------DLKGLLLGDSGYPCFRWLLTP 215
Query: 285 Y 285
Y
Sbjct: 216 Y 216
>gi|397595562|gb|EJK56513.1| hypothetical protein THAOC_23584, partial [Thalassiosira oceanica]
Length = 242
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 92/234 (39%), Gaps = 22/234 (9%)
Query: 131 QSTVSQVTWRFIEALEERAKHHLKWPDSNRME--EIKSKFEE--SFGLLNCCGAIDATHI 186
Q Q W ++A+ + + +K+P+S R E EI F+ S G+ C GAID I
Sbjct: 4 QRGAQQAFWEVVDAIHQTSSLDIKFPES-RFEQAEIARGFKRKSSIGIDCCVGAIDGILI 62
Query: 187 IMTLPAVQTSDD--------WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+ P+ ++ +C ++ Y + +QG+ D F+D+ +PG +
Sbjct: 63 WIHKPSDSDAESIGFGPIKFFCGRKKKYGLNMQGVCDANGYFLDVEIRFPGAASDFYAFD 122
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFN 298
S K E L IS + G Y +++ P+ G A +N
Sbjct: 123 ESALKKKLEESGFLKS---ISGSSFATTYCLFGDNAYVQSPYMVVPWRKVGAGAK-DAYN 178
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDK---RKLPSIILVCCLLHNIII 349
R AF L W +L K + P+ + ++L C LHN I
Sbjct: 179 FYQSQLRINIECAFGMLVHRWGMLRKPV--PNNITVLRTTRLVLALCKLHNYCI 230
>gi|156546775|ref|XP_001605652.1| PREDICTED: putative nuclease HARBI1-like [Nasonia vitripennis]
Length = 363
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 20/192 (10%)
Query: 101 LSVEKQVAIALRRLASGESQVSVG--VAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
+ V QV LR LA G Q V +GQST S+ + A+ A ++ +P +
Sbjct: 74 IPVHLQVLSVLRFLAEGSYQKGVAHDCYHPMGQSTFSKYLHEVVPAIVRLADEYIVFPRN 133
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD----WCDQENNYSMLVQGI 213
+E+ +FEES G GAID TLP + T + + + ++S+ VQ +
Sbjct: 134 MEERQEMSRRFEESLGFPGVIGAIDC-----TLPKIHTPHEHEEAYVNHNQDHSLNVQVV 188
Query: 214 VDHEMRFIDI-VTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
D + ++I +T G N + + S ++ E +L N E+ + Y++G
Sbjct: 189 SDMDYNILNIRITN--GSSNDKFVWRHS---QMRETMYQLRNNP--DPAEINNQYYLIGD 241
Query: 273 VGYPLLSWLITP 284
GY L+TP
Sbjct: 242 GGYTPSPVLLTP 253
>gi|331247932|ref|XP_003336592.1| hypothetical protein PGTG_17903 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 508
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 114/272 (41%), Gaps = 31/272 (11%)
Query: 106 QVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRM 161
Q+A+A+ L S G + + F VG T+ T R I A+ + WP S R+
Sbjct: 73 QIAVAVCCLGSNGNGSAIYRLKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTKSERI 132
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
E + EE F C G +D T I ++ + + D++ YS+ V I D +FI
Sbjct: 133 ESSQVMREEGFP--GCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFI 190
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
+ G+PG + + + F + Q+ + + +++++ Y ++
Sbjct: 191 SYLAGFPGSSHDAYV------FSHMQVAQQ-------PEKYFDRKQFLLADSAYTNDCYV 237
Query: 282 ITPYETNG-LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI--- 337
+ ++ L+ FN +R A LKG + L ++ R R ++
Sbjct: 238 VPAFKGKQLLNRRNIDFNYHLAQSRVRIEHAIGILKGRFASLREI--RTQIRNAEAMKGA 295
Query: 338 ---ILVCCLLHNIIIDSGDQ---LHPDVALSD 363
I+ C +LHN++ D DQ L+ D A D
Sbjct: 296 VKWIVTCIVLHNLLADLKDQWNDLYEDEAPED 327
>gi|193641226|ref|XP_001951909.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 329
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/305 (21%), Positives = 122/305 (40%), Gaps = 52/305 (17%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
FK FR+SK+TF + ++ + + R S LS Q+ IALR A+G Q+
Sbjct: 43 FKCRFRLSKETFMILLHMIGDSIKHRSSS---------LSPVLQLLIALRYYATGAFQMV 93
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAI 181
+G V +STV +V + H +++P + ++ F GA+
Sbjct: 94 LGDHIQVHKSTVYRVIKTVSTEIARLRPHFIEFPSTVAEQRRVQLGFFRLHQFPRVIGAL 153
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D THI + P + + +++ GWPG ++ S +
Sbjct: 154 DCTHIRIQSPKSDIGEQFRNRK----------------------GWPGSVHDSTIFD--- 188
Query: 242 FFKLCEAGQRLNGNVRISSEEVEL-REYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
N +R E E +++G GYP ++ +TP N + + +
Sbjct: 189 -----------NSLIRAKFENNEFGNTFLLGDGGYPCRNYHLTPL-LNPRTEAERKYQKA 236
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVA 360
+R++ F LK + +L+ V R + I+ +L+N ++ D++ ++
Sbjct: 237 QIGSRNVVEGLFGVLKRRFPVLA-VGIRTKLSTTMATIVATAVLYNFLLKEKDEIPQEI- 294
Query: 361 LSDHH 365
D+H
Sbjct: 295 --DNH 297
>gi|390359099|ref|XP_003729409.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 287
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 99/226 (43%), Gaps = 21/226 (9%)
Query: 126 AFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDAT 184
A+ V +++S+V +EA+ EE L+ P + + G G
Sbjct: 4 AWRVPHNSISKVVREVVEAIVEEYVDELLRCPTNEQ------------GWRQLIG----K 47
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
H+ PA + + + + YS+L+ +VD + +FI I G + +++ S
Sbjct: 48 HVACKAPA-NSGSTYYNYKGFYSILIFAMVDADYKFIYIDASSKGSASDAQIYNASDLKD 106
Query: 245 LCEAGQRLN--GNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
E + G + ++ ++ +IVG + L ++L+ PY + L+ FN
Sbjct: 107 GLERNLIMGFPGPDPLPNDTQDVPYFIVGDDAFSLRAYLMKPYSSRYLAREERIFNYRLS 166
Query: 303 ATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
R + AF L ++IL M + D + + SI+ CC+LHN++
Sbjct: 167 RARRVVENAFGILANRFQILLTTM-QHDPQTVKSIVEACCILHNLM 211
>gi|147798741|emb|CAN63216.1| hypothetical protein VITISV_033852 [Vitis vinifera]
Length = 660
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 125/298 (41%), Gaps = 54/298 (18%)
Query: 93 LINI---EGRLLS-----VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA 144
L+NI GRL + VE+QVA L + ++ F T+S+ + + A
Sbjct: 168 LVNILRGTGRLRNNAHSNVEEQVAKFLHIVDHNLRNRTMKFYFKRSSETISRHFHQVLRA 227
Query: 145 LEERAKHHLKWPDSNRM-EEIK--SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCD 201
+ LK PD + +EIK +KF F +C GAID +H V+ S+D
Sbjct: 228 IISLDDVFLKQPDGLKCPQEIKDNTKFWPYFK--DCIGAIDGSHF-----RVKVSNDVVH 280
Query: 202 Q---ENNY-SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR 257
+ N Y + V +++F ++ GW G + SR+L N VR
Sbjct: 281 RYRGRNYYPTQNVLAACSFDLKFTYVLPGWEGSASDSRILD--------------NALVR 326
Query: 258 -ISSEEVELREYIVGGVGYPLLSWLITPYETNGL-----SASMP-----TFNSLHEATRS 306
V +Y + G+ L +TPY + S P FN H + R+
Sbjct: 327 DFDKLIVPQGKYYLADAGFQLKIGFLTPYRSTRYHLKEYSVHQPENDREVFNLRHSSLRN 386
Query: 307 LAVKAFLQLKGGWRIL---SKVMWRPDKRKLPSIILVCCLLHNII--IDSGDQLHPDV 359
+AF LK + I+ ++ + D + IIL CC+LHN + +D ++L +V
Sbjct: 387 AIERAFGVLKKRFPIIASGTEPHYPVDTQS--DIILACCILHNYLMGVDPDERLIAEV 442
>gi|449681715|ref|XP_004209905.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 242
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 83/185 (44%), Gaps = 23/185 (12%)
Query: 118 ESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPDSNRMEEIKSKFEE 170
+SQ + F V ++TV + W+ ++ + RA + +K + I +F++
Sbjct: 60 KSQQTQSFYFRVCRATVCHIIEETCCAIWKVLKKVFLRAPNDVK-----EWQNIIKEFDQ 114
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
++ L C GAID H+ + PA ++ + + + YSM++ I D + F + G G
Sbjct: 115 NWNFLQCIGAIDGKHVRIKAPA-KSGPSFYNYKGFYSMVLLAICDAKYCFTMVDIGAYGR 173
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNV----RISSEEVELREYIVGGVGYPLLSWLITPYE 286
N + +L S F G+ N +IS + ++ VG + L WL+ PY
Sbjct: 174 DNDAAILNASTF------GRAFNKGYFNLPKISEFDPKVPPVFVGDDIFALKPWLMKPYP 227
Query: 287 TNGLS 291
L+
Sbjct: 228 GKNLT 232
>gi|24899402|gb|AAN65002.1| Hypothetical protein [Oryza sativa Japonica Group]
gi|108707938|gb|ABF95733.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 807
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 70/177 (39%), Gaps = 45/177 (25%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ +P+ + C ++ + V + D +MRF ++ GWPG
Sbjct: 624 DCIGAIDGTHVPCVVPSNKFVQHLC-RKGMTTQNVMAVCDFDMRFTFVLAGWPG------ 676
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+GN+ +S+ +G + + L S +MP
Sbjct: 677 -----------------SGNITLST---------LGTLTGQVALLLTREPSIIFRSIAMP 710
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
L K LK WR+L K+ D RK II+ CC LHN I SG
Sbjct: 711 -----------LNHKVIGVLKMKWRMLEKIP-SYDPRKQAQIIVACCALHNFIRKSG 755
>gi|77554035|gb|ABA96831.1| transposon protein, putative, Pong sub-class [Oryza sativa Japonica
Group]
Length = 511
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/306 (22%), Positives = 128/306 (41%), Gaps = 25/306 (8%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVR---EDLVSRPPSGLINIEGRL-LSVEKQVAIAL 111
P+ + F++ FR+ + F I + + E V + N+ G L LS ++V
Sbjct: 135 PTYLDVQFRHRFRMRRPLFLRIMNAIEAHDEYFVQKR-----NVAGVLGLSCLQKVTAVS 189
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRMEEIKSKFEE 170
R L G + +V +G STV + RF+ A++E + +L+ P+ N + + ++ EE
Sbjct: 190 RMLCYGIAADAVDEYVRIGGSTVIESFNRFVVAIDEVFGEEYLRTPNENDIARLLAQGEE 249
Query: 171 SFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGG 230
G G++D H + +E +++++ + + F G PG
Sbjct: 250 R-GFPGMLGSLDCMHWQWKNCPTAWQGQYKGKEGVPTIVLEAVASRDRWFWHAFFGSPGS 308
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
N +L+ S F GQ N I+ + + Y+ G+ YP + L+
Sbjct: 309 HNDINVLQRSPLFAKLAEGQAPKVNYSINGHDYTMGYYLADGI-YPSWATLVK------- 360
Query: 291 SASMPT------FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
S +P F+ EA R +AF L+ + I+ L I+ C ++
Sbjct: 361 SIKLPQGNKRKYFSKAQEAARKDVEQAFGVLQARFAIVRGPARFWKALTLNQIMRACIIM 420
Query: 345 HNIIID 350
+N+I++
Sbjct: 421 NNMIVE 426
>gi|15238244|ref|NP_199013.1| uncharacterized protein [Arabidopsis thaliana]
gi|9757943|dbj|BAB08431.1| unnamed protein product [Arabidopsis thaliana]
gi|28416671|gb|AAO42866.1| At5g41980 [Arabidopsis thaliana]
gi|110743283|dbj|BAE99532.1| hypothetical protein [Arabidopsis thaliana]
gi|332007366|gb|AED94749.1| uncharacterized protein [Arabidopsis thaliana]
Length = 374
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/297 (22%), Positives = 117/297 (39%), Gaps = 41/297 (13%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C L++ GL+ R+ +E Q+AI L + +V
Sbjct: 46 FRMDKPVFYKLCDLLQ-------TRGLLRHTNRI-KIEAQLAIFLFIIGHNLRTRAVQEL 97
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR--MEEIKSKFEESFGLLNCCGAIDAT 184
F T+S+ + A+ +K + P+SN +E F++ C G +D+
Sbjct: 98 FCYSGETISRHFNNVLNAVIAISKDFFQ-PNSNSDTLENDDPYFKD------CVGVVDSF 150
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIV---DHEMRFIDIVTGWPGGMNVSRLLKFSG 241
HI P + D+ N +L Q ++ ++RF ++ GW G + ++L +
Sbjct: 151 HI----PVMVGVDEQGPFRNGNGLLTQNVLAASSFDLRFNYVLAGWEGSASDQQVLNAAL 206
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE---TNGLSASMPTFN 298
R + +V +Y + YP L I PY TN + FN
Sbjct: 207 --------------TRRNKLQVPQGKYYIVDNKYPNLPGFIAPYHGVSTNSREEAKEMFN 252
Query: 299 SLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
H+ + F LK + IL P + ++ +I C L + + ++ D L
Sbjct: 253 ERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVKLVIAACALHNYVRLEKPDDL 309
>gi|81051977|gb|ABB55331.1| Zinc transporter, putative [Asparagus officinalis]
Length = 662
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 123/298 (41%), Gaps = 23/298 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ + F +C E+L S+ GL R ++V ++V I + +A G S V
Sbjct: 175 FRMESRVFLQLC----EELQSK--YGLT--PSRNMTVVEKVGIFVYTIALGLSNRDVCER 226
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKW------PDSNRMEEIKSKFEESFGLL----N 176
F T+S+ + +EA+ R K ++ P + I + + +
Sbjct: 227 FQRSGETISRTITQVLEAICGRNKGYMGLARDVIQPKDRNFQFIPPQIANDTRYMPYFKD 286
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C G ID TH+ +P + +++ + V + D ++ F + W G + +R+
Sbjct: 287 CIGCIDGTHVAACIPEADQL-RYRGRKDIPTFNVMAVCDFDICFTFLSVAWEGSAHDTRV 345
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPT 296
+ G + + R + V YP + + + ++ + + T
Sbjct: 346 FLHAINTPTMNFPHPRQGQYYLVDKGYPDRLGYL--VPYPKIRYHQSQFQREAPTNAKET 403
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII-IDSGD 353
FN H + RS ++F LK W+IL ++ K ++ II+ LHN I I+S D
Sbjct: 404 FNRAHSSLRSCIERSFGVLKKRWKILHQMPQYSVKTQI-DIIMAAFALHNYIRINSVD 460
>gi|357140604|ref|XP_003571855.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 211
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 93/222 (41%), Gaps = 39/222 (17%)
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
+F+ F +C GAID TH+ +PA + + ++ ++ V VD ++ F + G
Sbjct: 18 RFDPYFK--DCIGAIDGTHVPCHVPA-RIVHRFHGRKPFHTQNVLVAVDFDLLFTYVSAG 74
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
W G + S +L+ S + NG +R+ + Y + GY + P+
Sbjct: 75 WEGSAHDSTVLRHSL--------EHPNG-LRVLEGK-----YYLADAGYVARRGFLPPFR 120
Query: 287 -----------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
N FN H + R+ +AF LK ++ L+ + P ++
Sbjct: 121 QTQYHLREWRGNNKPRTPNELFNLRHSSFRTTVERAFGTLKNRFKFLTTRPYYPFPSQV- 179
Query: 336 SIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCKQV 377
+++ CC+LHN I+ +G PD + Y EQ C +
Sbjct: 180 RVVIACCILHNXILLNGG---PDELV-------YSEQNCNDL 211
>gi|331227576|ref|XP_003326456.1| hypothetical protein PGTG_07434 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 277
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/236 (22%), Positives = 99/236 (41%), Gaps = 25/236 (10%)
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
F VG T+ T R I A+ + WP +S R+E + EE G C G +D T
Sbjct: 19 FQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTESERIESSQVMREE--GFPGCVGFVDGTT 76
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
I ++ + + D++ YS+ V I D +FI + G+PG + + + F
Sbjct: 77 IPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKFISYLAGFPGSSHDAYV------FSH 130
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATR 305
+ Q+ + + +++++ Y ++I ++ L + H A
Sbjct: 131 MQVAQQ-------PEKYFDRKQFLLADSAYTNDRYVIPAFKGKHLLKRRNINFNYHLAQS 183
Query: 306 SLAVK-AFLQLKGGWRILSKVMWRPDKRKLPSI------ILVCCLLHNIIIDSGDQ 354
+ ++ A LKG + L ++ R R + I+ C +LHN++ D DQ
Sbjct: 184 RVRIEHAIGILKGRFASLREI--RTQIRNAEEMKGAVKWIVTCIVLHNLLADLKDQ 237
>gi|253761486|ref|XP_002489122.1| hypothetical protein SORBIDRAFT_0020s002020 [Sorghum bicolor]
gi|241947263|gb|EES20408.1| hypothetical protein SORBIDRAFT_0020s002020 [Sorghum bicolor]
Length = 351
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID H++ +P + ++ + + V VD +++F ++ GW G + +
Sbjct: 33 DCIGAIDGNHVLARVPR-HMQQAFMGRKKDPTQNVMVAVDFDLKFTYVLAGWEGSAHDAH 91
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE--------- 286
+L +A +R +G V ++ + GY + + PY
Sbjct: 92 ILS--------DAIEREDGFT------VPQGKFYLVDAGYACRNGFLPPYRGVRYHLTKF 137
Query: 287 --TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLL 344
TN + + FN H + R +A LK +RIL + K ++ ++L C +L
Sbjct: 138 GGTNRSTNARELFNLRHSSLRVTVERAIGALKNRFRILYNKPFHRYKTQV-RLVLACAIL 196
Query: 345 HNIIIDSGDQLH 356
HN II G H
Sbjct: 197 HNWIISFGTDEH 208
>gi|242067561|ref|XP_002449057.1| hypothetical protein SORBIDRAFT_05g004203 [Sorghum bicolor]
gi|241934900|gb|EES08045.1| hypothetical protein SORBIDRAFT_05g004203 [Sorghum bicolor]
Length = 440
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 64/298 (21%), Positives = 121/298 (40%), Gaps = 13/298 (4%)
Query: 63 FKYFFRVSKKTFDYICSLV---REDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
F+ FR+SK F I + E R + +N +G LS ++ A+R+LA+G +
Sbjct: 85 FRRRFRMSKPLFQRIVDALGQWSEYFTQRVDA--VNRQG--LSPLQKCTAAIRQLATGSA 140
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCC 178
+ +G++T + F++ + E + +L+ P E + K E G
Sbjct: 141 ADQLDDYLKIGETTAMEALKNFVKGVREIFGERYLRRPTMEDTERL-LKLGERRGFPGMF 199
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
G+ID H V + + +++++ + H++ G G N +L
Sbjct: 200 GSIDCMHWQWERCPVAWKGQFTRGDQKVPTLILEAVASHDLWIWHAFFGAAGSNNDINVL 259
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
S F GQ ++ + + ++ G+ YP + + ++ +
Sbjct: 260 NQSTVFINELKGQAPRVQYMVNGNQHNIGYFLADGI-YPEWAVFVKSIRL-PITEKEKLY 317
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII-DSGDQ 354
E R +AF L+ W IL + D+ L ++L C +LHN+I+ D DQ
Sbjct: 318 AQEQEGARKDIERAFGVLQRRWCILKRPARLYDRAVLRDVVLACIILHNMIVEDEKDQ 375
>gi|357129235|ref|XP_003566271.1| PREDICTED: uncharacterized protein LOC100825708 [Brachypodium
distachyon]
Length = 432
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 104/250 (41%), Gaps = 10/250 (4%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWP---DSNR 160
++V + R LA G SV + QST+ + +F+ A+ E +L+ P D+ R
Sbjct: 123 QKVVGSFRMLAYGVPVDSVDEIVSLAQSTMIDIFKKFVRAVVEIYGDQYLRAPNVEDTKR 182
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ + S + G G+ID H + + +M+++ + +
Sbjct: 183 LIAMNS----ARGFHGMLGSIDCMHWRWKNCPTAWKGQYTGHVDGPTMILEAVASQYLWI 238
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
G PG +N +L+ S F+ +G ++ + + Y+ G+ YP S
Sbjct: 239 WHSFFGLPGSLNGINVLQRSPLFQKLTSGTAPELEYVVNGNKYTMGYYLADGI-YPSWST 297
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILV 340
+ +N F+++ EA R +AF L+ + ++ DK + I+
Sbjct: 298 FVKTI-SNPEGNKKKHFSTMQEAVRKDVERAFGVLQARFAMVRGPARWWDKETIWYIMTA 356
Query: 341 CCLLHNIIID 350
C +LHN+IID
Sbjct: 357 CVILHNMIID 366
>gi|356566108|ref|XP_003551277.1| PREDICTED: uncharacterized protein LOC100797937 [Glycine max]
Length = 604
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 86/177 (48%), Gaps = 22/177 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS-----VEKQVAIALRRLASGESQV 121
RVSKK F +C +++E +G+L+ +++ VA+ L LA
Sbjct: 82 LRVSKKAFFKVCRILQE-------------KGQLVKTKNDPIDEVVAMFLHILAHNLKYR 128
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
V ++ T+S+ + A+ + +K +LK+ + N +++K+ N GA+
Sbjct: 129 VVHFSYCRSMETISRQFKNVLRAIMKVSKEYLKFHEYNIEGSVENKWR---WFKNSIGAL 185
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
D HI +T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 186 DGIHIPVTVSA-EDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLR 241
>gi|302787246|ref|XP_002975393.1| hypothetical protein SELMODRAFT_103242 [Selaginella moellendorffii]
gi|300156967|gb|EFJ23594.1| hypothetical protein SELMODRAFT_103242 [Selaginella moellendorffii]
Length = 81
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
FF+VSK+ F IC LV + + P L NI R + V KQVAI LRRLA G+S S +
Sbjct: 12 FFKVSKEQFMTICELVALLMQTTLPPNLKNIPSRYVLVGKQVAIVLRRLALGDSFTSSLI 71
Query: 126 A-FGVGQSTV 134
FGV ++ V
Sbjct: 72 EHFGVSKAIV 81
>gi|296084827|emb|CBI27709.3| unnamed protein product [Vitis vinifera]
Length = 217
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 81/177 (45%), Gaps = 19/177 (10%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C +R+ + R +G++ +E+Q+AI L + E +
Sbjct: 2 FRMDKHVFHKLCDTLRQRGMLRDTAGVM--------IEEQLAIFLNIIGHNERNRVIQER 53
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEI--KSKFEESFGLLNCCGAIDAT 184
F T+S+ ++A++ ++ L+ P + EI ++F F +C G ID
Sbjct: 54 FQHSGETISRHFNNVLKAIKSLSREFLQPPPLSTPPEILRNNRFYPYFK--DCIGVIDGM 111
Query: 185 HIIMTLPAVQTSDDWCDQENNYSMLVQGIV---DHEMRFIDIVTGWPGGMNVSRLLK 238
HI +PA D N +L Q ++ +++FI + GW G + SR+L+
Sbjct: 112 HIPAHVPA----KDQSRFRNRKGILSQNVLAACTFDLQFIFVYPGWEGSVADSRVLR 164
>gi|449680571|ref|XP_004209619.1| PREDICTED: uncharacterized protein LOC101240863 [Hydra
magnipapillata]
Length = 197
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 83/173 (47%), Gaps = 13/173 (7%)
Query: 208 MLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVR-ISSEEVELR 266
M VQ + D+ F+D V WPG ++ S++ S + + L G + + ++
Sbjct: 1 MNVQAVCDYYGMFMDEVCVWPGSVHDSKVFANSSINRKLR-NRSLPGTFQSVLHGSEKIG 59
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILS-KV 325
YI+GG YPL + I Y+T + + FNS+ + R+ AF +LK W IL+ K+
Sbjct: 60 TYIIGGPVYPLTPFCIKEYQTCRNNEQV-VFNSMLRSARNPIECAFWRLKARWAILTRKI 118
Query: 326 MWRPDKRKLPSIILVCCLLHNI------IIDSGDQLHPDVALSDHHDSGYGEQ 372
++ + +P I+ +LHN ++D D ++ V L ++ Y Q
Sbjct: 119 DFKLE--LVPKIVYATFVLHNFCERHKTLVDY-DSVNKQVQLIKDNEETYINQ 168
>gi|357167383|ref|XP_003581136.1| PREDICTED: uncharacterized protein LOC100846779 [Brachypodium
distachyon]
Length = 418
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 101/250 (40%), Gaps = 7/250 (2%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEE 163
++ ++LR LA G + + +ST +RF A+ + +L+ P +
Sbjct: 114 QKCTVSLRLLAYGIPADTQDDYLRMAKSTAINCMYRFCRAIVAVFGEQYLRTPTAEDTAR 173
Query: 164 IKSK-FEESF-GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
I ++ E F G+L G+ID H + + + S++++ +VD ++
Sbjct: 174 IMAQNAEREFPGML---GSIDCMHWSWKNCPFAHQSMYKSHKGSCSVVLEAVVDQDLWIW 230
Query: 222 DIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWL 281
G G N +L+ S F G N I+ R Y+ G+ YP S
Sbjct: 231 HAFFGMAGSHNDINVLQCSNVFAKLVEGTAPPVNYEINGHVYNKRYYLADGI-YPRWSTF 289
Query: 282 ITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVC 341
+ + F HEA R +AF L+ + I+ K ++ ++ C
Sbjct: 290 VKTISNAPAGGARSWFAMQHEACRKDVERAFGVLQARFAIVRHPALTWSKDQMWEVMTAC 349
Query: 342 CLLHNIIIDS 351
++HN+II+S
Sbjct: 350 VIMHNMIIES 359
>gi|331248209|ref|XP_003336729.1| hypothetical protein PGTG_17984 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 395
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 85/192 (44%), Gaps = 11/192 (5%)
Query: 45 AKNSST---PGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLL 101
A+NS+T P + +E F ++ + TF +LV+ L+ + P N
Sbjct: 51 ARNSTTRFDPRYDLSELQELPELSFVQMFRMTFPCFLNLVQ--LIKQNPIFYNNSRNPQR 108
Query: 102 SVEKQVAIALRRLAS---GESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-D 157
Q+A+A+ RL S G + + F VG + T + I A+ + WP +
Sbjct: 109 DPPIQIAVAVCRLGSNGNGSAIYRLKNLFQVGFGAIDLYTRQVIHAVYGLQSSLVTWPTE 168
Query: 158 SNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHE 217
S R+E + EE G C G +D T I ++ + + D++ YS+ V I D
Sbjct: 169 SERIESSQVMREE--GFPGCVGFVDGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDIN 226
Query: 218 MRFIDIVTGWPG 229
+FI + G+PG
Sbjct: 227 KKFISYLAGFPG 238
>gi|356545235|ref|XP_003541050.1| PREDICTED: uncharacterized protein LOC100782976 [Glycine max]
Length = 380
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 87/172 (50%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ + + +++ VA+ L LA V +
Sbjct: 82 LRVSKKEFFKLCRILQE-------KGQL-VKTKNVPIDEAVAMFLHILAHNLKYRVVHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ T+S+ + A+ + +K +LK+ + N +++K+ N GA+D HI
Sbjct: 134 YCRSMETISRQFKNVLRAIMKVSKEYLKFHEYNIEGSVENKWR---WFKNSIGALDGIHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 191 PVTVSA-EDRPRYRNRKGDISTNVLGVCGPDLRFIYVLPGWEGSAGDSRVLR 241
>gi|397602790|gb|EJK58269.1| hypothetical protein THAOC_21626 [Thalassiosira oceanica]
Length = 545
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 147/378 (38%), Gaps = 58/378 (15%)
Query: 2 APPKKSKSKKTKKVTKKLKKSKSTSAAPKDPNSSDSDWWPSFWAKNSSTPGATIPSDEEE 61
+PPKK ++KK+ + ++ + PKD +WA + A P + +
Sbjct: 69 SPPKKRRAKKSVLMARRTDDGELEPILPKD---------TVWWAVYVNNFIADDPRSQAK 119
Query: 62 GFKYFFRVSKKTFDYICSLV----REDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASG 117
F+ FR+ F C LV + +L SR G I +E + +LR L G
Sbjct: 120 -FRNRFRLPHANF---CELVLDCEQNELFSR-WLGKDAIGRESSPIELLILGSLRYLGRG 174
Query: 118 ESQVSVGVAFGVGQSTVSQVTWRFIE----ALEERAKHHLKWPDSNRMEEIKSKFE-ESF 172
+ + A + + +FIE L ER ++ P +N E ++ E E
Sbjct: 175 WTFDDLEEATAISREVHRVFFHQFIEFGSTVLFER---YVNTP-TNAEEARRNMLEFEMA 230
Query: 173 GLLNCCGAIDATHII-----MTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
GL G+ D TH++ L V + + + V+H R + G
Sbjct: 231 GLNGGLGSADGTHVVTEGCEYNLKQVHKG----GKTDQTTRTYNTTVNHRRRVLHTTKGG 286
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELRE---------------YIVGG 272
PG N +++F F +C G R NG + E EL E Y++
Sbjct: 287 PGRWNDKTIVRFDDF--IC--GIR-NGTT-LEDIEFELLEMGPDGEVITVKYQGAYVIVD 340
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
GY S + P++ + L ++ E+ R F LKG WRIL +
Sbjct: 341 NGYLRWSCTVPPFKVS-LDQREIRWSKWVESMRKDVECTFGILKGRWRILKTGIRVEGVD 399
Query: 333 KLPSIILVCCLLHNIIID 350
+ I L CC LHN ++D
Sbjct: 400 AVDKIWLTCCALHNWLLD 417
>gi|348681159|gb|EGZ20975.1| hypothetical protein PHYSODRAFT_382702 [Phytophthora sojae]
Length = 136
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 67/137 (48%), Gaps = 6/137 (4%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA 115
P E F+ FRV++ +F + L+ E + + + G+ L E ++A+ L +
Sbjct: 5 PQMNEGEFRANFRVTRASFKRLL-LIVEPYIKKETTNF----GKPLPAEVRLAVFLFYVG 59
Query: 116 SGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR-MEEIKSKFEESFGL 174
G S + FG+G+ST S + A+ + +++P + + ++ +FE+ +G+
Sbjct: 60 QGCSLHQLRSQFGIGKSTASGIVHDVSSAIVSQMSGLVRFPTGRQALMKLAMRFEDGYGI 119
Query: 175 LNCCGAIDATHIIMTLP 191
C GA+D HI + P
Sbjct: 120 PGCVGALDGCHIPIVQP 136
>gi|356560691|ref|XP_003548623.1| PREDICTED: uncharacterized protein LOC100801431 [Glycine max]
Length = 469
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 85/172 (49%), Gaps = 12/172 (6%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
RVSKK F +C +++E G + ++ R + + + VA+ LA + +
Sbjct: 82 LRVSKKAFFKLCRILQE-------KGKL-VKTRNVPIAEVVAMFFHILAHNLKYRVMHFS 133
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ +S+ + A+ + +K +LK+ D N +++K+ N GA+D HI
Sbjct: 134 YCRSMEIISRQFKNVLRAIMKISKEYLKFHDYNLEGSVENKWR---WFKNSIGALDGIHI 190
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
++T+ A + + +++ + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 191 LVTV-ATEDRPRYRNKKCDISTNVLGVCGPDLRFIYVLPGWQGSTGDSRVLR 241
>gi|356507176|ref|XP_003522346.1| PREDICTED: uncharacterized protein LOC100809302 [Glycine max]
Length = 531
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 95/237 (40%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 250 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 307
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++
Sbjct: 308 THIPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKV------- 359
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
L +A R NG +V +Y + G+P + PY +N
Sbjct: 360 -LSDALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHSNDPE 412
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 413 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 468
>gi|357116984|ref|XP_003560256.1| PREDICTED: uncharacterized protein LOC100824549 [Brachypodium
distachyon]
Length = 764
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 95/239 (39%), Gaps = 39/239 (16%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QVA L + +G F VS+ + A+ E +++ P
Sbjct: 289 VSVEEQVATFLNTIGHNLRNRVIGANFDRSNERVSRYFRLVLHAIGELRAEYIRPPSLET 348
Query: 161 MEEI--KSKFEESFGLLNCCGAIDATHIIMTL-----PAVQTSDDWCDQENNYSMLVQGI 213
+I K +F+ F +C GAID TH+ ++ P+ + + Q V
Sbjct: 349 PAKIAGKPRFDPYFK--DCIGAIDGTHVRASVSKDMEPSFRRRKSFPTQN------VMAA 400
Query: 214 VDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGV 273
VD ++RF ++ GW G + + + L +A +R NG V ++ +
Sbjct: 401 VDFDLRFTYVLAGWEGTAHDATI--------LADAIERENGMY------VPQGKFYLVDA 446
Query: 274 GYPLLSWLITPYE----------TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRIL 322
GY + P+ +N + FN H + R +AF LK ++IL
Sbjct: 447 GYGAKPEFLPPFRAMRYHLNEWGSNPVQNENELFNLRHSSLRVTVERAFRTLKRRFKIL 505
>gi|87240807|gb|ABD32665.1| Terpenoid cylases/protein prenyltransferase alpha-alpha toroid;
Bacterial adhesion [Medicago truncatula]
Length = 426
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 141/360 (39%), Gaps = 20/360 (5%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYIC-SLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRL 114
P+ ++ F+ +R+ K F I L D N EG +S + A+R L
Sbjct: 63 PTYDDAMFRRRYRMQKHVFLRIVGDLSSSDNYFTQRVDAANKEG--ISPLAKCTTAMRML 120
Query: 115 ASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKH-HLKWPDSNRMEEIKSKFEESFG 173
A G + +V +G +T + RF + + + +L+ P + ++ I E G
Sbjct: 121 AYGVAADAVDEYIKIGGTTALECLRRFCKGIIRLYEQVYLRAPTQDDLQRI-LHVSEMRG 179
Query: 174 LLNCCGAIDATHI-IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMN 232
L G+ID H P + +++++ H++ G PG +N
Sbjct: 180 FLGMIGSIDCMHWEWKNCPKAWEGQFTRGDKGTTTVILEADASHDLWIWHAFFGCPGTLN 239
Query: 233 VSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
+L S F E G+ N ++ + Y+ G+ YP + ++ L
Sbjct: 240 DINVLDRSPVFDDVEQGKTPRVNYFVNQRPYNMTYYLADGI-YPSYPTFV---KSIRLPQ 295
Query: 293 SMPT--FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII- 349
S P F E+ R +AF L+ ++I+ + D L I+ C +LHN+I+
Sbjct: 296 SEPDKLFAKHQESCRKDIERAFGVLQARFKIIREPARLWDIADLGIIMRSCIILHNMIVE 355
Query: 350 ---DSGDQLHPDVALSDHHDSGYGEQCCKQVDPMG----RTTRENLEKHLQHNQEKDLVK 402
D+ Q D S+ S + +V P R E + ++ H + DLVK
Sbjct: 356 DERDTYAQRWTDFEQSEGSGSSTQQPYSTEVLPTFANHVRARSELRDPNVHHELQADLVK 415
>gi|403177916|ref|XP_003336354.2| hypothetical protein PGTG_17962 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173261|gb|EFP91935.2| hypothetical protein PGTG_17962 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 127
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 57/117 (48%), Gaps = 20/117 (17%)
Query: 75 DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTV 134
D +C +++DL+ +G LSVE QVA+ L RL G S V++G F VG+ T
Sbjct: 11 DELCGELQQDLLG---------QGVPLSVEAQVAVGLYRLGHGTSYVTIGHVFKVGKETA 61
Query: 135 SQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
+ + R+ L+ +N+ + IK FE G+ AI TH+ + +P
Sbjct: 62 DKASGRY-PPLDR----------TNQWDVIKESFECCCGIPKVVRAIYGTHVPIVIP 107
>gi|356507066|ref|XP_003522292.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 401
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 116/297 (39%), Gaps = 38/297 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ K F +C ++ + + R +S+ +QV + L L+ S +
Sbjct: 77 FRMKKLVFLELCDILETKYNLK--------KTRNVSIYEQVGLFLYMLSQPGSVRNCEER 128
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEEIKSKFEESFGLLNCCGAIDA 183
F T+S+ +EA+ AK +K D + +EI +C GAID
Sbjct: 129 FQHSGETISRHFHNVLEAVCMFAKDIIKHVDPSFRDTPDEILKDARYCPYFRDCIGAIDG 188
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI + +P + ++ + V + D M F + GW G + +++
Sbjct: 189 THIRVCVP-YHLQGVYIGRKGYTTTNVMVVCDFSMCFTFVWAGWEGSAHDTKIFM----- 242
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-----------GLSA 292
EA ++ + S+ +Y + GYP + PY+ +
Sbjct: 243 ---EALRKPALHFPHPSQG----KYYLVDSGYPTFMGFLGPYKKTRYHLPQFRIRPRIRG 295
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPS-IILVCCLLHNII 348
FN H + RS +AF K W+IL + P K + II+VC +HN I
Sbjct: 296 RAEVFNYYHSSLRSTIERAFGLCKERWKILGNM--SPFALKTQNQIIVVCMAIHNFI 350
>gi|321454021|gb|EFX65210.1| hypothetical protein DAPPUDRAFT_65628 [Daphnia pulex]
Length = 148
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 1/90 (1%)
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSNRM 161
V +++AI LR LA+G+ S+ F V ST +T A+ E+ A +L P +++
Sbjct: 1 VGERLAITLRYLATGDYLHSLAYGFRVASSTACIITRNTAIAIYEKLAPTYLDVPKADQW 60
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLP 191
+ + ++ + NCCGAID HI + P
Sbjct: 61 KRVTMEYYTKWNYPNCCGAIDGKHIAIISP 90
>gi|297729097|ref|NP_001176912.1| Os12g0299600 [Oryza sativa Japonica Group]
gi|77554714|gb|ABA97510.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|255670242|dbj|BAH95640.1| Os12g0299600 [Oryza sativa Japonica Group]
Length = 754
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 124/313 (39%), Gaps = 56/313 (17%)
Query: 64 KYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSV 123
K FR+ F I + +RE+ + + GL +E+++ I + LA S +
Sbjct: 60 KVAFRMEPHVFKTIANYLREEKLLKDSRGL--------RIEEKLGIFMFMLAHNASFQDL 111
Query: 124 GVAFGVGQSTVS---QVTWRFIEALEERAKHHLKWPDSNRME---EIKSKFEESFGLLNC 177
F ST+ + ++ I AL R LK P +N+ +F F NC
Sbjct: 112 QYEFKHSGSTLHRHIKSIFKIIPALTYR---FLKLPHANQTHWKIRTNPRFFPYFK--NC 166
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GAID THI +T+ + + + +++ S V D ++ F I GW G +R+L
Sbjct: 167 IGAIDGTHIPITIDG-EKAAPYRNRKGTLSQNVMVACDFDLNFTFISCGWEGSATDARVL 225
Query: 238 KF---SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ET 287
+ SGF Q NG ++V G GY I PY +
Sbjct: 226 RSAMNSGF-------QVPNGKF-----------FLVDG-GYANTPQFIAPYRGVRYHLKE 266
Query: 288 NGLSASMPT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCC 342
G P FN H R+ +A LK + IL + K ++ I
Sbjct: 267 FGRGHRRPRDYKELFNHRHAILRNHVERALGVLKKRFPILKVGTFHRIKNQV-RIPAAAA 325
Query: 343 LLHNII-IDSGDQ 354
+ HN+I + +GD+
Sbjct: 326 VFHNMIRLLNGDE 338
>gi|357118623|ref|XP_003561051.1| PREDICTED: uncharacterized protein LOC100820863 [Brachypodium
distachyon]
Length = 429
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 63/146 (43%), Gaps = 14/146 (9%)
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
Q + R + + G+ G +L+ S ++ E G+ L+ +Y+
Sbjct: 240 AQLVAGASSRILSLAAGFRGDRPDLEVLRLSSLYQELEQGRVLDPT-----------QYL 288
Query: 270 VG-GVGYPLLSWLITPYETNGLSASMPT-FNSLHEATRSLAVKAFLQLKGGWRILSKVMW 327
VG G GYPLL WL+ P++ + S FN+ H A +A L GW + ++
Sbjct: 289 VGDGGGYPLLPWLMVPFQGPAVPGSPEAGFNAAHRAMCRPVRRAARSLM-GWGAIVRLHE 347
Query: 328 RPDKRKLPSIILVCCLLHNIIIDSGD 353
R + I C +LHN+++ D
Sbjct: 348 EESPRAAVACIGTCAMLHNVLLARED 373
>gi|331221008|ref|XP_003323179.1| hypothetical protein PGTG_04716 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 471
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 62/302 (20%), Positives = 118/302 (39%), Gaps = 29/302 (9%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLA---S 116
E+ F + R+S+K+F + D + P N VE Q+ +AL L +
Sbjct: 140 EDVFMIYARMSRKSFFSLA-----DTIKNHPVFHNNSNNPQAPVEMQLIVALSNLGLYGN 194
Query: 117 GESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
G S +G + + +V T R + A+ E K H+ WP++ + + +
Sbjct: 195 GGSPHVLGQYYNISPGSVENYTNRCMFAIIETLEKKHVYWPNTQQRANVTATLAGKTVFD 254
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
C G D T + + +D+ ++ Y++ + + + R I V GW G + R
Sbjct: 255 GCIGFADGTIFPLASAPTKHKEDYWMRKMVYAVNSLIVCNWQRRIIYAVHGWCGSAHDQR 314
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN---GLSA 292
+ +K C+ + G EY++ Y +I ++ + L
Sbjct: 315 V------YKNCQLYRHPRGFFSAG-------EYLLADSAYTSTDTIIPAFKRSPGRSLPP 361
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILS----KVMWRPDKRKLPSIILVCCLLHNII 348
FN R + LK W+ L +++ + ++L + I C +LHN +
Sbjct: 362 EKQQFNYELSHNRVVVEHTIGMLKNRWQSLKSLSIEILGKKTAKRLNAWIRTCVVLHNYL 421
Query: 349 ID 350
+D
Sbjct: 422 LD 423
>gi|449673172|ref|XP_004207882.1| PREDICTED: uncharacterized protein LOC101241684 [Hydra
magnipapillata]
Length = 215
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/209 (21%), Positives = 90/209 (43%), Gaps = 19/209 (9%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWP 156
+S +++A+ LR +G++Q ++ ++ + ST+S++ A L+ER H+
Sbjct: 5 VSPSERLAVTLRFFVTGDAQCTIAASYRISASTISRIISETCAAIWTSLKERNFLHVPSE 64
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ R + I +FE + + GAID HI+M P + S ++ + + +S+++ + +
Sbjct: 65 NQER-KTIAKEFENMWNFPHVKGAIDGKHIVMQAPHIGGS-EYFNYKKTHSIVLLAVCNA 122
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYI 269
+F + G G + + F C G + N I + E L +
Sbjct: 123 TYQFTMVDIGDSGRQSDGSV------FNNCSLGYAIENNKLNIPDPEYIGNSEKVLLYVL 176
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFN 298
V + L ++ PY + FN
Sbjct: 177 VADDAFGLKRHMMKPYSNQNIPLDQKIFN 205
>gi|222616945|gb|EEE53077.1| hypothetical protein OsJ_35831 [Oryza sativa Japonica Group]
Length = 339
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 125/307 (40%), Gaps = 50/307 (16%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ F I + +RE+ + + + R L +E+++ I + LA S +
Sbjct: 7 FRMEPHVFKTIANYLREEKLLK--------DSRGLRIEEKLGIFMFMLAHNASFQDLQYE 58
Query: 127 FGVGQSTVSQ---VTWRFIEALEERAKHHLKWPDSNRME---EIKSKFEESFGLLNCCGA 180
F ST+ + ++ I AL R LK P +N+ +F F NC GA
Sbjct: 59 FKHSGSTLHRHIKSIFKIIPALTYR---FLKLPHANQTHWKIRTNPRFFPYFK--NCIGA 113
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID THI +T+ + + + +++ S V D ++ F I GW G +R+L+ +
Sbjct: 114 IDGTHIPITIDG-EKAAPYRNRKGTLSQNVMVACDFDLNFTFISCGWEGSATDARVLRSA 172
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSAS 293
+N ++ + + ++V G GY I PY + G
Sbjct: 173 -----------MNSGFQVPNGKF----FLVDG-GYANTPQFIAPYRGVRYHLKEFGRGHR 216
Query: 294 MP-----TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
P FN H R+ +A LK + IL + K ++ I + HN+I
Sbjct: 217 RPRDYKELFNHRHAILRNHVERALGVLKKRFPILKVGTFHRIKNQV-RIPAAAAVFHNMI 275
Query: 349 -IDSGDQ 354
+ +GD+
Sbjct: 276 RLLNGDE 282
>gi|356566437|ref|XP_003551438.1| PREDICTED: uncharacterized protein LOC100812675 [Glycine max]
Length = 531
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 90/235 (38%), Gaps = 28/235 (11%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FG Q S+ + ++AL A + P S +I+ +C GAID TH
Sbjct: 250 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESIRFYPYFKDCIGAIDGTH 309
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL 245
I ++ S + D+ N S V + ++ F+ +++GW G + S++ L
Sbjct: 310 IPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKV--------L 360
Query: 246 CEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLSAS 293
+A R NG +V +Y + G+P + PY N
Sbjct: 361 SDALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPENE 414
Query: 294 MPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 415 KELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 468
>gi|16519473|gb|AAL25182.1|AC079852_15 Putative transposase [Oryza sativa]
gi|31430128|gb|AAP52086.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 435
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/305 (22%), Positives = 125/305 (40%), Gaps = 46/305 (15%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ + F + + +R + + ++ + VE+++A L L+ S +
Sbjct: 68 FRMEPEIFISLANYLRTEKL---------VDDTRIKVEEKLAFFLYMLSHNASFEDLQEK 118
Query: 127 FG-VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRME---EIKSKFEESFGLLNCCGAID 182
FG G S V F + +K LK P+ N + E +F F NC GAID
Sbjct: 119 FGHSGDSFHRHVKHFFNSVVPSLSKRFLKPPNPNEVHWKIEKDPRFYPYFK--NCLGAID 176
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
THI +++ + + + + +++N S V D +++ + TGW G +R+L +
Sbjct: 177 GTHIPISISSDKAA-PFRNRKNTLSQNVMIACDFDLKITFMSTGWEGSATDARVLTSA-- 233
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP 295
+N ++ + Y+V G GY + + PY + G P
Sbjct: 234 ---------VNKGFQVPPGKF----YLVDG-GYANTNSFLAPYRKVRYHLKEYGAGRRRP 279
Query: 296 T-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
FN H R+ + +K + IL + K ++ I + + HNII+
Sbjct: 280 QNYKELFNHRHAVLRNHVERTLGVVKKRFPILKVATFHKIKNQV-KIPVAAAVFHNIILS 338
Query: 351 -SGDQ 354
+GD+
Sbjct: 339 LNGDE 343
>gi|62701733|gb|AAX92806.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|77550562|gb|ABA93359.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 297
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 68/138 (49%), Gaps = 4/138 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+Q+AI L +A + ++ F T+ + +EA+ +++ P +
Sbjct: 88 VSVEEQIAIFLYTVAKNATNETLQDWFQHSPDTIHRYFKAVLEAITNLTSVYIRAPSLHP 147
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K +F F NC GAID THI M LP + + + +++ + S D +++
Sbjct: 148 HPILRKPQFYPFFQ--NCIGAIDGTHIPMKLP-LDEQEPYRNRKQSISQNCMVACDFDLK 204
Query: 220 FIDIVTGWPGGMNVSRLL 237
F+ I GW G + +R+L
Sbjct: 205 FVHIHPGWEGSASDARVL 222
>gi|405975296|gb|EKC39872.1| Putative nuclease HARBI1 [Crassostrea gigas]
Length = 203
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 76/145 (52%), Gaps = 6/145 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVA---FGVGQSTVSQVTWRFIEALEERAK-HHLKWP 156
+S +++++ LR LA+GE S+ + + +S + A+ R K ++K P
Sbjct: 28 ISPAERLSLTLRYLATGERYKSLSYLSYLYRIPVPNMSLIIPESCSAIFNRLKDEYMKVP 87
Query: 157 DSNR-MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
S E+I ++FE+++ NC GA+D HI + P +++ ++ + + YS+ + +VD
Sbjct: 88 SSQADWEKIANEFEDTWNFPNCIGAMDGKHIAIRCP-LKSGSNYYNYKQFYSIGLLALVD 146
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFS 240
+ +F I G G ++ + KF+
Sbjct: 147 ADYKFKYIDCGCNGRVSDGGVFKFN 171
>gi|22330012|ref|NP_683376.1| uncharacterized protein [Arabidopsis thaliana]
gi|332193870|gb|AEE31991.1| uncharacterized protein [Arabidopsis thaliana]
Length = 324
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 95/227 (41%), Gaps = 26/227 (11%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGV 125
R+S F +C++++ + +P +NI S+E+ VA+ LR E VG+
Sbjct: 69 LLRMSLPCFTTLCNMLQTNYDLQP---TLNI-----SIEESVAMFLRICGHNEVYRDVGL 120
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFG----LLNCCGAI 181
FG Q TV + + A E A +++ P + I + + GA+
Sbjct: 121 RFGRNQETVQRKFREVLTATELLACDYIRTPTRQELYRIPERLQVDQRYWPYFSGFVGAM 180
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSG 241
D TH+ + + W ++ +N S+ + I D +M F I G PG + +L+ +
Sbjct: 181 DGTHVCVKVKPDLQGMYW-NRHDNASLNIMAICDLKMLFTYIWNGAPGSCYDTAVLQIA- 238
Query: 242 FFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
Q+ + + E +Y + GYP L+ PY ++
Sbjct: 239 --------QQSDSEFPLPPSE----KYYLVDSGYPNKQGLLAPYRSS 273
>gi|449688211|ref|XP_004211683.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 260
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 90/211 (42%), Gaps = 16/211 (7%)
Query: 199 WCDQENNYSMLVQ--GIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKL-CEAGQRLNGN 255
+ E+++ M+ + D + RF ++ W G + +R+ S K+ L N
Sbjct: 41 YLKNEDSFRMVANTFAVCDEKGRFTEVEVRWLGSVRDARIYANSNLKKMFVNRCFPLKWN 100
Query: 256 VRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQL 315
++ + +++G YPLL + Y + S FN++ + R+ AF +L
Sbjct: 101 QKLVPGYSPIMPFMLGDSAYPLLPNFLKEY-PHSESDKHALFNNMMRSARNQIKCAFRRL 159
Query: 316 KGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYGEQCCK 375
K WR+L++ + D S++ C +LHN D+ ++H + + S QCC
Sbjct: 160 KARWRVLNRCI-DVDLELTTSLVYTCFVLHNFCEDNEIEIHGKL-IQTKIVSEKRSQCCD 217
Query: 376 QVDPM----------GRTTRENLEKHLQHNQ 396
+ + G TRE + ++L N
Sbjct: 218 HHNKLDKLYTYSSSRGVQTREAVAQYLSGNN 248
>gi|356528952|ref|XP_003533061.1| PREDICTED: uncharacterized protein LOC100796294 [Glycine max]
Length = 394
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 106/270 (39%), Gaps = 45/270 (16%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVA--------FGVGQSTVSQVTWRFIEALEERAKHH 152
+ +EKQ RR G S +VG+ VG T++ ++AL A
Sbjct: 85 VPIEKQNCHKRRRSEYGGSSSAVGINNQGNILENLSVGIGTIA-----ILKALNSLAPDL 139
Query: 153 LKWPDSNRMEEIK--SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV 210
+ P S +I+ ++F F +C GAID THI ++ S + D+ N S V
Sbjct: 140 MVRPGSTVPAKIRESTRFYPYFK--DCIGAIDGTHIPASVKGRDVS-SYRDRHGNISQNV 196
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
+ ++ F+ +++GW G + S++L +A R NG +V +Y +
Sbjct: 197 LAACNFDLEFMYVLSGWEGSAHDSKVLS--------DALARKNG------LKVPQGKYYL 242
Query: 271 GGVGYPLLSWLITPY------------ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGG 318
G+P + PY N FN H + R++ + F K
Sbjct: 243 VDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPENEKELFNLRHASLRNVIERIFGIFKSR 302
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNII 348
+ I K + ++L C LHN +
Sbjct: 303 FTIFKSAPPFLFKTQA-ELVLACAALHNFL 331
>gi|222623654|gb|EEE57786.1| hypothetical protein OsJ_08336 [Oryza sativa Japonica Group]
Length = 234
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 68/138 (49%), Gaps = 4/138 (2%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+Q+AI L +A + ++ F T+ + +EA+ +++ P +
Sbjct: 25 VSVEEQIAIFLYTVAKNATNETLQDWFQHSPDTIHRYFKAVLEAITNLTSVYIRAPSLHP 84
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K +F F NC GAID THI M LP + + + +++ + S D +++
Sbjct: 85 HPILRKPQFYPFFQ--NCIGAIDGTHIPMKLP-LDEQEPYRNRKQSISQNCMVACDFDLK 141
Query: 220 FIDIVTGWPGGMNVSRLL 237
F+ I GW G + +R+L
Sbjct: 142 FVHIHPGWEGSASDARVL 159
>gi|321470242|gb|EFX81219.1| hypothetical protein DAPPUDRAFT_317897 [Daphnia pulex]
Length = 206
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 56/144 (38%), Gaps = 49/144 (34%)
Query: 207 SMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELR 266
S+ +QG+ D ++F+DI GWPG + SR+L+ S
Sbjct: 59 SVKLQGVCDSSLKFMDIRCGWPGSKHDSRILEMSS------------------------- 93
Query: 267 EYIVGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM 326
LS + FNS R L ++F LKG R L K +
Sbjct: 94 -----------------------LSRAQRYFNSRDSGQRCLIERSFRLLKGKRRKL-KYL 129
Query: 327 WRPDKRKLPSIILVCCLLHNIIID 350
+ +P II+ CLLHN IID
Sbjct: 130 DITNLDYIPFIIITACLLHNFIID 153
>gi|403168073|ref|XP_003327777.2| hypothetical protein PGTG_08544 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167338|gb|EFP83358.2| hypothetical protein PGTG_08544 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 374
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 129 VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIM 188
VG+ TV + + A+++ ++ +L WPD +R +E+ S + G C G +D T I +
Sbjct: 133 VGRGTVVKASCWVTRAIKDLSEKYLIWPDKDRQKEV-SDVMKGEGFEGCVGFVDGTTIPL 191
Query: 189 TLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+ + D++ +YS+ Q I D + +TGWPG S + K
Sbjct: 192 YQRPSIDGEVFFDRKKHYSINCQVICDCDRFITAYMTGWPGTCGDSMVFK 241
>gi|7523685|gb|AAF63124.1|AC009526_9 Hypothetical protein [Arabidopsis thaliana]
Length = 408
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 56/253 (22%), Positives = 103/253 (40%), Gaps = 37/253 (14%)
Query: 40 WPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGR 99
W + W + A + R+S F +C++++ + +P +NI
Sbjct: 52 WRNIWRRLQQDAAACLQ---------LLRMSLPCFTTLCNMLQTNYDLQPT---LNI--- 96
Query: 100 LLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSN 159
S+E+ VA+ LR E VG+ FG Q TV + + A E A +++ P
Sbjct: 97 --SIEESVAMFLRICGHNEVYRDVGLRFGRNQETVQRKFREVLTATELLACDYIRTPTRQ 154
Query: 160 RMEEIKSKFEESFG----LLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIV 214
+ I + + GA+D TH+ + + P +Q + ++ +N S+ + I
Sbjct: 155 ELYRIPERLQVDQRYWPYFSGFVGAMDGTHVCVKVKPDLQGM--YWNRHDNASLNIMAIC 212
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
D +M F I G PG + +L+ + Q+ + + E +Y + G
Sbjct: 213 DLKMLFTYIWNGAPGSCYDTAVLQIA---------QQSDSEFPLPPSE----KYYLVDSG 259
Query: 275 YPLLSWLITPYET 287
YP L+ PY +
Sbjct: 260 YPNKQGLLAPYRS 272
>gi|72106786|ref|XP_796408.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 483
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/293 (20%), Positives = 118/293 (40%), Gaps = 19/293 (6%)
Query: 63 FKYFFRVSKKTF----DYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
FK F RV F + + + RPP L ++AI LR LA+G
Sbjct: 128 FKSFLRVEPDLFLELVQRVGPRIEKSKKGRPP----------LQPGLKLAITLRFLATGN 177
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWPDS-NRMEEIKSKFEESFGLLN 176
S S+ +F V +T+S +A+ +E + K P + + + F++ + +
Sbjct: 178 SFRSLEFSFRVAHNTISIFIPVVCQAIVDEYRQKVFKTPSTPDEWRRVAQVFQDRWNFPH 237
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
CGA+D H+ PA + + + YS+++ + D E + + G G + +
Sbjct: 238 VCGAVDGKHVATRKPA-HSGSTYYTYKGYYSIVILVLADGEYKALWADVGSQGSDSDCGI 296
Query: 237 LKFSGFFK-LCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
SG + L + +++ + +++G +PL +++ P+ L+
Sbjct: 297 FNRSGLLRSLITGTIGFPPPEPLPNDDRDSGFFLLGDNAFPLREFMLKPFSKRYLNREEM 356
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
N R + F + +R L + +R + +I C LHN++
Sbjct: 357 VCNYRLSRARRVVENLFGIMAKRFRCLLTTLDVEPERAM-TISNACITLHNLL 408
>gi|356573553|ref|XP_003554922.1| PREDICTED: uncharacterized protein LOC100809505 [Glycine max]
Length = 531
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 95/237 (40%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID+
Sbjct: 250 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDS 307
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++
Sbjct: 308 THIPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKV------- 359
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
L +A R NG +V +Y + G+P + PY N
Sbjct: 360 -LSDALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 412
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 413 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 468
>gi|356533420|ref|XP_003535262.1| PREDICTED: uncharacterized protein LOC100791415 [Glycine max]
Length = 488
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 94/237 (39%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 207 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 264
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++
Sbjct: 265 THIPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSTHDSKV------- 316
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
L +A R NG +V +Y + G+P + PY N
Sbjct: 317 -LSDALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 369
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 370 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 425
>gi|449663352|ref|XP_004205731.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 134
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAIDATH 185
FG+ S V +A+ +L P + M S+FE FG+ G ID TH
Sbjct: 6 FGITVCIASAVIISVCKAISNLGPKYLFLPRNQTEMRNKVSEFESKFGMTQAFGCIDGTH 65
Query: 186 IIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
I + P + D +C ++ +S+ VQ + D++ F+D+ W
Sbjct: 66 IPIRRPLTNSQDFFCYKQ-YFSLNVQAVCDYKGYFMDVECMW 106
>gi|356546197|ref|XP_003541517.1| PREDICTED: uncharacterized protein LOC100780130 [Glycine max]
Length = 436
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 107/270 (39%), Gaps = 34/270 (12%)
Query: 94 INIEGRLLSVEKQVAIALRRLASGESQVSV-GVAFGVGQSTVSQVTWRFIEALEERAKHH 152
IN +G +L V I + S ++ V FG Q S+ + ++AL A
Sbjct: 109 INNQGNVLE-NLSVGIGTIAIVSQNTRYCVIRNTFGRSQFATSENFHKILKALNSLAPDL 167
Query: 153 LKWPDSNRMEEIK--SKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV 210
+ P S +I+ ++F F +C GAID THI ++ S + D+ N S V
Sbjct: 168 MVRPGSTVPAKIRESTRFYPYFK--DCIGAIDGTHIPASVKGRDVS-SYRDRHGNISQNV 224
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
+ ++ F+ +++GW G + S++L +A R NG +V +Y +
Sbjct: 225 LAACNFDLEFMYVLSGWEGSAHDSKVLS--------DALARKNG------LKVPQGKYYL 270
Query: 271 GGVGYPLLSWLITPY------------ETNGLSASMPTFNSLHEATRSLAVKAFLQLKGG 318
G+P + PY N FN H + R++ + F K
Sbjct: 271 VDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPENEKELFNLRHASLRNVIERIFGIFKSR 330
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNII 348
+ I K + ++L C LHN +
Sbjct: 331 FTIFKSAPPFLFKTQ-AELVLACAALHNFL 359
>gi|449691859|ref|XP_004212822.1| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 131
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 58/118 (49%), Gaps = 2/118 (1%)
Query: 128 GVGQSTVSQVTWRFIEALEERAKHHLKWPD-SNRMEEIKSKFEESFGLLNCCGAIDATHI 186
+ Q T+S++ R +L +R ++K+P + + E + F D TH+
Sbjct: 9 ALSQPTISRIIRRVSLSLAKRVNEYIKYPAVPHVLNESRVNFYNVAEFPRITRLTDGTHV 68
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
+ P + + ++ +N+S+ VQ + D +FIDIV WPG + +++L+ S K
Sbjct: 69 CIQKPR-EHEYVYVNRSSNHSINVQAVCDCNGKFIDIVAKWPGSTHDAKILRESKLGK 125
>gi|357153495|ref|XP_003576469.1| PREDICTED: uncharacterized protein LOC100829939 [Brachypodium
distachyon]
Length = 394
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 64/295 (21%), Positives = 116/295 (39%), Gaps = 14/295 (4%)
Query: 63 FKYFFRVSKKTFDYICSLVRE-DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F+ FR+S+K F I +R+ D + I G S++K ++LR LA G
Sbjct: 45 FRRRFRMSRKLFLKIAEYLRDCDDYFKLKRDAIGTLG-FTSIQK-CTVSLRLLAYGIPAD 102
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+ + +ST +RF A+ + +L+ P + I ++ E G + G+
Sbjct: 103 TQDDYLRMAESTTIDCMYRFCRAIVAVFGEQYLRTPTAKDTARIMAQNAER-GFPSMLGS 161
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID H + + + ++++ + D ++ G G N +L+ S
Sbjct: 162 IDCMHWSWKNCPFAHQGMYKSHKESCGVVLEAVADQDLWIWHAFFGMAGSHNDINVLQCS 221
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
F G N I+ Y+ G+ YP S + + F
Sbjct: 222 NVFAKLVEGTAAPVNYDINGHVYNKGYYLADGI-YPRWSTFVKTISNAPAGGARSWFAMQ 280
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRP----DKRKLPSIILVCCLLHNIIIDS 351
EA R +AF L+ + I+ W P K ++ ++ C ++HN+II+S
Sbjct: 281 QEACRKDVERAFGVLQARFAIV----WYPALTWSKDQMWEVMTACVIMHNMIIES 331
>gi|297795315|ref|XP_002865542.1| hypothetical protein ARALYDRAFT_917556 [Arabidopsis lyrata subsp.
lyrata]
gi|297311377|gb|EFH41801.1| hypothetical protein ARALYDRAFT_917556 [Arabidopsis lyrata subsp.
lyrata]
Length = 593
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 65/145 (44%), Gaps = 7/145 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+S+EK VA+ LR E Q VG+ FG Q TV++ + + A E A ++K P
Sbjct: 361 VSIEKSVAMFLRICGHNEVQRDVGLRFGRTQETVNRKFFEVLRATELLACDYIKTPTRQE 420
Query: 161 MEEIKSKFEESFG----LLNCCGAIDATHIIMTL-PAVQTSDDWCDQENNYSMLVQGIVD 215
+ I K + GAID H+ + + P +Q + ++ + S + I D
Sbjct: 421 LRRIPKKLQRDRRYWPYFSGFVGAIDGVHVCVKVKPELQGM--YWNRHDRTSFNIMAICD 478
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFS 240
M F + G PG + + +L +
Sbjct: 479 INMLFTYVWNGAPGSCHDTAVLTMA 503
>gi|357116843|ref|XP_003560186.1| PREDICTED: uncharacterized protein LOC100823720 [Brachypodium
distachyon]
Length = 259
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 76/177 (42%), Gaps = 11/177 (6%)
Query: 63 FKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVS 122
F FRVS + F + L+ R + + +IE +A+ L + +S
Sbjct: 63 FYKMFRVSPEVFMALHDLLVSSYGLRSTTRVSSIES--------LAMFLWIVGGPQSFAQ 114
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESF--GLLNCCGA 180
V F TV + L + +KH++K D E EE F GA
Sbjct: 115 VENRFTWSLWTVHTKFKEVLRCLCKLSKHNIKPRDPTFSVEHDKVREERFWPHFRGAIGA 174
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
ID THI + +P +T + C + S V + D +MRFI V GWPG ++ +R+L
Sbjct: 175 IDGTHIPVIVPTDETVNYTC-RHGYTSQNVLAMCDFDMRFIFAVAGWPGSVHDTRVL 230
>gi|356533383|ref|XP_003535244.1| PREDICTED: uncharacterized protein LOC100817811 [Glycine max]
Length = 422
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 94/237 (39%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 141 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 198
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++L
Sbjct: 199 THIPASVKGRDVS-SYRDRHENISQNVLAACNFDLEFMYVLSGWEGSAHDSKVLS----- 252
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
+A R NG +V +Y + G+P + PY N
Sbjct: 253 ---DALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 303
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 304 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQA-ELVLACAALHNFL 359
>gi|115468766|ref|NP_001057982.1| Os06g0595700 [Oryza sativa Japonica Group]
gi|113596022|dbj|BAF19896.1| Os06g0595700 [Oryza sativa Japonica Group]
Length = 817
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 80/358 (22%), Positives = 144/358 (40%), Gaps = 66/358 (18%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
FR+ + F + + +R + + ++ + VE+++A L L+ S +
Sbjct: 152 FRMEPEIFISLANYLRTEKL---------VDDTRIKVEEKLAFFLYMLSHNASFEDLQEK 202
Query: 127 FG-VGQSTVSQVTWRFIEALEERAKHHLKWPDSNRME---EIKSKFEESFGLLNCCGAID 182
FG G S V F + +K LK P+ N + E +F F NC GAID
Sbjct: 203 FGHSGDSFHRHVKHFFNSVVPSLSKRFLKPPNPNEVHWKIEKDPRFYPYFK--NCLGAID 260
Query: 183 ATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGF 242
THI +++ + + + + +++N S V D +++ + TGW G +R+L +
Sbjct: 261 GTHIPISISSDKAA-PFRNRKNTLSQNVMIACDFDLKITFMSTGWEGSATDARVLTSA-- 317
Query: 243 FKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY-------ETNGLSASMP 295
+N ++ + Y+V G GY + + PY + G+ P
Sbjct: 318 ---------VNKGFQVPPGKF----YLVDG-GYANTNSFLAPYRKVRYHLKEYGVGRRRP 363
Query: 296 T-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
FN H R+ + +K + IL + ++ I +V + HNII
Sbjct: 364 QNYKELFNHRHAVLRNHVERTLGVVKKRFPILKVATFHKINNQV-KIPVVAAVFHNIIRS 422
Query: 351 -SGDQ---------LHPD--VALSDHHDSGYG---------EQCCKQVDPMGRTTREN 387
+GD+ +HP V L D + G G + K+ P GR T+++
Sbjct: 423 LNGDEQWLNNQPHNIHPSNYVDLPDVNMLGRGSPRLTMLLAKAAQKKNSPKGRPTKKH 480
>gi|357117845|ref|XP_003560672.1| PREDICTED: uncharacterized protein LOC100840686 [Brachypodium
distachyon]
Length = 221
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 84/180 (46%), Gaps = 15/180 (8%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F FR+ + F + ++ E R S + S ++ +A+ L L + +S
Sbjct: 53 KNFYDMFRMRRSVFHPLHDILVEKYGLRSSSNM--------SSKEALALFLWTLGAPQSN 104
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEEI-KSKFEESFGLLN 176
+ V F ST+S+ + L+ A L D ++ E++ K +F F +
Sbjct: 105 IQVANRFEHSPSTISRKFEEVLNCLDRMAGDQLAPIDPTFTHVHEKLRKPRFWPHFK--D 162
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GAID THI + +PA + +++ S V + D +MRFI +V GWPG ++ +R+
Sbjct: 163 AIGAIDGTHIPVIVPA-ELKVIHTNRKGYTSQNVMAMCDFDMRFIFVVPGWPGSVHDTRV 221
>gi|449688289|ref|XP_004211707.1| PREDICTED: uncharacterized protein LOC100198190 [Hydra
magnipapillata]
Length = 347
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/309 (19%), Positives = 123/309 (39%), Gaps = 52/309 (16%)
Query: 58 DEEEGFKYFFRVS----KKTFDYICSLVRE------DLVSRPPSGLINIEGRLLSVEKQV 107
++ E F +FR+S ++ ++C L+R+ D +S P ++
Sbjct: 42 NKNELFFRYFRMSPTILEELLSWVCPLIRKKSTHMRDCISAP---------------ERF 86
Query: 108 AIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSK 167
+ +R L +G++Q ++ + + + S V ++ A+ W ++ EI K
Sbjct: 87 CVTMRFLVTGDAQKTIAMNYRMSPSVVGRIISETCRAI---------WLLTS---EIDYK 134
Query: 168 FEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGW 227
+ +C GAID H++M PA S + + + +S+++ + + + +F + G
Sbjct: 135 IRNKWNFPHCLGAIDVKHVVMQAPACSGSSFY-NYKKTHSIVLLAVCNAKYQFSLLDIGN 193
Query: 228 PGGMNVSRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYIVGGVGYPLLSW 280
G N + S + G + ++ +++ E L VG + L
Sbjct: 194 SGRQNDGSVYANS------QLGYAIENDLLDIPQACKVNGTETILLHVFVGDNSFGLKPQ 247
Query: 281 LITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILV 340
++ PY LS FN R + AF +RI + + K K+ I
Sbjct: 248 MMKPYPFCNLSYKKRIFNYGLSRVRRIVENAFGIAASRFRIFRRPIIASTK-KVVLITKA 306
Query: 341 CCLLHNIII 349
LHN ++
Sbjct: 307 VVALHNFLM 315
>gi|58378661|ref|XP_308930.2| AGAP006816-PA [Anopheles gambiae str. PEST]
Length = 325
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 119/305 (39%), Gaps = 44/305 (14%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
E+E F F VS+ F + + L+S+ + +V ++A LR LA+G
Sbjct: 45 EDEWFLKCFHVSRDIFKSLYDGIAPTLLSQYEA----------TVYDRIAATLRYLATGT 94
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEAL----EERAKHHLKWPDSNRME--EIKSKFEESF 172
GV G ++ F + L + H+ NR E E S F
Sbjct: 95 CTSDRGVG-GYKAMPRAEFQAMFPQTLNAIRQLICNQHISLKPQNRDEQWEATSYFRRIL 153
Query: 173 GL---LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
G + C THI + P+ Q + + YS+ I DH R + + G
Sbjct: 154 GPTPGVAFCAV--GTHIPIAEPSEQKHLFY-YKYGTYSLNALMIFDHRKRIRYVNASFCG 210
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
M+ S L SG A Q GN + + ++ +P W+I P G
Sbjct: 211 AMHDSHLWNSSGV-DSHFAQQHAKGNEKCN---------LLANSMFPSQPWIIKP--KPG 258
Query: 290 LSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP----SIILVCCLLH 345
+ PTFN+ HE + A A LK ++ L + +P P ++I VCC LH
Sbjct: 259 IVD--PTFNARHERALATADTAVHLLKNRFKCL---LGKPPIPYTPPECVAVIDVCCALH 313
Query: 346 NIIID 350
N+ +D
Sbjct: 314 NMCLD 318
>gi|322781893|gb|EFZ10299.1| hypothetical protein SINV_12353 [Solenopsis invicta]
Length = 129
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 59/117 (50%), Gaps = 2/117 (1%)
Query: 106 QVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRM-EEI 164
++ I LR LASG+S S+ AF VG +T+S++ E + K + D+ + +
Sbjct: 11 RLQITLRYLASGDSMKSLSYAFRVGHNTISKIISETYEVIWNCFKDSVFLKDNQESWKAV 70
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
++FE + NC GAID H+ + P + + + + ++S+ + I D FI
Sbjct: 71 ANEFERLWNFPNCIGAIDGKHVQIQAPP-NSGSTYYNYKGHFSINLLAISDANYCFI 126
>gi|168065833|ref|XP_001784851.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663605|gb|EDQ50361.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 104
Score = 51.2 bits (121), Expect = 8e-04, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 37/67 (55%)
Query: 311 AFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHPDVALSDHHDSGYG 370
+F LKG +IL D +++PS+IL CCLLHN IID D +H + L H + Y
Sbjct: 15 SFRILKGMCKILKNPFTNCDLKRVPSLILACCLLHNNIIDKNDLIHELIILWRHLNLEYR 74
Query: 371 EQCCKQV 377
+ K +
Sbjct: 75 HEVVKNI 81
>gi|357151821|ref|XP_003575915.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 393
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 63/316 (19%), Positives = 128/316 (40%), Gaps = 59/316 (18%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R++K++ +C+ +RE +GL +I +SVE+ + I + G +
Sbjct: 75 LRLTKRSISDLCAFLRE------KAGLRDIFH--VSVEEALTIFFLVVGHGMKYRLIRST 126
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR---MEEIKSKFEESFGLLNCCGAIDA 183
+ T+S+ + A+ ++ +K PD + I F + G L D
Sbjct: 127 YRWTLETISRHFNEVLGAILSLSREFIKLPDPATELPQDNIWKWFPDGLGTL------DG 180
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
TH+ + + A + + + ++ + V G+ D M+F+ ++ G+ G + SR+L+
Sbjct: 181 THVRVRVAA-RKQGRYRNMKHQITTNVLGVCDRSMKFVYVLAGYEGSASDSRVLR----- 234
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYET----------NGLSAS 293
+ +R + +V +Y + GY + + PY + NG +
Sbjct: 235 ---------DAMLRQDAFKVPSGKYYLVDAGYTNVPGFLAPYRSVRYHLKEWAANGNNPQ 285
Query: 294 MP--TFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
P +N H + R++ + F +K W + +I CC+LHN + +
Sbjct: 286 TPRELYNLRHASARNVVERIFAFIKMRWAFIR-------------VINACCILHNFL--A 330
Query: 352 GDQLHPDVALSDHHDS 367
G Q D + D+
Sbjct: 331 GRQREMDYIMLQQVDN 346
>gi|357130999|ref|XP_003567131.1| PREDICTED: uncharacterized protein LOC100828284 [Brachypodium
distachyon]
Length = 378
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 115/291 (39%), Gaps = 6/291 (2%)
Query: 63 FKYFFRVSKKTFDYICSLVRE-DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQV 121
F+ FR+S+K F I +RE D + + ++ G S++K A +LR A G
Sbjct: 89 FRRRFRMSRKLFLKIAEYLREYDDYFKLKTEAVSTLG-FTSIQKCTA-SLRLFAYGIPAD 146
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+ + +STV +RF A+ HL+ P + I ++ E G L G+
Sbjct: 147 TQDDYLRMAESTVIDCMYRFYRAIVAVFGDQHLRAPIAEDTARIMARNVER-GFLGMLGS 205
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
ID H D + + + S++++ + D ++ G G N +L+ S
Sbjct: 206 IDCMHWSWKNCPFAHQDMYKSHKGSCSVVLEAVADQDLWIWYAFFGMAGSHNDINVLQCS 265
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
F G N I+ Y+ G+ YP S + F
Sbjct: 266 NVFAKLVEGTAPPVNYEINGHVYNKGYYLADGI-YPKWSTFVKTISNAPAGGVRSWFAMQ 324
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
EA R +AF L+ + I+ K ++ ++ ++HN+II+S
Sbjct: 325 QEACRKDVERAFGVLQVRFAIVRYPALTWSKDRMWEVMTAYVIMHNMIIES 375
>gi|402080063|gb|EJT75208.1| hypothetical protein GGTG_05145 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 413
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 77/204 (37%), Gaps = 34/204 (16%)
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
WPD R E +E+F C GA+D T +I A + W ++ + V V
Sbjct: 160 WPDCRRQE-----VQEAFA--GCLGAVDGT-LINAKVAAEEGVVWRSRKGPTAQNVMAAV 211
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
DH RF+ + G G ++ S+ F + G + G Y VG G
Sbjct: 212 DHRGRFVSVTAGIEGLVHDSQ-----AFREALARGFVVPGGY-----------YYVGDAG 255
Query: 275 YPLLSWLITPY--------ETNGLSASMP--TFNSLHEATRSLAVKAFLQLKGGWRILSK 324
+ L +ITPY E P FN H R ++F +LK W+I+
Sbjct: 256 FGLGQGVITPYGATKYHLQEQGFARPETPEELFNLHHAKIRVRVERSFGELKNRWKIVRG 315
Query: 325 VMWRPDKRKLPSIILVCCLLHNII 348
R SI+L L N I
Sbjct: 316 PGPSYSIRTQISIVLAVTALQNFI 339
>gi|356533889|ref|XP_003535490.1| PREDICTED: uncharacterized protein LOC100780221 [Glycine max]
Length = 531
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 95/237 (40%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 250 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSIVPAKIRESTRFYPYFK--DCIGAIDG 307
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++
Sbjct: 308 THIPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKV------- 359
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
L +A R NG +V +Y + G+P + PY N
Sbjct: 360 -LSDALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 412
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
+ FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 413 SEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 468
>gi|328698221|ref|XP_003240585.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 326
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/218 (22%), Positives = 95/218 (43%), Gaps = 20/218 (9%)
Query: 143 EALEERAKHHLKWPDSN-RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCD 201
EA+ ++K P+ + ++E + KF GAID TH+ + P ++ + +
Sbjct: 83 EAIASLRPLYIKMPEHHCNIQETRLKFYNIARFPRIIGAIDCTHVKLQSPGGNIAEVYRN 142
Query: 202 QENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSE 261
++ +S+ VQ + + +DIV WPG + + N + E
Sbjct: 143 RKGYFSLNVQVVGGRSLEILDIVARWPGSTHDQVIFN--------------NSTIHFKFE 188
Query: 262 EVELREYI-VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWR 320
E+ + I +G GY +++ P + +A + +N TR+ + F K +
Sbjct: 189 TNEMGDNILLGDGGYECRPYILVPLISPNTNAEL-LYNESQIRTRNTIERLFGVWKRRFP 247
Query: 321 ILS-KVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLHP 357
ILS + PD+ + +II+ ++HN+ G+ + P
Sbjct: 248 ILSLGIRTTPDRAQ--AIIVATAVVHNLASVLGESIPP 283
>gi|222637325|gb|EEE67457.1| hypothetical protein OsJ_24845 [Oryza sativa Japonica Group]
Length = 452
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/310 (20%), Positives = 126/310 (40%), Gaps = 14/310 (4%)
Query: 56 PSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIE--GRL-LSVEKQVAIALR 112
P ++ F+ FR+ K F +I E L P + + G++ LS ++ A+R
Sbjct: 88 PLYTDDQFRRRFRMRKHLFLHIV----EALGIWSPYFCLRRDAFGKVGLSPLQKCTAAIR 143
Query: 113 RLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEES 171
LA G + FGV +ST + F++ + + +L+ P+ ++ + + E+
Sbjct: 144 MLAYGTPADLMDETFGVAESTAMECMINFVQGVRHIFGQQYLRKPNEQDIQCLLQQ-GEA 202
Query: 172 FGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNY-SMLVQGIVDHEMRFIDIVTGWPGG 230
G G++D H V + + +++++ + ++ F G G
Sbjct: 203 HGFPGMLGSLDCMHWEWQNCPVAWKGQFTRGDYGVPTIMLEAVASADLWFWHAFFGAAGS 262
Query: 231 MNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL 290
N +L S F G+ + ++ E + Y+ + YP + T
Sbjct: 263 NNDINVLDQSPLFTAVLQGRAPSVQFTVNGTEYNMGYYLADNI-YPEWAAFAKSI-TRPQ 320
Query: 291 SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIID 350
S + E+ R +AF L+ W I+ W ++ +L I+ C +LHN+I++
Sbjct: 321 SDKAKLYAQRQESARKDVERAFGVLQKRWAIIRHPAWLWERDELADIMYACIILHNMIVE 380
Query: 351 S--GDQLHPD 358
D + PD
Sbjct: 381 DERDDYVIPD 390
>gi|302774759|ref|XP_002970796.1| hypothetical protein SELMODRAFT_94546 [Selaginella moellendorffii]
gi|300161507|gb|EFJ28122.1| hypothetical protein SELMODRAFT_94546 [Selaginella moellendorffii]
Length = 74
Score = 50.8 bits (120), Expect = 0.001, Method: Composition-based stats.
Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDDW-CDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
G LNCCGA+D TH+ + L + + DW + YSM+VQ +VD + F+ I G G +
Sbjct: 1 GFLNCCGAMDCTHMAIELLGSEDAIDWYARAKKYYSMVVQVVVDSKTSFLYITIGIVGSV 60
Query: 232 NVSRLLKFSGFFK 244
++ SG K
Sbjct: 61 PDRQVWNSSGLKK 73
>gi|134108967|ref|XP_776598.1| hypothetical protein CNBC0910 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259278|gb|EAL21951.1| hypothetical protein CNBC0910 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 398
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/306 (21%), Positives = 128/306 (41%), Gaps = 27/306 (8%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEG--RLLSVEKQVAIALRRLASG 117
EE F+ FR+SK+ + +E V + + G + E Q+A+ + RLA
Sbjct: 77 EEEFRRSFRMSKQEME------KEIWVRFAGASVFQSTGSRKQAKAEYQLALLVYRLAQP 130
Query: 118 ESQVSVGVA--FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ + FG+ T ++ T R + A+ E+ + WP S I S S +
Sbjct: 131 GCMTTKDIQQKFGISAGTTNEWTKRALTAVLEQFPLIVSWPSSQERSFICSHSPSSNRIP 190
Query: 176 NCCGAIDATHIIMTLPAVQTS---DDWCDQ-ENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
C G +D H+ ++ + W + Y+ L G+ D +R +V G G
Sbjct: 191 ECIGYLDGIHMHFHRGGLKPGGPMEYWQESCREGYNFL--GVCDDRLRMRMVVAGPTEGK 248
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
+ ++ ++ G+ +G + ++ + + G + PL ET+ +
Sbjct: 249 GQDKTMQDGLPWRSGSTGEYFDGKQFVVAD----KGFECGNMVLPLYDLDPKQQETDDPT 304
Query: 292 ASMPTFNSLHEA---TRSLAVKAFLQLKGGWRILSK---VMWRPDKRKLPSIILVCCLLH 345
F + H+A R++ A+ +K W+ L+ +++ + K ++L +LH
Sbjct: 305 VQRQMFFN-HQAQPLIRNMIQTAWGMIKSRWQYLNAAHILIYADEAEKARDMVLAAVVLH 363
Query: 346 NIIIDS 351
N++IDS
Sbjct: 364 NLLIDS 369
>gi|357129447|ref|XP_003566373.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclease HARBI1-like
[Brachypodium distachyon]
Length = 248
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 88/191 (46%), Gaps = 18/191 (9%)
Query: 179 GAIDATHIIMTLPAVQTSD-DWCDQENNY-SMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
GA+D TH+ PA + + Y S V +VD ++RF ++ GW G + S +
Sbjct: 36 GALDGTHVKACXPAHMVDRFTYVRGQKLYPSQNVLAVVDFDLRFTYVLAGWEGSAHDSLV 95
Query: 237 LKFSGFFKLCEAGQRL-NGNVRISSEEVELREYIV---GGVGYPLLSWLITPYETNGLSA 292
L + AG ++ G ++ V R I+ GV Y L + ++ L+
Sbjct: 96 LNDA---LTRRAGLKIPEGKFFLADAGVAARLGILPPYRGVRYHLNEFA----GSHHLTT 148
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
FN H + R+ +AF L ++IL+ + P K ++ +++ CC LHN I+++G
Sbjct: 149 PRELFNLRHSSLRTTVERAFGTLXSRFKILTHRPFIPLKSQI-KVVVSCCALHNWILENG 207
Query: 353 DQLHPDVALSD 363
PD + D
Sbjct: 208 ----PDEYVVD 214
>gi|390363706|ref|XP_003730429.1| PREDICTED: putative nuclease HARBI1-like [Strongylocentrotus
purpuratus]
Length = 163
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 57/136 (41%), Gaps = 16/136 (11%)
Query: 218 MRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPL 277
+ ++V WPG + SR+L S EAG++ I+G GYPL
Sbjct: 6 FKITNVVARWPGSTHDSRILTHSNIAADFEAGRK--------------EGVIIGDSGYPL 51
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI 337
WLITP + A +N H TR + Q+K + L+ V R +
Sbjct: 52 KPWLITPIQNPQTPAEY-AYNRAHPRTRVFIEQVNGQIKAKFPCLA-VGLRVAPKHACRT 109
Query: 338 ILVCCLLHNIIIDSGD 353
I+ C +L N+ + G+
Sbjct: 110 IVACAVLFNMAKEMGE 125
>gi|289742443|gb|ADD19969.1| hypothetical conserved protein [Glossina morsitans morsitans]
Length = 377
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 128/328 (39%), Gaps = 23/328 (7%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
EE F+Y +R SK+ + +VR+DL S L + V+KQ+ +A+R E
Sbjct: 58 EENEFRYSYRYSKENMRRLIEMVRDDLEVETHSEL---RKNQVPVDKQIMVAVRYWGMTE 114
Query: 119 SQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD-SNRMEEIKSKFEESFGLLNC 177
+ GV T+ +++ R EAL + +++ P E++ F +
Sbjct: 115 HPDVTARSHGVSLRTLMKISKRVAEALAAKTSRYIRMPCLLAEKEKVTQAFYRLAHMPQV 174
Query: 178 CGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLL 237
GA+ H + +Q D C +++ SM +Q + D ++ DI + + +
Sbjct: 175 IGAV--AHSRVKCKRLQHQDGDCKEDD--SMHIQIVSDASLKIRDIDCRL---VTLQDPV 227
Query: 238 KFSGFFKLCEAGQRLNGNVRISSEEVELR-EYIVGGVGYPLLSWLITPYETNGLSASMPT 296
+ F +R E+ E R ++G S+L TP + +S +
Sbjct: 228 TAADLFSQTRIKERF--------EQTEFRGRILLGDSTLHCTSFLYTPV-SCAISGPEQS 278
Query: 297 FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQLH 356
FN T + F K + ILS R II+ LLHN+ I+ D
Sbjct: 279 FNHSLRLTYEPVRQCFKLWKERFGILS-CELRGSVATARHIIVGSALLHNMAIEWNDPTF 337
Query: 357 PDVALS-DHHDSGYGEQCCKQVDPMGRT 383
D S D E K D GRT
Sbjct: 338 ADFQNSLLGLDKSVIEDLQKSTDVRGRT 365
>gi|356566240|ref|XP_003551342.1| PREDICTED: uncharacterized protein LOC100791416 [Glycine max]
Length = 422
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 93/237 (39%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 141 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 198
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G + S++L
Sbjct: 199 THIPASVKGRDVS-SYRDRHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKVLSV---- 253
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
A R NG +V +Y + G+P + PY N
Sbjct: 254 ----ALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 303
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 304 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQA-ELVLACAALHNFL 359
>gi|242078913|ref|XP_002444225.1| hypothetical protein SORBIDRAFT_07g015460 [Sorghum bicolor]
gi|241940575|gb|EES13720.1| hypothetical protein SORBIDRAFT_07g015460 [Sorghum bicolor]
Length = 694
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 124/315 (39%), Gaps = 34/315 (10%)
Query: 48 SSTPGATIPSDEEEGF----KYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSV 103
SS PGA + EG K FR+ + F I + +R + + R G+ +
Sbjct: 44 SSLPGAKKVKEILEGHENWCKVEFRMETEIFRTIANFLRAENLLRDTRGM--------KI 95
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD-SNRME 162
E+Q+ + + L+ S + F TV + + + + ++ P+ SN
Sbjct: 96 EEQLGLFMFMLSHNASTDRLKKEFQHSGETVHRKINEVFDIIPALIQRFIRLPNPSNTHV 155
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
+I NC GAID TH+ +T+ + S + +++ S V D +++F
Sbjct: 156 KITCDPRFMPFFQNCIGAIDGTHVPITIGQDKAS-PYRNRKGTLSQNVMFACDFDLKFTL 214
Query: 223 IVTGWPGGMNVSRLLKFS---------GFFKLCEAGQRLNGNVRISSEEVELREYIVGGV 273
I +GW G + + +L+ + G F L + G ++ L Y GV
Sbjct: 215 ISSGWEGSASDAGVLRSALGKGFTVPDGKFYLVDGG--------YANTPSFLAPY--RGV 264
Query: 274 GYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
Y L + N + FN H R+ +AF LK + IL P + +
Sbjct: 265 KYHLSEFRRRGQRGNAYANYKELFNHRHAILRNHIERAFGVLKKRFPILKVGTHYPIESQ 324
Query: 334 LPSIILVCCLLHNII 348
+ I + HNII
Sbjct: 325 V-KIPAAAAVFHNII 338
>gi|222625210|gb|EEE59342.1| hypothetical protein OsJ_11425 [Oryza sativa Japonica Group]
Length = 786
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 74/184 (40%), Gaps = 27/184 (14%)
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GAID HI P + + + + + V +VD +MRF G PG M+ + +
Sbjct: 554 CIGAIDGIHITSNSPR-EDYVRYIGRSKSPTQNVMAVVDFDMRFTYSSIGQPGSMHDTSV 612
Query: 237 LKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGL------ 290
L L+ + I + + Y+V GYP + PY+
Sbjct: 613 LY-----------HVLDVDKDIFPHPPQGKYYLVDA-GYPNRPGYMAPYKGQRYHIPKWR 660
Query: 291 -----SASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKV-MWRPDKRKLPSIILVCCLL 344
S FN H + R++ + F K WRIL K+ + DK+++ II L
Sbjct: 661 RGPPPSGEQEYFNQCHSSARNVIERTFGVWKMKWRILLKMPTYAMDKQEM--IIAATMCL 718
Query: 345 HNII 348
HN I
Sbjct: 719 HNFI 722
>gi|302855139|ref|XP_002959069.1| hypothetical protein VOLCADRAFT_100456 [Volvox carteri f.
nagariensis]
gi|300255566|gb|EFJ39863.1| hypothetical protein VOLCADRAFT_100456 [Volvox carteri f.
nagariensis]
Length = 225
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 59/120 (49%), Gaps = 6/120 (5%)
Query: 60 EEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGES 119
EE F+ FR+ F+ +C +R L +R + + + + VE+QV +AL LA G +
Sbjct: 87 EETFRANFRMECSNFERLCDALRPKL-ARQDTDM----RQAIPVEQQVCMALYHLAQGSN 141
Query: 120 QVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
+ F + + +++++ RF +A+ E +PD R EI +F G+ C G
Sbjct: 142 YRVLDNTFAIARESLTRIIRRFADAVLEAFAGECGYPDPQRKLEIMEEFAVD-GMPGCVG 200
>gi|449673717|ref|XP_002156809.2| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 361
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/259 (19%), Positives = 102/259 (39%), Gaps = 20/259 (7%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEA----LEERAKHHLKWP 156
+S +++A+ LR L + ++Q ++ + + ST+S++ A L+ER H+
Sbjct: 28 VSPSERLAVTLRFLVTEDAQCTIAAGYRISASTISRIISETCAAIWTSLKERNFLHVP-S 86
Query: 157 DSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDH 216
+ + + +FE + + GAID HI+M P ++ + + +S+++ +
Sbjct: 87 EKQEWKTVAKEFENMWNFPHAIGAIDGKHIVMQAPH-NGGSEYFNYKKTHSIVLLAVCYA 145
Query: 217 EMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNV-------RISSEEVELREYI 269
+ F + G G + R+ F C G + N I + E L +
Sbjct: 146 KYEFTMVDIGESGKQSDGRV------FNNCSLGYAIENNKLNIPDPEYIGNSEKVLPYVL 199
Query: 270 VGGVGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRP 329
V + L ++ PY + FN R + F +RI +++
Sbjct: 200 VADNAFGLKRHIMKPYPNQNIPLDQKIFNYRLSRARRVIENTFGIATTRFRIFRRLII-A 258
Query: 330 DKRKLPSIILVCCLLHNII 348
+ K+ I LHN
Sbjct: 259 NLEKVILITQAIVALHNFF 277
>gi|449683707|ref|XP_004210434.1| PREDICTED: uncharacterized protein LOC101239557 [Hydra
magnipapillata]
Length = 264
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 77/189 (40%), Gaps = 17/189 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
LS +Q+ + + L + + A GV T+ R +++ +KWPD
Sbjct: 76 LSPREQILLFSQFLGTSAFYHLLRDARGVDTKTIHSTVHRVANSIQSLKNKVIKWPDD-- 133
Query: 161 MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRF 220
+ +F + G GAID T ++ +P + D+ N+++ + + M F
Sbjct: 134 CSRLVDEFFKLGGFSCVAGAIDGT-LVQVMPPKIDEALFVDRYQNHAISILAVAGPNMAF 192
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEA-GQR-LNGNVRISSEEVELREYIVGGVGYPLL 278
+ T P + SR++K S ++ E G R G V I G GY L
Sbjct: 193 YYVNTNNPSRCHDSRIVKESKLWESWETNGLRPFQGAVNI------------GNSGYSLR 240
Query: 279 SWLITPYET 287
WLITPY
Sbjct: 241 DWLITPYRN 249
>gi|449687721|ref|XP_004211522.1| PREDICTED: putative nuclease HARBI1-like [Hydra magnipapillata]
Length = 173
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 80/175 (45%), Gaps = 19/175 (10%)
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
+D THI + P + + D+ + + + V+ + D++ +F D+V WP + +R+L S
Sbjct: 1 MDCTHICIQKPH-EHEYVYVDRSSKHLINVRAVCDNKEKFNDVVAKWPRSTHDARILGES 59
Query: 241 GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTFNSL 300
G++ V L+ ++G GYP L+TPY N +AS +N
Sbjct: 60 KL------GKKFMN--------VTLKGLLIGDSGYPCFRCLLTPY-LNPTTASQHRYNIS 104
Query: 301 HEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIII--DSGD 353
TR + + F + K + +L + R + + + +LHN+ I + GD
Sbjct: 105 LRKTRVIIEQVFGRWKRRFHMLHGEI-RLTPERTCTFVTAYTVLHNLAIQLNHGD 158
>gi|449665569|ref|XP_002165138.2| PREDICTED: putative nuclease HARBI1-like, partial [Hydra
magnipapillata]
Length = 193
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 75/172 (43%), Gaps = 8/172 (4%)
Query: 179 GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
GAID H+ + PA ++ + + + YSM++ I D + F + G G N + +L
Sbjct: 1 GAIDGKHVRIEAPA-KSGSSFYNYKGFYSMVLLAIFDAKHCFTMVDIGAYGRDNDAAILN 59
Query: 239 FSGFFKLCEAGQRLNGNV-RISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMPTF 297
S F + G N+ +IS + ++ +VG + L WL+ PY L+ F
Sbjct: 60 ASTFGRAFNKGYF---NLPKISEFDHKVPPVLVGDDMFALKPWLMKPYPGKNLTVQQRVF 116
Query: 298 NSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNII 348
N R +F L WRI S + +P K+ II LHN +
Sbjct: 117 NYRLSRARRTIENSFGILAARWRIYRSPIKAKP--LKVEHIIKATVCLHNYL 166
>gi|357140002|ref|XP_003571563.1| PREDICTED: uncharacterized protein LOC100841953 [Brachypodium
distachyon]
Length = 234
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 86/224 (38%), Gaps = 65/224 (29%)
Query: 149 AKHHLKWPDSN---RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENN 205
A H+L+ D N + I+ L +C GA+D THI +LPA Q D
Sbjct: 2 ATHYLRPKDPNFRVVHKRIRGDKRAYPHLKDCIGALDGTHIRASLPADQQPGAMHDT--- 58
Query: 206 YSMLVQGI-VDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVE 264
S+L I VDH DI P G
Sbjct: 59 -SVLYHAIRVDH-----DIFPHPPKG---------------------------------- 78
Query: 265 LREYIVGGVGYPLLSWLITPYE---------TNGLSASMPT--FNSLHEATRSLAVKAFL 313
+Y + GYP + PY+ G S P FN +H A R++ ++F
Sbjct: 79 --KYYLVDAGYPNRPGYLAPYKGERYHVPDFHKGAEPSTPKEKFNRVHSAIRNVIERSFG 136
Query: 314 QLKGGWRILSKV-MWRPDKRKLPSIILVCCLLHNIIID--SGDQ 354
LK WRIL K+ + DK+KL I +LHN + + SGD+
Sbjct: 137 VLKMKWRILLKMPNYSMDKQKL--IEAASMVLHNYVREHQSGDR 178
>gi|242055775|ref|XP_002457033.1| hypothetical protein SORBIDRAFT_03g000205 [Sorghum bicolor]
gi|241929008|gb|EES02153.1| hypothetical protein SORBIDRAFT_03g000205 [Sorghum bicolor]
Length = 285
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/263 (20%), Positives = 107/263 (40%), Gaps = 13/263 (4%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAK-HHLKWPDSN 159
LS ++V A+R L G + +G+ST + +F+ A+ E + +L++P+
Sbjct: 23 LSCFQKVTAAMRMLTYGVPADATDEYVRIGESTALESLRKFVAAVVEIFEDEYLRYPNEA 82
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ + + E+ G G+ID H + + +++++ + ++
Sbjct: 83 DIARLLAMGEKK-GFPGMLGSIDCMHWAWKNCPYDKQGQYKGHVDKPTIILEAVASDDLW 141
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
G PG N +L S F G+ ++ + + Y+ G+ YP +
Sbjct: 142 IWHAFFGMPGSHNDINVLHRSPLFDNLAEGRAPGVKFSVNGHDYTMGYYLADGI-YPSWA 200
Query: 280 WLITPYETNGLSASMPT----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
L+ ++A M F EA R + +AF L+ + I+ D L
Sbjct: 201 TLV-----QSIAAPMGNKRQYFAKAQEAARKMVERAFGVLQSRFAIIRGPARPWDIETLA 255
Query: 336 SIILVCCLLHNIII-DSGDQLHP 357
I+ C ++HN+I+ D G + P
Sbjct: 256 LIMRACVIMHNMIVEDEGFVVDP 278
>gi|397624813|gb|EJK67522.1| hypothetical protein THAOC_11427, partial [Thalassiosira oceanica]
Length = 285
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 79/195 (40%), Gaps = 27/195 (13%)
Query: 173 GLLNCCGAIDATHIIMTLPAVQTSDD----------WCDQENNYSMLVQGIVDHEMRFID 222
G+ CCGA+D + + P+ T D+ +C ++ + + +QG+ D F+D
Sbjct: 27 GIDCCCGAVDGMLVWIHKPS--TRDERVIGIGPKKFFCGRKKKFGLNMQGVCDSRGFFLD 84
Query: 223 IVTGWPGGMNVSRLLKFSGFFKLCEAG--QRLNGNVRISSEEVEL---REYIVGGVGYPL 277
+ +PG S FF E+G +++ + S L + + G Y
Sbjct: 85 VEVRFPGAA--------SDFFAFDESGLKKKVEKEGFLGSAPSSLFGGKLCLFGDNAYVQ 136
Query: 278 LSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPS 336
++ TP++ S FN H R AF L W IL + + D K
Sbjct: 137 TPYMCTPWKAVS-GGSKDAFNFFHSQVRINIECAFGMLVHRWGILRRAIPMGIDVNKTTR 195
Query: 337 IILVCCLLHNIIIDS 351
++L C LHN I S
Sbjct: 196 LVLALCKLHNFCIAS 210
>gi|77553131|gb|ABA95927.1| transposon protein, putative, Pong sub-class [Oryza sativa Japonica
Group]
Length = 450
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/257 (22%), Positives = 107/257 (41%), Gaps = 16/257 (6%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEER-AKHHLKWPDSN 159
LS ++V R L G + +V +G STV + RF+ A++E + +L+ P+ N
Sbjct: 118 LSCLQKVTAVSRMLCYGIAADAVDEYVRIGGSTVIESFNRFVVAIDEVFGEEYLRTPNEN 177
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ + ++ EE G G++D H + +E +++++ + +
Sbjct: 178 DIARLLAQGEER-GFPGMLGSLDCMHWQWKNCPTAWQGQYKGKEGVPTIVLEAVASRDRW 236
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
F G PG N +L+ S F GQ N I+ + + Y+ G+ YP +
Sbjct: 237 FWHAFFGSPGSHNDINVLQRSPLFAKLAEGQAPKVNYSINGHDYTMGYYLADGI-YPSWA 295
Query: 280 WLITPYETNGLSASMPT------FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
L+ S +P F+ EA R +AF L+ + I+
Sbjct: 296 TLVK-------SIKLPQGNKRKYFSKAQEAARKDVEQAFGVLQARFAIVRGPARFWKALT 348
Query: 334 LPSIILVCCLLHNIIID 350
L I+ C +++N+I++
Sbjct: 349 LNQIMRACIIMNNMIVE 365
>gi|356497601|ref|XP_003517648.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 379
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/237 (23%), Positives = 93/237 (39%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ + ++AL A + P S +I+ ++F F +C GAID
Sbjct: 98 TFGRSQFATSENFHKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 155
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D+ N S V + ++ F+ +++GW G S++L
Sbjct: 156 THIPASVKGRDVS-SYRDRHENISQNVLAACNFDLEFMYVLSGWEGSAYDSKVLS----- 209
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
+A R NG +V +Y + G+P + PY N
Sbjct: 210 ---DALARKNG------LKVPQGKYYLVDCGFPNRRKFLAPYRGVRYHLQDFAGHGNDPE 260
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 261 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQ-AELVLACAALHNFL 316
>gi|449483945|ref|XP_004156740.1| PREDICTED: putative nuclease HARBI1-like [Cucumis sativus]
Length = 215
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 87/211 (41%), Gaps = 47/211 (22%)
Query: 174 LLNCCGAIDATHIIMTL-----PAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWP 228
+ NC GA+D T+I + + P +T ++ + V G+ D + F+ ++ GW
Sbjct: 1 MQNCLGALDGTYIKVNVSQTDRPRYRT------RKGEVATNVLGVCDTKGDFVFVLAGWE 54
Query: 229 GGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN 288
G S +L+ +A R N + + E L Y G Y L W N
Sbjct: 55 GSAADSCILR--------DAIARPNW-LHVPQEGF-LAPY--RGQRYHLQEW---QGARN 99
Query: 289 GLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSI----ILVCCLL 344
+A+ FN H A R++ +AF LK W IL R L I ILVCCLL
Sbjct: 100 APAAAKEYFNMKHFAARNVIERAFDLLKDRWTIL-----RGKSYYLVQIQCRTILVCCLL 154
Query: 345 HNII------------IDSGDQLHPDVALSD 363
HN+I +D GD + + D
Sbjct: 155 HNLINREMTNVDFVDDVDEGDSTYATIGGDD 185
>gi|390363457|ref|XP_788048.3| PREDICTED: uncharacterized protein LOC583028, partial
[Strongylocentrotus purpuratus]
Length = 789
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 94/207 (45%), Gaps = 8/207 (3%)
Query: 84 DLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIE 143
D+ P G + EG S ++++ LR LA+G + + VG+ T+ +
Sbjct: 355 DVERNPEEGHPHQEGH--SPGLKLSVFLRHLATGATYAELSYNSRVGKETIQKFVPDVAR 412
Query: 144 AL-EERAKHHLKWPDSNRM-EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCD 201
A+ +E A + P +N E+ FE + L +C GA D +I + P ++ + +
Sbjct: 413 AIVDEYAAEVISLPTTNEGWLEVAGDFEARWNLPHCLGAYDGNNIRLQKPN-KSGSLYFN 471
Query: 202 QENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQ---RLNGNVRI 258
+ +S+++ +VD + +F+ I G G + +++ S + EAG +
Sbjct: 472 YKQFFSVVLMALVDSKYQFLWIDVGGVGHQSDAQIYNNSELKECIEAGTLGIPDPAPLPH 531
Query: 259 SSEEVELREYIVGGVGYPLLSWLITPY 285
EE + + VG + + ++++ PY
Sbjct: 532 DDEEHPMPYFFVGDDAFAMRTYMMKPY 558
>gi|356518973|ref|XP_003528149.1| PREDICTED: uncharacterized protein LOC100782123 [Glycine max]
Length = 467
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/237 (22%), Positives = 93/237 (39%), Gaps = 32/237 (13%)
Query: 126 AFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK--SKFEESFGLLNCCGAIDA 183
FG Q S+ ++ ++AL A + P S +I+ ++F F +C GAID
Sbjct: 186 TFGQSQFATSENFYKILKALNSLAPDLMVRPGSTVPAKIRESTRFYPYFK--DCIGAIDG 243
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI ++ S + D N S V + ++ F+ +++GW G + S++
Sbjct: 244 THIPASVKGRDVS-SYRDHHGNISQNVLAACNFDLEFMYVLSGWEGSAHDSKV------- 295
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY------------ETNGLS 291
L +A R NG +V +Y + G+P + Y N
Sbjct: 296 -LSDALARKNG------LKVPQGKYYLVDCGFPNRRKFLASYRGVRYHLQDFAGHGNDPE 348
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNII 348
FN H + R++ + F K + I K + ++L C LHN +
Sbjct: 349 NEKELFNLRHASLRNVIERIFGIFKSRFTIFKSAPPFLFKTQA-ELVLACAALHNFL 404
>gi|440577352|emb|CCI55357.1| PH01B035L11.2 [Phyllostachys edulis]
Length = 330
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 73/163 (44%), Gaps = 27/163 (16%)
Query: 210 VQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYI 269
V +VD ++RF ++ GW G + S +L +A NG ++ R++
Sbjct: 16 VLAVVDFDLRFTYVLAGWEGSAHDSLVLH--------DALSSPNG------LKIPKRKFY 61
Query: 270 VGGVGYPLLSWLITPYE------TNGLSASMPT-----FNSLHEATRSLAVKAFLQLKGG 318
+ GY ++ PY L P FN H + R+ +AF LK
Sbjct: 62 LADAGYATRPGILPPYRGVRYHLQEFLGPQDPKCPKELFNHRHSSLRTTVERAFGTLKNR 121
Query: 319 WRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG-DQLHPDVA 360
++ILS + P K + +++ CC L+N I+++G D+L D A
Sbjct: 122 FKILSNKPFIPLKSQA-KVVVACCALNNWILENGPDELVFDEA 163
>gi|346467333|gb|AEO33511.1| hypothetical protein [Amblyomma maculatum]
Length = 340
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 111/289 (38%), Gaps = 38/289 (13%)
Query: 54 TIPSDEEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRR 113
I S E F+ FR K FDY+ ++ + I + GR + + I LRR
Sbjct: 27 NIDSIEPSLFRQQFRFDKADFDYLFGALQVPETVKSAQN-ITVPGR-----EALCITLRR 80
Query: 114 LASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIK----SKFE 169
LA + FG S +S V + + + K+ L DSN + + +KF
Sbjct: 81 LAYPNRWCDLEAIFGRHSSVMSSVATQVVHHIAHSFKNLLN--DSNNHDWLSPAHLAKFA 138
Query: 170 ESF-----GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
+ L NC G ID T + P + + + + + Q ++ +
Sbjct: 139 SAVQNRGAALPNCWGFIDGTARPICRPTRNQREYFSGHKRMHVVKYQSVMCPNGIVCQLD 198
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITP 284
+PG + + +LK SG + E +L G+ + + G YPL L+ P
Sbjct: 199 GPYPGRRHDAGILKTSGLYTKLE---KLVGHHQFV---------LYGDPAYPLKPLLMKP 246
Query: 285 YETNGLSASMPTFNSLHEATRS---------LAVKAFLQLKGGWRILSK 324
Y + L+A+ FN R +A AFL K ++L +
Sbjct: 247 YGGSSLTATQLAFNYSMSRVRQSVEWGFGKVVAEFAFLDFKKNQKLLRQ 295
>gi|325192432|emb|CCA26869.1| hypothetical protein AaeL_AAEL007906 [Albugo laibachii Nc14]
Length = 416
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/251 (19%), Positives = 106/251 (42%), Gaps = 4/251 (1%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRF-IEALEERAKHHLKWPDSN 159
LS ++ A+R+LA G S+ + ++T ++ RF + ++ +L+ P +
Sbjct: 96 LSAIQKALSAIRQLAYGAPADSMDEHVRMAETTALEILKRFCVAVVDSLGDEYLRAPTTQ 155
Query: 160 RMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ + EE G G++D H + + +E +++++ + D+E
Sbjct: 156 DLSRLLRIGEER-GFPGMLGSLDCMHWRWKNCPKGWAGQFTGKEKASTIVLEAVADYEGW 214
Query: 220 FIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLS 279
+ G PG N +L S F G+ + + +I+ + Y+ G+ YP +
Sbjct: 215 MWHLFFGMPGSHNDINVLDRSHLFTDFRDGRAPSVSYQINGTHYSMGYYLADGI-YPPWA 273
Query: 280 WLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIIL 339
L+ +N S F E+ R +AF L+ + IL++ + + I+
Sbjct: 274 TLVQTI-SNPQSKKKKFFTEAQESARKDVERAFGMLQSQYAILTRPCRLWSEEDMAYIVK 332
Query: 340 VCCLLHNIIID 350
C +L ++ I+
Sbjct: 333 ACVILRHMAIE 343
>gi|356514841|ref|XP_003526111.1| PREDICTED: putative nuclease HARBI1-like [Glycine max]
Length = 329
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 79/180 (43%), Gaps = 28/180 (15%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLS-----VEKQVAIALRRLASGESQV 121
RVSK TF +C +++E +G+L+ + + VAI L LA
Sbjct: 94 LRVSKNTFFKLCRILQE-------------KGQLIKTINVPIAEVVAIFLHILAHNLKYR 140
Query: 122 SVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAI 181
V ++ +S+ + A+ + +K +LK+ D N +++K N GA+
Sbjct: 141 VVHFSYCRSMEIISRQFKNVLRAIMKVSKEYLKFHDYNLEGSVENKRR---WFKNLIGAL 197
Query: 182 DATHIIMTLPAVQTSDDWCDQEN---NYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
D HI P + + D N + S V G+ ++RFI ++ GW G SR+L+
Sbjct: 198 DGIHI----PVIVVAKDRLRYRNRKWDISTNVLGVCGTDLRFIYVLHGWEGSAGDSRVLR 253
>gi|53370693|gb|AAU89188.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
gi|108710091|gb|ABF97886.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 641
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 6/140 (4%)
Query: 104 EKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALE--ERAKHHLKWPDSNRM 161
+K +A+ LR+ E+ SV A + ++ V+ F L R H + P
Sbjct: 400 KKIMALWLRKQKMMEAAQSVRQADNCLRRSLDTVSRTFNRVLTCLLRLAHDIIVPKDPTF 459
Query: 162 EEIKSKFEESF---GLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEM 218
E+ E +C GAID TH+ + +P + + ++ N S V + D +M
Sbjct: 460 SEVHPNLENPAFWPHFNDCIGAIDGTHVNVVVPKSKRVP-YLNRHNETSQNVLAVCDFDM 518
Query: 219 RFIDIVTGWPGGMNVSRLLK 238
RF +++GWPG + R+ K
Sbjct: 519 RFTFVLSGWPGSAHDMRVFK 538
>gi|242043358|ref|XP_002459550.1| hypothetical protein SORBIDRAFT_02g006490 [Sorghum bicolor]
gi|241922927|gb|EER96071.1| hypothetical protein SORBIDRAFT_02g006490 [Sorghum bicolor]
Length = 396
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 126/316 (39%), Gaps = 36/316 (11%)
Query: 48 SSTPGATIPSDEEEGF----KYFFRVSKKTFDYICSLVR-EDLVSRPPSGLINIEGRLLS 102
SS PGA + EG K FR+ + F I + +R E+L+ + R +
Sbjct: 44 SSLPGAKKVKEILEGHENWCKVGFRMETEIFRTIANFLRAENLLH---------DTRGMK 94
Query: 103 VEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD-SNRM 161
+E+Q+ + + L+ S + F TV + + + + ++ P+ SN
Sbjct: 95 IEEQLGLFMFMLSHNASTDRLKKEFQHSGETVHRKINEVFDIIPALIQRFIRLPNPSNTH 154
Query: 162 EEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFI 221
+I NC GAID TH+ +T+ + S + +++ S V D +++F
Sbjct: 155 VKITCDPRFMPFFQNCIGAIDGTHVPITIGQDKAS-PYRNRKGTLSQNVMFACDFDLKFT 213
Query: 222 DIVTGWPGGMNVSRLLKFS---------GFFKLCEAGQRLNGNVRISSEEVELREYIVGG 272
I +GW G + + +L+ + G F L + G ++ L Y G
Sbjct: 214 FISSGWEGSTSDAGVLRSALGKGFTVPDGKFYLVDGG--------YANTPSFLAPY--RG 263
Query: 273 VGYPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKR 332
V Y L + N + FN H R+ +AF LK + IL P +
Sbjct: 264 VKYHLSEFRRRGQRGNAYANYKELFNHRHAILRNHIERAFGVLKKRFPILKVGTHYPIES 323
Query: 333 KLPSIILVCCLLHNII 348
++ I + HNII
Sbjct: 324 QV-KIPAAAAVFHNII 338
>gi|390595323|gb|EIN04729.1| hypothetical protein PUNSTDRAFT_24189, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 301
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 98/240 (40%), Gaps = 39/240 (16%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
+ F+ RVS FD I + + D + S + VE+Q+AI L R G +
Sbjct: 86 DHFRNDLRVSPAVFDRIVAEIEHDPIFSNHSHHAQ-----MPVEEQLAIVLFRF--GHNG 138
Query: 121 VSVGVA-----FGVGQSTVSQVTWRFIEALEERA--KHHLKWPDSNRMEEIKSKFEESFG 173
S GV+ GVG+ TV T R + A+ R+ ++ P + EE K E
Sbjct: 139 NSAGVSRIAKWAGVGKGTVENATRRVMTAVLRRSFMDQAVRLPTTVEKEEAKRWVER--- 195
Query: 174 LLNCCGA-------IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
C +D T + + + + D++ NYS+ +Q I ++ ID G
Sbjct: 196 --KSCDEWRDGWLFVDGTLVPLYDRPHWYGESYYDRKCNYSLNIQVISLPNLQIIDFGYG 253
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
+ G + S K + ++ E +E +E+I YP+ W+I PY+
Sbjct: 254 FTGSTHDSTAWKETRVYREHEMI-------------LEGQEWIWADSAYPVKHWVIAPYK 300
>gi|242063682|ref|XP_002453130.1| hypothetical protein SORBIDRAFT_04g000610 [Sorghum bicolor]
gi|241932961|gb|EES06106.1| hypothetical protein SORBIDRAFT_04g000610 [Sorghum bicolor]
Length = 317
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/178 (24%), Positives = 81/178 (45%), Gaps = 12/178 (6%)
Query: 65 YFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVG 124
Y RV+ + YI DL + + ++ R+ +VE+++ L L+ S +
Sbjct: 10 YLMRVAFRMEPYIF----RDLATYLRRKRLVVDTRI-TVEEKLGFFLYMLSHNASYEDLQ 64
Query: 125 VAFGVGQSTVSQVTWRFIEA-LEERAKHHLKWPDSNRMEEI---KSKFEESFGLLNCCGA 180
V FG T F + + ++ L+ PD N++ + +F F NC GA
Sbjct: 65 VFFGHSNDTFHHHINHFFKVVIPTLSRRFLQAPDPNQVHQKIQDNPRFYPFFK--NCLGA 122
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
ID THI +++ A + + + +++ S+ V D ++ I +GW G SR+L+
Sbjct: 123 IDGTHIPISI-ASEKASPFRNRKGTLSINVMVACDFDLNITFISSGWEGSATDSRVLR 179
>gi|9294258|dbj|BAB02160.1| unnamed protein product [Arabidopsis thaliana]
gi|15292885|gb|AAK92813.1| unknown protein [Arabidopsis thaliana]
gi|21280913|gb|AAM45005.1| unknown protein [Arabidopsis thaliana]
Length = 415
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/317 (20%), Positives = 125/317 (39%), Gaps = 23/317 (7%)
Query: 40 WPSFWAKNSSTPGATIPSDEEEGFKYFFRVSKKTFDYICSLVREDLV----SRPPSGLIN 95
W +++ N+ P T F+ FR+ K F I + +L+ R +G
Sbjct: 60 WNDYFSDNAIFPLQT--------FRRRFRMKKPLFLRIVDRLSSELMFFQHRRDATGRFG 111
Query: 96 IEGRLLSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLK 154
S ++ A+R LA G + +V +G++T F + + +L+
Sbjct: 112 -----HSPIQKCTAAIRLLAYGYASDAVDEYLRMGETTAMSCLENFTKGIISFFGDEYLR 166
Query: 155 WPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIV 214
P + + + + + G G++D H + +++++ I
Sbjct: 167 APTATNLRRLLN-IGKIRGFPGMIGSLDCMHWEWKNCPTAWKGQYTRGSGKPTIVLEAIA 225
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
++ + G PG +N +L S F G+ N +++ E L Y+ G+
Sbjct: 226 SQDLWIWHVFFGPPGTLNDINILDRSPIFDDILQGRAPNVKYKVNGREYHLAYYLTDGI- 284
Query: 275 YPLLSWLITPYET-NGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRK 333
YP + I A++ F + EA R +AF L+ + I+ DK K
Sbjct: 285 YPKWATFIQSIRLPQNRKATL--FATHQEADRKDVERAFGVLQARFHIIKNPALVWDKEK 342
Query: 334 LPSIILVCCLLHNIIID 350
+ +I+ C +LHN+I++
Sbjct: 343 IGNIMKACIILHNMIVE 359
>gi|397625395|gb|EJK67778.1| hypothetical protein THAOC_11144 [Thalassiosira oceanica]
Length = 347
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 101/256 (39%), Gaps = 23/256 (8%)
Query: 109 IALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKF 168
+ALR A G+ + GV V W ++A+ + + +K+P++ E + +
Sbjct: 1 MALRYCAGGDP-TDIADIHGVKDDEVLNSLWDVVDAIHQSPELDIKFPET---EADQRRV 56
Query: 169 EESFGL-----LNCC-GAIDATHIIMTLPA--------VQTSDDWCDQENNYSMLVQGIV 214
E F ++CC GA+D I + P+ + +C ++ + + +QG+
Sbjct: 57 AEGFRRKSQIDIDCCVGAVDGILIWIHKPSSEDEKVIRIGPKKFFCGRKKKFGLNMQGVC 116
Query: 215 DHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVG 274
D F+D+ +PG + S K E L V S + + + G
Sbjct: 117 DARGYFLDVEIRFPGSSSDFYAFDESSLKKKLEQEGFLGTAV---SMDFGGKLCLFGDNA 173
Query: 275 YPLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVM-WRPDKRK 333
Y S++ TP++ S FN H R AF L W +L K + +
Sbjct: 174 YVQSSYMCTPWKAV-TSGPKDAFNFYHSQVRINIECAFGILVHRWGMLRKAIPVGISVSR 232
Query: 334 LPSIILVCCLLHNIII 349
++L C LHN I
Sbjct: 233 TTRLVLALCKLHNFCI 248
>gi|357151740|ref|XP_003575888.1| PREDICTED: uncharacterized protein LOC100838622 [Brachypodium
distachyon]
Length = 488
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 177 CCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRL 236
C GAID +HI + +PA QT + + + S V + D +MRF +V GW G + +R+
Sbjct: 408 CIGAIDRSHIPVIVPASQTVN-YTGRHGYTSQNVLAVCDFDMRFTFVVAGWAGSAHATRI 466
Query: 237 LKFS 240
+S
Sbjct: 467 FNYS 470
>gi|328700501|ref|XP_003241280.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 302
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/253 (21%), Positives = 98/253 (38%), Gaps = 30/253 (11%)
Query: 112 RRLASGESQVSVGVAFGVGQSTVSQVT-------WRFIEALEERAKHHLKWPDSNRMEEI 164
+ LA+G + ++ F G ST++++ W+ ++ L+ + P + I
Sbjct: 3 KYLATGTNFTALHFEFLAGVSTIAKIVQETCDVLWKVLQPLK------MAEPTTKDWLGI 56
Query: 165 KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIV 224
+ + E N GA+D HI + P + + + +N +S+++ I D F I
Sbjct: 57 ANGYYEKTQFPNTVGAVDGKHIRLECPK-NSGSLYYNYKNFFSLILMAICDSNYCFRIID 115
Query: 225 TGWPGGMNVSRLLKFSGFFKLCEAGQRLNGN-VRISSEE--------VELREYIVGGVGY 275
G G + + K S F G++L + V S+ V +V +
Sbjct: 116 VGSYGKESDCNVFKTSTF------GKKLYSDKVNFPSDRCLPGDENGVPQPFVLVADEAF 169
Query: 276 PLLSWLITPYETNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLP 335
L L+ P+ L+ + FN R F L WR+ M D
Sbjct: 170 ALHKHLLRPFPGRTLNNNRRIFNYRLSRARQYIECTFGILSKKWRVFQSSML-VDPNVAV 228
Query: 336 SIILVCCLLHNII 348
+I CC+LHN +
Sbjct: 229 TITKACCVLHNFV 241
>gi|328701623|ref|XP_003241661.1| PREDICTED: putative nuclease HARBI1-like [Acyrthosiphon pisum]
Length = 191
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 89/205 (43%), Gaps = 25/205 (12%)
Query: 127 FGVGQSTVSQVTWRFIEALE--ERAKHHLKWPDS-NRMEEIKSKFEESFGLLNCCGAIDA 183
G+ Q +VS+ + AL E +K+P + N + +I++ + G G ID
Sbjct: 1 MGLSQPSVSRCISEVVTALNLPEIFNAWVKFPKNINELNDIRNG---NTGFPGVVGCIDC 57
Query: 184 THIIMTLPAVQTSDD--------WCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
TH+ +T P+ + + + +++ +S+ VQ I D ++ +++ +PG + +
Sbjct: 58 THVAITPPSTNLNLNENQHPEYIYVNRKGYHSINVQLICDSNLKILNVNALFPGSTHDTH 117
Query: 236 LLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSASMP 295
+ S + + R N N Y++G GY L WL+TP A
Sbjct: 118 VWNNSKVLPILQELHRRNYN----------NFYLLGDSGYALRQWLLTPIPDPSTEAE-E 166
Query: 296 TFNSLHEATRSLAVKAFLQLKGGWR 320
+N +TRS+ + LK +R
Sbjct: 167 NYNRRQMSTRSIIERCNGVLKMRFR 191
>gi|326431558|gb|EGD77128.1| hypothetical protein PTSG_07463 [Salpingoeca sp. ATCC 50818]
Length = 498
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 120/302 (39%), Gaps = 33/302 (10%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E F+ FRV+K F + ++ L RP + ++ R++ V L A G S
Sbjct: 50 ETFRQNFRVTKDQFWELLDIISPHLPKRPGT---DVPDRVVLVG-----LLWMFAHGCSL 101
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGA 180
+V + G+ +S S F +A+ + ++ +P ++E + ++ + L C A
Sbjct: 102 QAVQFSIGISKSMCSTHFPAFAKAIID-GLDNISFPTGEELKEEQRRWAQDDFLEGCVTA 160
Query: 181 IDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFS 240
D HI T + W +++ S V + + +V G G + + ++ +
Sbjct: 161 ADGVHIRYTPTTNGHEERWRNRKGFKSQNVLVCASFDRQIQHVVVGCEGSCHDASVVATA 220
Query: 241 --------GFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLSA 292
G F L +A RL ++ + GV Y L ++ G +
Sbjct: 221 DLTSKLPEGSFGLFDAAMRLTHKRWLTP---------IRGVTYHLDTF----KSRGGPQS 267
Query: 293 SMPTFNSLHEATRSLAVKAFLQLKGGWRIL-SKVMWRPDKRKLPSIILVCCLLHNIIIDS 351
FN +H RS F LK +RIL + D +L I+ CC LHN ++
Sbjct: 268 DEEIFNYMHARRRSRVEACFGILKSRFRILYTGFSSCEDTARL--YIVACCALHNFLLRQ 325
Query: 352 GD 353
D
Sbjct: 326 KD 327
>gi|322784090|gb|EFZ11197.1| hypothetical protein SINV_02709 [Solenopsis invicta]
Length = 114
Score = 49.3 bits (116), Expect = 0.003, Method: Composition-based stats.
Identities = 29/91 (31%), Positives = 47/91 (51%), Gaps = 1/91 (1%)
Query: 153 LKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQG 212
+K P+ +EE + GAID THI + LP D+ +++ S+++QG
Sbjct: 17 IKMPNEFECTMTAQVYEERTHIPQLIGAIDGTHIPV-LPPADGYKDFINRKGWPSIILQG 75
Query: 213 IVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
+VDH +RF +I PG + + + K S FF
Sbjct: 76 VVDHTLRFKNINCRAPGSAHDAAVFKESNFF 106
>gi|4585884|gb|AAD25557.1|AC005850_14 Hypothetical protein [Arabidopsis thaliana]
Length = 404
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 78/193 (40%), Gaps = 33/193 (17%)
Query: 174 LLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNV 233
++C GA+D TH+ + P+ + + +++ +M + + + M+F G PG +
Sbjct: 174 FIDCIGALDGTHVSVRPPSGDV-ERYRGRKSEATMNILAVCNFSMKFNIAYVGVPGRAHD 232
Query: 234 SRLLKF----SGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNG 289
+++L + F AG+ Y + GYP S + P+
Sbjct: 233 TKVLTYCATHEASFPHPPAGK-----------------YYLVDSGYPTRSGYLGPHRRTR 275
Query: 290 LSASM------PT-----FNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSII 338
+ PT FN H + RS+ + F K WRIL + + + +K I+
Sbjct: 276 YHLELFNRGGPPTNSRELFNRRHSSLRSVIERTFGVWKAKWRILDRKHPKYEVKKWIKIV 335
Query: 339 LVCCLLHNIIIDS 351
LHN I DS
Sbjct: 336 TSTMALHNYIRDS 348
>gi|218193903|gb|EEC76330.1| hypothetical protein OsI_13895 [Oryza sativa Indica Group]
Length = 367
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 64/298 (21%), Positives = 119/298 (39%), Gaps = 12/298 (4%)
Query: 63 FKYFFRVSKKTFDYI--CSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
F+ FR+SK F+ I S + D ++ N +G LS ++ A+R+LA+G
Sbjct: 2 FRRRFRMSKPLFERIVNASSDKYDYFTQRVDA-ANRQG--LSPLQKCTAAIRQLATGSVA 58
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEE-RAKHHLKWPDSNRMEEIKSKFEESFGLLNCCG 179
+ +G++T + F++ + E + +L+ P E + K E G G
Sbjct: 59 DQLDEYLKIGETTAMESMKNFVKGVREIFGERYLRRPSVEDTERL-LKIGEKRGFPGMFG 117
Query: 180 AIDATHIIMT-LPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLK 238
+ID H P + +++++ + H++ G G N +L
Sbjct: 118 SIDCMHWQWERCPLAWKGQFTRGDQKGPTLILEAVASHDLWIWHAFFGAAGSNNDINVLN 177
Query: 239 FSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETN-GLSASMPTF 297
S F GQ ++ + + ++ G+ YP W + ++ +
Sbjct: 178 QSTVFINELKGQAPRVQYMVNGNQHNIGYFLADGI-YP--DWAVFAKSIRLPITEKDKLY 234
Query: 298 NSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSGDQL 355
E R +AF LK W IL + D+ L ++L C +LHN+I++ L
Sbjct: 235 AQEQEGARKDIERAFGVLKRRWCILKRPARLYDRGVLRDVVLACIILHNMIVEDEKDL 292
>gi|321468204|gb|EFX79190.1| hypothetical protein DAPPUDRAFT_104684 [Daphnia pulex]
Length = 270
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 63/138 (45%), Gaps = 15/138 (10%)
Query: 153 LKWPDSNR--MEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLV 210
L W +NR + +F + G + C GAIDAT + P Q + ++ ++ +
Sbjct: 119 LIWCLANREAYRVLSRRFGMNQGTIFCLGAIDATEFEIRQPLNQLPS-YTSRKCKTTIKL 177
Query: 211 QGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIV 270
Q + H++ ID G+PG + +R+L+ S L A + ++ V +I+
Sbjct: 178 QIVSTHDLEIIDAAVGFPGSIGDARVLRLS---PLSRA---------LGAKLVRSNYHIL 225
Query: 271 GGVGYPLLSWLITPYETN 288
G YPL L+ P+ N
Sbjct: 226 GDTAYPLRQHLLVPFRNN 243
>gi|357127116|ref|XP_003565231.1| PREDICTED: uncharacterized protein LOC100832632 [Brachypodium
distachyon]
Length = 657
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 64/291 (21%), Positives = 117/291 (40%), Gaps = 23/291 (7%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ + F +C ++RE L++ + +S+E+QVA+ L + VG
Sbjct: 67 LRLMRAPFFRLCQVLRE-------RSLLH-DTIHISIEEQVAMFLITVGHNLRNREVGAI 118
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLLNCCGAIDATHI 186
F VS+ + A+ E ++ P +I +C GAID+T I
Sbjct: 119 FNRSGEPVSRYFGLVLHAIGELRDELIRPPSLETPTKIAGNPRWDPYFKDCVGAIDSTQI 178
Query: 187 IMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFFKLC 246
++ + +C ++++ V +D ++RF ++ G + +L + C
Sbjct: 179 QASV-SKNMEAAFCGKKSSAGQNVIAAIDFDLRFTYVLAGCDASAHDDAVLMDA---IEC 234
Query: 247 EAGQRL-NGNVRISSEEVELREYIVG---GVGYPLLSWLITPYETNGLSASMPTFNSLHE 302
E G R+ G + + + GV Y L W P + + FN H
Sbjct: 235 ENGLRVPEGKFYLVDAGYGAKPGFLPPFCGVRYYLNEWGNNPAQD-----ARELFNLRHS 289
Query: 303 ATRSLAVKAFLQLKGGWRILSKVM-WRPDKRKLPSIILVCCLLHNIIIDSG 352
+ R +AF LK ++IL + P + ++I C +LHN I+ G
Sbjct: 290 SLRVTVERAFSSLKRRFKILDDAAPFFPPATQTDAVI-ACVILHNWILSQG 339
>gi|357129889|ref|XP_003566592.1| PREDICTED: putative nuclease HARBI1-like [Brachypodium distachyon]
Length = 333
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 73/159 (45%), Gaps = 13/159 (8%)
Query: 101 LSVEKQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNR 160
+SVE+QV I L +A + ++ F T+ + R +E + +++ P +
Sbjct: 108 VSVEEQVTIFLYAVAKNATNETLQDWFQHSPDTIHRHFKRVLEGITNLTPIYIRPPSLHP 167
Query: 161 MEEI-KSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMR 219
+ K KF F NC GA+ THI M LP Q + + +++ S + +++
Sbjct: 168 HSILRKPKFYPFFK--NCIGAVHGTHIPMKLPLDQ-QEPYRNRKQTISQNAMVACNFDLK 224
Query: 220 FIDIVTGWPGGMNVSRLLKFS---------GFFKLCEAG 249
F+ I GW G + +R+L+ + G F L +AG
Sbjct: 225 FVHINPGWEGSASDARVLQDALNHGFEVPDGKFYLVDAG 263
>gi|393231424|gb|EJD39016.1| hypothetical protein AURDEDRAFT_38042, partial [Auricularia
delicata TFB-10046 SS5]
Length = 289
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 93/236 (39%), Gaps = 31/236 (13%)
Query: 59 EEEGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGE 118
+E+ F+ R S TFD IC+++++D V S + R +Q+AIA+ R
Sbjct: 74 DEKRFRRNIRCSPATFDVICAMLKDDPVFHNESQCPQKDIR-----EQLAIAMYRFGHDG 128
Query: 119 SQVSV-GVAFGVGQST--VSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKFEESFGLL 175
+ SV VA G S V T R + AL + WP EE K EE+
Sbjct: 129 NSCSVEDVAQWAGASAGMVLNATRRVMVALLALHDEAIHWPTPEEKEEAKQWVEET---- 184
Query: 176 NCC------GAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
+C +D T + + + ++ Y++ +Q + ID V G G
Sbjct: 185 SCAEWRDGHSMVDGTLAKLFEKPAFYGETYYSRKAIYALSIQVVNLPNRLIIDYVVGHTG 244
Query: 230 GMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
++ S + S +K Q L + E+I Y SW +TPY
Sbjct: 245 SVHDSTAFEDSRVYK--HHAQLLGED-----------EWIWADSAYATRSWCVTPY 287
>gi|222624856|gb|EEE58988.1| hypothetical protein OsJ_10696 [Oryza sativa Japonica Group]
Length = 684
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 44/97 (45%), Gaps = 12/97 (12%)
Query: 267 EYIVGGVGYPLLSWLITPYETNGL------SASMP-----TFNSLHEATRSLAVKAFLQL 315
+Y + GYP + PY+ A P FN H + R++ ++F L
Sbjct: 537 KYYLVDSGYPNRPGCLAPYKGTKYHLQEYRDAPEPQGKEENFNYAHSSLRNVIERSFGVL 596
Query: 316 KGGWRILSKVMWRPDKRKLPSIILVCCLLHNIIIDSG 352
K WR+L K+ D RK II+ CC LHN I SG
Sbjct: 597 KMKWRMLEKIP-SYDPRKQTQIIIACCALHNFIRKSG 632
>gi|218197838|gb|EEC80265.1| hypothetical protein OsI_22233 [Oryza sativa Indica Group]
Length = 263
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 96/222 (43%), Gaps = 28/222 (12%)
Query: 67 FRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQVSVGVA 126
R+ K F +C+ +R S + E +++E+QVA+ + + S +
Sbjct: 27 LRMRKVVFHKLCAELR--------SRALLEETFHVTIEEQVAMFIHVVGHNWSNRPIAFE 78
Query: 127 FGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS---KFEESFGLLNCCGAIDA 183
F TVS+ ++AL A+ + + ++I S +F F C GA+D
Sbjct: 79 FMRSGETVSRYFNLVLDALCILARDLVCIKSIDTHQKITSSHGRFHPYFQ--GCIGALDG 136
Query: 184 THIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSRLLKFSGFF 243
THI +P + D + ++ S V VD ++RF+ ++ GW G + S +L+
Sbjct: 137 THIPACVP-MHMQDRFRGRKKFQSQNVLAAVDFDLRFLYVLAGWEGSAHDSYVLQ----- 190
Query: 244 KLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPY 285
+A R NG ++I +Y + GY ++ PY
Sbjct: 191 ---DALSRTNG-LKIPEG-----KYFLADAGYAARPGVLPPY 223
>gi|198413941|ref|XP_002119367.1| PREDICTED: hypothetical protein [Ciona intestinalis]
Length = 99
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 156 PDSNRMEEIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVD 215
PD +I ++FE+ G ID +HI +T P +Q D+ ++++ S+++Q +VD
Sbjct: 2 PDKEEALKIAARFEKKSHFPRVMGCIDGSHIAVTAP-LQRKKDYTNRKDWDSLVLQAVVD 60
Query: 216 HEMRFIDIVTGWPGGMNVSRLLKFSGFFK 244
F +I PG + + + K SG FK
Sbjct: 61 DLYCFRNISVKMPGSSHDATVFKNSGLFK 89
>gi|449691722|ref|XP_004212775.1| PREDICTED: uncharacterized protein LOC101239569 [Hydra
magnipapillata]
Length = 164
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 62/130 (47%), Gaps = 3/130 (2%)
Query: 105 KQVAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEAL-EERAKHHLKWP-DSNRME 162
+++ + LR LA+GESQ + F VG++TV + A+ E K L+ P D +
Sbjct: 32 QRLIVTLRYLATGESQQTQSFYFRVGRATVCNIIEETCCAIWEVLIKIFLRAPNDIQEWQ 91
Query: 163 EIKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFID 222
I +F ++ C AID H+ + PA ++ + + + Y M++ I D + F
Sbjct: 92 NISKEFNQNLNFPQCIRAIDRKHVRIEAPA-KSGFSFYNYKGFYCMILLAICDAKYCFTM 150
Query: 223 IVTGWPGGMN 232
+ G G N
Sbjct: 151 VDKGGYGRDN 160
>gi|195128417|ref|XP_002008660.1| GI13616 [Drosophila mojavensis]
gi|193920269|gb|EDW19136.1| GI13616 [Drosophila mojavensis]
Length = 415
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 102/247 (41%), Gaps = 21/247 (8%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKS 166
+++AL +L + E + F S QV F + + + +KWP+S ++
Sbjct: 141 LSLALWKLTTDEHFEEIARKFQFPWSLCQQVVRSFWHIISDNYESFIKWPNSLEAQQSTL 200
Query: 167 KFEESFGLLNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTG 226
+ ++ L+C + I++L ++ + + + ++++Q I + E + ID
Sbjct: 201 QGFQNLPQLSCFQELFG---IISLKRLEIFLE--SEHADVAVVLQLICNGEQKIIDCYVE 255
Query: 227 WPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYE 286
L+ F+ GQ L N R + Y++G +PL S+LI P E
Sbjct: 256 ----------LEQEYTFEESPIGQTLALNPRT----MPANSYLIGSTCFPLKSYLIRPIE 301
Query: 287 TNGLSASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHN 346
FN+L E L+ L + L + R D ++ I+ C +HN
Sbjct: 302 AECFRKD-SVFNALLEPAYQLSNNVMDSLARRFHTLYALEAR-DLNEVRLIVESICAMHN 359
Query: 347 IIIDSGD 353
+ +D GD
Sbjct: 360 LCVDFGD 366
>gi|331227257|ref|XP_003326297.1| hypothetical protein PGTG_08127 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 937
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 73/168 (43%), Gaps = 8/168 (4%)
Query: 66 FFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLAS---GESQVS 122
F ++ + TF +LV+ L+ + P N Q+A+A+ L S G +
Sbjct: 57 FVQMFRMTFPCFLNLVQ--LIEQNPIFYNNSRNPQRDPPIQIAVAVCCLGSNGNGSAIYR 114
Query: 123 VGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWP-DSNRMEEIKSKFEESFGLLNCCGAI 181
+ F VG T+ T R I A+ + WP S ++E + EE G C G +
Sbjct: 115 LKNLFQVGFGTIDLYTRRVIHAVYGLRSSLVTWPTKSEQIESSQVMREE--GFPGCVGFV 172
Query: 182 DATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPG 229
D T I ++ + + D++ YS+ V I D + I + G+PG
Sbjct: 173 DGTTIPLSQKPPKDGQHYFDRKKRYSISVTVICDINKKIISYLAGFPG 220
>gi|108710333|gb|ABF98128.1| transposon protein, putative, CACTA, En/Spm sub-class [Oryza sativa
Japonica Group]
Length = 627
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 77/186 (41%), Gaps = 22/186 (11%)
Query: 107 VAIALRRLASGESQVSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPD---SNRMEE 163
+AI L L G S F T+S+ + A+ E + ++ D SN E
Sbjct: 401 LAIFLYILGDGSSNQRAQNCFKHSGETISRKIEEVLFAVVELGRDIVRPKDPNFSNVHER 460
Query: 164 IKSKFEESFGLLNCCGAIDATHIIMTLPAVQTSDD---WCDQENNYSMLVQGIVDHEMRF 220
I+ +C GA+D THI+ +P DD + + + + V I DH+MRF
Sbjct: 461 IRKDRRMWPHFKDCIGAVDGTHILAVVP----DDDKIRYIGRSKSTTQNVMAICDHDMRF 516
Query: 221 IDIVTGWPGGMNVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSW 280
G PG M+ + +L F+ L ++ I + + Y+V GYP
Sbjct: 517 TYASIGQPGSMHDTTVL-FNA----------LRTDIDIFPHPPQGKYYLVDA-GYPNRPG 564
Query: 281 LITPYE 286
+ PY+
Sbjct: 565 YLAPYK 570
>gi|297720307|ref|NP_001172515.1| Os01g0693600 [Oryza sativa Japonica Group]
gi|255673580|dbj|BAH91245.1| Os01g0693600 [Oryza sativa Japonica Group]
Length = 807
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 176 NCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGMNVSR 235
+C GAID TH+ + +P + + ++ N S V + D +MRF +++GWPG + R
Sbjct: 689 DCIGAIDGTHVKLVVPKTKRIQ-YLNRYNETSQNVLAVCDFDMRFTFVLSGWPGSAHDMR 747
Query: 236 LLK 238
+ K
Sbjct: 748 VFK 750
>gi|341892413|gb|EGT48348.1| hypothetical protein CAEBREN_17494 [Caenorhabditis brenneri]
Length = 385
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 62/296 (20%), Positives = 120/296 (40%), Gaps = 25/296 (8%)
Query: 61 EGFKYFFRVSKKTFDYICSLVREDLVSRPPSGLINIEGRLLSVEKQVAIALRRLASGESQ 120
E ++F+ V+ ++F+ I LV + E + E ++ I L+ + G +Q
Sbjct: 69 EEIEHFYGVTMRSFNEILPLVTHN------------EKCHYTKEVRLCIFLQYVKEGLTQ 116
Query: 121 VSVGVAFGVGQSTVSQVTWRFIEALEERAKHHLKWPDSNRMEEIKSKF------EESFGL 174
+V G+ Q+++S++ I+ L A +++K+P + EEI + E+S G
Sbjct: 117 NAVSRTIGLSQASISRIVNGIIKDLVNVASNYIKFPQTR--EEIYAATKGFYEKEDSKGR 174
Query: 175 ---LNCCGAIDATHIIMTLPAVQTSDDWCDQENNYSMLVQGIVDHEMRFIDIVTGWPGGM 231
+ C G +D H P S ++ + + +S + D E R I + G
Sbjct: 175 KRRMPCFGILDGKHWKTEHPPHSGSLNY-NYKCFFSFNSLFVTDFEHRIIYLQISENGVN 233
Query: 232 NVSRLLKFSGFFKLCEAGQRLNGNVRISSEEVELREYIVGGVGYPLLSWLITPYETNGLS 291
N ++LL ++ + + G + +I+ G+ L ++ PY L
Sbjct: 234 NDAQLLAAGPLKEMLKNAAEVGGCTLMPDSTYAFSPFILADNGFGLDKTVMLPYRQEKLD 293
Query: 292 ASMPTFNSLHEATRSLAVKAFLQLKGGWRILSKVMWRPDKRKLPSIILVCCLLHNI 347
FN ATR F L ++ + + P +I+ ++HNI
Sbjct: 294 KVNIKFNEKVSATRVKIENLFGVLTSKFQCFDRNLKLPPTAA-RGLIVALSVIHNI 348
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.405
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,393,570,115
Number of Sequences: 23463169
Number of extensions: 265085668
Number of successful extensions: 748032
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 395
Number of HSP's successfully gapped in prelim test: 902
Number of HSP's that attempted gapping in prelim test: 745366
Number of HSP's gapped (non-prelim): 1422
length of query: 402
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 257
effective length of database: 8,957,035,862
effective search space: 2301958216534
effective search space used: 2301958216534
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)