BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy14859
(1097 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|357630297|gb|EHJ78517.1| putative transposon Ty3-I Gag-Pol polyprotein [Danaus plexippus]
Length = 500
Score = 272 bits (696), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 213/353 (60%), Gaps = 18/353 (5%)
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA 637
+L +TCLPFGL P+ FAS++NW+A LLR+ G+R VVYLDDFL NQ L+ A
Sbjct: 58 LLQITCLPFGLIPVPRTFASVTNWIAELLRNHGIRCVVYLDDFLRANQSKSALQNDIAGA 117
Query: 638 VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
+ ++ +LGW++N QKS L+P L+FLGI WD + L K LTL L L SK
Sbjct: 118 LKMMRTLGWMINFQKSVLAPTQCLEFLGITWDTKRNTKSLSGQKCLTLRKALYLLKQSK- 176
Query: 698 WNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWL 756
W+L +S++G L FASFV GRLH R +Q + L PH I V P+LEWWL
Sbjct: 177 WSLRQYQSIMGRLKFASFVTRRGRLHCRTLQYYSRQLPKTHPHRRVSIPQPVQPELEWWL 236
Query: 757 NALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQA 816
+ S PI Q+ + ++T+AS+ GWG+Q++ +S W++ A
Sbjct: 237 EEIGGSMPIQIPQLTNLLTTNASNTGWGAQLNEISISRTWTKP----------------A 280
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
+ L+ LQ+S +++Q+DN+TVVSY+ ++GGT+SL LL + ++ + +H++AQ+I
Sbjct: 281 IQLDQDGLQNSQILLQTDNRTVVSYINKEGGTQSLKLLEQTRRLLSVLDKVNMHLIAQYI 340
Query: 877 PGAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNH 929
PG YN D+LSR K+ P+WHL AT +IF WG P ID FAS+ + VV +
Sbjct: 341 PGRYNVEVDALSRQKACPEWHLITEATTKIFQMWGCPEIDFFASKTAHVVRTY 393
>gi|301619133|ref|XP_002938952.1| PREDICTED: transposon Ty3-G Gag-Pol polyprotein-like [Xenopus
(Silurana) tropicalis]
Length = 707
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 176/519 (33%), Positives = 266/519 (51%), Gaps = 15/519 (2%)
Query: 422 VGGRLRRFVDAWIR-LGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
VGGRL FV W + P L + GY+IPF+ KPP + + P L I
Sbjct: 75 VGGRLHNFVQTWQNSITDPWVLNILEHGYSIPFAKKPPEHRFVTSSIPSDPNKQQALLSI 134
Query: 481 -QEMLETGVLKRLDSTT---GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF 536
Q++L V+ R+ GF S +FLV K +GG RPVLNL LN+F+ ++F + +
Sbjct: 135 IQDLLNNKVISRVPQEYRFHGFYSNIFLVAKKDGGFRPVLNLHPLNKFVRYERFKMESLP 194
Query: 537 RIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
I L +M ID+ AY H+PI HQRFL + LPFGL +AP+ F
Sbjct: 195 SIIRSLSPNLFMSKIDIKDAYLHIPINAFHQRFLRFAIGQSHFQFQALPFGLTSAPRVFT 254
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+ + ++LR +G+ V YLDD ++ Q + + + L GWI+N +KS LS
Sbjct: 255 KVLGALLAVLRLQGVHVTAYLDDLIVTAQSEKEANSHTRECLHTLRQHGWIINRKKSLLS 314
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
P L+FLG+ + +++LP K +TL + + + + LLG ++ +
Sbjct: 315 PTQALEFLGMQINTVDRKVFLPLHKAITLQQMAQNIRWQSQTSAHDILRLLGLMAASIEA 374
Query: 717 IPMGRLHSRRIQRQASLLRLGAPH------LTPINPAVLPKLEWWLNALPLS-SPIFPRQ 769
+P + H R +Q + L+L + ++ V L WW++ L+ +
Sbjct: 375 VPFAKFHLRTLQWE--FLKLWDKNHQDLSQKINLSSKVQLSLSWWIHLPNLTQGKSWDCP 432
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
VQ ++TDAS +GWG+ G WSR++ HIN E+ AV AL ++ V
Sbjct: 433 VQEIVTTDASRVGWGATWPPKVCQGTWSRQELKLHINALELKAVFYALLHWQTCMKGKHV 492
Query: 830 MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+QSDN T V+YL RQGGT+S S L EV +I ++ ++ + A FIPG N AD LSR
Sbjct: 493 RIQSDNSTTVAYLNRQGGTRSASALREVSRIMTWAETHQVLLSAVFIPGIQNWEADYLSR 552
Query: 890 SKSLP-DWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+ P +W L +QI KWG+PC+D+ ASR ++ +P
Sbjct: 553 TTLDPGEWKLKPEIFQQIVKKWGLPCLDIMASRFNSQIP 591
>gi|327286446|ref|XP_003227941.1| PREDICTED: hypothetical protein LOC100566709 [Anolis carolinensis]
Length = 1049
Score = 268 bits (686), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 175/525 (33%), Positives = 281/525 (53%), Gaps = 22/525 (4%)
Query: 424 GRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQE 482
RL F+ AW ++ + A ++ I+ GYAI F + P L T S + +
Sbjct: 18 NRLGPFIGAWRQITSDAWVLNIIERGYAIEFESLPRTGTL-----RPTRPSQCLRDEVTT 72
Query: 483 MLETGVLKRLD---STTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ E G + + + F SR FL+ K GG RP+L+L+ +N+ + ++F ++ I
Sbjct: 73 LFEKGAISKFSLERAHKCFFSRYFLIKKKGGGLRPILDLRAVNRHIKARRFRMVTLATIL 132
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
FL+KG + ++DL AYFH+ ++ +H+RFL+ + + LPFGLATAP+ F
Sbjct: 133 PFLRKGAWFATVDLRDAYFHISVRRSHRRFLSFLIGDVIYSFNVLPFGLATAPRVFTKCM 192
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ VA+ LR RG+ + Y+DD+LLV+ LE +S L LG I+N +KS L P+
Sbjct: 193 SVVAAALRQRGITIFPYIDDWLLVSDSRPQLEFDVSFTLSFLQGLGLIINEEKSHLHPSQ 252
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+QF+G + D +R +LPE++ ++ + L S + +S+LG+++ + ++ +
Sbjct: 253 TIQFIGALMDSIAERAYLPEERFRSIRASISQLRMSGQASAWHVQSILGHMASTTSLVDL 312
Query: 720 GRLHSRRIQRQASLLRLGAP----HLTPINP--AVLPKLEWWL--NALPLSSPIFPRQVQ 771
RL R + Q L++ P T + P +VL LEWWL + L P P
Sbjct: 313 ARLRMRPL--QFWFLKVFNPLFDSQRTLLRPPASVLESLEWWLKRHNLLKGLPFHPPTPS 370
Query: 772 HFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV 831
++TDAS GWG+ ++ ++G WS + + HIN E+ AV +AL LL+ + V +
Sbjct: 371 LELTTDASQDGWGAHLNGMTINGRWSAQHRTLHINLLELLAVERALHAFDRLLRGNTVRL 430
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR-S 890
+DN TV YL +QGGT S LL +I+ D RI++ A +PG N++AD+LSR +
Sbjct: 431 VTDNTTVKFYLNKQGGTHSRLLLQTSMRIWDWCVDRRINLQAVHLPGKDNALADALSRTT 490
Query: 891 KSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNHFQVSRH 935
S +W L+ + +WG P IDLFAS + PN +RH
Sbjct: 491 TSNHEWQLNNKEFRLLARRWGWPAIDLFASPENTHCPNF--CARH 533
>gi|301618694|ref|XP_002938748.1| PREDICTED: hypothetical protein LOC100127807 [Xenopus (Silurana)
tropicalis]
Length = 4048
Score = 263 bits (673), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 193/612 (31%), Positives = 290/612 (47%), Gaps = 23/612 (3%)
Query: 328 FSDESVFKNVSDHLLQYVCGKRAECLESRRRLVEPRDPHLASLLLRARRGKKSSSPQNLE 387
F +F D ++ G + + + R PR L RG +SP
Sbjct: 1651 FQGSKLFGEELDKIISQATGGKTKAFATTR-ATRPR------LTSHHSRG---ASPTRES 1700
Query: 388 PPGRVSLKVQTLQKPQRCSSPVNPPADSRI-GAELVGGRLRRFVDAWIRLGAPAPLVRIV 446
PPG + +L + + P P A +GGRLR F D W A +V+ V
Sbjct: 1701 PPGSPA----SLNPSHQSTRPPTPDYTRETPEAGSIGGRLRFFADVWRLHVEDAWVVQTV 1756
Query: 447 -SGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTT---GFLSRL 502
+GY + F PP S P A I+++ +TGV+ + GF S L
Sbjct: 1757 ATGYRLEFHETPPAHFFMSRVPHQAPKQQAFLSIIEKLRKTGVIVPVPQNQRFRGFYSNL 1816
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
F+VPK +G RP+L+LK LN+++ KF + + I L+ GD++ S+D+ AY HVPI
Sbjct: 1817 FIVPKKDGSFRPILDLKLLNRWIVYHKFKMESVRTIIRALEPGDFLASLDIRDAYLHVPI 1876
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
HQ++L ++ LPFGL++AP+ F + +A+ LR RG+ ++ YLDD L+
Sbjct: 1877 FQPHQQYLRFAFRNQHFQFIALPFGLSSAPRIFTKIMASMAAFLRVRGVFIMPYLDDLLI 1936
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
+ + E +L + L GW +NL KSSLSP+ + FLG+ + + +++LP DKQ
Sbjct: 1937 KARSKTLAEHNVQLTIQSLRMFGWSINLDKSSLSPSQNMIFLGLQFQTDIQKVFLPRDKQ 1996
Query: 683 LTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLL--RLGAPH 740
L + R L A+ + +LG + + + H R +Q L R
Sbjct: 1997 LKIQRSTRILRATARPTIQMCMRVLGLMVSTMEAVSFAQFHLRPLQTAVLKLWNRTSLHQ 2056
Query: 741 LTPINPAVLPKLEWWLNALPLS-SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSRE 799
+ L L WWL L+ F V ++TDAS GWG+ GLW++E
Sbjct: 2057 RITLPEDTLRSLSWWLTPERLTQGKTFLEPVWLIVTTDASLTGWGATFQGKAAQGLWTQE 2116
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
+ IN E+ A+ AL L++ V +QSDN T V+Y+ RQGGT+S EV
Sbjct: 2117 EALLPINILELRAILLALQSWERFLRNQAVRIQSDNATAVAYINRQGGTRSNRANQEVSF 2176
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWHLSRSATEQIFLKWGVPCIDLF 918
I ++ + A IPG N AD LSR P +W L + A + +WG+P IDL
Sbjct: 2177 ILEWAERTATLLSAIHIPGVSNVEADFLSRHHLDPGEWQLHQDAFLCLTRRWGMPEIDLM 2236
Query: 919 ASRVSAVVPNHF 930
ASR + VP +
Sbjct: 2237 ASRHNRRVPRFY 2248
Score = 228 bits (582), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 160/506 (31%), Positives = 240/506 (47%), Gaps = 64/506 (12%)
Query: 422 VGGRLRRFVDAWIR-LGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
VGG+L+ F + W R + P + I SGY + PP S + P+ A L I
Sbjct: 654 VGGKLQHFTETWARNIADPWVVETISSGYKLELRRLPPTRFFMS-RVPKEPIKRAAFLSI 712
Query: 481 -QEMLETGVLKRL---DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF 536
+E+L + V+ + TGF S LF+VPK G RPVL+LK LN+++ ++F + +
Sbjct: 713 VEELLHSNVIIPVPPSQQFTGFYSNLFIVPKKKGTFRPVLDLKHLNKWIVYRRFKMESVR 772
Query: 537 RIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
+ ++ G+++ S+D+ AY HVPI HQ +L ++ G L T LPFGL++AP+ F
Sbjct: 773 SVIRAMEPGEFLTSLDMKDAYLHVPIFPPHQAYLRFAFQGQHLQFTALPFGLSSAPRIFT 832
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+ + +A+ LR +G+ + YLDD L+ + E + L GW +N QKS L
Sbjct: 833 KIMSTMAAHLRVQGVCITPYLDDLLIKARSSHQAERDLTQTMQTLQEFGWTINRQKSFLI 892
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
P+ + FLG ++D H G +L
Sbjct: 893 PSQRMPFLGFIFDTHQ-------------GRVL--------------------------- 912
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIST 776
L ++Q+ SL++ + P + + L P + I+T
Sbjct: 913 -----LPEEKVQKLISLVQ-------ELKTTQRPSIRHCMKGRSLEEPRW-----QVITT 955
Query: 777 DASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
DAS GWG+ + GLWS + IN E+ A+ +A+ L V +QSDN
Sbjct: 956 DASLSGWGATFKTQIAQGLWSESEGTLPINILEIRAIFRAVVHWEEQLVDQDVRIQSDNA 1015
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-D 895
T V+YL RQGGTKS++ SE+ KIF ++ I A IPG N AD LSR P +
Sbjct: 1016 TAVAYLNRQGGTKSVAAASEISKIFRWAETRVTQISAVHIPGVVNWEADFLSRHYVDPTE 1075
Query: 896 WHLSRSATEQIFLKWGVPCIDLFASR 921
W L+ + I KWG P +DL ASR
Sbjct: 1076 WELNTEVFDYITTKWGQPDLDLMASR 1101
>gi|301612402|ref|XP_002935710.1| PREDICTED: transposon Ty3-I Gag-Pol polyprotein-like, partial
[Xenopus (Silurana) tropicalis]
Length = 683
Score = 249 bits (636), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 179/531 (33%), Positives = 268/531 (50%), Gaps = 23/531 (4%)
Query: 408 PVNPPADSRIGAEL---VGGRLRRFVDAWIRLGAPAPLVR--IVSGYAIPFSAKPPLVPL 462
P +PP + E VGGRL F D W+RL A P V I SGY + F ++PP
Sbjct: 48 PTSPPRHDYLPQESSTPVGGRLHLFRDEWLRLTA-DPWVHDIISSGYRLEFVSRPPNRFF 106
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRL---DSTTGFLSRLFLVPKGNGGTRPVLNLK 519
S + +A IQ++L+ V+ + + GF S LF+VPK +G RPVL+LK
Sbjct: 107 MSRLPPDSNKQNAFLSTIQDLLDERVIVPVPSGEKYRGFYSNLFIVPKKDGSFRPVLDLK 166
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN F+ +F + + + S + ++++++D+ AY HVPI H +FL +
Sbjct: 167 HLNAFIRFSRFKMESLRSVISAMNPNEFLVALDIKDAYLHVPIFPPHWKFLRFALKNQHF 226
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
T LPFGL +AP+ F + + A+ LRSRG+ + YLDD LL Q L +
Sbjct: 227 QFTALPFGLTSAPRIFTKIMSAAAASLRSRGVSITPYLDDLLLKAPSLPAATSQLSLVMD 286
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L +LGW +N KS L+P+ + FLG+++D R+ LP +K + +++R LL + +
Sbjct: 287 FLTALGWKINTAKSRLTPSQRMPFLGMVFDTTEQRVLLPPEKITRIQSLVRQLLHNPQPS 346
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQ----RQASLLRLGAPHLTPINPAVLPKLEWW 755
+ A +LG L + +P + H R +Q Q + L P + P + WW
Sbjct: 347 VRLAMQVLGSLVSSIEAVPFAQFHLRALQWNILDQWNRSSLSQP--IKLLPKTRVAMTWW 404
Query: 756 LNALPLSSPIFPRQVQH----FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMF 811
LN+ L R +Q ++TDAS GWG+ + G W+ + IN E+
Sbjct: 405 LNSTHLEK---GRSLQEPKWLILTTDASLQGWGAVMGHLTAQGTWTAAETRLPINILEIR 461
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
AV AL L + VQSDN T V+YL QGGT+S L EV +I ++ + +
Sbjct: 462 AVRLALCHWQNRLTGCDIKVQSDNATTVAYLNHQGGTRSRQALKEVSRILTWAEAREVRL 521
Query: 872 LAQFIPGAYNSVADSLSRSKSLP-DWHLSRSATEQIFLKWGVPCIDLFASR 921
A +IPG N AD LSR + P +W L+ + I WG+P +DL ASR
Sbjct: 522 SAIYIPGLENWQADYLSRQRLDPGEWALNPGIFQDIVALWGLPEVDLMASR 572
>gi|270017202|gb|EFA13648.1| hypothetical protein TcasGA2_TC015886 [Tribolium castaneum]
Length = 872
Score = 240 bits (613), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 150/505 (29%), Positives = 256/505 (50%), Gaps = 8/505 (1%)
Query: 424 GRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
GRL+ F D W ++ + ++ GYAIPF KP + + S + + + ++
Sbjct: 255 GRLKFFTDQWRQITSDPTILSWTQGYAIPFHQKPYQERPPKERDWSFKEKSILQVQLNKL 314
Query: 484 LETGVLKRLDST-TGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFL 542
L+ G +++ + F+S +FLVPK NG +R +LNLK LN F+ F + +H + L
Sbjct: 315 LDCGAIRQCTAEPKQFVSNVFLVPKKNGASRLILNLKQLNHFVETTHFKIEDHKVVCKLL 374
Query: 543 QKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV 602
+ +M IDL AY +PI+ H+++L ++ G + TC+PFGL+TAP F L +
Sbjct: 375 SRNCFMAVIDLKDAYHLIPIQKCHRKYLRFTFLGRLYEYTCMPFGLSTAPYVFTKLMKPL 434
Query: 603 ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ 662
+ LRS + V+YLDDFLL++ + +L LG+++N +KS L+P ++
Sbjct: 435 VAYLRSHNLLSVLYLDDFLLMDNSYLQSLHNISMTCKMLEGLGFLINYEKSQLTPNQTVR 494
Query: 663 FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRL 722
+LG ++D + LP DKQ + + + + ++ ++G L A I G+L
Sbjct: 495 YLGFIYDSSNMTVRLPLDKQQCITKLAKKVKRQSQCSVREFAKMIGTLVAACPAISYGQL 554
Query: 723 HSRRIQRQASLLRLGAPH-----LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
+++ ++R A L L H + I V L WW + + P + I +D
Sbjct: 555 YTKSLER-AKYLALKNTHGNYSQIMFIPQYVREDLNWWEVHISSRKSLLPPKFVLEIFSD 613
Query: 778 ASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
AS GWG G W+ ++++ HIN E+ A L L V+++ DN T
Sbjct: 614 ASLSGWGIFCGGESTHGHWNEKERSKHINFLELLAASFGLKCFAKNLSGCCVLLRIDNTT 673
Query: 838 VVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS-LPDW 896
V+Y+ R GG K L + ++I+ ++ ++ I A +I ++N+ AD SR S ++
Sbjct: 674 AVAYINRMGGVKHPHLHALAKEIWQWCEERKLWIFASYIKSSHNTEADWESRRLSPETEF 733
Query: 897 HLSRSATEQIFLKWGVPCIDLFASR 921
L+ A +I + +P +DLFASR
Sbjct: 734 ELAPYAFRKICTFFQIPEVDLFASR 758
>gi|348525970|ref|XP_003450494.1| PREDICTED: hypothetical protein LOC100698500 [Oreochromis
niloticus]
Length = 1418
Score = 229 bits (583), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 164/482 (34%), Positives = 242/482 (50%), Gaps = 17/482 (3%)
Query: 470 TPVSSAMSLHIQEMLETGVLKRL-------DSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
T V + ++ ++E ++ + KR ++ G+ SR F+VPK GG RP+L+L+ LN
Sbjct: 207 TTVHTEAAMILREEVKALLQKRAIRVVPASETDKGWYSRYFVVPKRGGGLRPILDLRVLN 266
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
+L +F ++ ++ + + GD+ +IDL+ AYFHV I H++FL ++ G
Sbjct: 267 TYLRTYRFKMLTLRQLLNAVGPGDWFATIDLTDAYFHVAIHPKHRQFLRFAFEGVAYEYL 326
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILG 642
LPFGL+ AP+ F + + LR RG+R++ YLDD+ LV E Q L +S +
Sbjct: 327 VLPFGLSLAPRTFTKCAEAALAPLRERGIRILAYLDDWALVACSREQAETQLSLVLSHIQ 386
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDS 702
+LG+ VN QKSSL P+ + FLG+ R L E + L + +
Sbjct: 387 TLGFSVNFQKSSLIPSQQISFLGLEICSLSSRARLSEHRVAAFHRCLAQFQLGRRLRFQT 446
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLT---PINPAVLPKLEWWLN- 757
LLG ++ V+P+G L R QR L A HL P+ + + L W
Sbjct: 447 ILRLLGMMASMIAVVPLGLLKMRAFQRWTLSHHLCASRHLRRRLPVTASCMLALRPWREP 506
Query: 758 -ALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQA 816
L S I +STDAS GWG+ + + G+WS Q+ HIN E+ V A
Sbjct: 507 RLLHQGSRIGRVLFCKVVSTDASLRGWGALCKGASVRGIWSTAQRQLHINHLELLTVFLA 566
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
L P+L+ V+V++DN TVVSY+ RQGGT+SL LL + L +H L+ +
Sbjct: 567 LKHFHPVLEGQHVLVRTDNSTVVSYINRQGGTRSLPLLKLSRSLLLWCS---VHFLSTHV 623
Query: 877 PGAYNSVADSLSRSKSLP-DWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNHFQVSRH 935
P N D LSR L +W L QI+ +G P IDLFASRV+ P F + H
Sbjct: 624 PCHLNLGPDLLSRGGPLVREWRLHPLIVAQIWDLFGKPQIDLFASRVNTHCPLFFSIIDH 683
Query: 936 VA 937
A
Sbjct: 684 DA 685
>gi|384497823|gb|EIE88314.1| hypothetical protein RO3G_13025 [Rhizopus delemar RA 99-880]
Length = 1062
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 143/469 (30%), Positives = 235/469 (50%), Gaps = 27/469 (5%)
Query: 480 IQEMLETGVLK-----RLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
IQ++L ++ + +T GF S +F++PK +GG RPV NLK LNQ+L F +
Sbjct: 338 IQDLLSKQAIEPVSDVEVRTTPGFYSSMFVIPKKDGGIRPVFNLKRLNQYLDAPHFKMET 397
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + DY++SIDLS A+ H+ + +RFL L + V FGL+T+P
Sbjct: 398 IREVALMINPNDYLVSIDLSDAFLHIGLHPESRRFLRLKWKDQVYQYCTTAFGLSTSPFV 457
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F+ + + RS+G R+ YLDD++L ++ Q + V++L LGW++N +KS
Sbjct: 458 FSKVCRPILEHFRSQGYRISAYLDDWILAANTKQLAIQQAQTVVNLLQQLGWLINFKKSV 517
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR--------SL 706
L+P L+ LG + + LP K LR L S LD R S+
Sbjct: 518 LTPTQQLKHLGFVLNTKTMTASLPMKK-------LRDLRRSIKQILDHPRRQTPRVIHSV 570
Query: 707 LGYLSFASFVIPMGRLHSRRI---QRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL-- 761
+ +F I RL++RR+ + Q + H ++ +L+WW N L L
Sbjct: 571 TMRIQATTFAIFPARLYTRRLLYHKNQTVHMDKDWDHPVSLDQESQQELQWWYNNLKLWN 630
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
P + DAS+ GWG + G W+ E+ IN +E+ A + AL
Sbjct: 631 GRSFLPTTPSETVYVDASNTGWGCSWRNHRTHGYWTPEEAQQSINWRELKAAYLALQ-TF 689
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
P L+++ V++++DN T ++Y+ +QGGT+SL L++ +++ I + AQ+I G +N
Sbjct: 690 PTLRNTTVLIRTDNTTSMTYINKQGGTRSLPLMTLATQVWTWCLKNNIMLQAQYIQGIHN 749
Query: 882 SVADSLSRSKSLPD-WHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNH 929
VAD SR + + W + + +QI WG +DLFA R + ++P +
Sbjct: 750 KVADFESRRQYFKNLWMIKPAIFQQINRMWGPYSVDLFADRTTRLLPKY 798
>gi|170819724|gb|ACB38666.1| reverse transcriptase [Daphnia pulex]
Length = 757
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 159/519 (30%), Positives = 257/519 (49%), Gaps = 14/519 (2%)
Query: 420 ELVGGRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSL 478
E VG RL F D W + ++ +S G + F P + ++ +
Sbjct: 84 EKVGARLLFFADRWKDITDDLWILEGLSEGVKLDFVNCPVQRSVPGPVAMSREMKKVCDT 143
Query: 479 HIQEML-ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFR 537
++E+L + +++ D + GF+ LF++PK +GG RP++NLK LNQF+ + F + N
Sbjct: 144 EVKELLAKQAIVEVTDGSHGFICSLFVIPKKSGGFRPIVNLKPLNQFIQYEHFKMENLDS 203
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
L+KGD+M+ +DL AY +P+ +HQ+FL + G + CL FGLA AP+ F
Sbjct: 204 ARFLLRKGDWMVKLDLKDAYLTIPVHPSHQKFLRFKWKGRIFQFKCLAFGLAPAPRIFTK 263
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V LR +GMR+++YLDD L++N+ K V +L LG+++N +K+ P
Sbjct: 264 ILKAVMGFLRKQGMRLIIYLDDILILNRSREGAAKDFKQVVDLLLQLGFLINWEKTVADP 323
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
A L++LG+M D LP K L + ++ + LA + +L S++G ++A I
Sbjct: 324 AQKLEYLGLMLDSCRLSFALPSAKALAVKSMCESALAVDSISLREIASIMGNFTWAIPAI 383
Query: 718 PMGRLHSRRIQRQ--ASLLRLGAPHLTP--INPAVLPKLEWWLNALPLSSP--IFPRQVQ 771
P + H R +Q + R G T ++ A L+WW+++L + FP
Sbjct: 384 PFAQAHFRSMQSYYISRARRAGYDLKTKCVLSAAARLDLQWWISSLKVDRDKLFFPDVTD 443
Query: 772 HFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV 831
I TDAS GWG+ + G W+ HIN+ E+ A+ S + +
Sbjct: 444 LEIYTDASLSGWGACCNGVRTRGSWTAADTKKHINELELVGALFAVQAFAAKSSSISIRI 503
Query: 832 QSDNQTVVSYLRRQGGTKS--LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
DN T V+Y+ GGT+S L+L+S + ++D I I A + G N +AD SR
Sbjct: 504 YLDNVTAVAYVNHCGGTRSKELTLVSAELTSWCEARD--ISIEAVHVAGRLNVIADEESR 561
Query: 890 SK-SLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+ DW L E+I W +D+FAS +A +P
Sbjct: 562 AGPDSGDWKLDPMVFERIQQLWPSD-VDVFASPWNAHLP 599
>gi|17066696|gb|AAL35360.1|AF442732_3 reverse transcriptase/ribonuclease H/putative methyltransferase
[Tetraodon nigroviridis]
Length = 785
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 172/571 (30%), Positives = 278/571 (48%), Gaps = 40/571 (7%)
Query: 396 VQTLQKPQRCSSPVNPPADSRI----GAELVG-----------GRLRRFVDAWIRLGAPA 440
++ QK +RCS + P D++ G L+ G L VD W R A
Sbjct: 119 IKKWQK-KRCSGQLFPAEDAKTSPVSGRSLLREEMSSSEGDAMGPLAARVDQW-RACAVH 176
Query: 441 PLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQE-----MLETGVLKRL- 492
P V + +GY + S KPP S + + V+ S I E +L ++R+
Sbjct: 177 PWVLSTVANGYKLQLSVKPP-----SFNGVLSSVADGPSARILEEEIVTLLNKRAIRRVP 231
Query: 493 --DSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMI 549
+ GF S+ FL+PK G + RP+L+L+ LN+ L F ++ + + S ++ D+ +
Sbjct: 232 DEEVCQGFYSKYFLIPKKGGSSLRPILDLRVLNKHLRKYTFRMLTYKVLCSSIRPNDWFV 291
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
+IDL+ AYFH+ I H++FL +Y G +PFGL+ AP+ F+ LR+
Sbjct: 292 TIDLADAYFHIAIYPAHRKFLRFAYQGAAYEFQRIPFGLSLAPRVFSKCVEAALFPLRNS 351
Query: 610 GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD 669
G+R+ Y+DD+L+ + + + L +LG+ VN KS L P+ +LG+ +
Sbjct: 352 GIRIFSYIDDYLVCSHSREQVITDSVTVLRHLRNLGFTVNETKSRLEPSQYTDYLGLTLN 411
Query: 670 PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR 729
R+ L +++ T + L K LLG ++ V+ +G L R +QR
Sbjct: 412 SLSYRVRLSTEREQTFNHCLALFQLGKMVTFRLCLRLLGLMASVISVVRLGLLMMRDVQR 471
Query: 730 QASLLRLGA-PHLT---PINPAVLPKLEWWLN--ALPLSSPIFPRQVQHFISTDASDLGW 783
+ LRL + HL+ + + L W + AL P+ + ++TDAS GW
Sbjct: 472 WVASLRLCSRKHLSRRVRVTARCMAALRPWRDPAALTAGVPLGAVSSRVTLTTDASLWGW 531
Query: 784 GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
G+ + +G+WS++ +HIN EM AV AL LP L V+V++DN +VV+Y+
Sbjct: 532 GATLSGRTANGVWSQQMAQFHINVLEMQAVFLALRHFLPYLYGRHVLVKTDNSSVVAYIN 591
Query: 844 RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL-PDWHLSRSA 902
RQGGT+S L ++ L S + + A + G N AD LSR L +W L
Sbjct: 592 RQGGTRSQQLHELARRLVLWSSSRLLSLRATHVAGVLNRGADLLSRGNPLYGEWRLHPQV 651
Query: 903 TEQIFLKWGVPCIDLFASRVSAVVPNHFQVS 933
QI+ ++G +DLFAS+ +A P F ++
Sbjct: 652 VAQIWQRYGKAAVDLFASQENAHCPLFFSLA 682
>gi|170819710|gb|ACB38665.1| reverse transcriptase [Daphnia pulex]
Length = 1291
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 147/534 (27%), Positives = 263/534 (49%), Gaps = 15/534 (2%)
Query: 406 SSPVNPPAD--SRIGAELVGGRLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPL 462
S P P A + G+ V RL +F D W + + +++ + G +I F P
Sbjct: 603 SQPEGPRATVVDKDGSITVASRLTKFADRWALVTSDRWVLKTIREGLSIEFENLPVQKSW 662
Query: 463 CSLQHLATPVSSAMSLHIQEML-ETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKG 520
++ ++ ++++L + + + D + GF+ F + K G RP++NLK
Sbjct: 663 PPQIVMSKEMAEVCDKEVKDLLAKRAIAEVTDGSAGFVCSFFCIKKKQAGQFRPIVNLKP 722
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN+F+ + F + N + ++KGD++ +DL AYF V +K H+++L + V
Sbjct: 723 LNKFIRYQHFKMENLESVRFLVRKGDWLAKVDLKDAYFTVAVKKEHRKYLRFRWGKRVFE 782
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
C+ FGLA P+ F + V + LR +G+R+V+YLDD L++N+ L + +
Sbjct: 783 FNCMAFGLA--PRVFTKILKTVMAFLRRKGIRLVIYLDDILVLNESKEGLVADVNTVLEL 840
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L SLG+++N +KS ++P V+++LG++ D + LP K + + T L+ +L
Sbjct: 841 LQSLGFLINWEKSIIAPTQVIEYLGLIVDSNDPSFSLPCAKAAAVRKMCETALSEGKVSL 900
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQR----QASLLRLGAPHLTPINPAVLPKLEWWL 756
+ S+ G ++A IP + H R +QR A + ++P+ L WW+
Sbjct: 901 RTIASIQGNFAWAIPAIPFAQSHYRSLQRFYISNAQRVDFNLEAKVRLSPSAALDLGWWV 960
Query: 757 NALPLSSP--IFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVH 814
+ ++ FPR+ I +DAS GWG+ + G W+ + N HIN+ E+
Sbjct: 961 ANIEKANGKMFFPREPDLEIFSDASLTGWGAVCNGVTTRGPWTVQDMNKHINELELLGAF 1020
Query: 815 QALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQ 874
A+ + + + DN T VSY+ + GGTKS +L + + I ++ I + A
Sbjct: 1021 FAIQTFSAQTSNIAIRIFLDNSTAVSYVNKCGGTKSAALTNTAKAISAWCEEKSISVEAV 1080
Query: 875 FIPGAYNSVADSLSRSKS-LPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+ G N +AD SR+++ DW L + +I W + +DLFAS ++ +P
Sbjct: 1081 HLAGELNVIADRESRAEADTSDWRLDATIFSRISEIWEMD-VDLFASSWNSQLP 1133
>gi|345484330|ref|XP_003425006.1| PREDICTED: hypothetical protein LOC100679608 [Nasonia vitripennis]
Length = 1189
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 153/539 (28%), Positives = 257/539 (47%), Gaps = 20/539 (3%)
Query: 392 VSLKVQTLQKPQRCS-SPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYA 450
V+ V+T+Q+ ++ V+ PA GRL+ F+D W ++ + +SGY
Sbjct: 304 VAFTVETIQQSRKEKLDTVSVPA----------GRLKLFLDNWRKITDDKFIFSCISGYK 353
Query: 451 IPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRL-DSTTGFLSRLFLVPKGN 509
IP + S + + + +++ G +K D LS FLV K +
Sbjct: 354 IPVMEGVSKLKSYSYDQSGQKETETVQECVDALIKKGAIKECSDHKDQILSPYFLVKKPD 413
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R +LNLK N+ + F + + + ++ SID+ A++ +PI ++F
Sbjct: 414 GSHRFILNLKNFNKIVINHHFKIEDIKTVVQLTFPNYFLASIDIEDAFYLIPIHVESRKF 473
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
L + CLPFGL TAP F + VA +RS G V+YLDD L +
Sbjct: 474 LRFKVKEKIYEFVCLPFGLCTAPLIFTKIMKVVAKYVRSLGFTSVIYLDDILCIESSVNK 533
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
+ +S+L LG+ +N +KS+L+P+ QFLG + D + LP +K+ + +L
Sbjct: 534 CKKNINETISVLEWLGFRINYKKSNLTPSTSCQFLGFIIDTQKYAILLPNEKRKAIYKLL 593
Query: 690 RTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL----GAPHLTPIN 745
L K ++ L+G L A I G ++++ ++++ + + I+
Sbjct: 594 VEFLDLKRCSIRKYAQLIGKLISACPAIEYGWMYTKILEKEKIFQLIINDKCYDKMMNIS 653
Query: 746 PAVLPKLEWWL-NALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWH 804
V +L WW N L I I TDAS GWG+ S + G WS ++Q WH
Sbjct: 654 DRVKQELFWWKENILEKIYHIKDGSFAMTIFTDASTTGWGAWNYSKKIYGFWSPDEQKWH 713
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
IN E++ + AL +++S ++++ DN T +SY+ + GG + +I+ +
Sbjct: 714 INYLELYTIKLALESLASDVKNSQILLRVDNTTALSYVNKMGGVRFDKYNKLAREIWKWA 773
Query: 865 QDWRIHIL-AQFIPGAYNSVADSLSRSKSLP-DWHLSRSATEQIFLKWGVPCIDLFASR 921
Q +R +IL A +IP N +ADSLSR K++ +W L+ ++ +G P IDLFAS+
Sbjct: 774 Q-FRGNILIASYIPTKQNVIADSLSRIKNIDIEWELNDMYFRKVVDHFGQPDIDLFASK 831
>gi|326678616|ref|XP_689703.4| PREDICTED: enzymatic polyprotein-like [Danio rerio]
Length = 585
Score = 211 bits (538), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 154/494 (31%), Positives = 240/494 (48%), Gaps = 59/494 (11%)
Query: 442 LVRIVS-GYAIPFSAKPPL---VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRL---DS 494
L+R + GYAI F+ +PP V + L+ PV + I +L G ++ + +
Sbjct: 7 LIRTIRLGYAIQFAKRPPKFTGVYFSRVNPLSAPV---LREEIAALLAKGAIEPVPPAEM 63
Query: 495 TTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLS 554
+GF S F+VPK +GG+RP+L+L+ LN+ L F ++ RI ++ D+ +IDL
Sbjct: 64 ESGFYSPYFIVPKKSGGSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLK 123
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV 614
AYFHV I H++FL ++ G LPFGL+ +P+ F L+ + LR G+R++
Sbjct: 124 DAYFHVSILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIRIL 183
Query: 615 VYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
YLDD+L++ L + + L LG VN +KS L+P + FLG+
Sbjct: 184 SYLDDWLILAHSREQLIMHRDEVLRHLRLLGLQVNREKSKLAPVQRISFLGM-------- 235
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDS-ARSLLGYLSFASFVIPMGRLHSRRIQ----- 728
LDS LLG+++ A+ V P+G LH R +Q
Sbjct: 236 ------------------------ELDSITMRLLGHMASAAAVTPLGLLHMRPLQHWLHD 271
Query: 729 -RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQV 787
+ S+ L L+P N + L P+ +STDAS+ GWG+
Sbjct: 272 RHRVSVTALCRRALSPWNDP---------SFLQAGVPLGQASSHVVVSTDASNTGWGAVC 322
Query: 788 DSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
+GLW Q +WHIN+ E+ AV AL LP+L+ V+V++D+ +Y+ R GG
Sbjct: 323 RGHAAAGLWKGAQLHWHINRLELLAVFLALHRFLPVLERQHVLVRTDSTAAAAYINRMGG 382
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWHLSRSATEQI 906
+S + ++ L S + A +PG N AD+LSR P +W L + + I
Sbjct: 383 MRSRRMSQLARRLLLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLI 442
Query: 907 FLKWGVPCIDLFAS 920
+ ++G IDLFAS
Sbjct: 443 WARFGEAQIDLFAS 456
>gi|326671087|ref|XP_002660956.2| PREDICTED: enzymatic polyprotein-like [Danio rerio]
Length = 714
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 153/490 (31%), Positives = 236/490 (48%), Gaps = 51/490 (10%)
Query: 442 LVRIVS-GYAIPFSAKPPL---VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRL---DS 494
L+R + GYAI F+ +PP V + L+ PV + I +L G ++ + +
Sbjct: 15 LIRTIRLGYAIQFAKRPPKFTGVYSSRVNPLSAPV---LREEIAALLAKGAIEPVPPAEM 71
Query: 495 TTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLS 554
+GF S F+VPK +GG+RP+L+L+ LN+ L F ++ RI ++ D+ +IDL
Sbjct: 72 ESGFYSPYFIVPKKSGGSRPILDLRVLNRCLHRLPFRMLTQRRILQCVRPRDWFAAIDLK 131
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV 614
AYFHV I H++FL ++ G LPFGL+ +P F L+ + LR G+R++
Sbjct: 132 DAYFHVSILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPWVFTKLAEGALAPLRLAGIRIL 191
Query: 615 VYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
YLDD+L++ L + + L LG VN +KS L+P + FLG+
Sbjct: 192 NYLDDWLILAHSREQLIMHRDKVLRHLRLLGLQVNREKSKLAPVQRISFLGM-------- 243
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDS-ARSLLGYLSFASFVIPMGRLHSRRIQRQASL 733
LDS LLG+++ A+ V P+G LH R +Q
Sbjct: 244 ------------------------ELDSITMRLLGHMASAAAVTPLGLLHMRPLQHW--- 276
Query: 734 LRLGAPHLTPINPAVLPKLEWWLNA--LPLSSPIFPRQVQHFISTDASDLGWGSQVDSSF 791
L H + L W + L P+ +STDAS+ GWG+
Sbjct: 277 --LHDRHRVLVTALCRRALSPWNDPSFLQAGVPLGQASSHVVVSTDASNTGWGAVCRGHA 334
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL 851
+GLW Q +WHIN+ E+ AV AL LP+L+ V+V++D+ +Y+ R GG +S
Sbjct: 335 AAGLWKGAQLHWHINRLELLAVFLALHRFLPVLERQHVLVRTDSMAAAAYINRMGGMRSR 394
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWHLSRSATEQIFLKW 910
+ ++ L S + A +PG N AD+LSR P +W L + + I+ ++
Sbjct: 395 RMSQLARRLLLWSHPRLKSLRAIHVPGTINRAADALSRQLLRPGEWRLHPKSVQLIWARF 454
Query: 911 GVPCIDLFAS 920
G IDLFAS
Sbjct: 455 GEAQIDLFAS 464
>gi|301620562|ref|XP_002939645.1| PREDICTED: e3 ubiquitin-protein ligase HUWE1-like [Xenopus (Silurana)
tropicalis]
Length = 5647
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 138/404 (34%), Positives = 206/404 (50%), Gaps = 10/404 (2%)
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L+ +++ S+D+ AY HVPI HQ+FL +Y+ LPFGL++AP+ F +
Sbjct: 1149 LEPQEFLTSLDMKDAYLHVPIHPLHQKFLRFAYHDHHYQFVALPFGLSSAPRIFTKIMAT 1208
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI--LGSLGWIVNLQKSSLSPAP 659
+A+LLR RG+ + YLDD L+ + P I + L +SI L + GW +N KS L P+
Sbjct: 1209 MAALLRVRGVYITPYLDDLLI--KAPSIHQALEDLNLSIQTLQNFGWTINRPKSCLVPSQ 1266
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+QFLG++ D ++ LPE+K ++R L + +L +LG + ++ +P
Sbjct: 1267 RIQFLGLLLDTRRGKVLLPEEKIHKTRLLVRQLKSIPKPSLRFCMKVLGVMVASTEAVPF 1326
Query: 720 GRLHSRRIQRQA-SLLRL--GAPHLTPINPAVLPKLEWWLNALPLS-SPIFPRQVQHFIS 775
+ H R +QR S R ++ L L WWL L+ F I+
Sbjct: 1327 AQFHLRALQRNVISEWRRHHSLNQRISLSTQTLDSLNWWLTPPHLTQGKSFADPNWQIIT 1386
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
TDAS GWG+ + GLWS + IN E+ A+ +AL+ L + +QSDN
Sbjct: 1387 TDASLSGWGATFQNLSAQGLWSAAESRLPINILEIRAIFRALTHWETRLTGLAIRIQSDN 1446
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP- 894
T VSYL RQGGT+S++ E+ KI ++ I A IPG N AD LSR + P
Sbjct: 1447 ATAVSYLNRQGGTRSVAAAGEISKILRWAEHNVPQISAVHIPGLLNWEADYLSRYQIDPT 1506
Query: 895 DWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNHFQVSR-HVA 937
+W L + I +WG P +DL ASR + P +R H+A
Sbjct: 1507 EWELHPEVFDLIVTQWGEPDLDLMASRHNRKTPLFISKTRDHLA 1550
>gi|301632434|ref|XP_002945290.1| PREDICTED: hypothetical protein LOC100497369, partial [Xenopus
(Silurana) tropicalis]
Length = 1160
Score = 189 bits (479), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 134/416 (32%), Positives = 208/416 (50%), Gaps = 30/416 (7%)
Query: 410 NPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHL 468
P +R GGRLR+F +AW+RL + + R+VS GY + F A PP S
Sbjct: 482 KPRFQARTKYSWQGGRLRQFREAWLRLTSDPWVHRVVSFGYRLEFLATPPSRFFMSRLSQ 541
Query: 469 ATPVSSAMSLHIQEMLETGVLKRLDSTT---GFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
P SA IQ++L+ V+ ++ S G+ S LFLVPK +G RP+L+LK LN FL
Sbjct: 542 DPPKQSAFLAIIQDLLDERVIMQVPSEERFRGYYSNLFLVPKRDGSFRPILDLKKLNTFL 601
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F + + + + + +Y++++D+ AY H T LP
Sbjct: 602 RFSRFKMESLRSVIAAMGHNEYLVALDIKDAYLH---------------------FTALP 640
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL +AP+ F + VA+ LR++G+ + YLDD LL Q +L S L SLG
Sbjct: 641 FGLTSAPRIFTKIMAAVAASLRAQGVSITPYLDDLLLKAPSQSAATSQLELVTSTLTSLG 700
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
W +NL+KS L+P+ + FLG+++D R++LP +K + ++ R L+ S+ ++ A
Sbjct: 701 WKINLEKSRLTPSRRMPFLGMIFDTAQQRVFLPPEKISQIQDLTRRLIQSQGPSIRFAMQ 760
Query: 706 LLGYLSFASFVIPMGRLHSRRIQRQA--SLLRLGAPHLTPINPAVLPKLEWWLNALPLSS 763
+LG + + +P + H R +Q R I P L WWLN P +
Sbjct: 761 VLGSMVSSIEAVPFAQFHLRDLQWNILDQWTRTSLSQRIQILPKTKTSLAWWLNT-PHLA 819
Query: 764 PIFPRQVQHF--ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQAL 817
P Q H+ ++TDAS GWG+ +D G WS+ + IN E+ AV + +
Sbjct: 820 RGRPLQEPHWRLLTTDASLKGWGAVLDHLSAQGTWSKTEALLPINVLEIRAVIRTM 875
>gi|292630533|ref|XP_002667924.1| PREDICTED: hypothetical protein LOC100333442 [Danio rerio]
Length = 762
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 168/571 (29%), Positives = 267/571 (46%), Gaps = 82/571 (14%)
Query: 386 LEPPGRVSLKVQTLQKPQRCSSPVNPPADSRIGAELVGGRL---RRFVDAWIRLGAPAP- 441
++ P + ++ Q Q P SSP P ++ E RL ++F+ W RL +
Sbjct: 148 VDSPPLLRVQEQLNQGPPVVSSPQCPELATQGNIETSLERLVPLQKFL--WKRLPNVSQW 205
Query: 442 -LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKR-------LD 493
L+ I GY I F++ P L T V +L +++ +E+ + KR LD
Sbjct: 206 VLLTIEKGYRIQFASCPSRFNGV----LHTLVKPEQALVMEQEVESLLRKRAIEQIPPLD 261
Query: 494 STTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+GF S F+VPK G RP+L+L+ LN+ + KF ++ I S LQ D+ ++IDL
Sbjct: 262 IESGFYSSYFIVPKKGEGLRPILDLRQLNRSVQTLKFKMLTISTIMSQLQSEDWFVTIDL 321
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
AYFH+ I +H+RFL ++ G LPFGLA +P+ FA + + + LR +G+R+
Sbjct: 322 KDAYFHISIHPSHRRFLRFAFGGKAYQYRVLPFGLALSPRTFAKVVDAALAPLRLQGIRI 381
Query: 614 VVYLDDFLLVNQDPRILEIQGK-LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
+ YLDD+L++ + R+L +Q + + ++ + L +N +KS L PA FL I
Sbjct: 382 LNYLDDWLILARS-RLLLVQHRGVVLTHIEKLVLQLNQKKSVLVPAQTTTFLSIR----- 435
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR--- 729
K++ LG + + + LLG L+ AS +IP+G LH R +QR
Sbjct: 436 -----AAAKRIKLGQAI---------TVKQFQKLLGLLAAASNIIPLGLLHMRPLQRWLK 481
Query: 730 ------QASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW 783
+ +L R + + K W+L+ P R+ HF
Sbjct: 482 TRGFSLRGNLFRTIKASRCCLQALSIWKKLWFLS----QGPTLGRE-HHF---------- 526
Query: 784 GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
+ HIN EM AV QAL P ++ V+V++D +VVSY+
Sbjct: 527 ------------------HMHINCLEMLAVFQALRHFFPQVRGHHVLVKTDKTSVVSYIN 568
Query: 844 RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWHLSRSA 902
QGG S L ++I L +Q + + A +IPG N AD LSR + P W L
Sbjct: 569 HQGGLNSRPLCRLAKRILLWAQGRLLSLKAAYIPGPMNVGADLLSRQRLEPGGWRLHPKV 628
Query: 903 TEQIFLKWGVPCIDLFASRVSAVVPNHFQVS 933
I+ ++ I+LFA + + P F ++
Sbjct: 629 VAAIWQRFSKADINLFACQKTTHCPLWFSLT 659
>gi|301624101|ref|XP_002941349.1| PREDICTED: hypothetical protein LOC100486655 [Xenopus (Silurana)
tropicalis]
Length = 2901
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 128/390 (32%), Positives = 196/390 (50%), Gaps = 14/390 (3%)
Query: 408 PVNPPADSRIGAEL---VGGRLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPLC 463
P++PP + E V GRLR F D W+RL + I+ SGY + F +PP
Sbjct: 1966 PISPPQHDYVPPEDHTPVRGRLRLFRDEWLRLTTDTWVHDIITSGYRLEFVCRPPNRFFM 2025
Query: 464 SLQHLATPVSSAMSLHIQEMLETGVLKRL---DSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
S A IQ++LE V+ + D GF S LF+VPK +G RPVL+LK
Sbjct: 2026 SRLSPDPHKQDAFLSIIQDLLEEKVIVPVPPGDKFRGFYSNLFIVPKKDGSFRPVLDLKH 2085
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN F+ +F + + + + + ++++++D+ AY HVPI H +FL
Sbjct: 2086 LNAFIRASRFKMESLRSVIAAMNPNEFLVALDIKDAYLHVPIFPPHWKFLRFVVKNKHFQ 2145
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
T LPFGL +AP+ F + + A+ LRSRG+ + YLDD LL Q L +
Sbjct: 2146 FTALPFGLTSAPRIFTKIMSAAAASLRSRGVSITPYLDDLLLKAPSRPAATSQFSLVMDT 2205
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L +LGW +N+ KS L+PA + FLG+++D R++LP +K + +++R L+ + ++
Sbjct: 2206 LTTLGWKINITKSRLTPAQRMPFLGMLFDTARQRVYLPPEKIGRIQSLVRQLIHTPQPSI 2265
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQ----RQASLLRLGAPHLTPINPAVLPKLEWWL 756
A +LG L + +P + H R +Q Q + L P + P L WWL
Sbjct: 2266 QFAMQVLGSLVSSIEAVPFAQFHLRTLQWNILDQWNRSSLSQP--IKLLPRTRVALSWWL 2323
Query: 757 NALPLSSPIFPRQVQHFI-STDASDLGWGS 785
N L ++ Q I +TDAS GWG+
Sbjct: 2324 NPTHLEKGRSLQEPQWIILTTDASLQGWGA 2353
>gi|190702585|gb|ACE75468.1| reverse transcriptase and recombinase [Glyptapanteles indiensis]
Length = 1167
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 148/480 (30%), Positives = 227/480 (47%), Gaps = 45/480 (9%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTG-FLSRLFLVPKGNGGTRPVLNL 518
V CSL P A + ++LE G + G FLS FLVPK NG R VLNL
Sbjct: 330 VKNCSLNANDEPYLPAA---VNKLLELGAISVCAPCNGQFLSTYFLVPKSNGDYRFVLNL 386
Query: 519 KGLNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
K LN+++ L HF++ F + KG +M +IDL AYF +PI ++++L
Sbjct: 387 KDLNKYI------LTFHFKMEDFRTAIKLMSKGCFMGTIDLKDAYFLIPIANKYRKYLRF 440
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ + C+PFGL P F +S +A+ LRS G VVYLD +L +D I E
Sbjct: 441 MWKQLLFEWACVPFGLNIGPWLFTKISKPIANFLRSLGFLSVVYLDYWLCFGRD--IEEC 498
Query: 633 QGKLAVS--ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
L + L S+G++VN KS+ P QFLG + TL +
Sbjct: 499 LNNLNQTKQCLESIGFVVNKDKSTPLPNMRCQFLGQL--------------IYTLITKFK 544
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA----PHLTPINP 746
L KT ++ + ++ A + G ++S+ ++RQ L L + ++
Sbjct: 545 NL---KTCSIREFAQFVRNITAACPAVQYGWVYSKSLERQKYLALLKSSGNYDAKMKLSA 601
Query: 747 AVLPKLEWW-LNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHI 805
++ L WW N L ++ I + + ISTDAS GWG+ + +G W E+ N+ I
Sbjct: 602 CLITDLNWWQKNILVTANQIRQQHYKLEISTDASLTGWGAACNHELYNGAWYGEELNYSI 661
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
E+ A + L + ++++ DN T +SY+ R GG + SL + +I+ +
Sbjct: 662 IHLELIAAYFGLQCFAEDKRDCEILLRIDNTTAISYINRMGGIQYPSLNAIAREIWQWCE 721
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSKSLPD--WHLSRSATEQIFLKWGVPCIDLFASRVS 923
+ I A +I N AD SR + PD W L+ A ++I +G P IDLFASR +
Sbjct: 722 QHNLWITASYIASKENIKADYGSRIVN-PDTEWELADWAFQRIVKNFGTPEIDLFASRTN 780
>gi|301622063|ref|XP_002940362.1| PREDICTED: LOW QUALITY PROTEIN: midasin-like [Xenopus (Silurana)
tropicalis]
Length = 6288
Score = 183 bits (465), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 126/388 (32%), Positives = 192/388 (49%), Gaps = 13/388 (3%)
Query: 497 GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
GF S LF+VPK +G RPVL+LK LN F+ +F + + + + + ++M ++D+ A
Sbjct: 2736 GFYSNLFIVPKKDGSFRPVLDLKHLNTFIRFARFKMESLRSVIAAMNPQEFMTAVDIKDA 2795
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY 616
Y H+PI HQ+FL ++ G LPFGL TAP+ F + V + LR + + V Y
Sbjct: 2796 YLHIPIFPPHQKFLRFAFKGHHYQFQALPFGLTTAPRIFTKVMAAVTATLRKQALSVTPY 2855
Query: 617 LDDFLLVNQDPRILEIQ--GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
LD ++ + P Q + L L W +N KS+L+P+ + FLG+ +D R
Sbjct: 2856 LD---ILIKAPSYAAAQRSRDTVLQTLTELSWTINYSKSTLTPSQRITFLGLTFDTRSQR 2912
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA--S 732
++LP DK + +++R LL + ++ A LG + + +P + H R +Q
Sbjct: 2913 VFLPPDKISKIQSLVRNLLTTPLSSVRFAMRTLGSMVASMEAVPFSQFHLRELQWNILDQ 2972
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALPLS---SPIFPRQVQHFISTDASDLGWGSQVDS 789
R + L WWL+ LS S P + I+TDAS GWG+ +
Sbjct: 2973 WTRKSLTQTMVLRHRTRASLRWWLHKTHLSVGKSLGDPHWL--IITTDASLQGWGAVFQA 3030
Query: 790 SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
GLWS + IN E+ AVH+AL L + +QSDN T V+YL QGGT+
Sbjct: 3031 QTAQGLWSHREAQLPINILELRAVHRALLHWQNQLSGLPIRIQSDNATTVAYLNHQGGTR 3090
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIP 877
S S E +L Q R H +++ P
Sbjct: 3091 SRSTQGS-ESHSVLGQSPRRHTISRIHP 3117
>gi|345495977|ref|XP_001604972.2| PREDICTED: hypothetical protein LOC100121360 [Nasonia vitripennis]
Length = 1198
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 165/637 (25%), Positives = 283/637 (44%), Gaps = 90/637 (14%)
Query: 394 LKVQTLQKPQRCSS-------PVNPPADSRIGAELV----GGRLRRFVDAWIRLGAPAPL 442
++ Q L PQ SS P P ++ E V GRL+ F++ W + +
Sbjct: 292 IQFQQLFVPQDVSSKEILHTIPTLPAEETINSVENVKVKTAGRLKFFINKWREITDDKFV 351
Query: 443 VRIVSGYAIPFSAKP-PLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTG-FLS 500
+ ++GY I K L P +L + + + I+++L + ++ G +LS
Sbjct: 352 LEAITGYKIDLEFKVFQLEPNYNLDT-DKKIQRELDIAIKKLLTSNTIETCKDVEGQYLS 410
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHV 560
FLVPK +G R +LNLK F+ + F + + + L KGD+M +DL +AYF +
Sbjct: 411 SFFLVPKPDGSYRFILNLKKFKFFVKKEHFKIEDIRTAINLLNKGDFMCRLDLKEAYFLI 470
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
PI +++FL + + LPFGL++AP F + + + LR G+RVV+YLDDF
Sbjct: 471 PIHDEYKKFLRFKHKNQLYQFNVLPFGLSSAPFVFTKIGKPIVNWLRKNGVRVVIYLDDF 530
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L++ + N+ + ++ + LP++
Sbjct: 531 LILGRSEEECSF----------------NINSADMT------------------LELPQE 556
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRR---IQRQASLLRLG 737
K++ + ++ LL + + +G L A + G L+ + I+R A LR
Sbjct: 557 KRVKIREMIDILLKMERVKVKVIAKCIGVLVAACPAVAYGWLYYKHLELIKRNA--LRSN 614
Query: 738 APHL---TPINPAVLPKLEWWLNALPLS-SPIFPRQVQHFISTDASDLGWGSQVDSSFLS 793
+ ++ +L+WW + + ++ + I I +DAS GWG+ + +
Sbjct: 615 FKRMDKWITLSLEAKEELKWWQSQILIAKNKIRSSNFDLEIFSDASTTGWGAICGNKKAN 674
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
G W+RE++ HIN E+ A AL + ++++ DN T ++Y+ + GG K L
Sbjct: 675 GFWNREEREMHINFLEIKAAFLALKCFAAHSLNKQILLRIDNITALAYINKMGGIKHKEL 734
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL-PDWHLSRSATEQIFLKWGV 912
+ + I+ + I I A++I N +AD SR ++ +W L+ A ++I ++G
Sbjct: 735 HALTKVIWEWCIEREIWIFAEYIASKEN-IADEGSRITNVDTEWELANFAFQKIVKEFGY 793
Query: 913 PCIDLFASRVSAVVPNHFQVSRHVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVV--- 969
P IDLFASRV NH + R+ + DPD Q ++ F V
Sbjct: 794 PSIDLFASRV-----NH-KCKRYCS----------------WDRDPDA-QVINAFTVSWK 830
Query: 970 --FWPVFGSKTDSSSHLQSGWKIKENSSDPLFCIPTW 1004
FW F S L+ KI+E S + IP W
Sbjct: 831 EEFWYAFPPFVLISRVLK---KIREEYSTGILVIPLW 864
>gi|903714|gb|AAA70202.1| unknown protein [Dictyostelium discoideum]
Length = 608
Score = 176 bits (445), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 142/495 (28%), Positives = 241/495 (48%), Gaps = 45/495 (9%)
Query: 471 PVSSAMSLHIQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFL 525
P S ++ +Q++L ++++ S F S +F VPK G RPVL+LK LN ++
Sbjct: 13 PKSDCITKEVQDLLLDDAIEQVLPNRYSKRVFYSNVFTVPKPGTNLHRPVLDLKRLNTYI 72
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+ + F + +PS +++G YM+ +D+ +AY HV + ++ + G +P
Sbjct: 73 NNQSFKMEGIKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMP 132
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL+TAP+ F L V +LR + V+ YLDD L+V K + +L LG
Sbjct: 133 FGLSTAPRIFTMLLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLG 192
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD--SA 703
+ +NL+KS L P + FLG+ D ++ +P++K+ ++ +R L LD S
Sbjct: 193 FKLNLEKSVLEPTQSITFLGLQIDSVSMKLLVPKEKKKSVIKEIRNFLK-----LDCCSP 247
Query: 704 RSLLG----YLSFASFVIPMGRLHSRRIQR-QASLLRLGAPHLT---PINPAVLPKLEWW 755
R L G ++ VIP RL++RR + + L + PI V ++ W
Sbjct: 248 RKLAGLKGKLIALKDAVIPF-RLYTRRTNKFHSQCLTIAKGDWDQSFPIPQEVKSEISHW 306
Query: 756 LNALPLSS----PIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHIN 806
L L + +FP + ++TDAS+ G G+ + S WS Q N N
Sbjct: 307 LTVLNQWNGKEISLFP-SYDYVLTTDASESGAGATLKKGNKVIKTWSFQWSTTQSNMSSN 365
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG-TKSLSLLSEVEKIFLLSQ 865
++EM A+ A L + + +Q+DN T +SY+ RQGG + LS+L E+++
Sbjct: 366 RREMLALLMAYQALCRKLNNCKLKIQTDNTTTLSYINRQGGQIQDLSVL--FEQLWKQCL 423
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSKSLP-----------DWHLSRSATEQIFLKWGVPC 914
+++++ + IPG +N AD LSR + +W L + +I L++G
Sbjct: 424 KKKVNLIGEHIPGFFNVKADHLSRLSEMNHKSSTRVIKSYNWQLKKEVFNRIQLQFGQIQ 483
Query: 915 IDLFASRVSAVVPNH 929
+DLFAS ++ N+
Sbjct: 484 MDLFASHLNHQTNNY 498
>gi|167739|gb|AAA33195.1| ORF3 [Dictyostelium discoideum]
Length = 608
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 144/495 (29%), Positives = 240/495 (48%), Gaps = 45/495 (9%)
Query: 471 PVSSAMSLHIQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFL 525
P S ++ +Q++L ++++ S F S +F VPK G RPVL+LK LN ++
Sbjct: 13 PKSDCITKEVQDLLLDDAIEQVLPNHYSKRVFYSNVFTVPKPGTNLHRPVLDLKRLNTYI 72
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+ + F + +PS +++G YM+ +D+ +AY HV + ++ + G +P
Sbjct: 73 NNQSFKMEGIKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMP 132
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL+TAP+ F L V +LR + V+ YLDD L+V K + +L LG
Sbjct: 133 FGLSTAPRIFTMLLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLG 192
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD--SA 703
+ +NL+KS L P + FLG+ D ++ +P++K+ ++ +R L LD S
Sbjct: 193 FKLNLEKSVLEPTQSITFLGLQIDSVSMKLLVPKEKKKSVIKEIRNFLK-----LDCCSP 247
Query: 704 RSLLG----YLSFASFVIPMGRLHSRRIQR-QASLLRLGAPHLT---PINPAVLPKLEWW 755
R L G ++ VIP RL++RR + L L PI V ++ W
Sbjct: 248 RKLAGLKGKLIALKDAVIPF-RLYTRRTNNFHSQCLTLANGDWDQSFPIPQDVKSEISHW 306
Query: 756 LNALPLSS----PIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHIN 806
L L + +FP + ++TDAS+ G G+ + S WS Q N N
Sbjct: 307 LIVLNQWNGKEISLFP-SYDYVLTTDASESGAGATLKKGNKVIKTWSFQWSTTQSNMSSN 365
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG-TKSLSLLSEVEKIFLLSQ 865
++EM A+ A L S + +Q+DN T +SY+ RQGG + LS+L E+++
Sbjct: 366 RREMLALLMAYQALCRKLNSCKLKIQTDNTTTLSYINRQGGQIQDLSVL--FEQLWKQCL 423
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSKSLP-----------DWHLSRSATEQIFLKWGVPC 914
+++++ + IPG +N AD LSR + +W L + +I L++G
Sbjct: 424 KKKVNLIGEHIPGFFNVKADHLSRLSEMNHKSSTRVIKSYNWQLKKEVFNRIQLQFGQIQ 483
Query: 915 IDLFASRVSAVVPNH 929
+DLFAS ++ N+
Sbjct: 484 MDLFASHLNHQTTNY 498
>gi|270004735|gb|EFA01183.1| hypothetical protein TcasGA2_TC010509 [Tribolium castaneum]
Length = 921
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 185/377 (49%), Gaps = 19/377 (5%)
Query: 321 REELGSLFSDES------VFKN--VSDHLLQYVCGKRAECLESRRRLVEPRDPHLASLLL 372
+++LG L SD + VF HLL + K+ + L + V PR+ + +
Sbjct: 215 KKQLGRLLSDSANLISDLVFTTSQTRRHLLFPLLSKQVKELTLK---VPPREFLFGANMA 271
Query: 373 RARRGKKSSS--PQNLEPPGRVSLKVQTLQKPQRCSS-----PVNPPADSRIGAELVGGR 425
R KS+ ++L+ PG S K R N ++ + GR
Sbjct: 272 DELRTAKSAERLAKDLKAPGSTSQNQHRFSKKSRTEVNRALLSENVQKVVQVSSTTQAGR 331
Query: 426 LRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLE 485
L+ F W + ++ + GY IPF +P Q + S ++ +Q ++
Sbjct: 332 LKDFASKWCEITNNQVILNWIKGYTIPFQHQPRQAKRLVNQKFSNTESKTITECLQALMT 391
Query: 486 TGVLKR-LDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
G +K+ + + F+S FLV K NG R +LNLK LN+FL P F + + + +++
Sbjct: 392 QGAVKKCIPPSNQFISPFFLVKKPNGSERFILNLKHLNRFLKPSHFKMEDSRTVTKLIEE 451
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+M +IDL AYF +PI+ + ++++ + + TC+PFGL+TAP AF L V S
Sbjct: 452 NIFMATIDLKDAYFLLPIRKSDKKYIRFKFREQLYEFTCMPFGLSTAPYAFTKLMKPVTS 511
Query: 605 LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL 664
LR + VVYLDDFL+ + + + K +++L +LG+I+N +KS+ P+ +FL
Sbjct: 512 FLRIHNIVCVVYLDDFLIFGKSEQSCQNNVKTVITLLQNLGFIINFEKSNCQPSQRCKFL 571
Query: 665 GIMWDPHLDRMWLPEDK 681
G ++D R+ LP +K
Sbjct: 572 GFVFDSVKMRISLPREK 588
>gi|357609714|gb|EHJ66600.1| putative transposon Ty3-I Gag-Pol polyprotein [Danaus plexippus]
Length = 421
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/292 (34%), Positives = 164/292 (56%), Gaps = 4/292 (1%)
Query: 607 RSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL 664
R RG + + F +++Q D IL Q ++A+ L LGW V +K +P + +L
Sbjct: 34 RQRGASLQDLEERFDILSQQLDRDILTTQVQVAIQFLTDLGWWVYTEKLIQTPTRSIDYL 93
Query: 665 GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHS 724
G +W+P + +LP DK + IL L + TWNL + LLG L+FA+F+ RLH
Sbjct: 94 GEVWNPSFNTKFLPSDKLQRIRQILHARLVAGTWNLKQPQRLLGDLNFATFITHSRRLHC 153
Query: 725 RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ-HFISTDASDLGW 783
R +Q Q++ LR + V +L WW+ + SPI P+++ + I TDASD+ W
Sbjct: 154 RLLQLQSNKLRKCPQSQIQFSEEVRTELIWWMENIGGQSPIHPKRMSTNHIITDASDIQW 213
Query: 784 GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
G+ V++ + G W Q N+H N KEM AV A+S+ L++S V++Q+DN+TVV+Y++
Sbjct: 214 GALVNNELIKGAWEHHQTNYHCNLKEMSAVLTAISVKAMELRNSTVILQNDNKTVVTYMK 273
Query: 844 RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+GGT+S LL ++ L + I + + +PG N+ A +S ++ L D
Sbjct: 274 NEGGTRSHQLLELTRQLLNLLDQFNIVLRSHHLPGLLNTEA-CISETRKLRD 324
>gi|384498610|gb|EIE89101.1| hypothetical protein RO3G_13812 [Rhizopus delemar RA 99-880]
Length = 370
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 192/429 (44%), Gaps = 66/429 (15%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+F++PK NGG RPV NLK LN++L F + ++ Y++SIDLS A+ H+
Sbjct: 1 MFVIPKKNGGIRPVFNLKKLNEYLKVPHFKMETIREGSQMIRPNAYLVSIDLSDAFLHIA 60
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
+ + + FL L + V PF L ++ F + + S+G R+ YLDD++
Sbjct: 61 LHSDSRWFLRLKWKNQVYQYCTTPFDLVSSLFVFTKVCRPILEHFCSQGFRISAYLDDWI 120
Query: 622 LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
L ++ + + V++L LGW+VN +KS L+P L+ LG + H LP K
Sbjct: 121 LAASTKQLAIQRVQAVVALLQQLGWMVNFKKSVLTPTQQLERLGFVLITHTMMESLPMKK 180
Query: 682 QLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL 741
L +I R +SK SF I RL++R + L H
Sbjct: 181 ---LRDIRR---SSK--------------QVGSFAISPARLYTRYL--------LYYKH- 211
Query: 742 TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQ 801
T SD GW + G W+ +
Sbjct: 212 ---------------------------------QTVKSDTGWRCSWQNHRAHGYWNPIEA 238
Query: 802 NWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
IN +E+ A H AL + +P + S +++++ N T +SY+ +QGGT+S LL ++
Sbjct: 239 AQSINWRELKAAHLALKTFRVP--KISTILIRTVNATSLSYINKQGGTRSPPLLELATEV 296
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD-WHLSRSATEQIFLKWGVPCIDLFA 919
+ I I AQ I G YN++AD SR + W + S E++ WG IDLFA
Sbjct: 297 WNWCLRHNIMIQAQHIYGIYNTIADIESRQTFFKNQWQIKPSVFEELNKIWGPFTIDLFA 356
Query: 920 SRVSAVVPN 928
R + ++P+
Sbjct: 357 DRTTKLLPS 365
>gi|384495518|gb|EIE86009.1| hypothetical protein RO3G_10719 [Rhizopus delemar RA 99-880]
Length = 640
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 139/282 (49%), Gaps = 6/282 (2%)
Query: 422 VGGRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPL-VPLCSLQH-LATPVSSAMSL 478
VGGRL+ F W RL + +V G+ IPF PPL QH L+ +
Sbjct: 268 VGGRLQLFSLHWDRLFHNNWVNTVVQHGFKIPFHTLPPLSTDFTPHQHQLSKDQQLQLDQ 327
Query: 479 HIQEMLETGVLKRLDS---TTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
IQ++L ++ + + GF+S +F +PK GG RPV NL+ LNQ++ F +
Sbjct: 328 AIQDLLTKQAIEPVPQHQLSPGFISPMFTIPKKTGGCRPVFNLRALNQYIDCPHFKMETI 387
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
++ +Q GDYM SIDLS A+ H+P+ H+R+L + G V PFGL+ P F
Sbjct: 388 QQVSLMVQPGDYMTSIDLSDAFLHLPVHPEHRRYLRFYWKGSVYQFKTTPFGLSIVPYWF 447
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
++ + R +G+R+ YLDD++ + + ++ + L SLGW VNL+KS
Sbjct: 448 TKVTKPILEWARQQGIRLSAYLDDWITLGKTKAEATKHTQMILQCLTSLGWPVNLKKSQT 507
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
P L+ LG D LP K L ++ L+ T
Sbjct: 508 QPTQTLEHLGFELDSQTMTARLPGKKLRDLRKSIQQLIKQPT 549
>gi|345485146|ref|XP_003425202.1| PREDICTED: hypothetical protein LOC100121748 [Nasonia vitripennis]
Length = 678
Score = 151 bits (381), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 157/299 (52%), Gaps = 6/299 (2%)
Query: 390 GRVSLKVQTLQKPQRCSSPVNPPADSRIGAEL-----VGGRLRRFVDAWIRLGAPAPLVR 444
G + L+ + Q P+ + + ++ ++ EL + GRL+ F++ W ++ + ++
Sbjct: 203 GLIQLQPEIRQPPEEQRTSSSRESEIQLETELNDTVGITGRLKFFLNEWKQITMDSVILS 262
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLD-STTGFLSRLF 503
+ G+ IPF +KP + + + + + I+E+ + G +++ + F+S +F
Sbjct: 263 WIEGFKIPFVSKPTQYAIPRERDWSGKETVDIFDLIKELEQKGAIQKCSFRSNQFVSDMF 322
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
LVPK N +R +LNLK LN+F+ + L + + + +G +M SIDL AY+ +PI
Sbjct: 323 LVPKSNEKSRLILNLKKLNKFIENRHLKLEDGRTVITLKSRGCFMGSIDLKDAYYLIPIH 382
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV 623
H++FL +N + LPFGL+ AP F + V S LR G+ +V+YLDD L++
Sbjct: 383 EEHRKFLRFQFNDQLYEYLVLPFGLSVAPFVFTKIFKPVVSHLRRVGILLVIYLDDILIL 442
Query: 624 NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
Q + +++L LG ++N KS L P Q+LG +++ + LP DK+
Sbjct: 443 AQSYNECLESIRTVITVLEQLGIVINYTKSQLIPTRTCQYLGFLYNSQNMSVELPIDKR 501
>gi|327282171|ref|XP_003225817.1| PREDICTED: FYVE and coiled-coil domain-containing protein 1-like
[Anolis carolinensis]
Length = 2543
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 164/331 (49%), Gaps = 25/331 (7%)
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
P+ F VA+ LR RG+ V Y+DD+L+ + L I + + +L SLG IVN +
Sbjct: 1567 PRVFTKCMAVVAAALRLRGVTVYPYIDDWLITSDSRNQLLIDIDITLFLLQSLGLIVNKE 1626
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
KS L P +QF+G + D + +LPED+ TL + ++ ++ K + +LG+++
Sbjct: 1627 KSQLEPTQSIQFIGAIIDSVSQKAFLPEDRFQTLKDNIQKVILQKHITARQIQIILGHMA 1686
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-------------AVLPKLEWWLNA 758
+ V P RLH R +Q A L+ NP V L WW N+
Sbjct: 1687 STTSVTPHARLHMRILQ---------AWFLSTYNPLVHNHNIRLSFPLEVRQSLFWWTNS 1737
Query: 759 LPLSS--PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQA 816
+ + P P ++TD+S GWG+ + + G+W++E+ HIN E+ AV +A
Sbjct: 1738 NNVCAGLPFKPSDPTITLTTDSSTQGWGAHTGNLTVHGIWTKEEAKEHINYLELLAVEKA 1797
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
L P L VV V +DN T YL +QGGT S LL +++ + + + A +
Sbjct: 1798 LKAFEPALTGHVVQVITDNTTTKYYLNKQGGTHSPQLLQLSVRLWNWCIERNVDLRAIHL 1857
Query: 877 PGAYNSVADSLSRSK-SLPDWHLSRSATEQI 906
PG N +AD LSR+ + +W L +A ++
Sbjct: 1858 PGEQNILADQLSRTPFTDHEWSLHENAVSEL 1888
>gi|391332305|ref|XP_003740576.1| PREDICTED: enzymatic polyprotein-like [Metaseiulus occidentalis]
Length = 390
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 175/357 (49%), Gaps = 21/357 (5%)
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
Y IDL A+ +P+ + Q FLA + + + LPFGL T+P+ F L V + L
Sbjct: 16 YFARIDLQDAFLSIPVHESSQLFLAFHWREQMYCWSRLPFGLKTSPRVFTKLLKPVVARL 75
Query: 607 RSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
R G+ ++VYLDDFLL+ P L + ++L SLG+ +N KS+L PA + +LG
Sbjct: 76 RQEGISLIVYLDDFLLIADSPSRLAVNVLRTTTLLQSLGYTINFAKSALEPARQVTYLGY 135
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRR 726
+ D R+ +P K++ + +R LL L + +LG L+ + ++ R H
Sbjct: 136 VLDASCMRLSVPLGKRIQIKEDIRHLLLLPRITLRALYRILGKLNALTTIVRSLRYHCAS 195
Query: 727 I-------QRQASLL--RLGAPHLTPINPAVLPKLEWWLNALP--LSSPIFPRQVQHFIS 775
+ RQ+ L +L P T ++ L WW L S PI P V I+
Sbjct: 196 LAKLVFETTRQSHTLDVQLRLPTATRVD------LLWWEANLDNIASGPIRPPLVSLEIT 249
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
TD+S GWG+ D G W+ + + HIN E+ AV A+ + + + +++D+
Sbjct: 250 TDSSLEGWGAWTDGRASGGAWTYDDKRLHINALELKAVFLAVQSLAGQVSDTTIAIRTDS 309
Query: 836 QTVVSYLRRQGGTKS--LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + G +S L LS + + ++ +H+ A IPG +N +AD+LSR+
Sbjct: 310 TNAMHCINNFGSLRSPKLDRLSRELRAWAFERN--VHLKASHIPGVHNDIADALSRT 364
>gi|357609981|gb|EHJ66772.1| hypothetical protein KGM_00439 [Danaus plexippus]
Length = 264
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 120/210 (57%), Gaps = 1/210 (0%)
Query: 720 GRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ-HFISTDA 778
G H R +Q ++ LR + V +L WW+ + S I P+++ + + TDA
Sbjct: 55 GNWHCRLLQLHSNKLRKCPQSQIQFSEEVRTELIWWMENIDGESSIHPKRMSTNHVITDA 114
Query: 779 SDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTV 838
SD+ WG+ V++ + G W NWH N ++M AV A+S+ L++S V++ +DN+TV
Sbjct: 115 SDIQWGALVNNELMKGAWEHHHTNWHCNLEDMSAVLTAISVKAMELRNSTVILHNDNKTV 174
Query: 839 VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHL 898
V+Y++ +GGT+S LL ++ L + I + + +PG N+ AD LSR++ +WH+
Sbjct: 175 VTYIKNEGGTRSCQLLELTRQLLNLVDHFNIVLYPRHLPGLLNTEADHLSRNRVAVEWHI 234
Query: 899 SRSATEQIFLKWGVPCIDLFASRVSAVVPN 928
T ++F WG P +DLFAS+ + VV N
Sbjct: 235 RDKETLRLFSLWGTPDLDLFASQTAHVVAN 264
>gi|384486631|gb|EIE78811.1| hypothetical protein RO3G_03516 [Rhizopus delemar RA 99-880]
Length = 918
Score = 139 bits (350), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 178/393 (45%), Gaps = 43/393 (10%)
Query: 423 GGRLRRFVDAWIR-LGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQ 481
GGRL+ F W + + P PL I GY I +++ P + + A+S ++
Sbjct: 241 GGRLQAFTPYWKKTIHHPWPLSVIQEGYQIQWNSTPHPWKYHPTKRPSMEDRIAVSEAVK 300
Query: 482 EMLETGVLK-RLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPS 540
+ L G+++ + +LS F V K RP+L+ + LN+F+ F + + S
Sbjct: 301 KFLAAGIIEISPTQSKHYLSHFFTV-KEPTKRRPILDCRPLNKFVQCHHFKMEGIPALRS 359
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
L+K D + IDL AY VP+ +RFL + G V L FGL+ AP+ F+ L
Sbjct: 360 LLEKDDLICKIDLKDAYVVVPLHQQSRRFLTFLHQGTVYQYKSLAFGLSVAPRIFSKLMR 419
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
+ LR +G+++V YLDD LV + + + + ++ L +LG+++N +KSSL P +
Sbjct: 420 YAVEPLRRKGIKLVYYLDDICLVAKSMKEMNANMQETLAHLKNLGFLINYKKSSLQPQKI 479
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
+FLG ++ ++ LP+ K L I+ S+ D A+SL
Sbjct: 480 QEFLGFQFNTSTMQITLPQQK---LKKIV-----SRIRQRDLAKSL-------------- 517
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPLSSPIFPRQVQHFI 774
+H + + L R L+WW N LP+ F I
Sbjct: 518 HVHHQNWESPCQLTRKS-----------FEDLQWWENFSGQHNGLPIHKEDFKTPAID-I 565
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQQNWHINK 807
DASD G+G G W++E+Q+ IN+
Sbjct: 566 YVDASDSGYGVSSAELETHGFWTKEEQSTSINQ 598
>gi|322784671|gb|EFZ11526.1| hypothetical protein SINV_09160 [Solenopsis invicta]
Length = 328
Score = 130 bits (326), Expect = 5e-27, Method: Composition-based stats.
Identities = 86/296 (29%), Positives = 154/296 (52%), Gaps = 12/296 (4%)
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASK 696
A ++ LG+I+N QKSSL P+ +LG + + + L + K++ + ++ + K
Sbjct: 8 AKELMEHLGFIINYQKSSLIPSQECTYLGFKINSNTFCLELTDKKKIKIVELINQVYEGK 67
Query: 697 TWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL-------GAPHLTPINPAVL 749
+ + LLG L+ A + +++ +R++R+ L + G H+T A +
Sbjct: 68 RYRIRDIAKLLGTLTAACPAMAYSKVYVKRLEREKFLALILNNNDFEGKMHITK---AAI 124
Query: 750 PKLEWWLNALPLS-SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK 808
L+WW +PL +PI ++ I +D+S GWG+ ++ +SG WS++++ HIN
Sbjct: 125 EDLQWWKRVVPLGYNPIRVQKFNMEIYSDSSTTGWGAYCNNVRISGFWSKKERKCHINYL 184
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR 868
E+ A AL L S ++++ DN T ++Y+ + GG K L KI+ + +
Sbjct: 185 ELKAAFLALQSFASELVSCEILLRLDNTTAIAYVNKAGGIKFPHLSELARKIWQWCEKRK 244
Query: 869 IHILAQFIPGAYNSVADSLSRSKSL-PDWHLSRSATEQIFLKWGVPCIDLFASRVS 923
I I A +IP + N AD+ SR ++ +W LS ++I +G IDLFASR++
Sbjct: 245 IWITASYIPSSENIEADAASRITNIDTEWELSDEIFKKIERSFGPFDIDLFASRLN 300
>gi|357613897|gb|EHJ68774.1| putative reverse transcriptase/ribonuclease H/putative
methyltransferase-like protein [Danaus plexippus]
Length = 182
Score = 129 bits (324), Expect = 9e-27, Method: Composition-based stats.
Identities = 62/168 (36%), Positives = 104/168 (61%), Gaps = 1/168 (0%)
Query: 763 SPIFPRQVQ-HFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
SPI P+++ + + TDASD+ WG+ V++ F+ W Q NWH N KEM AV A+S+
Sbjct: 8 SPIHPKRMSTNHVITDASDIQWGALVNNEFIKDAWEHHQTNWHCNPKEMSAVLTAISVKA 67
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
L++S ++Q+DN+TVV++++ +GGT+S LL ++ L + I + +PG N
Sbjct: 68 MELRNSTEILQNDNKTVVTFIKNEGGTRSRQLLELTRQLLNLVDHFNIVLSPHHLPGLLN 127
Query: 882 SVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNH 929
+ AD LSR+++ +WH+ T ++F WG P +DLFA + + V+ +
Sbjct: 128 TEADRLSRNRAAVEWHIRDKETLRLFNLWGTPDLDLFAFQTAHVIAKY 175
>gi|301605299|ref|XP_002932298.1| PREDICTED: treslin-like [Xenopus (Silurana) tropicalis]
Length = 2063
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 137/262 (52%), Gaps = 25/262 (9%)
Query: 413 ADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPLCSLQHLATP 471
+D+R E VGGRL F+ W ++ I+ +GY I +PP VP +
Sbjct: 806 SDAR-SQEEVGGRLSLFLPMWENTTTDTWVLSIIRNGYFISIH-RPP-VPKFLINR---- 858
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTT-----------GFLSRLFLVPKGNGGTRPVLNLKG 520
+ +L QE LE+ VL L+ G SR+FLVPK +G R +++LK
Sbjct: 859 --PSRNLLKQEALESAVLDVLNKKVLEPVPLSEHQRGIYSRVFLVPKPDGRFRLIIDLKF 916
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS-YNGDV- 578
LNQ++ +KF + + LQ+GD M+++DL AY HVPI ++FL ++ Y G+
Sbjct: 917 LNQYIRKEKFRMETIKSAINILQEGDLMVTLDLKDAYLHVPIHPLSRKFLRIAVYLGNSL 976
Query: 579 --LAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL 636
L LPFG+ +A + F + +A+ R RG+ VV YLDD+L+ L L
Sbjct: 977 HHLQFRALPFGINSATRVFTKVIVVIAAAFRQRGVFVVPYLDDWLIKASSLTQLSRHQDL 1036
Query: 637 AVSILGSLGWIVNLQKSSLSPA 658
+S+L S GWI+N +KS L P+
Sbjct: 1037 VISMLSSHGWIINEEKSILQPS 1058
>gi|357631473|gb|EHJ78947.1| reverse transcriptase [Danaus plexippus]
Length = 242
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 99/156 (63%)
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
F+Q D+++ I++ QAYFH+ + TH+RFL + Y ++ +T L G+++ P+ F +++N
Sbjct: 61 FIQDHDWLVKIEIHQAYFHLLVAETHRRFLRVVYKEEIFQLTALLLGVSSVPRTFGTVTN 120
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
WVA +LR++G+ +VVYLDDFLL NQ+ L Q ++IL SLG +N++KS
Sbjct: 121 WVAEILRNQGICLVVYLDDFLLANQNRNKLIAQVAETLAILKSLGRYLNVKKSMTELTHK 180
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASK 696
L++LG++WD + LP K L++ N LL +
Sbjct: 181 LEYLGLVWDIQSQIIALPTRKVLSIKNSFSGLLTRE 216
>gi|301603955|ref|XP_002931647.1| PREDICTED: uncharacterized protein KIAA0467-like [Xenopus (Silurana)
tropicalis]
Length = 3874
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 169/374 (45%), Gaps = 37/374 (9%)
Query: 566 HQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
H RFL + G LPFGLAT P+ F + + +LLRS+G+ + YLDD L+ +
Sbjct: 654 HHRFLRFAVLGKHFQFVALPFGLATTPRVFTKVLAPIMALLRSQGISITPYLDDLLI--K 711
Query: 626 DPRILEIQGKL--AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL 683
P + L + L S GWI+NL+KSSL+P+ + FLG +++ LP DK
Sbjct: 712 APTFHQNLSALNQVIQTLQSHGWIINLKKSSLTPSQEMTFLGTVFNTQRCLTLLPPDKVQ 771
Query: 684 TLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTP 743
L ++L + L + LG + A +P + H+R +QR A L H
Sbjct: 772 ALLLRAQSLTTAPRVFLRTCMQFLGSMVSAIDTVPFAQFHTRPLQR-AILSHWDPDHPDL 830
Query: 744 INPAVLP-----KLEWWLNALPLSSPIFPRQVQHFIS----TDASDLGWGSQVDSSFLSG 794
+ VLP L WW LS Q + F S T + L G F SG
Sbjct: 831 DSQIVLPLSARKSLSWWTQPTRLS------QGKPFPSGPSWTQPTRLSQG----KPFPSG 880
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLL 854
S + +F V + N + + +Q+DN T V+Y+ QGGT+S +
Sbjct: 881 PSS--------PRMPVFGVGVRSARN----RGKPIRIQTDNATAVAYVNHQGGTRSKGAM 928
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK-SLPDWHLSRSATEQIFLKWGVP 913
E I +++ + I A IPG N AD LSR +W L + + WG P
Sbjct: 929 QEAAHILAWAEENVLAISAIHIPGVDNWTADFLSRETLDQGEWALHPQVFQNLTSIWGTP 988
Query: 914 CIDLFASRVSAVVP 927
+DL ASR ++ +P
Sbjct: 989 EVDLMASRHNSKLP 1002
>gi|384501405|gb|EIE91896.1| hypothetical protein RO3G_16607 [Rhizopus delemar RA 99-880]
Length = 601
Score = 126 bits (317), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 130/254 (51%), Gaps = 6/254 (2%)
Query: 480 IQEMLETGVLK-----RLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
IQ++L ++ + +T+GF S +F++PK + G R V NLK LNQ+L F +
Sbjct: 324 IQDLLSKQAIEPVSDAEVRTTSGFYSSMFVIPKKDSGIRSVFNLKRLNQYLDAPHFKMET 383
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + DY++SIDLS A+ HV + +RFL L + V FGL+T+P
Sbjct: 384 IREVALMINPNDYLVSIDLSDAFLHVGLHPELRRFLWLKWKDQVYQYCTAAFGLSTSPFV 443
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F+ + + RS+G R+ YL D++L ++ Q ++ V++L LGW++N +KS+
Sbjct: 444 FSKVYRPILEHFRSQGYRISAYLYDWILAANTKQLAIQQAQIVVNLLQQLGWLINFKKSA 503
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS-KTWNLDSARSLLGYLSFA 713
L+P L+ L + + LP + L ++ +L + L S+ +
Sbjct: 504 LTPTQQLEHLSFVLNTKTMTASLPLKELRDLRRSIKQILDHPRRQTLRVIHSVTMRIQAT 563
Query: 714 SFVIPMGRLHSRRI 727
+F I RL++RR+
Sbjct: 564 TFAIFPPRLYTRRL 577
>gi|66828825|ref|XP_647766.1| hypothetical protein DDB_G0267210 [Dictyostelium discoideum AX4]
gi|60475915|gb|EAL73842.1| hypothetical protein DDB_G0267210 [Dictyostelium discoideum AX4]
Length = 968
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 140/275 (50%), Gaps = 13/275 (4%)
Query: 471 PVSSAMSLHIQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFL 525
P S ++ +Q++L ++++ S F S +F VPK G RPVL+LK LN ++
Sbjct: 92 PKSDCITKEVQDLLLDDAIEQVLPNRYSKRVFYSNVFTVPKPGTTLHRPVLDLKRLNTYI 151
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+ + F + +PS +++G YM+ +D+ +AY HV + ++ + G +P
Sbjct: 152 NNQSFKMEGIKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKAMP 211
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL+TAP+ F L V +LR + V+ YLDD L+V K + +L LG
Sbjct: 212 FGLSTAPRIFTMLLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLG 271
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
+ +NL+K L P ++ FLG+ D ++ +P++K+ + +R L + S R
Sbjct: 272 FKLNLEKIVLEPTQLITFLGLQIDSVSMKLLVPKEKKKGVIKEIRNFLK---LDCCSPRK 328
Query: 706 LLG----YLSFASFVIPMGRLHSRRIQRQASLLRL 736
L G ++ VIP RL++RR + S+ +L
Sbjct: 329 LAGLKGKLIALKDAVIPF-RLYTRRTNKNQSVSKL 362
>gi|66828719|ref|XP_647713.1| hypothetical protein DDB_G0267304 [Dictyostelium discoideum AX4]
gi|60475858|gb|EAL73789.1| hypothetical protein DDB_G0267304 [Dictyostelium discoideum AX4]
Length = 833
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 170/361 (47%), Gaps = 20/361 (5%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+PS +++G YM+ +D+ +AY +V + ++ + G +PFGL+TAP+ F
Sbjct: 7 LPSMVKQGYYMVKLDIKKAYLYVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAPRIFTM 66
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L V +LR + V+ YLDD L+V K + +L LG+ +NL+KS L
Sbjct: 67 LLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLGFKLNLEKSVLEL 126
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
+ FLG+ ++ +P++K+ ++ +R L + S R L G +
Sbjct: 127 TQSITFLGLQIGSISMKLLVPKEKKKSVIKEIRNFLK---LDCCSPRKLAGLKGKLVALK 183
Query: 718 PMGRLHSRRIQR-QASLLRLGA---PHLTPINPAVLPKLEWWLNALP----LSSPIFPRQ 769
RL +RR + + L L PI V ++ WL L +FP
Sbjct: 184 DAFRLSTRRTNKFHSQCLTLANGDWDQSFPIPQDVKSEISHWLTVLNQWNGKEISLFP-S 242
Query: 770 VQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
+ ++TDAS+ G G+ + S WS Q N N++EM A+ A L
Sbjct: 243 YDYVLTTDASESGAGATLKKGNKVIKTWSFQWSTTQSNMSSNRREMLALLMAYQALCQKL 302
Query: 825 QSSVVMVQSDNQTVVSYLRRQGG-TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ + +Q+DN T +SY+ RQGG + LS+L E+++ +++++ + IPG +N
Sbjct: 303 NNCKLKIQTDNTTTLSYINRQGGQIQDLSVL--FEQLWKQCLKKKVNLIGEHIPGFFNVK 360
Query: 884 A 884
A
Sbjct: 361 A 361
>gi|159465941|ref|XP_001691170.1| hypothetical protein CHLREDRAFT_180868 [Chlamydomonas reinhardtii]
gi|158270313|gb|EDO96179.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1199
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 119/461 (25%), Positives = 203/461 (44%), Gaps = 54/461 (11%)
Query: 476 MSLHIQEMLETGVLKRLDST--------TGFLSRLFLVPKGNGGTRPVLN--LKGLNQFL 525
+ H+ ML G+++ DS ++ L +VPKG+ RP+++ G+N +
Sbjct: 427 LGAHLAAMLRDGIIEAYDSAEHGPESAFATVVNPLHVVPKGDS-IRPIIDPTASGVNACM 485
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS--YNGDVLAMTC 583
L + + L + Y+ DL+ + H + +RF+A G +
Sbjct: 486 RQLPCKLPDLAELLQHLPQYGYLGKRDLASGFHHCVLAPEARRFMAFRNPATGALQRYVA 545
Query: 584 LPFGLATAPQAFASLSNWVASLLRS----RGM--RVVVYLDDFLLVNQDPRILEIQGKLA 637
LPFG + +P F L+ ++ +S RG+ R+ Y DDF+++ Q ++ G A
Sbjct: 546 LPFGASQSPAIFCELTAAATTIFQSECDRRGLQVRIFTYCDDFMIIGQQH--ADVVGAFA 603
Query: 638 V-SILGS-LGWIVNLQKSSL--SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
V +LG+ LG+ L+K + L+FLG+M+D M + DK+ +R LL
Sbjct: 604 VMDVLGAELGFTWKLEKDQGRDTACQQLEFLGMMFDTVRLEMRITPDKRQRYAAAIRALL 663
Query: 694 ------ASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
A + +L+S + G L+F + G + + L+ AP + + PA
Sbjct: 664 DAAEQGAVQRQDLES---VAGKLTFVARACRWGYTFLQSVYDALFSLQQPAPRVLSLEPA 720
Query: 748 VLPKLEWWLNALPLSSPIFP-------------------RQVQHFISTDASDLGWGSQVD 788
L L+WW L S ++ Q I TDAS G+G+ +
Sbjct: 721 ALEDLQWWWEVLRADSSVWDGARQCTVAELELVRGEFADAQSGAVIFTDASGAGFGAAWE 780
Query: 789 SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT 848
+ L G++S +Q+ HI E+ AV +AL P L+ V+V+ DN V+ + G T
Sbjct: 781 EAELQGVFSAQQRQSHIAWLELTAVVRALQTWAPRLKGRRVLVRCDNTQAVAAV-NHGST 839
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ S ++ L+ + A+ I G N+ AD LSR
Sbjct: 840 RVKEGRSLCRQLAELAMQAGFEVRAEHIAGVANTRADRLSR 880
>gi|268571541|ref|XP_002641077.1| Hypothetical protein CBG17454 [Caenorhabditis briggsae]
Length = 1022
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/477 (25%), Positives = 214/477 (44%), Gaps = 36/477 (7%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ ++LE GV+ +++ + V R +L+L LN+ + +F L + +
Sbjct: 264 EVGKLLEAGVV--VETINPIVCSPLQVADNGKKLRLILDLTALNKGIETPRFRLEDWRAV 321
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA----MTCLPFGLATAPQA 594
SFL+K +Y + D Y H+ I LA S + LA T LPFGL++AP
Sbjct: 322 WSFLEKANYAATFDFKSGYHHIKIADPSSDLLAFSLSDPPLAPFYKFTALPFGLSSAPWL 381
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + R +G+++ +Y+DD L++ + +L L + G V +KS
Sbjct: 382 FTKVFRPLVGRWREKGVKIFLYIDDGLILAKTREEAVEAVRLVREGLAAAGVTVEEEKSF 441
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+P+ +LGI D + L E ++++LGN L + SK + + L G++S S
Sbjct: 442 WTPSEQFTWLGIKCDLVTKSIRLTESREVSLGNQLDKMKKSKAPTILDRQKLCGFISSMS 501
Query: 715 FVIPMGRLHSRRIQRQ----ASLLRL--GAP---HL-TPINPAVLPKLEWWLNALPLSSP 764
V +QRQ A++ R+ G P HL ++ + L ++E+W L
Sbjct: 502 IVA-----ECESVQRQRHLAATVARVTSGNPKKQHLQVSMSQSELFEVEFWERKLKKGDL 556
Query: 765 IFPRQVQHF-----ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQ--AL 817
+ + F + TDAS G G+ + +S +W + +KE A+ + A+
Sbjct: 557 VRNINEEQFDPTWYLYTDASAEGMGAVLKNSRKETVWQASEIGDVSFRKESSALRELRAV 616
Query: 818 SLNLPLLQSSV---VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF-LLSQDWRIHILA 873
L+ + + + SD+Q VS L++ G+ L L E+++ + Q L
Sbjct: 617 EFATRTLREQIRGAISIHSDSQAAVSVLKK--GSMKLELHKVAERVWDSVEQIGPARFL- 673
Query: 874 QFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNHF 930
+IP N+ AD+ SR DW + A E +WG DLFAS +A +F
Sbjct: 674 -WIPREENTEADAASREFDTDDWGVQNWAFEWAQKRWGHVKCDLFASERNAKHSVYF 729
>gi|340383595|ref|XP_003390302.1| PREDICTED: hypothetical protein LOC100640973 [Amphimedon
queenslandica]
Length = 878
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 200/435 (45%), Gaps = 26/435 (5%)
Query: 479 HIQEMLETGVLKRLDSTTGF-LSRLFLVPKGN--GGTRPVLNLKG-----LNQFLSPKKF 530
+I E + G L+++ S + L PK + G R +++L +N +S
Sbjct: 394 YITEEIAAGRLRQVPRRAPVHWSSIGLTPKSHQPGKFRLIIDLSAPSGASVNDGISSSLT 453
Query: 531 SLI-----NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
SLI N + +G +M +DL AY HVP+ Q LA+ + G T LP
Sbjct: 454 SLIYPRVENAVELIRAAGRGAFMAKLDLKAAYRHVPVHPDDQSLLAIRWGGATYLDTALP 513
Query: 586 FGLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLV-NQDPRILEIQGKLAVSILGS 643
FGL++AP+ F ++++ ++ + G+ + YLDDF Q P + ++AV++
Sbjct: 514 FGLSSAPKIFTAMADGLSWCMMCEGVSHFLHYLDDFFFCPPQHPGRCQQSLRVAVNLCEH 573
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
LG+ V +K + PA L FLGI D M LP+ K L L L ++
Sbjct: 574 LGFPVAPEK-VVGPATTLVFLGIELDSIKMEMRLPQGKLERLQGSLGWWLTRESVTKKQL 632
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--L 761
+S++G LS A+ V+ GR R + +AS + HL +N L+WW + +
Sbjct: 633 QSIVGQLSDAAVVVRPGRTFIRSLI-EASKIPRKQDHLVRLNQECRADLQWWNSFIQNWN 691
Query: 762 SSPIFP-RQVQHFISTDASDLGWGS----QVDSSFLSGLWSREQQNWHINKKEMFAVHQA 816
+FP R + +++DAS WG + S++ W ++ +I KE+F V A
Sbjct: 692 GVALFPGRPLLETVTSDASG-SWGCGALLEDGSAWFQFQWPAPWRDANIATKELFPVVLA 750
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
+L + V ++DNQ VVS L K L+ + +F + + I I
Sbjct: 751 AALWGSRWRGRRVRYRTDNQAVVSALANYSA-KDPPLVHLLRSLFFIEAYFDIEHSVVHI 809
Query: 877 PGAYNSVADSLSRSK 891
PG N AD+LSR K
Sbjct: 810 PGEDNGAADALSRDK 824
>gi|308500876|ref|XP_003112623.1| hypothetical protein CRE_30767 [Caenorhabditis remanei]
gi|308267191|gb|EFP11144.1| hypothetical protein CRE_30767 [Caenorhabditis remanei]
Length = 1077
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 133/522 (25%), Positives = 222/522 (42%), Gaps = 40/522 (7%)
Query: 425 RLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
RL + W L ++ ++ +GY IP PL ++ +A + + I ++
Sbjct: 439 RLSEHIQFWCELTGERWILEVIKNGYEIPLDKHFPLPAPEGMRKMAKNNMNFVITEINKL 498
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
ETGV+ +S +S L + G R +L+L LN+ LSP +F + + FL
Sbjct: 499 RETGVVSVAESPM-VVSPLHVARNGEK-LRLILDLSKLNKGLSPARFRQEDWKTVWPFLS 556
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYN----GDVLAMTCLPFGLATAPQAFASLS 599
+ Y + D Y HV I LA S + L LPFGL+TAP F +
Sbjct: 557 EACYAATFDFRSGYHHVKISEASSDLLAFSLSDPPSSPFLKFNALPFGLSTAPWLFTKIF 616
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ R+ G+ + +YLDD L++ + E + L + G V KS+ P+
Sbjct: 617 RPLVGRWRAAGINIFLYLDDGLILAKTREEAERAVIMVREDLKAAGVCVAEDKSNWEPSA 676
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+LGI D + L E ++ +L + + L S+ ++ + + GYLS S I
Sbjct: 677 QFTWLGIRGDLTERTVRLTEKRENSLRDQISLLKRSRAPSVLDRQKMCGYLS--SLTIVA 734
Query: 720 GRLHSRRIQRQASLL--------RLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV- 770
G R ++ AS++ R G+ ++ + +L++W L I +
Sbjct: 735 GHEAIGRQRQMASVIAEETVGLGRAGSIR-RQLSEGEISELDFWEEKLESGGMIRDMEEE 793
Query: 771 ---QHFISTDASDLGWGSQVDSS------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
Q F+ TDAS G G+ + + +S L QN +E+ AV A+ +
Sbjct: 794 FEPQWFLFTDASAEGLGAVLKNGSGQTVMKMSELGGTGFQNESSALRELRAVQMAVE-RM 852
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEV--EKIFLLSQDWRIHILAQF--IP 877
+ V++ +D+Q V LR+ ++L +++E E + + Q A+F IP
Sbjct: 853 ASWKRGAVLIHTDSQAAVIILRKGSMRRTLQIVAERVWESLRSIGQ-------AKFIWIP 905
Query: 878 GAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFA 919
N AD SR DW + A E +WG D FA
Sbjct: 906 REQNKEADEASRDFDYDDWAVQNWAFEWAQKRWGEVKCDWFA 947
>gi|301611041|ref|XP_002935061.1| PREDICTED: hypothetical protein LOC100489410 [Xenopus (Silurana)
tropicalis]
Length = 952
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 133/264 (50%), Gaps = 18/264 (6%)
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG 686
P Q KL S L SLGW +NL+KS L+P+ + FLG+++D L R++LP +K +
Sbjct: 559 PSTATSQLKLVTSTLTSLGWKINLEKSRLTPSRRMPFLGMIFDTALQRVFLPPEKIFRIQ 618
Query: 687 NILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA--SLLRLGAPHLTPI 744
++ L+ S + ++ A +LG + + +P + H R +Q R I
Sbjct: 619 DLTCRLIQSPSPSIRFAMHVLGSMVSSIEAVPFAQFHLRDLQWNILDQWTRTSLSQRFRI 678
Query: 745 NPAVLPKLEWWLNALPLSSPIFPRQVQH----FISTDASDLGWGSQVDSSFLSGLWSREQ 800
P LEWWLN+ L+ R +Q ++TDAS GWG+ +D G WS+ +
Sbjct: 679 LPKTQASLEWWLNSSQLAK---GRSLQEPHWRLLTTDASLSGWGAVLDHLSAQGTWSKTE 735
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
IN E+ AV AL L+ LQ+ + VQSDN T V+YL QGGT+S L EV I
Sbjct: 736 ALLPINILEIRAVRLAL-LHWQHLQA--IKVQSDNATTVAYLNHQGGTQSCQALREVSLI 792
Query: 861 FLLSQ------DWRIHILAQFIPG 878
++ D R+H + + G
Sbjct: 793 LTWAETQESRFDSRLHPRTRQLAG 816
>gi|66802990|ref|XP_635338.1| hypothetical protein DDB_G0291223 [Dictyostelium discoideum AX4]
gi|60463654|gb|EAL61838.1| hypothetical protein DDB_G0291223 [Dictyostelium discoideum AX4]
Length = 300
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 109/204 (53%), Gaps = 5/204 (2%)
Query: 471 PVSSAMSLHIQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFL 525
P S ++ +Q++L ++++ S F S +F+VPK G RPVL+LK LN ++
Sbjct: 78 PKSDCITKEVQDLLLDDAIEQVLPNRYSKRVFYSNVFMVPKPGTNLHRPVLDLKRLNSYI 137
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+ + F + +PS +++G YM+ +D+ +AY HV + ++ + G +P
Sbjct: 138 NNQSFKMEGIKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMP 197
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL+TAP+ F L V +LR + V+ YLDD L+V K A+ +L LG
Sbjct: 198 FGLSTAPRIFTMLLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKAMDLLVKLG 257
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWD 669
+ +NL+KS L P + FLG+ D
Sbjct: 258 FKLNLEKSVLEPTQSITFLGLQID 281
>gi|66799767|ref|XP_628809.1| hypothetical protein DDB_G0294180 [Dictyostelium discoideum AX4]
gi|60462163|gb|EAL60396.1| hypothetical protein DDB_G0294180 [Dictyostelium discoideum AX4]
Length = 558
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 76/241 (31%), Positives = 120/241 (49%), Gaps = 8/241 (3%)
Query: 422 VGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPF--SAKPPLVPLCSLQHLATPVSSAMSLH 479
VGGRL W LG P V+G I + KP L P+ + P S ++
Sbjct: 319 VGGRLFHHKQVWKELGLPNFCQEFVNGLKIHLLPNFKPMLNPI-PISIPEGPKSDCITKE 377
Query: 480 IQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFLSPKKFSLIN 534
+Q++L ++++ S F S +F VPK G RPVL+LK LN +++ + F +
Sbjct: 378 VQDLLLDDAIEQVLPNRYSKRVFYSNVFTVPKPGTNLHRPVLDLKRLNTYINNQSFKMEG 437
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TAP+
Sbjct: 438 IKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAPRI 497
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L V +LR + V+ YLDD L+V K + +L LG+ +NL+KS
Sbjct: 498 FTMLLRPVLRMLRDIKVSVIAYLDDLLIVGSTKEECLSNFKKTMELLVKLGFKLNLEKSV 557
Query: 655 L 655
L
Sbjct: 558 L 558
>gi|10058|emb|CAA43185.1| ORF2 [Panagrellus redivivus]
Length = 588
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/437 (25%), Positives = 206/437 (47%), Gaps = 21/437 (4%)
Query: 499 LSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYF 558
+S L + + R V++L +N +++ K L N S + K +M++ D+ Y
Sbjct: 34 ISALSVSVNADAKCRLVMDLTTVNPYITANKIKLENVAIAKSLIPKSGFMLTFDMKSGYH 93
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLD 618
+ + +LA + G M LPFGL++AP+ F L + LR G+ ++YLD
Sbjct: 94 QARMADSELIYLAFRWEGKTFWMKALPFGLSSAPEYFTKLFRHPLATLRGDGVNCLLYLD 153
Query: 619 DFLLVNQDPR-ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWL 677
D L+ ++ E K+ ++ G LG ++N +KSS++P +++LG++++ + +
Sbjct: 154 DLLVWSETYEGACEASAKVR-ALFGKLGVVLNNEKSSVTPQREVKWLGVVFNLTHGTLKI 212
Query: 678 PEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-SFASFVIPMGRLHSRRI-QRQASLL- 734
+++ LL K + G L S + PM + ++ + AS+
Sbjct: 213 SKNRIENALAAAARLLNRKRPSAKDRLKFTGALNSMHDVLGPMAAIRTKSLFCFIASVTP 272
Query: 735 RLGAPHLTPINPAVLPKLEWWLNALPLSSPIF----PRQVQHFISTDASDLGWGS-QVDS 789
RLG ++ +++W L + ++ R ++ +TDAS G G+ +++
Sbjct: 273 RLGVR--LALSEREKADIKYWQRNL-VERNVWRIQDTRPSEYVFATDASATGVGAVKLNP 329
Query: 790 SFLSGLWS--REQQNWHINK----KEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
L+ L S RE + N +E+ AV AL L +++VV V++DNQ + L
Sbjct: 330 KDLTELSSAYREFDEYGGNDLEHHRELLAVQFALHHYLASKKNTVVTVRTDNQNIPRILA 389
Query: 844 RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSAT 903
+ G + L+ L V ++ + ++ ++ +IP A NS AD SR DW +S+
Sbjct: 390 KGSGVQELNEL--VLQVTEWCEQRKVELMTTWIPRAMNSAADRASRETDPDDWAISKEIF 447
Query: 904 EQIFLKWGVPCIDLFAS 920
E++ K+ D FAS
Sbjct: 448 EKLTAKFQKCQCDRFAS 464
>gi|301614963|ref|XP_002936955.1| PREDICTED: hypothetical protein LOC100494179 [Xenopus (Silurana)
tropicalis]
Length = 571
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 156/359 (43%), Gaps = 15/359 (4%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M +D+ A+ +P+ L + G CLP G + + F + S ++
Sbjct: 187 RGALMAKVDVESAFRLLPVHPESLHLLGCHFKGGYYVDRCLPMGCSISCAYFEAFSTFIE 246
Query: 604 SLLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R R G+ V+ YLDDFL V +L V + L + + P+ L
Sbjct: 247 WVVRKRAGVSAVIHYLDDFLCVGPGHTMLCAVLLRTVQAVADLFGVPLAPDKTEGPSTCL 306
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP+DK L + A+K L +SLLG L+FA +IPMGR
Sbjct: 307 RFLGIEIDTVRQECRLPQDKIQQLKEEVAYAQAAKKVTLRQLQSLLGKLNFACRIIPMGR 366
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF----PRQVQ--HFI 774
+ SR + + +R H +N L W L + ++ PR + HF
Sbjct: 367 VFSRNLALATAGIRQ-PQHFIRLNKGHKEDLGVWKTFLQGFNGKLYWHNQPRANEEFHFF 425
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVM 830
+ A G+G+ + + W RE + E+F + A+ L P L + V+
Sbjct: 426 TDAAGSGGFGAYFQGKWCAAPWPREWTETKLTSNLTLLELFPIIVAMELWGPQLANQAVV 485
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+DN +VV + + S +L ++ + L + A+ +PG N +ADSLSR
Sbjct: 486 FYTDNMSVVMAINNL-TSGSRPVLCLLKHLVLRCLQLNVKFGAKHVPGYTNEIADSLSR 543
>gi|308500992|ref|XP_003112681.1| hypothetical protein CRE_30766 [Caenorhabditis remanei]
gi|308267249|gb|EFP11202.1| hypothetical protein CRE_30766 [Caenorhabditis remanei]
Length = 838
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 130/520 (25%), Positives = 220/520 (42%), Gaps = 36/520 (6%)
Query: 425 RLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
RL + W L ++ ++ +GY IP PL ++ +A + + I ++
Sbjct: 200 RLSEHIQFWCELTGERWILEVIKNGYEIPLDKHFPLPAPEGMRKMAKNNMNFVITEINKL 259
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
ETGV+ +S + V + R +L+L LN+ LSP +F + + FL
Sbjct: 260 RETGVVSVAESP--MVVSPLQVARNGEKLRLILDLSKLNKGLSPARFRQEDWKTVWPFLS 317
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYN----GDVLAMTCLPFGLATAPQAFASLS 599
+ Y + D Y H+ I LA S + L LPFGL+TAP F +
Sbjct: 318 EACYAATFDFRSGYHHIKISEASSDLLAFSLSDPPSSPFLKFNALPFGLSTAPWLFTKIF 377
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ R+ G+ + +YLDD L++ + E ++ L + G V KS+ P+
Sbjct: 378 RPLVGRWRAAGINIFLYLDDGLILAKTREEAERAVRMVREDLKAAGVCVAEDKSNWVPSA 437
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+LGI D + L ++ +L + + L S+ ++ + + GYLS S I
Sbjct: 438 QFTWLGIRGDLSERTVRLTGKRENSLRDQISLLKRSRAPSVLDRQKMCGYLS--SLTIVA 495
Query: 720 GRLHSRRIQRQASLL--------RLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV- 770
G R ++ AS++ R G+ ++ + +L++W L I +
Sbjct: 496 GHEAIGRQRQMASVVAEETDGLERAGSIR-RQLSEGEISELDFWKEKLESGGMIRGMEEE 554
Query: 771 ---QHFISTDASDLGWGSQVDSS------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
Q F+ TDAS G G+ + + +S L QN +E+ AV A+ +
Sbjct: 555 FEPQWFLFTDASAEGLGAVLKNGSGQTVMRMSELGGTGFQNESSALRELRAVLMAVE-RM 613
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF--LLSQDWRIHILAQFIPGA 879
+ V++ +D+Q V LR+ G+ +L S E+++ L S +I +IP
Sbjct: 614 ASWKRGAVLIHTDSQAAVIILRK--GSMKRALQSVAERVWESLRSIGQAKYI---WIPRE 668
Query: 880 YNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFA 919
N AD SR DW + A E +WG D FA
Sbjct: 669 QNKEADEASRDFDYDDWAVQNWAFEWAQKRWGEVKCDWFA 708
>gi|301614961|ref|XP_002936954.1| PREDICTED: hypothetical protein LOC100494000 [Xenopus (Silurana)
tropicalis]
Length = 965
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 156/359 (43%), Gaps = 15/359 (4%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M +D+ A+ +P+ L + G CLP G + + F + S ++
Sbjct: 581 RGALMAKVDVESAFRLLPVHPESLHLLGCHFKGGYYVDRCLPMGCSISCSYFEAFSTFIE 640
Query: 604 SLLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R R G+ V+ YLDDFL V +L V + L + + P+ L
Sbjct: 641 WVVRKRAGVSAVIHYLDDFLCVGPGHTMLCAVLLRTVQAVADLFGVPLAPDKTEGPSTCL 700
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP+DK L + A+K L +SLLG L+FA +IPMGR
Sbjct: 701 RFLGIEIDTVRQECRLPQDKIQQLKEEVAYAQAAKKVTLRQLQSLLGKLNFACRIIPMGR 760
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF----PRQVQ--HFI 774
+ SR + + +R H +N L W L + ++ PR + HF
Sbjct: 761 VFSRNLALATAGIRQ-PQHFIRLNKGHKEDLGVWKTFLQGFNGKLYWHNQPRANEEFHFF 819
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVM 830
+ A G+G+ + + W RE + E+F + A+ L P L + V+
Sbjct: 820 TDAAGSGGFGAYFQGKWCAAPWPREWTETKLTSNLTLLELFPIIVAMELWGPQLANQAVV 879
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+DN +VV + + S +L ++ + L + A+ +PG N +ADSLSR
Sbjct: 880 FYTDNMSVVMAINNL-TSGSRPVLCLLKHLVLRCLQLNVKFGAKHVPGYTNEIADSLSR 937
>gi|327286490|ref|XP_003227963.1| PREDICTED: cadherin-related family member 2-like [Anolis
carolinensis]
Length = 2278
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 109/262 (41%), Gaps = 61/262 (23%)
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
VPK +GG RPVL+L+GLN +++PKKF ++ I L G + SIDL AYFHV I
Sbjct: 1434 VPKKDGGQRPVLDLRGLNNYINPKKFRMVTLSTILPLLPDGAWFASIDLKDAYFHVAIAP 1493
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
H R+LA LPFG++TAP+ F + VA+ LR +G+ V Y+
Sbjct: 1494 QHHRYLAFMVAQKAFCFQVLPFGISTAPRVFTKVMAVVAAHLRLQGITVYPYI------- 1546
Query: 625 QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
+G D R +LP+D+ +
Sbjct: 1547 --------------------------------------VIGADIDSTTGRAYLPDDRFQS 1568
Query: 685 LGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ----------RQASLL 734
L L TLL +S LG+L+ + V P L R +Q Q+ +
Sbjct: 1569 LRTALLTLLQGPLPRARDVQSALGHLASTTVVTPYAGLRMRPLQMWFLRVFDPLTQSQNI 1628
Query: 735 RLGAPHLTPINPAVLPKLEWWL 756
RL P+ V L WWL
Sbjct: 1629 RL------PVPAYVSQSLHWWL 1644
>gi|301616980|ref|XP_002937928.1| PREDICTED: hypothetical protein LOC100497221 [Xenopus (Silurana)
tropicalis]
Length = 1037
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 169/400 (42%), Gaps = 20/400 (5%)
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
S R+ +G M +D+ A+ +P+ L + G CLP G +
Sbjct: 640 SFDEAIRLVKEAGRGALMAKVDVESAFRLLPVHPESIHLLGCHFKGGYYVDRCLPMGCSI 699
Query: 591 APQAFASLSNWVASLLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIV 648
+ F + S ++ ++R R G+ V+ YLDDFL V +L V + L +
Sbjct: 700 SCAYFEAFSTFIEWVVRKRAGVSAVIHYLDDFLCVGPGHTMLCAVLLQTVQEVADLFGVP 759
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
+ P+ L+FLGI D LP DK L + +K L +SLLG
Sbjct: 760 LAPDKTEGPSTCLRFLGIEIDTIRQECRLPLDKVQQLKEEVGYAQTAKKVTLRQLQSLLG 819
Query: 709 YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF- 766
L+FA +IPMGR+ SR + + +R H +N L W L + ++
Sbjct: 820 KLNFACRIIPMGRVFSRNLALATAGIRQ-PQHFIRLNKGHKEDLGVWQTFLQGFNGKLYW 878
Query: 767 ---PRQVQ--HFISTDASDLGWGSQVDSSFLSG----LWSREQQNWHINKKEMFAVHQAL 817
PR + HF + A G+G+ + + WS ++ ++ E+F + A+
Sbjct: 879 QSQPRANEEFHFFTDAAGSGGFGAYFQGKWCAAPWPSQWSEDKLTSNLTLLELFPIIVAM 938
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L P L + V+ +DN +VV + + S +L ++ + L + A+ +P
Sbjct: 939 ELWGPQLANQAVVFYTDNMSVVMAINNL-TSGSRPVLCLLKHLVLRCLQQNVRFGAKHLP 997
Query: 878 GAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDL 917
G N +ADSLSR + L+ A Q G PC DL
Sbjct: 998 GYTNEIADSLSRFQWDRFRRLAPEAAAQ-----GEPCPDL 1032
>gi|357630417|gb|EHJ78554.1| hypothetical protein KGM_22403 [Danaus plexippus]
Length = 137
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 83/131 (63%), Gaps = 7/131 (5%)
Query: 930 FQVSRHVAILLLLSSGRRVHDLTLLSLDPDHFQELDDFVVFWPVFGSKTDSSSHLQSGWK 989
+ VSRH A LLLL+S RR+HDLTLL +D +D + FW F SK D+++H SG +
Sbjct: 12 YDVSRHAATLLLLASCRRIHDLTLLRIDNQSLLNEEDNLTFWSAFASKNDNANHRHSGLR 71
Query: 990 IKENSSDPLFCIPT--WIRHLSTLSQRRMGSRPLTSLFITTRGIVQPASRSVIAGWVKTA 1047
+K + P+ + T WI+ + +LS + + L FIT RG+V+PASR++I GWVK+
Sbjct: 72 LK---THPIQNLNTNLWIKRVLSLSSDQ--RKDLNHSFITPRGVVKPASRTMIGGWVKSL 126
Query: 1048 LRGANIIASPG 1058
L+ I ASPG
Sbjct: 127 LKDTGIEASPG 137
>gi|301620539|ref|XP_002939631.1| PREDICTED: hypothetical protein LOC100485640 [Xenopus (Silurana)
tropicalis]
Length = 1037
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 169/400 (42%), Gaps = 20/400 (5%)
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
S R+ +G M +D+ A+ +P+ L + G CLP G +
Sbjct: 640 SFDEAIRLVKEAGRGALMAKVDVESAFRLLPVHPESIHLLGCHFKGGYYVDRCLPMGCSI 699
Query: 591 APQAFASLSNWVASLLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIV 648
+ F + S ++ ++R R G+ V+ YLDDFL V +L V + L +
Sbjct: 700 SCAYFEAFSTFIEWVVRKRAGVSAVIHYLDDFLCVGPGHTMLCAVLLQTVQEVADLFGVP 759
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
+ P+ L+FLGI D LP DK L + +K L +SLLG
Sbjct: 760 LAPDKTEGPSTCLRFLGIEIDTIRQECRLPLDKVQQLKEEVGYAQTAKKVTLRQLQSLLG 819
Query: 709 YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF- 766
L+FA +IPMGR+ SR + + +R H +N L W L + ++
Sbjct: 820 KLNFACRIIPMGRVFSRNLALATAGIRQ-PQHFIRLNKGHKEDLGVWQTFLQGFNGKLYW 878
Query: 767 ---PRQVQ--HFISTDASDLGWGSQVDSSFLSG----LWSREQQNWHINKKEMFAVHQAL 817
PR + HF + A G+G+ + + WS ++ ++ E+F + A+
Sbjct: 879 QSQPRANEEFHFFTDAAGSGGFGAYFQGKWCAAPWPSQWSEDKLTSNLTLLELFPIIVAM 938
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L P L + V+ +DN +VV + + S +L ++ + L + A+ +P
Sbjct: 939 ELWGPQLANQAVVFYTDNMSVVMAINNL-TSGSRPVLCLLKHLVLRCLQQNVRFGAKHLP 997
Query: 878 GAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDL 917
G N +ADSLSR + L+ A Q G PC DL
Sbjct: 998 GYTNEIADSLSRFQWDRFRRLAPEAAAQ-----GEPCPDL 1032
>gi|301619279|ref|XP_002939023.1| PREDICTED: hypothetical protein LOC100485694 [Xenopus (Silurana)
tropicalis]
Length = 1235
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 116/437 (26%), Positives = 178/437 (40%), Gaps = 66/437 (15%)
Query: 527 PKKFSLINHFRIP-------------------------SFLQK---GDYMISIDLSQAYF 558
P KF LI+H P + +Q+ G M D+ A+
Sbjct: 799 PGKFRLIHHLSFPKGESVNDDIDPELCSVSYTSFDRALTVVQEAGPGALMAKADIESAFR 858
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVV-Y 616
+PI L D+ CLP G + + F S ++ +++ R G R +V Y
Sbjct: 859 LLPIHPECHHLLGCEVEDDIYVDLCLPMGCSISCSYFEKFSTFLEWVVKKRTGSRNLVHY 918
Query: 617 LDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPH 671
LDDFL V Q +LE+ ++ L ++ P L FLGI D
Sbjct: 919 LDDFLCVGQAGTEWCSFLLEVLKEVTTEFGVPL-----APDKTVGPVTCLSFLGIEIDTV 973
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA 731
LPEDK LG + ++ K + +SLLG +FA VIPMGR+ RR+ +
Sbjct: 974 AGMTRLPEDKLKDLGTRVSEMVGRKKATVRMVQSLLGKFNFACRVIPMGRVFCRRL---S 1030
Query: 732 SLLRLG--APHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQHFISTDASD- 780
+LLR H + V LE W + L + ++Q F TDAS
Sbjct: 1031 ALLRGSKEGHHHVRLTTEVRGDLEVWAHFLREFNGKTIFQGKEVSNEEIQLF--TDASGS 1088
Query: 781 LGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
+G+G+ + S+ + W + + K E+F + A+ L L + V+ + DN
Sbjct: 1089 VGFGAYLRGSWCAAPWPKGWTEGGLVKNLCFLELFPIVVAVFLWGKQLANRRVVFRCDNL 1148
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDW 896
V L +Q T SL ++ + + L + A +PG N VAD+LSR++ W
Sbjct: 1149 GAVQALNKQSAT-SLEVVRLLRALVLQCLKINLCFRAVHVPGVANVVADALSRAQ----W 1203
Query: 897 HLSRSATEQIFLKWGVP 913
R + WG P
Sbjct: 1204 ERFRQVAPEAD-SWGAP 1219
>gi|301621282|ref|XP_002939985.1| PREDICTED: hypothetical protein LOC100487960 [Xenopus (Silurana)
tropicalis]
Length = 911
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 155/359 (43%), Gaps = 15/359 (4%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M +D+ A+ +P+ L Y G CLP G + + F + S ++
Sbjct: 527 RGALMAKVDVESAFRLLPVHPESLHLLGCHYKGGYYVDRCLPMGCSISCAYFEAFSTFIE 586
Query: 604 SLLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R R G+ V+ YLDDFL V +L V + L + + P+ L
Sbjct: 587 WVVRKRAGVSAVIHYLDDFLCVGPGHTMLCAVLLQTVQAVADLFGVPLAPDKTEGPSTCL 646
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L + +K L +SLLG L+FA +IPMGR
Sbjct: 647 RFLGIEIDTIKQECRLPLDKIQQLREEVGYAQTAKKVTLRQLQSLLGKLNFACRIIPMGR 706
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF----PRQVQ--HFI 774
+ SR + + +R H +N L W L + ++ PR + HF
Sbjct: 707 VFSRNLALATAGIRQ-PQHFIRLNKGHKEDLGVWQTFLQGFNGKLYWQSQPRANEEFHFF 765
Query: 775 STDASDLGWGSQVDSSFLSGLWSRE----QQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
+ A G+G+ + SG W E + ++ E+F + A+ L L + V+
Sbjct: 766 TDAAGSGGFGAYFQGKWCSGPWPSEWIEDKLTSNLTLLELFPIIVAMELWGTQLANRAVV 825
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+DN +VV + + S +L ++ + L + A+ +PG N +ADSLSR
Sbjct: 826 FYTDNMSVVMAINNL-TSGSRPVLCLLKHLVLRCLQLNVRFGAKHVPGYTNEIADSLSR 883
>gi|66799593|ref|XP_628722.1| hypothetical protein DDB_G0294346 [Dictyostelium discoideum AX4]
gi|60462053|gb|EAL60309.1| hypothetical protein DDB_G0294346 [Dictyostelium discoideum AX4]
Length = 541
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 106/211 (50%), Gaps = 12/211 (5%)
Query: 422 VGGRLRRFVDAWIRLGAPAPLVRIVSGYAI----PFSAKPPLVPLCSLQHLATPVSSAMS 477
VGGRL W LG P +V+G I F P +P+ + P S ++
Sbjct: 299 VGGRLFHHKQVWKELGLPNFCQEVVNGLKIHLLPNFKPMPNPIPISIPE---GPKSDCIT 355
Query: 478 LHIQEMLETGVLKRL----DSTTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFLSPKKFSL 532
+Q++L ++++ S F S +F VPK G RPVL+LK LN +++ + F +
Sbjct: 356 KEVQDLLLDDAIEQVLPNRYSKRVFYSNVFTVPKPGTNLHRPVLDLKRLNTYINNQSFKM 415
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TAP
Sbjct: 416 EGIKNLPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAP 475
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLV 623
+ F L V +LR + V+ YLDD L+V
Sbjct: 476 RIFTMLLRHVLRMLRDINVSVIAYLDDLLIV 506
>gi|301603917|ref|XP_002931619.1| PREDICTED: hypothetical protein LOC100497926 [Xenopus (Silurana)
tropicalis]
Length = 598
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 159/359 (44%), Gaps = 23/359 (6%)
Query: 548 MISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR 607
M D+ A+ +P+ L + G CLP G + + F + S ++ ++R
Sbjct: 214 MGKTDIEAAFRLLPVHPECVHLLGCYFQGGYYVDRCLPMGCSISCAYFEAFSTFLEWVVR 273
Query: 608 S-RGM-RVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL 664
+ G VV YLDDFL +D + + + G + +K+ P L+FL
Sbjct: 274 TVSGFPSVVHYLDDFLCAGPRDSDTCRVILETMQVMFSKFGVPLAHEKTE-GPTTCLKFL 332
Query: 665 GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHS 724
GI D LP DK L ++ +LA+K L +SLLG L+FA VIPMGR+ +
Sbjct: 333 GIELDSERQECRLPADKVSDLRVVIGRMLAAKKVTLKQMQSLLGKLNFACRVIPMGRIFA 392
Query: 725 RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD------- 777
RR+ + + +R H +N + L+ WL+ L + I Q + +TD
Sbjct: 393 RRLAQATAGVR-EEHHFIRLNRSHKEDLQVWLSFLQNFNGIAMWQEKGLTNTDLHLYTDA 451
Query: 778 ASDLGWGSQVDSSFLSGLWSREQQNWHINK-------KEMFAVHQALSLNLPLLQSSVVM 830
A G+G+ +S+ +G W E WH E+F V ++ L + +
Sbjct: 452 AGSKGFGAIFGTSWCTGEWPDE---WHAKGLTRNLVFLELFPVLVSVVLWGDGFRDKAIT 508
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
SDN VV + S ++ + ++ LL + I A+ +PG N +AD+LSR
Sbjct: 509 FHSDNMGVVQCVNSL-TADSAPVIGLLRQLVLLCLERNILFRARHVPGVQNEIADALSR 566
>gi|340383235|ref|XP_003390123.1| PREDICTED: hypothetical protein LOC100636300 [Amphimedon
queenslandica]
Length = 857
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 122/466 (26%), Positives = 212/466 (45%), Gaps = 47/466 (10%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETG-VLKRLDSTTGFLSRLF 503
I SG+ I F+ PL+ S + P + ++ + G + R + +S L
Sbjct: 395 IQSGFRIGFNRHSPLLSASSNMACSNP--EVIHKYLAREVSLGRMFTRPPQSDVHISPLG 452
Query: 504 LVPKGN--GGTRPVLNL---------KGLNQFLSPKKFSLINHFRIPSFL---QKGDYMI 549
++PK N G R +++L G++ S ++ ++ + S + +G +++
Sbjct: 453 IIPKKNKPGKWRLIVDLSSPHGQSVNDGIDTSASSLRYPTVDD--LASLIVRNGRGAFIV 510
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
D+ +AY ++PI L + +NG V LPFGL +AP+ F+++++ +L
Sbjct: 511 KADIKEAYRNIPIHPDDYGLLGVQWNGTVFVDKFLPFGLRSAPKIFSAVADAAQWVLTEN 570
Query: 610 GMRVVV-YLDDFLLVNQDPRILEIQGKLAV-SILGSLGWIVNLQKSSL-SPAPVLQFLGI 666
G+R V+ YLDDF LV ++ + ++ K+ + S+ GSLG + L+ S L P L FLGI
Sbjct: 571 GVRQVLHYLDDFALVERN-QATALESKITLCSVFGSLG--LPLEPSKLEGPTTCLTFLGI 627
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRR 726
D ++ LP DK L N L+ + K + +SL G L A V+ GR
Sbjct: 628 EVDTVSLQLRLPTDKLDRLLNELKEVQGRKVISKRELQSLTGLLQHACKVVRPGRAF--- 684
Query: 727 IQRQASLLRLGAP--HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG 784
+QR +L ++G+ H +N A + WW +F V H+
Sbjct: 685 LQRLYALEKVGSAPDHHIRLNVAARADVMWW--------QLF---VSHWNGVSMLCDPKH 733
Query: 785 SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRR 844
S+ D W + + I KE+ V A +L + +++ DNQ VV L
Sbjct: 734 SKADIQ-----WPLDLEPTSIQVKELIPVVIAAALFGRSWRGKLIVFSVDNQAVVHILNN 788
Query: 845 QGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+S L+ + + + + A+ IPG N++AD+LSR+
Sbjct: 789 THSKES-HLMHLIRLLAFYASYYDFWFRAEHIPGRCNTLADALSRN 833
>gi|301603744|ref|XP_002931543.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Xenopus (Silurana) tropicalis]
Length = 749
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 117/446 (26%), Positives = 190/446 (42%), Gaps = 56/446 (12%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
++AM +I E L+ G ++ S G + F V K +GG RP ++ +GLN+ ++ L
Sbjct: 50 TAAMKEYISENLQRGFIRPSTSPAG--AGFFFVEKKDGGLRPCIDYRGLNKITVKNRYPL 107
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
+ L+ +DL AY + I+ + A + +PFGL AP
Sbjct: 108 PLISELFDQLKGAKIFSKLDLRGAYNLIRIRGGDEWKTAFNTRDGHYEYLVMPFGLCNAP 167
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F N + L G VVVYLDD L+ +QD Q K A+S L L+K
Sbjct: 168 AVFQEFVNDIFRDLL--GKSVVVYLDDILIFSQDLETHRSQVKEALSRLRENSLFAKLEK 225
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLS 711
+ P + FLG + R + + +++ A + W L S +++ ++
Sbjct: 226 CTFE-VPKISFLGYIIS---SRGFEMDPAKVS---------AIQKWPLPQSTKAIQRFIG 272
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
FA++ + S RI SL+R G P+ P P L + +A +S+ +
Sbjct: 273 FANYYRQFIKGFSSRIAPILSLIRKGGRPNCWP--PVALEAFQSLKDAF-ISASVLRHPE 329
Query: 771 QH---FISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMFAVHQA 816
H FI DASD+G G+ + ++ S +S +QN+ I +E+ AV A
Sbjct: 330 PHLPFFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLAVKLA 389
Query: 817 LSLNLPLLQ--SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW-----RI 869
L LL+ S V + +D+ K+L L +++ W R
Sbjct: 390 LEEWRHLLEGASHPVTIYTDH-------------KNLEFLQSLKRQNPRQARWSLFFSRF 436
Query: 870 HILAQFIPGAYNSVADSLSRSKSLPD 895
+ + + PG N AD+LSRS S D
Sbjct: 437 NFVLTYRPGTKNRKADALSRSFSPED 462
>gi|291237364|ref|XP_002738605.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 556
Score = 103 bits (256), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 110/418 (26%), Positives = 182/418 (43%), Gaps = 31/418 (7%)
Query: 502 LFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIP---SFLQK---GDYMIS 550
L L PK +GG R +++L +N +S + SL+ + RI +F+ K G +
Sbjct: 100 LGLRPKKSGGFRIIMDLSQPTLDSVNDNISKEDHSLV-YSRIDDAVAFIHKHGHGSLLAK 158
Query: 551 IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLR 607
ID+ A+ P++ L + G LPFGL +AP F +++ W+ S R
Sbjct: 159 IDVKHAFRLCPVRKEDWHLLGFFWEGCYFFDRVLPFGLRSAPYLFNRIADAIHWIVSH-R 217
Query: 608 SRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
+R + YLDDFL V + + + + LG + +K P V+ FLG+
Sbjct: 218 ARNTDFLHYLDDFLTVGLANTNACQHNMDVMLQSCHHLGVPIATEKVE-GPCSVITFLGV 276
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRR 726
D + LP+DK L L + L T + SL+G LSFA IP GR+ RR
Sbjct: 277 ELDTVNMVIRLPKDKLADLLIKLPSCLTRHTCSKRELLSLIGCLSFACKCIPAGRIFLRR 336
Query: 727 IQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFPRQVQHFISTDAS 779
+ S+ + + ++WW + LP L +P + + + TDAS
Sbjct: 337 MI-DISMTATSLSQVITLTDEFWHDVQWWCDFLPSWNGTASLLNPNWIPSPEFELFTDAS 395
Query: 780 -DLGWGSQVDSSFLSGLWSREQQN---WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
LG+G+ + + W N + I KE+ + + + L + DN
Sbjct: 396 ATLGYGAFYKGHWFTNTWPTFITNDPLYSIACKELLPILLSSLIWGHLWFGLRIRFHCDN 455
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+VV + ++G + ++ V +F + H++ I G N +ADSLSR + L
Sbjct: 456 ISVVQ-IWKKGSSSCPRIMQLVRLLFFTAASNNFHVMISHISGFNNDIADSLSRQQIL 512
>gi|242825158|ref|XP_002488383.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218712201|gb|EED11627.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 748
Score = 102 bits (255), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 117/442 (26%), Positives = 187/442 (42%), Gaps = 50/442 (11%)
Query: 491 RLDSTTGFLSRLFLVPKGNGGTRPVLNLK-----GLNQFLSPKKFSLINHFRIPSFLQK- 544
+L + F+S L LVPK +GG R + +L G+NQ + P +S I + +I +
Sbjct: 282 KLAFKSSFISPLGLVPKHDGGWRRIHDLSWPPGCGVNQGI-PDNWSAIEYMKIDDIYDQI 340
Query: 545 -----GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFA 596
G +I D+ A+ VP+ +Q LA +N CL FGLATAP FA
Sbjct: 341 IKAGSGCTIIKRDIKDAFRIVPVAQDNQYLLAFQWNNSTYVECCLLFGLATAPFLFNLFA 400
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG---KLAVSILGSLGWIVNLQKS 653
+WV L + YLDDF+ V P + G K+ ++ LG N +K
Sbjct: 401 EALHWVLQCLLPT-FYINHYLDDFIAVTHSPSMSNPAGAFDKVYHTVTDYLGIPRNNRKD 459
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
++ LGI D LP++K L + + +L SL G L+F
Sbjct: 460 EQGTCVIV--LGIQIDSIAMEARLPQEKLCRATLDAAAALNATSLSLKQVESLTGLLAFC 517
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF-----PR 768
S V+ +GR + + G+ I V LEWW ++L L + + R
Sbjct: 518 SRVVRLGRTRLQSLYTFQIAFPRGSCTRRRIPYEVRDDLEWWRDSLSLFNGVLLVDPCRR 577
Query: 769 QVQHFISTDASDLGWG-----------------SQVDSSFLSGLWSREQQNWHINKKEMF 811
+ H + TDAS G G Q+ S + L + + HIN E+
Sbjct: 578 NITH-LYTDASTTGQGLFFFSSKSTLDCWRTHCHQLQSCNAASLALAQDAHAHINTTEVD 636
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQ--GGTKSLSLLSEVEKIFLLSQDWRI 869
A+ Q L +++ +D+ T + L + G ++ L S + +L+ I
Sbjct: 637 AILQGFLLFSHHWLHHTLVIHTDSSTAYTGLSKGFLRGPPNVPLKS----LLILAAARDI 692
Query: 870 HILAQFIPGAYNSVADSLSRSK 891
I+ ++P N++AD+LSR+
Sbjct: 693 QIVPHWLPSGENTLADALSRNN 714
>gi|291231400|ref|XP_002735652.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 572
Score = 102 bits (255), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 111/454 (24%), Positives = 204/454 (44%), Gaps = 48/454 (10%)
Query: 477 SLHIQEMLETGVLK--RLDSTTGFLSR----------LFLVPKGNGGTRPVLNL-----K 519
SL ++++ V+K +L T G S L + PK GG R +++L
Sbjct: 96 SLKHKDIVSKAVMKEVKLGHTIGPFSEPPFLNFVTNSLGIRPKKTGGHRLIMDLSQPTNN 155
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLSQAYFHVPIKTTHQRFLALS 573
+N F+ + ++L+ + RI + G + +D+ A+ P++ L
Sbjct: 156 SVNDFIPKENYTLV-YARIDDAIAMINKHGPGSLLAKVDIQHAFRLCPVRKQDWHLLGFK 214
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR--SRGMRVVVYLDDFLLVNQ-DPRIL 630
++G LPFGL +AP F ++ + ++R ++ ++ YLDDFL V + +
Sbjct: 215 WDGHYYFDRVLPFGLRSAPFLFNRIATALEWVVRHQAKTSDIIHYLDDFLAVGPPNHQSC 274
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
+ + ++ +LG V K P+ ++ FLG+ D + LP DK L NIL
Sbjct: 275 QRSKDIILNTCSTLGIPVAANKVE-GPSSIITFLGVELDTVDMVIRLPADK---LENILS 330
Query: 691 TL-LASKTWNLDSAR--SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
+ L + ++ SL+G LSFA +P GRL RR+ A+ + ++
Sbjct: 331 AIPLWANRYHCSKRELLSLIGTLSFACKCVPAGRLFLRRMIDLATTAS-SINQIIVLSND 389
Query: 748 VLPKLEWWLNALP-------LSSPIFPRQVQHFISTDASD-LGWGSQVDSSFLSGLWSRE 799
L+WW LP + + + + TDAS + G+ + + + WS +
Sbjct: 390 FRLDLQWWWEFLPNWNGSARILATSWCLTPNMNLYTDASSVIACGAFYNKQWFTLPWSPD 449
Query: 800 QQNWH----INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+ + + I KE+F + + + L +M DN+ +V+ + ++G ++ ++S
Sbjct: 450 KCSINPPLSIEWKELFPILISCLIWGHLWHGQKIMFHCDNEAIVN-IWKKGTSRCQRIMS 508
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
V IF + + H++ I G NS+ADSLSR
Sbjct: 509 LVRAIFFTAANGNFHVMIAHIRGTNNSIADSLSR 542
>gi|313244116|emb|CBY14967.1| unnamed protein product [Oikopleura dioica]
Length = 725
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 106/444 (23%), Positives = 189/444 (42%), Gaps = 31/444 (6%)
Query: 420 ELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLH 479
+L+GG + +++ +RL P Y I + + + + Q + ++
Sbjct: 92 KLMGGNTKTYINGRLRLTKFVP-----RPYEITSQGRTRTLAIPN-QESVRERADDVTQQ 145
Query: 480 IQEMLETGVLKRLDSTTG--FLSRLFLVPKGNGGTRPVLN---LKGLNQFLSPKKFSLIN 534
++E +++G ++ +T + LV + TR LN +K L ++ P K I+
Sbjct: 146 LKEWIKSGSVELWKATKKPWLTAGFILVDRPEKETRVCLNGSIMKPLEKYTFPCKLDSIS 205
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
L+KGD M+ D + + +P+ ++ + G L FGL AP
Sbjct: 206 L--AIQLLKKGDLMVKFDDKRGFHQMPLAEESKKMACFEWGGKKFVNNILCFGLPAAPGI 263
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGK-------LAVSILGS 643
+ S++ + LR G++ +YLDD L+V ++ R+ ++GK + + L +
Sbjct: 264 YQSMNLVGINFLRKNGIKATLYLDDRLVVITPKSEAHRLRLLEGKEVCKEAWVTAATLVA 323
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
LG VN++KS P ++FLG + D + + +P+ + L L L+ L S+ +L
Sbjct: 324 LGGFVNIEKSEFIPKQRMEFLGFILDSETETIEIPQSRWLALKTKLQQALNSERTSLKEL 383
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN----AL 759
+ G + + V P R+ R+I + + + AV + + WLN L
Sbjct: 384 ERIRGTQASMAEVFPNMRMLIRQITMLICQAEIQGAYEVRLTRAVKAEWKTWLNFENSGL 443
Query: 760 PLSSPIFPRQ-VQHFISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQA 816
RQ I TDAS+ V+ +S W E + HI KE AV A
Sbjct: 444 KRCWKQQDRQNAGIIIYTDASNHAGAIVVEEWNISEKFAWDEEYASDHICIKEAVAVKYA 503
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVS 840
L L++ V DN +VV
Sbjct: 504 LEWYAKRLENKKVTFLVDNSSVVE 527
>gi|301631707|ref|XP_002944937.1| PREDICTED: hypothetical protein LOC100495730 [Xenopus (Silurana)
tropicalis]
Length = 761
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 160/369 (43%), Gaps = 33/369 (8%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +PI Q L + ++ CLP G + + F S+++
Sbjct: 371 GALMAKADIESAFRLLPIHPECQHLLGCKLDDEIYVDLCLPMGCSISCSYFEKFSSFLEW 430
Query: 605 LLRSR--GMRVVVYLDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
++R R +V YLDDFL V Q +LE+ ++ L ++ P
Sbjct: 431 VVRKRTGSQDLVHYLDDFLCVGQAYTEWCSFLLEVLKEVTAEFGIPLA-----PDKTVGP 485
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L FLGI D LPEDK L + ++ K + + +SLLG L+FA VI
Sbjct: 486 VTCLSFLGIEIDTVAGMTRLPEDKLTDLSKSVGEMMGRKKVTVKTVQSLLGKLNFACRVI 545
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQ 769
PMGR+ RR+ + G H+ + V L+ W L + +
Sbjct: 546 PMGRVFCRRLSALLKGSQEGHHHVR-LTVEVRGDLQIWDQFLKEFNGKVIFQGKEVTNEE 604
Query: 770 VQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLL 824
+Q F TDAS +G+G+ + S+ + W E + K E+F + A+ L L
Sbjct: 605 IQLF--TDASGSVGFGAYMGGSWCAAYWPEEWLQVDVIKNLCFLELFPIVVAVFLWGTQL 662
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI--LAQFIPGAYNS 882
+ V+ + DN V L +Q T S L+ Q +I++ +A +PG N
Sbjct: 663 ANKRVVFRCDNLGAVQALNKQSAT---SPEVVRLLRVLVLQCLKINLCFIAVHVPGVANV 719
Query: 883 VADSLSRSK 891
VAD+LSR++
Sbjct: 720 VADALSRAQ 728
>gi|291238548|ref|XP_002739190.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 465
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 189/413 (45%), Gaps = 36/413 (8%)
Query: 506 PKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLS 554
PK GG R +++L +N F+ + ++L+ + R+ + G + +D+
Sbjct: 30 PKKTGGHRLIMDLSQPTNNSVNDFIPKENYTLV-YARVDDAIAMINKHGPGSLLAKVDIQ 88
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR--SRGMR 612
A+ P++ L ++G LPFGL +AP F ++ + ++R ++
Sbjct: 89 HAFRLCPVRKQDWHLLGYKWDGHYYFDRVLPFGLRSAPFLFNRIATALEWVVRHQAKTSD 148
Query: 613 VVVYLDDFLLVNQ-DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPH 671
++ YLDDFL V + + + + ++ +LG V K P+ ++ FLG+ D
Sbjct: 149 IIHYLDDFLAVGPPNHQSCQRSKDIILNTCSTLGIPVAANKVE-GPSSIITFLGVELDTV 207
Query: 672 LDRMWLPEDKQLTLGNILRTL-LASKTWNLDSAR--SLLGYLSFASFVIPMGRLHSRRIQ 728
+ LP DK L NIL T+ L + ++ SL+G LSFA +P GRL RR+
Sbjct: 208 DMVIRLPADK---LENILSTIPLWANRYHCSKRELLSLIGTLSFACKCVPAGRLFLRRMI 264
Query: 729 RQASLLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFPRQVQHFISTDASD- 780
A+ + ++ L+WW LP + + + + TDAS
Sbjct: 265 DLATTAS-SINQIIILSNDFRLDLQWWWEFLPNWNGSARILATSWCLTPNTNLYTDASSV 323
Query: 781 LGWGSQVDSSFLSGLWSREQQNWH----INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
+ G+ + + + WS ++ + + I KE+F + + + L +M DN+
Sbjct: 324 IACGAFYNKQWFTLPWSPDKCSINPPLSIEWKELFPILISCLIWGHLWHGQKIMFHCDNE 383
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+V+ + ++G ++ ++S V IF + + H++ I G NS+ADSLSR
Sbjct: 384 GIVN-IWKKGSSRCQRIMSLVRAIFFTAANGNFHVMIAHIRGTNNSIADSLSR 435
>gi|301609771|ref|XP_002934433.1| PREDICTED: hypothetical protein LOC100494049 [Xenopus (Silurana)
tropicalis]
Length = 572
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 161/366 (43%), Gaps = 31/366 (8%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S ++
Sbjct: 189 GALMAKTDIEAAFRLLPVHPESLHLLGCQFGGSFYVDRSLPMGCSISCSYFETFSTFLEW 248
Query: 605 LLRSR-GM-RVVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R + GM ++ YLDDFL + N P I + + G + +K+ P+
Sbjct: 249 VIRQQAGMVSIIHYLDDFLCIGPNNSP-ACAILLQTVQRVTSEFGVPLAPEKTE-GPSTC 306
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++FLGI D LP DK L ++ + SK L +SLLG L+FA +I MG
Sbjct: 307 IKFLGIEIDTVSQECRLPMDKISALKEDIQRAINSKKLTLKQLQSLLGKLTFACRIITMG 366
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--------SSPIFPRQVQH 772
R+ SRR+ S L+ H + + L W L + + +Q
Sbjct: 367 RVFSRRLAMATSGLK-KPHHFVRLRAELKKDLGIWAKFLQAYNGRSYWQKATDTNKDLQL 425
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPL 823
F TDA+ GS ++ SG W E+ ++W ++ E+F + A+ L L
Sbjct: 426 F--TDAA----GSCGFGAYFSGSWCAEKWPESWAAGGLIRNLTLLELFPILVAIELWGHL 479
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ V+ +DN +VV + Q + S +L+ + + L + IH AQ +PG N +
Sbjct: 480 FSNRNVIFNTDNMSVVLAINNQ-TSSSGPVLALLRHLVLRCLQFNIHFRAQHLPGVVNDI 538
Query: 884 ADSLSR 889
ADSLSR
Sbjct: 539 ADSLSR 544
>gi|313214780|emb|CBY41042.1| unnamed protein product [Oikopleura dioica]
Length = 725
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 161/371 (43%), Gaps = 35/371 (9%)
Query: 495 TTGFLSRLFLVPKGNGGTRPVLN---LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISI 551
T GF+ LV + TR LN +K L ++ P K I+ L+KGD MI
Sbjct: 167 TAGFI----LVDRPEKETRVCLNGSIMKPLEKYTFPCKLDSISL--AIQLLKKGDLMIKF 220
Query: 552 DLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGM 611
D + + +P+ ++ + G L FGL AP + S++ + LR G+
Sbjct: 221 DDKRGFHQMPLAEESKKMACFEWGGKKFVNNILCFGLPAAPGIYQSMNLVGINFLRKNGI 280
Query: 612 RVVVYLDDFLLV----NQDPRILEIQGK-------LAVSILGSLGWIVNLQKSSLSPAPV 660
+ +YLDD L++ ++ R I+GK L + L +LG VN++KS P
Sbjct: 281 KATLYLDDRLVIVTPKSKAHRQRLIEGKEVCKEAWLTAATLVALGGFVNIEKSEFIPKQR 340
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++FLG + D + + +P+++ L L L+ L S+ +L + G + + V P
Sbjct: 341 MEFLGFILDSETETIEIPQNRWLALKAKLQQALRSERTSLKDLERIRGTQASMAEVFPNM 400
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW--WLNALPLSSPIFPRQVQH------ 772
R+ R+I + + + AV K EW WLN +S + R Q
Sbjct: 401 RMLIRQITMLICQAEIQGAYEVRLTRAV--KTEWKTWLNF--ENSGLKRRWKQQNRQNAG 456
Query: 773 -FISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
I TDAS+ ++ +S W E + HI KE AV AL L++ V
Sbjct: 457 IIIYTDASNHAGAIVIEEWNISEKFAWDEEYASDHICIKEAVAVKYALEWYAKRLENKKV 516
Query: 830 MVQSDNQTVVS 840
DN +VV
Sbjct: 517 TFLVDNSSVVE 527
>gi|384493499|gb|EIE83990.1| hypothetical protein RO3G_08695 [Rhizopus delemar RA 99-880]
Length = 314
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 92/181 (50%), Gaps = 6/181 (3%)
Query: 449 YAIPFSAKPP--LVPLCSLQH-LATPVSSAMSLHIQEMLETGVLKRLDST---TGFLSRL 502
Y +P ++ PP S H A + + IQ++L +K + + TG+ S L
Sbjct: 18 YCLPITSPPPPCFSHRTSCHHPAAFSIQQVIDQEIQKLLSKQAIKMIQTHHQPTGYHSNL 77
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
F++PK +GG RPVLN K LN+ L K F + +I + LQ GD++ SI L A+ H+ I
Sbjct: 78 FVIPKNDGGLRPVLNRKPLNRHLPIKHFKMETMQKITNLLQPGDFLTSIYLQDAFHHILI 137
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
H+ L + LPFGL+ +P F + V R +G+++ YLDD L+
Sbjct: 138 HPRHRHLLRFRWKQQTYQYRVLPFGLSLSPLIFTKVLKPVVKWARRQGIQITAYLDDLLI 197
Query: 623 V 623
+
Sbjct: 198 M 198
>gi|301630389|ref|XP_002944304.1| PREDICTED: hypothetical protein LOC100485517 [Xenopus (Silurana)
tropicalis]
Length = 1028
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 162/377 (42%), Gaps = 27/377 (7%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +P+ Q L + G+ LP G + + + F+S W
Sbjct: 642 GALMGKTDIEAAFRLLPVHPDSQHLLGCQFKGNYYVDRSLPMGCSISCSYFECFSSFLEW 701
Query: 602 VASLLRSRGMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V + R G+ + YLDDFL V ++ I I + G + + K+ P
Sbjct: 702 V--IKRESGISSLTHYLDDFLFVGPKNSNICAILMAKMEEMANKFGIPLAIDKTE-GPTT 758
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
L+FLGI D + LPEDK + L + + + L ++LLG L+FA +IP
Sbjct: 759 CLKFLGIEIDSVNMQCRLPEDKLIELFRAVNEAITLRKITLKKLQALLGRLNFACKIIPA 818
Query: 720 GRLHSRRIQRQASLLRLGAP---HLTPINPAVLPKLEWW------LNALPLSSPIFPRQV 770
GR+ SRR+ SL +G H +N LE W N + +
Sbjct: 819 GRIFSRRL----SLATVGIKKPYHFISLNGEHKRDLEIWRLFLQSFNGISVWQDKELSNE 874
Query: 771 QHFISTDAS-DLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQ 825
+ + TDA+ G G + + S W E I K E+ + +L + L
Sbjct: 875 EINLYTDAAGGKGMGGYFNGRWFSEPWPSEWYKADITKNMVFLELLPILTSLEVWGEELG 934
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
+ V+ DN VV L + + S+ ++ + ++ LL + I + A+ +PG N VAD
Sbjct: 935 NKKVIFYCDNMGVVQVLNKMNAS-SIPVVRLMRRLVLLCMNSNIWLKARHVPGVSNDVAD 993
Query: 886 SLSRSKSLPDWHLSRSA 902
SLSR + W+L SA
Sbjct: 994 SLSRLQLDRFWNLVPSA 1010
>gi|301608772|ref|XP_002933958.1| PREDICTED: hypothetical protein LOC100498577, partial [Xenopus
(Silurana) tropicalis]
Length = 983
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 168/373 (45%), Gaps = 41/373 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L ++G CLP G A + +AF++ W
Sbjct: 626 GALMAKADIESAFRLLPIHPECHHLLGCWFDGAFFVDLCLPMGCAISCAHFEAFSTFLEW 685
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V RSR VV YLDDF V Q +LE ++AV G + +K+
Sbjct: 686 VVKT-RSRCNSVVHYLDDFFCVGQAKSDTCFHLLETLQQVAVG----FGVPLAAEKTE-G 739
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + + +L K L +S+LG L+FA V
Sbjct: 740 PATVMRFLGLEIDSVAGECRLPTQKVTDLMHEVGSLRRDKKATLKRLQSVLGKLNFACRV 799
Query: 717 IPMGRLHSRRIQRQASLLRLGAPH----LTPINPAVLPKLEWWL-----NALPLSSPIFP 767
IP+GR+ SRR+ + + R APH LT A L E +L L +S
Sbjct: 800 IPVGRVFSRRLAQATAGAR--APHHHVRLTKEVKADLGVWEAFLADFNGRVLFQASETTA 857
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
++++ + TDA+ GS+ ++ +G W Q Q W ++ E+F + AL
Sbjct: 858 QELELY--TDAA----GSKGFGAYFAGRWCAAQWPQEWAEAGLVRNLVFLELFPIVVALF 911
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
+ L+ ++ SDN VV + + S +L + + L I A+ + G
Sbjct: 912 IWETELRDKRIVFYSDNMGVVQGINNWSAS-SQPVLRLLRALVLRCLRLNISCRARHVEG 970
Query: 879 AYNSVADSLSRSK 891
N++AD+LSRS+
Sbjct: 971 CKNNIADALSRSQ 983
>gi|327278553|ref|XP_003224026.1| PREDICTED: serine/threonine-protein kinase WNK2-like [Anolis
carolinensis]
Length = 2370
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 68/115 (59%)
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
S ++ R S +GDY +SIDL AYFHV I+ H+RFL L T LPFGL T
Sbjct: 1814 SPTDNIRAFSDQMRGDYFVSIDLRDAYFHVAIRKGHRRFLCLKVLNQTYQFTVLPFGLVT 1873
Query: 591 APQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
AP+ F + VA+ LR +G+ V YLDD+LLV DP +L + +++L SLG
Sbjct: 1874 APRVFTKVVAVVAAHLRLQGITVFPYLDDWLLVESDPVLLRRHVDVTLTLLDSLG 1928
>gi|340383518|ref|XP_003390264.1| PREDICTED: hypothetical protein LOC100635540 [Amphimedon
queenslandica]
Length = 1253
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 121/461 (26%), Positives = 197/461 (42%), Gaps = 43/461 (9%)
Query: 448 GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK 507
G+ I F L P S T +S +I+E + G L+ ST LS + L+PK
Sbjct: 465 GFTIGFKPGSALEPATSNMSSVTDKPEVVSKYIEEEVAAGRLR--PSTVTQLSPIGLIPK 522
Query: 508 GN--GGTRPVLNL-----KGLNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLS 554
N G R +++L + +N + P K +++ + Q+ G M IDL
Sbjct: 523 KNKPGCFRMIVDLSSPKGRCVNDGI-PSKLCSLHYASVAEAAQRMVQCGRGALMAKIDLK 581
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA-SLLRSRGMRV 613
AY VP++ L + + G A LPFGL +AP F+++++ +A +L RS
Sbjct: 582 SAYRMVPVRPEDSLLLGIQWEGITYADFALPFGLRSAPILFSAVADGLAWALFRSGVEFS 641
Query: 614 VVYLDDFLLVNQDPRILEIQG-KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
+ YLDDF ++ + ++A+ + LG V +K PA L FLGI +
Sbjct: 642 IHYLDDFFFCGPPSSLVCRRAMEIALPLCQKLGLPVAPEKVE-GPATSLTFLGIQLNSDA 700
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
+ LP++K +L L + ++ + LLG+L+ A+ V+ GR R I
Sbjct: 701 MSLSLPQEKLASLKLRLSAWVNAQAATKQELQELLGHLNHAAAVVRPGRSFVRAIIEAMK 760
Query: 733 LLRLGAPH-LTPINPAVLPKLEWW---------LNALPLSSPIFPRQVQHFISTDASDLG 782
RL PH T ++ ++WW ++ALP S P+ ++ +DAS
Sbjct: 761 RPRL--PHQKTRLDANCKADIKWWSLFVADWNGISALPPSCPV------TWVISDASG-S 811
Query: 783 WGS----QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTV 838
WG Q S+ W +I KE+ V A ++ V+ SDN V
Sbjct: 812 WGCGAFDQYHGSWFQLPWPASWAEVNIAAKELLPVVIAAAVWGRRWAGQRVLFLSDNTAV 871
Query: 839 VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
V+ L + + L ++ +F + A IPG
Sbjct: 872 VAALSSRSACHPI-LAHLLKCLFFWEAKFDFEHSADHIPGC 911
>gi|301632689|ref|XP_002945414.1| PREDICTED: hypothetical protein LOC100493982 [Xenopus (Silurana)
tropicalis]
Length = 970
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 167/396 (42%), Gaps = 49/396 (12%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +PI L +G++ CLP G + + F S+++
Sbjct: 581 GALMAKADIESAFRLLPIHPECHHLLGCELDGEIYVDLCLPMGCSISCSYFEKFSSFLEW 640
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+++ R G R +V YLDDFL V Q +LE+ ++ L ++ P
Sbjct: 641 VVKKRTGSRNLVHYLDDFLCVGQASTEWCSFLLEVLKEVTAEFGVPLA-----PDKTVGP 695
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L FLGI D LPEDK L + +L + + +SL+G L+FA VI
Sbjct: 696 VTCLSFLGIEIDTVAGMTRLPEDKLRDLCEGVGRMLGRQKATVKLIQSLVGKLNFACRVI 755
Query: 718 PMGRLHSRRIQRQASLLR--LGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFP 767
PMGR+ RR+ ++LLR + H + V L W L +
Sbjct: 756 PMGRVFCRRL---SALLRGSMDGHHHVRLTKEVRGDLHIWAQFLKEFNGRIIFQGKEVTN 812
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW------HINKKEMFAVHQALSL 819
++Q + TDAS GS ++L G W W ++ E+F + A+ L
Sbjct: 813 DEIQLY--TDAS----GSVGFGAYLGGRWCAAHWPAGWAEGLLKNLCFLELFPIVVAVFL 866
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI--LAQFIP 877
L + V+ + DN V L +Q T SL L+ Q +I++ A +P
Sbjct: 867 WRKELANRRVIFRCDNLGAVQALNKQSAT---SLEVVRLLRVLVLQCLKINLGFRAIHVP 923
Query: 878 GAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVP 913
G N VAD+LSRS+ W R + L WG P
Sbjct: 924 GVKNVVADALSRSQ----WERFRQVAPEADL-WGAP 954
>gi|313236395|emb|CBY11713.1| unnamed protein product [Oikopleura dioica]
Length = 1568
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 135/318 (42%), Gaps = 18/318 (5%)
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
L+KGD M+ D + Y +P+ ++ + G L FGL AP + S++
Sbjct: 224 LLKKGDLMVKYDDKKGYHQMPLAKDSRKMACFEWGGKTFVNNILCFGLPAAPGIYQSMNQ 283
Query: 601 WVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGK-------LAVSILGSLGWIVN 649
+ LR G++ +YLDD L+V ++ R I+GK + + L +LG VN
Sbjct: 284 VGINFLRKNGIKATLYLDDRLVVITPKSEAQRKRLIEGKEVCREAWITAATLVALGGFVN 343
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY 709
++KS P ++FLG + D + + +P+ + L L+ ++S+ +L + G
Sbjct: 344 IEKSEFIPKQRMEFLGFILDSQTETVEIPQGRWNALKAKLQKAISSQRTSLKELERIRGT 403
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN----ALPLSSPI 765
+ + V P R+ R+I + AV + E WL L S
Sbjct: 404 QASMAEVFPNMRMLIRQITMLICQAENQGAYEVRATRAVKAEWETWLKFEEAGLKRSWKQ 463
Query: 766 FPRQ-VQHFISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLP 822
RQ I TDAS+ V+ +S W E + HI KE AV AL
Sbjct: 464 QSRQDAGIIIYTDASNHAGAIVVEEWSVSEKFAWDEEYADDHICIKEAVAVKYALEWYAK 523
Query: 823 LLQSSVVMVQSDNQTVVS 840
L++ V DN +VV
Sbjct: 524 RLENKRVTFLVDNSSVVD 541
>gi|301632809|ref|XP_002945473.1| PREDICTED: hypothetical protein LOC100491597 [Xenopus (Silurana)
tropicalis]
Length = 525
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 160/375 (42%), Gaps = 45/375 (12%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +PI L G++ CLP G + + F S+++
Sbjct: 135 GALMAKADIESAFRLLPIHPECHHLLGCELEGEIYVDLCLPMGCSISCSYFEKFSSFLEW 194
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
++R R G R +V YLDDFL V + +LE+ ++A + L ++ P
Sbjct: 195 VVRKRAGARNLVHYLDDFLCVGKASTECCSFLLEVLQEVATELGVPLA-----PDKTVGP 249
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
A L FLGI D LPEDK L + L+ + + +SLLG L+FA VI
Sbjct: 250 ATCLSFLGIEIDTVAGMTRLPEDKLKDLSKGVGELMGRQKATVKIIQSLLGKLNFACRVI 309
Query: 718 PMGRLHSRRIQRQASLLR--LGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFP 767
PMGR+ RR+ +LLR H + V L W L +
Sbjct: 310 PMGRIFCRRL---GALLRGTKEGHHHVRLTAEVKGDLHIWDQFLKEFNGKVIFQGKEVTN 366
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNWH-------INKKEMFAVHQALS 818
++Q F TDAS GS ++L G W W+ + E+F + A+
Sbjct: 367 EEIQLF--TDAS----GSVGFGAYLGGCWCAAHWPAGWYESGLLKNLCFLELFPIVVAVF 420
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI--LAQFI 876
L L + V+ + DN V L +Q + SL L+ Q +I++ A +
Sbjct: 421 LWGTELANRRVVFRCDNLGAVQALNKQSAS---SLEVVRLLRVLVLQCLKINLSFRAVHV 477
Query: 877 PGAYNSVADSLSRSK 891
PG N VAD+LSR++
Sbjct: 478 PGVKNVVADALSRAQ 492
>gi|301624306|ref|XP_002941447.1| PREDICTED: hypothetical protein LOC100497400 [Xenopus (Silurana)
tropicalis]
Length = 908
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 122/438 (27%), Positives = 184/438 (42%), Gaps = 81/438 (18%)
Query: 514 PVLNLK----GLNQFLSPKKFSLINHFRIP-------------------SFLQ------- 543
P+ NL+ G+ P KF LI+H P SF Q
Sbjct: 455 PLSNLRISPLGVVPKKEPGKFRLIHHLSYPKGGSVNDDIDKELSSVSYTSFDQAVEMVRT 514
Query: 544 --KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASL 598
KG M +D+ A+ +PI L + G+ CLP G A + + F+S
Sbjct: 515 AGKGALMAKVDIESAFRLLPIHPDCHHLLGCRFEGNYFVDLCLPMGCAISCAYFEMFSSF 574
Query: 599 SNWVASLLRSRGMRVVV-YLDDFLLV---NQDP--RILEIQGKLAVSILGSLGWIVNLQK 652
WV + +S G VV YLDD+L V N D +LE ++AVS L ++
Sbjct: 575 VEWV--IKKSSGYTSVVHYLDDYLCVGPANSDICFYLLETIQEVAVSFGVPLA-----KE 627
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ P +QFLGI D LPE K + L + + + +K L +SLLG L+F
Sbjct: 628 KTEGPTTSIQFLGIQIDSMKGECRLPEGKVVELRHAVEEMGKAKRATLRQVQSLLGKLNF 687
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL------------P 760
A +IP+GR+ SRR+ QA+ A H I+ V L W + L
Sbjct: 688 ACRIIPVGRVFSRRLA-QATAGAQVAHHHVGISKEVRADLAVWGHFLRDFNGKVLFQDRE 746
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSRE--QQNW-------HINKKEMF 811
+SSP ++Q + TDA+ GS ++L+G W Q+W +I E+F
Sbjct: 747 ISSP----EMQLY--TDAA----GSSGFGAYLAGQWCAAPWPQDWVESELVRNIAFLELF 796
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
+ A+ + L ++ SDN VV + S +L + + L + +
Sbjct: 797 PIVVAMYVWQQELSDRRIVFFSDNMMVVQAINSWSAA-SPPVLRLLRALVLRCLEMNVKC 855
Query: 872 LAQFIPGAYNSVADSLSR 889
A + G N +AD+LSR
Sbjct: 856 RAVHVEGEKNVIADALSR 873
>gi|301629681|ref|XP_002943965.1| PREDICTED: hypothetical protein LOC100497818, partial [Xenopus
(Silurana) tropicalis]
Length = 912
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 160/365 (43%), Gaps = 29/365 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G + D+ A+ +P+ + L + G CLP G + + + F S ++
Sbjct: 524 RGALLAKSDIESAFRLLPVHSDCYHLLGCQFEGQFYFDMCLPMGCSISCRYFECFSTFLE 583
Query: 604 SLLR--SRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R + V+ YLDDFL V Q + ++ + G ++ +K+ P V
Sbjct: 584 WVVRHETGSNSVIHYLDDFLFVGPQATNVCQLLLSTFQFFMSRFGVPLSKEKTE-GPTTV 642
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI D LPE+K L + +L +K L S +SLLG L FA ++P+
Sbjct: 643 LSFLGIEIDTVALVFRLPEEKLQKLKGTVAEMLTAKKVTLRSMQSLLGLLVFACRIMPIA 702
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQH 772
R+ SRR+ ++ H I + L+ W L + + + ++
Sbjct: 703 RVFSRRLSLATRGIK-HPHHFIRITKQLREDLKVWQTFLEHYNGHTCLMDTEVSNEELSL 761
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPL 823
F TDA+ GS + L+ W EQ NW ++ E+F + A+ +
Sbjct: 762 F--TDAA----GSTGFGAILAQSWCAEQWPDNWALVGLCKNLTLLELFPIVVAVQIWGQR 815
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ + +DN +VV + R + SL +L+ + + L D+ I A+ +PG N+
Sbjct: 816 ISGKKICFWTDNMSVVFAINRL-TSSSLPVLALLRHLVLRCLDFNIWFRARHVPGRVNTA 874
Query: 884 ADSLS 888
AD+LS
Sbjct: 875 ADALS 879
>gi|340381946|ref|XP_003389482.1| PREDICTED: hypothetical protein LOC100637556 [Amphimedon
queenslandica]
Length = 550
Score = 96.3 bits (238), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 161/381 (42%), Gaps = 27/381 (7%)
Query: 519 KGLNQFLSPKKF-SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
G++ L+ ++ S+ N I L +G + DL AY VP+ + L + + G
Sbjct: 132 DGIDPSLASIRYASVDNAVEIIRSLGRGALLTKFDLKDAYRIVPVHPSDHHRLGIMWEGA 191
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKL 636
+ CLPFGL +AP+ F+++++ +A + G+ V YLDDFL + L
Sbjct: 192 IFVDCCLPFGLRSAPKFFSAIADSLAWVFGCYGLVSQVHYLDDFLFLEPPGHTNSSIVPL 251
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASK 696
SI +LG + K+ PA L FLGI+ D + LP+DK + +L+
Sbjct: 252 VSSICCTLGVSLAAHKTE-GPATCLTFLGIVVDFTHWELRLPDDKLELVYALLQVWSRHS 310
Query: 697 TWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH-LTPINPAVLPKLEWW 755
+ S LG+LS A+ VI R RQ PH + + WW
Sbjct: 311 SCRRRELESFLGHLSHAAVVI--------RQARQ--------PHFFVRLTRGAKADISWW 354
Query: 756 LNALPL--SSPIFPRQV--QHFISTDASDLGWGS-QVDSSFLSGLWSREQQNWHINKKEM 810
L L FP H + A G G QV + W E + I E+
Sbjct: 355 LCFLRRWNGRSFFPPSTPSVHVYTDAAGSFGCGGFQVRGPWFQLAWPGELRR-SIAVLEL 413
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
V A L + S+V SDN+ VV L + G + SL + + LL+ H
Sbjct: 414 IPVVIAAMLWGSSWRGSMVCFHSDNEAVVKVLNK-GFARDSSLSHLLRCLALLAAFHGFH 472
Query: 871 ILAQFIPGAYNSVADSLSRSK 891
I A +PG N AD+LSR+
Sbjct: 473 ICAIHVPGWLNDAADALSRNN 493
>gi|308493269|ref|XP_003108824.1| hypothetical protein CRE_11685 [Caenorhabditis remanei]
gi|308247381|gb|EFO91333.1| hypothetical protein CRE_11685 [Caenorhabditis remanei]
Length = 1143
Score = 96.3 bits (238), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 151/604 (25%), Positives = 236/604 (39%), Gaps = 64/604 (10%)
Query: 353 LESRRRLVEP---RDPHLASLLLRARRGKKSSSPQNLEPPGRVSLKVQTLQKPQRCSSPV 409
+ES+RR +EP R S A RG S+ + G S V+ K +CS
Sbjct: 153 MESKRRRLEPYPGRQQWFRSE--SAFRGHGSARQNGGQGYGN-SRDVKRSVKCFKCSQFG 209
Query: 410 NPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHL 468
+ D E RL ++ W + + ++ ++ GY I ++ L L+
Sbjct: 210 HYATDCMSFPE----RLSEAIEFWGNICSSEWVLSVIEDGYIIQLDSRVTLPEPQGLRPS 265
Query: 469 ATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPK 528
+ I+ + E GVL+R D +S L +V +G R +L+L LN+ L P
Sbjct: 266 VLRHKDFLFAEIERLEEEGVLERSDRLPRAVSPLHVVEQGKK-KRMILDLSELNKSLVPP 324
Query: 529 KFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA----MTCL 584
+F L N FL+ ++ + D Y H+ I + L+ S + A L
Sbjct: 325 RFKLENMKTAWPFLENANFAATFDFKSGYHHIKIHRDSRDLLSFSLSNPPAAPYFFFKGL 384
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
PFGLATAP F + + R+ G+++ +YLDD L+V + + + L
Sbjct: 385 PFGLATAPWLFTKIFKVLVRKWRAEGIKMFLYLDDGLIVGETEYEVARASRRVRGDLAEA 444
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
G V +KS P +LG D + E + T ++L L S ++
Sbjct: 445 GVCVAEEKSFWVPDAKFTWLGYECDLVAREVRGTEKRMATWQSVLDELRRSVAPSVLDRM 504
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN------PAVLPKLEWW--- 755
LG L ASF + G + R + + + N P + ++E+W
Sbjct: 505 KFLGCL--ASFELVAGDVGVGRARWLMQTVGESQKKMESKNTRKEKSPGEIREIEFWKVH 562
Query: 756 -----LNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSR--------EQQN 802
+L P F + TDAS G G + LW E+Q+
Sbjct: 563 GEELLKRSLLEIEPCF----DFLLFTDASARGVGGLLKDKKGCVLWKMSEIGDSNFEEQS 618
Query: 803 --WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSE---- 856
W +E+ AV A + + ++ S + V D+Q VS LRR L L+E
Sbjct: 619 SAW----RELTAVDVASARLIGQVRGS-IQVLVDSQAAVSVLRRGSMKPELHALAERVWK 673
Query: 857 -VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCI 915
E I S W IP N AD SR+ DW ++ +Q WG +
Sbjct: 674 NFESIGGCSFLW--------IPREQNVEADEASRNFDFDDWGIADRVFKQAQRLWGEIKV 725
Query: 916 DLFA 919
D FA
Sbjct: 726 DWFA 729
>gi|313228592|emb|CBY07384.1| unnamed protein product [Oikopleura dioica]
Length = 674
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/319 (26%), Positives = 138/319 (43%), Gaps = 18/319 (5%)
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+KGD M+ D + + +P+ ++ + G L FGL AP + S++
Sbjct: 147 KLLKKGDLMVKFDDKRGFHQMPLAEESKKMACFEWGGKKFVNNILCFGLPAAPGIYQSMN 206
Query: 600 NWVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGK-------LAVSILGSLGWIV 648
+ LR G++ +YLDD L+V ++ R+ ++GK + + L +LG V
Sbjct: 207 LVGINFLRKNGIKATLYLDDRLVVITPKSEAHRLRLLEGKEVCKEAWVTAATLVALGGFV 266
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
N++KS P ++FLG + D + + +P+ + L L L+ L S+ +L + G
Sbjct: 267 NIEKSEFIPKQRMEFLGFILDSETETIEIPQSRWLALKAKLQQALNSERTSLKELERIRG 326
Query: 709 YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN----ALPLSSP 764
+ + V P R+ R+I + + + AV + + WLN L
Sbjct: 327 TQASMAEVFPNMRMLIRQITMLIGQAEIQGAYEVRLTRAVKAEWKTWLNFENSGLKRCWK 386
Query: 765 IFPRQ-VQHFISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNL 821
RQ I TDAS+ V+ +S W E + HI KE AV AL
Sbjct: 387 QQDRQNAGIIIYTDASNHAGAIVVEEWNISEKFAWDEEYASDHICIKEAVAVKYALEWYA 446
Query: 822 PLLQSSVVMVQSDNQTVVS 840
L++ V DN +VV
Sbjct: 447 KRLENKKVTFLVDNSSVVE 465
>gi|301609767|ref|XP_002934431.1| PREDICTED: hypothetical protein LOC100493707 [Xenopus (Silurana)
tropicalis]
Length = 805
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 38/372 (10%)
Query: 545 GDYMISIDLSQAY-FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
G M D+ A+ +PI L +G++ CLP G + + F S+++
Sbjct: 414 GALMAKADIESAFRLLLPIHPECHHLLGCELDGEIYVDLCLPMGCSISCSYFEKFSSFLE 473
Query: 604 SLLRSR--GMRVVVYLDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+++ R +V YLDDFL V Q +LE+ ++ L ++
Sbjct: 474 WVVKKRTGSKNLVHYLDDFLCVGQASTEWCSFLLEVLKEVTTEFGVPLA-----PDKTVG 528
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
P L FLGI D LPEDK LG + +L + +SLLG L+FA V
Sbjct: 529 PVTCLSFLGIEIDTVAGMTRLPEDKLTDLGRGVGEMLGRTKATVRMVQSLLGKLNFACRV 588
Query: 717 IPMGRLHSRRIQRQASLLRLG--APHLTPINPAVLPKLEWWLNALP--------LSSPIF 766
IPMGR+ RR+ ++LLR H + V L+ W L +
Sbjct: 589 IPMGRVFCRRL---STLLRGSKEGHHHVRLTREVKGDLQIWAQFLESFNGRIIFQGKEVT 645
Query: 767 PRQVQHFISTDASD-LGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNL 821
++Q F TDAS +G+G+ + S+ + W + K E+F + A+ L
Sbjct: 646 NEEIQLF--TDASGSVGFGAYLGGSWCAACWPAGWAERGLLKNLCFLELFPIVVAVFLWG 703
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL--AQFIPGA 879
L + V+ + DN V L +Q T S L+ Q +I++ A +PG
Sbjct: 704 AKLANRRVVFRCDNLGAVQALNKQSAT---SPEVVRLLRVLVLQCLKINLCFRAIHVPGV 760
Query: 880 YNSVADSLSRSK 891
N VAD+LSR++
Sbjct: 761 ENVVADALSRAQ 772
>gi|340384546|ref|XP_003390772.1| PREDICTED: enzymatic polyprotein-like [Amphimedon queenslandica]
Length = 578
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 114/444 (25%), Positives = 203/444 (45%), Gaps = 34/444 (7%)
Query: 476 MSLHIQEMLETGVLKRLD-STTGFLSRLFLVPKGN--GGTRPVLNL---------KGLNQ 523
+S +IQE G L+ + +S + ++PK + G R +++L G++
Sbjct: 65 VSAYIQEEFGGGKLRLAAPNEVVHVSPIAIIPKTSQPGKYRLIVDLSAPDGSSVNDGISP 124
Query: 524 FLSPKKFSLINHFRIPSFLQKGD--YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
L+ ++ ++ + ++ G +M +D+ AY VP+ Q L + + G
Sbjct: 125 ALATLSYTSVDE-AVAMVIEAGQSAWMAKLDIQSAYRKVPVHPADQPLLGIHWRGVTFCD 183
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVV-YLDDFLLVNQ-DPRILEIQGKLAVS 639
LPFGL +AP F ++++ ++ + G+ ++ YLDDF ++ + E + AV+
Sbjct: 184 RALPFGLRSAPLLFTAVADGLSWAMECCGVHNLIHYLDDFFFCSRAESSECEQALRTAVN 243
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
+ LG K + P + FLGI D + LPEDKQ L +IL+ K +
Sbjct: 244 LCQRLGLPAAPHK-VVGPCTTITFLGIEIDSCRWELRLPEDKQTRLMSILQEWKHDKRQS 302
Query: 700 LDSA--RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN 757
+ +SL+G L++A+ ++ GR +R + +AS + H +N + WW
Sbjct: 303 VTKKQLQSLVGLLNYAARIVRPGRPFTRSLI-EASKIPQEPDHWVRLNVECRSDISWWQE 361
Query: 758 ALPL--SSPIFP-RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNWH---INKKE 809
L +P R + +DAS WG G W + Q ++W+ I KE
Sbjct: 362 FLRFWNGRSFYPGRPWAATVYSDASGR-WGCGA-VCLPVGQWFQVQWPESWYSISIAAKE 419
Query: 810 MFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF-LLSQDWR 868
+ + ++++ V+ + DN VV+ LR G+ LL+ + KI LLS +
Sbjct: 420 LLPIVVSVAVWGREWAGLRVLSRCDNDAVVACLR--SGSAKDPLLAHLLKILALLSALHK 477
Query: 869 IHILAQFIPGAYNSVADSLSRSKS 892
I ++A + G N AD+LSR S
Sbjct: 478 IQLVAVHVAGRSNGAADALSRGNS 501
>gi|291238550|ref|XP_002739191.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 943
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/408 (25%), Positives = 184/408 (45%), Gaps = 47/408 (11%)
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFL------QKGDYMISIDLSQAYFHVPIKTTH 566
R ++ +N + FSL ++ +I + +KG ++ D+ A+ +PI +
Sbjct: 525 RDNIDTPSINDLIDKDDFSL-SYVKIDDAISAIQRKEKGAWLCKTDIVDAFKLIPINPSL 583
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY--LDDFLLVN 624
+S+ LPFG ++P+ F +LS+ + + + ++ LDDFL V
Sbjct: 584 WHLYGISWEKHFYFFVRLPFGSRSSPKIFDNLSSAICWIAHNNYHIDTIFHLLDDFLTV- 642
Query: 625 QDPRILEIQGKLAVS--ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
D + +AV I LG + K+ + P+ VL++LGI D + LPEDK
Sbjct: 643 -DAPTFDADRTMAVLTLIFRRLGVPLAPHKT-VGPSTVLEYLGITLDTVEIQARLPEDKL 700
Query: 683 LTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT 742
+ + +L T L KT SLLG+L+FA VI GR R+ + ++ H+T
Sbjct: 701 VRIRKLLDTFLTRKTCTKRELLSLLGHLNFACRVIIPGRTFISRLIELSKGVKKLQHHVT 760
Query: 743 PINPAVLPKL-------EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL 795
+ + L EW ++ L + + P + TDAS +G G F GL
Sbjct: 761 ISSESKQDILMWRSFLSEWNGVSMFLEANLTPATGLQ-LYTDASGIGHG-----GFFRGL 814
Query: 796 WSREQQNWH----INKKEMFAVHQALSLNLPLLQSSV----------VMVQSDNQTVVSY 841
W E+ W ++ ++ Q L P++ +S+ ++ DN V +
Sbjct: 815 WFHER--WSPELTLDDPKLSIAFQEL---YPIVVASILWGHYWCRKRILFNCDNMATV-H 868
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ +G +KS +++ + ++ + + + A+ IPG N++ADSLSR
Sbjct: 869 IINKGRSKSPAIMKLMRRLVITAASFDFMFHAEHIPGKINTIADSLSR 916
>gi|384499664|gb|EIE90155.1| hypothetical protein RO3G_14866 [Rhizopus delemar RA 99-880]
gi|384499676|gb|EIE90167.1| hypothetical protein RO3G_14878 [Rhizopus delemar RA 99-880]
Length = 190
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 72/132 (54%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+ L+K DY++SIDLS A+ H+P+ ++FL + V T PFGLA+ F
Sbjct: 7 VAHLLRKIDYLVSIDLSDAFLHIPVHPNSRKFLRFKWKSQVYQYTTTPFGLASVLYLFTK 66
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ + R++G+RV YLDD++L + ++ + V L LGW+VN +KS LSP
Sbjct: 67 ICQPILEWARTQGIRVSAYLDDWILAAESKKLALQHTNMLVQQLQQLGWVVNTKKSVLSP 126
Query: 658 APVLQFLGIMWD 669
L+ LG D
Sbjct: 127 TRKLEHLGFCLD 138
>gi|301609896|ref|XP_002934494.1| PREDICTED: hypothetical protein LOC100486224 [Xenopus (Silurana)
tropicalis]
Length = 1709
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 154/373 (41%), Gaps = 17/373 (4%)
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
S R+ +G M D+ A+ +P+ L + G LP G +
Sbjct: 822 SFDEAIRLVGRAGQGALMAKADVESAFRLLPVHPESLHLLGCYFEGHYYVDRSLPMGCSI 881
Query: 591 APQAFASLSNWVASLLRSRG--MRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWI 647
+ F + S ++ ++R + V+ YLDDFL V Q + + + + G
Sbjct: 882 SCAYFEAFSTFIEWVVREKAGVASVIHYLDDFLCVGPQRSNLCAVLLETLQEVAEQFGVP 941
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
+ +K+ P L+FLGI D LP DK LTL + +K L +SLL
Sbjct: 942 LAREKTE-GPITCLKFLGIEIDTVRQECRLPMDKVLTLKEEVGYARQAKKVTLKQLQSLL 1000
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIF 766
G L+FA +IPMGR+ SR + + +R H +N L W L + ++
Sbjct: 1001 GKLNFACRIIPMGRVFSRSLSMATAGIRH-PHHFIRLNSEHKADLAVWGTFLQDFNGKVY 1059
Query: 767 -PRQVQH-----FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQA 816
P +V + A G+G+ + + W +E ++ E+F + A
Sbjct: 1060 WPEKVIENPDISLFTDAAGATGFGAYFAGKWCAAGWPKEWATGNLTGNLAFLELFPIIVA 1119
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
+ L L + V+ +SDN V + + S +L+ + + L I A+ I
Sbjct: 1120 VELWGKELSNKTVLFRSDNMAAVLAVNNL-TSSSRPVLALLRHLVLRCLQLNITFRAKHI 1178
Query: 877 PGAYNSVADSLSR 889
PG N +AD+LSR
Sbjct: 1179 PGEINDIADALSR 1191
>gi|291223453|ref|XP_002731724.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 945
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 158/664 (23%), Positives = 268/664 (40%), Gaps = 110/664 (16%)
Query: 305 AFSSTMNTLVEKYPEVREELGSLFSDESVFKN------VSDHLLQYVCGKRAECLESRRR 358
AF + E YP REEL SD N D+ +Q+ K A L+ +
Sbjct: 284 AFGRYKAIMCEVYPGRREELDLYESDIIDIANSFGGTLFYDYHVQFTT-KAAAYLQQKNV 342
Query: 359 LVE--PRDPHLASLLLRARRGKKSSSPQNLEPPGRVSLKVQTLQKPQRC-----SSPVNP 411
V+ RD + +R G+K+++ ++ TL C S
Sbjct: 343 KVDWSVRD---NDIFIRVTAGRKAAT---------CAICSSTLHSYDFCPRRQFVSQRGR 390
Query: 412 PADSRIGAELVGGRLRRFVDAWI----------RLGAPAPLVRIVS-----GYAIPFSAK 456
+ + GRLR F + L A PL+ + S G+ A
Sbjct: 391 DMQQQSNTNDIRGRLRLFFNGKEICNNFNTDKGCLRAVCPLLHLCSTCLREGFLTGIDAV 450
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGF----LSRLFL-VPKGNGG 511
P V C A S +S IQ ++ G L S+ F +S L L V K +G
Sbjct: 451 PTSVFECPNNLTARQNPSVISKLIQAEIDKGYLIGPFSSPPFDVYRISPLGLAVGKYSGK 510
Query: 512 TRPVLNLKG---------LNQFLSPKKFSLI-----NHFRIPSFLQKGDYMISIDLSQAY 557
R +++L LN+ ++ K+SL + + + +M D++ A+
Sbjct: 511 KRLIVDLSSPHDKPHVPSLNELINKDKYSLSYSTVDDAIKELKCVGNTAHMCKFDITDAF 570
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVV 614
+P+ + + + + G+ L FG ++P+ F LS W+A G+R +
Sbjct: 571 KIIPLHPSIWKLHGVKWQGNYYFFVRLVFGSRSSPKIFDMLSTAICWIAQF--KYGIRYI 628
Query: 615 VYL-DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL------QKSSLSPAPVLQFLGIM 667
++L DDFL + ++ + A+ + + I NL + +L PA L++LGI+
Sbjct: 629 LHLLDDFLTI-------DVHAESAMRSMAVMTMIFNLLRIPLAKHKTLGPAQELEYLGIL 681
Query: 668 WDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR-LHSRR 726
+LP DK+ + + + T SLLG+L+FA+ VI GR S
Sbjct: 682 LSSKDMTAFLPADKKARILTKFAEVTSKNTVTKRQLLSLLGHLNFATRVIIPGRSFLSHL 741
Query: 727 IQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL---------SSPIFPRQVQHFISTD 777
+Q AS+ +L H +N +L W + L S+ + F TD
Sbjct: 742 LQLAASVSKLH--HHISLNLDCRRELAMWEHFLAAWNGAHFFLHSTVTLASDIDIF--TD 797
Query: 778 ASD-LGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV-------- 828
AS +G+G S+ W E + H N+ M AL P++ +++
Sbjct: 798 ASSTVGFGGYFKGSWFCDKWPVELDSQHNNELSM-----ALLELYPIVVAAMLWGTQWSQ 852
Query: 829 --VMVQSDNQTVVSYLRRQGGTKSLSLLSE-VEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
+ DN V +R+ + ++++ + ++ LL+ + LA +PGA+NS+AD
Sbjct: 853 HRISFNCDNLATVHIIRKGRASSHCCIINKLMRRLTLLAMQYNFVFLAHHLPGAHNSIAD 912
Query: 886 SLSR 889
SLSR
Sbjct: 913 SLSR 916
>gi|340381288|ref|XP_003389153.1| PREDICTED: hypothetical protein LOC100634784 [Amphimedon
queenslandica]
Length = 1057
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/367 (24%), Positives = 150/367 (40%), Gaps = 48/367 (13%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G +M +DL AY VP+ Q L ++++ + LPFGL +AP+ F +++N +A
Sbjct: 427 RGAWMAKLDLRSAYRRVPVHPDDQPLLGMAWDDRIFCDRALPFGLRSAPKVFTAVANALA 486
Query: 604 SLLRSRGM-RVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++ G+ ++ YLDDF + R ++AV + LG+ V K + PA L
Sbjct: 487 WAMQCEGIGDLIHYLDDFFFWSPATSRDCHTALEIAVPLCNKLGFPVAPHK-VVGPATSL 545
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
FLG D + LPEDK L LR + +SL+G+L+ A+ V+ GR
Sbjct: 546 IFLGTEIDSLRQEIRLPEDKLSFLKQALRQWGDKRAATKREVQSLIGHLNHAAKVVRPGR 605
Query: 722 ------LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL----------SSPI 765
+ + +IQR+ H ++ A + WW L S P+
Sbjct: 606 PFLRGLIDTMKIQRRQH-------HRVRLSVACRGDIVWWQRLLEWRVFLSIGSHPSPPV 658
Query: 766 FPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ 825
R+ H W I KE+ + AL++
Sbjct: 659 LRRKWFHLC---------------------WPPSLSGVSIAPKELVPIVAALAVWGNRWS 697
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
V+ DN+ VV + R G L + + +L + + + +P N AD
Sbjct: 698 QGSVLCHCDNEAVVHTIAR-GSAIDPHLNHLLRLLAILQAKLNVSVAVEHVPCVLNGAAD 756
Query: 886 SLSRSKS 892
+LSR +S
Sbjct: 757 ALSRDRS 763
>gi|301611102|ref|XP_002935091.1| PREDICTED: hypothetical protein LOC100498516 [Xenopus (Silurana)
tropicalis]
Length = 1329
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 116/454 (25%), Positives = 181/454 (39%), Gaps = 75/454 (16%)
Query: 511 GTRPVLNLK----GLNQFLSPKKFSLINHFRIP-------------------SFLQ---- 543
G+ P++NL+ G+ P KF LI+H P SF Q
Sbjct: 877 GSPPMMNLRVSPLGVVPKKEPGKFRLIHHLSYPKGGSVNDDIDKELCSVSYTSFDQAVAV 936
Query: 544 -----KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
KG M D+ A+ +PI L + G CLP G A + F +
Sbjct: 937 VRKAGKGALMAKADIESAFRLLPIHPDCHHLLGCWFEGAFFVDLCLPMGCAISCAYFEAF 996
Query: 599 SNWVASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------L 650
S ++ ++R + VV YLDDFL V + IL +L +
Sbjct: 997 STFLEWVIRRKAGYSSVVHYLDDFLCVG------PASSDICFHILDTLREVAEEFGVPLA 1050
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+ PA +QFLG+ D + LP K L + + +K L +SLLG L
Sbjct: 1051 ADKTEGPATTMQFLGLEVDSVKGQCRLPVSKVTDLREEVGRMRQTKKPTLRQVQSLLGKL 1110
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPL-SS 763
+FA VIP GR+ SRR+ R A+ H + V L W N + L +
Sbjct: 1111 NFACRVIPAGRVFSRRLAR-ATAGATAPHHHVRLGREVRADLGIWEVFLRNFNGVVLFQA 1169
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVH 814
P Q + A +G+G ++L+G W E+ W ++ E+F +
Sbjct: 1170 PEATAQEMQLFTDAAGSVGFG-----AYLAGQWCAEKWPPEWVESGLVRNLAFLELFPIV 1224
Query: 815 QALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK---SLSLLSEVEKIFLLSQDWRIHI 871
AL + L++ ++ SDN +VV + + L V + F L+ I
Sbjct: 1225 VALFVWEQELRNRSIVFFSDNLSVVQGINNWSASSPPVLRLLRVLVLRCFRLN----IRC 1280
Query: 872 LAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQ 905
A+ + G N +AD+LSRS+ W ++ A ++
Sbjct: 1281 RARHVEGVKNVIADALSRSQWERFWQVAPEAEKE 1314
>gi|301632403|ref|XP_002945275.1| PREDICTED: genome polyprotein-like [Xenopus (Silurana) tropicalis]
Length = 565
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 161/366 (43%), Gaps = 29/366 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L + G CLP G + + + FA+
Sbjct: 178 QGSLLAKTDIESAFRLLPIHPDSHYLLGFHFQGAYFYDKCLPMGCSISCKYFEMFATFLE 237
Query: 601 WVASLLRSRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
WV S V YLDDFL + ++ I + G + +K+ ++P
Sbjct: 238 WVIKF-ESGANFVTHYLDDFLFLGPRESNTCSILLNTFLLYAKKFGVPIAREKT-VAPTT 295
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
LQFLGI D LPE K L +++ + L +K L +SL+G+L+FA+ +IPM
Sbjct: 296 SLQFLGIEIDTMRMEFRLPEAKITKLKSLIASALVAKKLKLKHIQSLIGHLNFATRIIPM 355
Query: 720 GRLHSRRIQRQASLLRLGAP----HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIS 775
GR+ +RR+ +L +G H+ I +V L W L + Q + FI
Sbjct: 356 GRVFNRRL----IVLTMGITNPNWHIR-IPTSVKEDLLIWRQFLSFYNGRTCWQ-EDFIE 409
Query: 776 TDASDL---GWGSQVDSSFLSGLWS--------REQQ-NWHINKKEMFAVHQALSLNLPL 823
A L GS ++LSG W REQ+ ++ E+F V AL +
Sbjct: 410 NSAIQLFTDAAGSTGFGAYLSGCWCCAAWPTEWREQELTGNLVLLEIFPVLVALEIWGSW 469
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
L + +++ DN VV + KS ++ + + L+ I + A+ IPG N +
Sbjct: 470 LANRRILLFCDNMGVVQVINNLSA-KSPPVVKVMRHLVFLALKHNIWLKAKHIPGCQNIL 528
Query: 884 ADSLSR 889
ADSLSR
Sbjct: 529 ADSLSR 534
>gi|301617707|ref|XP_002938267.1| PREDICTED: hypothetical protein LOC100493886 [Xenopus (Silurana)
tropicalis]
Length = 1054
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 163/385 (42%), Gaps = 37/385 (9%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G CLP G A + +AF++ W
Sbjct: 668 GALMAKADIESAFRLLPIHPECHHLLGCWFEGAYFVDLCLPMGCAISCAHFEAFSTFLEW 727
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + RS VV YLDDF V Q +LE + S G + K+
Sbjct: 728 VVKV-RSGYRSVVHYLDDFFCVGQAKSDTCFHLLET----LREVTASFGVPLAADKTE-G 781
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 782 PATVMRFLGLEIDSLAGECRLPTQKVADLMREVGSLRRDKKATLQRLQSVLGKLNFACRV 841
Query: 717 IPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPLSSPIFPRQ 769
IP+GR+ SRR+ + + R AP H I V L W N L
Sbjct: 842 IPVGRVFSRRLAQATAGAR--APHHHVRITKEVKADLGVWEAFLADFNGRVLFRASETTA 899
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLN 820
+ + TDA+ GS+ ++L+G W Q Q W ++ E+F + AL +
Sbjct: 900 QELELYTDAA----GSKGFGAYLAGRWCAAQWPQEWVEAGLVRNLVFLELFPIVVALFIW 955
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
L+ ++ SDN VV + + S +L + + L + A+ + G
Sbjct: 956 EAELRDRRIIFYSDNMGVVQGINNWSAS-SQPVLRLLRALVLRCLKLNVSCRARHVEGCK 1014
Query: 881 NSVADSLSRSKSLPDWHLSRSATEQ 905
N +AD+LSRS+ W ++ +A ++
Sbjct: 1015 NDIADALSRSQWERFWQVAPTAEKE 1039
>gi|301611629|ref|XP_002935334.1| PREDICTED: hypothetical protein LOC100490884 [Xenopus (Silurana)
tropicalis]
Length = 1013
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 169/418 (40%), Gaps = 71/418 (16%)
Query: 527 PKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAYF 558
P KF LI+H P SF Q KG + D+ A+
Sbjct: 600 PGKFRLIHHLSYPHGESVNDDINPELCSVTYISFDQAVALVRKAGKGALLAKADIESAFR 659
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG--MRVVVY 616
+PI L ++ G + CLP G + + F + S+++ ++R R +V Y
Sbjct: 660 LLPIHPECHHLLGCAFEGSIYVDLCLPMGCSISCSYFETFSSFMEWVVRQRAHTTGIVHY 719
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLSPAPVLQFLGIMWDP 670
LDDFL V + IL +L + ++ P L FLG+ D
Sbjct: 720 LDDFLCVG------PAGSEECFHILTTLQEVAEDFGVPLAPDKTVGPVTCLSFLGLEIDS 773
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR-------SLLGYLSFASFVIPMGRLH 723
LPEDK L L S W + A+ S+LG L+FA VI MGR+
Sbjct: 774 VRGESRLPEDK-------LHDLRKSVAWAREKAKMTVREIQSMLGKLNFACRVIVMGRVF 826
Query: 724 SRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNALPL--SSPIFP-RQVQ----HFIS 775
RR+ + R APH P V LE W L IFP R+V +
Sbjct: 827 CRRLGGLLAGAR--APHHHIRLPQGVRDDLEVWQRFLESFNGKVIFPEREVSSTELQLFT 884
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMV 831
A G+G+ + S+ + W E + + K E+F + ++ + L++ V+
Sbjct: 885 DAAGSFGFGAYLGGSWCADRWPGEWFSLGLVKNLCFLELFPIVVSVFIWGDKLRNKQVVF 944
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
SDN VV + RQ T S ++ + + L + + A+ +PG N +AD+LSR
Sbjct: 945 VSDNMGVVQVINRQTAT-SAEVVRLLRVLVLRCLNINLGFRARHLPGVKNEIADALSR 1001
>gi|291237354|ref|XP_002738600.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 443
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 171/397 (43%), Gaps = 26/397 (6%)
Query: 518 LKGLNQFLSPKKFSLINHFRIP---SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLA 571
L +N +S + SL+ + RI +F+ K G + ID+ A+ P++ L
Sbjct: 8 LDSVNDNISKEDHSLV-YSRIDDAVAFIHKHGHGSLLAKIDVKHAFRLCPVRKEDWHLLG 66
Query: 572 LSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVVVYLDDFLLVN-QDP 627
+ G LPFGL +AP F +++ W+ S R+R + YLDDFL V +
Sbjct: 67 FFWEGCYFFDRVLPFGLRSAPYLFNRIADAIHWIVSH-RARNTDFLHYLDDFLTVGPANT 125
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
+ + + LG + +K P V+ FLG+ D + LP+DK L
Sbjct: 126 NACQHNMDVMLQSCHHLGVPIATEKVE-GPCSVITFLGVELDTVNMVIRLPKDKLADLLV 184
Query: 688 ILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
L + L T + SL+G LSFA IP GR+ RR+ S+ + +
Sbjct: 185 KLPSWLTRHTCSKRELLSLIGCLSFACKCIPAGRIFLRRMI-DISMTATSLSQVITLTDE 243
Query: 748 VLPKLEWWLNALP-------LSSPIFPRQVQHFISTDAS-DLGWGSQVDSSFLSGLWSRE 799
++WW + LP L +P + + + TDAS LG+G+ + + W
Sbjct: 244 FWHDVQWWCDFLPSWNGTASLLNPNWIPSPEFELFTDASATLGYGAFYKGHWFANTWPTF 303
Query: 800 QQN---WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
N + I KE+ + + + L + DN +VV + ++G + ++
Sbjct: 304 ITNDPLYSIAWKELLPILLSSLIWGHLWYGLRIRFHCDNISVVQ-IWKKGSSSCPRIMQL 362
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
V +F + H++ I G N +ADSLSR + L
Sbjct: 363 VRLLFFTAASNNFHVMISHISGFNNDIADSLSRQQIL 399
>gi|301631073|ref|XP_002944633.1| PREDICTED: hypothetical protein LOC100486363, partial [Xenopus
(Silurana) tropicalis]
Length = 517
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 54/361 (14%)
Query: 526 SPKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAY 557
P KF LI+H P SF Q KG M +D+ A+
Sbjct: 127 EPGKFRLIHHLSYPKGSSVNDDIDKELSSVSYTSFDQAVDLVKIAGKGALMAKVDIESAF 186
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG--MRVVV 615
+PI L + G CLP G A + F S ++ +++ + VV
Sbjct: 187 RLLPIHPDCHHLLGCWFEGYYFVDLCLPMGCAISCAYFEMFSRFLEWVVKKKAGYTSVVH 246
Query: 616 YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
YLDD+L + + I + + S G + +K+ P +QFLGI DP +
Sbjct: 247 YLDDYLCIGPANSDICFYLLETIQDVTASFGVPLAREKTE-GPTTSIQFLGIQIDPMVGE 305
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLL 734
LPE K + L + + ++ L +SLLG L+FA +IP+GR+ SRR+ + + +
Sbjct: 306 CRLPESKVVELRQAVEEMGKARRATLRQVQSLLGKLNFACRIIPVGRVFSRRLAQATTGV 365
Query: 735 RLGAPHLTPINPAVLPKLEWW------------LNALPLSSPIFPRQVQHFISTDASDLG 782
++ H++ I+ V L W A +S+P ++Q + S G
Sbjct: 366 QVAHHHVS-ISKEVRADLAVWGHFLRDFNGKVLFQAKEISTP----EMQLYTDAAGSS-G 419
Query: 783 WGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMVQSDNQTV 838
+G+ + + + W R+ + + + E+F + A+ L ++ SDN TV
Sbjct: 420 FGAYLAGQWCAAPWPRDWIDTELVRNIAFLELFPIVVAMYTWEKELSDRRIVFHSDNMTV 479
Query: 839 V 839
V
Sbjct: 480 V 480
>gi|440790615|gb|ELR11896.1| transposon Ty3-G Gag-Pol polyprotein-like family protein, putative
[Acanthamoeba castellanii str. Neff]
Length = 447
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 149/322 (46%), Gaps = 29/322 (9%)
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
+ Y+D+ ++++Q + + LGW+VN +KS +P+ +FLG+M
Sbjct: 1 MAYMDNVIILSQSYTEARHHTTFTLHLFKKLGWVVNTEKSDTTPSQCKEFLGLM------ 54
Query: 674 RMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY-LSFASFVIPMGRLHSRRIQRQAS 732
P +TL T+ + ++D ++S++ ++ ++P L +
Sbjct: 55 ----PTTYDMTL-----TVSSDTPRHMDQSQSVVSQGVALTRAILPAKLLLRNAYRDIGR 105
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALPLS----SPIFPRQVQHFISTDASDLGWGSQVD 788
+ + ++PA LE W + L + + P + + T+AS GWG+
Sbjct: 106 WMSWNSS--IKLSPATTNDLEEWRHGLSTWNGRITVLRPHNI--ILETNASLSGWGASSS 161
Query: 789 SSFLS--GLWSREQQNWHINKKEMFAVHQA-LSLNLPLLQSSVVMVQSDNQTVVSYLRRQ 845
L+ G W + HIN E+ V L+L L L Q V++++ DN V++L
Sbjct: 162 CWTLTAAGWWLSDDSKSHINVLELAVVRNTILALQLHL-QGKVILMRCDNIATVAHLNHM 220
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQ 905
GG +S+++ ++I LL + I + + ++PG NS AD LS WHLSR A +
Sbjct: 221 GG-RSIAMNRVWKEIHLLCERLHIQLSSAYLPGLCNSEADCLSHLHPHHKWHLSREAFKL 279
Query: 906 IFLKWGVPCIDLFASRVSAVVP 927
I KWG I+ A+R + +P
Sbjct: 280 INKKWGPHSINQTATRENRQLP 301
>gi|301607367|ref|XP_002933296.1| PREDICTED: hypothetical protein LOC100495679 [Xenopus (Silurana)
tropicalis]
Length = 609
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 151/360 (41%), Gaps = 19/360 (5%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +P+ L + G LP G + + +AF+S W
Sbjct: 223 GALMAKTDIEAAFRLLPVHPDSLHLLGCQFGGYFYVDRSLPMGCSISCSYFEAFSSFLEW 282
Query: 602 VASLLRSRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + ++ YLDDFL V Q+ I + + + G + K+ P
Sbjct: 283 VVKKMAGVD-SLIHYLDDFLCVGPQNSPICALLLQRVHDVAAEFGVPLAPDKTE-GPTTC 340
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++FLGI D LP DK L + + SK L +SLLG L+FA +I MG
Sbjct: 341 IKFLGIEIDTIQQECRLPADKVDGLREDILRAMGSKKITLRQLQSLLGKLTFACRIIKMG 400
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSS--PIFPRQVQ-----HF 773
R+ SRR+ + L+ H + + LE W L + + RQ
Sbjct: 401 RVFSRRLAMATAGLK-KPHHFVRLRAELKADLEIWGKFLESYNGRSYWQRQTNTNKDLQL 459
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSRE--QQNW--HINKKEMFAVHQALSLNLPLLQSSVV 829
+ A LG+G+ + + W +E + W ++ E+F + A+ L V
Sbjct: 460 FTDAAGSLGFGAFFGGRWCAEGWPKEWVNEGWIRNLTLLELFPIIVAIELWGRQFTDRKV 519
Query: 830 MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ +DN +VV + Q + S +L+ + + L + I AQ +PG N +ADSLSR
Sbjct: 520 VFNTDNMSVVLAINNQ-TSSSGPVLALLRHLVLRCLQFNICFQAQHLPGITNDIADSLSR 578
>gi|301604212|ref|XP_002931768.1| PREDICTED: hypothetical protein LOC100493860 [Xenopus (Silurana)
tropicalis]
Length = 1108
Score = 93.6 bits (231), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 153/365 (41%), Gaps = 29/365 (7%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S+++
Sbjct: 722 GALMAKTDIEAAFRLLPVHPDSLHLLGCQFGGYFYVDRSLPMGCSISCSYFEAFSSFLEW 781
Query: 605 LLRSRGM--RVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R ++ YLDDFL V Q+ I + + + G + K+ P +
Sbjct: 782 VVRKMAGVDSLIHYLDDFLCVGPQNSPICALLLQRVHDVTAEFGVPLAPDKTE-GPTTCI 840
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTL-GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
+FLGI D LP DK +L +ILR A K L +SLLG L+FA +I MG
Sbjct: 841 KFLGIEIDTVQQECRLPADKVDSLREDILRATRAKKV-TLRQLQSLLGKLTFACRIIKMG 899
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--SSPIFPRQVQ-----HF 773
R+ SRR+ + L+ H + + LE W L + RQ
Sbjct: 900 RVFSRRLAMATAGLKK-PHHFVRLRAELKADLEIWGKFLESYNGRSYWQRQTNTNKDLQL 958
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLL 824
+ A LG+G +F G W E + W ++ E+F + A+ L
Sbjct: 959 FTDAAGSLGFG-----AFFGGRWCAEGWPEEWVQGGLTRNLTLLELFPIIVAIELWGRRF 1013
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V+ +DN +VV + Q + S +L+ + + L + I AQ +PG N +A
Sbjct: 1014 TDRKVVFNTDNMSVVMAINNQ-TSSSGPVLALLRHLVLRCLQFNICFQAQHLPGITNDIA 1072
Query: 885 DSLSR 889
DSLSR
Sbjct: 1073 DSLSR 1077
>gi|301604558|ref|XP_002931946.1| PREDICTED: hypothetical protein LOC100494686 [Xenopus (Silurana)
tropicalis]
Length = 1006
Score = 92.8 bits (229), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 161/373 (43%), Gaps = 17/373 (4%)
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
S R+ +G M D+ A+ +P+ + L + G LP G +
Sbjct: 606 SFDEAIRLVGRAGQGALMAKADIESAFRLLPVHSESLHLLGCYFEGHYYVDRSLPMGCSI 665
Query: 591 APQAFASLSNWVASLLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWI 647
+ F + S ++ ++R + G+ V+ YLDDFL V Q + + + + G
Sbjct: 666 SCAYFEAFSTFLEWVVREKAGVESVIHYLDDFLCVGPQRSSLCAVLLETLQEVAEQFGVP 725
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
+ +K+ P L+FLGI D LP DK LTL + +K L +SLL
Sbjct: 726 LAREKTE-GPITCLKFLGIEIDTVRQECRLPRDKVLTLKEEVGYAKQAKKVTLKQLQSLL 784
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIF 766
G L+FA +IPMGR SR + + +R H +N L W L + ++
Sbjct: 785 GKLNFACRIIPMGRAFSRSLSMATAGIRH-PHHFIRLNSEHKADLAVWSTFLQDFNGKVY 843
Query: 767 -PRQV----QHFISTDASD-LGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQA 816
P +V + + TDA+ G+G+ + + + W +E ++ E+F + A
Sbjct: 844 WPEEVVENTEISLFTDAAGATGFGAYLAGKWCAAGWPQEWATRNLTGNLAFLELFPIIVA 903
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
+ L L + V+ +SDN V + + S +L+ + + L I A+ I
Sbjct: 904 VELWGRELSNKSVLFRSDNMATVLAVNNL-TSSSRPVLALLRHLVLRCLQLNISFRAKHI 962
Query: 877 PGAYNSVADSLSR 889
PG N +AD+LSR
Sbjct: 963 PGEINEIADALSR 975
>gi|301604684|ref|XP_002931985.1| PREDICTED: hypothetical protein LOC100491034 [Xenopus (Silurana)
tropicalis]
Length = 1108
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 150/364 (41%), Gaps = 27/364 (7%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S+++
Sbjct: 722 GALMAKTDIEAAFRLLPVHPDSLHLLGCQFGGYFYVDRSLPMGCSISCSYFEAFSSFLEW 781
Query: 605 LLRSRGM--RVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R ++ YLDDFL V Q+ I + + + G + K+ P +
Sbjct: 782 VVRKMAGVDSLIHYLDDFLCVGPQNSPICALLLQRVHDVTAEFGVPLAPDKTE-GPTTCI 840
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK +L + SK L +SLLG L+FA +I MGR
Sbjct: 841 KFLGIEIDTIQQECRLPADKVDSLREDILRATRSKKVTLRQLQSLLGKLTFACRIIKMGR 900
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--SSPIFPRQVQ-----HFI 774
+ SRR+ + L+ H + + LE W L + RQ
Sbjct: 901 VFSRRLAMATAGLKK-PHHFVRLRAELKADLEIWGKFLESYNGRSYWQRQTNTNKDLQLF 959
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLLQ 825
+ A LG+G +F G W E + W ++ E+F + A+ L
Sbjct: 960 TDAAGSLGFG-----AFFGGRWCAEGWPEEWVQEGLTRNLTLLELFPIIVAIELWGRQFT 1014
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
V+ +DN +VV + Q + S +L+ + + L + I AQ +PG N +AD
Sbjct: 1015 DRKVVFNTDNMSVVMAINNQ-TSSSGPVLALLRHLVLRCLQFNICFQAQHLPGITNDIAD 1073
Query: 886 SLSR 889
SLSR
Sbjct: 1074 SLSR 1077
>gi|326676637|ref|XP_003200633.1| PREDICTED: hypothetical protein LOC100537191, partial [Danio rerio]
Length = 417
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 83/154 (53%)
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
L+L+ LN+ L F ++ RI ++ D+ +IDL AYFHV I H++FL ++
Sbjct: 1 LDLRILNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKYAYFHVSILPRHRQFLRFAFE 60
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
G LPFGL+ +P+ F L+ + LR G+R++ YLDD+L++ L +
Sbjct: 61 GRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLTGIRILNYLDDWLILAHLREQLIVHRD 120
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD 669
+ L LG VN +KS L+P + FLG+ D
Sbjct: 121 RVLRHLRLLGLQVNREKSKLAPVQRISFLGMELD 154
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 71/147 (48%), Gaps = 21/147 (14%)
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSD 834
STDAS+ GWG IN+ E+ AV AL LP+L+ ++V++D
Sbjct: 180 STDASNTGWG--------------------INRLELLAVFLALQRFLPVLEQQHMLVRTD 219
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+ +Y+ R GG S + ++ L S + A +PG N AD+LSR P
Sbjct: 220 STAAAAYINRMGGMSSRRMSQLARRLLLWSHPRLKSLCAIHVPGTLNRAADALSRQLLRP 279
Query: 895 -DWHLSRSATEQIFLKWGVPCIDLFAS 920
+W L + + I+ ++G IDLFAS
Sbjct: 280 GEWRLHPESVQLIWARFGEAQIDLFAS 306
>gi|301631147|ref|XP_002944668.1| PREDICTED: hypothetical protein LOC100496098 [Xenopus (Silurana)
tropicalis]
Length = 525
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/408 (26%), Positives = 169/408 (41%), Gaps = 49/408 (12%)
Query: 526 SPKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAY 557
P KF LI+H P SF Q KG + D+ A+
Sbjct: 88 EPGKFRLIHHLSYPKGGSVNDDIDPELCSVTYTSFDQAVALVRRAGKGALLAKADIESAF 147
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMR--VVV 615
+PI L ++ G + CLP G + + F + S+++ ++R R +V
Sbjct: 148 RLLPIHPECHHLLGCAFEGSIYVDLCLPMGCSISCSYFETFSSFMEWVVRQRAQTTGIVH 207
Query: 616 YLDDFLLVN--QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
YLDDFL V + I L + G + K ++ P L FLG+ D
Sbjct: 208 YLDDFLCVGPAKSDECFHILATLQ-EVAEDFGVPLAPDK-TVGPVTCLSFLGLEIDSEKG 265
Query: 674 RMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASL 733
LP DK L + + +SLLG L+FA VI MGR+ RR+ +
Sbjct: 266 ESRLPVDKLEDLRRAVAWAREKAKVTIREIQSLLGKLNFACRVIVMGRVFCRRLGGLLAG 325
Query: 734 LRLGAPHLTPINP-AVLPKLEWWLNALPL--SSPIFPRQ----VQHFISTDAS-DLGWGS 785
R APH P V LE W L IFP + V+ + TDA+ G+G+
Sbjct: 326 TR--APHHHMRLPQGVREDLEVWQRFLESFNGKVIFPGKEVTNVEMQLFTDAAGSFGFGA 383
Query: 786 QVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
+ S+ + W E + K E+F + ++ + L++ V+ SDN VV
Sbjct: 384 YLGGSWCADRWPDEWFKLGLVKNLCFLELFPIVVSVFVWSEKLRNRQVVFVSDNMGVVQV 443
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ RQ T S+ ++ + + L + + A+ +PG N +AD+LSR
Sbjct: 444 INRQTAT-SVEVVRLLRVLVLRCLNINLGFRARHLPGVKNEIADALSR 490
>gi|291237360|ref|XP_002738603.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 539
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/412 (25%), Positives = 174/412 (42%), Gaps = 36/412 (8%)
Query: 502 LFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
L L PK +GG R +++L +N +S + SL PSF D + A
Sbjct: 100 LGLRPKKSGGFRIIMDLSQPTLDSVNDNISKEDHSL------PSFTSMAD------VKHA 147
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRV 613
+ P++ L + G LPFGL +AP F +++ W+ S R+R +
Sbjct: 148 FRLCPVRKEDWHLLGFFWEGCYFFDRVLPFGLRSAPYLFNRIADAIHWIISH-RARNKDI 206
Query: 614 VVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
+ YLDDFL V + + + + LG + +K P V+ FLG+ D
Sbjct: 207 LHYLDDFLTVGPANTNACQHNMDVMLQSCHHLGVPIATEKVE-GPCSVITFLGVELDTVN 265
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
+ LP+DK L L + L T + SL+G LSFA + GR+ RR+ S
Sbjct: 266 MVIRLPKDKLADLLVKLPSWLTRHTCSKRELLSLIGCLSFACKCMLAGRIFLRRMI-DIS 324
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFPRQVQHFISTDAS-DLGWG 784
+ + + ++WW + LP L +P + + + TDAS LG+G
Sbjct: 325 MTATSLSQVITLTDEFWHDVQWWCDFLPSWNGTASLLNPNWIPSPEFELFTDASATLGYG 384
Query: 785 SQVDSSFLSGLWSREQQN---WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
+ + + W N + I KE+ + + + L + DN +VV
Sbjct: 385 AFYKGHWFANTWPTFITNDPLYSIAWKELLPILLSSLIWGHLWYGLRIRFHCDNISVVQ- 443
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+ ++G + ++ V +F + H++ I G N +ADSLSR + L
Sbjct: 444 IWKKGSSSCPRIMQLVRLLFFTAASNNFHVMISHISGFNNDIADSLSRQQIL 495
>gi|301630036|ref|XP_002944137.1| PREDICTED: hypothetical protein LOC100493088 [Xenopus (Silurana)
tropicalis]
Length = 1352
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 167/373 (44%), Gaps = 41/373 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G+ CLP G A + +AF++ W
Sbjct: 966 GALMAKADIESAFRLLPIHPECHHLLGCWFEGEYFVDLCLPMGCAISCAHFEAFSTFLEW 1025
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + RS VV YLDDF V Q +LE ++A + G + K+
Sbjct: 1026 VVKV-RSGFSSVVHYLDDFFCVGQAKADTCFHLLETLQEVASN----FGVPLAADKTE-G 1079
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 1080 PATVMRFLGLEIDSVAGECRLPTQKVEDLMREVGSLRRDKKATLQRLQSVLGKLNFACRV 1139
Query: 717 IPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPL--SSPIFP 767
IP+GR+ SRR+ + + R AP H + V L W N L +S
Sbjct: 1140 IPVGRVFSRRLAQATAGAR--APHHHVRLGKEVRADLRVWETFLTGFNGRVLFRASETTA 1197
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
++++ + TDA+ GS+ ++L+G W Q Q+W ++ E+F + A+
Sbjct: 1198 QELELY--TDAA----GSKGFGAYLAGQWCAAQWPQDWVEAGLVRNLVFLELFPIVVAMF 1251
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
+ L++ ++ SDN VV + + S +L + + L I A+ + G
Sbjct: 1252 IWERELRNRRIVFHSDNMGVVQGINNWSAS-SPPVLRLLRALVLRCLKLNISCRARHVEG 1310
Query: 879 AYNSVADSLSRSK 891
N++AD+LSRS+
Sbjct: 1311 CKNNIADALSRSQ 1323
>gi|401888493|gb|EJT52449.1| hypothetical protein A1Q1_03965 [Trichosporon asahii var. asahii CBS
2479]
Length = 1858
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 153/367 (41%), Gaps = 39/367 (10%)
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
++ +DL AY HV + L +G CL FG ++P F S +A +L
Sbjct: 1136 WLWKMDLKDAYRHVVVDAADAALLGFHLDGKDYVDCCLNFGGKSSPFIFNMFSEALAWIL 1195
Query: 607 RSRGMRVVVYLDDFL---LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
S G+R LDDF + P IL + ++ G LG V KS P ++
Sbjct: 1196 ASFGLRNRHLLDDFFGRCKAARGPAIL----RFLDALCGYLGLSVARHKSLT--GPCVEI 1249
Query: 664 LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN---LDSARSLLGYLSFASFVIPMG 720
LGIM D WL DK L +R+ L+ ++ + +A SL+G L+ A+ ++ G
Sbjct: 1250 LGIMVDGPTASAWLSPDKLEKLRWSVRSALSRESNDQISFSAAESLVGSLTDATRIVAAG 1309
Query: 721 RLHSR---------RIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI----FP 767
R +R R + + LRL + + L WW N L + P
Sbjct: 1310 RAFTRGFYDWLTDNRHRGHRATLRL--------SRDLKSDLRWWNNLLRKWPGVRLLRRP 1361
Query: 768 RQVQHFISTDASDLGWGSQVD-----SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
R + A+ G G + ++ S + +I E AVH+AL P
Sbjct: 1362 RGSIEIWTDAATSSGLGGHLGPPEAVTARFSAPVPDHLRGANIMALEAEAVHEALQRWAP 1421
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNS 882
+ V+ + DNQ VV+ L G + V +IF L + RI + +I N+
Sbjct: 1422 AHKGFRVVCRVDNQAVVNAL-LTGRIRHRDTQRVVRRIFTLLHEHRIFLRVSWIASEDNA 1480
Query: 883 VADSLSR 889
VAD+LSR
Sbjct: 1481 VADALSR 1487
>gi|301604726|ref|XP_002932020.1| PREDICTED: hypothetical protein LOC100489555 [Xenopus (Silurana)
tropicalis]
Length = 721
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 153/362 (42%), Gaps = 23/362 (6%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S ++
Sbjct: 338 GALMAKTDIEAAFRLLPVHPDSLHLLGCQFGGSFYIDRSLPMGCSISCSYFEAFSTFLEW 397
Query: 605 LLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R + GM ++ YLDDFL + + I + + G + K+ P+ +
Sbjct: 398 VIRQQSGMDSIIHYLDDFLCIGPANSPACAILLQTIQGVTSEFGVPLAPDKTE-GPSTCI 456
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L ++ + SK L +SLLG L+FA +I MGR
Sbjct: 457 KFLGIEIDTVGQECRLPIDKISALREDIQRAITSKKLTLKQLQSLLGKLTFACRIITMGR 516
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDL 781
+ SRR+ S L+ H + + L W L + R H + DL
Sbjct: 517 VFSRRLAMATSGLK-KPHHFVRLRAELKADLGIWARFLQAYN---GRSYWHKTTDSNKDL 572
Query: 782 -----GWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLLQSS 827
GS ++ G W ++ ++W ++ E+F + A+ L +
Sbjct: 573 QLFTDAAGSCGFGAYFRGSWCADRWPESWVAGGLTRNLTLLELFPILVAIELWGHWFSNK 632
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
V+ +DN +VV + Q + S +L+ + + L + I AQ +PG N +ADSL
Sbjct: 633 NVIFNTDNMSVVLAINNQ-TSSSGPVLALLRHLVLRCLQFNICFRAQHLPGVANDIADSL 691
Query: 888 SR 889
SR
Sbjct: 692 SR 693
>gi|313232917|emb|CBY09600.1| unnamed protein product [Oikopleura dioica]
Length = 724
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 118/468 (25%), Positives = 191/468 (40%), Gaps = 50/468 (10%)
Query: 456 KPPLVPLCSLQHLATPVSSAMSLHIQEMLE---------TGVLKRLDSTTGFLSRLFLVP 506
KP V L+ L P A+ +E+L+ + L++ D + LV
Sbjct: 114 KPYEVIKGRLKALTVPNQDAVRERSEEILQQLKEWCRMGSVTLRKDDKKPWITAGFILVD 173
Query: 507 KGNGGTRPVLN---LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
+ TR LN LK L + P K + LQKGD M D + Y +P+
Sbjct: 174 RPEKDTRICLNGSILKPLELYTFPCKMDSVKT--AIQMLQKGDVMAKFDDKKGYHQMPLA 231
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV 623
++ + + L FG+ AP + L++ + LR G+++ +YLDD LL+
Sbjct: 232 AESKKMACFKWGNYIFENNILAFGIPAAPGMYQLLNSVGINFLRQNGIKITLYLDDRLLI 291
Query: 624 ------NQDPRIL--EIQGK---LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
N ++L EI K L + L +LG VN++KS P ++FLG + D +
Sbjct: 292 ISPKSENHRKKLLTEEILCKEVWLVAATLVALGGFVNIKKSEFKPTQRIEFLGFILDTNK 351
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
+ + +PE + TL +R + K L + G + AS V + R + RQ +
Sbjct: 352 ETVEIPEGRWNTLKKRMRDAESGKMVELKLLERIRG--TQASMVEVFSNM--RMLIRQIT 407
Query: 733 LLRLGAPHLTPINPAVLPK---LEW--WLNALPLS-SPIFPRQVQH----FISTDASDLG 782
+L + L VL K EW W S + R+ + I TDAS
Sbjct: 408 IL-IMQTELEKKTETVLTKEVRREWKLWYEFEKTGLSRSWKREDRSDAGLLIYTDASKHA 466
Query: 783 WGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVS 840
++ LS W + HI KE A+ AL L V DN +VV
Sbjct: 467 GAIVIEKWKLSEKFAWEEDLAAAHIGIKEAAAIRMALEWYGRNLAKKRVTFLCDNDSVV- 525
Query: 841 YLRRQG---GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
QG G+K + ++ +I++L+Q +I + +++ D
Sbjct: 526 ----QGAINGSKDPEMNKQLVRIWMLAQKRKIDLKIEWVSTKLQKADD 569
>gi|301620353|ref|XP_002939542.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 892
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 112/448 (25%), Positives = 189/448 (42%), Gaps = 51/448 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I++ LE G ++ S G + F V K +GG RP ++ +GLN+
Sbjct: 182 LSLPETKAMEEYIKDNLEQGFIRPSSSPAG--AGFFFVSKKDGGLRPCIDYRGLNKITVK 239
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ +DL AY + IK + A + +PFG
Sbjct: 240 NRYPLPLISELFDRVKGASIFTKLDLRGAYNLIRIKEGDEWKTAFNTRAGHYEYLVMPFG 299
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L R VVVYLDD L+ + + + + L
Sbjct: 300 LCNAPAVFQEFVNDIFRDLLGR--HVVVYLDDILIYSSNLEDHHCHVQEVLLRLRQHHLY 357
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K P + FLG + + + L E + + IL W S R++
Sbjct: 358 AKLEKCIFE-VPSVHFLGYI----ISELGL-EMEPTKVEGIL-------NWAQPLSLRAI 404
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
+L FA++ + S + +L + G P+ P P L + +A +S+ +
Sbjct: 405 QRFLGFANYYRQFVKGFSSLVAPITALTKKGRPNCWP--PVALEAFQSLKDAF-ISASVL 461
Query: 767 PR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMFA 812
+ +FI DASD+G G+ + ++ S +S +QN+ I +E+ A
Sbjct: 462 RHPEPHLPYFIEVDASDVGAGAILSQRHSADGKLHPCAYFSKKFSSAEQNYDIGNRELLA 521
Query: 813 VHQALSLNLPLLQ--SSVVMVQSDNQTV--VSYLRRQGGTKS-LSLLSEVEKIFLLSQDW 867
V AL LL+ S V + +D++ + + L+RQ ++ SL
Sbjct: 522 VKLALEEWRHLLEGASHPVTIYTDHKNLEFLQSLKRQNPRQARWSLFFS----------- 570
Query: 868 RIHILAQFIPGAYNSVADSLSRSKSLPD 895
R + + + PG N AD+LSRS S D
Sbjct: 571 RFNFVLTYRPGTKNRKADALSRSFSPED 598
>gi|291232969|ref|XP_002736426.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 981
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/398 (23%), Positives = 177/398 (44%), Gaps = 39/398 (9%)
Query: 521 LNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN+ + + FS+ ++ RI + +M D+ A+ VP+ + + + +
Sbjct: 565 LNELIDKEAFSM-SYIRIDDATMELKRVGSSAFMCKFDIMDAFKQVPLHPSVVPWHGVKW 623
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVVVYL-DDFLLVNQDPRIL 630
G L FG ++P+ F +LS W+A G++ +++L DDF V+ D
Sbjct: 624 EGKYYFFVRLVFGCRSSPKLFDALSQAICWIAE--HKYGIKFILHLLDDFFTVD-DGEFS 680
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
++ ++++ + I + +L P L++LGIM + +LP DK++ + L
Sbjct: 681 ALRTMSIMTLIFNTLRIPLAKHKTLGPVQELEYLGIMLNSKELMAFLPTDKRIRISQKLS 740
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
+LA K+ SLLG+LSFA V+ GR + + AS + H+ IN
Sbjct: 741 EVLAKKSVTKQVLLSLLGHLSFAGRVVLPGRSFVSHLLQLASTVPKLHYHVA-INSDCRM 799
Query: 751 KLEWWLNALP--------LSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQN 802
+L W + L L+S + + +S +G+G + S W E
Sbjct: 800 ELAMWDSFLQSWNGVHLFLNSEVTSAADLQIYTDASSKVGFGGYFRGEWFSDCWPAE--- 856
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSV----------VMVQSDNQTVVSYLRR-QGGTKSL 851
+N + AL P++ +++ + DN+ V + + + +K
Sbjct: 857 --LNMNNRSDLSMALLELYPIVVAAMLWGGKWSQKRIRFNCDNEATVHIINKGRASSKCQ 914
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
++ + ++ LL+ + ILA+F+PG +N +ADSLSR
Sbjct: 915 TINKLMRRLTLLAMRYNFIILARFLPGVHNGIADSLSR 952
>gi|301620407|ref|XP_002939567.1| PREDICTED: hypothetical protein LOC100493347 [Xenopus (Silurana)
tropicalis]
Length = 1177
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 110/418 (26%), Positives = 167/418 (39%), Gaps = 71/418 (16%)
Query: 527 PKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAYF 558
P KF LI+H P SF Q KG + D+ A+
Sbjct: 741 PGKFRLIHHLSYPQGESVNDDINPELCSVTYISFDQAVALVRKAGKGALLAKADIESAFR 800
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR--GMRVVVY 616
+PI L ++ G + CLP G + + F + S+++ ++R R +V Y
Sbjct: 801 LLPIHPECHHLLGCAFEGSIYVDLCLPMGCSISCSYFETFSSFMEWVVRQRAHATGIVHY 860
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLSPAPVLQFLGIMWDP 670
LDDFL V + IL +L + ++ P L FLG+ D
Sbjct: 861 LDDFLCVG------PAGSEECFHILTTLQEVAEDFGVPLAPDKTVGPVTCLSFLGLEIDS 914
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR-------SLLGYLSFASFVIPMGRLH 723
LP+DK L L S W + A+ S+LG L+FA VI MGR+
Sbjct: 915 VKGESRLPDDK-------LHDLRKSVAWAREKAKMTVREIQSMLGKLNFACRVIVMGRVF 967
Query: 724 SRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNALPL--SSPIFP-RQVQ----HFIS 775
RR+ + R APH P V LE W L IFP R+V +
Sbjct: 968 CRRLGGLLAGAR--APHHHIRLPQGVRDDLEVWQRFLESFNGKVIFPEREVSSTELQLFT 1025
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMV 831
A G+G+ + S+ + W E + K E+F + ++ + L++ V+
Sbjct: 1026 DAAGSFGFGAYLGGSWCADRWPVEWFRLGLVKNLCFLELFPIVVSVFIWGDKLRNRQVVF 1085
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
SDN VV + RQ T S ++ + + L + A+ +PG N +AD+LSR
Sbjct: 1086 VSDNMGVVQVINRQTAT-SAEVVRLLRVLVLRCLSINLGFRARHLPGVKNEIADALSR 1142
>gi|326672963|ref|XP_003199767.1| PREDICTED: hypothetical protein LOC100536193 [Danio rerio]
Length = 2519
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 121/457 (26%), Positives = 189/457 (41%), Gaps = 67/457 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L G+++ S G + +F V K +G RP ++ +G
Sbjct: 1615 PRGRLYSLSKPEREAMEKYIHDSLAAGIIRPSSSPAG--AGVFFVEKKDGSLRPCIDYRG 1672
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 1673 LNDITVKNRYPLPLMSSAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHFE 1732
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
+ FGL+ +P F +L N V + +R V VYLDD L+ +Q+ R + +
Sbjct: 1733 YLVMLFGLSNSPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSQNEREHVQHVRRVLQR 1790
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L V ++K V FLG + P RM + K A W
Sbjct: 1791 LLENRLFVKVEKCDFHTQSV-SFLGFVLSPEGVRMDPAKVK------------AVADWPT 1837
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW----- 754
DS +++ +L FA+F RR R S + L +LT I+ + EW
Sbjct: 1838 PDSRKAVQRFLGFANFY--------RRFIRNFSQVALPLTNLTSIH----KRFEWSPQAQ 1885
Query: 755 ---------WLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDSS--------FLSG 794
+++A LS+P RQ + DAS++G G SQ SS F S
Sbjct: 1886 TAFSELKRHFISAPILSNPDPSRQF--VVEVDASEVGVGAILSQRSSSDGRIHPCAFFSH 1943
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLS 852
+ ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 1944 RLTPSERNYDIGNRELLAVRLALGEWRHWLEGSGVPFVVWTDHKN-LEYIR---SAKRLN 1999
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N AD+LSR
Sbjct: 2000 SRQAHWALFFGRFDFHI----SYRPGSKNGKADALSR 2032
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/152 (21%), Positives = 67/152 (44%), Gaps = 4/152 (2%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
A+ H+Q++L G+++ +S + F S + +V K + R ++ + LN ++L
Sbjct: 49 DAVRKHLQDLLAAGIIR--ESESPFASPIVVVRKKDNSVRLCIDFRKLNSQTIKDAYALP 106
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
N + S L + +DL Y+ + ++ + A +P G+ AP
Sbjct: 107 NLEEVFSALTGSKWFSVLDLKSGYYQIEMEEADKSKTAFVCPLGFWEFNRMPQGITNAPS 166
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
F L L + V+V++DD ++ ++
Sbjct: 167 TFQRLMERCMGDLNRK--EVLVFIDDLIIFSE 196
>gi|326677789|ref|XP_003200913.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1465
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 120/473 (25%), Positives = 191/473 (40%), Gaps = 75/473 (15%)
Query: 452 PFSAKPPLVPLCS-----LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVP 506
P+ LVP S L L+ P +AM ++ E L++G ++ S G + F V
Sbjct: 527 PYDCSIELVPGASPPRGRLYSLSIPERTAMEKYLNEALDSGFIRPSTSPAG--AGFFFVS 584
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
K +G RP ++ +GLN ++ L LQ +DL AY V IK
Sbjct: 585 KKDGSLRPCIDYRGLNHITIKNRYPLPLMNTAFEILQGATIFTKLDLRNAYHLVRIKEGD 644
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ AL+ +PFGL AP F + N V + +R V VYLDD L+ +
Sbjct: 645 EWKTALNTPTGHYEYQVMPFGLVNAPAVFQAFINDVLRKMLNR--LVFVYLDDILIFSSS 702
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG 686
+ +S L V L+KS + VL FLG + + + L Q+ G
Sbjct: 703 CEEHVQHVRQVLSQLLRHRLFVKLEKSEFHVSKVL-FLGFI----VSKCSL----QMDPG 753
Query: 687 NILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
I L + ++ + LG+ +F RR R S + A LT +
Sbjct: 754 KIKAVLDWPQPRSVKEVQRFLGFANFY-----------RRFIRGFSSI---AEPLTALTK 799
Query: 747 AVLPKLEWW---------LNALPLSSPIFPR---QVQHFISTDASDLGWG------SQVD 788
W L +L S+PIF ++ + DASD+G G S+ D
Sbjct: 800 KTAKSFVWTEMANKAFNRLKSLFTSAPIFALPDPELPFVVEVDASDIGIGAVLSQRSKTD 859
Query: 789 S-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSY 841
+ ++LS + Q+N+ I K+E+ AV AL L+ + ++ +D+
Sbjct: 860 NKLHPCAYLSHRLTPAQRNYDIGKRELLAVKVALEEWRHWLEGAKHPFLIWTDH------ 913
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
K+L+ + E +++ W R + PG+ NS D+LSR
Sbjct: 914 -------KNLTYIREAKRLNSRQARWALFFNRFDFTLSYRPGSKNSKPDALSR 959
>gi|313219624|emb|CBY30545.1| unnamed protein product [Oikopleura dioica]
Length = 631
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 118/468 (25%), Positives = 191/468 (40%), Gaps = 50/468 (10%)
Query: 456 KPPLVPLCSLQHLATPVSSAMSLHIQEMLE---------TGVLKRLDSTTGFLSRLFLVP 506
KP V L+ L P A+ +E+L+ + L++ D + LV
Sbjct: 21 KPYEVIKGRLKALTVPNQDAVRERSEEILQQLKEWCRMGSVTLRKDDKKPWITAGFILVD 80
Query: 507 KGNGGTRPVLN---LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
+ TR LN LK L + P K + LQKGD M D + Y +P+
Sbjct: 81 RPEKDTRICLNGSILKPLELYTFPCKMDSVKT--AIQMLQKGDVMAKFDDKKGYHQMPLA 138
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV 623
++ + + L FG+ AP + L++ + LR G+++ +YLDD LL+
Sbjct: 139 AESKKMACFKWGNYIFENNILAFGIPAAPGMYQLLNSVGINFLRQNGIKITLYLDDRLLI 198
Query: 624 ------NQDPRIL--EIQGK---LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
N ++L EI K L + L +LG VN++KS P ++FLG + D +
Sbjct: 199 ISPKSENHRKKLLTEEILCKEVWLVAATLVALGGFVNIKKSEFKPTQRIEFLGFILDTNK 258
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
+ + +PE + TL +R + K L + G + AS V + R + RQ +
Sbjct: 259 ETVEIPEGRWNTLKKRMRDAESGKMVELKLLERIRG--TQASMVEVFSNM--RMLIRQIT 314
Query: 733 LLRLGAPHLTPINPAVLPK---LEW--WLNALPLS-SPIFPRQVQH----FISTDASDLG 782
+L + L VL K EW W S + R+ + I TDAS
Sbjct: 315 IL-IMQTELEKKTETVLTKEVRREWKLWYEFEKTGLSRSWKREDRSDAGLLIYTDASKHA 373
Query: 783 WGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVS 840
++ LS W + HI KE A+ AL L V DN +VV
Sbjct: 374 GAIVIEKWKLSEKFAWEEDLAAAHIGIKEAAAIRMALEWYGRNLAKKRVTFLCDNDSVV- 432
Query: 841 YLRRQG---GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
QG G+K + ++ +I++L+Q +I + +++ D
Sbjct: 433 ----QGAINGSKDPEMNKQLVRIWMLAQKRKIDLKIEWVSTKLQKADD 476
>gi|301611555|ref|XP_002935300.1| PREDICTED: hypothetical protein LOC100493934 [Xenopus (Silurana)
tropicalis]
Length = 505
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 164/373 (43%), Gaps = 41/373 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G+ CLP G A + +AF++ W
Sbjct: 100 GALMAKADIEAAFRLLPIHPECHHLLGCWFEGEYFVDLCLPMGCAISCAHFEAFSTFLEW 159
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + R+ VV YLDDF V Q +LE ++A S L +
Sbjct: 160 VVKV-RAGCSSVVHYLDDFFCVGQANADTCFHLLETLQEVAASFGVPLA-----ADKTEG 213
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 214 PATVMRFLGLEIDSVAGECRLPTQKVEDLLREVGSLRRDKKATLQRLQSVLGKLNFACRV 273
Query: 717 IPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPL--SSPIFP 767
IP+GR+ SRR+ + + R AP H +N V L W N L +S
Sbjct: 274 IPVGRVFSRRLAQATAGTR--APHHHVRLNKEVRADLGVWEAFLTGFNGRVLFRASETTA 331
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
++++ + TDA+ GS+ ++L+G W Q Q W ++ E+F + A+
Sbjct: 332 QELE--LYTDAA----GSKGFGAYLAGRWCAAQWPQEWVEAGLVRNLVFLELFPIVVAMF 385
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
+ L++ ++ SDN VV + + S +L + + L I A+ + G
Sbjct: 386 VWERELRNRRIVFYSDNMGVVQGINNWSAS-SQPVLRLLRALVLRCLKLNISCRARHVEG 444
Query: 879 AYNSVADSLSRSK 891
N +AD+LSRS+
Sbjct: 445 CKNDIADALSRSQ 457
>gi|301615442|ref|XP_002937181.1| PREDICTED: hypothetical protein LOC100495321 [Xenopus (Silurana)
tropicalis]
Length = 564
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 166/369 (44%), Gaps = 35/369 (9%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G + D+ A+ +PI L ++ G++ CLP G + + F + S+++
Sbjct: 173 RGALLAKADIEAAFRLLPIHPECHHLLGCAFEGNIYIDLCLPMGCSISCSYFETFSSFLE 232
Query: 604 SLLRSR--GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSL 655
++R R +V YLDDFL V + + IL +L + ++
Sbjct: 233 WVVRQRAHATGIVHYLDDFLCVG------PTKSDVCSHILATLQEVAEDFGVPLAPDKTV 286
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR---SLLGYLSF 712
P L FLG+ D LPEDK L ++ R++ ++ N + R SLLG L+F
Sbjct: 287 GPVTCLSFLGLEIDSVKGESRLPEDK---LQDLRRSVAWARDRNKVTVREVQSLLGKLNF 343
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPL--SSPIFPRQ 769
A VI MGR+ RR+ S R AP H ++ V L+ W L IFP +
Sbjct: 344 ACRVIMMGRVFCRRLGGLLSGAR--APHHHIRLSQGVRDDLDVWHRFLESFNGKVIFPVK 401
Query: 770 ----VQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLN 820
V+ + TDA+ G+G+ + S+ + W + + K E+F + ++ +
Sbjct: 402 EVTNVEMQLFTDAAGSFGFGAYLGGSWCADRWPDDWFKLGLVKNLCFLELFPIVVSVFVW 461
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
L + V+ SDN VV + RQ T S+ ++ + + L + A+ +PG
Sbjct: 462 GDKLANRQVVFVSDNMGVVQVINRQTAT-SVEVVRLLRVLVLRCLKINLGFRARHLPGVK 520
Query: 881 NSVADSLSR 889
N +AD+LSR
Sbjct: 521 NEIADALSR 529
>gi|301612028|ref|XP_002935526.1| PREDICTED: hypothetical protein LOC100494384 [Xenopus (Silurana)
tropicalis]
Length = 787
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 151/370 (40%), Gaps = 37/370 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M D+ A+ +P+ L + G LP G + + F + S ++
Sbjct: 400 QGALMAKADVESAFRLLPVHPESLHLLGCYFEGKYYVDRSLPMGCSISCAYFEAFSTFIE 459
Query: 604 SLLRSRG--MRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R + V+ YLDDFL V Q+ + + + I G + +K+ P
Sbjct: 460 WVVRKKAGVTSVIHYLDDFLCVGPQNSSLCAVLLETLQEIAEQFGVPLAKEKTE-GPITC 518
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L+FLGI D LP DK LTL + +K L +SLLG L+FA +IPMG
Sbjct: 519 LKFLGIEIDTVRQECRLPTDKVLTLKEEIGYARQAKKVTLKQMQSLLGKLNFACRIIPMG 578
Query: 721 RLHSRR-------IQRQASLLRLGAPH----------LTPINPAVLPKLEWWLNALPLSS 763
R+ +R I+ +RL A H L N V +WL + +
Sbjct: 579 RVFARSLSLAMAGIRHPHHFIRLNAEHKADLAVWSTFLQDFNGRV-----YWLENVVENP 633
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSL 819
I + A G+G+ + + W +E + E+F + A+ L
Sbjct: 634 EI------SLFTDAAGATGFGAYFAGKWCAAGWPQEWAACKLTSNLTFLELFPIIVAVEL 687
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L + V+ +SDN V + + S +L+ + + L I A+ IPG
Sbjct: 688 WGRDLSNKSVLFRSDNMATVLAVNNL-TSSSRPVLALLRHLVLRCLQLNIDFRAKHIPGE 746
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 747 TNEIADALSR 756
>gi|301630417|ref|XP_002944318.1| PREDICTED: hypothetical protein LOC100488977 [Xenopus (Silurana)
tropicalis]
Length = 486
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 167/373 (44%), Gaps = 41/373 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G+ CLP G A + +AF++ W
Sbjct: 100 GALMAKADIESAFRLLPIHPECHHLLGCWFEGEYFVDLCLPMGCAISCAHFEAFSTFLEW 159
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + RS VV YLDDF V Q +LE ++A + G + K+
Sbjct: 160 VVKV-RSGFSSVVHYLDDFFCVGQAKADTCFHLLETLQEVA----SNFGVPLAADKTE-G 213
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 214 PATVMRFLGLEIDSVAGECRLPTQKVEDLMREVGSLRRDKKATLQRLQSVLGKLNFACRV 273
Query: 717 IPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPL--SSPIFP 767
IP+GR+ SRR+ + + R AP H + V L W N L +S
Sbjct: 274 IPVGRVFSRRLAQATAGAR--APHHHVRLGKEVRADLRVWETFLTGFNGRVLFRASETTA 331
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
++++ + TDA+ GS+ ++L+G W Q Q+W ++ E+F + A+
Sbjct: 332 QELE--LYTDAA----GSKGFGAYLAGQWCAAQWPQDWVEAGLVRNLVFLELFPIVVAMF 385
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
+ L++ ++ SDN VV + + S +L + + L I A+ + G
Sbjct: 386 IWERELRNRRIVFHSDNMGVVQGINNWSAS-SPPVLRLLRALVLRCLKLNISCRARHVEG 444
Query: 879 AYNSVADSLSRSK 891
N++AD+LSRS+
Sbjct: 445 CKNNIADALSRSQ 457
>gi|340381502|ref|XP_003389260.1| PREDICTED: hypothetical protein LOC100638994 [Amphimedon
queenslandica]
Length = 804
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 150/362 (41%), Gaps = 24/362 (6%)
Query: 520 GLNQFLSPKKF-SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDV 578
G++ LS ++ S+ N I L + DL AY VP+ L + + G
Sbjct: 126 GISHALSSVRYASVDNAVEIIRSLGPRAILTKFDLQDAYRIVPVHPADHHRLGIVWEGRT 185
Query: 579 LAMTCLPFGLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLA 637
CLPFGL +AP+ F++LS+ +A + S G+ V YLDDFL +
Sbjct: 186 YVDRCLPFGLRSAPKIFSALSDALAWIFASFGLVSQVHYLDDFLFLEPSNSTGVSVVSST 245
Query: 638 VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASK 696
V+ L S I + P+ + FLGI+ D + LPEDK QLT + +
Sbjct: 246 VTSLCSTLGIPLATHKTEGPSTCVVFLGIVVDSARQELRLPEDKLQLTYAMV-------Q 298
Query: 697 TWNLDSA------RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
W SA S LG+LS A+ VI GR + + + + H ++
Sbjct: 299 AWACRSACRRRELESFLGHLSHAAVVIRQGRPFLHDLFQLLPVAKY-PHHFIRLSSGAKA 357
Query: 751 KLEWWLNALPL--SSPIFPRQV--QHFISTDASDLGWGS-QVDSSFLSGLWSREQQNWHI 805
+ WWL L FP+ H + AS +G G QV+ S+ W Q I
Sbjct: 358 NILWWLCFLKEWNGRSFFPKVTPSVHVYTDAASSVGCGGFQVNGSWFKLAWPANQGQRSI 417
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
E+ V + L Q + SDN+TVV L + G S + LS + + S
Sbjct: 418 AVLELIPVVVSAMLWGSHWQGQSICFHSDNETVVQILSK--GYSSDADLSHLVRCLAFSA 475
Query: 866 DW 867
W
Sbjct: 476 AW 477
>gi|301603977|ref|XP_002931655.1| PREDICTED: hypothetical protein LOC100497463 [Xenopus (Silurana)
tropicalis]
Length = 564
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 170/414 (41%), Gaps = 61/414 (14%)
Query: 526 SPKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAY 557
P KF LI+H P SF Q +G M +D+ A+
Sbjct: 127 EPGKFRLIHHLSYPKGGSVNDDIDKELSSVSYTSFDQAVEMVRTAGEGALMAKVDIESAF 186
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR--SRGMRVVV 615
+PI L ++G+ CLP G A + F S ++ +++ S VV
Sbjct: 187 RLLPIHPDCHHLLGCRFDGNYFVDLCLPMGCAISCAYFEMFSCFLEWVVKKASGYTSVVH 246
Query: 616 YLDDFLLV---NQDP--RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
YLDD+L + N D +LE ++A+S L ++ + P +QFLGI D
Sbjct: 247 YLDDYLCIGPANSDICFYLLETIQEVALSFGVPLA-----KEKTEGPTTSIQFLGIQIDS 301
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ 730
LPE K + L ++ + +K L +SLLG L+FA +IP+GR+ SRR+ Q
Sbjct: 302 TKGECRLPEGKVVELRQVVGEMGKTKRATLRQVQSLLGKLNFACRIIPVGRVFSRRLA-Q 360
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWW------LNALPLSSPIFPRQVQHFISTDASDLGWG 784
A+ A H I+ V L W N L + + TDA+ G
Sbjct: 361 ATAGVQEAHHHVHISKEVRADLAVWGHFLRDFNGKVLFQARETSTPEMQLYTDAA----G 416
Query: 785 SQVDSSFLSGLWSRE--QQNW-------HINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
S ++L+G W Q+W +I E+F + A+ + L ++ SDN
Sbjct: 417 SSGFGAYLAGQWCAAPWPQDWVDSELVRNIAFLELFPIVVAMYVWKQELSDRRIVFYSDN 476
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
TVV + S +L + + L + A + G N +AD+LSR
Sbjct: 477 MTVVQAINSWSAA-SPPVLRLLRALVLRCLVMNVKCRAVHVEGEKNVIADALSR 529
>gi|291225352|ref|XP_002732664.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 553
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 107/418 (25%), Positives = 180/418 (43%), Gaps = 34/418 (8%)
Query: 502 LFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIP---SFLQK---GDYMIS 550
L L PK +GG R +++L ++ +S + SL+ + RI +F+ K G +
Sbjct: 100 LGLRPKKSGGFRIIMDLSQPTLDSVSDNISKEDHSLV-YSRIDDAVAFIHKHGHGSLLAK 158
Query: 551 IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLR 607
ID+ A+ P++ L + G LPFGL +AP F +++ W+ S R
Sbjct: 159 IDVKHAFRLCPVRKEDWHLLGFFWEGCYFFDRVLPFGLRSAPYLFNRIADAIHWIVSH-R 217
Query: 608 SRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
+R + YLDDFL V + + + + LG + +K P V+ FLG+
Sbjct: 218 ARNKDFLHYLDDFLTVGPANSNACQHNMDVMLQSCHHLGVPIATEKVE-GPCSVITFLGV 276
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRR 726
D + LP+DK L L + L T + SL+G LSFA IP GR+ R
Sbjct: 277 ELDTVNMVIRLPKDKLADLLVKLPSWLTRHTCSKRELLSLIGCLSFACKCIPAGRIFLRS 336
Query: 727 IQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFPRQVQHFISTDAS 779
+ S+ + + ++WW + LP L +P + + + TDAS
Sbjct: 337 MI-DISMTATSLSQVITLTDEFWHDVQWWCDFLPSWNGTASLLNPNWIPSPEFELFTDAS 395
Query: 780 -DLGWGSQVDSSFLSGLWSREQQN---WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
LG+G + + + W N + I KE+ + + + + DN
Sbjct: 396 ATLGYG---EGHWFANTWPTFITNDPLYSIAWKELLPILLSSLIWGHSWYGLRIRFHCDN 452
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+VV + ++G + ++ V +F + H++ I G N +ADSLSR + L
Sbjct: 453 ISVVQ-IWKKGSSSCPRIMQLVRLLFFTAASNNFHVMISHISGFNNDIADSLSRQQIL 509
>gi|301604826|ref|XP_002932068.1| PREDICTED: hypothetical protein LOC100488873 [Xenopus (Silurana)
tropicalis]
Length = 486
Score = 90.9 bits (224), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 162/372 (43%), Gaps = 39/372 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G CLP G + + +AF++ W
Sbjct: 100 GALMAKADIESAFRLLPIHPDCHHLLGCWFEGSFFVDLCLPMGCSISCAHFEAFSTFLEW 159
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + RS VV YLDDF V Q +LE ++A+S L +
Sbjct: 160 VVKI-RSGCGSVVHYLDDFFCVGQANTDTCFHLLETLQEVAISFGVPLA-----ADKTEG 213
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 214 PATVMRFLGLEIDSLAGECRLPTQKVEDLMREVGSLRRDKKATLRRLQSVLGKLNFACRV 273
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPL--SSPIFPR 768
IP+GR+ SRR+ QA+ H I+ V L W N L +S +
Sbjct: 274 IPVGRVFSRRLA-QATAGAQAPHHHVRISKEVRADLGVWEAFLAGFNGRVLFRASETTAQ 332
Query: 769 QVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSL 819
+++ + TDA+ GS+ ++L+G W Q Q W ++ E+F + A+ +
Sbjct: 333 ELE--LYTDAA----GSKGFGAYLAGRWCAAQWPQEWVEAGLTRNLVFLELFPIVVAMFI 386
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+ ++ SDN VV + + S +L + + L I A+ + G
Sbjct: 387 WERELRDKRIVFYSDNMGVVQGINNWSAS-SQPVLRLLRALVLRCLKLNISCRARHVEGC 445
Query: 880 YNSVADSLSRSK 891
N++AD+LSRS+
Sbjct: 446 KNNIADALSRSQ 457
>gi|301624526|ref|XP_002941553.1| PREDICTED: hypothetical protein LOC100487066 [Xenopus (Silurana)
tropicalis]
Length = 1511
Score = 90.9 bits (224), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 91/190 (47%), Gaps = 9/190 (4%)
Query: 464 SLQHLATPVSSAMSLHI-QEMLETGVLKRLDSTT---GFLSRLFLVPKGNGGTRPVLNLK 519
+L LA P L+I Q++ V+ + GF S +FLV K +GG VLNL
Sbjct: 168 ALAFLADPTKQKALLNIIQDLQNNNVISPVPQEYHFHGFYSNIFLVSKKDGG---VLNLH 224
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN+F+ ++F + + I L +M ID+ AY H+PI HQRFL +
Sbjct: 225 PLNKFVRYERFKMESLPSIIRGLTPNVFMSKIDIKDAYIHIPINPFHQRFLRFTLGQSHY 284
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
L FGL +AP+ F + + +++R +G+ V YLDD +L Q + E +
Sbjct: 285 QFQALSFGLTSAPRVFTKVLGALLAVVRLQGIHVTAYLDDLILTAQSEK--EANSHTGMP 342
Query: 640 ILGSLGWIVN 649
+ W+ N
Sbjct: 343 SPAATSWLAN 352
Score = 89.7 bits (221), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
++ V +QSDN T V+YL RQGGT+S S L EV +I ++ R+ + A FIPG N
Sbjct: 508 MKGKHVRIQSDNSTTVAYLNRQGGTRSASALREVSRIMTWAETHRVLLSAIFIPGIQNWE 567
Query: 884 ADSLSRSKSLP-DWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
AD LSR+ P +W L +Q+ KWG PC+D+ ASR ++ P
Sbjct: 568 ADYLSRTTLDPGEWKLKEEIFQQLVAKWGQPCLDVMASRFNSQTP 612
>gi|313235656|emb|CBY11109.1| unnamed protein product [Oikopleura dioica]
Length = 428
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 138/320 (43%), Gaps = 22/320 (6%)
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
L+KGD M+ D + + +P+ ++ + G L FGL AP + +++
Sbjct: 22 LLKKGDLMVKYDDKRGFHQMPLAEESKKMACFEWGGKKFVNNILCFGLPAAPGIYQNMNL 81
Query: 601 WVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGK-------LAVSILGSLGWIVN 649
+ LR G++ +YLDD L++ ++ R ++GK + + L +LG VN
Sbjct: 82 VGINFLRKNGIKATLYLDDRLIIITPKSEAHRQKLLEGKEVCKEAWITAATLVALGGFVN 141
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY 709
++KS P ++FLG + D + + +P+ + L L L+ L S+ +L + G
Sbjct: 142 IEKSEFIPKQRMEFLGFILDSETETIEIPQSRWLALKAKLQQALHSERTSLKELERIRGT 201
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW--WLN----ALPLSS 763
+ + V P R+ R+I + + + AV K EW WLN L
Sbjct: 202 QASMAEVFPNMRMLIRQITMLICQAEIQGAYEVRLTRAV--KAEWTTWLNFENSGLKRGW 259
Query: 764 PIFPRQ-VQHFISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLN 820
RQ I TDAS+ V+ +S W E + HI KE AV AL
Sbjct: 260 KQQNRQNAGIIIYTDASNHAGAIVVEEWNISEKFAWDEEYASDHICIKEAVAVKYALEWY 319
Query: 821 LPLLQSSVVMVQSDNQTVVS 840
L++ V DN +VV
Sbjct: 320 AKRLENKKVTFLVDNSSVVE 339
>gi|301626497|ref|XP_002942427.1| PREDICTED: hypothetical protein LOC100490112 [Xenopus (Silurana)
tropicalis]
Length = 512
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 152/377 (40%), Gaps = 51/377 (13%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M +D+ + +P+ L + G CLP G + F + S ++
Sbjct: 128 QGALMAKVDVESTFRLLPVHQESLHLLGCYFEGGYYVDRCLPMGCSILCAYFEAFSTFIE 187
Query: 604 SLLR--SRGMRVVVYLDDFLLVNQDPRIL-EIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R + V+ YLDDF V +L I + + G G + K+ P
Sbjct: 188 WVVRKWAGANTVIHYLDDFFCVGPGHSMLCAILLQTIQKVAGLFGIPLAPDKTE-GPNTC 246
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L+FLGI D LP+DK L + +K L +SLLG L+FA +IPMG
Sbjct: 247 LRFLGIEIDTVRQESRLPQDKVQQLKEEVGYARTAKKITLRQLQSLLGRLNFACRIIPMG 306
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ-------HF 773
R R + + +R H +N LE W L + Q Q HF
Sbjct: 307 RAFPRNLAMATAGIR-QPHHFIRLNEGHREDLEVWRVFLQDFNGRLYWQSQPRANGELHF 365
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV-- 831
+ A +G+G+ + +G W + W V Q L+ NL LL+ ++V
Sbjct: 366 HTDAAGSVGFGAYFAGRWCAGTWPNK---W---------VEQKLTSNLTLLELFPIIVSV 413
Query: 832 -----QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ--------------DWRIHIL 872
Q +NQ+VV Y T ++S++ + + S+ +
Sbjct: 414 ELWGTQLENQSVVFY------TDNMSVVMAINNLTSGSRPVLVLLKHLVLRCLQLNVRFR 467
Query: 873 AQFIPGAYNSVADSLSR 889
A+ +PG N +ADSLSR
Sbjct: 468 AKHVPGYTNEIADSLSR 484
>gi|326669487|ref|XP_003199024.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1359
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 119/465 (25%), Positives = 189/465 (40%), Gaps = 63/465 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L++P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 397 PKGHLYSLSSPEREAMDKYIDESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 454
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 455 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 514
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V L V VYLDD L+ + ++ + +
Sbjct: 515 YRVLPFGLTNAPAVFQALVNDV--LRDMVNQFVFVYLDDILIFSPSMQVHTQHVRQVLQR 572
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASKT 697
L V +K A + FLG + ++ G I + A
Sbjct: 573 LLENQLFVKAEKCVFH-AKSVSFLGFV---------------ISAGEIKADPSKVRAVAE 616
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F R S Q A L L +P + I + + L
Sbjct: 617 WPTPDSRKALQRFLGFANFYRRFIRNFS---QIAAPLTVLTSPKVPFIWGSKAQEAFDNL 673
Query: 757 NALPLSSPIF----PRQVQHFISTDASDLGWG------SQVDS-----SFLSGLWSREQQ 801
+ +S+P+ P++ Q + DASD+G G SQ D +F S S ++
Sbjct: 674 KSRFISAPVLSIPDPKR-QFIVEVDASDVGVGAVLSQRSQRDEKVHPCAFFSHRLSPTER 732
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ I +E+ AV AL L+ +V +V +D+ K+L +S ++
Sbjct: 733 NYDIGNRELLAVRLALGEWRHWLEGAVQPFVVWTDH-------------KNLEYISTAKR 779
Query: 860 IFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSKSLPDWHLS 899
+ W R + + PG+ N+ DSLSR S P+ +S
Sbjct: 780 LSSRQARWSLYFSRFNFTLSYRPGSKNTKPDSLSRMFSAPEREVS 824
>gi|301614821|ref|XP_002936889.1| PREDICTED: hypothetical protein LOC100488996 [Xenopus (Silurana)
tropicalis]
Length = 1088
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 165/387 (42%), Gaps = 40/387 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M D+ A+ +P+ L ++ G CLP G + + F S ++
Sbjct: 652 RGALMAKSDIESAFRLLPVHPDCFHLLGCTFGGYYFVDMCLPMGCSISCYYFELFSTFLE 711
Query: 604 SLL--RSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL------QKSSL 655
++ + ++ YLDDFL V Q S+L S ++ + +
Sbjct: 712 WMVAQETACRSLLHYLDDFLFVG------PAQSDQCASLLNSFRDLMAFIGVPVAEDKTE 765
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P V+ FLGI D LP+DK L ++ + +K L ++LLG++ FA
Sbjct: 766 GPVTVITFLGIQIDTVRMVFQLPKDKLEVLSRLIDRAVKAKKLTLKQVQTLLGHMVFACK 825
Query: 716 VIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V+PMGR+ RR+ +++ + AP H I+ L W + L + + Q
Sbjct: 826 VMPMGRV-CRRL--SMAMVGVKAPHHYIRISKNHREDLMLWQSFLAEYNGMTCWQAAAVD 882
Query: 775 STD-------ASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
S D AS +G G +F G W E+ + W ++ E+F + A
Sbjct: 883 SPDIELFTNAASSVGMG-----AFFQGEWCAERWPKTWEGSDLLRNLTFLELFPILVAAF 937
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
L L + ++ DNQ VV + RQ + S +L + + L I A+ + G
Sbjct: 938 LWGESLSNRRIIFWCDNQIVVHVINRQ-TSSSPPVLELLRALVLQCLRLNIWFRARHVLG 996
Query: 879 AYNSVADSLSRSKSLPDWHLSRSATEQ 905
S+ADSLSR + + W L+ +A+++
Sbjct: 997 VKISIADSLSRFQFMEFWRLAPNASQR 1023
>gi|313244785|emb|CBY15491.1| unnamed protein product [Oikopleura dioica]
Length = 724
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 104/415 (25%), Positives = 169/415 (40%), Gaps = 35/415 (8%)
Query: 495 TTGFLSRLFLVPKGNGGTRPVLN---LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISI 551
T GF+ LV + TR LN LK L + P K + L++GD +
Sbjct: 166 TAGFI----LVDRPEKDTRVCLNGSILKPLELYTFPCKMDSVKT--AIQLLKRGDLLAKF 219
Query: 552 DLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGM 611
D + Y +P+ ++ + G V L FGL AP + L++ + LR G+
Sbjct: 220 DDKKGYHQMPLAAESKKMACFKWGGHVFENNILAFGLPAAPGQYQLLNSVGINFLRRNGI 279
Query: 612 RVVVYLDDFLLV------NQDPRIL--EIQGK---LAVSILGSLGWIVNLQKSSLSPAPV 660
++ +YLDD LLV Q ++L E+ K + + L ++G VN++KS P
Sbjct: 280 KITLYLDDRLLVVTPESEEQRQKLLKEEVICKEVWVVAATLVAMGGFVNIEKSEFKPTQR 339
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++FLG + D + + +P+ + L +R + T L + G + V
Sbjct: 340 IEFLGFILDTEKETVEIPKGRWQVLKKRIREAQSEPTVELKLLERIRGTQASMVEVFSNM 399
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLS-SPIFPRQVQH----FIS 775
R+ R+I + L T + AV + W S + +Q + I
Sbjct: 400 RMMIRQITILITQTELEKKTHTVLTKAVRREWRIWFKFEESGLSRCWRKQDRSDAGLLIY 459
Query: 776 TDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
TDAS ++S LS W + N HI KE A+ AL L V
Sbjct: 460 TDASKHAGAIVIESWKLSEKFAWEEDLANAHIGIKEAAAIRMALEWYGKNLAKKRVTFLC 519
Query: 834 DNQTVVSYLRRQG---GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
DN +VV QG G+K + ++ KI+ L+Q I + +++ D
Sbjct: 520 DNDSVV-----QGAVNGSKDPEMNKQLVKIWSLAQKRSIDMKVEWVSTKLQKADD 569
>gi|301619187|ref|XP_002938982.1| PREDICTED: hypothetical protein LOC100486285 [Xenopus (Silurana)
tropicalis]
Length = 1137
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 159/369 (43%), Gaps = 35/369 (9%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L + + CLP G + + + F+S
Sbjct: 750 RGALLAKSDIESAFRLLPIHPDCFHLLGIKFANLYFVDMCLPMGCSISCYYFELFSSFLE 809
Query: 601 WVASLLRSRGMRVVVYLDDFLLVN-----QDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
WV + + ++ ++ YLDDFL V + R+L L + ++ + G + K+
Sbjct: 810 WVVTQV-AQSNSMLHYLDDFLFVGPANSPECARLLH----LFMEVMKNFGVPIAKDKTE- 863
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P V+ FLGI D LP +K +L +L L +K L +SLLG+L+FAS
Sbjct: 864 GPQEVIVFLGIEIDSREMVFRLPLEKLESLSQLLDRALMAKKLTLKQIQSLLGHLTFASR 923
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-- 773
++PMGR RR+ ++ H + + L W L + Q+
Sbjct: 924 IMPMGRAFCRRLSLSTKGIKY-PNHYIRMTKHIKDDLRIWQKFLAEYNGQSCWQISEKSN 982
Query: 774 ----ISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLN 820
+ TDA+ GS+ ++ G W Q W ++ E+F + A +
Sbjct: 983 LELELFTDAA----GSKGMGAYFQGQWCSAQWPSFWRDTDLIRNLTCLELFPIVVASHIW 1038
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
LL + V+ DN +VV + Q + S +L+ + + L I A+ +PG
Sbjct: 1039 GELLANQRVIFWCDNSSVVQVINNQTSS-SPPVLNLLRALVLQCLRMNIWFRARHVPGVQ 1097
Query: 881 NSVADSLSR 889
NS+AD+LSR
Sbjct: 1098 NSIADALSR 1106
>gi|406702084|gb|EKD05152.1| hypothetical protein A1Q2_00573 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1043
Score = 89.7 bits (221), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 153/367 (41%), Gaps = 39/367 (10%)
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
++ +DL AY HV + L +G CL FG ++P F S +A +L
Sbjct: 254 WLWKMDLKDAYRHVVVDAADAALLGFHLDGKDYVDCCLNFGGKSSPFIFNMFSEALAWIL 313
Query: 607 RSRGMRVVVYLDDFL---LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
S G+R LDDF + P IL + ++ G LG V KS P ++
Sbjct: 314 ASFGLRNRHLLDDFFGRCKAARGPAIL----RFLDALCGYLGLSVARHKSLT--GPCVEI 367
Query: 664 LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN---LDSARSLLGYLSFASFVIPMG 720
LGIM D WL DK L +R+ L+ ++ + +A SL+G L+ A+ ++ G
Sbjct: 368 LGIMVDGPTASAWLSPDKLEKLRWSVRSALSRESNDQISFSAAESLVGSLTDATRIVAAG 427
Query: 721 RLHSR---------RIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI----FP 767
R +R R + + LRL + + L WW N L + P
Sbjct: 428 RAFTRGFYDWLTDNRHRGHRATLRL--------SRDLKSDLRWWNNLLRKWPGVRLLRRP 479
Query: 768 RQVQHFISTDASDLGWGSQVD-----SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
R + A+ G G + ++ S + +I E AVH+AL P
Sbjct: 480 RGSIEIWTDAATSSGLGGHLGPPEAVTARFSAPVPDHLRGANIMALEAEAVHEALQRWAP 539
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNS 882
+ V+ + DNQ VV+ L G + V +IF L + RI + +I N+
Sbjct: 540 AHKGFRVVCRVDNQAVVNAL-LTGRIRHRDTQRVVRRIFTLLHEHRIFLRVSWIASEDNA 598
Query: 883 VADSLSR 889
VAD+LSR
Sbjct: 599 VADALSR 605
>gi|326676605|ref|XP_003200625.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1430
Score = 89.4 bits (220), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 111/454 (24%), Positives = 187/454 (41%), Gaps = 60/454 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM+ +IQE L TG+++ S G + F V K +GG RP ++ +GLN
Sbjct: 533 LSPPEQAAMNAYIQESLATGIIRASTSPAG--AGFFFVGKKDGGLRPCIDYRGLN----- 585
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQ +DL AY V I+ + A +
Sbjct: 586 -KITIRNRYPLPLMATAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHYEY 644
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F +L N V L + V VYLDD L+ ++ E + L
Sbjct: 645 LVMPFGLTNAPAVFQALINDV--LRDMLNIFVFVYLDDILIFSKSMEEHEGHVSRVLQRL 702
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDP-HLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
V +K + +FLG + P HL+ P + L + T
Sbjct: 703 LENHLFVKPEKCEFH-VSLTKFLGYIVTPGHLEMD--PSKIKAVLNWPIPT--------- 750
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNA 758
+ + + ++ FA+F R S + +L + G + P A L+ +
Sbjct: 751 -TVKEVQRFVGFANFYRKFIRNFSSVVAPLTALTKGGGVKIEWGPKAAAAFQDLKDRFTS 809
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD----------SSFLSGLWSREQQNWHINKK 808
P+ S P + + DASD+G G+ + +F+S S ++N+H+ +
Sbjct: 810 APILSIPNP-DIPFMVEVDASDVGVGAILSQRNEDGKLHPCAFMSRRLSNAERNYHVGDR 868
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW- 867
E+ AV AL L+ + + Q + + K+L L + +++ W
Sbjct: 869 ELLAVKLALEEWRHWLEGA----RHPFQVLTDH-------KNLEYLQQAKQLNPRQARWS 917
Query: 868 ----RIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
R + + PG+ N D+LSR+ S P+ H
Sbjct: 918 LFFNRFQFILTYRPGSKNLKPDALSRAYS-PETH 950
>gi|326670590|ref|XP_003199242.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1430
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 111/454 (24%), Positives = 187/454 (41%), Gaps = 60/454 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM+ +IQE L TG+++ S G + F V K +GG RP ++ +GLN
Sbjct: 533 LSPPEQAAMNAYIQESLATGIIRASTSPAG--AGFFFVGKKDGGLRPCIDYRGLN----- 585
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQ +DL AY V I+ + A +
Sbjct: 586 -KITIRNRYPLPLMATAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHYEY 644
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F +L N V L + V VYLDD L+ ++ E + L
Sbjct: 645 LVMPFGLTNAPAVFQALINDV--LRDMLNIFVFVYLDDILIFSKSMEEHEGHVSRVLQRL 702
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDP-HLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
V +K + +FLG + P HL+ P + L + T
Sbjct: 703 LENHLFVKPEKCEFH-VSLTKFLGYIVTPGHLEMD--PSKIKAVLNWPIPT--------- 750
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNA 758
+ + + ++ FA+F R S + +L + G + P A L+ +
Sbjct: 751 -TVKEVQRFVGFANFYRKFIRNFSSVVAPLTALTKGGGVKIEWGPKAAAAFQDLKDRFTS 809
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD----------SSFLSGLWSREQQNWHINKK 808
P+ S P + + DASD+G G+ + +F+S S ++N+H+ +
Sbjct: 810 APILSIPNP-DIPFMVEVDASDVGVGAILSQRNEDGKLHPCAFMSRRLSNAERNYHVGDR 868
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW- 867
E+ AV AL L+ + + Q + + K+L L + +++ W
Sbjct: 869 ELLAVKLALEEWRHWLEGA----RHPFQVLTDH-------KNLEYLQQAKQLNPRQARWS 917
Query: 868 ----RIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
R + + PG+ N D+LSR+ S P+ H
Sbjct: 918 LFFNRFQFILTYRPGSKNLKPDALSRAYS-PETH 950
>gi|327262042|ref|XP_003215835.1| PREDICTED: LOW QUALITY PROTEIN: cation-independent
mannose-6-phosphate receptor-like [Anolis carolinensis]
Length = 2641
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 112/235 (47%), Gaps = 11/235 (4%)
Query: 626 DPRILE-IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
D R+L + G + S +VN +KS L P ++F+G++ D LPED+
Sbjct: 1993 DLRVLSSLTGSWVFADKDSSSIVVNFKKSHLQPTQQIRFIGMLLDSTTCTAQLPEDRFRA 2052
Query: 685 LGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL-----GAP 739
L L +L + +++LG+++ V P RL R +Q A ++ +P
Sbjct: 2053 LRTSLLLILHHPFSSAKDIQTILGHMASTMAVTPYARLRMRPLQ--AWFIKTFDPVKDSP 2110
Query: 740 HLTPINPA-VLPKLEWWL--NALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLW 796
H P+ + L+WW N + + P P + I+TDAS GWG+ + G W
Sbjct: 2111 HTRLSLPSHICHSLQWWTHRNNICVGVPFRPSDLSTTITTDASLTGWGTFSGNLATHGHW 2170
Query: 797 SREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL 851
+ + HIN ++ A+ + L + +L ++ V V +DN TV+ Y+ +QG K L
Sbjct: 2171 TSTEITHHINVLKLLALFKGLRVFQDILSNTTVQVCTDNTTVMWYINKQGDDKVL 2225
>gi|301607309|ref|XP_002933268.1| PREDICTED: hypothetical protein LOC100488715 [Xenopus (Silurana)
tropicalis]
Length = 798
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 159/366 (43%), Gaps = 29/366 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG + D+ A+ + I + L + G CLP G + + + F S ++
Sbjct: 410 KGALLAKSDIESAFRLLHIHSDCYHLLGCQFEGKFYYNMCLPMGCSISCRYFECFSTFLE 469
Query: 604 SLLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R G V+ YLDDFL + Q + ++ + G ++ +K+ P V
Sbjct: 470 WIVRHETGYNSVIHYLDDFLFIGPQKTNVCQLLLSTFQFFMDRFGVPLSKEKTE-GPITV 528
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI LPEDK L + + A+K L S +SLLG L FA ++P+
Sbjct: 529 LSFLGIKIVTVSLVFRLPEDKLQKLKCTVAEITAAKKITLRSMQSLLGLLVFACRIMPIT 588
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQH 772
R+ SRR+ ++ H + L+ W L + + + ++
Sbjct: 589 RVFSRRLSLSTQGIK-PPHHFIRTTKQLREDLKVWQTFLEQYNGHTCLMDTEVSKEELSL 647
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPL 823
F TDA+ GS + L+ W EQ NW ++ E+F + A+ +
Sbjct: 648 F--TDAA----GSTGFGAILAQSWCAEQWPDNWASVGLCKNLTLLELFPIVVAVEIWGHR 701
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ + +DN +VV + R + SL +L+ + ++ L ++ I A+ +PG N+
Sbjct: 702 ISGKKICFWTDNMSVVFTVNRL-TSASLPVLALLRQLVLRCLEFNIWFRARHVPGRVNTA 760
Query: 884 ADSLSR 889
AD+LSR
Sbjct: 761 ADALSR 766
>gi|313236762|emb|CBY12015.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 158/358 (44%), Gaps = 32/358 (8%)
Query: 420 ELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLH 479
EL+GG ++ +++ +RL P Y I + + + + Q + ++
Sbjct: 92 ELMGGNMKTYINGRLRLTKFVP-----RPYEITSQGRTRTLAIPN-QESVRERADDVTQQ 145
Query: 480 IQEMLETGVLKRLDS------TTGFLSRLFLVPKGNGGTRPVLN---LKGLNQFLSPKKF 530
++E +++G ++ + T GF+ LV + TR LN +K L ++ P K
Sbjct: 146 LKEWIKSGSVELWKASKKPWLTAGFI----LVDRPEKETRVCLNGSIMKPLEKYTFPCKL 201
Query: 531 SLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
I+ L+KGD M+ D + + +P+ ++ + G L FGL
Sbjct: 202 DSISL--AIQLLKKGDLMVKYDDKRGFHQMPLAEESKKMACFEWGGKKFVNNILCFGLPA 259
Query: 591 APQAFASLSNWVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGK-------LAVS 639
AP + +++ + LR G++ +YLDD L++ ++ R ++GK + +
Sbjct: 260 APGIYQNMNLVGINFLRKNGIKATLYLDDRLIIITPKSEAHRQKLLEGKEVCKEAWITAA 319
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L +LG VN++KS P ++FLG + D + + +P+ + L L L+ L S+ +
Sbjct: 320 TLVALGGFVNIEKSEFIPKQRMEFLGFILDSETETIEIPQSRWLALKAKLQQALHSERTS 379
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN 757
L + G + + V P R+ R+I + + + AV + + WLN
Sbjct: 380 LKELERIRGTQASMAEVFPNMRMLIRQITMLICQAEIQGAYEVRLTRAVKAEWKTWLN 437
>gi|301608924|ref|XP_002934033.1| PREDICTED: hypothetical protein LOC100492780 [Xenopus (Silurana)
tropicalis]
Length = 699
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 156/381 (40%), Gaps = 63/381 (16%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M D+ A+ +P+ L ++ G CLP G + + F S ++
Sbjct: 298 RGALMAKSDIESAFRLLPVHPDCFHLLGCTFGGYNFVDMCLPMGCSISCYYFELFSTFLE 357
Query: 604 SLL--RSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++ + ++ YLDDFL V +D + P V+
Sbjct: 358 WMVAQETACKSLLHYLDDFLFVAED--------------------------KTEGPVTVI 391
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
FLGI D DK L ++ + ++ L ++LLG++ FA V+PMGR
Sbjct: 392 TFLGIQID----------DKLEALSRLIDRAVTARKLTLKQVQTLLGHMVFACKVMPMGR 441
Query: 722 LHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD--- 777
RR+ + + + AP H I+ L W + L + + Q S D
Sbjct: 442 AFCRRL--SMATVGVKAPHHYIRISKNHREDLMLWQSFLAEYNGMTCWQAAAVDSPDIEL 499
Query: 778 ----ASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLL 824
AS +G G +F G W E+ + W ++ E+F + A L L
Sbjct: 500 FTDAASSVGLG-----AFFQGEWCAERWPKTWEGSDLLRNLTFLELFPILVASFLWGESL 554
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+ V DNQ+VV + RQ + S +L + + L I AQ +PG NS+A
Sbjct: 555 SNRRVTFWCDNQSVVHVINRQ-TSSSPPVLELLRALVLQCLRLNIWFKAQHVPGVKNSIA 613
Query: 885 DSLSRSKSLPDWHLSRSATEQ 905
DSLSR + + W L+ +A+++
Sbjct: 614 DSLSRFQFMEFWRLAPNASQR 634
>gi|326673825|ref|XP_003200006.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1421
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 119/465 (25%), Positives = 189/465 (40%), Gaps = 63/465 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L++P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 397 PKGHLYSLSSPEREAMDKYIDESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 454
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 455 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 514
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V L V VYLDD L+ + ++ + +
Sbjct: 515 YRVLPFGLTNAPAVFQALVNDV--LRDMVNQFVFVYLDDILIFSPSMQVHTQHVRQVLQR 572
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASKT 697
L V +K A + FLG + ++ G I + A
Sbjct: 573 LLENQLFVKAEKCVFH-AKSVSFLGFV---------------ISAGEIKADPSKVRAVAE 616
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F R S Q A L L +P + I + + L
Sbjct: 617 WPTPDSRKALQRFLGFANFYRRFIRNFS---QIAAPLTVLTSPKVPFIWGSKAQEAFDNL 673
Query: 757 NALPLSSPIF----PRQVQHFISTDASDLGWG------SQVDS-----SFLSGLWSREQQ 801
+ +S+P+ P++ Q + DASD+G G SQ D +F S S ++
Sbjct: 674 KSRFISAPVLSIPDPKR-QFIVEVDASDVGVGAVLSQRSQRDEKVHPCAFFSHRLSPTER 732
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ I +E+ AV AL L+ +V +V +D+ K+L +S ++
Sbjct: 733 NYDIGNRELLAVRLALGEWRHWLEGAVQPFVVWTDH-------------KNLEYISTAKR 779
Query: 860 IFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSKSLPDWHLS 899
+ W R + + PG+ N+ DSLSR S P+ +S
Sbjct: 780 LSSRQARWSLYFSRFNFTLLYRPGSKNTKPDSLSRMFSAPEREVS 824
>gi|301628142|ref|XP_002943218.1| PREDICTED: hypothetical protein LOC100485753 [Xenopus (Silurana)
tropicalis]
Length = 891
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 169/418 (40%), Gaps = 69/418 (16%)
Query: 526 SPKKFSLINHFRIP-------------------SFLQ---------KGDYMISIDLSQAY 557
P KF LI+H P SF Q KG + D+ A+
Sbjct: 280 EPGKFRLIHHLSYPQGESVNDDINPELCSVTYISFDQAVALVRKAGKGALLAKADIESAF 339
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG--MRVVV 615
+PI L ++ G + CLP G + + F + S+++ ++R R +V
Sbjct: 340 RLLPIHPECHHLLGCTFEGSIYVDLCLPMGCSISCSYFETFSSFMEWVVRQRAHTTGIVH 399
Query: 616 YLDDFLLVN-----QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
YLDDFL V + IL ++A L ++ P L FLG+ D
Sbjct: 400 YLDDFLCVGPAGSEECFHILTTLQEVAEDFGVPLA-----PDKTVGPVTCLSFLGLEIDS 454
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR-------SLLGYLSFASFVIPMGRLH 723
LPEDK L L S W + A+ S+LG L+FA V+ MGR+
Sbjct: 455 VRGESRLPEDK-------LHDLRKSVAWAREKAKMTVREIQSMLGKLNFACRVVVMGRVF 507
Query: 724 SRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNALPL--SSPIFP-RQVQ----HFIS 775
RR+ + R APH P V LE W L IFP R+V +
Sbjct: 508 CRRLGGLLAGAR--APHHHIRLPQGVRDDLEVWQRFLESFNGKVIFPEREVSSTELQLFT 565
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMV 831
A G+G+ + S+ + W E + K E+F + ++ + L++ V+
Sbjct: 566 DAAGSFGFGAYLGGSWCADRWPDEWFRLGLVKNVCFLELFPIVVSVFIWGDKLRNRQVVF 625
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
SDN VV + RQ T S ++ + + L + + A+ +PG N +AD+LSR
Sbjct: 626 VSDNMGVVQVINRQTAT-SAEVVRLLRVLVLRCLNINLGFRARHLPGVKNEIADALSR 682
>gi|301607935|ref|XP_002933561.1| PREDICTED: hypothetical protein LOC100491153 [Xenopus (Silurana)
tropicalis]
Length = 521
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 114/451 (25%), Positives = 180/451 (39%), Gaps = 75/451 (16%)
Query: 514 PVLNLK----GLNQFLSPKKFSLINHFRIP-------------------SFLQ------- 543
P+ NL+ G+ P KF LI+H P SF Q
Sbjct: 72 PMTNLRVSPLGVVPKKEPGKFRLIHHLSYPKGGSVNDDIDKELCSVSYTSFDQAVAVVRN 131
Query: 544 --KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
+G M D+ A+ +PI Q L + G CLP G + + F + S +
Sbjct: 132 AGRGALMAKADIESAFRLLPIHPDCQHLLGCWFEGAFFVDLCLPMGCSISCAYFEAFSTF 191
Query: 602 VASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKS 653
+ ++R + VV YLDDFL + + IL +L +
Sbjct: 192 LEWVVRKKAGYSSVVHYLDDFLCMGP------ASSDICFHILDTLREVAEEFGVPLAADK 245
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ PA +QFLG+ D + LP K L + + +K L +SLLG L+FA
Sbjct: 246 TEGPATTMQFLGLEVDSVRGQCRLPASKVTDLREEVGRMRQTKKPTLRQVQSLLGKLNFA 305
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPL-SSPIF 766
VIP+GR+ SRR+ QA H + V L W N + L +P
Sbjct: 306 CRVIPVGRVFSRRLA-QAMAGATAPHHHVRLGREVRADLGVWELFLRNFNGVVLFQAPEA 364
Query: 767 PRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQAL 817
Q + A +G+G ++L+G W E+ W ++ E+F + AL
Sbjct: 365 TTQEMQLFTDAAGSVGFG-----AYLAGQWCAEKWPTEWVGSGLVRNLAFLELFPIVVAL 419
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK---SLSLLSEVEKIFLLSQDWRIHILAQ 874
+ L++S ++ SDN +VV + + L V + F L+ I A+
Sbjct: 420 FVWEQELKNSSIVFFSDNLSVVQGINNWSASSPPVLRLLRVLVLRCFNLN----IRCRAR 475
Query: 875 FIPGAYNSVADSLSRSKSLPDWHLSRSATEQ 905
+ G N +AD+LSRS+ W ++ A ++
Sbjct: 476 HVEGVKNVIADALSRSQWERFWQVAPEAEKE 506
>gi|301624976|ref|XP_002941774.1| PREDICTED: hypothetical protein LOC100488583 [Xenopus (Silurana)
tropicalis]
Length = 825
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/390 (24%), Positives = 162/390 (41%), Gaps = 32/390 (8%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG M D+ A+ +PI L ++G CLP G + + F + S +V
Sbjct: 438 KGALMSKCDIQSAFRLLPINPQCFHLLGFHFDGLFYFDRCLPMGCSLSCFYFEAFSTFVE 497
Query: 604 -SLLRSRGMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSL 655
SL G V+ YLDDFL + P + ++ L + W+ + ++
Sbjct: 498 WSLKWETGSEYVIHYLDDFLFLG--PHGTDTCRRM----LNTFVWLAQEFGIPLAPEKTV 551
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P + FLGI D LPEDK L ++ L L +SL+G+L+FA+
Sbjct: 552 YPTTSIVFLGIEIDSIRMEFRLPEDKVNKLKLLVAATLTCSKLKLKQLQSLIGHLNFATR 611
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFPR 768
++P+GR+ ++R+ + ++ H+ I V L W L
Sbjct: 612 IMPIGRVFTKRLCTLTAGIKNPNWHIR-IPLEVKSDLLIWQQFLEHFNGKTCWQEDYVDN 670
Query: 769 QVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLL 824
+ + AS +G+G+ + + G W + + K E F + ++ + L
Sbjct: 671 ETLQLFTDAASTVGFGAFFQNQWSVGTWPTKWIEAGLTKNMVLLEFFPILVSIEIWGLEL 730
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+ ++V DN VV + + S +L+ + + L + I + + IPG YN A
Sbjct: 731 SNKKIIVNCDNLGVVQVINNMSSS-SPPVLNLLRQFVLRALSRNIMVKERHIPGIYNKTA 789
Query: 885 DSLSRSKSLPDWHLSRSATEQIFLKWGVPC 914
D+LSR + W L R+A E GV C
Sbjct: 790 DALSRLQFQIFWELQRNACEV-----GVTC 814
>gi|301606719|ref|XP_002932975.1| PREDICTED: hypothetical protein LOC100496211 [Xenopus (Silurana)
tropicalis]
Length = 522
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 180/454 (39%), Gaps = 81/454 (17%)
Query: 514 PVLNLK----GLNQFLSPKKFSLINHFRIP-------------------SFLQ------- 543
P++NL+ G+ P KF LI+H P SF Q
Sbjct: 73 PMMNLRVSPLGVVPKKEPGKFRLIHHLSYPKGGSVNDDIDKELCSVSYTSFDQAVAVVRK 132
Query: 544 --KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
KG M D+ A+ +PI L + G CLP G + + F + S +
Sbjct: 133 AGKGALMAKADIESAFRLLPIHPDCHHLLGCWFEGAFFVDLCLPMGCSISCAYFEAFSTF 192
Query: 602 VASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKS 653
+ ++R + VV YLDDFL V + IL +L +
Sbjct: 193 LEWVIRRKAGYSSVVHYLDDFLCVGP------ASSDICFHILDTLREVAEEFGVPLAADK 246
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ PA +QFLG+ D + LP K L + + +K L +SLLG L+FA
Sbjct: 247 TEGPATTMQFLGLEVDSEKGQCRLPVSKVTDLREEVGRMRQTKKPTLRQVQSLLGKLNFA 306
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP---HLTPINPAVLPKLEWW------LNALPL-SS 763
VIP GR+ SRR+ R + GA H + V L W N + L +
Sbjct: 307 CRVIPAGRVFSRRLARAMA----GATALHHHVRLGREVRADLGVWEVFLHNFNGVVLFQA 362
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQ--NW-------HINKKEMFAVH 814
P Q + A +G+G ++L+G W E+ W ++ E+F +
Sbjct: 363 PEATAQEMQLFTDAAGSVGFG-----AYLAGQWCAEKWPLEWVESGLVRNLAFLELFPIV 417
Query: 815 QALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK---SLSLLSEVEKIFLLSQDWRIHI 871
AL + L++ ++ SDN +VV + + L V + F L+ I
Sbjct: 418 VALFVWEQELRNRSIVFFSDNLSVVQGINNWSASSPPVLRLLRVLVLRCFRLN----IRC 473
Query: 872 LAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQ 905
A+ + G N +AD+LSRS+ W ++ A ++
Sbjct: 474 RARHVEGVKNVIADALSRSQWERFWQVAPEAEKE 507
>gi|326677060|ref|XP_003200744.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1430
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 111/454 (24%), Positives = 187/454 (41%), Gaps = 60/454 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM+ +IQE L TG+++ S G + F V K +GG RP ++ +GLN
Sbjct: 533 LSPPEQAAMNAYIQESLATGIIRASTSPAG--AGFFFVGKKDGGLRPCIDYRGLN----- 585
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQ +DL AY V I+ + A +
Sbjct: 586 -KITIRNRYPLPLMATAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHYEY 644
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F +L N V L + V VYLDD L+ ++ E + L
Sbjct: 645 LVMPFGLTNAPAVFQALINDV--LRDMLNIFVFVYLDDILIFSKSMEEHEGHVSRVLQRL 702
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDP-HLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
V +K + +FLG + P HL+ P + L + T
Sbjct: 703 LENHLFVKPEKCEFH-VSLTKFLGYIVTPGHLEMD--PSKIKAVLNWPIPT--------- 750
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNA 758
+ + + ++ FA+F R S + +L + G + P A L+ +
Sbjct: 751 -TVKEVQRFVGFANFYRKFIRNFSSVVAPLTALTKGGGVKIEWGPKAAAAFQDLKDRFTS 809
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD----------SSFLSGLWSREQQNWHINKK 808
P+ S P + + DASD+G G+ + +F+S S ++N+H+ +
Sbjct: 810 APILSIPNP-DMPFMVEVDASDVGVGAILSQRNEDGKLHPCAFMSRRLSNAERNYHVGDR 868
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW- 867
E+ AV AL L+ + + Q + + K+L L + +++ W
Sbjct: 869 ELLAVKLALEEWRHWLEGA----RHPFQVLTDH-------KNLEYLQQAKQLNPRQARWS 917
Query: 868 ----RIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
R + + PG+ N D+LSR+ S P+ H
Sbjct: 918 LFFNRFQFILTYRPGSKNLKPDALSRAYS-PETH 950
>gi|66828689|ref|XP_647698.1| hypothetical protein DDB_G0267338 [Dictyostelium discoideum AX4]
gi|60475843|gb|EAL73775.1| hypothetical protein DDB_G0267338 [Dictyostelium discoideum AX4]
Length = 161
Score = 88.6 bits (218), Expect = 2e-14, Method: Composition-based stats.
Identities = 45/145 (31%), Positives = 77/145 (53%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TAP+ F
Sbjct: 7 LPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAPRIFTM 66
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L V +LR + V+ YLDD L+V K + +L LG+ +NL+K L P
Sbjct: 67 LLRHVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLGFKLNLEKIVLEP 126
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQ 682
+ FLG+ D ++ +P++K+
Sbjct: 127 TQSITFLGLQIDSKSMKLLVPKEKK 151
>gi|301632390|ref|XP_002945269.1| PREDICTED: hypothetical protein LOC100492160 [Xenopus (Silurana)
tropicalis]
Length = 521
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 157/370 (42%), Gaps = 33/370 (8%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G M D+ A+ +PI L L + G CLP G + + F + S ++
Sbjct: 134 QGALMAKADIESAFRLLPIHPDCHHLLGLWFEGAFFVDLCLPMGCSISCAYFEAFSTFLE 193
Query: 604 SLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSL 655
++R + VV YLDDFL V P +I IL +L + + +
Sbjct: 194 WVVRRKAGYSSVVHYLDDFLCVG--PAASDI----CFHILDTLREVADDFGVPLAADKTE 247
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
PA +QFLG+ D + LP +K L +R + +K L +SLLG L+FA
Sbjct: 248 GPATTIQFLGLEVDSVKGQCRLPVNKVTDLREEVRRMKQTKKPTLRQVQSLLGKLNFACR 307
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPL-SSPIFPR 768
VIP GR+ SRR+ + H+ + V L W N + L +P
Sbjct: 308 VIPAGRVFSRRLALATAGATAPHHHVR-LGHEVRADLRVWEVFLRDFNGVVLFQAPEATA 366
Query: 769 QVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLL 824
Q + A +G+G+ + + + W E + + K E+F + A+ + L
Sbjct: 367 QEIQLFTDAAGSVGFGAYLAGQWCAEKWPPEWEKSGLVKNLAFLELFPIVVAMFVWEKEL 426
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGT---KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
Q+ ++ SDN +VV + + L V + F L+ I A+ + G N
Sbjct: 427 QNRSIVFISDNMSVVQGINNWSASSPPVLRLLRVLVLRCFRLN----IRCRARHVEGVKN 482
Query: 882 SVADSLSRSK 891
+AD+LSRS+
Sbjct: 483 VIADALSRSQ 492
>gi|326665097|ref|XP_003197966.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1413
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 158/369 (42%), Gaps = 43/369 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM +I E L+ G+++ S G + F V K +GG RP ++ + LN
Sbjct: 516 LSPPEQTAMETYIMEGLKAGIIRSSTSPAG--AGFFFVGKKDGGLRPCIDYRALN----- 568
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQK +DL AY V IK + A +
Sbjct: 569 -KVTIRNRYPLPLMATAFELLQKATIFSKLDLRNAYHLVRIKQGDEWKTAFNTPTGHYEY 627
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F +L N V + ++ V VYLDD L+ + + E + + L
Sbjct: 628 LVMPFGLTNAPAVFQALINDVLREMLNKF--VFVYLDDILIFSSSLQEHESHVRKVLRRL 685
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL- 700
V +K VL FLG + P +M P+ Q L W
Sbjct: 686 QENHLFVKPEKCEFHTTEVL-FLGFIIKPGQVQM-DPKKVQAVLD-----------WPAP 732
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNA 758
S + + ++ FA+F + S + +L ++G+ ++ P A +L+ +
Sbjct: 733 TSVKEVQRFIGFANFYRKFVQNFSSVVAPLTALTKVGSARISWNPEAEAAFRELKRRFTS 792
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD----------SSFLSGLWSREQQNWHINKK 808
P+ + P ++ + DASD+G G+ + +FLS S ++N+H+ +
Sbjct: 793 APILTIPNP-ELPFVVEVDASDVGVGAVLSQRGKDNCLHPCAFLSHRLSSCERNYHVGDR 851
Query: 809 EMFAVHQAL 817
E+ AV AL
Sbjct: 852 ELLAVKLAL 860
>gi|326673727|ref|XP_003199969.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1411
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 157/368 (42%), Gaps = 41/368 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM +I E L+ G+++ S G + F V K +GG RP ++ + LN
Sbjct: 514 LSPPEQTAMETYIMEGLKAGIIRSSTSPAG--AGFFFVGKKDGGLRPCIDYRALN----- 566
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQK +DL AY V IK + A +
Sbjct: 567 -KVTIRNRYPLPLMATAFELLQKATIFSKLDLRNAYHLVRIKQGDEWKTAFNTPTGHYEY 625
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F +L N V + ++ V VYLDD L+ + + E + + L
Sbjct: 626 LVMPFGLTNAPAVFQALINDVLREMLNKF--VFVYLDDILIFSSSLQEHESHVRKVLKRL 683
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
V +K VL FLG + P +M P+ Q L T
Sbjct: 684 QENHLFVKHEKCEFHTTEVL-FLGFIIKPGQVQM-DPKKVQAVLDWPAPT---------- 731
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNAL 759
S + + ++ FA+F + S + +L ++G+ ++ P A +L+ +
Sbjct: 732 SVKEVQRFIGFANFYRKFVQNFSSVVAPLTALTKVGSARISWNPEAEAAFRELKRRFTSA 791
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGSQVD----------SSFLSGLWSREQQNWHINKKE 809
P+ + P ++ + DASD+G G+ + +FLS S + N+H+ +E
Sbjct: 792 PILTIPNP-ELPFVVEVDASDVGVGAVLSQRGKDNCLHPCAFLSHRLSSCECNYHVGDRE 850
Query: 810 MFAVHQAL 817
+ AV AL
Sbjct: 851 LLAVKLAL 858
>gi|326668984|ref|XP_003198907.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1690
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 119/466 (25%), Positives = 190/466 (40%), Gaps = 65/466 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L++P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 329 PKGHLYSLSSPEREAMDKYIDESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 386
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 387 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 446
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL AP F +L N +LR V VYLDD L+ + ++ + +
Sbjct: 447 YRVLPFGLTNAPAVFQALVN---DVLRDMVNQFVFVYLDDILIFSPSMQVHTQHVRQVLQ 503
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASK 696
L V +K A + FLG + ++ G I + A
Sbjct: 504 RLLENQLFVKAEKCVFH-AKSVSFLGFV---------------ISAGEIKADPSKVRAVA 547
Query: 697 TW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW 755
W DS ++L +L FA+F R S Q A L L +P + I + +
Sbjct: 548 EWPTPDSRKALQRFLGFANFYRRFIRNFS---QIAAPLTVLTSPKVPFIWGSKAQEAFDN 604
Query: 756 LNALPLSSPIF----PRQVQHFISTDASDLGWG------SQVDS-----SFLSGLWSREQ 800
L + +S+P+ P++ Q + DASD+G G SQ D +F S S +
Sbjct: 605 LKSRFISAPVLSIPDPKR-QFIVEVDASDVGVGAVLSQRSQRDEKVHPCAFFSHRLSPTE 663
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
+N+ I +E+ AV AL L+ +V +V +D+ K+L +S +
Sbjct: 664 RNYDIGNRELLAVRLALGEWRHWLEGAVQPFVVWTDH-------------KNLEYISTAK 710
Query: 859 KIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSKSLPDWHLS 899
++ W R + + PG+ N+ DSLSR S P+ +S
Sbjct: 711 RLSSRQARWSLYFSRFNFTLSYRPGSKNTKPDSLSRMFSAPEREVS 756
>gi|301607750|ref|XP_002933462.1| PREDICTED: hypothetical protein LOC100492542 [Xenopus (Silurana)
tropicalis]
Length = 983
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 155/366 (42%), Gaps = 29/366 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+G + D+ A +PI L + G CLP G + + + F S ++
Sbjct: 333 RGALLAKSDIESASRLLPIHRDCYHLLGCQFEGQFYYDLCLPMGCSISCRYFECFSTFLE 392
Query: 604 SLLRSRGMR--VVVYLDDFLLVNQ-DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R V+ YLDDFL + + + ++ + G ++ +K+ P PV
Sbjct: 393 WVVRHETGHNSVIHYLDDFLFIGPPNTNVCQLLLSTFQFFMAKFGVPLSREKTE-GPVPV 451
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI D LP+DK L + + + +K L S +SLLG L FA ++P+
Sbjct: 452 LSFLGIEIDTVELVFRLPDDKLQRLKSTVAEITVAKKVTLRSMQSLLGLLVFACRIMPIA 511
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQH 772
R+ SRR+ ++ H I + L W L + + + ++
Sbjct: 512 RVFSRRLSLSTCGIK-QPHHFIRITRQLREDLTVWQTFLEQYNGHTCLMDTEVSNEELSL 570
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPL 823
F TDA+ GS + L+ W EQ NW ++ E+F + + +
Sbjct: 571 F--TDAA----GSTGFGAILAQSWCAEQWPDNWAPVGLCKNMTLLELFPIVVTVEIWGHR 624
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ + +DN +VV + + + SL +L+ + + L + I A+ +PG N
Sbjct: 625 ISGKKICFWTDNMSVVFAINKL-TSSSLPVLALLRHLVLRCLELNIWFRARHVPGRENFA 683
Query: 884 ADSLSR 889
AD+LSR
Sbjct: 684 ADALSR 689
>gi|301613474|ref|XP_002936224.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1086
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 111/446 (24%), Positives = 188/446 (42%), Gaps = 42/446 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P + HL+ P + AM +I+E LE G ++ S G + F V K +GG RP ++ +G
Sbjct: 178 PKGRIYHLSLPETQAMEEYIKENLERGFIRPSCSPAG--AGFFFVEKKDGGLRPCIDYRG 235
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN+ ++ L + ++ +DL AY + I+ + A +
Sbjct: 236 LNKITVKNRYPLPLISELFDRVKSATIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYE 295
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
+PFGL AP F L N + L R VVVYLDD L+ + + +S
Sbjct: 296 YLVMPFGLCNAPAVFQELVNDIFRDLLGRS--VVVYLDDILIYSNSLSDHRAHVQEVLSR 353
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L ++K + V FLG ++ K L L + + + L
Sbjct: 354 LRQHHLYAKIEKCIFEVSSV-HFLG----------YIISHKGLELDPVKVQAIVNWVQPL 402
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKL--EWWLN 757
S R++ +L FA++ + S + +L + GA P + PI KL E +++
Sbjct: 403 -SLRAIQRFLGFANYYRQFIKNFSTLVAPITALTKKGADPSIWPIEALTAFKLLKEAFVS 461
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
A L P + + DAS++G G+ + F S +S + N+ I
Sbjct: 462 AHVLLHP--DSALPFLLEVDASEIGAGAVLSQRHPVTNKIHPCGFFSKKFSPTEINYDIG 519
Query: 807 KKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ A+ A + LL+ + VV V +D++ ++ Y+ K L+ +F
Sbjct: 520 NRELLAIKLAFTEWRHLLEGAKHVVTVITDHKNLL-YIE---SAKRLNPRQARWALFFS- 574
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRS 890
R + + + PG N AD+LSRS
Sbjct: 575 ---RFNFIITYRPGEKNVKADALSRS 597
>gi|301632042|ref|XP_002945100.1| PREDICTED: hypothetical protein LOC100486317 [Xenopus (Silurana)
tropicalis]
Length = 1166
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 156/366 (42%), Gaps = 29/366 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
KG + D+ A+ +PI L + G CLP G + + +AFA+
Sbjct: 562 KGALLAKSDIESAFRLLPIHPDCWHLLGFHFEGQFYYDCCLPMGCSLSCFYFEAFATFLE 621
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL------QKSS 654
WV S V+ YLDDFL + Q + +L + W+ + +
Sbjct: 622 WVVQF-ESGSNLVLHYLDDFLFIG------PAQSNHCLLLLKTFMWVAKKFGVPLSAEKT 674
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+ P LQFLGI D LP DK L +++ LA+K L +SL+G+L+F +
Sbjct: 675 VFPCTSLQFLGIEIDTVKQEFRLPVDKLNRLKSLIEAALAAKKLKLKHIQSLVGHLNFTT 734
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNALPLSSPIFP- 767
VIP+GR+ +RR+ + + H+ I V L W N F
Sbjct: 735 RVIPIGRVFNRRLISLTAGIANPNWHIR-IPQEVKDDLIIWQHFLRDFNGRAFWQDEFTG 793
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLW--SREQQNWHINKK--EMFAVHQALSLNLPL 823
V H S A+ +G+G+ S + W S +QN N E F + L +
Sbjct: 794 NSVLHLYSDAAASVGFGAIFLSHWCVDTWPISWHEQNLTSNLVLLEFFPILVVLEIWGEQ 853
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
L + ++ +DN VV ++ + S +L + ++ + I I A+ IPG N+V
Sbjct: 854 LANKRILWHTDNMGVV-FVLNNLSSNSPPVLRLLRQVVFRALRHNIWIKAKHIPGYKNNV 912
Query: 884 ADSLSR 889
AD+LSR
Sbjct: 913 ADALSR 918
>gi|301620195|ref|XP_002939468.1| PREDICTED: hypothetical protein LOC100492902 [Xenopus (Silurana)
tropicalis]
Length = 474
Score = 87.0 bits (214), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 154/365 (42%), Gaps = 29/365 (7%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ T Q L + G CLP G + + F + S ++
Sbjct: 91 GALMAKADVESAFRLLPVHTESQHLLGCYFKGSYYVDRCLPMGCSISCAYFEAFSTFLEW 150
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLS 656
+ R R G+ ++ YLDDFL V L +L +L +V+ +
Sbjct: 151 VARKRAGVNTIIHYLDDFLCVGPG------NSGLCAVLLQTLQEVVDQFGVPLAGDKTEG 204
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA L+FLGI D LP DK L + L +K L +SL+G L+FA +
Sbjct: 205 PATCLKFLGIEIDTERQECRLPPDKVQLLKGEVEYALGAKKVTLKQLQSLIGRLNFACRI 264
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ-VQHFI 774
IPMGR+ +R + +L R H ++ + L W L + + RQ V+
Sbjct: 265 IPMGRVFARALAMATALAR-RPHHFIRLSQELKEDLMVWRVFLQDFNGRSYWRQEVRDNR 323
Query: 775 STDASDLGWGSQVDSSFLSGLWSRE--QQNWHINK-------KEMFAVHQALSLNLPLLQ 825
D G+ ++ SG W Q W +K E+F + A+ L L
Sbjct: 324 EIDLFTDAAGAGGFGAYYSGRWCAAPWPQEWAESKLISNFTFLELFPIVVAIELWGHRLA 383
Query: 826 SSVVMVQSDNQ-TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+ V+ +DN T ++ G+K + L + L + + A+ +PGA N +A
Sbjct: 384 NKAVLFHTDNMATALAINNLTSGSKPVLRLLRHLVLRCLQIN--VSFRAKHLPGATNEIA 441
Query: 885 DSLSR 889
D+LSR
Sbjct: 442 DALSR 446
>gi|301627285|ref|XP_002942808.1| PREDICTED: hypothetical protein LOC100488967, partial [Xenopus
(Silurana) tropicalis]
Length = 731
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 152/357 (42%), Gaps = 43/357 (12%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ + + L + G LP G + + F + S ++
Sbjct: 135 GALMAKTDIEAAFRLLAVHPDSLHLLGCQFGGSFYVDRSLPMGCSISCSYFETFSTFLEW 194
Query: 605 LLRSR-GM-RVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R + GM ++ YLDDFL + + I + + G + +K+ P+ +
Sbjct: 195 VIRQQSGMISIIHYLDDFLCIGPANSPACAILLQTVQRVTSEFGVPLAPEKTE-GPSTYM 253
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L ++ ++SK L +SLLG L+FA +I MGR
Sbjct: 254 KFLGIEIDTVRQECRLPIDKVSALKENIQRAISSKKLTLKQLQSLLGKLTFACRIITMGR 313
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDL 781
+ SRR+ S L G T N + +Q F TDA+
Sbjct: 314 VFSRRLAMATSGLT-GIVWTTDTN----------------------KDLQLF--TDAA-- 346
Query: 782 GWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLLQSSVVMVQ 832
GS ++ SG W E+ ++W ++ E+F + A+ L L + V
Sbjct: 347 --GSCGFGAYFSGSWCAEKWPESWVAGGLTRNLTLLELFPILVAIELWGHLFSNRNVTFN 404
Query: 833 SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+DN +VV + Q + S +L+ + + L I AQ +PG N +ADSLSR
Sbjct: 405 TDNMSVVLAINNQ-TSSSGPVLALLRHLVLRCLQSNICFRAQHLPGVVNDIADSLSR 460
>gi|301609174|ref|XP_002934156.1| PREDICTED: hypothetical protein LOC100488992 [Xenopus (Silurana)
tropicalis]
Length = 1158
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 146/366 (39%), Gaps = 31/366 (8%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M +D+ A+ +P+ L + CLP G + + F + S ++
Sbjct: 775 GALMAKVDVESAFRLLPVHQESLHLLGCHFGEGYYVDRCLPMGCSISCAYFEAFSTFIEW 834
Query: 605 LLRS-RGMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ 662
+++ G+ V+ YLDDFL V L V + L + + P L+
Sbjct: 835 VVKKWAGVNSVIHYLDDFLCVGPGNSTLCAVLLQTVQKVADLFGVPLAPDKTEGPTTCLR 894
Query: 663 FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRL 722
FLGI D LP+DK L + +K L +SLLG L+FA +IPMGR+
Sbjct: 895 FLGIEIDTIRQECRLPQDKIQQLKVEVGYARTAKKVTLKQLQSLLGKLNFACRIIPMGRV 954
Query: 723 HSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW------LNA-LPLSSPIFPRQVQHFIS 775
SR + + +R H +N LE W N L S P + F +
Sbjct: 955 FSRNLAMATAGIRQ-PHHFIRLNAGHREDLEVWRVFLQDFNGRLYWQSQPRPNEELQFFT 1013
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINK-------KEMFAVHQALSLNLPLLQSSV 828
A G+G+ + + W W NK E+F + A+ L L++
Sbjct: 1014 DAAGSAGFGAYYAGRWCAAPWP---NAWGENKLTSNLAFLELFPIVVAVELWGAQLRNQS 1070
Query: 829 VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR-----IHILAQFIPGAYNSV 883
V+ +DN +VV + +L+ S L R + A+ +PG N +
Sbjct: 1071 VVFFTDNMSVVMAI------TNLTSASRPVLKLLKHLVLRCLQLNVRFGAKHVPGHTNEI 1124
Query: 884 ADSLSR 889
ADSLSR
Sbjct: 1125 ADSLSR 1130
>gi|10946132|gb|AAG24792.1|AF264028_2 pol protein [Colletotrichum gloeosporioides]
Length = 1241
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 109/454 (24%), Positives = 196/454 (43%), Gaps = 51/454 (11%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
A+ +++E L+ G ++ S G S + VPK NG R ++ + LN+ ++ L
Sbjct: 312 KALDKYLEENLKKGYIRESTSPAG--SPILFVPKKNGKLRLCVDYRMLNEMTIKNRYPLP 369
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L ++ ++DL AY + +K + A +PFGL AP
Sbjct: 370 LIDELQRLLHGANWFTALDLKGAYNLIRMKEGEEWKTAFRTRKGHFEYLVMPFGLTNAPA 429
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F ++ N V L + + VVVYLDD L+ + + + L + L + +V +KS
Sbjct: 430 TFQNMINQV--LRKFVDIFVVVYLDDILIFSPTLKQHKEHVHLVLQALQNAKLLVEPEKS 487
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGY 709
A +++LG P + + +DK +R++ + W NL +S LG
Sbjct: 488 KFH-AQEVEYLGFTITP--GHIHMSKDK-------VRSI---QEWPTPTNLKEVQSFLGL 534
Query: 710 LSFA-SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFP 767
++F F+ G++++ L R P K++ + + P+ P
Sbjct: 535 VNFYRKFIKYYGKINT----PLTDLSRKDQPFEWKEAQEIAFKKIKDRITSEPVLMIPNP 590
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSG------LWSRE----QQNWHINKKEMFAVHQAL 817
Q Q + DASD G+Q+ G +SR+ + N+ I+ KE+ A+ +A
Sbjct: 591 -QNQFEVEADASDFALGAQLSQRDSEGRLHPCAFFSRKLHGPELNYQIHDKELMAIIEAF 649
Query: 818 SLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQF 875
P L ++ V+V +D++ + + + K SE FL ++RI +
Sbjct: 650 KEWRPELSGTIHEVLVYTDHKNLAHFTTSKVLNKRQIRWSE----FLSEFNFRI----IY 701
Query: 876 IPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLK 909
G+ N AD+LSR PD+++ Q+ LK
Sbjct: 702 RKGSENGRADALSRR---PDYNVPVPEETQVILK 732
>gi|301605558|ref|XP_002932423.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1406
Score = 86.3 bits (212), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 190/445 (42%), Gaps = 54/445 (12%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I+E LE G ++ S G + F V K +GG RP ++ +GLN
Sbjct: 185 LSLPETQAMEEYIKENLERGFIRSSCSPAG--AGFFFVEKKDGGLRPCIDYRGLN----- 237
Query: 528 KKFSLINHFRIPSFLQ-----KGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
KF++ N + +P + KG + S +DL AY + I+ + A +
Sbjct: 238 -KFTVKNRYPLPLISELFDRVKGATIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEY 296
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F L N + L R VVVYLDD L+ + + +S L
Sbjct: 297 LVMPFGLCNAPAVFQELVNDIFRDLLGRS--VVVYLDDILIYSNSLSDHRAHVQEVLSRL 354
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
++K + V FLG ++ K L + + + L
Sbjct: 355 RQHHLYAKIEKCIFEVSSV-HFLG----------YIISHKGLEMDPVKVQAIVYWVQPL- 402
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKL--EWWLNA 758
S R++ +L FA++ + S + +L + GA P + PI KL E +++A
Sbjct: 403 SLRAIQRFLGFANYYRQFIKNFSTLVAPITALTKKGADPSIWPIEALTAFKLLKEAFVSA 462
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINK 807
L P + + DAS++G G+ + F S +S + N+ I
Sbjct: 463 HVLLHP--DSALPFLLEVDASEIGAGAVLSQRHPVTNKIHPCGFFSKKFSPTEINYDIGN 520
Query: 808 KEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+E+ A+ A + LL+ + VV V +D++ ++ Y+ K L+ +F
Sbjct: 521 RELLAIKLAFTEWRHLLEGAKHVVTVITDHKNLL-YIE---SAKRLNPRQARWALFFS-- 574
Query: 866 DWRIHILAQFIPGAYNSVADSLSRS 890
R + + + PG N AD+LSRS
Sbjct: 575 --RFNFIITYRPGEKNVKADALSRS 597
>gi|326669295|ref|XP_003198979.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1420
Score = 86.3 bits (212), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 135/535 (25%), Positives = 208/535 (38%), Gaps = 76/535 (14%)
Query: 387 EPPGR---VSLKVQTLQKPQRCSSPVN-PPADSRIGAEL-----VGGRLRRFVDAWIRLG 437
EPP + L + T+ KPQ +N PP SR+ E V + R
Sbjct: 444 EPPRHTKAIPLDIMTIPKPQIVPKSLNTPPEISRVPPEYSDLAEVFSKTR---------A 494
Query: 438 APAPLVRIVSGYAIPFSAKPPLVP-LCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTT 496
A P R Y P P P L L+ P +AM ++ E L++G ++ S
Sbjct: 495 ASLPPHR---PYDCPIDLLPGTCPPRGKLYSLSGPERAAMEKYVHESLDSGFIRPSTSPA 551
Query: 497 GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
G + F V K +G RP ++ +GLN ++ L LQ +DL A
Sbjct: 552 G--AGFFFVGKKDGSLRPCIDYRGLNSITVKNRYPLPLMTTAFEILQGATIFTKLDLRSA 609
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY 616
Y V I+ + A + +PFGLA AP F S N V L + V VY
Sbjct: 610 YHLVRIRQGDEWKTAFNTPTGHYEYQVMPFGLANAPAVFQSFINDV--LREMLNIFVFVY 667
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIM-------WD 669
LDD L+ + +P I + + L G V L+KS + V FLG + D
Sbjct: 668 LDDILIFSHNPEEHVIHVRKVLIELLKHGLFVKLEKSEFHVSSV-SFLGFIVSKGSLQMD 726
Query: 670 PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR 729
P R L + ++ + R +L FA+F R S +
Sbjct: 727 PSKTRAVLDWPQPTSIKEVQR------------------FLGFANFYRRFIRNFSSIAEP 768
Query: 730 QASLLRLGAPHLTPINPA--VLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG--- 784
SL + T + A L+ + P+ + P ++ + DASD+G G
Sbjct: 769 LTSLTKKANTPFTWNDKASTAFNTLKHRFTSAPILTLPDP-ELPFILEVDASDIGVGAVL 827
Query: 785 ---SQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVMVQSD 834
S+ D+ +F S + Q N+ I +E+ A+ AL L+ S ++ +D
Sbjct: 828 SQRSKADNKLHPCAFYSHRLTPTQANYDIGNRELLAIKLALEEWRHWLEGASHPFLIWTD 887
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+Q ++Y++ K L+ +F R F PG+ N D+LSR
Sbjct: 888 HQN-LTYIQ---NAKRLNARQARWSLFFN----RFKFTLSFRPGSKNIKPDALSR 934
>gi|307180652|gb|EFN68577.1| Transposon Ty3-G Gag-Pol polyprotein [Camponotus floridanus]
Length = 152
Score = 85.9 bits (211), Expect = 1e-13, Method: Composition-based stats.
Identities = 47/152 (30%), Positives = 83/152 (54%), Gaps = 1/152 (0%)
Query: 424 GRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
GRL+ F + W+++ + + + GY IPF ++P + + + + I ++
Sbjct: 1 GRLKAFSNVWLKVTKDPVIRQWIQGYKIPFMSRPTQMSCPVERTWSDKEKLLLDKQINKL 60
Query: 484 LETGVLKRLDSTTG-FLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFL 542
++ G + R + G FLS +FLVPK +G R +LNLK L++F++ + F + + +
Sbjct: 61 IDKGAIVRCFPSQGQFLSNIFLVPKPDGTHRLILNLKKLSEFVAAEHFKIEDWKVAKRLI 120
Query: 543 QKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
DYM ++DL AY+ VPIK ++FL SY
Sbjct: 121 GPHDYMATLDLKDAYYLVPIKKMDRKFLRFSY 152
>gi|301603809|ref|XP_002931558.1| PREDICTED: hypothetical protein LOC100494839 [Xenopus (Silurana)
tropicalis]
Length = 902
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 150/354 (42%), Gaps = 37/354 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L +++ CLP G + + + F+S
Sbjct: 547 RGALLAKSDIESAFRLLPIHPDCFHLLGITFANLYFVDMCLPMGCSISCYYFELFSSFLE 606
Query: 601 WVASLLRSRGMRVVVYLDDFLLVN-----QDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
WV + + ++ ++ YLDDFL V + R+L L ++ + G + K+
Sbjct: 607 WVVTQV-AQSNSMLHYLDDFLFVGPAISPECARLLH----LFKEVMKNFGVPIAKDKTE- 660
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P V+ FLGI D LP +K +L L L +K L +SLLG+L+F S
Sbjct: 661 GPQEVIVFLGIEIDSREMVFRLPLEKLESLSQSLDRALMAKKLTLKQIQSLLGHLTFVSR 720
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIS 775
++PMGR RR+ ++ H + + L W N + SP +
Sbjct: 721 IMPMGRAFCRRLSLSTKGIKY-PNHYIRMTKHIKDDLRIWRNI--MVSP---------VG 768
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
A +L W + L+ + W + K A H + LL + V+ DN
Sbjct: 769 RSAKNLIWNWNCLLTLLAA------KGWELIFKGNVASH----IWGELLANQRVIFWCDN 818
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+VV + Q + S +L+ + + L I A+ +PG NS+AD+LSR
Sbjct: 819 SSVVQVINNQTSS-SPPVLNLLRALVLQCLRMNIWFRARHVPGVQNSIADALSR 871
>gi|294901157|ref|XP_002777263.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239884794|gb|EER09079.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 2220
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 109/457 (23%), Positives = 195/457 (42%), Gaps = 55/457 (12%)
Query: 461 PLCS-LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
P+C ++ + +S ++EM E GV++R ST+ + VPK NG R ++ +
Sbjct: 742 PICERIRPIPHKYRDEISALLKEMEELGVIRR--STSAWRFPCVFVPKKNGKVRMCIDYR 799
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD-- 577
LN+ + + + + L ++DL Y+ +P++ QR A
Sbjct: 800 NLNKACHTEAYPVPRPDDVQEHLAGARVFSTLDLRSGYWQIPVRKEDQRKTAFCPGPGFP 859
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ-DPRILEIQGKL 636
+ +PFGLA+AP F L + + L V VYLDD L+ ++ D LE ++
Sbjct: 860 LYEWVMMPFGLASAPATFQRLMDAILGHLPF----VRVYLDDVLIFSRSDEEHLE-HLRI 914
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-----PHLDRMWLPEDKQLTLGNILRT 691
+L + G V +K V +LG M++ P L + + ILR
Sbjct: 915 VFELLRATGMTVAAEKCEFMQDRVT-YLGHMFNSTGMSPDLGKAEV----------ILRW 963
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR------QASLLRLGAPHLTPIN 745
L L RS LG + +P SR + + ++ LG
Sbjct: 964 PLPRTAPAL---RSFLGLAGYYRNFVPHFADKSRCLYEIVNYCTKNKVVELGN-QWGKEE 1019
Query: 746 PAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSRE 799
L+ W+ +LP L+ P F R Q + TDASD+ G+ V+ +F S +
Sbjct: 1020 ELAFNDLKQWIASLPLLAYPDFSRPFQ--LMTDASDVAIGAVVEQDGRPLAFFSQSLTPT 1077
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
Q+ W + ++E +A+ +AL PLL + +V + + +++ + ++V+
Sbjct: 1078 QKVWPVYEREAYAIFKALERFRPLLWGYHLELVVFSDHKPLEWIQ-------TATTAKVQ 1130
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + ++ + + G +N VAD+LSR + D
Sbjct: 1131 RWLISMSQFKFKVF--YKKGKHNVVADALSRITTSDD 1165
>gi|294893282|ref|XP_002774394.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239879787|gb|EER06210.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 778
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 109/457 (23%), Positives = 195/457 (42%), Gaps = 55/457 (12%)
Query: 461 PLCS-LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
P+C ++ + +S ++EM E GV++R ST+ + VPK NG R ++ +
Sbjct: 268 PICERIRPIPHKYRDEISALLKEMEELGVIRR--STSAWRFPCVFVPKKNGKVRMCIDYR 325
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD-- 577
LN+ + + + + L ++DL Y+ +P++ QR A
Sbjct: 326 NLNKACHTEAYPVPRPDDVQEHLAGARVFSTLDLRSGYWQIPVRKEDQRKTAFCPGPGFP 385
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ-DPRILEIQGKL 636
+ +PFGLA+AP F L + + L V VYLDD L+ ++ D LE ++
Sbjct: 386 LYEWVMMPFGLASAPATFQRLMDAILGHLPF----VRVYLDDVLIFSRSDEEHLE-HLRI 440
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-----PHLDRMWLPEDKQLTLGNILRT 691
+L + G V +K V +LG M++ P L + + ILR
Sbjct: 441 VFELLRAAGMTVAAEKCEFMQDRV-TYLGHMFNSTGMSPDLGKAEV----------ILRW 489
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR------QASLLRLGAPHLTPIN 745
L L RS LG + +P SR + + ++ LG
Sbjct: 490 PLPRTAPAL---RSFLGLAGYYRNFVPHFADRSRCLYEIVNYCTKNKVVELGN-QWGKEE 545
Query: 746 PAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSRE 799
L+ W+ +LP L+ P F R Q + TDASD+ G+ V+ +F S +
Sbjct: 546 ELAFNDLKQWIASLPLLAYPDFSRPFQ--LMTDASDVAIGAVVEQDGRPLAFFSQSLTPT 603
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
Q+ W + ++E +A+ +AL PLL + +V + + +++ + ++V+
Sbjct: 604 QKVWPVYEREAYAIFKALERFRPLLWGYHLELVVFSDHKPLEWIQ-------TATTAKVQ 656
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + ++ + + G +N VAD+LSR + D
Sbjct: 657 RWLISMSQFKFKVF--YKKGKHNVVADALSRITTSDD 691
>gi|345484016|ref|XP_003424926.1| PREDICTED: hypothetical protein LOC100677975 [Nasonia vitripennis]
Length = 791
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 133/272 (48%), Gaps = 11/272 (4%)
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
V++ LG + + LP++K++ + ++ LL + + +G L A +
Sbjct: 341 VVKILGFNINSADMTLELPQEKRVKIREMIDILLKMERVKVKVIAKCIGVLVAACPAVAY 400
Query: 720 GRLHSRR---IQRQASLLRLGAPHLTP---INPAVLPKLEWWLNALPLS-SPIFPRQVQH 772
G L+ + I+R A LR + ++ +L+WW + + ++ + I
Sbjct: 401 GWLYYKHLELIKRNA--LRSNFKRMDKWITLSLEAKEELKWWQSQILIAKNKIRSSNFDL 458
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ 832
I ++AS GWG+ + +G W+RE++ HIN E+ A AL + ++++
Sbjct: 459 EIFSNASTTGWGAICGNKKANGFWNREEREIHINFLEIKAAFLALKCFAAHSLNKQILLR 518
Query: 833 SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
DN T ++Y+ + GG K L + + I+ + I I A++I N +AD SR +
Sbjct: 519 IDNITALAYINKMGGIKHKELHALTKVIWEWCIEREIWIFAEYIASKEN-IADEGSRITN 577
Query: 893 L-PDWHLSRSATEQIFLKWGVPCIDLFASRVS 923
+ +W L+ A ++I ++G P IDLFASRV+
Sbjct: 578 VDTEWELANFAFQKIVKEFGYPSIDLFASRVN 609
>gi|301617191|ref|XP_002938030.1| PREDICTED: hypothetical protein LOC100496391 [Xenopus (Silurana)
tropicalis]
Length = 629
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 147/365 (40%), Gaps = 51/365 (13%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S ++
Sbjct: 233 GALMAKTDIEAAFRLLPVHPESLHLLGCQFGGSFYIDRSLPMGCSISCSYFETFSTFLEW 292
Query: 605 LLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R + GM ++ YLDDFL + + I + + G + K+ P+ +
Sbjct: 293 VIRQQSGMDSIIHYLDDFLCIGPANSPACAILLQTVQGVTTEFGVPLAPDKTE-GPSTCI 351
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L ++ L SK L +SLLG L+ A +I MGR
Sbjct: 352 KFLGIEIDTVRQECRLPIDKIGALREDIQRALTSKKLTLKQLQSLLGKLTLACRIISMGR 411
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--------SSPIFPRQVQHF 773
+ SRR+ S L+ H + + L W L + +Q F
Sbjct: 412 VFSRRLAMATSGLK-KPHHFVRLRAELKADLGIWAKFLEAYNGRSYWQKTADTNNDLQLF 470
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPLL 824
TDA+ GS +FLSG W E+ + W ++ E+F + A+ L L
Sbjct: 471 --TDAA----GSCGFGAFLSGNWCVEKWPEGWVEGGLTRNVTLLELFPILVAIELWGQWL 524
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+ V+ SDN +VV + Q + AQ +PG N VA
Sbjct: 525 SNRKVIFNSDNMSVVLAINNQTSSS-----------------------AQHLPGVVNDVA 561
Query: 885 DSLSR 889
DSLSR
Sbjct: 562 DSLSR 566
>gi|326669453|ref|XP_003199018.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1375
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 116/473 (24%), Positives = 187/473 (39%), Gaps = 75/473 (15%)
Query: 452 PFSAKPPLVPLCS-----LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVP 506
P+ LVP S L L+ P +AM ++ E L++G ++ S G + F V
Sbjct: 449 PYDCSIELVPGASPPRGRLYSLSIPERTAMEKYLNEALDSGFIRPSTSPAG--AGFFFVS 506
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
K +G RP ++ +GLN ++ L LQ +DL AY V IK
Sbjct: 507 KKDGSLRPCIDYRGLNHITIKNRYPLPLMNTAFEILQGATIFTKLDLRNAYHLVRIKEGD 566
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ A + +PFGL AP F + N V + +R V VYLDD L+ +
Sbjct: 567 EWKTAFNTPTGHYEYQVMPFGLVNAPAVFQAFINDVLREMLNRF--VFVYLDDILIFSSS 624
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG 686
+ +S L V L+KS + V FLG + + + L Q+ G
Sbjct: 625 YEEHVQHVRQVLSQLLRHRLFVKLEKSEFHVSKV-SFLGFI----VSKCSL----QMDPG 675
Query: 687 NILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
I L + ++ + LG+ +F RR R S + A LT +
Sbjct: 676 KIKAVLDWPQPCSVKEVQRFLGFANFY-----------RRFIRGFSSI---AEPLTALTK 721
Query: 747 AVLPKLEWW---------LNALPLSSPIFPR---QVQHFISTDASDLGWG------SQVD 788
W L +L S+PI ++ + DASD+G G S+ D
Sbjct: 722 KTAKSFVWTEMANKAFNRLKSLFTSAPILALPDPELPFVVEVDASDIGIGAVLSQRSKTD 781
Query: 789 S-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSY 841
+ ++LS + Q+N+ I +E+ AV AL L+ + ++ +D+
Sbjct: 782 NKLHPCAYLSHRLTPAQRNYDIGNRELLAVKVALEEWRHWLEGAKHPFLIWTDH------ 835
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
K+L+ + E +++ W R + PG+ NS D+LSR
Sbjct: 836 -------KNLTYIREAKRLNSRQARWALFFNRFDFTLSYRPGSKNSKPDALSR 881
>gi|294936925|ref|XP_002781915.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239893039|gb|EER13710.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1814
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 109/436 (25%), Positives = 184/436 (42%), Gaps = 53/436 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++LE G+++ ST+ + S VPK NG R ++ + LN ++L +
Sbjct: 728 VDKLLEEGMIR--PSTSPYRSPCVYVPKKNGSVRMCIDFRALNALTEVDSYTLPRPDDVQ 785
Query: 540 SFLQKGDYMISIDLSQAYFHVPIK--TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
L ++DL Y+ ++ H+ + +PFGL +A F
Sbjct: 786 EHLAGSKVFSTLDLQSGYWQCLLRPQDIHKTAFCPGPGFPLYEWVRMPFGLCSAGATFQR 845
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L + V + L V VYLDD L+ + D E + + L + G ++ +K
Sbjct: 846 LMDQVLNGLPF----VRVYLDDILVFSPDAETHEDHLRQVFARLRAWGLTLSAEKCEFG- 900
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
P + +LG ++D + R P+ ++ ILR + N+ RS LG + +
Sbjct: 901 CPSVPYLGHIFDGNGMR---PDPTKVE--AILRW---PRPGNVAEIRSFLGLAGYYRNFV 952
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLT------PINPAVLPKLEWWLNALP-LSSPIFPRQV 770
P +R IQR S +G+ L L+ L ALP L+ P F +
Sbjct: 953 PNFSDVARPIQRLVS--EVGSETLALDTYWGQEQEESFRALKLRLAALPFLAYPDF--GI 1008
Query: 771 QHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMFAVHQAL------SL 819
+ TDASD G+ + F S + Q NWH +KE + + QAL +
Sbjct: 1009 PFELYTDASDYAIGAVLMQEGRPLGFFSRTLTGSQLNWHTYEKEAYGILQALIYFQHYHI 1068
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
PL V V +D++ +++L + G K +E+ L Q + + +++PG
Sbjct: 1069 GYPL----TVTVYTDHEP-LTWLAKAGSKK-------LERWLLAMQAY--SFIVKYVPGK 1114
Query: 880 YNSVADSLSRSKSLPD 895
N AD+LSR + L D
Sbjct: 1115 KNVCADALSRIRQLDD 1130
>gi|9627997|ref|NP_056848.1| aspartic protease/reverse transcriptase [Cassava vein mosaic virus]
gi|81945490|sp|Q89703.1|POL_CSVMV RecName: Full=Putative enzymatic polyprotein; Includes: RecName:
Full=Protease; Short=PR; Includes: RecName: Full=Reverse
transcriptase; Short=RT; Includes: RecName:
Full=Ribonuclease H
gi|665934|gb|AAA79873.1| ORF III [Cassava vein mosaic virus]
gi|1399884|gb|AAB03327.1| ORF 3 [Cassava vein mosaic virus]
Length = 652
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 102/467 (21%), Positives = 191/467 (40%), Gaps = 62/467 (13%)
Query: 456 KPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR-----LFLVPK--- 507
KPP++ Q P +HI+EM++ G + + T F + F+V K
Sbjct: 206 KPPML----YQETDLP---EFKMHIEEMIKEGFI---EEKTNFEDKKYSSPAFIVNKHSE 255
Query: 508 -GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
G TR V++ K LN+ K+ + N + + Y D ++H+ ++
Sbjct: 256 QKRGKTRMVIDYKDLNKKAKVVKYPIPNKDTLIHRSIQARYYSKFDCKSGFYHIKLEEDS 315
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+++ A + LPFG +P F ++ + R ++VY+DD L+ ++
Sbjct: 316 KKYTAFTVPQGYYQWKVLPFGYHNSPSIFQQ---FMDRIFRPYYDFIIVYIDDILVFSKT 372
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-------PHLDRMWLPE 679
+I I + G I++ +K+ L + FLG+ + PH+ L +
Sbjct: 373 IEEHKIHIAKFRDITLANGLIISKKKTELCKEKI-DFLGVQIEQGGIELQPHIINKILEK 431
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP 739
++ L+++L L+ R + +L A ++P IQ++ +
Sbjct: 432 HTKIKNKTELQSILGL----LNQIRHFIPHL--AQILLP--------IQKKLKIKDEEIW 477
Query: 740 HLTPINPAVLPKLEWWLNAL--PLSSPIFPRQVQHFISTDASDLGWGSQVDSS------- 790
T + + ++ + L + PI + I DAS+ +GS +
Sbjct: 478 TWTKEDEEKIKLIQDYSKNLVIKMKYPINKEDMNWIIEVDASNNAYGSCLKYKPKNSKIE 537
Query: 791 ----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
+ SG + +Q + IN+KE+ AV+Q L +V++DN V +++
Sbjct: 538 YLCRYNSGTFKENEQKYDINRKELIAVYQGLQSYSLFTCEGNKLVRTDNSQVYYWIKNDT 597
Query: 847 GTKSLSLLSEVEKI-FLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
KS+ E I +LL++ + Q I G N +AD LSR S
Sbjct: 598 NKKSI----EFRNIKYLLAKIAVYNFEIQLIDGKTNIIADYLSRYNS 640
>gi|294954182|ref|XP_002788040.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239903255|gb|EER19836.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1233
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 109/436 (25%), Positives = 184/436 (42%), Gaps = 53/436 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++LE G+++ ST+ + S VPK NG R ++ + LN ++L +
Sbjct: 275 VDKLLEEGMIR--PSTSPYRSPCVYVPKKNGSVRMCIDFRALNALTEVDSYTLPRPDDVQ 332
Query: 540 SFLQKGDYMISIDLSQAYFHVPIK--TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
L ++DL Y+ ++ H+ + +PFGL +A F
Sbjct: 333 EHLAGSKVFSTLDLQSGYWQCLLRPQDIHKTAFCPGPGFPLYEWVRMPFGLCSAGATFQR 392
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L + V + L V VYLDD L+ + D E + + L + G ++ +K
Sbjct: 393 LMDQVLNGLPF----VRVYLDDILVFSPDAETHEDHLRQVFARLRAWGLTLSAEKCEFG- 447
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
P + +LG ++D + R P+ ++ ILR + N+ RS LG + +
Sbjct: 448 CPSVPYLGHIFDGNGMR---PDPTKVE--AILR---WPRPGNVAEVRSFLGLAGYYRNFV 499
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLT------PINPAVLPKLEWWLNALP-LSSPIFPRQV 770
P +R IQR S +G+ L L+ L ALP L+ P F
Sbjct: 500 PNFSDVARPIQRLVS--EVGSETLALDTYWGQEQEESFRALKLRLAALPFLAYPDFGTPF 557
Query: 771 QHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMFAVHQAL------SL 819
+ + TDASD G+ + F S + Q NWH +KE + + QAL +
Sbjct: 558 ELY--TDASDYAIGAVLMQEGRPLGFFSRTLTGSQLNWHTYEKEAYGILQALIYFQHYHI 615
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
PL V V +D++ +++L + G K +E+ L Q + + +++PG
Sbjct: 616 GYPL----TVTVYTDHEP-LTWLAKAGSKK-------LERWLLAMQAY--SFIVKYVPGK 661
Query: 880 YNSVADSLSRSKSLPD 895
N AD+LSR + L D
Sbjct: 662 KNVCADALSRIRQLDD 677
>gi|391325581|ref|XP_003737311.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1605
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 147/338 (43%), Gaps = 42/338 (12%)
Query: 389 PGRVSLKVQTLQKPQRCSSPVNPPADSRIG-AELVGGRLRRFVDAWIRLGAPAPLVRIVS 447
P +S ++ T+Q+ Q S DS G + + R VD +LG +
Sbjct: 407 PKLISRRITTVQENQSNSE------DSDTGLRDKIQKRFPSIVDG--KLGK-------CT 451
Query: 448 GYAIPFSAKPPLVPLCSLQH-LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVP 506
Y + K P+ +A + A++ I+ + TGV+++++S+ F + + +V
Sbjct: 452 KYEAKITLKKSSTPIFKKARPVAYAILPAIAAEIERLESTGVIEKVNSS-AFAAPVVVVR 510
Query: 507 KGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTT 565
K +GG R + GLN + + L I S L G + IDLS+AY +P+
Sbjct: 511 KSSGGIRLCADYSTGLNTAIEDDNYPLPTAEDIFSTLNGGTWFSKIDLSEAYLQIPVDAE 570
Query: 566 HQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
Q+ L ++ + M LPFG+ TAP F L + + + L YLDD ++ ++
Sbjct: 571 SQKLLTINTPKGLYRMKRLPFGIKTAPSIFQRLMDTLVADLEG----TTAYLDDIIVTSK 626
Query: 626 -----DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
+ RIL++ G+LA G L K S + + Q+LG + R PE
Sbjct: 627 TKAEHENRILKLFGRLA-----EFGLKAQLNKCSFMKSQI-QYLGFILSKE-GRKPDPE- 678
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
I + K N+ R+ LG ++F + +P
Sbjct: 679 ------RIQPIVALQKPTNISQLRAFLGMITFYNNFVP 710
>gi|294892413|ref|XP_002774051.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239879255|gb|EER05867.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1024
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 111/457 (24%), Positives = 193/457 (42%), Gaps = 55/457 (12%)
Query: 461 PLCS-LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
P+C ++ + +S ++EM E GV++R ST+ + VPK NG R ++ +
Sbjct: 257 PICERIRPIPHKYRDEISALLKEMEELGVIRR--STSAWRFPCVFVPKKNGKVRMCIDYR 314
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD-- 577
LN+ + + + + L ++DL Y+ +P++ QR A
Sbjct: 315 NLNKACHTEAYPVPRPDDVQEHLAGARVFSTLDLRSGYWQIPVRKEDQRKTAFCPGPGFP 374
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ-DPRILEIQGKL 636
+ +PFGLA+AP F L + + L V VYLDD L+ ++ D LE ++
Sbjct: 375 LYEWVMMPFGLASAPATFQRLMDAILEHLPF----VRVYLDDVLIFSRSDEEHLE-HLRI 429
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-----PHLDRMWLPEDKQLTLGNILRT 691
+L + G V +K V +LG M++ P L + + ILR
Sbjct: 430 VFELLRAAGMTVAAEKCEFMQDRV-TYLGHMFNSTGMSPDLGKAEV----------ILRW 478
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR------QASLLRLGAPHLTPIN 745
L L RS LG + +P SR + + ++ LG
Sbjct: 479 PLPRTAPAL---RSFLGLAGYYRNFVPHFADRSRCLYEIVNYCTKNKVVELGN-QWGKEE 534
Query: 746 PAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSRE 799
L+ W+ +LP L+ P F R Q + TDASD+ G+ V+ +F S +
Sbjct: 535 ELAFNDLKQWIASLPLLAYPDFSRPFQ--LMTDASDVAIGAVVEQDGRPLAFFSQSLTPT 592
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
Q+ W + ++E +AV +AL PLL + +V + + +++ K V+
Sbjct: 593 QRVWPVYEREAYAVFKALERFRPLLWGYHLELVVFSDHKPLEWIQTATTAK-------VQ 645
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + ++ + + G +N VAD+LSR + D
Sbjct: 646 RWLISMSQFKFKVF--YKKGKHNVVADALSRITTSDD 680
>gi|301609203|ref|XP_002934174.1| PREDICTED: hypothetical protein LOC100494971 [Xenopus (Silurana)
tropicalis]
Length = 1899
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 108/465 (23%), Positives = 188/465 (40%), Gaps = 94/465 (20%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I+E LE G ++ S G + F V K +GG RP ++ +GLN+
Sbjct: 367 LSLPETQAMEEYIKENLERGFIRPSSSPAG--AGFFFVEKKDGGLRPCIDYRGLNKITVK 424
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L ++ ++ +DL AY + I+ + A + +PFG
Sbjct: 425 NRYPLPLISKLFDRVKGATVFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFG 484
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKL------ 636
L AP F L N++ L R VVVYLDD L+ + + + E+ +L
Sbjct: 485 LCNAPAVFQELVNYIFRDLLGRT--VVVYLDDILIYSNNLSDHRAHVQEVLFRLRQNHLY 542
Query: 637 --------AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
VS + LG+I++ + + PA V L W+ + L+L I
Sbjct: 543 AKIEKCVFEVSSVHFLGYIISKRGLEMDPAKVQAILD----------WV---QPLSLRAI 589
Query: 689 LRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV 748
R +L FA++ + +S + +L + GA A+
Sbjct: 590 QR------------------FLGFANYYRQFIKNYSSLMAPITALTKRGADPTMWAEEAL 631
Query: 749 LPKLEWWLNALPLSSPIFPRQVQH-------FISTDASDLGWGSQVD-----------SS 790
L + L +S+P+ +QH + DAS++G G+ +
Sbjct: 632 LAFKK--LKEAFISAPV----LQHPDTTLPFLVEVDASEIGAGAVLSQRHPVTNKVHPCG 685
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS 850
+ S +S + N+ I+ +E+ A+ A LL+ + MV TV + K+
Sbjct: 686 YFSKKFSPTETNYDIDNRELLAIKLAFEEWRHLLEGAKHMV-----TVFT------DHKN 734
Query: 851 LSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRS 890
L + +++ W R + + + PG N AD+LSRS
Sbjct: 735 LLYIESAKRLNPRQARWALFFSRFNFIITYRPGDKNVKADALSRS 779
>gi|301618391|ref|XP_002938610.1| PREDICTED: hypothetical protein LOC100494344 [Xenopus (Silurana)
tropicalis]
Length = 795
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 160/371 (43%), Gaps = 39/371 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L + + CLP G + + + F+S
Sbjct: 408 RGALLAKSDIESAFRLLPIHPDCFHLLGIKFANLYFVDMCLPMGCSISCYYFELFSSFLE 467
Query: 601 WVASLLRSRGMRVVVYLDDFLLVN-----QDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
WV + + ++ ++ YLDDFL V + R+L L + ++ + G + K+
Sbjct: 468 WVVTQV-AQSNSMLHYLDDFLFVGPANSPECARLLH----LFMEVMENFGVPIAKDKTE- 521
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P V+ FLGI D LP +K +L +L L +K L +SLLG+L+FAS
Sbjct: 522 GPQEVIVFLGIEIDSREMVFRLPLEKLESLSQLLDRALMAKKLTLKQIQSLLGHLTFASR 581
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-- 773
++PMGR+ RR+ ++ H + + L W L + Q+
Sbjct: 582 IMPMGRVFCRRLSLSTKGIKY-PNHYIRMTKHIKDDLRIWQKFLAEYNGQSCWQISEKSN 640
Query: 774 ----ISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLN 820
+ TDA+ GS+ ++ G W Q W ++ E+F + A +
Sbjct: 641 LELELFTDAA----GSKGMGAYFQGQWCSAQWPSFWRDTDLIRNLTCLELFPIVVASHIW 696
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL--AQFIPG 878
LL + V+ DN +VV + Q S S L+ Q R++I A+ +PG
Sbjct: 697 GELLANQRVIFWCDNSSVVQVINNQ---TSSSPPVLNLLRVLVLQCLRMNIWFRARHVPG 753
Query: 879 AYNSVADSLSR 889
NS+AD+LSR
Sbjct: 754 VQNSIADALSR 764
>gi|291236769|ref|XP_002738310.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 981
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/401 (22%), Positives = 175/401 (43%), Gaps = 47/401 (11%)
Query: 521 LNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN+ + + FS+ ++ RI L +M D++ A+ +P+ + + +
Sbjct: 565 LNELIDKETFSM-SYIRIDDAFAEIRRLNGNTHMCKFDITDAFKQIPLHPSIWHLHGVKW 623
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVVVYL-DDFLLVNQDPRIL 630
+ L FG ++P F LS W+ ++ G++ +++L DDFL ++Q+
Sbjct: 624 DDKYYFFIRLVFGSRSSPNIFDLLSQAICWI--VIHKFGVKFILHLLDDFLTIDQEEETA 681
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
+ I +L + K+ L P+ L++LG+M + +LP +K + +
Sbjct: 682 MRSMAVMTMIFNTLNIPLAAHKT-LGPSQELEYLGVMINSKDMLAFLPANKISRIKEKIS 740
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
+ K+ SLLG+L+FA+ V+ GR + R AS + H++ +N A
Sbjct: 741 EFTSKKSITKRQLLSLLGHLNFAARVVLPGRSFIAHLLRLASSVSKLTHHVS-LNQACRL 799
Query: 751 KLEWW------LNALPL--SSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQN 802
+L W N + L +S + H + +S G+G S+ W E
Sbjct: 800 ELSMWKLFMDSWNGIHLFMNSGVISASDLHIYTDASSTKGFGGFFQGSWFCDKWPEEL-- 857
Query: 803 WHINKKEMFAVHQALSLNL----PLLQSSV----------VMVQSDNQTVVSYLRR-QGG 847
+F LS+ L P++ +++ + DNQ V+ +R+ +
Sbjct: 858 -------IFEQRDELSMALLELYPIVIAAMLWGQHWSKQRICFNCDNQATVNIIRKGRAA 910
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ ++ + + ++ L + ILAQF+PG NS+AD+LS
Sbjct: 911 SHCYAINTLMRRLTLTAMQHNFIILAQFLPGKQNSIADALS 951
>gi|301614953|ref|XP_002936952.1| PREDICTED: hypothetical protein LOC100493659 [Xenopus (Silurana)
tropicalis]
Length = 1084
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 145/352 (41%), Gaps = 56/352 (15%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG + D+ A+ +PI + L + G CLP G + + + F S ++
Sbjct: 751 KGALLAKSDIESAFHLLPIHSDCYHLLGCQFEGQFYYDMCLPMGCSISCRYFECFSTFLE 810
Query: 604 SLLRSR-GMRVVV-YLDDFLLVNQ-DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R G + YLDDFL + + + ++ + G ++ +K+ P V
Sbjct: 811 WVVRQETGYNSAIHYLDDFLFIGPPNTNVCQLLLSTFQFFMARFGVPLSKEKTE-GPITV 869
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI D LP DK L + + + A+K L S +SLLG L FA ++P+
Sbjct: 870 LSFLGIEIDTVALVFRLPVDKLQKLKSTVAEITAAKKVTLCSMQSLLGLLVFACRIMPIA 929
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASD 780
R+ SRR+ L+ + PH HFI
Sbjct: 930 RVFSRRLSLLGKLVGIKQPH-------------------------------HFIRI-TKQ 957
Query: 781 LGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS---VVMVQSDNQT 837
L +V +FL EQ N H + ++ LSL S+ ++ Q
Sbjct: 958 LREDLKVWQTFL------EQYNGHTCLMDTEVSNEELSLFTDAAGSTGFGAILAQ----- 1006
Query: 838 VVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S+L + SL +L+ + + L ++ I + AQ +PG N+ AD+LSR
Sbjct: 1007 --SWLT----SSSLPVLALLRHLVLRCLEFNIWLSAQHVPGRVNTSADALSR 1052
>gi|326680633|ref|XP_003201579.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1165
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 122/456 (26%), Positives = 184/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 359 PKGKLYSLSIPEREAMEKYISDSLAAKIIQPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 416
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 417 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 476
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 477 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHIRRVLQ 533
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 534 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 580
Query: 700 LDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ AR L +L FA+F RR R S +L AP LT + A P W
Sbjct: 581 ISEARKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSVA 628
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L +S+PI P + F + DAS++G G+ + +F S
Sbjct: 629 QAAFTKLKGCFVSAPILVTPDPARQFVVEVDASEVGVGAILSQRAVSDDRIHPCAFFSHR 688
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 689 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 744
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N+ D+LSR
Sbjct: 745 RQARWALFFGRFDFTI----SYRPGSKNTRPDALSR 776
>gi|326676761|ref|XP_003200671.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1165
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 121/456 (26%), Positives = 186/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 359 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 416
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 417 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 476
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 477 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 533
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 534 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 580
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ +S ++L +L FA+F RR R S +L AP LT + A P W
Sbjct: 581 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSVA 628
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L +S+PI P + F + DAS++G G+ + +F S
Sbjct: 629 QAAFTKLKGCFVSAPILVTPDPARQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 688
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 689 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 744
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N+ D+LSR
Sbjct: 745 RQARWALFFGRFDFTI----SYRPGSKNTKPDALSR 776
>gi|327267666|ref|XP_003218620.1| PREDICTED: hypothetical protein LOC100555260 [Anolis carolinensis]
Length = 658
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 101/211 (47%), Gaps = 21/211 (9%)
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
QK+ LSP + F+G + D + +LPE++ L ++ L + + + +S LG++
Sbjct: 391 QKNPLSPTQCIAFIGTLIDTQSQKAFLPEERFRNLRAMVSKLRHHERVSTWTIQSTLGHM 450
Query: 711 SFASFVIPMGRLHSRRIQ----RQASLLR-----LGAPHLTPINPAVLPKLEWWLNALPL 761
+ + V RL R +Q R+ +R L PH P +L A
Sbjct: 451 ASTTVVTLYARLRFRTVQIWFVRKFIPIRDHQNVLSLPHGGQGGPMCARDFRSFLLA--- 507
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
H ++TDAS LGWG+ + + G WS ++Q HIN E+ A+ +A+
Sbjct: 508 ---------PHSLTTDASTLGWGAHLQNLTAHGRWSTQEQKLHINALELLAMEKAMKSFT 558
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLS 852
L ++ V+ + +DN TV +Y+ R+G S+S
Sbjct: 559 RLTENQVIQLVTDNTTVKAYINREGPAHSIS 589
>gi|301620344|ref|XP_002939539.1| PREDICTED: hypothetical protein LOC100497597 [Xenopus (Silurana)
tropicalis]
Length = 1152
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 171/405 (42%), Gaps = 58/405 (14%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L ++G CLP G + + F + S ++
Sbjct: 769 GALMAKADVESAFRLLPVHRESLHLLGCFFHGKYYVDRCLPMGCSISCAYFEAFSTFIEW 828
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLS 656
++R R G+ ++ YLDDFL V L +L +L + + ++ +
Sbjct: 829 VVRRRAGVNTIIHYLDDFLCVAPG------NSGLCAVLLQTLQEVADQFGVPLAREKTEG 882
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
PA L+FLGI + L LP DK Q+ G + TL A K L +SLLG L+FA
Sbjct: 883 PATCLKFLGIEINTVLQECRLPLDKVQVLKGEVEYTLKAKKV-TLKQLQSLLGKLNFACR 941
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFP 767
+IPMGR+ SR + + +R H + A L W L L
Sbjct: 942 IIPMGRVFSRALAMATAGVRR-PHHFIRLTQAHKEDLAVWKTFLQDFNGRSYWLQKTCEN 1000
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSRE--QQNW-------HINKKEMFAVHQALS 818
+++ F TDA+ + ++ SG W Q W ++ E+F + A+
Sbjct: 1001 KEINLF--TDAAG----AGGFGAYFSGRWCAAPWPQEWVELRLTANLTFLELFPIIVAVE 1054
Query: 819 LNLPLLQSSVVMVQSDNQ-TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L LL + V +DN T ++ G++ + L + L + ++ A+ +P
Sbjct: 1055 LWGHLLANKSVRFHTDNMATALAINNLTSGSRPVLRLLRHLVLRCLQLN--VNFRAKHLP 1112
Query: 878 GAYNSVADSLSRSKSLPDWHLSR-----SATEQIFLKWGVPCIDL 917
G N +AD+LSR + W R +ATE G PC DL
Sbjct: 1113 GITNEIADALSRFQ----WDRFRRLAPGAATE------GDPCPDL 1147
>gi|301605640|ref|XP_002932454.1| PREDICTED: hypothetical protein LOC100489243 [Xenopus (Silurana)
tropicalis]
Length = 1152
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 171/405 (42%), Gaps = 58/405 (14%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L ++G CLP G + + F + S ++
Sbjct: 769 GALMAKADVESAFRLLPVHRESLHLLGCFFHGKYYVDRCLPMGCSISCAYFEAFSTFIEW 828
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLS 656
++R R G+ ++ YLDDFL V L +L +L + + ++ +
Sbjct: 829 VVRRRAGVNTIIHYLDDFLCVAPG------NSGLCAVLLQTLQEVADQFGVPLAREKTEG 882
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
PA L+FLGI + L LP DK Q+ G + TL A K L +SLLG L+FA
Sbjct: 883 PATCLKFLGIEINTVLQECRLPLDKVQVLKGEVEYTLKAKKV-TLKQLQSLLGKLNFACR 941
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFP 767
+IPMGR+ SR + + +R H + A L W L L
Sbjct: 942 IIPMGRVFSRALAMATAGVRR-PHHFIRLTQAHKEDLAVWKTFLQDFNGRSYWLQKTCEN 1000
Query: 768 RQVQHFISTDASDLGWGSQVDSSFLSGLWSRE--QQNW-------HINKKEMFAVHQALS 818
+++ F TDA+ + ++ SG W Q W ++ E+F + A+
Sbjct: 1001 KEINLF--TDAAG----AGGFGAYFSGRWCAAPWPQEWVELRLTANLTFLELFPIIVAVE 1054
Query: 819 LNLPLLQSSVVMVQSDNQ-TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L LL + V +DN T ++ G++ + L + L + ++ A+ +P
Sbjct: 1055 LWGHLLANKSVRFHTDNMATALAINNLTSGSRPVLRLLRHLVLRCLQLN--VNFRAKHLP 1112
Query: 878 GAYNSVADSLSRSKSLPDWHLSR-----SATEQIFLKWGVPCIDL 917
G N +AD+LSR + W R +ATE G PC DL
Sbjct: 1113 GITNEIADALSRFQ----WDRFRRLAPGAATE------GDPCPDL 1147
>gi|326665569|ref|XP_003198070.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1328
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 113/458 (24%), Positives = 184/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ +++ S G + F V K +G RP ++ +G
Sbjct: 413 PKGRLFSLSGPEREAMDRYINESLKAELIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 470
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 471 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 530
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + P++ + +
Sbjct: 531 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSPQVHTQHVRQVLQR 588
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K A + FLG + ++ G I + A
Sbjct: 589 LLENQLYVKAEKCVFH-AQSVPFLGFI---------------ISAGEIQADPCKVRAVAE 632
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP LT + +P +W
Sbjct: 633 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAP-LTALTSPKVP-FKWKA 680
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWGSQVD-----------SSFLS 793
+A +S+P+ + Q + DASD+G G+ + +F S
Sbjct: 681 DAQEAFDKLKSRFISAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSCLDGKVHPCAFFS 740
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R K L
Sbjct: 741 HRLSPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SAKRL 796
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 797 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 830
>gi|326669631|ref|XP_003199052.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1379
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 134/535 (25%), Positives = 207/535 (38%), Gaps = 76/535 (14%)
Query: 387 EPPGR---VSLKVQTLQKPQRCSSPVN-PPADSRIGAEL-----VGGRLRRFVDAWIRLG 437
EPP + L + T+ KPQ +N PP SR+ E V + R
Sbjct: 403 EPPRHTKAIPLDIMTIPKPQIVPKSLNTPPEISRVPPEYSDLAEVFSKTR---------A 453
Query: 438 APAPLVRIVSGYAIPFSAKPPLVP-LCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTT 496
A P R Y P P P L L+ P +AM ++ E L++G ++ S
Sbjct: 454 ASLPPHR---PYDCPIDLLPGTCPPRGKLYSLSGPERAAMEKYVHESLDSGFIRPSTSPA 510
Query: 497 GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
G + F V K +G RP ++ +GLN ++ L LQ +DL A
Sbjct: 511 G--AGFFFVGKKDGSLRPCIDYRGLNSVTVKNRYPLPLMTTAFEILQGATIFTKLDLRSA 568
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY 616
Y V I+ + A + +PFGLA AP F S N V L + V VY
Sbjct: 569 YHLVRIRQGDEWKTAFNTPTGHYEYQVMPFGLANAPAVFQSFINDV--LREMLNIFVFVY 626
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIM-------WD 669
LDD L+ + +P I + + L G V L+KS + V FLG + D
Sbjct: 627 LDDILIFSHNPEEHVIHVRKVLIELLKHGLFVKLEKSEFHVSSV-SFLGFIVSKGSLQMD 685
Query: 670 PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR 729
P R L + + + R +L FA+F R S +
Sbjct: 686 PSKTRAVLDWPQPTSFKEVQR------------------FLGFANFYRRFIRNFSSIAEP 727
Query: 730 QASLLRLGAPHLTPINPA--VLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG--- 784
SL + T + A L+ + P+ + P ++ + DASD+G G
Sbjct: 728 LTSLTKKANTPFTWNDKASTAFNTLKHRFTSAPILTLPDP-ELPFILEVDASDIGVGAVL 786
Query: 785 ---SQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVMVQSD 834
S+ D+ +F S + Q N+ I +++ A+ AL L+ S ++ +D
Sbjct: 787 SQRSKADNKLHPCAFYSHRLTPTQANYDIGNRKLLAIKLALEEWRHWLEGASHPFLIWTD 846
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+Q ++Y++ K L+ +F R F PG+ N D+LSR
Sbjct: 847 HQN-LTYIQ---NAKRLNARQARWSLFFN----RFKFTLSFRPGSKNIKPDALSR 893
>gi|326674960|ref|XP_003200241.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1392
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 109/455 (23%), Positives = 187/455 (41%), Gaps = 61/455 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P S+ L+ P +AM +I+E L G++++ S G + F V K +GG RP ++ +G
Sbjct: 497 PRGSIFSLSLPERTAMESYIEESLAAGIIRQSTSPAG--AGFFFVGKKDGGLRPCIDYRG 554
Query: 521 LNQFLSPKKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN K ++ N + +P LQ+ +DL AY V IK + A +
Sbjct: 555 LN------KITIRNRYPLPLMSTAFEILQEASIFTKLDLRNAYHLVRIKQGDEWKTAFNT 608
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F +L N V + ++ V VYLDD L+ + +
Sbjct: 609 PTGHYEYLVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSSSLQEHIFHV 666
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ + L + V +K V +FLG + P +M D Q + A
Sbjct: 667 RKVLQRLLNNHLYVKPEKCQFHVTQV-KFLGFIIKPGQIQM----DPQ--------KIQA 713
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP--HLTPINPAVLPK 751
W + S + + +L FA+F S ++L + H P K
Sbjct: 714 MVDWPSPSSVKEVQRFLGFANFYRKFILNFSTVAAPLSALTKENGAGFHWGPEAEEAFIK 773
Query: 752 LEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGSQVD----------SSFLSGLWSREQ 800
L+ + P+ + P + F + DASD+G G+ + +FLS + +
Sbjct: 774 LKKRFTSAPIL--LIPNPDKPFMVEVDASDVGIGAVLSQRGEDNKLHPCAFLSHRLTPTE 831
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+N+H+ +E+ AV AL L+ + + Q + + K+L + + +++
Sbjct: 832 RNYHVGDRELLAVKLALEEWRHWLEGA----KHPFQVLTDH-------KNLEYVQQAKRL 880
Query: 861 FLLSQDW-----RIHILAQFIPGAYNSVADSLSRS 890
W R H + PG+ N D+LSR+
Sbjct: 881 NPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSRA 915
>gi|308480981|ref|XP_003102696.1| hypothetical protein CRE_30022 [Caenorhabditis remanei]
gi|308260782|gb|EFP04735.1| hypothetical protein CRE_30022 [Caenorhabditis remanei]
Length = 1083
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 110/457 (24%), Positives = 184/457 (40%), Gaps = 33/457 (7%)
Query: 425 RLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
RL ++ W + + ++ ++ GY I ++ L L+ + I+ +
Sbjct: 205 RLSEAIEFWGNICSSEWVLSVIEDGYIIQLDSRVTLPEPQGLRPSVLRHKDFLFAEIERL 264
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
E GVL+R D +S L +V +G R +L+L LN+ L P +F L N FL+
Sbjct: 265 EEEGVLERSDRLPRVVSPLHVVEQGKK-KRMILDLSELNKSLVPPRFKLENMKTAWPFLE 323
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYN----GDVLAMTCLPFGLATAPQAFASLS 599
++ + D Y H+ I + L+ S + + LPFGLATAP F +
Sbjct: 324 NANFAATFDFKSGYHHIKIHRDSRDLLSFSLSNPPAAPYFSFRGLPFGLATAPWLFTKIF 383
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ R+ G+++ +YLDD L+V + + + L G V +KS P
Sbjct: 384 KVLVRKWRAEGVKIFLYLDDGLIVGETEYEVARASRRVRGDLAEAGVCVAEEKSFWVPDA 443
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+LG D + E + T ++L L S ++ LG L ASF +
Sbjct: 444 KFTWLGYECDLVAREVRGTEKRMATWQSVLDELRRSVAPSVLDRMKFLGCL--ASFELVA 501
Query: 720 GRLHSRRIQRQASLLRLGAPHLTPIN------PAVLPKLEWW--LNALPLSSPIFPRQ-- 769
G + R + + + N P + ++E+W A L + +
Sbjct: 502 GDVGVGRARWLMQTVGESQKKMESKNTRKEKSPGEIREIEFWKAYGAELLKRSLLEIEPF 561
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLWSR--------EQQN--WHINKKEMFAVHQALSL 819
+ TDAS G G + LW E+Q+ W +E+ AV A +
Sbjct: 562 FDFLLFTDASARGVGGLLKDKEGCVLWKMSELGDSNFEEQSSAW----RELTAVEVASAR 617
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
+ ++ S + V D+Q VS LRR L L+E
Sbjct: 618 LIGQVRGS-IQVLVDSQAAVSVLRRGSMKPELHALAE 653
>gi|326680487|ref|XP_003201530.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1336
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 109/455 (23%), Positives = 187/455 (41%), Gaps = 61/455 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P S+ L+ P +AM +I+E L G++++ S G + F V K +GG RP ++ +G
Sbjct: 441 PRGSIFSLSLPERTAMESYIEESLAAGIIRQSTSPAG--AGFFFVGKKDGGLRPCIDYRG 498
Query: 521 LNQFLSPKKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN K ++ N + +P LQ+ +DL AY V IK + A +
Sbjct: 499 LN------KITIRNRYPLPLMSTAFEILQEASIFTKLDLRNAYHLVRIKQGDEWKTAFNT 552
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F +L N V + ++ V VYLDD L+ + +
Sbjct: 553 PTGHYEYLVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSSSLQEHIFHV 610
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ + L + V +K V +FLG + P +M D Q + A
Sbjct: 611 RKVLQRLLNNHLYVKPEKCQFHVTQV-KFLGFIIKPGQIQM----DPQ--------KIQA 657
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP--HLTPINPAVLPK 751
W + S + + +L FA+F S ++L + H P K
Sbjct: 658 MVDWPSPSSVKEVQRFLGFANFYRKFILNFSTVAAPLSALTKENGAGFHWGPEAEEAFIK 717
Query: 752 LEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGSQVD----------SSFLSGLWSREQ 800
L+ + P+ + P + F + DASD+G G+ + +FLS + +
Sbjct: 718 LKKRFTSAPIL--LIPNPDKPFMVEVDASDVGIGAVLSQRGEDNKLHPCAFLSHRLTPTE 775
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+N+H+ +E+ AV AL L+ + + Q + + K+L + + +++
Sbjct: 776 RNYHVGDRELLAVKLALEEWRHWLEGA----KHPFQVLTDH-------KNLEYVQQAKRL 824
Query: 861 FLLSQDW-----RIHILAQFIPGAYNSVADSLSRS 890
W R H + PG+ N D+LSR+
Sbjct: 825 NPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSRA 859
>gi|294887269|ref|XP_002772025.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239875963|gb|EER03841.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1174
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 117/464 (25%), Positives = 192/464 (41%), Gaps = 61/464 (13%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
PFS KP +P V S + ++LE G+++ ST+ + S VPK NG
Sbjct: 177 PFSQKPRPIP----HKWRDEVKSL----VDKLLEEGMIR--PSTSPYRSPCVYVPKKNGS 226
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK--TTHQRF 569
R ++ + LN ++L + L ++DL Y+ ++ H+
Sbjct: 227 VRMCIDFRALNALTEVDSYTLPRPDDVQEHLAGSKVFSTLDLQSGYWQFLLRPQDIHKTA 286
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
+ +PFGL +A F L + V + L V VYLDD L+ + D
Sbjct: 287 FCPGPGFPLYEWVRMPFGLCSAGATFQRLMDQVLNGLPF----VRVYLDDILVFSPDAET 342
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
E + + L + G ++ +K P + +LG +D + R P+ ++ IL
Sbjct: 343 HEDHLRQVFARLRAWGLTLSAEKCEFG-CPSVPYLGHFFDGNGMR---PDPTKVE--AIL 396
Query: 690 RTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT------P 743
R + N+ RS LG + +P +R IQR S +G+ L
Sbjct: 397 RW---PRPGNVAEIRSFLGLAGYYRNFVPNFSDVARPIQRLVS--EVGSETLDLDTYWGQ 451
Query: 744 INPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDSS-----FLSGLWS 797
L+ L ALP L+ P F + + TDASD G+ + F S +
Sbjct: 452 EQEESFRALKLRLAALPFLAYPDF--GIPFELYTDASDYAIGAVLMQEGRPLGFFSRTLT 509
Query: 798 REQQNWHINKKEMFAVHQAL------SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL 851
Q NWH +KE + + QAL + PL V V +D++ +++L + G K
Sbjct: 510 GSQLNWHTYEKEAYGILQALIYFQHYHIGYPL----TVTVYTDHEP-LTWLAKAGSKK-- 562
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+E+ L Q + + +++PG N AD+LSR + L D
Sbjct: 563 -----LERWLLAMQAY--SFIVKYVPGKKNVCADALSRIRQLDD 599
>gi|326673827|ref|XP_003200007.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1327
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 120/466 (25%), Positives = 188/466 (40%), Gaps = 75/466 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+TG+++ S G + F V K +G RP ++ +G
Sbjct: 514 PKGHLFSLSGPEREAMDRYINESLKTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 571
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 572 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 631
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + L+I + +
Sbjct: 632 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSP---CLQIHIQHVRQV 686
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLA 694
L L V +K A + FLG + ++ G I + A
Sbjct: 687 LQRLLENQLYVKAEKCVFH-AQSIPFLGFI---------------ISAGEIQADPCKIRA 730
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
W DS ++L +L FA+F RR R + ++ AP +P V +
Sbjct: 731 VAEWPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKVW--FK 778
Query: 754 W---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SS 790
W L + +S+P+ P Q FI DASD+G G+ + +
Sbjct: 779 WNSDAQEAFDELKSRFVSAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPCA 838
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGT 848
F S + ++N+ + +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 839 FFSHRLNPSERNYDVGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR----- 892
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
S L+ + + L D R F PG N D+LSR +P
Sbjct: 893 -SARRLTPRQARWALFFD-RFKFTLSFRPGTKNVKPDALSRLFEVP 936
>gi|301604388|ref|XP_002931870.1| PREDICTED: hypothetical protein LOC100494154 [Xenopus (Silurana)
tropicalis]
Length = 505
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 24/283 (8%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +PI Q L + ++ CLP G + + F S+++
Sbjct: 135 GALMAKADIESAFRLLPIHPECQHLLGCKLDDEIYVDLCLPMGCSISCSYFEKFSSFLEW 194
Query: 605 LLRSR--GMRVVVYLDDFLLVNQDPR-----ILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
++R R +V YLDDFL V + +LE+ ++ L Q ++ P
Sbjct: 195 VVRKRTGSQSLVHYLDDFLCVGRASTEWCSFLLEVLKEVTAEFGVPLA-----QDKTVGP 249
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L FLGI D LP DK L + ++ K + + +SLLG L+FA VI
Sbjct: 250 VTCLSFLGIEIDTVAGMTRLPGDKLTDLSKGVGEMIGRKKVTVRAVQSLLGKLNFACRVI 309
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQ 769
PMGR+ RR+ + G H+ + V L+ W L + +
Sbjct: 310 PMGRVFCRRLGTLLKGSKEGHHHVR-LTAEVRGDLQIWDQFLKEFNGKVIFRGKEVTNEE 368
Query: 770 VQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQNWHINKKEMF 811
+Q F TDAS +G+G+ ++ + + W E ++ K F
Sbjct: 369 IQLF--TDASGSVGFGAYLNGGWCAAHWPEEWLQGNLLKNLCF 409
>gi|440797650|gb|ELR18732.1| reverse transcriptase, partial [Acanthamoeba castellanii str. Neff]
Length = 406
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 82/157 (52%), Gaps = 3/157 (1%)
Query: 773 FISTDASDLGWGSQVDSSFLS--GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
+ T+AS GWG+ L+ G WS + HIN ++ V + P LQ ++
Sbjct: 150 ILETNASLSGWGASSSCQTLTAAGWWSSDDSKSHINILKLATVRNTILALQPHLQGKAIL 209
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
++ +N V++L GG +S+++ ++I LL + I + + ++PG NS AD LSR
Sbjct: 210 MRCNNIATVAHLNHMGG-QSVAMNRVQKEIHLLCKRLHIQLSSAYLPGLCNSEADRLSRL 268
Query: 891 KSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+WHLSR A E I KWG ID A+R + +P
Sbjct: 269 HPHHEWHLSREAFESINKKWGPHSIDQTATRENRQLP 305
>gi|326671184|ref|XP_003199379.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1327
Score = 83.2 bits (204), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 120/466 (25%), Positives = 188/466 (40%), Gaps = 75/466 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+TG+++ S G + F V K +G RP ++ +G
Sbjct: 514 PKGHLFSLSGPEREAMDRYINESLKTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 571
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 572 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 631
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + L+I + +
Sbjct: 632 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSP---CLQIHIQHVRQV 686
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLA 694
L L V +K A + FLG + ++ G I + A
Sbjct: 687 LQRLLENQLYVKAEKCVFH-AQSIPFLGFI---------------ISAGEIQADPCKIRA 730
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
W DS ++L +L FA+F RR R + ++ AP +P V +
Sbjct: 731 VAEWPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKVW--FK 778
Query: 754 W---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SS 790
W L + +S+P+ P Q FI DASD+G G+ + +
Sbjct: 779 WNSDAQEAFDELKSRFVSAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPCA 838
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGT 848
F S + ++N+ + +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 839 FFSHRLNPSERNYDVGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR----- 892
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
S L+ + + L D R F PG N D+LSR +P
Sbjct: 893 -SARRLTPRQARWALFFD-RFKFTLSFRPGTKNVKPDALSRLFEVP 936
>gi|301612278|ref|XP_002935646.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1243
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 111/448 (24%), Positives = 189/448 (42%), Gaps = 61/448 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G +++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 389 LSIPETQAMKDYIQENLSKGFIRKSNSPAG--AGFFFVQKKDGGLRPCIDYRGLNKITIK 446
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + L +DL AY + I+ + A + +PFG
Sbjct: 447 NRYPLPLIPELFDRLNGAKVFTKLDLRGAYNLIRIRHGDEWKTAFNTRDGHYEYLVMPFG 506
Query: 588 LATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFL-----LVNQDPRILEIQGKLAVSIL 641
L AP F N + +L S VVVYLDD L L + ++ +L V+ L
Sbjct: 507 LCNAPAVFQDFINDIFRDILFS---YVVVYLDDILVFPSSLPEHIDHVKQVLHRLRVNHL 563
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
+ + KS +S FLG + RM + L +L W
Sbjct: 564 YAKIEKCDFHKSEVS------FLGYVISSSGFRM-----DPVKLSAVLE-------WPPP 605
Query: 702 SA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNAL 759
+ +++ +L FA+F + S+ + +L + G + + A KL+ +
Sbjct: 606 AGLKAIQQFLGFANFYRRFIKGFSQIVAPITALTKKGVKDVWSSEAQAAFEKLKAAFCSA 665
Query: 760 PLSSPIFPRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINK 807
P+ I P FI DASD+G G+ + ++ S +S ++N+ +
Sbjct: 666 PVL--IHPVPTCPFILEVDASDVGVGAILSQRPSFQDSLHPCAYFSRKFSAAERNYDVGN 723
Query: 808 KEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
+E+ A+ AL LL+ S S+ T+++ K+L +S+ +++ W
Sbjct: 724 RELLAIKLALQEWRHLLEGS-----SEPVTILT------DHKNLEYISDAKRLNPRQARW 772
Query: 868 -----RIHILAQFIPGAYNSVADSLSRS 890
R + L F PG+ N AD+LSRS
Sbjct: 773 ALFFSRFNFLISFRPGSKNIKADALSRS 800
>gi|301617705|ref|XP_002938285.1| PREDICTED: hypothetical protein LOC100496793, partial [Xenopus
(Silurana) tropicalis]
Length = 1057
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 115/276 (41%), Gaps = 29/276 (10%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D+ A+ +PI L + G CLP G A + +AF++ W
Sbjct: 762 GALMAKADIESAFRLLPIHPECHHLLGCWFEGAYFVDLCLPMGCAISCAHFEAFSTFLEW 821
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + RS VV YLDDF V Q +LE + S G + K+
Sbjct: 822 VVKV-RSGYRSVVHYLDDFFCVGQAKSDTCFHLLET----LREVTASFGVPLAADKTE-G 875
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 876 PATVMRFLGLEIDSLAGECRLPTQKVADLMREVGSLRRDKKATLQRLQSVLGKLNFACRV 935
Query: 717 IPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPLSSPIFPRQ 769
IP+GR+ SRR+ + + R AP H I V L W N L
Sbjct: 936 IPVGRVFSRRLAQATAGAR--APHHHVRITKEVKADLGVWEAFLADFNGRVLFRASETTA 993
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW 803
+ + TDA+ GS+ ++L+G W Q Q W
Sbjct: 994 QELELYTDAA----GSKGFGAYLAGRWCAAQWPQEW 1025
>gi|301609602|ref|XP_002934358.1| PREDICTED: hypothetical protein LOC100487718 [Xenopus (Silurana)
tropicalis]
Length = 913
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 139/351 (39%), Gaps = 49/351 (13%)
Query: 548 MISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR 607
M +D+ A+ +P+ L + G CLP G + + F + S ++ ++R
Sbjct: 595 MAKVDVESAFRLLPVHQESLHLLGCYFEGGYYVDCCLPMGCSISCAYFEAFSTFIEWVVR 654
Query: 608 SRG--MRVVVYLDDFLLVNQDPRIL-EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL 664
R V+ YLDDFL V +L + ++ + S G + K+ P L+FL
Sbjct: 655 KRAGANSVIHYLDDFLCVGPGHSMLCAVLLQMFQRVADSFGVPLAPDKTE-GPTTCLRFL 713
Query: 665 GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHS 724
GI D LP+DK L + +K L +SLLG L+FA +IPM L +
Sbjct: 714 GIEIDTIRQECRLPQDKIQQLKEEVGYAREAKKITLRQLQSLLGKLNFACRIIPMRSLMA 773
Query: 725 RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG 784
++W S P + H + A G+G
Sbjct: 774 ----------------------------DYW------QSQPRPNRELHLFTDAAGSAGFG 799
Query: 785 SQVDSSFLSGLWSREQQNWHINK-------KEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
+ + + W W NK E F + A+ L L++ V+ +DN +
Sbjct: 800 AYFAGKWCAASWP---NTWVENKLTGNLTLLEFFPIIVAIELWGTQLKNQSVVFFTDNMS 856
Query: 838 VVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
VV + + S +L+ ++ + L + A+ +PG N +ADSLS
Sbjct: 857 VVMAITNL-TSGSRPVLNLLKHLVLRCLQLNVRFEAKHVPGHTNEIADSLS 906
>gi|294894738|ref|XP_002774931.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239880706|gb|EER06747.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1653
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 109/457 (23%), Positives = 192/457 (42%), Gaps = 55/457 (12%)
Query: 461 PLCS-LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
P+C ++ + +S ++EM E GV++R ST+ + VPK NG R ++ +
Sbjct: 220 PICERIRPIPHKYRDEISTLLKEMEELGVIRR--STSAWRFPCVFVPKKNGKVRMCIDYR 277
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD-- 577
LN+ + + + + L ++DL Y+ +P++ QR A
Sbjct: 278 NLNKACHTEAYPVPRPDDVQEHLAGARVFSTLDLRSGYWQIPVRKEDQRKTAFCPGPGFP 337
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ-DPRILEIQGKL 636
+ +PFGLA+AP F L + + L V VYLDD L+ ++ D LE ++
Sbjct: 338 LYEWVMMPFGLASAPATFRRLMDAILGHLPF----VRVYLDDVLIFSRSDEEHLE-HLRI 392
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-----PHLDRMWLPEDKQLTLGNILRT 691
+L + G V +K V +LG M++ P L + + ILR
Sbjct: 393 VFELLRAAGMTVAAKKCEFMQDRV-TYLGHMFNSTGMSPDLGKAEV----------ILRW 441
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR------QASLLRLGAPHLTPIN 745
L L RS LG + +P SR + + ++ LG
Sbjct: 442 PLPRTAPAL---RSFLGLAGYYRNFVPHFADKSRCLYEIVNYCTKNKVVELGN-QWGKEE 497
Query: 746 PAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSRE 799
L+ W+ +LP L+ P F R Q + TDASD+ G+ V+ +F S +
Sbjct: 498 ELAFNDLKQWIASLPLLAYPDFSRPFQ--LMTDASDVAIGAVVEQDGRPLAFFSQSLTPT 555
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
Q+ W + ++E +A+ +AL LL + +V + + +++ K V+
Sbjct: 556 QKVWPVYEREAYAIFKALERFRSLLWGYHLELVVFSDHKPLEWIQTATTAK-------VQ 608
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + ++ + + G +N VAD+LSR + D
Sbjct: 609 RWLISMSQFKFKVF--YKKGKHNVVADALSRITTSDD 643
>gi|326668065|ref|XP_003198726.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1332
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 188/467 (40%), Gaps = 77/467 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+TG+++ S G + F V K +G RP ++ +G
Sbjct: 514 PKGHLFSLSGPEREAMDRYINESLKTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 571
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 572 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 631
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + L+I + +
Sbjct: 632 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSP---CLQIHIQHVRQV 686
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLA 694
L L V +K A + FLG + ++ G I + A
Sbjct: 687 LQRLLENQLYVKAEKCVFH-AQSIPFLGFI---------------ISAGEIQADPCKIRA 730
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
W DS ++L +L FA+F RR R + ++ AP +P V K
Sbjct: 731 VAEWPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKVWFK-- 778
Query: 754 WW----------LNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------S 789
W L + +S+P+ P Q FI DASD+G G+ +
Sbjct: 779 -WNSDAQEAFDELKSRFVSAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPC 837
Query: 790 SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGG 847
+F S + ++N+ + +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 838 AFFSHRLNPSERNYDVGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---- 892
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
S L+ + + L D R F PG N D+LSR +P
Sbjct: 893 --SARRLTPRQARWALFFD-RFKFTLSFRPGTKNVKPDALSRLFEVP 936
>gi|326673534|ref|XP_003199911.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1327
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 120/466 (25%), Positives = 188/466 (40%), Gaps = 75/466 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+TG+++ S G + F V K +G RP ++ +G
Sbjct: 514 PKGHLFSLSGPEREAMDRYINESLKTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 571
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 572 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 631
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + L+I + +
Sbjct: 632 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSP---CLQIHIQHVRQV 686
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLA 694
L L V +K A + FLG + ++ G I + A
Sbjct: 687 LQRLLENQLYVKAEKCVFH-AQSIPFLGFI---------------ISAGEIQADPCKIRA 730
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
W DS ++L +L FA+F RR R + ++ AP +P V +
Sbjct: 731 VAEWPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKVW--FK 778
Query: 754 W---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SS 790
W L + +S+P+ P Q FI DASD+G G+ + +
Sbjct: 779 WNSDAQEAFDELKSRFVSAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPCA 838
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGT 848
F S + ++N+ + +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 839 FFSHRLNPSERNYDVGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR----- 892
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
S L+ + + L D R F PG N D+LSR +P
Sbjct: 893 -SARRLTPRQARWALFFD-RFKFTLSFRPGTKNVKPDALSRLFEVP 936
>gi|291239414|ref|XP_002739618.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 793
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/415 (25%), Positives = 176/415 (42%), Gaps = 54/415 (13%)
Query: 499 LSRLFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
++ L L PK NGG R +++L +N + +FSL H+ S + MI+ L
Sbjct: 380 VNSLGLRPKKNGGHRIIMDLSQPRTDSVNSNICKDEFSL--HY---STVDDAVAMIN-KL 433
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
+A I H L C F + T P F ++ + ++ +R +
Sbjct: 434 GRATLLAKINIQHAFRL------------CPSFAIWT-PFLFNRIAEAINWIVCNRTNNI 480
Query: 614 VVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
+ YLDD+L+ + ++ ++ +LG V + ++ L P L FLGI D
Sbjct: 481 LHYLDDYLITGPANSKVCASSLDTMLTTCQALG--VPIAQNKLE-GPTLTFLGIEMDTVN 537
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
+ LP+DK + + + T SL+G LSFA IP GR+ +R+ ++
Sbjct: 538 RVLRLPQDKLNDITSSSEKWSTTTTCMKQELLSLIGTLSFACKCIPAGRIFLQRMIDLST 597
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQHFISTDASDLGWG 784
+T ++ + LEWW + LP L S P + +S + G
Sbjct: 598 TASALHQRIT-LSTSFQLDLEWWKDFLPTWNGTASFLDSTWTPVPEMELYTDASSTIRCG 656
Query: 785 SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV----------VMVQSD 834
+ + S WS N N + V + L LP+L S + VM D
Sbjct: 657 GYFNGEWFSLQWSAILSN---NDHQHSIVWKEL---LPILLSCLIWGHLWHGRRVMFHCD 710
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
N+++V + R+G T+ ++ + IFL + H++ I G N++AD+LSR
Sbjct: 711 NESIV-HTWRKGSTRCPYIMQLIRAIFLTAASSNFHVMITHIRGTDNNIADALSR 764
>gi|328545953|ref|YP_004347415.1| replicase [Sweet potato caulimo-like virus]
gi|327415369|gb|ADR03142.2| replicase [Sweet potato caulimo-like virus]
Length = 641
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/449 (20%), Positives = 196/449 (43%), Gaps = 61/449 (13%)
Query: 474 SAMSLHIQEMLETGVLKRLDS--TTGFLSRLFLVPKGN----GGTRPVLNLKGLNQFLSP 527
+HI EM++ G ++ + + S F+V K + G +R V++ K LN
Sbjct: 213 KEFKMHIDEMVKEGFIEECKNLENKKYSSPAFIVNKHSEIKRGKSRMVIDYKDLN----- 267
Query: 528 KKFSLINHFRIPS---FLQKG---DYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
KK +I H IP+ + +G +Y D ++H+ ++ +++ A +
Sbjct: 268 KKAKVIKH-PIPNKDILINRGIKANYFSKFDCKSGFYHIKLEEDSKKYTAFTVPQGYYVW 326
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
LPFG +P + ++ + R ++VY+DD L+ ++ +I ++ +I+
Sbjct: 327 IVLPFGYHNSPSIYQQ---FMDGIFRPYYDFILVYIDDILIFSKTYEEHKIHLEIFRNII 383
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWD-------PHLDRMWLPEDKQLTLGNILRTLLA 694
G +++ +K+ + + +FLG+ + PH+ L + ++ L+++L
Sbjct: 384 IKHGIVLSKKKAEIGKQKI-EFLGVKIEQGGIELQPHIIDKILEKHIKIKSKKELQSILG 442
Query: 695 SKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
++ R+ L LS ++P IQ++ + T + + KL+
Sbjct: 443 L----VNQIRNFLPNLS--KILLP--------IQKKLKIKNEEVWEWTKEDEQNIIKLKD 488
Query: 755 WL--NALPLSSPIFPRQVQHFISTDASDLGWGS---------QVDSSFLSGLWSREQQNW 803
+ N + ++ P+ + + I DAS +G+ + + SG + ++N+
Sbjct: 489 YCKDNVIKMTYPVEEKDMNWIIEVDASKEYYGNCLKYKKDKIEYICRYNSGTFKEHEKNY 548
Query: 804 HINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF-- 861
IN+KE+ A+++ L +V++DN +++ S+ + V+ I
Sbjct: 549 DINRKELIAIYKGLEHYAIFTTQGKKLVRTDNSQAYYWIKNSKIKNSID-MKNVKGILAK 607
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSRS 890
++ D+ I I I G N VAD LSR+
Sbjct: 608 IIMYDFDIEI----IDGKTNIVADFLSRN 632
>gi|326673580|ref|XP_003199928.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1323
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 121/456 (26%), Positives = 186/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 341 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 398
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 399 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 458
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 459 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 515
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 516 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 562
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ +S ++L +L FA+F RR R S +L AP LT + A P W
Sbjct: 563 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSVA 610
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L +S+PI P + F + DAS++G G+ + +F S
Sbjct: 611 QAAFTKLKGCFVSAPILVTPDPARQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 670
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 671 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 726
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N+ D+LSR
Sbjct: 727 RQARWALFFGRFDFTI----SYRPGSKNTKPDALSR 758
>gi|326666222|ref|XP_003198219.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1296
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 115/464 (24%), Positives = 184/464 (39%), Gaps = 71/464 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE LE +++ S G + F V K +G RP ++ +G
Sbjct: 384 PKGRLYSLSAPEREAMDRYIQESLEADLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 441
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 442 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 501
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ ++ ++ + +
Sbjct: 502 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSESEQVHTQHVRQVLQR 559
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
L V +K V FL GI DP R
Sbjct: 560 LLENQLYVKAEKCVFHSKSV-SFLGHIVSTEGIKADPAKVR------------------- 599
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQ-----ASLLRLGAPHLTPINPA 747
A W + DS ++L +L FA+F RR R A L L +P + I +
Sbjct: 600 AVAKWPVPDSRKALQRFLGFANFY--------RRFIRNFSSVAAPLTALTSPKVPFIWHS 651
Query: 748 VLPKLEWWLNALPLSSPIF----PRQVQHFISTDASDLGWGSQVD-----------SSFL 792
+ L + +++P+ P++ Q + DAS++G G+ + +F
Sbjct: 652 QAQEAFDVLKSRFITAPVLCLPDPKR-QFIVEVDASEVGIGAVLSQRSSRDGKVHPCAFF 710
Query: 793 SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKS 850
S S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R K
Sbjct: 711 SHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SAKR 766
Query: 851 LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
LS +F R + + PG+ N D+LSR +P
Sbjct: 767 LSSRQARWALFF----GRFNFSLSYRPGSKNIKPDALSRLFDVP 806
>gi|301632157|ref|XP_002945157.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Xenopus (Silurana) tropicalis]
Length = 873
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 114/441 (25%), Positives = 186/441 (42%), Gaps = 46/441 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 183 LSLPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGLNKITIK 240
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + IK + A + +PFG
Sbjct: 241 NRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIKEGDEWKTAFNTRDGHYEYLVMPFG 300
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + R K+ + L
Sbjct: 301 LCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLRDHRSHVKVVLQRLRENNLY 358
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K + V QFLG H+ L D + +R +L W S R+
Sbjct: 359 AKLEKCTFEVNSV-QFLGF----HISSKGLEMDPEK-----VRAVL---DWMQPLSLRAT 405
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPI 765
+L FA++ + S + L + GA P L K +L +S+PI
Sbjct: 406 QRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWSSEAVQAFK---FLKKEFVSAPI 462
Query: 766 F--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMF 811
P FI DAS++G G+ + +F S +S + N+ I +E+
Sbjct: 463 LRHPDTALPFIVEVDASEVGAGAILSQRHPLTNKLHPCAFFSKKFSPSEANYDIGNRELL 522
Query: 812 AVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
A+ A LL+ + V V +D++ ++ Y+ K L+ +F R
Sbjct: 523 AIKWAFEEWRHLLEGAKHAVSVFTDHKNLL-YIE---SAKRLNPRQARWALFFS----RF 574
Query: 870 HILAQFIPGAYNSVADSLSRS 890
+ + PG+ N+ AD+LSRS
Sbjct: 575 NFSITYRPGSKNTKADALSRS 595
>gi|326671938|ref|XP_003199556.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1236
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 116/459 (25%), Positives = 182/459 (39%), Gaps = 61/459 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L TG+++ S G + F V K +G RP ++ +G
Sbjct: 336 PKGRLYSLSGPEREAMDRYIQESLSTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 393
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 394 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 453
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + ++ V VYLDD L+ + + + +
Sbjct: 454 YLVLPFGLTNAPAVFQALVNDVLRDMVNKF--VFVYLDDILIFSSSLQAHTHHVRQVLQR 511
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K V FLG + + +I + +K
Sbjct: 512 LLENQLFVKAEKCEFHTKSV-TFLGYV-----------ISAEGIKPDIAKVRGVAKWPVP 559
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW------ 754
+ + L +L FA+F RR R S ++ AP + V+ W
Sbjct: 560 HTRKGLQRFLGFANFY--------RRFIRNFS--QIAAPLTALTSTKVM--FRWNTQAQE 607
Query: 755 ---WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWS 797
L + +S+P+ P Q FI DASD+G G+ + +F S S
Sbjct: 608 AFDVLKSRFISAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSPKDGKVHPCAFFSHRLS 667
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLS 855
++N+ I KE+ AV AL L+ +V +V +D++ + Y+R K LS
Sbjct: 668 PAERNYDIGNKELLAVKLALGEWRHWLEGAVHPFLVWTDHKN-LEYVR---SAKRLSARQ 723
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+F R + + PG+ N+ D+LSR +P
Sbjct: 724 ARWALFF----GRFNFSLSYRPGSKNTKPDALSRLFEVP 758
>gi|66828855|ref|XP_647781.1| hypothetical protein DDB_G0267240 [Dictyostelium discoideum AX4]
gi|60475930|gb|EAL73857.1| hypothetical protein DDB_G0267240 [Dictyostelium discoideum AX4]
Length = 818
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 67/127 (52%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TAP+ F
Sbjct: 7 LPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAPRIFTM 66
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L V +LR + V+ YLDD L+V K + +L LG+ +NL+KS L P
Sbjct: 67 LLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLVKLGFKLNLEKSVLEP 126
Query: 658 APVLQFL 664
+ FL
Sbjct: 127 TQSITFL 133
>gi|308475765|ref|XP_003100100.1| hypothetical protein CRE_21296 [Caenorhabditis remanei]
gi|308265905|gb|EFP09858.1| hypothetical protein CRE_21296 [Caenorhabditis remanei]
Length = 1034
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 127/522 (24%), Positives = 210/522 (40%), Gaps = 46/522 (8%)
Query: 432 AWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLK 490
AW + ++ +V GY I P L+ A ++ + +++E+G +
Sbjct: 115 AWREIVKDEWIMSVVEKGYVIQLGPDPIFPEPEGLRKSAKRHIDFITSEVAKLVESGAVT 174
Query: 491 RLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMIS 550
+S +S L +V +G R +L+L N+ LSP KF+L L++ + +
Sbjct: 175 VTESPK-VISPLHVVEQGEK-KRLILDLSEFNKNLSPPKFTLETWKHAAPELRRMSFAAT 232
Query: 551 IDLSQAYFHVPIKTTHQRFLALSYN----GDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
D Y HV I+ FLA S LPFGL+TAP F + +
Sbjct: 233 FDFKSGYHHVKIEENSSDFLAFSLTDPPTAPFYKYRALPFGLSTAPWLFTKIFRPIVGKW 292
Query: 607 RSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
R G++V +Y+DD L+V + L + L LG + +K + P+ V +LG
Sbjct: 293 RREGIKVWLYIDDGLVVAETKEELTRAVSIVKRDLERLGVALADEKCNWEPSSVFTWLGF 352
Query: 667 MWDPHLDRMWLPEDKQ---LTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI-PMGRL 722
+ D + L E + L +++ LA LD R LG LS FV +
Sbjct: 353 VGDMRRKTVTLSEKRYKAVLHRLEVIKGRLAPTV--LDRER-FLGSLSSMLFVAGNEAQA 409
Query: 723 HSRRIQRQ-ASLLRLGAPHLTPIN--PAVLPKLEWW-LNALPLSSPIFPRQVQHF--IST 776
SR +Q A+ R P I L ++ +W N LSS + + T
Sbjct: 410 RSRHMQSAVATARREDWPETRQIEKTKGELAEIRFWSENIRRLSSTTLEENFRPVWRVYT 469
Query: 777 DASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ-----SSVVMV 831
DAS G G+ + + + + K E A+ + ++ + + V+
Sbjct: 470 DASADGMGALLKNLEGEVVCRISEVGADTFKSESSAMRELKAMRMLARRIAGWIRGAVVC 529
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI-----HILAQFIPGAYNSVADS 886
D+Q V+ L++ S+ SE ++I Q W ++ +IP N AD
Sbjct: 530 YLDSQAAVAILKKG------SMNSEWQEI--AEQVWDALQTVGNVRFLWIPRELNKEADF 581
Query: 887 LSRSKSLPDWHLSRSATEQIFL----KWGVPCIDLFASRVSA 924
SR DW + +++FL +WG D FA +A
Sbjct: 582 ASRDFDFDDWGVD----QKVFLWAQTRWGKFKCDWFADEANA 619
>gi|301632006|ref|XP_002945082.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Xenopus (Silurana) tropicalis]
Length = 888
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 108/445 (24%), Positives = 188/445 (42%), Gaps = 45/445 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E L+ G ++ S G + F V K +G RP ++ +GLN+
Sbjct: 113 LSLPEAQAMKEYINENLQRGFIRPSSSPAG--AGFFFVGKKDGSLRPCIDYRGLNKITVK 170
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + I+ + A + +PFG
Sbjct: 171 NRYPLPLISELFDQVRNAKFFTKLDLRGAYNLIRIRVGDEWKTAFNTRDGHYEYLVMPFG 230
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N++ L G+ VVVYLDD L+ + + + + L
Sbjct: 231 LCNAPAVFQEFVNYIFRDL--LGLFVVVYLDDILIFSSNQSDHRNHVREVLLRLRRNNLY 288
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
L+K P +QFLG ++ D+ L + ++ + L S R+L
Sbjct: 289 AKLEKCIFE-VPSVQFLG----------FVISDEGLAMDSVKVKAILEWAQPL-SLRALQ 336
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPIF 766
+L FA++ + S + L + GA P L + + E+ N +S+PI
Sbjct: 337 RFLGFANYYRQFIKNFSLIVAPLTDLTKKGADPSLW--SSKAVHAFEFLKNEF-VSAPIL 393
Query: 767 PR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMFA 812
+ + DAS++G G+ + +F S +S + N+ I +E+ A
Sbjct: 394 RHPDTSLPFIVEVDASEVGAGAVLSQRHPTTNKMHPCAFFSKKFSPAEVNYDIGNRELLA 453
Query: 813 VHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
V A LL+ + VMV +D++ ++ Y+ K + +F R +
Sbjct: 454 VKWAFEEWRHLLEGAKYPVMVFTDHKNLL-YIE---SAKRFNPRQARWALFFS----RFN 505
Query: 871 ILAQFIPGAYNSVADSLSRS-KSLP 894
F PG+ N AD+LSRS +S+P
Sbjct: 506 FSLTFRPGSKNIKADALSRSFESIP 530
>gi|326668118|ref|XP_003198743.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1268
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 114/458 (24%), Positives = 185/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 452 YRVLPFGLTNAPAVFQALVNNVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 510 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGEIQADPCKVKAVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP +P V +W +
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKV--PFKWEV 601
Query: 757 NALP---------LSSPIF--PRQVQHFI-STDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ P + FI DASD+G G S++D +F S
Sbjct: 602 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 662 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|384489900|gb|EIE81122.1| hypothetical protein RO3G_05827 [Rhizopus delemar RA 99-880]
Length = 595
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 112/228 (49%), Gaps = 25/228 (10%)
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
K+ V L SLG+++N +KS+L+P+ +FLG ++ R+ +P+ K + I R+ A
Sbjct: 9 KVVVHHLTSLGFLINWEKSALNPSKTQEFLGFNFNTETMRIKVPQGKMNKI--IQRSRQA 66
Query: 695 SKTWNLDSAR---SLLGYLSFASFVIPMGRLHSRRIQRQASL-LRLGAPHLTPINPAVLP 750
KT + S R SL+G ++ I LH R +QR + LR+ + P VL
Sbjct: 67 MKTTTIRSCRWIASLIGKMTSVIPAIGEALLHVRHLQRDLTKSLRMNGYKNWEV-PCVLS 125
Query: 751 -----KLEWW------LNALPLS----SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL 795
L+WW N LP+ + P+ H DAS+ GW + + SG
Sbjct: 126 THSLQDLQWWEKWSTVKNGLPIHVTPPEILMPKLTIHV---DASNTGWRVKSNVMETSGF 182
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
W+ E++ IN +E+ ++ AL L + S + + SDN+T + Y++
Sbjct: 183 WTEEEKETSINVRELQTIYFALKLQARNAKDSTIHIFSDNKTALKYVQ 230
>gi|1402848|gb|AAC28743.1| pol-like protein [Ceratitis capitata]
Length = 1060
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 111/445 (24%), Positives = 186/445 (41%), Gaps = 55/445 (12%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRPVLNLKGLNQFLS 526
+ S + I ++LE G+++ S + + S +++V K GN R V++ + LN
Sbjct: 207 LKSEVERQINKLLEDGIIR--PSRSPYNSPVWIVDKKPDSLGNKQYRLVIDYRKLNSVTI 264
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
++ + + S L + IDL + +P+K + A S N + T LPF
Sbjct: 265 ADRYPIPEINEVLSHLGSNTFFSVIDLKSGFHQIPLKNSDIEKTAFSINNEKYEFTRLPF 324
Query: 587 GLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
GL AP F + +LR G VY+DD ++ +++ + K + L
Sbjct: 325 GLKNAPSIFQR---TLDDILRDYIGQCCYVYIDDIIIFSRNEKEHSTHLKNIFTTLEKAN 381
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
V L K V +FLG + P + + + + I R NL RS
Sbjct: 382 MKVQLDKCKFFEKEV-EFLGFIVTPEGIKTNPSKIEAIQNFPIPR--------NLKELRS 432
Query: 706 LLG-----------YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
LG Y A + + R R+ + S H+T + L LE
Sbjct: 433 FLGLSGYYRRFVKDYAKLAKPLTALLRGEEGRVSKSQS----ARAHIT-LGDEALAALEK 487
Query: 755 WLNALPLSSPI---FPRQVQHF-ISTDASDLGWG---SQVDS--SFLSGLWSREQQNWHI 805
N L +S + +P + F ++TDAS+ G SQ D +F+S ++ ++N+
Sbjct: 488 IKNVL-ISRDVMLTYPNLNKDFELTTDASNYAIGAVLSQEDRPITFISRTLTKTEENYAA 546
Query: 806 NKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
N+KEM A+ AL SL L S+ V + +D+Q + L + + K L
Sbjct: 547 NEKEMLAIIWALKSLRNYLYGSAKVKIFTDHQPLTYALSNKNNNSKMKRW----KAILEE 602
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
++ + ++ PG N VAD LSR
Sbjct: 603 YNYEL----KYKPGKTNVVADGLSR 623
>gi|322699332|gb|EFY91094.1| pol protein [Metarhizium acridum CQMa 102]
Length = 874
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 113/500 (22%), Positives = 203/500 (40%), Gaps = 86/500 (17%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
A+ ++++ML+ G ++ +S G+ + VPK NG RP ++ + LN+ ++ L
Sbjct: 204 DKALKEYLEDMLQKGYIRPSESPAGY--PILWVPKKNGKLRPCIDYRLLNKITIKNRYPL 261
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I + K + ++DL AY + +K H+ A + +PFGL AP
Sbjct: 262 PLMTEIRDKVGKAKWFTTLDLKGAYNLIRMKEGHEWMTAFRTSRGHYEYLVMPFGLTNAP 321
Query: 593 QAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIV 648
F + + ++LR + G+ VVVYLDD L+ + LE + +L +L +V
Sbjct: 322 ATFQRM---IDTILRKQLGVFVVVYLDDILIYSD---TLEEHKRHVHEVLQTLQDNKLLV 375
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT---LLASKTW----NLD 701
K V FLG + LT G I + + K W NL
Sbjct: 376 EASKCQFHQNTV-HFLGYV---------------LTHGEIRMSPDKIKTIKEWPTPKNLK 419
Query: 702 SARSLLGYLSFA-SFVIPMGRLHSR---RIQRQASLLRLGAPHLTPINPAVLPKLEWWLN 757
R +++F F+ G + SR + ++ + P T +
Sbjct: 420 EVRGFTAFVNFYRKFLSGYGDI-SRPLTNLTKKEVGFQWNEPEATAFQK---------MK 469
Query: 758 ALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVDS----------SFLSGLWSREQQNWH 804
L S P+ P Q + + + TDASD G Q+ +F S + N+
Sbjct: 470 DLVTSEPVLKAPDQDKPYELETDASDFALGGQLGQRDDQGRLHPVAFFSKKLHGPELNYG 529
Query: 805 INKKEMFAVHQALS--LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
I+ KE+ A+ + + + + V +D++ + S+L TK L+ + +
Sbjct: 530 IHDKELMAIIECFKEWRHYLIGAKHQIKVYTDHKNLTSFL----TTKDLN--KRQIRWYE 583
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWHLSRS-------------ATEQIFL 908
D+ I+ + G+ N AD+LSR + L D +S + T +I +
Sbjct: 584 TLTDYDFEII--YRKGSENGRADALSRREDLKSDEQVSNAPLLRATKDGNLVLGTREINM 641
Query: 909 KWGVPCIDLFASRVSAVVPN 928
W V + + R+++ + N
Sbjct: 642 TWQVKPDETWMHRIASCINN 661
>gi|326672260|ref|XP_003199625.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1291
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 183/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 349 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 406
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 407 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 466
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 467 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 523
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 524 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 570
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNA 758
+ +S ++L +L FA+F RR R S +L AP LT + + P W + A
Sbjct: 571 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSSKTP-FRWSIAA 618
Query: 759 LPLSSPIFPRQV------------QHFISTDASDLGWGSQVD-----------SSFLSGL 795
S + R V Q + DAS++G G+ + +F S
Sbjct: 619 QAAFSNLKSRFVSAPILVTPDPSRQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 678
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K LS
Sbjct: 679 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLSS 734
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 735 RQARWALFFGRFDFTI----SYRPGSKNIKPDALSR 766
>gi|326680323|ref|XP_003201497.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1221
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 112/453 (24%), Positives = 189/453 (41%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P S AM+ +IQE LE G+++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 284 LSQPESEAMNSYIQEELEKGLIR--PSTSPAAAGFFFVKKKDGNLRPCIDYRGLNEITVK 341
Query: 528 KKFSLINHFRIPSFLQ---KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L+ + +DL AY + IK + S T +
Sbjct: 342 YRYPLP---LVPAALEQLRQAKIYTKLDLRSAYNLIQIKQGDEWKAGFSTTRGHYEYTVM 398
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N V + + V+VY+DD L+ + + I ++ L I
Sbjct: 399 PFGLANSPSVFQAFMNDVFRDMLDQW--VIVYIDDILIYSNTVEEHIQHVRAVLQRLIHH 456
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-D 701
L +K V FLG ++ + +T+ R + A + W L
Sbjct: 457 HL--YAKFEKCEFHLTSV-SFLG----------YIISAEGVTMDE--RKVTAVQEWPLPQ 501
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ + L +L FA+F RR R S L AP LT + KL W A+
Sbjct: 502 TLKQLQRFLGFANFY--------RRFIRNFST--LAAP-LTSMTKRSHSKLIWQPEAIQA 550
Query: 762 SSPIFPR------------QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSR 798
S + R ++ + DAS+ G G+ + +F S +
Sbjct: 551 FSVLKERFTSAPVLRHPNPELPFVVEVDASNTGVGAVLSQRQGFPEKMYPCAFFSRKLNS 610
Query: 799 EQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL L+ + V +D++ + YLR K L+
Sbjct: 611 AERNYDVGNRELLAIKLALEEWRHWLEGATFPFTVLTDHKN-LEYLR---TAKRLNPRQA 666
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N+ AD+LSR
Sbjct: 667 RWALFFT----RFNFTVTYRPGSKNTKADALSR 695
>gi|301614205|ref|XP_002936571.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1232
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 115/459 (25%), Positives = 189/459 (41%), Gaps = 44/459 (9%)
Query: 449 YAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
Y P P P L+ P + AM +I E LE G ++ +S G + F V K
Sbjct: 212 YDCPIDLIPGSTPRGRTYPLSLPEAQAMREYISENLEGGFIRPSNSPAG--AGFFFVGKK 269
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+GG RP ++ +GLN+ ++ L + ++ + +DL AY + I+ +
Sbjct: 270 DGGLRPCIDYRGLNKITIKNRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIREGDEW 329
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
A + +PFGL AP F N + L G+ VVVYLDD L+ + +
Sbjct: 330 KTAFNTRDGHYEYLVMPFGLCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLS 387
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
K + L L+K + V QFLG H+ L D +
Sbjct: 388 DHRSHVKEVLRRLRENNLYAKLEKCTFEVNSV-QFLGF----HISSKGLEMDPEK----- 437
Query: 689 LRTLLASKTWNLD-SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
+R +L W S R+ +L FA++ + S + L + GA + A
Sbjct: 438 VRAVL---DWTQPLSLRATQRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWSSEA 494
Query: 748 VLPKLEWWLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SSFLS 793
V + +L +S+PI P FI DAS++G G+ + +F S
Sbjct: 495 V--QAFNFLKKEFVSAPILRHPDTALPFIVEVDASEVGAGAVLSQRHPLTNKLHPCAFFS 552
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSL 851
+S + N+ I +E+ A+ A LL+ + V V +D++ ++ Y+ K L
Sbjct: 553 KKFSPSEANYDIGNRELLAIKWAFEEWRHLLEGAKHAVSVFTDHKNLL-YIE---SAKRL 608
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ +F R + + PG+ N+ AD+LSRS
Sbjct: 609 NPRQARWALFFS----RFNFSITYRPGSKNTKADALSRS 643
>gi|301607492|ref|XP_002933346.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1007
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 109/448 (24%), Positives = 189/448 (42%), Gaps = 61/448 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G +++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 140 LSIPETQAMKDYIQENLSKGFIRKSNSPAG--AGFFFVQKKDGGLRPCIDYRGLNKITIE 197
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + L +DL AY + I+ + A + +PFG
Sbjct: 198 NRYPLPLIPELFDRLNGAKIFTKLDLRGAYNLIRIRHGDEWKTAFNTRDGHYEYLVMPFG 257
Query: 588 LATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKLAVSIL 641
L AP F N + +L S VVVYLDD L + + ++ L V+ L
Sbjct: 258 LCNAPAVFQDFINDIFRDILFS---YVVVYLDDILFFSSSLPEHIDHVKQVLHHLRVNHL 314
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
+ + KS +S FLG + RM + + +L W
Sbjct: 315 FAKIEKCDFHKSEVS------FLGYVISSSGFRM-----DPVKVSAVLE-------WPPP 356
Query: 702 SA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNAL 759
+ +++ +L FA+F + S+ + +L + G + + A KL+ +
Sbjct: 357 AGLKAIQRFLGFANFYRRFIKGFSQIVAPITALTKKGVKDVWSSEAQAAFEKLKAAFCSA 416
Query: 760 PLSSPIFPRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINK 807
P+ I P + FI DASD+G G+ + ++ S +S ++N+ +
Sbjct: 417 PVL--IHPVPTRPFILEVDASDVGVGAILSQRPSFQDSLHPCAYFSRKFSAAERNYDVGN 474
Query: 808 KEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
+E+ A+ AL LL+ S S+ T+++ K+L +S+ +++ W
Sbjct: 475 RELLAIKLALEEWRHLLEGS-----SEPVTILT------DHKNLEYISDAKRLNPRQARW 523
Query: 868 -----RIHILAQFIPGAYNSVADSLSRS 890
R + L F PG+ N AD+LSRS
Sbjct: 524 TLFFSRFNFLISFRPGSKNIKADALSRS 551
>gi|301632883|ref|XP_002945509.1| PREDICTED: hypothetical protein LOC100485523, partial [Xenopus
(Silurana) tropicalis]
Length = 1202
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/267 (29%), Positives = 118/267 (44%), Gaps = 25/267 (9%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G M D+ A+ +PI L + G CLP G A + +AF++
Sbjct: 905 RGALMAKADIESAFRLLPIHPECHHLLGCWFEGAYFVDLCLPMGCAISCAHFEAFSTFLE 964
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
WV + RS VV YLDDF V Q + L + S G + K+ PA
Sbjct: 965 WVVKV-RSGCRSVVHYLDDFFCVGQAKSDTCFHLLDTLR-EVTASFGVPLAADKTE-GPA 1021
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
V++FLG+ D LP K L + + +L K L +S+LG L+FA VIP
Sbjct: 1022 TVMRFLGLEIDSVAGECRLPTQKVADLMHEVGSLRRDKKATLQRLQSVLGKLNFACRVIP 1081
Query: 719 MGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWW------LNALPL--SSPIFPRQ 769
+GR+ SRR+ + + R AP H I V L W N L +S ++
Sbjct: 1082 VGRVFSRRLAQATAGAR--APHHHVRITKEVKADLGVWEAFLADFNGRVLFRASETTAQE 1139
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLW 796
++ + TDA+ GS+ ++L+G W
Sbjct: 1140 LELY--TDAA----GSKGFGAYLAGRW 1160
>gi|326674090|ref|XP_003200067.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1326
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 123/456 (26%), Positives = 186/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 426 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 483
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 484 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 543
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 544 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 600
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 601 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVDWP 647
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + +P W
Sbjct: 648 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSSKMP-FRWSSAA 695
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGL 795
L +S+PI P + F + DAS++G G SQ SS + S
Sbjct: 696 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYFSHR 755
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 756 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 811
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 812 RQARWALFF----GRFNFTISYRPGSKNIKPDALSR 843
>gi|46194168|tpg|DAA01994.1| TPA_exp: polyprotein [Danio rerio]
Length = 1119
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/452 (20%), Positives = 201/452 (44%), Gaps = 50/452 (11%)
Query: 482 EMLETGVLKRLDSTTGFLSRLFLVP---------------KGNGGTRPVLNLKG------ 520
E+++ + K +D+ F+ FL P K +G R +++L
Sbjct: 645 EVVDQLIKKEIDN--NFMIGPFLAPPFRVYRISPIGIATRKFSGKKRLIIDLSSPHNSCF 702
Query: 521 --LNQFLSPKKFSLINH-----FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
+N + P++++L H + + + ++ +D+S A+ +P+ + ++
Sbjct: 703 SSINSIIPPEEYALNYHDIDQAISLIKLVGRNAWLAKVDISSAFKVMPLHPDYWHLFGIN 762
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYL-DDFLLVNQDPRILE 631
+ L FG ++P+ F LS + +L + G+ +++L DDFL+++
Sbjct: 763 WRSKFYFAVRLTFGCRSSPKIFDMLSEAICWILSNNYGIAHILHLLDDFLIISPPSNPAT 822
Query: 632 IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
+ ++ +LG + +K+S P L+FLGI D + + LP++K + +
Sbjct: 823 EHLTITKTVFDNLGIPLAEEKTS-GPGTSLEFLGIKLDSNKFQASLPKEKIDRIIALSSI 881
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGR-LHSRRIQRQASLLRLGAPHLTPINPAVLP 750
L ++ + S+LG+L+FA +IP GR + +Q AS+ G ++
Sbjct: 882 FLENQNCSKRELLSILGHLNFAMRIIPQGRPFVTHLLQLAASVP--GLDDSLSLSDQCRH 939
Query: 751 KLEWWLNALP-------LSSPIFPRQVQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQN 802
+L W++ L S + + + TDA+ +G+G + + W +
Sbjct: 940 ELSLWISFLKCWNGCSFFYSDLIESPIDIQLYTDAAPSIGFGGYYQGRWFASSWPHQMIE 999
Query: 803 W--HINKKEMFAVHQALSLNL---PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
H +F ++ ++ ++ +S ++V DN+ VV + ++ + S +L+ +
Sbjct: 1000 IPPHHQSSALFELYPLVAASILWGDEWSASSILVHCDNEAVVQCINKRR-SHSPALMPLL 1058
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
++ S + I A+ +PG +N +ADSLSR
Sbjct: 1059 RRLIWTSAKKQFIITAKHVPGFHNQIADSLSR 1090
>gi|326665532|ref|XP_003198065.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1240
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 113/458 (24%), Positives = 186/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 452 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 510 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGEIQADPCKVKAVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP LT + +P +W +
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAP-LTALTSPKVP-FKWEV 601
Query: 757 NALP---------LSSPIFPR---QVQHFISTDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 602 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 662 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|326674202|ref|XP_003200091.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1162
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 121/456 (26%), Positives = 186/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 345 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 402
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 403 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 462
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 463 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 519
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 520 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 566
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ +S ++L +L FA+F RR R S +L AP LT + A P W
Sbjct: 567 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSAA 614
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L + +S+PI P + F + DAS++G G+ + +F S
Sbjct: 615 QVAFTKLKSRFVSAPILVTPDPARQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 674
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 675 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 730
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 731 RQARWALFFGRFDFTI----SYRPGSKNIKPDALSR 762
>gi|326671387|ref|XP_003199424.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
gi|326671391|ref|XP_003199426.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1402
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 113/468 (24%), Positives = 190/468 (40%), Gaps = 89/468 (19%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P S+ L+ P +AM +I+E L G+++ S G + F V K +GG RP ++ +G
Sbjct: 499 PRGSIFSLSLPERTAMDDYIEESLAAGIIRPSTSPAG--AGFFFVGKKDGGLRPCIDYRG 556
Query: 521 LNQFLSPKKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN K ++ N + +P LQ+ +DL AY V IK + A +
Sbjct: 557 LN------KITIRNRYPLPLMSTAFEILQEASIFTKLDLRNAYHLVRIKRGDEWKTAFNT 610
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F +L N V + ++ V VYLDD L+ + +
Sbjct: 611 PTGHYEYLVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSSSLQEHVHHV 668
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ + L V +K + V +FLG + P +M D Q + A
Sbjct: 669 RKVLHRLLENHLYVKPEKCQFHVSQV-KFLGFVIQPGQIQM----DPQ--------KVQA 715
Query: 695 SKTW----NLDSARSLLGY--------LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT 742
W ++ + LG+ L+F++ V P+ L ++ + R G
Sbjct: 716 MADWPSPSSVKEVQRFLGFANFYRKFILNFSTVVAPLSALTKEKV----AGFRWG----- 766
Query: 743 PINPAVLPKLEWWLNALP---LSSPIF--PRQVQHF-ISTDASDLGWGSQVD-------- 788
P+ E N L S+PI P + F + DAS++G G+ +
Sbjct: 767 -------PEAEKAFNELKKRFTSAPILLIPNPDKPFTVEVDASEVGIGAVLSQRGEDNKL 819
Query: 789 --SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
+FLS + ++N+H+ +E+ AV AL L+ S + Q + +
Sbjct: 820 HPCAFLSHRLTPTERNYHVGDRELLAVKLALEEWRHWLEGS----KHQFQVLTDH----- 870
Query: 847 GTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
K+L + + +++ W R H + PG+ N D+LSR
Sbjct: 871 --KNLEYVQQAKRLNPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSR 916
>gi|326676554|ref|XP_003200607.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1249
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 111/458 (24%), Positives = 182/458 (39%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P A+ +I E L+ +++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAIDRYINESLKAELIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIRKGDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + P++ + +
Sbjct: 452 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSPQVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K A + FLG + ++ G I + A
Sbjct: 510 LLENQLYVKAEKCVFH-AQSVPFLGFI---------------ISAGEIQADPCKVRAVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP +P V +W
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKV--PFKWKA 601
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWGSQVD-----------SSFLS 793
+A +S+P+ + Q + DASD+G G+ + +F S
Sbjct: 602 DAQEAFDKLKSRFISAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSCLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R K L
Sbjct: 662 HRLSPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SAKRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|326667605|ref|XP_003198633.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1388
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 113/458 (24%), Positives = 186/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 412 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 469
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 470 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 529
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 530 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 587
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 588 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGEIQADPCKVKAVAE 631
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP LT + +P +W +
Sbjct: 632 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAP-LTALTSPKVP-FKWEV 679
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 680 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 739
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 740 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 795
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 796 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 829
>gi|326670517|ref|XP_003199232.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1302
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 112/458 (24%), Positives = 184/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 452 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 510 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGEIQADPCKVKAVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP +P V +W +
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKV--PFKWEV 601
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWG------SQVD-----SSFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 602 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 662 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|326677098|ref|XP_003200757.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1274
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 117/457 (25%), Positives = 182/457 (39%), Gaps = 49/457 (10%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 369 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 426
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 427 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 486
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 487 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 543
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 544 RLLENGLFVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVV-----------DWP 590
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P T A KL+
Sbjct: 591 TPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTSLTSSKTPFRWTSAAEAAFSKLKSCFV 650
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS++G G+ + ++ S S ++N+ I
Sbjct: 651 SAPILIAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 709
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 710 NRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 763
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR------SKSLPD 895
R + + PG+ N D+LSR KS PD
Sbjct: 764 --GRFNFTISYRPGSKNIKPDALSRLFDPSDHKSSPD 798
>gi|326671771|ref|XP_003199522.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1297
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 112/458 (24%), Positives = 184/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 452 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT--- 697
L V +K V FLG + ++ G I K+
Sbjct: 510 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGEIQADPCKVKSVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP +P V +W +
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKV--PFKWEV 601
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 602 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 662 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|326665002|ref|XP_003197931.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1424
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 113/468 (24%), Positives = 190/468 (40%), Gaps = 89/468 (19%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P S+ L+ P +AM +I+E L G+++ S G + F V K +GG RP ++ +G
Sbjct: 499 PRGSIFSLSLPERTAMDDYIEESLAAGIIRPSTSPAG--AGFFFVGKKDGGLRPCIDYRG 556
Query: 521 LNQFLSPKKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN K ++ N + +P LQ+ +DL AY V IK + A +
Sbjct: 557 LN------KITIRNRYPLPLMSTAFEILQEASIFTKLDLRNAYHLVRIKRGDEWKTAFNT 610
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F +L N V + ++ V VYLDD L+ + +
Sbjct: 611 PTGHYEYLVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSSSLQEHVHHV 668
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ + L V +K + V +FLG + P +M D Q + A
Sbjct: 669 RKVLHRLLENHLYVKPEKCQFHVSQV-KFLGFVIQPGQIQM----DPQ--------KVQA 715
Query: 695 SKTW----NLDSARSLLGY--------LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT 742
W ++ + LG+ L+F++ V P+ L ++ + R G
Sbjct: 716 MADWPSPSSVKEVQRFLGFANFYRKFILNFSTVVAPLSALTKEKV----AGFRWG----- 766
Query: 743 PINPAVLPKLEWWLNALP---LSSPIF--PRQVQHF-ISTDASDLGWGSQVD-------- 788
P+ E N L S+PI P + F + DAS++G G+ +
Sbjct: 767 -------PEAEKAFNELKKRFTSAPILLIPNPDKPFTVEVDASEVGIGAVLSQRGEDNKL 819
Query: 789 --SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
+FLS + ++N+H+ +E+ AV AL L+ S + Q + +
Sbjct: 820 HPCAFLSHRLTPTERNYHVGDRELLAVKLALEEWRHWLEGS----KHQFQVLTDH----- 870
Query: 847 GTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
K+L + + +++ W R H + PG+ N D+LSR
Sbjct: 871 --KNLEYVQQAKRLNPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSR 916
>gi|326680728|ref|XP_003201603.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1445
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 120/467 (25%), Positives = 187/467 (40%), Gaps = 77/467 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+TG+++ S G + F V K +G RP ++ +G
Sbjct: 514 PKGHLFSLSGPEREAMDRYINESLKTGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 571
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 572 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 631
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + L+I + +
Sbjct: 632 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSP---CLQIHIQHVRQV 686
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLA 694
L L V +K A + FLG + ++ G I + A
Sbjct: 687 LQRLLENQLYVKAEKCVFH-AQSIPFLGFI---------------ISAGEIQADPCKIRA 730
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
W DS ++L +L FA+F RR R + ++ AP +P V K
Sbjct: 731 VAEWPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKVWFK-- 778
Query: 754 WW----------LNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------S 789
W L + +S+P+ P Q FI DASD+G G+ +
Sbjct: 779 -WNSDAQEAFDELKSRFVSAPVLSIPDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPC 837
Query: 790 SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGG 847
+F S + ++N+ + +E+ AV AL L+ + +V +D++ + Y+
Sbjct: 838 AFFSHRLNPSERNYDVGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYI----- 891
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
S L+ + + L D R F PG N D+LSR +P
Sbjct: 892 -HSARRLTPRQARWALFFD-RFKFTLSFRPGTKNVKPDALSRLFEVP 936
>gi|326675746|ref|XP_003200420.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1391
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 113/458 (24%), Positives = 186/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 412 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 469
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 470 LNDITIKDRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 529
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 530 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 587
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 588 LLENQLYVKAEKCVFHVKSV-SFLGFI---------------ISAGEIQADPCKVKAVAE 631
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP LT + +P +W +
Sbjct: 632 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAP-LTALTSPKVP-FKWEV 679
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 680 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 739
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 740 HRLNPSERNYDIGNRELLAVRLALGEWHHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 795
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 796 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 829
>gi|326668075|ref|XP_003198730.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1175
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 120/460 (26%), Positives = 184/460 (40%), Gaps = 55/460 (11%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 369 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 426
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 427 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 486
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + LE +
Sbjct: 487 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHS---LEEHIQHVRR 540
Query: 640 ILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASK 696
+L L G V +K A +QFLG + RM PE Q +
Sbjct: 541 VLQRLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD---------- 588
Query: 697 TW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEW 754
W DS ++L +L FA+F R S+ SL P T A KL+
Sbjct: 589 -WPTPDSRKALQRFLGFANFYRRFIRNFSQLAVPLTSLTSSKTPFRWTSAAEAAFSKLKS 647
Query: 755 WLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNW 803
+ P+ P + Q + DAS++G G+ + ++ S S ++N+
Sbjct: 648 CFVSAPILIAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSLAERNY 706
Query: 804 HINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
I +E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 707 DIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALF 762
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSR------SKSLPD 895
R + + PG+ N D+LSR KS PD
Sbjct: 763 F----GRFNFTISYRPGSKNIKPDALSRLFDPSDHKSSPD 798
>gi|301630801|ref|XP_002944505.1| PREDICTED: hypothetical protein LOC100494531 [Xenopus (Silurana)
tropicalis]
Length = 1075
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 163/394 (41%), Gaps = 36/394 (9%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ Q L + G CLP G + + F + S ++
Sbjct: 692 GALMAKADIESAFRLLPVHKESQHLLGCFFKGSYYVDRCLPMGCSISCSYFEAFSTFLEW 751
Query: 605 LLRSR-GMRVVV-YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN------LQKSSLS 656
++R + G+ V+ YLDDFL V L +L +L W+ + +
Sbjct: 752 VVRKQAGVNTVIHYLDDFLCVGPG------NSGLCAVLLQTLQWVADQFGVPLAGDKTEG 805
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P L+FLGI D LP DK QL G + L A K L +SL+G L+FA
Sbjct: 806 PTTCLKFLGIEIDTVSRECRLPPDKVQLLKGEVEYALRAKKV-TLKQLQSLIGRLNFACR 864
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ-VQHF 773
+IPMGR+ +R + + R H ++ + L W L + + RQ V+
Sbjct: 865 IIPMGRVFARALAMATAGARR-PHHYIRLSQELKEDLAVWRVFLQDFNGRSYWRQEVRDN 923
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSRE--QQNW-------HINKKEMFAVHQALSLNLPLL 824
+ G+ ++ G W Q W ++ E+F + A+ L LL
Sbjct: 924 REINLFTDAAGAGGFGAYYEGRWCAAPWPQEWVELKLTNNLTFLELFPIVVAIELWGHLL 983
Query: 825 QSSVVMVQSDNQ-TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ V+ +DN T ++ G+K + L + L + + A+ +PG N +
Sbjct: 984 ANKTVLFHTDNMATALAINNLTSGSKPVLRLLRHLVLRCLQIN--VSFRAKHLPGTTNEI 1041
Query: 884 ADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDL 917
AD+LSR + L+ A Q G PC +L
Sbjct: 1042 ADALSRFQWERFRRLAPGAVAQ-----GDPCPEL 1070
>gi|326678302|ref|XP_003201038.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1217
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 303 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 360
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 361 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 420
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 421 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 477
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 478 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 524
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ +S ++L +L FA+F RR R S +L AP LT + A P W
Sbjct: 525 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSAA 572
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L + +S+PI P + F + D S++G G+ + +F S
Sbjct: 573 QVAFTKLKSRFVSAPILVTPDPARQFVVEVDTSEVGMGAILSQRAASDDRIHPCAFFSHR 632
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 633 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 688
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 689 RQARWALFFGRFDFTI----SYRPGSKNIKPDALSR 720
>gi|326677013|ref|XP_003200730.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1147
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 121/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 348 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 405
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 406 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 465
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 466 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 522
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 523 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD-----------WP 569
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + +P W
Sbjct: 570 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSSKMP-FRWSSAA 617
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGL 795
L +S+PI P + F + D S++G G SQ SS + S
Sbjct: 618 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDVSEVGVGAILSQRSSSDGKIHPCAYFSHR 677
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 678 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 733
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 734 RQARWALFF----GRFNFTISYRPGSKNIKPDALSR 765
>gi|432912293|ref|XP_004078859.1| PREDICTED: uncharacterized protein LOC101161832 [Oryzias latipes]
Length = 1807
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 117/457 (25%), Positives = 184/457 (40%), Gaps = 65/457 (14%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L +L+ P AM +IQE L +G ++ S G + F V K + RP ++ +
Sbjct: 302 LPKGRLFNLSGPEKVAMEKYIQEALSSGHIRPSSSPAG--AGFFFVEKKDKSLRPCIDYR 359
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LNQ K+SL + +Q +DL AY V IK + A
Sbjct: 360 ELNQITIKDKYSLPLLSSVFDSIQGARIFSKLDLRNAYHLVRIKEGDEWKTAFKTPLGHY 419
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAV 638
+PFGL AP F L N +LR V VYLDD L+ ++D E + +
Sbjct: 420 EYLVMPFGLTNAPAVFQRLVN---DVLRDFLNFFVFVYLDDILVYSKDISQHESHVRSVL 476
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW 698
L V +K + V FLG +++ R P+ ++ A W
Sbjct: 477 QRLAENHLFVKAEKCAFHTTSV-PFLGYIFEAGSIR---PDPAKIE---------AVSQW 523
Query: 699 NLDSARSLL-GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW--- 754
+ R L +L FA+F RR R S + AP LT + P W
Sbjct: 524 EPPTNRKKLQQFLGFANFY--------RRFIRNYS--SIAAP-LTQLTSVAKP-FSWNAT 571
Query: 755 ------WLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSG 794
L L +S+PI + Q + DASD G G + +F S
Sbjct: 572 AQSAFDHLKKLFVSAPILIQLDPDRQFIVEVDASDSGVGGVLSQREVGTNKLKPCAFFSK 631
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLS 852
S ++N+ + +E+ A+ AL L+ + +V +D++ ++YLR +
Sbjct: 632 KLSPAERNYDVGNRELLAIKLALEEWRHWLEGAAHPFIVWTDHKN-LAYLR------TAK 684
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ + + L D R + + PG+ N+ D+LSR
Sbjct: 685 RLNSRQARWCLFFD-RFNFTITYRPGSRNTKPDALSR 720
>gi|326667016|ref|XP_003198453.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1296
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 114/464 (24%), Positives = 184/464 (39%), Gaps = 71/464 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I+E LE +++ S G + F V K +G RP ++ +G
Sbjct: 384 PKGRLYSLSAPEREAMDRYIRESLEADLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 441
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY + I+ + A +
Sbjct: 442 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFE 501
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ ++ ++ + +
Sbjct: 502 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILVFSESEQVHTQHVRQVLQR 559
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
L V +K V FL GI DP R
Sbjct: 560 LLENQLYVKAEKCVFHSKSV-SFLGHIVSTEGIKADPAKVR------------------- 599
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQ-----ASLLRLGAPHLTPINPA 747
A W + DS ++L +L FA+F RR R A L L +P + I +
Sbjct: 600 AVAKWPVPDSRKALQRFLGFANFY--------RRFIRNFSSVAAPLTALTSPKVPFIWHS 651
Query: 748 VLPKLEWWLNALPLSSPIF----PRQVQHFISTDASDLGWGSQVD-----------SSFL 792
+ L + +++P+ P++ Q + DAS++G G+ + +F
Sbjct: 652 QAQEAFDVLKSRFITAPVLCLPDPKR-QFIVEVDASEVGIGAVLSQRSSRDGKVHPCAFF 710
Query: 793 SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKS 850
S S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R K
Sbjct: 711 SHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SAKR 766
Query: 851 LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
LS +F R + + PG+ N D+LSR +P
Sbjct: 767 LSSRQARWALFF----GRFNFSLSYRPGSKNIKPDALSRLFDVP 806
>gi|301622867|ref|XP_002940748.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1646
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 112/441 (25%), Positives = 183/441 (41%), Gaps = 46/441 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 289 LSLPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGLNKITVK 346
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + I+ + A + +PFG
Sbjct: 347 NRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFG 406
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + K + L
Sbjct: 407 LCNAPAMFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLGDHRSHVKEVLRRLRENNLY 464
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K + V QFLG H+ L D + +R +L W S R+
Sbjct: 465 AKLEKCTFEVKSV-QFLGF----HISSKGLEMDPEK-----VRAVL---DWTQPLSLRAT 511
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPI 765
+L FA++ + S + L + GA P L P L +S+PI
Sbjct: 512 QRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWPSEAVQAFNF---LKKEFVSAPI 568
Query: 766 F--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMF 811
P FI DAS++G G+ + +F S +S + N+ I +E+
Sbjct: 569 LRHPDSALPFIVEVDASEVGAGAVLSQRHPLTNKLHPCAFFSKKFSPSEVNYDIGNRELL 628
Query: 812 AVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
A+ A LL+ + V V +D++ ++ Y+ K L+ +F R
Sbjct: 629 AIKWAFEEWRHLLEGAKHAVSVFTDHKNLL-YIE---SAKRLNPRQARWALFFT----RF 680
Query: 870 HILAQFIPGAYNSVADSLSRS 890
+ + PG+ N+ AD+LSRS
Sbjct: 681 NFSITYRPGSKNTKADALSRS 701
>gi|4775496|emb|CAB42622.1| putative polyprotein (aspartic proteinase, reverse transcriptase,
ribonuclease H) [Nicotiana tabacum]
Length = 636
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 97/446 (21%), Positives = 187/446 (41%), Gaps = 60/446 (13%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK----GNGGTRPVLNLKGLNQFLSPKK 529
+ +HI+E+L+ ++ +S + S F+V K G +R V++ + LN
Sbjct: 219 TEFKMHIKELLDNNYIQ--ESNSKHTSPAFIVNKHSEQKRGKSRMVIDYRNLNAKTKTYN 276
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+ + N +Q +Y D ++H+ ++ ++ A + LPFG
Sbjct: 277 YPIPNKILKIRQIQGYNYFSKFDCKSGFYHLKLEDESKKLTAFTVPQGFYEWNVLPFGYK 336
Query: 590 TAPQAFAS-LSNWVASLLRSRGMRVVVYLDDFLLV----NQDPRILEIQGKLAVSILGSL 644
AP + + N+ L ++Y+DD LL N+ ++LE + I+
Sbjct: 337 NAPGRYQHFMDNYFNQL-----ENCIIYIDDILLYSRTENEHIKLLE----KFIHIVEIS 387
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA- 703
G ++ +K+ + + +FLGI D + G ++T + K NL+
Sbjct: 388 GISLSKKKAEVMKNQI-EFLGIQIDKN--------------GIKMQTHVVQKIINLNETL 432
Query: 704 ------RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLN 757
+S LG ++ IP +L Q L + H + + K++
Sbjct: 433 DTKKKLQSFLGLVNQVREYIP--KLAENLKPLQKKLKKDIEYHFDEKDKIHIQKIKNMCK 490
Query: 758 ALP-LSSPIFPRQVQHFISTDASDLGWGS----QVDSS-------FLSGLWSREQQNWHI 805
LP L P +Q + + TD+SD +G + D+ + SG ++ Q W I
Sbjct: 491 KLPKLYFPDEKKQFTYIVETDSSDHSYGGVLKYKYDNEKIEHHCRYYSGSYTEPQLKWEI 550
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
N+KE+F +++ L P + + +V++DN V ++ R+ + E+ ++ L Q
Sbjct: 551 NRKELFGLYKCLLAFEPYIVYNKFIVRTDNTQVKWWITRK--VQDSVTTKEIRRLVLNIQ 608
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSK 891
++ I + I N +AD LSR +
Sbjct: 609 NFTFTI--EVIRTDKNVIADYLSRQR 632
>gi|301610293|ref|XP_002934698.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1244
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 109/441 (24%), Positives = 184/441 (41%), Gaps = 46/441 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AMS +I E LE G ++ +S G + F V K +GG RP ++ + LN+
Sbjct: 361 LSLPEAQAMSEYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRWLNKITVK 418
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + I+ + A + +PFG
Sbjct: 419 NRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFG 478
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + K + L
Sbjct: 479 LCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLSDHRCHVKEVLRRLRENNLY 536
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN-LDSARSL 706
L+K + V QFLG H+ L D + + A W L S R+
Sbjct: 537 AKLEKCTFEVDSV-QFLGF----HISSKGLEMDPE--------KVSAVLDWTQLLSLRAT 583
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTP---INPAVLPKLEWWLNALPLS 762
+L FA++ + S + L + GA P L P + K E +++A L
Sbjct: 584 QRFLGFANYYRQFIKNFSLIVGPITDLTKKGADPTLWPSEAVQAFNFLKKE-FVSASILR 642
Query: 763 SPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMF 811
P + + DAS++G G+ + +F S +S + N+ I +E+
Sbjct: 643 HP--DTALPFVVEVDASEVGAGAVLSQRHPLTNKLHPCAFFSKKFSPSEANYDIGNRELL 700
Query: 812 AVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
A+ A LL+ + V V +D++ ++ Y+ K L+ +F +R
Sbjct: 701 AIKWAFEEWRNLLEGAKHAVSVFTDHKNLL-YIE---SAKRLNPRQARWALFF----YRF 752
Query: 870 HILAQFIPGAYNSVADSLSRS 890
+ + PG+ N+ AD+ SRS
Sbjct: 753 NFSITYRPGSKNTKADAFSRS 773
>gi|301624145|ref|XP_002941367.1| PREDICTED: hypothetical protein LOC100491301 [Xenopus (Silurana)
tropicalis]
Length = 784
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 158/371 (42%), Gaps = 39/371 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G +M D+ A+ +P+ L + LP G + + + F+S
Sbjct: 397 RGAWMAKSDIQSAFRLLPVHPECFHLLGCHFMCFYFVDMSLPMGCSISCDYFEVFSSFLE 456
Query: 601 WVASLLRSRGMRVVV-YLDDFLLV---NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
WV + G+ + YLDDFL + + D L + AV+ G + +K+
Sbjct: 457 WVTC--QQAGLHSTLHYLDDFLFLGPAHTDTCNLLLNTFRAVT--SEFGVPLAEEKTE-G 511
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL-RTLLASKTWNLDSARSLLGYLSFASF 715
P + FLGI D LP +K +TLGN++ RT +A K L ++LLG+ FA
Sbjct: 512 PVRKITFLGIEIDSQEMVFRLPPEKLVTLGNLIDRTKMAKKV-TLKHIQTLLGHFVFACK 570
Query: 716 VIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ--- 771
VIPMGR RR+ ++ AP H I+ + L W L + Q
Sbjct: 571 VIPMGRPFCRRLSLATKGIK--APHHYIRISKPIKEDLGVWQQFLLEYNGRTCWQENDRS 628
Query: 772 ----HFISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALS 818
+ A+ +G G ++ G W E+ Q W ++ E+F + A+
Sbjct: 629 NSELQLFTDAAASIGMG-----AYFRGQWCAEKWPQAWRAMDLIRNLKFLELFPILVAIH 683
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
L L + V+ DN +VV + Q + SL +L+ + + L I A+ +PG
Sbjct: 684 LWGDHLANHRVIFWCDNLSVVHVINHQTSS-SLPVLALLRDLILCCLKQNIWFRAKHVPG 742
Query: 879 AYNSVADSLSR 889
N +AD+LSR
Sbjct: 743 VDNCLADALSR 753
>gi|326665990|ref|XP_003198169.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1224
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 115/455 (25%), Positives = 196/455 (43%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 421 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKNDGSLRPCIDYRGLNEITVK 478
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 479 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 535
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 536 PFGLANSPSVFQAFVNEIFRDMLNK--LVIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 593
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W +
Sbjct: 594 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPH 636
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 637 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 685
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + FL +SR+
Sbjct: 686 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRFLVNKKLHPCAFYSRKL 745
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 746 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 801
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 802 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 832
>gi|326680555|ref|XP_003201549.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1445
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 116/478 (24%), Positives = 187/478 (39%), Gaps = 100/478 (20%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P +AM ++ E L G+++ S G + F V K +G RP ++ +G
Sbjct: 496 PRGRLFSLSAPERAAMDKYLTESLAAGIIRHSSSPAG--AGFFFVKKKDGSLRPCIDYRG 553
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ + +DL AY V ++ + A +
Sbjct: 554 LNDITIKNRYPLPLMSSAFDLLQGARFFTKLDLRNAYHLVRMREGDEWKTAFNTPTGHFE 613
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQD------------P 627
LPFGL AP F +L N +LR V VYLDD L+ +
Sbjct: 614 YLVLPFGLTNAPAVFQALVN---DVLRDMINQFVFVYLDDILIFSSTMQEHVQHVRRVLQ 670
Query: 628 RILEIQ-------GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
R+LE Q K V + LG I++++ + PA V W P
Sbjct: 671 RLLENQLYVKAEKCKFHVQSVSFLGHIISVEGLRMDPAKVRAVSD--WPPP--------- 719
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
DS ++L +L FA+F RR R + R+ AP
Sbjct: 720 --------------------DSRKALQRFLGFANFY--------RRFIR--NFGRVAAP- 748
Query: 741 LTPINPAVLPKLEW---------WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD 788
LT + + + W L +L S+PI P + F + DAS++G G+ +
Sbjct: 749 LTALTSTRI-RFGWSVAAQTAFDHLKSLFTSAPILITPDPARQFVVEVDASEVGVGAVLS 807
Query: 789 ----------SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQ 836
++ S S ++N+ + +E+ AV AL L+ + V +V +D++
Sbjct: 808 QTAQDNKLHPCAYFSHCLSPTERNYDVGNRELLAVRLALGEWRHWLEGAAVPFLVWTDHR 867
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+ Y++ K L+ +F R + + PG+ NS D+LSR P
Sbjct: 868 N-LQYIQ---TAKRLNARQARWALFF----GRFNFTLSYRPGSKNSKPDALSRCFGSP 917
>gi|326671149|ref|XP_003199371.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1319
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 135/539 (25%), Positives = 211/539 (39%), Gaps = 84/539 (15%)
Query: 387 EPPGR---VSLKVQTLQKPQRCSSPVN-PPADSRIGAEL-----VGGRLRRFVDAWIRLG 437
EPP + L + T+ KPQ +N PP SR+ E V + R
Sbjct: 343 EPPRHTKAIPLDIMTIPKPQIVPKSLNTPPEISRVPPEYSDLAEVFSKTR---------A 393
Query: 438 APAPLVRIVSGYAIPFSAKPPLVP-LCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTT 496
A P R Y P P P L L+ P +AM ++ E L++G ++ S
Sbjct: 394 ASLPPHR---PYDCPIDLLPGTCPPRGKLYSLSGPERAAMEKYVHESLDSGFIRPSTSPA 450
Query: 497 GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL---INHFRIPSFLQKGDYMISIDL 553
G + F V K +G RP ++ +GLN ++ L F+I LQ +DL
Sbjct: 451 G--AGFFFVGKKDGSLRPCIDYRGLNSITVKNRYPLPLMTTAFKI---LQGATIFTKLDL 505
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
AY V I+ + A + +PFGLA AP F S + +LR +
Sbjct: 506 RSAYHLVRIRQGDEWKTAFNTPTGHYEYQVMPFGLANAPAVFQSF---IYDVLREMLNIF 562
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIM----- 667
V VYLDD L+ + +P I + + L G V L+KS + V FLG +
Sbjct: 563 VFVYLDDILIFSHNPEEHVIHVRKVLIELLKHGLFVKLEKSEFHVSSV-SFLGFIVSKGS 621
Query: 668 --WDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSR 725
DP R L + ++ + R +L FA+F R S
Sbjct: 622 LQIDPSKTRAVLDWPQPTSIKEVQR------------------FLGFANFYRRFIRNFSS 663
Query: 726 RIQRQASLLRLGAPHLTPINPA--VLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW 783
+ SL + T + A L+ + P+ + P ++ + ASD+G
Sbjct: 664 IAEPLTSLTKKANTPFTWNDKASTAFNTLKHRFTSAPILTLPDP-ELPFILEVYASDIGV 722
Query: 784 G------SQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVM 830
G S+ D+ +F S + Q N+ I +E+ A+ AL L+ S +
Sbjct: 723 GAVLSQRSKADNKLHPCAFYSHRLTPTQANYDIGNRELLAIKLALEEWRHWLEGASHHFL 782
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ +D+Q ++Y++ K L+ +F R F PG+ N D+LSR
Sbjct: 783 IWTDHQN-LTYIQ---NAKRLNARQARWSLFFN----RFKFTLSFRPGSKNIKPDALSR 833
>gi|425779964|gb|EKV17987.1| Retrotransposon polyprotein, putative [Penicillium digitatum PHI26]
Length = 1791
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/469 (22%), Positives = 186/469 (39%), Gaps = 63/469 (13%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQF 524
+ L+ S + +I E L+ G ++ S+ G+ + VPK +G R ++ + LN
Sbjct: 959 IYQLSQKESETLKEYISENLKKGYIRASKSSAGYP--IIFVPKKDGSLRLCVDYRHLNSI 1016
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
+ L + L + Y D++ AY + IKT H+ A +
Sbjct: 1017 TIKDRHPLPLIHEMQDRLGRAKYYSKYDITNAYHRIRIKTGHEWKTAFRTKYGHFEYLVM 1076
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
PFGL AP F + ++ + V+VYLDD L+ ++ KL + +
Sbjct: 1077 PFGLTNAPATFQRF--IIKAIEEYLDLFVIVYLDDILVFSETLEEHIEHNKLVLQKMREA 1134
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
+ L+K FLG P + + + E+K +++++ T S +
Sbjct: 1135 EVTLKLKKCEFHVQETT-FLGYRISP--NGLGMEEEK-------VKSIMEWPT--PKSMK 1182
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------W 755
+ +L A++ R+I + + G LT + K EW
Sbjct: 1183 EVQRFLGLANYY--------RKIIDGYAGVATGLYRLTKKD----QKFEWDEAAEDSFRK 1230
Query: 756 LNALPLSSPI---FPRQVQHFISTDASDLGWGSQVDSSFLSG------LWSRE----QQN 802
L L I F + TDASD G+++ G WSR+ + N
Sbjct: 1231 LKTLFSKGTIVATFDYDKPIIMETDASDYALGARLTQPGQDGKYRPVAFWSRKIIPAELN 1290
Query: 803 WHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+ ++ KE+ A+ A + L+ + V V+SD++ + + + T+ + +E
Sbjct: 1291 YDVHDKELLAIVSAFQVWREYLEGAKHTVTVKSDHKNLTFFTTTKVLTRRQARWAET--- 1347
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLK 909
L D+RI + G NS AD+LSR PD+ L + E L+
Sbjct: 1348 -LAQYDFRI----EHCKGTENSQADALSRR---PDYELGTKSAEPAVLR 1388
>gi|19114259|ref|NP_593347.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|19115068|ref|NP_594156.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|63054539|ref|NP_593385.2| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|162312209|ref|NP_001018800.2| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|1710054|sp|Q05654.1|RTF21_SCHPO RecName: Full=Retrotransposable element Tf2 155 kDa protein type 1
gi|1177360|emb|CAA93236.1| retrotransposable element [Schizosaccharomyces pombe]
gi|4760340|emb|CAB42363.1| retrotransposable element [Schizosaccharomyces pombe]
gi|6014423|emb|CAB57422.1| retrotransposable element [Schizosaccharomyces pombe]
gi|159883930|emb|CAB58169.2| retrotransposable element [Schizosaccharomyces pombe]
Length = 1333
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 98/430 (22%), Positives = 190/430 (44%), Gaps = 40/430 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
AM+ I + L++G+++ + + VPK G R V++ K LN+++ P + L
Sbjct: 427 AMNDEINQGLKSGIIRESKAINA--CPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
++ + +Q +DL AY + ++ + LA V +P+G++TAP
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 544
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F N + L ++ VV Y+DD L+ ++ K + L + I+N K
Sbjct: 545 FQYFINTI--LGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 655 LSPAPVLQFLGIMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ V +F+G H+ ++ + P NI + L + N R LG +++
Sbjct: 603 FHQSQV-KFIGY----HISEKGFTP-----CQENIDKVLQWKQPKNRKELRQFLGSVNYL 652
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
IP S+ +LL+ TP + ++ L + P L F +++
Sbjct: 653 RKFIPKT---SQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKI- 708
Query: 772 HFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ TDASD+ G+ + + S S+ Q N+ ++ KEM A+ ++L
Sbjct: 709 -LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWR 767
Query: 822 PLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+S++ + +D++ ++ + + ++ L ++FL QD+ I + PG+
Sbjct: 768 HYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR--WQLFL--QDFNFEI--NYRPGS 821
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 822 ANHIADALSR 831
>gi|425779965|gb|EKV17988.1| Retrotransposon polyprotein, putative [Penicillium digitatum PHI26]
Length = 1822
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/469 (22%), Positives = 186/469 (39%), Gaps = 63/469 (13%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQF 524
+ L+ S + +I E L+ G ++ S+ G+ + VPK +G R ++ + LN
Sbjct: 986 IYQLSQKESETLKEYISENLKKGYIRASKSSAGYP--IIFVPKKDGSLRLCVDYRHLNSI 1043
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
+ L + L + Y D++ AY + IKT H+ A +
Sbjct: 1044 TIKDRHPLPLIHEMQDRLGRAKYYSKYDITNAYHRIRIKTGHEWKTAFRTKYGHFEYLVM 1103
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
PFGL AP F + ++ + V+VYLDD L+ ++ KL + +
Sbjct: 1104 PFGLTNAPATFQRF--IIKAIEEYLDLFVIVYLDDILVFSETLEEHIEHNKLVLQKMREA 1161
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
+ L+K FLG P + + + E+K +++++ T S +
Sbjct: 1162 EVTLKLKKCEFHVQETT-FLGYRISP--NGLGMEEEK-------VKSIMEWPT--PKSMK 1209
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------W 755
+ +L A++ R+I + + G LT + K EW
Sbjct: 1210 EVQRFLGLANYY--------RKIIDGYAGVATGLYRLTKKD----QKFEWDEAAEDSFRK 1257
Query: 756 LNALPLSSPI---FPRQVQHFISTDASDLGWGSQVDSSFLSG------LWSRE----QQN 802
L L I F + TDASD G+++ G WSR+ + N
Sbjct: 1258 LKTLFSKGTIVATFDYDKPIIMETDASDYALGARLTQPGQDGKYRPVAFWSRKIIPAELN 1317
Query: 803 WHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+ ++ KE+ A+ A + L+ + V V+SD++ + + + T+ + +E
Sbjct: 1318 YDVHDKELLAIVSAFQVWREYLEGAKHTVTVKSDHKNLTFFTTTKVLTRRQARWAET--- 1374
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLK 909
L D+RI + G NS AD+LSR PD+ L + E L+
Sbjct: 1375 -LAQYDFRI----EHCKGTENSQADALSRR---PDYELGTKSAEPAVLR 1415
>gi|19075455|ref|NP_587955.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|19114896|ref|NP_593984.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|19115321|ref|NP_594409.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|68000596|ref|NP_001018276.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|173439|gb|AAA91215.1| protease, reverse transcriptase, RNAse H, integrase protein
[Schizosaccharomyces pombe]
gi|2388948|emb|CAB11682.1| retrotransposable element [Schizosaccharomyces pombe]
gi|6318248|emb|CAB60245.1| retrotransposable element [Schizosaccharomyces pombe]
gi|7340821|emb|CAB83007.1| retrotransposable element [Schizosaccharomyces pombe]
gi|19571555|emb|CAD27466.1| retrotransposable element [Schizosaccharomyces pombe]
Length = 1333
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 98/430 (22%), Positives = 190/430 (44%), Gaps = 40/430 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
AM+ I + L++G+++ + + VPK G R V++ K LN+++ P + L
Sbjct: 427 AMNDEINQGLKSGIIRESKAINA--CPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
++ + +Q +DL AY + ++ + LA V +P+G++TAP
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 544
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F N + L ++ VV Y+DD L+ ++ K + L + I+N K
Sbjct: 545 FQYFINTI--LGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 655 LSPAPVLQFLGIMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ V +F+G H+ ++ + P NI + L + N R LG +++
Sbjct: 603 FHQSQV-KFIGY----HISEKGFTP-----CQENIDKVLQWKQPKNRKELRQFLGSVNYL 652
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
IP S+ +LL+ TP + ++ L + P L F +++
Sbjct: 653 RKFIPKT---SQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKI- 708
Query: 772 HFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ TDASD+ G+ + + S S+ Q N+ ++ KEM A+ ++L
Sbjct: 709 -LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWR 767
Query: 822 PLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+S++ + +D++ ++ + + ++ L ++FL QD+ I + PG+
Sbjct: 768 HYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR--WQLFL--QDFNFEI--NYRPGS 821
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 822 ANHIADALSR 831
>gi|326670216|ref|XP_003199163.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1325
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 119/483 (24%), Positives = 205/483 (42%), Gaps = 81/483 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 308 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 365
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 366 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTVDGHYEYLVM 422
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + +R V+VY+DD L+ + I ++ L I
Sbjct: 423 PFGLANSPSVFQAFVNEIFRDMLNRW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 480
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 481 QL-----FAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 523
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 524 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 572
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 573 RAFTQLKTHFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 632
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 633 NSAERNYDVGNRELLAIKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 688
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR-----SKSLPDWHLSRSATEQIFLK 909
+F D+++ +IPG+ N AD+LSR + +PD + +S ++
Sbjct: 689 QARWALFFTRFDFQV----TYIPGSKNIKADALSRLSDDETSEIPDEPIIKSPLIVAPIQ 744
Query: 910 WGV 912
W +
Sbjct: 745 WDI 747
>gi|326668550|ref|XP_003198820.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1290
Score = 79.7 bits (195), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 116/478 (24%), Positives = 187/478 (39%), Gaps = 100/478 (20%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P +AM ++ E L G+++ S G + F V K +G RP +N +G
Sbjct: 374 PRGRLFSLSAPERAAMDKYLTESLAAGIIRHSSSPAG--AGFFFVKKKDGSLRPCINYRG 431
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ + +DL AY V ++ + A +
Sbjct: 432 LNDITIKNRYPLPLMSSAFDLLQGARFFTKLDLRNAYHLVRMREGDEWKTAFNTPTGHFE 491
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQD------------P 627
LPFGL AP F +L N +LR V VYLDD L+ +
Sbjct: 492 YLVLPFGLTNAPAVFQALVN---DVLRDMINQFVFVYLDDILIFSSAMQEHVQHVRRVLQ 548
Query: 628 RILEIQ-------GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
R+LE Q K V + LG I++++ + PA V W P D
Sbjct: 549 RLLENQLYVKAEKCKFHVQSVSFLGHIISVEGLRMDPAKVRAVSD----------WPPPD 598
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
+ ++L +L FA+F RR R + R+ AP
Sbjct: 599 FR---------------------KALQRFLGFANFY--------RRFIR--NFGRVAAP- 626
Query: 741 LTPINPAVLPKLEW---------WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD 788
LT + + + W L +L S+PI P + F + DAS++G G+ +
Sbjct: 627 LTALTSTRI-RFGWSVAAQTAFDHLKSLFTSAPILITPDPARQFVVEVDASEVGVGAVLS 685
Query: 789 ----------SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQ 836
++ S S ++N+ + +E+ AV AL L+ + V +V +D++
Sbjct: 686 QTAQDNKLHPCAYFSHCLSPTERNYDVGNRELLAVRLALGEWRHWLEGAAVPFLVWTDHR 745
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+ Y++ K L+ +F R + + PG+ NS D+LSR P
Sbjct: 746 N-LQYIQ---TAKRLNARQARWALFF----GRFNFTLSYRPGSKNSKPDALSRCFGSP 795
>gi|326677009|ref|XP_003200729.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1281
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 178/445 (40%), Gaps = 43/445 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 368 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 425
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 426 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 485
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 486 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 542
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 543 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD-----------WP 589
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P + A KL+
Sbjct: 590 TPDSRKALQRFLGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFV 649
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS++G G+ + ++ S S ++N+ I
Sbjct: 650 SAPILIAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 708
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 709 NRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 762
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N D+LSR
Sbjct: 763 --GRFNFTISYRPGSKNIKPDALSR 785
>gi|326671564|ref|XP_003199463.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1402
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 113/468 (24%), Positives = 189/468 (40%), Gaps = 89/468 (19%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P S+ L+ P +AM +I+E L G+++ S G + F V K +GG RP ++ +G
Sbjct: 499 PRGSIFSLSLPERTAMDDYIEESLAAGIIRPSTSPAG--AGFFFVGKKDGGLRPCIDYRG 556
Query: 521 LNQFLSPKKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN K ++ N + +P LQ+ DL AY V IK + A +
Sbjct: 557 LN------KITIRNRYPLPLMSTAFEILQEASIFTKPDLRNAYHLVRIKRGDEWKTAFNT 610
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F +L N V + ++ V VYLDD L+ + +
Sbjct: 611 PTGHYEYLVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSSSLQEHVHHV 668
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ + L V +K + V +FLG + P +M D Q + A
Sbjct: 669 RKVLHRLLENHLYVKPEKCQFHVSQV-KFLGFVIQPGQIQM----DPQ--------KVQA 715
Query: 695 SKTW----NLDSARSLLGY--------LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT 742
W ++ + LG+ L+F++ V P+ L ++ + R G
Sbjct: 716 MADWPSPSSVKEVQRFLGFANFYRKFILNFSTVVAPLSALTKEKV----AGFRWG----- 766
Query: 743 PINPAVLPKLEWWLNALP---LSSPIF--PRQVQHF-ISTDASDLGWGSQVD-------- 788
P+ E N L S+PI P + F + DAS++G G+ +
Sbjct: 767 -------PEAEKAFNELKKRFTSAPILLIPNPDKPFTVEVDASEVGIGAVLSQRGEDNKL 819
Query: 789 --SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
+FLS + ++N+H+ +E+ AV AL L+ S + Q + +
Sbjct: 820 HPCAFLSHRLTPTERNYHVGDRELLAVKLALEEWRHWLEGS----KHQFQVLTDH----- 870
Query: 847 GTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
K+L + + +++ W R H + PG+ N D+LSR
Sbjct: 871 --KNLEYVQQAKRLNPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSR 916
>gi|326672413|ref|XP_003199660.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
gi|326675801|ref|XP_002661138.2| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1226
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 113/445 (25%), Positives = 178/445 (40%), Gaps = 43/445 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 368 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 425
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 426 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 485
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 486 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 542
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 543 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVDWP 589
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P + A KL+
Sbjct: 590 TPDSRKALQRFLGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFV 649
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS++G G+ + ++ S S ++N+ I
Sbjct: 650 SAPILIAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 708
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 709 NRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 762
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N D+LSR
Sbjct: 763 --GRFNFTISYRPGSKNIKPDALSR 785
>gi|292617566|ref|XP_001919411.2| PREDICTED: hypothetical protein LOC100002220 [Danio rerio]
Length = 1755
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 120/472 (25%), Positives = 177/472 (37%), Gaps = 86/472 (18%)
Query: 387 EPPGR---VSLKVQTLQKPQRCSSPVN-PPADSRIGAEL-----VGGRLRRFVDAWIRLG 437
EPP + L + T+ KPQ +N PP SR+ E V + R
Sbjct: 913 EPPRHTKAIPLDIMTIPKPQIVPKSLNTPPEISRVPPEYSDLAEVFSKTR---------A 963
Query: 438 APAPLVRIVSGYAIPFSAKPPLVP-LCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTT 496
A P R Y P P P L L+ P +AM ++ E L++G ++ S
Sbjct: 964 ASLPPHR---PYDCPIDLLPGTCPPRGKLYSLSGPERAAMEKYVHESLDSGFIRPSTSPA 1020
Query: 497 GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
G + F V K +G RP ++ +GLN ++ L LQ +DL A
Sbjct: 1021 G--AGFFFVGKKDGSLRPCIDYRGLNSITVKNRYPLPLMTTAFEILQGATIFTKLDLRSA 1078
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY 616
Y V I+ + + +PFGLA AP F S N V L + V VY
Sbjct: 1079 YHLVRIRQGDEWKTGFNTPTGHYEYQVMPFGLANAPAVFQSFINDV--LREMLNIFVFVY 1136
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIM-------WD 669
LDD L+ + +P I + + L G V L+KS + V FLG + D
Sbjct: 1137 LDDILIFSHNPEEHVIHVRKVLIELLKHGLFVKLEKSEFHVSSV-SFLGFIVSKGSLQMD 1195
Query: 670 PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQR 729
P R L + ++ + R +L FA+F RR R
Sbjct: 1196 PSKTRAVLDWPQPTSIKEVQR------------------FLGFANFY--------RRFIR 1229
Query: 730 QASLLRLGAPHLTPINPAVLPKLEW---------WLNALPLSSPIFPR---QVQHFISTD 777
S + A LT + W L + S+PI ++ + D
Sbjct: 1230 NFSSI---AEPLTSLTKKANTPFTWNDKASTAFNTLKHIFTSAPILTLPDPELPFILEVD 1286
Query: 778 ASDLGWG------SQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALS 818
ASD+G G S+ D+ +F S + Q N+ I +E+ A+ AL
Sbjct: 1287 ASDIGVGAVLSQRSKADNKLHPCAFYSHRLTPTQANYDIGNRELLAIKLALE 1338
>gi|326665283|ref|XP_002660991.2| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1573
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 115/445 (25%), Positives = 179/445 (40%), Gaps = 43/445 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 368 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 425
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 426 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 485
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 486 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 542
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 543 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD-----------WP 589
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P + A KL+
Sbjct: 590 TPDSRKALQRFLGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFV 649
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWG---SQVDSS--------FLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS++G G SQ +S + S S ++N+ I
Sbjct: 650 SAPILIAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 708
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 709 NRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 762
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N D+LSR
Sbjct: 763 --GRFNFTISYRPGSKNIKPDALSR 785
>gi|173477|gb|AAA35339.1| Tf1 protein [Schizosaccharomyces pombe]
Length = 1330
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 98/430 (22%), Positives = 189/430 (43%), Gaps = 40/430 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
AM+ I + L++G+++ + + VPK G R V++ K LN+++ P + L
Sbjct: 424 AMNDEINQGLKSGIIRESKAINA--CPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 481
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
++ + +Q +DL AY + ++ + LA V +P+G++TAP
Sbjct: 482 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAH 541
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F N + L ++ VV Y+DD L+ ++ K + L + I+N K
Sbjct: 542 FQYFINTI--LGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 599
Query: 655 LSPAPVLQFLGIMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ V +F+G H+ ++ + P NI + L + N R LG +++
Sbjct: 600 FHQSQV-KFIGY----HISEKGFTP-----CQENIDKVLQWKQPKNRKELRQFLGSVNYL 649
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
IP S+ LL+ TP + ++ L + P L F +++
Sbjct: 650 RKFIPKT---SQLTHPLNKLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKI- 705
Query: 772 HFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ TDASD+ G+ + + S S+ Q N+ ++ KEM A+ ++L
Sbjct: 706 -LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWR 764
Query: 822 PLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+S++ + +D++ ++ + + ++ L ++FL QD+ I + PG+
Sbjct: 765 HYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR--WQLFL--QDFNFEI--NYRPGS 818
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 819 ANHIADALSR 828
>gi|301614289|ref|XP_002936633.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1108
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 112/440 (25%), Positives = 186/440 (42%), Gaps = 44/440 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 212 LSLPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGLNKITIK 269
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + IK + A + +PFG
Sbjct: 270 NRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIKEGDEWKTAFNTRDGHYEYLVMPFG 329
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + R K+ + L
Sbjct: 330 LCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLRDHRSHVKVVLQRLRENNLY 387
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K + V QFLG H+ L D + +R +L W S R+
Sbjct: 388 AKLEKCTFEVNSV-QFLGF----HISSKGLEMDPEK-----VRAVL---DWMQPLSLRAT 434
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
+L FA++ + S + L + GA + AV + +L +S+PI
Sbjct: 435 QRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWSSEAV--QAFNFLKKEFVSAPIL 492
Query: 767 --PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMFA 812
P FI DAS++G G+ + +F S +S + N+ I +E+ A
Sbjct: 493 RHPDTALPFIVEVDASEVGAGAVLSQRHPLTNKLHPCAFFSKKFSPSEANYDIGNRELLA 552
Query: 813 VHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+ A LL+ + V V +D++ ++ Y+ K L+ +F R +
Sbjct: 553 IKWAFEEWRHLLEGAKHAVSVFTDHKNLL-YIE---SAKRLNPRQARWALFFS----RFN 604
Query: 871 ILAQFIPGAYNSVADSLSRS 890
+ G+ N+ AD+LSRS
Sbjct: 605 FSITYRAGSKNTKADALSRS 624
>gi|425777956|gb|EKV16106.1| Retrotransposon polyprotein, putative [Penicillium digitatum Pd1]
Length = 1695
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/469 (22%), Positives = 186/469 (39%), Gaps = 63/469 (13%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQF 524
+ L+ S + +I E L+ G ++ S+ G+ + VPK +G R ++ + LN
Sbjct: 908 IYQLSQKESETLKEYISENLKKGYIRASKSSAGYP--IIFVPKKDGSLRLCVDYRHLNSI 965
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
+ L + L + Y D++ AY + IKT H+ A +
Sbjct: 966 TIKDRHPLPLIHEMQDRLGRAKYYSKYDITNAYHRIRIKTGHEWKTAFRTKYGHFEYLVM 1025
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
PFGL AP F + ++ + V+VYLDD L+ ++ KL + +
Sbjct: 1026 PFGLTNAPATFQRFI--IKAIEEYLDLFVIVYLDDILVFSETLEEHIEHNKLVLQKMREA 1083
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
+ L+K FLG P + + + E+K +++++ T S +
Sbjct: 1084 EVTLKLKKCEFHVQETT-FLGYRISP--NGLGMEEEK-------VKSIMEWPT--PKSMK 1131
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------W 755
+ +L A++ R+I + + G LT + K EW
Sbjct: 1132 EVQRFLGLANYY--------RKIIDGYAGVATGLYRLTKKD----QKFEWDEAAEDSFRK 1179
Query: 756 LNALPLSSPI---FPRQVQHFISTDASDLGWGSQVDSSFLSG------LWSRE----QQN 802
L L I F + TDASD G+++ G WSR+ + N
Sbjct: 1180 LKTLFSKGTIVATFDYDKPIIMETDASDYALGARLTQPGQDGKYRPVAFWSRKIIPAELN 1239
Query: 803 WHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+ ++ KE+ A+ A + L+ + V V+SD++ + + + T+ + +E
Sbjct: 1240 YDVHDKELLAIVSAFQVWREYLEGAKHTVTVKSDHKNLTFFTTTKVLTRRQARWAET--- 1296
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLK 909
L D+RI + G NS AD+LSR PD+ L + E L+
Sbjct: 1297 -LAQYDFRI----EHCKGTENSQADALSRR---PDYELGTKSAEPAVLR 1337
>gi|326670628|ref|XP_003199256.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1174
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 373 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 430
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 431 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 490
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 491 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 547
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 548 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 594
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 595 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 642
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 643 EVAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 702
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 703 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---AAKRLNS 758
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 759 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 790
>gi|326668476|ref|XP_003198810.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1174
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 373 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 430
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 431 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 490
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 491 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 547
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 548 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 594
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 595 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 642
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 643 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 702
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 703 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---AAKRLNS 758
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 759 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 790
>gi|326678715|ref|XP_003201148.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1361
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 116/478 (24%), Positives = 187/478 (39%), Gaps = 100/478 (20%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P +AM ++ E L G+++ S G + F V K +G RP ++ +G
Sbjct: 393 PRGRLFSLSAPERAAMDKYLTESLAAGIIRHSSSPAG--AGFFFVKKKDGSLRPCIDYRG 450
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ + +DL AY V ++ + A +
Sbjct: 451 LNDITIKNRYPLPLMSSAFDLLQGARFFTKLDLRNAYHLVRMREGDEWKTAFNTPTGHFE 510
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQD------------P 627
LPFGL AP F +L N +LR V VYLDD L+ +
Sbjct: 511 YLVLPFGLTNAPAVFQALVN---DVLRDMINQFVFVYLDDILIFSSTMQEHVQHVRRVLQ 567
Query: 628 RILEIQ-------GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
R+LE Q K V + LG I++++ + PA V W P
Sbjct: 568 RLLENQLYVKAEKCKFHVQSVSFLGHIISVEGLRMDPAKVRAVSD--WPPP--------- 616
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
DS ++L +L FA+F RR R + R+ AP
Sbjct: 617 --------------------DSRKALQRFLGFANFY--------RRFIR--NFGRVAAP- 645
Query: 741 LTPINPAVLPKLEW---------WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD 788
LT + + + W L +L S+PI P + F + DAS++G G+ +
Sbjct: 646 LTALTSTRI-RFGWSVAAQTAFDHLKSLFTSAPILITPDPARQFVVEVDASEVGVGAVLS 704
Query: 789 ----------SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQ 836
++ S S ++N+ + +E+ AV AL L+ + V +V +D++
Sbjct: 705 QTAQDNKLHPCAYFSHCLSPTERNYDVGNRELLAVRLALGEWRHWLEGAAVPFLVWTDHR 764
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+ Y++ K L+ +F R + + PG+ NS D+LSR P
Sbjct: 765 N-LQYIQ---TAKRLNARQARWALFF----GRFNFTLSYRPGSKNSKPDALSRCFGSP 814
>gi|326663868|ref|XP_003197683.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1187
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 119/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 373 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 430
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 431 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 490
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 491 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 547
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 548 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVV-----------NWP 594
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 595 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 642
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 643 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 702
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 703 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---SAKRLNS 758
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 759 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 790
>gi|357601798|gb|EHJ63156.1| hypothetical protein KGM_09197 [Danaus plexippus]
Length = 129
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 65/106 (61%)
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
++S V++Q+D +TV++Y++ +GGT+S LL ++ L + I + Q +PG N+
Sbjct: 3 FRNSTVILQNDKKTVLTYIKNEGGTRSRHLLKLTGQLLNLVDHFNIVLRLQHLPGLLNTE 62
Query: 884 ADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNH 929
AD LSR+ + DW++ T ++F WG P +DLFA++ + V H
Sbjct: 63 ADRLSRNHAAVDWYIRDEETSRLFSLWGTPDLDLFATQTAHVKSVH 108
>gi|326671251|ref|XP_003199400.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1353
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 179/454 (39%), Gaps = 61/454 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L G+++ S G + F V K +G RP ++ +G
Sbjct: 412 PKGRLYSLSGPEREAMDRYIQESLNAGLIRPSSSPAG--AGFFFVKKRDGSLRPCIDYRG 469
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 470 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 529
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + ++ V VYLDD L+ + + + +
Sbjct: 530 YLVLPFGLTNAPAVFQALVNDVLRDMVNKF--VFVYLDDILIFSSSLQEHTQHVRQVLQR 587
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K A + FLG + R DK + A W +
Sbjct: 588 LLENQLFVKAEKCEFH-ARSVAFLGYVISAEGIRA--DPDK----------VRAVAKWPV 634
Query: 701 DSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK-------- 751
+ R L +L FA+F RR R S ++ AP + VL K
Sbjct: 635 PNTRKALQRFLGFANFY--------RRFIRNFS--QIAAPLTALTSTKVLFKWNTQAQEA 684
Query: 752 ---LEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWS 797
L+ + P+ S P Q Q + DAS++G G+ + +F S S
Sbjct: 685 FGALKSRFTSAPVLSIPDPEQ-QFIVEVDASEVGVGAVLSQRSSKDGKVHPCAFFSHRLS 743
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLS 855
++N+ I +E+ AV AL L+ + +V +D++ + Y+R K LS
Sbjct: 744 PAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYVR---SAKRLSARQ 799
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + + PG+ N D+LSR
Sbjct: 800 ARWALFF----GRFNFVLSYRPGSKNIKPDALSR 829
>gi|301605918|ref|XP_002932608.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1173
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 108/445 (24%), Positives = 187/445 (42%), Gaps = 45/445 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E L+ G ++ S G + F V K +G RP ++ +GLN+
Sbjct: 314 LSLPEAQAMKEYINENLQRGFIRPSSSPAG--AGFFFVGKKDGSLRPCIDYRGLNKITVK 371
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + I+ + A + +PFG
Sbjct: 372 NRYPLPLISELFDQVRNAKFFTKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFG 431
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + + + L
Sbjct: 432 LCNAPAVFQEFVNDIFRDL--LGLFVVVYLDDILIFSSNLSDHRNHVREVLLRLRGNNLY 489
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
L+K P +QFLG ++ D+ L + ++ + L S R+L
Sbjct: 490 AKLEKCIFE-VPSVQFLG----------FVISDEGLAMDSVKVKAILEWAQPL-SLRALQ 537
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPIF 766
+L FA++ + S + L + GA P L + + E +L +S+PI
Sbjct: 538 RFLGFANYYRQFIKNFSLILAPLTDLTKKGADPSLW--SSKAVHAFE-FLKKEFVSAPIL 594
Query: 767 PR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKEMFA 812
+ + DAS++G G+ + +F S +S + N+ I +E+ A
Sbjct: 595 RHPDTSLPFIVEVDASEVGAGAVLSQRHPTTNKMHPCAFFSKKFSPAEVNYDIGNRELLA 654
Query: 813 VHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
V A LL+ + VMV +D++ +++ K L+ +F R +
Sbjct: 655 VKWAFEEWRHLLEGAKYPVMVFTDHKNLLNI----ESAKRLNPRQARWALFFS----RFN 706
Query: 871 ILAQFIPGAYNSVADSLSRS-KSLP 894
F PG+ N AD+LSRS +S+P
Sbjct: 707 FSLTFRPGSKNIKADALSRSFESIP 731
>gi|301604319|ref|XP_002931816.1| PREDICTED: hypothetical protein LOC100494422 [Xenopus (Silurana)
tropicalis]
Length = 810
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 152/355 (42%), Gaps = 29/355 (8%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG + D+ A+ +PI + L + G CLP G + + + F S ++
Sbjct: 134 KGALLAKSDIESAFRLLPIHSDCYHLLGCQFEGQFYYDLCLPMGCSISCRYFECFSTFLE 193
Query: 604 SLLRSR-GMRVVV-YLDDFLLVNQ-DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R G+ V+ YLDDFL + + + ++ + G ++ +K+ P +
Sbjct: 194 WVVRHETGLNSVIHYLDDFLFIGPPNTNVCQLLLSTFQFFMEKFGVPLSREKTE-GPVTI 252
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI D LP+DK L + + + +K L S +SLLG L FA ++P+
Sbjct: 253 LSFLGIEIDTVELVFRLPDDKLQRLKSTVAEITVAKKVTLRSMQSLLGLLVFACRIMPIA 312
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQH 772
R+ SRR+ ++ H I + L W L + + + ++
Sbjct: 313 RVFSRRLSLSTCGIK-QPHHFIRITKQLREDLRVWQTFLEQYNGHTCLMDTEVSNEELSL 371
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLNLPL 823
F TDA+ GS + L+ W EQ NW ++ E+F + A+ +
Sbjct: 372 F--TDAA----GSTGFGAILAQSWCAEQWPDNWAPVGLCKNLTLLELFPIVVAVEIWGHR 425
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
+ + +DN +VV + + + SL +L+ + + L + I A+ +PG
Sbjct: 426 MSGKKICFWTDNMSVVFAINKL-TSSSLPVLALLRHLVLRCLELNIWFRARHVPG 479
>gi|301610638|ref|XP_002934857.1| PREDICTED: hypothetical protein LOC100497321 [Xenopus (Silurana)
tropicalis]
Length = 1175
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 85/184 (46%), Gaps = 4/184 (2%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ Q L + G CLP G + + F + S ++
Sbjct: 792 GALMAKADIESAFRLLPVHKESQHLLGCFFKGSYYVDRCLPMGCSISCAYFEAFSTFLEW 851
Query: 605 LLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R R G+ ++ YLDDFL V D + + + + G + K+ PA L
Sbjct: 852 VVRKRAGVNTLIHYLDDFLCVGPGDSGLCAVLLQTLQEVADQFGVPLAGDKTE-GPATCL 910
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L + L ++ L +SL+G L+FA +IPMGR
Sbjct: 911 KFLGIEIDTVRQECRLPPDKVQLLKGEVEYALGARKVTLKQLQSLIGRLNFACRIIPMGR 970
Query: 722 LHSR 725
+ +R
Sbjct: 971 VFAR 974
>gi|326664009|ref|XP_003197710.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1214
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 314 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 371
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 372 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 431
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 432 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 488
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 489 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 535
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 536 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 583
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 584 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 643
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 644 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---AAKRLNS 699
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 700 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 731
>gi|326676679|ref|XP_003200646.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1353
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 179/454 (39%), Gaps = 61/454 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L G+++ S G + F V K +G RP ++ +G
Sbjct: 412 PKGRLYSLSGPEREAMDRYIQESLNAGLIRPSSSPAG--AGFFFVKKRDGSLRPCIDYRG 469
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 470 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 529
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + ++ V VYLDD L+ + + + +
Sbjct: 530 YLVLPFGLTNAPAVFQALVNDVLRDMVNKF--VFVYLDDILIFSSSLQEHTQHVRQVLQR 587
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K A + FLG + R DK + A W +
Sbjct: 588 LLENQLFVKAEKCEFH-ARSVAFLGYVISAEGIRA--DPDK----------VRAVAKWPV 634
Query: 701 DSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK-------- 751
+ R L +L FA+F RR R S ++ AP + VL K
Sbjct: 635 PNTRKALQRFLGFANFY--------RRFIRNFS--QIAAPLTALTSTKVLFKWNTQAQEA 684
Query: 752 ---LEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWS 797
L+ + P+ S P Q Q + DAS++G G+ + +F S S
Sbjct: 685 FGALKSRFTSAPVLSIPDPEQ-QFIVEVDASEVGVGAVLSQHSSKDGKVHPCAFFSYRLS 743
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLS 855
++N+ I +E+ AV AL L+ + +V +D++ + Y+R K LS
Sbjct: 744 PAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYVR---SAKRLSARQ 799
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + + PG+ N D+LSR
Sbjct: 800 ARWALFF----GRFNFVLSYRPGSKNIKPDALSR 829
>gi|326680703|ref|XP_003201595.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1353
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 179/454 (39%), Gaps = 61/454 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L G+++ S G + F V K +G RP ++ +G
Sbjct: 412 PKGRLYSLSGPEREAMDRYIQESLNAGLIRPSSSPAG--AGFFFVKKRDGSLRPCIDYRG 469
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 470 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 529
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + ++ V VYLDD L+ + + + +
Sbjct: 530 YLVLPFGLTNAPAVFQALVNDVLRDMVNKF--VFVYLDDILIFSSSLQEHTQHVRQVLQR 587
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K A + FLG + R DK + A W +
Sbjct: 588 LLENQLFVKAEKCEFH-ARSVAFLGYVISAEGIRA--DPDK----------VRAVAKWPV 634
Query: 701 DSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK-------- 751
+ R L +L FA+F RR R S ++ AP + VL K
Sbjct: 635 PNTRKALQRFLGFANFY--------RRFIRNFS--QIAAPLTALTSTKVLFKWNTQAQEA 684
Query: 752 ---LEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWS 797
L+ + P+ S P Q Q + DAS++G G+ + +F S S
Sbjct: 685 FGALKSRFTSAPVLSIPDPEQ-QFIVEVDASEVGVGAVLSQRSSKDGKVHPCAFFSHRLS 743
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLS 855
++N+ I +E+ AV AL L+ + +V +D++ + Y+R K LS
Sbjct: 744 PAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYVR---SAKRLSARQ 799
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + + PG+ N D+LSR
Sbjct: 800 ARWALFF----GRFNFVLSYRPGSKNIKPDALSR 829
>gi|66828741|ref|XP_647724.1| hypothetical protein DDB_G0267326 [Dictyostelium discoideum AX4]
gi|60475869|gb|EAL73800.1| hypothetical protein DDB_G0267326 [Dictyostelium discoideum AX4]
Length = 818
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 66/127 (51%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TAP+ F
Sbjct: 7 LPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAPRIFTM 66
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L +LR + V+ YLDD L+V K +++L LG+ +NL+KS L P
Sbjct: 67 LLRPALRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMNLLVKLGFKLNLEKSVLEP 126
Query: 658 APVLQFL 664
+ L
Sbjct: 127 TQSITLL 133
>gi|326673018|ref|XP_003199777.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1185
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 373 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 430
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 431 LNSITVKNMYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 490
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 491 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 547
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 548 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 594
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 595 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 642
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 643 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 702
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 703 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---SAKRLNS 758
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 759 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 790
>gi|326680732|ref|XP_003201604.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1149
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 335 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 392
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 393 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 452
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 453 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 509
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 510 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 556
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 557 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTNLTSSKTP-FRWSNAA 604
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG------SQVD-----SSFLSGL 795
L +S+PI P + F + DAS++G G S +D ++ S
Sbjct: 605 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSALDGKIHPCAYFSHR 664
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 665 LSAAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---SAKRLNS 720
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 721 RQARWALFF----GRFNFSISYRPGSKNIKPDALSR 752
>gi|326675438|ref|XP_003200354.1| PREDICTED: hypothetical protein LOC100536324 [Danio rerio]
Length = 2414
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 119/483 (24%), Positives = 191/483 (39%), Gaps = 86/483 (17%)
Query: 452 PFSAKPPLVPLCS-----LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVP 506
P+ L+P S L L+ P AM +I+E LE +++ S G + F V
Sbjct: 370 PYDCAIELLPGTSPPKGRLYSLSAPEREAMDRYIRESLEADLIRPSSSPAG--AGFFFVK 427
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
K +G RP ++ +GLN ++ L LQ +DL AY + I+
Sbjct: 428 KKDGSLRPCIDYRGLNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGD 487
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ A + LPFGL AP F +L N V + +R V VYLDD L+ ++
Sbjct: 488 EWKTAFNTPTGHFEYRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSES 545
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPE 679
++ + + L V +K V FL GI DP R
Sbjct: 546 EQVHTQHVRQVLQRLLENQLYVKAEKCVFHSKSV-SFLGHIVSTEGIKADPAKVR----- 599
Query: 680 DKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA 738
A W + DS ++L +L FA+F RR R S + A
Sbjct: 600 --------------AVAKWPVPDSRKALQRFLGFANFY--------RRFIRNFS--SVAA 635
Query: 739 PHLTPINPAVLPKLEWW----------LNALPLSSPIF----PRQVQHFISTDASDLGWG 784
P LT + +P + W L + +++P+ P++ Q + DAS++G G
Sbjct: 636 P-LTALTSPKVPFI--WHSQAQEAFDVLKSRFITAPVLCLPDPKR-QFIVEVDASEVGIG 691
Query: 785 SQVD-----------SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMV 831
+ + +F S S ++N+ I +E+ AV AL L+ + +V
Sbjct: 692 AVLSQRSSRDGKVHPCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLV 751
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+D++ + Y+R K LS +F R + + PG+ N D+LSR
Sbjct: 752 WTDHKN-LEYIR---SAKRLSSRQARWALFF----GRFNFSLSYRPGSKNIKPDALSRLF 803
Query: 892 SLP 894
+P
Sbjct: 804 DVP 806
>gi|326664005|ref|XP_003197708.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1295
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 115/453 (25%), Positives = 191/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 342 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 399
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 400 YRYPLP---LVPATLEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 456
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 457 PFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLERLIQN 514
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L + K + FLG + P M D+Q + + W +
Sbjct: 515 QLN--AKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQPE 559
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 560 TIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAVRA 608
Query: 762 ---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE--- 799
SSPI P Q FI DAS G G+ + L +SR+
Sbjct: 609 FTQLKTRFSSSPILRHPDPEQPFIVEIDASSTGIGAILSQRSLITKKLHPCAFYSRKLNS 668
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 669 AERNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNPRQA 724
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 725 RWALFFTRFDFQV----TYIPGSKNIKADALSR 753
>gi|20143429|ref|NP_619548.1| Enzymatic polyprotein [Contains: Aspartic protease; Endonuclease;
Reverse transcriptase] [Figwort mosaic virus]
gi|130600|sp|P09523.1|POL_FMVD RecName: Full=Enzymatic polyprotein; Includes: RecName:
Full=Aspartic protease; Includes: RecName:
Full=Endonuclease; Includes: RecName: Full=Reverse
transcriptase
gi|58813|emb|CAA29527.1| unnamed protein product [Figwort mosaic virus]
Length = 666
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 100/431 (23%), Positives = 178/431 (41%), Gaps = 44/431 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ G++ + S + +S FLV + G R V+N K +NQ +L N
Sbjct: 258 QIKELLDLGLI--IPSKSQHMSPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPN 315
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + L+ S D ++ V + Q+ A + +PFGL AP
Sbjct: 316 MQELLTLLRGKSIFSSFDCKSGFWQVVLDEESQKLTAFTCPQGHFQWKVVPFGLKQAPSI 375
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + L +VY+DD ++ + + I+ G I++ +K++
Sbjct: 376 F---QRHMQTALNGADKFCMVYVDDIIVFSNSELDHYNHVYAVLKIVEKYGIILSKKKAN 432
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
L + FLG+ D P++ L NI + + + + LG L++A
Sbjct: 433 LFKEKI-NFLGLEIDKGTH---CPQNH--ILENIHK--FPDRLEDKKHLQRFLGVLTYAE 484
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
IP +L R Q L + + T + + K++ L + P P+ H I
Sbjct: 485 TYIP--KLAEIRKPLQVKLKKDVTWNWTQSDSDYVKKIKKNLGSFP--KLYLPKPEDHLI 540
Query: 775 -STDASDLGWGSQVDSSFLSGL----------WSREQQNWHINKKEMFAVHQALSLNLPL 823
TDASD WG + + L G+ + + ++N+H N KE+ AV Q ++
Sbjct: 541 IETDASDSFWGGVLKARALDGVELICRYSSGSFKQAEKNYHSNDKELLAVKQVITKFSAY 600
Query: 824 LQSSVVMVQSDNQTVVSYLR--RQGGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPGA 879
L V++DN+ +LR +G +K L+ Q+W + + + G
Sbjct: 601 LTPVRFTVRTDNKNFTYFLRINLKGDSKQGRLVR--------WQNWFSKYQFDVEHLEGV 652
Query: 880 YNSVADSLSRS 890
N +AD L+R
Sbjct: 653 KNVLADCLTRD 663
>gi|326673157|ref|XP_003199804.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1188
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 177/445 (39%), Gaps = 43/445 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 368 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 425
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 426 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYDLVRIRPGDEWKTAFNTPRGHFE 485
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 486 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 542
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 543 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD-----------WP 589
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P + A KL+
Sbjct: 590 TPDSRKALQRFLGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFV 649
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS +G G+ + ++ S S ++N+ I
Sbjct: 650 SAPILIAPDPSR-QFVVEVDASKVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 708
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ S V +V +D++ + Y+R K L+ +F
Sbjct: 709 NRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 762
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N D+LSR
Sbjct: 763 --GRFNFTISYRPGSKNIKPDALSR 785
>gi|440792310|gb|ELR13538.1| RNAdirected DNA polymerase subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 774
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 147/356 (41%), Gaps = 41/356 (11%)
Query: 331 ESVFKNVSDHLLQYVCGKRAECLESRRRLVEPRDPHLASLLLRARRGKKSSSPQNLEPPG 390
ESV + + LLQ G+ L +RRR D H S G +SP++L PG
Sbjct: 304 ESVEVDGEEGLLQ---GEGPVPLSTRRRREPGWDEHYES------DGLGETSPRHLGTPG 354
Query: 391 RVSLKVQTLQKPQRCSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYA 450
+ PV PA+ V GRL+ AW ++GA + VR +
Sbjct: 355 AGVV-------------PVTDPAELS-----VVGRLQARAAAWAQIGA-SDQVRSWIAHG 395
Query: 451 IPFSAKPPLVPLCSLQHLATPVSSAMSL-HIQEMLETGVLKRLD------STTGFLSRLF 503
+ F + + + ATP L ++ ++ TG + + + ++S +F
Sbjct: 396 VAFELRAEVSEDRRI-FPATPAQQTWLLAEVERLVATGAAELMGIGPSKPAGIRYVSPVF 454
Query: 504 LVPK-GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
VPK G R V++++ +N ++ ++ + +G +MI+ DL+Q Y H+ +
Sbjct: 455 CVPKKGPKKWRLVIDMRRINLGIAERRVRFEGLSSVARVAGRGWWMITFDLAQGYHHLLV 514
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+ + L LPFGL +P F + + R +G+ V YLDDF++
Sbjct: 515 EEESCQLLGFRVGDRWFRYRVLPFGLRISPWVFTKVVRAMVRDWRRQGIVVTSYLDDFVV 574
Query: 623 VNQDPRIL-EIQGKLAVSILGSLGWIVNLQKSSLSP---APVLQFLGIMWDPHLDR 674
+ D L I+ + L L W+ K P A VL + + W L R
Sbjct: 575 MAPDCEALRRIRDTVITPTLDQLRWLREPTKGEWEPTQCAEVLALVVVRWSHTLRR 630
Score = 44.3 bits (103), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 61/131 (46%), Gaps = 10/131 (7%)
Query: 789 SSFLS----GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRR 844
S+F+S G ++ ++ W I+ KEM V A+ + V +SDN VV+YLR
Sbjct: 637 STFVSLQARGAFAADKLAWPIHHKEMKPVELAVDTLGHYVAGRWVEFESDNVMVVAYLRD 696
Query: 845 QGGTKSLSLLSEVEKIFLLSQDWRIHIL-AQFIPGAY-NSVADSLSRSKSLPDWHLSRSA 902
GG + V +++L + + A++I G+ N AD LSR DW LS
Sbjct: 697 GGGPDPW-MTDVVRRVWLRAAAEGCGVYNARWIRGSTDNREADWLSRYSDTDDWELSWDT 755
Query: 903 T---EQIFLKW 910
EQ F W
Sbjct: 756 VAELEQQFGGW 766
>gi|326670195|ref|XP_003199158.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1208
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 114/453 (25%), Positives = 191/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 420 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 477
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 478 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 534
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 535 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 592
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L + K + FLG + P M D+Q + + W +
Sbjct: 593 QL--YAKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQPE 637
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ R L +L FA+F RR R S + AP LT + A +L+W +A
Sbjct: 638 TIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDATRA 686
Query: 762 ---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE--- 799
S+PI P Q FI DAS+ G G+ + L +SR+
Sbjct: 687 FTQLKTRFSSAPILRHPDPEQPFIVEIDASNTGIGAILSQKSLVTKKLHPCAFYSRKLNS 746
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
+QN+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 747 AEQNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNPRQA 802
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 803 RWALFFTRFDFQV----TYIPGSKNIKADALSR 831
>gi|22415757|gb|AAM94957.1| reverse transcriptase [Volvox carteri f. nagariensis]
Length = 829
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 111/503 (22%), Positives = 200/503 (39%), Gaps = 57/503 (11%)
Query: 469 ATPVSSAMSLHIQEMLETGVLKRL--DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
AT ++ +++ L+ GV++ D+ + + V + +G R +N +N FL
Sbjct: 223 ATQHHEFVTAELRKALDRGVIREWPADAPSPTVVNGLRVVEKDGKLRLCINPMYINCFLR 282
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ +PS+L D++ + D Y+ + + +LA+ + G L LPF
Sbjct: 283 YRPVKYERLAEVPSYLLPEDWLYTTDDKSGYWQLSLHEREHTYLAMRWRGQTLFWPHLPF 342
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GLA A + S+ V LR G+R+ +DD + + Q V +L +LG+
Sbjct: 343 GLAPACHLYTSMKLEVFRPLRQLGVRMSFLIDDQMGAAGSKAAAQFQCGAVVRLLAALGF 402
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
++L K L P ++FLG+ D +P+DK L L T L + +
Sbjct: 403 TLSLSKCQLIPRRRVRFLGMEVDAEAQAFRVPDDK-LARFRALLTQLEGERLTARQVAQV 461
Query: 707 LGYLSFASFVIPMGRLHSRRIQR-------------QASLLR--------LGAPHLTPI- 744
G + + + L++R + R A +LR LG + T
Sbjct: 462 AGKIIAMTPAVTTAPLYARMVWRVARDVAWDEEVWDSAEVLRQAGLFMELLGRRNGTATW 521
Query: 745 --NPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQN 802
PA+ E L R F+ +LG S++ F + R Q+N
Sbjct: 522 RKGPALRLTTE-------LVGDASDRAFAAFLP--GEELGANSRMLVPFKAQETQRLQRN 572
Query: 803 -WHINKKE----MFAVHQALSLNLPLLQSSVVMVQSDNQ-TVVSYLRRQGGTKSLSLLSE 856
+ ++E ++++H LL V Q+D+Q + +G L +++E
Sbjct: 573 DFSSTERELRALLYSLHWLREQAPNLLYGRTVQYQTDSQPAEFCMVGMKGNAACLPIVAE 632
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYN--SVADSLSRSKSLPDWHLSRSATEQIFLKWGVPC 914
+ + L D I + P + AD+LS+ + W L+ + ++ W PC
Sbjct: 633 IHR---LCADTDTDISVVWYPRSREQQQQADALSKYEDGSQWMLNPTVYAKL---WEHPC 686
Query: 915 I-------DLFASRVSAVVPNHF 930
+ D+FA + VP F
Sbjct: 687 VHGRSPSLDVFADAHTTKVPGSF 709
>gi|313243963|emb|CBY14844.1| unnamed protein product [Oikopleura dioica]
Length = 1285
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 104/421 (24%), Positives = 167/421 (39%), Gaps = 76/421 (18%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ +M G+++ + + + + + ++PK +GG R V++L+GLN + +K+ L N +
Sbjct: 410 ELAKMTRAGIIREVQVGSPYNAPMSVIPKRSGGLRIVVDLRGLNNVIKGQKWPLPNLTEL 469
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA-S 597
+ S D + A+FH+ I Q A S LP G +P FA +
Sbjct: 470 LEGFKDAKLFTSYDFTSAFFHIVIDEKSQPLTAFSAMNRQYMYQRLPMGCKVSPHIFAHA 529
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLL--VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
++ + ++ RVV YLDD + N++ + I+ GW + L K
Sbjct: 530 IAKTIPEEMQG---RVVSYLDDLVSFDTNEEAHLQNIERMFRA--FREYGWKLKLAKCGF 584
Query: 656 SPAPVLQFLG--IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+F+G I D + KQ + I R L + T + R G +F
Sbjct: 585 CMRET-EFVGHEISADGY-------RPKQDNVDRIER--LPTPT-SKKEVRGFTGACAFY 633
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWLNALPLSSPIFPR---- 768
S IP A L L H LT N K EW P F +
Sbjct: 634 SNAIP------------ALQLTLSPLHDLTAKN----KKFEWG----PECETAFQKAKIA 673
Query: 769 --------------QVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWH 804
+ + +TDASD+G+G V F+SG + Q W
Sbjct: 674 IGKRNRLAFLSDDARTKIICTTDASDVGFGCMVSQLISTGTEQPIKFMSGKFKGASQRWP 733
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL-----SLLSEVEK 859
I +KE+ A +AL P+L + ++DN+ +S L Q TKS L+ +EK
Sbjct: 734 IFEKELAAFIRALETFRPILLGRPFLWRTDNK-ALSTLLVQATTKSAKEPSPKLMRWIEK 792
Query: 860 I 860
I
Sbjct: 793 I 793
>gi|397559412|gb|AFO54491.1| reverse transcriptase [Rose yellow vein virus]
Length = 819
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 190/449 (42%), Gaps = 71/449 (15%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKG----NGGTRPVLNLKGLNQFLSPKKFSLINH 535
I E+L ++++ +S F + F+V G R V+N K LN L +K N
Sbjct: 405 INELLRLKLIRKTNSPWSF--QAFMVRNHAEIVRGKARMVINYKPLN--LRIRK----NA 456
Query: 536 FRIPS----FL--QKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+RIP+ FL ++ + D ++ VP++ + A S +PFGLA
Sbjct: 457 YRIPNKDSLFLAIRESQFYSKFDCKSGFYQVPMEQDSIQLTAFSTPIGSYEWLVMPFGLA 516
Query: 590 TAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---G 645
TAP F A + N +L +VY+DD ++ + R LE ++I +L G
Sbjct: 517 TAPSIFQAKMDN----VLEDHHDYCLVYIDDIIVFS---RTLEEHKIHVITIAKTLKKNG 569
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN-ILRTL--LASKTWNLDS 702
+++ +K L + FLG E+ ++ L N +L L S+ +
Sbjct: 570 IVISKKKMELGLTKI-NFLGCE----------IENGRIILQNHVLENLSKFPSEIKDKKE 618
Query: 703 ARSLLGYLSFAS--FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP 760
+S LG +++A+ + I + +L R+ Q L + T + ++ +++ LP
Sbjct: 619 LQSFLGIINYAASHYSIEVTKL---RVPLQKKLKKNYIWSWTEQDKQIVEQIKTICQNLP 675
Query: 761 LSSPIFPRQVQHFI-STDASDLGWGSQVD----------------SSFLSGLWSREQQNW 803
P+ + +TDASD W + S + SG W++ +QNW
Sbjct: 676 ALE--LPKNGDKLVLTTDASDKHWAGVLQFYRKIEQEVFEKDLRVSRYCSGTWNQTEQNW 733
Query: 804 HINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
KE+ A+ AL L + SDN V+++L++ K + L
Sbjct: 734 STFGKELRAIKLALQ-KFKLFLFEPFTLYSDNLAVINFLKKDLNEKRSQREIRDKLDILQ 792
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRSKS 892
Q W + + IPG N +AD+L+R S
Sbjct: 793 YQGW---MTLKHIPGTKNVLADALTRGLS 818
>gi|326666824|ref|XP_003198387.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1239
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 114/453 (25%), Positives = 192/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 421 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 478
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 479 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 535
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 536 PFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 593
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L + K + FLG + P M D+Q + + W +
Sbjct: 594 QL--YAKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSMTQWPQPE 638
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 639 TIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAIRA 687
Query: 762 ---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE--- 799
S+PI P Q FI DAS+ G G+ + L +SR+
Sbjct: 688 FTQLKTRFSSAPILRHPDPEQPFIVEIDASNTGIGAILSQRSLVTKKLHPCAFYSRKLNS 747
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 748 AERNYDVGNRELLAMKAALEEWRHWLEGAKDPFTVITDHKN-LEYIR---SCKGLNPRQA 803
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 804 RWALFFTRFDFQV----TYIPGSKNIKADALSR 832
>gi|326668656|ref|XP_003198848.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1153
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 338 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 395
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL A V +K H+ A
Sbjct: 396 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNACHLVRMKQGHEWKTAFLTPRGHFE 455
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 456 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 512
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 513 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 559
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
+ +S ++L +L FA+F RR R S +L AP LT + A P W
Sbjct: 560 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKTP-FRWSSAA 607
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L + +S+PI P + F + DAS++G G+ + +F S
Sbjct: 608 QVAFTKLKSRFVSAPILVTPDPSRQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 667
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 668 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 723
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 724 RQARWALFFGRFDFTI----SYRPGSKNIKPDALSR 755
>gi|317419138|emb|CBN81175.1| Pol polyprotein [Dicentrarchus labrax]
Length = 1738
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 114/450 (25%), Positives = 183/450 (40%), Gaps = 51/450 (11%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L +L+ P AM +I + L G+++ S G + F V K + RP ++ +
Sbjct: 347 LPTSRLYNLSRPEREAMEKYIGDSLAAGLIRPSSSPVG--AGFFFVTKKDQTLRPCIDYR 404
Query: 520 GLNQFLSPKKFSL--INHFRIPSF--LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
GLN K+ L I+ P+F L K +DL AY V I+ + A +
Sbjct: 405 GLNDITIKNKYPLPLID----PAFEPLHKAQVFSKLDLRNAYHLVRIREGDEWKTAFNTP 460
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP F +L N V + +R + VYLDD L+ +Q +
Sbjct: 461 LGHFEYLVMPFGLTNAPAVFQALVNDVLRDMLNRFL--FVYLDDILIFSQSQEEHVQHVR 518
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
+ L V +K V FLG + ++R + D + A
Sbjct: 519 QVLQRLLENKLFVKAEKCEFHVTSV-SFLGFI----IERGQVKADPA--------KIQAV 565
Query: 696 KTWNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTP--INPAVLPKL 752
W + R L +L FA+F R +S+ L + P P A L
Sbjct: 566 ADWPSPTTRKQLQRFLGFANFYRRFIRNYSKVAAPLTKLTSVKVPFAWPPEAETAFLALK 625
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
E + +A L P +Q + DASD G G+ + +FLS S ++
Sbjct: 626 ELFTSAPVLRHP--DPSLQFVVEVDASDTGVGAVLSQRSPKDQKLHPCAFLSRRLSPAER 683
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ + +E+ AV AL L+ + + +V +D++ ++YLR K L+
Sbjct: 684 NYDVGNRELLAVVVALQEWRHWLEGAALPFIVWTDHKN-LAYLR---SAKRLNSRQARWA 739
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+FL R + PG+ N+ D+LSR
Sbjct: 740 LFLD----RFVFTLTYRPGSRNAKPDALSR 765
>gi|6466937|gb|AAF13073.1|AC011621_1 putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1661
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/418 (22%), Positives = 182/418 (43%), Gaps = 36/418 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++EML +++ S + + S + LV K +GG R ++ + LN+ P K+ + +
Sbjct: 735 VREMLNAQIIR--PSVSPYSSPVLLVKKKDGGWRFCVDYRALNEATIPDKYPIPVIEELL 792
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+ +DL YF + +K + A + +PFGL AP F S+
Sbjct: 793 DELKGATVFSKLDLKSGYFQIRMKLSDVEKTAFKTHEGHYEFLVMPFGLTNAPSTFQSVM 852
Query: 600 NWVASLLRSRGMR-VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N L R + V+V+ DD L+ + D + + + +L + N +K +
Sbjct: 853 N---DLFRPYLRKFVLVFFDDILVYSPDMKTHLKHLETVLQLLHLHQFYANFKKCTFGST 909
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVI 717
+ +LG H+ + E T + +L W L S L G+L F +
Sbjct: 910 RI-SYLG-----HI----ISEQGVATDPEKVEAMLQ---WPLPKSVTELRGFLGFTGYYR 956
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-IST 776
+ + + + L+ + + L+ ++ALP+ + P Q F + T
Sbjct: 957 RFVKNYGQIARPLRDQLKKNSFDWNEAATSAFQALKAAVSALPVL--VLPDFQQEFTVET 1014
Query: 777 DASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV 831
DAS +G G+ + +FLS +S + + + ++E+ A+ +A++ L S ++
Sbjct: 1015 DASGMGIGAVLSQNKRLIAFLSQAFSSQGRIRSVYERELLAIVKAVTKWKHYLSSKEFII 1074
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
++D +++ L + KS+S + + L +RI ++ PG N VAD+LSR
Sbjct: 1075 KTDQRSLRHLLEQ----KSVSTIQQRWASKLSGLKYRI----EYKPGVDNKVADALSR 1124
>gi|301627834|ref|XP_002943073.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1474
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 186/445 (41%), Gaps = 54/445 (12%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I+E LE G ++ S G + F V K +GG RP ++ +GLN
Sbjct: 185 LSLPETQAMEEYIKENLERGFIRPSTSPAG--AGFFFVEKKDGGLRPCIDYRGLN----- 237
Query: 528 KKFSLINHFRIPSFLQ-----KGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P + KG + S +DL AY + I+ + A +
Sbjct: 238 -KITVKNRYPLPLISELFDRVKGATIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEY 296
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F L N + L R VVVYLDD L+ + + + + L
Sbjct: 297 LVMPFGLCNAPAVFQELVNDIFRDLLGRS--VVVYLDDILIYSNSLSDHRVHVQEVLLRL 354
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
L+K +S + FLG + H P Q L K
Sbjct: 355 RQNHLYAKLEK-CISEVSSVHFLGFIIS-HKGLEMDPAKVQAIL----------KWVQPL 402
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINP-AVLPKL-EWWLNA 758
S R++ +L FA++ +S + +L + G P + + KL E +++A
Sbjct: 403 SLRAIQRFLGFANYYRQFINGYSTLVAPITALTKKGVDPSIWSVEALTAFKKLKEAFISA 462
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINK 807
L P + + DAS++G G+ + F S +S + N+ I
Sbjct: 463 SVLLHP--DSALPFLVEVDASEVGAGAILSQCHPVTNKVHPCGFFSKKFSSTEMNYDIGN 520
Query: 808 KEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+E+ A+ A + LL+ + VV V +D++ ++ Y+ K L+ +F
Sbjct: 521 RELLAIKLAFTEWRHLLEGAKHVVTVITDHKNLL-YIE---SAKRLNPRQARWALFFS-- 574
Query: 866 DWRIHILAQFIPGAYNSVADSLSRS 890
R + + + PG N AD+LSRS
Sbjct: 575 --RFNFIITYRPGEKNVKADALSRS 597
>gi|427780157|gb|JAA55530.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1206
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 116/471 (24%), Positives = 186/471 (39%), Gaps = 73/471 (15%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFL 504
I +G A+P P V L Q + + +ML G+++R S++ + S + L
Sbjct: 207 IETGDALPLKCNPRPVSLAKRQ--------IIDGLLDDMLSAGIIRR--SSSSWASPIVL 256
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
VPK +G R ++ + LN + L I L Y ++D S+ Y V +
Sbjct: 257 VPKKDGSHRLCVDYRRLNGVTRKDAYPLPTISSIVGNLGDARYFTTLDASKGYLQVRMGE 316
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
Q A + + + T +PFGL AP F L + V L ++ + YLDD ++ +
Sbjct: 317 RDQFKTAFTSHRGLFEFTRMPFGLCNAPATFQRLMDRV--LGEAKWSYCMCYLDDIVIYS 374
Query: 625 QDPRILEIQGKLAVSILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
R E +L + G +N K+ L+ + HL L E
Sbjct: 375 ---RTFEEHLAHVADVLERVRAAGMTLNPAKAQLAQTRI----------HLLGFTLGEGS 421
Query: 682 QLTLGNILRTLLASKTWNLDSA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
LR +L S R LG ++F IP S RL AP
Sbjct: 422 IEPDREKLRAILDFPVPKDGSGLRRFLGMVNFYRSFIP-------------SCARLQAPL 468
Query: 741 LTPINPAVLPKLEWWLNA----LPLSSPI-------FPRQVQHF-ISTDASDLGWGS--- 785
+ + K +W LSS I P + F + TDASDLG G+
Sbjct: 469 TKLLGKSA--KWQWGPEQQEAFCRLSSAIAETAQLRLPDLTRPFVVQTDASDLGLGAVLL 526
Query: 786 -QVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
+ D +F S ++N+ + +KE A+ AL L + +VQ+D+ +
Sbjct: 527 QEYDGVLQPLAFASRSLVPAEKNYSVTEKECLAIVFALRKFDVYLDGTKFVVQTDHSALS 586
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+R + L+ + LL Q + + Q+ G+ N VAD+LSR+
Sbjct: 587 WLMRLREPAGRLA------RWALLIQHYDFAV--QYRKGSTNVVADALSRA 629
>gi|317419716|emb|CBN81752.1| Pol polyprotein [Dicentrarchus labrax]
Length = 1450
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 117/460 (25%), Positives = 187/460 (40%), Gaps = 71/460 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L +L+ P AM +I + L G+++ S G + F V K + RP ++ +
Sbjct: 431 LPTSRLYNLSRPEREAMEKYIGDSLAAGLIRPSSSPVG--AGFFFVTKKDQTLRPCIDYR 488
Query: 520 GLNQFLSPKKFSL--INHFRIPSF--LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
GLN K+ L I+ P+F L K +DL AY V I+ + A +
Sbjct: 489 GLNDITIKNKYPLPLID----PAFEPLHKAQVFSKLDLRNAYHLVRIREGDEWKTAFNTP 544
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP F +L N V + +R + VYLDD L+ +Q +
Sbjct: 545 LGHFEYLVMPFGLTNAPAVFQALVNDVLRDMLNRFL--FVYLDDILIFSQSQEEHVQHVR 602
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
+ L V +K V FLG + ++R + D + A
Sbjct: 603 QVLQRLLENKLFVKAEKCEFHVTSV-SFLGFI----IERGQVKADPA--------KIQAV 649
Query: 696 KTWNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
W + R L +L FA+F RR R S ++ AP LT + +P W
Sbjct: 650 ADWPSPTTRKQLQRFLGFANFY--------RRFIRNYS--KVAAP-LTKLTSVKVP-FAW 697
Query: 755 ---------WLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSF 791
L L S+P+ +Q + DASD G G+ + +F
Sbjct: 698 SPEAENAFLALKELFTSAPVLHHPDPSLQFVVEVDASDTGVGAVLSQRSPKDQKLHPCAF 757
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTK 849
LS S ++N+ + +E+ AV AL L+ + + +V +D++ ++YLR K
Sbjct: 758 LSRRLSPAERNYDVGNRELLAVVVALQEWRHWLEGAALPFIVWTDHKN-LAYLR---SAK 813
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ +FL R + PG+ N+ D+LSR
Sbjct: 814 RLNSRQARWALFLD----RFVFTLTYRPGSRNAKPDALSR 849
>gi|326669639|ref|XP_003199054.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1279
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 110/453 (24%), Positives = 193/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM+ +I E LE G ++ ST+ + F V K +G RP ++ +GLN+ S
Sbjct: 516 LSQPETEAMNSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITSK 573
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY V I+ + S N +
Sbjct: 574 YRYPLP---LVPAALEQLRSAQYFTKLDLRNAYNLVRIRQGDEWKTGFSTNNGHYEYLVM 630
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD--PRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 631 PFGLANSPSVFQAFINEIFRDMLNKW--VIVYIDDILIYSNSLPEHIQHVRAVLQRLIQN 688
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L + K + FLG + + + D+Q + A W +
Sbjct: 689 QL--YAKVSKCEFHQT-CISFLGYI----ISHEGVAMDQQ--------KVDAVTQWPQPE 733
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW------- 754
+ R L +L FA+F + R I+ +S + AP LT + A +L+W
Sbjct: 734 AIRQLQRFLGFANF-------YGRFIRNFSS---VAAP-LTAMVKANNARLKWNSEAIRA 782
Query: 755 --WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSR 798
L A +PI P + FI +AS+ G G+ + +F S +
Sbjct: 783 FNQLKARFTEAPILCHPDSTRPFIVEIEASNSGIGAILSQRSPTTNKLHPCAFYSRKLNS 842
Query: 799 EQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 843 AERNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNPGQA 898
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N D+LSR
Sbjct: 899 RWALFFTRCDFQV----TYIPGSKNIKGDALSR 927
>gi|326673646|ref|XP_003199949.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1320
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 122/455 (26%), Positives = 184/455 (40%), Gaps = 63/455 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + +F V K +G RP ++ +G
Sbjct: 417 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGVFFVKKKDGSLRPCIDYRG 474
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V I+ + A +
Sbjct: 475 LNAITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTPRGHFE 534
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V L + VYLDD L+ + + + +
Sbjct: 535 YCVLPFGLSNAPAVFQALVNDV--LRDMLDQFIYVYLDDILIFSHSLQEHVQHVRRVLQR 592
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 593 LLENGLYVKAEKCVFH-AQSVPFLGHIVSVEGMRM-DPEKVQ-----------AVVDWPT 639
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW----- 754
DS ++L +L FA+F RR R S +L AP LT + P W
Sbjct: 640 PDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTALTSLKTP-FRWSNAAQ 687
Query: 755 ----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGLW 796
L + +S+PI P + F + DAS++G G SQ SS + S
Sbjct: 688 VAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKMHPCAYFSHRL 747
Query: 797 SREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLL 854
+ +QN+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 748 NNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIQ---SAKRLNSR 803
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 804 QARWALFFGRFDFSI----SYRPGSKNVKPDALSR 834
>gi|326671833|ref|XP_003199533.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1159
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 180/455 (39%), Gaps = 63/455 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L G+++ S G F V K +G RP ++ +G
Sbjct: 410 PRGRLYSLSGPERVAMDKYISDSLAAGLIRSSSSPAGV--GFFFVEKKDGSLRPCIDYRG 467
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ + +DL AY V I+ + A +
Sbjct: 468 LNDITIKNRYPLPLMSSAFELLQGSGFFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 527
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAV 638
+PFGL AP F +L N V + ++ V VYLDD L+ + I ++ L
Sbjct: 528 YRVMPFGLTNAPAVFQTLVNDVLRDMVNKF--VFVYLDDILIFSHSLQEHIQHVRQVLQR 585
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFL----GIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ L P L F+ GI DP R A
Sbjct: 586 LLENQLYIKAEKCLFHTRSVPFLGFIVSAEGIRVDPAKVR-------------------A 626
Query: 695 SKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN-----PAV 748
W DS ++L +L F++F R +S Q ASL L + TP A
Sbjct: 627 VSNWPTPDSRKALQRFLGFSNFYCRFVRNYS---QIAASLTALTSTK-TPFQWSSQAQAA 682
Query: 749 LPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLW 796
+L+ ++ P+ S FP + FI DAS++G G+ + ++ S
Sbjct: 683 FERLKTCFSSAPVLS--FPDPERQFIVEVDASEVGVGAVLSQRSLADGKVHPCAYFSHRL 740
Query: 797 SREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVMVQSDNQTVVSYLRRQGGTKSLSLL 854
S ++N+ I +E+ AV AL L+ S +V +D++ + Y+R K L+
Sbjct: 741 SPAERNYDIGNRELLAVKLALDEWRHWLEGTSEPFLVWTDHKN-LEYVR---SAKRLNSR 796
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + + PG+ N D+LSR
Sbjct: 797 QARWALFF----GRFNFILSYRPGSKNVKPDALSR 827
>gi|326675771|ref|XP_003200428.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1491
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 118/483 (24%), Positives = 205/483 (42%), Gaps = 81/483 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 334 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNKITVK 391
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 392 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 448
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 449 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIKN 506
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 507 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 549
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 550 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 598
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 599 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 658
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 659 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 714
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR-----SKSLPDWHLSRSATEQIFLK 909
+F D+++ +IPG+ N AD+LSR + +PD + +S ++
Sbjct: 715 QARWALFFTRFDFQV----TYIPGSKNIKADALSRLSDDETSEIPDEPIIKSPQIVAPIQ 770
Query: 910 WGV 912
W +
Sbjct: 771 WDI 773
>gi|326670560|ref|XP_003199240.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1320
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 122/455 (26%), Positives = 183/455 (40%), Gaps = 63/455 (13%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 417 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 474
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V I+ + A +
Sbjct: 475 LNAITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTPRGHFE 534
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V L + VYLDD L+ + + + +
Sbjct: 535 YCVLPFGLSNAPAVFQALVNDV--LRDMLDQFIYVYLDDILIFSHSLQEHVQHVRRVLQR 592
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 593 LLENGLYVKAEKCVFH-AQSVPFLGHIVSVEGMRM-DPEKVQ-----------AVVDWPT 639
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW----- 754
DS ++L +L FA+F RR R S +L AP LT + P W
Sbjct: 640 PDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTALTSLKTP-FRWSNAAQ 687
Query: 755 ----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGLW 796
L + +S+PI P + F + DAS++G G SQ SS + S
Sbjct: 688 VAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSSDGKMHPCAYFSHRL 747
Query: 797 SREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLL 854
+ +QN+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 748 NNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIQ---SAKRLNSR 803
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 804 QARWALFFGRFDFSI----SYRPGSKNVKPDALSR 834
>gi|326680172|ref|XP_003201468.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1222
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 111/457 (24%), Positives = 189/457 (41%), Gaps = 81/457 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +AM L+I+E L G+++ S G + F V K +GG RP ++ +GLN
Sbjct: 326 LSLPERTAMDLYIEESLAAGIIRPSTSPAG--AGFFFVGKKDGGLRPCIDYRGLN----- 378
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P LQ +DL AY V I+ + A +
Sbjct: 379 -KITIRNRYPLPLMSTAFEMLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHYEY 437
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV-NQDPRILEIQGKLAVSI 640
+PFGL AP F +L N V + ++ V VYLDD L+ N ++ K+ +
Sbjct: 438 LVMPFGLTNAPAVFQALINDVLRDMLNKF--VFVYLDDILIFSNSFQEHVQHVHKVLRHL 495
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L + +I +K + V +FLG + +P +M D Q + A W +
Sbjct: 496 LDNHLYI-KPEKCQFHVSQV-KFLGFVIEPGQIQM----DPQ--------KIQAVVDWPS 541
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW---- 755
S + + +L FA+F R+ S + AP L + + + W
Sbjct: 542 PSSVKEVQRFLGFANFY--------RKFILNFST--VAAP-LFALTKGNMIRFRWGPEAE 590
Query: 756 -----LNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD----------SSFLSGLWS 797
L S+PI P + F + DAS++G G+ + +FLS +
Sbjct: 591 EAFKILKKRFTSAPILLIPNADEPFTVEVDASEVGVGAVLSQRGEDKRLHPCAFLSHRLT 650
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
++N+H+ +E+ AV AL L+ + + Q + + K+L + +
Sbjct: 651 PTERNYHVGDRELLAVKLALEEWRHWLEGA----KHPFQVLTDH-------KNLEYIQQA 699
Query: 858 EKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
+++ W R H + PG+ N D+LSR
Sbjct: 700 KRLNPRQARWSLFFNRFHFTLTYRPGSKNLKPDALSR 736
>gi|348521920|ref|XP_003448474.1| PREDICTED: hypothetical protein LOC100710857 [Oreochromis
niloticus]
Length = 369
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 72/143 (50%), Gaps = 5/143 (3%)
Query: 430 VDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGV 488
+ +W A ++R + GY + F+ PP + + I + + G
Sbjct: 71 IKSWATCTQSAWVLRTLEKGYRLQFNVAPPHFKEIIYSRALGESAGFLLEEISTLRDKGA 130
Query: 489 LKRL---DSTTGFLSRLFLVPKGNG-GTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
++ + + +GF SR FLVPK G G R +L+L+ LN+ L KF ++ H + F+Q
Sbjct: 131 IRVVPPEEMHSGFYSRYFLVPKKGGRGMRAILDLRALNRHLRTYKFRMLTHASLLWFVQA 190
Query: 545 GDYMISIDLSQAYFHVPIKTTHQ 567
GD+ SIDL AYFH+PI H+
Sbjct: 191 GDWFTSIDLRDAYFHIPIYPPHR 213
>gi|391331905|ref|XP_003740380.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1476
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 101/420 (24%), Positives = 184/420 (43%), Gaps = 37/420 (8%)
Query: 482 EMLETG-VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPS 540
E LET V++R++S+ ++S L +V + +G R ++L+ +N + F L + + +
Sbjct: 486 ERLETSDVIERIESSE-WISALVVVSRKDGRIRLCVDLRAVNAAIVADVFPLPHFEDLLT 544
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
L +D AY V + + A + +PFGLA+AP AF L
Sbjct: 545 RLGSARVFSKLDARSAYHQVELAENSRDLTAFITPWGLFRFKRVPFGLASAPAAFQRLME 604
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
+ + V++YLDD L+ ++ R + + + + + G ++N + +
Sbjct: 605 QILWGIEG----VIIYLDDVLIFGENEREHDDRLQAVLLAIRKAGMVLNAK--CVIRVTE 658
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++F+G R T GN+ + N +S+LG SF +P
Sbjct: 659 IEFVGYSIGAGGIRP--------TAGNMRAIEDLPEPRNASQIKSVLGTTSFYMRCVPNF 710
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPA-VLPKLE-WWLNALPLSSPIFPRQVQHFISTDA 778
+ ++R LL+ +P + KL+ +NA PL+ +F + ++TDA
Sbjct: 711 STIAEPLRR---LLKADSPFVWGKEQGDAFKKLKNEIVNAKPLA--VFDHTKETIVATDA 765
Query: 779 SDLGWGS---QVDS------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
S++G G+ QV + SF S S Q + +KE AV A+ L
Sbjct: 766 SNVGCGACLLQVHADGERPVSFASCALSDAQMKYSAGEKEALAVVFAVERWRIFLYGRRF 825
Query: 830 MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
V++D+Q +V+ L G T S+ + + +++ + ++ PGA N+V D LSR
Sbjct: 826 RVRTDHQALVALL---GSTTSVRASMRIARWAERLREYNFSV--EYKPGANNNVPDMLSR 880
>gi|301617641|ref|XP_002938255.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1213
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 124/500 (24%), Positives = 210/500 (42%), Gaps = 65/500 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I++ LE G ++ S G + F V K +G RP ++ +GLN+
Sbjct: 435 LSLPETKAMEEYIKDNLERGFIRPSSSPAG--AGFFFVSKKDGWLRPCIDYRGLNKITVK 492
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ +DL +AY + IK + A + +PFG
Sbjct: 493 NRYPLPLISELFDRVKGASIFTKLDLRRAYNLIRIKEGDEWKTAFNTRDGHYEYLVMPFG 552
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN---QDPR--ILEIQGKLAVSILG 642
L AP F N + L R VVVYLDD L+ + +D R +LE+ +L L
Sbjct: 553 LCNAPAVFQEFVNDIFRDLLGR--HVVVYLDDILIYSSNLEDHRCHVLEVLLRLRQHHL- 609
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDS 702
L+K + P + FLG + H P + L N + L S
Sbjct: 610 ----YAKLEK-CIFEVPSVHFLGYIIS-HQGLEMEPTKVEGIL-NWAQPL---------S 653
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP----KLEWWLNA 758
R++ +L FA++ + S + +L + G ++P++ K L
Sbjct: 654 LRAIQRFLGFANYYRQFVKGFSSLVAPITALTKKG------VDPSIWSSEAIKAFKLLKE 707
Query: 759 LPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWH 804
+S+P+ P F + DAS++G G+ + +F S +S + N+
Sbjct: 708 AFISAPVLLHPDSTLPFLVEVDASEVGAGAVLSQRHPVTCKVHPCAFFSRKFSPTEANYD 767
Query: 805 INKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
I KE+ AV A LL+ + V V +D++ ++ Y+ K L+ +F
Sbjct: 768 IGNKELLAVKWAFEEWKHLLEGAKHPVTVFTDHKNLL-YIE---SAKRLNPRQARWALFF 823
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRS-KSLPDWHLSRSAT--EQIFLKWGVPCIDLFA 919
R + + + PG+ N AD+LSRS S P HL +++ + P DL A
Sbjct: 824 T----RFNFILTYRPGSKNIKADALSRSFVSCPKEHLEPDTILPKEVIVAALSP--DLLA 877
Query: 920 SRVSAVVPNHFQVSRHVAIL 939
S A + H S+ ++L
Sbjct: 878 SLSEAQIAGHPGQSKTCSLL 897
>gi|384501230|gb|EIE91721.1| hypothetical protein RO3G_16432 [Rhizopus delemar RA 99-880]
Length = 359
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 95/212 (44%), Gaps = 9/212 (4%)
Query: 511 GTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFL 570
GT+PV L L P+ +IN Q G + A+ H+ + + +++L
Sbjct: 149 GTQPVRRLSSLQDGNHPRH--IIND-------QTGRSLSRYRSFDAFLHLHLHPSSRKYL 199
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
+ + PFGL P F + + R +G+R+ YLDD++++ + I
Sbjct: 200 RFVWQNRLFQFRTTPFGLNIVPFWFTKTTRLIVQWARQQGIRLSAYLDDWIIMGETKEIF 259
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
+ + +S L +LGW+VN +KS+ P+P ++ LG + + + LP K + L+
Sbjct: 260 QRHLQKVLSCLRNLGWLVNDEKSNFEPSPTIEHLGFVLNTITMQASLPGKKLRDIQRSLK 319
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIPMGRL 722
L + + ++ L A+F + RL
Sbjct: 320 QLFQNPVQTPRRVQGVIMRLQAATFAVFPARL 351
>gi|326674161|ref|XP_003200083.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1377
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 186/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 526 PKGKLYSLSIPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 583
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K ++ A
Sbjct: 584 LNSITVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGNEWKTAFLTPRGHFE 643
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ ++ + + +
Sbjct: 644 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRRVLQ 700
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L G V +K A + FLG + RM PE Q A W
Sbjct: 701 RLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKVQ-----------AVVNWP 747
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW--- 755
+ +S ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 748 IPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSSKTP-FRWSSAA 795
Query: 756 ------LNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L + +S+PI P + F + DAS++G G+ + +F S
Sbjct: 796 QAAFSNLKSRFVSAPILVTPDPSRQFVVEVDASEVGVGAILSQRAASDDRIHPCAFFSHR 855
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K LS
Sbjct: 856 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLSS 911
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+ I + PG+ N D+LSR
Sbjct: 912 RQARWALFFGRFDFTI----SYRPGSKNIKPDALSR 943
>gi|301610510|ref|XP_002934795.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1004
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 185/444 (41%), Gaps = 52/444 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 173 LSLPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGLNKITIK 230
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ +DL AY + I+ + A + +PFG
Sbjct: 231 NRYPLPLISELFDRVKGASIYTKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEYLVMPFG 290
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + K + L
Sbjct: 291 LCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLSDHRSHVKEVLRRLRENNLY 348
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K + V QFLG H+ L D + +R +L W S R+
Sbjct: 349 AKLEKCTFEVNSV-QFLGF----HISSKGLEMDPEK-----VRAVL---DWTQPLSLRAT 395
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHL---TPINPAVLPKLEWWLNALPLS 762
+L FA++ + S + L + GA P L + L K E+ +S
Sbjct: 396 QRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWSSEAVQAFNLLKKEF------VS 449
Query: 763 SPIF--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKK 808
+PI P FI DAS++G G+ + +F S +S + N+ I +
Sbjct: 450 APILRHPDTALPFIVEVDASEVGAGAVLSQRHPLTNKLHPCAFFSRKFSPSEANYDIGNR 509
Query: 809 EMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
E+ A+ A LL+ + V V +D++ ++ Y+ + L+ +F
Sbjct: 510 ELLAIKWAFEEWRHLLEGAKHAVSVFTDHKNLL-YIE---SARRLNPRQARWALFFS--- 562
Query: 867 WRIHILAQFIPGAYNSVADSLSRS 890
R + + PG+ N+ AD+LSRS
Sbjct: 563 -RFNFSITYRPGSKNTKADALSRS 585
>gi|326670252|ref|XP_003199174.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1094
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 119/483 (24%), Positives = 204/483 (42%), Gaps = 81/483 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 393 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 450
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 451 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTVDGHYEYLVM 507
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 508 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 565
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 566 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 608
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
+ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 609 PKTIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 657
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 658 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 717
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
+QN+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 718 NSAEQNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 773
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR-----SKSLPDWHLSRSATEQIFLK 909
+F D+++ +IPG+ N AD+LSR + +PD + +S ++
Sbjct: 774 QARWALFFTRFDFQV----TYIPGSKNIKADALSRLSDDETSEIPDEPIIKSPLIVAPIQ 829
Query: 910 WGV 912
W +
Sbjct: 830 WDI 832
>gi|326675725|ref|XP_003200413.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1159
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 113/448 (25%), Positives = 176/448 (39%), Gaps = 49/448 (10%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 347 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 404
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 405 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 464
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 465 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 521
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG RM PE Q A W
Sbjct: 522 RLLENGLYVKAEKCVFH-AQSVQFLGHTVSVEGMRM-DPEKIQ-----------AVVNWP 568
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN-----PAVLPKLE 753
DS ++L +L FA+F R R ++ A+ L TP A KL+
Sbjct: 569 TPDSRKALQRFLGFANFY----RRFIRNFRQLAAPLTNLTSSKTPFRWSNAAEAAFSKLK 624
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWG------SQVD-----SSFLSGLWSREQQN 802
+ P+ P + Q + D S++G G S +D ++ S S ++N
Sbjct: 625 GCFVSAPILIAPDPSR-QFVVEVDVSEVGVGAILSQRSALDGKIHPCAYFSHRLSAAERN 683
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVV-MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
+ I +E+ AV AL L+ S V + S + + Y++ K L+ +F
Sbjct: 684 YDIGNRELLAVKLALEEWRHWLEGSGVPFIVSTDHKNLEYIK---SAKRLNSRQARWALF 740
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N D+LSR
Sbjct: 741 F----GRFNFSISYRPGSKNIKPDALSR 764
>gi|326673530|ref|XP_003199909.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1475
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 115/453 (25%), Positives = 191/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 316 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 373
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 374 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQRDEWKTGFSTIDGHYEYLVM 430
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 431 PFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLERLIQN 488
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L + K + FLG + P M D+Q + + W
Sbjct: 489 QL--YAKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQPV 533
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 534 TIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAVRA 582
Query: 762 ---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE--- 799
S+PI P Q FI DAS+ G G+ + L +SR+
Sbjct: 583 FTQLKTRFSSAPILRHPDPEQPFIVEIDASNTGIGAILSQRSLVTKKLHPCAFYSRKLNS 642
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL LQ + V +D++ + Y+R K L+
Sbjct: 643 AERNYDVGNRELLAMKAALEEWRHWLQGAKHPFTVITDHKN-LEYIR---SCKRLNPRQA 698
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 699 RWALFFTRFDFQV----TYIPGSKNIKADALSR 727
>gi|308484436|ref|XP_003104418.1| hypothetical protein CRE_22890 [Caenorhabditis remanei]
gi|308258066|gb|EFP02019.1| hypothetical protein CRE_22890 [Caenorhabditis remanei]
Length = 1871
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 110/475 (23%), Positives = 192/475 (40%), Gaps = 73/475 (15%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + IP +P VP+ + + HI +L + + +S T + S +
Sbjct: 911 VHIYTSTEIPVKGRPYRVPV--------KYQAELEKHINGLLRSARIT--ESNTPWTSPI 960
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
LV K NG R L+ + LN+ P + L RI + ++K Y S+D++ Y
Sbjct: 961 VLVKKKNGSLRVCLDFRKLNEVTIPDNYPLP---RIDTIIEKVGMARYFSSLDMANGYLQ 1017
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLD 618
+ + V A T LPFGL +A F +L +A L V+VY+D
Sbjct: 1018 LRLDAESSYKCGFITENKVYAYTHLPFGLKSAASYFQRALKTVLAGLEED----VMVYID 1073
Query: 619 DFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL----QFLGIMWDPHLDR 674
D L+ ++ + +L ++ + +K ++ + G+ + P+
Sbjct: 1074 DVLIFSKTFEEHLVSLRLVLARFREFNLKASPKKCEFVKQSIVFLGHEISGVSYSPN--- 1130
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-----SFASFVIPMGRLHSRRIQR 729
Q + I + + L + G+ +FAS P+ RL +R+ Q+
Sbjct: 1131 -------QANIDAIAKLPTPTNVMELKRFVGMAGFFRKFIENFASIAEPLTRL-TRKEQK 1182
Query: 730 QASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWG---- 784
+ L KL+ L + P+ S FP + F I TDAS + G
Sbjct: 1183 FV---------WSQEQQEALTKLKTALTSKPILS--FPNYEKPFHIFTDASAVAQGAALM 1231
Query: 785 --SQVD------SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
++ D ++ S S E+ W + E+ A+ AL P + S V++ SD++
Sbjct: 1232 QAAEADPKNFHVMAYASRTLSDEETRWAAIQIELGAIIFALRQFKPYICLSKVILHSDHR 1291
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+ L + +L+ + + Q + I I+ I G N+VAD LSR+K
Sbjct: 1292 PLTFLLAKNKVNDNLA------RWLVELQQYDIEIV--HIEGKKNTVADCLSRAK 1338
>gi|432875314|ref|XP_004072780.1| PREDICTED: uncharacterized protein LOC101168790 [Oryzias latipes]
Length = 1839
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 110/462 (23%), Positives = 182/462 (39%), Gaps = 79/462 (17%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L ++ A++ +I L G+++ S G + F V K +G RP ++
Sbjct: 619 IPKGRLYPVSVAERQALNDYIDNSLAAGLIRPSSSAAG--AGFFFVGKKDGSLRPCIDYS 676
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN+ ++ L + LQ+ +DL AY V I+ + +
Sbjct: 677 ALNE----NRYPLPLMSSVFDQLQQAKVFTKLDLRNAYHLVRIREGDEWKTGFNTPRGHY 732
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP------------ 627
+PFGL AP F ++ N V L V VYLDD L+ + DP
Sbjct: 733 EYLVMPFGLTNAPAVFQAMINDV--LRDFLDHFVYVYLDDILIYSPDPDTHVTHVSAGLK 790
Query: 628 RILE-------IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
R+L+ + + VS + LG+IV+ + PA V+ W R L
Sbjct: 791 RLLDNHLYVKAEKSEFHVSTVAFLGFIVSAGTVEMDPAKVIAVTD--WPSPDSRKKL--Q 846
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
+ L N R + +F+SF P+ L S R+Q +
Sbjct: 847 QFLGFANFYRRFIR----------------AFSSFAAPLHALTSPRVQFR---------- 880
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------S 789
TP A L+ + PL + PR+ Q + DAS+ G G+ +
Sbjct: 881 WTPAAEAAFRTLKRRFTSAPLLTLPDPRR-QFVVEVDASNEGIGAVLSQRSEQDGKLHPC 939
Query: 790 SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGG 847
+FLS S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 940 AFLSRRLSASERNYDIGNRELLAVKVALEEWRHWLEGAEHPFVVWTDHKN-LEYIR---N 995
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
K L+ ++F R + + PG+ N D+LSR
Sbjct: 996 AKRLNSRQARWRLFFD----RFSFVLSYRPGSKNIKPDALSR 1033
>gi|326668112|ref|XP_003198741.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1136
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 111/456 (24%), Positives = 195/456 (42%), Gaps = 78/456 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 219 LLQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKRDGSLRPCIDYRGLNEITVK 276
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ +R S +
Sbjct: 277 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDERKTGFSTIDGHYEYLVM 333
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKLAVS 639
PFGLA +P F + N + + ++ V+VY+DD L+ + + + +L +
Sbjct: 334 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAMPKRLIKN 391
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L + ++ +S FLG + P M D+Q + + W
Sbjct: 392 QLYAKSSKCEFHQTCIS------FLGYIISPEGMAM----DQQ--------KVDSVTQWP 433
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNA 758
++ R L +L FA+F RR R S + AP LT + A +L+W +A
Sbjct: 434 QPETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDA 482
Query: 759 LPL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE 799
+ S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 483 VRAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRK 542
Query: 800 ----QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSL 853
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 543 LNSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNP 598
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 599 RQARWALFFTRFDFQV----TYIPGSKNIKADALSR 630
>gi|390363533|ref|XP_003730394.1| PREDICTED: uncharacterized protein K02A2.6-like [Strongylocentrotus
purpuratus]
Length = 1027
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 101/447 (22%), Positives = 193/447 (43%), Gaps = 53/447 (11%)
Query: 464 SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ 523
SL+ L V +S + + G+++R+DS+ ++S L + + NG R ++L+ +N+
Sbjct: 444 SLRRLPLAVRDEVSKELHRLESDGIIERIDSSP-WVSNLVIARRKNGDLRLCVDLRAVNK 502
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
+ P K+ L + + +D+ ++Y VP+ + A + + +
Sbjct: 503 AIIPDKYPLPTMNELSASFHGAKVFSKLDMRRSYLQVPLAEQSRHLTAFNTHIGMFQYRR 562
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAV 638
+P+GL +AP AF + + V + + + LDD ++ +D R+ E+ +LA
Sbjct: 563 MPYGLNSAPSAFQKIVSSVLAGIEG----TLNLLDDVVVFGEDKAQHDQRLAEVMARLAK 618
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI--LRTLLASK 696
L +N K + + A + FLG H+ L TL N+ +RTL A
Sbjct: 619 HNL-----TLNEAKCTFA-ASDIDFLGY----HVTADGLTP----TLDNVAAIRTLPAPT 664
Query: 697 TWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ---RQASLLRLGAPHLTPINPAVLPKLE 753
N+ S LG +F +P + +Q R+ +L T N L+
Sbjct: 665 --NVKELASFLGTTNFYRKFVPQYAEIAEPLQKLLRKDALWEWHNAQETAFN-----TLK 717
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWGS----QVDSS-----FLSGLWSREQQNWH 804
+ P+ + P + +++TDAS G+ +DSS F S S ++ +
Sbjct: 718 GRIAEPPVLAHFTP-SAETYVTTDASAFAIGAVLSQTIDSSVRPVAFASRALSDTERKYS 776
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG-GTKSLSLLSEVEKIFLL 863
++E A A L +++D+Q + + L G G + L + ++ L
Sbjct: 777 TGEREALACIYACEHWHMYLYGRKFTLRTDHQALTTLLSTSGSGHRPLRIYRWSDR--LH 834
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRS 890
D+++ +++ G+ N VAD LSR+
Sbjct: 835 QYDFKV----EYLAGSRNRVADMLSRT 857
>gi|326667880|ref|XP_003198689.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1258
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 115/455 (25%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 415 LSQPETEAMKSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDCRGLNEITVK 472
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 473 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 529
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 530 PFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 587
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 588 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 630
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 631 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 679
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 680 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVTKKLHPCAFYSRKL 739
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 740 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 795
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 796 QARWALFFTRFDFQV----TYIPGSKNIKADTLSR 826
>gi|326663957|ref|XP_003197697.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1142
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 116/455 (25%), Positives = 191/455 (41%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDS--TTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
L+ P + AM +I E LE G ++ S +TGF F + K +G RP ++ +GLN+
Sbjct: 350 LSQPETEAMKNYISEELEKGFIRPSTSPASTGF----FFIKKKDGSLRPCIDYRGLNEIT 405
Query: 526 SPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
++ L +P+ L++ Y +DL AY + I+ + S
Sbjct: 406 VKYRYPLP---LVPAALEQLRSAQYFTKLDLRGAYNLIRIRQGDEWKTGFSTIDGHYEYL 462
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSI 640
+PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 463 VMPFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQYVRAVLERLI 520
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L + K + FLG + P M D+Q + + W
Sbjct: 521 QNQL--YAKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 565
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 566 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 614
Query: 760 PL---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q FI DAS+ G G+ + L +SR+
Sbjct: 615 RAFTQLKTRFSSAPILRHPDPEQPFIVEIDASNTGIGAILSQRSLVTKKLHPCAFYSRKL 674
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 675 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNPR 730
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R L +IPG+ N AD+LSR
Sbjct: 731 QARWALFFT----RFDFLVTYIPGSKNIKADALSR 761
>gi|326680534|ref|XP_003201542.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1673
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 103/451 (22%), Positives = 181/451 (40%), Gaps = 68/451 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + M+ +I+E L+ G ++ ST+ + F V K +GG RP ++ +GLN
Sbjct: 411 LSQTETETMNAYIEEELKKGFIRH--STSPASAGFFFVEKKDGGLRPCIDYRGLNAITVK 468
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L L+ Y +DL AY + IK H+ A S + +PFG
Sbjct: 469 YRYPLPLVPAALELLRTAKYFTKLDLRSAYNLIRIKKNHEWKTAFSTSSGHYEYLVMPFG 528
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLG 645
LA +P F + N V + +R V+VY+DD L+ ++ + I ++ L I L
Sbjct: 529 LANSPSVFQAFINDVFRDMLNRW--VIVYIDDILIYSESLEEHISHVRAVLQRLIEHRL- 585
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSAR 704
L+K + FLG + + + D+Q + A W + +
Sbjct: 586 -YAKLEKCEFHQTSI-SFLGYI----IGTEGVAMDEQ--------KVQAVLKWPKPRTIK 631
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP----------KLEW 754
L +L FA+F RR R SL+ LT +++ KL+
Sbjct: 632 ELQRFLGFANFY--------RRFIRNFSLVASPLTSLTRGKGSIIKGNDTAERAFAKLKH 683
Query: 755 WLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNW 803
P+ P ++ + DAS+ G G+ + +F S + +QN+
Sbjct: 684 RFATAPILHHPNP-ELPFIVEIDASNTGIGAILSQKQGSPSKSHPCAFFSRKLNSAEQNY 742
Query: 804 HINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
+ +E+ A+ A+ L+ + TV++ K+L + +++
Sbjct: 743 DVGNRELLAMKAAMEEWRHWLEGA-----KHKFTVIT------DHKNLEYIHSAKRLNPR 791
Query: 864 SQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R +IPG+ N AD+LSR
Sbjct: 792 QARWALFFTRFDFTVTYIPGSKNVKADALSR 822
>gi|254587273|emb|CAX83693.1| Gap-Pol polyprotein [Schistosoma japonicum]
Length = 1293
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 112/439 (25%), Positives = 180/439 (41%), Gaps = 42/439 (9%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
HL T V+ ++EML G+++ DS + S + LV K NG R ++ + LN
Sbjct: 425 HLETEVNR----QVEEMLRDGIIEEADSR--YNSPVLLVKKSNGKYRFCVDFRELNSITE 478
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
K + LQ+ +DL Y+ +PIK + A + +PF
Sbjct: 479 LKPCPMPTVAETLDRLQQAKLFTVLDLRSGYWQLPIKADDRYKTAFTVGHKQYQFRRMPF 538
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GLA AP F L N + L + V VY DD ++ ++ R + + + G
Sbjct: 539 GLAGAPFTFRRLMNLLLRNLEN----VEVYGDDLVVYSKTERDHARHLEAVLKRIEEFGL 594
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
+N +KS ++ V + D + LPE K LT+ NI T SK R L
Sbjct: 595 RINKKKSQIAKCNVTLLGYKVGDGEMKP--LPE-KILTIQNI--TAPTSK-------RKL 642
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
++ A+F + + LL G T +++ LNA ++ +
Sbjct: 643 RQFIGRAAFYSRFIKNFNEIAAPLYKLLSSGKFIWTEEAQRTFDRIKQLLNAKQMTLRLP 702
Query: 767 PRQVQHFISTDASDLGWGS---QVDS--SFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
R Q ++TDASD G G+ Q D + S + + +Q + +KE A+ A+
Sbjct: 703 IRGKQFTVATDASDFGIGAVLRQDDGVVEYASRVLTPTEQKYSTIEKECLAIVWAVDKWR 762
Query: 822 PLLQSSVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
P L +++D++ + + R G S L E F + IPG+
Sbjct: 763 PYLLGQPFHIETDHKPLQWLKTARDPRGKLSRWTLRLQEYDFTIGH----------IPGS 812
Query: 880 YNSVADSLSR---SKSLPD 895
N +AD LSR SLP+
Sbjct: 813 RNVIADYLSRPCEDASLPE 831
>gi|391325818|ref|XP_003737424.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1192
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 101/433 (23%), Positives = 178/433 (41%), Gaps = 60/433 (13%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I + + GV++R+ ++ ++S + + K NG R ++L+ +N+ + F L + +
Sbjct: 456 IDRLEQEGVIERIQASE-WISPIVVAEKKNGDVRLCVDLREVNKAVVQDAFPLPHIEDLM 514
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L KG IDL AY +P+ + + A + T + FGLA+AP AF +
Sbjct: 515 QRLAKGRVFSKIDLRSAYHQIPLHESSRDLTAFVSPWGLFRYTRVCFGLASAPAAFQAFM 574
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
L V+ YLDD L+V + ++ + + + + L G VN +
Sbjct: 575 EETLKDLEG----VICYLDDVLVVGETRQVHDERVRGLLRTLSERGLKVN--NKCVFGVE 628
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG----YLS---- 711
+FLG + + LP+ N+ + N+ RS LG YL
Sbjct: 629 ETEFLGHVVSSKGVKP-LPD-------NVKAIENVPEPKNVSQLRSFLGMAGFYLKCVPR 680
Query: 712 FASFVIPMGRLHSRRI-----QRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
+A V P+ L + + +RQ R + P L + ALPL
Sbjct: 681 YAELVEPLKELLRKEVKFDWRERQRLAFRAVKGAIAEAAP-----LRVFDPALPL----- 730
Query: 767 PRQVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQAL 817
++TDASD G G+ + ++ S S Q+ + + KE A A+
Sbjct: 731 ------VLTTDASDYGLGAVLQQRVNGKLEPLAYASCSLSETQRRYSTSDKEALACVWAI 784
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT-KSLSLLSEVEKIFLLSQDWRIHILAQFI 876
L +++D++ +VS +G +S+ L E++ + D ++
Sbjct: 785 EKWHVYLWGRRFTLKTDHRALVSLFGTKGADRRSIRLARWAERLGAYAFD------VEYK 838
Query: 877 PGAYNSVADSLSR 889
PG N +AD+LSR
Sbjct: 839 PGVENVIADALSR 851
>gi|322695492|gb|EFY87299.1| pol protein [Metarhizium acridum CQMa 102]
Length = 868
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 93/397 (23%), Positives = 163/397 (41%), Gaps = 58/397 (14%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
A+ ++++ML+ G ++ +S G+ + VPK NG RP ++ + LN+ ++ L
Sbjct: 486 DKALKEYLEDMLQKGYIRPSESPAGYP--ILWVPKKNGKLRPCIDYRHLNKITIKNRYPL 543
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I + K + ++DL AY + +K H+ A + +PFGL AP
Sbjct: 544 PLMTEIRDKVGKAKWFTTLDLKGAYNLIRMKEGHEWMTAFRTSRGHYEYLVMPFGLTNAP 603
Query: 593 QAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIV 648
F + + ++LR + G+ VVVYLDD L+ + LE + +L +L +V
Sbjct: 604 ATFQRM---IDTILRKQLGVFVVVYLDDILIYSD---TLEEHKRHVHEVLQTLQDNKLLV 657
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSAR 704
K V FLG + RM DK T+ K W NL R
Sbjct: 658 EASKCQFHQNTV-HFLGYVLTHGEIRM--SPDKIKTI----------KEWPTPKNLKEVR 704
Query: 705 SLLGYLSFA-SFVIPMGRLHSR---RIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP 760
+++F F+ G + SR + ++ + P T + L
Sbjct: 705 GFTAFVNFYRKFLSGYGDI-SRPLTNLTKKEVGFQWNEPEATAFQK---------MKDLV 754
Query: 761 LSSPIF--PRQVQHF-ISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINK 807
S P+ P Q + + + TDASD G Q+ +F S + N+ I+
Sbjct: 755 TSEPVLKAPDQDKPYELETDASDFALGGQLGQRDDQGRLHPVAFFSKKLHGPELNYGIHD 814
Query: 808 KEMFAVHQALS--LNLPLLQSSVVMVQSDNQTVVSYL 842
KE+ A+ + + + + V +D++ + S+L
Sbjct: 815 KELMAIIECFKEWRHYLIGAKHQIKVYTDHKNLTSFL 851
>gi|313215817|emb|CBY16360.1| unnamed protein product [Oikopleura dioica]
Length = 813
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 94/409 (22%), Positives = 163/409 (39%), Gaps = 27/409 (6%)
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
FL+KG + ID ++H+ + + Y G FG+ AP A+ ++++
Sbjct: 282 FLKKGMLLTKIDDKSGFYHMKLDNFSRNMACCEYGGQTFRYKGAVFGIPKAPGAYQTMNS 341
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA------VSILGSL-----GWIVN 649
LLR G +YLDD L + E Q + LG L G +N
Sbjct: 342 VPMCLLRQNGFHCFLYLDDRLFLTMPESKAEEQALIRGDRVPLAPFLGLLSITANGTYIN 401
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY 709
KS L P ++FLG + + +P +K + K L G
Sbjct: 402 RPKSVLKPTQKMEFLGFGLNTIKGTIRIPTEKFERFKIEASAIRKRKLCEYKRLEKLRGV 461
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL-------S 762
+ V RLH RR+ L + I V +L W+++ +
Sbjct: 462 MCSFVLVSENMRLHIRRVTWALKLADREKSAMIKICEEVKEELGNWIHSRHILKERSWVK 521
Query: 763 SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNW----HINKKEMFAVHQALS 818
+ + +V+ F TDAS+ G ++S ++ E+ + +I KE +AV AL
Sbjct: 522 NGVVIVEVKVF--TDASNYAGGVTIESEDINVSIPWEEGSAIARDNIFLKEAYAVLHALQ 579
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
L ++ +V +DN VV+ G S +L + KI +++ I + ++
Sbjct: 580 KYGHLFKNKLVHFLNDNMVVVNCF-HVGSKSSPALNRIIRKIHEAAEEHLIALKVSWV-S 637
Query: 879 AYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVP-CIDLFASRVSAVV 926
+ AD+ SR + ++ E I +K G+ +DLF++ +A V
Sbjct: 638 TLDQKADAASRETDCKEAIFRKTVFEAIQIKLGLTFSLDLFSTAENAKV 686
>gi|326676564|ref|XP_003200611.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Danio rerio]
Length = 1283
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 109/458 (23%), Positives = 183/458 (39%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 334 PKGRLFSLSGPEREAMDRYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 391
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ ++L AY + I+ + A +
Sbjct: 392 LNDITIKNRYPLPLMSSAFELLQGAKVFTKLELRNAYHLIRIREVDEWKTAFNTPTGHFE 451
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 452 YRVLPFGLTNAPAVLQALVNDVLRDMVNRF--VFVYLDDILIFSPSLKVHTQHVRQVLQR 509
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR---TLLASKT 697
L V +K V FLG + ++ G + + A
Sbjct: 510 LLENQLYVKAEKCVFHVQSV-SFLGFI---------------ISAGELQADPCKVKAVAE 553
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W DS ++L +L FA+F RR R + ++ AP +P V +W +
Sbjct: 554 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSPKV--PFKWEV 601
Query: 757 NALP---------LSSPIF---PRQVQHFISTDASDLGWG------SQVDS-----SFLS 793
+A +S+P+ + Q + DASD+G G S++D +F S
Sbjct: 602 DAQEAFDKLKSRFVSAPVLSIPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFS 661
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ I +E+ AV AL L+ + +V +D++ + Y+R + L
Sbjct: 662 HRLNPSERNYDIGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKN-LEYIR---SARRL 717
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
S +F R + + PG+ N D+LSR
Sbjct: 718 SSRQARWALFF----GRFNFTLSYRPGSKNIKPDALSR 751
>gi|427781365|gb|JAA56134.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1057
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 119/502 (23%), Positives = 188/502 (37%), Gaps = 82/502 (16%)
Query: 482 EMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRIPS 540
M+E +++ S++ + S L +VPK G RP + + LN+ P ++ + + + S
Sbjct: 216 HMMELRIVR--PSSSPYASPLHMVPKSTDGDWRPCGDYRALNRVTVPDRYPVPHIHDMTS 273
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
FL IDL +AY +P+ A++ + +PFGL A Q F +
Sbjct: 274 FLHGATIFSKIDLVRAYHQIPVAPEDIPKTAITTPFGMFEFLRMPFGLRNAGQTFQRFMD 333
Query: 601 WVASLLRSRGMRVV-VYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V RG+ VYLDD L+ + P E + + L G ++N K P
Sbjct: 334 GVV-----RGLDFCKVYLDDLLIASSTPEEHERHLRYVLQRLTENGIVINTAKCVFG-VP 387
Query: 660 VLQFLG-----IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
L FLG P D++ +R K NL R LG ++F
Sbjct: 388 TLSFLGHSISSAGVQPQKDKV-----------EAVRQFPQPK--NLRQLREFLGLVNFYR 434
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPI------NPAVLPKLEWWLNALPLSSPIFPR 768
IP + + SLL T I A E NA L+ P
Sbjct: 435 RFIPKC---ANILHPLHSLLAASGSKATAIQWNDQSTQAFRMIKEALANATMLTYPQL-- 489
Query: 769 QVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSL 819
V + DASD + + SF S S ++ + +E+ A++ A+
Sbjct: 490 GVPQCVMVDASDAAIRAVLQQRVSGVWRPISFFSTKLSPSERRYSTFGRELLAIYAAIRH 549
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLR--RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
++ V +D++ + LR G L ++ I + D R +
Sbjct: 550 FRHYVEGQEFFVLTDHKPLTYALRANSDSGAHVARELRQMSYIAEFTTDIR------HVS 603
Query: 878 GAYNSVADSLSR----SKSLP--------------DWHLSR---SATEQIFLKW-----G 911
G N+ AD+LSR + SLP D L R S T + L+W G
Sbjct: 604 GTDNAAADALSRGPVNAISLPSGVDFTTLTTAQRSDGELKRLLTSPTSALKLQWLAEPTG 663
Query: 912 VPCIDLFASRVSAVVPNHFQVS 933
C D+ R VP++ + S
Sbjct: 664 SVCCDMSTGRARPFVPSNLRRS 685
>gi|301622869|ref|XP_002940749.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 735
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 108/448 (24%), Positives = 183/448 (40%), Gaps = 51/448 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I+E LE G ++ S G + F V K +GG RP +N +GLN
Sbjct: 153 LSLPETQAMEEYIKENLERGFIRPSCSPAG--AGFFFVEKKDGGLRPCINYRGLN----- 205
Query: 528 KKFSLINHFRIPSFLQ-----KGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P + KG + S +DL AY + I+ + A +
Sbjct: 206 -KITVKNRYPLPLISELFDRVKGATIFSKLDLRGAYNLIRIREGDEWKTAFNTRDGHYEY 264
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F L N + L R VVVYLDD L+ + + + L
Sbjct: 265 LVIPFGLCNAPAVFQELVNDIFRDLLGRS--VVVYLDDILIYSNSLSDHRAHVQEVLLRL 322
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD 701
++K + V FLG ++ K L + + + + L
Sbjct: 323 RQHHLYAKIEKCIFEVSSV-HFLG----------YIISHKGLEMDPVKVQAILNWVQPL- 370
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNALP 760
S R++ +L FA++ + S + +L + P + PI +L+ P
Sbjct: 371 SLRAIQRFLGFANYYRQFIKNFSTLVAPITALTKGADPSIWPIEAQQAFTELKRSFTTAP 430
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKE 809
+ P + Q + DAS+ G+ + +F S S+ ++N+ + +E
Sbjct: 431 ILRHPDPAR-QFILEVDASEHALGAVLSQRSDFKSQLHPIAFFSRKLSQSERNYDVGDRE 489
Query: 810 MFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
+ A+ A LL+ + ++V SD++ + YLR K L +F
Sbjct: 490 LLAIKSAFQEWRHLLEGANHPILVFSDHKN-LEYLR---SAKRLRPRQARWALFFS---- 541
Query: 868 RIHILAQFIPGAYNSVADSLSRSKSLPD 895
R + F P + N AD+LSR P+
Sbjct: 542 RFNFHVTFRPDSKNGKADALSRMFPAPE 569
>gi|66828695|ref|XP_647701.1| hypothetical protein DDB_G0267342 [Dictyostelium discoideum AX4]
gi|60475845|gb|EAL73777.1| hypothetical protein DDB_G0267342 [Dictyostelium discoideum AX4]
Length = 925
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 65/127 (51%)
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+PS +++G YM+ +D+ +AY HV + ++ + G +PFGL+TA + F
Sbjct: 114 LPSMVKQGYYMVKLDIKKAYLHVLVDPQYRDLFRFVWKGSHYRWKTMPFGLSTAHRIFTM 173
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L V +LR + V+ YLDD L+V K + +L LG+ +NL+KS L P
Sbjct: 174 LLRPVLRMLRDINVSVIAYLDDLLIVGSTKEECLSNLKKTMDLLIKLGFKLNLEKSVLEP 233
Query: 658 APVLQFL 664
+ L
Sbjct: 234 TQSITLL 240
>gi|308478699|ref|XP_003101560.1| hypothetical protein CRE_10350 [Caenorhabditis remanei]
gi|308263014|gb|EFP06967.1| hypothetical protein CRE_10350 [Caenorhabditis remanei]
Length = 1069
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 88/195 (45%), Gaps = 5/195 (2%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
I+ + E GVL+R D +S L +V +G R +L+L LN+ L P +F L N
Sbjct: 480 EIERLEEEGVLERSDRLPRAVSPLHVVEQGKK-KRMILDLSELNKSLVPPRFKLENMKTA 538
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA----MTCLPFGLATAPQA 594
FL+ ++ + D Y H+ I + L+ S + A LPFGLATAP
Sbjct: 539 WPFLENANFAATFDFKSGYHHIKIHRDSRDLLSFSLSNPPAAPYFFFKGLPFGLATAPWL 598
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + R+ G+++ +YLDD L+V + + + L G V +KS
Sbjct: 599 FTKIFKVLVRKWRAEGIKMFLYLDDGLIVGETEYEVARASRRVRGDLAEAGVCVAEEKSF 658
Query: 655 LSPAPVLQFLGIMWD 669
P +LG D
Sbjct: 659 WVPDAKFTWLGYECD 673
>gi|403163067|ref|XP_003890259.1| hypothetical protein PGTG_21094 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163896|gb|EHS62546.1| hypothetical protein PGTG_21094 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 984
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 167/389 (42%), Gaps = 52/389 (13%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDS---------- 494
++ G+ F P + +H TP + SL +++ +E + K L++
Sbjct: 32 VIDGFTNGFDQGIPQ-HIIEGKHWFTPENHKSSLLVKDKIEESISKELEAKRMLGPFSHQ 90
Query: 495 ----TTGFLSR--LFLVPKGNGGTRPVLNL---------KGLNQFLSPKKFSLI-NHFRI 538
T GF L V G+G RP+ +L + +N ++ F + F+I
Sbjct: 91 QLKETFGFFRSNPLGAVVNGDGQIRPINDLSYPRNDPDIRSVNSYVDKSDFETTWDDFKI 150
Query: 539 PS-FLQKGDYMISI---DLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAPQ 593
S F + D + D +AY +P + ++L + ++G++L T + FG
Sbjct: 151 VSKFFAENDQKFDLALFDWEKAYRQIPTRQDQWKYLLVHDFDGNLLIDTRITFGGVAGCG 210
Query: 594 AFASLSNWVASLLRSRGMRVVVY--LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
+F ++ ++++ V V+ +DD L V + L + + VS LG + N++
Sbjct: 211 SFGRPADAWKLVMKNHFNLVNVFRWVDDNLFVKEVDENLSM--REIVSKSTELGVMTNIK 268
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGYL 710
K S A +F+G +W+ HL + LPE K + L I + ++ + A L+G L
Sbjct: 269 KFSDFTAE-QKFIGFVWNGHLKTVKLPEGKIEQRLAQIHPFQVKKAMFDYEEAEVLVGRL 327
Query: 711 SFASFVIPMGRLHSRRIQRQ-ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
+ S+++P R H + + S + A TP++ VL L+ W+N L
Sbjct: 328 NHVSYILPHMRCHLCSLYKWLKSWIWRKAKRATPVD--VLEDLDVWVNTL--------NN 377
Query: 770 VQHFISTDAS---DLGWGSQVDSSFLSGL 795
+H + D+GW +SF G+
Sbjct: 378 FEHTRLINWGPPLDVGWVGDASTSFGIGI 406
>gi|326669542|ref|XP_003199037.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1487
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 104/453 (22%), Positives = 182/453 (40%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + M+ +I+E L+ G ++ ST+ + F V K +GG RP ++ +GLN
Sbjct: 514 LSQTETETMNAYIEEELKKGFIRH--STSPASAGFFFVEKKDGGLRPCIDYRGLNAITVK 571
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L L+ Y +DL AY + IK H+ A S + +PFG
Sbjct: 572 YRYPLPLVPAALELLRTAKYFTKLDLRSAYNLIRIKKNHEWKTAFSTSSGHYEYLVMPFG 631
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLG 645
LA +P F + N V + +R V+VY+DD L+ ++ + I ++ L I L
Sbjct: 632 LANSPSVFQAFINDVFRDMLNRW--VIVYIDDILIYSESLEEHISHVRAVLQRLIEHRL- 688
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSAR 704
L+K + FLG + + + D+Q + A W + +
Sbjct: 689 -YAKLEKCEFHQTSI-SFLGYI----IGTEGVAMDEQ--------KVQAVLKWPKPRTIK 734
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------W 755
L +L FA+F RR R SL+ A LT + ++W
Sbjct: 735 ELQRFLGFANFY--------RRFIRNFSLV---ASPLTSLTRGKGSIIKWNDTAERAFAK 783
Query: 756 LNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
L ++PI ++ + DAS+ G G+ + +F S + +Q
Sbjct: 784 LKHRFATAPILHHPNPELPFIVEIDASNTGIGAILSQKQGSPSKSHPCAFFSRKLNSAEQ 843
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ + +E+ A+ A+ L+ + TV++ K+L + +++
Sbjct: 844 NYDVGNRELLAMKAAMEEWRHWLEGA-----KHKFTVIT------DHKNLEYIHSAKRLN 892
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R +IPG+ N AD+LSR
Sbjct: 893 PRQARWALFFTRFDFTVTYIPGSKNVKADALSR 925
>gi|326664810|ref|XP_003197890.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1486
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 117/456 (25%), Positives = 183/456 (40%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 501 PKGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 558
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 559 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVCIRPGDEWKTAFNTPRGHFE 618
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 619 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 675
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 676 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVNWP 722
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR S +L AP LT + + P W
Sbjct: 723 TPDSRKALQRFLGFANFY--------RRFIHNFS--QLAAP-LTSLTSSKTP-FRWSSAA 770
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGL 795
L +S+PI P + F + DAS++G G+ + ++ S
Sbjct: 771 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHR 830
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 831 LSSAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---SAKRLNS 886
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 887 RQARWALFF----GRFNFTISYRPGSKNIKPDALSR 918
>gi|326677050|ref|XP_003200740.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1402
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 104/453 (22%), Positives = 182/453 (40%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + M+ +I+E L+ G ++ ST+ + F V K +GG RP ++ +GLN
Sbjct: 462 LSQTETETMNAYIEEELKKGFIRH--STSPASAGFFFVEKKDGGLRPCIDYRGLNAITVK 519
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L L+ Y +DL AY + IK H+ A S + +PFG
Sbjct: 520 YRYPLPLVPAALELLRTAKYFTKLDLRSAYNLIRIKKNHEWKTAFSTSSGHYEYLVMPFG 579
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLG 645
LA +P F + N V + +R V+VY+DD L+ ++ + I ++ L I L
Sbjct: 580 LANSPFVFQAFINDVFRDMLNRW--VIVYIDDILIYSESLEEHISHVRAVLQRLIEHRL- 636
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSAR 704
L+K + FLG + + + D+Q + A W + +
Sbjct: 637 -YAKLEKCEFHQTSIY-FLGYI----IGTEGVAMDEQ--------KVQAVLKWPKPRTIK 682
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------W 755
L +L FA+F RR R SL+ A LT + ++W
Sbjct: 683 ELQRFLGFANFY--------RRFIRNFSLV---ASPLTSLTRGKGSIIKWNDTAERAFAK 731
Query: 756 LNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
L ++PI ++ + DAS+ G G+ + +F S + +Q
Sbjct: 732 LKHRFATAPILHHPNPELPFIVEIDASNTGIGAILSQKQGSPSKSHPCAFFSRKLNSAEQ 791
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ + +E+ A+ A+ L+ + TV++ K+L + +++
Sbjct: 792 NYDVGNRELLAMKAAMEEWRHWLEGA-----KHKFTVIT------DHKNLEYIHSAKRLN 840
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R +IPG+ N AD+LSR
Sbjct: 841 PRQARWALFFTRFDFTVTYIPGSKNVKADALSR 873
>gi|409038381|gb|EKM48439.1| hypothetical protein PHACADRAFT_58702, partial [Phanerochaete
carnosa HHB-10118-sp]
Length = 889
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/449 (22%), Positives = 184/449 (40%), Gaps = 53/449 (11%)
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
C + ++ + I+E L TG ++ S + + S F V K +G RPV + + LN
Sbjct: 102 CKVYPISVNEQKELDEFIEENLRTGRIR--PSKSPWASPFFFVKKKDGRLRPVQDYRKLN 159
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
+ ++ L + L++ Y +D+ Y ++ IK + A N +
Sbjct: 160 ELTIKNRYPLPLIQELVDKLKQARYFTKLDVRWGYNNIRIKEGDEEKAAFLTNRGLFEPL 219
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILG 642
+ FGL +P F ++ N + L S+G VVVYLDD ++ QD + + + IL
Sbjct: 220 VMFFGLTNSPATFQTMMNDIFRDLISQG-HVVVYLDDIMIFTQDLKEHRWITRQVLQILR 278
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
+ +K A + ++LG++ + RM D + G + W +
Sbjct: 279 EHKLYLKPEKCEFEKAEI-EYLGMIVGKGVVRM----DPVMIEGVV--------NWPRPE 325
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP- 760
S R + +L F +F R + + + +SL T L+ L P
Sbjct: 326 SKRDIQAFLGFTNFYRRFIRDYGKIAKPLSSLTGNATFESTVEQEVAFTSLKDALCTAPV 385
Query: 761 --LSSPIFPRQVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKE 809
+ + P QV+ DASD G+++ ++LS + ++N+ I KE
Sbjct: 386 LAIPNDNDPYQVE----CDASDFAIGAELAQKQDGKWKPVAYLSKAMTAAERNYEIYDKE 441
Query: 810 MFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRR-----QGGTKSLSLLSEVEKIFL 862
+ A+ +L L ++ + +D++ + Y R+ QG + ++ L+E +
Sbjct: 442 LLAIMTSLDEWRQYLMGAIHPFEIWTDHKN-LEYFRKPQKLNQGQARWVTELAEYQ---- 496
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+ +H P N AD LSR K
Sbjct: 497 ----YSLH----HKPSKSNGKADGLSRQK 517
>gi|12958098|gb|AAK07486.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 1311
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 98/430 (22%), Positives = 178/430 (41%), Gaps = 51/430 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 471 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYRSLNNATIPDRYPIPHIHDF 528
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ D+ T +PFGL A Q F
Sbjct: 529 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFDLFEFTRMPFGLRNAAQTFQRF 588
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 589 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 642
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLSFA 713
L FLG H+D + + +LA +++ L R +G +++
Sbjct: 643 VTSLDFLG----HHID--------STGISPLPNRILALESFPIPTTLTQLRRFIGIINYY 690
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV-LPKLEWWLNALPLSSPI----FPR 768
IP H I + + L LG + P+V + E A+ ++ +
Sbjct: 691 RRFIP----HCADILQPLTDL-LGCKEKSVTLPSVAIAAFERAKQAIAHATKLSFLDTHE 745
Query: 769 QVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 746 STKLILTTDASNAAVGAVLHQVVNNASQPLAFFSQKMQAAQTRYSTFGRELLAIYLAIRH 805
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
LL+ +Q+D++ + + S + ++ I + D R + PG+
Sbjct: 806 FRHLLEGRSFTIQTDHKPLTYAFNAKPDRYSPREIRHLDYISQFTTDIR------YTPGS 859
Query: 880 YNSVADSLSR 889
N VAD+LSR
Sbjct: 860 DNVVADALSR 869
>gi|326672934|ref|XP_003199760.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1427
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 182/445 (40%), Gaps = 56/445 (12%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 490 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 547
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + A +
Sbjct: 548 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTAFVTPTGHYEYRVM 604
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 605 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 661
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 662 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 708
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT--PINPAVLPKLEWWLNALP 760
+ L +L FA+F +S +LL+ L+ P A L+ P
Sbjct: 709 VKELQRFLGFANFYRRFIHNYSLVTAPLTNLLKNKPKKLSWPPEAAAAFRNLKEAFTRAP 768
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKKE 809
L + P + + DAS G G+ + ++ S S ++N+ I +E
Sbjct: 769 LLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPVEKNYDIGNRE 827
Query: 810 MFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW-- 867
+ A+ AL L+ + + Q + + K+L L + +++ W
Sbjct: 828 LLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLCPRQARWSL 876
Query: 868 ---RIHILAQFIPGAYNSVADSLSR 889
R H + PG+ N AD+LSR
Sbjct: 877 FFSRFHFSITYRPGSKNIRADALSR 901
>gi|422418|pir||S34639 pol protein - fruit fly (Drosophila ananassae) transposon Tom
(fragment)
gi|394705|emb|CAA80824.1| pol protein [Drosophila ananassae]
Length = 1040
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 116/468 (24%), Positives = 192/468 (41%), Gaps = 65/468 (13%)
Query: 461 PLCSLQH-LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRP 514
P+ S Q+ LA + + +QEMLE G+++ +S + + S ++VPK G R
Sbjct: 167 PIYSKQYPLAQTHENEVENQVQEMLEQGLIR--ESNSPYNSPTWVVPKKPDASGKAKYRV 224
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
V++ + LN+ P +F + N I L K Y +IDL++ + + + + + A S
Sbjct: 225 VIDYRKLNEITIPDRFPIPNMDEILGKLGKCQYFTTIDLARGFHQIEMDSESIQKTAFST 284
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEI 632
+PFGL AP F N + L ++ +VYLDD ++ + D + +
Sbjct: 285 KRGHYEYVRMPFGLRNAPATFQRCMNNILRPLINK--HCLVYLDDMIIFSTSLDEHLNSL 342
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
Q L L + L K FLG + P D + + L + I
Sbjct: 343 Q--LVFEKLSESNLKLQLDKCEFLKKEA-TFLGHIVTP--DGI---KPNPLKVEAIASYP 394
Query: 693 LASKTWNLDSARSLLGYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
+ +K + + + GY S+A PM R L+ GA IN
Sbjct: 395 IPTKVKEIRAFLGMTGYYRKFIPSYADIAKPMTR-----------YLKKGAK--IDINNH 441
Query: 748 VLPKLEWWLNALPLSSPI--FPRQVQHFI-STDASDLGWGSQVDS-----SFLSGLWSRE 799
+ L L S PI P + F+ +TDAS+L G+ + SF+S +
Sbjct: 442 EYVEAFEKLKTLITSEPILQLPNFEKKFVLTTDASNLALGAVLSQDNHPISFISRTLNDH 501
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
+ N+ +KE+ A+ A L + SD+Q L L +++
Sbjct: 502 ELNYSTIEKELLAIVWATKTFRHYLLGRHFQIASDHQ-------------PLRWLHNLKE 548
Query: 860 IFLLSQDWRIHILA-----QFIPGAYNSVADSLSRSKSLPDWHLSRSA 902
Q WRI + ++I G NS+AD+LSR K + + H S +
Sbjct: 549 PNAKLQRWRIRLAEFDFHIEYIKGKQNSIADALSRIK-VEENHFSEAT 595
>gi|301628205|ref|XP_002943248.1| PREDICTED: hypothetical protein LOC100493969 [Xenopus (Silurana)
tropicalis]
Length = 471
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 91/204 (44%), Gaps = 16/204 (7%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNW 601
G M D A+ +PI FL + G CLP G A + +AF++ W
Sbjct: 135 GALMAKADKEAAFRLLPIHPECHHFLGCWFEGAYFVDLCLPMGCAISCADFEAFSTFLEW 194
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V + R+ +V YLDDF V Q +LE ++A S L +
Sbjct: 195 VVKV-RAGCSLMVHYLDDFFCVRQANADTCFHLLETLQEVAASFGVPLA-----ADKTEG 248
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA V++FLG+ D LP K L + +L K L +S+LG L+FA V
Sbjct: 249 PATVMRFLGLEIDSVAGECRLPTQKVEDLTREVGSLRRDKKATLQRLQSMLGKLNFACRV 308
Query: 717 IPMGRLHSRRIQRQASLLRLGAPH 740
IP+GR+ SRR+ + + + APH
Sbjct: 309 IPVGRVFSRRLAQATAGTQ--APH 330
>gi|156058386|ref|XP_001595116.1| hypothetical protein SS1G_03204 [Sclerotinia sclerotiorum 1980]
gi|154700992|gb|EDO00731.1| hypothetical protein SS1G_03204 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1321
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 104/452 (23%), Positives = 178/452 (39%), Gaps = 39/452 (8%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
+P + L L T A+ I E L G + S F + + V K NG R
Sbjct: 485 TQPNNLTLSPLYRQTTQELQALKKFIDENLNRGWIA--PSNASFAAPILFVKKANGDLRL 542
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ + LN+ + + L I S + K +DL A+ + + + A
Sbjct: 543 CVDYRKLNEISAKDGYPLPRIDEILSQMSKAKIFTKLDLRAAFNAIRMHPDSEELTAFQT 602
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
LPFGL+ P + N + L+ + G YLDD ++ + DP Q
Sbjct: 603 CFGQFKSLVLPFGLSGGPGTYQRFINNL--LMENLGNFCTAYLDDIIIYSTDPSEHTAQV 660
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ ++ L G V+++K S + + ++LG ++ L D + I L
Sbjct: 661 RWVLTKLKEAGLSVDIKKCDFSVSRI-KYLGF----YVSTKGLEVDPE----KIKDILTW 711
Query: 695 SKTWNLDSARSLLGYLSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
+ + R LG+ F F+ GR+ +R + + R+ P L
Sbjct: 712 KRPTTVKGVRGFLGFCGFYRKFIKNYGRI-ARPLDKLTQKGRIF--DWDPDCQKAFETLR 768
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDS--------SFLSGLWSREQQN 802
+ P+ P ++ + TD+SD G SQ+D +F S + + N
Sbjct: 769 QAVTEAPVLHYFHPDRLTK-VETDSSDGVTGGILSQLDPATKEWHPLAFFSKTMNPAECN 827
Query: 803 WHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+ I+ KEM A+ QA LQS + V V SD+Q++ ++R + T + +E
Sbjct: 828 YEIHDKEMLAILQAFQQWRVELQSVENPVQVYSDHQSLEIFMRTKKLTARQARWAEYLSQ 887
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
F ++R G N AD+L+R S
Sbjct: 888 FNFQLEYRT--------GKANGQADALTRRDS 911
>gi|427798471|gb|JAA64687.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 970
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 112/460 (24%), Positives = 191/460 (41%), Gaps = 51/460 (11%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFL 504
I +G A+P P V L Q + + EML G+++R S++ + S + L
Sbjct: 140 IETGDALPLKCNPRPVSLAKRQ--------VIDGLLDEMLSAGIVRR--SSSAWASPIVL 189
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
VPK +G R ++ + LN + L I L Y ++D S+ Y V +
Sbjct: 190 VPKKDGSHRLCVDYRRLNGVTRKDAYPLPTISSIVGNLGTARYFTTLDASKGYLQVRMDE 249
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
+ A + + + T +PFGL AP F L + V L ++ + YLDD ++ +
Sbjct: 250 RDRCKTAFTSHRGLFEFTRMPFGLCNAPATFQRLMDRV--LGEAKWSYCMCYLDDIVIYS 307
Query: 625 QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
Q + + + G +N K+ L+ V Q LG L + D++
Sbjct: 308 QTFEEHLAHVADVLERVRAAGMTLNPAKAQLAQTRV-QLLGFT----LGEGSIEPDRE-- 360
Query: 685 LGNILRTLL---ASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL 741
LR +L A K ++ R LG +F IP R+Q + L LG
Sbjct: 361 ---KLRAILDFPAPK--DVRGLRRFLGMANFYRSFIP----SCARVQAPLTKL-LGKSAE 410
Query: 742 TPINPAVLPKLEWWLNALPLSSPI-FPRQVQHF-ISTDASDLGWGS----QVDS-----S 790
P +A+ ++ + P + F + TDASDLG G+ + D +
Sbjct: 411 WRWGPEQQEAFCRLSSAIAETAQLKLPDLTRPFVVQTDASDLGLGAVLLQEYDGVLQPLA 470
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS 850
F S ++N+ + ++E A+ AL L + +VQ+D+ + +R +
Sbjct: 471 FASRSLIPAEKNYSVTERECLAIVFALRKFDVYLDGTKFVVQTDHNALSWLMRLREPAGR 530
Query: 851 LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
L+ + LL Q + + Q+ G+ N VAD+LSR+
Sbjct: 531 LA------RWALLIQHYDFSV--QYRKGSTNVVADALSRA 562
>gi|46241321|tpg|DAA01767.2| TPA_exp: polyprotein [Lytechinus variegatus]
Length = 640
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/371 (23%), Positives = 148/371 (39%), Gaps = 21/371 (5%)
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L +G M D+ A+ +P+ L + +G CLP G A + F S +
Sbjct: 248 LGRGALMGKTDIKSAFRLLPVHPKDFELLGMYIDGRYYFDRCLPMGCAVSCSTFECFSTF 307
Query: 602 VASLLR--SRGMRVVVYLDDFLLVNQDPRILEIQGKLAV--SILGSLGWIVNLQKSSLSP 657
+ R ++ +V YLDDF P + + + +I G + +K+ P
Sbjct: 308 LEFCARKVAKSQNIVHYLDDFFFAG-GPASEDCRRAMHCFEAICERFGVPIAREKTE-GP 365
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L +LG++ D ++ +PEDK L L + + +L +S++G L+F I
Sbjct: 366 TTQLSYLGLVIDSVSQQVRVPEDKIEKLVGKLNWAVQKQKISLRDIQSIVGSLNFVCKAI 425
Query: 718 PMGRLHSRR-IQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFIS 775
GR RR I S+ R H+ + L WL L + +F R F S
Sbjct: 426 APGRAFMRRLIDLTKSVKR--PFHMVRLTRGAKADLRVWLEFLAHFNGQVFFRAPGWFGS 483
Query: 776 TDASDLGWGSQVDSSFLSGLWSREQQNW---------HINKKEMFAVHQALSLNLPLLQS 826
+ + + Q W I E+F + L + L+
Sbjct: 484 EEIQFFTDAAAGIGFGIFFGGRWAQSRWPADFQADRRSIAFLELFPIWVGLEIWGMELKD 543
Query: 827 SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADS 886
++ DNQ VV+ L +Q ++ V K+ ++ I + A+ + G N +AD+
Sbjct: 544 KNILFNCDNQAVVAVLNKQSAL-CPDIMVLVRKVVIICLSNNIVLRARHVHGCDNGIADA 602
Query: 887 LSRSKSLPDWH 897
LSR + +P +H
Sbjct: 603 LSRFQ-MPRFH 612
>gi|326674036|ref|XP_003200053.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1165
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 282 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 339
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 340 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 396
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 397 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIEN 454
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 455 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 497
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 498 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 546
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 547 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 606
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 607 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 662
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 663 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 693
>gi|326664007|ref|XP_003197709.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1230
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 121/459 (26%), Positives = 187/459 (40%), Gaps = 74/459 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + + +++ S G + F V K +G RP ++ +G
Sbjct: 361 PKGKLYSLSIPEREAMEKYISD---SKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 415
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ + +DL AY V +K H+ A
Sbjct: 416 LNSITVKNTYPLPLMSSEFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFE 475
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + R L+ +
Sbjct: 476 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFS---RSLQEHVQHVRR 529
Query: 640 ILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASK 696
+L L G V +K A + FLG + RM PE Q +
Sbjct: 530 VLQRLLENGLFVKAEKCVFH-AQSVPFLGHIVSVEGVRM-DPEKIQAVVN---------- 577
Query: 697 TWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW- 754
W + +S ++L +L FA+F RR R S +L AP LT + A W
Sbjct: 578 -WPIPESRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTSLTSAKT-AFRWS 624
Query: 755 --------WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFL 792
L +S+PI P + F + DAS++G G+ + +F
Sbjct: 625 SVAQAAFTKLKGCFVSAPILVTPDPARQFVVEVDASEVGVGAILSQRAASDDRIHPCAFF 684
Query: 793 SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKS 850
S S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K
Sbjct: 685 SHRLSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKR 740
Query: 851 LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ +F D+ I + PG+ N+ D+LSR
Sbjct: 741 LNSRQARWALFFGRFDFTI----SYRPGSKNTKPDALSR 775
>gi|326678742|ref|XP_003201157.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1334
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 181/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 418 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 475
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + A +
Sbjct: 476 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTAFVTPTGHYEYRVM 532
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 533 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 589
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 590 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 636
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 637 VKELQRFLGFANFY--------RRFIHNYSLITAPLTNLLKNKPKKLSWPSEAAAAFRNL 688
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 689 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 747
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 748 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 796
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 797 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 829
>gi|301605883|ref|XP_002932571.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1049
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 111/449 (24%), Positives = 181/449 (40%), Gaps = 62/449 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GLN+
Sbjct: 133 LSLPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGLNKITIK 190
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ +D AY + I+ + A + +PFG
Sbjct: 191 NRYPLPLISELFDRVKGASIYTKLDFRGAYNLILIREGDEWKTAFNTRDGHYEYLVMPFG 250
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F N + L G+ VVVYLDD L+ + + K + L
Sbjct: 251 LCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLSDHRSHVKEVLRRLRENNLY 308
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLD-SARSL 706
L+K + V QFLG H+ L D + +R +L W S R+
Sbjct: 309 AKLEKCTFEVNSV-QFLGF----HISSKGLEMDPEK-----VRAVL---DWTQPLSLRAT 355
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHL---TPINPAVLPKLEWWLNALPLS 762
+L FA++ + S + L + GA P L + L K E+ +S
Sbjct: 356 QRFLGFANYYRQFIKNFSLIVAPITDLTKKGADPSLWSSEAVQAFNLLKKEF------VS 409
Query: 763 SPIF--PRQVQHFI-STDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINKK 808
+PI P FI DAS++G G+ + +F S +S + N+ I +
Sbjct: 410 APILRHPDTALPFIVEVDASEVGAGAVLSQRHPLTNKLHPCAFFSRKFSPSEANYDIGNR 469
Query: 809 EMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
E+ A+ A LL+ + V V +D+ K+L + ++
Sbjct: 470 ELLAIKWAFEEWRHLLEGAKHAVSVFTDH-------------KNLLYIESARRLNPRQAR 516
Query: 867 W-----RIHILAQFIPGAYNSVADSLSRS 890
W R + + PG+ N+ AD+LSRS
Sbjct: 517 WALFFSRFNFSITYRPGSKNTKADALSRS 545
>gi|317419141|emb|CBN81178.1| Pol polyprotein [Dicentrarchus labrax]
Length = 1618
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 112/451 (24%), Positives = 185/451 (41%), Gaps = 55/451 (12%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L +L+ P AM +I + L G+++ S G + F V K + RP ++ +
Sbjct: 688 LPSSRLYNLSRPECEAMEKYIGDSLAAGLIRPSSSPVG--AGFFFVTKKDQSLRPCIDYR 745
Query: 520 GLNQFLSPKKFSLINHFRIPSF--LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
GLN K+ L P+F L K +DL AY V I+ + A +
Sbjct: 746 GLNDITIKNKYPL--PLIDPAFEPLHKAKVFSKLDLRNAYHLVRIREGDEWKTAFNIPLG 803
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA 637
+PFGL AP F ++ N V + +R + VYLDD L+ +Q +
Sbjct: 804 HFEYLVMPFGLTNAPAVFQAMVNDVLRDMLNRFL--FVYLDDILIFSQSQEEHVQHVRQV 861
Query: 638 VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
+ L V +K + V FLG + ++R + D + A
Sbjct: 862 LQRLLENKLYVKAEKCEFNVTSV-SFLGFI----IERGQVKADPA--------KIQAVAD 908
Query: 698 WNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE--- 753
W + R L +L FA+F R +S+ L + AP A P+ E
Sbjct: 909 WPRPTTRKQLQRFLGFANFYRRFIRNYSKVAAPLTKLTSVKAPF------AWSPEAETAF 962
Query: 754 WWLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSRE 799
L L S+P+ +Q + DASD G G+ + +FLS S
Sbjct: 963 LALKELFTSAPVLRHPDPSLQFVVEVDASDTGIGAVLSQRSPKDQKLHPCAFLSRRLSPA 1022
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
++N+ + +E+ AV AL L+ + + +V +D++ ++YLR S L+
Sbjct: 1023 ERNYDVGNRELLAVVVALQEWRHWLEGAALPFIVWTDHKN-LAYLR------SAKRLNSR 1075
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ + L D + L + PG+ N+ D+LS
Sbjct: 1076 QARWALFLDCFVFTLT-YRPGSRNAKPDALS 1105
>gi|326673505|ref|XP_003199902.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1311
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 112/456 (24%), Positives = 188/456 (41%), Gaps = 78/456 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM+ +I E LE G ++ ST+ + F + K NG RP ++ +GLN+
Sbjct: 471 LSQPETEAMNSYISEELEKGFIR--PSTSPASAGFFFLKKKNGSLRPCIDYRGLNE---- 524
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
++ H+ +P L Y +DL AY V I+ + S
Sbjct: 525 --ITIKYHYPLPLVPAALEQLHSAQYFTKLDLRSAYNLVRIRQGDEWKTGFSTINGHYEY 582
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD--PRILEIQGKLAVS 639
+PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L
Sbjct: 583 LVMPFGLANSPSVFQAFINEIFRDILNQW--VIVYIDDILIYSNSLPEHIQHVRAVLQRL 640
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
I L + K + FLG + P M D+Q + A W
Sbjct: 641 IQNQL--YAKVSKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDAVTHWP 685
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNA 758
++ R L +L FA+F RR R S + AP LT + A +L+W L A
Sbjct: 686 QPETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNLEA 734
Query: 759 LP---------LSSPIF--PRQVQHFI-STDASDLGWGSQV-----------DSSFLSGL 795
+ +PI P + FI DAS+ G G+ + +F S
Sbjct: 735 INAFNQLKARFTDAPILCHPDPTRPFIVEIDASNSGIGAILCQRSPTTNKLHPCAFYSHK 794
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSL 853
+ ++ + + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 795 LNSAERKYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNP 850
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 851 RQARWALFFTRFDFQV----TYIPGSKNIKADALSR 882
>gi|341880515|gb|EGT36450.1| hypothetical protein CAEBREN_28622 [Caenorhabditis brenneri]
Length = 2197
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 118/509 (23%), Positives = 210/509 (41%), Gaps = 74/509 (14%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P A+P VP+ + + HI +L++G + +S T + S +
Sbjct: 28 VHIYTNTEVPVKARPYRVPI--------KYQAELEKHINSLLKSGRIT--ESNTPWTSPI 77
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
LV K NG R L+ + LN+ P F L RI + L+K +Y S+D++ Y
Sbjct: 78 VLVKKKNGSLRVCLDFRKLNEATIPDNFPLP---RIDAILEKVGGSNYFSSLDMANGYLQ 134
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
+ + V A T LPFGL +A F V L V+VY+DD
Sbjct: 135 LRLDPASSYKCGFITESKVYAYTHLPFGLKSAASYFQRALRTVLGGLED---EVLVYIDD 191
Query: 620 FLLVNQD-PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLP 678
L+ ++ + LE K+ + + +K + ++ FLG + +
Sbjct: 192 ILIYSKTFDQHLETLRKV-LHRFRDFNLKASPKKCEFAKKSIV-FLG----HEISKNTYS 245
Query: 679 EDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI--------PMGRLHSRRIQRQ 730
DK N+ + N++ R +G F I P+ RL + + +
Sbjct: 246 PDK----ANVAKITEFPTPTNINEIRRFVGMAGFFRKFIPNFSEISEPLTRLTRKERKFE 301
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG----- 784
+L + GA KL L++ P L P + R F DAS + G
Sbjct: 302 WNLDQQGA----------FEKLRTSLSSEPVLGFPDYDRPFHIFC--DASAVAQGAALMQ 349
Query: 785 SQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
+++D+ ++ S + + W + EM A+ AL P + S +++ SD++
Sbjct: 350 TRLDNEKDFFAIAYASRTLADTETRWPAIQVEMGAIIFALRQFRPYVCMSKIILHSDHKP 409
Query: 838 VVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
+ L++ +L+ + + Q + I I+ I G N+VAD LSR++ D
Sbjct: 410 LTFLLQKSKTHDNLA------RWLVELQCYDISII--HIDGKKNTVADCLSRARENDD-- 459
Query: 898 LSRSATEQIFLKWGVPCIDLFASRVSAVV 926
+S + + +++ V C+ + A +A V
Sbjct: 460 ISEAVELKDIIEFPV-CMKIDARANAATV 487
>gi|326667704|ref|XP_003198659.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1222
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 187/445 (42%), Gaps = 56/445 (12%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 332 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 389
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 390 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 446
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 447 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIKN 504
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 505 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 547
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL--TPINPAVLPKLEWWLN 757
++ R L +L FA+F R S ++++ HL P +L+ +
Sbjct: 548 PETIRQLQRFLGFANFYRRFIRNFSSVAAPLTAMVKANNAHLKWNPDAVRAFTQLKTRFS 607
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFL-------SGLWSRE----QQNWHIN 806
+ P+ P Q + DAS+ G G+ + L +SR+ ++N+ +
Sbjct: 608 SAPILRHPDPEQ-PFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKLNSAERNYDVG 666
Query: 807 KKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ A+ AL L+ + +V +D++ + Y+R K L+ +F
Sbjct: 667 NRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPRQARWALFFTR 722
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR 889
D+++ +IPG+ N AD+LSR
Sbjct: 723 FDFQV----TYIPGSKNIKADALSR 743
>gi|326666338|ref|XP_003198245.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1335
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 106/444 (23%), Positives = 186/444 (41%), Gaps = 54/444 (12%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I+E L +G ++ ST+ + F V K +GG RP ++ +GLN
Sbjct: 403 LSLPETQAMENYIEEALASGYIR--PSTSPAAAGFFFVEKKDGGLRPCIDYRGLNSVTVK 460
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +PS L++ +DL AY + I+ + A +
Sbjct: 461 YRYPLP---LVPSALEQLREAHIYSKLDLRSAYNLIRIRAGDEWKTAFLTTRGHYEYLVM 517
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV--NQDPRILEIQGKLAVSILG 642
P+GLA +P F S N + L ++ V+ Y+DD L+ N + I +++ L L
Sbjct: 518 PYGLANSPAVFQSFINEIFHDLLNKC--VITYIDDILIYSPNLEQHIKDVKTVLTRLQLH 575
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-D 701
L L+K FLG + + M D+ + A W L
Sbjct: 576 QL--YAKLEKCEFHVHKT-SFLGYIVSHNGVEM----DQS--------KIQAVTEWPLPK 620
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ + L +L FA+F R +S SLL+ G P NP+ + E L
Sbjct: 621 TVKELQRFLGFANFYRRFIRSYSSIAAPLTSLLK-GKPGKLVWNPSAVRAFE-NLKTSFT 678
Query: 762 SSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHINK 807
++PI ++ + DASD G G+ + +F S + ++N+ +
Sbjct: 679 TAPILKHPDPELPFVVEVDASDCGIGAILSQRHGSPGKLHPCAFFSRKLTAAEKNYDVGN 738
Query: 808 KEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
KE+ ++ AL L+ +V + +D++ + Y++ G K ++ +F
Sbjct: 739 KELLSMKAALEEWRHWLEGAVHPFQIITDHKN-LEYIK---GAKRINPRQARWSLFFT-- 792
Query: 866 DWRIHILAQFIPGAYNSVADSLSR 889
R + + PG+ N AD+LSR
Sbjct: 793 --RFNFSVTYRPGSKNLKADALSR 814
>gi|19113631|ref|NP_596839.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|28380144|sp|Q9UR07.1|RTF23_SCHPO RecName: Full=Retrotransposable element Tf2 155 kDa protein type 3
gi|6634555|emb|CAB64236.1| retrotransposable element [Schizosaccharomyces pombe]
Length = 1333
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 188/430 (43%), Gaps = 40/430 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
AM+ I + L++G+++ + + VPK G R V++ K LN+++ P + L
Sbjct: 427 AMNDEINQGLKSGIIRESKAINA--CPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
++ + +Q +DL AY + ++ + LA V +P+G++ AP
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAH 544
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F N + L + VV Y+D+ L+ ++ K + L + I+N K
Sbjct: 545 FQYFINTI--LGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 655 LSPAPVLQFLGIMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ V +F+G H+ ++ + P NI + L + N R LG +++
Sbjct: 603 FHQSQV-KFIGY----HISEKGFTP-----CQENIDKVLQWKQPKNRKELRQFLGSVNYL 652
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
IP S+ +LL+ TP + ++ L + P L F +++
Sbjct: 653 RKFIPKT---SQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKI- 708
Query: 772 HFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ TDASD+ G+ + + S S+ Q N+ ++ KEM A+ ++L
Sbjct: 709 -LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWR 767
Query: 822 PLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+S++ + +D++ ++ + + ++ L ++FL QD+ I + PG+
Sbjct: 768 HYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR--WQLFL--QDFNFEI--NYRPGS 821
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 822 ANHIADALSR 831
>gi|6425168|gb|AAC33526.2| pol polyprotein [Takifugu rubripes]
Length = 1187
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 110/450 (24%), Positives = 185/450 (41%), Gaps = 54/450 (12%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L +L+ P M +I E L +G+++ S++ + F V K +GG RP ++ +
Sbjct: 278 PTGRLYNLSIPEKEVMRNYITESLASGIIR--PSSSPLAAGFFFVAKEDGGLRPCIDFRK 335
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN K+ L L +DL AY V I+ + A + +
Sbjct: 336 LNNITVKNKYPLPLMSSTFEPLTHARVFTKLDLRNAYHLVQIRKGDEWKTAFNTHLGHFE 395
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
+PFGL+ AP F L N +LR + VVVYLDD L+ ++ +L +
Sbjct: 396 YLVMPFGLSNAPAVFQELVN---DVLRDMINVFVVVYLDDILIFSRTMEEHHQHVRLVLQ 452
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L + +K A V +LG ++ E+ ++ + A W
Sbjct: 453 RLLENRLFIKAEKCIFHSASV-GYLG----------YIVEEGRVRADPA--KIQAVVEWP 499
Query: 700 LDSARS-LLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNA 758
+ R+ L +L FA F+ + S+ ++L P A P+ E +A
Sbjct: 500 RPTDRTQLRRFLGFAGFIRRFIKGFSQVAAPLSALTSTSRPF------AWTPEAETAFSA 553
Query: 759 LP---LSSPIF--PRQVQHFI-STDASDLGWG------SQVD-----SSFLSGLWSREQQ 801
L ++P+ P + FI DASD G G S+ D ++ S + ++
Sbjct: 554 LKDRFTTAPVLAHPDPARQFIVEVDASDAGIGAVLSQRSEADQKIHPCAYFSRRFDPAER 613
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ + +E+ AV+ AL L+ + +V SD++ ++Y+R K L+
Sbjct: 614 NYDVGNRELLAVYGALVEWKHWLEGAKHPFLVWSDHKN-LTYVR---TAKRLNPRQGRWA 669
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R F PG+ N AD+LSR
Sbjct: 670 LFS-----RFDFTLTFRPGSKNIRADALSR 694
>gi|372220276|ref|YP_004956864.1| unnamed protein product [Parrot hepatitis B virus]
gi|364506331|gb|AEW50169.1| polymerase [Parrot hepatitis B virus]
Length = 795
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 110/237 (46%), Gaps = 14/237 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFS-LINHFRIPSFLQKGDYMISID 552
R+FLV K + T R V++ KG N PK +S ++ R L G + IS+D
Sbjct: 396 RIFLVDKNSRNTEEARLVVDFSQFSKGQNALRFPKYWSPNLSTLRRIRILPVGMHRISLD 455
Query: 553 LSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GM 611
LSQA++H+P+ LA+S V P G+ +P S + S + R +
Sbjct: 456 LSQAFYHIPLNPASGSRLAISDGESVYYFRKTPMGVGISPFFLHLFSAAIGSEISRRFNI 515
Query: 612 RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPH 671
Y+DDFLL + + R L + L G +N K + SP ++FLG ++ +
Sbjct: 516 WTFTYMDDFLLCHPNARHLNAVSHAVCTFLQEFGIRINFDKMTPSPVTTIRFLG--YEIN 573
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ 728
+ + + + L +++ + K ++ + L+G++ FV+P + +S ++
Sbjct: 574 EQYIQIEDSRWTELRTVIKKISVGKWYDWKCIQRLIGHI---QFVLPFTKGNSEMLK 627
>gi|432939100|ref|XP_004082581.1| PREDICTED: uncharacterized protein LOC101155559 [Oryzias latipes]
Length = 1196
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/404 (24%), Positives = 170/404 (42%), Gaps = 44/404 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
IQ ML+ GV++ S + + S + +VPK +G RP ++ + +N F RI
Sbjct: 598 EIQRMLDLGVIE--PSRSEWSSPMVMVPKKDGSQRPCIDFRKVNAVSC---FDAYPMPRI 652
Query: 539 PSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+++ Y+ ++DL + Y+ +P+ T +++ A + T +PFGL AP F
Sbjct: 653 DDLVERVGTAKYITTLDLCKGYWQIPLDETSKQYTAFCAPTGLYHFTVMPFGLHGAPATF 712
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
L + ++LR YLDD ++ +Q + + +S + G +N QK
Sbjct: 713 QRLMD---AVLRGFEAFSAAYLDDVVIFSQSWEDHLMHLRAVLSAIEGAGLTLNAQKCEW 769
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
+ V Q+LG R + DK + N A + RS +G + + S
Sbjct: 770 AKGEV-QYLGFQLGGGRIRPLV--DKVDAIRN------AQRPRTKKQVRSFIGLVGWYSN 820
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP----LSSPIFPRQVQ 771
IP + + +L + GAP+ NA+ L SP F Q +
Sbjct: 821 FIPHNSTLATPL---TNLTKKGAPNTVKWTADCEQSFIALKNAMCSSPVLCSPDF--QKR 875
Query: 772 HFISTDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQAL-SLNL 821
+ DASD+G G+ + S +LS + + +KE A++ AL SL
Sbjct: 876 FTVQADASDVGIGAVLTQSDQGEERPVLYLSRKLEPREVRFSTIEKEALAINWALESLRY 935
Query: 822 PLLQSSVVMVQSDN----QTVVSYLRRQ-GGTKSLSLLSEVEKI 860
LL + + + + + YL Q GG + L+ +V+ I
Sbjct: 936 YLLGRAGLTLNAQKCEWAKGEFQYLGFQLGGGRIRPLVDKVDAI 979
>gi|294932253|ref|XP_002780180.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239890102|gb|EER11975.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1222
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 107/437 (24%), Positives = 179/437 (40%), Gaps = 63/437 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ M + G+++R ST+ + VPK NG R ++ + LNQ S+I+ + IP
Sbjct: 132 LDTMEKDGIIQR--STSAYKFPCVYVPKKNGAVRMCIDYRRLNQ------VSVIDAYPIP 183
Query: 540 ------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD--VLAMTCLPFGLATA 591
L ++DL Y+ +P++ + A + T +PFGL A
Sbjct: 184 RPDDVQEHLAGARVFSTLDLRSGYWQMPVRAGDRYKTAFCPGPGFPLYEWTRMPFGLCNA 243
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
P F L +++ L V VYLDD L+ + + L + G ++ +
Sbjct: 244 PAGFQRLMDFILGHLPF----VRVYLDDILVFSDSMEQHLDHLRQVFDALRAAGLTLSGE 299
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT-WNLDSARSLLGYL 710
K SL + V +LG ++ +D M +K +R ++ T R LG
Sbjct: 300 KCSLGMSSV-HYLGHIFG--VDGMRPDPEK-------VRAIVEWPTPTTCTELRGFLGLA 349
Query: 711 SFASFVIPM----GRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPI 765
+ IP R + ++ A L ++ H T L+ L LP L P
Sbjct: 350 GYYRHFIPHFSDRARPLHQLVKETAKLEKMVGDHWTQDQEQAFNDLKQALTGLPSLDYPD 409
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALS-- 818
F R VQ I DASD G ++ F S + Q NW +KE +A+ + L
Sbjct: 410 FTRAVQ--IVCDASDFAIGGVIEQDGRPLMFFSQTLTGSQLNWPAYEKEAYALMKCLDRF 467
Query: 819 ----LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQ 874
L PL V + SD++ + +++ K +++ L Q + + +
Sbjct: 468 RHFHLGYPL----EVTIYSDHRP-LQWIQTATSAK-------LQRWCLALQQYNFTV--K 513
Query: 875 FIPGAYNSVADSLSRSK 891
+IPG+ N AD+LSR +
Sbjct: 514 YIPGSTNVRADALSRIR 530
>gi|270016119|gb|EFA12567.1| hypothetical protein TcasGA2_TC004196 [Tribolium castaneum]
Length = 1635
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 116/490 (23%), Positives = 198/490 (40%), Gaps = 84/490 (17%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQ---HLATP-----------VSSAMSLHIQEMLETG 487
L+ ++ Y FS++P L + + H TP + A+ IQEML+ G
Sbjct: 750 LIHLLQEYRCIFSSRPGLTHKYTHEIKLHDKTPFLKRPYPVPFALRPAVDATIQEMLDLG 809
Query: 488 VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSF 541
V+KR + + S + +V K +G R L+ + +N + P L+ F
Sbjct: 810 VIKR--EASPYASPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF----- 862
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
YM +IDL +Y+ +P+ +++ A YNG LPFGL TA +F+ +
Sbjct: 863 -HGIRYMSTIDLRSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDV 921
Query: 602 V-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + +R VV Y+DD L+ ++ + L +NL+KS+ V
Sbjct: 922 VLGTEVRE---FVVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEV 978
Query: 661 ------LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF-- 712
L GI DP + I + KT ++ R+ LG +F
Sbjct: 979 KFLGHILTINGIKADPE------------KVSAIRNFPVPQKTKHV---RAFLGLFNFYR 1023
Query: 713 ---ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
A + L+ ++ R+ R G + LE L P + IF
Sbjct: 1024 KFCARYSAATQDLN--KLLRKGEKWRWGRNEQEAFDRVKDLFLEAVLLHYPDPNKIF--- 1078
Query: 770 VQHFISTDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSL 819
++ TD+S G G+ Q D S F S + N+ +KE+ V L
Sbjct: 1079 ---YVQTDSSGYGLGAELYQIQEDGSRGVFAFASRSLKGPELNYTTTEKELLGVIFVLHK 1135
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
+Q++ +++++D+Q + R + ++ L+ + + L D+ I + + G
Sbjct: 1136 FRIYIQATKIIIRTDHQALKFLSRCRLFSERLTRWT----LILGQYDYEI----ELVKGK 1187
Query: 880 YNSVADSLSR 889
N VAD LSR
Sbjct: 1188 DNVVADILSR 1197
>gi|326673582|ref|XP_003199929.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1231
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 436 LSQPETEAMKKYISEELEKGFIQ--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 493
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 494 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 550
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 551 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 608
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 609 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 651
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 652 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 700
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 701 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 760
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 761 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 816
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 817 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 847
>gi|326673785|ref|XP_003199991.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1427
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 181/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 490 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 547
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + A +
Sbjct: 548 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTAFVTPTGHYEYRVM 604
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 605 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 661
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 662 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 708
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 709 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSDAAAAFRNL 760
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 761 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 819
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 820 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 868
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 869 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 901
>gi|326664766|ref|XP_003197878.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1515
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 192/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDS--TTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
L+ P + AM +I E LE G ++ S + GF F V K +G P ++ +GLN+
Sbjct: 338 LSQPETEAMKSYISEELEKGFIRPATSPASAGF----FFVKKKDGSLCPCIDYRGLNEIT 393
Query: 526 SPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
++ L +P+ L++ Y +DL AY + I+ + S
Sbjct: 394 VKYRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYL 450
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSI 640
+PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ L I
Sbjct: 451 VMPFGLANSPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLERLI 508
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L + K + FLG + P + M D+Q + + W
Sbjct: 509 QNQL--YAKISKCEFHQT-CISFLGYIISPEVVAM----DQQ--------KVDSVTQWPQ 553
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 554 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 602
Query: 760 PL---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q FI DAS+ G G+ + L +SR+
Sbjct: 603 RAFTQLKTRFSSAPILRHPDPEQPFIVEIDASNTGIGAILSQRSLVTKKLHPCAFYSRKL 662
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 663 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNPR 718
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 719 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 749
>gi|326674339|ref|XP_003200115.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1427
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 181/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 490 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 547
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + A +
Sbjct: 548 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTAFVTPTGHYEYRVM 604
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 605 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 661
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 662 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 708
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 709 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSEAAAAFRNL 760
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 761 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 819
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 820 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 868
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 869 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 901
>gi|156055084|ref|XP_001593466.1| hypothetical protein SS1G_04893 [Sclerotinia sclerotiorum 1980]
gi|154702678|gb|EDO02417.1| hypothetical protein SS1G_04893 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1093
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 104/452 (23%), Positives = 178/452 (39%), Gaps = 39/452 (8%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
+P + L L T A+ I E L G + S F + + V K NG R
Sbjct: 257 TQPNNLTLSPLYRQTTQELQALKKFIDENLNRGWIA--PSNASFAAPILFVKKANGDLRL 314
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ + LN+ + + L I S + K +DL A+ + + + A
Sbjct: 315 CVDYRKLNEISAKDGYPLPRIDEILSQMSKAKIFTKLDLRAAFNAIRMHPDSEELTAFQT 374
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
LPFGL+ P + N + L+ + G YLDD ++ + DP Q
Sbjct: 375 CFGQFKSLVLPFGLSGGPGTYQRFINNL--LMENLGNFCTAYLDDIIIYSTDPSEHTAQV 432
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ ++ L G V+++K S + + ++LG ++ L D + I L
Sbjct: 433 RWVLTKLKEAGLSVDIKKCDFSVSRI-KYLGF----YVSTKGLEVDPE----KIKDILTW 483
Query: 695 SKTWNLDSARSLLGYLSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
+ + R LG+ F F+ GR+ +R + + R+ P L
Sbjct: 484 KRPTTVKGVRGFLGFCGFYRKFIKNYGRI-ARPLDKLTQKGRIF--DWDPDCQKAFETLR 540
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDS--------SFLSGLWSREQQN 802
+ P+ P ++ + TD+SD G SQ+D +F S + + N
Sbjct: 541 QAVTEAPVLHYFHPDRLTK-VETDSSDGVTGGILSQLDPATKEWHPLAFFSKTMNPAECN 599
Query: 803 WHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+ I+ KEM A+ QA LQS + V V SD+Q++ ++R + T + +E
Sbjct: 600 YEIHDKEMLAILQAFQQWRVELQSVENPVQVYSDHQSLEIFMRTKKLTARQARWAEYLSQ 659
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
F ++R G N AD+L+R S
Sbjct: 660 FNFQLEYRT--------GKANGQADALTRRDS 683
>gi|19115809|ref|NP_594897.1| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|63054667|ref|NP_594898.2| retrotransposable element [Schizosaccharomyces pombe 972h-]
gi|28380135|sp|Q9C0R2.1|RTF22_SCHPO RecName: Full=Retrotransposable element Tf2 155 kDa protein type 2
gi|13810242|emb|CAC37430.1| retrotransposable element [Schizosaccharomyces pombe]
gi|159884042|emb|CAC37431.2| retrotransposable element [Schizosaccharomyces pombe]
Length = 1333
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 188/430 (43%), Gaps = 40/430 (9%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
AM+ I + L++G+++ + + VPK G R V++ K LN+++ P + L
Sbjct: 427 AMNDEINQGLKSGIIRESKAINA--CPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPL 484
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
++ + +Q +DL AY + ++ + LA V +P+G++ AP
Sbjct: 485 IEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISIAPAH 544
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F N + L + VV Y+D+ L+ ++ K + L + I+N K
Sbjct: 545 FQYFINTI--LGEVKESHVVCYMDNILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCE 602
Query: 655 LSPAPVLQFLGIMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ V +F+G H+ ++ + P NI + L + N R LG +++
Sbjct: 603 FHQSQV-KFIGY----HISEKGFTP-----CQENIDKVLQWKQPKNRKELRQFLGSVNYL 652
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
IP S+ +LL+ TP + ++ L + P L F +++
Sbjct: 653 RKFIPKT---SQLTHPLNNLLKKDVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKI- 708
Query: 772 HFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ TDASD+ G+ + + S S+ Q N+ ++ KEM A+ ++L
Sbjct: 709 -LLETDASDVAVGAVLSQKHDDDKYYPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWR 767
Query: 822 PLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+S++ + +D++ ++ + + ++ L ++FL QD+ I + PG+
Sbjct: 768 HYLESTIEPFKILTDHRNLIGRITNESEPENKRLAR--WQLFL--QDFNFEI--NYRPGS 821
Query: 880 YNSVADSLSR 889
N +AD+LSR
Sbjct: 822 ANHIADALSR 831
>gi|326673518|ref|XP_003199906.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1239
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 179/469 (38%), Gaps = 81/469 (17%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L+ +++ S G + F V K +G RP ++ +G
Sbjct: 336 PRGRLYSLSRPEREAMDRYIQESLKADLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 393
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 394 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 453
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 454 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSKQVHTQHVRQVLQR 511
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
L V +K V FL GI DP R
Sbjct: 512 LLENQLYVKAEKCVFHTKSV-SFLGHIVSTEGIKADPAKVR------------------- 551
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
A W + +S ++L +L FA+F RR R S + P+ PK+
Sbjct: 552 AVAEWPIPNSRKALQRFLGFANFY--------RRFIRNFSSV------AAPLTALTSPKV 597
Query: 753 EWWLNALP-----------LSSPIFPR---QVQHFISTDASDLGWGSQVD---------- 788
+ N+ +++P+ + Q + DAS++G G+ +
Sbjct: 598 PFIWNSRAQEAFDVIKSRFITAPVLSLPDPERQFIVEVDASEVGVGAVLSQRSLRDGKVH 657
Query: 789 -SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQ 845
+F S S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 658 PCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYIR-- 714
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
K L+ +F R + PG+ N D+LSR +P
Sbjct: 715 -SAKRLNSRQARWALFF----GRFTFSLSYRPGSKNIKPDALSRLFDVP 758
>gi|326681142|ref|XP_003201727.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Danio rerio]
Length = 1317
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 181/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 421 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 478
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + A +
Sbjct: 479 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTAFVTPTGHYEYRVM 535
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 536 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 592
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 593 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 639
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 640 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSEAAAAFRNL 691
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 692 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 750
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 751 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 799
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 800 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 832
>gi|326673507|ref|XP_003199903.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1334
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 112/456 (24%), Positives = 188/456 (41%), Gaps = 78/456 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM+ +I E LE G ++ ST+ + F + K NG RP ++ +GLN+
Sbjct: 471 LSQPETEAMNSYISEELEKGFIR--PSTSPASAGFFFLKKKNGSLRPCIDYRGLNE---- 524
Query: 528 KKFSLINHFRIP------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
++ H+ +P L Y +DL AY V I+ + S
Sbjct: 525 --ITIKYHYPLPLVPAALEQLHSAQYFTKLDLRSAYNLVRIRQGDEWKTGFSTINGHYEY 582
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD--PRILEIQGKLAVS 639
+PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L
Sbjct: 583 LVMPFGLANSPSVFQAFINEIFRDILNQW--VIVYIDDILIYSNSLPEHIQHVRAVLQRL 640
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
I L + K + FLG + P M D+Q + A W
Sbjct: 641 IQNQL--YAKVSKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDAVTHWP 685
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNA 758
++ R L +L FA+F RR R S + AP LT + A +L+W L A
Sbjct: 686 QPETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNLEA 734
Query: 759 LP---------LSSPIF--PRQVQHFI-STDASDLGWGSQV-----------DSSFLSGL 795
+ +PI P + FI DAS+ G G+ + +F S
Sbjct: 735 INAFNQLKARFTDAPILCHPDPTRPFIVEIDASNSGIGAILCQRSPTTNKLHPCAFYSHK 794
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSL 853
+ ++ + + +E+ A+ AL L+ + V +D++ + Y+R K L+
Sbjct: 795 LNSAERKYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCKRLNP 850
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 851 RQARWALFFTRFDFQV----TYIPGSKNIKADALSR 882
>gi|326666973|ref|XP_003198437.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1107
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 343 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 400
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 401 CRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 457
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 458 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 515
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 516 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 558
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 559 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 607
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 608 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 667
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 668 NSAERNYDLGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 723
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 724 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 754
>gi|326669078|ref|XP_003198929.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1371
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 112/475 (23%), Positives = 183/475 (38%), Gaps = 81/475 (17%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I E L+ G+++ S G + F V K +G RP ++ +G
Sbjct: 331 PKGRLYSLSRPEREAMDKYINESLKAGLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 388
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 389 LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKSAFNTPTGHFE 448
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V L V VYLDD L+ + + + +
Sbjct: 449 YRVLPFGLTNAPAVFQALVNDV--LRDMVNQFVFVYLDDILIFSPSLQAHTQHVRQVLQR 506
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASKT 697
L V +K V FLG + ++ G I + A
Sbjct: 507 LLENQLFVKAEKCVFHTQSV-SFLGFL---------------ISAGEISADPAKVRAVAE 550
Query: 698 W-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW- 755
W DS ++L +L FA+F RR R + ++ AP + VL W
Sbjct: 551 WPTPDSRKALQRFLGFANFY--------RRFIR--NFGQIAAPLTALTSSKVL--FRWGD 598
Query: 756 --------LNALPLSSPIF----PRQVQHFISTDASDLGWGSQVD-----------SSFL 792
L + +S+P+ P+Q Q + DAS++G G+ + +F
Sbjct: 599 KAQEAFDKLKSRFISAPVLSIPDPKQ-QFIVEVDASEVGVGAVLSQRSLQDGKVHPCAFF 657
Query: 793 SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKS 850
S + ++N+ I +E+ AV AL L+ + +V +D+ ++
Sbjct: 658 SHRLTPTERNYDIGNRELLAVRLALGEWRHWLEGAEQPFVVWTDH-------------RN 704
Query: 851 LSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSKSLPDWHLSR 900
L ++ +++ W R + + PG+ N DSLSR P+ +S+
Sbjct: 705 LEYINSAKRLNARQARWSLFFSRFNFTLSYRPGSKNVKPDSLSRLFEAPERVVSK 759
>gi|326664035|ref|XP_003197715.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1310
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 179/469 (38%), Gaps = 81/469 (17%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L+ +++ S G + F V K +G RP ++ +G
Sbjct: 336 PRGRLYSLSRPEREAMDRYIQESLKADLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 393
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 394 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 453
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 454 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSKQVHTQHVRQVLQR 511
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
L V +K V FL GI DP R
Sbjct: 512 LLENQLYVKAEKCVFHTKSV-SFLGHIVSTEGIKADPAKVR------------------- 551
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
A W + +S ++L +L FA+F RR R S + P+ PK+
Sbjct: 552 AVAEWPIPNSRKALQRFLGFANFY--------RRFIRNFSSV------AAPLTALTSPKV 597
Query: 753 EWWLNALP-----------LSSPIFPR---QVQHFISTDASDLGWGSQVD---------- 788
+ N+ +++P+ + Q + DAS++G G+ +
Sbjct: 598 PFIWNSRAQEAFDVIKSRFITAPVLSLPDPERQFIVEVDASEVGVGAVLSQRSLRDGKVH 657
Query: 789 -SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQ 845
+F S S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 658 PCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYIR-- 714
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
K L+ +F R + PG+ N D+LSR +P
Sbjct: 715 -SAKRLNSRQARWALFF----GRFTFSLSYRPGSKNIKPDALSRLFDVP 758
>gi|326664909|ref|XP_003197912.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1225
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 115/457 (25%), Positives = 180/457 (39%), Gaps = 49/457 (10%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G P ++ +G
Sbjct: 368 PKGKLYSLSVPEREAMEKYISDSLAAKIIRLSSSPAG--AGFFFVKKKDGSLHPCIDYRG 425
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 426 LNSITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 485
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYLDD L+ + + + +
Sbjct: 486 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQ 542
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q A W
Sbjct: 543 RLLENGLYVKAEKYVFH-AQSVQFLGHIVSVEGMRM-DPEKIQ-----------AVVDWP 589
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
DS ++L +L FA+F R S+ SL P + A KL+
Sbjct: 590 TPDSRKALQRFLGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFV 649
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
+ P+ P + Q + DAS++G G+ + ++ S S ++N+ I
Sbjct: 650 SAPILFAPDPSR-QFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAERNYDIG 708
Query: 807 KKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ V +V +D++ + Y+R K L+ +F
Sbjct: 709 NRELLAVKLALEEWRHWLEGLGVPFIVWTDHKN-LEYIR---SAKRLNSRQARWALFF-- 762
Query: 865 QDWRIHILAQFIPGAYNSVADSLSR------SKSLPD 895
R + + PG+ N D+LSR KS PD
Sbjct: 763 --GRFNFTISYRPGSKNIKPDALSRLFDPSDRKSSPD 797
>gi|326679559|ref|XP_003201328.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1315
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 185/458 (40%), Gaps = 69/458 (15%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P A+ ++ E L G + S G + F V K +G RP ++ +G
Sbjct: 411 PRGRLFALSAPEREALDKYLSESLAAGTIVPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 468
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN+ K+ L LQ +DL AY V IK + A +
Sbjct: 469 LNEITVKNKYPLPLISTAFDILQGARIFTKLDLRNAYHLVRIKAGDEWKSAFNTPFGHFE 528
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD--PRILEIQGKLAV 638
LPFGL AP F +L N V L + V VYLDD L+ + D I ++ L
Sbjct: 529 YRVLPFGLVNAPAVFQALINDV--LCDMLNIFVFVYLDDILIFSPDLPTHIQHVRRVLQR 586
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG-NILRTLLASKT 697
+ L V +K V FLG ++ DK +++ LR ++
Sbjct: 587 LLENRL--FVKSEKCDFHTCSV-PFLG----------YIISDKGVSMDPAKLRGVI---D 630
Query: 698 WNLDSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-- 754
W + +R +L +L F++F RR R S ++ AP LT + A + EW
Sbjct: 631 WPIPESRVALQRFLGFSNFY--------RRFIRNFS--QIAAP-LTALTSAKT-RFEWSD 678
Query: 755 -------WLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLS 793
L + S+PI + Q + DASD+G G+ + +F S
Sbjct: 679 SAQQAFDRLKRMFASAPILITPDPERQFIVEVDASDVGVGAVLSQRSAEDNKVHPCAFFS 738
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSL 851
+ ++N+ + +E+ AV AL L+ + + +V +D++ + Y+R K L
Sbjct: 739 HRLTPAERNYDVGNRELRAVRLALGEWRHWLEGASIPFVVWTDHRN-LEYIR---SVKRL 794
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ +F R + + PG N D+LSR
Sbjct: 795 NARQARWALFFN----RFNFTISYRPGTKNIKPDALSR 828
>gi|12958103|gb|AAK07487.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 1304
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 176/428 (41%), Gaps = 47/428 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 464 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYRSLNYATIPDRYPIPHIHDF 521
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 522 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRIPFGLRNAAQTFQRF 581
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 582 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 635
Query: 658 APVLQFLGIMWDPHLDRMWLP--EDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
L FLG H+D + D+ L L + L R +G +++
Sbjct: 636 VTSLDFLG----HHIDSTGISPLPDRILALESF------PIPTTLTQLRRFIGIINYYRR 685
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPH----LTPINPAVLPKLEWWL-NALPLSSPIFPRQV 770
IP H I + + L LG+ L P+ A + + + +A LS
Sbjct: 686 FIP----HCADILQPLTDL-LGSKEKSVTLPPVAIAAFERAKQAIAHATKLSFLDTHEST 740
Query: 771 QHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 741 KLILTTDASNAAVGAVLHQVVNNASQPLTFFSQKMQAAQTRYSTFGRELLAIYLAIRHFR 800
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
LL+ +Q+D++ + + S + ++ I + D R + PG+ N
Sbjct: 801 HLLEGKSFTIQTDHKPLTYAFNAKPDRYSPREIRHLDYISQFTTDIR------YTPGSDN 854
Query: 882 SVADSLSR 889
VAD+LSR
Sbjct: 855 VVADALSR 862
>gi|326672366|ref|XP_003199652.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1310
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 179/469 (38%), Gaps = 81/469 (17%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +IQE L+ +++ S G + F V K +G RP ++ +G
Sbjct: 336 PRGRLYSLSRPEREAMDRYIQESLKADLIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 393
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 394 LNDITVKNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 453
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L N V + +R V VYLDD L+ + ++ + +
Sbjct: 454 YRVLPFGLTNAPAVFQALVNDVLRDMVNRF--VFVYLDDILIFSPSKQVHTQHVRQVLQR 511
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
L V +K V FL GI DP R
Sbjct: 512 LLENQLYVKAEKCVFHTKSV-SFLGHIVSTEGIKADPAKVR------------------- 551
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
A W + +S ++L +L FA+F RR R S + P+ PK+
Sbjct: 552 AVAEWPIPNSRKALQRFLGFANFY--------RRFIRNFSSV------AAPLTALTSPKV 597
Query: 753 EWWLNALP-----------LSSPIFPR---QVQHFISTDASDLGWGSQVD---------- 788
+ N+ +++P+ + Q + DAS++G G+ +
Sbjct: 598 PFIWNSRAQEAFDVIKSRFITAPVLSLPDPERQFIVEVDASEVGVGAVLSQRSLRDGKVH 657
Query: 789 -SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQ 845
+F S S ++N+ I +E+ AV AL L+ + +V +D++ + Y+R
Sbjct: 658 PCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDHKN-LEYIR-- 714
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
K L+ +F R + PG+ N D+LSR +P
Sbjct: 715 -SAKRLNSRQARWALFF----GRFTFSLSYRPGSKNIKPDALSRLFDVP 758
>gi|315360731|gb|ADU05359.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 389 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D H
Sbjct: 508 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQIDEHF 567
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 568 --MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|427798111|gb|JAA64507.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1010
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/380 (22%), Positives = 156/380 (41%), Gaps = 37/380 (9%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
RI + A P KP LV ++ ++ +QEML+ GV++ +S + + + +
Sbjct: 148 RIDTSTAHPIRQKPYLV--------SSSERKVIADQVQEMLQKGVIE--ESCSPWAAPVI 197
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
LV K +G R ++ + LN + L + L Y S+DL Y+ +P+
Sbjct: 198 LVKKKDGSWRFCVDYRRLNALTKKDVYPLPRIDDVIDCLHSASYFSSVDLRSGYWQIPMH 257
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV-VVYLDDFLL 622
+ A + +PFGL AP F ++ S+LR V + YLDD ++
Sbjct: 258 PADKEKTAFVTPDGLYQFNVMPFGLCNAPAIF---ERFMDSILRGLKWEVCLCYLDDVVI 314
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
+ + L + G G ++N +K L LG + D R P+ ++
Sbjct: 315 FGRSFEEHNARLSLVLDCFGKAGLVLNSKKCRFGERQTL-VLGHLVDKDGVR---PDPRK 370
Query: 683 LTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HL 741
+ + + K RS LG S+ +P + + SLLR +P
Sbjct: 371 IEAVSTFKPPQTQK-----ELRSFLGLCSYFRRFVPR---FADVVHPLTSLLRKDSPFEW 422
Query: 742 TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG---------SQVDSSFL 792
TP A +L++ L + P+ P + TDAS +G G ++ ++
Sbjct: 423 TPECDAAFSELKFLLTSEPILRHFDP-SAHTEVHTDASGIGIGGVLVQRHNNAEHVVAYT 481
Query: 793 SGLWSREQQNWHINKKEMFA 812
S S+ ++N+ + ++E A
Sbjct: 482 SRSLSKAERNYTVTEQECLA 501
>gi|40786831|gb|AAR89924.1| polymerase protein [Ross's goose hepatitis B virus]
Length = 783
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 99/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPK------KFSLINHFRIPSFLQKGDYMISIDLS 554
R+FLV K + T + +QF K K+ N + L G IS+DLS
Sbjct: 386 RIFLVDKNSRNTEEARLVVDFSQFSKGKHAMRFPKYWSPNLSTLRRILPMGMPRISLDLS 445
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRV 613
QA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 446 QAFYHLPLNPASSSRLAISDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVWT 505
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
Y+DDFLL + + R L S L LG +N K++ SP ++FLG + D D
Sbjct: 506 FTYMDDFLLCHPNARHLNAISHAVCSFLQELGVRINFDKTTPSPVTEIKFLGYLID---D 562
Query: 674 RMWLPEDKQLT-LGNILRTLLASKTWNLDSARSLLGYLSF 712
+ ED++ L +++ + K ++ + +G+L+F
Sbjct: 563 KYMKIEDQRWNELRQVIKKIQVGKWYDWKCIQRFIGHLNF 602
>gi|242093834|ref|XP_002437407.1| hypothetical protein SORBIDRAFT_10g026363 [Sorghum bicolor]
gi|241915630|gb|EER88774.1| hypothetical protein SORBIDRAFT_10g026363 [Sorghum bicolor]
Length = 1609
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 114/479 (23%), Positives = 193/479 (40%), Gaps = 59/479 (12%)
Query: 439 PAPLVRIVSGYAIPFSAKPPLVPLCSLQH------LATPVS-----------SAMSLHIQ 481
PA L R++ YA F+ L P H PV+ +
Sbjct: 657 PALLERLLDAYADVFAEPDGLPPARDCDHRIHLKPATEPVAVRPYRYPQLQKDELERQCD 716
Query: 482 EMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
ML+ G ++ ST+ F + + LV K +G R ++ + LN KF + +
Sbjct: 717 AMLQQGTIRA--STSPFSAPVLLVKKQDGSWRFCVDYRALNSATVKDKFPIPVVEELLDE 774
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L+ + +DL Y + + A + +PFGL+ AP F +L N
Sbjct: 775 LRGARFFTKLDLRSGYHQIRVHPDDVAKTAFRTHHGHFEFLVMPFGLSNAPSTFQALMNT 834
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLS-PA 658
V R V+V+ DD L+ + +L+++ L V SL +L++S S A
Sbjct: 835 VLKPFLRRC--VLVFFDDILIYSATWTEHLLQLRAVLDVLRTHSL----HLKRSKCSFAA 888
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSARSLLGYLSFASFVI 717
+ +LG + + + + A ++W SAR L G+L A +
Sbjct: 889 TSVHYLGHVI------------SHAGVSMDVSKVAAVQSWPQPRSARGLRGFLGLAGYYR 936
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFIST 776
+ + SLLR A T L+ L+A P L P F ++ F+
Sbjct: 937 RFIKDYGAIAAPLTSLLRKNAFLWTAEAEDAFSALKQALSAAPVLHLPDF--NLEFFVDC 994
Query: 777 DASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV 831
DAS G+G+ + +F S ++ ++E+ + QA+ P L +V
Sbjct: 995 DASGSGFGAVLHQGEGPLAFFSRPFAVRHLKVAAYERELIGLVQAVRHWRPYLWGRSFIV 1054
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLSEVEKIF-LLSQDWRIHILAQFIPGAYNSVADSLSR 889
++D+ + L ++ LS + + I L+ D+RI +F PG +N VAD+LSR
Sbjct: 1055 RTDHYALKFLLDQR-----LSTIPQNHWISKLMGYDFRI----EFRPGRFNVVADALSR 1104
>gi|326664020|ref|XP_003197713.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1459
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 101/454 (22%), Positives = 182/454 (40%), Gaps = 71/454 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + M+ +I+E L+ G ++ ST+ + F V K +GG RP ++ +GLN
Sbjct: 508 LSQTETETMNAYIEEELKKGFIRH--STSPASAGFFFVEKKDGGLRPCIDYRGLNAITVK 565
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L L+ Y +DL AY + IK H+ A S + +PFG
Sbjct: 566 YRYPLPLVPAALELLRTAKYFTKLDLRSAYNLIRIKKNHEWKTAFSTSSGHYEYLVMPFG 625
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLG 645
LA +P F + N V + +R V+VY+DD L+ ++ + I ++ L I L
Sbjct: 626 LANSPSVFQAFINDVFRDMLNRW--VIVYIDDILIYSESLEEHISHVRAVLQRLIEHRL- 682
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSAR 704
L+K + FLG + + + D+Q + A W + +
Sbjct: 683 -YAKLEKCEFHQTSI-SFLGYI----IGTEGVAMDEQ--------KVQAVLKWPKPRTIK 728
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP-------------K 751
L +L FA+F RR + SL+ LT +++ +
Sbjct: 729 ELQRFLGFANFY--------RRFIQNFSLVASPLTSLTRGKGSIIKWNDTAERAFAKKYR 780
Query: 752 LEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQ 800
L++ P+ P ++ + DAS+ G G+ + +F S + +
Sbjct: 781 LKYRFATAPILHHPNP-ELPFIVEIDASNTGIGAILSQKQGSPSKSHPCAFFSRKLNSAE 839
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
QN+ + +E+ A+ A+ L+ + TV++ K+L + +++
Sbjct: 840 QNYDVGNRELLAMKAAMEEWRHWLEGA-----KHKFTVIT------DHKNLEYIHSAKRL 888
Query: 861 FLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R +IPG+ N AD+LSR
Sbjct: 889 NPRQARWALFFTRFDFTVTYIPGSKNVKADALSR 922
>gi|340377297|ref|XP_003387166.1| PREDICTED: hypothetical protein LOC100634483 [Amphimedon
queenslandica]
Length = 1302
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 109/266 (40%), Gaps = 42/266 (15%)
Query: 415 SRIGAELVGGRLRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPV 472
S G E V G L++++ W P + I +GY +P PP P
Sbjct: 526 SEHGVENVKGHLKQYIVFWRDALTATPYIIDVIENGYRLPLICSPP--------SYGAPN 577
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
S+ + + E V+K +D+ G + ++ K F L
Sbjct: 578 HSSTKDKVDFVTE-AVVKLVDN-------------------------GCAKIVAQKPFYL 611
Query: 533 INH--FRIP-SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+ F+ + ++ Y+ + DL Y HV I + HQ +L + T LPFGL+
Sbjct: 612 FKYEDFKTALDYFEEDAYLFTFDLKSGYHHVDIHSEHQTYLGFQWEHKFYVFTVLPFGLS 671
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG-KLAVSILGSLGWIV 648
TA F L V +++ G+R+V+YLDD ++ + I KL L G ++
Sbjct: 672 TACSIFTKLLWHVVKYIQACGIRLVLYLDDGIVSVKASESQAIAASKLVEDTLVKAGLVI 731
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDR 674
+KS+ P +LG +D LD+
Sbjct: 732 KKEKSNFVPPKHASWLG--FDIDLDQ 755
>gi|301608127|ref|XP_002933648.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1391
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 114/486 (23%), Positives = 192/486 (39%), Gaps = 67/486 (13%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
+ ++SG AIPF PL + P + +I+E L G ++ S G + +
Sbjct: 502 IDLLSGAAIPFGRIYPL---------SEPELVILKNYIEENLRKGFIRPSTSPAG--AGI 550
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
F V K + RP ++ + LN+ ++ L IP Q+ +DL AY
Sbjct: 551 FFVEKKDHSLRPCIDYRDLNKITVKNRYPLP---LIPELFQRLRSAKVFSKLDLRGAYNL 607
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLD 618
+ I+ + A +PFGL AP A+ ++V + R V+VYLD
Sbjct: 608 IRIRKGDEWKTAFRTRYGHFEYLVMPFGLCNAP---ATFQHFVNDIFRDFLDHFVIVYLD 664
Query: 619 DFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLP 678
D L+ + + K + L + L+K + + +FLG++ P D M +
Sbjct: 665 DILVFSPSLEEHRVHVKKVFARLRAHKLFAKLEKCEFERSSI-EFLGLVISP--DGMSMD 721
Query: 679 EDKQLTLGNILRTLLASKTWNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLR-L 736
R + A W R + ++ FA+F + S+ I SL +
Sbjct: 722 S----------RKVSAVLDWPTPGDRKAVQRFVGFANFYRKFIKDFSKIIAPITSLTSSI 771
Query: 737 GAPHLTPINPAVLPKLEWWLNALPL---SSPIFPRQVQHFISTDASDLGWGSQVDS---- 789
H +P L+ P+ S P +P ++ DAS+ G+ +
Sbjct: 772 KKFHWSPEAQQAFIDLKKRFTTAPILRHSDPAYPFTLE----VDASEYAIGAVLSQRTDF 827
Query: 790 -------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVS 840
+F S S+ +QN+ + +E+ A+ A LL+ + ++V SD++ +
Sbjct: 828 NCQLHPVAFFSRKLSQSEQNYDVGDRELLAIKSAFQEWRHLLEGANHPILVFSDHKNL-E 886
Query: 841 YLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSR 900
YLR K L +F R + F PG+ N AD+LSR P+ R
Sbjct: 887 YLR---SAKRLRPRQARWALFFS----RFNFHVTFRPGSKNGKADALSRMFPAPE---DR 936
Query: 901 SATEQI 906
AT I
Sbjct: 937 PATGNI 942
>gi|172044504|sp|P0C691.1|DPOL_DHBV3 RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
Length = 786
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 389 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D H
Sbjct: 508 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQIDEHF 567
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 568 --MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|307183886|gb|EFN70496.1| hypothetical protein EAG_00458 [Camponotus floridanus]
Length = 240
Score = 74.3 bits (181), Expect = 4e-10, Method: Composition-based stats.
Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 10/177 (5%)
Query: 754 WWLNAL-----PLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK 808
WW + P+ P++ + I +DAS GWG +S G W E HIN
Sbjct: 13 WWKKNILFARAPMIEPVYNLE----IFSDASRTGWGVFCESQRSHGYWKAEDLELHINLL 68
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR 868
E+ A L + ++++ DN T ++Y+ R ++ L + +I+ +
Sbjct: 69 ELMAAFFGLKCFASNKRHCNILLRLDNTTAIAYINRMRDSRYEGLSTLAREIWQWCEQRE 128
Query: 869 IHILAQFIPGAYNSVADSLSRS-KSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSA 924
I I A +IP N+ AD SR + ++ L SA ++I +G P IDLFASR +A
Sbjct: 129 IWITASYIPSKENAEADYESRKLQPETEFELDNSAFQKIVKVFGQPEIDLFASRANA 185
>gi|301620378|ref|XP_002939554.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 34-like [Xenopus
(Silurana) tropicalis]
Length = 4555
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 67/128 (52%), Gaps = 1/128 (0%)
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ 832
++TDAS GWG+ + G W+ + IN E+ AV AL L + VQ
Sbjct: 2358 ILTTDASLQGWGAVMGHLTAQGTWAAAETRLPINILEIRAVRLALCHWQNRLTGCDIKVQ 2417
Query: 833 SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
SDN T V+YL QGGT+S L EV +I ++ + + A +IPG N AD LSR +
Sbjct: 2418 SDNATTVAYLNHQGGTRSQQALKEVSRILTWAEAREVRLSAIYIPGLENWQADYLSRQRL 2477
Query: 893 LP-DWHLS 899
P +W L+
Sbjct: 2478 DPGEWALN 2485
>gi|208609060|dbj|BAG72152.1| hypothetical protein [Lotus japonicus]
Length = 1369
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 194/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 421 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 460
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + LN+ P KF + + +
Sbjct: 461 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGA 518
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 519 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 578
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ DD L+ +++ + + ++ + +L + N +K S ++
Sbjct: 579 PYLRK---FVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 635
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 636 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 676
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 677 RRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 734
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 735 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV 794
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 795 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 844
Query: 889 RS 890
R
Sbjct: 845 RK 846
>gi|189242365|ref|XP_001809905.1| PREDICTED: similar to protease, reverse transcriptase and RNase H
[Tribolium castaneum]
Length = 1394
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 116/490 (23%), Positives = 198/490 (40%), Gaps = 84/490 (17%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQ---HLATP-----------VSSAMSLHIQEMLETG 487
L+ ++ Y FS++P L + + H TP + A+ IQEML+ G
Sbjct: 509 LIHLLQEYRCIFSSRPGLTHKYTHEIKLHDKTPFLKRPYPVPFALRPAVDATIQEMLDLG 568
Query: 488 VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSF 541
V+KR + + S + +V K +G R L+ + +N + P L+ F
Sbjct: 569 VIKR--EASPYASPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF----- 621
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
YM +IDL +Y+ +P+ +++ A YNG LPFGL TA +F+ +
Sbjct: 622 -HGIRYMSTIDLRSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDV 680
Query: 602 V-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + +R VV Y+DD L+ ++ + L +NL+KS+ V
Sbjct: 681 VLGTEVRE---FVVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEV 737
Query: 661 ------LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF-- 712
L GI DP + I + KT ++ R+ LG +F
Sbjct: 738 KFLGHILTINGIKADPE------------KVSAIRNFPVPQKTKHV---RAFLGLFNFYR 782
Query: 713 ---ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
A + L+ ++ R+ R G + LE L P + IF
Sbjct: 783 KFCARYSAATQDLN--KLLRKGEKWRWGRNEQEAFDRVKDLFLEAVLLHYPDPNKIF--- 837
Query: 770 VQHFISTDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSL 819
++ TD+S G G+ Q D S F S + N+ +KE+ V L
Sbjct: 838 ---YVQTDSSGYGLGAELYQIQEDGSRGVFAFASRSLKGPELNYTTTEKELLGVIFVLHK 894
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
+Q++ +++++D+Q + R + ++ L+ + + L D+ I + + G
Sbjct: 895 FRIYIQATKIIIRTDHQALKFLSRCRLFSERLTRWT----LILGQYDYEI----ELVKGK 946
Query: 880 YNSVADSLSR 889
N VAD LSR
Sbjct: 947 DNVVADILSR 956
>gi|326672393|ref|XP_003199655.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1146
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 193/454 (42%), Gaps = 76/454 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 221 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 278
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 279 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 335
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 336 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 393
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 394 QL-----YAKSSKCKFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 436
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 437 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 485
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 486 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPGAFYSRKL 545
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 546 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 601
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+F D+++ +IPG+ N AD+LS
Sbjct: 602 QARWALFFTRFDFQV----TYIPGSKNIKADALS 631
>gi|48696606|ref|YP_024968.1| polymerase [Ross's goose hepatitis B virus]
gi|325441|gb|AAA45748.1| polymerase [Ross's goose hepatitis B virus]
Length = 785
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 104/221 (47%), Gaps = 13/221 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG + PK +S N + L G IS+DL
Sbjct: 388 RIFLVDKNSRNTAEARLVVDFSQFSKGKHAMRFPKYWS-PNLSTLRRILPVGMPRISLDL 446
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 447 SQAFYHLPLNPACSSRLAISDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 506
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + S L LG +N K++ SP ++FLG + D
Sbjct: 507 TFTYMDDFLLCHPNARHLNSRSHAVCSFLQELGVRINFDKTTPSPVTEIKFLGYLID--- 563
Query: 673 DRMWLPEDKQLT-LGNILRTLLASKTWNLDSARSLLGYLSF 712
D+ ED++ L +++ + K ++ + +G+L+F
Sbjct: 564 DKFMKIEDQRWNELRQVIKKIQIGKWYDWKCIQRFIGHLNF 604
>gi|169116568|gb|ACA42587.1| polymerase [Duck hepatitis B virus]
Length = 841
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 100/227 (44%), Gaps = 21/227 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 442 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 496
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 497 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 556
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV--LQFLGI 666
+ Y+DDFLL + + R L S L LG +N K++ SP+PV ++FLG
Sbjct: 557 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPSPVNEIRFLGY 616
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 617 QIDENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 661
>gi|326673520|ref|XP_003199907.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1400
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 117/483 (24%), Positives = 204/483 (42%), Gaps = 81/483 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 354 LSQTETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 411
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 412 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 468
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 469 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIKN 526
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 527 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 569
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 570 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 618
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 619 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGVGAILSQRSLVNKKLHPCAFYSRKL 678
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 679 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 734
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR-----SKSLPDWHLSRSATEQIFLK 909
+F D+++ +IPG+ N AD+LSR + +PD + +S ++
Sbjct: 735 QARWALFFTRFDFQV----TYIPGSKNIKADALSRLSDDETSEIPDEPIIKSPLIVAPIQ 790
Query: 910 WGV 912
W +
Sbjct: 791 WDI 793
>gi|326668601|ref|XP_003198832.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1259
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 193/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 421 LSQPETEAMKSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 478
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 479 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 535
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 536 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQH 593
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 594 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 636
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 637 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 685
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 686 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 745
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
+ N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 746 NSAEWNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 801
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 802 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 832
>gi|327278749|ref|XP_003224123.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Anolis carolinensis]
Length = 995
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 112/488 (22%), Positives = 189/488 (38%), Gaps = 100/488 (20%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L ++ P A+ I + +E G ++ S + + +F PK +G R V++ +
Sbjct: 100 LPKPKLYAMSEPEKRALREFIDKNIERGFIE--PSQSPMAAPVFFRPKADG-LRLVVDYR 156
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
GLN + ++ L + + L + +DL +A++ + IK A + +
Sbjct: 157 GLNAISTTNQYPLPLMSEMLAQLGEARIFTKLDLREAFYRIRIKDEDCWKTAFNCHLGQY 216
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD------------P 627
LPFGL P+ F N + +G V++YLDD LL ++
Sbjct: 217 HFKVLPFGLCGGPKVFMQFINETFRDMLYKG--VIIYLDDILLYSKSLSEHIRLTREVLR 274
Query: 628 RILEIQ--GKLA-----VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
R+ E Q KL+ + L LG+ ++ + ++ PA V L W+P R
Sbjct: 275 RLKENQLYAKLSKCEFHKTELDYLGFRISTKGIAMDPAKVQDVLA--WEPPRTR------ 326
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
R L G+L FA+F + ++ LL+
Sbjct: 327 -----------------------RQLQGFLGFANFYRQFIKDFAKLSLPLTELLKTKGVG 363
Query: 741 LTPINPAVLPKLEWW---------LNALPLSSPIF---PRQVQHFISTDASDLGWGS--- 785
T KL W L S PI R + DAS+ +G+
Sbjct: 364 ETRKTKTPGAKLNWTPECQEAFEELKRRFTSQPILVHAQRDKPFVVHCDASEAAYGAILL 423
Query: 786 QVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM--VQSDNQ 836
Q D ++LS ++ ++NW + +KE FAV ALS L+ + V +D+
Sbjct: 424 QADDEGALRPCAYLSRKFTETERNWRVWEKEAFAVKAALSHWRHFLEGTEAQFEVWTDH- 482
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSK 891
K+L L +++ W R + +F PG N +AD+LSR
Sbjct: 483 ------------KNLQALRTPQRLNAKQLRWAQFFNRFNFKLKFFPGTQNRMADALSR-- 528
Query: 892 SLPDWHLS 899
+PD + S
Sbjct: 529 -MPDANYS 535
>gi|326670751|ref|XP_003199285.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1427
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 105/453 (23%), Positives = 180/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 490 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRVLNQGTVK 547
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL AY V I+ + +
Sbjct: 548 FRYPLP---LVPAALEQLRSAKIFTKLDLRSAYNLVRIRRGDEWKTGFVTPTGHYEYRVM 604
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + V+VY+DD L+ +++ + + L
Sbjct: 605 PYGLVNAPSVF---QNFIHEVLREFLHLFVIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 661
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 662 HQLYLKAEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 708
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 709 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSEAAAAFRNL 760
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 761 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 819
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 820 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 868
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 869 PRQARWSLFFSRFHFSITYRPGSKNIQADALSR 901
>gi|12958095|gb|AAK07485.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 1304
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 98/428 (22%), Positives = 175/428 (40%), Gaps = 47/428 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 464 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYRSLNNATIPDRYPIPHIHDF 521
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 522 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRMPFGLRNAAQTFQRF 581
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 582 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 635
Query: 658 APVLQFLGIMWDPHLDRMWLP--EDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
L FLG H+D + D+ L L + L R +G +++
Sbjct: 636 VTSLDFLG----HHIDSTGISPLPDRILALESF------PIPTTLTQLRRFIGIINYYRR 685
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV-LPKLEWWLNALPLSSPI----FPRQV 770
IP H I + + L LG + P+V + E A+ ++ +
Sbjct: 686 FIP----HCADILQPLTDL-LGCKEKSVTLPSVAIAAFERAKQAIAHATKLSFLDTHEST 740
Query: 771 QHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 741 KLILTTDASNAAVGAVLHQVVNNASQPLAFFSQKMQAAQTRYSTFGRELLAIYLAIRHFR 800
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
LL+ +Q+D++ + + S + ++ I + D R + PG+ N
Sbjct: 801 HLLEGRSFTIQTDHKPLTHAFNAKPDRYSPREIRHLDYISQFTTDIR------YTPGSDN 854
Query: 882 SVADSLSR 889
VAD+LSR
Sbjct: 855 VVADALSR 862
>gi|397344|emb|CAA52700.1| polymerase [Duck hepatitis B virus]
Length = 836
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 98/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 439 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 493
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 494 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISPR 553
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 554 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 613
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+FA
Sbjct: 614 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFA 656
>gi|326663872|ref|XP_003197684.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1110
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 111/457 (24%), Positives = 190/457 (41%), Gaps = 80/457 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 205 LSQPETEAMKSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 262
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL Y + I+ + S +
Sbjct: 263 YRYPLP---LVPAALEQLRSAQYFTKLDLRSPYNLIRIRQGDEWKTGFSTIDGHYEYLVM 319
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 320 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 377
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 378 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 420
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 421 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 469
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 470 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVTKKLHSCAFYSRKL 529
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSS----VVMVQSDNQTVVSYLRRQGGTKSLS 852
++N+ + +E+ A+ AL L+ + +V+ N + + +R ++
Sbjct: 530 NSTERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKNLEYIRFCKRLNPRQARW 589
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L F D+++ +IPG+ N AD+LSR
Sbjct: 590 AL------FFTRFDFQV----TYIPGSKNIKADALSR 616
>gi|391335687|ref|XP_003742221.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 540
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 68/261 (26%), Positives = 115/261 (44%), Gaps = 36/261 (13%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLIN 534
++ I+ + + GV++R++ T+ + + + +V K NG R + GLN+ + + L
Sbjct: 269 VTKEIERLGKAGVIERVE-TSLYAAPVVVVRKSNGSIRLCADYSIGLNEIIEDDNYPLPT 327
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I S L G + IDLS+AY VP++ Q L ++ + M LPFG+ TAP
Sbjct: 328 AEDIFSGLSNGRFFSKIDLSEAYLQVPVEAGSQSILTINTPKGLFKMKRLPFGIKTAPSI 387
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLL-----VNQDPRILEIQGKLAVSILGSLGWIVN 649
F L + + S L V YLDD ++ V + R++++ + L +
Sbjct: 388 FQRLMDSLVSDLPG----TVAYLDDIMVSSGTKVEHEQRVIQLFKR-----LNDFNLTIR 438
Query: 650 LQKSSLSPAPVLQFLGIMWD-----PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
L K S + +FLG + + P DR I L + N+ R
Sbjct: 439 LDKCSFLKDEI-RFLGFILNSKGRKPDPDR-------------IQPILDMKRPENVAQTR 484
Query: 705 SLLGYLSF-ASFVIPMGRLHS 724
+ LG L+F +F+ M L
Sbjct: 485 AFLGMLTFYNNFIADMATLRE 505
>gi|326670757|ref|XP_003199287.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1404
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 104/453 (22%), Positives = 180/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 438 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 495
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL Y V I+ + A +
Sbjct: 496 FRYPLP---LVPAALEQLRSAKIFTKLDLRSTYNLVRIRRGDEWKTAFVTPTGHYEYRVM 552
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + ++VY+DD L+ +++ + + L
Sbjct: 553 PYGLVNAPSVF---QNFIHEVLREFLHLFIIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 609
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 610 HQLYLKTEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 656
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 657 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSEAAAAFRNL 708
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 709 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 767
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 768 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 816
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 817 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 849
>gi|326670755|ref|XP_003199286.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1375
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 104/453 (22%), Positives = 180/453 (39%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +IQE L G ++ ST+ S F V K +GG RP ++ + LNQ
Sbjct: 438 LSLPETKAMEEYIQEALHQGYIR--PSTSPAASSFFFVTKKDGGLRPCIDYRILNQGTVK 495
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ +DL Y V I+ + A +
Sbjct: 496 FRYPLP---LVPAALEQLRSAKIFTKLDLRSTYNLVRIRRGDEWKTAFVTPTGHYEYRVM 552
Query: 585 PFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
P+GL AP F N++ +LR + ++VY+DD L+ +++ + + L
Sbjct: 553 PYGLVNAPSVF---QNFIHEVLREFLHLFIIVYIDDILIYSRNEVEHRHHVEKVLQTLRK 609
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDS 702
+ +K S P +QFLG + D RM ++ ++T A +W +
Sbjct: 610 HQLYLKTEKCSFH-LPSVQFLGYVIDKRGVRM---DEGKVT---------AVVSWPEPTT 656
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP----------AVLPKL 752
+ L +L FA+F RR SL+ +L P A L
Sbjct: 657 VKELQRFLGFANFY--------RRFIHNYSLVTAPLTNLLKNKPKKLSWPSEAAAAFRNL 708
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQ 801
+ PL + P + + DAS G G+ + ++ S S ++
Sbjct: 709 KEAFTRAPLLTHPDP-DLPFIVEVDASTTGVGAILSQFHGTPKLLHPCAYFSRKLSPAEK 767
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
N+ I +E+ A+ AL L+ + + Q + + K+L L + +++
Sbjct: 768 NYDIGNRELLAIKLALEEWRHWLEGA----KHPFQVITDH-------KNLQYLKDAKRLC 816
Query: 862 LLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
W R H + PG+ N AD+LSR
Sbjct: 817 PRQARWSLFFSRFHFSITYRPGSKNIRADALSR 849
>gi|208609051|dbj|BAG72148.1| hypothetical protein [Lotus japonicus]
gi|208609062|dbj|BAG72153.1| hypothetical protein [Lotus japonicus]
Length = 1558
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 194/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 610 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 649
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + LN+ P KF + + +
Sbjct: 650 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGA 707
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 708 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 767
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ DD L+ +++ + + ++ + +L + N +K S ++
Sbjct: 768 PYLRK---FVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 824
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 825 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 865
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 866 RRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 923
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 924 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV 983
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 984 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 1033
Query: 889 RS 890
R
Sbjct: 1034 RK 1035
>gi|406693857|gb|EKC97200.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 1790
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 115/485 (23%), Positives = 189/485 (38%), Gaps = 51/485 (10%)
Query: 429 FVDAWIRLGAPAPLVRIVSGYAIPF--SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLET 486
FVD +++ A + +AI F P PL + L + A+ ++ +ML
Sbjct: 803 FVDVFLKSSAESLPAFSKFDHAIDFIPGRSPKFGPLYATSPLK---ARAIKAYLDDMLAK 859
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD 546
G+++ DS T S + VPK NG +R ++ + N ++ + + L
Sbjct: 860 GLIRVSDSPTS--SPVLFVPKKNGESRFCVDYRATNAITVKNRYPIPLIQDLLDRLSSAK 917
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASL 605
IDL AY V I+ + A + +PFGL AP F L N V L
Sbjct: 918 VFTKIDLRGAYHLVRIRAGDEWKTAFRTQFGLYEYLVMPFGLCNAPATFQRLVNHVFHDL 977
Query: 606 LRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
L S VVVYLDD L+ ++D E+ + + L +K S FLG
Sbjct: 978 LES---CVVVYLDDILIFSEDNASHELHVREVLQRLRDNALFAKAEKCEFSTTST-SFLG 1033
Query: 666 IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW--NLDSARSLLGYLSFASFVIPMGRLH 723
++ K +T+ +AS + N + LG +F IP L
Sbjct: 1034 ----------YVISSKGVTMDPSKTNTIASWPYPRNAKDVQRFLGLANFYRHFIP---LF 1080
Query: 724 SRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLG 782
+ +LL+ G LTP + L+ + + + P Q Q + TDASD
Sbjct: 1081 AETCVPLYALLKKGTRFALTPDVKSAWDDLKKKIAGDAVLAHFDP-QSQCVVETDASDYA 1139
Query: 783 WGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ- 832
G+ + +F S S + N+ + KE A+ A L+ + V VQ
Sbjct: 1140 VGAVLSQEWEGFLRPLAFASRKMSPAELNYPTHDKEFLAIVYAFEQWRHYLECASVDVQV 1199
Query: 833 -SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+D++++ + R + + + ++ F H + PG D+LSR
Sbjct: 1200 YTDHRSLEYFARDKMLNRRQARWADFMTDF--------HFTITYRPGRLAIKPDALSRR- 1250
Query: 892 SLPDW 896
PD+
Sbjct: 1251 --PDY 1253
>gi|156048462|ref|XP_001590198.1| hypothetical protein SS1G_08962 [Sclerotinia sclerotiorum 1980]
gi|154693359|gb|EDN93097.1| hypothetical protein SS1G_08962 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1194
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 103/450 (22%), Positives = 178/450 (39%), Gaps = 41/450 (9%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
+P + L L T A+ I E L G + S F + + V K NG R
Sbjct: 511 TQPNNLTLSPLYRQTTQELQALKKFIDENLNRGWIA--PSNASFAAPILFVKKANGDLRL 568
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ + LN+ + + L I S + K +DL A+ + + + A
Sbjct: 569 CVDYRKLNEISAKDGYPLPRIDEILSQMSKAKIFTKLDLRAAFNAIRMHPDSEELTAFQT 628
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
LPFGL+ P + N + L+ + G YLDD ++ + DP Q
Sbjct: 629 CFGQFKSLVLPFGLSGGPGTYQRFINNL--LMENLGNFCTAYLDDIIIYSTDPSEHTAQV 686
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA 694
+ ++ L G V+++K S + + ++LG ++ L D + + +IL
Sbjct: 687 RWVLTKLKEAGLSVDIKKCDFSVSRI-KYLGF----YVSTKGLEVDPE-KIKDIL----- 735
Query: 695 SKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKL 752
TW + + + G+L F F + + R + L + G P L
Sbjct: 736 --TWKRPTTVKGVRGFLGFCGFYRKFIKNYGRIARPLDKLTQKGRTFDWDPDCQKAFETL 793
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDS--------SFLSGLWSREQQ 801
+ P+ P ++ + TD+SD G SQ+D +F S + +
Sbjct: 794 RQAVTEAPVLHYFHPDRLTK-VETDSSDGVTGGILSQLDPATKEWHPLAFFSKTMNPAEC 852
Query: 802 NWHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ I+ KEM A+ QA LQS + V V SD+Q++ ++R + T + +E
Sbjct: 853 NYEIHDKEMLAILQAFQQWRVELQSVENPVQVYSDHQSLEIFMRTKKLTARQARWAEYLS 912
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
F ++R G N AD+L+R
Sbjct: 913 QFNFQLEYRT--------GKANGQADALTR 934
>gi|388856200|emb|CCF50191.1| uncharacterized protein [Ustilago hordei]
Length = 1324
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 159/371 (42%), Gaps = 35/371 (9%)
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK--GDYMISIDLSQAYFHVPIKTTHQ 567
G P +N F + + +L + I SF++ G + DL+ A+ HV
Sbjct: 621 AGQLPAVNDGIATHFTAIRYATLAS---ILSFVRDNPGCRLWKSDLTNAFRHVVTALDDA 677
Query: 568 RFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
R L L+++G T L FG +AP FA +WV L S ++ YLDDF
Sbjct: 678 RLLGLTFDGLFYMETGLTFGGRSAPWLFNLFAEALHWVVQLTTSHPVKH--YLDDFFGAT 735
Query: 625 QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
++ + G + K+S + L+ LGI D + + +++
Sbjct: 736 PSTATADLPLHALALACHAFGLQLAPSKTSWNQT-RLEILGIEVDTIRQTVGITVERRQR 794
Query: 685 LGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPI 744
+ + + LL+ ++ +L + + G L F V+P G+ RR+ + L I
Sbjct: 795 ILDAIDHLLSRRSAHLLDWQRIAGLLQFVMQVVPHGKAFLRRLYDASKAAHRHPLTLRRI 854
Query: 745 NPAVLPKLEWWLNALPLSSP------IFPRQVQHFISTDASDLGWGSQ---VDSSFLSGL 795
+ + +L WW + L L+ P + P V+H I TDAS G+G+ +DS S +
Sbjct: 855 SRPAVAELRWWRSTL-LAWPGHSLLQLSPLVVEH-IWTDASKRGYGAHWGLMDSP--SAV 910
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSS-------VVMVQSDNQTVVSYLR--RQG 846
W +E WH K F H+AL++ L S +V++ DN V LR R
Sbjct: 911 WCKEVSKWHRQKDIRF--HEALAVLDTLRVFSAHWDGPRMVVLHVDNTNVEHGLRSGRSR 968
Query: 847 GTKSLSLLSEV 857
+ +LL E+
Sbjct: 969 DPLTQTLLCEI 979
>gi|325428|gb|AAA62819.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 389 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 443
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 444 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 503
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 504 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|28629158|gb|AAO49486.1|AF505512_1 polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 389 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 443
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 444 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 503
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 504 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|326676574|ref|XP_003200615.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1192
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 112/453 (24%), Positives = 192/453 (42%), Gaps = 72/453 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E L G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 429 LSQPETEAMKKYISEELGKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 486
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 487 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 543
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 544 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 601
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L K +L FLG + P M D+Q + + W +
Sbjct: 602 QL--YAKSSKCEFHQTCIL-FLGYIISPEGVAM----DQQ--------KVDSVTQWPQPE 646
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
+ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 647 TIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTTMVKANNARLKWNPDAVRA 695
Query: 762 ---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE--- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 696 FTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKLNS 755
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 756 AERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPRQA 811
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 812 RWALFFTRFDFQV----TYIPGSKNIKADALSR 840
>gi|208609065|dbj|BAG72154.1| hypothetical protein [Lotus japonicus]
Length = 1558
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 194/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 610 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 649
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + LN+ P KF + + +
Sbjct: 650 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGA 707
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 708 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 767
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ DD L+ +++ + + ++ + +L + N +K S ++
Sbjct: 768 PYLRK---FVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 824
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 825 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 865
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 866 RRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 923
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 924 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV 983
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 984 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 1033
Query: 889 RS 890
R
Sbjct: 1034 RK 1035
>gi|4530348|gb|AAD21995.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
Length = 787
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 102/221 (46%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++L KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKYSRNTTEARLVVDLSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D H
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDHHY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MKIEESRWKELRTVIKKIKPGEWYDWKCIQRFVGHLNFV 607
>gi|31506005|gb|AAP47822.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 389 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 443
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 444 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 503
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 504 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|311036249|gb|ADP55738.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 389 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 443
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 444 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 503
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 504 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|20136727|gb|AAM11780.1|AF493986_1 polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 389 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 443
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 444 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 503
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 504 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|4530343|gb|AAD21990.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
Length = 787
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D H
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDHHY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MKIEESRWKELRTVIKKIKPGEWYDWKCIQRFVGHLNFV 607
>gi|169116552|gb|ACA42573.1| polymerase [Duck hepatitis B virus]
Length = 841
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 21/227 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 442 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 496
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 497 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 556
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV--LQFLGI 666
+ Y+DDFLL + + R L S L LG +N K++ SP+PV ++FLG
Sbjct: 557 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPSPVNEIRFLGY 616
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 617 QIDETF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 661
>gi|406698503|gb|EKD01739.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 1789
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 115/485 (23%), Positives = 191/485 (39%), Gaps = 53/485 (10%)
Query: 429 FVDAWIRLGAPAPLVRIVSGYAIPF--SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLET 486
FVD +++ A + +AI F P PL + L + A+ ++ +ML
Sbjct: 804 FVDVFLKSSAESLPAFSKFDHAIDFIPGRSPKFGPLYATSPLK---ARAIKAYLDDMLAK 860
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD 546
G+++ DS T S + VPK NG +R ++ + N ++ + + L
Sbjct: 861 GLIRVSDSPTS--SPVLFVPKKNGESRFCVDYRATNAITVKNRYPIPLIQDLLDRLSSAK 918
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASL 605
IDL AY V I+ + ++ + +PFGL AP F L N V L
Sbjct: 919 VFTKIDLRGAYHLVRIRAGDE--WKTAFRTHLYEYLVMPFGLCNAPATFQRLVNHVFHDL 976
Query: 606 LRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
L S VVVYLDD L+ ++D E+ K + L +K S P + FLG
Sbjct: 977 LESC---VVVYLDDILIFSEDKASHELHVKEVLQRLRDNALFAKAEKCEFS-TPSMSFLG 1032
Query: 666 IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW--NLDSARSLLGYLSFASFVIPMGRLH 723
++ K +T+ + +AS + N + LG +F IP L
Sbjct: 1033 ----------YVISSKGVTMDPSKTSTIASWPYPRNAKDVQRFLGLANFYRHFIP---LF 1079
Query: 724 SRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLG 782
+ +LL+ G LT L+ + + + P Q Q + TDASD
Sbjct: 1080 AETCVPLYALLKKGTRFALTSEVKHAWDDLKKKIAGDAVLAHFDP-QSQCVVETDASDYA 1138
Query: 783 WGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ- 832
G+ + +F S S + N+ + KE A+ A L+ + V VQ
Sbjct: 1139 VGAVLSQEWEGYLRPLAFASRKMSPAELNYPTHDKEFLAIVYAFEQWRHYLECASVDVQV 1198
Query: 833 -SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+D++++ + R + + + ++ F H + PG D+LSR
Sbjct: 1199 YTDHRSLEYFARDKMLNRRQARWADFMTDF--------HFTITYRPGRLAIKPDALSRR- 1249
Query: 892 SLPDW 896
PD+
Sbjct: 1250 --PDY 1252
>gi|208609055|dbj|BAG72150.1| hypothetical protein [Lotus japonicus]
Length = 1558
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 194/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 610 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 649
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + LN+ P KF + + +
Sbjct: 650 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGA 707
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 708 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 767
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ DD L+ +++ + + ++ + +L + N +K S ++
Sbjct: 768 PYLRK---FVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 824
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 825 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 865
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 866 RRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 923
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 924 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV 983
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 984 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 1033
Query: 889 RS 890
R
Sbjct: 1034 RK 1035
>gi|208609057|dbj|BAG72151.1| hypothetical protein [Lotus japonicus]
Length = 1558
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 194/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 610 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 649
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + LN+ P KF + + +
Sbjct: 650 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRALNKATIPDKFPIPIIDELLDEIGA 707
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 708 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 767
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ DD L+ +++ + + ++ + +L + N +K S ++
Sbjct: 768 PYLRK---FVLVFFDDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 824
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 825 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 865
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 866 RRFVKNYSKLAQPLNQLLKKNSFQWTEGATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 923
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 924 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSKFV 983
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 984 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 1033
Query: 889 RS 890
R
Sbjct: 1034 RK 1035
>gi|4530338|gb|AAD21985.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
Length = 787
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D H
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDHHY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MKIEESRWKELRTVIKKIKPGEWYDWKCIQRFVGHLNFV 607
>gi|326678732|ref|XP_003201155.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1347
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 119/456 (26%), Positives = 182/456 (39%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 527 PKGKLYSLSAPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 584
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 585 LNNITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRAGDEWKSAFNTPRGHFE 644
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F + N +LR + VYLDD L+ + + + +
Sbjct: 645 YCVLPFGLSNAPAVFQAFVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHVRRVLQ 701
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A + FLG + RM PE + A W
Sbjct: 702 RLLENGLYVKAEKCVFH-AQSVPFLGHIVSVEGLRM-DPEK-----------IKAVVNWP 748
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 749 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTALTSSKTP-FRWSSAA 796
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGL 795
L +S+PI P + F + DAS++G G SQ SS + S
Sbjct: 797 EAAFSKLKGCFVSAPILITPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYYSHR 856
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S + N+ I +E+ AV AL L+ S V +V +D++ + Y++ K L+
Sbjct: 857 LSAAESNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIK---SAKRLNS 912
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 913 RQARWALFF----GRFNFTISYRPGSKNIKPDALSR 944
>gi|301615219|ref|XP_002937077.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 917
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 113/452 (25%), Positives = 188/452 (41%), Gaps = 68/452 (15%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P S AM +I+E LE G ++ S G + F V K +GG RP ++ +GLN
Sbjct: 176 LSLPESHAMEEYIKENLERGFIRPSSSPAG--AGFFFVEKKDGGLRPCIDYRGLN----- 228
Query: 528 KKFSLINHFRIPSFLQ-----KGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P + KG + S +DL AY + I+ + A + +
Sbjct: 229 -KITVKNRYPLPLISELFDRVKGATIFSKLDLRGAYNLIRIREGDEWKTAFNTHDGHYEY 287
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKL 636
+PFGL P F L N + L R VVVYLD L+ + + + E+ +L
Sbjct: 288 LVMPFGLCNTPAVFQELVNDIFRNLLGRC--VVVYLDAILIYSNNLSDHRAHVQEVLLRL 345
Query: 637 AVS-ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
+ + + + S P ++ G+ DP + L + L+L I R
Sbjct: 346 RQNQLYAKIEKCIFEVPSVYFPGYIISHKGLEMDPVKVQAILTWVQPLSLRAIQR----- 400
Query: 696 KTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEW 754
+L FA++ + S + +L + GA P + KL
Sbjct: 401 -------------FLGFANYYRQFIKNFSTLMAPITALTKKGADPSMWSEEALTAFKL-- 445
Query: 755 WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQ 800
L +S+P+ P F + DAS+LG G+ + SF S +S +
Sbjct: 446 -LKEAFISAPVLLHPDSALPFLVEVDASELGAGAVLSQCHPITNKVHPCSFFSKKFSPTE 504
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
N+ I +E+ A+ A LL+ + VV V +D++ ++ Y+ K L+
Sbjct: 505 MNYDIGNRELLAIKLAFEEWRHLLEGAKHVVTVFTDHKNLL-YIE---SAKRLNPRQARW 560
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+F ++ I + PG N AD+LSRS
Sbjct: 561 ALFFSRFNFNI----TYRPGEKNVKADALSRS 588
>gi|4530353|gb|AAD22000.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
Length = 787
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D H
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDHHY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MKIEESRWKELRTVIKKIKPGEWYDWKCIQRFVGHLNFV 607
>gi|33088067|gb|AAP82857.1| polymerase [Duck hepatitis B virus]
gi|33088071|gb|AAP82860.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 389 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D +
Sbjct: 508 TFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQIDENF 567
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 568 --MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|49247994|ref|YP_031695.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
gi|4530333|gb|AAD21980.1| DNA-directed DNA polymerase [Snow goose hepatitis B virus]
Length = 787
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D H
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDHHY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MRIEESRWKELRTVIKKIKPGEWYDWKCIQRFVGHLNFV 607
>gi|33088059|gb|AAP82851.1| polymerase [Duck hepatitis B virus]
gi|33088063|gb|AAP82854.1| polymerase [Duck hepatitis B virus]
Length = 786
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 389 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D +
Sbjct: 508 TFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQIDENF 567
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 568 --MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 606
>gi|31415506|gb|AAP44980.1| DNA polymerase [Duck hepatitis B virus]
Length = 836
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 439 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 493
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 494 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 553
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 554 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 613
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 614 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 656
>gi|339254664|ref|XP_003372555.1| putative integrase core domain protein [Trichinella spiralis]
gi|316966995|gb|EFV51499.1| putative integrase core domain protein [Trichinella spiralis]
Length = 1271
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 101/428 (23%), Positives = 178/428 (41%), Gaps = 44/428 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFR 537
H ++L G+++ S + + S L +VPK RP + + LN+ +P ++ L +
Sbjct: 447 HFNDLLRRGIIR--PSNSCWASPLHMVPKQQTAQWRPCGDYRALNRCTTPDRYPLPHLAD 504
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
L +DL+ AY+H+P++ A++ + +PFGL A Q+F
Sbjct: 505 FAHNLHGKHIFSKLDLAHAYYHIPMRPQDIAKTAITTPFGLFEFLKMPFGLRNAAQSFQR 564
Query: 598 LSNWVASLLRSRGMR-VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+ V +RG+ VY+DD LL + + + K + L + G VN K L+
Sbjct: 565 FIDTV-----TRGIEDCFVYVDDILLASASEKEHFVLLKKVLQRLKAHGIQVNKDKCILA 619
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
P L FLG D + R LP+ Q ++ A KT R LG ++F
Sbjct: 620 -VPSLPFLGHTVDANGIRP-LPDKVQ-----AVKAFPAPKTGR--ELRRFLGMVNFYRRF 670
Query: 717 IP-----MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ 771
+P + L + ++ + L L N A + NA L P P + +
Sbjct: 671 LPHIATTLAPLDAIASAAASTKITLTNDQLQAFNAAK----DALANATMLHHP-HPTE-E 724
Query: 772 HFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
+ + DASD G+ + +F S + Q+ + +E+ A + A
Sbjct: 725 YALMVDASDHAIGAVLQQPAENSWRPLAFFSKRLTATQKRYSAFGRELLAAYLAAKHFRH 784
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNS 882
+++ +++ +D++ + R + ++ I L+ D R I G N
Sbjct: 785 VVEGRRLVIYTDHKPLAHAFLRPSNNLNDRETRHLDLITSLADDVR------HIGGDSNV 838
Query: 883 VADSLSRS 890
VAD+LSRS
Sbjct: 839 VADALSRS 846
>gi|198424591|ref|XP_002120847.1| PREDICTED: similar to gag-pol polyprotein [Ciona intestinalis]
Length = 1302
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 114/490 (23%), Positives = 195/490 (39%), Gaps = 89/490 (18%)
Query: 482 EMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
++ E G++ + S + + S L +V K + R V + + LNQ + L +
Sbjct: 436 KLQELGIV--VPSDSPWCSPLHMVKKSDNSYRCVGDYRRLNQMTVSDSYPLPFLRDFANI 493
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L +DL +AY +P+ + ++ A + FGL A Q F S +
Sbjct: 494 LHGRTIFSKLDLERAYHQIPMDKSSVAKTTITTPFGAFAYKRMSFGLCNAAQTF---SRF 550
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++ ++R ++DD L+ + P E KL L G ++N +KS L A L
Sbjct: 551 ISQVVRGLEEFCFAFVDDLLVASYSPEEHEKHLKLVFQRLSEYGLLINTKKSILG-ASEL 609
Query: 662 QFLGIMWDPHL--DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
QFLG H+ D M D+ + +R KT + R LG ++F IP
Sbjct: 610 QFLG----HHITKDGMTAIADRVVA----IRAFPLPKT--VSELRRFLGMVNFYRRFIPH 659
Query: 720 GRLHSRRIQRQASLLRLGAPHLTPI---NPAVLPKLEWWLNALPLSSPI----------- 765
A LR HL + NP+ L W ++ I
Sbjct: 660 A----------ADTLR----HLNSMLCKNPSNRRPLVWSTESMSAFQKIKTMLSEETLLC 705
Query: 766 FPRQVQHF-ISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQ 815
+P+ F + TD S G+ ++ +F S Q+ + + +E+ A++
Sbjct: 706 YPKLNGRFSLVTDCSRTAMGAVLNQLVEGEWKPLAFFSRALKPSQKKYSVFDQELLAIYD 765
Query: 816 ALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG------TKSLSLLSEVEKIFLLSQDWRI 869
A+ LL+ + +D++ +V + T+ SL++E K
Sbjct: 766 AVRHFKYLLEGRKFNIVTDHKPIVRAFHSKRDRFSPRQTRQFSLIAEYTKSI-------- 817
Query: 870 HILAQFIPGAYNSVADSLSRSK----SLPD-----WHLSRSATEQIFLKWGVPCIDLFAS 920
++I G NSVAD LSR++ SLPD +SR F+ I+L++S
Sbjct: 818 ----EYISGFQNSVADCLSRAEVNSLSLPDEAITLHEISRHQNNPAFVNE----IELYSS 869
Query: 921 --RVSAVVPN 928
V+ V+P+
Sbjct: 870 LNLVNRVIPD 879
>gi|326670230|ref|XP_003199169.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Danio rerio]
Length = 1153
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 113/455 (24%), Positives = 193/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 270 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 327
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 328 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 384
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 385 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLERLIEN 442
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 443 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 485
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 486 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 534
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 535 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 594
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 595 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 650
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +I G+ N AD+LSR
Sbjct: 651 QARWALFFTRFDFQV----TYISGSKNIKADALSR 681
>gi|20152590|emb|CAD29542.1| pol [Kazachstania exigua]
Length = 1181
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 105/437 (24%), Positives = 177/437 (40%), Gaps = 66/437 (15%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ E++ +G + + S + + + + V K +G R ++ +GLN KF L +
Sbjct: 256 LSELIASGNV--IPSESPYAAPVIFVQKKDGTKRLCVDYRGLNDITIKSKFPLPLIEDVL 313
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L +DL Y V I Q A + + + +PFGL AP F L
Sbjct: 314 DQLSGATIFSKLDLISGYHQVAIADEDQYKTAFTTHRGQYSWRVMPFGLTNAPATFQRLM 373
Query: 600 NWVASLLRSRGMRV-VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N+V LR ++ VVYLDD L+ +++ + +++L K
Sbjct: 374 NYV---LRDYINKICVVYLDDILIYSKNEKEHSEHVSTIINVLRKHQLYAKKSKCEFY-V 429
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-------NLDSARSLLGYL- 710
P +QFLG L + DK+ +LA K W + S L GY
Sbjct: 430 PKIQFLG----HELSAKGITPDKE--------KILAIKDWPTPKTYKDAQSFIGLAGYYR 477
Query: 711 ----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
F+ P+ +L +++I+ T L KL+ L+ P+ P F
Sbjct: 478 RFIKDFSYIAKPLHQLAAQKIK------------WTDECKESLDKLKRQLSTAPIIIP-F 524
Query: 767 PRQVQHFISTDASDLGWGSQVD--------------SSFLSGLWSREQQNWHINKKEMFA 812
R Q ++TDAS G+ ++ ++LS L + NW I KE++A
Sbjct: 525 DRTKQIVLTTDASSTAIGAVLELYGKGTLKSELVGVVAYLSHLLRDNELNWPIRDKELYA 584
Query: 813 VHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
V A L + +++++D+ + + Y + +L L + L D+ I
Sbjct: 585 VIFAFKKWRHYLAGTHIIIKTDHHS-LQYFKTSVLDSNLRLARWRD--ILEEFDYEI--- 638
Query: 873 AQFIPGAYNSVADSLSR 889
Q+I G+ N AD+LSR
Sbjct: 639 -QYIKGSTNH-ADALSR 653
>gi|281205563|gb|EFA79753.1| hypothetical protein PPL_07444 [Polysphondylium pallidum PN500]
Length = 1918
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 103/437 (23%), Positives = 179/437 (40%), Gaps = 59/437 (13%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I LE G + S + F + L + K +GG R ++ + LN + L N +
Sbjct: 859 INTYLENGQI--TPSQSAFAAPLLFIKKKDGGWRLCVDYRSLNGITIKDTYPLPNITEVL 916
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + G IDL Q Y + + Q A + V T LPFGL AP F L
Sbjct: 917 NNTRDGVLFSKIDLLQGYHQIRVHENDQSKTAFRTSFGVFQYTVLPFGLTNAPACFQRL- 975
Query: 600 NWVASLLRSR--GMRVVVYLDDFLL-VNQDPRILEIQGKLA-VSILGSLGWIVNLQKSSL 655
+ S+ + +++VYLDD L+ N D IQ L V +L + L K
Sbjct: 976 --MDSIFQRHVIAKKLLVYLDDLLIKTNIDDEDKHIQDVLEIVDLLNQNKLKIKLTKCIF 1033
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLS--- 711
L++LG H+ + +K + + + +LA K W + R L G+L
Sbjct: 1034 GQYS-LEYLG-----HI----IGHNKLIPIND---KILAIKNWKQPITKRELRGFLGLTN 1080
Query: 712 -FASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF-- 766
+ F+ + + + I R+ L + H N L K N + SS +F
Sbjct: 1081 YYRKFIPKLSEIEAPLIDITRKNKLFKWEDIHTETFN---LIK-----NQISDSSFLFIP 1132
Query: 767 PRQVQHFISTDASDLGWG----------SQVDSS----FLSGLWSREQQNWHINKKEMFA 812
++ I DAS+ G G Q D+ + S ++ ++++H+ ++E+ A
Sbjct: 1133 DYKLTFHIDCDASNDGIGHVIYQYKDNIEQEDNKQIVLYGSKKFNTTERDYHVFEQEVMA 1192
Query: 813 VHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
+ AL N +L +++ +D+Q ++ + L+ ++ IF +
Sbjct: 1193 IKHALESNYHMLLGYKIVIHTDHQNILFINNKLNDNTKPKLIRWLQYIFSFNP------T 1246
Query: 873 AQFIPGAYNSVADSLSR 889
+ G+ N +AD LSR
Sbjct: 1247 LIYKKGSDNVIADGLSR 1263
>gi|301612460|ref|XP_002935737.1| PREDICTED: hypothetical protein LOC100487670 [Xenopus (Silurana)
tropicalis]
Length = 1434
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 135/325 (41%), Gaps = 34/325 (10%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L + + CLP G + + + F+S
Sbjct: 586 RGALLAKSDIESAFRLLPIHPDCFHLLGIKFANLYFVGMCLPMGCSISCYYFELFSSFIE 645
Query: 601 WVASLLRSRGMRVVVYLDDFLLVN-----QDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
WV + + ++ ++ YLDDFL V + R+L L + ++ + G + K+
Sbjct: 646 WVVTQV-AQSNSMLHYLDDFLFVGPANSPECARLLH----LFMEVMKNFGVPIAKDKTE- 699
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
P V+ FLGI D LP +K +L +L L +K L +SLLG+L+FAS
Sbjct: 700 GPQEVIVFLGIEIDSQEMVFRLPLEKLESLSQLLDRALMAKKLTLKQIQSLLGHLTFASR 759
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-- 773
++P GR RR+ ++ H + + L W L + Q+
Sbjct: 760 IMPTGRAFCRRLSLSTKGIKY-PNHYIRMTKHIKDDLRIWQKILAEYNGQSCWQISEKSN 818
Query: 774 ----ISTDASDLGWGSQVDSSFLSGLWSREQ--QNW-------HINKKEMFAVHQALSLN 820
+ TDA+ GS+ ++ G W Q W ++ E+F + A +
Sbjct: 819 LELELFTDAA----GSKGMGAYFQGQWCSAQWPSFWRDTDLIRNLTCLELFPIVVASHIW 874
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQ 845
LL + V+ DN +VV + Q
Sbjct: 875 GELLANQRVIFWCDNSSVVQVINNQ 899
>gi|281211438|gb|EFA85602.1| hypothetical protein PPL_01385 [Polysphondylium pallidum PN500]
Length = 1905
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 103/437 (23%), Positives = 179/437 (40%), Gaps = 59/437 (13%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I LE G + S + F + L + K +GG R ++ + LN + L N +
Sbjct: 846 INTYLENGQI--TPSQSAFAAPLLFIKKKDGGWRLCVDYRSLNGITIKDTYPLPNITEVL 903
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + G IDL Q Y + + Q A + V T LPFGL AP F L
Sbjct: 904 NNTRDGVLFSKIDLLQGYHQIRVHENDQSKTAFRTSFGVFQYTVLPFGLTNAPACFQRL- 962
Query: 600 NWVASLLRSR--GMRVVVYLDDFLL-VNQDPRILEIQGKLA-VSILGSLGWIVNLQKSSL 655
+ S+ + +++VYLDD L+ N D IQ L V +L + L K
Sbjct: 963 --MDSIFQRHVIAKKLLVYLDDLLIKTNIDDEDKHIQDVLEIVDLLNQNKLKIKLTKCIF 1020
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLS--- 711
L++LG H+ + +K + + + +LA K W + R L G+L
Sbjct: 1021 GQYS-LEYLG-----HI----IGHNKLIPIND---KILAIKNWKQPITKRELRGFLGLTN 1067
Query: 712 -FASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF-- 766
+ F+ + + + I R+ L + H N L K N + SS +F
Sbjct: 1068 YYRKFIPKLSEIEAPLIDITRKNKLFKWEDIHTETFN---LIK-----NQISDSSFLFIP 1119
Query: 767 PRQVQHFISTDASDLGWG----------SQVDSS----FLSGLWSREQQNWHINKKEMFA 812
++ I DAS+ G G Q D+ + S ++ ++++H+ ++E+ A
Sbjct: 1120 DYKLTFHIDCDASNDGIGHVIYQYKDNIEQEDNKQIVLYGSKKFNTTERDYHVFEQEVMA 1179
Query: 813 VHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
+ AL N +L +++ +D+Q ++ + L+ ++ IF +
Sbjct: 1180 IKHALESNYHMLLGYKIVIHTDHQNILFINNKLNDNTKPKLIRWLQYIFSFNP------T 1233
Query: 873 AQFIPGAYNSVADSLSR 889
+ G+ N +AD LSR
Sbjct: 1234 LIYKKGSDNVIADGLSR 1250
>gi|2982231|gb|AAC06354.1| DNA polymerase [Duck hepatitis B virus]
Length = 836
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 439 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 493
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 494 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 553
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 554 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 613
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 614 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 656
>gi|1706510|sp|P03162.2|DPOL_DHBV1 RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|325433|gb|AAA45742.1| DNA polymerase (putative; gene 6); putative [Duck hepatitis B
virus]
Length = 836
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 439 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 493
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 494 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 553
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 554 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 613
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 614 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 656
>gi|40786826|gb|AAR89920.1| polymerase protein [Ross's goose hepatitis B virus]
Length = 786
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 103/221 (46%), Gaps = 13/221 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG + PK +S N + L G IS+DL
Sbjct: 389 RIFLVDKNSRNTAEARLVVDFSQFSKGKHAMRFPKYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAISDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG + D
Sbjct: 508 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGVRINFDKTTPSPVTEIKFLGYLID--- 564
Query: 673 DRMWLPEDKQLT-LGNILRTLLASKTWNLDSARSLLGYLSF 712
D+ ED++ L +++ + K ++ + +G+L+F
Sbjct: 565 DKFMKIEDQRWNELRQVIKKIQIGKWYDWKCIQRFIGHLNF 605
>gi|31415504|gb|AAP44979.1| DNA polymerase [Duck hepatitis B virus]
Length = 836
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 439 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 493
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 494 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 553
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 554 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 613
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 614 DENF--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 656
>gi|326670241|ref|XP_003199171.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1401
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 104/451 (23%), Positives = 179/451 (39%), Gaps = 54/451 (11%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P + L+ P + AM +I+E L +G ++ ST+ + F + K +G RP ++ +G
Sbjct: 461 PKSKIYPLSHPETQAMETYIEEALSSGYIR--PSTSLAAAGFFFIEKKDGSLRPCIDYRG 518
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L L++ +DL AY + I+ + A
Sbjct: 519 LNNITVKYRYPLPLVPPALEQLREARIYTKLDLRSAYNLIRIREGDEWKTAFLTTRGHYE 578
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
+P+GLA +P F S N + L ++ V+ Y+DD L+ + + LE K ++
Sbjct: 579 YLVMPYGLANSPAVFQSFINEIFRDLLNKC--VIAYIDDILIYSPN---LEQHIKDVRTV 633
Query: 641 LGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
L L L+K + FLG + H M D ++ A
Sbjct: 634 LTRLQENQLYAKLEKCEFHMSKT-SFLGYIISHHGVEM---NDTKVQ---------AVTG 680
Query: 698 WNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL 756
W L + + L +L FA+F R +S SLL+ G P NP + LE L
Sbjct: 681 WPLPKTVKKLQRFLGFANFYRRFIRNYSLISAPLTSLLK-GKPSKLKWNPETVKSLE-KL 738
Query: 757 NALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGLWSREQQN 802
++PI ++ + DASD G G+ + ++ S + ++N
Sbjct: 739 KTSFTTAPILKHPNPELPFVVEVDASDYGIGAVLSQRHGNPGKLHPCAYFSRKLTAAERN 798
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSV----VMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
+ + KE+ ++ AL L+ +V ++ N + RR L+
Sbjct: 799 YDVGNKELLSMKAALEEWRHWLEGAVHPFQIITDHKNLEYIKSARR------LNPRQARW 852
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG N AD+LSR
Sbjct: 853 SLFFT----RFNFTVTYRPGTKNHKADALSR 879
>gi|77557165|gb|ABA99961.1| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 1619
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 112/457 (24%), Positives = 186/457 (40%), Gaps = 63/457 (13%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
PF PL+P A P++ + + EML+ G+++ +T+ F S + LV K +G
Sbjct: 658 PFDHSIPLLPG------AQPINDEIEAQVTEMLQNGIIQH--NTSPFASPVLLVKKKDGS 709
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLA 571
++ + LN K L + L + +DL Y + +K + A
Sbjct: 710 WHFCVDYRHLNAITVKNKCPLPIIDELLDELSGAQWFTKLDLRAVYHQIRMKVEDEHKTA 769
Query: 572 LSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILE 631
+ LPFGL +AP F + N + S L R V+V++DD L+ + R LE
Sbjct: 770 FRTHHGHFEFRVLPFGLTSAPATFQGIMNSILSTLLRRC--VLVFVDDILIYS---RTLE 824
Query: 632 --IQGKLAV-SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
I AV IL G V K S + L +LG P+ +
Sbjct: 825 DHIHHLRAVFQILNKHGLKVKQSKCSFAQQK-LSYLGHSIGPN------------GVATE 871
Query: 689 LRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA--PHLT 742
+ A + W ++ RS LG + + + S+ + +LLR G +
Sbjct: 872 TDKIAAVRDWPTPQSVKELRSFLGLAGYYRKFVKNFGIISKPL---TNLLRKGQLFAWTS 928
Query: 743 PINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWS 797
N A L ++A L+ P F + + TDASD G G+ + +FLS
Sbjct: 929 MTNEAFLTLKHTLVSAPVLALPDF--SIPFVVETDASDKGIGAVLMQRNHPVAFLSKALG 986
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT-----KSLS 852
+KE A+ A+ P LQ + +++D+++ +++L Q T K+L+
Sbjct: 987 PRHLGLSTYEKESLAIMLAIDHWRPYLQHAEFSIRTDHRS-LAFLDEQRLTTPWQHKALT 1045
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L ++ L + G+ NS AD+LSR
Sbjct: 1046 KLLGLQYKIL------------YKKGSENSAADALSR 1070
>gi|391325812|ref|XP_003737421.1| PREDICTED: uncharacterized protein K02A2.6-like, partial
[Metaseiulus occidentalis]
Length = 1209
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 101/424 (23%), Positives = 176/424 (41%), Gaps = 56/424 (13%)
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD 546
GV++R+ ++ ++S + + K NG R ++L+ +N+ + F L + + L KG
Sbjct: 425 GVIERIQASE-WVSPIVVAEKKNGDVRLCVDLREVNKAVVQDAFPLPHIEDLMQRLAKGR 483
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
IDL AY +P+ + + A + T + FGLA+AP AF + L
Sbjct: 484 VFSKIDLRSAYHQIPLHESSRDLTAFVSPWGLFRYTRVCFGLASAPAAFQAFMEETLKDL 543
Query: 607 RSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
V+ YLDD L+V + ++ + + + + L G VN + +FLG
Sbjct: 544 EG----VICYLDDVLVVGETRQVHDERVRGLLRTLSERGLKVN--NKCVFGVEETEFLGH 597
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG----YLS----FASFVIP 718
+ + LP D + N+ + N+ RS LG YL +A V P
Sbjct: 598 VVSSKGVKP-LP-DNVKAIENV------PEPKNVSQLRSFLGMAGFYLKCVPRYAELVEP 649
Query: 719 MGRLHSRRIQ---RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIS 775
+ L + ++ R+ L A A L + ALPL ++
Sbjct: 650 LRELLRKEVKFDWREKQRLAFRAVKGAIAEAAPLRVFD---PALPL-----------VLT 695
Query: 776 TDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS 826
TDASD G G+ + ++ S S Q+ + + KE A A+ L
Sbjct: 696 TDASDYGLGAVLQQRVNGKLEPLAYASCSLSETQRRYSTSDKEALACVWAIEKWHVYLWG 755
Query: 827 SVVMVQSDNQTVVSYLRRQGGT-KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
+++D++ +VS +G +S+ L E++ + D ++ PG N +AD
Sbjct: 756 RRFTLKTDHRALVSLFGTKGADRRSIRLARWAERLGAYAFD------VEYKPGVENVIAD 809
Query: 886 SLSR 889
+LSR
Sbjct: 810 ALSR 813
>gi|232018|sp|P30028.1|DPOL_HPBDC RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|325437|gb|AAA45745.1| polymerase [Duck hepatitis B virus]
Length = 787
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVTEIRFLGYQIDQKF 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + ED+ L +++ + ++ + +G+L+F
Sbjct: 569 --MKIEEDRWKELRTVIKKIKVGAWYDWKCIQRFVGHLNF 606
>gi|301631627|ref|XP_002944899.1| PREDICTED: hypothetical protein LOC100488033, partial [Xenopus
(Silurana) tropicalis]
Length = 633
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 127/308 (41%), Gaps = 18/308 (5%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G M D+ A+ +P+ L + G LP G + + F + S ++
Sbjct: 329 GALMAKTDIEAAFRLLPVHPDSLHLLGCQFGGSFYVDRSLPMGCSISCSYFETFSTFLEW 388
Query: 605 LLRSR-GMRVVV-YLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R + GM ++ YLDDFL + + I + S+ G + K+ P+ +
Sbjct: 389 VIRQQAGMDSIIHYLDDFLCIGPTNSPACAILLQTVQSVTAEFGVPLAPDKTE-GPSTCI 447
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D LP DK L ++ + SK L +SLLG L+FA +I MGR
Sbjct: 448 KFLGIEIDTVRQECRLPMDKIRALKEDIQWAINSKKLTLKQLQSLLGRLTFACRIITMGR 507
Query: 722 LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--------SSPIFPRQVQHF 773
+ SRR+ +S + H + + L W L + + +Q F
Sbjct: 508 VFSRRLAMASSGVN-KPHHFVRLRAELKADLGVWAKFLQTYNGKSYWQKTTDSNKDLQLF 566
Query: 774 ISTDASDLGWGSQVDSSFLSGL----WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
A G+G+ + S+ +G W E ++ E+F + A+ L + V
Sbjct: 567 TDA-AGSCGFGAYFNGSWCAGKWPESWGEEGLTRNLTLLELFPILVAIELWGHSFSNRNV 625
Query: 830 MVQSDNQT 837
+ +DN +
Sbjct: 626 IFNTDNMS 633
>gi|326663810|ref|XP_003197666.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type 1-like
[Danio rerio]
Length = 1416
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 118/461 (25%), Positives = 180/461 (39%), Gaps = 75/461 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G RP ++ +G
Sbjct: 603 PKGKLYSLSAPEREAMGKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRPCIDYRG 660
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 661 LNNITVKNTYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRAGDEWKSAFNTPRGHFE 720
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F + N +LR + VYLDD L+ + + + +
Sbjct: 721 YCVLPFGLSNAPAVFQAFVN---DVLRDMIDQFIYVYLDDILIFSHSLQEHVQHIRRVLQ 777
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A + FLG + RM PE + A W
Sbjct: 778 RLLENGLYVKAEKCVFH-AQSVPFLGHIVSVEGLRM-DPEK-----------IKAVVNWP 824
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L FA+F RR R S +L AP LT + + P W
Sbjct: 825 TPDSRKALQRFLGFANFY--------RRFIRNFS--QLAAP-LTALTSSKTP-FRWSSAA 872
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGWG---SQVDSS--------FLSGL 795
L +S+PI P + F + DAS++G G SQ SS + S
Sbjct: 873 EAAFSKLKGCFVSAPILITPDPSRQFVVEVDASEVGVGAILSQRSSSDGKIHPCAYYSHR 932
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S + N+ I +E+ AV AL L+ S V +V +D+ K+L
Sbjct: 933 LSAAESNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDH-------------KNLEY 979
Query: 854 LSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSR 889
+ +++ W R + + PG+ N D+LSR
Sbjct: 980 IKSAKRLNSRQARWALFFGRFNFTISYRPGSKNIKPDALSR 1020
>gi|308475019|ref|XP_003099729.1| hypothetical protein CRE_23634 [Caenorhabditis remanei]
gi|308266384|gb|EFP10337.1| hypothetical protein CRE_23634 [Caenorhabditis remanei]
Length = 1538
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 104/467 (22%), Positives = 192/467 (41%), Gaps = 88/467 (18%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRL--DSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
++ A P+ A+ I++M++ + +R+ +S + + S + LV K +G R ++ + +N
Sbjct: 1007 VRQKARPIPLAIRGEIRKMIQKMLSQRVIRESKSPWASPVVLVKKKDGSVRMCIDYRKVN 1066
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
+ L N L + DL Y+ +P+K + A + ++
Sbjct: 1067 LLIKYNAHPLPNIETTLLSLAGKKVFTTFDLLAGYWQLPLKEESKEITAFAIGSELFEWN 1126
Query: 583 CLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR--------ILEIQ 633
LPFGLAT+P F A++ V LL G V VY+DD L+ +++ + ILE
Sbjct: 1127 VLPFGLATSPAIFQAAMECVVGDLL---GTCVFVYVDDLLIASENMKEHAIHVQTILERI 1183
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
K + + S WI + + +LG M P + ++ + + +
Sbjct: 1184 EKSGMKLKASKCWIAREE---------VDYLGHMITPEGVKT-----EEAKVDKMKKFAR 1229
Query: 694 ASKTWNLDSARSLLGY-----LSFASFVIPMGRLHSRR------IQRQASLLRL-----G 737
L S L+GY +S++ P+ L S++ +++ + ++L
Sbjct: 1230 PEDVKQLQSFLGLVGYYRNFIMSYSKIAYPLNFLTSKKNAWVWGTEQENAFVQLKSSVCS 1289
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QV 787
AP L +P A+ + P + I TDAS G G+ Q
Sbjct: 1290 APVLRQPDPET---------AISGARP-------YLIYTDASRQGVGAVLAQEANDGEQH 1333
Query: 788 DSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
+F S + + +HI E A+ AL ++ S V+V +D++ ++S +R G
Sbjct: 1334 PIAFASKSLTSAETRYHITDLEALAMMFALRRFRTIIYGSQVIVFTDHKPLISLMR---G 1390
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILA-----QFIPGAYNSVADSLSR 889
++ L W I ++ ++ G N VAD+LSR
Sbjct: 1391 SRLADRLMR----------WSIELIEFNPKIVYVKGKANVVADALSR 1427
>gi|26800779|emb|CAD29586.1| polymerase [Crane hepatitis B virus]
Length = 786
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
RLFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 389 RLFLVDKNSRNTEEARLVVDFSQFSKGKNAMHFPRYWS-PNLSTLRRILPVGMPRISLDL 447
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + + + R +
Sbjct: 448 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGTEISRRFNVW 507
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K + SP ++FLG + D
Sbjct: 508 TFTYMDDFLLCHPNARHLNAISHAVCTFLQELGIRINFDKMTPSPVSEIRFLGYIIDEQF 567
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + + L +++ + + ++ + +G+L+F
Sbjct: 568 --MKIEESRWIELRQVIKKIQIGQWYDWKCIQRFVGHLNFV 606
>gi|291236398|ref|XP_002738126.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 954
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 102/430 (23%), Positives = 171/430 (39%), Gaps = 63/430 (14%)
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIP-- 539
G KR + L L PK +GG R +++L +N +S + SL+ + RI
Sbjct: 521 GPFKRPPLKNFVCNSLGLRPKKSGGFRIIMDLSQPTLDSVNDNISKEDHSLV-YSRIDDA 579
Query: 540 -SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+F+ K G + ID+ A+ P++
Sbjct: 580 VAFIHKHGHGSLLAKIDVKHAFRLCPVRK------------------------------- 608
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
+W L R+R + YLDDFL V + + + + LG + +K
Sbjct: 609 ---EDW--HLHRARNKDFLHYLDDFLTVGPANTNACQHNMDVMLQSCHHLGVPIATEKVE 663
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
P V+ FLG+ D + LP+DK L L + L T + SL+G LSFA
Sbjct: 664 -GPCSVITFLGVELDTVNMVIRLPKDKLADLLFKLPSWLTRHTCSKRELLSLIGCLSFAC 722
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-------LSSPIFP 767
IP GR+ RR+ S+ + + ++WW + LP L +P +
Sbjct: 723 KCIPAGRILLRRMI-DISMTATSLSQVITLTDEFWHDVQWWCDFLPSWNGTASLLNPNWI 781
Query: 768 RQVQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQN---WHINKKEMFAVHQALSLNLPL 823
+ + + TDAS LG+ + + + W N + I KE+ + + + L
Sbjct: 782 QSPEFELFTDASATLGYRAFYKGHWFANTWPTFITNDPLYSIAWKELLPILLSSLIWGHL 841
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
+ DN +VV + ++G + ++ V +F + H++ I G N +
Sbjct: 842 WYGLRIRFHCDNISVVQ-IWKKGSSSCPRIMQLVRLLFFTAASNNFHVMISHISGFNNDI 900
Query: 884 ADSLSRSKSL 893
ADSLSR + L
Sbjct: 901 ADSLSRQQIL 910
>gi|26800782|emb|CAD29588.1| polymerase [Crane hepatitis B virus]
Length = 785
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
RLFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 388 RLFLVDKNSRNTEEARLVVDFSQFSKGKNAMHFPRYWS-PNLSTLRRILPAGMPRISLDL 446
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + + + R +
Sbjct: 447 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGTEISRRFNVW 506
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K + SP ++FLG + D
Sbjct: 507 TFTYMDDFLLCHPNARHLNAISHAVCTFLQELGIRINFDKMTPSPVNEIRFLGYVIDEQF 566
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + + L +++ + + ++ + +G+L+F
Sbjct: 567 --MKIEESRWIELRQVIKKIQIGQWYDWKCIQRFVGHLNFV 605
>gi|322784669|gb|EFZ11524.1| hypothetical protein SINV_02587 [Solenopsis invicta]
Length = 82
Score = 72.4 bits (176), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%)
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLL 606
+M S+DL AY VP+ H++FL + + TCLPFGL+T+P F L V + L
Sbjct: 6 FMGSLDLKDAYHVVPVHKDHRKFLRFKFLDKLYQFTCLPFGLSTSPYVFTKLMKPVMNHL 65
Query: 607 RSRGMRVVVYLDDFLLV 623
R RG+ V+YLDD L +
Sbjct: 66 RLRGIVTVIYLDDILFI 82
>gi|313213696|emb|CBY40594.1| unnamed protein product [Oikopleura dioica]
Length = 512
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 150/358 (41%), Gaps = 36/358 (10%)
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
+ Y +P+ ++ + + L FG+ AP + L++ + LR G+++
Sbjct: 10 ASGYHQMPLAAESKKMACFKWGNYIFENNILAFGIPAAPGMYQLLNSVGINFLRQNGIKI 69
Query: 614 VVYLDDFLLV------NQDPRIL--EIQGK---LAVSILGSLGWIVNLQKSSLSPAPVLQ 662
+YLDD LL+ N ++L EI K L + L +LG VN++KS P ++
Sbjct: 70 TLYLDDRLLIISPKSENHRKKLLTEEILCKEVWLVAATLVALGGFVNIKKSEFKPTQRIE 129
Query: 663 FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRL 722
FLG + D + + + +PE + TL +R + K L + G + AS V +
Sbjct: 130 FLGFILDTNKETVEIPEGRWNTLKKRMRDAESGKMVELKLLERIRG--TQASMVEVFSNM 187
Query: 723 HSRRIQRQASLLRLGAPHLTPINPAVLPK---LEW--WLNALPLS-SPIFPRQVQH---- 772
R + RQ ++L + L VL K EW W S + R+ +
Sbjct: 188 --RMLIRQITIL-IMQTELEKKTETVLTKEVRREWKLWYEFEKTGLSRSWKREDRSDAGL 244
Query: 773 FISTDASDLGWGSQVDSSFLSG--LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
I TDAS ++ LS W + HI KE A+ AL L V
Sbjct: 245 LIYTDASKHAGAIVIEKWKLSEKFAWEEDLAAAHIGIKEAAAIRMALEWYGRNLAKKRVT 304
Query: 831 VQSDNQTVVSYLRRQG---GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
DN +VV QG G+K + ++ +I++L+Q +I + +++ D
Sbjct: 305 FLCDNDSVV-----QGAINGSKDPEMNKQLVRIWMLAQKRKIDLKIEWVSTKLQKADD 357
>gi|82592700|gb|ABB84517.1| polyprotein [Duck hepatitis B virus]
Length = 787
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 97/225 (43%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 390 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSALRRILPLGMPRI 444
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 445 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARR 504
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 505 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 564
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + M + E + L +++ + + ++ + +G+L+F
Sbjct: 565 DDNF--MKIEESRWSELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 607
>gi|189009874|ref|YP_001931967.1| replicase [Eupatorium vein clearing virus]
gi|172041764|gb|ACB69773.1| replicase [Eupatorium vein clearing virus]
Length = 674
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 186/449 (41%), Gaps = 62/449 (13%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG----NGGTRPVLNLKGLNQFLSP 527
+ + I+E+L ++ + S + +S F+V KG G R V+N K LN
Sbjct: 254 IRQEFDIQIKELLAMNLI--VPSKSPHMSPAFMVNKGAEQRRGKMRMVVNYKALNDATIG 311
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ N + + + S D ++ V + Q A + +PFG
Sbjct: 312 DAHNIPNRDSLMALISGKRIFSSFDCKSGFWQVLLDKPSQELTAFTCPQGHYQWLVMPFG 371
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F + L VY+DD L+ +++ + EI ++ +LG I
Sbjct: 372 LKQAPAIF---QRHMQIALNEHSAYSCVYIDDILVFSENEKDHEIHVSKVLNRCINLGII 428
Query: 648 VNLQKSSLSPAPVLQFLGIMWD-------PHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
++ +KS L + FLGI D PH+ L NI + ++
Sbjct: 429 LSKKKSQLFKETI-DFLGISIDKGTHSPKPHI------------LENIHN--FPERFKDV 473
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP 760
+ R LG +++A IP L +R+ Q L + T + +L KL+ L P
Sbjct: 474 NQCRKFLGIITYAMRYIP--ELSRKRMFLQDKLKKNVPWTWTSEDTRLLQKLKLSLKEFP 531
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDSS--------------FLSGLWSREQQNWHIN 806
P Q Q + TDAS WG + + + SG + + + N+H N
Sbjct: 532 KLHIPQPGQ-QLILETDASQKYWGGILKAEVIHSNNEITEEICCYASGTFKQAELNYHSN 590
Query: 807 KKEMFAVHQALSLNLPLLQSSV-VMVQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLL 863
+KE+ AV +++ + P+ + V +V++DN+T+ +L + + GTKS L+
Sbjct: 591 EKEILAVIRSIQ-SFPVYLTPVEFIVRTDNKTMEHFLTSKFELGTKSGRLVR-------- 641
Query: 864 SQDWRIH--ILAQFIPGAYNSVADSLSRS 890
Q W H + I G N +AD LSR
Sbjct: 642 WQMWFKHYNFKVEHIKGTSNFLADYLSRE 670
>gi|449524808|ref|XP_004169413.1| PREDICTED: uncharacterized protein LOC101228880 [Cucumis sativus]
Length = 1099
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 103/431 (23%), Positives = 173/431 (40%), Gaps = 54/431 (12%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
M + EML +G+++ ST+ +LS + LV K +G R ++ + LN P KF +
Sbjct: 648 MEKLVDEMLSSGIIR--PSTSPYLSSVLLVKKKDGSWRLCVDYRALNNVTIPDKFPIPVI 705
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ L + IDL Y + + + A + +PFGL AP F
Sbjct: 706 EELFDELNGANLFSKIDLKAGYHQIRMCSQDIEKTAFRTHEGHYEFLVIPFGLMNAPATF 765
Query: 596 ASLSNWVASLLRSRGMR-VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
SL N S+ R+ + V+V+ DD L+ ++ + KL + +L N +K S
Sbjct: 766 QSLMN---SIFRAYLWKFVLVFFDDILIYSRGWKEHCQHIKLVLEVLRIHRLFANKKKCS 822
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT----LLASKTWNL-DSARSLLGY 709
+ L++LG + GN + + + K W + + R + G+
Sbjct: 823 FATTK-LEYLG----------------HVLSGNEVEVDPEKISSIKQWPIPTNVREVRGF 865
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPR 768
L + + + LL+LG T +L+ + LP L+ P F
Sbjct: 866 LGLTGYYRRFEQHYGSIAAPLTQLLKLGPFKWTQEAQVAFERLQQAMITLPTLALPDFNA 925
Query: 769 QVQHFISTDASDLGWG-----SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
++ + TDAS G G ++ +F S + + I ++E+ AV A+ P
Sbjct: 926 PLE--LETDASGYGVGVVLMQNKRPIAFYSHTLAMRDRARPIYERELMAVVLAVQRWRPY 983
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA-----QFIPG 878
L +V++D +SL L E I Q W +L + PG
Sbjct: 984 LLGRTFIVKTDQ-------------RSLKFLLEQRVIQPQYQKWIAKLLGYSFEVMYKPG 1030
Query: 879 AYNSVADSLSR 889
N VAD+LSR
Sbjct: 1031 LENKVADALSR 1041
>gi|158830699|gb|ABW81763.1| reverse transcriptase [Dahlia mosaic virus-Holland]
Length = 673
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 121/516 (23%), Positives = 209/516 (40%), Gaps = 75/516 (14%)
Query: 395 KVQTLQKPQRCSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFS 454
++Q L K +P++P + + ++ A I+L P +VR+
Sbjct: 206 EIQNLLKKVCSENPIDP------------AKSKAWMKASIKLADPKSVVRV--------- 244
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK----GNG 510
P+V + + I+E+L+ V++ S + +S FLV K G G
Sbjct: 245 --KPMV-------YSPEDRKEFEIQIKELLDLKVIE--PSKSQHMSPAFLVEKEAEKGRG 293
Query: 511 GTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFL 570
R V+N K LN+ +L N + + L+ S D ++ V + Q+
Sbjct: 294 KKRMVVNYKKLNEVTIGDSHNLPNMQELITLLRGKTIFSSFDCKSGFWQVFLDQESQKLT 353
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + LR +VY+DD ++ +
Sbjct: 354 AFTCPQGHFQWRVVPFGLKQAPSIF---QRHMQNALRGLEEFCLVYVDDIIVFSDKEEEH 410
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR-MWLPEDKQLT-LGNI 688
+ + SLG I++ +K++L + FLG+ +DR P++ L L N
Sbjct: 411 YTHVLKVLKRIESLGIILSEKKANLFKEKI-NFLGL----EIDRGTHTPQNHILEHLHNF 465
Query: 689 LRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV 748
L K + LG L++A IP +L +R Q L + +
Sbjct: 466 PDRLEDKK-----QLQRFLGVLTYADSYIP--KLAEKRKPLQVKLKKDQVWIWNQSDTDY 518
Query: 749 LPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGW----------GSQVDSSFLSGLWS 797
+ K++ L P L P ++ I TDASD W G+++ + SG +
Sbjct: 519 VKKIKKGLVNFPKLYLP--KKEDSLIIETDASDHFWGGVLKAQTTEGNELICRYSSGTFK 576
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
+ N+H N+KE+ AV Q ++ L V++DN V+ L+R TK + +
Sbjct: 577 PAELNYHSNEKELLAVKQVITKFSIYLTPVTFTVRTDN---VNLLKRFMNTK---ITGDS 630
Query: 858 EKIFLLS-QDWRIHIL--AQFIPGAYNSVADSLSRS 890
++ L+ Q W H + G N +AD L+R
Sbjct: 631 KQGRLIRWQMWLSHYTFNVNHLKGEKNVLADYLTRE 666
>gi|48696570|ref|YP_024974.1| polymerase protein [Sheldgoose hepatitis B virus]
gi|40786855|gb|AAR89944.1| polymerase protein [Sheldgoose hepatitis B virus]
Length = 796
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 98/226 (43%), Gaps = 21/226 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 399 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 453
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P LA+S V P G+ +P + + S + R
Sbjct: 454 SLDLSQAFYHLPFNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARR 513
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L + L LG +N K++ SP ++FLG
Sbjct: 514 FNVWTFTYMDDFLLCHPNARHLHAISNSVCNFLQELGIRINFDKTTPSPVTEIRFLGYQI 573
Query: 669 DPHLDRMWLPEDKQLT-LGNILRTLLASKTWNLDSARSLLGYLSFA 713
D R+ ED + T + N+++ + + ++ + +G+L+F
Sbjct: 574 DSKFMRI---EDMRWTEIRNVIKKIKVGEWYDWKCIQRFVGHLNFV 616
>gi|119657151|gb|ABL86705.1| putative pol protein [Adineta vaga]
Length = 1302
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 108/461 (23%), Positives = 200/461 (43%), Gaps = 69/461 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GV++ +ST+ + S + LV K +G R ++ + LN + F + I
Sbjct: 399 INKLLKQGVIE--ESTSPWSSPIVLVRKKDGSVRFCIDYRKLNAITTKDAFPIPRIDDIF 456
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + Y +ID YF V + + A S T LP G+ P AF +
Sbjct: 457 DHLSQAGYYTTIDFKSGYFQVGLDARDRPKTAFSTRDQHYQFTVLPQGVTNGPPAFQRIV 516
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ + L +R + YLDD ++ + D ++ + L + L + +N+ K ++
Sbjct: 517 SQI--LGPTRWKYALAYLDDVIIYSPTFDQHLVHLDDIL--NRLHEANFRLNVGKCHIAQ 572
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNI------LRTLLASKTWNLDSARSLLGYLS 711
+ +LG + GNI +R LL +T +A+ ++
Sbjct: 573 TSI-DYLG---------------HHIEHGNIKPNADNIRALL--ETPQPATAKEAFRFVK 614
Query: 712 FASFV---IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-----LSS 763
A + IP + ++ + + A + + P P L L+ L+A+ L++
Sbjct: 615 AAEYYRKFIPKFSMIAQPLYKYAPTTKEQRSNKMPAVPIQL--LDDELHAIHELKQILTN 672
Query: 764 PIFPR----QVQHFISTDASDLGWGS---QVDSS------FLSGLWSREQQNWHINKKEM 810
+ R + I TDAS +G G+ Q S+ +LS ++ Q NW ++E
Sbjct: 673 DLILRIPDENLPFKIQTDASKIGIGAVLMQTHSNGDLPVAYLSKKFTTTQMNWPATEQEC 732
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+A+ A+ L ++++D++ ++ + +Q L S+ E+ L Q ++
Sbjct: 733 YAIIHAIEKWHKYLDGREFIIETDHKPLLPFNLKQ------QLNSKCERWRLKLQQYKFT 786
Query: 871 ILAQFIPGAYNSVADSLSRSKS------LPDWHLSRSATEQ 905
I ++I G +N+VAD LSRS S L D+ +RS T Q
Sbjct: 787 I--RYIKGKHNTVADYLSRSPSDNASDDLDDYVPTRSQTTQ 825
>gi|327291161|ref|XP_003230290.1| PREDICTED: hypothetical protein LOC100557797, partial [Anolis
carolinensis]
Length = 1042
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 79/166 (47%), Gaps = 12/166 (7%)
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
++F+G D R +LP+D+ +L L TLL +S LG+L + V P
Sbjct: 773 IRFIGADIDSTTGRAYLPDDRFRSLRTALLTLLQGPLPRAKDVQSALGHLGSTTVVTPYA 832
Query: 721 RLHSRRIQRQASLLRLGAP-------HLTPINPAVLPKLEWWL--NALPLSSPIFPRQVQ 771
RL R +Q LR+ P HL P+ V L WWL + + + P Q
Sbjct: 833 RLRMRPLQMW--FLRVFDPLTQSQNIHL-PVPAYVSQSLHWWLSRDNVCVGVPFQQPQAT 889
Query: 772 HFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQAL 817
++TDAS GWG+ S + G W++++ HIN E+ AV ++L
Sbjct: 890 VTLTTDASLYGWGAHSGSLMVKGKWTQQEAQHHINLLELMAVQRSL 935
>gi|302319038|gb|ADL14709.1| P protein [Duck hepatitis B virus]
Length = 787
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 96/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 390 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPLGMPRI 444
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 445 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARR 504
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 505 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 564
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 565 DDKF--MKIEESRWSELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 607
>gi|301632044|ref|XP_002945101.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Xenopus (Silurana) tropicalis]
Length = 1429
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 113/497 (22%), Positives = 197/497 (39%), Gaps = 66/497 (13%)
Query: 420 ELVGGRLRRFVDAWIRLGAPA-PLVRIVSGYAIPFSAKP-PLVPLCSLQHLATPVSSAMS 477
+L+ G F+D + GA P RI Y P P +P + L+ P + +
Sbjct: 472 KLIPGSYHEFLDVFDERGADVLPPHRI---YDCPVDLLPGAAIPFGRIYPLSEPELTVLK 528
Query: 478 LHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFR 537
+I+E L+ G ++ S G + +F V K + RP ++ + LN K ++ N +
Sbjct: 529 DYIEENLKKGFIRPSTSPAG--AGIFFVEKKDHSLRPCIDYRDLN------KITIKNRYP 580
Query: 538 IPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
+P L+ +DL AY V I+ + A +PFGL A
Sbjct: 581 LPLIPELFLRLRSARVFTKLDLRGAYNLVRIRQGDEWKTAFRTRYGHFEYLVMPFGLCNA 640
Query: 592 PQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
P A+ ++V + R + V+VYLDD L+ + K S L + L
Sbjct: 641 P---ATFQHFVNDIFRDFLDLFVIVYLDDILIFSSSLEEHRRHVKQVFSRLRAHKLFAKL 697
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL--RTLLASKTW-NLDSARSLL 707
+K + ++FLG + P G ++ R + A W +S +++
Sbjct: 698 EKCEFERS-TIEFLGFIISPE--------------GMLMDSRKVSAVLDWPTPNSRKAVQ 742
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLL-RLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
++ FA+F + S+ I +L L TP L+ + P+
Sbjct: 743 RFVGFANFYRKFIKNFSKIISPITALTSSLKKFCWTPEAQQAFSDLKSRFTSAPILK--H 800
Query: 767 PRQVQHFI-STDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKEMFAVH 814
P + F+ DAS+ G+ + +F S S +QN+ + +E+ +
Sbjct: 801 PDPTRPFVLEVDASEYAIGAVLSQRNDVQSLLHPIAFFSKKLSASEQNYDVGDRELLTIK 860
Query: 815 QALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
A LL+ + ++V SD++ + YLR + L + L + H+
Sbjct: 861 SAFQEWRHLLEGAAHPILVFSDHKN-LEYLR-----SAKRLRPRQARWALFFSRFNFHV- 913
Query: 873 AQFIPGAYNSVADSLSR 889
F PG+ N AD+LSR
Sbjct: 914 -TFRPGSKNGKADALSR 929
>gi|326673841|ref|XP_003200011.1| PREDICTED: hypothetical protein LOC100536704, partial [Danio rerio]
Length = 2339
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 114/455 (25%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 424 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 481
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 482 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTVDGHYEYLVM 538
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 539 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILIYSNSLSEHIQHVRAVLKRLIEN 596
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 597 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 639
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L ++ FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 640 PETIRQLQRFMGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 688
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWG------SQVDS-----SFLSGLW 796
S+PI P Q F + DAS+ G G S V+ +F S
Sbjct: 689 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAIRSQRSLVNKKLHPCAFYSRKL 748
Query: 797 SREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
+ ++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 749 NSAERNYDVGNRELLAMKAALEEWRHWLEDAKHPFIVITDHKN-LEYIR---SCKRLNPR 804
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 805 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 835
>gi|427780885|gb|JAA55894.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1358
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 166/410 (40%), Gaps = 48/410 (11%)
Query: 498 FLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAY 557
F + K +G R ++ + LN P F + N + + K Y+ +D+++ Y
Sbjct: 974 FAHPIVCAAKKDGSIRVCVDYRNLNAITEPDSFPMGNVTELLYTIAKAKYISVLDMTRGY 1033
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYL 617
+ +P+ Q A + + A +P+GL + F + N LL + Y+
Sbjct: 1034 WQIPLSGESQGLAAFATPSGLYAWKVMPYGLRNSAATFQRIVN---ELLANHRQYACAYI 1090
Query: 618 DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI-----MWDPHL 672
DD + ++ + + + + S G VN K + + P +++LG + P
Sbjct: 1091 DDVAVFSETWQDHMCHLRAVLQAIQSAGLTVNPAKCNFA-QPRVKYLGHEVGSGIHAPDS 1149
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS 732
DR+ ++ L KT RS+LG L++ +P +SR +
Sbjct: 1150 DRV-----------RAIQQLRPPKT--KKELRSVLGLLNYYRDYVPE---YSRLVLPLTG 1193
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALP----LSSPIFPRQVQHFISTDASDLGWGS--- 785
L P+ P + AL L++PI ++ +++TDAS+ G+
Sbjct: 1194 LTNKRVPNTLPWTAEAQHAFDAVKEALASVPGLTAPIPGKEF--YLATDASERAVGACLS 1251
Query: 786 -QVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
+ D +FLS + QQ W ++E FA+ AL L + V V++D+ +
Sbjct: 1252 QEADGEERPVAFLSKKLTPAQQKWSTIEREAFAIVWALESLDTWLFGTKVRVRTDHDPLT 1311
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
R + L+ + L Q + I ++ I G N AD+LSR
Sbjct: 1312 FLARSSPSSARLT------RWALALQKYDIEMV--HIKGTLNKAADALSR 1353
>gi|308465921|ref|XP_003095217.1| hypothetical protein CRE_22626 [Caenorhabditis remanei]
gi|308245611|gb|EFO89563.1| hypothetical protein CRE_22626 [Caenorhabditis remanei]
Length = 2243
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 109/470 (23%), Positives = 185/470 (39%), Gaps = 63/470 (13%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRL-DSTTGFLSR 501
V I + +P +P VP+ + + HI +L + +R+ +S T + S
Sbjct: 1232 VHIYTNTEVPIRGRPSRVPV--------KYQAELEKHINSLLRS---RRITESNTPWTSP 1280
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYF 558
+ +V K NG R L+ + LN+ P F L RI + L+K ++ S+D++ Y
Sbjct: 1281 IVIVTKKNGSLRVCLDFRKLNEATIPDNFPLP---RIDAILEKVGGSNFFSSLDMANGYL 1337
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLD 618
+ + + V A T LPFGL +A F V S L V+VY+D
Sbjct: 1338 QLRLDASSSYKCGFITENKVYAYTHLPFGLKSAASYFQRALRTVLSGLEE---EVLVYID 1394
Query: 619 DFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLP 678
D L+ ++ I + + + +K + + FLG + +
Sbjct: 1395 DILVFSKTFEQHVISLRKVLQRFRDFNLKASPKKCEFA-KKAITFLG----HEIGKDSYS 1449
Query: 679 EDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA 738
DK N+ + N++ R +G F IP + + L RL
Sbjct: 1450 PDK----ANVAKITEFPVPSNVNEVRRFVGMAGFFRKFIP------KFSEIAEPLTRLTR 1499
Query: 739 PHLT----PINPAVLPKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGSQVDSS--- 790
L A KL L + P+ FP + F I DAS + G+ + +
Sbjct: 1500 KELKFTWDSAQQAAFEKLRTALASEPILG--FPDYDKPFHIFCDASAVAQGAALMQTRPE 1557
Query: 791 ---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
+ S S + W + EM A+ AL P + S +++ SD++ +
Sbjct: 1558 SEKDFYGIAYASRTLSDPETRWPAIQVEMGAIIFALRQFKPYICMSKIILHSDHKPLTFL 1617
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
L++ +LS + + Q + IHI+ I G N+VAD LSR++
Sbjct: 1618 LQKAKAHDNLS------RWLIELQCYDIHIV--HIDGKKNTVADCLSRAR 1659
>gi|321453393|gb|EFX64634.1| hypothetical protein DAPPUDRAFT_266050 [Daphnia pulex]
Length = 384
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 74/142 (52%), Gaps = 5/142 (3%)
Query: 422 VGGRLRRFVDAWIRLGAPAPLVRIV-SGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
+GGR++ F AW + A ++ +V G + F A P +V H++ + +S+
Sbjct: 240 IGGRIKYFSKAWELISADPWILNVVRHGLKLDFEAPPTMVNFPCNAHMS---ADQLSIGN 296
Query: 481 QEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPS 540
+E+ R S GF+S +F++ K +GG RP++NLK LN L + F + +
Sbjct: 297 EEVASERAAVRA-SKIGFVSSMFIIKKASGGFRPIINLKKLNDLLVYRHFKMEGLPTLKH 355
Query: 541 FLQKGDYMISIDLSQAYFHVPI 562
+ + D+M+ IDL AY VP+
Sbjct: 356 LIGEEDWMVKIDLKDAYLTVPV 377
>gi|26800785|emb|CAD29590.1| polymerase [Crane hepatitis B virus]
Length = 785
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
RLFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 388 RLFLVDKNSRNTEEARLVVDFSQFSKGKNAMHFPRYWS-PNLSTLRRILPVGMPRISLDL 446
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + + + R +
Sbjct: 447 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGTEISRRFNVW 506
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K + SP ++FLG + D
Sbjct: 507 TFTYMDDFLLCHPNARHLNAISHAVCTFLQELGIRINFDKMTPSPVNEIRFLGYVIDEQF 566
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + + L +++ + + ++ + +G+L+F
Sbjct: 567 --MKIEESRWIELRQVIKKIQIGQWYDWKCIQRFVGHLNFV 605
>gi|326677849|ref|XP_003200929.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1198
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 113/455 (24%), Positives = 193/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 207 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 264
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 265 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 321
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 322 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIKN 379
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 380 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 422
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S + AP LT + A +L+W +A+
Sbjct: 423 PETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKWNPDAV 471
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + AS+ G G+ + L +SR+
Sbjct: 472 RAFTQLKTRFSSAPILRHPDPEQPFVVEIYASNTGIGAILSQRSLVNKKLHPCAFYSRKL 531
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 532 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 587
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 588 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 618
>gi|326666718|ref|XP_003198350.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1174
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 108/449 (24%), Positives = 184/449 (40%), Gaps = 66/449 (14%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P S AM+ +IQE L G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 283 LSQPESEAMNSYIQEELGKGFIR--PSTSPAAAGFFFVKKKDGNLRPCIDYRGLNEITVK 340
Query: 528 KKFSLINHFRIP-SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
++ L +P + L++ +DL AY + IK + S T +PF
Sbjct: 341 YRYPLP---LVPAALLRQAKIYTKLDLRSAYNLIRIKQGDEWKTGFSTTRGDYEYTVMPF 397
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GLA +P F + N V + + + + + + I ++ L I L
Sbjct: 398 GLANSPSVFQAFMNDVFRDMLDQWVIIYIDDILIYSNTVEEHIQHVRAVLQRLIHHHL-- 455
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARS 705
+K V FLG ++ + +T+ R + A + W L + +
Sbjct: 456 YAKFEKCEFHLTSV-SFLG----------YIISAEGVTMDE--RKVTAVQEWPLPQTLKQ 502
Query: 706 LLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP----- 760
L +L FA+F RR R S L AP LT + KL W A+
Sbjct: 503 LQRFLGFANFY--------RRFIRNFST--LAAP-LTSMTKRSHAKLIWQPEAIQAFSVL 551
Query: 761 ----LSSPIF--PRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQN 802
S+P+ P V F + DAS+ G G+ + +F S + ++N
Sbjct: 552 KEKFTSAPVLRHPNPVLPFVVEVDASNTGVGAVLSQRQGIPEKMYPCAFFSRKLNSAERN 611
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKI 860
+++ +E+ A+ AL L+ ++ V +D++ + YLR K L+ +
Sbjct: 612 YYVGNRELLAIKLALEEWRHWLEGAIFPFTVLTDHKN-LEYLR---TAKRLNPRQARWAL 667
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSR 889
F R + + PG+ N+ AD+LSR
Sbjct: 668 FFT----RFNFTVTYCPGSKNTKADALSR 692
>gi|212547038|ref|XP_002153672.1| retrovirus polyprotein, putative [Talaromyces marneffei ATCC 18224]
gi|210064432|gb|EEA18528.1| retrovirus polyprotein, putative [Talaromyces marneffei ATCC 18224]
Length = 558
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 101/447 (22%), Positives = 180/447 (40%), Gaps = 48/447 (10%)
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
C L + AM +I E L G + + S + F S + +V K +GG R ++ + LN
Sbjct: 117 CPLYRMTEAELEAMKDYILENLHKGFI--IPSNSPFASPILVVKKADGGLRFCVDYRKLN 174
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
++ L + + K +D+ Q + + +
Sbjct: 175 ALTRKDRYPLPLIDEVFERIHKAKIFTKLDIRQGFHRIRMSADSSDLTTFRCRYGTFKYE 234
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILG 642
+PFGL P F L N + L ++ ++DD L+ + D E K + L
Sbjct: 235 VMPFGLTNGPATFQRLINDI--FLDCLDKFLIAFVDDLLIYSNDELEHETHVKFVLERLR 292
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDS 702
+ G +++K ++LG + P D + + K + T+L+ KT N
Sbjct: 293 AAGLQASIKKCEFH-VTTTKYLGFIITP--DGVKVDTAK-------VETVLSWKTPN--- 339
Query: 703 ARSLLG---YLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNA 758
++LG +L F +F + +SR + L ++ P T L+ L +
Sbjct: 340 --TMLGIQSFLDFCNFYRKFIKEYSRIARPLYRLTKIDVPFKWTEDCQRAFDTLKERLGS 397
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFL-SGLW----------SREQQNWHINK 807
P+ + P++ Q + TDASD G + V S F G W + + N+HI+
Sbjct: 398 APVLTHYDPKR-QTRVETDASD-GVVAGVLSQFCKDGEWHPVGYYSATMAPAEHNYHIHD 455
Query: 808 KEMFAVHQALSLNLP----LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
KE+ A+ +A P L V SD++ + ++ TK+LS +V L
Sbjct: 456 KELLAIIKAFHEWKPELLGLRSEERFEVLSDHRALEYFM----TTKALS-ARQVRWYEFL 510
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + ++ PG N +AD+L+ S
Sbjct: 511 QE---FPFILKYRPGKSNVLADTLTSS 534
>gi|9625572|ref|NP_039822.1| P-protein [Duck hepatitis B virus]
gi|81936185|sp|Q66403.1|DPOL_DHBVQ RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|59062|emb|CAA42769.1| P-protein [Duck hepatitis B virus]
Length = 788
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 100/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNDIRFLGYQIDQKF 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWIELRTVIKKIKIGAWYDWKCIQRFVGHLNF 607
>gi|308458622|ref|XP_003091647.1| hypothetical protein CRE_18271 [Caenorhabditis remanei]
gi|308255420|gb|EFO99372.1| hypothetical protein CRE_18271 [Caenorhabditis remanei]
Length = 727
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 104/467 (22%), Positives = 192/467 (41%), Gaps = 88/467 (18%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRL--DSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
++ A P+ A+ I++M++ + +R+ +S + + S + LV K +G R ++ + +N
Sbjct: 125 VRQKARPIPLAIRGEIRKMIQKMLSQRVIRESKSPWASPVVLVKKKDGSVRMCIDYRKVN 184
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
+ L N L + DL Y+ +P+K + A + ++
Sbjct: 185 LLIKYNAHPLPNIETTLLSLAGKKVFTTFDLLAGYWQLPLKEESKEITAFAIGSELFEWN 244
Query: 583 CLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR--------ILEIQ 633
LPFGLAT+P F A++ V LL G V VY+DD L+ +++ + ILE
Sbjct: 245 VLPFGLATSPAIFQAAMECVVGDLL---GTCVFVYVDDLLIASENMKEHAIHVQTILERI 301
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
K + + S WI + + +LG M P + ++ + + +
Sbjct: 302 EKSGMKLKASKCWIAREE---------VDYLGHMITPEGVKT-----EEAKVDKMKKFAR 347
Query: 694 ASKTWNLDSARSLLGY-----LSFASFVIPMGRLHSRR------IQRQASLLRL-----G 737
L S L+GY +S++ P+ L S++ +++ + ++L
Sbjct: 348 PEDVKQLQSFLGLVGYYRNFIMSYSKIAYPLNFLTSKKNAWVWGTEQENAFVQLKSSVCS 407
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QV 787
AP L +P A+ + P + I TDAS G G+ Q
Sbjct: 408 APVLKQPDPE---------TAISGARP-------YLIYTDASRQGVGAVLAQEANDGEQH 451
Query: 788 DSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
+F S + + +HI E A+ AL ++ S V+V +D++ ++S +R G
Sbjct: 452 PIAFASKSLTSAETRYHITDLEALAMMFALRRFRTIIYGSQVIVFTDHKPLISLMR---G 508
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILA-----QFIPGAYNSVADSLSR 889
++ L W I ++ ++ G N VAD+LSR
Sbjct: 509 SRLADRLMR----------WSIELIEFNPKIVYVKGKANVVADALSR 545
>gi|301610932|ref|XP_002934999.1| PREDICTED: hypothetical protein LOC100497482 [Xenopus (Silurana)
tropicalis]
Length = 660
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 116/273 (42%), Gaps = 21/273 (7%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG + D+ A+ +PI + L + G CLP + + + F S ++
Sbjct: 134 KGALLAKSDIESAFRLLPIHSDCYHLLGCQFEGQFYYDLCLPMDCSISCRYFECFSTFLE 193
Query: 604 SLLRSRGMR--VVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
++R V+ YLD+FL + Q+ + ++ + G ++ +K+ P V
Sbjct: 194 WVVRHETGHNSVINYLDNFLFIGPQNTNVCQLLLSTFQFFMAKFGVPLSKEKTE-GPVTV 252
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L FLGI D LP+DK L + + + +K L S +SLLG L FA ++P+
Sbjct: 253 LSFLGIEIDTVELVFRLPDDKLQKLKSTVAKVTVAKKVTLRSMQSLLGLLVFACRIMPIA 312
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP--------LSSPIFPRQVQH 772
R+ S+R+ ++ H I + L+ W L + + + ++
Sbjct: 313 RVFSQRLSLSTCGIK-QPHHFIRITKQLREDLKVWQTFLEQYNGHTCLMDTEVSNEELGL 371
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQ--QNW 803
F TDA+ GS + L+ W EQ NW
Sbjct: 372 F--TDAA----GSTGFGAILARTWCAEQWPDNW 398
>gi|119657139|gb|ABL86696.1| putative pol protein [Adineta vaga]
gi|119657143|gb|ABL86699.1| putative pol protein [Adineta vaga]
Length = 1302
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 108/461 (23%), Positives = 194/461 (42%), Gaps = 69/461 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GV++ +ST+ + S + LV K +G R ++ + LN + F + I
Sbjct: 399 INKLLKQGVIE--ESTSPWSSPIVLVRKKDGSVRFCIDYRKLNAITTKDAFPIPRIDDIF 456
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + Y +ID YF V + + A S T LP G+ P AF +
Sbjct: 457 DHLSQAGYYTTIDFKSGYFQVGLDARDRPKTAFSTRDQHYQFTVLPQGVTNGPPAFQRIV 516
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ + L +R + YLDD ++ + D ++ + L + L + +N+ K ++
Sbjct: 517 SQI--LGPTRWKYALAYLDDVIIYSPTFDQHLVHLDDIL--NRLHEANFRLNVGKCHIAQ 572
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNI------LRTLLASKTWNLDSARSLLGYLS 711
+ +LG + GNI +R LL +T +A+ ++
Sbjct: 573 TSI-DYLG---------------HHIEHGNIKPNADNIRALL--ETPQPATAKEAFRFVK 614
Query: 712 FASFV---IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------WLNAL 759
A + IP + ++ + + A + + P P L E N L
Sbjct: 615 AAEYYRKFIPKFSMIAQPLYKYAPTTKEQRSNKMPAVPIQLLDDELHAFHELKQILTNDL 674
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------FLSGLWSREQQNWHINKKEM 810
L P + I TDAS +G G+ Q S+ +LS ++ Q NW ++E
Sbjct: 675 ILRIP--DENLPFKIQTDASKIGIGAVLMQTHSNGDLPVAYLSKKFTTTQMNWPATEQEC 732
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+A+ A+ L ++++D++ ++ + +Q L S+ E+ L Q ++
Sbjct: 733 YAIIHAIEKWHKYLDGREFIIETDHKPLLPFNLKQ------QLNSKCERWRLKLQQYKFT 786
Query: 871 ILAQFIPGAYNSVADSLSRSKS------LPDWHLSRSATEQ 905
I ++I G +N+VAD LSRS S L D+ +RS T Q
Sbjct: 787 I--RYIKGKHNTVADYLSRSPSDNASDDLDDYVPTRSQTTQ 825
>gi|308460222|ref|XP_003092417.1| hypothetical protein CRE_03461 [Caenorhabditis remanei]
gi|308253211|gb|EFO97163.1| hypothetical protein CRE_03461 [Caenorhabditis remanei]
Length = 1398
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 106/476 (22%), Positives = 198/476 (41%), Gaps = 53/476 (11%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P K VP SL+ A+S I + TGVLK LD + + + + V K N
Sbjct: 491 AKPVFRKSRPVPYASLE--------ALSNEIDRLEATGVLKSLDHS-DWAAPVVAVTKKN 541
Query: 510 GGTRPVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
G R + GLN + + L I + L G + IDL+ AY + + ++
Sbjct: 542 GSIRLCSDFSTGLNDAIEAHQHPLPTADDIFAKLNGGKFFSQIDLADAYLQIEVDDDSKK 601
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
L ++ + + LPFG+ AP F + + + + L V YLDD ++
Sbjct: 602 LLVINTHKGLFHYNKLPFGVKAAPGIFQQVMDTMLAGLDG----VACYLDDIIVTGCSIE 657
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
+ K + + S G+ + L+K S P +QFLG + + + P+ +++
Sbjct: 658 EHNQRVKKVIERIASFGFRMRLEKCSFL-MPEIQFLGFVINEQGRK---PDPQKIA---D 710
Query: 689 LRTLLASKTWNLDSARSLLGYLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHL-----T 742
++ + A K N RS LG + F +FV + RL + L +L + T
Sbjct: 711 IKAMPAPK--NAIEVRSFLGLIQFYGTFVRDLHRL-------RPPLDKLTNKDVEFKWDT 761
Query: 743 PINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSF----------L 792
A E + L L+ + +V ++ DAS G G+ + F +
Sbjct: 762 ECQHAFDQVKEMLQSDLLLTH--YNPKVPIIVAADASQYGIGATISHRFPDGKEKAIYHV 819
Query: 793 SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLS 852
S ++ Q+N+ +KE F + A++ + +++D++ ++S + G +
Sbjct: 820 SKALNKAQRNYSQIEKEAFGLVTAVTKFHKFVHGRRFTLRTDHKPLLSIFGEKKGV-PIY 878
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFL 908
+ +++ + ++ I ++I D+LSR + D R TE++ +
Sbjct: 879 TANRLQRWATILMNYNFSI--EYINTKDFGQVDALSR--LISDQMQQREETEEVVI 930
>gi|169116561|gb|ACA42581.1| polymerase [Duck hepatitis B virus]
Length = 841
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 98/227 (43%), Gaps = 21/227 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 442 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPLGMPRI 496
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 497 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARR 556
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV--LQFLGI 666
+ Y+DDFLL + + R L S L LG +N K++ SP PV ++FLG
Sbjct: 557 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVPVNEIRFLGY 616
Query: 667 MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 617 QIDQKY--MKIEESRWSELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 661
>gi|301608966|ref|XP_002934054.1| PREDICTED: hypothetical protein LOC100485881 [Xenopus (Silurana)
tropicalis]
Length = 616
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 4/186 (2%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
G + D+ A+ P+ + L + G CLP G + + + F S ++
Sbjct: 97 GALLSKSDIESAFCLFPVHSDCYHLLDCQFEGQFYYDMCLPMGCSISCRYFECFSTFLEW 156
Query: 605 LLRSRGMR--VVVYLDDFLLV-NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
++R V+ YLDDFL V ++ + ++ + G+ ++ +K+ P VL
Sbjct: 157 VVRQETGHNSVIHYLDDFLFVGSRSTNVCQLLLSTFQFFMQKFGFPLSKEKTE-GPTTVL 215
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
FLGI D LPEDK L + + A+K L S +SLLG L F+ ++P+
Sbjct: 216 SFLGIEIDTAALVFRLPEDKLQKLKVTISEIQAAKKVTLRSMQSLLGLLVFSRRIMPIAH 275
Query: 722 LHSRRI 727
+ S R+
Sbjct: 276 VFSLRL 281
>gi|211925530|dbj|BAG81988.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 530
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 178/430 (41%), Gaps = 51/430 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 83 FEHMLELGIIRT--SSSHWSSPLHMVPKKSKGDWRPCGDYRSLNNATIPDRYPIPHIHDF 140
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 141 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRMPFGLRNAAQTFQRF 200
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 201 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 254
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLSFA 713
L FLG H+D + + +LA +++ L R +G +++
Sbjct: 255 VTSLDFLG----HHIDSTG--------ISPLPNRILALESFPIPTTLTQLRRFIGIINYY 302
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV-LPKLEWWLNALPLSSPI----FPR 768
+IP H I + + L LG + P+V + E A+ ++ +
Sbjct: 303 RRLIP----HCADILQPLTDL-LGCKEKSVTLPSVAIAAFERAKQAIAHATKLSFLDTHE 357
Query: 769 QVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 358 STKLILTTDASNAAVGAVLHQVVNNASQPLAFFSQKMQAAQTRYSTFGRELLAIYLAIRH 417
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
LL+ +Q+D++ + + S + ++ I + D R + PG+
Sbjct: 418 FRHLLEGRSFTIQTDHKPLTYAFNAKPDRYSPREIRHLDYISQFTTDIR------YTPGS 471
Query: 880 YNSVADSLSR 889
N VAD+LSR
Sbjct: 472 DNVVADALSR 481
>gi|40786837|gb|AAR89929.1| polymerase protein [Duck hepatitis B virus]
Length = 787
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 96/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 390 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 444
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P LA+S V P G+ +P + + S + R
Sbjct: 445 SLDLSQAFYHLPFNPASSSRLAISDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 504
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 505 FNVWTFTYMDDFLLCHPNARHLNAISHSVCSFLQELGVRINFDKTTPSPVNEIRFLGYQI 564
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D + + +D+ L +++ + + ++ + +G+L+F
Sbjct: 565 DQKY--LKIEDDRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 607
>gi|326681316|ref|XP_003201782.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Danio rerio]
Length = 1161
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 108/446 (24%), Positives = 188/446 (42%), Gaps = 58/446 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 405 LSQPETEAMKNYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 462
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 463 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 519
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 520 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYVDDILVYSNSLSEHIQHVRAVLKRLIKN 577
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 578 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 620
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F R S ++++ L NPA + L
Sbjct: 621 PETIRQLQRFLGFANFYWRFIRNFSSVAAPLTAMVKTSNARLK-WNPAAVRAFT-QLKTR 678
Query: 760 PLSSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE----QQNWHI 805
S+PI P Q F + DAS+ G G+ + L +SR+ ++N+ +
Sbjct: 679 FSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKLNSAERNYDV 738
Query: 806 NKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
+E+ A+ AL L+ + ++ +D++ + Y+R K L+ +F
Sbjct: 739 GNRELLAMKAALEEWRHWLEGAKHPFILITDHKN-LEYIR---SCKRLNPRQARWALFFT 794
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSR 889
D+++ +IPG+ N AD+LSR
Sbjct: 795 RFDFQV----TYIPGSKNIKADALSR 816
>gi|38017495|gb|AAR08050.1| polyprotein [Duck hepatitis B virus]
Length = 788
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTDIRFLGYQIDEKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGTWYDWKCIQRFVGHLNF 607
>gi|403172773|ref|XP_003889291.1| hypothetical protein PGTG_22037 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169991|gb|EHS64020.1| hypothetical protein PGTG_22037 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 507
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 147/329 (44%), Gaps = 40/329 (12%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDS--------------TTGFLSR--LFLVPKGN 509
+H TP + SL +++ +E + K L + T GF L V G+
Sbjct: 17 KHWFTPENHKSSLLVKDKIEESISKELKAKRMLGLFSHQQMKETFGFFRSNPLGAVVNGD 76
Query: 510 GGTRPVLNL---------KGLNQFLSPKKFSLI-NHFRIPS-FLQKGDYMISI---DLSQ 555
G RP+ +L + +N ++ F + F+I S F + D + D +
Sbjct: 77 GQIRPINDLSYPRNDPDIRSVNSYVDKSDFETTWDDFKIVSKFFAENDKKFDLALFDWEK 136
Query: 556 AYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV 614
AY +P + ++L + ++G++L T + FG +F ++ ++++ V
Sbjct: 137 AYRQIPTRQDQWKYLLVHDFDGNLLIDTRITFGGVAGCGSFGRPADAWKLVMKNHFNLVN 196
Query: 615 VY--LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
V+ +DD L V + L + + VS LG + N++K S A +F+G +W+ HL
Sbjct: 197 VFRWVDDNLFVKEVDENLSM--REIVSKSTELGVMTNIKKFSDFTAE-QKFIGFVWNGHL 253
Query: 673 DRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ- 730
+ LPE K + L I + ++ + A L+G L+ S+++P R H + +
Sbjct: 254 KTVKLPEGKIEQRLAQIHPFQVKKAMFDYEEAEVLVGRLNHVSYILPHMRCHLCSLYKWL 313
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
S + A TP++ VL L+ W+N L
Sbjct: 314 KSWIWRKAKRATPVD--VLEDLDVWVNTL 340
>gi|308458391|ref|XP_003091538.1| hypothetical protein CRE_19532 [Caenorhabditis remanei]
gi|308256589|gb|EFP00542.1| hypothetical protein CRE_19532 [Caenorhabditis remanei]
Length = 1398
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 106/480 (22%), Positives = 201/480 (41%), Gaps = 61/480 (12%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P K VP SL+ A+S I + TGVLK LD + + + + V K N
Sbjct: 491 AQPVFRKSRPVPYASLE--------ALSNEIDRLEATGVLKSLDHS-DWAAPVVAVTKKN 541
Query: 510 GGTRPVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
G R + GLN + + L I + L G + IDL+ AY + + ++
Sbjct: 542 GSIRLCSDFSTGLNDAIEAHQHPLPTADDIFAKLNGGKFFSQIDLADAYLQIEVDDDSKK 601
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
L ++ + +L LPFG+ AP F + + + + L V YLDD ++
Sbjct: 602 LLVINTHKGLLHYNRLPFGVKAAPGIFQQVMDTMLAGLDG----VSCYLDDIIVTGCSIE 657
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
+ K + + S G+ + L+K S P +QFLG + + + P+ +++
Sbjct: 658 EHNQRVKKVIERIASFGFRMRLEKCSFL-MPEIQFLGFVINEQGRK---PDPQKIA---D 710
Query: 689 LRTLLASKTWNLDSARSLLGYLSF-ASFVI-------PMGRLHSRRIQRQASLLRLGAPH 740
++ + A K N RS LG + F +FV P+ +L ++ ++ + +
Sbjct: 711 IKAMPAPK--NAIEVRSFLGLIQFYGTFVRDLHRLRPPLDKLTNKDVEFKWN-------- 760
Query: 741 LTPINPAVLPKLEWWLNALPLS--SPIFPRQVQHFISTDASDLGWGSQVDSSF------- 791
T A E + L L+ +P P ++ DAS G G+ + F
Sbjct: 761 -TECQHAFDQVKEMLQSDLLLTHYNPKLPI----IVAADASQYGIGATISHRFPDGKEKA 815
Query: 792 ---LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT 848
+S ++ Q+N+ +KE F + A++ + +++D++ ++S + G
Sbjct: 816 IYHVSKALNKAQRNYSQIEKEAFGLVTAVTKFHKFVHGRRFTLRTDHKPLLSIFGEKKGV 875
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFL 908
+ + +++ + ++ I ++I D+LSR + D R TE++ +
Sbjct: 876 -PIYTANRLQRWATILMNYNFSI--EYINTKNFGQVDALSR--LISDQMQQREETEEVVI 930
>gi|326674098|ref|XP_003200069.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Danio rerio]
Length = 1210
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 108/446 (24%), Positives = 187/446 (41%), Gaps = 58/446 (13%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 399 LSQPETEAMKSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 456
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 457 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 513
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 514 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQH 571
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 572 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 614
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F R S ++++ L NP + L
Sbjct: 615 PETIRQLQRFLGFANFYRRFIRNFSSVAAPLTAMVKANNARLK-CNPDAVRAFT-QLKTR 672
Query: 760 PLSSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE----QQNWHI 805
S+PI P Q F + DAS+ G G+ + L +SR+ ++N+ +
Sbjct: 673 FSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKLNSAERNYDV 732
Query: 806 NKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
+E+ A+ AL L+ + +V +D++ + Y+R K L+ +F
Sbjct: 733 GNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPRQARWALFFT 788
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSR 889
D+++ +IPG+ N AD+LSR
Sbjct: 789 RFDFQV----TYIPGSKNIKADALSR 810
>gi|311036256|gb|ADP55744.1| polymerase [Duck hepatitis B virus]
Length = 787
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPLGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQIDQKY 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 569 --MKIEESRWSELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 607
>gi|40786843|gb|AAR89934.1| polymerase protein [Duck hepatitis B virus]
Length = 787
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 95/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 390 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLTTLRRILPVGMPRI 444
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P LA+S V P G+ +P + + S + R
Sbjct: 445 SLDLSQAFYHLPFNPASSSRLAISDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 504
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 505 FNVWTFTYMDDFLLCHPNARHLNAISHSVCSFLQELGVRINFDKTTPSPVNEIRFLGYQI 564
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 565 DQRY--MKIEESRWKELRTVIKKIKIGEWYDWKCIQRFVGHLNFV 607
>gi|410026811|gb|AFV52546.1| DNA polymerase [Duck hepatitis B virus]
Length = 788
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
SQA++H+P+ LA+S V P G+ +P + + A + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGAEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTDIRFLGYQIDEKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGTWYDWKCIQRFVGHLNF 607
>gi|406700077|gb|EKD03262.1| retrotransposon nucleocapsid protein [Trichosporon asahii var.
asahii CBS 8904]
Length = 1175
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 98/453 (21%), Positives = 183/453 (40%), Gaps = 60/453 (13%)
Query: 458 PLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLN 517
PL P+ + ++ + ++ EM E G + S G S + V K +G R ++
Sbjct: 241 PLYPISEKE------AAELRAYLAEMQEKGFIVPSSSPAG--SPILFVKKKDGSLRLCVD 292
Query: 518 LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
+ LN ++ L + L++ IDL AY + I + A
Sbjct: 293 YRKLNAVTVKNRYPLPLIGDLLDQLRQAKVYSKIDLRGAYHLLRIAEGDEWKTAFRTKYG 352
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA 637
+PFGL AP +F L N + + + V+VYLDD L+ +Q+ E + +
Sbjct: 353 AFEYKVMPFGLTNAPASFQHLMNHIFRDMLD--ISVIVYLDDILIFSQN----ETEHRGH 406
Query: 638 VSILGSLGWIVNLQKSSLSPAPV--------LQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
V + + L++++LS P ++FLG + P+ ++ G +
Sbjct: 407 VREV-----LRRLRENNLSAKPEKCEFHTDRVEFLGF--------IVTPDGIEMDPGKVA 453
Query: 690 RTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAV 748
+ NL +S LG+++F I + +R + L R P TP A
Sbjct: 454 GVVSWPTPTNLKELQSFLGFINFYRRFIHQFSMVARPLH---ELTRKEVPFEWTPARAAA 510
Query: 749 LPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSR 798
+L+ + + P+ P + + TDASD G+ + +FLS +
Sbjct: 511 FDELKTRITSAPILRHFNPDH-ETMVETDASDYALGAVLSQRGPGEEWRPVAFLSRGMTP 569
Query: 799 EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ--SDNQTVVSYLRRQGGTKSLSLLSE 856
+ N+ ++ KE A+ + + L+ V V+ +D++++ +L + + + +E
Sbjct: 570 PELNYPVHDKEFLAIFWSFNEWRHYLEGCNVRVEVLTDHRSLEYFLTTKQLNRRQARWAE 629
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
F H + PGA + D+LSR
Sbjct: 630 FMADF--------HFQISYRPGAKATKPDALSR 654
>gi|341901036|gb|EGT56971.1| hypothetical protein CAEBREN_28621 [Caenorhabditis brenneri]
Length = 2667
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 100/432 (23%), Positives = 185/432 (42%), Gaps = 52/432 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I MLE G+++ +ST+ + S L +VPK NG TR V++ + LN + + + N +
Sbjct: 1754 IGHMLENGLIE--ESTSPYTSPLLMVPKANGDTRIVIDYRRLNLITRSRTYIMPNTLDVT 1811
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+G D++Q + V + H+ A + V +P GL AP F
Sbjct: 1812 EEASRGKIFSVFDIAQGFHTVRMHEAHKERTAFCCHMGVYQYRYMPMGLKGAPDTF---Q 1868
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVN--QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+A + + +++Y+DD ++V+ +D I +++ + I S+G + +KS +
Sbjct: 1869 RAMAEVEKKFSGTMILYVDDLIVVSKTEDQHIRDLEEFFKLMI--SMGLKLKAEKSQIGR 1926
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-----SF 712
++ FLG + + + + + + + RT+ K++ + GY ++
Sbjct: 1927 TRIV-FLGFIIENNTIQPNGEKTEAIRKFPTPRTVTEVKSF-----LGMAGYFRRFIKNY 1980
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF--PRQV 770
A V P+ L + ++ + + A I A+ +S PI PR
Sbjct: 1981 AITVKPLTTLTQKDVEFKWTEEEEKA--FEEIKTAL------------ISPPILTTPRMD 2026
Query: 771 QHF-ISTDASDLGWGS-----QVDSSFLSGLWSR----EQQNWHINKKEMFAVHQALSLN 820
F + TDAS +G + Q + SR +Q + + E A+ L
Sbjct: 2027 GDFEMHTDASKVGLAAVLLQEQEKELKVVAYASRPTTPVEQRYVAIESEALAITWGLQHF 2086
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
P + V V +D+Q + S L R+ S LL I Q + + I+ + PG
Sbjct: 2087 RPYIFGKKVKVVTDHQPLKSLLHRKDKEMSGRLLRHQAII----QMYDVEIV--YRPGKL 2140
Query: 881 NSVADSLSRSKS 892
N +AD+LSR ++
Sbjct: 2141 NPLADALSRQRT 2152
>gi|390367672|ref|XP_003731307.1| PREDICTED: uncharacterized protein K02A2.6-like [Strongylocentrotus
purpuratus]
Length = 1077
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 101/447 (22%), Positives = 190/447 (42%), Gaps = 53/447 (11%)
Query: 464 SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ 523
SL+ L V +S + + G+++R+DS+ ++S L + + NG R ++L+ +N+
Sbjct: 195 SLRRLPLAVRDEVSKELHRLESDGIIERIDSSP-WVSNLVIARRKNGDLRLCVDLRAVNK 253
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
+ P K+ L + + +D+ ++Y VP+ + A + + +
Sbjct: 254 AIIPDKYPLPTMNELSASFHGAKVFSKLDMRRSYLQVPLAEQSRHLTAFNTHIGMFQYRR 313
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAV 638
+ +GL +AP AF + + V + + + LDD ++ +D R+ E+ +LA
Sbjct: 314 MTYGLNSAPSAFQKIVSSVLAGIEG----TLNLLDDVVVFGEDKAQHDQRLAEVMTRLAK 369
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI--LRTLLASK 696
L +N K + + A + FLG H+ L TL N+ +RTL A
Sbjct: 370 HNL-----TLNEAKCTFA-ASDIDFLGY----HVTADGLTP----TLDNVAAIRTLPAPT 415
Query: 697 TWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ---RQASLLRLGAPHLTPINPAVLPKLE 753
N+ S LG +F +P + +Q R+ +L T N L+
Sbjct: 416 --NVKELASFLGTTNFYRKFVPQYAEIAEPLQKLLRKDALWEWHNAQETAFN-----TLK 468
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWGS----QVDSS-----FLSGLWSREQQNWH 804
+ P+ + P + +++TDAS G+ +D S F S S ++ +
Sbjct: 469 GRIAEPPVLAHFTP-SAETYVTTDASAFAIGAVLSQTIDGSVRPVAFASRALSDTERKYS 527
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG-GTKSLSLLSEVEKIFLL 863
++E A A L +++D+Q + + L G G + L + ++ L
Sbjct: 528 TGEREALACIYACEHWHMYLYGRKFTLRTDHQALTTLLSTSGSGHRPLRIYRWSDR--LH 585
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRS 890
D+++ LA G+ N VAD LSR+
Sbjct: 586 QYDFKVEYLA----GSRNRVADMLSRT 608
>gi|341891780|gb|EGT47715.1| hypothetical protein CAEBREN_29963 [Caenorhabditis brenneri]
Length = 1297
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 120/495 (24%), Positives = 200/495 (40%), Gaps = 66/495 (13%)
Query: 426 LRRFVDAWIR----LGAPAPL-VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
L F DA+ R LG+ V I + +P A+P VP+ + + HI
Sbjct: 294 LNEFPDAFSRNSYDLGSSKTEPVHIYTNTEVPVKARPYRVPV--------KYQAELEKHI 345
Query: 481 QEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPS 540
+L +G + +S T +LS + LV K NG R L+ + LN+ P F L RI +
Sbjct: 346 NSLLRSGRIT--ESNTPWLSPIVLVKKKNGSLRVCLDFRKLNEATIPDNFPLP---RIDA 400
Query: 541 FLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
L++ +Y S+D++ Y + + V A T LPFGL +A F
Sbjct: 401 ILERVGGSNYFSSLDMANGYLQLRLDPASSYKCGFITESKVYAYTHLPFGLKSAASYFQR 460
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQD-PRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
V L V+VY+DD L+ ++ P + K+ ++ + +K +
Sbjct: 461 ALRTVLGGLED---EVLVYIDDILIFSKTFPEHISSIRKV-LTRFRDFNLKASPKKCEFA 516
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ FLG ++R DK N+ + + N++ R +G F F
Sbjct: 517 -KDFITFLG----HEINRDNYSPDK----ANVAKIVEFPVPSNINEIRRFVGMAGFFRKF 567
Query: 716 VIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF 773
+ + R+ R+ A A KL L + P+ FP + F
Sbjct: 568 IQNFSEMAEPLTRLTRKEQKFVWNAEQ-----QAAFEKLRDSLASEPILG--FPDYDKPF 620
Query: 774 -ISTDASDLGWG-----SQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLN 820
I DAS + G S+ DS ++ S S + W + EM A+ AL
Sbjct: 621 HIFCDASAVAQGAALMQSRPDSEKDFYAIAYASRTLSDPETRWPAIQVEMGAIIFALRQF 680
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
P + S +++ SD++ + L++ +L+ + + Q + I I+ I G
Sbjct: 681 RPYICLSKIILHSDHKPLTFLLQKSKTHDNLA------RWLIELQCYDITIV--HIDGKK 732
Query: 881 NSVADSLSRSKSLPD 895
N+VAD LSR++ D
Sbjct: 733 NTVADCLSRARENED 747
>gi|9625571|ref|NP_039821.1| hypothetical protein DHBVgp3 [Duck hepatitis B virus]
gi|59061|emb|CAA42768.1| unnamed protein product [Duck hepatitis B virus]
Length = 838
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 441 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 499
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 500 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 559
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 560 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNDIRFLGYQIDQKF 619
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + + L +++ + ++ + +G+L+F
Sbjct: 620 --MKIEESRWIELRTVIKKIKIGAWYDWKCIQRFVGHLNFV 658
>gi|326677826|ref|XP_003200922.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1228
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 191/460 (41%), Gaps = 77/460 (16%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
PL + L+ P + AM +I E LE ST+ + F V K +G RP ++ +G
Sbjct: 413 PLGRIFPLSQPETEAMKNYISEELEP-------STSPASACFFFVKKKDGSLRPCIDYRG 465
Query: 521 LNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
LN+ ++ L +P+ L++ Y +DL AY + I+ + S
Sbjct: 466 LNEITVKYRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIQQGDEWKTGFSTIDG 522
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGK 635
+PFGLA P F + N V + ++ V+VY+DD L+ + I ++
Sbjct: 523 HYEYLVMPFGLANNPSVFQAFVNEVFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAV 580
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
L I L + K + FLG + P M D+Q + +
Sbjct: 581 LKRLIQNQL--YAKISKCEFHQT-CISFLGYIISPEGVAM----DQQ--------KVDSV 625
Query: 696 KTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
W ++ R L +L FA+F RR R S + AP LT + A +L+W
Sbjct: 626 TQWPQPETIRQLQRFLGFANFY--------RRFIRNFS--SVAAP-LTAMVKANNARLKW 674
Query: 755 WLNALPL---------SSPIF--PRQVQHFI-STDASDLGWGSQVDSSFL-------SGL 795
+A+ S+PI P Q FI DAS+ G G+ + L
Sbjct: 675 NPDAIRAFTQLKTRFSSAPILRHPDPKQPFIVEIDASNTGIGAILSQRSLVTKKLHPCAF 734
Query: 796 WSRE----QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTK 849
+SR+ ++N+ + +E+ A+ AL L+ + V +D++ + Y+R K
Sbjct: 735 YSRKLNSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFTVITDHKN-LEYIR---SCK 790
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ +F D+++ +IPG+ N AD+LSR
Sbjct: 791 RLNPRQARWALFFTRFDFQV----TYIPGSKNIKADALSR 826
>gi|341900902|gb|EGT56837.1| hypothetical protein CAEBREN_06252 [Caenorhabditis brenneri]
Length = 1417
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 106/490 (21%), Positives = 198/490 (40%), Gaps = 72/490 (14%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNL 518
+P + + + I+EML+ +++R DS F + + LV K + + R ++
Sbjct: 552 IPQGKIYRIPLEKRKEVEKQIEEMLKQRIIQRSDSP--FCAPIVLVRKADQKSWRFTVDF 609
Query: 519 KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDV 578
+ LN +P + + N I + + S+D Q + +P++ H A + +
Sbjct: 610 RALNAVTTPVQSVIPNIHEILDLCAEQAFYTSLDFQQGFHQIPVEPAHCARTAFACHLGT 669
Query: 579 LAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQ 633
+P GL +P F + N L++ RV VY+DD ++ + D I E+
Sbjct: 670 FEYLRMPMGLKGSPGTFQRVMN---DLIKDMKARVFVYIDDMIITSPDATQHLKDIHEVL 726
Query: 634 GKLAVSILGSLGWIVNLQKS--SLSPAPVLQFL----GIMWDPHLDRMWLPEDKQLTLGN 687
K+ + +G + +KS +LS L F+ GI+ DP + + T+
Sbjct: 727 TKIEI-----IGMKLKAEKSQFALSQLRFLGFIVSDAGILTDPEKTKAVDDYPQPRTVKE 781
Query: 688 ILRTLLASKTWNLDSARSLLGYLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
+ R+ +G SF F+ ++ A +L L I
Sbjct: 782 V---------------RAFIGLASFYRRFIENFSKI-------AAPILELTKKDKEFIWS 819
Query: 747 AVLPKLEWWLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS------QVDS------SF 791
+ L + +PI P+ + F I D+S G G+ VD +F
Sbjct: 820 DECEQAFKQLKSAITKNPILVAPKLGRPFTIEVDSSGKGVGAVLLQAQDVDGKDRRVIAF 879
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL 851
S +++ ++N+ + E + A+ P + + + +D+ + + L R+
Sbjct: 880 ASRVYTGAEKNYPAIELEALGLTYAVQQFRPYIDGAKTEIITDHAPLKALLHRK------ 933
Query: 852 SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR---SKSLPDWHLSRSATEQ-IF 907
L+ + K ++ Q++ I I + PG N V D+LSR SK + + + EQ IF
Sbjct: 934 DLVGRLAKYQIILQEYDITI--SYRPGKQNVVCDTLSRYHPSKKMTEEKMDTKPEEQSIF 991
Query: 908 LKWGVPCIDL 917
P IDL
Sbjct: 992 ALSPSPLIDL 1001
>gi|3218461|emb|CAA06987.1| polymerase [Duck hepatitis B virus]
Length = 788
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 99/221 (44%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
RLFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 RLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASNSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGVRINFDKTTPSPVNDIRFLGYQIDQKF 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKIGAWYDWKCIQRFVGHLNFV 608
>gi|228423|prf||1803562C pol protein
Length = 837
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 100/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 440 KLFLVDKNSRNTSEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 498
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS-LLRSRGMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 499 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 558
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 559 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQIDQRF 618
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + + ++ + +G+L+F
Sbjct: 619 --MKIEESRWKELRTVIKKIKIGEWYDWKCIQRFVGHLNFV 657
>gi|44829149|gb|AAS47828.1| polyprotein [Duck hepatitis B virus]
Length = 788
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 101/221 (45%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTEEARLVVDFSQFSKGKNAMRFPRDWS-PNLSTLRRILPLGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P L+ + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLLTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQIDQKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ + E + L +++ + + ++ + +G+L+F
Sbjct: 570 TK--IEESRWSELRTVIKKIKIGEWYDWKCIQRFVGHLNFV 608
>gi|382948090|gb|AFG33160.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGXXVYYFRKAPMGVGLSPFXLHXFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D H M + E + L +++
Sbjct: 272 DEHF--MKIEESRWKELKTVIK 291
>gi|343475375|emb|CCD13217.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 507
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 101/201 (50%), Gaps = 12/201 (5%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ ++ I+++ + GV+ + +T+ F S ++ V K +G R ++ + LNQ ++P ++
Sbjct: 31 AEITATIKDLKDAGVV--VPTTSPFNSPIWPVQKTDGSWRMTVDYRKLNQVVTPIAAAVP 88
Query: 534 NHFRIPSFLQK-----GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGL 588
+ + S L++ G + +IDL+ A+F VP+ HQ+ A S+ G T LP G
Sbjct: 89 D---VVSLLEQINTSPGTWYAAIDLANAFFSVPVHKDHQKQFAFSWQGQQYTFTVLPQGY 145
Query: 589 ATAPQAFASLSNW-VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
+P +L + L + + +V Y+DD +L+ + + V+ + GW
Sbjct: 146 INSPALCHNLVRRDLDRLDLPQNITLVHYIDDIMLIGPSEQEVATTLDSLVTHMRIRGWE 205
Query: 648 VNLQKSSLSPAPVLQFLGIMW 668
+N K P+ ++FLG+ W
Sbjct: 206 INPTKIQ-GPSTSVKFLGVQW 225
>gi|410492647|ref|YP_006907834.1| polyprotein [Horseradish latent virus]
gi|57790326|gb|AAW56089.1| polyprotein [Horseradish latent virus]
Length = 679
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 174/436 (39%), Gaps = 49/436 (11%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLV----PKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
IQE+L+ V++ S + ++ FLV K G R V+N K +N ++L N
Sbjct: 265 QIQELLDLKVIR--PSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNDATVGDAYNLPN 322
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + ++ S D ++ V + + A + +PFG+ AP
Sbjct: 323 KDELLTLIRGKKIFSSFDCKSGFWQVRLDEESKSLTAFTCPQGHYEWNVVPFGMKQAPSI 382
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + R +Y+DD L+ + + + ++ + + G I++ +K+
Sbjct: 383 FQRHMDEAFKVFRK---FCCIYVDDILVFSDNEQNHQLHVAMILQKCYQHGIILSKKKAQ 439
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
L + FLG+ D R P+ L ++ SK + LG L++AS
Sbjct: 440 LFKERI-NFLGLEIDQGTHR---PQSHILEHIQKFPDIIESKL----QLQRFLGVLTYAS 491
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQVQHF 773
IP +L R QA L TP + + K++ LN PL P+ ++
Sbjct: 492 DYIP--KLAQIRKPLQAKLKENVQWRWTPEDTLYMKKVKKNLNGFPPLHHPLPEEKL--I 547
Query: 774 ISTDASDLGWGS-----QVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALS 818
I TDASD WG +D S + SG + +QN+H N KE AV + +
Sbjct: 548 IETDASDNYWGGILKAIHIDLSTNESIELVCRYASGSFKPAEQNYHSNDKETLAVIRTIQ 607
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHIL--AQ 874
L +V++DN +L +G +K + Q W + +
Sbjct: 608 KFSIYLTPVRFLVRTDNTHFKYFLNINYKGDSKMGRNIR--------WQGWLQNYVFDVD 659
Query: 875 FIPGAYNSVADSLSRS 890
I G N +AD LSR
Sbjct: 660 HIKGTNNCLADFLSRE 675
>gi|45775514|gb|AAS77351.1| polyprotein [Duck hepatitis B virus]
Length = 788
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNDIRFLGYQIDEKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + ++ + +G+L F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGTWYDWKCIQRFVGHLDFV 608
>gi|301622164|ref|XP_002940410.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 977
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 88/205 (42%), Gaps = 5/205 (2%)
Query: 462 LCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGL 521
L S L P + AM +I E LE G ++ +S G + F V K +GG RP ++ +GL
Sbjct: 5 LTSFLALLPPEAQAMREYISENLERGFIRPSNSPAG--AGFFFVGKKDGGLRPCIDYRGL 62
Query: 522 NQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
N+ ++ L + ++ + +DL AY + IK + A +
Sbjct: 63 NKITIKNRYPLPLISELFDRVKGANIYTKLDLRGAYNLIRIKEGDEWKTAFNTRDGHYEY 122
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL 641
+PFGL AP F N + L G+ VVVYLDD L+ + + K + L
Sbjct: 123 LVMPFGLCNAPAVFQEFVNDIFRDL--LGVFVVVYLDDILIFSSNLSDHRSHVKEVLRRL 180
Query: 642 GSLGWIVNLQKSSLSPAPVLQFLGI 666
L+K + V QFLG
Sbjct: 181 RENNLYAKLEKCTFEVNSV-QFLGF 204
>gi|301609828|ref|XP_002934465.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1160
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 107/470 (22%), Positives = 190/470 (40%), Gaps = 66/470 (14%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V ++ G AIPF PL + P + + +I+E L+ G ++ S G + +
Sbjct: 233 VDLLPGAAIPFGRIYPL---------SEPELTVLKDYIEENLKKGFIRPSTSPAG--AGI 281
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLSQA 556
F V K + RP ++ + LN K ++ N + +P L+ +DL A
Sbjct: 282 FFVEKKDHSLRPCIDYRDLN------KITIKNRYPLPLIPELFLRLRSARVFTKLDLRGA 335
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVV 615
Y V I+ + +A +PFGL AP A+ ++V + R + V+V
Sbjct: 336 YNLVRIRQGDEWKMAFRTRYGHFEYLVMPFGLCNAP---ATFQHFVNDIFRDFLDLFVIV 392
Query: 616 YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRM 675
YLDD L+ + K S L + L+K ++FLG + P
Sbjct: 393 YLDDILIFSSSLEEHRRHVKQVFSRLRAHKLFAKLEKCEFERL-TIEFLGFIISP----- 446
Query: 676 WLPEDKQLTLGNILRTLLASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLL 734
+ +++ + R + A W +S +++ ++ FA+F + S+ I +L
Sbjct: 447 -----EGMSMDS--RKVSAVLDWPTPNSRKAVQRFVGFANFYRKFIKNFSKIISPITALT 499
Query: 735 -RLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDS--- 789
L TP L+ + P+ P + F+ DAS+ G+ +
Sbjct: 500 SSLKKFCWTPEAQQAFSDLKSRFTSAPILK--HPDPTRPFVLEVDASEYAIGAVLSQRND 557
Query: 790 --------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVV 839
+F S S +QN+ + +E+ A+ A LL+ + ++V SD++ +
Sbjct: 558 VQSLLHPIAFFSKKLSSSEQNYDVGDRELLAIKSAFQEWRHLLEGAAHPILVFSDHKN-L 616
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
YLR + L + L + H+ F PG+ N AD+LSR
Sbjct: 617 EYLR-----SAKRLRPRQARWALFFSRFNFHV--TFRPGSKNGKADALSR 659
>gi|49035813|gb|AAT48677.1| pol protein [Oikopleura dioica]
Length = 1316
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 147/368 (39%), Gaps = 57/368 (15%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+Q+M + G++++ S + F + L LV K +GG R ++++ LN L+ K+ L +
Sbjct: 453 ELQKMEDAGIIEKA-SGSSFNAPLQLVRKSSGGYRICVDMRSLNNRLAESKWPLPSLAET 511
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L + +D+ QA+FH+ + + A S LP GL +P +
Sbjct: 512 LESLAGTAFFSCVDIRQAFFHMALTDESKHLTAFSALNCQYQFRRLPMGLKISPSVYQMA 571
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN-LQKSSLSP 657
+L G + VVYLDD L + G+ L +L +++ L+K+
Sbjct: 572 MK--ETLGNDLGNKAVVYLDDVL----------VTGRTEDEHLEALDVVLDRLRKAGFLL 619
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
P LG+ L E N+ K N + R +G +F S ++
Sbjct: 620 NPDKCILGVKKTTFLGHEVTTEGYYPKTDNLAAIREFPKPTNKKALRRFIGMTAFYSTLV 679
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-WLNA------------LPLSSP 764
P ++Q + L P++ K ++ W + L +
Sbjct: 680 P-------KLQYK----------LAPLHAISGSKADFDWTDEQEKAFDEVKTALLAKTGL 722
Query: 765 IFPRQV---QHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMF 811
FP ++ + ++TDASD G+G + F SG + W IN+KE+F
Sbjct: 723 AFPSRLPTAKLIVTTDASDTGYGGMLSQKIGDDPEQPLGFTSGFFRGPSTRWAINEKELF 782
Query: 812 AVHQALSL 819
A + L +
Sbjct: 783 AFIKTLEV 790
>gi|37549281|gb|AAQ93079.1| P protein [Duck hepatitis B virus]
Length = 788
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 96/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 391 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPLGMPRI 445
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 446 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARR 505
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L + L LG +N K++ SP ++FLG
Sbjct: 506 FNVWTFTYMDDFLLCHPNSRHLNSISHAVCTFLQELGIRINFDKTTPSPVNEIRFLGYQI 565
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 566 DQKY--MKIEESRWSELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 608
>gi|449465222|ref|XP_004150327.1| PREDICTED: uncharacterized protein LOC101216833 [Cucumis sativus]
Length = 2712
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 105/439 (23%), Positives = 171/439 (38%), Gaps = 66/439 (15%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ M ++EML +GV++ S + + S + LV K +G R ++ + LN P KF +
Sbjct: 1552 AEMERLVEEMLSSGVIR--PSNSPYSSPVLLVRKKDGSWRFCVDYRVLNSVTIPDKFPIP 1609
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L + IDL Y + + + A + +PFGL AP
Sbjct: 1610 VIEELFDELNGARWFSKIDLKAGYHQIRMASGDIEKTAFRTHEGHYEFLVMPFGLTNAPS 1669
Query: 594 AFASLSNWVAS-LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F SL N V LR ++V+ DD L+ +++ + LA+ IL N +K
Sbjct: 1670 TFQSLMNTVFKPYLRK---FILVFFDDILIYSKNLEVHLTHLGLALEILRRNELYANRKK 1726
Query: 653 SSLSP------APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDS 702
S + ++ G+ DP R A K W N+
Sbjct: 1727 CSFAQERVDYLGHIISAQGVEVDPEKIR-------------------AIKEWPTPTNIRE 1767
Query: 703 ARSLLGYLSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP- 760
R LG + FV G + + Q L++ G + T + +L+ + LP
Sbjct: 1768 VRGFLGLTGYYRKFVQHYGSMAAPLTQ----LVKKGGFNWTDDSEEAFQRLQQAMMTLPV 1823
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQ 815
L+ P F + + TDAS G G+ + ++ S + + + ++E+ AV
Sbjct: 1824 LALPDFSSTFE--LETDASGYGIGAVLMQAKKPIAYFSHTLAVRDRVKPVYERELMAVVM 1881
Query: 816 ALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA-- 873
A+ P L +V++D KSL L E I Q W +L
Sbjct: 1882 AVQRWRPYLLGKPFIVRTDQ-------------KSLKFLLEQRVIQPQYQKWVAKLLGYS 1928
Query: 874 ---QFIPGAYNSVADSLSR 889
Q+ PG N AD+LSR
Sbjct: 1929 FEVQYKPGLENKAADALSR 1947
>gi|124359710|gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptase); Chromo; Zinc
finger, CCHC-type; Peptidase aspartic, active site;
Polynucleotidyl transferase, Ribonuclease H fold
[Medicago truncatula]
Length = 1297
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 86/186 (46%), Gaps = 5/186 (2%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++EML+ G+++ ST+ F S + LV + + R ++ + LN+ P KF + +
Sbjct: 402 VREMLQAGIIRH--STSSFSSPVILVKEKDNSWRMCIDYRALNKATVPDKFPIPVIEELL 459
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + +DL Y V +K A + D +PFGL AP F SL
Sbjct: 460 DELHGARFYSKLDLKSGYHQVRVKEEDIHKTAFRTHEDHYEYLVMPFGLMNAPSTFQSLM 519
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V LL + V+V+ DD L+ +QD + + + I+ + G + N +K
Sbjct: 520 NDVFRLLLRK--FVLVFFDDILVYSQDWKTHMEHVEEVLRIMQTHGLVANKKKCYFGQET 577
Query: 660 VLQFLG 665
V ++LG
Sbjct: 578 V-EYLG 582
>gi|21450048|ref|NP_659397.1| hypothetical protein [Mirabilis mosaic virus]
gi|21427196|gb|AAM53128.1| ORFV [Mirabilis mosaic virus]
Length = 674
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 102/438 (23%), Positives = 180/438 (41%), Gaps = 47/438 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG----NGGTRPVLNLKGLNQFLSPKK 529
+ I+E+L+ V+ + S + +S FLV K G R V+N K LN+
Sbjct: 253 KEFEIQIKELLDLKVI--IPSKSQHMSPAFLVEKEAEKRRGKKRMVVNYKKLNEVTIGDS 310
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+L N + + L+ + S D ++ V + Q+ A + +PFGL
Sbjct: 311 HNLPNMQELITLLRGKNIFSSFDCKSGFWQVLLDDESQKLTAFTCPQGHYQWRVVPFGLK 370
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN 649
AP F + LR +VY+DD ++ + + + + + SLG I++
Sbjct: 371 QAPSIF---QRHMQDALRGLEEFSLVYVDDIIVFSDNKNDHQDHVMKVLRRIESLGIILS 427
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDR-MWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
+K++L + FLG+ +DR P++ L + + K + LG
Sbjct: 428 KKKANLFKEKI-NFLGL----EIDRGTHTPQNHILDHIHTFPDRIEDKK----QLQRFLG 478
Query: 709 YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFP 767
L++A IP +L +R Q L + T + + K++ L P L P
Sbjct: 479 VLTYADSYIP--KLAEKRKPLQVKLKKDQVWIWTQSDTDYVKKIKKGLINFPKLYLP--K 534
Query: 768 RQVQHFISTDASDLGW----------GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQAL 817
++ I TDASD W G ++ + SG + + N+H N+KE+ AV Q +
Sbjct: 535 KEDSLIIETDASDHFWGGVLKAQTTEGEELICRYSSGTFKPAELNYHSNEKELLAVKQVI 594
Query: 818 SLNLPLLQSSVVMVQSDNQTVVS-YLRRQ--GGTKSLSLLSEVEKIFLLSQDWRIHIL-- 872
+ L V++DN ++ ++ ++ G +K L+ Q W H
Sbjct: 595 TKFSIYLTPVCFTVRTDNVNLLKGFMNKKITGDSKQGRLIR--------WQMWFSHYTFK 646
Query: 873 AQFIPGAYNSVADSLSRS 890
+ G N +AD L+R
Sbjct: 647 VDHLKGEQNVLADYLTRE 664
>gi|291235377|ref|XP_002737621.1| PREDICTED: polyprotein-like, partial [Saccoglossus kowalevskii]
Length = 306
Score = 70.5 bits (171), Expect = 6e-09, Method: Composition-based stats.
Identities = 69/266 (25%), Positives = 116/266 (43%), Gaps = 17/266 (6%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN--- 600
KG + D++ A+ +PI + + +N L FG + P+ F LS
Sbjct: 13 KGAKLCKTDIADAFKLMPIHASLWHLYGIHWNEQFYFFVRLAFGCRSRPKFFDQLSTAVC 72
Query: 601 WVASLLRSRGMRVVVYL-DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
W+A + G++ + +L DDFL VN E + + LG + QK+ + P
Sbjct: 73 WIAE--HNYGVQTIFHLLDDFLTVNSPSYDAERTMAILTLLFHRLGIPLAKQKT-VGPCC 129
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
L++LGI D + + LPEDK + +L+T L +T SLLG+L+FA VI
Sbjct: 130 KLEYLGIELDTNHMQARLPEDKLARIRELLQTFLNRRTCTKREILSLLGHLNFACRVIVP 189
Query: 720 GRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW------W--LNALPLSSPIFPRQVQ 771
GR R+ + ++ H+T N + W W ++ ++ ++
Sbjct: 190 GRTFISRLITLSKGVKKLHHHVTITNESRQDLHMWNILLSDWNGISMFLYTNTTSTSMLE 249
Query: 772 HFISTDASDLGWGSQVDSSFLSGLWS 797
F TDAS +G+G + W+
Sbjct: 250 LF--TDASGIGFGGYFQGHWFQDRWT 273
>gi|301618701|ref|XP_002938751.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1439
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 105/470 (22%), Positives = 189/470 (40%), Gaps = 66/470 (14%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
+ + G AIPF PL + P + + +I+E L+ G + S G + +
Sbjct: 503 IDLFPGAAIPFGRIYPL---------SEPELTVLKDYIEENLQKGFIHPSTSPAG--AGI 551
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLSQA 556
F V K + RP ++ + LN K ++ N + +P L+ +DL A
Sbjct: 552 FFVEKKDQSLRPCIDYRELN------KVTIKNRYPLPLISELFLRLRSARVFTKLDLRGA 605
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVV 615
Y V I+ + A +PFGL AP A+ ++V + R + V+V
Sbjct: 606 YNLVRIRQGDEWKTAFRTRYGHFEYLVMPFGLCNAP---ATFQHFVNDIFRDFLDLFVIV 662
Query: 616 YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRM 675
YLDD L+ + + K S L + L+K + ++FLG + P
Sbjct: 663 YLDDILIFSSSLEEHRLHVKQVFSRLRTHKLFAKLEKCEFEKS-TIEFLGFVISP----- 716
Query: 676 WLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLL 734
+ +++ + R + A W + ++ +++ ++ FA+F + S+ I +L
Sbjct: 717 -----EGMSMDS--RKVSAVLDWPVPNNRKAVQRFVGFANFYRKFIKDFSKTIAPITALT 769
Query: 735 -RLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDS--- 789
L TP L+ + P+ P + F+ DAS+ G+ +
Sbjct: 770 SSLKKFCWTPEAQQAFSDLKNRFTSAPILK--HPDPTRPFVLEVDASEYAIGAVLSQRNE 827
Query: 790 --------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVV 839
+F S S +QN+ + +E+ A+ A LL+ + ++V SD++ +
Sbjct: 828 VQGLLHPVAFFSKKLSSSEQNYDVGDRELLAIKSAFQEWRHLLEGAAHPILVFSDHKN-L 886
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
YLR + L +F R + F PG+ N AD+LSR
Sbjct: 887 EYLR---SARRLRPCQARWALFF----SRFNFHVTFRPGSKNGKADALSR 929
>gi|40786849|gb|AAR89939.1| polymerase protein [Sheldgoose hepatitis B virus]
Length = 785
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 95/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 388 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 442
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P LA+S V P G+ +P + + S + R
Sbjct: 443 SLDLSQAFYHLPFNPASSSRLAISDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 502
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L + L LG +N K++ SP ++FLG
Sbjct: 503 FNVWTFTYMDDFLLCHPNARHLHAISNAVCTFLQELGVRINFDKTTPSPVHEIRFLGYEI 562
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
D M + E + L +++ + + ++ + +G+L+F
Sbjct: 563 DSTY--MKIEESRWKELRTVIKKIKVGEWYDWKCIQRFVGHLNFV 605
>gi|45775510|gb|AAS77348.1| polyprotein [Duck hepatitis B virus]
Length = 788
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 99/221 (44%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTDIRFLGNQIDEKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGTWYDWKCIQRFVGHLNFV 608
>gi|358056390|dbj|GAA97679.1| hypothetical protein E5Q_04357 [Mixia osmundae IAM 14324]
Length = 668
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 134/313 (42%), Gaps = 29/313 (9%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
KG ++I +DL AY VP++ + L + G+ T LPF L +AP AF L+ +
Sbjct: 64 KGCWLIKLDLEDAYHQVPVRLADRHLLGFEWRGEQYMSTRLPFELRSAPYAFNLLAEGLH 123
Query: 604 SLLRS----RGMRVVVYLDDFLLVNQDPRIL---EIQG--KLAVSILGSLGWIVNLQKSS 654
+L G ++ YLDDFL+V PR + E +G A+ I LG ++ K
Sbjct: 124 WILERCALPAGRKIRHYLDDFLIVL--PRTVSEEEARGVAHRALQIGEQLGLMLKAAKLE 181
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
P L FLG+ + +P+DK L + T + L + L SF
Sbjct: 182 -GPTHDLTFLGLRINTITGVALVPDDKLAKLRRLTSTWQRRQAATLKEIQELSAERSFVV 240
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL---SSPIFPRQVQ 771
+ I ++R+++ + L + L +W + P + I +
Sbjct: 241 WTI---------MRRRSNQVMLEGEYRARYGEI----LSFWWHLAPTWNGHTMIADDRAP 287
Query: 772 HFISTDASDLG-WGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
++TD S +G G+ D LS L + I E++AV +A L + VM
Sbjct: 288 IAVATDDSGVGDIGAVCDELTLSELAPKSIIKEGIMALEIYAVVRAFRLWGVRWRGQRVM 347
Query: 831 VQSDNQTVVSYLR 843
V DNQ V++ ++
Sbjct: 348 VYCDNQAVIAAIK 360
>gi|118863|sp|P17193.1|DPOL_HPBDW RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|325446|gb|AAA45751.1| P protein [Duck hepatitis B virus]
Length = 788
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 99/221 (44%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
RLFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 RLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNDIRFLGYQIDQKF 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
R + E + L +++ + ++ + +G+L+F
Sbjct: 570 MR--IEESRWKELRTVIKKIKIGAWYDWKCIQRFVGHLNFV 608
>gi|326678681|ref|XP_003201137.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1145
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 112/455 (24%), Positives = 194/455 (42%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 284 LSQPETEAMKKYISEELEKGFIR--PSTSPASAGFFFVRKKDGSLRPCIDYRGLNEITVK 341
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 342 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTIDGHYEYLVM 398
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
P GLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 399 PLGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIKN 456
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P M D+Q + + W
Sbjct: 457 QL-----YAKSSKCEFHQTCISFLGYIISPEGVAM----DQQ--------KVDSVTQWPQ 499
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F + R I +S + AP LT + A +L+W +A+
Sbjct: 500 PETIRQLQRFLGFANF-------YRRFILNFSS---VAAP-LTAMVKANNARLKWNPDAV 548
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 549 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 608
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 609 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 664
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 665 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 695
>gi|270017238|gb|EFA13684.1| hypothetical protein TcasGA2_TC016048 [Tribolium castaneum]
Length = 1075
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 123/531 (23%), Positives = 207/531 (38%), Gaps = 91/531 (17%)
Query: 381 SSPQNLEPPGRVSLKVQTLQKPQ-RCSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAP 439
+ Q+LE P + KV L+ P+ ++ + R L G ++ I+L
Sbjct: 176 NKKQHLESPSALQAKVDPLKIPEAEKRKLIHLLQEYRCIFSLRPGLTHKYTHE-IKLHDK 234
Query: 440 APLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFL 499
P ++ Y +PF+ +P A+ IQEML+ GV+KR + +
Sbjct: 235 TPFLK--RPYPVPFALRP-----------------AVDATIQEMLDLGVIKR--EASPYA 273
Query: 500 SRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSFLQKGDYMISIDL 553
S + +V K +G R L+ + +N + P L+ F YM +IDL
Sbjct: 274 SPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF------HGIRYMSTIDL 327
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
+Y+ +P+ +++ A YNG LPFGL TA +F+ + V + +R
Sbjct: 328 RSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDVVLGTEVRE---F 384
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS-LSPAPVLQFLGIMWDPH 671
VV Y+DD L+ ++ + L +NL+KS+ + P + + + +
Sbjct: 385 VVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEPTRKKISAIRNFP 444
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA 731
+ + L L N R A + LL R+
Sbjct: 445 VPQKTKHVRAFLGLCNFYRKFCARYSAATQDVNKLL---------------------RKG 483
Query: 732 SLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS-----Q 786
R G + LE L P + IF ++ TD+S G G+ Q
Sbjct: 484 EKWRWGRNEQEAFDRVKDLFLEAVLLRYPDLNKIF------YVQTDSSGYGLGAELYQIQ 537
Query: 787 VDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
D S F S + N+ +KE+ V AL +Q + +++++D+Q + +
Sbjct: 538 EDGSRGVIAFASRSLKGPELNYTTTEKELLGVIFALHKFRIYIQVTKIIIRTDHQA-LKF 596
Query: 842 LRRQGGTKSLSLLSE--VEKIFLLSQ-DWRIHILAQFIPGAYNSVADSLSR 889
L R LLSE +L Q D+ I + + G N +AD LSR
Sbjct: 597 LSR------CRLLSERLTRWTLILGQYDYEI----ELVKGKDNVIADILSR 637
>gi|270016158|gb|EFA12606.1| hypothetical protein TcasGA2_TC001846 [Tribolium castaneum]
Length = 1134
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 108/470 (22%), Positives = 202/470 (42%), Gaps = 65/470 (13%)
Query: 443 VRIVSGYAIPF-SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
+ I +G A+PF + P+ P + ++ + EML GV++ S + + S
Sbjct: 472 LNIETGDAVPFRQYQYPMSPY---------MQKILNSEVDEMLRLGVIE--PSQSPWCSP 520
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LV K +G R + + LN+ + L RI S L+ ++ S+DL +A++ +P
Sbjct: 521 VLLVKKSSGEYRFCFDGRKLNEITKHDSYPLPRIDRILSLLRDAKFISSLDLRRAFWQIP 580
Query: 562 IKTTHQRFLALSYNG-DVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
+ + A + G + T +PFGL + Q L + A ++ VYLDD
Sbjct: 581 LSEPSKEKTAFAVQGRGLFQFTVMPFGLRNSAQTQQRLMD--AIFGPQFEPKIFVYLDDL 638
Query: 621 LLV----NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMW 676
++V + +LEI ++ L + +NL KS L++LG + D R
Sbjct: 639 IIVTATFEEHVELLEI----VLNHLKAANLTINLDKSKFFRTS-LKYLGYIIDAEGLRT- 692
Query: 677 LPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS--------FASFVIPMGRLHSRRIQ 728
PE I + + N + +G S F+S V P+ L + +
Sbjct: 693 DPE-------KISSMVEYPRPKNATEIKRFIGLCSWYRRFIKNFSSLVAPINDLLKGKKK 745
Query: 729 RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD 788
+Q ++ T + A+ L ++A L++P F + ++ DASD+G G +
Sbjct: 746 KQE--VKWDEKTETAFH-AIKNAL---VSAPVLTTPDFSKPF--YVQCDASDVGLGGVLT 797
Query: 789 S---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
F S S+ ++N+ + ++E AV ++ P ++ + V +D+ +++
Sbjct: 798 QGEEGFEKVICFASRGLSKSERNYSVTERECLAVIFSIEKFRPYIEGTNFTVITDHHSLL 857
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
R + T L+ + L Q + +I+ + G N V D+LSR
Sbjct: 858 YLFRMKNPTGRLA------RWILRLQQFSFNIIHR--KGNINVVPDALSR 899
>gi|342871285|gb|EGU73979.1| hypothetical protein FOXB_15510 [Fusarium oxysporum Fo5176]
Length = 1306
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 176/430 (40%), Gaps = 41/430 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE L+ G ++ ST+ S + K GG R ++ + LN ++ L
Sbjct: 314 LQENLDKGFIR--TSTSPAASPVLFAKKPGGGLRFCVDYRALNAITIKNRYPLPLIQETL 371
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S L + Y +D+ A+ + IK + A + + +PFGL+ AP F +
Sbjct: 372 SQLSQAKYFTKLDVVAAFNRIRIKEGQEWMTAFNTRYGLFESLVMPFGLSNAPATFQARI 431
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V R Y+DD L+ + D + K + L + G ++++K
Sbjct: 432 NEVLRPFLDR--YCTAYIDDILIYSNDLASHRLHVKSVLQALEAAGLQLDVKKCEFETTE 489
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLDSARSLLGYLSFASFVIP 718
V ++LG++ RM PE + W N + + + +L F++F
Sbjct: 490 V-KYLGMIISTTGVRM-DPEKVDCLVN-----------WENPVNVKDVQAFLGFSNFYRR 536
Query: 719 MGRLHSRRIQRQASLLR-LGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
+ SR ++ +L R L + T L+ A P+ P + + F+ D
Sbjct: 537 FIKGFSRIVRPLVALTRKLVKWNWTRSCQEAFDMLKNSFTAAPILRHFDPTK-EVFVECD 595
Query: 778 ASDL---GWGSQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
ASD G SQ D +F+S ++ + N+ I KE+ A+ + P LQ +
Sbjct: 596 ASDFVSSGILSQEDDQGVLHPVAFMSKKYNPAECNYEIYDKELLAIVRCFECWRPELQGA 655
Query: 828 V--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
+ V +D+ + ++ TK LS +V LSQ + IPG N D
Sbjct: 656 HHPITVITDHANLRYFM----TTKQLS-RRQVRWSEFLSQ---FQFAIKSIPGKENGKPD 707
Query: 886 SLS-RSKSLP 894
SL+ RS+ LP
Sbjct: 708 SLTRRSQDLP 717
>gi|118861|sp|P17192.1|DPOL_HPBDB RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|325450|gb|AAA45754.1| P protein [Duck hepatitis B virus]
Length = 788
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNIW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVNDIRFLGYQIDQKF 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKIGAWYDWKCIQRFVGHLNF 607
>gi|4432807|gb|AAD20658.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1611
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 114/468 (24%), Positives = 191/468 (40%), Gaps = 70/468 (14%)
Query: 452 PFSAK--PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
PF+ + P P+ + P A + ++E+L+ G ++ S G + + V K
Sbjct: 648 PFTIELEPGTTPISKAPYRMAPAEMAKLKKQLEELLDKGFIRPSSSPWG--APVLFVKKK 705
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ +GLN+ K+ L + L + IDL+ Y +PI+ T R
Sbjct: 706 DGSFRLCIDYRGLNKVTVKNKYPLPRIDELMDQLGGAQWFSKIDLASGYHQIPIEPTDVR 765
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
A D +PFGL AP AF + N V V+++++D L+ ++
Sbjct: 766 KTAFRTRYDHFEFVVMPFGLTNAPAAFMKMMNGVFRDFLDEF--VIIFINDILVYSKSWE 823
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG-N 687
+ + + L L K S V FLG H + D+ +++
Sbjct: 824 AHQEHLRAVLERLREHELFAKLSKCSFWQRSV-GFLG-----H-----VISDQGVSVDPE 872
Query: 688 ILRTLLASKTW----NLDSARSLLG--------YLSFASFVIPMGRLHSRRI------QR 729
+R++ K W N RS LG +SFAS P+ RL + +
Sbjct: 873 KIRSI---KEWPRPRNATEIRSFLGLAGYYRRFVMSFASMAQPLTRLTGKDTAFNWSDEC 929
Query: 730 QASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---Q 786
+ S L L A LT VLP+ P + + TDAS +G G Q
Sbjct: 930 EKSFLELKA-MLTNAPVLVLPE-----EGEPYT-----------VYTDASIVGLGCVLMQ 972
Query: 787 VDS--SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRR 844
S ++ S + ++N+ + EM AV L + L + V + +D+++ + Y+
Sbjct: 973 KGSVIAYASRQLRKHEKNYPTHDLEMAAVVFFLKIWRSYLYGAKVQIYTDHKS-LKYIFT 1031
Query: 845 QGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
Q L+L + L D+ + I + PG N VAD+LSR +S
Sbjct: 1032 Q---PELNLRQ--RRWMELVADYNLDI--AYHPGKANQVADALSRRRS 1072
>gi|326664045|ref|XP_003197719.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1109
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 113/455 (24%), Positives = 193/455 (42%), Gaps = 78/455 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 329 LSQPETEAMKSYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 386
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY I+ + S +
Sbjct: 387 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAY--NLIRQGDEWKTGFSTIDGHYEYLVM 441
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGLA +P F + N + + ++ V+VY+DD L+ + I ++ L I
Sbjct: 442 PFGLANSPSVFQAFVNEIFRDMLNKW--VIVYIDDILVYSNSLSEHIQHVRAVLKRLIQN 499
Query: 643 SLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-N 699
L KSS + FLG + P + + D+Q + + W
Sbjct: 500 QL-----YAKSSKCKFHQTCISFLGYIISP----VGVAMDQQ--------KVDSVTQWPQ 542
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
++ R L +L FA+F RR R S AP LT + A +L+W +A+
Sbjct: 543 PETIRQLQRFLGFANFY--------RRFIRNFS--SFAAP-LTAMVKANNARLKWNPDAV 591
Query: 760 PL---------SSPIF--PRQVQHF-ISTDASDLGWGSQVDSSFL-------SGLWSRE- 799
S+PI P Q F + DAS+ G G+ + L +SR+
Sbjct: 592 RAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSLVNKKLHPCAFYSRKL 651
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +V +D++ + Y+R K L+
Sbjct: 652 NSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFIVITDHKN-LEYIR---SCKRLNPR 707
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F D+++ +IPG+ N AD+LSR
Sbjct: 708 QARWALFFTRFDFQV----TYIPGSKNIKADALSR 738
>gi|15150397|gb|AAK85436.1|AF404406_1 polyprotein [Duck hepatitis B virus]
Length = 787
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 99/220 (45%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 390 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 448
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 449 SQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVW 508
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 509 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVTEIRFLGYQIDQKF 568
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + L +++ + ++ + +G+L+F
Sbjct: 569 --MKIEESRWKELRTVIKKIKIGAWYDWKCIQRFVGHLNF 606
>gi|9626721|ref|NP_040998.1| polymerase [Heron hepatitis B virus]
gi|118865|sp|P13846.1|DPOL_HHBV RecName: Full=Protein P; Includes: RecName: Full=DNA-directed DNA
polymerase; Includes: RecName: Full=RNA-directed DNA
polymerase; Includes: RecName: Full=Ribonuclease H
gi|325454|gb|AAA45738.1| polymerase [Heron hepatitis B virus]
Length = 788
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 95/225 (42%), Gaps = 19/225 (8%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
R+FLV K + T + +QF K N R P + L G I
Sbjct: 391 RIFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPKYWCPNLTTLRRILPVGMPRI 445
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + + + SR
Sbjct: 446 SLDLSQAFYHLPLAPASSSRLAVSDGKQVYYFRKAPMGVGLSPFLLHLFTTAIGAEIASR 505
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + R L + L G +N K + SP ++FLG +
Sbjct: 506 FNVWTFSYMDDFLLCHPSARHLNTISHAVCTFLQEFGIRINFDKMTPSPVTTIRFLG--Y 563
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+ M + E + L +++ + + ++ + +G+L+F
Sbjct: 564 EISKQHMKIEESRWNELRTVIKKIKVGQWYDWKCIQRFIGHLNFV 608
>gi|25229104|gb|AAN71721.1| reverse transcriptase/ribonuclease H [Danio rerio]
Length = 459
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 79/362 (21%), Positives = 158/362 (43%), Gaps = 20/362 (5%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
+ ++ +D+S A+ +P+ +++ L F ++P+ F LS +
Sbjct: 73 RNAWLAKVDISSAFKIMPLHPDFWHLFGINWRSKFYFAVRLTFRCKSSPKIFDMLSEAIC 132
Query: 604 SLL-RSRGMRVVVYL-DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL 661
+L + G+ +++L DDFL+++ Q + + +LG + +K++ P+ L
Sbjct: 133 WILTNNYGVSHLIHLLDDFLIISLPSEPPARQLAITQKVFANLGIPLAEEKTA-GPSTSL 191
Query: 662 QFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR 721
+FLGI D + LP++K + + L + S+LG+L+FA +IP GR
Sbjct: 192 EFLGIKLDSKNFQASLPKEKIDRIIFLSSIFLEKQICTKRELLSILGHLNFAMRIIPQGR 251
Query: 722 -LHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--------SSPIFPRQVQH 772
S +Q AS+ G P++ A +L W++ L S + H
Sbjct: 252 PFVSHLLQTAASIN--GLEETIPLSEACRRELSLWISFLKCWNGCSFFYSDLVLAPIDIH 309
Query: 773 FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK-----EMFAVHQALSLNLPLLQSS 827
+ A +G+G + + W + + + E++ + A + +S
Sbjct: 310 LFTDAAPSVGFGGYYQGRWFASPWPSQMLEIPLPSQSSALFELYPLVAATIIWGDEWSAS 369
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
+++ SDN+ V + + G + L+ + ++ S + I A+ +PG N +ADSL
Sbjct: 370 SILIHSDNEAAVQCVNK-GRSHFPILMPFIHRLVWTSAKKQFIITAKHVPGFKNQIADSL 428
Query: 888 SR 889
SR
Sbjct: 429 SR 430
>gi|308492221|ref|XP_003108301.1| hypothetical protein CRE_10000 [Caenorhabditis remanei]
gi|308249149|gb|EFO93101.1| hypothetical protein CRE_10000 [Caenorhabditis remanei]
Length = 1478
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 107/469 (22%), Positives = 190/469 (40%), Gaps = 61/469 (13%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P +P VP+ + + HI+ +L++ + +S T + S +
Sbjct: 501 VHIYTNTEVPIKGRPYRVPV--------KYQAELEKHIESLLKSRRIT--ESNTPWTSPI 550
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
+V K NG R L+ + LN+ P F L RI S L+K Y S+D++ Y
Sbjct: 551 VIVKKKNGSLRVCLDFRKLNEATIPDNFPLP---RIDSILEKVGGSSYFSSLDMANGYLQ 607
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
+ + + V A T LPFGL +A F + ++L V+VY+DD
Sbjct: 608 LRLDPASSYKCGFITDQKVYAYTHLPFGLRSAASYFQRA---LRTVLGGLEEEVLVYIDD 664
Query: 620 FLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE 679
L+ ++ + + S + +K + + + FLG +++
Sbjct: 665 ILVFSKTFEKHVESLRKVLHRFRSFNLKASPKKCEFAKS-AITFLG----HEINKDNYAP 719
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP----MGRLHSRRIQRQASLLR 735
DK N+ + L + N++ R +G F IP + +R +++ + +
Sbjct: 720 DK----ANVAKILEFPEPTNVNEIRRFVGMAGFFRKFIPNFSEIAEPLTRLTRKEKNFVW 775
Query: 736 LGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGS-------QV 787
+ KL L A P+ FP + F I DAS + G+ +
Sbjct: 776 DRDQQES------FEKLRTALIAEPILG--FPDYDKPFHIFCDASSVAQGAALMQSRDEK 827
Query: 788 DSSFL-----SGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYL 842
D F S S + W + EM A+ AL P + S +++ SD++ + L
Sbjct: 828 DKDFCVIAYASRTLSDPETRWPAIQVEMGAIIYALRQFRPYVCMSKIILHSDHKPLTFLL 887
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
++ +LS + + Q + I I+ I G N+VAD LSR++
Sbjct: 888 QKAKTHDNLS------RWLIELQCYDISII--HIDGKKNTVADCLSRAR 928
>gi|211925532|dbj|BAG81989.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 569
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 177/430 (41%), Gaps = 51/430 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 122 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYRSLNNATIPDRYPIPHIHDF 179
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 180 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRMPFGLRNAAQTFQRF 239
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 240 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 293
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLSFA 713
L FLG H+D + + +LA +++ L R +G +++
Sbjct: 294 VTSLDFLG----HHIDSTG--------ISPLPNRILALESFPIPTTLTQLRRFIGIINYY 341
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAV-LPKLEWWLNALPLSSPI----FPR 768
IP H I + + L LG + P+V + E A+ ++ +
Sbjct: 342 RRFIP----HCADILQPLTDL-LGCKEKSVTLPSVAIAAFERAKQAIAHATKLSFLDTHE 396
Query: 769 QVQHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 397 STKLILTTDASNAAVGAVLHQVVNNASQPLAFFSQKMQAAQTRYSTFGRELLAIYLAIRH 456
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
LL+ +Q+D++ + + S + ++ I + D R + PG+
Sbjct: 457 FRHLLEGRSFTIQTDHKPLTYAFNAKPDRYSPREIRHLDYISQFTTDIR------YTPGS 510
Query: 880 YNSVADSLSR 889
N VAD+LSR
Sbjct: 511 DNVVADALSR 520
>gi|327267811|ref|XP_003218692.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Anolis carolinensis]
Length = 989
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 112/463 (24%), Positives = 190/463 (41%), Gaps = 58/463 (12%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L L P A+ ++ E L+ G ++ +S T +R+F V K G R V + +
Sbjct: 339 LPAGRLYALTVPERQALREYLDENLQKGFIRPSNSPTA--ARVFFVAKKTGDLRLVCDYR 396
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN++ ++ L + S +Q +DL AY V IK + A +
Sbjct: 397 VLNKYTVRDRYPLPLIPELLSRVQGASIFTKLDLHGAYNLVRIKEGDEWKTAFNTCFGTY 456
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
+PFGL AP F N V L + VV+YLDD L+ ++D R E + +S
Sbjct: 457 ENLVMPFGLCNAPVVFQRFINDVFRDLLDKI--VVIYLDDILIFSKDAREHEEHVRQVLS 514
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLG--IMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
L + G K + V +FLG I W ++L + R + A
Sbjct: 515 RLRANGLFAKASKCVFHVSEV-EFLGHVISW------------RELKMDP--RKVSAVLE 559
Query: 698 W----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKL 752
W N + LG+ ++ IP S I S +R P T +L
Sbjct: 560 WRAPTNKKEVQRFLGFANYYRKFIPDFARWSDPI---TSCIRGKQPFRWTDQAEKGFQQL 616
Query: 753 EWWLNALPL---SSPIFPRQVQHFISTDASDLGWGSQV-----DSSFLSGLWSRE----Q 800
+ + P+ +P P VQ DASD+ G+ + D +SR+ +
Sbjct: 617 KKLFTSQPILQHPNPGTPFVVQ----ADASDVAIGAVLLQPVGDHLHPCAFYSRQLTTPE 672
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
+N+ I +KE+ A+ A L+ + + V +D++ + +LR + L+ +
Sbjct: 673 RNYTIWEKELLAIKAAFETWRHWLEGAKFPIEVHTDHRN-LEHLR---TARKLNQRQQRW 728
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRS 901
+F +++IH ++ A AD+LSR P++ + R
Sbjct: 729 ALFFERFNFQIH----YVTPAQTKQADALSRK---PEYAIRRE 764
>gi|308486555|ref|XP_003105474.1| hypothetical protein CRE_22308 [Caenorhabditis remanei]
gi|308255440|gb|EFO99392.1| hypothetical protein CRE_22308 [Caenorhabditis remanei]
Length = 1279
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 101/441 (22%), Positives = 189/441 (42%), Gaps = 59/441 (13%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSL 532
+ +S ++ + + GV+ +D + + + + LV K NG R + GLN+ + + L
Sbjct: 395 TTVSDELERLQQAGVISPVDHSE-WAAPIVLVKKKNGSLRMCADFSTGLNEAIQQHQHPL 453
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I S L G Y IDL++AY + I ++ L ++ + + LPFG+ +AP
Sbjct: 454 PTADDIFSTLNGGKYFSQIDLAEAYLQIEIDEQAKQMLCINTHRGLYRYNRLPFGVKSAP 513
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
+F + + + S L V YLDD ++ + K +S + G V ++K
Sbjct: 514 GSFQQIMDSMTSGLDG----VAAYLDDIIITGSSVAEHNQRLKTVMSRIQDFGLRVRIEK 569
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ +P + FLG + D R P+ ++++ +R + + N RS LG + F
Sbjct: 570 CTFL-SPKITFLGFIIDKDGRR---PDPEKVSA---IRHMPVPQ--NESQVRSFLGLIQF 620
Query: 713 -ASFVI-------PMGRLHSRRIQ-RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSS 763
SFV P+ L + ++ + S + H+ I + L L + LP+
Sbjct: 621 YGSFVKELFKLRPPLDALTKKDVEFKWTSECQNAFDHIKQILHSDLL-LTHYDPKLPI-- 677
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINK-------------KEM 810
++ DAS G G+ + F G E+ +HI+K KE
Sbjct: 678 ---------IVAADASQYGIGAVISHRFPDG---SEKAIYHISKALTAPQRNDSQIEKEA 725
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWR 868
F + A++ + +++D++ ++S + G S + +++ I LL+ D+
Sbjct: 726 FGLITAVTKFHRFIHGRHFTLRTDHKPLLSIFGEKKGIPVYS-ANRLQRWAIILLNYDFN 784
Query: 869 IHILAQFIPGAYNSVADSLSR 889
I + G AD+LSR
Sbjct: 785 IEYINTHDFGQ----ADALSR 801
>gi|119657135|gb|ABL86693.1| putative pol protein [Adineta vaga]
Length = 1302
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 105/454 (23%), Positives = 191/454 (42%), Gaps = 55/454 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GV++ +ST+ + S + LV K +G R ++ + LN + F + I
Sbjct: 399 INKLLKQGVIE--ESTSPWSSPIVLVRKKDGSVRFCIDYRKLNAITTKDAFPIPRIDDIF 456
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + Y +ID YF V + + A S T LP G+ P AF +
Sbjct: 457 DHLSQAGYYTTIDFKSGYFQVGLDARDRPKTAFSTRDQHYQFTVLPQGVTNGPPAFQRIV 516
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ + L +R + YLDD ++ + D ++ + L + L + +N+ K ++
Sbjct: 517 SQI--LGPTRWKYALAYLDDVIIYSPTFDQHLVHLDDIL--NRLHEANFRLNVGKCHIAQ 572
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
+ +LG H++ + + NI L + A + + I
Sbjct: 573 TSI-DYLG----HHIEHGNIKPNA----DNIHALLETPQPATAKEAFRFVKAAEYYRKFI 623
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------WLNALPLSSPI--F 766
P + ++ + + A + + P P L E N L L P
Sbjct: 624 PKFSMIAQPLYKYAPTTKEQRSNKMPAVPIQLLDDELHAFHELKQILTNDLILRIPDENL 683
Query: 767 PRQVQHFISTDASDLGWGS---QVDSS------FLSGLWSREQQNWHINKKEMFAVHQAL 817
P ++Q TDAS +G G+ Q S+ +LS ++ Q NW ++E +A+ A+
Sbjct: 684 PFKIQ----TDASKIGIGAVLMQTHSNGDLPVAYLSKKFTTTQMNWPATEQECYAIIHAI 739
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L ++++D++ ++ + +Q L S+ E+ L Q ++ I ++I
Sbjct: 740 EKWHKYLDGREFIIETDHKPLLPFNLKQ------QLNSKCERWRLKLQQYKFTI--RYIK 791
Query: 878 GAYNSVADSLSRSKS------LPDWHLSRSATEQ 905
G +N+VAD LSRS S L D+ +RS T Q
Sbjct: 792 GKHNTVADYLSRSPSDNASDDLDDYVPTRSQTTQ 825
>gi|224074125|ref|XP_002304262.1| predicted protein [Populus trichocarpa]
gi|222841694|gb|EEE79241.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 182/434 (41%), Gaps = 52/434 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGN----GGTRPVLNLKGLNQ------FLSPKK 529
IQE+L T +++ S + S FLV K + G R V+N K LN +L P+K
Sbjct: 42 IQELLGTNLIE--PSESPHFSPAFLVNKHSEQKRGKRRMVINYKKLNDHTIGDGYLLPRK 99
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
L++ R S D ++ V + + Q+ A + +PFGL
Sbjct: 100 DELLDQIRGKKIFS------SFDCKSGFWQVLLDNSSQKLTAFTCPQGHFQWKVMPFGLK 153
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV-NQDPRILEIQGKLAVSILGSLGWIV 648
AP F + + + VY+DD ++ + D +E K+ + +G I+
Sbjct: 154 QAPSIF---QRHMDQTFKGFELFCRVYVDDIIIFSDNDKEHIEHVTKV-LDRCKEIGVIL 209
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
+L K+ L + FLG++ D ++ L E +G L + SK + + LG
Sbjct: 210 SLPKAQLFKESI-NFLGLVIDK--GQIMLQEH----IGEHL-SAFNSKIADKKQLQRFLG 261
Query: 709 YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPR 768
L++ S P ++ R QA L + + + + K++ + LP P+
Sbjct: 262 ILNYISHFCP--KVAQIRQPLQAKLKKDSIWQWSKEDTDYIEKIKKAVKHLPPVHHPGPK 319
Query: 769 QVQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWHINKKEMFAVHQALS 818
+ I TDASD WG ++ + SG + +QN+H N+KE+ A+ +
Sbjct: 320 EPL-IIETDASDKYWGGILKAQPEEGPELICGYASGTFKPAEQNYHSNEKELLALINTIK 378
Query: 819 LNLPLLQSSVVMVQSDNQTVVSYLRR--QGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
L + ++DN+ V +L G K L+ +++L D I+ + +
Sbjct: 379 RFQVYLIPVQFVARTDNKNVFFFLNTNIHGSYKQGRLVR--WQLWLSYYD----IVFEHV 432
Query: 877 PGAYNSVADSLSRS 890
G N AD+LSR
Sbjct: 433 EGINNIFADTLSRE 446
>gi|326668018|ref|XP_003198707.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1157
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 116/456 (25%), Positives = 182/456 (39%), Gaps = 65/456 (14%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P AM +I + L +++ S G + F V K +G R ++ +G
Sbjct: 331 PNGKLYSLSVPEREAMEKYISDSLAAKIIRPSSSPAG--AGFFFVKKKDGSLRLCIDYRG 388
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN + L LQ ++ +DL AY V I+ + A +
Sbjct: 389 LNSITVKITYPLPLMSSAFERLQGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFE 448
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
LPFGL+ AP F +L N +LR + VYL D L+ + + + +
Sbjct: 449 YCVLPFGLSNAPAVFQALVN---DVLRDMIDQFIYVYLYDILIFSHSLQEHIQHVRRVLQ 505
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW- 698
L G V +K A +QFLG + RM PE Q + W
Sbjct: 506 RLLENGLYVKAEKCVFH-AQSVQFLGHIVSVEGMRM-DPEKIQAVVD-----------WP 552
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---- 754
DS ++L +L+FA+F RR R S +L AP LT + + P +W
Sbjct: 553 TPDSRKALQRFLAFANFY--------RRFIRNFS--QLAAP-LTSLTSSKTP-FKWSSAA 600
Query: 755 -----WLNALPLSSPIF--PRQVQHF-ISTDASDLGW-----------GSQVDSSFLSGL 795
L +S+PI P + F + DAS++G G ++ S
Sbjct: 601 EAAFSKLKGCFVSAPILIAPDPSRQFVVEVDASEVGVEAILSQRSASDGKVHPCAYFSHR 660
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV--MVQSDNQTVVSYLRRQGGTKSLSL 853
S ++N+ I +E+ AV AL L+ S V +V +D++ + Y+R K L+
Sbjct: 661 LSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKN-LEYIR---SAKRLNS 716
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ N D+LSR
Sbjct: 717 RQARWALFF----GRFNFTISYRPGSKNIKPDALSR 748
>gi|301617456|ref|XP_002938159.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1420
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 107/463 (23%), Positives = 183/463 (39%), Gaps = 71/463 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P + LA P A+ +++E L G ++ S G + +F V K + RP ++ +
Sbjct: 471 IPFGRIYPLAEPELEALRSYLEENLAKGFIRPTTSPAG--AGIFFVEKKDHTLRPCIDYR 528
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
LN ++ L IP Q+ +DL AY V I+ + A
Sbjct: 529 NLNSITIKNRYLLP---LIPELFQRLREAKIFSKLDLRGAYNLVRIREGDEWKTAFRTRY 585
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP A+ ++V + R + V++YLDD L+ ++ + +
Sbjct: 586 GHFEYLVMPFGLCNAP---ATFQHFVNDVFRDYLDVFVIIYLDDILVFSKSVAEHRVHME 642
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
S L S +K + +FLG + +M Q + IL
Sbjct: 643 KIFSRLRSHQLFAKFEKCEFDKTSI-EFLGFIISAEGIQM-----DQKKISAIL------ 690
Query: 696 KTWNLD-SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
W + S + + ++ FA+F RR + S + L LT ++ K W
Sbjct: 691 -DWPVPLSRKDVQRFIGFANFY--------RRFIKGFSQIMLPITRLTSLST----KFHW 737
Query: 755 ---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVDSSF----------- 791
L L S+PI P + FI DAS+ G+ + F
Sbjct: 738 TPQAQSAFELLKTLFTSAPILQHPDPARPFILEVDASESAVGAVLSQRFPASGALHPVAY 797
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTK 849
S ++ +QN+ + +E+ A+ AL LL+ +++ +D++ + YLR K
Sbjct: 798 FSRKMNKSEQNYDVADRELLAIKLALEEWRYLLEGGPHPILIYTDHKNL-EYLR---VAK 853
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
L +F + R + + PG+ NS AD+LSR S
Sbjct: 854 RLRPRQARWALFFM----RFNFHLTYRPGSKNSKADALSRIHS 892
>gi|410026807|gb|AFV52543.1| DNA polymerase [Duck hepatitis B virus]
Length = 788
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 99/221 (44%), Gaps = 11/221 (4%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFLV K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLTTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L + L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCTFLQELGIRINFDKTTPSPVNDIRFLGYQIDQKF 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGAWYDWKCIQRFVGHLNFV 608
>gi|301611270|ref|XP_002935168.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1225
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 111/479 (23%), Positives = 195/479 (40%), Gaps = 76/479 (15%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V ++ G AIPF PL + P + +I+E L G ++ S G + +
Sbjct: 271 VDLLPGAAIPFGRIYPL---------SEPELKVLKTYIEENLRKGFIRPSTSPAG--AGI 319
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
F V K + RP ++ + LN+ ++ L + L+ +DL AY V I
Sbjct: 320 FFVEKKDHSLRPCIDYRDLNKVTIKNRYPLPLISELFIRLRSARVFTKLDLRGAYNLVRI 379
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+ + A +PFGL AP F +N + L V++YLDD L+
Sbjct: 380 RQGDEWKTAFRTRYGHYEYLVMPFGLCNAPATFQHFANDILDLF------VIIYLDDILI 433
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
+ + S L + L+K + + +FLG ++
Sbjct: 434 FSSSLEEHRHHVRQVFSRLRAYKLFAKLEKCEFEKSSI-EFLG----------FIISSDG 482
Query: 683 LTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL 741
+++ + R + A W + +S +++ ++ FA+F R+ + S ++ AP +
Sbjct: 483 ISMDS--RKVSAVLDWPVPNSRKAVQRFVGFANFY--------RKFIKNFS--KIIAP-I 529
Query: 742 TPINPAVLPKLEWWLNA----LPL-----SSPIF--PRQVQHFI-STDASDLGWGSQVDS 789
T + +V +L W A L L S+PI P + F+ DAS+ G+ +
Sbjct: 530 TALTSSV-KRLCWTSEAQRAFLDLKNRFTSAPILKHPDPTRPFVLEVDASEYAIGAVLSQ 588
Query: 790 -----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVMVQSDNQ 836
+F S S +QN+ + +E+ A+ A LL+ S ++V SD++
Sbjct: 589 RNDVQSLLHPIAFFSKKLSSSEQNYDVGDRELLAIKSAFQEWRHLLEWASHPILVFSDHK 648
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ YLR + L + L + H+ F PG+ N AD+LSR S P+
Sbjct: 649 N-LEYLR-----SAKRLRPRQARWALFFSRFNFHV--TFRPGSKNGKADALSRLFSAPE 699
>gi|326537262|ref|YP_004300274.1| replicase [Sweet potato vein clearing virus]
gi|325698381|gb|ADZ45064.1| replicase [Sweet potato vein clearing virus]
Length = 619
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/439 (21%), Positives = 175/439 (39%), Gaps = 48/439 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK----GNGGTRPVLNLKGLNQFLSPKK 529
LHI E+L+ G ++ S + S F+V K G +R V++ + LN
Sbjct: 204 EEFKLHIDELLKGGFIR--PSNSKHSSPAFIVNKHSEQKRGKSRMVIDYRNLNAKTKTYN 261
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+ L N +Q ++ D ++H+ + + A + L FG
Sbjct: 262 YPLPNKILRVRQVQGYNWFSKFDCKSGFYHLKLTEESKHLSAFNVPQGFYEFNVLMFGYK 321
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN 649
AP + + S L + +VY+DD LL ++ E K I+ G V+
Sbjct: 322 NAPGRYQCYMDSYFSKLEN----CIVYIDDILLYSKTKDEHETLLKKFYHIVKEAG--VS 375
Query: 650 L-QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLG 708
L +K ++ ++FLGI D + + N + T + LD+ + L
Sbjct: 376 LSEKKAIIGVNQIEFLGIEIDK----------SGVKMQNHIVTKIVQCEEVLDTKKKLQS 425
Query: 709 YLSFASFV---IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
+L + V +P + + Q L + H + L K++ + LP +
Sbjct: 426 FLGLINQVREYVP--NIAKELLFLQKKLKKDVEYHFDSQDQEKLKKIKEKCSNLP--KLL 481
Query: 766 FPRQVQHF---ISTDASDLGWGS-----------QVDSSFLSGLWSREQQNWHINKKEMF 811
FP + + F + TDAS++ +G + + SG + ++NW IN+KE+
Sbjct: 482 FPDETKQFDWIVETDASEISYGGVLKYKYHQDKIEYHCRYYSGTFKDNEKNWEINRKELL 541
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
+V + L P + + ++++DN V +L G + E+ ++ + + +
Sbjct: 542 SVFKCLYAFEPYIVYNKFILRTDNTQVKYWL--TGKLDNSVTTKEIRRLVVKINCYNFDV 599
Query: 872 LAQFIPGAYNSVADSLSRS 890
+ I N AD LSR
Sbjct: 600 VV--IKSKDNCFADYLSRE 616
>gi|296142307|gb|ADG96108.1| polymerase [Duck hepatitis B virus]
Length = 788
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 98/220 (44%), Gaps = 11/220 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+LFL K + T R V++ KG N P+ +S N + L G IS+DL
Sbjct: 391 KLFLADKNSRNTTEARLVVDFSQFSKGKNAMRFPRYWS-PNLSTLRRILPVGMPRISLDL 449
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMR 612
SQA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 450 SQAFYHLPLNPASSSRLAVSDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW 509
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D
Sbjct: 510 TFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTDIRFLGYQIDEKY 569
Query: 673 DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
M + E + L +++ + ++ + +G+L+F
Sbjct: 570 --MKIEESRWKELRTVIKKIKVGTWYDWKCIQRFVGHLNF 607
>gi|331250430|ref|XP_003337824.1| reverse transcriptase/ribonuclease H [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 517
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 117/501 (23%), Positives = 214/501 (42%), Gaps = 73/501 (14%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQ------HLATPVSS-------AMSLHIQEMLETGVLKR 491
++ G+ F+ P L +++ HL++ ++S +H + M +
Sbjct: 7 VLHGFKYGFNQGIPDHKLGNIRWFTPDNHLSSALASEKIKESMGKEVHAKRMYGPFTHEE 66
Query: 492 LDSTTGFL--SRLFLVPKGNGGTRPVLNLK---------GLNQFLSPKKFSLI-NHFR-I 538
S GF S L V G+G RP+ +L +N F++ FS + F +
Sbjct: 67 AYSRLGFFRSSPLGAVVNGDGSVRPINDLSFPHGDPDFPSVNSFVNKDDFSTTWDDFNMV 126
Query: 539 PSFLQKGDYMISI---DLSQAYFHVPIKTTHQRFLALS-YNGDVLAMTCLPFGLATAPQA 594
SF +K D+ +S+ D +AY +P + R+L + +G + T + FG +
Sbjct: 127 ASFFRKLDHPVSLALFDWEKAYRQIPTHPSQWRYLMVKGLDGKLYLDTRITFGGVAGCGS 186
Query: 595 FASLSNWVASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F ++ ++ + ++V ++DD L + Q E++ + S+ LG + N +K
Sbjct: 187 FGRPADAWKKIMENEFDLIKVFRWVDDNLFIKQSSSNTEMKHIIERSM--KLGVLTNEKK 244
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG-NILRTLLASKTWNLDSARSLLGYLS 711
S S + +F+G +W+ + LPE+K T I SK+++ + L G L+
Sbjct: 245 CS-SFSDEQKFIGFIWNGKDKTVRLPEEKLETRKLQIQEFQNESKSFSFNEVEVLTGRLN 303
Query: 712 FASFVIPMGRLHSRRIQR-QASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
++++P + + R + R + + A TP V L W++ L P
Sbjct: 304 HVTYILPQLKCYLRSLYRWMLNWFDVEAKRKTP--EDVQKDLVRWMSVLQSFQP-----T 356
Query: 771 QHFISTDASDLGWGSQVDSSF--------------LSGLWSREQQNWHINKKEMFAVHQA 816
+ +S++ ++GW SSF L G W E + I E AV
Sbjct: 357 RLIMSSEPKEIGWVGDASSSFGIGVLIGKYWSQLKLQGGWRDEDNHKTIAWLETAAVRVG 416
Query: 817 LSLNLPL---LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA 873
+ + L L + S V+V +DN T S + ++ S ++V + + QD I
Sbjct: 417 ILMLLHLGRHRKGSNVIVWTDNTTTESAIEKRK-----SEDADVNEEWKWIQD--ELIKN 469
Query: 874 QF-IPG----AYNSVADSLSR 889
+F I G + N+VAD+LSR
Sbjct: 470 EFDITGRRVKSKNNVADALSR 490
>gi|341890014|gb|EGT45949.1| hypothetical protein CAEBREN_01577 [Caenorhabditis brenneri]
Length = 2362
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 113/487 (23%), Positives = 185/487 (37%), Gaps = 89/487 (18%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P A+P VP+ + + HI ++++G + +S T + S +
Sbjct: 1375 VHIYTNTEVPVRARPYRVPV--------KYQAELEKHINSLIKSGRIT--ESNTPWTSPI 1424
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
LV K NG R L+ + LN P F L RI + L++ Y S+D++ Y
Sbjct: 1425 VLVKKKNGSLRVCLDFRRLNDVTIPDNFPLP---RIDAILERVGGSKYFTSMDMANGYLQ 1481
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
+ + + V A T LPFGL +A F V + L + VY+DD
Sbjct: 1482 LRLDPSSSYKCGFITETKVYAYTHLPFGLKSAASYFQRALKTVLAGLEDDAL---VYIDD 1538
Query: 620 FLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE 679
L+ ++ + + + S + +K + + FLG
Sbjct: 1539 ILVFSKTFEQHLLSLRKVLDRFRSFNLKASPKKCEFAKTSI-TFLG-------------- 1583
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP 739
I + A N+D S A+F +P RR A R P
Sbjct: 1584 ------HEISQNSYAPNKANVD---------SIAAFPVPSNINEVRRYVGMAGFFRKFIP 1628
Query: 740 HLTPI----------------NPAVLPKLEWWLNALPLSSPI--FPRQVQHF-ISTDASD 780
+ + I N A E AL +S PI FP + F I DAS
Sbjct: 1629 NFSEIAEPLTRLTRKNTSFEWNSAQQEAFETLRTAL-ISEPILGFPDYDKPFHIFCDASA 1687
Query: 781 LGWGSQV-----DS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV 828
+ G+ + DS ++ S S + W + EM A+ AL P S
Sbjct: 1688 VAQGAALMQTRPDSEKEFTAIAYASRTLSDPETRWPAIQVEMGAIIFALRQFKPYTCMSK 1747
Query: 829 VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+++ SD++ + L++ +L+ + + Q + I I+ I G N+VAD LS
Sbjct: 1748 IVLHSDHKPLTFLLQKSKTHDNLA------RWLIELQCYDITIV--HIDGKKNTVADCLS 1799
Query: 889 RSKSLPD 895
R++ D
Sbjct: 1800 RARDNED 1806
>gi|384495516|gb|EIE86007.1| hypothetical protein RO3G_10717 [Rhizopus delemar RA 99-880]
Length = 264
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 80/154 (51%), Gaps = 7/154 (4%)
Query: 777 DASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
DAS+ GWG + G W+ ++ IN +E+ A AL P L ++ V++++DN
Sbjct: 38 DASNSGWGCAYLTQRAHGYWTNQEAQMSINWRELKAAFLALQA-FPHLTNTTVLIRTDNT 96
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR---SKSL 893
T ++Y+ +QGGTKS SL++ ++ + +++ + +N AD SR +K+L
Sbjct: 97 TSLAYINKQGGTKSFSLMTLATTLWKWCLQRGLMLVSSHVSDIHNCKADYESRRSFTKNL 156
Query: 894 PDWHLSRSATEQIFL-KWGVPCIDLFASRVSAVV 926
W + + +WG +D+FA R S ++
Sbjct: 157 --WQVKPEVFNSLLQSQWGPHGVDMFADRTSNLL 188
>gi|301622913|ref|XP_002940769.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1233
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 107/463 (23%), Positives = 183/463 (39%), Gaps = 71/463 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P + LA P A+ +++E L G ++ S G + +F V K + RP ++ +
Sbjct: 407 IPFGRIYPLAEPELKALRSYLEENLAKGFIRPSTSPAG--AGIFFVEKKDHTLRPCIDYR 464
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
LN ++ L IP Q+ +DL AY V I+ + A
Sbjct: 465 NLNSITIKNRYPLP---LIPELFQRLREAKIFSKLDLRGAYNLVRIREGDEWKTAFRTRY 521
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP A+ ++V + R + V++YLDD L+ ++ + +
Sbjct: 522 GHFEYLVMPFGLCNAP---ATFQHFVNDVFRDYLDVFVIIYLDDILVFSKSVAEHRVHME 578
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
S L S +K + +FLG + +M Q + IL
Sbjct: 579 KIFSRLRSHQLFAKFEKCEFDKTSI-EFLGFIISAEGIQM-----DQKKISAIL------ 626
Query: 696 KTWNLD-SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
W + S + + ++ FA+F RR + S + L LT ++ K W
Sbjct: 627 -DWPVPLSRKDVQRFIGFANFY--------RRFIKGFSQIMLPITRLTSLST----KFHW 673
Query: 755 ---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVDSSF----------- 791
L L S+PI P + FI DAS+ G+ + F
Sbjct: 674 TPQAQSAFELLKTLFTSAPILQHPDPARPFILEVDASESAVGAVLSQRFPASGALHPVAY 733
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTK 849
S ++ +QN+ + +E+ A+ AL LL+ +++ +D++ + YLR K
Sbjct: 734 FSRKMNKSEQNYDVADRELLAIKLALEEWRYLLEGGPHPILIYTDHKNL-EYLR---VAK 789
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
L +F + R + + PG+ NS AD+LSR S
Sbjct: 790 RLRPRQARWALFFM----RFNFHLTYRPGSKNSKADALSRIHS 828
>gi|308456407|ref|XP_003090646.1| hypothetical protein CRE_09882 [Caenorhabditis remanei]
gi|308262093|gb|EFP06046.1| hypothetical protein CRE_09882 [Caenorhabditis remanei]
Length = 1305
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 120/275 (43%), Gaps = 24/275 (8%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P K VP SL+ A+S I + TGVLK LD + + + + V K N
Sbjct: 398 AQPVFRKSRPVPYASLE--------ALSNEIDRLEATGVLKSLDHS-DWAAPVVAVTKKN 448
Query: 510 GGTRPVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
G R + GLN + + L I + L G + IDL+ AY + + ++
Sbjct: 449 GSIRLCSDFSTGLNDAIEAHQHPLPTADDIFAKLNGGKFFSQIDLADAYLQIEVDDDSKK 508
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
L ++ + + LPFG+ AP F + + + + L V YLDD ++
Sbjct: 509 LLVINTHKGLFHYNRLPFGVKAAPGIFQQVMDTMLAGLDG----VSCYLDDIIVTGCSIE 564
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
+ K + + S G+ + L+K S P +QFLG + + + P+ +++
Sbjct: 565 EHNQRVKKVIERIASFGFRMRLEKCSFL-MPEIQFLGFVINEQGRK---PDPQKIA---D 617
Query: 689 LRTLLASKTWNLDSARSLLGYLSF-ASFVIPMGRL 722
++ + A K N RS LG + F +FV + RL
Sbjct: 618 IKAMPAPK--NAIEVRSFLGLIQFYGTFVRDLHRL 650
>gi|301615822|ref|XP_002937370.1| PREDICTED: retrotransposon-like protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1217
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 107/463 (23%), Positives = 183/463 (39%), Gaps = 71/463 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P + LA P A+ +++E L G ++ S G + +F V K + RP ++ +
Sbjct: 515 IPFGRIYPLAEPELKALRSYLEENLAKGFIRPSTSPAG--AGIFFVEKKDHTLRPCIDYR 572
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
LN ++ L IP Q+ +DL AY V I+ + A
Sbjct: 573 NLNSITIKNRYPLP---LIPELFQRLREAKIFSKLDLRGAYNLVRIRKGDEWKTAFRTRY 629
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP A+ ++V + R + V++YLDD L+ ++ + +
Sbjct: 630 GHFEYLVMPFGLCNAP---ATFQHFVNDVFRDYLDVFVIIYLDDILVFSKSVAEHRVHME 686
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
S L S +K + +FLG + +M Q + IL
Sbjct: 687 KIFSRLRSHQLFAKFEKCEFDKTSI-EFLGFIISAEGIQM-----DQKKISAIL------ 734
Query: 696 KTWNLD-SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW 754
W + S + + ++ FA+F RR + S + L LT ++ K W
Sbjct: 735 -DWPVPLSRKDVQRFIGFANFY--------RRFIKGFSQIMLPITRLTSLST----KFHW 781
Query: 755 ---------WLNALPLSSPIF--PRQVQHFI-STDASDLGWGSQVDSSF----------- 791
L L S+PI P + FI DAS+ G+ + F
Sbjct: 782 TPQAQSAFELLKTLFTSAPILQHPDPARPFILEVDASESAVGAVLSQRFPASGALHPVAY 841
Query: 792 LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTK 849
S ++ +QN+ + +E+ A+ AL LL+ +++ +D++ + YLR K
Sbjct: 842 FSRKMNKSEQNYDVADRELLAIKLALEEWRYLLEGGPHPILIYTDHKNL-EYLR---VAK 897
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
L +F + R + + PG+ NS AD+LSR S
Sbjct: 898 RLRPRQARWALFFM----RFNFHLTYRPGSKNSKADALSRIHS 936
>gi|190360813|gb|ACE76858.1| polyprotein [Citrus yellow mosaic virus]
Length = 1967
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 107/456 (23%), Positives = 184/456 (40%), Gaps = 56/456 (12%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 1385 LKHVTPQMEESFRKHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSIDPITGKEVKGKER 1442
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V N K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 1443 MVFNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSIEWT 1499
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ R
Sbjct: 1500 AFWVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDILVFSKSEREH 1556
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
E K+ +SI G I++ K ++ + +FLG + L ++ Q + L
Sbjct: 1557 EEHLKIMLSICQKNGLILSPTKMKIAQVEI-EFLGAIIHNGLIKL------QPHIVQKLL 1609
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
T + + RS LG L++A IP MGRL S A + G + + A++
Sbjct: 1610 TFTNKQLEEVKGLRSWLGLLNYARSYIPHMGRLLSPLY---AKVSPTGERRMNRQDWALI 1666
Query: 750 PKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVD-------SSFLSG 794
K+ + LP L P P I TD GWG +Q D ++ SG
Sbjct: 1667 DKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKLAQYDPRSSERVCAYASG 1724
Query: 795 LWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
++ + E+ AV +L + + L S + +++D Q ++S+ + K +
Sbjct: 1725 KFNPPKSTI---DAEIHAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKSNVNKPSRV 1781
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
FL ++I + I G N +AD+LSR
Sbjct: 1782 RWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1815
>gi|375281631|gb|AFA44809.1| pol protein [Macaque simian foamy virus]
Length = 1149
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 120/244 (49%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLIQQNSTMN--TPVYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I S + +G Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL++ V Y+DD + + DP+ Q + SIL + G++V+L+KS
Sbjct: 295 FTAD---VVDLLKTIP-NVQAYVDDIYISHDDPQEHLEQLEKVFSILLNAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
++ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IAQREV-EFLGF----NITK----EGRGLTDTFKQKLLNITPPKDLKQLQSVLGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|308482634|ref|XP_003103520.1| hypothetical protein CRE_28691 [Caenorhabditis remanei]
gi|308259941|gb|EFP03894.1| hypothetical protein CRE_28691 [Caenorhabditis remanei]
Length = 1059
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 105/480 (21%), Positives = 199/480 (41%), Gaps = 61/480 (12%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P K VP SL+ A+S I + TGVLK LD + + + + V K N
Sbjct: 310 AQPVFRKSRPVPYASLE--------ALSNEIDRLEATGVLKSLDHS-DWAAPVVAVTKKN 360
Query: 510 GGTRPVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
G R + GLN + + L I + L G + IDL+ AY + + ++
Sbjct: 361 GSIRLCSDFSTGLNDAIEAHQHPLPTADDIFAKLNGGKFFSQIDLADAYLQIEVDDDSKK 420
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
L ++ + + LPFG+ AP F + + + + L V YLDD ++
Sbjct: 421 LLVINTHKGLFHYNRLPFGVKAAPGIFQQVMDTMLAGLDG----VSCYLDDIIVTGCSIE 476
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNI 688
+ K + + S G+ + L+K S P +QFLG + + + P+ +++
Sbjct: 477 EHNQRVKKVIERIASFGFRMRLEKCSFL-MPEIQFLGFVINEQGRK---PDPQKIA---D 529
Query: 689 LRTLLASKTWNLDSARSLLGYLSF-ASFVI-------PMGRLHSRRIQRQASLLRLGAPH 740
++ + A K N RS LG + F +FV P+ +L ++ ++ +
Sbjct: 530 IKAMPAPK--NAIEVRSFLGLIQFYGTFVRDLHRLRPPLDKLTNKDVEFKWD-------- 579
Query: 741 LTPINPAVLPKLEWWLNALPLS--SPIFPRQVQHFISTDASDLGWGSQVDSSF------- 791
T A E + L L+ +P P ++ DAS G G+ + F
Sbjct: 580 -TECQHAFDQVKEMLQSDLLLTHYNPKLPI----IVAADASQYGIGATISHRFPDGKEKA 634
Query: 792 ---LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT 848
+S ++ Q+N+ +KE F + A++ + +++D++ ++S + G
Sbjct: 635 IYHVSKALNKAQRNYSQIEKEAFGLVTAVTKFHKFVHGRRFTLRTDHKPLLSIFGEKKGV 694
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFL 908
+ + +++ + ++ I ++I D+LSR + D R TE++ +
Sbjct: 695 -PIYTANRLQRWATILMNYNFSI--EYINTKNFGQVDALSR--LISDQMQQREETEEVVI 749
>gi|242775227|ref|XP_002478601.1| retrovirus polyprotein, putative [Talaromyces stipitatus ATCC 10500]
gi|218722220|gb|EED21638.1| retrovirus polyprotein, putative [Talaromyces stipitatus ATCC 10500]
Length = 1764
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 97/448 (21%), Positives = 169/448 (37%), Gaps = 46/448 (10%)
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
C L + A +I E L+ G + S F S + + K GG R ++ + LN
Sbjct: 775 CPLYRMTADELEAAKEYILENLDKGFIA--PSNVPFASPILMAKKPGGGLRFCVDYRKLN 832
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
++ L + + K +D+ Q + + +
Sbjct: 833 ALTRKDRYPLPLIDEVFERISKAKIFTKLDIRQGFHRIRMHKDSSDLTTFRCRYGTFKYE 892
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILG 642
+PFGL P F L N + L ++ ++DD L+ + + EI + + L
Sbjct: 893 VMPFGLTNGPATFQRLINDI--FLDCLDKFLIAFIDDLLIYSDNAAEHEIHVRTVLQRLR 950
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDS 702
G +++K ++LG + PE ++ + L TW + +
Sbjct: 951 DAGLQASIKKCEFH-VTTTKYLGFII--------TPEGIKVDSEKVESVL----TWKVPT 997
Query: 703 ARSLLG---YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNA 758
++LG +L F +F +SR Q L R P + T A KL+ L +
Sbjct: 998 --TVLGIQSFLGFCNFYRKFIAEYSRIAQPLHRLTRSNVPFVWTDKCQAAFDKLKVALVS 1055
Query: 759 LPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLW----------SREQQNWHINKK 808
P+ P ++ + TDASD + + G W S + N+ I+ K
Sbjct: 1056 APVLVHYDPTRLTR-VETDASDGVVAAVLSQLCDDGEWHPVAYYSSSMSSAEHNYDIHDK 1114
Query: 809 EMFAVHQALSLNLP----LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
EM A+ +A P L Q V SD++ + ++ TK+LS FL
Sbjct: 1115 EMLAIIKAFREWRPELLGLRQQERFEVLSDHRALEYFM----TTKALSARQVRWYEFLQE 1170
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKS 892
+ + ++ PG N +AD+L+R K
Sbjct: 1171 ----FYFILKYRPGRANVLADTLTRRKD 1194
>gi|387965727|gb|AFK13856.1| Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris]
Length = 1631
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 99/428 (23%), Positives = 173/428 (40%), Gaps = 54/428 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+EML G+++ ST+ F S + LV K +G R ++ + LN+ P K+ + +
Sbjct: 711 IKEMLAAGIIQ--PSTSPFSSPVILVKKKDGSWRFCVDYRALNKETVPDKYPIPVIDELL 768
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L +DL Y + ++ A + +PFGL AP F SL
Sbjct: 769 DELHGATVFSKLDLRAGYHQILVRPEDTHKTAFRTHEGHYEFLVMPFGLTNAPATFQSLM 828
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V R V+V+LDD L+ ++ ++ + +L VN +K
Sbjct: 829 NEVFRPFLRR--FVLVFLDDILIYSRSDEEHVGHLEMVLGMLAQHALFVNKKKCEFGKRE 886
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNL-DSARSLLGYLSFASF 715
V +LG H+ ++ G + + A W + + R L G+L +
Sbjct: 887 V-AYLG-----HV----------ISEGGVAMDTEKVKAVLEWEVPKNLRELRGFLGLTGY 930
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPIN---PAVLPKLEWWLNALPLSSPIFPR---Q 769
+ + + A + R L N A + L + +S+P+ Q
Sbjct: 931 -------YRKFVANYAHIARPLTEQLKKDNFKWSATATEAFKQLKSAMVSAPVLAMPNFQ 983
Query: 770 VQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
+ + TDAS G G+ + ++ S L Q + +KE+ A+ A+ L
Sbjct: 984 LTFVVETDASGYGMGAVLMQDNRPIAYYSKLLGTRAQLKSVYEKELMAICFAVQKWKYYL 1043
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF--LLSQDWRIHILAQFIPGAYNS 882
+V++D Q+ + Y+ T+ + +E +K L+ D+ IH + PG N
Sbjct: 1044 LGRHFVVRTDQQS-LRYI-----TQQREIGAEFQKWVSKLMGYDFEIH----YKPGLSNR 1093
Query: 883 VADSLSRS 890
VAD+LSR
Sbjct: 1094 VADALSRK 1101
>gi|342365298|gb|AEL30041.1| polymerase polyprotein [Dahlia common mosaic virus]
Length = 673
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 107/437 (24%), Positives = 181/437 (41%), Gaps = 45/437 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG----NGGTRPVLNLKGLNQFLSPKK 529
+ I+E+L+ V++ S + +S FLV K G R V+N K LN+
Sbjct: 255 KEFEIQIKELLDLKVIE--PSKSQHMSPAFLVEKEAEKRRGKKRMVVNYKKLNEVTIGDS 312
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+L N + + L+ S D ++ V + Q+ A + +PFGL
Sbjct: 313 HNLPNMQELITLLRGKTIFSSFDCKSGFWQVFLDQESQKLTAFTCPQGHFQWRVVPFGLK 372
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN 649
AP F + + LR +VY+DD ++ + + + SLG I++
Sbjct: 373 QAPSIF---QRHMQNALRGLEEFCLVYVDDIIVFSDKEEEHYTHVLKVLKRIESLGIILS 429
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDR-MWLPEDKQLT-LGNILRTLLASKTWNLDSARSLL 707
+K++L + FLG+ +DR P++ L L N L K + L
Sbjct: 430 KKKTNLFKEKI-NFLGL----EIDRGTHTPQNHILEHLHNFPDRLEDKK-----QLQRFL 479
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIF 766
G L++A IP +L +R Q L + + + K++ L P L P
Sbjct: 480 GVLTYADSYIP--KLAEKRKPLQVKLKKDQVWIWNQSDTDYVKKIKKGLVNFPKLYLP-- 535
Query: 767 PRQVQHFISTDASDLGW----------GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQA 816
++ I TDASD W G+++ + SG + + N+H N+KE+ AV Q
Sbjct: 536 KKEDSLIIETDASDHFWGGVLKAQTTEGNELICRYSSGTFKPAELNYHSNEKELLAVKQV 595
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS-QDWRIHIL--A 873
++ L V++DN V+ L+R TK + + ++ L+ Q W H
Sbjct: 596 ITKFSIYLTPVTFTVRTDN---VNLLKRFMNTK---ITGDSKQGRLIRWQMWLSHYTFNV 649
Query: 874 QFIPGAYNSVADSLSRS 890
+ G N +AD L+R
Sbjct: 650 NHLKGEKNVLADYLTRE 666
>gi|307197138|gb|EFN78492.1| hypothetical protein EAI_11211 [Harpegnathos saltator]
Length = 134
Score = 68.6 bits (166), Expect = 2e-08, Method: Composition-based stats.
Identities = 37/130 (28%), Positives = 68/130 (52%), Gaps = 2/130 (1%)
Query: 748 VLPKLEWWLNALPL--SSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHI 805
+L L+WW +L SS I + I +DAS+ GWG+ + W++EQ++WHI
Sbjct: 5 ILENLDWWKVSLTSGSSSTIKRDKFNLVIYSDASNTGWGATDGRRKIYKFWNKEQKSWHI 64
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
N KE+ AV A+ ++ ++++ DN T ++Y+ + G K KI+ ++
Sbjct: 65 NYKELLAVKYAVENLASERRNCRILLRVDNTTAIAYINKMGSVKFQKFNELARKIWQWAE 124
Query: 866 DWRIHILAQF 875
+I ++A +
Sbjct: 125 KRKIILMASY 134
>gi|208609049|dbj|BAG72147.1| hypothetical protein [Lotus japonicus]
gi|208609053|dbj|BAG72149.1| hypothetical protein [Lotus japonicus]
Length = 1520
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 110/482 (22%), Positives = 193/482 (40%), Gaps = 74/482 (15%)
Query: 427 RRFVDAWIRL--GAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEML 484
RR D I+L GA P +R Y PF K + L ++EML
Sbjct: 572 RRTTDHAIQLQEGASIPNIR---PYRYPFYQKNEIEKL-----------------VKEML 611
Query: 485 ETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK 544
+G+++ ST+ F S LV K +GG R ++ + +N+ P KF + + +
Sbjct: 612 NSGIIRH--STSPFSSPAILVKKKDGGWRFCVDYRAINKATIPDKFPIPIIDELLDEIGA 669
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
+DL Y + +K A + LPFGL AP F +L N V
Sbjct: 670 AVVFSKLDLKSGYHQIRMKEEDIPKTAFRTHEGHYEYLVLPFGLTNAPSTFQALMNQVLR 729
Query: 605 -LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
LR V+V+ D L+ +++ + + ++ + +L + N +K S ++
Sbjct: 730 PYLRK---FVLVFFYDILIYSKNEELHKDHLRIVLQVLKENNLVANQKKCSFGQPEIIYL 786
Query: 664 ------LGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFV 716
G+ DP + ++ +L W + + L G+L +
Sbjct: 787 GHVISQAGVAADP----------------SKIKDML---DWPIPKEVKGLRGFLGLTGYY 827
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-S 775
+ +S+ Q LL+ + T KL+ + +P+ P P + FI
Sbjct: 828 RRFVKNYSKLAQPLNQLLKKNSFQWTEEATQAFVKLKEVMTTVPVLVP--PNFDKPFILE 885
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDAS G G+ + +++S S Q + ++E+ AV A+ L S +
Sbjct: 886 TDASGKGLGAVLMQEGRPVAYMSKTLSDRAQAKSVYERELMAVVLAVQKWRHYLLGSQFV 945
Query: 831 VQSDNQTVVSYL--RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ +D Q + +L +R G + +S+ L+ D+ I ++ PG N AD+LS
Sbjct: 946 IHTD-QRSLRFLADQRIMGEEQQKWMSK-----LMGYDFEI----KYKPGIENKAADALS 995
Query: 889 RS 890
R
Sbjct: 996 RK 997
>gi|130398|sp|P20825.1|POL2_DROME RecName: Full=Retrovirus-related Pol polyprotein from transposon
297; Includes: RecName: Full=Protease; Includes:
RecName: Full=Reverse transcriptase; Includes: RecName:
Full=Endonuclease
gi|6015506|emb|CAB57796.1| unnamed protein product [Drosophila melanogaster]
Length = 1059
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 112/460 (24%), Positives = 188/460 (40%), Gaps = 58/460 (12%)
Query: 461 PLCSLQH-LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRP 514
P+ S Q+ LA + +QEML G+++ +S + + S ++VPK G R
Sbjct: 206 PIYSKQYPLAQTHEIEVENQVQEMLNQGLIR--ESNSPYNSPTWVVPKKPDASGANKYRV 263
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
V++ + LN+ P ++ + N I L K Y +IDL++ + + + A S
Sbjct: 264 VIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFST 323
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD--PRILEI 632
+PFGL AP F N + L ++ +VYLDD ++ + + I
Sbjct: 324 KSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNK--HCLVYLDDIIIFSTSLTEHLNSI 381
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
Q L + L + L K FLG + P D + + + + I+
Sbjct: 382 Q--LVFTKLADANLKLQLDKCEFLKKEA-NFLGHIVTP--DGI---KPNPIKVKAIVSYP 433
Query: 693 LASKTWNLDSARSLLGYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
+ +K + + L GY ++A PM +R T I+
Sbjct: 434 IPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKR---------------TKIDTQ 478
Query: 748 VLPKLEWW--LNALPLSSPI--FPRQVQHFI-STDASDLGWGSQVDS-----SFLSGLWS 797
L +E + L AL + PI P + F+ +TDAS+L G+ + SF+S +
Sbjct: 479 KLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLN 538
Query: 798 REQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
+ N+ +KE+ A+ A L ++ SD+Q LR K E
Sbjct: 539 DHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDHQP----LRWLHNLKEPGAKLER 594
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
++ L ++I +I G NSVAD+LSR K + H
Sbjct: 595 WRVRLSEYQFKI----DYIKGKENSVADALSRIKIEENHH 630
>gi|294954394|ref|XP_002788146.1| gag/pol/env polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239903361|gb|EER19942.1| gag/pol/env polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 1718
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 105/440 (23%), Positives = 177/440 (40%), Gaps = 83/440 (18%)
Query: 480 IQEMLETGVLKRLDS--TTGFLSRLFLVPKGNGGTRPVLNLKGLN----QFLSPKKFSLI 533
++EM + G +K +D T+ + FL KGNG R + +L+ +N F S F +
Sbjct: 785 VREMEDKGWIKIIDDKDTSQWFCPTFLKLKGNGKVRVLNDLREVNARIRSFASQSSFGAV 844
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ + I++D+S AY VP+ QRFL G P GL+ +P
Sbjct: 845 ----LGGIPRHAKSFITLDISNAYHSVPVDVESQRFLGGVLGGIRFKWLVCPQGLSISPY 900
Query: 594 AFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
+ LS+ ++ + + V+ Y+DD ++ D K +S L S +V +K
Sbjct: 901 FWELYLSSMLSGIEFPPQVTVLWYVDDIIICAPDDVSALAAKKAIISALVSENVMVAEEK 960
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
PA + FLG++ D H W P+++ L + LR L K N S LG +++
Sbjct: 961 -CCGPARSVNFLGLVIDEH---GWKPQEEPL---DQLRRL--PKPRNRGELHSFLGVVNY 1011
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP--KLEW---------WL----- 756
V L HL P+ ++ + +W WL
Sbjct: 1012 LRGVYDPSELQK---------------HLAPLQDLLVKGRRFQWSEAHDLAFEWLQTSIK 1056
Query: 757 NALPLSSPI-FPRQVQH----FISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMF 811
N L P+ F Q+Q + TDASDLG LW +H +K E
Sbjct: 1057 NQLYAHQPVSFGTQLQQGEGWVLQTDASDLG--------IACVLW-----RFHFDKVEDG 1103
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS--QDWRI 869
Q + ++ V ++ R+ G++ + ++E + L+ + R
Sbjct: 1104 T------------QVTPEILGQFGDVVSTWSRKLRGSEKRWAMFDLEGLALVEGLRRLRA 1151
Query: 870 HILAQFIPGAYNSVADSLSR 889
++ + I G N++AD SR
Sbjct: 1152 YLRMEHIRGDSNNLADIFSR 1171
>gi|301607174|ref|XP_002933191.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1456
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 110/468 (23%), Positives = 186/468 (39%), Gaps = 62/468 (13%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
+ ++ G AIPF PL + P + +I E LE G ++ S G + +
Sbjct: 511 IELLPGAAIPFGRIYPL---------SEPELDVLKKYIDENLEKGFIRPSTSPAG--AGI 559
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFH 559
F V K + RP ++ + LNQ ++ L +P Q +G + S +DL AY
Sbjct: 560 FFVEKKDHSLRPCIDYRHLNQITIKNRYPLP---LVPELFQNLRGAKIFSKLDLRGAYNL 616
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
V I+ + A +PFGL AP F N + + + +VYLDD
Sbjct: 617 VRIREGDEWKTAFRSRYGHFEYLVMPFGLCNAPATFQHFINDIFRDFLDQFL--IVYLDD 674
Query: 620 FLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE 679
L+ + ++ + L L+K + + +FLG + +M
Sbjct: 675 ILIFSTSEAEHQVHMQKVFKRLRLHHLFAKLEKCEFHKSSI-EFLGFIISTEGVQM---- 729
Query: 680 DKQLTLGNILRTLLASKTWNLDSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA 738
D+ R + A W + S+R ++ ++ FA+F + S+ I L
Sbjct: 730 DQ--------RKVSAIIDWPIPSSRKAVQSFIGFANFYRKFIQGFSKVISPITDLTCTSR 781
Query: 739 PH--LTPINPAV--LPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVD--- 788
P T A L KL L +P+ P + DAS++ G SQ D
Sbjct: 782 PFSWTTQAQTAFDHLKKLFVSAPILKHVNPVLP----FVLEVDASEIAVGAILSQRDIGK 837
Query: 789 -----SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSY 841
+F S + +QN+ ++ +E+ A+ A LL+ + +++ SD++ + Y
Sbjct: 838 DFLHPVAFFSKKLTSSEQNYDVSDRELLAIKAAFEEWRHLLEGAAHPIIIFSDHRN-LEY 896
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
LR + L + L + HI + PG N AD+LSR
Sbjct: 897 LR-----TAKRLKPRQARWALFFSRFNFHI--TYRPGCQNKKADALSR 937
>gi|1334942|emb|CAA41394.1| pol [Simian foamy virus]
Length = 970
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 119/244 (48%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 189 SIQIVIDDLLKQGVLIQQNSTMN--TPVYPVPKPDGKWRMVLDYREVNKIIPLIAAQNQH 246
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I S + +G Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 247 SAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 306
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V Y+DD + + DP+ Q + SIL + G++V+L+KS
Sbjct: 307 FTAD---VVDLLKEIP-NVQAYVDDIYISHDDPQEHLEQLEKIFSILLNAGYVVSLKKSE 362
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
++ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 363 IAQREV-EFLGF----NITK----EGRGLTDTFKQKLLNITPPKDLKQLQSILGLLNFAR 413
Query: 715 FVIP 718
IP
Sbjct: 414 NFIP 417
>gi|301623719|ref|XP_002941161.1| PREDICTED: hypothetical protein LOC100497966 [Xenopus (Silurana)
tropicalis]
Length = 855
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 89/185 (48%), Gaps = 19/185 (10%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFL 504
+V+ + + KP +P Q A+S ++++LE GV++ +S + + S + L
Sbjct: 471 VVTEPQVKVNVKPYRIPKARRQ--------AVSEEVRKILELGVIE--ESHSDWSSPIVL 520
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD---YMISIDLSQAYFHVP 561
+PK +G R + + LN+ KF R+ +++ D Y+ ++DL++ Y VP
Sbjct: 521 IPKPDGSLRFCNDFRKLNEV---SKFDAFPMPRVDELIERLDPARYLTTLDLTKGYRQVP 577
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
+ ++ A S D+ LPFGL AP F L +WV L+ YLDD +
Sbjct: 578 LNEQAKQKTAFSTPEDLFQYNVLPFGLHGAPATFQRLMDWV---LKPHRPYASAYLDDVV 634
Query: 622 LVNQD 626
+ + D
Sbjct: 635 IFSTD 639
>gi|358341466|dbj|GAA49140.1| transposon Ty3-G gap-Pol polyprotein [Clonorchis sinensis]
Length = 1324
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/430 (23%), Positives = 167/430 (38%), Gaps = 53/430 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
Q ML+ G+++ S + + S L +VPK RP + + LN +P ++ + +
Sbjct: 477 FQHMLQLGIIR--PSKSVWASPLHMVPKKTTADWRPCGDYRALNNITTPDRYPIPHIHDF 534
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L +D+ +AY H+PI A++ + +PFGL A Q F
Sbjct: 535 TSNLAGCTIFSHVDILRAYHHIPIHPEDIHKTAITTPFGLFEFLRMPFGLRNAAQTFQRF 594
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ V S L V YLDD L+ + +L + L G ++N QK
Sbjct: 595 IDQVLSGLSF----VFAYLDDILVASSSTEQHLEHLRLLFTRLRDHGVVINAQKCIFG-V 649
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF--V 716
L FLG + D + +DK LD+ RS SF
Sbjct: 650 STLNFLGHTVNQ--DGISPTDDK------------------LDAIRSFPLPTSFKQLKRF 689
Query: 717 IPMGRLHSRRIQRQASLLR------LGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
+ M + R I + ASLL G P + + + N+L S +F Q
Sbjct: 690 LGMINFYRRFIPKAASLLAPLTNLLSGNPKTFHLTDSAISAFGQVKNSLMNSFKLFYLQP 749
Query: 771 QHFIS--TDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQALSL 819
+S DAS+ G+ + + F S S + + +E+ A++ ++
Sbjct: 750 NSVLSLNVDASNDAVGAVLQQTINNIHQPLAFFSHKLSPTESRYSTFGRELLAIYLSIRH 809
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
LL+ V +D++ + S L+ S + ++ I + D R + G
Sbjct: 810 FRHLLEGREFHVYTDHKPLTSALKATSDKYSPREIRHLDYISQFTNDIR------HVSGH 863
Query: 880 YNSVADSLSR 889
N VAD+LSR
Sbjct: 864 ENIVADTLSR 873
>gi|427791841|gb|JAA61372.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1116
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 154/374 (41%), Gaps = 35/374 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++EMLE GV++ +S + + + + LV K +G R ++ + LN F + L
Sbjct: 289 VREMLERGVIQ--ESCSPWAAPVILVKKKDGTWRFCVDYRHLNAFTKKDVYPLPRIDDAI 346
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y S+DL Y+ +P+ + A + +PFGL AP A+
Sbjct: 347 DCLHSASYFSSVDLRSGYWQIPMDPVDKEKTAFVTPDGLYEFNVMPFGLCNAP---ATFE 403
Query: 600 NWVASLLRSRGMRV-VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
++ ++LRS + + YLDD ++ + + L + + + G ++N +K
Sbjct: 404 RFMDTILRSLKWEICMCYLDDVVIFGRTFSEHNQRLDLVLDCIRNAGLVLNSKKCHFGER 463
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
L LG + +D+ + D Q +T+ + S + L +L S+
Sbjct: 464 QAL-VLGHL----VDKDGIRPDPQ-------KTMAVKEFQPPRSVKELRSFLGLCSYFRR 511
Query: 719 MGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPL---SSPIFPRQVQHFI 774
S LLR AP H T A +L+ L + P+ P P +V
Sbjct: 512 FINRFSDVAHPLTCLLRKDAPFHWTDECDASFRQLKCLLTSQPILRHFDPSAPTEVH--- 568
Query: 775 STDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ 825
TDAS +G G+ + ++ S S+ ++N+ + ++E AV A+ L
Sbjct: 569 -TDASGVGVGAVLVQRIGDKEHVIAYASRSLSKPERNYTVTEQECLAVIFAVQRFRSYLY 627
Query: 826 SSVVMVQSDNQTVV 839
V +D+ ++
Sbjct: 628 GRHFTVVTDHHSLC 641
>gi|270015991|gb|EFA12439.1| hypothetical protein TcasGA2_TC016174 [Tribolium castaneum]
Length = 467
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 89/190 (46%), Gaps = 13/190 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ E+L G+++ +S + + S + LV K NG R V++ + LN+ K+ L
Sbjct: 257 VDELLANGIVR--ESQSPYASPVLLVKKKNGQLRLVVDYRALNKITVQDKYPLPLIEEQL 314
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L G + S+DL Y+H+P+ + A T +PFGL AP F +
Sbjct: 315 RRLAGGKFFTSLDLFSGYYHIPVSEDSIHYTAFITQDGHYEFTRVPFGLTNAPAVFQRMI 374
Query: 600 NWVASLLRSRGMRVVVYLDDFLL----VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
N LR +V++YLDD L+ +++ +L ++ + +L G +NL+K
Sbjct: 375 NTALGQLRFS--KVLIYLDDILIPAPTISESLHLL----RIVLKVLQDNGLTLNLKKCYF 428
Query: 656 SPAPVLQFLG 665
+ ++LG
Sbjct: 429 LKKQI-EYLG 437
>gi|94208|pir||S18738 pol protein - simian foamy virus (fragment)
Length = 1161
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 119/244 (48%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 189 SIQIVIDDLLKQGVLIQQNSTMN--TPVYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQH 246
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I S + +G Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 247 SAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 306
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V Y+DD + + DP+ Q + SIL + G++V+L+KS
Sbjct: 307 FTAD---VVDLLKEIP-NVQAYVDDIYISHDDPQEHLEQLEKIFSILLNAGYVVSLKKSE 362
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
++ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 363 IAQREV-EFLGF----NITK----EGRGLTDTFKQKLLNITPPKDLKQLQSILGLLNFAR 413
Query: 715 FVIP 718
IP
Sbjct: 414 NFIP 417
>gi|341882120|gb|EGT38055.1| hypothetical protein CAEBREN_28397 [Caenorhabditis brenneri]
Length = 2174
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 114/476 (23%), Positives = 195/476 (40%), Gaps = 75/476 (15%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
+ +V G A P KP VPL A P M I +ML+ V++ +S + + S +
Sbjct: 905 IELVEG-AQPVRQKPRPVPLA-----ARPEIRKM---IDKMLDQKVIR--ESKSSWASPV 953
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK--GDYMIS-IDLSQAYFH 559
LV K + R ++ + +N+ + L N I + LQ G + S +DL Y+
Sbjct: 954 VLVKKKDNSIRMCIDYRKVNKVVKYNAHPLPN---IEATLQSLAGKAVFSTLDLVSGYWQ 1010
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLD 618
+P+K + + A + LPFGL T+P F A++ V L+ G VY+D
Sbjct: 1011 LPLKESSKEITAFVVGTEFYEWEVLPFGLVTSPALFQATMETVVGDLI---GKCAFVYVD 1067
Query: 619 DFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMW 676
D L+ N + +L++Q L + G K L+ V ++LG
Sbjct: 1068 DLLVASENMEQHVLDLQRVL--ERVERSGLKFRASKCHLAKREV-EYLG--------HKI 1116
Query: 677 LPEDKQLTLGNILRTLLASKTWNLDSARSLLG--------YLSFASFVIPMGRLHSRRIQ 728
PE + + + S+ NL +S LG ++F+ P+ L S+
Sbjct: 1117 TPEGVKTEEKKVEKMRKFSRPTNLKELQSFLGLVGYYRKFIMTFSKIAAPLTPLTSKNSA 1176
Query: 729 -----RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW 783
Q + +L + + P +E +N S P I TDAS G
Sbjct: 1177 WIWGVEQETAFQLLIEKVCSAPVLMQPNVEAAING---SRPF-------LIYTDASRQGI 1226
Query: 784 GS----QVDS------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
G+ + D +F S + + +H+ E A+ AL ++ + V V +
Sbjct: 1227 GAVLAQEADDGEQHPIAFSSRSLTSAETRYHVTDLEALAMMSALKRFKTIIYGTQVTVFT 1286
Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
D++ +V L K L + + + Q++ + ++ F+ G N VAD+LSR
Sbjct: 1287 DHKPLVYLL------KGSPLADRLLRWSIQIQEYNVRLV--FVNGKANVVADALSR 1334
>gi|326663789|ref|XP_003197661.1| PREDICTED: hypothetical protein LOC100333208 [Danio rerio]
Length = 1481
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/431 (23%), Positives = 178/431 (41%), Gaps = 48/431 (11%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ +MLE GV++ +S + + S + LVPK +G R ++ + LN + KF RI
Sbjct: 1013 ELGKMLEMGVVE--ESHSDWASPIVLVPKTDGTVRFCVDYRKLN---AVSKFDAYPMPRI 1067
Query: 539 PSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
L + + ++DL++ Y+ +P+ + A + + LPFGL AP F
Sbjct: 1068 DELLDRLGAARFYSTLDLTKGYWQIPLSPISREKTAFTTPFGLHQFVTLPFGLFGAPATF 1127
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
L + +L YLDD ++ + D + + +S L G N +K ++
Sbjct: 1128 QRLMD---KILARHSAYAAAYLDDIIIFSNDWQRHMQHLRAVLSALRRAGLTANPRKCAI 1184
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
V ++LG HL + Q+ + T KT R LG +
Sbjct: 1185 GRVEV-RYLGF----HLGHGQV--RPQIDKTAAIATCPRPKTKK--EVRQFLGLAGYYRR 1235
Query: 716 VIPMGRLHSRRIQRQASLLRLGAP---HLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
+P +S + L + G P T K++ L P L +P F +
Sbjct: 1236 FVPE---YSALVSPLTDLTKKGEPDTVQWTEQCQQAFTKVKAALCGGPLLHAPNF--ALP 1290
Query: 772 HFISTDASDLGWGS----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQA-LSLNL 821
+ TDASD G G+ +V+ ++S S + + +KE A+ A L+L
Sbjct: 1291 FILQTDASDRGLGAVLAQEVEGEERPVLYISRKLSNREAKYSTIEKECLAIRWAVLTLRY 1350
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
LL ++ + + +L R T + + + +L Q ++ ++ + PGA
Sbjct: 1351 YLLGKEFILC--SDHAPLQWLHRMKDTN-----ARITRWYLALQPFKFKVIHR--PGAQM 1401
Query: 882 SVADSLSRSKS 892
VAD LSR++
Sbjct: 1402 VVADFLSRARG 1412
>gi|427791163|gb|JAA61033.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1065
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 96/411 (23%), Positives = 169/411 (41%), Gaps = 45/411 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
RI +G A P KP V + +A V+ EML+ GV++ +S + + + +
Sbjct: 210 RINTGDAPPIRQKPYRVSPSERKVIAEQVN--------EMLQKGVIQ--ESCSPWAAPVI 259
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
LV K + R ++ + LN + L L Y S+DL Y+ +P+
Sbjct: 260 LVKKKDNSWRFCVDYRRLNAVTKKDVYPLPRIDDAVDCLHSAAYFSSVDLRSGYWQIPMH 319
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLL 622
T + A + +PFGL AP A+ ++ S+LR + + YLDD ++
Sbjct: 320 PTDREKTAFVTPDGLFEFNVMPFGLCNAP---ATFERFMDSILRGLKWETCMCYLDDVII 376
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP--APVLQFLGIMWDPHLDRMWL-PE 679
+ + + ++ + G I+N +K A VL +L +DR + P+
Sbjct: 377 FGRTFHEHNQRLSVVLNCIKQAGLILNSKKCHFGERQAVVLGYL-------VDRNGIRPD 429
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIPMGRLHSRRIQRQASLLRLGA 738
++T +R KT + RS LG S F F+ +L LLR
Sbjct: 430 PNKIT---AVRNFKPPKT--VKDLRSFLGLCSYFRRFIKDFAQL----AHPLTDLLRKDT 480
Query: 739 PH-LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---------QVD 788
P+ T A +L++ L + PL P + + TDAS +G G+ Q
Sbjct: 481 PYRWTTECEAAFEQLKFLLTSGPLLHHFDPEALTE-LHTDASGVGVGAVLVQFHDGRQHV 539
Query: 789 SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
++ S ++ + N+ + + E AV A+ P L + +D+ ++
Sbjct: 540 VAYASRALTKAESNYTVTELECLAVVYAIHKFRPYLYGRHFKIVTDHHSLC 590
>gi|9634979|ref|NP_054716.1| Pol [equine foamy virus]
gi|7595332|gb|AAF64414.1|AF201902_2 Pol [equine foamy virus]
Length = 1153
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 160/369 (43%), Gaps = 28/369 (7%)
Query: 459 LVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLN 517
L P Q+ P + A + + I ++L+ GVLK+ T+ + ++ VPK +G R VL+
Sbjct: 157 LNPKPQKQYRINPKAKADIQIVIDDLLKQGVLKQ--QTSPMNTPVYPVPKPDGRWRMVLD 214
Query: 518 LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
+ +N+ + + + L +G Y ++DL+ ++ PI+ + Q + ++NG
Sbjct: 215 YRAVNKVTPAIATQNCHSASLLNTLYRGQYKTTLDLANGFWAHPIQESDQWITSFTWNGK 274
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA 637
T LP G +P F + V LL+ V VY+DD N L
Sbjct: 275 SYVWTTLPQGFLNSPALFTAD---VVDLLKDIP-NVEVYVDDVYFSNDTEEEHLKTMDLL 330
Query: 638 VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
L + G+IV+L+KS L V FLG + LT + L +
Sbjct: 331 FQKLQTAGYIVSLKKSKLGQHTV-DFLGF--------QITQTGRGLTDSYKSKLLDITPP 381
Query: 698 WNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTP---INPAVLPKLEW 754
L +S+LG L+FA IP +S I L+ L P + A+L K+
Sbjct: 382 NTLKQLQSILGLLNFARNFIPN---YSELITPLYQLIPLAKGIYIPWETKHTAILQKIIK 438
Query: 755 WLNA---LPLSSPIFPRQVQHFISTDASDLGW---GSQVDSSFLSGLWSREQQNWHINKK 808
LNA L P V+ +S A + + GS ++ + ++S+ + + I +K
Sbjct: 439 ELNASENLEQRKPDVELIVKVHVSPTAGYIKFANKGSIKPIAYHNVVFSKTELKFTITEK 498
Query: 809 EMFAVHQAL 817
M +H+AL
Sbjct: 499 VMTTIHKAL 507
>gi|341897659|gb|EGT53594.1| hypothetical protein CAEBREN_03434 [Caenorhabditis brenneri]
Length = 2039
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 120/508 (23%), Positives = 205/508 (40%), Gaps = 80/508 (15%)
Query: 420 ELVGGRLRRFVDAWIR----LGAPAPL-VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSS 474
+++ L F DA+ R LG+ V I + +P A+P VP+ +
Sbjct: 922 QILTDLLNEFPDAFSRNSYDLGSSKTEPVHIYTNTEVPVKARPYRVPV--------KYQA 973
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
+ HI +L +G + +S T +LS + LV K NG R L+ + LN+ P F L
Sbjct: 974 ELEKHINSLLRSGRIT--ESNTPWLSPIVLVKKKNGSLRVCLDFRKLNEATIPDNFPLP- 1030
Query: 535 HFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
RI + L++ +Y S+D++ Y + + V A T LPFGL +A
Sbjct: 1031 --RIDAILERVGGSNYFSSLDMANGYLQLRLDPASSYKCGFITESKVYAYTHLPFGLKSA 1088
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-PRILEIQGKLAVSILGSLGWIVNL 650
F V L V+VY+DD L+ ++ P + K+ + NL
Sbjct: 1089 ASYFQRALRTVLGGLED---EVLVYIDDILIFSKTFPEHINSIRKVLLRFRD-----FNL 1140
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+ S F+ + ++R DK N+ + + N++ R +G
Sbjct: 1141 KASPKKCEFAKDFITFLGH-EINRDNYAPDK----ANVAKIVEFPIPSNINEIRRFVGMA 1195
Query: 711 --------SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLS 762
+F+ P+ RL +R+ Q+ N E +AL S
Sbjct: 1196 GFFRKFIQNFSEIAEPLTRL-TRKEQKFV------------WNTEQQTAFERLRDAL-AS 1241
Query: 763 SPI--FPRQVQHF-ISTDASDLGWGSQV-----DS-------SFLSGLWSREQQNWHINK 807
PI +P + F I DAS + G+ + D+ ++ S S + W +
Sbjct: 1242 EPILGYPDYDKPFHIFCDASAVAQGAALMQARPDNEKDFYAIAYASRTLSDPETRWPAIQ 1301
Query: 808 KEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
EM A+ AL P + S +++ SD++ + L++ +L+ + + Q +
Sbjct: 1302 VEMGAIIFALRQFRPYICLSKIILHSDHKPLTFLLQKSKTHDNLA------RWLIELQCY 1355
Query: 868 RIHILAQFIPGAYNSVADSLSRSKSLPD 895
I I+ I G N+VAD LSR++ D
Sbjct: 1356 DITIV--HIDGKKNTVADCLSRARENED 1381
>gi|189909153|ref|YP_001961122.1| Pol [Macaque simian foamy virus]
gi|110282985|sp|P23074.3|POL_SFV1 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;
Contains: RecName: Full=Protease/Reverse
transcriptase/ribonuclease H; AltName:
Full=p87Pro-RT-RNaseH; Contains: RecName:
Full=Protease/Reverse transcriptase; AltName:
Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H;
Short=RNase H; Contains: RecName: Full=Integrase;
Short=IN; AltName: Full=p42In
Length = 1149
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 119/244 (48%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLIQQNSTMN--TPVYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I S + +G Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V Y+DD + + DP+ Q + SIL + G++V+L+KS
Sbjct: 295 FTAD---VVDLLKEIP-NVQAYVDDIYISHDDPQEHLEQLEKIFSILLNAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
++ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IAQREV-EFLGF----NITK----EGRGLTDTFKQKLLNITPPKDLKQLQSILGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|327268405|ref|XP_003218988.1| PREDICTED: lysine-specific demethylase 6A-like [Anolis
carolinensis]
Length = 1580
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 58/109 (53%), Gaps = 1/109 (0%)
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYN 881
P+ V VQ+DN + Y+ +Q GT S L+S + +L + ++A IP N
Sbjct: 62 PVNPHKTVQVQTDNMVAMYYINKQDGTGSRKLMSLSTRFWLWCIAHDVFLVALHIPVLQN 121
Query: 882 SVADSLSR-SKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNH 929
+ADSLSR + S DW L +F WG P +DLFASR ++ +P +
Sbjct: 122 GLADSLSRMTSSSHDWQLDPETLHSVFDDWGWPTLDLFASRHNSQLPRY 170
>gi|291236625|ref|XP_002738239.1| PREDICTED: polyprotein-like, partial [Saccoglossus kowalevskii]
Length = 861
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 117/249 (46%), Gaps = 23/249 (9%)
Query: 521 LNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
LN+ + + FS+ ++ RI ++ G +M D++ A+ +P+ + + +
Sbjct: 604 LNELIDKETFSM-SYIRIDDAFEELKRLGAGAHMNKFDITDAFKQIPLHPSIWHLHGIKW 662
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVVVYL-DDFLLVNQDPRIL 630
+G L FG ++P+ F LS W+A ++ +++L DDFL ++Q
Sbjct: 663 DGRYYFFARLVFGSRSSPKIFDKLSQAICWIAEY--KFDVKFILHLLDDFLTIDQ----F 716
Query: 631 EIQGKLAVSILGSLGWIVNL---QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
E ++S++ L +N+ +L P L++LGIM + + +LP +K+ +
Sbjct: 717 EETALRSMSVMTMLFKSLNIPLAAHKTLGPTQELEYLGIMLNSRDLQAFLPTNKKERITT 776
Query: 688 ILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR-LHSRRIQRQASLLRLGAPHLTPINP 746
I+ K+ SLLG+L+FA+ V+ GR S ++ S+ +L H +N
Sbjct: 777 IISEFTVKKSITKQELLSLLGHLNFAARVVVPGRSFVSHLLKLSTSVSKLH--HHVSLNH 834
Query: 747 AVLPKLEWW 755
A +L W
Sbjct: 835 ACRLELSMW 843
>gi|360045497|emb|CCD83045.1| hypothetical protein Smp_185440 [Schistosoma mansoni]
Length = 989
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 98/425 (23%), Positives = 172/425 (40%), Gaps = 35/425 (8%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFS 531
+ + ++ +QEML+ G+++ DS + S + LV K NG R ++ + LN K +
Sbjct: 131 LEAEVNRQVQEMLKEGIIEEADSP--YSSPVLLVKKPNGKYRFCVDFRELNNITKLKPCA 188
Query: 532 LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
+ LQ +DL Y+ +PIK + + A + +PFGLA A
Sbjct: 189 MPTVVETLDRLQNATVFTVLDLRSGYWQLPIKESDRSKTAFTIRDKQYQFRRMPFGLAGA 248
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
P F L SLL V VY DD ++ +Q + + + G +N
Sbjct: 249 PFTFRRL----MSLLLRDLDNVEVYGDDVVVYSQTETDHVKHVEAVLKRIEEFGLRINKD 304
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
KS ++ + + + + + LPE K LT+ N+ +S R L +L
Sbjct: 305 KSQMAKSSITLLGHKVGNGEIKP--LPE-KILTIKNVAVP---------NSKRKLRQFLG 352
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ 771
A+F R + LL T I +++ L+ ++ + +
Sbjct: 353 RAAFYSRFIRNFNEIAAPLYKLLSNTKFSWTEIAQQTFNQIKNMLDDRQMTLRLPELEKP 412
Query: 772 HFISTDASDLGWG---SQVDS--SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS 826
++TDASD G G SQ D + S + + +Q + +KE A+ A+ P L
Sbjct: 413 FTVTTDASDHGIGAVLSQSDRVVEYASRVLTPAEQKYSTIEKECLAIVWAVDKWRPYLLG 472
Query: 827 SVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+++D++ + + R G + ++ E F + +PG N +A
Sbjct: 473 RRFHIETDHKPLQWLQTARDPRGKSARWMIRLQEYDFSIGH----------VPGKENVMA 522
Query: 885 DSLSR 889
D LSR
Sbjct: 523 DYLSR 527
>gi|190360820|gb|ACE76864.1| polyprotein [Citrus yellow mosaic virus]
Length = 1968
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 106/456 (23%), Positives = 185/456 (40%), Gaps = 56/456 (12%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 1386 LKHVTPQMEESFRKHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSIDPITGKEVKGKER 1443
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V N K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 1444 MVFNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSVEWT 1500
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ +
Sbjct: 1501 AFWVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDILVFSKTEKEH 1557
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
E ++ +SI G I++ K ++ A + +FLG + L ++ Q + L
Sbjct: 1558 EEHLQIMLSICQRNGLILSPTKMKIAQAEI-EFLGAIIHNGLIKL------QPHIVQKLL 1610
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
T + + RS LG L++A IP MGRL S A + G + + A++
Sbjct: 1611 TFTNKQLEEVKGLRSWLGLLNYARSYIPHMGRLLSPLY---AKVSPTGERRMNRQDWALI 1667
Query: 750 PKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVD-------SSFLSG 794
K+ + LP L P P I TD GWG +Q D ++ SG
Sbjct: 1668 DKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKIAQYDPRSSERVCAYASG 1725
Query: 795 LWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
++ + E+ AV +L + + L S + +++D Q ++S+ + K +
Sbjct: 1726 KFNPPKSTI---DAEIHAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKSNVNKPSRV 1782
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
FL ++I + I G N +AD+LSR
Sbjct: 1783 RWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1816
>gi|301604023|ref|XP_002931674.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1435
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 115/517 (22%), Positives = 198/517 (38%), Gaps = 80/517 (15%)
Query: 406 SSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPA-PLVRIVSGYAIPFSAKP-PLVPLC 463
S V P D++ +V L F D + GA P R+ Y P P +P
Sbjct: 464 SMAVAPATDTQFA--VVPNYLHEFKDVFDEKGADTLPPHRV---YDCPIDLLPGAAIPFG 518
Query: 464 SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ 523
+ L+ P + +I E LE G ++ S G + +F V K + RP ++ + LN
Sbjct: 519 RIYPLSEPELIVLKKYIDENLEKGFIRPSTSPAG--AGIFFVEKKDHSLRPCIDYRQLNL 576
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
++ L + L++ +DL AY V I+ + A
Sbjct: 577 ITVKNRYPLPLIPELFQNLREAKIFSKLDLRGAYNLVRIRKGDEWKTAFRSRYGHFEYLV 636
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL AP F L N + + V+VYLDD L+ + + EI + S L
Sbjct: 637 MPFGLCNAPATFQHLVNDIFRDFLDQF--VIVYLDDILVFSSSIKEHEIHMRKVFSRLRE 694
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASKTWNL 700
L+K + +FLG + ++ IL + + A W +
Sbjct: 695 HSLFAKLEKCEFHKTSI-EFLGFV---------------ISTDGILMDPKKVSAVLNWPV 738
Query: 701 DSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN--PAVLPKLEW--- 754
++R ++ F++F RR R S + ++PI + + +W
Sbjct: 739 PTSRKATQRFIGFSNFY--------RRFIRNFSKI------ISPITDLTSTTKRFQWSSQ 784
Query: 755 ------WLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVDS-----------SFLSG 794
L L S+PI + + DAS+ G+ + +F S
Sbjct: 785 AQSAFDKLKELFTSAPILKHPDPSLPFVVEVDASETAVGAVLSQRSGLQNFLHPVAFFSK 844
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLS 852
S ++N+ ++ +E+ A+ A L+ S +++ SD++ + YLR +
Sbjct: 845 KLSPSEKNYDVSDRELLAIKVAFEEWRQYLEGSSHPILIFSDHRN-LEYLR-----TAKR 898
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L + L + HI + PG+ N AD+LSR
Sbjct: 899 LRPRQARWALFFSRFNFHI--TYRPGSQNHKADALSR 933
>gi|348531493|ref|XP_003453243.1| PREDICTED: receptor tyrosine-protein kinase erbB-4-like
[Oreochromis niloticus]
Length = 1523
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 44/74 (59%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS 604
GD+ +IDL+ AYFHV I H++FL ++ G LPF L+ AP+ F + +
Sbjct: 12 GDWFATIDLTDAYFHVAIHPKHRQFLRFAFEGVAYEYLVLPFELSLAPRTFTKCAEAALA 71
Query: 605 LLRSRGMRVVVYLD 618
LR RG+R++ YLD
Sbjct: 72 PLRERGIRILAYLD 85
>gi|308447755|ref|XP_003087511.1| hypothetical protein CRE_10791 [Caenorhabditis remanei]
gi|308255031|gb|EFO98983.1| hypothetical protein CRE_10791 [Caenorhabditis remanei]
Length = 630
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 87/195 (44%), Gaps = 6/195 (3%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ +++E+G + +S +S L +V +G R +L+L N+ LSP KF+L
Sbjct: 216 EVFKLVESGAVAVTESPI-VISPLHVVEQGEK-KRLILDLSEFNKNLSPPKFTLETWKHA 273
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN----GDVLAMTCLPFGLATAPQA 594
L + + + D Y HV I+ LA S LPFGL+TAP
Sbjct: 274 RPELVRMRFAATFDFKSGYHHVKIEENSSELLAFSLTDPPTAPYFKFRALPFGLSTAPWL 333
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + R G+++ +Y+DD L+V + L + S L LG + +K
Sbjct: 334 FTKIFRPIVGKWRRNGIKIWLYIDDGLIVAETEEDLIRAVSIVKSDLERLGVALADEKCK 393
Query: 655 LSPAPVLQFLGIMWD 669
P+ V +LG + D
Sbjct: 394 WEPSSVFTWLGFVGD 408
>gi|341891627|gb|EGT47562.1| hypothetical protein CAEBREN_01908 [Caenorhabditis brenneri]
Length = 2052
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 117/503 (23%), Positives = 203/503 (40%), Gaps = 76/503 (15%)
Query: 420 ELVGGRLRRFVDAWI-------RLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPV 472
E V G +R+F + R A + +V G A P KP +PL +
Sbjct: 1079 EEVWGIVRKFQHIFAVDDNELGRTNAVECEIELVEG-AEPVRQKPRPIPLA--------I 1129
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
+ IQ+ML GV++ +S + + S + LV K +G R ++ + +N+ + L
Sbjct: 1130 RPEIRKMIQKMLAQGVIR--ESHSPWASPVVLVKKKDGSVRMCIDYRKVNKVVRYNAHPL 1187
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
N L ++DL Y+ +P+K + A + ++ LPFGL T+P
Sbjct: 1188 PNIEATLQSLSGKKVFTTLDLLAGYWQIPLKEQSKEITAFAIGSELFEWNVLPFGLVTSP 1247
Query: 593 QAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
F A++ + + +L G+ VY+DD L+ ++ LE K +L ++
Sbjct: 1248 AIFQATMESVIGDML---GICAFVYVDDLLIASES---LEQHAKDLERVLE------RVE 1295
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
KS + + +L PE + I + + N +S LG +
Sbjct: 1296 KSGMRFRASKCHIAQEQVAYLGHKITPEGVRTEEAKIDKMKKFPRPTNPKEVQSFLGLVG 1355
Query: 712 -FASFVIPMGRLHS------------RRIQRQASLLR--LGAPHLTPINPAVLPKLEWWL 756
+ FVI ++ S R + Q + + + A TP+ + P E
Sbjct: 1356 YYRKFVINFAQMASALTPLTAKQAVWRWEEEQEAAFQSLIQAICSTPV--LMQPNTE--- 1410
Query: 757 NALPLSSPIFPRQVQHFISTDASDLGWGS----QVDS------SFLSGLWSREQQNWHIN 806
A+ S P I TDAS G G+ Q D +F S + + +HI
Sbjct: 1411 AAIDGSKPF-------LIYTDASRKGVGAVLAQQGDDGEQHPIAFASKALTPAETRYHIT 1463
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
E + AL ++ + ++V +D++ ++ LR L S I LL +
Sbjct: 1464 DLEALGMIFALRRFKTIVYGTQILVYTDHKPLIYLLRGTPLADRLLRWS----IELL--E 1517
Query: 867 WRIHILAQFIPGAYNSVADSLSR 889
+ + I+ F+ G N+VAD+LSR
Sbjct: 1518 YNVKII--FVNGKANNVADALSR 1538
>gi|308446277|ref|XP_003087141.1| hypothetical protein CRE_30354 [Caenorhabditis remanei]
gi|308260969|gb|EFP04922.1| hypothetical protein CRE_30354 [Caenorhabditis remanei]
Length = 739
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 105/452 (23%), Positives = 184/452 (40%), Gaps = 90/452 (19%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
IQ+ML V++ +S + + S + LV K +G R ++ + +N + L N
Sbjct: 2 IQKMLSQRVIR--ESKSPWASPVVLVKKKDGSVRMCIDYRKVNLLIKYNAHPLPNIETTL 59
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-ASL 598
L + DL Y+ +P+K + A + ++ LPFGLAT+P F A++
Sbjct: 60 LSLAGKKVFTTFDLLAGYWQLPLKEESKEITAFAIGSELFEWNVLPFGLATSPAIFQAAM 119
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPR--------ILEIQGKLAVSILGSLGWIVNL 650
V LL G V VY+DD L+ +++ + ILE K + + S WI
Sbjct: 120 ECVVGDLL---GTCVFVYVDDLLIASENMKEHAIHVQTILERIEKSGMKLKASKCWIARE 176
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY- 709
+ + +LG M P + ++ + + + L S L+GY
Sbjct: 177 E---------VDYLGHMITPEGVKT-----EEAKVDKMKKFARPEDVKQLQSFLGLVGYY 222
Query: 710 ----LSFASFVIPMGRLHSRR------IQRQASLLRL-----GAPHLTPINPAVLPKLEW 754
+S++ P+ L S++ +++ + ++L AP L +P
Sbjct: 223 RNFIMSYSKIAYPLNFLTSKKNAWVWGTEQENAFVQLKSSVCSAPVLRQPDPE------- 275
Query: 755 WLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWH 804
A+ + P + I TDAS G G+ Q +F S + + +H
Sbjct: 276 --TAISGARP-------YLIYTDASRQGVGAVLAQEANDGEQHPIAFASKSLTSAETRYH 326
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
I E A+ AL ++ S V+V +D++ ++S +R G++ L
Sbjct: 327 ITDLEALAMMFALRRFRTIIYGSQVIVFTDHKPLISLMR---GSRLADRLMR-------- 375
Query: 865 QDWRIHILAQFIP------GAYNSVADSLSRS 890
W I ++ +F P G N VAD+LSR
Sbjct: 376 --WSIELI-EFNPKIVSVKGKANVVADALSRG 404
>gi|391334995|ref|XP_003741883.1| PREDICTED: uncharacterized protein K02A2.6-like, partial
[Metaseiulus occidentalis]
Length = 890
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 96/398 (24%), Positives = 170/398 (42%), Gaps = 56/398 (14%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLI 533
A+S ++ +++ GVLK + + F + + +V K G R + GLN L K+ L
Sbjct: 493 ALSKELERLVKLGVLKPT-TNSEFAAPVVVVRKKGGEIRLCADFSTGLNNALQDDKYPLP 551
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
I S L G + IDL++AY + + + + ++ + MT LPFGL TAP
Sbjct: 552 TAQDIFSRLAGGKFFSKIDLAEAYLQIEVHPDDRNLITINTPKGLFEMTRLPFGLKTAPS 611
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQGKLAVSILGSLGWIV 648
F + + +L+ G +YLDD L+ + R+L++ K+ G+ +
Sbjct: 612 LFQRIMD--ETLVGIPG--TAIYLDDILVTGRTAKEHRDRVLKVMAKIQKG-----GFRI 662
Query: 649 NLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT-WNLDSARSLL 707
++K S + ++LG + D H R P+ Q +R ++ T + ++ L
Sbjct: 663 RMEKCSFLQKQI-KYLGFIIDEHGRR---PDPAQ------IRPIVELPTPKDAKDVQAFL 712
Query: 708 GYLSFASFVIPMGRLHSRRIQRQAS-LLRLGAPHL-TPINPAVLPKLEWWLN---ALPLS 762
G ++F S IP +RI+ + LLR + T + + L AL
Sbjct: 713 GLVTFYSNFIP----DMKRIKEPLTPLLRKNVKFVWTERCEEAFEQAKKILQSDLALTHY 768
Query: 763 SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINK-------------KE 809
P P +V S DAS G G + +F G ++ H+NK KE
Sbjct: 769 DPQLPLEV----SADASQSGVGGVLLHTFPDG---SKKAIMHVNKVLTETEKRYGQIEKE 821
Query: 810 MFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
A+ A+ + ++ +D++ ++S + GG
Sbjct: 822 ALALVFAVKKFHKYIYGRHFILNTDHKPLLSVFKVDGG 859
>gi|392578578|gb|EIW71706.1| hypothetical protein TREMEDRAFT_26982 [Tremella mesenterica DSM
1558]
Length = 1387
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 107/449 (23%), Positives = 181/449 (40%), Gaps = 64/449 (14%)
Query: 494 STTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISID 552
S + + S +F++PK G R V++ + LN+ P + L +I L K Y +D
Sbjct: 473 SNSPYGSPMFMIPKKAEGQWRMVIDYRKLNEATIPDAYPLPLIGQITEELGKARYFSKLD 532
Query: 553 LSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMR 612
L AY + + H+ A + + GL AP F N V L G
Sbjct: 533 LIGAYQLLRVTEGHEHLTAFRTQYGMFESLVVRDGLRNAPAVFQHFLNEVFRELLGNG-- 590
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLA--VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
VVVY+DD L+ + E++G A +L V K V+ FLG
Sbjct: 591 VVVYIDDILIYGNT--LEELRGTTAKVFEVLRKASLYVKASKCEFERDSVV-FLG----- 642
Query: 671 HLDRMWLPEDKQLTLGNILRTLLAS--KTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ 728
++ K +++ + S + NL +R +G +S+ +P +R I
Sbjct: 643 -----FVVSSKGVSVNPEYIDAITSFPRPKNLRESRGFIGVVSYYRRFVPNFSKIARPIN 697
Query: 729 RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI---FPRQVQHFISTDASDLGWG- 784
L R P + + K L A ++P+ F ++ + TDAS GWG
Sbjct: 698 ---DLTRKEVPFVWGVEQESAFKE---LKARMCTAPVLAHFDPTLKTILQTDASFFGWGF 751
Query: 785 --SQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDN 835
SQ+++ + SG ++ Q N+ + +KE AV + LL V +D+
Sbjct: 752 IISQINTAGQEHPVAIESGAFNTAQLNYTVGEKEFLAVVEGFRRRRHLLLQVETTVLTDH 811
Query: 836 QTVVSYLR------RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ ++ RQG VE++ ++R ++ + PG S+ D LSR
Sbjct: 812 LNLTYWMEPKQLSPRQG--------RWVEEL----ANFRFKMV--YRPGTQASLPDGLSR 857
Query: 890 SKSLPDWHLSRSAT--EQIFLKWGVPCID 916
D+H + +T ++ L G+P D
Sbjct: 858 R---ADYHSGKGSTMVQESNLIQGLPKFD 883
>gi|291239314|ref|XP_002739568.1| PREDICTED: polyprotein-like [Saccoglossus kowalevskii]
Length = 827
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 98/417 (23%), Positives = 175/417 (41%), Gaps = 50/417 (11%)
Query: 506 PKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ------KGDYMISIDLSQAYFH 559
P+GN + +N + ++SL ++ R+ + G + D+ A+
Sbjct: 407 PRGNSLS------TSINDLIDKDEYSL-SYVRVDDAITATQQAGHGALLCKTDIVDAFKL 459
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN---WVASLLRSRGMRVVVY 616
+PI ++ +++ + L FG + P+ F LS W+A + G+ +++
Sbjct: 460 LPIHSSLWHLYGINWQDNFYFFVRLAFGSRSNPKIFDQLSTAICWIAH--HNYGINKMLH 517
Query: 617 L-DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRM 675
L DDFL ++ E L I G LG + K+ + P L+FLGI D
Sbjct: 518 LLDDFLTIDAPSYDAERTMALLTLIFGRLGIPLAPHKT-VGPTTTLEFLGIKLDTIKMEA 576
Query: 676 WLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGR-LHSRRIQRQASLL 734
LP K L ++L L ++ SLLG+L+FA V+P GR SR I+
Sbjct: 577 RLPLVKLNRLLDLLDEFLLRRSCTKHQLLSLLGHLNFACRVVPPGRTFMSRLIELSKGTQ 636
Query: 735 RLGAPHLTPINPAVLPKLEWWLN--------ALPLSSPIFPR-QVQHFISTDASDLGWGS 785
+L H I+ + W +L L + P +Q F TDAS +G G
Sbjct: 637 KLH--HHVGISSKSKQDIRMWKEFLSGWNGISLFLDRYLTPAPDMQLF--TDASGIGHG- 691
Query: 786 QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV----------VMVQSDN 835
+ G W E+ ++ ++ A P++ +++ ++ DN
Sbjct: 692 ----GYFRGYWFHEKWETNLRLDHDKSLSIAFQQLYPIVVAALLWGHQWTRKHILFHCDN 747
Query: 836 QTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
V ++ +G +KS S++ + ++ + + A IPG N +AD+LSR ++
Sbjct: 748 MATV-HIVNKGRSKSPSIMKLMRRLVITAASHSFMFSAVHIPGKSNIIADALSRFQT 803
>gi|270016330|gb|EFA12776.1| hypothetical protein TcasGA2_TC005019 [Tribolium castaneum]
Length = 1226
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 109/245 (44%), Gaps = 33/245 (13%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQ---HLATP-----------VSSAMSLHIQEMLETG 487
L+ ++ Y FS++P L + + H TP + A+ + IQEML+ G
Sbjct: 574 LIHLLQEYRCIFSSRPGLTHKYTHEIKLHDKTPFLKRPYPVPFALRPAVDVTIQEMLDLG 633
Query: 488 VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSF 541
V+KR + + S + +V K +G R L+ + +N + P L+ F
Sbjct: 634 VIKR--EASPYASPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF----- 686
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
YM +IDL +Y+ +P+ ++ A YNG LPFGL TA +F+ +
Sbjct: 687 -HGIRYMSTIDLRSSYWQIPLSPESRQCTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDV 745
Query: 602 V-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + +R VV Y+DD L+V++ + L +NL+KS+ V
Sbjct: 746 VLGTEVRE---FVVNYIDDLLVVSETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEV 802
Query: 661 LQFLG 665
+FLG
Sbjct: 803 -KFLG 806
>gi|388856403|emb|CCF49952.1| uncharacterized protein [Ustilago hordei]
Length = 658
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 111/471 (23%), Positives = 186/471 (39%), Gaps = 75/471 (15%)
Query: 454 SAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
KPP PL +L P S + ++ E LE G ++ L S S + +PK +GG
Sbjct: 138 GGKPPQGPL----YLKGPKEMSKLRRYLDENLEKGFIRPLKSLAR--SPVLFIPKKDGGL 191
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
R ++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 192 RLCVDYQGLNEITVKNRAPLPLIEEQLFLLRKARIYTQLDLRAAYNLIQIAKGDEWKTAF 251
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ +PFGLA A F S N + + G+ VVVYLDDFL+ + +
Sbjct: 252 GTQLGLYEYLVMPFGLANASAHFQSFINDIFQDII--GVYVVVYLDDFLIFSDTEEVHVK 309
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
++ L S L K +++FLG + P M + D + +
Sbjct: 310 HVTTVLTHLRSNRLFAKLSKCEFH-TKIVEFLGYIIKP----MGIEMDPE--------KV 356
Query: 693 LASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
K W + +S + +L FA+F + R I A + + P+ V P
Sbjct: 357 CTVKEWPMPESIHDIQRFLGFANF-------YRRFIAYFACIAK-------PLTSLVKP- 401
Query: 752 LEWWLN-ALPLSS-PIFPRQVQHFIS----------------TDASDLGWGSQVDS---- 789
+EW+ LP + F + +Q F S TDASD +
Sbjct: 402 IEWFKKFELPEEAQQAFHKLIQAFTSAGVLQHFNYHLPTRLETDASDFAIAGVLKQEHEG 461
Query: 790 -----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYL 842
+F S S ++N+ I+ KE+ AV L+ +L S +++ +D++ + Y
Sbjct: 462 RWHPVAFYSRKMSSAKKNYEIHDKELLAVVACLTQWQHMLAGLLSQLVILTDHEA-LKYF 520
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+ Q + ++ I L D+ + Q+ PG D+L+R +
Sbjct: 521 KSQ---RCITGRQARWAILLADFDF----ILQYRPGDKGGEPDALTRRTDM 564
>gi|326669544|ref|XP_003199038.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1249
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 105/450 (23%), Positives = 180/450 (40%), Gaps = 43/450 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P + M ++ + L G+++ S G + F V K +G RP ++ +G
Sbjct: 332 PRGRLFSLSAPERATMEKYLSDSLAAGIIRSSSSPAG--AGFFFVKKKDGSLRPCIDYRG 389
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 390 LNDITIKNRYPLPLMSTAFEILQGARVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 449
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V + ++ V VYLDD L+ + ++ + +
Sbjct: 450 YLVLPFGLSNAPAVFQALVNDVLRDMINKF--VFVYLDDILIFSPSLQVHIQHVRRVLQR 507
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K L A + FLG + RM PE + A W +
Sbjct: 508 LLENQLFVKAEK-CLFHAQSVPFLGSIISVEGIRM-DPEK-----------VRAVSDWPV 554
Query: 701 DSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNA 758
+R +L +L FA+F R +S+ +L + I A +L+
Sbjct: 555 PGSRKALQQFLGFANFYRRFIRNYSQVAAPLTALTSTKSHFCWSIAAQAAFRELKSRFTT 614
Query: 759 LPLSSPIFPRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
P+ + P + F + DAS++G G+ + ++ S S ++N+ I
Sbjct: 615 APIL--VLPDPARQFVVEVDASEVGVGAVLSQICPKDNKLHPCAYYSHRLSPAERNYDIG 672
Query: 807 KKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ + +V +D++ + Y++ K L+ +F
Sbjct: 673 NRELLAVRLALGEWRHWLEGAAEPFVVWTDHRN-LEYIQ---TAKRLNSRQARWALFF-- 726
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
R + + PG+ N D+LSR P
Sbjct: 727 --GRFNFTLSYRPGSKNGKPDALSRCFGTP 754
>gi|341902401|gb|EGT58336.1| hypothetical protein CAEBREN_08140 [Caenorhabditis brenneri]
Length = 2301
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 110/478 (23%), Positives = 196/478 (41%), Gaps = 66/478 (13%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ + HI +L++G + +S T + S + LV K NG R L+ + LN+ P F L
Sbjct: 1338 AELEKHINSLLKSGRIT--ESNTPWTSPIVLVKKKNGSLRVCLDFRKLNEATIPDNFPLP 1395
Query: 534 NHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLAT 590
RI + L+K +Y S+D++ Y + + V A T LPFGL +
Sbjct: 1396 ---RIDAILEKVGGSNYFSSLDMANGYLQLRLDPASSYKCGFITESKVYAYTHLPFGLKS 1452
Query: 591 APQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-PRILEIQGKLAVSILGSLGWIVN 649
A F V L V+VY+DD L+ ++ + LE K+ + +
Sbjct: 1453 AASYFQRALRTVLGGLED---EVLVYIDDILIYSKTFEQHLETLRKV-LHRFRDFNLKAS 1508
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY 709
+K + ++ FLG + + DK N+ + N++ R +G
Sbjct: 1509 PKKCEFAKKSIV-FLG----HEISKNTYSPDK----ANVAKITEFPTPTNINEIRRFVGM 1559
Query: 710 LSFASFVI--------PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP- 760
F I P+ RL + + + +L + GA KL L++ P
Sbjct: 1560 AGFFRKFIPNFSEISEPLTRLTRKERKFEWNLDQQGA----------FEKLRTSLSSEPV 1609
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDSS------------FLSGLWSREQQNWHINKK 808
L P + R F DAS + G+ + + + S + + W +
Sbjct: 1610 LGFPDYDRPFHIFC--DASAVAQGAALMQTRLHNEKDFFAIAYASRTLADTETRWPAIQV 1667
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR 868
EM A+ AL P + S +++ SD++ + L++ +L+ + + Q +
Sbjct: 1668 EMGAIIFALRQFRPYVCMSKIILHSDHKPLTFLLQKSKTHDNLA------RWLVELQCYD 1721
Query: 869 IHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVV 926
I I+ I G N+VAD LSR++ D +S + + +++ V C+ + A +A V
Sbjct: 1722 ISII--HIDGKKNTVADCLSRARENDD--ISEAVELKDIIEFPV-CMKIDARANAATV 1774
>gi|432955950|ref|XP_004085643.1| PREDICTED: uncharacterized protein LOC101166850, partial [Oryzias
latipes]
Length = 1060
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 106/468 (22%), Positives = 190/468 (40%), Gaps = 69/468 (14%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P +P +PL Q A I++M G+++ DS + + + +V K
Sbjct: 103 ANPIRLRPHRLPLAKRQ--------AAEELIKDMAANGIIEPSDSP--WAAPVVMVRKKG 152
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
GG RP ++ + LN + L ++ + S+DL Y+ V + +
Sbjct: 153 GGWRPCVDYRRLNAVTRKDSYPLPRIDDALDYVTGSCWFSSLDLRSGYWQVELAPEARPK 212
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
A + + +PFGL AP F L V + R VVYLDD L+ P +
Sbjct: 213 TAFTIGQGLWQFKVMPFGLCNAPATFERLMERVLKDIPR--TRCVVYLDDLLVHGSFPEV 270
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
++ ++ ++I + G +N K +L A QFLG + + +
Sbjct: 271 HNVR-EVFLAIRQA-GLRLNPAKCTLL-ARKTQFLGHVI------------SESGVATDP 315
Query: 690 RTLLASKTW----NLDSARSLLGYLS--------FASFVIPMGRLHSRRIQRQASLLRLG 737
++A + W N RS LG S FA+ P+ RL + + + S
Sbjct: 316 AKVVAVRDWPTPSNTSELRSFLGLASYYRRFVKDFATIANPLHRLTDKGKRFEWS----- 370
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGSQVDS------- 789
A +L+ L P+ + +P Q F + TDAS++G G+ +
Sbjct: 371 -----EGCAAAFQRLKSALADAPVLA--YPDPGQPFTLDTDASNVGVGAVLSQQHETGER 423
Query: 790 --SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
++ S SR ++N+ + ++E+ A+ A+ P L + +++D+ ++ L +
Sbjct: 424 VVAYYSCSLSRPERNYCVTRRELLAIILAVRHFRPYLLGTKFTLRTDHASLTWMLNFKQP 483
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
++ E+ L D+ + Q PG +S AD+LSR D
Sbjct: 484 EGQVARWLEI----LQEYDFEV----QHRPGRQHSNADALSRRPCFTD 523
>gi|440484150|gb|ELQ64282.1| hypothetical protein OOW_P131scaffold00671g2 [Magnaporthe oryzae
P131]
Length = 841
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 104/447 (23%), Positives = 172/447 (38%), Gaps = 74/447 (16%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ ++ E L+ G ++ S G+ + VPK NG R ++ + LN + L
Sbjct: 240 ETLDKYLDENLKKGYIRPSTSPAGY--PILFVPKKNGKLRLCVDYRQLNDITVKNCYPLP 297
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L + + ++DL AY + IK + A +PFGL AP
Sbjct: 298 LIGELRDMLYQAQWFTTLDLKGAYNLIRIKKGEEWKTAFRTRRGHFEYLVMPFGLTNAPA 357
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F ++ N V L + + VVVYLDD L+ + + LE + ++L LQ +
Sbjct: 358 TFQTMINHV--LRKCLDIFVVVYLDDILVFS---KTLEEHKQHVHTVLQK------LQDA 406
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGY 709
L P L+ P + ++ I A K W N+ R+ LG+
Sbjct: 407 KLLIEPEKCIFHSKKVDFLEYTIAPGEIRMEASKI----QAIKEWPQPKNVKDVRAFLGF 462
Query: 710 LSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-W----------LN 757
++F F+ G++ TP+ LE+ W L
Sbjct: 463 VNFYRRFIKGYGKI------------------ATPLTNLTKKDLEFKWDKTENQTFEQLR 504
Query: 758 ALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVDS----------SFLSGLWSREQQNWH 804
+ P+ P + F + TDASD G Q+ +F S + N+
Sbjct: 505 DTVATEPVLRIPDPEKLFEVETDASDYAVGGQLGQKDEKGRLHPCAFFSQKLHGPELNYQ 564
Query: 805 INKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
I+ KE+ A+ +A P L + V+V +D++ + + + K SE FL
Sbjct: 565 IHNKELMAIIRAFEEWKPQLSGTKHEVLVYTDHKNLTHFTTSKVLNKRQIKWSE----FL 620
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSR 889
L +RI + G N AD+LSR
Sbjct: 621 LEFHFRI----IYRKGTENGRADALSR 643
>gi|378788723|gb|AFC40211.1| polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 86/197 (43%), Gaps = 9/197 (4%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLS 554
+LFLV K + T + +QF KK + P+ L G IS+DLS
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGKKAMRFPRYWSPNLSTLRRILPVGMPRISLDLS 156
Query: 555 QAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRV 613
QA++H+P+ LA+S V P G+ +P + + S + R +
Sbjct: 157 QAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRRFNVWT 216
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
Y+DDFLL + + R L S L LG +N K++ SP ++FLG D H
Sbjct: 217 FTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQIDEHF- 275
Query: 674 RMWLPEDKQLTLGNILR 690
M + E + L +++
Sbjct: 276 -MKIEESRWKELRTVIK 291
>gi|341877544|gb|EGT33479.1| hypothetical protein CAEBREN_32143 [Caenorhabditis brenneri]
Length = 2212
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 89/191 (46%), Gaps = 10/191 (5%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ EML +++ ST+ F S + LV K +G R + +GLN + + L I
Sbjct: 1452 QVNEMLSMDIIE--PSTSTFTSPIVLVKKKDGTFRFTTDFRGLNAVTVKQIYLLPLISDI 1509
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
G + ++DL Q +F +P++ + + S +P GL AP F S
Sbjct: 1510 VDLASHGKFFTNLDLIQGFFQIPLRKQDRPLTSFSTPNGTFQYKRMPMGLCGAPHTFQSA 1569
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+ + R+ R+ YLDD L+V +++ + +I+ L I ++G+ + +QK S
Sbjct: 1570 VQQLQKMTRA---RLFCYLDDLLIVSDSREQHLTDIEEVLQNII--TIGFKIKIQKCKFS 1624
Query: 657 PAPVLQFLGIM 667
V FLG++
Sbjct: 1625 QREVT-FLGLL 1634
>gi|388856675|emb|CCF49792.1| related to pol protein [Ustilago hordei]
Length = 1607
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 116/471 (24%), Positives = 185/471 (39%), Gaps = 77/471 (16%)
Query: 455 AKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
KPP PL +L P S + ++ E LE G ++ S + S + VPK +GG R
Sbjct: 646 GKPPQGPL----YLKGPKEMSELRRYLDENLEKGFIR--PSKSPAQSPVLFVPKKDGGLR 699
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 700 LCVDYRGLNEITVKNRAPLPLIKEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEWKTAFG 759
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEI 632
+ +PFGLA A F S N + R G+ VVVYLDDFL+ +
Sbjct: 760 TQLGLYEYLVMPFGLANALAHFQSFIN---DIFRDIIGIYVVVYLDDFLIFSDTEEAHVK 816
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
++ L S L K V +FLG + P M PE +RT+
Sbjct: 817 HVTEVLTRLRSNRLFAKLSKCEFHTKTV-EFLGYIIKPTGIEM-DPEK--------VRTV 866
Query: 693 LASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
K W + +S + +L FA+F + R I A R+ P + P
Sbjct: 867 ---KEWPMPESIHDIQRFLGFANF-------YRRFI---AHFARIAKPLTALVKP----- 908
Query: 752 LEWWLN-ALPLSS-PIFPRQVQHFIS----------------TDASDLGWGSQVDS---- 789
+EW+ LP + F + +Q F S TDASD +
Sbjct: 909 IEWFKKFELPEEAQQAFHKLIQAFTSAGVLQHFDYHLPTRLETDASDFAIAGVLKQEHEG 968
Query: 790 -----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYL 842
+F S S ++N+ I+ KE+ AV L+ +L S +++ +D++ + Y
Sbjct: 969 RWHPVTFYSRKMSSAEKNYEIHDKELLAVVACLTQWRHMLAGLPSQLVILTDHE-ALKYF 1027
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+ Q + ++ I L D+ + Q+ PG D+L+R +
Sbjct: 1028 KSQ---RRITGRQARWAILLADFDF----ILQYRPGDKGGEPDALTRRTDM 1071
>gi|302309734|ref|XP_002999545.1| hypothetical protein [Candida glabrata CBS 138]
gi|196049071|emb|CAR58026.1| unnamed protein product [Candida glabrata]
Length = 1504
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 115/469 (24%), Positives = 186/469 (39%), Gaps = 73/469 (15%)
Query: 457 PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPV 515
P P+ + +PV A + I+E++ +G + DS F + + V K +G +R
Sbjct: 513 PGTAPIAKRAYRLSPVKRAELEQQIKELISSGRISPSDS--PFAAPVLFVKKKDGSSRLC 570
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
++ +GLN KF L + L +DL Y V + + + +
Sbjct: 571 VDYRGLNNATVKSKFPLPLIEDVLDSLHGAKIFSKLDLISGYHQVSVNEPDRYKTSFITH 630
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP F L N V S+ VVYLDD L I K
Sbjct: 631 EGQYQWNVMPFGLTNAPATFQRLMNAVLRPYISKF--CVVYLDDIL----------IYSK 678
Query: 636 LAVSILGSLGWIVN-LQKSSLSPAPV--------LQFLGIMWDPHLDRMWLPEDKQLTLG 686
L + +++ L+K SL P +QFLG + ++ + D +
Sbjct: 679 TREEHLHHISQVLDKLRKHSLYPKKSKCHFMLTQVQFLGHV----INANGISTDPE---- 730
Query: 687 NILRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS-LLRLGAPHL 741
+ A K W N A+ LG ++ I + + AS L A
Sbjct: 731 ----KINAIKQWPILRNYKDAQRFLGLANYYRRFI-------KNFSKMASPLYEFAAKKN 779
Query: 742 TPINPAVLPKLEWWLNALPLSSPIF----PRQ-VQHFISTDASDLGWGSQVDS------- 789
T +AL +S+PI P+ Q ++ DASD G+ ++
Sbjct: 780 TKWTTECHNAFISLKDAL-ISAPILIAFDPKSPYQLTMTVDASDNCIGATLEYKDGRKPK 838
Query: 790 ---SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
++LS + WHI KE++A+ AL +Q S V++ +D++T V+ R
Sbjct: 839 GVIAYLSHKLHSYETRWHIRDKELYAIVFALKKWTHYVQGSHVIIYTDHKTNVNLNRLAL 898
Query: 847 GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ L+ +EV L + D+ I ++IPG N AD LSR P+
Sbjct: 899 LSPRLARWAEV----LANYDFEI----KYIPGPRNH-ADILSRPPGEPE 938
>gi|50402587|gb|AAT76628.1| polyprotein [Candida glabrata]
Length = 1504
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 115/469 (24%), Positives = 186/469 (39%), Gaps = 73/469 (15%)
Query: 457 PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPV 515
P P+ + +PV A + I+E++ +G + DS F + + V K +G +R
Sbjct: 513 PGTAPIAKRAYRLSPVKRAELEQQIKELISSGRISPSDS--PFAAPVLFVKKKDGSSRLC 570
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
++ +GLN KF L + L +DL Y V + + + +
Sbjct: 571 VDYRGLNNATVKSKFPLPLIEDVLDSLHGAKIFSKLDLISGYHQVSVNEPDRYKTSFITH 630
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP F L N V S+ VVYLDD L I K
Sbjct: 631 EGQYQWNVMPFGLTNAPATFQRLMNAVLRPYISKF--CVVYLDDIL----------IYSK 678
Query: 636 LAVSILGSLGWIVN-LQKSSLSPAPV--------LQFLGIMWDPHLDRMWLPEDKQLTLG 686
L + +++ L+K SL P +QFLG + ++ + D +
Sbjct: 679 TREEHLHHISQVLDKLRKHSLYPKKSKCHFMLTQVQFLGHV----INANGISTDPE---- 730
Query: 687 NILRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQAS-LLRLGAPHL 741
+ A K W N A+ LG ++ I + + AS L A
Sbjct: 731 ----KINAIKQWPILRNYKDAQRFLGLANYYRRFI-------KNFSKMASPLYEFAAKKN 779
Query: 742 TPINPAVLPKLEWWLNALPLSSPIF----PRQ-VQHFISTDASDLGWGSQVDS------- 789
T +AL +S+PI P+ Q ++ DASD G+ ++
Sbjct: 780 TKWTTECHNAFISLKDAL-ISAPILIAFDPKSPYQLTMTVDASDNCIGATLEYKDGRKPK 838
Query: 790 ---SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
++LS + WHI KE++A+ AL +Q S V++ +D++T V+ R
Sbjct: 839 GVIAYLSHKLHSYETRWHIRDKELYAIVFALKKWTHYVQGSHVIIYTDHKTNVNLNRLAL 898
Query: 847 GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ L+ +EV L + D+ I ++IPG N AD LSR P+
Sbjct: 899 LSPRLARWAEV----LANYDFEI----KYIPGPRNH-ADILSRPPGEPE 938
>gi|91176523|gb|ABE26651.1| pol polyprotein [Nosema bombycis]
Length = 1022
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 95/409 (23%), Positives = 175/409 (42%), Gaps = 64/409 (15%)
Query: 503 FLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
F++ K N R V++ + +N F+ PK + N +R+ + IDL
Sbjct: 297 FMIEKKNKELRLVVDFRKINNFIFDDVAAIPKIYD--NLYRVG----RSRVFSKIDLKNG 350
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVY 616
+ + + T + + + G +PFG+ + P+ F + + + + VY
Sbjct: 351 FNQIELATESRDVTSFTMFGLQYRYKRVPFGIKSGPKLFQKTISQILDGINN----CSVY 406
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWI---VNLQKSSLSPAPVLQFLGIMWDPHLD 673
+DD L+ + +E + +L L +N KS A ++ LG H++
Sbjct: 407 IDDILIYGE---TVEEHNETLNRVLDKLEQYHVKINFNKSEFG-ANKIEILG----NHIE 458
Query: 674 RMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFV-------IPMGRLHSR 725
L D +L N+L + +KT + + +LG ++ + +F+ I + RL S+
Sbjct: 459 DGKLKID-TTSLKNMLE--IRNKTPSKKEIQGVLGVITWYRNFIPDVSRRLISLTRLLSK 515
Query: 726 RIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS 785
+ + ++ L + +L K LP + IF Q DASDLG G+
Sbjct: 516 ETTEEWGMEQIVV--LNSLKHDILTKAHL---TLPDVNKIFKLQC------DASDLGMGA 564
Query: 786 QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVS 840
+ + S +S ++N+ I +KEMFA+ + L L+Q + V++D++ +
Sbjct: 565 VLFQEHGVIGYFSKKFSDCEKNYSIVEKEMFAIVRTLEHFRYLIQGFPIQVETDSRNCIF 624
Query: 841 YLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
K +S +E K+ L D +I +PG N+VAD LSR
Sbjct: 625 ------ENKEISKRTERWKLILNEFDIKI----TNMPGKENNVADGLSR 663
>gi|301614366|ref|XP_002936672.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 995
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 106/470 (22%), Positives = 188/470 (40%), Gaps = 66/470 (14%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V ++ G AIPF PL + P + + +I+E L+ G ++ S G + +
Sbjct: 245 VDLLPGAAIPFGRIYPL---------SEPELTVLKDYIEENLKKGFIRPSTSPAG--AGI 293
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF------LQKGDYMISIDLSQA 556
F V K + RP ++ + LN K ++ N + +P L+ +DL A
Sbjct: 294 FFVEKKDHSLRPCIDYRDLN------KITIKNRYPLPLIPELFLRLRSARVFTKLDLRGA 347
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVV 615
Y V I+ + A +PFGL P A+ ++V + R + V+V
Sbjct: 348 YNLVRIRQGDEWKTAFRTRYGHFEYLVMPFGLCNTP---ATFQHFVNDIFRDFLDLFVIV 404
Query: 616 YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRM 675
YLDD L+ + K S L + L+K ++FLG + P
Sbjct: 405 YLDDILIFSSSLEEHRRHVKQVFSRLRAHKLFAKLEKCEFERL-TIEFLGFIISP----- 458
Query: 676 WLPEDKQLTLGNILRTLLASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLL 734
+ +++ + R + A W +S +++ ++ FA+F + S+ I +L
Sbjct: 459 -----EGMSMDS--RKVSAVLDWPTPNSRKAVQRFVGFANFYRKFIKNFSKIISPITALT 511
Query: 735 -RLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDS--- 789
L TP L+ + P+ P + F+ DAS+ G+ +
Sbjct: 512 SSLKKFCWTPEAQQAFSDLKSRFTSAPILK--HPDPTRPFVLEVDASEYAIGAVLSQRND 569
Query: 790 --------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVV 839
+F S S +QN+ + +E+ A+ A LL+ + ++V SD++ +
Sbjct: 570 VQSLLHPIAFFSKKLSSSEQNYDVGDRELLAIKSAFQEWRHLLEGAAHPILVFSDHKN-L 628
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
YLR + L + L + H+ F PG+ N AD+LSR
Sbjct: 629 EYLR-----SAKRLRPRQARWALFFSRFNFHV--TFRPGSKNGKADALSR 671
>gi|388856364|emb|CCF49913.1| uncharacterized protein [Ustilago hordei]
Length = 999
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 154/392 (39%), Gaps = 67/392 (17%)
Query: 454 SAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
KPP PL +L P S + ++ E L+ G ++ S + S + VPK +GG
Sbjct: 172 GGKPPQGPL----YLKGPKEMSELRRYLDENLKKGFIR--PSKSPAQSPVLFVPKKDGGL 225
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
R ++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 226 RLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIWIAKGDEWKTAF 285
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILE 631
+ +PFGLA AP F S N + R G+ VVVYLDDFL+ +
Sbjct: 286 GTQLGLYEYLVMPFGLANAPAHFQSFIN---DIFRDIIGIYVVVYLDDFLIFSDTEEAHV 342
Query: 632 IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
++ L S L K V +FLG + L + + +K T+
Sbjct: 343 KHVTEVLTRLRSNRLFAKLSKCEFHTKTV-EFLGYII--KLTGIEMDPEKVCTV------ 393
Query: 692 LLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
K W + +S + +L FA+F + R I A R+ P + + P
Sbjct: 394 ----KEWPMPESIHDIQRFLGFANF-------YRRFI---AHFARIAKPLTSLVKP---- 435
Query: 751 KLEWWLN-ALPLSS-PIFPRQVQHFIS----------------TDASDLGWGSQVDS--- 789
+EW+ LP + F + +Q F S TDASD +
Sbjct: 436 -IEWFKKFELPEEAQQAFHKLIQAFTSAGVLQHFDYHLPTRLETDASDFAIAGVLKQEHE 494
Query: 790 ------SFLSGLWSREQQNWHINKKEMFAVHQ 815
+F S S ++N+ I+ KE+ AV+Q
Sbjct: 495 GRWHPVAFYSRKMSSAEKNYEIHDKELLAVYQ 526
>gi|301621284|ref|XP_002939986.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1502
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 114/497 (22%), Positives = 199/497 (40%), Gaps = 84/497 (16%)
Query: 428 RFVDAWIRLGAPA-PLVRI-------VSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLH 479
RF+D + GA P RI +SG IPF PL + P + + +
Sbjct: 476 RFLDVFDEKGADELPPHRIYDCPIDLLSGATIPFGRIYPL---------SEPELTVLKGY 526
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+E L+ G ++ S G + +F V K + RP ++ + LN+ ++ L +
Sbjct: 527 IEENLDKGFIRPFTSPAG--AGIFFVEKKDHSLRPCIDYRDLNKITVKNRYPLPLISELF 584
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+ +DL AY V I+ + A +PFGL AP A+
Sbjct: 585 VRLRSAQVFTKLDLRGAYNLVRIRQGDEWKTAFRTRYGHFEYLVMPFGLCNAP---ATFQ 641
Query: 600 NWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
++V + R + V+VYLDD L+ + + S L + L+K +
Sbjct: 642 HFVNDIFRDFLDLFVIVYLDDILIFSSSLEEHRVHVTKVFSRLRAHKLFAKLEKCEFEKS 701
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVI 717
+ +FLG++ P +++ + R + A W + + +++ ++ FA+F
Sbjct: 702 SI-EFLGLVISP----------DGISMDS--RKVSAVLDWPIPNDRKAVQRFVGFANFYR 748
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW----------LNALPLSSPIF- 766
+ SR I AP +T + +V K +W L ++PI
Sbjct: 749 KFIKDFSRVI----------AP-ITALTSSV--KKNFWSSEAQQAFTELKRSFTTAPILR 795
Query: 767 -PRQVQHFI-STDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKEMFAV 813
P FI DAS+ G+ + +F S S+ ++N+ + +E+ A+
Sbjct: 796 HPDPACPFILEVDASEHAVGAVLSQRADFKNQLHPVAFFSRKLSQSERNYDVGDRELLAI 855
Query: 814 HQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
A LL+ + ++V SD++ + YLR K L +F R +
Sbjct: 856 KSAFQEWRHLLEGANHPILVFSDHKN-LEYLR---SAKRLHPRQARWALFFS----RFNF 907
Query: 872 LAQFIPGAYNSVADSLS 888
F PG+ N AD+LS
Sbjct: 908 HVTFRPGSKNGKADALS 924
>gi|326676080|ref|XP_003200502.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1280
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 105/450 (23%), Positives = 180/450 (40%), Gaps = 43/450 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P + M ++ + L G+++ S G + F V K +G RP ++ +G
Sbjct: 332 PRGRLFSLSAPERATMEKYLSDSLAAGIIRSSSSPAG--AGFFFVKKKDGSLRPCIDYRG 389
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 390 LNDITIKNRYPLPLMSTAFEILQGARVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 449
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V + ++ V VYLDD L+ + ++ + +
Sbjct: 450 YLVLPFGLSNAPAVFQALVNDVLRDMINKF--VFVYLDDILIFSPSLQVHIQHVRRVLQR 507
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K L A + FLG + RM PE + A W +
Sbjct: 508 LLENQLFVKAEK-CLFHAQSVPFLGSIISVEGIRM-DPEKVR-----------AVSDWPV 554
Query: 701 DSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNA 758
+R +L +L FA+F R +S+ +L + I A +L+
Sbjct: 555 PGSRKALQQFLGFANFYRRFIRNYSQVAAPLTALTSTKSHFCWSIAAQAAFRELKSRFTT 614
Query: 759 LPLSSPIFPRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
P+ + P + F + DAS++G G+ + ++ S S ++N+ I
Sbjct: 615 APIL--VLPDPARQFVVEVDASEVGVGAVLSQICPKDNKLHPCAYYSHRLSPAERNYDIG 672
Query: 807 KKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ + +V +D++ + Y++ K L+ +F
Sbjct: 673 NRELLAVRLALGEWRHWLEGAAEPFVVWTDHRN-LEYIQ---TAKRLNSRQARWALFF-- 726
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
R + + PG+ N D+LSR P
Sbjct: 727 --GRFNFTLSYRPGSKNGKPDALSRCFGTP 754
>gi|427791983|gb|JAA61443.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 650
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 105/425 (24%), Positives = 167/425 (39%), Gaps = 66/425 (15%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFL 504
I +G A+P P V L Q + + +ML G+++R S++ + S + L
Sbjct: 239 IETGDALPLKCNPRPVSLAKRQ--------IIDGLLDDMLSAGIIRR--SSSSWASPIVL 288
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
VPK +G R ++ + LN + L I L Y ++D S+ Y V +
Sbjct: 289 VPKKDGSHRLCVDYRRLNGVTRKDAYPLPTISSIVGNLGDARYFTTLDASKGYLQVRMGE 348
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
Q A + + + T +PFGL AP F L + V L ++ + YLDD ++ +
Sbjct: 349 RDQFKTAFTSHRGLFEFTRMPFGLCNAPATFQRLMDRV--LGEAKWSYCMCYLDDIVIYS 406
Query: 625 QDPRILEIQGKLAVSILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
R E +L + G +N K+ L+ + HL L E
Sbjct: 407 ---RTFEEHLAHVADVLERVRAAGMTLNPAKAQLAQTRI----------HLLGFTLGEGS 453
Query: 682 QLTLGNILRTLLASKTWNLDSA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
LR +L S R LG ++F IP S RL AP
Sbjct: 454 IEPDREKLRAILDFPVPKDGSGLRRFLGMVNFYRSFIP-------------SCARLQAPL 500
Query: 741 LTPINPAVLPKLEWWLNA----LPLSSPI-------FPRQVQHF-ISTDASDLGWGS--- 785
+ + K +W LSS I P + F + TDASDLG G+
Sbjct: 501 TKLLGKSA--KWQWGPEQQEAFCRLSSAIAETAQLRLPDLTRPFVVQTDASDLGLGAVLL 558
Query: 786 -QVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
+ D +F S ++N+ + +KE A+ AL L + +VQ+D+ + +
Sbjct: 559 QEYDGVLQPLAFASRSLVPAEKNYSVTEKECLAIVFALRKFDVYLDGTKFVVQTDH-SAL 617
Query: 840 SYLRR 844
S+L R
Sbjct: 618 SWLMR 622
>gi|18450266|ref|NP_569141.1| polyprotein [Tobacco vein clearing virus]
gi|6425075|gb|AAF08289.1|AF190123_3 polyprotein [Tobacco vein clearing virus]
Length = 635
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 99/448 (22%), Positives = 189/448 (42%), Gaps = 57/448 (12%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK----GNGGTRPVLNLKGLNQFLSPKK 529
+ +HI+++L+ ++ +S + S F+V K G +R V++ + LN
Sbjct: 219 TEFKIHIKDLLDNKYIQ--ESNSKHTSPAFIVNKHSEQKRGKSRMVIDYRNLNAKTKTYN 276
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+ + N +Q +Y D ++H+ ++ ++ A + LPFG
Sbjct: 277 YPIPNKILKIRQIQGYNYFSKFDCKSGFYHLKLEDESKKLTAFTVPQGFYEWNVLPFGYK 336
Query: 590 TAPQAFAS-LSNWVASLLRSRGMRVVVYLDDFLLVN--QDPRILEIQGKLAVSILGSLGW 646
AP + + N+ L +VY+DD LL + QD I ++ K A I+ + G
Sbjct: 337 NAPGRYQHFMDNYFNQL-----ENCIVYIDDILLYSRTQDEHI-KLLEKFA-HIIENSGI 389
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA--- 703
++ K+ + + +FLGI D + G ++T + K NLD
Sbjct: 390 SLSKTKAEIMKNQI-EFLGIQIDKN--------------GIKMQTHIVQKIINLDENIDT 434
Query: 704 ----RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
+S LG ++ IP +L Q L + + + K++ L
Sbjct: 435 KKKLQSFLGIVNQVREYIP--KLAENLKPLQKKLKKDVEYSFDEKDKEQIKKIKILCKKL 492
Query: 760 P-LSSPIFPRQVQHFISTDASDLGWGS-----------QVDSSFLSGLWSREQQNWHINK 807
P L P ++ + + TD+S+ +G + + SG ++ Q+ W IN+
Sbjct: 493 PKLYFPDENKKFTYIVETDSSNYSYGGVLKYRYNKEKIEHHCRYYSGSYTEPQEKWEINR 552
Query: 808 KEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
KE+FA+++ L P + + +V++DN V ++ R+ + E+ ++ L ++
Sbjct: 553 KELFALYKCLLAFEPYIVYTRFIVRTDNTQVKWWITRK--VQDSVTTKEIRRLVLNILNF 610
Query: 868 RIHILAQFIPGAYNSVADSLSRSKSLPD 895
I + I N VAD LSR +S P+
Sbjct: 611 TFTI--EIINTNKNVVADYLSR-QSYPN 635
>gi|17932882|emb|CAC80811.1| polymerase [Stork hepatitis B virus]
Length = 790
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 100/222 (45%), Gaps = 13/222 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG N PK +S N + + G IS+DL
Sbjct: 393 RIFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPKYWS-PNLTALRRIVPLGMPRISLDL 451
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
SQA++H+P+ LA+S V P G+ +P + + A + R +
Sbjct: 452 SQAFYHLPLNPASSSRLAVSDGKQVYYFRKAPMGVGLSPFLLHLFTTAIGAEISRRFNVW 511
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI-MWDPH 671
Y+DDFLL + R L + L G +N K + SP ++FLG + + H
Sbjct: 512 TFSYMDDFLLCHPSARHLNSISHAVCTFLQEFGIRINFDKMTPSPVTTIRFLGYEISNQH 571
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + E + L +++ + + ++ + L+G+L+F
Sbjct: 572 LK---IEESRWNELRQVIKKIKVGQWYDWKCIQRLIGHLNFV 610
>gi|308489628|ref|XP_003107007.1| hypothetical protein CRE_17270 [Caenorhabditis remanei]
gi|308252895|gb|EFO96847.1| hypothetical protein CRE_17270 [Caenorhabditis remanei]
Length = 1385
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 98/438 (22%), Positives = 188/438 (42%), Gaps = 53/438 (12%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSL 532
+ +S ++ + + GV+ +D + + + + LV K NG R + GLN + + L
Sbjct: 500 TTVSDELERLQQAGVISPVDHSE-WAAPIVLVKKKNGSLRMCADFSTGLNDAIQQHQHPL 558
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I S L G Y IDL++AY + I ++ L ++ + + LPFG+ +AP
Sbjct: 559 PTADDIFSTLNGGKYFSQIDLAEAYLQIEIDEQAKQMLCINTHRGLYRYNRLPFGVKSAP 618
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
+F + + + S L V YLDD ++ + + +S + G V ++K
Sbjct: 619 GSFQQIMDSMTSGLDG----VAAYLDDIIITGSSVAEHNQRLETVMSRIQDFGLRVRIEK 674
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ +P + FLG + D R P+ ++++ +R + + N RS LG + F
Sbjct: 675 CTFL-SPKITFLGFIIDKDGRR---PDPEKVS---AIRHMPVPQ--NESQVRSFLGLIQF 725
Query: 713 -ASFVI-------PMGRLHSRRIQ-RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSS 763
SFV P+ L + ++ + S + H+ I + L L + LP+
Sbjct: 726 YGSFVKELFKLRPPLDALTKKDVEFKWTSECQNAFDHIKQILHSDLL-LTHYDPKLPI-- 782
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDSSF----------LSGLWSREQQNWHINKKEMFAV 813
++ DAS G G+ + F +S + Q+N+ +KE F +
Sbjct: 783 ---------IVAADASQYGIGAVISHRFPDGSEKAIYHISKALTAPQRNYSQIEKEAFGL 833
Query: 814 HQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
A++ + +++D++ ++S + G S + +++ I LL+ D+ I
Sbjct: 834 ITAVTKFHRFIHGRHFTLRTDHKPLLSIFGEKKGIPVYS-ANRLQRWAIILLNYDFNIEY 892
Query: 872 LAQFIPGAYNSVADSLSR 889
+ G AD+LSR
Sbjct: 893 INTHDFGQ----ADALSR 906
>gi|326673480|ref|XP_003199896.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1280
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 105/450 (23%), Positives = 180/450 (40%), Gaps = 43/450 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P + M ++ + L G+++ S G + F V K +G RP ++ +G
Sbjct: 332 PRGRLFSLSAPERATMEKYLSDSLAAGIIRSSSSPAG--AGFFFVKKKDGSLRPCIDYRG 389
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 390 LNDITIKNRYPLPLMSTAFEILQGARVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 449
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V + ++ V VYLDD L+ + ++ + +
Sbjct: 450 YLVLPFGLSNAPAVFQALVNDVLRDMINKF--VFVYLDDILIFSPSLQVHIQHVRRVLQR 507
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K L A + FLG + RM PE + A W +
Sbjct: 508 LLENQLFVKAEK-CLFHAQSVPFLGSIISVEGIRM-DPEKVR-----------AVSDWPV 554
Query: 701 DSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNA 758
+R +L +L FA+F R +S+ +L + I A +L+
Sbjct: 555 PGSRKALQQFLGFANFYRRFIRNYSQVAAPLTALTSTKSHFCWSIAAQAAFRELKSRFTT 614
Query: 759 LPLSSPIFPRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
P+ + P + F + DAS++G G+ + ++ S S ++N+ I
Sbjct: 615 APIL--VLPDPARQFVVEVDASEVGVGAVLSQICPKDNKLHPCAYYSHRLSPAERNYDIG 672
Query: 807 KKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ + +V +D++ + Y++ K L+ +F
Sbjct: 673 NRELLAVRLALGEWRHWLEGAAEPFVVWTDHRN-LEYIQ---TAKRLNSRQARWALFF-- 726
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
R + + PG+ N D+LSR P
Sbjct: 727 --GRFNFTLSYRPGSKNGKPDALSRCFGTP 754
>gi|313227295|emb|CBY22441.1| unnamed protein product [Oikopleura dioica]
Length = 804
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 78/374 (20%), Positives = 147/374 (39%), Gaps = 25/374 (6%)
Query: 537 RIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
+I FL+KG M +D + HV + + Y G FG+ P +
Sbjct: 272 KILPFLKKGMLMAKVDDKSGFHHVQLDPFSRNMACCQYGGIQFRYKAAAFGIPAVPGVYQ 331
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG-----------KLAVSILGSLG 645
+++ ++LR G +YLDD + + + E Q L + ++ + G
Sbjct: 332 LVNSVPVNVLRKAGHHCFLYLDDRIFLIEPKSKSEEQALRRGELVPEGPYLGLLLMTAAG 391
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
+N KS L P +++LG D + +P +K + T + +
Sbjct: 392 TYINRAKSVLLPTSKMEYLGFFLDTDRCTIKIPTEKLEKFKKEASDIRKKSTCDYKALEK 451
Query: 706 LLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
L G + S V+ RL+ RR+ +L+ + + + +L W+++ ++
Sbjct: 452 LRGKMCSFSLVVLNMRLYIRRV--TYALVLAEESGIVKVTSDLKEELSLWIDSKTIAKET 509
Query: 766 --FPRQVQHF---ISTDASDLGWGSQVDSSFLSGL--WSREQ--QNWHINKKEMFAVHQA 816
+ V F + TDAS G +DS + W +I KE +AV
Sbjct: 510 SWLKKGVMSFQTSVHTDASSFAAGIFIDSLGIEVYVPWGELDAVARDNIFVKEAWAVLYC 569
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
+ L++ + +DN+ V Y G+++ +L + KI + + +L ++
Sbjct: 570 IETYGHLMRDKTIHFFNDNKVV--YHAFHIGSRNQALNRIIRKIHEKADELNTELLITWV 627
Query: 877 PGAYNSVADSLSRS 890
P +AD SRS
Sbjct: 628 PTD-KQLADEASRS 640
>gi|340367723|ref|XP_003382403.1| PREDICTED: hypothetical protein LOC100639764 [Amphimedon
queenslandica]
Length = 342
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 145/327 (44%), Gaps = 34/327 (10%)
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVV-YLDDFLLVNQDPRILEI 632
++G V+ LPFGL +AP+ F+++++ +L G+++ + YLDDF++V D +
Sbjct: 7 WDGVVVIDKFLPFGLRSAPKIFSAVADAAQWVLLHNGVKLSLHYLDDFIMVEGDLVAAQE 66
Query: 633 QGKLAVSILGSLGWIVNLQKSSLS-PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
+L S LG + L+ S L P+ L FLGI D ++ LP +K L ++L
Sbjct: 67 AKRLLCSTFEKLG--LPLEPSKLEGPSTCLTFLGIEVDKFNLQLRLPAEKLARLMDLLEE 124
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
K SL G L +A+ V G +I+ +N
Sbjct: 125 THGRKHILKKELESLTGLLQYAAKV---GSAPDHKIR---------------LNKPARAD 166
Query: 752 LEWW------LNALP-LSSPIF-PRQVQHFISTDAS-DLGWGSQVDSSFLSGLWSREQQN 802
+ WW N + LS P+ P V+ F +DAS G G+ + + W ++
Sbjct: 167 VMWWQMFVSSWNGISMLSGPMNSPADVEVF--SDASGSWGGGAFCFPQWFAFKWPLALES 224
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
I KE+ V A +L + +V V S N V+Y+ + +K L+ + +
Sbjct: 225 TSIQVKELIPVVMAAALFGSSWKGKLV-VSSVNNEAVAYILNKTHSKESHLMHLIRLLVF 283
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSR 889
+ + A+ IP NS+AD+LSR
Sbjct: 284 YAAHFDFWFRAEHIPEKRNSLADALSR 310
>gi|388858586|emb|CCF47928.1| uncharacterized protein [Ustilago hordei]
Length = 1157
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 102/407 (25%), Positives = 170/407 (41%), Gaps = 35/407 (8%)
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFL-----QKGDYMISIDLSQAYFHVPIKTTHQ 567
+P + L +N + P F I + + + + +G + DL A+ H+ I
Sbjct: 542 QPGMRLPSVNDGIHPS-FVSIRYETLDTIIDFVRDHQGASLWKADLEDAFRHIIIAENDA 600
Query: 568 RFLALSYNGDVLAMTCLPFGLATAP---QAFASLSNWVASL-LRSRG------MRVVVYL 617
R L ++G L FG ++P FA +WV S L+S V YL
Sbjct: 601 RLLGFHFDGRYYQECALAFGGRSSPFLFNLFAKFLHWVTSFALQSASPSPTSHSEVSHYL 660
Query: 618 DDFLLVNQDPRILEIQGKLAVSILGS-LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMW 676
D+F + DP A+SI+ + LG+ ++ +K+ S L+ LGI D
Sbjct: 661 DNFFGAS-DPTSNASTPIQALSIVAAVLGFRLSHKKTVWSTT-HLEILGIELDSVAQMAS 718
Query: 677 LPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL 736
+ + + + + ++ +L + + G+L F + V P GR RRI + R
Sbjct: 719 ITDQHHQHILGLCQRIIDQGWASLLELQQITGHLQFVTRVAPHGRAFLRRIYDAVTSHRR 778
Query: 737 GAPHLTPINPAVLPKLEWWL------NALPLSSPIFPRQVQHFISTDASDLGWGSQVDS- 789
AP I+ A +L WW + + L P P ++H + TDAS GS
Sbjct: 779 -APFGRRISRATRDELIWWTSMLSAWDGMSLLQP-SPLIIEH-VWTDASKCSIGSHCGHM 835
Query: 790 SFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRR 844
+ ++S E H K E AV +AL L P V++ DN+ V L++
Sbjct: 836 EHPTAVFSWELSRCHCQKDIRFLEALAVLEALRLFSPAWPGPRRVILYVDNENVEHGLQK 895
Query: 845 QGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
G ++ +IF L I++ + A N++AD+LSR +
Sbjct: 896 -GSIRNPMTQVLFREIFALCLQRHINLQVTSVRSAANTLADALSRRR 941
>gi|392577009|gb|EIW70139.1| hypothetical protein TREMEDRAFT_30259, partial [Tremella
mesenterica DSM 1558]
Length = 1125
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 105/447 (23%), Positives = 182/447 (40%), Gaps = 60/447 (13%)
Query: 494 STTGFLSRLFLVPK-GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISID 552
S + + S +F++PK G R V++ + LN+ P + L +I L K Y +D
Sbjct: 473 SNSPYGSPMFMIPKKAEGQWRMVIDYRKLNEATIPDAYPLPLIGQITEELGKARYFSKLD 532
Query: 553 LSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMR 612
L AY + + H+ A + + GL AP F N V L G
Sbjct: 533 LIGAYQLLRVTEGHEHLTAFRTQYGMFESLVVRDGLRNAPAVFQHFLNEVFRELLGNG-- 590
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLA--VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
VVVY+DD L+ + E++G A +L V K V+ FLG
Sbjct: 591 VVVYIDDILIYGNT--LEELRGTTAKVFEVLRKASLYVKASKCEFERDSVV-FLG----- 642
Query: 671 HLDRMWLPEDKQLTLGNILRTLLAS--KTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ 728
++ K +++ + S + NL +R +G +S+ +P +R I
Sbjct: 643 -----FVVSSKGVSVNPEYIDAITSFPRPKNLRESRGFIGVVSYYRRFVPNFSKIARPIN 697
Query: 729 RQASLLRLGAPHLTPIN-PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG--- 784
L R P + + + +L+ + P+ + P ++ + TDAS GWG
Sbjct: 698 ---DLTRKEVPFVWGVEQESAFKELKARMCTAPVLAHFDP-TLKTILQTDASFFGWGFII 753
Query: 785 SQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
SQ+++ + SG ++ Q N+ + +KE AV + LL V +D+
Sbjct: 754 SQINTAGQEHPVAIESGAFNTAQLNYTVGEKEFLAVVEGFRRRRHLLLQVETTVLTDHLN 813
Query: 838 VVSYLR------RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+ ++ RQG VE++ ++R ++ + PG S+ D LSR
Sbjct: 814 LTYWMEPKQLSPRQG--------RWVEEL----ANFRFKMV--YRPGTQASLPDGLSRR- 858
Query: 892 SLPDWHLSRSAT--EQIFLKWGVPCID 916
D+H + +T ++ L G+P D
Sbjct: 859 --ADYHSGKGSTMVQESNLIQGLPKFD 883
>gi|17932890|emb|CAC80817.1| polymerase [Stork hepatitis B virus]
Length = 790
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 100/222 (45%), Gaps = 13/222 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG N PK +S N + + G IS+DL
Sbjct: 393 RIFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPKYWS-PNLTALRRIVPLGMPRISLDL 451
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
SQA++H+P+ LA+S V P G+ +P + + A + R +
Sbjct: 452 SQAFYHLPLNPASSSRLAVSDGKQVYYFRKAPMGVGLSPFLLHLFTTAIGAEISRRFNVW 511
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI-MWDPH 671
Y+DDFLL + R L + L G +N K + SP ++FLG + + H
Sbjct: 512 TFSYMDDFLLCHPSARHLNSISHAVCTFLQEFGIRINFDKMTPSPVTTIRFLGYEISNQH 571
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + E + L +++ + + ++ + L+G+L+F
Sbjct: 572 LK---IEESRWNELRQVIKKIKVGQWYDWKCIQRLIGHLNFV 610
>gi|2133581|pir||S68306 pol polyprotein, truncated - red flour beetle retrotransposon Woot
gi|805077|gb|AAC47271.1| protease, reverse transcriptase and RNase H [Tribolium castaneum]
Length = 712
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 111/482 (23%), Positives = 195/482 (40%), Gaps = 73/482 (15%)
Query: 381 SSPQNLEPPGRVSLKVQTLQKPQ-RCSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAP 439
+ Q+LE P + KV L+ P+ ++ + R L G ++ I+L
Sbjct: 245 NKKQHLESPSALQAKVDPLKIPEAEKRKLIHLLQEYRCIFSLRPGLTHKYTHE-IKLHDK 303
Query: 440 APLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFL 499
P ++ Y +PF+ +P A+ IQEML+ GV+KR + +
Sbjct: 304 TPFLK--RPYPVPFALRP-----------------AVDATIQEMLDLGVIKR--EASPYA 342
Query: 500 SRLFLVPKGNGGTRPVLNLKGLNQFLS------PKKFSLINHFRIPSFLQKGDYMISIDL 553
S + + K +G R L+ + +N + P L+ F + YM +IDL
Sbjct: 343 SPMTVGKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF------HEIRYMSTIDL 396
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
+Y+ +P+ +++ A YNG LPFGL TA +F+ + V + +R
Sbjct: 397 RSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDVVLGTEVRE---F 453
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHL 672
VV Y+DD L+ ++ + L +NL+KS+ V +FLG H+
Sbjct: 454 VVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEV-KFLG-----HI 507
Query: 673 DRMWLPEDKQLTLGNIL---RTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQ 728
LT+ I + A + + + + + +L +F +S Q
Sbjct: 508 ----------LTINGIKADPEKISAIRNFPVPQKTKHVRAFLGLCNFYRKFCARYSAATQ 557
Query: 729 RQASLLRLGAPHLTPIN--PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS- 785
LLR G N A + +L A+ L P + ++ TD+S G G+
Sbjct: 558 DLNKLLRKGEKWRWGRNEQEAFDRVKDLFLEAVLLRYPDLNKIF--YVQTDSSGYGLGAE 615
Query: 786 ----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
Q D S F S + N+ +KE+ V AL +Q + +++++D+Q
Sbjct: 616 LYQIQEDGSRGVIAFASRSLRGPELNYTTTEKELLGVIFALHKFRIYIQVTKIIIRTDHQ 675
Query: 837 TV 838
+
Sbjct: 676 AL 677
>gi|308463529|ref|XP_003094038.1| hypothetical protein CRE_16387 [Caenorhabditis remanei]
gi|308248701|gb|EFO92653.1| hypothetical protein CRE_16387 [Caenorhabditis remanei]
Length = 2379
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 107/506 (21%), Positives = 207/506 (40%), Gaps = 75/506 (14%)
Query: 405 CSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCS 464
C P D+ +G++ GR+ +F G + +V + IP A+P
Sbjct: 1373 CKYP-----DAFVGSD---GRIGKFK------GVTTHHIELVDDHTIP-QARP------- 1410
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQF 524
L + +++M + +++ +S++ + S L ++PK NG R V++ + LN
Sbjct: 1411 -YRLNPEQKDKLEKELRKMRDNDLIE--ESSSPYTSPLLMIPKSNGEIRIVIDYRKLNLI 1467
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
P+ + + N I KG D+ Q + HV + H+ A + V +
Sbjct: 1468 TRPRTYIMPNTLDITEEASKGRIFSVFDICQGFHHVKMHQAHKERTAFCCHLGVFQYKYM 1527
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
P GL +P F + V +++Y+DD +LV++ + + ++ +
Sbjct: 1528 PMGLRGSPDTFQRAMSEVQQKFSG---SMIIYVDDIVLVSETEQQHLEDLEEFFKLMIQM 1584
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
G + +KS + + + FLG +D + + +K + +R KT + R
Sbjct: 1585 GLKLKAEKSQIGRSKIT-FLG--FDIENNTIQPNGEKTKS----IREFPVPKT--VTEIR 1635
Query: 705 SLLGYLS--------FASFVIPMGRLHSR------RIQRQASLLRLGA-----PHLTPIN 745
LG S FA+ V P+ L + + ++Q + + P LT N
Sbjct: 1636 QFLGMASYFRRFIPGFATIVSPLNNLLRKETEFVWKKEQQDAFENVKEKLISPPILTTPN 1695
Query: 746 PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHI 805
+ +L + + +++ + RQ + + +GS+ + S + E
Sbjct: 1696 NTGIFELHTDASKVGIAAVLMQRQ-----DGELKVIAYGSRPTTPVESRYPAIEL----- 1745
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
E A+ L++ P + V+V +D+ + S L R+ T S L+ E I Q
Sbjct: 1746 ---ESLAISWGLTVYKPYIFGKKVIVITDHLPLKSLLHRKEKTMSGRLMRH-EAII---Q 1798
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSK 891
+ + I ++ PG N VAD+LSR +
Sbjct: 1799 QFDVEI--RYRPGKENHVADTLSRQR 1822
>gi|17932894|emb|CAC80820.1| polymerase [Stork hepatitis B virus]
Length = 790
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 100/222 (45%), Gaps = 13/222 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG N PK +S N + + G IS+DL
Sbjct: 393 RIFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPKYWS-PNLTALRRIVPLGMPRISLDL 451
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
SQA++H+P+ LA+S V P G+ +P + + A + R +
Sbjct: 452 SQAFYHLPLNPASSSRLAVSDGKQVYYFRKAPMGVGLSPFLLHLFTTAIGAEISRRFNVW 511
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI-MWDPH 671
Y+DDFLL + R L + L G +N K + SP ++FLG + + H
Sbjct: 512 TFSYMDDFLLCHPSARHLNSISHAVCTFLQEFGIRINFDKMTPSPVTTIRFLGYEISNQH 571
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + E + L +++ + + ++ + L+G+L+F
Sbjct: 572 LK---IEESRWNELRQVIKKIKVGQWYDWKCIQRLIGHLNFV 610
>gi|308467446|ref|XP_003095971.1| hypothetical protein CRE_06930 [Caenorhabditis remanei]
gi|308244240|gb|EFO88192.1| hypothetical protein CRE_06930 [Caenorhabditis remanei]
Length = 1869
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 107/475 (22%), Positives = 190/475 (40%), Gaps = 73/475 (15%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P +P VP+ + + HI +L + + +S T + S +
Sbjct: 909 VHIYTNTEVPVRGRPYRVPV--------KYQAELEKHINGLLLSNRI--TESNTPWTSPI 958
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFH 559
LV K NG R L+ + LN+ P + L RI + ++K Y S+D++ Y
Sbjct: 959 VLVKKKNGSLRVCLDFRKLNEVTIPDNYPLP---RIDTIIEKVGNARYFSSLDMANGYLQ 1015
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLD 618
+ + V A T LPFGL +A F +L +A L V+VY+D
Sbjct: 1016 LRLDAESSYKCGFITENKVYAYTHLPFGLKSAASYFQRALKTVLAGLEED----VMVYID 1071
Query: 619 DFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVL----QFLGIMWDPHLDR 674
D L+ ++ + + +S + +K ++ + G + P+
Sbjct: 1072 DVLIYSKTFEEHLVTLRHVLSRFRQFSLKASPKKCEFVKQSIVFLGHEISGTSYSPN--- 1128
Query: 675 MWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-----SFASFVIPMGRLHSRRIQR 729
Q + +I R + L + G+ +FA P+ RL +R+ Q+
Sbjct: 1129 -------QANVDSIERLPTPNNVPELKRFIGMAGFFRKFIENFAGIAEPLTRL-TRKEQK 1180
Query: 730 QASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGS--- 785
+ KL+ L + P+ S FP + F I TDAS + G+
Sbjct: 1181 FV---------WSEEQQEAWMKLKTALTSKPILS--FPNYEKPFHIFTDASSVAQGAVLM 1229
Query: 786 -QVDS--------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQ 836
D+ +++S S E+ W + E+ A+ AL P + S +++ SD++
Sbjct: 1230 QATDTDPRNFHVIAYVSRTLSDEETRWTAIQIELGAIIFALRQFKPYVCLSKIVLHSDHR 1289
Query: 837 TVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
+ L + +L+ + + Q + I I+ I G N+VAD LSR+K
Sbjct: 1290 PLTFLLAKNKVNDNLA------RWLVELQQYDIEIV--HIEGKKNTVADCLSRAK 1336
>gi|301608169|ref|XP_002933672.1| PREDICTED: hypothetical protein LOC100488716 [Xenopus (Silurana)
tropicalis]
Length = 1359
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 98/455 (21%), Positives = 196/455 (43%), Gaps = 86/455 (18%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
A++ +Q ML+ GV+++ S + + S + LVPK +G R + + +N+ KF
Sbjct: 878 AVTEEVQRMLDLGVIEK--SKSEWSSPIVLVPKPDGSLRFCNDFRKVNEV---SKFDAYP 932
Query: 535 HFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
R+ +++ Y+ ++DL++ Y+ VP+ + + A S + +PFGL A
Sbjct: 933 MPRVDELIERLGPARYITTLDLTRGYWQVPLTESAKEKTAFSTPQGLFQYVRMPFGLQGA 992
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKLAVSILGSLGW 646
P F + + +L + YLDD ++ ++D PR+ + ++ + G
Sbjct: 993 PATFQRMMD---HILSPHQLYASAYLDDVVIFSRDWQSHLPRV-----QAVLNSIRDAGL 1044
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN-----ILRTLLASKTW--- 698
N +K ++ ++LG T+G + + A + W
Sbjct: 1045 TANPKKCAIGLEEA-RYLG-----------------YTIGRGVIKPQVNKVEAIRNWPQP 1086
Query: 699 -NLDSARSLLGYL--------SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
N R+ LG + +FA+ P+ L +++++++ GA
Sbjct: 1087 VNKKQVRTFLGMVGYYRRFIPNFATMAAPLTDLTK---GKESTMVKWGAE-----TEKAF 1138
Query: 750 PKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS---------SFLSGLWSRE 799
+L+ L P L +P F +Q + TDAS +G G+ + +LS +
Sbjct: 1139 QELKTALCQQPVLVAPDFTKQF--MVQTDASGVGVGAVLSQLVRGEEHPVVYLSRKLNPA 1196
Query: 800 QQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE 858
++N+ I ++E A+ AL +L LL V++ + + ++++ Q K+ + V
Sbjct: 1197 EKNYSIVERECLAIKWALEALRYYLLGRQFVLI--TDHSPLTWM-SQAKEKN----ARVT 1249
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+ FL Q++ + + G AD+LSRS +
Sbjct: 1250 RWFLSLQNFNFKV--EHRAGRLQGNADALSRSYCM 1282
>gi|270006313|gb|EFA02761.1| hypothetical protein TcasGA2_TC008494 [Tribolium castaneum]
Length = 1453
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 80/187 (42%), Gaps = 6/187 (3%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE+L+ +++ +S + + S + LV K NG R ++ + LN L
Sbjct: 604 VQELLDNNIVR--ESESNYCSPVLLVKKKNGEQRLCIDYRKLNAQTVKDNHPLPRVDDQI 661
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
LQ G Y S+DL Y +P+ +++ + +PFGL AP+ F
Sbjct: 662 DRLQGGVYFTSLDLRSGYHQIPLSEESKKYTSFVTPFGQYEYNRVPFGLTNAPRTFQRFM 721
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N +L+ VYLDD LL +D + IL S G +NL+K S
Sbjct: 722 N---KILKPARENAAVYLDDVLLHAKDVNEALQNLQKVFEILRSEGLTLNLKKCSFLMTS 778
Query: 660 VLQFLGI 666
V FLG
Sbjct: 779 VT-FLGF 784
>gi|440551080|gb|AGC11913.1| Pol [Feline foamy virus]
Length = 1156
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 116/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGILGSLFKGRYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + ++ + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLEILFNRLNEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+IV+L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIVSLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENVTAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|7522108|pir||T29097 pro-pol-dUTPase polyprotein - murine endogenous retrovirus ERV-L
(fragment)
gi|2065210|emb|CAA73251.1| Pro-Pol-dUTPase polyprotein [Mus musculus]
Length = 1182
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 101/201 (50%), Gaps = 12/201 (5%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ ++ I+++ + GV+ + +T+ F S ++ V K +G R ++ + LNQ ++P ++
Sbjct: 169 AEITATIKDLKDAGVV--VPTTSPFNSPIWPVQKTDGSWRMTVDYRKLNQVVTPIAAAVP 226
Query: 534 NHFRIPSFLQK-----GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGL 588
+ + S L++ G + +IDL+ A+F VP+ HQ+ +A S+ G T LP
Sbjct: 227 D---VVSLLEQINTSPGTWYAAIDLANAFFSVPVHKDHQKQIAFSWQGQQYTFTVLPQVY 283
Query: 589 ATAPQAFASLSNW-VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
+P +L + L + + +V Y+DD +LV + + V+ + GW
Sbjct: 284 INSPALCHNLVRRDLDRLDLPQSITLVHYIDDIMLVGPSEQEVATTLDSLVTHMRIRGWE 343
Query: 648 VNLQKSSLSPAPVLQFLGIMW 668
+N K P+ ++FLG+ W
Sbjct: 344 INPTKIQ-GPSTSVKFLGVQW 363
>gi|270016118|gb|EFA12566.1| hypothetical protein TcasGA2_TC004195 [Tribolium castaneum]
Length = 988
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 99/222 (44%), Gaps = 27/222 (12%)
Query: 451 IPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNG 510
IPF +P VP + A+ IQEML+ GV+KR + + S + +V K +G
Sbjct: 235 IPFLKRPYPVPFA--------LRPAVDATIQEMLDLGVIKR--EASPYASPMTVVKKKDG 284
Query: 511 GTRPVLNLKGLNQFL------SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
R L+ + +N + P L+ F YM +IDL +Y+ +P+
Sbjct: 285 TVRICLDARMINSKMIADCESPPAADELLRRF------HGVRYMSTIDLRSSYWQIPLSP 338
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFLLV 623
+++ A YNG LPFGL TA +F+ + V + +R VV Y+DD L+
Sbjct: 339 ESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDVVLGTEVRE---FVVNYIDDLLVA 395
Query: 624 NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
++ + L +NL+KS+ V +FLG
Sbjct: 396 SETLNEHLEHLRQVFEKLKQARMTINLEKSNFIQKEV-KFLG 436
>gi|301623889|ref|XP_002941244.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1593
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 115/517 (22%), Positives = 197/517 (38%), Gaps = 80/517 (15%)
Query: 406 SSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPA-PLVRIVSGYAIPFSAKP-PLVPLC 463
S V P D++ +V L F D + GA P R+ Y P P +P
Sbjct: 464 SMAVAPATDTQFA--VVPNYLHEFKDVFDEKGADTLPPHRV---YDCPIDLLPGAAIPFG 518
Query: 464 SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ 523
+ L+ P + +I E LE G + S G + +F V K + RP ++ + LN
Sbjct: 519 RIYPLSEPELIVLKKYIDENLEKGFICPSTSPAG--AGIFFVEKKDHSLRPCIDYRQLNL 576
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
++ L + L++ +DL AY V I+ + A
Sbjct: 577 ITVKNRYPLPLIPELFQNLREAKIFSKLDLRGAYNLVRIRKGDEWKTAFRSRYGHFEYLV 636
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL AP F L N + + V+VYLDD L+ + + EI + S L
Sbjct: 637 MPFGLCNAPATFQHLVNDIFRDFLDQF--VIVYLDDILVFSSSIKEHEIHMRKVFSRLRE 694
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL---RTLLASKTWNL 700
L+K + +FLG + ++ IL + + A W +
Sbjct: 695 HSLFAKLEKCEFHKTSI-EFLGFV---------------ISTDGILMDPKKVSAVLNWPV 738
Query: 701 DSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN--PAVLPKLEW--- 754
++R ++ F++F RR R S + ++PI + + +W
Sbjct: 739 PTSRKATQRFIGFSNFY--------RRFIRNFSKI------ISPITDLTSTTKRFQWSSQ 784
Query: 755 ------WLNALPLSSPIFPR---QVQHFISTDASDLGWGSQVDS-----------SFLSG 794
L L S+PI + + DAS+ G+ + +F S
Sbjct: 785 AQSAFDKLKELFTSAPILKHPDPSLPFVVEVDASETAVGAVLSQRSGLQNFLHPVAFFSK 844
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLS 852
S ++N+ ++ +E+ A+ A L+ S +++ SD++ + YLR +
Sbjct: 845 KLSPSEKNYDVSDRELLAIKVAFEEWRQYLEGSSHPILIFSDHRN-LEYLR-----TAKR 898
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L + L + HI + PG+ N AD+LSR
Sbjct: 899 LRPRQARWALFFSRFNFHI--TYRPGSQNHKADALSR 933
>gi|119657147|gb|ABL86702.1| putative pol protein [Adineta vaga]
Length = 1302
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 107/461 (23%), Positives = 193/461 (41%), Gaps = 69/461 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GV++ +ST+ + S + LV K +G R ++ + LN + F + I
Sbjct: 399 INKLLKQGVIE--ESTSPWSSPIVLVRKKDGSVRFCIDYRKLNAITTKDAFPIPRIDDIF 456
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + Y +ID YF V + + A S T LP G+ P AF +
Sbjct: 457 DHLSQTGYYTTIDFKSGYFQVGLDARDRPKTAFSTRDQHYQFTVLPQGVTNGPPAFQRIV 516
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ + L +R + YLDD ++ + D ++ + L + L + +N+ K ++
Sbjct: 517 SQI--LGPTRWKYALAYLDDVIIYSPTFDQHLVHLDDIL--NRLHEANFRLNVGKCHIAQ 572
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNI------LRTLLASKTWNLDSARSLLGYLS 711
+ +LG + GNI +R LL +T +A+ ++
Sbjct: 573 TSI-DYLG---------------HHIEHGNIKPNADNIRALL--ETPQPATAKEAFRFVK 614
Query: 712 FASFV---IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---------WLNAL 759
A + IP + ++ + + A + + P P L E N L
Sbjct: 615 AAEYYRKFIPKFSMIAQPLYKYAPTTKEQRSNKMPAVPIQLLDDELHAFHELKQILTNDL 674
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------FLSGLWSREQQNWHINKKEM 810
L P + I TDAS +G G+ Q S+ +LS ++ Q NW ++E
Sbjct: 675 ILRIP--DENLPFKIQTDASKIGIGAVLMQTHSNGDLPVAYLSKKFTTTQLNWPATEQEC 732
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+A+ A+ L ++++D++ ++ + +Q L S+ E+ L Q ++
Sbjct: 733 YAIIHAIEKWHKYLDGREFIIETDHKPLLPFNLKQ------QLNSKCERWRLKLQQYKFT 786
Query: 871 ILAQFIPGAYNSVADSLSRSKS------LPDWHLSRSATEQ 905
I ++I G +N+VAD LS S S L D+ +RS T Q
Sbjct: 787 I--RYIKGKHNTVADYLSPSPSDNASGDLDDYVPTRSQTTQ 825
>gi|365991279|ref|XP_003672468.1| hypothetical protein NDAI_0K00353 [Naumovozyma dairenensis CBS 421]
gi|343771244|emb|CCD27225.1| hypothetical protein NDAI_0K00353 [Naumovozyma dairenensis CBS 421]
Length = 1249
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 100/444 (22%), Positives = 180/444 (40%), Gaps = 82/444 (18%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+++L+ G + + S + + S + LV K +G R ++ + LN+ F L + +
Sbjct: 320 IKDLLDKGFI--VPSKSSYSSPIVLVTKHDGSYRLCVDYRELNKVTVKDPFPLPHVDELL 377
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ ++DL Y +P+ T A T +PFGL AP FA
Sbjct: 378 GKVGSASVFTTLDLHSGYHQIPMNPTDMDKTAFVTPTGKYEYTVMPFGLVNAPSTFAR-- 435
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
++A L R V VYLDD L+ + D + +S L I +K + +
Sbjct: 436 -YMADLFRDLEF-VNVYLDDILIFSNDLESHWKHIDVVLSRLDQEKLIAKKKKCHFAQSE 493
Query: 660 VLQFLGIMWDPH-----------LDRMWLP---EDKQLTLG--NILRTLLASKTWNLDSA 703
V QFLG + + ++R +P ++ Q +G N R + D +
Sbjct: 494 V-QFLGYIIGRNKIKPVQEKCEAINRFPVPKTIKEAQRFVGMINYYRKFIK------DCS 546
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLR--LGAPHLTPINPAVLPKLEWWLNALPL 761
R + + F S +P G L A+L R + P L P
Sbjct: 547 RKVRPLVDFISRNVPWGDLQDDAF---ATLKRDLMSEPLLVP------------------ 585
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMF 811
F R ++ ++TDAS G G+ ++ S+ S + Q+ + + E+
Sbjct: 586 ----FKRDAEYRLTTDASMDGLGAVLEEVADNKVLGVVSYYSKSLNETQRRYPPGELELM 641
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS----LSLLSEVEKIFLLSQDW 867
A+ + L +L ++++D+ +++S ++ + L LSE + F L+
Sbjct: 642 AIIEGLEHFKYMLHGKHFVLRTDHISLLSIQNQKEPARRVQRWLDTLSEFD--FSLA--- 696
Query: 868 RIHILAQFIPGAYNSVADSLSRSK 891
++PG N VAD++SR+K
Sbjct: 697 -------YLPGPKNVVADAISRAK 713
>gi|301607281|ref|XP_002933262.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1065
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 73/159 (45%), Gaps = 4/159 (2%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM +I E L+ G ++ S G + F V K +G RP ++ +GLN+
Sbjct: 215 LSLPEAQAMKEYINENLQRGFIRPSSSPAG--AGFFFVGKKDGSLRPCIDYRGLNKITVK 272
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + ++ + +DL AY + I+ + A + +PFG
Sbjct: 273 NRYPLPLISELFDQVRNAKFFTKLDLRGAYNLIRIRVGDEWKTAFNTRDGHYEYLVMPFG 332
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
L AP F N + L G+ VVVYLDD L+ + +
Sbjct: 333 LCNAPAVFQEFVNDIFRDL--LGLFVVVYLDDILIFSSN 369
>gi|378788725|gb|AFC40212.1| polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D H M + E + L +++
Sbjct: 272 DEHF--MKIEESRWKELRTVIK 291
>gi|308468863|ref|XP_003096672.1| hypothetical protein CRE_29108 [Caenorhabditis remanei]
gi|308241619|gb|EFO85571.1| hypothetical protein CRE_29108 [Caenorhabditis remanei]
Length = 1384
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 186/430 (43%), Gaps = 37/430 (8%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSL 532
+ +S + + + GV+ +D + + + + LV K NG R + GLN + + L
Sbjct: 500 TTVSDELDRLQQAGVISPVDHSE-WAAPIVLVKKKNGSLRMCADFSTGLNDAIEQHQHPL 558
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I S L G Y IDL++AY + I ++ L ++ + + LPFG+ +AP
Sbjct: 559 PTADDIFSTLNGGKYFSQIDLAEAYLQIEIDEQAKQMLCINTHRGLYRYNRLPFGVKSAP 618
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
+F + + + S L V YLDD ++ + K +S + G V ++K
Sbjct: 619 GSFQQIMDSMTSGLDG----VAAYLDDIIITGSSVAEHNQRLKTVMSRIQDFGLRVRIEK 674
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ +P + FLG + D R P+ ++++ +R + + N RS LG + F
Sbjct: 675 CTF-LSPKITFLGFIIDKDGRR---PDPEKVS---AIRHMPVPQ--NESQVRSFLGLIQF 725
Query: 713 -ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ 771
SFV + +L R A + T +++ L++ L + P+ +
Sbjct: 726 YGSFVKELFKL---RPPLDALTKKDVEFKWTSECQNAFDRIKQILHSDLLLTHYDPK-LP 781
Query: 772 HFISTDASDLGWGSQVDSSF----------LSGLWSREQQNWHINKKEMFAVHQALSLNL 821
++ DAS G G+ + F +S + Q+N+ +KE F + A++
Sbjct: 782 IIVAADASQYGIGAVISHQFPDGSEKAIYHISKALTAPQRNYSQIEKEAFGLITAVTKFH 841
Query: 822 PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHILAQFIPGA 879
+ +++D++ ++S + G S + +++ I LL+ D+ I + G
Sbjct: 842 RFIHGRHFTLRTDHKPLLSIFGEKKGIPVYS-ANRLQRWAIILLNYDFNIEYINTHDFGQ 900
Query: 880 YNSVADSLSR 889
AD+LSR
Sbjct: 901 ----ADALSR 906
>gi|440551074|gb|AGC11908.1| Pol [Feline foamy virus]
Length = 1156
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 116/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGILGSLFKGRYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + ++ + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLEILFNRLNEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+IV+L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIVSLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENVTAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|382948100|gb|AFG33165.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D + M + E + L +++
Sbjct: 272 DENF--MKIEESRWKELKTVIK 291
>gi|341881053|gb|EGT36988.1| hypothetical protein CAEBREN_09040, partial [Caenorhabditis brenneri]
Length = 2341
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 91/192 (47%), Gaps = 12/192 (6%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF-R 537
+ EML +++ ST+ F S + LV K +G R + +GLN ++ K+ LI
Sbjct: 1415 QVNEMLSMDIIE--PSTSTFTSPIVLVKKKDGTFRFTTDFRGLNA-VTVKQIYLIPLISD 1471
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
I G + ++DL Q +F +P++ + + S +P GL AP F S
Sbjct: 1472 IVDLASHGKFYTNLDLVQGFFQIPLRKQDRPLTSFSTPNGTFQYKRMPMGLCGAPHTFQS 1531
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
+ + R+ R+ YLDD L+V+ + + +I+ L I ++G+ + +QK
Sbjct: 1532 AVQQLQKMTRA---RLFCYLDDLLIVSDSIEQHLTDIEEVLENII--TIGFKIKIQKCKF 1586
Query: 656 SPAPVLQFLGIM 667
S V FLG++
Sbjct: 1587 SQREVT-FLGLL 1597
>gi|1711029|gb|AAB38321.1| FeSFV polymerase [Feline foamy virus]
gi|1805314|gb|AAC58531.1| reverse transcriptase [Feline foamy virus]
Length = 1156
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 115/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGIIGSLFKGKYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + + + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLDILFNRLKEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+IV+L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIVSLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENITAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|391331501|ref|XP_003740183.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1388
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 93/421 (22%), Positives = 178/421 (42%), Gaps = 39/421 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
IQEM + GV+ R T ++ + +V K +G R L+ + +N + + F L +
Sbjct: 474 IQEMEKRGVIARTTRPTDYVLPMVVVSKSDGSFRICLDPRYINPHIKRRTFPLPIAQELL 533
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y +D A++H+ + + LPFG+ A + F+ +
Sbjct: 534 MQLAGAQYYSVVDGDAAFWHLKLDQESSDLCTFATPWGNYQFKRLPFGIVDASERFSEVI 593
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ + + L+ V +DDF + + + + +S +G +N +K P
Sbjct: 594 HALFADLKG----VANCVDDFPIHGRTREEHDKNLEAFLSRCREVGLKLNEKKFQYC-QP 648
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
++FLG + D + +P+ + I L S + R LG +++ + IP
Sbjct: 649 SVKFLGHVVGK--DGIAIPDSR------IDAILKMSPPKDQKEVRQFLGMINYVAKFIPN 700
Query: 720 GRLHS---RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIST 776
+ R++ R + P +L+ L P+ + P + +S
Sbjct: 701 AANITAPLRQLTRNDTDFTWN-----PGAEDAFSRLKHALTKAPVLAHFDP-TCETTLSV 754
Query: 777 DASDLGWGS-----QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMV 831
DAS G G+ Q +F S + Q + +KE+ A+ A +Q V +
Sbjct: 755 DASSYGIGAVLIQNQRPVAFSSTSLTETQSRYAQIEKELLAIVYACEHFKFFIQGQQVTI 814
Query: 832 QSDNQTVVSYLRRQGGTKSLSLLS-EVEKIF--LLSQDWRIHILAQFIPGAYNSVADSLS 888
++D+ +++ ++ K L+LLS ++K+ LL D+++ Q+IPG Y +AD+LS
Sbjct: 815 ETDHHPLIAIVK-----KELALLSPRLQKMMLRLLRFDFKL----QYIPGKYMFIADALS 865
Query: 889 R 889
R
Sbjct: 866 R 866
>gi|34392238|emb|CAD92800.1| pol protein [Feline foamy virus]
Length = 1156
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 115/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGIIGSLFKGKYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + + + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLDILFNRLKEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+IV+L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIVSLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENITAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|365984847|ref|XP_003669256.1| hypothetical protein NDAI_0C03530 [Naumovozyma dairenensis CBS 421]
gi|343768024|emb|CCD24013.1| hypothetical protein NDAI_0C03530 [Naumovozyma dairenensis CBS 421]
Length = 1263
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 100/444 (22%), Positives = 180/444 (40%), Gaps = 82/444 (18%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+++L+ G + + S + + S + LV K +G R ++ + LN+ F L + +
Sbjct: 320 IKDLLDKGFI--VPSKSSYSSPIVLVTKHDGSYRLCVDYRELNKVTVKDPFPLPHVDELL 377
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ ++DL Y +P+ T A T +PFGL AP FA
Sbjct: 378 GKVGSASVFTTLDLHSGYHQIPMNPTDMDKTAFVTPTGKYEYTVMPFGLVNAPSTFAR-- 435
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
++A L R V VYLDD L+ + D + +S L I +K + +
Sbjct: 436 -YMADLFRDLEF-VNVYLDDILIFSNDLESHWKHIDVVLSRLDQEKLIAKKKKCHFAQSE 493
Query: 660 VLQFLGIMWDPH-----------LDRMWLP---EDKQLTLG--NILRTLLASKTWNLDSA 703
V QFLG + + ++R +P ++ Q +G N R + D +
Sbjct: 494 V-QFLGYIIGRNKIKPVQEKCEAINRFPVPKTIKEAQRFVGMINYYRKFIK------DCS 546
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLR--LGAPHLTPINPAVLPKLEWWLNALPL 761
R + + F S +P G L A+L R + P L P
Sbjct: 547 RKVRPLVDFISRNVPWGDLQDDAF---ATLKRDLMSEPLLVP------------------ 585
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMF 811
F R ++ ++TDAS G G+ ++ S+ S + Q+ + + E+
Sbjct: 586 ----FKRDAEYRLTTDASMDGLGAVLEEVADNKVLGVVSYYSKSLNETQRRYPPGELELM 641
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS----LSLLSEVEKIFLLSQDW 867
A+ + L +L ++++D+ +++S ++ + L LSE + F L+
Sbjct: 642 AIIEGLEHFKYMLHGKHFVLRTDHISLLSIQNQKEPARRVQRWLDTLSEFD--FSLA--- 696
Query: 868 RIHILAQFIPGAYNSVADSLSRSK 891
++PG N VAD++SR+K
Sbjct: 697 -------YLPGPKNVVADAISRAK 713
>gi|382948094|gb|AFG33162.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D + M + E + L +++
Sbjct: 272 DENF--MKIEESRWKELRTVIK 291
>gi|18450271|ref|NP_569153.1| poplyprotein [Citrus yellow mosaic virus]
gi|16416939|gb|AAL18495.1|AF347695_3 unknown [citrus yellow mosaic virus]
Length = 1983
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 105/456 (23%), Positives = 185/456 (40%), Gaps = 56/456 (12%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 1401 LKHVTPQMEESFRKHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSIDPITGKEVKGKER 1458
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V N K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 1459 MVFNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSIEWT 1515
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ +
Sbjct: 1516 AFWVPSGLYEWLVMPFGLKNAPAIFQRKMD---HCFKGTEAFIAVYIDDILVFSKTEQDH 1572
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
E ++ ++I G I++ K ++ A + +FLG + L ++ Q + L
Sbjct: 1573 EKHLQIMLAICQKNGLILSPTKMKIAQAEI-EFLGAIIHKGLIKL------QPHIVQKLL 1625
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
T + + RS LG L++A IP MGRL S A + G + + A++
Sbjct: 1626 TFTNKQLEEVKGLRSWLGLLNYARSYIPHMGRLLSPLY---AKVSPTGERRMNRQDWALI 1682
Query: 750 PKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVD-------SSFLSG 794
K+ + LP L P P I TD GWG +Q D ++ SG
Sbjct: 1683 DKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKVAQYDPRSSERVCAYASG 1740
Query: 795 LWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
++ + E+ AV +L + + L S + +++D Q ++S+ + K +
Sbjct: 1741 KFNPPKSTI---DAEIHAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKSNVNKPSRV 1797
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
FL ++I + I G N +AD+LSR
Sbjct: 1798 RWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1831
>gi|34392233|emb|CAD92796.1| pol protein [Feline foamy virus]
Length = 1156
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 115/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGIIGSLFKGKYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + + + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLDILFNRLKEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+IV+L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIVSLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENITAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|382948096|gb|AFG33163.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D + M + E + L +++
Sbjct: 272 DENF--MKIEESRWKELRTVIK 291
>gi|32812829|emb|CAD67562.1| polymerase [Simian foamy virus-orangutan]
Length = 1145
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 172/369 (46%), Gaps = 27/369 (7%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
++ + I ++L+ GVL + +S + ++ VPK +G R VL+ + +N+ +
Sbjct: 176 ESIQIVINDLLKQGVLIQQNSIMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLIAAQNQ 233
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ I + + +G Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 234 HSAGILASIYRGTYKTTLDLANGFWAHPITPNSYWLTAFTWQGKQHCWTRLPQGFLNSPA 293
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F + V L++ V VY+DD L + DP+ + + IL G++V+L+KS
Sbjct: 294 LFTAD---VVDLMKHIP-NVQVYVDDLYLSHDDPQEHLQVLQQVLHILHDAGYVVSLKKS 349
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
+++ V++FLG ++ + + LT + L S NL +S+LG ++FA
Sbjct: 350 AIA-QKVVEFLGF----NITKT----GRGLTDAFKEKLLNISPPQNLKQLQSILGLMNFA 400
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ--VQ 771
IP ++ R++ SL+ + N + +L+ + L + + R+ +
Sbjct: 401 RNFIPN---YAERVKPFYSLISTAKSNNILWNDELTSQLQELITLLNQADNLEERKPTTR 457
Query: 772 HFISTDASD-------LGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
I ++S GS+ +++ ++S+ ++ + + +K + +H+AL + L
Sbjct: 458 LIIKVNSSSHAGYIRYYNEGSKKPILYINYVFSKAEEKFSMLEKLLTTLHKALIKAVDLA 517
Query: 825 QSSVVMVQS 833
+ +MV S
Sbjct: 518 MGTEIMVYS 526
>gi|365984457|ref|XP_003669061.1| hypothetical protein NDAI_0C01570 [Naumovozyma dairenensis CBS 421]
gi|343767829|emb|CCD23818.1| hypothetical protein NDAI_0C01570 [Naumovozyma dairenensis CBS 421]
Length = 1247
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 100/444 (22%), Positives = 180/444 (40%), Gaps = 82/444 (18%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+++L+ G + + S + + S + LV K +G R ++ + LN+ F L + +
Sbjct: 320 IKDLLDKGFI--VPSKSSYSSPIVLVTKHDGSYRLCVDYRELNKVTVKDPFPLPHVDELL 377
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ ++DL Y +P+ T A T +PFGL AP FA
Sbjct: 378 GKVGSASVFTTLDLHSGYHQIPMNPTDMDKTAFVTPTGKYEYTVMPFGLVNAPSTFAR-- 435
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
++A L R V VYLDD L+ + D + +S L I +K + +
Sbjct: 436 -YMADLFRDLEF-VNVYLDDILIFSNDLESHWKHIDVVLSRLDQEKLIAKKKKCHFAQSE 493
Query: 660 VLQFLGIMWDPH-----------LDRMWLP---EDKQLTLG--NILRTLLASKTWNLDSA 703
V QFLG + + ++R +P ++ Q +G N R + D +
Sbjct: 494 V-QFLGYIIGRNKIKPVQEKCEAINRFPVPKTIKEAQRFVGMINYYRKFIK------DCS 546
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLR--LGAPHLTPINPAVLPKLEWWLNALPL 761
R + + F S +P G L A+L R + P L P
Sbjct: 547 RKVRPLVDFISRNVPWGDLQDDAF---ATLKRDLMSEPLLVP------------------ 585
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMF 811
F R ++ ++TDAS G G+ ++ S+ S + Q+ + + E+
Sbjct: 586 ----FKRDAEYRLTTDASMDGLGAVLEEVADNKVLGVVSYYSKSLNETQRRYPPGELELM 641
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS----LSLLSEVEKIFLLSQDW 867
A+ + L +L ++++D+ +++S ++ + L LSE + F L+
Sbjct: 642 AIIEGLEHFKYMLHGKHFVLRTDHISLLSIQNQKEPARRVQRWLDTLSEFD--FSLA--- 696
Query: 868 RIHILAQFIPGAYNSVADSLSRSK 891
++PG N VAD++SR+K
Sbjct: 697 -------YLPGPKNVVADAISRAK 713
>gi|403168500|ref|XP_003889733.1| hypothetical protein PGTG_21580 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167529|gb|EHS63448.1| hypothetical protein PGTG_21580 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1022
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 102/475 (21%), Positives = 208/475 (43%), Gaps = 70/475 (14%)
Query: 470 TPVSSAMSLHIQEMLETGVLKRLDS-------TTGFLSRLF---------LVPKGNGGTR 513
TP + +L QE +E+ + K +++ T L + + V G+G R
Sbjct: 534 TPPNHQSALLAQEKIESSISKEIEAGRMYGPYTHAQLMKKYSFFRSNPLGAVVNGDGTVR 593
Query: 514 PV---------LNLKGLNQFLSPKKF--SLINHFRIPSFLQKGDYMISI---DLSQAYFH 559
P+ L + +N F+ + + + R+ FL+ + I + D +AY
Sbjct: 594 PINDLSFPHDNLQVPSVNSFVDKLDYVTTWDDFERVSRFLRNQEEPILLALFDWEKAYRQ 653
Query: 560 VPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAPQAFASLSN-WVASLLRSRGMRVVV-Y 616
+P + +L + +NG +L T + FG +F ++ W +L + V +
Sbjct: 654 IPTAKSQWAYLMVRDFNGGILIDTRIAFGGVAGCGSFGRPADAWKDLMLHEFDLITVFRW 713
Query: 617 LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ-FLGIMWDPHLDRM 675
+DD L + + ++++ +A S LG V + SP Q ++G +W+ +
Sbjct: 714 VDDNLFIKRQDSTVDMEQIVARS--EELG--VKTNSTKYSPFKEEQKYIGFIWNAAKKTV 769
Query: 676 WLPEDKQLTLGNILRTLLASKT-WNLDSARSLLGYLSFASFVIPMGRLHSRRIQR--QAS 732
LP+DK+ ++ LA +T ++ + G L+ S+++P R + + R A
Sbjct: 770 RLPDDKKFQRIQQIKQFLAPETVFSFKQVEIMAGRLNHVSYLLPQLRCYLNSLYRWMNAW 829
Query: 733 LLRLGAPHLTPINPA-VLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSF 791
+ R H+ PA LE WL L + ++ + + D +++GW +S+
Sbjct: 830 VFR----HIELALPADARQDLEEWLTTL-----LCFKETRMIRNPDPTEIGWMGDASTSY 880
Query: 792 LSGL-----WSREQQNWHINK----KEMFAVHQALSLNLPLLQ--------SSVVMVQSD 834
G+ W++ Q H ++ K A + +++ L +L ++V +D
Sbjct: 881 GIGITIGQHWAQFQLTKHWDQGPEPKRDIAWLETVAIRLGILALKQLKIRPGKTLIVWTD 940
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
N T S + ++ +K+ ++ +E + I L + I I+++ + + N+VAD+LSR
Sbjct: 941 NTTTESVISKR-KSKNEAVNNEWKVIQKLLVEEEIDIVSRRVT-SQNNVADALSR 993
>gi|382948092|gb|AFG33161.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D + M + E + L +++
Sbjct: 272 DENF--MKIEESRWKELRTVIK 291
>gi|382948098|gb|AFG33164.1| DNA polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 86/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTEEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNAISHAVCSFLQELGIRINFDKTTPSPVNEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D + M + E + L +++
Sbjct: 272 DENF--MKIEESRWKELRTVIK 291
>gi|182239960|gb|ACB87150.1| polyprotein [Citrus yellow mosaic virus]
Length = 1976
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 108/457 (23%), Positives = 186/457 (40%), Gaps = 57/457 (12%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 1393 LKHVTPQMEGSFRKHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSIDPLTGKEVKGKER 1450
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V N K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 1451 MVFNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSIEWT 1507
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ R
Sbjct: 1508 AFWVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDILVFSKTEREH 1564
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
E ++ +SI G I++ K ++ A + +FLG + L ++ Q + L
Sbjct: 1565 EEHLQIMLSICQKNGLILSPTKMKIAQAEI-EFLGAIIHKGLIKL------QPHIVQKLL 1617
Query: 691 TLLASKTWNLDSARSLLGYL-SFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAV 748
T + + RS LG L ++A IP MGRL S A + G + + A+
Sbjct: 1618 TFTNKQLEEVKGLRSWLGLLINYARSYIPHMGRLLSPLY---AKVSPTGERRMNRQDWAL 1674
Query: 749 LPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVDS-------SFLS 793
+ K+ + LP L P P I TD GWG +Q DS ++ S
Sbjct: 1675 IDKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKLAQYDSRSSEKVCAYAS 1732
Query: 794 GLWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLS 852
G ++ + E+ AV +L + + L S + +++D Q ++S+ + K
Sbjct: 1733 GKFNPPKSTI---DAEIRAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKSNVNKPSR 1789
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ FL ++I + I G N +AD+LSR
Sbjct: 1790 VRWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1824
>gi|224087774|ref|XP_002335127.1| predicted protein [Populus trichocarpa]
gi|222832905|gb|EEE71382.1| predicted protein [Populus trichocarpa]
Length = 909
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 104/451 (23%), Positives = 176/451 (39%), Gaps = 74/451 (16%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN----GGTRPVLNLKGLNQ------ 523
S I+E+L+ +++ S + S FLV K + G R V+N K LN
Sbjct: 496 DEFSKQIKELLDAKLIQ--PSKSPHFSPAFLVNKHSEQKRGKRRMVINYKKLNDHTIGDG 553
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
+L P+K L++ R S D ++ V + T Q A +
Sbjct: 554 YLLPRKDELLDQIRGKKIFS------SFDCKSGFWQVLLDETSQSLTAFTCPQGHFEWKV 607
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL AP F N L + +Y+DD ++ +++ + +
Sbjct: 608 MPFGLKQAPSIFQRHMNETFMGLENF---CRIYVDDIIVFSENDKEHIEHVSQVLDRCKE 664
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
+G I++ K+ L + +FLG++ +D+ L K + NI T SK +
Sbjct: 665 VGVILSKPKAQLFREKI-EFLGLI----IDKGKLQLQKHIG-ENI--TAFNSKITDRKQL 716
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSS 763
+ LG L++ S P ++ R QA L + + + A + K++ + LP
Sbjct: 717 QRFLGILNYISQFCP--KVAQIRQPLQAKLKKDAFWQWSDSDTAYVDKIKKAIKNLP--- 771
Query: 764 PIFPRQVQH-------FISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHIN 806
V H I TDASD WG + S + SG + ++N+H N
Sbjct: 772 -----PVHHPGPDEPLIIETDASDNYWGGILKSKQSDGLELICGYASGTFKPAEKNYHSN 826
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRR--QGGTKSLSLLSEVEKIFLLS 864
+KE+ A+ + L ++DN+ V +L G K L+
Sbjct: 827 EKEILALINTIKRFQVFLIPVQFTARTDNKNVFYFLHTNIHGSYKQGRLVR--------- 877
Query: 865 QDWRI-----HILAQFIPGAYNSVADSLSRS 890
W++ I+ + + G N AD LSR
Sbjct: 878 --WQLWLSYFDIIFEHVAGTNNVFADFLSRE 906
>gi|198385725|gb|ACH86211.1| unknown [Citrus yellow mosaic virus]
Length = 1490
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 108/467 (23%), Positives = 185/467 (39%), Gaps = 78/467 (16%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 908 LKHVTPQMEESFRRHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSIDPITGKEVKGKER 965
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V N K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 966 MVFNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFQQVAMHPDSVEWT 1022
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ +
Sbjct: 1023 AFWVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDILVFSKTEKEH 1079
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR 690
E ++ +SI G I++ K ++ A + +FLG + L ++ Q + L
Sbjct: 1080 EEHLQIMLSICQRNGLILSPTKMKIAQAEI-EFLGAIIHNGLIKL------QPHIVQKLL 1132
Query: 691 TLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHS-----------RRIQRQASLLRLGA 738
T + + RS LG L++A IP MGRL S RR+ RQ
Sbjct: 1133 TFTNKQLEEVKGLRSWLGLLNYARSYIPHMGRLLSPLYAKVSPTGERRVNRQ-------- 1184
Query: 739 PHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVD-- 788
+ A++ K+ + LP L P P I TD GWG +Q D
Sbjct: 1185 ------DWALIDKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKIAQYDPR 1236
Query: 789 -----SSFLSGLWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYL 842
++ SG ++ + E+ AV +L S + L + +++D Q ++S+
Sbjct: 1237 SSERVCAYASGKFNPPKSTI---DAEIHAVMNSLNSFKIYYLDKPSLCLRTDCQAIISFF 1293
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ K + FL ++I + I G N +AD+LSR
Sbjct: 1294 NKSNVNKPSRVRWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1338
>gi|342319725|gb|EGU11672.1| Hypothetical Protein RTG_02458 [Rhodotorula glutinis ATCC 204091]
Length = 2138
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 156/382 (40%), Gaps = 53/382 (13%)
Query: 552 DLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGM 611
D A+ VPI +++L + ++G +CLP GL+ A + + + SR
Sbjct: 562 DAKDAFRQVPIHPDDRKWLVMQFDGKFFIDSCLPMGLSPATDIWGRTVDLLRLGAESRLK 621
Query: 612 RVVV-----------YLDDFLLVNQDPRILEIQGK-LAVSILGSLGWIVNLQKSSLSPAP 659
+ ++DD L+ R+ + LA + G ++ K +
Sbjct: 622 KTFCDNGGDLHALRNWVDDLALLASLGRLSPAEATALADEYYDTFGLVLAKAKKRKMWSS 681
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
+ F G+ +D + LP K+L + L + L G L+ +FVI
Sbjct: 682 IAVFQGLEFDLQNKILSLPNAKRLKALRKIDAFLQLSWITATAISELCGTLTHLAFVITE 741
Query: 720 GR--LHSRRIQRQASLLRLGAPHLTPI-NPAVLPKLEWWLNALPLSS------------- 763
GR L I L R P+L + + +L + WW AL LS
Sbjct: 742 GRFFLSPLYIFETPFLAR---PYLKRVPDDDLLKAVRWWREALTLSDDEETTLRDSDLLP 798
Query: 764 -----PIFPR--QVQHFISTDASDLGWGSQVDSSFLSGLW----SREQQNWHINKKEMFA 812
P+ P ++ + TDASD G G V+ S W S N +I+ E FA
Sbjct: 799 SRFSRPLRPNALSIELSLYTDASDSGVGVVVNGS--QAFWPLKPSWRDGNVNIDVPEAFA 856
Query: 813 VHQALSLNLP---LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF--LLSQDW 867
V + + + L + ++ + DN+TVV R+ ++L + + + +I+ L DW
Sbjct: 857 VELLVRMIVDANGALSNRLLQIYCDNETVVRSWRKN-RCRNLHINACLLRIYRLLAMHDW 915
Query: 868 RIHILAQFIPGAYNSVADSLSR 889
R+ + +++P N AD++SR
Sbjct: 916 RLEL--EYVPSEMNE-ADAVSR 934
>gi|432860371|ref|XP_004069523.1| PREDICTED: uncharacterized protein LOC101162523 [Oryzias latipes]
Length = 813
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 78/165 (47%), Gaps = 16/165 (9%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P AM +++E L+ G+++ S G + F V K +GG RP ++ +GLN
Sbjct: 377 LSPPEREAMDSYLRESLQAGLIRPSSSPAG--AGFFFVGKRDGGLRPCIDYRGLN----- 429
Query: 528 KKFSLINHFRIPSF-----LQKGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
K ++ N + +P L G M S +DL AY V ++ + A +
Sbjct: 430 -KITVRNTYPLPLLHTAFDLLSGARMFSRLDLRNAYHLVRVREGDEWKTAFNTPSGHFEY 488
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+PFGL AP F SL N V + ++ V VYLDD L + D
Sbjct: 489 LVMPFGLTNAPAVFQSLINDVLKDMLNKF--VFVYLDDILFFSPD 531
>gi|222574|dbj|BAA01607.1| 194K polypeptide [Rice tungro bacilliform virus]
gi|229052|prf||1817177A capsid protein
Length = 1675
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/393 (20%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1322 FQRFMQESFGDLKF----ALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSK 1377
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + N ++ +K L ++ LG L+
Sbjct: 1378 MFLKEV-EYLGVE---------IKEGKISLQPHIVNKIKKFDKNKLNTLKGLQAYLGLLN 1427
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1484
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1485 I--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1542
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1543 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1574
>gi|326673500|ref|XP_003199901.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1394
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 111/455 (24%), Positives = 187/455 (41%), Gaps = 76/455 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P S AM +I+E L G ++ ST+ + F V K +GG RP ++ + LN
Sbjct: 442 LSQPESEAMKQYIKEELSKGFIR--PSTSPASAGFFFVEKKDGGLRPCIDYRNLNAITCK 499
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
++ L +P+ L++ Y +DL AY + I+ + S +
Sbjct: 500 FRYPLP---LVPAALEQLRTAQYFTKLDLRSAYNLIRIRPGDEWKTGFSTCTGHYEYLVM 556
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILG 642
PFGL +P F S N + + + V+VY+DD L+ + + I ++ L I
Sbjct: 557 PFGLVNSPSVFQSFVNDIFRDMLHKW--VIVYIDDILIYSDSLEDHITHVRAVLKRLIDN 614
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
L ++K + LG + + + + D+Q + +L+ W
Sbjct: 615 KL--YAEVEKCEFHQTSI-SLLGYV----ISQEGVAMDEQ-KVNAVLK-------WPKPS 659
Query: 702 SARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW------- 754
+ + L +L FA+F RR R S + AP LT + +L+W
Sbjct: 660 TVKELQRFLGFANFY--------RRFIRNFST--IAAP-LTSLTKRSGKQLKWNTTAELA 708
Query: 755 --WLNALPLSSPIF--PRQVQHFI-STDASDLGWGS-----QVDSSFL--SGLWSRE--- 799
L ++PI P + FI DAS+ G G+ Q SS L +SR+
Sbjct: 709 FIHLKDRFTTAPILSHPNPDKPFIVEVDASNTGIGAILSQRQDASSVLHPCAYFSRKLNP 768
Query: 800 -QQNWHINKKEMFAVHQALSLNLPLLQSS----VVMVQSDNQTVVSYLRRQGGTKSLSLL 854
++N+ + +E+ A+ AL L+ + +VM N + YLR K L+
Sbjct: 769 AERNYDVGNRELLAIKAALEEWRHWLEGAKHEFIVMTDHKN---LEYLR---TAKRLNPR 822
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + PG+ N+ AD+LSR
Sbjct: 823 QARWALFFS----RFQFSVTYRPGSKNTKADALSR 853
>gi|308471722|ref|XP_003098091.1| hypothetical protein CRE_11332 [Caenorhabditis remanei]
gi|308269432|gb|EFP13385.1| hypothetical protein CRE_11332 [Caenorhabditis remanei]
Length = 1387
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/451 (21%), Positives = 186/451 (41%), Gaps = 83/451 (18%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLIN 534
+S I+ + +TGV+ +D + + + + V K NG R + GLN + L
Sbjct: 506 VSTEIERLNQTGVISPVDHSE-WAAPVVAVKKKNGSIRLCADFSTGLNDAIESNNHPLPT 564
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + L G++ IDL++AY V + Q+ L ++ + + LPFG+ +AP
Sbjct: 565 ADDIFAKLNGGNFFTQIDLAEAYLQVEMDPDSQKLLVINTHLGLFTYNRLPFGVKSAPGI 624
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWIVN 649
F + + + + L V YLDD ++ + R+L++ G++ G+ +
Sbjct: 625 FQQIMDTMLNGLEG----VSTYLDDIIICGSTIEEHNERVLKVFGRIQ-----EYGFRIK 675
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLG 708
++K S + +FLG + + R P+ +K L + N+ + N+ +S LG
Sbjct: 676 MEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVLHIKNM------PEPTNVSQVKSFLG 725
Query: 709 YLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP------- 760
+ F FV +Q LR +LT A +W L
Sbjct: 726 LIQFYGQFV------------KQLFRLRQPLDNLT----AKDTDFKWTLECQKSFDTIKE 769
Query: 761 -LSSPIFPRQVQHF-------ISTDASDLGWGSQVDSSF----------LSGLWSREQQN 802
L S + + H+ ++ DAS G G+ + F +S S+ Q+N
Sbjct: 770 ILQSDLL---LTHYNPNLPIIVAADASQYGIGATISHRFPDGTEKTIYHISKTLSKTQRN 826
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE---- 858
+ +KE F + A++ + +++D++ +++ GG K + + +
Sbjct: 827 YSQIEKEGFGLITAVTKFHKFIHGRKFTLRTDHKPLLTIF---GGKKGVPVYTANRLQRW 883
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
LL+ D+ I ++I D+LSR
Sbjct: 884 ATILLNYDFDI----EYINTKDFGQVDALSR 910
>gi|308483543|ref|XP_003103973.1| hypothetical protein CRE_02412 [Caenorhabditis remanei]
gi|308258630|gb|EFP02583.1| hypothetical protein CRE_02412 [Caenorhabditis remanei]
Length = 1473
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/451 (21%), Positives = 185/451 (41%), Gaps = 83/451 (18%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLIN 534
+S I+ + +TGV+ +D + + + + V K NG R + GLN + L
Sbjct: 591 VSTEIERLNQTGVISPVDHSE-WAAPVVAVKKKNGSIRLCADFSTGLNDAIESNNHPLPT 649
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + L G++ IDL++AY V + Q+ L ++ + + LPFG+ +AP
Sbjct: 650 SDDIFAKLNGGNFFTQIDLAEAYLQVEMDPDSQKLLVINTHLGLFTYNRLPFGVKSAPGI 709
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWIVN 649
F + + + + L V YLDD ++ + R+L++ G++ G+ +
Sbjct: 710 FQQIMDTMLNGLEG----VFTYLDDIIICGSTIEEHNERVLKVFGRIQ-----EYGFRIK 760
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLG 708
++K S + +FLG + + R P+ +K L + N+ + N+ +S LG
Sbjct: 761 MEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVLHIKNM------PEPTNVSQVKSFLG 810
Query: 709 YLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP------- 760
+ F FV +Q LR +LT A +W L
Sbjct: 811 LIQFYGQFV------------KQLFRLRQPLDNLT----AKDTDFKWTLECQKSFDTIKE 854
Query: 761 -LSSPIFPRQVQHF-------ISTDASDLGWGSQVDSSFLSG----------LWSREQQN 802
L S + + H+ ++ DAS G G+ + F G S+ Q+N
Sbjct: 855 ILQSDLL---LTHYNPNLPIIVAADASQYGIGATISHRFPDGTEKTIYHIGKTLSKTQRN 911
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVE---- 858
+ +KE F + A++ + +++D++ +++ GG K + + +
Sbjct: 912 YSQIEKEGFGLITAVTKFHKFIHGRKFTLRTDHKPLLTIF---GGKKGVPVYTANRLQRW 968
Query: 859 KIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
LL+ D+ I ++I D+LSR
Sbjct: 969 ATILLNYDFDI----EYINTKDFGQVDALSR 995
>gi|189236881|ref|XP_001807130.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 2021
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 80/187 (42%), Gaps = 6/187 (3%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE+L+ +++ +S + + S + LV K NG R ++ + LN L
Sbjct: 1172 VQELLDNNIVR--ESESNYCSPVLLVKKKNGEQRLCIDYRKLNAQTVKDNHPLPRVDDQI 1229
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
LQ G Y S+DL Y +P+ +++ + +PFGL AP+ F
Sbjct: 1230 DRLQGGVYFTSLDLRSGYHQIPLSEESKKYTSFVTPFGQYEYNRVPFGLTNAPRTFQRFM 1289
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N +L+ VYLDD LL +D + IL S G +NL+K S
Sbjct: 1290 N---KILKPARENAAVYLDDVLLHAKDVNEALQNLQKVFEILRSEGLTLNLKKCSFLMTS 1346
Query: 660 VLQFLGI 666
V FLG
Sbjct: 1347 V-TFLGF 1352
>gi|301605103|ref|XP_002932203.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like, partial [Xenopus (Silurana) tropicalis]
Length = 832
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 110/481 (22%), Positives = 193/481 (40%), Gaps = 76/481 (15%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
+ + SG IPF PL + P + + +I+E L+ G ++ S G + +
Sbjct: 233 IDLFSGATIPFGRIYPL---------SEPELTVLKGYIEENLDKGFIRPSTSPAG--AGI 281
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
F V K + RP ++ + LN+ ++ L + L+ +DL AY V I
Sbjct: 282 FFVEKKDHSLRPCIDYRDLNKITVKNRYPLPLISELFVRLRSAQVFTKLDLRGAYNLVRI 341
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFL 621
+ + A +PFGL AP F L N + R + V+VYLDD L
Sbjct: 342 RQGDEWKTAFRTRYGHFEYLVMPFGLCNAPATFQHLVN---DIFRDFLDLFVIVYLDDIL 398
Query: 622 LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
+ + + S L + L+K + + +FLG++
Sbjct: 399 IFSSSLEEHRVHVTKVFSRLRAHKLFAKLEKCEFEKSSI-EFLGLVI----------SSD 447
Query: 682 QLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
+++ + R + A W + + R++ ++ FA+F R+ + S R+ AP
Sbjct: 448 GISMDS--RKVSAVLDWPIPNDRRAVQRFVGFANFY--------RKFIKDFS--RVIAP- 494
Query: 741 LTPINPAVLPKLEWW----------LNALPLSSPIF--PRQVQHFI-STDASDLGWGSQV 787
+T + +V K +W L S+PI P + FI DAS+ G+ +
Sbjct: 495 ITALTSSV--KKFFWSSEAQQAFTELKRSFTSAPILRHPDPARPFILEVDASEHAVGAVL 552
Query: 788 DS-----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSD 834
+F S S+ ++N+ + +E+ A+ A LL+ + ++V SD
Sbjct: 553 SQRADFKNQLHPVAFFSRKLSQSERNYDVGDRELLAIKSAFQEWRHLLEGANHPILVFSD 612
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
++ + YLR K L +F R + F PG+ N AD+LSR P
Sbjct: 613 HKN-LEYLR---SAKRLRPRQARWALFFS----RFNFHVTFRPGSKNGKADALSRMFPAP 664
Query: 895 D 895
+
Sbjct: 665 E 665
>gi|281201718|gb|EFA75926.1| Polyprotein [Polysphondylium pallidum PN500]
Length = 904
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/422 (22%), Positives = 173/422 (40%), Gaps = 61/422 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML+ G++ +S + + S + L+PK +G R +N K LN P + + I
Sbjct: 524 VDDMLKKGIIS--ESNSSYASPVVLIPKPDGEVRFCVNYKKLNNLTRPLVYPFPHVDDIY 581
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS-- 597
S LQ +D + Y +P++ + A + + V +PFGL AP F
Sbjct: 582 SALQHASVFSILDCAAGYHQIPVREEDRWLTAFTTHRGVYEFNVMPFGLKNAPALFQKDD 641
Query: 598 ---LSNWVASLLRSRGMRVVVYLDDFL--LVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
SN V S + +LD L L + ++ + K + + LG I+N
Sbjct: 642 IIIFSNDVESH--------ITHLDKILQILFKNNVQLNRKKSKFFRTSVKFLGHIINNGS 693
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
S+ P + + LP Q T N L+ LL +N R + ++F
Sbjct: 694 ISIDPDRINSIIN-----------LP---QPTNVNQLQKLLGILNYN----RKFI--INF 733
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQH 772
AS P+ RL ++ + + S AV ++ ++ L P ++
Sbjct: 734 ASIAAPLHRLLNKDTRWEWS---------EECKNAVKTIVDRMKDSGILKIPDLNKEF-- 782
Query: 773 FISTDASDLGWGSQV-DSSFLSGLWSRE----QQNWHINKKEMFAVHQALSLNLPLLQSS 827
+ TDAS +G G + + L +SR+ ++N+H + E ++ ++ +L S+
Sbjct: 783 VLETDASGVGIGGALFQNGMLVSAYSRKLTAAEKNYHSGELECLSLVDSVKHFRHILGSA 842
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
+DNQ + + K+ +L +FL ++ I A G +N AD L
Sbjct: 843 FFTAVTDNQAL--KVLNDKSPKNARILR--WSMFLQGFNYVIRYRA----GKHNDFADGL 894
Query: 888 SR 889
SR
Sbjct: 895 SR 896
>gi|328905463|gb|AEB54984.1| polyprotein [Dahlia mosaic virus D10]
Length = 810
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 118/544 (21%), Positives = 210/544 (38%), Gaps = 83/544 (15%)
Query: 373 RARRGKKSSSPQNLEPPGRVSLKVQTLQKPQR-----CS-SPVNPPADSRIGAELVGGRL 426
+A++ ++ Q + P R L++Q QK + CS +P++P +
Sbjct: 321 KAKQIPGTNITQEVIKPERFFLEIQKYQKIEELLEKVCSENPIDPEKSNY---------- 370
Query: 427 RRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLET 486
+++A I L P ++R P P S I+E+L+
Sbjct: 371 --WMNASIELIDPKTVIR-----EKPMKYSPQ-------------DREEFSKQIKELLDL 410
Query: 487 GVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFL 542
++ + S + +S FLV K G R V+N K +N +L + + L
Sbjct: 411 KII--IPSKSPHMSPAFLVENEAEKRRGKKRMVVNYKAINAATKGDSHNLPCMQELLTLL 468
Query: 543 QKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV 602
+ Y S D ++ V + Q A + +PFGL AP F +
Sbjct: 469 RGKIYFSSFDCKNGFWQVLLDEESQLLTAFTCPDGHYQWKVVPFGLKQAPSIF---QRHM 525
Query: 603 ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ 662
+ LR VY+DD ++ + + + G I++ +K++L +
Sbjct: 526 QNALRGLENYCTVYVDDIIVFSDSEEKHYFHVLSVLKTIEKYGIILSKKKTNLFKTKI-N 584
Query: 663 FLGIMWD--PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
FLG D H + + E+ L TL K + LG L++A IP
Sbjct: 585 FLGFEIDQGTHCPQKHILEN----LHKFPDTLEDKK-----HLQRFLGILTYAESYIP-- 633
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-STDAS 779
+L R Q L + + + + K++ L + P P++ + I TDAS
Sbjct: 634 KLAELRRPLQVKLKKDYVWEWKQSDTSYIKKIKKNLTSFP--KLYLPKEKEFLIIETDAS 691
Query: 780 DLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
+ WG + + + SG + ++N+H N+KE+ AV A+S L +
Sbjct: 692 NDYWGGVLKAKTAEKEEVCRYTSGSFKTAEKNYHSNEKELLAVKNAISKFSIYLTPVKFL 751
Query: 831 VQSDNQTVVSYLRRQ--GGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPGAYNSVADS 886
V++D++ +L+ + G K L+ Q W R + + G N +AD
Sbjct: 752 VRTDSKNFTYFLKTKISGDNKQGRLVR--------WQMWFSRYTFDIEHLEGLKNVLADC 803
Query: 887 LSRS 890
L+R
Sbjct: 804 LTRD 807
>gi|210076138|ref|XP_002143073.1| YALI0E14388p2 [Yarrowia lipolytica]
gi|199426919|emb|CAR64332.1| YALI0E14388p2 [Yarrowia lipolytica CLIB122]
Length = 1488
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/444 (22%), Positives = 172/444 (38%), Gaps = 79/444 (17%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++LE G+++ S + + + L +V K G R + + LN+ + +F L I
Sbjct: 573 VNDLLERGIIR--PSKSPYSAPLVIVKKKGGELRICTDYRALNELTTKDRFPLPRIDDIL 630
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L D DL Y+ V +K + A S +PFGL AP F L
Sbjct: 631 DCLDGADTFSKFDLLSGYWQVLVKESDVHKTAFSTRSGHYEYLVMPFGLCNAPATFQRLM 690
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N +L V VYLDD ++ +++ + + + L + + K L
Sbjct: 691 N--DALRPFLNKTVCVYLDDIIVFSRNREDHKRHVREVLDALRAQKFYAKKSKCELFRKK 748
Query: 660 ------VLQFLGIMWDPHLDRM---WLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
V+ G+ DP ++ W+P ++ + LL +L
Sbjct: 749 MGFLGHVVSAAGVEPDPEKVKVVEEWVPP---------------------NTPKGLLSFL 787
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-WLNALPLSSP----- 764
+ + R I+ A ++ AP LT + A L ++ W A ++
Sbjct: 788 GLTGY-------YRRFIEDYA---KIAAP-LT--DAATLSPTDFKWTEACQVAFEQMKAK 834
Query: 765 -------IFPRQVQHF-ISTDASDLGWGSQVDS-----------SFLSGLWSREQQNWHI 805
I P F +STDA D+ G + ++ S + + + N+
Sbjct: 835 LVSNEVMIIPTMEDTFKVSTDACDIAMGGVLQQWSPKDQEFRPVAYESTKFKKHEMNYPT 894
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+KE +A+ AL L ++++D+Q+ +SY Q S L ++ FL
Sbjct: 895 REKEFYAIIHALRKWRHYLLGRPFLIETDHQS-LSYFTSQTHPPSGRLSRWLD--FLAEY 951
Query: 866 DWRIHILAQFIPGAYNSVADSLSR 889
D+ I +++PG N AD LSR
Sbjct: 952 DFEI----KYVPGKDNDAADGLSR 971
>gi|321459492|gb|EFX70545.1| hypothetical protein DAPPUDRAFT_257014 [Daphnia pulex]
Length = 424
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/178 (28%), Positives = 82/178 (46%), Gaps = 21/178 (11%)
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDS 702
SLG+++NL+KS +P+ V+++LG++ D + + D L G + L
Sbjct: 252 SLGFLINLEKSVTTPSRVMEYLGMVIDSVQEVKKMCTDA-LNTGQV----------PLRD 300
Query: 703 ARSLLGYLSFASFVIPMGRLHSRRIQR------QASLLRLGAPHLTPINPAVLPKLEWWL 756
S+LG ++A IP + H R +QR Q +L L + + LEWW+
Sbjct: 301 VASILGNFTWAIPTIPFAQSHYRSMQRFYINESQKALGDLSVKCVLSV--GARSDLEWWV 358
Query: 757 NALPLSS--PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFA 812
L ++ FP+ I +DAS GWG+ D G W+ +Q HIN E+
Sbjct: 359 ANLEEANGKEFFPKVADMEIFSDASRSGWGAVCDGITTRGPWTMDQSTLHINCLELLG 416
>gi|49256739|emb|CAG34127.1| polyprotein [Yarrowia lipolytica]
Length = 1240
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/444 (22%), Positives = 172/444 (38%), Gaps = 79/444 (17%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++LE G+++ S + + + L +V K G R + + LN+ + +F L I
Sbjct: 325 VNDLLERGIIR--PSKSPYSAPLVIVKKKGGELRICTDYRALNELTTKDRFPLPRIDDIL 382
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L D DL Y+ V +K + A S +PFGL AP F L
Sbjct: 383 DCLDGADTFSKFDLLSGYWQVLVKESDVHKTAFSTRSGHYEYLVMPFGLCNAPATFQRLM 442
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N +L V VYLDD ++ +++ + + + L + + K L
Sbjct: 443 N--DALRPFLNKTVCVYLDDIIVFSRNREDHKRHVREVLDALRAQKFYAKKSKCELFRKK 500
Query: 660 ------VLQFLGIMWDPHLDRM---WLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
V+ G+ DP ++ W+P ++ + LL +L
Sbjct: 501 MGFLGHVVSAAGVEPDPEKVKVVEEWVPP---------------------NTPKGLLSFL 539
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-WLNALPLSSP----- 764
+ + R I+ A ++ AP LT + A L ++ W A ++
Sbjct: 540 GLTGY-------YRRFIEDYA---KIAAP-LT--DAATLSPTDFKWTEACQVAFEQMKAK 586
Query: 765 -------IFPRQVQHF-ISTDASDLGWGSQVDS-----------SFLSGLWSREQQNWHI 805
I P F +STDA D+ G + ++ S + + + N+
Sbjct: 587 LVSNEVMIIPTMEDTFKVSTDACDIAMGGVLQQWSPKDQEFRPVAYESTKFKKHEMNYPT 646
Query: 806 NKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+KE +A+ AL L ++++D+Q+ +SY Q S L ++ FL
Sbjct: 647 REKEFYAIIHALRKWRHYLLGRPFLIETDHQS-LSYFTSQTHPPSGRLSRWLD--FLAEY 703
Query: 866 DWRIHILAQFIPGAYNSVADSLSR 889
D+ I +++PG N AD LSR
Sbjct: 704 DFEI----KYVPGKDNDAADGLSR 723
>gi|391335912|ref|XP_003742330.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 745
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 94/397 (23%), Positives = 172/397 (43%), Gaps = 33/397 (8%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLS 526
+A + + + +++ G L ++D + + + + +V K NG R + GLN+ L
Sbjct: 300 VAIALQEQIDKELDRLIQNGTLTKVDFSE-WATPIVVVKKANGSIRVCADYSTGLNEALV 358
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ L N I + +DL+ AY +P+ QR ++ + + T L F
Sbjct: 359 DIEHPLPNMEEIMTKFSGNRVFSQLDLADAYLQLPLDENSQRVTTITTHRGLFQYTRLVF 418
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GL TAP F V L+ G V+VYLDD L++ D + + + L G+
Sbjct: 419 GLKTAPSIFQKTIEQV--LMGMEG--VLVYLDDILVMAPDTERHDQRLNRVLQRLQDSGF 474
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
+ L+K P +++LG++ R + ++++ LR N RSL
Sbjct: 475 HLKLEKCYFH-VPKVKYLGMVVSS---RGIEADPSRISVIKELRP-----PRNQKEVRSL 525
Query: 707 LGYLS-FASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSP 764
LG ++ + FV M R H ++ +LL+ + TP + L K++ L L +
Sbjct: 526 LGMVNYYGKFVDNMHR-HKPLLE---ALLKKDVRFVWTPEHEKALAKIKEILTGPLLLTH 581
Query: 765 IFPRQVQHFISTDASDLGWGSQVDSSFLSG----------LWSREQQNWHINKKEMFAVH 814
PRQ ++ DAS G G + + G ++ Q+N+ +KE A+
Sbjct: 582 YDPRQTL-LVAADASPSGIGGVLLQRYADGNEKAVFHMSKSLTKAQRNYSQIEKEALALV 640
Query: 815 QALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSL 851
A+ + ++Q+D++ +++ L R TK L
Sbjct: 641 TAVERFRKFIWGRHFILQTDHRPLLA-LFRTSNTKGL 676
>gi|149236333|ref|XP_001524044.1| hypothetical protein LELG_04857 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452420|gb|EDK46676.1| hypothetical protein LELG_04857 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 944
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 191/441 (43%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 77 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 134
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 135 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 193
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 194 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 249
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L+ L + L+S+ L+
Sbjct: 250 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLKYPLPNTVKQLESSLGLV 301
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 302 NY--YRQLIVGHAELTAPFYNLVNQARTEPKHQIHWDPTTKRFFRQIITVLTNQPILQPL 359
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 360 NFKDLIT-VHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 418
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 419 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 475
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 476 ---HIDGLKNIIADALSRCHT 493
>gi|154282825|ref|XP_001542208.1| hypothetical protein HCAG_02379 [Ajellomyces capsulatus NAm1]
gi|150410388|gb|EDN05776.1| hypothetical protein HCAG_02379 [Ajellomyces capsulatus NAm1]
Length = 1263
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 97/224 (43%), Gaps = 12/224 (5%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 601 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 655
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 656 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYPRILIAAKDRWKTAFRT 715
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 716 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 772
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI--MWDPHLDRM 675
+ L L K V FLG+ + PH ++
Sbjct: 773 VTKVLERLERANLFAKLSKCEFEVDKV-SFLGLSPLLQPHYSKL 815
>gi|427780775|gb|JAA55839.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1152
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/414 (22%), Positives = 167/414 (40%), Gaps = 51/414 (12%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
RI +G A P KP V + +A V+ +ML+ GV++ +S + + + +
Sbjct: 288 RINTGDATPIRQKPYRVSPSERKVIADQVN--------DMLKKGVIQ--ESCSPWAAPVI 337
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIK 563
LV K + R ++ + LN + L L Y S+DL Y+ +P+
Sbjct: 338 LVKKKDNSWRFCVDYRRLNAVTKKDVYPLPRIDDALDCLHSAAYFSSVDLRSGYWQIPMH 397
Query: 564 TTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLL 622
+ + A + +PFGL AP A+ ++ S+LR + + YLDD ++
Sbjct: 398 PSDREKTAFVTPDGLFEFNVMPFGLCNAP---ATFERFMDSILRGLKWETCMCYLDDVII 454
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP--APVLQFL----GIMWDPHLDRMW 676
+ + + + + G I+N +K A VL +L GI DPH
Sbjct: 455 FGRTFHEHNQRLSVVLDCVKQAGLILNAKKCHFGERQALVLGYLVDKDGIRPDPH----- 509
Query: 677 LPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIPMGRLHSRRIQRQASLLR 735
++T +R T + RS LG S F F+ +L LLR
Sbjct: 510 -----KIT---AVRNFKPPTT--VKDLRSFLGLCSYFRRFIKGFAQL----AHPLTDLLR 555
Query: 736 LGAPHLTPIN-PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLG---------WGS 785
P+ + A +L++ L + PL P + + TDAS +G GS
Sbjct: 556 KDTPYQWTVQCEAAFEQLKFLLTSGPLLRHFDPEALTE-LHTDASGVGVGAVLVQFHGGS 614
Query: 786 QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
Q ++ S ++ ++N+ + + E AV A+ P L + +D+ ++
Sbjct: 615 QHVIAYASRTLTKAERNYTVTELECLAVVFAVQKFRPYLYGRRFKIVTDHHSLC 668
>gi|326677758|ref|XP_003200905.1| PREDICTED: hypothetical protein LOC100331485 [Danio rerio]
Length = 1474
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I A P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 1038 KICLTEATPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1087
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN + KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1088 IVPKKDGTLRVCLDFRKLN---AVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1144
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1145 PLEKTSREYTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1201
Query: 621 LL 622
++
Sbjct: 1202 VI 1203
>gi|308460254|ref|XP_003092433.1| hypothetical protein CRE_03477 [Caenorhabditis remanei]
gi|308253227|gb|EFO97179.1| hypothetical protein CRE_03477 [Caenorhabditis remanei]
Length = 1753
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 101/456 (22%), Positives = 188/456 (41%), Gaps = 62/456 (13%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P C ++ + ++ M + G+++ +ST+ + S L +PK NG R V++ +
Sbjct: 814 IPQCRPYRVSPQQREKLEKELKFMKDNGLIE--ESTSPYTSPLLSIPKANGEIRIVIDYR 871
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN + + + N + +G D++Q + +P+ H+ A + V
Sbjct: 872 RLNLITRSRTYIMPNTIDVTEEASRGKLFSVFDIAQGFHTIPMHEAHKERTAFCCHMGVF 931
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP----RILEIQGK 635
+P GL AP F +A + + +++Y+DD ++V++D R LE +
Sbjct: 932 QYRYMPMGLKGAPDTF---QRAMAEVEKQFTGTMILYVDDLIVVSRDEEEHLRNLEEFFQ 988
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
L + ++G + +KS + + FLG + + + P ++ +R
Sbjct: 989 LMI----NMGLKLKAEKSQIGRTKI-SFLGFVIE---NNTIQPSGEKT---EAIRKFPTP 1037
Query: 696 KTWNLDSARSLL---GYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
T L +S L GY +A V P+ L + ++ + G
Sbjct: 1038 TT--LSEVKSFLGMSGYFRRFIKDYAIIVKPLTTLTQKDVE-----FKWGEEQ-----EK 1085
Query: 748 VLPKLEWWLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS-----QVDSSFLSGLWSR- 798
+++ L +S PI PR F + TDAS +G + Q D + SR
Sbjct: 1086 AFEEVKQRL----ISPPILTTPRMDGDFEMHTDASKIGIAAVLLQKQDDELKVIAYASRP 1141
Query: 799 ---EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+Q + + E A+ L+ P + V V +D+Q + S L R+ S LL
Sbjct: 1142 TTPVEQRYAAIESEALAITWGLTHYRPYIFGKKVKVVTDHQPLKSLLHRKEKEMSGRLLR 1201
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
I Q + + I+ + PG N +AD+LSR +
Sbjct: 1202 HQAII----QMYDVEIV--YRPGKENPLADALSRQR 1231
>gi|270016329|gb|EFA12775.1| hypothetical protein TcasGA2_TC005018 [Tribolium castaneum]
Length = 997
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 108/245 (44%), Gaps = 33/245 (13%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQ---HLATP-----------VSSAMSLHIQEMLETG 487
L+ ++ Y FS++P L + + H TP + A+ IQEML+ G
Sbjct: 298 LIHLLQEYRCIFSSRPGLTHKYTHEIKLHDKTPFLKRPYPVPFALRPAVDATIQEMLDLG 357
Query: 488 VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSF 541
V+KR + + S + +V K +G R L+ + +N + P L+ F
Sbjct: 358 VIKR--EASPYASPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF----- 410
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
YM +IDL +Y+ +P+ +++ A YNG LPFGL TA +F+ +
Sbjct: 411 -HGIRYMSTIDLRSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDV 469
Query: 602 V-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + +R VV Y+DD L+ ++ + L +NL+KS+ V
Sbjct: 470 VLGTEVRE---FVVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMKINLEKSNFIQKEV 526
Query: 661 LQFLG 665
+FLG
Sbjct: 527 -KFLG 530
>gi|326677858|ref|XP_003200930.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1106
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 102/443 (23%), Positives = 180/443 (40%), Gaps = 74/443 (16%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P + AM+ +I E LE G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 393 LSQPETEAMNTYISEELEKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNEITVK 450
Query: 528 KKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQ---RFLALSYNGDVLAM 581
++ L +P+ L++ Y +DL AY + I+ + F ++ + + L M
Sbjct: 451 YRYPLP---LVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFFTINGHYEYLVM 507
Query: 582 TCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVS 639
PFGLA +P F + N V + ++ V+VY+DD L+ + I ++ +
Sbjct: 508 ---PFGLANSPSVFQAFINEVFRDMLNQW--VIVYIDDILIYSNSLSEHIQHVRAVIKRL 562
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
I L + K + FLG P + ++A N
Sbjct: 563 IQNQL--YAKISKCEFHQT-CISFLGYNISP------------------MDIIVAMDQQN 601
Query: 700 LDSA---------RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
+DS R L L FA+F +R R S + AP LT + A
Sbjct: 602 VDSVTQWPQPETIRQLQRVLGFANFY--------QRFIRNFS--TIAAP-LTAMVKANNA 650
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEM 810
+L+W +A+ + + R I + S L +F S + ++N+ + +E+
Sbjct: 651 RLKWNSDAIKAFNQLKARFSSAPILSQRS-LTMNKLHPCAFYSRKLNPAERNYDVGNREL 709
Query: 811 FAVHQALSLNLPLLQSS----VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
A+ AL L+ + V+ N + +R ++ L F D
Sbjct: 710 LAMKAALEEWRHWLEGTKHPFTVITDHKNLEYICSCKRLNPRQARWAL------FFTCFD 763
Query: 867 WRIHILAQFIPGAYNSVADSLSR 889
+++ + PG+ N AD+LSR
Sbjct: 764 FQV----TYTPGSKNVKADALSR 782
>gi|149248580|ref|XP_001528677.1| hypothetical protein LELG_01197 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448631|gb|EDK43019.1| hypothetical protein LELG_01197 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESLLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|110282984|sp|P14350.2|POL_FOAMV RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;
Contains: RecName: Full=Protease/Reverse
transcriptase/ribonuclease H; AltName:
Full=p87Pro-RT-RNaseH; Contains: RecName:
Full=Protease/Reverse transcriptase; AltName:
Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H;
Short=RNase H; Contains: RecName: Full=Integrase;
Short=IN; AltName: Full=p42In
gi|1617063|emb|CAA69003.1| pol [Human foamy virus]
gi|1617068|emb|CAA68997.1| pol [Human foamy virus]
gi|1617073|emb|CAA68999.1| pol [Human foamy virus]
gi|1850918|gb|AAB48112.1| pro-pol protein [Human spumaretrovirus]
Length = 1143
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 115/244 (47%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLTPQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + + + Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILATIVRQKYKTTLDLANGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V VY+DD L + DP+ Q + IL G++V+L+KS
Sbjct: 295 FTAD---VVDLLKEIP-NVQVYVDDIYLSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IGQKTV-EFLGF----NITK----EGRGLTDTFKTKLLNITPPKDLKQLQSILGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|189533692|ref|XP_001921559.1| PREDICTED: hypothetical protein LOC566211 [Danio rerio]
Length = 1496
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I A P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 1038 KICLTEATPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1087
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN + KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1088 IVPKKDGTLRVCLDFRKLN---AVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1144
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1145 PLEKTSREYTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1201
Query: 621 LL 622
++
Sbjct: 1202 VI 1203
>gi|149234601|ref|XP_001523180.1| hypothetical protein LELG_05726 [Lodderomyces elongisporus NRRL
YB-4239]
gi|149240724|ref|XP_001526209.1| hypothetical protein LELG_02767 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450332|gb|EDK44588.1| hypothetical protein LELG_02767 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453289|gb|EDK47545.1| hypothetical protein LELG_05726 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|391337337|ref|XP_003743026.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1165
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 96/438 (21%), Positives = 186/438 (42%), Gaps = 39/438 (8%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLS 526
+A + ++ + +++ G L +DS+ + + + +V K NG R + GLN L
Sbjct: 428 VAFALQESIDKELDRLVQNGTLIPVDSS-DWATPIVVVKKPNGAVRVCADYSTGLNDALV 486
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ L N + + +DL+ AY + + T Q+ ++ + + T L F
Sbjct: 487 DIEHPLPNMEEVMTKFSGNKIFAHLDLADAYLQLRLDTPSQQLTTITTHRGLFRYTRLVF 546
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GL TAP F + + L V+VYLDD L++ + ++ E + + L G+
Sbjct: 547 GLKTAPAIFQKTIDQALAGLDG----VLVYLDDILIMAPNYKLYEQRLVDVLRRLEDWGF 602
Query: 647 IVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
+ ++K + P +++LG++ D+ + ++ + LR K N R+L
Sbjct: 603 RLRIEKCFFN-VPTVKYLGMVIS---DKGIEADPARIAVIKNLR-----KPANQKEVRAL 653
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIF 766
LG +++ + LH + +A L + + A L ++ L L S
Sbjct: 654 LGLVNYYGKFVK--NLHRFKTPLEALLAKDARFEWGVQHDAALTGIKDMLCGPLLLSHYD 711
Query: 767 PRQVQHFISTDASDLGWGSQVDSSFLSG----------LWSREQQNWHINKKEMFAVHQA 816
PRQ ++ DAS G G + + G S+ Q+N+ +KE FA+ A
Sbjct: 712 PRQTL-VVAADASQTGIGGVLLQRYADGNERAVFHMSKSLSKSQRNYSQVEKEAFALVTA 770
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA--- 873
+ + ++Q+D++ +++ L R TK L E+ + W + ++
Sbjct: 771 VERFKKFVWGRRFILQTDHRPLLA-LFRTSNTKGLQ-----ERTAARLKRWALRLVGFDF 824
Query: 874 --QFIPGAYNSVADSLSR 889
++I AD+LSR
Sbjct: 825 EIEYIRTEEFGQADALSR 842
>gi|341876938|gb|EGT32873.1| hypothetical protein CAEBREN_06262 [Caenorhabditis brenneri]
Length = 2238
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 105/437 (24%), Positives = 194/437 (44%), Gaps = 61/437 (13%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
++ ++E+L+ G+++ +ST F S + LV K + R ++ + LN + + + N
Sbjct: 1378 INRQVEELLKQGIIEVSNST--FTSPIVLVKKKDSTFRFTVDYRLLNAVSEKRNYQIPNI 1435
Query: 536 FRIPSFLQKGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ L G ++ S D Q +F + +K + A + + + +P G++ AP
Sbjct: 1436 TELLD-LATGSFIYSSFDFVQGFFQIDLKKEDRYLTAFATDEETYQFQRMPMGVSGAPFT 1494
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-NQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F +S + L ++ +R+ YLDD LLV + + LE KL +I+ + G + L K
Sbjct: 1495 FQQVSRF---LQKTTKVRMFAYLDDLLLVSSSEEEHLEDIKKLLENIIRN-GLKLKLSKC 1550
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPED--KQLTLGN--ILRTLLASKTWNLDSARSLLGY 709
+ L+FLG + + L + K + N I ++ A +++ L+GY
Sbjct: 1551 VFARKE-LKFLGYL----IGETGLKPNPSKTFVIQNFPIPESVTAVRSF-----IGLVGY 1600
Query: 710 L-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LS 762
+FA P+ L + + P L T I+ +L+ L P LS
Sbjct: 1601 FRRFIRNFAGIAAPLHNLTEKDV-----------PFLWTDIHQKAFDELKTALINPPILS 1649
Query: 763 SPIFPRQVQHFISTDASDLGWGS---QVDS-------SFLSGLWSREQQNWHINKKEMFA 812
P + + + TDAS + + Q ++ SF S S+ + + + E A
Sbjct: 1650 GPDLTK--PYILETDASTIAIAAVLLQKNNEGLLNVISFASRKLSKAESRYPPIEGEALA 1707
Query: 813 VHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
V L L + +V +D+Q + S L+R+ +L + K ++ Q++ I I
Sbjct: 1708 VLFGLQHYRQYLLGNHTLVVTDHQPLTSLLKRK------NLEGRLLKYQIMIQEFDIEI- 1760
Query: 873 AQFIPGAYNSVADSLSR 889
++ PG N VAD+LSR
Sbjct: 1761 -RYRPGRRNVVADALSR 1776
>gi|427792993|gb|JAA61948.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1099
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 147/358 (41%), Gaps = 47/358 (13%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
+S + EM+ GV++ +S + + + + LV K +G R ++ + LN + L
Sbjct: 265 ISEQVHEMMTKGVIQ--ESASPWAAPVILVKKKDGSWRFCVDYRRLNAVTKKDVYPLPRI 322
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
L Y S+DL Y+ +P+ + A + +PFGL AP F
Sbjct: 323 DDAIDCLHSASYFSSVDLRSGYWQIPMHPEDKEKTAFVTPDGLFEFNVMPFGLCNAPATF 382
Query: 596 ASLSNWVASLLRSRGMR---VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
+ + RG++ + YLDD ++ + + ++ + + G ++N +K
Sbjct: 383 ERFMDTIL-----RGLKWNICMCYLDDVVIFGRTFSEHNSRLDTVLNCIRNAGLVLNSKK 437
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
L LG + D R P+ +++ L S ++ RS LG S+
Sbjct: 438 CHFGDRQTL-VLGHLVDKDGIR---PDPEKIAAVAAL-----SAPRSVKELRSFLGLCSY 488
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ 771
IP + SLLR P TP + +L++ L + P+ +Q
Sbjct: 489 FRRFIPK---FADVAYPLTSLLRKDVPFEWTPECASSFRQLKFLLTSRPV--------LQ 537
Query: 772 HF-------ISTDASDLGWGSQV-----DS----SFLSGLWSREQQNWHINKKEMFAV 813
HF + TDAS +G G+ + DS ++ S SR +QN+ + ++E AV
Sbjct: 538 HFSPSAPTELHTDASGIGIGAVLIQRYGDSEHVIAYASRSLSRPEQNYTVTEQECLAV 595
>gi|17932886|emb|CAC80814.1| polymerase [Stork hepatitis B virus]
Length = 790
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 99/222 (44%), Gaps = 13/222 (5%)
Query: 501 RLFLVPKGNGGT---RPVLNL----KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
R+FLV K + T R V++ KG N PK +S N + + G IS+DL
Sbjct: 393 RIFLVDKNSRNTTEARLVVDFSQFSKGKNAMRFPKYWS-PNLTALRRIVPLGMPRISLDL 451
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWV-ASLLRSRGMR 612
SQA++H+P+ LA+S P G+ +P + + A + R +
Sbjct: 452 SQAFYHLPLNPASSSRLAVSDGKQAYYFRKAPMGVGLSPFLLHLFTTAIGAEISRRFNVW 511
Query: 613 VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI-MWDPH 671
Y+DDFLL + R L + L G +N K + SP ++FLG + + H
Sbjct: 512 TFSYMDDFLLCHPSARHLNSISHAVCTFLQEFGIRINFDKMTPSPVTTIRFLGYEISNQH 571
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + E + L +++ + + ++ + L+G+L+F
Sbjct: 572 LK---IEESRWNELRQVIKKIKVGQWYDWKCIQRLIGHLNFV 610
>gi|154284878|ref|XP_001543234.1| hypothetical protein HCAG_00280 [Ajellomyces capsulatus NAm1]
gi|150406875|gb|EDN02416.1| hypothetical protein HCAG_00280 [Ajellomyces capsulatus NAm1]
Length = 1584
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 9/202 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 615 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 669
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 670 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 729
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 730 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 786
Query: 634 GKLAVSILGSLGWIVNLQKSSL 655
+ L L KS L
Sbjct: 787 VTKVLERLERANLFAKLSKSPL 808
>gi|67625725|tpe|CAJ00251.1| TPA: gag-pol polyprotein [Schistosoma mansoni]
Length = 1154
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 101/438 (23%), Positives = 176/438 (40%), Gaps = 61/438 (13%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFS 531
+ + ++ +QEML+ G+++ DS + S + LV K NG R ++ + LN K +
Sbjct: 426 LEAEVNRQVQEMLKEGIIEEADSP--YSSPVLLVKKPNGKYRFCVDFRELNNITELKPCA 483
Query: 532 LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
+ LQ +DL Y+ +PIK + + A + +PFGLA A
Sbjct: 484 MPTVVETLDRLQNATVFTVLDLRSGYWQLPIKESDRSKTAFTIRDKQYQFRRMPFGLAGA 543
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
P F L SLL V VY DD ++ +Q + + + G +N
Sbjct: 544 PFTFRRL----MSLLLRDLDNVEVYGDDVVVYSQTETDHAKHVEAVLKRIEEFGLRINKD 599
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
KS ++ + + + + + LPE K LT+ N+ +S R L +L
Sbjct: 600 KSQMAKSSITLLGHKVGNGEIKP--LPE-KILTIKNVAVP---------NSRRKLRQFLG 647
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI------ 765
A+F +SR I+ + + AP ++ K W A + I
Sbjct: 648 RAAF-------YSRFIK---NFNEIAAPLYKLLSNT---KFSWTETAQQTFNQIKNVLDD 694
Query: 766 ------FPRQVQHF-ISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMFAV 813
P + F ++TDASD G G+ + S + S + + +Q + +KE A+
Sbjct: 695 RQMTLRLPELEKPFTVTTDASDHGIGAVLSQSNRVVEYASRVLTPAEQKYSTIEKECLAI 754
Query: 814 HQALSLNLPLLQSSVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
A+ P L +++D++ + + R G + ++ E F +
Sbjct: 755 VWAVDKWRPYLLGRRFHIETDHKPLQWLQTARDPRGKLARWMIRLQEYDFSIGH------ 808
Query: 872 LAQFIPGAYNSVADSLSR 889
+PG N +AD LSR
Sbjct: 809 ----VPGKENVMADYLSR 822
>gi|189242428|ref|XP_001808069.1| PREDICTED: similar to protease, reverse transcriptase and RNase H
[Tribolium castaneum]
Length = 553
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 108/245 (44%), Gaps = 33/245 (13%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQ---HLATP-----------VSSAMSLHIQEMLETG 487
L+ ++ Y FS++P L + + H TP + A+ IQEML+ G
Sbjct: 285 LIHLLQEYRCIFSSRPGLTHKYTHEIKLHDKTPFLKRPYPVPFALRPAVDATIQEMLDLG 344
Query: 488 VLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL------SPKKFSLINHFRIPSF 541
V+KR + + S + +V K +G R L+ + +N + P L+ F
Sbjct: 345 VIKR--EASPYASPMTVVKKKDGTVRICLDARMINSKMIADCESPPAADELLRRF----- 397
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
YM +IDL +Y+ +P+ +++ A YNG LPFGL TA +F+ +
Sbjct: 398 -HGIRYMSTIDLRSSYWQIPLSPESRQYTAFLYNGRSYTYQVLPFGLKTAVGSFSRAMDV 456
Query: 602 V-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + +R VV Y+DD L+ ++ + L +NL+KS+ V
Sbjct: 457 VLGTEVRE---FVVNYIDDLLVASETLNEHLEHLRQVFEKLKQARMKINLEKSNFIQKEV 513
Query: 661 LQFLG 665
+FLG
Sbjct: 514 -KFLG 517
>gi|292615051|ref|XP_002662530.1| PREDICTED: hypothetical protein LOC100333686 [Danio rerio]
Length = 1470
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I A P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 1012 KICLTEATPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1061
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN + KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1062 IVPKKDGTLRVCLDFRKLN---AVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1118
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1119 PLEKTSREYTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1175
Query: 621 LL 622
++
Sbjct: 1176 VI 1177
>gi|154271288|ref|XP_001536497.1| hypothetical protein HCAG_08279 [Ajellomyces capsulatus NAm1]
gi|150409167|gb|EDN04617.1| hypothetical protein HCAG_08279 [Ajellomyces capsulatus NAm1]
Length = 1587
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 9/202 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 618 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 672
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 673 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 732
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 733 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 789
Query: 634 GKLAVSILGSLGWIVNLQKSSL 655
+ L L KS L
Sbjct: 790 VTKVLERLERANLFAKLSKSPL 811
>gi|149248682|ref|XP_001528728.1| hypothetical protein LELG_01248 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448682|gb|EDK43070.1| hypothetical protein LELG_01248 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|9629926|ref|NP_056914.1| hypothetical protein FFV_gp1 [Feline foamy virus]
gi|82281205|sp|O93209.1|POL_FFV RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;
Contains: RecName: Full=Protease/Reverse
transcriptase/ribonuclease H; AltName:
Full=p87Pro-RT-RNaseH; Contains: RecName:
Full=Protease/Reverse transcriptase; AltName:
Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H;
Short=RNase H; Contains: RecName: Full=Integrase;
Short=IN; AltName: Full=p42In
gi|2842430|emb|CAA70075.1| pol [Feline foamy virus]
gi|3123539|emb|CAA11581.1| pol [Feline foamy virus]
Length = 1156
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 115/255 (45%), Gaps = 21/255 (8%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ + + I ++L+ GVL + +ST + ++ VPK NG R VL+ + +N+
Sbjct: 166 HINPKAKPDIQIVINDLLKQGVLIQKESTMN--TPVYPVPKPNGRWRMVLDYRAVNKVTP 223
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ + I L KG Y +IDLS ++ PI A ++ G T LP
Sbjct: 224 LIAVQNQHSYGILGSLFKGRYKTTIDLSNGFWAHPIVPEDYWITAFTWQGKQYCWTVLPQ 283
Query: 587 GLATAPQAFASLSNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
G +P F V LL +G+ V VY+DD + + + + + L G
Sbjct: 284 GFLNSPGLFTGD---VVDLL--QGIPNVEVYVDDVYISHDSEKEHLEYLDILFNRLKEAG 338
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQL--TLGNILRTLLASKTWNLDSA 703
+I++L+KS+++ + ++ FLG E + L T L + A T L
Sbjct: 339 YIISLKKSNIANS-IVDFLGF--------QITNEGRGLTDTFKEKLENITAPTT--LKQL 387
Query: 704 RSLLGYLSFASFVIP 718
+S+LG L+FA IP
Sbjct: 388 QSILGLLNFARNFIP 402
>gi|149248865|ref|XP_001528809.1| hypothetical protein LELG_05797 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453360|gb|EDK47616.1| hypothetical protein LELG_05797 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNK--PLDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|425856935|gb|AFX98084.1| pol protein [Simian foamy virus]
Length = 1143
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 115/244 (47%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLTPQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + + + Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILATIVRQKYKTTLDLANGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V VY+DD L + DP+ Q + IL G++V+L+KS
Sbjct: 295 FTAD---VVDLLKEIP-NVQVYVDDIYLSHDDPQEHIQQLEKVFQILLQAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IGQKTV-EFLGF----NITK----EGRGLTDTFKTKLLNVTPPKDLKQLQSILGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|255726280|ref|XP_002548066.1| hypothetical protein CTRG_02363 [Candida tropicalis MYA-3404]
gi|240133990|gb|EER33545.1| hypothetical protein CTRG_02363 [Candida tropicalis MYA-3404]
Length = 1299
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 99/442 (22%), Positives = 174/442 (39%), Gaps = 69/442 (15%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
+ + I E+LE G + + + + + + V K +G R ++ + LN ++F +
Sbjct: 320 LEVQISELLEKGFI--VPQASPYGAPVIFVKKKDGSKRLCVDYRALNDITVKERFPIPLI 377
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ L ++DL Y V I Q A A +PFGL AP F
Sbjct: 378 DELFDSLSGATIFSTLDLHSGYHQVAIAKEDQEKTAFVTRFGQYAWKVMPFGLCNAPATF 437
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
L N + + + + +YLDD ++ ++D E + ++ L G I +K
Sbjct: 438 QRLMNDI--FMDTFDKYLNIYLDDLIIYSRDRESHEKHVREVLTRLRKNGLIAKKKKCHF 495
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLS 711
+ V I+ D ++ P+++++ A K W ++ A+S LG +
Sbjct: 496 FQSSVKYLGHIITDKGIE----PDEEKIA---------AIKNWPPIKSVKQAQSFLGLVG 542
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW---------LNALPLS 762
+ IP Q S L PI+ + K W L
Sbjct: 543 YYRKFIP-----------QMSAL------CGPIHDFIAGKSNWGTDQQEGFDKLRESLTK 585
Query: 763 SP--IFPRQVQHFIS-TDASDLGWGS---QVDS--------SFLSGLWSREQQNWHINKK 808
SP I P Q F+ TDAS+ G+ QVDS ++ S + + + + +K
Sbjct: 586 SPILILPSQEDTFVVFTDASNSCSGAVLHQVDSAGKFKGVVAYDSYKFGVHELKYTVREK 645
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR 868
E AV +AL L ++ +D++++ S + ++ FL D
Sbjct: 646 ECLAVVKALRKWKHYLAGRRFILYTDHESLRSLHYNKDAFGRINRWIG----FLAEYDME 701
Query: 869 IHILAQFIPGAYNSVADSLSRS 890
I + I G+ NSVAD++SR+
Sbjct: 702 I----RHIKGSRNSVADAISRA 719
>gi|149241778|ref|XP_001526353.1| hypothetical protein LELG_02911 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450476|gb|EDK44732.1| hypothetical protein LELG_02911 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|427798207|gb|JAA64555.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1199
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 114/524 (21%), Positives = 209/524 (39%), Gaps = 57/524 (10%)
Query: 396 VQTLQKPQRCSSPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSA 455
V+ + QR P +D+ +G ++ G + IRL A PF+
Sbjct: 354 VKFIDAAQRPHQPARSASDATLGQDIFEGLGTVGDEYTIRLKPNAK----------PFAL 403
Query: 456 KPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPV 515
P + + P+ + + +M + GV++R+ T + + L VPK +GG R
Sbjct: 404 SVP-------RRIPIPLYDKVKQELDQMEQQGVIRRITKPTLWCAGLVAVPKASGGIRIC 456
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
++L LN+ + ++ L + L +D + + V + Q +
Sbjct: 457 VDLTKLNKEVLRERHVLPTVEWVLGQLGDAKVFSKLDATAGFHQVRLSQECQEYTTFITP 516
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
LPFGL +AP+ F +A +L + VV +DD L+ +D E K
Sbjct: 517 FGRYCYCRLPFGLTSAPEYF---QREMARILEGQP-NVVNMIDDILVFGKD---REEHDK 569
Query: 636 LAVSILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
V +L L G +N K V +FLG++ D + P +L LR L
Sbjct: 570 RLVEVLERLRRAGVKLNKSKCCFGQDRV-EFLGVVIDAN---GISPSPSKL---EALRNL 622
Query: 693 LASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
A K ++ R LG + +P L +A L A P+ ++
Sbjct: 623 GAPK--DVAGVRRFLGMANHIGRFLP--NLSQVTAPIRALLGEQNAWEWGPLQETAFNRI 678
Query: 753 EWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQN 802
+ L + + P + +S DAS G G+ + +F S S ++
Sbjct: 679 KAMLTSDLCVAKYHPGR-NTTVSCDASSFGLGTVLLQEQPSGERRAVAFASRSLSDAEKR 737
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
+ +KE AV A+ ++ V++D+Q +V+ L G ++ +++ +
Sbjct: 738 YSQTEKEALAVAWAVHRFDQYVRGLNFTVETDHQPLVTLL---GNADLDTMPPRIQRFRI 794
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQI 906
++ +++ ++PG + AD+LSR+ PD S SA + +
Sbjct: 795 KLMRYQFNVV--YVPGKQLATADTLSRA---PDEKPSISAVDVV 833
>gi|4165194|emb|CAA08807.1| Pol protein [Drosophila melanogaster]
Length = 1150
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 109/448 (24%), Positives = 191/448 (42%), Gaps = 69/448 (15%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRPVLNLKGLNQFLS 526
V ++ I+EM+E G++++ S + + S +++VPK G R V++ + LN+
Sbjct: 292 VDQEVNKQIKEMIEQGIVRK--SKSPYCSPIWVVPKKADASGKQKFRLVVDYRNLNEITV 349
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
KF + I L + Y +IDL++ + + + A S T +PF
Sbjct: 350 NDKFPIPRMDEILDKLGRCQYFTTIDLAKGFHQIQMDENSIAKTAFSTKHGHYEYTRMPF 409
Query: 587 GLATAPQAFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
GL AP F ++N + L+ + VYLDD ++V P L IL
Sbjct: 410 GLKNAPATFQRCMNNLLEDLIYKDCL---VYLDD-IIVYSTP--------LEEHILSLKK 457
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA-- 703
L+ ++L LQ LD+ + + LG+I+ T N A
Sbjct: 458 VFEKLRDANLK----LQ---------LDKCEFMKKETEFLGHIVTTNGIKPNPNKTKAIT 504
Query: 704 -----------RSLLGYLSFASFVIPMGRLHSRRIQRQASL-LRLGAPHLTPINPAV--L 749
+S LG F IP + +I + +L L+ GA T +
Sbjct: 505 NFPLPKTPKQIKSFLGLCGFYRKFIP----NFAKIVKPMTLKLKKGAIIDTKCKEYIESF 560
Query: 750 PKLEWWLNALPLSSPIFPRQVQHF-ISTDASDLGWGSQVDSS-----FLSGLWSREQQNW 803
KL+ + + P+ I+P + F ++TDAS++ G+ + + + S + + N+
Sbjct: 561 EKLKVLITSDPIL--IYPDFSKPFSLTTDASNVAIGAVLSQNHKPVCYASRTLNEHEINY 618
Query: 804 HINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLL 863
+KE+ A+ A L V SD++ +V +L K ++ + KI L
Sbjct: 619 ATIEKELLAIVWATKYFRSYLFGRPFEVLSDHKPLV-WL---NNIKEPNMKLQRWKIKLN 674
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRSK 891
D++I +++PG N VAD+LSR+K
Sbjct: 675 EFDYKI----KYLPGKENHVADALSRTK 698
>gi|154271534|ref|XP_001536620.1| hypothetical protein HCAG_08402 [Ajellomyces capsulatus NAm1]
gi|150409290|gb|EDN04740.1| hypothetical protein HCAG_08402 [Ajellomyces capsulatus NAm1]
Length = 1584
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 9/202 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 615 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 669
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 670 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 729
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 730 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 786
Query: 634 GKLAVSILGSLGWIVNLQKSSL 655
+ L L KS L
Sbjct: 787 VTKVLERLERANLFAKLSKSPL 808
>gi|308465965|ref|XP_003095239.1| hypothetical protein CRE_22625 [Caenorhabditis remanei]
gi|308245633|gb|EFO89585.1| hypothetical protein CRE_22625 [Caenorhabditis remanei]
Length = 2555
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 115/281 (40%), Gaps = 17/281 (6%)
Query: 394 LKVQTLQKPQRCSSPVNPPADSRI----GAELVGGRLRRFVDAWIRLGAPAPLVRIVSGY 449
++++ L KP P + RI G ++ ++ A+ L + +
Sbjct: 1513 VQLKNLSKPGLPDEPAEANWEERILETNGTKVAEEDFKKCRHAFFNEDGDIGLFKGGIEH 1572
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
+I P P +A + +QEM+ +++ +ST+ F+S + LV K +
Sbjct: 1573 SIVIRKDMPF-PKSRTYRVALGTQDEVEKQVQEMILLDIIE--ESTSTFISPIVLVRKKD 1629
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R + + LN + + + I G + ++DL Q +F +P++ +
Sbjct: 1630 GTYRFTTDFRLLNAVTVKQNYQIPLISDIVDLASDGTFFTNLDLIQGFFQIPLRKEDRPL 1689
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
A + +P GL AP F + V L + ++ YLDD L+V+
Sbjct: 1690 TAFATPTGTYQYKRMPMGLCGAPHTFQTA---VRQLQKKTKAKLFCYLDDLLIVSN---T 1743
Query: 630 LEIQGKLAVSIL---GSLGWIVNLQKSSLSPAPVLQFLGIM 667
LE K +L +G+ V ++K + P + FLG++
Sbjct: 1744 LEQHMKDIEEVLQNIAEIGFKVKIEKCKFA-QPEVTFLGLL 1783
>gi|149248883|ref|XP_001528810.1| hypothetical protein LELG_05795 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453358|gb|EDK47614.1| hypothetical protein LELG_05795 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1529
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 662 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 719
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 720 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 778
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 779 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 834
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 835 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 886
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 887 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 944
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 945 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1003
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1004 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWLNFIRTFNYQIH- 1060
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1061 ---HIDGLKNIIADALSRCHT 1078
>gi|149235147|ref|XP_001523452.1| hypothetical protein LELG_05298 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452861|gb|EDK47117.1| hypothetical protein LELG_05298 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1527
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/441 (21%), Positives = 189/441 (42%), Gaps = 52/441 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F+S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFSSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DD +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESLLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFLLSQDWRIHI 871
PLL + V+ + DN+ +V + + + ++ V K F+ + +++IH
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNK--PLDNSHFVNRVYKWLNFIRTFNYQIH- 1058
Query: 872 LAQFIPGAYNSVADSLSRSKS 892
I G N +AD+LSR +
Sbjct: 1059 ---HIDGLKNIIADALSRCHT 1076
>gi|221132915|ref|XP_002160449.1| PREDICTED: uncharacterized protein LOC100211417 [Hydra
magnipapillata]
Length = 384
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/275 (24%), Positives = 120/275 (43%), Gaps = 23/275 (8%)
Query: 499 LSRLFLVPKGNGGTRPVLNL-----KGLNQFLSPKKFSLINHFRIPSFLQK------GDY 547
++R ++PK +G R + +L + +N +S S +++ + + ++K G
Sbjct: 92 INRFGVIPKSSGKWRLITDLSYPPGRSVNDGISAAD-STVSYTGLTAAIKKILLLGEGCL 150
Query: 548 MISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLR 607
+ D+ +AY + IK +R L L++ G LPFG +APQ F SN + +L
Sbjct: 151 LAKFDIQRAYRLIAIKEDERRLLVLNWKGCYYVDLALPFGARSAPQTFTRFSNVLEWILA 210
Query: 608 SRG-MRVVVY-LDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL 664
G ++ + + LDDFL+ + ++ A I LG ++ K+ PA + FL
Sbjct: 211 YHGEIKYIQHSLDDFLICGPPNSKVCGESLDKAFEICKELGILIEHTKTE-GPATCITFL 269
Query: 665 GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHS 724
G + D + +PE K + + +L + SL+G L + I GR
Sbjct: 270 GFIIDTTKLELRIPELKLVKIRALLDEWCIKRHGTKRDLLSLIGVLFYCCQAIIPGRPFL 329
Query: 725 RRIQRQASLLRLGAPHL---TPINPAVLPKLEWWL 756
+R+ +A HL + L L+WW
Sbjct: 330 KRLLLKAH----SVDHLWSQVRLTENELQDLKWWF 360
>gi|154272636|ref|XP_001537170.1| hypothetical protein HCAG_07479 [Ajellomyces capsulatus NAm1]
gi|150415682|gb|EDN11026.1| hypothetical protein HCAG_07479 [Ajellomyces capsulatus NAm1]
Length = 1554
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 9/202 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 585 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 639
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 640 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 699
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 700 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 756
Query: 634 GKLAVSILGSLGWIVNLQKSSL 655
+ L L KS L
Sbjct: 757 VTKVLERLERANLFAKLSKSPL 778
>gi|4826604|gb|AAD30198.1|AF113832_3 polyprotein P194 [Rice tungro bacilliform virus]
Length = 1677
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1204 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1263
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1264 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1323
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1324 FQRFMQESFGDLKF----TLLYIDDILIASNNEKEHIEHLKIFFNRVKEIGCVLSKKKSK 1379
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + + ++ +K L ++ LG L+
Sbjct: 1380 MFLKEV-EYLGVE---------IKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLN 1429
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1430 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1486
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1487 I--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1544
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1545 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1576
>gi|46194172|tpg|DAA01997.1| TPA_exp: reverse transcriptase/ribonuclease H [Coprinopsis cinerea]
Length = 741
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/380 (21%), Positives = 162/380 (42%), Gaps = 48/380 (12%)
Query: 545 GDYMISIDLSQAYFHVPIKTTHQRFLALSYN-GDVLAMTCLPFGLATAPQAFASLSNWVA 603
G ++D+ + + +P+ H+ +L + + G+ +PFG A+A ++N
Sbjct: 348 GTQACTLDIEKFHRTIPVVPPHKCWLVVQGDPGEFWIEHNVPFGCASASSNSGMVANAGV 407
Query: 604 SLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS-------------ILGSLGWIVNL 650
++++ G + +D L ++ R+ +G++ S IL LG+ +N
Sbjct: 408 DIIQASGAGPTMKYEDDL---KNLRVPVAEGRIEDSGYTYDVPSGRVADILTYLGFPINR 464
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS---KTWNLDSARSLL 707
+K PV++F+G +WD + LPE K+ +R L + K + +
Sbjct: 465 EKGDGVYRPVVEFIGFLWDIPRKVVSLPERKRSKFLRRVRDFLDAFDGKRCSRRDVERIH 524
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNALPLSSPIF 766
G + SFV GR + A+ P P +V+ L+WW L ++
Sbjct: 525 GSMCHVSFVHIDGRSRLPSLSNFAATFDGRDPKTAHYPPTSVVSDLKWWAGCLEMA---- 580
Query: 767 PRQ---------VQHFISTDASDLGWGSQVDSSFLSGLWS--REQQNWHINKKEMFAVHQ 815
PR+ + H I DAS WG + + W+ R +++W + +++ +
Sbjct: 581 PRERSIRNRGPPMDHRIFVDAS-TSWGIGI---VIGERWAALRLREDWKVKGRDICWLET 636
Query: 816 -ALSLNLPLLQS-----SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
A+ + + LL S +++ SDN+ V + + G +++ + V +++ L +
Sbjct: 637 VAVEILVYLLDSLGYRDQHILIHSDNKGTVGSITK-GRSRNYHINHSVRRLYDLVLAVGL 695
Query: 870 HILAQFIPGAYNSVADSLSR 889
++I N AD LSR
Sbjct: 696 TPTLEYIESEKNP-ADPLSR 714
>gi|1542877|emb|CAA65152.1| orf [Drosophila melanogaster]
Length = 1494
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 100/427 (23%), Positives = 162/427 (37%), Gaps = 93/427 (21%)
Query: 414 DSRIGAELVGGRL-----RRFVDAWIRLGAP-APLVRIVSGYAIPFSAKPPLVPLCSLQH 467
+ +IG E V RL R F + +I P P VR I K P CS +
Sbjct: 669 EYKIG-ENVSNRLQLEFDRLFRNFYINAKRPNEPTVR----SEIQLCLKNPKPFSCSPRR 723
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + + E LE G ++ DS + S + LV K G R ++ + LN+
Sbjct: 724 LSYTEKDRLQKLLDEYLENGFIRPSDSE--YASPIVLVKKKTGDLRMCVDFRKLNKMTMK 781
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ L + + + +DL +FHV +K ++ + +PFG
Sbjct: 782 DNYPLPLIDDLLDRMNEKTVFTKLDLKNGFFHVHVKKESIKYTSFVTPLGQYEWLRMPFG 841
Query: 588 LATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFLL----VNQDPRILE------IQGKL 636
L AP F N + A ++R +VVVY+DD LL +N+ L+ ++ KL
Sbjct: 842 LKNAPSVFQRFVNKIFADMIREN--KVVVYMDDILLATENINEHLETLKEIFKRLVENKL 899
Query: 637 AVSI---------LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
+ I + LG+I+N GIM P DK +
Sbjct: 900 ELRIDKCEFMQSSIKYLGFIINKD-------------GIM----------PNDKGIE--- 933
Query: 688 ILRTLLASKTW----NLDSARSLLGYLS-FASFVIPMGRLHS--RRIQRQASLLRLGAPH 740
A K + N+ + +S LG S F F+ RL I ++ + G+
Sbjct: 934 ------AIKNFPIPNNVHTVQSFLGLCSYFRRFIKDFSRLAKPLHDILKKDKPFKFGSEE 987
Query: 741 LTPINPAVLPKLEWWLNALPLSSP---IFPRQVQHFISTDASDLGWGSQVDSSFLSGLWS 797
+ N L + SP I+ + + + DAS G+G+ + +
Sbjct: 988 MICFN---------MLKDKLIQSPVLAIYNHKHETELHCDASSSGFGAVL-------MQK 1031
Query: 798 REQQNWH 804
+E Q WH
Sbjct: 1032 KEDQKWH 1038
>gi|18026842|gb|AAL55651.1|AF220561_3 P194 [Rice tungro bacilliform virus]
Length = 1677
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1204 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1263
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1264 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1323
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1324 FQRFMQESFGDLKF----ALLYIDDILIASNNEKEHIEHLKIFFNRVKEIGCVLSKKKSK 1379
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + + ++ +K L ++ LG L+
Sbjct: 1380 MFLKEV-EYLGVE---------IKEGKINLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLN 1429
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1430 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1486
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1487 I--IIETDASEEGWGAVLICKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1544
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1545 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1576
>gi|393235324|gb|EJD42880.1| hypothetical protein AURDEDRAFT_67136, partial [Auricularia
delicata TFB-10046 SS5]
Length = 487
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/395 (22%), Positives = 157/395 (39%), Gaps = 28/395 (7%)
Query: 521 LNQFLSPKKFS-----LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
LN + PK F+ + + + +G + D+ A+ +PI Q + + +N
Sbjct: 77 LNAGIDPKDFTCDWGTFAQCYMLAAKAPEGAEVAVFDVKSAHRRIPIVPWQQPYCVIYWN 136
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR--GMRVVVYLDDFLLVNQDPRI--LE 631
G C FG +A + +++ + R R + + DDF +
Sbjct: 137 GRTALDYCCQFGQVSASGLWGKVADGFRGIFRFRYPDDDCINWADDFTFWRYPLPCGGFD 196
Query: 632 IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
I + ++ LGW + +K + A ++LG W+ + + E+K+ +R
Sbjct: 197 IDEQDIYALGDELGWPWS-EKKTTPFASQFKYLGFNWNLDTRMVSVTEEKKDKYLRTMRD 255
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
++ A+SLLG L S VIP R + R A P N +V
Sbjct: 256 WTRGSRFSQKEAQSLLGKLVHCSMVIPDARSRLPALSRFAGSFDSAFARHVP-NTSVFND 314
Query: 752 LEWWLNALP-------LSSPIFPRQVQHFISTDASDLGWGSQVDSSF----LSGLWSREQ 800
L+WW + L + S P + ++ S G G +D ++ L W E+
Sbjct: 315 LDWWRDKLSGSFCGMRIKSVPAPSAISLYVDASTS-FGIGIVLDGAWDYWRLKEGWRDER 373
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSV-VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
++ I EM A L + SS+ ++++SDN VV L G +K+ + +++
Sbjct: 374 RD--IGWAEMVAAEFGLRAAVERGASSLHLVIKSDNAGVVGAL-DAGKSKNPAANKVLQR 430
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
I + I + ++P N +AD SR P
Sbjct: 431 IVSTMMEHEIWLSTSWVPSEEN-IADPPSRGLPAP 464
>gi|154285338|ref|XP_001543464.1| hypothetical protein HCAG_00510 [Ajellomyces capsulatus NAm1]
gi|150407105|gb|EDN02646.1| hypothetical protein HCAG_00510 [Ajellomyces capsulatus NAm1]
Length = 1487
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 9/202 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 518 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 572
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 573 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 632
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+PFGLA AP F A ++N ++ LL + VVYLDD L+ + + ++
Sbjct: 633 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILIFSNSKQEHKVH 689
Query: 634 GKLAVSILGSLGWIVNLQKSSL 655
+ L L KS L
Sbjct: 690 VTKVLERLERANLFAKLSKSPL 711
>gi|326680285|ref|XP_002666897.2| PREDICTED: hypothetical protein LOC100333989 [Danio rerio]
Length = 1194
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
P +P VP + L TP+ + I+ M++ GV++ ST+ + S + LVPK +G
Sbjct: 769 PIRQRPYRVP----ESLITPLRA----EIKMMMDMGVIE--SSTSAWSSPIVLVPKKDGT 818
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQR 568
R L+ + LN + KF RI +++ Y+ ++DL + Y+ VP++ T +
Sbjct: 819 LRLCLDFRKLN---AVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQVPLEKTSRE 875
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+ A + T +PFGL AP F L + + L+ YLDD ++
Sbjct: 876 YTAFRTPVGLYQFTTMPFGLHGAPATFQRLMDLI---LQDCEDCSAAYLDDVVI 926
>gi|326678717|ref|XP_003201149.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1290
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 104/450 (23%), Positives = 179/450 (39%), Gaps = 43/450 (9%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P + M ++ + L G+++ S G + F V K + RP ++ +G
Sbjct: 332 PRGRLFSLSAPERATMEKYLSDSLAAGIIRSSSSPAG--AGFFFVKKKDSSLRPCIDYRG 389
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ +DL AY V I+ + A +
Sbjct: 390 LNDITIKNRYPLPLMSTAFEILQGARVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFE 449
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL+ AP F +L N V + ++ V VYLDD L+ + ++ + +
Sbjct: 450 YLVLPFGLSNAPAVFQALVNDVLRDMINKF--VFVYLDDILIFSPSLQVHIQHVRRVLQR 507
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
L V +K L A + FLG + RM PE + A W +
Sbjct: 508 LLENQLFVKAEK-CLFHAQSVPFLGSIISVEGIRM-DPEKVR-----------AVSDWPV 554
Query: 701 DSAR-SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNA 758
+R +L +L FA+F R +S+ +L + I A +L+
Sbjct: 555 PGSRKALQQFLGFANFYRRFIRNYSQVAAPLTALTSTKSHFCWSIAAQAAFRELKSRFTT 614
Query: 759 LPLSSPIFPRQVQHF-ISTDASDLGWGSQVD-----------SSFLSGLWSREQQNWHIN 806
P+ + P + F + DAS++G G+ + ++ S S ++N+ I
Sbjct: 615 APIL--VLPDPARQFVVEVDASEVGVGAVLSQICPKDNKLHPCAYYSHRLSPAERNYDIG 672
Query: 807 KKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLS 864
+E+ AV AL L+ + +V +D++ + Y++ K L+ +F
Sbjct: 673 NRELLAVRLALGEWRHWLEGAAEPFVVWTDHRN-LEYIQ---TAKRLNSRQARWALFF-- 726
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
R + + PG+ N D+LSR P
Sbjct: 727 --GRFNFTLSYRPGSKNGKPDALSRCFGTP 754
>gi|308468618|ref|XP_003096551.1| hypothetical protein CRE_09733 [Caenorhabditis remanei]
gi|308243001|gb|EFO86953.1| hypothetical protein CRE_09733 [Caenorhabditis remanei]
Length = 2404
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 99/436 (22%), Positives = 177/436 (40%), Gaps = 53/436 (12%)
Query: 476 MSLHIQEMLETGVLKRL-DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
+ HI +L + +R+ +S T + S + +V K NG L+ + LN+ P F L
Sbjct: 1422 LEKHINSLLRS---RRITESNTPWTSPIVIVTKKNGSLTVCLDFRKLNEATIPDNFPLP- 1477
Query: 535 HFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
RI + L+K ++ S+D++ Y + + + V A T LPFGL +A
Sbjct: 1478 --RIDAILEKVGGSNFFSSLDMANGYLQLRLDASSSYKCGFITENKVYAYTHLPFGLKSA 1535
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
F + ++L V+VY+DD L+ ++ I + + + +
Sbjct: 1536 ASYF---QRALRTVLNGLEEEVLVYIDDILVFSKTFEQHVISLRKVLQRFRDFNLKASPK 1592
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
K + + FLG + + DK N+ + + N++ R +G
Sbjct: 1593 KCEFA-KKAITFLG----HEIGKDSYSPDK----ANVAKIVEFPVPSNVNEVRRFVGMAG 1643
Query: 712 -FASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPR 768
F F+ + R+ R+ + A KL L + P+ FP
Sbjct: 1644 FFRKFISKFSEIAEPLTRLTRKEQKFTWDSAQ-----QAAFEKLRTALASEPILG--FPD 1696
Query: 769 QVQHF-ISTDASDLGWGSQV-----DS-------SFLSGLWSREQQNWHINKKEMFAVHQ 815
+ F I DAS + G+ + DS ++ S S + W + EM A+
Sbjct: 1697 YDKPFHIFCDASAVAQGAALMQTRPDSEKDFYGIAYASRTLSDPETRWPAIQVEMGAIIF 1756
Query: 816 ALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQF 875
AL P + S +++ SD++ + L++ +LS + + Q + IHI+
Sbjct: 1757 ALRQFKPYICMSKIILHSDHKPLTFLLQKAKAHDNLS------RWLIELQCYDIHIV--H 1808
Query: 876 IPGAYNSVADSLSRSK 891
I G N+VAD LSR++
Sbjct: 1809 IDGKKNTVADCLSRAR 1824
>gi|388854961|emb|CCF51464.1| uncharacterized protein [Ustilago hordei]
Length = 1516
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 109/503 (21%), Positives = 197/503 (39%), Gaps = 68/503 (13%)
Query: 420 ELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFS--AKPPLVPLCSLQHLATPVSSAMS 477
+++ +++D + R+ A + IP PP P+ SL +
Sbjct: 528 DIIPQEYHQYLDVFSRVKADKLSPHRTYDHQIPLEEGKSPPFGPIYSLSEHEL---KTLR 584
Query: 478 LHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFR 537
+++E L G + DS S + V K +G R ++ +GLNQ ++ L
Sbjct: 585 EYLEENLAKGFISPSDSLAA--SPILFVKKKDGSLRLCVDYRGLNQITIRNRYPLPLIDE 642
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+ L K + IDL AY + I + A + +PFGL AP +F
Sbjct: 643 LLDRLCKARFFTRIDLRGAYNLLRIAKGDEWKTAFCTRYGLFQYNVMPFGLTNAPASFQH 702
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI--LGSLGWIVNLQKSSL 655
L N + R + ++YLDD L+ + + + QG ++ + L G +K
Sbjct: 703 LMNDTFKDMLDRSL--IIYLDDLLIYSS--TLEQHQGHVSAVLARLRQAGLYAKAEKCQF 758
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLS 711
S + +FLG ++ D+ +++ ++ + W ++ + LG+ +
Sbjct: 759 STSQT-EFLG----------FVVSDQGVSMDPSKTEVITN--WPVPTSVHDVQVFLGFCN 805
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPI--NPAVLPKLEWWLNALPLSSPIFPRQ 769
F IP +SR LLR A TP N A E ++ + +
Sbjct: 806 FYRKFIPQ---YSRMAYPLTQLLRKEA-QSTPFAWNQAAQHAFEQLRSSFSTDTIL---- 857
Query: 770 VQHF-------ISTDASDLGWGSQVDSSFLSG------LWSRE----QQNWHINKKEMFA 812
HF + TDASD + + SF G +S++ Q N+ I KEMFA
Sbjct: 858 -HHFDPAQPIIVETDASDFAVAAVLSQSFDQGTRHPIAFFSKKLDPAQLNYPIFDKEMFA 916
Query: 813 VHQALSLNLPLLQSSVVMVQ--SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+ A L+ + VQ +D++++ + + + + SE+ F
Sbjct: 917 IVAAFKHWRQYLEGAKFPVQVLTDHRSLEYFTTTKQLNRWQARWSELLSDF--------D 968
Query: 871 ILAQFIPGAYNSVADSLSRSKSL 893
+ Q+ PG + D+L+R +
Sbjct: 969 FVIQYRPGVQAGLPDALTRRSDM 991
>gi|308464032|ref|XP_003094286.1| hypothetical protein CRE_11440 [Caenorhabditis remanei]
gi|308248024|gb|EFO91976.1| hypothetical protein CRE_11440 [Caenorhabditis remanei]
Length = 1388
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 102/473 (21%), Positives = 194/473 (41%), Gaps = 86/473 (18%)
Query: 454 SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
+A+P + + A P+ SA I+ + +TGV+ +D + + + + V K NG R
Sbjct: 487 NARPIFRKARPVTYSARPMVSA---EIERLNQTGVISPVDHSE-WAAPVVAVKKKNGSIR 542
Query: 514 PVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
+ GLN + L I + L G++ IDL++AY V + Q+ L +
Sbjct: 543 LCADFSTGLNDAIESNNHPLPTSDDIFAKLNGGNFFTQIDLAEAYLQVEMDPDSQKLLVI 602
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDP 627
+ + + LPFG+ +AP F + + + + L V YLDD ++ +
Sbjct: 603 NTHLGLFTYNRLPFGVKSAPGIFQQIMDTMLNGLEG----VSTYLDDIIICGSTIEEHNE 658
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLG 686
R+ ++ G++ G+ + ++K S + +FLG + + R P+ +K L +
Sbjct: 659 RVFKVFGRIQ-----EYGFRIKMEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVLHIK 709
Query: 687 NILRTLLASKTWNLDSARSLLGYLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
N+ + N+ +S LG + F FV +Q LR +LT
Sbjct: 710 NM------PEPTNVSQVKSFLGLIQFYGQFV------------KQLFRLRQPLDNLT--- 748
Query: 746 PAVLPKLEWWLNALP--------LSSPIFPRQVQHF-------ISTDASDLGWGSQVDSS 790
A +W L L S + + H+ ++ DAS G G+ +
Sbjct: 749 -AKDTDFKWTLECQKSFDTIKEILQSDLL---LTHYNPNLPIIVAADASQYGIGATISHR 804
Query: 791 F----------LSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVS 840
F +S S+ Q+N+ +KE F + A++ + +++D++ +++
Sbjct: 805 FPDGTEKTIYHISKTLSKTQRNYSQIEKEGFGLITAVTKFHKFIHGRKFTLRTDHKPLLT 864
Query: 841 YLRRQGGTKSLSLLSEVE----KIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
GG K + + + LL+ D+ I ++I D+LSR
Sbjct: 865 IF---GGKKGVPVYTANRLQRWATILLNYDFDI----EYINTKDFGQVDALSR 910
>gi|427780599|gb|JAA55751.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1243
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 116/525 (22%), Positives = 211/525 (40%), Gaps = 59/525 (11%)
Query: 396 VQTLQKPQRCSSPVNPPADSRIGAELVGGRLRRFVDAW-IRLGAPAPLVRIVSGYAIPFS 454
V+ + QR P +D+ +G ++ G L D + IRL A PF+
Sbjct: 324 VKFIDAAQRPHQPARSASDATLGQDIFEG-LGTVGDEYTIRLKPNAK----------PFA 372
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
P + + P+ + + +M + GV++R+ T + + L VPK +GG R
Sbjct: 373 LSVP-------RRIPIPLYDKVKQELDQMEQQGVIRRITKPTLWCAGLVAVPKASGGIRI 425
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++L LN+ + ++ L + L +D + + V + Q +
Sbjct: 426 CVDLTKLNKEVLRERHVLPTVEWVLGQLGDAKVFSKLDATAGFHQVRLSQECQEYTTFIT 485
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
LPFGL +AP+ F +A +L + VV +DD L+ +D E
Sbjct: 486 PFGRYCYCRLPFGLTSAPEYF---QREMARILEGQP-NVVNMIDDILVFGKD---REEHD 538
Query: 635 KLAVSILGSL---GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
K V +L L G +N K V +FLG++ D + P +L LR
Sbjct: 539 KRLVEVLERLRRAGVKLNKSKCCFGQDRV-EFLGVVIDAN---GISPSPSKL---EALRN 591
Query: 692 LLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
L A K ++ R LG + +P L +A L A P+ +
Sbjct: 592 LGAPK--DVAGVRRFLGMANHIGRFLP--NLSQVTAPIRALLGEQNAWEWGPLQETAFNR 647
Query: 752 LEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQ 801
++ L + + P + +S DAS G G+ + +F S S ++
Sbjct: 648 IKAMLTSDLCVAKYHPGR-NTTVSCDASSFGLGTVLLQEQPSGERRAVAFASRSLSDAEK 706
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
+ +KE AV A+ ++ V++D+Q +V+ L G ++ +++
Sbjct: 707 RYSQTEKEALAVAWAVHRFDQYVRGLNFTVETDHQPLVTLL---GNADLDTMPPRIQRFR 763
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQI 906
+ ++ +++ ++PG + AD+LSR+ PD S SA + +
Sbjct: 764 IKLMRYQFNVV--YVPGKQLATADTLSRA---PDEKPSISAVDVV 803
>gi|4826599|gb|AAD30194.1|AF113831_3 polyprotein P194 [Rice tungro bacilliform virus]
Length = 1675
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1322 FQRFMQESFGDLKF----ALLYIDDILIASNNEKEHIEHLKIFFNRVKEIGCVLSKKKSK 1377
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + + ++ +K L ++ LG L+
Sbjct: 1378 MFLKEV-EYLGVE---------IKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLN 1427
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1484
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1485 I--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1542
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1543 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1574
>gi|308481861|ref|XP_003103135.1| hypothetical protein CRE_25636 [Caenorhabditis remanei]
gi|308260511|gb|EFP04464.1| hypothetical protein CRE_25636 [Caenorhabditis remanei]
Length = 1775
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 101/456 (22%), Positives = 187/456 (41%), Gaps = 62/456 (13%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P C ++ + ++ M + G+++ +ST+ + S L +PK NG R V++ +
Sbjct: 836 IPQCRPYRVSPQQREKLEKELKFMKDNGLIE--ESTSPYTSPLLSIPKANGEIRIVIDYR 893
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN + + + N + +G D++Q + +P+ H+ A + V
Sbjct: 894 RLNLITRSRTYIMPNTIDVTEEASRGKLFSVFDIAQGFHTIPMHEAHKERTAFCCHMGVF 953
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP----RILEIQGK 635
+P GL AP F +A + + +++Y+DD ++V++D R LE +
Sbjct: 954 QYRYMPMGLKGAPDTF---QRAMAEVEKQFTGTMILYVDDLIVVSRDEEEHLRNLEEFFQ 1010
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
L + ++G + +KS + + FLG + + + P ++ +R
Sbjct: 1011 LMI----NMGLKLKAEKSQIGRTKI-SFLGFVIE---NNTIQPSGEKT---EAIRKFPTP 1059
Query: 696 KTWNLDSARSLL---GYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
T L +S L GY +A V P+ L + ++ G
Sbjct: 1060 TT--LSEVKSFLGMSGYFRRFIKDYAIIVKPLTTLTQKDVE-----FNWGEEQ-----EK 1107
Query: 748 VLPKLEWWLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS-----QVDSSFLSGLWSR- 798
+++ L +S PI PR F + TDAS +G + Q D + SR
Sbjct: 1108 AFEEVKQRL----ISPPILTTPRMDGDFEMHTDASKIGIAAVLLQKQDDKLKVIAYASRP 1163
Query: 799 ---EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+Q + + E A+ L+ P + V V +D+Q + S L R+ S LL
Sbjct: 1164 TTPVEQRYAAIESEALAITWGLTHYRPYIFGKKVKVVTDHQPLKSLLHRKEKEMSGRLLR 1223
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
I Q + + I+ + PG N +AD+LSR +
Sbjct: 1224 HQAII----QMYDVEIV--YRPGKENPLADALSRQR 1253
>gi|427798385|gb|JAA64644.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1319
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 103/457 (22%), Positives = 185/457 (40%), Gaps = 45/457 (9%)
Query: 449 YAIPFSAKPPL-VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPK 507
+ IP PP+ VPL + A ++ + EML G+++ S++ + + + LV K
Sbjct: 463 HRIPTGDHPPICVPL---RRYAEREREVIAEQVDEMLAAGIIR--PSSSPWAAPVVLVRK 517
Query: 508 GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQ 567
NG R ++ + LN+ +P + L ++ + S+DL Y+ + + +
Sbjct: 518 KNGSLRFCVDYRQLNKCTTPDSYPLPRIDDAVDTVRHCKFFSSLDLRAGYWQINVAEEDK 577
Query: 568 RFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV--NQ 625
A + +PFGL TAP F N V L+++ VVYLDD L+V +
Sbjct: 578 CKTAFRTPSGLYEFNRMPFGLRTAPSTFQRAMNSVLGPLKNQA--CVVYLDDILVVGKTE 635
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
D + + L L G+ +N +K + + +LG P + LPE
Sbjct: 636 DEHLRNLDEVL--HRLYEAGFRLNREKCQFGLSKI-SYLGHYISP-VGIQPLPERIAAVS 691
Query: 686 GNILRTLLASKTWNLDSARSLLGYL-SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPI 744
T L + A L ++ FAS P+ L + + + +L+ A
Sbjct: 692 EYPTPTCLKQVQSFMGMASYLRRFIPHFASIAAPLSGLLKKDARFEWGVLQENA------ 745
Query: 745 NPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVD-------SSFLSG 794
++ L P+ S F + TDAS +G G+ Q D ++ S
Sbjct: 746 ----FQTIKQKLTCCPILSH-FNEDWTTEVHTDASQVGLGAVLVQRDPDGLEHVVAYASR 800
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNL-PLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
S ++++H N+ E AV ++ L V +DN + +Q L
Sbjct: 801 KLSDTERHYHSNELECLAVVWSVDDKFRHYLFGRKFTVVTDNTAIAWMFSKQ------QL 854
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + + Q++ + + G N++AD+LSR+
Sbjct: 855 KHKFARWIITLQEYDFDVRHR--AGGLNNIADALSRN 889
>gi|425856947|gb|AFX98094.1| pol protein [Simian foamy virus]
Length = 1141
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 120/250 (48%), Gaps = 16/250 (6%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
S++ + I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ +
Sbjct: 176 SSIQVVIDDLLKQGVLVQQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLIAAQNQ 233
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ I + + + Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 234 HSAGILATIVRKKYKTTLDLANGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPA 293
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F + V LL+ V Y+DD L + DP+ Q + IL G++V+L+KS
Sbjct: 294 LFTAD---VVDLLKEIS-NVQAYVDDIYLSHDDPQEHLNQLEKVFQILLQAGYVVSLKKS 349
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
++ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 350 EIAQKTV-EFLGF----NITK----EGRGLTEAFKAKLLDITPPKDLKQLQSILGLLNFA 400
Query: 714 -SFVIPMGRL 722
+F++ L
Sbjct: 401 RNFILNFAEL 410
>gi|333967|gb|AAB03094.1| ORF3 [Rice tungro bacilliform virus]
Length = 1675
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDEFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1322 FQRFMQESFGDLKF----ALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSK 1377
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + + ++ +K L ++ LG L+
Sbjct: 1378 MFLKEV-EYLGVE---------IKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLN 1427
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1484
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1485 I--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1542
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1543 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1574
>gi|62733754|gb|AAX95863.1| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 1126
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 99/448 (22%), Positives = 180/448 (40%), Gaps = 41/448 (9%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P +AKP A + ++ MLE G++K DS++ F S + LV K +
Sbjct: 642 AKPVNAKP--------YRYAPKQKDEIERQVKVMLEQGIIK--DSSSPFASPVLLVKKKD 691
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R ++ +GLN KF + + L + +DL Y + + +
Sbjct: 692 GSWRFCVDYRGLNDITIKNKFPMPVVDELLDELAGAKWFTKLDLRSGYHKIRLLPQDEHK 751
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
A + + +PFGL AP +F L N + +L+ + V+V++DD + ++
Sbjct: 752 TAFRTHQGLYEFRVMPFGLTNAPASFQGLMNKIFALMIRKN--VLVFVDDIPVYSKSLAE 809
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
+ IL V K S + L++LG + H +
Sbjct: 810 HVQHLRQVFQILQHHQLFVKASKCSFAKQQ-LEYLGHIIGEH------------GVATDP 856
Query: 690 RTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH-LTPINPA 747
+ A + W + + + L G+L + R + + LL+ + T
Sbjct: 857 AKVQAVQEWPVPKNLKQLRGFLGLTGYYRKFIRHYGVITRPLTELLKKDKSYNWTDAQQK 916
Query: 748 VLPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDS-----SFLSGLWSREQQ 801
+++ + P+S Q F+ TDA D G G+ + ++LS + Q
Sbjct: 917 AFCQVKMAMVQAPVSVLAMLDFSQEFVLETDAYDRGIGAVLTQNGHPIAYLSKALGVKAQ 976
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
+KE A+ A+ LQ S ++ +D+++ +++L Q T S+ + V
Sbjct: 977 ALSTYEKECLAILMAVQKWRAYLQHSEFVILTDHRS-LTHLGEQKLTTSMQHKAFVR--- 1032
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ +RI Q+ G+ N VA +LSR
Sbjct: 1033 LMGLQYRI----QYKQGSENKVAYALSR 1056
>gi|301610718|ref|XP_002934905.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 839
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 113/499 (22%), Positives = 192/499 (38%), Gaps = 88/499 (17%)
Query: 429 FVDAWIRLGAPA-PLVRIVSGYAIPFSAKP-PLVPLCSLQHLATPVSSAMSLHIQEMLET 486
F+D + + GA + P RI Y P P VP + LA P + +I+E L
Sbjct: 21 FLDIFDKKGADSLPPHRI---YDCPIDLLPGSQVPFGRIYPLAEPELKVLREYIEENLAK 77
Query: 487 GVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKF------SLINHFRIPS 540
++ S G + +F V K + RP ++ + LN+ ++ L FR +
Sbjct: 78 KFIRPSTSPAG--AGIFFVEKKDHSLRPCIDYRELNKITIKNRYPLPLIPELFQRFRTAT 135
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
K +DL AY V I+ + A +PFGL AP F N
Sbjct: 136 VFSK------LDLRGAYNLVRIRKGDEWKTAFRTRYGHFEYLVMPFGLCNAPATFQHFLN 189
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV 660
V + V++YLDD L+ + + K S L S V +K + +
Sbjct: 190 DVFRDFLD--IFVIIYLDDILIFSSSLLEHRVHMKKVFSCLRSHQLYVKFEKCEFHKSSI 247
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR-SLLGYLSFASFVIPM 719
+FLG + P +M Q + +L W ++R + ++ FA+F
Sbjct: 248 -EFLGFVISPGGIQM-----DQKKIAALL-------NWPAPTSRKEVQRFIGFANFY--- 291
Query: 720 GRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIS---- 775
R+ + S + L LT A+ K +W A F HF S
Sbjct: 292 -----RKFIKNFSHIILPITKLT----ALTSKFQWTFQA----QEAFDTLKSHFTSAPVL 338
Query: 776 ------------TDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKEMFA 812
DAS+ G+ + +F S S+ ++N+ ++ +E+ +
Sbjct: 339 CHPNPSLPFVLEVDASENAVGAILSQRLNPSGSLHPVAFFSRKLSKSERNYDVSDRELLS 398
Query: 813 VHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
+ AL LL+ S +++ +D++ + YLR + L +F + R +
Sbjct: 399 IKLALEEWRYLLEGSPHPILIFTDHRN-LEYLR---TARRLRPRQARWALFFM----RFN 450
Query: 871 ILAQFIPGAYNSVADSLSR 889
+ PG+ N+ AD+LSR
Sbjct: 451 FHLTYRPGSKNTKADALSR 469
>gi|326672972|ref|XP_003199768.1| PREDICTED: hypothetical protein LOC100331420 [Danio rerio]
Length = 1442
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 83/182 (45%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I + P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 984 KICLTESTPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1033
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1034 IVPKKDGTLRVCLDFRKLNAL---SKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1090
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1091 PLEKTSREYTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1147
Query: 621 LL 622
++
Sbjct: 1148 VI 1149
>gi|410930532|ref|XP_003978652.1| PREDICTED: protein ECT2-like [Takifugu rubripes]
Length = 1492
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 74/165 (44%), Gaps = 4/165 (2%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L +L+ P M +I E L +G+++ S++ + F V K +GG RP ++ +
Sbjct: 1106 PTGRLYNLSIPEKETMRNYITESLASGIIR--PSSSPLAAGFFFVAKKDGGLRPCIDFRK 1163
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN K+ L L +DL AY V I+ + A + +
Sbjct: 1164 LNDITVKNKYPLPLMSSTFEPLTHARVFTKLDLRNAYHLVRIRKGDEWKTAFNTHLGHFE 1223
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
+PFGL+ AP F L N V L + VVVYLDD L+ ++
Sbjct: 1224 YLVMPFGLSNAPAVFQELVNDV--LPDMINVFVVVYLDDILVFSR 1266
>gi|4826594|gb|AAD30190.1|AF113830_3 polyprotein P194 [Rice tungro bacilliform virus]
Length = 1677
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/397 (20%), Positives = 167/397 (42%), Gaps = 50/397 (12%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1204 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1263
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1264 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1323
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1324 FQRFMQESFGDLKF----TLLYIDDILIASNNEKEHIEHLKIFFNRVKEIGCVLSKKKSK 1379
Query: 655 LSPAPVLQFLG-------IMWDPHL-DRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
+ V ++LG I PH+ D++ + +L L+ L L+ AR
Sbjct: 1380 MFLKEV-EYLGVEIKEGKISLQPHIVDKIKKFDKNKLNTFKGLQAYLGL----LNYARGY 1434
Query: 707 LGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPI 765
+ LS V P+ + + QR + ++ K+E ++ + PL P
Sbjct: 1435 IKNLS--KLVGPLYKKTGKNGQRI----------FNKEDWNIIFKIEREVSKIKPLERPK 1482
Query: 766 FPRQVQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFA 812
+ I TDAS+ GWG S D+ ++G S E++ W E+ A
Sbjct: 1483 ETDYI--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEA 1540
Query: 813 VHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+++AL+ + +++D + +V ++ + K
Sbjct: 1541 INEALN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1576
>gi|9630633|ref|NP_056762.1| hypothetical protein [Rice tungro bacilliform virus]
gi|130653|sp|P27502.1|POL_RTBVP RecName: Full=Polyprotein P3; AltName: Full=P194 protein; Contains:
RecName: Full=Putative movement protein; Short=MP;
Contains: RecName: Full=Capsid protein; AltName:
Full=Coat protein; Short=CP; Contains: RecName:
Full=Protease; Short=PR; Contains: RecName: Full=Reverse
transcriptase/Ribonuclease H; Short=RT; AltName:
Full=p55; Flags: Precursor
gi|61913|emb|CAA40997.1| ORF P194 [Rice tungro bacilliform virus]
Length = 1675
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 169/393 (43%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ D T + F+V + R V N K LN + F++ +
Sbjct: 1202 QIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1321
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1322 FQRFMQESFGDLKF----ALLYIDDILIASNNEKEHIEHLKIFFNRVKEVGCVLSKKKSK 1377
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + + ++ +K L ++ LG L+
Sbjct: 1378 MFLKEV-EYLGVE---------IKEGKISLQPHIVDKIKKFDKNKLNTLKGLQAYLGLLN 1427
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1428 YARGYIKDLSKLVGPLYKKTG---KNGQRIFNKEDWNIIFKIEREVSKIKPLERPKETDY 1484
Query: 770 VQHFISTDASDLGWG----------SQVDSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG S D+ ++G S E++ W E+ A+++A
Sbjct: 1485 I--IIETDASEEGWGAVLVCKPDKYSGKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1542
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1543 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1574
>gi|74641|pir||GNLJLK pol polyprotein - simian foamy virus (type 3, strain LK3)
gi|334872|gb|AAA47796.1| pol polyprotein, partial [Simian foamy virus 3]
Length = 1157
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 115/239 (48%), Gaps = 15/239 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GVL + +S + ++ VPK +G R VL+ + +N+ + + I
Sbjct: 196 INDLLKQGVLIQQNSIMN--TPVYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGIL 253
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S + +G Y ++DLS ++ I A ++ G T LP G +P F +
Sbjct: 254 SSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQQYCWTRLPQGFLNSPALFTAD- 312
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V LL+ V VY+DD + + DPR Q + S+L + G++V+L+KS ++
Sbjct: 313 --VVDLLKEVP-NVQVYVDDIYISHDDPREHLEQLEKVFSLLLNAGYVVSLKKSEIAQHE 369
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
V +FLG ++ + E + LT + L + +L +S+LG L+FA IP
Sbjct: 370 V-EFLGF----NITK----EGRGLTETFKQKLLNITPPRDLKQLQSILGLLNFARNFIP 419
>gi|307196129|gb|EFN77817.1| hypothetical protein EAI_17025 [Harpegnathos saltator]
Length = 251
Score = 64.3 bits (155), Expect = 3e-07, Method: Composition-based stats.
Identities = 50/176 (28%), Positives = 86/176 (48%), Gaps = 2/176 (1%)
Query: 751 KLEWWLNALPLSSPIFPRQV-QHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKE 809
+L WW + ++ F V I DAS G +++ + G W+ ++ HIN E
Sbjct: 4 ELIWWRTHIITTNSFFRLAVIDTTIFIDASISGSRVVLENKQIHGFWTEIEKRQHINWLE 63
Query: 810 MFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
+ A+ AL L+ ++++ DN T V + + G K +I+ +++
Sbjct: 64 LKAIWFALQSFEAELKDRHILLRVDNTTAVVCINKMEGIKYPKFNKLATQIWSWAENNNN 123
Query: 870 HILAQFIPGAYNSVADSLSRSKSL-PDWHLSRSATEQIFLKWGVPCIDLFASRVSA 924
+ A+ IP N VAD LSR K+L +W L+ A +I +G P +DLFA+ ++A
Sbjct: 124 WLHAEHIPSTSNVVADRLSRLKNLDTEWELATYAFNKITTSFGFPELDLFATSLNA 179
>gi|294900961|ref|XP_002777195.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239884666|gb|EER09011.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 463
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 11/199 (5%)
Query: 461 PLCS-LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
P+C ++ + +S ++EM E GV++R ST+ + VPK NG R ++ +
Sbjct: 268 PICERIRPIPHKYRDEISALLKEMEELGVIRR--STSAWRFPCVFVPKKNGKVRMCIDYR 325
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD-- 577
LN+ + + + + L + ++DL Y+ +P++ QR A
Sbjct: 326 NLNKACHTEAYPVPRPDDVQEHLAGARVLSTLDLRSGYWQIPVRKEDQRKTAFCPGPGFP 385
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ-DPRILEIQGKL 636
+ +PFGLA+AP F L + + L V VYLDD L+ ++ D LE ++
Sbjct: 386 LYEWVMMPFGLASAPATFQRLMDAILGHLPF----VRVYLDDVLIFSRSDEEHLE-HLRI 440
Query: 637 AVSILGSLGWIVNLQKSSL 655
+L + G V +K
Sbjct: 441 VFELLRAAGMTVAAEKCEF 459
>gi|425856929|gb|AFX98079.1| pol protein [Simian foamy virus]
Length = 1143
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 114/244 (46%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLTPQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + + + Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILATIVRQKYKTTLDLANGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + V LL+ V VY+DD L + DP+ Q IL G++V+L+KS
Sbjct: 295 FTAD---VVDLLKEIP-NVQVYVDDIYLSHDDPQEHIQQLGKVFQILLQAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IGQKTV-EFLGF----NITK----EGRGLTDTFKTKLLNVTPPKDLKQLQSILGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|308495826|ref|XP_003110101.1| hypothetical protein CRE_06372 [Caenorhabditis remanei]
gi|308244938|gb|EFO88890.1| hypothetical protein CRE_06372 [Caenorhabditis remanei]
Length = 2108
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 101/456 (22%), Positives = 187/456 (41%), Gaps = 62/456 (13%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P C ++ + ++ M + G+++ +ST+ + S L +PK NG R V++ +
Sbjct: 1169 IPQCRPYRVSPQQREKLEKELKFMKDNGLIE--ESTSPYTSPLLSIPKANGEIRIVIDYR 1226
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN + + + N + +G D++Q + +P+ H+ A + V
Sbjct: 1227 RLNLITRSRTYIMPNTIDVTEEASRGKLFSVFDIAQGFHTIPMHEAHKERTAFCCHMGVF 1286
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP----RILEIQGK 635
+P GL AP F +A + + +++Y+DD ++V++D R LE +
Sbjct: 1287 QYRYMPMGLKGAPDTF---QRAMAEVEKQFTGTMILYVDDLIVVSRDEEEHLRNLEEFFQ 1343
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
L + ++G + +KS + + FLG + + + P ++ +R
Sbjct: 1344 LMI----NMGLKLKAEKSQIGRTKI-SFLGFVIE---NNTIQPSGEKT---EAIRKFPTP 1392
Query: 696 KTWNLDSARSLL---GYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
T L +S L GY +A V P+ L + ++ G
Sbjct: 1393 TT--LSEVKSFLGMSGYFRRFIKDYAIIVKPLTTLTQKDVE-----FNWGEEQ-----EK 1440
Query: 748 VLPKLEWWLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS-----QVDSSFLSGLWSR- 798
+++ L +S PI PR F + TDAS +G + Q D + SR
Sbjct: 1441 AFEEVKQRL----ISPPILTTPRMDGDFEMHTDASKIGIAAVLLQKQDDELKVIAYASRP 1496
Query: 799 ---EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+Q + + E A+ L+ P + V V +D+Q + S L R+ S LL
Sbjct: 1497 TTPVEQRYAAIESEALAITWGLTHYRPYIFGKKVKVVTDHQPLKSLLHRKEKEMSGRLLR 1556
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
I Q + + I+ + PG N +AD+LSR +
Sbjct: 1557 HQAII----QMYDVEIV--YRPGKENPLADALSRQR 1586
>gi|403159489|ref|XP_003890634.1| hypothetical protein PGTG_20668 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168116|gb|EHS63573.1| hypothetical protein PGTG_20668 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 773
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 119/508 (23%), Positives = 215/508 (42%), Gaps = 65/508 (12%)
Query: 437 GAPAPLVRIVSGYAIPFSAKPPLVPLCSL------QHL-ATPVSSAMSLHIQEMLETG-- 487
G A ++ G+ F P+ + L HL AT + + + LE G
Sbjct: 255 GILANYQDVLVGFKHGFHQGIPVHKIRGLSWFTPDNHLSATLAEGKIKESMSKELEAGRM 314
Query: 488 ----VLKRLDSTTGFL--SRLFLVPKGNGGTRPVLNLK---------GLNQFLSPKKFSL 532
+ ++ S F S L V G+G RP+ +L +N F+ K+F
Sbjct: 315 FGPFTMDQVKSKFKFFRTSPLGAVVNGDGSVRPINDLSFPHGNVSIPSVNSFVDKKEFET 374
Query: 533 I--NHFRIPSFLQKGD---YMISIDLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPF 586
++ + SF ++ + + D +AY +P + +L + +G + T + F
Sbjct: 375 TWDDYNVVSSFFRESEGPLLLALFDWEKAYRQIPTHPSQWPYLMVKGLDGLLYLDTRITF 434
Query: 587 GLATAPQAFASLSN-WVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
G +F ++ W + + +V ++DD L V + E++ + S+ L
Sbjct: 435 GGVAGCGSFGRPADAWKDIMFAEFDLVKVFWWVDDNLFVKRPNSKTEMKDIVKRSV--KL 492
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSA 703
G + NL K S+ + +F+G +W+ + LPE+K + IL L S+ ++ +
Sbjct: 493 GVLTNLNKCSVF-SEEQKFIGFLWNGVAKTVRLPEEKLEQRKRQILEFLDTSRKFSFNEV 551
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQAS-LLRLGAPHLTPINPAVLPKLEWWLNALPLS 762
L G L+ +++P R + R + R S L A +P + VL L+ W++ L+
Sbjct: 552 EVLTGRLNHVLYMLPQLRCYLRSLYRWMSDWFDLNAKRYSPND--VLLDLDRWIHT--LN 607
Query: 763 SPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL-----WSREQ---------QNWHINKK 808
+ R V S D D+GW +SF G+ W++ + Q +I
Sbjct: 608 DFVHSRLVS---SPDPVDIGWVGDASTSFGVGVLIGKYWAQLRILKDRVKGIQKRNIAWL 664
Query: 809 EMFAVHQALSLNLP---LLQSSVVMVQSDNQTVVSY-LRRQGGTKSLSLLSEVEKIFLLS 864
E AV L + L + S ++V +DN T S LRR+ ++ +V + FL+
Sbjct: 665 ETVAVRVGLIMLDTLGRLRRGSNLLVWTDNTTTESVILRRKSRDTEVNEEWKVIQDFLIK 724
Query: 865 QDWRIHILAQFIPGAYNSVADSLSRSKS 892
Q+ + + A+ + N +AD LSR S
Sbjct: 725 QE--VDLTARRVKSKDN-IADELSRGLS 749
>gi|189677275|ref|YP_001956722.2| Pol precursor [African green monkey simian foamy virus]
gi|110282986|sp|P27401.2|POL_SFV3L RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;
Contains: RecName: Full=Protease/Reverse
transcriptase/ribonuclease H; AltName:
Full=p87Pro-RT-RNaseH; Contains: RecName:
Full=Protease/Reverse transcriptase; AltName:
Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H;
Short=RNase H; Contains: RecName: Full=Integrase;
Short=IN; AltName: Full=p42In
Length = 1143
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 115/239 (48%), Gaps = 15/239 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GVL + +S + ++ VPK +G R VL+ + +N+ + + I
Sbjct: 182 INDLLKQGVLIQQNSIMN--TPVYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGIL 239
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S + +G Y ++DLS ++ I A ++ G T LP G +P F +
Sbjct: 240 SSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQQYCWTRLPQGFLNSPALFTAD- 298
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V LL+ V VY+DD + + DPR Q + S+L + G++V+L+KS ++
Sbjct: 299 --VVDLLKEVP-NVQVYVDDIYISHDDPREHLEQLEKVFSLLLNAGYVVSLKKSEIAQHE 355
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
V +FLG ++ + E + LT + L + +L +S+LG L+FA IP
Sbjct: 356 V-EFLGF----NITK----EGRGLTETFKQKLLNITPPRDLKQLQSILGLLNFARNFIP 405
>gi|6650019|gb|AAF21678.1|AF051915_2 pol polyprotein [Passalora fulva]
Length = 1243
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 99/439 (22%), Positives = 167/439 (38%), Gaps = 71/439 (16%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++E L G ++R S+ G + VPK NG R V + + LN+ ++ L N
Sbjct: 341 LKEKLAKGWIRRSTSSAG--TPCMFVPKANGKLRLVQDYRKLNEITIKNRYPLPNIEEAQ 398
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L D+ IDL A++ + + + A + +P GL AP AS
Sbjct: 399 DRLTGSDWYTKIDLRDAFYAIRMAEGEEWKTAFRTRYGLYEFLVMPMGLTNAP---ASCQ 455
Query: 600 NWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ V LR + VV Y+DD L+ +G L L KS A
Sbjct: 456 DLVNETLRDLLDVCVVAYMDDILVYT--------KGSLQEHTKQVQDVFERLTKSGFKTA 507
Query: 659 P---------------VLQFLGIMWDPHLD---RMWLPEDKQLTLGNILRTLLASKTWNL 700
P ++ GI DP R W PE K + +++ L +N
Sbjct: 508 PEKCEFHKKEVKFLGFIISTTGITIDPAKTQSIREW-PEPKTV---KDVQSFLGLANYN- 562
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP 760
R + ++ PM L + + + + A A P L
Sbjct: 563 ---RKFIK--DYSKTAAPMTMLTRKDVNWKWGKEQTEAFKRLKEQCASAPTLR------- 610
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMF 811
+F + I TDASD+ G+ + + + S + +QN+ I+ KE+
Sbjct: 611 ----LFDGSKEVHIETDASDMAIGACLTQTHDGKRHPVAYYSRKMTTAEQNYDIHDKELL 666
Query: 812 AVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
A+ A+ ++ + + SD++ + + + T+ + SE LL Q ++
Sbjct: 667 AIVAAMQHWRVYVEGPPKLTILSDHKNLTYFTTTKELTRRQARWSE-----LLGQ-YKFE 720
Query: 871 ILAQFIPGAYNSVADSLSR 889
I ++ PG N AD+LSR
Sbjct: 721 I--KYTPGTENGPADALSR 737
>gi|15150419|gb|AAK84933.1| SD02026p [Drosophila melanogaster]
Length = 1026
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 78/369 (21%), Positives = 151/369 (40%), Gaps = 40/369 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + A+ ++E +E ++++ ST+ SR+ +V K +G R ++ + LN +
Sbjct: 190 LSEEEAIAVKKQVEEWVEQSIVRK--STSNVASRIVVVKKKDGTLRVCVDYRKLNTMV-- 245
Query: 528 KKFSLINHFRIPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
L++ F +P LQ + ++DL +FHV ++ + + A +
Sbjct: 246 ----LMDCFPVPIMEEVLEKLQSAKWFTTMDLQNGFFHVAVEEASKPYTAFVTREGLFEF 301
Query: 582 TCLPFGLATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
PFG +P AF ++ L+ S M+ +Y+DD ++ P + ++ +
Sbjct: 302 NKAPFGFKNSPAAFIRFVQFIFQELINSNIMQ--LYMDDIIVYAATPEECMEKTEMVLKR 359
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS--KTW 698
G + +K + + FLG + E Q+ G + + S
Sbjct: 360 AAEFGLKIKWKKCNFMQRRI-HFLG----------HIIEGGQICPGKEKTSAVNSFGTPQ 408
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
N+ + + LG F IP +R + LL+ A ++ P+ + KL+ L
Sbjct: 409 NVKAVQGFLGLTGFFRKFIPGYAQIARPL---TDLLKKDAIFNIGPVEQQSVNKLKEILV 465
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSG-----LWSREQQNWHINKKEMFA 812
P+ I+ R+ + + TDAS G G+ + F WSR+ N+ +
Sbjct: 466 NEPVLR-IYSREAETELHTDASKDGLGAVLLQKFEGSFHPVCFWSRKTTKAESNRHSYYL 524
Query: 813 VHQALSLNL 821
+A L L
Sbjct: 525 EVKAAYLAL 533
>gi|125803524|ref|XP_001343088.1| PREDICTED: hypothetical protein LOC100003575 [Danio rerio]
Length = 1496
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I + P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 1038 KICLTESTPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1087
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN + KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1088 IVPKKDGTLRVCLDFRKLN---AVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1144
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1145 PLEKTSREYTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1201
Query: 621 LL 622
++
Sbjct: 1202 VI 1203
>gi|308452573|ref|XP_003089094.1| hypothetical protein CRE_06222 [Caenorhabditis remanei]
gi|308243277|gb|EFO87229.1| hypothetical protein CRE_06222 [Caenorhabditis remanei]
Length = 1388
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 99/460 (21%), Positives = 186/460 (40%), Gaps = 101/460 (21%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLIN 534
+S I+ + +TGV+ +D + + + + V K NG R + GLN + L
Sbjct: 506 VSTEIERLNQTGVISPVDHSE-WAAPVVAVKKKNGSIRLCADFSTGLNDAIESNNHPLPT 564
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + L G++ IDL++AY V + Q+ L ++ + + LPFG+ +AP
Sbjct: 565 ADDIFAKLNGGNFFTQIDLAEAYLQVEMDPDSQKLLVINTHLGLFTYNRLPFGVKSAPGI 624
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWIVN 649
F + + + + L V YLDD ++ + R+L++ G++ G+ +
Sbjct: 625 FQQIMDTMLNGLEG----VSTYLDDIIICGSTIEEHNERVLKVFGRIQ-----EYGFRIK 675
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLG 708
++K S + +FLG + + R P+ +K + N+ + N+ +S LG
Sbjct: 676 MEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVRHIKNM------PEPTNVSQVKSFLG 725
Query: 709 YLSF-ASFVIPMGRLH---------------SRRIQR---------QASLLRLGAPHLTP 743
+ F FV + RL +R Q+ Q+ LL LT
Sbjct: 726 LIQFYGQFVKQLFRLRQPLDNLTAKDTDFKWNRECQKSFDTIKEILQSDLL------LTH 779
Query: 744 INPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSF----------LS 793
NP LP+ ++ DAS G G+ + F +S
Sbjct: 780 YNP-----------NLPI-----------IVAADASQYGIGATISHRFPDGTEKTIYHIS 817
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
S+ Q+N+ +KE F + A++ + +++D++ +++ GG K + +
Sbjct: 818 KTLSKTQRNYSQIEKEGFGLITAVTKFHKFIHGRKFTLRTDHKPLLTIF---GGKKGVPV 874
Query: 854 LSEVE----KIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ LL+ D+ I ++I D+LSR
Sbjct: 875 YTANRLQRWATILLNYDFDI----EYINTKDFGQVDALSR 910
>gi|182239993|gb|ACB87156.1| polyprotein [Citrus yellow mosaic virus]
Length = 1979
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 110/464 (23%), Positives = 187/464 (40%), Gaps = 70/464 (15%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
L+H+ + + H++ +L+ G ++ S + + +V G G R
Sbjct: 1395 LKHVTPQMEESFRKHVEALLKIGAIR--PSKSRHRTTAIIVNSGTSINPLTGKEVKGKER 1452
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
V K LN + ++SL I + LQ KG + S DL + V + +
Sbjct: 1453 MVFIYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSIEWT 1509
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + + + VY+DD L+ ++ R
Sbjct: 1510 AFWVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDILVFSKTEREH 1566
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG-------IMWDPHLDRMWLP-EDKQ 682
E ++ +SI G I++ K ++ A + +F G I PH+ + L +KQ
Sbjct: 1567 EEHLQIMLSICQKNGLILSPTKMKIAQAEI-EFPGAIIHKGLIKLQPHIVQKLLTFTNKQ 1625
Query: 683 LTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHL 741
L LR+ L LG L++A IP MGRL S A + G +
Sbjct: 1626 LEEVKGLRSWLG------------LGLLNYARNYIPHMGRLLSPLY---AKVSPTGERRM 1670
Query: 742 TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVDS---- 789
+ A++ K+ + LP L P P I TD GWG +Q DS
Sbjct: 1671 NRQDWALIDKIRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKLAQYDSRSSE 1728
Query: 790 ---SFLSGLWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQ 845
++ SG ++ + + E+ AV +L + + L S + +++D Q ++S+ +
Sbjct: 1729 KVCAYASGKFNPPKSTIDV---EIHAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKS 1785
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
K + FL ++I + I G N +AD+LSR
Sbjct: 1786 NVNKPSRVRWIAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1827
>gi|308480035|ref|XP_003102225.1| hypothetical protein CRE_05815 [Caenorhabditis remanei]
gi|308262151|gb|EFP06104.1| hypothetical protein CRE_05815 [Caenorhabditis remanei]
Length = 2406
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 90/203 (44%), Gaps = 12/203 (5%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
+A + +QEM+ +++ +ST+ F+S + LV K +G R + + LN
Sbjct: 1425 VALGTQDEVEKQVQEMILLDIIE--ESTSTFISPIVLVRKKDGTYRFTTDFRLLNAVTVK 1482
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ + + I G + ++DL Q +F +P++ + A + +P G
Sbjct: 1483 QNYQIPLISDIVDLASDGTFFTNLDLIQGFFQIPLRKEDRPLTAFATPTGTYQYKRMPMG 1542
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSIL---GSL 644
L AP F + V L + ++ YLDD L+V+ LE K +L +
Sbjct: 1543 LCGAPHTFQTA---VRQLQKKTKAKLFCYLDDLLIVSN---TLEQHMKDIEEVLQNIAEI 1596
Query: 645 GWIVNLQKSSLSPAPVLQFLGIM 667
G+ V ++K + P + FLG++
Sbjct: 1597 GFKVKIEKCKFA-QPEVTFLGLL 1618
>gi|82793635|ref|XP_728120.1| RNase H [Plasmodium yoelii yoelii 17XNL]
gi|23484307|gb|EAA19685.1| RNase H, putative [Plasmodium yoelii yoelii]
Length = 962
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 96/195 (49%), Gaps = 16/195 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+++ + GV+ + +T+ F S ++ V K +G R ++ + LNQ ++P ++ + +
Sbjct: 132 IKDLKDAGVV--VPTTSPFNSPIWPVQKTDGSWRMTVDYRKLNQVVTPIAAAVPD---VV 186
Query: 540 SFLQK-----GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
S L++ G + +IDL+ A+F VP+ HQ+ A S+ G T LP G +P
Sbjct: 187 SLLEQINTSPGTWYAAIDLANAFFSVPVHKDHQKQFAFSWQGQQYTFTVLPQGYINSPAL 246
Query: 595 FASLSNW-VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA--VSILGSLGWIVNLQ 651
+L + L + + +V Y+DD +L+ P E+ L V GW +N
Sbjct: 247 CHNLVRRDLDRLDLPQNITLVHYIDDIMLIG--PSXQEVATTLDSLVXHXXIRGWEINPX 304
Query: 652 KSSLSPAPVLQFLGI 666
K P ++FLG+
Sbjct: 305 KIQ-GPXTSVKFLGV 318
>gi|385145298|emb|CCG14716.1| ORF V protein [Strawberry vein banding virus]
Length = 699
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/384 (22%), Positives = 162/384 (42%), Gaps = 39/384 (10%)
Query: 479 HIQEMLETGVL---KRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+L+ G++ K S+ F+ R K G R V+N K LN + L N
Sbjct: 288 QIEELLKLGIIRPSKSPHSSPAFMVRNHAEIK-RGKARMVINYKKLNDNTKGDGYLLPNK 346
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
++ + Y S D ++ V + + A S +PFGL AP F
Sbjct: 347 EQLLQRIGGKTYYSSFDCKSGFWQVRLAPETIQLTAFSCPQGHYEWLVMPFGLKQAPAIF 406
Query: 596 -----ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
SLSN S VY+DD ++ ++ K+ ++ +LG +++
Sbjct: 407 QRHMDESLSNMYPSF-------CAVYVDDIIVFSKTEDEHLGHVKIVLNRCKALGIVLSK 459
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+K+ L + FLG++ ++R L + L T + + ++ + LG L
Sbjct: 460 KKAQLCKTTI-NFLGLV----IERGNLKVQSHIGLH---LTAFPDQLADRNALQRFLGLL 511
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQ 769
++ S P ++ + R Q L + T + + K++ + +LP L +P +
Sbjct: 512 NYISAYFP--KIANLRSPLQVKLKKEITWSWTEKDTETVRKIKSLVKSLPDLYNP--SPE 567
Query: 770 VQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ I DASD WG+ +V + SG + ++N+H N+KE+ ++ +A+
Sbjct: 568 DKPIIECDASDDHWGAVLKAKLPEGKEVICRYASGTFKPAEKNYHSNEKEILSIIKAIKA 627
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLR 843
+ +V++DN ++R
Sbjct: 628 FRAYILPYKFLVRTDNTNAAYFVR 651
>gi|294893680|ref|XP_002774593.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239879986|gb|EER06409.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 505
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 144/329 (43%), Gaps = 47/329 (14%)
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL +A F L + V + L V VYLDD L+ + D E + + L +
Sbjct: 1 MPFGLCSAGATFQRLMDQVLNGLPF----VRVYLDDILVFSPDAETHEDHLRQVFARLRA 56
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
G ++ +K P + +LG ++D + R P+ ++ ILR + N+
Sbjct: 57 WGLTLSAEKCEFG-CPSVPYLGHIFDGNGMR---PDPTKVE--AILRW---PRPGNVAEI 107
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQAS-----LLRLGAPHLTPINPAVLPKLEWWLNA 758
RS LG + +P +R IQR S L L A + L L+ L A
Sbjct: 108 RSFLGLAGYYRNFVPNFSDVARPIQRLVSEVGSETLALDA-YWGQEQEESLRALKLRLAA 166
Query: 759 LP-LSSPIFPRQVQHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMFA 812
LP L+ P F + + TDASD G+ + F S + Q NWH +KE +
Sbjct: 167 LPFLAYPDF--GIPFELYTDASDYAIGAVLMQEGRPLGFFSRTLTGSQLNWHTYEKEAYG 224
Query: 813 VHQAL------SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
+ QAL + PL V V +D++ +++L + G K +E+ L Q
Sbjct: 225 ILQALIYFQHYHIGYPL----TVTVYTDHEP-LTWLAKAGSKK-------LERWLLAMQA 272
Query: 867 WRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + +++PG N AD+LSR + L D
Sbjct: 273 Y--SFIVKYVPGKKNVCADALSRIRQLDD 299
>gi|154274776|ref|XP_001538239.1| hypothetical protein HCAG_05844 [Ajellomyces capsulatus NAm1]
gi|150414679|gb|EDN10041.1| hypothetical protein HCAG_05844 [Ajellomyces capsulatus NAm1]
Length = 1172
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 79/169 (46%), Gaps = 9/169 (5%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 498 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 552
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 553 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 612
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLL 622
+PFGLA AP F A ++N ++ LL + VVYLDD L+
Sbjct: 613 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILI 658
>gi|154275790|ref|XP_001538740.1| hypothetical protein HCAG_06345 [Ajellomyces capsulatus NAm1]
gi|150413813|gb|EDN09178.1| hypothetical protein HCAG_06345 [Ajellomyces capsulatus NAm1]
Length = 1515
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 79/169 (46%), Gaps = 9/169 (5%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+PP +P+ +L +A+ +++ L+ G ++ S G + + VPK +GG R
Sbjct: 498 ARPPFMPIYNLSETEL---AALREYLKNALDKGWIQPSSSPAG--APILFVPKKDGGLRL 552
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ +GLN+ ++ L + L K +DL AY + I + A
Sbjct: 553 CVDYRGLNRITIKNRYPLPLISELLDRLSKAKVFTKLDLRDAYHRILIAAKDRWKTAFRT 612
Query: 575 NGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLL 622
+PFGLA AP F A ++N ++ LL + VVYLDD L+
Sbjct: 613 RYGHFEYVVMPFGLANAPATFQAYINNALSDLL---DICCVVYLDDILI 658
>gi|270006314|gb|EFA02762.1| hypothetical protein TcasGA2_TC008495 [Tribolium castaneum]
Length = 1365
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 5/176 (2%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE+L+ +++ +S + + S + LV K NG R ++ + LN L
Sbjct: 516 VQELLDNNIVR--ESESNYCSPVLLVKKKNGEQRLCIDYRKLNAQTVKDNHPLPRVDDQI 573
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
LQ G Y S+DL Y +P+ +++ + +PFGL AP+ F
Sbjct: 574 DRLQGGVYFTSLDLRSGYHQIPLSEESKKYTSFVTPFGQYEYNRVPFGLTNAPRTFQRFM 633
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
N +L+ VYLDD L +D + IL S G +NL+K S
Sbjct: 634 N---KILKPARENAAVYLDDVFLHAKDVNEALQNLQKVFEILRSEGLTLNLKKCSF 686
>gi|427780071|gb|JAA55487.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 940
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 93/414 (22%), Positives = 165/414 (39%), Gaps = 51/414 (12%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
RI +G A P KP V +P +S ++EML GV++ +S + + + +
Sbjct: 80 RINTGSAHPIRQKPYRV---------SPTERKVISEQVEEMLRKGVIQ--ESASPWAAPV 128
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
LV K +G R ++ + LN + L L Y S+DL Y+ +P+
Sbjct: 129 ILVKKKDGSWRFCVDYRRLNSITKKDVYPLPRIDDALDCLHSASYFSSVDLRSGYWQIPM 188
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMR---VVVYLDD 619
T + A + +PFGL AP F + + RG++ + YLDD
Sbjct: 189 HTDDKEKTAFVTPDGLFEFNVMPFGLCNAPATFERFMDTIL-----RGLKWEICMCYLDD 243
Query: 620 FLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE 679
++ + + + +S + ++N +K L LG + +D+ +
Sbjct: 244 VIIFGRTFHEHNERLGIVLSCIQKACLVLNSKKCHFGERQAL-VLGHL----VDKDGVRP 298
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP 739
D T + K N+ RS LG S+ IP + SLL A
Sbjct: 299 DPAKT--EAVEAFEPPK--NVKELRSFLGLCSYFRRFIPR---FADVAHPLTSLLHKNA- 350
Query: 740 HL--TPINPAVLPKLEWWLNALPL---SSPIFPRQVQHFISTDASDLGWGSQVDS----- 789
H TP A +L++ L + P+ +P P +V TDAS +G G+ +
Sbjct: 351 HFEWTPECSASFRQLKFLLTSQPILRHFNPTAPTEVH----TDASGVGLGAVLVQRLGNR 406
Query: 790 ----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
++ S S+ ++N+ + ++E AV A+ L + +D+ ++
Sbjct: 407 EHVIAYASRSLSKPERNYTVTEQECLAVIFAVQRFRSYLYGRQFTIVTDHHSLC 460
>gi|388856666|emb|CCF49783.1| related to Gag-pol polyprotein [Ustilago hordei]
Length = 1106
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 153/392 (39%), Gaps = 67/392 (17%)
Query: 454 SAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
KPP PL +L P S + ++ E L+ G ++ S + S + VPK +GG
Sbjct: 172 GGKPPQGPL----YLKGPKEMSELRRYLDENLKKGFIR--PSKSPAQSPVLFVPKKDGGL 225
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
R ++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 226 RLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEWKTAF 285
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILE 631
+ +PFGLA AP F S N + R G+ VVVYLDDFL+ +
Sbjct: 286 GTQLGLYEYLVMPFGLANAPAHFQSFIN---DIFRDIIGIYVVVYLDDFLIFSDTEEAHV 342
Query: 632 IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
++ L S L K V +FLG + P M PE +RT
Sbjct: 343 KHVTEVLTRLRSNRLFAKLSKCEFHTKTV-EFLGYIIKPTGIEM-DPEK--------VRT 392
Query: 692 LLASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
+ K W L+S + +L FA+F + R I A + + P+ V P
Sbjct: 393 V---KEWPMLESIHDIQRFLGFANF-------YRRFIAHFARIAK-------PLTALVKP 435
Query: 751 KLEWWLNALPLSS-PIFPRQVQHFIS----------------TDASDLGWGSQVDSSFLS 793
+ LP + F + +Q F S TDASD +
Sbjct: 436 IERFKKFELPEEAQQAFHKLIQAFTSAGVLQHFDYHLPTRLETDASDFAIAGVLKQEH-E 494
Query: 794 GLW------SRE----QQNWHINKKEMFAVHQ 815
G W SR+ ++N+ I+ KE+ AV++
Sbjct: 495 GRWHPVAFYSRKMPSAEKNYEIHDKELLAVYR 526
>gi|261865347|gb|ACY01928.1| hypothetical protein [Beta vulgaris]
Length = 1583
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 102/456 (22%), Positives = 171/456 (37%), Gaps = 61/456 (13%)
Query: 460 VPLCSLQHLATPVS-----------SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
V +LQH PVS + I +ML G++++ S + F S + LV K
Sbjct: 612 VHAINLQHGTNPVSVRPYRYPQSQKDEIEQLIHDMLAAGIIQQ--SHSAFSSPVLLVKKK 669
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ + LN P K+ + + L +DL Y + +K +
Sbjct: 670 DGSWRFCVDYRALNNVTVPDKYPIPIIDELLDELHGACVFSKLDLKSGYHQIKMKPSDVH 729
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVAS-LLRSRGMRVVVYLDDFLLVNQDP 627
A + +PFGL AP F +L N V LR V+V+ DD L+ +
Sbjct: 730 KTAFRTHEGHYEFLVMPFGLTNAPATFQALMNEVFKPYLRK---FVLVFFDDILVYSTSL 786
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPV------LQFLGIMWDPHLDRMWLPEDK 681
+ + +L + NL+K V + G+ DP
Sbjct: 787 EQHMHHLNVVLGLLATNHLFANLKKCEFGKEEVAYLGHIISSKGVAMDP----------- 835
Query: 682 QLTLGNILRTLLASKTWNLDSA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
+ A W++ S R L G+L + + ++ + L+ +
Sbjct: 836 --------SKVQAMMDWSIPSTLRELRGFLGLTGYYRRFVKGYASIAHPLTNQLKKDSFG 887
Query: 741 LTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSG 794
+P L+ L P L P F + I DAS G G+ + ++ S
Sbjct: 888 WSPAATRAFETLKRALTEAPVLQMPNF--SLPFVIEADASGYGLGAVLLQQGHPIAYFSK 945
Query: 795 LWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYL-RRQGGTKSLSL 853
+ I +KE+ AV A+ L ++ SD Q++ L +R+ G
Sbjct: 946 TLGERARAKSIYEKELMAVVMAVQKWKHFLLGRHFVIHSDQQSLRHLLNQREIGPAYQKW 1005
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ + LL D+ I ++ PG +N VAD+LSR
Sbjct: 1006 VGK-----LLGFDFEI----KYKPGGHNKVADALSR 1032
>gi|415798|emb|CAA81643.1| blastopia polyprotein [Drosophila melanogaster]
Length = 1333
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 78/369 (21%), Positives = 151/369 (40%), Gaps = 40/369 (10%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ + A+ ++E +E ++++ ST+ SR+ +V K +G R ++ + LN +
Sbjct: 496 LSEEEAIAVKKQVEEWVEQSIVRK--STSNVASRIVVVRKKDGTLRVCVDYRKLNTMV-- 551
Query: 528 KKFSLINHFRIPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAM 581
L++ F +P LQ + ++DL +FHV ++ + + A +
Sbjct: 552 ----LMDCFPVPIMEEVLEKLQSAKWFTTMDLQNGFFHVAVEEASKPYTAFVTREGLFEF 607
Query: 582 TCLPFGLATAPQAFASLSNWV-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
PFG +P AF ++ L+ S M+ +Y+DD ++ P + ++ +
Sbjct: 608 NKAPFGFKNSPAAFIRFVQFIFQELINSNIMQ--LYMDDIIVYAATPEECMEKTEMVLKR 665
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS--KTW 698
G + +K + + FLG + E Q+ G + + S
Sbjct: 666 AAEFGLKIKWKKCNFMQRRI-HFLG----------HIIEGGQICPGKEKTSAVNSFGTPQ 714
Query: 699 NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLN 757
N+ + + LG F IP +R + LL+ A ++ P+ + KL+ L
Sbjct: 715 NVKAVQGFLGLTGFFRKFIPGYAQIARPL---TDLLKKDAIFNIGPVEQQSVNKLKEILV 771
Query: 758 ALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSG-----LWSREQQNWHINKKEMFA 812
P+ I+ R+ + + TDAS G G+ + F WSR+ N+ +
Sbjct: 772 NEPVLR-IYSREAETELHTDASKDGLGAVLLQKFEGSFHPVCFWSRKTTKAESNRHSYYL 830
Query: 813 VHQALSLNL 821
+A L L
Sbjct: 831 EVKAAYLAL 839
>gi|45382873|ref|NP_989963.1| pol-like protein ENS-3 [Gallus gallus]
gi|13194728|gb|AAK15526.1|AF329451_1 pol-like protein ENS-3 [Gallus gallus]
Length = 936
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 112/244 (45%), Gaps = 21/244 (8%)
Query: 457 PPLVP--LCSLQHLATPVS--SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
PPL P L ++ P+ S +S + E+ E G++ + + + F S ++ V K NG
Sbjct: 45 PPLPPSNLTCVKPYPLPLGARSGISPVLAELKEQGIV--IPTHSPFNSPVWPVRKPNGKW 102
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGD--YMISIDLSQAYFHVPIKTTHQRFL 570
R ++ + LN P ++ N + + +Q+ +M +ID+ +F VP+ Q
Sbjct: 103 RLTIDYRRLNANTGPLTAAVPNISELIAAIQEQAHPFMATIDVKDMFFMVPLHPDDQLRF 162
Query: 571 ALSYNGDVLAMTCLPFGLATAPQ-AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
A ++ G T LP G +P A +L+ + + G+R+ Y+DD L+
Sbjct: 163 AFTWEGQQYTFTRLPQGFKHSPTLAHYALAKELEQIPLEEGVRLYQYIDDILIGGDHLTP 222
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD-----------PHLDRMWLP 678
++I + L LG + K SPA ++FLGI W LD++ +P
Sbjct: 223 VKIMHDKIIKRLEELGLTIPPDKIQ-SPAAEVKFLGIWWKGGMACIPQDTLSALDQLKMP 281
Query: 679 EDKQ 682
E+K+
Sbjct: 282 ENKK 285
>gi|378788721|gb|AFC40210.1| polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 85/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D M + E + L +++
Sbjct: 272 DEQF--MKIEESRWKELKTVIK 291
>gi|156847234|ref|XP_001646502.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
gi|156117179|gb|EDO18644.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
Length = 1233
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 104/417 (24%), Positives = 173/417 (41%), Gaps = 53/417 (12%)
Query: 492 LDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYM 548
+ S + F S + +V K +G R ++ + LN+ F L R S L K
Sbjct: 299 ISSKSPFSSPIVMVKKKDGTYRLCVDYRTLNKATVKDPFPLP---RTESALAKIGAASIF 355
Query: 549 ISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRS 608
++DL Y +P+K + A T +PFGL AP F S ++A L R
Sbjct: 356 TTLDLHSGYHQIPMKREDRYKTAFVTPSGKYEYTVMPFGLVNAPSTF---SRYMADLFRE 412
Query: 609 RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
V VYLDD L+ + L + L + IV K S + V ++LG +
Sbjct: 413 MKF-VNVYLDDILIFSSSETEHWKHIDLVLQKLKNEQLIVKKPKCSFAEKEV-EYLGYII 470
Query: 669 DPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQ 728
H + P + + T N+ +A+ LG +++ IP ++ IQ
Sbjct: 471 SEHKIK---PVQSKCEAISKFPT-----PNNVKAAQRFLGMINYYRRFIPHCSTIAKPIQ 522
Query: 729 RQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVD 788
+ + L P A + +L+ L P+ P P + ++TDAS G G+ ++
Sbjct: 523 ---DYINEESTWLQPQTDA-MNELKTILTNHPILVPFQPNG-NYRLTTDASKYGIGAVLE 577
Query: 789 SSF-------LSGLWSRE----QQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
+ G +S+ QQN+ + E+ + QAL LL +++D+ +
Sbjct: 578 EVTPTGNVLGVVGYYSQSLKGAQQNYPAGELELLGIVQALDHFKYLLHGIHFSLRTDHVS 637
Query: 838 VVSYLRRQGG-----TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
++S LR + + L L++ + F LS +IPG N VAD+LSR
Sbjct: 638 LLS-LRNETEPAPRVQRHLDKLADYD--FELS----------YIPGPENVVADALSR 681
>gi|327262238|ref|XP_003215932.1| PREDICTED: hypothetical protein LOC100563775 [Anolis carolinensis]
Length = 907
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 50/98 (51%)
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
LPFG+ TAP+ F V L +G+ V Y+DD+LLV L + +S+L
Sbjct: 493 LPFGICTAPRVFTKCMAIVTCYLHVQGITVFPYIDDWLLVADSWHQLLHNIQFTISLLQD 552
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
LG +VN KS L P +QF+G + R L ED+
Sbjct: 553 LGLVVNGDKSHLQPQQCIQFIGARLNSLSGRACLLEDR 590
>gi|432869982|ref|XP_004071779.1| PREDICTED: uncharacterized protein LOC101172982 [Oryzias latipes]
Length = 1605
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 70/159 (44%), Gaps = 4/159 (2%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P AM ++QE L+ GV++ S G + F V K + G RP ++ +GLN
Sbjct: 39 LSPPERDAMDSYLQECLQAGVIRPSSSPAG--AGFFFVGKRDRGLRPCIDYRGLNSITVR 96
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ L L +DL AY V I+ + A + +PFG
Sbjct: 97 NTYPLPLLQTAFDLLSGAQVFSKLDLRNAYHLVRIREGDEWKTAFNTPNGHFEYLVMPFG 156
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
L AP F SL N + + ++ V VYLDD L+ + D
Sbjct: 157 LTNAPAVFQSLINDILKDMINKF--VFVYLDDILIFSPD 193
>gi|341883865|gb|EGT39800.1| hypothetical protein CAEBREN_00607 [Caenorhabditis brenneri]
Length = 1108
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 91/192 (47%), Gaps = 12/192 (6%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF-R 537
+ EML +++ ST+ F S + LV K +G R + + LN ++ K+ LI
Sbjct: 578 QVNEMLSMDIIE--PSTSTFTSPIVLVKKKDGTFRFTTDFRELNA-VTVKQIYLIPLISD 634
Query: 538 IPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
I G + ++DL Q +F +P++ + ++S +P GL AP F S
Sbjct: 635 IVDLASHGKFFTNLDLIQGFFQIPLRKQDRPLTSVSTPNGTFQYKRMPMGLCGAPHTFQS 694
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
+ + R+ R+ YLDD L+V+ D + +I+ L I ++G+ + +QK
Sbjct: 695 AVQQLQKMTRA---RLFCYLDDLLIVSDSMDQHLKDIEEVLKNII--TIGFKIKIQKCKF 749
Query: 656 SPAPVLQFLGIM 667
P + FLG++
Sbjct: 750 -PQREVTFLGLL 760
>gi|34328897|gb|AAO67369.1| polyprotein 1 [Petunia vein clearing virus]
Length = 1886
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 135/335 (40%), Gaps = 34/335 (10%)
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R V+N + LN FL KF + N + S L K DL ++ + I +
Sbjct: 1438 GKLRLVINYQPLNHFLQDDKFPIPNKLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNERPK 1497
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
+PFGL TAP F + + + +VY+DD LL ++
Sbjct: 1498 TGFCIPDRHFQWKVMPFGLKTAPSLFQKA---MIKIFQPILFSALVYIDDILLFSE---T 1551
Query: 630 LEIQGKLA---VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTL 685
LE KL +S++ G +++ +K L+ + QFLG+ + D + P L L
Sbjct: 1552 LEDHIKLLNQFISLVKKFGVMLSAKKMILAQNKI-QFLGMDF---ADGTFSPAGHISLEL 1607
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
T L+ K + LG +++ IP H I + +L+ P
Sbjct: 1608 QKFPDTNLSVK-----QIQQFLGIVNYIRDFIPEVTEH---ISPLSDMLKKKPPAWGKCQ 1659
Query: 746 PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW---------GSQVDSSFLSGLW 796
+ +L+ A + S P + + + TDASD W G + F SG +
Sbjct: 1660 DNAVKQLKQL--AQQVKSLHIPSEGKKILQTDASDQYWSAVLLEEHNGKRKICGFASGKF 1717
Query: 797 SREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVM 830
+Q++H KE+ AV + N L+ ++ ++
Sbjct: 1718 KVSEQHYHSTFKEILAVKNGIKKFNFFLIHTNFLV 1752
>gi|440472042|gb|ELQ40936.1| hypothetical protein OOU_Y34scaffold00319g1 [Magnaporthe oryzae
Y34]
Length = 969
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 90/402 (22%), Positives = 154/402 (38%), Gaps = 72/402 (17%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ ++ E L+ G ++ S G+ + VPK NG R ++ + LN + L
Sbjct: 566 ETLDKYLDENLKKGYIRPSTSPAGYP--ILFVPKKNGKLRLCVDYRQLNDITVKNCYPLP 623
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L + + ++DL AY + IK + A +PFGL AP
Sbjct: 624 LIGELRDMLYQAQWFTTLDLKGAYNLIRIKKGEEWKTAFRTRRGHFEYLVMPFGLTNAPA 683
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIVNL 650
F ++ N V L + + VVVYLDD L+ + + LE + ++L L ++
Sbjct: 684 TFQTMINHV--LRKCLDIFVVVYLDDILVFS---KTLEEHKQHVHTVLQKLQDAKLLIEP 738
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSL 706
+K V FL P RM + + A K W N+ R+
Sbjct: 739 EKCIFHSKKV-DFLEYTIAPGEIRMEASK------------IQAIKEWPQPKNVKDVRAF 785
Query: 707 LGYLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-W--------- 755
LG+++F F+ G++ TP+ LE+ W
Sbjct: 786 LGFVNFYRRFIKGYGKI------------------ATPLTNLTKKDLEFKWDKTENQTFE 827
Query: 756 -LNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQVDS----------SFLSGLWSREQQ 801
L + P+ P + F + TDASD G Q+ +F S +
Sbjct: 828 QLRDTVATEPVLRIPDPEKLFEVETDASDYAVGGQLGQKDEKGRLHPCAFFSQKLHGPEL 887
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSY 841
N+ I+ KE+ A+ +A P L + V+V +D++ + +
Sbjct: 888 NYQIHNKELMAIIRAFEEWKPQLSGTKHEVLVYTDHKNLTHF 929
>gi|425856953|gb|AFX98099.1| pol protein [Simian foamy virus]
Length = 1143
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 114/238 (47%), Gaps = 15/238 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + + I
Sbjct: 182 INDLLKQGVLVQQNSTMN--TPIYPVPKPDGKWRMVLDYREVNKTIPLIAAQNQHSAGIL 239
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S + +G Y ++DLS ++ PI A ++ G T LP G +P F +
Sbjct: 240 SSIFRGKYKTTLDLSNGFWAHPITPESYWLTAFTWQGQQYCWTRLPQGFLNSPALFTAD- 298
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V LL+ V Y+DD + + DP Q + S+L + G++V+L+KS ++
Sbjct: 299 --VVDLLKEIP-NVQAYVDDIYISHDDPVEHVQQLEKVFSLLLNAGYVVSLKKSEIAKHE 355
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
V +FLG ++ + E + LT + L + +L +S+LG L+FA I
Sbjct: 356 V-EFLGF----NITK----EGRGLTDTFKQKLLNITPPKDLKQLQSILGLLNFARNFI 404
>gi|149239979|ref|XP_001525865.1| hypothetical protein LELG_02423 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449988|gb|EDK44244.1| hypothetical protein LELG_02423 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 953
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 97/453 (21%), Positives = 196/453 (43%), Gaps = 53/453 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L + ++A+ + +QEM+ G L D+T + + FL+ K +G R +++L+ LN+ +
Sbjct: 75 LGSKRTAAVEI-LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVEL 131
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ ++ + + + ++ +ID+ AYF +P+ + + +L LP G
Sbjct: 132 EGGHPLSVDDLTTEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQG 191
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
+ F+S+ + +L V+ ++DD +V P++ E+ L L + +
Sbjct: 192 YINSVSEFSSI---LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEV 246
Query: 648 VNLQKSS---LSPA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLAS 695
L ++ ++PA P FLG H+ P K L G + L L +
Sbjct: 247 FRLLTNAGLKINPAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPN 298
Query: 696 KTWNLDSARSLLGYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLE 753
L+S L+ Y + ++ L + + QA H P ++
Sbjct: 299 TVKQLESFLGLVNY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQII 356
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNW 803
L P+ P+ + + + TDAS WG + ++ SG + ++N+
Sbjct: 357 TVLTNQPILQPLNFKDLIT-VHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNY 415
Query: 804 HINKKEMFAVHQALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK-- 859
I +KE+F++++ PLL + V+ + DN+ +V + + + ++ V K
Sbjct: 416 TIYEKELFSIYKTFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWL 473
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
F+ + +++IH I G N +AD+LSR +
Sbjct: 474 NFIRTFNYQIH----HIDGLKNIIADALSRCHT 502
>gi|189236883|ref|XP_001807231.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1063
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 5/176 (2%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE+L+ +++ +S + + S + LV K NG R ++ + LN L
Sbjct: 214 VQELLDNNIVR--ESESNYCSPVLLVKKKNGEQRLCIDYRKLNAQTVKDNHPLPRVDDQI 271
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
LQ G Y S+DL Y +P+ +++ + +PFGL AP+ F
Sbjct: 272 DRLQGGVYFTSLDLRSGYHQIPLSEESKKYTSFVTPFGQYEYNRVPFGLTNAPRTFQRFM 331
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
N +L+ VYLDD L +D + IL S G +NL+K S
Sbjct: 332 N---KILKPARENAAVYLDDVFLHAKDVNEALQNLQKVFEILRSEGLTLNLKKCSF 384
>gi|149246023|ref|XP_001527481.1| hypothetical protein LELG_00001 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146447435|gb|EDK41823.1| hypothetical protein LELG_00001 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1084
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 97/453 (21%), Positives = 196/453 (43%), Gaps = 53/453 (11%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L + ++A+ + +QEM+ G L D+T + + FL+ K +G R +++L+ LN+ +
Sbjct: 206 LGSKRTAAVEI-LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVEL 262
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ ++ + + + ++ +ID+ AYF +P+ + + +L LP G
Sbjct: 263 EGGHPLSVDDLTTEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQG 322
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
+ F+S+ + +L V+ ++DD +V P++ E+ L L + +
Sbjct: 323 YINSVSEFSSI---LQKILSPVAKDVMCFIDDIAIVG--PKVDELTDSLVREHLDKIVEV 377
Query: 648 VNLQKSS---LSPA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLAS 695
L ++ ++PA P FLG H+ P K L G + L L +
Sbjct: 378 FRLLTNAGLKINPAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPN 429
Query: 696 KTWNLDSARSLLGYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLE 753
L+S L+ Y + ++ L + + QA H P ++
Sbjct: 430 TVKQLESFLGLVNY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQII 487
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNW 803
L P+ P+ + + + TDAS WG + ++ SG + ++N+
Sbjct: 488 TVLTNQPILQPLNFKDLIT-VHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNY 546
Query: 804 HINKKEMFAVHQALSLNLPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK-- 859
I +KE+F++++ PLL + V+ + DN+ +V + + + ++ V K
Sbjct: 547 TIYEKELFSIYKTFDAIHPLLFGFTGVIHLYCDNKALVLVMNKP--LDNSHFVNRVYKWL 604
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
F+ + +++IH I G N +AD+LSR +
Sbjct: 605 NFIRTFNYQIH----HIDGLKNIIADALSRCHT 633
>gi|189234033|ref|XP_001807972.1| PREDICTED: similar to protease, reverse transcriptase, ribonuclease
H, integrase [Tribolium castaneum]
Length = 1300
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 98/475 (20%), Positives = 196/475 (41%), Gaps = 53/475 (11%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P A PP C + + + + I+ +
Sbjct: 401 VKEYRDLFAEFGPPTPYATHRINTGDSLPV-AVPPYRLTCEKRKV-------LQMEIERL 452
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ + LN P + L +
Sbjct: 453 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYRKLNAVTKPDVYPLPRLDDLLHATG 510
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 511 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 570
Query: 604 SLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQF 663
+ + + ++ YLDD ++++ D + L +N K + + V ++
Sbjct: 571 TGIPD--VPILAYLDDLIIISPDGHTHIRHLRSTFDQLRRFQLRINRNKCLIGCSKV-RY 627
Query: 664 LGIMWDPHLDRMWLPEDKQL-TLGNI--------LRTLLASKTWNLDSARSLLGYLSFAS 714
LG P P++K++ + I L++ L + +W R + FAS
Sbjct: 628 LGHRISPS---GIAPDEKKVEAIKQIPPPRNLKQLQSFLQTCSW----FRRFID--QFAS 678
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
P+ L ++ AS A L+ L P+ P Q +
Sbjct: 679 VARPLSEL----TKKNASWKWGNA------QDEAFTTLKEKLTTAPILRAADPTQ-PFIL 727
Query: 775 STDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSD 834
TD+S + + S L + ++N+ ++E A+ A+S + + V +D
Sbjct: 728 RTDSSAYALARR-PVEYASRLLTSSERNYSTTEREALAIVWAISKFRGYVGENSTTVITD 786
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+Q + ++ + T L+ + L Q + +++ ++ PG N++AD LSR
Sbjct: 787 HQPLRWFMSLKTPTGRLARWA------LQLQPY--NLVIEYTPGKANTIADFLSR 833
>gi|294942880|ref|XP_002783703.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239896284|gb|EER15499.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 374
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 143/330 (43%), Gaps = 49/330 (14%)
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL +A F L + V + L V VYLDD L+ + D E + + L +
Sbjct: 1 MPFGLCSAGATFQRLMDQVLNGLPF----VRVYLDDILVFSPDAETHEDHLRQVFARLRA 56
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
G ++ +K P + +LG ++D + R P+ ++ ILR + N+
Sbjct: 57 WGLTLSAEKCEFG-CPSVPYLGHIFDGNGMR---PDPTKVE--AILRW---PRPGNVAEV 107
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT------PINPAVLPKLEWWLN 757
RS LG + +P +R IQR S +G+ L L+ L
Sbjct: 108 RSFLGLAGYYRNFVPNFSDVARPIQRLVS--EVGSETLALDTYWGQEQEESFRALKLRLA 165
Query: 758 ALP-LSSPIFPRQVQHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMF 811
ALP L+ P F + + TDASD G+ + F S + Q NWH +KE +
Sbjct: 166 ALPFLAYPDF--GIPFELYTDASDYAIGAVLMQEGRPLGFFSRTLTGSQLNWHTYEKEAY 223
Query: 812 AVHQAL------SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+ QAL + PL V V +D++ +++L + G K +E+ L Q
Sbjct: 224 GILQALIYFQHYHIGYPL----TVTVYTDHEP-LTWLAKAGSKK-------LERWLLAMQ 271
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + +++PG N AD+LSR + L D
Sbjct: 272 AY--SFIVKYVPGKKNVCADALSRIRQLDD 299
>gi|280492|pir||S23570 pol polyprotein homolog - fungus (Cladosporium fulvum)
retrotransposon CfT-1 (fragment)
gi|2564|emb|CAA77891.1| Reverse Transcriptase [Passalora fulva]
Length = 1045
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 99/439 (22%), Positives = 167/439 (38%), Gaps = 71/439 (16%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++E L G ++R S+ G + VPK NG R V + + LN+ ++ L N
Sbjct: 341 LKEKLAKGWIRRSTSSAG--TPCMFVPKANGKLRLVQDYRKLNEITIKNRYPLPNIEEAQ 398
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L D+ IDL A++ + + + A + +P GL AP AS
Sbjct: 399 DRLTGSDWYTKIDLRDAFYAIRMAEGEEWKTAFRTRYGLYEFLVMPMGLTNAP---ASCQ 455
Query: 600 NWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ V LR + VV Y+DD L+ +G L L KS A
Sbjct: 456 DLVNETLRDLLDVCVVAYMDDILVYT--------KGSLQEHTKQVQDVFERLTKSGFKTA 507
Query: 659 P---------------VLQFLGIMWDPHLD---RMWLPEDKQLTLGNILRTLLASKTWNL 700
P ++ GI DP R W PE K + +++ L +N
Sbjct: 508 PEKCEFHKKEVKFLGFIISTTGITIDPAKTQSIREW-PEPKTV---KDVQSFLGLANYN- 562
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP 760
R + ++ PM L + + + + A A P L
Sbjct: 563 ---RKFIK--DYSKTAAPMTMLTRKDVNWKWGKEQTEAFKRLKEQCASAPTLR------- 610
Query: 761 LSSPIFPRQVQHFISTDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMF 811
+F + I TDASD+ G+ + + + S + +QN+ I+ KE+
Sbjct: 611 ----LFDGSKEVHIETDASDMAIGACLTQTHDGKRHPVAYYSRKMTTAEQNYDIHDKELL 666
Query: 812 AVHQALSLNLPLLQS-SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
A+ A+ ++ + + SD++ + + + T+ + SE LL Q ++
Sbjct: 667 AIVAAMQHWRVYVEGPPKLTILSDHKNLTYFTTTKELTRRQARWSE-----LLGQ-YKFE 720
Query: 871 ILAQFIPGAYNSVADSLSR 889
I ++ PG N AD+LSR
Sbjct: 721 I--KYTPGTENGPADALSR 737
>gi|384485786|gb|EIE77966.1| hypothetical protein RO3G_02670 [Rhizopus delemar RA 99-880]
Length = 228
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 74/171 (43%), Gaps = 34/171 (19%)
Query: 423 GGRLRRFVDAW-IRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQ 481
GG L++F+ AW + P PL I GY I F AK P+ +HL S+ LH Q
Sbjct: 91 GGLLQQFITAWRSTITHPWPLSVIQHGYKIQF-AKQPVPWKLPKKHL----SAEDQLHEQ 145
Query: 482 EMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
KR RP+L+ + LN+FL + F + +
Sbjct: 146 T-------KR---------------------RPILDRQKLNRFLQVEHFQMEGVPTLREI 177
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
+++ DY+ IDL AY +PI Q +L+ G V L FGL+ AP
Sbjct: 178 IEENDYICKIDLKDAYVVIPIHPDSQDYLSFENQGTVYRYKSLAFGLSVAP 228
>gi|378788717|gb|AFC40208.1| polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 85/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D M + E + L +++
Sbjct: 272 DEQF--MKIEESRWKELRTVIK 291
>gi|353453394|gb|AER00538.1| ORF III [Citrus yellow mosaic virus]
Length = 1982
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 105/454 (23%), Positives = 182/454 (40%), Gaps = 52/454 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVL---KRLDSTTGFLSRL------FLVPKGNGGTRPV 515
L+H+ + + H++ +L+ G + K TT ++ + G R V
Sbjct: 1400 LKHVTPQMEESFRKHVEALLKIGAIRPSKSRHRTTAIIANSGTSIDPITGKEVKGKERMV 1459
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYFHVPIKTTHQRFLAL 572
N K LN + ++SL I + LQ KG + S DL + V + + A
Sbjct: 1460 FNYKRLNDLTNKDQYSLPG---IQTILQRLKGSTIFSKFDLKSGFHQVAMHPDSIEWTAF 1516
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ +PFGL AP F + + + VY+DD + ++ + E
Sbjct: 1517 WVPSGLYEWLVMPFGLKNAPAVFQRKMD---HCFKGTEAFIAVYIDDIQVFSKTEQDHEE 1573
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
++ +SI G I++ K ++ A + FLG + L ++ Q + L T
Sbjct: 1574 YLQIMLSICQKNGLILSPTKMKIAQAEI-GFLGAIIHKGLIKL------QPHIVQKLLTF 1626
Query: 693 LASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
+ + RS LG L++A IP MGRL S A + G + + A++ K
Sbjct: 1627 TNKQLEEVKGLRSWLGLLNYARSYIPHMGRLLSPLY---AKVSPTGERRMNRQDWALIDK 1683
Query: 752 LEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVD-------SSFLSGLW 796
+ + LP L P P I TD GWG +Q D ++ SG +
Sbjct: 1684 IRAQVQNLPALELP--PADCFIIIETDGCMDGWGGVCKWKVAQYDPRSSERVCAYASGKF 1741
Query: 797 SREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+ + E+ AV +L + + L S + +++D Q ++S+ + K +
Sbjct: 1742 NPPKSTI---DAEIHAVMNSLNNFKIYYLDKSSLCLRTDCQAIISFFNKSNVNKPSRVRW 1798
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
FL ++I + I G N +AD+LSR
Sbjct: 1799 IAFTDFLTGLGIPVNI--EHIDGKNNHLADALSR 1830
>gi|4544430|gb|AAD22339.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1411
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 107/469 (22%), Positives = 180/469 (38%), Gaps = 74/469 (15%)
Query: 452 PFSAK--PPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
PF+ + P PL + P + + ++++L G ++ ST+ + + + V K
Sbjct: 506 PFTIELEPGTAPLSKAPYRMAPAEMTELKKQLEDLLGKGFIR--PSTSPWGAPVLFVKKK 563
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ +GLN K+ L + L+ IDL+ Y +PI R
Sbjct: 564 DGSFRLCIDYRGLNWVTVKNKYPLPRIDELLDQLRGATCFSKIDLTSGYHQIPIAEADVR 623
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
A +PF L AP AF L N V V++++DD L+ ++ P
Sbjct: 624 KTAFRTRYGHFEFVVMPFALTNAPAAFMRLMNSVFQEFLDEF--VIIFIDDILVYSKSPE 681
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFL-------GIMWDPHLDRMWLPEDK 681
E+ + + L L K S + FL G+ DP
Sbjct: 682 EHEVHLRRVMEKLREQKLFAKLSKCSFWQREI-GFLGHIVSAEGVSVDPE---------- 730
Query: 682 QLTLGNILRTLLASKTW----NLDSARSLL---GYL-----SFASFVIPMGRLHSRRIQR 729
+ A + W N RS L GY FAS PM +L + +
Sbjct: 731 ---------KIEAIRDWPRPTNATEIRSFLRLTGYYRRFVKGFASMAQPMTKLTGKDV-- 779
Query: 730 QASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQ-HFISTDASDLGWGSQV 787
P + +P L+ L + P+ + P Q + + TDAS +G G +
Sbjct: 780 ---------PFVWSPECEEGFVSLKEMLTSTPVLA--LPEHGQPYMVYTDASRVGLGCVL 828
Query: 788 DS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYL 842
++ S + + N+ + EM V AL + L V V +D+++ + Y+
Sbjct: 829 MQRGKVIAYASRQLRKHEGNYPTHDLEMAVVIFALKIWRSYLYGGKVQVFTDHKS-LKYI 887
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
Q +L + +E L D+ + I + PG N VAD+LSR +
Sbjct: 888 FNQPEL-NLRQMRWME----LVADYDLEI--AYHPGKANVVADALSRKR 929
>gi|20136450|gb|AAM11674.1|AF492764_2 pol protein [Drosophila melanogaster]
Length = 849
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 102/435 (23%), Positives = 186/435 (42%), Gaps = 59/435 (13%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRPVLNLKGLNQFLSPKKFSLIN 534
IQEMLE G+++ +S++ + S +++VPK G R V++ + LN+ ++ + N
Sbjct: 189 IQEMLEQGIIR--ESSSPYCSPIWVVPKKLDASGQQKLRIVIDYRKLNEITINDRYPMPN 246
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I L + +Y +IDL++ + + + + A S T +PFGL AP
Sbjct: 247 IDEILGKLGRSNYFTTIDLAKGFHQIEMDSESIAKTAFSTKYGHYEYTRMPFGLKNAPAT 306
Query: 595 FA-SLSNWVASLLRSRGMRVVVYLDDFLL----VNQDPRILE-IQGKLAVSILG-SLGWI 647
F ++N + LL + VYLDD ++ + + + LE + KL+ + L L
Sbjct: 307 FQRCMNNLLRPLLNKNCL---VYLDDIIVFSTSLEEHLQSLEAVFEKLSQANLKLQLDKC 363
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
L++ + V+ GI +P + I L SK + + +
Sbjct: 364 EFLRQETTFLGHVITKDGIKPNPE------------KIKAIQDYPLPSKPKEIKAFLGIT 411
Query: 708 GYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-L 761
GY +F+ P+ + + + ++ H I KL+ ++ P L
Sbjct: 412 GYYRKFIPNFSDIAKPLTKCLKKGV-------KIDTKHKEYI--EAFQKLKLLISEDPIL 462
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQA 816
P F R+ ++TDAS++ G+ + S++S + + N+ +KE+ A+ A
Sbjct: 463 KIPNFERKF--VLTTDASNVALGAVLSQDGHPISYISRTLNEHEVNYSAIEKELLAIVWA 520
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
L + SD+Q + + + L+ KI L D+ I ++I
Sbjct: 521 TKTFRHYLLGRHCEIASDHQPLCWLHKLKEPNSKLTRW----KIRLSEYDFDI----KYI 572
Query: 877 PGAYNSVADSLSRSK 891
G N VAD+LSR K
Sbjct: 573 KGKENHVADALSRIK 587
>gi|357614969|gb|EHJ69396.1| hypothetical protein KGM_03198 [Danaus plexippus]
Length = 457
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 53/103 (51%), Gaps = 6/103 (5%)
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP-AVLPKLEWWLNAL 759
D A SLLG + F LH R +Q + L P+ I P +V +L WWL A+
Sbjct: 359 DQALSLLGDSTLHYF-----GLHCRTLQYHSRHRPLNHPYHQKILPESVALELRWWLKAI 413
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQN 802
+ PI V H TDASD+GWG+Q++ + LSG W ++
Sbjct: 414 ASTLPIHLGSVTHHAKTDASDIGWGAQIEETKLSGQWIEDKHG 456
>gi|378788719|gb|AFC40209.1| polymerase, partial [Duck hepatitis B virus]
Length = 292
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 85/202 (42%), Gaps = 19/202 (9%)
Query: 501 RLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF-----------LQKGDYMI 549
+LFLV K + T + +QF K N R P + L G I
Sbjct: 97 KLFLVDKNSRNTTEARLVVDFSQFSKGK-----NAMRFPRYWSPNLSTLRRILPVGMPRI 151
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
S+DLSQA++H+P+ LA+S V P G+ +P + + S + R
Sbjct: 152 SLDLSQAFYHLPLNPASSSRLAVSDGQWVYYFRKAPMGVGLSPFLLHLFTTALGSEISRR 211
Query: 610 -GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMW 668
+ Y+DDFLL + + R L S L LG +N K++ SP ++FLG
Sbjct: 212 FNVWTFTYMDDFLLCHPNARHLNSISHAVCSFLQELGIRINFDKTTPSPVTEIRFLGYQI 271
Query: 669 DPHLDRMWLPEDKQLTLGNILR 690
D M + E + L +++
Sbjct: 272 DEQF--MKIEESRWKELRTVIK 291
>gi|391331107|ref|XP_003739992.1| PREDICTED: uncharacterized protein LOC100907926 [Metaseiulus
occidentalis]
Length = 416
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 99/222 (44%), Gaps = 11/222 (4%)
Query: 677 LPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL 736
+P DK+ + + TLLA++ L +LG L+ + ++ R H + ASL+
Sbjct: 5 VPTDKRHQIKQDIITLLATQRITLRMLYRILGKLNALTTIVRSLRYHCSSL---ASLVSK 61
Query: 737 GAPHLTPINPAV-LP-----KLEWWLNALPLSS--PIFPRQVQHFISTDASDLGWGSQVD 788
+ V LP L WW L + + PI V I+TD+S GWG+
Sbjct: 62 STRQNCAFDSEVPLPLTGREDLIWWSENLDMIAVGPIKLPLVSLEITTDSSLKGWGAWCG 121
Query: 789 SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT 848
G W+ QN HIN E+ A+ A+ + + + +++DN T + + G
Sbjct: 122 QRASGGTWNIHDQNLHINALELKAIFLAVQKLADDQKDTTIAIRTDNTTAMHCVNNFGSL 181
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
S +L S ++ + + I + A IPG N AD LSR+
Sbjct: 182 HSSTLNSLTRSLWAWAFERNIFLKATHIPGTCNDRADLLSRT 223
>gi|302853331|ref|XP_002958181.1| hypothetical protein VOLCADRAFT_99387 [Volvox carteri f.
nagariensis]
gi|300256450|gb|EFJ40715.1| hypothetical protein VOLCADRAFT_99387 [Volvox carteri f.
nagariensis]
Length = 701
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 89/231 (38%), Gaps = 45/231 (19%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
+A P +AM L +Q+ L GV RL+S +G+G R V+NLK +N
Sbjct: 6 VAGPDIAAMLLQMQQDL-AGVRARLNS------------QGSGSWRVVVNLKRMNIAQKA 52
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
K + + + +M+ +DL+ A++H+PI+ +RF + G + M LP G
Sbjct: 53 YKCRYESLRTLRRMGIQNSWMVKVDLADAFYHIPIRAADRRFFVFRFCGVLYQMNALPMG 112
Query: 588 LATAPQAFASLSNWVASLLRS--------------------------------RGMRVVV 615
+P F+ + V R G RV+
Sbjct: 113 WLNSPYWFSKIMRNVVRFWRDPLAAVGGGRRRLSPPLPPHQFFPSDRCARPARLGARVLP 172
Query: 616 YLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGI 666
YLDDFL V + + + LG + K P+ + LGI
Sbjct: 173 YLDDFLFVFASEEQARLGAQWVRESIEFLGLSCHPTKCQWEPSQSVYHLGI 223
>gi|308456302|ref|XP_003090602.1| hypothetical protein CRE_10709 [Caenorhabditis remanei]
gi|308262244|gb|EFP06197.1| hypothetical protein CRE_10709 [Caenorhabditis remanei]
Length = 2287
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 90/195 (46%), Gaps = 10/195 (5%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
+ + +++M G+++ +ST+ + S L ++PK NG R V++ + LN + + +
Sbjct: 1764 KTKLEKQVKQMKANGLIE--ESTSPYTSPLLMIPKPNGEIRIVIDYRRLNLITRSRTYIM 1821
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
N I +G D++Q + +P+ H+ A + V +P GL AP
Sbjct: 1822 PNTIDICEEASRGKLFSVFDIAQGFHTIPMHEAHKERTAFCCHMGVFQYRKMPMGLKGAP 1881
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNL 650
F +A + R +++Y+DD ++V ++D I ++ + I +G +
Sbjct: 1882 DTF---QRAMAEVERQFSGTLILYVDDLIVVSNDEDQHITHLEEFFQLMI--KMGLKLKA 1936
Query: 651 QKSSLSPAPVLQFLG 665
+KS + + FLG
Sbjct: 1937 EKSQIGRTKI-SFLG 1950
>gi|147775005|emb|CAN70471.1| hypothetical protein VITISV_013478 [Vitis vinifera]
Length = 1122
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 98/422 (23%), Positives = 167/422 (39%), Gaps = 34/422 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML+ G++K ST+ F S + LV K +G R + + LN +F + +
Sbjct: 219 VCDMLKLGLIKA--STSLFSSPVLLVKKKDGTWRFCTDYRALNAVTIKDRFPIPTVDDML 276
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y +DL Y +V + A + +PFGL+ AP F ++
Sbjct: 277 DELHGATYFTKLDLRAGYHYVRVHPPDIPKTAFRTHNGHYEYLVMPFGLSNAPSTFQAIM 336
Query: 600 NWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N S+ R G V+V+ D L+ + + + K A IL + V + K +
Sbjct: 337 N---SIFRPYLGKFVLVFFXDILIYSPNXNMHLEHVKQAFEILRQHQFFVKISKCAFGQX 393
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
L++LG + Q+ G I L + N+ L G+L +
Sbjct: 394 E-LEYLG--------HIVTXXGVQVDXGKIKAMLNWPRPTNISE---LHGFLGLTGYYRK 441
Query: 719 MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTD 777
R + + +LL+ G T L+ + + P L+ P F I +D
Sbjct: 442 FVRNYGIIARALTNLLKKGQFAWTKDAETAFQALKQAMTSTPTLAMPNFNEPF--VIESD 499
Query: 778 ASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQ 832
A G G+ + +F+S +++W I +EM A+ A+ P L +Q
Sbjct: 500 ALGDGIGAVLTQQGKPIAFMSRALGVSKRSWSIYAREMLAIVHAIQTWRPYLLGRKFYIQ 559
Query: 833 SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKS 892
+D +++ L ++ T V K LL D+ I + G NS ++LSR S
Sbjct: 560 TDQRSLKYLLEQRIATPEQQ--EWVAK--LLGYDYEI----TYKXGRENSAENALSRVVS 611
Query: 893 LP 894
P
Sbjct: 612 SP 613
>gi|157366222|gb|ABV45226.1| gag-pol polyprotein [Ascogregarina taiwanensis]
Length = 1535
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 105/454 (23%), Positives = 179/454 (39%), Gaps = 75/454 (16%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL--NLKGLN 522
LQ LA P+ ++ + +L+ V+ + PK GG+ PV KG
Sbjct: 630 LQRLANPLQEKLTQQLDPLLKANVI--------------VPPKPAGGSCPVFEKKKKGKG 675
Query: 523 Q----FLSPKKFSLINHFRIPSFLQK------GDYMISIDLSQAYFHVPIKTTHQRFLAL 572
Q + K + + +P L++ + SIDL A +++ I+ +++ A
Sbjct: 676 QMCLDYRQLNKMMKADAYPVPLLLERLQQVAHARWYASIDLQWACWNIKIQEESRQYTAF 735
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ T LPFG+ +P F + + + L + G V +Y DD ++ L
Sbjct: 736 VTSRGSFEFTVLPFGIKNSPAEFQRIMDSIFGDLYTNG--VSIYFDDIVIFADTKATLLE 793
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
+ + + G V L KS L + LG + + R + D + + +R
Sbjct: 794 RLDIVLGRCCDEGLNVKLGKSELMKTEI-SMLGHI----VGRNGIYSDPRKVVA--VRET 846
Query: 693 LASKTWNLDSARSLLG---YLS-----FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPI 744
A K N D RSLLG YL FA V P+ L + + + T
Sbjct: 847 RAPK--NRDELRSLLGTVGYLRRFIPHFAELVFPLTELTKKGTRYEWD---------TAC 895
Query: 745 NPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDS------SFLSGL 795
A E + LSSP I TDAS +G G+ QV F S
Sbjct: 896 QKAFDILKEELATVVLLSSP--KGTGPFIIVTDASSVGIGNALLQVQDGDLVLIEFGSKK 953
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+ +Q W ++E FA+ ++ +++ V V +D+++ L+ G +
Sbjct: 954 LTLAEQKWDTREREAFAIKWSVKQYEDYVKAGKVFVLTDHES----LKWMDGATT----G 1005
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
V++ L Q + + ++ + G N +AD LSR
Sbjct: 1006 RVQRWALYLQQFDLEVI--HVAGVVNVMADWLSR 1037
>gi|391345509|ref|XP_003747028.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 772
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 101/225 (44%), Gaps = 8/225 (3%)
Query: 445 IVSGYAIPFSAKPPLVPLC-SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+ S + K + P C + LA P+ + ++ +L+ G L +++S+ + + +
Sbjct: 59 LCSKIEVDLRLKDNVTPTCLPARPLALPIRELVDKELERLLDNGTLYKVESSD-WATPIV 117
Query: 504 LVPKGNGGTRPVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPI 562
+V K NG R + GLN+ L + L N I S +DL+ AY +P+
Sbjct: 118 VVRKANGQIRMCADYSTGLNEALRDVDYPLPNMEEIMSRFSGNRIFTQLDLADAYLQLPL 177
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+++ ++ + + L FGL TAP F + + L V+VYLDD L+
Sbjct: 178 TAENRKLTTINTHRGLYQYNRLVFGLKTAPAIFQRTIDQALAGLEG----VLVYLDDILV 233
Query: 623 VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIM 667
+ + + + + + L G+ + +K + V ++LG++
Sbjct: 234 MAPEQELHDRRLRQVCEKLQEWGFHLRFEKCRFNSRTV-KYLGLI 277
>gi|82055772|sp|Q6XKE6.1|POLG_PVCV2 RecName: Full=Genome polyprotein; Contains: RecName: Full=Movement
protein; Short=MP; Contains: RecName: Full=Capsid
protein; Short=CP; Contains: RecName: Full=Aspartic
protease; Short=PR; Contains: RecName: Full=Reverse
transcriptase; Short=RT
gi|34328896|gb|AAO67368.1| polyprotein 1 [Petunia vein clearing virus]
Length = 2180
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 135/335 (40%), Gaps = 34/335 (10%)
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R V+N + LN FL KF + N + S L K DL ++ + I +
Sbjct: 1438 GKLRLVINYQPLNHFLQDDKFPIPNKLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNERPK 1497
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
+PFGL TAP F + + + +VY+DD LL ++
Sbjct: 1498 TGFCIPDRHFQWKVMPFGLKTAPSLFQKA---MIKIFQPILFSALVYIDDILLFSE---T 1551
Query: 630 LEIQGKLA---VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTL 685
LE KL +S++ G +++ +K L+ + QFLG+ + D + P L L
Sbjct: 1552 LEDHIKLLNQFISLVKKFGVMLSAKKMILAQNKI-QFLGMDF---ADGTFSPAGHISLEL 1607
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
T L+ K + LG +++ IP H I + +L+ P
Sbjct: 1608 QKFPDTNLSVK-----QIQQFLGIVNYIRDFIPEVTEH---ISPLSDMLKKKPPAWGKCQ 1659
Query: 746 PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW---------GSQVDSSFLSGLW 796
+ +L+ A + S P + + + TDASD W G + F SG +
Sbjct: 1660 DNAVKQLKQL--AQQVKSLHIPSEGKKILQTDASDQYWSAVLLEEHNGKRKICGFASGKF 1717
Query: 797 SREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVM 830
+Q++H KE+ AV + N L+ ++ ++
Sbjct: 1718 KVSEQHYHSTFKEILAVKNGIKKFNFFLIHTNFLV 1752
>gi|291223447|ref|XP_002731721.1| PREDICTED: zinc finger protein-like [Saccoglossus kowalevskii]
Length = 1533
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/425 (21%), Positives = 182/425 (42%), Gaps = 40/425 (9%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+++ML+ G+++R DS + + + L+ K +G R + + LNQ + N +
Sbjct: 1129 EVEDMLKMGIIERSDSP--YAAPIVLIKKKDGKIRFCTDFRRLNQITVFDAEPMPNPEEL 1186
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L +G Y+ +DL++ ++ +P+ + A + +PFGL +P F+ +
Sbjct: 1187 FSNLAEGRYLTKLDLTKGFWQIPLTPGSKPKTAFLTPLGLFQYRVMPFGLVNSPSTFSRM 1246
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ + +VV +DD L+ N ++ E V +L LQ++ L+
Sbjct: 1247 MRVILGGMHG---KVVNLVDDILIYN---KLWEDHVCTLVEVLQ------RLQEAGLTVK 1294
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
P ++G + + + + I A + + R+LL Y + +P
Sbjct: 1295 PSKCYIGFSQLEFVGHVVGHGELRTMPSKINAMQNARRPETVTQVRALLAYAGYYRKFVP 1354
Query: 719 MGRLHSRRIQRQASLLRLGAPHLT---PINPAVLPKLEWWLNALP-LSSPIFPRQVQHFI 774
+ + L R G P ++ +L+ L+ P L P F R +
Sbjct: 1355 NFTAIAAPL---YDLTRKGLPKKVIWEMVHQLAFEQLKHALSNPPILKLPDFNRTF--VL 1409
Query: 775 STDASDLGWGSQV----DS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ 825
TDA+++G G+ + D +F S ++ + N+ + +KE A+ L L
Sbjct: 1410 RTDAAEVGLGAVLLQYYDEVAFPVAFASRKLTKAEVNYAVIEKECLALIWGLRKFQQYLH 1469
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
++ +++D++ ++ Y++ T S + + L Q + I+A I G++N AD
Sbjct: 1470 TASFWIETDHKPLI-YMKSAKFTN-----SRIMRWALYMQSFSYRIIA--IKGSHNIGAD 1521
Query: 886 SLSRS 890
LSRS
Sbjct: 1522 YLSRS 1526
>gi|341891414|gb|EGT47349.1| hypothetical protein CAEBREN_15307 [Caenorhabditis brenneri]
Length = 2250
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 94/469 (20%), Positives = 189/469 (40%), Gaps = 78/469 (16%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNL 518
+P + + + EML G+++ DS F + + LV K + + R ++
Sbjct: 1563 IPQAKIHRTPLEKRKEVETQVNEMLNQGIIRPTDSP--FAAPIVLVRKADKTSWRFTVDF 1620
Query: 519 KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDV 578
+ LN +P + + N I ++D Q + +P++ H + A + +
Sbjct: 1621 RALNAMTTPVQSVIPNIHEILDLCAGKTLYTTLDFQQGFHQIPVEPLHCQRTAFACHLGA 1680
Query: 579 LAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAV 638
+P GL +P F + N +L++ R+ VY+DD +L +++ AV
Sbjct: 1681 FEYVRMPMGLKGSPGTFQRVMN---NLIKEMRARIFVYIDDMVLTSEN----------AV 1727
Query: 639 SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW 698
L + +++ ++ +G+ P + LPE K L + SK+
Sbjct: 1728 QHLKDVEEVLD----------KIEKIGMKLRPEKCKFALPEIKFL-------GFVISKSG 1770
Query: 699 ---NLDSARSLLGY-----LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
N + R++ Y + I M + R I A+ ++ AP + + P
Sbjct: 1771 IHPNPEKTRAIDEYPTPRTVKEVRAFIGMASFYRRFI---ANFSKIAAP-IMELTKKDKP 1826
Query: 751 KLEWW---------LNALPLSSPIF--PRQVQHF-ISTDASDLGWGSQV----DS----- 789
EW L ++PI P+ + F I D+S G G+ + D+
Sbjct: 1827 -FEWTTECQDAMTELKKALTTNPILMAPKLGKPFQIEVDSSGKGVGAVLMQAQDTEGKDK 1885
Query: 790 ---SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
++ S +++ +Q + + E + A+ P + + ++ +D+ + S L R+
Sbjct: 1886 RVIAYASRVYTGAEQKYPAIELEALGLTYAVQQFRPYIDGAKTLIITDHSPLKSMLYRK- 1944
Query: 847 GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
L + K ++ Q++ I I ++ PG N V D+LSR+ P+
Sbjct: 1945 -----DLFGRMGKFQIVLQEYDIEI--EYRPGKQNIVCDTLSRNHPRPN 1986
>gi|14575753|ref|NP_127504.1| ORF I polyprotein [Petunia vein clearing virus]
gi|82061579|sp|Q91DM0.1|POLG_PVCV1 RecName: Full=Genome polyprotein; Contains: RecName: Full=Movement
protein; Short=MP; Contains: RecName: Full=Capsid
protein; Short=CP; Contains: RecName: Full=Aspartic
protease; Short=PR; Contains: RecName: Full=Reverse
transcriptase; Short=RT
gi|14574598|gb|AAK68664.1| ORF I polyprotein [petunia vein clearing virus]
Length = 2179
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 135/335 (40%), Gaps = 34/335 (10%)
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRF 569
G R V+N + LN FL KF + N + S L K DL ++ + I +
Sbjct: 1437 GKLRLVINYQPLNHFLQDDKFPIPNKLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNERPK 1496
Query: 570 LALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
+PFGL TAP F + + + +VY+DD LL ++
Sbjct: 1497 TGFCIPDRHFQWKVMPFGLKTAPSLFQKA---MIKIFQPILFSALVYIDDILLFSE---T 1550
Query: 630 LEIQGKLA---VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTL 685
LE KL +S++ G +++ +K L+ + QFLG+ + D + P L L
Sbjct: 1551 LEDHIKLLNQFISLVKKFGVMLSAKKMILAQNKI-QFLGMDF---ADGTFSPAGHISLEL 1606
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
T L+ K + LG +++ IP H I + +L+ P
Sbjct: 1607 QKFPDTNLSVK-----QIQQFLGIVNYIRDFIPEVTEH---ISPLSDMLKKKPPAWGKCQ 1658
Query: 746 PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGW---------GSQVDSSFLSGLW 796
+ +L+ A + S P + + + TDASD W G + F SG +
Sbjct: 1659 DNAVKQLKQL--AQQVKSLHIPSEGKKILQTDASDQYWSAVLLEEHNGKRKICGFASGKF 1716
Query: 797 SREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVM 830
+Q++H KE+ AV + N L+ ++ ++
Sbjct: 1717 KVSEQHYHSTFKEILAVKNGIKKFNFFLIHTNFLV 1751
>gi|270016066|gb|EFA12514.1| hypothetical protein TcasGA2_TC001405 [Tribolium castaneum]
Length = 1453
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/203 (21%), Positives = 90/203 (44%), Gaps = 14/203 (6%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P + P L + + I+ +
Sbjct: 562 VKEYRDLFAEFGPPTPYATHRINTGDSLPVAVSP--------YRLTCEKRKVLQMEIERL 613
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ + LN P + L +
Sbjct: 614 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYRKLNAVTKPDVYPLPRLDDLLHATG 671
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 672 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 731
Query: 604 SLLRSRGMRVVVYLDDFLLVNQD 626
+ + + ++ YLDD ++++ D
Sbjct: 732 TGIPD--VPILAYLDDLIIISPD 752
>gi|326670254|ref|XP_003199175.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Danio rerio]
Length = 1467
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 107/456 (23%), Positives = 176/456 (38%), Gaps = 78/456 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P AMS +I E L G ++ ST+ + F V K +G RP ++ +GLN+
Sbjct: 266 LSQPEHKAMSEYIDEELAKGFIR--PSTSPASAGFFFVKKKDGSLRPCIDYRGLNE---- 319
Query: 528 KKFSLINHFRIP--------SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
+ FR P L+K Y +DL AY V I+ + A S
Sbjct: 320 ----ITVKFRYPLPLVPPALEQLRKARYYTKLDLRSAYNLVRIRAGDEWKTAFSTTRGHY 375
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
T +PFGL+ P F S N V + R + + + + ++ L
Sbjct: 376 EYTVMPFGLSNCPSVFQSFMNDVFRDMLDRWVIIYIDDILIYSNTMKEHVEHVRMVLQRM 435
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
I L L+K + FLG ++ + +T+ + + A + W
Sbjct: 436 IQHRL--YAKLEKCEFHQTQI-AFLG----------YVISAEGITMDDT--KVQAVQRWP 480
Query: 700 L-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWW--- 755
L + + L +L FA+F RR R S + AP LT + KL W
Sbjct: 481 LPQNLKELQRFLGFANFY--------RRFIRGFS--SIAAP-LTAMTKRNSHKLSWSSEA 529
Query: 756 ------LNALPLSSPIFPR---QVQHFISTDASDLGWGSQVD-----------SSFLSGL 795
L ++PI + + DAS+ G G+ + +F S
Sbjct: 530 RQAFSDLKTQFTTAPILRHPNPDLPFIVEVDASNTGVGAVLSQRQGQPSKMYPCAFFSRK 589
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQ--SSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
+ ++N+ + +E+ A+ AL L+ S + +D++ + YLR K L+
Sbjct: 590 LTSAERNYDVGNRELLAMKLALEEWRHWLEGASQQFTILTDHKN-LEYLR---SAKRLNP 645
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+F R + + PG+ NS AD+LSR
Sbjct: 646 RQARWALFFT----RFDFIVTYRPGSKNSKADALSR 677
>gi|425856941|gb|AFX98089.1| pol protein [Simian foamy virus]
Length = 1141
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 116/244 (47%), Gaps = 16/244 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I ++L+ GVL + +ST + ++ VPK +G R VL+ + +N+ + + I
Sbjct: 182 IDDLLKQGVLVQQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLIAAQNQHSAGIL 239
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + + Y ++DL+ ++ PI A ++ G T LP G +P F +
Sbjct: 240 ATIVRKKYKTTLDLANGFWAHPITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTAD- 298
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
V LL+ V Y+DD L + DP+ Q + IL G++V+L+KS ++
Sbjct: 299 --VVDLLKEIS-NVQAYVDDIYLSHDDPQEHLNQLEKVFQILLQAGYVVSLKKSEIAQKT 355
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA-SFVIP 718
V +FLG ++ + E + LT + L + +L +S+LG L+FA +F++
Sbjct: 356 V-EFLGF----NITK----EGRGLTEAFKAKLLDITPPKDLKQLQSILGLLNFARNFILN 406
Query: 719 MGRL 722
L
Sbjct: 407 FAEL 410
>gi|189242199|ref|XP_001812244.1| PREDICTED: similar to protease, reverse transcriptase, ribonuclease
H, integrase [Tribolium castaneum]
Length = 1437
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/203 (21%), Positives = 90/203 (44%), Gaps = 14/203 (6%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P + P L + + I+ +
Sbjct: 562 VKEYRDLFAEFGPPTPYATHRINTGDSLPVAVSP--------YRLTCEKRKVLQMEIERL 613
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ + LN P + L +
Sbjct: 614 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYRKLNAVTKPDVYPLPRLDDLLHATG 671
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 672 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 731
Query: 604 SLLRSRGMRVVVYLDDFLLVNQD 626
+ + + ++ YLDD ++++ D
Sbjct: 732 TGIPD--VPILAYLDDLIIISPD 752
>gi|18450261|ref|NP_569150.1| polyprotein, cleavage products include viral coat protein and
proteins with homology to an aspartic protease, reverse
transcriptase and RNase H [Banana streak OL virus]
gi|3183637|emb|CAA05264.1| polyprotein, cleavage products include viral coat protein and
proteins with homology to an aspartic protease, reverse
transcriptase and RNase H [Banana streak OL virus]
Length = 1832
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 122/549 (22%), Positives = 208/549 (37%), Gaps = 114/549 (20%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M+ H+Q++LE V++ S++ + +V G G R
Sbjct: 1241 LKHVTPTMKETMAKHVQKLLELKVIR--PSSSKHRTTAMIVESGTEVDPMTGKERRGKER 1298
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1299 LVFNYKRLNDNTEKDQYSLPGINTIIKRIGNAKIYSKFDLKSGFHQVAMDPESIPWTAFW 1358
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ ++ +
Sbjct: 1359 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEDFIAVYIDDILVFSETIHQHKEH 1415
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
K ++I G +++ K + + FLG T+GN
Sbjct: 1416 LKKFMTICEKNGLVLSPTKMKIGTRQI-DFLG-----------------ATIGNSKIKLQ 1457
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHS-------RRIQRQ 730
I++ ++ K L + L LG L++A IP +G L++ RR+ Q
Sbjct: 1458 PHIIKKIIEMKDEELKEVKGLRKWLGILNYARSYIPKLGKILGPLYAKTSPNGERRMNTQ 1517
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----Q 786
+ + A LP+LE LP P + I TD GWG +
Sbjct: 1518 DWKIVKEVKEVV----ANLPELE-----LP------PEKAIMIIETDGCMEGWGGVCKWK 1562
Query: 787 VDSSFLSGLWSREQQNWHINK---------KEMFAVHQALS-LNLPLLQSSVVMVQSDNQ 836
DS L WS + + K E+ AV +L + L +++++D+Q
Sbjct: 1563 TDS--LQPRWSEKICAYASGKFTPIKSTIDAEIQAVINSLDKFKIYYLDKKELIIRTDSQ 1620
Query: 837 TVVSYLRRQGGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL- 893
+VS+ ++ K + L+ + I + I + I G N +AD+LSR +
Sbjct: 1621 AIVSFYKKSSDHKPSRVRWLAFTDYITGTG----LEIKFEHIDGKDNVLADTLSRLVKII 1676
Query: 894 --PDWHLSR----SATEQIFLKWGVPCIDLFASRVSAVVPNHFQVSRHVAILLLLSSGRR 947
P+ H S +A E++F K RV+ VV + LS G R
Sbjct: 1677 LHPEKHQSEGVLINAVEEVFHKGNTDA----KQRVNDVVKRYED---------WLSKGYR 1723
Query: 948 VHDLTLLSL 956
+H + +L+L
Sbjct: 1724 LHQINVLTL 1732
>gi|270014460|gb|EFA10908.1| hypothetical protein TcasGA2_TC001734 [Tribolium castaneum]
Length = 1600
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 93/203 (45%), Gaps = 14/203 (6%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P A PP C + + + + I+ +
Sbjct: 822 VKEYRDLFAEFGPPTPYATHRINTGDSLPV-AVPPYRLTCEKRKV-------LQMEIERL 873
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ + LN P + L +
Sbjct: 874 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYRKLNAVTKPDVYPLPRLDDLLHATG 931
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 932 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 991
Query: 604 SLLRSRGMRVVVYLDDFLLVNQD 626
+ + + ++ YLDD ++++ D
Sbjct: 992 TGIPD--VPILAYLDDLIIISPD 1012
>gi|440790519|gb|ELR11801.1| hypothetical protein ACA1_362880 [Acanthamoeba castellanii str.
Neff]
Length = 439
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 144/339 (42%), Gaps = 31/339 (9%)
Query: 607 RSRGMRVVVYLDDFLLVNQDPRIL-EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
R +G+ V+ YLDDF+++ D L +I + L LGW+ K + LG
Sbjct: 6 RRQGIIVMSYLDDFVVMAPDCEALQQIHDTVITLTLDRLGWLREPTKGEWELTQCAEVLG 65
Query: 666 IMWDPHLDRMWLPEDKQLTL-GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHS 724
++ D + + +PE K L G LR + +T L+G ++ S + L+
Sbjct: 66 LVVDLGMGLLRVPEPKVEALEGLCLRNVCDHRT--RRQLAELVGSVTAVSRAALVLHLYL 123
Query: 725 RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL--SSPIF-PRQVQHFISTDASDL 781
+ + G ++ A + W + + L SP++ P V F TDA D
Sbjct: 124 HSVYQLIGHNWRGWNRRVLMDVATCGDIAWIGSNIRLVAGSPLWRPSHVMVF-QTDACDT 182
Query: 782 GWGSQVDSSFLS--GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVV 839
GWG+ + L G + ++ W I+ KEM AV A+ + + V +SDN VV
Sbjct: 183 GWGACIPKVGLQARGAFMVDELAWPIHHKEMKAVKLAVEMLGHYVAGWWVEFESDNIMVV 242
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA----------QFIPGAY-NSVADSLS 888
YL GG +++ WR+ + A Q+I G+ N AD LS
Sbjct: 243 VYLCDGGGP----------DLWMTDVMWRVWLRAVAKGCGVYNVQWIHGSMDNREADWLS 292
Query: 889 RSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+ +W LS ++ ++G +D F + ++A P
Sbjct: 293 HYSNTNNWELSWDIVAELEQQFGGWDVDHFTNNLNAKAP 331
>gi|341889614|gb|EGT45549.1| hypothetical protein CAEBREN_25839 [Caenorhabditis brenneri]
Length = 1384
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/407 (20%), Positives = 163/407 (40%), Gaps = 72/407 (17%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSL 532
+A+S + + GVL +D ++ + + + +V K NG R + GLN + + L
Sbjct: 490 AAVSDELDRLTTQGVLAPVDHSS-WAAPIVIVKKKNGSIRMCADYSTGLNDSIEQHRHPL 548
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I + + G + IDL++AY + + + L+++ + + LPFG+ +AP
Sbjct: 549 PTAEDIFTIINGGKFFTQIDLAEAYLQIELDDQSKNLLSINTHKGIYQFQRLPFGVKSAP 608
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F + + + + + V YLDD ++ I+ K + + G + L+K
Sbjct: 609 GIFQQVMDQLVNGIEG----VSAYLDDIIITGGTIEEHNIRLKKVMCRINEFGMRMKLEK 664
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ +FLG + D + R P+ +++ ++ + A K ++ +S LG + F
Sbjct: 665 CKFLMEEI-RFLGFIVDKNGRR---PDPEKIA---AIKDMPAPK--DVTQVKSFLGLIQF 715
Query: 713 -ASFVIPMGRLH---------------SRRIQ------RQASLLRLGAPHLTPINPAVLP 750
+FV + RL SR Q ++A L H P P +
Sbjct: 716 YGAFVKHLFRLRPPLDALTKKDTPFKWSRDCQHAFDKIKEALQSDLLLTHFDPTKPII-- 773
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGW----------GSQVDSSFLSGLWSREQ 800
++ DAS G GSQ +S ++ Q
Sbjct: 774 -----------------------VAADASKDGIGGVLLHQYPDGSQKAVFHISKALNKAQ 810
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
QN+ +KE FA+ A++ L +++D++ ++S + G
Sbjct: 811 QNYSQIEKEGFALITAVTKFHKYLHGRFFTLKTDHKPLLSIFGDKKG 857
>gi|392718245|gb|AFM82591.1| polyprotein [Cacao swollen shoot virus]
Length = 1872
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 95/452 (21%), Positives = 184/452 (40%), Gaps = 48/452 (10%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVL---KRLDSTTGFL--SRLFLVPKG----NGGTRPV 515
++HL + + HI+ +L+ GV+ K TT F+ S + PK +G R V
Sbjct: 1336 IKHLTPAMEKQFAKHIKALLDIGVIRPSKSRHRTTAFIVESGTTIDPKTKKTIHGKERMV 1395
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
N K LN ++SL I + DL + V + + A
Sbjct: 1396 FNYKRLNDNTEKDQYSLPGIHTILKRVGNKKIFSKFDLKSGFHQVAMAEESIPWTAFWVP 1455
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGK 635
+ +PFGL AP F + + + VY+DD L+ ++ E
Sbjct: 1456 QGLYEWLAMPFGLKNAPAIFQRKMD---VCFKGTEDFIAVYIDDILVFSETMEEHENHIS 1512
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
+ I G +++ K S++ + +FLG + + + + ++++ +++
Sbjct: 1513 RMLEICKRHGLVLSPNKMSIAQEEI-EFLGTI---------ISKGRMKLQAHVIKKIVSK 1562
Query: 696 KTWNLDSA---RSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
L + RS LG L++A IP +GR S A + G L + ++ +
Sbjct: 1563 AQMELSTTKGLRSFLGLLNYARIYIPNLGRKLS---PLYAKVSPTGEKKLNKQDWNLINE 1619
Query: 752 LEWWLNALPLSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQNWH 804
++ + LP I P + I +D GWG ++ DS + + +
Sbjct: 1620 IKQMVQKLP-DLDIPPIESCIVIESDGCMEGWGGICKWKKAKEDSRTTGRICAYASGKFG 1678
Query: 805 INKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLLSEV 857
+ K E++A+ +AL + + L ++V++D Q +V++ + K + ++ +
Sbjct: 1679 VIKSTIDAEIYALIKALDAFKIFYLDKGHLIVRTDCQAIVTFYNKTNTHKPSRIRWITFL 1738
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ I L + + + I G N +AD+LSR
Sbjct: 1739 DYITGLG----VSVTIEHIDGKDNHLADTLSR 1766
>gi|254587271|emb|CAX83692.1| Gap-Pol polyprotein [Schistosoma japonicum]
Length = 1350
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 119/254 (46%), Gaps = 29/254 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLINHFRI 538
+Q + + GVL + S + + + + +V K G R + GLN L + L +
Sbjct: 507 LQRLEKQGVLMPV-SFSAWAAPIVVVKKPKGTLRICADFSTGLNDALEQHHYPLPAPDDL 565
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
+ L G + +DL+ AY V ++ T + L ++ + + T LPFG+ TAP F L
Sbjct: 566 FTILNGGSFFAKLDLADAYLQVEVEPTCRELLTINTHRGLFQYTRLPFGVKTAPAIFQQL 625
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI--LGSLGWIVNLQKSSLS 656
+ + LL G V VYLDD L+V + P E+ +LA + + G+ + +K
Sbjct: 626 MDTI--LLDLTG--VAVYLDDILVVAESPN--ELYNRLATVLKQIEDHGFHLRPEKCQFY 679
Query: 657 PAPVLQFLGIMWDPHLDRMWLPED----KQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
V ++LG ++D R PE+ +++ +++ TL RS LG +S+
Sbjct: 680 LTSV-KYLGFIFDKS-GRRPDPENIEAIQKMPAPHVVPTL-----------RSFLGLISY 726
Query: 713 ASFVIPMGRLHSRR 726
S +P LH +R
Sbjct: 727 YSAFLP--SLHEKR 738
>gi|3342816|gb|AAC27711.1| polyprotein [Rice tungro bacilliform virus]
Length = 1675
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 168/393 (42%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++K+ T + F+V R V N K LN + F++ +
Sbjct: 1202 QIKELLDNKLIKKASPTCRHRTAAFIVRNHAEEVAQKPRIVYNYKRLNDNMHTDPFNIPH 1261
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + +QK + DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1262 KISMINLIQKANIFSKFDLKAGFHHMKLKEDFKDWTTFTCSEGLFTWNVCPFGIANAPCA 1321
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + K+ + + +G +++ +KS
Sbjct: 1322 FQRFMQESFGDLKF----ALLYIDDILIASSNEQEHIKHLKIFFNRVKEVGCVLSKRKSK 1377
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + ++ SK L ++ LG L+
Sbjct: 1378 MFLKEV-EYLGVE---------IKEGKISLQPHIVEKIKRFDKSKLSTLKGLQAYLGLLN 1427
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A S++ + +L ++ + G + ++ K+E ++ + PL P
Sbjct: 1428 YARSYIKDLSKLVGPLYKKTG---KSGQRSFNKEDWNIIFKIEREISKIKPLERPKESDY 1484
Query: 770 VQHFISTDASDLGWGSQV----------DSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG+ + D+ ++G S E++ W E+ A+++A
Sbjct: 1485 I--IIETDASEEGWGAVLLCKPDKYASKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1542
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1543 LN-KFQIYLDKDFTIRTDCEAIVKGIKTEDYKK 1574
>gi|313242534|emb|CBY34671.1| unnamed protein product [Oikopleura dioica]
Length = 1339
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 96/432 (22%), Positives = 176/432 (40%), Gaps = 41/432 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRI 538
+ + E G+L + G+ + + V K +GG R V+NL +N+ L+ + +
Sbjct: 381 VNTLKEQGILIPCPDSKGWNTPVSAVGKRDGGVRLVMNLNLTINKLLTETDTYSLPYLDE 440
Query: 539 PSFLQKG-DYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+ + G + S+DL+ Y+++ IK Q ++ +NG+ L T PFG+ + F
Sbjct: 441 STEIPVGMKFFGSLDLASGYYNIAIKQEDQVKTSIHWNGEQLMFTRCPFGMRHSGNLFCR 500
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ +++R V V++DD + D + + + ++ G++V +K
Sbjct: 501 ALHHALHKMKNR-QHVTVFVDDLCIHTPDFQSFCSTLRELLQLIHEYGFVVKGRKV---- 555
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L F I W L R+ E ++ N+ L + N +SLLG L++
Sbjct: 556 --CLLFPEIRW---LGRLISAEGQRPDPENVETILKMNPPTNFKGLQSLLGMLNWVRQFC 610
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP-------KLEWWLNALPLSSP---IFP 767
+ Q ++L+R L IN P + L LSSP FP
Sbjct: 611 SIKSGDDISTQNFSTLIR-PITALVKINRPRGPFTWNREHTAAFNLIKQKLSSPEMIYFP 669
Query: 768 RQVQHFI-STDASDL--GW-------GSQVDSSFLSGLWSREQQNWHINKKEMFAVHQAL 817
F+ TDAS + GW G S ++ Q + ++E A+ A+
Sbjct: 670 DFSLPFVLCTDASSVASGWCLLQIHEGKSRIVRVGSKTFTPAQTRYSATEREALAICTAV 729
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
+ + +++D+Q ++ + L+ + +L D+ I ++P
Sbjct: 730 GDCRTYIFGTPFTIRTDHQALIYIDAKISKNDKLARWAS----YLSQYDFVI----TYLP 781
Query: 878 GAYNSVADSLSR 889
G N VAD LSR
Sbjct: 782 GEENIVADYLSR 793
>gi|224146803|ref|XP_002336340.1| predicted protein [Populus trichocarpa]
gi|222834762|gb|EEE73225.1| predicted protein [Populus trichocarpa]
Length = 556
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 147/356 (41%), Gaps = 46/356 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGN----GGTRPVLNLKGLNQ------FLSPKK 529
IQE+L+ +++ S + S FLV K + G R V+N K LN +L P+K
Sbjct: 211 IQELLDANLIE--PSESPHFSPAFLVNKHSEQKRGKRRMVINYKKLNDHTIGDGYLLPRK 268
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
L++ R S D ++ V + + Q+ A + +PFGL
Sbjct: 269 DELLDQIRGKKIFS------SFDCKSGFWQVLLDNSSQKLTAFTCPQGHFQWKVMPFGLK 322
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN 649
AP F + + + VY+DD ++ + + + + +G I++
Sbjct: 323 QAPSIF---QRHMDQTFKGFELFCRVYVDDIIVFSDNDKEHIGHVTKVLDRCKEVGVILS 379
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA--SKTWNLDSARSLL 707
L K+ L + FLG++ D + + + G+I L A SK + + L
Sbjct: 380 LPKAQLFKESI-NFLGLIID---------KGQIMLQGHIGDNLSAFNSKITDKKQLQRFL 429
Query: 708 GYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFP 767
G L++ S P ++ R QA L + + + + K++ + LP P
Sbjct: 430 GILNYISHFCP--KVAQIRQPLQAKLKKDTIWQWSKEDTDYVEKIKKAVKHLPPVHHPGP 487
Query: 768 RQVQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWHINKKEMFAV 813
++ I TDASD WG ++ + SG + +QN+H N+KE+ A+
Sbjct: 488 KEPL-IIETDASDKYWGGILKAQPMEGPELICGYASGTFKTAEQNYHSNEKELLAL 542
>gi|327272201|ref|XP_003220874.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Anolis carolinensis]
Length = 738
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 104/461 (22%), Positives = 169/461 (36%), Gaps = 103/461 (22%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
A+ +P L L P A+ + E L G ++ S T + +F V K G R
Sbjct: 15 AEGAKLPAGRLYALTVPERQALREFLDENLAKGFIRPSSSPTA--APVFFVAKKTGELRL 72
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
V + + LN++ ++ L + S +Q +DL AY + I+ + A +
Sbjct: 73 VCDYRILNKYTIQDRYPLPLISELLSRVQGAKVFTKLDLRGAYNLIRIREGDEWKTAFNT 132
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQG 634
+PFGL AP F N V L + + V+YLDD L+ ++D +
Sbjct: 133 CFGCHEFRVMPFGLCNAPAVFQRFMNDVFRDLIDQFL--VIYLDDILIFSKDEKEHRQHV 190
Query: 635 KLAVSILGSLGWIVNLQKSSLSPAPVLQFLG-------IMWDPHLDRMWLPEDKQLTLGN 687
K + L + G K P ++FLG + DPH
Sbjct: 191 KQVLHRLRANGLFAKASKCVFH-VPEVEFLGHVVSGRELKMDPH---------------- 233
Query: 688 ILRTLLASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
+ A +W L + + + +L FA++ R P+ P
Sbjct: 234 ---KVDAVNSWQELKTKKDVQRFLGFANY------------------YREFIPNFHP--- 269
Query: 747 AVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDSSFL---SGLWSRE- 799
+ P + DAS G SQ DSS G +SR+
Sbjct: 270 -------------DVDKPF-------VVEADASSYALGAVLSQKDSSGTLRPCGFYSRQL 309
Query: 800 ---QQNWHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLL 854
+QN+ I +KE+ A+ A + L+ + ++V+SD+ K+L L
Sbjct: 310 TPFEQNYTIWEKELLAIKVAFEVWRHWLEGARHQIVVRSDH-------------KNLEHL 356
Query: 855 SEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRS 890
+K+ W R + QF+ G N AD+LSR
Sbjct: 357 QTAKKLNQRQIRWALFFSRFNFKVQFVEGKANLRADALSRK 397
>gi|388855184|emb|CCF51315.1| uncharacterized protein [Ustilago hordei]
Length = 1304
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 123/311 (39%), Gaps = 27/311 (8%)
Query: 552 DLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF---ASLSNWVASLLRS 608
DL A+ HV L SYNG L FG +++P F A +W+ +
Sbjct: 634 DLEDAFRHVVTAERDSHLLGFSYNGVRYRENALTFGGSSSPWLFNLVAEFLHWLVAACLP 693
Query: 609 RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ---KSSLSPAPVLQFLG 665
V YLDD P L + IL + L+ K + + L+ LG
Sbjct: 694 ADWPVNHYLDDTF--GAVPVSHTTHALLPIHILALAANALGLRLSPKKTFGTSTRLEVLG 751
Query: 666 IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSR 725
I D + + +D++ + +LL + +L + + G L F S V P G+ R
Sbjct: 752 IEIDTVAQTVGITDDRRHHILAQCHSLLQHHSVDLLDMQWIAGLLQFVSQVFPCGKAFLR 811
Query: 726 RIQRQASLLRLGAPHLTPINPAVLPKLE--WWLNALPLSSPI-----FPRQVQHFISTDA 778
R+ G HLT L +LE WW N L S P H I TDA
Sbjct: 812 RLYDTTRRALPGKHHLT-----RLARLELLWWCNILERWSGTSVLSPSPLTAAH-IWTDA 865
Query: 779 SDLGWGSQVD-----SSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS-SVVMVQ 832
G+G + ++ + + R + +I E AV +AL LPL ++V+V
Sbjct: 866 CPKGYGGYLGLDTSPTAVFAKIVPRRHRRKNIRFLEALAVLEALRCFLPLWSGPTLVVVH 925
Query: 833 SDNQTVVSYLR 843
DN+ V LR
Sbjct: 926 VDNENVEHGLR 936
>gi|321462636|gb|EFX73658.1| hypothetical protein DAPPUDRAFT_252904 [Daphnia pulex]
Length = 562
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 71/146 (48%), Gaps = 3/146 (2%)
Query: 390 GRVSLKVQTLQKPQRCSSPVNPPA-DSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVS- 447
GR + +P R S+ N DS +GGR++ F + W L ++ VS
Sbjct: 275 GREHGRYNGRFQPYRGSNEFNKSVKDSTGDIHHLGGRVKLFSEFWPTLTQDRWVLEAVSL 334
Query: 448 GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDS-TTGFLSRLFLVP 506
G IPF +P + ++ V + I+ ++E +K + F+S LF++P
Sbjct: 335 GVRIPFLERPTVPFYLDNMRMSEKVMAICDEEIKALIEKEAIKEVAGPEQRFVSGLFVIP 394
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSL 532
K +GG RP++NLK LN+ + PK F +
Sbjct: 395 KSSGGYRPIINLKRLNRLVEPKHFKM 420
>gi|348545031|ref|XP_003459984.1| PREDICTED: alpha-2-macroglobulin-like, partial [Oreochromis
niloticus]
Length = 897
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 63/137 (45%), Gaps = 4/137 (2%)
Query: 424 GRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQE 482
G L ++ W L AP +VR + GY + F+ PP + + I
Sbjct: 40 GPLSARIERWRALAAPEWVVRTLERGYRLQFATTPPRFHRVIFSLAKGESARILQEEITT 99
Query: 483 MLETGVLKRL---DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+L G + + S F S+ FLVPK GG R +L+L+ LNQ L KF ++ +
Sbjct: 100 LLPKGAAREIAQEQSQCSFYSKYFLVPKKGGGLRLILDLRALNQHLWSYKFRMLTTSTLL 159
Query: 540 SFLQKGDYMISIDLSQA 556
++ GD+ S+DL A
Sbjct: 160 HVVRPGDWFTSMDLKDA 176
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 873 AQFIPGAYNSVADSLSRSK-SLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVPNHFQ 931
A IPGA AD +SR DW L QI+ +G P +DLFASR +A P +F
Sbjct: 190 ATHIPGALKLGADLMSRGNPQYGDWTLHPHVVTQIWTHFGHPQVDLFASRENAQCPLYFS 249
Query: 932 VSRHVAIL 939
+ A L
Sbjct: 250 LKDQEAPL 257
>gi|116195346|ref|XP_001223485.1| hypothetical protein CHGG_04271 [Chaetomium globosum CBS 148.51]
gi|88180184|gb|EAQ87652.1| hypothetical protein CHGG_04271 [Chaetomium globosum CBS 148.51]
Length = 931
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 106/445 (23%), Positives = 171/445 (38%), Gaps = 66/445 (14%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
++++ LE G ++ S G+ + VPK +G R ++ + LN ++ L R+
Sbjct: 29 YLEKNLEIGHIRPSTSPAGYP--VLFVPKKDGKLRMCVDYRQLNNETVKNRYPLPLISRL 86
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-AS 597
L + +DL AY H+ IK + A +PFGL AP F A+
Sbjct: 87 RDQLSGAQHFTRLDLPTAYAHIRIKEGDEWKTAFRTPNGHYEYLVMPFGLTNAPATFQAA 146
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIVNLQKSS 654
+ + L V YLDD L+ + + LE + +L +L VN KS
Sbjct: 147 IDQAIRHCL---DKFAVCYLDDILIYS---KTLEEHKEHVRQVLDALHEHKLSVNKDKSE 200
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYL 710
+ FLG P ++ PE L A +TW N R +G+
Sbjct: 201 FHVKKTV-FLGYEISPGWVKI-EPE-----------KLEAGRTWPTPTNATEVRGFIGFA 247
Query: 711 SFAS-FVIPMGRLHS--RRIQRQASLLRLGAPH---LTPINPAVLPKLEWWLNALPLSSP 764
+F F+ G + + ++ + + H I A+ L LS P
Sbjct: 248 NFVRIFIKNFGEIARPLHELTKKDTTFQWKQEHEQAFQRIRDAITAD-----PVLMLSDP 302
Query: 765 IFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMFAVH 814
P +V+ DASD G Q+ +F S + N+ I+ KE+ A+
Sbjct: 303 SKPFEVE----ADASDFAIGGQLGQRDKDGKLHPVAFFSKKLEGPRLNYPIHDKELLAII 358
Query: 815 QALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
+A P L + V V +D++ LR TK L+ FL ++ IH
Sbjct: 359 EAFQEWRPYLSGTTHEVQVYTDHKN----LRYFTTTKVLNGRQTRWAEFLSEFNFTIH-- 412
Query: 873 AQFIPGAYNSVADSLSRSKSLPDWH 897
+ G+ N A + RS DW+
Sbjct: 413 --YKKGSEN--ARNAVRSSGNNDWN 433
>gi|28195287|gb|AAO27306.1| TyB3p [Saccharomyces paradoxus]
Length = 1255
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/431 (21%), Positives = 176/431 (40%), Gaps = 57/431 (13%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QE+L+ + + S + S + LVPK +G R ++ + LN+ F L +
Sbjct: 339 VQELLDNKFI--VPSKSPCSSPVVLVPKKDGTFRLCVDYRALNKVTISDPFPLPRIDNLL 396
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S + ++DL Y +P+ + A T +PFGL AP FA
Sbjct: 397 SRIGNAQIFTTLDLHSGYHQIPMDPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR-- 454
Query: 600 NWVASLLRSRGMRVV-VYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIVNLQKSSL 655
++A + R +R V VYLDD L+ ++ E K ++LG L IV +K
Sbjct: 455 -YMADIFRD--LRFVNVYLDDILIFSESQ---EEHWKHLDTVLGRLKKENLIVKKKKCKF 508
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
+ + +FLG ++ + ++ + K + + K + A+ LG +++
Sbjct: 509 ASEQI-EFLG--YNIGIQKITPLQHKCAAIRDF------PKPRTVKQAQRFLGMINYYRR 559
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIS 775
IP + +I + L T + KL++ L P+ P F + + ++
Sbjct: 560 FIP----NCSKIAQPIQLFICDKSQWTEEQDKAIEKLKFALCNSPVLVP-FNNKAIYRLT 614
Query: 776 TDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS G G+ ++ + S Q+N+ + E+ + +AL +L
Sbjct: 615 TDASKDGIGAVLEEVNAKNALVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYML 674
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGA 879
+++D+ +++S + + + Q W + +++ G
Sbjct: 675 HGKHFTLRTDHISLLSLQNKNEPARRV-------------QRWLDDLATYNFTLEYLAGP 721
Query: 880 YNSVADSLSRS 890
N VAD++SR+
Sbjct: 722 KNVVADAISRA 732
>gi|443918479|gb|ELU38935.1| reverse transcriptase (RNA-dependent DNA polymerase)
domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 1560
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/380 (23%), Positives = 159/380 (41%), Gaps = 42/380 (11%)
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L + + D+ AY +PI + Q +++ ++ C PFG A++ FA +
Sbjct: 725 LPEASMAATFDVDAAYRCIPIHPSDQSSTIVAWKDNLYVDHCAPFGAASSNGLFARCGDA 784
Query: 602 VASLLR-SRGMRVVVYLDDFLLVNQDPRI--LEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ +L S G RVV ++D++++V P + + ++ LGW K+ + A
Sbjct: 785 MLMILEASLGCRVVKWVDNYVIVRPPPGFPGGDTSKQDIYNLALPLGWPWKSSKTK-NFA 843
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
FLG W + +P K+ + + T L + +L + L+G L + V+
Sbjct: 844 YAFDFLGFYWCIPKREVSIPLKKRDKFLSKICTWLDADRVSLKLTQQLIGSLVHCTNVVV 903
Query: 719 MGRLHSRRIQRQASLLRLGA--PHLTPINPAVLPKLE-------WWLNALPLSS------ 763
GR A L+R A PH PK WW N L +S
Sbjct: 904 EGR------AWLAGLIRFSAAFPHEHASRFVSRPKPTYAVHDALWWQNRLASASCTRNIS 957
Query: 764 ---PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNW-----HINKKEMFAVHQ 815
FP + F ++ G VD+ + + W R Q W +I E+ A+
Sbjct: 958 PPPTAFPSE---FFMDASTSFGIAIIVDNHWAA--W-RLLQGWKSGGRNIGWAEISALEM 1011
Query: 816 ALSLNLPL-LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQ 874
L + L+ S++ +SDNQ VV + G +++L + +++IF S + + I
Sbjct: 1012 TLEAAIAYGLRDSLLHFRSDNQGVV-FAMAAGRSRNLEQNNAIKRIFARSSLFGLRIQTS 1070
Query: 875 FIPGAYNSVADSLSRSKSLP 894
+I + ++ AD SR +P
Sbjct: 1071 YI-ASEDNPADPPSRGMPIP 1089
>gi|427798379|gb|JAA64641.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 636
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 100/433 (23%), Positives = 176/433 (40%), Gaps = 47/433 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+++ML V++ S + + S + LV K +G R ++ + LN+ + L
Sbjct: 126 VEDMLRRDVIR--PSHSPWASPVVLVRKKDGSIRFCVDYRRLNKITRKDVYPLPRIDDAI 183
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L ++ S+DL Y+ VP+ ++ A + +PFGL AP F L
Sbjct: 184 DTLHGAEFFSSLDLRSGYWQVPMADADRQKTAFITPDGLYEFNVMPFGLCNAPATFERLM 243
Query: 600 NWVASLLRSRGMR---VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+ RG++ + YLDD ++ + D ++ + ++ L + G +NL+K +
Sbjct: 244 DNTL-----RGLKWTMCLCYLDDVVVFSHDFPTHLLRLRHVLTCLTNAGLQLNLKKCRFT 298
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
A L LG + H LP+ +L K + RS +G S F F
Sbjct: 299 -ARELVILGHIVSKH---GVLPDPAKLRAVAEF-----PKPTTMKELRSFVGLCSYFRRF 349
Query: 716 VIPMGRLHSRRIQRQASLLRLGA-PHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + S Q + L + H + L +L L P P +V
Sbjct: 350 VRNFASIMSPLTQLLRGDVNLSSWSHACDVAFTTLRRLLTSPPILRHFDPTAPTEVH--- 406
Query: 775 STDASDLGWG----------SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G S+ ++ S ++ + N+ + +KE A+ AL P L
Sbjct: 407 -TDASGVGLGAVLAQRKPGYSEYVVAYASRTLTKAEANYSVTEKECLALVWALGKFRPYL 465
Query: 825 QSSVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNS 882
+ +D+ + +S L+ G + + L Q++ I ++ + G +S
Sbjct: 466 YGRPFYLVTDHHALCWLSTLKDPSG--------RLARWALRIQEYDIRVV--YRCGRKHS 515
Query: 883 VADSLSRSKSLPD 895
AD+LSRS PD
Sbjct: 516 DADALSRSPLPPD 528
>gi|294949864|ref|XP_002786362.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
gi|239900615|gb|EER18158.1| retrovirus polyprotein, putative [Perkinsus marinus ATCC 50983]
Length = 356
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 143/330 (43%), Gaps = 49/330 (14%)
Query: 584 LPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGS 643
+PFGL +A F L + V + L V VYLDD L+ + D E + + L +
Sbjct: 1 MPFGLCSAGATFQRLMDQVLNGLPF----VRVYLDDILVFSPDAETHEDHLRQVFARLRA 56
Query: 644 LGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSA 703
G ++ +K P + +LG ++D + R P+ ++ ILR + N+
Sbjct: 57 WGLTLSAEKCEFG-CPSVPYLGHIFDGNGMR---PDPTKVE--AILRW---PRPGNVAEI 107
Query: 704 RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT------PINPAVLPKLEWWLN 757
RS LG + +P +R IQR S +G+ L L+ L
Sbjct: 108 RSFLGLAGYYRNFVPNFSDVARPIQRLVS--EVGSETLALDTYWGQEQEESFRALKLRLA 165
Query: 758 ALP-LSSPIFPRQVQHFISTDASDLGWGSQVDSS-----FLSGLWSREQQNWHINKKEMF 811
ALP L+ P F + + TDASD G+ + F S + Q NWH +KE +
Sbjct: 166 ALPFLAYPDF--GIPFELYTDASDYAIGAVLMQEGRPLGFFSRTLTGSQLNWHTYEKEAY 223
Query: 812 AVHQAL------SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQ 865
+ QAL + PL V V +D++ +++L + G K +E+ L Q
Sbjct: 224 GILQALIYFQHYHIGYPL----TVTVYTDHEP-LTWLAKAGSKK-------LERWLLAMQ 271
Query: 866 DWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
+ + +++PG N AD+LSR + L D
Sbjct: 272 AY--SFIVKYVPGKKNVCADALSRIRQLDD 299
>gi|307201692|gb|EFN81403.1| hypothetical protein EAI_09447 [Harpegnathos saltator]
Length = 251
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 76/153 (49%), Gaps = 3/153 (1%)
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
I +D S GWG+ G WS E + HIN E+ A AL L ++++
Sbjct: 27 IFSDVSLTGWGASCGMQRTHGWWSIEDRALHINALELKAAFHALKCFALHLHDCRILLRI 86
Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEK-IFLLSQDWRIHILAQFIPGAYNSVADSLSR-SK 891
DN T +SY+ R G + LLS++ + ++ + I + A +I N +AD+ SR S
Sbjct: 87 DNTTAISYINRFGSVQ-YPLLSDLARDMWKWCEKRHILLFASYIASIDNVIADAESRISD 145
Query: 892 SLPDWHLSRSATEQIFLKWGVPCIDLFASRVSA 924
+ +W LS A + +G IDLFAS ++A
Sbjct: 146 TNTEWSLSEQAFRAVEGVFGPFDIDLFASIINA 178
>gi|270014461|gb|EFA10909.1| hypothetical protein TcasGA2_TC001735 [Tribolium castaneum]
Length = 1515
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 92/203 (45%), Gaps = 14/203 (6%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P A PP C + + + + I+ +
Sbjct: 561 VKEYRDLFAEFGPPTPYATHRINTGDSLPV-AVPPYRLTCEKRKV-------LQMEIERL 612
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ LN P + L +
Sbjct: 613 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYHKLNAVTKPDVYPLPRLDDLLHATG 670
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 671 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 730
Query: 604 SLLRSRGMRVVVYLDDFLLVNQD 626
+ + + ++ YLDD ++++ D
Sbjct: 731 TGIPD--VPILAYLDDLIIISPD 751
>gi|440922552|gb|AGC25937.1| hypothetical protein, partial [Piper DNA virus 1]
Length = 2027
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/447 (20%), Positives = 179/447 (40%), Gaps = 74/447 (16%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLV----PKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ +++R ST+ S F+V + G R V N K LN ++L +
Sbjct: 937 QIKELLDLKLIRR--STSPHRSAAFIVRNHAEQKRGKARIVYNYKRLNDNTHDDAYNLPH 994
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + +Q DL Y + +K + + A + + L FGL AP
Sbjct: 995 KDSILNLIQNKKIFSKFDLKSGYNQIKMKEEDRPWTAFTCPEGLFEWNVLSFGLKNAPAI 1054
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F ++ SL + +VY+DD L+ + + +L + G +++ +K+
Sbjct: 1055 F---QRFMDSLFKKYEF-CIVYIDDILVASDTVQEHIKHLELVFKTIKEAGIVISKKKTE 1110
Query: 655 LSPAPVLQFLG-------IMWDPHLDRMWLPEDKQLTLGNILRTL-LASKTWNLDSARSL 706
++ + FLG I PH+ + + L K N + +S
Sbjct: 1111 IAKT-YINFLGLKIGKGQIELQPHI---------------VTKALEYPDKIENKNKLQSF 1154
Query: 707 LGYLSFASFVIP-----MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-P 760
LG L++A IP +G L+S+ L + G + + ++ K++ + L P
Sbjct: 1155 LGLLNYARKFIPNLSKLVGPLYSK-------LRKNGQIYFNNEDVRLVQKIKNEIKQLKP 1207
Query: 761 LSSPIFPRQVQHFISTDASDLGWG----------SQVDSSFLSGLWSR--EQQNWHINKK 808
L P+ I D+S GWG ++ D+ + S + NW
Sbjct: 1208 LELPL--ENYYKVIEVDSSQDGWGAILITKPNEYAEKDTEKICAYASGNFKNNNWTSLDM 1265
Query: 809 EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS-----LSLLSEVEKIFLL 863
E+ V A++ L + +++D + +V +++ + K L+ + ++ + +
Sbjct: 1266 EIQGVINAIN-TFKLYLNKKFTLRTDCENIVKFMKNENSRKHNSKRWLNFQNAIQGMGYV 1324
Query: 864 SQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + I G N++AD LSR
Sbjct: 1325 -------VKFEHISGKSNTLADILSRK 1344
>gi|315042437|ref|XP_003170595.1| hypothetical protein MGYG_09171 [Arthroderma gypseum CBS 118893]
gi|311345629|gb|EFR04832.1| hypothetical protein MGYG_09171 [Arthroderma gypseum CBS 118893]
Length = 1868
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 69/150 (46%), Gaps = 4/150 (2%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+I + LE G ++ S G+ + VPK +G R ++ + LN ++ L I
Sbjct: 884 YIDKNLEKGYIRESTSPAGYP--IIFVPKKDGSLRLCVDYRQLNDITIKNRYPLPLIDEI 941
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L+ + ++D+ +AY+ + IK + A + +PFGL AP F +
Sbjct: 942 QDRLKGTTWFTALDIREAYYRIRIKEGEEWKTAFRTRFGLYEYQVMPFGLTNAPATFQAF 1001
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
N V L + V+VYLDD L+ ++ R
Sbjct: 1002 INNV--LREYLDIFVIVYLDDILIFTKEDR 1029
>gi|68362780|ref|XP_690943.1| PREDICTED: hypothetical protein LOC562472 [Danio rerio]
Length = 1496
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 83/182 (45%), Gaps = 19/182 (10%)
Query: 444 RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLF 503
+I + P +P VP + L P+ + + MLE +++ ST+ + S +
Sbjct: 1038 KICLTESTPIRQRPYRVP----ESLIKPLKEELKM----MLEMDIIE--PSTSAWSSPIV 1087
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHV 560
+VPK +G R L+ + LN + KF RI +++ Y+ ++DL + Y+ V
Sbjct: 1088 IVPKKDGTLRVCLDFRKLN---AVSKFDTYPMPRIDELVERIGRAKYITTLDLCKGYWQV 1144
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P++ T + A + +PFGL AP F L N V LR+ YLDD
Sbjct: 1145 PLEKTSRECTAFRTPVGLYHFKTMPFGLHGAPATFQRLMNQV---LRNCEEYSAAYLDDV 1201
Query: 621 LL 622
++
Sbjct: 1202 VI 1203
>gi|281211420|gb|EFA85584.1| hypothetical protein PPL_01367 [Polysphondylium pallidum PN500]
Length = 1436
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/408 (23%), Positives = 166/408 (40%), Gaps = 57/408 (13%)
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+GG R ++ + LN + L N + + + G IDL Q Y + + Q
Sbjct: 404 DGGWRLCVDYRSLNGITIKDTYPLPNITEVLNNTRDGVLFSKIDLLQGYHQIRVHEKDQS 463
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR--GMRVVVYLDDFLL-VNQ 625
A + V LPFGL AP F L + S+ + +++VYLDD L+ N
Sbjct: 464 KTAFRTSFGVFQYIVLPFGLTNAPACFQRL---MDSIFQRHVIAKKLLVYLDDLLIKTNI 520
Query: 626 DPRILEIQGKLA-VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
D IQ L V +L + L K L++LG H+ + +K +
Sbjct: 521 DDEDKHIQDVLEIVDLLNQNKLKIKLTKCIFGQYS-LEYLG-----HI----IGHNKLIP 570
Query: 685 LGNILRTLLASKTWNL-DSARSLLGYLS----FASFVIPMGRLHS--RRIQRQASLLRLG 737
+ + +LA K W + R L G+L + F+ + + + I R+ L +
Sbjct: 571 IND---KILAIKNWKQPITKRELRGFLGLTNYYRKFIPKLSEIEAPLIDIARKNKLFKWE 627
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIF--PRQVQHFISTDASDLGWG----------S 785
H N L K N + SS +F ++ I DAS+ G G
Sbjct: 628 DIHTETFN---LIK-----NQISDSSFLFIPDYKLTFHIDCDASNDGIGHVIYQYKDNIE 679
Query: 786 QVDSS----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
Q D+ + S ++ ++++H+ ++E+ A+ AL N +L +++ +D+Q ++
Sbjct: 680 QEDNKQIVLYGSKKFNTTERDYHVFEQEVMAIKHALESNYHMLLGYKIVIHTDHQNILFI 739
Query: 842 LRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ L+ ++ IF + + G+ N +AD LSR
Sbjct: 740 NNKLNDNTKPKLIRWLQYIFSFNP------TLIYKKGSDNVIADGLSR 781
>gi|315041357|ref|XP_003170055.1| hypothetical protein MGYG_09147 [Arthroderma gypseum CBS 118893]
gi|311345089|gb|EFR04292.1| hypothetical protein MGYG_09147 [Arthroderma gypseum CBS 118893]
Length = 1590
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 7/166 (4%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP +PL +L V + ++ ML+ G ++ S G + L PK +G R +
Sbjct: 555 PPHLPLYNLSAKELQV---LREYLDTMLKRGWIRESKSPAG--APLLFAPKADGSLRTCV 609
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN+ + +L + L + +DL +AY V IK + A
Sbjct: 610 DYRGLNKMTIKNRLTLPRVDEMLDRLAGAMFFTKLDLREAYHRVRIKEGDEWKTAFRTRY 669
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+PFGLA AP F N V + L + +VYLDD L+
Sbjct: 670 GHYEYLVMPFGLANAPATFQGYINRVLTGLVD--IACIVYLDDILI 713
>gi|145220604|gb|ABP48077.1| putative gag-pol protein [Drosophila ananassae]
Length = 1339
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 102/436 (23%), Positives = 179/436 (41%), Gaps = 50/436 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
Q +++ GV + S + + S L +V K G RP + + LN P ++ L H +
Sbjct: 501 FQYLMDMGVCR--PSKSPYASPLHMVRKPTGEWRPCGDYRALNAQTIPDRYPL-PHIQDC 557
Query: 540 SFLQKGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
+ + G + S IDL++AY +PI+ A++ + T + FGL A Q F
Sbjct: 558 THVFYGKTIFSKIDLNRAYNQIPIEPKDIPKTAITTPFGLFEFTHMTFGLCNAAQTF--- 614
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
++ + R V VY+DD + + + + I + L +N K S
Sbjct: 615 QRYMHTAFRDLDF-VFVYVDDIAVASANIQQHHIHLRQVFEKLQEYNLTINPSKCSFGKE 673
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
+ +FLG H + Q + I L L R L ++F IP
Sbjct: 674 SI-EFLG-----HKINHQGIKPLQTKVNAITNFPLPKVAKEL---RRFLAMINFYRRFIP 724
Query: 719 MGRLHSRRIQRQASLLRL----GAPHLTPIN--PAVLPKLEWWLNALP----LSSPIFPR 768
IQ QA L++L TPIN + K N+L L+ P
Sbjct: 725 NA------IQHQAPLVQLIPGNKKNDSTPINWTTETIDKFNSCKNSLAQAALLAHPAPNA 778
Query: 769 QVQHFISTDASDLGWGS---QVDSS------FLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ +STDAS++ G+ QV +S F S S Q+ + +E+ A++ +
Sbjct: 779 NLS--LSTDASNIAVGAVLHQVINSEYQPMGFFSIKLSETQRKYSTYDRELLAIYLGIKH 836
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
+L+ V +++D++ +V ++ S L +++ I + + G
Sbjct: 837 FRHMLEGRVFHIRTDHKPLVYAFDQKPEKASPRQLRQLDFIGQFTTS------ITHVRGD 890
Query: 880 YNSVADSLSRSKSLPD 895
N+ AD+LSR +++ D
Sbjct: 891 ENTTADTLSRIEAIGD 906
>gi|427798439|gb|JAA64671.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 926
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 101/450 (22%), Positives = 189/450 (42%), Gaps = 71/450 (15%)
Query: 472 VSSA----MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
VSSA ++ + +M++ G+++ S + + S + LV K +G R ++ + LN+
Sbjct: 85 VSSAERRVINQQVDDMMKRGIIE--PSNSPWASPVVLVKKKDGSIRFCVDYRRLNKITRK 142
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ L LQ ++ S+DL Y+ VP+ + A + T +PFG
Sbjct: 143 DVYPLPRIDDALDCLQGAEFFSSLDLRSGYWQVPMAEADRSKTAFVTPDGLYEFTVMPFG 202
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWI 647
L AP F + + A+L + + YLDD ++ + D + + ++ L + G
Sbjct: 203 LCNAPATFERMMD--ATLRGLKWNTCLCYLDDVVVFSTDFASHLTRLEQVLTCLSTAGLQ 260
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
+NL+K + A L LG + LP+ +L+ K + RS +
Sbjct: 261 LNLKKCRFA-ARKLTILGHVVSKD---GILPDPAKLSAVAAF-----PKPTTIKELRSFV 311
Query: 708 GYLS--------FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
G S FAS + P+ +LL G+ L+ + A + L L
Sbjct: 312 GLCSYFRRFIRDFASIMAPL-----------TNLLAAGS-DLSAWSQACDDAFD-QLRRL 358
Query: 760 PLSSPIFPRQVQHF-------ISTDASDLGWGS----------QVDSSFLSGLWSREQQN 802
+ PI ++HF + TDAS +G G+ + ++ S ++ ++N
Sbjct: 359 LTAPPI----LRHFDPSAPTELHTDASGIGLGAVLAQRKGGFEEYVVAYASRTLTKPEKN 414
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKI 860
+ + +KE A+ A+ P L + +D+ + +S L+ G + +
Sbjct: 415 YSVTEKECLAIIWAIGKFRPYLYGRPFHIVTDHHALCWLSSLKDPNG--------RLARW 466
Query: 861 FLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
L Q++ I ++ + G +S AD+LSRS
Sbjct: 467 ALRLQEFDIRVV--YRSGRKHSDADALSRS 494
>gi|427798023|gb|JAA64463.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1124
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/394 (22%), Positives = 161/394 (40%), Gaps = 43/394 (10%)
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLN 522
C L+ + + M I ++LE +++ ST+ + S LV K +GG R ++ + LN
Sbjct: 615 CKLRPVNAKKKAIMDSCIADLLEHELIR--PSTSQWTSAPVLVAKKSGGFRLAIDYRPLN 672
Query: 523 QFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMT 582
+ + + + L + + S DLSQ +F +P++ A + T
Sbjct: 673 SRTRVPAYPMPRTDWLLAQLGQAQWFSSFDLSQGFFQIPVREQDIPKTAFICHQGTYEFT 732
Query: 583 CLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSI 640
+PFG+A P F +L + V + R + +LDD L+ + D + IQ L
Sbjct: 733 RMPFGVAGGPATFQTLMDTVLDGVNHRF--AMAFLDDVLVYSDTLDDHVEHIQCVL--ER 788
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL 700
+ G +N K L L+FLG + P R P++ G +L L + +
Sbjct: 789 IRKAGLTINPAKIQLC-RNSLKFLGHVISPGQCR---PDE-----GKVLAVLQYPRPTTI 839
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWL--- 756
++ LG + IP L + + SLL+ AP T +L+ L
Sbjct: 840 KQLQAFLGLAGYYRSFIPQFSLTAHPL---TSLLKKDAPWQWTNQQEDAFGRLKEALAHD 896
Query: 757 ---NALPLSSPIFPRQVQHFISTDASDLGWGSQV---------DSSFLSGLWSREQQNWH 804
N L+ P + TDAS +G + + SF+S + + ++ +
Sbjct: 897 AVVNLPDLNRPF-------VVETDASGIGIAAVLLQASPEELRPVSFISRVLTEAEKQYT 949
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTV 838
+ + E AV A+ P L+ + + D+ ++
Sbjct: 950 VQEWECLAVVWAVDKFRPYLEFTEFEIHCDHSSL 983
>gi|281211888|gb|EFA86050.1| hypothetical protein PPL_01285 [Polysphondylium pallidum PN500]
Length = 1643
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 106/440 (24%), Positives = 175/440 (39%), Gaps = 42/440 (9%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFS 531
+ S +S + ++LE GV++ T F S F V KG R V++ K LN F
Sbjct: 761 LESKISAEVAKLLEIGVIEEAPPNTEFCSPAFFVNKGTSKERMVVDFKHLNSMTVDDVFP 820
Query: 532 LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
+ I + ID ++ + + +RF + N + T FGL +
Sbjct: 821 MERLDEIIESIGGAKIFSVIDAKSGFYQMLLNPGSRRFTTFAANKRLYMFTRPCFGLKNS 880
Query: 592 PQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
P F V L +G V VY+DD L+ ++ E K +L V
Sbjct: 881 PAYFNRWLQHVLDPLVKKGF-VRVYVDDILIFSKSVAEHEQHLKQVFELLDKNDVYVAKS 939
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT-LGNILRTLLA-SKTWNLDSARSLLGY 709
K L V + G M DK + L N + +L S + RS +G
Sbjct: 940 KCHLFKYSV-SYAGHML----------SDKGIKPLYNKVNAILNRSVPTTVKEMRSFIGA 988
Query: 710 LS-FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWL-NALPLSSPIFP 767
++ + ++ MG +R ++ R +LT A ++ L ++ L SP +
Sbjct: 989 INHYRRYLNHMGPQLARLTSTISTKYR--KINLTDQEIADFNDIKTELCSSRCLMSPRYD 1046
Query: 768 RQVQHFISTDASDLGWG---SQVDSS-------FLSGLWSREQQNWHINKKEMFAVHQAL 817
R + TDASD+G G +Q D + F + + Q+N+ +E+ A A+
Sbjct: 1047 RTFH--VYTDASDVGSGLMIAQYDDNNNLRPVLFDARKFDSAQRNYSARDRELLAFIHAV 1104
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
+ LL V +D++ ++ + L SE+ LS R +IP
Sbjct: 1105 TRYGYLLSRPFVF-HTDHKNLIYNSQNDMDNPRLVRWSEI-----LS---RFSFQTSYIP 1155
Query: 878 GAYNSVADSLSRSKSLPDWH 897
G N +AD LSR+ PD++
Sbjct: 1156 GKENCMADYLSRA---PDFY 1172
>gi|388856373|emb|CCF49922.1| uncharacterized protein [Ustilago hordei]
Length = 1214
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 96/391 (24%), Positives = 152/391 (38%), Gaps = 65/391 (16%)
Query: 454 SAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
KPP PL +L P S + ++ E L+ G ++ S + S + VPK +GG
Sbjct: 380 GGKPPQGPL----YLKGPKEMSELRRYLDENLKKGFIR--PSKSPAQSPVLFVPKKDGGL 433
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
R ++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 434 RLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIWIAKGDEWKTAF 493
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILE 631
+ +PFGLA AP F S N + R G+ VVVYLDDFL+ +
Sbjct: 494 GTQLGLYEYLVMPFGLANAPAHFQSFIN---DIFRDIIGIYVVVYLDDFLIFSDTEEAHV 550
Query: 632 IQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
++ L S L K V +FLG + L + + +K T+
Sbjct: 551 KHVTEVLTRLRSNRLFAKLSKCEFHTKTV-EFLGYIIK--LTGIEMDPEKVCTV------ 601
Query: 692 LLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
K W + +S + +L FA+F + R I A + + P+ V P
Sbjct: 602 ----KEWPMPESIHDIQRFLGFANF-------YRRFIAHFARIAK-------PLTSLVKP 643
Query: 751 KLEWWLNALPLSS-PIFPRQVQHFIS----------------TDASDLGWGSQVDS---- 789
++ LP + F + +Q F S TDASD +
Sbjct: 644 IEQFKKFELPEEAQQAFHKLIQAFTSAGVLQHFDYHLPTRLETDASDFAIAGVLKQEHEG 703
Query: 790 -----SFLSGLWSREQQNWHINKKEMFAVHQ 815
+F S S ++N+ I+ KE+ AV++
Sbjct: 704 RWHPVAFYSRKMSSAEKNYEIHDKELLAVYR 734
>gi|326665908|ref|XP_003198150.1| PREDICTED: hypothetical protein LOC100535957 [Danio rerio]
Length = 1427
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/415 (24%), Positives = 170/415 (40%), Gaps = 56/415 (13%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
A+ +++ML+ GV++ S + + S + +VPK +G R + + LN+ + +
Sbjct: 893 AIEEEVKQMLKLGVIE--PSRSPWSSPIVMVPKSDGTLRFCNDFRRLNEISEFDGYPMPR 950
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ L + Y+ ++DL++ Y+ VP+ + A S LPFGL AP
Sbjct: 951 VDELLDRLGRARYISTLDLTKGYWQVPLSEEAKAKTAFSTPSGHWQYRTLPFGLHGAPAT 1010
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L + V LR YLDD ++ ++ + + +S L G N +K
Sbjct: 1011 FQRLMDIV---LRPHQAYAAAYLDDVVIHSETWEDHLDRLRRVLSELRRAGLTANPRKCH 1067
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL---- 710
L+ ++LG L + P++K++ +R A K R+ LG
Sbjct: 1068 LALHEA-KYLGFQVGRGLIQ---PQEKKV---EAIRN--APKPETKTQVRAFLGLAGYYR 1118
Query: 711 ----SFASFVIPMGRLHSRRIQRQASLLRLGAPH---LTPINPAVLPKLEWWLNALP-LS 762
+FAS P+ + L R G P T L K++ L + P L
Sbjct: 1119 CFIPNFASLAAPL-----------SDLTRKGQPEKICWTTAAEEALHKVKMALTSEPVLR 1167
Query: 763 SPIFPRQVQHFISTDASDLGWG---SQVDSS------FLSGLWSREQQNWHINKKEMFAV 813
+P F + TDASD G G SQ+ ++S ++N+ +KE A+
Sbjct: 1168 APDF--ACPFLLQTDASDTGLGAVLSQIQEGEEHPILYISRKLLPAEKNYATVEKEALAI 1225
Query: 814 HQA-LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
A L L LL S +V + + ++ R T + V + FL QD+
Sbjct: 1226 KWAVLELRYYLLGRSFTLV--TDHAPLQWMARAKDTN-----ARVTRWFLALQDF 1273
>gi|329351125|gb|AEB91356.1| polyprotein, partial [Verticillium dahliae VdLs.17]
Length = 1129
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 102/442 (23%), Positives = 173/442 (39%), Gaps = 64/442 (14%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ +++E L G ++ S G+ + VPK NG R ++ + LN + L
Sbjct: 242 DTLDEYLKENLRKGYIRPSTSPAGYP--ILFVPKKNGKERLCVDYRQLNDITIKNCYPLP 299
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L ++ ++DL AY + IK + A +PFGL AP
Sbjct: 300 LISELRDALAGANWFTALDLKGAYNLIRIKDGEEWKTAFRTRRGHYEYLVMPFGLTNAPA 359
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL--AVSILGSLGWIVNLQ 651
F ++ N V L + VVVYLDD L+ ++ + E +G + ++ L +V +
Sbjct: 360 TFQNMINDV--LREFLDVFVVVYLDDILIFSKT--VEEHKGHVHQVLTRLHQHELLVEPE 415
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLL 707
K+ V FLG P RM E ++ A + W N+ R+ L
Sbjct: 416 KAKFHTQEV-DFLGYTITPGEIRM---EKSKVA---------AIREWPTPKNVKDVRAFL 462
Query: 708 GYLSFASFVI--------PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
G+++F + P+ L + I+ + + A I AVL + L
Sbjct: 463 GFVNFYRRFLKGYSKTANPLTNLTVKEIEFAWNEPQEKA--FRQIIDAVLSE-----PVL 515
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKE 809
+ P P +V+ TDASD G Q+ +F S + N+ I+ KE
Sbjct: 516 RMIDPEKPMEVE----TDASDFAIGGQLGQRDDQGRLHPVAFFSKKLHGPELNYQIHDKE 571
Query: 810 MFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
+ A+ +A L + V V +D++ + + + K +E F + +
Sbjct: 572 LMAIIEAFKEWRTYLSGARHEVKVYTDHKNLAHFTTNKDLNKRQIRWAEFLSEFNFTIIY 631
Query: 868 RIHILAQFIPGAYNSVADSLSR 889
R G+ N AD LSR
Sbjct: 632 R--------KGSENGRADILSR 645
>gi|7682782|gb|AAF67363.1| Hypothetical protein T32B20.f [Arabidopsis thaliana]
Length = 1504
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 99/231 (42%), Gaps = 26/231 (11%)
Query: 452 PFSAK--PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
PF+ + P PL + PV A + ++++L G + RL+++ S LF V K
Sbjct: 607 PFTIELEPGTAPLSKAPYRMVPVEMAELKKQLEDLLGKGFI-RLNTSPWRTSVLF-VKKK 664
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ + LN+ K+ L + L+ IDL+ Y +PI R
Sbjct: 665 DGSFRLCIDYRELNRVTVKNKYPLPRIDELLDQLRGATCFSKIDLTSGYHQIPIAEADVR 724
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP- 627
A +PFGL AP AF L N V V++++DD L+ ++ P
Sbjct: 725 KTAFRTRYGHFEFVVMPFGLTNAPAAFMRLMNSVFQEFLDEF--VIIFIDDILVYSKSPE 782
Query: 628 -------RILEI--QGKLAVSI---------LGSLGWIVNLQKSSLSPAPV 660
R++E + KL + +G LG IV+++ S+ P +
Sbjct: 783 EHDVHLRRVMEKLREEKLFAKLSKCSFWQRKMGFLGHIVSVEGVSVDPEKI 833
>gi|327207054|gb|AEA39176.1| polyprotein [Dahlia mosaic virus]
Length = 808
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 115/533 (21%), Positives = 204/533 (38%), Gaps = 83/533 (15%)
Query: 384 QNLEPPGRVSLKVQTLQKPQR-----CS-SPVNPPADSRIGAELVGGRLRRFVDAWIRLG 437
Q + P R L++Q QK + CS +P++P + + +++A I L
Sbjct: 330 QEVIKPERFFLEIQKYQKIEELLEKVCSENPIDPE------------KSKYWMNASIELI 377
Query: 438 APAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTG 497
P +VR+ P P I+E+L+ ++ + S +
Sbjct: 378 DPKTVVRVK-----PMKYSPQ-------------DREEFGKQIKELLDLKLI--IPSKSP 417
Query: 498 FLSRLFLV----PKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+S FLV + G R V+N K +N +L + + L+ + D
Sbjct: 418 HMSPAFLVGNEAERRRGKKRMVVNYKAINAATKGDSHNLPCIQELLTLLRGKTIFSTFDC 477
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
++ V + Q A + +PFGL AP F + + LR
Sbjct: 478 KSGFWQVLLNEESQLLTAFTCPDGHYQWKVVPFGLKQAPGIF---QRHMQNALRGLENYC 534
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD--PH 671
VY+DD ++ + + + G I++ +K++L A + FLG+ D H
Sbjct: 535 TVYVDDIIVFSDSEEKHYFHDLSVLKTIEKYGIILSKKKANLFKAKI-NFLGLEIDQGTH 593
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA 731
+ + E+ L TL K + LG L++A IP +L R Q
Sbjct: 594 CPQKHILEN----LHKFPDTLEDKK-----HLQRFLGVLTYAESYIP--KLAELRKPLQV 642
Query: 732 SLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDSS 790
L + + + K++ L + P P++ + I TDAS+ WG + +
Sbjct: 643 KLKKDYVWEWKQSDTNYIKKIKKNLISFP--KLYLPKEKEFLIIETDASNDFWGGVLKAK 700
Query: 791 ---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSY 841
+ SG + ++N+H N+KE+ AV +S L +V++DN+ +
Sbjct: 701 TADKEEVCRYTSGSFKTAEKNYHSNEKELLAVKNTISKFSIYLTPVKFLVRTDNKNFTYF 760
Query: 842 LRRQ--GGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPGAYNSVADSLSRS 890
L+ + G K L+ Q W R + + G N +AD L+R
Sbjct: 761 LKTKISGDNKQGRLVR--------WQMWFSRYSFDIEHLEGPKNVLADCLTRD 805
>gi|327268874|ref|XP_003219220.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Anolis carolinensis]
Length = 533
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 58/120 (48%), Gaps = 2/120 (1%)
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
K +GG R + +GLN + K+ L + + L KG +DL +AYF V IK
Sbjct: 298 KKDGGLRLCTDFRGLNAICTTNKYPLPLIKDMLAHLSKGKIFTKLDLREAYFRVRIKEGD 357
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ A + LPFGL+ AP F L N +G V+VYLDD L+++++
Sbjct: 358 EWKTAFNCPLGQFQYKVLPFGLSGAPGVFMQLINETLHPFLYKG--VLVYLDDILIMSEN 415
>gi|14010621|gb|AAK52055.1|AF364549_1 RNA-directed DNA polymerase [Drosophila melanogaster]
Length = 1013
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 110/461 (23%), Positives = 180/461 (39%), Gaps = 98/461 (21%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPK------GNGGTRPVLNLKGLNQFLSPKKFSLI 533
IQE+L+ G++++ S + + + +++V K GN R VL+ + LN+ P ++ +
Sbjct: 178 IQELLKNGIIQK--SKSPYNNPIWVVDKKGTDDAGNKKMRLVLDFRKLNERTVPDRYPMP 235
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
N I L K Y ++DL Y + + + A + NG LPFGL A
Sbjct: 236 NISMILGNLGKAKYFTTLDLKSGYHQITLAERDREKTAFAVNGGKYEFRRLPFGLRNAAS 295
Query: 594 AFASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F + +LR + G VY+DD ++ ++D + L ++ +K
Sbjct: 296 IF---QRTIDDILREQIGKFCYVYVDDVIIFSEDENDHVKHVDWVLKSLYDANMRISAEK 352
Query: 653 SSLSPAPVLQFLGIM-------WDPHLDRMW--LPEDKQLTLGNILRTLLASKTWNLDSA 703
S V FLG + DP + PE K N+
Sbjct: 353 SRFFKKSV-SFLGFIVTNNGAATDPEKVKAIKEFPEPK-----------------NVFEV 394
Query: 704 RSLLGYLS--------FASFVIPM-----------GRLHSRRIQRQASLLRLGAPHLTPI 744
RS LG S FAS P+ R SR IQ + S + A
Sbjct: 395 RSFLGLASYYRCFIKDFASIARPISDILKGENGSVSRHRSRSIQVEFSEAQQRA------ 448
Query: 745 NPAVLPKLEWWLNALPLSSPI--FPRQVQHF-ISTDASDLGWGSQVDS-----SFLSGLW 796
E N L I +P + F ++TDAS G G+ + + +S
Sbjct: 449 -------FEKLRNILASEDVILRYPDYKKAFDLTTDASAYGIGAVLSQEGRPITMISRTL 501
Query: 797 SREQQNWHINKKEMFAVHQALS-LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
S + N+ N++E+ A+ AL+ L L + + +D+Q + T ++S +
Sbjct: 502 SDREVNYATNERELLAIVWALAKLRHYLYAVKEINIFTDHQPL---------TFAVSESN 552
Query: 856 EVEKIFLLSQDWRIHILAQ-----FIPGAYNSVADSLSRSK 891
KI + W+ I + PG N VAD+LSR +
Sbjct: 553 PNAKI----KRWKARIDESGARIFYKPGRNNLVADALSRQQ 589
>gi|301605579|ref|XP_002932346.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1542
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/452 (22%), Positives = 180/452 (39%), Gaps = 55/452 (12%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P + L+ P + +I E LE G ++ S G + +F V K +G RP ++ +
Sbjct: 514 IPFGKIYPLSEPELKILKDYIDENLEKGFIRPSTSPAG--AGIFFVEKKDGSLRPCIDYR 571
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
LN+ ++ L +P Q+ +DL AY V I+ + A
Sbjct: 572 ELNKITVKNRYPLP---LVPELFQRLRSAKVFSKLDLQGAYNLVRIREGDEWKTAFRTRY 628
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP A+ +++ + R + VVVYLDD L+ + I +
Sbjct: 629 GHFEYLVMPFGLCNAP---ATFQHFINDIFRDFLDLFVVVYLDDILVFSSSLAEHRIHLR 685
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
S L + ++K + +FLG + + + D + + +IL
Sbjct: 686 RVFSRLRTHQLYAKIEKCEFEKTSI-EFLGFI----ISTEGISMDPR-KISSILE----- 734
Query: 696 KTWNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLE 753
W +R + ++ FA+F + SR I +L + L+
Sbjct: 735 --WPTPGSRKAVQRFVGFANFYRKFIKNFSRVIAPITALTSTSKKFFWSREAQGAFENLK 792
Query: 754 WWLNALPL---SSPIFPRQVQHFISTDASDLGWGS----QVDS-------SFLSGLWSRE 799
+ P+ P P V+ DAS++ G+ ++DS +F S S
Sbjct: 793 GRFTSAPILIHPDPSLPFVVE----VDASEVAVGAILSQRMDSLGHLHPVAFFSRKLSSS 848
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
++N+ + +E+ A+ A L+ ++ V+V SD++ + YLR + L
Sbjct: 849 EKNYDVGDRELLAIKVAFEEWRHFLEGALHPVIVFSDHKN-LEYLR-----SAKRLRPRQ 902
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ L + H+ + PG N AD+LSR
Sbjct: 903 ARWALFFSRFNFHV--TYRPGTKNGKADALSR 932
>gi|403175459|ref|XP_003889018.1| hypothetical protein PGTG_22255 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171612|gb|EHS64419.1| hypothetical protein PGTG_22255 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 859
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/314 (22%), Positives = 137/314 (43%), Gaps = 40/314 (12%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGF----LS 500
++ G+ I F P + +L H TP + + S ++ +E + K L + F L
Sbjct: 22 VIEGFRIGFDQGIPQHRVGTLTHY-TPDNHSSSEKVKSKVEDSISKELGAKRMFGPFSLE 80
Query: 501 R------------LFLVPKGNGGTRPVLNL---------KGLNQFLSPKKFSLI-NHFRI 538
+ L V G+G RP+ +L +N F+ +F + F+I
Sbjct: 81 KVIEKFGFCRSNPLGAVVNGDGAIRPINDLSFPRNDPEITSVNSFVDKSEFETTWDDFQI 140
Query: 539 PS-FLQKGDYMISI---DLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAPQ 593
S F K + + + D +AY +P + R+L + +NG+ L T + FG
Sbjct: 141 VSEFFAKDNRKMELALFDWEKAYRQIPTRKEQWRYLMVKDFNGNFLVDTRITFGGVAGCG 200
Query: 594 AFASLSNWVASLLRS--RGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
+F ++ +++S + + + ++DD L V + ++ + S LG + N
Sbjct: 201 SFGRPADAWKLIMKSHFKLLNIFRWVDDNLFVRLQGEDISMEDVVEKS--HHLGVLTN-- 256
Query: 652 KSSLSPAPVLQ-FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT-WNLDSARSLLGY 709
K+ SP Q F+G +W+ + LP+ K T N ++ L K ++ + A L+G
Sbjct: 257 KTKYSPFQDEQKFIGFIWNGVEKTVRLPDGKVETRLNQIKPFLEDKAMFDYNDAEILVGR 316
Query: 710 LSFASFVIPMGRLH 723
L+ ++++P R H
Sbjct: 317 LNHVAYILPHLRCH 330
>gi|406701213|gb|EKD04365.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 1687
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 105/456 (23%), Positives = 176/456 (38%), Gaps = 49/456 (10%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP P+ SL V + ++ E L+ G + + S + + + V K +G R +
Sbjct: 718 PPFGPIYSLSEKELGV---LREYLDENLDKGFV--VPSESPAAAPILFVKKKDGSLRLCV 772
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN+ ++ L + L+K IDL AY + I + A
Sbjct: 773 DYRGLNKITVKNRYPLPLIPELLDRLRKAKVFTKIDLRGAYNLLRIAEGDEWKTAFRTRY 832
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL 636
+ +PFGL AP +F L N + V+VYLDD L+ + KL
Sbjct: 833 GLFEYKVMPFGLTNAPASFQHLMN--HNFRDMLDDFVIVYLDDILVFSNSIEEHTEHVKL 890
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG-NILRTLLA- 694
+ L +G K V +FLG ++ DK +++ ++T+L
Sbjct: 891 VLQRLREVGLYAKASKCEFHTNSV-EFLG----------FVISDKGISMDMKKVQTILDW 939
Query: 695 SKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLE 753
K NL RS LG+ +F I + + + R L R P T L+
Sbjct: 940 PKPCNLHDVRSFLGFCNFYRRFIKGYSVVANPLIR---LTRNDVPFTWTDKEQQAFDALK 996
Query: 754 WWLNALPLSSPIFPRQVQHFI-STDASDLGWGS----QVDS-----SFLSGLWSREQQNW 803
L P QH + TDASD +VD +F S S + N+
Sbjct: 997 SCFTTADLLHHYDPD--QHLVLETDASDYAIAGVLSQEVDKELQPIAFFSRKLSPAELNY 1054
Query: 804 HINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
I+ KEM A+ + + + V +D++++ + + T+ + SE F
Sbjct: 1055 EIHDKEMLAIVACFKEWRHYFEGAAHNITVYTDHRSLEYFTTSKQLTRRQARWSEFLSEF 1114
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
+ +R PG + D+L+R + D+H
Sbjct: 1115 NFTIVYR--------PGLKGTKPDALTRRR---DYH 1139
>gi|77551464|gb|ABA94261.1| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 1369
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/427 (23%), Positives = 171/427 (40%), Gaps = 53/427 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEML G+++ S S + LV K +G R ++ + LN K+ L +
Sbjct: 559 VQEMLAKGIIQPSSSPF--SSPVLLVKKKDGSWRFCVDYRHLNAITVKNKYPLPIIDELL 616
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS-L 598
L + +DL Y + + + A + +PFGL +AP F S +
Sbjct: 617 DELAGVQWFTKLDLRAGYHQIIMHIEDEHKTAFQTHHGHFEFRVIPFGLTSAPATFQSVM 676
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+N ++SLLR V+V++DD L+ ++ + + IL V K S +
Sbjct: 677 NNILSSLLRK---CVLVFVDDILIYSRTLEEHLVHLQTVFQILHKHQLKVKKSKCSFAQQ 733
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVI 717
L +LG + P + + DK + ++W + S + L +L A +
Sbjct: 734 K-LAYLGHIISP--NGVSTDSDK----------IAVVQSWPVPSSVKELRSFLGLAGYYR 780
Query: 718 PMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFIS 775
R + + +LL+ G +L T ++ L P L+ P F + +
Sbjct: 781 KFVRNYGILSKPLTNLLKKGQLYLWTSATDQAFQAIKHALVTAPVLAMPDF--SIPFVVE 838
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDASD G G+ + +FLS +KE A+ A+ P LQ +
Sbjct: 839 TDASDKGMGAVLMQNNHPIAFLSKALGPRHLGLSTYEKESLAIMMAVDHWRPYLQHAEFF 898
Query: 831 VQS--------DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNS 882
+++ DNQ + + + + TK L L R I+ + G+ N
Sbjct: 899 IKTDHRSLAFLDNQRLTTPWQHKALTKLLGL--------------RYQII--YKKGSDNR 942
Query: 883 VADSLSR 889
VAD+LSR
Sbjct: 943 VADALSR 949
>gi|391331925|ref|XP_003740390.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 495
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 69/135 (51%), Gaps = 6/135 (4%)
Query: 469 ATPVSSAMSLHIQEMLE----TGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQ 523
A PV+ A+ + E LE GV++ +D + F + + +V K NG R + GLN+
Sbjct: 283 ARPVAYALLPQVVEELERMQKEGVIEAIDHS-DFATPVVIVKKKNGTIRMCADYSTGLNK 341
Query: 524 FLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTC 583
+ + L + + L G Y IDL++AY VP++ Q+ L+++ + +
Sbjct: 342 SIEDDVYPLPTTDAVFAQLNGGHYFSQIDLAEAYLQVPVEEKSQKILSINTVKGLFKVKR 401
Query: 584 LPFGLATAPQAFASL 598
LPFG+ TAP F SL
Sbjct: 402 LPFGIKTAPSQFQSL 416
>gi|242816502|ref|XP_002486791.1| gag/polymerase/env polyprotein, putative [Talaromyces stipitatus
ATCC 10500]
gi|218715130|gb|EED14553.1| gag/polymerase/env polyprotein, putative [Talaromyces stipitatus
ATCC 10500]
Length = 1787
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 85/211 (40%), Gaps = 7/211 (3%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P SL L+ S + ++ L G ++ S G + + VPK +GG R ++ +G
Sbjct: 740 PYRSLYRLSPKESEVLREYLVTNLAKGWIRESKSPAG--APILFVPKKDGGLRLCVDYRG 797
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L L +DL AY + IK + A
Sbjct: 798 LNAITIKNRYPLPLIGETIDRLAGAKIYTQLDLRDAYHRIRIKEGDEWKTAFRTRYGHYE 857
Query: 581 MTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
T +PFGLA AP F A ++ +A LL + V YLDD ++ +QD + +
Sbjct: 858 YTVMPFGLANAPATFQAYVNRALADLL---DICCVAYLDDIIIYSQDESSHTDDVQRVLE 914
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
L L K + + +FLG + P
Sbjct: 915 RLRQYKLYAKLSKCAFEKTEI-RFLGFIVSP 944
>gi|308447841|ref|XP_003087537.1| hypothetical protein CRE_03601 [Caenorhabditis remanei]
gi|308254851|gb|EFO98803.1| hypothetical protein CRE_03601 [Caenorhabditis remanei]
Length = 1274
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/416 (22%), Positives = 177/416 (42%), Gaps = 50/416 (12%)
Query: 493 DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISID 552
+ST+ + S + +VPK NG TR V++ + LN P+ + + + + +G D
Sbjct: 598 ESTSPYTSPILMVPKPNGDTRIVIDYRKLNLITRPRTYIMPHTTDVTEDASRGKIFSVFD 657
Query: 553 LSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMR 612
+ Q + H+ + H+ A + V +P GL +P F + VA R
Sbjct: 658 ICQGFHHIRMYEPHKERTAFCCHLGVFHYKYMPMGLKGSPDTFQRAMSEVA---RQFSGT 714
Query: 613 VVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
+++Y+DD +V N++ I +++ + I +G + +KS + + + LG + +
Sbjct: 715 LILYVDDLTVVSDNEEQHIADLEEFFKLMI--KMGLKLKAEKSQIGRNRI-KLLGFVIE- 770
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ 730
+R P ++ +R K N+ +S LG + RR +
Sbjct: 771 --NRTIQPSGEKT---EAIRNFPIPK--NVSEVKSFLGMSGYF-----------RRFIKD 812
Query: 731 ASLLRLGAPHLTPINPAVL--PKLEWWLNALP---LSSPIF--PRQVQHF-ISTDASDLG 782
++L LT + P+ + L+ + +S PI P F + TDAS +G
Sbjct: 813 YAILAKPLTALTQKENSFKWGPEQQKALDMIKDKLISPPILTTPDMNGDFEMHTDASKIG 872
Query: 783 -----WGSQVDS----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
+ Q + ++ S ++ +Q + + E A+ L+ P + V V +
Sbjct: 873 IAAILFQKQENQLKVVAYASRPTTKVEQRYPPIELESLAITWGLTHYRPYIFGRKVKVVT 932
Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
D+Q + + L R+ S L+ E I Q + + I+ + PG N VAD+LSR
Sbjct: 933 DHQPLKALLHRKENNMSGRLMRH-EAII---QQYDVEIV--YRPGRENHVADALSR 982
>gi|254587290|emb|CAX83702.1| Gag-Pol polyprotein [Schistosoma japonicum]
Length = 1345
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 96/431 (22%), Positives = 169/431 (39%), Gaps = 50/431 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ M++ G+++ S + + S L +VPK + RP + + LN P ++ I H
Sbjct: 488 FEHMMQLGIIR--PSNSPWASPLHMVPKKDHDWRPCGDYRRLNALTVPDRYP-IPHLHDF 544
Query: 540 SFLQKGDYMIS-IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S + G + + IDL +AY +P+ T A+ + +PFGL A Q F
Sbjct: 545 SLMLHGKTVFTKIDLVRAYHQIPVATEDIPKTAIITPFGLFEFLRMPFGLKNAAQTFQRF 604
Query: 599 SNWVASLLRSRGMR-VVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V +RG+ V VY+DD L+ N + + + G +++ K
Sbjct: 605 MDEV-----TRGLDFVFVYIDDVLIANNNMHDHQQHLYVFFQRFQQYGVVIHTNKCIFGV 659
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS-KTWNLDSARSLLGYLSFASFV 716
+ + FLG H L + ++ L++ + + R LG +F
Sbjct: 660 SEI-DFLGHHITTH---------GVTPLADRVQALISYPEPDSFQRLRRFLGMCNFYRRF 709
Query: 717 IPMGRLHSRRIQRQASLL--RLGAPHLTPINPAVLPKLEWWLNALPL-----SSPIFPRQ 769
+P + +Q LL R H + P ++ L + + S+P
Sbjct: 710 VPHC---AHLLQPLTDLLKGRQKNFHFSQEAPTAFNAIKEVLARVTMLTYLDSNP----S 762
Query: 770 VQHFISTDASDLGWGS-----QVDS----SFLSGLWSREQQNWHINKKEMFAVHQALSLN 820
Q + TDAS L G+ Q D +F S Q + +E+ A++ A+
Sbjct: 763 SQLVLCTDASKLAVGAVLQQQQKDELVPLAFFSKRLEPAQTRYSTFGRELLAMYLAVKHF 822
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
LQ + +D++ + S +++ I + D R FI G
Sbjct: 823 CHFLQGRDFTILTDHKQLCYSFTTSYDKHSPREARQLDCISQFTTDIR------FIQGDA 876
Query: 881 NSVADSLSRSK 891
N VAD+LSR +
Sbjct: 877 NVVADTLSRHE 887
>gi|9628896|ref|NP_043924.1| gag-pol polyprotein [Snakehead retrovirus]
gi|1335769|gb|AAC54861.1| gag-pol polyprotein [Snakehead retrovirus]
Length = 2017
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 145/300 (48%), Gaps = 31/300 (10%)
Query: 446 VSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLH--IQEMLETGVLKRLDSTTGFLSRLF 503
+ G F+A P + ++ P +S S+ ++ +LE GVL++ +ST S ++
Sbjct: 793 LQGMTASFTADHPKM----IKQYPVPDASHASIKETVEALLEQGVLRKCNSTVN--SAIW 846
Query: 504 LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMI--SIDLSQAYFHVP 561
V K +G R ++ + LN +S ++ + + + L+K Y + S+D+S ++ +
Sbjct: 847 PVGKPDGSWRLTIDYRPLNSAVSCPYPTVASTPELFAKLEK-KYQVYSSLDISNGFWSIR 905
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-ASLSNWVASL---LRSRGMRVVVYL 617
++ Q A +++ T LP G +P F +L N +AS + S+G +++ Y+
Sbjct: 906 LEEECQYLFAFTFDTQQYTWTRLPQGFHASPGIFHQALYNGLASCKTAIESQGCKLLQYV 965
Query: 618 DDFLLVNQDPRILEIQGKLAVSILG--SLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRM 675
DD LL+++D R ++ LA+ + G LG +N +KS V Q+LG+ + D
Sbjct: 966 DDILLMSED-RDHHLR-SLAILLQGLKDLGVKINPKKSHFCKDQV-QYLGV--NVGADTR 1020
Query: 676 WLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLR 735
L + + ++RTL T + RS LG +F IP SR+ Q +L+
Sbjct: 1021 SLIDAR----SQLIRTLDIPLT--VQGLRSALGLFNFCRAWIPE---FSRKTQSLYDMLK 1071
>gi|427797353|gb|JAA64128.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 913
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/439 (23%), Positives = 180/439 (41%), Gaps = 69/439 (15%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML+ G+++ S S + LV K G R ++ + LN+ + L
Sbjct: 205 VDDMLQRGIIQPSSSPW--SSPVVLVKKKYGSIRFCVDYRRLNKVTRKDVYPLPRIDDAL 262
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
LQ ++ S++L Y+ VP+ + + AL + +PFGL AP F +
Sbjct: 263 DCLQGAEFFSSLELRSGYWQVPMAPSDRPKTALVTPDGLYEFNVMPFGLCNAPATFERMM 322
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ V LL + + YLDD ++ + D + +S L G +N++K A
Sbjct: 323 DSV--LLGLKWKTCLFYLDDVVVFSPDFDSHLRRLNEVLSCLTRAGLQLNIKKCRFG-AR 379
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLA-SKTWNLDSARSLLGYLS------- 711
L LG + + +D L LR + K L + RS +G S
Sbjct: 380 KLTILGHV---------VSKDGVLPDPEKLRAVAEFPKPTTLKALRSFVGLCSYFRRFVK 430
Query: 712 -FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
FAS + P+ +L + G HL+ +PA L L S PI +
Sbjct: 431 NFASVIAPLTKLLT------------GDGHLSDWSPACDDAFA-TLRHLLTSPPI----L 473
Query: 771 QHF-------ISTDASDLGWG----------SQVDSSFLSGLWSREQQNWHINKKEMFAV 813
+HF + TDAS +G G S+ ++ S ++ + N+ + +KE A+
Sbjct: 474 RHFDPTAPTEVHTDASGIGLGAVLAQRKPGLSEYVVAYASRALTKTECNYSVTEKECLAI 533
Query: 814 HQALSLNLPLLQSSVVMVQSDNQTV--VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
AL P L V +D+ ++ ++ L+ G + + L Q++ I +
Sbjct: 534 VWALQKFRPYLYGHPFDVVTDHHSLCWLASLKDPSG--------RLGRWALRLQEFDIRV 585
Query: 872 LAQFIPGAYNSVADSLSRS 890
+ + G ++ AD+LSRS
Sbjct: 586 V--YRSGRKHADADALSRS 602
>gi|189234037|ref|XP_001808080.1| PREDICTED: similar to protease, reverse transcriptase, ribonuclease
H, integrase [Tribolium castaneum]
Length = 1202
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 92/203 (45%), Gaps = 14/203 (6%)
Query: 426 LRRFVDAWIRLGAPAPLV--RIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
++ + D + G P P RI +G ++P A PP C + + + + I+ +
Sbjct: 262 VKEYRDLFAEFGPPTPYATHRINTGDSLPV-AVPPYRLTCEKRKV-------LQMEIERL 313
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L+ G+++ DS + S + +VPK NG R ++ LN P + L +
Sbjct: 314 LQQGIIEECDS--AWASPVVMVPKANGTIRLCVDYHKLNAVTKPDVYPLPRLDDLLHATG 371
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVA 603
K + ++DL Y+ + ++ + + + T +PFGL AP +F L +
Sbjct: 372 KIGCITTLDLQAGYWQIQVEPGDRDKTSFICPFGLYRFTRMPFGLRNAPASFQRLMDKFK 431
Query: 604 SLLRSRGMRVVVYLDDFLLVNQD 626
+ + + ++ YLDD ++++ D
Sbjct: 432 TGIPD--VPILAYLDDLIIISPD 452
>gi|427779291|gb|JAA55097.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1155
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 151/377 (40%), Gaps = 39/377 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+Q+MLE V++ S + + S + LV K +G R ++ + LN + L
Sbjct: 423 QVQKMLEDDVIQ--PSKSPWASPVVLVKKKDGSLRFCIHYRKLNNVTKKDVYPLPRIDDS 480
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L+ Y S+DL Y+ + + + A + LPFGL +AP F L
Sbjct: 481 LDRLRHARYFSSMDLKSGYWQIEVDERDREKTAFVTPDGLYEFKVLPFGLCSAPATFQRL 540
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL---GWIVNLQKSSL 655
+ V S L+ + +VYLDD ++ + E K +S+L ++ G + +K
Sbjct: 541 MDTVLSGLKWKT--CLVYLDDVIVFSA---TFEEHLKRLLSVLQAIRSAGLTLKPEKCHF 595
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA- 713
LQFLG + R P+ DK + R + + + R LG ++
Sbjct: 596 GFEE-LQFLGHVVSQEGVR---PDPDKTAAVAKFTRPV------DKKAVRRFLGLCAYYR 645
Query: 714 SFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQ 771
F+ + S R+ R+ G N +L L P+ + F
Sbjct: 646 RFIADFAHIASPLTRLTREDVAFVWGEEQEASFN-----ELRQRLQTPPVLAH-FDEDAP 699
Query: 772 HFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
I TDAS++G G+ + ++ S SR + N+ +KE AV A++ P
Sbjct: 700 TAIHTDASNVGLGAVLVQCQDGAERVVAYASRTLSRAESNYSTTEKECLAVVWAVAKFRP 759
Query: 823 LLQSSVVMVQSDNQTVV 839
L V SD+ ++
Sbjct: 760 YLYGRAFQVVSDHHSLC 776
>gi|327278264|ref|XP_003223882.1| PREDICTED: hypothetical protein LOC100559995 [Anolis carolinensis]
Length = 689
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 90/204 (44%), Gaps = 10/204 (4%)
Query: 543 QKGDY--MISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
+KG + M D+ A+ +P+ L + +P G A + AF + S+
Sbjct: 131 EKGPHALMAKCDIQSAFRLLPVNPADFNLLGFKFQDQWYFDKAMPMGCAVSCAAFETFSS 190
Query: 601 ---WVASLLRSRGMRVVVYLDDFLLVN-QDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
WVA + + YLDDFL V +D + + V++ G + +K+
Sbjct: 191 FLEWVARTF-AHSRFITHYLDDFLFVGGRDSKECTHLLRSFVAMAKVFGIPLAGEKTE-G 248
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFV 716
PA + +LGI D LP DK LG+++R L K L +++LG+L+FA V
Sbjct: 249 PATTITYLGIQLDSVRGVSQLPADKLSRLGDLVREALHRKKITLRELQAILGHLNFACKV 308
Query: 717 IPMGRLHSRRIQRQASLLRLGAPH 740
+ GR + R + + APH
Sbjct: 309 VSPGRPFCAHLARATA--GISAPH 330
>gi|308448685|ref|XP_003087722.1| hypothetical protein CRE_23282 [Caenorhabditis remanei]
gi|308253265|gb|EFO97217.1| hypothetical protein CRE_23282 [Caenorhabditis remanei]
Length = 1135
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 115/255 (45%), Gaps = 28/255 (10%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLIN 534
+S I+ + +TGV+ +D + + + + V K NG R + GLN + L
Sbjct: 506 VSTEIERLTQTGVISPVDHSE-WAAPVVAVKKKNGSIRLCADFSTGLNDAIESNNHPLPT 564
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + L G++ IDL++AY V + Q+ L ++ + + LPFG+ +AP
Sbjct: 565 ADDIFAKLNGGNFFTQIDLAEAYLQVEMDPDSQKLLVINTHLGLFTYNRLPFGVKSAPGI 624
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWIVN 649
F + + + + L V YLDD ++ + R+L++ G++ G+ +
Sbjct: 625 FQQIMDTMLNGLEG----VSTYLDDIIICGSTIEEHNERVLKVFGRIQ-----EYGFRIK 675
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLG 708
++K S + +FLG + + R P+ +K + N+ + N+ +S LG
Sbjct: 676 MEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVRHIKNM------PEPTNVSQVKSFLG 725
Query: 709 YLSF-ASFVIPMGRL 722
+ F FV + RL
Sbjct: 726 LIQFYGQFVKQLFRL 740
>gi|341900941|gb|EGT56876.1| hypothetical protein CAEBREN_09057 [Caenorhabditis brenneri]
Length = 1390
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/407 (20%), Positives = 163/407 (40%), Gaps = 72/407 (17%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSL 532
+A+S + + GVL +D ++ + + + +V K NG R + GLN + + L
Sbjct: 496 AAVSDELDRLTTQGVLAPVDHSS-WAAPIVIVKKKNGSIRMCADYSTGLNDSIEQHRHPL 554
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I + + G + IDL++AY + + + L+++ + + LPFG+ +AP
Sbjct: 555 PTAEDIFTVINGGKFFTQIDLAEAYLQIELDDQSKNLLSINTHKGIYQFQRLPFGVKSAP 614
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F + + + + + V YLDD ++ I+ K + + G + L+K
Sbjct: 615 GIFQQVMDQLVNGIEG----VSAYLDDIIITGGTIEEHNIRLKKVMCRINEFGMRMKLEK 670
Query: 653 SSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
+ +FLG + D + R P+ +++ ++ + A K ++ +S LG + F
Sbjct: 671 CKFLMEEI-RFLGFIVDKNGRR---PDPEKIA---AIKDMPAPK--DVTQVKSFLGLIQF 721
Query: 713 -ASFVIPMGRLH---------------SRRIQ------RQASLLRLGAPHLTPINPAVLP 750
+FV + RL SR Q ++A L H P P +
Sbjct: 722 YGAFVKHLFRLRPPLDALTKKDTPFKWSRDCQHAFDKIKEALQSDLLLTHFDPTKPII-- 779
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGW----------GSQVDSSFLSGLWSREQ 800
++ DAS G GSQ +S ++ Q
Sbjct: 780 -----------------------VAADASKDGIGGVLLHQYPDGSQKAVFHISKALNKAQ 816
Query: 801 QNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
QN+ +KE FA+ A++ L +++D++ ++S + G
Sbjct: 817 QNYSQIEKEGFALITAVTKFHKYLHGRSFTLKTDHKPLLSIFGDKKG 863
>gi|557718|gb|AAA50456.1| gag, pol and env protein precursor [Caenorhabditis elegans]
Length = 2272
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 112/479 (23%), Positives = 194/479 (40%), Gaps = 61/479 (12%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P KP +PL + + IQ+ML V++ +S + + S + LV K +
Sbjct: 1026 AEPIRQKPRPIPL--------ALKPEIRKMIQKMLNQKVIR--ESKSPWSSPVVLVKKKD 1075
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLIN-HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
G R ++ + +N+ + L N + S K Y + D+ ++ +P+ +
Sbjct: 1076 GSIRMCIDYRKVNKVVKNNAHPLPNIEATLQSLAGKKLYTV-FDMIAGFWQIPLDEKSKE 1134
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP 627
A + ++ LPFGL +P F ++ + LL G+ VY+DD L+ ++D
Sbjct: 1135 ITAFAIGSELFEWNVLPFGLVISPALFQGTMEEIIGDLL---GVCAFVYVDDLLIASKDM 1191
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
K A++ + G + K ++ V ++LG LD + E K +
Sbjct: 1192 EQHLQDVKEALTRIRKSGMKLRASKCHIAKKEV-EYLG--HKVTLDGVETQEVKTDKMKQ 1248
Query: 688 ILRTLLASKTWNLDSARSLLGY-----LSFASFVIPMGRLHSRRI----QRQASLLRLGA 738
R + L S L+GY L+FA + L S ++ +++ +
Sbjct: 1249 FSR---PTNVKELQSFLGLVGYYRKFILNFAQIASSLTSLISAKVAWIWEKEQEIAFQEL 1305
Query: 739 PHLTPINPAVL-PKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----------QV 787
L P + P +E AL P I TDAS G G+ Q
Sbjct: 1306 KKLVCQTPVLAQPDVE---AALKGDRPF-------MIYTDASRKGIGAVLAQEGPDGQQH 1355
Query: 788 DSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG 847
+F S S + +HI E A+ AL ++ + + V +D++ ++S L+
Sbjct: 1356 PIAFASKALSPAETRYHITDLEALAMMFALRRFKTIIYGTAITVFTDHKPLISLLKGSPL 1415
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQI 906
L S I +L D +I LA G N+VAD+LSR P+ L T+++
Sbjct: 1416 ADRLWRWS----IEILEFDVKIVYLA----GKANAVADALSRGGCPPN-ELEEEQTKEL 1465
>gi|336388947|gb|EGO30091.1| hypothetical protein SERLADRAFT_433232 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1240
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/382 (23%), Positives = 148/382 (38%), Gaps = 67/382 (17%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
L+ P +S I E L G ++ S + S +F +PK +G R +++ + +N+
Sbjct: 319 LSAPERDEVSSFIDEQLRKGYIR--PSKSPMTSPVFFIPKKDGKKRMIMDYRYVNEHTVK 376
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ L ++ L + +DL Y +V IK A + + + FG
Sbjct: 377 NAYPLPLISQLVDKLAGAKILTKMDLRWGYNNVRIKEEDAWKAAFTCHRGSFEPLVMFFG 436
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL--- 644
L +P F S+ N + + + + VV+Y+DD L+ + E K+ +L L
Sbjct: 437 LTNSPATFQSMMNEIFADMENC---VVIYIDDLLIFTKSDDEAE-HDKIVQEVLRRLQER 492
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NL 700
V +K + V FLG++ + RM PE Q L W +
Sbjct: 493 DLFVKPEKCNFKVKEV-DFLGMIIGQNGIRM-NPEKVQAIL-----------EWPEPTRV 539
Query: 701 DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-W---- 755
R+ LG +G + R I+ A + R P+N L W W
Sbjct: 540 KGVRAFLG----------LGNFYRRFIENFAKITR-------PLNDLTKKDLVWQWGAKE 582
Query: 756 ------LNALPLSSPI--FPRQVQHF-ISTDASDLGWGSQVDSSFLSGLW---------- 796
L ++PI FP + F + TD+SD G+ + LW
Sbjct: 583 QEAFDKLKQAFTTAPILAFPELDKEFRLETDSSDFATGAVLSIKCPDDLWRPCAYLSHSL 642
Query: 797 SREQQNWHINKKEMFAVHQALS 818
S ++N+ I KEM A+ +AL
Sbjct: 643 SPTERNYQIYDKEMLAIIRALE 664
>gi|3319351|gb|AAC26240.1| contains similarity to reverse transcriptases (PFam: rvt.hmm,
score: 116.22) [Arabidopsis thaliana]
Length = 1322
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 99/440 (22%), Positives = 167/440 (37%), Gaps = 69/440 (15%)
Query: 479 HIQEMLETGVLKRL--DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF 536
+++ LE + KR ST+ + + + + K +G R ++ +GLNQ K+ L
Sbjct: 538 ELKKQLEDFLGKRFIRPSTSPWRAPMLFMKKKDGSFRLCIDYRGLNQVTVKNKYPLPRID 597
Query: 537 RIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
+ L+ IDL+ Y +PI R A +PFGL AP AF
Sbjct: 598 ELLDQLRGATCFSKIDLTSDYHQIPIAEADVRKTAFRTRYGHFEFVVMPFGLTNAPAAFM 657
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
L N V V++++DD L+ ++ P E+ + L L K S
Sbjct: 658 RLMNSVFQEFLDEF--VIIFIDDILVYSKSPEEHEVHLRRVKEKLREQKLFAKLSKCSFW 715
Query: 657 P------APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSL 706
++ G+ DP + A + W N RS
Sbjct: 716 QREMGFLGHIVSAEGVSVDPE-------------------KIEAIRDWPRPTNAVEIRSF 756
Query: 707 LGYL--------SFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLN 757
LG FAS PM +L + + P + +P L+ L
Sbjct: 757 LGLAGYYRRFVKGFASMAQPMTKLTGKDV-----------PFVWSPECEEGFVSLKEMLT 805
Query: 758 ALPLSSPIFPRQVQHF-ISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMF 811
+ P+ + P + + + TDAS +G G + ++ S + + N+ + EM
Sbjct: 806 STPVLA--LPEHGEPYSVYTDASGVGLGCVLMQRGKVIAYASRQLRKHEGNYPTHDLEMA 863
Query: 812 AVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI 871
AV AL + L V V +D+++ + Y+ Q L+L + L D+ + I
Sbjct: 864 AVIFALKIWRSYLYGGKVQVFTDHKS-LKYIFTQ---PELNLRQ--RRWMELVADYDLEI 917
Query: 872 LAQFIPGAYNSVADSLSRSK 891
+ PG N + D+LSR +
Sbjct: 918 --AYHPGKANVIVDALSRKR 935
>gi|313242219|emb|CBY34384.1| unnamed protein product [Oikopleura dioica]
Length = 1188
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/454 (20%), Positives = 176/454 (38%), Gaps = 66/454 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRI 538
+ + + G+L + G+ + + V K +GG R V+NL +N+ L+ + +
Sbjct: 122 VNTLKQQGILVPCPDSKGWNTPISCVGKRDGGVRLVMNLNLTINKLLTETDTYSLPYLDQ 181
Query: 539 PSFLQKG-DYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS 597
+ + G + +DL+ Y+++ IK Q ++ +NG+ L T PFG+ + F
Sbjct: 182 ATEIPIGMKFFGCLDLASGYYNIAIKKEDQVKTSIHWNGEQLMFTRCPFGMRHSGNIFCR 241
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ +++R V V++DD + D + + ++ G++V +K
Sbjct: 242 ALHHALHTMKNR-QHVTVFVDDLCIHTPDFQSFCSTLSELLRLIREFGFVVKGRKV---- 296
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL------- 710
L F + W L R+ E ++ N+ L S N +SLLG L
Sbjct: 297 --CLLFPEVRW---LGRLISAEGQRPDPDNVDAILKMSPPKNFKGLQSLLGLLNWVRSFC 351
Query: 711 -----------SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
+F++ + P+ L ++ R H +N L
Sbjct: 352 SIKSGDNIADTNFSTLIRPISAL--IKVNRPRGPFTWNREHTAALN----------LIKQ 399
Query: 760 PLSSP---IFPRQVQHFI-STDASDL--GW-------GSQVDSSFLSGLWSREQQNWHIN 806
LSSP FP F+ TDAS + GW G S ++ Q +
Sbjct: 400 KLSSPEMIYFPDFSLPFVLCTDASSVASGWCLLQVHEGKSRIIRVGSKTFTSAQSRYSAT 459
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
++E + A+ L + ++ D+Q +V + L+ + LSQ
Sbjct: 460 EREALGICTAVGDCRTYLFGTPFTIRCDHQALVYIDAKISKNDKLARWAS-----YLSQ- 513
Query: 867 WRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSR 900
+ ++PG N+VAD SR P++ +R
Sbjct: 514 --FDFVLTYLPGDENTVADYFSRP---PNYDYTR 542
>gi|384493731|gb|EIE84222.1| hypothetical protein RO3G_08932 [Rhizopus delemar RA 99-880]
Length = 244
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 5/129 (3%)
Query: 435 RLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDS 494
RL A L+R+ + P S S+Q + +SL +L+ G ++ +
Sbjct: 115 RLAAKNDLLRVQNHVRFPSSTSLTTSTYNSVQQQNQLLDHEVSL----LLKKGAMEEVPP 170
Query: 495 TT-GFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
TT GF S +F++PK NGG+RPV NLK LN ++ F + + ++ + + S L
Sbjct: 171 TTPGFYSSMFVIPKKNGGSRPVFNLKKLNNYIQAPHFKMETLQEVTKQIKHSNDLTSTYL 230
Query: 554 SQAYFHVPI 562
S + H+P+
Sbjct: 231 SDDFLHIPV 239
>gi|9626105|ref|NP_056803.1| pol polyprotein [Simian foamy virus]
gi|75651203|sp|Q87040.1|POL_SFVCP RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;
Contains: RecName: Full=Protease/Reverse
transcriptase/ribonuclease H; AltName:
Full=p87Pro-RT-RNaseH; Contains: RecName:
Full=Protease/Reverse transcriptase; AltName:
Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H;
Short=RNase H; Contains: RecName: Full=Integrase;
Short=IN; AltName: Full=p42In
gi|514843|gb|AAA19978.1| N-terminus uncertain, partial [Simian foamy virus]
Length = 1146
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 114/244 (46%), Gaps = 15/244 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
++ + I ++L+ GVL +ST + ++ VPK +G R VL+ + +N+ + +
Sbjct: 177 SIQIVIDDLLKQGVLTPQNSTMN--TPVYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQH 234
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + + + Y ++DL+ ++ PI A ++ G T LP G +P
Sbjct: 235 SAGILATIVRQKYKTTLDLANGFWAHPITPDSYWLTAFTWQGKQYCWTRLPQGFLNSPAL 294
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + LL+ V VY+DD L + +P Q + IL G++V+L+KS
Sbjct: 295 FTADA---VDLLKEVP-NVQVYVDDIYLSHDNPHEHIQQLEKVFQILLQAGYVVSLKKSE 350
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS 714
+ V +FLG ++ + E + LT + L + +L +S+LG L+FA
Sbjct: 351 IGQRTV-EFLGF----NITK----EGRGLTDTFKTKLLNVTPPKDLKQLQSILGLLNFAR 401
Query: 715 FVIP 718
IP
Sbjct: 402 NFIP 405
>gi|326433863|gb|EGD79433.1| hypothetical protein PTSG_12975 [Salpingoeca sp. ATCC 50818]
Length = 1558
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 113/490 (23%), Positives = 198/490 (40%), Gaps = 70/490 (14%)
Query: 426 LRRFVDAWIRLGAPA-----PLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
L F DA+ + G P P +RI +G P + +P L H + V + +
Sbjct: 437 LEEFKDAFAKPGEPLTKAMLPSMRIETGDTPPVARRP-----YRLSHHESEVVERI---V 488
Query: 481 QEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPS 540
++ + G+++ S + + S + L+ K G R V++ + LN P + L
Sbjct: 489 KDHIAAGIVR--PSFSPWASPVILIKKKTGEYRLVVDYRRLNAVSVPDAYPLPRLDDTLE 546
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
+ + S+DL+ AY VP+ A V T +PFGL AP F N
Sbjct: 547 AMAGAKFFSSLDLASAYHQVPLHPDDCSKTAFVTKNGVFEYTVVPFGLRNAPGHFQRCIN 606
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAV--SILGSLGWIVNLQ----KSS 654
V + + M YLDD ++ + LA ++L L VNLQ K S
Sbjct: 607 TVLADVAGVSM----YLDDIVIFSP-----TFDAHLATLRTVLERL-RAVNLQLRRDKCS 656
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA- 713
L++LG + H + P ++ + + ++ A K ++ R+ LG F
Sbjct: 657 FIQDE-LEYLGHLVSKHGVK---PNPAKI---DAIFSMPAPK--DVRELRAFLGMAGFYR 707
Query: 714 SFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQV 770
FV + + + R + + G P +L+ L + P L P F R
Sbjct: 708 RFVDKFAEIGAPLYALLRDGTEFKFGEPQQVAFR-----RLKAALASSPVLVYPDFARPF 762
Query: 771 QHFISTDASDLGWGSQVDS-----------SFLSGLWSREQQNWHINKKEMFAVHQALSL 819
++TDAS +G G+ + +F+S + ++ ++ + + ++E A+ A+
Sbjct: 763 T--LATDASGVGLGAVLQQRQDGDGKLRPVAFISRVLNKAERKYSVTEQECLALVWAVKK 820
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
P L V +D++ + + T L + L QD ++ + PG
Sbjct: 821 FRPYLHGQRFTVVTDHRALQWLRNLKDPTGRLG------RWALALQDMDFDVVHK--PGT 872
Query: 880 YNSVADSLSR 889
N VAD+LSR
Sbjct: 873 ENVVADALSR 882
>gi|301615962|ref|XP_002937446.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Xenopus (Silurana) tropicalis]
Length = 1553
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 100/452 (22%), Positives = 180/452 (39%), Gaps = 55/452 (12%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P + L+ P + +I E LE G ++ S G + +F V K +G RP ++ +
Sbjct: 515 IPFGKIYPLSEPELKILKDYIDENLEKGFIRPSTSPAG--AGIFFVEKKDGSLRPCIDYR 572
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
LN+ ++ L +P Q+ +DL AY V I+ + A
Sbjct: 573 ELNKITVKNRYPLP---LVPELFQRLRSAKVFSKLDLQGAYNLVRIREGDEWKTAFRTRY 629
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQDPRILEIQGK 635
+PFGL AP A+ +++ + R + VVVYLDD L+ + I +
Sbjct: 630 GHFEYLVMPFGLCNAP---ATFQHFINDIFRDFLDLFVVVYLDDILVFSSSLAEHRIHLR 686
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
S L + ++K + +FLG + + + D + + +IL
Sbjct: 687 RVFSRLRTHQLYAKIEKCEFEKTSI-EFLGFI----ISTEGISMDPR-KISSILE----- 735
Query: 696 KTWNLDSARSLLG-YLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLE 753
W +R + ++ FA+F + SR I +L + L+
Sbjct: 736 --WPTPGSRKAVQRFVGFANFYRKFIKNFSRVIAPITALTSTSKKFFWSREAQGAFENLK 793
Query: 754 WWLNALPL---SSPIFPRQVQHFISTDASDLGWGS----QVDS-------SFLSGLWSRE 799
+ P+ P P V+ DAS++ G+ ++DS +F S S
Sbjct: 794 GRFTSAPILIHPDPSLPFVVE----VDASEVAVGAILSQRMDSLGHLHPVAFFSRKLSSS 849
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEV 857
++N+ + +E+ A+ A L+ ++ V+V SD++ + YLR + L
Sbjct: 850 EKNYDVGDRELLAIKVAFEEWRHFLEGALHPVIVFSDHKN-LEYLR-----SAKRLRPRQ 903
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ L + H+ + PG N AD+LSR
Sbjct: 904 ARWALFFSRFNFHV--TYRPGTKNGKADALSR 933
>gi|165974305|dbj|BAF99128.1| pol polyprotein [Magnaporthe oryzae]
Length = 1305
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 97/431 (22%), Positives = 165/431 (38%), Gaps = 48/431 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I E+L ++R T S LF+ + R ++ + +N+F+ ++ +
Sbjct: 346 IDELLRIDFIERTMEETA-ASTLFVPKPQSKEQRFCVDYRWVNKFIKGRQVLAPDVAGTL 404
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S K M ID+ +A+ + + + A LPFGL P F +
Sbjct: 405 SKCGKARRMTKIDIIRAFNRLLMDPNSRYLTAFKTRQGTFQWKVLPFGLKVGPAWFQAFI 464
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDP--RILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
N A L Y DD L+ +D ++ Q + + L G +++KSS
Sbjct: 465 N--AQLNELLDAFASAYADDVLIYTEDKSEQVHFEQTEEVIYRLHKAGLQGDIKKSSFGV 522
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILR----TLLASKTWNLD---SARSLLGYL 710
+ ++LG++ L +G +R + A +W D S ++ +L
Sbjct: 523 FEI-EYLGLL---------------LEIGKGIRIDPKKVEAITSWQWDDVTSVSAVRSFL 566
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
+FV S + + LL+ G P P A L+ + P+ S F
Sbjct: 567 GLCNFVRTFCHHASEQAEPLTRLLKKGVPFEKGPEQKAAFEALKQLVVTAPVMS-FFKPG 625
Query: 770 VQHFISTDASDLG-----WGSQVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ + TDAS W Q D S + S S +QN+ I +E+ AV L
Sbjct: 626 MPVRMDTDASGRATAGVVWQQQDDGSWKPIGYSSKTMSPAEQNYPIQDQELLAVINTLKD 685
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
P L + V +D+Q ++ + TK L ++ L+ I ++ PG
Sbjct: 686 FEPALLGTKFCVFTDHQALIYW-----STKKLLSARQIRWADYLAN---FDITFKYRPGK 737
Query: 880 YNSVADSLSRS 890
N AD+LSR
Sbjct: 738 DNVAADALSRK 748
>gi|308484320|ref|XP_003104360.1| hypothetical protein CRE_22892 [Caenorhabditis remanei]
gi|308258008|gb|EFP01961.1| hypothetical protein CRE_22892 [Caenorhabditis remanei]
Length = 1386
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 97/447 (21%), Positives = 187/447 (41%), Gaps = 71/447 (15%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSL 532
SA+S + + GVL +D ++ + + V K NG R + GLN + + L
Sbjct: 502 SAVSEELDRLTLQGVLTPVDHSS-WAAPTVTVKKKNGSIRMCADYSTGLNDSIEQHRHPL 560
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I + + G Y IDL++AY + + + L ++ + + LPFG+ +AP
Sbjct: 561 PTADSIFTSINGGKYFTQIDLAEAYLQMELSDDSKELLCINTHKGLYQFNRLPFGVKSAP 620
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWI 647
F L + + + + V YLDD ++ D R++++ +S + G
Sbjct: 621 GIFQQLMDQLINGIEG----VASYLDDVIVTGSTVSEHDDRLMKV-----MSRINEFGLK 671
Query: 648 VNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
+ L+K + V +FLG + D + R P+ +++ ++ + K ++ +S L
Sbjct: 672 MKLEKCNFLMQEV-RFLGFIVDKNGRR---PDPEKIA---AIKNMPVPK--DVSQVKSFL 722
Query: 708 GYLSF-ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW---WLNALPLSS 763
G + F +FV SL RL P L + P W NA
Sbjct: 723 GLIQFYGAFV--------------KSLFRLRPP-LDALTKKDTP-FRWSRACQNAFDQIK 766
Query: 764 PIFPRQ--VQHF-------ISTDASDLGWGSQVDSSF----------LSGLWSREQQNWH 804
+ + H+ ++ DAS G G+ + + +S ++ QQN+
Sbjct: 767 EVLQSDLLLTHYDPNKPIIVAADASQYGIGAVLSHRYPDGSEKAVFHISKSLNKAQQNYS 826
Query: 805 INKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK--IFL 862
+KE FA+ A++ L ++++D++ ++S + G S + +++ + L
Sbjct: 827 QIEKEGFALVTAVTKFHKYLHGRSFILKTDHKPLLSIFGDKKGVPVYS-ANRLQRWAVIL 885
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ ++I + G AD+LSR
Sbjct: 886 LNYHFKIEYVNTMSFGQ----ADALSR 908
>gi|327267829|ref|XP_003218701.1| PREDICTED: hypothetical protein LOC100555788 [Anolis carolinensis]
Length = 972
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 102/454 (22%), Positives = 166/454 (36%), Gaps = 103/454 (22%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L L P A+ + E L G ++ S T + +F V K G R V + +
Sbjct: 315 LPAGRLYALTVPERQALREFLDENLAKGFIRPSSSPTA--APVFFVAKKTGELRLVCDYR 372
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN++ ++ L + S +Q +DL AY + I+ + A +
Sbjct: 373 ILNKYTIRDRYPLPLISELLSRVQGAKVFTKLDLRGAYNLIRIREGDEWKTAFNTCFGCH 432
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
+PFGL AP F N V L + + V+YLDD L+ ++D + K +
Sbjct: 433 EFRVMPFGLCNAPAVFQRFMNDVFRDLIDQFL--VIYLDDILIFSKDEKEHRQHVKQVLH 490
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLG-------IMWDPHLDRMWLPEDKQLTLGNILRTL 692
L + G K P ++FLG + DPH +
Sbjct: 491 RLRANGLFAKASKCVFH-VPEVEFLGHVVSGRELKMDPH-------------------KV 530
Query: 693 LASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
A +W L + + + +L FA++ R P+ P
Sbjct: 531 DAVNSWQELKTKKDVQRFLGFANY------------------YREFIPNFHP-------- 564
Query: 752 LEWWLNALPLSSPIFPRQVQHFISTDASDLGWG---SQVDSSFL---SGLWSRE----QQ 801
+ P + DAS G SQ DSS G +SR+ +Q
Sbjct: 565 --------DVDKPF-------VVEADASSYALGAVLSQKDSSGTLRPCGFYSRQLTPFEQ 609
Query: 802 NWHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
N+ I +KE+ A+ A + L+ + ++V+SD+ K+L L +K
Sbjct: 610 NYTIWEKELLAIKVAFEVWRHWLEGARHQIVVRSDH-------------KNLEHLQTAKK 656
Query: 860 IFLLSQDW-----RIHILAQFIPGAYNSVADSLS 888
+ W R + QF+ G N AD+LS
Sbjct: 657 LNQRQIRWALFFSRFNFKVQFVEGKANLRADALS 690
>gi|308481604|ref|XP_003103007.1| hypothetical protein CRE_31176 [Caenorhabditis remanei]
gi|308260710|gb|EFP04663.1| hypothetical protein CRE_31176 [Caenorhabditis remanei]
Length = 1814
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 100/456 (21%), Positives = 186/456 (40%), Gaps = 62/456 (13%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P C ++ + ++ M + G+++ +ST+ + S L +PK NG R V++ +
Sbjct: 875 IPQCRPYRVSPQQREKLGKELKFMKDNGLIE--ESTSPYTSPLLSIPKANGEIRIVIDYR 932
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN + + + N + +G D++Q + +P+ H+ A + V
Sbjct: 933 RLNLITRSRTYIMPNTIDVTEEASRGKLFSVFDIAQGFHTIPMHEAHKERTAFCCHMGVF 992
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP----RILEIQGK 635
+P GL AP F +A + + +++ +DD ++V++D R LE +
Sbjct: 993 QYRYMPMGLKGAPDTF---QRAMAEVEKQFTGTMILNVDDLIVVSRDEEEHLRNLEEFFQ 1049
Query: 636 LAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLAS 695
L + ++G + +KS + + FLG + + + P ++ +R
Sbjct: 1050 LMI----NMGLKLKAEKSQIGRTKI-SFLGFVIE---NNTIQPSGEKT---EAIRKFPTP 1098
Query: 696 KTWNLDSARSLL---GYL-----SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPA 747
T L +S L GY +A V P+ L + ++ G
Sbjct: 1099 TT--LSEVKSFLGMSGYFRRFIKDYAIIVKPLTTLTQKDVE-----FNWGEEQ-----EK 1146
Query: 748 VLPKLEWWLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS-----QVDSSFLSGLWSR- 798
+++ L +S PI PR F + TDAS +G + Q D + SR
Sbjct: 1147 AFEEVKQRL----ISPPILTTPRMDGDFEMHTDASKIGIAAVLLQKQDDELKVIAYASRP 1202
Query: 799 ---EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
+Q + + E A+ L+ P + V V +D+Q + S L R+ S LL
Sbjct: 1203 TTPVEQRYAAIESEALAITWGLTHYRPYIFGKKVKVVTDHQPLKSLLHRKEKEMSGRLLR 1262
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
I Q + + I+ + PG N +AD+LSR +
Sbjct: 1263 HQAII----QMYDVEIV--YRPGKENPLADALSRQR 1292
>gi|149236387|ref|XP_001524071.1| hypothetical protein LELG_04884 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452447|gb|EDK46703.1| hypothetical protein LELG_04884 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1345
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/431 (20%), Positives = 174/431 (40%), Gaps = 38/431 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ EML+ G L+ S+ + + FL+PK +G R +++L+ LN+ + + + +
Sbjct: 477 LSEMLKNGQLQY--SSAAYRNPWFLIPKKDGRHRMLIDLRELNKHVELEGGHPQSTDELT 534
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S L + ID+ AYF VP+ T + + +L LP G F+S+
Sbjct: 535 SELSGRLFNTLIDVQNAYFQVPLDPTTNDVTSFNSPLGLLKYAVLPQGYLNLVSEFSSI- 593
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN-LQKSSLSPA 658
+ +L V+ ++DD + P + ++ L L + ++ L + L
Sbjct: 594 --LQKILSPVAKDVICFIDDIAICG--PTVEDLSESLMKEHLDKVHQVLQLLAHAGLKIN 649
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS------- 711
P + + L PE K + G + + S LG ++
Sbjct: 650 PAKLKVAVEDCEFLGYRITPEGKTIIRGQVDALTNYPRPTTQKKMESFLGLVNYYRQLIV 709
Query: 712 -FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
FA P+ L + + LL + + L + P+ P +Q+
Sbjct: 710 GFAELTAPLYDLILKAKEHPKHLLEWDDQTINYFQHII-----RVLTSCPVLQPFNDKQI 764
Query: 771 QHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLN 820
I TDAS WG + ++ SG + ++++ I +KE+F+++ L+
Sbjct: 765 TT-IHTDASTESWGGVLQNTDAHGVTRMVLCYSGKFHGSERHYTIYEKELFSIYLTLNAI 823
Query: 821 LPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
PLL ++ + DN+ +V+ L + ++ ++ K + + I+ I G
Sbjct: 824 QPLLVGYKDILYIYCDNKALVTVLDKP--LENSHFVNRTYKWLNYIRSFNYMII--HIDG 879
Query: 879 AYNSVADSLSR 889
N +AD+LSR
Sbjct: 880 KRNVIADALSR 890
>gi|403161344|ref|XP_003890470.1| hypothetical protein PGTG_20926 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171229|gb|EHS64314.1| hypothetical protein PGTG_20926 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1367
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/440 (21%), Positives = 191/440 (43%), Gaps = 58/440 (13%)
Query: 490 KRLDSTTGFLSR--LFLVPKGNGGTRPV---------LNLKGLNQFLSPKKFSLI-NHFR 537
+++++T GF L V G+G RP+ + +K +N +++ F + F+
Sbjct: 568 QQMEATFGFFRSNPLGAVVNGDGKIRPINDLSYPKNDIEVKSVNSYVNKLDFETTWDDFK 627
Query: 538 IPSFLQKGDY----MISIDLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAP 592
S D + D AY +P K ++L + ++G++L T + FG
Sbjct: 628 TVSKFFAEDKRSFELALFDWEGAYRQIPTKQDQWKYLLVQDFDGNLLIDTRITFGGVAGC 687
Query: 593 QAFASLSNWVASLLRSRGMRVVVY--LDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
+F ++ +++S + ++ +DD L + + L ++ + S LG + N+
Sbjct: 688 GSFGRPADAWKLIMKSHFNLITIFRWVDDNLFIKEVGADLSMKDVVLKST--ELGVMTNV 745
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGY 709
+K S +P +F+G +W+ + LPE K + L I +A T++ + L+G
Sbjct: 746 KKFS-DFSPEQKFIGFVWNGVSKTVRLPEGKIEKRLNQIYPFQVAKATFDYEEVEILVGR 804
Query: 710 LSFASFVIPMGRLHSRRIQRQ-ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPR 768
L+ ++++P R + + R S A TP++ VL L+ W+ L
Sbjct: 805 LNHVTYILPHLRCNLCSLYRWLKSWFWRKAKRATPVD--VLEDLQIWVETL--------- 853
Query: 769 QVQHFISTDAS------DLGWGSQVDSSFLSGL-----WSREQQNWHINKKEMFAVHQAL 817
+F T D+GW +SF G+ W++ + ++ K+ ++ + +
Sbjct: 854 --NNFEHTRLIRWGPPLDVGWVGDASTSFGIGILVGRHWAQFKLIDPLSNKKRISLLETV 911
Query: 818 SLNLPLL--------QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
++ L L+ + ++V +DN T + + TK E +KI + +
Sbjct: 912 AIRLGLIMLLKLRDQRGKSLIVWTDNTTTENSINNM-KTKDREANDEWKKIQAILLRESV 970
Query: 870 HILAQFIPGAYNSVADSLSR 889
+++A+ + N AD+LSR
Sbjct: 971 NLIARRVASKDNK-ADALSR 989
>gi|340385330|ref|XP_003391163.1| PREDICTED: hypothetical protein LOC100633740, partial [Amphimedon
queenslandica]
Length = 449
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 79/171 (46%), Gaps = 13/171 (7%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
P KP +P L+ + ++EM G++++ S++ + S L +V K +GG
Sbjct: 262 PIRQKPYRIPQAYLKDVMK--------ELEEMERDGIIEK--SSSEWASPLVIVKKKDGG 311
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLA 571
R ++ + LNQ + + + + +++ ++DL++ Y+ VP+ + A
Sbjct: 312 IRLCVDYRQLNQVTKFDAYPMPRVEELLDTIGDAEFITTLDLAKGYWQVPVNEKDREKTA 371
Query: 572 LSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
+ + +PFGL+ AP F + + +LR VYLDD ++
Sbjct: 372 FTSPRGLYQFKTMPFGLSGAPATFQRMMD---EILRGTETFAGVYLDDIVI 419
>gi|308475526|ref|XP_003099981.1| hypothetical protein CRE_20844 [Caenorhabditis remanei]
gi|308266033|gb|EFP09986.1| hypothetical protein CRE_20844 [Caenorhabditis remanei]
Length = 355
Score = 61.2 bits (147), Expect = 3e-06, Method: Composition-based stats.
Identities = 50/190 (26%), Positives = 88/190 (46%), Gaps = 14/190 (7%)
Query: 469 ATPVSSAMSLHIQEMLETGVLKRL--DSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
A P+ A+ I++M++ + +R+ +S + + S + LV K +G R ++ + +N +
Sbjct: 150 ARPIPLAIRGEIRKMIQKMLSQRVIRESKSPWASPVVLVKKKDGSVRMCIDYRKVNLLIK 209
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
L N L + DL Y+ +P+K + A + ++ LPF
Sbjct: 210 YNAHPLPNIETTLLSLAGKKVFTTFDLLAGYWQLPLKEESKEITAFAIGSELFEWNVLPF 269
Query: 587 GLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR--------ILEIQGKLA 637
GLAT+P F A++ V LL G V VY+DD L+ +++ + ILE K
Sbjct: 270 GLATSPAIFQAAMECVVGDLL---GTCVFVYVDDLLIASENMKEHAIHVQTILERIEKSG 326
Query: 638 VSILGSLGWI 647
+ + S WI
Sbjct: 327 MKLKASKCWI 336
>gi|58268442|ref|XP_571377.1| retrotransposon nucleocapsid protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|57227612|gb|AAW44070.1| retrotransposon nucleocapsid protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 1484
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 112/521 (21%), Positives = 200/521 (38%), Gaps = 83/521 (15%)
Query: 410 NPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFS--AKPPLVPLCSLQH 467
+ P AE+V +++D + + A + IP PP P+ +L
Sbjct: 495 DKPPKENTDAEIVPKEYHQYLDVFDKKSADTLPEHRSFDHHIPLEEGKNPPFGPIYNLSE 554
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
A+ ++ E L+ G ++ +S G + + V K +G R ++ +G+N+
Sbjct: 555 TEL---EALREYLDENLKKGFIRPSESPAG--APILFVKKKDGSLRMCVDYRGINKITIK 609
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
++ L + L+ IDL AY + IK + A +PFG
Sbjct: 610 NRYPLPLIAELLDRLKSAKVFTKIDLRGAYNLLRIKAGEEWKTAFRTRYGHFEYLVMPFG 669
Query: 588 LATAPQAFASLSNW-VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL-- 644
L AP +F L N LL + V++YLDD L+ + D LE + +L L
Sbjct: 670 LTNAPASFQHLMNHNFRDLL---DIFVIIYLDDILIYSPD---LETHQSHVIQVLDRLRQ 723
Query: 645 -GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----N 699
V K V +FLG ++ D+ L++ + + + W N
Sbjct: 724 TQLYVKASKCEFHQTSV-EFLG----------FVVSDQGLSMDT--KKVKSITEWPTPRN 770
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP--------- 750
L +S LG+ +F + R I+ +S+ A L + LP
Sbjct: 771 LRDTQSFLGFCNF----------YRRFIKDYSSI----AKPLIDLTKKDLPFVWEEPQRT 816
Query: 751 ---KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----QVDS-----SFLSGLWSR 798
L+ ++ L P + Q + TDASD ++D +F S
Sbjct: 817 SFEALKKSFTSVDLLRHYDPTK-QLILETDASDYAIAGILSHEIDKKLEPVAFFSHKMLP 875
Query: 799 EQQNWHINKKEMFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSE 856
+ N+ I+ KEM A+ A + + + V +D++++ ++ + + + SE
Sbjct: 876 AELNYPIHDKEMLAIVSAFKEWRHYFEGARETIRVYTDHRSLEYFMTTKQLNRRQARWSE 935
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
FL D+ I + PG + D+L+R D+H
Sbjct: 936 ----FLADFDFNI----IYRPGVQGTKPDALTRRH---DYH 965
>gi|310751834|gb|ADP09366.1| pol protein [Human immunodeficiency virus 1]
Length = 402
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 124/282 (43%), Gaps = 29/282 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
A++ +EM + G + ++ + + +F++ + NG R +++L+ LN+ + + +
Sbjct: 131 KALTEICEEMEKEGKISKIGPENPYNTPVFVIKRKNGKWRKLIDLRELNK-RTQDFWEVQ 189
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL---SYNGDVLAM----TCLPF 586
P+ L K M +D+ AYF VP+ +++ A S N + + LP
Sbjct: 190 LGIPHPAGLNKKKSMTVLDVGDAYFSVPLYEDFRKYTAFTIPSINNETPGIRYQYNVLPM 249
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVV--YLDDFLLVNQDPRILEIQGKLAVSILGSL 644
G +P F S + R++ +V+ Y+DD L V D I + + K+ L
Sbjct: 250 GWKGSPSIFQSSMTKILEPFRTKNPEIVICQYMDD-LYVGSDLEIGQHRAKIKELREHLL 308
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
W + P +++G ++ H D+ W + QL +W ++ +
Sbjct: 309 KWGLTTPDQKHQEEPPFRWMG--YELHPDK-WTVQPIQLP---------EKDSWTVNDIQ 356
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
L+G L++AS + P +++ LLR GA LT I P
Sbjct: 357 KLVGKLNWASQIYP-----GIKVRHLCKLLR-GAKALTAIVP 392
>gi|326666839|ref|XP_003198393.1| PREDICTED: hypothetical protein LOC100331523 [Danio rerio]
Length = 1440
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 100/415 (24%), Positives = 169/415 (40%), Gaps = 56/415 (13%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
A+ +++ML+ GV++ S + + S + +VPK +G R + + LN+ + +
Sbjct: 924 AIEEEVEQMLKLGVIE--PSRSPWSSPIVMVPKSDGTLRFCNDFRRLNEISEFDGYPMPR 981
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ L + Y+ ++DL++ Y+ VP+ + A S LPFGL AP
Sbjct: 982 VDELLDRLGRARYISTLDLTKGYWQVPLSEEAKAKTAFSTPSGHWQYRTLPFGLHGAPAT 1041
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L + V LR YLDD ++ ++ + + +S L G N +K
Sbjct: 1042 FQRLMDIV---LRPHQAYAAAYLDDVVIHSETWEDHLDRLRRVLSELRRAGLTANPRKCH 1098
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL---- 710
L+ ++LG L + P++K++ +R A K R+ LG
Sbjct: 1099 LALHEA-KYLGFRVGRGLIQ---PQEKKV---EAIRN--APKPETKTQVRAFLGLAGYYR 1149
Query: 711 ----SFASFVIPMGRLHSRRIQRQASLLRLGAPH---LTPINPAVLPKLEWWLNALP-LS 762
+FAS P+ L R G P T L K++ L + P L
Sbjct: 1150 CFIPNFASLAAPL-----------TDLTRKGQPEKICWTTAAEEALHKVKMALTSEPVLR 1198
Query: 763 SPIFPRQVQHFISTDASDLGWG---SQVDSS------FLSGLWSREQQNWHINKKEMFAV 813
+P F + TDASD G G SQ+ ++S ++N+ +KE A+
Sbjct: 1199 APDF--ACPFLLQTDASDTGLGAVLSQIQEGEEHPILYISRKLLPAEKNYATVEKEALAI 1256
Query: 814 HQA-LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
A L L LL S +V + + ++ R T + V + FL QD+
Sbjct: 1257 KWAVLELRYYLLGRSFTLV--TDHAPLQWMARAKDTN-----ARVTRWFLALQDF 1304
>gi|432888034|ref|XP_004075034.1| PREDICTED: uncharacterized protein LOC101156905 [Oryzias latipes]
Length = 1290
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 85/194 (43%), Gaps = 25/194 (12%)
Query: 483 MLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFL 542
ML G+++ S + + + + LVPK +G R ++ + LN KF RI +
Sbjct: 912 MLSLGIIQ--PSKSEWCNPVVLVPKKDGSIRFCIDFRYLNAM---SKFDSYPTPRIDDLI 966
Query: 543 Q---KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ K Y+ +IDL + Y+ VP+ Q + A + T LPFGL AP F L
Sbjct: 967 ERLGKAKYLTTIDLCKGYWQVPLTARSQEYTAFRTPWGLFEFTVLPFGLHGAPATFQRLM 1026
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKS---- 653
+ V L YLDD ++ + D + ++ + L S+G VN K+
Sbjct: 1027 DQVLGGLDGFA---CAYLDDIVVYSTTWDEHLEHLK---VLECLHSVGLTVNPAKAEAAF 1080
Query: 654 -----SLSPAPVLQ 662
SLS PVL
Sbjct: 1081 RDIQRSLSTNPVLH 1094
>gi|56407688|gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas]
Length = 1358
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 82/195 (42%), Gaps = 5/195 (2%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
+ + + E+L+ G ++ +S + + LVPK +G R ++ + +N ++ +
Sbjct: 674 AKEIQRQVDELLQAGFIQ--ESLSPCAVPVLLVPKKDGTWRMCVDCRAINNITVKYRYPI 731
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
+ L IDL + Y + ++ + A + +PFGL AP
Sbjct: 732 PRLDDMLDELHGAKIFSKIDLRRGYHQIRMQKGDEWKTAFKTKNGLYEWLVMPFGLTNAP 791
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQK 652
F L N V L G VVVY DD L+ ++DP+ I K +L NL+K
Sbjct: 792 STFMRLMNHV--LRNFIGKFVVVYFDDILIYSKDPQKHIIHLKEVFLVLRREQLYANLEK 849
Query: 653 SSLSPAPVLQFLGIM 667
V+ FLG +
Sbjct: 850 CYFGVESVV-FLGFI 863
>gi|9629908|ref|NP_045937.1| Pr gag-pro-pol [Walleye dermal sarcoma virus]
gi|2801525|gb|AAC82611.1| Pr gag-pro-pol [Walleye dermal sarcoma virus]
Length = 1751
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 99/225 (44%), Gaps = 11/225 (4%)
Query: 508 GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKG-DYMISIDLSQAYFHVPIKTTH 566
G R + +L+ +N ++P + + + S L + IDLS A+F VPI
Sbjct: 815 GRDEYRMIHDLRAINNIVAPLTAVVASPTTVLSNLAPSLHWFTVIDLSNAFFSVPIHKDS 874
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
Q A ++ G T LP G +P F+ +L + + + +Y+DD L+ ++
Sbjct: 875 QYLFAFTFEGHQYTWTVLPQGFIHSPTLFSQALYQSLHKIKFKISSEICIYMDDVLIASK 934
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
D + + L S G V+ +K L V+ +LG + P R LP D+++T+
Sbjct: 935 DRDTNLKDTAVMLQHLASEGHKVSKKKLQLCQQEVV-YLGQLLTPE-GRKILP-DRKVTV 991
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ 730
+ + R+ LG + + IP +HS+ +++Q
Sbjct: 992 SQF------QQPTTIRQIRAFLGLVGYCRHWIPEFSIHSKFLEKQ 1030
>gi|338819284|sp|O92815.2|POL_WDSV RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix
protein p10; Short=MA; Contains: RecName: Full=p20;
Contains: RecName: Full=Capsid protein p25; Short=CA;
Contains: RecName: Full=Nucleocapsid protein p14;
Short=NC-pol; Contains: RecName: Full=Protease p15;
Short=PR; Contains: RecName: Full=Reverse
transcriptase/ribonuclease H p90; Short=RT; Contains:
RecName: Full=Integrase p46; Short=IN
Length = 1752
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 99/225 (44%), Gaps = 11/225 (4%)
Query: 508 GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKG-DYMISIDLSQAYFHVPIKTTH 566
G R + +L+ +N ++P + + + S L + IDLS A+F VPI
Sbjct: 816 GRDEYRMIHDLRAINNIVAPLTAVVASPTTVLSNLAPSLHWFTVIDLSNAFFSVPIHKDS 875
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
Q A ++ G T LP G +P F+ +L + + + +Y+DD L+ ++
Sbjct: 876 QYLFAFTFEGHQYTWTVLPQGFIHSPTLFSQALYQSLHKIKFKISSEICIYMDDVLIASK 935
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
D + + L S G V+ +K L V+ +LG + P R LP D+++T+
Sbjct: 936 DRDTNLKDTAVMLQHLASEGHKVSKKKLQLCQQEVV-YLGQLLTPE-GRKILP-DRKVTV 992
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ 730
+ + R+ LG + + IP +HS+ +++Q
Sbjct: 993 SQF------QQPTTIRQIRAFLGLVGYCRHWIPEFSIHSKFLEKQ 1031
>gi|1326016|emb|CAA86713.1| TY3-2 orfB [Saccharomyces cerevisiae]
Length = 1099
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 385 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 444
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 445 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFA---RYMADTFRD--LRFVNVYLDDI 499
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 500 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 556
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 557 K----CAAIRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 606
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 607 WTEKQDKAIEKLKAALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 665
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 666 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 725
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 726 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 758
>gi|301606450|ref|XP_002932846.1| PREDICTED: hypothetical protein LOC100495514 [Xenopus (Silurana)
tropicalis]
Length = 1254
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 69/133 (51%), Gaps = 11/133 (8%)
Query: 407 SPVNPPADSRIGAELVGGRLRRFVDAWIRLGAPAPLVRIVS-GYAIPFSAKPP---LVPL 462
+P+ P + + G + VGGRL++F+ W ++ I+S GY IPF+ + P +P
Sbjct: 415 NPIQPDSTRQRGIQSVGGRLQQFLGTWETHVTDKWVLSIISQGYRIPFTPQLPQGRFLPS 474
Query: 463 CSLQHLATPVSSAMSLHIQEMLETGVLKRL---DSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+ H + A + +L +GV++ + GF S LFLV K G RPVLNLK
Sbjct: 475 STAPHKQHLLQQA----VNTLLLSGVVEDVPEHHKYRGFYSNLFLVRKKEGSFRPVLNLK 530
Query: 520 GLNQFLSPKKFSL 532
LN + ++F +
Sbjct: 531 PLNPMVLNQRFKM 543
>gi|317138877|ref|XP_003189097.1| hypothetical protein AOR_1_1350184 [Aspergillus oryzae RIB40]
Length = 1605
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L + Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVEAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTVMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|403172329|ref|XP_003889331.1| hypothetical protein PGTG_21973 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375169805|gb|EHS63969.1| hypothetical protein PGTG_21973 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1385
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 106/510 (20%), Positives = 206/510 (40%), Gaps = 70/510 (13%)
Query: 435 RLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDS 494
+ G ++ G+ F P L TP + +L Q+ +E + K +++
Sbjct: 534 KAGLTEEFADVIRGFKEGFDQGIPNHNLGPATPYFTPPNHQSALMAQDKIEQSMKKEIEA 593
Query: 495 TTGF-------LSRLF---------LVPKGNGGTRPVLNLK---------GLNQFLSPKK 529
F L + F G+G RP+ +L +N F++
Sbjct: 594 GRMFGPYTHKQLMKKFSFFRTNPLGAAINGDGSVRPINDLSFPRHDPLTPSVNSFVNKLD 653
Query: 530 FSLI-NHFR-IPSFLQKGD---YMISIDLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTC 583
++ + F + F ++ + D +AY +P + +L + +NG +L T
Sbjct: 654 YATTWDDFESVSKFFRRQTSPLLLALFDWEKAYRQIPTAKSQWAYLMVRDFNGGILIDTR 713
Query: 584 LPFGLATAPQAFASLSN-WVASLLRSRGMRVVV-YLDDFLLVNQDPRILEIQGKLAVSIL 641
+ FG +F ++ W +L + V ++DD L V +E++ + S
Sbjct: 714 IAFGGVAGCGSFGRPADAWKQLMLHEFDLVTVFRWVDDNLFVKHPDSKVEMEDIVDRS-- 771
Query: 642 GSLGWIVNLQKSSLSPAPVLQ-FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT-WN 699
LG V + SP Q ++G +W+ + LPEDK+ ++ L T ++
Sbjct: 772 EKLG--VKTNSTKYSPFKEEQKYIGFIWNATKKSVRLPEDKKYQRVQQIKEFLKPDTEFS 829
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQR--QASLLRLGAPHLT-PINPAVLPKLEWWL 756
A + G L+ S+++P R + + R A + R H+ PI V LE WL
Sbjct: 830 FKQAEVMAGRLNHVSYLLPQLRCYINSLYRWMNAWVHR----HIDLPIPKDVRVDLEEWL 885
Query: 757 NALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL-----WSR----EQQNWHINK 807
L + ++ + D ++GW +S+ G+ W++ E+ N
Sbjct: 886 TTL-----LTFKETRMISDPDPIEIGWMGDASTSYGIGVTIGRRWAQFQLTEEWNQGPEP 940
Query: 808 KEMFAVHQALSLNLPLL--------QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
+ A + +++ L L+ Q V+V +DN T S + ++ +K ++ E +
Sbjct: 941 RRDIAWLETVAIRLGLIALAQLNIKQGKTVIVWTDNTTTESVILKR-KSKHHAVNEEWKI 999
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
I + + + I+++ + N VAD+LSR
Sbjct: 1000 IQRMLVEMELDIVSRRVSSGEN-VADALSR 1028
>gi|254587292|emb|CAX83703.1| Gag-Pol polyprotein [Schistosoma japonicum]
Length = 1367
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 141/338 (41%), Gaps = 21/338 (6%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
AKP P + + A P + + + + GV+ + S + + + + ++ K NG R
Sbjct: 492 AKPVFRPKRPVPYAALP---KVDEELNRLQQQGVITPV-SYSAWAAPIVVIKKPNGSIRI 547
Query: 515 VLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
+ GLN L + L + + L G + +DL+ AY + + L ++
Sbjct: 548 CADFSTGLNAALEQHHYPLPVPADLFTMLNGGKFFAKLDLADAYLQEEVAEESRELLTIN 607
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ + LPFG+ TAP F L + + + + V YLDD L+V P L +
Sbjct: 608 THRGMFQYNRLPFGVKTAPSIFQQLMDTILAGIAG----VATYLDDILIVATSPEELRER 663
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ + G+ + +K V ++LG ++D R P+ + NI L
Sbjct: 664 TTNVLQRISENGFRLRPEKCQFFLESV-KYLGFIFDAKGRR---PDPE-----NIRAIQL 714
Query: 694 ASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
N+ + RS LG +S+ S +P +H R L + + TP A KL+
Sbjct: 715 MPAPTNVSALRSFLGLVSYYSAFVP--SMHVIRSPLNHLLHKDVTWNWTPDCEAAFNKLK 772
Query: 754 WWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSF 791
+++ L + P + ++ DAS G G+ + F
Sbjct: 773 SLISSRLLLTHYDP-SMPIIVAADASSSGLGAVISHQF 809
>gi|126643676|gb|ABO25842.1| gag-pro-pol polyprotein [Walleye dermal sarcoma virus]
Length = 1751
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 99/225 (44%), Gaps = 11/225 (4%)
Query: 508 GNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKG-DYMISIDLSQAYFHVPIKTTH 566
G R + +L+ +N ++P + + + S L + IDLS A+F VPI
Sbjct: 815 GRDEYRMIHDLRAINNIVAPLTAVVASPTTVLSNLAPSLHWFTVIDLSNAFFSVPIHKDS 874
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
Q A ++ G T LP G +P F+ +L + + + +Y+DD L+ ++
Sbjct: 875 QYLFAFTFEGHQYTWTVLPQGFIHSPTLFSQALYQSLHKIKFKISSEICIYMDDVLIASK 934
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
D + + L S G V+ +K L V+ +LG + P R LP D+++T+
Sbjct: 935 DRDTNLKDTAVMLQHLASEGHKVSKKKLQLCQQEVV-YLGQLLTPE-GRKILP-DRKVTV 991
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQ 730
+ + R+ LG + + IP +HS+ +++Q
Sbjct: 992 SQF------QQPTTIRQIRAFLGLVGYCRHWIPEFSIHSKFLEKQ 1030
>gi|84801|pir||S08405 hypothetical protein 2 - silkworm transposon mag
Length = 1178
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 101/225 (44%), Gaps = 8/225 (3%)
Query: 447 SGYAIPFSAKPPLVPL-CSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLV 505
+G +P VP+ C + + + + + ML GV+K +D + + + L +V
Sbjct: 226 TGGTAELIVRPDAVPIYCRARPVPYALRERVDAELDAMLAAGVIKPVDHSD-WATPLVVV 284
Query: 506 PKGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
K +GG R + K LN+ L+ +F + + S L + +DLSQAY + +
Sbjct: 285 RKADGGLRICADYKVTLNKVLAIDRFPVPKMEDLFSNLSGNKFFTKLDLSQAYNQIVLSE 344
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
+ ++ + + + L +GLA++P F L + ++ ++ VVV+ DD L+ N
Sbjct: 345 RSSEYTVINTHRGLFKYSRLVYGLASSPGIFQKL---MVNMFKNVP-NVVVFYDDILIRN 400
Query: 625 QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD 669
QD K + IL G + K V ++LG + D
Sbjct: 401 QDLDSHLKSIKEVLDILERYGLKIKRSKCEFMVTEV-RYLGFIID 444
>gi|326484367|gb|EGE08377.1| hypothetical protein TEQG_08797 [Trichophyton equinum CBS 127.97]
Length = 365
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 42/149 (28%), Positives = 71/149 (47%), Gaps = 6/149 (4%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
A+ ++++++ G ++ S G + + VPK +GG R ++ +GLNQ ++ L
Sbjct: 150 ALREYLEKVISKGWIRPSKSLAG--APILFVPKKDGGLRMCVDYRGLNQITIKNRYPLPL 207
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ L K ++ I++ AY +PI+ + A +PFGLA AP
Sbjct: 208 ISELLDRLLKAKFLGKINIRDAYTRIPIREKDRWMTAFRTRYGYFEYCIMPFGLANAPTT 267
Query: 595 FASLSNWV-ASLLRSRGMRVVVYLDDFLL 622
F + N + A LL VVYLDD L+
Sbjct: 268 FQAYINEIFADLLDQ---FCVVYLDDILI 293
>gi|315464686|emb|CBQ72271.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 1284
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 151/359 (42%), Gaps = 29/359 (8%)
Query: 552 DLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS---NWVASLLRS 608
DL A+ H+ R L NG L FG ++P F + +WV +
Sbjct: 652 DLQDAFHHIVTCRADARLLGFQLNGIAYQENTLTFGSKSSPWLFNLFTKALHWVVASCLP 711
Query: 609 RGMRVVVYLDDFLL---VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
+ YL+DF +DP + LA +LG+ ++L+K + L+ LG
Sbjct: 712 PSTPLNHYLNDFFGAVPAGEDPTVPVRTLTLACH---ALGFSLSLEK-TFHSCSRLEILG 767
Query: 666 IMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSR 725
I D + + + + ++ + LL +L + + G L F + V+P GR +
Sbjct: 768 IEIDSVVQTVGITDTRRSCILAACDRLLQQPHCSLVELQQIAGLLQFVTQVVPHGRTYLG 827
Query: 726 RIQ---RQASLLRLGAPHLTPINPAVLPKLEWWLNALPL---SSPIFPR-QVQHFISTDA 778
RI R+A R A HL + + + +L WW + L SS + P V I TDA
Sbjct: 828 RIYAALRRAH--RSPASHLR-LAKSTIVELHWWCDLLSSWCGSSILLPSPLVAVHIWTDA 884
Query: 779 SDLGWGSQVD-SSFLSGLWS----REQQNWHINKKEMFAVHQALSLNLPLLQ---SSVVM 830
G +D +F S ++S R ++ +I E AV A+ LP L S ++
Sbjct: 885 LLQGLSGHLDLMTFPSAVFSCSVPRRHRHKNICFLEALAVLDAIRQFLPQLHVRSVSTLV 944
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
V DN+ V LR L+ + + +IF L ++ + A N +AD LSR
Sbjct: 945 VHVDNENVEFGLRSGHSCNPLT-QTLLREIFGLCFFHGFSLVPVRMSSADNVLADLLSR 1002
>gi|315056475|ref|XP_003177612.1| hypothetical protein MGYG_08943 [Arthroderma gypseum CBS 118893]
gi|311339458|gb|EFQ98660.1| hypothetical protein MGYG_08943 [Arthroderma gypseum CBS 118893]
Length = 574
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 84/209 (40%), Gaps = 8/209 (3%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP +PL +L V + ++ ML+ G ++ S G + L PK +G R +
Sbjct: 150 PPHLPLYNLSAKELQV---LREYLDTMLKRGWIRESKSPAG--APLLFAPKADGSLRTCV 204
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN+ + +L + L + +DL +AY V IK + A
Sbjct: 205 DYRGLNKMTIKNRLTLPRVDEMLDRLAGAMFFTKLDLREAYHRVRIKEGDEWKTAFRTRY 264
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL 636
+PFGLA AP F N V + L + +VYLDD L+ + KL
Sbjct: 265 GHYEYLVMPFGLANAPATFQGYINRVLTGLVD--IACIVYLDDILIFSASREKYIRHIKL 322
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
+ L L K + V FLG
Sbjct: 323 VLDRLRKYRLYAKLSKCAFFQVSV-DFLG 350
>gi|1945323|emb|CAA97115.1| TY3B [Saccharomyces cerevisiae]
gi|1945324|emb|CAA97117.1| TY3B [Saccharomyces cerevisiae]
Length = 1547
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 636 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 695
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 696 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR---YMADTFRD--LRFVNVYLDDI 750
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 751 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 807
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 808 KCAA----IRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 857
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 858 WTEKQDKAIDKLKDALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 916
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 917 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 976
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 977 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 1009
>gi|158726337|gb|ABW80581.1| polyprotein [Dahlia mosaic virus]
Length = 812
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 97/432 (22%), Positives = 171/432 (39%), Gaps = 47/432 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++ + S + +S FLV K G R V+N K +N +L
Sbjct: 405 QIKELLDLKLI--IPSKSPHMSPAFLVENEAEKRRGKKRRVVNYKAINAATKGDSHNLPC 462
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + L+ + D ++ V + Q A + +PFGL AP
Sbjct: 463 MQELLTLLRGKTIFSTFDCKSGFWQVLLNEESQLLTAFTCPDGHYQWRVVPFGLKQAPSI 522
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + LR VY+DD ++ + A+ + G I++ +K++
Sbjct: 523 F---QRHMQNALRGLENYCTVYVDDIIVFSDSEEKHYFHVLSALKTIEKYGIILSKKKAN 579
Query: 655 LSPAPVLQFLGIMWD--PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
L + FLG+ D H + + E+ L TL K + LG L++
Sbjct: 580 LFKTKI-NFLGLEIDQGTHCPQKHILEN----LHKFPDTLEDKK-----HLQRFLGVLTY 629
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQH 772
A IP +L R Q L + + + K++ L + P P++ +
Sbjct: 630 AESYIP--KLAELRKPLQVKLKKDYVWEWKQSDTNYIKKIKKNLTSFP--KLYLPKEKEF 685
Query: 773 FI-STDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
I TDAS+ WG + + + SG + ++N+H N+KE+ AV +S
Sbjct: 686 LIIETDASNDFWGGVLKAKTADKEEVCRYTSGSFKTAERNYHSNEKELLAVKNTISKFSI 745
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQ--GGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPG 878
L +V++DN+ +L+ + G K L+ Q W R + + G
Sbjct: 746 YLTPVKFLVRTDNRNFTYFLKTKISGDNKQGRLVR--------WQMWFSRYSFDIEHLEG 797
Query: 879 AYNSVADSLSRS 890
+ N +AD L R
Sbjct: 798 SKNVLADCLPRD 809
>gi|6321547|ref|NP_011624.1| gag-pol fusion protein [Saccharomyces cerevisiae S288c]
gi|285812302|tpg|DAA08202.1| TPA: gag-pol fusion protein [Saccharomyces cerevisiae S288c]
Length = 1547
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 636 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 695
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 696 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR---YMADTFRD--LRFVNVYLDDI 750
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 751 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 807
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 808 KCAA----IRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 857
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 858 WTEKQDKAIDKLKDALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 916
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 917 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 976
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 977 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 1009
>gi|134034966|sp|Q99315.3|YG31B_YEAST RecName: Full=Transposon Ty3-G Gag-Pol polyprotein; AltName:
Full=Gag3-Pol3; AltName: Full=Transposon Ty3-1 TYA-TYB
polyprotein; Contains: RecName: Full=Capsid protein;
Short=CA; AltName: Full=p24; Contains: RecName:
Full=Spacer peptide p3; Contains: RecName:
Full=Nucleocapsid protein p11; Short=NC; Contains:
RecName: Full=Ty3 protease; Short=PR; AltName: Full=p16;
Contains: RecName: Full=Spacer peptide J; Contains:
RecName: Full=Reverse transcriptase/ribonuclease H;
Short=RT; Short=RT-RH; AltName: Full=p55; Contains:
RecName: Full=Integrase p61; Short=IN; Contains: RecName:
Full=Integrase p58; Short=IN
Length = 1547
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 636 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 695
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 696 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR---YMADTFRD--LRFVNVYLDDI 750
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 751 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 807
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 808 KCAA----IRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 857
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 858 WTEKQDKAIDKLKDALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 916
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 917 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 976
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 977 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 1009
>gi|56266281|emb|CAG70342.1| CP, RT, RNaseH and protease polyprotein [Cacao swollen shoot virus]
Length = 1868
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 95/455 (20%), Positives = 178/455 (39%), Gaps = 54/455 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + H+Q +L+ GV++ S + + F+V G +G R
Sbjct: 1328 IKHLTPAMEKQFKRHVQALLDIGVIR--PSKSKHRTTAFIVESGTTIDPVTKKTVHGKER 1385
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1386 MVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKVFSKFDLKSGFHQVAMAEESIPWTAFW 1445
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ ++
Sbjct: 1446 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFQGTEDFIAVYIDDILVFSETMEEHAEH 1502
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ I G +++ K S++ + +FLG + + K ++++ ++
Sbjct: 1503 IATMLKICQKNGLVLSPSKMSIAQREI-EFLGTI---------ISNGKMKLQAHVIKKII 1552
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
+ L++ RS LG L++A IP +GR S + + G L + ++
Sbjct: 1553 SKAQLELETTKGLRSFLGLLNYARVYIPNLGRKLSPLYAKTSP---TGERRLNRQDWRLI 1609
Query: 750 PKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS-------QVDS-------SFLSGL 795
+++ + LP I P Q I +D GWG + DS ++ SG
Sbjct: 1610 NEIKGMVQKLP-DLDIPPAQCCTVIESDGCMEGWGGICKWKTVKEDSRSTERICAYASGK 1668
Query: 796 WSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLL 854
+S + E+ A+ +AL S + L ++V++D Q +V++ + G K +
Sbjct: 1669 FSTLKSTI---DAEIHALIKALESFKIFYLDKKHLIVRTDCQAIVTFHNKTSGHKPSRIR 1725
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+L + + + I G N +AD+LSR
Sbjct: 1726 WITFSDYLTGLG--VQVTIEHIEGKENYLADTLSR 1758
>gi|38258172|sp|Q8I7P9.1|POL5_DROME RecName: Full=Retrovirus-related Pol polyprotein from transposon
opus; Includes: RecName: Full=Protease; Includes:
RecName: Full=Reverse transcriptase; Includes: RecName:
Full=Endonuclease
gi|27368146|gb|AAN87271.1| ORF2 [Drosophila melanogaster]
Length = 1003
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 103/439 (23%), Positives = 190/439 (43%), Gaps = 56/439 (12%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPK-----GNGGTRPVLNLKGLNQFLSPKKFSLIN 534
I E+L+ G+++ S + + S +++VPK G R V++ K LN P + + +
Sbjct: 143 IDELLQDGIIR--PSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ L Y ++DL+ + + +K + A S LPFGL AP
Sbjct: 201 INATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAI 260
Query: 595 FASLSNWVASLLRSR-GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F + + +LR G VY+DD ++ ++D +L ++ L VNL+KS
Sbjct: 261 FQRM---IDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKS 317
Query: 654 SLSPAPVLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSL 706
V +FL GI DP R ++ + R L + + + +
Sbjct: 318 HFLDTQV-EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYR----KFI 372
Query: 707 LGYLSFASFVIPMGR-LHSRRIQRQASL--LRLGAPHLTPINPAVLPKLEWWLNALPLSS 763
Y A + + R L++ Q+S + L L N L ++ SS
Sbjct: 373 QDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFND---------LKSILCSS 423
Query: 764 PI--FPRQVQHF-ISTDASDLGWG---SQVDS------SFLSGLWSREQQNWHINKKEMF 811
I FP + F ++TDAS+ G SQ D +++S ++ ++N+ +KEM
Sbjct: 424 EILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEML 483
Query: 812 AVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIH 870
A+ +L +L L + + V +D+Q + L G ++ + +++++ +++
Sbjct: 484 AIIWSLDNLRAYLYGAGTIKVYTDHQPLTFAL----GNRNFN--AKLKRWKARIEEYNCE 537
Query: 871 ILAQFIPGAYNSVADSLSR 889
++ + PG N VAD+LSR
Sbjct: 538 LI--YKPGKSNVVADALSR 554
>gi|38346034|emb|CAD39763.2| OSJNBa0059D20.6 [Oryza sativa Japonica Group]
Length = 1470
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 82/185 (44%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G ++ S G ++
Sbjct: 533 IIDLIPGTA-PISKRPYRMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AQ 581
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ IDL Y +
Sbjct: 582 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLPRIDNLFDQLKGAKVFSKIDLRSGYHQLK 641
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + T + FGL AP F +L N V + VVV++DD L
Sbjct: 642 IRTEDIPKTAFSTRYGLYEFTVMSFGLTNAPAYFMNLMNKV--FMEYLDKFVVVFIDDIL 699
Query: 622 LVNQD 626
+ ++D
Sbjct: 700 IYSKD 704
>gi|384486663|gb|EIE78843.1| hypothetical protein RO3G_03548 [Rhizopus delemar RA 99-880]
Length = 419
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 480 IQEMLETGVLK-----RLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
IQ++L ++ +T GF S +F++PK +GG RPV NLK LNQ+L F +
Sbjct: 337 IQDLLSKQAIEPVSEVEFRTTPGFYSSMFVIPKKDGGIRPVCNLKRLNQYLDAPHFKMET 396
Query: 535 HFRIPSFLQKGDYMISIDLS 554
+ + DY++SIDLS
Sbjct: 397 IREVALMINPNDYLVSIDLS 416
>gi|325668305|gb|ADZ44590.1| polyprotein [Dahlia mosaic virus]
Length = 806
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 96/432 (22%), Positives = 170/432 (39%), Gaps = 47/432 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
I+E+LE ++ + S + +S FLV K R V+N K +N +L
Sbjct: 399 QIKELLELKLI--IPSKSPHMSPAFLVENEAEKRRRKKRMVVNYKAINAATKGDSHNLPC 456
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + L+ Y S D ++ V + Q A + +PFGL AP
Sbjct: 457 MQELLTLLRGKTYFSSFDCKSGFWQVLLDEESQLLTAFTCPNGHYQWKVVPFGLKQAPSI 516
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F + + LR VY+D ++ + + + G I++ +K++
Sbjct: 517 F---QRHMQNALRGLENYCTVYVDGIIVFSDSEEKYYFHVLSILKTIEKYGIILSKKKAN 573
Query: 655 LSPAPVLQFLGIMWD--PHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF 712
L + FLG+ D H + + E+ L TL K + LG L++
Sbjct: 574 LFKTKI-NFLGLEIDQGTHCPQKHILEN----LHKFPDTLEDKK-----HLQRFLGVLTY 623
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQH 772
A IP +L R Q L + + + K++ L + P P++ +
Sbjct: 624 AESYIP--KLAELRRPLQVKLKKDYVWEWKQSDTNYIKKIKKNLTSFP--KLYLPKEKEF 679
Query: 773 -FISTDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
I TDAS+ WG + + + SG + ++N+H N+K++ AV +S
Sbjct: 680 PIIETDASNDFWGGVLKAKTADKEAVCRYTSGSFKTAEKNYHSNEKKLLAVKNTISKFSI 739
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQ--GGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPG 878
L +V++DN+ +L+ + G K L+ Q W R + + G
Sbjct: 740 YLTPVKFLVRTDNRNFTYFLKTKISGDNKQGRLIR--------WQMWFSRYSFDIEHLEG 791
Query: 879 AYNSVADSLSRS 890
+ N +AD L+R
Sbjct: 792 SKNVLADCLTRD 803
>gi|341895838|gb|EGT51773.1| hypothetical protein CAEBREN_12621 [Caenorhabditis brenneri]
Length = 1272
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/250 (22%), Positives = 110/250 (44%), Gaps = 16/250 (6%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLI 533
A+S + + + GVL +D + + + + +V K NG R + GLN + + L
Sbjct: 493 AVSEELDRLTQQGVLTPVDHS-AWAAPVVIVKKKNGSIRMCADYSTGLNDSIEQHRHPLP 551
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
I + + G Y IDL++AY + + + L ++ + + LPFG+ +AP
Sbjct: 552 TADDIFTIINGGKYFTQIDLAEAYLQIELSDQAKDLLCINTHKGIYQFQLLPFGVKSAPG 611
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F + + + + + YLDD ++ + K +S + G + L+K
Sbjct: 612 IFQQVMDQLINGIEG----AAAYLDDIIITGSTIEEHNTRLKKVMSRIHEFGMRMKLEKC 667
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSF- 712
V +FLG + D + R P+ +++ ++ + K ++ +S LG + F
Sbjct: 668 KFLIEEV-RFLGFIVDENGRR---PDPEKV---KAIKNMPVPK--DITQVKSFLGLIQFY 718
Query: 713 ASFVIPMGRL 722
+FV + RL
Sbjct: 719 GAFVNSLFRL 728
>gi|154304149|ref|XP_001552480.1| hypothetical protein BC1G_09710 [Botryotinia fuckeliana B05.10]
Length = 839
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 76/173 (43%), Gaps = 7/173 (4%)
Query: 454 SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
+KPP P+ SL + V + ++ E + G ++R S+ G + + VPK GG R
Sbjct: 199 DSKPPWGPIYSLSEEESIV---LREYLVEYQKKGWIRRSISSAG--APIMFVPKKGGGYR 253
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
++ +GLN+ + L L++G +DL AY + I+ + A
Sbjct: 254 LCVDYRGLNRITKKDRTPLPLISESLDRLRQGVVFTKLDLRDAYHRIRIREGDEWKTAFR 313
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+PFGL AP F + N S L VVYLDD L+ ++D
Sbjct: 314 TRYGQFEYLVMPFGLTNAPATFQTYINQALSGLTD--TICVVYLDDILIYSED 364
>gi|156058736|ref|XP_001595291.1| hypothetical protein SS1G_03380 [Sclerotinia sclerotiorum 1980]
gi|154701167|gb|EDO00906.1| hypothetical protein SS1G_03380 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1004
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/198 (23%), Positives = 78/198 (39%), Gaps = 9/198 (4%)
Query: 476 MSLHIQEMLETGVLKRLD------STTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKK 529
MSL E ++ +L LD S F + + V K NGG R ++ + LN +
Sbjct: 78 MSLDELEAVKAYILDNLDKGFIEPSQAPFAAPILFVKKPNGGLRLCIDYRTLNALTRKDR 137
Query: 530 FSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLA 589
+ L + + +D+ QA+ + + + M LPFGL
Sbjct: 138 YPLPLIDETLARITHAKIFTKLDIRQAFHRIRMDPDSEELTTFRTRYGAYKMKVLPFGLT 197
Query: 590 TAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN 649
P + N V L + YLDD L+ + DP E Q KL + L + G +
Sbjct: 198 NGPATYQRYMNEV--LFDYLDIFCTAYLDDILIYSNDPLEHEYQVKLVLERLRNAGLQAD 255
Query: 650 LQKSSLSPAPVLQFLGIM 667
++K + ++LG +
Sbjct: 256 IKKCEFNITKT-KYLGFI 272
>gi|57863925|gb|AAS55774.2| putative polyprotein [Oryza sativa Japonica Group]
Length = 2108
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 172/438 (39%), Gaps = 71/438 (16%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+++ML+ G+++ S + F S + LV K +G R ++ + LN K+ L +
Sbjct: 1333 VKDMLQKGIIQ--PSASPFSSPVLLVKKKDGTWRFCVDYRHLNAITVKNKYPLPIIDELM 1390
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + +DL Y + + + A + LPFGL +AP F +
Sbjct: 1391 DELAGACWFSKLDLRSGYHQIRMAVGEEAKTAFKTHNGHFEFKVLPFGLTSAPATFQGVM 1450
Query: 600 NWV-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N V A LR V+V++DD L+ + R LE + +L I++ S P
Sbjct: 1451 NTVLADQLRQ---NVLVFVDDILVYS---RTLEEHKNHLRQVFETLRHIISADGVSTDPE 1504
Query: 659 PVLQFLGIMWDPHLDRMW-LPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
+ + R W +P ++ RS LG + +
Sbjct: 1505 KI----------QVVRQWPVPV-------------------SVKDVRSFLGLAGYYRKFV 1535
Query: 718 PMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFIS 775
+ S+ + + LL G P + T + L+ L + P L P F Q +
Sbjct: 1536 RHFGIISKPLTK---LLCKGQPFIWTQHHQEAFDTLKQSLISAPVLVMPDF--QKMFVVE 1590
Query: 776 TDASDLGWGS-----QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDASD G G+ Q +FLS Q +KE A+ A+ P LQ +
Sbjct: 1591 TDASDRGIGAVLMQDQHSVAFLSKALGHRTQVLSTYEKESLAIILAVDHWRPYLQHDDFL 1650
Query: 831 VQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+++D+++ +++L Q T S + + LL +RI + G N AD+LS
Sbjct: 1651 IRTDHRS-LAFLDNQLLTTSWQYKAMTK---LLGLRYRI----VYKKGLENGAADALSHR 1702
Query: 891 KS------------LPDW 896
S LPDW
Sbjct: 1703 SSDGLPILSALSVGLPDW 1720
>gi|65362561|ref|YP_233110.1| putative polyprotein [Banana streak virus strain Acuminata Vietnam]
gi|53830363|gb|AAU95075.1| putative polyprotein [Banana streak virus strain Acuminata Vietnam]
Length = 1898
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 100/466 (21%), Positives = 180/466 (38%), Gaps = 76/466 (16%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + +M H+ ++LE V++ ST+ + +V G G R
Sbjct: 1301 LKHVTPAMKESMKKHVDKLLELKVIR--PSTSKHRTTAIIVQSGTEIDPLTGKEKRGKER 1358
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I S + K DL + V + + A
Sbjct: 1359 LVFNYKRLNDNTEKDQYSLPGINTIISRIGKSKIYSKFDLKSGFHQVAMDPESIPWTAFW 1418
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ + +
Sbjct: 1419 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEEFIAVYIDDILIFSDNISDHRKH 1475
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
+ I + G +++ K + A + FLG T+GN
Sbjct: 1476 LSKFLEICKANGLVLSPTKMKIG-AKEIDFLG-----------------ATIGNSKIKLQ 1517
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHSRRIQRQASLLRLG 737
I++ ++ +K L + L LG L++A IP +G L+S+ S+ G
Sbjct: 1518 PHIIKKIIETKDEELKETKGLRKWLGVLNYARAYIPNLGKTLGPLYSK-----TSI--NG 1570
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQV-------DSS 790
+ + V+ ++ + LP I P + + TD GWG D+
Sbjct: 1571 EKKMNSQDWKVVQLIKNQVQNLP-DLDIPPAEATMVLETDGCMEGWGGVCKWKLHPSDTR 1629
Query: 791 FLSGLWSREQQNWHINKK----EMFAVHQALS-LNLPLLQSSVVMVQSDNQTVVSYLRRQ 845
+ + +H K E+ AV +L + L +++++D+Q +V++ ++Q
Sbjct: 1630 LAEKVCAYASGRYHPIKSTIDAEVHAVINSLEKFKIYYLDKKELIIRTDSQAIVAFYKKQ 1689
Query: 846 GGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
K L ++ I L I++ + I G N +AD+LSR
Sbjct: 1690 ADHKPSRTRWLMLIDYITGLG----INVKFEHIDGKENVLADTLSR 1731
>gi|329351119|gb|AEB91352.1| polyprotein, partial [Verticillium dahliae VdLs.17]
Length = 1129
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 102/442 (23%), Positives = 173/442 (39%), Gaps = 64/442 (14%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ +++E L G ++ S G+ + VPK NG R ++ + LN + L
Sbjct: 242 DTLDEYLKENLRKGYIRPSTSPAGYP--ILFVPKKNGKERLCVDYRQLNDITIKNCYPLP 299
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L ++ ++DL AY + IK + A +PFGL AP
Sbjct: 300 LISELRDALAGANWFTALDLKGAYNLIRIKDGEEWKTAFRTRRGHYEYLVMPFGLTNAPA 359
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL--AVSILGSLGWIVNLQ 651
F ++ N V L + VVVYLDD L+ ++ + E +G + ++ L +V +
Sbjct: 360 TFQNMINDV--LREFLDVFVVVYLDDILIFSKT--MEEHKGHVHQVLTRLHQHELLVEPE 415
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLL 707
K+ V FLG P RM E ++ A + W N+ R+ L
Sbjct: 416 KAKFHTQEV-DFLGYTITPGEIRM---EKSKVA---------AIREWPTPKNVKDVRAFL 462
Query: 708 GYLSFASFVI--------PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL 759
G+++F + P+ L + I+ + + A I AVL + L
Sbjct: 463 GFVNFYRRFLKGYSKTANPLTNLTVKEIEFAWNEPQEKA--FRQIIDAVLSE-----PVL 515
Query: 760 PLSSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKE 809
+ P P +V+ TDASD G Q+ +F S + N+ I+ KE
Sbjct: 516 RMIDPEKPMEVE----TDASDFAIGGQLGQRDDQGRLHPVAFFSKKLHGPELNYQIHDKE 571
Query: 810 MFAVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDW 867
+ A+ +A L + V V +D++ + + + K +E F + +
Sbjct: 572 LMAIIEAFKEWRTYLSGARHEVKVYTDHKNLAHFTTNKDLNKRQIRWAEFLSEFNFTIIY 631
Query: 868 RIHILAQFIPGAYNSVADSLSR 889
R G+ N AD LSR
Sbjct: 632 R--------KGSENGRADILSR 645
>gi|390340432|ref|XP_003725242.1| PREDICTED: uncharacterized protein LOC100891783 [Strongylocentrotus
purpuratus]
Length = 637
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 57/104 (54%), Gaps = 1/104 (0%)
Query: 839 VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP-DWH 897
++Y+ RQGGT S++L +++ + R+ +A IPG N +AD LSR K LP +W
Sbjct: 1 MAYINRQGGTHSVALNELASQLWAWCKGARVFPIASHIPGEENIIADFLSRGKCLPSEWT 60
Query: 898 LSRSATEQIFLKWGVPCIDLFASRVSAVVPNHFQVSRHVAILLL 941
LS + Q+ +GV IDLFA+ ++ +P R +L
Sbjct: 61 LSPTVFRQLVRVFGVLGIDLFATSLNHRLPRFCSRVREPGAFVL 104
>gi|308464934|ref|XP_003094730.1| hypothetical protein CRE_29046 [Caenorhabditis remanei]
gi|308247003|gb|EFO90955.1| hypothetical protein CRE_29046 [Caenorhabditis remanei]
Length = 2823
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 95/460 (20%), Positives = 187/460 (40%), Gaps = 72/460 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNL 518
VP + + + ++EM++ G+++ DS F + + LV K + + R ++
Sbjct: 1873 VPQGKVYRVPLEKRKEVETQLKEMIKQGIIRPTDSP--FSAPIVLVRKADKTSWRFTVDF 1930
Query: 519 KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDV 578
+ LN P + + N I ++D Q + +P++ H A + +
Sbjct: 1931 RALNALTQPVQSIIPNIHEILDLCAGKILYTTLDFQQGFHQIPVEPAHCARTAFACHMGA 1990
Query: 579 LAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP-----RILEIQ 633
+P GL +P F + N +L++ RV VY+DD +L ++ P I E+
Sbjct: 1991 FEYIRMPMGLKGSPGTFQRVMN---TLIKEIQARVFVYIDDMVLTSESPSQHVRDIEEVL 2047
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
K+ S G + +K + P +++LG + + + + D + T +
Sbjct: 2048 DKIEKS-----GMKLRPEKCKFA-LPEIRYLGFI----ISKSGIHPDPEKT--KAIDEYP 2095
Query: 694 ASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLE 753
KT + R+ +G SF I + ++ AP +T + P E
Sbjct: 2096 TPKT--VKEVRAFIGMASFYRRFI-------------ENFSKIAAPIMT-LTKKDQP-FE 2138
Query: 754 WW---------LNALPLSSPIF--PRQVQHF-ISTDASDLGWG-----SQVDS------- 789
W L A +PI P+ + F I D+S G G +Q D
Sbjct: 2139 WTNECEEAFKELKAALTKNPILVAPKLGKPFVIEVDSSGKGVGAVLFQAQDDEGKDLRVI 2198
Query: 790 SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
++ S +++ ++ + + E + A+ P + + ++ +D+ + S L R+
Sbjct: 2199 AYASRVYNGAEKRYPAIELEGLGLVYAVQQFRPYIDGARTLIITDHAPLKSLLHRK---- 2254
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ + K ++ Q++ I I ++ PG N V D+LSR
Sbjct: 2255 --DLIGRMGKYQIVLQEYDIQI--EYRPGKQNIVCDTLSR 2290
>gi|427791991|gb|JAA61447.1| Putative tick transposon, partial [Rhipicephalus pulchellus]
Length = 1140
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 100/430 (23%), Positives = 177/430 (41%), Gaps = 50/430 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+Q ML++G+++ ST+ F S + +VPK +G R + + +N+ F + I
Sbjct: 404 LQGMLDSGIIR--PSTSAFASPITIVPKEDGSLRLCTDYRLINRQTELFPFPMPRIDEII 461
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ IDL + Y+ VP+K ++F A D+ LPFG + F L
Sbjct: 462 EETGGCKWFSRIDLCKGYWQVPLKEETKKFTAFVTPFDIYEYNRLPFGWKNSGAWFQKLM 521
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVN--QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
N V L G VY+DD ++ + ++ I + L L +N++KS
Sbjct: 522 NSV--LNDYIGKFCNVYVDDIIVYSRTKEDHIQHLSDVLEALSRAKLK--INVKKSEFFC 577
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFV 716
V+ FLG R++ + K ++ R K ++L S R LG F +F+
Sbjct: 578 QTVV-FLG--------RVFNGKTKSTKEESVQRISKLVKPYDLHSLRVFLGLAGHFRTFI 628
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHL--TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
+ ++ + + +L + P + L ++ + L+ P F + +
Sbjct: 629 ----KNYATKTRCLTALTQKEVPFIWTEECERCYLDLVDIISSDPALTLPDFSLPFE--L 682
Query: 775 STDASDLGWGS--------------QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLN 820
TDAS G G+ Q + S +S+ + N+ +KE AV AL
Sbjct: 683 CTDASHYGTGAVLYQHDIKKPTGRQQQVIGYYSYTFSKAEVNYATTEKEALAVVMALRYF 742
Query: 821 LPLLQSSVVMVQSDNQTVVSYLR-RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
L+ + +DNQ + L+ Q + +SE+++ S D + PG
Sbjct: 743 KSYLEGRNFKLFTDNQALTHLLKLAQPKGRLARWISEIQQ---YSFD------VEHRPGL 793
Query: 880 YNSVADSLSR 889
+ AD+LSR
Sbjct: 794 KHRDADALSR 803
>gi|4539436|emb|CAB40024.1| putative reverse-transcriptase-like protein [Arabidopsis thaliana]
gi|7267755|emb|CAB78181.1| putative reverse-transcriptase-like protein [Arabidopsis thaliana]
Length = 1240
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 102/471 (21%), Positives = 180/471 (38%), Gaps = 78/471 (16%)
Query: 452 PFSAK--PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
PF+ + P PL + P A + ++++L G ++ ST+ + + + V K
Sbjct: 480 PFTIELEPGTAPLSKAPYRMAPAEMAELKKQLKDLLGKGFIR--PSTSPWGAPVLFVKKK 537
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ + LN+ ++ L + L+ IDL+ Y +PI R
Sbjct: 538 DGSFRLCIDYRELNRVTVKNRYPLPRIDELLDQLRGATCFSKIDLTSGYHQIPIAEADVR 597
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
A +PFGL AP F L N V V++++DD L+ ++ P
Sbjct: 598 KTAFRTRYGHFEFVVMPFGLTNAPAVFMRLMNSVFQEFLDEF--VIIFIDDILVYSKSPE 655
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSP------APVLQFLGIMWDPHLDRMWLPEDKQ 682
E+ + + L L K S ++ G+ DP
Sbjct: 656 EQEVHLRRVMEKLREQKLFAKLSKCSFWQREMGFLGHIVSAEGVSVDPE----------- 704
Query: 683 LTLGNILRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGA 738
+ A + W N RS LG+ + + R ++ AS+ A
Sbjct: 705 --------KIEAIRDWPRPTNATEIRSFLGWAGY----------YRRFVKGFASM----A 742
Query: 739 PHLTPINPAVLPKLEWW----------LNALPLSSPI--FPRQVQ-HFISTDASDLGWGS 785
+T + +P + W L + S+P+ P Q + + TDAS +G G
Sbjct: 743 QPMTKLTGKDVPFV--WSQECEEGFVSLKEMLTSTPVLALPEHGQPYMVYTDASRVGLGC 800
Query: 786 QVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVS 840
+ ++ S + + N+ + EM AV AL + L V V +D+++ +
Sbjct: 801 VLMQHGKVIAYASRQLMKHEGNYPTHDLEMAAVIFALKIWRSYLYGGKVQVFTDHKS-LK 859
Query: 841 YLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSK 891
Y+ Q L+L + L D+ + I + PG N V D+LSR +
Sbjct: 860 YIFTQ---PELNLRQ--RRWMELVADYDLEI--AYHPGKANVVVDALSRKR 903
>gi|341898320|gb|EGT54255.1| hypothetical protein CAEBREN_31218 [Caenorhabditis brenneri]
Length = 1014
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 91/417 (21%), Positives = 173/417 (41%), Gaps = 59/417 (14%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
L +VPK NG R V++ + LN P+ + + + + KG D++ + H+
Sbjct: 117 LLIVPKKNGDIRIVIDYRKLNLITRPRTYIMPHTLDVTEEASKGKIFSVFDIASGFHHIR 176
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
+ H++ A + V +P GL +P F + + + + ++VY+DD +
Sbjct: 177 MNEDHKQRTAFCCHLGVFQYRVMPMGLKGSPDTF---NQAMEEVKQKYSGSMIVYVDDIV 233
Query: 622 LVN--QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPE 679
LV+ ++ + +++ + I +G + +KS + + FLG ++ +
Sbjct: 234 LVSETEEQHVKDLEEFFQLMI--KMGLKLKAEKSQIGRTRIT-FLGF----DIENNTIQP 286
Query: 680 DKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP 739
+ + T +R A + N+ + LG S+ IP + I + L + GA
Sbjct: 287 NGEKTKA--IREFPAPR--NITEVKQFLGMCSYFRRFIPGYAILVNPINK---LNKKGA- 338
Query: 740 HLTPINPAVLPKLEW---------WLNALPLSSPIF--PRQVQHF-ISTDASDLGWGS-- 785
+ EW + + +S PI P F + TDAS +G +
Sbjct: 339 -----------EFEWKQEQQEAFEKVKEILMSPPILTTPDMTGTFEMHTDASKIGLSAVL 387
Query: 786 -QVDSSFLSGLWSREQQNWHINKK------EMFAVHQALSLNLPLLQSSVVMVQSDNQTV 838
Q + L + + + + E A+ L + P + + V+V +D+ +
Sbjct: 388 MQKQADLLKVVAYASRPTTAVESRYPPIELEALAITWGLIHHKPYVFNRKVLVVTDHLPL 447
Query: 839 VSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
S L R+ T S L+ E I Q + + I+ + PG N VAD+LSR + +PD
Sbjct: 448 KSLLHRKEKTMSGRLMRH-EAII---QQFDVEIV--YRPGKENYVADALSRQR-VPD 497
>gi|384496879|gb|EIE87370.1| hypothetical protein RO3G_12081 [Rhizopus delemar RA 99-880]
Length = 571
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 89/208 (42%), Gaps = 11/208 (5%)
Query: 420 ELVGGRLRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQ-HLATPVSSAMSL 478
E+V GR F D +++ + L R V G+ KP P+ S+ L + +
Sbjct: 96 EMVEGR---FADCFVK---NSGLGR-VKGFEHKIVLKPDATPVRSVPFRLTWEENEFLEK 148
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G++ R + + S F V K +G R VL+ + LNQ + L +
Sbjct: 149 ELNNMLELGII-RPSKSGAYSSPCFFVKKKDGSRRMVLDYRKLNQMTVSNAYPLPLISEL 207
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L + ++D++ Y+ VP+ + +PFGL +AP F ++
Sbjct: 208 LDSLGGAKFFTTMDMAFGYWQVPMAEDSIEKTGFVTKKGIYEFLVMPFGLTSAPSTFQAM 267
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQD 626
N + L G +V++DD L+ D
Sbjct: 268 MNSI--LGEYIGKFCLVFIDDVLIFGGD 293
>gi|18157521|dbj|BAB83836.1| LReO_3 [Oryzias latipes]
Length = 1498
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 96/433 (22%), Positives = 177/433 (40%), Gaps = 52/433 (12%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
+ I+ ML+ GV++ ST+ + S + LVPK +G R ++ + LN + + +
Sbjct: 1095 LEKEIELMLKLGVIE--PSTSEWCSPVVLVPKKDGSLRFCIDFRYLNAVSKIQSYPMPRI 1152
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ + K ++ ++DLS+ Y+ V + + A + +PFGL AP F
Sbjct: 1153 DELLERVGKSKFITTLDLSKGYWQVALAQETKELTAFTTPYGKFQFKVMPFGLQGAPATF 1212
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
L + +LR YLDD ++ + R + + ++ + G +N K +
Sbjct: 1213 QRLMD---EILRDFPQFAAAYLDDVIIFSHSWRDHMSHLRHVLHLIKAAGLTINPGKCVV 1269
Query: 656 SPAPVLQFLGI-----MWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+ V ++LG + P + ++ ++ Q+ R+ +G +
Sbjct: 1270 AQQQV-EYLGHVVGQGLVKPRVGKVEAIQEYQIPTTK-------------KKVRAFVGLV 1315
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPH---LTPINPAVLPKLEWWLNALP-LSSPIF 766
+ S IP + R L R AP+ T A L+ + + L SP F
Sbjct: 1316 GWYSKFIPH---FADRAAVLTDLTRASAPNKVVWTEDCDAAFRDLKGAITSESVLYSPDF 1372
Query: 767 PRQVQHFISTDASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQAL 817
R + TDAS +G G+ + FLS + + +KE A+ A+
Sbjct: 1373 TRPF--ILQTDASAVGLGAVLVQEAEGERHPVLFLSRKLLDRETRYSTVEKECLAMKWAI 1430
Query: 818 -SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFI 876
SL LL + ++D++ + +LRR + + + +L Q + + Q+
Sbjct: 1431 DSLRYYLLGRHFCL-ETDHR-ALQWLRRMKDSN-----TRLTAWYLSLQAYDFTV--QYR 1481
Query: 877 PGAYNSVADSLSR 889
G N VAD LSR
Sbjct: 1482 AGKTNCVADCLSR 1494
>gi|388856424|emb|CCF49973.1| related to retrotransposon nucleocapsid protein [Ustilago hordei]
Length = 1391
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 179/445 (40%), Gaps = 53/445 (11%)
Query: 454 SAKPPLVPLCSLQHLATPVS-SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT 512
KPP PL +L P S + ++ E LE G ++ S + S + +PK +GG
Sbjct: 459 GGKPPQGPL----YLKGPKEMSELRRYLDENLEKGFIR--PSKSPAQSPVLFIPKKDGGL 512
Query: 513 RPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
R ++ +GLN+ + L L+K +DL AY + I + A
Sbjct: 513 RLCVDYRGLNEITVKNRAPLPLIEEQLFLLRKARIYTKLDLRAAYNLIRIAKGDEWKTAF 572
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ +PFGLA A F S N + + G+ VVVYLDDFL+ +
Sbjct: 573 GTQLGLYEYLVMPFGLANALAHFQSFINDIFQDII--GIYVVVYLDDFLIFSDTEEAHVK 630
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL 692
++ L S L K V +FLG + L + + +K +RT+
Sbjct: 631 HVTEVLTHLRSNRLFAKLSKCEFHTKTV-EFLGYIIK--LTGIEMDPEK-------VRTV 680
Query: 693 LASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPK 751
K W + +S + +L FA+F + R I A + + P+ V P
Sbjct: 681 ---KEWPMPESIHDIQRFLGFANF-------YRRFIAHFAHIAK-------PLTALVKPI 723
Query: 752 LEWWLNALPLSS-PIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEM 810
++ LP + F + +Q F S G + + S ++N+ I+ KE+
Sbjct: 724 EQFKKCELPEEAQQAFHKLIQAFTSA-----GVLQHFNYHLPTRKMSSAKKNYEIHDKEL 778
Query: 811 FAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWR 868
AV L+ +L S +++ +D++ + Y + Q + ++ I L D+
Sbjct: 779 LAVVACLTQWRHMLAGLPSQLVILTDHE-ALKYFKSQ---RRITGRQARWAILLADSDF- 833
Query: 869 IHILAQFIPGAYNSVADSLSRSKSL 893
+ Q+ PG D+L+R +
Sbjct: 834 ---ILQYRPGDKGGEPDALTRRTDM 855
>gi|189527795|ref|XP_001920303.1| PREDICTED: hypothetical protein LOC793061 [Danio rerio]
Length = 1490
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 103/440 (23%), Positives = 178/440 (40%), Gaps = 53/440 (12%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ------FL 525
+ A+ + ML G+++ S + + + LVPK +G R ++ + LN +
Sbjct: 1067 LQEALKEEVDFMLSLGIIE--PSQSEWCHPVVLVPKKDGNIRFCIDFRYLNSVSQFDCYP 1124
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+P+ SLI+ L K Y+ ++DLS+ Y+ +P+ + A + LP
Sbjct: 1125 TPRIDSLIDR------LGKAVYLTTLDLSKGYWQIPLTERARPLTAFRTPWGLFQFRFLP 1178
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP F L + V L YLDD ++ + L G
Sbjct: 1179 FGLHGAPATFQRLMDQVLQGLTF----AAAYLDDIIIYSTTWEEHMQHLHEVFQRLQRAG 1234
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
N K +++ ++LG + + R + + + L + +T RS
Sbjct: 1235 LTANPAKCAIARKEA-EYLGFVIGNGVVRPQIKKIQALEECPLPQT--------RKELRS 1285
Query: 706 LLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
LG F + IP S R ++ + P+ + + AL ++ +
Sbjct: 1286 FLGMAGFYNRFIPN---FSSRAATLTDMVGVRCPNQCQWTEERMAAFKDIQTALTTNTVL 1342
Query: 766 F-PRQVQHFI-STDASDLGWGSQVDSS---------FLS-GLWSREQQNWHINKKEMFAV 813
+ P + FI TDAS+ G G+ + F+S L+ RE + I +KE AV
Sbjct: 1343 YNPDFTKEFIVQTDASERGLGAVLLQGSPGERRPVVFISRKLFPRETRYSTI-EKECLAV 1401
Query: 814 HQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
AL SL LL ++ ++D++ + +L R T + + +L Q +R +
Sbjct: 1402 KWALDSLRYYLLGREFIL-ETDHK-ALQWLERMRDTN-----GRITRWYLAMQPFRFKV- 1453
Query: 873 AQFIPGAYNSVADSLSRSKS 892
+PG N AD LSR S
Sbjct: 1454 -HHVPGKANVTADYLSRCAS 1472
>gi|281201580|gb|EFA75789.1| hypothetical protein PPL_10844 [Polysphondylium pallidum PN500]
Length = 1798
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 114/477 (23%), Positives = 189/477 (39%), Gaps = 92/477 (19%)
Query: 451 IPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNG 510
IP S P VP+ L+ L I + L G +KR S + F S + V K +G
Sbjct: 669 IPKSKLYP-VPIAHLEELKK--------QINDRLNKGWIKR--SRSPFGSPILFVSKPDG 717
Query: 511 GTRPVLNLKGLNQFLS------PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
G R ++ + LN+ P+ L+N+ R+ +L K +DL Y + I
Sbjct: 718 GWRLCVDYRELNKITVRDDYPLPRINELLNNTRLAYWLSK------LDLLDGYHQIRINE 771
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLV- 623
Q A T PFGLA A F L + + L M++ VYLDD L++
Sbjct: 772 GEQYKTAFKTTFGTFEYTVTPFGLAGAGANFQRLMDHIFQ-LEILNMKICVYLDDILIMT 830
Query: 624 NQDPRILEIQGKLAV-SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQ 682
N D I + + IL + V L K + + +FLG + + +
Sbjct: 831 NSDSLDDHINDLIEIFKILQKHDFKVKLSKCKFARREI-EFLGHVVGRGVIK-------- 881
Query: 683 LTLGNILRTLLA-SKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL 741
L N + ++L K N + RS +G + + RR S + + P L
Sbjct: 882 -PLHNKIESILNWKKPENKNEMRSFIGLVGYY-----------RRFISNVSSIEI--PLL 927
Query: 742 TPINPAVLPKLEWWLNALPLSSPI--FPRQVQHFIS----------TDASDLGWG----- 784
I + W A + I V++ ++ DAS G G
Sbjct: 928 NMIKDK--SEFVWTEEATNAFNQIKKLVEDVKYLVAPNYTIPFHLECDASKYGIGHALYQ 985
Query: 785 -SQVDS------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQT 837
+++D+ SF S + + N+ + +KE+ ++ AL +N L V++ +D++
Sbjct: 986 INKLDNITRDFISFGSRKLTISEINYTVLEKELLSIIHALKVNYYHLIGHEVIINTDHKN 1045
Query: 838 VVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHI-----LAQFIPGAYNSVADSLSR 889
+ +LR Q ++ + S + + W I Q+I G N +AD LSR
Sbjct: 1046 -IKFLREQY---AIGINSRINR-------WLQFIELFNPTLQYIKGETNVIADGLSR 1091
>gi|6322110|ref|NP_012184.1| gag-pol fusion protein [Saccharomyces cerevisiae S288c]
gi|285812571|tpg|DAA08470.1| TPA: gag-pol fusion protein [Saccharomyces cerevisiae S288c]
Length = 1498
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 662 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 721
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 722 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR---YMADTFRD--LRFVNVYLDDI 776
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 777 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 833
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 834 KCAA----IRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 883
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 884 WTEKQDKAIEKLKAALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 942
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 943 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 1002
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 1003 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 1035
>gi|328867820|gb|EGG16201.1| hypothetical protein DFA_09229 [Dictyostelium fasciculatum]
Length = 1100
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 87/205 (42%), Gaps = 5/205 (2%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
HL++ ++ M +++ L +G + S + + S + V K +G R ++ + LN+
Sbjct: 417 NHLSSEENNVMFTTVEKGLASGRIA--PSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQT 474
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F L ++ + K IDL + + IK H A S T +P
Sbjct: 475 VADRFPLPRIDQLIEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIP 534
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP AF N A+ V++Y+DD L+ +++ K + L S
Sbjct: 535 FGLRNAPSAFVRAIN--AAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLRSNK 592
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDP 670
N KSS V +FLG + P
Sbjct: 593 LFANKAKSSFLVKEV-EFLGHLITP 616
>gi|146291076|sp|Q7LHG5.2|YI31B_YEAST RecName: Full=Transposon Ty3-I Gag-Pol polyprotein; AltName:
Full=Gag3-Pol3; AltName: Full=Transposon Ty3-2 TYA-TYB
polyprotein; Contains: RecName: Full=Capsid protein;
Short=CA; AltName: Full=p24; Contains: RecName:
Full=Spacer peptide p3; Contains: RecName:
Full=Nucleocapsid protein p11; Short=NC; Contains:
RecName: Full=Ty3 protease; Short=PR; AltName: Full=p16;
Contains: RecName: Full=Spacer peptide J; Contains:
RecName: Full=Reverse transcriptase/ribonuclease H;
Short=RT; Short=RT-RH; AltName: Full=p55; Contains:
RecName: Full=Integrase p52; Short=IN; Contains: RecName:
Full=Integrase p49; Short=IN
Length = 1498
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 662 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 721
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 722 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFAR---YMADTFRD--LRFVNVYLDDI 776
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 777 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 833
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 834 KCAA----IRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 883
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 884 WTEKQDKAIEKLKAALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 942
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 943 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 1002
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 1003 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 1035
>gi|340381314|ref|XP_003389166.1| PREDICTED: hypothetical protein LOC100636756 [Amphimedon
queenslandica]
Length = 1451
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 99/466 (21%), Positives = 176/466 (37%), Gaps = 72/466 (15%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN 509
A P+ P +PL + + IQ M + GV++ +T+ + S + +VPK +
Sbjct: 1015 ASPYRQSPYRIPLGKQEEVRK--------EIQRMEDMGVIR--PTTSDWASPMVIVPKKD 1064
Query: 510 GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK---GDYMISIDLSQAYFHVPIKTTH 566
G R ++ + LN KF R+ + + G ++ ++DL++ Y+ +P++ +
Sbjct: 1065 GSIRLCVDYRKLNNV---SKFDAYPIPRVDEMIDRMGSGKFITTLDLNKGYWQIPVEKSS 1121
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
Q A + +PFGL AP F L + V LR Y+DD + ++
Sbjct: 1122 QEKTAFITPMGLYEFVTMPFGLRGAPATFQRLMDRV---LRGTEQFAGAYIDDVAIHSET 1178
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG 686
+ + L G N K VL +LG + PE+ ++
Sbjct: 1179 WEGHLQHLREVLEKLQGAGLTANPSKCKFGMTEVL-YLGHKIGGGRVK---PEESKV--- 1231
Query: 687 NILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
++ KT RS LG + + IP + AP L+ +
Sbjct: 1232 KAVKQYPVPKTKT--EVRSFLGLVGYYRKFIP-------------QFSSIAAP-LSDLTK 1275
Query: 747 AVLPKLEW---------WLNALPLSSPIFPR---QVQHFISTDASDLGWG---SQVDSS- 790
+ W L + SP+ + + + TDAS+ G G SQ+
Sbjct: 1276 KNVKTFVWTEECQASFQLLKKMLCGSPVLRTPDIRKEMILQTDASNRGLGAVLSQIGEDG 1335
Query: 791 ------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRR 844
++S ++ + + +KE AV A+ L +Q+D+ + R
Sbjct: 1336 EEHPIVYISRKLLPREEKYAVVEKECLAVVWAIQTLKVYLYGQHFRIQTDHHALYWLDRM 1395
Query: 845 QGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + LS S L Q+W + ++ G N AD LSR
Sbjct: 1396 KSKNERLSRWS------LYLQEWNFRV--EYRKGTENGNADGLSRG 1433
>gi|427779883|gb|JAA55393.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 703
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 100/212 (47%), Gaps = 10/212 (4%)
Query: 455 AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRP 514
AKP VP + H A VS + ++ M G++ +++ ++S L +V K GG R
Sbjct: 463 AKPVAVPARRVPH-ARQVS--LREELERMEAEGIIVKMEEPAEWVSPLVIVEKKGGGIRV 519
Query: 515 VLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSY 574
++ + +N+ + +++ L I + L + +D ++ ++ +P+ R S
Sbjct: 520 CMDPRHINENIKRERYELPRRDDIEAELAGARWFSKLDANRGFYQIPLDDASPRICTFST 579
Query: 575 NGDVLAMTCLPFGLATAPQAFA-SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
LPFGL++A + F +S+ +L R G+R VY+DD L+ +
Sbjct: 580 PYGRYRFLRLPFGLSSASEVFQREISD---ALDRIPGVR--VYIDDVLVWGTTKAEHDKL 634
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
+ A++ + S G+ +N +K + +FLG
Sbjct: 635 LRAALAAISSAGFTLNAEKCVFGSQEI-KFLG 665
>gi|341875853|gb|EGT31788.1| hypothetical protein CAEBREN_31619 [Caenorhabditis brenneri]
Length = 2112
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 96/466 (20%), Positives = 192/466 (41%), Gaps = 59/466 (12%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P +P VP+ + + I +L++G + +S T ++S +
Sbjct: 1041 VHIYTTTEVPVRGRPYRVPV--------KFQADLEKQINGLLKSGRI--TESNTPWISPI 1090
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMI---SIDLSQAYFH 559
+V K NG R L+ + LN+ P + L RI + +++ +M S+D++ Y
Sbjct: 1091 VIVKKKNGSLRVCLDFRKLNEVTIPDNYPLP---RIDAIVERVGHMKFFSSLDMANGYLQ 1147
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
+ + + V A + LPFGL +A F + ++L V++Y+DD
Sbjct: 1148 LRLDDESSYKCGFTTENRVYAYSHLPFGLKSAASYF---QRALKTVLDGMDHEVMLYIDD 1204
Query: 620 FLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWL 677
L++++ D + ++ L V+ +K L ++ FLG +
Sbjct: 1205 VLIISKTYDEHLDTLERVLQR--FRQYNLKVSPKKCDLVRKSIV-FLGHQIN-------- 1253
Query: 678 PEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLG 737
E+ + N+ + N++ R +G F + + +S +Q L
Sbjct: 1254 EENYEPNKSNVSAIVNMPTPSNINELRRFIGMTGFFRRFV---KDYSEIVQPLNKLTHKN 1310
Query: 738 APHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQV------DS 789
P + T ++ + KL+ L + P L P + ++ + TDAS + G+ + D
Sbjct: 1311 TPFVWTQVHQDAVQKLKTILTSKPVLCYPDYNKEFHCY--TDASGVAQGAVLMQTKPGDE 1368
Query: 790 S------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
S + S S+ + E+ + AL + S V++ +D++ ++ ++
Sbjct: 1369 SKMQAIAYASRTLSQPETRRAAIHNELGGIIFALRAFKVYIYGSKVVIHTDHRPLIFLMK 1428
Query: 844 RQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
R L+ + + Q + I I+ I G N++AD LSR
Sbjct: 1429 RHKVNDVLA------RWLVELQQYNIDIV--HIDGKRNTIADCLSR 1466
>gi|317141088|ref|XP_003189333.1| hypothetical protein AOR_1_2970174 [Aspergillus oryzae RIB40]
Length = 1178
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTVMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|95020641|ref|YP_605811.1| ORFIII [Banana streak virus]
gi|68566426|gb|AAY99427.1| ORFIII [Banana streak virus Acuminata Yunnan]
Length = 1900
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 100/466 (21%), Positives = 177/466 (37%), Gaps = 76/466 (16%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + +M HI ++LE V++ L T+ + +V G G R
Sbjct: 1303 LKHVTPAMKESMKKHIDKLLELKVIRPL--TSKHRTTAIIVQSGTEIDPLTGKERRGKER 1360
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I S + K DL + V + + A
Sbjct: 1361 LVFNYKRLNDNTEKDQYSLPGINTIISRIGKSKIYSKFDLKSGFHQVAMDPESIPWTAFW 1420
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ + +
Sbjct: 1421 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEEYIAVYIDDILIFSDNVSDHRKH 1477
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
+ I + G +++ K + A + FLG T+GN
Sbjct: 1478 LSKFLEICKANGLVLSPTKMKIG-AREIDFLG-----------------ATIGNSRIKLQ 1519
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHSRRIQRQASLLRLG 737
I++ ++ +K L + L LG L++A IP +G L+S+ G
Sbjct: 1520 PHIIKKIIETKDEELKETKGLRKWLGVLNYARAYIPNLGKTLGPLYSKTSVN-------G 1572
Query: 738 APHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQV-------DSS 790
+ + V+ ++ + LP I P + TD GWG D+
Sbjct: 1573 EKRMNSQDWKVVQLIKNQVQNLP-DLDIPPAGATMVLETDGCMEGWGGVCKWKLHPSDTR 1631
Query: 791 FLSGLWSREQQNWHINKK----EMFAVHQALS-LNLPLLQSSVVMVQSDNQTVVSYLRRQ 845
+ + +H K E+ AV +L + L +++++D+Q +V++ ++Q
Sbjct: 1632 LAEKVCAYASGRYHPIKSTIDAEVHAVINSLEKFKIYYLDKKELIIRTDSQAIVAFYKKQ 1691
Query: 846 GGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
K L ++ I L I++ + I G N +AD+LSR
Sbjct: 1692 ADHKPSRTRWLMLIDYITGLG----INVKFEHIDGKENVLADTLSR 1733
>gi|198411777|ref|XP_002121094.1| PREDICTED: similar to polyprotein, partial [Ciona intestinalis]
Length = 1073
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 95/460 (20%), Positives = 187/460 (40%), Gaps = 45/460 (9%)
Query: 448 GYAIPFSAKPPLVPLC--SLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLV 505
G A K P C + + +A P+ + + +Q M +GV+ R+ T + S + +V
Sbjct: 273 GEAYNIKLKDNAKPFCLSTPRRIAHPLLKRVQIELQHMEASGVITRITQPTDWCSGMVVV 332
Query: 506 PKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTT 565
PK NG R ++ LN+ + ++ L + L +D + ++ +P+
Sbjct: 333 PKPNGKVRICVDFTSLNEGVCRERLMLPTVDETLAKLSSAKVFTKLDANSGFWQIPLANE 392
Query: 566 HQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
+ LPFG+ +AP+ F + + + + V+ +DD L++
Sbjct: 393 SKPLTTFITPWGRFCFNRLPFGICSAPEHFQRRMSQILEGIPN----VLCKMDDILIIGS 448
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP---HLDRMWLPEDKQ 682
D + + L G +N K S + +FLG + D H D + ++
Sbjct: 449 DQAKHDQTLNEVLERLNKAGVTLN-DKCEFSKNSI-KFLGHIVDASGVHADPRRISAIEE 506
Query: 683 LTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIPMGRLHS--RRIQRQASLLRLGAP 739
+ ++ R LG + A F+ + + R++ ++ +L
Sbjct: 507 MKTPT-----------DVSGVRRFLGMANQLAKFIPGFSDMTAPIRKLLQKCNLWTWEKE 555
Query: 740 HLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----QVDS-----S 790
A L K++ L LP+ + P++ + IS D+S G G+ +V+ +
Sbjct: 556 Q----QDAFL-KIKENLCKLPVLAHYDPKR-ETTISADSSSYGLGAVLLQKVNGFNKPIA 609
Query: 791 FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS 850
F S + +Q + +KE A A L +++D++ +V+ L G K
Sbjct: 610 FASRALNTTEQKYAQIEKEALATTWACEKFKDYLIGLEFEIETDHRPLVALL----GKKC 665
Query: 851 LSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
++ LS + + W + + +++PG S AD+LSRS
Sbjct: 666 INELSPRIQRMRMRLMWFTYKI-KYVPGKLLSTADALSRS 704
>gi|403174361|ref|XP_003333341.2| hypothetical protein PGTG_15125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170942|gb|EFP88922.2| hypothetical protein PGTG_15125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1387
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 107/516 (20%), Positives = 210/516 (40%), Gaps = 64/516 (12%)
Query: 426 LRRFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLE 485
+R + A + G ++ G+ F P L TP + +L Q+ +E
Sbjct: 493 VREWERALEKAGLAQEFSDVIRGFKEGFDQGIPNHNLGPATPYFTPPNHQSALLAQDKIE 552
Query: 486 TGVLKRLDSTTGF-------LSRLF---------LVPKGNGGTRPVLNLKG-LNQFLSPK 528
+ K +++ F L R F G+G RP+ +L N L+P
Sbjct: 553 QSMKKEVEAGRMFGPYTHEQLMRKFSFFRTNPLGAAVNGDGSIRPINDLSFPRNDPLTPS 612
Query: 529 KFSLINHF----------RIPSFL--QKGDYMISI-DLSQAYFHVPIKTTHQRFLAL-SY 574
S ++ + F Q G ++++ D +AY +P + +L + +
Sbjct: 613 VNSFVDKLDYATTWDDFENVSKFFRRQTGPLLLALFDWEKAYRQIPTAKSQWAYLMVRDF 672
Query: 575 NGDVLAMTCLPFGLATAPQAFASLSN-WVASLLRSRGMRVVV-YLDDFLLVNQDPRILEI 632
NG +L T + FG +F ++ W +L + V ++DD L V +E+
Sbjct: 673 NGGILIDTRIAFGGVAGCGSFGRPADAWKQLMLHEFDLVTVFRWVDDNLFVKHPDSNVEM 732
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQ-FLGIMWDPHLDRMWLPEDKQLTLGNILRT 691
+A S SLG V + SP Q ++G +W+ + LP+DK+ ++
Sbjct: 733 DHIVARS--ESLG--VKTNSTKYSPFKEEQKYIGFIWNATRKSVRLPDDKKYQRVQQVKE 788
Query: 692 LLA-SKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
L ++ + G L+ S+++P R + + R + + L P+ P+
Sbjct: 789 FLQIGSEFSFKQVEVMAGRLNHVSYLLPQLRCYLNSLYRWMNAWVYRSKDL-PLPPSARV 847
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL-----WSREQ--QNW 803
L+ WL L + ++ + D ++GW +S+ G+ W++ Q ++W
Sbjct: 848 DLQEWLTTL-----LSFKETRMIRDPDPIEIGWMGDASTSYGIGITIGRRWAQFQLTKDW 902
Query: 804 HI--NKKEMFAVHQALSLNLPLL--------QSSVVMVQSDNQTVVSYLRRQGGTKSLSL 853
K A + +++ L L+ + ++V +DN T S + ++ +K ++
Sbjct: 903 DRGPEPKRDIAWLETVAIRLGLIALAQLSVKRGKTIIVWTDNTTTESAILKR-KSKHQAV 961
Query: 854 LSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
E + I L + + I+++ + N VAD+LSR
Sbjct: 962 NDEWKIIQRLLVEMELDIVSRRVSSGDN-VADALSR 996
>gi|125826203|ref|XP_001339254.1| PREDICTED: hypothetical protein LOC798826 [Danio rerio]
Length = 1490
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 103/440 (23%), Positives = 178/440 (40%), Gaps = 53/440 (12%)
Query: 472 VSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQ------FL 525
+ A+ + ML G+++ S + + + LVPK +G R ++ + LN +
Sbjct: 1067 LQEALKEEVDFMLSLGIIE--PSQSEWCHPVVLVPKKDGNIRFCIDFRYLNSVSQFDCYP 1124
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+P+ SLI+ L K Y+ ++DLS+ Y+ +P+ + A + LP
Sbjct: 1125 TPRIDSLIDR------LGKAVYLTTLDLSKGYWQIPLTERARPLTAFRTPWGLFQFRFLP 1178
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP F L + V L YLDD ++ + L G
Sbjct: 1179 FGLHGAPATFQRLMDQVLQGLTF----AAAYLDDIIIYSTTWEEHMQHLHEVFQRLQRAG 1234
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARS 705
N K +++ ++LG + + R + + + L + +T RS
Sbjct: 1235 LTANPAKCAIARKEA-EYLGFVIGNGVVRPQIKKIQALEECPLPQT--------RKELRS 1285
Query: 706 LLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
LG F + IP S R ++ + P+ + + AL ++ +
Sbjct: 1286 FLGMAGFYNRFIPN---FSSRAATLTDMVGVRCPNQCQWTEERIAAFKDIQTALTTNTVL 1342
Query: 766 F-PRQVQHFI-STDASDLGWGSQVDSS---------FLS-GLWSREQQNWHINKKEMFAV 813
+ P + FI TDAS+ G G+ + F+S L+ RE + I +KE AV
Sbjct: 1343 YNPDFTKEFIVQTDASERGLGAVLLQGSPGERRPVVFISRKLFPRETRYSTI-EKECLAV 1401
Query: 814 HQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHIL 872
AL SL LL ++ ++D++ + +L R T + + +L Q +R +
Sbjct: 1402 KWALDSLRYYLLGREFIL-ETDHK-ALQWLERMRDTN-----GRITRWYLAMQPFRFKV- 1453
Query: 873 AQFIPGAYNSVADSLSRSKS 892
+PG N AD LSR S
Sbjct: 1454 -HHVPGKANVTADYLSRCAS 1472
>gi|406702754|gb|EKD05684.1| putative retrotransposon nucleocapsid protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1367
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 103/461 (22%), Positives = 181/461 (39%), Gaps = 59/461 (12%)
Query: 454 SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
+ +P P+ L + V + ++++ L+TG ++ S G S + V K +G R
Sbjct: 367 NQQPKFGPIYGLSEVELRV---LDEYLKDNLKTGFIRPSTSPAG--SPILFVKKKDGSLR 421
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
++ + LN+ ++ L L+ Y IDL AY + IK + A
Sbjct: 422 LCVDYRALNKITRKNRYPLPLIQESLDRLKTAKYFTKIDLRAAYNLIRIKPGDEWKTAFR 481
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP +F L N V L V+VYLDD L+ ++
Sbjct: 482 TRYGLYEYLVMPFGLTNAPASFQYLINDV--LRDYLDNFVIVYLDDILIFSKTHEEHVTH 539
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
K + L +K V FLG ++ DK +++ + +
Sbjct: 540 VKQVLKRLEDNSLWAKAEKCEFFQDSV-DFLG----------YIVSDKGISMDP--KKVE 586
Query: 694 ASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
A W + + + +L FA+F + +S+ LLR G IN K
Sbjct: 587 AIVDWPVPKNVHDIQVFLGFANFYRRFIKSYSKVTSPLTRLLRKG------INFEWTTKE 640
Query: 753 EWWLNALP---LSSPIFPRQVQHF-------ISTDASDLGWGSQVDSS---------FLS 793
+ + L ++PI +QHF + TDASD + S F S
Sbjct: 641 QEAFDDLKKRFTTAPI----LQHFQPDLPLVLETDASDFAIAGVLSHSIDGKLYPIAFYS 696
Query: 794 GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS-VVMVQSDNQTVVSYLRRQGGTKSLS 852
+ + N+ I KEM A+ +A L+ S + V +D++ + + + + +
Sbjct: 697 RKLNNSELNYEIYDKEMLAIVEAFKHWRAYLEGSPNITVYTDHKNLEYFTTSKVLNRRQA 756
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+E+ L D++I + PG+ D+L+R + L
Sbjct: 757 RWAEL----LAQYDFKI----VYRPGSKMGKPDALTRRQDL 789
>gi|38346992|emb|CAD40278.2| OSJNBb0062H02.17 [Oryza sativa Japonica Group]
gi|38347666|emb|CAE05600.2| OSJNBa0054D14.1 [Oryza sativa Japonica Group]
Length = 1629
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 169/441 (38%), Gaps = 60/441 (13%)
Query: 470 TPV-SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPK 528
TP+ + + +QEML G+++ S S + LV K +G R ++ + LN
Sbjct: 688 TPIQKNEIESQVQEMLSKGIIQPSSSPF--SSPVLLVKKKDGSWRFCVDYRHLNAITVKN 745
Query: 529 KFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGL 588
K+ L + L + +DL Y + + + A + LPFGL
Sbjct: 746 KYPLPVIDELLDELAGAQWFSKLDLRSGYHQIRMHPDDEHKTAFQTHHGHFEFRVLPFGL 805
Query: 589 ATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIV 648
+AP F + N V + L R V+V++DD L+ ++ K IL V
Sbjct: 806 TSAPATFQGVMNSVLATLLRRC--VLVFVDDILIYSKSLEEHVQHLKTVFQILLKHQLKV 863
Query: 649 NLQKSSLSPAPVLQFLGIMWDPH---LDRMWLPEDKQLTLGNILRTLLASKTWNL-DSAR 704
K S + L +LG + P+ D PE Q+ + W S +
Sbjct: 864 KRTKCSFAQQE-LAYLGHIIQPNGVSTD----PEKIQVI-----------QHWPAPTSVK 907
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHL--TPINPAVLPKLEWWLNALPLS 762
L +L + + R + + +LLR G ++ A + + AL L+
Sbjct: 908 ELRSFLGLSGYYRKFVRNYGILSKPLTNLLRKGQLYIWTAETEDAFQALKQALITALVLA 967
Query: 763 SPIFPRQVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQAL 817
P F Q + TDASD G G+ + +FLS +KE A+ A+
Sbjct: 968 MPDF--QTPFVVETDASDKGIGAVLMQNNHPLAFLSRALGLRHPGLSTYEKESLAIMLAV 1025
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQ--- 874
P LQ +++D+ +SL+ L+E L+ W+ L +
Sbjct: 1026 DHWRPYLQHDEFFIRTDH-------------RSLAFLTEQR----LTTPWQHKALTKLLG 1068
Query: 875 ------FIPGAYNSVADSLSR 889
F G NS AD+LSR
Sbjct: 1069 LRYKIIFKKGIDNSAADALSR 1089
>gi|149248918|ref|XP_001528813.1| hypothetical protein LELG_05791 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146453354|gb|EDK47610.1| hypothetical protein LELG_05791 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1326
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 90/431 (20%), Positives = 174/431 (40%), Gaps = 38/431 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ EML+ G L+ S+ + + FL+PK +G R +++L+ LN+ + + + +
Sbjct: 458 LSEMLKNGQLQY--SSAAYRNPWFLIPKKDGRHRMLIDLRELNKHVELEGGHPQSTDELT 515
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S L + ID+ AYF VP+ T + + +L LP G F+S+
Sbjct: 516 SELSGRLFNTLIDVQNAYFQVPLDPTTNDVTSFNSPLGLLKYAVLPQGYLNLVSEFSSI- 574
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVN-LQKSSLSPA 658
+ +L V+ ++DD + P + ++ L L + ++ L + L
Sbjct: 575 --LQKILSPVAKDVICFIDDIAICG--PTVEDLSESLMKEHLDKVHQVLQLLAHAGLEIN 630
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS------- 711
P + + L PE K + G + + S LG ++
Sbjct: 631 PAKLKVAVEDCEFLGYRITPEGKTIIRGQVDALTNYPRPTTQKKMESFLGLVNYYRQLIV 690
Query: 712 -FASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQV 770
FA P+ L + + LL + + L + P+ P +Q+
Sbjct: 691 GFAELTAPLYDLILKAKEHPKHLLEWDDQTINYFQHII-----RVLTSCPVLQPFNDKQI 745
Query: 771 QHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQALSLN 820
I TDAS WG + ++ SG + ++++ I +KE+F+++ L+
Sbjct: 746 TT-IHTDASTESWGGVLQNTDAHGVTRMVLCYSGKFHGSERHYTIYEKELFSIYLTLNAI 804
Query: 821 LPLL--QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPG 878
PLL ++ + DN+ +V+ L + ++ ++ K + + I+ I G
Sbjct: 805 QPLLVGYKDILYIYCDNKALVTVLDKP--LENSHFVNRTYKWLNYIRSFNYMII--HIDG 860
Query: 879 AYNSVADSLSR 889
N +AD+LSR
Sbjct: 861 KRNVIADALSR 871
>gi|308450734|ref|XP_003088406.1| hypothetical protein CRE_18358 [Caenorhabditis remanei]
gi|308247718|gb|EFO91670.1| hypothetical protein CRE_18358 [Caenorhabditis remanei]
Length = 666
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 87/195 (44%), Gaps = 14/195 (7%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
I+ M E G+++ +ST+ + S L ++PK NG R V++ + LN + + + N I
Sbjct: 362 EIKFMKENGLIE--ESTSPYTSPLLMIPKANGDIRIVIDYRRLNLITRSRTYIMPNTLDI 419
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
+G D++Q + + + H+ A + V +P GL AP F
Sbjct: 420 TEEASRGKIFSVFDIAQGFHTIRMHEAHKERTAFCSHMGVFQYRYMPMGLKGAPDTFQRA 479
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVN----QDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
+ V +++Y+DD ++V+ Q R LE KL + +G + +KS
Sbjct: 480 MSEVEKQFSG---TMILYVDDLIVVSKTEEQHIRDLEEFFKLMI----KMGLKLKAEKSQ 532
Query: 655 LSPAPVLQFLGIMWD 669
+ + FLG + +
Sbjct: 533 IGRTRI-SFLGFIIE 546
>gi|156836723|ref|XP_001642409.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
gi|146189619|emb|CAM91758.1| pol protein [Vanderwaltozyma polyspora]
gi|156112929|gb|EDO14551.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
Length = 1667
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 107/444 (24%), Positives = 186/444 (41%), Gaps = 62/444 (13%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGF-LSRLFLVPKGNGGTRPVLNLKGLNQFL--SPKKFS 531
A++ I + LE+ ++ +D+ LS +F + + R + +L+ +N L +P+
Sbjct: 733 AINQFITQSLESNMISPIDTDEVVALSPVFPIQQSKDKIRIITDLRKVNTHLLYTPRPIP 792
Query: 532 LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATA 591
I H I S L S+D+ +AY +PI+ L L LP+GLA+A
Sbjct: 793 PIQH--IFSNLANKTIFSSLDIRKAYQQIPIQGDK---LGLITELGSYKFNRLPYGLASA 847
Query: 592 PQAFAS-LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
P + + N + L +S+ V Y DD ++ + K +++L G ++
Sbjct: 848 PYWWGEFIQNILKQLPQSKDTIVSYYYDDLIIASSTIAEHYTTLKHIMALLAENGLSLSY 907
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+K ++ VL FLG ++ +R+ + ++K+ T+ R +L + + G++
Sbjct: 908 EKIHIAQPKVL-FLG--YEVSHNRLAIDKEKKNTIA---RWVLPE---DKKAIEKFTGFV 958
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLG----APHLTPINPAVLPKLEWWLNALPLSS--- 763
+F IP Q A + H PI + K + L L S
Sbjct: 959 NFLRNFIPNAS------QLLAPFYSFATNKPSNHEKPILQKAMKKNFQLIKQLILKSLTL 1012
Query: 764 PIFPRQVQHFISTDASDLGWGSQVDS-------------SFLSGLWSREQQNWHINKKEM 810
+F Q I TDAS G S V +F S ++ QQ + ++E+
Sbjct: 1013 KLFDPQAPTIIYTDASLTGAASIVLQPETVNGKTTLYPITFYSLRFTDTQQRYSTVEREL 1072
Query: 811 FAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGG-----TKSLSLLSEVEKIFLLSQ 865
+AV L L+ SS + + +DNQ ++S + + TK L LL+
Sbjct: 1073 WAVLHTLE-KARLVLSSSITIYTDNQGIISMGKTERATHPRLTKYLDLLN------TYRL 1125
Query: 866 DWRIHILAQFIPGAYNSVADSLSR 889
+W+ +I G N VAD LSR
Sbjct: 1126 NWK------YIKGRDNHVADYLSR 1143
>gi|536873|gb|AAA98435.1| POL3 [Saccharomyces cerevisiae]
Length = 1270
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 359 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 418
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 419 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFA---RYMADTFRD--LRFVNVYLDDI 473
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 474 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 530
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 531 K----CAAIRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 580
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 581 WTEKQDKAIDKLKDALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 639
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 640 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 699
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 700 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 732
>gi|317159454|ref|XP_003191074.1| hypothetical protein AOR_1_1498024 [Aspergillus oryzae RIB40]
Length = 1605
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTVMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|317158449|ref|XP_003190969.1| hypothetical protein AOR_1_910034 [Aspergillus oryzae RIB40]
Length = 1605
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTVMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|149241192|ref|XP_001526282.1| hypothetical protein LELG_02840 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450405|gb|EDK44661.1| hypothetical protein LELG_02840 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1097
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 83/391 (21%), Positives = 165/391 (42%), Gaps = 44/391 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEM+ G L D+T + + FL+ K +G R +++L+ LN+ + + ++ +
Sbjct: 660 LQEMIRQGQLVYSDAT--YRNPWFLISKKDGRHRLLIDLRELNKNVELEGGHPLSVDDLT 717
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + ++ +ID+ AYF +P+ + + +L LP G + F S+
Sbjct: 718 TEISGCWFISTIDVQNAYFQIPLDAATSDVTSFNSPLGLLKYAVLPQGYINSVSEFRSI- 776
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS---LS 656
+ +L V+ ++DDF +V P++ E+ L L + + L ++ ++
Sbjct: 777 --LQKILSPVAKDVMCFIDDFAIVG--PKVDELTDSLVREHLDKIVEVFRLLTNAGLKIN 832
Query: 657 PA------PVLQFLGIMWDPHLDRMWLPEDKQLTLGNI---LRTLLASKTWNLDSARSLL 707
PA P FLG H+ P K L G + L L + L+S L+
Sbjct: 833 PAKLKIAVPECDFLGY----HIS----PAGKTLIRGQVDALLNYPLPNTVKQLESFLGLV 884
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
Y + ++ L + + QA H P ++ L P+ P+
Sbjct: 885 NY--YRQLIVGHAELTAPLYNLVNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPL 942
Query: 766 FPRQVQHFISTDASDLGWGSQVDSS----------FLSGLWSREQQNWHINKKEMFAVHQ 815
+ + + TDAS WG + ++ SG + ++N+ I +KE+F++++
Sbjct: 943 NFKDLI-TVHTDASTDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYK 1001
Query: 816 ALSLNLPLL--QSSVVMVQSDNQTVVSYLRR 844
PLL + V+ + DN+ +V + +
Sbjct: 1002 TFDAIHPLLFGFTGVIHLYCDNKALVLVMNK 1032
>gi|312383757|gb|EFR28710.1| hypothetical protein AND_02962 [Anopheles darlingi]
Length = 304
Score = 60.1 bits (144), Expect = 7e-06, Method: Composition-based stats.
Identities = 45/150 (30%), Positives = 69/150 (46%), Gaps = 10/150 (6%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPK------GNGGTRPVLNLKGLNQFLSPKKFSL 532
I EML+ G+++ DST+ + + + +PK GN R V++ + LN P + +
Sbjct: 116 QIDEMLKLGIIQ--DSTSPWNAPVLCIPKKTVDAQGNKRYRIVVDFRALNVITVPFVYPI 173
Query: 533 INHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP 592
I L Y ++DL ++ +PI A S T +P GL +P
Sbjct: 174 PLIPDILDTLGDSKYFSTLDLKSGFYQIPIHERDAPKTAFSTPYGHYEFTRMPMGLKNSP 233
Query: 593 QAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
F L N V L RG+R VVYLDD ++
Sbjct: 234 STFQKLMNKV--LYEIRGVRAVVYLDDIVV 261
>gi|317158528|ref|XP_003190983.1| hypothetical protein AOR_1_1018034 [Aspergillus oryzae RIB40]
Length = 1585
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTVMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|319656526|gb|ADV58678.1| P194 [Rice tungro bacilliform virus]
Length = 1674
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 168/393 (42%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++++ T + F+V + R V N K LN + F++ +
Sbjct: 1195 QIKELLDNKLIRKASPTCRHRTAAFVVRNHSEEVAQKPRIVYNYKRLNDNMVTDPFNIPH 1254
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + LQ+ DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1255 KISMINLLQRARIFSKFDLKAGFHHMKLKEDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1314
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + KL + + +G +++ +KS
Sbjct: 1315 FQRFMQESFGDLKF----ALLYIDDILIASSNEQEHIKHLKLFFTRVKEVGCVLSKKKSK 1370
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + ++ SK L ++ LG L+
Sbjct: 1371 MFLKEV-EYLGVE---------IKEGKISLQPHIVEKIKKFDKSKLSTLKGLQAYLGLLN 1420
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PLS P
Sbjct: 1421 YARGYIKDLSKLVGPLYKKTG---KSGQRTFNKEDWNIIFKIEREVDKIKPLSRPEESDY 1477
Query: 770 VQHFISTDASDLGWGSQV----------DSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG+ + D+ ++G S E++ W E+ A+++A
Sbjct: 1478 I--IIETDASEEGWGAVLICKPDKYASKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1535
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1536 LN-KFQIYLDRDFTIRTDCEAIVKGIKTEDYKK 1567
>gi|9628905|ref|NP_043933.1| hypothetical protein [Strawberry vein banding virus]
gi|1360613|emb|CAA65970.1| ORF V [Strawberry vein banding virus]
Length = 708
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 83/384 (21%), Positives = 160/384 (41%), Gaps = 39/384 (10%)
Query: 479 HIQEMLETGVL---KRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+L+ G++ K S+ F+ R K G R V+N K LN + L N
Sbjct: 297 QIEELLKLGIIRPSKSPHSSPAFMVRNHAEIK-RGKARMVINYKKLNDHTKGDGYLLPNK 355
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
++ + + S D ++ V + + A S +PFGL AP F
Sbjct: 356 EQLLQRIGGKTFYSSFDCKSGFWQVRLAPETIQLTAFSCPQGHYEWLVMPFGLKQAPAIF 415
Query: 596 -----ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
SLSN VY+DD ++ ++ K+ ++ +LG +++
Sbjct: 416 QRHMDESLSNMYPQF-------CAVYVDDIIVFSKTEEEHLGHVKIVLNRCKALGIVLSK 468
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL 710
+K+ L + FLG++ ++R L + L + + + ++ + LG L
Sbjct: 469 KKAQLCKTTI-NFLGLV----IERGNLKVQSHIGLHLVA---FPDQLSDRNALQRFLGLL 520
Query: 711 SFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQ 769
++ S P ++ + R Q L + T + + K++ + LP L +P +
Sbjct: 521 NYISAYFP--KIANLRSPLQVKLKKEITWSWTEKDTETVRKIKSLVKTLPDLYNP--SPE 576
Query: 770 VQHFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSL 819
+ I DASD WG+ +V + SG + ++N+H N+KE+ ++ +A+
Sbjct: 577 DKPIIECDASDDHWGAILKAKLPEGKEVICRYASGTFKPAEKNYHSNEKEILSIIKAIKA 636
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLR 843
+ +V++DN ++R
Sbjct: 637 FRAYILPYKFLVRTDNTNAAYFVR 660
>gi|189009867|ref|YP_001931961.1| replicase [Lamium leaf distortion virus]
gi|172041757|gb|ACB69767.1| replicase [Lamium leaf distortion virus]
Length = 696
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 97/429 (22%), Positives = 170/429 (39%), Gaps = 43/429 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+LE ++ + S + S FLV + G R V+N K +N +L N
Sbjct: 284 IKELLELKII--IPSKSPHQSPAFLVENEAERRRGKKRMVVNYKAINTATIGDAHNLPNK 341
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ + ++ S D ++ V + Q A + +PFGL AP F
Sbjct: 342 DELLTLIRGKSIFSSFDCKSGFWQVLLDEDSQLLTAFTCPQGHYQWIVVPFGLKQAPSIF 401
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
N + R VY+DD L+ + + + + LG I++ +K+ L
Sbjct: 402 QRHMN---NAFRDFASYCCVYVDDILVFSNNIKDHYAHVAQVLRKCAELGIILSKKKAQL 458
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
+ FLG+ D R P++ L +I + +K + + LG L++AS
Sbjct: 459 FKCRI-NFLGLDIDEGTHR---PQNH--ILEHIHK--FPNKIEDKKQLQRFLGILTYASD 510
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFI 774
IP +L S R Q L + + + +++ L P L P ++ I
Sbjct: 511 YIP--QLASMRAPLQEKLKEDVPWNWKHSDTEYVEEIKKSLTDFPKLHHPATDEKL--II 566
Query: 775 STDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ 825
DAS WG + + + SG + + + N+H N+KE+ AV + ++ L
Sbjct: 567 ECDASGKYWGGILKAIHQSEERICRYTSGSFKKAELNYHSNEKEILAVIRVIAKFTIYLT 626
Query: 826 SSVVMVQSDNQTVVSYLRR--QGGTKSLSLLSEVEKIFLLSQDW--RIHILAQFIPGAYN 881
++++DN+ ++ +G K L+ Q W R + I G N
Sbjct: 627 PLEFLIRTDNKNFTFFMNTNVKGDYKQGRLVR--------WQQWLSRYSFKVEHITGVKN 678
Query: 882 SVADSLSRS 890
AD L+R
Sbjct: 679 IFADFLTRE 687
>gi|328868140|gb|EGG16520.1| hypothetical protein DFA_09058 [Dictyostelium fasciculatum]
Length = 1302
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 87/205 (42%), Gaps = 5/205 (2%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
HL++ ++ M +++ L +G + S + + S + V K +G R ++ + LN+
Sbjct: 383 NHLSSEENNVMFTTVEKGLASGRIA--PSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQT 440
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F L ++ + K IDL + + IK H A S T +P
Sbjct: 441 VADRFPLPRIDQLIEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIP 500
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP AF N A+ V++Y+DD L+ +++ K + L S
Sbjct: 501 FGLRNAPSAFVRAIN--AAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLRSNK 558
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDP 670
N KSS V +FLG + P
Sbjct: 559 LFANKAKSSFLVKEV-EFLGHLITP 582
>gi|147854459|emb|CAN78588.1| hypothetical protein VITISV_043911 [Vitis vinifera]
Length = 2232
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 107/432 (24%), Positives = 173/432 (40%), Gaps = 64/432 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+QEMLE G+++ S + F S + LV K +GG R ++ + LN+ P +F + +
Sbjct: 1259 VQEMLEAGIVR--PSLSPFSSPVLLVKKKDGGWRFCIDYRALNKVTVPDRFPIPVIDELL 1316
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L +DL Y + ++ A + +PFGL AP F SL
Sbjct: 1317 DKLHGATIFSKLDLKSGYHQIRVRQQDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQSLM 1376
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N + V+V+ D L+ ++D + + +SIL + VN K L
Sbjct: 1377 NRI--FWPHLWKFVLVFFYDILVYSKDLKEHCDHLQTVLSILANHQLHVN-GKKCLFAKL 1433
Query: 660 VLQFL-------GIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLG 708
L++L G+ DP+ + A W +L R LG
Sbjct: 1434 QLEYLGHLVSAKGVAADPN-------------------KISAMVEWPTPKSLKELRGFLG 1474
Query: 709 YLSFA-SFVIPMGRLH---SRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSP 764
+ FV G + ++ +++ A L A KL+ + +P+ +
Sbjct: 1475 LTGYYRRFVEGYGAISWPLTQELKKDAFNWNLEA-------EVAFQKLKTTMTTIPVLA- 1526
Query: 765 IFPRQVQHFI-STDASDLGWGSQVDSS------FLSGLWSREQQNWHINKKEMFAVHQAL 817
P Q FI DAS G G+ + S F L +RE+Q I ++E+ A+ A+
Sbjct: 1527 -LPNFSQLFIVEMDASGYGLGTVLMQSHRPVAYFSQVLTARERQK-SIYERELMAIVLAV 1584
Query: 818 SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIP 877
L +V++D Q+ + +L Q S V K+F D+ I QF P
Sbjct: 1585 QKWRHYLLGRHFIVRTD-QSSLKFLLEQRIVNE-SYQKWVAKLF--GYDFEI----QFRP 1636
Query: 878 GAYNSVADSLSR 889
G N AD+LSR
Sbjct: 1637 GXENKAADALSR 1648
>gi|388852905|emb|CCF53353.1| uncharacterized protein [Ustilago hordei]
Length = 1005
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 95/402 (23%), Positives = 166/402 (41%), Gaps = 47/402 (11%)
Query: 517 NLKGLNQFLSPKKFSLINHF---RIPSFLQK--GDYMISIDLSQAYFHVPIKTTHQRFLA 571
L +N+ ++P F++I + +I F+Q+ G ++ DL+ A+ HV R L+
Sbjct: 242 QLLSVNEGITPH-FTMIRYASLAKILDFVQEHLGCHLWKSDLTDAFRHVITTLADARLLS 300
Query: 572 LSYNGDVLAMTCLPFGLATAP---QAFASLSNWVASLLRSRGMRVVVYLDDF---LLVNQ 625
S++G T L F ++P FA +W+ L + G V YLDDF + +
Sbjct: 301 FSFDGHFFMETGLTFRGHSSPWIFNLFAEALHWIVQL--AMGHPVDHYLDDFFGAVPAST 358
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
DP Q +++ + + + L+ LGI D + + + + +
Sbjct: 359 DPG----QLLHVLALACLALGLQLALQKTFWDTTKLEILGIQIDSVQQSVSITSEWCICI 414
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
+ LL + +L + + L F S V+P G+ ++ + H P++
Sbjct: 415 LEAIDNLLHQCSAHLLDWQRVASLLQFVSQVVPHGKAFLHQLYDA-----IKTAHRCPLS 469
Query: 746 ------PAVLPKLEWWLNALPL--------SSPIFPRQVQHFISTDASDLGWGSQVDSSF 791
PA L +L WW + L SP+F R I TD S G+G+ +
Sbjct: 470 LWCVSRPAAL-ELRWWRSTLQAWLGHSLLQPSPLFIRH----IWTDVSKRGFGAHLGPMH 524
Query: 792 L-SGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQG 846
+SRE + H +K E+ AV +AL LPL ++V + T V + G
Sbjct: 525 APEAAFSREVPHCHWSKDICFLEVLAVLEALCTFLPLWSGPHLVVLHVDNTNVKFSLCNG 584
Query: 847 GTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
+ + + +IF + W + + I N +AD LS
Sbjct: 585 RSHDPLTQTLLREIFGICFRWHVTLQPVHIALEDNCLADLLS 626
>gi|48686545|emb|CAD59232.1| polyprotein [Cacao swollen shoot virus]
Length = 1839
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 91/455 (20%), Positives = 182/455 (40%), Gaps = 54/455 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + HI+ +L+ GV++ S + + F+V G +G R
Sbjct: 1291 IKHLTPAMEKQFQKHIKALLDIGVIR--PSKSKHRTTAFIVESGTVIDPVTKKTIHGKER 1348
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1349 LVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKIFSKFDLKSGFHQVAMAEESIPWTAFW 1408
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ +++
Sbjct: 1409 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFKGTEEFIAVYIDDILVFSENMAEHTKH 1465
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ + I G +++ K L+ + +FLG + + + + ++++ ++
Sbjct: 1466 IGIMLKICQENGLVLSPSKICLAQREI-EFLGTV---------ISQGQMKLQAHVIKKIV 1515
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
L++ RS LG L++A IP +GR S + + G + ++
Sbjct: 1516 NKANMELETTKGLRSFLGLLNYARIYIPNLGRKLSPLYAKTSP---TGEKRFNRQDWHLI 1572
Query: 750 PKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQ 801
+++ + LP L+ P P + I +D GWG ++ DS + +
Sbjct: 1573 KEIKDMVQKLPDLAIP--PARCCIIIESDGCMEGWGAVCKWKLAKEDSRTTEKICAYASG 1630
Query: 802 NWHINKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLL 854
+ + K E++A+ +AL S + L ++V++D Q +V++ + K + +
Sbjct: 1631 KFGVVKSTIDAEIYALIKALESFKIFYLDKKHLVVRTDCQAIVTFYNKTSTHKPSRIRWI 1690
Query: 855 SEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ + I L + + + I G N +AD+LSR
Sbjct: 1691 TFSDYITGLG----VQVTIEHIDGKENQLADTLSR 1721
>gi|189242076|ref|XP_001808495.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1475
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 38/426 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L
Sbjct: 638 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLPRVDEHL 695
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+ + ++DL+ YF +P+ T A +PFGLA AP F
Sbjct: 696 DKLKGAKFFTTLDLASGYFQIPMATESIPKTAFVTPDGHCEFVRMPFGLANAPAVFQRAM 755
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V L+ + Y+DD L+ ++D + +L G + L K +
Sbjct: 756 NKVLGPLQFQT--AFCYIDDLLIPSKDFETGLNNLQTVFQLLRQFGLTLKLSKYCFFGSQ 813
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIP 718
+ ++LG + + K +T K ++ R LG F +V
Sbjct: 814 I-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKYVKD 864
Query: 719 MGRLHSRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
+ + SLL+ G+ L + L+ L + P+ + I+ + + + TD
Sbjct: 865 YATIAN----SLTSLLKNGSAFVLEEVQERAFQTLKDILTSRPVLA-IYDAEAETELHTD 919
Query: 778 ASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
AS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 920 ASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYLLGL 979
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S D+L
Sbjct: 980 QFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHVDAL 1031
Query: 888 SRSKSL 893
SR+ L
Sbjct: 1032 SRNTVL 1037
>gi|211925534|dbj|BAG81990.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 450
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 153/375 (40%), Gaps = 41/375 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + LN P ++ + +
Sbjct: 83 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYCSLNNATIPDRYPIPHIHDF 140
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 141 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRMPFGLRNAAQTFQRF 200
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 201 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 254
Query: 658 APVLQFLGIMWDPHLDRMWLP--EDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASF 715
L FLG H+D + D+ L L + L R +G +++
Sbjct: 255 VTSLDFLG----HHIDSTGISPLPDRILALESF------PIPTTLTQLRRFIGIINYYRR 304
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPH----LTPINPAVLPKLEWWL-NALPLSSPIFPRQV 770
IP H I + + L LG+ L P+ A + + + +A LS
Sbjct: 305 FIP----HCADILQPLTDL-LGSKEKSVTLPPVAIAAFERAKQAIAHATKLSFLDTHEST 359
Query: 771 QHFISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNL 821
+ ++TDAS+ G+ + +F S Q + +E+ A++ A+
Sbjct: 360 KLILTTDASNAAVGAVLHQVVNNASQPLAFFSQKMQAAQTRYSTFGRELLAIYLAIRHFR 419
Query: 822 PLLQSSVVMVQSDNQ 836
LL+ +Q+D++
Sbjct: 420 HLLEGRSFTIQTDHK 434
>gi|173088|gb|AAA35184.1| has homology to retroviral pol genes; ORF2 TYB3-2 (5' end of coding
region not precisely determined), partial [Saccharomyces
cerevisiae]
Length = 1221
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 89/401 (22%), Positives = 164/401 (40%), Gaps = 39/401 (9%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ LVPK +G R ++ + LN+ F L + S + ++DL Y +P
Sbjct: 385 VVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIP 444
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVV-VYLDDF 620
++ + A T +PFGL AP FA ++A R +R V VYLDD
Sbjct: 445 MEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFA---RYMADTFRD--LRFVNVYLDDI 499
Query: 621 LLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPED 680
L+ ++ P + L + IV +K + +FLG + + ++ +
Sbjct: 500 LIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEET-EFLG--YSIGIQKIAPLQH 556
Query: 681 KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPH 740
K +R KT + A+ LG +++ IP + +I + L
Sbjct: 557 K----CAAIRDFPTPKT--VKQAQRFLGMINYYRRFIP----NCSKIAQPIQLFICDKSQ 606
Query: 741 LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS---QVDSS------- 790
T + KL+ L P+ P F + + ++TDAS G G+ +VD+
Sbjct: 607 WTEKQDKAIEKLKAALCNSPVLVP-FNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVV 665
Query: 791 -FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
+ S Q+N+ + E+ + +AL +L +++D+ +++S + +
Sbjct: 666 GYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPAR 725
Query: 850 SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
+ + L + D+ + LA G N VAD++SR+
Sbjct: 726 RVQRWLDD----LATYDFTLEYLA----GPKNVVADAISRA 758
>gi|328866329|gb|EGG14714.1| hypothetical protein DFA_10972 [Dictyostelium fasciculatum]
Length = 1344
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 87/205 (42%), Gaps = 5/205 (2%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
HL++ ++ M +++ L +G + S + + S + V K +G R ++ + LN+
Sbjct: 425 NHLSSEENNVMFTTVEKGLASGRIA--PSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQT 482
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F L ++ + K IDL + + IK H A S T +P
Sbjct: 483 VADRFPLPRIDQLIEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIP 542
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP AF N A+ V++Y+DD L+ +++ K + L S
Sbjct: 543 FGLRNAPSAFVRAIN--AAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLRSNK 600
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDP 670
N KSS V +FLG + P
Sbjct: 601 LFANKAKSSFLVKEV-EFLGHLITP 624
>gi|329351122|gb|AEB91354.1| polyprotein, partial [Verticillium dahliae VdLs.17]
Length = 1125
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 100/440 (22%), Positives = 169/440 (38%), Gaps = 60/440 (13%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
+ +++E L G ++ S G+ + VPK NG R ++ + LN + L
Sbjct: 242 DTLDEYLKENLRKGYIRPSTSPAGYP--ILFVPKKNGKERLCVDYRQLNDITIKNCYPLP 299
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L ++ ++DL AY + IK + A +PFGL AP
Sbjct: 300 LISELRDALAGANWFTALDLKGAYNLIRIKDGEEWKTAFRTRRGHYEYLVMPFGLTNAPA 359
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F ++ N V L + VVVYLDD L+ ++ + ++ L +V +K+
Sbjct: 360 TFQNMINDV--LREFLDVFVVVYLDDILIFSKTMEEHKGHVHQVLTRLHQHELLVEPEKA 417
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGY 709
V FLG P RM E ++ A + W N+ R+ LG+
Sbjct: 418 KFHTQEV-DFLGYTITPGEIRM---EKSKVA---------AIREWPTPKNVKDVRAFLGF 464
Query: 710 LSFASFVI--------PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPL 761
++F + P+ L + I+ + + A I AVL + L +
Sbjct: 465 VNFYRRFLKGYSKTANPLTSLTVKEIEFAWNEPQEKA--FRQIIDAVLSE-----PVLRM 517
Query: 762 SSPIFPRQVQHFISTDASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMF 811
P P +V+ TDASD G Q+ +F S + N+ I+ KE+
Sbjct: 518 IDPEKPMEVE----TDASDFAIGGQLGQRDDQGRLHPVAFFSKKLHGPELNYQIHDKELM 573
Query: 812 AVHQALSLNLPLLQSS--VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRI 869
A+ +A L + V V +D++ + + + K +E F + +R
Sbjct: 574 AIIEAFKEWRTYLSGARHEVKVYTDHKNLAHFTTNKDLNKRQIRWAEFLSEFNFTIIYR- 632
Query: 870 HILAQFIPGAYNSVADSLSR 889
G+ N AD LSR
Sbjct: 633 -------KGSENGRADILSR 645
>gi|312381152|gb|EFR26964.1| hypothetical protein AND_06612 [Anopheles darlingi]
Length = 1053
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 76/178 (42%), Gaps = 5/178 (2%)
Query: 494 STTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
S++ + S L +V K +G RP + + LN P ++ + S L IDL
Sbjct: 211 SSSNWASPLHMVLKSDGSWRPCGDYRSLNAQTVPDRYPVPYLQDFTSMLHGAAVFSKIDL 270
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
+AY VPI + A++ + T + FGL A Q F L + V L V
Sbjct: 271 KKAYHQVPISPSDVPKTAITTPFGLFEFTKMTFGLRNAAQTFQRLIDEVLQGLEY----V 326
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPH 671
VY+DD ++ ++ +I + L +N K+ L LQFLG + D H
Sbjct: 327 FVYIDDIIVASKTTEEHQIHLRTVFQRLQEHHLQINASKTELGREE-LQFLGHLVDKH 383
>gi|19919894|ref|NP_612577.1| Enzymatic polyprotein [Contains: Aspartic protease; Endonuclease;
Reverse transcriptase] [Carnation etched ring virus]
gi|130593|sp|P05400.1|POL_CERV RecName: Full=Enzymatic polyprotein; Includes: RecName:
Full=Aspartic protease; Includes: RecName:
Full=Endonuclease; Includes: RecName: Full=Reverse
transcriptase
gi|58863|emb|CAA28360.1| unnamed protein product [Carnation etched ring virus]
gi|225356|prf||1301227E ORF 5
Length = 659
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 154/390 (39%), Gaps = 40/390 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+LE V+K ST +S FLV + G R V+N K +N+ +L N
Sbjct: 248 IKELLELKVIKPSKST--HMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAHNLPNK 305
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ + ++ S D + V + Q A + +PFGL AP F
Sbjct: 306 DELLTLVRGKKIYSSFDCKSGLWQVLLDKESQLLTAFTCPQGHYQWNVVPFGLKQAPSIF 365
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAV-SILGSLGWIVNLQKSS 654
+ S VY+DD L+ + R L + LG I++ +K+
Sbjct: 366 PK--TYANSHSNQYSKYCCVYVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQ 423
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTL--LASKTWNLDSARSLLGYLSF 712
L + FLG+ D + +IL + + + + LG L++
Sbjct: 424 LFKEKI-NFLGLEID---------QGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTY 473
Query: 713 ASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQH 772
AS IP +L S R Q+ L + + K++ L + P P +
Sbjct: 474 ASDYIP--KLASIRKPLQSKLKEDSTWTWNDTDSQYMAKIKKNLKSFPKLYHPEPND-KL 530
Query: 773 FISTDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
I TDAS+ WG + + + SG + ++N+H N+KE+ AV + +
Sbjct: 531 VIETDASEEFWGGILKAIHNSHEYICRYASGSFKAAERNYHSNEKELLAVIRVIKKFSIY 590
Query: 824 LQSSVVMVQSDNQTVVSYL-------RRQG 846
L S ++++DN+ ++ R+QG
Sbjct: 591 LTPSRFLIRTDNKNFTHFVNINLKGDRKQG 620
>gi|317146488|ref|XP_003189815.1| hypothetical protein AOR_1_1900144 [Aspergillus oryzae RIB40]
Length = 1605
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 73/170 (42%), Gaps = 9/170 (5%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP PL +L V + ++ +MLE G ++ S G + + V K +G R +
Sbjct: 592 PPYGPLYNLSQHELQV---LREYLDKMLERGWIRHSTSAAG--APVLFVRKPDGSLRLCV 646
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN ++ L + L Y +DL AY + I+ + A
Sbjct: 647 DYRGLNAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAFRTRY 706
Query: 577 DVLAMTCLPFGLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
T +PFGL AP F A ++ + +L VVYLDD L+ +Q
Sbjct: 707 GHFEYTMMPFGLCNAPATFQAYINEAMKGILDD---YCVVYLDDILIYSQ 753
>gi|432962520|ref|XP_004086710.1| PREDICTED: zinc finger protein 407-like [Oryzias latipes]
Length = 1971
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 77/169 (45%), Gaps = 4/169 (2%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L +L+ P AM +IQE L + ++ S G + F V K + RP ++ +
Sbjct: 1567 LPKGRLFNLSGPEKVAMESYIQEALSSEHIRPSSSPVGAV--FFFVEKKDKTLRPCIDYR 1624
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN K+SL + +++ +DL AY+ V +K ++ A +
Sbjct: 1625 ELNLITVKDKYSLPLISSVFVSIKEARIFSKLDLRNAYYLVRVKEGNEWKTAFNTPLGHF 1684
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPR 628
+PFGL AP F L N V +R V VYLDD L+ +++P
Sbjct: 1685 EYLVMPFGLTNAPAVFQRLVNDVLRDFLNRF--VFVYLDDILIYSKNPE 1731
>gi|384495032|gb|EIE85523.1| hypothetical protein RO3G_10233 [Rhizopus delemar RA 99-880]
Length = 1068
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/219 (22%), Positives = 93/219 (42%), Gaps = 27/219 (12%)
Query: 428 RFVDAWIRLGAPAPLVRIVSGYAIPFSAKPPL---------VPLCSLQHLATPVSS---- 474
+F I +G P LV ++ Y FS L +PL S ATP+ S
Sbjct: 800 KFDAENITMGVPDELVEVIERYKNCFSEVSGLGRVKNYVMDIPLVSG---ATPIRSKPFR 856
Query: 475 -------AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSP 527
A+ +++E+L+ ++K S + S F +PK +G R V++ + LN+ +
Sbjct: 857 MTWQEEEALDAYLEELLDLDIIKP--SNGLWTSPCFFIPKKDGTLRLVIDYRRLNKMIKQ 914
Query: 528 KKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFG 587
+ L + + + ++D + Y +P+ H + LPFG
Sbjct: 915 DAYPLPHIDELLDAVGGATVFSTLDCTSGYHQLPLNPEHAERTGFVTKKGTFSFNVLPFG 974
Query: 588 LATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ T + + N V S + G ++++LDD L+ +++
Sbjct: 975 ITTGCSQYQRMMNSVLS--KYVGDFILIFLDDILVYSKN 1011
>gi|270017029|gb|EFA13475.1| hypothetical protein TcasGA2_TC012972 [Tribolium castaneum]
Length = 1293
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 38/426 (8%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L
Sbjct: 638 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLPRVDEHL 695
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+ + ++DL+ YF +P+ T A +PFGLA AP F
Sbjct: 696 DKLKGAKFFTTLDLASGYFQIPMATESIPKTAFVTPDGHCEFVRMPFGLANAPAVFQRAM 755
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V L+ + Y+DD L+ ++D + +L G + L K +
Sbjct: 756 NKVLGPLQFQT--AFCYIDDLLIPSKDFETGLNNLQTVFQLLRQFGLTLKLSKYCFFGSQ 813
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIP 718
+ ++LG + + K +T K ++ R LG F +V
Sbjct: 814 I-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKYVKD 864
Query: 719 MGRLHSRRIQRQASLLRLGAPH-LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
+ + SLL+ G+ L + L+ L + P+ + I+ + + + TD
Sbjct: 865 YATIAN----SLTSLLKNGSAFVLEEVQERAFQTLKDILTSRPVLA-IYDAEAETELHTD 919
Query: 778 ASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
AS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 920 ASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYLLGL 979
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S D+L
Sbjct: 980 QFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHVDAL 1031
Query: 888 SRSKSL 893
SR+ L
Sbjct: 1032 SRNTVL 1037
>gi|15029035|emb|CAC41321.2| unnamed protein product [Rice tungro bacilliform virus]
Length = 1674
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 168/393 (42%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++++ T + F+V + R V N K LN + F++ +
Sbjct: 1195 QIKELLDNKLIRKASPTCRHRTAAFVVRNHSEEVAQKPRIVYNYKRLNDNMVTDPFNIPH 1254
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + LQ+ DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1255 KISMINLLQRARIFSKFDLKAGFHHMKLKEDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1314
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + KL + + +G +++ +KS
Sbjct: 1315 FQRFMQESFGDLKF----ALLYIDDILIASSNEQEHIKHLKLFFTRVKEVGCVLSKKKSK 1370
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + ++ SK L ++ LG L+
Sbjct: 1371 MFLKEV-EYLGVE---------IKEGKISLQPHIVEKIKKFDKSKLSTLKGLQAYLGLLN 1420
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PLS P
Sbjct: 1421 YARGYIKDLSKLVGPLYKKTG---KSGQRTFNKEDWNIIFKIEREVDKIKPLSRPEESDY 1477
Query: 770 VQHFISTDASDLGWGSQV----------DSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG+ + D+ ++G S E++ W E+ A+++A
Sbjct: 1478 I--IIETDASEEGWGAVLICKPDKYASKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1535
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1536 LN-KFQIYLDRDFTIRTDCEAIVKGIKTEDYKK 1567
>gi|328875770|gb|EGG24134.1| hypothetical protein DFA_06276 [Dictyostelium fasciculatum]
Length = 1320
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 87/205 (42%), Gaps = 5/205 (2%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
HL++ ++ M +++ L +G + S + + S + V K +G R ++ + LN+
Sbjct: 383 NHLSSEENNVMFTTVEKGLASGRIA--PSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQT 440
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F L ++ + K IDL + + IK H A S T +P
Sbjct: 441 VADRFPLPRIDQLIEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIP 500
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP AF N A+ V++Y+DD L+ +++ K + L S
Sbjct: 501 FGLRNAPSAFVRAIN--AAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLRSNK 558
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDP 670
N KSS V +FLG + P
Sbjct: 559 LFANKAKSSFLVKEV-EFLGHLITP 582
>gi|156335399|ref|XP_001619572.1| hypothetical protein NEMVEDRAFT_v1g224056 [Nematostella vectensis]
gi|156203052|gb|EDO27472.1| predicted protein [Nematostella vectensis]
Length = 213
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 541 FLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP---FGLATAPQAFAS 597
L++GD + IDL+ A + T + AL LA++ P FGLA+AP+ F
Sbjct: 10 LLRRGDLLNKIDLNNACLTISNFETEPKVPALQVEKPTLAVSSPPPPLFGLASAPRVFTK 69
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V + LR RG+R+++YLDD L I+ +LA+ KS L P
Sbjct: 70 ILKPVVAHLRKRGIRLIIYLDDIL-------IMSASKELAL-------------KSILCP 109
Query: 658 APVLQFLGIMWD 669
L+FLG + +
Sbjct: 110 TRELKFLGKVGN 121
>gi|21070179|gb|AAM34208.1|AF503912_1 polyprotein [Danio rerio]
Length = 2237
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 87/189 (46%), Gaps = 6/189 (3%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLF-LVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
I+E+ VL+R + +G+ + + ++ K G R V +L+ +N+ + N + I
Sbjct: 1069 IKELEAAEVLRR--TVSGWNTPILPVLKKTTGKYRMVHDLRLINEKVLTATLPTPNPYTI 1126
Query: 539 PSFLQ-KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-A 596
S L K + IDL+ A+F +P+ Q A SY G LP G +P F
Sbjct: 1127 MSKLTPKHSHFTCIDLANAFFCMPLAEQCQGIFAFSYQGAQYTYNRLPQGFILSPGLFNQ 1186
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
+L + S G V+ Y+DD LL + + +++L + G V+ +K +S
Sbjct: 1187 ALRELLDSCTLHEGTIVIQYVDDLLLAAHSNEVCLQDTRKVLTLLSTAGLKVSKEKIQIS 1246
Query: 657 PAPVLQFLG 665
A V FLG
Sbjct: 1247 RATV-HFLG 1254
>gi|77548510|gb|ABA91307.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
Japonica Group]
Length = 1284
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 85/188 (45%), Gaps = 19/188 (10%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G ++ S G +
Sbjct: 533 IIDLIPGTA-PISKRPYRMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AP 581
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ--KGDYMIS-IDLSQAYF 558
+ V K +G R ++ + LN+ K+ L RI KG M S IDL Y
Sbjct: 582 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLP---RIDDLFDQLKGAKMFSKIDLRSGYH 638
Query: 559 HVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLD 618
+ I+T ALS + T + FGL AP F +L N V + VVV++D
Sbjct: 639 QLKIRTEDIPKTALSTRYGLYEFTVMSFGLTNAPAYFMNLMNKV--FMDYLDKFVVVFID 696
Query: 619 DFLLVNQD 626
D L+ ++D
Sbjct: 697 DILIYSKD 704
>gi|440800943|gb|ELR21969.1| hypothetical protein ACA1_325410 [Acanthamoeba castellanii str.
Neff]
Length = 305
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 9/103 (8%)
Query: 787 VDSSFLS-GLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQ 845
+DSS S G W Q HINK E+ AVHQAL LP L ++++ +N T V Y+
Sbjct: 86 LDSSTTSMGWWKHCSQ--HINKLELKAVHQALKALLPCLWGKLILLHCNNVTAVVYI--- 140
Query: 846 GGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
K L + KIF L + I +LA +PG N+ AD LS
Sbjct: 141 ---KHLVMNCMTHKIFDLCEHHNIQLLAIHLPGVENNRADHLS 180
>gi|308471499|ref|XP_003097980.1| hypothetical protein CRE_10442 [Caenorhabditis remanei]
gi|308269548|gb|EFP13501.1| hypothetical protein CRE_10442 [Caenorhabditis remanei]
Length = 876
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 110/244 (45%), Gaps = 27/244 (11%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLIN 534
+S I+ + +TGV+ +D + + + + V K NG R + GLN + L
Sbjct: 39 VSTEIERLNQTGVISPVDHSE-WAAPVVAVKKKNGSIRLCADFSTGLNDAIESNNHPLPT 97
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I + L G++ IDL++A+ V + Q+ L ++ + + LPFG+ +AP
Sbjct: 98 SDDIFAKLNGGNFFTQIDLAEAHLQVEMDPDSQKLLVINTHLGLFTYNRLPFGVKSAPGI 157
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLV-----NQDPRILEIQGKLAVSILGSLGWIVN 649
F + + + + L V YLDD ++ + R+ ++ G++ G+ +
Sbjct: 158 FQQIMDTMLNGLEG----VSTYLDDIIICGSTIEEHNERVFKVFGRIQ-----EYGFRIK 208
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPE-DKQLTLGNILRTLLASKTWNLDSARSLLG 708
++K S + +FLG + + R P+ +K L + N+ + N+ +S LG
Sbjct: 209 MEKCSFLMEEI-KFLGFIINKQGRR---PDPEKVLHIKNM------PEPTNVSQVKSFLG 258
Query: 709 YLSF 712
+ F
Sbjct: 259 LIQF 262
>gi|189236290|ref|XP_001815266.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1446
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 96/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 638 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 692
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 693 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 752
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 753 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 810
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 811 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 861
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + S SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 862 VKDYATIAS----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 916
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 917 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 976
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 977 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1028
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 1029 DALSRN 1034
>gi|211925527|dbj|BAG81987.1| gag-pol polyprotein [Clonorchis sinensis]
Length = 489
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 10/192 (5%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRI 538
+ MLE G+++ S++ + S L +VPK + G RP + + LN P ++ + +
Sbjct: 122 FEHMLELGIIR--TSSSHWSSPLHMVPKKSKGDWRPCGDYRSLNNATIPDRYPIPHIHDF 179
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
S L + +DL +AY+H+P+ A++ + T +PFGL A Q F
Sbjct: 180 ASTLCHTNIFSKLDLVRAYYHIPVAPDDIPKTAITTPFGLFEFTRMPFGLRNAAQTFQRF 239
Query: 599 SNWVASLLRSRGM-RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+ V RG+ V YLDD L+ + P + L + +N+ K L
Sbjct: 240 MDEVL-----RGLPFVYAYLDDVLIASTSPTEHAAHLRAVFERLSTYSIRLNIDK-CLFG 293
Query: 658 APVLQFLGIMWD 669
L FLG D
Sbjct: 294 VTSLDFLGHHID 305
>gi|402534616|gb|AFQ62092.1| P194 [Rice tungro bacilliform virus]
Length = 1674
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 78/393 (19%), Positives = 168/393 (42%), Gaps = 42/393 (10%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT----RPVLNLKGLNQFLSPKKFSLIN 534
I+E+L+ ++++ T + F+V + R V N K LN + F++ +
Sbjct: 1195 QIKELLDNKLIRKASPTCRHRTAAFVVRNHSEEVAQKPRIVYNYKRLNDNMVTDPFNIPH 1254
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
+ + LQ+ DL + H+ +K + + + + + PFG+A AP A
Sbjct: 1255 KISMINLLQRARIFSKFDLKAGFHHMKLKEDFKDWTTFTCSEGLYTWNVCPFGIANAPCA 1314
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS 654
F L+ ++Y+DD L+ + + + KL + + +G +++ +KS
Sbjct: 1315 FQRFMQESFGDLKF----ALLYIDDILIASSNEQEHIKHLKLFFTRVKEVGCVLSKKKSK 1370
Query: 655 LSPAPVLQFLGIMWDPHLDRMWLPEDK---QLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ V ++LG+ + E K Q + ++ SK L ++ LG L+
Sbjct: 1371 MFLKEV-EYLGVE---------IKEGKISLQPHIVEKIKKFDKSKLSTLKGLQAYLGLLN 1420
Query: 712 FA-SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQ 769
+A ++ + +L ++ + G + ++ K+E ++ + PLS P
Sbjct: 1421 YARGYIKDLSKLVGPLYKKTG---KSGQRTFNKEDWNIIFKIEREVDKIKPLSRPEESDY 1477
Query: 770 VQHFISTDASDLGWGSQV----------DSSFLSGLWS---REQQNWHINKKEMFAVHQA 816
+ I TDAS+ GWG+ + D+ ++G S E++ W E+ A+++A
Sbjct: 1478 I--IIETDASEEGWGAVLICKPDKYASKDTEKIAGYASGNFGEKKTWTSLDYEIEAINEA 1535
Query: 817 LSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK 849
L+ + +++D + +V ++ + K
Sbjct: 1536 LN-KFQIYLDRDFTIRTDCEAIVKGIKTEDYKK 1567
>gi|270005481|gb|EFA01929.1| hypothetical protein TcasGA2_TC007543 [Tribolium castaneum]
Length = 8815
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 5067 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 5121
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 5122 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 5181
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 5182 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 5239
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 5240 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 5290
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + S SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 5291 VKDYATIAS----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 5345
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 5346 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 5405
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 5406 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 5457
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 5458 DALSRN 5463
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 6492 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 6546
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 6547 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 6606
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 6607 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 6664
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 6665 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 6715
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + S SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 6716 VKDYATIAS----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 6770
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 6771 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 6830
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 6831 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 6882
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 6883 DALSRN 6888
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 607 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQYRMCVDYRQLNSKTIKDRFPLP---RVD 661
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 662 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 721
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 722 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 779
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 780 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 830
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 831 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 885
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 886 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 945
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 946 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 997
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 998 DALSRN 1003
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 3569 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 3623
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 3624 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 3683
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 3684 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLQTVFQLLRQFGLTLKLSKCCFF 3741
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 3742 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 3792
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 3793 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDTEAETEL 3847
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 3848 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 3907
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 3908 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 3959
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 3960 DALSRN 3965
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 93/423 (21%), Positives = 172/423 (40%), Gaps = 50/423 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN ++ + F +P
Sbjct: 7990 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSK------TIKDRFPLP 8041
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
F ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 8042 RFF------TTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQRAM 8095
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V LR + Y+DD L+ ++D + +L G + L K +
Sbjct: 8096 NKVLGPLRFQT--AYCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFFGSQ 8153
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASFVIP 718
+ ++LG + + K +T K ++ R LG F +V
Sbjct: 8154 I-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKYVKD 8204
Query: 719 MGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
+ + SLL+ G+ + L+ L + P+ + I+ + + + TD
Sbjct: 8205 YATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETELHTD 8259
Query: 778 ASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
AS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 8260 ASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYLLGL 8319
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S D+L
Sbjct: 8320 QFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHVDAL 8371
Query: 888 SRS 890
SR+
Sbjct: 8372 SRN 8374
Score = 47.0 bits (110), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 10/150 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 2105 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQYRMCVDYRQLNSKTIKDRFPLP---RVD 2159
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 2160 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 2219
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
N + LR + Y+ D L+ ++D
Sbjct: 2220 RAMNKMLDPLRFQT--AFCYIADLLIPSKD 2247
>gi|270014457|gb|EFA10905.1| hypothetical protein TcasGA2_TC001731 [Tribolium castaneum]
Length = 1398
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 151/398 (37%), Gaps = 37/398 (9%)
Query: 505 VPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKT 564
V K NG R ++ + LN L LQ Y S+DL Y+ +PI
Sbjct: 583 VEKKNGEKRLCIDYRKLNSMTVKDSHPLPRISEQIDRLQGAKYFSSLDLKSGYYQIPISE 642
Query: 565 THQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN 624
+ + + +PFGL AP+ F N +LLR VYLDD LL +
Sbjct: 643 NSRHYTSFVTPSGQYEYLRMPFGLTNAPRVFQRFMN---NLLRPVSKIAAVYLDDVLLHS 699
Query: 625 QDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLT 684
+ + +L + G +N QK + V FLG + R L DK
Sbjct: 700 NTEGQALCDLREVLDVLRAEGLTLNFQKCAFLKETV-HFLGFEVSDGIIRPGL--DKIQA 756
Query: 685 LGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTP 743
+ N S N+ R +G + + L +R + +L R G
Sbjct: 757 VKNF------SPPKNVKQIRQFIGLTGYFRHFVKNYALIARPL---TNLTRKGVNWKWDT 807
Query: 744 INPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS-----QVDS-----SFLS 793
+L+ L + P+ S P V + TDAS LG Q D ++ S
Sbjct: 808 EEELAFERLKEILTSRPVLSIYDPTAVTE-LHTDASSLGVAGILLQYQTDGRLHPIAYYS 866
Query: 794 GLWSREQQNWHINKKEMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLS 852
+ ++++H + E AV +++ + LL +V N+ + + Q
Sbjct: 867 RQTNEHERHYHSFELETLAVVESVKKFRIYLLDLEFTIVTDCNELKATSNKSQ------- 919
Query: 853 LLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
L+ + + +L ++R + ++ G S D+LSR+
Sbjct: 920 LIPRIARWWLQLLEFRFKV--KYRAGTQMSHVDALSRN 955
>gi|391331472|ref|XP_003740170.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 756
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 91/201 (45%), Gaps = 7/201 (3%)
Query: 468 LATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLS 526
+A + + + +++ G L ++D + + + + +V K NG R + GLN+ L
Sbjct: 474 VAIALQEQIDKELDRLIQNGTLTKVDFSE-WATPIVVVKKANGSIRVCADYSTGLNEALV 532
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ L N I + +DL+ AY +P+ QR ++ + + T L F
Sbjct: 533 DIEHPLPNMEEIMTKFSGNRVFSQLDLADAYLQLPLDENSQRVTTITTHRGLFQYTRLVF 592
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGW 646
GL TAP F V L+ G V+VYLDD L++ D + + + L G+
Sbjct: 593 GLKTAPSIFQKTIEQV--LMGMEG--VLVYLDDILVMAPDTERHDQRLNRVLQRLQDSGF 648
Query: 647 IVNLQKSSLSPAPVLQFLGIM 667
+ L+K P +++LG++
Sbjct: 649 HLKLEKCYFH-VPKVKYLGMV 668
>gi|77551190|gb|ABA93987.1| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 1485
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 129/313 (41%), Gaps = 22/313 (7%)
Query: 476 MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
+ + +ML+ G++K S S + LV K +G R ++ + LN K+ L
Sbjct: 677 IECQVADMLDRGIIKPSSSPF--SSPVLLVKKKDGSWRFCVDYRHLNAITVKNKYPLPII 734
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ L + +DL Y + + + A + LPFGL +AP F
Sbjct: 735 DELLDELAGACWFSKLDLRSGYHQIRMHPDDEHKTAFKTHHGDFEFRVLPFGLTSAPATF 794
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
S+ N V + R V+V+++D L+ + E+ + + IL V K S
Sbjct: 795 QSIMNSVLAPYLRRS--VLVFVNDILVYSHSLAEHEVHLRQVLQILSDNHLKVKQSKCSF 852
Query: 656 SPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFAS 714
P P L +LG + + + EDK ++A + W S + L +L +
Sbjct: 853 -PQPQLAYLGHVISA--NGVATDEDK----------IMAVRNWITPTSVKELRSFLRLSG 899
Query: 715 FVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQH 772
+ R + + +LLR G + T ++ L+ L + P L+ P F Q Q
Sbjct: 900 YYRKFVRNYGIICKPLTNLLRKGQLFVWTSVHEEAFVTLKSALISAPVLAMPDF--QKQF 957
Query: 773 FISTDASDLGWGS 785
+ TDASD G G+
Sbjct: 958 VVETDASDKGIGA 970
>gi|328871951|gb|EGG20321.1| hypothetical protein DFA_07444 [Dictyostelium fasciculatum]
Length = 1441
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 87/205 (42%), Gaps = 5/205 (2%)
Query: 466 QHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
HL++ ++ M +++ L +G + S + + S + V K +G R ++ + LN+
Sbjct: 515 NHLSSEENNVMFTTVEKGLASGRIA--PSKSPYNSAVLFVRKKDGTLRMCVDFRALNKQT 572
Query: 526 SPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLP 585
+F L ++ + K IDL + + IK H A S T +P
Sbjct: 573 VADRFPLPRIDQLIEKIAKAKIFSKIDLKDGFNQIRIKDEHTHKTAFSTPSGHYEYTVIP 632
Query: 586 FGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLG 645
FGL AP AF N A+ V++Y+DD L+ +++ K + L S
Sbjct: 633 FGLRNAPSAFVRAIN--AAFADILDTFVIIYIDDILIFSENENDHYEHIKQVLDRLRSNK 690
Query: 646 WIVNLQKSSLSPAPVLQFLGIMWDP 670
N KSS V +FLG + P
Sbjct: 691 LFANKAKSSFLVKEV-EFLGHLITP 714
>gi|189242078|ref|XP_001808624.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1242
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 434 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 488
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 489 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 548
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 549 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 606
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 607 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 657
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 658 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 712
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 713 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 772
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 773 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 824
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 825 DALSRN 830
>gi|169639661|gb|ACA60916.1| pol protein [Thalassiosira pseudonana]
Length = 1239
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 109/473 (23%), Positives = 190/473 (40%), Gaps = 69/473 (14%)
Query: 450 AIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLK-RLDSTTGFLSRLFLVPKG 508
AIP+ K VP + L V Q +++ GVL + DS G + F++PK
Sbjct: 334 AIPYHGKAFPVPFIHKETLMKEV--------QRLVDLGVLIPQNDSEWG--APTFIIPKK 383
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
NG R + + + LN+ + K F + + LQ Y ++DL+ Y+ + + +
Sbjct: 384 NGTVRIISDFRELNKRIKRKPFPIPKISTVLQELQGFTYATALDLNMGYYTIRLDPDASK 443
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFAS-LSNWVASLLRSRGMRVVVYLDDFLLVNQDP 627
+ + LP G+A +P F S +S +A+L R YLDD L++++
Sbjct: 444 LCTIILPWGKYSYARLPMGVAGSPDLFQSKMSALMANLEYVR-----TYLDDLLILSKGT 498
Query: 628 RILEIQGKLAV-SILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWL---PEDKQL 683
++ + V L G VN KS+ + + ++LG + L R + PE Q
Sbjct: 499 FDDHLEKMVEVFERLREAGLRVNAAKSTFATDEI-EYLGYI----LSRAGIKPQPEKVQA 553
Query: 684 TLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL---GAPH 740
L + N+ R LG + + L +R + A L L G
Sbjct: 554 ILA-------INPPKNVKELRKFLGIVQYYR------DLWEKRSEMLAPLTDLVGEGGVT 600
Query: 741 LTPINPAVLPKLEWW----LNALPLSSPIFPRQV---------QHFISTDASDLGWGSQV 787
T + +W A + R V + I TDAS G+ +
Sbjct: 601 KTKKQKGTVKAPWYWDKKHQQAFENVKAMIARDVVLAYPNFKEEFVIYTDASKRQLGAVI 660
Query: 788 DS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYL 842
+F S S Q + + + E+ A+ + L +L + + +D+ V+ +
Sbjct: 661 TQNNRPIAFFSRKLSEAQSKYSVTELELLAMVECLKEFKGMLWGQKITIHTDH---VNLM 717
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPD 895
R G S V + LL +++ I+ +I G N+VAD++SR + P+
Sbjct: 718 RDALGLSS----DRVYRWRLLLEEYAPKIV--YIKGEVNTVADAISRLEYNPE 764
>gi|116196940|ref|XP_001224282.1| hypothetical protein CHGG_05068 [Chaetomium globosum CBS 148.51]
gi|88180981|gb|EAQ88449.1| hypothetical protein CHGG_05068 [Chaetomium globosum CBS 148.51]
Length = 1784
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 104/255 (40%), Gaps = 26/255 (10%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H + + ++++ LE G ++ S G+ + VPK +G R ++ + LN
Sbjct: 857 HTNEKQDAELRSYLEKNLEIGHIRPSTSPAGYP--VLFVPKKDGKLRMCVDYRQLNNERV 914
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
++ L R+ L Y +DL AY H+ IK + A +PF
Sbjct: 915 KNRYPLPLIARLRDQLSGAQYYTRLDLPTAYAHIRIKEGDEWKTAFRTPYGHYEYLVMPF 974
Query: 587 GLATAPQAF-ASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL- 644
GL AP F A++ + + L V YLDD L+ + + LE + +L +L
Sbjct: 975 GLTNAPATFQAAIDHAIRHCL---DKFAVCYLDDILIYS---KTLEEHKEHVRQVLDALH 1028
Query: 645 --GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW-NLD 701
VN KS + FLG P ++ PE L A +TW
Sbjct: 1029 EHKLSVNKDKSEFHVKKTV-FLGYEISPGWVKI-EPE-----------KLEAVRTWPTPT 1075
Query: 702 SARSLLGYLSFASFV 716
+A + G++ FA+FV
Sbjct: 1076 NATEVRGFIGFANFV 1090
>gi|38605839|emb|CAE02919.3| OSJNBb0108J11.11 [Oryza sativa Japonica Group]
Length = 815
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 115/473 (24%), Positives = 192/473 (40%), Gaps = 72/473 (15%)
Query: 457 PPLVPLCSLQHLATPV-SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPV 515
P VP+ + TP + ++EML+ G+++ S S + LV K +G R
Sbjct: 152 PGAVPVNVRPYRYTPAQKDEIEQQVREMLDKGIIQPSSSPF--SSPVLLVKKKDGTWRFC 209
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
++ + LN K+ L + L + + +DL Y + +K + + A +
Sbjct: 210 VDYRHLNAITVKNKYPLPIIDELLDELSRAQWFTKLDLRAGYHQIRMKMSDEHKTAFKTH 269
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL--------VNQDP 627
+PFGL +AP F N + S L R V+V++DD L+ VN
Sbjct: 270 SGHYEFRVIPFGLTSAPATFQGGMNSILSPLLRRC--VLVFVDDILIYSATLEDHVNHLR 327
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
++ +I K + + S K S + L +LG + P + + +DK + N
Sbjct: 328 QLFQILVKHQLKVKQS--------KCSFAQQR-LSYLGHIITP--NGVSTDDDKIRVVQN 376
Query: 688 ILRTLLASKTW----NLDSARSLLGYLSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHL- 741
W ++ RS LG + FV G L S+ + +LLR G ++
Sbjct: 377 ----------WPVPGSVKELRSFLGLTGYYRKFVCHYGIL-SKPL---TNLLRKGQLYIW 422
Query: 742 TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGS-----QVDSSFLSGL 795
T A L+ L P L+ P F + TDASD G G+ Q +FLS
Sbjct: 423 TSETEAAFQALKQALITAPVLAMPNFSEPF--IVETDASDKGIGAVLMQHQHPIAFLSKA 480
Query: 796 WSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLS 855
Q +K+ A+ A+ P LQ + +++D+++ +S+L Q T +
Sbjct: 481 LGPRHQGLSTYEKKSLAIMLAVEHWRPYLQHAEFFIRTDHRS-LSFLDDQRLTTPWQHKA 539
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR------------SKSLPDW 896
+ + L R I+ + G N AD+LSR S ++PDW
Sbjct: 540 LTKLLGL-----RYKII--YKKGTDNGAADALSRYPSSATLELSALSVAVPDW 585
>gi|307202550|gb|EFN81895.1| Uncharacterized protein F44E2.2 [Harpegnathos saltator]
Length = 1389
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/217 (22%), Positives = 92/217 (42%), Gaps = 17/217 (7%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
PF K +P L+ + ++EM GV+ R + T ++S L V K NG
Sbjct: 857 PFKGKSYPIPQKHLEEV--------RRQLREMENLGVVSR--AATQYISPLVAVTKPNGK 906
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLA 571
R L+ + +N + + + + +D++QAY+ +P+ +++
Sbjct: 907 IRICLDARNINDRMENDHAQPPTIEEVLANIGHKSIFSKLDITQAYWQIPLTANSRQYTG 966
Query: 572 LSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR---GMRVVVYLDDFLLVNQDPR 628
S++ LPFG+ TA AS + + + L+ + V+VYLDD L+ +++
Sbjct: 967 FSFDHQTYIFERLPFGIKTAG---ASFTRAIEAALKGKPELRKHVIVYLDDVLIASENET 1023
Query: 629 ILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG 665
L G+ +N K + ++ FLG
Sbjct: 1024 DHLSHLASVFEALQEAGFRLNRDKCEFARDRIV-FLG 1059
>gi|189236288|ref|XP_001815250.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1523
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 686 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 740
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 741 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 800
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 801 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 858
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 859 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 909
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + S SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 910 VKDYATIAS----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 964
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 965 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 1024
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 1025 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1076
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 1077 DALSRN 1082
>gi|7493955|pir||T18350 probable pol polyprotein - rice blast fungus gypsy retroelement
(fragment)
gi|538067|gb|AAA21442.1| putative pol polyprotein, partial [Magnaporthe grisea]
Length = 1398
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 102/468 (21%), Positives = 173/468 (36%), Gaps = 70/468 (14%)
Query: 454 SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
S K P +P L H+ + I +M++ G ++ S+ + + +V K +GG R
Sbjct: 354 SGKTPALPWGRLYHMPREQLLELRRQIVDMMDKGWIRASSSSA--AAPVLMVRKASGGWR 411
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
++ + LN ++ L L + +D+ A+ + I + A
Sbjct: 412 LCVDYRALNSITMQDRYPLPLIKETIRSLTGARWFTKVDVRAAFHKLRIAEGDEHLTAFR 471
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD------P 627
+ PFGLA AP F N V L + G YLDD L+ +
Sbjct: 472 TRFGLFEWLVCPFGLAGAPATFQRYVNGV--LGDTLGDYASAYLDDILIYSSGSKSDHWS 529
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
++ + KLA + G ++L KS+ + V ++LG + PE
Sbjct: 530 KVTRVLDKLAAA-----GLNLDLDKSAFAVKEV-KYLGFIVKAGEGVQADPE-------- 575
Query: 688 ILRTLLASKTWNLDSA-RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
+ A + W + R L G+L FA+F +S +L + G P
Sbjct: 576 ---KIKAIRDWEAPTRLRGLRGFLGFANFYRDFIDGYSTLTAPLLALTKKGTPF------ 626
Query: 747 AVLPKLEWWLNALP---LSSPIFPR---QVQHFISTDASDLGWGSQVDSSFLSGLW---- 796
+LE AL L +PI + + TD S G + GLW
Sbjct: 627 RWTEELEGAFEALKHAFLQAPILAQWDDAKDTRMETDCSGAALGGCLSQKGTDGLWRPVA 686
Query: 797 ------SREQQNWHINKKEMFAVHQALSLNLPLLQS--SVVMVQSDNQTVVSYLRRQGGT 848
+ Q+N+ I+ KE+ AV L L+S ++ +D+
Sbjct: 687 FHSAKLTDAQRNYTIHDKELLAVIACLKAWDAELRSVRRPFLILTDH------------- 733
Query: 849 KSLSLLSEVEKIFLLSQDW-----RIHILAQFIPGAYNSVADSLSRSK 891
K+L S+ ++ W + + +F PG V D+LSR +
Sbjct: 734 KALEYFSKPREVSERQMRWAETLSKFNYNLRFRPGRLAGVPDALSRRE 781
>gi|341885905|gb|EGT41840.1| hypothetical protein CAEBREN_11901 [Caenorhabditis brenneri]
Length = 2055
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 98/467 (20%), Positives = 192/467 (41%), Gaps = 61/467 (13%)
Query: 443 VRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRL 502
V I + +P +P VP+ + + I +L++G + +S T ++S +
Sbjct: 1024 VHIYTTTEVPVRGRPYRVPV--------KFQADLEKQINGLLKSGRI--TESNTPWISPI 1073
Query: 503 FLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMI---SIDLSQAYFH 559
+V K NG R L+ + LN+ P + L RI + +++ +M S+D++ Y
Sbjct: 1074 VIVKKKNGSLRVCLDFRKLNEVTIPDNYPLP---RIDAIVERVGHMKFFSSLDMANGYLQ 1130
Query: 560 VPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDD 619
+ + + + A T LPFGL +A F + ++L V++Y+DD
Sbjct: 1131 LRLDDESSYKCGFTTENRIYAYTHLPFGLKSAASYFQRA---LKTVLDGMDHEVMLYIDD 1187
Query: 620 FLLVNQ--DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWL 677
L+ ++ D + ++ L V+ +K L ++ FLG +
Sbjct: 1188 VLIFSKTYDEHLDTLERVL--QRFRQYNLKVSPKKCDLVRKSIV-FLG---------HQI 1235
Query: 678 PEDK-QLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRL 736
ED + N+ + N++ R +G F + + +S +Q L
Sbjct: 1236 NEDNYEPNKSNVSAIVNMPTPSNINELRRFIGMTGFFRRFV---KDYSEIVQPLNKLTHK 1292
Query: 737 GAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQV------D 788
P + T ++ + KL+ L + P L P + ++ + TDAS + G+ + D
Sbjct: 1293 NTPFVWTQVHQDAVQKLKTILTSKPVLCYPDYNKEFHCY--TDASGVAQGAVLMQTKPGD 1350
Query: 789 SS------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYL 842
S + S S+ + E+ A+ AL + S V++ +D++ ++ +
Sbjct: 1351 ESKMQAIAYASRTLSQPETRRAAIHNELGAIIFALRAFKVYIYGSKVVIHTDHRPLIFLM 1410
Query: 843 RRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+R L+ + + Q + I I+ I G N++AD LSR
Sbjct: 1411 KRHKVNDVLA------RWLVELQQYNIDIV--HIDGKRNTIADCLSR 1449
>gi|327280226|ref|XP_003224853.1| PREDICTED: retrotransposable element Tf2 155 kDa protein type
1-like [Anolis carolinensis]
Length = 827
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 182/466 (39%), Gaps = 70/466 (15%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK 519
+P L L P A+ ++ E L+ G ++ + T + +F V K G R V + +
Sbjct: 75 LPAGRLYALTVPERQALREYLDENLQKGFIRPSNLPTA--APVFFVAKKTGDLRLVCDYR 132
Query: 520 GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVL 579
LN++ ++ L + S +Q +DL AY V IK + A +
Sbjct: 133 VLNKYTVRDRYPLPLIPELLSRVQGASIFTKLDLRGAYNLVRIKEGDEWKTAFNTCFGAY 192
Query: 580 AMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVS 639
+PFGL AP F N V L + VV+YLDD L+ ++D R + +S
Sbjct: 193 ENLVIPFGLCNAPAVFQRFINDVFRDLLDKF--VVIYLDDILIFSKDAREHGEHVRQVLS 250
Query: 640 ILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWN 699
L + G K + V +FLG + +M + +++ N + L + K
Sbjct: 251 RLRANGLFAKASKCVFHVSEV-EFLGHVISGRELKM---DPRKVDTVNTWQELTSKK--- 303
Query: 700 LDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLT-PINPAVLPK--LEWWL 756
+ LG+ ++ IP HLT P+ + K EW +
Sbjct: 304 --DVQRFLGFANYYREFIPH------------------FAHLTVPLTQLLQKKRPFEWTM 343
Query: 757 NA------LPLSSP-----IFPRQVQHFI-STDASDLGWGSQVDSSFLSGLWSREQQNWH 804
A L S + P Q FI DAS+ G+ + Q++
Sbjct: 344 EAKKAFEQLKCSFQNGNILVHPNVNQPFIVEADASNYALGAVL-----------SQKDPS 392
Query: 805 INKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
+ +KE+ A+ A + L+ + ++V SD++ L + L+ +F
Sbjct: 393 VWEKELLAIKVAFEVWRHWLEGAKHQIVVLSDHKN----LEHLQTARKLNQRQIRWALFF 448
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWHLSRSATEQIFL 908
D+R+ QF+ G N AD+LSR P++ + EQ L
Sbjct: 449 SRFDFRV----QFVEGKQNLRADALSRK---PEFKTAEIPPEQTIL 487
>gi|291239145|ref|XP_002739485.1| PREDICTED: LReO_3-like [Saccoglossus kowalevskii]
Length = 1469
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 114/492 (23%), Positives = 192/492 (39%), Gaps = 73/492 (14%)
Query: 426 LRRFVDAWIRLGAPAPLVR--IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEM 483
L R+ D + LV +V+ P KP +P V S + + +M
Sbjct: 1018 LERYGDVFTDAPGLTGLVEHEVVTCEDFPLRQKPYRLP--------QAVQSTVREELDKM 1069
Query: 484 LETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQ 543
L++G+++ S + + S + LV K +G R ++ + LN S KF R+ ++
Sbjct: 1070 LKSGIIR--PSKSPWASPIVLVGKKDGTVRFCVDYRSLN---SVTKFDAYPMARVEDLIE 1124
Query: 544 K-GD--YMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSN 600
K GD Y+ + DLS+ Y+ +P+ + A + T +PFG+ P A+L
Sbjct: 1125 KFGDACYISTFDLSKGYWQIPLSKSSCEKSAFITQFGLFEFTVMPFGMKNGP---ATLQR 1181
Query: 601 WVASLLRSRGMRVVVYLDDFLL--VNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
+ +L+ Y+DD + V+ +L +Q L L + V K +
Sbjct: 1182 LINQILQGTNEYAGAYMDDIEVHDVSWKEHLLHLQAVL--ERLRNAKLTVKPSKCYIG-G 1238
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-----SFA 713
P + F+G M + R L DK + N R N+ + L+GY +F+
Sbjct: 1239 PQVSFVGYMAGSGVIRAML--DKVQAVNNFPR---PKTKQNVRAFLGLVGYYRKFIPNFS 1293
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQH 772
+ L ++ Q T KL+ L + P L +P + +
Sbjct: 1294 EIAFYLTELIKKKCSNQVI--------WTEDCETSFCKLKQALTSEPVLHNPDYTKPF-- 1343
Query: 773 FISTDASDLGWGSQVDSSFLSG------LWSRE----QQNWHINKKEMFAVHQALSLNLP 822
+ DA D G G + + G SR+ + N+ +KE A+ A+ P
Sbjct: 1344 VLQVDACDHGIGGVLSQNNDKGEEHPIVYISRKLLPREMNYATIEKECLAIVWAVEALYP 1403
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF----LLSQDWRIHILAQFIPG 878
L VQSD+ + K L+ L E + L Q ++I+ Q G
Sbjct: 1404 YLYGRAFTVQSDHHPL----------KWLNQLRERNRRLMRWSLTLQGYQINF--QHKKG 1451
Query: 879 AYNSVADSLSRS 890
N AD LSR+
Sbjct: 1452 VENGNADGLSRA 1463
>gi|157652694|gb|ABV59399.1| pol [Spider monkey foamy virus]
Length = 1148
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 119/257 (46%), Gaps = 25/257 (9%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLS 526
H+ ++ + I ++L+ GVL++ ST+ + ++ VPK +G R VL+ + +N+ +
Sbjct: 165 HINPKAKPSIQIVINDLLKQGVLRQ--STSPMNTPVYPVPKPDGKWRMVLDYRAVNKTIP 222
Query: 527 PKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+ I + L + Y +IDLS ++ PI Q A ++ G T LP
Sbjct: 223 LIAAQNQHSLGILTNLIRHKYKSTIDLSNGFWAHPITEDSQWITAFTWEGKQHVWTRLPQ 282
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL-- 644
G +P F + + V L G V VY+DD + + +E ++ SI L
Sbjct: 283 GFLNSPALFTA--DVVDILKEVPG--VSVYVDDIYISSP---TMEEHFQVLDSIFRKLLE 335
Query: 645 -GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT--WNLD 701
G+IV+L+KS+L+ V FLG ++ + L + R L T L
Sbjct: 336 TGYIVSLKKSALARYEV-NFLG----------FVISETGRGLTSEFRERLQEITPPTTLK 384
Query: 702 SARSLLGYLSFASFVIP 718
+S+LG+L+FA +P
Sbjct: 385 QLQSILGFLNFARNFVP 401
>gi|254587279|emb|CAX83696.1| Gap-Pol polyprotein [Schistosoma japonicum]
Length = 1365
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 104/222 (46%), Gaps = 18/222 (8%)
Query: 465 LQHLATPV--------SSAMSLHIQEM---LETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
L+H ATPV +A+ + QE+ ++GV++ ++ + + + + +V K NG R
Sbjct: 483 LKHGATPVFRPKRPVPYAALPIVEQELDRLQKSGVIEPVNFSE-WAAPIVIVKKPNGSIR 541
Query: 514 PVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL 572
+ GLN+ L ++ L + + L Y +DLS AY +P+ +++L +
Sbjct: 542 LCADYSTGLNEALEAHQYPLPLPEDLFAKLNGRRYFAKLDLSDAYLQIPVAEECKQYLTI 601
Query: 573 SYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEI 632
+ + + LPFG+ TAP F + + + + YLDD L++ D LE
Sbjct: 602 NTHKGLFRYNRLPFGVKTAPSIFQQIMDTMLQDIPG----TAAYLDDILIMGVDQADLEK 657
Query: 633 QGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDR 674
+ ++ +G G+ + +K V ++LG + D + R
Sbjct: 658 KLDSVLTRIGDFGFPLRAEKCDFHLQQV-RYLGFIVDKNGRR 698
>gi|189236296|ref|XP_001815322.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1271
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 434 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQYRMCVDYRQLNSKTIKDRFPLP---RVD 488
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 489 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 548
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 549 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 606
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 607 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 657
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 658 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 712
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 713 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 772
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 773 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 824
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 825 DALSRN 830
>gi|291232291|ref|XP_002736091.1| PREDICTED: RETRotransposon-like family member (retr-1)-like
[Saccoglossus kowalevskii]
Length = 1096
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 99/467 (21%), Positives = 183/467 (39%), Gaps = 68/467 (14%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +++ G+++ + T +++ + LV K NG R L+ + LN+F+ + I
Sbjct: 265 LDRLIQLGIIREVQEPTDWVNSIVLVTKPNGSLRICLDPRELNKFIKRPHYYAKTLDDIL 324
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L+ + ++DL Y+++P+ T Q S T LPFGL +A F
Sbjct: 325 PDLKNTQHFSTLDLRSGYWNIPLDTESQLLTTFSTIFGRYCFTRLPFGLISAQDVFQ--- 381
Query: 600 NWVASLLRSRGM--RVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
L R G V DD L+ + + + + +N +K
Sbjct: 382 ---VDLDRIFGQIDNVHCLKDDILIAAETSTQHDKAVQDVLKACRQYNVKLNDEKCQFHQ 438
Query: 658 APVLQF------LGIMWDPH----LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLL 707
V F G+ DP + RM P+DKQ +SLL
Sbjct: 439 DKVKLFGHILSKDGVSPDPSKVKAIQRMEAPKDKQ-------------------ELQSLL 479
Query: 708 GYLSFASFVIPMGRLHS--RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI 765
G +++ S + + L R++ ++ + A H L +++ + P+ +
Sbjct: 480 GLVNYLSKFVKLSPLTEPLRKLLQKGVVFEWSASH-----DKALDQIKQAITKAPVLT-Y 533
Query: 766 FPRQVQHFISTDASDLGWGS---QVDS--SFLSGLWSREQQNWHINKKEMFAVHQALSLN 820
F I +AS G G+ Q D F S + + N+ ++EM AV A
Sbjct: 534 FDASKDIVIQCNASSKGLGAVLLQDDKPIHFASKALTPAEANYSNIEREMLAVVWATRYF 593
Query: 821 LPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAY 880
+ + SD+Q + S ++Q + + ++++ L Q + +I +++PG
Sbjct: 594 KHYVFGRPFTIHSDHQPLESIAKKQIN----KMPARLQRLILQIQGYNYNI--KYVPGKD 647
Query: 881 NSVADSLSRSKSLPDWHLSRSATEQIFLKWGVPCIDLFASRVSAVVP 927
+AD LSR + ++ T QI P +D+ VS + P
Sbjct: 648 VLLADCLSRCIN------TKQTTFQI------PDVDVHIHEVSTMKP 682
>gi|189533024|ref|XP_001336517.2| PREDICTED: disks large-associated protein 2 [Danio rerio]
Length = 1434
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 10/204 (4%)
Query: 461 PLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKG 520
P L L+ P +AM ++ E L G+++ S G + F V K +G P ++ +
Sbjct: 531 PRGRLFSLSAPERAAMDKYLTESLAAGIIRHSSSPAG--AGFFFVKKKDGSLCPCIDYRD 588
Query: 521 LNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLA 580
LN ++ L LQ + +DL AY V ++ + A +
Sbjct: 589 LNDITIKNRYPLPLMSSAFDLLQGERFFTKLDLRNAYHLVCMREGEEWKTAFNTATGHFK 648
Query: 581 MTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI 640
LPFGL AP F +L+ ++ V + +LL NQ + + K V
Sbjct: 649 YLVLPFGLTNAPAVF-------QALVSDEHVQHVRRVLQWLLENQ-LYVKTEKCKFHVQS 700
Query: 641 LGSLGWIVNLQKSSLSPAPVLQFL 664
+ LG I++++ + PA + F+
Sbjct: 701 VSFLGHIISVEGLCMDPAKAVHFI 724
>gi|308462401|ref|XP_003093484.1| hypothetical protein CRE_26772 [Caenorhabditis remanei]
gi|308250141|gb|EFO94093.1| hypothetical protein CRE_26772 [Caenorhabditis remanei]
Length = 518
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 157/400 (39%), Gaps = 31/400 (7%)
Query: 547 YMISIDLSQAYFHVPIKTTHQRFLALSY----NGDVLAMTCLPFGLATAPQAFASLSNWV 602
+ + D Y HV I+ FLA S LPFGL+TAP F + +
Sbjct: 3 FAATFDFKSGYHHVKIEENSSEFLAFSLTDPPKAPFYKYRALPFGLSTAPWLFTKIFRPI 62
Query: 603 ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ 662
R G+++ +Y+DD L+V + L + S L LG + +K S P+
Sbjct: 63 VGKWRRDGIKIWLYIDDGLIVAETKEELIRAVSIVRSDLERLGVALADEKCSWEPSSEFT 122
Query: 663 FLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG-- 720
+LG + D + L E + + + L + S + LG LS S ++ G
Sbjct: 123 WLGFVGDLRRKTVTLSEKRYKAVLHRLEVIKGSLAPTVLDRERFLGSLS--SMLLVAGND 180
Query: 721 -RLHSRRIQRQ-ASLLRLGAPHLTPI--NPAVLPKLEWW-LNALPLSSPIFPRQVQ--HF 773
+ SR +Q AS R P I L ++ +W N LSS +
Sbjct: 181 AQARSRHMQMTVASARREQLPETRRIEKTKGELAEIRFWSENIRRLSSTKLEENFRPVWR 240
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQ-----ALSLNLPLLQSSV 828
TDAS G G+ + + + + K E A+ + L+ +
Sbjct: 241 AYTDASADGMGALLKNLEGEVVCRISEVGADTFKSESSAMRELKAMRMLARRIAGWIRGA 300
Query: 829 VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
++ D+Q V+ L++ G+ S L E+++ Q ++ +IP N AD S
Sbjct: 301 IVCYVDSQAAVAILKK--GSMSSELQEVAEQVWDAFQT-VGNVRFLWIPRELNKEADFAS 357
Query: 889 RSKSLPDWHLSRSATEQIFL----KWGVPCIDLFASRVSA 924
R DW + +++FL +WG D FA +A
Sbjct: 358 RDFDFDDWGVD----QKVFLWAQTRWGEFKCDWFADEANA 393
>gi|157366227|gb|ABV45229.1| unknown [Ascogregarina taiwanensis]
Length = 728
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 89/199 (44%), Gaps = 22/199 (11%)
Query: 449 YAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
Y+ F K P V + +++L + + ++ ML GV++ S G S K
Sbjct: 369 YSAGFCVKGPPVKV-KMRYLTPELKEELDRQLEAMLAAGVIQPSKSAWG--SVPVFTKKK 425
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQK------GDYMISIDLSQAYFHVPI 562
+GG R L+ + LN + ++ IP F Q+ + I +D++ ++++P+
Sbjct: 426 DGGWRLCLDYRRLNAQMESDRYP------IPLFWQQIQEAAHHRWYICLDINWGFWNLPL 479
Query: 563 KTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
K + F A + L FG+ +P F + + V S ++G V+ Y+DD ++
Sbjct: 480 KEDSREFTAFLTHRGAFEFRVLLFGVKNSPSEFQRMMDGVLSEFYNKG--VLCYIDDIII 537
Query: 623 VNQDP-----RILEIQGKL 636
DP R+ EI KL
Sbjct: 538 FANDPSQCLGRLEEILQKL 556
>gi|270017030|gb|EFA13476.1| hypothetical protein TcasGA2_TC012973 [Tribolium castaneum]
Length = 1233
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 607 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 661
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 662 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 721
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 722 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 779
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 780 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 830
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 831 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 885
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 886 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 945
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 946 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 997
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 998 DALSRN 1003
>gi|432863505|ref|XP_004070100.1| PREDICTED: uncharacterized protein LOC101166792 [Oryzias latipes]
Length = 1023
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 67/147 (45%), Gaps = 11/147 (7%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
+ ML G++K S + + + LVPK +G R ++ + LN S KF RI
Sbjct: 731 EVDHMLSLGIIK--PSKSEWCHPVVLVPKKDGTIRFCIDFRYLN---SVSKFDSYPTPRI 785
Query: 539 PSFLQ---KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ ++ K ++ +IDL + Y+ VP+ Q A + T LPFGL AP F
Sbjct: 786 DNLIECLGKAKFLTTIDLCKGYWQVPLTERSQELTAFRTPWGLFQFTVLPFGLHGAPATF 845
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLL 622
L + V L+ YLDD ++
Sbjct: 846 QRLMDQVLGGLKDCA---CAYLDDIVV 869
>gi|108864565|gb|ABA94541.2| retrotransposon protein, putative, unclassified [Oryza sativa
Japonica Group]
Length = 1347
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 154/368 (41%), Gaps = 29/368 (7%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML+ GV++ S + F S + LV K +G R ++ +GLN K+ + +
Sbjct: 605 VTQMLQHGVIQ--PSHSPFASPVLLVKKKDGTWRFCVDYRGLNDITVKNKYPMPVVDELL 662
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + +DL Y + + + A + +PFGL AP F L
Sbjct: 663 DELSGLKWFTKLDLRSGYHQIRLVEKDEFKTAFKTHQGHYKFRVMPFGLTNAPATFQGLM 722
Query: 600 NWV-ASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N + A +R V+V++DD L+ ++ K +++L + K S S
Sbjct: 723 NTIFAHAIRK---FVLVFVDDILIYSKTLAEHVCHLKSVLTVLQQHQLYIKRSKCSFS-Q 778
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVI 717
P L++LG + + + + D Q + A + W + + + L G+L A +
Sbjct: 779 PSLEYLGHI----IGAIGVATDPQ--------KVQAIRDWPVPKNLKQLRGFLGLAGYYR 826
Query: 718 PMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQHFIS 775
R + + LL+ G + + +++ L P L+ P F +Q I
Sbjct: 827 KFIRNYGVITKPLIELLKKGTFFFWSTLEHTAFQEVKQCLQQAPVLAMPDFNQQF--VIE 884
Query: 776 TDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVM 830
TDASD G G+ + +FLS + Q + +KE A+ A+ LQ +
Sbjct: 885 TDASDKGIGAVLMQAGHPIAFLSKALGPKAQGFSTYEKECLAILMAVDKWHQYLQYAEFA 944
Query: 831 VQSDNQTV 838
+ +D++++
Sbjct: 945 ILTDHRSL 952
>gi|308512333|ref|XP_003118349.1| hypothetical protein CRE_00852 [Caenorhabditis remanei]
gi|308238995|gb|EFO82947.1| hypothetical protein CRE_00852 [Caenorhabditis remanei]
Length = 1564
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 105/448 (23%), Positives = 177/448 (39%), Gaps = 84/448 (18%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+Q+ML V++ S + + S + +V K +G R ++ + +N+ + L +
Sbjct: 326 LQKMLAQDVIRV--SKSPWSSPVVIVKKKDGSVRMCVDYRKVNKVVKNNAHPLPHIEATL 383
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-ASL 598
L ++DL Y+ +P++ + A + ++ LPFGL T+P F A++
Sbjct: 384 QSLTGKKIFTTLDLLAGYWQIPLEERSKEITAFAIGSELFEYNVLPFGLVTSPAVFQATM 443
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI-LGSLGWIVNLQKSSLSP 657
V LL G VY+DD L+ ++ IQ + I L + G + K ++
Sbjct: 444 EAVVGDLL---GKNAFVYVDDLLIASETME-KHIQDLKEILIRLEASGMKLRASKCHIAQ 499
Query: 658 APVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA-SFV 716
V ++LG P + + E K + N R N + RS LG + F+
Sbjct: 500 REV-EYLGHRITP--EGVKTEETKVNKMKNFTRPE------NAEQMRSFLGLTGYYRKFM 550
Query: 717 IPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEW-W----------LNALPLSSPI 765
+ ++ A LTP+ K+ W W L L S+P+
Sbjct: 551 LNYAQV---------------ASELTPLTSV---KVAWVWQAEQEKAFQELIQLICSAPV 592
Query: 766 F--PRQVQ-------HFISTDASDLGWGS----------QVDSSFLSGLWSREQQNWHIN 806
P Q I DAS G G+ Q +F S S ++ +HI
Sbjct: 593 LMQPNIEQALDGSRPFMIYCDASKKGVGAVLAQEGDDGLQHPIAFSSKALSPAEKRYHIT 652
Query: 807 KKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQD 866
E A+ AL + + V+V +D++ ++S L+ G S L+
Sbjct: 653 DLEALAMMSALRKFKTITYGTSVVVFTDHKPLISLLK--GSPLSDRLMR----------- 699
Query: 867 WRIHILA-----QFIPGAYNSVADSLSR 889
W I I+ +I G N VAD+LSR
Sbjct: 700 WSIEIMQFDVKIVYIAGQANVVADALSR 727
>gi|254210620|gb|ACT65332.1| pol protein [Human immunodeficiency virus 1]
Length = 433
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 125/282 (44%), Gaps = 29/282 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
A++ +EM + G + ++ + + +F + + +G R +++L+ LN+ ++ + +
Sbjct: 131 KALTEICKEMEKEGKISKIGPENPYNTPIFAIKRKDGKWRKLIDLRELNK-ITQDFWEVQ 189
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLAL---SYNGDVLAM----TCLPF 586
+P+ L+K + +D+ AYF VP+ + + A S N + + LP
Sbjct: 190 LGIPLPAGLRKNKSVTVLDIGDAYFSVPLYEDFRNYTAFTIPSINNETPGIRYQYNVLPM 249
Query: 587 GLATAPQAFASLSNWVASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
G +P F S + R++ M +V Y+DD L V D I + + K+ L
Sbjct: 250 GWKGSPAIFQSSMTKILEPYRTKNPEMVIVQYMDD-LYVGSDLEIGQHRAKIEELREHLL 308
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
W + P ++G ++ H D+ W + QL + W ++ +
Sbjct: 309 KWGLTTPDRKYQKEPPFLWMG--YELHPDK-WTVQPIQLP---------NKEDWTVNDIQ 356
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
L+G L++AS + P ++++ LLR GA LT I P
Sbjct: 357 KLVGKLNWASQIYP-----GIKVKQLCKLLR-GAKSLTDIVP 392
>gi|58699442|ref|ZP_00374187.1| pol protein [Wolbachia endosymbiont of Drosophila ananassae]
gi|58534040|gb|EAL58294.1| pol protein [Wolbachia endosymbiont of Drosophila ananassae]
Length = 492
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 72/148 (48%), Gaps = 9/148 (6%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPK--GNGGT---RPVLNLKGLNQFLSPKKFSLIN 534
I+EML G++++ S + + + L+LVPK N GT R V++ + LN+ KF + N
Sbjct: 138 IEEMLHQGIIRK--SNSPYNAPLWLVPKKADNSGTKKWRIVIDYRKLNENTVDDKFPIPN 195
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQA 594
I L + Y +IDL++ + + ++ + A S +PFGL AP
Sbjct: 196 IEGILDKLGRAQYFSTIDLAKGFHQILVQEQDREKTAFSTPHGHYEFNRMPFGLKNAPAT 255
Query: 595 FASLSNWVASLLRSRGMRVVVYLDDFLL 622
F L N V L VVYLDD L+
Sbjct: 256 FQRLINTV--LKEEINKICVVYLDDVLI 281
>gi|189236284|ref|XP_001815208.1| PREDICTED: similar to orf [Tribolium castaneum]
gi|270005484|gb|EFA01932.1| hypothetical protein TcasGA2_TC007546 [Tribolium castaneum]
Length = 1475
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 638 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 692
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 693 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 752
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 753 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCCFF 810
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 811 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 861
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 862 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 916
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 917 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 976
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 977 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1028
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 1029 DALSRN 1034
>gi|266827|sp|Q00962.1|POL_CAMVN RecName: Full=Enzymatic polyprotein; Includes: RecName:
Full=Aspartic protease; Includes: RecName:
Full=Endonuclease; Includes: RecName: Full=Reverse
transcriptase
gi|331571|gb|AAA46358.1| reverse transcriptase [Cauliflower mosaic virus]
gi|445598|prf||1909346E reverse transcriptase
Length = 680
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 102/435 (23%), Positives = 171/435 (39%), Gaps = 51/435 (11%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLV----PKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+L+ V+K S + ++ FLV G G R V+N K +N+ ++L N
Sbjct: 267 IKELLDLKVIK--PSKSPHMAPAFLVNNEAENGRGNKRMVVNYKAMNKATVGDAYNLPNK 324
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ + ++ S D ++ V + + A + +PFGL AP F
Sbjct: 325 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 384
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
+ + R VY+DD ++ N++ +L + + + G I++ +K+
Sbjct: 385 QRHMDEAFRVFRK---FCCVYVDDIVVFSNNEEDHLLHVA--MILQKCNQHGIILSKKKA 439
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + FLG+ D + P+ L N L K + LG L++A
Sbjct: 440 QLFKKKI-NFLGLEIDEGTHK---PQGHILEHINKFPDTLEDKK----QLQRFLGILTYA 491
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNAL-PLSSPIFPRQVQH 772
S IP L R QA L T + + K++ L PL P+ ++
Sbjct: 492 SDYIP--NLAQMRQPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKL-- 547
Query: 773 FISTDASDLGWG-------------SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSL 819
I TDASD WG +++ + SG + ++N+H N KE AV +
Sbjct: 548 IIETDASDDYWGGMLKAIKINEGTNTELICRYRSGSFKAAERNYHSNDKETLAVINTIKK 607
Query: 820 NLPLLQSSVVMVQSDNQTVVSY--LRRQGGTKSLSLLSEVEKIFLLSQDWRIH--ILAQF 875
L ++++DN S+ L +G +K + Q W H +
Sbjct: 608 FSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKLGRNIR--------WQAWLSHYSFDVEH 659
Query: 876 IPGAYNSVADSLSRS 890
I G N AD LSR
Sbjct: 660 IKGTDNHFADFLSRE 674
>gi|403167840|ref|XP_003327591.2| hypothetical protein PGTG_09125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375167222|gb|EFP83172.2| hypothetical protein PGTG_09125 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1375
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 112/522 (21%), Positives = 212/522 (40%), Gaps = 70/522 (13%)
Query: 425 RLRRFVDAWIRLGAPAPLVR----IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHI 480
R V+ W R A L + ++ G+ F P L TP + +L
Sbjct: 487 RCEMDVEEWERSLEKAGLAQEFSDVIRGFKEGFDQGIPNHNLGPATPYFTPPNHQSALLA 546
Query: 481 QEMLETGVLKRLDSTTGF-------LSRLF---------LVPKGNGGTRPVLNLKG-LNQ 523
Q+ +E + K +++ F L + F G+G RP+ +L N
Sbjct: 547 QDKIEQSMRKEVEAGRMFGPYTHEQLMKKFSFFRTNPLGAAVNGDGSIRPINDLSFPRNN 606
Query: 524 FLSPKKFSLINHF----------RIPSFL--QKGDYMISI-DLSQAYFHVPIKTTHQRFL 570
L+P S ++ + F Q G ++++ D +AY +P + +L
Sbjct: 607 PLTPSVNSFVDKLDYATTWDDFENVSKFFKRQTGPLLLALFDWEKAYRQIPTAKSQWAYL 666
Query: 571 AL-SYNGDVLAMTCLPFGLATAPQAFASLSN-WVASLLRSRGMRVVV-YLDDFLLVNQDP 627
+ +NG +L T + FG +F ++ W +L + V ++DD L V
Sbjct: 667 MVRDFNGGILIDTRIAFGGVAGCGSFGRPADAWKQLMLHEFDLVTVFRWVDDNLFVKHPD 726
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQ-FLGIMWDPHLDRMWLPEDKQLT-L 685
+E+ +A S SLG V + SP Q ++G +W+ + LP+DK+ +
Sbjct: 727 SKVEMDHIVARS--ESLG--VKTNSTKYSPFKEEQKYIGFIWNATRKSVRLPDDKKYQRV 782
Query: 686 GNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPIN 745
I L ++ + G L+ S+++P R + + R + + L P+
Sbjct: 783 QQIKEFLQIGSEFSFKQVEVMAGRLNHVSYLLPQLRCYLNSLYRWMNTWVHRSKDL-PLP 841
Query: 746 PAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGL-----WSREQ 800
P L+ WL L + ++ + D ++GW +S+ G+ W++ Q
Sbjct: 842 PGARVDLQEWLTTL-----LSFKETRMIRDPDPIEIGWMGDASTSYGIGITIGRRWAQFQ 896
Query: 801 --QNWHI--NKKEMFAVHQALSLNLPLL--------QSSVVMVQSDNQTVVS-YLRRQGG 847
++W K A + +++ L L+ + ++V +DN T S L+R+
Sbjct: 897 LTKDWDKGPEPKRDIAWLETVAIRLGLIALAQLSVKRGKTIIVWTDNTTTESAILKRK-- 954
Query: 848 TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+K ++ E + I L + + I+++ + N VAD+LSR
Sbjct: 955 SKHQAVNDEWKIIQRLLIEMELDIVSRRVSSGDN-VADALSR 995
>gi|32489310|emb|CAE03706.1| OSJNBa0060B20.14 [Oryza sativa Japonica Group]
Length = 3200
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 171/430 (39%), Gaps = 57/430 (13%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
++EML+ G+++ S++ F S LV K +G R ++ + LN + + +
Sbjct: 1982 QVKEMLQNGIIQH--SSSPFSSPALLVKKKDGSWRVCIDYRQLNAITKKGTYPMPIIDEL 2039
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-AS 597
L +DL Y + + + A + + FGL AP F +
Sbjct: 2040 LDELAGAKIFSKLDLRAGYHQIRMAEGEEFKTAFQTHSGHYEYKVMSFGLTGAPATFQGA 2099
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
+++ + LLR + V+ DD L+ + D K + +L + W V L K +
Sbjct: 2100 MNDTLRPLLRKCAL---VFFDDILIYSPDMNSHLDHLKQVLQLLDTHQWKVKLSKCDFAQ 2156
Query: 658 APV------LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS 711
+ + G+ DP + +I+ + + L L GY
Sbjct: 2157 TQISYLGHIISGQGVSTDPS------------KIQSIVDWAVPTTLKKLRGFLGLAGY-- 2202
Query: 712 FASFVIPMGRLHSRRIQRQASLLRLGAPHL--TPINPAVLPKLEWWLNALP-LSSPIFPR 768
+ FV G L Q LL+ AP + +N A L+ L + P L+ P F
Sbjct: 2203 YRKFVKDFGTLSKPLTQ----LLKKDAPFVWSAEVNQA-FQALKHALTSTPVLALPNF-- 2255
Query: 769 QVQHFISTDASDLGWGS-----QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
Q I TDASD+G G+ Q +F+S Q +KE A+ A+ P
Sbjct: 2256 QQGFTIETDASDIGIGAVLSQNQHPVAFVSKALGPRTQGLSTYEKECLAIMMAVDHWRPY 2315
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGT----KSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
LQ ++ +D+ +++ ++ T K+ + LS ++ I+ + G
Sbjct: 2316 LQFQEFLIITDHHSLMHLTEQRLHTPWQQKAFTKLSGLQ----------FQIV--YRKGK 2363
Query: 880 YNSVADSLSR 889
+N+ AD+LSR
Sbjct: 2364 HNAAADALSR 2373
>gi|38346970|emb|CAD39728.2| OSJNBb0049I21.6 [Oryza sativa Japonica Group]
Length = 1606
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 82/185 (44%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G + S++ + +
Sbjct: 466 IIDLIPGTA-PISKRPYRMPVNELEELKK--------QIRELQEKGFV--CPSSSPWGAP 514
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ IDL Y +
Sbjct: 515 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLK 574
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + T + FGL AP F +L N V + VVV++DD L
Sbjct: 575 IRTGDIPKTAFSTRYGLYEFTVMSFGLTNAPAYFMNLMNKV--FMDYLDKFVVVFIDDIL 632
Query: 622 LVNQD 626
+ ++D
Sbjct: 633 IYSKD 637
>gi|227438239|gb|ACP30609.1| disease resistance protein [Brassica rapa subsp. pekinensis]
Length = 2726
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 103/432 (23%), Positives = 172/432 (39%), Gaps = 54/432 (12%)
Query: 475 AMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLIN 534
M + EMLE G+++ +ST+ F S + LV K +G R ++ + LN+ P KF +
Sbjct: 1801 VMEQMVCEMLEAGIIR--ESTSPFSSPVLLVKKKDGSWRFCIDYRALNKATIPDKFPIPV 1858
Query: 535 HFRIPSFLQKGDYMISIDLSQAYFHVPIKTT---HQRFLALSYNGDVLAMTCLPFGLATA 591
++ L +DL Y + ++ F + + + L M PFGL A
Sbjct: 1859 IDQLLDELYGASVFSKLDLRSGYHQIRMQEEDIPKTAFRTVEGHYEFLVM---PFGLTNA 1915
Query: 592 PQAFASLSNWVAS-LLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNL 650
P F +L N + LR V+V+ DD L+ ++ +L +S+L + N
Sbjct: 1916 PATFQALMNSIFKPYLRK---FVLVFFDDVLIYSKTVEEHAEHLRLVLSVLQEHKLLANR 1972
Query: 651 QKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGY 709
+K S + ++LG H+ K + ++T K W L S + L G+
Sbjct: 1973 KKCSFGLQQI-EYLG-----HII------SKNGVATDAIKT-QCMKEWPLPKSVKQLRGF 2019
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPR 768
L + + + + LL+ + L+ + P L+ P F +
Sbjct: 2020 LGLTGYYRHYVKGYGSIARPLTELLKKDGFQWSKEAELAFDSLKKAMVEAPVLALPNFEK 2079
Query: 769 QVQHFISTDASDLGWGSQVDSS------FLSGLWSREQQNWHINKKEMFAVHQALSLNLP 822
I +DAS G G+ + F GL REQ ++E+ AV A+
Sbjct: 2080 PF--VIESDASGFGVGAVLMQDGKPIAFFSHGLTEREQLK-PAYERELMAVVLAVQKWKH 2136
Query: 823 LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILA-QFI----P 877
L +V +D+ +SL L E +++ + W +L FI P
Sbjct: 2137 YLLGRQFVVHTDH-------------RSLKYLLEQKEVNMEYHRWLTKLLGFDFIIVYRP 2183
Query: 878 GAYNSVADSLSR 889
G N AD LSR
Sbjct: 2184 GCDNKAADGLSR 2195
>gi|341902993|gb|EGT58928.1| hypothetical protein CAEBREN_19301 [Caenorhabditis brenneri]
Length = 2472
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 93/461 (20%), Positives = 186/461 (40%), Gaps = 74/461 (16%)
Query: 460 VPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNL 518
+P + + + I EML+ +++ +S F + + LV K + + R ++
Sbjct: 1610 IPQARIHRIPLEKRKEVETQISEMLKQEIIRPTESP--FAAPIVLVRKADKTSWRFTVDF 1667
Query: 519 KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDV 578
+ LN +P + + N I + ++D Q + +P++ H A + +
Sbjct: 1668 RALNAMTTPVQSVIPNIHEILDLCAGKAFYTTLDFQQGFHQIPVEPAHCPRTAFACHMGA 1727
Query: 579 LAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAV 638
+P GL +P F + N SL++ R+ VY+DD ++ ++D A+
Sbjct: 1728 FEYIRMPMGLKGSPGTFQRVMN---SLIKEIRARIFVYIDDMVITSED----------AI 1774
Query: 639 SILGSLGWIVN-LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKT 697
L + +++ ++KS G+ P + LPE + LG I+ A
Sbjct: 1775 QHLKDIEEVLDQIEKS-----------GMKLRPEKCKFALPE--IIYLGFIISK--AGIR 1819
Query: 698 WNLDSARSLLGY-----LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKL 752
N + R++ Y + I M + R I A+ ++ AP + +
Sbjct: 1820 PNPEKTRAIDEYPTPRTVKEVRAFIGMCSFYRRFI---ANFSKIAAPIMDLTKKEKV--F 1874
Query: 753 EWW---------LNALPLSSPIF--PRQVQHF-ISTDASDLGWG-----SQVDS------ 789
EW L +PI P+ + F I D+S G G +Q D
Sbjct: 1875 EWTKECQEAMEILKEALTKNPILVAPQLGKPFIIEVDSSGRGVGAVLFQAQDDEGKDKRV 1934
Query: 790 -SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGT 848
++ S +++ ++ + + E + A+ P + + ++ +D+ + S L R+
Sbjct: 1935 IAYASRVYTGAEKRYPAIELEALGLTYAVKQFRPYIDGAKTLIITDHSPLKSLLYRK--- 1991
Query: 849 KSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
L+ + K ++ Q++ I I ++ PG N V D+LSR
Sbjct: 1992 ---DLMGRMGKYQIVLQEYDIKI--EYRPGKQNIVCDTLSR 2027
>gi|270012874|gb|EFA09322.1| hypothetical protein TcasGA2_TC001648 [Tribolium castaneum]
Length = 1388
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 62/143 (43%), Gaps = 4/143 (2%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ EML G++ DS + + S + LV K +G R ++ + LN + +
Sbjct: 423 VDEMLSAGIIS--DSNSEYSSPVLLVKKKDGSNRLCIDYRRLNAITVKEYVPMQIIDEQL 480
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y ++DL+ Y VP+ + + +PFGL AP F L
Sbjct: 481 DLLSGNGYFTTLDLASGYMQVPVAKESRHLTSFVTTTGQYEFNRMPFGLVNAPSVFNRLM 540
Query: 600 NWVASLLRSRGMRVVVYLDDFLL 622
N V + RG+ V +Y+DD L+
Sbjct: 541 NMVTRKI-GRGV-VTIYMDDILI 561
>gi|328699183|ref|XP_003240855.1| PREDICTED: hypothetical protein LOC100569596 [Acyrthosiphon pisum]
Length = 1663
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 103/444 (23%), Positives = 176/444 (39%), Gaps = 43/444 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++M + G+ + S++ + S L LVPK +G RP + + LN P ++ L
Sbjct: 449 FEQMQKQGICR--PSSSAWASPLLLVPKKDGSFRPCGDYRRLNSVTVPDRYPLPYLHDFT 506
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ L +DL +AY VPI A++ + + FGL A Q F L
Sbjct: 507 ANLAGKTVFTKLDLVRAYNQVPIAAGDVHKTAVTTPFGLFEFPVMCFGLCNAAQTFQRLV 566
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
N V + L V Y+DD L+ + + + + G +N K + A
Sbjct: 567 NTVLAGLDF----VFAYVDDVLIASTNAEQHVEHVRAVLGRFEEFGIAINPAKCVFA-AS 621
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS-FVIP 718
L FLG + D R P + +I+R T + LG L+F FV
Sbjct: 622 TLTFLGHVVDAQGLR---PNPDSV---DIIRRWPQPNTKK--ELQRFLGSLNFYHRFVHG 673
Query: 719 MGRLHSRRIQRQASLLRLGAPH--LTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIST 776
+ + A++ + P A L E + + L P R ++T
Sbjct: 674 AANVQAPLYDISAAIKKKDGPLAWTDAARKAFLVCREALVTSALLVHP--QRDAPLRLTT 731
Query: 777 DASDLGWGSQVDSS---------FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
DAS++ G+ ++ S F S S Q + +E+ A + A + ++
Sbjct: 732 DASNIAVGAVLEQSVDNEWQPLGFFSRKLSGAQTRYSAYDRELLAAYLAARHFVHAIEGR 791
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
V +++D++ ++ ++ Q K + + + LSQ + + + G N V D+L
Sbjct: 792 FVTLRTDHRPLL-FMFSQKAEKLID--RQARHVAFLSQYFH---EVEHVSGELNIVPDAL 845
Query: 888 SR------SKSLPDWHLSRSATEQ 905
SR LPD L + ATEQ
Sbjct: 846 SRLELAPLDNGLPD--LDQWATEQ 867
>gi|391331165|ref|XP_003740021.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1429
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 13/205 (6%)
Query: 469 ATPVSSAMSLHIQEMLETGVLKRLDSTTG---FLSRLFLVPKGNGGTRPVLNLK-GLNQF 524
A PV+ A+ I + +E V + + T + + + +V K +G R + GLN
Sbjct: 334 ARPVAYALLPKIVDEIERLVSEDVLEPTAHSKYAAPVVIVQKKDGTIRLCADYSTGLNNS 393
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
+ + L I + L G Y +DL++AY +P+++ Q L ++ + M L
Sbjct: 394 IEDDAYPLPTAESIFAKLNGGRYFSQLDLAEAYLQIPVESQSQELLTINTAKGLFKMKRL 453
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI--LG 642
+G+ TAP F L + + + L YLDD L+ + I E +G+LA L
Sbjct: 454 AYGVKTAPSLFQRLMDTITNDLPG----TTAYLDDILVTSST--IEEHEGRLAKVFQRLQ 507
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIM 667
G + +K S V +FLG +
Sbjct: 508 ENGLRIREEKCSFLRTEV-KFLGFI 531
>gi|359367498|gb|AEV42076.1| putative polyprotein [Pineapple bacilliform comosus virus]
Length = 1826
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 110/463 (23%), Positives = 182/463 (39%), Gaps = 71/463 (15%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M H+ ++LE V++ ST+ + +V G G R
Sbjct: 1270 LKHVTPALKEVMQKHVDKLLELKVIR--PSTSRHQTTAMIVYSGTEVDPVTKKEKRGKER 1327
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQK-GDYMI--SIDLSQAYFHVPIKTTHQRFL 570
V N K LN ++SL I + LQK G I DL + V + +
Sbjct: 1328 LVFNYKRLNDNTETDQYSLPG---ISTILQKIGHSKIYSKFDLKSGFHQVAMHPDSVPWT 1384
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRIL 630
A + +PFGL AP F + R + VY+DD L+ ++ P
Sbjct: 1385 AFWVINGLYEWLVMPFGLKNAPAVFQRKMD---HCFRGTEDFIAVYIDDILVFSETPEQH 1441
Query: 631 EIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLG-------IMWDPHLDRMWLPEDKQL 683
+ ++ + I+ G +++ K + V FLG I PH+ + K +
Sbjct: 1442 KKHLEVFLQIVRKNGLVLSPTKMKVGVQQV-DFLGATIGNSRIRLQPHIIQ------KVV 1494
Query: 684 TLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLT 742
N + L +K RS LG L++A IP MG+L + + G +
Sbjct: 1495 QFNN--KDLQTTK-----GLRSFLGILNYARSYIPQMGKLLGPLYSKVSP---TGEKRMN 1544
Query: 743 PINPAVLPKLEWWLNALP-LSSPIFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQ 801
+ A++ K++ + LP L P P I TD GWG F ++E+
Sbjct: 1545 KQDWAIIEKIKQMVEQLPELELP--PNGSVIVIETDGCMEGWGGICKWKFPGAPRNQEKV 1602
Query: 802 NWHINKK----------EMFAVHQALS-LNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKS 850
+ + + E+ AV +L + L ++V++D Q +VS+ + K
Sbjct: 1603 CAYASGRFQPIKSTIDAEIQAVINSLDKFKIYYLNQKELVVRTDCQAIVSFYEKMANNKP 1662
Query: 851 LSLLSEVEKIFLLSQDWRIHILA----QFIPGAYNSVADSLSR 889
S V +L D+ I A + I G N +AD+LSR
Sbjct: 1663 ----SRVR--WLTFSDFITGIGAPVKFEHIDGKDNLLADTLSR 1699
>gi|291240975|ref|XP_002740391.1| PREDICTED: putative reverse-transcriptase-like protein-like
[Saccoglossus kowalevskii]
Length = 1408
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 108/243 (44%), Gaps = 31/243 (12%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
HI+++L+ G++K+ S + + S + LV K NG R ++ + LNQ P ++++
Sbjct: 227 HIRDLLDAGIIKK--SRSQYASPIVLVRKKNGTLRLCVDYRTLNQRTIPDQYTVPRIQDA 284
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L + +DL Y+ +P+ + A +P G+ AP F L
Sbjct: 285 LDCLNGCTWFHVLDLKSGYYQIPMDEADKEKTAFICPLGFYQFERMPQGITGAPATFQRL 344
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQ-----DPRILEIQGKLAVSILGSLGWIVNLQKS 653
S + + V+VYLDD ++ + + R++++ +LA G ++ +K
Sbjct: 345 MEKCMSDMHL--LDVLVYLDDLIVFGRTLGEAEDRLMKVLDRLA-----EYGLKLSAEKC 397
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGY 709
L V ++LG H+ + ED + + L KTW N+ +SLLG+
Sbjct: 398 QLFQKSV-KYLG-----HI----VGEDGVKPDPDKIEVL---KTWPTPSNIRELKSLLGF 444
Query: 710 LSF 712
L +
Sbjct: 445 LGY 447
>gi|242760779|ref|XP_002340058.1| retrotransposon polyprotein, putative [Talaromyces stipitatus ATCC
10500]
gi|242776034|ref|XP_002478760.1| retrotransposon polyprotein, putative [Talaromyces stipitatus ATCC
10500]
gi|242796991|ref|XP_002482919.1| gag/polymerase/env polyprotein, putative [Talaromyces stipitatus ATCC
10500]
gi|218719507|gb|EED18927.1| gag/polymerase/env polyprotein, putative [Talaromyces stipitatus ATCC
10500]
gi|218722379|gb|EED21797.1| retrotransposon polyprotein, putative [Talaromyces stipitatus ATCC
10500]
gi|218723254|gb|EED22671.1| retrotransposon polyprotein, putative [Talaromyces stipitatus ATCC
10500]
Length = 1732
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/459 (21%), Positives = 172/459 (37%), Gaps = 48/459 (10%)
Query: 474 SAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLI 533
A +I E L G + + S F S + + K GG R ++ + LN ++ L
Sbjct: 780 EAAREYILENLHKGFI--VPSNAPFASPILMAKKPGGGLRFCVDFRKLNSITRKDRYPLP 837
Query: 534 NHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+ L + +D+ Q + + + + +PFG+ P
Sbjct: 838 LIDEVFERLSRAKVFTKLDIRQGFHRIRMHPDSEDLTTFRCRYGTYKYKVMPFGVTNGPA 897
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
F L N + + +V ++DD L+ + + E+ + + L + G + K
Sbjct: 898 TFQRLINDI--FMDCLDKFLVAFVDDLLIYSDNELEHELHVRQVLQRLRNAGLQAAIHKC 955
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
++LG + H E + ILR + + + + S +L F
Sbjct: 956 EFHVTKT-RYLGFIVTEHG-----IEVDPSKIEAILRWGVPTTVFGIQS------FLGFC 1003
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPLSS---PIFPRQ 769
+F + +SR + L P T KL+ L+ P+ S P P +
Sbjct: 1004 NFYRRFIKDYSRIAKPLYRLTHNNVPFEWTKNCQEAFDKLKLCLSTAPVLSHYQPNLPTR 1063
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLW------SR----EQQNWHINKKEMFAVHQALSL 819
V+ TDASD + GLW SR ++N+ I+ KEM A+ +AL
Sbjct: 1064 VE----TDASDGVIAGILSQLHEEGLWHPVAYFSRTMTPSERNYDIHDKEMLAIVRALEE 1119
Query: 820 NLP----LLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQF 875
P L + + SD++ + ++ TK L+ FL + + +F
Sbjct: 1120 WRPELVGLQREDRFEILSDHRALEYFM----TTKKLNARQARWCEFLTD----YYFVLRF 1171
Query: 876 IPGAYNSVADSLSRSKSLP--DWHLSRSATEQIFLKWGV 912
PG N AD+L+R P + + R+ + FL V
Sbjct: 1172 RPGKANVAADTLTRRDGAPKDEGYRERTILTEDFLDSAV 1210
>gi|427780451|gb|JAA55677.1| Putative tick transposon [Rhipicephalus pulchellus]
Length = 1151
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/374 (21%), Positives = 151/374 (40%), Gaps = 35/374 (9%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML+ V++ +S + + + + LV K +G R ++ + LN + L
Sbjct: 319 VCDMLKKNVIQ--ESCSPWAAPVILVKKKDGSWRFCVDYRRLNAITKKDVYPLPRIDDAI 376
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y S+DL Y+ +P+ + A + +PFGL AP A+
Sbjct: 377 DCLSSASYFSSVDLRSGYWQIPMHKEDKEKTAFVTPDGLFEFNVMPFGLCNAP---ATFE 433
Query: 600 NWVASLLRSRGMRV-VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
++ ++LR V + YLDD ++ + R + L + L ++N +K
Sbjct: 434 RFMDNILRGLKWEVCMCYLDDVVIFGRTFREHNTRLDLVLDCLSKARLVLNSKKCHFGER 493
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
L LG + D R + + N R+ + RS LG S+ IP
Sbjct: 494 QTL-VLGHLVDKEGIRPDPAKTAAVEAFNQPRS--------VKELRSFLGLCSYFRRFIP 544
Query: 719 MGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLEWWLNALPL---SSPIFPRQVQHFI 774
+ LL+ G TP + +L++ L + P+ P+ P +V
Sbjct: 545 G---FANIAHPLTCLLQKGVRFEFTPECESAFCELKFRLTSHPILRHFDPMAPTEVH--- 598
Query: 775 STDASDLGWGS----QVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ 825
+DAS +G G+ +VDS ++ S S ++N+ + ++E AV A+ L
Sbjct: 599 -SDASAVGVGAVLVQRVDSKEHVVAYASRSLSNSERNYTVTEQECLAVVFAVQRFRSYLY 657
Query: 826 SSVVMVQSDNQTVV 839
V +D+ ++
Sbjct: 658 GRSFTVVTDHHSLC 671
>gi|242075522|ref|XP_002447697.1| hypothetical protein SORBIDRAFT_06g013680 [Sorghum bicolor]
gi|241938880|gb|EES12025.1| hypothetical protein SORBIDRAFT_06g013680 [Sorghum bicolor]
Length = 1360
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 108/478 (22%), Positives = 196/478 (41%), Gaps = 79/478 (16%)
Query: 446 VSGYA---IPFSAKP-----PLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTG 497
+SG++ IP A+P L+ +C + I E+L+ G++++ S+
Sbjct: 912 ISGFSEDKIPTKARPIQMNSRLLEICKSE-------------INELLKKGLIRK--SSAP 956
Query: 498 FLSRLFLVP----KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+ F V K G R V+N K LN+ L ++ + + +Q D+
Sbjct: 957 WSCAAFYVENAAEKERGVPRLVINYKPLNKVLQWIRYPIPYKHDLIRRIQGSQIYSKFDM 1016
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
++ + IK + A + + FGL AP F + N + +
Sbjct: 1017 KSGFWQIQIKEEDRYKTAFTTPFGHYEWNVMTFGLKNAPSEFQKIMN---EIFLPYTSFI 1073
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
+VY+DD L+ +Q+ + I+ G +++ +K L + QFLG + D
Sbjct: 1074 IVYIDDVLIFSQNIDQHWKHLNIFHKIIIQNGLVISARKMKLFQTNI-QFLG--YKIQHD 1130
Query: 674 RMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFAS-FVIPMGRLHSRRIQRQAS 732
++ LP + + + + K + LG L++ S F + + R+I +
Sbjct: 1131 QV-LPVTRVIEFADKFPDEIKEKK----QLQRFLGCLNYVSDFYERLAK--DRKILTER- 1182
Query: 733 LLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG----SQVD 788
L+ P T + + +++ + LP I I TDASD G+G + D
Sbjct: 1183 -LKKNPPAWTTRHTQAVKQIKDKVKRLPCLY-ILDHDALKIIETDASDFGYGGILKQRKD 1240
Query: 789 SS-----FLSGLWSREQQNWHINKKEMFAVHQALS------LNLPLL-----QSSVVMVQ 832
S F SG W+ Q+N+ KKE+ A+ + +S LN L +++ ++Q
Sbjct: 1241 SKEQLVWFASGTWNDAQRNYSTIKKEILAIVKIVSKFQGELLNQKFLLRIDCKAAKDVLQ 1300
Query: 833 SDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
D + +VS +Q + ++LS D+ I + I G NS+ D LSR
Sbjct: 1301 QDVENLVS---KQIFARWQAILS--------CFDFDI----EHIKGEVNSLPDFLSRE 1343
>gi|38346427|emb|CAD40214.2| OSJNBa0019J05.12 [Oryza sativa Japonica Group]
Length = 1817
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 98/425 (23%), Positives = 170/425 (40%), Gaps = 47/425 (11%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
I +ML++GV++ S + F S LV K +G R V++ + LN + + +
Sbjct: 816 QISDMLKSGVIQ--PSHSAFSSPALLVKKKDGTWRLVIDYRKLNAITVKGTYPMPVIDEL 873
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF-AS 597
L + +DL Y + + + A + T + FGL AP F +
Sbjct: 874 LDELAHAKWFTKLDLRAGYHQIRMAPGEEYKTAFQTHTGHYEYTVMSFGLTGAPATFQGA 933
Query: 598 LSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSP 657
++ ++ +LR + V+ DD L+ + D +S+L W V L K S +
Sbjct: 934 MNETLSPVLRKFAL---VFFDDILIYSPDFSSHLDHIAQVLSLLSKHQWYVKLSKCSFAQ 990
Query: 658 APVLQFLGIMWDP---HLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFA 713
L +LG H D+ G I + W + + + L G+L A
Sbjct: 991 KQ-LTYLGHTISAAGVHTDQ-----------GKIQEVV----NWKVPTTVKKLRGFLGLA 1034
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSPIFPRQVQ 771
+ + + +LLR G P + TP A L+ L + P L+ P F Q
Sbjct: 1035 GYYRKFVKGFGVISKPLTNLLRKGVPFIWTPETDAAFHNLKQALVSAPVLALPDF--QKV 1092
Query: 772 HFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS 826
+ TDASD G G+ + +++S + +KE AV A+ LQ
Sbjct: 1093 FTVETDASDSGIGAVLSQDGHPVAYISKALGPRTKGLSTYEKECMAVLLAVDQWRSYLQL 1152
Query: 827 SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF--LLSQDWRIHILAQFIPGAYNSVA 884
++ +D+ +++ ++ T K F LL ++I + G++N A
Sbjct: 1153 GDFIILTDHHSLMHLSDQRLHTPWQ------HKAFTKLLGLSYKI----CYRRGSHNGAA 1202
Query: 885 DSLSR 889
D+LSR
Sbjct: 1203 DALSR 1207
>gi|9627249|ref|NP_041734.1| polyprotein [Cacao swollen shoot virus]
gi|347871|gb|AAA03171.1| polyprotein [Cacao swollen shoot virus]
Length = 1834
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/454 (19%), Positives = 181/454 (39%), Gaps = 52/454 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + HI+ +L+ GV++ S + + F+V G +G R
Sbjct: 1286 IKHLTPAMEKQFQKHIKALLDIGVIR--PSKSKHRTTAFIVESGTVIDPVTKKTIHGKER 1343
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1344 LVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKVFSKFDLKSGFHQVAMAEESIPWTAFW 1403
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ +++
Sbjct: 1404 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFKGTEEFIAVYIDDILVFSENMAEHTKH 1460
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ + I G +++ K L+ + +FLG + + + + ++++ ++
Sbjct: 1461 IGIMLKICQENGLVLSPSKICLAQREI-EFLGTV---------ISQGQMKLQAHVIKKIV 1510
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIP-MGRLHSRRIQRQASLLRLGAPHLTPINPAVL 749
L++ RS LG L++A IP +GR S + + G + ++
Sbjct: 1511 NKANIELETTKGLRSFLGLLNYARIYIPNLGRKLSPLYAKTSP---TGEKRFNRQDWHLI 1567
Query: 750 PKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQN 802
+++ + LP + I P + I +D GWG ++ DS + +
Sbjct: 1568 KEIKDMVQKLP-NLAIPPARCYIIIESDGCMEGWGAVCKWKLAKEDSRTTEKICAYASGK 1626
Query: 803 WHINKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLLS 855
+ + K E++A+ +AL S + L ++V++D Q +V++ + K + ++
Sbjct: 1627 FGVVKSTIDAEIYALIKALESFKIFYLDKKHLVVRTDCQAIVTFYNKTSTHKPSRIRWIT 1686
Query: 856 EVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ I L + + + I G N +AD+LSR
Sbjct: 1687 FSDYITGLG----VPVTIEHIDGKENQLADTLSR 1716
>gi|56266239|emb|CAE76628.1| polyprotein [Cacao swollen shoot virus]
Length = 1847
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/453 (19%), Positives = 178/453 (39%), Gaps = 50/453 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + H++ +L+ GV++ S + + F+V G +G R
Sbjct: 1293 IKHLTPAMEKQFQKHVKALLDIGVIR--PSKSKHRATAFIVESGTVIDPVTKKTIHGKER 1350
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1351 MVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKIFSKFDLKSGFHQVAMAEESIPWTAFW 1410
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ ++
Sbjct: 1411 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFKGTEEFIAVYIDDILVFSETMAEHTKH 1467
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ ++I G +++ K L+ + +FLG + + + + ++++ ++
Sbjct: 1468 IGIMLTICQENGLVLSPNKICLAQREI-EFLGTI---------ISQGQMKLQPHVIKKIV 1517
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
L++ RS LG L++A IP L + A G + ++
Sbjct: 1518 NKADMELETTRGLRSFLGLLNYARIYIP--NLGKKLSPLYAKTSPTGEKKFNRQDWHLIK 1575
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQNW 803
+++ + LP + I P + I +D GWG ++ DS + + +
Sbjct: 1576 EIKDMVQKLP-NLAIPPARCCIIIESDGCMEGWGAVCKWKLAKEDSRTTEKICAYASGKF 1634
Query: 804 HINKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLLSE 856
I K E+FA+ +AL S + L ++V++D Q +V++ + K + ++
Sbjct: 1635 GIIKSTIDAEIFALIKALESFKIFYLDKKHLVVRTDCQAIVTFYNKTSTHKPSRIRWITF 1694
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ I L + + + I G N +AD+LSR
Sbjct: 1695 SDYITGLG----VQVTIEHINGKENQLADTLSR 1723
>gi|147864892|emb|CAN79373.1| hypothetical protein VITISV_028502 [Vitis vinifera]
Length = 1439
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 104/428 (24%), Positives = 165/428 (38%), Gaps = 49/428 (11%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
++E+L+ G+++ S + + + K +G R ++ + LN+ K+ + +
Sbjct: 466 QLKELLDAGLIQ--PSRAPYGAPVLFQKKHDGSLRMCVDYRALNKVTIKNKYPIPLAAEL 523
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
L K Y +DL Y+ V + + +PFGL A F +L
Sbjct: 524 FDRLSKASYFTKLDLRSGYWQVRVAAGDEGKTTCVTRYGSYEFLVMPFGLTNALATFCNL 583
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPA 658
N V L VVVYLDD ++ ++ E +L L V +K +
Sbjct: 584 MNDV--LFDYLDAFVVVYLDDIVVYSKTLTEQEKHLRLVFQRLRENRLYVKPEKCEFAQE 641
Query: 659 PVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYL-----SFA 713
+ FLG L RM DK + I+ + SK L S L Y ++
Sbjct: 642 EIT-FLGHKISAGLIRM----DKG-KVHAIMEWIAPSKVTELRSFLGLANYYRRFIKGYS 695
Query: 714 SFVIPMGRLHSRRIQ----RQASLLRLGAPHLTPINPAV-LPKLEWWLNALPLSSPIFPR 768
V P+ L + Q RQ + P + LP L+ P
Sbjct: 696 KTVSPLTDLLKKDNQWDWSRQCQMAFESLKEAMSTEPVLRLPDLD------------LPF 743
Query: 769 QVQHFISTDASDLGWGSQVDS-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPL 823
+VQ TDASD G + +F S + +Q + ++KEM AV L
Sbjct: 744 EVQ----TDASDRALGGVLVQEGHPVAFESRKLNNAEQRYSTHEKEMTAVVHCLRQWRHY 799
Query: 824 LQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSV 883
L S+ V +DN ++ + Q K LS + FL D+ L + PG +N+V
Sbjct: 800 LLGSIFTVVTDN-VANTFFKTQ---KKLSPRQARWQEFL--ADFNFEWLHR--PGRHNTV 851
Query: 884 ADSLSRSK 891
AD LSR +
Sbjct: 852 ADVLSRKE 859
>gi|391326386|ref|XP_003737698.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1509
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 76/169 (44%), Gaps = 10/169 (5%)
Query: 502 LFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHV 560
+ +V K +G R + GLN + + L I + L G Y +DL++AY +
Sbjct: 450 VVIVQKKDGTIRLCADYSTGLNNSIEDDAYPLPTAESIFAKLNGGRYFSQLDLAEAYLQI 509
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P+++ Q L ++ + M L +G+ TAP F L + + + L YLDD
Sbjct: 510 PVESQSQELLTINTAKGLFKMKRLAYGVKTAPSLFQRLMDTITNDLPG----TTAYLDDI 565
Query: 621 LLVNQDPRILEIQGKLAVSI--LGSLGWIVNLQKSSLSPAPVLQFLGIM 667
L+ + I E +G+LA L G + +K S V +FLG +
Sbjct: 566 LVTSST--IEEHEGRLAKVFQRLQENGLRIREEKCSFLRTEV-KFLGFI 611
>gi|294899773|ref|XP_002776736.1| hypothetical protein Pmar_PMAR017604 [Perkinsus marinus ATCC 50983]
gi|239883937|gb|EER08552.1| hypothetical protein Pmar_PMAR017604 [Perkinsus marinus ATCC 50983]
Length = 1374
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 82/180 (45%), Gaps = 18/180 (10%)
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFL----- 570
L + G +FL K R P+ G MI ID + AY ++PI QR+
Sbjct: 364 LIIHGHKKFLHEK-----GKIRGPNEGDDGQVMIEIDCAAAYRNIPILQEEQRYCMNFIP 418
Query: 571 ALSYNGDVLAMTCLPFGLATAP----QAFASLSNWVASLLRS-RGMRVVVYLDDFLLVNQ 625
+ + G ++ T LPFGL+++ + +A L V LL G VY+DD + +
Sbjct: 419 SKTGKGTLICHTKLPFGLSSSGLQWVRVYAGLVQVVKRLLAYPHGEGAQVYIDDLVYITT 478
Query: 626 DPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTL 685
R + + + ++G ++ K +S PV+ LG W+ L + +P D+Q+ L
Sbjct: 479 R-RAALHRLLAILLLHAAVGINISYNKVRVSSTPVV--LGYEWNTELGTVAVPTDRQIRL 535
>gi|391339345|ref|XP_003744012.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1509
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 76/169 (44%), Gaps = 10/169 (5%)
Query: 502 LFLVPKGNGGTRPVLNLK-GLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHV 560
+ +V K +G R + GLN + + L I + L G Y +DL++AY +
Sbjct: 450 VVIVQKKDGTIRLCADYSTGLNNSIEDDAYPLPTAESIFAKLNGGRYFSQLDLAEAYLQI 509
Query: 561 PIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDF 620
P+++ Q L ++ + M L +G+ TAP F L + + + L YLDD
Sbjct: 510 PVESQSQELLTINTAKGLFKMKRLAYGVKTAPSLFQRLMDTITNDLPG----TTAYLDDI 565
Query: 621 LLVNQDPRILEIQGKLAVSI--LGSLGWIVNLQKSSLSPAPVLQFLGIM 667
L+ + I E +G+LA L G + +K S V +FLG +
Sbjct: 566 LVTSST--IEEHEGRLAKVFQRLQENGLRIREEKCSFLRTEV-KFLGFI 611
>gi|270015530|gb|EFA11978.1| hypothetical protein TcasGA2_TC001426 [Tribolium castaneum]
Length = 830
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/448 (19%), Positives = 190/448 (42%), Gaps = 46/448 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++E+ ++KR++ +T +L+ LV K +G R L+ K LNQ + + + L + I
Sbjct: 374 LEELQRANIIKRVEGSTEWLNSYVLVKKADGSLRICLDPKYLNQAIINQSYKLPSTDEII 433
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
S L+ + ++D + ++++P+ + +PFG+ A + F
Sbjct: 434 SKLKDSKFFSTLDAANGFWNIPLDDESSKLCTFGTPFGRFRFLRMPFGIKIASEVFQE-- 491
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
+ + G V +Y+DD L+ + +I+ + + N +K L
Sbjct: 492 -YFYDIFSMEG--VEIYIDDILIHAKTKEEHDIKLEKVFQLARKHNIKFNAKKCLLGANE 548
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
V ++LG + + + E+K + N + S T N + LG +++ I
Sbjct: 549 V-KYLGYKFSDK--GVSIDEEKLEAIKN-----MPSPT-NKKEVQRFLGLITYVDRFIKN 599
Query: 720 --GRLHS-RRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI---FPRQVQHF 773
+ H R I ++ ++ G + L L + PI F +
Sbjct: 600 LSEKTHPLREIIKRENIFYWGEEQQKTFDE---------LKNLLANKPILQYFDANKEIT 650
Query: 774 ISTDASDLGWGSQV--DS---SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSV 828
+S DAS G G+ + D+ ++ S ++ QQ + +KE+ + ++ +
Sbjct: 651 LSVDASQKGLGAVLLQDNKPCAYASRAMTQTQQRYAQIEKELLVICFGVNKFYQYVFGKK 710
Query: 829 VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLS 888
V++D++ ++S ++ + ++++ L Q + I+++ + PG +AD LS
Sbjct: 711 FNVETDHKPLISIFKKPLN----DCPARLQRMLLSLQKFDINLI--YKPGKKLIIADHLS 764
Query: 889 RSKSLPDWHLSRSATEQIFLKWGVPCID 916
RS +LS+ T+ + L+ V I+
Sbjct: 765 RS------NLSKEFTDNLNLELQVCLIE 786
>gi|270005488|gb|EFA01936.1| hypothetical protein TcasGA2_TC007550 [Tribolium castaneum]
Length = 1119
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/429 (22%), Positives = 171/429 (39%), Gaps = 44/429 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 638 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 692
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 693 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 752
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 753 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCC-- 808
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
F G + +L E + I K ++ R LG F +
Sbjct: 809 ------FFGSQME-YLGHEISAEGIKPGETKIKAVTAFPKPTDVHKLRQFLGLCGYFRKY 861
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 862 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 916
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 917 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 976
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 977 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1028
Query: 885 DSLSRSKSL 893
D+LSR+ L
Sbjct: 1029 DALSRNTVL 1037
>gi|384500943|gb|EIE91434.1| hypothetical protein RO3G_16145 [Rhizopus delemar RA 99-880]
Length = 454
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 143/368 (38%), Gaps = 60/368 (16%)
Query: 551 IDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRG 610
+D+ AY + I + A + T +PFGL AP +F L N +L
Sbjct: 13 LDIRNAYHRIRIAEGDEWKTAFRTQYGLFEYTVMPFGLTNAPASFQGLIN--DTLREFLD 70
Query: 611 MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
+ VVVYLDD L+ +++ L + L G +K V +FLG P
Sbjct: 71 LFVVVYLDDILIYSENLTDHYNHVNLVLEKLRGAGLYAKAEKCEFDVTEV-EFLGFKISP 129
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNL-DSARSLLGYLSFASFVIPMGRLHSRRIQR 729
+++ + K + N W+ S + +L F++F RR +
Sbjct: 130 K--GIFMDQSKVSAITN----------WSTPRSVHDIQVFLGFSNFF--------RRFIQ 169
Query: 730 QASLLRLGAPHLTPINPAVLPKLE-----WWLNALPLSSPIFPRQVQHF-------ISTD 777
S L + LT N + E L S+PI + HF I TD
Sbjct: 170 DYSKLTVPMTALTKKNVPFVWSTEADQSFQQLKTAFTSAPI----LHHFDPSSKIIIETD 225
Query: 778 ASDL---GWGSQVDS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ-- 825
ASD G SQ S +F S + + N+ I KE+ A+ + LQ
Sbjct: 226 ASDFAIAGVLSQYGSDSLLHPVAFYSRKLNTTEVNYDIYDKELLAIIECFKTWRHYLQGA 285
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
S + V +D++ + + TKSL+ IF+ S D+ I + PG N AD
Sbjct: 286 SHQITVYTDHKNLEYF----ATTKSLNRRQARWSIFMSSFDFFI----TYRPGTKNPKAD 337
Query: 886 SLSRSKSL 893
+LSR +
Sbjct: 338 ALSRRSDM 345
>gi|390358910|ref|XP_003729361.1| PREDICTED: uncharacterized protein LOC754545 [Strongylocentrotus
purpuratus]
Length = 462
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 72/163 (44%), Gaps = 9/163 (5%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I+ L+ G+++ +ST+ + S L V K +G TRP ++ + LN + L
Sbjct: 108 IETQLQMGIIR--ESTSAWSSPLVYVKKRDGTTRPCVDYRKLNDVTRKDAYPLPRIEDCL 165
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L ++DL Y+ + IK + A S +PFGL AP F
Sbjct: 166 DCLGGAQIFSTLDLQSGYWQIDIKEEDRHKTAFSTRTGHYEYVTMPFGLCNAPGTFERAM 225
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQD-----PRILEIQGKLA 637
+ L+ R + ++YLDD ++++ R+ E+ G+L
Sbjct: 226 ELIMKGLQWRTL--ILYLDDIIVMSSTIGEHINRLDEVLGRLG 266
>gi|270016456|gb|EFA12902.1| hypothetical protein TcasGA2_TC001990 [Tribolium castaneum]
Length = 933
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 96/450 (21%), Positives = 168/450 (37%), Gaps = 83/450 (18%)
Query: 452 PFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGG 511
P ++KP +PL LQ + M+ G+ + S + + S L +VPK +
Sbjct: 416 PVASKPRRLPLDKLQ--------IAKREFEHMMALGICR--PSNSPWASPLHMVPKKDSN 465
Query: 512 TRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMIS-IDLSQAYFHVPIKTTHQRFL 570
RPV + + LN ++ I H + L G+ + S +DL +AY +P++ +
Sbjct: 466 WRPVGDYRRLNAVTKEDRYP-IPHLHDFAHLLAGNTIFSTVDLVRAYHQIPVEASSIPKT 524
Query: 571 ALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVV-YLDDFLLVNQDPRI 629
A + + T + FGL A Q+F + V + G+ YL+D L+ + R
Sbjct: 525 ATTTPFGLFEFTRMQFGLRNAAQSFQRFIHEVLN-----GLHFCFPYLNDILIASTSERE 579
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
+ L G +N +K S GN
Sbjct: 580 RTDHLRKVFERLLKYGLTINPEKCS------------------------------FGN-- 607
Query: 690 RTLLASKTWNLDSARSLLGYLSFASFVIPM-GRLHSRRIQRQASLLRLGAPHLTPINPAV 748
S LGY A P+ R+ S+ +R LG P+ + V
Sbjct: 608 ------------SKVKFLGYEVSADGTKPLTDRVKSKTAKRTTKHRYLGPPNRKQHSSCV 655
Query: 749 LPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGSQVDS---------SFLSGLWSRE 799
L A+ L P + ++ DASD G+ ++ SF S +
Sbjct: 656 KHDLA---QAMLLIHPTATDTIS--LTVDASDFAMGAVLEQNQGGSWKLLSFFSKKLTPA 710
Query: 800 QQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEK 859
QQ + +E+ A++ A+ L+ ++ +D++ +V L ++ + ++
Sbjct: 711 QQKYSTYDRELLAIYSAVKALQHFLEGRHFVIYTDHKPLVYALTQKSDKATPRQARHLDY 770
Query: 860 IFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
I + + I G N VAD+LSR
Sbjct: 771 ISQFT------TVINHISGKSNVVADTLSR 794
>gi|156846142|ref|XP_001645959.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
gi|156116630|gb|EDO18101.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
Length = 1063
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 94/415 (22%), Positives = 174/415 (41%), Gaps = 53/415 (12%)
Query: 494 STTGFLSRLFLVPKGNGGTRPVLNLKGLNQ--FLSPKKFSLINHFRIPSFLQKGDYMI-- 549
S + F S + +V K +G R ++ + LN+ P LI+H + + G I
Sbjct: 142 SKSPFSSPIVMVKKKDGSYRLCVDYRKLNKATVKDPFPLPLIDH----ALAKIGSATIFT 197
Query: 550 SIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSR 609
++DL Y + +K + A T +PFGL AP F S ++A L R
Sbjct: 198 TLDLHSGYHQISMKEQDRYKTAFVTPNGKYEYTVMPFGLVNAPSTF---SRYMADLFRDL 254
Query: 610 GMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWD 669
V VYLDD L+ + + + L ++G IV +K + + V ++LG
Sbjct: 255 KF-VNVYLDDILIFSTTLNDHWNHLDIVFNRLKNVGLIVKKKKCTFAAEEV-EYLGYNVG 312
Query: 670 PHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSR 725
R LP I A K + + A+ LG +++ IP +
Sbjct: 313 V---RGILP---------IQNKCQAVKDFPTPTTIKEAQRFLGLVNYYRRFIPHCADKTE 360
Query: 726 RIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS 785
IQ+ + G + + + +L+ L++ P+ P + + ++TDAS G G+
Sbjct: 361 PIQKYVT----GQTEWSDLQDKAMVELKDILSSEPVLVAFRPDGL-YRLTTDASKNGVGA 415
Query: 786 QVDS-----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSD 834
++ + S ++N+ + E+ + AL LL ++++D
Sbjct: 416 VLEEVTETGKLKGVVGYFSHSLKGPERNYPAGELELLGIVSALKHFKYLLHGRHFVLRTD 475
Query: 835 NQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ ++S +R++G + V++ L D+ I + Q++ G N VAD++SR
Sbjct: 476 HVGLLS-IRKEGEPS-----TRVQRWLDLLADFDIDL--QYLQGKKNVVADAISR 522
>gi|147861248|emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera]
Length = 1521
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 105/468 (22%), Positives = 174/468 (37%), Gaps = 87/468 (18%)
Query: 459 LVPLCSLQHL----ATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
L+P SL +L P A + + E+L G ++ S G + L PK +G R
Sbjct: 617 LIPGASLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPCGVPA--LLTPKKDGSWR 674
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMI------SIDLSQAYFHVPIKTTHQ 567
++ + +N K ++ F IP D M+ IDL Y + I+ +
Sbjct: 675 MCVDSRAIN------KITIKYRFPIPRLDDMLDMMVGSVIFSKIDLRSGYHQIRIRPGDE 728
Query: 568 RFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP 627
+ + +PFGL AP F + V R VVVY DD L+ ++
Sbjct: 729 WKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPFIGRF--VVVYFDDILIYSRSC 786
Query: 628 RILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN 687
E K + L + + +NL+K + +P + FLG + + + D +
Sbjct: 787 EDHEEHLKQVMRTLRAEKFYINLKKCTFM-SPSVVFLGFV----VSSKGVETDPE----- 836
Query: 688 ILRTLLASKTW----NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTP 743
+ A W N+ RS G +F RR R S + + P
Sbjct: 837 ---KIKAIVDWPVPTNIHEVRSFHGMATFY-----------RRFIRNFSSI------MAP 876
Query: 744 INPAVLPKLEWWLNALP---------------LSSPIFPRQVQHFISTDASDLGWGSQVD 788
I + P L W A L P F + + ++ DAS +G G+ +
Sbjct: 877 ITECMKPGLFIWTKAANKAFEEIKSKMVNPPILRLPDFEKVFE--VACDASHVGIGAVLS 934
Query: 789 S-----SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLR 843
+F S + ++ + E +AV QA+ L ++ SD++ + YL
Sbjct: 935 QEGHPVAFFSEKLNGAKKKYSTYDLEFYAVVQAIRHWQHYLSYKEFVLYSDHE-ALRYLN 993
Query: 844 RQGGTKSL-SLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRS 890
Q S + S ++F + + G N VAD+LSR
Sbjct: 994 SQKKLNSRHAKWSSFLQLFTFN--------LKHCAGIENKVADALSRK 1033
>gi|189236280|ref|XP_001815148.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 3364
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/429 (22%), Positives = 171/429 (39%), Gaps = 44/429 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 1237 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 1291
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 1292 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 1351
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 1352 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLRTVFQLLRQFGLTLKLSKCC-- 1407
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
F G + +L E + I K ++ R LG F +
Sbjct: 1408 ------FFGSQME-YLGHEISAEGIKPGETKIKAVTAFPKPTDVHKLRQFLGLCGYFRKY 1460
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 1461 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDAEAETEL 1515
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 1516 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 1575
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 1576 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1627
Query: 885 DSLSRSKSL 893
D+LSR+ L
Sbjct: 1628 DALSRNTVL 1636
>gi|391326403|ref|XP_003737706.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 1575
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 13/205 (6%)
Query: 469 ATPVSSAMSLHIQEMLETGVLKRLDSTTG---FLSRLFLVPKGNGGTRPVLNLK-GLNQF 524
A PV+ A+ I + +E V + + T + + + +V K +G R + GLN
Sbjct: 480 ARPVAYALLPKIVDEIERLVSEDVLEPTAHSKYAAPVVIVQKKDGTIRLCADYSTGLNNS 539
Query: 525 LSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
+ + L I + L G Y +DL++AY +P+++ Q L ++ + M L
Sbjct: 540 IEDDAYPLPTAESIFAKLNGGRYFSQLDLAEAYLQIPVESQSQELLTINTAKGLFKMKRL 599
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSI--LG 642
+G+ TAP F L + + + L YLDD L+ + I E +G+LA L
Sbjct: 600 AYGVKTAPSLFQRLMDTITNDLPG----TTAYLDDILVTSST--IEEHEGRLAKVFQRLQ 653
Query: 643 SLGWIVNLQKSSLSPAPVLQFLGIM 667
G + +K S V +FLG +
Sbjct: 654 ENGLRIREEKCSFLRTEV-KFLGFI 677
>gi|2995405|emb|CAA73042.1| polyprotein [Ananas comosus]
Length = 871
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 109/448 (24%), Positives = 172/448 (38%), Gaps = 60/448 (13%)
Query: 469 ATPVSSA-----------MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLN 517
TP+S A + +Q++L+ G ++ S + + + + V K +G R ++
Sbjct: 15 TTPISKAPYRMAPAELRELRAQLQDLLDKGFIR--PSVSPWGAPVLFVKKKDGSLRLCVD 72
Query: 518 LKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGD 577
+ LN+ K+ L + LQ IDL Y + IK A
Sbjct: 73 YRELNKVTIKNKYPLPRIDDLFDQLQGSCVYSKIDLQSGYHQLKIKPEDVSKTAFRTRYG 132
Query: 578 VLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLA 637
+PFGL AP AF L N V R VVV++DD L+ ++ E ++
Sbjct: 133 HYEFAVMPFGLTNAPTAFMDLMNRVFKPYLDRF--VVVFIDDILVYSRSDADHEEHLRIV 190
Query: 638 VSILGSLGWIVNLQKSS--LSPAPVLQFL----GIMWDPHLDRMWLPEDKQLTLGNILRT 691
+ +L V L+K L L L GI DP +
Sbjct: 191 LQVLREKELYVKLKKCEFWLREVAFLGHLISGSGIAVDP-------------------KK 231
Query: 692 LLASKTW-NLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
+ A K W L S + +L A + R R + L RL + I
Sbjct: 232 IEAIKDWPRLTSVTEIRSFLGLAGY---YRRFVERFAKLSTPLTRLTHKGVKFIWNDACE 288
Query: 751 KLEWWLNALPLSSPIFPRQVQ---HFISTDASDLGWGS---QVDS--SFLSGLWSREQQN 802
+ L ++PI V + + +DAS G G Q D ++ S ++N
Sbjct: 289 RSFQELKQRLTTAPILTLPVAGAGYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKN 348
Query: 803 WHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFL 862
+ + E+ AV AL L L V +D+++ + YL Q K L+L +
Sbjct: 349 YPTHDLELAAVVFALKLWRHYLYGERCEVYTDHKS-LKYLFTQ---KELNLRQ--RRWLE 402
Query: 863 LSQDWRIHILAQFIPGAYNSVADSLSRS 890
L +D+ + IL + PG N VAD+LSR
Sbjct: 403 LLKDYDLTIL--YHPGKANVVADALSRK 428
>gi|189236292|ref|XP_001815280.1| PREDICTED: similar to orf [Tribolium castaneum]
Length = 1505
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 173/426 (40%), Gaps = 44/426 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++L +GV++ DS + S + LV K +G R ++ + LN +F L R+
Sbjct: 668 VNDLLGSGVIRESDSP--YSSPILLVRKKDGQHRMCVDYRQLNSKTIKDRFPLP---RVD 722
Query: 540 SFLQK---GDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
L K + ++DL+ Y+ +P+ T A +PFGLA AP F
Sbjct: 723 EHLDKLNGAKFFTTLDLASGYYQIPMATESIPKTAFVTPDGHYEFVRMPFGLANAPAVFQ 782
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLS 656
N V LR + Y+DD L+ ++D + +L G + L K
Sbjct: 783 RAMNKVLGPLRFQT--AFCYIDDLLIPSKDFETGLNNLQTVFQLLRQFGLTLKLSKCCFF 840
Query: 657 PAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLS-FASF 715
+ + ++LG + + K +T K ++ R LG F +
Sbjct: 841 GSQI-EYLGHEISAEGIKPGETKIKAVT--------AFPKPTDVHKLRQFLGLCGYFRKY 891
Query: 716 VIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFI 774
V + + SLL+ G+ + L+ L + P+ + I+ + + +
Sbjct: 892 VKDYATIAN----SLTSLLKKGSAFVWEEAQERAFQTLKDILTSRPVLA-IYDTEAETEL 946
Query: 775 STDASDLGWGS-----QVDSS-----FLSGLWSREQQNWHINKKEMFAVHQALSLNLPLL 824
TDAS +G G Q D S F S ++E+Q +H + E AV +L L
Sbjct: 947 HTDASKVGIGGILLQRQGDGSLRPVMFFSRQTTKEEQRYHSYELETLAVVCSLKHYRVYL 1006
Query: 825 QSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
V +D + + L ++ L+ + + +LL+ ++ + ++ PG+ S
Sbjct: 1007 LGLQFKVITDCNALRTTLTKR------DLIPRIGRWWLLTSEFDFTV--EYRPGSKMSHV 1058
Query: 885 DSLSRS 890
D+LSR+
Sbjct: 1059 DALSRN 1064
>gi|62734485|gb|AAX96594.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza
sativa Japonica Group]
gi|62734535|gb|AAX96644.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza
sativa Japonica Group]
gi|77550407|gb|ABA93204.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
Japonica Group]
Length = 1158
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 82/185 (44%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G ++ S G +
Sbjct: 413 IIDLIPGTA-PISKRPYQMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AP 461
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ IDL Y +
Sbjct: 462 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLPRIDDLFDQLKGAKVFSKIDLQSGYHQLK 521
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + + T + FGL AP F +L N V + VVV++DD L
Sbjct: 522 IRTGDIPKTAFSTHYGLYEFTVISFGLTNAPAYFMNLMNKV--FMDYLDKFVVVFIDDIL 579
Query: 622 LVNQD 626
+ ++D
Sbjct: 580 IYSKD 584
>gi|74640|pir||GNLJSP pol polyprotein - human foamy virus
Length = 886
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 100/217 (46%), Gaps = 13/217 (5%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
++ VPK +G R VL+ + +N+ + + I + + + Y ++DL+ ++ P
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I A ++ G T LP G +P F + V LL+ V VY+DD
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTAD---VVDLLKEIP-NVQVYVDDIY 120
Query: 622 LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
L + DP+ Q + IL G++V+L+KS + V +FLG ++ + E +
Sbjct: 121 LSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTV-EFLGF----NITK----EGR 171
Query: 682 QLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
LT + L + +L +S+LG L+FA IP
Sbjct: 172 GLTDTFKTKLLNITPPKDLKQLQSILGLLNFARNFIP 208
>gi|77552016|gb|ABA94813.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
Japonica Group]
Length = 1712
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 81/185 (43%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G P S +P +P+ L+ L I+E+ E G ++ S G +
Sbjct: 874 IIDLIPG-TTPISKRPYRMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AP 922
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ +IDL Y +
Sbjct: 923 VLFVKKKDGSMRICVDYRSLNEVTIKNKYPLPRIDDLFDQLKGAKVFSTIDLRSGYHQLK 982
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + T + FGL AP F +L N V + VVV++DD L
Sbjct: 983 IRTEDIPKTAFSTRYGLYEFTVMSFGLTNAPAYFMNLMNKV--FMDYLDKFVVVFIDDIL 1040
Query: 622 LVNQD 626
+ ++D
Sbjct: 1041 IYSKD 1045
>gi|77555663|gb|ABA98459.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
Japonica Group]
Length = 1470
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 81/185 (43%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G ++ S G +
Sbjct: 533 IIDLIPGTA-PISKRPYRMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AP 581
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ IDL Y +
Sbjct: 582 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLK 641
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + T + FGL AP F +L N V + VVV++DD L
Sbjct: 642 IRTEDIPKTAFSTRYGLYEFTVMSFGLTNAPAYFMNLMNKV--FMDYLDKFVVVFIDDIL 699
Query: 622 LVNQD 626
+ ++D
Sbjct: 700 IYSKD 704
>gi|56266252|emb|CAE81285.1| polyprotein [Cacao swollen shoot virus]
Length = 1770
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/453 (20%), Positives = 178/453 (39%), Gaps = 50/453 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + H++ +L+ GV++ S + + F+V G +G R
Sbjct: 1216 IKHLTPAMEKQFQKHVKALLDIGVIR--PSKSKHRTTAFIVESGTVIDPVTKKTIHGKER 1273
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1274 MVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKIFSKFDLKSGFHQVAMAKESIPWTAFW 1333
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ ++
Sbjct: 1334 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFKGTEEFIAVYIDDILVFSETMAEHTKH 1390
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ ++I G +++ K L+ + +FLG + + + + +I++ ++
Sbjct: 1391 IGIMLTICQENGLVLSPNKICLAQREI-EFLGTI---------ISQGQMKLQPHIIKKIV 1440
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
L++ RS LG L++A IP L + A G + ++
Sbjct: 1441 NKADMELETTKGLRSFLGLLNYARIYIP--NLGKKLSPLYAKTSPTGEKKFNRQDWHLIK 1498
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQNW 803
+++ + LP + I P + I +D GWG ++ DS + + +
Sbjct: 1499 EIKNMVQRLP-NLAIPPARCCIIIESDGCMEGWGAVCKWKLAKEDSRTTEKVCAYASGKF 1557
Query: 804 HINKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLLSE 856
I K E+FA+ +AL S + L ++V++D Q +V++ + K + ++
Sbjct: 1558 GIIKSTIDAEIFALIKALESFKIFYLDKKHLVVRTDCQAIVTFYNKTSTHKPSRIRWITF 1617
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ I L + + + I G N +AD+LSR
Sbjct: 1618 SDYITGLG----VQVTIEHINGKENQLADTLSR 1646
>gi|156052661|ref|XP_001592257.1| hypothetical protein SS1G_06497 [Sclerotinia sclerotiorum 1980]
gi|154704276|gb|EDO04015.1| hypothetical protein SS1G_06497 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1582
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 93/423 (21%), Positives = 164/423 (38%), Gaps = 67/423 (15%)
Query: 498 FLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAY 557
F + + V K NG R ++ + LN ++ L + LQ +D+ QA+
Sbjct: 923 FAAPVLFVKKSNGSLRFCIDYRKLNALTRKDRYPLPLIDETLARLQGAKIYTKLDIRQAF 982
Query: 558 FHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYL 617
+ + + + LPFGL P + N + L YL
Sbjct: 983 HRIRMDPASEEYTTFRTRYGAYKCKVLPFGLTNGPATYQRYMNDI--LFDYLDDFCTAYL 1040
Query: 618 DDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKS--SLSPAPVLQFL----GIMWDPH 671
DD L+ ++DP + + + L G +L+KS ++ L F+ GI DP
Sbjct: 1041 DDILIYSEDPSEHDTHVRKVLQRLRDAGLQADLKKSEFDVTKTKYLGFIISTTGIEVDP- 1099
Query: 672 LDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQA 731
D++ + ++ Q L +S LG+ +F IP
Sbjct: 1100 -DKVAIVKEWQY-------------PSTLKGVQSFLGFCNFYRRFIP------------- 1132
Query: 732 SLLRLGAP--HLTPINPAVLPKLEW-----WLNALPLSSPI---FPRQVQHFISTDASDL 781
S + +P HLT N E L L ++PI + +++ + TDASD
Sbjct: 1133 SYGVIASPLTHLTKTNVPFSFNQECKEAFNTLRGLLTTAPILRHYDYKLESMLETDASD- 1191
Query: 782 GWGSQV------DSSFLSGLWSRE----QQNWHINKKEMFAVHQALSL-NLPLLQSSV-V 829
G + V D F +S+ + N+ ++ KE+ A+ ++ L+ S V
Sbjct: 1192 GVIAAVLSQKHDDHWFPVAYFSKTMLPAELNYPVHDKELTAIAKSFGHWRAELIGSPFQV 1251
Query: 830 MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
V +D++ + ++ S L S ++ L D+ I+ + PG N +AD+LSR
Sbjct: 1252 KVYTDHKALEYFM------TSKQLNSRQARVAELLADFNFLIM--YRPGKENPLADALSR 1303
Query: 890 SKS 892
+
Sbjct: 1304 RED 1306
>gi|353231103|emb|CCD77521.1| hypothetical protein Smp_166950 [Schistosoma mansoni]
Length = 1074
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 95/422 (22%), Positives = 181/422 (42%), Gaps = 43/422 (10%)
Query: 483 MLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
ML+ G+++ DS + S L +VPK N G RP + + LN+ P ++ + + +
Sbjct: 485 MLQLGIIRPSDSQ--WASPLHMVPKKNEGDWRPCGDYRALNRQTVPDRYPIPHIQDFTNG 542
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
LQ + IDL +AY ++P+ A++ + +PFGL A Q F +
Sbjct: 543 LQGMNIFTKIDLVRAYHNIPVADEDIPKTAITTPFGLFEFVRMPFGLRNAAQTF---QRF 599
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS-LSPAPV 660
+ +LLR Y+DD L+ + D + E K + L G +N+ +S +
Sbjct: 600 IDNLLRDMPF-AQGYIDDLLIASPDLQSHEQHVKTVLKRLDEHG--INIHQSKCVFGVQT 656
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP-- 718
L+FLG P + P K+ + I + + S +L RS G ++F IP
Sbjct: 657 LEFLGHTISPEGIK---PIKKE--VDTIKQYPIPS---SLTQLRSFPGLINFYRRFIPGC 708
Query: 719 --MGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFIST 776
+ + + ++R+ +L + + I + KL L + ++P +
Sbjct: 709 AQLMQPLTDSLKRKPKEFKLSSDAVEAIK-QLKDKLA-QATTLMYPNSLYPLALM----V 762
Query: 777 DASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
DASD G ++ +F S + + + +E+ A++ + +L+
Sbjct: 763 DASDKAVGGTLNQLVKNAWKPIAFFSKRLAPAETRYSTFGRELLAIYLTIKHFRHMLEGR 822
Query: 828 VVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSL 887
+V +D++ + + L+ + S + ++ I + D R H+ +Q N AD+L
Sbjct: 823 EFIVFTDHKPLTNALKARADKYSPPEVRHLDYISQFTSDIR-HVKSQ-----DNQAADAL 876
Query: 888 SR 889
SR
Sbjct: 877 SR 878
>gi|321473536|gb|EFX84503.1| hypothetical protein DAPPUDRAFT_238870 [Daphnia pulex]
Length = 736
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 96/241 (39%), Gaps = 16/241 (6%)
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSL 644
P PQ VA LLR + + V D L K+A +L L
Sbjct: 48 PEECTMGPQMEEVCDKEVADLLRKKAIAVA----------PDTPGLLADVKMASDLLQRL 97
Query: 645 GWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSAR 704
G+++N +KS +P L+FLG++ + LP K+ + L + L
Sbjct: 98 GFLINWEKSLPNPTQSLEFLGMILNSLSLAFILPSVKREKTRKLCVQALNNNIIKLRDLA 157
Query: 705 SLLGYLSFASFVIPMGRLHSRRIQRQA--SLLRLGA--PHLTPINPAVLPKLEWWLNALP 760
++ S A +P + H R++Q L R G ++ L WW+ ++
Sbjct: 158 KVIDNFSCAIPAMPFAQAHYRKVQSDLIWMLGRNGGDFEKYLVLSEEAKADLSWWIQSMD 217
Query: 761 LSSP--IFPRQVQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALS 818
S IF + I +DAS GW + + G WS E + HIN+ E+ A AL
Sbjct: 218 SSEGKVIFQGEPDLTIFSDASLSGWVAVCNGITARGPWSAEDASRHINELELLAAFFALQ 277
Query: 819 L 819
+
Sbjct: 278 I 278
>gi|301623455|ref|XP_002941032.1| PREDICTED: hypothetical protein LOC100491299 [Xenopus (Silurana)
tropicalis]
Length = 704
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/364 (21%), Positives = 127/364 (34%), Gaps = 81/364 (22%)
Query: 544 KGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAP---QAFASLSN 600
+G + D+ A+ +PI L + G CLP G + + + FA+
Sbjct: 373 QGSLLAKTDIESAFRLLPIHPDSHYLLGFHFQGAYFYDKCLPMGCSISCKYFEMFATFLE 432
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPR-----------ILEIQGKLAVSILGSLGWIVN 649
WV S V YLDDFL + PR L K V I
Sbjct: 433 WVIKF-ESGANFVTHYLDDFLFLG--PRGSNTCSILLNTFLHYSSKFGVPIA-------- 481
Query: 650 LQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGY 709
++ +++P LQFLGI D LPE K L +++ + L +K L +SL+G
Sbjct: 482 -REKTVAPTTSLQFLGIEIDTMHMEFRLPEAKISKLKSLIASALVAKKLKLKHIQSLIGT 540
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
F+ N A+
Sbjct: 541 CWQEDFI---------------------------ENSAI--------------------- 552
Query: 770 VQHFISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKK----EMFAVHQALSLNLPLLQ 825
+ A G+G+ + + W E + + E+F V AL + L
Sbjct: 553 --QLFTDAAGSTGFGAYLSGRWCCAAWPSEWRKQELTGNLVLLEIFPVLVALEIWGSWLA 610
Query: 826 SSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVAD 885
+ +++ N VV + KS ++ + + L+ I + A+ IPG N +AD
Sbjct: 611 NRRILLFCHNMGVVQVINNLSA-KSPPVVRVMRHLVFLALMHNIWLRAKHIPGCQNILAD 669
Query: 886 SLSR 889
+LSR
Sbjct: 670 ALSR 673
>gi|91214364|gb|ABE27943.1| ORFIII [Banana streak virus]
Length = 1709
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 111/507 (21%), Positives = 192/507 (37%), Gaps = 101/507 (19%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M+ H+Q++LE V++ S++ + +V G G R
Sbjct: 1241 LKHVTPTMKETMAKHVQKLLELKVIR--PSSSKHRTTAMIVESGTEVDPMTGKERRGKER 1298
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1299 LVFNYKRLNDNTEKDQYSLPGINTIIKRIGNAKIYSKFDLKSGFHQVAMDPESIPWTAFW 1358
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ ++ +
Sbjct: 1359 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEDFIAVYIDDILVFSETIHQHKEH 1415
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
K ++I G +++ K + + FLG T+GN
Sbjct: 1416 LKKFMTICEKNGLVLSPTKMKIGTRQI-DFLG-----------------ATIGNSKIKLQ 1457
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHS-------RRIQRQ 730
I++ ++ K L + L LG L++A IP +G L++ RR+ Q
Sbjct: 1458 PHIIKKIIEMKDEELKEVKGLRKWLGILNYARSYIPKLGKILGPLYAKTSPNGERRMNTQ 1517
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----Q 786
+ + A LP+LE LP P + I TD GWG +
Sbjct: 1518 DWKIVKEVKEVV----ANLPELE-----LP------PEKAIMIIETDGCMEGWGGVCKWK 1562
Query: 787 VDSSFLSGLWSREQQNWHINK---------KEMFAVHQALS-LNLPLLQSSVVMVQSDNQ 836
DS L WS + + K E+ AV +L + L +++++D+Q
Sbjct: 1563 TDS--LQPRWSEKICAYASGKFTPIKSTIDAEIQAVINSLDKFKIYYLDKKELIIRTDSQ 1620
Query: 837 TVVSYLRRQGGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL- 893
+VS+ ++ K + L+ + I + I + I G N +AD+LSR +
Sbjct: 1621 AIVSFYKKSSDHKPSRVRWLAFTDYITGTG----LEIKFEHIDGKDNVLADTLSRLVKII 1676
Query: 894 --PDWHLSR----SATEQIFLKWGVPC 914
P+ H S +A E++F + C
Sbjct: 1677 LHPEKHQSEGVLINAVEEVFHNYTKGC 1703
>gi|38346969|emb|CAE02265.2| OSJNBb0049I21.5 [Oryza sativa Japonica Group]
Length = 1644
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 81/185 (43%), Gaps = 13/185 (7%)
Query: 442 LVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSR 501
++ ++ G A P S +P +P+ L+ L I+E+ E G ++ S G +
Sbjct: 623 IIDLIPGTA-PISKRPYRMPVNELEELKK--------QIRELQEKGFVRPSSSPWG--AP 671
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
+ V K +G R ++ + LN+ K+ L + L+ IDL Y +
Sbjct: 672 VLFVKKKDGSMRMCVDYRSLNEVTIKNKYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLK 731
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I+T A S + T + FGL AP F +L N V + M VV++DD L
Sbjct: 732 IRTGDIPKTAFSTRYGLYKFTVMSFGLTNAPAYFMNLMNKVFMDYLDKFM--VVFIDDIL 789
Query: 622 LVNQD 626
+ ++D
Sbjct: 790 IYSKD 794
>gi|350646870|emb|CCD58591.1| choline/ethanolamine kinase, putative [Schistosoma mansoni]
Length = 946
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 92/423 (21%), Positives = 179/423 (42%), Gaps = 46/423 (10%)
Query: 483 MLETGVLKRLDSTTGFLSRLFLVPKGN-GGTRPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
ML+ G+++ DS + S L +VPK N G RP + + LN+ P ++ + + +
Sbjct: 107 MLQLGIIRPSDSQ--WASPLHMVPKKNEGDRRPCGDYRALNRQTVPDRYPIPHIQDFTNG 164
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS-LSN 600
LQ + I L +AY ++P+ A++ + +PFGL A Q F + N
Sbjct: 165 LQGMNIFTKIGLVRAYHNIPVADEDIPKTAITTPFGLFEFVRMPFGLRNAAQTFQRFMDN 224
Query: 601 WVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS-LSPAP 659
+ + ++G Y+DD L+ + D + E K + L G +N+ +S +
Sbjct: 225 LLRDMPFAQG-----YIDDLLIASPDLQSHEQHVKTVLKRLDENG--INIHQSKCVFGVQ 277
Query: 660 VLQFLGIMWDPHLDRMWLPED--KQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVI 717
L+FL P + D KQ + + L L RS LG ++F I
Sbjct: 278 TLEFLSHTISPEGIKPIKKVDTIKQYPIPSSLTQL-----------RSFLGLINFYRRFI 326
Query: 718 PMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQHF-IS 775
P ++ +Q L+ G P ++ + ++ + L +++ ++P + +
Sbjct: 327 PGC---AQLMQPLTDSLK-GKPKEFKLSSDAVEAIKQLKDKLAQVTTLMYPNSLSPLALM 382
Query: 776 TDASDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQS 826
DASD G ++ +F S + + + I +E+ A++ + +L+
Sbjct: 383 VDASDKAVGGTLNQLVKNAWKPIAFFSKRLAPAETRYSIFGRELLAIYLTIKHFRHMLEG 442
Query: 827 SVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADS 886
+V +D++ + + L+ + S + ++ I + D R + G N AD+
Sbjct: 443 REFIVFTDHKPLTNALKARADKYSPREVRHLDYISQFTSDIR------HVKGQDNQAADA 496
Query: 887 LSR 889
LSR
Sbjct: 497 LSR 499
>gi|5001453|gb|AAD37020.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 949
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 74/174 (42%), Gaps = 7/174 (4%)
Query: 452 PFSAK--PPLVPLCSLQHLATPVSSA-MSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG 508
PF+ + P P+ + P A + ++E+L G ++ S G + + V K
Sbjct: 143 PFTIELEPGTTPISKAPYRMAPAEMAELKKQLEELLAKGFIRPSSSPWG--APVLFVKKK 200
Query: 509 NGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQR 568
+G R ++ +GLN+ K+ L + L + IDL+ Y +PI+ T R
Sbjct: 201 DGSFRLCIDYRGLNKVTVKNKYPLPRIDELMDQLGGAQWFSKIDLASGYHQIPIEPTDVR 260
Query: 569 FLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLL 622
A +PFGL AP AF + N V V++++DD L+
Sbjct: 261 KTAFRTRYGHFEFVVMPFGLTNAPAAFMKMMNGVFRDFLDEF--VIIFIDDILV 312
>gi|406696837|gb|EKD00111.1| retrotransposon nucleocapsid protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 1662
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 102/456 (22%), Positives = 171/456 (37%), Gaps = 49/456 (10%)
Query: 457 PPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVL 516
PP P+ SL V + ++ E LE G + + S + + + V K +G R +
Sbjct: 720 PPFGPIYSLSEKELGV---LREYLDENLEKGFI--VPSESPAAAPILFVKKKDGSLRLCV 774
Query: 517 NLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNG 576
+ +GLN+ ++ L + L+K IDL AY + I + A
Sbjct: 775 DYRGLNKITVKNRYPLPLIPELLDRLRKAKVFTKIDLRGAYNLLRIAEGDEWKTAFRTRY 834
Query: 577 DVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKL 636
+ +PFGL AP +F L N + V+ +LDD ++ + E K
Sbjct: 835 GLFEYKVMPFGLTNAPASFQHLMN--HNFRDMLDDFVICFLDDIMVFSDTTEEHEHHVKQ 892
Query: 637 AVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG--NILRTLLA 694
+ L +G K + V +FLG ++ DK + + + L
Sbjct: 893 VLQRLREVGLYAKASKCEFNKDSV-EFLG----------FIISDKGIGMDQKKVATILEW 941
Query: 695 SKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAP-HLTPINPAVLPKLE 753
K NL RS LG+ +F I + +S L R P T ++
Sbjct: 942 PKPCNLHDVRSFLGFCNFYRRFI---KGYSTIAGPLIRLTRNDVPFQWTAKEQQAFDAMK 998
Query: 754 WWLNALPLSSPIFPRQVQHFI-STDASDLGWGSQVDS---------SFLSGLWSREQQNW 803
S P QH + TDASD + +F S S + N+
Sbjct: 999 GCFITAGFLSHYDPN--QHLVLETDASDFAIAGVLSQKINDELRPIAFFSRKLSPAELNY 1056
Query: 804 HINKKEMFAVHQALSLNLPLLQSSV--VMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIF 861
I+ KEM A+ L+ + + V +D++++ + + + + SE F
Sbjct: 1057 EIHDKEMLAIVACFKEWRHYLEGAAHQITVYTDHRSLEYFTTSKQLNRRQARWSE----F 1112
Query: 862 LLSQDWRIHILAQFIPGAYNSVADSLSRSKSLPDWH 897
L D+ I + PG + D+L+R PD+H
Sbjct: 1113 LSEFDFVI----IYRPGLKGTKPDALTRR---PDYH 1141
>gi|403160624|ref|XP_003890497.1| hypothetical protein PGTG_20791 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170326|gb|EHS64088.1| hypothetical protein PGTG_20791 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 532
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 94/444 (21%), Positives = 180/444 (40%), Gaps = 65/444 (14%)
Query: 445 IVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGF----LS 500
++ G+ I F P + L++ TP + + S ++ +E + K L + F L+
Sbjct: 22 VLEGFRIGFDQGIPQHTIPGLRYY-TPDNHSSSEKVKSKVEESIKKELLAKRMFGPFTLN 80
Query: 501 RLF------------LVPKGNGGTRPVLNL---------KGLNQFLSPKKFSLI-NHFRI 538
++ V G+G RP+ +L + +N F++ F + FR+
Sbjct: 81 QVMKKFEFFRSNPLGAVVNGDGAIRPINDLSFPRYDPVVRSVNSFVNKHDFETTWDDFRV 140
Query: 539 PS-FLQKGDYMISI---DLSQAYFHVPIKTTHQRFLAL-SYNGDVLAMTCLPFGLATAPQ 593
S F K + + D +AY +P + +FL + +NG+ L T + FG
Sbjct: 141 VSEFFAKDKRKMELALFDWEKAYRQIPTRMDQWKFLLVKDFNGEFLLDTRITFGGVAGCG 200
Query: 594 AFASLSNWVASLLRSRG--MRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
+F ++ +++ R + + ++DD L + +I + V LG + N
Sbjct: 201 SFGRPADAWKQIMKKRFKLLNIFRWVDDNLFIRLQGD--DISMEKIVEFSTELGVLTN-- 256
Query: 652 KSSLSPAPVLQ-FLGIMWDPHLDRMWLPEDK-QLTLGNILRTLLASKTWNLDSARSLLGY 709
K SP Q F+G +W+ + LP+ K + + I+ L T++ + L+G
Sbjct: 257 KEKYSPFQDEQKFIGFIWNGIQKTVRLPDRKIEKRISQIMPFLEEKATFDYEDVEILIGR 316
Query: 710 LSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQ 769
L+ ++++P + H + R R+ P P L L W+ L +
Sbjct: 317 LNHVAYILPHLKCHLCSLYRWLISWRMRKAR-RPTPPDALEDLSLWVATL--------KS 367
Query: 770 VQHFISTDAS---DLGWGSQVDSSFLSGL-----WSREQQN------WHINKKEMFAVHQ 815
+H + D+GW +SF G+ W++ + + I+ E A+
Sbjct: 368 FEHTRIINYGPPVDIGWVGDASTSFGIGILIGKRWAQFKLHDPKSNPLRISYLETVAIRL 427
Query: 816 ALSLNLPLL--QSSVVMVQSDNQT 837
L + L L + +MV +DN T
Sbjct: 428 GLLMVLKLRTQRGKTLMVWTDNTT 451
>gi|18378611|gb|AAL68643.1|AF458767_1 polyprotein [Oryza sativa Japonica Group]
Length = 775
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 184/449 (40%), Gaps = 71/449 (15%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
++EML+ G+++ S S + LV K +G R ++ + LN K+ L +
Sbjct: 176 VREMLDKGIIQPSSSPF--SSPVLLVKKKDGTWRFCVDYRHLNAITVKNKYPLPIIDELL 233
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L + + +DL Y + +K + + A + +PFGL +AP F
Sbjct: 234 DELSRAQWFTKLDLRAGYHQIRMKMSDEHKTAFKTHSGHYEFRVIPFGLTSAPATFQGGM 293
Query: 600 NWVASLLRSRGMRVVVYLDDFLL--------VNQDPRILEIQGKLAVSILGSLGWIVNLQ 651
N + S L R V+V++DD L+ VN ++ +I K + + S
Sbjct: 294 NSILSPLLRRC--VLVFVDDILIYSATLEDHVNHLRQLFQILVKHQLKVKQS-------- 343
Query: 652 KSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTW----NLDSARSLL 707
K S + L +LG + P + + +DK + N W ++ RS L
Sbjct: 344 KCSFAQQR-LSYLGHIITP--NGVSTDDDKIRVVQN----------WPVPGSVKELRSFL 390
Query: 708 GYLSFA-SFVIPMGRLHSRRIQRQASLLRLGAPHL-TPINPAVLPKLEWWLNALP-LSSP 764
G + FV G L S+ + +LLR G ++ T A L+ L P L+ P
Sbjct: 391 GLTGYYRKFVCHYGIL-SKPL---TNLLRKGQLYIWTSETEAAFQALKQALITAPVLAMP 446
Query: 765 IFPRQVQHFISTDASDLGWGS-----QVDSSFLSGLWSREQQNWHINKKEMFAVHQALSL 819
F + TDASD G G+ Q +FLS Q +K+ A+ A+
Sbjct: 447 NFSEPF--IVETDASDKGIGAVLMQHQHPIAFLSKALGPRHQGLSTYEKKSLAIMLAVEH 504
Query: 820 NLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGA 879
P LQ + +++D+++ +S+L Q T + + + L R I+ + G
Sbjct: 505 WRPYLQHAEFFIRTDHRS-LSFLDDQRLTTPWQHKALTKLLGL-----RYKII--YKKGT 556
Query: 880 YNSVADSLSR------------SKSLPDW 896
N AD+LSR S ++PDW
Sbjct: 557 DNGAADALSRYPSSATLELSALSVAVPDW 585
>gi|384403237|gb|AFH88829.1| ORFIII [Banana streak virus]
Length = 1709
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 192/507 (37%), Gaps = 101/507 (19%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M+ H+Q++LE V++ S++ + +V G G R
Sbjct: 1241 LKHVTPTMKETMAKHVQKLLELKVIR--PSSSKHRTTAMIVESGTEVDPMTGKERRGKER 1298
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1299 LVFNYKRLNDNTEKDQYSLPGINTIIKRIGNAKIYSKFDLKSGFHQVAMDPESIPWTAFW 1358
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ ++ +
Sbjct: 1359 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEDFIAVYIDDILVFSETIHQHKEH 1415
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
K ++I G +++ K + + FLG T+GN
Sbjct: 1416 LKKFMTICEKNGLVLSPTKMKIGTRQI-DFLG-----------------ATIGNSKIKLQ 1457
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFA-SFVIPMGRL-----------HSRRIQRQ 730
I++ ++ K L + L LG L++A S++ +G++ RR+ Q
Sbjct: 1458 PHIIKKIIEMKDEELKEVKGLRKWLGILNYARSYISKLGKILGPLYAKTSPNGERRMNTQ 1517
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----Q 786
+ + A LP+LE LP P + I TD GWG +
Sbjct: 1518 DWKIVKEVKEVV----ANLPELE-----LP------PEKAIMIIETDGCMEGWGGVCKWK 1562
Query: 787 VDSSFLSGLWSREQQNWHINK---------KEMFAVHQALS-LNLPLLQSSVVMVQSDNQ 836
DS L WS + + K E+ AV +L + L +++++D+Q
Sbjct: 1563 TDS--LQPRWSEKICAYASGKFTPIKSTIDAEIQAVINSLDKFKIYYLDKKELIIRTDSQ 1620
Query: 837 TVVSYLRRQGGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL- 893
+VS+ ++ K + L+ + I + I + I G N +AD+LSR +
Sbjct: 1621 AIVSFYKKSSDHKPSRVRWLAFTDYITGTG----LEIKFEHIDGKDNVLADTLSRLVKII 1676
Query: 894 --PDWHLSR----SATEQIFLKWGVPC 914
P+ H S +A E++F + C
Sbjct: 1677 LHPEKHQSEGVLINAVEEVFHNYTKGC 1703
>gi|115481476|ref|NP_001064331.1| Os10g0317000 [Oryza sativa Japonica Group]
gi|15217201|gb|AAK92545.1|AC051624_3 Putative retroelement [Oryza sativa Japonica Group]
gi|31431040|gb|AAP52878.1| retrotransposon protein, putative, unclassified, expressed [Oryza
sativa Japonica Group]
gi|113638940|dbj|BAF26245.1| Os10g0317000 [Oryza sativa Japonica Group]
Length = 1476
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 91/224 (40%), Gaps = 25/224 (11%)
Query: 458 PLVPLCSLQHLATPVSSAMSLHIQ--EMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPV 515
P P +++ PV+ L Q M+E G+++R ST+ F S + LV K +G R
Sbjct: 528 PGAPPVAVRPYRYPVAHKDELERQCAVMMEQGLIRR--STSAFSSPVLLVKKADGSWRFC 585
Query: 516 LNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYN 575
++ + LN + + + L + +DL Y V ++ A +
Sbjct: 586 VDYRALNAITIKDAYPIPVVDELLDELHGAKFFTKLDLRSGYHQVRMRAEDVAKTAFRTH 645
Query: 576 GDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVN----------- 624
+ +PFGL AP F +L N + + R V+V+ DD L+ +
Sbjct: 646 DGLYEFLVMPFGLCNAPATFQALMNDILRIYLRRF--VLVFFDDILIYSNTWADHLRHIR 703
Query: 625 ------QDPRILEIQGKLA--VSILGSLGWIVNLQKSSLSPAPV 660
+ R+ + K A VS + LG I+ S+ PA V
Sbjct: 704 AVLLLLRQHRLFVKRSKCAFGVSSISYLGHIIGATGVSMDPAKV 747
>gi|270356908|gb|ACZ80693.1| putative retrotransposon nucleocapsid protein [Filobasidiella
depauperata]
Length = 1481
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 107/474 (22%), Positives = 181/474 (38%), Gaps = 72/474 (15%)
Query: 449 YAIPFS--AKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVP 506
+AIP +P P+ L + + +++ L G ++ S G S + V
Sbjct: 489 HAIPIKEGTQPKFGPVYRLSEVEL---KTLDGYLKNNLRNGFIRPSTSPAG--SPILFVK 543
Query: 507 KGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTH 566
K +G R ++ + LN + ++ L L + + IDL AY + IK
Sbjct: 544 KSDGSLRLCVDYRNLNDITTKNRYPLPLIGESLDRLSEASWFSKIDLRAAYHLIRIKKGD 603
Query: 567 QRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
+ A + +PFGL AP +F +L N V L + V+VYLDD L+ ++
Sbjct: 604 EWKTAFRTRYGLYEYQVMPFGLTNAPASFQNLINDV--LREYLDLSVIVYLDDILIFSKT 661
Query: 627 PRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLG 686
+ + L N K + V+ FLG + +M + + +T
Sbjct: 662 REEHVVHVNQVLEKLKENQLWANAGKCQFFQSEVV-FLGFIASKDGIKMDPKKVEAITDW 720
Query: 687 NILRTLLASKTWNLDSARSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINP 746
R N+ +S LG+ +F RR + S ++ P
Sbjct: 721 KTPR--------NVKGVQSFLGFANFY-----------RRFIKSYS--KIATPLTALTKK 759
Query: 747 AVLPKLEW---------WLNALPLSSPIFPRQVQHF-------ISTDASDL---GWGSQV 787
V+ EW L ++PI +QHF I TDASD G S
Sbjct: 760 DVM--FEWTEAAEDAFLTLKKAFTTAPI----LQHFSPSQPIVIETDASDYAIAGIISHP 813
Query: 788 DS-------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQ-SSVVMVQSDNQTVV 839
D +F S + + N+ I KEM A+ A L+ S + V +D++ +
Sbjct: 814 DERNQLRPIAFYSRKLTDVELNYEIYDKEMLAIVWAFKEWRAYLEGSKEITVYTDHKNLE 873
Query: 840 SYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
+ + + + +EV L + D++I + GA AD+L+R + L
Sbjct: 874 YFTTSKVLNRRQARWAEV----LANYDFKI----VYRSGAQMGKADALTRRQDL 919
>gi|113120256|gb|ABI30268.1| polyprotein [Banana streak virus]
Length = 1709
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 111/507 (21%), Positives = 192/507 (37%), Gaps = 101/507 (19%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M+ H+Q++LE V++ S++ + +V G G R
Sbjct: 1241 LKHVTPTMKETMAKHVQKLLELKVIR--PSSSKHRTTAMIVESGTEVDPMTGKERRGKER 1298
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1299 LVFNYKRLNDNTEKDQYSLPGINTIIKRIGNAKIYSKFDLKSGFHQVAMDPESIPWTAFW 1358
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ ++ +
Sbjct: 1359 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEDFIAVYIDDILVFSETIHQHKEH 1415
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
K ++I G +++ K + + FLG T+GN
Sbjct: 1416 LKKFMTICEKNGLVLSPTKMKIGTRQI-DFLG-----------------ATIGNSKIKLQ 1457
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHS-------RRIQRQ 730
I++ ++ K L + L LG L++A IP +G L++ RR+ Q
Sbjct: 1458 PHIIKKIIEMKDEELKEVKGLRKWLGILNYARSYIPKLGKILGPLYAKTSPNGERRMNTQ 1517
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----Q 786
+ + A LP+LE LP P + I TD GWG +
Sbjct: 1518 DWKIVKEVKEVV----ANLPELE-----LP------PEKAIMIIETDGCMEGWGGVCKWK 1562
Query: 787 VDSSFLSGLWSREQQNWHINK---------KEMFAVHQALS-LNLPLLQSSVVMVQSDNQ 836
DS L WS + + K E+ AV +L + L +++++D+Q
Sbjct: 1563 TDS--LQPRWSEKICAYASGKFTPIKSTIDAEIQAVINSLDKFKIYYLDKKELIIRTDSQ 1620
Query: 837 TVVSYLRRQGGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL- 893
+VS+ ++ K + L+ + I + I + I G N +AD+LSR +
Sbjct: 1621 AIVSFYKKSSDHKPSRVRWLAFTDYITGTG----LEIKFEHIDGKDNVLADTLSRLVKII 1676
Query: 894 --PDWHLSR----SATEQIFLKWGVPC 914
P+ H S +A E++F + C
Sbjct: 1677 LHPEKHQSEGVLINAVEEVFHNYTKGC 1703
>gi|67625686|tpe|CAJ00228.1| TPA: gag-pol polyprotein [Schistosoma mansoni]
Length = 816
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 117/285 (41%), Gaps = 24/285 (8%)
Query: 435 RLGAPAPLVRIVSGYAIPFSAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDS 494
R A +R+ SG F K P VP +LQ + + + GV+ + S
Sbjct: 476 RCSAMKTTLRLKSGVKPVFRPKRP-VPYAALQKVEE--------ELNRLQREGVITPV-S 525
Query: 495 TTGFLSRLFLVPKGNGGTRPVLNL-KGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDL 553
+ + + + ++ K NG R + GLN L + L + + L G + +DL
Sbjct: 526 YSAWAAPIVVIKKANGAIRICADFSTGLNAALEQHHYPLAVPADLFTMLNGGKFFAXLDL 585
Query: 554 SQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRV 613
+ AY V + + + L G + LPFG P F L + + S + V
Sbjct: 586 ADAYLQVEVAEDQESYSLLLLIGGLFQYNRLPFGGQDRPIYFQQLMDTILSGIPG----V 641
Query: 614 VVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLD 673
YLDD L+V L + + + G+ + +K L V ++LG ++D
Sbjct: 642 ATYLDDILIVATTSEQLRERTTAVLQRVSDNGFRLRPEKCQLFLKSV-KYLGFIFDAAGR 700
Query: 674 RMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIP 718
R P+ + + +RT+ N+ + RS LG +S+ S +P
Sbjct: 701 R---PDPENI---RAIRTMPTPT--NISTLRSFLGLVSYYSAFVP 737
>gi|67625723|tpe|CAJ00250.1| TPA: pol polyprotein [Schistosoma mansoni]
Length = 1028
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 93/420 (22%), Positives = 179/420 (42%), Gaps = 39/420 (9%)
Query: 483 MLETGVLKRLDSTTGFLSRLFLVPKGNGGT-RPVLNLKGLNQFLSPKKFSLINHFRIPSF 541
ML+ G+++ DS + S L +VPK N G RP + + LN+ P ++ + + +
Sbjct: 188 MLQLGIIRPSDSQ--WASPLHMVPKKNEGDWRPCGDYRALNRQTVPDRYPIPHIQDFTNG 245
Query: 542 LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNW 601
L + IDL +AY ++P+ A++ + +PFGL A Q F +
Sbjct: 246 LHGMNIFTKIDLVRAYHNIPVADEDIPKTAITTPFGLFEFIRMPFGLRNAAQTF---QRF 302
Query: 602 VASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSS-LSPAPV 660
+ +LLR Y+DD L+ + D + E K + L G +N+ +S +
Sbjct: 303 IDNLLRDMPF-AQGYIDDLLIASPDLQSHEQHVKTVLKRLDEHG--INIHQSKCVFGVQT 359
Query: 661 LQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPMG 720
L+FLG + P K+ + I + + S +L RS LG ++F IP
Sbjct: 360 LEFLGHTISSEGIK---PIKKE--VDTIKQYPIPS---SLTQLRSFLGLINFYRRFIPGC 411
Query: 721 RLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPI-FPRQVQHF-ISTDA 778
++ +Q L+ G P ++ + ++ + L ++ + +P + + DA
Sbjct: 412 ---AQLMQPLTDSLK-GKPKEFKLSSDAVEAIKQLKDKLAQATTLMYPNSLSPLALMVDA 467
Query: 779 SDLGWGSQVDS---------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
SD G ++ +F S + + + +E+ A++ + +L+
Sbjct: 468 SDKAVGGTLNQLVKNAWKPIAFFSKRLAPAETRYSTFGRELLAIYLTIKHFRHMLEGREF 527
Query: 830 MVQSDNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
MV +D++ + + L+ + S + ++ I + D R + G N AD+LSR
Sbjct: 528 MVFTDHKPLTNALKARADKYSPREVRHLDYISQFTSDIR------HVKGQDNQAADALSR 581
>gi|425766248|gb|EKV04872.1| hypothetical protein PDIG_86730 [Penicillium digitatum PHI26]
Length = 1465
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 4/147 (2%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRI 538
++++L+ G+++ S GF + V K G R ++ +GLN+ + + L +
Sbjct: 243 QVKDLLDRGLIQVSSSPWGFP--VVFVKKPGGEWRMCIDYRGLNELTAKNGYPLPRIQDL 300
Query: 539 PSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASL 598
+ + Y+ IDL+ Y+ V + A + +PFGL AP F +L
Sbjct: 301 LDIVGQAKYLSKIDLAAGYWQVRMADDAVPKTAFNTVWGKYEWRAMPFGLCNAPATFQTL 360
Query: 599 SNWVASLLRSRGMRVVVYLDDFLLVNQ 625
N +L G VVVYLDD L+ +Q
Sbjct: 361 MN--ETLRPYLGRSVVVYLDDILVYSQ 385
>gi|270003675|gb|EFA00123.1| hypothetical protein TcasGA2_TC002939 [Tribolium castaneum]
Length = 2951
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 92/457 (20%), Positives = 197/457 (43%), Gaps = 54/457 (11%)
Query: 454 SAKPPLVPLCSLQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTR 513
+AKP + P+ +++ + ++E+ + ++K+++ ++ +++ LV K +G R
Sbjct: 468 NAKPVICPI---RNVPFALRDKFKTCLEELEQAQIIKKVEGSSEWVNSYVLVKKQDGSLR 524
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
L+ + LN+ + K+ + N I + L ++D + ++++P+ T +
Sbjct: 525 VCLDPQNLNKVIKNHKYKIPNIDEITNKLNGSKIYSTLDAASGFWNIPLDETSSKLCTFG 584
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDP----RI 629
+P G+ A + F + + + G V VY+DD L+ ++ +I
Sbjct: 585 TPYGRYRFLRMPMGIKVASEVFQE---YFSEIFNIPG--VEVYVDDILIYAKNKTEHDKI 639
Query: 630 LEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNIL 689
LE I NL K + ++LG + + + E+K + N
Sbjct: 640 LE----QVFQIAKEKNIKFNLSKCRFGLNEI-KYLGHKFSAA--GISVDEEKIDAIKN-- 690
Query: 690 RTLLASKTWNLDSARSLLGYLSF-ASFVIPMGR--LHSRRIQRQASLLRLGAPHLTPINP 746
+ S T D R LG +++ F+ + H R++ +Q N
Sbjct: 691 ---MPSPTCKKDIER-FLGLVTYVGKFINNLSEKTYHLRKLLKQDVCFEWEQEQQEAFNN 746
Query: 747 AVLPKLEWWLNALPLSSP---IFPRQVQHFISTDASDLGWGSQV--DS---SFLSGLWSR 798
L ++ +S P F + + IS DAS G G+ + D+ +F S +
Sbjct: 747 ---------LKSIIVSKPCLQFFDPKKEITISVDASQNGLGAVLLQDNKPCAFASRAMTE 797
Query: 799 EQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTKSLSLL-SEV 857
Q+ + +KE+ A+H ++ + V++D++ ++S + KSL+ + +
Sbjct: 798 TQKRYAQIEKELLAIHFGVNKFYQYIFGREFNVETDHKPLISIFK-----KSLNDCPARL 852
Query: 858 EKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSLP 894
+++ L Q + + + ++ PG VAD+LSR+ +LP
Sbjct: 853 QRMRLSLQKFDLSL--KYKPGKDLIVADTLSRA-TLP 886
>gi|56266245|emb|CAE81279.1| polyprotein [Cacao swollen shoot virus]
Length = 1816
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/453 (19%), Positives = 177/453 (39%), Gaps = 50/453 (11%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKG-----------NGGTR 513
++HL + H++ +L+ GV++ S + + F+V G +G R
Sbjct: 1292 IKHLTPAMEKQFQKHVKALLDIGVIR--PSKSKHRTTAFIVESGTVIDPVTKKTIHGKER 1349
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1350 MVFNYKRLNDNTEKDQYSLPGIQTILKRVGNKKIFSKFDLKSGFHQVAMAKESIPWTAFW 1409
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + + VY+DD L+ ++
Sbjct: 1410 VPQGLYEWLVMPFGLKNAPAVFQRKMD---QCFKGTEEFIAVYIDDILVFSETMAEHTKH 1466
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLL 693
+ ++I G +++ K L+ + +FLG + + + + +I++ ++
Sbjct: 1467 IGIMLTICQENGLVLSPNKICLAQREI-EFLGTI---------ISQGQMKLQPHIIKKIV 1516
Query: 694 ASKTWNLDSA---RSLLGYLSFASFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLP 750
L++ RS LG L++A IP L + A G + ++
Sbjct: 1517 NKADMELETTKGLRSFLGLLNYARIYIP--NLGKKLSPLYAKTSPTGEKKFNRQDWHLIK 1574
Query: 751 KLEWWLNALPLSSPIFPRQVQHFISTDASDLGWG-------SQVDSSFLSGLWSREQQNW 803
+++ + LP + I P + I +D GWG ++ DS + + +
Sbjct: 1575 EIKNMVQKLP-NLAIPPARCCIIIESDGCMEGWGAVCKWKLAKEDSRTTEKICAYASGKF 1633
Query: 804 HINKK----EMFAVHQAL-SLNLPLLQSSVVMVQSDNQTVVSYLRRQGGTK--SLSLLSE 856
I K E+FA+ +AL S + L ++ ++D Q +V++ + K + ++
Sbjct: 1634 GIIKSTIDAEIFALIKALESFKIFYLDKKHLVARTDCQAIVTFYNKTSTHKPSRIRWITF 1693
Query: 857 VEKIFLLSQDWRIHILAQFIPGAYNSVADSLSR 889
+ I L + + + I G N +AD+LSR
Sbjct: 1694 SDYITGLG----VQVTIEHINGKENQLADTLSR 1722
>gi|110739791|dbj|BAF01802.1| hypothetical protein [Arabidopsis thaliana]
Length = 576
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 73/179 (40%), Gaps = 4/179 (2%)
Query: 479 HIQEMLETGVLKRLD--STTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHF 536
+++ LE + KR ST+ + + + + K +G R ++ +GLNQ K+ L
Sbjct: 382 ELKKQLEDFLGKRFIRPSTSPWRAPMLFMKKKDGSFRLCIDYRGLNQVTVKNKYPLPRID 441
Query: 537 RIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFA 596
+ L+ IDL+ Y +PI R A +PFGL AP AF
Sbjct: 442 ELLDQLRGATCFSKIDLTSDYHQIPIAEADVRKTAFRTRYGHFEFVVMPFGLTNAPAAFM 501
Query: 597 SLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSL 655
L N V V++++DD L+ ++ P E+ + L L K S
Sbjct: 502 RLMNSVFQEFLDEF--VIIFIDDILVYSKSPEEHEVHLRRVKEKLREQKLFAKLSKCSF 558
>gi|313240429|emb|CBY32766.1| unnamed protein product [Oikopleura dioica]
Length = 2001
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 79/153 (51%), Gaps = 9/153 (5%)
Query: 479 HIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLK-GLNQFL-SPKKFSLIN-- 534
I ++ + GVL D++ G+ + L V K NGGTR +LNL +N L + F++ N
Sbjct: 594 EIDKLKQIGVLVPSDNSCGWNTPLGAVTKSNGGTRLILNLNLTVNPLLRNADTFAIPNID 653
Query: 535 -HFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQ 593
+P ++ Y +D++ Y+++ ++ + Q L++ +N + L + LPFGL ++
Sbjct: 654 ASMELPLGMR---YFGVMDIANGYWNIRVRESDQVKLSIFWNDECLKFSRLPFGLKSSGH 710
Query: 594 AFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
F ++ R V +++DD L+ +D
Sbjct: 711 LFVRAITHALKGMKYRD-NVKIFVDDALIFAKD 742
>gi|71029838|ref|XP_764562.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351516|gb|EAN32279.1| hypothetical telomeric SfiI fragment 20 protein 3 [Theileria parva]
Length = 3300
Score = 57.8 bits (138), Expect = 4e-05, Method: Composition-based stats.
Identities = 52/210 (24%), Positives = 77/210 (36%), Gaps = 29/210 (13%)
Query: 38 QAITARKSSTLDQSPAVSDPGMASGVSDQSPPLSSAPANPVQASAPVQAAHQPSATVQAA 97
+ +T K++T V+ P A +PP + A P ++ P ++A T A
Sbjct: 629 EVVTPAKAAT------VTTPAKAPSPKVPTPPTADESATP--STTPDESATPVVTTPAKA 680
Query: 98 PGSSASVLAAPLPSSSGQPFISPPAQ--SAAFLAQPASTASLPPSAAHHLYPLPFFCDPS 155
P A V P P S P ++ PA+ S P + S PS P P+
Sbjct: 681 P--DAKVTTPPTPDESATPVVTTPAKAPSPKVPTPPTADESATPSTTPDESATPVVTTPA 738
Query: 156 YYGHYLQSAMKASRGQVAQPPPPSESVTPIPLSPVSSDQEDFSEEDEVVDCNPPALFSFA 215
KA +V PP P ES TP+ +P + D V P S
Sbjct: 739 ----------KAPDAKVTTPPTPDESATPVVTTPAKA-------PDAKVTTPPTPDESAT 781
Query: 216 PSTKEREPSIPDPDSELASQGVVCQKLGSP 245
PST E + P + A+ + +P
Sbjct: 782 PSTTADESATPAKATPSATPSTTADESATP 811
>gi|384403233|gb|AFH88826.1| ORFIII [Banana streak virus]
Length = 1709
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 111/507 (21%), Positives = 192/507 (37%), Gaps = 101/507 (19%)
Query: 465 LQHLATPVSSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGN-----------GGTR 513
L+H+ + M+ H+Q++LE V++ S++ + +V G G R
Sbjct: 1241 LKHVTPTMKETMAKHVQKLLELKVIR--PSSSKHRTTAMIVESGTEVDPMTGKERRGKER 1298
Query: 514 PVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALS 573
V N K LN ++SL I + DL + V + + A
Sbjct: 1299 LVFNYKRLNDNTEKDQYSLPGINTIIKRIGNAKIYSKFDLKSGFHQVAMDPESIPWTAFW 1358
Query: 574 YNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQ 633
+ +PFGL AP F + + R + VY+DD L+ ++ +
Sbjct: 1359 AIDGLYEWLVMPFGLKNAPAIFQRKMD---NCFRGTEDFIAVYIDDILVFSETIHQHKEH 1415
Query: 634 GKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGN------ 687
K ++I G +++ K + + FLG T+GN
Sbjct: 1416 LKKFMTICEKNGLVLSPTKMKIGTRQI-DFLG-----------------ATIGNSKIKLQ 1457
Query: 688 --ILRTLLASKTWNLDSARSL---LGYLSFASFVIP-----MGRLHS-------RRIQRQ 730
I++ ++ K L + L LG L++A IP +G L++ RR+ Q
Sbjct: 1458 PHIIKKIIEMKDEELKEVKGLRKWLGILNYARSYIPKLGKILGPLYAKTSPNGERRMNTQ 1517
Query: 731 ASLLRLGAPHLTPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTDASDLGWGS----Q 786
+ + A LP+LE LP P + I TD GWG +
Sbjct: 1518 DWKIVKEVKEVV----ANLPELE-----LP------PEKAIMIIETDGCMEGWGGVCKWK 1562
Query: 787 VDSSFLSGLWSREQQNWHINK---------KEMFAVHQAL-SLNLPLLQSSVVMVQSDNQ 836
DS L WS + + K E+ AV +L + L +++++D+Q
Sbjct: 1563 TDS--LQPRWSEKICAYASGKFTPTKSTIDAEIQAVINSLHKFKIYYLDKKELIIRTDSQ 1620
Query: 837 TVVSYLRRQGGTK--SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL- 893
+VS+ ++ K + L+ + I + I + I G N +AD+LSR +
Sbjct: 1621 AIVSFYKKSSDHKPSRVRWLAFTDYITGTG----LEIKFEHIDGKDNVLADTLSRLVKII 1676
Query: 894 --PDWHLSR----SATEQIFLKWGVPC 914
P+ H S +A E++F + C
Sbjct: 1677 LHPEKHQSEGVLINAVEEVFHNYTKGC 1703
>gi|241956792|ref|XP_002421116.1| retrotransposon reverse transcriptase, pseudogene, putative
[Candida dubliniensis CD36]
gi|223644459|emb|CAX41275.1| retrotransposon reverse transcriptase, pseudogene, putative
[Candida dubliniensis CD36]
Length = 230
Score = 57.4 bits (137), Expect = 4e-05, Method: Composition-based stats.
Identities = 40/160 (25%), Positives = 72/160 (45%), Gaps = 16/160 (10%)
Query: 473 SSAMSLHIQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSL 532
S + +++ L+ G ++ + + F + + +VPK NG R + +GLN K ++
Sbjct: 40 SDELQRQLKQYLDAGFIE--PTVSPFGAPIVMVPKANGEVRLCNDFRGLN------KLTI 91
Query: 533 INHFRIPSF------LQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPF 586
+HF +P+ ++ Y SIDL Q Y V I + A +PF
Sbjct: 92 ADHFHLPNMEELLMEVKNSTYYSSIDLCQGYHQVLINEADKEKTAFHTPFGSFHWVVMPF 151
Query: 587 GLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQD 626
GL AP +F + V M ++YLDD ++ +++
Sbjct: 152 GLINAPASFQRMMEQVFREYNHDFM--LIYLDDLIIYSKN 189
>gi|189242337|ref|XP_001810078.1| PREDICTED: similar to orf [Tribolium castaneum]
gi|270016528|gb|EFA12974.1| hypothetical protein TcasGA2_TC004277 [Tribolium castaneum]
Length = 1399
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 62/143 (43%), Gaps = 4/143 (2%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ +ML G++ DS + + S + LV K +G R ++ + LN + +
Sbjct: 530 VDDMLSAGIIS--DSNSEYSSPVLLVKKKDGSNRLCIDYRRLNAITVKEYVPMQIIDEQL 587
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L Y ++DL+ Y VP+ + + +PFGL AP F L
Sbjct: 588 DLLSGNGYFTTLDLASGYMQVPVAKESRHLTSFVTTTGQYEFNRMPFGLVNAPSVFNRLM 647
Query: 600 NWVASLLRSRGMRVVVYLDDFLL 622
N V + RG+ V +Y+DD L+
Sbjct: 648 NMVTRKI-GRGV-VTIYMDDILI 668
>gi|449672269|ref|XP_004207675.1| PREDICTED: uncharacterized protein LOC101237757 [Hydra
magnipapillata]
Length = 647
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/161 (22%), Positives = 74/161 (45%), Gaps = 2/161 (1%)
Query: 467 HLATPVSSAMSLHIQEMLETGVLKRLDST-TGFLSRLFLVPKGNGGTRPVLNLKGLNQFL 525
H++ V S + I E + G LK + G +S + ++ + V++ + LNQF+
Sbjct: 125 HISEDVKSEYATEISEWIAQGWLKLFEGKYNGIISLMAVIQRNKLKVSLVMDYRELNQFV 184
Query: 526 SPKKFS-LINHFRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCL 584
S + ++ ++ + G+ + IDL +AY + + ++ + Y G +T L
Sbjct: 185 SSHTADGDVCSTKLRNWRKLGENLEIIDLKKAYLQIRVDEALWKYQVVEYEGQRYCLTRL 244
Query: 585 PFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFLLVNQ 625
FGL AP+ + V S+ + G ++DD ++ N
Sbjct: 245 GFGLNVAPRIMTKILKKVLSIDKIVGSGTDSFIDDIIVNNN 285
>gi|367012495|ref|XP_003680748.1| hypothetical protein TDEL_0C06480 [Torulaspora delbrueckii]
gi|359748407|emb|CCE91537.1| hypothetical protein TDEL_0C06480 [Torulaspora delbrueckii]
Length = 1374
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 94/424 (22%), Positives = 176/424 (41%), Gaps = 44/424 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
+ ++LE G ++ S + + S + LV K +G R ++ + LN+ F L +
Sbjct: 522 VNDLLEKGFIE--PSKSPYSSPVVLVKKKDGSFRLCVDYRVLNEATIKDPFPLPRIEVLL 579
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
+ + K ++DL Y +P+K A + + +PFGL AP F S
Sbjct: 580 AKIGKASIFSTLDLHSGYHQIPVKPEDVPKTAFTTHNGKYQYRVMPFGLVNAPSTF---S 636
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAP 659
++A + R V+VYLDD L+++ + + L I +K
Sbjct: 637 RYMADIFRDLPF-VLVYLDDILVISTSEKQHIEHLNTVLGRLQEHQLIAKEKKCQFLQTE 695
Query: 660 VLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFASFVIPM 719
V +FLG H R P + + + + K+ + A+ LG +++ IP
Sbjct: 696 V-EFLGYHISEHCIR---PIKGKC---DAIHAIPPCKS--IKDAQRFLGMINYYRRFIP- 745
Query: 720 GRLHSRRIQRQASLLRLGAPHL--TPINPAVLPKLEWWLNALPLSSPIFPRQVQHFISTD 777
H I L+ A T + +L+ L A PL P F + + ++TD
Sbjct: 746 ---HCSTIAH--PLIDYAAKKTPWTTLQTNAFNELKRLLVAAPLLVP-FCTEHSYRLTTD 799
Query: 778 ASDLGWGSQVDS----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSS 827
AS G G+ ++ + S Q N+ + E+ + ++L LL
Sbjct: 800 ASKSGLGAVLEQMEGKKVVGVVGYFSKSLQGAQNNYPAGELELLGIIESLRHFKYLLHGK 859
Query: 828 VVMVQSDNQTVVSYLRRQGGTK-SLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADS 886
+++D+ +S L + T+ S+ L ++++ ++ I + +++ G N VAD+
Sbjct: 860 RFTLRTDH---ISLLALKNKTEPSIRLARWLDEL----AEYEIDL--EYLKGPDNVVADT 910
Query: 887 LSRS 890
LSR+
Sbjct: 911 LSRN 914
>gi|2801486|gb|AAC82578.1| Pr125 [Human spumaretrovirus]
Length = 556
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 103/219 (47%), Gaps = 17/219 (7%)
Query: 502 LFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIPSFLQKGDYMISIDLSQAYFHVP 561
++ VPK +G R VL+ + +N+ + + I + + + Y ++DL+ ++ P
Sbjct: 5 VYPVPKPDGRWRMVLDYREVNKTIPLTAAQNQHSAGILATIVRQKYKTTLDLANGFWAHP 64
Query: 562 IKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLSNWVASLLRSRGMRVVVYLDDFL 621
I A ++ G T LP G +P F + V LL+ V VY+DD
Sbjct: 65 ITPESYWLTAFTWQGKQYCWTRLPQGFLNSPALFTAD---VVDLLKEIP-NVQVYVDDIY 120
Query: 622 LVNQDPRILEIQGKLAVSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDPHLDRMWLPEDK 681
L + DP+ Q + IL G++V+L+KS + V +FLG ++ + E +
Sbjct: 121 LSHDDPKEHVQQLEKVFQILLQAGYVVSLKKSEIGQKTV-EFLGF----NITK----EGR 171
Query: 682 QLTLGNILRTLLASKT--WNLDSARSLLGYLSFASFVIP 718
LT + +T L + T +L +S+LG L+FA IP
Sbjct: 172 GLT--DTFKTKLLNITPPKDLKQLQSILGLLNFARNFIP 208
>gi|331554|gb|AAA21736.1| reverse transcriptase [Cauliflower mosaic virus]
Length = 680
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 159/392 (40%), Gaps = 41/392 (10%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLV----PKGNGGTRPVLNLKGLNQFLSPKKFSLINH 535
I+E+L+ V+K S + ++ FLV K G R V+N K +N+ ++L N
Sbjct: 267 IKELLDLKVIK--PSKSPHMAPAFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNLPNK 324
Query: 536 FRIPSFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAF 595
+ + ++ S D ++ V + + A + +PFGL AP F
Sbjct: 325 DELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIF 384
Query: 596 ASLSNWVASLLRSRGMRVVVYLDDFLLV--NQDPRILEIQGKLAVSILGSLGWIVNLQKS 653
+ + R VY+DD L+ N++ +L + + + G I++ +K+
Sbjct: 385 QRHMDEAFRVFRKF---CCVYVDDILVFSNNEEDHLLHVA--MILQKCNQHGIILSKKKA 439
Query: 654 SLSPAPVLQFLGIMWDPHLDRMWLPEDKQLTLGNILRTLLASKTWNLDSARSLLGYLSFA 713
L + FLG+ D + P+ L N L K + LG L++A
Sbjct: 440 QLFKKKI-NFLGLEIDEGTHK---PQGHILEHINKFPDTLEDKK----QLQRFLGILTYA 491
Query: 714 SFVIPMGRLHSRRIQRQASLLRLGAPHLTPINPAVLPKLEWWLNALP-LSSPIFPRQVQH 772
S IP +L R QA L T + + K++ L P L P+ ++
Sbjct: 492 SDYIP--KLAQIRKPLQAKLKENVPWKWTKEDTLYMQKVKKNLQGFPPLHHPLPEEKL-- 547
Query: 773 FISTDASDLGWG-------------SQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSL 819
I TDASD WG +++ + SG + ++N+H N KE AV +
Sbjct: 548 IIETDASDDYWGGMLKAIKINEGINTELICRYASGSFKAAERNYHSNDKETLAVINTIKK 607
Query: 820 NLPLLQSSVVMVQSDNQTVVSY--LRRQGGTK 849
L + + ++DN S+ L +G +K
Sbjct: 608 FSIYLTPAHFLTRTDNTHFKSFVNLNYKGDSK 639
>gi|307212135|gb|EFN87992.1| hypothetical protein EAI_06111 [Harpegnathos saltator]
Length = 213
Score = 57.4 bits (137), Expect = 5e-05, Method: Composition-based stats.
Identities = 46/151 (30%), Positives = 71/151 (47%), Gaps = 1/151 (0%)
Query: 774 ISTDASDLGWGSQVDSSFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVVMVQS 833
I +DAS WG+ S WS + + HIN E+ A AL L V+++
Sbjct: 6 IFSDASLNRWGASCGDSRTHVWWSADDRALHINTLELKAAFNALRCFTADLSDCDVLLRI 65
Query: 834 DNQTVVSYLRRQGGTKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVADSLSRSKSL 893
DN T ++Y+ + G + L + +I ++ I I A I N +AD SR K
Sbjct: 66 DNTTALAYINKFGSVQYPRLPAISGEIGCWCEERNIFIFALTISSMENFIADCESRCKDP 125
Query: 894 -PDWHLSRSATEQIFLKWGVPCIDLFASRVS 923
+W LS A +Q+ +G I+LFAS ++
Sbjct: 126 GTEWCLSDEAFQQVNKAFGPFDINLFASAIN 156
>gi|156846146|ref|XP_001645961.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
gi|156116632|gb|EDO18103.1| Tkp3 protein [Vanderwaltozyma polyspora DSM 70294]
Length = 1665
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 102/425 (24%), Positives = 179/425 (42%), Gaps = 73/425 (17%)
Query: 499 LSRLFLVPKGNGGTRPVLNLKGLNQFL--SPKKFSLINHFRIPSFLQKGDYMISIDLSQA 556
LS +F + + R + +L+ +N L +P+ I H I S L S+D+ +A
Sbjct: 780 LSPVFPIQQSKDKIRIITDLRKVNNHLQYTPRPIPPIQH--IFSNLANKTIFSSLDIRKA 837
Query: 557 YFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFAS-LSNWVASLLRSRGMRVVV 615
Y +PIK L L LP+GLA+AP + + N + L S+ V
Sbjct: 838 YQQIPIKGDK---LGLITEFGSYKFNRLPYGLASAPYWWGEFIQNLLKQLPTSKNTTVSY 894
Query: 616 YLDDFLLVNQDPRILEIQGKLA-----VSILGSLGWIVNLQKSSLSPAPVLQFLGIMWDP 670
Y DD ++ + L I + + +L G ++ +K ++ + + FLG ++
Sbjct: 895 YYDDLVIAS-----LTIADHYSTLQNIMRLLSDHGLSLSYEKIHIAESKI-HFLG--YEI 946
Query: 671 HLDRMWLPEDKQLTLGNILRTLLASKTWNL----DSARSLLGYLSFASFVIPMGRLHSRR 726
+R+ + +DK+ T+ W L + G+++F IP S+
Sbjct: 947 SHNRLAIDKDKKNTIAQ----------WELPQDKKAIEKFTGFVNFLRNFIPNA---SKL 993
Query: 727 IQ--RQASLLRLGAPHLTPINPAVLPKLEWWLNAL--PLSSPIFPRQVQHFISTDASDLG 782
+Q Q + + H+ A+ A+ ++ +F Q I TDAS G
Sbjct: 994 LQPFYQFATGKTPTQHINTTKSAMTTNFHLIKQAILKSITLKLFDPQAPTIIYTDASLTG 1053
Query: 783 WGS---QVDS----------SFLSGLWSREQQNWHINKKEMFAVHQALSLNLPLLQSSVV 829
S Q ++ +F S ++ QQ + ++E++AV L LL SS +
Sbjct: 1054 AASILLQPENQNGNTILYPIAFYSIRFNPTQQRYSTVERELWAVLHTLE-KARLLLSSNI 1112
Query: 830 MVQSDNQTVVSYLRRQGG-----TKSLSLLSEVEKIFLLSQDWRIHILAQFIPGAYNSVA 884
+ +DNQ ++S + + TK L LL+ + L+ W+ +I G+ N VA
Sbjct: 1113 TIYTDNQGIISIGKTERATHPRLTKYLDLLN----TYRLT--WK------YIKGSQNYVA 1160
Query: 885 DSLSR 889
D LSR
Sbjct: 1161 DYLSR 1165
>gi|391337694|ref|XP_003743200.1| PREDICTED: uncharacterized protein K02A2.6-like [Metaseiulus
occidentalis]
Length = 624
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 70/150 (46%), Gaps = 5/150 (3%)
Query: 480 IQEMLETGVLKRLDSTTGFLSRLFLVPKGNGGTRPVLNLKGLNQFLSPKKFSLINHFRIP 539
I + + GV++R+ ++ ++S + + K NG R ++L+ +N+ + F L + +
Sbjct: 464 IDRLEQEGVIERIQASE-WVSPIVVAEKKNGDVRLCVDLREVNKAVVQDAFPLPHIEDLM 522
Query: 540 SFLQKGDYMISIDLSQAYFHVPIKTTHQRFLALSYNGDVLAMTCLPFGLATAPQAFASLS 599
L KG IDL AY +P+ + + A + T + FGLA+AP AF +
Sbjct: 523 QRLAKGRAFSKIDLRSAYHQIPLHESSRDLTAFVSPWGLFRYTRVCFGLASAPAAFQAFM 582
Query: 600 NWVASLLRSRGMRVVVYLDDFLLVNQDPRI 629
L V+ YLDD L + + R
Sbjct: 583 EETLKDLEG----VICYLDDVLALAKRGRF 608
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,489,563,292
Number of Sequences: 23463169
Number of extensions: 759418728
Number of successful extensions: 3255978
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 630
Number of HSP's successfully gapped in prelim test: 41049
Number of HSP's that attempted gapping in prelim test: 3116272
Number of HSP's gapped (non-prelim): 103651
length of query: 1097
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 943
effective length of database: 8,745,867,341
effective search space: 8247352902563
effective search space used: 8247352902563
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 83 (36.6 bits)