BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013748
(437 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|302144065|emb|CBI23170.3| unnamed protein product [Vitis vinifera]
Length = 678
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/437 (80%), Positives = 388/437 (88%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSKA
Sbjct: 242 MVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSKA 301
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS L
Sbjct: 302 MLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSVL 361
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VLQ
Sbjct: 362 KWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLCSVLQ 421
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVED
Sbjct: 422 ECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVED 481
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR RAMPHIKT+ RYNGQ LAW
Sbjct: 482 VRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTYTRYNGQNLAW 541
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS G GFSCT N PS+ K G
Sbjct: 542 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTDNGSPSKNKCGL 601
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++YSSEDVPWSWD+RY KKDV
Sbjct: 602 SENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQYSSEDVPWSWDRRYYKKDV 661
Query: 421 YGQVWPRHFQLYAFQDS 437
GQVWPRH QLY+ DS
Sbjct: 662 CGQVWPRHVQLYSSPDS 678
>gi|359483320|ref|XP_002265078.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Vitis vinifera]
Length = 621
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/437 (80%), Positives = 388/437 (88%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSKA
Sbjct: 185 MVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSKA 244
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS L
Sbjct: 245 MLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSVL 304
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VLQ
Sbjct: 305 KWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLCSVLQ 364
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVED
Sbjct: 365 ECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVED 424
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR RAMPHIKT+ RYNGQ LAW
Sbjct: 425 VRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTYTRYNGQNLAW 484
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS G GFSCT N PS+ K G
Sbjct: 485 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTDNGSPSKNKCGL 544
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++YSSEDVPWSWD+RY KKDV
Sbjct: 545 SENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQYSSEDVPWSWDRRYYKKDV 604
Query: 421 YGQVWPRHFQLYAFQDS 437
GQVWPRH QLY+ DS
Sbjct: 605 CGQVWPRHVQLYSSPDS 621
>gi|255554997|ref|XP_002518536.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223542381|gb|EEF43923.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 665
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/438 (77%), Positives = 381/438 (86%), Gaps = 3/438 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+ ACP LAK+P+VLV+HGE DGTLEHMKR KPANWILHKPPLPISFGTHHSKA
Sbjct: 230 MVDIDWLMSACPALAKVPNVLVLHGEGDGTLEHMKRTKPANWILHKPPLPISFGTHHSKA 289
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YPRG+RIIVHTANLI+VDWNNK+QGLWMQDFP KD+ + ++ CGFENDL+DYL+TL
Sbjct: 290 MLLVYPRGMRIIVHTANLIYVDWNNKTQGLWMQDFPWKDEKSQTKGCGFENDLVDYLNTL 349
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF+ LPA G+F INPSFFKKF++S+AAVRLIASVPGYHTG +LKKWGHMKLR+VLQ
Sbjct: 350 KWPEFTVKLPALGSFTINPSFFKKFDYSTAAVRLIASVPGYHTGPNLKKWGHMKLRSVLQ 409
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
ECTF K FK SPL YQFSSLGSLD KWM EL++S+SSG SED+TPLG+GEP I+WPTVED
Sbjct: 410 ECTFRKEFKNSPLAYQFSSLGSLDAKWMTELATSLSSGLSEDRTPLGLGEPRIIWPTVED 469
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNAIPSP KNV+KD LKKYW+KWKA+H+GR RAMPHIKTF RYNGQKLAW
Sbjct: 470 VRCSLEGYAAGNAIPSPLKNVEKDILKKYWSKWKATHSGRCRAMPHIKTFTRYNGQKLAW 529
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSG 359
LLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS+ K HGC SCT + SE + G
Sbjct: 530 LLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSSYKNHGCRLSCTDHGARSEDEYG 589
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 419
S+ KT+LVTL W G D SS+V+ LPVPYELPPQ YSSEDVPWSWD+RY+KKD
Sbjct: 590 LLADSEEPKTELVTLMWQGPKD--PSSQVIPLPVPYELPPQPYSSEDVPWSWDRRYSKKD 647
Query: 420 VYGQVWPRHFQLYAFQDS 437
VYGQVWPR QLY DS
Sbjct: 648 VYGQVWPRLVQLYTSLDS 665
>gi|224078752|ref|XP_002305614.1| predicted protein [Populus trichocarpa]
gi|222848578|gb|EEE86125.1| predicted protein [Populus trichocarpa]
Length = 599
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/428 (77%), Positives = 374/428 (87%), Gaps = 3/428 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWLL ACP +AK+P+V+VIHGE DGTLEHMKR KPANWILHKP LPISFGTHHSKA
Sbjct: 174 MVDMDWLLSACPTIAKVPNVMVIHGEGDGTLEHMKRRKPANWILHKPRLPISFGTHHSKA 233
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
M L+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K++ + CGFENDL+DYLS L
Sbjct: 234 MFLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKEEKKPGKGCGFENDLVDYLSML 293
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF+ LP G+ IN SFFKKF++S AAVRLIASVPGYHTG++L+KWGHMKL++VLQ
Sbjct: 294 KWPEFTVKLPNLGSISINASFFKKFDYSHAAVRLIASVPGYHTGANLRKWGHMKLQSVLQ 353
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
ECTF+ FK+SPLVYQFSSLGSLDEKWM EL+ SMSSG++EDKTPLG+G P I+WPTVED
Sbjct: 354 ECTFDNEFKRSPLVYQFSSLGSLDEKWMTELAISMSSGYAEDKTPLGLGVPQIIWPTVED 413
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNAIP P KNV+K FLKKYWAKWKASH+GR RAMPHIKTF RYNGQKLAW
Sbjct: 414 VRCSLEGYAAGNAIPGPLKNVEKGFLKKYWAKWKASHSGRCRAMPHIKTFTRYNGQKLAW 473
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSG 359
FLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS+ +R+G GFSCTSN PS G
Sbjct: 474 FLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSSIRRYGSGFSCTSNGGPSMDNCG 533
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 419
S S+ +T LVTL W G+SD ++S+V+ LPVPYELPP YSSEDVPWSWD+RY+KKD
Sbjct: 534 SLVDSEELRTTLVTLKWQGTSD--SASKVIPLPVPYELPPIPYSSEDVPWSWDRRYSKKD 591
Query: 420 VYGQVWPR 427
VYGQVWPR
Sbjct: 592 VYGQVWPR 599
>gi|449479663|ref|XP_004155668.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 959
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/431 (74%), Positives = 368/431 (85%), Gaps = 3/431 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+PACP LAKIP VLVIHGE DGTL++MKR KPANWILHKPPLPISFGTHHSKA
Sbjct: 530 MVDIDWLIPACPTLAKIPQVLVIHGEGDGTLDNMKRKKPANWILHKPPLPISFGTHHSKA 589
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
+ L+YPRG+R++VHTANLI+VDWNNKSQGLWMQDFP KDQN+ S C FE+DL+DYLS L
Sbjct: 590 IFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNSSSRGCAFEDDLVDYLSAL 649
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF A+ P HGNF INP FF+KF++S AAVRLIASVPGYHTG LKKWGHMKLR+VLQ
Sbjct: 650 KWPEFPASFPGHGNFNINPYFFRKFDYSKAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ 709
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC F+K F++SPLVYQFSSLGSL+EKWMAE ++S+SSGF+ DKTPLG+GEPLIVWPTVED
Sbjct: 710 ECIFDKEFQRSPLVYQFSSLGSLNEKWMAEFAASLSSGFTPDKTPLGLGEPLIVWPTVED 769
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAG+AIPSP KNV+K FL+KYWAKW + H+GR AMPHIKTFARYNGQKLAW
Sbjct: 770 VRCSLEGYAAGSAIPSPLKNVEKGFLRKYWAKWNSFHSGRCHAMPHIKTFARYNGQKLAW 829
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
+LTS+NLS+AAWGALQKNNSQLMIRSYELGVL LP KR+ FSCT N ++ KS
Sbjct: 830 LVLTSSNLSQAAWGALQKNNSQLMIRSYELGVLFLPQ-KRNDYSFSCTKNGGSAQNKSTV 888
Query: 361 TETSQI--QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 418
+ S+ KT+LVTL W + + SEV+ LP+PYELPPQ Y EDVPWSWD+RYT+K
Sbjct: 889 SRPSETLEGKTELVTLAWQENKKRESLSEVIQLPIPYELPPQPYGPEDVPWSWDRRYTQK 948
Query: 419 DVYGQVWPRHF 429
DV+G VWPR F
Sbjct: 949 DVHGAVWPRQF 959
>gi|449434370|ref|XP_004134969.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 613
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/429 (73%), Positives = 363/429 (84%), Gaps = 1/429 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+PACP LAK+P VLVIHGE DGTL++MKR KPANWILHKPPLPISFGTHHSKA
Sbjct: 186 MVDIDWLIPACPALAKVPQVLVIHGEGDGTLDNMKRKKPANWILHKPPLPISFGTHHSKA 245
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
+ L+YPRG+R++VHTANLI+VDWNNKSQGLWMQDFP KDQN+ S C FE+DL+DYLS L
Sbjct: 246 IFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNSSSRGCAFEDDLVDYLSAL 305
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF A+ P HGNF INP FF+KF++S AAVRLIASVPGYHTG LKKWGHMKLR+VLQ
Sbjct: 306 KWPEFPASFPGHGNFNINPYFFRKFDYSKAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ 365
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC F+K F++SPLVYQFSSLGSL+EKWMAE ++S+SSGF+ DKTPLG+GEPLIVWPTVED
Sbjct: 366 ECIFDKEFQRSPLVYQFSSLGSLNEKWMAEFAASLSSGFTPDKTPLGLGEPLIVWPTVED 425
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAG+A+PSP KNV+K FL KYWAKW + H+GR AMPHIKTFARYNGQKLAW
Sbjct: 426 VRCSLEGYAAGSAVPSPLKNVEKGFLTKYWAKWNSFHSGRCHAMPHIKTFARYNGQKLAW 485
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
+LTS+NLS+AAWGALQKNNSQLMIRSYELGVL LP KR+ FSCT N ++
Sbjct: 486 LVLTSSNLSQAAWGALQKNNSQLMIRSYELGVLFLPQ-KRNDYSFSCTKNGGSAQSTVSR 544
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+ KT+LVTL W + + SEV+ LP+PYELPPQ Y EDVPWSW++RYT+KDV
Sbjct: 545 PSETLEGKTELVTLAWQENKKRESLSEVIQLPIPYELPPQPYGPEDVPWSWERRYTQKDV 604
Query: 421 YGQVWPRHF 429
+G VWPR F
Sbjct: 605 HGAVWPRQF 613
>gi|356507526|ref|XP_003522515.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 2 [Glycine
max]
Length = 599
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/429 (77%), Positives = 377/429 (87%), Gaps = 2/429 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILHKP LPISFGTHHSKA
Sbjct: 170 MVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILHKPSLPISFGTHHSKA 229
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
M+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+ GFENDL++YLS L
Sbjct: 230 MMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSKGSGFENDLVEYLSVL 289
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GSSLKKWGHMKLR++LQ
Sbjct: 290 KWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGSSLKKWGHMKLRSLLQ 349
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
ECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTPLG+GEP I+WPTVED
Sbjct: 350 ECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTPLGMGEPQIIWPTVED 409
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMPHIKTFARY Q LAW
Sbjct: 410 VRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMPHIKTFARYKNQSLAW 469
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSG 359
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LPS KRH FSCTSN+ SE K
Sbjct: 470 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESVFSCTSNVTVSEDKCP 529
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYSSEDVPWSWDKRYTKK 418
+ E+S+++KTKLVTLT +SSEV+ LP+PYELPP YSS+D+PWSWD++Y KK
Sbjct: 530 ARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYSSQDIPWSWDRQYNKK 589
Query: 419 DVYGQVWPR 427
DVYG VWPR
Sbjct: 590 DVYGHVWPR 598
>gi|356507524|ref|XP_003522514.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 1 [Glycine
max]
Length = 610
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/429 (77%), Positives = 377/429 (87%), Gaps = 2/429 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILHKP LPISFGTHHSKA
Sbjct: 181 MVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILHKPSLPISFGTHHSKA 240
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
M+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+ GFENDL++YLS L
Sbjct: 241 MMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSKGSGFENDLVEYLSVL 300
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GSSLKKWGHMKLR++LQ
Sbjct: 301 KWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGSSLKKWGHMKLRSLLQ 360
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
ECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTPLG+GEP I+WPTVED
Sbjct: 361 ECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTPLGMGEPQIIWPTVED 420
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMPHIKTFARY Q LAW
Sbjct: 421 VRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMPHIKTFARYKNQSLAW 480
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSG 359
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LPS KRH FSCTSN+ SE K
Sbjct: 481 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESVFSCTSNVTVSEDKCP 540
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYSSEDVPWSWDKRYTKK 418
+ E+S+++KTKLVTLT +SSEV+ LP+PYELPP YSS+D+PWSWD++Y KK
Sbjct: 541 ARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYSSQDIPWSWDRQYNKK 600
Query: 419 DVYGQVWPR 427
DVYG VWPR
Sbjct: 601 DVYGHVWPR 609
>gi|297811655|ref|XP_002873711.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
gi|297319548|gb|EFH49970.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
Length = 612
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/430 (73%), Positives = 361/430 (83%), Gaps = 7/430 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWL+ ACP LA IP V+VIHGE DG E+++R KP NWILHKP LPISFGTHHSKA
Sbjct: 187 MVDVDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKPVNWILHKPRLPISFGTHHSKA 246
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN-NLSEECGFENDLIDYLST 119
+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD + + + CGFE DLIDYL+
Sbjct: 247 IFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLTV 306
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
LKWPEFSANLP GN KIN +FFKKF++S A VRLIASVPGYHTG +LKKWGHMKLRT+L
Sbjct: 307 LKWPEFSANLPGRGNVKINAAFFKKFDYSDAKVRLIASVPGYHTGLNLKKWGHMKLRTIL 366
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
QEC F++ F +SPLVYQFSSLGSLDEKW+AE +S+SSG SEDKTPLG G+PLI+WPTVE
Sbjct: 367 QECIFDREFCRSPLVYQFSSLGSLDEKWLAEFGNSLSSGISEDKTPLGPGDPLIIWPTVE 426
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 299
DVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+W A H+ R RAMPHIKTF RYN QKLA
Sbjct: 427 DVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWTADHSARGRAMPHIKTFTRYNDQKLA 486
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-AKRHGCGFSCTSNIVPSEIKS 358
WFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS K GC FSCT + PS +K+
Sbjct: 487 WFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCIFSCTES-NPSTMKA 545
Query: 359 GSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
+ +K +KLVT+TW G D S E++ LP+PYELPP+ YS+EDVPWSWD+ Y+K
Sbjct: 546 KQERKDEAEKRSKLVTMTWQGDRD---SPEIISLPIPYELPPKPYSAEDVPWSWDRGYSK 602
Query: 418 KDVYGQVWPR 427
KDVYGQVWPR
Sbjct: 603 KDVYGQVWPR 612
>gi|22326821|ref|NP_197021.2| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
gi|23297734|gb|AAN13014.1| unknown protein [Arabidopsis thaliana]
gi|226511716|gb|ACO60340.1| tyrosyl-DNA phosphodiesterase I [Arabidopsis thaliana]
gi|332004741|gb|AED92124.1| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
Length = 605
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/430 (72%), Positives = 361/430 (83%), Gaps = 7/430 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+ ACP LA IP V+VIHGE DG E+++R KPANWILHKP LPISFGTHHSKA
Sbjct: 180 MVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKPANWILHKPRLPISFGTHHSKA 239
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN-NLSEECGFENDLIDYLST 119
+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD + + + CGFE DLIDYL+
Sbjct: 240 IFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLNV 299
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASVPGYHTG +L KWGHMKLRT+L
Sbjct: 300 LKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASVPGYHTGFNLNKWGHMKLRTIL 359
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
QEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG +EDKTPLG G+ LI+WPTVE
Sbjct: 360 QECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSGITEDKTPLGPGDSLIIWPTVE 419
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 299
DVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+ R RAMPHIKTF RYN QK+A
Sbjct: 420 DVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHSARGRAMPHIKTFTRYNDQKIA 479
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-AKRHGCGFSCTSNIVPSEIKS 358
WFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS K GC FSCT + PS +K+
Sbjct: 480 WFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCVFSCTES-NPSVMKA 538
Query: 359 GSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
+++K +KLVT+TW G D E++ LPVPY+LPP+ YS EDVPWSWD+ Y+K
Sbjct: 539 KQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQLPPKPYSPEDVPWSWDRGYSK 595
Query: 418 KDVYGQVWPR 427
KDVYGQVWPR
Sbjct: 596 KDVYGQVWPR 605
>gi|17381098|gb|AAL36361.1| unknown protein [Arabidopsis thaliana]
Length = 605
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/430 (72%), Positives = 361/430 (83%), Gaps = 7/430 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+ ACP LA IP V+VIHGE DG E+++R KPANWILHKP LPISFGTHHSKA
Sbjct: 180 MVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKPANWILHKPRLPISFGTHHSKA 239
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN-NLSEECGFENDLIDYLST 119
+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD + + + CGFE DLIDYL+
Sbjct: 240 IFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLNV 299
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASVPGYHTG +L KWGHMKLRT+L
Sbjct: 300 LKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASVPGYHTGFNLNKWGHMKLRTIL 359
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
QEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG +EDKTPLG G+ LI+WPTVE
Sbjct: 360 QECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSGITEDKTPLGPGDSLIIWPTVE 419
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 299
DVRCSLEGYAAGNAIPSP KNV++ FLKKYWA+WKA H+ R RAMPHIKTF RYN QK+A
Sbjct: 420 DVRCSLEGYAAGNAIPSPLKNVEEPFLKKYWARWKADHSARGRAMPHIKTFTRYNDQKIA 479
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-AKRHGCGFSCTSNIVPSEIKS 358
WFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS K GC FSCT + PS +K+
Sbjct: 480 WFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCVFSCTES-NPSVMKA 538
Query: 359 GSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
+++K +KLVT+TW G D E++ LPVPY+LPP+ YS EDVPWSWD+ Y+K
Sbjct: 539 KQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQLPPKPYSPEDVPWSWDRGYSK 595
Query: 418 KDVYGQVWPR 427
KDVYGQVWPR
Sbjct: 596 KDVYGQVWPR 605
>gi|326504850|dbj|BAK06716.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 669
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/428 (69%), Positives = 348/428 (81%), Gaps = 3/428 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWLL ACP L K+PHVLV+HGE +LE +K+ KP NWILHKPPLPISFGTHHSKA
Sbjct: 244 MVDMDWLLTACPSLRKVPHVLVLHGEDGASLERLKKTKPTNWILHKPPLPISFGTHHSKA 303
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP K+ N++S GFENDL+DYL L
Sbjct: 304 MLLVYPQGIRVVVHTANLIHVDWNNKSQGLWAQDFPWKEANDMSTNIGFENDLVDYLRAL 363
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF NLP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+VL+
Sbjct: 364 KWPEFRVNLPVVGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNMKKWGHMKLRSVLE 423
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC FEK F KSPL+YQFSSLGSLDEKWM+E + S+S+G ++D + LGIG+PLIVWPTVED
Sbjct: 424 ECVFEKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKADDGSQLGIGKPLIVWPTVED 483
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR RAMPHIKTF RYNGQ +AW
Sbjct: 484 VRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCRAMPHIKTFTRYNGQNIAW 543
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTS+NLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT S
Sbjct: 544 FLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVPQFSCTDK---SRSNLDK 600
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+ KTKLVTL W G + S+EVV LPVPY+LPPQ Y EDVPWSWD+RYTKKDV
Sbjct: 601 LALGKNIKTKLVTLCWKGDEEKDPSAEVVRLPVPYQLPPQLYGPEDVPWSWDRRYTKKDV 660
Query: 421 YGQVWPRH 428
YG VW RH
Sbjct: 661 YGSVWSRH 668
>gi|357122586|ref|XP_003562996.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Brachypodium
distachyon]
Length = 671
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 296/428 (69%), Positives = 351/428 (82%), Gaps = 3/428 (0%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWLL ACP L K+PHVLV+HGE +LEH+K++KPANWILHKPPLPI+FGTHHSKA
Sbjct: 246 MVDMDWLLTACPSLRKVPHVLVLHGEDGASLEHLKKSKPANWILHKPPLPITFGTHHSKA 305
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP KD ++++ FE+DL+DYLS L
Sbjct: 306 MLLVYPQGIRVVVHTANLIHVDWNNKSQGLWTQDFPWKDTKDMNKNISFESDLVDYLSAL 365
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF LP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+VL+
Sbjct: 366 KWPEFRIKLPVAGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNIKKWGHMKLRSVLE 425
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
C FEK F KSPL+YQFSSLGSLDEKWM E + S+S+G ++D +PLGIG+PLIVWPTVED
Sbjct: 426 GCVFEKQFCKSPLIYQFSSLGSLDEKWMTEFACSLSAGKADDGSPLGIGKPLIVWPTVED 485
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR AMPHIKTFARYNGQ +AW
Sbjct: 486 VRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCHAMPHIKTFARYNGQNIAW 545
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTS+NLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT + G+
Sbjct: 546 FLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVSRFSCTEK---NHSNLGN 602
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+ KTKLVTL W + S+EV+ LPVPY+LPPQ Y EDVPWSWD+RYTKKDV
Sbjct: 603 LTLGKTIKTKLVTLCWKDDEEKEPSAEVIRLPVPYQLPPQLYGPEDVPWSWDRRYTKKDV 662
Query: 421 YGQVWPRH 428
YG VWPRH
Sbjct: 663 YGAVWPRH 670
>gi|242050414|ref|XP_002462951.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
gi|241926328|gb|EER99472.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
Length = 689
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 297/428 (69%), Positives = 346/428 (80%), Gaps = 6/428 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL ACP L K+PHVLV+HG+ +LE MK+ KPANWILHKPPLPISFGTHHSKA
Sbjct: 267 MVDIDWLLTACPSLKKVPHVLVLHGQDGASLELMKKLKPANWILHKPPLPISFGTHHSKA 326
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+RI+VHTANLIHVDWN KSQGLWMQDFP KD N+++ + FENDL+DYLS L
Sbjct: 327 MLLVYPQGIRIVVHTANLIHVDWNYKSQGLWMQDFPWKDTNDMNNKVPFENDLVDYLSAL 386
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEFS NLP G+ IN +FF+KF++ ++ VRLI SVPGYH G +++KWGHMKLR VL
Sbjct: 387 KWPEFSVNLPEVGDVNINAAFFRKFDYRNSMVRLIGSVPGYHVGPNIRKWGHMKLRNVLD 446
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
E TF K F KSPL+YQFSSLGSLDEKWM+E + S+S+G S+D + LGIG+PLIVWPTVED
Sbjct: 447 EITFNKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKSDDGSQLGIGKPLIVWPTVED 506
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCS+EGYAAG+ IPSPQKNV+KDFLKKYW++WKA H GR RAMPHIKTF RY+GQ +AW
Sbjct: 507 VRCSIEGYAAGSCIPSPQKNVEKDFLKKYWSRWKADHVGRCRAMPHIKTFTRYSGQNIAW 566
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTS+NLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT S
Sbjct: 567 FLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSIPQFSCTEK---SRSSRDG 623
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+ KTKLVTL W G + +V LPVPY+LPPQ Y ++DVPWSWD+RYTKKDV
Sbjct: 624 VAIGRTIKTKLVTLCWKGDEE---DPSIVKLPVPYQLPPQPYGTQDVPWSWDRRYTKKDV 680
Query: 421 YGQVWPRH 428
YG VWPRH
Sbjct: 681 YGSVWPRH 688
>gi|115472491|ref|NP_001059844.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|33146648|dbj|BAC79984.1| putative tyrosyl-DNA phosphodiesterase [Oryza sativa Japonica
Group]
gi|113611380|dbj|BAF21758.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|215697362|dbj|BAG91356.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222637174|gb|EEE67306.1| hypothetical protein OsJ_24533 [Oryza sativa Japonica Group]
Length = 671
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 295/436 (67%), Positives = 353/436 (80%), Gaps = 19/436 (4%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHSKA
Sbjct: 246 MVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHSKA 305
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS +
Sbjct: 306 MLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRSVSFENDLVDYLSAI 365
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF NLP G+ IN +FF+KF++ S++VRLI SVPGYH G ++KKWGHMKLR+VL+
Sbjct: 366 KWPEFRVNLPVVGDVNINAAFFRKFDYKSSSVRLIGSVPGYHVGPNIKKWGHMKLRSVLE 425
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTVED
Sbjct: 426 GCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFAFSLSAGKSDNGSPLGIGKPLIVWPTVED 485
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +AW
Sbjct: 486 VRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDIAW 545
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNIVP 353
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+ P
Sbjct: 546 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNLAP 605
Query: 354 S-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 412
EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +EDVPWSWD
Sbjct: 606 GKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDVPWSWD 654
Query: 413 KRYTKKDVYGQVWPRH 428
KRYTKKDVYG VWPRH
Sbjct: 655 KRYTKKDVYGSVWPRH 670
>gi|218199747|gb|EEC82174.1| hypothetical protein OsI_26284 [Oryza sativa Indica Group]
Length = 843
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 295/441 (66%), Positives = 353/441 (80%), Gaps = 19/441 (4%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHSKA
Sbjct: 246 MVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHSKA 305
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS +
Sbjct: 306 MLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRIVSFENDLVDYLSAI 365
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF NLP G+ IN +FF+KF++ S+ VRLI SVPGYH G ++KKWGHMKLR+VL+
Sbjct: 366 KWPEFRVNLPVVGDVNINAAFFRKFDYKSSLVRLIGSVPGYHVGPNIKKWGHMKLRSVLE 425
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTVED
Sbjct: 426 GCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFACSLSAGKSDNGSPLGIGKPLIVWPTVED 485
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +AW
Sbjct: 486 VRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDIAW 545
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNIVP 353
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+ P
Sbjct: 546 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNLAP 605
Query: 354 S-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 412
EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +ED PWSWD
Sbjct: 606 GKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDDPWSWD 654
Query: 413 KRYTKKDVYGQVWPRHFQLYA 433
KRYTKKDVYG VWPRH + A
Sbjct: 655 KRYTKKDVYGSVWPRHGGIQA 675
>gi|293331809|ref|NP_001169273.1| uncharacterized protein LOC100383136 [Zea mays]
gi|224028313|gb|ACN33232.1| unknown [Zea mays]
gi|414886956|tpg|DAA62970.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
gi|414886957|tpg|DAA62971.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 665
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 296/428 (69%), Positives = 348/428 (81%), Gaps = 6/428 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL ACP L K+PHVLV+HG+ +LE MK+ KPANWILH+PPLPISFGTHHSKA
Sbjct: 243 MVDIDWLLTACPSLRKVPHVLVLHGQDGASLELMKKLKPANWILHRPPLPISFGTHHSKA 302
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP+G+RI+VHTANLIHVDWN KSQGLWMQDFP KD +++++ FENDL+DYLS L
Sbjct: 303 MLLVYPQGIRIVVHTANLIHVDWNYKSQGLWMQDFPWKDTVDMNKKTAFENDLVDYLSAL 362
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF NLP G+ IN +FF+KF++S++ VRLI SVPGYH GS+++KWGHMKLR VL
Sbjct: 363 KWPEFRVNLPGVGDVNINAAFFRKFDYSNSMVRLIGSVPGYHVGSNIRKWGHMKLRNVLD 422
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
E F K F KSPL+YQFSSLGSLDEKWM+E + S+S+G S+D + LGIG+PLIVWPTVED
Sbjct: 423 EIMFNKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKSDDGSQLGIGKPLIVWPTVED 482
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VRCS+EGYAAG+ IPSPQKNV++DFLKKYW++WKA H GR RAMPHIKTF RY+GQ +AW
Sbjct: 483 VRCSIEGYAAGSCIPSPQKNVERDFLKKYWSRWKADHVGRCRAMPHIKTFTRYSGQNIAW 542
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT I+ G
Sbjct: 543 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVPQFSCTEK--SRSIRDGV 600
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
I KTKLVTL W G + +V LPVPY+LPPQ Y ++DVPWSWD+RYTKKDV
Sbjct: 601 ALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYGTQDVPWSWDRRYTKKDV 656
Query: 421 YGQVWPRH 428
YG VWPR+
Sbjct: 657 YGSVWPRY 664
>gi|7671486|emb|CAB89327.1| putative protein [Arabidopsis thaliana]
Length = 627
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 293/408 (71%), Positives = 340/408 (83%), Gaps = 7/408 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWL+ ACP LA IP V+VIHGE DG E+++R KPANWILHKP LPISFGTHHSKA
Sbjct: 180 MVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKPANWILHKPRLPISFGTHHSKA 239
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN-NLSEECGFENDLIDYLST 119
+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD + + + CGFE DLIDYL+
Sbjct: 240 IFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKDDDKDPPKGCGFEGDLIDYLNV 299
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASVPGYHTG +L KWGHMKLRT+L
Sbjct: 300 LKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASVPGYHTGFNLNKWGHMKLRTIL 359
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
QEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG +EDKTPLG G+ LI+WPTVE
Sbjct: 360 QECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSGITEDKTPLGPGDSLIIWPTVE 419
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 299
DVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+ R RAMPHIKTF RYN QK+A
Sbjct: 420 DVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHSARGRAMPHIKTFTRYNDQKIA 479
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-AKRHGCGFSCTSNIVPSEIKS 358
WFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS K GC FSCT + PS +K+
Sbjct: 480 WFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSPIKTQGCVFSCTES-NPSVMKA 538
Query: 359 GSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
+++K +KLVT+TW G D E++ LPVPY+LPP+ YS E
Sbjct: 539 KQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQLPPKPYSPE 583
>gi|147781461|emb|CAN76118.1| hypothetical protein VITISV_033882 [Vitis vinifera]
Length = 592
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 281/388 (72%), Positives = 307/388 (79%), Gaps = 47/388 (12%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSKA
Sbjct: 189 MVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSKA 248
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS L
Sbjct: 249 MLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSVL 308
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VLQ
Sbjct: 309 KWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLXSVLQ 368
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
EC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVED
Sbjct: 369 ECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVED 428
Query: 241 VRCSLE-----------------------------GYAAGNAIPSPQKNVDKDFLKKYWA 271
VRCSLE GYAAGNAIPSPQKNV+K+FLKKYWA
Sbjct: 429 VRCSLEAHITCWIPGYLLGFYMCKFALHQSYYIVQGYAAGNAIPSPQKNVEKEFLKKYWA 488
Query: 272 KWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 331
KWKA+HTGR WFLLTSANLSKAAWGALQKNNSQLMIRSYELG
Sbjct: 489 KWKATHTGR------------------CWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
Query: 332 VLILPSAKRHGCGFSCTSNIVPSEIKSG 359
VL LPS G GFSCT N PS++ G
Sbjct: 531 VLFLPSPINRGQGFSCTDNGSPSKMFPG 558
>gi|168038405|ref|XP_001771691.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676998|gb|EDQ63474.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 598
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 261/444 (58%), Positives = 331/444 (74%), Gaps = 9/444 (2%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDIDWLL ACP L +P V++ HGES G+LE ++ KP +W+LHKPPL +S+GTHH+KA
Sbjct: 154 MVDIDWLLEACPRLKTVPSVVIFHGESGGSLELLQARKPNSWLLHKPPLRLSYGTHHTKA 213
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD-QNNLSEECGFENDLIDYLST 119
M L+YP G+RI+VHTANLI++DWNNKSQGLW QDFP K+ S+ FENDL++YL
Sbjct: 214 MFLLYPTGIRIVVHTANLIYIDWNNKSQGLWTQDFPYKNVAAGESKPSPFENDLVEYLQA 273
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
L+W A + G ++ +FF+KF++SSA VRL+ASVPGYH G +L KWGH+KLRT+L
Sbjct: 274 LEWTGCIAIISGIGEVHVDAAFFRKFDYSSAMVRLVASVPGYHLGRNLTKWGHLKLRTIL 333
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
QE FE+ FK SP VYQFSSLGSLDEKWM E SS+ +G + LG G IVWPTVE
Sbjct: 334 QEQHFEEHFKGSPCVYQFSSLGSLDEKWMGEFGSSIQAGSTFGNEQLGPGPVQIVWPTVE 393
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 299
D+R SLEGYAAG A+PSP KNV++ FL KYW +W+A HTGRSRA+PHIKTF RYN Q+LA
Sbjct: 394 DIRNSLEGYAAGGAVPSPLKNVERAFLSKYWYRWQADHTGRSRAIPHIKTFLRYNDQRLA 453
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG---FSCT--SNIVPS 354
WFLLTS+NLSKAAWG LQKN SQLMIRSYELGVL LPS + FSCT S+I+P
Sbjct: 454 WFLLTSSNLSKAAWGVLQKNGSQLMIRSYELGVLFLPSLVGNNSNVTPFSCTYSSSILPR 513
Query: 355 EIKSGSTETS--QIQKTKLVTLTWHGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSW 411
E+++ + Q++ TKLVTL+W S+ + ++ V LP+PY LPP +Y +D+PWSW
Sbjct: 514 ELQNREDDGGKRQLRHTKLVTLSWKSSNHEKSDMDIFVRLPIPYALPPVKYDPKDIPWSW 573
Query: 412 DKRYTKKDVYGQVWPRHFQLYAFQ 435
D++Y + D++G+VWPR + Y Q
Sbjct: 574 DRQYREPDMFGEVWPRQVRRYTMQ 597
>gi|302774643|ref|XP_002970738.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
gi|300161449|gb|EFJ28064.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
Length = 478
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 255/429 (59%), Positives = 321/429 (74%), Gaps = 6/429 (1%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDI+WLL ACP+L IP V++IHGES+ + ++ KP+NW+L KP L IS+GTHHSKA
Sbjct: 53 MVDIEWLLSACPLLRSIPQVVMIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKA 110
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP GVR++VHTANLI++DWNNK+QGLWMQDFP K ++ FENDL+DYL+ L
Sbjct: 111 MLLVYPTGVRVVVHTANLINIDWNNKTQGLWMQDFPFKSMTGITTASDFENDLVDYLTAL 170
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
+W + ++ HG KIN +F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+
Sbjct: 171 EWSGCTVDVQHHGQMKINAIYFRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILK 230
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
E F+K F+ SPLVYQFSSLGSLDEKWM E SSS+S G + D LG+GE I++PTVED
Sbjct: 231 EEKFDKKFQNSPLVYQFSSLGSLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVED 290
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VR SLEGY AG AIPSP KNV+K LKKYW++W+A HTGRSRAMPHIKTF R+ LAW
Sbjct: 291 VRQSLEGYRAGAAIPSPAKNVEKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAW 350
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSG 359
LTS+NLSKAAWGALQKN +QLMIRSYELGV+ LPS + +SCT ++ P ++
Sbjct: 351 VCLTSSNLSKAAWGALQKNKTQLMIRSYELGVVFLPSMLSKFKNRYSCTEDL-PLINENE 409
Query: 360 STETSQIQKTKLVTLTWHGSSD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
+ ET + KL TL S D +++++ LP+PY LPP RYSS+D PW WDK+Y
Sbjct: 410 ACETGEAPNVKLYTLAATESVDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYLH 469
Query: 418 KDVYGQVWP 426
DVYG+ WP
Sbjct: 470 PDVYGKRWP 478
>gi|302771966|ref|XP_002969401.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
gi|300162877|gb|EFJ29489.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
Length = 491
Score = 506 bits (1304), Expect = e-141, Method: Compositional matrix adjust.
Identities = 256/430 (59%), Positives = 323/430 (75%), Gaps = 9/430 (2%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDI+WLL ACP+L IP V++IHGES+ + ++ KP+NW+L KP L IS+GTHHSKA
Sbjct: 66 MVDIEWLLSACPLLRSIPQVVMIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKA 123
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL+YP GVR++VHTANLI++DWNNK+QGLWMQDFPLK ++ FENDL+DYL+ L
Sbjct: 124 MLLVYPTGVRVVVHTANLINIDWNNKTQGLWMQDFPLKSMTGITTASDFENDLVDYLTAL 183
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
+W + ++ HG KIN S+F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+
Sbjct: 184 EWSGCTVDVQHHGQMKINASYFRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILK 243
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
E F+K F+ SPLVYQFSSLGSLDEKWM E SSS+S G + D LG+GE I++PTVED
Sbjct: 244 EEKFDKKFQNSPLVYQFSSLGSLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVED 303
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 300
VR SLEGY AG AIPSP KNV+K LKKYW++W+A HTGRSRAMPHIKTF R+ LAW
Sbjct: 304 VRQSLEGYRAGAAIPSPAKNVEKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAW 363
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNI-VPSEIKS 358
LTS+NLSKAAWGALQKN +QLMIRSYELGV+ LPS + +SCT ++ + +E ++
Sbjct: 364 VCLTSSNLSKAAWGALQKNKTQLMIRSYELGVVFLPSMLSKFKNRYSCTEDLPLINENEA 423
Query: 359 GSTETSQIQKTKLVTLTWHGSSD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYT 416
T + KL TL S D +++++ LP+PY LPP RYSS+D PW WDK+Y
Sbjct: 424 CKTGAPNV---KLYTLAATESMDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYL 480
Query: 417 KKDVYGQVWP 426
DVYG+ WP
Sbjct: 481 HPDVYGKRWP 490
>gi|357504797|ref|XP_003622687.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355497702|gb|AES78905.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 849
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 192/246 (78%), Positives = 221/246 (89%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWL+PACP L+K+PHVLV+HGESD + +KR+KP NWILHKPPLPISFGTHHSKA
Sbjct: 206 MVDVDWLVPACPALSKVPHVLVLHGESDERVACIKRSKPKNWILHKPPLPISFGTHHSKA 265
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
M L+YPRGVR+I+HTANLI+VDWNNKSQGLWMQDFP KDQN+ S+ FENDL++YLS L
Sbjct: 266 MFLVYPRGVRVIIHTANLIYVDWNNKSQGLWMQDFPWKDQNSPSKGSRFENDLVEYLSAL 325
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
KWPEFS NLP+ GNF I PSFFKKF++S A VRLIASVPGYH+G+ LKKWGHMKLR+VLQ
Sbjct: 326 KWPEFSVNLPSLGNFSICPSFFKKFDYSDAMVRLIASVPGYHSGNGLKKWGHMKLRSVLQ 385
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
ECTF+K FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDK PLG+GEP I+WPTVE+
Sbjct: 386 ECTFDKEFKKSPLVYQFSSLGSLDEKWMVELASSMSAGLSEDKVPLGMGEPQIIWPTVEE 445
Query: 241 VRCSLE 246
VRCS+E
Sbjct: 446 VRCSIE 451
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 133/175 (76%), Positives = 147/175 (84%), Gaps = 1/175 (0%)
Query: 254 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAW 313
IPSP KNV+K FLKKYWAKWKA+HTGR+RAMPHIKTFARYN Q LAWF LTS+NLSKAAW
Sbjct: 633 IPSPMKNVEKAFLKKYWAKWKANHTGRTRAMPHIKTFARYNNQNLAWFCLTSSNLSKAAW 692
Query: 314 GALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVT 373
GALQKNNSQLMIRSYELGVL LPS + GCGFSCTSN+ S+ KS + ETS+++KTKLVT
Sbjct: 693 GALQKNNSQLMIRSYELGVLFLPSLLKPGCGFSCTSNVKQSKDKSPAQETSKMKKTKLVT 752
Query: 374 LTWHGSSDAGASSEVVY-LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 427
LT +SSEV+ LPVPYELPP YSSEDVPWSWD+RY KKD YGQVWPR
Sbjct: 753 LTAPTRDTTHSSSEVIIQLPVPYELPPLPYSSEDVPWSWDRRYFKKDDYGQVWPR 807
>gi|384252305|gb|EIE25781.1| tyrosyl-DNA phosphodiesterase [Coccomyxa subellipsoidea C-169]
Length = 502
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 177/450 (39%), Positives = 257/450 (57%), Gaps = 40/450 (8%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGT 55
M+D+ W + A P + V V+HGE ++ + +P W++H+ P+ +G
Sbjct: 45 MIDMRWFVSAAPSVLDADRVTVVHGEKSNPTSVSWMQQIAAGRP--WVIHQARCPLQYGV 102
Query: 56 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDL 113
HHSKA L+ + RG+R++VHTANLIH D N K+QGLW QDFP KD+ + + FE L
Sbjct: 103 HHSKAFLVQFDRGLRVVVHTANLIHQDCNCKTQGLWYQDFPRKDERSPQDNASRLFETTL 162
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
DY++ L+ P A H I + +FSSA LI SVPGYH G++ +K+GHM
Sbjct: 163 SDYIAALRLPAREAQ---HAQQVI-----AQHDFSSARAHLIPSVPGYHQGAAKQKYGHM 214
Query: 174 KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL- 232
+R++L F+ F++SP+V QFSSLGS+ W++E S+++G D P G L
Sbjct: 215 LVRSLLARQRFDPVFRRSPIVAQFSSLGSITGAWLSEFRESLAAGDCWDSNPSGSAGRLG 274
Query: 233 ------IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-------FLKKYWAKWKAS--H 277
+VWPTVE+V+ S+EG+ AG +IP NV K L+ +W ++ +
Sbjct: 275 PAADFRVVWPTVEEVKNSVEGWFAGCSIPGTHANVLKTDKGLSTPILQPFWCRFDGAPAT 334
Query: 278 TGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 337
GR AMPHIK++ R++GQ+LA+ +LTS NLSKAAWG LQKNN+QL I YELGVL+LPS
Sbjct: 335 AGRQHAMPHIKSYLRHSGQRLAYIVLTSHNLSKAAWGVLQKNNTQLHIMHYELGVLLLPS 394
Query: 338 A----KRHG-CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 392
+RH GFSCT+ S + + + S+++ S +E + +
Sbjct: 395 LEESYRRHRHFGFSCTAPA--SHKPAAAAQPSRVEFWAADGAAAGSSEALSTGAEKLEIL 452
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYG 422
+PY+LPP RY +D PW + D G
Sbjct: 453 LPYQLPPVRYGPQDQPWMTGVEFPGLDSQG 482
>gi|302833870|ref|XP_002948498.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
gi|300266185|gb|EFJ50373.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
Length = 1521
Score = 298 bits (762), Expect = 4e-78, Method: Composition-based stats.
Identities = 169/395 (42%), Positives = 222/395 (56%), Gaps = 57/395 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP----ANWILHKPPLPISFGTH 56
M+D+ WLL CP LAK V+HGE M++ A+ LH+PPLPI +GTH
Sbjct: 162 MIDMGWLLSCCPDLAKARQFFVVHGEGPDAEPEMRQQAAEAGAAHVRLHRPPLPIMYGTH 221
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-FENDLID 115
HSKA LL Y G+R+I+HTAN ++ D N+K+QGLW+QDFP KD + FE DL+
Sbjct: 222 HSKAFLLAYSTGLRLIIHTANCVYPDCNDKTQGLWVQDFPRKDTVAAAAPVSTFEQDLVA 281
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSS-LKKWGH 172
Y L P PA N P F +FS A L+ASVPGYH G++ ++ +GH
Sbjct: 282 YFRALALP------PAMAN----PLFEAIAMHDFSFARGTLVASVPGYHRGTAAVQSYGH 331
Query: 173 MKLRTVLQECTFEKGFKKSP----------------LVYQFSSLGSLDEKWMA-ELSSSM 215
M+LR +L++ F L+ Q SS+GS D+ W+ E+ +S+
Sbjct: 332 MRLRRLLEQVPLPSCFAAEGSSCGTASSSSAVPPEGLIIQCSSMGSFDQAWLVDEMGASL 391
Query: 216 SS--------------------GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 255
++ G +VWPTVE+VR S+EG+ AG +IP
Sbjct: 392 AACRRQPPPPPPPPRPLAAAPPPRPSGPPGCGPLPLAVVWPTVEEVRNSIEGWNAGRSIP 451
Query: 256 SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGA 315
P +NV K F+ +Y+A+W GR RAMPHIKT+ RY GQ+LAWFL+TS NLSKAAWG
Sbjct: 452 GPSRNVSKPFMGRYYARWGGEAVGRQRAMPHIKTYTRYRGQQLAWFLVTSHNLSKAAWGE 511
Query: 316 LQKNNSQLMIRSYELGVLILPS--AKRHGCGFSCT 348
LQKN SQLMIRSYELGVL+ P+ A G S T
Sbjct: 512 LQKNGSQLMIRSYELGVLVTPALEAAYRAKGLSAT 546
>gi|255087474|ref|XP_002505660.1| predicted protein [Micromonas sp. RCC299]
gi|226520930|gb|ACO66918.1| predicted protein [Micromonas sp. RCC299]
Length = 536
Score = 297 bits (761), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 174/466 (37%), Positives = 244/466 (52%), Gaps = 46/466 (9%)
Query: 1 MVDIDWLLP--ACPVLAKIPHVLVIHGESDGTL----EHMKRNKPANWILHKPPLPISFG 54
M+D+ WLL CP L +IP V+ I E E ++ +W + PP P FG
Sbjct: 63 MIDLPWLLSPDGCPELLRIPKVVWIGDERSSPTPRDPEFLRLKGERDWTVVNPPCP-KFG 121
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 114
THH+K +L+Y GVR+ VHTANLIH D ++ W QDFP K +L FE DL
Sbjct: 122 THHTKCFILVYDTGVRVCVHTANLIHGDVRKRTNAAWCQDFPNKSAAHLGRSSEFERDLG 181
Query: 115 DYLSTLKWPEFSANLP-AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
YL+TL W + + LP A G+ + PS +F+FS A +LIASVPG GS++ +GH
Sbjct: 182 RYLATLGWKDETCALPGAGGDVVVGPSAMSRFDFSGAGAKLIASVPGRWVGSAMMNYGHT 241
Query: 174 KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP-------- 225
+R L TF FK++P+V QF+S+G+ EKWM E++ S +G +E
Sbjct: 242 SVRHALAGMTFPGVFKRAPVVCQFTSVGATTEKWMGEMARSFGAGATETDDANEWPGGPC 301
Query: 226 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA---------- 275
LG G+ +VWPT+ +VR S GY G +IP + ++ +++ +W+
Sbjct: 302 LGDGDLRLVWPTMGEVRGSNLGYVTGGSIPGATDKISREHVRRRLHRWRGDVGATRGTKL 361
Query: 276 --------SHTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLM 324
TGR R MPH+KTFARY LAW ++ S NLS AAWG L+KN +Q+
Sbjct: 362 LDHPPASTDPTGRGRVMPHVKTFARYAPNAPHHLAWVIVGSHNLSGAAWGRLEKNETQIA 421
Query: 325 IRSYELGVLILPSA---KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 381
I SYELGVL+ P + R F+CT V G + ++ + G D
Sbjct: 422 ILSYELGVLLSPRSIGKTRVAAPFTCTPGAVSHR---GEVVPRCLGGVRISAASDDGPGD 478
Query: 382 A--GASSE-VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
+ G S E V + P+PY +PP Y+ D PW+ D D YG+V
Sbjct: 479 SPPGDSREFVAFAPLPYRVPPVPYAPSDAPWAVDAWDETPDKYGRV 524
>gi|303279543|ref|XP_003059064.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458900|gb|EEH56196.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 520
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 174/491 (35%), Positives = 254/491 (51%), Gaps = 80/491 (16%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
VD+DW L ACP L V++++G + + P +W HKPP P +GTHH+KA
Sbjct: 41 VDLDWFLAACPALRTARRVILMYGNMHPGVAEI----PKHWSTHKPPCP-QYGTHHTKAF 95
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
+L Y GVR+++HTANL H D+N Q +W QDFPLK +++ FENDL+ Y+S L+
Sbjct: 96 ILAYDAGVRVVIHTANLTHHDFNKSCQAVWYQDFPLKRESS-PPGSAFENDLVRYVSRLQ 154
Query: 122 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 181
W S + +++P ++++FS A V+LIASVPG H G L++WGHM +RT L+
Sbjct: 155 WSGESVD-----GERVSPEALRRYDFSGAGVKLIASVPGRHAGEELRRWGHMAVRTALER 209
Query: 182 CTFEKGFKKSPLVYQFSSLGSLDEKWMAE------------LSSSMSSGFSEDKTPLGIG 229
T + FK S ++ Q++S GSL +KW+ E S G + + LG G
Sbjct: 210 ETHDDAFKGSSVLCQYTSTGSLPKKWLDEEFRDSLCAGACAGGGGGSVGGNANDRSLGPG 269
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK---------ASHTGR 280
E ++WPTVE++R GYAAG +IP KNV + L + + KW A GR
Sbjct: 270 EMQLLWPTVEEIRTCDVGYAAGGSIPGNGKNVRRPHLTEKFHKWAKPNDDDDDDAHPMGR 329
Query: 281 SRAMPHIKTFARY-----------------NGQKLAWFLLTSANLSKAAWGALQKNNSQL 323
+ MPHIKTF+RY G K A+ ++ S NLS AAWG L+ SQ+
Sbjct: 330 RKHMPHIKTFSRYYDALTPYQKKRGGGGGVAGAKFAYVIVCSHNLSGAAWGKLEHGGSQI 389
Query: 324 MIRSYELGVLILPS-------------AKRHGCGFSCTSNIVP------SEIKSGSTETS 364
+ SYELGV+ LPS + F C + + P + + ++E +
Sbjct: 390 HVYSYELGVMFLPSLIGARTAKPFSALSATEADPFRCLAAVRPRATTTATATATATSEGA 449
Query: 365 QIQKTKLVTLTWHGSSDA----GASSEVVYLPVPYELPPQRYS--------SEDVPWSWD 412
+ L G++ A G S+ + P+PY +PP RY+ D PW WD
Sbjct: 450 VVLTHALTLARPPGAATATTASGPSATLALCPLPYNVPPLRYNLDDNAPLLERDEPWVWD 509
Query: 413 KRYTKKDVYGQ 423
+RY D +G+
Sbjct: 510 QRYDVADEWGR 520
>gi|410962801|ref|XP_003987957.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Felis catus]
Length = 608
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 174/440 (39%), Positives = 244/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPQFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ + Q + F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRVVHGTQRSGDSTTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + ++ + S V LI S PG GS WGH +LR
Sbjct: 327 LMAYNAPSLKEWI----------DVIQEHDLSETNVYLIGSTPGRFQGSQKDHWGHFRLR 376
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+L+E + KG + P+V QFSS+GS+ + KW+ +E S+ + E +TP
Sbjct: 377 KLLKEHASSIPKG-ESWPIVGQFSSIGSMGADESKWLCSEFKESLVTQGKESRTPGKSAA 435
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIK
Sbjct: 436 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIK 495
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 496 TYMRLSPDFSQIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 549
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
S V + SGS E + PVPY+LPP+ Y S+D
Sbjct: 550 LDSFRVKQKFFSGSKEPTS------------------------SFPVPYDLPPELYGSKD 585
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ YTK D +G +W
Sbjct: 586 RPWIWNIPYTKAPDTHGNMW 605
>gi|296215712|ref|XP_002754236.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Callithrix
jacchus]
Length = 606
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/439 (40%), Positives = 245/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 205 DVDWLVKQYPREFRKKPILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKM 264
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 265 MLLLYEEGLRVVIHTSNLIHADWHQKTQGVWLSPLYPRIVDGTHKSGESITHFKADLISY 324
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + A + + S V LI S PG GS WGH +LR
Sbjct: 325 LMAYNAPSLKEWIDA----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 374
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
VL++ ++S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 375 KVLKDHASSIPNEESWPVVGQFSSIGSLGADESKWLCSEFKESMLALGKESKTPGKSSVP 434
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 435 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 494
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 495 YMRPSPDFSKIAWFLITSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 548
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 549 DSFKVKQKFFAGSQEP------------------------MTTFPVPYDLPPELYGSKDR 584
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 585 PWIWNIPYVKAPDTHGNMW 603
>gi|346467109|gb|AEO33399.1| hypothetical protein [Amblyomma maculatum]
Length = 423
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 173/441 (39%), Positives = 245/441 (55%), Gaps = 64/441 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHSKA 60
DI WL+ P + +L++HGE + ++ + N + L I +GTHH+K
Sbjct: 20 DIPWLVEQYPPEFRSFPLLIVHGEQREAKKELEASAADFKNLSFVQAKLEIVYGTHHTKM 79
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLSEECGFENDLIDYL 117
MLL+Y G+RI++HTANL+ DW K+Q +W+ + D E GF+ DL+ YL
Sbjct: 80 MLLLYKDGLRIVIHTANLVASDWAQKTQAIWVSPVCTRLASDSKGGDSETGFKADLLTYL 139
Query: 118 STLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
S A+G+ +IN + + +FS+ V L+ SVPG HTG +GH++L
Sbjct: 140 S------------AYGDPRINEWCHYIRSHDFSAVKVFLVGSVPGRHTGPRKSSFGHLRL 187
Query: 176 RTVLQECTFEKGFKKS--PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIG 229
RT+L + K S PLV QFSS+GSL + W+ E SS+S+ S TP +
Sbjct: 188 RTLLNQHGPSKDLVSSHWPLVAQFSSIGSLGTSAQAWLTGEFLSSLSATKSSGSTPQSV- 246
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHI 287
PL +V+P+V+DVRCSLEGY AG +IP K +L Y+ +WK+ GR+ A PHI
Sbjct: 247 -PLKLVFPSVDDVRCSLEGYPAGASIPYSIVTASKQRWLDSYFYRWKSERLGRTAASPHI 305
Query: 288 KTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + G++ AWFL+TSANLSKAAWGA +KN SQLMIRSYELGVL+ P++ F
Sbjct: 306 KTYTRLSPSGKQAAWFLVTSANLSKAAWGAFEKNGSQLMIRSYELGVLLFPASFGQATTF 365
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
IV SD SS +YLP+PY+LP Y+S+
Sbjct: 366 -----IV---------------------------SDESCSSSALYLPLPYDLPLVPYTSD 393
Query: 406 DVPWSWDKRYTK-KDVYGQVW 425
D PW+WD ++ + D +G +W
Sbjct: 394 DEPWTWDSQHRELPDRFGNMW 414
>gi|403298195|ref|XP_003939917.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403298197|ref|XP_003939918.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 605
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 179/439 (40%), Positives = 245/439 (55%), Gaps = 57/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + VL++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 205 DVDWLVKQYPREFRKKPVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 264
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 265 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 324
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +LR
Sbjct: 325 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 374
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 375 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 434
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRSRAMPHIKT
Sbjct: 435 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSRAMPHIKT 494
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 495 YMRPSPDFSRIAWFLITSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 548
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 549 DSFKVKQKFFAGSQEP-------------------------MPFPVPYDLPPELYGSKDR 583
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 584 PWIWNIPYVKAPDTHGNMW 602
>gi|351706738|gb|EHB09657.1| Tyrosyl-DNA phosphodiesterase 1 [Heterocephalus glaber]
Length = 655
Score = 274 bits (701), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 179/462 (38%), Positives = 255/462 (55%), Gaps = 55/462 (11%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP AN L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYANISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKD---QNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+N+I DW+ K+QG+W+ +P D Q + + F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNIIREDWHQKTQGIWLSPLYPRIDHGTQGSGESKTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 327 LTAYNAPPLQEWI----------DTIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 376
Query: 177 TVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E T + PLV QFSS+GSL + KW+ +E S+ + +E+KTP P
Sbjct: 377 KLLKEHGTSIPKAECWPLVGQFSSIGSLGADESKWLCSEFKESLLTQGAENKTPGKSSIP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R N ++AWFL+TSANLSKAAWG L+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRLSPNSSRIAWFLVTSANLSKAAWGVLEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETS-----------QIQKTK------------LVTLTWHGSSDAGA 384
S V + SGS E + ++ +K L + +G+
Sbjct: 551 ASFKVKQKFSSGSQELAPPFPVPYDLPPELYGSKGETWAQGTMGGGLASFKVKQKFSSGS 610
Query: 385 SSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 611 QELAPPFPVPYDLPPELYGSKDRPWIWNIPYVKAPDRHGNMW 652
>gi|449280745|gb|EMC87981.1| Tyrosyl-DNA phosphodiesterase 1 [Columba livia]
Length = 604
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 170/440 (38%), Positives = 245/440 (55%), Gaps = 55/440 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKA 60
D+ WL+ P + +L++HGE + E + + +P I + L I+FGTHH+K
Sbjct: 200 DVGWLVRQYPQEFRKKPLLIVHGEKRESKAELVAQARPYEHISFCQAKLDIAFGTHHTKM 259
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNL----SEECGFENDLID 115
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P Q E F++DLI
Sbjct: 260 MLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLPQGTTGSAGESETNFKSDLIS 319
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL+ P + ++ + S V L+ S PG + GS +KWGH++L
Sbjct: 320 YLTAYNSPTLKEWI----------DLIQEHDLSETRVYLLGSTPGRYQGSDKEKWGHLRL 369
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L++ ++S P+V QFSS+GSL KW+ +E S+ + S TPL
Sbjct: 370 RKLLKDHASSIPARESWPVVGQFSSIGSLGVDGSKWLCSEFQESLVAAGSSVTTPLKCDV 429
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIK 288
P+ +V+PTV++VR SLEGY AG ++P + K L Y+ KW AS +GRS A+PHIK
Sbjct: 430 PIHLVYPTVDNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKWAASISGRSHAIPHIK 489
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + QK+AWFL+T ANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA G+
Sbjct: 490 TYMRPSPDFQKIAWFLVTLANLSKAAWGALEKSGTQLMIRSYELGVLFLPSAFGLDKGYF 549
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
C SE K +T Y PVPY+LPP++Y S+D
Sbjct: 550 CVRGKTLSESKESAT----------------------------YFPVPYDLPPEQYGSKD 581
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ +T D +G +W
Sbjct: 582 QPWIWNIPHTDAPDTHGNMW 601
>gi|397525717|ref|XP_003832802.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pan paniscus]
gi|397525719|ref|XP_003832803.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pan paniscus]
Length = 608
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|297695684|ref|XP_002825063.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pongo abelii]
gi|297695686|ref|XP_002825064.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pongo abelii]
Length = 608
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E+KTP P
Sbjct: 377 KLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESMLTLGKENKTPGKTSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFVGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|20127586|ref|NP_060789.2| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|57242805|ref|NP_001008744.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|37999797|sp|Q9NUW8.2|TYDP1_HUMAN RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|15930062|gb|AAH15474.1| Tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|85725382|gb|ABC79301.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|119601820|gb|EAW81414.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601821|gb|EAW81415.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601822|gb|EAW81416.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
Length = 608
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|350539189|ref|NP_001233557.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|343962149|dbj|BAK62662.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410225564|gb|JAA10001.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410265878|gb|JAA20905.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410301400|gb|JAA29300.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
Length = 608
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|49258603|pdb|1QZQ|A Chain A, Human Tyrosyl Dna Phosphodiesterase
gi|49258604|pdb|1QZQ|B Chain B, Human Tyrosyl Dna Phosphodiesterase
Length = 483
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 82 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 141
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 142 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 201
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 202 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 251
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 252 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 311
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 312 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 371
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 372 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 425
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 426 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 461
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 462 PWIWNIPYVKAPDTHGNMW 480
>gi|7023536|dbj|BAA91997.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNPESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|332223510|ref|XP_003260916.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Nomascus
leucogenys]
Length = 608
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 177/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKTPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTPKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DIIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E+KTP P
Sbjct: 377 KLLKDHASSMPDAESWPVVGQFSSIGSLGGDESKWLCSEFKESMLTLGKENKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|189054943|dbj|BAG37927.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 176/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E +M + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKENMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 550
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 551 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 586
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 PWIWNIPYVKAPDTHGNMW 605
>gi|73964387|ref|XP_547950.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Canis lupus familiaris]
Length = 609
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 176/440 (40%), Positives = 245/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ-NNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P Q + S E F+ DLI Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRMAQATHRSGESATHFKADLISY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L + + + S V LI S PG GS WGH +LR
Sbjct: 328 LMAYNAAPLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+L+E + KG + P+V QFSS+GS+ D KW+ +E S+ + E +TP
Sbjct: 378 KLLREHASSITKG-ESWPIVGQFSSIGSMGADDSKWLCSEFKESLVTLGKESRTPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWMADTSGRSNAMPHIK 496
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRSSPDFSQIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
S V + SGS E + PVPY+LPP+ Y ++D
Sbjct: 551 LDSFKVKQKFFSGSKEPA------------------------AAFPVPYDLPPELYGNKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ YTK D +G +W
Sbjct: 587 RPWIWNIPYTKAPDTHGNMW 606
>gi|402876919|ref|XP_003902197.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Papio anubis]
gi|402876921|ref|XP_003902198.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Papio anubis]
Length = 603
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 176/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 202 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 261
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 262 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 321
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + + + S V LI S PG GS WGH +LR
Sbjct: 322 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 371
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 372 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 431
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 432 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 491
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 492 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 545
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 546 DNFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 581
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 582 PWIWNIPYVKAPDTHGNMW 600
>gi|355778790|gb|EHH63826.1| hypothetical protein EGM_16873 [Macaca fascicularis]
Length = 603
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 176/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 202 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 261
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 262 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHESGESTTHFKADLISY 321
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + + + S V LI S PG GS WGH +LR
Sbjct: 322 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 371
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 372 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 431
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 432 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 491
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 492 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 545
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 546 DNFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 581
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 582 PWIWNIPYVKAPDTHGNMW 600
>gi|383873205|ref|NP_001244708.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|355693501|gb|EHH28104.1| hypothetical protein EGK_18452 [Macaca mulatta]
gi|380814614|gb|AFE79181.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|383419927|gb|AFH33177.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
Length = 603
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 176/439 (40%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 202 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 261
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 262 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 321
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + + + S V LI S PG GS WGH +LR
Sbjct: 322 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 371
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 372 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 431
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 432 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 491
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 492 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 545
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 546 DNFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 581
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 582 PWIWNIPYVKAPDTHGNMW 600
>gi|417403294|gb|JAA48458.1| Putative tyrosyl-dna phosphodiesterase [Desmodus rotundus]
Length = 611
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 174/441 (39%), Positives = 244/441 (55%), Gaps = 60/441 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 210 DVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKM 269
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ----NNLSEECG--FENDLI 114
MLL+Y G+R+++HTANLI DW+ K+QG+W+ PL + ++S E F+ DLI
Sbjct: 270 MLLLYEEGLRVVIHTANLICADWHQKTQGIWLS--PLYPRVACGTHMSGESATHFKADLI 327
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL+ P + + + + S V LI S PG GS WGH +
Sbjct: 328 SYLTAYNAPPLNEWI----------DIIRDHDLSETNVYLIGSTPGRFQGSQKDNWGHFR 377
Query: 175 LRTVLQE-CTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIG 229
LR +L+E + G + P+V QFSS+GS+ KW+ +E ++++ E + P
Sbjct: 378 LRKLLKEHASSTPGAEAWPVVGQFSSIGSMGADASKWLCSEFKETLATLGKESRAPGKGV 437
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHI
Sbjct: 438 TPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSHAMPHI 497
Query: 288 KTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 498 KTYMRPSPDFGRIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------F 551
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
S V SGS E + PVPY+LPP+ Y S+
Sbjct: 552 GLDSFQVKQRFFSGSQEPA------------------------ASFPVPYDLPPELYGSK 587
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ YTK D +G +W
Sbjct: 588 DRPWIWNIPYTKAPDTHGNMW 608
>gi|28373796|pdb|1MU7|A Chain A, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373797|pdb|1MU7|B Chain B, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373798|pdb|1MU9|A Chain A, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|28373799|pdb|1MU9|B Chain B, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|29726730|pdb|1NOP|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|29726731|pdb|1NOP|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|46015472|pdb|1RFF|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015473|pdb|1RFF|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015478|pdb|1RFI|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015479|pdb|1RFI|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015488|pdb|1RG1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015489|pdb|1RG1|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015492|pdb|1RG2|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015493|pdb|1RG2|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015502|pdb|1RGT|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015503|pdb|1RGT|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015506|pdb|1RGU|A Chain A, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015507|pdb|1RGU|B Chain B, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015511|pdb|1RH0|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
gi|46015512|pdb|1RH0|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
Length = 485
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 175/439 (39%), Positives = 244/439 (55%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 84 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 143
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ +LI Y
Sbjct: 144 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISY 203
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + K + S V LI S PG GS WGH +L+
Sbjct: 204 LTAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 253
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 254 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 313
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 314 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 373
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 374 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------LGL 427
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 428 DSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPELYGSKDR 463
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 464 PWIWNIPYVKAPDTHGNMW 482
>gi|281340418|gb|EFB16002.1| hypothetical protein PANDA_009635 [Ailuropoda melanoleuca]
Length = 388
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 171/421 (40%), Positives = 235/421 (55%), Gaps = 56/421 (13%)
Query: 20 VLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTAN 77
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+N
Sbjct: 6 ILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSN 65
Query: 78 LIHVDWNNKSQGLWMQDF--PLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHG 133
LIH DW+ K+QG+W+ P+ + S E F+ DLI YL P +
Sbjct: 66 LIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKADLISYLMAYNAPSLKEWI---- 121
Query: 134 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL 193
+ + S V LI S PG GS WGH +LR +L+E KG + P+
Sbjct: 122 ------DIIHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASPKG-ESWPV 174
Query: 194 VYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGY 248
V QFSS+GS+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY
Sbjct: 175 VGQFSSIGSMGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGY 234
Query: 249 AAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTS 305
AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+TS
Sbjct: 235 PAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTS 294
Query: 306 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 365
ANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 295 ANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPA- 347
Query: 366 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQV 424
PVPY+LPP+ Y S+D PW W+ YTK D +G +
Sbjct: 348 -----------------------AAFPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNM 384
Query: 425 W 425
W
Sbjct: 385 W 385
>gi|50748586|ref|XP_421313.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gallus gallus]
Length = 606
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 169/441 (38%), Positives = 242/441 (54%), Gaps = 55/441 (12%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSK 59
+D+ WL+ P + +L++HGE + E + + +P N + L I+FGTHH+K
Sbjct: 201 IDVAWLVRQYPQEYRKKPLLIVHGEKRESKAELLAQARPFENISFCQAKLDIAFGTHHTK 260
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSE-ECGFENDLI 114
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P ++ E E F++DLI
Sbjct: 261 MMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLPQGSSDSAGESETNFKSDLI 320
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL P + ++ + S V L+ S PG + G +KWGH+K
Sbjct: 321 SYLMAYSSPVLKEWI----------DLIREHDLSETRVYLLGSTPGRYQGIDKEKWGHLK 370
Query: 175 LRTVLQECTFEKGFKKS-PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIG 229
LR +L++ ++S P+V QFSS+GSL KW+ +E S+ + S L
Sbjct: 371 LRKLLKDHASSIPAQESWPVVGQFSSIGSLGADGSKWLCSEFQESLVAAGSGVAALLKCD 430
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHI 287
P+ +V+PTV +VR SLEGY AG ++P + K L Y+ KW A +GRS AMPHI
Sbjct: 431 VPIHLVYPTVSNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKWSAEVSGRSHAMPHI 490
Query: 288 KTFAR--YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R ++ QK+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA G+
Sbjct: 491 KTYMRPSHDFQKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSAFGLDKGY 550
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
+ SE K +T PVP++LPP+RY S+
Sbjct: 551 FHVKGNMLSEGKDSATS----------------------------FPVPFDLPPERYGSK 582
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ YT D +G +W
Sbjct: 583 DQPWIWNIPYTSAPDTHGNMW 603
>gi|224051603|ref|XP_002200587.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Taeniopygia guttata]
Length = 609
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 168/443 (37%), Positives = 242/443 (54%), Gaps = 57/443 (12%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSK 59
+D+ WL+ P + +L++HGE + E + + +P N + L I+FGTHH+K
Sbjct: 202 IDVGWLVRQYPQEFRKKPLLIVHGEKRESKAELIAQARPYENISFCQAKLDIAFGTHHTK 261
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLI 114
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + S G F++DLI
Sbjct: 262 MMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLSKGTSGSAGESATNFKSDLI 321
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL+ P + ++ + S V L+ S PG + G+ +KWGH++
Sbjct: 322 SYLAAYNSPALREWI----------DLIQEHDLSETRVYLLGSTPGRYQGNDKEKWGHLR 371
Query: 175 LRTVLQECTFEKGFKKS---PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLG 227
LR +L+E ++S PLV QFSS+GS+ KW+ +E S+ + S T
Sbjct: 372 LRKLLKEHALPIPAQESWPLPLVGQFSSIGSMGADGSKWLCSEFQESLVAAGSSVTTFRK 431
Query: 228 IGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMP 285
P+ +V+PTV +VR SLEGY AG ++P + K L Y+ KW A TGR+ A+P
Sbjct: 432 CDVPIHLVYPTVNNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKWSADVTGRTHAIP 491
Query: 286 HIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
HIKT+ R + QK+AWFL+TSANLSKAAWGAL+KN SQLMIRSYELGVL LPSA
Sbjct: 492 HIKTYMRLSPDFQKIAWFLVTSANLSKAAWGALEKNGSQLMIRSYELGVLFLPSA----- 546
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
I + L + GS ++ Y PVPY+LPP++Y
Sbjct: 547 --------------------FGIFRLDLRKKFFTGSEQPATTT---YFPVPYDLPPEQYG 583
Query: 404 SEDVPWSWDKRYTKK-DVYGQVW 425
S+D PW W+ YT D +G +W
Sbjct: 584 SKDQPWIWNIPYTDAPDTHGNMW 606
>gi|354478467|ref|XP_003501436.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
gi|344235810|gb|EGV91913.1| Tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
Length = 609
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 170/441 (38%), Positives = 242/441 (54%), Gaps = 60/441 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRNKPILIVHGDKREDKAHLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DLI Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRLDQGSHTSGESSTHFKADLISY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L + P + ++ + S V L+ S PG GS WGH +LR
Sbjct: 328 LMSYNAPSLQEWIDT----------IQEHDLSETNVYLVGSTPGRFQGSHKDNWGHFRLR 377
Query: 177 TVLQECTFEKGFKKS---PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 229
+L+ T K P+V QFSS+GSL + KW+ +E S+ + + +TP
Sbjct: 378 KLLR--THAPSVPKDECWPIVGQFSSIGSLGPDESKWLCSEFKESLLALREDGRTPGKSA 435
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHI 287
PL +++P+VE+VR SLEGY AG ++P + ++ ++L Y+ KW A +GRS AMPHI
Sbjct: 436 VPLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAERQNWLHSYFHKWSAETSGRSNAMPHI 495
Query: 288 KTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + KLAWFL+TSANLSKAAWG L+KN +QLMIRSYELGVL LPSA F
Sbjct: 496 KTYMRPSSDFNKLAWFLVTSANLSKAAWGTLEKNGTQLMIRSYELGVLFLPSA------F 549
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
+ V + S S E + PVPY+LPP+ Y S+
Sbjct: 550 GLDAFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYGSK 585
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ Y K D +G +W
Sbjct: 586 DRPWIWNIPYVKAPDTHGNMW 606
>gi|311261437|ref|XP_003128731.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sus scrofa]
Length = 606
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 169/438 (38%), Positives = 237/438 (54%), Gaps = 55/438 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + VL++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 206 DVDWLVKQYPPEFRKKPVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 265
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM----QDFPLKDQNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ Q + F+ DLI Y
Sbjct: 266 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYQRIVPGSHRSGESATHFKADLISY 325
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
LS + ++ + S V LI S PG G WGH +LR
Sbjct: 326 LSAYNAAALKEWI----------DTIQEHDLSETNVYLIGSTPGRFQGDQKDNWGHFRLR 375
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E +S P+V QFSS+ S+ + KW+ +E S+ + E +TP G
Sbjct: 376 KLLKENGSSIPKAESWPVVGQFSSISSMGADESKWLCSEFKESLVTLGKESRTPGGAVPL 435
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTF 290
+++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A+ +GRS AMPHIKT+
Sbjct: 436 HLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQTWLHSYFHKWSAATSGRSNAMPHIKTY 495
Query: 291 ARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT 348
R + ++AWFL+TSANLSKAAWGAL+KN SQLMIRSYELGVL LP+A F
Sbjct: 496 MRPSPDFSQIAWFLVTSANLSKAAWGALEKNGSQLMIRSYELGVLFLPAA------FGLD 549
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
S V + SGS E + PVPY+LPP+ Y S+D P
Sbjct: 550 SFRVKQKFFSGSQEPT------------------------ASFPVPYDLPPELYGSKDRP 585
Query: 409 WSWDKRYTKK-DVYGQVW 425
W W+ Y K D +G +W
Sbjct: 586 WIWNIPYMKAPDTHGNMW 603
>gi|126282139|ref|XP_001366471.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Monodelphis domestica]
Length = 608
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 174/440 (39%), Positives = 242/440 (55%), Gaps = 57/440 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
DIDWL+ P+ + +L++HG+ + ++ KP N L + L I+FGTHH+K
Sbjct: 206 DIDWLIRQYPLEFRKKPILLVHGDKREAKARLQEQAKPYENISLCQAKLDIAFGTHHTKM 265
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFENDLID 115
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E F++DLI
Sbjct: 266 MLLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTSGESSTNFKSDLIR 325
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL T P + K ++ + S V LI S PG GS + WGH +L
Sbjct: 326 YLMTYNAP----------SLKEWADIIQEHDLSETRVYLIGSTPGRFQGSHKEDWGHFRL 375
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E T ++S P+V QFSS+GSL + KW+ AE S+ + K+
Sbjct: 376 RKLLKEHTSLVPEQQSWPIVGQFSSIGSLGADESKWLCAEFKESLVVLGNCGKSQGQQDV 435
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIK 288
PL +++PTVE+VR SLEGY AG ++P + +K L Y+ KW A +GRS AMPHIK
Sbjct: 436 PLYLIYPTVENVRKSLEGYPAGGSLPYSLQTAEKQLWLHSYFHKWSAETSGRSHAMPHIK 495
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPS F
Sbjct: 496 TYMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPST------FG 549
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V ++ S + E V PVPY+LPP Y S+D
Sbjct: 550 MDTFKVKKKVFSENREP------------------------VTSFPVPYDLPPNIYDSKD 585
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ YTK D +G +W
Sbjct: 586 RPWIWNIPYTKAPDTHGNMW 605
>gi|149737576|ref|XP_001496143.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Equus caballus]
Length = 611
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 170/441 (38%), Positives = 241/441 (54%), Gaps = 60/441 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 210 DVDWLVKQYPPEFRKTPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 269
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLI 114
MLL+Y G+R+++HT+NL+H DW+ K+QG+W+ PL + ++ F+ DLI
Sbjct: 270 MLLLYEEGLRVVIHTSNLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKADLI 327
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL P + ++ + S V LI S PG GS WGH +
Sbjct: 328 SYLMAYNAPSLKEWI----------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFR 377
Query: 175 LRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 229
LR +L+E +S P+V QFSS+GS+ + KW+ +E S+ + E KTP
Sbjct: 378 LRKLLKEHASSIPKAESWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPGKSV 437
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
P +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHI
Sbjct: 438 SPFHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHI 497
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 498 KTYMRPSPDFSQIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------F 551
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
S V + S + E + PVPY+LPP+ Y S+
Sbjct: 552 GLDSFKVKQKFFSDNQEPT------------------------ASFPVPYDLPPELYGSK 587
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ Y K D +G +W
Sbjct: 588 DRPWIWNIPYIKAPDTHGNMW 608
>gi|348573481|ref|XP_003472519.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cavia porcellus]
Length = 607
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 173/439 (39%), Positives = 240/439 (54%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 206 DVDWLVKQYPPEFRKKPILLVHGDKREAKADLHAQAKPYANVSLCQAKLDIAFGTHHTKM 265
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKD---QNNLSEECGFENDLIDY 116
MLL+Y G R+++HT+N+I DW+ K+QG+W+ +P D Q + F+ DLI Y
Sbjct: 266 MLLLYEEGFRVVIHTSNIIREDWHQKTQGIWLSPLYPRLDPGSQKSGESRTHFKADLISY 325
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + ++ + S V LI S PG GS WGH KLR
Sbjct: 326 LMAYNAPPLKEWIDT----------IREHDLSETNVYLIGSTPGRFQGSQKDNWGHFKLR 375
Query: 177 TVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E T + PLV QFSS+GSL + KW+ +E S+ + E+K P P
Sbjct: 376 KLLKEHGTPVPKTECWPLVGQFSSIGSLGADESKWLCSEFKESLLTLGPENKIPGKSSVP 435
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q + +L Y+ KW A +GRS AMPHIKT
Sbjct: 436 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQKWLHSYFHKWSAETSGRSNAMPHIKT 495
Query: 290 FARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPS F
Sbjct: 496 YMRPSPDFSRIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSV------FGL 549
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + SGS + + PVPY+LPP+ Y S+D
Sbjct: 550 DSFKVKQKFFSGSQDPT------------------------TAFPVPYDLPPELYGSKDR 585
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 586 PWIWNIPYVKAPDTHGNMW 604
>gi|40796186|gb|AAH65162.1| Tdp1 protein [Mus musculus]
Length = 609
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 242/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYGSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>gi|348500374|ref|XP_003437748.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oreochromis
niloticus]
Length = 616
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 168/448 (37%), Positives = 242/448 (54%), Gaps = 77/448 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP----------IS 52
DI W++ P + VL++HG+ KR A I P P I+
Sbjct: 218 DIAWMVKQYPSEFRDRPVLIVHGD--------KREAKARLIQQAQPFPHVRFCQAKLDIA 269
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG---- 108
FGTHH+K MLL Y G R+I+ T+NLI DW K+QG+WM + S G
Sbjct: 270 FGTHHTKMMLLWYEEGFRVIILTSNLIRADWYQKTQGMWMSPLYPRLPKESSASAGESPT 329
Query: 109 -FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 167
F+ DL++YL++ + PE + K+ + S V L+ S PG + GS +
Sbjct: 330 FFKRDLLEYLASYRAPELEEWI----------QRIKEHDLSETRVYLVGSTPGRYVGSDM 379
Query: 168 KKWGHMKLRTVLQECTFE-KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSED 222
++WGH++LR +L E T G ++ P++ QFSS+GS+ KW+A E ++++
Sbjct: 380 ERWGHLRLRKLLYEHTNPIPGEERWPVIGQFSSIGSMGLDKSKWLAGEFQRTLTT---LG 436
Query: 223 KTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGR 280
K+ L P+ +++P+VEDVR SLEGY AG ++P + K L Y+ +WKA TGR
Sbjct: 437 KSSLRPDPPMHLLYPSVEDVRMSLEGYPAGGSLPYSIQTAQKQLWLHSYFHRWKAEATGR 496
Query: 281 SRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
S AMPHIKT+ R + +LAWFL+TSANLSKAAWGAL+KNN+Q+M+RSYELGVL LPSA
Sbjct: 497 SHAMPHIKTYMRASPDFSQLAWFLVTSANLSKAAWGALEKNNTQMMVRSYELGVLYLPSA 556
Query: 339 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 398
FS N P V+ ++ G PVP++LP
Sbjct: 557 FGMKT-FSVDKNPFP------------------VSASFSG------------FPVPFDLP 585
Query: 399 PQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
P Y+++D PW W+ Y++ D +G +W
Sbjct: 586 PTSYTTKDQPWIWNIPYSQAPDTHGNIW 613
>gi|37999670|sp|Q8BJ37.2|TYDP1_MOUSE RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1; AltName: Full=Protein expressed in
male leptotene and zygotene spermatocytes 501;
Short=MLZ-501
Length = 609
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 242/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>gi|162417986|ref|NP_082630.2| tyrosyl-DNA phosphodiesterase 1 [Mus musculus]
gi|148686961|gb|EDL18908.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Mus musculus]
Length = 609
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 172/440 (39%), Positives = 242/440 (55%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYGSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>gi|125841556|ref|XP_700174.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Danio rerio]
Length = 615
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 171/451 (37%), Positives = 238/451 (52%), Gaps = 80/451 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP----------IS 52
DI W++ P + V+++HGE KR A I P P I+
Sbjct: 214 DIPWMVEQYPPEFRNKPVVLVHGE--------KRESKACLIEQAKPYPHISFCQAKLDIA 265
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-C 107
FGTHH+K MLL Y G R+I+ T+NLI DW K+QG+WM P E
Sbjct: 266 FGTHHTKMMLLWYEEGFRVIILTSNLIRADWYQKTQGMWMSPLYPRLPQGSPGTAGESLT 325
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 167
GF+ DL++YL + PE + + K+ + S V LI S PG + G ++
Sbjct: 326 GFKRDLLEYLEAYRAPELANWI----------ERIKQHDLSETRVYLIGSTPGRYQGPAM 375
Query: 168 KKWGHMKLRTVLQECTFE-KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSED 222
+KWGH++LR +L E T + ++ ++ QFSS+GS+ KW+A E ++++
Sbjct: 376 EKWGHLRLRKLLSEHTQPMQNEERWHVLGQFSSIGSMGLDKTKWLAAEFQRTLTTLGKAG 435
Query: 223 KTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASH 277
K+ + P L+++P+VE+VR SLEGY AG ++P + K L Y+ W A
Sbjct: 436 KS---LASPETQMLLIYPSVENVRTSLEGYPAGGSLPYSIQTAQKQLWLHSYFHGWHADV 492
Query: 278 TGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
TGRS AMPHIKT+ R + +LAWFL+TSANLSKAAWGAL+KNN+Q+M+RSYELGVL L
Sbjct: 493 TGRSNAMPHIKTYMRISPDFTQLAWFLVTSANLSKAAWGALEKNNTQIMVRSYELGVLYL 552
Query: 336 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
PSA F N+ P A S + PVP+
Sbjct: 553 PSAFNMST-FPVEKNVFP------------------------------ACSSSIGFPVPF 581
Query: 396 ELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
+LPPQRYSS+D PW W+ YT+ D +G VW
Sbjct: 582 DLPPQRYSSKDRPWIWNIPYTQAPDTHGNVW 612
>gi|327259270|ref|XP_003214461.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Anolis
carolinensis]
Length = 603
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 169/444 (38%), Positives = 247/444 (55%), Gaps = 58/444 (13%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHSK 59
+D+ WL+ P + +L++HGE + ++ N L + L I+FGTHH+K
Sbjct: 200 IDLGWLVKQYPKEFREKPLLIVHGEKRESKAELQEEASLYDNVRLCQAKLDIAFGTHHTK 259
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEECGFENDLI 114
MLL Y G+R+++HT+NLI DW K+QG+W+ P ++ F++DLI
Sbjct: 260 MMLLHYEEGLRVVIHTSNLIADDWYQKTQGIWLSPLYPRLPPGASASDGESHTMFKSDLI 319
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL + K PA G + K+ +FS V L+ S PG + S +KWGH++
Sbjct: 320 SYLMSYK-------SPALGKWA---ETIKQHDFSETRVYLLGSTPGRYQNSDKEKWGHLR 369
Query: 175 LRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 229
L+ +L++ + + S P++ QFSS+GS+ KW+ +E S++S ++ K
Sbjct: 370 LKKLLKDHVMQVSDQDSWPVIGQFSSIGSMGADQSKWLCSEFRDSLTSLGNDTKALTNRD 429
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHI 287
P+ +V+PTVE+VR SLEGY AG ++P + K L Y+ KW A +GRSRAMPHI
Sbjct: 430 IPIHLVYPTVENVRQSLEGYPAGGSLPYSIETAKKQLWLHAYFHKWSAETSGRSRAMPHI 489
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + QK+AWFL+TSANLSKAAWGA +K +QLMIRSYELGVL LPS F
Sbjct: 490 KTYMRASPDFQKIAWFLVTSANLSKAAWGAFEKKGTQLMIRSYELGVLFLPSE------F 543
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
S Q++++ S+ +SS PVPY+LPP++Y +
Sbjct: 544 GLNSGYF------------QVKESMF--------SNEPSSS----FPVPYDLPPKKYEGK 579
Query: 406 DVPWSWDKRYTKK-DVYGQVW-PR 427
D PW W+ YT+ D YG +W PR
Sbjct: 580 DRPWIWNIPYTRAPDTYGNMWVPR 603
>gi|72255547|ref|NP_001026827.1| tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|123781898|sp|Q4G056.1|TYDP1_RAT RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|71051114|gb|AAH98739.1| Tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|149025341|gb|EDL81708.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Rattus norvegicus]
Length = 609
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 171/440 (38%), Positives = 238/440 (54%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
D++WL+ P + +L++HG E+ L H + AN L + L I+FGTHH+K
Sbjct: 208 DVNWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTK 266
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLID 115
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P Q N + F+ DL
Sbjct: 267 MMLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTS 326
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL P + ++ + S V LI S PG GS WGH +L
Sbjct: 327 YLMAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRL 376
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +LQ + P+V QFSS+GSL + KW+ +E S+ + E +TP
Sbjct: 377 RKLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S+E + PVPY+LPP+ Y S+D
Sbjct: 551 LDTFKVKQKFFSSSSEP------------------------MASFPVPYDLPPELYGSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>gi|426233768|ref|XP_004010886.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ovis aries]
Length = 612
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 238/439 (54%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + VL++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 211 DVDWLVRQYPPEFRKKPVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKM 270
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + + F+ DLI Y
Sbjct: 271 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATHFKADLISY 330
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ + ++ + S V LIAS PG G+ WGH +LR
Sbjct: 331 LAAYNAAPLKEWI----------DTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLR 380
Query: 177 TVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E + G + P++ QFSS+GS+ + KW+ +E S+ + E +T LG P
Sbjct: 381 KLLKEHASPAPGAESWPVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAVP 439
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 440 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKT 499
Query: 290 FARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++AWFL+TSANLSKAAWGAL+K +QLMIRSYELGVL LPSA F
Sbjct: 500 YLRPSPDFSQIAWFLVTSANLSKAAWGALEKGGTQLMIRSYELGVLFLPSA------FGL 553
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + SGS++ PVPY+LPP+ Y D
Sbjct: 554 DSFKVKQKFFSGSSQ-----------------------EPTASFPVPYDLPPELYGDRDR 590
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 591 PWIWNIPYVKAPDTHGNMW 609
>gi|291406635|ref|XP_002719650.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Oryctolagus cuniculus]
Length = 609
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 173/440 (39%), Positives = 241/440 (54%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRRKPILLVHGDKREAKAHLHAQAKPYENIALCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P L + S E F+ DLI Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIHEDWHQKTQGIWLSPLYPRLVHGTHRSGESTTHFKADLISY 327
Query: 117 LSTLKWPEFSANLPA-HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
L P + HG+ + S V LI S PG G+ WGH +L
Sbjct: 328 LMAYNAPSLQEWIDTIHGH-----------DLSETNVYLIGSTPGRFQGNQKDNWGHFRL 376
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E T +S P+V QFSS+GSL + KW+ +E S+ + +T
Sbjct: 377 RKLLKEHTSSVPQAESWPIVGQFSSIGSLGADESKWLCSEFKESLLTLGQASRTAGKSTV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LP+ F
Sbjct: 497 TYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPAT------FG 550
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
S V + S E + PVPY+LPP+ Y S+D
Sbjct: 551 LDSFNVKQKFFSSHQEPA------------------------AAFPVPYDLPPELYGSKD 586
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 587 RPWIWNIPYVKAPDTHGNMW 606
>gi|300798259|ref|NP_001180084.1| tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
gi|296482871|tpg|DAA24986.1| TPA: tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
Length = 612
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 238/439 (54%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + VL++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 211 DVDWLIRQYPPEFRKKPVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKM 270
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + + F+ DLI Y
Sbjct: 271 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISY 330
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ + ++ + S V LIAS PG G+ WGH +LR
Sbjct: 331 LAAYNAAPLKEWI----------DTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLR 380
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E +S P++ QFSS+GS+ + KW+ +E S+ + E +T LG P
Sbjct: 381 KLLKEHASPMPKAESWPVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAP 439
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 440 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKT 499
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++AWFL+TSANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA F
Sbjct: 500 YLRPSPDFSQIAWFLVTSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGL 553
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + SGS++ PVPY+LPP+ Y D
Sbjct: 554 DSFKVKQKFFSGSSQ-----------------------EPTASFPVPYDLPPEVYGDRDR 590
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 591 PWIWNIPYVKAPDTHGNMW 609
>gi|440911964|gb|ELR61579.1| Tyrosyl-DNA phosphodiesterase 1, partial [Bos grunniens mutus]
Length = 616
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 168/439 (38%), Positives = 238/439 (54%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + VL++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 215 DVDWLVRQYPPEFRKKPVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKM 274
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + + F+ DLI Y
Sbjct: 275 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISY 334
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ K ++ + S V LIAS PG G+ WGH +LR
Sbjct: 335 LAAYN----------AAPLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLR 384
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E +S P++ QFSS+GS+ + KW+ +E S+ + E +T LG P
Sbjct: 385 KLLKEHASPMPKAESWPVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAP 443
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 444 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKT 503
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++AWFL+TSANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA F
Sbjct: 504 YLRPSPDFSQIAWFLVTSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGL 557
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + SGS++ PVPY+LPP+ Y D
Sbjct: 558 DSFKVKQKFFSGSSQ-----------------------EPTASFPVPYDLPPELYGDRDR 594
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 595 PWIWNIPYVKAPDTHGNMW 613
>gi|395503746|ref|XP_003756224.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sarcophilus harrisii]
Length = 612
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/440 (37%), Positives = 241/440 (54%), Gaps = 57/440 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
++DWL+ P+ + +L++HG+ + ++ KP N L + L I+FGTHH+K
Sbjct: 210 EVDWLVRQYPLEFRKKPILLVHGDKREAKARLQEKAKPYENISLCQAKLDIAFGTHHTKM 269
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFENDLID 115
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E F++DLI
Sbjct: 270 MLLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTHGESSTNFKSDLIS 329
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL P + +K + S V LI S PG G ++ WGH +L
Sbjct: 330 YLMAYNAPPLKEWI----------DIVQKHDLSETRVYLIGSTPGRFQGKHIEDWGHFRL 379
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E T ++S P+V QFSS+GSL + KW+ +E S+ + K
Sbjct: 380 RKLLKEHTSLLPEQQSWPIVGQFSSIGSLGADESKWLCSEFKDSLVILGNHGKNQGQHNV 439
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++PTVE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 440 PLHLIYPTVENVRNSLEGYPAGGSLPYSLQTAEKQVWLHSYFHKWSAETSGRSNAMPHIK 499
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 500 TYMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 553
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ + ++ S E + PVPY+LPP+ Y+S+D
Sbjct: 554 MDTFKIKRKVFSEKQEPA------------------------TSFPVPYDLPPEIYNSKD 589
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 590 RPWIWNIPYVKAPDTHGNMW 609
>gi|344274118|ref|XP_003408865.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Loxodonta africana]
Length = 612
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 236/439 (53%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 211 DVDWLVKQYPPEFRNKPILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKM 270
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNL----SEECGFENDLIDY 116
MLL+Y G+R+++HTANLIH DW+ K+QG+W+ + + F+ DL+ Y
Sbjct: 271 MLLLYEEGLRVVIHTANLIHADWHQKTQGIWLSPLYPRIVHGTHGPGESPTHFKADLVSY 330
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + ++ + S V LI S PG G WGH +LR
Sbjct: 331 LMAYNAPPLKGWI----------DTIQEHDLSETNVYLIGSTPGRFQGDQKDNWGHFRLR 380
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E T ++ P+V QFSS+GS+ + KW+ +E S+ + + +T P
Sbjct: 381 KLLREHTSPIPKAEAWPIVGQFSSIGSMGTDESKWLCSEFKESLLTLGKDGRTLGKSTAP 440
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 441 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSSAMPHIKT 500
Query: 290 FAR--YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + +AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPS F
Sbjct: 501 YMRPSPDFSSIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSV------FGL 554
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + SGS E + PVPY+LPP+ Y S+D
Sbjct: 555 DSFKVRQKFFSGSQEL------------------------MASFPVPYDLPPELYGSKDR 590
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 591 PWIWNIPYVKAPDTHGNMW 609
>gi|410911974|ref|XP_003969465.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Takifugu rubripes]
Length = 614
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 165/441 (37%), Positives = 242/441 (54%), Gaps = 62/441 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
DI W++ P + VL++HG E+ L + P + + L I+FGTHH+K
Sbjct: 215 DIAWMVKQYPEEFRDRPVLIVHGDKREAKARLVQQAQGFP-HIQFCQAKLDIAFGTHHTK 273
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP----LKDQNNLSEECGFENDLI 114
MLL Y G R+IV T+NLI DW K+QG+WM FP ++ F+ DL+
Sbjct: 274 MMLLWYEEGFRVIVLTSNLIRADWYQKTQGMWMSPLFPRLPEGSSASSGESPTYFKRDLL 333
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
+YL++ + PE + K+ + S +V L+ S PG + GS +++WGH++
Sbjct: 334 EYLASYRAPELEEWI----------QRIKEHDLSETSVYLVGSTPGRYVGSDMERWGHLR 383
Query: 175 LRTVLQECTFE-KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIG 229
LR +L E T G ++ P++ QFSS+GS+ KW+A E +M++ K+ +
Sbjct: 384 LRKLLSEHTEAFPGEERWPVIGQFSSIGSMGLDKTKWLAGEFQRTMTT---MGKSTVRSD 440
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHI 287
P+ +++P++EDVR SLEGY AG ++P + K L ++ +WKA TGRS AMPHI
Sbjct: 441 PPMQLLYPSIEDVRTSLEGYPAGGSLPYSIQTAQKQLWLHSFFHRWKADSTGRSHAMPHI 500
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R N +LAWF +TSANLSKAAWGAL+KNN+Q+MIRSYELGVL +PSA
Sbjct: 501 KTYMRVSPNFTELAWFFMTSANLSKAAWGALEKNNTQMMIRSYELGVLFVPSA------- 553
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
K+ T + S +SS PVP++LPP YS +
Sbjct: 554 -----------------------FKMKTFPVNKSPFLVSSSSFSGFPVPFDLPPTAYSPK 590
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ Y++ D +G +W
Sbjct: 591 DQPWIWNIPYSQAPDTHGNIW 611
>gi|395827684|ref|XP_003787027.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Otolemur garnettii]
Length = 608
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 171/440 (38%), Positives = 240/440 (54%), Gaps = 58/440 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
D+DWL+ P + +L++HG E+ L H + N L + L I+FGTHH+K
Sbjct: 207 DVDWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYGNISLCQAKLDIAFGTHHTK 265
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLID 115
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + + S E F+ DLI
Sbjct: 266 MMLLLYEEGLRVVIHTSNLIHEDWHQKTQGIWLSPLYPRIVHGTHKSGESVTHFKADLIS 325
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL + + + S V LI+S PG GS WGH +L
Sbjct: 326 YLMAYNASPLKEWI----------DLIHEHDLSETNVYLISSTPGRFQGSQKDNWGHFRL 375
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGE 230
R +L+E +S P+V QFSS+GSL + KW++ E S+ + E K P
Sbjct: 376 RKLLKEHASSIPAAESWPIVGQFSSIGSLGADESKWLSSEFKESLLTLGKESKAPGKSTV 435
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K ++L Y+ KW A +GRS AMPHIK
Sbjct: 436 PLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQNWLHSYFHKWSAETSGRSHAMPHIK 495
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 496 TYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FG 549
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
S V + S + E + PVPY+LPP+ Y ++D
Sbjct: 550 LDSFKVKQKFFSANKEP------------------------MATFPVPYDLPPELYGNKD 585
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 586 RPWIWNIPYVKAPDTHGNMW 605
>gi|340383155|ref|XP_003390083.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Amphimedon
queenslandica]
Length = 535
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 165/441 (37%), Positives = 238/441 (53%), Gaps = 65/441 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK--PANWILHKPPLPISFGTHHS 58
M DI WLL P + +L++HG E ++ + N L + L + FGTHHS
Sbjct: 141 MFDIKWLLDQYPEDKRSLPLLIVHGFQGREFESLRMDSLPHPNIKLLQAKLDL-FGTHHS 199
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
K MLL Y G+R+++HTANLI DW+ K+QG+WM P+ ++ + C F++DL+ YL
Sbjct: 200 KMMLLSYNEGLRVVIHTANLIQKDWDQKTQGVWMS--PVFPKSTVKRSCKFQDDLLSYLD 257
Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
T ++ K+ K + SS +IASVPG HTG ++ KWGHMKLR V
Sbjct: 258 T-----YTGAAMNEWKEKV-----KSHDMSSCRAHIIASVPGPHTGLNIFKWGHMKLRKV 307
Query: 179 LQE--CTFEKGFKKSPLVYQFSSLGSL--------DEKWMAELSSSMSSGFSED-KTPLG 227
L+E + K P++ QFSS+GSL +W+ LSS +G + ++ +
Sbjct: 308 LEEHGPSASTTTKDWPVIGQFSSIGSLGPAPSSWLTSEWLTSLSSCWKTGTVKTLRSEIP 367
Query: 228 IGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPH 286
G+ +V+PTVE+++ SLEGY AG ++P + Q + + +L ++ +W A GRSRA PH
Sbjct: 368 KGKLQLVFPTVENIKNSLEGYMAGGSVPYASQTALKQPYLTTFFNQWVAEGYGRSRASPH 427
Query: 287 IKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
IKT+ R + +LAWFLLTSANLSKAAWG +K +QL IRSYE+GVL+LP
Sbjct: 428 IKTYMRVSPTCDRLAWFLLTSANLSKAAWGGFEKKGTQLRIRSYEIGVLLLP-------- 479
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
+ +SG+ + +SS LP+P +LP Y +
Sbjct: 480 ----------DDESGTLMVGE------------------SSSNNSMLPIPIDLPLTDYKT 511
Query: 405 EDVPWSWDKRYTKKDVYGQVW 425
D PW W+ RY D G VW
Sbjct: 512 TDRPWIWNDRYLAPDCKGNVW 532
>gi|113931582|ref|NP_001039242.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
gi|89273341|emb|CAJ81457.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
Length = 597
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 165/440 (37%), Positives = 242/440 (55%), Gaps = 57/440 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANWI-LHKPPLPISFGTHHSKA 60
DI WL+ P + +L++HGE + + + P I L + L I+FGTHH+K
Sbjct: 195 DIKWLVKQYPEEFRDKPLLIVHGEKRESKAKLHEDAHPYEHIRLCQAKLDIAFGTHHTKM 254
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLID 115
MLL+Y G+R+++HT+NLIH DW K+QG+W+ + S G F +DL+
Sbjct: 255 MLLLYTEGLRVVIHTSNLIHEDWYQKTQGIWLSPLYPRLPEGASVSAGESSTNFRSDLVA 314
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL++ P + K+ + S V LI S PG G+ KWGH +L
Sbjct: 315 YLASYNSPSLREWM----------DIIKQHDLSETRVYLIGSTPGRFQGNDKDKWGHFRL 364
Query: 176 RTVLQECTFEK-GFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E T G + P++ QFSS+GS+ KW+ +E + S+++ K+
Sbjct: 365 RKLLRENTSAAPGQETWPVIGQFSSIGSMGVDKTKWLCSEFTESLTTLGKSIKSLQKTEI 424
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+V++VR SLEGY AG ++P S Q + +L Y+ KWKA + RS+AMPHIK
Sbjct: 425 PLHLIYPSVDNVRTSLEGYPAGGSLPYSIQTAQKQPWLHSYFHKWKAETSRRSQAMPHIK 484
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + Q LAWFL+TSANLSKAAWG+L+KN +QL IRSYELGVL LPSA
Sbjct: 485 TYMRLSPDSQHLAWFLVTSANLSKAAWGSLEKNGAQLFIRSYELGVLFLPSA-------- 536
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
ET+ V L + S++ +++ PVPY+LPP+ Y ++D
Sbjct: 537 --------------FETNTFN----VKLNIYASNEPSSNA----FPVPYDLPPEHYGAKD 574
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y D +G +W
Sbjct: 575 RPWVWNIPYVNAPDTHGNIW 594
>gi|432115827|gb|ELK36975.1| Tyrosyl-DNA phosphodiesterase 1 [Myotis davidii]
Length = 610
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 172/444 (38%), Positives = 240/444 (54%), Gaps = 66/444 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 209 DVDWLVRQYPPEFRKKPILLVHGDKREAKAHLHAEAKPYPNVSLCQAKLDIAFGTHHTKM 268
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLI 114
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ PL + + F+ DLI
Sbjct: 269 MLLLYEEGLRVVIHTSNLIREDWHQKTQGMWVS--PLYPRMAHGTPGSGESTTHFKADLI 326
Query: 115 DYLSTLKWP---EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
YL P E+ + AH + S V LI S PG G+ WG
Sbjct: 327 SYLMAYNAPPLQEWVDVIHAH-------------DLSETNVYLIGSTPGRFQGNQKDNWG 373
Query: 172 HMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPL 226
H +LR VL+E ++ P++ QFSS+GS+ + KW+ AE ++ + E + P
Sbjct: 374 HFRLRKVLKEHASSIPKAEAWPVIGQFSSIGSMGADESKWLCAEFKETLVTLGKESRAPG 433
Query: 227 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 284
PL +++P+VE+VR SLEGY AG ++P S Q + +L Y+ KW A +GRS AM
Sbjct: 434 RSPAPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQSWLHAYFHKWSAETSGRSNAM 493
Query: 285 PHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 342
PHIKT+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 494 PHIKTYMRPSPDFSQIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA---- 549
Query: 343 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 402
F S V + SGS E + PVPY+LPP+ Y
Sbjct: 550 --FGLDSFRVKPKFFSGSQEPT------------------------ASFPVPYDLPPELY 583
Query: 403 SSEDVPWSWDKRYTKK-DVYGQVW 425
S+D PW W+ Y K D +G +W
Sbjct: 584 GSKDRPWIWNIPYVKAPDTHGNMW 607
>gi|301123067|ref|XP_002909260.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
gi|262100022|gb|EEY58074.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
Length = 1123
Score = 257 bits (656), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 200/353 (56%), Gaps = 52/353 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M D+ WL CP L ++P VLV HGE D + +N PPLPI +GTHH+K
Sbjct: 64 MFDLPWLFTECPRLKEVPVVLV-HGERDRQGMTKECRDYSNVTPVAPPLPIPYGTHHTKM 122
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---------CGFEN 111
++ +YP VR+ + TAN + DWN K+QGLW QDF LK + EE FE
Sbjct: 123 LVALYPERVRVAIFTANFLSNDWNTKTQGLWYQDFGLKVLTDSDEEEKEAVAKSSSDFEA 182
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
DL+ YLS+L P K+ K+F+FSSA V L+ SVPG H G ++K+G
Sbjct: 183 DLVHYLSSLGAP-----------VKLFCGELKRFDFSSARVALVPSVPGVHKGKDMEKYG 231
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPLGIG 229
H+++R +LGSLDEKW+ E + S+ G T + +
Sbjct: 232 HLRVR----------------------NLGSLDEKWLFGEFAESLLPGKKHISSTSMPVQ 269
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIK 288
++WP VEDVR SLEG+ +G +IP P KN+ K FL KY KW + R AMPHIK
Sbjct: 270 ALHVIWPAVEDVRNSLEGWNSGRSIPCPLKNM-KPFLHKYLRKWMPPAELHRQNAMPHIK 328
Query: 289 TFARYNGQ-----KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
++AR+N +L W ++TS+NLSKAAWG+LQKN +Q MIRSYELGV+ LP
Sbjct: 329 SYARFNASEDKAGELDWAIVTSSNLSKAAWGSLQKNKTQFMIRSYELGVMFLP 381
>gi|452821653|gb|EME28681.1| tyrosyl-DNA phosphodiesterase 1 [Galdieria sulphuraria]
Length = 452
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 161/439 (36%), Positives = 234/439 (53%), Gaps = 50/439 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSK 59
M D+ WL P+L + +L++HG+ + + P ++I HKP LP +GTHH+K
Sbjct: 45 MFDLSWLFQRVPILLTVERLLIVHGDE----QVYQPFSPYHFITFHKPRLPFPYGTHHTK 100
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
++L YP VR ++ TAN+I DW K+QG++++DFP K + C F + DYLS
Sbjct: 101 LIILFYPTKVRFVLTTANMIQSDWEYKTQGMFLKDFPQKTGE--LKSCPFLETMDDYLSA 158
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT-V 178
L P + S +++FS A V LI SVPGYH G +L K+GH L + +
Sbjct: 159 LGEP-----------LRYYRSLLCQYDFSKAGVVLIPSVPGYHGGRNLDKYGHRSLHSNI 207
Query: 179 LQECTF--EKGFKKSP------LVYQFSSLGSLDEKWM-AELSSSMSSGFSEDKTPLGIG 229
Q C E+ ++ L+ Q SS+GS+ EKW+ EL SM S + +
Sbjct: 208 SQYCCISDEQRIRRKTTHSTIRLLLQCSSMGSISEKWLKQELFHSMVSSCWKQEDWQYCF 267
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
E ++WP+V+ VR S++GYA+G A P +KN + F + W A R+ +PH+K+
Sbjct: 268 EWDLIWPSVQQVRNSIQGYASGAAFPWTKKNY-RSFQSSHLCLWNAYFFRRNAWLPHMKS 326
Query: 290 FARY-NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC- 347
+ Y + WFLLTSANLS AAWG L +N SQL IRSYELGVL P C ++C
Sbjct: 327 YMAYEESGNIFWFLLTSANLSTAAWGRLVRNQSQLFIRSYELGVLWTPML----CSYTCP 382
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
N++ ++ + TS + K ++ + LP+P++LPPQ Y S D
Sbjct: 383 MDNVI--QLTTPQHITSYYPREK-------------NNNILFCLPLPFQLPPQHYDSNDS 427
Query: 408 PWSWDKRYTKKDVYGQVWP 426
PW WD Y D G VWP
Sbjct: 428 PWLWDAIYKSPDRLGNVWP 446
>gi|427789081|gb|JAA59992.1| Putative tyrosyl-dna phosphodiesterase [Rhipicephalus pulchellus]
Length = 614
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 164/443 (37%), Positives = 244/443 (55%), Gaps = 68/443 (15%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH----KPPLPISFGTHHS 58
DI WL+ P + +LV+HGE + ++ + A+ H + L I +GTHH+
Sbjct: 211 DIPWLVEQYPTEFRNLPLLVVHGEQREAKKALETS--ASGFQHVSFAQAKLEIVYGTHHT 268
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLSEECGFENDLID 115
K MLL+Y G+R+++HTAN+I DW K+Q +W+ + N E GF DL++
Sbjct: 269 KMMLLLYKEGLRVVIHTANMIPTDWAQKTQAIWVGPVCPRLAPGSNGGDSETGFRADLLN 328
Query: 116 YLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
YLS A+G+ IN + + +FS+ V L+ SVPG HTG +GH+
Sbjct: 329 YLS------------AYGDTHINEWCHYIRTHDFSAVKVFLVGSVPGRHTGPRKSCFGHL 376
Query: 174 KLRTVLQECTFEKGFKKS--PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLG 227
+LR +L + K + PLV QFSS+GSL E W+ E SS+S+ T
Sbjct: 377 RLRNLLSQHGPSKDLVSNHWPLVAQFSSIGSLGASAESWLLGEFLSSLSTTKGSVVTARS 436
Query: 228 IGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMP 285
+ PL +V+P+V+DVRCSLEGY AG +IP DK +L ++ +WK+ GR+ A P
Sbjct: 437 V--PLKLVFPSVDDVRCSLEGYPAGASIPYSIVTADKQRWLDSFFHRWKSERLGRTAASP 494
Query: 286 HIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
HIKT+ R + +++AW L+TSANLSKAAWGAL+KN SQLMIRSYELG+L+ P+
Sbjct: 495 HIKTYTRLSPSSKQIAWLLVTSANLSKAAWGALEKNGSQLMIRSYELGILLFPA------ 548
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
F + V SE +G++ ++LP+PY++P Y+
Sbjct: 549 NFGQATTFVVSEGANGNS--------------------------ALFLPLPYDVPLVPYT 582
Query: 404 SEDVPWSWDKRYTK-KDVYGQVW 425
+D PW+WD ++ + D +G +W
Sbjct: 583 KDDEPWTWDSQHRELPDRFGNMW 605
>gi|148237298|ref|NP_001087094.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus laevis]
gi|49903395|gb|AAH76790.1| Tdp1-prov protein [Xenopus laevis]
Length = 597
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 165/440 (37%), Positives = 237/440 (53%), Gaps = 57/440 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANWI-LHKPPLPISFGTHHSKA 60
DI+WL+ P + +L++HGE + + + P I L + L I++GTHH+K
Sbjct: 195 DIEWLVKQYPEEFRNKPLLIVHGEKRESKTKLHEDAHPYEHIRLCQAKLDIAYGTHHTKM 254
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLID 115
MLL+Y G+R+++HT+NLI DW K+QG+W+ + S G F +DLI
Sbjct: 255 MLLLYTEGLRVVIHTSNLIREDWYQKTQGIWLSPLYPRLPEGASVSAGESSTNFRSDLIA 314
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL++ P + K+ + S V LI S PG G KWGH +L
Sbjct: 315 YLASYNSPSLREWM----------DIIKQHDLSETRVYLIGSTPGRFQGKDKDKWGHFRL 364
Query: 176 RTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E T K+ P++ QFSS+GS+ KW+ +E + S+ + K+
Sbjct: 365 RKLLRENTSAGPDKEMWPVIGQFSSIGSMGVDKTKWLCSEFTESLKTLGKSIKSLQKSEI 424
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+V++VR SLEGY AG ++P S Q + +L Y+ KWKA +GRS+A+PHIK
Sbjct: 425 PLRLIYPSVDNVRTSLEGYPAGGSLPYSIQTAQKQPWLHSYFHKWKAETSGRSQAIPHIK 484
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R+ + Q LAWFL+TSANLSKAAWG+L+KN +QL IRSYELGVL LPSA F+
Sbjct: 485 TYMRFSPDFQNLAWFLVTSANLSKAAWGSLEKNGAQLFIRSYELGVLFLPSAFDTNT-FN 543
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
NI SG+ PVPY+LPP+ Y S+D
Sbjct: 544 VKVNIYSHNEPSGNA-----------------------------FPVPYDLPPEHYGSKD 574
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y D +G +W
Sbjct: 575 RPWVWNIPYVNAPDTHGNIW 594
>gi|348675737|gb|EGZ15555.1| hypothetical protein PHYSODRAFT_505563 [Phytophthora sojae]
Length = 1258
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 201/356 (56%), Gaps = 55/356 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M D+ WL CP L +P VL++HGE D + + AN PPLPI++GTHH+K
Sbjct: 69 MYDLPWLFAECPRLRDVP-VLLVHGERDRQGMMKECREYANVTPVAPPLPIAYGTHHTKM 127
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE------------CG 108
++ +YP VR+ + TAN + DWN K+QG+W QDF LK + +E
Sbjct: 128 LVALYPEKVRVAIFTANFLSNDWNTKTQGVWFQDFGLKVLDGSEDEEKDAVADNSTAIND 187
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 168
FE DL+ YLS+L K+ +F+FS+A V L+ SVPG H G ++
Sbjct: 188 FEADLVHYLSSLG-----------AQVKLFCGELMRFDFSAARVALVPSVPGVHKGKDME 236
Query: 169 KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPL 226
K+GH+++R +LGSLDEKW+ E + SM G T +
Sbjct: 237 KYGHLRVR----------------------NLGSLDEKWLFGEFAESMLPGKKNVSPTSM 274
Query: 227 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMP 285
+ I+WP+V+DVR SLEG+ +G +IP P KN+ K FL KY KW R AMP
Sbjct: 275 PVQALHIIWPSVDDVRNSLEGWNSGRSIPCPLKNM-KPFLHKYLRKWTPPEELHRQNAMP 333
Query: 286 HIKTFARYN-----GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
HIK++AR+N +L W ++TS+NLSKAAWGALQKN +QLMIRSYELGV+ LP
Sbjct: 334 HIKSYARFNPSDEKAGELDWVIVTSSNLSKAAWGALQKNKTQLMIRSYELGVMFLP 389
>gi|20150581|pdb|1JY1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)
Length = 464
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 171/439 (38%), Positives = 238/439 (54%), Gaps = 56/439 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 63 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKX 122
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
LL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ +LI Y
Sbjct: 123 XLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISY 182
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + K + S V LI S PG GS WGH +L+
Sbjct: 183 LTAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 232
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E S + E KTP P
Sbjct: 233 KLLKDHASSXPNAESWPVVGQFSSVGSLGADESKWLCSEFKESXLTLGKESKTPGKSSVP 292
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS A PHIKT
Sbjct: 293 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAXPHIKT 352
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QL IRSYELGVL LPSA
Sbjct: 353 YXRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLXIRSYELGVLFLPSA------LGL 406
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
S V + +GS E PVPY+LPP+ Y S+D
Sbjct: 407 DSFKVKQKFFAGSQEPXAT------------------------FPVPYDLPPELYGSKDR 442
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G W
Sbjct: 443 PWIWNIPYVKAPDTHGNXW 461
>gi|395746171|ref|XP_003778400.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Pongo abelii]
Length = 589
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 212/351 (60%), Gaps = 25/351 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E+KTP P
Sbjct: 377 KLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESMLTLGKENKTPGKTSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
>gi|79154014|gb|AAI07878.1| TDP1 protein [Homo sapiens]
Length = 589
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 211/351 (60%), Gaps = 25/351 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
>gi|397525721|ref|XP_003832804.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 3 [Pan paniscus]
Length = 589
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 211/351 (60%), Gaps = 25/351 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 327 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 377 KLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESMLTLGKESKTPGKSSVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 497 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
>gi|426377770|ref|XP_004055628.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gorilla gorilla
gorilla]
Length = 608
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 170/446 (38%), Positives = 233/446 (52%), Gaps = 70/446 (15%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS---- 58
D+DWL+ P + +L++HG+ H+ KP IS
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQA-------KPYENISLCQLSEIGKR 259
Query: 59 -----KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGF 109
K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F
Sbjct: 260 FLLCEKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHF 319
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 169
+ DLI YL P + K + S V LI S PG GS
Sbjct: 320 KADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDN 369
Query: 170 WGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKT 224
WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM + E KT
Sbjct: 370 WGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKT 429
Query: 225 PLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSR 282
P PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS
Sbjct: 430 PGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSN 489
Query: 283 AMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR 340
AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 490 AMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA-- 547
Query: 341 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 400
F S V + +GS E + PVPY+LPP+
Sbjct: 548 ----FGLDSFKVKQKFFAGSQEP------------------------MATFPVPYDLPPE 579
Query: 401 RYSSEDVPWSWDKRYTKK-DVYGQVW 425
Y S+D PW W+ Y K D +G +W
Sbjct: 580 LYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|431839199|gb|ELK01126.1| Tyrosyl-DNA phosphodiesterase 1 [Pteropus alecto]
Length = 709
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 151/351 (43%), Positives = 213/351 (60%), Gaps = 25/351 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAEAKPYGNISLCQAKLEIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P + N S E F+ DL+ Y
Sbjct: 267 MLLLYEEGLRVVIHTSNLIRADWHQKTQGIWLSPLYPRIAPGTNTSGESTTHFKADLVSY 326
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L + N PA K ++ + S V LI S PG GS WGH +LR
Sbjct: 327 L-------MAYNAPA---LKEWIDVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 376
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E +S P+V QFSS+GS+ + KW+ +E ++++ E KTP P
Sbjct: 377 KLLKEHASSIPKAESWPVVGQFSSIGSMGADESKWLCSEFKETLATLGRESKTPGKSAVP 436
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 496
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
+ R + ++AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 497 YMRPSPDFSQIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 382 AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
+G+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 662 SGSQEPAASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 706
>gi|26329523|dbj|BAC28500.1| unnamed protein product [Mus musculus]
gi|148686960|gb|EDL18907.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Mus musculus]
Length = 579
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 154/368 (41%), Positives = 216/368 (58%), Gaps = 35/368 (9%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 497 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA-------- 548
Query: 347 CTSNIVPS 354
SNIVP+
Sbjct: 549 FVSNIVPA 556
>gi|28071068|emb|CAD61915.1| unnamed protein product [Homo sapiens]
Length = 369
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 157/381 (41%), Positives = 212/381 (55%), Gaps = 54/381 (14%)
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLI 114
K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI
Sbjct: 26 KMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLI 85
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL P + K + S V LI S PG GS WGH +
Sbjct: 86 SYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFR 135
Query: 175 LRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 229
L+ +L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP
Sbjct: 136 LKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSS 195
Query: 230 EPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHI
Sbjct: 196 VPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHI 255
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 256 KTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------F 309
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
S V + +GS E + PVPY+LPP+ Y S+
Sbjct: 310 GLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSK 345
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ Y K D +G +W
Sbjct: 346 DRPWIWNIPYVKAPDTHGNMW 366
>gi|195997043|ref|XP_002108390.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
gi|190589166|gb|EDV29188.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
Length = 569
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 163/445 (36%), Positives = 236/445 (53%), Gaps = 69/445 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M D+ WLL P + VL++HG +S LE + P N H+ L +++GTHH
Sbjct: 155 MFDVSWLLDQYPEDYRKNPVLIVHGYSGQSRNNLEQQGQPFP-NVKFHQAKLEMAYGTHH 213
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN----NLSEECGFENDL 113
SK M L+Y G+RI++HTANLI DW ++QG+W+ LK + N++++ GF+ DL
Sbjct: 214 SKMMFLLYSNGLRIVIHTANLIPQDWGRRTQGIWISPLFLKRSDKSEMNIADDTGFKQDL 273
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+DY+++ PA ++ S + + SS V LIASVPG H G ++ KWGH+
Sbjct: 274 LDYVASYG--------PALFEWR---SRIMEHDMSSVNVFLIASVPGRHAGKNIDKWGHL 322
Query: 174 KLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDEK---WM-AELSSSMSSGFSEDKTPLG 227
KLR +L+ K + P + QFSS+GSL K W+ +E +S+SS + + LG
Sbjct: 323 KLRKILKRNGPSKDDVSANWPAICQFSSIGSLGSKRDAWLYSEFRTSLSSTSTTRLSQLG 382
Query: 228 --IGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+VE+VR LEGY G+ +P + +K +L W A TGR RA
Sbjct: 383 ERKADVKLIFPSVENVRNCLEGYKGGSCLPYNRGTANKQPWLNSLLHNWAAKKTGRHRAS 442
Query: 285 PHIKTFARY--NGQKLAWFLLTS--ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR 340
PHIKT+ R + +LAWFL+T ANLSKAAWG ++KN +QLMIRSYE+GVL LP
Sbjct: 443 PHIKTYTRVSPDNTELAWFLITRQVANLSKAAWGTMEKNETQLMIRSYEIGVLFLPKQFG 502
Query: 341 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 400
G F KT + W +PY+LP
Sbjct: 503 DGKTF----------------------KTCDLKTNW---------------LIPYDLPLI 525
Query: 401 RYSSEDVPWSWDKRYTKKDVYGQVW 425
Y +D PW+WD + + D +G W
Sbjct: 526 PYGLQDSPWTWDTPHLEPDTHGAQW 550
>gi|47220883|emb|CAG03090.1| unnamed protein product [Tetraodon nigroviridis]
Length = 607
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 165/446 (36%), Positives = 246/446 (55%), Gaps = 72/446 (16%)
Query: 7 LLPACP--------VLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTH 56
LL ACP L + VL++HG+ + + A + + L I+FGTH
Sbjct: 204 LLQACPRRQSPHQWCLRRDRPVLIVHGDKREAKARLVQQAQAFPHVQFCQAKLDIAFGTH 263
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE----CGFEN 111
H+K MLL Y G R+++ T+NLI DW K+QG+WM FP + + + F+
Sbjct: 264 HTKMMLLWYEEGFRVVILTSNLIRADWYQKTQGMWMSPLFPRLPEGSGARAGESPTSFKR 323
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
DL++YL++ + + + ++ + S A+V L+ S PG + G+ +++WG
Sbjct: 324 DLLEYLASYRAAQLEEWM----------ERIQEHDLSEASVYLVGSTPGRYVGADMERWG 373
Query: 172 HMKLRTVLQECT-FEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSS-GFSEDKT- 224
H++LR +L+E T G + P+V QFSS+GS+ KW+A E ++S+ G S ++
Sbjct: 374 HLRLRKLLREHTETPAGQDRWPVVGQFSSIGSMGLDKSKWLAGEFQHTLSTLGQSSARSD 433
Query: 225 -PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSR 282
PL L+++P+VEDVR SLEGY AG ++P S Q + +L ++ +W+A TGRS
Sbjct: 434 PPL-----LLLYPSVEDVRTSLEGYPAGGSLPYSIQTAQRQLWLHAFFHRWRADSTGRSH 488
Query: 283 AMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR 340
AMPHIKT+ R + +LAWFL+TSANLSKAAWGAL+KNN+Q+MIRSYELGVL LP+A
Sbjct: 489 AMPHIKTYMRASPGYTELAWFLVTSANLSKAAWGALEKNNTQVMIRSYELGVLFLPAA-- 546
Query: 341 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 400
+ T + S +SS PVP++LPP
Sbjct: 547 ----------------------------FNMKTFPVNTSPFPVSSSSFSGFPVPFDLPPT 578
Query: 401 RYSSEDVPWSWDKRYTKK-DVYGQVW 425
YS +D PW W+ Y++ D +G VW
Sbjct: 579 AYSPKDQPWIWNIPYSQAPDTHGNVW 604
>gi|256073128|ref|XP_002572884.1| tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 1234
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 163/449 (36%), Positives = 246/449 (54%), Gaps = 71/449 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M DI WL P + + ++H G+ +L+ K +N + + + +G HH
Sbjct: 830 MFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQADIRLPYGVHH 888
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSE---ECGFE 110
+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q NL++ + F
Sbjct: 889 TKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKNLNDTDSKTNFR 948
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRLIASVPGYHTGS 165
DL++YL + + +L + +P F ++F V LIASV G H G
Sbjct: 949 ADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVLIASVSGRHAGE 1000
Query: 166 SLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSLDEK----WMAELSSSMSSGFS 220
SLKK+GH +L VLQ C + S P++ QFSS+GSL K + E SSS++
Sbjct: 1001 SLKKFGHTRLGEVLQTCNSQ--IPSSWPVIGQFSSIGSLGPKPTDWFTTEWSSSLAG--- 1055
Query: 221 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTG 279
K G+ +++P+VEDVR SLEGY AG +P + +K +L +++ +W+A +
Sbjct: 1056 --KGARGL---RMIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQFFYRWQAFN-- 1108
Query: 280 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 337
SRA PHIK++ R +GQ++ WFLLTSANLSK+AWGA +K+ SQLMIRSYELGVL LP+
Sbjct: 1109 HSRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIRSYELGVLFLPT 1168
Query: 338 A-KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
K F EI + + SQ ++ E++ P+PYE
Sbjct: 1169 NYKESAHSF---------EILKNNAKYSQ-----------------SSTDELLPFPIPYE 1202
Query: 397 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
LPP +Y S D PW DK ++ D++G++W
Sbjct: 1203 LPPVKYQSNDEPWILDKPHSLPDIFGRIW 1231
>gi|67971950|dbj|BAE02317.1| unnamed protein product [Macaca fascicularis]
gi|67971954|dbj|BAE02319.1| unnamed protein product [Macaca fascicularis]
Length = 343
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/379 (40%), Positives = 211/379 (55%), Gaps = 54/379 (14%)
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 61
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + + + S V LI S PG GS WGH +LR
Sbjct: 62 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 111
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 112 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 171
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 172 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 231
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 232 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 285
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 286 DNFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSKDR 321
Query: 408 PWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 322 PWIWNIPYVKAPDTHGNMW 340
>gi|405964823|gb|EKC30268.1| Tyrosyl-DNA phosphodiesterase 1 [Crassostrea gigas]
Length = 461
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 156/441 (35%), Positives = 230/441 (52%), Gaps = 58/441 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHS 58
M +I WL+ P + +L +HG G ++ + K N + L + +GTHH+
Sbjct: 60 MFEIPWLIQQYPASFRQKPLLCVHGFQGGQKAGLEADARKFTNIKFCQAKLEMPYGTHHT 119
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDL 113
K M L+Y G+R+++HTANLI DW+ K+QG+W+ K ++ S G F+ DL
Sbjct: 120 KMMFLLYDNGLRVVIHTANLIERDWHQKTQGIWISPVFPKLKSGPSPTQGDSPTHFKRDL 179
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+ Y++ K K + + SSA V ++ SVPG H +GHM
Sbjct: 180 LQYVAAYK----------AYQLKDWQDHISRHDLSSANVFIVGSVPGRHMAEKKHWFGHM 229
Query: 174 KLRTVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGI 228
KLR +L E ++ K P++ QFSS+GSL E W++ E S+++ PL
Sbjct: 230 KLRKLLNENGPVKEQASKWPVIGQFSSIGSLGASKENWLSVEFLQSLATVKGTSSVPLAP 289
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHI 287
E +++PTV++VR SLEGY AG +IP K +L Y+ +WK+ GR+RAMPHI
Sbjct: 290 VEFKLIFPTVDNVRTSLEGYPAGGSIPYSINVAKKQPWLHSYFHQWKSEGRGRNRAMPHI 349
Query: 288 KTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + ++ AWFL+TS+NLSKAAWGAL+K SQLMIRSYE+GVL +P F
Sbjct: 350 KTYCRPSPTWEEAAWFLVTSSNLSKAAWGALEKKGSQLMIRSYEIGVLFIPKYLVENAVF 409
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
C+S + +AG + V +PY+LPP+ Y+
Sbjct: 410 ECSSKV----------------------------KEAGQKTFV----LPYDLPPRAYTKS 437
Query: 406 DVPWSWDKRYTK-KDVYGQVW 425
D PW WD + + D G +W
Sbjct: 438 DKPWIWDIAHKELPDSNGNMW 458
>gi|443688556|gb|ELT91218.1| hypothetical protein CAPTEDRAFT_131694, partial [Capitella teleta]
Length = 374
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 204/348 (58%), Gaps = 25/348 (7%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH----KPPLPISFGTHH 57
+DI WL+ PV + +LV+HG + +++R A H + L + +GTHH
Sbjct: 5 IDIPWLVAQYPVHHRTKPLLVVHGSTRQEKANLERE--ARLFTHVDLCQAKLEMIYGTHH 62
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQN-NLSEECGFENDLI 114
+K M+L Y GVR+I+HTANLIH DW+ K+QG+WM PL Q+ N F+ DL+
Sbjct: 63 TKMMILSYVNGVRVIIHTANLIHSDWHQKTQGVWMSPLFPPLAPQSRNGDSPTNFKRDLL 122
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
Y++ K + + S K+ +FS+A V LIASVPG H+G+SL ++GH+K
Sbjct: 123 QYINAYKSQSLNEWI----------SIIKRHDFSTAKVFLIASVPGRHSGASLNEFGHLK 172
Query: 175 LRTVLQEC-TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLI 233
L+ VL++ K+ P++ QFSS+GSL + LSS + + FS + +P +
Sbjct: 173 LKKVLRQFGPSSDACKQWPVLAQFSSIGSLGPTPESWLSSELLTSFSATRGSGSQSKPRL 232
Query: 234 --VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTF 290
++P DVR SLEGY AG ++P K + + +W++ GR++A PHIKT+
Sbjct: 233 HLMYPCRHDVRLSLEGYGAGGSLPYSINTAKKQPWFRTICNRWRSECNGRTKACPHIKTY 292
Query: 291 ARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
R + LAWF LTSANLSKAAWG L+K SQLM+RSYELGVL LP
Sbjct: 293 LRASPDWHNLAWFTLTSANLSKAAWGMLEKQGSQLMVRSYELGVLFLP 340
>gi|349604421|gb|AEP99976.1| Tyrosyl-DNA phosphodiesterase 1-like protein, partial [Equus
caballus]
Length = 345
Score = 234 bits (596), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 149/384 (38%), Positives = 210/384 (54%), Gaps = 58/384 (15%)
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFEN 111
+K MLL+Y G+R+++HT+NL+H DW+ K+QG+W+ PL + ++ F+
Sbjct: 1 TKMMLLLYEEGLRVVIHTSNLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKA 58
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
DLI YL P + ++ + S V LI S PG GS WG
Sbjct: 59 DLISYLMAYNAPSLKEWI----------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWG 108
Query: 172 HMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPL 226
H +LR +L+E +S P+V QFSS+GS+ + KW+ +E S+ + E KTP
Sbjct: 109 HFRLRALLKEHASSIPKAESWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPG 168
Query: 227 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 284
P +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AM
Sbjct: 169 KSVSPFHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAM 228
Query: 285 PHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 342
PHIKT+ R + ++AWFL+TSANLSKAAWGAL++N +QLMIRSYELGVL LPSA
Sbjct: 229 PHIKTYMRPSPDFSQIAWFLVTSANLSKAAWGALERNGAQLMIRSYELGVLFLPSA---- 284
Query: 343 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 402
F S V + S + E + PVPY+LPP+ Y
Sbjct: 285 --FGLDSFKVKQKFFSDNQEPT------------------------ASFPVPYDLPPELY 318
Query: 403 SSEDVPWSWDKRYTKK-DVYGQVW 425
S+D PW W+ Y K D +G +W
Sbjct: 319 GSKDRPWIWNIPYIKAPDTHGNMW 342
>gi|18044048|gb|AAH19804.1| Tdp1 protein [Mus musculus]
Length = 343
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 152/380 (40%), Positives = 209/380 (55%), Gaps = 56/380 (14%)
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHAAGESSTRFKADLISY 61
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 62 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 111
Query: 177 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 112 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 170
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 171 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 230
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 231 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 284
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 285 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 320
Query: 407 VPWSWDKRYTKK-DVYGQVW 425
PW W+ Y K D +G +W
Sbjct: 321 RPWIWNIPYVKAPDTHGNMW 340
>gi|241556145|ref|XP_002399612.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
gi|215499691|gb|EEC09185.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
Length = 624
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 153/441 (34%), Positives = 230/441 (52%), Gaps = 61/441 (13%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKA 60
DI WL+ P + +L++HGE ++ + + + + L I +GTHH+K
Sbjct: 218 DIPWLVERYPAEFRNLPLLIVHGEQRDAKRELEASASSFKHVSFAQAKLEIVYGTHHTKM 277
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG---FENDLIDYL 117
MLL+Y G+R+++HT+NL+ DW K+Q W+ K F DL++YL
Sbjct: 278 MLLLYKEGMRVVIHTSNLVESDWAQKTQAAWIGPLCPKASGGAGGGDSATGFRADLLEYL 337
Query: 118 STLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
+ +G+ KIN + + +FS+ V L+ SVPG HTG+ +GH+KL
Sbjct: 338 GS------------YGDPKINEWCHYLRAHDFSAVKVFLVGSVPGRHTGARKSSFGHLKL 385
Query: 176 RTVLQECTFEKGFKKS--PLVYQFSSLGSLD---EKWM-AELSSSMSS-GFSEDKTPLGI 228
R +L K S P + QFSS+GSL + W+ AE +S+++ TP
Sbjct: 386 RKLLSLHGPPKELVSSYWPAIAQFSSIGSLGTGPDNWLRAEFLTSLAAVKGGPPLTPSST 445
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHI 287
+V+P+V+DVRCSLEGY AG +IP +K +L Y+ +W++ GR+ A PH+
Sbjct: 446 VPVKLVFPSVDDVRCSLEGYPAGASIPYSISTANKQRWLDAYFFRWRSGRFGRTHASPHV 505
Query: 288 KTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
K++AR + G++ AW L+TSANLSKAAWGA +K+ SQLMIRSYELGVL P
Sbjct: 506 KSYARLSPSGKQTAWLLVTSANLSKAAWGAFEKSGSQLMIRSYELGVLFFPG-------- 557
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
Q T T G S AG ++ VP+++P Y +
Sbjct: 558 ---------------------QFGDARTFTVGGDSMAGKGCLPLF--VPFDVPLTPYGQD 594
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
DVPW+WD ++ + D +G +W
Sbjct: 595 DVPWTWDSQHREAPDRFGNMW 615
>gi|432853024|ref|XP_004067503.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oryzias latipes]
Length = 614
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 159/441 (36%), Positives = 233/441 (52%), Gaps = 65/441 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKA 60
DI W++ P + VL++HG+ + + A + + L I+FGTHH+K
Sbjct: 218 DIPWMVQQYPPEFRDRPVLIVHGDKREAKARLLQQAQAFPHVRFCQAKLDIAFGTHHTKM 277
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLID 115
MLL Y G R+I+ T+NLI DW K+QG+WM + G F+ DL+D
Sbjct: 278 MLLWYEEGFRVIILTSNLIRADWYQKTQGMWMSPLFPRLPAGSGWSAGESPTFFKRDLLD 337
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL++ + PE + K+ + S V L+ S PG G +++WGH++L
Sbjct: 338 YLTSYRAPELEEWI----------QRIKEHDLSETRVYLVGSTPGRFVGPDMERWGHLRL 387
Query: 176 RTVLQECTFE-KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGE 230
R +L E T G +K P++ QFSS+GS+ KW+A E +M++ P +
Sbjct: 388 RKLLYEHTNPIPGEEKWPVIGQFSSIGSMGLDKTKWLAGEFQRTMTTLGKSSSRP----D 443
Query: 231 P--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHI 287
P L+++P VEDVR SLEGY AG ++P + K L Y+ +WKA+ TGRS AMPHI
Sbjct: 444 PPVLLLYPAVEDVRMSLEGYPAGGSLPYSIQTAQKQLWLHGYFHRWKANATGRSHAMPHI 503
Query: 288 KTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
KT+ R + +LAWFL+T LS AWGAL+KNNSQ+M+RSYELGVL +PSA
Sbjct: 504 KTYMRVSPDFTELAWFLVTRCLLS--AWGALEKNNSQVMVRSYELGVLYVPSA------- 554
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
L T S+ +SS +L VP++LPP Y+++
Sbjct: 555 -----------------------FNLKTFPVDKSAFPVSSSSSGFL-VPFDLPPTPYAAK 590
Query: 406 DVPWSWDKRYTKK-DVYGQVW 425
D PW W+ Y+++ D +G +W
Sbjct: 591 DQPWIWNIPYSQEPDTHGNIW 611
>gi|198414495|ref|XP_002123899.1| PREDICTED: similar to tyrosyl-DNA phosphodiesterase 1 [Ciona
intestinalis]
Length = 471
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 148/346 (42%), Positives = 210/346 (60%), Gaps = 33/346 (9%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
+D+DWL+ PV + + +IHG G + + N L K LP +GTHH+K M
Sbjct: 146 IDVDWLIQQYPVSCQGKPLTIIHG---GNVS--PNPQYPNITLVKVNLP-PYGTHHTKMM 199
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL-IDYLSTL 120
LL Y G+R+++ T NL+ DW K+QG WM P+ + ++ F+ ++Y+S+
Sbjct: 200 LLHYTSGLRVVILTTNLVPQDWGQKTQGFWMS--PIFPKTTPTKTSKFKPRFGLEYVSSY 257
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
K + + + + + SSA V LI S+PG HTG +L WGHM+LR VL+
Sbjct: 258 K----------NKSLQRWVDHIRSHDMSSANVILIGSIPGRHTGHNLSTWGHMRLRKVLK 307
Query: 181 ECTFEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPL-IVW 235
T +K P++ QFSS+GSL ++KW+ E +S+SS T LG PL +++
Sbjct: 308 NET-KKIDSSWPVIGQFSSIGSLGSSNQKWLCNEWLTSLSSC---SNTTLGASPPLKLIF 363
Query: 236 PTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR-- 292
P+V+DVR SLEGY AG +IP S + + +L+ Y KW A+H GR++A PHIK++AR
Sbjct: 364 PSVDDVRMSLEGYPAGASIPYSRNIALKQPWLRPYLHKWVATHAGRTQAAPHIKSYARIS 423
Query: 293 -YNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
YN +L WFLLTSANLSKAAWG+L+KNNSQL I+SYELGVL LP
Sbjct: 424 PYNTNIRLPWFLLTSANLSKAAWGSLEKNNSQLSIKSYELGVLFLP 469
>gi|339256684|ref|XP_003370218.1| 7 transmembrane receptor [Trichinella spiralis]
gi|316965617|gb|EFV50306.1| 7 transmembrane receptor [Trichinella spiralis]
Length = 478
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 151/407 (37%), Positives = 214/407 (52%), Gaps = 58/407 (14%)
Query: 38 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP 96
K N L LPI FGTHHSK LL Y +G+++ +HTANLI DW K+QG+++ FP
Sbjct: 109 KATNVSLVDARLPIPFGTHHSKMTLLRYRKGLKVAIHTANLIEYDWCEKTQGMYISPLFP 168
Query: 97 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF------FKKFNFSSA 150
L + N ++ DY S F A+L A+ N NP+ + ++ A
Sbjct: 169 LIENNTGTD---------DYDSKTN---FKADLIAYLNAYTNPAVKAWAEEIENYDMREA 216
Query: 151 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSSLGSLD---EK 206
V ++AS+PG H ++ WGH+KL +L+ ++ P+V QFSS+GSL EK
Sbjct: 217 NVFIVASIPGRHRDVAMYNWGHLKLGRILKTHLNYDAIDANWPVVCQFSSIGSLGTKPEK 276
Query: 207 WM-AELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNV 261
W+ E ++S+ E + EP +V+P+VE+VRCS EGY G +P +
Sbjct: 277 WLLGEFAASLGRTAFECS---ALQEPFRNLKLVYPSVENVRCSSEGYYGGTCLPYTEAVA 333
Query: 262 DKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQK 318
K +L+++ +W GRS A+PHIKT+ RY+ QKLAWFLLTSANLSKAAWG +K
Sbjct: 334 SKQQYLQQFMHRWMCECFGRSHAVPHIKTYFRYSPCFQKLAWFLLTSANLSKAAWGVTEK 393
Query: 319 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHG 378
+N Q IRSYE+GVL +P F C NI +Q K T+ H
Sbjct: 394 SNQQFNIRSYEIGVLFIPE-------FFCERNI-----------NFFLQGLKAFTI--HR 433
Query: 379 SSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+ + ++ P+P +LP YS D W D Y + D +G W
Sbjct: 434 NVETPSAE----FPLPMDLPLVPYSQNDKMWIIDIPYGEADAHGITW 476
>gi|360045261|emb|CCD82809.1| putative tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 483
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 159/467 (34%), Positives = 243/467 (52%), Gaps = 87/467 (18%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M DI WL P + + ++H G+ +L+ K +N + + + +G HH
Sbjct: 59 MFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQADIRLPYGVHH 117
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSE---ECGFE 110
+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q NL++ + F
Sbjct: 118 TKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKNLNDTDSKTNFR 177
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRLIASVPGYHTGS 165
DL++YL + + +L + +P F ++F V LIASV G H G
Sbjct: 178 ADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVLIASVSGRHAGE 229
Query: 166 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK----WMAELSSSMSSGFSE 221
SLKK+GH +L VLQ C + P++ QFSS+GSL K + E SSS++
Sbjct: 230 SLKKFGHTRLGEVLQTCNSQIP-SSWPVIGQFSSIGSLGPKPTDWFTTEWSSSLAG---- 284
Query: 222 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 280
K G+ +++P+VEDVR SLEGY AG +P + +K +L +++ +W+A +
Sbjct: 285 -KGARGLR---MIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQFFYRWQAFN--H 338
Query: 281 SRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 338
SRA PHIK++ R +GQ++ WFLLTSANLSK+AWGA +K+ SQLMIRSYELGVL LP+
Sbjct: 339 SRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIRSYELGVLFLPTN 398
Query: 339 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 398
+ EI + + SQ ++ E++ P+PYELP
Sbjct: 399 YKESAH--------SFEILKNNAKYSQ-----------------SSTDELLPFPIPYELP 433
Query: 399 PQRYSSED--------------------VPWSWDKRYTKKDVYGQVW 425
P +Y S PW DK ++ D++G++W
Sbjct: 434 PVKYQSNGKKLYMCIIIFLSLFFAMDKYEPWILDKPHSLPDIFGRIW 480
>gi|320165079|gb|EFW41978.1| tyrosyl-DNA phosphodiesterase 1 [Capsaspora owczarzaki ATCC 30864]
Length = 622
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 148/373 (39%), Positives = 203/373 (54%), Gaps = 49/373 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+DWL+ P + + V+HG ++ K + +PPLPI+FGTHH+K
Sbjct: 232 MVDLDWLMTIFPRELQARPMTVVHGLTESADVLQAAGKKWGKTIIRPPLPIAFGTHHTKM 291
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLK----DQNNLSEECGFENDLID 115
M L Y +RI++HTAN+I DW K++G+W FPLK Q + S FE L
Sbjct: 292 MFLFYSDSMRIVIHTANIIPSDWYAKTEGVWCSPKFPLKASTAQQASSSTGRAFEQTLNK 351
Query: 116 YLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL+ A+G+ + K++FS+A V LIASVPG H G + +WGHM+
Sbjct: 352 YLT------------AYGSCIRQVREQAMKYDFSAANVALIASVPGRHAGLAKSEWGHMQ 399
Query: 175 LRTV-LQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIG 229
LR + L + L+ QFSS+GSL E W+ +E S S+S+ ++ +P I
Sbjct: 400 LRKLPLPANVASQPVNTHQLIGQFSSIGSLGASPETWLTSEFSVSLSAHKAQGLSP-PIA 458
Query: 230 EP---LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMP 285
P +++P+VE+VR SLEGY AG A+P K +L +++ W A+ +GR AMP
Sbjct: 459 HPRALRLIFPSVENVRLSLEGYLAGGALPYRLATHSKQAWLDQFFCTWNATRSGRQHAMP 518
Query: 286 HIKTFARY------------------NGQKLAWFLLTSANLSKAAWGALQKNNS---QLM 324
HIK++AR L WFLLTSANLSKAAWG LQK + QL
Sbjct: 519 HIKSYARIAVSPKTADSAQQAEATDSTNVALGWFLLTSANLSKAAWGTLQKKGTAAEQLE 578
Query: 325 IRSYELGVLILPS 337
IRSYELGVL PS
Sbjct: 579 IRSYELGVLFHPS 591
>gi|358337002|dbj|GAA55434.1| tyrosyl-DNA phosphodiesterase 1, partial [Clonorchis sinensis]
Length = 1156
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 153/421 (36%), Positives = 223/421 (52%), Gaps = 51/421 (12%)
Query: 1 MVDIDWLLP-------ACPVLAKIPHVLVIHGESDGTLEHM--KRNKPANWILHKPPLPI 51
M D+DWL+ +CP+L V HG+ L + K + H + +
Sbjct: 771 MFDVDWLMQQYPKQFRSCPLLL----VHAYHGQDKAALNSVVSKYENIRQCVAH---IRL 823
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE---ECG 108
FGTHH+K M L Y G+RI++HTAN+I DW+ ++QG+W+ L+ SE +
Sbjct: 824 PFGTHHTKMMFLKYADGLRIVIHTANMIPDDWDRRTQGIWLSPKLLRKSGTSSETDSDTK 883
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 168
F L++YL + A P+ + + ++FS V L+ SV G H GSSLK
Sbjct: 884 FRETLVNYLR--GYGSTVAGTPSSPLGEWIEELLQ-YDFSPIRVFLVGSVSGMHGGSSLK 940
Query: 169 KWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
+GH +L +LQ+ T E S PL+ QFSS+GSL + L++ SS + K G
Sbjct: 941 HFGHPRLANLLQDYTLE--VPSSWPLIGQFSSIGSLGAQPTTWLTTQWSSSLA-GKGARG 997
Query: 228 IGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPH 286
+ +++P V+DVR SLEGYAAG +P ++ +K +L+++ +W A SRA PH
Sbjct: 998 L---RMIFPCVDDVRNSLEGYAAGGCLPYSRQTAEKQPWLRQFLHRWCAGP--HSRAAPH 1052
Query: 287 IKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
IK++ R +G +WFLLTSANLSKAAWG+ K+ SQLMIRSYELGVL +P +
Sbjct: 1053 IKSYTRISNDGTHASWFLLTSANLSKAAWGSFVKDGSQLMIRSYELGVLFVPGQFQEKA- 1111
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
+C + PS + S QI AG + + PVPY+LPP Y +
Sbjct: 1112 -NCFRLVTPSRTTTPSDALKQI---------------AGMRTHSIPFPVPYDLPPVLYDT 1155
Query: 405 E 405
+
Sbjct: 1156 D 1156
>gi|325180643|emb|CCA15048.1| tyrosylDNA phosphodiesterase putative [Albugo laibachii Nc14]
Length = 489
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 152/397 (38%), Positives = 211/397 (53%), Gaps = 59/397 (14%)
Query: 47 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS-- 104
P LPI FGTHHSK M++ Y VR+ + TAN + +DWNNK+QG+W QDF LK + + S
Sbjct: 132 PYLPIPFGTHHSKMMIIWYAEKVRVAIFTANFLPIDWNNKTQGIWFQDFGLKSETSASSR 191
Query: 105 -----EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 159
E FE DLIDYL + G + +K++FS+A V L+ASVP
Sbjct: 192 TNLWPERIDFEADLIDYL-------IHVDKIHLGELCLT---LEKYDFSTANVALVASVP 241
Query: 160 GYHTGSS----LKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA-ELSSS 214
G H + + K+GH+++R +LQ T E + PL+ QFSSLGSL E W+ E + S
Sbjct: 242 GTHKNRAIWIDMHKYGHLRMRRLLQ--TLEAWNNEYPLICQFSSLGSLTEPWLYHEFTES 299
Query: 215 MSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK 274
+ + + + P ++WP+ E VR S+EG+ AG AIP P KN+ K FL K+ W
Sbjct: 300 LQAHSTTKQRP----ALHLIWPSAEQVRNSIEGWNAGRAIPCPLKNM-KPFLHKFLRTWN 354
Query: 275 -ASHTGRSRAMPHIKTFARYN----GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYE 329
RS AMPHIK++A+++ L W LL+S+NLS AAWG+ QK +Q MIRS+E
Sbjct: 355 PPPKLHRSNAMPHIKSYAQFDPTALDGTLRWALLSSSNLSSAAWGSYQKQKNQFMIRSFE 414
Query: 330 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 389
+GVL P R+ CT +V V T +D AS +
Sbjct: 415 IGVLFHPKVYRNDK--LCTDPLV-------------------VIGT---PADEAASQNAI 450
Query: 390 YLPVPYELPPQRYSS-EDVPWSWDKRYTKKDVYGQVW 425
P PY P Q Y + +D PW W+ + D G +
Sbjct: 451 RFPAPYNFPLQAYDTKQDEPWIWNLAWDLPDSTGACY 487
>gi|440800948|gb|ELR21974.1| tyrosylDNA phosphodiesterase-related, putative [Acanthamoeba
castellanii str. Neff]
Length = 601
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 153/427 (35%), Positives = 213/427 (49%), Gaps = 92/427 (21%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
VD+DWL+ CPVL P V + +KP W+L +G HH K M
Sbjct: 260 VDMDWLMRRCPVLPHPPPPNVHY------------HKP--WVL-------DYGCHHGKMM 298
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
LL + + TANLI D+ K+QG+W+QDFP K + FE+ L+DY
Sbjct: 299 LLFWK-----AITTANLIQKDYERKTQGIWLQDFPKKRGD-------FEDTLVDYF---- 342
Query: 122 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 181
++ + PS + +++S+ V L+ SVPGYH+ ++L ++GHM+LR +L
Sbjct: 343 -----GHMGNERQLQFQPSSLRHYDYSAVRVALVTSVPGYHSRATLNRYGHMRLRGLLSR 397
Query: 182 CTFEKGF-KKSPLVYQFSSLGSLDEKWMAEL--SSSMSSGFSEDKTPLGIGEPLIVWPTV 238
T ++S + QFSS+GSL KW+ E S M+S S D E +VWPTV
Sbjct: 398 VTMPAEIERRSSVACQFSSVGSLTAKWVEEEFGQSLMASAGSSDSKKEAQVE--LVWPTV 455
Query: 239 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKL 298
+ VR S++GYAAG ++ + N KDF+ + ++KA R R PHIK
Sbjct: 456 DYVRSSIDGYAAGGSLCFGESNR-KDFMTPLFRQYKAMPESRGRVTPHIKV--------- 505
Query: 299 AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKS 358
LTSANLSKAAWGALQK N+QLMIR++E+GVL LPS F + I
Sbjct: 506 ---CLTSANLSKAAWGALQKGNTQLMIRNFEIGVLFLPSH------FDDRTFIA------ 550
Query: 359 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP-QRYSSEDVPWSWDKRYTK 417
GS+ A S + V +P+PY + P +RY D PW WD +
Sbjct: 551 -------------------GSAPAALSKDSVVIPLPYRIEPLERYGPRDEPWIWDLPRPE 591
Query: 418 KDVYGQV 424
D GQ
Sbjct: 592 PDALGQT 598
>gi|428172199|gb|EKX41110.1| hypothetical protein GUITHDRAFT_142267 [Guillardia theta CCMP2712]
Length = 465
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/298 (38%), Positives = 170/298 (57%), Gaps = 15/298 (5%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MV WLL +L+ IP V+ ++ ++ + + PP P +G HHSK
Sbjct: 163 MVQERWLLSEIALLSSIPRVVFMY---PFLSSLASPPSSSSIVRYAPPTP-QYGVHHSKV 218
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
MLL Y GVR++V TAN IH D + + LW QDFPLK + E FE+DL+ Y
Sbjct: 219 MLLGYNTGVRVVVMTANHIHGDHYDMTDALWAQDFPLKGEGE--ERSEFEDDLVSYFQAT 276
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
+W LP K++ + ++++F +A +++ASVPG H G + WGHMK+R +L
Sbjct: 277 QWK--GTTLPC--GSKLDAQYLRRYSFKNARAKIVASVPGRHQGEKMHMWGHMKMRRILS 332
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE--PLIVWPTV 238
TF+ F K P+V+Q +S+GSL EKW+ E +SS+ G + + +G E P +WPT+
Sbjct: 333 RETFDPLFNKCPMVWQCTSIGSLSEKWIEEFTSSLCEGKNTEGKNIGRPEEPPHFIWPTM 392
Query: 239 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---RSRAMPHIKTFARY 293
E+VR S +GY G +IP KNV K FL K + +W + + R RAMPHIKT+ R+
Sbjct: 393 EEVRTSSKGYTMGESIPGFSKNVHKPFLLKMFCRWSSGSSDPQLRRRAMPHIKTWLRF 450
>gi|302790465|ref|XP_002977000.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
gi|300155478|gb|EFJ22110.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
Length = 301
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 99/175 (56%), Positives = 130/175 (74%), Gaps = 8/175 (4%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVDI+WLL ACP+L I V++IHGES+ + ++ KP+N +L KP L I++GT HS
Sbjct: 129 MVDIEWLLSACPLLRTILQVVMIHGESN--VSQLQSVKPSNRLLFKPRLWIAYGTPHS-- 184
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
LL+YP GV+++VHTANLI++DWNNK+QGLWMQDFP K + S+ FENDL+DYL+ L
Sbjct: 185 -LLVYPTGVQVVVHTANLINIDWNNKNQGLWMQDFPFKSKTGASD---FENDLVDYLTAL 240
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
+W + ++ HG KIN F+ F FS+AAVRL+ASVPGYH+G L KWGHMKL
Sbjct: 241 EWLGCTVDVQHHGKMKINVGHFRNFYFSNAAVRLVASVPGYHSGPQLNKWGHMKL 295
>gi|324510072|gb|ADY44216.1| Tyrosyl-DNA phosphodiesterase [Ascaris suum]
Length = 452
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 139/439 (31%), Positives = 213/439 (48%), Gaps = 76/439 (17%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGT 55
M+D+ WLL P + +I GE++GT +R K N + + L + +GT
Sbjct: 75 MIDLHWLLSQYPERCSAYPISIIVGENNGTNHLDVRAEARRCKADNVSVGRARLVLPYGT 134
Query: 56 HHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 114
HHSK ++ + +++ TANL+ DW++K+Q + P+ + + F DLI
Sbjct: 135 HHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEGQNNFRKDLI 194
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL+ ++ G + +FS R+I+S+PGYH G ++GH++
Sbjct: 195 SYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGDQKDRYGHLR 248
Query: 175 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSGFSEDKTPLGIGE 230
LR VL+ + KK V QFSS+GSL K W+ A+ S++ G P+
Sbjct: 249 LRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGGI-----PVPESS 301
Query: 231 PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKT 289
+++P VEDVR S+EGY AG A+P + + +L + KW+ GR+RAMPHIK+
Sbjct: 302 LRLIYPCVEDVRNSVEGYMAGGALPYQRNTAARQPYLLERMHKWRCERFGRTRAMPHIKS 361
Query: 290 FARYNGQKL--AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
++ ++ + +W L+TSANLSKAAWG LQK SQL IRSYELGVL+
Sbjct: 362 YSAFSDGRCLPSWLLITSANLSKAAWGELQKKESQLAIRSYELGVLL------------- 408
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
T+ +Q +PY++P ++ D
Sbjct: 409 -------------TDEDSLQL------------------------LPYDMPLTKFEPGDQ 431
Query: 408 PWSWDKRYTKKDVYGQVWP 426
PW D YTK D++G WP
Sbjct: 432 PWVCDDTYTKPDIHGATWP 450
>gi|71423941|ref|XP_812626.1| tyrosyl-DNA phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70877431|gb|EAN90775.1| tyrosyl-DNA phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 158/491 (32%), Positives = 241/491 (49%), Gaps = 79/491 (16%)
Query: 1 MVDIDWLLPACPVLAKIPH-VLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPI 51
M+DI+WL+ P L + + ++ GE S ++K K + +P LP+
Sbjct: 50 MIDIEWLVRVAPSLLQTKQQIFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPL 106
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSE 105
FG HHSK +L + G+R+ V TAN I DW KSQG+++QDFP K DQ NL+
Sbjct: 107 PFGVHHSKLVLCVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDQANLTF 166
Query: 106 ECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 158
G F+N+L+ YL+ + N A I + F + +FS+ V +I S+
Sbjct: 167 SAGNEIRGNKFKNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSI 221
Query: 159 PGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 216
PGYH + + +G ++ VL E + L++QFSS G L ++ L ++MS
Sbjct: 222 PGYHRYTDIHSFGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMS 281
Query: 217 SGFSE----DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 270
+ + +K PL PL IV+PT +VR SLEG+ G ++P + ++ +
Sbjct: 282 TEWKSIEEANKKPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRL 337
Query: 271 AKWKASHTG-------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNS 321
+W G R RA+PH+KT+ R N +K + WF+LTSANLS+AAWG QK
Sbjct: 338 HRWGQGTRGLCKIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGD 397
Query: 322 QLMIRSYELGVLILPS---AKRHGCGFSCTSNI---VPSEIK-SGSTETSQIQKTKLVTL 374
QL IRSYE GV+ + G FS T + +PS ++ G E Q K
Sbjct: 398 QLAIRSYEFGVVYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK---- 453
Query: 375 TWHGSSDAGASSEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDV 420
+ + G S + Y P+ PY ++ QR +++D+PW D + KDV
Sbjct: 454 ---QNIEEGPSLFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDV 510
Query: 421 YGQVWPRHFQL 431
+G+ R +L
Sbjct: 511 FGKEIHRAMEL 521
>gi|219113113|ref|XP_002186140.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582990|gb|ACI65610.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 305
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 175/304 (57%), Gaps = 20/304 (6%)
Query: 51 ISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLSEE 106
I +G HHSK L+ Y + +RII+HTAN+ + D + K+Q + QDF LK + N++
Sbjct: 1 IPYGVHHSKFFLVGYADQSLRIIIHTANIRYDDIHCKAQAAFFQDFGLKSPENFTNVANT 60
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 166
C FE DLIDYL + ++ + K F ++++FSSA L+ S PGYH
Sbjct: 61 CEFEEDLIDYLDSYRYTRLHKWTKSGSKTKSLGQFVREYDFSSAKAVLVPSTPGYHRLDE 120
Query: 167 LKKWGHMKLRTVLQECTF--EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 224
+ GH K+R + T E+ P+V QFSS+GSL E+++ EL +SM S D+
Sbjct: 121 KHRRGHWKMRQTIPSHTEAPEEETICDPIVCQFSSIGSLTERYLLELQTSMDMKQSRDRG 180
Query: 225 PLGIGE--PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG--- 279
G E +V+PTVE++R S+EGY G ++P +NV K FLK+ + +W A +
Sbjct: 181 RPGRLELSLKLVYPTVEEIRTSVEGYRGGGSVPGTMRNVGKPFLKRLFCRWSALSSSDMN 240
Query: 280 ---RSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNN----SQLMIRSYEL 330
+ R +PH+KT+ + N + L WF+LTS NLSKAAWG +Q ++ +L +R +EL
Sbjct: 241 PLWKGRNVPHMKTYFQTNSTTETLHWFVLTSHNLSKAAWGEIQTSSRYGGRRLFVRHWEL 300
Query: 331 GVLI 334
GV +
Sbjct: 301 GVFL 304
>gi|397627380|gb|EJK68455.1| hypothetical protein THAOC_10361 [Thalassiosira oceanica]
Length = 656
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 150/496 (30%), Positives = 234/496 (47%), Gaps = 98/496 (19%)
Query: 1 MVDIDWLLP-ACPVLAKIPHVLVIHGES-----------DGTLEHMKR---------NKP 39
++D +L A P L + V+V +G S + LE R + P
Sbjct: 186 LIDFSYLFQRASPELLQFQRVVVFYGTSGQACPAVMRQWERLLEGTGRTVAFVQLLPSDP 245
Query: 40 ANWILHKPPLPISFGTHHSKAMLLIYP------RGVRIIVHTANLIHVDWNNKSQGLWMQ 93
N + P+ I +G HH+K L+ Y + +HT+N++H D KSQG++ Q
Sbjct: 246 PNSRANPLPVKIEYGVHHTKMFLMGYEDEESGISKCHVSIHTSNILHSDAELKSQGVYAQ 305
Query: 94 DFPLK------DQNNLSEECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPS 140
DFPLK N S+E FE+DL+ Y+ + ++ + + +F ++
Sbjct: 306 DFPLKVAPGKSTGNPYSKEEDASKTPRQFEDDLVTYMESYRYQARQSWCSSSASFGLSNQ 365
Query: 141 ------FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR-TVLQECTFEKGFKKSPL 193
+ ++FS+A LI SVPG H + + ++G++KLR V+Q + SPL
Sbjct: 366 PMTILQLIRAYDFSTAYCVLIPSVPGRHRANDMHEYGYLKLRKAVIQHA---RSQTNSPL 422
Query: 194 VYQFSSLGSLDEKWMAELSSSMSSGF--------SEDKTPLGIGEPL----IVWPTVEDV 241
+ QFSSLGSL+ KW+++ S + S S+ K G + IVWP+VE+V
Sbjct: 423 LLQFSSLGSLNGKWLSQFLSCLDSSAQSFDPVTESDKKKSKGTSDLASRMKIVWPSVEEV 482
Query: 242 RCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR------SRAMPHIKTFAR--Y 293
R +EGY+ G AIP KN++K FL + +W + + S+ PHIKTF +
Sbjct: 483 RTCVEGYSGGGAIPGRTKNLEKAFLMPLYHRWSSRNPNNEGPLKTSKHAPHIKTFVQPSS 542
Query: 294 NGQKLAWFLLTSANLSKAAWGALQKNNSQ-----LMIRSYELGVLILPSAKRHGCGFSCT 348
+G ++ W LL S NLS AA G +QK + L IR +ELGV I P + +
Sbjct: 543 DGTEIEWMLLGSHNLSIAALGQIQKRHKDSSEKILFIRHWELGVFISPRTLKQAGNYD-- 600
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
K VTL + + SE V +P+PY+L P Y++EDV
Sbjct: 601 --------------------GKDVTLVPYRGGGMSSGSE-VQVPLPYDLNPTPYNNEDVT 639
Query: 409 WSWDKRYTKKDVYGQV 424
W+ D+ D +G++
Sbjct: 640 WAVDRTTFLPDRFGRI 655
>gi|343477672|emb|CCD11565.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 548
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 200/367 (54%), Gaps = 47/367 (12%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHGESDGTL---------EHMKRNKPANWILHKPPLP 50
++D++WL P+L +++I GE G L + RN+ + +P LP
Sbjct: 49 VIDVEWLFRVSGPLLMSKCTIVLISGEK-GFLHKYRHLVLHDRFGRNRVK---IVEPCLP 104
Query: 51 ISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQN-----NLS 104
I FG HHSK ML I G+R+ V TAN I DWN K+QG++ QDFP LK Q+ N+S
Sbjct: 105 IPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYFQDFPRLKTQSENIVLNIS 164
Query: 105 EECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 160
G F N++ YLS + ++++P G + S +F+FS A V LIASVPG
Sbjct: 165 SIEGKGMRFRNEIKRYLSCIG---VASSMPKDGCIPL--SLLDEFDFSGACVELIASVPG 219
Query: 161 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKWMAELSSSMSSG 218
YH S + +G KL+++LQ ++P L +QF+S G L ++ + MS
Sbjct: 220 YHRCSDAQHYGLGKLKSILQSMQLPSSLDRNPPVLTWQFTSQGLLTANFLNSMKQIMS-- 277
Query: 219 FSEDKTPLGIG--EPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK 274
+ + P G +P+ +V+PT +V+ SLEG+ G ++P + ++ + +W
Sbjct: 278 -IDARNPTGEDKMDPVVRVVYPTETEVKNSLEGWRGGLSLPVTLRCC-HSYINERLFRWG 335
Query: 275 ASHTG------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIR 326
G RS+ +PH+KT+ R + L+WFLLTSANLS+AAWG Q +QL+IR
Sbjct: 336 TVPQGSEVENERSKGLPHLKTYTRLTESEDGLSWFLLTSANLSRAAWGEWQHGGTQLLIR 395
Query: 327 SYELGVL 333
SYELGVL
Sbjct: 396 SYELGVL 402
>gi|340055492|emb|CCC49811.1| putative tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma vivax
Y486]
Length = 548
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 160/482 (33%), Positives = 223/482 (46%), Gaps = 75/482 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPISFG 54
++D +WLL P + L I G H + A + + +PP+P+ FG
Sbjct: 48 LIDPEWLLRVAPAITCTSRQLFIITGERGFAHHFASSTMAAHMGAGRVTVIEPPMPLPFG 107
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQNNL 103
HH+K +L I RG+R+ V TAN I DW+ K+QG++MQDFP L
Sbjct: 108 VHHTKLVLGINSRGLRVAVLTANFIEEDWDMKAQGIYMQDFPRSLTPDKEGRYTAQSATL 167
Query: 104 SEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 161
E G F ++L YL + + +G I PS F +FSSA+V LIASVPGY
Sbjct: 168 QEGRGERFRSELRRYLHS-----YGLLSDENGLKGIPPSHFDGIDFSSASVELIASVPGY 222
Query: 162 HTGSSLKKWGHMKLRTVLQECTFEKGFK--KSPLVYQFSSLGSLDEKWMAELSSSMSSGF 219
H G +G +L V+Q K L +QFSS G L EK++ L +M
Sbjct: 223 HRGGEAYSFGMGRLLKVVQSVQMGPILDGGKPILTWQFSSQGLLTEKFLKSLEDAMLGNH 282
Query: 220 ---SEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWK 274
+ D+ P EP +V+PT +V+ SLEG+ G ++P + ++ +W
Sbjct: 283 AVGATDRRP----EPEVRVVYPTESEVKNSLEGWRGGMSLPVRLRCCHP-YINARMHRW- 336
Query: 275 ASHTG---------RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQL 323
H G R RAMPH+KT+ R L WFLLTSANLS+AAWG Q+N SQL
Sbjct: 337 -CHRGVSEAVNKPVRGRAMPHLKTYMRLAEGEDSLHWFLLTSANLSRAAWGEWQRNGSQL 395
Query: 324 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH-GSSDA 382
IRSYELGVL S C + PS S ++ L+ L G++D
Sbjct: 396 AIRSYELGVL-YDSKSFINCAEGELFVVTPSR---RIPLPSSVEGDGLLRLHIRAGANDI 451
Query: 383 GASSEVVYLPV------PYELPPQR---------------YSSEDVPWSWDKRYTKKDVY 421
+ V++LP PYE Q S++DVPW D + +D
Sbjct: 452 IGEAPVLFLPYDALHPEPYESTLQLRKNHGSSVENESHAPLSTKDVPWVVDAPHHGRDAL 511
Query: 422 GQ 423
G+
Sbjct: 512 GK 513
>gi|407867395|gb|EKG08563.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 158/491 (32%), Positives = 240/491 (48%), Gaps = 79/491 (16%)
Query: 1 MVDIDWLLPACPVLAKIPHVL-VIHGE--------SDGTLEHMKRNKPANWILHKPPLPI 51
M+DI+WL+ P L + L ++ GE S ++K K + +P LP+
Sbjct: 50 MIDIEWLVRVAPSLLQTKQQLFIVSGEKEYEKKIQSSFLFRYIKAKKIR---IVEPKLPL 106
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSE 105
FG HHSK +L + G+R+ V TAN I DW KSQG+++QDFP K D+ NL+
Sbjct: 107 PFGVHHSKLVLCVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDRANLTF 166
Query: 106 ECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 158
G F+N+L+ YL+ + N A I + F + +FS+ V +I S+
Sbjct: 167 SAGNEIRGNNFKNELLRYLNCYDIISNTENTEA-----IPSTLFDEIDFSAVCVEIITSI 221
Query: 159 PGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 216
PGYH + + +G ++ VL E + L++QFSS G L ++ L ++MS
Sbjct: 222 PGYHRYTDIHSFGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMS 281
Query: 217 SGFSE----DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 270
+ + +K PL PL IV+PT +VR SLEG+ G ++P + ++
Sbjct: 282 TEWKSIEEANKKPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINGRL 337
Query: 271 AKWKASHTG-------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNS 321
+W G R RA+PH+KT+ R N +K + WF+LTSANLS+AAWG QK
Sbjct: 338 HRWGQGTRGLCKIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGD 397
Query: 322 QLMIRSYELGVLILPS---AKRHGCGFSCTSNI---VPSEIK-SGSTETSQIQKTKLVTL 374
QL IRSYE GV+ + G FS T + +PS ++ G E Q K
Sbjct: 398 QLAIRSYEFGVVYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK---- 453
Query: 375 TWHGSSDAGASSEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDV 420
+ + G S + Y P+ PY ++ QR +++D+PW D + KDV
Sbjct: 454 ---QNIEEGPSLFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDV 510
Query: 421 YGQVWPRHFQL 431
+G+ R +L
Sbjct: 511 FGKEIHRAMEL 521
>gi|71404281|ref|XP_804861.1| tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70868036|gb|EAN83010.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 154/483 (31%), Positives = 238/483 (49%), Gaps = 79/483 (16%)
Query: 1 MVDIDWLLPACPVLAKIP-HVLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPI 51
M+DI+WL+ P L + + ++ GE S ++K K + +P LP+
Sbjct: 50 MIDIEWLVRVAPSLLQTKKQLFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPL 106
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSE 105
FG HHSK +L + G+R+ V TAN I DW KSQG+++QDFP K D+ NL+
Sbjct: 107 PFGVHHSKLVLCVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQTSPKTDRANLTF 166
Query: 106 ECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 158
G F+N+L+ YL+ + N A I + F + +FS+ V +I S+
Sbjct: 167 SAGNEIRGNKFKNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSI 221
Query: 159 PGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 216
PGYH + + +G ++ VL E + L++QFSS G L ++ L ++MS
Sbjct: 222 PGYHRYTDIHSFGLGRIPKVLHSIDMELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMS 281
Query: 217 SGFSE----DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 270
+ + +K PL P+ IV+PT +VR SLEG+ G ++P + ++ +
Sbjct: 282 TEWKSIEEANKKPL---RPVVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRL 337
Query: 271 AKWKASHTG-------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNS 321
+W G R RA+PH+KT+ R +K + WF+LTSANLS+AAWG QK
Sbjct: 338 HRWGQGTRGLCKMEFLRRRALPHLKTYMRLTEKKDGIKWFILTSANLSRAAWGEWQKKGD 397
Query: 322 QLMIRSYELGVLILPS---AKRHGCGFSCTSNI---VPSEIK-SGSTETSQIQKTKLVTL 374
QL IRSYE GV+ S + G FS T + +PS ++ G E Q K
Sbjct: 398 QLAIRSYEFGVVYGKSSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK---- 453
Query: 375 TWHGSSDAGASSEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDV 420
+ + G S + Y P+ PY ++ QR +++D+PW D + KDV
Sbjct: 454 ---QNIEKGPSLFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDV 510
Query: 421 YGQ 423
+G+
Sbjct: 511 FGK 513
>gi|67609723|ref|XP_667058.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658157|gb|EAL36834.1| hypothetical protein Chro.70273 [Cryptosporidium hominis]
Length = 511
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 133/391 (34%), Positives = 203/391 (51%), Gaps = 58/391 (14%)
Query: 45 HKPPLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
+ P L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF
Sbjct: 149 YSPYLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDFF- 207
Query: 98 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIA 156
N ++C F +DYL EF N+ K S ++FNF A V+L+A
Sbjct: 208 --HNIERKDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVA 256
Query: 157 SVPGYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM- 208
SVPGY G + WGH+++R+++ Q + E G K+ ++ QFSSLG + EKW+
Sbjct: 257 SVPGYFFGDEMFMWGHLRVRSLIKRFVSKDQGKSDELGEKRERIILQFSSLGRISEKWLY 316
Query: 209 AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK 267
EL+SS+S + P G L I++PTVE V S+EG G ++P ++ + K ++K
Sbjct: 317 TELASSLS------EIP---GTKLEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIK 367
Query: 268 KYWAKWKASHTGRS----RAMPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKN 319
K KW ++ + +PHIKTF +Y N K+ W + S NLS AAWG +QK+
Sbjct: 368 KLLHKWGTGTMEKNATDEKVIPHIKTFLKYKIFDNAIKIIWLVQGSYNLSNAAWGQIQKD 427
Query: 320 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 379
SQ IR+YELG+ I H F +E E + + ++
Sbjct: 428 GSQFCIRNYELGIFI------HKDQFEFERYFKLNE------EFPKFFWKRKSNFSFISE 475
Query: 380 SDAGASSEVVYLPVPYELPPQRYSSEDVPWS 410
+A ++ P+P++LPP+RYS+ D PW+
Sbjct: 476 INANKPIRLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|209879936|ref|XP_002141408.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
gi|209557014|gb|EEA07059.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
Length = 513
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 132/461 (28%), Positives = 220/461 (47%), Gaps = 87/461 (18%)
Query: 1 MVDIDWLLPAC---PVLAKIPHVLVIHGES---DGTLEHMKRNKPANWILHKPPLPISFG 54
++DI WL + K+ +L+IHG S D T E N N+ + P +P+ +G
Sbjct: 80 IIDIKWLFKEVRLNKIDEKLNRLLIIHGGSCNLDDTTEIQILNIAKNYEIQCPTMPLPYG 139
Query: 55 THHSKAMLLIYPRG----------VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
H K ++L + + +R+++ TAN + DW K+Q +W+QDF L + +N +
Sbjct: 140 VFHPKFLILKFSKQDPIIKKEESFIRLVITTANFLESDWKFKTQAVWVQDFLLANNSNGA 199
Query: 105 EE---CGFENDLIDYL-STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 160
+ C + ++++ S ++ +F ++L K++++ +A V L+ASVPG
Sbjct: 200 MKNPFCEYFGMFLNHIISKIEHKKFWSDL------------IKQYDYDNATVDLVASVPG 247
Query: 161 YHTGSSLKKWGHMKLRTVLQE----------------CTFEK-----GFKKSPLVYQFSS 199
YH G ++K WGH++++ +++ C E+ +S ++ QFSS
Sbjct: 248 YHKGENMKLWGHLRMKEIMKYKTDLNSTLNIEQPNRICKVEQYNNEYRHVESRIICQFSS 307
Query: 200 LGSLDEKWMA-ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQ 258
LG EKW+ E S+++ +E T +V+PT E V SLEG G +IP
Sbjct: 308 LGKFSEKWLTQEFGDSLNTCINEYTTKSSFE---LVYPTAEQVYKSLEGIYGGGSIPVKH 364
Query: 259 KNVDKDFLKKYWAKWKASHTG----RSRAMPHIKTFARY--NGQK----LAWFLLTSANL 308
N+ K ++ K W + R ++PHIKTF RY N + + W S NL
Sbjct: 365 NNITKSWISKILHLWGSGTLSNPSIRDLSVPHIKTFLRYLWNSDRKTVSIPWIFYGSHNL 424
Query: 309 SKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 368
AAWG LQ N +Q+ IR+YELGV+I P + + I++ T +
Sbjct: 425 GPAAWGQLQNNQTQMCIRNYELGVIITPYTLYNNVKY----------IRTKRNRTPKFIW 474
Query: 369 TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
TK+ T S+ + VP+ +PP +Y + D PW
Sbjct: 475 TKMET----------KSTPNYNIRVPFSIPPIQYKTNDTPW 505
>gi|170587939|ref|XP_001898731.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
gi|158592944|gb|EDP31539.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
Length = 454
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 129/349 (36%), Positives = 177/349 (50%), Gaps = 26/349 (7%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA-----NWILHKPPLPISFGT 55
M+D+ WLL P + + +I GE GT + R N + + L I FGT
Sbjct: 75 MIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTRTAVKQCGVNNVTVGRARLMIPFGT 134
Query: 56 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FEND 112
HHSK + G V I++ TANL+ DWN K+Q + + +N G F+ D
Sbjct: 135 HHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIERSADNRCNPNGSDFQAD 194
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
+ YL+ K + G + N S R++ SVPG H G L K+GH
Sbjct: 195 FVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARIVYSVPGAHKGVQLTKYGH 248
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGI 228
+LR +L+E + QFSSLGSL + W+ + +S++ G D L
Sbjct: 249 PRLRVILKELFGNVKMDEFTYHVQFSSLGSLGAAPQYWLTGQFLNSLAGGAETDGKHL-- 306
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
I++P VEDVR S EGY AG + P + V + +L + KW+++H GRSRAMPHI
Sbjct: 307 ---RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMYKWRSNHLGRSRAMPHI 363
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
KT+A + N K W L+TSANLSKAAWG Q +QL IRSYE GVL
Sbjct: 364 KTYAAFAKNSLKPLWLLVTSANLSKAAWGDYQLKKTQLTIRSYEFGVLF 412
>gi|84043866|ref|XP_951723.1| tyrosyl-DNA phosphodiesterase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|33348708|gb|AAQ16032.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
gi|62358538|gb|AAX78999.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei]
Length = 553
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 158/488 (32%), Positives = 235/488 (48%), Gaps = 88/488 (18%)
Query: 1 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 52
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSNC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQN 101
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP +
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 102 NLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 159
L G F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 160 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 217
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 218 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 273
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPVRLRCCHP-YVNERLYRWG 340
Query: 274 -----KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 326
+ + GR+RAMPHIKT+ R NG L WF+LTSANLS+AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 327 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 377
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 378 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 415
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQREHRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 416 TKKDVYGQ 423
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|402592672|gb|EJW86599.1| tyrosyl-DNA phosphodiesterase [Wuchereria bancrofti]
Length = 453
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 177/349 (50%), Gaps = 26/349 (7%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGT 55
M+D+ WLL P + + +I GE GT +K+ N I+ + L I FGT
Sbjct: 74 MIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVIVGRARLMIPFGT 133
Query: 56 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FEND 112
HHSK + G V I++ TANL+ DWN K+Q + +N G F+ D
Sbjct: 134 HHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIELSADNRCNPNGSDFQAD 193
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
+ YL+ K + G + N S R++ SVPG H G L K+GH
Sbjct: 194 FVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARVVYSVPGAHKGVQLTKYGH 247
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGI 228
+LR +L+E + QFSSLGSL + W+ + +S+S G D L
Sbjct: 248 PRLRVILKELFGNVKMDEFTYHAQFSSLGSLGAAPQYWLTGQFLNSLSGGAETDGKHL-- 305
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
I++P VEDVR S EGY AG + P + V + +L + KW++ H GRSRAMPHI
Sbjct: 306 ---RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMHKWRSDHLGRSRAMPHI 362
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
KT+A + N K W L+TSANLSKAAWG Q +QL IRSYE GVL
Sbjct: 363 KTYAAFAKNSLKPLWLLVTSANLSKAAWGNYQLKKTQLTIRSYEFGVLF 411
>gi|66362892|ref|XP_628412.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
gi|46229443|gb|EAK90261.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
Length = 511
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 130/390 (33%), Positives = 199/390 (51%), Gaps = 56/390 (14%)
Query: 45 HKPPLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
+ P L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF
Sbjct: 149 YSPYLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDFFH 208
Query: 98 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIA 156
+ ++C F +DYL EF N+ K S ++FNF A V+L+A
Sbjct: 209 SIE---RKDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVA 256
Query: 157 SVPGYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM- 208
SVPGY G + WGH+++R+++ Q+ + E K+ +V QFSSLG + EKW+
Sbjct: 257 SVPGYFFGDEMFMWGHLRVRSLIKRFVSKDQDKSDELREKRERIVLQFSSLGRISEKWLY 316
Query: 209 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK 268
EL+SS+S + E I++PTVE V S+EG G ++P ++ + K ++KK
Sbjct: 317 TELASSLSE--------IPGTELEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIKK 368
Query: 269 YWAKWKASHTGRS----RAMPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKNN 320
KW ++ + +PHIKTF +Y N K+ W + S NLS AAWG +QK+
Sbjct: 369 LLHKWGTGTMKKNATDEKVIPHIKTFLKYKIFGNAIKIIWLVQGSYNLSNAAWGQIQKDG 428
Query: 321 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS 380
SQ IR+YELG+ I F P + S I +
Sbjct: 429 SQFCIRNYELGIFIHKDQFEFERYFKLNEEF-PKFFWKRKSNCSLISEI----------- 476
Query: 381 DAGASSEVVYLPVPYELPPQRYSSEDVPWS 410
+A + ++ P+P++LPP+RYS+ D PW+
Sbjct: 477 NANQPNVLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|261326666|emb|CBH09628.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
gambiense DAL972]
Length = 553
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 158/488 (32%), Positives = 235/488 (48%), Gaps = 88/488 (18%)
Query: 1 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 52
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSSC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQN 101
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP +
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 102 NLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 159
L G F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 160 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 217
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 218 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 273
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPV-RLRCCHPYVNERLYRWG 340
Query: 274 -----KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 326
+ + GR+RAMPHIKT+ R NG L WF+LTSANLS+AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 327 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 377
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 378 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 415
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQRERRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 416 TKKDVYGQ 423
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|345304821|ref|XP_003428263.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ornithorhynchus
anatinus]
Length = 580
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 185/331 (55%), Gaps = 24/331 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + ++ KP N L + L I+FGTHH+K
Sbjct: 203 DVDWLIKQYPPEFRNKPLLLVHGDKREAKAQLHEQAKPYENICLCQAKLDIAFGTHHTKM 262
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP--LKDQNNLSEECG-FENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P +++ ++ + F+ DLI+Y
Sbjct: 263 MLLLYEEGMRVVIHTSNLIHADWHQKTQGIWLSPLYPRLVRETHSSGDSVTHFKTDLINY 322
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K+ + S V LI S PG G + WGH +LR
Sbjct: 323 LMAYNSPSLKEWI----------DIIKEHDLSETRVYLIGSTPGRFQGQKKEDWGHFRLR 372
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+L+E + ++S P+V QFSS+GS+ + KW+ +E S+ K+ G
Sbjct: 373 KLLEEHSSSIPEEESWPIVGQFSSIGSMGADESKWLCSEFKDSLVMLGKSGKSQGGHVPI 432
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIKTF 290
+++PTV++VR SLEGY AG ++P + K L Y+ KW A +GRS AMPHIKT+
Sbjct: 433 HLIYPTVDNVRKSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKWSAEISGRSHAMPHIKTY 492
Query: 291 ARY--NGQKLAWFLLTSANLSKAAWGALQKN 319
R + Q++AWFL+T A+ G L +N
Sbjct: 493 MRLSPDFQQIAWFLVTRASAFDVTGGFLTEN 523
>gi|414886955|tpg|DAA62969.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 140
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 94/145 (64%), Positives = 106/145 (73%), Gaps = 6/145 (4%)
Query: 284 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
MPHIKTF RY+GQ +AWFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP +
Sbjct: 1 MPHIKTFTRYSGQNIAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVP 60
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
FSCT I+ G I KTKLVTL W G + +V LPVPY+LPPQ Y
Sbjct: 61 QFSCTEK--SRSIRDGVALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYG 114
Query: 404 SEDVPWSWDKRYTKKDVYGQVWPRH 428
++DVPWSWD+RYTKKDVYG VWPR+
Sbjct: 115 TQDVPWSWDRRYTKKDVYGSVWPRY 139
>gi|322787271|gb|EFZ13407.1| hypothetical protein SINV_04400 [Solenopsis invicta]
Length = 647
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 134/421 (31%), Positives = 211/421 (50%), Gaps = 63/421 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+ WL + + +L+++G+ ++H K + +N + + +P FG HH+K
Sbjct: 268 MVDVGWLCLQYLLAGQRTDMLILYGDR---VDHEKLH--SNITMIEVQMPTQFGCHHTKI 322
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQNNLSE---ECGFENDLI 114
M+L Y G+R++V TANL DW N++QGLW+ P L + N S+ GF+ DL
Sbjct: 323 MILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPESANPSDGESPTGFKKDLE 382
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL+ ++P+ + + A ++ NFS V L+ASVPG H + WGH K
Sbjct: 383 RYLNKYRFPDLTQWISA----------VRRANFSDVKVFLVASVPGTHKDNEADSWGHKK 432
Query: 175 LRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP-- 231
L VL + T + P+V Q SS+GSL + + LS + S + T P
Sbjct: 433 LAHVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKEIIPCMSRETTKGLKSHPHF 492
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTF 290
++P++++ + S + +P S + + + +++ Y +WKA TGR RAMPHIK++
Sbjct: 493 QFIYPSIDNYKQSFDCRNLSCCLPYSAKTHSKQQWIESYLYQWKAKRTGRDRAMPHIKSY 552
Query: 291 ARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT 348
R + + ++WF+LTSANLSKAAWG +Q+NN +M SYE GV+ +P
Sbjct: 553 TRISPDLRSISWFVLTSANLSKAAWG-MQRNNHYIM--SYEAGVVFIP------------ 597
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
K +T T + V P+PY+LP RY S D P
Sbjct: 598 ---------------------KFITGTTTFPIEDEEDPAVPVFPIPYDLPLCRYESSDRP 636
Query: 409 W 409
+
Sbjct: 637 F 637
>gi|407394035|gb|EKF26770.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi
marinkellei]
Length = 551
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 147/484 (30%), Positives = 231/484 (47%), Gaps = 82/484 (16%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH---------KPPLPI 51
M+DI+WL+ P L + L I G E+ K+ + ++ + +P LP+
Sbjct: 50 MIDIEWLVCVAPSLLQTKQKLFI---VSGEKEYEKKIQSSSLFAYIKAEKVRIVEPKLPL 106
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSE 105
FG HHSK +L + +G+R+ V TAN I DW KSQG+++QDFP + D+ NL+
Sbjct: 107 PFGVHHSKLVLCVNAKGIRVAVLTANFIQDDWVCKSQGIYVQDFPRRQNLPKTDRANLTF 166
Query: 106 ECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 158
G F+N+L+ YL+ + A I + F + +FS+A V +I S+
Sbjct: 167 SAGSEIRGSEFKNELLRYLNC-----YGIISNAENTVAIPSTLFDEIDFSAACVEIITSI 221
Query: 159 PGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 216
PGY+ + + +G ++ VL E + L++QFSS G L ++ L ++MS
Sbjct: 222 PGYYRYNDVHSFGLGRIPKVLHSIDMELSDSIQVPLLIWQFSSQGKLTNSFLVALENAMS 281
Query: 217 ----SGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 270
S +K PL P+ IV+PT +V+ SLEG+ G ++P + ++ +
Sbjct: 282 TEGKSNEEANKKPLC---PVVQIVYPTESEVKESLEGWRGGLSLPLRLSSCHP-YINRRL 337
Query: 271 AKWKASHTG------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQ 322
+W G R RA+PH+KT+ R +K + W +LTSANLS+AAWG QK +Q
Sbjct: 338 HRWGQGTRGTCKIELRRRALPHLKTYMRLTEKKDGIKWLILTSANLSRAAWGEWQKKGNQ 397
Query: 323 LMIRSYELGVLILPS---AKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTW 376
L IRSYE GV+ + G FS T + +PS ++ I +
Sbjct: 398 LAIRSYEFGVVYGKDSFISFLEGEPFSVTPSRKIPLPSLVEGDGLAEVHIDQ-------- 449
Query: 377 HGSSDAGASSEVVYLPV-PYELPP---------QR-------YSSEDVPWSWDKRYTKKD 419
G ++LP P L P QR +++D+PW D + KD
Sbjct: 450 -GGKKDIEEGPTLFLPYDPLHLEPYASTVQMQNQRGNNCDSSINTDDIPWVIDMPHFGKD 508
Query: 420 VYGQ 423
V+G+
Sbjct: 509 VFGK 512
>gi|357630668|gb|EHJ78636.1| hypothetical protein KGM_17628 [Danaus plexippus]
Length = 581
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 138/431 (32%), Positives = 209/431 (48%), Gaps = 67/431 (15%)
Query: 1 MVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
MVD WLL + +++GE L ++ KP N H+ + FG HH+K
Sbjct: 202 MVDAGWLLAHYYFAGYSAKKLTILYGEESAELRNISAKKP-NVEAHQVKMATPFGKHHTK 260
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFENDL 113
MLL Y G +R++V TANL DW N++QGLW+ P + ++ E GF+ L
Sbjct: 261 MMLLCYEDGSLRVVVSTANLYMDDWENRTQGLWLSPSCPQLPAESPSHSGESPTGFKRSL 320
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+DYL + P+ + + ++ +FS V L+ SVPG H +S WG +
Sbjct: 321 LDYLHHYRLPQLAVYV----------HRVQRCDFSHINVFLVCSVPGTHYSAS---WGFL 367
Query: 174 KLRTVLQ-ECTFEKGFKKS-PLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK-TPLGIGE 230
++ +L+ C +S PL+ Q SSLGS + + L+ F++ K P +
Sbjct: 368 RVGALLRAHCAVPPQETRSWPLIAQASSLGSYGKDPGSWLTGDFLHHFTKIKDQPQTLTP 427
Query: 231 P---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPH 286
P +++P++E+V+ S +G G +P S +V + +LK + +W+A H+ R RAMPH
Sbjct: 428 PPDLKLIYPSLENVKSSHDGLLGGGCLPYSAAVHVKQPWLKDFLYQWRALHSERDRAMPH 487
Query: 287 IKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
IK++ R + + A++LLTS N+SKAAWG K+ L + SYE GVL LP
Sbjct: 488 IKSYTRVSPDNSRAAFYLLTSGNVSKAAWGVRNKDGG-LRLMSYEAGVLFLPR------- 539
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
F S+ P S + LPVPY+LPPQRYS
Sbjct: 540 FVINSDFFPL-----------------------------CPSSALRLPVPYDLPPQRYSP 570
Query: 405 EDVPWSWDKRY 415
+ PW D Y
Sbjct: 571 DMSPWVSDYLY 581
>gi|148686962|gb|EDL18909.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_c [Mus musculus]
Length = 542
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/331 (36%), Positives = 183/331 (55%), Gaps = 28/331 (8%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKNPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 377
Query: 177 TVLQ--ECTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 230
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 378 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 436
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 437 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 496
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQ 317
T+ R + KLAWFL+T K WG ++
Sbjct: 497 TYMRPSPDFSKLAWFLVTRQPAFK-YWGPVR 526
>gi|393910432|gb|EJD75879.1| TDP1 protein [Loa loa]
Length = 672
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 133/357 (37%), Positives = 180/357 (50%), Gaps = 41/357 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGT 55
M+D+ WLL P + + +I GE GT +K+ N + + L I FGT
Sbjct: 75 MIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMIPFGT 134
Query: 56 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLW--------MQDFPLKDQNNLSEE 106
HHSK + G V II+ TANL+ DWN K+Q + D P D+N
Sbjct: 135 HHSKISIFESNTGRVHIIIATANLLESDWNFKTQAFFHCSGNELAAGDCP--DRNG---- 188
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 166
F+ DL+ YL K + L H +++ + S R++ SVPG H G
Sbjct: 189 SDFQTDLVKYLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKGVQ 242
Query: 167 LKKWGHMKLRTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE 221
L K+GH +LR +L+E + GF SLG+ + W+ + +S+S G
Sbjct: 243 LTKYGHPRLRVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGAET 302
Query: 222 DKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTG 279
D GE L I++P VEDVR S EGYAAG + P S V + +L + KW + H G
Sbjct: 303 D------GEHLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDHLG 356
Query: 280 RSRAMPHIKTFARYNGQKL--AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
RSRAMPHIKT+A + L +W L+TSANLSKAAWG Q QL IRSYE G+L
Sbjct: 357 RSRAMPHIKTYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF 413
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 426
+PY+LP +Y D W DK Y K D++ + WP
Sbjct: 422 LPYDLPLTKYDDNDRVWIVDKTYRKPDIFRKTWP 455
>gi|307188952|gb|EFN73469.1| Probable tyrosyl-DNA phosphodiesterase [Camponotus floridanus]
Length = 666
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 132/422 (31%), Positives = 208/422 (49%), Gaps = 65/422 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANWILHKPPLPISFGTHHSK 59
MVD+ WL + + +++++GE + R K +N + +P+ FG HHSK
Sbjct: 286 MVDVGWLCLQYLLAGQRTDMMILYGE------RVDREKLGSNITMIHVDMPVRFGCHHSK 339
Query: 60 AMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFENDL 113
M+ Y G+R++V TANL DW+N++QGLW+ PL + ++ GF+ DL
Sbjct: 340 IMIFQYKDDGIRVVVSTANLYSDDWDNRTQGLWISPHLPLLPESANPSDGESPTGFKKDL 399
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
YLS + P + + A ++ NFS+ V L+ASVPG H + + WGH
Sbjct: 400 ERYLSKYRHPALTQWIWA----------VRRANFSAVNVFLVASVPGTHKDAEVDSWGHR 449
Query: 174 KLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
KL VL + T + P+V Q SS+GSL + + LS + S + T P
Sbjct: 450 KLAYVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDIIPCMSRETTKGLKSHPN 509
Query: 232 -LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
++P++E+ + S + +P S Q + + +++ Y +W+A T R RAMPHIK+
Sbjct: 510 FQFIYPSIENYKHSFDCRNLSCCLPYSAQVHSKQQWIESYLYQWRAKRTRRDRAMPHIKS 569
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + +++ WF+LTSANLSKAAWG +Q++N +M SYE GV+ +P
Sbjct: 570 YTRISPDLKRIPWFVLTSANLSKAAWG-VQRSNHYIM--SYEAGVIFIP----------- 615
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
K +T T + V P+PY+LP +RY S D
Sbjct: 616 ----------------------KFITQTTTFPIEDEEDPAVPIFPIPYDLPLRRYDSSDS 653
Query: 408 PW 409
P+
Sbjct: 654 PF 655
>gi|301791029|ref|XP_002930517.1| PREDICTED: LOW QUALITY PROTEIN: tyrosyl-DNA phosphodiesterase
1-like [Ailuropoda melanoleuca]
Length = 473
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 138/382 (36%), Positives = 196/382 (51%), Gaps = 57/382 (14%)
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLI 114
K MLL+Y G+ +++HT++LIH D + K+QG W+ +P + + S E F+ DLI
Sbjct: 131 KMMLLLYEEGLWVVIHTSDLIHADCHQKTQGEWLTPLYPRIIHGXHRSGESATHFKADLI 190
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL P + K + S V LI S PG GS GH +
Sbjct: 191 SYLMAYNAPSLKEWI----------DTVHKHDISETNVYLIGSTPGCFQGSRKDHXGHFR 240
Query: 175 LRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGI 228
LR +L+E + KG + P+V QFSS+GSL D KW+ +E S+++ E +TP
Sbjct: 241 LRKLLKEHASSIPKG-ESWPIVGQFSSIGSLGADDLKWLCSEFKESLATLGKESRTPGKS 299
Query: 229 GEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPH 286
PL +++P+VE+V+ SLE Y AG+++PS + +K + L Y+ K A +G + AMPH
Sbjct: 300 AVPLHLIYPSVENVQTSLEEYPAGDSLPSSIQIAEKQNCLHSYFHKXVADTSGCNNAMPH 359
Query: 287 IKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
IK + R + ++ W L+TS NLSK GAL+KN QLMI SYE GVL L SA
Sbjct: 360 IKRYMRPSPDFSQIVWLLVTSTNLSKTTXGALEKNGXQLMIHSYEXGVLFLLSA------ 413
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
F S V K KL +G+ PVPY+LPP+ Y S
Sbjct: 414 FGLDSFKV---------------KQKL----------SGSKEPAATFPVPYDLPPELYGS 448
Query: 405 EDVPWSWDKRYTK-KDVYGQVW 425
+D P + YTK D +G +W
Sbjct: 449 KDRPXIXNIPYTKVPDTHGNMW 470
>gi|149025342|gb|EDL81709.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Rattus norvegicus]
Length = 542
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 174/317 (54%), Gaps = 25/317 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKA 60
D++WL+ P + +L++HG+ + + + KP AN L + L I+FGTHH+K
Sbjct: 208 DVNWLIKQYPPEFRKKPILLVHGDKREAKADLHAQAKPYANISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 116
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P Q N + F+ DL Y
Sbjct: 268 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSY 327
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + ++ + S V LI S PG GS WGH +LR
Sbjct: 328 LMAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLR 377
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 231
+LQ + P+V QFSS+GSL + KW+ +E S+ + E +TP P
Sbjct: 378 KLLQAHAPSAPRGECWPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVP 437
Query: 232 L-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKT 289
L +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT
Sbjct: 438 LHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIKT 497
Query: 290 FARYNGQ--KLAWFLLT 304
+ R + KLAWFL+T
Sbjct: 498 YMRPSPDFSKLAWFLVT 514
>gi|332029124|gb|EGI69135.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 667
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 207/422 (49%), Gaps = 65/422 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSK 59
MVD+ WL + + +++++G+ + R K N I + + +P FG HH+K
Sbjct: 290 MVDVGWLCLQYLLAGQCTDMMILYGD------RVDREKLNNNITMIEVDMPTKFGCHHTK 343
Query: 60 AMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQNNLSE---ECGFENDL 113
M+L Y G+R++V TANL DW N++QGLW+ P L + N S+ GF+ DL
Sbjct: 344 IMILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPESANPSDGESPTGFKKDL 403
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
Y + + P + + A ++ +FS V L+ASVPG H + WG+
Sbjct: 404 ERYFNKYRHPALTQWICA----------IRRADFSDVNVFLVASVPGTHKDNEADSWGYK 453
Query: 174 KLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
KL VL T + P+V Q SS+GSL + + LS + S + T P
Sbjct: 454 KLAHVLSRYATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDIIPCMSRETTKGLKSHPH 513
Query: 232 -LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
++P++E+ + S + +P S + + + +++ Y +WKA TGR RAMPHIK+
Sbjct: 514 FQFIYPSIENYKQSFDCRNLSCCLPYSTKVHSKQQWIESYLYQWKAKRTGRDRAMPHIKS 573
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R + ++++WF+LTSANLSKAAWG +Q+NN +M SYE GV+ +P
Sbjct: 574 YTRISPDLKRISWFVLTSANLSKAAWG-VQRNNHYIM--SYEAGVIFIP----------- 619
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
KL+T T + V P+PY+LP RY S D
Sbjct: 620 ----------------------KLITGTTTFPIEEEEDPAVPVFPIPYDLPLCRYESSDS 657
Query: 408 PW 409
P+
Sbjct: 658 PF 659
>gi|383853604|ref|XP_003702312.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Megachile
rotundata]
Length = 701
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 135/434 (31%), Positives = 213/434 (49%), Gaps = 75/434 (17%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKP-PLPISFGTHHSK 59
MVD+ WL + + +L+++G+ + K + I P +P FG HH+K
Sbjct: 325 MVDVGWLCLQYLLAGQRTDMLILYGD------RVDEEKLSLNITMIPVQMPTKFGCHHTK 378
Query: 60 AMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSE---ECGFENDL 113
M+L Y G+R++V TANL DW N++QGLW+ PL + N ++ GF+ DL
Sbjct: 379 IMILKYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPPLPESANTNDGESPTGFKKDL 438
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+ YL+ + P + A ++ +FSS V IASVPG H G WGH
Sbjct: 439 LLYLNKYRQPAITEWTSA----------VRRADFSSVNVFFIASVPGRHKGVEYDSWGHR 488
Query: 174 KLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGI 228
KL VL + T + LV Q SS+GSL E W+ E++SSMS ++P +
Sbjct: 489 KLGYVLSKHATLPPDAPRWTLVAQSSSIGSLGPSYESWLLKEITSSMSK-----ESPSNL 543
Query: 229 GEP---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 284
++P++ + + S + +P S Q + +++++ Y +WKA+ T R +AM
Sbjct: 544 KSHPNFQFIYPSINNYKQSFDCRVGSCCLPYSLQTHSKQEWIESYMYQWKATRTARDKAM 603
Query: 285 PHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 342
PHIK++ R+ + +K+ WF+LTSANLSKAAWG + K++ +M +YE GV+ +P
Sbjct: 604 PHIKSYTRFSPDMKKIPWFVLTSANLSKAAWGTVGKDSHYIM--NYEGGVIFIPK----- 656
Query: 343 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 402
F S P + + V P+PY+LPP +Y
Sbjct: 657 --FIIGSTTFPVQEEENG---------------------------VPVFPIPYDLPPTKY 687
Query: 403 SSEDVPWSWDKRYT 416
S D P+ + Y+
Sbjct: 688 QSGDKPFVMEFFYS 701
>gi|397613425|gb|EJK62211.1| hypothetical protein THAOC_17185, partial [Thalassiosira oceanica]
Length = 576
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 144/512 (28%), Positives = 231/512 (45%), Gaps = 114/512 (22%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDG-TLEHMKR--------NKPANWILHKP---- 47
++D+++L P + K V+V +G +G +++ M++ K +I P
Sbjct: 61 LLDVEYLFEELPEIIKYQKVIVYYGSVEGNSMQAMRQWEQVLGNSGKTVEFIRLVPSDPP 120
Query: 48 -----PLP--ISFGTHHSKAMLLIYPRG----VRIIVHTANLIHVDWNNKSQGLWMQDF- 95
PLP + +G HHSK L Y RI +H+ANL D K+QG+++QDF
Sbjct: 121 YSATNPLPFKLPYGVHHSKFFLSGYEEEGKHMCRIGIHSANLRRSDIERKTQGIYVQDFP 180
Query: 96 -------------PLK-----DQNNLSEECGFENDLIDYLSTLKWPE-----FSANLPAH 132
P K + ++L + FE+DLI Y+ + ++ FS +
Sbjct: 181 AKAPKKQAAAAVNPYKRAKVDEDDDLRQ---FEDDLITYMESYRYYVRGQIWFSPSTTQS 237
Query: 133 GNFKINP----SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQEC-TFEKG 187
G + ++++FS A L+ SVPGYH + K+G+ K+ ++ + G
Sbjct: 238 GGLTDRSHSILTLLRRYDFSCAYAVLVPSVPGYHQARDMPKFGYYKIHKAVKNARSGRAG 297
Query: 188 FKKS---------PLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK----------TPLGI 228
+S P+++Q SSLG++ +W+ +L +++ S + P G
Sbjct: 298 SNQSSSGETETPKPIIFQVSSLGTIQNRWLIKLLAAIDSNCHRNDPSTYLPAGKSIPQGK 357
Query: 229 GEPL-----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---- 279
PL +VWPTVE+VR +EGYA G AIP + +DKDFL + +W T
Sbjct: 358 TPPLETRMKLVWPTVEEVRTCVEGYAGGGAIPGTTEKLDKDFLLPLYHRWSNPDTNILGP 417
Query: 280 --RSRAMPHIKTFAR-YNGQKLAWFLLTSANLSKAAWGALQ----KNNSQLMIRSYELGV 332
+R PHIKTF + +G ++ W +LTS NLSK + G Q N +LMI+ +ELGV
Sbjct: 418 LRTARYAPHIKTFVQPGDGDEIHWVVLTSHNLSKPSLGEFQTDTKTNERRLMIQHWELGV 477
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 392
P + ++P E E Q G DA +P
Sbjct: 478 FFSPETLTKMTSDNSPLRMIPFE------EAGQC-----------GIKDA------ALVP 514
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
+PY L P RY + W+ D+ + D +G+V
Sbjct: 515 LPYSLHPSRYDENEEAWATDRPASTPDAFGRV 546
>gi|313236496|emb|CBY11811.1| unnamed protein product [Oikopleura dioica]
Length = 495
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 134/439 (30%), Positives = 212/439 (48%), Gaps = 80/439 (18%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M+D++++L P +KI L + G D + + P N P+P FGTHH+K
Sbjct: 120 MIDLEFVLKHHPNSSKI---LFVSG--DTLFQPGRDGIPDNIFQSVVPVP-QFGTHHTKM 173
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFP--LKDQNNLSEECGFENDLIDYL 117
+L + G+R+ +++ANL+ DW ++Q +W+ LK+++ S E FE DL++Y+
Sbjct: 174 SILKFRNIGLRVAIYSANLLDYDWRERTQVIWLSPLLPLLKEKSKTSSE--FETDLVEYI 231
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 177
+ ++ L + F+K++FSS R I S PG +GH+KLR
Sbjct: 232 DSYSLAPLNSLLQS----------FEKYDFSSIKARFIGSSPGRRRDKEKWIFGHLKLRK 281
Query: 178 VLQECTFEKGFKKSPLVYQFSSLGSLDEK-------WMAEL--SSSMSSGFSEDKTPLGI 228
VL++ + K LV Q SS+GSL + ++A L S +S +++D +
Sbjct: 282 VLKKIS--NCAKNDKLVAQCSSIGSLRSRDSWLYNEFLASLMTCSDAASYYTKDNDAFSL 339
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASH-TGRSRAMPH 286
V+PTVE +RCS GY++G + P S + + + ++ Y +KW+ TGRSR MPH
Sbjct: 340 -----VYPTVEQIRCSKFGYSSGGSFPYSAKTHESQKWIIYYMSKWEPDEKTGRSRVMPH 394
Query: 287 IKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
K + R + K+ WFL S NLSKAAWG +K ++QL IRS+E VL++P +
Sbjct: 395 SKIYQRVSDGKVKWFLSGSHNLSKAAWGQYEKGDTQLHIRSFEASVLLIPE------DYG 448
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
S P+ + E Q RYS D
Sbjct: 449 LESFNFPAFPNFHNFEKIQ-----------------------------------RYSDND 473
Query: 407 VPWSWDKRYTKKDVYGQVW 425
PW +D +Y + D + Q W
Sbjct: 474 FPWLYDNKYLQPDDFNQTW 492
>gi|156400100|ref|XP_001638838.1| predicted protein [Nematostella vectensis]
gi|156225962|gb|EDO46775.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 158/289 (54%), Gaps = 47/289 (16%)
Query: 152 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE---- 205
VRLIASVPG H G + KWGH+KLR +LQE + P++ QFSS+GSL
Sbjct: 1 VRLIASVPGRHAGLNKNKWGHLKLRKILQEHGPPSSDVTTNWPVIGQFSSIGSLGPDKNK 60
Query: 206 ----KWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKN 260
+W+ L+++ F G PL +V+PTV++VR +L +AG +IP K
Sbjct: 61 WLCGEWLQSLAATCGRTF-------GSNAPLKLVFPTVDNVRTTLWFISAGGSIPYSHKT 113
Query: 261 VDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQ 317
+K +L ++ W A+ GRSRA PHIKT+ R + +LAWF++TS+NLSKAAWG L+
Sbjct: 114 AEKQPYLPSFFCSWNATSRGRSRASPHIKTYMRTSPDHSRLAWFMVTSSNLSKAAWGVLE 173
Query: 318 KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 377
K SQLMIRSYE+GVL LP+ + T+ I + + +
Sbjct: 174 KGGSQLMIRSYEIGVLFLPADQ--------------------VTDREAIDQCRDIL---- 209
Query: 378 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
+ + ++ VP++LPP YS ++ PW WD RY K D G +W
Sbjct: 210 -GGNRLSDEPCTHVHVPFDLPPSPYSDDEKPWMWDVRYLDKPDTNGNIW 257
>gi|50550131|ref|XP_502538.1| YALI0D07590p [Yarrowia lipolytica]
gi|49648406|emb|CAG80726.1| YALI0D07590p [Yarrowia lipolytica CLIB122]
Length = 471
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 188/394 (47%), Gaps = 76/394 (19%)
Query: 53 FGTHHSKAMLLIY-----PRGVRIIVHTANLIHVDWNNKSQGLWM-QDFPLKDQNNLSEE 106
F THH+K M+L + R ++++HTAN+IH DW+N +QG+W Q K + N
Sbjct: 116 FATHHTKMMVLFFKEDKGERSAQVVIHTANMIHHDWDNMTQGVWKSQKVKEKRKTNTEGS 175
Query: 107 CG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 165
FE DL+ YLS + S + F ++F++SS R++ SVPG H
Sbjct: 176 TSTFETDLVAYLSEYQLDTTSKLIK----------FLQRFDWSSETARVVGSVPGTHKD- 224
Query: 166 SLKKWGHMKLRTVLQECTFE-----KGFKKSPLVYQFSSLGSL--DEKWMA-ELSSSMSS 217
KKWG ++ +L E + +G + +V Q SS+GSL +KW+ +L ++
Sbjct: 225 --KKWGLTRVADLLDEHKEDHKSDYEGSESDTIVLQSSSIGSLGVTDKWITPQLVGALDG 282
Query: 218 GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 273
D+ G+ IVWPTVE+VR S +GY G +I S ++K+ W
Sbjct: 283 RSPRDRDGHGLPASQIVWPTVENVRRSFDGYDLGMSIHFKNESDTHRKQYAYMKERMNVW 342
Query: 274 KASHTGRSRAMPHIKTFARYN-GQKLAWFLLTSANLSKAAWGALQ-KNNSQLMIRSYELG 331
KA + R+RAMPHIKT+ R+ KL W LLTSAN+SK AWG++ S+ I S+ELG
Sbjct: 343 KADNKHRTRAMPHIKTYTRFTRAGKLRWVLLTSANISKYAWGSVSAAKESKFSIPSWELG 402
Query: 332 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 391
VL+ P A F ++
Sbjct: 403 VLLFPQAVGKAV-FDLKDSV---------------------------------------- 421
Query: 392 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY+ P YS++D PW+ + + +KD G W
Sbjct: 422 -IPYDWPLTNYSAKDEPWTKNADHLEKDTNGFPW 454
>gi|302422748|ref|XP_003009204.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
gi|261352350|gb|EEY14778.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
Length = 527
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 143/457 (31%), Positives = 209/457 (45%), Gaps = 80/457 (17%)
Query: 20 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 73
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRVALQEEAAAWKNVELHTAPMPEMFGTHHTKMMILFRHDDTAQVII 162
Query: 74 HTANLIHVDWNNKSQGLWMQDF-PLKDQNN-----------LSEECG----FENDLIDYL 117
HTAN+I DW N + G+W PL Q N +E+ G F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPNGGKLEDGEVYEANEDFGSGRKFKSDLLRYL 222
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 175
+ + ++ +++F+ LIASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRLLTEQLARYDFAGVRAVLIASVPGRHAIHDTSQTAWGWPAL 274
Query: 176 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 230
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQTG--KSEIVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSIGPRPAF--- 329
Query: 231 PLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS---------- 276
+V+PT +++R SL+GYA+G +I SPQ+ +LK + W
Sbjct: 330 -KVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKSIFCHWANDAPGGKELSKD 388
Query: 277 ----HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 332
GR RA PHIKT+ RY Q + W LLTSANLSK AWG ++ I S+E GV
Sbjct: 389 TLLRDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 391
L+ PS + +G+ E + + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVTGTDEATMVGTFKTDAPGEEAPSSAPSSTGNVVGL 490
Query: 392 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
+PY LP Q Y +++PW K D G+V R
Sbjct: 491 RMPYSLPLQLYGKDEIPWVLRMSIPKPDWAGRVCLRE 527
>gi|158293221|ref|XP_558110.3| AGAP010577-PA [Anopheles gambiae str. PEST]
gi|157016854|gb|EAL40355.3| AGAP010577-PA [Anopheles gambiae str. PEST]
Length = 584
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 143/431 (33%), Positives = 204/431 (47%), Gaps = 70/431 (16%)
Query: 1 MVDIDWLLPACPVLAKIPHV--LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
MVDI WLL A A +V L+++G+ L + + KP N K + FG HH+
Sbjct: 199 MVDIGWLL-AHYFFAGYENVPLLILYGDETPELRMVSQKKP-NVTAVKVEIKTPFGVHHT 256
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSE-ECGFEND 112
K L Y G +R++V TANL DW+N++QGLW+ P E F +
Sbjct: 257 KMGLYGYRDGSMRVVVSTANLYEDDWHNRTQGLWISPRLPAVPEGSDTTYGESRSDFRSS 316
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-WG 171
L+ YL K P+ + + +K +FS V L+ASVPG HT ++ WG
Sbjct: 317 LLTYLDAYKLPQLQPWM----------ARIRKTDFSDVKVFLVASVPGGHTNTAKGPLWG 366
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWMAELSSSMSSGFSEDKTPLGI 228
H +L +L + PLV Q SS+GSL E W+ L M+S F +D P+GI
Sbjct: 367 HPRLGYLLSQHAAPID-DSCPLVAQSSSIGSLGPSPESWV--LGEIMAS-FRKDSAPVGI 422
Query: 229 GEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAM 284
+++P+ +VR S +G G +P + +V +++LK Y +W + R++AM
Sbjct: 423 RRLPGFRMIYPSFSNVRQSHDGMMGGGCLPYVRSTHVKQEWLKDYLQQWCSRARHRNKAM 482
Query: 285 PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPSAKRH 341
PHIKT+ R++ + L WFLLTSANLSKAAWG K L I SYE GVL LP
Sbjct: 483 PHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKTGRFEKPLRINSYEAGVLFLPK---- 538
Query: 342 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 401
N P E A+ + P+PY++P
Sbjct: 539 ---LLLDENFFPME----------------------------ANKKHPQFPMPYDVPTIP 567
Query: 402 YSSEDVPWSWD 412
Y+ ED P+ D
Sbjct: 568 YAPEDTPFFMD 578
>gi|299115351|emb|CBN74172.1| Tyrosyl-DNA phosphodiesterase [Ectocarpus siliculosus]
Length = 607
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 139/432 (32%), Positives = 206/432 (47%), Gaps = 103/432 (23%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RNKPA---NWILHKPPLPISFGTH 56
MVD L+ P L +P V ++HG GT + + R++ A L P LP +GT+
Sbjct: 118 MVDYALLVRCAPRLGSVP-VTIVHGFKPGTQDEVNLRSQCAVNPGVKLRYPELP-EYGTN 175
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
H+K ++L +P G+R+ V TAN I VD +KSQG+W QDFP + S C F+ DL+ +
Sbjct: 176 HAKMIILKFPTGIRVAVLTANFIVVDVTDKSQGVWYQDFPKR----TSGSCAFQEDLMGF 231
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY-----------HTGS 165
L F PA S +++F A V L+ SVPG H G
Sbjct: 232 L-------FKVGGPASAF----ASTLGEYDFRGARVALVPSVPGTGGNTPGTGGKPHKGR 280
Query: 166 SLKKWGHMKLRTVLQE-------CTFEKGFKKSPLVYQFSSLGSLDE---KWMAELSSSM 215
L K+GHM++R +L ++G K ++ Q SSL SL + +W++E+ +S
Sbjct: 281 DLHKYGHMRVRALLAREKEDGTGAKLKEGGHK--VLCQISSLASLTKTPNRWLSEILASF 338
Query: 216 -------------SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAI------ 254
SED+ + E +VWP+VE VR S +G+ AG +I
Sbjct: 339 MPLEDEGKKAEPTRRSVSEDEAQATLLEQHLRVVWPSVEAVRTSSQGWIAGGSICCNTVN 398
Query: 255 -----------PSPQKNVDKDFLKKYWAKWKAS-HTGRSRAMPHIKTFARY--------- 293
+ + N L+ KWK + R+R PHIK++ RY
Sbjct: 399 MYGGKYKWPNMDNYRSNTPLPELRPLLRKWKGNPAVNRTRDAPHIKSYLRYREVAGENGT 458
Query: 294 ----NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS------------ 337
+G ++AWFLLTS+NLS++AWG L K ++ L +RS+E+GV+ LPS
Sbjct: 459 ETRVDGDEVAWFLLTSSNLSRSAWGYLNKASTDLTLRSFEMGVMFLPSLLRSPSQDSDDG 518
Query: 338 -AKRHGCGFSCT 348
A GF+CT
Sbjct: 519 NAAAKASGFTCT 530
>gi|157129902|ref|XP_001661809.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108872048|gb|EAT36273.1| AAEL011629-PA [Aedes aegypti]
Length = 536
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 137/427 (32%), Positives = 204/427 (47%), Gaps = 60/427 (14%)
Query: 1 MVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
MVDI WLL + +L+++G+ L+ + KP N K + FG HH+K
Sbjct: 151 MVDIGWLLGHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NVTAVKVHIATPFGVHHTK 209
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNL---SEECGFENDL 113
L Y G +R++V TANL DW+N++QGLW+ P+ + ++ + GF +L
Sbjct: 210 MGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPEDSDTGAGDSKTGFRENL 269
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-WGH 172
I YL++ K G+ + + +K NFS V L+ASVPG H + WGH
Sbjct: 270 ITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVASVPGGHLNTPKGPLWGH 319
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
++ +L + + PLV Q SS+GSL + + S + + F D P+G+
Sbjct: 320 PRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEVLASFRRDSAPIGLRRVP 378
Query: 232 --LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIK 288
+++P+ +VR S + G +P + DK LK Y +WK+ R++A+PHIK
Sbjct: 379 AFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLHQWKSDSRNRTKAVPHIK 438
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPSAKRHGCGF 345
T+ R++ + L WFLLTSANLSKAAWG K+ + L I SYE GVL LP F
Sbjct: 439 TYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYEAGVLFLPK-------F 491
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
N P E K G P+PY++P Y+ E
Sbjct: 492 VIEENFFPMESKPGQQHPQ--------------------------FPMPYDVPIIPYALE 525
Query: 406 DVPWSWD 412
D P+ D
Sbjct: 526 DTPFFMD 532
>gi|380026209|ref|XP_003696847.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
florea]
Length = 695
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 143/434 (32%), Positives = 208/434 (47%), Gaps = 89/434 (20%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHS 58
MVDI WL + + ++ ++ GE T P +N +P FG HH+
Sbjct: 318 MVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSNVTTFYVDMPTKFGCHHT 370
Query: 59 KAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSE---ECGFEND 112
K M+L Y G+R++V TANL DW N++QG+W+ PL + N SE GF+ D
Sbjct: 371 KIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLSESANSSEGESPTGFKKD 430
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
L YL+ + P + A ++ +FSS V +ASVPG HT WGH
Sbjct: 431 LERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFLASVPGRHTDMEYDSWGH 480
Query: 173 MKLRTVLQECTFEKGFKKSP------LVYQFSSLGSLD---EKWMA-ELSSSMSSGFSED 222
KL ++L K K P LV Q SS+GSL E W+ E++SSMS
Sbjct: 481 RKLGSILS-----KHAKLPPDAPQWILVAQSSSIGSLGPNYESWLQKEITSSMSK----- 530
Query: 223 KTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 278
+ P+G+ ++P++ + + S + +P S Q + + +++ Y +WKA T
Sbjct: 531 ENPVGLKSHPNFHFIYPSLNNYKQSFDCRVGSCCLPYSLQTHSKQKWIESYMYQWKAKQT 590
Query: 279 GRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
GR RAMPHIKT+ R + +++ WF+LTSANLSKAAWG + KN+ +M +YE GV+ +P
Sbjct: 591 GRDRAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKNSHYIM--NYEGGVVFIP 648
Query: 337 SAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
S F S+ P E + G V PVPY
Sbjct: 649 S-------FITGSSTFPIKEEEPG----------------------------VPIFPVPY 673
Query: 396 ELPPQRYSSEDVPW 409
+LP RY D P+
Sbjct: 674 DLPLTRYEKNDSPF 687
>gi|346323354|gb|EGX92952.1| tyrosyl-DNA phosphodiesterase [Cordyceps militaris CM01]
Length = 515
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 135/420 (32%), Positives = 200/420 (47%), Gaps = 66/420 (15%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG--VRIIVHTANLIHVDWNNKSQGLWM------ 92
N LH P+P FGTHHSK ML+++ R ++I+HTAN+I DW N + W+
Sbjct: 125 NVKLHVAPMPEMFGTHHSK-MLIVFRRDDTAQVIIHTANMIAKDWTNMTNAAWISPILPK 183
Query: 93 -----QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 147
+D P + F+ DL+ YL++ + P + K ++F
Sbjct: 184 LNTAPKDSPRPENMTPGSGPRFQFDLLSYLTS-----YDRMRPTCTGLVQS---LKVYDF 235
Query: 148 SSAAVRLIASVPG---YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL- 203
SS L+ASVPG HT + WG + L++ + G KS + Q SS+ +L
Sbjct: 236 SSVKGSLVASVPGTHEVHTEAGATAWGWSAMGKCLEQIPCQAG--KSEVTVQVSSIATLG 293
Query: 204 -DEKWM-AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSP 257
++ W+ L ++S G S T + +V+PT +++R SL+GYA+G +I S
Sbjct: 294 GNDGWLRGTLFKALSKGKSA-TTAAAAPQFKVVFPTADEIRASLDGYASGGSIHTKIQSK 352
Query: 258 QKNVDKDFLKKYWAKWKASHT----------GRSRAMPHIKTFARYNGQK-LAWFLLTSA 306
Q+ + +L+ + W A GR RA PHIKT+ R N + + W L+TSA
Sbjct: 353 QQEMQLRYLRPIFHYWMADDASKAASSFRDAGRDRAAPHIKTYIRTNEKNTMDWALVTSA 412
Query: 307 NLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSGSTETSQ 365
NLSK AWG K Q I S+E+GVL+ PS K+ C + VP GS E
Sbjct: 413 NLSKQAWGEAAKPTGQFRIASWEIGVLVWPSLFKKDAIMKGCFKSDVP-----GSAEGHG 467
Query: 366 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
Q+ G + VV +PY LP ++YS E +PW + K+D GQ W
Sbjct: 468 GQR--------------GEAETVVGFRMPYSLPLRKYSREAMPWVATMSHEKEDCLGQSW 513
>gi|157109623|ref|XP_001650753.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108868427|gb|EAT32652.1| AAEL015141-PA [Aedes aegypti]
Length = 624
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 138/427 (32%), Positives = 205/427 (48%), Gaps = 60/427 (14%)
Query: 1 MVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
MVDI WLL + +L+++G+ L+ + KP N K + FG HH+K
Sbjct: 239 MVDIGWLLGHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NVTAVKVHIATPFGVHHTK 297
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNL---SEECGFENDL 113
L Y G +R++V TANL DW+N++QGLW+ P+ + ++ + GF +L
Sbjct: 298 MGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPEDSDTGAGDSKTGFRENL 357
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-WGH 172
I YL++ K G+ + + +K NFS V L+ASVPG H + WGH
Sbjct: 358 ITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVASVPGGHLNTPKGPLWGH 407
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE-P 231
++ +L + + PLV Q SS+GSL + + S + + F D P+G+ P
Sbjct: 408 PRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEVLASFRRDSAPIGLRRVP 466
Query: 232 L--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIK 288
+++P+ +VR S + G +P + DK LK Y +WK+ R++A+PHIK
Sbjct: 467 AFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLHQWKSDSRNRTKAVPHIK 526
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPSAKRHGCGF 345
T+ R++ + L WFLLTSANLSKAAWG K+ + L I SYE GVL LP F
Sbjct: 527 TYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYEAGVLFLPK-------F 579
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
N P E K G P+PY++P Y+ E
Sbjct: 580 VIEENFFPMESKPGQQHPQ--------------------------FPMPYDVPIIPYALE 613
Query: 406 DVPWSWD 412
D P+ D
Sbjct: 614 DTPFFMD 620
>gi|194855370|ref|XP_001968528.1| GG24923 [Drosophila erecta]
gi|190660395|gb|EDV57587.1| GG24923 [Drosophila erecta]
Length = 580
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/357 (35%), Positives = 187/357 (52%), Gaps = 35/357 (9%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K +LV++G+ L + + KP + + +P F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQVTAI-RVRMPTPFATSH 248
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P E GF+
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPEDADTGAGESLTGFKQ 308
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + +K +FS+ V + SVPG H SS++
Sbjct: 309 DLMLYLVEYKITQLQPWI----------ARIRKSDFSAINVFFLGSVPGGHRESSVRGHP 358
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WGH +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 359 WGHARLGSLLSKHAAPID-DRIPVVCQSSSIGSLGVSVQAWIQQDFVNSLKKDSTPVGKL 417
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSRAM
Sbjct: 418 RQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSRAM 477
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
PHIK++ R+N Q + WF+LTSANLSKAAWG KN++ L I +YE+GVL LP
Sbjct: 478 PHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEVGVLFLP 534
>gi|194771042|ref|XP_001967588.1| GF20606 [Drosophila ananassae]
gi|190615089|gb|EDV30613.1| GF20606 [Drosophila ananassae]
Length = 576
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 130/358 (36%), Positives = 187/358 (52%), Gaps = 37/358 (10%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISFGTH 56
MVDI WLL +L K +LV++G+ L + + KP I K P P F T
Sbjct: 188 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQVTAIGVKMPTP--FATS 243
Query: 57 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFE 110
H+K MLL Y G +R+++ TANL DW+N++QG+W+ P D + GF+
Sbjct: 244 HTKMMLLAYNDGSMRVVISTANLYEDDWHNRTQGVWISPKLPELHEDADTGAGESQTGFK 303
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK- 169
DL+ YL K + + + +K +FS+ V + SVPG H S+++
Sbjct: 304 QDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFLGSVPGGHRESTVRGH 353
Query: 170 -WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGI 228
WGH +L +L + + P+V Q SS+GSL A + + +D TPLG
Sbjct: 354 PWGHARLGALLAKHATPIN-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPLGK 412
Query: 229 GEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRA 283
+ +++P+ +V S +G G +P + DK +LK + +WK++ RSRA
Sbjct: 413 LRQMPTFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDHLHQWKSNDRYRSRA 472
Query: 284 MPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKN-NSQ--LMIRSYELGVLILP 336
MPHIKT+ RYN Q + WF+LTSANLSKAAWG KN N Q L I +YE GVL LP
Sbjct: 473 MPHIKTYTRYNLEDQSVYWFVLTSANLSKAAWGCFNKNSNVQPCLRIANYEAGVLFLP 530
>gi|48094884|ref|XP_392205.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
mellifera]
Length = 692
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 141/434 (32%), Positives = 208/434 (47%), Gaps = 89/434 (20%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHS 58
MVDI WL + + ++ ++ GE T P +N +P FG HH+
Sbjct: 315 MVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSNVTTFYVDMPTKFGCHHT 367
Query: 59 KAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSE---ECGFEND 112
K M+L Y G+R++V TANL DW N++QG+W+ PL + N SE GF+ D
Sbjct: 368 KIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLSESANSSEGESPTGFKKD 427
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
L YL+ + P + A ++ +FSS V +ASVPG HT WGH
Sbjct: 428 LERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFLASVPGRHTDMEYDSWGH 477
Query: 173 MKLRTVLQECTFEKGFKKSP------LVYQFSSLGSLD---EKWMA-ELSSSMSSGFSED 222
KL ++L K K P LV Q SS+GSL E W+ E++SSMS
Sbjct: 478 RKLGSILS-----KHAKLPPDAPQWTLVAQSSSIGSLGPNYESWLQKEITSSMSK----- 527
Query: 223 KTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 278
+ P+G+ ++P++ + + S + +P S Q + + +++ Y +WKA T
Sbjct: 528 ENPVGLKSHPNFHFIYPSLNNYKRSFDCRVGSCCLPYSLQTHSKQKWIESYMYQWKAKQT 587
Query: 279 GRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
GR +AMPHIKT+ R + +++ WF+LTSANLSKAAWG + KN+ +M +YE GV+ +P
Sbjct: 588 GRDKAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKNSHYIM--NYEGGVVFIP 645
Query: 337 SAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
S F S+ P E + G V P+PY
Sbjct: 646 S-------FITGSSTFPIKEEEPG----------------------------VPVFPIPY 670
Query: 396 ELPPQRYSSEDVPW 409
+LP RY D P+
Sbjct: 671 DLPLTRYEKNDSPF 684
>gi|125984342|ref|XP_001355935.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
gi|54644254|gb|EAL32995.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
Length = 576
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 130/360 (36%), Positives = 189/360 (52%), Gaps = 41/360 (11%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISFGTH 56
MVDI WLL +L K +LV++G+ L + + KP I K P P F T
Sbjct: 188 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAIGVKMPTP--FATS 243
Query: 57 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEE--CG 108
H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL +D + + E G
Sbjct: 244 HTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLLPALSEDADTAAGESLTG 301
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 168
F DL+ YL K + + + +K +FS+ V + SVPG H S++
Sbjct: 302 FRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGSVR 351
Query: 169 K--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPL 226
WGH +L ++L + + P+V Q SS+GSL A + + +D +P
Sbjct: 352 GHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSSPG 410
Query: 227 GIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRS 281
G + +++P+ +V S +G G +P + DK +LK + +WK+S RS
Sbjct: 411 GKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDRHRS 470
Query: 282 RAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQ---LMIRSYELGVLILP 336
RAMPHIKT+ RYN Q + WF+LTSANLSKAAWG+ KN + L I +YE GVL LP
Sbjct: 471 RAMPHIKTYTRYNLTDQSVYWFVLTSANLSKAAWGSFNKNTNLQPCLRIANYEAGVLFLP 530
>gi|195118058|ref|XP_002003557.1| GI21930 [Drosophila mojavensis]
gi|193914132|gb|EDW12999.1| GI21930 [Drosophila mojavensis]
Length = 596
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 143/435 (32%), Positives = 213/435 (48%), Gaps = 73/435 (16%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M+DI WLL +L+K +LV++G D L + + KP + K + F T H
Sbjct: 208 MIDIGWLLGHYYFAGILSK--PLLVLYGADDPNLVDIGKFKPQVTAI-KVQMQSPFATSH 264
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PL-KDQNNLSEE--CGFEN 111
+K MLL Y G +R+++ TANL DW+N++QGLWM PL +D + + E GF+
Sbjct: 265 TKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWMSPRLPPLPEDADTAAGESPTGFKQ 324
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + +K +FS+ V I SVPG H S+++
Sbjct: 325 DLMLYLVEYKLSQLQPWI----------ARIRKSDFSAINVFFIGSVPGGHRESAVRGHP 374
Query: 170 WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
WG +L ++L + E P+V Q SS+GSL A + + S F +D +P+G
Sbjct: 375 WGCARLGSLLAKHAAPVEPNI---PVVCQSSSIGSLGANVQAWIEQDILSNFRKDSSPIG 431
Query: 228 IGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSR 282
L +++P+ +V S +G G +P + DK +LK Y +WK+ RS+
Sbjct: 432 RLSQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGKSTNDKQPWLKNYLHQWKSGDRHRSQ 491
Query: 283 AMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGAL-QKNNSQ--LMIRSYELGVLILPS 337
AMPHIK++ R+N Q + WF+LTSANLSKAAWGA +K+N Q L I +YE GVL LP
Sbjct: 492 AMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGAFNKKSNLQPCLRIFNYEAGVLFLPK 551
Query: 338 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 397
F + P A + V P+PY++
Sbjct: 552 -------FVTGEDTFPL---------------------------GNARNGVPAFPLPYDV 577
Query: 398 PPQRYSSEDVPWSWD 412
P Y +D P+ D
Sbjct: 578 PLTPYGPDDTPFLMD 592
>gi|429856258|gb|ELA31180.1| tyrosyl-dna phosphodiesterase [Colletotrichum gloeosporioides Nara
gc5]
Length = 517
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 128/421 (30%), Positives = 202/421 (47%), Gaps = 73/421 (17%)
Query: 40 ANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL- 97
+N LH +P FGTHHSK M+L+ + ++++HTAN+I DW N + +WM PL
Sbjct: 132 SNVELHGAYMPEMFGTHHSKMMILVRHDDSAQVVIHTANMIAKDWTNMTNAVWMS--PLL 189
Query: 98 -----KDQNNLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 148
KD + + G F++DL+ YL ++ P + +++FS
Sbjct: 190 RLLKEKDSTSCEDAIGTGQRFKHDLLSYLKA-----YNVRRPTLRDLV---DKLSQYDFS 241
Query: 149 SAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--D 204
S LIASVPG H+ +S WG L+ VL+ + G KS +V Q SS+ +L
Sbjct: 242 SVKAALIASVPGRHSIHDTSQTSWGWPALKHVLRHVPVQDG--KSEIVVQISSIATLGAT 299
Query: 205 EKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI----PSPQ 258
+ W+ + L + +S S DK P +V+PT +++R SL+GYA+G +I S Q
Sbjct: 300 DNWIQKCLFNPLSE--SSDKGPKKTKPTFKVVFPTADEIRRSLDGYASGGSIHTKIQSQQ 357
Query: 259 KNVDKDFLKKYWAKW--------------KASHTGRSRAMPHIKTFARYNGQKLAWFLLT 304
+ +L ++ W GR RA PHIKT+ RY + + W L+T
Sbjct: 358 QAKQLAYLHPFFCHWGNDAPNGKALPETATVREAGRKRAAPHIKTYIRYGEKSIDWALVT 417
Query: 305 SANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 364
SAN+SK AWG + + ++ I S+E+GVL+ P T +++ S +TE
Sbjct: 418 SANISKQAWGEVAGASQEVRIASWEIGVLVWPEMMAEKATMMST---FQTDLPSNNTE-- 472
Query: 365 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
S+ VV + +PY LP Q Y+ +++PW + + D G+
Sbjct: 473 -------------------GSNPVVGVRIPYNLPLQHYAKDEIPWVATMAHAEPDNMGRF 513
Query: 425 W 425
W
Sbjct: 514 W 514
>gi|346970364|gb|EGY13816.1| tyrosyl-DNA phosphodiesterase [Verticillium dahliae VdLs.17]
Length = 527
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 140/457 (30%), Positives = 205/457 (44%), Gaps = 80/457 (17%)
Query: 20 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 73
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRMALQEEAAAWKNLELHNAPMPEMFGTHHTKMMILFRFDDTAQVII 162
Query: 74 HTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG---------------FENDLIDYL 117
HTAN+I DW N + G+W PL Q + + F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPDSGKPEAEEESEADEDFGSGRKFKSDLLSYL 222
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 175
+ + + K++F+ IASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRPLTEQLVKYDFAGIRAVFIASVPGRHAIHDTSQTAWGWPAL 274
Query: 176 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 230
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQAG--KSEVVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSISPRPAF--- 329
Query: 231 PLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS---------- 276
+V+PT +++R SL+GYA+G +I SPQ+ +LK + W
Sbjct: 330 -RVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKPIFCHWANDAPGGKEISKD 388
Query: 277 ----HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 332
GR RA PHIKT+ RY Q + W LLTSANLSK AWG ++ I S+E GV
Sbjct: 389 TALQDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 391
L+ PS + +G+ E + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVAGTDEAIMVGTFKTDAPGEEAPSGAPSSTGNVVGL 490
Query: 392 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
+PY LP Q Y +++PW +T+ D G+V R
Sbjct: 491 RMPYSLPLQLYGKDEIPWVASNEHTEPDWAGRVCLRQ 527
>gi|195470993|ref|XP_002087790.1| GE18215 [Drosophila yakuba]
gi|194173891|gb|EDW87502.1| GE18215 [Drosophila yakuba]
Length = 582
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 124/357 (34%), Positives = 186/357 (52%), Gaps = 35/357 (9%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K +LV++G+ L + + KP + + +P F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQVTAI-RVRMPTPFATSH 248
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P E GF+
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPEDADTGAGESLTGFKQ 308
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + +K +FS+ V + SVPG H SS++
Sbjct: 309 DLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFLGSVPGGHRESSVRGHP 358
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WGH +L ++L + + P++ Q SS+GSL A + + +D TP G
Sbjct: 359 WGHARLGSLLSKHATPID-DRIPVICQSSSIGSLGANVQAWIQQDFVNSLKKDSTPAGKL 417
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSRAM
Sbjct: 418 RQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSRAM 477
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
PHIK++ R+N Q + WF+LTSANLSKAAWG KN++ L I +YE+GVL LP
Sbjct: 478 PHIKSYTRFNLEEQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEVGVLFLP 534
>gi|350415522|ref|XP_003490669.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Bombus
impatiens]
Length = 697
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 189/373 (50%), Gaps = 58/373 (15%)
Query: 49 LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNN 102
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL + ++
Sbjct: 364 MPTKFGCHHTKVMILKYKDDGIRVVVSTANLYSDDWENRTQGVWISPHLPLLAESANPSD 423
Query: 103 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 162
GF+ DL YL + P + + A K+ NFSS V +ASVPG H
Sbjct: 424 GESPTGFKRDLERYLHKYEQPALTEWISA----------VKRANFSSVNVFFVASVPGRH 473
Query: 163 TGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 221
TG WG+ KL VL + + LV Q SS+GSL + + + + S S+
Sbjct: 474 TGVEYDYWGYRKLGHVLSKHAKLPPDAPQWTLVVQSSSIGSLGPNYESWIQKEIISSMSK 533
Query: 222 DKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 278
+ P P ++P++ + + S + +P S Q + +++++ Y +WKA+ T
Sbjct: 534 ENPPGLKSCPNFRFIYPSLNNYKQSFDCQVGSCCLPYSIQTHSKQEWVESYMYQWKATRT 593
Query: 279 GRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
R +A+PHIKT+ R N +K+ WF+LTSANLSKAAWG ++K++ ++ +YE GV+ +P
Sbjct: 594 ARDKAIPHIKTYTRISPNLEKIPWFVLTSANLSKAAWGMVRKDSHHIL--NYEAGVIFIP 651
Query: 337 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
+GST T I+K +AG V P+PY+
Sbjct: 652 ------------------HFVTGST-TFPIKK-----------EEAG----VPVFPIPYD 677
Query: 397 LPPQRYSSEDVPW 409
LP RY S D P+
Sbjct: 678 LPLTRYGSGDKPF 690
>gi|195435334|ref|XP_002065649.1| GK15563 [Drosophila willistoni]
gi|194161734|gb|EDW76635.1| GK15563 [Drosophila willistoni]
Length = 572
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 191/359 (53%), Gaps = 39/359 (10%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +LAK ++V++G+ L ++ + KP + K +P F T H
Sbjct: 184 MVDIGWLLGHYYFAGILAK--PLIVLYGDESPELLNISKLKPQVTAI-KVQMPTPFATSH 240
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFEN 111
+K MLL Y G +R+++ TANL DW+N++QG+W+ P D + GF+
Sbjct: 241 TKMMLLAYTDGSMRVVISTANLYEDDWHNRTQGVWISPRLPALSEEADTAAGESKTGFKQ 300
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--K 169
DL+ YL K + + + +K +FS+ V LIASVPG H S++
Sbjct: 301 DLMLYLVEYKLTQLQPWI----------ARIRKSDFSAINVFLIASVPGGHREGSVRGHP 350
Query: 170 WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
WGH +L ++L + E + P+V Q SS+GSL A + + +D + +G
Sbjct: 351 WGHARLGSLLAKHAAPIED---RIPVVCQSSSIGSLGPNVQAWIQQDFVNSLRKDSSTVG 407
Query: 228 IGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSR 282
L +++P+ +V S +G G +P + DK +LK++ +WK+ R++
Sbjct: 408 RLRQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGKNTNDKQPWLKEHLQQWKSGDRYRNQ 467
Query: 283 AMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
AMPHIK + RYN Q + WF+LTSANLSKAAWG+ KN++ L I +YE GVL LP
Sbjct: 468 AMPHIKCYTRYNLENQSVYWFVLTSANLSKAAWGSFNKNSNIQPCLRIANYEAGVLFLP 526
>gi|308462649|ref|XP_003093606.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
gi|308249623|gb|EFO93575.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
Length = 462
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 133/441 (30%), Positives = 204/441 (46%), Gaps = 82/441 (18%)
Query: 1 MVDIDWLLPACP--VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
M+D ++L+ + P + P LV+ L P N +H LPI FGTHHS
Sbjct: 86 MIDFEFLVNSYPPSLRTTTPITLVVGAPDVSDLRKSTLQYP-NVTVHSASLPIPFGTHHS 144
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 117
K +L G + +IV TANLI DW K+Q + ++ ++ E F+ DLI+YL
Sbjct: 145 KLSILESDDGFIHVIVSTANLISDDWEFKTQQFYYA-MGMRREDEF-ERSPFQEDLIEYL 202
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS-LKKWGHMKLR 176
S P + +FS+ RLI S PGYHT + + GH +L
Sbjct: 203 SYYSNP-----------LSTWKKLIESTDFSTVTDRLIFSTPGYHTDPQHVSRLGHPRLS 251
Query: 177 TVL-QECTFEKGFK---KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
T+L Q+ F+ ++ + + Q SS+GSL + E P +P
Sbjct: 252 TILSQKFPFDPKYEHTDRCTFIAQCSSIGSLGSAPSSWFRGQFLKSL-EAANPAPKNKPP 310
Query: 232 --LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
+V+P VEDVR S +GYA G ++P D+ +L+ + KW+++ R++A+PH K
Sbjct: 311 KMYLVFPCVEDVRNSCQGYAGGGSVPYRNSVHDRQKWLQDFMCKWRSNTKRRTKAVPHCK 370
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCG 344
T+ +Y+ + W LLTSAN+SKAAWG + +KN QLMIRS+E+GVLI
Sbjct: 371 TYVKYDQKIAQWQLLTSANVSKAAWGEMSFSKKKNVDQLMIRSWEIGVLI---------- 420
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
T+ S+ +P++ P YS
Sbjct: 421 ----------------TDPSRFN-------------------------IPFDYPCVPYSP 439
Query: 405 EDVPWSWDKRYTKKDVYGQVW 425
D P++ D+++ + D+ G VW
Sbjct: 440 TDRPFTTDQKHEQPDILGCVW 460
>gi|24581359|ref|NP_523465.2| glaikit [Drosophila melanogaster]
gi|37999816|sp|Q9VQM4.1|TYDP1_DROME RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase; AltName: Full=Protein glaikit
gi|7295840|gb|AAF51141.1| glaikit [Drosophila melanogaster]
gi|15292079|gb|AAK93308.1| LD37277p [Drosophila melanogaster]
gi|220946228|gb|ACL85657.1| gkt-PA [synthetic construct]
Length = 580
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 126/357 (35%), Positives = 182/357 (50%), Gaps = 35/357 (9%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSH 248
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 308
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 309 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 358
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WGH +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 359 WGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKL 417
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSRAM
Sbjct: 418 RQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAM 477
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
PHIK++ R+N Q + WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 478 PHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534
>gi|367053563|ref|XP_003657160.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
gi|347004425|gb|AEO70824.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
Length = 548
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 140/474 (29%), Positives = 213/474 (44%), Gaps = 78/474 (16%)
Query: 3 DIDWLLPAC-PVLAKIPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTH 56
DID+L+ A P + + V V+HG E LE ++ N LH +P FGTH
Sbjct: 104 DIDFLMAAFDPDVRGLVQVHVVHGFWKREDPSRLELQAAASRYENVTLHNAYMPEMFGTH 163
Query: 57 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEE---- 106
HSK M+L+ + +I++HTAN+I DW N +Q +W+ P + N +E
Sbjct: 164 HSKMMILLRHDDTAQIVIHTANMIVRDWTNMTQAVWLSPRLPLIKPAQQAVNQAEARTGS 223
Query: 107 -CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--T 163
F+ D ++YL + + + K +++FS LIASVPG H +
Sbjct: 224 GAKFKMDFLNYLRSYDTRKSTC--------KPIIEQLLRYDFSEIRASLIASVPGRHKFS 275
Query: 164 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFS 220
+S +WG + L+ + KS + Q SS+ +L + W+ + ++S G
Sbjct: 276 ENSPTRWGWAAMEEALKAVPVSQA--KSEIAIQISSIATLGPTDSWLKDTFFRALSRGRR 333
Query: 221 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK-- 274
P + +V+PT +++R SL+GYA+G +I SPQ+ +L+ W
Sbjct: 334 GTGPPSAPPDFKVVFPTPDEIRKSLDGYASGGSIHTKIQSPQQVKQLQYLRPMLCHWAND 393
Query: 275 ------------ASHTGRSRAMPHIKTFARYNGQ-------KLAWFLLTSANLSKAAWG- 314
GR RA PH+KT+ RY G + W LLTSANLSK AWG
Sbjct: 394 SPHGVELEAGAAVQEAGRKRAAPHVKTYIRYRGDGPPHGPITIDWALLTSANLSKQAWGE 453
Query: 315 ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 374
A ++ I SYE+GVL+ P + + G + + + + G + V L
Sbjct: 454 AANAKTGEIRISSYEIGVLVWP--ELYAPGATMQATFLTDTLAEGERRDAAAAAATAVPL 511
Query: 375 TWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
VPY LP Q Y +VPW Y+++D GQVW RH
Sbjct: 512 R-----------------VPYNLPLQPYGKGEVPWVATASYSERDWMGQVW-RH 547
>gi|340710910|ref|XP_003394026.1| PREDICTED: LOW QUALITY PROTEIN: probable tyrosyl-DNA
phosphodiesterase-like [Bombus terrestris]
Length = 697
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 206/422 (48%), Gaps = 65/422 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPP-LPISFGTHHSK 59
MVD+ WL + + + +++G + + K + I P +P FG HH+K
Sbjct: 321 MVDVGWLCLQYLLAGQRTDMSIMYGS------RVDKEKLSLNITMIPVWIPTKFGCHHTK 374
Query: 60 AMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFENDL 113
M+L Y G+R++V TANL DW N++QG+W+ PL + ++ GF+ DL
Sbjct: 375 VMILKYKDDGIRVVVSTANLYSCDWENRTQGVWISPHLPLLAESANPSDGESPTGFKRDL 434
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
YL + + A ++ NFSS V +ASVPG HTG WG+
Sbjct: 435 ERYLHKYHQRGLTEWISA----------VRRANFSSVNVFFLASVPGKHTGVEYDYWGYR 484
Query: 174 KLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
KL VL + + LV Q SS+GS + + + + S S++ P +P
Sbjct: 485 KLGQVLSKHAKLPPDAPQWTLVAQSSSIGSFGPNYESWIQKEIVSSMSKENPPGLKSQPN 544
Query: 232 -LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
++P++ + + S + +P S + + +++L+ Y +WKA+ T R +A+PHIKT
Sbjct: 545 FQFIYPSINNYKQSFDCQVGSCCLPYSIETHSKQEWLESYMYQWKATRTARDKAIPHIKT 604
Query: 290 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ R N +K+ WF+LTSANLSKAAWG ++ ++ L I +YE GV+ +P
Sbjct: 605 YTRISPNLEKIPWFVLTSANLSKAAWGIVRVDS--LHILNYEAGVIFIP----------- 651
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+GST T I+K +AG V P+PY+LP RY SED
Sbjct: 652 -------HFVTGST-TFPIKK-----------EEAG----VPVFPIPYDLPLTRYGSEDK 688
Query: 408 PW 409
P+
Sbjct: 689 PF 690
>gi|408398119|gb|EKJ77253.1| hypothetical protein FPSE_02528 [Fusarium pseudograminearum CS3096]
Length = 513
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 131/414 (31%), Positives = 192/414 (46%), Gaps = 61/414 (14%)
Query: 44 LHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 102
+H P+P FGTHHSK M+L + ++I+HTAN+I DW N + G+W + N
Sbjct: 128 IHIAPMPEMFGTHHSKMMVLFRHDDTAQVIIHTANMIPKDWTNMTNGVWKSPLLPRMSNT 187
Query: 103 LSEECGFENDL--------IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 154
E L ID L+ LK+ + + + K+ ++++FS+ L
Sbjct: 188 QILTSSPEEFLVGSGERFKIDLLNYLKFYDKRKIVCKPLSDKL-----QQYDFSTVKAAL 242
Query: 155 IASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAE 210
IASVPG H + + WG L+ L+ + S +V Q SS+ +L K W
Sbjct: 243 IASVPGRHDVHDMSETSWGWAALKRCLRHVPCHQ-HGDSDIVVQVSSIATLGPKDDW--- 298
Query: 211 LSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAG----NAIPSPQKNVDKD 264
L ++ S K G+G P +V+PT +++R SL+GYA+G I SPQ+ +
Sbjct: 299 LQKTLFDHLSRCKD-TGLGRPRFKVVFPTADEIRRSLDGYASGLSIHTKIQSPQQAKQLE 357
Query: 265 FLKKYWAKWKAS-------------HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKA 311
+L+ + W +GR RA PHIKT+ R N + W LLTSAN+SK
Sbjct: 358 YLRPMFHHWANDSPGGTKLPDGPVLESGRKRAAPHIKTYVRSNKSSIDWGLLTSANISKQ 417
Query: 312 AWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 371
AWG + ++ I S+E+GVLI P G T E+ E +
Sbjct: 418 AWGEAAQLTGEMRIASWEVGVLIWPELLEPGSVMVGTYKTDVPEVSRSPKEDEE------ 471
Query: 372 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
S VV L +PY P QRY+SE+VPW +T+ D GQ W
Sbjct: 472 -------------SLPVVGLRIPYNTPLQRYTSEEVPWVVSMSHTEPDWAGQSW 512
>gi|341892674|gb|EGT48609.1| hypothetical protein CAEBREN_24547 [Caenorhabditis brenneri]
Length = 451
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 125/355 (35%), Positives = 179/355 (50%), Gaps = 45/355 (12%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M++ D+L+ P + + ++ GE D ++ ++R+ A N + LPI +GTHHS
Sbjct: 73 MIEPDYLMNCYPQSIRSNPITLVVGEPD--VKDLRRSMHAYKNVTVIGASLPIPYGTHHS 130
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 117
K +L G + +IV +AN+I DW K+Q W + +K + ++ F+NDLI+YL
Sbjct: 131 KLSILEGEDGTIHVIVSSANMISEDWEFKTQQFWY-GYGVKKETQVTGS-EFQNDLIEYL 188
Query: 118 -----STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
S W E K +FS RLI SVPGYH GH
Sbjct: 189 GYYPSSMNSWTEL----------------IKCTDFSEVKDRLIFSVPGYHKAKK-NSLGH 231
Query: 173 MKLRTVL-QECTFEKGF---KKSPLVYQFSSLGSLD---EKWMAE--LSSSMSSGFSEDK 223
M LR++L F+ F ++ Q SS+GSL W L S +
Sbjct: 232 MALRSILIDRFPFDPNFVHTDRTTFFCQCSSIGSLGPTPANWFRGQFLKSLEGAATPPQN 291
Query: 224 TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSR 282
P + +++P VEDVR S EGYA G ++P + L+ + +WKA R+R
Sbjct: 292 KPARL---FVLFPRVEDVRMSAEGYAGGKSVPYRNSVHQRQLWLQHAFCRWKADKKKRTR 348
Query: 283 AMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLI 334
A+PH KT+ + + W LLTSANLSKAAWG LQK N+ QLMIRSYE+GVL+
Sbjct: 349 AIPHCKTYMKIDKDGQKWQLLTSANLSKAAWGELQKVNTANEQLMIRSYEMGVLV 403
>gi|307211789|gb|EFN87770.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 645
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 184/348 (52%), Gaps = 30/348 (8%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+ WL + + +++++G+ + + N + +P +FG HH+K
Sbjct: 267 MVDVGWLCLQYLLAGQRTDMMILYGDRVD-----QESLGCNITMIHVDMPSAFGCHHTKI 321
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFENDLI 114
M+L Y G+RI+V TANL DW N++QGLW+ PL + N+ F+ D
Sbjct: 322 MILQYKDDGIRIVVSTANLYSDDWENRTQGLWISPHLPLLPESANSNDGESPTNFKKDFE 381
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YLS + P + + +K +FS+ V +ASVPG H + WGH K
Sbjct: 382 RYLSKYRHPALTQWI----------WIVRKADFSAVNVYFVASVPGTHKNVDVDFWGHRK 431
Query: 175 LRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP-- 231
L +L Q T + ++ Q SS+GSL + + LS + S S + T P
Sbjct: 432 LAQILSQHATLPPDAPQWSIIAQSSSIGSLGPNYESWLSREIVSSMSRETTQGLKSHPKF 491
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTF 290
V+P++E+ + S + + +P S + + + +++ Y +WKA+ TGR+RA+PHIK++
Sbjct: 492 QFVYPSIENYKRSFDFQTLSSCLPYSLKVHSKQQWIESYLYQWKATRTGRNRAIPHIKSY 551
Query: 291 ARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
R + + + WF+LTSANLSKAAWGA Q++N +M +YE GV+ LP
Sbjct: 552 TRISPDLKSIPWFVLTSANLSKAAWGA-QRSNYYIM--NYEAGVVFLP 596
>gi|195034799|ref|XP_001988977.1| GH11458 [Drosophila grimshawi]
gi|193904977|gb|EDW03844.1| GH11458 [Drosophila grimshawi]
Length = 590
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 141/433 (32%), Positives = 211/433 (48%), Gaps = 69/433 (15%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M+DI WLL +L K +LV++G+ L + + KP + + +P F T H
Sbjct: 202 MIDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAV-RVKMPTPFATSH 258
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--KDQNNLSEE--CGFEN 111
+K MLL Y G +R+++ TANL DW+N++QGLW+ P +D + + E GF+
Sbjct: 259 TKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPALAEDADTAAGESATGFKQ 318
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + +K +FS+ V LI SVPG H +++
Sbjct: 319 DLMLYLVEYKLSQLQPWI----------ARIRKSDFSAVNVFLIGSVPGGHREGAVRGHP 368
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WG +L ++L + + P+V Q SS+GSL A + S +D TPLG
Sbjct: 369 WGCARLGSLLAKHATPVE-DRIPVVCQSSSIGSLGANVQAWIQQDFVSNLRKDSTPLGRL 427
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
L +++P+ +V S +G G +P + DK +LK + +WK+ RS+AM
Sbjct: 428 RQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGRNTNDKQPWLKAHLQQWKSGDRHRSQAM 487
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKN-NSQ--LMIRSYELGVLILPSAK 339
PHIK++ R+N Q + WF+LTSANLSKAAWG+ KN N Q L I +YE GVL LP
Sbjct: 488 PHIKSYTRFNLEEQCIYWFVLTSANLSKAAWGSFNKNPNIQPCLRIANYEAGVLFLPR-- 545
Query: 340 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 399
F P G+S G V P+PY++P
Sbjct: 546 -----FVTGEETFPL-----------------------GNSRNG----VPAFPLPYDVPL 573
Query: 400 QRYSSEDVPWSWD 412
Y ++D P+ D
Sbjct: 574 TPYGADDKPFLMD 586
>gi|195576262|ref|XP_002077995.1| GD23212 [Drosophila simulans]
gi|194190004|gb|EDX03580.1| GD23212 [Drosophila simulans]
Length = 580
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/357 (35%), Positives = 182/357 (50%), Gaps = 35/357 (9%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 192 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATSH 248
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 249 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 308
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 309 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 358
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WGH +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 359 WGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTPVGKL 417
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSRAM
Sbjct: 418 RQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSRAM 477
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
PHIK++ R+N Q + WF+LTSANLSKAAWG K+++ L I +YE GVL LP
Sbjct: 478 PHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPCLRIANYEAGVLFLP 534
>gi|345487640|ref|XP_001604652.2| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 690
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 128/426 (30%), Positives = 200/426 (46%), Gaps = 63/426 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MV+I WL + A+ P + + G ++ P+N L + +P +FG HHSK
Sbjct: 310 MVEIGWLCLQYLLAAQNPKMTIFCG----SVCDPNVALPSNITLVEVNMPAAFGCHHSKI 365
Query: 61 MLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSE---ECGFENDLI 114
+ Y G +RI+V TAN+ DW N++QGLWM PL + N S+ F+
Sbjct: 366 SVFKYSDGGIRIVVSTANIYSDDWENRTQGLWMSPHLPPLPNSANPSDGESPTNFKKSFR 425
Query: 115 DYLSTLKWPEFSA--NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
+YL+ + P+ NL K+ + S+ V +AS+PG H G SL WGH
Sbjct: 426 EYLNAYRNPKLVEWENL------------VKRADCSAVNVFFVASIPGSHKGLSLNSWGH 473
Query: 173 MKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP 231
+L +L E + ++ Q SS+G+L + + + S++ S +K P
Sbjct: 474 RRLAAILNEHAVLPPDAPQWTIIAQSSSIGNLGPTFDSWIQSNIVFSLSREKAKGIKSNP 533
Query: 232 --LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIK 288
V+P++ + S + A +P +K+ +K ++LK Y +WKA TGR++AMPH+K
Sbjct: 534 NFHFVYPSLRNYEGSFDCKAGSCCLPYSRKSHEKQEWLKNYLYQWKADETGRTKAMPHVK 593
Query: 289 TFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
++ R + ++ WF+LTSANLSK AWG K I +YE GV+ +P F
Sbjct: 594 SYTRISPDLTQIPWFVLTSANLSKGAWGTTAKTGVSHYIMNYEAGVVFIPK-------FV 646
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
P IK+ S S ++ +PY+LP RY D
Sbjct: 647 INQQTFP--IKTSS------------------------SPDIPVFRLPYDLPLTRYRQND 680
Query: 407 VPWSWD 412
VP+ D
Sbjct: 681 VPFVID 686
>gi|242006203|ref|XP_002423943.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
gi|212507213|gb|EEB11205.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
Length = 447
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 134/432 (31%), Positives = 205/432 (47%), Gaps = 75/432 (17%)
Query: 1 MVDIDWLLPACPVLAKI-PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
MV++ WL+ + P + +++ DG L ++ + I K P P FG HH+K
Sbjct: 73 MVELPWLMAQYAINDLFNPSMTILYDVQDGDLANIPEHLNIKAIKIKSPYP--FGHHHTK 130
Query: 60 AMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWM--------QDFPLKDQNNLSEECGFE 110
+ Y R +R ++TANLI DW +++QG+W+ D P+ N + F+
Sbjct: 131 MSIFFYTDRSIRFAIYTANLIESDWEDRTQGVWISPKCPYLGDDVPI---NYGESDTLFK 187
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 170
+++ YL + K PE L KI + + S V ++SVPG S + +
Sbjct: 188 FEILQYLISYKLPEIRNLL-----IKIQET-----DCSLIKVFFVSSVPG----SVIDNF 233
Query: 171 GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL----DEKWMAELSSSMSSGFSEDKTPL 226
G++KL +++E E K +V Q SS+GSL D + E S SS S +
Sbjct: 234 GYIKLGKIIKEHAVENSEDKERIVIQCSSIGSLGPAPDSWLLNEFVKSTSSKLSSPQVS- 292
Query: 227 GIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMP 285
IV+P+V +V S+ G + G +P S ++ + +L KY +W H RS+A+P
Sbjct: 293 ------IVYPSVRNVASSIYGLSGGGCLPYSSGTHIKQLWLNKYLMQWYCEHRKRSKAVP 346
Query: 286 HIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
HIKT+AR N K ++WFLLTSANLSKAAWG K + L I SYE GVL LP +
Sbjct: 347 HIKTYARINEDKEEISWFLLTSANLSKAAWGKKLK-SGMLQIMSYEAGVLFLPKLLINKN 405
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
F +I+K ++G E P+PY++P Y
Sbjct: 406 VF-------------------KIKKF---------GYNSGNDDE---FPIPYDIPLTSYQ 434
Query: 404 SEDVPWSWDKRY 415
D + +DK +
Sbjct: 435 ETDRLFLFDKNF 446
>gi|17540580|ref|NP_500149.1| Protein F52C12.1 [Caenorhabditis elegans]
gi|37999811|sp|Q9TXV7.1|TYDP1_CAEEL RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|351063437|emb|CCD71624.1| Protein F52C12.1 [Caenorhabditis elegans]
Length = 451
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 133/441 (30%), Positives = 206/441 (46%), Gaps = 83/441 (18%)
Query: 1 MVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
M+D ++L+ + P L + P LV+ L +N+ ++ LPI FGTHH+K
Sbjct: 75 MLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVTVVGAS-LPIPFGTHHTK 133
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
+L G +IV TANL+ DW K+Q + +F +K + F++DL++YLS
Sbjct: 134 MSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIASGTVPRSDFQDDLLEYLS 192
Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
+ +K +FS + RLI S PGYHT ++ GH +L +
Sbjct: 193 MYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGYHTDPPTQRPGHPRLFRI 241
Query: 179 LQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LSSSMSSGFSEDKTPLGIG 229
L E F+ ++ + V Q SS+GSL W L S + S + P +
Sbjct: 242 LSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQSLEGANPSPKQKPAKM- 300
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
+V+P+VEDVR S +GYA G ++P + + +L+ KW+++ R+ A+PH K
Sbjct: 301 --YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMCKWRSNAKRRTNAVPHCK 358
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCG 344
T+ +Y+ + W LLTSANLSKAAWG + KN QLMIRS+E+GVLI
Sbjct: 359 TYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRSWEMGVLI---------- 408
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
T+ S+ +P++ P YS+
Sbjct: 409 ----------------TDPSRFN-------------------------IPFDYPLVPYSA 427
Query: 405 EDVPWSWDKRYTKKDVYGQVW 425
D P+ DK++ K D+ G +W
Sbjct: 428 TDEPFVTDKKHEKPDILGCIW 448
>gi|195388525|ref|XP_002052930.1| GJ17827 [Drosophila virilis]
gi|194149387|gb|EDW65085.1| GJ17827 [Drosophila virilis]
Length = 592
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 203/433 (46%), Gaps = 69/433 (15%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
M+DI WLL +L K +LV++G+ L + + KP + K +P F T H
Sbjct: 204 MIDIGWLLGHYYFAGILDK--PLLVLYGDESPDLLGIGKFKPQVTAI-KVNMPTPFATSH 260
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K MLL Y G +R+++ TANL DW+N++QGLW+ P E GF+
Sbjct: 261 TKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPALPEGADTAAGESPTGFKQ 320
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + +K +FS+ V LI SVPG H S+++
Sbjct: 321 DLMLYLVEYKVSQLQPWI----------ARIRKSDFSAVNVFLIGSVPGGHRESAVRGHP 370
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG 229
WG +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 371 WGCARLGSLLAKHAAPVD-DRIPVVCQSSSIGSLGANVQAWIQQDFVNNLRKDSTPVGRL 429
Query: 230 EPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 284
L +++P+ +V S +G G +P + DK +LK + +WK+ RS+AM
Sbjct: 430 RQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYSKNTNDKQPWLKAHLQQWKSGDRHRSQAM 489
Query: 285 PHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILPSAK 339
PHIK++ R+N Q + WF+LTSANLSKAAWG+ KN+ L I +YE GVL LP
Sbjct: 490 PHIKSYTRFNLEQQCVYWFVLTSANLSKAAWGSFNKNSQIQPCLRIANYEAGVLFLPR-- 547
Query: 340 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 399
F P A V P+PY++P
Sbjct: 548 -----FVTGEETFPL---------------------------GNARDGVPAFPLPYDVPL 575
Query: 400 QRYSSEDVPWSWD 412
Y +D P+ D
Sbjct: 576 TPYGPDDTPFLMD 588
>gi|310798351|gb|EFQ33244.1| tyrosyl-DNA phosphodiesterase [Glomerella graminicola M1.001]
Length = 517
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/425 (30%), Positives = 199/425 (46%), Gaps = 80/425 (18%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW-------- 91
N LH +P FGTHHSK M+LI + ++++HTAN+I DW N + +W
Sbjct: 130 NVELHSAFMPEMFGTHHSKMMILIRHDDSAQVVIHTANMIAKDWTNMTNAVWRSPMLPLL 189
Query: 92 ----MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 147
++D P D + E F++DL+ YL ++A P K ++F
Sbjct: 190 PNNYVEDAPTNDHPFGTGE-RFKHDLLGYLRA-----YNARRP---TLKSLVDQICHYDF 240
Query: 148 SSAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL-- 203
SS +LIASVPG H +S WG L+ L+ ++G KS +V Q SS+ +L
Sbjct: 241 SSVRAKLIASVPGRHPIHDTSQTAWGWPALKRALRSVPVQEG--KSEVVVQVSSIATLGS 298
Query: 204 DEKWMAE-----LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSP- 257
+ W + L+ S ++ S + + V+PT +++R SL+GYA+G +I +
Sbjct: 299 SDSWTQKCLFDSLAVSKNNSSSNPRPKFKV-----VFPTADEIRRSLDGYASGGSIHTKI 353
Query: 258 ---QKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQKLAW 300
Q+ +L+ + W GR RA PHIKT+ RY + + W
Sbjct: 354 QSQQQAKQLQYLRSMFCHWANDAPDGEPLPETATIREAGRQRAAPHIKTYIRYGEKSIDW 413
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
L+TSAN+SK AWG + + ++ I S+E+GVL+ PS I G+
Sbjct: 414 ALVTSANISKQAWGEAARPSQEVRIASWEIGVLVWPSI------------IAEKATMIGA 461
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
E+ QK DAG VV + +PY +P Q Y +++PW +T+ D
Sbjct: 462 FESDMPQK------------DAGDGDPVVGIRIPYSIPLQSYGKDEIPWVASMVHTEPDS 509
Query: 421 YGQVW 425
G+ W
Sbjct: 510 MGRFW 514
>gi|268553849|ref|XP_002634911.1| Hypothetical protein CBG22509 [Caenorhabditis briggsae]
Length = 421
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 117/349 (33%), Positives = 180/349 (51%), Gaps = 32/349 (9%)
Query: 1 MVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
M+D +LL + P L P LV+ G SD + N + PLPI FGTHH+K
Sbjct: 50 MIDFQYLLNSYPPSLRTTPMTLVV-GASDKAALSRECAAHKNVTVIGAPLPIPFGTHHTK 108
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
++ G V +IV TANL+ DW K+Q + +D ++ C F++DL++YLS
Sbjct: 109 MSIMESEDGRVHVIVSTANLVPDDWEFKTQQFYYACGLRRDGE--AQRCPFQSDLLEYLS 166
Query: 119 TLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
F NL + P + +FSS RLI S PGYHT + +G R
Sbjct: 167 ------FYRNL-------LTPWRELIQSTDFSSITDRLIFSTPGYHTHVARLNFGPRLAR 213
Query: 177 TVLQECTFEKGFK---KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP-- 231
+ ++ F+ ++ + + Q SS+GS+ ++ + E P +P
Sbjct: 214 ILTEKFPFDPSYEHTERCTFISQCSSIGSIGKQPIDWFRGQFLKSL-EGANPAPKSKPAK 272
Query: 232 -LIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKT 289
+++P VEDVR S +GYA G ++P +V + +L+ KW+++ R+ A+PH KT
Sbjct: 273 MYLIFPCVEDVRTSCQGYAGGGSVPYRNSVHVRQKWLQGVMCKWRSNAKRRTHAVPHCKT 332
Query: 290 FARYNGQKLAWFLLTSANLSKAAWG----ALQKNNSQLMIRSYELGVLI 334
+ +++ + W L+TSANLSKAAWG + K QLM+RSYE+GVLI
Sbjct: 333 YVKFDKKVPQWQLVTSANLSKAAWGEASFSKAKKTDQLMVRSYEMGVLI 381
>gi|170040309|ref|XP_001847946.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
gi|167863873|gb|EDS27256.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
Length = 615
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 206/427 (48%), Gaps = 58/427 (13%)
Query: 1 MVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
MVDI WLL + +L+++G+ L+ + KP N K + FG HH+K
Sbjct: 228 MVDIGWLLGHYFFAGYEDRPLLILYGDESPELKTVSTKKP-NVTALKVHIATPFGVHHTK 286
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEECGFENDL 113
L Y G +R+++ TANL D++N++QGLW+ P D GF L
Sbjct: 287 MGLYGYTDGSMRVVISTANLYEDDFHNRTQGLWISPRLPALAEDADTGAGESRTGFRESL 346
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-WGH 172
I YL++ K+ + +A + S ++ +F V +AS+PG H ++ WGH
Sbjct: 347 ITYLNSYKFAQLAAWV----------SRIQRTDFGEVNVFFVASIPGGHLNTAKGPLWGH 396
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP- 231
+L +L + + PLV Q SS+GSL + + S + + F D P+G+
Sbjct: 397 PRLGYLLGKHSAPID-DACPLVAQSSSIGSLGPNPQSWVLSEIMASFRRDSAPVGLRRVP 455
Query: 232 --LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 288
+++P+ +VR S + G +P + +K +LK + +WK+ R++A+PHIK
Sbjct: 456 SFRMIFPSFSNVRNSHDNLLGGGCLPYMRATHEKQPWLKDHLHQWKSDCRNRTKAVPHIK 515
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPSAKRHGCGF 345
T+ R++ + L WFLLTSANLSKAAWG K+ + L I SYE+GVL LP F
Sbjct: 516 TYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYEVGVLFLPK-------F 568
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
N P E KS +G + + P+PY++P Y+ E
Sbjct: 569 VIDENFFPMESKS-----------------------SGDNKHPAF-PMPYDVPIIPYAPE 604
Query: 406 DVPWSWD 412
D P+ D
Sbjct: 605 DSPFFMD 611
>gi|7529314|emb|CAB86488.1| Glaikit protein [Drosophila melanogaster]
Length = 580
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 126/358 (35%), Positives = 183/358 (51%), Gaps = 37/358 (10%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHG-ESDGTLEHMKRNKPANWILHKPPLPISFGTH 56
MVDI WLL +L K +LV++G ES L K + I K P P F T
Sbjct: 192 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKQQVTAIRVKMPTP--FATS 247
Query: 57 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-CGFE 110
H+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 248 HTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGARESLTGFK 307
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK- 169
D + YL K + +P + +FS+ V + SVPG H S++
Sbjct: 308 QDRMLYLVEYKISQLQPWIPR----------IRNSDFSAINVFFLGSVPGGHREGSVRGH 357
Query: 170 -WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGI 228
WGH +L ++L + + P+V Q SS+GSL A + + +D TP+G
Sbjct: 358 PWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSPKKDSTPVGK 416
Query: 229 GEPL----IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRA 283
+ +++P+ +V S +G G +P N ++ +LK Y +WK+S RSRA
Sbjct: 417 LRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDNQPWLKDYLQQWKSSDRFRSRA 476
Query: 284 MPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
MPHIK++ R+N Q + WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 477 MPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 534
>gi|301770839|ref|XP_002920828.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Ailuropoda
melanoleuca]
Length = 205
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 136/232 (58%), Gaps = 36/232 (15%)
Query: 200 LGSLDEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-S 256
+G+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY AG ++P S
Sbjct: 1 MGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGYPAGGSLPYS 60
Query: 257 PQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWG 314
Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+TSANLSKAAWG
Sbjct: 61 IQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTSANLSKAAWG 120
Query: 315 ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 374
AL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 121 ALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPAAA-------- 166
Query: 375 TWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
PVPY+LPP+ Y S+D PW W+ YTK D +G +W
Sbjct: 167 ----------------FPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNMW 202
>gi|321478262|gb|EFX89219.1| hypothetical protein DAPPUDRAFT_310135 [Daphnia pulex]
Length = 580
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 124/364 (34%), Positives = 184/364 (50%), Gaps = 44/364 (12%)
Query: 1 MVDIDWLLPA-CPVLAKIPHVLVIHG-ESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
MV++ WLL C + +LVI+G ES+ R + I KP P FG+HH+
Sbjct: 194 MVELGWLLAQYCQHKVQRKPMLVIYGTESEELAAAQSRVPTLHTIRVKPKYP--FGSHHT 251
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNN-----------LS 104
K ++ Y G +RI+VHT NLI DW +++QGLW+ PL ++N
Sbjct: 252 KMSMMSYEDGNLRIVVHTGNLIESDWEDRTQGLWISPSCPPLSSKDNEKIGDGDSIGDGD 311
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 164
GF+ DLI YL + + ++ + SS V I S PG H
Sbjct: 312 SITGFKRDLIRYLESYSLSALKPWIEK----------IRQADMSSIKVCFIPSSPGSHAI 361
Query: 165 SS-----LKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELSSSM 215
S + KWGH+ L +LQ+ + ++ Q SS+GSL W+A EL SM
Sbjct: 362 QSEANEKVPKWGHLHLSWLLQQHASSEA--DDSIIMQCSSIGSLGPSPSSWLAGELGVSM 419
Query: 216 SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWK 274
G S T LG +V+P +DV+ S+ G G +P S Q + + + + KW+
Sbjct: 420 --GASSGVTKLGQKNVQVVYPCFQDVKSSIHGLLGGGCLPYSHQGHNKQTWFTGFLHKWR 477
Query: 275 ASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 332
+ R+ AMPHIK++AR + + ++F+LTSAN+SKAAWG +++LMI+S+E GV
Sbjct: 478 SDSRLRTTAMPHIKSYARVSSDMSRASFFVLTSANVSKAAWGMRINKDTKLMIQSFEAGV 537
Query: 333 LILP 336
L LP
Sbjct: 538 LFLP 541
>gi|389628810|ref|XP_003712058.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|351644390|gb|EHA52251.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|440474085|gb|ELQ42852.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae Y34]
gi|440485911|gb|ELQ65827.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae P131]
Length = 555
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 135/424 (31%), Positives = 196/424 (46%), Gaps = 69/424 (16%)
Query: 38 KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM---- 92
K N +LH LP FGTHHSK ++L+ + ++I+HTAN+I DW N + G+W+
Sbjct: 165 KHKNVVLHTAFLPEIFGTHHSKMLVLLRHDDTAQVIIHTANMIPKDWTNMTNGIWLSPRL 224
Query: 93 -----QDFPLKDQ-NNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 144
QD Q NL+E G F+ DL++YL + + N +K
Sbjct: 225 PLLQGQDPADASQYENLAEGTGYKFKIDLLNYLRA-----YDDKRVVCRDLVTN---LEK 276
Query: 145 FNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGS 202
++FSS LIASVPG H T S WG + ++ L+ + G KS +V Q SS+ +
Sbjct: 277 YDFSSIRGTLIASVPGRHDFTDLSTSAWGWVAIKRALRSVPLQVG--KSEVVTQISSIAT 334
Query: 203 LD--EKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----P 255
L + W+ L SM G + P + I++PT +++R SL+GY +G +I
Sbjct: 335 LGPTDTWLQRTLFESMCRGKTTGVAPRP--QFKIIFPTADEIRRSLDGYGSGGSIHTKIQ 392
Query: 256 SPQKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQKLAWF 301
S Q+ + K W GR+RA PHIKT+ RY + W
Sbjct: 393 SSQQAKQLIYQKPLLCHWANDSPHGQDLGQNIPILDAGRNRAAPHIKTYIRYGANSIDWA 452
Query: 302 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 361
LL+SANLSK AWG SQ I S+E+GVL+ P ++ + +K
Sbjct: 453 LLSSANLSKQAWGDATGAGSQTRISSWEIGVLVWPE-------LFAKDALMTTVVKK--- 502
Query: 362 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVY 421
+T + T L VV L PY LP Q+Y + +VPW Y++ D
Sbjct: 503 DTPSRETTNLC-----------PGRPVVGLRSPYSLPVQKYGNGEVPWVATLSYSEPDWA 551
Query: 422 GQVW 425
G W
Sbjct: 552 GNTW 555
>gi|164425147|ref|XP_962379.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
gi|157070809|gb|EAA33143.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
Length = 527
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 165/514 (32%), Positives = 231/514 (44%), Gaps = 101/514 (19%)
Query: 3 DIDWLLPACPVLAK-IPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTH 56
DID+L+ A + + V VIHG E L+ + N H LP FGTH
Sbjct: 26 DIDFLMGAFDSDVRHLIKVHVIHGFWKKEDPNRLQIQSDAARYPNITTHHAYLPEPFGTH 85
Query: 57 HSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEE---- 106
HSK M+L+ II+HTANLI DW+N +Q W+ P QN S
Sbjct: 86 HSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPLLKPDAQQNTSSTRSPPP 145
Query: 107 --CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 162
CG F+ D ++YL + + A N I+ K++FSS LIASVPG H
Sbjct: 146 AGCGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKYDFSSIRGSLIASVPGRH 194
Query: 163 T--GSSLKKWGHMKLRTVLQECTFEKG------------FKKSPLVYQFSSLGSLD--EK 206
+ +WG ++ L+ + +K +V Q SS+ +L +
Sbjct: 195 SLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKPEVVIQISSIATLGPTDN 254
Query: 207 WMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAI----PSPQ 258
W+ SG KT L +P I++PT +++R SL+GYA+G +I S Q
Sbjct: 255 WLKNTLFEALSGSQGPKTLLS-SKPKPDFKIIFPTPDEIRKSLDGYASGGSIHTKIQSAQ 313
Query: 259 KNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQK----LAW 300
+ +L+ + W GR+RA PHIKTF R+ K + W
Sbjct: 314 QAKQLQYLRPMFCHWANDSADGVGTTTTTPIREAGRNRAAPHIKTFIRFANHKTKNTIDW 373
Query: 301 FLLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKRHGCGFSCTSNI----- 351
LLTSANLSK AWG Q KNN+ Q+ I SYE+GVL+ P G S S +
Sbjct: 374 ALLTSANLSKQAWGDAQSKNNAGEPQVHICSYEIGVLVWPELFADSDGTSSGSKMGQKAV 433
Query: 352 -VPSEIKS-----GSTE---TSQIQKTKLVTLTWHGSSDAGASSE--------VVYLPVP 394
VP+ +K GS + +S +K + + +G D E VV L +P
Sbjct: 434 MVPTFLKDTPAIHGSEKDRPSSLGEKQGPTSTSRNGEKDGKGDDEKEEKSSTVVVGLRMP 493
Query: 395 YELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
Y LP QRY ++VPW + + D GQVW RH
Sbjct: 494 YNLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 526
>gi|195342204|ref|XP_002037691.1| GM18399 [Drosophila sechellia]
gi|194132541|gb|EDW54109.1| GM18399 [Drosophila sechellia]
Length = 539
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 182/359 (50%), Gaps = 39/359 (10%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 162 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLNIGKFKQQVTAIRVKMPTP--FATSH 218
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 111
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 219 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 278
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 169
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 279 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 328
Query: 170 WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
WGH +L +++ + E + P+V Q SS+GSL A + + +D T +G
Sbjct: 329 WGHARLASLVAKHAAPIED---RIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTSVG 385
Query: 228 IGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSR 282
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSR
Sbjct: 386 KLRQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSR 445
Query: 283 AMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
AMPHIK++ R+N Q + WF+LTSANLSKAAWG K+++ L I +YE GVL LP
Sbjct: 446 AMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPCLRIANYEAGVLFLP 504
>gi|340960785|gb|EGS21966.1| hypothetical protein CTHT_0038420 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 487
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 138/466 (29%), Positives = 203/466 (43%), Gaps = 73/466 (15%)
Query: 1 MVDIDWLLPACPVLAK-IPHVLVIHG-----ESDGTLEHMKRNKPANWILHKPPLPISFG 54
M DID+L+ A + + V V+HG + H + + N LH +P FG
Sbjct: 51 MHDIDFLMSAFDEDTRHLVKVHVVHGFWKREDLSRVTLHEQAARYPNVALHAAYMPEMFG 110
Query: 55 THHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPL----KDQNNLSEE-- 106
THHSK M+L+ + RI++HTAN+I DW N +Q +WM PL Q N+ E
Sbjct: 111 THHSKMMILLRHDDTARIVIHTANMIVRDWTNMTQAVWMSPWLPLMKGPSQQENVHEAKP 170
Query: 107 ---CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK--KFNFSSAAVRLIASVPGY 161
F+ DL++YL + G P K +F+FS LIASVPG
Sbjct: 171 GSGAKFKVDLLNYLRAYD---------SRGRETCKPIIEKLMRFDFSEVKGALIASVPGR 221
Query: 162 H--TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF 219
H SS +WG + L+ + + + + ++LG D S ++S G
Sbjct: 222 HKLNDSSPTRWGWAAMEQALKTVPVHQQAEIAIQISSIATLGPTDNWLKNTFSRALSGGR 281
Query: 220 SEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA 275
+ + +P +++PT +++R SL+GYA+G +I + ++ + + K
Sbjct: 282 G-----VSLSQPPPSFKVIFPTADEIRKSLDGYASGGSIHTKIQSPQQVKQLQQADKSAV 336
Query: 276 SHTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWG-------------ALQKN 319
+GR RA PHIKT+ RY Q + W LLTSANLSK AWG
Sbjct: 337 LDSGRKRAAPHIKTYIRYGNKSHQTIDWALLTSANLSKQAWGEAASAPGGSKGKSTASSG 396
Query: 320 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 379
+ ++ I SYE+GVL+ P T G T Q K
Sbjct: 397 DREVRIASYEIGVLVWPELWGEDAAMKATFMTDNLGDSRGGEFTEQEGKV---------- 446
Query: 380 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
V L +PY LP Q Y + +VPW + + D GQVW
Sbjct: 447 --------TVALRMPYSLPLQPYDNAEVPWVATTNHEEPDWMGQVW 484
>gi|336471045|gb|EGO59206.1| hypothetical protein NEUTE1DRAFT_145272 [Neurospora tetrasperma
FGSC 2508]
gi|350292122|gb|EGZ73317.1| phospholipase D/nuclease, partial [Neurospora tetrasperma FGSC
2509]
Length = 619
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 160/513 (31%), Positives = 229/513 (44%), Gaps = 99/513 (19%)
Query: 3 DIDWLLPACPVLAK-IPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTH 56
DID+L+ A + + V VIHG E+ L+ + N H LP FGTH
Sbjct: 118 DIDFLMSAFDSDVRHLIKVHVIHGFWKKENTNRLQIQSDAARYPNITTHHAYLPEPFGTH 177
Query: 57 HSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEECG-- 108
HSK M+L+ II+HTANLI DW+N +Q W+ P QNN S
Sbjct: 178 HSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPLLKPDAQQNNSSPRSSLP 237
Query: 109 ------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 162
F+ D ++YL + + A N I+ K++FSS LIASVPG H
Sbjct: 238 AGSGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKYDFSSIRGSLIASVPGRH 286
Query: 163 T--GSSLKKWGHMKLRTVLQECTFEKG------------FKKSPLVYQFSSLGSLD--EK 206
+ +WG ++ L+ + +K +V Q SS+ +L +
Sbjct: 287 SLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKPEVVIQISSIATLGPTDN 346
Query: 207 WMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQK 259
W+ SG KT L I++PT +++R SL+GYA+G +I S Q+
Sbjct: 347 WLKNTLFEALSGSQGPKTLLSSKSKPDFKIIFPTPDEIRKSLDGYASGGSIHTKIQSAQQ 406
Query: 260 NVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQK----LAWF 301
+L+ + W GR+RA PHIKTF R+ + W
Sbjct: 407 AKQLQYLRPIFCHWANDSADGVGTTTTTPIREAGRNRAAPHIKTFIRFANHNTKNSIDWA 466
Query: 302 LLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKRHGCGFSCTSN------I 351
LLTSANLSK AWG Q KNN+ Q+ I SYE+GVL+ P G S S +
Sbjct: 467 LLTSANLSKQAWGDAQSKNNAGEPQVRICSYEIGVLVWPELFADSDGTSSGSKTGQKAVM 526
Query: 352 VPSEI-KSGSTETSQIQKTKLV-------TLTWHGSSDAGASSE--------VVYLPVPY 395
VP+ + + ++ S+ +T L+ + + +G D E VV L +PY
Sbjct: 527 VPTFLTDTPASHGSEKDRTSLLGEKQGSASTSGNGEEDGKGDDEKEEKSSTVVVGLRMPY 586
Query: 396 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
LP QRY ++VPW + + D GQVW RH
Sbjct: 587 NLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 618
>gi|312069908|ref|XP_003137901.1| tyrosyl-DNA phosphodiesterase [Loa loa]
Length = 426
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/440 (30%), Positives = 190/440 (43%), Gaps = 102/440 (23%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDG-----TLEHMKRNKPANWILHKPPLPISFGT 55
M+D+ WLL P + + +I GE G T +K+ N + + L I FGT
Sbjct: 75 MIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMIPFGT 134
Query: 56 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID 115
HHSK + + + + L D P ++ ++ F+ DL+
Sbjct: 135 HHSKISI--------------------FESNTGRLAAGDCPDRNGSD------FQTDLVK 168
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YL K + L H +++ + S R++ SVPG H G L K+GH +L
Sbjct: 169 YLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKGVQLTKYGHPRL 222
Query: 176 RTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSEDKTPLGIGE 230
R +L+E + GF SLG+ + W+ + +S+S G D GE
Sbjct: 223 RVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGAETD------GE 276
Query: 231 PL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
L I++P VEDVR S EGYAAG + P S V + +L + KW + H GRSRAMPHIK
Sbjct: 277 HLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDHLGRSRAMPHIK 336
Query: 289 TFARYNGQKL--AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
T+A + L +W L+TSANLSKAAWG Q QL IRSYE G+L
Sbjct: 337 TYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF------------ 384
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
SD + + Y +LP +Y D
Sbjct: 385 ---------------------------------SDPESLDMLPY-----DLPLTKYDDND 406
Query: 407 VPWSWDKRYTKKDVYGQVWP 426
W DK Y K D++ + WP
Sbjct: 407 RVWIVDKTYRKPDIFRKTWP 426
>gi|291233547|ref|XP_002736713.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Saccoglossus
kowalevskii]
Length = 431
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 152/285 (53%), Gaps = 41/285 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHG--ESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
M DI WL+ P + +L+IHG +D T H ++ N L + L I +GTHHS
Sbjct: 157 MFDIPWLVQQYPEQFRSKPLLIIHGSQRADKTTLHENAHRYPNITLCQAKLDIMYGTHHS 216
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE---CGFENDL 113
K M L+Y G+R+++HTAN+IH DW K+QG+W+ FP L +LS+ F DL
Sbjct: 217 KMMFLLYDNGMRVVIHTANIIHNDWYQKTQGVWISPLFPKLASDQDLSQGDSVTQFRKDL 276
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSF------FKKFNFSSAAVRLIASVPGYHTGSSL 167
++YL G + N ++ + SSA V +I SVPG HTG+S
Sbjct: 277 LEYL---------------GAYGTNKHLQEWQETIRQHDMSSAKVFIIGSVPGRHTGASK 321
Query: 168 KKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGS--------LDEKWMAELSSSMSSG 218
KWGH+KLR VLQE + K P++ QFSS+GS L +W+ LS+ ++G
Sbjct: 322 MKWGHLKLRKVLQEHGPDGSTVKDWPVIGQFSSVGSLGSGPENWLSSEWLESLSTVQANG 381
Query: 219 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 263
+ P + +++P VE+VR SLEGY AG ++P KN K
Sbjct: 382 IVKLSKP----KLNLIFPCVENVRRSLEGYPAGASLPYSIKNARK 422
>gi|358365748|dbj|GAA82370.1| tyrosyl-DNA phosphodiesterase [Aspergillus kawachii IFO 4308]
Length = 585
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 123/417 (29%), Positives = 190/417 (45%), Gaps = 67/417 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P +FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL ++ SE
Sbjct: 194 MPEAFGTHHSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSNSSGSES 253
Query: 107 CG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVP 159
F+ DL+ YL +G K P + +K +FS+ L+ASVP
Sbjct: 254 IATPGTRFKRDLLSYLR------------EYGPKKTGPLVAQLEKHDFSTVRAALVASVP 301
Query: 160 GYH-----TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL 211
T S+ K WG + LR VL+ ++ + +V Q SS+ SL +KW+ ++
Sbjct: 302 SKQKIRESTDSTRKTLWGWLALRDVLRSVPIDRSEDRPHIVTQISSVASLGQTDKWLKDV 361
Query: 212 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLK 267
+ S S P I++PT +++R SL GY +G +I S + +++
Sbjct: 362 FFTSLSPSSNTPKPRFS----IIFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMR 417
Query: 268 KYWAKWKAS----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWG 314
Y W GR RA PHIKT+ RY+ ++ W ++TSANLS AWG
Sbjct: 418 SYLCHWAGDGAEDEVKVKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWG 477
Query: 315 ALQKNNSQLMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 368
A N ++ I S+E+GV++ P A+ C VP + +
Sbjct: 478 AAVNANGEVRICSWEIGVVVWPELVTGAGAEGRSVMVPCFRRDVPDADAVAAAGAAANAN 537
Query: 369 TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
K + T V +PY+LP RYS D+PW +++ D GQ W
Sbjct: 538 VKEIPTT-----------TTVGFRMPYDLPLTRYSETDIPWCATASHSEPDWLGQTW 583
>gi|322706849|gb|EFY98429.1| tyrosyl-DNA phosphodiesterase 1 [Metarhizium anisopliae ARSEF 23]
Length = 517
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 134/444 (30%), Positives = 204/444 (45%), Gaps = 90/444 (20%)
Query: 18 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTA 76
PH L + ES G +++K LH P+P FGTHHSK M+L + II+HTA
Sbjct: 126 PHRLALTAESSG-FDNVK--------LHVAPMPEMFGTHHSKMMVLFRHDNTAEIIIHTA 176
Query: 77 NLIHVDWNNKSQGLWMQDFPLKDQ-----NNLSEECG--------FENDLIDYLSTLKWP 123
N+I DW N + +W P Q L E C F+ DL++YL +
Sbjct: 177 NMIPKDWTNMTNAVWRT--PRLSQLPPGFRQLQEYCDLPIGSGERFKADLLNYLKSYDSR 234
Query: 124 EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQE 181
+ + + +++FSS LIASVPG H L +G ++ L
Sbjct: 235 KLTC--------RTLIDRLVQYDFSSVKGALIASVPGKHDIHDLSGTAYGWSGVKRYLSS 286
Query: 182 CTFEKGFKKSPLVYQ-FSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
++G K + L F SL + ++ S FS IV+PT ++
Sbjct: 287 VPCKEGAKDTWLQKTLFDSLAT------SKTKSLQRPKFS------------IVFPTADE 328
Query: 241 VRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------KASHTGRSR 282
+R SL+GYA+G +I S Q+ +L++ W K + GR R
Sbjct: 329 IRQSLDGYASGASIHTKIQSSQQAQQLGYLRRILHHWANDSPDGIASSPEIKTRNGGRDR 388
Query: 283 AMPHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 341
A PHIKT+ RYN + + W +LTSAN+SK AWG + + +L + S+E+GVL+ P
Sbjct: 389 AAPHIKTYIRYNEEGSIDWAMLTSANISKQAWGEASRPSGELRVASWEIGVLVWP----- 443
Query: 342 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 401
+V ++ T S + K SS A AS ++ + +PY LP QR
Sbjct: 444 --------GLVGQDVSMVGTFQSDVPKKP----KEQASSKADASGVLMGVRIPYSLPLQR 491
Query: 402 YSSEDVPWSWDKRYTKKDVYGQVW 425
Y +E+VPW ++++ D +G+ W
Sbjct: 492 YGAEEVPWVATMQHSEPDRFGRQW 515
>gi|317027510|ref|XP_001399437.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 568
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/411 (30%), Positives = 188/411 (45%), Gaps = 68/411 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P +FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL + SE
Sbjct: 190 MPEAFGTHHSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSEN 249
Query: 107 CG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVP 159
F+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 250 IATPGARFKRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVP 297
Query: 160 GYH-----TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL 211
T S+ K WG + LR VL+ + +V Q SS+ SL +KW+ ++
Sbjct: 298 SKQKIRESTDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDV 357
Query: 212 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLK 267
+ S S + P IV+PT +++R SL GY +G +I S + +++
Sbjct: 358 FFASLSPSSNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMR 413
Query: 268 KYWAKWKAS----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWG 314
Y W GR RA PHIKT+ RY+ ++ W ++TSANLS AWG
Sbjct: 414 PYLCHWAGDVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWG 473
Query: 315 ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 374
A N ++ I S+E+GV++ P G G S ++P + ++I T V
Sbjct: 474 AAVNANGEVRICSWEIGVVVWPELI-AGAGAEGRSVMMPCFRRDMPDADAEIPTTTTVGF 532
Query: 375 TWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY+LP RY D+PW +++ D GQ W
Sbjct: 533 R-----------------MPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 566
>gi|312378421|gb|EFR25002.1| hypothetical protein AND_10059 [Anopheles darlingi]
Length = 436
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 134/425 (31%), Positives = 196/425 (46%), Gaps = 58/425 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHV--LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
MVDI WLL A A +V L+++G+ L + + KP N K + G HH+
Sbjct: 53 MVDIGWLL-AHYYFAGYENVPLLILYGDETPELRMVSKKKP-NVTAVKVDIKTPVGVHHT 110
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 117
K L Y G +RI++ TANL DW+N++QGLW+ P + F + D+
Sbjct: 111 KMGLYGYRDGSMRIVISTANLYEDDWHNRTQGLWIS--PRLPAVPEDADTAFGESVTDFR 168
Query: 118 STLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK-WGHMK 174
S L A L A+ ++ P + ++ +FS V L+ASVPG H + WGH +
Sbjct: 169 SNLL-----AYLDAYKLTQLQPWIARIRRTDFSDIKVCLVASVPGGHVNTPKGPLWGHAR 223
Query: 175 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP--- 231
L +L + PLV Q SS+GSL + + + + F +D P+GI
Sbjct: 224 LGYLLTKYAAPID-DSCPLVAQSSSIGSLGPSPESWVLGEIMANFRKDSAPIGIRRMPGF 282
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTF 290
+++P+ +VR S + G +P + K ++LK Y +W R++AMPHIKT+
Sbjct: 283 RMIYPSYSNVRQSHDSLLGGGCLPYGRATHSKQEWLKTYLHQWFCRSRHRNKAMPHIKTY 342
Query: 291 ARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPSAKRHGCGFSC 347
R++ + L WFLLTSANLSK+AWG K L I SYE GVL LP
Sbjct: 343 CRWSHRGLYWFLLTSANLSKSAWGVYNKAGRFEKPLRINSYEAGVLFLPK-------LLL 395
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
N P E A + P+PY++P Y+ ED
Sbjct: 396 DENFFPME----------------------------AGKKDPQFPMPYDVPIIPYAPEDT 427
Query: 408 PWSWD 412
P+ D
Sbjct: 428 PFFMD 432
>gi|402082685|gb|EJT77703.1| hypothetical protein GGTG_02808 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 583
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 133/453 (29%), Positives = 213/453 (47%), Gaps = 77/453 (16%)
Query: 20 VLVIHG---ESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 73
V VIHG + D ++R+ + N LH +P FGTHHSK ++L+ + ++++
Sbjct: 160 VNVIHGFWKKDDRRRIDLQRDAAQNKNLTLHTAFMPEMFGTHHSKMLILLRHDDTAQVVI 219
Query: 74 HTANLIHVDWNNKSQGLWMQ-DFPLK----------DQNNLSEECG--FENDLIDYLSTL 120
HTAN+I DW N +Q +W+ PL+ D +L E G F+ DL+ YL
Sbjct: 220 HTANMIPKDWTNMTQSIWLSPRLPLQKPTAPAPAHVDYESLPEGSGEKFKLDLLSYLRAY 279
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 178
+ ++++FSS L+ASVPG H S WG +R
Sbjct: 280 D--------KRRAICRPLVQELQRYDFSSVRATLVASVPGRHQIHDRSAATWGWAAIRRA 331
Query: 179 LQECTFEKGFKKSP-LVYQFSSLGSL--DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-- 232
L+ + ++P +V Q SS+ +L + W+ L SMS G + +P
Sbjct: 332 LESVPLQTAAGRTPEVVVQVSSIATLGPTDSWLRGALFDSMSRG---KAAAVAAPKPRFK 388
Query: 233 IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------ 276
+++PT +++R SL+GYAAG +I S Q+ +LK + W
Sbjct: 389 VIFPTPDEIRASLDGYAAGASIHTKIQSAQQVKQLMYLKPLFCHWANDSALGNEKDENAP 448
Query: 277 --HTGRSRAMPHIKTFARY-NGQK-LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 332
GR+RA PH+KT+ RY +G++ L W L+TSANLSK AWG ++ I S+E+GV
Sbjct: 449 IRDAGRNRAAPHVKTYIRYGDGERSLDWALMTSANLSKQAWGEAVNAMGEVRIASWEIGV 508
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 392
L+ PS F+ + + P + + +++ + G V+ L
Sbjct: 509 LVWPSL------FAEKARMAP------------VFGSDRLSVEEADEARQGGGP-VMGLR 549
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY LP Q Y +++PW +Y + D G+ W
Sbjct: 550 IPYNLPVQAYGRDEIPWVATAKYDELDCKGRKW 582
>gi|336270704|ref|XP_003350111.1| hypothetical protein SMAC_01002 [Sordaria macrospora k-hell]
Length = 624
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 155/507 (30%), Positives = 225/507 (44%), Gaps = 96/507 (18%)
Query: 3 DIDWLLPACPV-LAKIPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTH 56
+ID+L+ A + + V V+HG E L+ ++ N H LP FGTH
Sbjct: 132 NIDFLMNAFDEDIRHLVKVHVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTH 191
Query: 57 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG------ 108
HSK M+L II+HTANLI DW N + G W+ PL +
Sbjct: 192 HSKLMVLFRLDDTAEIIIHTANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPP 251
Query: 109 -------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 161
FE D ++YL + + +A P K++FSS LIASVPG
Sbjct: 252 AAGSGEKFEIDFLNYLRSYR----TACKPLVDQLS-------KYDFSSIRGSLIASVPGR 300
Query: 162 HT--GSSLKKWGHMKLRTVLQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAE 210
H+ + +WG ++ L+ + +K+ +V Q SS+ +L + W
Sbjct: 301 HSLVDNFPTRWGWAAMKETLKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW--- 357
Query: 211 LSSSMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 260
L S++ S + P + +++PT +++R SL+GY++G +I S Q+
Sbjct: 358 LKSTLFEALSGSQGPKTLSSSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQA 417
Query: 261 VDKDFLKKYWAKWKAS---------------HTGRSRAMPHIKTFARYNGQK----LAWF 301
+L+ + W GR RA PHIKTF RY QK + W
Sbjct: 418 KQLQYLRPIFCHWANDSADGGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWA 477
Query: 302 LLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP--- 353
LLTSANLSK AWG Q KNN+ Q+ I SYE+GV++ P G G + +VP
Sbjct: 478 LLTSANLSKQAWGDAQSKNNAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFL 537
Query: 354 -------SEIKSGSTETSQIQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQR 401
S K G++ + TK T G + S+ VV L +PY LP QR
Sbjct: 538 TDTPTGLSSSKDGTSLAGERGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQR 597
Query: 402 YSSEDVPWSWDKRYTKKDVYGQVWPRH 428
Y ++VPW + + D GQVW RH
Sbjct: 598 YGPQEVPWVATANHLEPDWMGQVW-RH 623
>gi|380479741|emb|CCF42843.1| tyrosyl-DNA phosphodiesterase [Colletotrichum higginsianum]
Length = 520
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 130/450 (28%), Positives = 205/450 (45%), Gaps = 83/450 (18%)
Query: 20 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 73
V V+HG + D ++++ A N LH +P FGTHHSK M+LI + ++I+
Sbjct: 107 VHVVHGFWKKEDPNRLALQKDAEAYPNVELHGAFMPEMFGTHHSKMMVLIRHDDSAQVII 166
Query: 74 HTANLIHVDWNNKSQGLW-------MQDFPLKDQNNLSEECG----FENDLIDYLSTLKW 122
HTAN+I DW N + +W + D +D + G F++DL+ YL
Sbjct: 167 HTANMIVRDWTNMTNAVWRSPLLPLLSDEHAEDTSATDHPFGTGKRFKHDLLSYLRA--- 223
Query: 123 PEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVLQ 180
++A P ++FSS IASVPG H +S WG L+ L
Sbjct: 224 --YNARRPITRTLVAQ---LCNYDFSSVRATFIASVPGRHPILDTSQTAWGWPALKRALG 278
Query: 181 ECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-----LSSSMSSGFSEDKTPLGIGEPLI 233
++G +S +V Q SS+ +L + W+ + L+ S + S K +
Sbjct: 279 SVPVQEG--ESEIVIQVSSIATLGPTDSWIQKCLFDSLAVSKNKSSSRPKPKFKV----- 331
Query: 234 VWPTVEDVRCSLEGYAAGNAIPSP----QKNVDKDFLKKYWAKWK--------------A 275
V+PT +++R SL+GYA+G +I + Q+ +L+ + W
Sbjct: 332 VFPTADEIRQSLDGYASGGSIHTKIQSQQQMKQLQYLRPIFCHWANDAPEGKILSETAAI 391
Query: 276 SHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
GR RA PHIKT+ RY + + W L+TSAN+SK AWG + ++ + S+E+GVL+
Sbjct: 392 QKAGRERAAPHIKTYIRYGEKSIDWALVTSANISKQAWGEAMGASQEVRVASWEVGVLVW 451
Query: 336 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
PS I + G+ ET + + G+ VV L +PY
Sbjct: 452 PSI------------ITDNATMVGTFETDMPPR------------EGGSGDTVVGLRIPY 487
Query: 396 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
LP Q Y +++PW +T+ D G+ W
Sbjct: 488 NLPLQSYGKDEIPWVASMAHTEPDRMGRFW 517
>gi|380095505|emb|CCC06978.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 666
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 155/507 (30%), Positives = 225/507 (44%), Gaps = 96/507 (18%)
Query: 3 DIDWLLPACPV-LAKIPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTH 56
+ID+L+ A + + V V+HG E L+ ++ N H LP FGTH
Sbjct: 174 NIDFLMNAFDEDIRHLVKVHVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTH 233
Query: 57 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG------ 108
HSK M+L II+HTANLI DW N + G W+ PL +
Sbjct: 234 HSKLMVLFRLDDTAEIIIHTANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPP 293
Query: 109 -------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 161
FE D ++YL + + +A P K++FSS LIASVPG
Sbjct: 294 AAGSGEKFEIDFLNYLRSYR----TACKPLVDQLS-------KYDFSSIRGSLIASVPGR 342
Query: 162 HT--GSSLKKWGHMKLRTVLQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAE 210
H+ + +WG ++ L+ + +K+ +V Q SS+ +L + W
Sbjct: 343 HSLVDNFPTRWGWAAMKETLKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW--- 399
Query: 211 LSSSMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 260
L S++ S + P + +++PT +++R SL+GY++G +I S Q+
Sbjct: 400 LKSTLFEALSGSQGPKTLSSSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQA 459
Query: 261 VDKDFLKKYWAKWKAS---------------HTGRSRAMPHIKTFARYNGQK----LAWF 301
+L+ + W GR RA PHIKTF RY QK + W
Sbjct: 460 KQLQYLRPIFCHWANDSADGGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWA 519
Query: 302 LLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP--- 353
LLTSANLSK AWG Q KNN+ Q+ I SYE+GV++ P G G + +VP
Sbjct: 520 LLTSANLSKQAWGDAQSKNNAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFL 579
Query: 354 -------SEIKSGSTETSQIQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQR 401
S K G++ + TK T G + S+ VV L +PY LP QR
Sbjct: 580 TDTPTGLSSSKDGTSLAGERGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQR 639
Query: 402 YSSEDVPWSWDKRYTKKDVYGQVWPRH 428
Y ++VPW + + D GQVW RH
Sbjct: 640 YGPQEVPWVATANHLEPDWMGQVW-RH 665
>gi|134056346|emb|CAK47581.1| unnamed protein product [Aspergillus niger]
Length = 559
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 124/420 (29%), Positives = 188/420 (44%), Gaps = 70/420 (16%)
Query: 49 LPISFGTHHSKAMLLIYPRGV----RIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNL 103
+P +FGTHHSK M+L+ + R+++HTAN+I DW N Q +W PL +
Sbjct: 165 MPEAFGTHHSKMMILLRHDDLAHEHRVVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSG 224
Query: 104 SEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIA 156
SE F+ DL+ YL +G K P + +K +FS+ LIA
Sbjct: 225 SENIATPGARFKRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIA 272
Query: 157 SVPGYH-----TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWM 208
SVP T S+ K WG + LR VL+ + +V Q SS+ SL +KW+
Sbjct: 273 SVPSKQKIRESTDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWL 332
Query: 209 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKD 264
++ + S S + P IV+PT +++R SL GY +G +I S +
Sbjct: 333 KDVFFASLSPSSNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQ 388
Query: 265 FLKKYWAKWKAS----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKA 311
+++ Y W GR RA PHIKT+ RY+ ++ W ++TSANLS
Sbjct: 389 YMRPYLCHWAGDVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQ 448
Query: 312 AWGALQKNNSQLMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQ 365
AWGA N ++ I S+E+GV++ P A+ C +P + + +
Sbjct: 449 AWGAAVNANGEVRICSWEIGVVVWPELIAGAGAEGRSVMMPCFRRDMPDADAVAAADANA 508
Query: 366 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
K + T V +PY+LP RY D+PW +++ D GQ W
Sbjct: 509 NADKKEIPTT-----------TTVGFRMPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 557
>gi|7648685|gb|AAF65624.1|AF182003_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 189
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/210 (45%), Positives = 123/210 (58%), Gaps = 35/210 (16%)
Query: 221 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 278
E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +
Sbjct: 7 ESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETS 66
Query: 279 GRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
GRS AMPHIKT+ R + K+AWF +TSANLSKAAWGAL+KN +QLMIRSYELGVL LP
Sbjct: 67 GRSNAMPHIKTYMRPSPDFSKIAWFRVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLP 126
Query: 337 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
SA F S V + +GS E + PVPY+
Sbjct: 127 SA------FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYD 156
Query: 397 LPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 425
LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 157 LPPELYGSKDRPWIWNIPYVKAPDTHGNMW 186
>gi|350634393|gb|EHA22755.1| hypothetical protein ASPNIDRAFT_174927 [Aspergillus niger ATCC
1015]
Length = 581
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 123/417 (29%), Positives = 188/417 (45%), Gaps = 67/417 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P +FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL + SE
Sbjct: 190 MPEAFGTHHSKMMILLRHDDLAQLVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSEN 249
Query: 107 CG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVP 159
F+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 250 IATPGARFKRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVP 297
Query: 160 GYH-----TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL 211
T S+ K WG + LR VL+ + +V Q SS+ SL +KW+ ++
Sbjct: 298 SKQKIRESTDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDV 357
Query: 212 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLK 267
+ S S + P IV+PT +++R SL GY +G +I S + +++
Sbjct: 358 FFASLSPSSNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMR 413
Query: 268 KYWAKWKAS----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWG 314
Y W GR RA PHIKT+ RY+ ++ W ++TSANLS AWG
Sbjct: 414 PYLCHWAGDVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWG 473
Query: 315 ALQKNNSQLMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 368
A N ++ I S+E+GV++ P A+ C +P + + +
Sbjct: 474 AAVNANGEVRICSWEIGVVVWPELIAGAGAEGRSVMMPCFRRDMPDADAVAAADANANAD 533
Query: 369 TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
K + T V +PY+LP RY D+PW +++ D GQ W
Sbjct: 534 KKEIPTT-----------TTVGFRMPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 579
>gi|195177151|ref|XP_002028871.1| GL22360 [Drosophila persimilis]
gi|194104354|gb|EDW26397.1| GL22360 [Drosophila persimilis]
Length = 946
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 118/334 (35%), Positives = 175/334 (52%), Gaps = 38/334 (11%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISFGTH 56
MVDI WLL +L K +LV++G+ L + + KP I K P P F T
Sbjct: 189 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAIGVKMPTP--FATS 244
Query: 57 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEE--CG 108
H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL +D + + E G
Sbjct: 245 HTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLLPALSEDADTAAGESLTG 302
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 168
F DL+ YL K + + + +K +FS+ V + SVPG H S++
Sbjct: 303 FRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGSVR 352
Query: 169 K--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPL 226
WGH +L ++L + + P+V Q SS+GSL A + + +D +P
Sbjct: 353 GHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSSPG 411
Query: 227 GIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRS 281
G + +++P+ +V S +G G +P + DK +LK + +WK+S RS
Sbjct: 412 GKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDRHRS 471
Query: 282 RAMPHIKTFARYN--GQKLAWFLLTSANLSKAAW 313
RAMPHIKT++RYN Q + WF+LTSANLSKAAW
Sbjct: 472 RAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAW 505
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 85/274 (31%), Positives = 133/274 (48%), Gaps = 35/274 (12%)
Query: 1 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISFGTH 56
MVDI WLL +L K +LV++G+ L + + KP I K P P F T
Sbjct: 668 MVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAIGVKMPTP--FATS 723
Query: 57 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEE--CG 108
H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL +D + + E G
Sbjct: 724 HTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLLPALSEDADTAAGESLTG 781
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 168
F DL+ YL K + + + +K +FS+ V + SVPG H S++
Sbjct: 782 FRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGSVR 831
Query: 169 K--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPL 226
WGH +L ++L + + P+V Q SS+GSL A + + +D +P
Sbjct: 832 GHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSSPG 890
Query: 227 GIGEPL----IVWPTVEDVRCSLEGYAAGNAIPS 256
G + +++P+ +V S +G G +PS
Sbjct: 891 GKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPS 924
>gi|258577075|ref|XP_002542719.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902985|gb|EEP77386.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 669
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 133/453 (29%), Positives = 201/453 (44%), Gaps = 93/453 (20%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE- 105
+P FGTHHSK M+LI + ++++HTAN+I DW N Q +W PL NN E
Sbjct: 231 MPEPFGTHHSKMMVLIRHDDCAQVVIHTANMIPGDWANMCQAVWKSPLLPLLSPNNDREP 290
Query: 106 ----ECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLI 155
E G F+ DL+ YL A+G K P K + F LI
Sbjct: 291 SITGEIGSGPRFKRDLLAYLE------------AYGRKKTGPLVEQLKNYGFDGIRAALI 338
Query: 156 ASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEK----GFKKSPLVYQFSSLGSL--D 204
ASVP SL WG L+ VL+ K K+S +V Q SS+ SL
Sbjct: 339 ASVPSRQRFPSLDSRKETIWGWPALQDVLRRIPIHKQQPLQSKRSRIVIQISSIASLGQS 398
Query: 205 EKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA----IPSPQK 259
+KW+ E +S+ + D P + I++PT +++R SL GY +G + I S +
Sbjct: 399 DKWLKETFFASLYPHSAADGAP----QLSIIFPTPDEIRRSLNGYGSGGSIHMKIQSSAQ 454
Query: 260 NVDKDFLKKYWAKWKAS-------------------------------HTGRSRAMPHIK 288
D+++ Y W GR RA PHIK
Sbjct: 455 QKQLDYMRPYLCHWAGDSENNQTPVSATDVLTHDSAIDRYPPKATPVREAGRRRAAPHIK 514
Query: 289 TFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP------SAK 339
T+ R++ + + W ++TSANLS AWGA ++ I S+E+GVL+ P S +
Sbjct: 515 TYIRFSDEDMRTIDWAMVTSANLSTQAWGAAINAKQEVRICSWEIGVLVWPDLFCNGSER 574
Query: 340 RHGCGF-------SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 392
R+ G S + ++P + S S++++ ++ + + + G S +V
Sbjct: 575 RNESGEENKDKAKSDYARMIPC-FRRDSPCLSEVERYEIEETSKKDADNTGVLSTLVGFR 633
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY+LP + YS DVPW + + D GQ W
Sbjct: 634 MPYDLPLKPYSPRDVPWCATASHKEPDWLGQTW 666
>gi|307211794|gb|EFN87775.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 441
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 123/420 (29%), Positives = 196/420 (46%), Gaps = 65/420 (15%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
++D++WL + + ++ +++GE E + N A +H +P FG HHSK
Sbjct: 66 ILDVEWLCLQYLLAGQSTNMTILYGERRDE-EELDDNITA---IHMK-MPFEFGCHHSKI 120
Query: 61 MLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEECGFENDLID 115
M+L Y G+R++V TANL DW N +QG+W+ ++N F+ DL
Sbjct: 121 MILQYKDNGIRVVVSTANLFFEDWQNSTQGIWISPHLPRLSKAAKHNGESLTNFKKDLQR 180
Query: 116 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 175
YLS+ + P K KK +FS+ V LIAS+PG H ++ WG+ KL
Sbjct: 181 YLSSYRNPA----------LKRWRKLVKKTDFSAINVCLIASIPG-HFEHTVDLWGYKKL 229
Query: 176 RTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP--L 232
VL Q T K ++ Q S++GS K+ + LS + + + P
Sbjct: 230 ANVLSQHVTLPPDALKWSIIAQSSAVGSFGPKYGSWLSKEIVWSMTRETERDLNNYPKFQ 289
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLKKYWAKWKASHTGRSRAMPHIKTF 290
++P+V++ S + Y G + S + V + ++K Y +WKA+ T R +AMPHIK++
Sbjct: 290 FIYPSVKNYEQSFD-YQNGTSCFSYSREVHSKQQWIKSYLYQWKAARTERDQAMPHIKSY 348
Query: 291 ARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT 348
R + +++AWF+LTSANLSK AWG ++++ I +YE+G+ LP F T
Sbjct: 349 TRISSDLKRIAWFVLTSANLSKGAWGVQREDD--YYITNYEVGIAFLPKFITRITTFPIT 406
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
+ + I P+PY+LP Y S D P
Sbjct: 407 DEDLTNSI----------------------------------FPIPYDLPLCPYDSSDSP 432
>gi|193659893|ref|XP_001947945.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 1
[Acyrthosiphon pisum]
Length = 684
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 211/434 (48%), Gaps = 67/434 (15%)
Query: 1 MVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL-PISFGTHHS 58
MV++ WL + + + +++ D ++ + + K + HK + +FG HS
Sbjct: 298 MVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKKKLLNVRHKKIINKNAFGHQHS 357
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSE---ECGFENDL 113
K + Y G +R++V +ANL DW +QG+W+ FPLK++++ S+ + F+ D+
Sbjct: 358 KVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKFPLKEEDDKSDGNSQTDFKIDI 417
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+ YL++ + P + +K +FS A V I SVPG HT WGH+
Sbjct: 418 LRYLNSFREPSLVPWIQK----------IEKVDFSQANVFFIPSVPGKHTEPL---WGHL 464
Query: 174 KLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLG 227
L+ +L++ C + P++ Q SSLGSL DE+W+ +E S+S+ D T
Sbjct: 465 YLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEWLKSEFVESLSASTYCDDTDTD 524
Query: 228 IGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRA 283
+P+ +++P+V++V S +G G +P + +K LKKY W+ R++A
Sbjct: 525 -NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEKQLWLKKYMCLWQCHSRKRTKA 583
Query: 284 MPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQL-MIRSYELGVLILPSAKR 340
MPHIKT+ R + +++WFLL SANLSKAAWG K++ Q I ++E GVL LP
Sbjct: 584 MPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSDEQSNFIMAHEAGVLFLPQ--- 640
Query: 341 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 400
F S+ P D ++ Y +P++LP
Sbjct: 641 ----FLIGSDTFP--------------------------IDETEPNKFPYFSLPFDLPLA 670
Query: 401 RYSSEDVPWSWDKR 414
YS D PW+ R
Sbjct: 671 GYSDTDQPWTISTR 684
>gi|406865596|gb|EKD18637.1| tyrosyl-DNA phosphodiesterase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 532
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 194/442 (43%), Gaps = 72/442 (16%)
Query: 20 VLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RII 72
V V+HG S L+ + P N LH +P FGTHHSK ++L+ +I+
Sbjct: 125 VHVVHGFWKSEDASRLNLQAQAKKYP-NITLHTAYMPEMFGTHHSKMLVLLRKYDTAQIV 183
Query: 73 VHTANLIHVDWNNKSQGLWM--------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPE 124
+HTAN+ DW+N +Q W+ + L+D + F+ D ++YL
Sbjct: 184 IHTANMQAFDWDNMTQAAWISPLLPQIREKELLEDTEPIGSGSRFKFDFLNYLRAYDTKR 243
Query: 125 FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQEC 182
P G K NFS+ L+ASVPG + S K WG L+ L+
Sbjct: 244 VICK-PLVGKLM-------KHNFSAIRGALVASVPGKQSIKSDSKTLWGWAGLKKALEAV 295
Query: 183 TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR 242
K+ +V Q SS+ +L EKW+ + + ++ + + IV+PT +++R
Sbjct: 296 PVRS--KEGEIVIQISSIATLSEKWIDK--TLFAAMSTSKSHGSSKSKFKIVFPTADEIR 351
Query: 243 CSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA------------SHTGRSRAMPH 286
SL GY +G+AI S + LK W S GR RA PH
Sbjct: 352 RSLNGYNSGSAIHTKIQSHAQARQLQLLKPMLCHWAGDSDEKGPSSAPVSDAGRKRAAPH 411
Query: 287 IKTFARY---NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
IKTF R+ + W L+TSANLSK AWG + I SYE+GVL+ P
Sbjct: 412 IKTFIRFPDATRSTIDWMLVTSANLSKQAWGEGTNAAGDVRICSYEIGVLVWPGL----- 466
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
F + +VP+ K+ + + S A +E+V +PY+LP Y
Sbjct: 467 -FGDNATMVPT-FKTDNPDASA----------------AKPGTELVGARMPYDLPLVPYG 508
Query: 404 SEDVPWSWDKRYTKKDVYGQVW 425
+D+PW Y + D GQVW
Sbjct: 509 KDDLPWCATSSYEEPDWKGQVW 530
>gi|401428160|ref|XP_003878563.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494811|emb|CBZ30114.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 682
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 155/595 (26%), Positives = 234/595 (39%), Gaps = 177/595 (29%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT---------------------------LEH 33
+ D+ WLL P L+ + LV+ GT +
Sbjct: 65 VTDLRWLLATVPELSAVTGKLVVLSGEKGTATLRRTTGDPSSPYTATSPLMDRVNPFMAA 124
Query: 34 MKRNKPANWILH-----------KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVD 82
++ A LH +PPLP++FGTHH+K L + RG+RI + TANL+ D
Sbjct: 125 LREQARATSALHTTLSRERLAVLEPPLPVAFGTHHTKMALCVNSRGLRISIFTANLVEQD 184
Query: 83 WNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST------------LKWPEFSANL- 129
W KSQG+++QDFP K S + ++ ++ K EF A+L
Sbjct: 185 WCWKSQGIYLQDFPWKAATECSNDVAAGATVVKTAASSTSKGGNGSNTLTKGAEFVAHLR 244
Query: 130 ---------------------PAHGNFKI-NPSFFKKFNFSSAAVRLIASVPG---YHTG 164
A G I F +FS+AAV LI+SVPG Y
Sbjct: 245 NYLMQCGVSLTTACASPTDAVSAAGPLGIFETDFLSHIDFSAAAVWLISSVPGTCAYGEV 304
Query: 165 SSLKKWGHMKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SG 218
+ + G +L VL+ T L +Q+SS GSL+ ++ L ++M S
Sbjct: 305 APGYRVGLCRLAEVLRRSALTMATAPASVDLSWQYSSQGSLNLAFLNSLQAAMCGESVSV 364
Query: 219 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 278
TP G+ + +V+PT E+VR S EG+ G ++P + +F+ +W +S
Sbjct: 365 IESGDTPRGVRDVQVVYPTEEEVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEE 423
Query: 279 G------------------------------------------------RSRAMPHIKTF 290
G R A+PHIK++
Sbjct: 424 GHTAKRAFPRPAKVAAAHASREDAVDVDGVDSDGGEGTTASLTCSCAAYRQFALPHIKSY 483
Query: 291 ARYNGQK--LAWFLLTSANLSKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGC 343
A + + WFLLTSANLS+AAWG+L Q+ + Q ++RSYELGV+ + H
Sbjct: 484 AAVAPDRSCVRWFLLTSANLSQAAWGSLSRKMNQRGSRQQLVRSYELGVIYDSHSAIHPS 543
Query: 344 GFSCTSNIVPSEIKSGSTETSQ--IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQ 400
S S + ++I+ S S+ + +T L G ++ V L PY L P
Sbjct: 544 ASSWFSVVSKTKIELPSARNSRAMLYETPL-----------GVETQNVCLYTPYNLLCPT 592
Query: 401 RYSS-------------------------EDVPWSWDKRYTKKDVYGQVWPRHFQ 430
Y+S DVPW D + +D YG + F+
Sbjct: 593 PYASTAALRARRDAPVEGEQAVAGSTLDCSDVPWVLDMPHRGRDAYGLDFEEAFE 647
>gi|332029127|gb|EGI69138.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 542
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 127/424 (29%), Positives = 194/424 (45%), Gaps = 72/424 (16%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPI------SFGT 55
VD+ WL L+ +D T+ + R P + L K I F +
Sbjct: 159 VDVGWLYL---------QYLLAGQRTDMTILYKYRVCPCHEELSKNITIIHVDGQHEFSS 209
Query: 56 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-CGFE 110
HH+ M+L Y G+R++V TA L DW N++QGLW+ P + + E GF+
Sbjct: 210 HHANIMILQYSNGIRVVVSTAALYSDDWKNRTQGLWISPHLPYLPESAKPSDGESPTGFK 269
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 170
DL YLS + P + + A + +FS V L+ASVPG H G W
Sbjct: 270 KDLERYLSKYEQPALTQWIRA----------VQMADFSDVNVFLVASVPGIHKGYEDDFW 319
Query: 171 GHMKLRTVLQECTFEKGFKKSPLVYQFSSLG--SLDEKWMAELSSSMSSGFSEDKTPLGI 228
G+ KL VL ++ P+V Q S +G L E W+ ++ MS S+D
Sbjct: 320 GYRKLAHVLSCYVTLPRNEQWPIVAQSSGVGCFGLFENWLEDIIWCMSKETSKDSNNYPH 379
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKYWAKWKASHTGRSRAMPHI 287
+ ++P++ + + S + + +N + +L+ Y +WKA TGR RAMP+I
Sbjct: 380 FQ--FIYPSIANYKQSFDFRVLSTPLSYNTENHFKQQWLESYLYQWKAKRTGRDRAMPNI 437
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
K++ R + +K+ WFLLTSANLSKAAWG+ ++ + I +YE GVL +P
Sbjct: 438 KSYTRISPDLKKIPWFLLTSANLSKAAWGSNKQYD--YSIGNYEAGVLFIP--------- 486
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
+ +G+T T G D G V P+PY+LP +Y +
Sbjct: 487 ---------KFITGTT-----------TFPIGGEEDTG----VPMFPIPYDLPLSQYEFD 522
Query: 406 DVPW 409
D P+
Sbjct: 523 DSPF 526
>gi|320590454|gb|EFX02897.1| tyrosyl-DNA phosphodiesterase [Grosmannia clavigera kw1407]
Length = 553
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/433 (29%), Positives = 189/433 (43%), Gaps = 76/433 (17%)
Query: 40 ANWILHKPPLPISFGTHHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW------- 91
AN LH +P FGTHHSK A+L + +++++TAN+I DW N +QG+W
Sbjct: 148 ANVQLHTAFMPEPFGTHHSKMAVLFRHDDTAQVVIYTANMIPHDWANMTQGVWRSPLLPL 207
Query: 92 -MQDFPLKDQNNLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFN 146
D +D++ + G F+ DL+ YL S P +++
Sbjct: 208 LADDVDGEDESEIDGPVGSGRRFKTDLLSYLRAYN-QRRSICRPLVERLA-------RYD 259
Query: 147 FSSAAVRLIASVPGYHT------GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 200
F++ LIASVPG H+ +WG L+ L+ + + +V Q SS+
Sbjct: 260 FAAVQAALIASVPGRHSLIRQPDEKYHTQWGWTALKNTLRSVPVQAVAPSTEIVLQVSSM 319
Query: 201 GSLD--EKW--------MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 250
+L + W MA SS++ G S K L V+PT +++R SLEGY +
Sbjct: 320 ATLGPTDAWIRHTLFSAMATASSAVDKGGSIGKEELQQPRFRAVFPTADEIRRSLEGYKS 379
Query: 251 GNAIPSP----QKNVDKDFLKKYWAKWKASH--------------TGRSRAMPHIKTFAR 292
G +I + Q+ +++ W GR RA PHIKT+ R
Sbjct: 380 GTSIHTKIQSSQQQRQLQYMRPLLCHWANDSPDGAKLPDGATPIVNGRKRAAPHIKTYVR 439
Query: 293 YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 352
Y + W LLTSANLSK AWG ++ + S+E+GV++ P G + ++
Sbjct: 440 YGQVGVDWALLTSANLSKQAWGEAVTAAGEVRVASWEIGVMVWP-------GLFAETAVM 492
Query: 353 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 412
+I GS Q K A VV L VPY+LP Q+Y ++PW
Sbjct: 493 --QIVGGSDSVLQPATGK------------AAGRPVVALRVPYDLPLQQYGKGEIPWVCT 538
Query: 413 KRYTKKDVYGQVW 425
+ D GQ W
Sbjct: 539 LPDEEPDWTGQAW 551
>gi|119178141|ref|XP_001240773.1| hypothetical protein CIMG_07936 [Coccidioides immitis RS]
Length = 531
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 140/509 (27%), Positives = 220/509 (43%), Gaps = 106/509 (20%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
+ DID+L+ P + + + VIHG +S + E R + I+ P P
Sbjct: 42 LFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRIYIDEACARYQNVEPIIAYMPEP-- 99
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM----------QDFPLKDQN 101
FGTHHSK M+LI + +II+HTAN+I DW N QG+W +D+
Sbjct: 100 FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMCQGVWRSPLLPLLPLDRDYDQSISG 159
Query: 102 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVP 159
+ F+ D++ YL A+G K P KK++F LIASVP
Sbjct: 160 IIGSGRRFKRDILAYLD------------AYGRKKTGPLVEQLKKYDFDEVRAALIASVP 207
Query: 160 GYHTGSSLKK-----WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSL--DEKWM 208
+L WG ++ VL++ K KK +V Q SS+ SL +KW+
Sbjct: 208 SRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSCEPKKPRIVIQISSIASLGQTDKWL 267
Query: 209 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKD 264
+ + F+ P I++PT +++R SL GY +G +I S + D
Sbjct: 268 KD------TFFNALCPPSAAARFSIIFPTPDEIRRSLNGYRSGGSIHMKLQSAAQQKQFD 321
Query: 265 FLKKYWAKWKAS------------------------------HTGRSRAMPHIKTFARYN 294
+++ Y W GR RA PHIKT+ R++
Sbjct: 322 YMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTPNNTFVREAGRRRAAPHIKTYIRFS 381
Query: 295 G----QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAKRHG 342
+ + W ++TSANLS AWGA N ++ + S+E+GVL+ P +A R
Sbjct: 382 DAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVCSWEIGVLVWPDLFLNDPQTADRDD 441
Query: 343 CGFSCT------SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
S + ++P + + S++++ +L + G + A +V +PY
Sbjct: 442 KMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELEESS-RGDDKSKAWRTLVGFRMPYN 499
Query: 397 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
LP + YSS D+PW +T+ D GQ W
Sbjct: 500 LPLKPYSSRDIPWCATATHTEPDWLGQTW 528
>gi|302662485|ref|XP_003022896.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
gi|291186867|gb|EFE42278.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
Length = 587
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/431 (28%), Positives = 194/431 (45%), Gaps = 69/431 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK M+LI + ++I+HTAN+I DW N +Q +W Q + + C
Sbjct: 168 MPEPFGTHHSKMMILIRHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLAQPQVGDTC 227
Query: 108 G-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
G F+ DL+ YL A+ N IN ++++F + LIASV
Sbjct: 228 GVFGSSTRFKRDLLAYLE------------AYNNKTINTLIRQLQRYDFGAVKAMLIASV 275
Query: 159 PGY-----HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWM 208
P + WG L+ + ++ ++ ++ Q SS+ +L +KW+
Sbjct: 276 PTRLPVKEFDSNKRTLWGWPALKDAISSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWL 335
Query: 209 AE--LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
E LSS I++PT +++R SL+GY +G +I SP +
Sbjct: 336 KETFLSSLCPQPEVNQSRSTSNARFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQ 395
Query: 263 KDFLKKYWAKW-----------------KASHTGRSRAMPHIKTFARYNGQKL---AWFL 302
+L++Y W + GR RA PHIKT+ R++ + W +
Sbjct: 396 LAYLRRYLCHWAGDAEDPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAM 455
Query: 303 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK----- 357
+TSANLS AWGA + ++ I S+E+GVL+ P R C+ + + + +K
Sbjct: 456 ITSANLSTQAWGAGANTHGEVRICSWEIGVLMWPDLFREKNIEECSDSSLTNYVKMIPCF 515
Query: 358 ---SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
S + Q + +T H SDA + V L +PY+LP Y+ ++VPW
Sbjct: 516 KRNVPSEKPPQTSENDSTKVTLH--SDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATAV 572
Query: 415 YTKKDVYGQVW 425
+ + D GQ W
Sbjct: 573 HREPDWMGQTW 583
>gi|327299128|ref|XP_003234257.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
gi|326463151|gb|EGD88604.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
Length = 586
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 197/431 (45%), Gaps = 69/431 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK M+LI + ++I+HTAN+I DW N +Q +W Q+ + + C
Sbjct: 167 MPEPFGTHHSKMMILIRHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVGDAC 226
Query: 108 G-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
G F+ DL+ YL A+ N IN ++++F + LIASV
Sbjct: 227 GVFGSSARFKRDLLAYLE------------AYNNNTINTLIRQLQQYDFGAVKAVLIASV 274
Query: 159 PGY-----HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWM 208
P + WG L+ + ++ ++ ++ Q SS+ +L +KW+
Sbjct: 275 PTRLPVKEFDSNRRTLWGWPALKDAIGSIPIDRSSSQAQNPHIIIQVSSIATLGQTDKWL 334
Query: 209 AE--LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
E SS S + I++PT +++R SL+GY +G +I SP +
Sbjct: 335 KETFFSSLYSQPEVNQSRSTSKAKFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQ 394
Query: 263 KDFLKKYWAKW-----------------KASHTGRSRAMPHIKTFARYNGQKLA---WFL 302
+L++Y W + GR RA PHIK++ R++ + W +
Sbjct: 395 LAYLRRYLCHWAGDAEGPKNADPTTTSDRVREAGRRRAAPHIKSYIRFSDSDMDSIDWAM 454
Query: 303 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK----- 357
+TSANLS AWGA + ++ I S+E+G+LI P R C+ + + + +K
Sbjct: 455 ITSANLSTQAWGAGANTHGEVRICSWEIGILIWPDLFREENIEECSDSSLTNHVKMIPCF 514
Query: 358 ---SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
+ S + Q + + +T H DA + V L +PY+LP Y+ ++VPW
Sbjct: 515 KRNTPSEKPLQTSENDSIKVTLH--LDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATSV 571
Query: 415 YTKKDVYGQVW 425
+ + D GQ W
Sbjct: 572 HREPDWMGQTW 582
>gi|392867268|gb|EAS29510.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 616
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 140/509 (27%), Positives = 220/509 (43%), Gaps = 106/509 (20%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
+ DID+L+ P + + + VIHG +S + E R + I+ P P
Sbjct: 127 LFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRIYIDEACARYQNVEPIIAYMPEP-- 184
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM----------QDFPLKDQN 101
FGTHHSK M+LI + +II+HTAN+I DW N QG+W +D+
Sbjct: 185 FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMCQGVWRSPLLPLLPLDRDYDQSISG 244
Query: 102 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVP 159
+ F+ D++ YL A+G K P KK++F LIASVP
Sbjct: 245 IIGSGRRFKRDILAYLD------------AYGRKKTGPLVEQLKKYDFDEVRAALIASVP 292
Query: 160 GYHTGSSLKK-----WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSL--DEKWM 208
+L WG ++ VL++ K KK +V Q SS+ SL +KW+
Sbjct: 293 SRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSCEPKKPRIVIQISSIASLGQTDKWL 352
Query: 209 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKD 264
+ + F+ P I++PT +++R SL GY +G +I S + D
Sbjct: 353 KD------TFFNALCPPSAAARFSIIFPTPDEIRRSLNGYRSGGSIHMKLQSAAQQKQFD 406
Query: 265 FLKKYWAKWKAS------------------------------HTGRSRAMPHIKTFARYN 294
+++ Y W GR RA PHIKT+ R++
Sbjct: 407 YMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTPNNTFVREAGRRRAAPHIKTYIRFS 466
Query: 295 G----QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAKRHG 342
+ + W ++TSANLS AWGA N ++ + S+E+GVL+ P +A R
Sbjct: 467 DAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVCSWEIGVLVWPDLFLNDPQTADRDD 526
Query: 343 CGFSCT------SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
S + ++P + + S++++ +L + G + A +V +PY
Sbjct: 527 KMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELEESS-RGDDKSKAWRTLVGFRMPYN 584
Query: 397 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
LP + YSS D+PW +T+ D GQ W
Sbjct: 585 LPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|326476634|gb|EGE00644.1| tyrosyl-DNA phosphodiesterase [Trichophyton tonsurans CBS 112818]
gi|326478089|gb|EGE02099.1| tyrosyl-DNA phosphodiesterase [Trichophyton equinum CBS 127.97]
Length = 588
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 125/432 (28%), Positives = 198/432 (45%), Gaps = 71/432 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK M+LI + +II+HTAN+I DW N +Q +W Q + + C
Sbjct: 169 MPEPFGTHHSKMMILIRHDNLAQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQAQVCDTC 228
Query: 108 G-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
G F+ DL+ YL A+ N IN ++++F S LIASV
Sbjct: 229 GGFGSSARFKRDLLAYLE------------AYHNKTINTLIRQLQRYDFGSVKAVLIASV 276
Query: 159 PGY-----HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWM 208
P + WG L+ + ++ ++ ++ Q SS+ +L ++W+
Sbjct: 277 PTRLPVKEFDSNRRTLWGWPALKDAIGSIPIDRSSSRAQNPHIIVQVSSIATLGQTDRWL 336
Query: 209 AE--LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKN 260
E LSS + I++PT +++R SL+G+ +G +I PS QK
Sbjct: 337 KETFLSSLYPQPEVNQNRSTSNVKFSIIFPTPDEIRRSLDGHGSGGSIHMKIQSPSQQKQ 396
Query: 261 VDKDFLKKYWAKW-----------------KASHTGRSRAMPHIKTFARYNG---QKLAW 300
+ +L++Y W + GR RA PHIKT+ R++ + W
Sbjct: 397 LA--YLRRYLCHWAGDAEGRKNSDPTTKSDRVREAGRRRAAPHIKTYIRFSDSDMDNIDW 454
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR----HGCGFSCTSN---IVP 353
++TSANLS AWGA + ++ I S+E+GVLI P R GC S +N ++P
Sbjct: 455 AMITSANLSTQAWGAGANTHGEVRICSWEIGVLIWPDLFREEHIEGCSDSSLTNHVKMIP 514
Query: 354 SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDK 413
K + +Q ++ + SDA + V L +PY+LP Y+ ++VPW
Sbjct: 515 C-FKRNTPSEKPLQSSENDSTKVALHSDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATA 572
Query: 414 RYTKKDVYGQVW 425
+ + D GQ W
Sbjct: 573 VHREPDWMGQTW 584
>gi|320034009|gb|EFW15955.1| tyrosyl-DNA phosphodiesterase [Coccidioides posadasii str.
Silveira]
Length = 559
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 138/509 (27%), Positives = 219/509 (43%), Gaps = 106/509 (20%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
+ DID+L+ P + + + V+HG +S + E R + I+ P P
Sbjct: 70 LFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRIYIDEACARYQNVEPIIAYMPEP-- 127
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM----------QDFPLKDQN 101
FGTHHSK M+LI + +II+HTAN+I DW N QG+W +D+
Sbjct: 128 FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMCQGVWRSPLLPLLPLDRDYDQSISG 187
Query: 102 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVP 159
+ F+ D++ YL A+G K P KK++F LIASVP
Sbjct: 188 IIGSGRRFKRDILAYLD------------AYGRKKTGPLVEQLKKYDFDEVRAALIASVP 235
Query: 160 GYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKKSP----LVYQFSSLGSL--DEKWM 208
+L WG ++ VL++ K P +V Q SS+ SL +KW+
Sbjct: 236 SRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSCEPEKPRIVIQISSIASLGQTDKWL 295
Query: 209 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKD 264
+ + F+ P I++PT +++R SL GY +G +I S + D
Sbjct: 296 KD------TFFNALCPPSAAARFSIIFPTPDEIRRSLNGYRSGGSIHMKLQSAAQQKQFD 349
Query: 265 FLKKYWAKWKAS------------------------------HTGRSRAMPHIKTFARYN 294
+++ Y W GR RA PHIKT+ R++
Sbjct: 350 YMRPYLCHWAGDCENNQNADIEKSVSSTVTLDESTPNNTFVREAGRRRAAPHIKTYIRFS 409
Query: 295 G----QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAKRHG 342
+ + W ++TSANLS AWGA N ++ + S+E+GVL+ P +A R
Sbjct: 410 DAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVCSWEIGVLVWPDLFLNDPQTADRDD 469
Query: 343 CGFSCT------SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 396
S + ++P + + S++++ +L + G + A +V +PY
Sbjct: 470 KMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELEEPS-RGDDKSKAWKTLVGFRMPYN 527
Query: 397 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
LP + YSS D+PW +T+ D GQ W
Sbjct: 528 LPLKPYSSRDIPWCATATHTEPDWLGQTW 556
>gi|302894143|ref|XP_003045952.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726879|gb|EEU40239.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1086
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 181/384 (47%), Gaps = 70/384 (18%)
Query: 3 DIDWLLPAC-PVLAKIPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGT 55
DI +L+ A P + V V+HG ES +E + N +H P+P FGT
Sbjct: 81 DIHFLMDAFDPDTRHLVKVHVVHGFWKREDESRIAIEQAA-AEFNNVQIHIAPMPEMFGT 139
Query: 56 HHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM------------------QDFP 96
HHSK M+L + ++I+HTAN+I DW N + G+W +D P
Sbjct: 140 HHSKMMILFRHDDTAQVIIHTANMISKDWTNMTNGIWKSPLLPKMTVAPTHTTSSPEDHP 199
Query: 97 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
+ + F+ DL++YL + + K ++FSS L+A
Sbjct: 200 VGSGDR------FKIDLLNYLRAYDRRKITC--------KALTDELVHYDFSSIKAALVA 245
Query: 157 SVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELS 212
SVPG H L + WG L+ LQ+ E ++S +V Q SS+ +L E W L
Sbjct: 246 SVPGRHNIRDLSETSWGWAALKRCLQQVPCEDQ-EQSEIVVQISSIATLGAKEDW---LK 301
Query: 213 SSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFL 266
++ S K P +G+P +V+PT +++R SL+GYA+G +I S Q+ ++L
Sbjct: 302 KTLFEPLSRCKNP-SLGKPKFKVVFPTADEIRRSLDGYASGGSIHTKIQSAQQAKQLEYL 360
Query: 267 KKYWAKWK--------------ASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAA 312
+ + W GR RA PHIKT+ R N + W LLTSANLSK A
Sbjct: 361 RPIFHHWANDSPSGAKLPEGATVKDGGRKRAAPHIKTYIRSNKSSIDWALLTSANLSKQA 420
Query: 313 WGALQKNNSQLMIRSYELGVLILP 336
WG + ++ I S+E+GVL+ P
Sbjct: 421 WGEAARPTGEMRIASWEIGVLVWP 444
>gi|156549662|ref|XP_001604678.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 573
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 180/361 (49%), Gaps = 51/361 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI----LHKPPLPISF--- 53
M ++ WL+ + ++P + V++G +W+ +++ P I F
Sbjct: 130 MAEMLWLINEYMLAVQVPKMTVLYG---------------SWLDPDMMYEIPFDIEFVNV 174
Query: 54 -----GTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDF-PL--KDQNNLS 104
G HHSK + Y +RI++ ++N+ DW +++QGLW+ F PL +D N
Sbjct: 175 EMSEFGCHHSKISIFKYTGDKIRIMISSSNIYAEDWQSRTQGLWISPFLPLLPEDANESD 234
Query: 105 EE--CGFENDLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 161
E F+ D + YLS PE F + H + + S+ V IASVPG+
Sbjct: 235 GESPTNFKRDFLQYLSMYNQPEVFGWSALIH-----------RADCSAINVFFIASVPGH 283
Query: 162 HTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFS 220
H GSSL WGH KL +L + +K P++ Q SS+G + + LSSS+ S
Sbjct: 284 HDGSSLDTWGHRKLAALLSAHASLPSDAQKWPVIAQSSSVGVFGPDYQSWLSSSIVRTMS 343
Query: 221 E--DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKYWAKWKASH 277
+ DK + E ++P+ + S + + + ++N + + +LK Y +WK+
Sbjct: 344 KEKDKKIIIFPEFKFIYPSKNNYNQSYDNQIGSSCLMYNEQNYLKQQWLKDYLYQWKSDK 403
Query: 278 TGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
GR++AMPH+K + R + ++AWF LTSANLSK A G + +N + + +YE GVL L
Sbjct: 404 IGRTQAMPHLKCYTRISPDESEMAWFFLTSANLSKGAMGKMLRNCTVQTLCNYEAGVLFL 463
Query: 336 P 336
P
Sbjct: 464 P 464
>gi|303310201|ref|XP_003065113.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240104773|gb|EER22968.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 616
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 137/512 (26%), Positives = 218/512 (42%), Gaps = 112/512 (21%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHGE----------SDGTLEHMKRNKPANWILHKPPL 49
+ DID+L+ P + + + V+HG D H + +P I+ P
Sbjct: 127 LFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRIYIDEACAHYQNVEP---IIAYMPE 183
Query: 50 PISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWM----------QDFPLK 98
P FGTHHSK M+LI + +II+HTAN+I DW N QG+W +D+
Sbjct: 184 P--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMCQGVWRSPLLPLLPLDRDYDQS 241
Query: 99 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIA 156
+ F+ D++ YL A+G K P KK++F LIA
Sbjct: 242 ISGIIGSGRRFKRDILAYLD------------AYGRKKTGPLVEQLKKYDFDEVRAALIA 289
Query: 157 SVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKKSP----LVYQFSSLGSL--DE 205
SVP +L WG ++ VL++ K P +V Q SS+ SL +
Sbjct: 290 SVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSCEPEKPRIVIQISSIASLGQTD 349
Query: 206 KWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNV 261
KW+ + + F+ P +++PT +++R SL GY +G +I S +
Sbjct: 350 KWLKD------TFFNALCPPSAAARFSVIFPTPDEIRRSLNGYRSGGSIHMKLQSAAQQK 403
Query: 262 DKDFLKKYWAKWKAS------------------------------HTGRSRAMPHIKTFA 291
D+++ Y W GR RA PHIKT+
Sbjct: 404 QFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDESTPNNTFVREAGRRRAAPHIKTYI 463
Query: 292 RYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAK 339
R++ + + W ++TSANLS AWGA N ++ + S+E+GVL+ P +A
Sbjct: 464 RFSDAEDMCTIDWAMVTSANLSTQAWGAAINANQEVRVCSWEIGVLVWPDLFLNDPQTAD 523
Query: 340 RHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 393
R S + ++P + + S++++ +L + G + A +V +
Sbjct: 524 RDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELEEPS-RGDDKSKAWKTLVGFRM 581
Query: 394 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
PY LP + YSS D+PW +T+ D GQ W
Sbjct: 582 PYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|367033183|ref|XP_003665874.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
gi|347013146|gb|AEO60629.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 140/501 (27%), Positives = 216/501 (43%), Gaps = 115/501 (22%)
Query: 3 DIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRNKPANWILHKPPLPISFGTH 56
DID+L+ A P + + V V+HG + +G ++ N LH +P +GTH
Sbjct: 112 DIDFLMAAFDPDVRHLVKVHVVHGFWKREDPNGLELQEAASRFQNVTLHSAFMPEMYGTH 171
Query: 57 HSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLS---EECG--- 108
HSK M+L+ +I++HTAN+I DW N +Q +W+ PL + + EE
Sbjct: 172 HSKMMILLRRDDTAQIVIHTANMIIRDWTNMTQAVWLSPRLPLMEPSRCDARPEEVAAGS 231
Query: 109 ---FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--T 163
F+ D ++YL + + K++FS+ LIASVPG H
Sbjct: 232 GAKFKIDFLNYLRAYDTRRTTC--------RPIIDQLSKYDFSAIRGSLIASVPGRHKLD 283
Query: 164 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSE 221
+S +WG + L+ ++S + Q SS+ +L + W L S+ S
Sbjct: 284 DTSPTRWGWAAMEQALKSVPVSS--RRSDIAIQISSIATLGPTDTW---LKSTFFRSLSG 338
Query: 222 DKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKK---YWA 271
+ + +P +++PT +++R SL+GY++G +I SPQ+ +L+ +WA
Sbjct: 339 GRPGGTLQQPPNFQVIFPTPDEIRKSLDGYSSGASIHTKVQSPQQVKQLAYLRPMLYHWA 398
Query: 272 KWKAS----------------------------------HTGRSRAMPHIKTFARY---N 294
A+ GR RA PHIKT+ RY +
Sbjct: 399 NDSANGADPQEGGGGGERRREDYENDGGDDEGDSAVVVKEAGRKRAAPHIKTYIRYGDKS 458
Query: 295 GQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGC---GFSC 347
G + W L+TSANLSK AWG + + I SYE+GVL+ P G G
Sbjct: 459 GPSIDWALVTSANLSKQAWGEAAVRGADGGATMRIASYEIGVLVWPGLYGEGAIMRGTFL 518
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
T ++ E+K G+T V L +PY LP Q Y +V
Sbjct: 519 TDSLGTEEVKEGTT--------------------------AVALRMPYNLPLQPYGKGEV 552
Query: 408 PWSWDKRYTKKDVYGQVWPRH 428
PW Y++ D GQ+W RH
Sbjct: 553 PWVATANYSEPDWKGQIW-RH 572
>gi|398021965|ref|XP_003864145.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
gi|322502379|emb|CBZ37463.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
Length = 682
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 136/504 (26%), Positives = 212/504 (42%), Gaps = 139/504 (27%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 105
+PPLP++FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S
Sbjct: 148 EPPLPVAFGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSN 207
Query: 106 ECGFENDLIDYLST------------LKWPEFSANL-----------------PAHGNFK 136
+ + +++ ++ K EF A+L P
Sbjct: 208 DDSADATMVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASA 267
Query: 137 INP------SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQEC--TFE 185
P F +FS+AAV L++SVPG + + + G +L VL+ T
Sbjct: 268 AGPLGIFETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMA 327
Query: 186 KGFKKSPLVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDV 241
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+V
Sbjct: 328 TSPASVDLSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEV 387
Query: 242 RCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---------------------- 279
R S EG+ G ++P + +F+ +W +S G
Sbjct: 388 RNSWEGWRGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASRED 446
Query: 280 --------------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKA 311
R A+PHIK++A + + WFLLTSANLS+A
Sbjct: 447 AVDVDGVDIDGGEETTASLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQA 506
Query: 312 AWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ- 365
AWG+L Q+ + Q ++RSYELGVL + + S S + S+I+ + S+
Sbjct: 507 AWGSLSRKVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESKIELPNARNSRA 566
Query: 366 -IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS------------------- 404
+ +T L G ++ V L +PY L P Y+S
Sbjct: 567 MLYETPL-----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVE 615
Query: 405 ------EDVPWSWDKRYTKKDVYG 422
DVPW D + KD YG
Sbjct: 616 EAALDFSDVPWVLDMPHRGKDAYG 639
>gi|307105869|gb|EFN54116.1| hypothetical protein CHLNCDRAFT_13268, partial [Chlorella
variabilis]
Length = 150
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 101/179 (56%), Gaps = 40/179 (22%)
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR 292
+VW TV +V+ S+EG+ AG +IP P KNVD+ FL+ Y+ +W GR RAMPHIK++ R
Sbjct: 10 LVWTTVAEVQNSIEGWMAGRSIPGPAKNVDRPFLQAYYRRWGGEACGRQRAMPHIKSYLR 69
Query: 293 YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 352
Y G +AW + S NLSKAAWG LQK SQLM+RSYELGVL++PS +
Sbjct: 70 YRGDDVAWLYVGSHNLSKAAWGQLQKQGSQLMVRSYELGVLLVPSLE------------- 116
Query: 353 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE--VVYLPVPYELPPQRYSSEDVPW 409
G+ A A + V LP+PY LPPQRY++ D PW
Sbjct: 117 -------------------------GAYQAAARGQELRVPLPIPYTLPPQRYAAGDQPW 150
>gi|146098236|ref|XP_001468366.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
gi|134072733|emb|CAM71450.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
Length = 682
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 136/504 (26%), Positives = 211/504 (41%), Gaps = 139/504 (27%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 105
+PPLP++FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S
Sbjct: 148 EPPLPVAFGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSN 207
Query: 106 ECGFENDLIDYLST------------LKWPEFSANL-----------------PAHGNFK 136
+ + +++ ++ K EF A+L P
Sbjct: 208 DDSADATMVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASA 267
Query: 137 INP------SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQEC--TFE 185
P F +FS+AAV L++SVPG + + + G +L VL+ T
Sbjct: 268 AGPLGIFETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMA 327
Query: 186 KGFKKSPLVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDV 241
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+V
Sbjct: 328 TSPASVDLSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEV 387
Query: 242 RCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---------------------- 279
R S EG+ G ++P + +F+ +W +S G
Sbjct: 388 RNSWEGWRGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASRED 446
Query: 280 --------------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKA 311
R A+PHIK++A + + WFLLTSANLS+A
Sbjct: 447 AVDVDGVDIDGGEETTPSLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQA 506
Query: 312 AWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ- 365
AWG+L Q+ + Q ++RSYELGVL + + S S + S I+ + S+
Sbjct: 507 AWGSLSRKVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESRIELPNARNSRA 566
Query: 366 -IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS------------------- 404
+ +T L G ++ V L +PY L P Y+S
Sbjct: 567 MLYETPL-----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVE 615
Query: 405 ------EDVPWSWDKRYTKKDVYG 422
DVPW D + KD YG
Sbjct: 616 EAALDCSDVPWVLDMPHRGKDAYG 639
>gi|296817701|ref|XP_002849187.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
gi|238839640|gb|EEQ29302.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
Length = 606
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 120/431 (27%), Positives = 198/431 (45%), Gaps = 66/431 (15%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW------MQDFPLKD-Q 100
+P FGTHHSK M+L+ + +II+HTAN+I DW N +Q +W + F + D +
Sbjct: 184 MPELFGTHHSKMMVLVRHDDLTQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQFKMADSR 243
Query: 101 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
++ F+ DL+ YL+ A+ N KI+ ++++F LI+SV
Sbjct: 244 GDIGSGARFKRDLLAYLN------------AYNNKKIDMLIDQLQRYDFGEVKAALISSV 291
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKKS---PLVYQFSSLGSL--DEKWM 208
P L WG L+ + + +V Q SS+ +L +KW+
Sbjct: 292 PSRQPARELDSGKRTLWGWPALKDAISSIPIRGNSSQRLEPQVVVQVSSIATLGQTDKWL 351
Query: 209 AE--LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
E SS + D + + + I++PT +++R SL+GYA+G +I S +
Sbjct: 352 KETFFSSLCPQSRASDTSNISSTKFSIIFPTPDEIRRSLDGYASGGSIHMKIQSAAQQKQ 411
Query: 263 KDFLKKYWAKWKAS---------------------HTGRSRAMPHIKTFARYNGQKLA-- 299
+L++Y +W GR RA PHIKT+ R++ +
Sbjct: 412 LQYLRRYLCRWAGDAAGQRDTNPATQPDKGSSIVREAGRKRAAPHIKTYIRFSDSGMTSI 471
Query: 300 -WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS--AKRHGCGFSCTSNIVPSE- 355
W ++TSANLS AWGA ++ I S+E+GVL+ P +R +S I P +
Sbjct: 472 DWAMVTSANLSTQAWGAGANTQGEVRICSWEIGVLVWPDLFRERMTSKDKDSSTIHPVKM 531
Query: 356 IKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSWDKR 414
I +T + + + + +S +GA++ + L +PY LP Y+ +DVPW
Sbjct: 532 IPCFKCDTPSEKSLLCESDSTNSTSHSGATNMTRIGLRMPYNLPLVPYTHQDVPWCATAV 591
Query: 415 YTKKDVYGQVW 425
+ + D GQ W
Sbjct: 592 HREPDWLGQTW 602
>gi|332029128|gb|EGI69139.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 550
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 174/375 (46%), Gaps = 71/375 (18%)
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEE-C 107
+ +HH+ M+L Y G+R+IV TA L +DW N++QGLW+ P + + E
Sbjct: 224 YSSHHANIMILQYKNGIRVIVSTAGLYSIDWENRTQGLWISPHLPYLPESAKPSDGESPT 283
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 167
GF+ DL YLS K P + + A + +FS V L+ASVPG +
Sbjct: 284 GFKKDLERYLSKYKQPALTQWIRA----------VQMADFSDVNVFLVASVPGIYKADEA 333
Query: 168 KKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL---------DEKW-MAELSSSMSS 217
WG+ KL VL ++ P+V Q S +G D W M+E++S S
Sbjct: 334 DFWGYRKLAHVLSRYATLPRNEQWPIVAQSSGVGCFGLFKNWLLKDIIWSMSEMTSKASK 393
Query: 218 GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKAS 276
+ + ++P++E+ + S + + S + + + +L+ Y +WKA+
Sbjct: 394 NHPQFQ---------FIYPSIENYKQSFDYQCLITPLTYSAENHSKQQWLESYLYQWKAT 444
Query: 277 HTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
TGR RAMP+IK++ R + +K+ WFLLTSANLSKAAWG+ K I +YE GVL
Sbjct: 445 RTGRDRAMPNIKSYTRISPDLKKIPWFLLTSANLSKAAWGST-KQYKGYSIGNYEAGVLF 503
Query: 335 LPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 394
+P K +T T ++ V P+P
Sbjct: 504 IP---------------------------------KFITGTTTFPVGEEKNTGVPVFPIP 530
Query: 395 YELPPQRYSSEDVPW 409
Y+LP +Y S+D P+
Sbjct: 531 YDLPLTQYESDDSPF 545
>gi|290999837|ref|XP_002682486.1| predicted protein [Naegleria gruberi]
gi|284096113|gb|EFC49742.1| predicted protein [Naegleria gruberi]
Length = 320
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 149/286 (52%), Gaps = 35/286 (12%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
H+K ++ + +RI+V +ANL DW+ Q +W+QDFP K+ + + FEN L+++
Sbjct: 2 HAKLFIIEFDDFIRIVVSSANLTDFDWSFFKQCIWIQDFPKKENISNNNTNQFENTLVEF 61
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
W + + +P +F +K+++S+A LI S+PGYHT K+GH+ ++
Sbjct: 62 -----WTKLTDGIPG--------NFLRKYDYSNAKGELIPSIPGYHTNIEKDKYGHLAIK 108
Query: 177 TVLQECTFEK----GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL 232
++ F K K+SPL YQ SS+GS++ W+ ELSSS + +D
Sbjct: 109 KAIERMNFTKNEILNLKQSPLYYQMSSIGSMNLDWIKELSSSF---YLKDCNNFN----- 160
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK----YWAKWKASHTGRSRAMPHIK 288
IV+P++E V S G G I K + K +++ +A+H S+ + H++
Sbjct: 161 IVFPSLESVSSSHFGLRCGGMIHLKSKTFETSTFPKHLMTHYSPNQANHLAHSKILLHLE 220
Query: 289 TFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
K + + S NLS+ A G LQKN +QL I +YELGV+
Sbjct: 221 NL------KNGYIFVGSHNLSQPALGKLQKNGTQLYISNYELGVIF 260
>gi|342883838|gb|EGU84260.1| hypothetical protein FOXB_05217 [Fusarium oxysporum Fo5176]
Length = 1127
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 159/326 (48%), Gaps = 49/326 (15%)
Query: 45 HKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLW----------MQ 93
H P+P FGTHHSK M+L G ++I+HTAN+I DW N S G+W Q
Sbjct: 129 HIAPMPEMFGTHHSKMMILFRHDGTAQVIIHTANMIPKDWTNMSNGVWKSPLLPKLSGAQ 188
Query: 94 DFPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 152
+F + +++ F+ DL++YL + K ++FSS
Sbjct: 189 NFQASPEDHSVGSGQRFKIDLLNYLKAYDRRKIIC--------KPLTDKLTHYDFSSIKA 240
Query: 153 RLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WM 208
L+ASVPG H + + WG L+ LQ + S +V Q SS+ +L K W
Sbjct: 241 ALVASVPGKHDARDMSETSWGWAALKRCLQHVPCQD-HGDSDIVVQVSSIATLGAKDDW- 298
Query: 209 AELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
L ++ + K P G+G P +V+PT +++R SL+GYA+G +I S Q+
Sbjct: 299 --LQKTLFEPLTRSKNP-GLGRPRFKVVFPTADEIRRSLDGYASGGSIHTKIQSSQQAKQ 355
Query: 263 KDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQKLAWFLLTSANL 308
++L+ + W +GR RA PHIKT+ R N + W LLTSAN+
Sbjct: 356 LEYLRPIFHHWANDSPRGAKLPEDTPLRDSGRKRAAPHIKTYIRSNKSSIDWGLLTSANI 415
Query: 309 SKAAWGALQKNNSQLMIRSYELGVLI 334
SK AWG + ++ I S+E+GVLI
Sbjct: 416 SKQAWGEAARPTGEMRIASWEIGVLI 441
>gi|121715578|ref|XP_001275398.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
gi|119403555|gb|EAW13972.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
Length = 576
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 193/425 (45%), Gaps = 75/425 (17%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL+ +++EE
Sbjct: 177 MPEPFGTHHSKMMILLRHDDLAQVVIHTANMIPGDWANMCQAVWRSPLLPLQKVEHIAEE 236
Query: 107 CG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIAS 157
G F+ DL+ YL+ +G K P +F+FSS LIAS
Sbjct: 237 PGTIGSGARFKRDLLAYLN------------EYGAKKTGPLVKQLARFDFSSVRAALIAS 284
Query: 158 VPGYHTGSSLKK-----WGHMKLRTVLQEC--TFEKGFKKSP--LVYQFSSLGSL--DEK 206
VP +SL WG LR ++ T E+G + + ++ Q SS+ +L +K
Sbjct: 285 VPSKQKLASLDLQRKTLWGWPALRETTRQIPLTREQGSETATPHIITQISSIATLGQTDK 344
Query: 207 WMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
W+ ++ + S + + TP + IV+PT +++R SL GY +G +I S ++
Sbjct: 345 WLKDVFFN-SLAPTSNPTPPTKSKYSIVFPTPDEIRRSLNGYGSGGSIHMKLQSTTQHKQ 403
Query: 263 KDFLKKYWAKW------------------KASHTGRSRAMPHIKTFARYNG----QKLAW 300
+++ Y W K GR RA PHIKT+ R+ + W
Sbjct: 404 LQYMRPYLRHWAGDSSTHSSDGRGETSTTKTQEAGRRRAAPHIKTYIRFADANRMNAIDW 463
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
++TSANLS AWGA +N ++ I S+E+GV++ P ++ +
Sbjct: 464 AMVTSANLSTQAWGAAVNSNGEVRICSWEVGVMVWPQLFAEKAEQQQQQAMMVPCFRRDL 523
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 420
+Q K L V L +PY+LP Y +++VPW + + D
Sbjct: 524 PVDCPVQPAKCDVL--------------VGLRMPYDLPLTSYRADEVPWCATATHMEPDW 569
Query: 421 YGQVW 425
GQ W
Sbjct: 570 LGQTW 574
>gi|302500932|ref|XP_003012459.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
gi|291176017|gb|EFE31819.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
Length = 587
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/428 (27%), Positives = 191/428 (44%), Gaps = 63/428 (14%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK M+LI + ++I+HTAN+I DW N +Q +W Q+ + + C
Sbjct: 168 MPEPFGTHHSKMMILIRHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVDDTC 227
Query: 108 G-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
G F+ DL+ YL A+ N IN ++++F + LIASV
Sbjct: 228 GVFGSSARFKRDLLAYLE------------AYNNKTINILIRQLRRYDFGAVKALLIASV 275
Query: 159 PGY-----HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWM 208
P + WG L+ + ++ ++ ++ Q SS+ +L +KW+
Sbjct: 276 PTRLPVKEFDSNRRTLWGWPALKDAIGSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWL 335
Query: 209 AE--LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 262
E L S + I++PT +++R SL+GY +G +I SP +
Sbjct: 336 RETFLRSLCPQPEVNQSRSTSNVKFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQ 395
Query: 263 KDFLKKYWAKW-----------------KASHTGRSRAMPHIKTFARYNGQKL---AWFL 302
+L+ Y W + GR RA PHIKT+ R++ + W +
Sbjct: 396 LAYLRHYLCHWAGDAEDPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAM 455
Query: 303 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
+TSANLS AWGA ++ I S+E+GVLI P R C+ + + + +K
Sbjct: 456 ITSANLSTQAWGAGANTQGEVRICSWEVGVLIWPDLFREENIEECSDSSLTNYVKMIPCF 515
Query: 363 TSQIQKTKLVTLTWHGSSDAGASSEV-----VYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
+ K + + + S+ S+ V L +PY+LP Y+ ++VPW + +
Sbjct: 516 KRNVPSEKPLQTSENDSTKVTLHSDATNMTRVGLRMPYDLPLIPYTPQEVPWCATAVHRE 575
Query: 418 KDVYGQVW 425
D GQ W
Sbjct: 576 PDWMGQTW 583
>gi|169775023|ref|XP_001821979.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
gi|83769842|dbj|BAE59977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 570
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 199/418 (47%), Gaps = 73/418 (17%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P +FGTHHSK M+L+ + V++++HTAN+I DW N Q +W PL+ ++ E+
Sbjct: 182 MPEAFGTHHSKMMVLLRHDDLVQVVIHTANMIPGDWTNMCQAVWRSPLLPLQKTDDRVED 241
Query: 107 ------CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
F+ DL+ YL+ +G K P +K++F + L+ASV
Sbjct: 242 LTLGSGARFKRDLLAYLT------------EYGPKKTGPLVEQLRKYDFGAIRAALVASV 289
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWM 208
P L WG L+ ++++ + K+ +V Q SS+ +L +KW+
Sbjct: 290 PSKQKVDDLDSQKKTLWGWPALKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWL 349
Query: 209 AELS-SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 263
++ +S+S + + P + I++PT +++R SL GY +G +I S +
Sbjct: 350 KDVMFASLSPASTSTRQP----KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQL 405
Query: 264 DFLKKYWAKWKASH------------TGRSRAMPHIKTFARYNGQK----LAWFLLTSAN 307
+++ Y W H GR RA PHIKT+ R++ + + W ++TSAN
Sbjct: 406 QYMRPYLRHWAGDHDTAEPSHTSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSAN 465
Query: 308 LSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 367
LS AWGA + ++ I S+E+G+++ P ++ +VP+ K + E + +
Sbjct: 466 LSTQAWGAAVNASGEVRICSWEIGIVVWPQLYVQDTE---SATMVPT-FKRDTPEPLENK 521
Query: 368 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
++ T V+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 522 DSETTPDT------------VIGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 567
>gi|154344310|ref|XP_001568099.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065433|emb|CAM40865.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 680
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 131/467 (28%), Positives = 189/467 (40%), Gaps = 134/467 (28%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTL----------------------------- 31
M D WLL P L+ + LV+ GT
Sbjct: 67 MTDFRWLLRTVPELSAVTGKLVVLSGEKGTATLRCTTGEPLHSYTATSPLLDRVNPFVAS 126
Query: 32 --EHMKRNKPANWILHK-------PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVD 82
EH + +L + PPLPI+FGTHHSK L + RG+R+ + TANL+ D
Sbjct: 127 LREHAQTTSAVGTLLSRERLAVLEPPLPIAFGTHHSKMALCVNSRGLRVSIFTANLLEQD 186
Query: 83 WNNKSQGLWMQDFPLK----------------------DQNNLSEECGFENDLIDYLS-- 118
W KSQG+++QDFP K +N S C D ++L
Sbjct: 187 WCWKSQGIYVQDFPWKTSAKSSKHDSLDATAGTATTGYSSSNFSGVCPKGIDFAEHLRHY 246
Query: 119 --------TLKWPEFSANLPAHGNFKI-NPSFFKKFNFSSAAVRLIASVPGYHTGSSLK- 168
+ A G I F +FS+AAV L++SVPG H +
Sbjct: 247 LIQCGVSLAAAFTSLKAAASLAGPLGIFETDFLSHIDFSAAAVWLVSSVPGTHAHGEVSP 306
Query: 169 --KWGHMKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFS 220
+ G +L VL+ T L++Q+SS GSL+ ++ L ++M +
Sbjct: 307 GYRVGLCRLAEVLRRSPLTMATTPASVDLIWQYSSQGSLNSTFLNTLQAAMCGEAVTVIE 366
Query: 221 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP------------------------- 255
P G+ + L+V+PT E+VR S EG+ G ++P
Sbjct: 367 SGNAPRGVRDVLVVYPTEEEVRNSWEGWRGGGSLPLRVQCCHEFVNNRLHRWGSRAEDHA 426
Query: 256 ------SPQKNV---------------DKDFLKKYWAKWKASHTG-RSRAMPHIKTFARY 293
P K V D D ++ A AS R A+PHIK++A
Sbjct: 427 VEHGLTQPAKGVAAHASREDAVDVDQADSDRDEEATASLVASCAAYRQFALPHIKSYAAV 486
Query: 294 NGQK--LAWFLLTSANLSKAAWGAL-----QKNNSQLMIRSYELGVL 333
+ + WFLLTSANLS+AAWG++ ++ Q ++RSYELGVL
Sbjct: 487 APDRTCVRWFLLTSANLSQAAWGSVSGKVKKRGLCQQLVRSYELGVL 533
>gi|332029125|gb|EGI69136.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 522
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 174/348 (50%), Gaps = 29/348 (8%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
+VD++WL + + + +++G D N N + K + F HH+K
Sbjct: 130 IVDVEWLCWQYLLAGQCTDMTILYG--DKAYYQTLFN---NITIIKVNIETGFACHHTKI 184
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQNNLSE---ECGFENDLI 114
M+L Y G+R+IV TANL DW N +QGLW+ P L + N S+ GF+ DL
Sbjct: 185 MILQYKDDGIRVIVSTANLRSTDWENVTQGLWISPHLPRLPESANPSDGESPTGFKKDLE 244
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YLS + P + + A + +FS V LIASVPG + + WG+ K
Sbjct: 245 RYLSKYEQPTLTQWICA----------VQMADFSKVNVFLIASVPGIYQNNEANFWGYKK 294
Query: 175 LRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP-- 231
L VL + T P+V Q SS+G L + + L + S + T G+P
Sbjct: 295 LAHVLSRHVTLPSDVFPWPIVAQSSSIGKLGSSFESWLLKDIIPCMSRESTESTKGQPEF 354
Query: 232 LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTF 290
++P++++ + S P S + + + +L Y +WKA T R RAMPHIK++
Sbjct: 355 KFIYPSIQNYKQSFHYKNLSWCSPYSAEAHSKQQWLDLYLHQWKAKRTERDRAMPHIKSY 414
Query: 291 ARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
R + + + WF+LTSANLSKAAWG+++++ I +YE G++ +P
Sbjct: 415 TRISPDLKSIPWFVLTSANLSKAAWGSIKRHGYS--IENYEAGIIFVP 460
>gi|332029126|gb|EGI69137.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 511
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 175/372 (47%), Gaps = 64/372 (17%)
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN----NLSEEC 107
F +HH+ M+L Y G+R+IV TA L +W N++QGLW+ P ++ +
Sbjct: 178 FSSHHTNIMILQYKNGIRVIVSTAGLYSAEWENRTQGLWISPHLPYLPESAHPSDGESST 237
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 167
GF+ DL YLS P + + ++ +FS V L+ASVPG H +
Sbjct: 238 GFKKDLERYLSKYDQPVLTQWICT----------VRRVDFSDVNVFLVASVPGIHKSYEI 287
Query: 168 KKWGHMKLRTVLQECTFEKGFKKSPLVYQFS---SLGSLDEKWM-AELSSSMSSGFSEDK 223
WG KL VL ++ P+V Q S + GS E W+ ++ MS +
Sbjct: 288 NFWGCKKLAYVLSRYVTLPSNEQWPIVIQSSGVGNFGSTIESWLLRDIIRCMSK-----E 342
Query: 224 TPLGIG---EPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTG 279
T +G+ + ++P++E+ + S + ++ S + + + +L++Y +WKA TG
Sbjct: 343 TSIGLKNHPQFQFIYPSIENYKQSFDCQDLITSLTYSVEIHSKQQWLEQYLYQWKAKRTG 402
Query: 280 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 337
R AMP IK++ R + +++ WFLLTSANLSKAAWG +++ I +YE GVL +P
Sbjct: 403 RDCAMPGIKSYTRISPDSKRVPWFLLTSANLSKAAWGLIKRYEG-YSIGNYEAGVLFIP- 460
Query: 338 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 397
K++T T + V P+PY+L
Sbjct: 461 --------------------------------KVITGTATFPIGEEEDAAVPTFPIPYDL 488
Query: 398 PPQRYSSEDVPW 409
P RY S+D P+
Sbjct: 489 PLSRYDSDDSPF 500
>gi|340521404|gb|EGR51638.1| predicted protein [Trichoderma reesei QM6a]
Length = 1118
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 171/351 (48%), Gaps = 54/351 (15%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWM---------Q 93
LH P+P FGTHHSK M++ ++++HTAN+I DW N + +W Q
Sbjct: 130 LHCAPMPEMFGTHHSKMMVIFRSDDTAQVVIHTANMIPKDWTNMTNAVWRSPRLPRLGEQ 189
Query: 94 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
D + L G F+ DL++YL ++ + + +N F+FSS
Sbjct: 190 DTLFQQGQQLPVGSGTRFKVDLLEYLR--QYELYRPTCKQLVDRLVN------FDFSSIR 241
Query: 152 VRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--W 207
IASVPG H+ +S WG ++ L+ E+G +S +V Q SS+ +L K W
Sbjct: 242 AAFIASVPGRHSFRDASRPAWGWAAVQRCLRCVPVERG--QSQIVVQISSIATLGAKDDW 299
Query: 208 MAELSSSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNA----IPSPQKNV 261
L ++ + TP G P +V+PTV+++R S++GYA+G + I SPQ+
Sbjct: 300 ---LQRTLFDSLATSLTP-NTGRPGFKVVFPTVDEIRNSIDGYASGRSIHTKIQSPQQIR 355
Query: 262 DKDFLKKYWAKWK---------------ASHTGRSRAMPHIKTFARYN-GQKLAWFLLTS 305
+L+ W + +GR RA PHIKT+ R+N + W +LTS
Sbjct: 356 QLGYLRPILHHWANDSAGGAKLPGEPSISGDSGRDRAAPHIKTYIRFNESNTIDWAMLTS 415
Query: 306 ANLSKAAWG-ALQKNNSQLMIRSYELGVLILPSAK-RHGCGFSCTSNIVPS 354
AN+SK AWG AL + I S+E+GVL+ P G S ++VPS
Sbjct: 416 ANMSKQAWGEALSSTTGNIRIASWEVGVLVWPGLLCEDGAMVSSPKSLVPS 466
>gi|212543739|ref|XP_002152024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
gi|210066931|gb|EEA21024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
Length = 587
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 123/431 (28%), Positives = 193/431 (44%), Gaps = 81/431 (18%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF----PLKDQNNL 103
+P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ +W P++D +
Sbjct: 182 MPEPFGTHHSKMMILLRHDDHAQVIIHTANMLAGDWTNMSQAVWRSPLLSLSPIRDNSET 241
Query: 104 SEECGF------ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLI 155
++ F + DL+ YL EF +GN K +KF+F + LI
Sbjct: 242 AQAASFGTGARFKRDLLAYL------EF------YGNKKTRSLVDQLRKFDFQAIRAALI 289
Query: 156 ASVPGYHTGSSLKK-----WGHMKLRTVLQECTFE-KGFKKSP-LVYQFSSLGSL--DEK 206
ASVP S WG L+ L++ + + P +V Q SS+ SL +K
Sbjct: 290 ASVPSKERISRADSSRSTLWGWPALKDTLRQVPLRIREKNQCPHVVIQISSIASLGQTDK 349
Query: 207 WMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
W+ ++ SE + P I++PT +++R SL GY +G +I +++ +
Sbjct: 350 WLKDVLFDSLCLPSELPHTNKMPRPKYSIIFPTPDEIRRSLNGYGSGGSIHMKLQSITQQ 409
Query: 265 ----FLKKYWAKW----------------------KASHTGRSRAMPHIKTFARYNGQK- 297
+++ Y +W + + GR RA PHIKT+ R+ +
Sbjct: 410 KQLQYMRPYLCQWAGDQKQTAMGTLHLNAESVYNSQRTDAGRRRAAPHIKTYIRFADKTK 469
Query: 298 ---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 354
+ W ++TSANLS AWGA +N ++ I S+E+GVL P I
Sbjct: 470 MDTIDWAMITSANLSTQAWGAAANSNGEVRICSWEIGVLFWPEL------------IAGD 517
Query: 355 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
ST T + + T S D S +V +PY+LP YS++DVPW
Sbjct: 518 PFNPNSTRTEMVPSFRKDTPDPTESEDV---SSIVGFRMPYDLPLTPYSAQDVPWCATIN 574
Query: 415 YTKKDVYGQVW 425
+ + D GQ W
Sbjct: 575 HPEPDWLGQSW 585
>gi|328721089|ref|XP_003247207.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 2
[Acyrthosiphon pisum]
Length = 678
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 126/434 (29%), Positives = 209/434 (48%), Gaps = 73/434 (16%)
Query: 1 MVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL-PISFGTHHS 58
MV++ WL + + + +++ D ++ + + K + HK + +FG HS
Sbjct: 298 MVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKKKLLNVRHKKIINKNAFGHQHS 357
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSE---ECGFENDL 113
K + Y G +R++V +ANL DW +QG+W+ FPLK++++ S+ + F+ D+
Sbjct: 358 KVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKFPLKEEDDKSDGNSQTDFKIDI 417
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+ YL++ + P + +K +FS A +VPG HT WGH+
Sbjct: 418 LRYLNSFREPSLVPWIQK----------IEKVDFSQA------NVPGKHTEPL---WGHL 458
Query: 174 KLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLG 227
L+ +L++ C + P++ Q SSLGSL DE+W+ +E S+S+ D T
Sbjct: 459 YLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEWLKSEFVESLSASTYCDDTDTD 518
Query: 228 IGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRA 283
+P+ +++P+V++V S +G G +P + +K LKKY W+ R++A
Sbjct: 519 -NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEKQLWLKKYMCLWQCHSRKRTKA 577
Query: 284 MPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQL-MIRSYELGVLILPSAKR 340
MPHIKT+ R + +++WFLL SANLSKAAWG K++ Q I ++E GVL LP
Sbjct: 578 MPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSDEQSNFIMAHEAGVLFLPQ--- 634
Query: 341 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 400
F S+ P D ++ Y +P++LP
Sbjct: 635 ----FLIGSDTFP--------------------------IDETEPNKFPYFSLPFDLPLA 664
Query: 401 RYSSEDVPWSWDKR 414
YS D PW+ R
Sbjct: 665 GYSDTDQPWTISTR 678
>gi|189242173|ref|XP_970490.2| PREDICTED: similar to tyrosyl-dna phosphodiesterase [Tribolium
castaneum]
Length = 358
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 173/379 (45%), Gaps = 67/379 (17%)
Query: 53 FGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE- 106
FG HHSK + Y +R+++ TANL + DWN+ +QGLW+ P E
Sbjct: 23 FGCHHSKIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEKSGESP 82
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 166
GF++ L++YL NLP K + K+ +FS+ V L+ SVPG H +
Sbjct: 83 TGFKSSLLNYLKHY-------NLPV---LKPWIDYVKRADFSAVRVFLVTSVPGKHYPGT 132
Query: 167 LKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELSSSMSS 217
H + + C+ K P ++ Q SS+GS+ + L S++
Sbjct: 133 QGSHVHHVGDLLSRHCSLPA--KTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLRSTLLR 190
Query: 218 GFSEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 272
S K + I++P+V++V G +G +P S Q N + +L+ Y +
Sbjct: 191 SLSGHKQTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQSYLHQ 250
Query: 273 WKASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 330
WKA GRSRAMPHIKT+ R + KLAWF +TSANLSK+AWG + + +RSYE
Sbjct: 251 WKADKLGRSRAMPHIKTYCRVSPCLSKLAWFFITSANLSKSAWGGNLQKDKGAYVRSYEA 310
Query: 331 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 390
GV+ LP K E +I+ T +G + ++
Sbjct: 311 GVMFLP--------------------KFFDEEYFEIETTL-----------SGKNKKL-- 337
Query: 391 LPVPYELPPQRYSSEDVPW 409
P Y+LP Y S D PW
Sbjct: 338 FPFMYDLPLTEYKSSDYPW 356
>gi|307109629|gb|EFN57867.1| hypothetical protein CHLNCDRAFT_143337 [Chlorella variabilis]
Length = 370
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 139/272 (51%), Gaps = 44/272 (16%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVI-------------HGESDGTLEHMKRNKPANWIL--- 44
M+D+ WLL ACP L + +L++ G TL+ +R L
Sbjct: 110 MLDLPWLLSACPDLHRAERILLVSHRPWLAKKAKVEEGAKPRTLQARERKLADVRALGLE 169
Query: 45 -----HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
++P + GT+HSK L+ Y RG+R+I+ +AN + D NNK+Q L+ QDFP KD
Sbjct: 170 DRASVYEPAIG-GHGTNHSKFFLVDYERGMRVIIMSANAVFSDCNNKTQVLFTQDFPRKD 228
Query: 100 QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 159
+ + + FE L Y+ L+ P G + +FS+A L+ASVP
Sbjct: 229 EQS-PKTSAFEGALEAYIRELRMP--------CGPTLHLVQLIRSCDFSAARGHLVASVP 279
Query: 160 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA-ELSSSMSSG 218
G H G+ L KWGHM++R VL + F F+ +PL Q SSLG L+E+W+ E S+++G
Sbjct: 280 GRHKGADLHKWGHMRMRAVLCQEAFPARFRGAPLAAQMSSLGLLNERWLVREFRYSLAAG 339
Query: 219 FSEDKT---------PLGIGEPLIVWPTVEDV 241
E T PLG+ +V+PTVE+V
Sbjct: 340 LCEGGTDVLGLPANGPLGLQ---LVYPTVEEV 368
>gi|71001518|ref|XP_755440.1| tyrosyl-DNA phosphodiesterase [Aspergillus fumigatus Af293]
gi|66853078|gb|EAL93402.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
Af293]
gi|159129510|gb|EDP54624.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
A1163]
Length = 564
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 121/431 (28%), Positives = 188/431 (43%), Gaps = 91/431 (21%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL+ E
Sbjct: 169 MPEPFGTHHSKMMILLRHDDLAQVVIHTANMIPGDWANMCQAVWRSPLLPLRKSGREPEG 228
Query: 107 CG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIAS 157
G F+ DL+ YL+ +G K P ++F+FS+ LIAS
Sbjct: 229 PGAIGSGVRFKRDLLAYLNE------------YGVKKTGPLVRQLERFDFSAVRAALIAS 276
Query: 158 VPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFK----KSPLVYQFSSLGSL--DEK 206
VP SSL WG L+ ++ K +S +V Q SS+ SL +K
Sbjct: 277 VPSKQRLSSLDSQKKTLWGWPALKEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDK 336
Query: 207 WMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKN 260
W+ ++ S + I +P I++PT +++R SL GY +G +I S +
Sbjct: 337 WLKDV---FFPSLSPTPSMASIPQPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQQ 393
Query: 261 VDKDFLKKYWAKWKAS------------HTGRSRAMPHIKTFARYNGQK----LAWFLLT 304
+++ Y W GR RA PHIKT+ R++ + + W ++T
Sbjct: 394 KQLQYMRPYLRHWAGDSDSSSSTSTPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVT 453
Query: 305 SANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAKRH--GCGFSCTSNIVPS 354
SANLS AWGA N ++ I S+E+GV++ P + +RH C +P
Sbjct: 454 SANLSTQAWGAAVNNAGEVRISSWEIGVIVWPQLFVHEDNTTERHQQAVMVPCFKRDIPL 513
Query: 355 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
++ D +V L +PY+LP Y + +VPW
Sbjct: 514 QL----------------------PEDMPRCDVLVGLRMPYDLPLIPYKANEVPWCATIA 551
Query: 415 YTKKDVYGQVW 425
+T+ D GQ W
Sbjct: 552 HTEPDWLGQTW 562
>gi|307211795|gb|EFN87776.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 463
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/350 (30%), Positives = 173/350 (49%), Gaps = 31/350 (8%)
Query: 1 MVDIDWL-LPACPVLAKIPHVLVIHGESDGT-LEHMKRNKPANWILHKPPLPISFGTHHS 58
+VD++WL L + ++ H D T L P +++ L + THH+
Sbjct: 116 IVDVEWLCLQYALAGQRTDMTILYHNRRDDTDLSDNISIMP----VYEAELVFNSETHHT 171
Query: 59 KAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEECGFEND 112
K M+L Y G+R++V TANL DW N++QGLW+ L ++ F+ D
Sbjct: 172 KIMILQYKDDGIRVVVSTANLYSNDWENRTQGLWISPHLPRLPELASSSDGESPTNFKQD 231
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
YLS P + K +FS+ V +ASVPG +T + WGH
Sbjct: 232 FKRYLSRYWNPALKQWMDV----------VSKADFSAVNVCFVASVPGNYTHFNADYWGH 281
Query: 173 MKL-RTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP 231
KL R + Q T + ++ Q SS+G+L + + LS + S++ + P
Sbjct: 282 RKLARVLFQHTTLPPDAPQWSIIAQSSSIGNLGPNYESWLSKEIVLSMSQETMQMTNRYP 341
Query: 232 LI--VWPTVEDVRCSLEGYAAGNAI-PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
++P+VE+ S + + + + +++ + +++ + +WKA+ TGR RAMPHIK
Sbjct: 342 KFQYIYPSVENYERSFDFRNSISCFYYTAERHSKQQWIEPFLHQWKATRTGRDRAMPHIK 401
Query: 289 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
++ R + ++++WF+LTSANLSK+AWG S I +YE GV+ LP
Sbjct: 402 SYMRISPDLKRISWFMLTSANLSKSAWGV---KRSTYSITNYEAGVVFLP 448
>gi|256575388|gb|ACU87659.1| tyrosyl DNA phosphodiesterase 1 [Leishmania donovani]
Length = 828
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 136/510 (26%), Positives = 209/510 (40%), Gaps = 151/510 (29%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 105
+PPLP++FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S
Sbjct: 294 EPPLPVAFGTHHTKMALCVNGRGLRVSIFTANLVEQDWCWKSQGIYVQDFPWKTATERSN 353
Query: 106 ECGFENDLIDYLST------------LKWPEFSANLPAH--------------------- 132
+ +++ + K EF A+L +
Sbjct: 354 DDSAGTTMVETAARSTSDSNNGSNAFTKGAEFVAHLRQYLMQCGVSLAAACASPADAASA 413
Query: 133 ----GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK---KWGHMKLRTVLQEC--T 183
G F+ + F +FS+AAV L++SVPG + + + G +L VL+ T
Sbjct: 414 AGPLGIFETD--FLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALT 471
Query: 184 FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIVWPTVE 239
L +Q+SS GSL+ ++ L ++M + P G+ + +V+PT +
Sbjct: 472 MATAPASVDLSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDAPRGVRDVQVVYPTED 531
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-------------------- 279
+VR S EG+ G ++P + +F+ +W +S G
Sbjct: 532 EVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEAGHTAKRAFPRPAKVAAAHASR 590
Query: 280 ----------------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLS 309
R A+PHIK++A + + WFLLTSANLS
Sbjct: 591 EDAVDVDGVDSDGGEGTPVSLAGSCAAYRQFALPHIKSYAAVAPDRSCVRWFLLTSANLS 650
Query: 310 KAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 364
+AAWG+L Q + Q ++RSYELGVL + S I P S S+ S
Sbjct: 651 QAAWGSLSRKVNQHGSRQQLVRSYELGVL-----------YDSHSAIYP----SASSWFS 695
Query: 365 QIQKTKLVTLTWHGS------SDAGASSEVVYLPVPYE-LPPQRYSS------------- 404
+ K+K+ S + G ++ V L PY L P Y+S
Sbjct: 696 VVAKSKIELPNARNSRAVLYETPLGVDTQDVCLYTPYNLLCPTPYASTAALRAHRDAPDT 755
Query: 405 ------------EDVPWSWDKRYTKKDVYG 422
DVPW D + +D YG
Sbjct: 756 GEQAVAGAALDCSDVPWVLDMPHRGRDAYG 785
>gi|115384578|ref|XP_001208836.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196528|gb|EAU38228.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1250
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/430 (29%), Positives = 194/430 (45%), Gaps = 95/430 (22%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PL-KDQNNLSE 105
+P +FGTHHSK M+L+ + ++++HTAN+I DW N Q +W PL KD + SE
Sbjct: 859 MPEAFGTHHSKMMILLRHDDLAQVVIHTANMIPGDWANMCQAVWRSPLLPLRKDIDAESE 918
Query: 106 ECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIA 156
+ F+ DL+ YL +G K P ++++F + L+A
Sbjct: 919 DAAKIGSGMRFKRDLLAYLDH------------YGPKKTGPLVDQLRRYDFDAVRAALVA 966
Query: 157 SVPG---YHTGSSLKK--WGHMKLRTVLQECTFEK-GFKKSP----LVYQFSSLGSL--D 204
SVP +T S + WG L+ V++ G KS +V Q SS+ SL
Sbjct: 967 SVPSKQKINTADSQRTTLWGWPALKDVVRGIPLRAAGGSKSAVTPHIVSQISSVASLGQT 1026
Query: 205 EKWMAE-----LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----- 254
+KW+ E LSS +S +S I++PT +++R SL GY +G +I
Sbjct: 1027 DKWLKEVFFKSLSSDPTSKYS------------IIFPTDDEIRRSLNGYGSGGSIHMKIQ 1074
Query: 255 PSPQKNVDKDFLKKYWAKW---------------KASHTGRSRAMPHIKTFARYNGQK-- 297
+PQ+ +++ Y W + GR RA PHIKT+ +++ K
Sbjct: 1075 SAPQQK-QLQYIRPYLCHWAGDRDDGSSAGTSMSRKRDAGRRRAAPHIKTYIQFSDTKTM 1133
Query: 298 --LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE 355
+ W ++TSANLS AWGA + ++ I SYE+GV++ P S+
Sbjct: 1134 DSIDWAMVTSANLSTQAWGAAPNASGEIRICSYEIGVVVWPQL------------FADSD 1181
Query: 356 IKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
+S Q T + S VV L +PY+LP Y+ +D PW +
Sbjct: 1182 AESAVMVPCFKQDTPAF-----AEREGPVPSVVVGLRMPYDLPLTSYTPKDTPWCATATH 1236
Query: 416 TKKDVYGQVW 425
T+ D GQ W
Sbjct: 1237 TEPDWLGQTW 1246
>gi|296424093|ref|XP_002841585.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637828|emb|CAZ85776.1| unnamed protein product [Tuber melanosporum]
Length = 510
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/458 (27%), Positives = 204/458 (44%), Gaps = 86/458 (18%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRNKPANWILHKPPLPISFG 54
+ D+DW++ P + V ++HG +++ H + N L +P +G
Sbjct: 104 LFDLDWVMNQFDPDVKDTVKVRIVHGSWRREDANRARIHDQAESYPNVKLVCAFMPEPYG 163
Query: 55 THHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG---- 108
THHSK +L +II+HTAN+I DW N +Q +W PL Q++ S
Sbjct: 164 THHSKMFVLFRTDDHAQIIIHTANMIPFDWQNMTQAVWQSPLLPLLPQDHGSPRAQTFKP 223
Query: 109 ----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHT 163
F+ D++ Y S G + +++F + SVPG +H
Sbjct: 224 IGQRFKTDILAYFSAY----------GEGRTDFLTTQLSRYSFDPVKAVFVGSVPGKFHI 273
Query: 164 GSSLKK---WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAEL--SSSMS 216
+S K WG +L +VL++ K +V Q SS+ +L K W++ + +S +
Sbjct: 274 DASNGKGYEWGWRRLASVLRKVPLRSPEAKGCIVVQVSSIATLGSKNTWLSPVLFASLKT 333
Query: 217 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKAS 276
S F+ P + +++PT ++R SL GY +G+++ K+ + + +
Sbjct: 334 SRFTASAEP----KFHVIFPTANEIRESLNGYRSGSSL-----------HMKFQSPAQQA 378
Query: 277 HTGRSRAMPHIKTFARY---NGQKLAWFLLTSANLSKAAWGALQK------NNSQLMIRS 327
G +RA PHIKT+ R+ + ++ W LLTSAN+S AWGA +K N+ ++ I S
Sbjct: 379 QLG-ARAAPHIKTYIRFSDTDCTQIDWALLTSANISIQAWGAAEKDPIGRINHREVRICS 437
Query: 328 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 387
YE GVL+ P +P EI G T AG
Sbjct: 438 YEAGVLVYPEILDVEEMVPTFRKDIPDEIGDGGT--------------------AG---- 473
Query: 388 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
L +PY LP ++Y+S ++PW K Y+ D GQ W
Sbjct: 474 ---LRMPYGLPLRKYASNEMPWCAYKSYSDVDWLGQRW 508
>gi|119481099|ref|XP_001260578.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
gi|119408732|gb|EAW18681.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
Length = 564
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 121/432 (28%), Positives = 191/432 (44%), Gaps = 93/432 (21%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P FGTHHSK M+L+ + ++++HTAN+I DW N Q +W L+ E
Sbjct: 169 MPEPFGTHHSKMMILLRHDDLAQVVIHTANMIPGDWANMCQAVWRSPLLALRKSEREPEG 228
Query: 107 CG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIAS 157
G F+ DL+ YL+ +G K P ++F+FS+ LIAS
Sbjct: 229 PGAIGSGARFKRDLLAYLNE------------YGVKKTGPLVRQLERFDFSAVRAALIAS 276
Query: 158 VPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFK----KSPLVYQFSSLGSL--DEK 206
VP SSL WG L+ ++ K +S +V Q SS+ SL +K
Sbjct: 277 VPSKQRLSSLDSRKKTLWGWPALKEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDK 336
Query: 207 WMAELS-SSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQK 259
W+ ++ +S+S S + P +P I++PT +++R SL GY +G +I S +
Sbjct: 337 WLKDVFFASLSPTSSMESIP----QPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQ 392
Query: 260 NVDKDFLKKYWAKWKAS------------HTGRSRAMPHIKTFARYNGQK----LAWFLL 303
+++ Y W GR RA PHIKT+ R++ + + W ++
Sbjct: 393 QKQLQYMRPYLRHWAGDSDSSSSTSTPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMV 452
Query: 304 TSANLSKAAWGALQKNNSQLMIRSYELGVLILP--------SAKRH--GCGFSCTSNIVP 353
TSANLS AWGA N ++ I S+E+GV++ P + +RH C +P
Sbjct: 453 TSANLSTQAWGAAVNNAGEVRISSWEIGVMVWPQLFVREDNTTERHQQAVMVPCFKRDIP 512
Query: 354 SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDK 413
++ + +V L +PY+LP Y + +VPW
Sbjct: 513 LQL----------------------PDETPGCDVLVGLRMPYDLPLTPYKANEVPWCATA 550
Query: 414 RYTKKDVYGQVW 425
+T+ D GQ W
Sbjct: 551 AHTEPDWLGQTW 562
>gi|358384803|gb|EHK22400.1| hypothetical protein TRIVIDRAFT_179757 [Trichoderma virens Gv29-8]
Length = 1118
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 170/351 (48%), Gaps = 59/351 (16%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ--------- 93
LH P+P FGTHHSK M+L + +I++HTAN+I DW N + +W
Sbjct: 130 LHCAPMPEMFGTHHSKMMILFHSDNTAQIVIHTANMIPKDWTNMTNAVWRSPKLPWRWEL 189
Query: 94 --DFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
Q F+ DL+ YL +++ + +N F+FSS
Sbjct: 190 DPRLQQAQQAPFGSGIRFKADLLAYL--MQYDSHRVTCKQLVDRLVN------FDFSSIR 241
Query: 152 VRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--W 207
LIASVPG + +S WG L+ LQ E G +S +V Q SS+ +L K W
Sbjct: 242 AALIASVPGRYNLYDTSSPAWGWTALKRCLQTVPVETG--ESQIVVQISSIATLGAKDDW 299
Query: 208 MAE-LSSSMSSGFSED-KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF 265
+ + L +S+++ ++D K P + +V+PT +++R SL+GYA+G +I + K+
Sbjct: 300 LQKILFNSLATSRNQDTKKP----DFKVVFPTADEIRNSLDGYASGQSIHTKIKSAQHIR 355
Query: 266 LKKY-------WAKWKAS------------HTGRSRAMPHIKTFARYN-GQKLAWFLLTS 305
Y WA A +GR+RA PHIKT+ R+N + W +LTS
Sbjct: 356 QLHYLHPMLHHWANDSADGVGLLEQPPISGDSGRNRAAPHIKTYTRFNQNNSIDWAMLTS 415
Query: 306 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEI 356
AN+SK AWG + ++ I S+E+GVL+ P G C + ++ S I
Sbjct: 416 ANMSKQAWGEAPSSTGEVRIASWEVGVLVWP-------GLLCENGVMVSSI 459
>gi|225682330|gb|EEH20614.1| tyrosyl-DNA phosphodiesterase [Paracoccidioides brasiliensis Pb03]
Length = 628
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/473 (26%), Positives = 199/473 (42%), Gaps = 109/473 (23%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW-----------MQDFP 96
+P +FGTHHSK M++I + +I++HTAN+I DW N Q +W ++ P
Sbjct: 166 MPEAFGTHHSKMMVIIKHDDQAQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHP 225
Query: 97 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
N++ F+ DL+ Y T H +K++FS+ LIA
Sbjct: 226 SATPNDVGTGSRFKRDLLAYFETY----------GHNKTGALIEQLEKYDFSAIRAALIA 275
Query: 157 SVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSL--DE 205
S P T L WG L+ +++ F+KG K K P +V Q SS+ +L +
Sbjct: 276 SAPSRQTIDELDSKRRTLWGWPALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTD 335
Query: 206 KWMAEL--------SSSMSSGF-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 254
KW+ E S+ S F +E +P I++PT +++R SL GY +G +I
Sbjct: 336 KWLKETLFNSLSPPSARSSELFKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHM 392
Query: 255 --PSPQKNVDKDFLKKYWAKW--------------------------------------- 273
S + +L+ Y +W
Sbjct: 393 KLQSAAQQKQLQYLRPYLCRWAGDANDDGGVKSAGGPATSKRKRLEGNEVSESVQDGASL 452
Query: 274 KASH-----TGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMI 325
K +H GR RA PHIKT+ R++ + W ++TSANLS AWGA ++ I
Sbjct: 453 KKAHRPIREAGRRRAAPHIKTYIRFSDTDMTTIDWAMVTSANLSLQAWGAAANAKKEIRI 512
Query: 326 RSYELGVLILPSAKRHGCGFSCTSNIVPSEIK-------SGSTETSQIQKTKLVTLTWHG 378
SYE+GVL+ P ++ + K SG T ++ +V
Sbjct: 513 CSYEIGVLVWPDLFVDEEIDDSDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMVPCFKRD 572
Query: 379 SSDAG------ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+A +++ +V +PY+LP Y+++D PW Y++ D GQ W
Sbjct: 573 MPEAAENEARSSNTTLVGFRMPYDLPLHSYAAKDQPWCATATYSEPDWLGQTW 625
>gi|242787594|ref|XP_002481044.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218721191|gb|EED20610.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 577
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 120/434 (27%), Positives = 196/434 (45%), Gaps = 87/434 (20%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQ--NNLS 104
+P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ LW PL N +
Sbjct: 172 MPEPFGTHHSKMMILLRHDDLAQVIIHTANMLAGDWTNMSQALWRSPLLPLSSTPYNPAT 231
Query: 105 EECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLI 155
EE F+ DL+ YL EF +G K +KF+F + L+
Sbjct: 232 EEAAVFGTGARFKRDLLAYL------EF------YGRRKTGSLVDQLRKFDFYAIRAVLV 279
Query: 156 ASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG--FKKSPLVYQFSSLGSL--DEK 206
ASVP S + WG L+ L++ + + +V Q SS+ SL +K
Sbjct: 280 ASVPSKERLSRMNSSQSTLWGWPALKDALRQISLSDNEHIEDPHVVIQVSSIASLGQTDK 339
Query: 207 WMAEL--SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
W+ ++ S S + + + IV+PT +++R SL GY +G +I ++V +
Sbjct: 340 WLKDVLFDSLCPSSILPNASKRCNPKFSIVFPTPDEIRRSLNGYGSGGSIHMKLQSVAQQ 399
Query: 265 ----FLKKYWAKW----------------------KASHTGRSRAMPHIKTFARYNGQ-- 296
+++ Y W +++ GR RA PHIKT+ R++ +
Sbjct: 400 KQLQYMRPYLCHWAGDQEQTPVRISRTNAEVPSNIQSTDAGRRRAAPHIKTYIRFSDKTK 459
Query: 297 --KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 354
+ W ++TSANLS AWGA +N ++ I S+E+GVL+ P
Sbjct: 460 MDSIDWVMITSANLSTQAWGAAPNSNGEVRICSWEIGVLVWP------------------ 501
Query: 355 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE---VVYLPVPYELPPQRYSSEDVPWSW 411
++ G + ++ K+V + +++ +V +PY+LP RY +DVPW
Sbjct: 502 QLIVGDSPEPGAERPKMVPCFQKDRPELPNNNDITPIVGFRMPYDLPLARYGVQDVPWCA 561
Query: 412 DKRYTKKDVYGQVW 425
+ + D GQ W
Sbjct: 562 TINHPEPDWLGQSW 575
>gi|46123591|ref|XP_386349.1| hypothetical protein FG06173.1 [Gibberella zeae PH-1]
Length = 1094
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 163/330 (49%), Gaps = 46/330 (13%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL-- 97
N +H P+P FGTHHSK M+L + ++I+HTAN+I DW N + G+W PL
Sbjct: 125 NVNVHIAPMPEMFGTHHSKMMILFRHGDTAQVIIHTANMIPKDWTNMTNGVWKS--PLLP 182
Query: 98 ---KDQNNLSEECGF-----ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS 149
K Q S F E ID L+ LK+ + + + K+ K+++FS+
Sbjct: 183 RMSKTQTPASSPEEFLVGSGERFKIDLLNYLKFYDKRKIICKPLSDKL-----KQYDFST 237
Query: 150 AAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK- 206
LIASVPG H + + WG L+ L+ + S +V Q SS+ +L K
Sbjct: 238 IKAALIASVPGRHDAHDMSETSWGWAALKRCLRHVPCHQ-HGDSDIVVQVSSIATLGPKD 296
Query: 207 -WMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAG----NAIPSPQK 259
W L ++ K G+ P +V+PT +++R SL+GYA+G I SPQ+
Sbjct: 297 DW---LQKTLFDHLGRCKD-TGLRRPRFKVVFPTADEIRRSLDGYASGLSIHTKIQSPQQ 352
Query: 260 NVDKDFLKKYWAKWKAS-------------HTGRSRAMPHIKTFARYNGQKLAWFLLTSA 306
++L+ + W +GR RA PHIKT+ R N + W LLTSA
Sbjct: 353 AKQLEYLRPMFHHWANDSPGGTKLPDGPVLESGRKRAAPHIKTYVRSNKSSIDWGLLTSA 412
Query: 307 NLSKAAWGALQKNNSQLMIRSYELGVLILP 336
N+SK AWG + ++ I S+E+GVLI P
Sbjct: 413 NISKQAWGEAARPTGEMRIASWEVGVLIWP 442
>gi|213409511|ref|XP_002175526.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
gi|212003573|gb|EEB09233.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
Length = 518
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 137/451 (30%), Positives = 196/451 (43%), Gaps = 80/451 (17%)
Query: 12 PVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYP 66
P + K V V HG S + L K P + LH +P +GTHHSK M+ +
Sbjct: 107 PSVLKQVKVHVTHGYSYDSPRMDVLRQQKTRLPMDIELHSVYVP-QWGTHHSKIMVNFFA 165
Query: 67 R-GVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC------GFENDLIDYLST 119
++++HTAN+I +DW SQ ++ PL + + E F+ D YLS
Sbjct: 166 DDSCQVVIHTANMIQMDWEGMSQAIYKT--PLLWRKTVEREGPPSVGDRFQKDFCSYLSH 223
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
K A L ++++F+S I+SVPG G L WGH +L L
Sbjct: 224 YK---HCAKLICK---------LQRYDFTSVKAIFISSVPGKFGGDKLDSWGHNRLEKEL 271
Query: 180 Q--ECTFE-----KGFKKSPL-VYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIG 229
E E F+ S + V Q SS+GS + ++ E + ++ + K
Sbjct: 272 AAIESMAEFMGPRNKFQDSDICVSQCSSMGSFGARQAFLKEHTKALHCDLTHWK------ 325
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRA 283
+++PTV DVR SL G+ +G++I V++ KWKA +GR R
Sbjct: 326 ---LIFPTVTDVRDSLLGWHSGSSIHFNVTARGAPAQVEELVRHNQLCKWKAMKSGRQRI 382
Query: 284 MPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQ------KNNSQLMIRSYELGVLIL 335
PH+KT+ R N G + W LLTSANLSK AWG L+ K L IRSYE GVL+
Sbjct: 383 APHVKTYMRLNDEGTLIRWVLLTSANLSKPAWGTLEGVAANSKTEHGLRIRSYEAGVLLH 442
Query: 336 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
P +C V KS S ++ D S V + +P+
Sbjct: 443 PGLFADDSNSACAFFPV---YKSNSLKSPNF--------------DFPLS---VAIRMPW 482
Query: 396 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 426
+ PPQ Y +D WS + D G WP
Sbjct: 483 DFPPQPYGDKDDIWSPSIPRNETDWLGSKWP 513
>gi|171683299|ref|XP_001906592.1| hypothetical protein [Podospora anserina S mat+]
gi|170941609|emb|CAP67263.1| unnamed protein product [Podospora anserina S mat+]
Length = 569
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 127/453 (28%), Positives = 194/453 (42%), Gaps = 98/453 (21%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL- 97
N LH LP FGTHHSK +L+ + ++++HTANLI DW N +QG W PL
Sbjct: 145 NVTLHAAFLPEMFGTHHSKMFILLRHDDTAQLVIHTANLITRDWTNMTQGAWFSPRLPLL 204
Query: 98 -----KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 152
+ + + F+ D ++YL + P + K++FSS
Sbjct: 205 KPEHDEGRPRIGNGAKFKLDFLNYLRA-----YDTKRPTCKDITTK---LMKYDFSSING 256
Query: 153 RLIASVPGYHT---GSSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSL--DEK 206
LI+SVPG HT +S +G +++ L + P V Q SS+ +L +
Sbjct: 257 SLISSVPGRHTVTQSTSSTNFGWAAMKSALAAVPIHSTIEHKPEVAIQISSIATLGPTDS 316
Query: 207 WMAE-----LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSP 257
W+ L ++ ++ F +V+PT +++R SL+GY +G +I SP
Sbjct: 317 WLKNTFLHTLGNTPATTFK------------VVFPTPDEIRKSLDGYMSGGSIHTKTQSP 364
Query: 258 QKNVDKDFLKKYWAKW---------------------------------KASHTGRSRAM 284
Q+ +LK + W K ++GR RA
Sbjct: 365 QQVKQLQYLKPLFHHWANDSASGLRMFPPRPLLSPSANAPSPNIAINASKVKNSGRKRAA 424
Query: 285 PHIKTFARYNGQK---------LAWFLLTSANLSKAAWG-ALQKNNSQLMIRSYELGVLI 334
PHIKT+ R + + W LLTSANLSK AWG AL + + I SYE+GVL+
Sbjct: 425 PHIKTYIRSHRPTPESSETDIHIDWALLTSANLSKQAWGEALSAKENTVRISSYEIGVLV 484
Query: 335 LPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLP 392
P + + + P+ ++ Q + G D EV V L
Sbjct: 485 WPGL------YGENAVMKPAFLEDALPPPEQTRGDG----DGKGKEDYDGKDEVVEVALR 534
Query: 393 VPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY+LP Q Y +VPW +T+ D G++W
Sbjct: 535 MPYDLPLQPYGPGEVPWVATASHTEPDWMGRIW 567
>gi|226289717|gb|EEH45201.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 639
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 127/478 (26%), Positives = 199/478 (41%), Gaps = 122/478 (25%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW-----------MQDFP 96
+P +FGTHHSK M++I + +I++HTAN+I DW N Q +W ++ P
Sbjct: 166 MPEAFGTHHSKMMVIIKHDDQAQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHP 225
Query: 97 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
N++ F+ DL+ Y T H +K++FS+ LIA
Sbjct: 226 SATPNDVGTGSRFKRDLLAYFETY----------GHNKTGALIEQLEKYDFSAIRAALIA 275
Query: 157 SVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSL--DE 205
SVP T L WG L+ +++ F+KG K K P +V Q SS+ +L +
Sbjct: 276 SVPSRQTIDELDSKRRTLWGWPALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTD 335
Query: 206 KWMAEL--------SSSMSSGF-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 254
KW+ E S+ S F +E +P I++PT +++R SL GY +G +I
Sbjct: 336 KWLKETLFNSLSPPSARSSELFKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHM 392
Query: 255 --PSPQKNVDKDFLKKYWAKW--------------------------------------K 274
S + +L+ Y +W K
Sbjct: 393 KLQSAAQQKQLQYLQPYLCRWAGDANDDGVKSAGGPATSKRKRLEGNEVSESVQDGASLK 452
Query: 275 ASH-----TGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIR 326
+H GR RA PHIKT+ R++ + W ++TSANLS AWGA ++ I
Sbjct: 453 KAHRPIREAGRRRAAPHIKTYVRFSDTDMTTIDWAMVTSANLSLQAWGAAANVKKEIRIC 512
Query: 327 SYELGVLILPSAKRHGCGFSCTSNIVPSEIK--------------SGSTETSQIQKTKLV 372
SYE+GVL+ P F I S+ SG T ++ +V
Sbjct: 513 SYEIGVLVWPR-------FIVDEEIDDSDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMV 565
Query: 373 TLTWHGSSDAG------ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
+A +++ +V +PY+LP Y+++D PW Y++ D Y +
Sbjct: 566 PCFKRDMPEAAENEARSSNTTLVGFRMPYDLPLHSYAAKDQPWCATATYSEPDCYADM 623
>gi|347837882|emb|CCD52454.1| hypothetical protein [Botryotinia fuckeliana]
Length = 639
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 144/511 (28%), Positives = 213/511 (41%), Gaps = 106/511 (20%)
Query: 3 DIDWLLPACPV-LAKIPHVLVIHG----ESDGTLEHMKR-NKPANWILHKPPLPISFGTH 56
D+D+L+ + + V VIHG E L M++ ++ +N L +P FGTH
Sbjct: 145 DLDFLMEQFDEDVRNLVRVNVIHGFWKREDHSRLNLMEQASRYSNIKLLTAYMPEMFGTH 204
Query: 57 HSKAMLLIYPRG--VRIIVHTANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSE 105
HSK ML+I+ +II+HTAN+I DW N +Q LW + L + + +
Sbjct: 205 HSK-MLIIFRHDCTAQIIIHTANMIPFDWTNMTQALWKSPHLPLLNPKKPTLVEASRIGS 263
Query: 106 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPGYH 162
F+ D ++YL I S + K++FS LIASVPG
Sbjct: 264 GSKFKLDFLNYLRAYDTKRI-----------ICKSLIEQLLKYDFSEIKAALIASVPGKQ 312
Query: 163 TGSSLKK----WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMS 216
G+ L WG L L+ + +V Q SS+ SL +KW+ ++S
Sbjct: 313 -GTELSPSQTGWGWAGLTNALKSVPSHHNTQPE-IVIQVSSIASLGPTDKWLTHFFKALS 370
Query: 217 SGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS----PQKNVDKDFLKKYWA 271
E K+P G I++PT ++VR S+ GYA+GNAI + P + +LK
Sbjct: 371 ----ESKSPRKTGSKFKIIFPTADEVRRSINGYASGNAIHTKILTPAQGKQLAYLKPMLC 426
Query: 272 KW------------------------------KASHTGRSRAMPHIKTFARYNGQK---- 297
W K R RA PHIKT+ R++
Sbjct: 427 HWAGDGAQHSSSSSLSSNPPSESSQSFTSPELKTQEAYRRRAAPHIKTYIRFSSDSTSSS 486
Query: 298 -----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS---AKRHGCGFS--- 346
+ W L+TSANLSK AWG + ++ I SYE+GVL+ P K++G
Sbjct: 487 SSQKSIDWMLVTSANLSKQAWGESINSADKVRICSYEIGVLVWPDLWEEKQNGKNVKMVP 546
Query: 347 CTSNIVPS--------EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE----VVYLPVP 394
C N PS EI + ++ L D E +V +P
Sbjct: 547 CFGNDTPSIPFVSPSLEIVGQKEIRVEGEEGHLKRKRCDAREDEKRQEESHTIIVGARMP 606
Query: 395 YELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
Y+LP Y +D+PW Y++ D G+ W
Sbjct: 607 YDLPLVSYGKDDIPWCASASYSEPDWMGKTW 637
>gi|307211791|gb|EFN87772.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 530
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 174/351 (49%), Gaps = 38/351 (10%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD WL + + +++++GE K N +P FG HH+K
Sbjct: 170 MVDARWLCLQYLLAGQCTDMMILYGERVD-----KEKLGDNITTVHVEMPFEFGCHHTKI 224
Query: 61 MLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLI 114
M+L Y G+R++V TANL DW N++QG+W+ L + ++ CG F+ DL
Sbjct: 225 MILQYRDNGIRVVVSTANLYSDDWENRTQGMWISPH-LPRLSKAAKRCGESPTNFKKDLQ 283
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL T P K +K +FS+ V LIAS PG ++ WG+ K
Sbjct: 284 RYLGTYHNPA----------LKRWRKLVRKADFSAINVCLIASTPG-RFRHTVNLWGYKK 332
Query: 175 LRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELSSSMSSGFSEDKTPLGIG 229
L VL + T + ++ Q SS+G+ E W++ E+ SM+ D
Sbjct: 333 LADVLFRHVTQLPNALEWSIIAQSSSVGNFGPRYEGWLSKEIVRSMAWKTVRDLKDYPKF 392
Query: 230 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF--LKKYWAKWKASHTGRSRAMPHI 287
+ +++P+VE+ S + Y G + + V +K Y +WKA+ TGR++AMP+I
Sbjct: 393 Q--LIYPSVENYEQSFD-YQNGTSCFFYSREVHSKLQWIKSYLYQWKATKTGRNQAMPYI 449
Query: 288 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
K++ R + +++AWF+LTSANL+K AWG + N I +YE+GV LP
Sbjct: 450 KSYTRISPDLKRIAWFVLTSANLNKGAWGVQRSN---YYIANYEVGVAFLP 497
>gi|302797949|ref|XP_002980735.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
gi|300151741|gb|EFJ18386.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
Length = 197
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 90/148 (60%), Gaps = 28/148 (18%)
Query: 10 ACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV 69
ACP L IP V++IHGES+ + MLL+YP GV
Sbjct: 71 ACPPLRTIPQVVMIHGESNVS-------------------------QLQSVMLLVYPTGV 105
Query: 70 RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL 129
R++VHTANLI++DWNNK+QGLWMQDFP K S+ FENDL+DYL+ L+W + ++
Sbjct: 106 RVVVHTANLINIDWNNKNQGLWMQDFPFKSMTGASD---FENDLVDYLTALEWLGCTVDV 162
Query: 130 PAHGNFKINPSFFKKFNFSSAAVRLIAS 157
HG KIN F+ F+FS+AAVRL+AS
Sbjct: 163 QHHGKMKINVGHFQNFDFSNAAVRLVAS 190
>gi|255950552|ref|XP_002566043.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211593060|emb|CAP99435.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 520
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 186/426 (43%), Gaps = 86/426 (20%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P FGTHHSK M+L+ + ++I+HTAN+IH+DW N +Q W PL+ N +
Sbjct: 130 MPEPFGTHHSKMMILLRHDDLAQVIIHTANMIHMDWTNMTQAAWCSPLLPLQKANTAGSQ 189
Query: 107 CG--------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIA 156
F+ DL+ YL A+G K P ++FSS LIA
Sbjct: 190 ADNKIGSGARFKRDLLAYLK------------AYGPKKTGPLVQQLDNYDFSSIRAALIA 237
Query: 157 SVPGY-HTGSSLKK----WGHMKLRTVLQECTFEKGF--KKSPLVYQFSSLGSLDE--KW 207
SVP H S + WG L+ ++ + ++ KK +V Q SS+ +L + KW
Sbjct: 238 SVPSKKHVSDSSSEEDTLWGWPALKDLMSQIPIQQKSPSKKPHVVIQISSVATLGQTNKW 297
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAI----PSPQKN 260
+ E+ F + TP +P I++PT +++R SL GY +G++I S +
Sbjct: 298 LKEV-------FFKSLTP----QPTTYSIIFPTPDEIRRSLNGYNSGSSIHMKTQSAAQQ 346
Query: 261 VDKDFLKKYWAKWKASHTGRSRAM------------------PHIKTFARY---NGQKLA 299
+++ + +W + + PHIKT+ R+ + + +
Sbjct: 347 KQLQYMRPHLCQWAGDSLPPGQCIDLSEENPPRREAGRARAAPHIKTYIRFADSDMKTID 406
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 359
W +++SANLS AWGA + ++ I S+E+GV++ P R G G
Sbjct: 407 WAMVSSANLSTQAWGAATNGSGEVRICSWEIGVVVWPDLFRDGA--------------EG 452
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 419
G SDA +S VV +PY+LP Y + D PW + D
Sbjct: 453 KAPVPDALMVPCFKRDRPGVSDADTASVVVGFRMPYDLPLTPYGAADEPWCATASHALPD 512
Query: 420 VYGQVW 425
G+ W
Sbjct: 513 WRGESW 518
>gi|307211790|gb|EFN87771.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 553
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/428 (27%), Positives = 187/428 (43%), Gaps = 77/428 (17%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
MVD+ WL + + P+++++ + G E N + +P FG HH+K
Sbjct: 182 MVDVGWLCLQYLLAGQRPNMVILCSQRLGEEELGD-----NITVVHVEMPFEFGCHHTKV 236
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEEC---------GF 109
M+L Y G+R++V TANL DW N++QG+W+ P LSE F
Sbjct: 237 MILQYKDVGIRVVVSTANLYASDWKNRTQGIWISPHLP-----RLSEAAKWSSGESPTNF 291
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 169
+ DL YL++ + P K +K +FS+ V IAS PG+ +
Sbjct: 292 KKDLQRYLNSYQNPA----------LKRWRDLVRKADFSAVNVCFIASTPGHFRRIDVNL 341
Query: 170 WGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE--DKTPL 226
WG+ KL VL Q K ++ Q S++GS K+ LS + + ++
Sbjct: 342 WGYKKLANVLSQHVMLPPDAPKWSIIAQSSAVGSFGPKYEGWLSKEIVRSMTRETERDLK 401
Query: 227 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLKKYWAKWKASHTGRSRAM 284
E ++P+V++ S + Y G++ K V + ++K Y +WKA +G +AM
Sbjct: 402 DYPEFQFIYPSVKNYEQSFD-YQDGSSCFLYMKEVHSKQQWIKSYLYQWKAK-SGCDQAM 459
Query: 285 PHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 342
PHIK++ R + +++AWF+LTSANLSK AWG I +YE+GV LP
Sbjct: 460 PHIKSYTRISPDLKRIAWFVLTSANLSKGAWGV---QRGDYYITNYEVGVAFLPKFITGT 516
Query: 343 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 402
F T + + I P+PY+ P Y
Sbjct: 517 TTFPITDEDLTAPI----------------------------------FPIPYDFPLCPY 542
Query: 403 SSEDVPWS 410
S D P++
Sbjct: 543 DSNDSPFT 550
>gi|315052274|ref|XP_003175511.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
gi|311340826|gb|EFR00029.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
Length = 591
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 191/438 (43%), Gaps = 79/438 (18%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK M+LI + +II+HTAN+I DW N +Q +W Q ++ +
Sbjct: 168 MPEPFGTHHSKMMILIRHDNLAQIIIHTANMIPRDWGNMTQAVWRSPLLPFSQPHVGDTH 227
Query: 108 G-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
G F+ DL+ YL A+ N I ++++F + LIASV
Sbjct: 228 GEFGSGARFKRDLLAYLD------------AYNNKTIGLLIHQLQRYDFGAVKAVLIASV 275
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFK---KSPLVYQFSSLGSL--DEKWM 208
P + WG LR ++ + K ++ Q SS+ +L +KW+
Sbjct: 276 PSRLPVKAFDSNRKTLWGWPALRDAIRSIPIDHSSSQTLKPHIIVQVSSIATLGQTDKWL 335
Query: 209 AEL---SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQK 259
E S S F++ + I++PT +++R SL+GY +G +I S QK
Sbjct: 336 KETFFGSLCPQSRFNQTISACHANFS-IIFPTPDEIRRSLDGYGSGGSIHMKIQSASQQK 394
Query: 260 NVDKDFLKKYWAKWKAS---------------------HTGRSRAMPHIKTFARYNGQKL 298
+ +L+ Y W GRSRA PHIKT+ R++ +
Sbjct: 395 QLA--YLRHYLCHWAGDAEGQRDPGPATESVKGLAYVREAGRSRAAPHIKTYIRFSDSGM 452
Query: 299 A---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE 355
+ W ++TSANLS AWGA ++ I S+E+GVLI P R C + +
Sbjct: 453 SSIDWAMVTSANLSTQAWGAGANAQGEVRICSWEIGVLIWPELFRENNIEKCNDSSPINH 512
Query: 356 IK--------SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 407
+K + S E Q ++ LT H DA V + +PY LP Y+ DV
Sbjct: 513 VKMIPCFKRNTPSKEPLQPPESDSTKLTSH--PDATNMIRVGFR-MPYNLPLVPYTPRDV 569
Query: 408 PWSWDKRYTKKDVYGQVW 425
PW + + D GQ W
Sbjct: 570 PWCATAAHREPDWMGQTW 587
>gi|332376511|gb|AEE63395.1| unknown [Dendroctonus ponderosae]
Length = 584
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 117/362 (32%), Positives = 174/362 (48%), Gaps = 41/362 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESD--GTLEHMKRNKPANWILHKPPLPISFGTHHS 58
M+DI WL+ + L I D +E+M+R P N H + FG HH+
Sbjct: 198 MIDIGWLVKQYKAREQDNKPLTILYGDDWPDMVEYMRRFCP-NVKHHFVKMKDPFGCHHT 256
Query: 59 KAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE-----CGFEND 112
K + Y +R++V TANL + DWN+ +QGLW+ K +N +E GF+
Sbjct: 257 KLGIYAYEDESIRVVVSTANLYYEDWNHYNQGLWISPRLAKLPSNSAERDGEAITGFKGH 316
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH----TGSSLK 168
L+DYL + + P + + +F V L+ S PG H GS L
Sbjct: 317 LLDYLRSYQLPILRDWV----------KYVANADFGEVKVALVYSAPGKHYAKQNGSHLH 366
Query: 169 KWGHMKLRTVLQECTF---EKGFKKSPL----VYQFSSLGSLDEKWMAELSSSM-SSGFS 220
+ G + + Q C + PL + Q SS+GS+ + L S+ S S
Sbjct: 367 RVGDL----LSQHCVLPAKTTAQSEGPLSWGILAQASSIGSIGKTAAEWLRGSLLRSLAS 422
Query: 221 EDKTPL-GIGEPLI--VWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKAS 276
++PL G + I V+P+V +V G +G +P S N + +L+ Y +W A
Sbjct: 423 HKQSPLPGNSQATISLVYPSVSNVAHGYFGLESGGCLPYSKATNEKQRWLQTYMHQWIAD 482
Query: 277 HTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
R+RAMPHIK++ R + KLA+FLLTSANLSK+A G + + IRSYE+GV+
Sbjct: 483 ARHRTRAMPHIKSYCRVSPGLDKLAYFLLTSANLSKSARGNNIQKDGGCYIRSYEMGVMF 542
Query: 335 LP 336
LP
Sbjct: 543 LP 544
>gi|67539466|ref|XP_663507.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|40738576|gb|EAA57766.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|259479929|tpe|CBF70601.1| TPA: tyrosyl-DNA phosphodiesterase, putative (AFU_orthologue;
AFUA_2G11070) [Aspergillus nidulans FGSC A4]
Length = 586
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 126/427 (29%), Positives = 198/427 (46%), Gaps = 79/427 (18%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PL----KDQNN 102
+P FGTHHSK M+L+ + ++++HTAN++ DW + Q +W PL +D+N+
Sbjct: 173 MPEPFGTHHSKMMILLRHDDFAQVVIHTANMLAGDWGDMCQAIWRSPLLPLTDGHEDKNS 232
Query: 103 LSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASV 158
+ G F+ DL+ YL A+G K P K++FS+ LIASV
Sbjct: 233 TAWGTGARFKRDLLAYLK------------AYGVKKTGPLVEQLGKYDFSAVRAALIASV 280
Query: 159 PGYH-------TGSSLKKWG----HMKLRTV-LQECTFEKGFKKSP-LVYQFSSLGSL-- 203
P G+S KWG LR V L+E G P +V Q SS+ +L
Sbjct: 281 PSKQKVDASSIDGNSKTKWGWPALKEALRNVPLRENVGADGTATVPHIVTQISSIATLGQ 340
Query: 204 DEKWMAELS-SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQ 258
+KW+ ++ +++++ S KT +++PT E++R SL+GY G +I S
Sbjct: 341 TDKWLKDVFFNALAASSSSTKTRPRYS---VIFPTAEEIRRSLKGYGYGGSIHMKLQSAA 397
Query: 259 KNVDKDFLKKYWAKW----------KASHTGRSRAMPHIKTFARYNGQKLA---WFLLTS 305
+ +L+ Y W + GR RA PHIKT+ R+ Q + W L+TS
Sbjct: 398 QKKQLQYLRPYLCHWAGDVSGQAPKRLQDAGRRRAAPHIKTYIRFADQHMRSIDWALVTS 457
Query: 306 ANLSKAAWGALQKNNSQLMIRSYELGVLI-------LPSAKRHGCGFSCTSNIVPSEIKS 358
ANLS AWGA ++ + S+E+GVL+ P +R S + +VP K
Sbjct: 458 ANLSTQAWGAAANAAGEVRVCSWEIGVLVWPELLTTEPQGQRKHQQQSRSVAMVPCFKKD 517
Query: 359 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 418
+S++ A + ++ +PY+LP YS++D PW + +
Sbjct: 518 KPDPSSKVGN--------------AAPAALIGFRMPYDLPLTPYSTQDEPWCATMSHIEP 563
Query: 419 DVYGQVW 425
D GQ W
Sbjct: 564 DWLGQTW 570
>gi|67484562|ref|XP_657501.1| tyrosyl-DNA phosphodiesterase [Entamoeba histolytica HM-1:IMSS]
gi|56474754|gb|EAL52111.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449702140|gb|EMD42834.1| tyrosylDNA phosphodiesterase, putative [Entamoeba histolytica KU27]
Length = 402
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 179/368 (48%), Gaps = 44/368 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW-ILHKPPLPISFGTHHSK 59
+ D+ WL P+L KIP V IH +GTL + + + +P+ G HH K
Sbjct: 45 VFDLQWLFDELPILTKIP-VQFIH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVK 100
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
M+++Y G+R ++ TANLI +D+N KSQG++++DF + + + E G +L+T
Sbjct: 101 IMIILYEGGLRFVLSTANLIPLDYNLKSQGIYIKDFKPSESSTILNEKG-----THFLTT 155
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
L+ S N + S+ F++S+ L+ S+PG H G+ L K+G ++ +L
Sbjct: 156 LQSYFTSVN--------VTISYLSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVYDIL 207
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
+ + Q SSLG ++ ELS +++ E K I+WPT +
Sbjct: 208 NNKLHVQFNNHCTIAAQASSLGLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTED 259
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLK---KYWAKWKASHTGRSRAMPHIKTFARYNGQ 296
+R S GY + + +F+K Y+ K+ R PHIKT+ Y
Sbjct: 260 FIRTSETGYHGSCSF-----FLRSNFVKTWENYFYKFLPPFP-RHLIQPHIKTYVIYEED 313
Query: 297 KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEI 356
+ +LTS+N+S AAWG + NS L I +YE+G+L + + F+ T +P +I
Sbjct: 314 IPKYGILTSSNISGAAWG--KPTNSSLEINNYEMGMLFIDN-------FTLTRFPLPYDI 364
Query: 357 KSGSTETS 364
K + +S
Sbjct: 365 KQSTKYSS 372
>gi|358393671|gb|EHK43072.1| hypothetical protein TRIATDRAFT_225252 [Trichoderma atroviride IMI
206040]
Length = 1124
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 170/365 (46%), Gaps = 58/365 (15%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN 101
LH P+P FGTHHSK M++ +II+HTAN+I DW N + +W PL
Sbjct: 133 LHCAPMPEMFGTHHSKMMVMFRSDNTAQIIIHTANMIPRDWTNMTNAVWQSPKLPLLPVP 192
Query: 102 NLSEECG----------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
++ + G F+ DL+ YL +K+ + K F+FSS
Sbjct: 193 DIISQHGQTLPLGSGLRFKADLLSYL--MKYDSYKVTC------KPLADRLGYFDFSSVR 244
Query: 152 VRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKW 207
IASVPG H +S WG L+ LQ G S +V Q SS+ +L ++ W
Sbjct: 245 AAFIASVPGKHDIRDASQPAWGWAGLQRCLQGVPVGPG--GSAIVVQISSIATLGANDDW 302
Query: 208 MAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK------- 259
+ L +S+++ + + +V+PT +++R SL+GYA+GN+I + +
Sbjct: 303 LQRTLFNSLATSLTPNANKPSFK---VVFPTADEIRNSLDGYASGNSIHTKIQSAQHISQ 359
Query: 260 ------------NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN-GQKLAWFLLTSA 306
N KD + +GR+RA PHIKT+ R+N + W +LTSA
Sbjct: 360 LRYLHPILHHWANDSKDGAALFAGASIYGDSGRNRAAPHIKTYIRFNCNTTIDWAMLTSA 419
Query: 307 NLSKAAWG-ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 365
N+SK AWG L+ + I S+E+GVL+ P+ C ++ S +S + S
Sbjct: 420 NMSKQAWGETLKPTTGEFRIASWEVGVLVWPN-------LLCKDGVMLSSFQSDTVNMSP 472
Query: 366 IQKTK 370
+ +
Sbjct: 473 FSQAQ 477
>gi|167389207|ref|XP_001738862.1| tyrosyl-DNA phosphodiesterase [Entamoeba dispar SAW760]
gi|165897690|gb|EDR24772.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba dispar SAW760]
Length = 721
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 167/338 (49%), Gaps = 35/338 (10%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
+ D+ WL P+L ++P V IH + + + + ++ P+P+ G HH K
Sbjct: 45 VFDLQWLFNELPILTRVP-VQFIHNGNLSCFDQLLIQQYKDF--QTFPIPLKKGCHHVKI 101
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
M+++Y G+R ++ TANLI +D+N KSQG++++DF + + + E G +L+TL
Sbjct: 102 MIMLYEGGLRFVLSTANLIPIDYNLKSQGIYVKDFKPSESSTVLNEKG-----THFLTTL 156
Query: 121 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 180
+ S N+ S+ F++S+ L+ S+PG H G+ L K+G ++ +L
Sbjct: 157 QNYLASVNVTV--------SYLSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVHDILN 208
Query: 181 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 240
+ + Q SSLG ++ ELS +++ E K I+WPT +
Sbjct: 209 MKLHVQFNNHCTIAAQASSLGLFTSQYRRELSLCLTNQ-PESKFQ-------IIWPTEDF 260
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLK---KYWAKWKASHTGRSRAMPHIKTFARYNGQK 297
+R S GY + + +F+K Y+ K+ R PHIKT+ Y
Sbjct: 261 IRTSETGYHGSCSF-----FLRSNFVKTWENYFYKFLPPFP-RHLIQPHIKTYVIYEEDI 314
Query: 298 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
+ +LTS+N+S AAWG + NS L I +YE+G+L +
Sbjct: 315 PKYGILTSSNISGAAWG--KPTNSTLEINNYEIGMLFI 350
>gi|425771231|gb|EKV09680.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum Pd1]
gi|425776784|gb|EKV14988.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum
PHI26]
Length = 900
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/428 (27%), Positives = 194/428 (45%), Gaps = 70/428 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE 106
+P FGTHHSK M+L+ + ++++HTAN+IH+DW N +Q W+ PL+ ++
Sbjct: 490 MPEPFGTHHSKMMILLRHDDLAQVVIHTANMIHMDWTNMTQAAWLSPLLPLQKATSVESP 549
Query: 107 CG--------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR--LIA 156
F+ DL+ YL A+G K P + N+ +R LIA
Sbjct: 550 TDAKVGSGARFKRDLLAYLK------------AYGPKKTGPLVQQLDNYDFCPIRAALIA 597
Query: 157 SVPGYHTGSSLKK-----WGHMKLRTVLQECTFEK--GFKKSPLVYQFSSLGSLDE--KW 207
SVP S WG ++ ++ + ++ KK +V Q SS+ +L + KW
Sbjct: 598 SVPSKKHASDSSSDEETLWGWPAVKDLMGQVPIQQKNTSKKPHIVIQTSSVATLGQTNKW 657
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNV 261
+ ++ F + TP +P I++PT +++R SL GY +G +I S +
Sbjct: 658 LKDV-------FFKALTPTHSPQPTYSIIFPTPDEIRRSLNGYNSGVSIHMKIQSAAQQK 710
Query: 262 DKDFLKKYWAKWKAS------------------HTGRSRAMPHIKTFARY---NGQKLAW 300
++ Y +W GR+RA PHIKT+ R+ + + + W
Sbjct: 711 QLQYMSPYLCQWAGDSLPPGQCIDLSEDNPPKREAGRARAAPHIKTYIRFADSDMKTIDW 770
Query: 301 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH-GCGFSCTSNIVPSEIKS- 358
+++SANLS AWGA + ++ I S+E+GV++ P R GC + + + SE ++
Sbjct: 771 AMVSSANLSTQAWGAATNASGEVRICSWEIGVVVWPELFRDGGCDDAASPSASESESRAE 830
Query: 359 GSTETSQIQKTKLVTLTWHGSSD-AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 417
G + SD A +S VV +PY+LP Y + D PW +
Sbjct: 831 GKPPAPDVLMVPCFKRDRPVVSDGAETASMVVGFRMPYDLPLTPYGAGDEPWCATASHAL 890
Query: 418 KDVYGQVW 425
D GQ W
Sbjct: 891 PDWQGQSW 898
>gi|407035177|gb|EKE37579.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba nuttalli P19]
Length = 402
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 179/368 (48%), Gaps = 44/368 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW-ILHKPPLPISFGTHHSK 59
+ D+ WL P+L +IP V +H +GTL + + + +P+ G HH K
Sbjct: 45 VFDLQWLFDELPILTRIP-VQFVH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVK 100
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
M+++Y G+R ++ TANLI +D+N KSQG++++DF + + + E G +L+T
Sbjct: 101 IMIILYEGGLRFVLSTANLIPLDYNLKSQGIYIKDFKPSESSTVLNEKG-----AHFLTT 155
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
L+ S N + S+ F++S+ L+ S+PG H G+ L K+G ++ +L
Sbjct: 156 LQSYFTSVN--------VTISYLSDFDYSTIDGWLLLSIPGTHKGNDLNKYGMKQVYDIL 207
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 239
+ + Q SSLG ++ ELS +++ E K I+WPT +
Sbjct: 208 NNKLHVQFTNHCTIAAQASSLGLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTED 259
Query: 240 DVRCSLEGYAAGNAIPSPQKNVDKDFLK---KYWAKWKASHTGRSRAMPHIKTFARYNGQ 296
+R S GY + + +F+K Y+ K+ R PHIKT+ Y
Sbjct: 260 FIRTSETGYHGSCSF-----FLRSNFVKTWENYFYKFLPPFP-RHLIQPHIKTYVIYEED 313
Query: 297 KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEI 356
+ +LTS+N+S AAWG + NS L I +YE+G+L + + F+ T +P +I
Sbjct: 314 IPKYGILTSSNISGAAWG--KPTNSTLEINNYEMGMLFIDN-------FTLTRFPLPYDI 364
Query: 357 KSGSTETS 364
K + +S
Sbjct: 365 KQSTKYSS 372
>gi|157875345|ref|XP_001686067.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
gi|68129140|emb|CAJ06851.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
Length = 828
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 134/511 (26%), Positives = 208/511 (40%), Gaps = 153/511 (29%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 105
+PPLP++FGT+H+K L I +G+R+ + TANL+ DW KSQG+++QDFP K S
Sbjct: 294 EPPLPVAFGTYHTKMALCINGKGLRVSIFTANLVEQDWCWKSQGIYVQDFPWKPVTERSN 353
Query: 106 ECGFENDLIDYLST------------LKWPEFSANLPAH--------------------- 132
+ +++ + K EF A+L +
Sbjct: 354 DDSAGTIMVETAARSTSNSNNGSNTFTKGAEFVAHLRHYLMRCGVSLASACASPADAASA 413
Query: 133 ----GNFKINPSFFKKFNFSSAAVRLIASVPG----------YHTGSSLKKWGHMKLRTV 178
G F+ + F +F++AAV L++SVPG Y G L + G + R+
Sbjct: 414 AGPLGIFETD--FLSHIDFTAAAVWLVSSVPGTYAHGEVCPVYRVG--LCRLGEVLRRSA 469
Query: 179 LQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIV 234
L T L +Q+SS GSL+ ++ L ++M + P G+ + +V
Sbjct: 470 LTTATAPASVD---LSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDAPRGVRDVQVV 526
Query: 235 WPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG--------------- 279
+PT E+VR S EG+ G ++P + +F+ W +S G
Sbjct: 527 YPTEEEVRNSWEGWRGGGSLPLCVQCC-HEFVNARLHCWGSSEAGHMAKRAFPRPAKVAA 585
Query: 280 ---------------------------------RSRAMPHIKTFARYNGQK--LAWFLLT 304
R A+PHIK++A + + WFLLT
Sbjct: 586 VHASREDAVDVDGVDSDGGEGTPVSLAGSCAAYRRFALPHIKSYAAVAPDRSCVRWFLLT 645
Query: 305 SANLSKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK-- 357
SANLS+AAWG+L Q + Q ++RSYELGVL + + S S + S+I+
Sbjct: 646 SANLSQAAWGSLSRKVNQHGSRQQLVRSYELGVLYDSHSAIYQSASSWFSVVAKSKIELP 705
Query: 358 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS------------ 404
+ + + +T L G ++ V L PY L P Y+S
Sbjct: 706 NACNSRAMLYETPL-----------GIGTQDVCLYTPYNLLCPTPYASTAALRAHRDAPD 754
Query: 405 -------------EDVPWSWDKRYTKKDVYG 422
DVPW D + +D YG
Sbjct: 755 KGEQAVAGAALDCSDVPWVLDMPHRGRDAYG 785
>gi|391868838|gb|EIT78047.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae 3.042]
Length = 389
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 185/397 (46%), Gaps = 72/397 (18%)
Query: 69 VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE------CGFENDLIDYLSTLK 121
VR+++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ YL+
Sbjct: 22 VRVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVEDLTLGSGARFKRDLLAYLT--- 78
Query: 122 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 174
+G K P +K++F + L+ASVP L WG
Sbjct: 79 ---------EYGPKKTGPLVEQLRKYDFGAIRAALVASVPSKQKVDDLDSQKKTLWGWPA 129
Query: 175 LRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAELS-SSMSSGFSEDKTPLGI 228
L+ ++++ + K+ +V Q SS+ +L +KW+ ++ +S+S + + P
Sbjct: 130 LKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWLKDVMFASLSPASTSTRQP--- 186
Query: 229 GEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------- 277
+ I++PT +++R SL GY +G +I S + +++ Y W H
Sbjct: 187 -KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSH 245
Query: 278 -----TGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSY 328
GR RA PHIKT+ R++ + + W ++TSANLS AWGA + ++ I S+
Sbjct: 246 TSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSW 305
Query: 329 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 388
E+G+++ P + ++ +VP+ K + E + + ++ T V
Sbjct: 306 EIGIVVWPQLYVQD---TESATMVPT-FKRDTPEPLENKDSETTPDT------------V 349
Query: 389 VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 350 IGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 386
>gi|156034731|ref|XP_001585784.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980]
gi|154698704|gb|EDN98442.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 633
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 133/497 (26%), Positives = 205/497 (41%), Gaps = 111/497 (22%)
Query: 20 VLVIHG----ESDGTLEHMKRN-KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 73
V V+HG E L M++ K +N L +P FGTHHSK ++L + ++I+
Sbjct: 155 VNVVHGFWKREDQSRLNLMEQALKYSNVKLLTAYMPEMFGTHHSKMLILFRHDSTAQVII 214
Query: 74 HTANLIHVDWNNKSQGLWMQD-FPL--------KDQNNLSEECGFENDLIDYLSTLKWPE 124
HTAN+I DW N +Q +W PL K+ + F+ DL++YL
Sbjct: 215 HTANMIPFDWTNMTQAMWKSPLLPLLDPEKPNPKESGQMGSGSKFKIDLLNYLGAY---- 270
Query: 125 FSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLRTV 178
H I + K +FS L+AS PG S+ WG L ++
Sbjct: 271 -------HTKRAICKPLIEQLSKHDFSEIRAALVASTPGKQDIELDSTETAWGWAGLSSI 323
Query: 179 LQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWP 236
L+ K + +V Q SS+ SL +KW L+ + S K P + I++P
Sbjct: 324 LKSIPCSK--TQPEIVVQISSIASLGPTDKW---LNQTFFKALSTSKDPSPKPKFKIIFP 378
Query: 237 TVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKAS---------------- 276
T +++R S+ GY++G+AI + + +LK W
Sbjct: 379 TADEIRRSINGYSSGSAIHTKILTSAQGKQLAYLKPLLCHWAGDGEQHSSTSQTSSTSES 438
Query: 277 ---------------------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAA 312
+ R RA PHIKT+ R++ + + W L+TSANLSK A
Sbjct: 439 ATSSNTSNIALSPHMASPPPQNAHRKRAAPHIKTYIRFSSSSHKTIDWMLVTSANLSKQA 498
Query: 313 WGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP---SEIKSGSTETSQIQKT 369
WG ++ I SYE+GV++ P G S +VP ++I S TS+++ T
Sbjct: 499 WGENINTAGEVRICSYEIGVIVWPGLWDEG----NKSKMVPCFGTDIPSRPDVTSELEST 554
Query: 370 KLVTLT--------------WHGSSDAGASSE-------VVYLPVPYELPPQRYSSEDVP 408
V T G + SE ++ +PY+LP Y+ D+P
Sbjct: 555 VAVEATSVTADNNNIREKGKGKGREEIEKKSENDTENTILIGARIPYDLPLIPYTKSDIP 614
Query: 409 WSWDKRYTKKDVYGQVW 425
W Y++ D G W
Sbjct: 615 WCASASYSEPDWMGNTW 631
>gi|396459207|ref|XP_003834216.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
gi|312210765|emb|CBX90851.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
Length = 650
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 120/454 (26%), Positives = 201/454 (44%), Gaps = 92/454 (20%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW--------MQDFPLKD 99
+P FGTHHSK ++L + +II+HTAN+I+ DW+N +Q +W Q +P ++
Sbjct: 209 IPDPFGTHHSKMLILFRHDDTAQIIIHTANMIYRDWSNMTQAVWSSPMLPLSTQKWPTEN 268
Query: 100 QNNLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 155
++ S G F+ DL+ YL+ + K S ++F + I
Sbjct: 269 PDSASHPVGSGLRFKVDLLRYLAAYE-----------RRTKDLVSQLAHYDFFAIRAAFI 317
Query: 156 ASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKK-SP--LVYQFSSLGSLDEK- 206
SVP + K +G + LR +L + + K SP +V Q SS+ +L +
Sbjct: 318 GSVPSRQNPDASKPSEETSFGWLGLREILTQVPVARRDKSHSPPHIVTQISSIATLGAQP 377
Query: 207 -WMAELSSSMSS----------------GFSEDKTPLGIGEPL--IVWPTVEDVRCSLEG 247
W+ S +SS S P P I++PT E++R L+G
Sbjct: 378 TWLTHFQSVLSSEPKVSNTAVSGSTKTASASPKHAPNNPPPPTFSIIFPTPEELRTCLDG 437
Query: 248 YAAGNAI----PSPQKNVDKDFLKKYWAKW--------------KASHTGRSRAMPHIKT 289
YA+G +I S Q+ ++ + W +A+H R A PHIKT
Sbjct: 438 YASGASIHWKLQSAQQQKQLAYMHPFLRHWHSPAPTSPPQDSPRRAAH--RGPAAPHIKT 495
Query: 290 FARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
+ R++ Q + W LLTSANLSK AWG + +++ ++S+E GV++ P+ H
Sbjct: 496 YIRFSNQDHTTIDWALLTSANLSKQAWGDVVGKKNEMRVQSWEAGVVLWPALFAHNS-VP 554
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS---------------DAGASSEVVYL 391
+ P+ + + +Q+ L +GS+ ++ + VV
Sbjct: 555 GNRALAPAIMVPVFARDAPLQE-DLAGWLRNGSAAHNHNVCADRVSPVRNSAVNVTVVGF 613
Query: 392 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
+PY+LP Y+++++PW RY + D G W
Sbjct: 614 RMPYDLPLCPYTADEMPWCATMRYAEPDGKGMAW 647
>gi|440634212|gb|ELR04131.1| hypothetical protein GMDG_01435 [Geomyces destructans 20631-21]
Length = 570
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 140/491 (28%), Positives = 218/491 (44%), Gaps = 98/491 (19%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHG--ESDGTLEHMKRN--KPANWILHKPPLPISFGT 55
M D+D+L+ P + V+HG + + L HMK K N L +P FGT
Sbjct: 110 MHDLDFLMSNMDPDTKDTVKIHVVHGYWKQESGL-HMKSQALKYPNVHLRCAYMPEIFGT 168
Query: 56 HHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEEC-GFEND 112
HH+K M+L+ + +II+HTAN+I DW N SQ W PL L+++ +
Sbjct: 169 HHTKMMVLLRHDDQAQIIIHTANMIPQDWANLSQDAWTSPLLPLLPAEKLADQTLARGSK 228
Query: 113 LIDYLSTLKWP-EFSANLPAHGNFKI--NPSF--FKKFNFSSAAVRLIASVPGYHTGSSL 167
Y S L++ +F L A+ + + P K++FSS L+ VPG H S
Sbjct: 229 SASYGSGLRFKLDFLGYLKAYDSRRTICKPLIEELLKYDFSSIRGALVGHVPGRHHVESD 288
Query: 168 KK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE--LSSSMSSGFSE 221
+G +R +L G K +V Q SS+ +L ++W+ + ++ +S S
Sbjct: 289 NPTLFGWSAIRAILNTIPVHNG-DKPEVVAQVSSIATLGVTDQWLQKTLFAALSASSNSP 347
Query: 222 DKTP-LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD----FLKKYWAKWKAS 276
KTP LG IV+PT +++R SL+GY +G +I + V ++ +LK + W
Sbjct: 348 SKTPKLG-----IVFPTPDEIRKSLDGYNSGGSIHVRIQTVAQEKQLQYLKPLFYHWAGD 402
Query: 277 H---------------------------------------TGRSRAMPHIKTFARYNGQ- 296
+ GR+RA PHIKT+ R+ +
Sbjct: 403 NRPVSPPSTSSPGPSTVASTVREAWQNRAGPSAVASTVREAGRNRAAPHIKTYIRFADEA 462
Query: 297 --KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 354
++ W L+TSANLSK AWG + I SYELGVL+ PS ++ + +VP
Sbjct: 463 KTRIDWALVTSANLSKQAWGERLNAAGDVRICSYELGVLVSPSM------YAEDAVMVP- 515
Query: 355 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
T Q + K +A + +PY+LP RY +++ PW K
Sbjct: 516 --------TFQTDRPK----------EAVDGKITIGCRMPYDLPLVRYGADEEPWCATKA 557
Query: 415 YTKKDVYGQVW 425
Y + D G+ +
Sbjct: 558 YEELDWMGRSY 568
>gi|448079213|ref|XP_004194340.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359375762|emb|CCE86344.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 575
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 126/431 (29%), Positives = 180/431 (41%), Gaps = 92/431 (21%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
K LP FGTHH+K M+ Y G II+ T NL +D++ +Q W K ++ +
Sbjct: 182 KVDLPKRFGTHHTKMMVNFYEDGTCEIIIMTCNLQPIDFSALTQMCWRSGRLSKASSSNA 241
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPG-- 160
+ F+ D+I YL + P KIN KF+ S V L+ASVPG
Sbjct: 242 GQNRFKTDIIRYLKRYRKP------------KINELADTLAKFDMSGIDVELVASVPGNF 289
Query: 161 --YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 218
+++G+ KL VL+ G + + Y + + A + +S
Sbjct: 290 NLARATDESEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISYPFALKEKNTASV 349
Query: 219 FSEDKTPLGIGE--------------------------PLIVWPTVEDVRCSLEGYAAGN 252
FS PL P I++P +D+ S G+ +G
Sbjct: 350 FSHIICPLVFSRNSERLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKDIALSGTGFYSGQ 409
Query: 253 AI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFARYNG---QKLAWF 301
AI + +N + +K Y KW+ASH GR PH+K + NG + L W
Sbjct: 410 AIHFKYDTSAIHRNQYEQNIKPYLYKWRASHKNAGRDETPPHVKLYMCDNGDNWKTLRWV 469
Query: 302 LLTSANLSKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE 355
L+ S NLSK AWGA ++ + S I SYELGVLI PS H +VP
Sbjct: 470 LMASHNLSKQAWGARRELRYRSADPSAYEISSYELGVLI-PSKSDH--------KLVPVF 520
Query: 356 IKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
S E S+ G V + +P+ LPP+RYSS+D PWS Y
Sbjct: 521 DSSHQQEVSE-----------QGD---------VPVRIPFILPPERYSSDDKPWSAYSNY 560
Query: 416 -TKKDVYGQVW 425
+ KD +G W
Sbjct: 561 GSLKDKFGNTW 571
>gi|330928975|ref|XP_003302469.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
gi|311322144|gb|EFQ89422.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
Length = 621
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 191/444 (43%), Gaps = 83/444 (18%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--------- 97
+P FGTHHSK ++L + +II+HTAN+IH DW N +Q +W+ PL
Sbjct: 191 IPDPFGTHHSKMLVLFRHDDTAQIIIHTANMIHRDWANMTQAVWVSPQLPLLSRASQSQS 250
Query: 98 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
+ N + F++DL+ Y+ + K + + ++FSS I
Sbjct: 251 DTNTNPIGSGERFKSDLLRYIGAYE-----------KRLKGLIAQLEDYDFSSIRAAFIG 299
Query: 157 SVPGYHTGS----SLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDE--KWM 208
SVP S +G + L+ +L K SP +V Q SS+ +L W+
Sbjct: 300 SVPSRQKPGRAIPSTTSFGWLGLKEILSTIPISKAKAFSPPHIVAQVSSIATLGAAPTWL 359
Query: 209 AELSSSMSS---------------------GFSEDKTPLGIGEP---LIVWPTVEDVRCS 244
+ L S +SS F++ + I +++P E++R S
Sbjct: 360 SNLQSVLSSYSKATTSVPENTTVSFTKASSFFTKRDDSVRIASSPKFSVIFPNPEEIRNS 419
Query: 245 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASHTG--------------RSRAMPH 286
L+GY +G +I S Q+ +++ W ++ + R A PH
Sbjct: 420 LDGYGSGGSIHWKLQSAQQQKQLEYMHPMLCHWASTPSAPALASTDVPRREAHRGPAAPH 479
Query: 287 IKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
IKT+ R++ + + W +LTSANLSK AWG + ++ I+S+E GV++ P+
Sbjct: 480 IKTYIRFSDDEQNTIDWAMLTSANLSKQAWGDVVNKKEEIWIQSWETGVVVWPAL----- 534
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHG-SSDAGASSEVVYLPVPYELPPQRY 402
F+ T+ E+ + + G ++ +V +PY+LP + Y
Sbjct: 535 -FAETTQAAVDEVVMVPMFGKDMPGVDDNGVNLEGKEAEEMRPKTIVGFRMPYDLPLKPY 593
Query: 403 SSEDVPWSWDKRYTKKDVYGQVWP 426
++++ PW YT+ D G WP
Sbjct: 594 TADEKPWCATMAYTEPDRNGHAWP 617
>gi|19075361|ref|NP_587861.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe
972h-]
gi|74625832|sp|Q9USG9.1|TYDP1_SCHPO RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|6066756|emb|CAB58371.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe]
Length = 536
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 133/498 (26%), Positives = 207/498 (41%), Gaps = 99/498 (19%)
Query: 2 VDIDWLLPAC------PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGT 55
VD+++LL V +I H +S L + P N L+ +P+ +GT
Sbjct: 62 VDLNFLLENMHASVFPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGT 120
Query: 56 HHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ--------------------- 93
HHSK M+ + +I++HTANL+ DW SQ ++
Sbjct: 121 HHSKIMVNFFKDDSCQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGN 180
Query: 94 ---------DFPLKDQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPA 131
+KD N + + FEN D + + +F A L
Sbjct: 181 PSKIRKHEGSLDIKDDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKN 240
Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 191
+ + K ++FS+ I SVPG G WG KL+ +L+ EK KK
Sbjct: 241 YRHTYELIEKLKMYDFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKD 298
Query: 192 P---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR 242
+ Q SS+GS K E + ++ GF + G ++PTV++V+
Sbjct: 299 EKTKFEESDICISQCSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQ 351
Query: 243 CSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--N 294
S+ G+ +G++I + V+ K KW A GR R PHIKT+ R+ +
Sbjct: 352 QSMLGWQSGSSIHFNILGKTAASQVETLKKGKNLCKWVAMKAGRQRVAPHIKTYMRFSND 411
Query: 295 GQKLAWFLLTSANLSKAAWGALQKNNSQ------LMIRSYELGVLILPSAKRHGCGFSCT 348
G+ L W L+TSANLSK AWG L+ + ++ L IRSYE GVL+ P C
Sbjct: 412 GELLRWVLVTSANLSKPAWGTLEGHKAKSRSTRGLRIRSYEAGVLLYPKLFEESQRAPC- 470
Query: 349 SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVP 408
I+ K+ + + ++ ++G V+ + + ++ PP Y +D
Sbjct: 471 --IMTPTYKTNTPNLDEKRR------EFYG-------KRVIGVRMCWDFPPVEYEDKDEI 515
Query: 409 WSWDKRYTKKDVYGQVWP 426
WS T KD G VWP
Sbjct: 516 WSPVINRTDKDWLGYVWP 533
>gi|327358116|gb|EGE86973.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ATCC 18188]
Length = 655
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 136/553 (24%), Positives = 215/553 (38%), Gaps = 142/553 (25%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + +V ++HG ES + E +R I+ P P
Sbjct: 114 MFDVDFLMSQFDEDVRNLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP-- 171
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW----------MQDFPLKDQN 101
FGTHHSK M+LI + ++++HTAN+I DW N Q +W M+ P +
Sbjct: 172 FGTHHSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTAS 231
Query: 102 N-LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
N F+ DLI YL A+G K P +K++FS+ L+ASV
Sbjct: 232 NRFGSGIRFKRDLIAYLE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASV 279
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSL--DEKW 207
P L WG L+ +Q+ KG + +V Q SS+ +L +KW
Sbjct: 280 PSRQAIDELDSEKHTLWGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKW 339
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----P 255
+ E + S + G+ +P I++PT +++R SL GYA+G +I
Sbjct: 340 LKETFFAALSPSPSRSSSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQ 399
Query: 256 SPQKNVDKDFLKKYWAKWKAS--------------------------------------- 276
S + ++L+ Y +W
Sbjct: 400 SSAQRKQLEYLRPYLCRWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDK 459
Query: 277 ------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRS 327
GR RA PHIKT+ R++ L W +++SANLS AWGA ++ I S
Sbjct: 460 NGQPIRQAGRRRAAPHIKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICS 519
Query: 328 YELGVLILPS--------------------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 367
+E+GV++ P G + + ++
Sbjct: 520 WEIGVIVWPDLFVNRKVDDDEDDDEDDDDDDDDDDDGSEWKEKGKGKKARENDRRGARED 579
Query: 368 KTKLVTL---------TWHGSSDAGAS------SEVVYLPVPYELPPQRYSSEDVPWSWD 412
K K+ + D+G+S + V L +PY+LP Y+ +D PW
Sbjct: 580 KNKVAVMLPCFKQDMPEVRVDKDSGSSTTTATTTTFVGLRMPYDLPLSPYTPQDQPWCAT 639
Query: 413 KRYTKKDVYGQVW 425
Y + D GQ W
Sbjct: 640 ASYKETDWLGQTW 652
>gi|189210395|ref|XP_001941529.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977622|gb|EDU44248.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 624
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/451 (25%), Positives = 193/451 (42%), Gaps = 98/451 (21%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQ--NNLS 104
+P FGTHHSK ++L + ++++HTAN+IH DW N +Q +W P+ Q +LS
Sbjct: 195 IPDPFGTHHSKMLILFRHDDTAQVVIHTANMIHRDWANMTQAVWASPQLPMLSQASQSLS 254
Query: 105 EECG---------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 155
+ F++DL+ Y+ + K + ++FSS I
Sbjct: 255 DSDKTYPIGSGQRFKSDLLRYIGAYE-----------KRLKGLAAQLGDYDFSSIRAAFI 303
Query: 156 ASVPGYH----TGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDE--KW 207
S P SS +G + L+ +L K SP +V Q SS+ +L W
Sbjct: 304 GSAPSRQKPERAVSSNNSFGWLGLKEILSTVPISKARASSPPHIVAQVSSIATLGAAPTW 363
Query: 208 M--------------------AELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCS 244
+ A +SS+ +S F++ T + I++PT E++R S
Sbjct: 364 LSNFQSVLSSHSKATVSVPENATVSSTKASTFTKRDTSVTKAPSPKFSIIFPTPEEIRNS 423
Query: 245 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA--------------SHTGRSRAMPH 286
L GY +G +I S Q+ +++ W + R A PH
Sbjct: 424 LNGYGSGGSIHWKLQSAQQQKQLEYMHPMLCHWTSTPSASASSLTNVSKQEAHRGPAAPH 483
Query: 287 IKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
IKT+ R++ ++ + W +LTSAN SK AWG ++ I+S+E GV++ P+
Sbjct: 484 IKTYIRFSDEEQKAIDWAMLTSANFSKQAWGDTVNKKEEIWIQSWETGVVVWPALFAETA 543
Query: 344 GFSCTSNIVP--------SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 395
++VP E +T+ ++ +T++ T V L +PY
Sbjct: 544 KGVNEVSMVPVFGKDMPKVEDARVNTKGKEVGETRIKT--------------TVGLRMPY 589
Query: 396 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 426
+LP + Y++++ PW YT+ D G WP
Sbjct: 590 DLPLKPYTADEKPWCATMAYTEPDRNGHFWP 620
>gi|195161240|ref|XP_002021476.1| GL26495 [Drosophila persimilis]
gi|194103276|gb|EDW25319.1| GL26495 [Drosophila persimilis]
Length = 511
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/242 (35%), Positives = 127/242 (52%), Gaps = 23/242 (9%)
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 166
GF DL+ YL K + + + +K +FS+ V + SVPG H S
Sbjct: 235 TGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGS 284
Query: 167 LKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 224
++ WGH +L ++L + + P+V Q SS+GSL A + + +D +
Sbjct: 285 VRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSS 343
Query: 225 PLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTG 279
P G + +++P+ +V S +G G +P + DK +LK + +WK+S
Sbjct: 344 PGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDRH 403
Query: 280 RSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQ---LMIRSYELGVLI 334
RSRAMPHIKT++RYN Q + WF+LTSANLSKAAWG+ KN + L I +YE GVL
Sbjct: 404 RSRAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAWGSFNKNTNLQPCLRIANYEAGVLF 463
Query: 335 LP 336
LP
Sbjct: 464 LP 465
>gi|451851539|gb|EMD64837.1| hypothetical protein COCSADRAFT_36213 [Cochliobolus sativus ND90Pr]
Length = 610
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/441 (27%), Positives = 187/441 (42%), Gaps = 93/441 (21%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PL-----KDQN 101
+P FGTHHSK ++L + ++++HTAN+IH DW N +Q +W PL +Q+
Sbjct: 198 IPDPFGTHHSKMLILFRHDDTAQVVIHTANMIHRDWANMTQAVWASPLLPLLRHTTSEQS 257
Query: 102 NLSE--ECG----FENDLIDYL-----------STLKWPEFS-----------------A 127
N S+ G F+ DL+ YL S LK+ +FS A
Sbjct: 258 NSSKIHSIGSGERFKVDLLRYLYAYGMRLGALTSQLKYYDFSSIRAAFLGSAPSKQKLTA 317
Query: 128 NLPAHGNF------KINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 181
P+H F +I S K + S ++ + T + W +++L
Sbjct: 318 AGPSHTAFGWLGLDQILSSIPVKASGDSLRPHIVTQISSVATLGATPTW-LFHFQSILSR 376
Query: 182 CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDV 241
C K +K F+ L K + + + FS +V+PT ++
Sbjct: 377 CPDAKDTEKEEASSSFTKASMLFTKQESNAAEAPEPKFS------------VVFPTPAEI 424
Query: 242 RCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------KASHTGRSRAMPHIKT 289
R L+GY AG +I S Q+ +++ W R A PHIKT
Sbjct: 425 RMPLDGYTAGGSIHWKFQSVQQQKQLEYMHPILCHWTPVSRPDPSQQEAHRGTAAPHIKT 484
Query: 290 FARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
+ R++ + + W LLTSANLSK AWG + N ++ ++S+E GV++ P+ F
Sbjct: 485 YIRFSDETHTTIDWALLTSANLSKQAWGDVMNKNEEIRVQSWETGVVMWPAL---FAEFE 541
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
+S +VP + + ET + HG G VV +PY LP YS+++
Sbjct: 542 HSSTMVPV-FGADNPETGK-----------HGE---GKRETVVGFRMPYNLPLVPYSADE 586
Query: 407 VPWSWDKRYTKKDVYGQVWPR 427
PW Y + D YG W R
Sbjct: 587 RPWCATLAYEEPDRYGLTWAR 607
>gi|354543539|emb|CCE40258.1| hypothetical protein CPAR2_102960 [Candida parapsilosis]
Length = 532
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 164/412 (39%), Gaps = 87/412 (21%)
Query: 49 LPISFGTHHSKAML-LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHH+K M+ + +I+ + NL +D+ +Q +W + ++
Sbjct: 149 IPSRFGTHHTKMMINFFHGNSAEVIIMSCNLTKLDFGGLTQMIWRSGRLARGNTTGTKSI 208
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSS 166
F++DLI YL T + P+ A + F+FS V LIAS PG Y +
Sbjct: 209 KFKSDLIGYLRTYEKPQIDTLATA----------LETFSFSGIDVDLIASSPGHYDLNNE 258
Query: 167 LKKWGHMKLRTVLQECT-----------FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 215
+G+ L + F + S + Y F+ L M
Sbjct: 259 EPHYGYGSLFDACKRNDLLIDNRDKSHHFNVLAQTSAISYPFAVEKGATAGVFTHLLCPM 318
Query: 216 SSGFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLEGYAAGNAI------PS 256
+E L G P IV+P+V++V S G+AAG AI
Sbjct: 319 LFSKNEKFCLLAPGAQSLRRHQSKHNYTPSIVFPSVDEVAASTVGFAAGQAIHFDYSRSY 378
Query: 257 PQKNVDKDFLKKYWAKWKASH----TGRSRAMPHIKTFARYNG---QKLAWFLLTSANLS 309
KN +K Y KW + TGR R MPH+K + NG + + W + S NLS
Sbjct: 379 VHKNYYNQAIKPYHKKWDSGDVKVFTGRERVMPHVKLYMCDNGDNWETIKWCYMGSHNLS 438
Query: 310 KAAWGALQKNN------SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 363
K AWG+ + N SQ + SYELG+L+ P + + PS +
Sbjct: 439 KQAWGSRKGNKFVNNDPSQYEVNSYELGILVTPRP---------NTKMKPSYL------- 482
Query: 364 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
SDAG V Y+ +P++LPP YS D PWS Y
Sbjct: 483 ----------------SDAGTEGGVTYIRMPFKLPPAAYSDNDKPWSGHVSY 518
>gi|240276539|gb|EER40051.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H143]
Length = 685
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 183/428 (42%), Gaps = 109/428 (25%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + V +IHG ES + E +R I+ P P
Sbjct: 112 MFDVDFLMSQFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP-- 169
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ---------- 100
FGTHHSK M+LI + ++++HTAN+I DW N Q +W P++ +
Sbjct: 170 FGTHHSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATL 229
Query: 101 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
+ + F+ DL+ YL A+GN K P +K++F + LIASV
Sbjct: 230 DGVGRGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASV 277
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KW 207
P L WG L+ +Q+ G KK ++ Q SS+ +L + KW
Sbjct: 278 PTRQAIDELDSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKW 337
Query: 208 MAEL-------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI---- 254
+ E S +S KT P I++PT +++R SL GYA+G +I
Sbjct: 338 LKETFFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKL 394
Query: 255 PSPQKNVDKDFLKKYWAKW----------KASHT-------------------------- 278
S + ++L+ Y +W A H+
Sbjct: 395 QSAAQRKQLEYLRPYLCRWAGDTGDGSDISAKHSINSGQERCDSKDANESVRKYVTTGKN 454
Query: 279 -------GRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSY 328
GR RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+
Sbjct: 455 SQPIRNAGRRRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSW 514
Query: 329 ELGVLILP 336
E+GVLI P
Sbjct: 515 EIGVLIWP 522
>gi|325092032|gb|EGC45342.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H88]
Length = 682
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 183/428 (42%), Gaps = 109/428 (25%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + V +IHG ES + E +R I+ P P
Sbjct: 112 MFDVDFLMSQFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP-- 169
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ---------- 100
FGTHHSK M+LI + ++++HTAN+I DW N Q +W P++ +
Sbjct: 170 FGTHHSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATL 229
Query: 101 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
+ + F+ DL+ YL A+GN K P +K++F + LIASV
Sbjct: 230 DGVGRGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASV 277
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLD--EKW 207
P L WG L+ +Q+ G KK ++ Q SS+ +L +KW
Sbjct: 278 PTRQAIDELDSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKW 337
Query: 208 MAEL-------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI---- 254
+ E S +S KT P I++PT +++R SL GYA+G +I
Sbjct: 338 LKETFFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKL 394
Query: 255 PSPQKNVDKDFLKKYWAKW----------KASHT-------------------------- 278
S + ++L+ Y +W A H+
Sbjct: 395 QSAAQRKQLEYLRPYLCRWAGDTGDGSDISAKHSINSGQERRDSKDANESVRKYVTTGKN 454
Query: 279 -------GRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSY 328
GR RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+
Sbjct: 455 SQPIRNAGRRRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSW 514
Query: 329 ELGVLILP 336
E+GVLI P
Sbjct: 515 EIGVLIWP 522
>gi|225555717|gb|EEH04008.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus G186AR]
Length = 637
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 132/512 (25%), Positives = 207/512 (40%), Gaps = 136/512 (26%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHGESDGTLEHMKRNKP----ANWILHKPP------- 48
M D+D+L+ + + V +IHG KR P + H+ P
Sbjct: 112 MFDVDFLMSQFDEDVRDLVKVKIIHGS-------WKRESPNRIRVDEACHRYPNVEPIVA 164
Query: 49 -LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----- 100
+P FGTHHSK M+LI + ++++HTAN+I DW N Q +W P++ +
Sbjct: 165 YMPEPFGTHHSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGH 224
Query: 101 -----NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVR 153
+ + F+ DL+ YL A+GN K P +K++F +
Sbjct: 225 ASATLDGVGRGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAG 272
Query: 154 LIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSL- 203
LIASVP L WG L+ +Q+ G KK ++ Q SS+ +L
Sbjct: 273 LIASVPTRQAIDELDSEKQTLWGWPALKDTIQQIPLGGGNNTVGKKPQIIIQISSVATLG 332
Query: 204 -DEKWMAEL-------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNA 253
+KW+ E S +S KT P I++PT +++R SL GYA+G +
Sbjct: 333 QTDKWLKETFFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGS 389
Query: 254 I----PSPQKNVDKDFLKKYWAKWKAS--------------------------------- 276
I S + ++L+ Y +W +
Sbjct: 390 IHMKLQSAAQRKQLEYLRPYLCRWASDTGDGSDISAKHSINSGQERCESKNVNESVQKCV 449
Query: 277 ----------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQL 323
+ GR RA PHIKT+ R++ LA W ++TSANLS AWGA ++
Sbjct: 450 ATSKNSQPIRNAGRCRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEI 509
Query: 324 MIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEI-------KSGSTETSQIQ--- 367
I S+E+GVL+ P ++ G G E+ +G + + +
Sbjct: 510 RICSWEIGVLVWPDLFIDREVEKDGGGTGRNGKENGKELPRDDGNKNNGYNKPAAVMLPC 569
Query: 368 -KTKLVTLTWHGSSDAGASSEVVYLPVPYELP 398
K + + S A +S V L +PY+LP
Sbjct: 570 FKQDMPEVPEDNGSGASTTSTFVGLRMPYDLP 601
>gi|261191861|ref|XP_002622338.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
gi|239589654|gb|EEQ72297.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
Length = 653
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/429 (27%), Positives = 181/429 (42%), Gaps = 107/429 (24%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + +V ++HG ES + E +R I+ P P
Sbjct: 114 MFDVDFLMSQFDEDVRNLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP-- 171
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW----------MQDFPLKDQN 101
FGTHHSK M+LI + V++++HTAN+I DW N Q +W M+ P +
Sbjct: 172 FGTHHSKMMILIRHDDQVQVVIHTANMIAGDWANMCQAVWRSPLLPMCPEMEHGPGSTAS 231
Query: 102 N-LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
N F+ DLI YL A+G K P +K++FS+ L+ASV
Sbjct: 232 NRFGSGIRFKRDLIAYLE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASV 279
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSL--DEKW 207
P L WG L+ +Q+ KG + +V Q SS+ +L +KW
Sbjct: 280 PSRQAIDELDSEKHTLWGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKW 339
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----P 255
+ E + S + G+ +P I++PT +++R SL GYA+G +I
Sbjct: 340 LKETFFAALSPSPSRSSSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQ 399
Query: 256 SPQKNVDKDFLKKYWAKWKAS--------------------------------------- 276
S + ++L+ Y +W
Sbjct: 400 SSAQRKQLEYLRPYLCRWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDK 459
Query: 277 ------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRS 327
GR RA PHIKT+ R++ L W +++SANLS AWGA ++ I S
Sbjct: 460 NGQPIRQAGRRRAAPHIKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICS 519
Query: 328 YELGVLILP 336
+E+GV++ P
Sbjct: 520 WEIGVIVWP 528
>gi|154273448|ref|XP_001537576.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150416088|gb|EDN11432.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 610
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 119/428 (27%), Positives = 181/428 (42%), Gaps = 109/428 (25%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + V +IHG ES + E +R I+ P P
Sbjct: 112 MFDVDFLMSQFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP-- 169
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ---------- 100
FGTHHSK M+LI + ++++HTAN+I DW N Q +W P++ +
Sbjct: 170 FGTHHSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMENGHSYATL 229
Query: 101 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
+ + F+ DL+ YL A+GN K P +K++F + LIASV
Sbjct: 230 DGVRRGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASV 277
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLD--EKW 207
P L WG L+ +Q+ G KK ++ Q SS+ +L +KW
Sbjct: 278 PTRQAIDELDSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKW 337
Query: 208 MAEL-------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI---- 254
+ E S +S KT P I++PT +++R SL GYA+G +I
Sbjct: 338 LKETFFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKL 394
Query: 255 PSPQKNVDKDFLKKYWAKWKAS-------------------------------------- 276
S + ++L+ Y +W
Sbjct: 395 QSAAQRKQLEYLRPYLCRWAGDTGDGSDISAKHPINSGQERCDSKDANESVQKYVTTGKN 454
Query: 277 -----HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSY 328
+ GR RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+
Sbjct: 455 SQPIRNAGRRRAAPHIKTYIRFSDADLATIDWAMVTSANLSVQAWGAAANGKKEIRICSW 514
Query: 329 ELGVLILP 336
E+GVL+ P
Sbjct: 515 EIGVLVWP 522
>gi|260945317|ref|XP_002616956.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
gi|238848810|gb|EEQ38274.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
Length = 748
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/419 (28%), Positives = 177/419 (42%), Gaps = 88/419 (21%)
Query: 48 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 106
PLP F +HHSK M+ YP V II+ T NL +D+ +Q +W + +
Sbjct: 369 PLPNRFASHHSKMMINFYPHDQVEIIIMTCNLTQLDFGGLTQSVWRSGKLKRGKTTAKLG 428
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY----H 162
F+ DL YL K + + +N++S V L+AS PG H
Sbjct: 429 SRFKQDLERYLLKYKMATIEKVVQR----------LRDYNYNSVGVELVASAPGTYSIDH 478
Query: 163 TGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS--- 217
+ + +G+ KLR VLQ + + K ++ Q +S+ + +S +S
Sbjct: 479 IDENDETYGYGKLRQVLQRNDLLIKDTEKHHNILAQVTSIAYPYSSRKGDTASILSHLLC 538
Query: 218 --GFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLEGYAAGNAIPSP----- 257
FS K L G +P +V+PTV++V S G+ +G+A+
Sbjct: 539 PLMFSHWKKHLEPGTQSTSKHQEEFKYKPQLVFPTVKEVASSNFGFLSGSAVHFKHSGSL 598
Query: 258 --QKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSK 310
QK +++ +K Y KW TGR R PH+K +A NG L W L+ S NLSK
Sbjct: 599 IHQKQYEQN-VKPYLCKWSTPENVTGRERVTPHVKYYACDNGDGWNTLKWVLVGSHNLSK 657
Query: 311 AAWG--ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 368
AWG + + SYEL VL+ S K N+VP K
Sbjct: 658 QAWGYPEAKSKGQTFDVASYELSVLVPGSGK----------NLVPVFKKD---------- 697
Query: 369 TKLVTLTWHGSSDAGASSEVVYLPV--PYELPPQRYSSEDVPWSWDKRYTK-KDVYGQV 424
SS+ + +PV P++LPP RY D+PWS Y K KD +G +
Sbjct: 698 ---------------VSSDTITIPVRFPFKLPPTRYGENDLPWSAGSDYGKLKDRWGNL 741
>gi|451995661|gb|EMD88129.1| hypothetical protein COCHEDRAFT_1227354 [Cochliobolus
heterostrophus C5]
Length = 571
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 121/440 (27%), Positives = 189/440 (42%), Gaps = 94/440 (21%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHHSK ++L Y +II+HTAN+I DW N +Q +W+ ++ SEE
Sbjct: 158 IPDPFGTHHSKMLILFRYDDTAQIIIHTANMIRRDWANMTQAVWVSPLLPLLRHTTSEES 217
Query: 108 G------------FENDLIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRL 154
F+ DL+ YL A+G + S K +NFS
Sbjct: 218 KSTSIHSIGSGERFKVDLLRYLY------------AYGKGTRALTSQLKHYNFSGIRAAF 265
Query: 155 IASVPGYHTGS----SLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEK-- 206
+ S P S S +G + L +L + S +V Q SS+ +L
Sbjct: 266 LGSAPSRQKPSAASPSHTAFGWLGLDQILSGIPAKASEDSSRPHVVTQISSVATLGATPT 325
Query: 207 WMAELSSSMS--------------SGFSEDKT--------PLGIGEPL--IVWPTVEDVR 242
W+ S +S S F+E T +G EP +V+PT +++R
Sbjct: 326 WLFHFQSILSRCSNVNDSEKEEASSSFTEACTLSIQQKTNTVGAPEPKFSVVFPTPDEIR 385
Query: 243 CSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW----------KASHTGRSRAMPHIK 288
SL+GY++G +I S Q+ +++ W + +H RS A PHIK
Sbjct: 386 MSLDGYSSGGSIHWKFESAQQQKQLEYMHPILCHWAPVSQPDQPQRKAH--RSTAAPHIK 443
Query: 289 TFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 345
T+ R++ + + W LLTS+NLSK AWG + N ++ I+S+E GV++ P+
Sbjct: 444 TYIRFSDETHTTIDWALLTSSNLSKQAWGDVANKNDEIRIQSWETGVVLWPAL---FAEH 500
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
+S I+ + E + K T VV +PY LP YS++
Sbjct: 501 EHSSTIMVPVFGIDNPEADSTYEAKKGT--------------VVGFRMPYNLPLVPYSAD 546
Query: 406 DVPWSWDKRYTKKDVYGQVW 425
+ PW + + D YG+ W
Sbjct: 547 ERPWCATMAHKEPDRYGRTW 566
>gi|223995471|ref|XP_002287409.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976525|gb|EED94852.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 625
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 121/447 (27%), Positives = 191/447 (42%), Gaps = 110/447 (24%)
Query: 76 ANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--------------------------- 108
+NL D KSQG++ Q FPLK + +
Sbjct: 189 SNLWRTDIEYKSQGVYSQVFPLKQKTPADDTVNKLKRKQIYNPYEKKKKPAAGSSSRGWP 248
Query: 109 --------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 160
FE+DL+ YL + + + + +G + ++++FS A LI SVPG
Sbjct: 249 FEDDKSQLFEDDLVGYLESYHYRK-QQSWKMNGESMNLLALIRQYDFSEAYAVLIPSVPG 307
Query: 161 YHTGSSLKKWGHMKLRTVLQE--CTFEKGFK--------KSPLVYQFSSLGSLDEKWM-- 208
YH+ S+ +G++KLR + E C + K PLV Q+SS+GSL W+
Sbjct: 308 YHS-LSIDDFGYLKLRKAIIEWVCNQQSNADSRKSSSNAKPPLVCQYSSVGSLTTAWLDL 366
Query: 209 --AELSSSMSSGF----------------SEDKTPLGIGEPL-IVWPTVEDVRCSLEGYA 249
A L S+ +S ++ K + + E + IVWPTV+++R ++EGY
Sbjct: 367 FTAALDSTSTSAVDPVEYYHEVTKKAKSRAKGKKGVDLSERMKIVWPTVDEIRTTIEGYN 426
Query: 250 AGNAIPSPQKNVDKDFLKKYWAKWKA---SHTGRS---------RAMPHIKTFARYNGQ- 296
G ++P KNV + FL + +W GR+ R +PHIKT+ + +
Sbjct: 427 GGGSVPGRTKNVAQSFLLPLYHRWTKRGNDFIGRTDNVDPLRTARNVPHIKTYVQPSTHV 486
Query: 297 -----KLAWFLLTSANLSKAAWGALQK----NNSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ W +LTS NLSKAAWG ++ ++ L IR +ELGV I P+
Sbjct: 487 IGDTPSIEWMVLTSHNLSKAAWGNIENRSVDDSKVLFIRHWELGVFISPATL-------A 539
Query: 348 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLPVPYE-LPPQRY-- 402
S E + + L SD G +E V P+PY+ + P Y
Sbjct: 540 NSKFTGGEARRIVPYIGNDIGNSPINL---ADSDDGGDTESRDVVAPLPYDVMNPSIYHH 596
Query: 403 SSEDVPWSWDKRYTKK-----DVYGQV 424
ED+ W+ D +++ D++G V
Sbjct: 597 QGEDMAWTVDGPWSRNGFVLPDLHGVV 623
>gi|7648683|gb|AAF65623.1|AF182002_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 415
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 119/224 (53%), Gaps = 21/224 (9%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 202 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 261
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 262 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISY 321
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L P + K + S V LI S PG GS WGH +L+
Sbjct: 322 LMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLK 371
Query: 177 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 215
+L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 372 KLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 415
>gi|281201405|gb|EFA75617.1| protein-tyrosine phosphatase 3 [Polysphondylium pallidum PN500]
Length = 665
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 138/295 (46%), Gaps = 69/295 (23%)
Query: 53 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 112
FG HSK MLL+Y +R+++ +AN D+++ Q +W QDFP N+ F++
Sbjct: 390 FGCQHSKLMLLVYDDSIRVVIPSANPTRFDYDDIGQTIWFQDFP--KVNSQPPPSQFQDT 447
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
L ++ + P +F K++FS A V LI S+PGYH G+S+ + GH
Sbjct: 448 LKLFIKSCALPN---------------TFLDKYDFSIAKVHLIVSIPGYHRGASMNQCGH 492
Query: 173 MKLRTVLQECTFEKG-----------FKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFS 220
M+LR++L++ +K KK + Q SSLG +++KW + L S+ + S
Sbjct: 493 MQLRSILKKYYTDKENDLKHSDFPIIIKKREVHSQTSSLGLVNDKWSPQFLESTQTLTKS 552
Query: 221 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR 280
+ P G+ I++P KN+
Sbjct: 553 KLVDPTGLLH--ILFP----------------------KNL----------------ILH 572
Query: 281 SRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
S+ + F + + W + S NLS AAWG LQK+NSQL I ++E+GVL+L
Sbjct: 573 SKIITGTTKFEHNDKLRFDWVYVGSHNLSPAAWGRLQKDNSQLYISNFEIGVLLL 627
>gi|448516422|ref|XP_003867567.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis Co 90-125]
gi|380351906|emb|CCG22130.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis]
Length = 533
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 170/423 (40%), Gaps = 88/423 (20%)
Query: 49 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHH+K M+ Y V +I+ + N +D+ +Q +W + ++
Sbjct: 149 IPSRFGTHHTKMMINFYTDESVEVIIMSCNFTRLDFGGLTQMIWRSGRLILGNTTGAKSS 208
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSS 166
F++DLI YL T P+ + ++FS V LIAS PG Y S
Sbjct: 209 KFKSDLIAYLRTYARPQID----------YLAKLLEPYSFSGIDVELIASSPGKYDLNSE 258
Query: 167 LKKWGHMKLRTVLQECT-----------FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 215
+G+ L + + + S + Y FS L M
Sbjct: 259 GPHYGYGSLYNACKRNNLLIDNRDKSRHYNVLAQTSAISYPFSVEKGATAGIFTHLLCPM 318
Query: 216 SSGFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLEGYAAGNAIPSP----- 257
+ + L G P I++P V +V S G+AAG AI
Sbjct: 319 LFSKNGEFKLLAPGIQSLRRHQSEHNYTPSIIFPAVSEVVSSTIGFAAGQAIHFDYSRSF 378
Query: 258 -QKNVDKDFLKKYWAKWKASHT----GRSRAMPHIKTFARYNG---QKLAWFLLTSANLS 309
KN + +K Y KW +S + GR + MPH+K + NG + + W + S NLS
Sbjct: 379 IHKNYYQQAIKPYLKKWNSSSSMSLAGREQVMPHVKLYMCDNGDNWRSIKWCYMGSHNLS 438
Query: 310 KAAWGALQKN------NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 363
K AWG+ + N +SQ + SYELGVL++P K + + PS +K
Sbjct: 439 KQAWGSRKGNKFVNDDSSQYEVNSYELGVLVVPKPK---------TEMKPSYLK------ 483
Query: 364 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDVYG 422
D G+ V Y+ +P++LPP YS D PWS Y + +D G
Sbjct: 484 -----------------DLGSEEGVTYVRMPFKLPPTAYSENDKPWSGHASYGELRDSKG 526
Query: 423 QVW 425
+
Sbjct: 527 NTY 529
>gi|239608603|gb|EEQ85590.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ER-3]
Length = 653
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 107/429 (24%)
Query: 1 MVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPIS 52
M D+D+L+ + + +V ++HG ES + E +R I+ P P
Sbjct: 114 MFDVDFLMSQFDEDVRNLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP-- 171
Query: 53 FGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW----------MQDFPLKDQN 101
FGTHHSK M+LI + ++++HT N+I DW N Q +W M+ P +
Sbjct: 172 FGTHHSKMMILIRHDDQAQVVIHTTNMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTAS 231
Query: 102 N-LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASV 158
N F+ DLI YL A+G K P +K++FS+ L+ASV
Sbjct: 232 NRFGSGIRFKRDLIAYLE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASV 279
Query: 159 PGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSL--DEKW 207
P L WG L+ +Q+ KG + +V Q SS+ +L +KW
Sbjct: 280 PSRQAIDELDSEKHTLWGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKW 339
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----P 255
+ E + S + G+ +P I++PT +++R SL GYA+G +I
Sbjct: 340 LKETFFAALSPSPSRSSSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQ 399
Query: 256 SPQKNVDKDFLKKYWAKWKAS--------------------------------------- 276
S + ++L+ Y +W
Sbjct: 400 SSAQRKQLEYLRPYLCRWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDK 459
Query: 277 ------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRS 327
GR RA PHIKT+ R++ L W +++SANLS AWGA ++ I S
Sbjct: 460 NGQPIRQAGRRRAAPHIKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICS 519
Query: 328 YELGVLILP 336
+E+GV++ P
Sbjct: 520 WEIGVIVWP 528
>gi|326431947|gb|EGD77517.1| hypothetical protein PTSG_08615 [Salpingoeca sp. ATCC 50818]
Length = 594
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 95/195 (48%), Gaps = 28/195 (14%)
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 291
+PTVEDVR S EGY G ++P K D F K KW+A R+RA+PHIKTF
Sbjct: 422 FCYPTVEDVRTSYEGYVGGGSLPHAIKYREDHVFFAKEACKWRAGWCYRTRALPHIKTFT 481
Query: 292 RYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 349
+N + + W LL S NLSKAAWG LQK SQL I SYELGV + PS +
Sbjct: 482 AWNTAARSIDWMLLGSHNLSKAAWGQLQKQESQLHILSYELGVFLSPSL--------LGA 533
Query: 350 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
+ P K S T + + PVPY+ P YS+ D W
Sbjct: 534 TLRPLADKLRSVRRPDKHDT-----------------QTAWAPVPYDYPLTPYSTHDEMW 576
Query: 410 SWDKRYTKKDVYGQV 424
WD Y + D +G+V
Sbjct: 577 YWDGVYMQPDTHGRV 591
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 99/220 (45%), Gaps = 26/220 (11%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M+D+DWLL P + +++++G + + + P LP +FGTHH+K
Sbjct: 118 MIDVDWLLDQYPAEYRRLPLMIVYGNDQRVSKETEHDTSNVRWFRAPYLP-AFGTHHTKM 176
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSEECGFEN 111
MLL + G++++VHTANLI DWN K+QG+WM + ++D ++ S GF
Sbjct: 177 MLLFFHDGMQVVVHTANLISRDWNLKTQGIWMSPKLPRFSPKRGRVQDISSYS-PTGFGA 235
Query: 112 DLIDYLST--------LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT 163
DL YL + + AH + F ++ L+ P
Sbjct: 236 DLWSYLRAYGDGVQGGVSMRAVRERIAAHDLTHVKVVFACQYERD-----LLPLSPAATA 290
Query: 164 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL 203
G + WG + + +L + G +V QFSS+G +
Sbjct: 291 GRTKTAWGQHEAQDLLLQQHAAGG--ADVVVCQFSSIGKM 328
>gi|448083780|ref|XP_004195441.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359376863|emb|CCE85246.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 576
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 123/431 (28%), Positives = 181/431 (41%), Gaps = 92/431 (21%)
Query: 46 KPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
K LP FGTHH+K M+ Y II+ T NL +D++ +Q W + ++
Sbjct: 182 KVDLPKRFGTHHTKMMVNFYENETCEIIIMTCNLQPIDFSALTQMCWRSGRLSRASSSNP 241
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPG-- 160
+ F+ D+I YL + KIN +F+ S V L+ASVPG
Sbjct: 242 GKPRFKTDIIRYLKRYRKQ------------KINELADTLAEFDMSGIDVELVASVPGNF 289
Query: 161 --YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 218
T +++G+ KL VL+ G + + Y + + A + +S
Sbjct: 290 NLARTADDSEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISYPFALKEKNTASV 349
Query: 219 FSEDKTPLGIGE--------------------------PLIVWPTVEDVRCSLEGYAAGN 252
FS PL P I++P +D+ S G+ +G
Sbjct: 350 FSHIICPLIFSRNSDRLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKDIALSGTGFYSGQ 409
Query: 253 AI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFARYNG---QKLAWF 301
AI + +N + +K Y KW+ASH GR PH+K + NG + L W
Sbjct: 410 AIHFKYDTSAIHRNQFEQNIKPYLYKWRASHKNAGREETPPHVKLYMCDNGDNWKTLRWV 469
Query: 302 LLTSANLSKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE 355
L+ S NLSK AWGA ++ + S I SYELGVLI PS+ H +VP
Sbjct: 470 LMASHNLSKQAWGARRELRYRSADPSTYEISSYELGVLI-PSSSDH--------KLVP-- 518
Query: 356 IKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
S+ Q+ +T G V + +P+ LPP+RYSS+D PWS Y
Sbjct: 519 -----VFDSRHQR----KVTDQGD---------VPVRIPFILPPERYSSDDKPWSAYSNY 560
Query: 416 -TKKDVYGQVW 425
+ KD +G W
Sbjct: 561 GSLKDKFGHTW 571
>gi|390364206|ref|XP_788891.3| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Strongylocentrotus
purpuratus]
Length = 414
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 123/437 (28%), Positives = 190/437 (43%), Gaps = 101/437 (23%)
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-----FENDLI 114
M L+Y G+R+++HTAN+I DW+ K+QG+W+ FP +N + G F+ DL+
Sbjct: 2 MFLLYADGMRVVIHTANIIESDWHQKTQGVWISPLFPKLPSSNQTATNGESPSFFKRDLL 61
Query: 115 DYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
YL+ + P + P + +FSSA V LI+SVPG H KWGH
Sbjct: 62 AYLTAYRSPS------------LQPWKDHITQHDFSSAKVFLISSVPGRHARELKNKWGH 109
Query: 173 MKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSS--GFSEDKTP 225
+K+R +L++ +K ++ P++ QFSS+GSL KW+ AE SMS+ G S T
Sbjct: 110 LKVRKILRQYGPDKEQVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKGQSGSFTS 169
Query: 226 LGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRA 283
+ +++P ++VR SLEGY AG ++P S Q + +L +++ + G +
Sbjct: 170 NADTRHMKLIFPCSDNVRTSLEGYPAGASLPYSIQTAKKQPYLHQFFLREILRQYGPDKE 229
Query: 284 M----PHIKTFA---RYNGQKLAWF---LLTSANLSKAAWGALQKNNSQLMIRSYELGVL 333
P I F+ G K W L S + K G+ N ++ L
Sbjct: 230 QVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKGQSGSFTSNADTRHMK------L 283
Query: 334 ILPSAKRHGCGFSCTSNIVPS--EIKSGSTETSQIQKTK------------LVTLTWHGS 379
I P C+ N+ S +G++ IQ K L W G+
Sbjct: 284 IFP----------CSDNVRTSLEGYPAGASLPYSIQTAKKQPYLHQFFFANLSKAAW-GA 332
Query: 380 SDAGASS--------EVVYLP----------------------VPYELPPQRYSSEDVPW 409
+ AS V+ +P +P+++P YS D PW
Sbjct: 333 YEKNASQLMIRSYEIGVMMIPSFFDKSRKTFPLTEGRGQKEFSLPWDVPLTPYSKTDRPW 392
Query: 410 SWDKRYTKK-DVYGQVW 425
WD YT K D +G W
Sbjct: 393 IWDIPYTDKPDSHGNAW 409
>gi|116194574|ref|XP_001223099.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
gi|88179798|gb|EAQ87266.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
Length = 349
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 139/311 (44%), Gaps = 56/311 (18%)
Query: 145 FNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 201
++FS LIASVPG H S+ WG + L+ KK + Q SS+
Sbjct: 62 YDFSEIRGSLIASVPGRHVFEEEDSITWWGSAAMSRALEAVPISS--KKPEIAIQTSSIA 119
Query: 202 SL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI--- 254
+L + W+ L S+ G S TPL +V+PT +++R SL+GY +G++I
Sbjct: 120 TLGGSDTWLKNILFRSLRGGRS--TTPLAQRPSFKVVFPTPDEIRKSLDGYHSGSSIHTK 177
Query: 255 -PSPQKNVDKDFLKKYWAKWK--------------ASHTGRSRAMPHIKTFARYNG---- 295
SPQ+ +L+ + W GR RA PHIKT+ RY+G
Sbjct: 178 TQSPQQASQLTYLRPMFHHWANDSDRGAPLSYGDIPKEAGRKRAAPHIKTYIRYSGYGPE 237
Query: 296 -QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 354
+ W LLTSANLSK AWG +++ + SYE+GVL+ P + +G G + +
Sbjct: 238 PPTVDWALLTSANLSKQAWGDAPNTRNEVRVASYEIGVLVWP--ELYGEGATMVPTFMTD 295
Query: 355 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 414
+ G ++ V L +PY LP Q Y +VPW ++
Sbjct: 296 SLAEGEVPE--------------------GTATAVALRMPYNLPLQAYGEGEVPWVATEK 335
Query: 415 YTKKDVYGQVW 425
+ + D G+ W
Sbjct: 336 HLEPDWMGRAW 346
>gi|444707427|gb|ELW48704.1| Tyrosyl-DNA phosphodiesterase 1 [Tupaia chinensis]
Length = 389
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/241 (36%), Positives = 117/241 (48%), Gaps = 71/241 (29%)
Query: 192 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 246
PLV QFSS+G L + KW+ +E S+ + + K P PL +++P+VE+VR SLE
Sbjct: 210 PLVGQFSSIGFLGADESKWLCSEFKESLLTLGRDSKIPGKSTVPLHLIYPSVENVRTSLE 269
Query: 247 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTS 305
GY AG ++P S Q +++L Y+
Sbjct: 270 GYPAGGSLPYSIQTAEKQNWLHSYF----------------------------------H 295
Query: 306 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 365
ANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS
Sbjct: 296 ANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGS----- 344
Query: 366 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQV 424
HG + + PVPY+LPP+ Y +D PW W+ Y K D +G +
Sbjct: 345 -----------HGPTAS--------FPVPYDLPPELYGHKDRPWIWNIPYVKAPDTHGNM 385
Query: 425 W 425
W
Sbjct: 386 W 386
>gi|294659254|ref|XP_461609.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
gi|199433821|emb|CAG90056.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
Length = 583
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 187/443 (42%), Gaps = 112/443 (25%)
Query: 49 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
LP FGTHH+K M+ Y II+ T NL +D+ +Q W + N+S E
Sbjct: 182 LPTRFGTHHTKMMINFYEDDTSEIIIMTCNLQKIDFGGLTQMCWKSGRLHRSNGNISPER 241
Query: 108 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG 160
G F+ DL +YL +K NP +++FS + L+AS PG
Sbjct: 242 GARFQKDLKNYLF---------------RYKKNPLRELGKSLDEYDFSPVNIELVASAPG 286
Query: 161 Y----HTGSSLKKWGHMKLRTVLQECTF----EKGFKKSPLVYQFSSLGSLDEKWMAELS 212
+ + + + +G+ KL VL+ KG K ++ Q SS+ A
Sbjct: 287 FFNMAESTNDSEIYGYGKLYQVLRRNNLLIDNSKGENKYNILAQVSSISYP----FATEK 342
Query: 213 SSMSSGFSEDKTPL---GIGE-----------------------PLIVWPTVEDVRCSLE 246
S+ +S FS PL G+ + P I++P+V+DV S
Sbjct: 343 SNTASIFSHLLCPLIFSGMSKASFNLLKPGAASFKSHQNTHNYRPHILYPSVDDVANSNV 402
Query: 247 GYAAGNAI-------PSPQKNVDKDFLKKYWAKWK----ASHTGRSRAMPHIKTFARYNG 295
G+A+G A+ P+ + +++ +K Y +W+ A TGR +PH+K + NG
Sbjct: 403 GFASGQALHFKFTTTPTHRNQYEQN-IKPYLYRWQSGSHADETGRENVVPHVKLYMCDNG 461
Query: 296 QK---LAWFLLTSANLSKAAWGALQKNNSQLM--------IRSYELGVLILPSAKRHGCG 344
L W L+ S NLSK AWGA KN ++ + SYELGVL+
Sbjct: 462 DDWCTLRWVLMGSHNLSKQAWGA--KNETKFTNSDPSVYKVSSYELGVLV---------- 509
Query: 345 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 404
N+ P++ G T L + + A + L +P++LPP +Y
Sbjct: 510 ---PGNMDPND--DGIT---------LKPIYGRDTFPAPQHNNDTPLRIPFKLPPVKYKP 555
Query: 405 EDVPWSWDKRYTK--KDVYGQVW 425
+ PWS Y KD +GQ +
Sbjct: 556 SERPWSALINYGNNLKDRFGQCY 578
>gi|344301196|gb|EGW31508.1| hypothetical protein SPAPADRAFT_154759 [Spathaspora passalidarum
NRRL Y-27907]
Length = 549
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 115/426 (26%), Positives = 175/426 (41%), Gaps = 91/426 (21%)
Query: 49 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 107
+P FGTHH+K M+ + + I++ ++N+ +D+ +Q LW K +
Sbjct: 163 IPNRFGTHHTKMMINFFKGDTMEIVIMSSNITRLDFGGLTQMLWRSGRLSKIKPKTIPLV 222
Query: 108 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--- 162
G F+ DL++YL+ E + K+++FSS V LIAS PG +
Sbjct: 223 GKRFQKDLMNYLNKYNKVEITQL----------SKRLKQYDFSSVNVELIASAPGSYNLR 272
Query: 163 -TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 221
+ + +G+ KL L+ + S L Y + S A + + FS
Sbjct: 273 DVTNETEIYGYGKLHQALKRNSLLIDNSISKLKYNIIAQVSAISYPFAVETFQTAGIFSH 332
Query: 222 DKTPLGIGE------------------------PLIVWPTVEDVRCSLEGYAAGNAI--- 254
PL + P+I++PT E+V S G+ AG AI
Sbjct: 333 LLCPLVFSKKEEFKLLEPGTNSFRQHQKDHNYNPIIIFPTPEEVAGSNVGFRAGGAIHFD 392
Query: 255 ---PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSA 306
KN + +K Y KW + + TGR + MPH+K + NG L W + S
Sbjct: 393 YNRSFVHKNYYQQCIKPYLHKWSSRETITGREKVMPHVKLYMCDNGDNWSTLKWVYMGSH 452
Query: 307 NLSKAAWGA------LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 360
NLSK AWG+ L N S I SYELGVL+ P P E
Sbjct: 453 NLSKQAWGSRRGNKFLSSNPSIYDISSYELGVLVYPK---------------PGE----- 492
Query: 361 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KD 419
TL + D+ S+ + + +P++LPP +Y S D+PWS Y D
Sbjct: 493 ------------TLVPNYLGDSIPKSKNIPIRLPFKLPPVKYLSTDLPWSGHVSYGGLAD 540
Query: 420 VYGQVW 425
YG+ +
Sbjct: 541 KYGETY 546
>gi|193785768|dbj|BAG51203.1| unnamed protein product [Homo sapiens]
Length = 118
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 82/145 (56%), Gaps = 33/145 (22%)
Query: 284 MPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 341
MPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 1 MPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA--- 57
Query: 342 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 401
F S V + +GS E + PVPY+LPP+
Sbjct: 58 ---FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPEL 90
Query: 402 YSSEDVPWSWDKRYTKK-DVYGQVW 425
Y S+D PW W+ Y K D +G +W
Sbjct: 91 YGSKDRPWIWNIPYVKAPDTHGNMW 115
>gi|300121378|emb|CBK21758.2| unnamed protein product [Blastocystis hominis]
Length = 397
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 149/314 (47%), Gaps = 45/314 (14%)
Query: 43 ILHKPPLPISF--GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 98
++ PP S+ G H+K +LL + +RI++ +ANL DW SQ +WMQDF K
Sbjct: 60 LIVSPPFAQSYLRGCFHAKLLLLRFSDRLRIVISSANLTTEDWTMWSQCVWMQDFFNAPK 119
Query: 99 DQNNLSE---ECGFENDLIDYLSTLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAAV 152
D ++ + F LI +L PE F+A F+ F + +V
Sbjct: 120 DSTRVAAKKLDLEFRTQLISFLRKCCVPEERIFNA--------------FRGVFFENVSV 165
Query: 153 RLIASVPGYHTGSSLKKWGHMKLRTVLQECT--FEKGF---KKSPLVYQFSSLGSLDEKW 207
+L+ASVPG + G + +G ++LR+VL+ EK K P++ Q SS+G+ + W
Sbjct: 166 QLVASVPGVYQGDRMNDYGQLRLRSVLKGLNDYMEKVASLPKNPPILSQCSSIGNPSQNW 225
Query: 208 MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEG-YAAGNAIPSPQKNVDKDF 265
+ + S G + + + L IV+PT V S+ G AG+ I + K F
Sbjct: 226 ILSMLKSCYGGREIVEKKGKLADLLHIVYPTNVYVNNSIIGPEMAGSLIFMQKVYTAKAF 285
Query: 266 LKKYWAKWKASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQ 322
L++ ++K + GR +PH K +K L W AWG ++K SQ
Sbjct: 286 LREMLKRYKDA-PGRETTLPHSKYLMNVPLKKRPRLPW----------VAWGQIEKKESQ 334
Query: 323 LMIRSYELGVLILP 336
+ I +YE GV++LP
Sbjct: 335 IAICNYECGVVLLP 348
>gi|328868012|gb|EGG16393.1| protein-tyrosine phosphatase 3 [Dictyostelium fasciculatum]
Length = 596
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 146/324 (45%), Gaps = 45/324 (13%)
Query: 51 ISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN--------- 101
+ +G HSK +LL+Y +R++V +AN D+ Q +W QDF K
Sbjct: 236 VLYGCMHSKLILLLYKDYIRVVVPSANPFEEDYIRIGQTIWYQDFQKKLPPPPPPLATTP 295
Query: 102 ------------NLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFS 148
+LS + +T +F +L N FKI F +F+F
Sbjct: 296 TLKPIPSTSKTISLSLKQMTTKKPTTTTTTTTTNDFQISLKTLLNCFKIETKFLDQFDFE 355
Query: 149 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK---------GFKKSPLVYQFSS 199
A +LI S+PG+H G++L +GH+KLR+VL +K FK+ + Q SS
Sbjct: 356 CAKAQLIISIPGFHNGATLNSYGHLKLRSVLTSYYNQKEKDLNLKIDNFKRD-VFSQCSS 414
Query: 200 LGSLDEKWMAEL--SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS 256
LG+++ W S + ED I + L I++PTV + + + + + I
Sbjct: 415 LGNVNSGWNQHFLESCRIPKNNLED-----ISKSLHILFPTVSWITSNHKRMQSASIIRF 469
Query: 257 PQKNV-DKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN----GQKLAWFLLTSANLSKA 311
K+ DK F + K H R + H K ++ W + S NLS A
Sbjct: 470 QDKSYDDKTFPRNSMTLIKHRHPHRGNMLLHTKVNVGVTTIGKNKRYDWIYVGSHNLSPA 529
Query: 312 AWGALQKNNSQLMIRSYELGVLIL 335
AWG +QKN +Q+ + +YE+GV++L
Sbjct: 530 AWGKIQKNQTQIQLSNYEIGVVLL 553
>gi|149245486|ref|XP_001527220.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449614|gb|EDK43870.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 554
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 117/443 (26%), Positives = 182/443 (41%), Gaps = 110/443 (24%)
Query: 49 LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 106
+P FGTHH+K M+ + V I++ ++N+ +D+ +Q +W P + +
Sbjct: 154 IPTRFGTHHTKMMINFFEDLSVEIVISSSNITRLDFGGLTQMVWRSGRLPQSGETIGEKG 213
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGS 165
F+ DLI YL+ K+ + + A + +NF S V LIAS PG Y+
Sbjct: 214 IQFKKDLIGYLN--KYKKVPVDKLA--------TRLNLYNFLSVDVELIASAPGKYNLQK 263
Query: 166 SLKKWGHMKLRTVLQ--------------ECTFEKGFKK---------SPLVYQFSS--- 199
+G+ L L+ E +K KK S + Y FS+
Sbjct: 264 DSSLYGYGSLYKALERNNLLLNNKNVEHDEIDNDKHNKKKHYNVLAQVSAISYPFSTEKW 323
Query: 200 -------------LGSLDEKW--MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 244
+ S DEK+ +A S+ E P I++PTV++V S
Sbjct: 324 ATAGIFTHLLCPLIFSKDEKFRLLAPGKESIKRHQKEHNYT-----PHIIFPTVDEVASS 378
Query: 245 LEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHT----GRSRAMPHIKTFARYN 294
GY AG+AI KN +K Y +KW +S T GR R MPH+K + N
Sbjct: 379 TIGYVAGSAIHFDYTRSFVHKNYFTQAIKPYLSKWDSSDTKEVTGRERVMPHVKLYMCDN 438
Query: 295 G---QKLAWFLLTSANLSKAAWGALQKN------NSQLMIRSYELGVLILPSAKRHGCGF 345
+ + W + S NLSK AWG+ + N + + + SYELGVL P
Sbjct: 439 ADNWKTIKWCYMGSHNLSKQAWGSKKGNKFVNDHSDEYEVSSYELGVLFTP--------- 489
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 405
K G+T ++ K + + ++ +P++LPP YS
Sbjct: 490 -----------KEGTTMVPSYKENK-----------SSIRGDHTFVRMPFQLPPALYSLL 527
Query: 406 DVPWSWDKRYTKK-DVYGQVWPR 427
D+PWS Y K D+ G + +
Sbjct: 528 DMPWSGHVSYGDKLDLMGSTYKK 550
>gi|345570074|gb|EGX52899.1| hypothetical protein AOL_s00007g235 [Arthrobotrys oligospora ATCC
24927]
Length = 651
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 124/462 (26%), Positives = 186/462 (40%), Gaps = 95/462 (20%)
Query: 49 LPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEE 106
+P FGTHH+K ++L Y I+VHTAN+I DW+N +Q +W PL ++L +
Sbjct: 186 MPDMFGTHHTKMLVLFYHDDSAEIVVHTANMIPWDWSNMTQAVWRSPKLPLLADDSLERK 245
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFSSAAVRLIASVPGYH--T 163
G + Y+ F+A + A+G K K++F + + VPG H
Sbjct: 246 EG-----VGYVFK---EAFTAYVGAYGWRTKSLMEQIVKYDFRAVRAVFVGHVPGDHAIN 297
Query: 164 GSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVY----------QFSSLGSLDEKWMAE 210
G K +G K++ VL G K +VY Q SS+ +L E +
Sbjct: 298 GPENKLFGWSKVKRVLTRIGRGGGHGVNKAGRVVYTVKGGGEIAMQCSSVATLGESYFDS 357
Query: 211 L----------SSSMSSGFSEDKTPLGIG---------EPLIVWPTVEDVRCSLEGYAAG 251
+ + F +TP E +V+PTVE+VR S+ G+ G
Sbjct: 358 VLYPTFSTCRPGGGQLNAFDVLRTPSSSASSSRPSNRPELALVFPTVENVRTSVLGWDGG 417
Query: 252 NAI-PSPQKNVDKDFLK------KYWAK-------WKASHTGRSRAMPHIKTF------- 290
+I QK VDK LK + W + A R +A PHIKT+
Sbjct: 418 GSIFMKSQKPVDKAQLKYVKPMLRVWGQPPIGLSTAIAVEAERGKATPHIKTYNFFSPPR 477
Query: 291 ---------------ARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGV 332
+N + W ++TSANLSK AWG K +S I+SYE G+
Sbjct: 478 MDSKDSDTTDGEDESGAFNIVSMDWAMITSANLSKQAWGNPTKGSGPSSTSKIQSYEAGI 537
Query: 333 LILP----SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 388
LI P + G S + GS + + K+ D +
Sbjct: 538 LIHPGLWKDLLKDEAGAVTMSAV-------GSKDWLVAEGQKIENCDVPEDMDGKCNMVK 590
Query: 389 VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHFQ 430
V + + Y+ P + Y +D PW D Y +D G WP ++
Sbjct: 591 VGVRLAYDYPLKPYDEDDEPWCKDMPYEGRDWKGITWPPRWE 632
>gi|154311214|ref|XP_001554937.1| hypothetical protein BC1G_06725 [Botryotinia fuckeliana B05.10]
Length = 405
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 142/349 (40%), Gaps = 72/349 (20%)
Query: 144 KFNFSSAAVRLIASVPGYHTGS---SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 200
K++FS LIASVPG S WG L L+ + +V Q SS+
Sbjct: 60 KYDFSEIKAALIASVPGKQDTELSPSQTGWGWAGLTNALKSVPSHHNTQPE-IVIQVSSI 118
Query: 201 GSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS- 256
SL +KW+ ++S E K+P G I++PT ++VR S+ GYA+GNAI +
Sbjct: 119 ASLGPTDKWLTHFFKALS----ESKSPRKTGSKFKIIFPTADEVRRSINGYASGNAIHTK 174
Query: 257 ---PQKNVDKDFLKKYWAKW------------------------------KASHTGRSRA 283
P + +LK W K R RA
Sbjct: 175 ILTPAQGKQLAYLKPMLCHWAGDGAQHSSSSSLSSNTPSKSSQSFTSPELKTQEAYRRRA 234
Query: 284 MPHIKTFARYNGQK---------LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
PHIKT+ R++ + W L+TSANLSK AWG + ++ I SYE+GVL+
Sbjct: 235 APHIKTYIRFSSDSTSSSSSQKSIDWMLVTSANLSKQAWGESINSADKVRICSYEIGVLV 294
Query: 335 LP---SAKRHGCGFS---CTSNIVPS--------EIKSGSTETSQIQKTKLVTLTWHGSS 380
P K++G C N PS EI + ++ L
Sbjct: 295 WPDLWEEKQNGKNVKMVPCFGNDTPSIPFVSPSLEIVGQKEIRVEGEEGHLKRKRCDDRE 354
Query: 381 DAGASSE----VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
D E +V +PY+LP Y +D+PW Y++ D G+ W
Sbjct: 355 DEKRQEESHTIIVGARMPYDLPLVSYGKDDIPWCASASYSEPDWMGKTW 403
>gi|150865397|ref|XP_001384596.2| hypothetical protein PICST_67678 [Scheffersomyces stipitis CBS
6054]
gi|149386653|gb|ABN66567.2| putative tyrosyl-DNA phosphodiesterase [Scheffersomyces stipitis
CBS 6054]
Length = 553
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 181/427 (42%), Gaps = 92/427 (21%)
Query: 49 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 106
+P FGTHH+K M+ + + I++ + NL +D +Q LW L+ ++++ E
Sbjct: 165 IPNRFGTHHTKMMVNFFEDKSCEIVIMSFNLNKIDVVGLTQTLWRSGRLQLETEDSVKLE 224
Query: 107 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 164
G F+ D ++YL P ++ + ++F S V L+AS PG +
Sbjct: 225 RGENFKRDFMNYLKKYNSPVVTSLADR----------LQSYDFHSIDVELLASAPGKYEI 274
Query: 165 SSLKK----WGHMKLRTVLQECTFEKGFKKSPLVYQF---------------SSLGSLDE 205
++L +G+ KL +L+ K +Y F S S+
Sbjct: 275 TNLTDKDEVYGYGKLYQILKRNNLLVDNTKGDKLYNFLSQVTSISYPFNVRGSQTASVFS 334
Query: 206 KWMAELS-SSMSSGF-----SEDKTPLGIGE----PLIVWPTVEDVRCSLEGYAAGNAIP 255
+A L S S+GF D T + P +V+PTV+++ + G+ AG A+
Sbjct: 335 HLLAPLVFSGGSNGFKILLPGSDSTSKHQKDNYYLPHMVYPTVKEIANNNVGFGAGQAVH 394
Query: 256 SPQKNVD------KDFLKKYWAKWKASH----TGRSRAMPHIKTFARYNGQK---LAWFL 302
D + ++ Y KW +S TGR +PH K F NG L W L
Sbjct: 395 MKHTKSDTHRYQYQQNIRPYLRKWNSSGSDIVTGRESVVPHCKYFMCDNGDNFSSLKWAL 454
Query: 303 LTSANLSKAAWGA---LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 359
+ S NLSK AWG+ N ++ I S+ELGV++ P + G +VP+
Sbjct: 455 VGSHNLSKQAWGSPVPKSTNPNKYEISSFELGVVVFP---KEG------EKLVPA----- 500
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS-WDKRYTKK 418
+G D + + L +P+ LPP +Y+++D PWS W K
Sbjct: 501 -----------------YGE-DTVNDDKAIPLRMPFSLPPTKYTAQDEPWSEWVSYGELK 542
Query: 419 DVYGQVW 425
D +GQ +
Sbjct: 543 DKFGQTY 549
>gi|440302433|gb|ELP94746.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba invadens IP1]
Length = 446
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 165/378 (43%), Gaps = 74/378 (19%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
+ DI WLL P+L K V +H DG+L + N + G HH K
Sbjct: 49 VFDIGWLLREVPIL-KTVQVQFVH---DGSLSEDEERLIHNLDFQCIKVSPFRGCHHVKI 104
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID-YLST 119
M+++Y G+R ++ T NL+ D+ K+ G++++DF K N+ S+ ND+ + +L+T
Sbjct: 105 MVMLYEGGLRFVLSTGNLLEQDYEIKTNGIYVRDFKPK-SNSFSKM----NDIGEHFLTT 159
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
+++ S N + + F+FS+ L+ SVPG G + G +L ++L
Sbjct: 160 MRYYLNSIN--------TDIGYLDDFDFSTIDAWLLLSVPGKFHGDMASEVGLGQLSSLL 211
Query: 180 QECTF---------------------------------EKGFK--------KSPLVYQFS 198
+ +F +KG K ++ ++ Q S
Sbjct: 212 KSFSFGSQKDQKTQEEHKTSALINPVVPTKQSQKTSTSQKGLKSPEIECAEQAVIISQSS 271
Query: 199 SLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQ 258
SLG L + + SS + +WPT + VR S GYA G ++ Q
Sbjct: 272 SLGYLSSNFTEKFKSSFVPNVHHIQLK-------TLWPTEDFVRVSATGYAGGQSLFLTQ 324
Query: 259 KNVDKDF-LKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQ 317
+NV L +Y ++ R PHIKT+ G +LTSAN+S AAWG +
Sbjct: 325 QNVKSGVALYRYEPRFP-----RHYIQPHIKTYLVKVGDTFRCGVLTSANMSAAAWG--K 377
Query: 318 KNNSQLMIRSYELGVLIL 335
+ + I ++E+G+L +
Sbjct: 378 PMSYGIDISNFEMGLLFV 395
>gi|254565439|ref|XP_002489830.1| hypothetical protein [Komagataella pastoris GS115]
gi|238029626|emb|CAY67549.1| hypothetical protein PAS_chr1-1_0480 [Komagataella pastoris GS115]
gi|328350245|emb|CCA36645.1| tyrosyl-DNA phosphodiesterase 1 [Komagataella pastoris CBS 7435]
Length = 562
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 170/400 (42%), Gaps = 82/400 (20%)
Query: 53 FGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN 111
F THH+K M+ + G +I+V +AN+ +D+ +QGLWM P+ + N E F+N
Sbjct: 192 FATHHTKMMVNFFRDGTAQIVVMSANMTEMDFVGNTQGLWMS--PMLSKGN-GRESSFKN 248
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT----GSSL 167
D + YL + + +L A K ++F + ++SVPG T L
Sbjct: 249 DFLAYLKA--YNKHDLDLLAEE--------LKLYDFGNVKAEFLSSVPGTFTIPEEDDRL 298
Query: 168 KK---WGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSSLGS-LDEKWMAELSSSMSSGFSED 222
K+ +G+ KL +L+ F K + + ++ Q +++ S D + + ++ +
Sbjct: 299 KRSVQYGYGKLFQLLKLNNLFPKATESTDILAQVATIASPFDFRSSNIFTHLLAPLINGT 358
Query: 223 KTPLGIG---------------EPLIVWPTVEDVRCS-LEGYAAG---NAIPSPQK---- 259
K P+ G P +V+PT +V S L+ Y +G N S K
Sbjct: 359 KFPIAGGLEPLQKAINDDVHPFNPFLVFPTKNEVFGSVLKEYTSGIFYNIDDSSHKVPFL 418
Query: 260 NVDKDFLKKYWAKWKASH------TGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKA 311
+ ++K+ +W S GRS PH+KT+ N Q W+LLTSANLSK
Sbjct: 419 TNQHNIIRKFMYRWTNSDPNLNQKAGRSNLAPHVKTYCASNDGFQTFMWYLLTSANLSKQ 478
Query: 312 AWGALQK--NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 369
AWG K N + I SYE G+ I P K +G +
Sbjct: 479 AWGYPLKGSNGLKYKISSYEAGIFIHP--KLYGEDY------------------------ 512
Query: 370 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
+L + S VV + VPY P ++Y D PW
Sbjct: 513 QLKPILSRDSFPNRDKDNVVPIRVPYAFPLEKYHDSDEPW 552
>gi|449019998|dbj|BAM83400.1| probable tyrosyl-DNA phosphodiesterase [Cyanidioschyzon merolae
strain 10D]
Length = 615
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 154/349 (44%), Gaps = 73/349 (20%)
Query: 55 THHSKAMLL-IYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL 113
HHSK M+L + VR+++HT+N I DW K QG++ D PL+ + S GF DL
Sbjct: 208 VHHSKFMILRLRDDRVRLVIHTSNDIAYDWFFKCQGIFAVDLPLRGAGSASPNTGFCADL 267
Query: 114 IDYLS---------------------TLKWPEFSANL-PAHGNFKINPSFFKKFNFSSA- 150
YL T+ P +A+L A +F+ ++S+
Sbjct: 268 QQYLGAYIRAGERALHGGVTSARRFGTMVAPGDAASLVDAVSHFR---RLMTCCDYSAVD 324
Query: 151 AVRLIASVPGYHTGSSLKK--------------WGHMKLRTV----LQECTFEKGFKKS- 191
VRL++SVPG+H S + +GH++L + L+ CT S
Sbjct: 325 GVRLVSSVPGWHRISGQSRTSQTSRTASHAVCAFGHLRLANLVASSLRHCTEAARHPNSL 384
Query: 192 PLVYQFSSLGSLDEK------------WM-AELSSSMSSGFSED----------KTPLGI 228
V Q SSL S+D + W+ +EL S+ G K G
Sbjct: 385 AFVLQGSSLSSVDARCPRAASETLARYWLTSELFRSLCGGDGGGGGVGEESVFAKLAEGS 444
Query: 229 GEPLIVWPTVEDVRCSLEGYAAG-NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
+ +VWPT V S+ G +G I Q +D + +++ +W A R+ MPH+
Sbjct: 445 AQVYLVWPTRTQVLTSIVGIDSGMGLIARAQAFLDPE-IRQLLTRWNADWCARTSVMPHM 503
Query: 288 KTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
KT + ++ + + + L SAN++ AAWG QK S L ++ELGVL
Sbjct: 504 KTISCWDTRTDQCLYCYLGSANVTPAAWGITQKQGSLLRCMNWELGVLF 552
>gi|145533358|ref|XP_001452429.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420117|emb|CAK85032.1| unnamed protein product [Paramecium tetraurelia]
Length = 508
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 164/340 (48%), Gaps = 49/340 (14%)
Query: 27 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 82
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 83 WNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSF 141
W SQ +W+QDF + + F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKEFKVGLKEFLDNI--------LPSSHKFEDLLKIK 258
Query: 142 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQFS 198
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ +
Sbjct: 259 YNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQTT 318
Query: 199 SLGSLDEKWMAELS--------SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 250
S+G LD ++ + + M E+K+ L +++PT + ++ +A
Sbjct: 319 SIGQLDVNYVDFVQQQQNNKSIAQMLFNQQEEKSILK-----LIYPTSDYIQNQT---SA 370
Query: 251 GNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTFARYN-GQKL- 298
G +P Q+ + F K + +++ S H G +PH+K +K+
Sbjct: 371 GPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVMIITGIDEKID 427
Query: 299 --AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
+ S NLS+AAWG L+KN +QL I + ELGVL P
Sbjct: 428 DKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 467
>gi|260788030|ref|XP_002589054.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
gi|229274227|gb|EEN45065.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
Length = 130
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/90 (56%), Positives = 65/90 (72%), Gaps = 3/90 (3%)
Query: 250 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSA 306
AG ++P K +L K+ +W +S GR+RA PHIKT+ R + +LAWFL+TSA
Sbjct: 8 AGGSLPYSINTARKQPYLNKFLHQWSSSARGRTRASPHIKTYTRTSPDCSRLAWFLVTSA 67
Query: 307 NLSKAAWGALQKNNSQLMIRSYELGVLILP 336
NLSKAAWGAL+KN +QLMIRSYE+GVL LP
Sbjct: 68 NLSKAAWGALEKNGAQLMIRSYEIGVLFLP 97
>gi|400603196|gb|EJP70794.1| tyrosyl-DNA phosphodiesterase [Beauveria bassiana ARSEF 2860]
Length = 399
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 127/264 (48%), Gaps = 37/264 (14%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG--VRIIVHTANLIHVDWNNKSQGLW------- 91
N LH P+P FGTHHSK ML+++ R ++I+HTAN+I DW N + +W
Sbjct: 125 NVNLHAAPMPEMFGTHHSK-MLILFRRDDTTQVIIHTANMIAKDWTNMTNAVWTSPVLSK 183
Query: 92 MQDFPLKD--QNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 147
++ P + ++++ G F++DL+ YL + N K+++F
Sbjct: 184 LKKVPDDPSWREDMAQGSGHRFKSDLLSYLRCYDRMRPTCNALVES--------LKEYDF 235
Query: 148 SSAAVRLIASVPGYHT---GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL- 203
SS LIASVPG H + WG + LQ+ E G S + Q SS+ +L
Sbjct: 236 SSVRGSLIASVPGTHEVHGDPGVTSWGWKSMSKCLQQIPCEPGV--SQVAVQVSSIATLG 293
Query: 204 -DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNA----IPSP 257
++ W L ++ S+ K + +V+PT +++R SL+GYA+G + I S
Sbjct: 294 GNDGW---LRGTLFRALSKGKVATALSPQFKVVFPTADEIRASLDGYASGGSIHTKIQSK 350
Query: 258 QKNVDKDFLKKYWAKWKASHTGRS 281
Q+ + ++L+ + W R+
Sbjct: 351 QQQMQLNYLRPIFHHWMTDDDSRT 374
>gi|299740649|ref|XP_001833897.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
gi|298404347|gb|EAU87927.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
Length = 627
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 100/363 (27%), Positives = 157/363 (43%), Gaps = 53/363 (14%)
Query: 21 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLI 79
+++ + D T + +N NWI PPL +G H K MLL + G +R++V TANLI
Sbjct: 227 VIVVAQPDATGQASMKNVLPNWIKTTPPLRGGYGCQHMKFMLLFHKTGRLRVVVSTANLI 286
Query: 80 HVDWNNKSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLSTLKW-PEFSANLPAHGNF 135
DW +W+QD PL+ ++ + F L+ L+ L P + H N
Sbjct: 287 SYDWREMENTVWLQDVPLRSSSSTAPVRATDDFPGTLLYMLAALNVVPALKIMINEHPNL 346
Query: 136 KIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGF---- 188
I +++++S L+ S+ G H G S+ K GH +L V+++ G
Sbjct: 347 PIKTIEELRERWDWSKVKAHLVPSIAGKHEGWPSVIKTGHPRLMAVVRKMAMRTGTGSQA 406
Query: 189 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED----------KTPLGIGEPL-IVWPT 237
KK L Q SSLG+ +W+ E S +ED K P P+ I++PT
Sbjct: 407 KKLTLECQGSSLGNYTTQWLNEFYYSARGESAEDWLDRSKKQREKQPY---PPVKIIFPT 463
Query: 238 VEDVRCSLEGYAAGNAIPSPQKNVD-KDFLKKYWAKWKASHTGRS-----------RAMP 285
+ V+ S G G I ++ D K+F ++ + K S GRS R
Sbjct: 464 KKTVQESTFGEQGGGTIFCRRRQWDGKNFPRELFHDSK-SKAGRSLMHSKMIIGTLRDST 522
Query: 286 HIKTFARYNGQK------------LAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELG 331
H T + + + W + S N + +AWG L + N L I +YE+G
Sbjct: 523 HASTSQDGSETEDSDDEIQIIQPAVGWAYIGSHNFTPSAWGTLSGSSFNPTLNITNYEVG 582
Query: 332 VLI 334
V+
Sbjct: 583 VVF 585
>gi|145497459|ref|XP_001434718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401846|emb|CAK67321.1| unnamed protein product [Paramecium tetraurelia]
Length = 522
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/335 (27%), Positives = 156/335 (46%), Gaps = 43/335 (12%)
Query: 31 LEHMKR-NKPANWILHKP-PLPISF--GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 86
LE ++R N NW + KP L + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 87 SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKF 145
SQG+W+QDF + F++ L ++L + LP F+ + + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQEFKSMLREFLYEI--------LPTSHKFEDLLKIKYDDY 264
Query: 146 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSLGS 202
+F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+G
Sbjct: 265 DFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSIGQ 324
Query: 203 LDEKWMAELSSSMSSGFSEDKTPLGI--------GEPLIVWPTVEDVRCSLE-GYAAGNA 253
+D ++ + +G S K I + +++PT + + G N
Sbjct: 325 MDNNYV-DFVLQCCTGRSTKKINQMILNQQEEEQSKLKLIYPTADYIENQTHGGVDFANP 383
Query: 254 IPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF------ARYNGQKLAWF 301
+ Q++ + F K + K++ S HTG +PH+K N Q +
Sbjct: 384 LHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLDEDINDQTSIY- 439
Query: 302 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
+ S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 440 -IGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 473
>gi|324542673|gb|ADY49650.1| Tyrosyl-DNA phosphodiesterase 1, partial [Ascaris suum]
Length = 133
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 85/180 (47%), Gaps = 53/180 (29%)
Query: 250 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKL--AWFLLTSA 306
AG A+P + + +L + KW+ GR+RAMPHIK+++ ++ + +W L+TSA
Sbjct: 2 AGGALPYQRNTAARQPYLLERMHKWRCERFGRTRAMPHIKSYSAFSDGRCLPSWLLITSA 61
Query: 307 NLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 366
NLSKAAWG LQK SQL IRSYELGVL+ T+ +
Sbjct: 62 NLSKAAWGELQKKESQLAIRSYELGVLL--------------------------TDEDSL 95
Query: 367 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 426
Q +PY++P ++ D PW D YTK D++G WP
Sbjct: 96 QL------------------------LPYDMPLTKFEPGDQPWVCDDTYTKPDIHGATWP 131
>gi|74830335|emb|CAI39050.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 521
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 168/350 (48%), Gaps = 56/350 (16%)
Query: 27 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 82
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 83 WNNKSQGLWMQDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INP 139
W SQ +W+QDF + + + +S+E F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKE--FKVGLKEFLDNI--------LPSSHKFEDLLK 256
Query: 140 SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQ 196
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ
Sbjct: 257 IKYNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQ 316
Query: 197 FSSLGSLDEKWMAELSSSMSSGFSEDKTPL-----GIGEPL-----------IVWPTVED 240
+S+G LD ++ + S + + I + L +++PT +
Sbjct: 317 TTSIGQLDVNYVDFVQQCCSGQQIKQSQKIEQNNKSIAQMLFNQQEEKSILKLIYPTSDY 376
Query: 241 VRCSLEGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTF 290
++ +AG +P Q+ + F K + +++ S H G +PH+K
Sbjct: 377 IQNQT---SAGPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVM 430
Query: 291 ARYN-GQKL---AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
+K+ + S NLS+AAWG L+KN +QL I + ELGVL P
Sbjct: 431 IITGIDEKIDDKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 480
>gi|330842084|ref|XP_003293015.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
gi|325076694|gb|EGC30460.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
Length = 564
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 140/321 (43%), Gaps = 48/321 (14%)
Query: 47 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 106
PPL S+ T H K +LL++P +RII+ ++N +D+++ +Q +W QDF +K + +
Sbjct: 218 PPLG-SYQTFHGKLILLVFPEFIRIIIPSSNPTQLDYDSLNQTIWFQDFQIKK----APK 272
Query: 107 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH---- 162
+ D+L TLK+ S P+ F +++FS A+ LI SVPG++
Sbjct: 273 QATPSKDNDFLKTLKYFLASIGCPS-------VKFLDEYDFSEASAHLIISVPGFYKHDG 325
Query: 163 TGSSLKK-----WGHMKLRTVLQ-------ECTFEKGFKKS------PLVYQFSSLGSLD 204
GS + + G KL +VL+ E T K+ YQ SS+G
Sbjct: 326 AGSGIIESDKPLMGIYKLESVLKKYYRNQDETTDYTVLDKNNQHCVRDFYYQASSIGGEK 385
Query: 205 EKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
+ +S PL I P W D R +A + + N DK
Sbjct: 386 GNFRNNFVKHLSPSIENSDKPLHIIYPTDQWIKSNDHRLQ---HAGCLFLSNKNYNNDKS 442
Query: 265 FLK----KY-WAKWKASHT----GRSRAM--PHIKTFARYNGQKLAWFLLTSANLSKAAW 313
KY + K H+ G S + P T + + K W S N S AAW
Sbjct: 443 CFSYLSPKYDYRKHLVYHSKVLVGTSTRLNKPLKDTLNQRSNIKYDWVYAGSHNFSSAAW 502
Query: 314 GALQKNNSQLMIRSYELGVLI 334
GA QKN +Q+ I +YE+GVL
Sbjct: 503 GAFQKNETQIQISNYEIGVLF 523
>gi|307108295|gb|EFN56535.1| hypothetical protein CHLNCDRAFT_144174 [Chlorella variabilis]
Length = 682
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 87/179 (48%), Gaps = 13/179 (7%)
Query: 6 WLLPACPVLAKI----PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
WLL ACP L + E+ G +R ++LH PP+P +G HHSK M
Sbjct: 508 WLLSACPDLRPLVTWRTKTRKALREASGAAAEGRR-----FVLHTPPVPDRWGRHHSKMM 562
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
L+ Y GVR I+ T NL ++++Q ++ QDFP K FE L YL+ L+
Sbjct: 563 LIEYATGVRFILPTPNLQFHQLHSQTQAVFFQDFPPKQDGTSPPGSDFETSLARYLAALQ 622
Query: 122 WPEFSANLPAHGNFKIN-PSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
P A H + P ++ +FS+A L+ASVPG H G +GH +L +L
Sbjct: 623 LPGEEAK---HAQAGWHWPELVRRHDFSAARAVLVASVPGSHGGELAAAYGHKRLAALL 678
>gi|74834157|emb|CAI44465.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 532
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 157/344 (45%), Gaps = 51/344 (14%)
Query: 31 LEHMKR-NKPANWILHKP-PLPISF--GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 86
LE ++R N NW + KP L + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 87 SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKF 145
SQG+W+QDF + F++ L ++L + LP F+ + + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQEFKSMLREFLYEI--------LPTSHKFEDLLKIKYDDY 264
Query: 146 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSLGS 202
+F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+G
Sbjct: 265 DFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSIGQ 324
Query: 203 LDEKWMAELSSSMSSGFSEDKTP-----LGIGEPL------------IVWPTVEDVRCSL 245
+D ++ + + + + P I + + +++PT + +
Sbjct: 325 MDNNYVDFVLQCCTGRVYKQQLPNEQSTKKINQMILNQQEEEQSKLKLIYPTADYIENQT 384
Query: 246 E-GYAAGNAIPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF------AR 292
G N + Q++ + F K + K++ S HTG +PH+K
Sbjct: 385 HGGVDFANPLHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLDED 441
Query: 293 YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
N Q + + S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 442 INDQTSIY--IGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 483
>gi|340503654|gb|EGR30196.1| tyrosyl-DNA phosphodiesterase family protein, putative
[Ichthyophthirius multifiliis]
Length = 547
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 152/323 (47%), Gaps = 39/323 (12%)
Query: 41 NWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
NW L PP S G H K L+ + +R++V + NL DW+ S LW QDFPL
Sbjct: 201 NWTLIHPPKDASVSWGGAFHPKLWLIKFNEFLRVVVGSGNLHICDWSVWSNCLWYQDFPL 260
Query: 98 KDQNNLSEECG---------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 148
K Q N +E F N LID ++ + N+ KI+ +++++S
Sbjct: 261 KKQQNAQKEKNQQQWDFEGDFSNTLIDIVNRM----MPDNVKYQNLLKID---LEEYDYS 313
Query: 149 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 208
+ L+++VPG H +++K G KL ++ F + K+ + Y+ S+LG++D K++
Sbjct: 314 EVKIILLSNVPGRHL--NIQKHGLGKLNAIIN--AFGQQNKQKIITYESSTLGNIDNKFL 369
Query: 209 AELSSSM---SSGF---SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP---SPQK 259
E S+ S F S++ + +++PT + + C Y A P + +
Sbjct: 370 NEFYKSVNLASCDFQKNSKENIKDIQNQFKVIFPTKKYI-CQDTLYGIEYASPVILNEKY 428
Query: 260 NVDKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQKL----AWFLLTSANLSKAAW 313
++ F+K + +++ + S +PH+K + + + + S N + AAW
Sbjct: 429 YSNEKFIKDVFYQFECPKGYFYHSGVIPHLKVMVVNDKEDQISDDSLIYVGSHNFTGAAW 488
Query: 314 GALQKNNSQLMIRSYELGVLILP 336
G +KN SQ+ + ELGV+ P
Sbjct: 489 GRYEKNYSQIYCMNTELGVVYPP 511
>gi|422293515|gb|EKU20815.1| tyrosyl-dna phosphodiesterase 1, partial [Nannochloropsis gaditana
CCMP526]
Length = 160
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 76/128 (59%), Gaps = 8/128 (6%)
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
LL+Y G+R+++ T+N I VDW+NK+QG+W+QDFP + + +++ F DL +YL L
Sbjct: 3 LLLYEGGIRVMICTSNFIEVDWHNKTQGIWVQDFPKLREEDKADDSLFGRDLREYLQALN 62
Query: 122 -WPEFSANLPAHGNFKINPSF-------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
+ + H K +P + +FSSA L+ASVPG HTG K+GH+
Sbjct: 63 GFENECGSRGPHSPGKGHPLLTEMIEQELPRIDFSSAQAVLLASVPGKHTGHDKFKFGHL 122
Query: 174 KLRTVLQE 181
KLR +L++
Sbjct: 123 KLRRLLEK 130
>gi|307211793|gb|EFN87774.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 445
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 121/255 (47%), Gaps = 25/255 (9%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
++D++WL + + ++ +++GE E + N A + +P FG+HH+K
Sbjct: 182 ILDVEWLCLQYLLAGQSTNMTILYGERTDE-EELDDNITAVQV----QMPFEFGSHHTKI 236
Query: 61 MLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLI 114
M+L Y G+R++V TANL DW N+ QG+W+ L + ++ CG F+ DL
Sbjct: 237 MILQYKDDGIRVVVSTANLYFEDWQNRMQGMWISPH-LPRLSKAAKRCGESPTNFKKDLQ 295
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL++ + P K +K +FS+ V LIAS PGY + + WG+ K
Sbjct: 296 RYLNSYQNPA----------LKRWRDLVRKADFSAVNVCLIASTPGYFRRTDVDLWGYKK 345
Query: 175 LRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP-- 231
L VL Q +K ++ Q S++GS K+ LS + + + P
Sbjct: 346 LANVLSQHVMLPSNARKWSIIAQSSAVGSFGPKYEGWLSKEIIRSMTRETKRDLKNYPKF 405
Query: 232 LIVWPTVEDVRCSLE 246
++P+V++ S +
Sbjct: 406 QFIYPSVKNYEQSFD 420
>gi|16768278|gb|AAL28358.1| GH27933p [Drosophila melanogaster]
Length = 161
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 70/110 (63%), Gaps = 6/110 (5%)
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFA 291
+++P+ +V S +G G +P + DK +LK Y +WK+S RSRAMPHIK++
Sbjct: 6 MIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAMPHIKSYT 65
Query: 292 RYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 336
R+N Q + WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 66 RFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 115
>gi|169620876|ref|XP_001803849.1| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
gi|160704126|gb|EAT79090.2| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
Length = 384
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 148/338 (43%), Gaps = 62/338 (18%)
Query: 142 FKKFNFSSAAVRLIASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKKSP---L 193
+ ++FSS I SVP + K +G + L +L KK+ +
Sbjct: 58 LRDYDFSSIKAAFIGSVPSRQKPIATKPAQQTSFGWLGLEEILSNVPITANAKKASAPHI 117
Query: 194 VYQFSSLGSLDEK--WMAELSSSM---SSGFSEDKTPLGIGEPL---------------- 232
V Q SS+ +L W+ + S + ++G E+ +P
Sbjct: 118 VMQVSSIATLGAAPTWLNKFQSVLCRSAAGQLEEAPAASSSKPPKLFSKGGMSSAKQDKP 177
Query: 233 ------IVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKASHTGRSR 282
I++PT ++VR SL+GY +G++I S Q+ ++L + WKA+ S+
Sbjct: 178 LSPKFNIIFPTSDEVRTSLDGYDSGSSIHMKLLSIQQQKQLEYLHPLFCHWKATPDSNSK 237
Query: 283 -------AMPHIKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 332
A PHIKT+ RY+ +K + W ++TSANLSK AWG + + I+S+E GV
Sbjct: 238 GQAMRGPAAPHIKTYIRYSDEKHKTIDWAMVTSANLSKQAWGDVVNKKDETWIQSWEAGV 297
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKS--GSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 390
++ P S + +VP K G+ + S K G+ + A V+
Sbjct: 298 VVWPEL----FAESKEAIMVPVFGKDMPGTEDVSSQDVNK-------GADEGQAGKTVIG 346
Query: 391 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 428
+PY+LP Y++++ PW + D G+ WP +
Sbjct: 347 FRMPYDLPLTPYTAKEKPWCAQMPSAEPDWMGRAWPGY 384
>gi|451998304|gb|EMD90769.1| hypothetical protein COCHEDRAFT_1179942 [Cochliobolus
heterostrophus C5]
Length = 567
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 146/343 (42%), Gaps = 34/343 (9%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---------QGLW 91
N +H PP+ + HSK MLL P +RI++ TAN+I DW + ++
Sbjct: 217 NLKIHFPPMDGMIQSMHSKFMLLFGPNKLRIVIPTANMIQTDWGEVANDWQPGVMENSIF 276
Query: 92 MQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 148
+ D P + S + F +L+ +L K PE F+FS
Sbjct: 277 LIDLPRRGNETTSTQENMTRFGQELMYFLEMQKVPEMVLQ------------GILNFDFS 324
Query: 149 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 207
+ + + S+ G H S G L +Q+ + ++ L Y SSLG++++ +
Sbjct: 325 QTSHLAFVHSIGGSHKTESEHPTGLPGLARAIQDLRLDN-VEQIELDYAASSLGAINDSF 383
Query: 208 MAELS-SSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
++ L ++ F+ D + I +PT E V S+ G G I Q+ + D
Sbjct: 384 LSRLYLAACGKCFAADTATVSDVRRHIRIYFPTNETVEKSIGGPDCGGIISLSQQRYNAD 443
Query: 265 -FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQ--KNN- 320
F ++ +++S G + R +G+ + W + SANLS++AWG + KN
Sbjct: 444 TFPRECLRDYESSRAGMLSHNKLLLARGRKDGRPVGWVYVGSANLSESAWGGQKVIKNGK 503
Query: 321 -SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
L IR++E GV++ R G VP I G+ E
Sbjct: 504 MGSLNIRNWECGVVMTVPEDRLGGRDKDRDKTVPMSIFEGTVE 546
>gi|409075791|gb|EKM76167.1| hypothetical protein AGABI1DRAFT_45345 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 625
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 145/344 (42%), Gaps = 54/344 (15%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
W+ PPL FG H K MLL Y G +R+++ TANLI DW + +W+QD P++
Sbjct: 244 TWVKTTPPLRGGFGCQHMKFMLLFYKNGNLRVVISTANLIAYDWRDMENSVWLQDLPMRP 303
Query: 100 QNNLSEECG--FENDLIDYLSTLK-WPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRL 154
Q + F + + L + P LP H N + ++++S V L
Sbjct: 304 QLMPPDPKAKDFPSIMQQVLHAVNVAPALRTMLPDHPNIPLRTIEDLRMRWDWSKVKVHL 363
Query: 155 IASVPGYHTG-SSLKKWGHMKLRTVLQECTFE--KGFKKSPLVY--QFSSLGSLDEKWMA 209
+AS+ G H G S+ K GH +L ++ +G K ++ Q SSLG+ +W+
Sbjct: 364 VASIAGKHEGWPSIVKTGHPRLMMAIRTMGLRPSRGLGKGNMIIECQGSSLGNFTTQWLN 423
Query: 210 ELSSSMSSGFSED--KTPLGIGEPL------IVWPTVEDVRCSLEGYAAGNAIPSPQKN- 260
E S +ED P E L I++PT + V+ S G G I +K
Sbjct: 424 EFHWSARGESAEDWLDEPKRRREKLPYPSVRILFPTKKIVQESASGEPGGGTIFCRRKQW 483
Query: 261 VDKDFLKK--YWAKWKA--------------SHTGRSRAM------------PHIKTFAR 292
K+F + Y +K KA HT + A P +K
Sbjct: 484 AAKNFPRDKFYVSKSKAGPVLMHSKMIIATIQHTNPASASLNREGSDTEEDEPEVKIIEP 543
Query: 293 YNGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 334
G W + S N + +AWG L + N L I +YE+G++
Sbjct: 544 AVG----WAYVGSHNFTPSAWGTLSGSAFNPILNITNYEIGIVF 583
>gi|238496339|ref|XP_002379405.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694285|gb|EED50629.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 491
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 121/259 (46%), Gaps = 41/259 (15%)
Query: 188 FKKSPLVYQFSSLGSLDEKWMAELS-SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLE 246
FK+ L Y +KW+ ++ +S+S + + P + I++PT +++R SL
Sbjct: 250 FKRDLLAYLTEYGPKKTDKWLKDVMFASLSPASTSTRQP----KYSIIFPTADEIRRSLN 305
Query: 247 GYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------------TGRSRAMPHIKTF 290
GY +G +I S + +++ Y W H GR RA PHIKT+
Sbjct: 306 GYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSHTSKQDAGRRRAAPHIKTY 365
Query: 291 ARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 346
R++ + + W ++TSANLS AWGA + ++ I S+E+G+++ P
Sbjct: 366 IRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSWEIGIVVWPQLYVQDTE-- 423
Query: 347 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 406
++ +VP+ K + E + + ++ T V+ L +PY+LP Y++ D
Sbjct: 424 -SATMVPT-FKRDTPEPLENKDSETTPDT------------VIGLRMPYDLPLTPYAAHD 469
Query: 407 VPWSWDKRYTKKDVYGQVW 425
PW ++ + D GQ W
Sbjct: 470 TPWCATAQHLEPDWLGQTW 488
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 25/150 (16%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 106
+P +FGTHHSK M+L+ + V++++HTAN+I DW N Q +W PL+ ++ E+
Sbjct: 182 MPEAFGTHHSKMMVLLRHDDLVQVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVED 241
Query: 107 ------CGFENDLIDYLS------TLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAA 151
F+ DL+ YL+ T KW + F++ PA + + P + F +
Sbjct: 242 LILGSGARFKRDLLAYLTEYGPKKTDKWLKDVMFASLSPASTSTR-QPKYSIIFPTADEI 300
Query: 152 VRLIASVPGYHTGSSLKKWGHMKLRTVLQE 181
R S+ GY +G S+ HMKL++ Q+
Sbjct: 301 RR---SLNGYGSGGSI----HMKLQSAAQQ 323
>gi|355723700|gb|AES07977.1| tyrosyl-DNA phosphodiesterase 1 [Mustela putorius furo]
Length = 381
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 74/129 (57%), Gaps = 10/129 (7%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 208 DVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 267
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLI 114
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ PL Q + F+ DLI
Sbjct: 268 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLS--PLYPQIIHGTHRSGESTTHFKADLI 325
Query: 115 DYLSTLKWP 123
YL+ P
Sbjct: 326 SYLTAYNAP 334
>gi|384490985|gb|EIE82181.1| hypothetical protein RO3G_06886 [Rhizopus delemar RA 99-880]
Length = 338
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 141/313 (45%), Gaps = 45/313 (14%)
Query: 41 NWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---- 95
N I+ +PPL + +G H+K MLL +R+++ +AN++ D+ ++MQDF
Sbjct: 18 NRIIIQPPLKDNKYGVFHNKLMLLFRSSSLRVVIGSANMVACDYEELENVVFMQDFPELI 77
Query: 96 -PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 154
PLK +++ E F D+ D L ++ P K++FS A R+
Sbjct: 78 VPLKSESDFPE---FAKDICDVLDKMRVPTTVKEE------------LLKYDFSKAKARI 122
Query: 155 IASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSLDEKWMAELS 212
+ASV G G KK+GH +L ++++ T P V Q SSLGSL ++ E+
Sbjct: 123 VASVSGVFEGEEEYKKYGHTRLADIVRDITGPLDPNNYPKVEMQTSSLGSLSVSYLQEIY 182
Query: 213 SSMS--SGFSEDKTPLGIGE-----PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
S S FS+ K + P+ I++PT + V S G A ++I
Sbjct: 183 QSFCGISSFSDGKAVRSSLQKNQLPPIDIIFPTRDTVTSSRYGGAGADSIC--------- 233
Query: 265 FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS--- 321
F W K ++ H + A + + L + S N + +AWG + +
Sbjct: 234 FNTATWRKPTFPKQVMCDSISH-RQGALMHSKALTSMIFRSHNSTTSAWGKFTVSKASKL 292
Query: 322 -QLMIRSYELGVL 333
+L I ++ELGV+
Sbjct: 293 PKLSISNWELGVV 305
>gi|449686459|ref|XP_002156800.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial [Hydra
magnipapillata]
Length = 206
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 114/235 (48%), Gaps = 64/235 (27%)
Query: 49 LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG 108
LPI++GTHH RI W KS ++D +N+
Sbjct: 19 LPIAYGTHH------------RI-----------W--KSPLFAIKDVAYDGKND-----P 48
Query: 109 FENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTGSS 166
F+ DL++YLS+ +GN K+ K+++ SSA V L++SVPG +TG
Sbjct: 49 FKEDLLEYLSS------------YGNSKLGMYAEKLKEYDMSSANVHLVSSVPGRYTGFK 96
Query: 167 LKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE--------KWMAELSSSMS 216
+ +WGH+KLR +L K P++ QFSS+GSL +W++ LS+
Sbjct: 97 MHQWGHLKLRKLLLSYGPSKDLVNENWPIIGQFSSIGSLGSESSSWLCGEWLSSLSTCKD 156
Query: 217 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-----SPQKNVDKDFL 266
E K L +++PT+E+VR SLEGY+AG ++P + ++ KDFL
Sbjct: 157 DELKESKANLK-----LIYPTIENVRNSLEGYSAGCSLPYGIQVAMKQRYLKDFL 206
>gi|440797312|gb|ELR18403.1| Tyrosyl-DNA phosphodiesterase [Acanthamoeba castellanii str. Neff]
Length = 569
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/223 (30%), Positives = 107/223 (47%), Gaps = 33/223 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPI-SFG------- 54
D++W+L P IP LV H E ++ ++ N + PPL + FG
Sbjct: 65 DVEWVLSVIP--PTIPITLVRHWEEPDREGEVRISR--NIRVIHPPLALPGFGGGQAMRA 120
Query: 55 THHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FEN 111
H+K MLL Y +R++V +ANL D+ Q +W QDFP K Q + ++ FE
Sbjct: 121 KMHAKLMLLRYRDNTLRVVVTSANLAQPDYELVGQTVWYQDFPKKQQKSSGQQPASPFEE 180
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS-SLKKW 170
L +L LK E F ++++FS AA L+ SVPG+H G +
Sbjct: 181 TLTQFLVALKADE---------------GFLREYDFSKAAADLVVSVPGFHRGKHKMDAV 225
Query: 171 GHMKLRTVLQECTFEKG--FKKSPLVYQFSSLGSLDEKWMAEL 211
GH +LR +L++ + + + YQ SSLG+L E +++E
Sbjct: 226 GHTRLRALLRDFQWPPADELRDDNIYYQTSSLGALYESFVSEF 268
>gi|324522792|gb|ADY48131.1| Tyrosyl-DNA phosphodiesterase, partial [Ascaris suum]
Length = 306
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 114/230 (49%), Gaps = 20/230 (8%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM------KRNKPANWILHKPPLPISFG 54
M+D+ WLL P + +I GE++GT H+ +R K N + + L + +G
Sbjct: 75 MIDLHWLLSQYPERCSAYPISIIVGENNGT-NHLDVRAEARRCKADNVSVGRARLVLPYG 133
Query: 55 THHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL 113
THHSK ++ + +++ TANL+ DW++K+Q + P+ + + F DL
Sbjct: 134 THHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEGQNNFRKDL 193
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 173
I YL+ ++ G + +FS R+I+S+PGYH G ++GH+
Sbjct: 194 ISYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGDQKDRYGHL 247
Query: 174 KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSGF 219
+LR VL+ + KK V QFSS+GSL K W+ A+ S++ G
Sbjct: 248 RLRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGGI 295
>gi|145527276|ref|XP_001449438.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|74834160|emb|CAI44466.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
gi|124417026|emb|CAK82041.1| unnamed protein product [Paramecium tetraurelia]
Length = 532
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 151/345 (43%), Gaps = 62/345 (17%)
Query: 35 KRNKPANWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 91
K N NW++ KP S G H K +L +P+ +RI++ + NL DW SQ +W
Sbjct: 158 KYNNYPNWMVIKPSKLGSCMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIWSQAMW 217
Query: 92 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSA 150
+QDF + F+ L ++L + LP F+ + + ++F
Sbjct: 218 IQDFKIGKSELDQGSQEFKTMLREFLYEI--------LPTSHKFEDLLKIKYDDYDFKDV 269
Query: 151 AVRLIASVPGYHTGSSLKKWGHMKLRTVL--QECTFEKGFKKSPLV-YQFSSLGSLDEKW 207
++LI S+PG G+ L K+G M+L++VL + C + K V YQ +S+G LD+ +
Sbjct: 270 NIKLITSIPGRFVGNQLFKYGMMRLQSVLYYELCNNKMEIPKQVCVTYQTTSIGQLDDNY 329
Query: 208 M----------------------AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSL 245
+ +L+ + + E+++ L +++PT + +
Sbjct: 330 IDFALQCCTGKVYKQPLASEQNNKKLNQMILNQQEEEQSKLK-----LIYPTADYIENQT 384
Query: 246 EGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIK----TFA 291
G G +P Q + F K + K++ S HTG +PH+K T
Sbjct: 385 HG---GVDFANPLYLKKQLYENPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGL 438
Query: 292 RYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
+ S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 439 DEEINDYTSIYIGSHNFSQGAWGKMEKNATQLYIANTELGVLYPP 483
>gi|146413473|ref|XP_001482707.1| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 170/425 (40%), Gaps = 100/425 (23%)
Query: 49 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 106
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 107 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 164
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 165 SSL----KKWGHMKLRTVLQECTF--EKGFKKSPLVYQFSSLG--SLDEKWMAELSSSMS 216
+ + +G KL VL+ + K ++ Q SS+ + EK+ S
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHT------S 324
Query: 217 SGFSEDKTPLGIGEP-----------------------LIVWPTVEDVRCSLEGYAAGNA 253
S F+ PL +P IV+PT ++V + G+ AG +
Sbjct: 325 SIFTHILCPLIFDDPQFSMLSPGRETTRNHQKLYNYTPTIVYPTAQEVSQANVGFGAGAS 384
Query: 254 I------PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFL 302
I +N K + Y KW KA GR+ PH+K + NG + + W L
Sbjct: 385 IHFNYTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWAL 444
Query: 303 LTSANLSKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 361
L S NLSK AWGA + KN + + SYELGVL+ G + T +K+
Sbjct: 445 LCSHNLSKQAWGAPKSKNGRKYHVASYELGVLV------PGTPHTLTPTYPHDHLKNC-- 496
Query: 362 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDV 420
+ L +P+++PP+ Y D PWS + + KD
Sbjct: 497 --------------------------LAPLRLPFKVPPEPYGDSDQPWSPHMNFGELKDR 530
Query: 421 YGQVW 425
+G +
Sbjct: 531 FGNTY 535
>gi|190348157|gb|EDK40564.2| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 115/426 (26%), Positives = 173/426 (40%), Gaps = 102/426 (23%)
Query: 49 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 106
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 107 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 164
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 165 SSL----KKWGHMKLRTVLQECTF--EKGFKKSPLVYQFSSLG--SLDEKWMAELSSSMS 216
+ + +G KL VL+ + K ++ Q SS+ + EK+ S
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHT------S 324
Query: 217 SGFSEDKTPLGIGEP-----------------------LIVWPTVEDVRCSLEGYAAGNA 253
S F+ PL +P IV+PT ++V + G+ AG +
Sbjct: 325 SIFTHILCPLIFDDPQFSMLSPGRETTRNHQKLYNYTPTIVYPTAQEVSQANVGFGAGAS 384
Query: 254 I------PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFL 302
I +N K + Y KW KA GR+ PH+K + NG + + W L
Sbjct: 385 IHFNYTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWAL 444
Query: 303 LTSANLSKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 361
L S NLSK AWGA + KN + + SYELGVL+ G+
Sbjct: 445 LCSHNLSKQAWGAPKSKNGRKYHVASYELGVLV-----------------------PGTP 481
Query: 362 ETSQIQKTKLVTLTW-HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KD 419
T +T T+ H S + L +P+++PP+ Y D PWS + + KD
Sbjct: 482 HT--------LTPTYPHDHSKNCLAP----LRLPFKVPPEPYGDSDQPWSPHMNFGELKD 529
Query: 420 VYGQVW 425
+G +
Sbjct: 530 RFGNTY 535
>gi|358056499|dbj|GAA97673.1| hypothetical protein E5Q_04351 [Mixia osmundae IAM 14324]
Length = 686
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 153/354 (43%), Gaps = 43/354 (12%)
Query: 2 VDIDWLLPAC--PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
D+DWL+ P L K +L + G +D + N P + LH PP+ + G H K
Sbjct: 318 TDLDWLVAHVLPPELGKQ-VLLALPGPADAPITSFVPNHP-HIKLHCPPVCRTSGAMHIK 375
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
+L++Y R+ + TANL+ DW +W+QDFP Q +L++ F L L
Sbjct: 376 LILVVYDDFCRVAIPTANLVPYDWQQIENAVWIQDFP--RQGSLAKPTRFAQTLHTTLRL 433
Query: 120 LKWPEFSAN--LPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 177
L E S N LP +F + + R+I S PG SS + GH L
Sbjct: 434 LCIEEDSRNAVLPLDVDFS-----------AGISARMILSTPG---SSSSEPNGHKLLGQ 479
Query: 178 VLQECTFEKGFKKS-PLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP---LGIGEPL- 232
LQ+ + L Q SS+G+L+++W+ E SS+ P EPL
Sbjct: 480 ALQDLHLLPARDQDVRLECQGSSIGALNDEWLLEFYSSICGRPVRTMFPKVQTANFEPLR 539
Query: 233 ----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 288
IV+PT+ ++ + G A G + + + K + S + R+ + H K
Sbjct: 540 TLFRIVFPTLRNIENTHLGTAGGGTLFCNRSTWENRHFPKEC--MRQSTSKRAGVVMHTK 597
Query: 289 -TFARYNGQKLA-------WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
A++ + A W + S N + AAWG + S + + ELG+++
Sbjct: 598 MILAQFRMSRHAQSDRPPGWLYVGSHNFTAAAWG--KSTASSFKVSNCELGIVM 649
>gi|170097685|ref|XP_001880062.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645465|gb|EDR09713.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 609
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 155/376 (41%), Gaps = 57/376 (15%)
Query: 4 IDWL------LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
+DW+ PA PV ++ + D T + +N +WI P L G H
Sbjct: 208 LDWMWIYQFFDPATPV--------IMVAQPDQTGRAIIKNVLPHWIKTTPYLRGGHGCQH 259
Query: 58 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
K MLL Y G +R++V TANLI DW + +W+QD PL+ + + + N D+
Sbjct: 260 MKFMLLFYRNGRLRVVVSTANLIEYDWRDMENSVWLQDVPLR-SSPIPHDPKATN---DF 315
Query: 117 LSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHM 173
S ++ S N+ H N + ++++S V L+ S+ G H G ++ K GH
Sbjct: 316 PSIIQRVLNSLNVKPHPNLALKSIEDLRCRWDWSKVKVHLVPSIAGKHEGWPAVIKTGHP 375
Query: 174 KLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDEKWMAELSSSMSSGFSED--KTPLGI 228
+L ++E G K+ L Q SSLG +WM E S +ED P
Sbjct: 376 RLMMAVREMAMRTGKGKAKELILECQGSSLGIYTTQWMNEFHWSARGESAEDWLDEPKKR 435
Query: 229 GEPL------IVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKYWAKWK------- 274
E L I +P+ V+ S G G I +K K+F + ++ K
Sbjct: 436 REKLPYPPIKIFFPSKRTVQESALGEKGGGTIFCRRKQWSTKNFPRDHFYDSKSKGGPVL 495
Query: 275 -------ASHTGRSRAMPHIKTFARYNGQK-------LAWFLLTSANLSKAAWGALQKN- 319
A+H +R + L W L S N + +AWG L +
Sbjct: 496 MHSKMIIATHQETTRKTLQAAESSSEEDDDIEVVDPPLGWSYLGSHNFTPSAWGNLSGSS 555
Query: 320 -NSQLMIRSYELGVLI 334
N L I +YELG++
Sbjct: 556 FNPVLNIANYELGIVF 571
>gi|118399033|ref|XP_001031843.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89286177|gb|EAR84180.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 562
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 151/349 (43%), Gaps = 53/349 (15%)
Query: 41 NWILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
N+ + PP L ++G HSK +L +P+ +RI++ T NL + W N S +W +DF L
Sbjct: 190 NFTIVYPPKDYLSKTWGCFHSKLWILKFPKFLRIVIGTGNLRILHWTNWSNIIWFKDFEL 249
Query: 98 KDQN-NLSEECGFENDLIDYLST-LKWPEFSANLPAHGNFKINPSF-------------- 141
Q +S+ + N I S +K N + +N F
Sbjct: 250 IPQQIQVSQSLDYFNSNISIGSKGVKVVNLEKNYRNINDVDMNEDFIDVLNEFIDKICPY 309
Query: 142 ----------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 191
+ + L++S+PG +GS + +G M++R + Q K
Sbjct: 310 FDVKEMLDINLRNYQIKGINFMLVSSLPGKFSGSQIHDYGKMRIRKICQVFNPRNIDSKK 369
Query: 192 PLVYQFSSLGSLDEKWMAE-----LSSSMSSGFS-EDKT----PLGIGEPLIVWPTVEDV 241
L Q +SLG++D ++ E L S +DK P E +++P+ + +
Sbjct: 370 VLYSQSTSLGTIDRTFVNEFLFCFLPYQFCSEIELKDKVKKNDPEKNDEIRLIFPSKDYI 429
Query: 242 RC-SLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKA--------SHTGRSRAMPHIKTF- 290
+ +L+G + + K K+ FLK + +++ S + +PH KT
Sbjct: 430 QNKTLDGAGYSDTLFLTSKRYQKESFLKNIFYQFQCKQMDSLGESQDKQKGIIPHFKTMI 489
Query: 291 -ARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
NG+ + + S N S+AAWG L K+N+QL I + ELG+LI P
Sbjct: 490 VCEQNGEINDDSIIYIGSHNFSEAAWGKLNKDNTQLYISNTELGILIPP 538
>gi|301770841|ref|XP_002920838.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial
[Ailuropoda melanoleuca]
Length = 172
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 74/127 (58%), Gaps = 6/127 (4%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 6 DVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 65
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSEE--CGFENDLIDY 116
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ P+ + S E F+ DLI Y
Sbjct: 66 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKADLISY 125
Query: 117 LSTLKWP 123
L P
Sbjct: 126 LMAYNAP 132
>gi|426193767|gb|EKV43700.1| hypothetical protein AGABI2DRAFT_121836 [Agaricus bisporus var.
bisporus H97]
Length = 635
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 144/343 (41%), Gaps = 54/343 (15%)
Query: 42 WILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 100
W+ PPL FG H K MLL Y G +R+++ TANLI DW + +W+QD P++ Q
Sbjct: 255 WVKTTPPLRGGFGCQHMKFMLLFYKNGNLRVVISTANLIAYDWRDMENSVWLQDLPMRPQ 314
Query: 101 NNLSEECG--FENDLIDYLSTLK-WPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLI 155
+ F + + L + P L H N + ++++S V L+
Sbjct: 315 LMPPDPKAKDFPSIMQQVLHAVNVAPALRTMLSDHPNIPLRTIEDLRMRWDWSKVKVHLV 374
Query: 156 ASVPGYHTG-SSLKKWGHMKLRTVLQECTFE--KGFKKSPLVY--QFSSLGSLDEKWMAE 210
AS+ G H G S+ K GH +L ++ +G K ++ Q SSLG+ +W+ E
Sbjct: 375 ASIAGKHEGWPSIVKTGHPRLMMAIRTMGLRPSRGLGKGNMIIECQGSSLGNFTTQWLNE 434
Query: 211 LSSSMSSGFSED--KTPLGIGEPL------IVWPTVEDVRCSLEGYAAGNAIPSPQKN-V 261
S +ED P E L I++PT + V+ S G G I +K
Sbjct: 435 FHWSARGESAEDWLDEPKRRREKLPYPPVRILFPTKKIVQESASGEPGGGTIFCRRKQWA 494
Query: 262 DKDFLKK--YWAKWKA--------------SHTGRSRAM------------PHIKTFARY 293
K+F + Y +K KA HT + A P +K
Sbjct: 495 AKNFPRDKFYVSKSKAGPVLMHSKMIIATIQHTNPASASLNREGSDTEEDEPEVKIIEPA 554
Query: 294 NGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 334
G W + S N + +AWG L + N L I +YE+G++
Sbjct: 555 VG----WAYVGSHNFTPSAWGTLSGSAFNPILNITNYEIGIVF 593
>gi|392587577|gb|EIW76911.1| phospholipase D nuclease [Coniophora puteana RWD-64-598 SS2]
Length = 667
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 102/441 (23%), Positives = 182/441 (41%), Gaps = 60/441 (13%)
Query: 18 PHVLVIH-GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHT 75
PH VI + D + +N NW++ P L +G H K MLL Y G +R+++ T
Sbjct: 244 PHTPVIFVAQPDSSGNAALKNVLPNWLMTTPFLRNGYGCQHMKFMLLFYKDGRLRVVIST 303
Query: 76 ANLIHVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLKWPEFSANLPA- 131
ANLI DW + +W+QD P + ++ + F + + + L ++ AN+ A
Sbjct: 304 ANLIDYDWRDIENAVWLQDVPRRPSPIPHDPKAKDDFPSIMQNVLRSVNVRPALANMLAN 363
Query: 132 -HGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKG 187
H N + ++FS V+L+ S+ G H G ++ + GH +L +++ G
Sbjct: 364 DHPNLPLQTIADLRTHWDFSKVKVKLVPSIAGKHEGWPAVVQSGHPRLMKAVRDMGLRTG 423
Query: 188 FKKSP----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVW 235
K+ + Q SS+G+ +W+ E S +ED +T L I++
Sbjct: 424 KGKAAKELVVECQGSSIGTYTTQWLNEFHHSARGESAEDWLDAPRSRRTKLPFPPVKIIF 483
Query: 236 PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR----------SRAMP 285
P+++ VR + G G + F K+ A+W+ + R R +
Sbjct: 484 PSLKRVRATALGERGGGTM----------FCKR--AQWEGKNFPRGSFYESESRGGRTLM 531
Query: 286 HIK-TFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
H K + L + A SK+A Q +S+ ++ I + G
Sbjct: 532 HTKMIIGTFRSNPL---VSVGAGTSKSAPQKKQLEDSETEPEDDDVDPDIQIVNEPIGWA 588
Query: 345 FSCTSNIVPSE--IKSGST---ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 399
+ + N PS SGS+ + I + + + D S ++ PP
Sbjct: 589 YVGSHNFTPSAWGTLSGSSFNPSLNNINYELGIVMPLYNDEDIDRVS-------CFKHPP 641
Query: 400 QRYSSEDVPWSWDKRYTKKDV 420
++Y S+DVPW D+ +++
Sbjct: 642 KKYGSDDVPWMQDESLILREI 662
>gi|118380757|ref|XP_001023542.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89305309|gb|EAS03297.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 584
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 151/346 (43%), Gaps = 52/346 (15%)
Query: 41 NWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
NW L PP +S G H K L+ + +R+++ + NL DW+ S LW QDFPL
Sbjct: 217 NWTLIHPPKDVSSSWGGAFHPKLWLIKFSSFLRVVIGSGNLHVSDWSVWSNCLWYQDFPL 276
Query: 98 K-------DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 150
Q S + FE D L+ L + + KIN +++S
Sbjct: 277 NANKKEKTQQKPSSPKWDFEGDFKITLTELVKKMMPSGINYQDLLKIN---LDDYDYSEV 333
Query: 151 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECT-FEKGFKKSP----------LVYQFSS 199
+ LI+S+ G HT + K+G K+ ++Q T EK P + YQ +S
Sbjct: 334 KIILISSIVGRHT--DIYKYGRGKMYKIIQAFTQNEKNITNQPNNNLTQNQKIITYQCTS 391
Query: 200 LGSLDEKWMAELSSSMSSG-----FSEDKTPLGIGEPLI------VWPTVEDV-RCSLEG 247
LG++D ++ E + ++ +DK LI ++PT E + ++ G
Sbjct: 392 LGNIDNTFLNEFYTCATANKPITELKKDKANKKQDPNLIEQKFRLIFPTAEYIYEDTIYG 451
Query: 248 YAAGNAIPSPQKNVDKD-FLKKYWAKWKAS-----HTGRSRAMPHIKTFARYNG----QK 297
+ + QK +K+ F K + ++ + HTG A+PH+KT + +
Sbjct: 452 PEYASPVILNQKYYEKESFPKSIFHQFCSPDNYFYHTG---AIPHLKTMVVTDNDLQIKD 508
Query: 298 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 343
+ + S N + AAWG +K+ SQ+ + ELG+ I P + C
Sbjct: 509 DSIVYIGSHNFTAAAWGRFEKDYSQIYNSNTELGI-IYPPMEDSAC 553
>gi|449544019|gb|EMD34993.1| hypothetical protein CERSUDRAFT_54191, partial [Ceriporiopsis
subvermispora B]
Length = 621
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 152/360 (42%), Gaps = 56/360 (15%)
Query: 18 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTA 76
P ++V H + G+ E +K P NWI P L G H K MLL Y G +R++V TA
Sbjct: 214 PVIMVAH-DQQGSNETIKEVLP-NWIKTTPFLRNGMGCMHIKFMLLFYKSGRLRVVVTTA 271
Query: 77 NLIHVDWNNKSQGLWMQDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN 134
N I DW + W+QD P + N + F I L TL N+ H N
Sbjct: 272 NFIEHDWRDIENTAWVQDIPKRPTPIPNDPKADDFPAAWIRVLRTL-------NI-QHPN 323
Query: 135 FKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFE--KGFK 189
I K++FS AV+L+ S+ G H G ++ K GH L +++ + KG K
Sbjct: 324 LPIQRLEDLRMKWDFSKVAVKLVPSLAGKHEGWPNVIKTGHTGLMKAVRDMGAQVPKG-K 382
Query: 190 KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDV 241
+ L Q SS+G+ +WM E S ++ ++ L +++P++ V
Sbjct: 383 QMVLECQGSSIGTYSTQWMNEFHCSARGESAQSWLDVSRARRSKLPWPAVKLIFPSLRTV 442
Query: 242 RCSLEGYAAGNAIPSPQKNVDK-DFLKKYW------------------AKWKASHTGRSR 282
R S+ G G + + D F K+ + A ++++ T +R
Sbjct: 443 RESVLGEPGGGTMFCRRNQWDAPKFPKELFHDSNSKRGKVLMHSKMIIATFRSASTPFTR 502
Query: 283 AM--------PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGV 332
P + Q + W + S N + +AWG L + N L I +YELG+
Sbjct: 503 GQSETDSETEPESDAEETESRQPIGWAYMGSHNFTPSAWGTLSGSAFNPTLNITNYELGI 562
>gi|392563164|gb|EIW56343.1| phospholipase D/nuclease [Trametes versicolor FP-101664 SS1]
Length = 641
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 147/344 (42%), Gaps = 54/344 (15%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
NWI P L FG H K MLL+Y G +R++V TANL+ DW + +W+QD P +
Sbjct: 261 NWIRTTPFLRNGFGCMHMKFMLLLYRDGRLRVVVSTANLVEYDWRDIENSVWVQDIPKRP 320
Query: 100 Q--NNLSEECGFENDLIDYLSTLKWPEFSANL--PAHGNFKIN--PSFFKKFNFSSAAVR 153
++ F + ++ L L N+ H N + ++FS
Sbjct: 321 SPVTQPADTEDFASAMVRVLHALNVAPALINMLRNDHPNLPLQRLEDLRSHWDFSRVKAA 380
Query: 154 LIASVPGYHTG-SSLKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDEKWMAE 210
L+ SV G H G + GH +L L E T K K+ L Q SS+G+ W+ E
Sbjct: 381 LVPSVAGKHEGWPKVILTGHTRLMKALLDMEATVPKD-KELALECQGSSIGNYSSMWVNE 439
Query: 211 --LSSSMSSGFSEDKTPLGIGEPL------IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 262
LS+ S S +TP + I++PT + VR S+ G + G + +K +
Sbjct: 440 FFLSARGESTQSWLETPKTRRAKVPYPAVKILFPTAQYVRESVLGESGGGTMFCRRKQWE 499
Query: 263 -KDFLKKYWAKWKASHTGRSRAMPHIK----TFARYNGQ--------------------- 296
+F ++ + + + + R R + H K TF G
Sbjct: 500 GANFPRQLFHQ---TRSKRGRVLMHSKMILGTFKEKTGTLDGHQRASATRSSEVDTDEDA 556
Query: 297 ---KLA-WFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 334
KLA W + S N + +AWG L + N L I +YELGV+I
Sbjct: 557 GSAKLAGWVYVGSHNFTPSAWGTLSGSGFNPSLNINNYELGVVI 600
>gi|451845752|gb|EMD59064.1| hypothetical protein COCSADRAFT_41609 [Cochliobolus sativus ND90Pr]
Length = 568
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 147/351 (41%), Gaps = 49/351 (13%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---------QGLW 91
N +H PP+ + HSK MLL P+ +RI++ TAN+I DW + ++
Sbjct: 217 NLKIHFPPMDGMIQSMHSKLMLLFGPKKLRIVIPTANMIQTDWGEVANDWQPGVMENSIF 276
Query: 92 MQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 148
+ D P + S + F +L+ +L K PE F+FS
Sbjct: 277 LIDLPRRGNETTSTKENMTRFGQELMYFLEMQKVPEMVLQ------------GILNFDFS 324
Query: 149 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 207
+ + + S+ G H S G + L +Q+ + ++ L Y SSLG++++ +
Sbjct: 325 QTSHLAFVHSIGGSHKTESEHPTGLLGLTRAIQDLHLDN-VEQMELDYAASSLGAINDSF 383
Query: 208 MAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVD-- 262
++ L ++ F+ D P I +PT E V+ S+ G G I Q+ +
Sbjct: 384 LSRLYLAACGRCFAADTAMVPDVRNHIRIYFPTNETVKKSIGGPDCGGIISLSQQRYNAA 443
Query: 263 ---KDFLKKYWAKWKASHTGRSRAMPHIKTF----ARYNGQKLAWFLLTSANLSKAAWGA 315
++ L+ Y + R+ + H K + +G+ + W + SANLS++AWG
Sbjct: 444 TFPRECLRDY-------ESLRAGMLSHNKLLLARGRKKDGRPVGWVYVGSANLSESAWGG 496
Query: 316 LQ----KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
+ L IR++E GV++ R VP + G+ E
Sbjct: 497 QKVIKDGKMGSLNIRNWECGVVMTVPDDRLAGLDKDKDKTVPMSVFEGTVE 547
>gi|353240852|emb|CCA72701.1| hypothetical protein PIIN_06638 [Piriformospora indica DSM 11827]
Length = 636
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 149/364 (40%), Gaps = 80/364 (21%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
NWI+ P L G H K MLL Y G +R+++ TAN I DW + W+QDFP
Sbjct: 245 NWIMTMPFLRGGRGAMHVKLMLLFYRSGRLRLVLPTANFIDYDWRDIENTAWVQDFPPLS 304
Query: 100 QNNLSEEC---GFENDLIDYLSTLKW-PEFSANLPAHGNFKIN--PSFFKKFNFSSAAVR 153
+ + E F + L L+ L P ++ L H N I K +NF+ AAV+
Sbjct: 305 KPAVGREATSSAFASTLQMVLTKLNVSPALASLLTDHPNLPIKFIGDLGKGWNFTKAAVK 364
Query: 154 LIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGF----KKSP-----LVYQFSSLGSL 203
LI S+ G + G + K GH+ L + + +G KK P + Q SS+G+
Sbjct: 365 LIPSMSGKYEGWDQVLKQGHVSLMKGIMDIGAHRGHTKRDKKKPPEELIVECQGSSIGTY 424
Query: 204 DEKWMAELSSSM----------SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGN 252
+W+ E SS S S K P PL I++P+++ V+ S+ G G
Sbjct: 425 SAQWLQEFYSSCCGISPETWLDKSKASRSKLP---KPPLRILFPSLKTVQSSVLGEDGGG 481
Query: 253 AI--PSPQ---KNVDKDFLKKYWAKWKASHTGRSRAMPHIK-----------------TF 290
+ + Q N +D S++ R + + H K T
Sbjct: 482 TMFCRTSQWEGANFPRDLFYD-------SNSKRGKVLMHTKMILGLWRDSSSDERSSTTL 534
Query: 291 ARYNGQK------------------LAWFLLTSANLSKAAWGALQKN--NSQLMIRSYEL 330
+Y QK W + S N + +AWG L + L I +YEL
Sbjct: 535 RKYAKQKEVLEIDSDDEVEIIDPFAAGWLYVGSHNFTPSAWGTLSGSAFTPVLNITNYEL 594
Query: 331 GVLI 334
G+LI
Sbjct: 595 GILI 598
>gi|406602541|emb|CCH45857.1| Tyrosyl-DNA phosphodiesterase 1 [Wickerhamomyces ciferrii]
Length = 587
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 95/420 (22%), Positives = 169/420 (40%), Gaps = 97/420 (23%)
Query: 53 FGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN 111
+ +HH K ++ +Y V++ + + N+ ++W+ +Q +W KD N S++ F+
Sbjct: 212 YSSHHPKLIINVYNDDTVQLFLVSCNMTFMEWSTNNQMIWQSPRLHKDLN--SKDTVFKT 269
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
L +Y+ + P+ + KK++F+S ++S T WG
Sbjct: 270 HLFNYIKNYQKPQLDTLV----------VLLKKYDFNSIIGDFVSSATS--TSDKFGFWG 317
Query: 172 --------------HMKLRTVL-QECTFEKGFKKSPLVYQFSSLGS------LDEKWMAE 210
H K R +L Q + + +P + Q +++ + K+
Sbjct: 318 LYNSLLSKGLIPRKHEKERQLLYQTSSIASAIRHTPTINQSANIFTHLLLPLFSGKYTNH 377
Query: 211 LSSSMSSGFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLEGYAAGN-AIPS 256
S+S F PL G +P I++P++ DVR SL GY +G + +
Sbjct: 378 GRLSISRDF-----PLSNGFISVEQFSKEYKVKPYIIYPSLSDVRNSLFGYGSGGWSHFN 432
Query: 257 PQKNVDK---DFLKKYWAKWKASHTGRSRAMP-HIK--TFARYNGQKLAWFLLTSANLSK 310
P +K DFL + S++ + + P H K + N + L W TS N+SK
Sbjct: 433 PHSKWNKPMNDFLTP--KVFHHSYSQQRKTNPSHTKFLIMSSDNFKTLDWVFFTSTNMSK 490
Query: 311 AAWGALQKNNSQLM------IRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 364
AWG L + +YE G+L+ PS +G G
Sbjct: 491 QAWGTPPTKKDLLSLPPKSNVSNYETGILLCPSD--YGSGI------------------- 529
Query: 365 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
K + L + + + +YLP + LPP++YS++D PW K + D+ G +
Sbjct: 530 -----KFIPLEFGQEKNLEENEVPIYLP--FRLPPEKYSNQDEPWCVSKSHDLPDILGNL 582
>gi|13543875|gb|AAH06083.1| TDP1 protein [Homo sapiens]
Length = 298
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 58/89 (65%), Gaps = 2/89 (2%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKA 60
D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTHH+K
Sbjct: 207 DVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKM 266
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQG 89
MLL+Y G+R+++HT+NLIH DW+ K+QG
Sbjct: 267 MLLLYEEGLRVVIHTSNLIHADWHQKTQG 295
>gi|452985745|gb|EME85501.1| hypothetical protein MYCFIDRAFT_133255 [Pseudocercospora fijiensis
CIRAD86]
Length = 482
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 163/366 (44%), Gaps = 52/366 (14%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEECGFENDL 113
HSK MLL +P +RI + TANL++ DW Q +++ D P G + L
Sbjct: 152 HSKLMLLFHPHKLRIAIPTANLLNFDWGETGQMENSVFLIDLPRYSD-------GLKASL 204
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 172
D S + E + G + KF+FS+ + + +V G H + G
Sbjct: 205 EDLPSFGR--ELMYFIQKQGLDQDVRDGVLKFDFSATRDMAFVHTVGGVHYKDEAARTGL 262
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE 230
+ L + ++E G S L +F SS+G L+E + +L ++ + +
Sbjct: 263 LGLSSAVRELGLSTG---SDLEIEFAASSIGMLNEAQVNDLHTAARGKPQQSSSTTETST 319
Query: 231 P----LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPH 286
I +PT + VR S G +AG + K+F + + +K++ G + H
Sbjct: 320 ARKNVRIYFPTADTVRSSTAG-SAGTICLQRKYFEAKNFPRDIFRDYKSTRRG---LLSH 375
Query: 287 IKTF-ARYNGQKLAWFLLTSANLSKAAWGAL--QKNNSQLMIRSYELGVLILPSAKRHGC 343
K AR +K+AW + SAN+SK+AWG L +++ +++ R++E GV ILP A++
Sbjct: 376 NKILCARSRKEKVAWVYVGSANMSKSAWGELGAKRDENKITCRNWECGV-ILPVARK--- 431
Query: 344 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 403
V E T+ + LV++ A + V+ L P+E+P + Y+
Sbjct: 432 --------VKDENGDEETDDEGEDEKALVSMN--------AFANVIDL--PFEVPGEEYA 473
Query: 404 SEDVPW 409
+ PW
Sbjct: 474 GRE-PW 478
>gi|395329020|gb|EJF61409.1| phospholipase D/nuclease [Dichomitus squalens LYAD-421 SS1]
Length = 656
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 140/349 (40%), Gaps = 63/349 (18%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
NWI P L FG H K MLL + G +RI+V TANL+ DW + +W+QD P +
Sbjct: 275 NWIRTTPFLRGGFGCMHMKFMLLFFRTGRLRIVVSTANLVEYDWRDIENTVWVQDVPKRP 334
Query: 100 QNNLSEECGFENDLIDYLSTLKWPEFSANL-PAHGNFKIN----------PSFFKKFNFS 148
++ + D+ S L N+ PA N N ++FS
Sbjct: 335 SPEPADP-----KVEDFASALVRMLHGVNVAPALVNHLKNEYPNLPLQRLEELRTHWDFS 389
Query: 149 SAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSLDEK 206
RLI S+ G H G + GH L L++ E K L Q SS+G+
Sbjct: 390 RVKARLIPSIAGKHEGWPKVILTGHTCLMKSLKDIGAETPKDKDLVLECQGSSVGAYTTA 449
Query: 207 WMAELSSSMS--------SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQ 258
W+ E S G + L + I++PT + VR S+ G G + +
Sbjct: 450 WLNEFYCSARGESAQTWLDGPKSRRAKLPLPPIKILFPTAQYVRDSVLGEVGGGTMFCRR 509
Query: 259 KNVD-KDFLKKYWAKWKASHTGRSRAMPHIK----TF----------------------- 290
K + K+F ++ + + + + R R + H K TF
Sbjct: 510 KQWEGKNFPRELFHQ---TRSKRGRVLMHSKMVLGTFRDKRRKQQTLTDSEDEAEDGRNA 566
Query: 291 ---ARYNGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 334
+R Q W + S N + +AWG L + N L I +YELGVLI
Sbjct: 567 DSGSRDRQQLAGWVYVGSHNFTPSAWGTLTGSAFNPTLNITNYELGVLI 615
>gi|390595745|gb|EIN05149.1| phospholipase D/nuclease, partial [Punctularia strigosozonata
HHB-11173 SS5]
Length = 622
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 156/393 (39%), Gaps = 78/393 (19%)
Query: 21 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLI 79
+V+ + D T + NWI PPL G H K MLL Y G +R+++ TAN I
Sbjct: 220 VVVVAQPDTTGARSVKEVLPNWIRTTPPLRGGRGCMHMKFMLLFYRTGRLRVVISTANFI 279
Query: 80 HVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH-GNFKIN 138
DW + +W+QD PL+ +++ D+ +T + + N+ A IN
Sbjct: 280 DYDWRDIENTVWVQDVPLR-----QTPIRYDHKATDFPATFERVFKALNVEAALQALTIN 334
Query: 139 -------PS---FFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKG 187
PS K++FS L+ASV G H G + + GH L +++ G
Sbjct: 335 DHPDIPLPSVTDLRTKWDFSKVKAHLVASVAGKHEGWPEVIRNGHTALMKAVRDMGARAG 394
Query: 188 -FKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTV 238
++ L Q SS+G+ +WM E S +ED + L IV+P++
Sbjct: 395 KGREVELECQGSSIGTYSTQWMNEFHYSCRGESAEDWLDQPKTRRAKLPWPPVKIVFPSL 454
Query: 239 EDVRCSLEGYAAGNAI--PSPQKNVDKDFLKKYWAKWKASHTGRSRAMP---HIK----T 289
V+ S G G I S Q +K F ++ + H RS+ P H K T
Sbjct: 455 ATVQASRLGEKGGGTIFCRSNQWQAEK-FPRELF------HDSRSKRGPVLMHSKMVLAT 507
Query: 290 FARYNGQK---------------------------------LAWFLLTSANLSKAAWGAL 316
F GQ + W + S N + +AWG L
Sbjct: 508 FRPKGGQSTLVDSDSETESETESESDEEVKIVEPKERKKKLVGWIYVGSHNFTPSAWGNL 567
Query: 317 QKN--NSQLMIRSYELGVLILPSAKRHGCGFSC 347
+ + I +YE+G+++ ++ + +C
Sbjct: 568 SGSAFGPIMNITNYEIGIVLPLTSGKEADAIAC 600
>gi|378727943|gb|EHY54402.1| tyrosyl-DNA phosphodiesterase 1 [Exophiala dermatitidis NIH/UT8656]
Length = 793
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 110/278 (39%), Gaps = 81/278 (29%)
Query: 233 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK----DFLKKYWAKWKASHTG--------- 279
I++PT ++V SL+GYA+G +I + L+ +W S TG
Sbjct: 515 IIFPTPQNVASSLDGYASGGSIHMKAQAASHLNQISLLRPSLCQWTRSQTGASSSSSLSG 574
Query: 280 RSRAMPHIKTFARYNGQ--------KLAWFLLTSANLSKAAWGALQ-----KNNSQLMIR 326
R A PH+KT+ R+ + + W LLTSANLS AWG ++ + +++++
Sbjct: 575 RHLAAPHVKTYIRFKSKPTTQHPTPDIDWALLTSANLSTQAWGVVREPKDKRKEKEVVVQ 634
Query: 327 SYELGVLILP-----------SAKRHGCG-------------FSCTSN------------ 350
S+E+GVL+ P + K+ G G T+N
Sbjct: 635 SFEIGVLVWPGLFGPEFEDEGTIKQDGAGSGRDARMGTGDYDIKNTTNPSKEDQSQNLNS 694
Query: 351 -------------------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 391
+ P+ I +G E + + ++ +V +
Sbjct: 695 VHSVRMAPVFGTDMPSQLQLQPANIGTGIVEDGTASGNGNENGNVNEKDVSSTTTTLVGI 754
Query: 392 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHF 429
+PY+LP Y D+PWS Y D +G+ WPR F
Sbjct: 755 RLPYDLPLTPYVETDMPWSPQGVYEVPDRHGRRWPRDF 792
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 61/136 (44%), Gaps = 37/136 (27%)
Query: 49 LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNL 103
+P +FGTHHSK +L + ++++HTAN++H DW N +Q +W P NN
Sbjct: 209 MPDAFGTHHSKMFILFTHDDLAQVVIHTANMLHRDWTNMTQAVWQSPMLPVLPPTTNNNS 268
Query: 104 SEECG-------------------FENDLIDYLSTLKWPEFSANLPAHGN-FKINPSFFK 143
+ G F++D++ YLS A+G K
Sbjct: 269 TGAKGNQPKSTSTSPIGSIGTGSRFKHDMMAYLS------------AYGTKTKSLREQLV 316
Query: 144 KFNFSSAAVRLIASVP 159
+F+FSS L+ASVP
Sbjct: 317 RFDFSSVRGALVASVP 332
>gi|403372152|gb|EJY85968.1| Tyrosyl-DNA phosphodiesterase [Oxytricha trifallax]
Length = 676
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 145/354 (40%), Gaps = 72/354 (20%)
Query: 52 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---PLKDQNNLSEECG 108
S+ HSK +L + +R+IV +ANL DW S W QDF L N +S+
Sbjct: 324 SYSVFHSKLILYEFDDRLRVIVTSANLYTQDWELLSNVTWFQDFFKAELGKNNEISQSST 383
Query: 109 FENDLIDYLSTLKWP-----------------EFSANLPAH------GNFKINPSF---- 141
++ + K P +F L + N K+ F
Sbjct: 384 TQSVKVATKEERKNPFNFNEQRPQQQQQPFQNDFKQYLKDYLEVIIPKNVKVREVFRQKI 443
Query: 142 -FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 200
KF+FS+A LIAS+ G H KK+G +L +++ +K +K+ + YQ SS+
Sbjct: 444 DLDKFDFSTANAFLIASINGRHADREFKKYGQARLGELVRNV--DKQHEKT-ITYQTSSI 500
Query: 201 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-----IVWPTVEDVRCSLEGYAAGNAIP 255
G L+ K+M +SM + F + K + E + +++PT+ V S G ++I
Sbjct: 501 GKLNTKFM----TSMYNQFGKSKK---VSEDIHQNFRVIFPTIGYVSTSHLGPENASSII 553
Query: 256 SPQKNVDKDFLKKYW-------AKWKASHTGRSRAMP----HIKTFARYNGQKLAW---- 300
+ YW K G+S+ + H K + K +
Sbjct: 554 ---------LQESYWYDTPGFPRKSFYRQVGKSKLLDKNLYHTKFMIITDKGKESEITDD 604
Query: 301 --FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 352
S N S AWG L+KN+SQ+ I ++ELGV+ P +N+V
Sbjct: 605 TVLYFGSHNFSGGAWGNLEKNDSQISISNWELGVVFGPQVGSQEMKQKMINNMV 658
>gi|440632301|gb|ELR02220.1| hypothetical protein GMDG_01013 [Geomyces destructans 20631-21]
Length = 529
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 163/370 (44%), Gaps = 52/370 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPP-LPISFGTHHSKAM 61
D +W+L +A+ +L+ E ++++ P+N P + T HSK
Sbjct: 115 DQEWILSKLD-MARTKLILIAQAVPRDDQEEVRKSAPSNVRFCFPSNKDETVSTMHSKLQ 173
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
LL +P +R++V +ANL+ DW +++ D P N + EN L +
Sbjct: 174 LLAHPSHLRVVVPSANLVPYDWGETGVMENTVFLIDLPRLAANKV---VSIEN-LTPFCR 229
Query: 119 TLKWPEFSANLPAHG-NFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLR 176
L+ F L A G + KI S K F+FS +A + + S+ G HT + K G+ L
Sbjct: 230 ELR--RF---LKAQGLDSKITDSLLK-FDFSQTAGLAFVHSIGGNHTENDWKTIGYPGLG 283
Query: 177 TVLQECTFEKGFKKSPLVYQF--SSLGSLDEKW-------------MAELS--SSMSSGF 219
+ +QE PL F +S+G+L + + + EL+ +S S +
Sbjct: 284 SAIQELGLAN---TGPLNVTFVSASIGALTDDFVLAILLACKGDDGLTELTWRTSTSPAY 340
Query: 220 SEDKTPLGIGEPL------IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 272
+ T I++P+ E VR S G +G I P+ + F K+ +
Sbjct: 341 RKRTTKEETLLMEMEEGFRIMFPSHETVRTSKNGTNSGGTICLDPKYYQREQFPKELFRD 400
Query: 273 WKASHTG---RSRAMPHIKTFARYNGQK-LAWFLLTSANLSKAAWGALQKNNS----QLM 324
K+ G S+ + T +G + AW + SANLS++AWG L KN S +L
Sbjct: 401 CKSKRAGLLLHSKLLFTAPTHMNADGDRGKAWAYVGSANLSESAWGRLTKNKSTKQVKLY 460
Query: 325 IRSYELGVLI 334
R++E GV+I
Sbjct: 461 CRNWECGVVI 470
>gi|393244923|gb|EJD52434.1| phospholipase D/nuclease [Auricularia delicata TFB-10046 SS5]
Length = 628
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/403 (26%), Positives = 156/403 (38%), Gaps = 92/403 (22%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-DGT-LEHMKRNKPANWILHKPPLPISFGTHHSKA 60
DI WLL P +P +LV H + DG L ++ N++L P + G H K
Sbjct: 211 DIPWLLTMFP--DTVPVILVNHPVTPDGNDLTYLS----TNFVLVTPSMQQDSGAMHIKL 264
Query: 61 MLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ----NNLSEECGFENDLID 115
MLL Y G +R+ + TAN I DW + +W+QD P +D L +E F L+D
Sbjct: 265 MLLFYKSGRLRVAIPTANFIQYDWRDIENAVWLQDIPKRDAPTPFAKLPKELDFAAQLVD 324
Query: 116 YLSTLKWPEFSANLPAHG---NFKINPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWG 171
L L + +G + +++S RL+ S+ G H G + + G
Sbjct: 325 TLRALNVGRAVESQMQNGFAPPLRALDELRMWWDWSKVTARLVPSLKGSHEGWPRVTRVG 384
Query: 172 HMKLRTVLQECTFEK-GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE--------- 221
H L L++ + G K L Q SS+G +W + S SE
Sbjct: 385 HTSLLKALRDLGADTPGSCKLLLECQGSSIGQYTRRWTHQFYRSARGEPSEKFSWIAKQS 444
Query: 222 --DKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA--- 275
D P P+ I++P++ V S+ G G + K WKA
Sbjct: 445 AFDNLPY---PPIKIIFPSLRTVEESVLGKPGGGTMFCDPKT------------WKAPKF 489
Query: 276 -------SHTGRSRAMPHIK----TFAR------------------------------YN 294
S++ R R + H K F R
Sbjct: 490 PRENFFDSNSKRGRVLMHTKMILGIFERDTMFTAKGKRRDDPYDTDDDEVTIVEPKSTKK 549
Query: 295 GQKLA-WFLLTSANLSKAAWGALQKNNSQ--LMIRSYELGVLI 334
+KLA W + S N + AAWG L ++ L IR+YELGV++
Sbjct: 550 REKLAGWLYVGSHNFTPAAWGHLSGSSITPILSIRNYELGVVL 592
>gi|403418586|emb|CCM05286.1| predicted protein [Fibroporia radiculosa]
Length = 1675
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 148/379 (39%), Gaps = 53/379 (13%)
Query: 18 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTA 76
P+ VI D + + NWI P L G H K MLL Y G +RI++ TA
Sbjct: 1274 PNTPVIAVAQDPEGQETIKTILPNWIKTTPFLRNGMGCMHMKFMLLFYKSGRLRIMISTA 1333
Query: 77 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-- 134
N+I DW + W+QD PL+ +S + E+ + L+ + L +H
Sbjct: 1334 NMIEYDWRDIENTAWVQDVPLRSA-PISHDPKAEDFAAAMVRVLRAISVAPALVSHLRND 1392
Query: 135 -----FKINPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGF 188
+ F K++FS V L+ S+ G H G + GH L L+
Sbjct: 1393 HPDLPLQRLEEFRMKWDFSKVKVSLVPSIAGKHEGWPKVILAGHTALMKALRNLNAAADK 1452
Query: 189 KKSPLVY-QFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVE 239
K ++ Q SS+G+ +WM E S ++ + L I++PT +
Sbjct: 1453 DKEVILECQGSSIGNYSTQWMNEFHCSARGESAQSWLDVSKARRAKLSFPPVKILFPTSQ 1512
Query: 240 DVRCSLEGYAAGNAIPSPQKNVD-KDFLKKYWAKWKASHTGRSRAMPHIKTF-------- 290
VR S G A G + + + F ++ + + S + R + + H K
Sbjct: 1513 YVRDSALGEAGGGTMFCRRNQWEGAKFPRELFHQ---SRSKRGKVLMHSKMILGMFRSRP 1569
Query: 291 ARYNGQK--------------------LAWFLLTSANLSKAAWGALQKN--NSQLMIRSY 328
+ ++G + W + S N + +AWG L + N L I +Y
Sbjct: 1570 SVFSGSSNRSDSETEDEDDPESDQEKLIGWLYVGSHNFTPSAWGTLSGSAFNPTLNITNY 1629
Query: 329 ELGVLILPSAKRHGCGFSC 347
ELG+++ ++ C
Sbjct: 1630 ELGIVLPLRSEEEANRMVC 1648
>gi|336379126|gb|EGO20282.1| hypothetical protein SERLADRAFT_452973 [Serpula lacrymans var.
lacrymans S7.9]
Length = 620
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 147/375 (39%), Gaps = 61/375 (16%)
Query: 21 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLI 79
++I + D + + +N NWI P L G H K MLL Y G +R+++ TANLI
Sbjct: 207 VIIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANLI 266
Query: 80 HVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HGN 134
D+ + +W+QD PL+ Q N+ F + L L P + +L H N
Sbjct: 267 DYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHPN 326
Query: 135 FKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKS 191
+ +++S V+L+ S+ G H G + GH +L +++ G K+
Sbjct: 327 LPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGKA 386
Query: 192 P----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVE 239
+ Q SS+G+ +WM E S +ED + L IV+P+++
Sbjct: 387 AKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSLK 446
Query: 240 DVRCSLEGYAAG----------NAIPSPQ-----------------KNVDKDFLKKYWAK 272
V+ S+ G G N P+ K + F +K
Sbjct: 447 TVQTSVLGEPGGGTMFCRGVQWNGAKFPRQLFHDSNSTAGGVLMHTKMIIGTFKQKATTN 506
Query: 273 WKASHT-GRSR----------AMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN-- 319
SH G+ R N + W L S N + +AWG L +
Sbjct: 507 SLDSHDKGKGRQSDADSDTETETEEDDVVEVVNDAPIGWAYLGSHNFTPSAWGTLSGSGF 566
Query: 320 NSQLMIRSYELGVLI 334
N L + +YELG++
Sbjct: 567 NPILNVVNYELGIVF 581
>gi|336366433|gb|EGN94780.1| hypothetical protein SERLA73DRAFT_171190 [Serpula lacrymans var.
lacrymans S7.3]
Length = 607
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 147/375 (39%), Gaps = 61/375 (16%)
Query: 21 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLI 79
++I + D + + +N NWI P L G H K MLL Y G +R+++ TANLI
Sbjct: 194 VIIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANLI 253
Query: 80 HVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HGN 134
D+ + +W+QD PL+ Q N+ F + L L P + +L H N
Sbjct: 254 DYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHPN 313
Query: 135 FKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKS 191
+ +++S V+L+ S+ G H G + GH +L +++ G K+
Sbjct: 314 LPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGKA 373
Query: 192 P----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVE 239
+ Q SS+G+ +WM E S +ED + L IV+P+++
Sbjct: 374 AKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSLK 433
Query: 240 DVRCSLEGYAAG----------NAIPSPQ-----------------KNVDKDFLKKYWAK 272
V+ S+ G G N P+ K + F +K
Sbjct: 434 TVQTSVLGEPGGGTMFCRGVQWNGAKFPRQLFHDSNSTAGGVLMHTKMIIGTFKQKATTN 493
Query: 273 WKASHT-GRSR----------AMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN-- 319
SH G+ R N + W L S N + +AWG L +
Sbjct: 494 SLDSHDKGKGRQSDADSDTETETEEDDVVEVVNDAPIGWAYLGSHNFTPSAWGTLSGSGF 553
Query: 320 NSQLMIRSYELGVLI 334
N L + +YELG++
Sbjct: 554 NPILNVVNYELGIVF 568
>gi|295662314|ref|XP_002791711.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279837|gb|EEH35403.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 589
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 87/304 (28%)
Query: 194 VYQFSSLGSLDEKWMAEL--------SSSMSSGF-SEDKTPLGIGEPLIVWPTVEDVRCS 244
+ ++LG D KW+ E S+ S F +E +P I++PT +++R S
Sbjct: 192 ISSVATLGQTD-KWLKETLFNSLSPPSARSSELFKTESNSPANFS---IIFPTPDEIRRS 247
Query: 245 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------------------- 273
L GY +G +I S + +L+ Y +W
Sbjct: 248 LNGYMSGGSIHMKLQSAAQQKQLQYLRPYLCRWAGDANDDGGVKSAGGPATSKRKRLEGN 307
Query: 274 ------------KASH-----TGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAW 313
K H GR RA PHIKT+ R++ + W ++TSANLS AW
Sbjct: 308 DVSESVQDCAALKKEHRPIREAGRRRAAPHIKTYVRFSDTDMTTIDWAMVTSANLSLQAW 367
Query: 314 GALQKNNSQLMIRSYELGVLILPS------------AKRHGCGFSCTSNIVPSEIKSGST 361
GA ++ I SYE+GVL+ P G G + + SG+
Sbjct: 368 GAAANAKKEIRICSYEIGVLVWPDLFVDEEIDDSDEPLTKGKGKDNSRREI-----SGNK 422
Query: 362 ETSQIQKTKLVTL----TWHGSSDAGASSE--VVYLPVPYELPPQRYSSEDVPWSWDKRY 415
T ++ +V + +A SS+ +V +PY+LP Y+++D PW Y
Sbjct: 423 NTKDVKTAVMVPCFKRDMPEAAENAARSSDTTLVGFRMPYDLPLHSYTAKDQPWCATATY 482
Query: 416 TKKD 419
++ D
Sbjct: 483 SEPD 486
>gi|281210780|gb|EFA84946.1| hypothetical protein PPL_01939 [Polysphondylium pallidum PN500]
Length = 493
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 138/311 (44%), Gaps = 44/311 (14%)
Query: 43 ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP----LK 98
I+H P L G HSK +LL Y + +R+++ ++NL DW Q +++ D P
Sbjct: 134 IIHPPLLVSQIGILHSKIILLEYQQIIRVVISSSNLTGSDWEVLGQTIFIVDIPRIKKNN 193
Query: 99 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLI 155
N + F+ +L+D LS+L + + + +N +F+FS + ++
Sbjct: 194 IDNINDNKDQFKYELVDILSSLGFTD---------DHIVNA--LDQFDFSMIHQHGIHIV 242
Query: 156 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 215
+S+PG + S K+G KL ++ E + K+ VYQ S++G +W++
Sbjct: 243 SSIPGVY---SHNKYGLSKLASLASEY---QSTSKATAVYQSSAIGMTSREWLSSF---- 292
Query: 216 SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL--KKYWAKW 273
K +G + +PT+ + + G + DKD L K +K
Sbjct: 293 -------KAAIGTDNLTLPFPTLNTIDEMITYNPLGATESVTIRYHDKDLLLSNKMLSKL 345
Query: 274 KASHTGRSRAMPHIKTFARY---NGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRS 327
+ ++ + I + + + + L W S N ++A+WG++ K S + I +
Sbjct: 346 QYNNERDPKVDNSITNLSSHPPLHSKVLITDRWIYHGSHNFTEASWGSISKRQSTIKISN 405
Query: 328 YELGVLILPSA 338
+E GV I P+A
Sbjct: 406 FETGVFI-PTA 415
>gi|322701752|gb|EFY93501.1| tyrosyl-DNA phosphodiesterase, putative [Metarhizium acridum CQMa
102]
Length = 267
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 74/158 (46%), Gaps = 20/158 (12%)
Query: 270 WAKWKASHTGRSRAMPHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 328
W + S+T + T+ RYN + + W +LTSAN+SK AWG ++ + +L + S+
Sbjct: 126 WVIYDPSYTTGPTTVQTALTYIRYNEKGSIDWAMLTSANISKQAWGEAERPSGELRVASW 185
Query: 329 ELGVLILPSAKRHGCGFSCT-SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 387
E+GVL+ P T + VP E K S GA
Sbjct: 186 EIGVLVWPGLVGQDVSMVGTFQSDVPKEPKE------------------QADSKTGAGGV 227
Query: 388 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 425
++ + +PY LP QRY + +VPW ++ + D +G+ W
Sbjct: 228 LIGVRIPYSLPLQRYGAGEVPWVATMKHGEPDRFGRQW 265
>gi|406860446|gb|EKD13504.1| tyrosyl-DNA phosphodiesterase domain protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 669
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 135/322 (41%), Gaps = 45/322 (13%)
Query: 22 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 81
V+ ++D +++ PAN+ P + + HSK LL +P +R++V +ANL
Sbjct: 247 VLQAKTDAERQNISSKAPANYRFCFPNMEGNINCMHSKLQLLAHPTHLRVVVPSANLTSY 306
Query: 82 DWNNK---SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN 138
DW ++ D P + F N+L+ ++ + + +A
Sbjct: 307 DWGETGIMENICFLIDLPRLPPGEKTVVTNFANELVYFVEQMGLDQKTA----------- 355
Query: 139 PSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF 197
+ + F+FS A + + S+ G H+GS+ K+ G+ L T +++ + + +
Sbjct: 356 -TSLQNFDFSRTAHLAFVHSIGGSHSGSTWKRTGYCGLGTAIKKLGMATEVDLN-IEFLS 413
Query: 198 SSLGSLDEKWMA--ELSSSMSSGFSE-----DKTPLGIGEPL--------------IVWP 236
+S+GSL++ +M L++ G +E +K G I +P
Sbjct: 414 ASIGSLNDSFMECLYLAAQGDDGATEYRWRTEKPTKSKGRSAAEHKLLGNVNSNCRIYFP 473
Query: 237 TVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNG 295
T E V S G G I K D D F +K K+ G M + FAR
Sbjct: 474 TKETVEASRGGVTGGGTICLQSKWFDSDTFPRKLMRDCKSVRKGI--LMHNKMIFARARD 531
Query: 296 QK----LAWFLLTSANLSKAAW 313
QK +AW + S NLS++AW
Sbjct: 532 QKQYPKIAWAYVGSHNLSESAW 553
>gi|328769090|gb|EGF79135.1| hypothetical protein BATDEDRAFT_90149 [Batrachochytrium
dendrobatidis JAM81]
Length = 554
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 106/478 (22%), Positives = 189/478 (39%), Gaps = 118/478 (24%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWIL-HKPPLPISFGTHHSKA 60
+D DWL C V + + + E + + N IL P + +G H K
Sbjct: 124 IDDDWL---CDVFPSTIKICLARPKPKMVPESVDKLPVTNNILWVFPKMSAGYGAMHIKF 180
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD----QNNLSEECGFENDLIDY 116
LL YP+ +R+++ +ANL+ DW ++ QDFP+ + Q+ SE + ++
Sbjct: 181 QLLWYPKFLRVVITSANLMPHDWQELENVVFYQDFPILNSRVRQSQHSETASSSTN--EF 238
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL--KKWGHMK 174
TL S N+P + +K +FS A L+ S+PG H +S+ +++G M
Sbjct: 239 SKTLYNLLVSMNIPQSVIASV-----QKHDFSKALGMLVVSLPGKHDATSMETRQFGSMG 293
Query: 175 LRTVLQECT--FEKGFKKSPLVYQFSSLGSLDEKWMAELSS------------SMSSGFS 220
L T Q + F +++ + Q +S+GS W+ + S S++S F+
Sbjct: 294 LCTASQVISRQFRFDLEQAIVCMQTASMGSTHPAWLRYMLSAFRGQDVIPETPSLASFFT 353
Query: 221 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI-------PSPQKNVDKDFLKK---- 268
+ + + EP+ I++P+ V S G G I + +++ +D + K
Sbjct: 354 QSMSSI---EPITILFPSRRTVETSRNGIPGGGTIFFSSKFWSTFPRHIIRDGVSKTQGI 410
Query: 269 -------------YWAKWKASHTGRSRAMP-HIKTFARYNGQKL-----AWFLLTSANLS 309
Y S ++P H + A + KL + S N +
Sbjct: 411 LMHSKINVVIGIGYIDLLATSQQLDIVSVPIHTQDNAHDHNTKLEKEIHGYIYCGSHNAT 470
Query: 310 KAAWG-----------------ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 352
+AAWG ++Q + Q+ I+++ELG+L LP R C
Sbjct: 471 QAAWGSVPVMRSSVSTSSQSCKSIQHGHLQVEIKNWELGIL-LPFRIRDVC--------- 520
Query: 353 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 410
S G + ++ ++ +P+E PP +Y D P+S
Sbjct: 521 -------------------------SHSSVGFNPDLSFV-LPFEYPPAKYGPTDKPFS 552
>gi|320165097|gb|EFW41996.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 545
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 144/327 (44%), Gaps = 61/327 (18%)
Query: 54 GTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FE 110
G H + MLL + +R+ V +A+L+ DW + QDFP++ + E G F+
Sbjct: 190 GRLHGRLMLLFHGSDTLRVAVTSASLVPSDWGVLENVTYYQDFPIEAKRPTVTERGLAFQ 249
Query: 111 NDLIDYLSTL-----KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 165
+ L++Y++ L K + PA + K NF + RLI+S P + S
Sbjct: 250 STLMNYVTQLVAHQPKDDDVDDRHPARAARILKE--LKTVNFDTVEARLISSYPEH---S 304
Query: 166 SLK----KWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF 219
+L+ + G M L LQ T SP++YQ SS+G + + W+ + +++ ++G
Sbjct: 305 NLETNGCRQGLMALEQALQAEYSTLPAQVLNSPIIYQSSSIGQVSDPWVTQFATACNAGA 364
Query: 220 SEDKTPLGIGEPL-----------IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK 268
+ G P ++PT V +L+G+ G+ P + F +
Sbjct: 365 PARISGESRGSPFAIDPADALKLQFIFPTTATVSQALQGFPEGH----PHR---LHFFPR 417
Query: 269 YWAK---------WKASHTGRSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWG-AL 316
Y++ +++ H +P+ K R ++ + + ++ S +L +WG
Sbjct: 418 YFSSTFPRGSLFDYQSKH---GNVLPNSKVLLRVPDEQSTIGYAVIGSHSLGIGSWGNGA 474
Query: 317 QKNNSQL---------MIRSYELGVLI 334
++S+L M+R++EL VLI
Sbjct: 475 VSSDSKLGAKATSKPRMMRNFELSVLI 501
>gi|302695723|ref|XP_003037540.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
gi|300111237|gb|EFJ02638.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
Length = 646
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 148/372 (39%), Gaps = 73/372 (19%)
Query: 20 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANL 78
V+++ DG + +N NWI P L +G H K MLL Y G +R+ + TANL
Sbjct: 240 VIIVQQTKDG--DASIKNWLPNWIRASPFLRNGYGCMHMKFMLLFYKTGRLRVYIPTANL 297
Query: 79 IHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN------DLIDYLSTLKWPEFSANLPAH 132
+ D+ + W+QD P + + + E+ +++ L+ + +P H
Sbjct: 298 VQYDYRDIENFAWLQDIPRRPAHKPEPKPNPEDFPSIMQRVLEALNIRPAQLETNTIPQH 357
Query: 133 GNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFK 189
N + + +++S V L+AS+ G + G S+ + GH +L ++ +
Sbjct: 358 PNLPLQSISDLRRLWDWSLVKVHLVASLHGKYEGWPSVLQVGHPRLMKAVRNMGLAVDKE 417
Query: 190 KSPLVY-QFSSLGSLDEKWMAELSSSM----------SSGFSEDKTPLGIGEPLIVWPTV 238
+ V Q SS+G W+ E+ SM ++ + TPL + + IV+PT
Sbjct: 418 REVEVECQGSSIGRCTSVWINEMYGSMRGQSAREWLDATKKRREATPLPLVK--IVYPTK 475
Query: 239 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKAS-------HTGRSRAMP---HIK 288
V + G G I F ++ A W+A H +S P H K
Sbjct: 476 ATVHATAWGVNGGGTI----------FCRR--ATWEAKNFPRQLFHDSKSTGGPVLMHTK 523
Query: 289 TFARYNGQK------------------------LAWFLLTSANLSKAAWGALQKN--NSQ 322
K L W + S N +++AWG L + N
Sbjct: 524 LIEAKTSAKPSTTSTNNNDINSTIDDIEVVHPALGWVYVGSHNFTQSAWGTLSGSGFNPV 583
Query: 323 LMIRSYELGVLI 334
L + +YELGV+
Sbjct: 584 LNVTNYELGVVF 595
>gi|403173802|ref|XP_003332829.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170701|gb|EFP88410.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 583
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 142/365 (38%), Gaps = 63/365 (17%)
Query: 20 VLVIHGESDGTLEHMKRNKPANWILHKPPL------PISFGTHHSKAMLLIYPRGVRIIV 73
++VI +D K N+ AN L PP+ G H K ++ Y R+ +
Sbjct: 193 IMVIRHHTD--CGSFKVNERANMFLCHPPMLKTANGNAKAGCMHIKFFIIFYDNFCRVAI 250
Query: 74 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPA 131
TAN + D+ +W+QDF N + +D+ + TL LP
Sbjct: 251 PTANAVSFDYEFVENAIWIQDFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLP- 309
Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFK 189
F+ K +F SAA L+ S+ G H +S H+ +L+T+ + G +
Sbjct: 310 ---FR---KPLKDHDFGSAAANLVVSIQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-R 362
Query: 190 KSPLVYQFSSLGSLDEKWMAEL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 244
+ L Q SS+GS D KW+ S S + +ED PL +++PT+ VR S
Sbjct: 363 TATLECQGSSIGSYDLKWLNNFYRCASGSPPTASTEDPDLQTKTPPLTVLYPTLHTVRNS 422
Query: 245 LEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTF------------- 290
G A + + +K +F +A + TG + H+K
Sbjct: 423 HSGKAGAGTLFCNKATWEKANFPTHIFADTMSKRTG---VLMHVKMILGLFNSDSSAKST 479
Query: 291 ----------------ARYNGQKLAWFLLTSANLSKAAWGALQ-----KNNSQLMIRSYE 329
R N + + S N + AAWG +++ L I ++E
Sbjct: 480 SSTLDTASVEKSGARDGRINKDHAGFLYIGSHNFTPAAWGKFNLKSGSDDSTSLEISNWE 539
Query: 330 LGVLI 334
LGV++
Sbjct: 540 LGVVL 544
>gi|298705565|emb|CBJ28816.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 947
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTL-----EHMKRNKPANWILHKPPLPISFGT 55
+VD ++LL A P L +P +L+ + D L +KR PA + P I G
Sbjct: 216 LVDAEFLLNAAPRLKTVPFLLIQGIKEDKPLVVSMKAFLKREHPAAVVYL--PKTIHIGL 273
Query: 56 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 98
HHSK +LL Y GVR+++ T N+ DW + Q W QDFP K
Sbjct: 274 HHSKMILLKYKTGVRVVIMTCNMRPDDWGGRCQAAWYQDFPFK 316
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 22/113 (19%)
Query: 109 FENDLIDYLSTLKWPE--FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 166
FE LIDY + P + +L A ++FSSA V LI SVPG H G
Sbjct: 423 FEEILIDYFEHVGGPAAVWGRSLSA-------------YDFSSANVTLIPSVPGRHKGRD 469
Query: 167 LKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDE---KWMAELSSS 214
L ++GHM++R VL +E G + + +Q +S+ +L KW+ E++ S
Sbjct: 470 LYRYGHMRVRAVLAREEVHVRPGSHR--VAFQAASIMNLSRRPYKWLGEITES 520
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 65/164 (39%), Gaps = 59/164 (35%)
Query: 233 IVWPTVEDVRCSLEGYAAGNAIP----------------SPQKNVDKDFLKKYWAKWK-A 275
+VWPT E VR S G+ +G +P + Q N + LK W A
Sbjct: 658 VVWPTEEAVRTSNLGWESGAGMPCLTTTLYEGGYRKCETNYQLNRVMEELKPLLCTWTGA 717
Query: 276 SHTGRSRAMPHIKTFARY------------NGQKLAWFLLTSANLSKAAWGALQKNN--- 320
R AMPH+ T+ RY + LA+FLL S +L + AWG L+ N
Sbjct: 718 KGMDRGNAMPHLNTYYRYRELPRTDGSLKMSKDGLAYFLLASHSLHRIAWGYLEHRNPPQ 777
Query: 321 ---------------------------SQLMIRSYELGVLILPS 337
+QL I+S+++GV+ LPS
Sbjct: 778 RPRKRRVRMKPIYPPKPENTLPYKEEEAQLDIKSFDMGVMFLPS 821
>gi|189207467|ref|XP_001940067.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976160|gb|EDU42786.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 564
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 138/319 (43%), Gaps = 41/319 (12%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN---------KSQGLW 91
N LH PP+ + HSK MLL +RI + TAN+ DW ++
Sbjct: 213 NMKLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTQTDWGEVGNDWQPGVMENSVF 272
Query: 92 MQDFPLKDQNN---LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 148
+ D P + + + E F DLI + LK + + + KF+F+
Sbjct: 273 VIDLPRRSDDGVGKVEELPSFGRDLIFF---LKAQQVESRVTGG---------VLKFDFA 320
Query: 149 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 207
+ + S+ G H + G L ++E ++ + L Y SSLG++++ +
Sbjct: 321 DTKHLAFVHSIGGSHKEELERPTGLPGLANAVRELQYDD-VEHIELDYAASSLGAINDTF 379
Query: 208 MAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 264
++ + ++ F++D P I +PT E V S+ G N I +K +
Sbjct: 380 LSRIHLAARGKNFTQDNAAVPDVRDHFRIYFPTNETVEKSIGGSGCANIISLSKKYYNAS 439
Query: 265 -FLKKYWAKWKASHTGRSRAMPHIK-TFA---RYNGQKLAWFLLTSANLSKAAWGALQKN 319
F K+ + ++ G + H K FA R +G+ AW + SAN+S++AWG +
Sbjct: 440 TFPKECLRDYDSTRRG---MLSHNKLLFARGRRTDGRPFAWVYVGSANISESAWGGQKVL 496
Query: 320 NS----QLMIRSYELGVLI 334
S L +R++E GV++
Sbjct: 497 KSGKVGALNVRNWECGVIV 515
>gi|270017231|gb|EFA13677.1| hypothetical protein TcasGA2_TC001393 [Tribolium castaneum]
Length = 416
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 126/291 (43%), Gaps = 35/291 (12%)
Query: 1 MVDIDWLLPACPV--LAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
M+DI WL+ L K P ++ E E +++ P N H + FG HHS
Sbjct: 131 MIDIMWLMERYRERNLGKKPLTILYGDEFPKMKEFIEKFLP-NVSHHYVKMKDPFGCHHS 189
Query: 59 KAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEND 112
K + Y +R+++ TANL + DWN+ +QGLW+ P E GF++
Sbjct: 190 KIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEKSGESPTGFKSS 249
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
L++YL NLP K + K+ +FS+ V L+ SVPG H + H
Sbjct: 250 LLNYLK-------HYNLPV---LKPWIDYVKRADFSAVRVFLVTSVPGKHYPGTQGSHVH 299
Query: 173 MKLRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 223
+ + C+ K P ++ Q SS+GS+ + L S++ S K
Sbjct: 300 HVGDLLSRHCSLPA--KTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLRSTLLRSLSGHK 357
Query: 224 TPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKY 269
+ I++P+V++V G +G +P S Q N + +L+ Y
Sbjct: 358 QTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQSY 408
>gi|393219182|gb|EJD04669.1| phospholipase D/nuclease [Fomitiporia mediterranea MF3/22]
Length = 583
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 122/278 (43%), Gaps = 41/278 (14%)
Query: 4 IDWLL----PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 59
+DWL P P+ VLV DG +K P N ++ KP + G H K
Sbjct: 148 VDWLYDFFEPTTPI------VLVNQPGEDGN-SGLKELAP-NILMTKPFIRNGRGCMHIK 199
Query: 60 AMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
+LL Y G +RI + TAN + DW + W+QD P++ + D+
Sbjct: 200 ILLLFYKDGRLRICLPTANFVEYDWRDIENTAWVQDVPMRKTT-----IRHDPKAADFPG 254
Query: 119 TLKWPEFSANLPA------HGNFKINP-----SFFKKFNFSSAAVRLIASVPGYHTG-SS 166
TL+ N+PA GNF P ++++S V+L+AS+ G + G
Sbjct: 255 TLQRVLHKLNVPAALTKLLDGNFPELPIEALSELRMRWDWSKVKVKLVASLAGKYEGWDE 314
Query: 167 LKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE--- 221
+++ GH L +QE T KG K+ L Q SS+G+ +WM E+ S ++
Sbjct: 315 VERTGHPALAKAIQELGVTPPKG-KELVLECQGSSIGTYSRQWMDEIYCSAKGQSAKAWL 373
Query: 222 ---DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI 254
+ + PL I++P++ V+ S+ G G +
Sbjct: 374 NKPRSQRMKLAWPLIKILFPSLATVKDSVLGMPGGGTM 411
>gi|392580440|gb|EIW73567.1| hypothetical protein TREMEDRAFT_70993 [Tremella mesenterica DSM
1558]
Length = 758
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 112/467 (23%), Positives = 177/467 (37%), Gaps = 119/467 (25%)
Query: 1 MVDIDWLLPACPVLAKIPHVLV------IHGESDGTLEHMKRNKPANWILHKPPLPISFG 54
++D DWL P K+P V+V +H +G ++ + + P + G
Sbjct: 345 VLDDDWLSGILPDPQKVPTVIVRPHPKEMHSTYNGKVQAQVTGE----VFCYPLMLDERG 400
Query: 55 THHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECGFEND 112
H K + Y G +R+++ TAN + DW+ ++QDF P K + G D
Sbjct: 401 AAHMKYAWIFYKTGRLRVMISTANFVPYDWDWIENTTFVQDFLPRKPTSPAPTTKG--ED 458
Query: 113 LIDYLSTL--------------KWPEFSANLPAH--GNFKINPSFFKKFNFSSAAVRLIA 156
+ + +L + ++LP G F+ K+++S +VRLI
Sbjct: 459 FVAHFRSLFIHLKVHKALRYLKDQHKAGSDLPPQVSGAFE----GLDKYDWSRVSVRLIM 514
Query: 157 SVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKW---MAE 210
SV GYH G K+G +L VL++ + K LV +F SSLG + +W +
Sbjct: 515 SVAGYHHGYDQADKYGMTRLGKVLKDEGLVQS-KGERLVAEFQGSSLGQYNIEWYNTFYQ 573
Query: 211 LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 269
L + D PL I++P++ V S G G + K F
Sbjct: 574 LCTGKDVRALVDHPKYKDWPPLKIIFPSLATVEASELGKDGGGTM-----FCGKAFTANT 628
Query: 270 WAKWKASHTGRSRAMPHIK----TFARY------------NGQKLA----------WFLL 303
+ S + R + H K TF +G++ A W +
Sbjct: 629 KHLFHHSESKRGGVLMHTKMLIGTFEPIPRSLGFTSVDCKSGKRKASEMEESPYGGWIYV 688
Query: 304 TSANLSKAAWGALQKNNSQLMIRSYELGVLI-LPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
S N S AAWG + +L IR+YELG+L LP K
Sbjct: 689 GSHNFSAAAWGTMNFKEKRLTIRNYELGILFPLPRDK----------------------- 725
Query: 363 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
A A +++V PY+ P ++YSS D+PW
Sbjct: 726 -------------------ARAMADIV---APYKRPARQYSSNDIPW 750
>gi|307109628|gb|EFN57866.1| hypothetical protein CHLNCDRAFT_143336 [Chlorella variabilis]
Length = 213
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 71/139 (51%), Gaps = 21/139 (15%)
Query: 284 MPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--- 336
MPH K + R+ +G ++AW + S NLSKAAWG L+ + SQL I SYELGVL+LP
Sbjct: 1 MPHSKAYLRWSHGDHGPEIAWCYVGSHNLSKAAWGCLELDASQLHICSYELGVLLLPRLE 60
Query: 337 SAKRHG--CGFSCTSN------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 388
+A R CGFSCT ++ + + + L W D+ A+ V
Sbjct: 61 AAYRTSRWCGFSCTGGQPGAAAPRLAQAAAAAGAAGTAAVPSVRFLQWR-QGDSQAAEMV 119
Query: 389 -----VYLPVPYELPPQRY 402
V LPVP+ LPP Y
Sbjct: 120 QGQLGVPLPVPFHLPPVPY 138
>gi|294896960|ref|XP_002775774.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239882085|gb|EER07590.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 201
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 85/175 (48%), Gaps = 23/175 (13%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--------PLKDQNNLSE 105
GT H+K +++ + +R+ + ++N+ DW SQ +W+ DF P + +
Sbjct: 1 GTMHAKLIIIERAQALRVCISSSNVTPQDWEGVSQCIWVADFKPANDPEAPARKRVKPDH 60
Query: 106 ECGFENDLIDYLSTLKWPEFSANLP---AHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 161
F + L ++ T F ++P + ++ + +FN V LIAS PGY
Sbjct: 61 TSDFGDQLARFIET-----FFRSIPDSSSLSSYWVKVLTGSRFNVKLPKGVELIASAPGY 115
Query: 162 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 216
G WGHM+LR +L + E+ +++Q SS+G L ++A+LS S++
Sbjct: 116 WKGDDRDNWGHMRLRALLSDVHSEE------ILFQCSSIGFLPASFLADLSKSLN 164
>gi|340518445|gb|EGR48686.1| predicted protein [Trichoderma reesei QM6a]
Length = 534
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 103/463 (22%), Positives = 173/463 (37%), Gaps = 116/463 (25%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +W+L + ++ +L+ + + M+ PAN PP+ G HSK L
Sbjct: 127 DEEWMLSKLDI-SRTKLLLLAFAKDEAQKNQMRGIVPANIKFCFPPMH-GVGAMHSKLQL 184
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEE--CGFENDLIDYL 117
L YP +R+++ T NL+ DW +++ D P + + + F +L+ +L
Sbjct: 185 LKYPNRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPRLENPATTPQSPTAFYTELVYFL 244
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLR 176
A G + ++FS ++ + + ++PG HTG + ++ G+ L
Sbjct: 245 Q------------ATGVGDKMVASLSNYDFSKTSDIAFVHTIPGSHTGKAAERTGYCGLG 292
Query: 177 TVLQECTFEKG-------FKKSPLVYQFSSLGSLDEKWMAEL----------------SS 213
+ + ++ +SLG+L+ +++ + S
Sbjct: 293 ASVAALGLASAEPVEVDLLARCGDLHCCASLGALNHEFIEAIYNACRGRDGIEDFKNKSG 352
Query: 214 SMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK 267
+ SS K P I +PT V S G AG I
Sbjct: 353 AASSRSKAAKKPDEAASKELQERFRIYFPTERTVAGSRGGRNAGGTI------------- 399
Query: 268 KYWAKWKASHT----------GRSRAMPHIK-TFARYNG------QKLAWFLLTSANLSK 310
AKW S T R R + H K F R G Q+ W + SANLS+
Sbjct: 400 CVQAKWWRSPTFPTELVRDVIARDRLLVHSKMIFVRRVGHDQTTQQRPGWAYVGSANLSE 459
Query: 311 AAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 366
+AWG L ++ S ++ R++E GV ILP +
Sbjct: 460 SAWGRLSRDRSTKAIKMNCRNWECGV-ILP-----------------------------V 489
Query: 367 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
++K V + G A + V PVP ++P Y+S D PW
Sbjct: 490 PESKAVDMARAGGDMAMFAGTV---PVPMQVPGPAYASSDRPW 529
>gi|330927762|ref|XP_003301988.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
gi|311322883|gb|EFQ89910.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
Length = 572
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 153/359 (42%), Gaps = 43/359 (11%)
Query: 31 LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----- 85
L+ ++ N LH PP+ + HSK MLL +RI + TAN+ DW
Sbjct: 211 LQEWAESRVPNMRLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTPTDWGEVGNDW 270
Query: 86 ----KSQGLWMQDFPLKDQNNLSEECG---FENDLIDYLSTLKWPEFSANLPAHGNFKIN 138
+++ D P + + + + F DL+ + LK E + K+
Sbjct: 271 QPGVMENSVFLIDLPRRSDDGVGKVEDLPPFGRDLVFF---LKAQEVGS--------KVT 319
Query: 139 PSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF 197
KF+F+ + + S+ G H S + G L ++E ++ + L Y
Sbjct: 320 DGVL-KFDFADTKHLAFVHSIGGSHKEESERPTGLPGLANAVRELQYDD-VEHLELDYAA 377
Query: 198 SSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI 254
SSLG++++ +++ + ++ F++D P I +PT + V S G N I
Sbjct: 378 SSLGAINDTFLSRIYLAARGKSFTKDNAVVPDVRDHIRIYFPTNDTVEKSTGGPDCANII 437
Query: 255 PSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK-TFA---RYNGQKLAWFLLTSANLS 309
+K + F K+ + ++ G + H K FA R NG+ AW + SAN+S
Sbjct: 438 SLSRKYYNASTFPKECLRDYVSTRRG---MLSHNKLLFARGRRTNGKPFAWVYVGSANIS 494
Query: 310 KAAWGALQKNNS----QLMIRSYELGVLI-LPSAKRHGCGFSCTSNIVPSEIKSGSTET 363
++AWG + S L +R++E GV++ +P K + + P + G+ E
Sbjct: 495 ESAWGGQKVLKSGKVGALSVRNWECGVMVPVPDDKLEQVDLKADA-VPPMSVFEGTVEV 552
>gi|261190935|ref|XP_002621876.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
gi|239590920|gb|EEQ73501.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
Length = 696
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 117/462 (25%), Positives = 191/462 (41%), Gaps = 85/462 (18%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPANWI-LHKPPLPISFGTHH 57
M ++DW+ + K L+I GE D E K + L PP+ H
Sbjct: 262 MWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMH 319
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ------DFPLKDQNNLSEECG--F 109
SK MLL +P +RI V +ANL+ DW QG M+ D PLK +L+ G F
Sbjct: 320 SKLMLLFHPSYLRIAVPSANLVPFDWGE--QGGVMENIVFLIDLPLKSP-DLANGPGTSF 376
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFSSAA-VRLIASVPGYHTGS 165
+DL+ +L ++NL + KK F+FS+ + + ++ G HT
Sbjct: 377 LDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFSATKDIAFVHTIGGSHTDP 421
Query: 166 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE--LSSSMSSGFSE-- 221
+K G L + + + + L Y SS+GSL+E+++ L++ SG E
Sbjct: 422 KWRKTGLCGLGSAITALGLQT-TRDINLDYVTSSVGSLNEQFLRSMYLAAQGDSGLKELT 480
Query: 222 ----------------DKTPLG--IGEPLIVWPTVEDVRCSLEGYAAGNAI--------- 254
+T G + +V+P+++ VR S G I
Sbjct: 481 LRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLDTVRKSKGGAENAGTICFQSKWYNS 540
Query: 255 PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG 314
+ K++ +D + + + R I + + + W + SANLS++AWG
Sbjct: 541 ATFPKDIMRDNISRREGLLMHNKILFVRPEKPITSLKDNSTRYAGWAYVGSANLSESAWG 600
Query: 315 ALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTK 370
L + S +L R++E GV+I RH +S +PS +G T T K
Sbjct: 601 RLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--IPS---TGRTAT---LLAK 649
Query: 371 LVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 409
+ +SD G+ V+ +PVP +P RY + P+
Sbjct: 650 SESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPF 691
>gi|403159950|ref|XP_003320511.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169349|gb|EFP76092.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 573
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 86/365 (23%), Positives = 143/365 (39%), Gaps = 63/365 (17%)
Query: 20 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISF------GTHHSKAMLLIYPRGVRIIV 73
++VI +D K N+ AN L PP+ + G H K ++ Y R+ +
Sbjct: 183 IMVIRHHTD--CGSFKVNERANMFLCHPPMLKTANGNAKPGCMHIKFFIIFYDNFCRVAI 240
Query: 74 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPA 131
TAN + D+ +W+QDF N + +D+ + TL LP
Sbjct: 241 PTANAVSFDYEFVENAIWIQDFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLP- 299
Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFK 189
F+ + +F SAA L+ SV G H +S H+ +L+T+ + G +
Sbjct: 300 ---FR---KPLEDHDFRSAAANLVVSVQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-R 352
Query: 190 KSPLVYQFSSLGSLDEKWMAEL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 244
+ L Q SS+GS D KW+ S S + +ED PL +++P++ VR S
Sbjct: 353 TATLECQGSSIGSYDLKWLNNFYRCASGSPPTASTEDPDLQTKTPPLSVLYPSLHTVRNS 412
Query: 245 LEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTF------------- 290
G A + + +K +F +A + TG + H+K
Sbjct: 413 HSGKAGAGTLFCNKATWEKANFPTHIFADTMSKRTG---VLMHVKMILGLFNSDSSAEST 469
Query: 291 ----------------ARYNGQKLAWFLLTSANLSKAAWGALQ-----KNNSQLMIRSYE 329
R N + + S N + AAWG +++ L I ++E
Sbjct: 470 SSTLATASVDKSGARDGRINKDHAGFLYIGSHNFTPAAWGKFNSKSGSDDSTSLEISNWE 529
Query: 330 LGVLI 334
LGV++
Sbjct: 530 LGVVL 534
>gi|327354754|gb|EGE83611.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ATCC 18188]
Length = 696
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 117/462 (25%), Positives = 190/462 (41%), Gaps = 85/462 (18%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPANWI-LHKPPLPISFGTHH 57
M ++DW+ + K L+I GE D E K + L PP+ H
Sbjct: 262 MWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMH 319
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ------DFPLKDQNNLSEECG--F 109
SK MLL +P +RI V +ANL+ DW QG M+ D PLK +L+ G F
Sbjct: 320 SKLMLLFHPSYLRIAVPSANLVPFDWGE--QGGVMENIVFLIDLPLKSP-DLANGPGTSF 376
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFSSAA-VRLIASVPGYHTGS 165
+DL+ +L ++NL + KK F+FS+ + + ++ G HT
Sbjct: 377 LDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFSATKDIAFVHTIGGSHTDP 421
Query: 166 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE--LSSSMSSGFSE-- 221
+K G L + + + + L Y SS+GSL+E+++ L++ SG E
Sbjct: 422 KWRKTGLCGLGSAITALGLQT-TRDINLDYVTSSVGSLNEQFLRSMYLAAQGDSGLKELT 480
Query: 222 ----------------DKTPLG--IGEPLIVWPTVEDVRCSLEGYAAGNAI--------- 254
+T G + +V+P++ VR S G I
Sbjct: 481 LRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLNTVRKSKGGAENAGTICFQSKWYNS 540
Query: 255 PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG 314
+ K++ +D + + + R I + + + W + SANLS++AWG
Sbjct: 541 ATFPKDIMRDNISRREGLLMHNKILFVRPEKPITSLKDNSTRYAGWAYVGSANLSESAWG 600
Query: 315 ALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTK 370
L + S +L R++E GV+I RH +S +PS +G T T K
Sbjct: 601 RLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--IPS---TGRTAT---LLAK 649
Query: 371 LVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 409
+ +SD G+ V+ +PVP +P RY + P+
Sbjct: 650 SESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPF 691
>gi|367027210|ref|XP_003662889.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
gi|347010158|gb|AEO57644.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
Length = 646
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 97/428 (22%), Positives = 156/428 (36%), Gaps = 79/428 (18%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +W+L + A+ +LV + E M+ N P + I P G+ HSK ML
Sbjct: 237 DEEWMLSKIDI-ARTKLILVAFAADEAQKEEMRSNVPRDRIRFCFPPMHGIGSMHSKLML 295
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L Y +RI+V T NL+ DW +++ D P K + E N D L
Sbjct: 296 LKYENYLRIVVPTGNLMSFDWGETGTMENMVFILDLP-KFETAEGREAQKLNRFADQLFY 354
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTV 178
L A G + + ++F+ A + ++PG HTG + G+ L
Sbjct: 355 F--------LRAQGLDEKLVDSLRNYDFTEAGRYEFVHTIPGSHTGDDALRTGYCGLG-- 404
Query: 179 LQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL------------------SSSMSSG 218
Q G + P+ +SLG+++ + L S
Sbjct: 405 -QSVNALVGTRSEPVELDLVCASLGAVNYGLLTSLYYACLGDPLREYEERASGSQRNRDA 463
Query: 219 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK------ 272
F+ L I +P+ E V S G I L K+W
Sbjct: 464 FTSRAISLVKEHMRIFFPSRETVLRSKGGKDGAGTIC---------LLSKWWQAPTFPRE 514
Query: 273 -WKASHTGRSRAMPHIKTF--------ARYNGQKLAWFLLTSANLSKAAWGALQKNNS-- 321
+ + R + H K ++ +G+ A+ + SANLS++AWG L ++ +
Sbjct: 515 LVRDCKSVRQGVLMHTKALYVRPCSPTSQQSGRCFAY--VGSANLSESAWGRLSRDRASG 572
Query: 322 --QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 379
+L R++E GVL+ CT V +GS V + W G
Sbjct: 573 KPKLTCRNWECGVLL------------CTDRTVEGSSGAGSDNLGVFDGCVPVPMEWPGR 620
Query: 380 SDAGASSE 387
+ +G E
Sbjct: 621 AISGEGGE 628
>gi|452845379|gb|EME47312.1| hypothetical protein DOTSEDRAFT_21105 [Dothistroma septosporum
NZE10]
Length = 584
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 167/409 (40%), Gaps = 72/409 (17%)
Query: 47 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFP-LKDQNN 102
PP+ + HSK MLL +P +R+ + +ANL++ DW Q ++M D P L +
Sbjct: 208 PPMDGAIYCMHSKLMLLFHPHKLRVAIPSANLLNFDWGETGQMENSVFMIDLPRLAGSTS 267
Query: 103 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGY 161
+ E DL T E + G K F+FS+ + I +V G
Sbjct: 268 QTTE-----DL-----TFFGQELMFFIERQGLDKDLRKGVLGFDFSATEHMAFIHTVGGM 317
Query: 162 -HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS---- 216
+ + + G + L ++ ++ + + SS+G L++ + +L S+ S
Sbjct: 318 NYERTGADRTGLLGLSRAVRYLGLTTDQRELEIDFAASSIGQLNDSQVQDLHSAASGQDL 377
Query: 217 -SGFSEDKTPLG--------------------IGEPLIVW-PTVEDVRCSLEGYAAGNAI 254
+ +E K+ I + L V+ PT E V+ S G AAG
Sbjct: 378 IAQAAEAKSKAATNFFAKKAASSKAASTSERDIKQKLRVYFPTKETVQASTAG-AAGTIC 436
Query: 255 PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG 314
+ K F + + +K++ G + H K + LAW + SAN+SK+AWG
Sbjct: 437 LQRKYFEGKTFPRAIFRDYKSTRKG---LLSHNKILC-ARSKSLAWLYIGSANMSKSAWG 492
Query: 315 ALQKNNSQLMI--RSYELGVL------ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 366
+ K+ + I R++E GVL ILP A + T + SE S E +
Sbjct: 493 EIPKDRKERRITCRNWECGVLLPVPKEILPPACKEKARRRHTDDEEDSETDSEDEEPQLV 552
Query: 367 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
+ +L + +P+E+P Y+ + PW + +++
Sbjct: 553 DMSVFSSL----------------VDLPFEVPGDDYNGRE-PWYFTEKH 584
>gi|297806769|ref|XP_002871268.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297317105|gb|EFH47527.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1083
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 81/188 (43%), Gaps = 32/188 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIH-------GESDGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C + + +P + H D N P N + PP P I+F
Sbjct: 411 DILWFLTCCEIPSHLPVTIACHHAERCWSSSPDARSTAPLPNYP-NVTMVFPPFPEEIAF 469
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQN 101
G HH K +L +R+I+ +ANL+ WN+ + +W QDFP + D
Sbjct: 470 GKDRKNRGIACHHPKLFILQREVSIRVIITSANLVARQWNDVTNTVWWQDFPRRADPDVL 529
Query: 102 NLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 155
+L C G + D L+ ++P+ ++ I F K+NF +A L+
Sbjct: 530 SLFGHCRRETNHGLKTDFCAQLAGFA-ASLLTDVPSQAHWIIE---FTKYNFEHSACHLV 585
Query: 156 ASVPGYHT 163
ASVPG H+
Sbjct: 586 ASVPGIHS 593
>gi|307108296|gb|EFN56536.1| hypothetical protein CHLNCDRAFT_144175 [Chlorella variabilis]
Length = 226
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 47/72 (65%), Gaps = 6/72 (8%)
Query: 284 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-----A 338
MPH+KT+ R+ G +AW L S N+SKAAWG L ++ +L ++S+EL VL+LPS
Sbjct: 1 MPHLKTYCRHVGGDVAWLCLGSHNVSKAAWGELLRDG-RLYVKSFELSVLLLPSRELAYQ 59
Query: 339 KRHGCGFSCTSN 350
+ GFSCTS
Sbjct: 60 RSRRRGFSCTSG 71
>gi|453087183|gb|EMF15224.1| phospholipase D/nuclease [Mycosphaerella populorum SO2202]
Length = 629
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 92/408 (22%), Positives = 162/408 (39%), Gaps = 81/408 (19%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEECGFENDL 113
HSK MLL + +RI + TANL++ DW Q +++ D P Q G +NDL
Sbjct: 242 HSKLMLLFHADKLRIAIPTANLLNFDWGETGQMENTVFLIDLPRLPQ-------GQKNDL 294
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 172
+ L + + G + F+FS+ A + + +V G H + G
Sbjct: 295 TSFGRELMF-----FIEMQGLDQDVRDGVLNFDFSATADIAFVHTVGGVHYKDQAARTGL 349
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW-----MAELSSSMSSGFSEDKTPLG 227
+ L +++ G + + SS+G+L +K MA + + E ++ G
Sbjct: 350 LGLSRTVRQMDLVAG-PSLEIDFAASSIGALTDKQLNDFHMAARGVDLLAHAREARSKAG 408
Query: 228 IG------------------EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 269
+ + +PT E VR S G AAG + F K+
Sbjct: 409 ASFFKKAGSKTVTATTNVRKKIRVYFPTKETVRSSTAG-AAGTICLQREYYERNSFPKEC 467
Query: 270 WAKWKASHTG-------------RSRAMPH-------IKTFARYNGQKLAWFLLTSANLS 309
+ ++++ G RS A H + N +AW + S+N+S
Sbjct: 468 FRDYRSTRKGLLSHNKILCARGFRSTASEHADPPGVSVAATGSPNSNPVAWVYVGSSNMS 527
Query: 310 KAAWGAL--QKNNSQLMIRSYELGVLI------LPSAKRHGCGFSCTSNIVPSEIKSGST 361
K+AWG L ++ S++ R++E GV++ LPS+ F SE ++
Sbjct: 528 KSAWGELAAERTESKITCRNWECGVILSVPVETLPSSAGE-AAFKQRDANGDSETETEDE 586
Query: 362 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
++Q + V + A ++ L P+ +P + Y S++ PW
Sbjct: 587 TSAQTSTPEFVNIE--------AFRRIIDL--PFSIPGEEYKSQE-PW 623
>gi|212546293|ref|XP_002153300.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064820|gb|EEA18915.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 684
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 108/472 (22%), Positives = 175/472 (37%), Gaps = 114/472 (24%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMKR--NKPANWILHKPPLPISFGTHHSK 59
D+ W+ K ++V+ + + T L++ + N P N L PP+ HSK
Sbjct: 258 DMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQEETANMP-NIRLCFPPMDGQVNCMHSK 316
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDLID 115
MLL +P +RI+V +AN++ DW + +++ D P K ND D
Sbjct: 317 LMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENTVFLIDLPKKST----------NDAAD 366
Query: 116 YLSTLKWPEFSANLPA---HGNF--KINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 170
T + E S L A H N K++ FK+ N + + ++ G H G SL +
Sbjct: 367 SPKTAFYEELSYFLKASTLHENIIAKLSAFDFKETNRYA----FVHTIGGSHFGESLTRT 422
Query: 171 GHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
GH L + G K + P+ F SS+GSL +++M + S +T
Sbjct: 423 GHCGLGKAVTSL----GLKTREPINIDFVTSSIGSLTDEFMRSIYLSAQG----KQTLYS 474
Query: 228 IGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 287
I +I+ +V C L G + NA + F Y ++ S + SR
Sbjct: 475 IIRTIIL-----NVSCRLGGDGSTNAQRTTSSEWKSRFRVYYPSEQTVSQSKGSRRSAGT 529
Query: 288 KTFAR--YNGQKL---------------------------------------AWFLLTSA 306
F + G K W + SA
Sbjct: 530 ICFQEKWFTGPKFPRNTLHDCISRREGLLMHNKMMFVRPEKPINLPGGSNCAGWAYVGSA 589
Query: 307 NLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
NLS++AWG + + +L R++E GVL+ + + P+ G +
Sbjct: 590 NLSESAWGKVVHDRVRKEPKLNCRNWECGVLV------------PITELPPAAGSDGEEQ 637
Query: 363 TSQIQKTKLVTLTWHGSSDAGASSEVVYL-----PVPYELPPQRYSSEDVPW 409
K + +GA ++V + PVP +P SE PW
Sbjct: 638 NKDSAKKE---------DKSGAEGDIVEIFGSTVPVPMRVPAPSLGSELKPW 680
>gi|346971357|gb|EGY14809.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium dahliae VdLs.17]
Length = 609
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 107/455 (23%), Positives = 167/455 (36%), Gaps = 98/455 (21%)
Query: 15 AKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIV 73
A+ V + + ++ E ++ N P++ I L PP+ G HSK LL YP +RI+V
Sbjct: 199 ARTRMVFIAYAKNGAEQETLRANVPSSRIKLCFPPMH-GIGCMHSKLQLLKYPNHLRIVV 257
Query: 74 HTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLP 130
+ NL+ DW +++ D P Q + +D + F L
Sbjct: 258 PSGNLVPYDWGETGVLENIVFLIDLPRIVQAPEDRDAIRGHDAAGVSFGTELRRF---LR 314
Query: 131 AHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK 189
A G + F+F+ + R I ++ G HT + G+ L + K
Sbjct: 315 AQGLDESLVKSLDNFDFTETERYRFIHTIAGGHTDQLSGETGYHGLSRAVHSMGLSTD-K 373
Query: 190 KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP------------------ 231
+ Y SSLGS+D ++ + ++ D G+ +P
Sbjct: 374 PISVDYVTSSLGSIDNSFIKTIYTACQG--LNDGQKDGVDQPSRRNTKTALAATATDSDK 431
Query: 232 ------LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGR 280
I +PT + V S G AAG I +K +D L+ A T R
Sbjct: 432 ALGAKMRIYFPTEDTVAKSRGGKAAGGTICFQEKWWGSATFPRDMLR------DAISTRR 485
Query: 281 SRAMPHIKTFARYNGQ------KLAWFLLTSANLSKAAWGALQK----NNSQLMIRSYEL 330
M F + NG W + SANLS++AWG L K ++L R++E
Sbjct: 486 GVLMHDKIIFVQPNGTGGQDDPGAGWAYVGSANLSESAWGRLTKERGSGRAKLTCRNWEC 545
Query: 331 GVLILP--SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 388
GVL+ + R G S G+ +AG E
Sbjct: 546 GVLVPTGNTGDRSSGGLS-------------------------------GAGEAGKMLEA 574
Query: 389 VY--LPVPYELPPQRY------SSEDVPWSWDKRY 415
+PVP P + Y ++ D PW + KRY
Sbjct: 575 FRGAVPVPMVAPSRAYGASSNDTAADRPWLFMKRY 609
>gi|255719760|ref|XP_002556160.1| KLTH0H06468p [Lachancea thermotolerans]
gi|238942126|emb|CAR30298.1| KLTH0H06468p [Lachancea thermotolerans CBS 6340]
Length = 570
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/420 (24%), Positives = 161/420 (38%), Gaps = 72/420 (17%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 102
L PP F HHSK ++ Y G I + + N H + N Q +W L+ +
Sbjct: 179 LQMPP----FACHHSKMIVNEYRDGSCCIYIPSNNFTHAETNLPQQIVWCSP-RLRRCSE 233
Query: 103 LSEECGFENDLIDYLS----TLK-WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 157
+E F L+ YL+ +LK EF L ++ F F+ +++
Sbjct: 234 AVKESEFRKSLVKYLNAYPVSLKPLIEFLGTLDFTSLDQLGVEFI--FSCPKPFESILSG 291
Query: 158 VPGYHTGSSLKK------WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 211
+P H S ++ G + R + Q T +PL G+L M L
Sbjct: 292 IPLLHKALSSRQHAAGGNTGRERHRYLSQVSTI-----GAPLKTGLEYPGNLFSHLMIPL 346
Query: 212 SSSMSSGFSEDKTPLGIG-----------EPLIVWPTVEDVRCSLEGYAAGNAIPSP-QK 259
S + G + K I EP IV+PT E++R S GY G +
Sbjct: 347 LSGLLVGPRDRKRAYEIPNLHKVFEDYNIEPYIVYPTPEEIRQSPMGYLTGGWFHFHWLR 406
Query: 260 NVDKDFLKKYWAKWKASHTG-------RSRAMPHIKTFARYNG--------QKLAWFLLT 304
N + KW H R R H K + + ++ WFL T
Sbjct: 407 NQATKTVYNTLKKWGVLHKQQPQDCPRRGRTPSHTKFYMKSTTLLDNQAPFSEVDWFLFT 466
Query: 305 SANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 364
+ANLS AWG + ++YE+GVL S R S++V S+ +S T
Sbjct: 467 TANLSLNAWGTTTRKP-----QNYEVGVL-FKSQDRRRITVKSVSDLVYSKFRS----TG 516
Query: 365 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 424
QI GSS +++ + + VP+++ P Y D + + Y D++G++
Sbjct: 517 QIL----------GSSKVHSNANICVM-VPFDINPVPYQPGDDAFCVSRSYEAPDIHGKL 565
>gi|317035597|ref|XP_001396653.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 640
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 103/457 (22%), Positives = 175/457 (38%), Gaps = 75/457 (16%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M +++WL + K +LV+ E D T + N L PP+ HS
Sbjct: 204 MWEMEWLFSKFNI-EKTRFILVMQAEDDATKRQYESETATMRNLRLCFPPMGGQVVCMHS 262
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFPLKDQNNLSEE--CGFEND 112
K MLL +P +R++V TANL DW + +++ D P K N+ E+ F D
Sbjct: 263 KLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLPKK---NVLEKPTTHFYED 319
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWG 171
L+ + LK N+ A F+FS ++ + ++ G HT ++ K+ G
Sbjct: 320 LVVF---LKASTLHENIIAK---------LDNFDFSKTSKYAFVHTIGGSHTDTAWKRTG 367
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--AELSSSMSSGFSEDKTPLGIG 229
+ L ++ + + Y SS+G++ ++++ L+S G +E
Sbjct: 368 YCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCMYLASQGDDGLTEFSIRYAKT 426
Query: 230 EPL-----------------------IVWPTVEDVRCSLEGYAAGNAIPSPQK-----NV 261
P+ + +P+ V S G + K N
Sbjct: 427 FPVPRRDNPSKLLKKDTGSEWSDRFRLYFPSQNTVATSKGGPRCAGTVCFQSKWYNGENF 486
Query: 262 DKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN 319
+ L+ ++ K H P Q AW + SAN+S++AWG L ++
Sbjct: 487 PRHILRDCESQRKGLLMHNKILYVRPDDPIPLSETTQCRAWAYIGSANMSESAWGRLVQD 546
Query: 320 NS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 375
S +L R++E GV++ R S++K E K +
Sbjct: 547 RSTKSPKLNCRNWECGVIVPVIEDRTDS----------SDLKDKIHEDKCKGKASEFSSL 596
Query: 376 WHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 409
+D GA+ VV+ +PVP +P RY PW
Sbjct: 597 SSSDNDDGANLPVVFENTIPVPMRVPGARYGGGRKPW 633
>gi|147770909|emb|CAN67540.1| hypothetical protein VITISV_012382 [Vitis vinifera]
Length = 1423
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 39/191 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPLP--ISFG 54
D+ W L C V +P + H S ++ + N ++ PP P I+FG
Sbjct: 417 DVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPFPEAIAFG 476
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP--------- 96
HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 477 RDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRISPPDYSS 536
Query: 97 -----LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
+ NL F L ++++L ++P+ ++ + K++F A
Sbjct: 537 IFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTKYDFKGAT 588
Query: 152 VRLIASVPGYH 162
L+ASVPG H
Sbjct: 589 GHLVASVPGIH 599
>gi|409042750|gb|EKM52233.1| hypothetical protein PHACADRAFT_148739 [Phanerochaete carnosa
HHB-10118-sp]
Length = 603
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 91/386 (23%), Positives = 149/386 (38%), Gaps = 85/386 (22%)
Query: 18 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG--VRIIVHT 75
P V+V + G E +K P +WI P L G H K +++ R +R+++ T
Sbjct: 193 PVVIVTQDPAAGN-ETLKEVLP-DWIKTTPFLRNGRGCQHMKVTFILFYRTSRLRMVIST 250
Query: 76 ANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL-----P 130
AN I DW + +W+QD P + + ++ + + + ++ L+ + L
Sbjct: 251 ANFIEYDWRDIENSVWLQDVPPR-PSPIAHDSKANDFPMAFMRVLRGVNVAPALLTLTKN 309
Query: 131 AHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFE 185
H N + K++FS V LI S+ G H G + + GH L LQ+
Sbjct: 310 GHSNLPLKRIEELRMKWDFSKIKVALIPSLAGKHEGWPKVIQTGHTALMKALQDMGARTP 369
Query: 186 KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPT 237
KG K+ L Q SS+G+ +W+ E + +E + L I++PT
Sbjct: 370 KG-KELVLECQGSSIGTYTTQWLNEFYVTARGESAESWLDQPRARRARLPFPLVKILFPT 428
Query: 238 VEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA----------SHTGRSRAMPHI 287
+ V+ S G G + F ++ A+W+ S + R R + H
Sbjct: 429 RKTVQDSALGEPGGGTM----------FCRR--AQWQGANFPRELFHDSKSKRGRVLMHS 476
Query: 288 K----TFARY---------------------------------NGQKLAWFLLTSANLSK 310
K TF N + W + S N +
Sbjct: 477 KLILATFRDSAFAASSSGSSKRHDTPSTDVSDDEIVEVPPPPGNEDFVGWAYVGSHNFTP 536
Query: 311 AAWGALQKN--NSQLMIRSYELGVLI 334
+AWG L + N L I +YELGVL+
Sbjct: 537 SAWGTLSGSAFNPTLNITNYELGVLV 562
>gi|302142785|emb|CBI20080.3| unnamed protein product [Vitis vinifera]
Length = 1032
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 39/191 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPLP--ISFG 54
D+ W L C V +P + H S ++ + N ++ PP P I+FG
Sbjct: 373 DVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPFPEAIAFG 432
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP--------- 96
HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 433 RDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRISPPDYSS 492
Query: 97 -----LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
+ NL F L ++++L ++P+ ++ + K++F A
Sbjct: 493 IFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTKYDFKGAT 544
Query: 152 VRLIASVPGYH 162
L+ASVPG H
Sbjct: 545 GHLVASVPGIH 555
>gi|402224759|gb|EJU04821.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 955
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 132/310 (42%), Gaps = 35/310 (11%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
D +WL P A +P + + H E + P + ++ P G H K +
Sbjct: 519 TDFEWLRSMIP--AGVPLLSINHPTDRERWEPQIKPLPLDGWIYATPKMNKGGIMHVKLL 576
Query: 62 LLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 120
LL Y G +R+++ TANL+ DW + +++QD P K++++ +E F L +L L
Sbjct: 577 LLFYKNGRLRLVIPTANLVPDDWRDIENTMFLQDIPAKNKDSSAEPHPFPVYLASFLKIL 636
Query: 121 KWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG-YHTGSSLKKWGHMK 174
+ L G + P + +++S +L+ S G Y S+++WGH +
Sbjct: 637 NVHNGLSAL-VQGGYPNLPLPSLDALATGWDWSRVTAQLVGSPAGSYEDWDSVRRWGHPR 695
Query: 175 LRTVLQECTFEKGF-KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED---KTPLGIGE 230
L +++ + K+ L YQ SS+G+ +++ + S G S D + P
Sbjct: 696 LGEAVRQLKAQPPTGKRLNLEYQGSSIGNYTTQYLNDFYKS-GCGLSPDVSKRRPKAQPW 754
Query: 231 PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK-YWAK-------WKASHTGR 280
P IV+P++ V ++ G + F +K YW+K ++ S
Sbjct: 755 PAIQIVYPSLTTVDNTVLGRLGAGSF----------FCRKQYWSKPNAPRKLFRDSRARS 804
Query: 281 SRAMPHIKTF 290
R + H K
Sbjct: 805 GRVLMHTKMI 814
>gi|359493967|ref|XP_002283806.2| PREDICTED: uncharacterized protein LOC100243589 [Vitis vinifera]
Length = 1091
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 39/191 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPLP--ISFG 54
D+ W L C V +P + H S ++ + N ++ PP P I+FG
Sbjct: 413 DVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPFPEAIAFG 472
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP--------- 96
HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 473 RDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRISPPDYSS 532
Query: 97 -----LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 151
+ NL F L ++++L ++P+ ++ + K++F A
Sbjct: 533 IFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTKYDFKGAT 584
Query: 152 VRLIASVPGYH 162
L+ASVPG H
Sbjct: 585 GHLVASVPGIH 595
>gi|7576178|emb|CAB87929.1| hypothetical protein [Arabidopsis thaliana]
Length = 1075
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 79/188 (42%), Gaps = 32/188 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C +P + H D N P N + PP P I+F
Sbjct: 408 DILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPPFPEEIAF 466
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQN 101
G HH K +L +R+I+ +ANL+ WN+ + +W QDFP + D
Sbjct: 467 GKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPRRADPDLL 526
Query: 102 NLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 155
+L C G + D L+ ++P+ ++ + F K+NF +A L+
Sbjct: 527 SLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFEHSAGHLV 582
Query: 156 ASVPGYHT 163
ASVPG H+
Sbjct: 583 ASVPGIHS 590
>gi|42567721|ref|NP_196357.2| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
gi|30102672|gb|AAP21254.1| At5g07400 [Arabidopsis thaliana]
gi|110743660|dbj|BAE99667.1| hypothetical protein [Arabidopsis thaliana]
gi|332003770|gb|AED91153.1| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
Length = 1084
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 79/188 (42%), Gaps = 32/188 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C +P + H D N P N + PP P I+F
Sbjct: 408 DILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPPFPEEIAF 466
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQN 101
G HH K +L +R+I+ +ANL+ WN+ + +W QDFP + D
Sbjct: 467 GKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPRRADPDLL 526
Query: 102 NLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 155
+L C G + D L+ ++P+ ++ + F K+NF +A L+
Sbjct: 527 SLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFEHSAGHLV 582
Query: 156 ASVPGYHT 163
ASVPG H+
Sbjct: 583 ASVPGIHS 590
>gi|158293223|ref|XP_001237573.2| AGAP010579-PA [Anopheles gambiae str. PEST]
gi|157016855|gb|EAU76764.2| AGAP010579-PA [Anopheles gambiae str. PEST]
Length = 103
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/53 (56%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 284 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
MPHIKT+ R+ + L WFLLTSAN SK+AWG + + + L I +YE GVL LP
Sbjct: 1 MPHIKTYCRWTPEGLQWFLLTSANFSKSAWG-ITRYDKLLYINNYEAGVLFLP 52
>gi|358399116|gb|EHK48459.1| hypothetical protein TRIATDRAFT_290150 [Trichoderma atroviride IMI
206040]
Length = 590
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 97/418 (23%), Positives = 158/418 (37%), Gaps = 81/418 (19%)
Query: 34 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGL 90
M+ N PAN PP+ G HSK LL YP +R+++ T NL+ DW +
Sbjct: 207 MQGNVPANIKFCFPPMH-GVGAMHSKLQLLKYPSHLRVVIPTGNLMPYDWGETGVMENMV 265
Query: 91 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-S 149
++ D P D + + + T + E L A G + + ++FS +
Sbjct: 266 FLIDLPRLDHPVSTHASAARS----HAPTRFYTELVYFLQATGVGEKMVASLANYDFSRT 321
Query: 150 AAVRLIASVPGYHTG--------------------------SSLKKWGHMKLRTVLQECT 183
A + + ++PG H+ +SL +R + C
Sbjct: 322 ADLAFVHTIPGSHSAKNAERIASVADLGLASVDPVDVDLVCASLGALNQQMVRAIYNACR 381
Query: 184 FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC 243
+ G + SS S + +++++S + L I +PT V
Sbjct: 382 GDDGTDEYHKPASTSSRSSAKKPTTTTTTATVTS-----QEQLLRERFRIYFPTDRTVSQ 436
Query: 244 SLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKA---SHTGRSRAMPHIKTFARYNG 295
S G AG I K N ++ ++ ++ + S R P A+
Sbjct: 437 SRGGRNAGGTICVQTKWWRAPNFPRELVRDVISRDRVLMHSKMIFVRRRPGDSGQAQAVR 496
Query: 296 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNI 351
Q W + SANLS++AWG + K+ S +L+ R++E GV+I
Sbjct: 497 QSPGWAYVGSANLSESAWGRMSKDKSTGGFKLVCRNWECGVII----------------P 540
Query: 352 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
VP E+ + KT L T S+D S +PVP ++P Y S D PW
Sbjct: 541 VP--------ESQPVDKTTLPT-----SADDDMSMFAGTVPVPMQVPGPVYRSSDQPW 585
>gi|315045107|ref|XP_003171929.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
gi|311344272|gb|EFR03475.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
Length = 678
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 96/217 (44%), Gaps = 22/217 (10%)
Query: 3 DIDWLLPA-CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKA 60
D+DWLL + ++ GE + + M+ WI L PP+ HSK
Sbjct: 232 DMDWLLAKFTNPKTRFLFIMGAKGE-ERQAQLMRETASMPWIRLCFPPMDGEVHCMHSKL 290
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
MLL +P +RI++ +ANL DW K L++ D P K + ++ F ++L+ +
Sbjct: 291 MLLFHPNHMRIVIPSANLDPYDWGEKGGVMENMLFLIDLPRKAREADEDKTPFRDELVYF 350
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGS-SLKKWGHMK 174
L K N KI +F+FS + + S+ G H GS S ++ GH
Sbjct: 351 LRASKL-----------NEKIIDKML-QFDFSNTTKYAFVHSIGGSHIGSGSYERTGHCG 398
Query: 175 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 211
L T ++ E + L Y SS+GSL ++ L
Sbjct: 399 LGTAVKSLGLETS-RPLTLDYITSSVGSLTATFLQNL 434
>gi|367001138|ref|XP_003685304.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
gi|357523602|emb|CCE62870.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
Length = 563
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 100/419 (23%), Positives = 159/419 (37%), Gaps = 73/419 (17%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 102
+ PP F HHSK ++ IY + ++ + + N + N Q W D N+
Sbjct: 176 FYMPP----FSCHHSKMVINIYEDKSCKLFIPSNNFTFYETNLPQQVCWEGPTLPYDINS 231
Query: 103 LSEECGFENDLIDYLSTLKWPEFSAN---LPAHGNFKINPSFFKKFNFSSAAVRLIASVP 159
+++ F+ +LI Y + N +P N F K N V + S P
Sbjct: 232 KNQKISFKENLISYFQSYPSEVKIMNRTIIPMISNID-----FSKLN----NVEFLYSSP 282
Query: 160 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM-----AELSSS 214
S + K ++ + L C+ + K++ + Q S++G K + L
Sbjct: 283 N-DKDSGISKLLYLLEKNDLLGCSDDIN-KRTHFLCQSSTIGGSLSKTVPLNIFTHLMIP 340
Query: 215 MSSGFSEDKTPLGIGE------------PLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-- 260
SG + L + P IV+PTVE++R S G+ N KN
Sbjct: 341 EFSGIQKSNKKLKTSQELIDIYREKRISPYIVYPTVEELRNSPSGWKCSNWFHFNYKNKA 400
Query: 261 -----VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ---------KLAWFLLTSA 306
+ KDF Y K + + R H K + R KL W + TS+
Sbjct: 401 EYYEVLAKDFKLFYKQKDQLTSKYRKATPSHSKFYIRCTENDSKVPARFSKLDWCIFTSS 460
Query: 307 NLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 366
NLS AWG L R+YE+G+L+ G +C+S + G + S
Sbjct: 461 NLSFNAWGKLSSK-----PRNYEVGILL---CSNEGQQINCSSFSRKIDEHQGCSRLSDS 512
Query: 367 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE-DVPWSWDKRYTKKDVYGQV 424
TK +D + V+ VP+ LP + Y + D + K Y D +G+V
Sbjct: 513 NNTK---------NDGKKNINVM---VPFTLPLEPYDIKYDTAFCIQKSYNLPDCFGEV 559
>gi|239613173|gb|EEQ90160.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ER-3]
Length = 662
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 110/440 (25%), Positives = 178/440 (40%), Gaps = 75/440 (17%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPANWI-LHKPPLPISFGTHH 57
M ++DW+ + K L+I GE D E K + L PP+ H
Sbjct: 262 MWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMH 319
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ------DFPLKDQNNLSEECG--F 109
SK MLL +P +RI V +ANL+ DW QG M+ D PLK +L+ G F
Sbjct: 320 SKLMLLFHPSYLRIAVPSANLVPFDWGE--QGGVMENIVFLIDLPLKSP-DLANGPGTSF 376
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFSSAA-VRLIASVPGYHTGS 165
+DL+ +L ++NL + KK F+FS+ + + ++ G HT
Sbjct: 377 LDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFSATKDIAFVHTIGGSHTDP 421
Query: 166 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 225
+K G L + + + + +F S E W ++ G +DK
Sbjct: 422 KWRKTGLCGLGSAITALGLQTTRDINLDYVRFPS-----ENW-GVVTKRTDGGKWKDKF- 474
Query: 226 LGIGEPLIVWPTVEDVRCSLEGYAAGNAI---------PSPQKNVDKDFLKKYWAKWKAS 276
+V+P++ VR S G I + K++ +D + + +
Sbjct: 475 ------RVVFPSLNTVRKSKGGAENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHN 528
Query: 277 HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGV 332
R I + + + W + SANLS++AWG L + S +L R++E GV
Sbjct: 529 KILFVRPEKPITSLKDNSTRYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGV 588
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-- 390
+I RH +S +PS +G T T K + +SD G+ V+
Sbjct: 589 VI---PIRHNDAGKLSS--IPS---TGRTAT---LLAKSESEDSSANSDDGSEVTTVFEP 637
Query: 391 -LPVPYELPPQRYSSEDVPW 409
+PVP +P RY + P+
Sbjct: 638 TIPVPMIVPAPRYHGRNRPF 657
>gi|389632429|ref|XP_003713867.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
gi|351646200|gb|EHA54060.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
Length = 636
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 163/391 (41%), Gaps = 63/391 (16%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL 113
G HSK LL +P +RI+V + NL+ DW ++ G+ + D L E++
Sbjct: 249 GCMHSKLQLLKFPGYLRIVVPSGNLVPYDWGEQN-GIMENSVFIIDLPPLKAGVKLEDNT 307
Query: 114 IDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWG 171
+ E S L A G N +I S +K++FS ++ + ++ G HTG ++ G
Sbjct: 308 LTSFGE----ELSYFLTAQGLNERIINS-LRKYDFSQTSRYAFVHTIAGVHTGDKWRRTG 362
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAELSSSM--SSGFSE-----D 222
+ L +Q P+ F SS+G+L ++ L ++ SG +
Sbjct: 363 YCGLGRAIQNLGLA---TDEPVEIDFVASSMGALKYGYLLALYNAFQGDSGLKDYQSRAS 419
Query: 223 KTPLGIGEPL------------IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 270
KT + I +P++ V S G + + L+ W
Sbjct: 420 KTKTSKEDAASAQQAKLRDFFRIYFPSLATVEASRGGTRSAGTL----------CLRSGW 469
Query: 271 AKWKASHTGRS---------RAMPHIK-TFARYNGQKLAWFLLTSANLSKAAWGAL---Q 317
W+A+ R+ A+ H K FAR AW + SAN+S++AWG L
Sbjct: 470 --WEAATFPRALFRDYENPRGALVHSKIVFARPPDASAAWAYVGSANVSESAWGNLLVKD 527
Query: 318 KNNSQ--LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 375
+ +SQ + R++E GV I+P + G + ++ I P + +G + + +
Sbjct: 528 RASSQPKMSCRNWECGV-IVPVGEPASPGRTLSTGIDPGDASAGKGGSLHGHQARNSPQE 586
Query: 376 WHGSSDAGASSEVVY---LPVPYELPPQRYS 403
+ S E ++ +P+P +LP + Y+
Sbjct: 587 QNAPVGRSRSIEELFSECVPLPMQLPGRSYA 617
>gi|407927985|gb|EKG20864.1| hypothetical protein MPH_01847 [Macrophomina phaseolina MS6]
Length = 642
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/404 (22%), Positives = 161/404 (39%), Gaps = 87/404 (21%)
Query: 49 LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFP-LKDQNNLS 104
L + G +H K ++ +P+ +R+ + TANL DW + +++ D P L + S
Sbjct: 285 LDMKNGHNHGKFLIGSHPKYLRVAITTANLKGHDWGESGKMENTVFIIDLPRLPEGKKTS 344
Query: 105 EE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 161
E+ F +L YL +L + L A +F++S + + + S+ G
Sbjct: 345 EDEATAFCQNLRFYLKSL-----NVGLSAR-------DALLRFDWSRTRNLGFVCSLQGA 392
Query: 162 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFS 220
G ++ G L ++E + + L Y SSLG+L +M + L+++
Sbjct: 393 SIGDDGQRIGLPGLSQAIKELNLKS--NRLALDYATSSLGALSRGFMKQFLTAAKGEELE 450
Query: 221 EDK----TPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW-- 270
K + +G+ L + +PTV+ VR S G AG I FL+K W
Sbjct: 451 ATKEKYDADIKLGDLLKQFRVYFPTVDTVRASKGGEEAGGTI----------FLRKRWYD 500
Query: 271 ------AKWKASHTGRSRAMPHIKTF--------------ARYNGQKLAWFLLTSANLSK 310
A + R+ + H K G+K+AW + S N ++
Sbjct: 501 APSFPKASMHDHKSTRNGILSHNKLIICRGQIGPEDEDNAGATEGKKVAWAYVGSHNFTQ 560
Query: 311 AAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTK 370
AAWG L ++ + ++ + + + CG I+P S + Q K
Sbjct: 561 AAWGTLSRDKNTKTLKV---------NCRNNECGV-----IIPIFRGGASEQVGQEDK-- 604
Query: 371 LVTLTWHGSSDAGASSEVVY--LPVPYELPPQRYSSEDVPWSWD 412
+ D EV + +P+E+P +RY ++ PW D
Sbjct: 605 ------NAEEDGLPGYEVFARKMEIPFEIPGERYGNKK-PWFTD 641
>gi|357520291|ref|XP_003630434.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355524456|gb|AET04910.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 1064
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 83/192 (43%), Gaps = 41/192 (21%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C + +P + D + +N P N ++ PP P I+F
Sbjct: 401 DITWFLTYCKIPYHLPVTIACQNTEKCWSSKPDERVFVPYQNYP-NLVVVHPPFPETIAF 459
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL------- 97
G HH K ++L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 460 GKDHKRHGIACHHPKLIVLQREDSIRVIITSANLVEKQWNSVTNTIWWQDFPRAILVDYA 519
Query: 98 -------KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 150
D+ + + +C F L ++++L ++P+ ++ K++F SA
Sbjct: 520 SLFRKIDDDEVHRNSKCDFAAQLAGFMASL-----VIDVPSQAHWITQ---LTKYDFGSA 571
Query: 151 AVRLIASVPGYH 162
L+AS+PG H
Sbjct: 572 TGHLVASLPGIH 583
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 65/242 (26%)
Query: 149 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 208
+A LIAS+ + +G +L+ VL + + + + S +VY SS+GS++ K++
Sbjct: 746 AAFCSLIASIQ--------RHYGLWRLQEVLNQYRWPESLE-SEIVYGASSIGSVNSKFL 796
Query: 209 AELSS-----SMSSGFSEDKTP----------LGIGEPLIVWPTVEDVRCSLEGYAAGNA 253
A S+ S+ SE+ P L I++PT+E V+ + G
Sbjct: 797 AAFSAAAGKKSLQHFDSEESDPEWGCWNAREELKNPSVKIIFPTIERVKSAYNGILPSRR 856
Query: 254 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPH--------------IKTF-ARYNGQKL 298
I F ++ W + K A+PH + F +R +
Sbjct: 857 ILC--------FSERTWQRLKTLDVLHD-AVPHPHERVGHPMHTKVVRRCFWSRGEAPSI 907
Query: 299 AWFLLTSANLSKAAWGALQKN----------------NSQLMIRSYELGVLI-LPSAKRH 341
W S N S AAWG N NS L I +YELG++ P ++ +
Sbjct: 908 GWVYCGSHNFSAAAWGRQISNPFGTKADDPHKGDPSVNSGLHICNYELGIIFTFPPSENN 967
Query: 342 GC 343
C
Sbjct: 968 EC 969
>gi|402219032|gb|EJT99107.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 680
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 130/296 (43%), Gaps = 44/296 (14%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEH---MKRNKPANWILHKPPLPISFGTHHS 58
D WL P +IP +LV+ + D + H +K +W+ P + S G H
Sbjct: 233 TDTPWLTTFLP--REIPVLLVV--DPDPSQRHDASLKNLGIGDWLRVTPRIWQSRGVMHI 288
Query: 59 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECGFENDLIDY 116
K +LL Y G +R+ + TANL+ DW + +++QD P+ D + + F L
Sbjct: 289 KVLLLFYKSGRLRVAIPTANLVDYDWRDIENTVFVQDLPPITDSSADPQSHDFPTYLWGV 348
Query: 117 LSTLKWPEFSANLPAHG----NFKINPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWG 171
L +L P NL G + + K+++ RL+ASV G + G +++ +G
Sbjct: 349 LKSLNVPAGLLNLVNSGYPSLPLQSLQNLQDKWDWCKMRARLVASVAGNYEGWYNVRMYG 408
Query: 172 HMKLRTVLQECTFE-KGFKKSPLVYQFSSLGSLDEKWMAELSSS-------------MSS 217
H +L ++++ + K K + Q SS+G+ +++ E+ S MS
Sbjct: 409 HPRLSAIIRDSRAQPKKGKVLNIECQGSSVGNCTTQYLNEVYKSCCGIDPISWIDIPMSR 468
Query: 218 GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK-YWAK 272
+ P+ I++PT++ V S+ G G + F KK YW+K
Sbjct: 469 QVRQPWPPVK-----ILFPTLKTVDDSVFGRNGGGSF----------FCKKPYWSK 509
>gi|400597097|gb|EJP64841.1| ubiquitin interaction domain-containing protein [Beauveria bassiana
ARSEF 2860]
Length = 540
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 87/396 (21%), Positives = 160/396 (40%), Gaps = 73/396 (18%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WLL +K +L+ S+ + M+ N P N PP+ G+ HSK
Sbjct: 150 DEEWLLSKLNA-SKTRILLLAFAASEEQKQLMRGNVPKNIRFCFPPMN-GPGSMHSKLQF 207
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L +P+ +R+++ + NL+ DW +++ D P + + F ++ +L
Sbjct: 208 LKFPKYLRLVIPSGNLVPYDWGETGVMENMVFLIDLPRLEASGNRTMTVFGENVARFLK- 266
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTV 178
A G + ++FS+ A + + S+PG H G +L++ G+ L
Sbjct: 267 -----------ASGVDEAMVESIANYDFSATANLGFVYSIPGGHMGEALRQVGYCGLGAT 315
Query: 179 LQECTFEKGFKKSPLVYQF--SSLGSLD-------------EKWMAELSSSMSSGFSEDK 223
++ +P+ +SLGS++ + M E ++ + +
Sbjct: 316 VRGLGLA---TDTPIEVDLACASLGSINYDLINAVYNACQGDDGMQEYNARVGRKLKDKG 372
Query: 224 T-PLG--IGEPLIVWPTVEDVRCSLEGYAAGNAI---------PSPQKNVDKDFLKKYWA 271
T P G + I +PT V S G + I PS K + +D +
Sbjct: 373 TRPTGRLRDQFRIYFPTDRTVSESKGGRQSAGTICVQAKWWRAPSFPKELVRDCVNN--- 429
Query: 272 KWKASHTGRSRAMPHIKTF-------ARYNGQ--KLAWFLLTSANLSKAAWGALQKN--- 319
R + H K A GQ + W + SANLS++AWG + K+
Sbjct: 430 --------RDGLLMHSKIILVRRPAAAELIGQTPAMGWAYIGSANLSESAWGRVVKDRGT 481
Query: 320 -NSQLMIRSYELGVLI-LPSAKRHGCGFSCTSNIVP 353
++++ R++E GV++ + +GC + S +VP
Sbjct: 482 GSAKMSCRNWECGVVVPVHGNPGNGCDITIFSGVVP 517
>gi|255539987|ref|XP_002511058.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223550173|gb|EEF51660.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 1148
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 82/193 (42%), Gaps = 41/193 (21%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C + + +P + H D + N P N + PP P I+F
Sbjct: 469 DILWFLSYCEIPSHLPVTIACHNTERCWSSNPDKRISMPYSNFP-NLSVVFPPFPEAIAF 527
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+I+ +ANL+ W+N + +W QDFP + +LS
Sbjct: 528 GNDRRRQGIACHHPKLLVLQRENSIRVIITSANLVPNQWHNVTNTIWWQDFPRRSTPDLS 587
Query: 105 --------------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 150
F L ++++L ++P+ ++ + K+NF A
Sbjct: 588 SLFTRVSDGEISQDSRSDFAAQLAGFIASL-----VIDVPSQAHWVVE---LTKYNFDGA 639
Query: 151 AVRLIASVPGYHT 163
L+AS+PG H+
Sbjct: 640 LGYLVASIPGIHS 652
>gi|302406010|ref|XP_003000841.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
gi|261360099|gb|EEY22527.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
Length = 586
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 102/447 (22%), Positives = 167/447 (37%), Gaps = 82/447 (18%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAM 61
D WLL A+ V + + ++ E ++ + P++ I L PP+ G HSK
Sbjct: 188 DEPWLLSKVDT-ARTRMVFIAYAKNGAEQETLRASVPSSRIKLCFPPM-YGIGCMHSKLQ 245
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
LL Y +RI+V + NL+ DW +++ D P Q + + ND
Sbjct: 246 LLKYQNHLRIVVPSGNLVPYDWGETGVLENMVFLIDLPRIVQASGDGDAIRGNDAAGVSF 305
Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRT 177
+ F L A G + F+F+ + R I ++ G HT + G+ L
Sbjct: 306 GTELRRF---LRAQGLDESLVKSLDNFDFTETERFRFIHTIAGGHTDQLSGETGYHGLSR 362
Query: 178 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWP 236
+ P+ + + ++ + + + + +G + I +P
Sbjct: 363 AVHSLGLS---TDEPITVDYVAQQDQNDGGNQPSRRNTKTALNATDSQKALGVKMRIYFP 419
Query: 237 TVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIK- 288
T + V S G AAG I F +K+W + S + R + H K
Sbjct: 420 TEDTVARSRGGKAAGGTIC---------FQEKWWGSATFPREMLRDSISTRPGVLMHDKI 470
Query: 289 TFARYN---GQK---LAWFLLTSANLSKAAWGALQK----NNSQLMIRSYELGVLI--LP 336
F + N GQ W + SANLS++AWG L K ++L R++E GVL+
Sbjct: 471 IFVQPNSTGGQDDPGAGWAYVGSANLSESAWGRLTKERGSGRAKLTCRNWECGVLVPTRT 530
Query: 337 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVP 394
+ R G S G+ +AG E +PVP
Sbjct: 531 TGDRSSGGLS-------------------------------GAGEAGKMLEAFRGAVPVP 559
Query: 395 YELPPQRY------SSEDVPWSWDKRY 415
P + Y ++ D PW + KRY
Sbjct: 560 MVAPSRAYGTSSNDTAADRPWLFMKRY 586
>gi|317148904|ref|XP_001822999.2| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
Length = 667
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 148/369 (40%), Gaps = 52/369 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M +++WL AK LV+ + + T K A N L PP+ HS
Sbjct: 260 MWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAAMSNLRLCFPPMDGQVNCMHS 318
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLK-DQNNLSEECGFENDL 113
K MLL + VRI+V TANL DW +++ D P + D+++ GF ++L
Sbjct: 319 KLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIIDLPKRTDKDSGFTRTGFYHEL 378
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 172
+ LK N+ A ++FS A + + ++ G H G S ++ G+
Sbjct: 379 TYF---LKASTLHENIIAK---------LTDYDFSRTAHIAFVHTIGGSHMGDSWRRTGY 426
Query: 173 MKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMAE--LSSSMSSGFSEDKTPLG 227
L + G + S PL F SS+GSL ++++ L+ G +E
Sbjct: 427 CGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLRSIYLACQGDDGSTEYVLRTA 482
Query: 228 IGEP---------LIVWPTVEDVRCSLEGY-----AAGNAIPSPQKNVDKDFLKKYWAKW 273
P LI T E+ + Y + PQ F +++
Sbjct: 483 KSFPVRSRSNPTQLINKSTAEEWKDRFRVYFPSETTVNDTKGGPQSAGTICFQSRWYTGP 542
Query: 274 K-ASHTGRSRAM---PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMI 325
K H R + P N Q AW + SANLS++AWG L + + +L
Sbjct: 543 KFPRHVLRDCILYVRPDDPATLPDNSQCRAWAYVGSANLSESAWGRLVQERATKEPKLNC 602
Query: 326 RSYELGVLI 334
R++E GVL+
Sbjct: 603 RNWECGVLM 611
>gi|225678545|gb|EEH16829.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 686
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 112/460 (24%), Positives = 182/460 (39%), Gaps = 81/460 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSK 59
D DWL + K ++I GE D E K + L PP+ HSK
Sbjct: 247 DADWLFSKFDI--KKSRFILIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMHSK 304
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFP-LKDQNNLSEECGFENDLI 114
MLL + +RI++ +ANLI DW K +++ D P + + + F DL+
Sbjct: 305 LMLLFHLNHLRIVIPSANLIPFDWGEKGGIMENVVFLIDLPRISPSPDATPRTPFLEDLV 364
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA---SVPGYHTGSSLKKWG 171
+L ++NL K NF +A + IA ++ G HT + K+ G
Sbjct: 365 YFLQ-------ASNLDEQ-------IIQKMLNFDFSATKDIAFVHTIGGSHTDPTWKRTG 410
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE--LSSSMSSGFSE-------- 221
L + + + L Y SS+GSL+E+++ L++ +G E
Sbjct: 411 LCGLGRAITSLGLQTS-QNLNLDYVTSSVGSLNEQFLRSIYLAAQGDTGLKELTFRTSRT 469
Query: 222 -DKTPLGI------GEP-----LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKD 264
LG+ GE + +P++ V S G I K ++
Sbjct: 470 LPSEKLGVLTTRTDGEKWRDRFKVYFPSLNTVCQSKGGTMNAGTICFQSKWYNSTTFPRN 529
Query: 265 FLKKYWAKWKA--SHTGRSRAMPH--IKTFARYNGQKLAWFLLTSANLSKAAWGALQKNN 320
++ ++ H+ A P I + + Q W + SANLS++AWG L +
Sbjct: 530 VMRNNISRRDGLLMHSKMLFACPDKPITSSKDNSTQYAGWAYVGSANLSESAWGRLVLDR 589
Query: 321 S----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 376
S +L R++E GV+I + G G + S+ SGST + KL +
Sbjct: 590 STTKPKLNCRNWECGVVI--PIRHRGSG------QLSSQPSSGST-----LRPKLEPESE 636
Query: 377 HGSSDAGASSEVV-----YLPVPYELPPQRYSSEDVPWSW 411
S S++V +PVP +P + Y D PW +
Sbjct: 637 SASVTVSDGSKLVSVFEPRIPVPMRVPGEPYQPGDKPWYY 676
>gi|343476326|emb|CCD12540.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 173
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 14/102 (13%)
Query: 1 MVDIDWLLPAC-PVLAKIPHVLVIHGESDGTL---------EHMKRNKPANWILHKPPLP 50
++D++WL P+L +++I GE G L + RN+ + +P LP
Sbjct: 49 VMDVEWLFRVSDPLLMSKCTIVLISGEK-GFLHKYRHLVLHDRFGRNRVK---IVEPCLP 104
Query: 51 ISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM 92
I FG HHSK ML I G+R+ V TAN I DWN K+QG++
Sbjct: 105 IPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYF 146
>gi|402072975|gb|EJT68632.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 629
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 110/456 (24%), Positives = 178/456 (39%), Gaps = 98/456 (21%)
Query: 3 DIDWL-LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI--LHKPPLPISFGTHHSK 59
D DWL P+ KI V E +E + A I L PP+ FG HSK
Sbjct: 226 DTDWLWRKVNPMKTKITLVAYAGNE----VEKAAVVESARGIARLCFPPMN-GFGYMHSK 280
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
LL +P +RI+V + NL+ DW G + D + + G E + +
Sbjct: 281 LQLLKFPGFLRIVVPSGNLVSYDWGET--GTMENVVFIIDLPPVGDLAGSEGNTLTSFGE 338
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
+ L A G + +K++F+ ++ + S+PG H G S + G+ L
Sbjct: 339 ----DLCYFLKAQGLEESLIKSLRKYDFTETSRYGFVHSIPGSHMGDSWNQTGYCGLGRA 394
Query: 179 LQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL--SSSMSSGFSE-----DKTPLGIG 229
+ + P+ SS+GSL K+ + L + SG E K G+G
Sbjct: 395 VNKLGLA---TDQPIEVDLVASSIGSLTSKFCSALYKACQGDSGIKEHESKGAKAKNGMG 451
Query: 230 EPL------------IVWPTVEDVRCSLEGY-AAGNA--------IPSPQKNVDKDFLKK 268
+ +P+++ V S G +AG +PS + + +D++
Sbjct: 452 GAASTTQAALAQRFRVYFPSLQSVVASRGGRNSAGTTCLQSRWWNLPSFPRELFRDYMNP 511
Query: 269 YWAKWKASHTGRSRAMPHIK-TFARYNGQKLAWFLLTSANLSKAAWGALQKNNS----QL 323
R + H K F R +W + SANLS++AWG L K+ + ++
Sbjct: 512 R------------RVLVHSKIIFVRAPSGGASWAYVGSANLSESAWGKLVKDRTSSSPKM 559
Query: 324 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK---SGSTETSQIQKTKLVTLTWHGSS 380
R++E GV I+P+ H E+K G E + I + V + G
Sbjct: 560 TCRNWESGV-IVPAGSGH-------------ELKHQGHGRAEGAGICGS--VGAVFEGC- 602
Query: 381 DAGASSEVVYLPVPYELPPQRYSSED---VPWSWDK 413
+P+P LP Y+S D +PW D+
Sbjct: 603 ----------VPLPMTLPGTEYASGDGTRLPWFIDQ 628
>gi|134082171|emb|CAK42283.1| unnamed protein product [Aspergillus niger]
Length = 655
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 100/433 (23%), Positives = 172/433 (39%), Gaps = 59/433 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M +++WL + K +LV+ E D T E N L PP+ HSK
Sbjct: 244 MWEMEWLFSKFNI-EKTRFILVMQAEDDATYESETATM-RNLRLCFPPMGGQVVCMHSKL 301
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFPLKDQNNLSEE--CGFENDLI 114
MLL +P +R++V TANL DW + +++ D P K N+ E+ F DL+
Sbjct: 302 MLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLPKK---NVLEKPTTHFYEDLV 358
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVP--GYHTGSSLKKWG 171
+ LK N+ A F+FS ++ + ++P G HT ++ K+ G
Sbjct: 359 VF---LKASTLHENIIAK---------LDNFDFSKTSKYAFVHTIPSGGSHTDTAWKRTG 406
Query: 172 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP 231
+ L ++ + + Y SS+G++ ++++ + + ++ + L +
Sbjct: 407 YCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCMYLASQVPRRDNPSKLLKKDT 465
Query: 232 LIVW--------PTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKA--S 276
W P+ V S G + K N + L+ ++ K
Sbjct: 466 GSEWSDRFRLYFPSQNTVATSKGGPRCAGTVCFQSKWYNGENFPRHILRDCESQRKGLLM 525
Query: 277 HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGV 332
H P Q AW + SAN+S++AWG L ++ S +L R++E GV
Sbjct: 526 HNKILYVRPDDPIPLSETTQCRAWAYIGSANMSESAWGRLVQDRSTKSPKLNCRNWECGV 585
Query: 333 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-- 390
++ R S++K E K + +D GA+ VV+
Sbjct: 586 IVPVIEDRTDS----------SDLKDKIHEDKCKGKASEFSSLSSSDNDDGANLPVVFEN 635
Query: 391 -LPVPYELPPQRY 402
+PVP +P RY
Sbjct: 636 TIPVPMRVPGARY 648
>gi|212546295|ref|XP_002153301.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064821|gb|EEA18916.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 596
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 115/266 (43%), Gaps = 43/266 (16%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMKR--NKPANWILHKPPLPISFGTHHSK 59
D+ W+ K ++V+ + + T L++ + N P N L PP+ HSK
Sbjct: 258 DMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQEETANMP-NIRLCFPPMDGQVNCMHSK 316
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDLID 115
MLL +P +RI+V +AN++ DW + +++ D P K ND D
Sbjct: 317 LMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENTVFLIDLPKKST----------NDAAD 366
Query: 116 YLSTLKWPEFSANLPA---HGNF--KINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 170
T + E S L A H N K++ FK+ N + + ++ G H G SL +
Sbjct: 367 SPKTAFYEELSYFLKASTLHENIIAKLSAFDFKETNRYA----FVHTIGGSHFGESLTRT 422
Query: 171 GHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLGSLDEKWMAELSSSMSSGFSEDKTPLG 227
GH L + G K + P+ F SS+GSL +++M + S ++ K L
Sbjct: 423 GHCGLGKAVTSL----GLKTREPINIDFVTSSIGSLTDEFMRSIYLS-----AQGKQTLY 473
Query: 228 IGEPLIVWPTVEDVRCSLEGYAAGNA 253
I+ + +V C L G + NA
Sbjct: 474 S----IIRTIILNVSCRLGGDGSTNA 495
>gi|119467668|ref|XP_001257640.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
gi|119405792|gb|EAW15743.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
Length = 676
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 109/460 (23%), Positives = 177/460 (38%), Gaps = 94/460 (20%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA---NWILHKPPLPISFGTHH 57
M DI+WL V K L++ D + + A N L PP+ H
Sbjct: 258 MWDIEWLF--SKVDTKSTRFLLVMQAKDELTKRQYEAETASMSNLRLCFPPMEGQVNCMH 315
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLK-DQNNLSEECGFEND 112
SK MLL +P +RI+ TANL DW ++ D P K ++ + FE D
Sbjct: 316 SKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLIDLPRKVATTSVGSKTVFEED 375
Query: 113 LIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 169
L+ +L STL+ S +F+FS + + L+ ++ G HTG++ ++
Sbjct: 376 LVYFLRASTLQENIISR--------------LDEFDFSQTSHIMLVHTIGGSHTGNTWRR 421
Query: 170 WGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMAE--LSSSMSSGFSE--- 221
G+ L + G + S P+ F SS+GSL ++++ L+S G ++
Sbjct: 422 TGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEFLRSIYLASQGDDGITDFTL 477
Query: 222 -------DKTPLGIGEPLIVWPTVEDVRCSLEGY-AAGNAIPSPQKNVDKDFLKKYWAKW 273
+ P + LI T E+ + Y + + + D + +KW
Sbjct: 478 RTSKTFPARNPNDTDQ-LIHKNTAEEWKDRFRVYFPSQTTVEQSRGGPDCAGTICFQSKW 536
Query: 274 -----------KASHTGRSRAMPHIKT-FARYN--------GQKLAWFLLTSANLSKAAW 313
+ + R + H K F R + Q W + SANLS++AW
Sbjct: 537 YEGPKFPRHVLRDCKSRRPGLLMHNKILFIRPDEPIRLPNSSQCRGWAYVGSANLSESAW 596
Query: 314 GALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 369
G L ++ + +L R++E GVL+ P + + N SG T
Sbjct: 597 GRLVQDKTTKQPKLNCRNWECGVLV-PILDKDNSLDKVSDN------DSGKRATESADML 649
Query: 370 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
+ T +PVP +P QRY PW
Sbjct: 650 DVFRDT---------------VPVPMTVPGQRYGPGLKPW 674
>gi|358380063|gb|EHK17742.1| hypothetical protein TRIVIDRAFT_82987 [Trichoderma virens Gv29-8]
Length = 528
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 110/466 (23%), Positives = 176/466 (37%), Gaps = 119/466 (25%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +W++ + + +L+ + + M+ N P+N PP+ G HSK L
Sbjct: 118 DEEWMMSKLDI-RRTKILLLAFAKDEAQKNLMRGNVPSNIKFCFPPM-HGPGAMHSKLQL 175
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNL---SEECGFENDLIDY 116
L YP +R+++ T NL+ DW +++ D P GF +L+ +
Sbjct: 176 LKYPDRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPRLGNPATHPPQRPTGFYTELVYF 235
Query: 117 L-STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMK 174
L ST + A+L ++FS ++ + + ++PG H+G++ K+ G+
Sbjct: 236 LQSTGVGDKMVASL-------------SNYDFSKTSDIAFVHTIPGSHSGNAAKRTGYCG 282
Query: 175 LRTVLQECTF-----------EKGFKKSPL---VYQFSSLGSL-----------DEKWMA 209
L + + F S + V S+L SL D
Sbjct: 283 LGASVAALGLASPEPVEVDLVARFFGLSTICGEVANSSTLPSLVGAIYNACRGDDGIEDY 342
Query: 210 ELSSSMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSLEGYAAGNAI--------- 254
+ SS SS K P I +PT + V S G AG I
Sbjct: 343 KKSSGTSSRSRASKKPAETTSKELKDRFRIYFPTDKTVARSRGGRNAGGTICVQARWWRS 402
Query: 255 PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK-TFARYNG------QKLAWFLLTSAN 307
PS + +D + R R + H K F R G Q W + SAN
Sbjct: 403 PSFPTELVRDVIT------------RDRLLIHSKMIFVRRVGDGQATRQPPGWAYVGSAN 450
Query: 308 LSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 363
LS++AWG L K+ S ++ R++E GV+I VP E+
Sbjct: 451 LSESAWGRLSKDKSTEGIKMSCRNWECGVII----------------PVP--------ES 486
Query: 364 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
+ KT S+D + V PVP ++P Y+S D+PW
Sbjct: 487 KTVDKT-------VASADMAMFAGTV--PVPMQVPGPVYTSNDLPW 523
>gi|302823724|ref|XP_002993511.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
gi|300138642|gb|EFJ05403.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
Length = 920
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 62/134 (46%), Gaps = 23/134 (17%)
Query: 47 PPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF 95
PP P+ G HH K LL + +R+IV ++NL + W S +W QDF
Sbjct: 312 PPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYRQWLQVSNTVWWQDF 371
Query: 96 PLKDQNNLSE-------ECGFEN-DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 147
PL++ + S E G N D YL+ ++P+ ++ + +NF
Sbjct: 372 PLRNTRDYSSLFSSKITEGGERNGDFAAYLAGF-ISTLVKDVPSEAHWATD---LACYNF 427
Query: 148 SSAAVRLIASVPGY 161
S A V L+ASVPG+
Sbjct: 428 SKATVSLVASVPGF 441
>gi|429855706|gb|ELA30650.1| tyrosyl-dna phosphodiesterase domain-containing protein
[Colletotrichum gloeosporioides Nara gc5]
Length = 620
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 154/386 (39%), Gaps = 62/386 (16%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WLL + VLV + +D ++ N PA I P P+ G HSK +
Sbjct: 173 DEEWLLSKIDCR-RTKMVLVAYAANDAEKAVIRSNAPAGLIRFCFP-PMHGGYMHSKLQI 230
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKD--QNNLSEECGFENDLIDYL 117
L Y +R++V + NL+ DW +++ D P + Q E F +L +L
Sbjct: 231 LNY---LRLVVPSGNLVPYDWGETGVLENMVFLIDLPRYETQQTTAGTETLFGKELRRFL 287
Query: 118 STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLR 176
+ L E K+ S ++FS ++ + ++ G H S + G+ L
Sbjct: 288 TALGIGE-----------KLVKS-LDNYDFSETSRYGFVHTISGSHANDSWQHTGYCGLG 335
Query: 177 TVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL----------------------SSS 214
+ + + Y SSLGSL+ ++ + S +
Sbjct: 336 NTARSLGLATDYPVD-VDYVASSLGSLNHGYLTAIYNACQGDSGMKEYEARQSKSTRSKA 394
Query: 215 MSSGFSEDKTPLGIGEPL-----IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKK 268
SG S +T L I +PT + V S G +A I +K F ++
Sbjct: 395 GRSGPSGSRTITAEAVDLQHHFRIYFPTEKTVSSSRGGRSAAGTICMQEKWWKSSTFPRE 454
Query: 269 YWAKWKASHTGRSRAMPHIKT-FARYNGQKLA-WFLLTSANLSKAAWGALQKN----NSQ 322
+++ TG + H K F R A W + SANLS++AWG L K+ ++
Sbjct: 455 LLRDCESTRTG---LLLHSKAIFVRERACNGAVWAYMGSANLSESAWGRLVKDRESGTAK 511
Query: 323 LMIRSYELGVLILPSAKRHGCGFSCT 348
L R++E GVL+ + GC S T
Sbjct: 512 LSCRNWECGVLV-AVGRTAGCADSGT 536
>gi|320040691|gb|EFW22624.1| hypothetical protein CPSG_00523 [Coccidioides posadasii str.
Silveira]
Length = 651
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 140/332 (42%), Gaps = 62/332 (18%)
Query: 48 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNL 103
P+ HSK MLL +P +R++V +ANL+ DW + L++ D P K +
Sbjct: 280 PMGPQVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLLFLIDLPRKILGSQ 339
Query: 104 SEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 161
+ F ++L+ +L E KI +F+F +A + ++ G
Sbjct: 340 EKTSTPFFDELVYFLKASALHE-----------KIIAK-LSEFDFGKTAGFAFVHTIGGS 387
Query: 162 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSLGSLDEKWM----------- 208
HTGS WG + + + T PL Y SSLGSL++++M
Sbjct: 388 HTGSD---WGKTGVCGLGKAVTMLGLQTPQPLKLDYVTSSLGSLNDQFMRSMYLAAQGDN 444
Query: 209 --AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCSLEGYAAGNAIPSP 257
EL+ S F DK + + + LI +P+++ V+ S + I
Sbjct: 445 GLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGSRARPSGAGTICFQ 504
Query: 258 QKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL----------AWFLLTSA 306
K ++ ++ + S + R + H KT F R + K+ W + SA
Sbjct: 505 SKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDANTTAYQGWTYVGSA 562
Query: 307 NLSKAAWGALQKNNS----QLMIRSYELGVLI 334
NLS++AWG L + S +L R++E GV+I
Sbjct: 563 NLSESAWGRLVIDRSTTKPKLNCRNWECGVII 594
>gi|392861898|gb|EAS37505.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 672
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 140/330 (42%), Gaps = 58/330 (17%)
Query: 48 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNL 103
P+ HSK MLL +P +R++V +ANL+ DW + L++ D P K +
Sbjct: 301 PMGPQVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLLFLIDLPRKILGSQ 360
Query: 104 SEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 161
+ F ++L+ +L E KI + +F+F +A + ++ G
Sbjct: 361 EKTSTPFFDELVYFLKASALHE-----------KI-IAKLSEFDFGKTAGFAFVHTIGGS 408
Query: 162 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM------------- 208
HTGS K G L + E + L Y SSLGSL++++M
Sbjct: 409 HTGSYWGKTGVCGLGKAVTMLGLETP-QPLKLDYITSSLGSLNDQFMRSMYLAAQGDNGL 467
Query: 209 AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCSLEGYAAGNAIPSPQK 259
EL+ S F DK + + + LI +P+++ V+ S + I K
Sbjct: 468 KELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGSRARPSGAGTICFQSK 527
Query: 260 NVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL----------AWFLLTSANL 308
++ ++ + S + R + H KT F R + K+ W + SANL
Sbjct: 528 WYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDANTTAYQGWTYVGSANL 585
Query: 309 SKAAWGALQKNNS----QLMIRSYELGVLI 334
S++AWG L + S +L R++E GV+I
Sbjct: 586 SESAWGRLVIDRSTTKPKLNCRNWECGVII 615
>gi|156844717|ref|XP_001645420.1| hypothetical protein Kpol_534p43 [Vanderwaltozyma polyspora DSM
70294]
gi|156116082|gb|EDO17562.1| hypothetical protein Kpol_534p43 [Vanderwaltozyma polyspora DSM
70294]
Length = 568
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 95/421 (22%), Positives = 170/421 (40%), Gaps = 88/421 (20%)
Query: 52 SFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFE 110
+F HHSK ++ Y +I + + N +++ N Q W+ L + + E F+
Sbjct: 184 AFSCHHSKMIINFYEDNSCKIFIPSNNFTYMETNLPQQVCWVSP-RLPEASGTPPENKFK 242
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS-AAVRLIASVPGYHTGSSLKK 169
+L Y+ + + L S+ ++ +F+S + V + SVP + S K+
Sbjct: 243 KNLFKYIYSYQDKRVRQVL----------SYLREIDFNSLSNVEFVYSVPSKSSVSGFKQ 292
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG-SLDEKW---------------MAELSS 213
+ L+ +E + + Q S++G S+ +K+ + E ++
Sbjct: 293 LAALLLKNSTKEDFSTPTDIQHHYLCQTSTIGGSISKKFPLNLFTGIMIPTFSRLIEFNT 352
Query: 214 SMSSGFSEDKTPLGIGE--------PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF 265
+S S+ +P + E P +V+PTVE++R S G++ + ++ +
Sbjct: 353 EPNSR-SKSASPEDMIEQLNSHNIKPYLVYPTVEEIRNSPSGWSCSGWFNFRYQKNNEQY 411
Query: 266 LK-----KYWAKWKASHTGRSR-AMP-------HIKTFARYNGQK----LAWFLLTSANL 308
L K + K A+ + R A P KT + N L W + TSANL
Sbjct: 412 LSLLNDFKCFYKQNANLISKHRKATPSHSKFYLKSKTSVKSNSNNPFDILDWCVYTSANL 471
Query: 309 SKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 368
S +AWG S + R+YE+G+L ST QI+
Sbjct: 472 SVSAWGT-----SSRLARNYEVGILF------------------------QSTPELQIKC 502
Query: 369 TKLVTLTWH-GS--SDAGASSEVVYLPVPYELPPQRY-SSEDVPWSWDKRYTKKDVYGQV 424
V + + GS SD S V + VP+ LP Y +++D + K Y D+ G+
Sbjct: 503 KSFVDVIYRKGSKLSDTAPSCNTVNVMVPFTLPCSPYDTTKDEAFCISKNYDLPDINGEY 562
Query: 425 W 425
+
Sbjct: 563 F 563
>gi|50310989|ref|XP_455517.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49644653|emb|CAG98225.1| KLLA0F09625p [Kluyveromyces lactis]
Length = 497
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 100/420 (23%), Positives = 169/420 (40%), Gaps = 72/420 (17%)
Query: 29 GTLEHMKRNKP----ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDW 83
G L + +P AN +H+ +P +G HHSK + + G +R+ V + NL +
Sbjct: 108 GQLNTINSEQPISHYANLKVHRVDIPSPWGCHHSKIIFSFHQNGTMRMHVPSFNLSREEM 167
Query: 84 NNKSQGLWMQDFPL---KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 140
N Q +W PL K + ++ FE++L++YL++ +S+ +G +
Sbjct: 168 NLVQQTVWTS--PLLYEKSETVPKKKSRFEDELLEYLNS-----YSSYTSLYG-LIASLK 219
Query: 141 FFKKFNFSSAAVRLIASVPGYHTG-----SSLKKWGHMKL------------RTVLQECT 183
+K + + S P Y+ G S L+ G MKL +Q +
Sbjct: 220 RYKWHVLDEQNCQFVYSTP-YNGGLTQLKSCLRASG-MKLHGDEEDDDLSFVNLFIQVSS 277
Query: 184 FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR- 242
F+K + Q + L W + E TP + +VWPT +++
Sbjct: 278 MGNPFRKKFDLLQDVMIPYLYTDWFEKDGYDKKLKSKEYTTPF-LAHSTLVWPTKTEIKE 336
Query: 243 CSLEGYAAG----NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAM--PHIKTFARYNGQ 296
C +G +A ++ V K A+ + ++R M H K + ++ +
Sbjct: 337 CMTQGLSANWFFYKRSEQTERKVVPCLRKHVPLPTNATQSDKNRHMVPSHTKYYIQFTDE 396
Query: 297 ----KLAWFLLTSANLSKAAWG--ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSN 350
+ W LLTS NLS+AAWG L+K +YE G+L + R+ + S
Sbjct: 397 NTLKRPDWILLTSHNLSQAAWGPSPLKKPT------NYECGILYTTTMGRNKVRLTLASA 450
Query: 351 IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 410
P G T S++ + V T V + PY L QRYS+ D P++
Sbjct: 451 QQP----PGRTIGSRVPEDITVLPT-------------VKVVTPYPLKFQRYSATDEPYT 493
>gi|320587853|gb|EFX00328.1| mitochondrial translation optimization protein [Grosmannia
clavigera kw1407]
Length = 1223
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 151/374 (40%), Gaps = 53/374 (14%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +W++ V K +L+ + + M+ N P + + P +S G HSK L
Sbjct: 151 DEEWMMQHVDV-RKTKLLLIAYAADENQKVEMRENVPNSNVRFCFPPMLSVGAMHSKLQL 209
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L Y +RI+V T NL+ DW +++ D P L + G + +L
Sbjct: 210 LKYADYLRIVVPTGNLVPYDWGESGTIENMVFIIDLP-----RLPAQAGRISGKTPFLDD 264
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTV 178
L + L A + ++FS+ A + ++ G H S ++ G+ L
Sbjct: 265 LSY-----FLKAQAVDQSLVQSLDNYDFSATARYAFVHTISGSHAKDSWERTGYCGLGRA 319
Query: 179 LQECTFEKGFKKSPLV--YQFSSLGSLDEKWMAEL--SSSMSSGFSE-----DKTPLGI- 228
++ + + PL Y SS+GSL + + L + +G E +K G+
Sbjct: 320 IKSLGWA---TEEPLQLDYLCSSIGSLGDDLLNALYYACQGDTGMKEYEARANKPKKGVL 376
Query: 229 ---GEP------LIVWPTVEDVRCSLEGYAAGNAIPSPQKN--VDKDFLKKYWAKWKASH 277
EP + +P+ + V S G I ++N F +K ++
Sbjct: 377 ASSSEPDWKSRMRVYFPSHQTVVRSRGGIRGAGTI-CFRRNWWESAKFPRKILRDYQNVK 435
Query: 278 TGRSRAMPHIKTF--ARYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELG 331
G + H K R AW L SANLS++AWG L K+ + +L R++E G
Sbjct: 436 KG---TLAHTKLLFVRREASSAQAWTYLGSANLSESAWGRLVKDRATKEPRLTCRNWECG 492
Query: 332 VLI----LPSAKRH 341
VLI P A+R
Sbjct: 493 VLIPAVPRPEAERR 506
>gi|116192211|ref|XP_001221918.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
gi|88181736|gb|EAQ89204.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
Length = 670
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 150/377 (39%), Gaps = 79/377 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D W+L + + +L+ S+ M+ N P N + P G HSK ML
Sbjct: 248 DEHWMLSKIDI-TRTKLMLIAFAASEAQKAEMRANVPKNRVRFCFPPMHGIGAMHSKLML 306
Query: 63 LIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP---LKDQNNLSEECGFENDLIDY 116
L Y R +RI+V T N + DW +++ D P +Q + F ++L +
Sbjct: 307 LKYERYMRIVVPTGNFMSYDWGETGTMENMVFIIDLPKFETAEQREAQKPDPFSSELFYF 366
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKL 175
L A G + S + ++F+ A+ + + ++PG HT W +
Sbjct: 367 LR------------AQGLDEKLVSSLRNYDFTEASRYKFVHTIPGSHTDED--AWRRTAV 412
Query: 176 RTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL-------------SSSMSSGFS 220
++++ + P+ F +SLG+++ +++ + + + S G
Sbjct: 413 SSLIRAT-------RDPIDIDFVCASLGAINYDFLSAMYYACLGDPLVEYQARTGSKGQR 465
Query: 221 E---DKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA- 275
E D+ + E + + +P+ E V S G I K W W+A
Sbjct: 466 EAFNDRAQFLVKEHMRVFFPSRETVLQSKGGKEGAGTI----------CFKPIW--WQAP 513
Query: 276 ---------SHTGRSRAMPHIKT-FARYNGQKLAW----FLLTSANLSKAAWGALQKNN- 320
+ R + H K + R N + W + SANLS++AWG L ++
Sbjct: 514 TFPQQILRDCKSVRPGVLMHSKVIYIRPNDPGIRWNQCLAYVGSANLSESAWGKLVRDRV 573
Query: 321 ---SQLMIRSYELGVLI 334
++L R++E GVLI
Sbjct: 574 TKKAKLTCRNWECGVLI 590
>gi|224119906|ref|XP_002318192.1| predicted protein [Populus trichocarpa]
gi|222858865|gb|EEE96412.1| predicted protein [Populus trichocarpa]
Length = 1131
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 79/201 (39%), Gaps = 45/201 (22%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLP--ISFG 54
DI W L C + +P + H S + + N ++ PP P I+FG
Sbjct: 467 DILWFLSHCEIPCHLPVTIACHNTERCWSSSPDNRTSVPYSDFPNLVVVFPPFPESIAFG 526
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLI------HVDWNNKSQGLWMQDFPLKD 99
HH K ++L +R+I+ +ANL+ H WNN + +W QDFP +
Sbjct: 527 QDRKRRGIACHHPKLLVLQREDSIRVIITSANLVSNQVVAHSKWNNVTNTVWWQDFPARS 586
Query: 100 --------------QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 145
N F L +++ L N+P+ + S K+
Sbjct: 587 APDPSPLFIRVSDGDANKDSRSDFAAQLAGFMACL-----VINVPSQAYWI---SELTKY 638
Query: 146 NFSSAAVRLIASVPGYHTGSS 166
+F A L+ASVPG H+ S
Sbjct: 639 DFEGANGHLVASVPGIHSRRS 659
>gi|71004940|ref|XP_757136.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
gi|46096766|gb|EAK81999.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
Length = 687
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 99/217 (45%), Gaps = 33/217 (15%)
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNL----------- 103
T H K ++L++ R +R+ + + NL +DW+ ++QDFPL Q ++
Sbjct: 301 TQHIKFLILVHQRFLRVAILSGNLNAIDWDRIENTAYIQDFPLLGQASMINHGSGSSSGS 360
Query: 104 -SEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGY 161
S + F++ L+ L +L P A A +++FS A R++AS P
Sbjct: 361 KSSQNDFKSQLVRVLRSLSMPASHAVYAA----------LDRYDFSLATRARIVASWP-- 408
Query: 162 HTGSSLKKWGHMKLRTV--LQECTFEKGFKKS-PLVYQFSSLGSLDEKWMAELSSSMSSG 218
+SL++W ++ + + L + + G K+S L Q SSL + D KW+ S
Sbjct: 409 -EAASLREWDQIETQGLGRLGKVVRDLGIKESVELECQGSSLANHDVKWIEHFHLLASGV 467
Query: 219 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 255
PL G+P V P + ++ + GNA+P
Sbjct: 468 EPRGLLPLK-GKPNEVHP---EYASAIGATSKGNALP 500
>gi|115458196|ref|NP_001052698.1| Os04g0403400 [Oryza sativa Japonica Group]
gi|113564269|dbj|BAF14612.1| Os04g0403400 [Oryza sativa Japonica Group]
Length = 1011
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 77/186 (41%), Gaps = 33/186 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPLP--ISFG 54
D+ W L C V +P + H + + A N +L P P I+FG
Sbjct: 328 DVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQFPEEIAFG 387
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS- 104
HH K ++L +R+IV +ANL+ W+ + +W QDFP + + S
Sbjct: 388 KDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSTDYSA 447
Query: 105 -------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 157
+ F L+ +++ F N ++ IN K+NF AA LIAS
Sbjct: 448 LFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGAAGYLIAS 499
Query: 158 VPGYHT 163
VPG +
Sbjct: 500 VPGIYA 505
>gi|222628800|gb|EEE60932.1| hypothetical protein OsJ_14671 [Oryza sativa Japonica Group]
Length = 1021
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 77/186 (41%), Gaps = 33/186 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPLP--ISFG 54
D+ W L C V +P + H + + A N +L P P I+FG
Sbjct: 328 DVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQFPEEIAFG 387
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS- 104
HH K ++L +R+IV +ANL+ W+ + +W QDFP + + S
Sbjct: 388 KDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSTDYSA 447
Query: 105 -------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 157
+ F L+ +++ F N ++ IN K+NF AA LIAS
Sbjct: 448 LFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGAAGYLIAS 499
Query: 158 VPGYHT 163
VPG +
Sbjct: 500 VPGIYA 505
>gi|38346146|emb|CAD40679.2| OSJNBb0118P14.6 [Oryza sativa Japonica Group]
Length = 989
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 77/186 (41%), Gaps = 33/186 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPLP--ISFG 54
D+ W L C V +P + H + + A N +L P P I+FG
Sbjct: 328 DVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQFPEEIAFG 387
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS- 104
HH K ++L +R+IV +ANL+ W+ + +W QDFP + + S
Sbjct: 388 KDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSTDYSA 447
Query: 105 -------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 157
+ F L+ +++ F N ++ IN K+NF AA LIAS
Sbjct: 448 LFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGAAGYLIAS 499
Query: 158 VPGYHT 163
VPG +
Sbjct: 500 VPGIYA 505
>gi|218194792|gb|EEC77219.1| hypothetical protein OsI_15757 [Oryza sativa Indica Group]
Length = 974
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 77/186 (41%), Gaps = 33/186 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPLP--ISFG 54
D+ W L C V +P + H + + A N +L P P I+FG
Sbjct: 329 DVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQFPEEIAFG 388
Query: 55 ---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS- 104
HH K ++L +R+IV +ANL+ W+ + +W QDFP + + S
Sbjct: 389 KDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSTDYSA 448
Query: 105 -------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 157
+ F L+ +++ F N ++ IN K+NF AA LIAS
Sbjct: 449 LFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGAAGYLIAS 500
Query: 158 VPGYHT 163
VPG +
Sbjct: 501 VPGIYA 506
>gi|118785322|ref|XP_001237572.1| AGAP010578-PA [Anopheles gambiae str. PEST]
gi|116128029|gb|EAU76763.1| AGAP010578-PA [Anopheles gambiae str. PEST]
Length = 239
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
M+DI+WLL H L+I + LE + +P N K FG HH+K
Sbjct: 94 MIDINWLLEQYSDAGYEQHPLLILYGDESELETISDKQP-NVTAIKIKTKTGFGLHHTKM 152
Query: 61 MLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWM 92
L Y G +R++V TANL DW N++QGLW+
Sbjct: 153 GLYGYCDGSMRVVVSTANLYENDWYNRTQGLWI 185
>gi|326472360|gb|EGD96369.1| hypothetical protein TESG_03817 [Trichophyton tonsurans CBS 112818]
Length = 676
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/383 (22%), Positives = 148/383 (38%), Gaps = 67/383 (17%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKA 60
D+DWLL + + ++ + + E + R + L PP+ HSK
Sbjct: 240 DMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETASMSRIRLCFPPMDGEVNCMHSKL 298
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
MLL + +RI++ +ANL DW + L++ D P K + + F ++L+ +
Sbjct: 299 MLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLIDLPRKANETVDDTTPFRDELVYF 358
Query: 117 L--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGS-SLKKWGH 172
L STL N KI +++FS +A + S+ G H GS S ++ GH
Sbjct: 359 LRASTL-------------NEKIIDKML-QYDFSQTAKYAFVHSIGGSHIGSGSYERTGH 404
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL--SSSMSSGFSEDKTPLG--- 227
L T ++ + L Y SS+GSL ++ L S+ +G + G
Sbjct: 405 CGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQNLYWSAQGDNGTKQLSARAGNPR 463
Query: 228 -----------------------IGEPLIVWPTVEDVRCSLEGYAAGNAI---------P 255
G + +P+ E V S G +A + P
Sbjct: 464 SSSKSSSNNNNNKKSGGRVDDDWTGRMKVYFPSRETVCSSRGGVSAAGTLCLMSKWYNSP 523
Query: 256 SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGA 315
++V +D S R + + W + SANLS++AWG
Sbjct: 524 MFPRDVMRDNRSVREGLLMHSKVLYVRPEGEARKGESRSADCAEWAYVGSANLSESAWGR 583
Query: 316 L----QKNNSQLMIRSYELGVLI 334
L + ++L R++E GV++
Sbjct: 584 LVIDRKTKQAKLNCRNWESGVVV 606
>gi|159464062|ref|XP_001690261.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284249|gb|EDP09999.1| predicted protein [Chlamydomonas reinhardtii]
Length = 424
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/31 (70%), Positives = 28/31 (90%)
Query: 68 GVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 98
G+R+++HTAN I+ D NNKSQGLW+QDFPLK
Sbjct: 174 GLRLVIHTANAIYADCNNKSQGLWVQDFPLK 204
>gi|342320632|gb|EGU12571.1| Endoplasmic reticulum Ca-transporting P-type ATPase [Rhodotorula
glutinis ATCC 204091]
Length = 1978
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/393 (22%), Positives = 149/393 (37%), Gaps = 84/393 (21%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN-LSEECG-FEN 111
G H+K ++ + RI++ TAN + DW+ ++ DFP + + ++EE F+N
Sbjct: 1630 GIMHTKLLIFYHEDFCRIVIPTANAVSYDWSQIDNAFYVHDFPRRRSASPVNEESNPFKN 1689
Query: 112 DLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 170
S + + +P H + S F+ SS V+L+ S G + K
Sbjct: 1690 PTHTQFSKKSFQVCYYLGIPKH---ILQESLHYDFS-SSTDVQLVHSNQGKFPAADYDKG 1745
Query: 171 GHMKLRTVLQECTFEKGFKKS---PLVYQFSSLGSLDEKWMAELSSSMS---------SG 218
G + L + GF + SS+G W+ ++ ++ S SG
Sbjct: 1746 GGI---AGLAKAVSAFGFASGGHWEIEVTGSSIGQYSSTWLTQMLAACSGIHPSTYFRSG 1802
Query: 219 FSED------KTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 269
D KTP G L I++PT +++ S G G I P K + K+
Sbjct: 1803 KGNDVPSQLPKTPSGQPTRLPIKIIFPTQDEILSSPGGAGHGGTIFCPSKTWNSLTFPKH 1862
Query: 270 WAKWKASHTGRSRAMPHIKT------FARYNGQKL--AWFLLTSANLSKAAWGALQ--KN 319
+ + R H K FA+ + + L S N + +AWG LQ K+
Sbjct: 1863 L--FHRGESKRKNIPAHTKIILGLHRFAKAPTPPVHEGFIYLGSHNFTPSAWGRLQNGKD 1920
Query: 320 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 379
QL +YELGV++ +++ S E + + T+LVT
Sbjct: 1921 GPQLFCNNYELGVVL--------------------TLRASSAEELEAKATELVT------ 1954
Query: 380 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 412
Y+ P +Y DVPW +
Sbjct: 1955 ---------------YKRPLVKYGPNDVPWQQE 1972
>gi|302787823|ref|XP_002975681.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
gi|300156682|gb|EFJ23310.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
Length = 920
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 63/137 (45%), Gaps = 31/137 (22%)
Query: 47 PPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF 95
PP P+ G HH K LL + +R+IV ++NL + W S +W QDF
Sbjct: 312 PPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYRQWLQVSNTVWWQDF 371
Query: 96 PLKDQNNLS-----------EECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK 143
PL++ + S E G F L ++STL ++P+ ++ +
Sbjct: 372 PLRNTRDYSSLFSSKITDGGERNGDFAAYLAGFISTL-----VKDVPSEAHWATD---LA 423
Query: 144 KFNFSSAAVRLIASVPG 160
+NFS A V L+ASVPG
Sbjct: 424 CYNFSKATVSLVASVPG 440
>gi|169625658|ref|XP_001806232.1| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
gi|160705700|gb|EAT76477.2| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
Length = 895
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/401 (21%), Positives = 155/401 (38%), Gaps = 54/401 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT----LEHMKRNKPANWILHKPPLPISFGTH 56
M D +WL L K+ + +++ +S + M+ N +H PP+ +
Sbjct: 488 MWDSEWLNKKLSPL-KVKQIWIMNAKSQDVQQRWVREMEDAGIPNLRIHFPPMGGLIHSM 546
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNK---------SQGLWMQDFPLKDQNNLSEEC 107
HSK MLL +R++V TAN+ +DW +K L++ D P + + ++
Sbjct: 547 HSKFMLLFGRDKLRLVVPTANMTPMDWGDKVNNWQPGVMENSLFLVDLPRRSDGVMGKKQ 606
Query: 108 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR---LIASVPGYHTG 164
+ + L+ E + G K + + F A + + + G H G
Sbjct: 607 DLTTFGKELVCFLEKQELDKKV-IEGVLKFDFTQTDHLAFVHAILEEQSITCTSGGVHKG 665
Query: 165 SSLK-KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 223
+ G L +++ + K+ L Y +SLG++++ ++ + +
Sbjct: 666 EQQQLSTGLPGLAKAIRDVHLDD-VKEIELDYASASLGAINDNFLQRIYLAAQ------- 717
Query: 224 TPLGIGEPLIVWPTVEDVRCSLEGY-----AAGNAIPSPQKNVDKDFLKKYWAK------ 272
G+PL V VR Y A N+I P Y+
Sbjct: 718 -----GKPLTTTSAVSQVRRHFRIYFPTDDAVQNSIGGPDCGGIISLSSHYYNAATFPRE 772
Query: 273 -WKASHTGRSRAMPHIKTF----ARYNGQKLAWFLLTSANLSKAAWGALQ----KNNSQL 323
+ + R + H K + +G+ AW + SAN+S++AWGA + L
Sbjct: 773 CLRNYDSTRRGMLSHNKLLFVRGIKNDGRPFAWVYVGSANMSESAWGAQKVLKSGQTGSL 832
Query: 324 MIRSYELGVLI-LPSAKRHGCGFSCTSNIVPSEIKSGSTET 363
IR++E GVL+ +P+ K + I P + G+ E
Sbjct: 833 NIRNWECGVLMPVPNEKMADMKLN-DGAIPPMSVFRGTVEV 872
>gi|46111419|ref|XP_382767.1| hypothetical protein FG02591.1 [Gibberella zeae PH-1]
Length = 676
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 149/397 (37%), Gaps = 81/397 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WL+ L K +L+ +S+ M+ N P P + G HSK L
Sbjct: 164 DDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPPGIKFVFPAMN-GPGAMHSKLQL 221
Query: 63 LIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L YP +R++V +ANL+ DW +++ D P D + F +L +LS
Sbjct: 222 LKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPRLDGSATHRPTPFSTELGRFLSA 281
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
E N + +F S K F + ++PG H G LK+ G+ L +
Sbjct: 282 TGVGETMVNSLTNYDF----SQTKHLGF-------VYTIPGGHQGDELKRIGYSGLGASV 330
Query: 180 QECTFEKGFKKSPLVYQF--SSLGSLDEKWMAELSSSM--SSGFSEDKTPLGIGEPL--- 232
P+ F +SLGSL+ + + ++ G +E K+ G
Sbjct: 331 ASLGLA---TDDPVEVDFVCASLGSLNYDLVGAIYNACRGDDGLAEFKSRTGRAGAAGKN 387
Query: 233 ---------------IVWPTVEDVRCSLEGYAAGNAI---------PSPQKNVDKDFLKK 268
I +PT E V S G A I P+ + +D +
Sbjct: 388 KASNPWQGKLKDRFRIYFPTNETVTRSRGGRNAAGTICVQPKWWRSPTFPTELVRDCVNT 447
Query: 269 -----------YWAKWKASHTGRS--RAMPHIKTFARYNGQ--------------KLAWF 301
++ +A +S + P + R + Q L W
Sbjct: 448 RHGLLMHSKMILVSQTEAGSQNQSQLQTRPQTRREPRGHDQGSASTQRDPKTANKSLGWV 507
Query: 302 LLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 334
+ SANLS++AWG + K+ + ++ R++E GV++
Sbjct: 508 YVGSANLSESAWGRIVKDRATGQPKMSCRNWESGVVV 544
>gi|388851550|emb|CCF54740.1| uncharacterized protein [Ustilago hordei]
Length = 665
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 78/166 (46%), Gaps = 21/166 (12%)
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC----GFE 110
T H K ++L++ +R+ + + NL VDW+ G+++QDFPLK S G E
Sbjct: 285 TQHMKFLVLVHEGWLRVAIASGNLNEVDWSRIENGVFIQDFPLKGGEGSSARAEGRGGVE 344
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS--SAAVRLIASVPGYHTGSSLK 168
ND + L TL S P+H + + +F+FS A R++AS P SSL+
Sbjct: 345 NDFKEQL-TLVLKSLSVP-PSHPVW----TALDRFDFSLGGARARIVASWP---EASSLQ 395
Query: 169 KW------GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 208
W G +L V+++ + Q SSL + D KW+
Sbjct: 396 GWDRIETQGLGRLGKVVRDLDIPAVKGGMEVECQGSSLANHDLKWI 441
>gi|50292179|ref|XP_448522.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527834|emb|CAG61483.1| unnamed protein product [Candida glabrata]
Length = 553
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 140/335 (41%), Gaps = 65/335 (19%)
Query: 44 LHKPPLPISFGTHHSKAMLLIYP--RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 101
++ PP + HHSK ++ IY RGVR+ + + N + N Q LW F + +
Sbjct: 182 IYMPP----YSCHHSKMIIGIYRNGRGVRVFLPSNNFTWAETNWPQQVLWSSPF-MSISD 236
Query: 102 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS-AAVRLIASVPG 160
E GF+ L DYLS K E ++ + + +FS A V I S P
Sbjct: 237 KAVEMNGFQRSLCDYLSFYKLKELNS---------LVKDTIMRTDFSGLADVEFIYSCPK 287
Query: 161 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL---VYQFSSLG-------SLDEKWMAE 210
G +++ +M L+++ + T + + L + Q S++G
Sbjct: 288 -TKGKNIETGLNMFLKSIEKVETELRDVDQISLNLFLCQSSTIGGPIGRRKDNPSNLFTH 346
Query: 211 LSSSMSSGFSE----DKTPL------GIGEPLIVWPTVEDVRCSLEGY-AAG----NAIP 255
+ + GFSE D+ L P I++P ++++R + G +AG N
Sbjct: 347 VIVPTARGFSEAAKSDQQALLKAYHENKTYPCIIYPCMKEIRDASVGINSAGWFNFNYTR 406
Query: 256 SPQKNVDKDFLK---KYWAKWKASHTGRSRAMP--HIKTFARYN--GQKLA--------- 299
+ + D+L+ K + K+ +T + R H K + R+ Q +A
Sbjct: 407 NDTQLQQYDWLRNKIKVFYKYNRDYTTKQRLTTPSHTKFYLRFRMPSQSMAQGMRVPEHI 466
Query: 300 -WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 333
W L TSANLS AWG L R+YE+GV+
Sbjct: 467 DWCLFTSANLSSNAWGTLGSQP-----RNYEVGVM 496
>gi|345560675|gb|EGX43800.1| hypothetical protein AOL_s00215g536 [Arthrobotrys oligospora ATCC
24927]
Length = 634
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 146/368 (39%), Gaps = 60/368 (16%)
Query: 20 VLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTAN 77
VLV+H + D ++H +RN L P + + HSK LL + +R++V TAN
Sbjct: 239 VLVLHAKEDEVVDHYRRNLCNIPRTRLCFPDMSGNVNIMHSKLQLLFHLTHLRVVVPTAN 298
Query: 78 LIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND--LIDYLSTLKWPEFSANLPAHGNF 135
L DW + S E EN +ID+ K + P+H F
Sbjct: 299 LTSYDWGEAT-------------GTGSNEGVMENSVFIIDFPELPKTSTEGSTNPSHTPF 345
Query: 136 KINPSFFKK---------------FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 179
N F K ++F+ S + + S+ G H G + G L +
Sbjct: 346 SRNLLHFCKAKGMPSDIIKKVDQVYDFTRSQRLGFVYSIGGSHHGDEALRNGVCGLACAV 405
Query: 180 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLI----VW 235
++ K K+ Y SSLGSL+++++ + ++ G K+ I + I
Sbjct: 406 RDLGL-KTRKRVEADYITSSLGSLNKEFLLRIYRAL-HGDEGKKSVQNIPKTFIGRQVKA 463
Query: 236 PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW---AKWKAS-----HTGRSRAMPHI 287
P E E + + + + N ++ W +K+ S + R + H
Sbjct: 464 PEDESTDSETEEDESDDKV--WRDNGGTICFQRQWFNGSKFPQSLLHDCQSVRRGMLMHN 521
Query: 288 KT----FARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLI---LP 336
K R G + W + S NLS++AWG L + + ++ R++E GV++ LP
Sbjct: 522 KIIFVRLPRPRGNSIGWAYVGSHNLSESAWGKLVWDRSEKDFKMSNRNWECGVIVPVALP 581
Query: 337 SAKRHGCG 344
+ H G
Sbjct: 582 DGQEHTRG 589
>gi|342319803|gb|EGU11749.1| Proteophosphoglycan 5 [Rhodotorula glutinis ATCC 204091]
Length = 564
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 71/319 (22%), Positives = 129/319 (40%), Gaps = 41/319 (12%)
Query: 46 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQNNLS 104
+P P + G HSK LL YP + +++ + N + +D + ++ P +
Sbjct: 211 RPLYPWASGCAHSKFFLLFYPGFLLLVITSCNTMRIDMDLSDNHWYIHALPEIPPGKKRK 270
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA--VRLIASVPGYH 162
+ FE+DL+ ++ L WPE ++ K++F SA V L+ASVPG
Sbjct: 271 AKTTFEHDLLAHMLDLDWPE-----------ELVSRVRGKYDFRSAEGRVHLVASVPGTK 319
Query: 163 TGSSLK-KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 221
+ + +G ++L + ++ + + S+ SL +W+ + +
Sbjct: 320 RATDDEGSYGMLRLNALARQIIPPSVRPDIDMEFCAGSVNSLPPEWIDQTDKLLRGRDLS 379
Query: 222 DKTPL---GIGEP----------LIVWPTVEDV-RCSLEGYAAGNAIPSPQKNVD----K 263
P+ G+ EP IV+PT V CS + A + I N
Sbjct: 380 RAVPVTKPGVPEPPVSLNNLPEWSIVFPTKATVAACSPQVIEAASNIGCCLNNAKWPETS 439
Query: 264 DFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQK-- 318
+ ++ + + + GR M + N A L S NLSKAA G + +
Sbjct: 440 NEVRSMFFDYGSKDPGRLFHMKFYQWKDSRNKDPSAPPLMVYLGSHNLSKAALGEVSRLK 499
Query: 319 ---NNSQLMIRSYELGVLI 334
+ ++ ++ELGV+I
Sbjct: 500 SGAGDVRIKCNNFELGVVI 518
>gi|242072904|ref|XP_002446388.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
gi|241937571|gb|EES10716.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
Length = 972
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 79/189 (41%), Gaps = 35/189 (18%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKP---------PLPISF 53
DI W L C + +P + H + D N+ A P P I+F
Sbjct: 303 DISWFLNYCKIPQHLPVTIACHNK-DRCWSASSENRTAAPFESHPKLLLVFPRFPEEIAF 361
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+IV +ANL+ W+ + +W QDFP + + +
Sbjct: 362 GQDRKKQGVACHHPKLIVLQREDSMRVIVTSANLVPRQWHLITNTVWWQDFPRRTSLDYA 421
Query: 105 --------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
++ F L+ +++++ +P+ + IN K++F A LIA
Sbjct: 422 ALFSAAEKQKSDFAAQLVSFIASM-----VNEVPSQA-YLINE--IAKYDFEGAGGYLIA 473
Query: 157 SVPGYHTGS 165
SVPG H S
Sbjct: 474 SVPGIHAQS 482
>gi|242823839|ref|XP_002488140.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218713061|gb|EED12486.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 673
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 91/220 (41%), Gaps = 26/220 (11%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMKRNKPA-NWILHKPPLPISFGTHHSKA 60
D +WL K ++V+ + + T L++ + N L PP+ HSK
Sbjct: 255 DTEWLFSKFRTPGKTRFLMVMQAKEESTRLQYQQETADMPNIRLCFPPMEGQIKCMHSKL 314
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSE--ECGFENDLI 114
MLL +P +RI+V +ANL+ DW + +++ D P + ++ + + F +L
Sbjct: 315 MLLFHPDYLRIVVPSANLVPYDWGEQGGVMENTVFLIDLPKRSAQDVPDTPKKAFYEELA 374
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF-SSAAVRLIASVPGYHTGSSLKKWGHM 173
+L H N F+F ++ R + ++ G H G ++ GH
Sbjct: 375 FFLQAST---------VHNNIIAK---LSSFDFKETSRYRFVHTIGGSHIGECRRRTGHC 422
Query: 174 KLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL 211
L + P+ F SS+GSL +++M +
Sbjct: 423 GLGQAVSSLGLR---THEPISIDFVTSSIGSLTDEFMRSI 459
>gi|238494160|ref|XP_002378316.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694966|gb|EED51309.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 679
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 96/222 (43%), Gaps = 28/222 (12%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M +++WL AK LV+ + + T K A N L PP+ HS
Sbjct: 260 MWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAAMSNLRLCFPPMDGQVNCMHS 318
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLK-DQNNLSEECGFENDL 113
K MLL + VRI+V TANL DW +++ D P + D+++ GF ++L
Sbjct: 319 KLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIIDLPKRTDKDSGFTRTGFYDEL 378
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 172
+ LK N+ A ++FS A + + ++ G H G S ++ G+
Sbjct: 379 TYF---LKASTLHENIIAK---------LTDYDFSRTAHIAFVHTIGGSHMGDSWRRTGY 426
Query: 173 MKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMAEL 211
L + G + S PL F SS+GSL ++++ +
Sbjct: 427 CGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLRSI 464
>gi|410081624|ref|XP_003958391.1| hypothetical protein KAFR_0G02220 [Kazachstania africana CBS 2517]
gi|372464979|emb|CCF59256.1| hypothetical protein KAFR_0G02220 [Kazachstania africana CBS 2517]
Length = 527
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 167/410 (40%), Gaps = 78/410 (19%)
Query: 44 LHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 102
++ PP + +HHSK +L Y + V+I + + N H + N Q W P Q
Sbjct: 170 IYMPP----YTSHHSKMILNFYRDKSVKIFIPSNNFTHHETNLPQQICWCS--PSLYQGK 223
Query: 103 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF---------SSAAVR 153
+ F+ +L+ YL + + + + + ++N K +F +S+ ++
Sbjct: 224 -TGSVLFQENLLSYLKSYEDKTLNTTI-YYELLQLNFESLKDVDFVYSCPSKENASSGLK 281
Query: 154 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG--SLDEKWMAEL 211
L+ + H K GH + Q T KS F+ L +L +
Sbjct: 282 LLVELLSKHDND---KSGHY----LCQTSTIGGPLNKSQNSNIFTHLMIPALSNMFGMSN 334
Query: 212 SSSMSSGFSEDKTPLGIG---EPLIVWPTVEDVR-CSLEGYAAG------NAIPSPQKNV 261
SS ++ +E +P I++PTV++++ C + +G + IP + +
Sbjct: 335 SSRLTIPTTEQVLQFNKNNNIKPYILYPTVKELQNCPMGWLPSGWFHFNYDRIPMYYETL 394
Query: 262 DKDFLKKYWAKWKASHTGRSRAMP-HIKTFARYNGQ---KLAWFLLTSANLSKAAWGALQ 317
+ F ++ + S + + RA P H K + + + + +L W L TSANLS +AWG +
Sbjct: 395 KEKF-DIFYKQDAESISIQRRATPSHSKFYMKSSTETFTELDWCLYTSANLSMSAWGKIT 453
Query: 318 KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 377
R+YE+GVL + C T + L +
Sbjct: 454 TKP-----RNYEVGVLFTGKDRLIRC-------------------------TSFIDLIYK 483
Query: 378 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 427
+ S+VV VP+ L Q+Y ++D + K Y D+ G+++ R
Sbjct: 484 RT---DGQSDVV---VPFTLKLQKYEADDEAFCMSKDYGLLDINGRLYER 527
>gi|159122429|gb|EDP47550.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 665
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 100/224 (44%), Gaps = 32/224 (14%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M DI+WL + +LV+ + D T + + N L PP+ HS
Sbjct: 247 MWDIEWLFSKVDTKS-TRFLLVMQAKDDLTKRQYEAETASMSNLRLCFPPMEGQVNCMHS 305
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLK-DQNNLSEECGFENDL 113
K MLL +P +RI+ TANL DW ++ D P K ++ + FE +L
Sbjct: 306 KLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLIDLPRKVATTSVGSKTVFEEEL 365
Query: 114 IDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKW 170
+ +L STL+ S +F+FS ++ + L+ ++ G HTG++ ++
Sbjct: 366 VYFLRASTLQENIISR--------------LDEFDFSPTSHIMLVHTIGGSHTGNTWRRT 411
Query: 171 GHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMAEL 211
G+ L + G + S P+ F SS+GSL ++++ +
Sbjct: 412 GYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEFLRSI 451
>gi|344232732|gb|EGV64605.1| phospholipase D/nuclease [Candida tenuis ATCC 10573]
Length = 171
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 66/160 (41%), Gaps = 43/160 (26%)
Query: 266 LKKYWAKWKASH--TGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQ--- 317
+K Y KW H TGR R H+K + NG + L W + S NLSK AWG
Sbjct: 32 IKPYLCKWNNGHEYTGRERNPAHVKLYMCDNGDDFKSLKWLYMGSHNLSKQAWGGGSGFG 91
Query: 318 --KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 375
+N ++ + SYELG+LI P + TL
Sbjct: 92 SWQNINEYQVSSYELGILITPENDKD-------------------------------TLK 120
Query: 376 WHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 415
SD SSE + +P LPP RYS D+PWS + Y
Sbjct: 121 PVFCSDF--SSEKYPVRMPLYLPPTRYSPTDMPWSKNISY 158
>gi|391872408|gb|EIT81535.1| hypothetical protein Ao3042_01981 [Aspergillus oryzae 3.042]
Length = 679
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 96/222 (43%), Gaps = 28/222 (12%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHS 58
M +++WL AK LV+ + + T K A N L PP+ HS
Sbjct: 260 MWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAAMSNLRLCFPPMDGQVNCMHS 318
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLK-DQNNLSEECGFENDL 113
K MLL + VRI+V TANL DW +++ D P + D+++ GF ++L
Sbjct: 319 KLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIIDLPKRTDKDSGFTRTGFYHEL 378
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 172
+ LK N+ A ++FS A + + ++ G H G S ++ G+
Sbjct: 379 TYF---LKASTLHENIIAK---------LTDYDFSRTAHIAFVHTIGGSHMGDSWRRTGY 426
Query: 173 MKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMAEL 211
L + G + S PL F SS+GSL ++++ +
Sbjct: 427 CGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLRSI 464
>gi|326484528|gb|EGE08538.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Trichophyton equinum CBS 127.97]
Length = 462
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 95/219 (43%), Gaps = 26/219 (11%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKA 60
D+DWLL + + ++ + + E + R + L PP+ HSK
Sbjct: 255 DMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETASMSRIRLCFPPMDGEVNCMHSKL 313
Query: 61 MLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDLIDY 116
MLL + +RI++ +ANL DW + L++ D P K + + F ++L+ +
Sbjct: 314 MLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLIDLPRKANETVDDTTPFRDELVYF 373
Query: 117 L--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGS-SLKKWGH 172
L STL N KI +++FS +A + S+ G H GS S ++ GH
Sbjct: 374 LRASTL-------------NEKIIDKML-QYDFSQTAKYAFVHSIGGSHIGSGSYERTGH 419
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 211
L T ++ + L Y SS+GSL ++ L
Sbjct: 420 CGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQNL 457
>gi|219116995|ref|XP_002179292.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409183|gb|EEC49115.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 708
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 101/438 (23%), Positives = 162/438 (36%), Gaps = 124/438 (28%)
Query: 54 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL---KDQNNLSEECGF 109
G HH K M+L+ G V ++V T+NL + S W+Q FP + L EE
Sbjct: 263 GVHHPKFMILLERSGDVVVVVSTSNLTEPRATDAS---WLQRFPAARSSRERKLKEE--- 316
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF--------------KKFNFSSAAVRLI 155
E+D L+ + + + H + P F K F+FS A V L+
Sbjct: 317 EDDFGIVLTNMLEAQTLSCRKGH----VTPMGFCRQELGWNSLRDLTKHFDFSKAQVHLV 372
Query: 156 ASVPGYH---TGSSLKKWGHMKLRTVLQECTFEKGFKKSP--------LVYQFSSLGSLD 204
A++PG T S + +G ++ V++ + + P L+ Q +SLGS
Sbjct: 373 ATIPGDRLSKTASPSELFGRQRVSAVMKRLSQGPTPRLPPILRSEDDRLIVQPTSLGS-- 430
Query: 205 EKW----MAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPS 256
+W M E+ S D + + + I+WPT ++ G+ AG P+
Sbjct: 431 -EWTRANMTEVVRSYLGHEDRDVSKVRDAQVFPRLDILWPTERFMKAYRTGF-AGRGSPA 488
Query: 257 PQKNVDKDFLKKYWAKWKASH-----------------------------TGRSRAMPHI 287
+ F K +K + RS PHI
Sbjct: 489 SVVCIGDAFDTKELVLFKENEGYLFLSSDTFSKIDLSCLSRMAQYEVSVPLQRSCLPPHI 548
Query: 288 KTFAR-YNGQK---------------LAWFLLTSANLSKAAWG-ALQKNNSQLMIRSY-- 328
K+ R + G ++FLLTSA LS+ A G L + S+ + SY
Sbjct: 549 KSICRLFQGNDYRLRQDYGLPKSEEIFSYFLLTSACLSRGAQGETLTQLGSRETVVSYAN 608
Query: 329 -ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 387
ELGVL +++ G P++ + + +
Sbjct: 609 FELGVLF--TSRLQGRASDRVYGWKPAQCMCRNRPRTSL--------------------- 645
Query: 388 VVYLPVPYELPPQRYSSE 405
++LPVP+ L P RY S+
Sbjct: 646 -IHLPVPFSLRPARYQSD 662
>gi|389739055|gb|EIM80250.1| phospholipase D/nuclease [Stereum hirsutum FP-91666 SS1]
Length = 698
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/352 (22%), Positives = 132/352 (37%), Gaps = 65/352 (18%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 99
NWI P L +G H M + Y G +RI + TANL+ DW + +W+QD P +
Sbjct: 280 NWIRVTPFLRSGYGVMH---MKIFYKSGRLRIAISTANLVDFDWKDIENTVWIQDVPQRS 336
Query: 100 Q--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP-------SFFKKFNFSSA 150
+ + + F L L +L H + P S ++FS
Sbjct: 337 KPIPHDPKADDFPTAFERVLKALNVEPALTSL-VHNDHPTIPLSSLHPGSLRTAYDFSRV 395
Query: 151 AVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSP-------LVYQFSSLGS 202
L+ S+ G H + + G L ++E E G + YQ SS+G+
Sbjct: 396 KAHLVPSLAGKHEHWPQVLRVGETALMKAVREIGCEVGSGSGGGKRGKLRVEYQGSSIGT 455
Query: 203 LDEKWMAELSSSMSSGFSE---DKTPLGIGE------PLIVWPTVEDVRCSLEGYAAGNA 253
+W+ E S E DKT + I++PT E V+ S+ G A G
Sbjct: 456 YSTQWINEFYICASGTSPEKYLDKTKASKSKLPYPDSMTILFPTREWVKGSVLGEAGGGT 515
Query: 254 IPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKT----------------------- 289
+ + D F ++ + + S + R + + H K
Sbjct: 516 MFCRKDQWDAPKFPRELFGQ---SKSKRGKVLMHSKVHESSVTESESESEPEPPQDAEES 572
Query: 290 -----FARYNGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 334
+ + W + S N + +AWG L + + L I +YELG+++
Sbjct: 573 DSDLEIVEKKAKAVGWAYVGSHNFTPSAWGTLSGSGFHPVLNITNYELGIVL 624
>gi|327295831|ref|XP_003232610.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
gi|326464921|gb|EGD90374.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
Length = 677
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 87/407 (21%), Positives = 148/407 (36%), Gaps = 69/407 (16%)
Query: 47 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNN 102
PP+ HSK MLL + +RI++ +ANL DW K L++ D P K
Sbjct: 284 PPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLFLIDLPRKANET 343
Query: 103 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA---SVP 159
+++ F ++L+ +L E + H +N F + S AA S
Sbjct: 344 VNDTTPFRDELVYFLRASTLNEKIIDKMLH---TLNSIFVNSNSLSLAACCCCCCWLSGG 400
Query: 160 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL--SSSMSS 217
+ S ++ GH L T ++ + L Y SS+GSL ++ L S+ +
Sbjct: 401 SHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYITSSVGSLTATFLQNLYWSAQGDN 459
Query: 218 GFSEDKTPLG----------------------IGEPLIVWPTVEDVRCSLEGYAAGNAI- 254
G + G G + +P+ E VR S G +A +
Sbjct: 460 GTKQLSARAGNTRSSNKSNQSSKRSGRGDDDWTGRMKVYFPSRETVRSSRGGVSAAGTLC 519
Query: 255 --------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSA 306
P ++V +D S +R + + W + SA
Sbjct: 520 LMSKWYNSPMFPRDVMRDNRSVREGLLMHSKVLYARPEGEARKGESRSADCAGWAYVGSA 579
Query: 307 NLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 362
NLS++AWG L + ++L R++E GV ++P + S + + E
Sbjct: 580 NLSESAWGRLVIDRKTKQAKLNCRNWESGV-VVPVGRGEDGTQRGASAASAAAGAAPEAE 638
Query: 363 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 409
SQ + +PVP + P + Y+ ++ PW
Sbjct: 639 LSQTFR--------------------AAVPVPMQEPGREYAEDEQPW 665
>gi|449302183|gb|EMC98192.1| hypothetical protein BAUCODRAFT_416098 [Baudoinia compniacensis
UAMH 10762]
Length = 610
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 91/403 (22%), Positives = 162/403 (40%), Gaps = 66/403 (16%)
Query: 3 DIDWLLPAC---PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGT---H 56
D++W+L P + V+ + D + M A + P G+
Sbjct: 164 DVEWVLSKLKVPPNGGTTKCIFVMQAKEDSLRQQMLTETDAMRPFLRLTFPYMGGSVFCM 223
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP-LKDQNN---LSEECGF 109
HSK MLL +P +RI + +ANL+ DW +++ D P L D+ +++ F
Sbjct: 224 HSKLMLLFHPHKLRIAIPSANLLSFDWGETGMMENSVFIIDLPRLVDEQRARVTADDLTF 283
Query: 110 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLK 168
+ Y LK + ++ F+F++ A + + + G G +
Sbjct: 284 FGKELLYF--LKKQDIDQDVR---------DGVLGFDFAATAHIAFVHTAGGTSFGEEAQ 332
Query: 169 KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS---------MSSGF 219
+ G L ++ + + + + SS+GSL+++++ + S+ S+
Sbjct: 333 RTGLPGLARAVRSLRLQT--RSLEVDFAASSIGSLNDEFLRSVHSAAKGEDAIALTSAAA 390
Query: 220 SEDKTPLGIGEP--------------LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF 265
S+ K P I +PT E V S G AAG S + + F
Sbjct: 391 SQAKANFFRPSPGKRTSAADNIKTKLRIYFPTQETVTNSTAG-AAGTICLSRKWYENMTF 449
Query: 266 LKKYWAKWKASHTGRSRAMPHIKT-FAR----YNGQKLAWFLLTSANLSKAAWGALQKNN 320
+ + + ++ G + H K +AR Q +AW + SAN+S++AWG L +
Sbjct: 450 PRSVFRDYVSTRPG---LLSHNKILYARGKQKQGTQDVAWAYVGSANMSESAWGKLSYDR 506
Query: 321 S----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 359
++ R++E GVL+ A+R S SN E KSG
Sbjct: 507 KAKVWKVNCRNWECGVLLPVPAERLR---SAASNNNTKEAKSG 546
>gi|307211792|gb|EFN87773.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 95
Score = 52.8 bits (125), Expect = 3e-04, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 37/55 (67%), Gaps = 5/55 (9%)
Query: 284 MPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 336
MPHIK++ R + +++AWF+LTSANLSK+AWG I +YE+GV LP
Sbjct: 1 MPHIKSYTRISPDLKRIAWFVLTSANLSKSAWGV---QRGDYYITNYEVGVAFLP 52
>gi|387220095|gb|AFJ69756.1| tyrosyl-dna phosphodiesterase 1, partial [Nannochloropsis gaditana
CCMP526]
Length = 103
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 22/84 (26%)
Query: 265 FLKKYWAKWKASHTGRSRAMPHIKTFARY-------------NGQ---------KLAWFL 302
+LK+ A+W+ GR RAMPH+K+F R+ NG+ +LAW L
Sbjct: 20 YLKERLARWEGGRWGRQRAMPHLKSFLRFSVIREGAGAAPGENGRGQGACKETTRLAWVL 79
Query: 303 LTSANLSKAAWGALQKNNSQLMIR 326
+TS N SK AWG LQ I+
Sbjct: 80 ITSHNYSKPAWGELQSKGEVFKIQ 103
>gi|310793199|gb|EFQ28660.1| ubiquitin interaction domain-containing protein [Glomerella
graminicola M1.001]
Length = 628
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 109/474 (22%), Positives = 175/474 (36%), Gaps = 94/474 (19%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WLL V + +LV + ++ ++ N P + P P+ G HSK +
Sbjct: 176 DEEWLLSKVDV-RQTRLLLVAYANNEAEKAAIRANAPTGLVRFCFP-PMYGGYMHSKLQI 233
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSE---ECGFENDLIDY 116
L Y +RI++ + NL+ DW +++ D P + + E F +L +
Sbjct: 234 LKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLIDLPKLESTQQAAPPAETLFGTELRRF 293
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKL 175
L L E K+ S ++F+ ++ + S+ G H S W H
Sbjct: 294 LRALGLDE-----------KLVKSL-DSYDFTETSRYGFVHSIAGSHANDS---WQHTGQ 338
Query: 176 RTV----LQECTFEKGFKKSPLV---YQFSSLGSLDEKWMAEL--SSSMSSGFSE----- 221
T L G V Y SSLGSL++ + + + SG E
Sbjct: 339 STRGYCGLGSTVRSLGLATEDAVDIDYVASSLGSLNDASLKAIYYACQGDSGMKEYDARK 398
Query: 222 -------------DKTPLGIGEPL-------IVWPTVEDVRCSLEGYAAGNAIPSPQKNV 261
D + EPL I +PT V S G ++ I
Sbjct: 399 PKPARSKAAKAGLDGSRPVFNEPLQLQRHFRIYFPTEHTVSSSRGGRSSAGTIC------ 452
Query: 262 DKDFLKKYWAK-------WKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG 314
F +K+W + + RS + H K AW + SANLS++AWG
Sbjct: 453 ---FQEKWWKSSTFPRELLRDCQSVRSGLLLHTKAIFVQARDGAAWAYMGSANLSESAWG 509
Query: 315 ALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTK 370
L K +L R++E GVL+ G + T V + + G S+ +
Sbjct: 510 RLVKERDSGAPKLTCRNWECGVLVAVDGNLPGSADTGTRPGVDQDAQ-GQAPMSKGEGGP 568
Query: 371 LVTLT--------WHGSSDAGASSEVVY---LPVPYELPPQRYSSEDV----PW 409
VT+T D E V+ +P+P ++P RY+S++ PW
Sbjct: 569 AVTVTDSEEKQRHQQLGQDEPRCLEGVFGTTMPIPMKVPAGRYTSDESAASRPW 622
>gi|388580252|gb|EIM20568.1| phospholipase D/nuclease [Wallemia sebi CBS 633.66]
Length = 417
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 70/151 (46%), Gaps = 36/151 (23%)
Query: 54 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN----NLSEECG 108
GT+H+K L+ G +R++V TAN I +DW ++MQDFPLK Q + ++
Sbjct: 8 GTYHAKFALIFTTDGWLRVVVTTANFIPIDWMWNENTVFMQDFPLKGQTLGGESSEQKSA 67
Query: 109 FEND----------------LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 152
F++D + D + P A K++FS +
Sbjct: 68 FQSDWTWFLYKLKLNKSLKLVADQMPDTPLPNVDA--------------VNKWDFSRSKA 113
Query: 153 RLIASVPGYHTG-SSLKKWGHMKLRTVLQEC 182
RLI+S+ ++G +++K GH +L ++++
Sbjct: 114 RLISSISETYSGLENIRKVGHFRLADLVRQA 144
>gi|374105912|gb|AEY94823.1| FAAR169Cp [Ashbya gossypii FDAG1]
Length = 540
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 142/390 (36%), Gaps = 80/390 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
+++WLL P HV V+ GT++ + A +P F +HHSK ++
Sbjct: 110 EMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVRYRMVWMP-PFSSHHSKMVI 163
Query: 63 LIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
Y + R+++ +AN ++ + Q +WM + + F + L DYL
Sbjct: 164 AFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAAEQQPSRFRSGLQDYLQM-- 221
Query: 122 WPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
+PE L +K +F+ + + S PG T + K G +L
Sbjct: 222 YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAPGARTRA---KTGLAQLAAQ 269
Query: 179 LQECTFEKGFKKSPLVYQFSSLG------------SLDEKWMAELSSSMSSGFSED-KTP 225
L E G + S Q SS+G +L M L S + G + K
Sbjct: 270 LDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHLMVPLLSGHTQGLPKSVKDC 328
Query: 226 LGIGE-----------PLIVWPTVEDVRCSLEGYAAG--------------NAIPSPQKN 260
LG E P I++PTVED G+ A N S + N
Sbjct: 329 LGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLASGWFHFHHSRTAATRNHYSSLRDN 388
Query: 261 ----VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNG---------QKLAWFLLTSAN 307
+++ + + R R H K + ++ WFL TSAN
Sbjct: 389 GCFVKQREYELRPGGRTALPIIRRDRVPCHTKFYIKFASASATSWNSLTDCEWFLFTSAN 448
Query: 308 LSKAAWGALQKNNSQLMIRSYELGVLILPS 337
LS AWGA ++YE GVL S
Sbjct: 449 LSTHAWGA----PPSYQPKNYECGVLYTKS 474
>gi|45184994|ref|NP_982712.1| AAR169Cp [Ashbya gossypii ATCC 10895]
gi|44980615|gb|AAS50536.1| AAR169Cp [Ashbya gossypii ATCC 10895]
Length = 540
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 142/390 (36%), Gaps = 80/390 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
+++WLL P HV V+ GT++ + A +P F +HHSK ++
Sbjct: 110 EMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVRYRMVWMP-PFSSHHSKMVI 163
Query: 63 LIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 121
Y + R+++ +AN ++ + Q +WM + + F + L DYL
Sbjct: 164 AFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAAEQQPSRFRSGLQDYLQM-- 221
Query: 122 WPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
+PE L +K +F+ + + S PG T + K G +L
Sbjct: 222 YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAPGARTRA---KTGLAQLAAQ 269
Query: 179 LQECTFEKGFKKSPLVYQFSSLG------------SLDEKWMAELSSSMSSGFSED-KTP 225
L E G + S Q SS+G +L M L S + G + K
Sbjct: 270 LDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHLMVPLLSGHTQGLPKSVKDC 328
Query: 226 LGIGE-----------PLIVWPTVEDVRCSLEGYAAG--------------NAIPSPQKN 260
LG E P I++PTVED G+ A N S + N
Sbjct: 329 LGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLASGWFHFHHSRTAATRNHYSSLRDN 388
Query: 261 ----VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNG---------QKLAWFLLTSAN 307
+++ + + R R H K + ++ WFL TSAN
Sbjct: 389 GCFVKQREYELRPGGRTALPIIRRDRVPCHTKFYIKFASASATSWNSLTDCEWFLFTSAN 448
Query: 308 LSKAAWGALQKNNSQLMIRSYELGVLILPS 337
LS AWGA ++YE GVL S
Sbjct: 449 LSTHAWGA----PPSYQPKNYECGVLYTKS 474
>gi|70984252|ref|XP_747643.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845270|gb|EAL85605.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 277
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 29/183 (15%)
Query: 40 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDF 95
+N L PP+ HSK MLL +P +RI+ TANL DW ++ D
Sbjct: 2 SNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLIDL 61
Query: 96 PLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 151
P K ++ + FE +L+ +L STL+ S +F+FS ++
Sbjct: 62 PRKVATTSVGSKTVFEEELVYFLRASTLQENIISR--------------LDEFDFSPTSH 107
Query: 152 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWM 208
+ L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL ++++
Sbjct: 108 IMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEFL 163
Query: 209 AEL 211
+
Sbjct: 164 RSI 166
>gi|293335739|ref|NP_001168462.1| hypothetical protein [Zea mays]
gi|223948435|gb|ACN28301.1| unknown [Zea mays]
gi|414587433|tpg|DAA38004.1| TPA: hypothetical protein ZEAMMB73_810727 [Zea mays]
Length = 989
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 33/189 (17%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGES--------DGTLEHMKRNKPANWILHKPPLPISF 53
+DI W L C + +P + H + + T + + + + P I+F
Sbjct: 315 LDISWFLNYCKIPQHLPVTIACHNKERCWSASSENRTAAPFESHPKLLLVFPRFPEDIAF 374
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+IV +ANL+ W+ + +W QDFP + + +
Sbjct: 375 GKDRKKQGVACHHPKLIVLQREDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSPDYA 434
Query: 105 --------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
++ F L+ +++++ N + I K++F A LIA
Sbjct: 435 ALFSAAKKQKSDFAAQLVSFIASM------VNEVRSQAYWITE--VAKYDFEGAGGYLIA 486
Query: 157 SVPGYHTGS 165
SVPG H S
Sbjct: 487 SVPGIHAQS 495
>gi|326521102|dbj|BAJ96754.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 646
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 78/191 (40%), Gaps = 39/191 (20%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGE-------SDGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C + +P + H + S+ N P N +L P P I+F
Sbjct: 312 DISWFLDYCKIPQYLPVTIACHNKDRCWSANSESRTAAPFENHP-NILLVYPRFPEVIAF 370
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+I+ +ANL+ W+ + +W QDFP
Sbjct: 371 GKDRKNQGVACHHPKLIVLQREDSMRVIISSANLVPRQWHLITNTVWWQDFP-------- 422
Query: 105 EECGFENDLIDYLSTLKWP--EFSANLPAHGNFKIN--PS------FFKKFNFSSAAVRL 154
C D S + P +F+A L + IN PS +++F A L
Sbjct: 423 --CRTSPDYSALFSAFEGPKSDFAAQLVSFIGSLINEVPSQAYWINEIARYDFEGAGGYL 480
Query: 155 IASVPGYHTGS 165
+ASVPG + S
Sbjct: 481 VASVPGLYMPS 491
>gi|380495056|emb|CCF32689.1| ubiquitin interaction domain-containing protein [Colletotrichum
higginsianum]
Length = 641
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 109/481 (22%), Positives = 180/481 (37%), Gaps = 103/481 (21%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WLL + +L+ + ++ ++ N P + P P+ G HSK +
Sbjct: 174 DEEWLLGKVDAR-QTKMLLIAYANNEAEKATIRANAPTGLVRFCFP-PMHGGYMHSKLQI 231
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL---KDQNNLSEECGFENDLIDY 116
L Y +RI++ + NL+ DW +++ D P Q F +L +
Sbjct: 232 LKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLIDLPRIGGTHQTAPPAGTAFGTELRRF 291
Query: 117 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKL 175
L L E K+ S ++FS ++ + S+ G H S + G+ L
Sbjct: 292 LRALGLDE-----------KLVKS-LDNYDFSKTSRYGFVHSIAGSHANDSWQHTGYCGL 339
Query: 176 RTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKWMAEL--SSSMSSGFSE---------- 221
+ ++ + P + Y SSLGSL ++ + + SG E
Sbjct: 340 GSTVRSLGLA---TEEPVNIDYVASSLGSLTHDYLTAIYHACQGDSGMKEYEARQSKPTR 396
Query: 222 ---DKTPLGIGEPL------------IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 266
K L PL I +PT + V S G ++ I F
Sbjct: 397 NKAAKAGLAGSRPLGEGTLQWQHHFRIYFPTEKTVSSSRGGRSSAGTIC---------FQ 447
Query: 267 KKYWAK-------WKASHTGRSRAMPHIKT-FARYN-GQKLAWFLLTSANLSKAAWGALQ 317
+K+W + + RS + H K+ F R G AW + SANLS++AWG L
Sbjct: 448 EKWWKSSTFPRELLRDCQSVRSGLLLHSKSIFVRGRAGGDAAWAYVGSANLSESAWGRLV 507
Query: 318 KNN----SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL-- 371
K+ ++L R++E GVL+ G S T V + S +++Q L
Sbjct: 508 KDRESGAAKLTCRNWECGVLVAVEGNPTGTADSGTRPGVDQDAHSRRHPWARVQAQTLEG 567
Query: 372 -----VTLTWHGSSDAGAS-------------------SEV--VYLPVPYELPPQRYSSE 405
T T G + A A+ EV +P+P ++P RY S+
Sbjct: 568 YARDEETSTSRGVAAATAADSEENRRQQQLDRDESAGLDEVFGTTVPIPMKVPAGRYMSD 627
Query: 406 D 406
+
Sbjct: 628 E 628
>gi|323449457|gb|EGB05345.1| hypothetical protein AURANDRAFT_72265 [Aureococcus anophagefferens]
Length = 1631
Score = 51.6 bits (122), Expect = 9e-04, Method: Composition-based stats.
Identities = 58/207 (28%), Positives = 86/207 (41%), Gaps = 37/207 (17%)
Query: 151 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK-WMA 209
V I SVPG+ G+ +GH +R L +G + + SSLG LD K ++
Sbjct: 850 GVHFIGSVPGFRRGAFADAFGHRAIRRALA----REGLTVARAEFANSSLGRLDNKVFLR 905
Query: 210 ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC----SLEGYAAGNAIPSPQKNVDKDF 265
++S+ D+ IVWP+ + C L +A + Q N D
Sbjct: 906 GFATSLFGAGDLDRLK-------IVWPS-QATACRSSRKLMLHAMTEDKGTAQMNGPDDR 957
Query: 266 LKKYWAKWKASHTGRSR-----------AMPHIKTFARYNG-QKLAWFLLTSANLSKAAW 313
+ W A+ R+R + H K A ++G +L + S N S AAW
Sbjct: 958 I------WNAAGFPRARFHHYHAPSDRQTLHHTKMLACFDGDDRLVAVVGGSHNCSGAAW 1011
Query: 314 GALQKNNSQLMIRSYELGVLILPSAKR 340
G + N S +M SYE GVL+ A R
Sbjct: 1012 GVGEDNMSVIM--SYEAGVLVACGAGR 1036
>gi|414587432|tpg|DAA38003.1| TPA: hypothetical protein ZEAMMB73_810727, partial [Zea mays]
Length = 816
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 33/189 (17%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGES--------DGTLEHMKRNKPANWILHKPPLPISF 53
+DI W L C + +P + H + + T + + + + P I+F
Sbjct: 315 LDISWFLNYCKIPQHLPVTIACHNKERCWSASSENRTAAPFESHPKLLLVFPRFPEDIAF 374
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+IV +ANL+ W+ + +W QDFP + + +
Sbjct: 375 GKDRKKQGVACHHPKLIVLQREDSMRVIVTSANLVPRQWHLITNTVWWQDFPCRTSPDYA 434
Query: 105 --------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
++ F L+ +++++ N + I K++F A LIA
Sbjct: 435 ALFSAAKKQKSDFAAQLVSFIASM------VNEVRSQAYWITE--VAKYDFEGAGGYLIA 486
Query: 157 SVPGYHTGS 165
SVPG H S
Sbjct: 487 SVPGIHAQS 495
>gi|398406723|ref|XP_003854827.1| hypothetical protein MYCGRDRAFT_35953, partial [Zymoseptoria
tritici IPO323]
gi|339474711|gb|EGP89803.1| hypothetical protein MYCGRDRAFT_35953 [Zymoseptoria tritici IPO323]
Length = 266
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 101/253 (39%), Gaps = 45/253 (17%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEEC---GFE 110
HSK MLL +P +RI + TANL++ DW Q ++M D P +SE F
Sbjct: 20 HSKLMLLFHPDKLRIAIPTANLLNFDWGETGQMENSVFMVDLPRLADGKISEAGDLPAFG 79
Query: 111 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 169
+LI +L + + KF+FS+ + + +V G H ++
Sbjct: 80 QELIYFLEQQGLDD-----------DVRTGVL-KFDFSATKDMAFVHTVGGMHFRDEAER 127
Query: 170 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS--------------- 214
G M L +++ + L + SS+G L++ ++ + S+
Sbjct: 128 TGLMGLSKAVKQLNLAT--QDLELDFAASSIGRLNDNYLRDFHSAAKGISLIAQAAEAKS 185
Query: 215 -MSSGFSEDKTPLGIGEP-------LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 266
+S F + K + +P I +PT VR S G AAG + F
Sbjct: 186 KAASTFFDRKKASTVAKPDNVREKVRIYFPTASTVRVSTAG-AAGTLCIARNYFEGSTFP 244
Query: 267 KKYWAKWKASHTG 279
+ + +K++ TG
Sbjct: 245 RACFRDYKSTRTG 257
>gi|440473340|gb|ELQ42143.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae Y34]
gi|440489437|gb|ELQ69093.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae P131]
Length = 614
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 89/395 (22%), Positives = 161/395 (40%), Gaps = 71/395 (17%)
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 117
++A LL +P +RI+V + NL+ DW ++ G+ + D L E++ +
Sbjct: 223 NEADLLKFPGYLRIVVPSGNLVPYDWGEQN-GIMENSVFIIDLPPLKAGVKLEDNTLTSF 281
Query: 118 STLKWPEFSANLPAHG-NFKINPSFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKL 175
E S L A G N +I S +K++FS ++ + ++ G HTG ++ G+ L
Sbjct: 282 GE----ELSYFLTAQGLNERIINSL-RKYDFSQTSRYAFVHTIAGVHTGDKWRRTGYCGL 336
Query: 176 RTVLQECTF------EKGFKKSPLVYQF---------SSLGSLDEKWMAELSSSM--SSG 218
+Q E F S Y F SS+G+L ++ L ++ SG
Sbjct: 337 GRAIQNLGLATDEPVEIDFVVSGPNYPFLPNYLRQAASSMGALKYGYLLALYNAFQGDSG 396
Query: 219 FSE-----DKTPLGIGEPL------------IVWPTVEDVRCSLEGYAAGNAIPSPQKNV 261
+ KT + I +P++ V S G + +
Sbjct: 397 LKDYQSRASKTKTSKEDAASAQQAKLRDFFRIYFPSLATVEASRGGTRSAGTL------- 449
Query: 262 DKDFLKKYWAKWKASHTGRS---------RAMPHIK-TFARYNGQKLAWFLLTSANLSKA 311
L+ W W+A+ R+ A+ H K FAR AW + SAN+S++
Sbjct: 450 ---CLRSGW--WEAATFPRALFRDYENPRGALVHSKIVFARPPDASAAWAYVGSANVSES 504
Query: 312 AWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 371
AW + Q ++ R++E GV I+P + G + ++ I P + +G + + +
Sbjct: 505 AWASSQP---KMSCRNWECGV-IVPVGEPASPGRTLSTGIDPGDASAGKGGSLHGHQARN 560
Query: 372 VTLTWHGSSDAGASSEVVY---LPVPYELPPQRYS 403
+ S E ++ +P+P +LP + Y+
Sbjct: 561 SPQEQNAPVGRSRSIEELFSECVPLPMQLPGRSYA 595
>gi|357167454|ref|XP_003581171.1| PREDICTED: uncharacterized protein LOC100837648 [Brachypodium
distachyon]
Length = 987
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 80/189 (42%), Gaps = 35/189 (18%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPPLP--ISF 53
DI W L C + +P + H + + + N P N +L P P I+F
Sbjct: 315 DICWFLDYCNIPQHLPVTIACHNKERCWSASRESRMAAPFVNHP-NVLLVYPQFPEVIAF 373
Query: 54 G---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 104
G HH K ++L +R+I+ +ANL+ W+ + +W QDFP + + S
Sbjct: 374 GKDRKKQGVACHHPKLIVLQREDSMRVIITSANLVPRQWHLITNTVWWQDFPCRTSPDYS 433
Query: 105 E--------ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 156
+ F L+ ++ +L +P+ + IN K+NF A L+A
Sbjct: 434 AIFSAVEEPKSDFAVQLVSFIGSLI-----NEVPSQA-YWINE--IAKYNFEGAGGYLVA 485
Query: 157 SVPGYHTGS 165
SVPG + S
Sbjct: 486 SVPGLYMPS 494
>gi|121703656|ref|XP_001270092.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
gi|119398236|gb|EAW08666.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
Length = 683
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 100/440 (22%), Positives = 169/440 (38%), Gaps = 84/440 (19%)
Query: 20 VLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTAN 77
+LV+ + D T + + N L PP+ HSK MLL +P +RI+V TAN
Sbjct: 276 LLVMQAKDDATKRQYEAETASMRNLRLCFPPMDGQINCMHSKLMLLFHPEYLRIVVPTAN 335
Query: 78 LIHVDWNN----KSQGLWMQDFP--LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA 131
L DW ++ D P ++ + F DL+ +LS + E N+ A
Sbjct: 336 LTPYDWGEMGGVMENSAFLIDLPRKSSTLSSSDSKTAFLEDLVFFLSASRLHE---NVIA 392
Query: 132 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 191
K+ F++ + + L+ ++ G H + K G L ++ FK
Sbjct: 393 ----KLGDYDFRE----TKHIMLVHTIGGSHI-ENFSKTGFCGLGRAVKALGLST-FKSI 442
Query: 192 PLVYQFSSLGSLDEKWMAE--LSSSMSSGFSE-----DKT----PLGIGEPLIVWPTVED 240
+ Y SS+GSL ++++ L+ G +E KT P +++ P E+
Sbjct: 443 SIDYVTSSVGSLTDEFLRSIYLACQGDDGMTEHALRTTKTMPARPPTTTSSILLKPAAEE 502
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKD-----------FLKKYWAK-------WKASHTGRSR 282
+ Y PS Q V++ F ++++ + + R
Sbjct: 503 CKDRFRVY-----FPS-QTTVEQSRGGPNCAGTICFQQRWYEGPKFPKHLLRDCKSRRPG 556
Query: 283 AMPHIKTFARY---------NGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYE 329
+ H K Q W + SANLS++AWG L ++ + +L R++E
Sbjct: 557 LLMHNKMLFVTPDEPITLPDTSQCQGWAYVGSANLSESAWGRLVQDRATKRPKLNCRNWE 616
Query: 330 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 389
GVLI A+ T+ P E +S + + G + +
Sbjct: 617 CGVLIPVRAE-------ATAENRPKESESKPVDG--------LDKPGEGEVERMLDTFKD 661
Query: 390 YLPVPYELPPQRYSSEDVPW 409
+PVP +P QRY PW
Sbjct: 662 TVPVPMRVPGQRYGPGLKPW 681
>gi|343426865|emb|CBQ70393.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 654
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 73/161 (45%), Gaps = 14/161 (8%)
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 114
T H K ++L++ +R+ + + NL +DW ++QDFPL G
Sbjct: 274 TQHMKFLILVHEGFLRVAILSGNLNQIDWERIENTAFIQDFPLLSSATKPNVAGPSQSTN 333
Query: 115 DYLSTLKWPEFSANLPA-HGNFKINPSFFKKFNFSSA-AVRLIASVPGYHTGSSLKKWGH 172
D+ L S +LPA H + + F+FS+A R++AS P SSL W
Sbjct: 334 DFKLQLIRSLRSLSLPASHAIY----AALDTFDFSAATCARIVASWP---EPSSLADWER 386
Query: 173 MKLRTV--LQECTFEKGFKKSPLV---YQFSSLGSLDEKWM 208
++ + + L + E G + S V Q SSL + D KW+
Sbjct: 387 IETQGLGRLGKVVRELGIRPSQSVEVECQGSSLANHDVKWV 427
>gi|408391841|gb|EKJ71209.1| hypothetical protein FPSE_08715 [Fusarium pseudograminearum CS3096]
Length = 598
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 71/172 (41%), Gaps = 16/172 (9%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WL+ L K +L+ +S+ M+ N P P + G HSK L
Sbjct: 164 DDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPPGIKFVFPAM-NGPGAMHSKLQL 221
Query: 63 LIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L YP +R++V +ANL+ DW +++ D P D + F +L +LS
Sbjct: 222 LKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPRLDGSATHRPTPFSIELGRFLSA 281
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 171
E N + +F S K F + ++PG H G LK+ G
Sbjct: 282 TGVGETMVNSLTNYDF----SQTKHLGF-------VYTIPGGHQGDELKRIG 322
>gi|302892021|ref|XP_003044892.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
gi|256725817|gb|EEU39179.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
Length = 674
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 72/177 (40%), Gaps = 18/177 (10%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D +WLL L + +LV + M+ N P P + G HSK L
Sbjct: 170 DDEWLLSKID-LRRTKLLLVASAADESQKREMQSNTPPGIRFCFPAMN-GPGAMHSKLQL 227
Query: 63 LIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLST 119
L YP +R++V TANL+ DW +++ D P + + + F +L +LS
Sbjct: 228 LKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLPKLEASVDHQPTHFSTELGRFLSE 287
Query: 120 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKL 175
G S ++FS + + ++PG H G SLK+ G+ L
Sbjct: 288 T------------GVGAGMVSSLSNYDFSRTKHLGFVYTIPGGHVGDSLKRIGYCGL 332
>gi|254582597|ref|XP_002499030.1| ZYRO0E01914p [Zygosaccharomyces rouxii]
gi|238942604|emb|CAR30775.1| ZYRO0E01914p [Zygosaccharomyces rouxii]
Length = 513
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 125/318 (39%), Gaps = 54/318 (16%)
Query: 53 FGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN 111
F HHSK ++ +Y G +++ + + N + + N Q W+ P F++
Sbjct: 153 FTCHHSKLIINVYQDGSLQLFMPSNNFTYAETNYPQQVCWVS--PRLSACASPASSSFQS 210
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS-AAVRLIASVPGYHTGSSLKKW 170
DL++YL + E N I P +KFNF + S P S +
Sbjct: 211 DLLNYLKSYDLREI--------NRYIIPEV-EKFNFEPLEGTEFVYSTPSKDYLSGFQLL 261
Query: 171 GHMKLRTVLQECTFEKGFKKSPLVYQFSSLG-SLDEKWMAELSSSM-------------- 215
KLR + S + Q SS+G SL K L + M
Sbjct: 262 AQ-KLRYKKENGDTSIKHHLSHYLCQSSSVGNSLSRKEPCNLLTHMIIPVLEGIIPKDSK 320
Query: 216 ----SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGN------AIPSPQKNVDKDF 265
+S ED I P +++PTV+++ S G+ N+ +D
Sbjct: 321 KLPSTSQLLEDYRSHHIV-PYLLYPTVQEIVDSPVGWLCSGWFNFNYNKDMAHYNMLRDE 379
Query: 266 LKKYWAKWKASHTGRSRAMP-----HIKTFARYNGQK----LAWFLLTSANLSKAAWGAL 316
+ + K+ + + RA P ++K+ R +K L W L TSANLS +AWG
Sbjct: 380 FNIFHKQKKSQLSPQRRATPSHSKFYMKSTTRNPNEKPFRELDWCLFTSANLSFSAWGK- 438
Query: 317 QKNNSQLMIRSYELGVLI 334
+ R+YE+G+L+
Sbjct: 439 ----TSAKPRNYEVGILL 452
>gi|430811371|emb|CCJ31122.1| unnamed protein product [Pneumocystis jirovecii]
Length = 402
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/269 (21%), Positives = 99/269 (36%), Gaps = 49/269 (18%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPLKDQNNLSEECGFENDL 113
H K LL Y +R+++ +ANL+ DW +++ DFP ++ FE DL
Sbjct: 171 HCKLQLLFYTTYLRVVIPSANLVDYDWGETGVMENSMYIHDFPRRESAFTEFSTNFERDL 230
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS-SLKKWGH 172
Y +P+ +FK+ S + + S+P S LK G+
Sbjct: 231 FHYCKAKNYPDHILKKMQCYDFKM-----------SKNIHFVHSIPARALNSVDLKDTGY 279
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL 232
+ L +Q+ + SSLG L +M + ++ D++ L
Sbjct: 280 LSLARAVQKLGKASKNDIEINIIVTSSLGLLKSAFMTNIYRALKG----DQSIASYNMDL 335
Query: 233 IVW--------PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAM 284
W P++ V S G + I F K++W + +S M
Sbjct: 336 QSWKTSIKVHFPSINTVLSSNGGKESAGTIC---------FQKQFWENLEFP---KSCLM 383
Query: 285 PHIKTFARYNGQKLAWFLLTSANLSKAAW 313
H K+ +SANLS++AW
Sbjct: 384 HH----------KIILVRNSSANLSESAW 402
>gi|325095061|gb|EGC48371.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus H88]
Length = 652
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 128/323 (39%), Gaps = 67/323 (20%)
Query: 137 INPSFFKK---FNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKK 190
+N KK F+FS+ + I ++ G HT +K G L + + +
Sbjct: 342 VNEQIIKKMLRFDFSATKDIAFIHTIGGSHTDPKWEKTGLCGLGRAITSLNLQTSQDINL 401
Query: 191 SPLVYQFSSLGSLDEKWM-------------AELSSSMSSGFSEDK----TPLGIGEP-- 231
+V+Q SS+GSL+E+++ EL+ S F +K T G
Sbjct: 402 DYIVFQTSSVGSLNEQFLRSIYLAAQGDNGLKELTLRTSRTFPSEKWGVVTNKSDGAKWK 461
Query: 232 ---LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKW-------KAS 276
+ +P++ VR S G I K KD ++ ++ K
Sbjct: 462 DKFRVYFPSLNTVRNSKGGIENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHNKML 521
Query: 277 HTGRSRAMPHIKTFA-RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELG 331
+ + +K + RY+G W + SANLS++AWG L + + +L R++E G
Sbjct: 522 FVRPDKPITSVKNNSIRYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECG 577
Query: 332 VL--ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 389
V+ I + + T I S +SG TS SD G+ V
Sbjct: 578 VVIPIRHNDEEKSSYIPSTRGITTSVAESGGGNTS-------------AGSDDGSRVASV 624
Query: 390 Y---LPVPYELPPQRYSSEDVPW 409
+ +PVP ++P QRY D P+
Sbjct: 625 FEPTVPVPMKVPAQRYHGRDRPF 647
>gi|342884381|gb|EGU84597.1| hypothetical protein FOXB_04892 [Fusarium oxysporum Fo5176]
Length = 632
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 71/181 (39%), Gaps = 31/181 (17%)
Query: 3 DIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
D +WL+ P K+ +L+ +S+ M+ N P P + G HSK
Sbjct: 168 DDEWLMSKIDPRKTKL--LLLAFADSEAQKSEMRSNAPPGIKFVFPAM-NGPGAMHSKLQ 224
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFENDLIDYLS 118
LL YP +R++V TANL+ DW +++ D P + F +L +LS
Sbjct: 225 LLKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLPRLKDPATYRQTAFSTELGRFLS 284
Query: 119 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 178
E H F + ++PG H G SLK+ G+ L T
Sbjct: 285 ATGVGEG-----MHLGF-------------------VYTIPGGHQGDSLKRIGYSGLGTT 320
Query: 179 L 179
+
Sbjct: 321 V 321
>gi|350636132|gb|EHA24492.1| hypothetical protein ASPNIDRAFT_183042 [Aspergillus niger ATCC
1015]
Length = 324
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 85/190 (44%), Gaps = 25/190 (13%)
Query: 41 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFP 96
N L PP+ HSK MLL +P +R++V TANL DW + +++ D P
Sbjct: 3 NLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLP 62
Query: 97 LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 153
K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 63 KK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSKYA 107
Query: 154 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--AEL 211
+ ++ G HT ++ K+ G+ L ++ + + Y SS+G++ ++++ L
Sbjct: 108 FVHTIGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCMYL 166
Query: 212 SSSMSSGFSE 221
+S G +E
Sbjct: 167 ASQGDDGLTE 176
>gi|443895439|dbj|GAC72785.1| ras-related GTPase [Pseudozyma antarctica T-34]
Length = 689
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 83/184 (45%), Gaps = 32/184 (17%)
Query: 55 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ----------NNLS 104
T H K ++L++P +R+ + + NL +DW ++QDFPL ++
Sbjct: 300 TQHMKFLILVHPDFLRVAILSGNLNGIDWERIENTAYIQDFPLNTDTAKAATPAHGSSQG 359
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHT 163
F+ L+ L +L P ++ P + + + +FS A R++AS P
Sbjct: 360 RTNDFKAQLVRILRSLGMP---SSHPVY-------AALDRHDFSQATRARIVASWP---E 406
Query: 164 GSSLKKWGHM------KLRTVLQECTFEKGFKKS-PLVYQFSSLGSLDEKWMAELSSSMS 216
S+L +W M +L V+++ + S L Q SSL + D KW+ E ++
Sbjct: 407 ASNLAEWDRMETQGLGRLGKVVRDLGIQPKRSGSLQLECQGSSLANHDIKWI-EHFHLLA 465
Query: 217 SGFS 220
SGF+
Sbjct: 466 SGFN 469
>gi|115386326|ref|XP_001209704.1| predicted protein [Aspergillus terreus NIH2624]
gi|114190702|gb|EAU32402.1| predicted protein [Aspergillus terreus NIH2624]
Length = 381
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 20/165 (12%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHS 58
M D+DWL + V ++ + D T +R N L PP+ HS
Sbjct: 227 MWDMDWLFSKMDQV-NTRFVFLMQAKDDATKRQYERETADLRNLKLCFPPMEGQVQCMHS 285
Query: 59 KAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQDFPLKDQNNLSEECGFENDLI 114
K M+L +P VRI++ TANL DW +++ D P ++ E F+ +LI
Sbjct: 286 KLMILFHPGHVRIVIPTANLTPYDWGEMGGVMENTVFLIDLPKLHPDSERIETNFKKELI 345
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASV 158
+L A +++ + +++FS A + L+ S+
Sbjct: 346 YFLQ------------ASAAYEMVTTKLNEYDFSKTAHIALVHSI 378
>gi|302653979|ref|XP_003018803.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
gi|291182481|gb|EFE38158.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 54/124 (43%), Gaps = 13/124 (10%)
Query: 3 DIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 57
D+DWLL P+ L ++ GE T + + L PP+ H
Sbjct: 230 DMDWLLMKFTNPSTRFL----FIMGAKGEERRTQLLRETASMSRIRLCFPPMDGEVNCMH 285
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQNNLSEECGFENDL 113
SK MLL + +RI++ +ANL DW K L++ D P K + + F ++L
Sbjct: 286 SKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLFLIDLPRKANETIDDTTPFRDEL 345
Query: 114 IDYL 117
+ +L
Sbjct: 346 VYFL 349
>gi|367050628|ref|XP_003655693.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
gi|347002957|gb|AEO69357.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
Length = 657
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D+ WLL LA+ +L+ + E M+ P I P G+ HSK L
Sbjct: 262 DVRWLLSKVD-LARTKLILIAFAADEAHKEEMRNAVPRERIRFCFPPMQPVGSMHSKLQL 320
Query: 63 LIYPRGVRIIVHTANLIHVDWNN 85
L Y + +RI+V T NL+ DW
Sbjct: 321 LKYEKYMRIVVPTGNLMSFDWGE 343
>gi|225554729|gb|EEH03024.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus G186AR]
Length = 676
Score = 44.7 bits (104), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 126/324 (38%), Gaps = 72/324 (22%)
Query: 137 INPSFFKK---FNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP 192
+N KK F+FS+ + I ++ G HT +K G L + + +
Sbjct: 369 VNEQIIKKMLRFDFSATKDIAFIHTIGGSHTDPKWEKTGLCGLGRAITSLNLQTS-QDIN 427
Query: 193 LVYQFSSLGSLDEKWM-------------AELSSSMSSGFSEDK----TPLGIGEP---- 231
L Y SS+GSL+E+++ EL+ S F +K T G
Sbjct: 428 LDYITSSVGSLNEQFLRSIYLAAQGDNGLKELTLRTSRTFPSEKWGVVTNKSDGAKWKDK 487
Query: 232 -LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKW-------KASHT 278
+ +P++ VR S G I K KD ++ ++ K
Sbjct: 488 FRVYFPSLNTVRNSKGGIENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHNKMLFV 547
Query: 279 GRSRAMPHIKTFA-RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVL 333
+ + +K + RY+G W + SANLS++AWG L + + +L R++E GV+
Sbjct: 548 RPDKPITSVKNNSIRYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECGVV 603
Query: 334 ILPSAKRHGCG-----FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 388
I RH T I S +SG TS SD G+
Sbjct: 604 I---PIRHNDEEKSPYIPSTRGITTSVAESGGGNTS-------------AGSDDGSRVAS 647
Query: 389 VY---LPVPYELPPQRYSSEDVPW 409
V+ +PVP ++P QRY D P+
Sbjct: 648 VFEPTVPVPMKVPAQRYHGRDRPF 671
>gi|296415071|ref|XP_002837215.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633076|emb|CAZ81406.1| unnamed protein product [Tuber melanosporum]
Length = 603
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 91/221 (41%), Gaps = 27/221 (12%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHG--ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 60
DIDW+L P+ V+V+H E D + + + L PP+ HSK
Sbjct: 258 DIDWVLKKLPLDTIQRLVMVMHAKEEQDRSYKVQQLGSLPRTTLVLPPMQGQVSCMHSKL 317
Query: 61 MLLIYPRG----VRIIVHTANLIHVDWNN----KSQGLWMQDFPLKDQNNLSEECGFEND 112
MLL + G +R+ V +ANL DW +++ D P + N + F +
Sbjct: 318 MLLFHMNGDQRWLRVAVPSANLTDYDWGELGGVMENTVFIIDLPRLPKPN-HNQTHFAKE 376
Query: 113 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 172
L + + PE N G ++ + S K F + S+ G + G ++ G+
Sbjct: 377 LHHFCAAKGMPEDVLN----GLYRYDFSRTKDMAF-------VHSIGGSNAGKDWRRTGY 425
Query: 173 MKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL 211
L T ++ G L + F SSLG+ + +++ +
Sbjct: 426 SGLGTAVKALGLSSG---PGLEFDFVTSSLGAANMGFISNM 463
>gi|66822393|ref|XP_644551.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|66822691|ref|XP_644700.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
gi|60472674|gb|EAL70625.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|60472831|gb|EAL70780.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
Length = 734
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 297 KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
K W S N S +AWGA QKN SQ+ I ++E+GVL+L
Sbjct: 655 KYDWVYTGSHNFSLSAWGAFQKNESQVSISNFEIGVLLL 693
>gi|240276898|gb|EER40409.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus H143]
Length = 183
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 26/127 (20%)
Query: 292 RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVL--ILPSAKRHGCGF 345
RY+G W + SANLS++AWG L + + +L R++E GV+ I + +
Sbjct: 69 RYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECGVVIPIRHNDEEKSSYI 124
Query: 346 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRY 402
T I S +SG TS SD G+ V+ +PVP ++P QRY
Sbjct: 125 PSTRGITTSVAESGGGNTS-------------AGSDDGSRVASVFEPTVPVPMKVPAQRY 171
Query: 403 SSEDVPW 409
D P+
Sbjct: 172 HGRDRPF 178
>gi|330792943|ref|XP_003284546.1| hypothetical protein DICPUDRAFT_148330 [Dictyostelium purpureum]
gi|325085576|gb|EGC38981.1| hypothetical protein DICPUDRAFT_148330 [Dictyostelium purpureum]
Length = 613
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 45/204 (22%), Positives = 90/204 (44%), Gaps = 19/204 (9%)
Query: 140 SFFKKFNFSSAA---VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLV 194
S+ F+FS + +++++P +S ++ G +KL++V+Q L
Sbjct: 346 SYLDDFDFSICTDNNIHIVSTIPSLSNDNSNQQNGFLKLKSVVQNYNSSNNNPDGVYSLT 405
Query: 195 YQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC--SLEGYAAGN 252
YQ S++GS+ + W + ++ + + IV+PT++ ++ + + A
Sbjct: 406 YQSSAIGSIRKNWFENFTDNLFPNLVRTEKKVS-----IVFPTLDTIQTLSNKDKNLALE 460
Query: 253 AIPSPQKNVDKDFLKKYWAKWKA-SHTGRSRAMP---HIKTFARYNGQKLAWFLLTSANL 308
+I +++ D+LKK + +G ++ +P I F N W S N
Sbjct: 461 SITIRYQDL-TDYLKKKNLLYDYFEESGHNQVIPLHSKIIIFLEENKPNSGWVYHGSHNF 519
Query: 309 SKAAWGALQKNNSQLMIRSYELGV 332
S+ +WG L S + +YE GV
Sbjct: 520 SEGSWGMLS--GSGIKTFNYETGV 541
>gi|154272585|ref|XP_001537145.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150409132|gb|EDN04588.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 478
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 31/176 (17%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPANWI-LHKPPLPISFGTHH 57
M ++DW+ + K L+I GE D E K + L PP+ H
Sbjct: 306 MWNVDWMFSKFDI--KTTRFLLIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMH 363
Query: 58 SKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLK--DQNNLSEECGFEN 111
SK MLL +P +RI+V +ANL+ DW + +++ D P K D +N + F +
Sbjct: 364 SKLMLLFHPDYLRIVVPSANLVPFDWGEQGGVMENIVFLIDLPRKSPDLDN-DPQTSFLD 422
Query: 112 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFSSAA-VRLIASVPGYHT 163
+L+ +L +N KK F+FS+ + I ++ G HT
Sbjct: 423 ELVYFLQA---------------STVNEQIIKKMLRFDFSATKDIAFIHTIGGSHT 463
>gi|443723184|gb|ELU11715.1| hypothetical protein CAPTEDRAFT_223095 [Capitella teleta]
Length = 942
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 61/304 (20%), Positives = 119/304 (39%), Gaps = 39/304 (12%)
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL---KDQNNLS--------- 104
H +LL + +R+I+ +A+L W Q W DFPL K+ + S
Sbjct: 477 HPNLILLRFKHCLRVIITSASLRRRHWEEVVQLGWTADFPLAVDKETDETSWVAMNMMDE 536
Query: 105 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 164
EE E + ++ + L+ F +L G+ + F+ S VRLI S G +
Sbjct: 537 EEARAEAQVTNFGTDLEG--FLKDLQIDGDHLLTGI---DFSVLSPCVRLITSKLGAVSQ 591
Query: 165 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 224
+ + +L++++ ++ K+ + LG ++ + +S +G +
Sbjct: 592 EESENYAVARLKSLISRFPWKANSKRDNVCVS-HRLGLSNDTPLGIISDIFRTG-DRNSP 649
Query: 225 PLGIGEPLIVWPTVEDVR--CSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSR 282
P +++P+ D + CS + + +D D L + H+ +
Sbjct: 650 PFK-----LLYPSEADAKKHCSEVDGLTYEDLATDDTFIDFDIL---FHSHPFLHSSKES 701
Query: 283 AMPHIKTFARYN-------GQKLAWFLLTSANLSKAAWG---ALQKNNSQLMIRSYELGV 332
+ H +Y ++L WF+ S L +WG ++ N ++ ELGV
Sbjct: 702 LVLHANALLKYEDITDDSGSKRLGWFMFGSQVLGLKSWGDSNRRRRRNEVQILERMELGV 761
Query: 333 LILP 336
+ P
Sbjct: 762 GVFP 765
>gi|444315287|ref|XP_004178301.1| hypothetical protein TBLA_0A10020 [Tetrapisispora blattae CBS 6284]
gi|387511340|emb|CCH58782.1| hypothetical protein TBLA_0A10020 [Tetrapisispora blattae CBS 6284]
Length = 566
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 230 EPLIVWPTVEDVRCS-LEGYAAG--NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPH 286
+P++V+PT ++++ S G AAG + I S K F K+ K T S + +
Sbjct: 405 QPMVVFPTTQEIKDSPTHGDAAGWFHNIGSNSFESQKIFYKQGPNVSKERGTTPSHSKYY 464
Query: 287 IKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 344
+K+ + L W + TS+NLS +AWG +K+ R++E+G++I P ++G
Sbjct: 465 MKSTCTDEDPFKYLDWCIYTSSNLSMSAWGTDRKD-----PRNFEIGIVIKP---KNGGK 516
Query: 345 FSCTS 349
C S
Sbjct: 517 LKCHS 521
>gi|401885055|gb|EJT49186.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1170
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 14/140 (10%)
Query: 55 THHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN-- 111
+ H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 428 SEHQKWAFVFYKTGRLRVAIMTANMVDYDWERIENTVFVQDV-LPNKAGHSPDWHLPDFP 486
Query: 112 ----DLIDYLSTLKWPEFSANLPAHGN---FKINPSF--FKKFNFSSAAVRLIASVPGYH 162
DL +L K EF G+ +PS+ F K+++S RL+ S+ G +
Sbjct: 487 QQFADLFKHLKIHKGIEFMRQTHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISIAGKY 546
Query: 163 TG-SSLKKWGHMKLRTVLQE 181
G + KWG +L V+QE
Sbjct: 547 EGFHDMSKWGIGRLGQVVQE 566
>gi|171686654|ref|XP_001908268.1| hypothetical protein [Podospora anserina S mat+]
gi|170943288|emb|CAP68941.1| unnamed protein product [Podospora anserina S mat+]
Length = 438
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 62
D DW+L + ++ L+ + +S+ E M+ N P + I P + G HSK ML
Sbjct: 276 DEDWMLSKIDI-SRTKLYLIAYAKSEAQNE-MRNNVPKSRIRFCFPAMQAVGAMHSKLML 333
Query: 63 LIYPRGVRIIVHTANLIHVDW 83
L Y +R++V T N + DW
Sbjct: 334 LKYEGYLRVVVPTGNFMSYDW 354
>gi|296810424|ref|XP_002845550.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
gi|238842938|gb|EEQ32600.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
Length = 672
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 47 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQDFPLKDQ-- 100
PP+ HSK MLL +P +RI+ TANL DW K L++ D P K
Sbjct: 376 PPMEGEVNCMHSKLMLLFHPGYLRIVTPTANLDPYDWGEKGGEMENMLFLIDLPRKSDGG 435
Query: 101 NNLSEECGFENDLIDYL 117
+ + F ++L+ +L
Sbjct: 436 TGIDDATPFRDELVYFL 452
>gi|406694621|gb|EKC97945.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1114
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 64/139 (46%), Gaps = 14/139 (10%)
Query: 56 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN--- 111
H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 381 EHQKWAFIFYKTGRLRVAIMTANMMDYDWERIENTVFLQDV-LPNKAGHSPDWHLPDFPQ 439
Query: 112 ---DLIDYLSTLKWPEFSAN---LPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHT 163
DL +L K EF L + +PS+ F K+++S RL+ S+ G +
Sbjct: 440 QFADLFKHLKIHKGIEFMRQKHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISISGKYE 499
Query: 164 G-SSLKKWGHMKLRTVLQE 181
G + KWG +L V+QE
Sbjct: 500 GFHDMSKWGIGRLGQVVQE 518
>gi|294944973|ref|XP_002784507.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239897573|gb|EER16303.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 230
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 54/123 (43%), Gaps = 17/123 (13%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--------PLKDQNNLSE 105
GT H+K +++ + +R+ + ++NL DW SQ +W+ DF P + +
Sbjct: 111 GTMHAKLIIIERAQALRVCISSSNLTPQDWEGVSQCIWVADFKAANDFEAPARKRVKPDH 170
Query: 106 ECGFENDLIDYLSTLKWPEFSANLPAHG---NFKINPSFFKKFNFS-SAAVRLIASVPGY 161
F + L ++ T F ++P ++ + +FN V LIAS PGY
Sbjct: 171 TSDFGDQLARFIET-----FFRSIPDSSSLWSYWVKVLTGSRFNVKLPKGVELIASAPGY 225
Query: 162 HTG 164
G
Sbjct: 226 WKG 228
>gi|330841055|ref|XP_003292520.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
gi|325077216|gb|EGC30943.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
Length = 658
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 37/230 (16%)
Query: 138 NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR-------------TVLQECTF 184
N F +F+FS++ +LI S+PG + +S K G +LR TV +
Sbjct: 385 NVQFLDQFDFSTSKAQLIISIPGEYKHTS-NKMGLERLRYHVNNYYKTQENNTVYGDDVK 443
Query: 185 EKGFKKSPLVYQFSSLG---SLDEKWMAELS-----SSMSSGFSEDKTPLGIGEPL---I 233
+ +K YQ SS+G + +++ +++++ + + G+ I
Sbjct: 444 SQSIQKI-FYYQSSSVGLSTFFKQAFVSNFKVNNNITTINTFHTMNSNNNNNGKDKSFHI 502
Query: 234 VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY-WAKWKASHTGRSRAMPHIKTFAR 292
++PT V+ + G + D + KY ++ ++ H R + H K
Sbjct: 503 IYPTARWVKETQAKQKLGKVLSLAYDIYD---INKYDFSYFQIKHGYRKNTVSHSKIIVG 559
Query: 293 YNGQKLA-------WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 335
+ L W S N+S AAWG+ S L I +YE+G+L+L
Sbjct: 560 VSQNSLKNKELKYDWCYSGSHNISSAAWGSPSSRTSDLSILNYEMGILLL 609
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 14/65 (21%)
Query: 45 HKP-PLPISFGT-------------HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 90
HKP P PI F H+K ++L+Y +RI V +AN +++N SQ +
Sbjct: 206 HKPGPHPIPFSNCTIIHPVLSGDQIFHAKLLVLVYDTWIRIAVTSANPSSYEYSNLSQSI 265
Query: 91 WMQDF 95
W QDF
Sbjct: 266 WYQDF 270
>gi|303322280|ref|XP_003071133.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240110832|gb|EER28988.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 608
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 99/231 (42%), Gaps = 45/231 (19%)
Query: 144 KFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSL 200
+F+F +A + ++ G HTGS WG + + + T PL Y SSL
Sbjct: 326 EFDFGKTAGFAFVHTIGGSHTGSD---WGKTGVCGLGKAVTMLGLQTPQPLKLDYVTSSL 382
Query: 201 GSLDEKWM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTV 238
GSL++++M EL+ S F DK + + + LI +P++
Sbjct: 383 GSLNDQFMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSL 442
Query: 239 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQK 297
+ V+ S + I K ++ ++ + S + R + H KT F R + K
Sbjct: 443 KTVQGSRARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGK 500
Query: 298 L----------AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 334
+ W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 501 IIGDANTTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 551
>gi|156603320|ref|XP_001618811.1| hypothetical protein NEMVEDRAFT_v1g224792 [Nematostella vectensis]
gi|156200471|gb|EDO26711.1| predicted protein [Nematostella vectensis]
Length = 208
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 308 LSKAAWGALQKNNSQLMIRSYELGVLILPS 337
+S G L+K SQLMIRSYE+GVL LP+
Sbjct: 1 MSGYTRGVLEKGGSQLMIRSYEIGVLFLPA 30
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 314 GALQKNNSQLMIRSYELGVLILPS 337
G L+K SQLMIRSYE+GVL LP+
Sbjct: 51 GVLEKGGSQLMIRSYEIGVLFLPA 74
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 314 GALQKNNSQLMIRSYELGVLILPS 337
G L+K SQLMIRSYE+GVL LP+
Sbjct: 95 GVLEKGGSQLMIRSYEIGVLFLPA 118
>gi|323454653|gb|EGB10523.1| hypothetical protein AURANDRAFT_62499 [Aureococcus anophagefferens]
Length = 1848
Score = 40.8 bits (94), Expect = 1.3, Method: Composition-based stats.
Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 13/73 (17%)
Query: 285 PHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNNS-----------QLMIRSYELGV 332
PH+ + ++G+ + LLTSANLS AAWG + N L IRS+ELGV
Sbjct: 1744 PHLMLYVLHDGRGAVRRALLTSANLSAAAWGRRRSANDPENADACDAAGALEIRSFELGV 1803
Query: 333 LILPSAKRHGCGF 345
+ P A G GF
Sbjct: 1804 CV-PVAPDAGEGF 1815
>gi|347836693|emb|CCD51265.1| hypothetical protein [Botryotinia fuckeliana]
Length = 638
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 76/356 (21%), Positives = 142/356 (39%), Gaps = 85/356 (23%)
Query: 2 VDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAM 61
+D DW+ K+ + V+ +++ + K P + PP+ + HSK
Sbjct: 309 IDSDWIRSKIQPSTKV--IWVLQAKTEAEKMNFKSLAPETYRFCFPPMEGNVNIMHSKLQ 366
Query: 62 LLIYPRGVRIIVHTANLIHVDWNNKSQGL-----WMQDFP-LKDQNNLSEE--CGFENDL 113
+L +P +R+++ +ANL DW +S G+ ++ D P L + S++ F DL
Sbjct: 367 ILAHPTHLRLVIPSANLTPYDW-GESGGILENVVFLIDLPRLPNGEKASDDQLTPFAQDL 425
Query: 114 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP--GYHTGSSLKKWG 171
+ +L + + P R I S+ G H G++L++ G
Sbjct: 426 LHFLHAM---------------TLTP-------------RTIESLKRGGSHFGTNLQRTG 457
Query: 172 HMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWM-------------------- 208
+ L + C G PL ++ +S+G+LD++++
Sbjct: 458 YPGLGS----CVRSLGLNTDHPLEIEYVTASIGNLDDRFLRTMYLASQGDNGSKEYKWRT 513
Query: 209 -----AELSSSMSSGFSEDKTPLGIGEPLIVW-PTVEDVRCSLEGYAAGNAIPSPQK--N 260
+++ + M + SE+ IG V+ P+ + V+ S G A I K N
Sbjct: 514 EKPARSKMETVMETQLSEE-----IGRRFRVYFPSEQTVKESKGGTNAAGTICFRSKWYN 568
Query: 261 VDKDFLKKYWAKWKASHTG--RSRAMPHIKTFARYNGQK-LAWFLLTSANLSKAAW 313
F ++ ++ G M ++T K +AW + SANLS++AW
Sbjct: 569 ASA-FPRELMRDCQSRREGLLMHNKMLFVRTRRTQKSPKPVAWVYVGSANLSESAW 623
>gi|85109758|ref|XP_963073.1| hypothetical protein NCU06222 [Neurospora crassa OR74A]
gi|28924724|gb|EAA33837.1| predicted protein [Neurospora crassa OR74A]
Length = 657
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LWMQDFPLKDQNNLSEECG-F 109
G HSK LL Y +RI+V +ANL+ DW L++ D PL D +++ E F
Sbjct: 316 GCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILFIIDLPLLDDPDVTRELTHF 375
Query: 110 ENDLIDYL 117
+L+ +L
Sbjct: 376 GEELLYFL 383
>gi|119196585|ref|XP_001248896.1| hypothetical protein CIMG_02667 [Coccidioides immitis RS]
Length = 629
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 98/229 (42%), Gaps = 41/229 (17%)
Query: 144 KFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGS 202
+F+F +A + ++ G HTGS K G L + E + L Y SSLGS
Sbjct: 347 EFDFGKTAGFAFVHTIGGSHTGSYWGKTGVCGLGKAVTMLGLETP-QPLKLDYITSSLGS 405
Query: 203 LDEKWM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVED 240
L++++M EL+ S F DK + + + LI +P+++
Sbjct: 406 LNDQFMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKT 465
Query: 241 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL- 298
V+ S + I K ++ ++ + S + R + H KT F R + K+
Sbjct: 466 VQGSRARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKII 523
Query: 299 ---------AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 334
W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 524 GDANTTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 572
>gi|435853317|ref|YP_007314636.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
gi|433669728|gb|AGB40543.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
Length = 372
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 4/69 (5%)
Query: 21 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 80
L++H DGT MKR K N + P P GT AMLL Y +G +IV H
Sbjct: 233 LIVHAYPDGTAPGMKRIKKLNLQAQRIPAP---GTSEDIAMLLAYEKGAELIVAVGTHTH 289
Query: 81 -VDWNNKSQ 88
+D+ K +
Sbjct: 290 MIDFLEKGR 298
>gi|350290891|gb|EGZ72105.1| phospholipase D/nuclease [Neurospora tetrasperma FGSC 2509]
Length = 657
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LWMQDFPLKDQNNLSEECG-F 109
G HSK LL Y +RI+V +ANL+ DW L++ D PL D +++ E F
Sbjct: 315 GCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILFIIDLPLLDDPDVTRELTHF 374
Query: 110 ENDLIDYL 117
+L+ +L
Sbjct: 375 GEELLYFL 382
>gi|336469464|gb|EGO57626.1| hypothetical protein NEUTE1DRAFT_81347 [Neurospora tetrasperma FGSC
2508]
Length = 656
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 54 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LWMQDFPLKDQNNLSEECG-F 109
G HSK LL Y +RI+V +ANL+ DW L++ D PL D +++ E F
Sbjct: 315 GCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILFIIDLPLLDDPDVTRELTHF 374
Query: 110 ENDLIDYL 117
+L+ +L
Sbjct: 375 GEELLYFL 382
>gi|226294747|gb|EEH50167.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides brasiliensis Pb18]
Length = 589
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 5/87 (5%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW---ILHKPPLPISFGTHHSK 59
D DWL + K ++I GE + + N + L PP+ HSK
Sbjct: 247 DADWLFSKFDI--KKSRFILIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMHSK 304
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNK 86
MLL + +RI++ +ANLI DW K
Sbjct: 305 LMLLFHLNHLRIVIPSANLIPFDWGEK 331
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 22/123 (17%)
Query: 296 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNI 351
Q W + SANLS++AWG L + S +L R++E GV+I + G G
Sbjct: 468 QYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI--PIRHRGSG------Q 519
Query: 352 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV-----YLPVPYELPPQRYSSED 406
+ S+ SGST + KL + S S++V +PVP +P + Y D
Sbjct: 520 LSSQPSSGST-----LRPKLEPESESASVTVSDGSKLVSVFEPRIPVPMRVPGEPYQPGD 574
Query: 407 VPW 409
PW
Sbjct: 575 KPW 577
>gi|363750352|ref|XP_003645393.1| hypothetical protein Ecym_3064 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889027|gb|AET38576.1| Hypothetical protein Ecym_3064 [Eremothecium cymbalariae
DBVPG#7215]
Length = 561
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 103/489 (21%), Positives = 177/489 (36%), Gaps = 103/489 (21%)
Query: 3 DIDWLLPACP----VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 58
++DW+L P V+ VI S G +K +L PP F +HHS
Sbjct: 112 EMDWVLSLIPGHVKVVVTAQEGTVIPASSGGGGHDVKFR-----MLRMPP----FCSHHS 162
Query: 59 KAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE---ECGFENDLI 114
K ++ Y R R+++ + N ++ Q +W+ P+ + S + F N+L+
Sbjct: 163 KLVVAFYKNRSCRLMMPSNNFTAMESQIPQQMVWVS--PILEYGGGSSAGPQSLFRNELV 220
Query: 115 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 174
YL P+++ +++ FK + A + S PG G L +
Sbjct: 221 RYLERYPNPDYTLIS------RLSVIDFKPLD--DTAAEFVFSAPG-GGGEDLSGLPLLY 271
Query: 175 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG----- 229
R + + K+ + F S+ SS + F+ PL G
Sbjct: 272 QRLQITPPRIRQAACKNQHQHYFCQTSSIGSPVNYRASSDPRNLFTNLMVPLFSGSLSSL 331
Query: 230 ----------------------EPLIVWPTVED-VRCSLEGYAAG--------NAIPSPQ 258
P I++PTV++ +C+ +G I Q
Sbjct: 332 PKSARSCPGAEFIETTLRVKQIHPHILYPTVKEFTQCTPGWLCSGWFHFHYDKQPIAKMQ 391
Query: 259 KNV--DKDFLKKYWAKWKASHTGRSRAMP---------HIKTFARY--------NGQKLA 299
+ + +FL+K + G + A+P H K F ++ N +
Sbjct: 392 YTMLKENNFLEK--QQEYELKPGSTIALPIIRRDKVPCHTKFFFKFTSASARSWNTEDCD 449
Query: 300 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 359
W L TSANLS AWG + ++YE GVL H C + +V ++ +
Sbjct: 450 WALFTSANLSTHAWG----KPPSYVPKNYECGVLY------HSCE-TIKVQVVSAKDIAY 498
Query: 360 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 419
S S ++++ S+ + VV + P+ LP YS D + Y + D
Sbjct: 499 SQNRSSHHRSQI-------STSSSRLKTVVNIMTPFWLPTVPYSELDQAFCASTNYVEFD 551
Query: 420 VYGQVWPRH 428
G + H
Sbjct: 552 QNGMQYTCH 560
>gi|257095684|ref|YP_003169325.1| cytochrome c oxidase subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257048208|gb|ACV37396.1| cytochrome c oxidase, subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 535
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 6 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 65
WLLP L +P +L + G DG + W L+ PL + G A+ I+
Sbjct: 123 WLLPPAAALLTLPFILALFGIGDGAVN-------TGWTLYA-PLSVQGGMGVDFAIFSIH 174
Query: 66 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
GV I+ + N+I +N ++ G+ M PL
Sbjct: 175 ILGVSSILGSINIIVTIFNLRAPGMTMMKLPL 206
>gi|71907102|ref|YP_284689.1| cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
gi|71846723|gb|AAZ46219.1| Cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
Length = 531
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 6 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 65
WLLP +L +P L + G DG L W + PL + G A+L ++
Sbjct: 119 WLLPPAAILLTLPFSLALFGIGDGALA-------TGWTFYA-PLSVQGGMGVDFAILAVH 170
Query: 66 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
G+ I+ + N+I +N ++ G+ M PL
Sbjct: 171 ILGISSIMGSINIIVTIFNMRAPGMTMMKLPL 202
>gi|253995926|ref|YP_003047990.1| cytochrome c oxidase subunit I [Methylotenera mobilis JLW8]
gi|253982605|gb|ACT47463.1| cytochrome c oxidase, subunit I [Methylotenera mobilis JLW8]
Length = 530
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 6 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 65
WLLP +L +P L + G DG L W + PPL I G A+ ++
Sbjct: 118 WLLPPSAILLTLPFTLALFGIGDGALA-------TGWTFY-PPLSIQGGIGVDFAIFAVH 169
Query: 66 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
G+ ++ + N+I +N ++ G+ + P+
Sbjct: 170 LLGISSVLGSINIIVTLFNMRAPGMTLMKMPM 201
>gi|396484884|ref|XP_003842038.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
gi|312218614|emb|CBX98559.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
Length = 588
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 52/228 (22%), Positives = 90/228 (39%), Gaps = 31/228 (13%)
Query: 1 MVDIDWLLPACPVLAKIPHVLVIHGESDGT----LEHMKRNKPANWILHKPPLPISFGTH 56
M D DWL + K+ + V++ + L+ MK N LH PP+ +
Sbjct: 359 MWDADWLHKKLDPI-KVKQIWVMNAKGKDVQKRWLQEMKDTGVPNLTLHFPPMHGMIQSM 417
Query: 57 HSKAMLLIYPRGVRIIVHTANLIHVDW----NNKSQGLWMQDFPLKDQNNLSEECG---- 108
HSK +LL + +R V TAN+ +DW N+ G+ L D L++
Sbjct: 418 HSKFLLLFGKKKLRFAVPTANMTCIDWGEVANDWQPGVMENSVFLIDLPRLADGVSADHA 477
Query: 109 ----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHT 163
F +LI +L + P K+ F+FS A + + S+ G H
Sbjct: 478 KLTKFGKELIYFLEQQELPR-----------KVIDGVL-NFDFSETAHLAFVHSIGGSHD 525
Query: 164 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 211
++ G L ++ + Y SS+G++++ + +L
Sbjct: 526 PTTAHPTGLPGLAAAVRGLNL-GNVNNLEIDYAASSIGAVNDNLLQQL 572
>gi|295668965|ref|XP_002795031.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226285724|gb|EEH41290.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 668
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 5/87 (5%)
Query: 3 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW---ILHKPPLPISFGTHHSK 59
D DWL + K ++I GE + + N + L PP+ HSK
Sbjct: 253 DADWLFSKFDI--KKSRFILIMGEKEEDKKRELENDTKSMGSVRLCFPPMEPQVNCMHSK 310
Query: 60 AMLLIYPRGVRIIVHTANLIHVDWNNK 86
MLL + +RI++ +ANLI DW K
Sbjct: 311 LMLLFHLNYLRIVIPSANLIPFDWGEK 337
>gi|322711943|gb|EFZ03516.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Metarhizium anisopliae ARSEF 23]
Length = 496
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 296 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCG 344
+KLAW + SANLS++AWG + + + ++M R++E GV++ A G G
Sbjct: 349 EKLAWAYVGSANLSESAWGRVVTDRASGQKKMMCRNWECGVVLPVRAFEQGSG 401
>gi|297539461|ref|YP_003675230.1| cytochrome c oxidase subunit I [Methylotenera versatilis 301]
gi|297258808|gb|ADI30653.1| cytochrome c oxidase, subunit I [Methylotenera versatilis 301]
Length = 530
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 6 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 65
WLLP +L +P L + G DG L W + PPL + G A+ ++
Sbjct: 118 WLLPPSAILLTLPFTLALFGIGDGALA-------TGWTFY-PPLSVQGGIGVDFAIFAVH 169
Query: 66 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 97
G+ ++ + N+I +N ++ G+ + P+
Sbjct: 170 LLGISSVLGSINVIVTVFNMRAPGMTLMKMPM 201
>gi|401626756|gb|EJS44678.1| tdp1p [Saccharomyces arboricola H-6]
Length = 539
Score = 39.3 bits (90), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 9/50 (18%)
Query: 298 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI----LPSAKRHGC 343
L W L TSANLS+ AWG + K R+YE+GVL LP ++ C
Sbjct: 451 LEWCLYTSANLSQTAWGTISKKP-----RNYEVGVLYHSGRLPGTRKITC 495
>gi|322700189|gb|EFY91945.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Metarhizium acridum CQMa 102]
Length = 432
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 296 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCG 344
+K+AW + SANLS++AWG L + + ++M R++E GV++ A G G
Sbjct: 290 KKVAWAYVGSANLSESAWGRLVTDRASGQKKMMCRNWECGVVLPVRAFEQGSG 342
>gi|329901801|ref|ZP_08272900.1| hypothetical protein IMCC9480_3399 [Oxalobacteraceae bacterium
IMCC9480]
gi|327549010|gb|EGF33621.1| hypothetical protein IMCC9480_3399 [Oxalobacteraceae bacterium
IMCC9480]
Length = 658
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 285 PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 334
PH K + GQ L+TSAN S +AWG ++ + L I+++ELGV +
Sbjct: 343 PHAKVYCFTRGQSRR-LLITSANFSPSAWG-IENRHGSLTIKNFELGVCL 390
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,498,785,969
Number of Sequences: 23463169
Number of extensions: 323144908
Number of successful extensions: 650082
Number of sequences better than 100.0: 501
Number of HSP's better than 100.0 without gapping: 358
Number of HSP's successfully gapped in prelim test: 143
Number of HSP's that attempted gapping in prelim test: 647632
Number of HSP's gapped (non-prelim): 890
length of query: 437
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 291
effective length of database: 8,933,572,693
effective search space: 2599669653663
effective search space used: 2599669653663
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)