BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006900
(626 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359483320|ref|XP_002265078.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Vitis vinifera]
Length = 621
Score = 938 bits (2424), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/636 (71%), Positives = 521/636 (81%), Gaps = 25/636 (3%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
MS ++IG+LVPL+ NL ED S PKLP+ G NVIGR +I VSDKRLSRKH+TL AS +GS
Sbjct: 1 MSLSQIGFLVPLNRNLEEDTSTPKLPIPTGANVIGRNSISVSDKRLSRKHLTLIASGNGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSND 120
VV+GTNPVVV SG+QRKKL + E I + DIIELIPGH+FFKYVT++
Sbjct: 61 VDAVVEGTNPVVVASGNQRKKLRTGEKAVITNDDIIELIPGHYFFKYVTVA--------- 111
Query: 121 GATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSC 180
GE KK D Q+ E+ N +A+ +F + +D LP T+RLLRV+ LPAWANTS
Sbjct: 112 ----GEKCEKKGNSMDAQNMES--NEVKAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSS 165
Query: 181 VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPAN 240
VSIRDVIQGD+++A+LSNYMVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP N
Sbjct: 166 VSIRDVIQGDVLIAVLSNYMVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPN 225
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
WILHKPPLPISFGTHHSKAMLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q
Sbjct: 226 WILHKPPLPISFGTHHSKAMLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQK 285
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
LS+ C FENDLIDYLS LKWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGY
Sbjct: 286 ELSKGCAFENDLIDYLSVLKWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGY 345
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
HTGS+LKKWGHMKL +VLQEC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +
Sbjct: 346 HTGSNLKKWGHMKLCSVLQECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCD 405
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS 480
DKTPLG+G+PLI+WPTVEDVRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR
Sbjct: 406 DKTPLGLGKPLIIWPTVEDVRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRC 465
Query: 481 RAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRH 530
RAMPHIKT+ RYNGQ LA KAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 466 RAMPHIKTYTRYNGQNLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINR 525
Query: 531 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 590
G GFSCT N PS+ K G +E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++
Sbjct: 526 GQGFSCTDNGSPSKNKCGLSENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQ 585
Query: 591 YSSEDVPWSWDKRYTKKDVYGQVWPRHFQLYAFQDS 626
YSSEDVPWSWD+RY KKDV GQVWPRH QLY+ DS
Sbjct: 586 YSSEDVPWSWDRRYYKKDVCGQVWPRHVQLYSSPDS 621
>gi|302144065|emb|CBI23170.3| unnamed protein product [Vitis vinifera]
Length = 678
Score = 937 bits (2422), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/678 (68%), Positives = 536/678 (79%), Gaps = 52/678 (7%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
MS ++IG+LVPL+ NL ED S PKLP+ G NVIGR +I VSDKRLSRKH+TL AS +GS
Sbjct: 1 MSLSQIGFLVPLNRNLEEDTSTPKLPIPTGANVIGRNSISVSDKRLSRKHLTLIASGNGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLS--RSQKRVS 118
VV+GTNPVVV SG+QRKKL + E I + DIIELIPGH+FFKYVT++ + +K+ +
Sbjct: 61 VDAVVEGTNPVVVASGNQRKKLRTGEKAVITNDDIIELIPGHYFFKYVTVAGEKCEKKGN 120
Query: 119 NDGATNGE-----LSSKKMRQ-----------QDEQDNE---------NGKN-------- 145
+ A N E LS K+MRQ Q E +N+ GK+
Sbjct: 121 SMDAQNMESNEVSLSRKRMRQVSEDEAFARKLQAEMENDVLVQERSLVTGKSGYSQASTA 180
Query: 146 -------SEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSN 198
+ EA+ +F + +D LP T+RLLRV+ LPAWANTS VSIRDVIQGD+++A+LSN
Sbjct: 181 SIPSSHMNSEAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSSVSIRDVIQGDVLIAVLSN 240
Query: 199 YMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 258
YMVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSK
Sbjct: 241 YMVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSK 300
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
AMLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS
Sbjct: 301 AMLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSV 360
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
LKWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VL
Sbjct: 361 LKWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLCSVL 420
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
QEC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVE
Sbjct: 421 QECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVE 480
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 498
DVRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR RAMPHIKT+ RYNGQ LA
Sbjct: 481 DVRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTYTRYNGQNLA 540
Query: 499 ----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 548
KAAWGALQKNNSQLMIRSYELGVL LPS G GFSCT N PS+ K G
Sbjct: 541 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTDNGSPSKNKCG 600
Query: 549 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 608
+E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++YSSEDVPWSWD+RY KKD
Sbjct: 601 LSENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQYSSEDVPWSWDRRYYKKD 660
Query: 609 VYGQVWPRHFQLYAFQDS 626
V GQVWPRH QLY+ DS
Sbjct: 661 VCGQVWPRHVQLYSSPDS 678
>gi|255554997|ref|XP_002518536.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223542381|gb|EEF43923.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 665
Score = 924 bits (2389), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/633 (70%), Positives = 510/633 (80%), Gaps = 40/633 (6%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
KIG+LVPL NL ED S+PK+ LS+GPN IGR+++ VSDKRLSR H++LT S DGSA L
Sbjct: 62 KIGFLVPLKLNLEEDTSIPKISLSEGPNAIGRSHVSVSDKRLSRNHLSLTTSVDGSAFLT 121
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
+GTNPVV+KSGDQRKKLS E SI GD+IELIPGHHFFKY +G N
Sbjct: 122 PEGTNPVVIKSGDQRKKLSPGEKASINSGDVIELIPGHHFFKY----------EGEGECN 171
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
G KNSEEA+ F+V+ DKLP TFRL++V+GLPAWANTSCVSI
Sbjct: 172 G-----------------AKNSEEAIGKFNVNDDKLPLTFRLMKVKGLPAWANTSCVSIT 214
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI+ A+LSNYMVDIDWL+ ACP LAK+P+VLV+HGE DGTLEHMKR KPANWILH
Sbjct: 215 DVIQGDIVFAVLSNYMVDIDWLMSACPALAKVPNVLVLHGEGDGTLEHMKRTKPANWILH 274
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KPPLPISFGTHHSKAMLL+YPRG+RIIVHTANLI+VDWNNK+QGLWMQDFP KD+ + ++
Sbjct: 275 KPPLPISFGTHHSKAMLLVYPRGMRIIVHTANLIYVDWNNKTQGLWMQDFPWKDEKSQTK 334
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
CGFENDL+DYL+TLKWPEF+ LPA G+F INPSFFKKF++S+AAVRLIASVPGYHTG
Sbjct: 335 GCGFENDLVDYLNTLKWPEFTVKLPALGSFTINPSFFKKFDYSTAAVRLIASVPGYHTGP 394
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
+LKKWGHMKLR+VLQECTF K FK SPL YQFSSLGSLD KWM EL++S+SSG SED+TP
Sbjct: 395 NLKKWGHMKLRSVLQECTFRKEFKNSPLAYQFSSLGSLDAKWMTELATSLSSGLSEDRTP 454
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNAIPSP KNV+KD LKKYW+KWKA+H+GR RAMP
Sbjct: 455 LGLGEPRIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKDILKKYWSKWKATHSGRCRAMP 514
Query: 485 HIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 533
HIKTF RYNGQKLA KAAWGALQKNNSQLMIRSYELGVL LPS+ K HGC
Sbjct: 515 HIKTFTRYNGQKLAWLLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSSYKNHGCR 574
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 593
SCT + SE + G S+ KT+LVTL W G D SS+V+ LPVPYELPPQ YSS
Sbjct: 575 LSCTDHGARSEDEYGLLADSEEPKTELVTLMWQGPKD--PSSQVIPLPVPYELPPQPYSS 632
Query: 594 EDVPWSWDKRYTKKDVYGQVWPRHFQLYAFQDS 626
EDVPWSWD+RY+KKDVYGQVWPR QLY DS
Sbjct: 633 EDVPWSWDRRYSKKDVYGQVWPRLVQLYTSLDS 665
>gi|449479663|ref|XP_004155668.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 959
Score = 887 bits (2292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/643 (67%), Positives = 500/643 (77%), Gaps = 30/643 (4%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
K+GYLVPLD NL DNS K+ LS+GPN IGR+N+ VS+KR+SRKHITLT S DGSA L+
Sbjct: 318 KVGYLVPLDKNLEVDNSGLKIRLSEGPNSIGRSNVLVSEKRISRKHITLTTSTDGSAKLL 377
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTL---SR------SQK 115
VDGTNPVV+ SGD RKKL E V I DGD+IELIPGH+ FKY + SR QK
Sbjct: 378 VDGTNPVVINSGDGRKKLGPRESVIIRDGDVIELIPGHYPFKYASHCFNSRPGSEDLGQK 437
Query: 116 RV--------SNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLL 167
RV S A E+ S Q NS EA+ NFH+ D+LP TFRLL
Sbjct: 438 RVRQVAHDKISERVAKRAEMGSPLENMQSGSSKSKEANSVEAIRNFHIPDDRLPMTFRLL 497
Query: 168 RVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V+GLP WANTSCV I D+IQGDI+ A+LSNYMVDIDWL+PACP LAKIP VLVIHGE D
Sbjct: 498 SVKGLPPWANTSCVRITDIIQGDILFAVLSNYMVDIDWLIPACPTLAKIPQVLVIHGEGD 557
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ 287
GTL++MKR KPANWILHKPPLPISFGTHHSKA+ L+YPRG+R++VHTANLI+VDWNNKSQ
Sbjct: 558 GTLDNMKRKKPANWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQ 617
Query: 288 GLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
GLWMQDFP KDQN+ S C FE+DL+DYLS LKWPEF A+ P HGNF INP FF+KF++S
Sbjct: 618 GLWMQDFPWKDQNSSSRGCAFEDDLVDYLSALKWPEFPASFPGHGNFNINPYFFRKFDYS 677
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
AAVRLIASVPGYHTG LKKWGHMKLR+VLQEC F+K F++SPLVYQFSSLGSL+EKWM
Sbjct: 678 KAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECIFDKEFQRSPLVYQFSSLGSLNEKWM 737
Query: 408 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK 467
AE ++S+SSGF+ DKTPLG+GEPLIVWPTVEDVRCSLEGYAAG+AIPSP KNV+K FL+K
Sbjct: 738 AEFAASLSSGFTPDKTPLGLGEPLIVWPTVEDVRCSLEGYAAGSAIPSPLKNVEKGFLRK 797
Query: 468 YWAKWKASHTGRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSY 517
YWAKW + H+GR AMPHIKTFARYNGQKLA +AAWGALQKNNSQLMIRSY
Sbjct: 798 YWAKWNSFHSGRCHAMPHIKTFARYNGQKLAWLVLTSSNLSQAAWGALQKNNSQLMIRSY 857
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI--QKTKLVTLTWHGSSDAGASS 575
ELGVL LP KR+ FSCT N ++ KS + S+ KT+LVTL W + + S
Sbjct: 858 ELGVLFLPQ-KRNDYSFSCTKNGGSAQNKSTVSRPSETLEGKTELVTLAWQENKKRESLS 916
Query: 576 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHF 618
EV+ LP+PYELPPQ Y EDVPWSWD+RYT+KDV+G VWPR F
Sbjct: 917 EVIQLPIPYELPPQPYGPEDVPWSWDRRYTQKDVHGAVWPRQF 959
>gi|449434370|ref|XP_004134969.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 613
Score = 879 bits (2270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/628 (67%), Positives = 496/628 (78%), Gaps = 25/628 (3%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ ++GYLVPLD NL DNS K+ LS+GPN IGR+N+ VS+KR+SRKHITLT S DGS
Sbjct: 1 MARLQVGYLVPLDKNLEVDNSGLKIRLSEGPNSIGRSNVLVSEKRISRKHITLTTSTDGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSND 120
A L+V+GTNPVV+ SGD RKKL E V I DGD+IELIPGH+ FKY + + + S D
Sbjct: 61 AKLLVEGTNPVVINSGDGRKKLGPRESVIIRDGDVIELIPGHYPFKYASHCFNSRPGSED 120
Query: 121 GATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSC 180
L K++RQ+ NS EA+ NFH+ D+LP TFRLL V+GLP WANTSC
Sbjct: 121 ------LGQKRVRQE--------ANSVEAIRNFHIPDDRLPMTFRLLSVKGLPPWANTSC 166
Query: 181 VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPAN 240
V I D+IQGDI+ A+LSNYMVDIDWL+PACP LAK+P VLVIHGE DGTL++MKR KPAN
Sbjct: 167 VRITDIIQGDILFAVLSNYMVDIDWLIPACPALAKVPQVLVIHGEGDGTLDNMKRKKPAN 226
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
WILHKPPLPISFGTHHSKA+ L+YPRG+R++VHTANLI+VDWNNKSQGLWMQDFP KDQN
Sbjct: 227 WILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQN 286
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
+ S C FE+DL+DYLS LKWPEF A+ P HGNF INP FF+KF++S AAVRLIASVPGY
Sbjct: 287 SSSRGCAFEDDLVDYLSALKWPEFPASFPGHGNFNINPYFFRKFDYSKAAVRLIASVPGY 346
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
HTG LKKWGHMKLR+VLQEC F+K F++SPLVYQFSSLGSL+EKWMAE ++S+SSGF+
Sbjct: 347 HTGRYLKKWGHMKLRSVLQECIFDKEFQRSPLVYQFSSLGSLNEKWMAEFAASLSSGFTP 406
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS 480
DKTPLG+GEPLIVWPTVEDVRCSLEGYAAG+A+PSP KNV+K FL KYWAKW + H+GR
Sbjct: 407 DKTPLGLGEPLIVWPTVEDVRCSLEGYAAGSAVPSPLKNVEKGFLTKYWAKWNSFHSGRC 466
Query: 481 RAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRH 530
AMPHIKTFARYNGQKLA +AAWGALQKNNSQLMIRSYELGVL LP KR+
Sbjct: 467 HAMPHIKTFARYNGQKLAWLVLTSSNLSQAAWGALQKNNSQLMIRSYELGVLFLPQ-KRN 525
Query: 531 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 590
FSCT N ++ + KT+LVTL W + + SEV+ LP+PYELPPQ
Sbjct: 526 DYSFSCTKNGGSAQSTVSRPSETLEGKTELVTLAWQENKKRESLSEVIQLPIPYELPPQP 585
Query: 591 YSSEDVPWSWDKRYTKKDVYGQVWPRHF 618
Y EDVPWSW++RYT+KDV+G VWPR F
Sbjct: 586 YGPEDVPWSWERRYTQKDVHGAVWPRQF 613
>gi|356507524|ref|XP_003522514.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 1 [Glycine
max]
Length = 610
Score = 871 bits (2250), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/624 (70%), Positives = 505/624 (80%), Gaps = 32/624 (5%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
++GYLVPL+ N +E+ S+PK +S G NVIGR NIPV DKRLSRKH+TLTAS +GSASL+
Sbjct: 6 QVGYLVPLNRNFKEEASVPKFAVSDGINVIGRNNIPVPDKRLSRKHLTLTASPNGSASLL 65
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
V+GTNP+VV SG++R+KL+ E +I +GDIIELIPGHH FKY L
Sbjct: 66 VEGTNPIVVNSGNKRRKLNPKEEATICNGDIIELIPGHHLFKYQVLGG------------ 113
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
R D + + NS EA+ NFHV D++PSTFRLL VQGLP WANTSCVSI
Sbjct: 114 --------RNADARKSSGEDNSVEAIRNFHVPSDQIPSTFRLLHVQGLPPWANTSCVSIG 165
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI VAILSNYMVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILH
Sbjct: 166 DVIQGDIKVAILSNYMVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILH 225
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KP LPISFGTHHSKAM+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+
Sbjct: 226 KPSLPISFGTHHSKAMMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSK 285
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GFENDL++YLS LKWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GS
Sbjct: 286 GSGFENDLVEYLSVLKWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGS 345
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
SLKKWGHMKLR++LQECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTP
Sbjct: 346 SLKKWGHMKLRSLLQECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTP 405
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMP
Sbjct: 406 LGMGEPQIIWPTVEDVRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMP 465
Query: 485 HIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 533
HIKTFARY Q LA KAAWGALQKNN+QLMIRSYELGVL LPS KRH
Sbjct: 466 HIKTFARYKNQSLAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESV 525
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYS 592
FSCTSN+ SE K + E+S+++KTKLVTLT +SSEV+ LP+PYELPP YS
Sbjct: 526 FSCTSNVTVSEDKCPARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYS 585
Query: 593 SEDVPWSWDKRYTKKDVYGQVWPR 616
S+D+PWSWD++Y KKDVYG VWPR
Sbjct: 586 SQDIPWSWDRQYNKKDVYGHVWPR 609
>gi|356507526|ref|XP_003522515.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 2 [Glycine
max]
Length = 599
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/624 (69%), Positives = 504/624 (80%), Gaps = 43/624 (6%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
++GYLVPL+ N +E+ S+PK +S G NVIGR NIPV DKRLSRKH+TLTAS +GSASL+
Sbjct: 6 QVGYLVPLNRNFKEEASVPKFAVSDGINVIGRNNIPVPDKRLSRKHLTLTASPNGSASLL 65
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
V+GTNP+VV SG++R+KL+ E +I +GDIIELIPGHH FKY ++
Sbjct: 66 VEGTNPIVVNSGNKRRKLNPKEEATICNGDIIELIPGHHLFKY--------------QSS 111
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
GE NS EA+ NFHV D++PSTFRLL VQGLP WANTSCVSI
Sbjct: 112 GE-----------------DNSVEAIRNFHVPSDQIPSTFRLLHVQGLPPWANTSCVSIG 154
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI VAILSNYMVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILH
Sbjct: 155 DVIQGDIKVAILSNYMVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILH 214
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KP LPISFGTHHSKAM+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+
Sbjct: 215 KPSLPISFGTHHSKAMMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSK 274
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GFENDL++YLS LKWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GS
Sbjct: 275 GSGFENDLVEYLSVLKWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGS 334
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
SLKKWGHMKLR++LQECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTP
Sbjct: 335 SLKKWGHMKLRSLLQECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTP 394
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMP
Sbjct: 395 LGMGEPQIIWPTVEDVRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMP 454
Query: 485 HIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 533
HIKTFARY Q LA KAAWGALQKNN+QLMIRSYELGVL LPS KRH
Sbjct: 455 HIKTFARYKNQSLAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESV 514
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYS 592
FSCTSN+ SE K + E+S+++KTKLVTLT +SSEV+ LP+PYELPP YS
Sbjct: 515 FSCTSNVTVSEDKCPARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYS 574
Query: 593 SEDVPWSWDKRYTKKDVYGQVWPR 616
S+D+PWSWD++Y KKDVYG VWPR
Sbjct: 575 SQDIPWSWDRQYNKKDVYGHVWPR 598
>gi|224078752|ref|XP_002305614.1| predicted protein [Populus trichocarpa]
gi|222848578|gb|EEE86125.1| predicted protein [Populus trichocarpa]
Length = 599
Score = 845 bits (2184), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/628 (67%), Positives = 495/628 (78%), Gaps = 41/628 (6%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ + I YLVPL +L E+ S+PKLPLS G N IGR +I SDKRLSR H++LT S S
Sbjct: 1 MTHSPIAYLVPLSPSLEENASIPKLPLSNGQNTIGRNDISASDKRLSRNHLSLTLSLT-S 59
Query: 61 ASLVVDGTNPV-VVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSN 119
+++ V+GTNPV VVKSG +R+KL + E I + DIIELIPG++F+KYV + S
Sbjct: 60 STITVEGTNPVAVVKSGKRRRKLRAGEKAEIINDDIIELIPGNYFYKYVEME------SG 113
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTS 179
N E EEA+ +F VS D+L TFRLLRV+ LPAWANTS
Sbjct: 114 GPPRNCE--------------------EEAIRDFGVSEDELALTFRLLRVKELPAWANTS 153
Query: 180 CVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
CVSI DVI+GDI+VAILSNYMVD+DWLL ACP +AK+P+V+VIHGE DGTLEHMKR KPA
Sbjct: 154 CVSINDVIKGDILVAILSNYMVDMDWLLSACPTIAKVPNVMVIHGEGDGTLEHMKRRKPA 213
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
NWILHKP LPISFGTHHSKAM L+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K++
Sbjct: 214 NWILHKPRLPISFGTHHSKAMFLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKEE 273
Query: 300 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
+ CGFENDL+DYLS LKWPEF+ LP G+ IN SFFKKF++S AAVRLIASVPG
Sbjct: 274 KKPGKGCGFENDLVDYLSMLKWPEFTVKLPNLGSISINASFFKKFDYSHAAVRLIASVPG 333
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFS 419
YHTG++L+KWGHMKL++VLQECTF+ FK+SPLVYQFSSLGSLDEKWM EL+ SMSSG++
Sbjct: 334 YHTGANLRKWGHMKLQSVLQECTFDNEFKRSPLVYQFSSLGSLDEKWMTELAISMSSGYA 393
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR 479
EDKTPLG+G P I+WPTVEDVRCSLEGYAAGNAIP P KNV+K FLKKYWAKWKASH+GR
Sbjct: 394 EDKTPLGLGVPQIIWPTVEDVRCSLEGYAAGNAIPGPLKNVEKGFLKKYWAKWKASHSGR 453
Query: 480 SRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA-K 528
RAMPHIKTF RYNGQKLA KAAWGALQKNNSQLMIRSYELGVL LPS+ +
Sbjct: 454 CRAMPHIKTFTRYNGQKLAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSSIR 513
Query: 529 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 588
R+G GFSCTSN PS GS S+ +T LVTL W G+SD ++S+V+ LPVPYELPP
Sbjct: 514 RYGSGFSCTSNGGPSMDNCGSLVDSEELRTTLVTLKWQGTSD--SASKVIPLPVPYELPP 571
Query: 589 QRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
YSSEDVPWSWD+RY+KKDVYGQVWPR
Sbjct: 572 IPYSSEDVPWSWDRRYSKKDVYGQVWPR 599
>gi|297811655|ref|XP_002873711.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
gi|297319548|gb|EFH49970.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
Length = 612
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/631 (65%), Positives = 492/631 (77%), Gaps = 34/631 (5%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+ED+S P++ LS+GPN IGR N+ + DKRLSRKHIT+ AS GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDDSSPRITLSEGPNFIGRGNVSIVDKRLSRKHITIMASTSGS 60
Query: 61 ASLVVDGTNPVVVKS--GDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL V+GTNPVV++S G +RKK+ E VS+++ D+IELIPGHHFFK V L +K
Sbjct: 61 ASLSVEGTNPVVIRSSGGGERKKVKPREEVSVSNDDLIELIPGHHFFKLVLLPVEKK--- 117
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
+ E ++KK R+ ++ EA+ F +KLPSTFRLL V GLP WANT
Sbjct: 118 ----GSHERATKKARKAEDD--------VEAIRRFCPPNEKLPSTFRLLSVNGLPDWANT 165
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GDI+ AILSNYMVD+DWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 166 SCVSINDVIEGDIVAAILSNYMVDVDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 225
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
NWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 226 VNWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 285
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEFSANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 286 DDKDPPKGCGFEGDLIDYLTVLKWPEFSANLPGRGNVKINAAFFKKFDYSDAKVRLIASV 345
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +LKKWGHMKLRT+LQEC F++ F +SPLVYQFSSLGSLDEKW+AE +S+SSG
Sbjct: 346 PGYHTGLNLKKWGHMKLRTILQECIFDREFCRSPLVYQFSSLGSLDEKWLAEFGNSLSSG 405
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
SEDKTPLG G+PLI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+W A H+
Sbjct: 406 ISEDKTPLGPGDPLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWTADHS 465
Query: 478 GRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA 527
R RAMPHIKTF RYN QKLA KAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 466 ARGRAMPHIKTFTRYNDQKLAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 525
Query: 528 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 585
K GC FSCT + PS +K+ + +K +KLVT+TW G D S E++ LP+PYE
Sbjct: 526 IKTQGCIFSCTES-NPSTMKAKQERKDEAEKRSKLVTMTWQGDRD---SPEIISLPIPYE 581
Query: 586 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
LPP+ YS+EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 582 LPPKPYSAEDVPWSWDRGYSKKDVYGQVWPR 612
>gi|22326821|ref|NP_197021.2| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
gi|23297734|gb|AAN13014.1| unknown protein [Arabidopsis thaliana]
gi|226511716|gb|ACO60340.1| tyrosyl-DNA phosphodiesterase I [Arabidopsis thaliana]
gi|332004741|gb|AED92124.1| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
Length = 605
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/631 (64%), Positives = 488/631 (77%), Gaps = 41/631 (6%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA 527
R RAMPHIKTF RYN QK+A KAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 528 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 585
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 586 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
LPP+ YS EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 575 LPPKPYSPEDVPWSWDRGYSKKDVYGQVWPR 605
>gi|17381098|gb|AAL36361.1| unknown protein [Arabidopsis thaliana]
Length = 605
Score = 833 bits (2153), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/631 (64%), Positives = 488/631 (77%), Gaps = 41/631 (6%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV++ FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEEPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA 527
R RAMPHIKTF RYN QK+A KAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 528 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 585
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 586 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
LPP+ YS EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 575 LPPKPYSPEDVPWSWDRGYSKKDVYGQVWPR 605
>gi|7671486|emb|CAB89327.1| putative protein [Arabidopsis thaliana]
Length = 627
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/609 (64%), Positives = 467/609 (76%), Gaps = 41/609 (6%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPS- 526
R RAMPHIKTF RYN QK+A KAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 527 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 585
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 586 LPPQRYSSE 594
LPP+ YS E
Sbjct: 575 LPPKPYSPE 583
>gi|326504850|dbj|BAK06716.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 669
Score = 777 bits (2007), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/671 (56%), Positives = 476/671 (70%), Gaps = 62/671 (9%)
Query: 2 SATKIGYLVP-LDNNLREDNS----LPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTAS 56
S ++G LVP ++ N+ +P +P+ +G NV+GR+N+ DKR+SRKH++L A
Sbjct: 5 SRVRVGTLVPFVEGKSGSPNASSLPMPSIPIFEGSNVVGRSNLVAVDKRVSRKHLSLRAV 64
Query: 57 ADGSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKR 116
DGS +VV+GTNP+VV+S QR+K+ + + I D++ELIPG +F KYV +S +++
Sbjct: 65 PDGSVEVVVEGTNPIVVRSEGQRRKVCAQQRAKIMPDDVLELIPGEYFMKYVNMS-DERK 123
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSR------------------- 157
+S A+ KK ++ E+D+ K + + + + ++R
Sbjct: 124 IS---ASVDSHDLKKGKRHSEEDSVAAKRNRQVMEDEALARTLQESFAEESASVTEVLSS 180
Query: 158 ---------------------DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL 196
D LP +FRL+RVQGLP+W NTS V+I+DVIQG++++A+L
Sbjct: 181 LDSAGSSERNKERTHSVGPLKDVLPLSFRLMRVQGLPSWTNTSTVTIQDVIQGEVLLAVL 240
Query: 197 SNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
SNYMVD+DWLL ACP L K+PHVLV+HGE +LE +K+ KP NWILHKPPLPISFGTHH
Sbjct: 241 SNYMVDMDWLLTACPSLRKVPHVLVLHGEDGASLERLKKTKPTNWILHKPPLPISFGTHH 300
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
SKAMLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP K+ N++S GFENDL+DYL
Sbjct: 301 SKAMLLVYPQGIRVVVHTANLIHVDWNNKSQGLWAQDFPWKEANDMSTNIGFENDLVDYL 360
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 376
LKWPEF NLP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+
Sbjct: 361 RALKWPEFRVNLPVVGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNMKKWGHMKLRS 420
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPT 436
VL+EC FEK F KSPL+YQFSSLGSLDEKWM+E + S+S+G ++D + LGIG+PLIVWPT
Sbjct: 421 VLEECVFEKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKADDGSQLGIGKPLIVWPT 480
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK 496
VEDVRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR RAMPHIKTF RYNGQ
Sbjct: 481 VEDVRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCRAMPHIKTFTRYNGQN 540
Query: 497 LA----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 546
+A KAAWGALQKNN+QLMIRSYELGVL LP + FSCT S
Sbjct: 541 IAWFLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVPQFSCTDK---SRSN 597
Query: 547 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 606
+ KTKLVTL W G + S+EVV LPVPY+LPPQ Y EDVPWSWD+RYTK
Sbjct: 598 LDKLALGKNIKTKLVTLCWKGDEEKDPSAEVVRLPVPYQLPPQLYGPEDVPWSWDRRYTK 657
Query: 607 KDVYGQVWPRH 617
KDVYG VW RH
Sbjct: 658 KDVYGSVWSRH 668
>gi|357122586|ref|XP_003562996.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Brachypodium
distachyon]
Length = 671
Score = 777 bits (2006), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/669 (56%), Positives = 472/669 (70%), Gaps = 56/669 (8%)
Query: 2 SATKIGYLVPLDNNLRED--NSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVP SLP +P+ +G NV+GR+N+ V DKR+SRKH++L SADG
Sbjct: 5 SRVRVGTLVPFGEGKAGSLGASLPSIPIFEGSNVVGRSNLVVVDKRVSRKHLSLRVSADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLS---RSQKR 116
S +VV+G NP+VV+S QR+++ + E I D++ELIPG +F KYV + +S
Sbjct: 65 SIEVVVEGPNPIVVQSEGQRRRVCAKERAKIIHDDVLELIPGDYFVKYVNMGDEHKSSTP 124
Query: 117 VSNDGATNG------ELSSKKMRQQDEQDNENGKNSEEALCNFHVS-------------- 156
V ++ G E K +Q +D + +E+ +S
Sbjct: 125 VDSNDLKKGKRHREEECVVAKRNRQIVEDEALARTLQESFAEETMSATGMACVQVSSSLD 184
Query: 157 ------------------RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSN 198
+D LP TFRL+RVQGLP+W NTS V+I+DVIQG++++A+LSN
Sbjct: 185 SAGSSERNNERMHSAGSLKDVLPLTFRLMRVQGLPSWTNTSAVTIQDVIQGEVLLAVLSN 244
Query: 199 YMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 258
YMVD+DWLL ACP L K+PHVLV+HGE +LEH+K++KPANWILHKPPLPI+FGTHHSK
Sbjct: 245 YMVDMDWLLTACPSLRKVPHVLVLHGEDGASLEHLKKSKPANWILHKPPLPITFGTHHSK 304
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
AMLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP KD ++++ FE+DL+DYLS
Sbjct: 305 AMLLVYPQGIRVVVHTANLIHVDWNNKSQGLWTQDFPWKDTKDMNKNISFESDLVDYLSA 364
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
LKWPEF LP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+VL
Sbjct: 365 LKWPEFRIKLPVAGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNIKKWGHMKLRSVL 424
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ C FEK F KSPL+YQFSSLGSLDEKWM E + S+S+G ++D +PLGIG+PLIVWPTVE
Sbjct: 425 EGCVFEKQFCKSPLIYQFSSLGSLDEKWMTEFACSLSAGKADDGSPLGIGKPLIVWPTVE 484
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 498
DVRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR AMPHIKTFARYNGQ +A
Sbjct: 485 DVRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCHAMPHIKTFARYNGQNIA 544
Query: 499 ----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 548
KAAWGALQKNN+QLMIRSYELGVL LP + FSCT + G
Sbjct: 545 WFLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVSRFSCTEK---NHSNLG 601
Query: 549 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 608
+ + KTKLVTL W + S+EV+ LPVPY+LPPQ Y EDVPWSWD+RYTKKD
Sbjct: 602 NLTLGKTIKTKLVTLCWKDDEEKEPSAEVIRLPVPYQLPPQLYGPEDVPWSWDRRYTKKD 661
Query: 609 VYGQVWPRH 617
VYG VWPRH
Sbjct: 662 VYGAVWPRH 670
>gi|293331809|ref|NP_001169273.1| uncharacterized protein LOC100383136 [Zea mays]
gi|224028313|gb|ACN33232.1| unknown [Zea mays]
gi|414886956|tpg|DAA62970.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
gi|414886957|tpg|DAA62971.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 665
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/668 (56%), Positives = 472/668 (70%), Gaps = 60/668 (8%)
Query: 2 SATKIGYLVPL--DNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL DN + S+ +P+ QGPNV+GR ++ V DKR+SRKH++L AS DG
Sbjct: 5 SRVRLGTLVPLTKDNAGSSNGSVSSIPIFQGPNVVGRDHLVVVDKRISRKHLSLHASTDG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQ----- 114
S +VV+G NP++V+S QR+K+ + E IA GD++ELIPG +F KYV +
Sbjct: 65 SIEVVVEGPNPIIVRSKGQRRKVCAKETAKIAHGDVLELIPGDYFVKYVDMGDEHVPMHL 124
Query: 115 -----------------KRV------------------SNDGATNGELSSKKMRQQDEQD 139
KR+ ++D A +G S +K+ D
Sbjct: 125 SDLMKGKRYSEEHGAAVKRIRQIMEDEALAKTLQESFAADDAAVSGMPSGQKISSHDSAG 184
Query: 140 NENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNY 199
+ N + +D LP TFRL+ VQGLP+W NTS V+I+DVIQG++++A+LSNY
Sbjct: 185 SSERNNDRTH--SVGPLKDMLPLTFRLMHVQGLPSWTNTSSVTIQDVIQGEVLLAVLSNY 242
Query: 200 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 259
MVDIDWLL ACP L K+PHVLV+HG+ +LE MK+ KPANWILH+PPLPISFGTHHSKA
Sbjct: 243 MVDIDWLLTACPSLRKVPHVLVLHGQDGASLELMKKLKPANWILHRPPLPISFGTHHSKA 302
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 319
MLL+YP+G+RI+VHTANLIHVDWN KSQGLWMQDFP KD +++++ FENDL+DYLS L
Sbjct: 303 MLLVYPQGIRIVVHTANLIHVDWNYKSQGLWMQDFPWKDTVDMNKKTAFENDLVDYLSAL 362
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 379
KWPEF NLP G+ IN +FF+KF++S++ VRLI SVPGYH GS+++KWGHMKLR VL
Sbjct: 363 KWPEFRVNLPGVGDVNINAAFFRKFDYSNSMVRLIGSVPGYHVGSNIRKWGHMKLRNVLD 422
Query: 380 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 439
E F K F KSPL+YQFSSLGSLDEKWM+E + S+S+G S+D + LGIG+PLIVWPTVED
Sbjct: 423 EIMFNKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKSDDGSQLGIGKPLIVWPTVED 482
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA- 498
VRCS+EGYAAG+ IPSPQKNV++DFLKKYW++WKA H GR RAMPHIKTF RY+GQ +A
Sbjct: 483 VRCSIEGYAAGSCIPSPQKNVERDFLKKYWSRWKADHVGRCRAMPHIKTFTRYSGQNIAW 542
Query: 499 ---------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 549
KAAWGALQKNN+QLMIRSYELGVL LP + FSCT I+ G
Sbjct: 543 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVPQFSCTEK--SRSIRDGV 600
Query: 550 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 609
I KTKLVTL W G + +V LPVPY+LPPQ Y ++DVPWSWD+RYTKKDV
Sbjct: 601 ALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYGTQDVPWSWDRRYTKKDV 656
Query: 610 YGQVWPRH 617
YG VWPR+
Sbjct: 657 YGSVWPRY 664
>gi|115472491|ref|NP_001059844.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|33146648|dbj|BAC79984.1| putative tyrosyl-DNA phosphodiesterase [Oryza sativa Japonica
Group]
gi|113611380|dbj|BAF21758.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|215697362|dbj|BAG91356.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222637174|gb|EEE67306.1| hypothetical protein OsJ_24533 [Oryza sativa Japonica Group]
Length = 671
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/677 (55%), Positives = 481/677 (71%), Gaps = 72/677 (10%)
Query: 2 SATKIGYLVPLD--NNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL+ N + S+ +P+ G NV+GR ++ V DKR+SRKH++L ASADG
Sbjct: 5 SRVRVGNLVPLNEGNASSSNGSVSSIPIYLGANVVGRNHLVVVDKRVSRKHLSLHASADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSN 119
S VV+G NP++V+S QR+K+ + E V IA D++ELIPG +F KY+ + + K ++
Sbjct: 65 SIEAVVEGPNPIIVRSEGQRRKVCAQERVKIAHDDVLELIPGEYFVKYLNVGDNHKSSTS 124
Query: 120 DGATNGE----------LSSKKMRQ---------------QDEQDNENGKNSEEALCNFH 154
G+++ + + K+ RQ +E +G ++ L +
Sbjct: 125 MGSSDFKKGKRLCEDDTVVIKRNRQIMEDEALARSLQKSFAEESSTISGLGCDQMLSSLD 184
Query: 155 VS----------------RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSN 198
+ +D L TFRL+RVQGLP+W NTS V+I+DVIQG++++A+LSN
Sbjct: 185 SAGFSERNNERIHSVDYLKDVLSLTFRLMRVQGLPSWTNTSSVTIQDVIQGEVLLAVLSN 244
Query: 199 YMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 258
YMVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHSK
Sbjct: 245 YMVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHSK 304
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
AMLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS
Sbjct: 305 AMLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRSVSFENDLVDYLSA 364
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
+KWPEF NLP G+ IN +FF+KF++ S++VRLI SVPGYH G ++KKWGHMKLR+VL
Sbjct: 365 IKWPEFRVNLPVVGDVNINAAFFRKFDYKSSSVRLIGSVPGYHVGPNIKKWGHMKLRSVL 424
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTVE
Sbjct: 425 EGCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFAFSLSAGKSDNGSPLGIGKPLIVWPTVE 484
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 498
DVR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +A
Sbjct: 485 DVRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDIA 544
Query: 499 ----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNIV 541
KAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+
Sbjct: 545 WFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNLA 604
Query: 542 PS-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSW 600
P EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +EDVPWSW
Sbjct: 605 PGKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDVPWSW 653
Query: 601 DKRYTKKDVYGQVWPRH 617
DKRYTKKDVYG VWPRH
Sbjct: 654 DKRYTKKDVYGSVWPRH 670
>gi|218199747|gb|EEC82174.1| hypothetical protein OsI_26284 [Oryza sativa Indica Group]
Length = 843
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/683 (54%), Positives = 478/683 (69%), Gaps = 74/683 (10%)
Query: 2 SATKIGYLVPLD--NNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL+ N + S+ +P+ G NV+GR ++ V DKR+SRKH++L ASADG
Sbjct: 5 SRVRVGNLVPLNEGNASSSNGSVSSIPIYLGANVVGRNHLVVVDKRVSRKHLSLHASADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYV----------- 108
S VV+G NP++V+S QR+K+ + E V IA D++ELIPG +F KY+
Sbjct: 65 SIEAVVEGPNPIIVRSEGQRRKVCAQERVKIAHDDVLELIPGEYFVKYLNVGDNHKSSTS 124
Query: 109 ------------------------------TLSRS-QKRVSNDGATNGELSSKKMRQQDE 137
L+RS QK + + +T L +M +
Sbjct: 125 MGSSDFKKGKRLCEDDTVVIKRNRQIMEDEALARSLQKSFAEESSTISGLGCDQMLSSLD 184
Query: 138 QDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS 197
+ +N+E + + +D L TFRL+RVQGLP+W NTS V+I+DVIQG++++A+LS
Sbjct: 185 SAGSSERNNER-IHSVDYLKDVLSLTFRLMRVQGLPSWTNTSSVTIQDVIQGEVLLAVLS 243
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYMVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHS
Sbjct: 244 NYMVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHS 303
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 317
KAMLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS
Sbjct: 304 KAMLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRIVSFENDLVDYLS 363
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 377
+KWPEF NLP G+ IN +FF+KF++ S+ VRLI SVPGYH G ++KKWGHMKLR+V
Sbjct: 364 AIKWPEFRVNLPVVGDVNINAAFFRKFDYKSSLVRLIGSVPGYHVGPNIKKWGHMKLRSV 423
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTV 437
L+ CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTV
Sbjct: 424 LEGCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFACSLSAGKSDNGSPLGIGKPLIVWPTV 483
Query: 438 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKL 497
EDVR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +
Sbjct: 484 EDVRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDI 543
Query: 498 A----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNI 540
A KAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+
Sbjct: 544 AWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNL 603
Query: 541 VPS-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 599
P EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +ED PWS
Sbjct: 604 APGKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDDPWS 652
Query: 600 WDKRYTKKDVYGQVWPRHFQLYA 622
WDKRYTKKDVYG VWPRH + A
Sbjct: 653 WDKRYTKKDVYGSVWPRHGGIQA 675
>gi|242050414|ref|XP_002462951.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
gi|241926328|gb|EER99472.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
Length = 689
Score = 687 bits (1773), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/519 (63%), Positives = 396/519 (76%), Gaps = 18/519 (3%)
Query: 109 TLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
LS+ + ++ A +G S +K+ D +G+N+E + +D LP TFRL+R
Sbjct: 178 VLSKQESFAEDNTAVSGMTSGQKISSHDSA-GSSGRNNERKH-SIGPLKDMLPLTFRLMR 235
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
VQGLP+W NTS VSI+DVIQG++++A+LSNYMVDIDWLL ACP L K+PHVLV+HG+
Sbjct: 236 VQGLPSWTNTSSVSIQDVIQGEVLLAVLSNYMVDIDWLLTACPSLKKVPHVLVLHGQDGA 295
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG 288
+LE MK+ KPANWILHKPPLPISFGTHHSKAMLL+YP+G+RI+VHTANLIHVDWN KSQG
Sbjct: 296 SLELMKKLKPANWILHKPPLPISFGTHHSKAMLLVYPQGIRIVVHTANLIHVDWNYKSQG 355
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS 348
LWMQDFP KD N+++ + FENDL+DYLS LKWPEFS NLP G+ IN +FF+KF++ +
Sbjct: 356 LWMQDFPWKDTNDMNNKVPFENDLVDYLSALKWPEFSVNLPEVGDVNINAAFFRKFDYRN 415
Query: 349 AAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
+ VRLI SVPGYH G +++KWGHMKLR VL E TF K F KSPL+YQFSSLGSLDEKWM+
Sbjct: 416 SMVRLIGSVPGYHVGPNIRKWGHMKLRNVLDEITFNKQFCKSPLIYQFSSLGSLDEKWMS 475
Query: 409 ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
E + S+S+G S+D + LGIG+PLIVWPTVEDVRCS+EGYAAG+ IPSPQKNV+KDFLKKY
Sbjct: 476 EFACSLSAGKSDDGSQLGIGKPLIVWPTVEDVRCSIEGYAAGSCIPSPQKNVEKDFLKKY 535
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYE 518
W++WKA H GR RAMPHIKTF RY+GQ +A KAAWGALQKNN+QLMIRSYE
Sbjct: 536 WSRWKADHVGRCRAMPHIKTFTRYSGQNIAWFLLTSSNLSKAAWGALQKNNTQLMIRSYE 595
Query: 519 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 578
LGVL LP + FSCT S + KTKLVTL W G + +V
Sbjct: 596 LGVLFLPQTLQSIPQFSCTEK---SRSSRDGVAIGRTIKTKLVTLCWKGDEE---DPSIV 649
Query: 579 YLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
LPVPY+LPPQ Y ++DVPWSWD+RYTKKDVYG VWPRH
Sbjct: 650 KLPVPYQLPPQPYGTQDVPWSWDRRYTKKDVYGSVWPRH 688
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 78/116 (67%), Gaps = 2/116 (1%)
Query: 2 SATKIGYLVPL--DNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL DN + S+ +P+ QG NV+GR ++ V DKR+SRKH++L AS DG
Sbjct: 5 SRVRLGTLVPLTKDNAGSSNGSVSNIPIFQGSNVVGRDHLVVVDKRISRKHLSLHASTDG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQK 115
S +VV+G NP++V+S QR+K+ + IA GD++ELIPG +F KYV + K
Sbjct: 65 SIEVVVEGPNPIMVRSNGQRRKVCATGKAKIAHGDVLELIPGDYFVKYVDMGDEHK 120
>gi|357504797|ref|XP_003622687.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355497702|gb|AES78905.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 849
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/449 (67%), Positives = 371/449 (82%), Gaps = 4/449 (0%)
Query: 1 MSATKIGYLVPLDNNL--REDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASAD 58
S ++IGYL+PL+ N +E S PKL +S G N+IGR N+PV+DKRLSRKH+T+TASAD
Sbjct: 3 FSHSQIGYLIPLNPNSEEKEKASTPKLTISDGTNIIGRNNVPVNDKRLSRKHLTITASAD 62
Query: 59 GSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
G+A+L V+GTNPVVV SG++R+KL+S + +I DGD+IELIPGH+ FKY RS K
Sbjct: 63 GTANLHVEGTNPVVVNSGNKRRKLNSKQTAAIFDGDVIELIPGHYLFKYQVSQRSPKVAD 122
Query: 119 NDGATNGELSSKKMRQQDEQDNENG--KNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
N G+ S+ + + ++G ++ EE + +F V+ D++P TFRLLRVQGLP WA
Sbjct: 123 NKHHERGKNSATQRHDKIAVTQKHGSSRSCEEPIRDFRVADDQIPCTFRLLRVQGLPPWA 182
Query: 177 NTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN 236
NTSCVSI DVIQGDI+VA+LSNYMVD+DWL+PACP L+K+PHVLV+HGESD + +KR+
Sbjct: 183 NTSCVSISDVIQGDILVAVLSNYMVDVDWLVPACPALSKVPHVLVLHGESDERVACIKRS 242
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
KP NWILHKPPLPISFGTHHSKAM L+YPRGVR+I+HTANLI+VDWNNKSQGLWMQDFP
Sbjct: 243 KPKNWILHKPPLPISFGTHHSKAMFLVYPRGVRVIIHTANLIYVDWNNKSQGLWMQDFPW 302
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
KDQN+ S+ FENDL++YLS LKWPEFS NLP+ GNF I PSFFKKF++S A VRLIAS
Sbjct: 303 KDQNSPSKGSRFENDLVEYLSALKWPEFSVNLPSLGNFSICPSFFKKFDYSDAMVRLIAS 362
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
VPGYH+G+ LKKWGHMKLR+VLQECTF+K FKKSPLVYQFSSLGSLDEKWM EL+SSMS+
Sbjct: 363 VPGYHSGNGLKKWGHMKLRSVLQECTFDKEFKKSPLVYQFSSLGSLDEKWMVELASSMSA 422
Query: 417 GFSEDKTPLGIGEPLIVWPTVEDVRCSLE 445
G SEDK PLG+GEP I+WPTVE+VRCS+E
Sbjct: 423 GLSEDKVPLGMGEPQIIWPTVEEVRCSIE 451
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 125/175 (71%), Positives = 138/175 (78%), Gaps = 11/175 (6%)
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA----------KAAW 502
IPSP KNV+K FLKKYWAKWKA+HTGR+RAMPHIKTFARYN Q LA KAAW
Sbjct: 633 IPSPMKNVEKAFLKKYWAKWKANHTGRTRAMPHIKTFARYNNQNLAWFCLTSSNLSKAAW 692
Query: 503 GALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVT 562
GALQKNNSQLMIRSYELGVL LPS + GCGFSCTSN+ S+ KS + ETS+++KTKLVT
Sbjct: 693 GALQKNNSQLMIRSYELGVLFLPSLLKPGCGFSCTSNVKQSKDKSPAQETSKMKKTKLVT 752
Query: 563 LTWHGSSDAGASSEVVY-LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
LT +SSEV+ LPVPYELPP YSSEDVPWSWD+RY KKD YGQVWPR
Sbjct: 753 LTAPTRDTTHSSSEVIIQLPVPYELPPLPYSSEDVPWSWDRRYFKKDDYGQVWPR 807
>gi|168038405|ref|XP_001771691.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676998|gb|EDQ63474.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 598
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 328/622 (52%), Positives = 423/622 (68%), Gaps = 47/622 (7%)
Query: 25 LPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRKKLSS 84
+ L +GPN IGR ++ ++K++SRKH+ L S+D + L V G NPVV+KSG ++KL
Sbjct: 1 IALFEGPNSIGRDDLVSANKQVSRKHVVLKTSSDCTFELSVIGQNPVVIKSGSGKRKLLP 60
Query: 85 NEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQ---DNE 141
N I+ GDIIE +PG +K +TL T ELS + + DE D E
Sbjct: 61 NARALISAGDIIEFLPGKMPYK-LTLE----------PTEDELSPRAANKLDEAFGVDYE 109
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
G S STFRL++V+GLP WAN CV+IR VIQGD+ VA+LSNYMV
Sbjct: 110 AGCRSS--------------STFRLMQVKGLPQWANKGCVNIRGVIQGDVQVALLSNYMV 155
Query: 202 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
DIDWLL ACP L +P V++ HGES G+LE ++ KP +W+LHKPPL +S+GTHH+KAM
Sbjct: 156 DIDWLLEACPRLKTVPSVVIFHGESGGSLELLQARKPNSWLLHKPPLRLSYGTHHTKAMF 215
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD-QNNLSEECGFENDLIDYLSTLK 320
L+YP G+RI+VHTANLI++DWNNKSQGLW QDFP K+ S+ FENDL++YL L+
Sbjct: 216 LLYPTGIRIVVHTANLIYIDWNNKSQGLWTQDFPYKNVAAGESKPSPFENDLVEYLQALE 275
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 380
W A + G ++ +FF+KF++SSA VRL+ASVPGYH G +L KWGH+KLRT+LQE
Sbjct: 276 WTGCIAIISGIGEVHVDAAFFRKFDYSSAMVRLVASVPGYHLGRNLTKWGHLKLRTILQE 335
Query: 381 CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
FE+ FK SP VYQFSSLGSLDEKWM E SS+ +G + LG G IVWPTVED+
Sbjct: 336 QHFEEHFKGSPCVYQFSSLGSLDEKWMGEFGSSIQAGSTFGNEQLGPGPVQIVWPTVEDI 395
Query: 441 RCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA-- 498
R SLEGYAAG A+PSP KNV++ FL KYW +W+A HTGRSRA+PHIKTF RYN Q+LA
Sbjct: 396 RNSLEGYAAGGAVPSPLKNVERAFLSKYWYRWQADHTGRSRAIPHIKTFLRYNDQRLAWF 455
Query: 499 --------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG---FSCT--SNIVPSEI 545
KAAWG LQKN SQLMIRSYELGVL LPS + FSCT S+I+P E+
Sbjct: 456 LLTSSNLSKAAWGVLQKNGSQLMIRSYELGVLFLPSLVGNNSNVTPFSCTYSSSILPREL 515
Query: 546 KSGSTETS--QIQKTKLVTLTWHGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSWDK 602
++ + Q++ TKLVTL+W S+ + ++ V LP+PY LPP +Y +D+PWSWD+
Sbjct: 516 QNREDDGGKRQLRHTKLVTLSWKSSNHEKSDMDIFVRLPIPYALPPVKYDPKDIPWSWDR 575
Query: 603 RYTKKDVYGQVWPRHFQLYAFQ 624
+Y + D++G+VWPR + Y Q
Sbjct: 576 QYREPDMFGEVWPRQVRRYTMQ 597
>gi|147781461|emb|CAN76118.1| hypothetical protein VITISV_033882 [Vitis vinifera]
Length = 592
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 307/430 (71%), Positives = 345/430 (80%), Gaps = 37/430 (8%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
EA+ +F + +D LP T+RLLRV+ LPAWANTS VSIRDVIQGD+++A+LSNYMVDIDWLL
Sbjct: 137 EAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSSVSIRDVIQGDVLIAVLSNYMVDIDWLL 196
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSKAMLL+YPRG
Sbjct: 197 SSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSKAMLLVYPRG 256
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN 327
VR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS LKWPEF+AN
Sbjct: 257 VRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSVLKWPEFTAN 316
Query: 328 LPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF 387
LPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VLQEC F+K F
Sbjct: 317 LPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLXSVLQECIFDKEF 376
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLE-- 445
+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVEDVRCSLE
Sbjct: 377 QKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVEDVRCSLEAH 436
Query: 446 ---------------------------GYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG 478
GYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTG
Sbjct: 437 ITCWIPGYLLGFYMCKFALHQSYYIVQGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTG 496
Query: 479 RSRAMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 538
R + L+KAAWGALQKNNSQLMIRSYELGVL LPS G GFSCT
Sbjct: 497 RCWFL--------LTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTD 548
Query: 539 NIVPSEIKSG 548
N PS++ G
Sbjct: 549 NGSPSKMFPG 558
>gi|302774643|ref|XP_002970738.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
gi|300161449|gb|EFJ28064.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
Length = 478
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 278/476 (58%), Positives = 348/476 (73%), Gaps = 18/476 (3%)
Query: 153 FHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPV 212
H +R P F+LLRVQGLP WAN CV I DVI+GD++VAILSNYMVDI+WLL ACP+
Sbjct: 8 LHSARS--PCGFQLLRVQGLPDWANAGCVRISDVIKGDVLVAILSNYMVDIEWLLSACPL 65
Query: 213 LAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIV 272
L IP V++IHGES+ + ++ KP+NW+L KP L IS+GTHHSKAMLL+YP GVR++V
Sbjct: 66 LRSIPQVVMIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKAMLLVYPTGVRVVV 123
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG 332
HTANLI++DWNNK+QGLWMQDFP K ++ FENDL+DYL+ L+W + ++ HG
Sbjct: 124 HTANLINIDWNNKTQGLWMQDFPFKSMTGITTASDFENDLVDYLTALEWSGCTVDVQHHG 183
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL 392
KIN +F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+E F+K F+ SPL
Sbjct: 184 QMKINAIYFRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILKEEKFDKKFQNSPL 243
Query: 393 VYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
VYQFSSLGSLDEKWM E SSS+S G + D LG+GE I++PTVEDVR SLEGY AG A
Sbjct: 244 VYQFSSLGSLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVEDVRQSLEGYRAGAA 303
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA----------KAAW 502
IPSP KNV+K LKKYW++W+A HTGRSRAMPHIKTF R+ LA KAAW
Sbjct: 304 IPSPAKNVEKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAWVCLTSSNLSKAAW 363
Query: 503 GALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLV 561
GALQKN +QLMIRSYELGV+ LPS + +SCT ++ P ++ + ET + KL
Sbjct: 364 GALQKNKTQLMIRSYELGVVFLPSMLSKFKNRYSCTEDL-PLINENEACETGEAPNVKLY 422
Query: 562 TLTWHGSSD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
TL S D +++++ LP+PY LPP RYSS+D PW WDK+Y DVYG+ WP
Sbjct: 423 TLAATESVDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYLHPDVYGKRWP 478
>gi|302771966|ref|XP_002969401.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
gi|300162877|gb|EFJ29489.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
Length = 491
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 277/469 (59%), Positives = 347/469 (73%), Gaps = 19/469 (4%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P F+LLRVQGLP WAN CV I DVI+GD++VAILSNYMVDI+WLL ACP+L IP V+
Sbjct: 27 PCGFQLLRVQGLPDWANAGCVRISDVIKGDVLVAILSNYMVDIEWLLSACPLLRSIPQVV 86
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
+IHGES+ + ++ KP+NW+L KP L IS+GTHHSKAMLL+YP GVR++VHTANLI++
Sbjct: 87 MIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKAMLLVYPTGVRVVVHTANLINI 144
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
DWNNK+QGLWMQDFPLK ++ FENDL+DYL+ L+W + ++ HG KIN S+
Sbjct: 145 DWNNKTQGLWMQDFPLKSMTGITTASDFENDLVDYLTALEWSGCTVDVQHHGQMKINASY 204
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+E F+K F+ SPLVYQFSSLG
Sbjct: 205 FRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILKEEKFDKKFQNSPLVYQFSSLG 264
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
SLDEKWM E SSS+S G + D LG+GE I++PTVEDVR SLEGY AG AIPSP KNV
Sbjct: 265 SLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVEDVRQSLEGYRAGAAIPSPAKNV 324
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNS 510
+K LKKYW++W+A HTGRSRAMPHIKTF R+ LA KAAWGALQKN +
Sbjct: 325 EKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAWVCLTSSNLSKAAWGALQKNKT 384
Query: 511 QLMIRSYELGVLILPSA-KRHGCGFSCTSNI-VPSEIKSGSTETSQIQKTKLVTLTWHGS 568
QLMIRSYELGV+ LPS + +SCT ++ + +E ++ T + KL TL S
Sbjct: 385 QLMIRSYELGVVFLPSMLSKFKNRYSCTEDLPLINENEACKTGAPNV---KLYTLAATES 441
Query: 569 SD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
D +++++ LP+PY LPP RYSS+D PW WDK+Y DVYG+ WP
Sbjct: 442 MDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYLHPDVYGKRWP 490
>gi|303279543|ref|XP_003059064.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458900|gb|EEH56196.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 520
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 186/531 (35%), Positives = 271/531 (51%), Gaps = 90/531 (16%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P FRL +G+ A AN CVSI DV++G + AI+ N+ VD+DW L ACP L V+
Sbjct: 1 PPAFRLWSTEGVTADANAGCVSISDVVRGSVRWAIVMNFTVDLDWFLAACPALRTARRVI 60
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
+++G + + P +W HKPP P +GTHH+KA +L Y GVR+++HTANL H
Sbjct: 61 LMYGNMHPGVAEI----PKHWSTHKPPCP-QYGTHHTKAFILAYDAGVRVVIHTANLTHH 115
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+N Q +W QDFPLK +++ FENDL+ Y+S L+W S + +++P
Sbjct: 116 DFNKSCQAVWYQDFPLKRESS-PPGSAFENDLVRYVSRLQWSGESVD-----GERVSPEA 169
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
++++FS A V+LIASVPG H G L++WGHM +RT L+ T + FK S ++ Q++S G
Sbjct: 170 LRRYDFSGAGVKLIASVPGRHAGEELRRWGHMAVRTALERETHDDAFKGSSVLCQYTSTG 229
Query: 401 SLDEKWMAE------------LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYA 448
SL +KW+ E S G + + LG GE ++WPTVE++R GYA
Sbjct: 230 SLPKKWLDEEFRDSLCAGACAGGGGGSVGGNANDRSLGPGEMQLLWPTVEEIRTCDVGYA 289
Query: 449 AGNAIPSPQKNVDKDFLKKYWAKWK---------ASHTGRSRAMPHIKTFARY------- 492
AG +IP KNV + L + + KW A GR + MPHIKTF+RY
Sbjct: 290 AGGSIPGNGKNVRRPHLTEKFHKWAKPNDDDDDDAHPMGRRKHMPHIKTFSRYYDALTPY 349
Query: 493 --------------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPS------ 526
L+ AAWG L+ SQ+ + SYELGV+ LPS
Sbjct: 350 QKKRGGGGGVAGAKFAYVIVCSHNLSGAAWGKLEHGGSQIHVYSYELGVMFLPSLIGART 409
Query: 527 -------AKRHGCGFSCTSNIVP------SEIKSGSTETSQIQKTKLVTLTWHGSSDA-- 571
+ F C + + P + + ++E + + L G++ A
Sbjct: 410 AKPFSALSATEADPFRCLAAVRPRATTTATATATATSEGAVVLTHALTLARPPGAATATT 469
Query: 572 --GASSEVVYLPVPYELPPQRYS--------SEDVPWSWDKRYTKKDVYGQ 612
G S+ + P+PY +PP RY+ D PW WD+RY D +G+
Sbjct: 470 ASGPSATLALCPLPYNVPPLRYNLDDNAPLLERDEPWVWDQRYDVADEWGR 520
>gi|384252305|gb|EIE25781.1| tyrosyl-DNA phosphodiesterase [Coccomyxa subellipsoidea C-169]
Length = 502
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 185/493 (37%), Positives = 274/493 (55%), Gaps = 53/493 (10%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVS--IRDVIQGDIIVAIL-SNYMVDIDWLLPACPVLAKI 216
+P LLRV+GLP + + ++D++ G + +L SN+M+D+ W + A P +
Sbjct: 2 IPPVASLLRVRGLPEQFSRGALGTQLKDLLSGGPMRWLLISNFMIDMRWFVSAAPSVLDA 61
Query: 217 PHVLVIHGE-----SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRII 271
V V+HGE S ++ + +P W++H+ P+ +G HHSKA L+ + RG+R++
Sbjct: 62 DRVTVVHGEKSNPTSVSWMQQIAAGRP--WVIHQARCPLQYGVHHSKAFLVQFDRGLRVV 119
Query: 272 VHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLP 329
VHTANLIH D N K+QGLW QDFP KD+ + + FE L DY++ L+ P A
Sbjct: 120 VHTANLIHQDCNCKTQGLWYQDFPRKDERSPQDNASRLFETTLSDYIAALRLPAREAQ-- 177
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
H I + +FSSA LI SVPGYH G++ +K+GHM +R++L F+ F++
Sbjct: 178 -HAQQVI-----AQHDFSSARAHLIPSVPGYHQGAAKQKYGHMLVRSLLARQRFDPVFRR 231
Query: 390 SPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-------IVWPTVEDVRC 442
SP+V QFSSLGS+ W++E S+++G D P G L +VWPTVE+V+
Sbjct: 232 SPIVAQFSSLGSITGAWLSEFRESLAAGDCWDSNPSGSAGRLGPAADFRVVWPTVEEVKN 291
Query: 443 SLEGYAAGNAIPSPQKNVDKD-------FLKKYWAKWKA--SHTGRSRAMPHIKTFARYN 493
S+EG+ AG +IP NV K L+ +W ++ + GR AMPHIK++ R++
Sbjct: 292 SVEGWFAGCSIPGTHANVLKTDKGLSTPILQPFWCRFDGAPATAGRQHAMPHIKSYLRHS 351
Query: 494 GQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSA----KRH-GCGFSCTS 538
GQ+LA KAAWG LQKNN+QL I YELGVL+LPS +RH GFSCT+
Sbjct: 352 GQRLAYIVLTSHNLSKAAWGVLQKNNTQLHIMHYELGVLLLPSLEESYRRHRHFGFSCTA 411
Query: 539 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
S + + + S+++ S +E + + +PY+LPP RY +D PW
Sbjct: 412 PA--SHKPAAAAQPSRVEFWAADGAAAGSSEALSTGAEKLEILLPYQLPPVRYGPQDQPW 469
Query: 599 SWDKRYTKKDVYG 611
+ D G
Sbjct: 470 MTGVEFPGLDSQG 482
>gi|255087474|ref|XP_002505660.1| predicted protein [Micromonas sp. RCC299]
gi|226520930|gb|ACO66918.1| predicted protein [Micromonas sp. RCC299]
Length = 536
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 183/509 (35%), Positives = 259/509 (50%), Gaps = 60/509 (11%)
Query: 161 PSTFRLLRVQGLPAWANTS----CVSIRDVIQGDIIVAILSNYMVDIDWLLP--ACPVLA 214
P FRLL NTS CVS+RD++ G + ++ N+M+D+ WLL CP L
Sbjct: 20 PPLFRLLTTDPADLNPNTSGNAGCVSLRDIVSGPVRWCVVMNFMIDLPWLLSPDGCPELL 79
Query: 215 KIPHVLVIHGESDGTL----EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRI 270
+IP V+ I E E ++ +W + PP P FGTHH+K +L+Y GVR+
Sbjct: 80 RIPKVVWIGDERSSPTPRDPEFLRLKGERDWTVVNPPCP-KFGTHHTKCFILVYDTGVRV 138
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLP- 329
VHTANLIH D ++ W QDFP K +L FE DL YL+TL W + + LP
Sbjct: 139 CVHTANLIHGDVRKRTNAAWCQDFPNKSAAHLGRSSEFERDLGRYLATLGWKDETCALPG 198
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
A G+ + PS +F+FS A +LIASVPG GS++ +GH +R L TF FK+
Sbjct: 199 AGGDVVVGPSAMSRFDFSGAGAKLIASVPGRWVGSAMMNYGHTSVRHALAGMTFPGVFKR 258
Query: 390 SPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP--------LGIGEPLIVWPTVEDVR 441
+P+V QF+S+G+ EKWM E++ S +G +E LG G+ +VWPT+ +VR
Sbjct: 259 APVVCQFTSVGATTEKWMGEMARSFGAGATETDDANEWPGGPCLGDGDLRLVWPTMGEVR 318
Query: 442 CSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA------------------SHTGRSRAM 483
S GY G +IP + ++ +++ +W+ TGR R M
Sbjct: 319 GSNLGYVTGGSIPGATDKISREHVRRRLHRWRGDVGATRGTKLLDHPPASTDPTGRGRVM 378
Query: 484 PHIKTFARY-------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSA--- 527
PH+KTFARY L+ AAWG L+KN +Q+ I SYELGVL+ P +
Sbjct: 379 PHVKTFARYAPNAPHHLAWVIVGSHNLSGAAWGRLEKNETQIAILSYELGVLLSPRSIGK 438
Query: 528 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA--GASSE-VVYLPVPY 584
R F+CT V G + ++ + G D+ G S E V + P+PY
Sbjct: 439 TRVAAPFTCTPGAVSHR---GEVVPRCLGGVRISAASDDGPGDSPPGDSREFVAFAPLPY 495
Query: 585 ELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+PP Y+ D PW+ D D YG+V
Sbjct: 496 RVPPVPYAPSDAPWAVDAWDETPDKYGRV 524
>gi|302833870|ref|XP_002948498.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
gi|300266185|gb|EFJ50373.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
Length = 1521
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 174/422 (41%), Positives = 233/422 (55%), Gaps = 67/422 (15%)
Query: 162 STFRLLRVQGLPAWANTSC--VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHV 219
S LLRV+GL NT C V +R V+ G + +A++SNYM+D+ WLL CP LAK
Sbjct: 122 SPVHLLRVRGLSPRYNTGCLGVDLRHVVSGPLQLALVSNYMIDMGWLLSCCPDLAKARQF 181
Query: 220 LVIHGESDGTLEHMKRNKP----ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
V+HGE M++ A+ LH+PPLPI +GTHHSKA LL Y G+R+I+HTA
Sbjct: 182 FVVHGEGPDAEPEMRQQAAEAGAAHVRLHRPPLPIMYGTHHSKAFLLAYSTGLRLIIHTA 241
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNF 334
N ++ D N+K+QGLW+QDFP KD + FE DL+ Y L P AN
Sbjct: 242 NCVYPDCNDKTQGLWVQDFPRKDTVAAAAPVSTFEQDLVAYFRALALPPAMAN------- 294
Query: 335 KINPSF--FKKFNFSSAAVRLIASVPGYHTGSS-LKKWGHMKLRTVLQECTFEKGFKKSP 391
P F +FS A L+ASVPGYH G++ ++ +GHM+LR +L++ F
Sbjct: 295 ---PLFEAIAMHDFSFARGTLVASVPGYHRGTAAVQSYGHMRLRRLLEQVPLPSCFAAEG 351
Query: 392 ----------------LVYQFSSLGSLDEKWMA-ELSSSMSS------------------ 416
L+ Q SS+GS D+ W+ E+ +S+++
Sbjct: 352 SSCGTASSSSAVPPEGLIIQCSSMGSFDQAWLVDEMGASLAACRRQPPPPPPPPRPLAAA 411
Query: 417 --GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA 474
G +VWPTVE+VR S+EG+ AG +IP P +NV K F+ +Y+A+W
Sbjct: 412 PPPRPSGPPGCGPLPLAVVWPTVEEVRNSIEGWNAGRSIPGPSRNVSKPFMGRYYARWGG 471
Query: 475 SHTGRSRAMPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLIL 524
GR RAMPHIKT+ RY GQ+LA KAAWG LQKN SQLMIRSYELGVL+
Sbjct: 472 EAVGRQRAMPHIKTYTRYRGQQLAWFLVTSHNLSKAAWGELQKNGSQLMIRSYELGVLVT 531
Query: 525 PS 526
P+
Sbjct: 532 PA 533
>gi|332223510|ref|XP_003260916.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Nomascus
leucogenys]
Length = 608
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 197/575 (34%), Positives = 291/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K SS E + S +D ++ +P K V SNDGA +G +
Sbjct: 75 KRQKSSSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISASNDGAAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G E + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSG----EVQDIWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKTPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTP 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DIIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGGDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|410962801|ref|XP_003987957.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Felis catus]
Length = 608
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 177/484 (36%), Positives = 254/484 (52%), Gaps = 71/484 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFRFYLTRVSGIEPKDNSGALHIKDILSPLFGTLLSSAQFNYCFDVDWLVKQYPPQFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ + Q + F+ DLI YL P +
Sbjct: 283 NLIHADWHQKTQGIWLSPLYPRVVHGTQRSGDSTTHFKADLISYLMAYNAPSLKEWI--- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKK 389
++ + S V LI S PG GS WGH +LR +L+E + KG +
Sbjct: 340 -------DVIQEHDLSETNVYLIGSTPGRFQGSQKDHWGHFRLRKLLKEHASSIPKG-ES 391
Query: 390 SPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GS+ + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 392 WPIVGQFSSIGSMGADESKWLCSEFKESLVTQGKESRTPGKSAAPLHLIYPSVENVRTSL 451
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN---------- 493
EGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R +
Sbjct: 452 EGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRLSPDFSQIAWFL 511
Query: 494 --GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 512 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFRVKQKFFSGSKE 565
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 610
+ PVPY+LPP+ Y S+D PW W+ YTK D +
Sbjct: 566 PTS------------------------SFPVPYDLPPELYGSKDRPWIWNIPYTKAPDTH 601
Query: 611 GQVW 614
G +W
Sbjct: 602 GNMW 605
>gi|297695684|ref|XP_002825063.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pongo abelii]
gi|297695686|ref|XP_002825064.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pongo abelii]
Length = 608
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 194/575 (33%), Positives = 290/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDGA +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGAAQRTENHGPPT 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSRALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKTSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFVGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|403298195|ref|XP_003939917.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403298197|ref|XP_003939918.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 605
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 182/483 (37%), Positives = 254/483 (52%), Gaps = 70/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 161 PYQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPREFRKK 220
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 221 PVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 280
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 281 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWI--- 337
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 338 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 390
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 391 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 450
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRSRAMPHIKT+ R
Sbjct: 451 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSRAMPHIKTYMRPSPDFSRIAWFLI 510
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 511 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 564
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 565 -------------------------MPFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 599
Query: 612 QVW 614
+W
Sbjct: 600 NMW 602
>gi|296215712|ref|XP_002754236.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Callithrix
jacchus]
Length = 606
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 181/483 (37%), Positives = 254/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 161 PYQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPREFRKK 220
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 221 PILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 280
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P + A
Sbjct: 281 NLIHADWHQKTQGVWLSPLYPRIVDGTHKSGESITHFKADLISYLMAYNAPSLKEWIDA- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR VL++ ++S
Sbjct: 340 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKVLKDHASSIPNEESW 390
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 391 PVVGQFSSIGSLGADESKWLCSEFKESMLALGKESKTPGKSSVPLYLIYPSVENVRTSLE 450
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 451 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLI 510
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 511 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 564
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 565 ------------------------MTTFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 600
Query: 612 QVW 614
+W
Sbjct: 601 NMW 603
>gi|397525717|ref|XP_003832802.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pan paniscus]
gi|397525719|ref|XP_003832803.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pan paniscus]
Length = 608
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 193/575 (33%), Positives = 289/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRQAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|350539189|ref|NP_001233557.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|343962149|dbj|BAK62662.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410225564|gb|JAA10001.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410265878|gb|JAA20905.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410301400|gb|JAA29300.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
Length = 608
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 193/575 (33%), Positives = 289/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRRAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|20127586|ref|NP_060789.2| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|57242805|ref|NP_001008744.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|37999797|sp|Q9NUW8.2|TYDP1_HUMAN RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|15930062|gb|AAH15474.1| Tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|85725382|gb|ABC79301.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|119601820|gb|EAW81414.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601821|gb|EAW81415.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601822|gb|EAW81416.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
Length = 608
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 193/575 (33%), Positives = 289/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|7023536|dbj|BAA91997.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 271 bits (694), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 193/575 (33%), Positives = 289/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNPESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|189054943|dbj|BAG37927.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 192/575 (33%), Positives = 289/575 (50%), Gaps = 82/575 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E +M
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKENM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|73964387|ref|XP_547950.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Canis lupus familiaris]
Length = 609
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 178/484 (36%), Positives = 254/484 (52%), Gaps = 71/484 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 223
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + + S E F+ DLI YL +
Sbjct: 284 NLIHADWHQKTQGIWLSPLYPRMAQATHRSGESATHFKADLISYLMAYNAAPLKEWIDT- 342
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKK 389
+ + S V LI S PG GS WGH +LR +L+E + KG +
Sbjct: 343 ---------IHEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLREHASSITKG-ES 392
Query: 390 SPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GS+ D KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPIVGQFSSIGSMGADDSKWLCSEFKESLVTLGKESRTPGKSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------ 491
EGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 453 EGYPAGGSLPYSIQTAEKQNWLHSYFHKWMADTSGRSNAMPHIKTYMRSSPDFSQIAWFL 512
Query: 492 YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 513 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSKE 566
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 610
+ PVPY+LPP+ Y ++D PW W+ YTK D +
Sbjct: 567 PA------------------------AAFPVPYDLPPELYGNKDRPWIWNIPYTKAPDTH 602
Query: 611 GQVW 614
G +W
Sbjct: 603 GNMW 606
>gi|417403294|gb|JAA48458.1| Putative tyrosyl-dna phosphodiesterase [Desmodus rotundus]
Length = 611
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 177/485 (36%), Positives = 254/485 (52%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N++ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 166 PFQFYLTRVSGIKPKYNSAALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 225
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HTA
Sbjct: 226 PILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTA 285
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ----NNLSEECG--FENDLIDYLSTLKWPEFSANLP 329
NLI DW+ K+QG+W+ PL + ++S E F+ DLI YL+ P + +
Sbjct: 286 NLICADWHQKTQGIWLS--PLYPRVACGTHMSGESATHFKADLISYLTAYNAPPLNEWI- 342
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFK 388
+ + S V LI S PG GS WGH +LR +L+E + G +
Sbjct: 343 ---------DIIRDHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSTPGAE 393
Query: 389 KSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GS+ KW+ +E ++++ E + P PL +++P+VE+VR S
Sbjct: 394 AWPVVGQFSSIGSMGADASKWLCSEFKETLATLGKESRAPGKGVTPLHLIYPSVENVRTS 453
Query: 444 LEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 454 LEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSHAMPHIKTYMRPSPDFGRIAWF 513
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V SGS
Sbjct: 514 LVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFQVKQRFFSGSQ 567
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ YTK D
Sbjct: 568 EPA------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYTKAPDT 603
Query: 610 YGQVW 614
+G +W
Sbjct: 604 HGNMW 608
>gi|351706738|gb|EHB09657.1| Tyrosyl-DNA phosphodiesterase 1 [Heterocephalus glaber]
Length = 655
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 184/507 (36%), Positives = 267/507 (52%), Gaps = 70/507 (13%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGIKPKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP AN L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKD---QNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
N+I DW+ K+QG+W+ +P D Q + + F+ DLI YL+ P +
Sbjct: 283 NIIREDWHQKTQGIWLSPLYPRIDHGTQGSGESKTHFKADLISYLTAYNAPPLQEWI--- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +L+E T +
Sbjct: 340 -------DTIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHGTSIPKAECW 392
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
PLV QFSS+GSL + KW+ +E S+ + +E+KTP PL +++P+VE+VR SLE
Sbjct: 393 PLVGQFSSIGSLGADESKWLCSEFKESLLTQGAENKTPGKSSIPLHLIYPSVENVRTSLE 452
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN----------- 493
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R +
Sbjct: 453 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRLSPNSSRIAWFLV 512
Query: 494 -GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWG L+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 513 TSANLSKAAWGVLEKNGTQLMIRSYELGVLFLPSA------FGLASFKVKQKFSSGSQEL 566
Query: 553 S-----------QIQKTKLVTLTWHGSSDAGASSEVVY-------------LPVPYELPP 588
+ ++ +K T G+ G +S V PVPY+LPP
Sbjct: 567 APPFPVPYDLPPELYGSKGETWA-QGTMGGGLASFKVKQKFSSGSQELAPPFPVPYDLPP 625
Query: 589 QRYSSEDVPWSWDKRYTKK-DVYGQVW 614
+ Y S+D PW W+ Y K D +G +W
Sbjct: 626 ELYGSKDRPWIWNIPYVKAPDRHGNMW 652
>gi|49258603|pdb|1QZQ|A Chain A, Human Tyrosyl Dna Phosphodiesterase
gi|49258604|pdb|1QZQ|B Chain B, Human Tyrosyl Dna Phosphodiesterase
Length = 483
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 179/483 (37%), Positives = 253/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 38 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 97
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 98 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 157
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 158 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWI--- 214
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 215 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESW 267
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 268 PVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 327
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 328 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 387
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 388 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 441
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 442 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 477
Query: 612 QVW 614
+W
Sbjct: 478 NMW 480
>gi|449280745|gb|EMC87981.1| Tyrosyl-DNA phosphodiesterase 1 [Columba livia]
Length = 604
Score = 268 bits (685), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 173/484 (35%), Positives = 258/484 (53%), Gaps = 68/484 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V G+ N+ + I+D++ G ++ + NY D+ WL+ P +
Sbjct: 156 PFRFFLTKVTGIEQSYNSGALHIKDILSPLFGTLVSSAQFNYCFDVGWLVRQYPQEFRKK 215
Query: 218 HVLVIHGES-DGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HGE + E + + +P I + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 216 PLLIVHGEKRESKAELVAQARPYEHISFCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 275
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKDQNNL----SEECGFENDLIDYLSTLKWPEFSANLPA 330
NLI DW+ K+QG+W+ +P Q E F++DLI YL+ P +
Sbjct: 276 NLIAEDWHQKTQGIWLSPLYPRLPQGTTGSAGESETNFKSDLISYLTAYNSPTLKEWI-- 333
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V L+ S PG + GS +KWGH++LR +L++ ++S
Sbjct: 334 --------DLIQEHDLSETRVYLLGSTPGRYQGSDKEKWGHLRLRKLLKDHASSIPARES 385
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL KW+ +E S+ + S TPL P+ +V+PTV++VR SL
Sbjct: 386 WPVVGQFSSIGSLGVDGSKWLCSEFQESLVAAGSSVTTPLKCDVPIHLVYPTVDNVRQSL 445
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR--YNGQKLA--- 498
EGY AG ++P + K +L Y+ KW AS +GRS A+PHIKT+ R + QK+A
Sbjct: 446 EGYPAGGSLPYSIQTAQKQLWLHSYFHKWAASISGRSHAIPHIKTYMRPSPDFQKIAWFL 505
Query: 499 -------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
KAAWGAL+K+ +QLMIRSYELGVL LPSA G+ C SE K +T
Sbjct: 506 VTLANLSKAAWGALEKSGTQLMIRSYELGVLFLPSAFGLDKGYFCVRGKTLSESKESAT- 564
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 610
Y PVPY+LPP++Y S+D PW W+ +T D +
Sbjct: 565 ---------------------------YFPVPYDLPPEQYGSKDQPWIWNIPHTDAPDTH 597
Query: 611 GQVW 614
G +W
Sbjct: 598 GNMW 601
>gi|383873205|ref|NP_001244708.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|355693501|gb|EHH28104.1| hypothetical protein EGK_18452 [Macaca mulatta]
gi|380814614|gb|AFE79181.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|383419927|gb|AFH33177.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
Length = 603
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 178/483 (36%), Positives = 253/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 612 QVW 614
+W
Sbjct: 598 NMW 600
>gi|402876919|ref|XP_003902197.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Papio anubis]
gi|402876921|ref|XP_003902198.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Papio anubis]
Length = 603
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 178/483 (36%), Positives = 253/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 612 QVW 614
+W
Sbjct: 598 NMW 600
>gi|355778790|gb|EHH63826.1| hypothetical protein EGM_16873 [Macaca fascicularis]
Length = 603
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 178/483 (36%), Positives = 253/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHESGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 612 QVW 614
+W
Sbjct: 598 NMW 600
>gi|301123067|ref|XP_002909260.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
gi|262100022|gb|EEY58074.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
Length = 1123
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 159/397 (40%), Positives = 215/397 (54%), Gaps = 64/397 (16%)
Query: 158 DKLPST--FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAK 215
D PS F L R++ PA N + D+++GD +L+NYM D+ WL CP L +
Sbjct: 20 DTTPSELGFYLNRLKTAPASHNLHAKRLSDLLEGDFSRCLLTNYMFDLPWLFTECPRLKE 79
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+P VLV HGE D + +N PPLPI +GTHH+K ++ +YP VR+ + TA
Sbjct: 80 VPVVLV-HGERDRQGMTKECRDYSNVTPVAPPLPIPYGTHHTKMLVALYPERVRVAIFTA 138
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---------CGFENDLIDYLSTLKWPEFSA 326
N + DWN K+QGLW QDF LK + EE FE DL+ YLS+L P
Sbjct: 139 NFLSNDWNTKTQGLWYQDFGLKVLTDSDEEEKEAVAKSSSDFEADLVHYLSSLGAP---- 194
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKG 386
K+ K+F+FSSA V L+ SVPG H G ++K+GH+++R
Sbjct: 195 -------VKLFCGELKRFDFSSARVALVPSVPGVHKGKDMEKYGHLRVR----------- 236
Query: 387 FKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPLGIGEPLIVWPTVEDVRCSL 444
+LGSLDEKW+ E + S+ G T + + ++WP VEDVR SL
Sbjct: 237 -----------NLGSLDEKWLFGEFAESLLPGKKHISSTSMPVQALHVIWPAVEDVRNSL 285
Query: 445 EGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYNGQK------- 496
EG+ +G +IP P KN+ K FL KY KW + R AMPHIK++AR+N +
Sbjct: 286 EGWNSGRSIPCPLKNM-KPFLHKYLRKWMPPAELHRQNAMPHIKSYARFNASEDKAGELD 344
Query: 497 --------LAKAAWGALQKNNSQLMIRSYELGVLILP 525
L+KAAWG+LQKN +Q MIRSYELGV+ LP
Sbjct: 345 WAIVTSSNLSKAAWGSLQKNKTQFMIRSYELGVMFLP 381
>gi|28373796|pdb|1MU7|A Chain A, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373797|pdb|1MU7|B Chain B, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373798|pdb|1MU9|A Chain A, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|28373799|pdb|1MU9|B Chain B, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|29726730|pdb|1NOP|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|29726731|pdb|1NOP|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|46015472|pdb|1RFF|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015473|pdb|1RFF|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015478|pdb|1RFI|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015479|pdb|1RFI|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015488|pdb|1RG1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015489|pdb|1RG1|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015492|pdb|1RG2|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015493|pdb|1RG2|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015502|pdb|1RGT|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015503|pdb|1RGT|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015506|pdb|1RGU|A Chain A, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015507|pdb|1RGU|B Chain B, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015511|pdb|1RH0|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
gi|46015512|pdb|1RH0|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
Length = 485
Score = 266 bits (679), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 177/483 (36%), Positives = 253/483 (52%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 40 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 99
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 100 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 159
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ +LI YL+ P +
Sbjct: 160 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISYLTAYNAPSLKEWI--- 216
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 217 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESW 269
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 270 PVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 329
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 330 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 389
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QLMIRSYELGVL LPSA S V + +GS E
Sbjct: 390 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------LGLDSFKVKQKFFAGSQEP 443
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 444 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 479
Query: 612 QVW 614
+W
Sbjct: 480 NMW 482
>gi|348573481|ref|XP_003472519.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cavia porcellus]
Length = 607
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 183/523 (34%), Positives = 264/523 (50%), Gaps = 76/523 (14%)
Query: 122 ATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCV 181
NG +S +++++DE + S E + + P F L RV G+ N+ +
Sbjct: 128 GNNGLPASHRLKEEDEYET-----SGEGQDIWDMLDKGNPFQFYLTRVSGIKPKYNSKAL 182
Query: 182 SIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKR 235
I+D++ G ++ + NY D+DWL+ P + +L++HG E+ L H +
Sbjct: 183 HIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKADL-HAQA 241
Query: 236 NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-F 294
AN L + L I+FGTHH+K MLL+Y G R+++HT+N+I DW+ K+QG+W+ +
Sbjct: 242 KPYANVSLCQAKLDIAFGTHHTKMMLLLYEEGFRVVIHTSNIIREDWHQKTQGIWLSPLY 301
Query: 295 PLKD---QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P D Q + F+ DLI YL P + ++ + S V
Sbjct: 302 PRLDPGSQKSGESRTHFKADLISYLMAYNAPPLKEWIDT----------IREHDLSETNV 351
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWM 407
LI S PG GS WGH KLR +L+E T + PLV QFSS+GSL + KW+
Sbjct: 352 YLIGSTPGRFQGSQKDNWGHFKLRKLLKEHGTPVPKTECWPLVGQFSSIGSLGADESKWL 411
Query: 408 -AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-F 464
+E S+ + E+K P PL +++P+VE+VR SLEGY AG ++P + +K +
Sbjct: 412 CSEFKESLLTLGPENKIPGKSSVPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQKW 471
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQL 512
L Y+ KW A +GRS AMPHIKT+ R L+KAAWGAL+KN +QL
Sbjct: 472 LHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSRIAWFLVTSANLSKAAWGALEKNGTQL 531
Query: 513 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 572
MIRSYELGVL LPS F S V + SGS + +
Sbjct: 532 MIRSYELGVLFLPSV------FGLDSFKVKQKFFSGSQDPT------------------- 566
Query: 573 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 567 -----TAFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 604
>gi|354478467|ref|XP_003501436.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
gi|344235810|gb|EGV91913.1| Tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
Length = 609
Score = 265 bits (677), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 173/485 (35%), Positives = 252/485 (51%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIRDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRNK 223
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP AN L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILIVHGDKREDKAHLHAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ +P DQ + + F+ DLI YL + P +
Sbjct: 284 NLIREDWHQKTQGIWLSPLYPRLDQGSHTSGESSTHFKADLISYLMSYNAPSLQEWIDT- 342
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
++ + S V L+ S PG GS WGH +LR +L+ T K
Sbjct: 343 ---------IQEHDLSETNVYLVGSTPGRFQGSHKDNWGHFRLRKLLR--THAPSVPKDE 391
Query: 391 --PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + + +TP PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKESLLALREDGRTPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P + ++ ++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 452 LEGYPAGGSLPYGIQTAERQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSSDFNKLAWF 511
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWG L+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGTLEKNGTQLMIRSYELGVLFLPSA------FGLDAFKVKQKFFSSSC 565
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 610 YGQVW 614
+G +W
Sbjct: 602 HGNMW 606
>gi|311261437|ref|XP_003128731.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sus scrofa]
Length = 606
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 174/482 (36%), Positives = 246/482 (51%), Gaps = 68/482 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 162 PFQFYLTRVSGIKPKYNSGALHIRDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 221
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 222 PVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 281
Query: 276 NLIHVDWNNKSQGLWM----QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ Q + F+ DLI YLS
Sbjct: 282 NLIHADWHQKTQGIWLSPLYQRIVPGSHRSGESATHFKADLISYLSAYN----------A 331
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LI S PG G WGH +LR +L+E +S
Sbjct: 332 AALKEWIDTIQEHDLSETNVYLIGSTPGRFQGDQKDNWGHFRLRKLLKENGSSIPKAESW 391
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
P+V QFSS+ S+ + KW+ +E S+ + E +TP G +++P+VE+VR SLEG
Sbjct: 392 PVVGQFSSISSMGADESKWLCSEFKESLVTLGKESRTPGGAVPLHLIYPSVENVRTSLEG 451
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR------------YN 493
Y AG ++P + +K +L Y+ KW A+ +GRS AMPHIKT+ R
Sbjct: 452 YPAGGSLPYSIQTAEKQTWLHSYFHKWSAATSGRSNAMPHIKTYMRPSPDFSQIAWFLVT 511
Query: 494 GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 553
L+KAAWGAL+KN SQLMIRSYELGVL LP+A F S V + SGS E +
Sbjct: 512 SANLSKAAWGALEKNGSQLMIRSYELGVLFLPAA------FGLDSFRVKQKFFSGSQEPT 565
Query: 554 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 612
PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 566 ------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYMKAPDTHGN 601
Query: 613 VW 614
+W
Sbjct: 602 MW 603
>gi|348675737|gb|EGZ15555.1| hypothetical protein PHYSODRAFT_505563 [Phytophthora sojae]
Length = 1258
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 155/398 (38%), Positives = 213/398 (53%), Gaps = 65/398 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIP 217
D F L ++ PA N S+ D+++GD +L+NYM D+ WL CP L +P
Sbjct: 27 DARECAFHLTCLKNAPAAPNVHTKSLGDLLEGDFSRCLLTNYMYDLPWLFAECPRLRDVP 86
Query: 218 HVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANL 277
VL++HGE D + + AN PPLPI++GTHH+K ++ +YP VR+ + TAN
Sbjct: 87 -VLLVHGERDRQGMMKECREYANVTPVAPPLPIAYGTHHTKMLVALYPEKVRVAIFTANF 145
Query: 278 IHVDWNNKSQGLWMQDFPLKDQNNLSEE------------CGFENDLIDYLSTLKWPEFS 325
+ DWN K+QG+W QDF LK + +E FE DL+ YLS+L
Sbjct: 146 LSNDWNTKTQGVWFQDFGLKVLDGSEDEEKDAVADNSTAINDFEADLVHYLSSLG----- 200
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
K+ +F+FS+A V L+ SVPG H G ++K+GH+++R
Sbjct: 201 ------AQVKLFCGELMRFDFSAARVALVPSVPGVHKGKDMEKYGHLRVR---------- 244
Query: 386 GFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPLGIGEPLIVWPTVEDVRCS 443
+LGSLDEKW+ E + SM G T + + I+WP+V+DVR S
Sbjct: 245 ------------NLGSLDEKWLFGEFAESMLPGKKNVSPTSMPVQALHIIWPSVDDVRNS 292
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYN--------- 493
LEG+ +G +IP P KN+ K FL KY KW R AMPHIK++AR+N
Sbjct: 293 LEGWNSGRSIPCPLKNM-KPFLHKYLRKWTPPEELHRQNAMPHIKSYARFNPSDEKAGEL 351
Query: 494 ------GQKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
L+KAAWGALQKN +QLMIRSYELGV+ LP
Sbjct: 352 DWVIVTSSNLSKAAWGALQKNKTQLMIRSYELGVMFLP 389
>gi|149737576|ref|XP_001496143.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Equus caballus]
Length = 611
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 173/485 (35%), Positives = 250/485 (51%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 166 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKT 225
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 226 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 285
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWPEFSANLP 329
NL+H DW+ K+QG+W+ PL + ++ F+ DLI YL P +
Sbjct: 286 NLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKADLISYLMAYNAPSLKEWI- 342
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
++ + S V LI S PG GS WGH +LR +L+E +
Sbjct: 343 ---------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSIPKAE 393
Query: 390 S-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
S P+V QFSS+GS+ + KW+ +E S+ + E KTP P +++P+VE+VR S
Sbjct: 394 SWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPGKSVSPFHLIYPSVENVRTS 453
Query: 444 LEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 454 LEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQIAWF 513
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + S +
Sbjct: 514 LVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSDNQ 567
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 568 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYIKAPDT 603
Query: 610 YGQVW 614
+G +W
Sbjct: 604 HGNMW 608
>gi|40796186|gb|AAH65162.1| Tdp1 protein [Mus musculus]
Length = 609
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 175/485 (36%), Positives = 251/485 (51%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 610 YGQVW 614
+G +W
Sbjct: 602 HGNMW 606
>gi|162417986|ref|NP_082630.2| tyrosyl-DNA phosphodiesterase 1 [Mus musculus]
gi|148686961|gb|EDL18908.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Mus musculus]
Length = 609
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 175/485 (36%), Positives = 251/485 (51%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 610 YGQVW 614
+G +W
Sbjct: 602 HGNMW 606
>gi|37999670|sp|Q8BJ37.2|TYDP1_MOUSE RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1; AltName: Full=Protein expressed in
male leptotene and zygotene spermatocytes 501;
Short=MLZ-501
Length = 609
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 175/485 (36%), Positives = 251/485 (51%), Gaps = 73/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 609
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYRSKDRPWIWNIPYVKAPDT 601
Query: 610 YGQVW 614
+G +W
Sbjct: 602 HGNMW 606
>gi|291406635|ref|XP_002719650.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Oryctolagus cuniculus]
Length = 609
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 181/535 (33%), Positives = 271/535 (50%), Gaps = 74/535 (13%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
+S + + G +G +S +++++ E +E ++ + + P F L RV
Sbjct: 116 VSSPRDGTAQTGGNHGPAASHRLKEEGEDKHETAGEGQDL---WDMLDRGNPFRFYLTRV 172
Query: 170 QGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES 226
G+ N+ + I+D++ G ++ + NY D+DWL+ P + +L++HG+
Sbjct: 173 SGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRRKPILLVHGDK 232
Query: 227 DGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+
Sbjct: 233 REAKAHLHAQAKPYENIALCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHEDWHQ 292
Query: 285 KSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPA-HGNFKINPS 339
K+QG+W+ +P L + S E F+ DLI YL P + HG+
Sbjct: 293 KTQGIWLSPLYPRLVHGTHRSGESTTHFKADLISYLMAYNAPSLQEWIDTIHGH------ 346
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSS 398
+ S V LI S PG G+ WGH +LR +L+E T +S P+V QFSS
Sbjct: 347 -----DLSETNVYLIGSTPGRFQGNQKDNWGHFRLRKLLKEHTSSVPQAESWPIVGQFSS 401
Query: 399 LGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
+GSL + KW+ +E S+ + +T PL +++P+VE+VR SLEGY AG ++
Sbjct: 402 IGSLGADESKWLCSEFKESLLTLGQASRTAGKSTVPLHLIYPSVENVRTSLEGYPAGGSL 461
Query: 454 P-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQKLAKA 500
P S Q +++L Y+ KW A +GRS AMPHIKT+ R L+KA
Sbjct: 462 PYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKA 521
Query: 501 AWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 560
AWGAL+KN +QLMIRSYELGVL LP+ F S V + S E +
Sbjct: 522 AWGALEKNGTQLMIRSYELGVLFLPAT------FGLDSFNVKQKFFSSHQEPA------- 568
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 569 -----------------AAFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 606
>gi|50748586|ref|XP_421313.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gallus gallus]
Length = 606
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 177/515 (34%), Positives = 266/515 (51%), Gaps = 72/515 (13%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKL----PSTFRLLRVQGLPAWANTSCVSIRDVIQ- 188
+ + NE ++ E L + D L P F L +V+G+ N+ + I+D++
Sbjct: 127 KDEHSKNEKAEDYNEVLGEPQDTWDLLSGGNPFGFFLTKVRGIEQSYNSGALHIKDILSP 186
Query: 189 --GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILH 244
G ++ + NY +D+ WL+ P + +L++HGE + E + + +P N
Sbjct: 187 LFGTLVSSAQFNYCIDVAWLVRQYPQEYRKKPLLIVHGEKRESKAELLAQARPFENISFC 246
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQN 300
+ L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P +
Sbjct: 247 QAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLPQGSSD 306
Query: 301 NLSE-ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
+ E E F++DLI YL P + ++ + S V L+ S PG
Sbjct: 307 SAGESETNFKSDLISYLMAYSSPVLKEWI----------DLIREHDLSETRVYLLGSTPG 356
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
+ G +KWGH+KLR +L++ ++S P+V QFSS+GSL KW+ +E S+
Sbjct: 357 RYQGIDKEKWGHLKLRKLLKDHASSIPAQESWPVVGQFSSIGSLGADGSKWLCSEFQESL 416
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKW 472
+ S L P+ +V+PTV +VR SLEGY AG ++P + K +L Y+ KW
Sbjct: 417 VAAGSGVAALLKCDVPIHLVYPTVSNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKW 476
Query: 473 KASHTGRSRAMPHIKTFAR--YNGQKLA----------KAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R ++ QK+A KAAWGAL+KN +QLMIRSYELG
Sbjct: 477 SAEVSGRSHAMPHIKTYMRPSHDFQKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 536
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LPSA G+ + SE K +T
Sbjct: 537 VLFLPSAFGLDKGYFHVKGNMLSEGKDSATS----------------------------F 568
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVP++LPP+RY S+D PW W+ YT D +G +W
Sbjct: 569 PVPFDLPPERYGSKDQPWIWNIPYTSAPDTHGNMW 603
>gi|344274118|ref|XP_003408865.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Loxodonta africana]
Length = 612
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 179/513 (34%), Positives = 262/513 (51%), Gaps = 70/513 (13%)
Query: 131 KMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ-- 188
+ R ++E+++E K S E + + P F L RV G+ N + IRD++
Sbjct: 138 RHRLKEEEEDEY-KTSGEGQDIWDMVNKGNPFQFYLTRVSGIKPKYNCGALHIRDILSPL 196
Query: 189 -GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHK 245
G ++ + NY D+DWL+ P + +L++HG+ H+ KP N L +
Sbjct: 197 FGTLVSSAQFNYCFDVDWLVKQYPPEFRNKPILLVHGDKREAKAHLHAEAKPYENISLCQ 256
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP--LKDQNNL 302
L I+FGTHH+K MLL+Y G+R+++HTANLIH DW+ K+QG+W+ +P + +
Sbjct: 257 AKLDIAFGTHHTKMMLLLYEEGLRVVIHTANLIHADWHQKTQGIWLSPLYPRIVHGTHGP 316
Query: 303 SEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 361
E F+ DL+ YL P + ++ + S V LI S PG
Sbjct: 317 GESPTHFKADLVSYLMAYNAPPLKGWI----------DTIQEHDLSETNVYLIGSTPGRF 366
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSS 416
G WGH +LR +L+E T ++ P+V QFSS+GS+ + KW+ +E S+ +
Sbjct: 367 QGDQKDNWGHFRLRKLLREHTSPIPKAEAWPIVGQFSSIGSMGTDESKWLCSEFKESLLT 426
Query: 417 GFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKA 474
+ +T PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A
Sbjct: 427 LGKDGRTLGKSTAPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSA 486
Query: 475 SHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVL 522
+GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELGVL
Sbjct: 487 ETSGRSSAMPHIKTYMRPSPDFSSIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVL 546
Query: 523 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
LPS F S V + SGS E + PV
Sbjct: 547 FLPSV------FGLDSFKVRQKFFSGSQEL------------------------MASFPV 576
Query: 583 PYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 577 PYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 609
>gi|302790465|ref|XP_002977000.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
gi|300155478|gb|EFJ22110.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
Length = 301
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/297 (47%), Positives = 191/297 (64%), Gaps = 35/297 (11%)
Query: 88 VSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSE 147
V I+ GDI++++PG FFK++ L S K + ++ L+S K ++Q E D + +
Sbjct: 24 VQISTGDIVKMLPGDRFFKFM-LCSSLKGKAVASHSDNVLASNKRKRQIEDDEAFARALQ 82
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ----------GDIIVAILS 197
+ +LLRVQGL WAN CV I DVI+ ++ AILS
Sbjct: 83 Q----------------QLLRVQGLLDWANAGCVRICDVIKVIRALVFLRIRILLFAILS 126
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYMVDI+WLL ACP+L I V++IHGES+ + ++ KP+N +L KP L I++GT HS
Sbjct: 127 NYMVDIEWLLSACPLLRTILQVVMIHGESN--VSQLQSVKPSNRLLFKPRLWIAYGTPHS 184
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 317
LL+YP GV+++VHTANLI++DWNNK+QGLWMQDFP K + S+ FENDL+DYL+
Sbjct: 185 ---LLVYPTGVQVVVHTANLINIDWNNKNQGLWMQDFPFKSKTGASD---FENDLVDYLT 238
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 374
L+W + ++ HG KIN F+ F FS+AAVRL+ASVPGYH+G L KWGHMKL
Sbjct: 239 ALEWLGCTVDVQHHGKMKINVGHFRNFYFSNAAVRLVASVPGYHSGPQLNKWGHMKL 295
>gi|72255547|ref|NP_001026827.1| tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|123781898|sp|Q4G056.1|TYDP1_RAT RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|71051114|gb|AAH98739.1| Tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|149025341|gb|EDL81708.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Rattus norvegicus]
Length = 609
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 173/484 (35%), Positives = 248/484 (51%), Gaps = 71/484 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D++WL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P Q N + F+ DL YL P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +LQ +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR------------ 491
EGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 453 EGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFL 512
Query: 492 YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S+E
Sbjct: 513 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSSE 566
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 610
+ PVPY+LPP+ Y S+D PW W+ Y K D +
Sbjct: 567 P------------------------MASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTH 602
Query: 611 GQVW 614
G +W
Sbjct: 603 GNMW 606
>gi|224051603|ref|XP_002200587.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Taeniopygia guttata]
Length = 609
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 175/518 (33%), Positives = 265/518 (51%), Gaps = 76/518 (14%)
Query: 135 QDEQDNENGKNSE------EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
+D++ +EN K E EA + + P F L +V G+ N+ + I+D++
Sbjct: 127 KDDKLSENLKEEEYNVTPSEAQDTWDLVTGDNPFRFFLTKVSGIEQSYNSGALHIKDILS 186
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWIL 243
G +I + NY +D+ WL+ P + +L++HGE + E + + +P N
Sbjct: 187 PLFGTLISSAQFNYCIDVGWLVRQYPQEFRKKPLLIVHGEKRESKAELIAQARPYENISF 246
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 303
+ L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + S
Sbjct: 247 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLSKGTS 306
Query: 304 EECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
G F++DLI YL+ P + ++ + S V L+ S P
Sbjct: 307 GSAGESATNFKSDLISYLAAYNSPALREWI----------DLIQEHDLSETRVYLLGSTP 356
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS---PLVYQFSSLGSL---DEKWM-AELS 411
G + G+ +KWGH++LR +L+E ++S PLV QFSS+GS+ KW+ +E
Sbjct: 357 GRYQGNDKEKWGHLRLRKLLKEHALPIPAQESWPLPLVGQFSSIGSMGADGSKWLCSEFQ 416
Query: 412 SSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYW 469
S+ + S T P+ +V+PTV +VR SLEGY AG ++P + K +L Y+
Sbjct: 417 ESLVAAGSSVTTFRKCDVPIHLVYPTVNNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYF 476
Query: 470 AKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSY 517
KW A TGR+ A+PHIKT+ R + L+KAAWGAL+KN SQLMIRSY
Sbjct: 477 HKWSADVTGRTHAIPHIKTYMRLSPDFQKIAWFLVTSANLSKAAWGALEKNGSQLMIRSY 536
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
ELGVL LPSA F + + +GS + +
Sbjct: 537 ELGVLFLPSA------FGIFRLDLRKKFFTGSEQPAT----------------------T 568
Query: 578 VYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
Y PVPY+LPP++Y S+D PW W+ YT D +G +W
Sbjct: 569 TYFPVPYDLPPEQYGSKDQPWIWNIPYTDAPDTHGNMW 606
>gi|426233768|ref|XP_004010886.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ovis aries]
Length = 612
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 172/483 (35%), Positives = 247/483 (51%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 167 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 226
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 227 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 286
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ + + F+ DLI YL+
Sbjct: 287 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATHFKADLISYLAAYN----------A 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKS 390
K ++ + S V LIAS PG G+ WGH +LR +L+E + G +
Sbjct: 337 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPAPGAESW 396
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 397 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAVPLHLIYPSVENVRTSLE 455
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 456 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 515
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+K +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 516 TSANLSKAAWGALEKGGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 568
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 569 ----------------------EPTASFPVPYDLPPELYGDRDRPWIWNIPYVKAPDTHG 606
Query: 612 QVW 614
+W
Sbjct: 607 NMW 609
>gi|126282139|ref|XP_001366471.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Monodelphis domestica]
Length = 608
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 178/499 (35%), Positives = 260/499 (52%), Gaps = 70/499 (14%)
Query: 146 SEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVD 202
S+E+ + + +K P F L +V G+ N + I+D++ G ++ + NY D
Sbjct: 147 SDESQEPWDLLEEKNPFRFYLTKVSGIMPKYNAGVLHIKDILSPLFGTLLSSAQFNYCFD 206
Query: 203 IDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAM 260
IDWL+ P+ + +L++HG+ + ++ KP N L + L I+FGTHH+K M
Sbjct: 207 IDWLIRQYPLEFRKKPILLVHGDKREAKARLQEQAKPYENISLCQAKLDIAFGTHHTKMM 266
Query: 261 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSE-ECGFENDLIDY 315
LL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E F++DLI Y
Sbjct: 267 LLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTSGESSTNFKSDLIRY 326
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L T P + K ++ + S V LI S PG GS + WGH +LR
Sbjct: 327 LMTYNAP----------SLKEWADIIQEHDLSETRVYLIGSTPGRFQGSHKEDWGHFRLR 376
Query: 376 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 430
+L+E T ++S P+V QFSS+GSL + KW+ AE S+ + K+ P
Sbjct: 377 KLLKEHTSLVPEQQSWPIVGQFSSIGSLGADESKWLCAEFKESLVVLGNCGKSQGQQDVP 436
Query: 431 L-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKT 488
L +++PTVE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPTVENVRKSLEGYPAGGSLPYSLQTAEKQLWLHSYFHKWSAETSGRSHAMPHIKT 496
Query: 489 FARYN------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 536
+ R + L+KAAWGAL+KN +QLMIRSYELGVL LPS F
Sbjct: 497 YMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPST------FGM 550
Query: 537 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 596
+ V ++ S + E V PVPY+LPP Y S+D
Sbjct: 551 DTFKVKKKVFSENREP------------------------VTSFPVPYDLPPNIYDSKDR 586
Query: 597 PWSWDKRYTKK-DVYGQVW 614
PW W+ YTK D +G +W
Sbjct: 587 PWIWNIPYTKAPDTHGNMW 605
>gi|440911964|gb|ELR61579.1| Tyrosyl-DNA phosphodiesterase 1, partial [Bos grunniens mutus]
Length = 616
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 172/483 (35%), Positives = 247/483 (51%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 171 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 230
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 231 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 290
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ + + F+ DLI YL+
Sbjct: 291 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISYLAAYN----------A 340
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LIAS PG G+ WGH +LR +L+E +S
Sbjct: 341 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPMPKAESW 400
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 401 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAPLHLIYPSVENVRTSLE 459
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 460 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 519
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+K+ +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 520 TSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 572
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 573 ----------------------EPTASFPVPYDLPPELYGDRDRPWIWNIPYVKAPDTHG 610
Query: 612 QVW 614
+W
Sbjct: 611 NMW 613
>gi|300798259|ref|NP_001180084.1| tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
gi|296482871|tpg|DAA24986.1| TPA: tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
Length = 612
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 172/483 (35%), Positives = 247/483 (51%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 167 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIRQYPPEFRKK 226
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 227 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 286
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ + + F+ DLI YL+
Sbjct: 287 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISYLAAYN----------A 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LIAS PG G+ WGH +LR +L+E +S
Sbjct: 337 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPMPKAESW 396
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 397 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAPLHLIYPSVENVRTSLE 455
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 456 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 515
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+K+ +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 516 TSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 568
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 569 ----------------------EPTASFPVPYDLPPEVYGDRDRPWIWNIPYVKAPDTHG 606
Query: 612 QVW 614
+W
Sbjct: 607 NMW 609
>gi|125841556|ref|XP_700174.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Danio rerio]
Length = 615
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 172/492 (34%), Positives = 251/492 (51%), Gaps = 93/492 (18%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V G+P NT + I++++ G + ++ NY DI W++ P + V+
Sbjct: 173 FYLNKVTGIPKKYNTGALHIKEILSPMFGTLKESVQFNYCFDIPWMVEQYPPEFRNKPVV 232
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLP----------ISFGTHHSKAMLLIYPRGVRI 270
++HGE KR A I P P I+FGTHH+K MLL Y G R+
Sbjct: 233 LVHGE--------KRESKACLIEQAKPYPHISFCQAKLDIAFGTHHTKMMLLWYEEGFRV 284
Query: 271 IVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE----CGFENDLIDYLSTLKWPEFS 325
I+ T+NLI DW K+QG+WM +P Q + GF+ DL++YL + PE +
Sbjct: 285 IILTSNLIRADWYQKTQGMWMSPLYPRLPQGSPGTAGESLTGFKRDLLEYLEAYRAPELA 344
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE- 384
+ K+ + S V LI S PG + G +++KWGH++LR +L E T
Sbjct: 345 NWI----------ERIKQHDLSETRVYLIGSTPGRYQGPAMEKWGHLRLRKLLSEHTQPM 394
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP----LIVWPT 436
+ ++ ++ QFSS+GS+ KW+ AE ++++ K+ + P L+++P+
Sbjct: 395 QNEERWHVLGQFSSIGSMGLDKTKWLAAEFQRTLTTLGKAGKS---LASPETQMLLIYPS 451
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN-- 493
VE+VR SLEGY AG ++P + K +L Y+ W A TGRS AMPHIKT+ R +
Sbjct: 452 VENVRTSLEGYPAGGSLPYSIQTAQKQLWLHSYFHGWHADVTGRSNAMPHIKTYMRISPD 511
Query: 494 ----------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 543
L+KAAWGAL+KNN+Q+M+RSYELGVL LPSA F N+ P
Sbjct: 512 FTQLAWFLVTSANLSKAAWGALEKNNTQIMVRSYELGVLYLPSAFNMST-FPVEKNVFP- 569
Query: 544 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 603
A S + PVP++LPPQRYSS+D PW W+
Sbjct: 570 -----------------------------ACSSSIGFPVPFDLPPQRYSSKDRPWIWNIP 600
Query: 604 YTKK-DVYGQVW 614
YT+ D +G VW
Sbjct: 601 YTQAPDTHGNVW 612
>gi|348500374|ref|XP_003437748.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oreochromis
niloticus]
Length = 616
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 170/489 (34%), Positives = 251/489 (51%), Gaps = 90/489 (18%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL N+ + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 177 FYLNKVTGLEKKYNSGALHIRDILSPLFGTLKESVQFNYCFDIAWMVKQYPSEFRDRPVL 236
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLP----------ISFGTHHSKAMLLIYPRGVRI 270
++HG+ KR A I P P I+FGTHH+K MLL Y G R+
Sbjct: 237 IVHGD--------KREAKARLIQQAQPFPHVRFCQAKLDIAFGTHHTKMMLLWYEEGFRV 288
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFS 325
I+ T+NLI DW K+QG+WM + S G F+ DL++YL++ + PE
Sbjct: 289 IILTSNLIRADWYQKTQGMWMSPLYPRLPKESSASAGESPTFFKRDLLEYLASYRAPELE 348
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE- 384
+ K+ + S V L+ S PG + GS +++WGH++LR +L E T
Sbjct: 349 EWI----------QRIKEHDLSETRVYLVGSTPGRYVGSDMERWGHLRLRKLLYEHTNPI 398
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPL-IVWPTVED 439
G ++ P++ QFSS+GS+ KW+A E +++ + K+ L P+ +++P+VED
Sbjct: 399 PGEERWPVIGQFSSIGSMGLDKSKWLAGEFQRTLT---TLGKSSLRPDPPMHLLYPSVED 455
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR------- 491
VR SLEGY AG ++P + K +L Y+ +WKA TGRS AMPHIKT+ R
Sbjct: 456 VRMSLEGYPAGGSLPYSIQTAQKQLWLHSYFHRWKAEATGRSHAMPHIKTYMRASPDFSQ 515
Query: 492 -----YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 546
L+KAAWGAL+KNN+Q+M+RSYELGVL LPSA FS N P
Sbjct: 516 LAWFLVTSANLSKAAWGALEKNNTQMMVRSYELGVLYLPSAFGMKT-FSVDKNPFP---- 570
Query: 547 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 606
V+ ++ G PVP++LPP Y+++D PW W+ Y++
Sbjct: 571 --------------VSASFSG------------FPVPFDLPPTSYTTKDQPWIWNIPYSQ 604
Query: 607 K-DVYGQVW 614
D +G +W
Sbjct: 605 APDTHGNIW 613
>gi|395827684|ref|XP_003787027.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Otolemur garnettii]
Length = 608
Score = 258 bits (659), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 191/573 (33%), Positives = 286/573 (49%), Gaps = 78/573 (13%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN--GELSSKK 131
+R++ S E++ S +D ++ P K + V DG G S+
Sbjct: 75 KRQRSDSQEYLGWCLSSSDDELQPETPEKQAKKVIVKEEEDISVPQDGTAQRTGNHSTPA 134
Query: 132 MRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ--- 188
+ E+++E + S E + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEY-ETSGEGQDIWDMLDKGNPFQFYLTRVSGIKPKYNSGALHIKDILSPLF 193
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHK 245
G ++ + NY D+DWL+ P + +L++HG E+ L H + N L +
Sbjct: 194 GTLVSSAQFNYCFDVDWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYGNISLCQ 252
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLS 303
L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + + S
Sbjct: 253 AKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHEDWHQKTQGIWLSPLYPRIVHGTHKS 312
Query: 304 EE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 361
E F+ DLI YL ++A+ K + + S V LI+S PG
Sbjct: 313 GESVTHFKADLISYLMA-----YNAS-----PLKEWIDLIHEHDLSETNVYLISSTPGRF 362
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSS 416
GS WGH +LR +L+E +S P+V QFSS+GSL + KW+ +E S+ +
Sbjct: 363 QGSQKDNWGHFRLRKLLKEHASSIPAAESWPIVGQFSSIGSLGADESKWLSSEFKESLLT 422
Query: 417 GFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKA 474
E K P PL +++P+VE+VR SLEGY AG ++P + +K ++L Y+ KW A
Sbjct: 423 LGKESKAPGKSTVPLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQNWLHSYFHKWSA 482
Query: 475 SHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVL 522
+GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELGVL
Sbjct: 483 ETSGRSHAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVL 542
Query: 523 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
LPSA F S V + S + E + PV
Sbjct: 543 FLPSA------FGLDSFKVKQKFFSANKEP------------------------MATFPV 572
Query: 583 PYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PY+LPP+ Y ++D PW W+ Y K D +G +W
Sbjct: 573 PYDLPPELYGNKDRPWIWNIPYVKAPDTHGNMW 605
>gi|327259270|ref|XP_003214461.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Anolis
carolinensis]
Length = 603
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 173/510 (33%), Positives = 267/510 (52%), Gaps = 71/510 (13%)
Query: 138 QDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVA 194
Q E+ + SE+ + + + P F L +V+G+ + N + I+D++ G ++ +
Sbjct: 134 QSQESSQPSEKVQDTWDLLNGENPFRFFLTKVKGIDSKYNLGALHIKDILSPLFGTLVSS 193
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISF 252
NY +D+ WL+ P + +L++HGE + ++ N L + L I+F
Sbjct: 194 AQFNYCIDLGWLVKQYPKEFREKPLLIVHGEKRESKAELQEEASLYDNVRLCQAKLDIAF 253
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEECG 307
GTHH+K MLL Y G+R+++HT+NLI DW K+QG+W+ P ++
Sbjct: 254 GTHHTKMMLLHYEEGLRVVIHTSNLIADDWYQKTQGIWLSPLYPRLPPGASASDGESHTM 313
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
F++DLI YL + K PA G + K+ +FS V L+ S PG + S +
Sbjct: 314 FKSDLISYLMSYK-------SPALGKWA---ETIKQHDFSETRVYLLGSTPGRYQNSDKE 363
Query: 368 KWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDK 422
KWGH++L+ +L++ + + S P++ QFSS+GS+ KW+ +E S++S ++ K
Sbjct: 364 KWGHLRLKKLLKDHVMQVSDQDSWPVIGQFSSIGSMGADQSKWLCSEFRDSLTSLGNDTK 423
Query: 423 TPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRS 480
P+ +V+PTVE+VR SLEGY AG ++P + K +L Y+ KW A +GRS
Sbjct: 424 ALTNRDIPIHLVYPTVENVRQSLEGYPAGGSLPYSIETAKKQLWLHAYFHKWSAETSGRS 483
Query: 481 RAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAK 528
RAMPHIKT+ R L+KAAWGA +K +QLMIRSYELGVL LPS
Sbjct: 484 RAMPHIKTYMRASPDFQKIAWFLVTSANLSKAAWGAFEKKGTQLMIRSYELGVLFLPSE- 542
Query: 529 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 588
F S Q++++ S+ +SS PVPY+LPP
Sbjct: 543 -----FGLNSGYF------------QVKESMF--------SNEPSSS----FPVPYDLPP 573
Query: 589 QRYSSEDVPWSWDKRYTKK-DVYGQVW-PR 616
++Y +D PW W+ YT+ D YG +W PR
Sbjct: 574 KKYEGKDRPWIWNIPYTRAPDTYGNMWVPR 603
>gi|346467109|gb|AEO33399.1| hypothetical protein [Amblyomma maculatum]
Length = 423
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/454 (36%), Positives = 239/454 (52%), Gaps = 74/454 (16%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKP 246
G ++ + NY DI WL+ P + +L++HGE + ++ + N +
Sbjct: 7 GQLVRSAQFNYCFDIPWLVEQYPPEFRSFPLLIVHGEQREAKKELEASAADFKNLSFVQA 66
Query: 247 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLS 303
L I +GTHH+K MLL+Y G+RI++HTANL+ DW K+Q +W+ + D
Sbjct: 67 KLEIVYGTHHTKMMLLLYKDGLRIVIHTANLVASDWAQKTQAIWVSPVCTRLASDSKGGD 126
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH 361
E GF+ DL+ YLS A+G+ +IN + + +FS+ V L+ SVPG H
Sbjct: 127 SETGFKADLLTYLS------------AYGDPRINEWCHYIRSHDFSAVKVFLVGSVPGRH 174
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLD---EKWM-AELSSSMS 415
TG +GH++LRT+L + K S PLV QFSS+GSL + W+ E SS+S
Sbjct: 175 TGPRKSSFGHLRLRTLLNQHGPSKDLVSSHWPLVAQFSSIGSLGTSAQAWLTGEFLSSLS 234
Query: 416 SGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWK 473
+ S TP + PL +V+P+V+DVRCSLEGY AG +IP K +L Y+ +WK
Sbjct: 235 ATKSSGSTPQSV--PLKLVFPSVDDVRCSLEGYPAGASIPYSIVTASKQRWLDSYFYRWK 292
Query: 474 ASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELGV 521
+ GR+ A PHIKT+ R + L+KAAWGA +KN SQLMIRSYELGV
Sbjct: 293 SERLGRTAASPHIKTYTRLSPSGKQAAWFLVTSANLSKAAWGAFEKNGSQLMIRSYELGV 352
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 581
L+ P+ S Q T + SD SS +YLP
Sbjct: 353 LLFPA--------------------------SFGQATTFIV------SDESCSSSALYLP 380
Query: 582 VPYELPPQRYSSEDVPWSWDKRYTK-KDVYGQVW 614
+PY+LP Y+S+D PW+WD ++ + D +G +W
Sbjct: 381 LPYDLPLVPYTSDDEPWTWDSQHRELPDRFGNMW 414
>gi|432115827|gb|ELK36975.1| Tyrosyl-DNA phosphodiesterase 1 [Myotis davidii]
Length = 610
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 176/489 (35%), Positives = 250/489 (51%), Gaps = 81/489 (16%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 165 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 224
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 225 PILLVHGDKREAKAHLHAEAKPYPNVSLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 284
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWP---EFSA 326
NLI DW+ K+QG+W+ PL + + F+ DLI YL P E+
Sbjct: 285 NLIREDWHQKTQGMWVS--PLYPRMAHGTPGSGESTTHFKADLISYLMAYNAPPLQEWVD 342
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFE 384
+ AH + S V LI S PG G+ WGH +LR VL+E +
Sbjct: 343 VIHAH-------------DLSETNVYLIGSTPGRFQGNQKDNWGHFRLRKVLKEHASSIP 389
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVED 439
K + P++ QFSS+GS+ + KW+ AE ++ + E + P PL +++P+VE+
Sbjct: 390 KA-EAWPVIGQFSSIGSMGADESKWLCAEFKETLVTLGKESRAPGRSPAPLHLIYPSVEN 448
Query: 440 VRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------- 491
VR SLEGY AG ++P S Q + +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 449 VRTSLEGYPAGGSLPYSIQTAEKQSWLHAYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQ 508
Query: 492 -----YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 546
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V +
Sbjct: 509 IAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFRVKPKFF 562
Query: 547 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 606
SGS E + PVPY+LPP+ Y S+D PW W+ Y K
Sbjct: 563 SGSQEPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVK 598
Query: 607 K-DVYGQVW 614
D +G +W
Sbjct: 599 APDTHGNMW 607
>gi|410911974|ref|XP_003969465.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Takifugu rubripes]
Length = 614
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 167/482 (34%), Positives = 255/482 (52%), Gaps = 75/482 (15%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL NT + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 174 FYLNKVTGLDRKYNTGALHIRDILSPLFGTLKASVQFNYCFDIAWMVKQYPEEFRDRPVL 233
Query: 221 VIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANL 277
++HG E+ L + P + + L I+FGTHH+K MLL Y G R+IV T+NL
Sbjct: 234 IVHGDKREAKARLVQQAQGFP-HIQFCQAKLDIAFGTHHTKMMLLWYEEGFRVIVLTSNL 292
Query: 278 IHVDWNNKSQGLWMQD-FP----LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG 332
I DW K+QG+WM FP ++ F+ DL++YL++ + PE +
Sbjct: 293 IRADWYQKTQGMWMSPLFPRLPEGSSASSGESPTYFKRDLLEYLASYRAPELEEWI---- 348
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE-KGFKKSP 391
K+ + S +V L+ S PG + GS +++WGH++LR +L E T G ++ P
Sbjct: 349 ------QRIKEHDLSETSVYLVGSTPGRYVGSDMERWGHLRLRKLLSEHTEAFPGEERWP 402
Query: 392 LVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEG 446
++ QFSS+GS+ KW+A E +M++ K+ + P+ +++P++EDVR SLEG
Sbjct: 403 VIGQFSSIGSMGLDKTKWLAGEFQRTMTT---MGKSTVRSDPPMQLLYPSIEDVRTSLEG 459
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------ 493
Y AG ++P + K +L ++ +WKA TGRS AMPHIKT+ R +
Sbjct: 460 YPAGGSLPYSIQTAQKQLWLHSFFHRWKADSTGRSHAMPHIKTYMRVSPNFTELAWFFMT 519
Query: 494 GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 553
L+KAAWGAL+KNN+Q+MIRSYELGVL +PSA + +T
Sbjct: 520 SANLSKAAWGALEKNNTQMMIRSYELGVLFVPSAFK--------------------MKTF 559
Query: 554 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 612
+ K+ + +SS PVP++LPP YS +D PW W+ Y++ D +G
Sbjct: 560 PVNKSPFLV----------SSSSFSGFPVPFDLPPTAYSPKDQPWIWNIPYSQAPDTHGN 609
Query: 613 VW 614
+W
Sbjct: 610 IW 611
>gi|113931582|ref|NP_001039242.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
gi|89273341|emb|CAJ81457.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
Length = 597
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 170/518 (32%), Positives = 267/518 (51%), Gaps = 70/518 (13%)
Query: 127 LSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDV 186
+ SKK+++ E + K ++ + + + P F L +V G+ N+ + I+D+
Sbjct: 117 VQSKKIQENIEVKQKKCKTPSDSQDTWDLLQAGEPFRFYLTKVMGIKPKYNSGALHIKDI 176
Query: 187 IQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANWI 242
+ G ++ + NY DI WL+ P + +L++HGE + + + P I
Sbjct: 177 LSPLFGTLVSSAQFNYCFDIKWLVKQYPEEFRDKPLLIVHGEKRESKAKLHEDAHPYEHI 236
Query: 243 -LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
L + L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW K+QG+W+ +
Sbjct: 237 RLCQAKLDIAFGTHHTKMMLLLYTEGLRVVIHTSNLIHEDWYQKTQGIWLSPLYPRLPEG 296
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
S G F +DL+ YL++ P + K+ + S V LI S
Sbjct: 297 ASVSAGESSTNFRSDLVAYLASYNSPSLREWM----------DIIKQHDLSETRVYLIGS 346
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEK-GFKKSPLVYQFSSLGSL---DEKWM-AELS 411
PG G+ KWGH +LR +L+E T G + P++ QFSS+GS+ KW+ +E +
Sbjct: 347 TPGRFQGNDKDKWGHFRLRKLLRENTSAAPGQETWPVIGQFSSIGSMGVDKTKWLCSEFT 406
Query: 412 SSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYW 469
S+++ K+ PL +++P+V++VR SLEGY AG ++P S Q + +L Y+
Sbjct: 407 ESLTTLGKSIKSLQKTEIPLHLIYPSVDNVRTSLEGYPAGGSLPYSIQTAQKQPWLHSYF 466
Query: 470 AKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSY 517
KWKA + RS+AMPHIKT+ R + L+KAAWG+L+KN +QL IRSY
Sbjct: 467 HKWKAETSRRSQAMPHIKTYMRLSPDSQHLAWFLVTSANLSKAAWGSLEKNGAQLFIRSY 526
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
ELGVL LPSA ET+ V L + S++ +++
Sbjct: 527 ELGVLFLPSA----------------------FETNTFN----VKLNIYASNEPSSNA-- 558
Query: 578 VYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y ++D PW W+ Y D +G +W
Sbjct: 559 --FPVPYDLPPEHYGAKDRPWVWNIPYVNAPDTHGNIW 594
>gi|340383155|ref|XP_003390083.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Amphimedon
queenslandica]
Length = 535
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 168/485 (34%), Positives = 251/485 (51%), Gaps = 80/485 (16%)
Query: 161 PSTFRLLRVQGLPAWANTS--CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAK 215
P+ F L +V+G+P N V I+D++ G++I + NYM DI WLL P +
Sbjct: 97 PTLFYLTKVRGIPDRYNDPRYTVGIKDILSSTHGNLIGSAQFNYMFDIKWLLDQYPEDKR 156
Query: 216 IPHVLVIHGESDGTLEHMKRNK--PANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVH 273
+L++HG E ++ + N L + L + FGTHHSK MLL Y G+R+++H
Sbjct: 157 SLPLLIVHGFQGREFESLRMDSLPHPNIKLLQAKLDL-FGTHHSKMMLLSYNEGLRVVIH 215
Query: 274 TANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN 333
TANLI DW+ K+QG+WM P+ ++ + C F++DL+ YL T ++
Sbjct: 216 TANLIQKDWDQKTQGVWMS--PVFPKSTVKRSCKFQDDLLSYLDT-----YTGAAMNEWK 268
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSP 391
K+ K + SS +IASVPG HTG ++ KWGHMKLR VL+E + K P
Sbjct: 269 EKV-----KSHDMSSCRAHIIASVPGPHTGLNIFKWGHMKLRKVLEEHGPSASTTTKDWP 323
Query: 392 LVYQFSSLGS--------LDEKWMAELSSSMSSGFSED-KTPLGIGEPLIVWPTVEDVRC 442
++ QFSS+GS L +W+ LSS +G + ++ + G+ +V+PTVE+++
Sbjct: 324 VIGQFSSIGSLGPAPSSWLTSEWLTSLSSCWKTGTVKTLRSEIPKGKLQLVFPTVENIKN 383
Query: 443 SLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN-------- 493
SLEGY AG ++P + Q + + +L ++ +W A GRSRA PHIKT+ R +
Sbjct: 384 SLEGYMAGGSVPYASQTALKQPYLTTFFNQWVAEGYGRSRASPHIKTYMRVSPTCDRLAW 443
Query: 494 ----GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 549
L+KAAWG +K +QL IRSYE+GVL+LP + +SG+
Sbjct: 444 FLLTSANLSKAAWGGFEKKGTQLRIRSYEIGVLLLP------------------DDESGT 485
Query: 550 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 609
+ +SS LP+P +LP Y + D PW W+ RY D
Sbjct: 486 LMVGE------------------SSSNNSMLPIPIDLPLTDYKTTDRPWIWNDRYLAPDC 527
Query: 610 YGQVW 614
G VW
Sbjct: 528 KGNVW 532
>gi|427789081|gb|JAA59992.1| Putative tyrosyl-dna phosphodiesterase [Rhipicephalus pulchellus]
Length = 614
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 169/481 (35%), Positives = 254/481 (52%), Gaps = 83/481 (17%)
Query: 169 VQGLPAWANTSCV--SIRDVIQGDIIVAILS---NYMVDIDWLLPACPVLAKIPHVLVIH 223
V G+PA NT+ + S+RD++ D+ + S NY DI WL+ P + +LV+H
Sbjct: 173 VTGIPARYNTAQIARSVRDLLSPDMGRLVRSAQFNYCFDIPWLVEQYPTEFRNLPLLVVH 232
Query: 224 GESDGTLEHMKRNKPANWILH----KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
GE + ++ + A+ H + L I +GTHH+K MLL+Y G+R+++HTAN+I
Sbjct: 233 GEQREAKKALETS--ASGFQHVSFAQAKLEIVYGTHHTKMMLLLYKEGLRVVIHTANMIP 290
Query: 280 VDWNNKSQGLWMQDFPLK---DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
DW K+Q +W+ + N E GF DL++YLS A+G+ I
Sbjct: 291 TDWAQKTQAIWVGPVCPRLAPGSNGGDSETGFRADLLNYLS------------AYGDTHI 338
Query: 337 NP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PL 392
N + + +FS+ V L+ SVPG HTG +GH++LR +L + K + PL
Sbjct: 339 NEWCHYIRTHDFSAVKVFLVGSVPGRHTGPRKSCFGHLRLRNLLSQHGPSKDLVSNHWPL 398
Query: 393 VYQFSSLGSLD---EKW-MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGY 447
V QFSS+GSL E W + E SS+S+ T + PL +V+P+V+DVRCSLEGY
Sbjct: 399 VAQFSSIGSLGASAESWLLGEFLSSLSTTKGSVVTARSV--PLKLVFPSVDDVRCSLEGY 456
Query: 448 AAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------G 494
AG +IP DK +L ++ +WK+ GR+ A PHIKT+ R +
Sbjct: 457 PAGASIPYSIVTADKQRWLDSFFHRWKSERLGRTAASPHIKTYTRLSPSSKQIAWLLVTS 516
Query: 495 QKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 554
L+KAAWGAL+KN SQLMIRSYELG+L+ P+ F + V SE +G++
Sbjct: 517 ANLSKAAWGALEKNGSQLMIRSYELGILLFPA------NFGQATTFVVSEGANGNS---- 566
Query: 555 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDVYGQV 613
++LP+PY++P Y+ +D PW+WD ++ + D +G +
Sbjct: 567 ----------------------ALFLPLPYDVPLVPYTKDDEPWTWDSQHRELPDRFGNM 604
Query: 614 W 614
W
Sbjct: 605 W 605
>gi|395503746|ref|XP_003756224.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sarcophilus harrisii]
Length = 612
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 171/507 (33%), Positives = 262/507 (51%), Gaps = 70/507 (13%)
Query: 138 QDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVA 194
+ E+ +EA ++++ +K F L +V G+ N+ + I+D++ G ++ +
Sbjct: 143 KTEEDDVTFDEAQESWNLLDEKNLFRFYLTKVSGILPKYNSGALHIKDILSPLFGTLLSS 202
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISF 252
NY ++DWL+ P+ + +L++HG+ + ++ KP N L + L I+F
Sbjct: 203 AQFNYCFEVDWLVRQYPLEFRKKPILLVHGDKREAKARLQEKAKPYENISLCQAKLDIAF 262
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSE-ECG 307
GTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E
Sbjct: 263 GTHHTKMMLLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTHGESSTN 322
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
F++DLI YL P + +K + S V LI S PG G ++
Sbjct: 323 FKSDLISYLMAYNAPPLKEWI----------DIVQKHDLSETRVYLIGSTPGRFQGKHIE 372
Query: 368 KWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDK 422
WGH +LR +L+E T ++S P+V QFSS+GSL + KW+ +E S+ + K
Sbjct: 373 DWGHFRLRKLLKEHTSLLPEQQSWPIVGQFSSIGSLGADESKWLCSEFKDSLVILGNHGK 432
Query: 423 TPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRS 480
PL +++PTVE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS
Sbjct: 433 NQGQHNVPLHLIYPTVENVRNSLEGYPAGGSLPYSLQTAEKQVWLHSYFHKWSAETSGRS 492
Query: 481 RAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAK 528
AMPHIKT+ R + L+KAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 493 NAMPHIKTYMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA- 551
Query: 529 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 588
F + + ++ S E + PVPY+LPP
Sbjct: 552 -----FGMDTFKIKRKVFSEKQEPA------------------------TSFPVPYDLPP 582
Query: 589 QRYSSEDVPWSWDKRYTKK-DVYGQVW 614
+ Y+S+D PW W+ Y K D +G +W
Sbjct: 583 EIYNSKDRPWIWNIPYVKAPDTHGNMW 609
>gi|148237298|ref|NP_001087094.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus laevis]
gi|49903395|gb|AAH76790.1| Tdp1-prov protein [Xenopus laevis]
Length = 597
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 165/484 (34%), Positives = 247/484 (51%), Gaps = 70/484 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V G+ N+ + I+D++ G ++ + NY DI+WL+ P +
Sbjct: 151 PFRFYLTKVTGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDIEWLVKQYPEEFRNK 210
Query: 218 HVLVIHGESDGTLEHMKRNK-PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HGE + + + P I L + L I++GTHH+K MLL+Y G+R+++HT+
Sbjct: 211 PLLIVHGEKRESKTKLHEDAHPYEHIRLCQAKLDIAYGTHHTKMMLLLYTEGLRVVIHTS 270
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLPA 330
NLI DW K+QG+W+ + S G F +DLI YL++ P +
Sbjct: 271 NLIREDWYQKTQGIWLSPLYPRLPEGASVSAGESSTNFRSDLIAYLASYNSPSLREWM-- 328
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
K+ + S V LI S PG G KWGH +LR +L+E T K+
Sbjct: 329 --------DIIKQHDLSETRVYLIGSTPGRFQGKDKDKWGHFRLRKLLRENTSAGPDKEM 380
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P++ QFSS+GS+ KW+ +E + S+ + K+ PL +++P+V++VR SL
Sbjct: 381 WPVIGQFSSIGSMGVDKTKWLCSEFTESLKTLGKSIKSLQKSEIPLRLIYPSVDNVRTSL 440
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN---------- 493
EGY AG ++P S Q + +L Y+ KWKA +GRS+A+PHIKT+ R++
Sbjct: 441 EGYPAGGSLPYSIQTAQKQPWLHSYFHKWKAETSGRSQAIPHIKTYMRFSPDFQNLAWFL 500
Query: 494 --GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
L+KAAWG+L+KN +QL IRSYELGVL LPSA F+ NI SG+
Sbjct: 501 VTSANLSKAAWGSLEKNGAQLFIRSYELGVLFLPSAFDTNT-FNVKVNIYSHNEPSGNA- 558
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 610
PVPY+LPP+ Y S+D PW W+ Y D +
Sbjct: 559 ----------------------------FPVPYDLPPEHYGSKDRPWVWNIPYVNAPDTH 590
Query: 611 GQVW 614
G +W
Sbjct: 591 GNIW 594
>gi|452821653|gb|EME28681.1| tyrosyl-DNA phosphodiesterase 1 [Galdieria sulphuraria]
Length = 452
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 157/457 (34%), Positives = 235/457 (51%), Gaps = 60/457 (13%)
Query: 182 SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
S+ ++ Q +L+NYM D+ WL P+L + +L++HG+ + + P ++
Sbjct: 27 SLDEIFQPGFHSVLLTNYMFDLSWLFQRVPILLTVERLLIVHGDE----QVYQPFSPYHF 82
Query: 242 I-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
I HKP LP +GTHH+K ++L YP VR ++ TAN+I DW K+QG++++DFP K
Sbjct: 83 ITFHKPRLPFPYGTHHTKLIILFYPTKVRFVLTTANMIQSDWEYKTQGMFLKDFPQKTGE 142
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
+ C F + DYLS L P + S +++FS A V LI SVPGY
Sbjct: 143 --LKSCPFLETMDDYLSALGEP-----------LRYYRSLLCQYDFSKAGVVLIPSVPGY 189
Query: 361 HTGSSLKKWGHMKLRT-VLQECTF--EKGFKKSP------LVYQFSSLGSLDEKWM-AEL 410
H G +L K+GH L + + Q C E+ ++ L+ Q SS+GS+ EKW+ EL
Sbjct: 190 HGGRNLDKYGHRSLHSNISQYCCISDEQRIRRKTTHSTIRLLLQCSSMGSISEKWLKQEL 249
Query: 411 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWA 470
SM S + + E ++WP+V+ VR S++GYA+G A P +KN + F +
Sbjct: 250 FHSMVSSCWKQEDWQYCFEWDLIWPSVQQVRNSIQGYASGAAFPWTKKNY-RSFQSSHLC 308
Query: 471 KWKASHTGRSRAMPHIKTFARY-----------NGQKLAKAAWGALQKNNSQLMIRSYEL 519
W A R+ +PH+K++ Y L+ AAWG L +N SQL IRSYEL
Sbjct: 309 LWNAYFFRRNAWLPHMKSYMAYEESGNIFWFLLTSANLSTAAWGRLVRNQSQLFIRSYEL 368
Query: 520 GVLILPSAKRHGCGFSC-TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 578
GVL P C ++C N++ ++ + TS + K ++ +
Sbjct: 369 GVLWTPML----CSYTCPMDNVI--QLTTPQHITSYYPREK-------------NNNILF 409
Query: 579 YLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
LP+P++LPPQ Y S D PW WD Y D G VWP
Sbjct: 410 CLPLPFQLPPQHYDSNDSPWLWDAIYKSPDRLGNVWP 446
>gi|20150581|pdb|1JY1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)
Length = 464
Score = 249 bits (635), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 173/483 (35%), Positives = 247/483 (51%), Gaps = 69/483 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 19 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 78
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K LL+Y G+R+++HT+
Sbjct: 79 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKXXLLLYEEGLRVVIHTS 138
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ +LI YL+ P +
Sbjct: 139 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISYLTAYNAPSLKEWI--- 195
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 196 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSXPNAESW 248
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E S + E KTP PL +++P+VE+VR SLE
Sbjct: 249 PVVGQFSSVGSLGADESKWLCSEFKESXLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 308
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS A PHIKT+ R
Sbjct: 309 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAXPHIKTYXRPSPDFSKIAWFLV 368
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
L+KAAWGAL+KN +QL IRSYELGVL LPSA S V + +GS E
Sbjct: 369 TSANLSKAAWGALEKNGTQLXIRSYELGVLFLPSA------LGLDSFKVKQKFFAGSQEP 422
Query: 553 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 611
PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 423 XAT------------------------FPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 458
Query: 612 QVW 614
W
Sbjct: 459 NXW 461
>gi|395746171|ref|XP_003778400.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Pongo abelii]
Length = 589
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 170/487 (34%), Positives = 258/487 (52%), Gaps = 51/487 (10%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDGA +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGAAQRTENHGPPT 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSRALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKTSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSA 527
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|397525721|ref|XP_003832804.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 3 [Pan paniscus]
Length = 589
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 169/487 (34%), Positives = 256/487 (52%), Gaps = 51/487 (10%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRQAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSA 527
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|281340418|gb|EFB16002.1| hypothetical protein PANDA_009635 [Ailuropoda melanoleuca]
Length = 388
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 162/421 (38%), Positives = 223/421 (52%), Gaps = 66/421 (15%)
Query: 219 VLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTAN 276
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+N
Sbjct: 6 ILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSN 65
Query: 277 LIHVDWNNKSQGLWMQDF--PLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHG 332
LIH DW+ K+QG+W+ P+ + S E F+ DLI YL P +
Sbjct: 66 LIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKADLISYLMAYNAPSLKEWI---- 121
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL 392
+ + S V LI S PG GS WGH +LR +L+E KG + P+
Sbjct: 122 ------DIIHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASPKG-ESWPV 174
Query: 393 VYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGY 447
V QFSS+GS+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY
Sbjct: 175 VGQFSSIGSMGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGY 234
Query: 448 AAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------YNG 494
AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 235 PAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTS 294
Query: 495 QKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 554
L+KAAWGAL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 295 ANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPAA 348
Query: 555 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQV 613
PVPY+LPP+ Y S+D PW W+ YTK D +G +
Sbjct: 349 A------------------------FPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNM 384
Query: 614 W 614
W
Sbjct: 385 W 385
>gi|426377770|ref|XP_004055628.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gorilla gorilla
gorilla]
Length = 608
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 186/582 (31%), Positives = 278/582 (47%), Gaps = 96/582 (16%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHK 245
G ++ + NY D+DWL+ P + +L++HG+ H+ K
Sbjct: 191 PLFGMLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQA-------K 243
Query: 246 PPLPISFGTHHS---------KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP 295
P IS K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P
Sbjct: 244 PYENISLCQLSEIGKRFLLCEKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYP 303
Query: 296 -LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
+ D + S E F+ DLI YL P + K + S V
Sbjct: 304 RIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVY 353
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM- 407
LI S PG GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+
Sbjct: 354 LIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLC 413
Query: 408 AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
+E SM + E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L
Sbjct: 414 SEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWL 473
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLM 513
Y+ KW A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLM
Sbjct: 474 HSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLM 533
Query: 514 IRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGA 573
IRSYELGVL LPSA F S V + +GS E
Sbjct: 534 IRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP--------------------- 566
Query: 574 SSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 567 ---MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|79154014|gb|AAI07878.1| TDP1 protein [Homo sapiens]
Length = 589
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 169/487 (34%), Positives = 256/487 (52%), Gaps = 51/487 (10%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELG 520
A +GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 521 VLILPSA 527
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|428172199|gb|EKX41110.1| hypothetical protein GUITHDRAFT_142267 [Guillardia theta CCMP2712]
Length = 465
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/334 (39%), Positives = 191/334 (57%), Gaps = 15/334 (4%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
F L G+ N V +RDV+QGD++ AI +NYMV WLL +L+ IP V+ ++
Sbjct: 127 FWLFHTDGIEEPGNEQAVRLRDVVQGDVLWAIFTNYMVQERWLLSEIALLSSIPRVVFMY 186
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
++ + + PP P +G HHSK MLL Y GVR++V TAN IH D
Sbjct: 187 ---PFLSSLASPPSSSSIVRYAPPTP-QYGVHHSKVMLLGYNTGVRVVVMTANHIHGDHY 242
Query: 284 NKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ + LW QDFPLK + E FE+DL+ Y +W LP K++ + ++
Sbjct: 243 DMTDALWAQDFPLKGEGE--ERSEFEDDLVSYFQATQWK--GTTLPCGS--KLDAQYLRR 296
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
++F +A +++ASVPG H G + WGHMK+R +L TF+ F K P+V+Q +S+GSL
Sbjct: 297 YSFKNARAKIVASVPGRHQGEKMHMWGHMKMRRILSRETFDPLFNKCPMVWQCTSIGSLS 356
Query: 404 EKWMAELSSSMSSGFSEDKTPLGIGE--PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
EKW+ E +SS+ G + + +G E P +WPT+E+VR S +GY G +IP KNV
Sbjct: 357 EKWIEEFTSSLCEGKNTEGKNIGRPEEPPHFIWPTMEEVRTSSKGYTMGESIPGFSKNVH 416
Query: 462 KDFLKKYWAKWKASHTG---RSRAMPHIKTFARY 492
K FL K + +W + + R RAMPHIKT+ R+
Sbjct: 417 KPFLLKMFCRWSSGSSDPQLRRRAMPHIKTWLRF 450
>gi|431839199|gb|ELK01126.1| Tyrosyl-DNA phosphodiesterase 1 [Pteropus alecto]
Length = 709
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 154/395 (38%), Positives = 222/395 (56%), Gaps = 38/395 (9%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAEAKPYGNISLCQAKLEIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ +P + N S E F+ DL+ YL + N PA
Sbjct: 283 NLIRADWHQKTQGIWLSPLYPRIAPGTNTSGESTTHFKADLVSYL-------MAYNAPA- 334
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LI S PG GS WGH +LR +L+E +S
Sbjct: 335 --LKEWIDVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSIPKAESW 392
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GS+ + KW+ +E ++++ E KTP PL +++P+VE+VR SLE
Sbjct: 393 PVVGQFSSIGSMGADESKWLCSEFKETLATLGRESKTPGKSAVPLHLIYPSVENVRTSLE 452
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------Y 492
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 453 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQIAWFLV 512
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSA 527
L+KAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 513 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 571 AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
+G+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 662 SGSQEPAASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 706
>gi|26329523|dbj|BAC28500.1| unnamed protein product [Mus musculus]
gi|148686960|gb|EDL18907.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Mus musculus]
Length = 579
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 157/413 (38%), Positives = 225/413 (54%), Gaps = 50/413 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 543
L+KAAWGAL+KN +QLMIRSYELGVL LPSA SNIVP+
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA--------FVSNIVPA 556
>gi|195997043|ref|XP_002108390.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
gi|190589166|gb|EDV29188.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
Length = 569
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 180/552 (32%), Positives = 277/552 (50%), Gaps = 99/552 (17%)
Query: 98 LIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSR 157
L+ G + VT S +K S+D K QD+ + + +CN +
Sbjct: 63 LLVGEANSREVTESPRKKLKSHDVRVEQPRVETKEHSQDQAE-------PDQMCNKY--- 112
Query: 158 DKLPSTFRLLRVQGLPAWAN--TSCVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPV 212
++ L +V+GL N TS + IR+++ + ++I +I NYM D+ WLL P
Sbjct: 113 -----SYYLSKVRGLNNNYNSRTSSIHIREILALEKSELISSIQFNYMFDVSWLLDQYPE 167
Query: 213 LAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVR 269
+ VL++HG +S LE + P N H+ L +++GTHHSK M L+Y G+R
Sbjct: 168 DYRKNPVLIVHGYSGQSRNNLEQQGQPFP-NVKFHQAKLEMAYGTHHSKMMFLLYSNGLR 226
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPL----KDQNNLSEECGFENDLIDYLSTLKWPEFS 325
I++HTANLI DW ++QG+W+ L K + N++++ GF+ DL+DY+++
Sbjct: 227 IVIHTANLIPQDWGRRTQGIWISPLFLKRSDKSEMNIADDTGFKQDLLDYVASYG----- 281
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
PA ++ S + + SS V LIASVPG H G ++ KWGH+KLR +L+ K
Sbjct: 282 ---PALFEWR---SRIMEHDMSSVNVFLIASVPGRHAGKNIDKWGHLKLRKILKRNGPSK 335
Query: 386 GFKKS--PLVYQFSSLGSLDEK---WM-AELSSSMSSGFSEDKTPLG--IGEPLIVWPTV 437
+ P + QFSS+GSL K W+ +E +S+SS + + LG + +++P+V
Sbjct: 336 DDVSANWPAICQFSSIGSLGSKRDAWLYSEFRTSLSSTSTTRLSQLGERKADVKLIFPSV 395
Query: 438 EDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ- 495
E+VR LEGY G+ +P + +K +L W A TGR RA PHIKT+ R +
Sbjct: 396 ENVRNCLEGYKGGSCLPYNRGTANKQPWLNSLLHNWAAKKTGRHRASPHIKTYTRVSPDN 455
Query: 496 -------------KLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP 542
L+KAAWG ++KN +QLMIRSYE+GVL LP K+ G G
Sbjct: 456 TELAWFLITRQVANLSKAAWGTMEKNETQLMIRSYEIGVLFLP--KQFGDG--------- 504
Query: 543 SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDK 602
K+ +T+ + +PY+LP Y +D PW+WD
Sbjct: 505 KTFKTCDLKTNWL--------------------------IPYDLPLIPYGLQDSPWTWDT 538
Query: 603 RYTKKDVYGQVW 614
+ + D +G W
Sbjct: 539 PHLEPDTHGAQW 550
>gi|432853024|ref|XP_004067503.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oryzias latipes]
Length = 614
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 164/480 (34%), Positives = 246/480 (51%), Gaps = 74/480 (15%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL NT + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 177 FYLNKVTGLDKKYNTGALHIRDILSPLFGTLKESVQFNYCFDIPWMVQQYPPEFRDRPVL 236
Query: 221 VIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLI 278
++HG+ + + A + + L I+FGTHH+K MLL Y G R+I+ T+NLI
Sbjct: 237 IVHGDKREAKARLLQQAQAFPHVRFCQAKLDIAFGTHHTKMMLLWYEEGFRVIILTSNLI 296
Query: 279 HVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLPAHGN 333
DW K+QG+WM + G F+ DL+DYL++ + PE +
Sbjct: 297 RADWYQKTQGMWMSPLFPRLPAGSGWSAGESPTFFKRDLLDYLTSYRAPELEEWI----- 351
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE-KGFKKSPL 392
K+ + S V L+ S PG G +++WGH++LR +L E T G +K P+
Sbjct: 352 -----QRIKEHDLSETRVYLVGSTPGRFVGPDMERWGHLRLRKLLYEHTNPIPGEEKWPV 406
Query: 393 VYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEG 446
+ QFSS+GS+ KW+A E +M++ P +P L+++P VEDVR SLEG
Sbjct: 407 IGQFSSIGSMGLDKTKWLAGEFQRTMTTLGKSSSRP----DPPVLLLYPAVEDVRMSLEG 462
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ---------- 495
Y AG ++P + K +L Y+ +WKA+ TGRS AMPHIKT+ R +
Sbjct: 463 YPAGGSLPYSIQTAQKQLWLHGYFHRWKANATGRSHAMPHIKTYMRVSPDFTELAWFLVT 522
Query: 496 KLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 555
+ +AWGAL+KNNSQ+M+RSYELGVL +PSA
Sbjct: 523 RCLLSAWGALEKNNSQVMVRSYELGVLYVPSA---------------------------- 554
Query: 556 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
L T S+ +SS +L VP++LPP Y+++D PW W+ Y+++ D +G +W
Sbjct: 555 --FNLKTFPVDKSAFPVSSSSSGFL-VPFDLPPTPYAAKDQPWIWNIPYSQEPDTHGNIW 611
>gi|325180643|emb|CCA15048.1| tyrosylDNA phosphodiesterase putative [Albugo laibachii Nc14]
Length = 489
Score = 235 bits (599), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 166/479 (34%), Positives = 241/479 (50%), Gaps = 79/479 (16%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
F L ++GL A N +++ D++ G+ +LSNYM D+ WL+ V + +
Sbjct: 60 FYLTPIKGLSAAQNQYSIALTDLLDGEFTSCLLSNYMYDVPWLMQQYFV------SIFLF 113
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
+S ++H + K N P LPI FGTHHSK M++ Y VR+ + TAN + +DWN
Sbjct: 114 WQS---IKH-QCQKYTNIKTIAPYLPIPFGTHHSKMMIIWYAEKVRVAIFTANFLPIDWN 169
Query: 284 NKSQGLWMQDFPLKDQNNLS-------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
NK+QG+W QDF LK + + S E FE DLIDYL + G +
Sbjct: 170 NKTQGIWFQDFGLKSETSASSRTNLWPERIDFEADLIDYL-------IHVDKIHLGELCL 222
Query: 337 NPSFFKKFNFSSAAVRLIASVPGYHTGSS----LKKWGHMKLRTVLQECTFEKGFKKSPL 392
+K++FS+A V L+ASVPG H + + K+GH+++R +LQ T E + PL
Sbjct: 223 T---LEKYDFSTANVALVASVPGTHKNRAIWIDMHKYGHLRMRRLLQ--TLEAWNNEYPL 277
Query: 393 VYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
+ QFSSLGSL E W+ E + S+ + + + P ++WP+ E VR S+EG+ AG
Sbjct: 278 ICQFSSLGSLTEPWLYHEFTESLQAHSTTKQRP----ALHLIWPSAEQVRNSIEGWNAGR 333
Query: 452 AIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYNGQKL------------- 497
AIP P KN+ K FL K+ W RS AMPHIK++A+++ L
Sbjct: 334 AIPCPLKNM-KPFLHKFLRTWNPPPKLHRSNAMPHIKSYAQFDPTALDGTLRWALLSSSN 392
Query: 498 -AKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 556
+ AAWG+ QK +Q MIRS+E+GVL P R+ CT +V
Sbjct: 393 LSSAAWGSYQKQKNQFMIRSFEIGVLFHPKVYRNDK--LCTDPLV--------------- 435
Query: 557 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS-EDVPWSWDKRYTKKDVYGQVW 614
V T +D AS + P PY P Q Y + +D PW W+ + D G +
Sbjct: 436 ----VIGT---PADEAASQNAIRFPAPYNFPLQAYDTKQDEPWIWNLAWDLPDSTGACY 487
>gi|405964823|gb|EKC30268.1| Tyrosyl-DNA phosphodiesterase 1 [Crassostrea gigas]
Length = 461
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 162/485 (33%), Positives = 244/485 (50%), Gaps = 74/485 (15%)
Query: 161 PSTFRLLRVQGLPAWANTS-CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKI 216
P +F L +V G+ + N + +S+RD++ G++ + NYM +I WL+ P +
Sbjct: 17 PLSFFLTKVYGISSDYNGAYTMSLRDILSESMGNLQESCQFNYMFEIPWLIQQYPASFRQ 76
Query: 217 PHVLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L +HG G ++ + K N + L + +GTHH+K M L+Y G+R+++HT
Sbjct: 77 KPLLCVHGFQGGQKAGLEADARKFTNIKFCQAKLEMPYGTHHTKMMFLLYDNGLRVVIHT 136
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLP 329
ANLI DW+ K+QG+W+ K ++ S G F+ DL+ Y++ K
Sbjct: 137 ANLIERDWHQKTQGIWISPVFPKLKSGPSPTQGDSPTHFKRDLLQYVAAYK--------- 187
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFK 388
K + + SSA V ++ SVPG H +GHMKLR +L E ++
Sbjct: 188 -AYQLKDWQDHISRHDLSSANVFIVGSVPGRHMAEKKHWFGHMKLRKLLNENGPVKEQAS 246
Query: 389 KSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSL 444
K P++ QFSS+GSL E W++ E S+++ PL E +++PTV++VR SL
Sbjct: 247 KWPVIGQFSSIGSLGASKENWLSVEFLQSLATVKGTSSVPLAPVEFKLIFPTVDNVRTSL 306
Query: 445 EGYAAGNAIPSPQKNVDKD--FLKKYWAKWKASHTGRSRAMPHIKTFAR----------- 491
EGY AG +IP NV K +L Y+ +WK+ GR+RAMPHIKT+ R
Sbjct: 307 EGYPAGGSIPY-SINVAKKQPWLHSYFHQWKSEGRGRNRAMPHIKTYCRPSPTWEEAAWF 365
Query: 492 -YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+KAAWGAL+K SQLMIRSYE+GVL +P F C+S +
Sbjct: 366 LVTSSNLSKAAWGALEKKGSQLMIRSYEIGVLFIPKYLVENAVFECSSKV---------- 415
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDV 609
+AG + V +PY+LPP+ Y+ D PW WD + + D
Sbjct: 416 ------------------KEAGQKTFV----LPYDLPPRAYTKSDKPWIWDIAHKELPDS 453
Query: 610 YGQVW 614
G +W
Sbjct: 454 NGNMW 458
>gi|440800948|gb|ELR21974.1| tyrosylDNA phosphodiesterase-related, putative [Acanthamoeba
castellanii str. Neff]
Length = 601
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 156/446 (34%), Positives = 223/446 (50%), Gaps = 82/446 (18%)
Query: 172 LPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLE 231
PA AN + IR +I ++ A++ Y VD+DWL+ CPVL P V +
Sbjct: 231 FPADANQGALGIRQIIPENVERAVIVTYQVDMDWLMRRCPVLPHPPPPNVHY-------- 282
Query: 232 HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM 291
+KP W+L +G HH K MLL + + TANLI D+ K+QG+W+
Sbjct: 283 ----HKP--WVL-------DYGCHHGKMMLLFWK-----AITTANLIQKDYERKTQGIWL 324
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
QDFP K + FE+ L+DY ++ + PS + +++S+ V
Sbjct: 325 QDFPKKRGD-------FEDTLVDYF---------GHMGNERQLQFQPSSLRHYDYSAVRV 368
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF-KKSPLVYQFSSLGSLDEKWMAEL 410
L+ SVPGYH+ ++L ++GHM+LR +L T ++S + QFSS+GSL KW+ E
Sbjct: 369 ALVTSVPGYHSRATLNRYGHMRLRGLLSRVTMPAEIERRSSVACQFSSVGSLTAKWVEEE 428
Query: 411 --SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
S M+S S D E +VWPTV+ VR S++GYAAG ++ + N KDF+
Sbjct: 429 FGQSLMASAGSSDSKKEAQVE--LVWPTVDYVRSSIDGYAAGGSLCFGESN-RKDFMTPL 485
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAK 528
+ ++KA R R PHIK L+KAAWGALQK N+QLMIR++E+GVL LPS
Sbjct: 486 FRQYKAMPESRGRVTPHIKVC--LTSANLSKAAWGALQKGNTQLMIRNFEIGVLFLPSH- 542
Query: 529 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 588
F + I GS+ A S + V +P+PY + P
Sbjct: 543 -----FDDRTFIA-------------------------GSAPAALSKDSVVIPLPYRIEP 572
Query: 589 -QRYSSEDVPWSWDKRYTKKDVYGQV 613
+RY D PW WD + D GQ
Sbjct: 573 LERYGPRDEPWIWDLPRPEPDALGQT 598
>gi|241556145|ref|XP_002399612.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
gi|215499691|gb|EEC09185.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
Length = 624
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 156/479 (32%), Positives = 240/479 (50%), Gaps = 76/479 (15%)
Query: 169 VQGLPAWANTSCV--SIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
V+G+PA N + SI D++ G+++ + NY DI WL+ P + +L++H
Sbjct: 180 VKGIPAIYNAPSIARSIEDILSPNMGELVRSAQFNYCFDIPWLVERYPAEFRNLPLLIVH 239
Query: 224 GESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVD 281
GE ++ + + + + L I +GTHH+K MLL+Y G+R+++HT+NL+ D
Sbjct: 240 GEQRDAKRELEASASSFKHVSFAQAKLEIVYGTHHTKMMLLLYKEGMRVVIHTSNLVESD 299
Query: 282 WNNKSQGLWMQDFPLK---DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
W K+Q W+ K GF DL++YL + +G+ KIN
Sbjct: 300 WAQKTQAAWIGPLCPKASGGAGGGDSATGFRADLLEYLGS------------YGDPKINE 347
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVY 394
+ + +FS+ V L+ SVPG HTG+ +GH+KLR +L K S P +
Sbjct: 348 WCHYLRAHDFSAVKVFLVGSVPGRHTGARKSSFGHLKLRKLLSLHGPPKELVSSYWPAIA 407
Query: 395 QFSSLGSLD---EKWM-AELSSSMSS-GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
QFSS+GSL + W+ AE +S+++ TP +V+P+V+DVRCSLEGY A
Sbjct: 408 QFSSIGSLGTGPDNWLRAEFLTSLAAVKGGPPLTPSSTVPVKLVFPSVDDVRCSLEGYPA 467
Query: 450 GNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQK 496
G +IP +K +L Y+ +W++ GR+ A PH+K++AR +
Sbjct: 468 GASIPYSISTANKQRWLDAYFFRWRSGRFGRTHASPHVKSYARLSPSGKQTAWLLVTSAN 527
Query: 497 LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 556
L+KAAWGA +K+ SQLMIRSYELGVL P Q
Sbjct: 528 LSKAAWGAFEKSGSQLMIRSYELGVLFFPG-----------------------------Q 558
Query: 557 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
T T G S AG ++ VP+++P Y +DVPW+WD ++ + D +G +W
Sbjct: 559 FGDARTFTVGGDSMAGKGCLPLF--VPFDVPLTPYGQDDVPWTWDSQHREAPDRFGNMW 615
>gi|256073128|ref|XP_002572884.1| tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 1234
Score = 224 bits (572), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 156/459 (33%), Positives = 244/459 (53%), Gaps = 79/459 (17%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHK 245
G+++ +I N+M DI WL P + + ++H G+ +L+ K +N +
Sbjct: 819 GELVSSIQFNFMFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQ 877
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNN 301
+ + +G HH+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q N
Sbjct: 878 ADIRLPYGVHHTKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKN 937
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRL 353
L++ + F DL++YL + + +L + +P F ++F V L
Sbjct: 938 LNDTDSKTNFRADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVL 989
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK----WMAE 409
IASV G H G SLKK+GH +L VLQ C + P++ QFSS+GSL K + E
Sbjct: 990 IASVSGRHAGESLKKFGHTRLGEVLQTCNSQIP-SSWPVIGQFSSIGSLGPKPTDWFTTE 1048
Query: 410 LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKY 468
SSS++ K G+ +++P+VEDVR SLEGY AG +P + +K +L ++
Sbjct: 1049 WSSSLAG-----KGARGL---RMIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQF 1100
Query: 469 WAKWKASHTGRSRAMPHIKTFARY--NGQK----------LAKAAWGALQKNNSQLMIRS 516
+ +W+A + SRA PHIK++ R +GQ+ L+K+AWGA +K+ SQLMIRS
Sbjct: 1101 FYRWQAFN--HSRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIRS 1158
Query: 517 YELGVLILPS-AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 575
YELGVL LP+ K F EI + + SQ ++
Sbjct: 1159 YELGVLFLPTNYKESAHSF---------EILKNNAKYSQ-----------------SSTD 1192
Query: 576 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
E++ P+PYELPP +Y S D PW DK ++ D++G++W
Sbjct: 1193 ELLPFPIPYELPPVKYQSNDEPWILDKPHSLPDIFGRIW 1231
>gi|320165079|gb|EFW41978.1| tyrosyl-DNA phosphodiesterase 1 [Capsaspora owczarzaki ATCC 30864]
Length = 622
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 149/410 (36%), Positives = 214/410 (52%), Gaps = 60/410 (14%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVI 222
F+L R G+ W N + S+R ++ D+ ++ NYMVD+DWL+ P + + V+
Sbjct: 195 FQLTRAGGINEWFNRNAFSLRQLLSDMDLQSSVQFNYMVDLDWLMTIFPRELQARPMTVV 254
Query: 223 HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
HG ++ K + +PPLPI+FGTHH+K M L Y +RI++HTAN+I DW
Sbjct: 255 HGLTESADVLQAAGKKWGKTIIRPPLPIAFGTHHTKMMFLFYSDSMRIVIHTANIIPSDW 314
Query: 283 NNKSQGLWMQ-DFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-FKI 336
K++G+W FPLK Q + S FE L YL+ A+G+ +
Sbjct: 315 YAKTEGVWCSPKFPLKASTAQQASSSTGRAFEQTLNKYLT------------AYGSCIRQ 362
Query: 337 NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV-LQECTFEKGFKKSPLVYQ 395
K++FS+A V LIASVPG H G + +WGHM+LR + L + L+ Q
Sbjct: 363 VREQAMKYDFSAANVALIASVPGRHAGLAKSEWGHMQLRKLPLPANVASQPVNTHQLIGQ 422
Query: 396 FSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYA 448
FSS+GSL E W+ +E S S+S+ ++ +P I P +++P+VE+VR SLEGY
Sbjct: 423 FSSIGSLGASPETWLTSEFSVSLSAHKAQGLSP-PIAHPRALRLIFPSVENVRLSLEGYL 481
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR---------------- 491
AG A+P K +L +++ W A+ +GR AMPHIK++AR
Sbjct: 482 AGGALPYRLATHSKQAWLDQFFCTWNATRSGRQHAMPHIKSYARIAVSPKTADSAQQAEA 541
Query: 492 ------------YNGQKLAKAAWGALQKNNS---QLMIRSYELGVLILPS 526
L+KAAWG LQK + QL IRSYELGVL PS
Sbjct: 542 TDSTNVALGWFLLTSANLSKAAWGTLQKKGTAAEQLEIRSYELGVLFHPS 591
>gi|28071068|emb|CAD61915.1| unnamed protein product [Homo sapiens]
Length = 369
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/381 (38%), Positives = 200/381 (52%), Gaps = 64/381 (16%)
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLI 313
K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI
Sbjct: 26 KMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLI 85
Query: 314 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 373
YL P + K + S V LI S PG GS WGH +
Sbjct: 86 SYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFR 135
Query: 374 LRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 428
L+ +L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP
Sbjct: 136 LKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSS 195
Query: 429 EPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 486
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHI
Sbjct: 196 VPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHI 255
Query: 487 KTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 534
KT+ R L+KAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 256 KTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------F 309
Query: 535 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 594
S V + +GS E + PVPY+LPP+ Y S+
Sbjct: 310 GLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSK 345
Query: 595 DVPWSWDKRYTKK-DVYGQVW 614
D PW W+ Y K D +G +W
Sbjct: 346 DRPWIWNIPYVKAPDTHGNMW 366
>gi|47220883|emb|CAG03090.1| unnamed protein product [Tetraodon nigroviridis]
Length = 607
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 157/454 (34%), Positives = 233/454 (51%), Gaps = 98/454 (21%)
Query: 206 LLPACP--------VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP-------- 249
LL ACP L + VL++HG+ KR A + P
Sbjct: 204 LLQACPRRQSPHQWCLRRDRPVLIVHGD--------KREAKARLVQQAQAFPHVQFCQAK 255
Query: 250 --ISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE- 305
I+FGTHH+K MLL Y G R+++ T+NLI DW K+QG+WM FP + + +
Sbjct: 256 LDIAFGTHHTKMMLLWYEEGFRVVILTSNLIRADWYQKTQGMWMSPLFPRLPEGSGARAG 315
Query: 306 ---CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT 362
F+ DL++YL++ + + + ++ + S A+V L+ S PG +
Sbjct: 316 ESPTSFKRDLLEYLASYRAAQLEEWM----------ERIQEHDLSEASVYLVGSTPGRYV 365
Query: 363 GSSLKKWGHMKLRTVLQECT-FEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSS- 416
G+ +++WGH++LR +L+E T G + P+V QFSS+GS+ KW+A E ++S+
Sbjct: 366 GADMERWGHLRLRKLLREHTETPAGQDRWPVVGQFSSIGSMGLDKSKWLAGEFQHTLSTL 425
Query: 417 GFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWK 473
G S ++ PL L+++P+VEDVR SLEGY AG ++P S Q + +L ++ +W+
Sbjct: 426 GQSSARSDPPL-----LLLYPSVEDVRTSLEGYPAGGSLPYSIQTAQRQLWLHAFFHRWR 480
Query: 474 ASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGV 521
A TGRS AMPHIKT+ R L+KAAWGAL+KNN+Q+MIRSYELGV
Sbjct: 481 ADSTGRSHAMPHIKTYMRASPGYTELAWFLVTSANLSKAAWGALEKNNTQVMIRSYELGV 540
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 581
L LP+A + T + S +SS P
Sbjct: 541 LFLPAA------------------------------FNMKTFPVNTSPFPVSSSSFSGFP 570
Query: 582 VPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
VP++LPP YS +D PW W+ Y++ D +G VW
Sbjct: 571 VPFDLPPTAYSPKDQPWIWNIPYSQAPDTHGNVW 604
>gi|67971950|dbj|BAE02317.1| unnamed protein product [Macaca fascicularis]
gi|67971954|dbj|BAE02319.1| unnamed protein product [Macaca fascicularis]
Length = 343
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 144/379 (37%), Positives = 199/379 (52%), Gaps = 64/379 (16%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 315
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L P + + + S V LI S PG GS WGH +LR
Sbjct: 62 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 111
Query: 376 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 430
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 112 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 171
Query: 431 L-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKT 488
L +++P+VE+VR SLEGY AG ++P + +K ++L Y+ KW A +GRS AMPHIKT
Sbjct: 172 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 231
Query: 489 FAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 536
+ R L+KAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 232 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 285
Query: 537 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 596
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 286 DNFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSKDR 321
Query: 597 PWSWDKRYTKK-DVYGQVW 614
PW W+ Y K D +G +W
Sbjct: 322 PWIWNIPYVKAPDTHGNMW 340
>gi|198414495|ref|XP_002123899.1| PREDICTED: similar to tyrosyl-DNA phosphodiesterase 1 [Ciona
intestinalis]
Length = 471
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 145/369 (39%), Positives = 214/369 (57%), Gaps = 46/369 (12%)
Query: 181 VSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK 237
+ I+DV+ G++I ++ NY +D+DWL+ PV + + +IHG G + +
Sbjct: 123 LGIKDVLSEKFGNLIESVQFNYCIDVDWLIQQYPVSCQGKPLTIIHG---GNVS--PNPQ 177
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
N L K LP +GTHH+K MLL Y G+R+++ T NL+ DW K+QG WM P+
Sbjct: 178 YPNITLVKVNLP-PYGTHHTKMMLLHYTSGLRVVILTTNLVPQDWGQKTQGFWMS--PIF 234
Query: 298 DQNNLSEECGFENDL-IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
+ ++ F+ ++Y+S+ K + + + + + SSA V LI S
Sbjct: 235 PKTTPTKTSKFKPRFGLEYVSSYK----------NKSLQRWVDHIRSHDMSSANVILIGS 284
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSS 412
+PG HTG +L WGHM+LR VL+ T +K P++ QFSS+GSL ++KW+ E +
Sbjct: 285 IPGRHTGHNLSTWGHMRLRKVLKNET-KKIDSSWPVIGQFSSIGSLGSSNQKWLCNEWLT 343
Query: 413 SMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWA 470
S+SS T LG PL +++P+V+DVR SLEGY AG +IP S + + +L+ Y
Sbjct: 344 SLSSC---SNTTLGASPPLKLIFPSVDDVRMSLEGYPAGASIPYSRNIALKQPWLRPYLH 400
Query: 471 KWKASHTGRSRAMPHIKTFAR---YN-----------GQKLAKAAWGALQKNNSQLMIRS 516
KW A+H GR++A PHIK++AR YN L+KAAWG+L+KNNSQL I+S
Sbjct: 401 KWVATHAGRTQAAPHIKSYARISPYNTNIRLPWFLLTSANLSKAAWGSLEKNNSQLSIKS 460
Query: 517 YELGVLILP 525
YELGVL LP
Sbjct: 461 YELGVLFLP 469
>gi|443688556|gb|ELT91218.1| hypothetical protein CAPTEDRAFT_131694, partial [Capitella teleta]
Length = 374
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 133/351 (37%), Positives = 196/351 (55%), Gaps = 35/351 (9%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH----KPPLPISFG 253
N+ +DI WL+ PV + +LV+HG + +++R A H + L + +G
Sbjct: 2 NFKIDIPWLVAQYPVHHRTKPLLVVHGSTRQEKANLERE--ARLFTHVDLCQAKLEMIYG 59
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQN-NLSEECGFEN 310
THH+K M+L Y GVR+I+HTANLIH DW+ K+QG+WM PL Q+ N F+
Sbjct: 60 THHTKMMILSYVNGVRVIIHTANLIHSDWHQKTQGVWMSPLFPPLAPQSRNGDSPTNFKR 119
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DL+ Y++ K + + S K+ +FS+A V LIASVPG H+G+SL ++G
Sbjct: 120 DLLQYINAYKSQSLNEWI----------SIIKRHDFSTAKVFLIASVPGRHSGASLNEFG 169
Query: 371 HMKLRTVLQEC-TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE 429
H+KL+ VL++ K+ P++ QFSS+GSL + LSS + + FS + +
Sbjct: 170 HLKLKKVLRQFGPSSDACKQWPVLAQFSSIGSLGPTPESWLSSELLTSFSATRGSGSQSK 229
Query: 430 PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHI 486
P +++P DVR SLEGY AG ++P K + + +W++ GR++A PHI
Sbjct: 230 PRLHLMYPCRHDVRLSLEGYGAGGSLPYSINTAKKQPWFRTICNRWRSECNGRTKACPHI 289
Query: 487 KTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
KT+ R L+KAAWG L+K SQLM+RSYELGVL LP
Sbjct: 290 KTYLRASPDWHNLAWFTLTSANLSKAAWGMLEKQGSQLMVRSYELGVLFLP 340
>gi|349604421|gb|AEP99976.1| Tyrosyl-DNA phosphodiesterase 1-like protein, partial [Equus
caballus]
Length = 345
Score = 213 bits (541), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 198/384 (51%), Gaps = 68/384 (17%)
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFEN 310
+K MLL+Y G+R+++HT+NL+H DW+ K+QG+W+ PL + ++ F+
Sbjct: 1 TKMMLLLYEEGLRVVIHTSNLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKA 58
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DLI YL P + ++ + S V LI S PG GS WG
Sbjct: 59 DLISYLMAYNAPSLKEWI----------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWG 108
Query: 371 HMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPL 425
H +LR +L+E +S P+V QFSS+GS+ + KW+ +E S+ + E KTP
Sbjct: 109 HFRLRALLKEHASSIPKAESWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPG 168
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 483
P +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AM
Sbjct: 169 KSVSPFHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAM 228
Query: 484 PHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 531
PHIKT+ R L+KAAWGAL++N +QLMIRSYELGVL LPSA
Sbjct: 229 PHIKTYMRPSPDFSQIAWFLVTSANLSKAAWGALERNGAQLMIRSYELGVLFLPSA---- 284
Query: 532 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 591
F S V + S + E + PVPY+LPP+ Y
Sbjct: 285 --FGLDSFKVKQKFFSDNQEPT------------------------ASFPVPYDLPPELY 318
Query: 592 SSEDVPWSWDKRYTKK-DVYGQVW 614
S+D PW W+ Y K D +G +W
Sbjct: 319 GSKDRPWIWNIPYIKAPDTHGNMW 342
>gi|18044048|gb|AAH19804.1| Tdp1 protein [Mus musculus]
Length = 343
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/380 (37%), Positives = 197/380 (51%), Gaps = 66/380 (17%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 315
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHAAGESSTRFKADLISY 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 62 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 111
Query: 376 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 429
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 112 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 170
Query: 430 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 487
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 171 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 230
Query: 488 TFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 535
T+ R L+KAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 231 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 284
Query: 536 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 595
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 285 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 320
Query: 596 VPWSWDKRYTKK-DVYGQVW 614
PW W+ Y K D +G +W
Sbjct: 321 RPWIWNIPYVKAPDTHGNMW 340
>gi|358337002|dbj|GAA55434.1| tyrosyl-DNA phosphodiesterase 1, partial [Clonorchis sinensis]
Length = 1156
Score = 209 bits (533), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 146/432 (33%), Positives = 219/432 (50%), Gaps = 59/432 (13%)
Query: 188 QGDIIVAILSNYMVDIDWLLP-------ACPVLAKIPHVLVIHGESDGTLEHM--KRNKP 238
GD++ + NYM D+DWL+ +CP+L V HG+ L + K
Sbjct: 759 HGDLVSSAQFNYMFDVDWLMQQYPKQFRSCPLLL----VHAYHGQDKAALNSVVSKYENI 814
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
+ H + + FGTHH+K M L Y G+RI++HTAN+I DW+ ++QG+W+ L+
Sbjct: 815 RQCVAH---IRLPFGTHHTKMMFLKYADGLRIVIHTANMIPDDWDRRTQGIWLSPKLLRK 871
Query: 299 QNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
SE + F L++YL + A P+ + +++FS V L+
Sbjct: 872 SGTSSETDSDTKFRETLVNYLR--GYGSTVAGTPSSPLGEWIEELL-QYDFSPIRVFLVG 928
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
SV G H GSSLK +GH +L +LQ+ T E PL+ QFSS+GSL + L++ S
Sbjct: 929 SVSGMHGGSSLKHFGHPRLANLLQDYTLEVP-SSWPLIGQFSSIGSLGAQPTTWLTTQWS 987
Query: 416 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKA 474
S + K G+ +++P V+DVR SLEGYAAG +P ++ +K +L+++ +W A
Sbjct: 988 SSLA-GKGARGL---RMIFPCVDDVRNSLEGYAAGGCLPYSRQTAEKQPWLRQFLHRWCA 1043
Query: 475 SHTGRSRAMPHIKTFARYNGQ------------KLAKAAWGALQKNNSQLMIRSYELGVL 522
SRA PHIK++ R + L+KAAWG+ K+ SQLMIRSYELGVL
Sbjct: 1044 G--PHSRAAPHIKSYTRISNDGTHASWFLLTSANLSKAAWGSFVKDGSQLMIRSYELGVL 1101
Query: 523 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
+P + +C + PS + S QI AG + + PV
Sbjct: 1102 FVPGQFQEKA--NCFRLVTPSRTTTPSDALKQI---------------AGMRTHSIPFPV 1144
Query: 583 PYELPPQRYSSE 594
PY+LPP Y ++
Sbjct: 1145 PYDLPPVLYDTD 1156
>gi|360045261|emb|CCD82809.1| putative tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 483
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 151/474 (31%), Positives = 240/474 (50%), Gaps = 89/474 (18%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHK 245
G+++ +I N+M DI WL P + + ++H G+ +L+ K +N +
Sbjct: 48 GELVSSIQFNFMFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQ 106
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNN 301
+ + +G HH+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q N
Sbjct: 107 ADIRLPYGVHHTKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKN 166
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRL 353
L++ + F DL++YL + + +L + +P F ++F V L
Sbjct: 167 LNDTDSKTNFRADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVL 218
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS 413
IASV G H G SLKK+GH +L VLQ C + P++ QFSS+GSL K ++
Sbjct: 219 IASVSGRHAGESLKKFGHTRLGEVLQTCNSQIP-SSWPVIGQFSSIGSLGPKPTDWFTTE 277
Query: 414 MSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKW 472
SS + K G+ +++P+VEDVR SLEGY AG +P + +K +L +++ +W
Sbjct: 278 WSSSLA-GKGARGLR---MIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQFFYRW 333
Query: 473 KASHTGRSRAMPHIKTFARY--NGQK----------LAKAAWGALQKNNSQLMIRSYELG 520
+A + SRA PHIK++ R +GQ+ L+K+AWGA +K+ SQLMIRSYELG
Sbjct: 334 QAFN--HSRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIRSYELG 391
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL LP+ + EI + + SQ ++ E++
Sbjct: 392 VLFLPTNYKESAH--------SFEILKNNAKYSQ-----------------SSTDELLPF 426
Query: 581 PVPYELPPQRYSSED--------------------VPWSWDKRYTKKDVYGQVW 614
P+PYELPP +Y S PW DK ++ D++G++W
Sbjct: 427 PIPYELPPVKYQSNGKKLYMCIIIFLSLFFAMDKYEPWILDKPHSLPDIFGRIW 480
>gi|339256684|ref|XP_003370218.1| 7 transmembrane receptor [Trichinella spiralis]
gi|316965617|gb|EFV50306.1| 7 transmembrane receptor [Trichinella spiralis]
Length = 478
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 153/487 (31%), Positives = 233/487 (47%), Gaps = 73/487 (14%)
Query: 164 FRLLRVQGLPAWANTSCVSIRD---VIQGD----IIVAILSNYMVDIDWLLPACPVLAKI 216
F L +V GL N + VS+++ + G+ + N+++D W + P +
Sbjct: 27 FYLTKVYGLDEKWNENAVSMKNFNLALLGENPDELEATAQFNFLIDYGWTMAQYPENCRQ 86
Query: 217 PHVLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+ ++ + + K N L LPI FGTHHSK LL Y +G+++ +HT
Sbjct: 87 KPLTIVTSSQSSRWNDLVNDVRKATNVSLVDARLPIPFGTHHSKMTLLRYRKGLKVAIHT 146
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE----CGFENDLIDYLSTLKWPEFSANLP 329
ANLI DW K+QG+++ FPL + N +++ F+ DLI YL+ P A
Sbjct: 147 ANLIEYDWCEKTQGMYISPLFPLIENNTGTDDYDSKTNFKADLIAYLNAYTNPAVKAWAE 206
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFK 388
N+ + A V ++AS+PG H ++ WGH+KL +L+ ++
Sbjct: 207 EIENYDMR----------EANVFIVASIPGRHRDVAMYNWGHLKLGRILKTHLNYDAIDA 256
Query: 389 KSPLVYQFSSLGSLD---EKW-MAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDV 440
P+V QFSS+GSL EKW + E ++S+ E + EP +V+P+VE+V
Sbjct: 257 NWPVVCQFSSIGSLGTKPEKWLLGEFAASLGRTAFECS---ALQEPFRNLKLVYPSVENV 313
Query: 441 RCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKL 497
RCS EGY G +P + K +L+++ +W GRS A+PHIKT+ RY+ QKL
Sbjct: 314 RCSSEGYYGGTCLPYTEAVASKQQYLQQFMHRWMCECFGRSHAVPHIKTYFRYSPCFQKL 373
Query: 498 A----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKS 547
A KAAWG +K+N Q IRSYE+GVL +P F C NI
Sbjct: 374 AWFLLTSANLSKAAWGVTEKSNQQFNIRSYEIGVLFIPE-------FFCERNI------- 419
Query: 548 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 607
+Q K T+ H + + ++ P+P +LP YS D W D Y +
Sbjct: 420 ----NFFLQGLKAFTI--HRNVETPSAE----FPLPMDLPLVPYSQNDKMWIIDIPYGEA 469
Query: 608 DVYGQVW 614
D +G W
Sbjct: 470 DAHGITW 476
>gi|148686962|gb|EDL18909.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_c [Mus musculus]
Length = 542
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 194/360 (53%), Gaps = 31/360 (8%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAW 502
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R +K AW
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMR-PSPDFSKLAW 510
>gi|149025342|gb|EDL81709.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Rattus norvegicus]
Length = 542
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 133/382 (34%), Positives = 198/382 (51%), Gaps = 33/382 (8%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D++WL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P Q N + F+ DL YL P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +LQ +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWG 503
EGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R +K AW
Sbjct: 453 EGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIKTYMR-PSPDFSKLAWF 511
Query: 504 ALQKNNSQLMIRSYELGVLILP 525
+ + Q R Y V I P
Sbjct: 512 LVTR---QPAFR-YRCAVQICP 529
>gi|291233547|ref|XP_002736713.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Saccoglossus
kowalevskii]
Length = 431
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 134/379 (35%), Positives = 203/379 (53%), Gaps = 43/379 (11%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDE----QDNENGKNSEEALCNFHVSRDKLPSTFRL 166
++S KR +D + LS KK R +DE + ++ ++ E + + + P F L
Sbjct: 60 NQSNKRRRSDEQPSSHLSCKKSRTEDESPQSKKSKTQSSTSEKMSPYENYIEAAPLNFFL 119
Query: 167 LRVQGLPAWANTS-CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVI 222
+V G+P N+S V I+D++ G++I + NYM DI WL+ P + +L+I
Sbjct: 120 TKVFGIPNHYNSSLAVGIKDILSASMGNLISSAQFNYMFDIPWLVQQYPEQFRSKPLLII 179
Query: 223 HG--ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
HG +D T H ++ N L + L I +GTHHSK M L+Y G+R+++HTAN+IH
Sbjct: 180 HGSQRADKTTLHENAHRYPNITLCQAKLDIMYGTHHSKMMFLLYDNGMRVVIHTANIIHN 239
Query: 281 DWNNKSQGLWMQD-FP-LKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
DW K+QG+W+ FP L +LS+ F DL++YL A+G K
Sbjct: 240 DWYQKTQGVWISPLFPKLASDQDLSQGDSVTQFRKDLLEYLG------------AYGTNK 287
Query: 336 INPSF---FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-P 391
+ ++ + SSA V +I SVPG HTG+S KWGH+KLR VLQE + K P
Sbjct: 288 HLQEWQETIRQHDMSSAKVFIIGSVPGRHTGASKMKWGHLKLRKVLQEHGPDGSTVKDWP 347
Query: 392 LVYQFSSLGS--------LDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
++ QFSS+GS L +W+ LS+ ++G + P + +++P VE+VR S
Sbjct: 348 VIGQFSSVGSLGSGPENWLSSEWLESLSTVQANGIVKLSKP----KLNLIFPCVENVRRS 403
Query: 444 LEGYAAGNAIPSPQKNVDK 462
LEGY AG ++P KN K
Sbjct: 404 LEGYPAGASLPYSIKNARK 422
>gi|345304821|ref|XP_003428263.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ornithorhynchus
anatinus]
Length = 580
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 192/346 (55%), Gaps = 25/346 (7%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V+G+ N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 159 PFRFYLTKVKGIMPKYNSGALHIRDILSPLLGTLVSSAQFNYCFDVDWLIKQYPPEFRNK 218
Query: 218 HVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ + + ++ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 219 PLLLVHGDKREAKAQLHEQAKPYENICLCQAKLDIAFGTHHTKMMLLLYEEGMRVVIHTS 278
Query: 276 NLIHVDWNNKSQGLWMQD-FP--LKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P +++ ++ + F+ DLI+YL P +
Sbjct: 279 NLIHADWHQKTQGIWLSPLYPRLVRETHSSGDSVTHFKTDLINYLMAYNSPSLKEWI--- 335
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K+ + S V LI S PG G + WGH +LR +L+E + ++S
Sbjct: 336 -------DIIKEHDLSETRVYLIGSTPGRFQGQKKEDWGHFRLRKLLEEHSSSIPEEESW 388
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
P+V QFSS+GS+ + KW+ +E S+ K+ G +++PTV++VR SLEG
Sbjct: 389 PIVGQFSSIGSMGADESKWLCSEFKDSLVMLGKSGKSQGGHVPIHLIYPTVDNVRKSLEG 448
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR 491
Y AG ++P + K +L Y+ KW A +GRS AMPHIKT+ R
Sbjct: 449 YPAGGSLPYSIQTAQKQLWLHSYFHKWSAEISGRSHAMPHIKTYMR 494
>gi|324510072|gb|ADY44216.1| Tyrosyl-DNA phosphodiesterase [Ascaris suum]
Length = 452
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 145/508 (28%), Positives = 236/508 (46%), Gaps = 90/508 (17%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKLPST-FRLLRVQGLPAWANTSCVSIRDVIQG-DI 191
+ D D + + ++ F L S ++ G P +T+ S+ ++++
Sbjct: 7 ENDGDDASSARTPSASMVKFRKQDSPLLSNRLYFTKIVGHPCRYSTNAFSLSELLELISP 66
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM------KRNKPANWILHK 245
I +I N+M+D+ WLL P + +I GE++GT H+ +R K N + +
Sbjct: 67 IASIHFNFMIDLHWLLSQYPERCSAYPISIIVGENNGT-NHLDVRAEARRCKADNVSVGR 125
Query: 246 PPLPISFGTHHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
L + +GTHHSK ++ + +++ TANL+ DW++K+Q + P+ +
Sbjct: 126 ARLVLPYGTHHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEG 185
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+ F DLI YL+ ++ G + +FS R+I+S+PGYH G
Sbjct: 186 QNNFRKDLISYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGD 239
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSGFSE 420
++GH++LR VL+ + KK V QFSS+GSL K W+ A+ S++ G
Sbjct: 240 QKDRYGHLRLRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGGIPV 297
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
++ L + ++P VEDVR S+EGY AG A+P + + +L + KW+ GR
Sbjct: 298 PESSLRL-----IYPCVEDVRNSVEGYMAGGALPYQRNTAARQPYLLERMHKWRCERFGR 352
Query: 480 SRAMPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGVLILPSA 527
+RAMPHIK+++ ++ + L+KAAWG LQK SQL IRSYELGVL+
Sbjct: 353 TRAMPHIKSYSAFSDGRCLPSWLLITSANLSKAAWGELQKKESQLAIRSYELGVLL---- 408
Query: 528 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 587
T+ +Q +PY++P
Sbjct: 409 ----------------------TDEDSLQL------------------------LPYDMP 422
Query: 588 PQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
++ D PW D YTK D++G WP
Sbjct: 423 LTKFEPGDQPWVCDDTYTKPDIHGATWP 450
>gi|209879936|ref|XP_002141408.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
gi|209557014|gb|EEA07059.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
Length = 513
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 135/493 (27%), Positives = 228/493 (46%), Gaps = 110/493 (22%)
Query: 181 VSIRDVIQGD-------------IIVAILSNYMVDIDWLLPAC---PVLAKIPHVLVIHG 224
+SI+D+ + D I ++S+Y++DI WL + K+ +L+IHG
Sbjct: 48 LSIKDIFRADCEYCFDGEQDSWLIQDLLVSSYIIDIKWLFKEVRLNKIDEKLNRLLIIHG 107
Query: 225 ES---DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG----------VRII 271
S D T E N N+ + P +P+ +G H K ++L + + +R++
Sbjct: 108 GSCNLDDTTEIQILNIAKNYEIQCPTMPLPYGVFHPKFLILKFSKQDPIIKKEESFIRLV 167
Query: 272 VHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---CGFENDLIDYL-STLKWPEFSAN 327
+ TAN + DW K+Q +W+QDF L + +N + + C + ++++ S ++ +F ++
Sbjct: 168 ITTANFLESDWKFKTQAVWVQDFLLANNSNGAMKNPFCEYFGMFLNHIISKIEHKKFWSD 227
Query: 328 LPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE------- 380
L K++++ +A V L+ASVPGYH G ++K WGH++++ +++
Sbjct: 228 L------------IKQYDYDNATVDLVASVPGYHKGENMKLWGHLRMKEIMKYKTDLNST 275
Query: 381 ---------CTFEK-----GFKKSPLVYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKTPL 425
C E+ +S ++ QFSSLG EKW+ E S+++ +E T
Sbjct: 276 LNIEQPNRICKVEQYNNEYRHVESRIICQFSSLGKFSEKWLTQEFGDSLNTCINEYTTKS 335
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----RSR 481
+V+PT E V SLEG G +IP N+ K ++ K W + R
Sbjct: 336 SFE---LVYPTAEQVYKSLEGIYGGGSIPVKHNNITKSWISKILHLWGSGTLSNPSIRDL 392
Query: 482 AMPHIKTFAR----------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
++PHIKTF R Y L AAWG LQ N +Q+ IR+YELGV+I P
Sbjct: 393 SVPHIKTFLRYLWNSDRKTVSIPWIFYGSHNLGPAAWGQLQNNQTQMCIRNYELGVIITP 452
Query: 526 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 585
+ + I++ T + TK+ T S+ + VP+
Sbjct: 453 YTLYNNVKY----------IRTKRNRTPKFIWTKMET----------KSTPNYNIRVPFS 492
Query: 586 LPPQRYSSEDVPW 598
+PP +Y + D PW
Sbjct: 493 IPPIQYKTNDTPW 505
>gi|71423941|ref|XP_812626.1| tyrosyl-DNA phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70877431|gb|EAN90775.1| tyrosyl-DNA phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 163/540 (30%), Positives = 256/540 (47%), Gaps = 97/540 (17%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L+NYM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLANYMIDIEWLVRVA 60
Query: 211 PVLAKIPH-VLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + + ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQQIFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K DQ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDQANLTFSAGNEIRGNKF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF----SEDK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL PL IV+PT +VR SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGV 521
R RA+PH+KT+ R N +K L++AAWG QK QL IRSYE GV
Sbjct: 349 KIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 522 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 574
+ + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEEGPS 461
Query: 575 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQVWPRHFQL 620
+ Y P+ PY ++ QR +++D+PW D + KDV+G+ R +L
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGKEIHRAMEL 521
>gi|397627380|gb|EJK68455.1| hypothetical protein THAOC_10361 [Thalassiosira oceanica]
Length = 656
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 146/501 (29%), Positives = 229/501 (45%), Gaps = 108/501 (21%)
Query: 195 ILSNYMVDIDWLLP-ACPVLAKIPHVLVIHGES-----------DGTLEHMKR------- 235
I+ NY++D +L A P L + V+V +G S + LE R
Sbjct: 181 IICNYLIDFSYLFQRASPELLQFQRVVVFYGTSGQACPAVMRQWERLLEGTGRTVAFVQL 240
Query: 236 --NKPANWILHKPPLPISFGTHHSKAMLLIYP------RGVRIIVHTANLIHVDWNNKSQ 287
+ P N + P+ I +G HH+K L+ Y + +HT+N++H D KSQ
Sbjct: 241 LPSDPPNSRANPLPVKIEYGVHHTKMFLMGYEDEESGISKCHVSIHTSNILHSDAELKSQ 300
Query: 288 GLWMQDFPLK------DQNNLSEECG-------FENDLIDYLSTLKWPEFSANLPAHGNF 334
G++ QDFPLK N S+E FE+DL+ Y+ + ++ + + +F
Sbjct: 301 GVYAQDFPLKVAPGKSTGNPYSKEEDASKTPRQFEDDLVTYMESYRYQARQSWCSSSASF 360
Query: 335 KINPS------FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR-TVLQECTFEKGF 387
++ + ++FS+A LI SVPG H + + ++G++KLR V+Q +
Sbjct: 361 GLSNQPMTILQLIRAYDFSTAYCVLIPSVPGRHRANDMHEYGYLKLRKAVIQHA---RSQ 417
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGF--------SEDKTPLGIGEPL----IVWP 435
SPL+ QFSSLGSL+ KW+++ S + S S+ K G + IVWP
Sbjct: 418 TNSPLLLQFSSLGSLNGKWLSQFLSCLDSSAQSFDPVTESDKKKSKGTSDLASRMKIVWP 477
Query: 436 TVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR------SRAMPHIKTF 489
+VE+VR +EGY+ G AIP KN++K FL + +W + + S+ PHIKTF
Sbjct: 478 SVEEVRTCVEGYSGGGAIPGRTKNLEKAFLMPLYHRWSSRNPNNEGPLKTSKHAPHIKTF 537
Query: 490 AR------------YNGQKLAKAAWGALQKNNSQ-----LMIRSYELGVLILPSAKRHGC 532
+ L+ AA G +QK + L IR +ELGV I P +
Sbjct: 538 VQPSSDGTEIEWMLLGSHNLSIAALGQIQKRHKDSSEKILFIRHWELGVFISPRTLKQAG 597
Query: 533 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 592
+ K VTL + + SE V +P+PY+L P Y+
Sbjct: 598 NYD----------------------GKDVTLVPYRGGGMSSGSE-VQVPLPYDLNPTPYN 634
Query: 593 SEDVPWSWDKRYTKKDVYGQV 613
+EDV W+ D+ D +G++
Sbjct: 635 NEDVTWAVDRTTFLPDRFGRI 655
>gi|340055492|emb|CCC49811.1| putative tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma vivax
Y486]
Length = 548
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 160/525 (30%), Positives = 234/525 (44%), Gaps = 93/525 (17%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPACPVLAKI 216
F + R++ LP + S + + D++ D +L+NY++D +WLL P +
Sbjct: 6 FWVNRIKALPT-ESPSAIRLGDILHCDAENPDERWTHVVLANYLIDPEWLLRVAPAITCT 64
Query: 217 PHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPISFGTHHSKAMLLIYPRGVRI 270
L I G H + A + + +PP+P+ FG HH+K +L I RG+R+
Sbjct: 65 SRQLFIITGERGFAHHFASSTMAAHMGAGRVTVIEPPMPLPFGVHHTKLVLGINSRGLRV 124
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQNNLSEECG--FENDLIDYLS 317
V TAN I DW+ K+QG++MQDFP L E G F ++L YL
Sbjct: 125 AVLTANFIEEDWDMKAQGIYMQDFPRSLTPDKEGRYTAQSATLQEGRGERFRSELRRYLH 184
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 377
+ + +G I PS F +FSSA+V LIASVPGYH G +G +L V
Sbjct: 185 S-----YGLLSDENGLKGIPPSHFDGIDFSSASVELIASVPGYHRGGEAYSFGMGRLLKV 239
Query: 378 LQECTFEKGFK--KSPLVYQFSSLGSLDEKWMAELSSSMSSGF---SEDKTPLGIGEP-- 430
+Q K L +QFSS G L EK++ L +M + D+ P EP
Sbjct: 240 VQSVQMGPILDGGKPILTWQFSSQGLLTEKFLKSLEDAMLGNHAVGATDRRP----EPEV 295
Query: 431 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---------RSR 481
+V+PT +V+ SLEG+ G ++P + ++ +W H G R R
Sbjct: 296 RVVYPTESEVKNSLEGWRGGMSLPV-RLRCCHPYINARMHRW--CHRGVSEAVNKPVRGR 352
Query: 482 AMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKR 529
AMPH+KT+ R L++AAWG Q+N SQL IRSYELGVL S
Sbjct: 353 AMPHLKTYMRLAEGEDSLHWFLLTSANLSRAAWGEWQRNGSQLAIRSYELGVL-YDSKSF 411
Query: 530 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH-GSSDAGASSEVVYLPV------ 582
C + PS S ++ L+ L G++D + V++LP
Sbjct: 412 INCAEGELFVVTPSR---RIPLPSSVEGDGLLRLHIRAGANDIIGEAPVLFLPYDALHPE 468
Query: 583 PYELPPQR---------------YSSEDVPWSWDKRYTKKDVYGQ 612
PYE Q S++DVPW D + +D G+
Sbjct: 469 PYESTLQLRKNHGSSVENESHAPLSTKDVPWVVDAPHHGRDALGK 513
>gi|407867395|gb|EKG08563.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 162/540 (30%), Positives = 255/540 (47%), Gaps = 97/540 (17%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L+NYM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLANYMIDIEWLVRVA 60
Query: 211 PVLAKIPH-VLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + + ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQQLFIVSGEKEYEKKIQSSFLFRYIKAKKIR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K D+ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDRANLTFSAGNEIRGNNF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYDIISNTENTEA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF----SEDK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL PL IV+PT +VR SLEG+ G ++P + ++ +W G
Sbjct: 293 KPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINGRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGV 521
R RA+PH+KT+ R N +K L++AAWG QK QL IRSYE GV
Sbjct: 349 KIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 522 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 574
+ + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEEGPS 461
Query: 575 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQVWPRHFQL 620
+ Y P+ PY ++ QR +++D+PW D + KDV+G+ R +L
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGKEIHRAMEL 521
>gi|71404281|ref|XP_804861.1| tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70868036|gb|EAN83010.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 158/532 (29%), Positives = 253/532 (47%), Gaps = 97/532 (18%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L++YM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLASYMIDIEWLVRVA 60
Query: 211 PVLAKIP-HVLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + + ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKKQLFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K D+ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQTSPKTDRANLTFSAGNEIRGNKF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE----DK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDMELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL P+ IV+PT +VR SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPL---RPVVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGV 521
R RA+PH+KT+ R +K L++AAWG QK QL IRSYE GV
Sbjct: 349 KMEFLRRRALPHLKTYMRLTEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 522 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 574
+ S + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKSSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEKGPS 461
Query: 575 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQ 612
+ Y P+ PY ++ QR +++D+PW D + KDV+G+
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGK 513
>gi|343477672|emb|CCD11565.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 548
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 128/375 (34%), Positives = 194/375 (51%), Gaps = 61/375 (16%)
Query: 194 AILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH-------- 244
IL Y++D++WL P+L +++I GE G L +K + +LH
Sbjct: 43 VILGGYVIDVEWLFRVSGPLLMSKCTIVLISGEK-GFL-----HKYRHLVLHDRFGRNRV 96
Query: 245 ---KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQN 300
+P LPI FG HHSK ML I G+R+ V TAN I DWN K+QG++ QDFP LK Q+
Sbjct: 97 KIVEPCLPIPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYFQDFPRLKTQS 156
Query: 301 -----NLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
N+S G F N++ YLS + ++++P G + S +F+FS A V
Sbjct: 157 ENIVLNISSIEGKGMRFRNEIKRYLSCIG---VASSMPKDGCIPL--SLLDEFDFSGACV 211
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKWMAE 409
LIASVPGYH S + +G KL+++LQ ++P L +QF+S G L ++
Sbjct: 212 ELIASVPGYHRCSDAQHYGLGKLKSILQSMQLPSSLDRNPPVLTWQFTSQGLLTANFLNS 271
Query: 410 LSSSMSSGFSEDKTPLGIG--EPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+ MS + + P G +P+ +V+PT +V+ SLEG+ G ++P + ++
Sbjct: 272 MKQIMS---IDARNPTGEDKMDPVVRVVYPTETEVKNSLEGWRGGLSLPVTLRCC-HSYI 327
Query: 466 KKYWAKWKASHTG------RSRAMPHIKTFARYNGQK------------LAKAAWGALQK 507
+ +W G RS+ +PH+KT+ R + L++AAWG Q
Sbjct: 328 NERLFRWGTVPQGSEVENERSKGLPHLKTYTRLTESEDGLSWFLLTSANLSRAAWGEWQH 387
Query: 508 NNSQLMIRSYELGVL 522
+QL+IRSYELGVL
Sbjct: 388 GGTQLLIRSYELGVL 402
>gi|407394035|gb|EKF26770.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi
marinkellei]
Length = 551
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 152/533 (28%), Positives = 247/533 (46%), Gaps = 100/533 (18%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDII-------VAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L++YM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWNYVLLASYMIDIEWLVCVA 60
Query: 211 PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH---------KPPLPISFGTHHSKAML 261
P L + L I G E+ K+ + ++ + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQKLFI---VSGEKEYEKKIQSSSLFAYIKAEKVRIVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ +G+R+ V TAN I DW KSQG+++QDFP + D+ NL+ G F
Sbjct: 118 CVNAKGIRVAVLTANFIQDDWVCKSQGIYVQDFPRRQNLPKTDRANLTFSAGSEIRGSEF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + A I + F + +FS+A V +I S+PGY+ + +
Sbjct: 178 KNELLRYLNC-----YGIISNAENTVAIPSTLFDEIDFSAACVEIITSIPGYYRYNDVHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDK 422
+G ++ VL E + L++QFSS G L ++ L ++MS S +K
Sbjct: 233 FGLGRIPKVLHSIDMELSDSIQVPLLIWQFSSQGKLTNSFLVALENAMSTEGKSNEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL P+ IV+PT +V+ SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPLC---PVVQIVYPTESEVKESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGTC 348
Query: 479 ----RSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGVL 522
R RA+PH+KT+ R +K L++AAWG QK +QL IRSYE GV+
Sbjct: 349 KIELRRRALPHLKTYMRLTEKKDGIKWLILTSANLSRAAWGEWQKKGNQLAIRSYEFGVV 408
Query: 523 ILPS---AKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
+ G FS T + +PS ++ I + G
Sbjct: 409 YGKDSFISFLEGEPFSVTPSRKIPLPSLVEGDGLAEVHIDQ---------GGKKDIEEGP 459
Query: 577 VVYLPV-PYELPP---------QR-------YSSEDVPWSWDKRYTKKDVYGQ 612
++LP P L P QR +++D+PW D + KDV+G+
Sbjct: 460 TLFLPYDPLHLEPYASTVQMQNQRGNNCDSSINTDDIPWVIDMPHFGKDVFGK 512
>gi|67609723|ref|XP_667058.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658157|gb|EAL36834.1| hypothetical protein Chro.70273 [Cryptosporidium hominis]
Length = 511
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 135/448 (30%), Positives = 216/448 (48%), Gaps = 76/448 (16%)
Query: 195 ILSNYMVDIDWLLPACP----VLAKIPHVLVIHGESDGTLEHMKRNKPANWIL----HKP 246
+ S+Y+ D++W++ + I +L + D + +N + P
Sbjct: 92 LFSSYLADVNWVINEIGDSELICENIESILFVSHGFDNPQNYKLKNCNKMKNKKISTYSP 151
Query: 247 PLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF
Sbjct: 152 YLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDF---FH 208
Query: 300 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIASVP 358
N ++C F +DYL EF N+ K S ++FNF A V+L+ASVP
Sbjct: 209 NIERKDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVASVP 259
Query: 359 GYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM-AEL 410
GY G + WGH+++R+++ Q + E G K+ ++ QFSSLG + EKW+ EL
Sbjct: 260 GYFFGDEMFMWGHLRVRSLIKRFVSKDQGKSDELGEKRERIILQFSSLGRISEKWLYTEL 319
Query: 411 SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 469
+SS+S + P G L I++PTVE V S+EG G ++P ++ + K ++KK
Sbjct: 320 ASSLS------EIP---GTKLEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIKKLL 370
Query: 470 AKWKASHTGRS----RAMPHIKTFARYN--------------GQKLAKAAWGALQKNNSQ 511
KW ++ + +PHIKTF +Y L+ AAWG +QK+ SQ
Sbjct: 371 HKWGTGTMEKNATDEKVIPHIKTFLKYKIFDNAIKIIWLVQGSYNLSNAAWGQIQKDGSQ 430
Query: 512 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 571
IR+YELG+ I H F +E E + + ++ +A
Sbjct: 431 FCIRNYELGIFI------HKDQFEFERYFKLNE------EFPKFFWKRKSNFSFISEINA 478
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWS 599
++ P+P++LPP+RYS+ D PW+
Sbjct: 479 NKPIRLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|219113113|ref|XP_002186140.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582990|gb|ACI65610.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 305
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 166/304 (54%), Gaps = 30/304 (9%)
Query: 250 ISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLSEE 305
I +G HHSK L+ Y + +RII+HTAN+ + D + K+Q + QDF LK + N++
Sbjct: 1 IPYGVHHSKFFLVGYADQSLRIIIHTANIRYDDIHCKAQAAFFQDFGLKSPENFTNVANT 60
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
C FE DLIDYL + ++ + K F ++++FSSA L+ S PGYH
Sbjct: 61 CEFEEDLIDYLDSYRYTRLHKWTKSGSKTKSLGQFVREYDFSSAKAVLVPSTPGYHRLDE 120
Query: 366 LKKWGHMKLRTVLQECTF--EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 423
+ GH K+R + T E+ P+V QFSS+GSL E+++ EL +SM S D+
Sbjct: 121 KHRRGHWKMRQTIPSHTEAPEEETICDPIVCQFSSIGSLTERYLLELQTSMDMKQSRDRG 180
Query: 424 PLGIGE--PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG--- 478
G E +V+PTVE++R S+EGY G ++P +NV K FLK+ + +W A +
Sbjct: 181 RPGRLELSLKLVYPTVEEIRTSVEGYRGGGSVPGTMRNVGKPFLKRLFCRWSALSSSDMN 240
Query: 479 ---RSRAMPHIKTFARYN------------GQKLAKAAWGALQKNN----SQLMIRSYEL 519
+ R +PH+KT+ + N L+KAAWG +Q ++ +L +R +EL
Sbjct: 241 PLWKGRNVPHMKTYFQTNSTTETLHWFVLTSHNLSKAAWGEIQTSSRYGGRRLFVRHWEL 300
Query: 520 GVLI 523
GV +
Sbjct: 301 GVFL 304
>gi|357630668|gb|EHJ78636.1| hypothetical protein KGM_17628 [Danaus plexippus]
Length = 581
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 135/452 (29%), Positives = 211/452 (46%), Gaps = 77/452 (17%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNK 237
+ + I D G++ ++ N+MVD WLL + +++GE L ++ K
Sbjct: 181 TLLEILDSSLGELKCSLQINFMVDAGWLLAHYYFAGYSAKKLTILYGEESAELRNISAKK 240
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ---- 292
P N H+ + FG HH+K MLL Y G +R++V TANL DW N++QGLW+
Sbjct: 241 P-NVEAHQVKMATPFGKHHTKMMLLCYEDGSLRVVVSTANLYMDDWENRTQGLWLSPSCP 299
Query: 293 DFPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P + ++ E GF+ L+DYL + P+ + + ++ +FS V
Sbjct: 300 QLPAESPSHSGESPTGFKRSLLDYLHHYRLPQLAVYV----------HRVQRCDFSHINV 349
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKS-PLVYQFSSLGSLDEKWMAE 409
L+ SVPG H +S WG +++ +L+ C +S PL+ Q SSLGS + +
Sbjct: 350 FLVCSVPGTHYSAS---WGFLRVGALLRAHCAVPPQETRSWPLIAQASSLGSYGKDPGSW 406
Query: 410 LSSSMSSGFSEDK-TPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDF 464
L+ F++ K P + P +++P++E+V+ S +G G +P S +V + +
Sbjct: 407 LTGDFLHHFTKIKDQPQTLTPPPDLKLIYPSLENVKSSHDGLLGGGCLPYSAAVHVKQPW 466
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQL 512
LK + +W+A H+ R RAMPHIK++ R + ++KAAWG K+ L
Sbjct: 467 LKDFLYQWRALHSERDRAMPHIKSYTRVSPDNSRAAFYLLTSGNVSKAAWGVRNKDGG-L 525
Query: 513 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 572
+ SYE GVL LP F S+ P
Sbjct: 526 RLMSYEAGVLFLPR-------FVINSDFFPL----------------------------- 549
Query: 573 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 604
S + LPVPY+LPPQRYS + PW D Y
Sbjct: 550 CPSSALRLPVPYDLPPQRYSPDMSPWVSDYLY 581
>gi|393910432|gb|EJD75879.1| TDP1 protein [Loa loa]
Length = 672
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 153/511 (29%), Positives = 217/511 (42%), Gaps = 102/511 (19%)
Query: 137 EQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAI 195
E D+ K SE + DKL +V GL N + S ++++ + +I
Sbjct: 15 ECDDLESKGSEGKRMKQNCLMDKL----YFNKVVGLAEQYNVNAFSFAELLELISPVASI 70
Query: 196 LSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPI 250
N+M+D+ WLL P + + +I GE GT +K+ N + + L I
Sbjct: 71 HFNFMIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMI 130
Query: 251 SFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC--- 306
FGTHHSK + G V II+ TANL+ DWN K+Q F + +C
Sbjct: 131 PFGTHHSKISIFESNTGRVHIIIATANLLESDWNFKTQAF----FHCSGNELAAGDCPDR 186
Query: 307 ---GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL+ YL K + L H +++ + S R++ SVPG H G
Sbjct: 187 NGSDFQTDLVKYLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKG 240
Query: 364 SSLKKWGHMKLRTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGF 418
L K+GH +LR +L+E + GF SLG+ + W+ + +S+S G
Sbjct: 241 VQLTKYGHPRLRVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGA 300
Query: 419 SEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASH 476
D GE L I++P VEDVR S EGYAAG + P S V + +L + KW + H
Sbjct: 301 ETD------GEHLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDH 354
Query: 477 TGRSRAMPHIKTFARY------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLIL 524
GRSRAMPHIKT+A + L+KAAWG Q QL IRSYE G+L
Sbjct: 355 LGRSRAMPHIKTYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF- 413
Query: 525 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 584
SD + + Y
Sbjct: 414 --------------------------------------------SDPESLDMLPY----- 424
Query: 585 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
+LP +Y D W DK Y K D++ + WP
Sbjct: 425 DLPLTKYDDNDRVWIVDKTYRKPDIFRKTWP 455
>gi|322787271|gb|EFZ13407.1| hypothetical protein SINV_04400 [Solenopsis invicta]
Length = 647
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 210/438 (47%), Gaps = 73/438 (16%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +L+++G+ ++H K + +N
Sbjct: 251 ILDRSLGEIVKSLHLNFMVDVGWLCLQYLLAGQRTDMLILYGDR---VDHEKLH--SNIT 305
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQ 299
+ + +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ P L +
Sbjct: 306 MIEVQMPTQFGCHHTKIMILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPES 365
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N S+ GF+ DL YL+ ++P+ + + A ++ NFS V L+AS
Sbjct: 366 ANPSDGESPTGFKKDLERYLNKYRFPDLTQWISA----------VRRANFSDVKVFLVAS 415
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG H + WGH KL VL + T + P+V Q SS+GSL + + LS +
Sbjct: 416 VPGTHKDNEADSWGHKKLAHVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKEII 475
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S + T P ++P++++ + S + +P S + + + +++ Y +W
Sbjct: 476 PCMSRETTKGLKSHPHFQFIYPSIDNYKQSFDCRNLSCCLPYSAKTHSKQQWIESYLYQW 535
Query: 473 KASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELG 520
KA TGR RAMPHIK++ R + L+KAAWG +Q+NN +M SYE G
Sbjct: 536 KAKRTGRDRAMPHIKSYTRISPDLRSISWFVLTSANLSKAAWG-MQRNNHYIM--SYEAG 592
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
V+ +P K +T T + V
Sbjct: 593 VVFIP---------------------------------KFITGTTTFPIEDEEDPAVPVF 619
Query: 581 PVPYELPPQRYSSEDVPW 598
P+PY+LP RY S D P+
Sbjct: 620 PIPYDLPLCRYESSDRPF 637
>gi|261326666|emb|CBH09628.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
gambiense DAL972]
Length = 553
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 166/548 (30%), Positives = 252/548 (45%), Gaps = 117/548 (21%)
Query: 147 EEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNY 199
E LC F VSR V GL A + S +++ D++ +I +L+NY
Sbjct: 3 ETKLCPFWVSR-----------VSGL-ATESPSALTLSDLLHCNIEDPSEVWTHVVLANY 50
Query: 200 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 251
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSSC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQN 300
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP +
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 301 NLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
L G F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 359 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 417 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 472
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPV-RLRCCHPYVNERLYRWG 340
Query: 473 -----KASHTGRSRAMPHIKTFARY--NGQ----------KLAKAAWGALQKNNSQLMIR 515
+ + GR+RAMPHIKT+ R NG L++AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 516 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 566
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 567 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 604
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQRERRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 605 TKKDVYGQ 612
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|66362892|ref|XP_628412.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
gi|46229443|gb|EAK90261.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
Length = 511
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 192/390 (49%), Gaps = 66/390 (16%)
Query: 244 HKPPLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
+ P L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF
Sbjct: 149 YSPYLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDFFH 208
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIA 355
+ ++C F +DYL EF N+ K S ++FNF A V+L+A
Sbjct: 209 SIER---KDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVA 256
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM- 407
SVPGY G + WGH+++R+++ Q+ + E K+ +V QFSSLG + EKW+
Sbjct: 257 SVPGYFFGDEMFMWGHLRVRSLIKRFVSKDQDKSDELREKRERIVLQFSSLGRISEKWLY 316
Query: 408 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK 467
EL+SS+S + E I++PTVE V S+EG G ++P ++ + K ++KK
Sbjct: 317 TELASSLSE--------IPGTELEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIKK 368
Query: 468 YWAKWKASHTGRS----RAMPHIKTFARYN--------------GQKLAKAAWGALQKNN 509
KW ++ + +PHIKTF +Y L+ AAWG +QK+
Sbjct: 369 LLHKWGTGTMKKNATDEKVIPHIKTFLKYKIFGNAIKIIWLVQGSYNLSNAAWGQIQKDG 428
Query: 510 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS 569
SQ IR+YELG+ I F P + S I +
Sbjct: 429 SQFCIRNYELGIFIHKDQFEFERYFKLNEEF-PKFFWKRKSNCSLISEI----------- 476
Query: 570 DAGASSEVVYLPVPYELPPQRYSSEDVPWS 599
+A + ++ P+P++LPP+RYS+ D PW+
Sbjct: 477 NANQPNVLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|84043866|ref|XP_951723.1| tyrosyl-DNA phosphodiesterase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|33348708|gb|AAQ16032.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
gi|62358538|gb|AAX78999.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei]
Length = 553
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 166/548 (30%), Positives = 252/548 (45%), Gaps = 117/548 (21%)
Query: 147 EEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNY 199
E LC F VSR V GL A + S +++ D++ +I +L+NY
Sbjct: 3 ETKLCPFWVSR-----------VSGL-ATESPSALTLSDLLHCNIEDPSEVWTHVVLANY 50
Query: 200 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 251
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSNC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQN 300
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP +
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 301 NLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
L G F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 359 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 417 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 472
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPV-RLRCCHPYVNERLYRWG 340
Query: 473 -----KASHTGRSRAMPHIKTFARY--NGQ----------KLAKAAWGALQKNNSQLMIR 515
+ + GR+RAMPHIKT+ R NG L++AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 516 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 566
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 567 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 604
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQREHRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 605 TKKDVYGQ 612
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|170587939|ref|XP_001898731.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
gi|158592944|gb|EDP31539.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
Length = 454
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 172/357 (48%), Gaps = 36/357 (10%)
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA-----NWILHKP 246
+ +I N+M+D+ WLL P + + +I GE GT + R N + +
Sbjct: 67 VASIHFNFMIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTRTAVKQCGVNNVTVGRA 126
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
L I FGTHHSK + G V I++ TANL+ DWN K+Q + + +N
Sbjct: 127 RLMIPFGTHHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIERSADNRCNP 186
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D + YL+ K + G + N S R++ SVPG H G
Sbjct: 187 NGSDFQADFVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARIVYSVPGAHKG 240
Query: 364 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFS 419
L K+GH +LR +L+E + QFSSLGSL + W+ + +S++ G
Sbjct: 241 VQLTKYGHPRLRVILKELFGNVKMDEFTYHVQFSSLGSLGAAPQYWLTGQFLNSLAGGAE 300
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTG 478
D L I++P VEDVR S EGY AG + P V + +L + KW+++H G
Sbjct: 301 TDGKHL-----RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMYKWRSNHLG 355
Query: 479 RSRAMPHIKTFARY------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLI 523
RSRAMPHIKT+A + L+KAAWG Q +QL IRSYE GVL
Sbjct: 356 RSRAMPHIKTYAAFAKNSLKPLWLLVTSANLSKAAWGDYQLKKTQLTIRSYEFGVLF 412
>gi|402592672|gb|EJW86599.1| tyrosyl-DNA phosphodiesterase [Wuchereria bancrofti]
Length = 453
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 124/357 (34%), Positives = 172/357 (48%), Gaps = 36/357 (10%)
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKP 246
+ +I N+M+D+ WLL P + + +I GE GT +K+ N I+ +
Sbjct: 66 VASIHFNFMIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVIVGRA 125
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
L I FGTHHSK + G V I++ TANL+ DWN K+Q + +N
Sbjct: 126 RLMIPFGTHHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIELSADNRCNP 185
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D + YL+ K + G + N S R++ SVPG H G
Sbjct: 186 NGSDFQADFVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARVVYSVPGAHKG 239
Query: 364 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFS 419
L K+GH +LR +L+E + QFSSLGSL + W+ + +S+S G
Sbjct: 240 VQLTKYGHPRLRVILKELFGNVKMDEFTYHAQFSSLGSLGAAPQYWLTGQFLNSLSGGAE 299
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTG 478
D L I++P VEDVR S EGY AG + P V + +L + KW++ H G
Sbjct: 300 TDGKHL-----RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMHKWRSDHLG 354
Query: 479 RSRAMPHIKTFARY------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLI 523
RSRAMPHIKT+A + L+KAAWG Q +QL IRSYE GVL
Sbjct: 355 RSRAMPHIKTYAAFAKNSLKPLWLLVTSANLSKAAWGNYQLKKTQLTIRSYEFGVLF 411
>gi|299115351|emb|CBN74172.1| Tyrosyl-DNA phosphodiesterase [Ectocarpus siliculosus]
Length = 607
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 154/514 (29%), Positives = 232/514 (45%), Gaps = 117/514 (22%)
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPA-WAN 177
N +NG S K ++ DN+ + +K P +RLL P+ A+
Sbjct: 39 NSSNSNGGTSQSKRPASEQGDNKTPSQRKGKRPRSFQPFEK-PPLYRLLSTS--PSDRAS 95
Query: 178 TSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RN 236
T V + D++ GD A+L NYMVD L+ P L +P V ++HG GT + + R+
Sbjct: 96 TGSVGLDDLLSGDFESALLCNYMVDYALLVRCAPRLGSVP-VTIVHGFKPGTQDEVNLRS 154
Query: 237 KPA---NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD 293
+ A L P LP +GT+H+K ++L +P G+R+ V TAN I VD +KSQG+W QD
Sbjct: 155 QCAVNPGVKLRYPELP-EYGTNHAKMIILKFPTGIRVAVLTANFIVVDVTDKSQGVWYQD 213
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
FP + S C F+ DL+ +L F PA S +++F A V L
Sbjct: 214 FPKR----TSGSCAFQEDLMGFL-------FKVGGPASA----FASTLGEYDFRGARVAL 258
Query: 354 IASVPGY-----------HTGSSLKKWGHMKLRTVLQE-------CTFEKGFKKSPLVYQ 395
+ SVPG H G L K+GHM++R +L ++G K ++ Q
Sbjct: 259 VPSVPGTGGNTPGTGGKPHKGRDLHKYGHMRVRALLAREKEDGTGAKLKEGGHK--VLCQ 316
Query: 396 FSSLGSLDE---KWMAELSSSM-------------SSGFSEDKTPLGIGEP--LIVWPTV 437
SSL SL + +W++E+ +S SED+ + E +VWP+V
Sbjct: 317 ISSLASLTKTPNRWLSEILASFMPLEDEGKKAEPTRRSVSEDEAQATLLEQHLRVVWPSV 376
Query: 438 EDVRCSLEGYAAGNAI-----------------PSPQKNVDKDFLKKYWAKWKAS-HTGR 479
E VR S +G+ AG +I + + N L+ KWK + R
Sbjct: 377 EAVRTSSQGWIAGGSICCNTVNMYGGKYKWPNMDNYRSNTPLPELRPLLRKWKGNPAVNR 436
Query: 480 SRAMPHIKTFARY------NGQK-----------------LAKAAWGALQKNNSQLMIRS 516
+R PHIK++ RY NG + L+++AWG L K ++ L +RS
Sbjct: 437 TRDAPHIKSYLRYREVAGENGTETRVDGDEVAWFLLTSSNLSRSAWGYLNKASTDLTLRS 496
Query: 517 YELGVLILPS-------------AKRHGCGFSCT 537
+E+GV+ LPS A GF+CT
Sbjct: 497 FEMGVMFLPSLLRSPSQDSDDGNAAAKASGFTCT 530
>gi|307188952|gb|EFN73469.1| Probable tyrosyl-DNA phosphodiesterase [Camponotus floridanus]
Length = 666
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/439 (29%), Positives = 207/439 (47%), Gaps = 75/439 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANW 241
I D G+I+ ++ N+MVD+ WL + + +++++GE + R K +N
Sbjct: 269 ILDRSLGEIVNSLHMNFMVDVGWLCLQYLLAGQRTDMMILYGE------RVDREKLGSNI 322
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
+ +P+ FG HHSK M+ Y G+R++V TANL DW+N++QGLW+ PL
Sbjct: 323 TMIHVDMPVRFGCHHSKIMIFQYKDDGIRVVVSTANLYSDDWDNRTQGLWISPHLPLLPE 382
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YLS + P + + A ++ NFS+ V L+A
Sbjct: 383 SANPSDGESPTGFKKDLERYLSKYRHPALTQWIWA----------VRRANFSAVNVFLVA 432
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + + WGH KL VL + T + P+V Q SS+GSL + + LS +
Sbjct: 433 SVPGTHKDAEVDSWGHRKLAYVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDI 492
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S + T P ++P++E+ + S + +P S Q + + +++ Y +
Sbjct: 493 IPCMSRETTKGLKSHPNFQFIYPSIENYKHSFDCRNLSCCLPYSAQVHSKQQWIESYLYQ 552
Query: 472 WKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYEL 519
W+A T R RAMPHIK++ R + L+KAAWG +Q++N +M SYE
Sbjct: 553 WRAKRTRRDRAMPHIKSYTRISPDLKRIPWFVLTSANLSKAAWG-VQRSNHYIM--SYEA 609
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GV+ +P K +T T + V
Sbjct: 610 GVIFIP---------------------------------KFITQTTTFPIEDEEDPAVPI 636
Query: 580 LPVPYELPPQRYSSEDVPW 598
P+PY+LP +RY S D P+
Sbjct: 637 FPIPYDLPLRRYDSSDSPF 655
>gi|346323354|gb|EGX92952.1| tyrosyl-DNA phosphodiesterase [Cordyceps militaris CM01]
Length = 515
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 153/521 (29%), Positives = 235/521 (45%), Gaps = 99/521 (19%)
Query: 154 HVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP- 211
H S D + S FRL ++ L +N +++ D++ +I + NY DI +L+
Sbjct: 32 HKSVDTVSSPFRLTWIRDLDEESNQDAITLTDLLGDPLISECWNFNYQHDIPFLMGTFDR 91
Query: 212 -VLAKIPHVLVIHG---ESDGT---LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
+ A + V V+HG DG L + P N LH P+P FGTHHSK ML+++
Sbjct: 92 DIRAHV-QVHVVHGFWKREDGNRLRLVEQAEHFP-NVKLHVAPMPEMFGTHHSK-MLIVF 148
Query: 265 PRG--VRIIVHTANLIHVDWNNKSQGLWM-----------QDFPLKDQNNLSEECGFEND 311
R ++I+HTAN+I DW N + W+ +D P + F+ D
Sbjct: 149 RRDDTAQVIIHTANMIAKDWTNMTNAAWISPILPKLNTAPKDSPRPENMTPGSGPRFQFD 208
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSF------FKKFNFSSAAVRLIASVPG---YHT 362
L+ YL++ ++ P+ K ++FSS L+ASVPG HT
Sbjct: 209 LLSYLTSYD--------------RMRPTCTGLVQSLKVYDFSSVKGSLVASVPGTHEVHT 254
Query: 363 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWM-AELSSSMSSGFS 419
+ WG + L++ + G KS + Q SS+ +L ++ W+ L ++S G S
Sbjct: 255 EAGATAWGWSAMGKCLEQIPCQAG--KSEVTVQVSSIATLGGNDGWLRGTLFKALSKGKS 312
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS 475
T + +V+PT +++R SL+GYA+G +I S Q+ + +L+ + W A
Sbjct: 313 A-TTAAAAPQFKVVFPTADEIRASLDGYASGGSIHTKIQSKQQEMQLRYLRPIFHYWMAD 371
Query: 476 HT----------GRSRAMPHIKTFARYNGQK-----------LAKAAWGALQKNNSQLMI 514
GR RA PHIKT+ R N + L+K AWG K Q I
Sbjct: 372 DASKAASSFRDAGRDRAAPHIKTYIRTNEKNTMDWALVTSANLSKQAWGEAAKPTGQFRI 431
Query: 515 RSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGA 573
S+E+GVL+ PS K+ C + VP GS E Q+ G
Sbjct: 432 ASWEIGVLVWPSLFKKDAIMKGCFKSDVP-----GSAEGHGGQR--------------GE 472
Query: 574 SSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ VV +PY LP ++YS E +PW + K+D GQ W
Sbjct: 473 AETVVGFRMPYSLPLRKYSREAMPWVATMSHEKEDCLGQSW 513
>gi|302422748|ref|XP_003009204.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
gi|261352350|gb|EEY14778.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
Length = 527
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 148/513 (28%), Positives = 230/513 (44%), Gaps = 92/513 (17%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPH 218
PS F+L ++ LP +N V+++D++ +I N++ DI +L+ + +
Sbjct: 43 PSPFQLTHIRDLPTSSNADAVTLKDLLGDPLISECWEFNFLHDIPFLMSHFDEDTRDLVK 102
Query: 219 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRVALQEEAAAWKNVELHTAPMPEMFGTHHTKMMILFRHDDTAQVII 162
Query: 273 HTANLIHVDWNNKSQGLWMQD-FPLKDQNN-----------LSEECG----FENDLIDYL 316
HTAN+I DW N + G+W PL Q N +E+ G F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPNGGKLEDGEVYEANEDFGSGRKFKSDLLRYL 222
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 374
+ + ++ +++F+ LIASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRLLTEQLARYDFAGVRAVLIASVPGRHAIHDTSQTAWGWPAL 274
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 429
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQTG--KSEIVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSIGPRPAF--- 329
Query: 430 PLIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWK------------ 473
+V+PT +++R SL+GYA+G + I SPQ+ +LK + W
Sbjct: 330 -KVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKSIFCHWANDAPGGKELSKD 388
Query: 474 --ASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKNNSQLMIRSYELGV 521
GR RA PHIKT+ RY Q L+K AWG ++ I S+E GV
Sbjct: 389 TLLRDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 580
L+ PS + +G+ E + + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVTGTDEATMVGTFKTDAPGEEAPSSAPSSTGNVVGL 490
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+PY LP Q Y +++PW K D G+V
Sbjct: 491 RMPYSLPLQLYGKDEIPWVLRMSIPKPDWAGRV 523
>gi|383853604|ref|XP_003702312.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Megachile
rotundata]
Length = 701
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 131/450 (29%), Positives = 214/450 (47%), Gaps = 83/450 (18%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +L+++G+ ++ K + N
Sbjct: 308 ILDRSLGEIVNSLHINFMVDVGWLCLQYLLAGQRTDMLILYGDR---VDEEKLS--LNIT 362
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQ 299
+ +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ PL +
Sbjct: 363 MIPVQMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPPLPES 422
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N ++ GF+ DL+ YL+ + P + A ++ +FSS V IAS
Sbjct: 423 ANTNDGESPTGFKKDLLLYLNKYRQPAITEWTSA----------VRRADFSSVNVFFIAS 472
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELS 411
VPG H G WGH KL VL + T + LV Q SS+GSL E W+ E++
Sbjct: 473 VPGRHKGVEYDSWGHRKLGYVLSKHATLPPDAPRWTLVAQSSSIGSLGPSYESWLLKEIT 532
Query: 412 SSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKK 467
SSMS ++P + P ++P++ + + S + +P S Q + +++++
Sbjct: 533 SSMSK-----ESPSNLKSHPNFQFIYPSINNYKQSFDCRVGSCCLPYSLQTHSKQEWIES 587
Query: 468 YWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIR 515
Y +WKA+ T R +AMPHIK++ R++ L+KAAWG + K++ +M
Sbjct: 588 YMYQWKATRTARDKAMPHIKSYTRFSPDMKKIPWFVLTSANLSKAAWGTVGKDSHYIM-- 645
Query: 516 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 575
+YE GV+ +P F S P + +
Sbjct: 646 NYEGGVIFIPK-------FIIGSTTFPVQEEENG-------------------------- 672
Query: 576 EVVYLPVPYELPPQRYSSEDVPWSWDKRYT 605
V P+PY+LPP +Y S D P+ + Y+
Sbjct: 673 -VPVFPIPYDLPPTKYQSGDKPFVMEFFYS 701
>gi|429856258|gb|ELA31180.1| tyrosyl-dna phosphodiesterase [Colletotrichum gloeosporioides Nara
gc5]
Length = 517
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 147/510 (28%), Positives = 238/510 (46%), Gaps = 94/510 (18%)
Query: 159 KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK- 215
++ S F+L ++ LP AN V+++D++ GD ++A NY+ DI +L+ K
Sbjct: 45 RIKSPFQLTWIRDLPEPANRDAVALKDIL-GDPLIAECWEFNYLHDIHFLMSHFDEDTKS 103
Query: 216 IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG D ++ A N LH +P FGTHHSK M+L+ + +
Sbjct: 104 LVKVHVVHGFWKREDPNRLALQEEASAYSNVELHGAYMPEMFGTHHSKMMILVRHDDSAQ 163
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPL------KDQNNLSEECG----FENDLIDYLSTL 319
+++HTAN+I DW N + +WM PL KD + + G F++DL+ YL
Sbjct: 164 VVIHTANMIAKDWTNMTNAVWMS--PLLRLLKEKDSTSCEDAIGTGQRFKHDLLSYLKA- 220
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
++ P + +++FSS LIASVPG H+ +S WG L+ V
Sbjct: 221 ----YNVRRPTLRDLVDK---LSQYDFSSVKAALIASVPGRHSIHDTSQTSWGWPALKHV 273
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IV 433
L+ + G KS +V Q SS+ +L + W+ + L + +S S DK P +V
Sbjct: 274 LRHVPVQDG--KSEIVVQISSIATLGATDNWIQKCLFNPLSE--SSDKGPKKTKPTFKVV 329
Query: 434 WPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------KAS 475
+PT +++R SL+GYA+G +I S Q+ +L ++ W
Sbjct: 330 FPTADEIRRSLDGYASGGSIHTKIQSQQQAKQLAYLHPFFCHWGNDAPNGKALPETATVR 389
Query: 476 HTGRSRAMPHIKTFARYNGQK-----------LAKAAWGALQKNNSQLMIRSYELGVLIL 524
GR RA PHIKT+ RY G+K ++K AWG + + ++ I S+E+GVL+
Sbjct: 390 EAGRKRAAPHIKTYIRY-GEKSIDWALVTSANISKQAWGEVAGASQEVRIASWEIGVLVW 448
Query: 525 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 584
P T +++ S +TE S+ VV + +PY
Sbjct: 449 PEMMAEKATMMST---FQTDLPSNNTE---------------------GSNPVVGVRIPY 484
Query: 585 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
LP Q Y+ +++PW + + D G+ W
Sbjct: 485 NLPLQHYAKDEIPWVATMAHAEPDNMGRFW 514
>gi|340960785|gb|EGS21966.1| hypothetical protein CTHT_0038420 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 487
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 147/508 (28%), Positives = 220/508 (43%), Gaps = 86/508 (16%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IP 217
+PS FRL R++ LPA N V+++D++ +I NYM DID+L+ A + +
Sbjct: 10 IPSPFRLTRIRDLPANLNQDTVTLKDLLGDPLISECWEFNYMHDIDFLMSAFDEDTRHLV 69
Query: 218 HVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRI 270
V V+HG S TL P N LH +P FGTHHSK M+L+ + RI
Sbjct: 70 KVHVVHGFWKREDLSRVTLHEQAARYP-NVALHAAYMPEMFGTHHSKMMILLRHDDTARI 128
Query: 271 IVHTANLIHVDWNNKSQGLWMQDF-PL----KDQNNLSEE-----CGFENDLIDYLSTLK 320
++HTAN+I DW N +Q +WM + PL Q N+ E F+ DL++YL
Sbjct: 129 VIHTANMIVRDWTNMTQAVWMSPWLPLMKGPSQQENVHEAKPGSGAKFKVDLLNYLRAYD 188
Query: 321 WPEFSANLPAHGNFKINPSFFK--KFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRT 376
+ G P K +F+FS LIASVPG H SS +WG +
Sbjct: 189 ---------SRGRETCKPIIEKLMRFDFSEVKGALIASVPGRHKLNDSSPTRWGWAAMEQ 239
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP----LI 432
L+ + + + + ++LG D S ++S G + + +P +
Sbjct: 240 ALKTVPVHQQAEIAIQISSIATLGPTDNWLKNTFSRALSGGRG-----VSLSQPPPSFKV 294
Query: 433 VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY 492
++PT +++R SL+GYA+G +I + ++ + + K +GR RA PHIKT+ RY
Sbjct: 295 IFPTADEIRKSLDGYASGGSIHTKIQSPQQVKQLQQADKSAVLDSGRKRAAPHIKTYIRY 354
Query: 493 NGQ-------------KLAKAAWG-------------ALQKNNSQLMIRSYELGVLILPS 526
+ L+K AWG + ++ I SYE+GVL+ P
Sbjct: 355 GNKSHQTIDWALLTSANLSKQAWGEAASAPGGSKGKSTASSGDREVRIASYEIGVLVWPE 414
Query: 527 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 586
T G T Q K V L +PY L
Sbjct: 415 LWGEDAAMKATFMTDNLGDSRGGEFTEQEGKV------------------TVALRMPYSL 456
Query: 587 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
P Q Y + +VPW + + D GQVW
Sbjct: 457 PLQPYDNAEVPWVATTNHEEPDWMGQVW 484
>gi|367053563|ref|XP_003657160.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
gi|347004425|gb|AEO70824.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
Length = 548
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 143/515 (27%), Positives = 223/515 (43%), Gaps = 89/515 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHV 219
S F+L +++ LP N +++D++ +I NY+ DID+L+ A P + + V
Sbjct: 63 SPFKLTKIRDLPPELNRDTTTLKDILGDPLISECWEFNYLHDIDFLMAAFDPDVRGLVQV 122
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E LE ++ N LH +P FGTHHSK M+L+ + +I++H
Sbjct: 123 HVVHGFWKREDPSRLELQAAASRYENVTLHNAYMPEMFGTHHSKMMILLRHDDTAQIVIH 182
Query: 274 TANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEE-----CGFENDLIDYLSTLKWPE 323
TAN+I DW N +Q +W+ P + N +E F+ D ++YL +
Sbjct: 183 TANMIVRDWTNMTQAVWLSPRLPLIKPAQQAVNQAEARTGSGAKFKMDFLNYLRSYD--- 239
Query: 324 FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS--SLKKWGHMKLRTVLQEC 381
K +++FS LIASVPG H S S +WG + L+
Sbjct: 240 -----TRKSTCKPIIEQLLRYDFSEIRASLIASVPGRHKFSENSPTRWGWAAMEEALKAV 294
Query: 382 TFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ KS + Q SS+ +L + W+ + ++S G P + +V+PT +
Sbjct: 295 PVSQA--KSEIAIQISSIATLGPTDSWLKDTFFRALSRGRRGTGPPSAPPDFKVVFPTPD 352
Query: 439 DVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK--------------ASHTGRS 480
++R SL+GYA+G +I SPQ+ +L+ W GR
Sbjct: 353 EIRKSLDGYASGGSIHTKIQSPQQVKQLQYLRPMLCHWANDSPHGVELEAGAAVQEAGRK 412
Query: 481 RAMPHIKTFARYNGQ-----------------KLAKAAWG-ALQKNNSQLMIRSYELGVL 522
RA PH+KT+ RY G L+K AWG A ++ I SYE+GVL
Sbjct: 413 RAAPHVKTYIRYRGDGPPHGPITIDWALLTSANLSKQAWGEAANAKTGEIRISSYEIGVL 472
Query: 523 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
+ P + + G + + + + G + V L V
Sbjct: 473 VWP--ELYAPGATMQATFLTDTLAEGERRDAAAAAATAVPLR-----------------V 513
Query: 583 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
PY LP Q Y +VPW Y+++D GQVW RH
Sbjct: 514 PYNLPLQPYGKGEVPWVATASYSERDWMGQVW-RH 547
>gi|332029124|gb|EGI69135.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 667
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 130/439 (29%), Positives = 205/439 (46%), Gaps = 75/439 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +++++G+ + R K N I
Sbjct: 273 ILDRSLGEIVNSLHLNFMVDVGWLCLQYLLAGQCTDMMILYGD------RVDREKLNNNI 326
Query: 243 -LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKD 298
+ + +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ P L +
Sbjct: 327 TMIEVDMPTKFGCHHTKIMILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPE 386
Query: 299 QNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
N S+ GF+ DL Y + + P + + A ++ +FS V L+A
Sbjct: 387 SANPSDGESPTGFKKDLERYFNKYRHPALTQWICA----------IRRADFSDVNVFLVA 436
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + WG+ KL VL T + P+V Q SS+GSL + + LS +
Sbjct: 437 SVPGTHKDNEADSWGYKKLAHVLSRYATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDI 496
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S + T P ++P++E+ + S + +P S + + + +++ Y +
Sbjct: 497 IPCMSRETTKGLKSHPHFQFIYPSIENYKQSFDCRNLSCCLPYSTKVHSKQQWIESYLYQ 556
Query: 472 WKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYEL 519
WKA TGR RAMPHIK++ R + L+KAAWG +Q+NN +M SYE
Sbjct: 557 WKAKRTGRDRAMPHIKSYTRISPDLKRISWFVLTSANLSKAAWG-VQRNNHYIM--SYEA 613
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GV+ +P KL+T T + V
Sbjct: 614 GVIFIP---------------------------------KLITGTTTFPIEEEEDPAVPV 640
Query: 580 LPVPYELPPQRYSSEDVPW 598
P+PY+LP RY S D P+
Sbjct: 641 FPIPYDLPLCRYESSDSPF 659
>gi|50550131|ref|XP_502538.1| YALI0D07590p [Yarrowia lipolytica]
gi|49648406|emb|CAG80726.1| YALI0D07590p [Yarrowia lipolytica CLIB122]
Length = 471
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 140/510 (27%), Positives = 227/510 (44%), Gaps = 101/510 (19%)
Query: 150 LCNFHVSRDKLPST-----FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDI 203
+ N V R K+ S +L + LP NT V ++D+I + A+ N+M+D+
Sbjct: 1 MDNDRVKRRKVESESDNGRTQLTAITALPDEENTGSVHLKDLIGSPHLEAMWQFNFMIDL 60
Query: 204 DWLLPAC--PVLAKIPHVLVI---HGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHH 256
++L ++ I V+ GE ++ P N + + L F THH
Sbjct: 61 AFVLDNIHKNAMSNIKCRFVMGDFSGEKIAAFRAQAKSLPIADNIEVGRAKLSNLFATHH 120
Query: 257 SKAMLLIY-----PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---CGF 308
+K M+L + R ++++HTAN+IH DW+N +QG+W +K++ + E F
Sbjct: 121 TKMMVLFFKEDKGERSAQVVIHTANMIHHDWDNMTQGVWKSQ-KVKEKRKTNTEGSTSTF 179
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
E DL+ YLS + S + F ++F++SS R++ SVPG H KK
Sbjct: 180 ETDLVAYLSEYQLDTTSKLI----------KFLQRFDWSSETARVVGSVPGTHKD---KK 226
Query: 369 WGHMKLRTVLQECTFE-----KGFKKSPLVYQFSSLGSL--DEKWMA-ELSSSMSSGFSE 420
WG ++ +L E + +G + +V Q SS+GSL +KW+ +L ++
Sbjct: 227 WGLTRVADLLDEHKEDHKSDYEGSESDTIVLQSSSIGSLGVTDKWITPQLVGALDGRSPR 286
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH 476
D+ G+ IVWPTVE+VR S +GY G +I S ++K+ WKA +
Sbjct: 287 DRDGHGLPASQIVWPTVENVRRSFDGYDLGMSIHFKNESDTHRKQYAYMKERMNVWKADN 346
Query: 477 TGRSRAMPHIKTFARY-----------NGQKLAKAAWGALQ-KNNSQLMIRSYELGVLIL 524
R+RAMPHIKT+ R+ ++K AWG++ S+ I S+ELGVL+
Sbjct: 347 KHRTRAMPHIKTYTRFTRAGKLRWVLLTSANISKYAWGSVSAAKESKFSIPSWELGVLLF 406
Query: 525 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 584
P A F ++ +PY
Sbjct: 407 PQAVGKAV-FDLKDSV-----------------------------------------IPY 424
Query: 585 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ P YS++D PW+ + + +KD G W
Sbjct: 425 DWPLTNYSAKDEPWTKNADHLEKDTNGFPW 454
>gi|301791029|ref|XP_002930517.1| PREDICTED: LOW QUALITY PROTEIN: tyrosyl-DNA phosphodiesterase
1-like [Ailuropoda melanoleuca]
Length = 473
Score = 165 bits (417), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 132/382 (34%), Positives = 187/382 (48%), Gaps = 67/382 (17%)
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLI 313
K MLL+Y G+ +++HT++LIH D + K+QG W+ +P + + S E F+ DLI
Sbjct: 131 KMMLLLYEEGLWVVIHTSDLIHADCHQKTQGEWLTPLYPRIIHGXHRSGESATHFKADLI 190
Query: 314 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 373
YL P + K + S V LI S PG GS GH +
Sbjct: 191 SYLMAYNAPSLKEWI----------DTVHKHDISETNVYLIGSTPGCFQGSRKDHXGHFR 240
Query: 374 LRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGI 427
LR +L+E + KG + P+V QFSS+GSL D KW+ +E S+++ E +TP
Sbjct: 241 LRKLLKEHASSIPKG-ESWPIVGQFSSIGSLGADDLKWLCSEFKESLATLGKESRTPGKS 299
Query: 428 GEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPH 485
PL +++P+VE+V+ SLE Y AG+++PS + +K + L Y+ K A +G + AMPH
Sbjct: 300 AVPLHLIYPSVENVQTSLEEYPAGDSLPSSIQIAEKQNCLHSYFHKXVADTSGCNNAMPH 359
Query: 486 IKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 533
IK + R L+K GAL+KN QLMI SYE GVL L SA
Sbjct: 360 IKRYMRPSPDFSQIVWLLVTSTNLSKTTXGALEKNGXQLMIHSYEXGVLFLLSA------ 413
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 593
F S V K KL +G+ PVPY+LPP+ Y S
Sbjct: 414 FGLDSFKV---------------KQKL----------SGSKEPAATFPVPYDLPPELYGS 448
Query: 594 EDVPWSWDKRYTK-KDVYGQVW 614
+D P + YTK D +G +W
Sbjct: 449 KDRPXIXNIPYTKVPDTHGNMW 470
>gi|397613425|gb|EJK62211.1| hypothetical protein THAOC_17185, partial [Thalassiosira oceanica]
Length = 576
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/517 (26%), Positives = 226/517 (43%), Gaps = 124/517 (23%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG-TLEHMKR--------NKPANWILHK 245
+++++++D+++L P + K V+V +G +G +++ M++ K +I
Sbjct: 56 VITSFLLDVEYLFEELPEIIKYQKVIVYYGSVEGNSMQAMRQWEQVLGNSGKTVEFIRLV 115
Query: 246 P---------PLP--ISFGTHHSKAMLLIYPRG----VRIIVHTANLIHVDWNNKSQGLW 290
P PLP + +G HHSK L Y RI +H+ANL D K+QG++
Sbjct: 116 PSDPPYSATNPLPFKLPYGVHHSKFFLSGYEEEGKHMCRIGIHSANLRRSDIERKTQGIY 175
Query: 291 MQDF--------------PLK-----DQNNLSEECGFENDLIDYLSTLKWPE-----FSA 326
+QDF P K + ++L + FE+DLI Y+ + ++ FS
Sbjct: 176 VQDFPAKAPKKQAAAAVNPYKRAKVDEDDDLRQ---FEDDLITYMESYRYYVRGQIWFSP 232
Query: 327 NLPAHGNFKINP----SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQEC- 381
+ G + ++++FS A L+ SVPGYH + K+G+ K+ ++
Sbjct: 233 STTQSGGLTDRSHSILTLLRRYDFSCAYAVLVPSVPGYHQARDMPKFGYYKIHKAVKNAR 292
Query: 382 TFEKGFKKS---------PLVYQFSSLGSLDEKWMAELSSSMSSGFSED----------K 422
+ G +S P+++Q SSLG++ +W+ +L +++ S +
Sbjct: 293 SGRAGSNQSSSGETETPKPIIFQVSSLGTIQNRWLIKLLAAIDSNCHRNDPSTYLPAGKS 352
Query: 423 TPLGIGEPL-----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
P G PL +VWPTVE+VR +EGYA G AIP + +DKDFL + +W T
Sbjct: 353 IPQGKTPPLETRMKLVWPTVEEVRTCVEGYAGGGAIPGTTEKLDKDFLLPLYHRWSNPDT 412
Query: 478 G------RSRAMPHIKTFAR-----------YNGQKLAKAAWGALQ----KNNSQLMIRS 516
+R PHIKTF + L+K + G Q N +LMI+
Sbjct: 413 NILGPLRTARYAPHIKTFVQPGDGDEIHWVVLTSHNLSKPSLGEFQTDTKTNERRLMIQH 472
Query: 517 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
+ELGV P + ++P E E Q G DA
Sbjct: 473 WELGVFFSPETLTKMTSDNSPLRMIPFE------EAGQC-----------GIKDA----- 510
Query: 577 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+P+PY L P RY + W+ D+ + D +G+V
Sbjct: 511 -ALVPLPYSLHPSRYDENEEAWATDRPASTPDAFGRV 546
>gi|313236496|emb|CBY11811.1| unnamed protein product [Oikopleura dioica]
Length = 495
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 130/441 (29%), Positives = 208/441 (47%), Gaps = 90/441 (20%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYM+D++++L P +KI L + G D + + P N P+P FGTHH+
Sbjct: 118 NYMIDLEFVLKHHPNSSKI---LFVSG--DTLFQPGRDGIPDNIFQSVVPVP-QFGTHHT 171
Query: 258 KAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFP--LKDQNNLSEECGFENDLID 314
K +L + G+R+ +++ANL+ DW ++Q +W+ LK+++ S E FE DL++
Sbjct: 172 KMSILKFRNIGLRVAIYSANLLDYDWRERTQVIWLSPLLPLLKEKSKTSSE--FETDLVE 229
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 374
Y+ + ++ L + F+K++FSS R I S PG +GH+KL
Sbjct: 230 YIDSYSLAPLNSLLQS----------FEKYDFSSIKARFIGSSPGRRRDKEKWIFGHLKL 279
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSLDEK-------WMAEL--SSSMSSGFSEDKTPL 425
R VL++ + K LV Q SS+GSL + ++A L S +S +++D
Sbjct: 280 RKVLKKIS--NCAKNDKLVAQCSSIGSLRSRDSWLYNEFLASLMTCSDAASYYTKDNDAF 337
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASH-TGRSRAM 483
+ V+PTVE +RCS GY++G + P S + + + ++ Y +KW+ TGRSR M
Sbjct: 338 SL-----VYPTVEQIRCSKFGYSSGGSFPYSAKTHESQKWIIYYMSKWEPDEKTGRSRVM 392
Query: 484 PHIKTFARYNGQK----------LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 533
PH K + R + K L+KAAWG +K ++QL IRS+E VL++P
Sbjct: 393 PHSKIYQRVSDGKVKWFLSGSHNLSKAAWGQYEKGDTQLHIRSFEASVLLIPE------D 446
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 593
+ S P+ + E Q RYS
Sbjct: 447 YGLESFNFPAFPNFHNFEKIQ-----------------------------------RYSD 471
Query: 594 EDVPWSWDKRYTKKDVYGQVW 614
D PW +D +Y + D + Q W
Sbjct: 472 NDFPWLYDNKYLQPDDFNQTW 492
>gi|380026209|ref|XP_003696847.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
florea]
Length = 695
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 141/451 (31%), Positives = 209/451 (46%), Gaps = 99/451 (21%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--AN 240
I D+ G+I+ ++ N+MVDI WL + + ++ ++ GE T P +N
Sbjct: 301 ILDISLGEIVNSLHINFMVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSN 353
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 354 VTTFYVDMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLS 413
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
+ N SE GF+ DL YL+ + P + A ++ +FSS V +
Sbjct: 414 ESANSSEGESPTGFKKDLERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFL 463
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP------LVYQFSSLGSLD---EK 405
ASVPG HT WGH KL ++L K K P LV Q SS+GSL E
Sbjct: 464 ASVPGRHTDMEYDSWGHRKLGSILS-----KHAKLPPDAPQWILVAQSSSIGSLGPNYES 518
Query: 406 WMA-ELSSSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNV 460
W+ E++SSMS + P+G+ P ++P++ + + S + +P S Q +
Sbjct: 519 WLQKEITSSMSK-----ENPVGLKSHPNFHFIYPSLNNYKQSFDCRVGSCCLPYSLQTHS 573
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKN 508
+ +++ Y +WKA TGR RAMPHIKT+ R + L+KAAWG + KN
Sbjct: 574 KQKWIESYMYQWKAKQTGRDRAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKN 633
Query: 509 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHG 567
+ +M +YE GV+ +PS F S+ P E + G
Sbjct: 634 SHYIM--NYEGGVVFIPS-------FITGSSTFPIKEEEPG------------------- 665
Query: 568 SSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
V PVPY+LP RY D P+
Sbjct: 666 ---------VPIFPVPYDLPLTRYEKNDSPF 687
>gi|310798351|gb|EFQ33244.1| tyrosyl-DNA phosphodiesterase [Glomerella graminicola M1.001]
Length = 517
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 147/515 (28%), Positives = 238/515 (46%), Gaps = 101/515 (19%)
Query: 159 KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK- 215
++ S F+L R++ LP AN V+++D++ GD ++A N++ DI +L+ A+
Sbjct: 42 RIRSPFQLTRIRDLPEAANRDTVALKDIL-GDPLIAECWEFNFLHDIHFLMSHFDADARD 100
Query: 216 IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG D ++ A N LH +P FGTHHSK M+LI + +
Sbjct: 101 LVKVHVVHGFWKREDPNRLALQEEADAYPNVELHSAFMPEMFGTHHSKMMILIRHDDSAQ 160
Query: 270 IIVHTANLIHVDWNNKSQGLW------------MQDFPLKDQNNLSEECGFENDLIDYLS 317
+++HTAN+I DW N + +W ++D P D + E F++DL+ YL
Sbjct: 161 VVIHTANMIAKDWTNMTNAVWRSPMLPLLPNNYVEDAPTNDHPFGTGE-RFKHDLLGYLR 219
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLR 375
++A P K ++FSS +LIASVPG H +S WG L+
Sbjct: 220 A-----YNARRP---TLKSLVDQICHYDFSSVRAKLIASVPGRHPIHDTSQTAWGWPALK 271
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-----LSSSMSSGFSEDKTPLGIG 428
L+ ++G KS +V Q SS+ +L + W + L+ S ++ S + +
Sbjct: 272 RALRSVPVQEG--KSEVVVQVSSIATLGSSDSWTQKCLFDSLAVSKNNSSSNPRPKFKV- 328
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNVDKDFLKKYWAKWK----------- 473
V+PT +++R SL+GYA+G +I + Q+ +L+ + W
Sbjct: 329 ----VFPTADEIRRSLDGYASGGSIHTKIQSQQQAKQLQYLRSMFCHWANDAPDGEPLPE 384
Query: 474 ---ASHTGRSRAMPHIKTFARYNGQK-----------LAKAAWGALQKNNSQLMIRSYEL 519
GR RA PHIKT+ RY G+K ++K AWG + + ++ I S+E+
Sbjct: 385 TATIREAGRQRAAPHIKTYIRY-GEKSIDWALVTSANISKQAWGEAARPSQEVRIASWEI 443
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GVL+ PS I G+ E+ QK DAG VV
Sbjct: 444 GVLVWPSI------------IAEKATMIGAFESDMPQK------------DAGDGDPVVG 479
Query: 580 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ +PY +P Q Y +++PW +T+ D G+ W
Sbjct: 480 IRIPYSIPLQSYGKDEIPWVASMVHTEPDSMGRFW 514
>gi|408398119|gb|EKJ77253.1| hypothetical protein FPSE_02528 [Fusarium pseudograminearum CS3096]
Length = 513
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 147/508 (28%), Positives = 228/508 (44%), Gaps = 86/508 (16%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+PS ++L +Q LP N VS++D++ +I N++ DI +L+ A P +
Sbjct: 38 IPSPWQLTWIQDLPESENKDAVSLQDLLGDPLISECWEFNFLHDIPFLMNAFDPDTRHLV 97
Query: 218 HVLVIHGESDGTLEHMKRNKPA---------NWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+V ++HG +H +N+ A N +H P+P FGTHHSK M+L +
Sbjct: 98 NVHLVHG----FWKHEDKNRIALENAAAKFENVNIHIAPMPEMFGTHHSKMMVLFRHDDT 153
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL--------IDYLSTL 319
++I+HTAN+I DW N + G+W + N E L ID L+ L
Sbjct: 154 AQVIIHTANMIPKDWTNMTNGVWKSPLLPRMSNTQILTSSPEEFLVGSGERFKIDLLNYL 213
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTV 377
K+ + + + K+ ++++FS+ LIASVPG H + + WG L+
Sbjct: 214 KFYDKRKIVCKPLSDKL-----QQYDFSTVKAALIASVPGRHDVHDMSETSWGWAALKRC 268
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPL--IV 433
L+ + S +V Q SS+ +L K W L ++ S K G+G P +V
Sbjct: 269 LRHVPCHQ-HGDSDIVVQVSSIATLGPKDDW---LQKTLFDHLSRCKD-TGLGRPRFKVV 323
Query: 434 WPTVEDVRCSLEGYAAG----NAIPSPQKNVDKDFLKKYWAKWKAS-------------H 476
+PT +++R SL+GYA+G I SPQ+ ++L+ + W
Sbjct: 324 FPTADEIRRSLDGYASGLSIHTKIQSPQQAKQLEYLRPMFHHWANDSPGGTKLPDGPVLE 383
Query: 477 TGRSRAMPHIKTFARYN----------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPS 526
+GR RA PHIKT+ R N ++K AWG + ++ I S+E+GVLI P
Sbjct: 384 SGRKRAAPHIKTYVRSNKSSIDWGLLTSANISKQAWGEAAQLTGEMRIASWEVGVLIWPE 443
Query: 527 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 586
G T E+ E + S VV L +PY
Sbjct: 444 LLEPGSVMVGTYKTDVPEVSRSPKEDEE-------------------SLPVVGLRIPYNT 484
Query: 587 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
P QRY+SE+VPW +T+ D GQ W
Sbjct: 485 PLQRYTSEEVPWVVSMSHTEPDWAGQSW 512
>gi|346970364|gb|EGY13816.1| tyrosyl-DNA phosphodiesterase [Verticillium dahliae VdLs.17]
Length = 527
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 145/513 (28%), Positives = 226/513 (44%), Gaps = 92/513 (17%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPH 218
PS F+L ++ LP +N V+++D++ +I N++ DI +L+ + +
Sbjct: 43 PSPFQLTHIRDLPDSSNADTVTLKDLLGDPLISECWEFNFLHDIPFLMSHFDKDTRDLVK 102
Query: 219 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRMALQEEAAAWKNLELHNAPMPEMFGTHHTKMMILFRFDDTAQVII 162
Query: 273 HTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG---------------FENDLIDYL 316
HTAN+I DW N + G+W PL Q + + F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPDSGKPEAEEESEADEDFGSGRKFKSDLLSYL 222
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 374
+ + + K++F+ IASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRPLTEQLVKYDFAGIRAVFIASVPGRHAIHDTSQTAWGWPAL 274
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 429
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQAG--KSEVVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSISPRPAF--- 329
Query: 430 PLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK------------ 473
+V+PT +++R SL+GYA+G +I SPQ+ +LK + W
Sbjct: 330 -RVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKPIFCHWANDAPGGKEISKD 388
Query: 474 --ASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKNNSQLMIRSYELGV 521
GR RA PHIKT+ RY Q L+K AWG ++ I S+E GV
Sbjct: 389 TALQDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 580
L+ PS + +G+ E + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVAGTDEAIMVGTFKTDAPGEEAPSGAPSSTGNVVGL 490
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+PY LP Q Y +++PW +T+ D G+V
Sbjct: 491 RMPYSLPLQLYGKDEIPWVASNEHTEPDWAGRV 523
>gi|307109629|gb|EFN57867.1| hypothetical protein CHLNCDRAFT_143337 [Chlorella variabilis]
Length = 370
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 163/314 (51%), Gaps = 46/314 (14%)
Query: 160 LPSTFRLLRVQGLPAWANTSCV--SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIP 217
L + L+RV+ +P+WAN + S+ ++ G+I ++ N M+D+ WLL ACP L +
Sbjct: 68 LDAPMHLMRVRSIPSWANAGFLGASLSSLVCGNIRWILIQNAMLDLPWLLSACPDLHRAE 127
Query: 218 HVLVI-------------HGESDGTLEHMKRNKPANWIL--------HKPPLPISFGTHH 256
+L++ G TL+ +R L ++P + GT+H
Sbjct: 128 RILLVSHRPWLAKKAKVEEGAKPRTLQARERKLADVRALGLEDRASVYEPAIG-GHGTNH 186
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
SK L+ Y RG+R+I+ +AN + D NNK+Q L+ QDFP KD+ + + FE L Y+
Sbjct: 187 SKFFLVDYERGMRVIIMSANAVFSDCNNKTQVLFTQDFPRKDEQS-PKTSAFEGALEAYI 245
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 376
L+ P G + +FS+A L+ASVPG H G+ L KWGHM++R
Sbjct: 246 RELRMP--------CGPTLHLVQLIRSCDFSAARGHLVASVPGRHKGADLHKWGHMRMRA 297
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKT---------PLG 426
VL + F F+ +PL Q SSLG L+E+W+ E S+++G E T PLG
Sbjct: 298 VLCQEAFPARFRGAPLAAQMSSLGLLNERWLVREFRYSLAAGLCEGGTDVLGLPANGPLG 357
Query: 427 IGEPLIVWPTVEDV 440
+ +V+PTVE+V
Sbjct: 358 LQ---LVYPTVEEV 368
>gi|48094884|ref|XP_392205.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
mellifera]
Length = 692
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 136/446 (30%), Positives = 208/446 (46%), Gaps = 89/446 (19%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--AN 240
I D+ G+I+ ++ N+MVDI WL + + ++ ++ GE T P +N
Sbjct: 298 ILDISLGEIVNSLHINFMVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSN 350
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 351 VTTFYVDMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLS 410
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
+ N SE GF+ DL YL+ + P + A ++ +FSS V +
Sbjct: 411 ESANSSEGESPTGFKKDLERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFL 460
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-E 409
ASVPG HT WGH KL ++L + + LV Q SS+GSL E W+ E
Sbjct: 461 ASVPGRHTDMEYDSWGHRKLGSILSKHAKLPPDAPQWTLVAQSSSIGSLGPNYESWLQKE 520
Query: 410 LSSSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
++SSMS + P+G+ P ++P++ + + S + +P S Q + + ++
Sbjct: 521 ITSSMSK-----ENPVGLKSHPNFHFIYPSLNNYKRSFDCRVGSCCLPYSLQTHSKQKWI 575
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLM 513
+ Y +WKA TGR +AMPHIKT+ R + L+KAAWG + KN+ +M
Sbjct: 576 ESYMYQWKAKQTGRDKAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKNSHYIM 635
Query: 514 IRSYELGVLILPSAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHGSSDAG 572
+YE GV+ +PS F S+ P E + G
Sbjct: 636 --NYEGGVVFIPS-------FITGSSTFPIKEEEPG------------------------ 662
Query: 573 ASSEVVYLPVPYELPPQRYSSEDVPW 598
V P+PY+LP RY D P+
Sbjct: 663 ----VPVFPIPYDLPLTRYEKNDSPF 684
>gi|414886955|tpg|DAA62969.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 140
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 84/145 (57%), Positives = 96/145 (66%), Gaps = 16/145 (11%)
Query: 483 MPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 532
MPHIKTF RY+GQ +A KAAWGALQKNN+QLMIRSYELGVL LP +
Sbjct: 1 MPHIKTFTRYSGQNIAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVP 60
Query: 533 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 592
FSCT I+ G I KTKLVTL W G + +V LPVPY+LPPQ Y
Sbjct: 61 QFSCTEK--SRSIRDGVALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYG 114
Query: 593 SEDVPWSWDKRYTKKDVYGQVWPRH 617
++DVPWSWD+RYTKKDVYG VWPR+
Sbjct: 115 TQDVPWSWDRRYTKKDVYGSVWPRY 139
>gi|336471045|gb|EGO59206.1| hypothetical protein NEUTE1DRAFT_145272 [Neurospora tetrasperma
FGSC 2508]
gi|350292122|gb|EGZ73317.1| phospholipase D/nuclease, partial [Neurospora tetrasperma FGSC
2509]
Length = 619
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 172/590 (29%), Positives = 256/590 (43%), Gaps = 117/590 (19%)
Query: 130 KKMRQQDEQDNENGKNSEEAL----CNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRD 185
KK R E+ E +EE C++ R + S F L ++ L +N VS++
Sbjct: 44 KKRRTSPEEGEEESFPAEEQAKKQPCSY---RRVVASPFHLTTIRSLGQNSNKDTVSLKG 100
Query: 186 VIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPHVLVIHG----ESDGTLE-HMKRNKP 238
++ +I NY+ DID+L+ A + + V VIHG E+ L+ +
Sbjct: 101 LLGDPLIKECWEFNYLHDIDFLMSAFDSDVRHLIKVHVIHGFWKKENTNRLQIQSDAARY 160
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ-DFPL 296
N H LP FGTHHSK M+L+ II+HTANLI DW+N +Q W+ PL
Sbjct: 161 PNITTHHAYLPEPFGTHHSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPL 220
Query: 297 ----KDQNNLSEECG--------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
QNN S F+ D ++YL + + A N I+ K+
Sbjct: 221 LKPDAQQNNSSPRSSLPAGSGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKY 269
Query: 345 NFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKG------------FKKS 390
+FSS LIASVPG H+ +WG ++ L+ + +K
Sbjct: 270 DFSSIRGSLIASVPGRHSLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKP 329
Query: 391 PLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLE 445
+V Q SS+ +L + W+ SG KT L I++PT +++R SL+
Sbjct: 330 EVVIQISSIATLGPTDNWLKNTLFEALSGSQGPKTLLSSKSKPDFKIIFPTPDEIRKSLD 389
Query: 446 GYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIK 487
GYA+G +I S Q+ +L+ + W GR+RA PHIK
Sbjct: 390 GYASGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSADGVGTTTTTPIREAGRNRAAPHIK 449
Query: 488 TFARY--------------NGQKLAKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKR 529
TF R+ L+K AWG Q KNN+ Q+ I SYE+GVL+ P
Sbjct: 450 TFIRFANHNTKNSIDWALLTSANLSKQAWGDAQSKNNAGEPQVRICSYEIGVLVWPELFA 509
Query: 530 HGCGFSCTSN------IVPSEI-KSGSTETSQIQKTKLVTLTWHGSSDAG---------- 572
G S S +VP+ + + ++ S+ +T L+ +S +G
Sbjct: 510 DSDGTSSGSKTGQKAVMVPTFLTDTPASHGSEKDRTSLLGEKQGSASTSGNGEEDGKGDD 569
Query: 573 -----ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
+S+ VV L +PY LP QRY ++VPW + + D GQVW RH
Sbjct: 570 EKEEKSSTVVVGLRMPYNLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 618
>gi|402082685|gb|EJT77703.1| hypothetical protein GGTG_02808 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 583
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 144/514 (28%), Positives = 235/514 (45%), Gaps = 93/514 (18%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVL 220
S FRL ++ L N V ++DVI +I I + NY+ DI+++L A + + H++
Sbjct: 101 SPFRLTHIKDLAPQDNVDAVRLKDVIGDPLISEIWNFNYLHDINFVLGA--LDEDVRHMI 158
Query: 221 ---VIHG---ESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRII 271
VIHG + D ++R+ + N LH +P FGTHHSK ++L+ + +++
Sbjct: 159 KVNVIHGFWKKDDRRRIDLQRDAAQNKNLTLHTAFMPEMFGTHHSKMLILLRHDDTAQVV 218
Query: 272 VHTANLIHVDWNNKSQGLWMQ-DFPLK----------DQNNLSEECG--FENDLIDYLST 318
+HTAN+I DW N +Q +W+ PL+ D +L E G F+ DL+ YL
Sbjct: 219 IHTANMIPKDWTNMTQSIWLSPRLPLQKPTAPAPAHVDYESLPEGSGEKFKLDLLSYL-- 276
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRT 376
A + ++++FSS L+ASVPG H S WG +R
Sbjct: 277 ------RAYDKRRAICRPLVQELQRYDFSSVRATLVASVPGRHQIHDRSAATWGWAAIRR 330
Query: 377 VLQECTFEKGFKKSP-LVYQFSSLGSL--DEKWM-AELSSSMSSGFSEDKTPLGIGEPL- 431
L+ + ++P +V Q SS+ +L + W+ L SMS G + +P
Sbjct: 331 ALESVPLQTAAGRTPEVVVQVSSIATLGPTDSWLRGALFDSMSRG---KAAAVAAPKPRF 387
Query: 432 -IVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWK------------- 473
+++PT +++R SL+GYAAG + I S Q+ +LK + W
Sbjct: 388 KVIFPTPDEIRASLDGYAAGASIHTKIQSAQQVKQLMYLKPLFCHWANDSALGNEKDENA 447
Query: 474 -ASHTGRSRAMPHIKTFARY-NGQK-----------LAKAAWGALQKNNSQLMIRSYELG 520
GR+RA PH+KT+ RY +G++ L+K AWG ++ I S+E+G
Sbjct: 448 PIRDAGRNRAAPHVKTYIRYGDGERSLDWALMTSANLSKQAWGEAVNAMGEVRIASWEIG 507
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
VL+ PS F+ + + P S + + + V+ L
Sbjct: 508 VLVWPSL------FAEKARMAPV-FGSDRLSVEEADEAR------------QGGGPVMGL 548
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+PY LP Q Y +++PW +Y + D G+ W
Sbjct: 549 RIPYNLPVQAYGRDEIPWVATAKYDELDCKGRKW 582
>gi|380479741|emb|CCF42843.1| tyrosyl-DNA phosphodiesterase [Colletotrichum higginsianum]
Length = 520
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 143/515 (27%), Positives = 235/515 (45%), Gaps = 99/515 (19%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK 215
D++ S F+L R++ LP AN V+++D++ GD ++A N++ DI +L+ +
Sbjct: 44 DRIASPFQLTRIRDLPEAANKDTVTLKDIL-GDPLIAECWEFNFLHDIHFLMSHFDEDTR 102
Query: 216 -IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGV 268
+ V V+HG + D ++++ A N LH +P FGTHHSK M+LI +
Sbjct: 103 NLVKVHVVHGFWKKEDPNRLALQKDAEAYPNVELHGAFMPEMFGTHHSKMMVLIRHDDSA 162
Query: 269 RIIVHTANLIHVDWNNKSQGLW-------MQDFPLKDQNNLSEECG----FENDLIDYLS 317
++I+HTAN+I DW N + +W + D +D + G F++DL+ YL
Sbjct: 163 QVIIHTANMIVRDWTNMTNAVWRSPLLPLLSDEHAEDTSATDHPFGTGKRFKHDLLSYLR 222
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLR 375
++A P ++FSS IASVPG H +S WG L+
Sbjct: 223 A-----YNARRPITRTLVAQ---LCNYDFSSVRATFIASVPGRHPILDTSQTAWGWPALK 274
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-----LSSSMSSGFSEDKTPLGIG 428
L ++G +S +V Q SS+ +L + W+ + L+ S + S K +
Sbjct: 275 RALGSVPVQEG--ESEIVIQVSSIATLGPTDSWIQKCLFDSLAVSKNKSSSRPKPKFKV- 331
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNVDKDFLKKYWAKWK----------- 473
V+PT +++R SL+GYA+G +I + Q+ +L+ + W
Sbjct: 332 ----VFPTADEIRQSLDGYASGGSIHTKIQSQQQMKQLQYLRPIFCHWANDAPEGKILSE 387
Query: 474 ---ASHTGRSRAMPHIKTFARYNGQK-----------LAKAAWGALQKNNSQLMIRSYEL 519
GR RA PHIKT+ RY G+K ++K AWG + ++ + S+E+
Sbjct: 388 TAAIQKAGRERAAPHIKTYIRY-GEKSIDWALVTSANISKQAWGEAMGASQEVRVASWEV 446
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GVL+ PS I + G+ ET + + G+ VV
Sbjct: 447 GVLVWPSI------------ITDNATMVGTFETDMPPR------------EGGSGDTVVG 482
Query: 580 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
L +PY LP Q Y +++PW +T+ D G+ W
Sbjct: 483 LRIPYNLPLQSYGKDEIPWVASMAHTEPDRMGRFW 517
>gi|194855370|ref|XP_001968528.1| GG24923 [Drosophila erecta]
gi|190660395|gb|EDV57587.1| GG24923 [Drosophila erecta]
Length = 580
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/374 (32%), Positives = 185/374 (49%), Gaps = 45/374 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLP---ACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQ 232
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ + +P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAI-RVRMPTPFATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 292 PEDADTGAGESLTGFKQDLMLYLVEYKITQLQPWI----------ARIRKSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H SS++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHRESSVRGHPWGHARLGSLLSKHAAPID-DRIPVVCQSSSIGSLGVSVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNS---Q 511
Y +WK+S RSRAMPHIK++ R+N L+KAAWG KN++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 512 LMIRSYELGVLILP 525
L I +YE+GVL LP
Sbjct: 521 LRIANYEVGVLFLP 534
>gi|307211789|gb|EFN87770.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 645
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 184/365 (50%), Gaps = 40/365 (10%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +++++G+ + + N
Sbjct: 250 ILDRSLGEIVNSLHLNFMVDVGWLCLQYLLAGQRTDMMILYGDRVD-----QESLGCNIT 304
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL---- 296
+ +P +FG HH+K M+L Y G+RI+V TANL DW N++QGLW+ PL
Sbjct: 305 MIHVDMPSAFGCHHTKIMILQYKDDGIRIVVSTANLYSDDWENRTQGLWISPHLPLLPES 364
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
+ N+ F+ D YLS + P + + +K +FS+ V +AS
Sbjct: 365 ANSNDGESPTNFKKDFERYLSKYRHPALTQWI----------WIVRKADFSAVNVYFVAS 414
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG H + WGH KL +L Q T + ++ Q SS+GSL + + LS +
Sbjct: 415 VPGTHKNVDVDFWGHRKLAQILSQHATLPPDAPQWSIIAQSSSIGSLGPNYESWLSREIV 474
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S S + T P V+P++E+ + S + + +P S + + + +++ Y +W
Sbjct: 475 SSMSRETTQGLKSHPKFQFVYPSIENYKRSFDFQTLSSCLPYSLKVHSKQQWIESYLYQW 534
Query: 473 KASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELG 520
KA+ TGR+RA+PHIK++ R + L+KAAWGA Q++N +M +YE G
Sbjct: 535 KATRTGRNRAIPHIKSYTRISPDLKSIPWFVLTSANLSKAAWGA-QRSNYYIM--NYEAG 591
Query: 521 VLILP 525
V+ LP
Sbjct: 592 VVFLP 596
>gi|350415522|ref|XP_003490669.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Bombus
impatiens]
Length = 697
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 206/439 (46%), Gaps = 75/439 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ N+MVD+ WL + + + ++ G + K + I
Sbjct: 304 ILDISLGEIVKSLHINFMVDVGWLCLQYLLAGQRTDMSILFGT------RVDEEKLSLNI 357
Query: 243 LHKPP-LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
P +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 358 TMIPVWMPTKFGCHHTKVMILKYKDDGIRVVVSTANLYSDDWENRTQGVWISPHLPLLAE 417
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YL + P + + A K+ NFSS V +A
Sbjct: 418 SANPSDGESPTGFKRDLERYLHKYEQPALTEWISA----------VKRANFSSVNVFFVA 467
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG HTG WG+ KL VL + + LV Q SS+GSL + + + +
Sbjct: 468 SVPGRHTGVEYDYWGYRKLGHVLSKHAKLPPDAPQWTLVVQSSSIGSLGPNYESWIQKEI 527
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S S++ P P ++P++ + + S + +P S Q + +++++ Y +
Sbjct: 528 ISSMSKENPPGLKSCPNFRFIYPSLNNYKQSFDCQVGSCCLPYSIQTHSKQEWVESYMYQ 587
Query: 472 WKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYEL 519
WKA+ T R +A+PHIKT+ R + L+KAAWG ++K++ ++ +YE
Sbjct: 588 WKATRTARDKAIPHIKTYTRISPNLEKIPWFVLTSANLSKAAWGMVRKDSHHIL--NYEA 645
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GV+ +P +GST T I+K +AG V
Sbjct: 646 GVIFIP------------------HFVTGST-TFPIKK-----------EEAG----VPV 671
Query: 580 LPVPYELPPQRYSSEDVPW 598
P+PY+LP RY S D P+
Sbjct: 672 FPIPYDLPLTRYGSGDKPF 690
>gi|345487640|ref|XP_001604652.2| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 690
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 124/441 (28%), Positives = 200/441 (45%), Gaps = 73/441 (16%)
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
D+ G+I+ ++ N+MV+I WL + A+ P + + G ++ P+N L
Sbjct: 295 DISLGEIVDSLHINFMVEIGWLCLQYLLAAQNPKMTIFCG----SVCDPNVALPSNITLV 350
Query: 245 KPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNN 301
+ +P +FG HHSK + Y G +RI+V TAN+ DW N++QGLWM PL + N
Sbjct: 351 EVNMPAAFGCHHSKISVFKYSDGGIRIVVSTANIYSDDWENRTQGLWMSPHLPPLPNSAN 410
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSA--NLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
S+ F+ +YL+ + P+ NL K+ + S+ V +AS
Sbjct: 411 PSDGESPTNFKKSFREYLNAYRNPKLVEWENL------------VKRADCSAVNVFFVAS 458
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
+PG H G SL WGH +L +L E + ++ Q SS+G+L + + + S++
Sbjct: 459 IPGSHKGLSLNSWGHRRLAAILNEHAVLPPDAPQWTIIAQSSSIGNLGPTFDSWIQSNIV 518
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKW 472
S +K P V+P++ + S + A +P +K+ +K ++LK Y +W
Sbjct: 519 FSLSREKAKGIKSNPNFHFVYPSLRNYEGSFDCKAGSCCLPYSRKSHEKQEWLKNYLYQW 578
Query: 473 KASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELG 520
KA TGR++AMPH+K++ R + L+K AWG K I +YE G
Sbjct: 579 KADETGRTKAMPHVKSYTRISPDLTQIPWFVLTSANLSKGAWGTTAKTGVSHYIMNYEAG 638
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
V+ +P F P IK+ S S ++
Sbjct: 639 VVFIPK-------FVINQQTFP--IKTSS------------------------SPDIPVF 665
Query: 581 PVPYELPPQRYSSEDVPWSWD 601
+PY+LP RY DVP+ D
Sbjct: 666 RLPYDLPLTRYRQNDVPFVID 686
>gi|195470993|ref|XP_002087790.1| GE18215 [Drosophila yakuba]
gi|194173891|gb|EDW87502.1| GE18215 [Drosophila yakuba]
Length = 582
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/374 (32%), Positives = 186/374 (49%), Gaps = 45/374 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQ 232
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ + +P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAI-RVRMPTPFATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 292 PEDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H SS++ WGH +L ++L + + P++ Q SS+GSL A +
Sbjct: 342 LGSVPGGHRESSVRGHPWGHARLGSLLSKHATPID-DRIPVICQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPAGKLRQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNS---Q 511
Y +WK+S RSRAMPHIK++ R+N ++ L+KAAWG KN++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEEQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 512 LMIRSYELGVLILP 525
L I +YE+GVL LP
Sbjct: 521 LRIANYEVGVLFLP 534
>gi|194771042|ref|XP_001967588.1| GF20606 [Drosophila ananassae]
gi|190615089|gb|EDV30613.1| GF20606 [Drosophila ananassae]
Length = 576
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 124/388 (31%), Positives = 192/388 (49%), Gaps = 50/388 (12%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGES 226
P + V++++++ G+I + N+MVDI WLL +L K +LV++G+
Sbjct: 158 PTHSEPLSVTLQEILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDE 215
Query: 227 DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNN 284
L + + KP I K P P F T H+K MLL Y G +R+++ TANL DW+N
Sbjct: 216 SPELLSIGKFKPQVTAIGVKMPTP--FATSHTKMMLLAYNDGSMRVVISTANLYEDDWHN 273
Query: 285 KSQGLWMQ-DFPL----KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
++QG+W+ P D + GF+ DL+ YL K + + +
Sbjct: 274 RTQGVWISPKLPELHEDADTGAGESQTGFKQDLMLYLVEYKISQLQPWI----------A 323
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
+K +FS+ V + SVPG H S+++ WGH +L +L + + P+V Q S
Sbjct: 324 RIRKSDFSAINVFFLGSVPGGHRESTVRGHPWGHARLGALLAKHATPIN-DRIPVVCQSS 382
Query: 398 SLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAI 453
S+GSL A + + +D TPLG + +++P+ +V S +G G +
Sbjct: 383 SIGSLGANVQAWIQQDFVNSLKKDSTPLGKLRQMPTFKMIYPSFGNVSGSHDGMLGGGCL 442
Query: 454 PSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKA 500
P + DK +LK + +WK++ RSRAMPHIKT+ RYN L+KA
Sbjct: 443 PYGKNTNDKQPWLKDHLHQWKSNDRYRSRAMPHIKTYTRYNLEDQSVYWFVLTSANLSKA 502
Query: 501 AWGALQKNNSQ---LMIRSYELGVLILP 525
AWG KN++ L I +YE GVL LP
Sbjct: 503 AWGCFNKNSNVQPCLRIANYEAGVLFLP 530
>gi|195118058|ref|XP_002003557.1| GI21930 [Drosophila mojavensis]
gi|193914132|gb|EDW12999.1| GI21930 [Drosophila mojavensis]
Length = 596
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 212/452 (46%), Gaps = 83/452 (18%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I ++ N+M+DI WLL +L+K +LV++G D L + + KP
Sbjct: 191 IFDESLGEIESSVQINFMIDIGWLLGHYYFAGILSK--PLLVLYGADDPNLVDIGKFKPQ 248
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PL 296
+ K + F T H+K MLL Y G +R+++ TANL DW+N++QGLWM PL
Sbjct: 249 VTAI-KVQMQSPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWMSPRLPPL 307
Query: 297 -KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
+D + + E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 308 PEDADTAAGESPTGFKQDLMLYLVEYKLSQLQPWI----------ARIRKSDFSAINVFF 357
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
I SVPG H S+++ WG +L ++L + E P+V Q SS+GSL A
Sbjct: 358 IGSVPGGHRESAVRGHPWGCARLGSLLAKHAAPVEPNI---PVVCQSSSIGSLGANVQAW 414
Query: 410 LSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-F 464
+ + S F +D +P+G L +++P+ +V S +G G +P + DK +
Sbjct: 415 IEQDILSNFRKDSSPIGRLSQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGKSTNDKQPW 474
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGAL-QKNNSQ 511
LK Y +WK+ RS+AMPHIK++ R+N L+KAAWGA +K+N Q
Sbjct: 475 LKNYLHQWKSGDRHRSQAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGAFNKKSNLQ 534
Query: 512 --LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS 569
L I +YE GVL LP F + P
Sbjct: 535 PCLRIFNYEAGVLFLPK-------FVTGEDTFPL-------------------------- 561
Query: 570 DAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 601
A + V P+PY++P Y +D P+ D
Sbjct: 562 -GNARNGVPAFPLPYDVPLTPYGPDDTPFLMD 592
>gi|125984342|ref|XP_001355935.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
gi|54644254|gb|EAL32995.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
Length = 576
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 126/377 (33%), Positives = 188/377 (49%), Gaps = 51/377 (13%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP- 238
I D G+I ++ N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 171 IFDESLGEIESSVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 228
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
I K P P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL
Sbjct: 229 VTAIGVKMPTP--FATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLL 284
Query: 297 ----KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+D + + E GF DL+ YL K + + + +K +FS+
Sbjct: 285 PALSEDADTAAGESLTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAIN 334
Query: 351 VRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
V + SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A
Sbjct: 335 VFFVGSVPGGHREGSVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQA 393
Query: 409 ELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD- 463
+ + +D +P G + +++P+ +V S +G G +P + DK
Sbjct: 394 WIQQDFVNSLRKDSSPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQP 453
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQ 511
+LK + +WK+S RSRAMPHIKT+ RYN L+KAAWG+ KN +
Sbjct: 454 WLKAHLQQWKSSDRHRSRAMPHIKTYTRYNLTDQSVYWFVLTSANLSKAAWGSFNKNTNL 513
Query: 512 ---LMIRSYELGVLILP 525
L I +YE GVL LP
Sbjct: 514 QPCLRIANYEAGVLFLP 530
>gi|308462649|ref|XP_003093606.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
gi|308249623|gb|EFO93575.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
Length = 462
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 142/474 (29%), Positives = 219/474 (46%), Gaps = 121/474 (25%)
Query: 182 SIRDVIQGDI--IVAILSNYMVDIDWLLPACP--VLAKIPHVLVIHGESDGTLEHMKRNK 237
S+ D++ DI I ++ N+M+D ++L+ + P + P LV+ L
Sbjct: 67 SLEDIL-ADIRPISSLHMNFMIDFEFLVNSYPPSLRTTTPITLVVGAPDVSDLRKSTLQY 125
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P N +H LPI FGTHHSK +L G + +IV TANLI DW K+Q + +
Sbjct: 126 P-NVTVHSASLPIPFGTHHSKLSILESDDGFIHVIVSTANLISDDWEFKTQQFYYA-MGM 183
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP-SFFKKF----NFSSAAV 351
+ ++ E F+ DLI+YLS + NP S +KK +FS+
Sbjct: 184 RREDEF-ERSPFQEDLIEYLS----------------YYSNPLSTWKKLIESTDFSTVTD 226
Query: 352 RLIASVPGYHTGSS-LKKWGHMKLRTVL-QECTFEKGFK---KSPLVYQFSSLGSLDEKW 406
RLI S PGYHT + + GH +L T+L Q+ F+ ++ + + Q SS+GSL
Sbjct: 227 RLIFSTPGYHTDPQHVSRLGHPRLSTILSQKFPFDPKYEHTDRCTFIAQCSSIGSL---- 282
Query: 407 MAELSSSMSSGFS-------EDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSP 456
S+ SS F E P +P +V+P VEDVR S +GYA G ++P
Sbjct: 283 ----GSAPSSWFRGQFLKSLEAANPAPKNKPPKMYLVFPCVEDVRNSCQGYAGGGSVPYR 338
Query: 457 QKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA-----------KAAWGA 504
D+ +L+ + KW+++ R++A+PH KT+ +Y+ QK+A KAAWG
Sbjct: 339 NSVHDRQKWLQDFMCKWRSNTKRRTKAVPHCKTYVKYD-QKIAQWQLLTSANVSKAAWGE 397
Query: 505 L----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 560
+ +KN QLMIRS+E+GVLI T+ S+
Sbjct: 398 MSFSKKKNVDQLMIRSWEIGVLI--------------------------TDPSRFN---- 427
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+P++ P YS D P++ D+++ + D+ G VW
Sbjct: 428 ---------------------IPFDYPCVPYSPTDRPFTTDQKHEQPDILGCVW 460
>gi|340710910|ref|XP_003394026.1| PREDICTED: LOW QUALITY PROTEIN: probable tyrosyl-DNA
phosphodiesterase-like [Bombus terrestris]
Length = 697
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 206/439 (46%), Gaps = 75/439 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ N+MVD+ WL + + + +++G + + K + I
Sbjct: 304 ILDISLGEIVKSLHINFMVDVGWLCLQYLLAGQRTDMSIMYGS------RVDKEKLSLNI 357
Query: 243 LHKPP-LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
P +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 358 TMIPVWIPTKFGCHHTKVMILKYKDDGIRVVVSTANLYSCDWENRTQGVWISPHLPLLAE 417
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YL + + A ++ NFSS V +A
Sbjct: 418 SANPSDGESPTGFKRDLERYLHKYHQRGLTEWISA----------VRRANFSSVNVFFLA 467
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG HTG WG+ KL VL + + LV Q SS+GS + + + +
Sbjct: 468 SVPGKHTGVEYDYWGYRKLGQVLSKHAKLPPDAPQWTLVAQSSSIGSFGPNYESWIQKEI 527
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S S++ P +P ++P++ + + S + +P S + + +++L+ Y +
Sbjct: 528 VSSMSKENPPGLKSQPNFQFIYPSINNYKQSFDCQVGSCCLPYSIETHSKQEWLESYMYQ 587
Query: 472 WKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYEL 519
WKA+ T R +A+PHIKT+ R + L+KAAWG ++ ++ L I +YE
Sbjct: 588 WKATRTARDKAIPHIKTYTRISPNLEKIPWFVLTSANLSKAAWGIVRVDS--LHILNYEA 645
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
GV+ +P +GST T I+K +AG V
Sbjct: 646 GVIFIP------------------HFVTGST-TFPIKK-----------EEAG----VPV 671
Query: 580 LPVPYELPPQRYSSEDVPW 598
P+PY+LP RY SED P+
Sbjct: 672 FPIPYDLPLTRYGSEDKPF 690
>gi|158293221|ref|XP_558110.3| AGAP010577-PA [Anopheles gambiae str. PEST]
gi|157016854|gb|EAL40355.3| AGAP010577-PA [Anopheles gambiae str. PEST]
Length = 584
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 138/461 (29%), Positives = 210/461 (45%), Gaps = 83/461 (18%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHV--LVIHGESD 227
P A V+ ++++ G++ ++ N+MVDI WLL A A +V L+++G+
Sbjct: 169 PTHAEPLSVTFQELLDSSLGELECSVQMNFMVDIGWLL-AHYFFAGYENVPLLILYGDET 227
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKS 286
L + + KP N K + FG HH+K L Y G +R++V TANL DW+N++
Sbjct: 228 PELRMVSQKKP-NVTAVKVEIKTPFGVHHTKMGLYGYRDGSMRVVVSTANLYEDDWHNRT 286
Query: 287 QGLWMQD----FPLKDQNNLSE-ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF 341
QGLW+ P E F + L+ YL K P+ + +
Sbjct: 287 QGLWISPRLPAVPEGSDTTYGESRSDFRSSLLTYLDAYKLPQLQPWM----------ARI 336
Query: 342 KKFNFSSAAVRLIASVPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
+K +FS V L+ASVPG HT ++ WGH +L +L + PLV Q SS+G
Sbjct: 337 RKTDFSDVKVFLVASVPGGHTNTAKGPLWGHPRLGYLLSQHAAPID-DSCPLVAQSSSIG 395
Query: 401 SLD---EKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP 454
SL E W+ L M+S F +D P+GI +++P+ +VR S +G G +P
Sbjct: 396 SLGPSPESWV--LGEIMAS-FRKDSAPVGIRRLPGFRMIYPSFSNVRQSHDGMMGGGCLP 452
Query: 455 SPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWG 503
+ +V +++LK Y +W + R++AMPHIKT+ R++ + L+KAAWG
Sbjct: 453 YVRSTHVKQEWLKDYLQQWCSRARHRNKAMPHIKTYCRWSHRGLYWFLLTSANLSKAAWG 512
Query: 504 ALQKN---NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 560
K L I SYE GVL LP N P E
Sbjct: 513 VYNKTGRFEKPLRINSYEAGVLFLPK-------LLLDENFFPME---------------- 549
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 601
A+ + P+PY++P Y+ ED P+ D
Sbjct: 550 ------------ANKKHPQFPMPYDVPTIPYAPEDTPFFMD 578
>gi|157109623|ref|XP_001650753.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108868427|gb|EAT32652.1| AAEL015141-PA [Aedes aegypti]
Length = 624
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 205/444 (46%), Gaps = 70/444 (15%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPH-VLVIHGESDGTLEHMKRNKPANW 241
+ D G++ ++ N+MVDI WLL +L+++G+ L+ + KP N
Sbjct: 222 LLDTSLGELECSVQMNFMVDIGWLLGHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NV 280
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKD 298
K + FG HH+K L Y G +R++V TANL DW+N++QGLW+ P+ +
Sbjct: 281 TAVKVHIATPFGVHHTKMGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPE 340
Query: 299 QNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
++ + GF +LI YL++ K G+ + + +K NFS V L+A
Sbjct: 341 DSDTGAGDSKTGFRENLITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVA 390
Query: 356 SVPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + WGH ++ +L + + PLV Q SS+GSL + + S +
Sbjct: 391 SVPGGHLNTPKGPLWGHPRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEV 449
Query: 415 SSGFSEDKTPLGIGE-PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWA 470
+ F D P+G+ P +++P+ +VR S + G +P + DK +LK Y
Sbjct: 450 LASFRRDSAPIGLRRVPAFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLH 509
Query: 471 KWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKN---NSQLMIRSY 517
+WK+ R++A+PHIKT+ R++ + L+KAAWG K+ + L I SY
Sbjct: 510 QWKSDSRNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSY 569
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
E GVL LP F N P E K G
Sbjct: 570 EAGVLFLPK-------FVIEENFFPMESKPGQQHPQ------------------------ 598
Query: 578 VYLPVPYELPPQRYSSEDVPWSWD 601
P+PY++P Y+ ED P+ D
Sbjct: 599 --FPMPYDVPIIPYALEDTPFFMD 620
>gi|321478262|gb|EFX89219.1| hypothetical protein DAPPUDRAFT_310135 [Daphnia pulex]
Length = 580
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 128/407 (31%), Positives = 197/407 (48%), Gaps = 58/407 (14%)
Query: 161 PSTFRLLRVQGLP-AWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA-CPVLAK 215
P + L ++ +P W + ++ D++ G + ++ N+MV++ WLL C +
Sbjct: 151 PVCYFLSSIENVPETWDQSLTLTFSDLLHPSLGVLQESVQFNFMVELGWLLAQYCQHKVQ 210
Query: 216 IPHVLVIHG-ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVH 273
+LVI+G ES+ R + I KP P FG+HH+K ++ Y G +RI+VH
Sbjct: 211 RKPMLVIYGTESEELAAAQSRVPTLHTIRVKPKYP--FGSHHTKMSMMSYEDGNLRIVVH 268
Query: 274 TANLIHVDWNNKSQGLWMQDF--PLKDQNN-----------LSEECGFENDLIDYLSTLK 320
T NLI DW +++QGLW+ PL ++N GF+ DLI YL
Sbjct: 269 TGNLIESDWEDRTQGLWISPSCPPLSSKDNEKIGDGDSIGDGDSITGFKRDLIRYLE--- 325
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS-----LKKWGHMKLR 375
S +L A K ++ + SS V I S PG H S + KWGH+ L
Sbjct: 326 ----SYSLSA---LKPWIEKIRQADMSSIKVCFIPSSPGSHAIQSEANEKVPKWGHLHLS 378
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELSSSMSSGFSEDKTPLGIGEPL 431
+LQ+ + ++ Q SS+GSL W+A EL SM G S T LG
Sbjct: 379 WLLQQHASSEA--DDSIIMQCSSIGSLGPSPSSWLAGELGVSM--GASSGVTKLGQKNVQ 434
Query: 432 IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
+V+P +DV+ S+ G G +P S Q + + + + KW++ R+ AMPHIK++A
Sbjct: 435 VVYPCFQDVKSSIHGLLGGGCLPYSHQGHNKQTWFTGFLHKWRSDSRLRTTAMPHIKSYA 494
Query: 491 RYNGQ------------KLAKAAWGALQKNNSQLMIRSYELGVLILP 525
R + ++KAAWG +++LMI+S+E GVL LP
Sbjct: 495 RVSSDMSRASFFVLTSANVSKAAWGMRINKDTKLMIQSFEAGVLFLP 541
>gi|157129902|ref|XP_001661809.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108872048|gb|EAT36273.1| AAEL011629-PA [Aedes aegypti]
Length = 536
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 205/444 (46%), Gaps = 70/444 (15%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPH-VLVIHGESDGTLEHMKRNKPANW 241
+ D G++ ++ N+MVDI WLL +L+++G+ L+ + KP N
Sbjct: 134 LLDTSLGELECSVQMNFMVDIGWLLGHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NV 192
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKD 298
K + FG HH+K L Y G +R++V TANL DW+N++QGLW+ P+ +
Sbjct: 193 TAVKVHIATPFGVHHTKMGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPE 252
Query: 299 QNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
++ + GF +LI YL++ K G+ + + +K NFS V L+A
Sbjct: 253 DSDTGAGDSKTGFRENLITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVA 302
Query: 356 SVPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + WGH ++ +L + + PLV Q SS+GSL + + S +
Sbjct: 303 SVPGGHLNTPKGPLWGHPRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEV 361
Query: 415 SSGFSEDKTPLGIGE-PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWA 470
+ F D P+G+ P +++P+ +VR S + G +P + DK +LK Y
Sbjct: 362 LASFRRDSAPIGLRRVPAFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLH 421
Query: 471 KWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKN---NSQLMIRSY 517
+WK+ R++A+PHIKT+ R++ + L+KAAWG K+ + L I SY
Sbjct: 422 QWKSDSRNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSY 481
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
E GVL LP F N P E K G
Sbjct: 482 EAGVLFLPK-------FVIEENFFPMESKPGQQHPQ------------------------ 510
Query: 578 VYLPVPYELPPQRYSSEDVPWSWD 601
P+PY++P Y+ ED P+ D
Sbjct: 511 --FPMPYDVPIIPYALEDTPFFMD 532
>gi|156400100|ref|XP_001638838.1| predicted protein [Nematostella vectensis]
gi|156225962|gb|EDO46775.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 155 bits (391), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 103/289 (35%), Positives = 146/289 (50%), Gaps = 57/289 (19%)
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE---- 404
VRLIASVPG H G + KWGH+KLR +LQE + P++ QFSS+GSL
Sbjct: 1 VRLIASVPGRHAGLNKNKWGHLKLRKILQEHGPPSSDVTTNWPVIGQFSSIGSLGPDKNK 60
Query: 405 ----KWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
+W+ L+++ F G PL +V+PTV++VR +L +AG +IP K
Sbjct: 61 WLCGEWLQSLAATCGRTF-------GSNAPLKLVFPTVDNVRTTLWFISAGGSIPYSHKT 113
Query: 460 VDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWGALQ 506
+K +L ++ W A+ GRSRA PHIKT+ R L+KAAWG L+
Sbjct: 114 AEKQPYLPSFFCSWNATSRGRSRASPHIKTYMRTSPDHSRLAWFMVTSSNLSKAAWGVLE 173
Query: 507 KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 566
K SQLMIRSYE+GVL LP+ + T+ I + + +
Sbjct: 174 KGGSQLMIRSYEIGVLFLPADQ--------------------VTDREAIDQCRDIL---- 209
Query: 567 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
+ + ++ VP++LPP YS ++ PW WD RY K D G +W
Sbjct: 210 -GGNRLSDEPCTHVHVPFDLPPSPYSDDEKPWMWDVRYLDKPDTNGNIW 257
>gi|195435334|ref|XP_002065649.1| GK15563 [Drosophila willistoni]
gi|194161734|gb|EDW76635.1| GK15563 [Drosophila willistoni]
Length = 572
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 198/394 (50%), Gaps = 62/394 (15%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGES 226
P + V++++++ G+I + N+MVDI WLL +LAK ++V++G+
Sbjct: 154 PTHSEPLSVTLQEILDESLGEIESTVQINFMVDIGWLLGHYYFAGILAK--PLIVLYGDE 211
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
L ++ + KP + K +P F T H+K MLL Y G +R+++ TANL DW+N+
Sbjct: 212 SPELLNISKLKPQVTAI-KVQMPTPFATSHTKMMLLAYTDGSMRVVISTANLYEDDWHNR 270
Query: 286 SQGLWMQ-DFPLKDQNNLSEEC---------GFENDLIDYLSTLKWPEFSANLPAHGNFK 335
+QG+W+ P LSEE GF+ DL+ YL K + +
Sbjct: 271 TQGVWISPRLPA-----LSEEADTAAGESKTGFKQDLMLYLVEYKLTQLQPWI------- 318
Query: 336 INPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQE--CTFEKGFKKSP 391
+ +K +FS+ V LIASVPG H S++ WGH +L ++L + E + P
Sbjct: 319 ---ARIRKSDFSAINVFLIASVPGGHREGSVRGHPWGHARLGSLLAKHAAPIED---RIP 372
Query: 392 LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGY 447
+V Q SS+GSL A + + +D + +G L +++P+ +V S +G
Sbjct: 373 VVCQSSSIGSLGPNVQAWIQQDFVNSLRKDSSTVGRLRQLPPFKMIYPSFGNVSRSHDGM 432
Query: 448 AAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN------------G 494
G +P + DK +LK++ +WK+ R++AMPHIK + RYN
Sbjct: 433 LGGGCLPYGKNTNDKQPWLKEHLQQWKSGDRYRNQAMPHIKCYTRYNLENQSVYWFVLTS 492
Query: 495 QKLAKAAWGALQKNNS---QLMIRSYELGVLILP 525
L+KAAWG+ KN++ L I +YE GVL LP
Sbjct: 493 ANLSKAAWGSFNKNSNIQPCLRIANYEAGVLFLP 526
>gi|358365748|dbj|GAA82370.1| tyrosyl-DNA phosphodiesterase [Aspergillus kawachii IFO 4308]
Length = 585
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 138/529 (26%), Positives = 218/529 (41%), Gaps = 116/529 (21%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 97 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 156
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + +M P +FGTH
Sbjct: 157 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAITAYM---------------PEAFGTH 201
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL ++ SE F
Sbjct: 202 HSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSNSSGSESIATPGTRF 261
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ L+ASVP
Sbjct: 262 KRDLLSYLR------------EYGPKKTGPLVAQLEKHDFSTVRAALVASVPSKQKIRES 309
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGF 418
T S+ K WG + LR VL+ ++ + +V Q SS+ SL +KW+ ++ + S
Sbjct: 310 TDSTRKTLWGWLALRDVLRSVPIDRSEDRPHIVTQISSVASLGQTDKWLKDVFFTSLSPS 369
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 370 SNTPKPRFS----IIFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRSYLCHWAG 425
Query: 475 S----------HTGRSRAMPHIKTFARYNGQK-------------LAKAAWGALQKNNSQ 511
GR RA PHIKT+ RY+ + L+ AWGA N +
Sbjct: 426 DGAEDEVKVKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 485
Query: 512 LMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 565
+ I S+E+GV++ P A+ C VP + + K + T
Sbjct: 486 VRICSWEIGVVVWPELVTGAGAEGRSVMVPCFRRDVPDADAVAAAGAAANANVKEIPTT- 544
Query: 566 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
V +PY+LP RYS D+PW +++ D GQ W
Sbjct: 545 ----------TTVGFRMPYDLPLTRYSETDIPWCATASHSEPDWLGQTW 583
>gi|389628810|ref|XP_003712058.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|351644390|gb|EHA52251.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|440474085|gb|ELQ42852.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae Y34]
gi|440485911|gb|ELQ65827.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae P131]
Length = 555
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 147/507 (28%), Positives = 221/507 (43%), Gaps = 88/507 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAKIPHV 219
S FRL R++ L N + + D+I GD ++A NY+ DI++LL A +
Sbjct: 83 SPFRLTRIRDLGEEDNADALGLNDII-GDPLIAECWDFNYLHDIEFLLDALDQDVRDVVK 141
Query: 220 LVI------HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
+ + + L K N +LH LP FGTHHSK ++L+ + ++I+
Sbjct: 142 VHVVHGFWKKDDPSRILLQDDAEKHKNVVLHTAFLPEIFGTHHSKMLVLLRHDDTAQVII 201
Query: 273 HTANLIHVDWNNKSQGLWMQ-DFPL---------KDQNNLSEECG--FENDLIDYLSTLK 320
HTAN+I DW N + G+W+ PL NL+E G F+ DL++YL
Sbjct: 202 HTANMIPKDWTNMTNGIWLSPRLPLLQGQDPADASQYENLAEGTGYKFKIDLLNYLRA-- 259
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVL 378
+ + N +K++FSS LIASVPG H T S WG + ++ L
Sbjct: 260 ---YDDKRVVCRDLVTN---LEKYDFSSIRGTLIASVPGRHDFTDLSTSAWGWVAIKRAL 313
Query: 379 QECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWP 435
+ + G KS +V Q SS+ +L + W+ L SM G + P + I++P
Sbjct: 314 RSVPLQVG--KSEVVTQISSIATLGPTDTWLQRTLFESMCRGKTTGVAPR--PQFKIIFP 369
Query: 436 TVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKAS--------------HT 477
T +++R SL+GY +G + I S Q+ + K W
Sbjct: 370 TADEIRRSLDGYGSGGSIHTKIQSSQQAKQLIYQKPLLCHWANDSPHGQDLGQNIPILDA 429
Query: 478 GRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKNNSQLMIRSYELGVLILPSA 527
GR+RA PHIKT+ RY L+K AWG SQ I S+E+GVL+ P
Sbjct: 430 GRNRAAPHIKTYIRYGANSIDWALLSSANLSKQAWGDATGAGSQTRISSWEIGVLVWPE- 488
Query: 528 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 587
++ + +K +T + T L VV L PY LP
Sbjct: 489 ------LFAKDALMTTVVKK---DTPSRETTNLC-----------PGRPVVGLRSPYSLP 528
Query: 588 PQRYSSEDVPWSWDKRYTKKDVYGQVW 614
Q+Y + +VPW Y++ D G W
Sbjct: 529 VQKYGNGEVPWVATLSYSEPDWAGNTW 555
>gi|317027510|ref|XP_001399437.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 568
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 139/523 (26%), Positives = 217/523 (41%), Gaps = 117/523 (22%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 152
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + + +M P +FGTH
Sbjct: 153 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAIVAYM---------------PEAFGTH 197
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL + SE F
Sbjct: 198 HSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARF 257
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 258 KRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRES 305
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGF 418
T S+ K WG + LR VL+ + +V Q SS+ SL + KW+ ++ + S
Sbjct: 306 TDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPS 365
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S + P IV+PT +++R SL GY +G +I S + +++ Y W
Sbjct: 366 SNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAG 421
Query: 475 S----------HTGRSRAMPHIKTFARYNGQK-------------LAKAAWGALQKNNSQ 511
GR RA PHIKT+ RY+ + L+ AWGA N +
Sbjct: 422 DVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 481
Query: 512 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 571
+ I S+E+GV++ P G G S ++P + ++I T V
Sbjct: 482 VRICSWEIGVVVWPELI-AGAGAEGRSVMMPCFRRDMPDADAEIPTTTTVGFR------- 533
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+PY+LP RY D+PW +++ D GQ W
Sbjct: 534 ----------MPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 566
>gi|134056346|emb|CAK47581.1| unnamed protein product [Aspergillus niger]
Length = 559
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 137/511 (26%), Positives = 214/511 (41%), Gaps = 102/511 (19%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKI 216
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQ------- 145
Query: 217 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV----RIIV 272
E + H + +P +FGTHHSK M+L+ + R+++
Sbjct: 146 ------FDEDEACTRHPNVEAIVAY------MPEAFGTHHSKMMILLRHDDLAHEHRVVI 193
Query: 273 HTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSA 326
HTAN+I DW N Q +W PL + SE F+ DL+ YL
Sbjct: 194 HTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARFKRDLLSYLRE-------- 245
Query: 327 NLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH-----TGSSLKK-WGHMKLRTVL 378
+G K P + +K +FS+ LIASVP T S+ K WG + LR VL
Sbjct: 246 ----YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRESTDSNQKTLWGWLALRDVL 301
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPLIVWPT 436
+ + +V Q SS+ SL + KW+ ++ + S S + P IV+PT
Sbjct: 302 RSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPSSNNPKPRFS----IVFPT 357
Query: 437 VEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS----------HTGRSRA 482
+++R SL GY +G +I S + +++ Y W GR RA
Sbjct: 358 PDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAGDVAEDEVKMKREAGRRRA 417
Query: 483 MPHIKTFARYNGQK-------------LAKAAWGALQKNNSQLMIRSYELGVLILPS--- 526
PHIKT+ RY+ + L+ AWGA N ++ I S+E+GV++ P
Sbjct: 418 APHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGEVRICSWEIGVVVWPELIA 477
Query: 527 ---AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 583
A+ C +P + + + K + T V +P
Sbjct: 478 GAGAEGRSVMMPCFRRDMPDADAVAAADANANADKKEIPTT-----------TTVGFRMP 526
Query: 584 YELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
Y+LP RY D+PW +++ D GQ W
Sbjct: 527 YDLPLTRYGETDIPWCATASHSEPDWLGQTW 557
>gi|406865596|gb|EKD18637.1| tyrosyl-DNA phosphodiesterase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 532
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 151/565 (26%), Positives = 239/565 (42%), Gaps = 105/565 (18%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNF----HVSRDKLP----- 161
SR ++++S+D ++ + +D+ D N KN ++ + RD+ P
Sbjct: 10 SRKRRKLSSD--------DEETQSEDDTDQNNKKNLPYSITRSISPPPLRRDREPEVQVA 61
Query: 162 ----STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAK 215
S F+L ++ LP N VS+++++ I NY+ D+++L+ A +
Sbjct: 62 KVLKSPFQLTCIKDLPEAVNKDAVSLKNILGDPTITECWEFNYLHDLEFLMEAFHDDVRD 121
Query: 216 IPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV- 268
V V+HG S L+ + P N LH +P FGTHHSK ++L+
Sbjct: 122 RTKVHVVHGFWKSEDASRLNLQAQAKKYP-NITLHTAYMPEMFGTHHSKMLVLLRKYDTA 180
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQDFP--------LKDQNNLSEECGFENDLIDYLSTLK 320
+I++HTAN+ DW+N +Q W+ L+D + F+ D ++YL
Sbjct: 181 QIVIHTANMQAFDWDNMTQAAWISPLLPQIREKELLEDTEPIGSGSRFKFDFLNYLRAYD 240
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVL 378
P G K NFS+ L+ASVPG + S K WG L+ L
Sbjct: 241 TKRVICK-PLVGKLM-------KHNFSAIRGALVASVPGKQSIKSDSKTLWGWAGLKKAL 292
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ K+ +V Q SS+ +L EKW+ + + ++ + + IV+PT +
Sbjct: 293 EAVPVRS--KEGEIVIQISSIATLSEKWIDK--TLFAAMSTSKSHGSSKSKFKIVFPTAD 348
Query: 439 DVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA------------SHTGRSRA 482
++R SL GY +G+AI S + LK W S GR RA
Sbjct: 349 EIRRSLNGYNSGSAIHTKIQSHAQARQLQLLKPMLCHWAGDSDEKGPSSAPVSDAGRKRA 408
Query: 483 MPHIKTFARY-------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKR 529
PHIKTF R+ L+K AWG + I SYE+GVL+ P
Sbjct: 409 APHIKTFIRFPDATRSTIDWMLVTSANLSKQAWGEGTNAAGDVRICSYEIGVLVWPGL-- 466
Query: 530 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 589
F + +VP+ K+ + + S A +E+V +PY+LP
Sbjct: 467 ----FGDNATMVPT-FKTDNPDASA----------------AKPGTELVGARMPYDLPLV 505
Query: 590 RYSSEDVPWSWDKRYTKKDVYGQVW 614
Y +D+PW Y + D GQVW
Sbjct: 506 PYGKDDLPWCATSSYEEPDWKGQVW 530
>gi|24581359|ref|NP_523465.2| glaikit [Drosophila melanogaster]
gi|37999816|sp|Q9VQM4.1|TYDP1_DROME RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase; AltName: Full=Protein glaikit
gi|7295840|gb|AAF51141.1| glaikit [Drosophila melanogaster]
gi|15292079|gb|AAK93308.1| LD37277p [Drosophila melanogaster]
gi|220946228|gb|ACL85657.1| gkt-PA [synthetic construct]
Length = 580
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 180/374 (48%), Gaps = 45/374 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K P +L+ ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P+ E GF+ DL+ YL K + + + + +FS+ V
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNS---Q 511
Y +WK+S RSRAMPHIK++ R+N L+KAAWG KN++
Sbjct: 461 DYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 512 LMIRSYELGVLILP 525
L I +YE GVL LP
Sbjct: 521 LRIANYEAGVLFLP 534
>gi|195034799|ref|XP_001988977.1| GH11458 [Drosophila grimshawi]
gi|193904977|gb|EDW03844.1| GH11458 [Drosophila grimshawi]
Length = 590
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 137/450 (30%), Positives = 211/450 (46%), Gaps = 79/450 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+M+DI WLL +L K +LV++G+ L + + KP
Sbjct: 185 ILDESLGEIESTVQINFMIDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 242
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL- 296
+ + +P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ P
Sbjct: 243 VTAV-RVKMPTPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPAL 301
Query: 297 -KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
+D + + E GF+ DL+ YL K + + + +K +FS+ V L
Sbjct: 302 AEDADTAAGESATGFKQDLMLYLVEYKLSQLQPWI----------ARIRKSDFSAVNVFL 351
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
I SVPG H +++ WG +L ++L + + P+V Q SS+GSL A +
Sbjct: 352 IGSVPGGHREGAVRGHPWGCARLGSLLAKHATPVE-DRIPVVCQSSSIGSLGANVQAWIQ 410
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
S +D TPLG L +++P+ +V S +G G +P + DK +LK
Sbjct: 411 QDFVSNLRKDSTPLGRLRQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGRNTNDKQPWLK 470
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYNGQK------------LAKAAWGALQKN-NSQ-- 511
+ +WK+ RS+AMPHIK++ R+N ++ L+KAAWG+ KN N Q
Sbjct: 471 AHLQQWKSGDRHRSQAMPHIKSYTRFNLEEQCIYWFVLTSANLSKAAWGSFNKNPNIQPC 530
Query: 512 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 571
L I +YE GVL LP F P G+S
Sbjct: 531 LRIANYEAGVLFLPR-------FVTGEETFPL-----------------------GNSRN 560
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWSWD 601
G V P+PY++P Y ++D P+ D
Sbjct: 561 G----VPAFPLPYDVPLTPYGADDKPFLMD 586
>gi|336270704|ref|XP_003350111.1| hypothetical protein SMAC_01002 [Sordaria macrospora k-hell]
Length = 624
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 156/548 (28%), Positives = 237/548 (43%), Gaps = 107/548 (19%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S F L ++ L +N +S++ ++ +I+ NY+ +ID+L+ A + + V
Sbjct: 91 SPFHLTTIRSLGQASNKDTISLKHLLGDPLIIECWEFNYLHNIDFLMNAFDEDIRHLVKV 150
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E L+ ++ N H LP FGTHHSK M+L II+H
Sbjct: 151 HVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTHHSKLMVLFRLDDTAEIIIH 210
Query: 274 TANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG-------------FENDLIDYLSTL 319
TANLI DW N + G W+ PL + FE D ++YL +
Sbjct: 211 TANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPPAAGSGEKFEIDFLNYLRSY 270
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
+ +A P K++FSS LIASVPG H+ + +WG ++
Sbjct: 271 R----TACKPLVDQLS-------KYDFSSIRGSLIASVPGRHSLVDNFPTRWGWAAMKET 319
Query: 378 LQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIG 428
L+ + +K+ +V Q SS+ +L + W L S++ S + P +
Sbjct: 320 LKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW---LKSTLFEALSGSQGPKTLS 376
Query: 429 EP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--- 475
+++PT +++R SL+GY++G +I S Q+ +L+ + W
Sbjct: 377 SSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSAD 436
Query: 476 ------------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQ-KN 508
GR RA PHIKTF RY QK L+K AWG Q KN
Sbjct: 437 GGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWALLTSANLSKQAWGDAQSKN 496
Query: 509 NS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP----------SEIKSGSTETSQ 554
N+ Q+ I SYE+GV++ P G G + +VP S K G++ +
Sbjct: 497 NAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFLTDTPTGLSSSKDGTSLAGE 556
Query: 555 IQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 609
TK T G + S+ VV L +PY LP QRY ++VPW + + D
Sbjct: 557 RGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQRYGPQEVPWVATANHLEPDW 616
Query: 610 YGQVWPRH 617
GQVW RH
Sbjct: 617 MGQVW-RH 623
>gi|380095505|emb|CCC06978.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 666
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 156/548 (28%), Positives = 237/548 (43%), Gaps = 107/548 (19%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S F L ++ L +N +S++ ++ +I+ NY+ +ID+L+ A + + V
Sbjct: 133 SPFHLTTIRSLGQASNKDTISLKHLLGDPLIIECWEFNYLHNIDFLMNAFDEDIRHLVKV 192
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E L+ ++ N H LP FGTHHSK M+L II+H
Sbjct: 193 HVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTHHSKLMVLFRLDDTAEIIIH 252
Query: 274 TANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG-------------FENDLIDYLSTL 319
TANLI DW N + G W+ PL + FE D ++YL +
Sbjct: 253 TANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPPAAGSGEKFEIDFLNYLRSY 312
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
+ +A P K++FSS LIASVPG H+ + +WG ++
Sbjct: 313 R----TACKPLVDQLS-------KYDFSSIRGSLIASVPGRHSLVDNFPTRWGWAAMKET 361
Query: 378 LQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIG 428
L+ + +K+ +V Q SS+ +L + W L S++ S + P +
Sbjct: 362 LKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW---LKSTLFEALSGSQGPKTLS 418
Query: 429 EP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK----- 473
+++PT +++R SL+GY++G +I S Q+ +L+ + W
Sbjct: 419 SSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSAD 478
Query: 474 ----------ASHTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQ-KN 508
GR RA PHIKTF RY QK L+K AWG Q KN
Sbjct: 479 GGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWALLTSANLSKQAWGDAQSKN 538
Query: 509 NS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP----------SEIKSGSTETSQ 554
N+ Q+ I SYE+GV++ P G G + +VP S K G++ +
Sbjct: 539 NAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFLTDTPTGLSSSKDGTSLAGE 598
Query: 555 IQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 609
TK T G + S+ VV L +PY LP QRY ++VPW + + D
Sbjct: 599 RGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQRYGPQEVPWVATANHLEPDW 658
Query: 610 YGQVWPRH 617
GQVW RH
Sbjct: 659 MGQVW-RH 665
>gi|367033183|ref|XP_003665874.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
gi|347013146|gb|AEO60629.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 152/575 (26%), Positives = 243/575 (42%), Gaps = 135/575 (23%)
Query: 130 KKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG 189
K+ R Q ++ E ++ SR S FRL +++ LP N ++++D++ G
Sbjct: 46 KRRRAQSLEETEPARSPS-------ASRRVFDSPFRLTKIRDLPREMNKDTITLKDIL-G 97
Query: 190 DIIVAIL--SNYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRNKPANW 241
D ++A NY+ DID+L+ A P + + V V+HG + +G ++ N
Sbjct: 98 DPLIAECWEFNYLHDIDFLMAAFDPDVRHLVKVHVVHGFWKREDPNGLELQEAASRFQNV 157
Query: 242 ILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQ 299
LH +P +GTHHSK M+L+ +I++HTAN+I DW N +Q +W+ PL +
Sbjct: 158 TLHSAFMPEMYGTHHSKMMILLRRDDTAQIVIHTANMIIRDWTNMTQAVWLSPRLPLMEP 217
Query: 300 NNLS---EECG------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+ EE F+ D ++YL A + K++FS+
Sbjct: 218 SRCDARPEEVAAGSGAKFKIDFLNYL--------RAYDTRRTTCRPIIDQLSKYDFSAIR 269
Query: 351 VRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKW 406
LIASVPG H +S +WG + L+ ++S + Q SS+ +L + W
Sbjct: 270 GSLIASVPGRHKLDDTSPTRWGWAAMEQALKSVPVSS--RRSDIAIQISSIATLGPTDTW 327
Query: 407 MAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 459
L S+ S + + +P +++PT +++R SL+GY++G +I SPQ+
Sbjct: 328 ---LKSTFFRSLSGGRPGGTLQQPPNFQVIFPTPDEIRKSLDGYSSGASIHTKVQSPQQV 384
Query: 460 VDKDFLKK---YWAKWKAS----------------------------------HTGRSRA 482
+L+ +WA A+ GR RA
Sbjct: 385 KQLAYLRPMLYHWANDSANGADPQEGGGGGERRREDYENDGGDDEGDSAVVVKEAGRKRA 444
Query: 483 MPHIKTFARYNGQ-------------KLAKAAWGAL----QKNNSQLMIRSYELGVLILP 525
PHIKT+ RY + L+K AWG + + I SYE+GVL+ P
Sbjct: 445 APHIKTYIRYGDKSGPSIDWALVTSANLSKQAWGEAAVRGADGGATMRIASYEIGVLVWP 504
Query: 526 SAKRHGC---GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
G G T ++ E+K G+T V L +
Sbjct: 505 GLYGEGAIMRGTFLTDSLGTEEVKEGTT--------------------------AVALRM 538
Query: 583 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
PY LP Q Y +VPW Y++ D GQ+W RH
Sbjct: 539 PYNLPLQPYGKGEVPWVATANYSEPDWKGQIW-RH 572
>gi|195576262|ref|XP_002077995.1| GD23212 [Drosophila simulans]
gi|194190004|gb|EDX03580.1| GD23212 [Drosophila simulans]
Length = 580
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 121/374 (32%), Positives = 180/374 (48%), Gaps = 45/374 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K P +L+ ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P+ E GF+ DL+ YL K + + + + +FS+ V
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNS---Q 511
Y +WK+S RSRAMPHIK++ R+N L+KAAWG K+++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPC 520
Query: 512 LMIRSYELGVLILP 525
L I +YE GVL LP
Sbjct: 521 LRIANYEAGVLFLP 534
>gi|322706849|gb|EFY98429.1| tyrosyl-DNA phosphodiesterase 1 [Metarhizium anisopliae ARSEF 23]
Length = 517
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 143/518 (27%), Positives = 231/518 (44%), Gaps = 114/518 (22%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK 215
++L S ++L ++ LP N V+++D++ GD +++ NY+ D+ +L+ A +
Sbjct: 51 ERLASPWQLTWIRDLPEELNYDAVTLKDLL-GDPLISDCWEFNYLHDVPFLMDAFDQDTR 109
Query: 216 -IPHVLVIHGESDGTLEHMKRNKP------------ANWILHKPPLPISFGTHHSKAMLL 262
+ +V V+HG KR+ P N LH P+P FGTHHSK M+L
Sbjct: 110 HLVNVHVVHG-------FWKRDDPHRLALTAESSGFDNVKLHVAPMPEMFGTHHSKMMVL 162
Query: 263 I-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ-----NNLSEECG--------F 308
+ II+HTAN+I DW N + +W P Q L E C F
Sbjct: 163 FRHDNTAEIIIHTANMIPKDWTNMTNAVWRT--PRLSQLPPGFRQLQEYCDLPIGSGERF 220
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK- 367
+ DL++YL + + + + +++FSS LIASVPG H L
Sbjct: 221 KADLLNYLKSYDSRKLTC--------RTLIDRLVQYDFSSVKGALIASVPGKHDIHDLSG 272
Query: 368 -KWGHMKLRTVLQECTFEKGFKKSPLVYQ-FSSLGSLDEKWMAELSSSMSSGFSEDKTPL 425
+G ++ L ++G K + L F SL + ++ S FS
Sbjct: 273 TAYGWSGVKRYLSSVPCKEGAKDTWLQKTLFDSLAT------SKTKSLQRPKFS------ 320
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------- 472
IV+PT +++R SL+GYA+G +I S Q+ +L++ W
Sbjct: 321 ------IVFPTADEIRQSLDGYASGASIHTKIQSSQQAQQLGYLRRILHHWANDSPDGIA 374
Query: 473 -----KASHTGRSRAMPHIKTFARYNGQ-----------KLAKAAWGALQKNNSQLMIRS 516
K + GR RA PHIKT+ RYN + ++K AWG + + +L + S
Sbjct: 375 SSPEIKTRNGGRDRAAPHIKTYIRYNEEGSIDWAMLTSANISKQAWGEASRPSGELRVAS 434
Query: 517 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
+E+GVL+ P +V ++ T S + K SS A AS
Sbjct: 435 WEIGVLVWP-------------GLVGQDVSMVGTFQSDVPKKP----KEQASSKADASGV 477
Query: 577 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
++ + +PY LP QRY +E+VPW ++++ D +G+ W
Sbjct: 478 LMGVRIPYSLPLQRYGAEEVPWVATMQHSEPDRFGRQW 515
>gi|350634393|gb|EHA22755.1| hypothetical protein ASPNIDRAFT_174927 [Aspergillus niger ATCC
1015]
Length = 581
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 138/529 (26%), Positives = 217/529 (41%), Gaps = 116/529 (21%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 152
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + + +M P +FGTH
Sbjct: 153 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAIVAYM---------------PEAFGTH 197
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL + SE F
Sbjct: 198 HSKMMILLRHDDLAQLVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARF 257
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 258 KRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRES 305
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGF 418
T S+ K WG + LR VL+ + +V Q SS+ SL + KW+ ++ + S
Sbjct: 306 TDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPS 365
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S + P IV+PT +++R SL GY +G +I S + +++ Y W
Sbjct: 366 SNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAG 421
Query: 475 S----------HTGRSRAMPHIKTFARYNGQK-------------LAKAAWGALQKNNSQ 511
GR RA PHIKT+ RY+ + L+ AWGA N +
Sbjct: 422 DVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 481
Query: 512 LMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 565
+ I S+E+GV++ P A+ C +P + + + K + T
Sbjct: 482 VRICSWEIGVVVWPELIAGAGAEGRSVMMPCFRRDMPDADAVAAADANANADKKEIPTT- 540
Query: 566 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
V +PY+LP RY D+PW +++ D GQ W
Sbjct: 541 ----------TTVGFRMPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 579
>gi|341892674|gb|EGT48609.1| hypothetical protein CAEBREN_24547 [Caenorhabditis brenneri]
Length = 451
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 177/357 (49%), Gaps = 55/357 (15%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTH 255
++M++ D+L+ P + + ++ GE D ++ ++R+ A N + LPI +GTH
Sbjct: 71 SFMIEPDYLMNCYPQSIRSNPITLVVGEPD--VKDLRRSMHAYKNVTVIGASLPIPYGTH 128
Query: 256 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID 314
HSK +L G + +IV +AN+I DW K+Q W + +K + ++ F+NDLI+
Sbjct: 129 HSKLSILEGEDGTIHVIVSSANMISEDWEFKTQQFWY-GYGVKKETQVTGS-EFQNDLIE 186
Query: 315 YL-----STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 369
YL S W E K +FS RLI SVPGYH
Sbjct: 187 YLGYYPSSMNSWTEL----------------IKCTDFSEVKDRLIFSVPGYHKAKK-NSL 229
Query: 370 GHMKLRTVL-QECTFEKGF---KKSPLVYQFSSLGSLD---EKWMAE--LSSSMSSGFSE 420
GHM LR++L F+ F ++ Q SS+GSL W L S +
Sbjct: 230 GHMALRSILIDRFPFDPNFVHTDRTTFFCQCSSIGSLGPTPANWFRGQFLKSLEGAATPP 289
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGR 479
P + +++P VEDVR S EGYA G ++P + L+ + +WKA R
Sbjct: 290 QNKPARL---FVLFPRVEDVRMSAEGYAGGKSVPYRNSVHQRQLWLQHAFCRWKADKKKR 346
Query: 480 SRAMPHIKTFARY--NGQK--------LAKAAWGALQKNNS---QLMIRSYELGVLI 523
+RA+PH KT+ + +GQK L+KAAWG LQK N+ QLMIRSYE+GVL+
Sbjct: 347 TRAIPHCKTYMKIDKDGQKWQLLTSANLSKAAWGELQKVNTANEQLMIRSYEMGVLV 403
>gi|195388525|ref|XP_002052930.1| GJ17827 [Drosophila virilis]
gi|194149387|gb|EDW65085.1| GJ17827 [Drosophila virilis]
Length = 592
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 203/450 (45%), Gaps = 79/450 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G I ++ N+M+DI WLL +L K +LV++G+ L + + KP
Sbjct: 187 ILDESLGKIESSVQINFMIDIGWLLGHYYFAGILDK--PLLVLYGDESPDLLGIGKFKPQ 244
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ K +P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+
Sbjct: 245 VTAI-KVNMPTPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPAL 303
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V L
Sbjct: 304 PEGADTAAGESPTGFKQDLMLYLVEYKVSQLQPWI----------ARIRKSDFSAVNVFL 353
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
I SVPG H S+++ WG +L ++L + + P+V Q SS+GSL A +
Sbjct: 354 IGSVPGGHRESAVRGHPWGCARLGSLLAKHAAPVD-DRIPVVCQSSSIGSLGANVQAWIQ 412
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G L +++P+ +V S +G G +P + DK +LK
Sbjct: 413 QDFVNNLRKDSTPVGRLRQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYSKNTNDKQPWLK 472
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYNGQK------------LAKAAWGALQKNNS---Q 511
+ +WK+ RS+AMPHIK++ R+N ++ L+KAAWG+ KN+
Sbjct: 473 AHLQQWKSGDRHRSQAMPHIKSYTRFNLEQQCVYWFVLTSANLSKAAWGSFNKNSQIQPC 532
Query: 512 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 571
L I +YE GVL LP F P
Sbjct: 533 LRIANYEAGVLFLPR-------FVTGEETFPL---------------------------G 558
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWSWD 601
A V P+PY++P Y +D P+ D
Sbjct: 559 NARDGVPAFPLPYDVPLTPYGPDDTPFLMD 588
>gi|7529314|emb|CAB86488.1| Glaikit protein [Drosophila melanogaster]
Length = 580
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 181/375 (48%), Gaps = 47/375 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHG-ESDGTLEHMKRNKP 238
I D G+I + N+MVDI WLL +L K +LV++G ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKQQ 232
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD---- 293
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPA 290
Query: 294 FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
P+ E GF+ D + YL K + +P + +FS+ V
Sbjct: 291 LPVDADTGARESLTGFKQDRMLYLVEYKISQLQPWIPR----------IRNSDFSAINVF 340
Query: 353 LIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 341 FLGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWI 399
Query: 411 SSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
+ +D TP+G + +++P+ +V S +G G +P N ++ +L
Sbjct: 400 QQDFVNSPKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDNQPWL 459
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNS--- 510
K Y +WK+S RSRAMPHIK++ R+N L+KAAWG KN++
Sbjct: 460 KDYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQP 519
Query: 511 QLMIRSYELGVLILP 525
L I +YE GVL LP
Sbjct: 520 CLRIANYEAGVLFLP 534
>gi|268553849|ref|XP_002634911.1| Hypothetical protein CBG22509 [Caenorhabditis briggsae]
Length = 421
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 188/379 (49%), Gaps = 45/379 (11%)
Query: 172 LPAWANTSCVSIRDVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDG 228
+P + +S+ D++ DI A+ ++M+D +LL + P L P LV+ G SD
Sbjct: 21 VPRQESEGSLSLEDIL-ADIRPTQALHLSFMIDFQYLLNSYPPSLRTTPMTLVV-GASDK 78
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQ 287
+ N + PLPI FGTHH+K ++ G V +IV TANL+ DW K+Q
Sbjct: 79 AALSRECAAHKNVTVIGAPLPIPFGTHHTKMSIMESEDGRVHVIVSTANLVPDDWEFKTQ 138
Query: 288 GLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFN 345
+ +D ++ C F++DL++YLS F NL + P + +
Sbjct: 139 QFYYACGLRRDGE--AQRCPFQSDLLEYLS------FYRNL-------LTPWRELIQSTD 183
Query: 346 FSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK---KSPLVYQFSSLGSL 402
FSS RLI S PGYHT + +G R + ++ F+ ++ + + Q SS+GS+
Sbjct: 184 FSSITDRLIFSTPGYHTHVARLNFGPRLARILTEKFPFDPSYEHTERCTFISQCSSIGSI 243
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQK- 458
++ + E P +P +++P VEDVR S +GYA G ++P
Sbjct: 244 GKQPIDWFRGQFLKSL-EGANPAPKSKPAKMYLIFPCVEDVRTSCQGYAGGGSVPYRNSV 302
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWG----A 504
+V + +L+ KW+++ R+ A+PH KT+ +++ + L+KAAWG +
Sbjct: 303 HVRQKWLQGVMCKWRSNAKRRTHAVPHCKTYVKFDKKVPQWQLVTSANLSKAAWGEASFS 362
Query: 505 LQKNNSQLMIRSYELGVLI 523
K QLM+RSYE+GVLI
Sbjct: 363 KAKKTDQLMVRSYEMGVLI 381
>gi|164425147|ref|XP_962379.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
gi|157070809|gb|EAA33143.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
Length = 527
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 160/518 (30%), Positives = 227/518 (43%), Gaps = 111/518 (21%)
Query: 198 NYMVDIDWLLPACPVLAK-IPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPIS 251
NY+ DID+L+ A + + V VIHG E L+ + N H LP
Sbjct: 22 NYLHDIDFLMGAFDSDVRHLIKVHVIHGFWKKEDPNRLQIQSDAARYPNITTHHAYLPEP 81
Query: 252 FGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ-DFPL----KDQNNLSEE 305
FGTHHSK M+L+ II+HTANLI DW+N +Q W+ PL QN S
Sbjct: 82 FGTHHSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPLLKPDAQQNTSSTR 141
Query: 306 ------CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
CG F+ D ++YL + + A N I+ K++FSS LIASV
Sbjct: 142 SPPPAGCGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKYDFSSIRGSLIASV 190
Query: 358 PGYHT--GSSLKKWGHMKLRTVLQECTFEKG------------FKKSPLVYQFSSLGSL- 402
PG H+ +WG ++ L+ + +K +V Q SS+ +L
Sbjct: 191 PGRHSLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKPEVVIQISSIATLG 250
Query: 403 -DEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAI---- 453
+ W+ SG KT L +P I++PT +++R SL+GYA+G +I
Sbjct: 251 PTDNWLKNTLFEALSGSQGPKTLLS-SKPKPDFKIIFPTPDEIRKSLDGYASGGSIHTKI 309
Query: 454 PSPQKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQK--- 496
S Q+ +L+ + W GR+RA PHIKTF R+ K
Sbjct: 310 QSAQQAKQLQYLRPMFCHWANDSADGVGTTTTTPIREAGRNRAAPHIKTFIRFANHKTKN 369
Query: 497 -----------LAKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKRHGCGFSCTSNI- 540
L+K AWG Q KNN+ Q+ I SYE+GVL+ P G S S +
Sbjct: 370 TIDWALLTSANLSKQAWGDAQSKNNAGEPQVHICSYEIGVLVWPELFADSDGTSSGSKMG 429
Query: 541 -----VPSEIKS-----GSTE---TSQIQKTKLVTLTWHGSSDAGASSE--------VVY 579
VP+ +K GS + +S +K + + +G D E VV
Sbjct: 430 QKAVMVPTFLKDTPAIHGSEKDRPSSLGEKQGPTSTSRNGEKDGKGDDEKEEKSSTVVVG 489
Query: 580 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
L +PY LP QRY ++VPW + + D GQVW RH
Sbjct: 490 LRMPYNLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 526
>gi|17540580|ref|NP_500149.1| Protein F52C12.1 [Caenorhabditis elegans]
gi|37999811|sp|Q9TXV7.1|TYDP1_CAEEL RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|351063437|emb|CCD71624.1| Protein F52C12.1 [Caenorhabditis elegans]
Length = 451
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 130/458 (28%), Positives = 208/458 (45%), Gaps = 95/458 (20%)
Query: 185 DVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
D I DI I ++ ++M+D ++L+ + P L + P LV+ L +N+
Sbjct: 58 DEILADIRPINSLHFSFMLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVT 117
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
++ LPI FGTHH+K +L G +IV TANL+ DW K+Q + +F +K +
Sbjct: 118 VVGAS-LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIAS 175
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
F++DL++YLS + +K +FS + RLI S PGY
Sbjct: 176 GTVPRSDFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGY 224
Query: 361 HTGSSLKKWGHMKLRTVLQE-CTFEKGF---KKSPLVYQFSSLGSLDE---KWMAE--LS 411
HT ++ GH +L +L E F+ + ++ V Q SS+GSL W L
Sbjct: 225 HTDPPTQRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQ 284
Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWA 470
S + S + P + +V+P+VEDVR S +GYA G ++P + + +L+
Sbjct: 285 SLEGANPSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMC 341
Query: 471 KWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAAWGAL----QKNNSQLMIRS 516
KW+++ R+ A+PH KT+ +Y+ + L+KAAWG + KN QLMIRS
Sbjct: 342 KWRSNAKRRTNAVPHCKTYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRS 401
Query: 517 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
+E+GVLI T+ S+
Sbjct: 402 WEMGVLI--------------------------TDPSRFN-------------------- 415
Query: 577 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+P++ P YS+ D P+ DK++ K D+ G +W
Sbjct: 416 -----IPFDYPLVPYSATDEPFVTDKKHEKPDILGCIW 448
>gi|312069908|ref|XP_003137901.1| tyrosyl-DNA phosphodiesterase [Loa loa]
Length = 426
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 141/504 (27%), Positives = 207/504 (41%), Gaps = 117/504 (23%)
Query: 137 EQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAI 195
E D+ K SE + DKL +V GL N + S ++++ + +I
Sbjct: 15 ECDDLESKGSEGKRMKQNCLMDKL----YFNKVVGLAEQYNVNAFSFAELLELISPVASI 70
Query: 196 LSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG-----TLEHMKRNKPANWILHKPPLPI 250
N+M+D+ WLL P + + +I GE G T +K+ N + + L I
Sbjct: 71 HFNFMIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMI 130
Query: 251 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN 310
FGTHHSK + + + + L D P ++ ++ F+
Sbjct: 131 PFGTHHSKISI--------------------FESNTGRLAAGDCPDRNGSD------FQT 164
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DL+ YL K + L H +++ + S R++ SVPG H G L K+G
Sbjct: 165 DLVKYLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKGVQLTKYG 218
Query: 371 HMKLRTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSEDKTPL 425
H +LR +L+E + GF SLG+ + W+ + +S+S G D
Sbjct: 219 HPRLRVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGAETD---- 274
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 483
GE L I++P VEDVR S EGYAAG + P S V + +L + KW + H GRSRAM
Sbjct: 275 --GEHLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDHLGRSRAM 332
Query: 484 PHIKTFARY------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 531
PHIKT+A + L+KAAWG Q QL IRSYE G+L
Sbjct: 333 PHIKTYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF-------- 384
Query: 532 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 591
SD + + Y +LP +Y
Sbjct: 385 -------------------------------------SDPESLDMLPY-----DLPLTKY 402
Query: 592 SSEDVPWSWDKRYTKKDVYGQVWP 615
D W DK Y K D++ + WP
Sbjct: 403 DDNDRVWIVDKTYRKPDIFRKTWP 426
>gi|242006203|ref|XP_002423943.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
gi|212507213|gb|EEB11205.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
Length = 447
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 126/434 (29%), Positives = 197/434 (45%), Gaps = 85/434 (19%)
Query: 198 NYMVDIDWLLPACPVLAKI-PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
N+MV++ WL+ + P + +++ DG L ++ + I K P P FG HH
Sbjct: 71 NFMVELPWLMAQYAINDLFNPSMTILYDVQDGDLANIPEHLNIKAIKIKSPYP--FGHHH 128
Query: 257 SKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWM--------QDFPLKDQNNLSEECG 307
+K + Y R +R ++TANLI DW +++QG+W+ D P+ N +
Sbjct: 129 TKMSIFFYTDRSIRFAIYTANLIESDWEDRTQGVWISPKCPYLGDDVPI---NYGESDTL 185
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
F+ +++ YL + K PE L KI + + S V ++SVPG S +
Sbjct: 186 FKFEILQYLISYKLPEIRNLL-----IKIQET-----DCSLIKVFFVSSVPG----SVID 231
Query: 368 KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL----DEKWMAELSSSMSSGFSEDKT 423
+G++KL +++E E K +V Q SS+GSL D + E S SS S +
Sbjct: 232 NFGYIKLGKIIKEHAVENSEDKERIVIQCSSIGSLGPAPDSWLLNEFVKSTSSKLSSPQV 291
Query: 424 PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRA 482
IV+P+V +V S+ G + G +P S ++ + +L KY +W H RS+A
Sbjct: 292 S-------IVYPSVRNVASSIYGLSGGGCLPYSSGTHIKQLWLNKYLMQWYCEHRKRSKA 344
Query: 483 MPHIKTFARYNGQK------------LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 530
+PHIKT+AR N K L+KAAWG K + L I SYE GVL LP +
Sbjct: 345 VPHIKTYARINEDKEEISWFLLTSANLSKAAWGKKLK-SGMLQIMSYEAGVLFLPKLLIN 403
Query: 531 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 590
F +I+K ++G E P+PY++P
Sbjct: 404 KNVF-------------------KIKKF---------GYNSGNDDE---FPIPYDIPLTS 432
Query: 591 YSSEDVPWSWDKRY 604
Y D + +DK +
Sbjct: 433 YQETDRLFLFDKNF 446
>gi|170040309|ref|XP_001847946.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
gi|167863873|gb|EDS27256.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
Length = 615
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 203/439 (46%), Gaps = 70/439 (15%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPH--VLVIHGESDGTLEHMKRNKPANWILHKP 246
G++ ++ N+MVDI WLL A +L+++G+ L+ + KP N K
Sbjct: 217 GELECSVQMNFMVDIGWLL-GHYFFAGYEDRPLLILYGDESPELKTVSTKKP-NVTALKV 274
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD-FPL----KDQN 300
+ FG HH+K L Y G +R+++ TANL D++N++QGLW+ P D
Sbjct: 275 HIATPFGVHHTKMGLYGYTDGSMRVVISTANLYEDDFHNRTQGLWISPRLPALAEDADTG 334
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
GF LI YL++ K+ + +A + S ++ +F V +AS+PG
Sbjct: 335 AGESRTGFRESLITYLNSYKFAQLAAWV----------SRIQRTDFGEVNVFFVASIPGG 384
Query: 361 HTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFS 419
H ++ WGH +L +L + + PLV Q SS+GSL + + S + + F
Sbjct: 385 HLNTAKGPLWGHPRLGYLLGKHSAPID-DACPLVAQSSSIGSLGPNPQSWVLSEIMASFR 443
Query: 420 EDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKAS 475
D P+G+ +++P+ +VR S + G +P + +K +LK + +WK+
Sbjct: 444 RDSAPVGLRRVPSFRMIFPSFSNVRNSHDNLLGGGCLPYMRATHEKQPWLKDHLHQWKSD 503
Query: 476 HTGRSRAMPHIKTFARYNGQ----------KLAKAAWGALQKN---NSQLMIRSYELGVL 522
R++A+PHIKT+ R++ + L+KAAWG K+ + L I SYE+GVL
Sbjct: 504 CRNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYEVGVL 563
Query: 523 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 582
LP F N P E KS G + A P+
Sbjct: 564 FLPK-------FVIDENFFPMESKSS------------------GDNKHPA------FPM 592
Query: 583 PYELPPQRYSSEDVPWSWD 601
PY++P Y+ ED P+ D
Sbjct: 593 PYDVPIIPYAPEDSPFFMD 611
>gi|296424093|ref|XP_002841585.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637828|emb|CAZ85776.1| unnamed protein product [Tuber melanosporum]
Length = 510
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 134/502 (26%), Positives = 219/502 (43%), Gaps = 97/502 (19%)
Query: 157 RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPAC-PVLA 214
R ++ S F+L RV LP N V IRD+++ G + + NY+ D+DW++ P +
Sbjct: 60 RIRVASPFQLTRVDELPESENVDAVGIRDILRRGPLKEVWIFNYLFDLDWVMNQFDPDVK 119
Query: 215 KIPHVLVIHG-----ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-V 268
V ++HG +++ H + N L +P +GTHHSK +L
Sbjct: 120 DTVKVRIVHGSWRREDANRARIHDQAESYPNVKLVCAFMPEPYGTHHSKMFVLFRTDDHA 179
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG--------FENDLIDYLSTL 319
+II+HTAN+I DW N +Q +W PL Q++ S F+ D++ Y S
Sbjct: 180 QIIIHTANMIPFDWQNMTQAVWQSPLLPLLPQDHGSPRAQTFKPIGQRFKTDILAYFSAY 239
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSSLKK---WGHMKLR 375
G + +++F + SVPG +H +S K WG +L
Sbjct: 240 ----------GEGRTDFLTTQLSRYSFDPVKAVFVGSVPGKFHIDASNGKGYEWGWRRLA 289
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAEL--SSSMSSGFSEDKTPLGIGEPL 431
+VL++ K +V Q SS+ +L K W++ + +S +S F+ P +
Sbjct: 290 SVLRKVPLRSPEAKGCIVVQVSSIATLGSKNTWLSPVLFASLKTSRFTASAEP----KFH 345
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR 491
+++PT ++R SL GY +G+++ K+ + + + G +RA PHIKT+ R
Sbjct: 346 VIFPTANEIRESLNGYRSGSSL-----------HMKFQSPAQQAQLG-ARAAPHIKTYIR 393
Query: 492 YN-------------GQKLAKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 532
++ ++ AWGA +K N+ ++ I SYE GVL+ P
Sbjct: 394 FSDTDCTQIDWALLTSANISIQAWGAAEKDPIGRINHREVRICSYEAGVLVYPEILDVEE 453
Query: 533 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 592
+P EI G T AG L +PY LP ++Y+
Sbjct: 454 MVPTFRKDIPDEIGDGGT--------------------AG-------LRMPYGLPLRKYA 486
Query: 593 SEDVPWSWDKRYTKKDVYGQVW 614
S ++PW K Y+ D GQ W
Sbjct: 487 SNEMPWCAYKSYSDVDWLGQRW 508
>gi|326476634|gb|EGE00644.1| tyrosyl-DNA phosphodiesterase [Trichophyton tonsurans CBS 112818]
gi|326478089|gb|EGE02099.1| tyrosyl-DNA phosphodiesterase [Trichophyton equinum CBS 127.97]
Length = 588
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 144/536 (26%), Positives = 236/536 (44%), Gaps = 95/536 (17%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N CV +RD++ +I NY+ D+D+++
Sbjct: 67 SRQKIIPSPIQLTHIRDISDSTGYNEGCVKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 126
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 127 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 184
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ +II+HTAN+I DW N +Q +W Q + + CG F+ DL+ Y
Sbjct: 185 RHDNLAQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQAQVCDTCGGFGSSARFKRDLLAY 244
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F S LIASVP +
Sbjct: 245 LE------------AYHNKTINTLIRQLQRYDFGSVKAVLIASVPTRLPVKEFDSNRRTL 292
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L + +W+ E LSS
Sbjct: 293 WGWPALKDAIGSIPIDRSSSRAQNPHIIVQVSSIATLGQTDRWLKETFLSSLYPQPEVNQ 352
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKW--- 472
+ I++PT +++R SL+G+ +G +I PS QK + +L++Y W
Sbjct: 353 NRSTSNVKFSIIFPTPDEIRRSLDGHGSGGSIHMKIQSPSQQKQL--AYLRRYLCHWAGD 410
Query: 473 --------------KASHTGRSRAMPHIKTFARYN-------------GQKLAKAAWGAL 505
+ GR RA PHIKT+ R++ L+ AWGA
Sbjct: 411 AEGRKNSDPTTKSDRVREAGRRRAAPHIKTYIRFSDSDMDNIDWAMITSANLSTQAWGAG 470
Query: 506 QKNNSQLMIRSYELGVLILPSAKR----HGCGFSCTSN---IVPSEIKSGSTETSQIQKT 558
+ ++ I S+E+GVLI P R GC S +N ++P K + +Q +
Sbjct: 471 ANTHGEVRICSWEIGVLIWPDLFREEHIEGCSDSSLTNHVKMIPC-FKRNTPSEKPLQSS 529
Query: 559 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ + SDA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 530 ENDSTKVALHSDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 584
>gi|302662485|ref|XP_003022896.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
gi|291186867|gb|EFE42278.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
Length = 587
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 139/535 (25%), Positives = 232/535 (43%), Gaps = 93/535 (17%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 66 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 125
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 126 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 183
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q + + CG F+ DL+ Y
Sbjct: 184 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLAQPQVGDTCGVFGSSTRFKRDLLAY 243
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 244 LE------------AYNNKTINTLIRQLQRYDFGAVKAMLIASVPTRLPVKEFDSNKRTL 291
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L +KW+ E LSS
Sbjct: 292 WGWPALKDAISSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWLKETFLSSLCPQPEVNQ 351
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKW----- 472
I++PT +++R SL+GY +G + I SP + +L++Y W
Sbjct: 352 SRSTSNARFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRRYLCHWAGDAE 411
Query: 473 ------------KASHTGRSRAMPHIKTFARYN-------------GQKLAKAAWGALQK 507
+ GR RA PHIKT+ R++ L+ AWGA
Sbjct: 412 DPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAMITSANLSTQAWGAGAN 471
Query: 508 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKS--------GSTETSQIQKTK 559
+ ++ I S+E+GVL+ P R C+ + + + +K S + Q +
Sbjct: 472 THGEVRICSWEIGVLMWPDLFREKNIEECSDSSLTNYVKMIPCFKRNVPSEKPPQTSEND 531
Query: 560 LVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+T H SDA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 532 STKVTLH--SDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 583
>gi|171683299|ref|XP_001906592.1| hypothetical protein [Podospora anserina S mat+]
gi|170941609|emb|CAP67263.1| unnamed protein product [Podospora anserina S mat+]
Length = 569
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 145/556 (26%), Positives = 227/556 (40%), Gaps = 129/556 (23%)
Query: 151 CNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA 209
+H + S F+L +++ LPA N ++RDV+ +I NY+ DID+L+ A
Sbjct: 49 AKYHPPFKSVGSPFQLTKIKDLPAGLNKDTYTLRDVLGDPLISECWEFNYLHDIDFLMSA 108
Query: 210 CPV-LAKIPHVLVIHGESDGTLEHMKRNKPA------------NWILHKPPLPISFGTHH 256
+ + V V+HG KR P N LH LP FGTHH
Sbjct: 109 FDEDVRSLVKVHVVHG-------FWKREDPNRLALQESAARFNNVTLHAAFLPEMFGTHH 161
Query: 257 SKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPL------KDQNNLSEECGF 308
SK +L+ + ++++HTANLI DW N +QG W PL + + + F
Sbjct: 162 SKMFILLRHDDTAQLVIHTANLITRDWTNMTQGAWFSPRLPLLKPEHDEGRPRIGNGAKF 221
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT---GSS 365
+ D ++YL + P + K++FSS LI+SVPG HT +S
Sbjct: 222 KLDFLNYLRA-----YDTKRPTCKDITTK---LMKYDFSSINGSLISSVPGRHTVTQSTS 273
Query: 366 LKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSL--DEKWMAE-----LSSSMSSG 417
+G +++ L + P V Q SS+ +L + W+ L ++ ++
Sbjct: 274 STNFGWAAMKSALAAVPIHSTIEHKPEVAIQISSIATLGPTDSWLKNTFLHTLGNTPATT 333
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW- 472
F +V+PT +++R SL+GY +G +I SPQ+ +LK + W
Sbjct: 334 FK------------VVFPTPDEIRKSLDGYMSGGSIHTKTQSPQQVKQLQYLKPLFHHWA 381
Query: 473 --------------------------------KASHTGRSRAMPHIKTFAR--------- 491
K ++GR RA PHIKT+ R
Sbjct: 382 NDSASGLRMFPPRPLLSPSANAPSPNIAINASKVKNSGRKRAAPHIKTYIRSHRPTPESS 441
Query: 492 ----------YNGQKLAKAAWG-ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNI 540
L+K AWG AL + + I SYE+GVL+ P + + +
Sbjct: 442 ETDIHIDWALLTSANLSKQAWGEALSAKENTVRISSYEIGVLVWPGL------YGENAVM 495
Query: 541 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLPVPYELPPQRYSSEDVPW 598
P+ ++ Q + G D EV V L +PY+LP Q Y +VPW
Sbjct: 496 KPAFLEDALPPPEQTRGDG----DGKGKEDYDGKDEVVEVALRMPYDLPLQPYGPGEVPW 551
Query: 599 SWDKRYTKKDVYGQVW 614
+T+ D G++W
Sbjct: 552 VATASHTEPDWMGRIW 567
>gi|327299128|ref|XP_003234257.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
gi|326463151|gb|EGD88604.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
Length = 586
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 137/535 (25%), Positives = 235/535 (43%), Gaps = 93/535 (17%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 65 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYVMGQFD 124
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 125 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 182
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q+ + + CG F+ DL+ Y
Sbjct: 183 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVGDACGVFGSSARFKRDLLAY 242
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 243 LE------------AYNNNTINTLIRQLQQYDFGAVKAVLIASVPTRLPVKEFDSNRRTL 290
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L + KW+ E SS S
Sbjct: 291 WGWPALKDAIGSIPIDRSSSQAQNPHIIIQVSSIATLGQTDKWLKETFFSSLYSQPEVNQ 350
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKW----- 472
+ I++PT +++R SL+GY +G + I SP + +L++Y W
Sbjct: 351 SRSTSKAKFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRRYLCHWAGDAE 410
Query: 473 ------------KASHTGRSRAMPHIKTFARYN-------------GQKLAKAAWGALQK 507
+ GR RA PHIK++ R++ L+ AWGA
Sbjct: 411 GPKNADPTTTSDRVREAGRRRAAPHIKSYIRFSDSDMDSIDWAMITSANLSTQAWGAGAN 470
Query: 508 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK--------SGSTETSQIQKTK 559
+ ++ I S+E+G+LI P R C+ + + + +K + S + Q +
Sbjct: 471 THGEVRICSWEIGILIWPDLFREENIEECSDSSLTNHVKMIPCFKRNTPSEKPLQTSEND 530
Query: 560 LVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ +T H DA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 531 SIKVTLH--LDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATSVHREPDWMGQTW 582
>gi|301770839|ref|XP_002920828.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Ailuropoda
melanoleuca]
Length = 205
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 93/232 (40%), Positives = 124/232 (53%), Gaps = 46/232 (19%)
Query: 399 LGSLDEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-S 455
+G+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY AG ++P S
Sbjct: 1 MGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGYPAGGSLPYS 60
Query: 456 PQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQKLAKAAWG 503
Q +++L Y+ KW A +GRS AMPHIKT+ R L+KAAWG
Sbjct: 61 IQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTSANLSKAAWG 120
Query: 504 ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 563
AL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 121 ALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPAAA-------- 166
Query: 564 TWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
PVPY+LPP+ Y S+D PW W+ YTK D +G +W
Sbjct: 167 ----------------FPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNMW 202
>gi|193659893|ref|XP_001947945.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 1
[Acyrthosiphon pisum]
Length = 684
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 126/455 (27%), Positives = 211/455 (46%), Gaps = 77/455 (16%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNK 237
S + D GD+ ++ N+MV++ WL + + + +++ D ++ + + K
Sbjct: 277 SFAELLDKSLGDLSESLHLNFMVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKK 336
Query: 238 PANWILHKPPL-PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DF 294
+ HK + +FG HSK + Y G +R++V +ANL DW +QG+W+ F
Sbjct: 337 KLLNVRHKKIINKNAFGHQHSKVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKF 396
Query: 295 PLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
PLK++++ S+ + F+ D++ YL++ + P + +K +FS A V
Sbjct: 397 PLKEEDDKSDGNSQTDFKIDILRYLNSFREPSLVPWIQK----------IEKVDFSQANV 446
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKW 406
I SVPG HT WGH+ L+ +L++ C + P++ Q SSLGSL DE+W
Sbjct: 447 FFIPSVPGKHTEPL---WGHLYLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEW 503
Query: 407 M-AELSSSMSSGFSEDKTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+ +E S+S+ D T +P+ +++P+V++V S +G G +P + +K
Sbjct: 504 LKSEFVESLSASTYCDDTDTD-NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEK 562
Query: 463 DF-LKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNN 509
LKKY W+ R++AMPHIKT+ R + L+KAAWG K++
Sbjct: 563 QLWLKKYMCLWQCHSRKRTKAMPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSD 622
Query: 510 SQL-MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 568
Q I ++E GVL LP F S+ P
Sbjct: 623 EQSNFIMAHEAGVLFLPQ-------FLIGSDTFP-------------------------- 649
Query: 569 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 603
D ++ Y +P++LP YS D PW+ R
Sbjct: 650 IDETEPNKFPYFSLPFDLPLAGYSDTDQPWTISTR 684
>gi|7648683|gb|AAF65623.1|AF182002_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 415
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 176/360 (48%), Gaps = 37/360 (10%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 70 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 129
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 130 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 185
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 186 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 245
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 246 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 305
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 306 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 355
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 356 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 415
>gi|225682330|gb|EEH20614.1| tyrosyl-DNA phosphodiesterase [Paracoccidioides brasiliensis Pb03]
Length = 628
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 146/572 (25%), Positives = 236/572 (41%), Gaps = 132/572 (23%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L RV+ PA + NT V +RD++ +I NY+ D+D+L+ +
Sbjct: 69 IPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIFDVDFLMSQFDQDVRG 128
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V +IHG ES + E +R ++ +P +FGTHHSK M++I +
Sbjct: 129 LVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFGTHHSKMMVIIKHDDQ 186
Query: 268 VRIIVHTANLIHVDWNNKSQGLW-----------MQDFPLKDQNNLSEECGFENDLIDYL 316
+I++HTAN+I DW N Q +W ++ P N++ F+ DL+ Y
Sbjct: 187 AQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHPSATPNDVGTGSRFKRDLLAYF 246
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGH 371
T H +K++FS+ LIAS P T L WG
Sbjct: 247 ETY----------GHNKTGALIEQLEKYDFSAIRAALIASAPSRQTIDELDSKRRTLWGW 296
Query: 372 MKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSLDE--KWMAEL--------SSSMSSG 417
L+ +++ F+KG K K P +V Q SS+ +L + KW+ E S+ S
Sbjct: 297 PALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTDKWLKETLFNSLSPPSARSSEL 356
Query: 418 F-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 472
F +E +P I++PT +++R SL GY +G +I S + +L+ Y +W
Sbjct: 357 FKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHMKLQSAAQQKQLQYLRPYLCRW 413
Query: 473 ---------------------------------------KASH-----TGRSRAMPHIKT 488
K +H GR RA PHIKT
Sbjct: 414 AGDANDDGGVKSAGGPATSKRKRLEGNEVSESVQDGASLKKAHRPIREAGRRRAAPHIKT 473
Query: 489 FARYN-------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 535
+ R++ L+ AWGA ++ I SYE+GVL+ P
Sbjct: 474 YIRFSDTDMTTIDWAMVTSANLSLQAWGAAANAKKEIRICSYEIGVLVWPDLFVDEEIDD 533
Query: 536 CTSNIVPSEIK-------SGSTETSQIQKTKLVTLTWHGSSDAG------ASSEVVYLPV 582
++ + K SG T ++ +V +A +++ +V +
Sbjct: 534 SDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMVPCFKRDMPEAAENEARSSNTTLVGFRM 593
Query: 583 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
PY+LP Y+++D PW Y++ D GQ W
Sbjct: 594 PYDLPLHSYAAKDQPWCATATYSEPDWLGQTW 625
>gi|392867268|gb|EAS29510.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 616
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 140/533 (26%), Positives = 224/533 (42%), Gaps = 117/533 (21%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + VIHG +S +
Sbjct: 103 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRI 162
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 163 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 220
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 221 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 268
Query: 337 NPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF-- 387
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 269 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 328
Query: 388 --KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
KK +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 329 EPKKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 382
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 383 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTP 442
Query: 476 ------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIR 515
GR RA PHIKT+ R++ + L+ AWGA N ++ +
Sbjct: 443 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 502
Query: 516 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 561
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 503 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 561
Query: 562 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 562 ESS-RGDDKSKAWRTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|258577075|ref|XP_002542719.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902985|gb|EEP77386.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 669
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 145/533 (27%), Positives = 227/533 (42%), Gaps = 114/533 (21%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG--ESDG---- 228
N + +RD++ +I N++ DID+L+ P + + V V+HG + D
Sbjct: 153 NGDTIKLRDILGDPLIKECWQFNFLFDIDFLMDQFDPDVKNLVKVKVVHGSWKKDAPNRI 212
Query: 229 -TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + ++++HTAN+I DW N
Sbjct: 213 RVDEQCSRYQNVEPIIAYMPEP--FGTHHSKMMVLIRHDDCAQVVIHTANMIPGDWANMC 270
Query: 287 QGLWMQD-FPLKDQNN-----LSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKI 336
Q +W PL NN ++ E G F+ DL+ YL A+G K
Sbjct: 271 QAVWKSPLLPLLSPNNDREPSITGEIGSGPRFKRDLLAYLE------------AYGRKKT 318
Query: 337 NP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEK---- 385
P K + F LIASVP SL WG L+ VL+ K
Sbjct: 319 GPLVEQLKNYGFDGIRAALIASVPSRQRFPSLDSRKETIWGWPALQDVLRRIPIHKQQPL 378
Query: 386 GFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC 442
K+S +V Q SS+ SL +KW+ E +S+ + D P + I++PT +++R
Sbjct: 379 QSKRSRIVIQISSIASLGQSDKWLKETFFASLYPHSAADGAP----QLSIIFPTPDEIRR 434
Query: 443 SLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKAS----------------------- 475
SL GY +G + I S + D+++ Y W
Sbjct: 435 SLNGYGSGGSIHMKIQSSAQQKQLDYMRPYLCHWAGDSENNQTPVSATDVLTHDSAIDRY 494
Query: 476 --------HTGRSRAMPHIKTFARYNGQ-------------KLAKAAWGALQKNNSQLMI 514
GR RA PHIKT+ R++ + L+ AWGA ++ I
Sbjct: 495 PPKATPVREAGRRRAAPHIKTYIRFSDEDMRTIDWAMVTSANLSTQAWGAAINAKQEVRI 554
Query: 515 RSYELGVLILP------SAKRHGCGF-------SCTSNIVPSEIKSGSTETSQIQKTKLV 561
S+E+GVL+ P S +R+ G S + ++P + S S++++ ++
Sbjct: 555 CSWEIGVLVWPDLFCNGSERRNESGEENKDKAKSDYARMIPC-FRRDSPCLSEVERYEIE 613
Query: 562 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ + + G S +V +PY+LP + YS DVPW + + D GQ W
Sbjct: 614 ETSKKDADNTGVLSTLVGFRMPYDLPLKPYSPRDVPWCATASHKEPDWLGQTW 666
>gi|119178141|ref|XP_001240773.1| hypothetical protein CIMG_07936 [Coccidioides immitis RS]
Length = 531
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 140/533 (26%), Positives = 224/533 (42%), Gaps = 117/533 (21%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + VIHG +S +
Sbjct: 18 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRI 77
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 78 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 135
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 136 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 183
Query: 337 NPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF-- 387
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 184 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 243
Query: 388 --KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
KK +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 244 EPKKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 297
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 298 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTP 357
Query: 476 ------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIR 515
GR RA PHIKT+ R++ + L+ AWGA N ++ +
Sbjct: 358 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 417
Query: 516 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 561
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 418 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 476
Query: 562 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 477 ESS-RGDDKSKAWRTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 528
>gi|290999837|ref|XP_002682486.1| predicted protein [Naegleria gruberi]
gi|284096113|gb|EFC49742.1| predicted protein [Naegleria gruberi]
Length = 320
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 145/280 (51%), Gaps = 33/280 (11%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDY 315
H+K ++ + +RI+V +ANL DW+ Q +W+QDFP K+ + + FEN L+++
Sbjct: 2 HAKLFIIEFDDFIRIVVSSANLTDFDWSFFKQCIWIQDFPKKENISNNNTNQFENTLVEF 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
W + + +P +F +K+++S+A LI S+PGYHT K+GH+ ++
Sbjct: 62 -----WTKLTDGIPG--------NFLRKYDYSNAKGELIPSIPGYHTNIEKDKYGHLAIK 108
Query: 376 TVLQECTFEK----GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL 431
++ F K K+SPL YQ SS+GS++ W+ ELSSS + +D
Sbjct: 109 KAIERMNFTKNEILNLKQSPLYYQMSSIGSMNLDWIKELSSSF---YLKDCNNFN----- 160
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK----YWAKWKASHTGRSRAMPHIK 487
IV+P++E V S G G I K + K +++ +A+H S+ + H++
Sbjct: 161 IVFPSLESVSSSHFGLRCGGMIHLKSKTFETSTFPKHLMTHYSPNQANHLAHSKILLHLE 220
Query: 488 T----FARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLI 523
+ L++ A G LQKN +QL I +YELGV+
Sbjct: 221 NLKNGYIFVGSHNLSQPALGKLQKNGTQLYISNYELGVIF 260
>gi|347837882|emb|CCD52454.1| hypothetical protein [Botryotinia fuckeliana]
Length = 639
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 150/560 (26%), Positives = 229/560 (40%), Gaps = 117/560 (20%)
Query: 154 HVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV 212
H + + S F+L ++ LP +N VS++D++ +I NY+ D+D+L+
Sbjct: 96 HTKQRVVKSPFQLTTIRDLPDSSNVDTVSLKDILGDPLISECWEFNYLHDLDFLMEQFDE 155
Query: 213 -LAKIPHVLVIHG----ESDGTLEHMKR-NKPANWILHKPPLPISFGTHHSKAMLLIYPR 266
+ + V VIHG E L M++ ++ +N L +P FGTHHSK ML+I+
Sbjct: 156 DVRNLVRVNVIHGFWKREDHSRLNLMEQASRYSNIKLLTAYMPEMFGTHHSK-MLIIFRH 214
Query: 267 G--VRIIVHTANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSEECGFENDLIDY 315
+II+HTAN+I DW N +Q LW + L + + + F+ D ++Y
Sbjct: 215 DCTAQIIIHTANMIPFDWTNMTQALWKSPHLPLLNPKKPTLVEASRIGSGSKFKLDFLNY 274
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPGYHTGSSLKK---- 368
L I S + K++FS LIASVPG G+ L
Sbjct: 275 LRAYDTKRI-----------ICKSLIEQLLKYDFSEIKAALIASVPGKQ-GTELSPSQTG 322
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLG 426
WG L L+ + +V Q SS+ SL +KW+ ++ SE K+P
Sbjct: 323 WGWAGLTNALKSVPSHHNTQPE-IVIQVSSIASLGPTDKWLTHFFKAL----SESKSPRK 377
Query: 427 IGEPL-IVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKW--------- 472
G I++PT ++VR S+ GYA+GNAI +P + +LK W
Sbjct: 378 TGSKFKIIFPTADEVRRSINGYASGNAIHTKILTPAQGKQLAYLKPMLCHWAGDGAQHSS 437
Query: 473 ---------------------KASHTGRSRAMPHIKTFARY------------------- 492
K R RA PHIKT+ R+
Sbjct: 438 SSSLSSNPPSESSQSFTSPELKTQEAYRRRAAPHIKTYIRFSSDSTSSSSSQKSIDWMLV 497
Query: 493 NGQKLAKAAWGALQKNNSQLMIRSYELGVLILP---SAKRHGCGFS---CTSNIVPS--- 543
L+K AWG + ++ I SYE+GVL+ P K++G C N PS
Sbjct: 498 TSANLSKQAWGESINSADKVRICSYEIGVLVWPDLWEEKQNGKNVKMVPCFGNDTPSIPF 557
Query: 544 -----EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE----VVYLPVPYELPPQRYSSE 594
EI + ++ L D E +V +PY+LP Y +
Sbjct: 558 VSPSLEIVGQKEIRVEGEEGHLKRKRCDAREDEKRQEESHTIIVGARMPYDLPLVSYGKD 617
Query: 595 DVPWSWDKRYTKKDVYGQVW 614
D+PW Y++ D G+ W
Sbjct: 618 DIPWCASASYSEPDWMGKTW 637
>gi|303310201|ref|XP_003065113.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240104773|gb|EER22968.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 616
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 137/536 (25%), Positives = 222/536 (41%), Gaps = 123/536 (22%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHGE--------- 225
N V+++D++ +I NY+ DID+L+ P + + + V+HG
Sbjct: 103 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRI 162
Query: 226 -SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWN 283
D H + +P I+ P P FGTHHSK M+LI + +II+HTAN+I DW
Sbjct: 163 YIDEACAHYQNVEP---IIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWA 217
Query: 284 NKSQGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN 333
N QG+W +D+ + F+ D++ YL A+G
Sbjct: 218 NMCQGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGR 265
Query: 334 FKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG 386
K P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 266 KKTGPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQ 325
Query: 387 FKKSP----LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
P +V Q SS+ SL +KW+ + + F+ P +++PT +++
Sbjct: 326 LSCEPEKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSVIFPTPDEI 379
Query: 441 RCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--------------------- 475
R SL GY +G +I S + D+++ Y W
Sbjct: 380 RRSLNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDE 439
Query: 476 ---------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQL 512
GR RA PHIKT+ R++ + L+ AWGA N ++
Sbjct: 440 STPNNTFVREAGRRRAAPHIKTYIRFSDAEDMCTIDWAMVTSANLSTQAWGAAINANQEV 499
Query: 513 MIRSYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKT 558
+ S+E+GVL+ P +A R S + ++P + + S++++
Sbjct: 500 RVCSWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERL 558
Query: 559 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+L + G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 559 ELEEPS-RGDDKSKAWKTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|121715578|ref|XP_001275398.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
gi|119403555|gb|EAW13972.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
Length = 576
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 138/524 (26%), Positives = 227/524 (43%), Gaps = 98/524 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V +RD++ +I N++ D+D+L+ + +
Sbjct: 80 IPSPIQLTHIRDLSAASGNNVDTVRLRDILGDPMIRECWQFNFLFDVDFLMNQFDEDVRR 139
Query: 216 IPHVLVIHG--ESDG-----TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V V+HG + D E R I+ P P FGTHHSK M+L+ +
Sbjct: 140 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAIVAYMPEP--FGTHHSKMMILLRHDDL 197
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-------FENDLIDYLSTL 319
++++HTAN+I DW N Q +W PL+ +++EE G F+ DL+ YL+
Sbjct: 198 AQVVIHTANMIPGDWANMCQAVWRSPLLPLQKVEHIAEEPGTIGSGARFKRDLLAYLN-- 255
Query: 320 KWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSL-----KKWGHM 372
+G K P +F+FSS LIASVP +SL WG
Sbjct: 256 ----------EYGAKKTGPLVKQLARFDFSSVRAALIASVPSKQKLASLDLQRKTLWGWP 305
Query: 373 KLRTVLQEC--TFEKGFKKSP--LVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLG 426
LR ++ T E+G + + ++ Q SS+ +L + KW+ ++ + S + + TP
Sbjct: 306 ALRETTRQIPLTREQGSETATPHIITQISSIATLGQTDKWLKDVFFN-SLAPTSNPTPPT 364
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW---------- 472
+ IV+PT +++R SL GY +G +I S ++ +++ Y W
Sbjct: 365 KSKYSIVFPTPDEIRRSLNGYGSGGSIHMKLQSTTQHKQLQYMRPYLRHWAGDSSTHSSD 424
Query: 473 --------KASHTGRSRAMPHIKTFARY--------------NGQKLAKAAWGALQKNNS 510
K GR RA PHIKT+ R+ L+ AWGA +N
Sbjct: 425 GRGETSTTKTQEAGRRRAAPHIKTYIRFADANRMNAIDWAMVTSANLSTQAWGAAVNSNG 484
Query: 511 QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 570
++ I S+E+GV++ P ++ + +Q K L
Sbjct: 485 EVRICSWEVGVMVWPQLFAEKAEQQQQQAMMVPCFRRDLPVDCPVQPAKCDVL------- 537
Query: 571 AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
V L +PY+LP Y +++VPW + + D GQ W
Sbjct: 538 -------VGLRMPYDLPLTSYRADEVPWCATATHMEPDWLGQTW 574
>gi|302500932|ref|XP_003012459.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
gi|291176017|gb|EFE31819.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
Length = 587
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 135/535 (25%), Positives = 230/535 (42%), Gaps = 93/535 (17%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 66 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 125
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 126 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 183
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q+ + + CG F+ DL+ Y
Sbjct: 184 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVDDTCGVFGSSARFKRDLLAY 243
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 244 LE------------AYNNKTINILIRQLRRYDFGAVKALLIASVPTRLPVKEFDSNRRTL 291
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE-----LSSSMSSGF 418
WG L+ + ++ ++ ++ Q SS+ +L +KW+ E L
Sbjct: 292 WGWPALKDAIGSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWLRETFLRSLCPQPEVNQ 351
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKW-- 472
S + + I++PT +++R SL+GY +G + I SP + +L+ Y W
Sbjct: 352 SRSTSNVKFS---IIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRHYLCHWAG 408
Query: 473 ---------------KASHTGRSRAMPHIKTFARYN-------------GQKLAKAAWGA 504
+ GR RA PHIKT+ R++ L+ AWGA
Sbjct: 409 DAEDPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAMITSANLSTQAWGA 468
Query: 505 LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 564
++ I S+E+GVLI P R C+ + + + +K + K + +
Sbjct: 469 GANTQGEVRICSWEVGVLIWPDLFREENIEECSDSSLTNYVKMIPCFKRNVPSEKPLQTS 528
Query: 565 WHGSSDAGASSEV-----VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ S+ S+ V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 529 ENDSTKVTLHSDATNMTRVGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 583
>gi|296817701|ref|XP_002849187.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
gi|238839640|gb|EEQ29302.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
Length = 606
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 134/530 (25%), Positives = 233/530 (43%), Gaps = 89/530 (16%)
Query: 160 LPSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L V+ +P N C+ +RD++ +I N++ D+D+++ K
Sbjct: 87 IPSPIQLTHVRDIPDSTGYNKDCIRLRDILGDPMIKECWQFNFLFDVDYIMGQFDRDVKD 146
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ + ++HG E+ + + KR I+ +P FGTHHSK M+L+ +
Sbjct: 147 LVQLKIVHGSWKKEAPNKIAIDDACKRYPNVEAIVAY--MPELFGTHHSKMMVLVRHDDL 204
Query: 268 VRIIVHTANLIHVDWNNKSQGLW------MQDFPLKD-QNNLSEECGFENDLIDYLSTLK 320
+II+HTAN+I DW N +Q +W + F + D + ++ F+ DL+ YL+
Sbjct: 205 TQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQFKMADSRGDIGSGARFKRDLLAYLN--- 261
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
A+ N KI+ ++++F LI+SVP L WG
Sbjct: 262 ---------AYNNKKIDMLIDQLQRYDFGEVKAALISSVPSRQPARELDSGKRTLWGWPA 312
Query: 374 LRTVLQECTFEKGFKKS---PLVYQFSSLGSL--DEKWMAE--LSSSMSSGFSEDKTPLG 426
L+ + + +V Q SS+ +L +KW+ E SS + D + +
Sbjct: 313 LKDAISSIPIRGNSSQRLEPQVVVQVSSIATLGQTDKWLKETFFSSLCPQSRASDTSNIS 372
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKAS------- 475
+ I++PT +++R SL+GYA+G + I S + +L++Y +W
Sbjct: 373 STKFSIIFPTPDEIRRSLDGYASGGSIHMKIQSAAQQKQLQYLRRYLCRWAGDAAGQRDT 432
Query: 476 --------------HTGRSRAMPHIKTFARYN-------------GQKLAKAAWGALQKN 508
GR RA PHIKT+ R++ L+ AWGA
Sbjct: 433 NPATQPDKGSSIVREAGRKRAAPHIKTYIRFSDSGMTSIDWAMVTSANLSTQAWGAGANT 492
Query: 509 NSQLMIRSYELGVLILPS--AKRHGCGFSCTSNIVPSE-IKSGSTETSQIQKTKLVTLTW 565
++ I S+E+GVL+ P +R +S I P + I +T + + +
Sbjct: 493 QGEVRICSWEIGVLVWPDLFRERMTSKDKDSSTIHPVKMIPCFKCDTPSEKSLLCESDST 552
Query: 566 HGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ +S +GA++ + L +PY LP Y+ +DVPW + + D GQ W
Sbjct: 553 NSTSHSGATNMTRIGLRMPYNLPLVPYTHQDVPWCATAVHREPDWLGQTW 602
>gi|320034009|gb|EFW15955.1| tyrosyl-DNA phosphodiesterase [Coccidioides posadasii str.
Silveira]
Length = 559
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 138/533 (25%), Positives = 223/533 (41%), Gaps = 117/533 (21%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + V+HG +S +
Sbjct: 46 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRI 105
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 106 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 163
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 164 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 211
Query: 337 NPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKK 389
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 212 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 271
Query: 390 SP----LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
P +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 272 EPEKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 325
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 326 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDESTP 385
Query: 476 ------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIR 515
GR RA PHIKT+ R++ + L+ AWGA N ++ +
Sbjct: 386 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 445
Query: 516 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 561
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 446 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 504
Query: 562 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 505 EPS-RGDDKSKAWKTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 556
>gi|195342204|ref|XP_002037691.1| GM18399 [Drosophila sechellia]
gi|194132541|gb|EDW54109.1| GM18399 [Drosophila sechellia]
Length = 539
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 172/359 (47%), Gaps = 49/359 (13%)
Query: 200 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 162 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLNIGKFKQQVTAIRVKMPTP--FATSH 218
Query: 257 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 310
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 219 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 278
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 368
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 279 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 328
Query: 369 WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 426
WGH +L +++ + E + P+V Q SS+GSL A + + +D T +G
Sbjct: 329 WGHARLASLVAKHAAPIED---RIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTSVG 385
Query: 427 IGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSR 481
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSR
Sbjct: 386 KLRQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSR 445
Query: 482 AMPHIKTFARYN------------GQKLAKAAWGALQKNNS---QLMIRSYELGVLILP 525
AMPHIK++ R+N L+KAAWG K+++ L I +YE GVL LP
Sbjct: 446 AMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPCLRIANYEAGVLFLP 504
>gi|242787594|ref|XP_002481044.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218721191|gb|EED20610.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 577
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 146/568 (25%), Positives = 246/568 (43%), Gaps = 112/568 (19%)
Query: 127 LSSKKMR---QQDEQDNENGKNSEEALCNFHVSRDK-LPSTFRLLRVQGLPAWANTSCVS 182
L+S++ R Q +Q ++ K + E + R + +PS F+L ++ LP+ N V
Sbjct: 40 LTSRERRPPENQHDQHTDHIKRNNETNADIIEGRPRVIPSPFQLTHIRDLPSDKNVDTVQ 99
Query: 183 IRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPHVLVIHG----ESDGTL---EHM 233
+ D++ +I NY D+D+++ K + V ++HG +S L E
Sbjct: 100 LHDILGDPMIRECWQFNYCFDVDFVMSQFDQDVKDLVQVKIVHGSWKQDSPNRLRIDEAC 159
Query: 234 KRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
R I+ P P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ LW
Sbjct: 160 ARYPNVEPIVAYMPEP--FGTHHSKMMILLRHDDLAQVIIHTANMLAGDWTNMSQALWRS 217
Query: 293 D-FPLKDQ--NNLSEECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-- 340
PL N +EE F+ DL+ YL EF +G K
Sbjct: 218 PLLPLSSTPYNPATEEAAVFGTGARFKRDLLAYL------EF------YGRRKTGSLVDQ 265
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG--FKKSPLV 393
+KF+F + L+ASVP S + WG L+ L++ + + +V
Sbjct: 266 LRKFDFYAIRAVLVASVPSKERLSRMNSSQSTLWGWPALKDALRQISLSDNEHIEDPHVV 325
Query: 394 YQFSSLGSL--DEKWMAEL--SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
Q SS+ SL +KW+ ++ S S + + + IV+PT +++R SL GY +
Sbjct: 326 IQVSSIASLGQTDKWLKDVLFDSLCPSSILPNASKRCNPKFSIVFPTPDEIRRSLNGYGS 385
Query: 450 GNAIPSPQKNVDK----DFLKKYWAKW----------------------KASHTGRSRAM 483
G +I ++V + +++ Y W +++ GR RA
Sbjct: 386 GGSIHMKLQSVAQQKQLQYMRPYLCHWAGDQEQTPVRISRTNAEVPSNIQSTDAGRRRAA 445
Query: 484 PHIKTFARYNGQ--------------KLAKAAWGALQKNNSQLMIRSYELGVLILPSAKR 529
PHIKT+ R++ + L+ AWGA +N ++ I S+E+GVL+ P
Sbjct: 446 PHIKTYIRFSDKTKMDSIDWVMITSANLSTQAWGAAPNSNGEVRICSWEIGVLVWP---- 501
Query: 530 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE---VVYLPVPYEL 586
++ G + ++ K+V + +++ +V +PY+L
Sbjct: 502 --------------QLIVGDSPEPGAERPKMVPCFQKDRPELPNNNDITPIVGFRMPYDL 547
Query: 587 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
P RY +DVPW + + D GQ W
Sbjct: 548 PLARYGVQDVPWCATINHPEPDWLGQSW 575
>gi|226289717|gb|EEH45201.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 639
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 148/582 (25%), Positives = 236/582 (40%), Gaps = 155/582 (26%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L RV+ PA + NT V +RD++ +I NY+ D+D+L+ +
Sbjct: 69 IPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIFDVDFLMSQFDQDVRG 128
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V +IHG ES + E +R ++ +P +FGTHHSK M++I +
Sbjct: 129 LVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFGTHHSKMMVIIKHDDQ 186
Query: 268 VRIIVHTANLIHVDWNNKSQGLW-----------MQDFPLKDQNNLSEECGFENDLIDYL 316
+I++HTAN+I DW N Q +W ++ P N++ F+ DL+ Y
Sbjct: 187 AQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHPSATPNDVGTGSRFKRDLLAYF 246
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGH 371
T H +K++FS+ LIASVP T L WG
Sbjct: 247 ETY----------GHNKTGALIEQLEKYDFSAIRAALIASVPSRQTIDELDSKRRTLWGW 296
Query: 372 MKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSL--DEKWMAEL--------SSSMSSG 417
L+ +++ F+KG K K P +V Q SS+ +L +KW+ E S+ S
Sbjct: 297 PALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTDKWLKETLFNSLSPPSARSSEL 356
Query: 418 F-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 472
F +E +P I++PT +++R SL GY +G +I S + +L+ Y +W
Sbjct: 357 FKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHMKLQSAAQQKQLQYLQPYLCRW 413
Query: 473 --------------------------------------KASH-----TGRSRAMPHIKTF 489
K +H GR RA PHIKT+
Sbjct: 414 AGDANDDGVKSAGGPATSKRKRLEGNEVSESVQDGASLKKAHRPIREAGRRRAAPHIKTY 473
Query: 490 ARYN-------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 536
R++ L+ AWGA ++ I SYE+GVL+ P
Sbjct: 474 VRFSDTDMTTIDWAMVTSANLSLQAWGAAANVKKEIRICSYEIGVLVWPRF--------- 524
Query: 537 TSNIVPSEIK-------------------SGSTETSQIQKTKLVTLTWHGSSDAG----- 572
IV EI SG T ++ +V +A
Sbjct: 525 ---IVDEEIDDSDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMVPCFKRDMPEAAENEAR 581
Query: 573 -ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+++ +V +PY+LP Y+++D PW Y++ D Y +
Sbjct: 582 SSNTTLVGFRMPYDLPLHSYAAKDQPWCATATYSEPDCYADM 623
>gi|401428160|ref|XP_003878563.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494811|emb|CBZ30114.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 682
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 158/647 (24%), Positives = 245/647 (37%), Gaps = 208/647 (32%)
Query: 155 VSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIV-------AILSNYMVDIDWLL 207
V + + PS+ LLR +RD+ + D+ +LS+Y+ D+ WLL
Sbjct: 27 VPQGRAPSSCSLLR--------------LRDLFRCDLADPGECWQHILLSSYVTDLRWLL 72
Query: 208 PACPVLAKIPHVLVIHGESDGT---------------------------LEHMKRNKPAN 240
P L+ + LV+ GT + ++ A
Sbjct: 73 ATVPELSAVTGKLVVLSGEKGTATLRRTTGDPSSPYTATSPLMDRVNPFMAALREQARAT 132
Query: 241 WILH-----------KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
LH +PPLP++FGTHH+K L + RG+RI + TANL+ DW KSQG+
Sbjct: 133 SALHTTLSRERLAVLEPPLPVAFGTHHTKMALCVNSRGLRISIFTANLVEQDWCWKSQGI 192
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLST------------LKWPEFSANL--------- 328
++QDFP K S + ++ ++ K EF A+L
Sbjct: 193 YLQDFPWKAATECSNDVAAGATVVKTAASSTSKGGNGSNTLTKGAEFVAHLRNYLMQCGV 252
Query: 329 -------------PAHGNFKI-NPSFFKKFNFSSAAVRLIASVPG---YHTGSSLKKWGH 371
A G I F +FS+AAV LI+SVPG Y + + G
Sbjct: 253 SLTTACASPTDAVSAAGPLGIFETDFLSHIDFSAAAVWLISSVPGTCAYGEVAPGYRVGL 312
Query: 372 MKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPL 425
+L VL+ T L +Q+SS GSL+ ++ L ++M S TP
Sbjct: 313 CRLAEVLRRSALTMATAPASVDLSWQYSSQGSLNLAFLNSLQAAMCGESVSVIESGDTPR 372
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG------- 478
G+ + +V+PT E+VR S EG+ G ++P + +F+ +W +S G
Sbjct: 373 GVRDVQVVYPTEEEVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAF 431
Query: 479 -----------------------------------------RSRAMPHIKTFAR------ 491
R A+PHIK++A
Sbjct: 432 PRPAKVAAAHASREDAVDVDGVDSDGGEGTTASLTCSCAAYRQFALPHIKSYAAVAPDRS 491
Query: 492 ------YNGQKLAKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNI 540
L++AAWG+L Q+ + Q ++RSYELGV+ + H S S +
Sbjct: 492 CVRWFLLTSANLSQAAWGSLSRKMNQRGSRQQLVRSYELGVIYDSHSAIHPSASSWFSVV 551
Query: 541 VPSEIKSGSTETSQ--IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS---- 593
++I+ S S+ + +T L G ++ V L PY L P Y+S
Sbjct: 552 SKTKIELPSARNSRAMLYETPL-----------GVETQNVCLYTPYNLLCPTPYASTAAL 600
Query: 594 ---------------------EDVPWSWDKRYTKKDVYGQVWPRHFQ 619
DVPW D + +D YG + F+
Sbjct: 601 RARRDAPVEGEQAVAGSTLDCSDVPWVLDMPHRGRDAYGLDFEEAFE 647
>gi|169775023|ref|XP_001821979.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
gi|83769842|dbj|BAE59977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 570
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 127/522 (24%), Positives = 235/522 (45%), Gaps = 106/522 (20%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS F+L ++ L A + N V +R+++ +I NY+ D+D+++ + +
Sbjct: 85 IPSPFQLTHIRDLAASSDNNVDTVRLREILGDPMIRECWQFNYLHDVDFIMGQFDEDVRR 144
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPP------------LPISFGTHHSKAMLLI 263
+ V ++HG KR+ P + + +P +FGTHHSK M+L+
Sbjct: 145 LVKVKIVHGS-------WKRDAPNRVRIDEACSRYPNVEAVVAYMPEAFGTHHSKMMVLL 197
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE------CGFENDLIDY 315
+ V++++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ Y
Sbjct: 198 RHDDLVQVVIHTANMIPGDWTNMCQAVWRSPLLPLQKTDDRVEDLTLGSGARFKRDLLAY 257
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L+ +G K P +K++F + L+ASVP L
Sbjct: 258 LT------------EYGPKKTGPLVEQLRKYDFGAIRAALVASVPSKQKVDDLDSQKKTL 305
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDK 422
WG L+ ++++ + K+ +V Q SS+ +L +KW+ + + +S+S + +
Sbjct: 306 WGWPALKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWLKDVMFASLSPASTSTR 365
Query: 423 TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH-- 476
P + I++PT +++R SL GY +G +I S + +++ Y W H
Sbjct: 366 QP----KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDT 421
Query: 477 ----------TGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQL 512
GR RA PHIKT+ R++ + L+ AWGA + ++
Sbjct: 422 AEPSHTSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEV 481
Query: 513 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 572
I S+E+G+++ P + ++ +VP+ K + E + + ++ T
Sbjct: 482 RICSWEIGIVVWPQLYVQD---TESATMVPT-FKRDTPEPLENKDSETTPDT-------- 529
Query: 573 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
V+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 530 ----VIGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 567
>gi|307211794|gb|EFN87775.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 441
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 119/437 (27%), Positives = 194/437 (44%), Gaps = 75/437 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ Y++D++WL + + ++ +++GE E + N A
Sbjct: 49 ILDISFGEIVNSLHLTYILDVEWLCLQYLLAGQSTNMTILYGERRDE-EELDDNITA--- 104
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLK 297
+H +P FG HHSK M+L Y G+R++V TANL DW N +QG+W+
Sbjct: 105 IHMK-MPFEFGCHHSKIMILQYKDNGIRVVVSTANLFFEDWQNSTQGIWISPHLPRLSKA 163
Query: 298 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
++N F+ DL YLS+ + P K KK +FS+ V LIAS+
Sbjct: 164 AKHNGESLTNFKKDLQRYLSSYRNPA----------LKRWRKLVKKTDFSAINVCLIASI 213
Query: 358 PGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
PG H ++ WG+ KL VL Q T K ++ Q S++GS K+ + LS +
Sbjct: 214 PG-HFEHTVDLWGYKKLANVLSQHVTLPPDALKWSIIAQSSAVGSFGPKYGSWLSKEIVW 272
Query: 417 GFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLKKYWAKW 472
+ + P ++P+V++ S + Y G + S + V + ++K Y +W
Sbjct: 273 SMTRETERDLNNYPKFQFIYPSVKNYEQSFD-YQNGTSCFSYSREVHSKQQWIKSYLYQW 331
Query: 473 KASHTGRSRAMPHIKTFARYNGQ------------KLAKAAWGALQKNNSQLMIRSYELG 520
KA+ T R +AMPHIK++ R + L+K AWG ++++ I +YE+G
Sbjct: 332 KAARTERDQAMPHIKSYTRISSDLKRIAWFVLTSANLSKGAWGVQREDD--YYITNYEVG 389
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
+ LP F T + + I
Sbjct: 390 IAFLPKFITRITTFPITDEDLTNSI----------------------------------F 415
Query: 581 PVPYELPPQRYSSEDVP 597
P+PY+LP Y S D P
Sbjct: 416 PIPYDLPLCPYDSSDSP 432
>gi|302797949|ref|XP_002980735.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
gi|300151741|gb|EFJ18386.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
Length = 197
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 90/148 (60%), Gaps = 28/148 (18%)
Query: 209 ACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV 268
ACP L IP V++IHGES+ + MLL+YP GV
Sbjct: 71 ACPPLRTIPQVVMIHGESNVS-------------------------QLQSVMLLVYPTGV 105
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL 328
R++VHTANLI++DWNNK+QGLWMQDFP K S+ FENDL+DYL+ L+W + ++
Sbjct: 106 RVVVHTANLINIDWNNKNQGLWMQDFPFKSMTGASD---FENDLVDYLTALEWLGCTVDV 162
Query: 329 PAHGNFKINPSFFKKFNFSSAAVRLIAS 356
HG KIN F+ F+FS+AAVRL+AS
Sbjct: 163 QHHGKMKINVGHFQNFDFSNAAVRLVAS 190
>gi|212543739|ref|XP_002152024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
gi|210066931|gb|EEA21024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
Length = 587
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 145/570 (25%), Positives = 235/570 (41%), Gaps = 109/570 (19%)
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR-------VQGLPAWAN 177
G S+ + Q E ++ E + D L FR++R ++ LP N
Sbjct: 45 GRPSNARRDQNAESAPQDFDIKENTQIDIDREDDSLRDKFRIIRSPIQLTHIRDLPNDKN 104
Query: 178 TSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL- 230
V + D++ +I NY D+D+++ + + V ++HG +S +
Sbjct: 105 IDTVQLHDILGDPMIRECWQFNYCFDVDFVMSQFDQDVRDLVQVKIVHGSWKQDSANRIR 164
Query: 231 --EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQ 287
E R I+ P P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ
Sbjct: 165 IDEACARYPNVESIVAYMPEP--FGTHHSKMMILLRHDDHAQVIIHTANMLAGDWTNMSQ 222
Query: 288 GLWMQDF----PLKDQNNLSEECGF------ENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
+W P++D + ++ F + DL+ YL EF +GN K
Sbjct: 223 AVWRSPLLSLSPIRDNSETAQAASFGTGARFKRDLLAYL------EF------YGNKKTR 270
Query: 338 PSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFE-KGFKK 389
+KF+F + LIASVP S WG L+ L++ + +
Sbjct: 271 SLVDQLRKFDFQAIRAALIASVPSKERISRADSSRSTLWGWPALKDTLRQVPLRIREKNQ 330
Query: 390 SP-LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSL 444
P +V Q SS+ SL +KW+ ++ SE + P I++PT +++R SL
Sbjct: 331 CPHVVIQISSIASLGQTDKWLKDVLFDSLCLPSELPHTNKMPRPKYSIIFPTPDEIRRSL 390
Query: 445 EGYAAGNAIPSPQKNVDKD----FLKKYWAKW----------------------KASHTG 478
GY +G +I +++ + +++ Y +W + + G
Sbjct: 391 NGYGSGGSIHMKLQSITQQKQLQYMRPYLCQWAGDQKQTAMGTLHLNAESVYNSQRTDAG 450
Query: 479 RSRAMPHIKTFARY--------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLIL 524
R RA PHIKT+ R+ L+ AWGA +N ++ I S+E+GVL
Sbjct: 451 RRRAAPHIKTYIRFADKTKMDTIDWAMITSANLSTQAWGAAANSNGEVRICSWEIGVLFW 510
Query: 525 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 584
P I ST T + + T S D S +V +PY
Sbjct: 511 PEL------------IAGDPFNPNSTRTEMVPSFRKDTPDPTESEDV---SSIVGFRMPY 555
Query: 585 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+LP YS++DVPW + + D GQ W
Sbjct: 556 DLPLTPYSAQDVPWCATINHPEPDWLGQSW 585
>gi|340521404|gb|EGR51638.1| predicted protein [Trichoderma reesei QM6a]
Length = 1118
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 128/439 (29%), Positives = 204/439 (46%), Gaps = 71/439 (16%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVL 220
S ++L R++ LP N V +RD++ +I N++ DI ++L A + + L
Sbjct: 42 SPWQLTRIRDLPEELNRDTVRLRDILDDPLITECWQFNFLHDIPFVLSAFDDMVRNRVQL 101
Query: 221 -VIHG--ESDGTLEHMKRNKPA---NWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVH 273
V+HG + D + ++ A N LH P+P FGTHHSK M++ ++++H
Sbjct: 102 HVVHGFWKKDDESRIVLSDQAAQFHNVHLHCAPMPEMFGTHHSKMMVIFRSDDTAQVVIH 161
Query: 274 TANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSEECG--FENDLIDYLSTLKWP 322
TAN+I DW N + +W QD + L G F+ DL++YL ++
Sbjct: 162 TANMIPKDWTNMTNAVWRSPRLPRLGEQDTLFQQGQQLPVGSGTRFKVDLLEYLR--QYE 219
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQE 380
+ + +N F+FSS IASVPG H+ +S WG ++ L+
Sbjct: 220 LYRPTCKQLVDRLVN------FDFSSIRAAFIASVPGRHSFRDASRPAWGWAAVQRCLRC 273
Query: 381 CTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEP--LIVWPT 436
E+G +S +V Q SS+ +L K W L ++ + TP G P +V+PT
Sbjct: 274 VPVERG--QSQIVVQISSIATLGAKDDW---LQRTLFDSLATSLTP-NTGRPGFKVVFPT 327
Query: 437 VEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWK---------------ASHT 477
V+++R S++GYA+G + I SPQ+ +L+ W + +
Sbjct: 328 VDEIRNSIDGYASGRSIHTKIQSPQQIRQLGYLRPILHHWANDSAGGAKLPGEPSISGDS 387
Query: 478 GRSRAMPHIKTFARYN-----------GQKLAKAAWG-ALQKNNSQLMIRSYELGVLILP 525
GR RA PHIKT+ R+N ++K AWG AL + I S+E+GVL+ P
Sbjct: 388 GRDRAAPHIKTYIRFNESNTIDWAMLTSANMSKQAWGEALSSTTGNIRIASWEVGVLVWP 447
Query: 526 SAK-RHGCGFSCTSNIVPS 543
G S ++VPS
Sbjct: 448 GLLCEDGAMVSSPKSLVPS 466
>gi|302894143|ref|XP_003045952.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726879|gb|EEU40239.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1086
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 126/428 (29%), Positives = 196/428 (45%), Gaps = 83/428 (19%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPAC-PVLAKI 216
+ S ++L +Q L N VS+RD++ GD ++A N++ DI +L+ A P +
Sbjct: 38 IKSPWQLTWIQDLSEEDNRDAVSLRDLL-GDPLIAECWEFNFLHDIHFLMDAFDPDTRHL 96
Query: 217 PHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V V+HG ES +E N +H P+P FGTHHSK M+L + +
Sbjct: 97 VKVHVVHGFWKREDESRIAIEQAAAEF-NNVQIHIAPMPEMFGTHHSKMMILFRHDDTAQ 155
Query: 270 IIVHTANLIHVDWNNKSQGLWM------------------QDFPLKDQNNLSEECGFEND 311
+I+HTAN+I DW N + G+W +D P+ + F+ D
Sbjct: 156 VIIHTANMISKDWTNMTNGIWKSPLLPKMTVAPTHTTSSPEDHPVGSGDR------FKID 209
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--W 369
L++YL + + K ++FSS L+ASVPG H L + W
Sbjct: 210 LLNYLRAYDRRKITC--------KALTDELVHYDFSSIKAALVASVPGRHNIRDLSETSW 261
Query: 370 GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGI 427
G L+ LQ+ E ++S +V Q SS+ +L E W L ++ S K P +
Sbjct: 262 GWAALKRCLQQVPCEDQ-EQSEIVVQISSIATLGAKEDW---LKKTLFEPLSRCKNP-SL 316
Query: 428 GEP--LIVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWK-------- 473
G+P +V+PT +++R SL+GYA+G + I S Q+ ++L+ + W
Sbjct: 317 GKPKFKVVFPTADEIRRSLDGYASGGSIHTKIQSAQQAKQLEYLRPIFHHWANDSPSGAK 376
Query: 474 ------ASHTGRSRAMPHIKTFARYN----------GQKLAKAAWGALQKNNSQLMIRSY 517
GR RA PHIKT+ R N L+K AWG + ++ I S+
Sbjct: 377 LPEGATVKDGGRKRAAPHIKTYIRSNKSSIDWALLTSANLSKQAWGEAARPTGEMRIASW 436
Query: 518 ELGVLILP 525
E+GVL+ P
Sbjct: 437 EIGVLVWP 444
>gi|312378421|gb|EFR25002.1| hypothetical protein AND_10059 [Anopheles darlingi]
Length = 436
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 193/436 (44%), Gaps = 68/436 (15%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHV--LVIHGESDGTLEHMKRNKPANWILHKP 246
G + ++ N+MVDI WLL A A +V L+++G+ L + + KP N K
Sbjct: 42 GQLESSVQMNFMVDIGWLL-AHYYFAGYENVPLLILYGDETPELRMVSKKKP-NVTAVKV 99
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
+ G HH+K L Y G +RI++ TANL DW+N++QGLW+ P +
Sbjct: 100 DIKTPVGVHHTKMGLYGYRDGSMRIVISTANLYEDDWHNRTQGLWIS--PRLPAVPEDAD 157
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTG 363
F + D+ S L A L A+ ++ P + ++ +FS V L+ASVPG H
Sbjct: 158 TAFGESVTDFRSNLL-----AYLDAYKLTQLQPWIARIRRTDFSDIKVCLVASVPGGHVN 212
Query: 364 SSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 422
+ WGH +L +L + PLV Q SS+GSL + + + + F +D
Sbjct: 213 TPKGPLWGHARLGYLLTKYAAPID-DSCPLVAQSSSIGSLGPSPESWVLGEIMANFRKDS 271
Query: 423 TPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTG 478
P+GI +++P+ +VR S + G +P + K ++LK Y +W
Sbjct: 272 APIGIRRMPGFRMIYPSYSNVRQSHDSLLGGGCLPYGRATHSKQEWLKTYLHQWFCRSRH 331
Query: 479 RSRAMPHIKTFARYNGQ----------KLAKAAWGALQKN---NSQLMIRSYELGVLILP 525
R++AMPHIKT+ R++ + L+K+AWG K L I SYE GVL LP
Sbjct: 332 RNKAMPHIKTYCRWSHRGLYWFLLTSANLSKSAWGVYNKAGRFEKPLRINSYEAGVLFLP 391
Query: 526 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 585
N P E A + P+PY+
Sbjct: 392 K-------LLLDENFFPME----------------------------AGKKDPQFPMPYD 416
Query: 586 LPPQRYSSEDVPWSWD 601
+P Y+ ED P+ D
Sbjct: 417 VPIIPYAPEDTPFFMD 432
>gi|7648685|gb|AAF65624.1|AF182003_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 189
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 112/210 (53%), Gaps = 45/210 (21%)
Query: 420 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 477
E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +
Sbjct: 7 ESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETS 66
Query: 478 GRSRAMPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
GRS AMPHIKT+ R L+KAAWGAL+KN +QLMIRSYELGVL LP
Sbjct: 67 GRSNAMPHIKTYMRPSPDFSKIAWFRVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLP 126
Query: 526 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 585
SA F S V + +GS E + PVPY+
Sbjct: 127 SA------FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYD 156
Query: 586 LPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 157 LPPELYGSKDRPWIWNIPYVKAPDTHGNMW 186
>gi|156549662|ref|XP_001604678.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 573
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 180/378 (47%), Gaps = 61/378 (16%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G++I ++ N+M ++ WL+ + ++P + V++G +W+
Sbjct: 113 IIDYTTGELIDSLHINFMAEMLWLINEYMLAVQVPKMTVLYG---------------SWL 157
Query: 243 ----LHKPPLPISF--------GTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGL 289
+++ P I F G HHSK + Y +RI++ ++N+ DW +++QGL
Sbjct: 158 DPDMMYEIPFDIEFVNVEMSEFGCHHSKISIFKYTGDKIRIMISSSNIYAEDWQSRTQGL 217
Query: 290 WMQDF-PL--KDQNNLSEE--CGFENDLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKK 343
W+ F PL +D N E F+ D + YLS PE F + H +
Sbjct: 218 WISPFLPLLPEDANESDGESPTNFKRDFLQYLSMYNQPEVFGWSALIH-----------R 266
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSSLGSL 402
+ S+ V IASVPG+H GSSL WGH KL +L + +K P++ Q SS+G
Sbjct: 267 ADCSAINVFFIASVPGHHDGSSLDTWGHRKLAALLSAHASLPSDAQKWPVIAQSSSVGVF 326
Query: 403 DEKWMAELSSSMSSGFS--EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN- 459
+ + LSSS+ S +DK + E ++P+ + S + + + ++N
Sbjct: 327 GPDYQSWLSSSIVRTMSKEKDKKIIIFPEFKFIYPSKNNYNQSYDNQIGSSCLMYNEQNY 386
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQK 507
+ + +LK Y +WK+ GR++AMPH+K + R + L+K A G + +
Sbjct: 387 LKQQWLKDYLYQWKSDKIGRTQAMPHLKCYTRISPDESEMAWFFLTSANLSKGAMGKMLR 446
Query: 508 NNSQLMIRSYELGVLILP 525
N + + +YE GVL LP
Sbjct: 447 NCTVQTLCNYEAGVLFLP 464
>gi|332029125|gb|EGI69136.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 522
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 174/365 (47%), Gaps = 39/365 (10%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N++VD++WL + + + +++G D N N
Sbjct: 113 ILDCSLGEIVYSLHLNFIVDVEWLCWQYLLAGQCTDMTILYG--DKAYYQTLFN---NIT 167
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQ 299
+ K + F HH+K M+L Y G+R+IV TANL DW N +QGLW+ P L +
Sbjct: 168 IIKVNIETGFACHHTKIMILQYKDDGIRVIVSTANLRSTDWENVTQGLWISPHLPRLPES 227
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N S+ GF+ DL YLS + P + + A + +FS V LIAS
Sbjct: 228 ANPSDGESPTGFKKDLERYLSKYEQPTLTQWICA----------VQMADFSKVNVFLIAS 277
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG + + WG+ KL VL + T P+V Q SS+G L + + L +
Sbjct: 278 VPGIYQNNEANFWGYKKLAHVLSRHVTLPSDVFPWPIVAQSSSIGKLGSSFESWLLKDII 337
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S + T G+P ++P++++ + S P S + + + +L Y +W
Sbjct: 338 PCMSRESTESTKGQPEFKFIYPSIQNYKQSFHYKNLSWCSPYSAEAHSKQQWLDLYLHQW 397
Query: 473 KASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELG 520
KA T R RAMPHIK++ R + L+KAAWG+++++ I +YE G
Sbjct: 398 KAKRTERDRAMPHIKSYTRISPDLKSIPWFVLTSANLSKAAWGSIKRHGYS--IENYEAG 455
Query: 521 VLILP 525
++ +P
Sbjct: 456 IIFVP 460
>gi|398021965|ref|XP_003864145.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
gi|322502379|emb|CBZ37463.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
Length = 682
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 146/617 (23%), Positives = 238/617 (38%), Gaps = 194/617 (31%)
Query: 177 NTSCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 35 SCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLPWLLATVPELSAVTGKLVLLSGEKGT 94
Query: 230 ---------------------------LEHMKRNKPANWILH-----------KPPLPIS 251
+ ++ A LH +PPLP++
Sbjct: 95 ATLRRTTGDSSCPYTAASPLMDRVNPFMAALREQARATSALHTTLSRERLAVLEPPLPVA 154
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S + +
Sbjct: 155 FGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSNDDSADAT 214
Query: 312 LIDYLST------------LKWPEFSANL-----------------PAHGNFKINP---- 338
+++ ++ K EF A+L P P
Sbjct: 215 MVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASAAGPLGIF 274
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQECTFEKGFKKSP-- 391
F +FS+AAV L++SVPG + + + G +L VL+ +
Sbjct: 275 ETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMATSPASVD 334
Query: 392 LVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+VR S EG+
Sbjct: 335 LSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 394
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ +W +S G
Sbjct: 395 RGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASREDAVDVDGV 453
Query: 479 -------------------RSRAMPHIKTFAR------------YNGQKLAKAAWGAL-- 505
R A+PHIK++A L++AAWG+L
Sbjct: 454 DIDGGEETTASLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 513
Query: 506 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ--IQKTKL 560
Q+ + Q ++RSYELGVL + + S S + S+I+ + S+ + +T L
Sbjct: 514 KVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESKIELPNARNSRAMLYETPL 573
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 594
G ++ V L +PY L P Y+S
Sbjct: 574 -----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVEEAALDFS 622
Query: 595 DVPWSWDKRYTKKDVYG 611
DVPW D + KD YG
Sbjct: 623 DVPWVLDMPHRGKDAYG 639
>gi|156034731|ref|XP_001585784.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980]
gi|154698704|gb|EDN98442.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 633
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 154/626 (24%), Positives = 254/626 (40%), Gaps = 145/626 (23%)
Query: 113 SQKRVSNDGATNGELSSKKMRQ--------------------QDEQDNENGKNSEEALCN 152
+QKR D TN +++ K +R+ Q+E E+ S + +
Sbjct: 27 AQKRRKVDDNTNDDINEKGVRRGMNRSISPPPLRRYRKEIPIQEEGSLEHKVESSKQTSS 86
Query: 153 FHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPAC 210
+ + S F+L ++ LPA +N VS++D++ GD +++ NY+ ++D+L+
Sbjct: 87 KITKQKVVKSPFQLTSIRDLPASSNVDTVSLKDIL-GDPLISECWEFNYLHNLDFLMGQF 145
Query: 211 PV-LAKIPHVLVIHG----ESDGTLEHMKRN-KPANWILHKPPLPISFGTHHSKAMLLI- 263
+ + V V+HG E L M++ K +N L +P FGTHHSK ++L
Sbjct: 146 DEDVRNLVKVNVVHGFWKREDQSRLNLMEQALKYSNVKLLTAYMPEMFGTHHSKMLILFR 205
Query: 264 YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPL--------KDQNNLSEECGFENDLID 314
+ ++I+HTAN+I DW N +Q +W PL K+ + F+ DL++
Sbjct: 206 HDSTAQVIIHTANMIPFDWTNMTQAMWKSPLLPLLDPEKPNPKESGQMGSGSKFKIDLLN 265
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPG---YHTGSSLKK 368
YL H I + K +FS L+AS PG S+
Sbjct: 266 YLGAY-----------HTKRAICKPLIEQLSKHDFSEIRAALVASTPGKQDIELDSTETA 314
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLG 426
WG L ++L+ K + +V Q SS+ SL +KW L+ + S K P
Sbjct: 315 WGWAGLSSILKSIPCSK--TQPEIVVQISSIASLGPTDKW---LNQTFFKALSTSKDPSP 369
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKA-------- 474
+ I++PT +++R S+ GY++G+AI + + +LK W
Sbjct: 370 KPKFKIIFPTADEIRRSINGYSSGSAIHTKILTSAQGKQLAYLKPLLCHWAGDGEQHSST 429
Query: 475 -----------------------------SHTGRSRAMPHIKTFARYNGQ---------- 495
+ R RA PHIKT+ R++
Sbjct: 430 SQTSSTSESATSSNTSNIALSPHMASPPPQNAHRKRAAPHIKTYIRFSSSSHKTIDWMLV 489
Query: 496 ---KLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP---SEIKSGS 549
L+K AWG ++ I SYE+GV++ P G S +VP ++I S
Sbjct: 490 TSANLSKQAWGENINTAGEVRICSYEIGVIVWPGLWDEG----NKSKMVPCFGTDIPSRP 545
Query: 550 TETSQIQKTKLVTLT--------------WHGSSDAGASSE-------VVYLPVPYELPP 588
TS+++ T V T G + SE ++ +PY+LP
Sbjct: 546 DVTSELESTVAVEATSVTADNNNIREKGKGKGREEIEKKSENDTENTILIGARIPYDLPL 605
Query: 589 QRYSSEDVPWSWDKRYTKKDVYGQVW 614
Y+ D+PW Y++ D G W
Sbjct: 606 IPYTKSDIPWCASASYSEPDWMGNTW 631
>gi|46123591|ref|XP_386349.1| hypothetical protein FG06173.1 [Gibberella zeae PH-1]
Length = 1094
Score = 135 bits (339), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 198/421 (47%), Gaps = 71/421 (16%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+PS ++L +Q LP N VS+RD++ +I N++ DI +L+ A P +
Sbjct: 38 IPSPWQLTWIQDLPESENKDAVSLRDLLGDPLISECWEFNFLHDIPFLMNAFDPDTRHLV 97
Query: 218 HVLVIHGESDGTLEHMKRNKPA---------NWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+V ++HG +H +N+ A N +H P+P FGTHHSK M+L +
Sbjct: 98 NVHLVHG----FWKHEDKNRIALENAAAKFENVNVHIAPMPEMFGTHHSKMMILFRHGDT 153
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEECGF-----ENDLIDYLS 317
++I+HTAN+I DW N + G+W PL K Q S F E ID L+
Sbjct: 154 AQVIIHTANMIPKDWTNMTNGVWKS--PLLPRMSKTQTPASSPEEFLVGSGERFKIDLLN 211
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLR 375
LK+ + + + K+ K+++FS+ LIASVPG H + + WG L+
Sbjct: 212 YLKFYDKRKIICKPLSDKL-----KQYDFSTIKAALIASVPGRHDAHDMSETSWGWAALK 266
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPL-- 431
L+ + S +V Q SS+ +L K W L ++ K G+ P
Sbjct: 267 RCLRHVPCHQ-HGDSDIVVQVSSIATLGPKDDW---LQKTLFDHLGRCKD-TGLRRPRFK 321
Query: 432 IVWPTVEDVRCSLEGYAAG----NAIPSPQKNVDKDFLKKYWAKWK-------------A 474
+V+PT +++R SL+GYA+G I SPQ+ ++L+ + W
Sbjct: 322 VVFPTADEIRRSLDGYASGLSIHTKIQSPQQAKQLEYLRPMFHHWANDSPGGTKLPDGPV 381
Query: 475 SHTGRSRAMPHIKTFARYN----------GQKLAKAAWGALQKNNSQLMIRSYELGVLIL 524
+GR RA PHIKT+ R N ++K AWG + ++ I S+E+GVLI
Sbjct: 382 LESGRKRAAPHIKTYVRSNKSSIDWGLLTSANISKQAWGEAARPTGEMRIASWEVGVLIW 441
Query: 525 P 525
P
Sbjct: 442 P 442
>gi|320590454|gb|EFX02897.1| tyrosyl-DNA phosphodiesterase [Grosmannia clavigera kw1407]
Length = 553
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 141/538 (26%), Positives = 223/538 (41%), Gaps = 98/538 (18%)
Query: 144 KNSEEALCNFHVSRD---KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NY 199
+N EEA + S D + S F+L ++ LPA N V++ ++ ++ NY
Sbjct: 45 RNGEEA--HDSTSTDAGVRFRSPFQLTAIRDLPAEDNVDTVTVDEIFGSPLVAECWEFNY 102
Query: 200 MVDIDWLLPAC-----PVLAKIPHVLVIHGESDGTLE-HMKRNKPANWILHKPPLPISFG 253
+ DI + + A ++ E LE + + AN LH +P FG
Sbjct: 103 LHDIGFFMDALNEDVRHLVHVHVVHGFWKREDQRRLELEAEAARYANVQLHTAFMPEPFG 162
Query: 254 THHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW--------MQDFPLKDQNNLSE 304
THHSK A+L + +++++TAN+I DW N +QG+W D +D++ +
Sbjct: 163 THHSKMAVLFRHDDTAQVVIYTANMIPHDWANMTQGVWRSPLLPLLADDVDGEDESEIDG 222
Query: 305 ECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
G F+ DL+ YL S P +++F++ LIASVPG
Sbjct: 223 PVGSGRRFKTDLLSYLRAYN-QRRSICRPLVERLA-------RYDFAAVQAALIASVPGR 274
Query: 361 HT------GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD--EKW------ 406
H+ +WG L+ L+ + + +V Q SS+ +L + W
Sbjct: 275 HSLIRQPDEKYHTQWGWTALKNTLRSVPVQAVAPSTEIVLQVSSMATLGPTDAWIRHTLF 334
Query: 407 --MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNV 460
MA SS++ G S K L V+PT +++R SLEGY +G +I + Q+
Sbjct: 335 SAMATASSAVDKGGSIGKEELQQPRFRAVFPTADEIRRSLEGYKSGTSIHTKIQSSQQQR 394
Query: 461 DKDFLKKYWAKWKASH--------------TGRSRAMPHIKTFARY----------NGQK 496
+++ W GR RA PHIKT+ RY
Sbjct: 395 QLQYMRPLLCHWANDSPDGAKLPDGATPIVNGRKRAAPHIKTYVRYGQVGVDWALLTSAN 454
Query: 497 LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 556
L+K AWG ++ + S+E+GV++ P F+ T+ + +I GS Q
Sbjct: 455 LSKQAWGEAVTAAGEVRVASWEIGVMVWPGL------FAETAVM---QIVGGSDSVLQPA 505
Query: 557 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
K A VV L VPY+LP Q+Y ++PW + D GQ W
Sbjct: 506 TGK------------AAGRPVVALRVPYDLPLQQYGKGEIPWVCTLPDEEPDWTGQAW 551
>gi|195177151|ref|XP_002028871.1| GL22360 [Drosophila persimilis]
gi|194104354|gb|EDW26397.1| GL22360 [Drosophila persimilis]
Length = 946
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 167/337 (49%), Gaps = 48/337 (14%)
Query: 197 SNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISF 252
S +MVDI WLL +L K +LV++G+ L + + KP I K P P F
Sbjct: 186 SIFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAIGVKMPTP--F 241
Query: 253 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEE- 305
T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL +D + + E
Sbjct: 242 ATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLLPALSEDADTAAGES 299
Query: 306 -CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GF DL+ YL K + + + +K +FS+ V + SVPG H
Sbjct: 300 LTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREG 349
Query: 365 SLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 422
S++ WGH +L ++L + + P+V Q SS+GSL A + + +D
Sbjct: 350 SVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDS 408
Query: 423 TPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHT 477
+P G + +++P+ +V S +G G +P + DK +LK + +WK+S
Sbjct: 409 SPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDR 468
Query: 478 GRSRAMPHIKTFARYN------------GQKLAKAAW 502
RSRAMPHIKT++RYN L+KAAW
Sbjct: 469 HRSRAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAW 505
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 142/291 (48%), Gaps = 35/291 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP- 238
I D G+I ++ N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 651 ILDESLGEIESSVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 708
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
I K P P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL
Sbjct: 709 VTAIGVKMPTP--FATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLL 764
Query: 297 ----KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+D + + E GF DL+ YL K + + + +K +FS+
Sbjct: 765 PALSEDADTAAGESLTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAIN 814
Query: 351 VRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
V + SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A
Sbjct: 815 VFFVGSVPGGHREGSVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQA 873
Query: 409 ELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPS 455
+ + +D +P G + +++P+ +V S +G G +PS
Sbjct: 874 WIQQDFVNSLRKDSSPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPS 924
>gi|71001518|ref|XP_755440.1| tyrosyl-DNA phosphodiesterase [Aspergillus fumigatus Af293]
gi|66853078|gb|EAL93402.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
Af293]
gi|159129510|gb|EDP54624.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
A1163]
Length = 564
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 135/528 (25%), Positives = 221/528 (41%), Gaps = 110/528 (20%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V ++D++ +I N++ D+D+L+ + +
Sbjct: 72 IPSPIQLSHIRDLSAASGNNVDTVRLKDILGDPLIRECWQFNFLFDVDFLMSQFDEDVRR 131
Query: 216 IPHVLVIHGESDGTLEHMKRNKPA-----NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG + R + A N +P FGTHHSK M+L+ + +
Sbjct: 132 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAITAYMPEPFGTHHSKMMILLRHDDLAQ 191
Query: 270 IIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-------FENDLIDYLSTLKW 321
+++HTAN+I DW N Q +W PL+ E G F+ DL+ YL+
Sbjct: 192 VVIHTANMIPGDWANMCQAVWRSPLLPLRKSGREPEGPGAIGSGVRFKRDLLAYLN---- 247
Query: 322 PEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKL 374
+G K P ++F+FS+ LIASVP SSL WG L
Sbjct: 248 --------EYGVKKTGPLVRQLERFDFSAVRAALIASVPSKQRLSSLDSQKKTLWGWPAL 299
Query: 375 RTVLQECTFEKGFK----KSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIG 428
+ ++ K +S +V Q SS+ SL + KW+ ++ S + I
Sbjct: 300 KEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDKWLKDV---FFPSLSPTPSMASIP 356
Query: 429 EPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------- 475
+P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 357 QPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQQKQLQYMRPYLRHWAGDSDSSSST 416
Query: 476 -----HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIRS 516
GR RA PHIKT+ R++ + L+ AWGA N ++ I S
Sbjct: 417 STPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNNAGEVRISS 476
Query: 517 YELGVLILP--------SAKRH--GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 566
+E+GV++ P + +RH C +P ++
Sbjct: 477 WEIGVIVWPQLFVHEDNTTERHQQAVMVPCFKRDIPLQL--------------------- 515
Query: 567 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
D +V L +PY+LP Y + +VPW +T+ D GQ W
Sbjct: 516 -PEDMPRCDVLVGLRMPYDLPLIPYKANEVPWCATIAHTEPDWLGQTW 562
>gi|332029127|gb|EGI69138.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 542
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 192/442 (43%), Gaps = 82/442 (18%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ + VD+ WL L+ +D T+ + R P +
Sbjct: 141 ILDRSLGEIVNSLHLTFTVDVGWLYL---------QYLLAGQRTDMTILYKYRVCPCHEE 191
Query: 243 LHKPPLPI------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD--- 293
L K I F +HH+ M+L Y G+R++V TA L DW N++QGLW+
Sbjct: 192 LSKNITIIHVDGQHEFSSHHANIMILQYSNGIRVVVSTAALYSDDWKNRTQGLWISPHLP 251
Query: 294 -FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P + + E GF+ DL YLS + P + + A + +FS V
Sbjct: 252 YLPESAKPSDGESPTGFKKDLERYLSKYEQPALTQWIRA----------VQMADFSDVNV 301
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG--SLDEKWMAE 409
L+ASVPG H G WG+ KL VL ++ P+V Q S +G L E W+ +
Sbjct: 302 FLVASVPGIHKGYEDDFWGYRKLAHVLSCYVTLPRNEQWPIVAQSSGVGCFGLFENWLED 361
Query: 410 LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKY 468
+ MS S+D + ++P++ + + S + + +N + +L+ Y
Sbjct: 362 IIWCMSKETSKDSNNYPHFQ--FIYPSIANYKQSFDFRVLSTPLSYNTENHFKQQWLESY 419
Query: 469 WAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRS 516
+WKA TGR RAMP+IK++ R + L+KAAWG+ ++ + I +
Sbjct: 420 LYQWKAKRTGRDRAMPNIKSYTRISPDLKKIPWFLLTSANLSKAAWGSNKQYD--YSIGN 477
Query: 517 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
YE GVL +P + +G+T T G D G
Sbjct: 478 YEAGVLFIP------------------KFITGTT-----------TFPIGGEEDTG---- 504
Query: 577 VVYLPVPYELPPQRYSSEDVPW 598
V P+PY+LP +Y +D P+
Sbjct: 505 VPMFPIPYDLPLSQYEFDDSPF 526
>gi|119481099|ref|XP_001260578.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
gi|119408732|gb|EAW18681.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
Length = 564
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 135/529 (25%), Positives = 224/529 (42%), Gaps = 112/529 (21%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V ++D++ +I N++ D+D+L+ + +
Sbjct: 72 IPSPIQLTHIRDLSAASGNNVDTVRLKDILGDPMIRECWQFNFLFDVDFLMSQFDEDVRR 131
Query: 216 IPHVLVIHGESDGTLEHMKRNKPA-----NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG + R + A N +P FGTHHSK M+L+ + +
Sbjct: 132 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAITAYMPEPFGTHHSKMMILLRHDDLAQ 191
Query: 270 IIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-------FENDLIDYLSTLKW 321
+++HTAN+I DW N Q +W L+ E G F+ DL+ YL+
Sbjct: 192 VVIHTANMIPGDWANMCQAVWRSPLLALRKSEREPEGPGAIGSGARFKRDLLAYLN---- 247
Query: 322 PEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKL 374
+G K P ++F+FS+ LIASVP SSL WG L
Sbjct: 248 --------EYGVKKTGPLVRQLERFDFSAVRAALIASVPSKQRLSSLDSRKKTLWGWPAL 299
Query: 375 RTVLQECTFEKGFK----KSPLVYQFSSLGSLDE--KWMAEL-SSSMSSGFSEDKTPLGI 427
+ ++ K +S +V Q SS+ SL + KW+ ++ +S+S S + P
Sbjct: 300 KEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDKWLKDVFFASLSPTSSMESIP--- 356
Query: 428 GEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------ 475
+P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 357 -QPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQQKQLQYMRPYLRHWAGDSDSSSS 415
Query: 476 ------HTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIR 515
GR RA PHIKT+ R++ + L+ AWGA N ++ I
Sbjct: 416 TSTPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNNAGEVRIS 475
Query: 516 SYELGVLILP--------SAKRH--GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 565
S+E+GV++ P + +RH C +P ++
Sbjct: 476 SWEIGVMVWPQLFVREDNTTERHQQAVMVPCFKRDIPLQL-------------------- 515
Query: 566 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ +V L +PY+LP Y + +VPW +T+ D GQ W
Sbjct: 516 --PDETPGCDVLVGLRMPYDLPLTPYKANEVPWCATAAHTEPDWLGQTW 562
>gi|146098236|ref|XP_001468366.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
gi|134072733|emb|CAM71450.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
Length = 682
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 145/617 (23%), Positives = 236/617 (38%), Gaps = 194/617 (31%)
Query: 177 NTSCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 35 SCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLPWLLATVPELSAVTGKLVLLSGEKGT 94
Query: 230 ---------------------------LEHMKRNKPANWILH-----------KPPLPIS 251
+ ++ LH +PPLP++
Sbjct: 95 ATLRRTTGDSSCPYTAASPLMDRVNPFMAALREQARPTSALHTTLSRERLAVLEPPLPVA 154
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S + +
Sbjct: 155 FGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSNDDSADAT 214
Query: 312 LIDYLST------------LKWPEFSANL-----------------PAHGNFKINP---- 338
+++ ++ K EF A+L P P
Sbjct: 215 MVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASAAGPLGIF 274
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQECTFEKGFKKSP-- 391
F +FS+AAV L++SVPG + + + G +L VL+ +
Sbjct: 275 ETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMATSPASVD 334
Query: 392 LVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+VR S EG+
Sbjct: 335 LSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 394
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ +W +S G
Sbjct: 395 RGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASREDAVDVDGV 453
Query: 479 -------------------RSRAMPHIKTFAR------------YNGQKLAKAAWGAL-- 505
R A+PHIK++A L++AAWG+L
Sbjct: 454 DIDGGEETTPSLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 513
Query: 506 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ--IQKTKL 560
Q+ + Q ++RSYELGVL + + S S + S I+ + S+ + +T L
Sbjct: 514 KVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESRIELPNARNSRAMLYETPL 573
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 594
G ++ V L +PY L P Y+S
Sbjct: 574 -----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVEEAALDCS 622
Query: 595 DVPWSWDKRYTKKDVYG 611
DVPW D + KD YG
Sbjct: 623 DVPWVLDMPHRGKDAYG 639
>gi|342883838|gb|EGU84260.1| hypothetical protein FOXB_05217 [Fusarium oxysporum Fo5176]
Length = 1127
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 123/456 (26%), Positives = 205/456 (44%), Gaps = 66/456 (14%)
Query: 124 NGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSI 183
N + ++M + D Q E+ + S + S ++L ++ LP N V++
Sbjct: 2 NRPVKRQRMEEPDAQTPESLQRSISPPKKRDRKLTVVKSPWQLTWIRDLPEGDNQDAVTL 61
Query: 184 RDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRN 236
+D++ +I N++ DI +L+ + P + V ++HG +++ +
Sbjct: 62 KDLLSDPLISECWEFNFLHDIPFLMNSFDPDTRHLVKVHLVHGFWKREDANRIALENASS 121
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLW----- 290
+ N H P+P FGTHHSK M+L G ++I+HTAN+I DW N S G+W
Sbjct: 122 EFENIKTHIAPMPEMFGTHHSKMMILFRHDGTAQVIIHTANMIPKDWTNMSNGVWKSPLL 181
Query: 291 -----MQDFPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
Q+F + +++ F+ DL++YL + K +
Sbjct: 182 PKLSGAQNFQASPEDHSVGSGQRFKIDLLNYLKAYDRRKIIC--------KPLTDKLTHY 233
Query: 345 NFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL 402
+FSS L+ASVPG H + + WG L+ LQ + S +V Q SS+ +L
Sbjct: 234 DFSSIKAALVASVPGKHDARDMSETSWGWAALKRCLQHVPCQD-HGDSDIVVQVSSIATL 292
Query: 403 DEK--WMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNA----IP 454
K W L ++ + K P G+G P +V+PT +++R SL+GYA+G + I
Sbjct: 293 GAKDDW---LQKTLFEPLTRSKNP-GLGRPRFKVVFPTADEIRRSLDGYASGGSIHTKIQ 348
Query: 455 SPQKNVDKDFLKKYWAKWK--------------ASHTGRSRAMPHIKTFARYN------- 493
S Q+ ++L+ + W +GR RA PHIKT+ R N
Sbjct: 349 SSQQAKQLEYLRPIFHHWANDSPRGAKLPEDTPLRDSGRKRAAPHIKTYIRSNKSSIDWG 408
Query: 494 ---GQKLAKAAWGALQKNNSQLMIRSYELGVLILPS 526
++K AWG + ++ I S+E+GVLI S
Sbjct: 409 LLTSANISKQAWGEAARPTGEMRIASWEIGVLIWAS 444
>gi|154344310|ref|XP_001568099.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065433|emb|CAM40865.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 680
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 131/497 (26%), Positives = 193/497 (38%), Gaps = 151/497 (30%)
Query: 177 NTSCVSIRDVIQGDII-------VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ D+ +LS+YM D WLL P L+ + LV+ GT
Sbjct: 37 SCSLLRLRDLFCCDVADTDECWQYILLSSYMTDFRWLLRTVPELSAVTGKLVVLSGEKGT 96
Query: 230 L-------------------------------EHMKRNKPANWILHK-------PPLPIS 251
EH + +L + PPLPI+
Sbjct: 97 ATLRCTTGEPLHSYTATSPLLDRVNPFVASLREHAQTTSAVGTLLSRERLAVLEPPLPIA 156
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK-------------- 297
FGTHHSK L + RG+R+ + TANL+ DW KSQG+++QDFP K
Sbjct: 157 FGTHHSKMALCVNSRGLRVSIFTANLLEQDWCWKSQGIYVQDFPWKTSAKSSKHDSLDAT 216
Query: 298 --------DQNNLSEEC----GFENDLIDYLSTL------KWPEFSANLPAHGNFKI-NP 338
+N S C F L YL + A G I
Sbjct: 217 AGTATTGYSSSNFSGVCPKGIDFAEHLRHYLIQCGVSLAAAFTSLKAAASLAGPLGIFET 276
Query: 339 SFFKKFNFSSAAVRLIASVPGYHTGSSLK---KWGHMKLRTVLQE--CTFEKGFKKSPLV 393
F +FS+AAV L++SVPG H + + G +L VL+ T L+
Sbjct: 277 DFLSHIDFSAAAVWLVSSVPGTHAHGEVSPGYRVGLCRLAEVLRRSPLTMATTPASVDLI 336
Query: 394 YQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
+Q+SS GSL+ ++ L ++M + P G+ + L+V+PT E+VR S EG+
Sbjct: 337 WQYSSQGSLNSTFLNTLQAAMCGEAVTVIESGNAPRGVRDVLVVYPTEEEVRNSWEGWRG 396
Query: 450 GNAIP-------------------------------SPQKNV---------------DKD 463
G ++P P K V D D
Sbjct: 397 GGSLPLRVQCCHEFVNNRLHRWGSRAEDHAVEHGLTQPAKGVAAHASREDAVDVDQADSD 456
Query: 464 FLKKYWAKWKASHTG-RSRAMPHIKTFAR------------YNGQKLAKAAWGAL----- 505
++ A AS R A+PHIK++A L++AAWG++
Sbjct: 457 RDEEATASLVASCAAYRQFALPHIKSYAAVAPDRTCVRWFLLTSANLSQAAWGSVSGKVK 516
Query: 506 QKNNSQLMIRSYELGVL 522
++ Q ++RSYELGVL
Sbjct: 517 KRGLCQQLVRSYELGVL 533
>gi|213409511|ref|XP_002175526.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
gi|212003573|gb|EEB09233.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
Length = 518
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 140/506 (27%), Positives = 213/506 (42%), Gaps = 92/506 (18%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAK 215
+K S L ++ LP N C+S+R +I + N+ +D+ +++ P + K
Sbjct: 52 EKQDSPIFLNSIKSLPDEENVHCLSLRQLIGSKNLRETWQFNFCIDLGFIVENMHPSVLK 111
Query: 216 IPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-GVR 269
V V HG S + L K P + LH +P +GTHHSK M+ + +
Sbjct: 112 QVKVHVTHGYSYDSPRMDVLRQQKTRLPMDIELHSVYVP-QWGTHHSKIMVNFFADDSCQ 170
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC------GFENDLIDYLSTLKWPE 323
+++HTAN+I +DW SQ ++ PL + + E F+ D YLS K
Sbjct: 171 VVIHTANMIQMDWEGMSQAIYKT--PLLWRKTVEREGPPSVGDRFQKDFCSYLSHYK--- 225
Query: 324 FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--EC 381
A L ++++F+S I+SVPG G L WGH +L L E
Sbjct: 226 HCAKLICK---------LQRYDFTSVKAIFISSVPGKFGGDKLDSWGHNRLEKELAAIES 276
Query: 382 TFE-----KGFKKSPL-VYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPLIV 433
E F+ S + V Q SS+GS + ++ E + ++ + K ++
Sbjct: 277 MAEFMGPRNKFQDSDICVSQCSSMGSFGARQAFLKEHTKALHCDLTHWK---------LI 327
Query: 434 WPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 487
+PTV DVR SL G+ +G++I V++ KWKA +GR R PH+K
Sbjct: 328 FPTVTDVRDSLLGWHSGSSIHFNVTARGAPAQVEELVRHNQLCKWKAMKSGRQRIAPHVK 387
Query: 488 TFARYNGQ------------KLAKAAWGALQ------KNNSQLMIRSYELGVLILPSAKR 529
T+ R N + L+K AWG L+ K L IRSYE GVL+ P
Sbjct: 388 TYMRLNDEGTLIRWVLLTSANLSKPAWGTLEGVAANSKTEHGLRIRSYEAGVLLHPGLFA 447
Query: 530 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 589
+C V KS S ++ D S V + +P++ PPQ
Sbjct: 448 DDSNSACAFFPV---YKSNSLKSPNF--------------DFPLS---VAIRMPWDFPPQ 487
Query: 590 RYSSEDVPWSWDKRYTKKDVYGQVWP 615
Y +D WS + D G WP
Sbjct: 488 PYGDKDDIWSPSIPRNETDWLGSKWP 513
>gi|307105869|gb|EFN54116.1| hypothetical protein CHLNCDRAFT_13268, partial [Chlorella
variabilis]
Length = 150
Score = 132 bits (332), Expect = 6e-28, Method: Composition-based stats.
Identities = 73/179 (40%), Positives = 95/179 (53%), Gaps = 50/179 (27%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR 491
+VW TV +V+ S+EG+ AG +IP P KNVD+ FL+ Y+ +W GR RAMPHIK++ R
Sbjct: 10 LVWTTVAEVQNSIEGWMAGRSIPGPAKNVDRPFLQAYYRRWGGEACGRQRAMPHIKSYLR 69
Query: 492 YNG----------QKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 541
Y G L+KAAWG LQK SQLM+RSYELGVL++PS +
Sbjct: 70 YRGDDVAWLYVGSHNLSKAAWGQLQKQGSQLMVRSYELGVLLVPSLE------------- 116
Query: 542 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLPVPYELPPQRYSSEDVPW 598
G+ A A + V LP+PY LPPQRY++ D PW
Sbjct: 117 -------------------------GAYQAAARGQELRVPLPIPYTLPPQRYAAGDQPW 150
>gi|67539466|ref|XP_663507.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|40738576|gb|EAA57766.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|259479929|tpe|CBF70601.1| TPA: tyrosyl-DNA phosphodiesterase, putative (AFU_orthologue;
AFUA_2G11070) [Aspergillus nidulans FGSC A4]
Length = 586
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 131/508 (25%), Positives = 218/508 (42%), Gaps = 102/508 (20%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHGESDGTLEHMK 234
N V +RD++ +I NY D+D+L+ + + V V+HG E+
Sbjct: 95 NDDTVKLRDILGDPLIRECWQFNYCFDVDFLMDQFDEDVRNLVRVKVVHGSWKKDSENRV 154
Query: 235 RNKPANWILHKPP--------LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNK 285
R + A + P +P FGTHHSK M+L+ + ++++HTAN++ DW +
Sbjct: 155 RIEKA---CQRYPNVEPIVAYMPEPFGTHHSKMMILLRHDDFAQVVIHTANMLAGDWGDM 211
Query: 286 SQGLWMQDF-PL----KDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
Q +W PL +D+N+ + G F+ DL+ YL A+G K P
Sbjct: 212 CQAIWRSPLLPLTDGHEDKNSTAWGTGARFKRDLLAYLK------------AYGVKKTGP 259
Query: 339 SF--FKKFNFSSAAVRLIASVPGYHT-------GSSLKKWGHMKLRTVLQECTFEK---- 385
K++FS+ LIASVP G+S KWG L+ L+ +
Sbjct: 260 LVEQLGKYDFSAVRAALIASVPSKQKVDASSIDGNSKTKWGWPALKEALRNVPLRENVGA 319
Query: 386 -GFKKSP-LVYQFSSLGSLDE--KWMAELS-SSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
G P +V Q SS+ +L + KW+ ++ +++++ S KT +++PT E++
Sbjct: 320 DGTATVPHIVTQISSIATLGQTDKWLKDVFFNALAASSSSTKTRPRYS---VIFPTAEEI 376
Query: 441 RCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW----------KASHTGRSRAMPHI 486
R SL+GY G +I S + +L+ Y W + GR RA PHI
Sbjct: 377 RRSLKGYGYGGSIHMKLQSAAQKKQLQYLRPYLCHWAGDVSGQAPKRLQDAGRRRAAPHI 436
Query: 487 KTFARYNGQKLAKAAWGALQKNN-------------SQLMIRSYELGVLILPS------- 526
KT+ R+ Q + W + N ++ + S+E+GVL+ P
Sbjct: 437 KTYIRFADQHMRSIDWALVTSANLSTQAWGAAANAAGEVRVCSWEIGVLVWPELLTTEPQ 496
Query: 527 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 586
+R S + +VP K +S++ A + ++ +PY+L
Sbjct: 497 GQRKHQQQSRSVAMVPCFKKDKPDPSSKVGN--------------AAPAALIGFRMPYDL 542
Query: 587 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
P YS++D PW + + D GQ W
Sbjct: 543 PLTPYSTQDEPWCATMSHIEPDWLGQTW 570
>gi|328721089|ref|XP_003247207.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 2
[Acyrthosiphon pisum]
Length = 678
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 123/455 (27%), Positives = 209/455 (45%), Gaps = 83/455 (18%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNK 237
S + D GD+ ++ N+MV++ WL + + + +++ D ++ + + K
Sbjct: 277 SFAELLDKSLGDLSESLHLNFMVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKK 336
Query: 238 PANWILHKPPL-PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DF 294
+ HK + +FG HSK + Y G +R++V +ANL DW +QG+W+ F
Sbjct: 337 KLLNVRHKKIINKNAFGHQHSKVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKF 396
Query: 295 PLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
PLK++++ S+ + F+ D++ YL++ + P + +K +FS A
Sbjct: 397 PLKEEDDKSDGNSQTDFKIDILRYLNSFREPSLVPWIQK----------IEKVDFSQA-- 444
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKW 406
+VPG HT WGH+ L+ +L++ C + P++ Q SSLGSL DE+W
Sbjct: 445 ----NVPGKHTEPL---WGHLYLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEW 497
Query: 407 M-AELSSSMSSGFSEDKTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+ +E S+S+ D T +P+ +++P+V++V S +G G +P + +K
Sbjct: 498 LKSEFVESLSASTYCDDTDTD-NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEK 556
Query: 463 DF-LKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNN 509
LKKY W+ R++AMPHIKT+ R + L+KAAWG K++
Sbjct: 557 QLWLKKYMCLWQCHSRKRTKAMPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSD 616
Query: 510 SQL-MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 568
Q I ++E GVL LP F S+ P
Sbjct: 617 EQSNFIMAHEAGVLFLPQ-------FLIGSDTFP-------------------------- 643
Query: 569 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 603
D ++ Y +P++LP YS D PW+ R
Sbjct: 644 IDETEPNKFPYFSLPFDLPLAGYSDTDQPWTISTR 678
>gi|67484562|ref|XP_657501.1| tyrosyl-DNA phosphodiesterase [Entamoeba histolytica HM-1:IMSS]
gi|56474754|gb|EAL52111.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449702140|gb|EMD42834.1| tyrosylDNA phosphodiesterase, putative [Entamoeba histolytica KU27]
Length = 402
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 191/404 (47%), Gaps = 51/404 (12%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ +S+ D+ G+I L+ ++ D+ WL P+L KIP V
Sbjct: 6 FHLNKLELTPSLMKEKDTISLHDIFNTPGEIYSVTLTTFVFDLQWLFDELPILTKIP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
IH +GTL + + + +P+ G HH K M+++Y G+R ++ TANLI
Sbjct: 65 FIH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVKIMIILYEGGLRFVLSTANLIP 121
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
+D+N KSQG++++DF + + + E G +L+TL+ S N + S
Sbjct: 122 LDYNLKSQGIYIKDFKPSESSTILNEKG-----THFLTTLQSYFTSVN--------VTIS 168
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
+ F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSL
Sbjct: 169 YLSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVYDILNNKLHVQFNNHCTIAAQASSL 228
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
G ++ ELS +++ E K I+WPT + +R S GY G+ + N
Sbjct: 229 GLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGY-HGSCSFFLRSN 279
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY----------NGQKLAKAAWGALQKNN 509
K + + Y+ K+ R PHIKT+ Y ++ AAWG + N
Sbjct: 280 FVKTW-ENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KPTN 335
Query: 510 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 553
S L I +YE+G+L + + F+ T +P +IK + +S
Sbjct: 336 SSLEINNYEMGMLFIDN-------FTLTRFPLPYDIKQSTKYSS 372
>gi|255950552|ref|XP_002566043.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211593060|emb|CAP99435.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 520
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 130/519 (25%), Positives = 210/519 (40%), Gaps = 128/519 (24%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLL------PACPVLA 214
S +L ++ LP N + +RD++ +I NY+ D+D+L+ AC +
Sbjct: 62 SPIKLTHIRDLPEGNNVDTIRLRDILGDPMIRECWQFNYLFDVDFLMSQFDEDEAC---S 118
Query: 215 KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
+ P+V I +P FGTHHSK M+L+ + ++I+H
Sbjct: 119 RYPNVEPIVAY----------------------MPEPFGTHHSKMMILLRHDDLAQVIIH 156
Query: 274 TANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG--------FENDLIDYLSTLKWPEF 324
TAN+IH+DW N +Q W PL+ N + F+ DL+ YL
Sbjct: 157 TANMIHMDWTNMTQAAWCSPLLPLQKANTAGSQADNKIGSGARFKRDLLAYLK------- 209
Query: 325 SANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPG-YHTGSSLKK----WGHMKLRTV 377
A+G K P ++FSS LIASVP H S + WG L+ +
Sbjct: 210 -----AYGPKKTGPLVQQLDNYDFSSIRAALIASVPSKKHVSDSSSEEDTLWGWPALKDL 264
Query: 378 LQECTFEKG--FKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPL-- 431
+ + ++ KK +V Q SS+ +L + KW+ E+ F + TP +P
Sbjct: 265 MSQIPIQQKSPSKKPHVVIQISSVATLGQTNKWLKEV-------FFKSLTP----QPTTY 313
Query: 432 -IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASHTGRSRAM--- 483
I++PT +++R SL GY +G++I S + +++ + +W + +
Sbjct: 314 SIIFPTPDEIRRSLNGYNSGSSIHMKTQSAAQQKQLQYMRPHLCQWAGDSLPPGQCIDLS 373
Query: 484 ---------------PHIKTFARY-------------NGQKLAKAAWGALQKNNSQLMIR 515
PHIKT+ R+ + L+ AWGA + ++ I
Sbjct: 374 EENPPRREAGRARAAPHIKTYIRFADSDMKTIDWAMVSSANLSTQAWGAATNGSGEVRIC 433
Query: 516 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 575
S+E+GV++ P R G G G SDA +S
Sbjct: 434 SWEIGVVVWPDLFRDGA--------------EGKAPVPDALMVPCFKRDRPGVSDADTAS 479
Query: 576 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
VV +PY+LP Y + D PW + D G+ W
Sbjct: 480 VVVGFRMPYDLPLTPYGAADEPWCATASHALPDWRGESW 518
>gi|315052274|ref|XP_003175511.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
gi|311340826|gb|EFR00029.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
Length = 591
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 141/537 (26%), Positives = 225/537 (41%), Gaps = 102/537 (18%)
Query: 160 LPSTFRLLRVQGL--PAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L ++ + N C+ +RD++ +I NY+ D+D+++ K
Sbjct: 71 IPSPIQLTHIRDINDSTGYNKDCIKLRDILGDPMIKECWQFNYLFDVDYIMSQFDRDVKD 130
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ + +IHG E+ + + KR A ++ P P FGTHHSK M+LI +
Sbjct: 131 LIQLKIIHGSWKREAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILIRHDNL 188
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDYLSTLK 320
+II+HTAN+I DW N +Q +W Q ++ + G F+ DL+ YL
Sbjct: 189 AQIIIHTANMIPRDWGNMTQAVWRSPLLPFSQPHVGDTHGEFGSGARFKRDLLAYLD--- 245
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
A+ N I ++++F + LIASVP + WG
Sbjct: 246 ---------AYNNKTIGLLIHQLQRYDFGAVKAVLIASVPSRLPVKAFDSNRKTLWGWPA 296
Query: 374 LRTVLQECTFEKGFK---KSPLVYQFSSLGSLDE--KWMAEL---SSSMSSGFSEDKTPL 425
LR ++ + K ++ Q SS+ +L + KW+ E S S F++ +
Sbjct: 297 LRDAIRSIPIDHSSSQTLKPHIIVQVSSIATLGQTDKWLKETFFGSLCPQSRFNQTISAC 356
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKAS---- 475
I++PT +++R SL+GY +G +I S QK + +L+ Y W
Sbjct: 357 HANFS-IIFPTPDEIRRSLDGYGSGGSIHMKIQSASQQKQL--AYLRHYLCHWAGDAEGQ 413
Query: 476 -----------------HTGRSRAMPHIKTFARYN-------------GQKLAKAAWGAL 505
GRSRA PHIKT+ R++ L+ AWGA
Sbjct: 414 RDPGPATESVKGLAYVREAGRSRAAPHIKTYIRFSDSGMSSIDWAMVTSANLSTQAWGAG 473
Query: 506 QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK--------SGSTETSQIQK 557
++ I S+E+GVLI P R C + + +K + S E Q +
Sbjct: 474 ANAQGEVRICSWEIGVLIWPELFRENNIEKCNDSSPINHVKMIPCFKRNTPSKEPLQPPE 533
Query: 558 TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ LT H DA V + +PY LP Y+ DVPW + + D GQ W
Sbjct: 534 SDSTKLTSH--PDATNMIRVGFR-MPYNLPLVPYTPRDVPWCATAAHREPDWMGQTW 587
>gi|167389207|ref|XP_001738862.1| tyrosyl-DNA phosphodiesterase [Entamoeba dispar SAW760]
gi|165897690|gb|EDR24772.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba dispar SAW760]
Length = 721
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 181/374 (48%), Gaps = 42/374 (11%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ +S+ D+ G+I +L+ ++ D+ WL P+L ++P V
Sbjct: 6 FHLNKLELTPSLMKEKDTISLHDLFNTPGEIYSVVLTTFVFDLQWLFNELPILTRVP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
IH + + + + ++ P+P+ G HH K M+++Y G+R ++ TANLI +
Sbjct: 65 FIHNGNLSCFDQLLIQQYKDF--QTFPIPLKKGCHHVKIMIMLYEGGLRFVLSTANLIPI 122
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+N KSQG++++DF + + + E G +L+TL+ N A N + S+
Sbjct: 123 DYNLKSQGIYVKDFKPSESSTVLNEKG-----THFLTTLQ------NYLASVN--VTVSY 169
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSLG
Sbjct: 170 LSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVHDILNMKLHVQFNNHCTIAAQASSLG 229
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
++ ELS +++ E K I+WPT + +R S GY G+ + N
Sbjct: 230 LFTSQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGY-HGSCSFFLRSNF 280
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFARY----------NGQKLAKAAWGALQKNNS 510
K + + Y+ K+ R PHIKT+ Y ++ AAWG + NS
Sbjct: 281 VKTW-ENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KPTNS 336
Query: 511 QLMIRSYELGVLIL 524
L I +YE+G+L +
Sbjct: 337 TLEINNYEIGMLFI 350
>gi|407035177|gb|EKE37579.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba nuttalli P19]
Length = 402
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 111/404 (27%), Positives = 191/404 (47%), Gaps = 51/404 (12%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ VS+ D+ G+I L+ ++ D+ WL P+L +IP V
Sbjct: 6 FHLNKLELTPSLMKEKDTVSLHDIFNTPGEIYSVTLTTFVFDLQWLFDELPILTRIP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
+H +GTL + + + +P+ G HH K M+++Y G+R ++ TANLI
Sbjct: 65 FVH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVKIMIILYEGGLRFVLSTANLIP 121
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
+D+N KSQG++++DF + + + E G +L+TL+ S N + S
Sbjct: 122 LDYNLKSQGIYIKDFKPSESSTVLNEKG-----AHFLTTLQSYFTSVN--------VTIS 168
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
+ F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSL
Sbjct: 169 YLSDFDYSTIDGWLLLSIPGTHKGNDLNKYGMKQVYDILNNKLHVQFTNHCTIAAQASSL 228
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
G ++ ELS +++ E K I+WPT + +R S GY G+ + N
Sbjct: 229 GLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGY-HGSCSFFLRSN 279
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY----------NGQKLAKAAWGALQKNN 509
K + + Y+ K+ R PHIKT+ Y ++ AAWG + N
Sbjct: 280 FVKTW-ENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KPTN 335
Query: 510 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 553
S L I +YE+G+L + + F+ T +P +IK + +S
Sbjct: 336 STLEINNYEMGMLFIDN-------FTLTRFPLPYDIKQSTKYSS 372
>gi|358393671|gb|EHK43072.1| hypothetical protein TRIATDRAFT_225252 [Trichoderma atroviride IMI
206040]
Length = 1124
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 129/496 (26%), Positives = 217/496 (43%), Gaps = 82/496 (16%)
Query: 126 ELSSKKMRQQDEQDNENGKNSEEALCN-FHVSRDKL------PSTFRLLRVQGLPAWANT 178
+ + K+ R + D NG + E+L R K S ++L R++ LP N
Sbjct: 2 DFARKRSRDAADGDEGNGDEALESLSRPISPPRKKFRQINIQKSPWQLTRIRDLPDELNK 61
Query: 179 SCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLE 231
VS++D++ +I N++ DI +++ + ++ + V+HG + + L
Sbjct: 62 DTVSLQDLLGDPLIRECWQFNFLHDIPFMVNTFDETVRRLVQLHVVHGFWKKSDLNRILL 121
Query: 232 HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLW 290
+ N LH P+P FGTHHSK M++ +II+HTAN+I DW N + +W
Sbjct: 122 SDAAARYPNVHLHCAPMPEMFGTHHSKMMVMFRSDNTAQIIIHTANMIPRDWTNMTNAVW 181
Query: 291 MQ-DFPLKDQNNLSEECG----------FENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
PL ++ + G F+ DL+ YL +K+ + K
Sbjct: 182 QSPKLPLLPVPDIISQHGQTLPLGSGLRFKADLLSYL--MKYDSYKVTC------KPLAD 233
Query: 340 FFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
F+FSS IASVPG H +S WG L+ LQ G S +V Q S
Sbjct: 234 RLGYFDFSSVRAAFIASVPGKHDIRDASQPAWGWAGLQRCLQGVPVGPG--GSAIVVQIS 291
Query: 398 SLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 454
S+ +L ++ W+ L +S+++ + + +V+PT +++R SL+GYA+GN+I
Sbjct: 292 SIATLGANDDWLQRTLFNSLATSLTPNANKPSFK---VVFPTADEIRNSLDGYASGNSIH 348
Query: 455 SPQK-------------------NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN-- 493
+ + N KD + +GR+RA PHIKT+ R+N
Sbjct: 349 TKIQSAQHISQLRYLHPILHHWANDSKDGAALFAGASIYGDSGRNRAAPHIKTYIRFNCN 408
Query: 494 ---------GQKLAKAAWG-ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 543
++K AWG L+ + I S+E+GVL+ P+ C ++ S
Sbjct: 409 TTIDWAMLTSANMSKQAWGETLKPTTGEFRIASWEVGVLVWPN-------LLCKDGVMLS 461
Query: 544 EIKSGSTETSQIQKTK 559
+S + S + +
Sbjct: 462 SFQSDTVNMSPFSQAQ 477
>gi|256575388|gb|ACU87659.1| tyrosyl DNA phosphodiesterase 1 [Leishmania donovani]
Length = 828
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 152/641 (23%), Positives = 241/641 (37%), Gaps = 214/641 (33%)
Query: 171 GLPAWANT--------------SC--VSIRDVIQGDIIVA-------ILSNYMVDIDWLL 207
G+P W N SC + +RD+ + D+ +LS+Y+ D+ WLL
Sbjct: 159 GVPLWVNAIDSFASVPQRHAPLSCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLRWLL 218
Query: 208 PACPVLAKIPHVLVIHGESDGT---------------------------LEHMKRNKPAN 240
P L+ + LV+ GT + ++ A
Sbjct: 219 ATVPELSAVTGKLVVLSGEKGTATLRRSTGDPSSPYTAASPLMDRVNPFMAALREQARAT 278
Query: 241 WILH-----------KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
LH +PPLP++FGTHH+K L + RG+R+ + TANL+ DW KSQG+
Sbjct: 279 SPLHTALSRERLAVLEPPLPVAFGTHHTKMALCVNGRGLRVSIFTANLVEQDWCWKSQGI 338
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLST------------LKWPEFSANLPAH------ 331
++QDFP K S + +++ + K EF A+L +
Sbjct: 339 YVQDFPWKTATERSNDDSAGTTMVETAARSTSDSNNGSNAFTKGAEFVAHLRQYLMQCGV 398
Query: 332 -------------------GNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSSLKKW-- 369
G F+ + F +FS+AAV L++SVPG Y G +
Sbjct: 399 SLAAACASPADAASAAGPLGIFETD--FLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRV 456
Query: 370 GHMKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKT 423
G +L VL+ T L +Q+SS GSL+ ++ L ++M +
Sbjct: 457 GLCRLAEVLRRSALTMATAPASVDLSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDA 516
Query: 424 PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----- 478
P G+ + +V+PT ++VR S EG+ G ++P + +F+ +W +S G
Sbjct: 517 PRGVRDVQVVYPTEDEVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEAGHTAKR 575
Query: 479 -------------------------------------------RSRAMPHIKTFAR---- 491
R A+PHIK++A
Sbjct: 576 AFPRPAKVAAAHASREDAVDVDGVDSDGGEGTPVSLAGSCAAYRQFALPHIKSYAAVAPD 635
Query: 492 --------YNGQKLAKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 538
L++AAWG+L Q + Q ++RSYELGVL + + S S
Sbjct: 636 RSCVRWFLLTSANLSQAAWGSLSRKVNQHGSRQQLVRSYELGVLYDSHSAIYPSASSWFS 695
Query: 539 NIVPSEIKSGSTETSQ--IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-- 593
+ S+I+ + S+ + +T L G ++ V L PY L P Y+S
Sbjct: 696 VVAKSKIELPNARNSRAVLYETPL-----------GVDTQDVCLYTPYNLLCPTPYASTA 744
Query: 594 -----------------------EDVPWSWDKRYTKKDVYG 611
DVPW D + +D YG
Sbjct: 745 ALRAHRDAPDTGEQAVAGAALDCSDVPWVLDMPHRGRDAYG 785
>gi|115384578|ref|XP_001208836.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196528|gb|EAU38228.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1250
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 151/583 (25%), Positives = 247/583 (42%), Gaps = 123/583 (21%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQ----DEQDNENGKNSEEALCNFHVSRDK-LPSTFR 165
++ K S+D TN + +R+ + ++ +S N + +PS F+
Sbjct: 708 AKRAKLSSDDSTTNSTTALASLRRSITPPSPRPSKRAASSPAKTTNAQQDTARVIPSPFQ 767
Query: 166 LLRVQGLPAWANTSCVSIR-DVIQGDIIV--AILSNYMVDIDWLLPACPV-LAKIPHVLV 221
L V+ L + + ++R I GD ++ NY+ D+D+L+ + + V V
Sbjct: 768 LTHVRDLAESSGNNADTVRLHNILGDPMIRECWQFNYLFDVDFLMKQFDEDVRSLVKVKV 827
Query: 222 IHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
+HG E+ + E R I+ +P +FGTHHSK M+L+ + ++++H
Sbjct: 828 VHGSWKREAPNRIRIDEACSRYPNVEAIVAY--MPEAFGTHHSKMMILLRHDDLAQVVIH 885
Query: 274 TANLIHVDWNNKSQGLWMQDF-PL-KDQNNLSEECG-------FENDLIDYLSTLKWPEF 324
TAN+I DW N Q +W PL KD + SE+ F+ DL+ YL
Sbjct: 886 TANMIPGDWANMCQAVWRSPLLPLRKDIDAESEDAAKIGSGMRFKRDLLAYLDH------ 939
Query: 325 SANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPG---YHTGSSLKK--WGHMKLRTV 377
+G K P ++++F + L+ASVP +T S + WG L+ V
Sbjct: 940 ------YGPKKTGPLVDQLRRYDFDAVRAALVASVPSKQKINTADSQRTTLWGWPALKDV 993
Query: 378 LQECTFEK-GFKKSP----LVYQFSSLGSLDE--KWMAE-----LSSSMSSGFSEDKTPL 425
++ G KS +V Q SS+ SL + KW+ E LSS +S +S
Sbjct: 994 VRGIPLRAAGGSKSAVTPHIVSQISSVASLGQTDKWLKEVFFKSLSSDPTSKYS------ 1047
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI-----PSPQKNVDKDFLKKYWAKW-------- 472
I++PT +++R SL GY +G +I +PQ+ +++ Y W
Sbjct: 1048 ------IIFPTDDEIRRSLNGYGSGGSIHMKIQSAPQQK-QLQYIRPYLCHWAGDRDDGS 1100
Query: 473 -------KASHTGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQ 511
+ GR RA PHIKT+ +++ K L+ AWGA + +
Sbjct: 1101 SAGTSMSRKRDAGRRRAAPHIKTYIQFSDTKTMDSIDWAMVTSANLSTQAWGAAPNASGE 1160
Query: 512 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 571
+ I SYE+GV++ P S+ +S Q T +
Sbjct: 1161 IRICSYEIGVVVWPQL------------FADSDAESAVMVPCFKQDTPAF-----AEREG 1203
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
S VV L +PY+LP Y+ +D PW +T+ D GQ W
Sbjct: 1204 PVPSVVVGLRMPYDLPLTSYTPKDTPWCATATHTEPDWLGQTW 1246
>gi|425771231|gb|EKV09680.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum Pd1]
gi|425776784|gb|EKV14988.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum
PHI26]
Length = 900
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 136/523 (26%), Positives = 223/523 (42%), Gaps = 91/523 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S +L ++ LP N V +RD++ +I N++ D+D+L+ + + V
Sbjct: 397 SPVQLTHIRDLPDGNNVDAVRLRDILGDPMIRECWQFNFIFDVDFLMAHFDEDVRSLVKV 456
Query: 220 LVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRII 271
V+HG E + E R I+ P P FGTHHSK M+L+ + +++
Sbjct: 457 KVVHGSWRREDSNRIRVEEACSRYPNVEPIVAYMPEP--FGTHHSKMMILLRHDDLAQVV 514
Query: 272 VHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG--------FENDLIDYLSTLKWP 322
+HTAN+IH+DW N +Q W+ PL+ ++ F+ DL+ YL
Sbjct: 515 IHTANMIHMDWTNMTQAAWLSPLLPLQKATSVESPTDAKVGSGARFKRDLLAYLK----- 569
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVR--LIASVPGYHTGSSLKK-----WGHMKLR 375
A+G K P + N+ +R LIASVP S WG ++
Sbjct: 570 -------AYGPKKTGPLVQQLDNYDFCPIRAALIASVPSKKHASDSSSDEETLWGWPAVK 622
Query: 376 TVLQECTFEK--GFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPL 431
++ + ++ KK +V Q SS+ +L + KW+ ++ F + TP +P
Sbjct: 623 DLMGQVPIQQKNTSKKPHIVIQTSSVATLGQTNKWLKDV-------FFKALTPTHSPQPT 675
Query: 432 --IVWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKAS---------- 475
I++PT +++R SL GY +G + I S + ++ Y +W
Sbjct: 676 YSIIFPTPDEIRRSLNGYNSGVSIHMKIQSAAQQKQLQYMSPYLCQWAGDSLPPGQCIDL 735
Query: 476 --------HTGRSRAMPHIKTFARY-------------NGQKLAKAAWGALQKNNSQLMI 514
GR+RA PHIKT+ R+ + L+ AWGA + ++ I
Sbjct: 736 SEDNPPKREAGRARAAPHIKTYIRFADSDMKTIDWAMVSSANLSTQAWGAATNASGEVRI 795
Query: 515 RSYELGVLILPSAKRH-GCGFSCTSNIVPSEIKS-GSTETSQIQKTKLVTLTWHGSSD-A 571
S+E+GV++ P R GC + + + SE ++ G + SD A
Sbjct: 796 CSWEIGVVVWPELFRDGGCDDAASPSASESESRAEGKPPAPDVLMVPCFKRDRPVVSDGA 855
Query: 572 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+S VV +PY+LP Y + D PW + D GQ W
Sbjct: 856 ETASMVVGFRMPYDLPLTPYGAGDEPWCATASHALPDWQGQSW 898
>gi|332029128|gb|EGI69139.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 550
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 119/451 (26%), Positives = 191/451 (42%), Gaps = 97/451 (21%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTL--EHMKRNKPAN 240
I D G+I+ ++ +MVD+ WL L+ +D T+ +H ++ N
Sbjct: 157 ILDRSLGEIVNSLHLTFMVDVTWLYL---------QYLLAGQRTDMTILCKHRICHEELN 207
Query: 241 WILHKPPLPI-----SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-- 293
+ I + +HH+ M+L Y G+R+IV TA L +DW N++QGLW+
Sbjct: 208 ICHENVIIEIVGQLDQYSSHHANIMILQYKNGIRVIVSTAGLYSIDWENRTQGLWISPHL 267
Query: 294 --FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
P + + E GF+ DL YLS K P + + A + +FS
Sbjct: 268 PYLPESAKPSDGESPTGFKKDLERYLSKYKQPALTQWIRA----------VQMADFSDVN 317
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG--SLDEKW-- 406
V L+ASVPG + WG+ KL VL ++ P+V Q S +G L + W
Sbjct: 318 VFLVASVPGIYKADEADFWGYRKLAHVLSRYATLPRNEQWPIVAQSSGVGCFGLFKNWLL 377
Query: 407 ------MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
M+E++S S + + ++P++E+ + S + + +N
Sbjct: 378 KDIIWSMSEMTSKASKNHPQFQ---------FIYPSIENYKQSFDYQCLITPLTYSAENH 428
Query: 461 DK-DFLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQK 507
K +L+ Y +WKA+ TGR RAMP+IK++ R + L+KAAWG+ K
Sbjct: 429 SKQQWLESYLYQWKATRTGRDRAMPNIKSYTRISPDLKKIPWFLLTSANLSKAAWGS-TK 487
Query: 508 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHG 567
I +YE GVL +P K +T T
Sbjct: 488 QYKGYSIGNYEAGVLFIP---------------------------------KFITGTTTF 514
Query: 568 SSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
++ V P+PY+LP +Y S+D P+
Sbjct: 515 PVGEEKNTGVPVFPIPYDLPLTQYESDDSPF 545
>gi|358384803|gb|EHK22400.1| hypothetical protein TRIVIDRAFT_179757 [Trichoderma virens Gv29-8]
Length = 1118
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 204/446 (45%), Gaps = 90/446 (20%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGD--IIVAILSNYMVDIDWLLPACPVLAKIPHV 219
S ++L R++ +P N V++ D++ GD I NY+ DI +++ A +
Sbjct: 42 SPWQLTRIRDVPEELNKDTVALGDIL-GDPSITECWQFNYLHDIPFVMNAFDKNVRDSVQ 100
Query: 220 L-VIHG-----------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-R 266
L V+HG S+ L+H N LH P+P FGTHHSK M+L +
Sbjct: 101 LHVVHGFWKRNDLNRVILSEHALQH------PNVHLHCAPMPEMFGTHHSKMMILFHSDN 154
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLK----------DQNNLSEECGFENDLIDY 315
+I++HTAN+I DW N + +W P + Q F+ DL+ Y
Sbjct: 155 TAQIVIHTANMIPKDWTNMTNAVWRSPKLPWRWELDPRLQQAQQAPFGSGIRFKADLLAY 214
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMK 373
L +++ + +N F+FSS LIASVPG + +S WG
Sbjct: 215 L--MQYDSHRVTCKQLVDRLVN------FDFSSIRAALIASVPGRYNLYDTSSPAWGWTA 266
Query: 374 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAE-LSSSMSSGFSED-KTPLGIGE 429
L+ LQ E G +S +V Q SS+ +L K W+ + L +S+++ ++D K P +
Sbjct: 267 LKRCLQTVPVETG--ESQIVVQISSIATLGAKDDWLQKILFNSLATSRNQDTKKP----D 320
Query: 430 PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK----DFLKKYWAKWK------------ 473
+V+PT +++R SL+GYA+G +I + K+ +L W
Sbjct: 321 FKVVFPTADEIRNSLDGYASGQSIHTKIKSAQHIRQLHYLHPMLHHWANDSADGVGLLEQ 380
Query: 474 ---ASHTGRSRAMPHIKTFARYN-----------GQKLAKAAWGALQKNNSQLMIRSYEL 519
+ +GR+RA PHIKT+ R+N ++K AWG + ++ I S+E+
Sbjct: 381 PPISGDSGRNRAAPHIKTYTRFNQNNSIDWAMLTSANMSKQAWGEAPSSTGEVRIASWEV 440
Query: 520 GVLILPSAKRHGCGFSCTSNIVPSEI 545
GVL+ P G C + ++ S I
Sbjct: 441 GVLVWP-------GLLCENGVMVSSI 459
>gi|396459207|ref|XP_003834216.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
gi|312210765|emb|CBX90851.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
Length = 650
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 138/589 (23%), Positives = 255/589 (43%), Gaps = 121/589 (20%)
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTS 179
DG +G K Q++ D ++G++ + + NF +PS +L+R++ + A N
Sbjct: 86 DGGLDG-----KGDQEEHPDIKSGRDGDSNI-NF------IPSPIQLIRIEDMGAMQNVD 133
Query: 180 CVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTLEHM 233
+ + D++ +I + NY+ D+ +++ + + V ++HG + + +E +
Sbjct: 134 AIGLGDILGDPLIRECWNFNYLFDLGFVMQHFDSDVRHMVKVKIVHGFWRRDDERRIELL 193
Query: 234 KR-NKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW- 290
+ + N L +P FGTHHSK ++L + +II+HTAN+I+ DW+N +Q +W
Sbjct: 194 EAAERYPNIELLSAYIPDPFGTHHSKMLILFRHDDTAQIIIHTANMIYRDWSNMTQAVWS 253
Query: 291 -------MQDFPLKDQNNLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
Q +P ++ ++ S G F+ DL+ YL+ + K S
Sbjct: 254 SPMLPLSTQKWPTENPDSASHPVGSGLRFKVDLLRYLAAYE-----------RRTKDLVS 302
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKK-SP-- 391
++F + I SVP + K +G + LR +L + + K SP
Sbjct: 303 QLAHYDFFAIRAAFIGSVPSRQNPDASKPSEETSFGWLGLREILTQVPVARRDKSHSPPH 362
Query: 392 LVYQFSSLGSLDEK--WMAELSSSMSS----------------GFSEDKTPLGIGEPL-- 431
+V Q SS+ +L + W+ S +SS S P P
Sbjct: 363 IVTQISSIATLGAQPTWLTHFQSVLSSEPKVSNTAVSGSTKTASASPKHAPNNPPPPTFS 422
Query: 432 IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------K 473
I++PT E++R L+GYA+G +I S Q+ ++ + W +
Sbjct: 423 IIFPTPEELRTCLDGYASGASIHWKLQSAQQQKQLAYMHPFLRHWHSPAPTSPPQDSPRR 482
Query: 474 ASHTGRSRAMPHIKTFARYNGQ-------------KLAKAAWGALQKNNSQLMIRSYELG 520
A+H R A PHIKT+ R++ Q L+K AWG + +++ ++S+E G
Sbjct: 483 AAH--RGPAAPHIKTYIRFSNQDHTTIDWALLTSANLSKQAWGDVVGKKNEMRVQSWEAG 540
Query: 521 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS----------- 569
V++ P+ H + P+ + + +Q+ L +GS+
Sbjct: 541 VVLWPALFAHNS-VPGNRALAPAIMVPVFARDAPLQE-DLAGWLRNGSAAHNHNVCADRV 598
Query: 570 ----DAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
++ + VV +PY+LP Y+++++PW RY + D G W
Sbjct: 599 SPVRNSAVNVTVVGFRMPYDLPLCPYTADEMPWCATMRYAEPDGKGMAW 647
>gi|440634212|gb|ELR04131.1| hypothetical protein GMDG_01435 [Geomyces destructans 20631-21]
Length = 570
Score = 125 bits (314), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 144/532 (27%), Positives = 230/532 (43%), Gaps = 109/532 (20%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+ S F+L R++ P N VS+ +++ +I + NYM D+D+L+ P
Sbjct: 69 ISSPFKLTRIRDSPGSLNNGSVSLGEIVCDPMIREMWQFNYMHDLDFLMSNMDPDTKDTV 128
Query: 218 HVLVIHG--ESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
+ V+HG + + L HMK K N L +P FGTHH+K M+L+ + +II+
Sbjct: 129 KIHVVHGYWKQESGL-HMKSQALKYPNVHLRCAYMPEIFGTHHTKMMVLLRHDDQAQIII 187
Query: 273 HTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEEC-GFENDLIDYLSTLKWP-EFSANLP 329
HTAN+I DW N SQ W PL L+++ + Y S L++ +F L
Sbjct: 188 HTANMIPQDWANLSQDAWTSPLLPLLPAEKLADQTLARGSKSASYGSGLRFKLDFLGYLK 247
Query: 330 AHGNFK--INPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTF 383
A+ + + P K++FSS L+ VPG H S +G +R +L
Sbjct: 248 AYDSRRTICKPLIEELLKYDFSSIRGALVGHVPGRHHVESDNPTLFGWSAIRAILNTIPV 307
Query: 384 EKGFKKSPLVYQFSSLGSL--DEKWMAE--LSSSMSSGFSEDKTP-LGIGEPLIVWPTVE 438
G K +V Q SS+ +L ++W+ + ++ +S S KTP LG IV+PT +
Sbjct: 308 HNG-DKPEVVAQVSSIATLGVTDQWLQKTLFAALSASSNSPSKTPKLG-----IVFPTPD 361
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKD----FLKKYWAKWKASH------------------ 476
++R SL+GY +G +I + V ++ +LK + W +
Sbjct: 362 EIRKSLDGYNSGGSIHVRIQTVAQEKQLQYLKPLFYHWAGDNRPVSPPSTSSPGPSTVAS 421
Query: 477 ---------------------TGRSRAMPHIKTFARYNGQ-------------KLAKAAW 502
GR+RA PHIKT+ R+ + L+K AW
Sbjct: 422 TVREAWQNRAGPSAVASTVREAGRNRAAPHIKTYIRFADEAKTRIDWALVTSANLSKQAW 481
Query: 503 GALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVT 562
G + I SYELGVL+ PS ++ + +VP T Q + K
Sbjct: 482 GERLNAAGDVRICSYELGVLVSPSM------YAEDAVMVP---------TFQTDRPK--- 523
Query: 563 LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+A + +PY+LP RY +++ PW K Y + D G+ +
Sbjct: 524 -------EAVDGKITIGCRMPYDLPLVRYGADEEPWCATKAYEELDWMGRSY 568
>gi|307211795|gb|EFN87776.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 463
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 171/367 (46%), Gaps = 41/367 (11%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWL-LPACPVLAKIPHVLVIHGESDGT-LEHMKRNKPAN 240
I D G+I+ ++ ++VD++WL L + ++ H D T L P
Sbjct: 99 ILDKSLGEIVNSLHLTFIVDVEWLCLQYALAGQRTDMTILYHNRRDDTDLSDNISIMP-- 156
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP--- 295
+++ L + THH+K M+L Y G+R++V TANL DW N++QGLW+ P
Sbjct: 157 --VYEAELVFNSETHHTKIMILQYKDDGIRVVVSTANLYSNDWENRTQGLWISPHLPRLP 214
Query: 296 -LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
L ++ F+ D YLS P + K +FS+ V +
Sbjct: 215 ELASSSDGESPTNFKQDFKRYLSRYWNPALKQWMDV----------VSKADFSAVNVCFV 264
Query: 355 ASVPGYHTGSSLKKWGHMKL-RTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS 413
ASVPG +T + WGH KL R + Q T + ++ Q SS+G+L + + LS
Sbjct: 265 ASVPGNYTHFNADYWGHRKLARVLFQHTTLPPDAPQWSIIAQSSSIGNLGPNYESWLSKE 324
Query: 414 MSSGFSEDKTPLGIGEPLI--VWPTVEDVRCSLEGYAAGNAI-PSPQKNVDKDFLKKYWA 470
+ S++ + P ++P+VE+ S + + + + +++ + +++ +
Sbjct: 325 IVLSMSQETMQMTNRYPKFQYIYPSVENYERSFDFRNSISCFYYTAERHSKQQWIEPFLH 384
Query: 471 KWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYE 518
+WKA+ TGR RAMPHIK++ R + L+K+AWG S I +YE
Sbjct: 385 QWKATRTGRDRAMPHIKSYMRISPDLKRISWFMLTSANLSKSAWGV---KRSTYSITNYE 441
Query: 519 LGVLILP 525
GV+ LP
Sbjct: 442 AGVVFLP 448
>gi|451851539|gb|EMD64837.1| hypothetical protein COCSADRAFT_36213 [Cochliobolus sativus ND90Pr]
Length = 610
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 132/538 (24%), Positives = 217/538 (40%), Gaps = 114/538 (21%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS RL R++ LP N V + D++ +I + NY+ D+D+++ + +
Sbjct: 103 IPSPVRLTRIEKLPKEKNVDTVGLTDLLGDPLIKECWNFNYLFDLDFIMQHFDRDIRDMV 162
Query: 218 HVLVIHGESDGT-------LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG G LE +R N L +P FGTHHSK ++L + +
Sbjct: 163 KVKIVHGFWRGDDKNRIALLETAERY--PNIELISAYIPDPFGTHHSKMLILFRHDDTAQ 220
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG------------FENDLIDYL- 316
+++HTAN+IH DW N +Q +W ++ SE+ F+ DL+ YL
Sbjct: 221 VVIHTANMIHRDWANMTQAVWASPLLPLLRHTTSEQSNSSKIHSIGSGERFKVDLLRYLY 280
Query: 317 ----------STLKWPEFS-----------------ANLPAHGNF------KINPSFFKK 343
S LK+ +FS A P+H F +I S K
Sbjct: 281 AYGMRLGALTSQLKYYDFSSIRAAFLGSAPSKQKLTAAGPSHTAFGWLGLDQILSSIPVK 340
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
+ S ++ + T + W +++L C K +K F+ L
Sbjct: 341 ASGDSLRPHIVTQISSVATLGATPTW-LFHFQSILSRCPDAKDTEKEEASSSFTKASMLF 399
Query: 404 EKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 459
K + + + FS +V+PT ++R L+GY AG +I S Q+
Sbjct: 400 TKQESNAAEAPEPKFS------------VVFPTPAEIRMPLDGYTAGGSIHWKFQSVQQQ 447
Query: 460 VDKDFLKKYWAKW--------KASHTGRSRAMPHIKTFARYNGQ-------------KLA 498
+++ W R A PHIKT+ R++ + L+
Sbjct: 448 KQLEYMHPILCHWTPVSRPDPSQQEAHRGTAAPHIKTYIRFSDETHTTIDWALLTSANLS 507
Query: 499 KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 558
K AWG + N ++ ++S+E GV++ P+ F +S +VP + + ET +
Sbjct: 508 KQAWGDVMNKNEEIRVQSWETGVVMWPAL---FAEFEHSSTMVPV-FGADNPETGK---- 559
Query: 559 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 616
HG G VV +PY LP YS+++ PW Y + D YG W R
Sbjct: 560 -------HGE---GKRETVVGFRMPYNLPLVPYSADERPWCATLAYEEPDRYGLTWAR 607
>gi|332029126|gb|EGI69137.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 511
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/441 (25%), Positives = 187/441 (42%), Gaps = 79/441 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ + VD+ WL + + + ++ E + N I
Sbjct: 114 ILDRSLGEIVNSLHLTFRVDVTWLYLQYLLAGQCTDMTILCKRKTRIHEKLSEN-----I 168
Query: 243 LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN- 300
F +HH+ M+L Y G+R+IV TA L +W N++QGLW+ P ++
Sbjct: 169 TIIKVDGHEFSSHHTNIMILQYKNGIRVIVSTAGLYSAEWENRTQGLWISPHLPYLPESA 228
Query: 301 ---NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ GF+ DL YLS P + + ++ +FS V L+ASV
Sbjct: 229 HPSDGESSTGFKKDLERYLSKYDQPVLTQWICT----------VRRVDFSDVNVFLVASV 278
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS---SLGSLDEKWMA-ELSSS 413
PG H + WG KL VL ++ P+V Q S + GS E W+ ++
Sbjct: 279 PGIHKSYEINFWGCKKLAYVLSRYVTLPSNEQWPIVIQSSGVGNFGSTIESWLLRDIIRC 338
Query: 414 MSSGFSEDKTPLGIG---EPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYW 469
MS +T +G+ + ++P++E+ + S + ++ S + + + +L++Y
Sbjct: 339 MSK-----ETSIGLKNHPQFQFIYPSIENYKQSFDCQDLITSLTYSVEIHSKQQWLEQYL 393
Query: 470 AKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSY 517
+WKA TGR AMP IK++ R + L+KAAWG +++ I +Y
Sbjct: 394 YQWKAKRTGRDCAMPGIKSYTRISPDSKRVPWFLLTSANLSKAAWGLIKRYEG-YSIGNY 452
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
E GVL +P K++T T + V
Sbjct: 453 EAGVLFIP---------------------------------KVITGTATFPIGEEEDAAV 479
Query: 578 VYLPVPYELPPQRYSSEDVPW 598
P+PY+LP RY S+D P+
Sbjct: 480 PTFPIPYDLPLSRYDSDDSPF 500
>gi|327358116|gb|EGE86973.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ATCC 18188]
Length = 655
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 141/597 (23%), Positives = 228/597 (38%), Gaps = 157/597 (26%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGFENDLIDY 315
++++HTAN+I DW N Q +W P+ + N F+ DLI Y
Sbjct: 188 QAQVVIHTANMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILPS------ 526
IKT+ R++ L AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWPDLFVNRK 535
Query: 527 --------------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL--------- 563
G + + ++ K K+ +
Sbjct: 536 VDDDEDDDEDDDDDDDDDDDGSEWKEKGKGKKARENDRRGAREDKNKVAVMLPCFKQDMP 595
Query: 564 TWHGSSDAGAS------SEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
D+G+S + V L +PY+LP Y+ +D PW Y + D GQ W
Sbjct: 596 EVRVDKDSGSSTTTATTTTFVGLRMPYDLPLSPYTPQDQPWCATASYKETDWLGQTW 652
>gi|240276539|gb|EER40051.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H143]
Length = 685
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 129/479 (26%), Positives = 198/479 (41%), Gaps = 122/479 (25%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAM 260
+ + V +IHG ES + E +R I+ P P FGTHHSK M
Sbjct: 121 QFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKMM 178
Query: 261 LLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGF 308
+LI + ++++HTAN+I DW N Q +W P++ + + + F
Sbjct: 179 ILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVGRGNRF 238
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSL 366
+ DL+ YL A+GN K P +K++F + LIASVP L
Sbjct: 239 KRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDEL 286
Query: 367 KK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL----- 410
WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 287 DSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAAL 346
Query: 411 --SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 347 SPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQL 403
Query: 463 DFLKKYWAKW----------KASHT---------------------------------GR 479
++L+ Y +W A H+ GR
Sbjct: 404 EYLRPYLCRWAGDTGDGSDISAKHSINSGQERCDSKDANESVRKYVTTGKNSQPIRNAGR 463
Query: 480 SRAMPHIKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILP 525
RA PHIKT+ R++ LA AWGA ++ I S+E+GVLI P
Sbjct: 464 RRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLIWP 522
>gi|325092032|gb|EGC45342.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H88]
Length = 682
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 129/479 (26%), Positives = 198/479 (41%), Gaps = 122/479 (25%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAM 260
+ + V +IHG ES + E +R I+ P P FGTHHSK M
Sbjct: 121 QFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKMM 178
Query: 261 LLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGF 308
+LI + ++++HTAN+I DW N Q +W P++ + + + F
Sbjct: 179 ILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVGRGNRF 238
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSL 366
+ DL+ YL A+GN K P +K++F + LIASVP L
Sbjct: 239 KRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDEL 286
Query: 367 KK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL----- 410
WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 287 DSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAAL 346
Query: 411 --SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 347 SPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQL 403
Query: 463 DFLKKYWAKW----------KASHT---------------------------------GR 479
++L+ Y +W A H+ GR
Sbjct: 404 EYLRPYLCRWAGDTGDGSDISAKHSINSGQERRDSKDANESVRKYVTTGKNSQPIRNAGR 463
Query: 480 SRAMPHIKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILP 525
RA PHIKT+ R++ LA AWGA ++ I S+E+GVLI P
Sbjct: 464 RRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLIWP 522
>gi|225555717|gb|EEH04008.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus G186AR]
Length = 637
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 127/484 (26%), Positives = 197/484 (40%), Gaps = 132/484 (27%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFAASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHGESDGTLEHMKRNKP----ANWILHKPP--------LPISFGTH 255
+ + V +IHG KR P + H+ P +P FGTH
Sbjct: 121 QFDEDVRDLVKVKIIHGS-------WKRESPNRIRVDEACHRYPNVEPIVAYMPEPFGTH 173
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLS 303
HSK M+LI + ++++HTAN+I DW N Q +W P++ + + +
Sbjct: 174 HSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVG 233
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYH 361
F+ DL+ YL A+GN K P +K++F + LIASVP
Sbjct: 234 RGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQ 281
Query: 362 TGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL 410
L WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 282 AIDELDSEKQTLWGWPALKDTIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKET 341
Query: 411 -------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQ 457
S +S KT P I++PT +++R SL GYA+G +I S
Sbjct: 342 FFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAA 398
Query: 458 KNVDKDFLKKYWAKW----------KASHT------------------------------ 477
+ ++L+ Y +W A H+
Sbjct: 399 QRKQLEYLRPYLCRWASDTGDGSDISAKHSINSGQERCESKNVNESVQKCVATSKNSQPI 458
Query: 478 ---GRSRAMPHIKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGV 521
GR RA PHIKT+ R++ LA AWGA ++ I S+E+GV
Sbjct: 459 RNAGRCRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGV 518
Query: 522 LILP 525
L+ P
Sbjct: 519 LVWP 522
>gi|307211791|gb|EFN87772.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 530
Score = 122 bits (306), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 171/368 (46%), Gaps = 48/368 (13%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ +MVD WL + + +++++GE K N
Sbjct: 153 ILDRSLGEIVNSLHLTFMVDARWLCLQYLLAGQCTDMMILYGERVD-----KEKLGDNIT 207
Query: 243 LHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ L +
Sbjct: 208 TVHVEMPFEFGCHHTKIMILQYRDNGIRVVVSTANLYSDDWENRTQGMWISPH-LPRLSK 266
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
++ CG F+ DL YL T P K +K +FS+ V LIAS
Sbjct: 267 AAKRCGESPTNFKKDLQRYLGTYHNPA----------LKRWRKLVRKADFSAINVCLIAS 316
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELS 411
PG ++ WG+ KL VL + T + ++ Q SS+G+ E W++ E+
Sbjct: 317 TPG-RFRHTVNLWGYKKLADVLFRHVTQLPNALEWSIIAQSSSVGNFGPRYEGWLSKEIV 375
Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK--DFLKKYW 469
SM+ D + +++P+VE+ S + Y G + + V ++K Y
Sbjct: 376 RSMAWKTVRDLKDYPKFQ--LIYPSVENYEQSFD-YQNGTSCFFYSREVHSKLQWIKSYL 432
Query: 470 AKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSY 517
+WKA+ TGR++AMP+IK++ R + L K AWG + N I +Y
Sbjct: 433 YQWKATKTGRNQAMPYIKSYTRISPDLKRIAWFVLTSANLNKGAWGVQRSN---YYIANY 489
Query: 518 ELGVLILP 525
E+GV LP
Sbjct: 490 EVGVAFLP 497
>gi|154273448|ref|XP_001537576.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150416088|gb|EDN11432.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 610
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 126/480 (26%), Positives = 196/480 (40%), Gaps = 122/480 (25%)
Query: 151 CNFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLL 207
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 60 VNAPISSRVIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLM 119
Query: 208 PACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKA 259
+ + V +IHG ES + E +R I+ P P FGTHHSK
Sbjct: 120 SQFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKM 177
Query: 260 MLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECG 307
M+LI + ++++HTAN+I DW N Q +W P++ + + +
Sbjct: 178 MILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMENGHSYATLDGVRRGNR 237
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSS 365
F+ DL+ YL A+GN K P +K++F + LIASVP
Sbjct: 238 FKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDE 285
Query: 366 LKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL---- 410
L WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 286 LDSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAA 345
Query: 411 ---SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 346 LSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQ 402
Query: 462 KDFLKKYWAKWKAS-------------------------------------------HTG 478
++L+ Y +W + G
Sbjct: 403 LEYLRPYLCRWAGDTGDGSDISAKHPINSGQERCDSKDANESVQKYVTTGKNSQPIRNAG 462
Query: 479 RSRAMPHIKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILP 525
R RA PHIKT+ R++ LA AWGA ++ I S+E+GVL+ P
Sbjct: 463 RRRAAPHIKTYIRFSDADLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLVWP 522
>gi|451995661|gb|EMD88129.1| hypothetical protein COCHEDRAFT_1227354 [Cochliobolus
heterostrophus C5]
Length = 571
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 135/536 (25%), Positives = 223/536 (41%), Gaps = 113/536 (21%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP-VLAKIP 217
+PS +L +++ LP N V + D++ +I + NY+ D+D+++ + K+
Sbjct: 63 IPSPVQLTQIEKLPREKNVDTVCLSDLLGDPLINECWNFNYLFDLDFVMQHFDWDVRKMV 122
Query: 218 HVLVIHGESDG------TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRI 270
+ ++HG G TL P N L +P FGTHHSK ++L Y +I
Sbjct: 123 RIKIVHGFWRGDDKNRMTLLEAAEEYP-NIELISAYIPDPFGTHHSKMLILFRYDDTAQI 181
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG------------FENDLIDYLST 318
I+HTAN+I DW N +Q +W+ ++ SEE F+ DL+ YL
Sbjct: 182 IIHTANMIRRDWANMTQAVWVSPLLPLLRHTTSEESKSTSIHSIGSGERFKVDLLRYLY- 240
Query: 319 LKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRLIASVPGYHTGS----SLKKWGHMK 373
A+G + S K +NFS + S P S S +G +
Sbjct: 241 -----------AYGKGTRALTSQLKHYNFSGIRAAFLGSAPSRQKPSAASPSHTAFGWLG 289
Query: 374 LRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSM--------------S 415
L +L + + +V Q SS+ +L W+ S + S
Sbjct: 290 LDQILSGIPAKASEDSSRPHVVTQISSVATLGATPTWLFHFQSILSRCSNVNDSEKEEAS 349
Query: 416 SGFSEDKT--------PLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
S F+E T +G EP +V+PT +++R SL+GY++G +I S Q+
Sbjct: 350 SSFTEACTLSIQQKTNTVGAPEPKFSVVFPTPDEIRMSLDGYSSGGSIHWKFESAQQQKQ 409
Query: 462 KDFLKKYWAKW----------KASHTGRSRAMPHIKTFARYNGQ-------------KLA 498
+++ W + +H RS A PHIKT+ R++ + L+
Sbjct: 410 LEYMHPILCHWAPVSQPDQPQRKAH--RSTAAPHIKTYIRFSDETHTTIDWALLTSSNLS 467
Query: 499 KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 558
K AWG + N ++ I+S+E GV++ P+ +S I+ + E +
Sbjct: 468 KQAWGDVANKNDEIRIQSWETGVVLWPAL---FAEHEHSSTIMVPVFGIDNPEADSTYEA 524
Query: 559 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
K T VV +PY LP YS+++ PW + + D YG+ W
Sbjct: 525 KKGT--------------VVGFRMPYNLPLVPYSADERPWCATMAHKEPDRYGRTW 566
>gi|354543539|emb|CCE40258.1| hypothetical protein CPAR2_102960 [Candida parapsilosis]
Length = 532
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 138/560 (24%), Positives = 214/560 (38%), Gaps = 113/560 (20%)
Query: 114 QKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEA--LCNFHVSRDKLPSTFRLLRVQG 171
+KR S+ E +K+ + + E+ ++ + EE L N + S +LL
Sbjct: 3 EKRKSDAFKAASEHWAKRFKNESERVQDDSAHHEETKPLGNNSTTVSCFSSQIKLLHNPS 62
Query: 172 LP----AWANTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPAC--PVLAKIPHVLVIHG 224
P N V I D+I ++ N+ VD+ + L A+ ++ I G
Sbjct: 63 YPEQDLTRVNQDTVRIHDLIGSSELKETYQFNFNVDLPFFLSFLHPTFTARKRKLVFITG 122
Query: 225 ES--DGTLEHMKRNKPANWILH-KPPLPISFGTHHSKAML-LIYPRGVRIIVHTANLIHV 280
D E K K + I + +P FGTHH+K M+ + +I+ + NL +
Sbjct: 123 NKLLDSADEETKSIKSSYNISEVQANIPSRFGTHHTKMMINFFHGNSAEVIIMSCNLTKL 182
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+ +Q +W + ++ F++DLI YL T + P+ A
Sbjct: 183 DFGGLTQMIWRSGRLARGNTTGTKSIKFKSDLIGYLRTYEKPQIDTLATA---------- 232
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECT--------------FEKG 386
+ F+FS V LIAS PG++ ++ + H ++ C F
Sbjct: 233 LETFSFSGIDVDLIASSPGHYDLNNEEP--HYGYGSLFDACKRNDLLIDNRDKSHHFNVL 290
Query: 387 FKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE-------------PLIV 433
+ S + Y F+ L M +E L G P IV
Sbjct: 291 AQTSAISYPFAVEKGATAGVFTHLLCPMLFSKNEKFCLLAPGAQSLRRHQSKHNYTPSIV 350
Query: 434 WPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKAS----HTGRSRAM 483
+P+V++V S G+AAG AI KN +K Y KW + TGR R M
Sbjct: 351 FPSVDEVAASTVGFAAGQAIHFDYSRSYVHKNYYNQAIKPYHKKWDSGDVKVFTGRERVM 410
Query: 484 PHIKTFARYNG-------------QKLAKAAWGALQKNN------SQLMIRSYELGVLIL 524
PH+K + NG L+K AWG+ + N SQ + SYELG+L+
Sbjct: 411 PHVKLYMCDNGDNWETIKWCYMGSHNLSKQAWGSRKGNKFVNNDPSQYEVNSYELGILVT 470
Query: 525 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 584
P + + PS + SDAG V Y+ +P+
Sbjct: 471 PRP---------NTKMKPSYL-----------------------SDAGTEGGVTYIRMPF 498
Query: 585 ELPPQRYSSEDVPWSWDKRY 604
+LPP YS D PWS Y
Sbjct: 499 KLPPAAYSDNDKPWSGHVSY 518
>gi|189242173|ref|XP_970490.2| PREDICTED: similar to tyrosyl-dna phosphodiesterase [Tribolium
castaneum]
Length = 358
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 163/377 (43%), Gaps = 73/377 (19%)
Query: 252 FGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE- 305
FG HHSK + Y +R+++ TANL + DWN+ +QGLW+ P E
Sbjct: 23 FGCHHSKIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEKSGESP 82
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
GF++ L++YL NLP K + K+ +FS+ V L+ SVPG H +
Sbjct: 83 TGFKSSLLNYLKHY-------NLPV---LKPWIDYVKRADFSAVRVFLVTSVPGKHYPGT 132
Query: 366 LKKWGHMKLRTVLQECTF-------EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF 418
H + + C+ +G ++ Q SS+GS+ + L S++
Sbjct: 133 QGSHVHHVGDLLSRHCSLPAKTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLRSTLLRSL 192
Query: 419 SEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWK 473
S K + I++P+V++V G +G +P S Q N + +L+ Y +WK
Sbjct: 193 SGHKQTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQSYLHQWK 252
Query: 474 ASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMIRSYELGV 521
A GRSRAMPHIKT+ R + L+K+AWG + + +RSYE GV
Sbjct: 253 ADKLGRSRAMPHIKTYCRVSPCLSKLAWFFITSANLSKSAWGGNLQKDKGAYVRSYEAGV 312
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 581
+ LP K E +I+ T +G + ++ P
Sbjct: 313 MFLP--------------------KFFDEEYFEIETTL-----------SGKNKKL--FP 339
Query: 582 VPYELPPQRYSSEDVPW 598
Y+LP Y S D PW
Sbjct: 340 FMYDLPLTEYKSSDYPW 356
>gi|261191861|ref|XP_002622338.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
gi|239589654|gb|EEQ72297.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
Length = 653
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 122/473 (25%), Positives = 195/473 (41%), Gaps = 122/473 (25%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLW----------MQDFPLKDQNN-LSEECGFENDLIDY 315
V++++HTAN+I DW N Q +W M+ P +N F+ DLI Y
Sbjct: 188 QVQVVIHTANMIAGDWANMCQAVWRSPLLPMCPEMEHGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILP 525
IKT+ R++ L AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWP 528
>gi|307211790|gb|EFN87771.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 553
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 116/445 (26%), Positives = 186/445 (41%), Gaps = 87/445 (19%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G I+ ++ N MVD+ WL + + P+++++ + G E + N +
Sbjct: 165 ILDRSLGQIVSSLHLNCMVDVGWLCLQYLLAGQRPNMVILCSQRLGE-EELGDNIT---V 220
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN 300
+H +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ P
Sbjct: 221 VHVE-MPFEFGCHHTKVMILQYKDVGIRVVVSTANLYASDWKNRTQGIWISPHLP----- 274
Query: 301 NLSEEC---------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
LSE F+ DL YL++ + P K +K +FS+ V
Sbjct: 275 RLSEAAKWSSGESPTNFKKDLQRYLNSYQNPA----------LKRWRDLVRKADFSAVNV 324
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
IAS PG+ + WG+ KL VL Q K ++ Q S++GS K+ L
Sbjct: 325 CFIASTPGHFRRIDVNLWGYKKLANVLSQHVMLPPDAPKWSIIAQSSAVGSFGPKYEGWL 384
Query: 411 SSSMSSGFSE--DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLK 466
S + + ++ E ++P+V++ S + Y G++ K V + ++K
Sbjct: 385 SKEIVRSMTRETERDLKDYPEFQFIYPSVKNYEQSFD-YQDGSSCFLYMKEVHSKQQWIK 443
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQLMI 514
Y +WKA +G +AMPHIK++ R + L+K AWG I
Sbjct: 444 SYLYQWKAK-SGCDQAMPHIKSYTRISPDLKRIAWFVLTSANLSKGAWGV---QRGDYYI 499
Query: 515 RSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGAS 574
+YE+GV LP F T + + I
Sbjct: 500 TNYEVGVAFLPKFITGTTTFPITDEDLTAPI----------------------------- 530
Query: 575 SEVVYLPVPYELPPQRYSSEDVPWS 599
P+PY+ P Y S D P++
Sbjct: 531 -----FPIPYDFPLCPYDSNDSPFT 550
>gi|189210395|ref|XP_001941529.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977622|gb|EDU44248.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 624
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 129/548 (23%), Positives = 226/548 (41%), Gaps = 119/548 (21%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS +L R++ L N V + D++ +I + N++ D+D+++ + +
Sbjct: 100 IPSPIQLTRIEKLSDHQNVDTVGLADLLGDPLIKECWNFNFLFDLDFVMQHLDRDVRDMV 159
Query: 218 HVLVIHG---ESDGT----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG D LE +R N L +P FGTHHSK ++L + +
Sbjct: 160 KVKIVHGFWKRDDANRISLLETAERY--PNIELLSAYIPDPFGTHHSKMLILFRHDDTAQ 217
Query: 270 IIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQ--NNLSEECG---------FENDLIDYLS 317
+++HTAN+IH DW N +Q +W P+ Q +LS+ F++DL+ Y+
Sbjct: 218 VVIHTANMIHRDWANMTQAVWASPQLPMLSQASQSLSDSDKTYPIGSGQRFKSDLLRYIG 277
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH----TGSSLKKWGHMK 373
+ K + ++FSS I S P SS +G +
Sbjct: 278 AYE-----------KRLKGLAAQLGDYDFSSIRAAFIGSAPSRQKPERAVSSNNSFGWLG 326
Query: 374 LRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEK--WM--------------------AE 409
L+ +L K SP +V Q SS+ +L W+ A
Sbjct: 327 LKEILSTVPISKARASSPPHIVAQVSSIATLGAAPTWLSNFQSVLSSHSKATVSVPENAT 386
Query: 410 LSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
+SS+ +S F++ T + I++PT E++R SL GY +G +I S Q+
Sbjct: 387 VSSTKASTFTKRDTSVTKAPSPKFSIIFPTPEEIRNSLNGYGSGGSIHWKLQSAQQQKQL 446
Query: 463 DFLKKYWAKWKA--------------SHTGRSRAMPHIKTFARYNGQK------------ 496
+++ W + R A PHIKT+ R++ ++
Sbjct: 447 EYMHPMLCHWTSTPSASASSLTNVSKQEAHRGPAAPHIKTYIRFSDEEQKAIDWAMLTSA 506
Query: 497 -LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS--------EIKS 547
+K AWG ++ I+S+E GV++ P+ ++VP E
Sbjct: 507 NFSKQAWGDTVNKKEEIWIQSWETGVVVWPALFAETAKGVNEVSMVPVFGKDMPKVEDAR 566
Query: 548 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 607
+T+ ++ +T++ T V L +PY+LP + Y++++ PW YT+
Sbjct: 567 VNTKGKEVGETRIKT--------------TVGLRMPYDLPLKPYTADEKPWCATMAYTEP 612
Query: 608 DVYGQVWP 615
D G WP
Sbjct: 613 DRNGHFWP 620
>gi|157875345|ref|XP_001686067.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
gi|68129140|emb|CAJ06851.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
Length = 828
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 143/617 (23%), Positives = 232/617 (37%), Gaps = 198/617 (32%)
Query: 179 SCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-- 229
S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 183 SLLRLRDLFRCDVADPGECWQHILLSSYVTDLRWLLATVPELSAVTGKLVVLSGEKGTAT 242
Query: 230 -------------------------LEHMKRNKPANWILH-----------KPPLPISFG 253
+ ++ LH +PPLP++FG
Sbjct: 243 LRRTTGDPSSPYTAVPPLMDRVNPFMTALREQASGTSPLHTALSRERLAVLEPPLPVAFG 302
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
T+H+K L I +G+R+ + TANL+ DW KSQG+++QDFP K S + ++
Sbjct: 303 TYHTKMALCINGKGLRVSIFTANLVEQDWCWKSQGIYVQDFPWKPVTERSNDDSAGTIMV 362
Query: 314 DYLSTL------------KWPEFSANLPAH-------------------------GNFKI 336
+ + K EF A+L + G F+
Sbjct: 363 ETAARSTSNSNNGSNTFTKGAEFVAHLRHYLMRCGVSLASACASPADAASAAGPLGIFET 422
Query: 337 NPSFFKKFNFSSAAVRLIASVPG-YHTG--SSLKKWGHMKLRTVLQECTFEKGFKKSP-- 391
+ F +F++AAV L++SVPG Y G + + G +L VL+ +
Sbjct: 423 D--FLSHIDFTAAAVWLVSSVPGTYAHGEVCPVYRVGLCRLGEVLRRSALTTATAPASVD 480
Query: 392 LVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M + P G+ + +V+PT E+VR S EG+
Sbjct: 481 LSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 540
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ W +S G
Sbjct: 541 RGGGSLPLCVQCC-HEFVNARLHCWGSSEAGHMAKRAFPRPAKVAAVHASREDAVDVDGV 599
Query: 479 -------------------RSRAMPHIKTFAR------------YNGQKLAKAAWGAL-- 505
R A+PHIK++A L++AAWG+L
Sbjct: 600 DSDGGEGTPVSLAGSCAAYRRFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 659
Query: 506 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS--EIKSGSTETSQIQKTKL 560
Q + Q ++RSYELGVL + + S S + S E+ + + + +T L
Sbjct: 660 KVNQHGSRQQLVRSYELGVLYDSHSAIYQSASSWFSVVAKSKIELPNACNSRAMLYETPL 719
Query: 561 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 594
G ++ V L PY L P Y+S
Sbjct: 720 -----------GIGTQDVCLYTPYNLLCPTPYASTAALRAHRDAPDKGEQAVAGAALDCS 768
Query: 595 DVPWSWDKRYTKKDVYG 611
DVPW D + +D YG
Sbjct: 769 DVPWVLDMPHRGRDAYG 785
>gi|448079213|ref|XP_004194340.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359375762|emb|CCE86344.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 575
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 131/503 (26%), Positives = 202/503 (40%), Gaps = 106/503 (21%)
Query: 177 NTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE---SDGTLEH 232
N + V++ D+I D+ + N+ +D+++ L K + + G S +
Sbjct: 110 NYNAVTLSDMIGMSDLQSSFQFNFAIDLEFFLEHVDRSKKSKTITFVLGSDLLSPEVKDE 169
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWM 291
+++ + K LP FGTHH+K M+ Y G II+ T NL +D++ +Q W
Sbjct: 170 VQKRYGVDASDIKVDLPKRFGTHHTKMMVNFYEDGTCEIIIMTCNLQPIDFSALTQMCWR 229
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSA 349
K ++ + + F+ D+I YL + P KIN KF+ S
Sbjct: 230 SGRLSKASSSNAGQNRFKTDIIRYLKRYRKP------------KINELADTLAKFDMSGI 277
Query: 350 AVRLIASVPG----YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK 405
V L+ASVPG +++G+ KL VL+ G + + Y + +
Sbjct: 278 DVELVASVPGNFNLARATDESEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISY 337
Query: 406 WMAELSSSMSSGFSEDKTPLGIGE--------------------------PLIVWPTVED 439
A + +S FS PL P I++P +D
Sbjct: 338 PFALKEKNTASVFSHIICPLVFSRNSERLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKD 397
Query: 440 VRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFAR 491
+ S G+ +G AI + +N + +K Y KW+ASH GR PH+K +
Sbjct: 398 IALSGTGFYSGQAIHFKYDTSAIHRNQYEQNIKPYLYKWRASHKNAGRDETPPHVKLYMC 457
Query: 492 YNG-------------QKLAKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 532
NG L+K AWGA ++ + S I SYELGVLI PS H
Sbjct: 458 DNGDNWKTLRWVLMASHNLSKQAWGARRELRYRSADPSAYEISSYELGVLI-PSKSDH-- 514
Query: 533 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 592
+VP S E S+ G V + +P+ LPP+RYS
Sbjct: 515 ------KLVPVFDSSHQQEVSE-----------QGD---------VPVRIPFILPPERYS 548
Query: 593 SEDVPWSWDKRY-TKKDVYGQVW 614
S+D PWS Y + KD +G W
Sbjct: 549 SDDKPWSAYSNYGSLKDKFGNTW 571
>gi|390364206|ref|XP_788891.3| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Strongylocentrotus
purpuratus]
Length = 414
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 119/422 (28%), Positives = 187/422 (44%), Gaps = 81/422 (19%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-----FENDLI 313
M L+Y G+R+++HTAN+I DW+ K+QG+W+ FP +N + G F+ DL+
Sbjct: 2 MFLLYADGMRVVIHTANIIESDWHQKTQGVWISPLFPKLPSSNQTATNGESPSFFKRDLL 61
Query: 314 DYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 371
YL+ + P + P + +FSSA V LI+SVPG H KWGH
Sbjct: 62 AYLTAYRSPS------------LQPWKDHITQHDFSSAKVFLISSVPGRHARELKNKWGH 109
Query: 372 MKLRTVLQECTFEK-GFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSS--GFSEDKTP 424
+K+R +L++ +K + P++ QFSS+GSL KW+ AE SMS+ G S T
Sbjct: 110 LKVRKILRQYGPDKEQVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKGQSGSFTS 169
Query: 425 LGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTG---- 478
+ +++P ++VR SLEGY AG ++P S Q + +L +++ + G
Sbjct: 170 NADTRHMKLIFPCSDNVRTSLEGYPAGASLPYSIQTAKKQPYLHQFFLREILRQYGPDKE 229
Query: 479 RSRAMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRH-GCGFSCT 537
+ + P I F+ K W + S ++ + G + RH F C+
Sbjct: 230 QVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKG-QSGSFTSNADTRHMKLIFPCS 288
Query: 538 SNIVPS--EIKSGSTETSQIQKTK------------LVTLTWHGSSDAGASS-------- 575
N+ S +G++ IQ K L W G+ + AS
Sbjct: 289 DNVRTSLEGYPAGASLPYSIQTAKKQPYLHQFFFANLSKAAW-GAYEKNASQLMIRSYEI 347
Query: 576 EVVYLP----------------------VPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 612
V+ +P +P+++P YS D PW WD YT K D +G
Sbjct: 348 GVMMIPSFFDKSRKTFPLTEGRGQKEFSLPWDVPLTPYSKTDRPWIWDIPYTDKPDSHGN 407
Query: 613 VW 614
W
Sbjct: 408 AW 409
>gi|330928975|ref|XP_003302469.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
gi|311322144|gb|EFQ89422.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
Length = 621
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 125/542 (23%), Positives = 224/542 (41%), Gaps = 106/542 (19%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS +L R+ L N V + D++ +I + N++ D+++++ + +
Sbjct: 96 IPSPIQLTRIMKLHGHQNVDTVGLNDLLGDPLIKECWNFNFLFDLEFVMQHFDRDVRDMV 155
Query: 218 HVLVIHG---ESDGT----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG D LE +R N L +P FGTHHSK ++L + +
Sbjct: 156 KVKIVHGFWKRDDANRISLLETAERY--PNIELLSAYIPDPFGTHHSKMLVLFRHDDTAQ 213
Query: 270 IIVHTANLIHVDWNNKSQGLWMQ-DFPL----------KDQNNLSEECGFENDLIDYLST 318
II+HTAN+IH DW N +Q +W+ PL + N + F++DL+ Y+
Sbjct: 214 IIIHTANMIHRDWANMTQAVWVSPQLPLLSRASQSQSDTNTNPIGSGERFKSDLLRYIGA 273
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS----SLKKWGHMKL 374
+ K + + ++FSS I SVP S +G + L
Sbjct: 274 YE-----------KRLKGLIAQLEDYDFSSIRAAFIGSVPSRQKPGRAIPSTTSFGWLGL 322
Query: 375 RTVLQECTFEKGFKKSP--LVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEP 430
+ +L K SP +V Q SS+ +L W++ L S +SS +S+ T +
Sbjct: 323 KEILSTIPISKAKAFSPPHIVAQVSSIATLGAAPTWLSNLQSVLSS-YSKATTSVPENTT 381
Query: 431 L-------------------------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
+ +++P E++R SL+GY +G +I S Q+
Sbjct: 382 VSFTKASSFFTKRDDSVRIASSPKFSVIFPNPEEIRNSLDGYGSGGSIHWKLQSAQQQKQ 441
Query: 462 KDFLKKYWAKWKASHTG--------------RSRAMPHIKTFARYNGQK----------- 496
+++ W ++ + R A PHIKT+ R++ +
Sbjct: 442 LEYMHPMLCHWASTPSAPALASTDVPRREAHRGPAAPHIKTYIRFSDDEQNTIDWAMLTS 501
Query: 497 --LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 554
L+K AWG + ++ I+S+E GV++ P+ F+ T+ E+
Sbjct: 502 ANLSKQAWGDVVNKKEEIWIQSWETGVVVWPAL------FAETTQAAVDEVVMVPMFGKD 555
Query: 555 IQKTKLVTLTWHG-SSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 613
+ + G ++ +V +PY+LP + Y++++ PW YT+ D G
Sbjct: 556 MPGVDDNGVNLEGKEAEEMRPKTIVGFRMPYDLPLKPYTADEKPWCATMAYTEPDRNGHA 615
Query: 614 WP 615
WP
Sbjct: 616 WP 617
>gi|239608603|gb|EEQ85590.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ER-3]
Length = 653
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 119/473 (25%), Positives = 192/473 (40%), Gaps = 122/473 (25%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGFENDLIDY 315
++++HT N+I DW N Q +W P+ + N F+ DLI Y
Sbjct: 188 QAQVVIHTTNMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLAK-------------AAWGALQKNNSQLMIRSYELGVLILP 525
IKT+ R++ L AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWP 528
>gi|281201405|gb|EFA75617.1| protein-tyrosine phosphatase 3 [Polysphondylium pallidum PN500]
Length = 665
Score = 115 bits (288), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 136/285 (47%), Gaps = 59/285 (20%)
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FG HSK MLL+Y +R+++ +AN D+++ Q +W QDFP N+ F++
Sbjct: 390 FGCQHSKLMLLVYDDSIRVVIPSANPTRFDYDDIGQTIWFQDFP--KVNSQPPPSQFQDT 447
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 371
L ++ + P +F K++FS A V LI S+PGYH G+S+ + GH
Sbjct: 448 LKLFIKSCALPN---------------TFLDKYDFSIAKVHLIVSIPGYHRGASMNQCGH 492
Query: 372 MKLRTVLQECTFEKG-----------FKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFS 419
M+LR++L++ +K KK + Q SSLG +++KW + L S+ + S
Sbjct: 493 MQLRSILKKYYTDKENDLKHSDFPIIIKKREVHSQTSSLGLVNDKWSPQFLESTQTLTKS 552
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR 479
+ P G+ I++P + + I K D L+ W + SH
Sbjct: 553 KLVDPTGLLH--ILFPKNLILHSKI--------ITGTTKFEHNDKLRFDWV-YVGSHN-- 599
Query: 480 SRAMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLIL 524
L+ AAWG LQK+NSQL I ++E+GVL+L
Sbjct: 600 -----------------LSPAAWGRLQKDNSQLYISNFEIGVLLL 627
>gi|260945317|ref|XP_002616956.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
gi|238848810|gb|EEQ38274.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
Length = 748
Score = 115 bits (288), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 130/495 (26%), Positives = 206/495 (41%), Gaps = 103/495 (20%)
Query: 176 ANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPAC-PVLAKIPHVLV-IHGESDGTLEH 232
N V++ D++ D++ N+ VD+++ L P AK +V + G +
Sbjct: 293 VNVDTVTVHDLVGAPDLLETFQFNFNVDLEYFLTFLHPNFAKNKRKIVFVTGTAYLAGHP 352
Query: 233 MKRNKPANWILHK--PPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGL 289
++ A + + + PLP F +HHSK M+ YP V II+ T NL +D+ +Q +
Sbjct: 353 LREIIKAKYNISECIAPLPNRFASHHSKMMINFYPHDQVEIIIMTCNLTQLDFGGLTQSV 412
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
W + + F+ DL YL K + + +N++S
Sbjct: 413 WRSGKLKRGKTTAKLGSRFKQDLERYLLKYKMATIEKVVQR----------LRDYNYNSV 462
Query: 350 AVRLIASVPGY----HTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLD 403
V L+AS PG H + + +G+ KLR VLQ + + K ++ Q +S+
Sbjct: 463 GVELVASAPGTYSIDHIDENDETYGYGKLRQVLQRNDLLIKDTEKHHNILAQVTSIAYPY 522
Query: 404 EKWMAELSSSMSS-----GFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLE 445
+ +S +S FS K L G +P +V+PTV++V S
Sbjct: 523 SSRKGDTASILSHLLCPLMFSHWKKHLEPGTQSTSKHQEEFKYKPQLVFPTVKEVASSNF 582
Query: 446 GYAAGNAIPSP-------QKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFARYNG-- 494
G+ +G+A+ QK +++ +K Y KW TGR R PH+K +A NG
Sbjct: 583 GFLSGSAVHFKHSGSLIHQKQYEQN-VKPYLCKWSTPENVTGRERVTPHVKYYACDNGDG 641
Query: 495 -----------QKLAKAAWGALQ-KNNSQLM-IRSYELGVLILPSAKRHGCGFSCTSNIV 541
L+K AWG + K+ Q + SYEL VL+ S K N+V
Sbjct: 642 WNTLKWVLVGSHNLSKQAWGYPEAKSKGQTFDVASYELSVLVPGSGK----------NLV 691
Query: 542 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV--PYELPPQRYSSEDVPWS 599
P K SS+ + +PV P++LPP RY D+PWS
Sbjct: 692 PVFKKD-------------------------VSSDTITIPVRFPFKLPPTRYGENDLPWS 726
Query: 600 WDKRYTK-KDVYGQV 613
Y K KD +G +
Sbjct: 727 AGSDYGKLKDRWGNL 741
>gi|332376511|gb|AEE63395.1| unknown [Dendroctonus ponderosae]
Length = 584
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 171/379 (45%), Gaps = 63/379 (16%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD--GTLEHMKRNKPANWILHKP 246
G + ++ N+M+DI WL+ + L I D +E+M+R P N H
Sbjct: 187 GPLKESLQINFMIDIGWLVKQYKAREQDNKPLTILYGDDWPDMVEYMRRFCP-NVKHHFV 245
Query: 247 PLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
+ FG HH+K + Y +R++V TANL + DWN+ +QGLW+ K +N +E
Sbjct: 246 KMKDPFGCHHTKLGIYAYEDESIRVVVSTANLYYEDWNHYNQGLWISPRLAKLPSNSAER 305
Query: 306 -----CGFENDLIDYLSTLK------WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
GF+ L+DYL + + W ++ AN +F V L+
Sbjct: 306 DGEAITGFKGHLLDYLRSYQLPILRDWVKYVANA----------------DFGEVKVALV 349
Query: 355 ASVPGYH----TGSSLKKWGHMKLRTVLQECTF---EKGFKKSPLVY----QFSSLGSLD 403
S PG H GS L + G + + Q C + PL + Q SS+GS+
Sbjct: 350 YSAPGKHYAKQNGSHLHRVGDL----LSQHCVLPAKTTAQSEGPLSWGILAQASSIGSIG 405
Query: 404 EKWMAELSSS-MSSGFSEDKTPL-GIGEPLI--VWPTVEDVRCSLEGYAAGNAIP-SPQK 458
+ L S + S S ++PL G + I V+P+V +V G +G +P S
Sbjct: 406 KTAAEWLRGSLLRSLASHKQSPLPGNSQATISLVYPSVSNVAHGYFGLESGGCLPYSKAT 465
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN------------GQKLAKAAWGALQ 506
N + +L+ Y +W A R+RAMPHIK++ R + L+K+A G
Sbjct: 466 NEKQRWLQTYMHQWIADARHRTRAMPHIKSYCRVSPGLDKLAYFLLTSANLSKSARGNNI 525
Query: 507 KNNSQLMIRSYELGVLILP 525
+ + IRSYE+GV+ LP
Sbjct: 526 QKDGGCYIRSYEMGVMFLP 544
>gi|391868838|gb|EIT78047.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae 3.042]
Length = 389
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 177/397 (44%), Gaps = 82/397 (20%)
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE------CGFENDLIDYLSTLK 320
VR+++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ YL+
Sbjct: 22 VRVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVEDLTLGSGARFKRDLLAYLT--- 78
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
+G K P +K++F + L+ASVP L WG
Sbjct: 79 ---------EYGPKKTGPLVEQLRKYDFGAIRAALVASVPSKQKVDDLDSQKKTLWGWPA 129
Query: 374 LRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGI 427
L+ ++++ + K+ +V Q SS+ +L +KW+ + + +S+S + + P
Sbjct: 130 LKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWLKDVMFASLSPASTSTRQP--- 186
Query: 428 GEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------- 476
+ I++PT +++R SL GY +G +I S + +++ Y W H
Sbjct: 187 -KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSH 245
Query: 477 -----TGRSRAMPHIKTFARYNGQK--------------LAKAAWGALQKNNSQLMIRSY 517
GR RA PHIKT+ R++ + L+ AWGA + ++ I S+
Sbjct: 246 TSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSW 305
Query: 518 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
E+G+++ P + ++ +VP+ K + E + + ++ T V
Sbjct: 306 EIGIVVWPQLYVQD---TESATMVPT-FKRDTPEPLENKDSETTPDT------------V 349
Query: 578 VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 350 IGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 386
>gi|448516422|ref|XP_003867567.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis Co 90-125]
gi|380351906|emb|CCG22130.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis]
Length = 533
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 142/572 (24%), Positives = 223/572 (38%), Gaps = 133/572 (23%)
Query: 112 RSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQG 171
+++ + S DG T+ E +RQ D + A+ +F PS +LL
Sbjct: 22 KTESKQSQDGKTDCE----DVRQPD--------TTSVAIASF-------PSQLKLLYNPS 62
Query: 172 LPA----WANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHGE 225
P N + IRD+I ++ N+ VD+ + L P + +V
Sbjct: 63 YPEKELPSVNQDTLRIRDLIGSALLKETYQFNFNVDLPFFLSFLHPTFKREERKIVFITG 122
Query: 226 S---DGTLEHMKRNKPANWILH--KPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIH 279
S D + E + K AN+ + + +P FGTHH+K M+ Y V +I+ + N
Sbjct: 123 SRLLDPSFEETESIK-ANYNISEVQAHIPSRFGTHHTKMMINFYTDESVEVIIMSCNFTR 181
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPE--FSANLPAHGNFKIN 337
+D+ +Q +W + ++ F++DLI YL T P+ + A L
Sbjct: 182 LDFGGLTQMIWRSGRLILGNTTGAKSSKFKSDLIAYLRTYARPQIDYLAKL--------- 232
Query: 338 PSFFKKFNFSSAAVRLIASVPG-YHTGSSLKKWGHMKLRTVLQECT-----------FEK 385
+ ++FS V LIAS PG Y S +G+ L + +
Sbjct: 233 ---LEPYSFSGIDVELIASSPGKYDLNSEGPHYGYGSLYNACKRNNLLIDNRDKSRHYNV 289
Query: 386 GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG-------------EPLI 432
+ S + Y FS L M + + L G P I
Sbjct: 290 LAQTSAISYPFSVEKGATAGIFTHLLCPMLFSKNGEFKLLAPGIQSLRRHQSEHNYTPSI 349
Query: 433 VWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHT----GRSRA 482
++P V +V S G+AAG AI KN + +K Y KW +S + GR +
Sbjct: 350 IFPAVSEVVSSTIGFAAGQAIHFDYSRSFIHKNYYQQAIKPYLKKWNSSSSMSLAGREQV 409
Query: 483 MPHIKTFARYNG-------------QKLAKAAWGALQKN------NSQLMIRSYELGVLI 523
MPH+K + NG L+K AWG+ + N +SQ + SYELGVL+
Sbjct: 410 MPHVKLYMCDNGDNWRSIKWCYMGSHNLSKQAWGSRKGNKFVNDDSSQYEVNSYELGVLV 469
Query: 524 LPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 583
+P K + + PS +K D G+ V Y+ +P
Sbjct: 470 VPKPK---------TEMKPSYLK-----------------------DLGSEEGVTYVRMP 497
Query: 584 YELPPQRYSSEDVPWSWDKRYTK-KDVYGQVW 614
++LPP YS D PWS Y + +D G +
Sbjct: 498 FKLPPTAYSENDKPWSGHASYGELRDSKGNTY 529
>gi|19075361|ref|NP_587861.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe
972h-]
gi|74625832|sp|Q9USG9.1|TYDP1_SCHPO RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|6066756|emb|CAB58371.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe]
Length = 536
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 133/544 (24%), Positives = 216/544 (39%), Gaps = 110/544 (20%)
Query: 156 SRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC---- 210
S + + S L ++ LP N C+ ++ +I + N+ VD+++LL
Sbjct: 16 SNEIIDSPIFLNKISALPESENVHCLLLKQLIGSPQLKQTWQFNFCVDLNFLLENMHASV 75
Query: 211 --PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-G 267
V +I H +S L + P N L+ +P+ +GTHHSK M+ +
Sbjct: 76 FPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGTHHSKIMVNFFKDDS 134
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQ------------------------------DFPLK 297
+I++HTANL+ DW SQ ++ +K
Sbjct: 135 CQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGNPSKIRKHEGSLDIK 194
Query: 298 DQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
D N + + FEN D + + +F A L + + K +
Sbjct: 195 DDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKNYRHTYELIEKLKMY 254
Query: 345 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQ 395
+FS+ I SVPG G WG KL+ +L+ EK KK + Q
Sbjct: 255 DFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKDEKTKFEESDICISQ 312
Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 453
SS+GS K E + ++ GF + G ++PTV++V+ S+ G+ +G++I
Sbjct: 313 CSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQQSMLGWQSGSSIHF 365
Query: 454 ----PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ------------KL 497
+ V+ K KW A GR R PHIKT+ R++ L
Sbjct: 366 NILGKTAASQVETLKKGKNLCKWVAMKAGRQRVAPHIKTYMRFSNDGELLRWVLVTSANL 425
Query: 498 AKAAWGALQKNNSQ------LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
+K AWG L+ + ++ L IRSYE GVL+ P C I+ K+ +
Sbjct: 426 SKPAWGTLEGHKAKSRSTRGLRIRSYEAGVLLYPKLFEESQRAPC---IMTPTYKTNTPN 482
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYG 611
+ ++ ++G V+ + + ++ PP Y +D WS T KD G
Sbjct: 483 LDEKRR------EFYG-------KRVIGVRMCWDFPPVEYEDKDEIWSPVINRTDKDWLG 529
Query: 612 QVWP 615
VWP
Sbjct: 530 YVWP 533
>gi|300121378|emb|CBK21758.2| unnamed protein product [Blastocystis hominis]
Length = 397
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 148/304 (48%), Gaps = 35/304 (11%)
Query: 242 ILHKPPLPISF--GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
++ PP S+ G H+K +LL + +RI++ +ANL DW SQ +WMQDF K
Sbjct: 60 LIVSPPFAQSYLRGCFHAKLLLLRFSDRLRIVISSANLTTEDWTMWSQCVWMQDFFNAPK 119
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAAV 351
D ++ + F LI +L PE F+A F+ F + +V
Sbjct: 120 DSTRVAAKKLDLEFRTQLISFLRKCCVPEERIFNA--------------FRGVFFENVSV 165
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECT--FEKGF---KKSPLVYQFSSLGSLDEKW 406
+L+ASVPG + G + +G ++LR+VL+ EK K P++ Q SS+G+ + W
Sbjct: 166 QLVASVPGVYQGDRMNDYGQLRLRSVLKGLNDYMEKVASLPKNPPILSQCSSIGNPSQNW 225
Query: 407 MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEG-YAAGNAIPSPQKNVDKDF 464
+ + S G + + + L IV+PT V S+ G AG+ I + K F
Sbjct: 226 ILSMLKSCYGGREIVEKKGKLADLLHIVYPTNVYVNNSIIGPEMAGSLIFMQKVYTAKAF 285
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARYNGQK---LAKAAWGALQKNNSQLMIRSYELGV 521
L++ ++K + GR +PH K +K L AWG ++K SQ+ I +YE GV
Sbjct: 286 LREMLKRYKDA-PGRETTLPHSKYLMNVPLKKRPRLPWVAWGQIEKKESQIAICNYECGV 344
Query: 522 LILP 525
++LP
Sbjct: 345 VLLP 348
>gi|444707427|gb|ELW48704.1| Tyrosyl-DNA phosphodiesterase 1 [Tupaia chinensis]
Length = 389
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 113/230 (49%), Gaps = 59/230 (25%)
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
PLV QFSS+G L + KW+ +E S+ + + K P PL +++P+VE+VR SLE
Sbjct: 210 PLVGQFSSIGFLGADESKWLCSEFKESLLTLGRDSKIPGKSTVPLHLIYPSVENVRTSLE 269
Query: 446 GYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWGAL 505
GY AG ++P + +K W S Y L+KAAWGAL
Sbjct: 270 GYPAGGSLPYSIQTAEKQ-------NWLHS----------------YFHANLSKAAWGAL 306
Query: 506 QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 565
+KN +QLMIRSYELGVL LPSA F S V + SGS
Sbjct: 307 EKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGS---------------- 344
Query: 566 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 614
HG + + PVPY+LPP+ Y +D PW W+ Y K D +G +W
Sbjct: 345 HGPTAS--------FPVPYDLPPELYGHKDRPWIWNIPYVKAPDTHGNMW 386
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV+G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 105 PFQFYLTRVKGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 164
Query: 218 HVLVIHGESDGTLEHM-KRNKP 238
+L++HG+ H+ R KP
Sbjct: 165 PILLVHGDKREAKAHLHARAKP 186
>gi|400603196|gb|EJP70794.1| tyrosyl-DNA phosphodiesterase [Beauveria bassiana ARSEF 2860]
Length = 399
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 163/352 (46%), Gaps = 46/352 (13%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPH 218
PS FRL V+ L N V++ D++ +I S NY+ I +L+ A + PH
Sbjct: 38 FPSPFRLTWVRDLEEENNKDAVTLSDLLGDPLISECWSFNYLHSISFLMDAFDRDIR-PH 96
Query: 219 VLV--IHG---ESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHSKAMLLIYPRG--VR 269
V V +HG DG + N LH P+P FGTHHSK ML+++ R +
Sbjct: 97 VKVHIVHGFWKREDGNRIGLVEQAALFPNVNLHAAPMPEMFGTHHSK-MLILFRRDDTTQ 155
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDF--PLKD-------QNNLSEECG--FENDLIDYLST 318
+I+HTAN+I DW N + +W LK + ++++ G F++DL+ YL
Sbjct: 156 VIIHTANMIAKDWTNMTNAVWTSPVLSKLKKVPDDPSWREDMAQGSGHRFKSDLLSYLRC 215
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLR 375
+ N K+++FSS LIASVPG H + WG +
Sbjct: 216 YDRMRPTCNALVES--------LKEYDFSSVRGSLIASVPGTHEVHGDPGVTSWGWKSMS 267
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL-I 432
LQ+ E G S + Q SS+ +L ++ W L ++ S+ K + +
Sbjct: 268 KCLQQIPCEPGV--SQVAVQVSSIATLGGNDGW---LRGTLFRALSKGKVATALSPQFKV 322
Query: 433 VWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKASHTGRS 480
V+PT +++R SL+GYA+G + I S Q+ + ++L+ + W R+
Sbjct: 323 VFPTADEIRASLDGYASGGSIHTKIQSKQQQMQLNYLRPIFHHWMTDDDSRT 374
>gi|355723700|gb|AES07977.1| tyrosyl-DNA phosphodiesterase 1 [Mustela putorius furo]
Length = 381
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 95/173 (54%), Gaps = 13/173 (7%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFRFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 223
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWP 322
NLIH DW+ K+QG+W+ PL Q + F+ DLI YL+ P
Sbjct: 284 NLIHADWHQKTQGIWLS--PLYPQIIHGTHRSGESTTHFKADLISYLTAYNAP 334
>gi|440797312|gb|ELR18403.1| Tyrosyl-DNA phosphodiesterase [Acanthamoeba castellanii str. Neff]
Length = 569
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 134/263 (50%), Gaps = 35/263 (13%)
Query: 164 FRLLRVQGLP-AWANTSCVSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLV 221
F L ++GL A AN+ C+SIR +++ + ++ A+++++ D++W+L P IP LV
Sbjct: 25 FVLNEIKGLRGADANSGCISIRKLVRPESLVAALVTSFTEDVEWVLSVIP--PTIPITLV 82
Query: 222 IHGESDGTLEHMKRNKPANWILHKPPLPI-SFG-------THHSKAMLLIY-PRGVRIIV 272
H E ++ ++ N + PPL + FG H+K MLL Y +R++V
Sbjct: 83 RHWEEPDREGEVRISR--NIRVIHPPLALPGFGGGQAMRAKMHAKLMLLRYRDNTLRVVV 140
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPA 330
+ANL D+ Q +W QDFP K Q + ++ FE L +L LK E
Sbjct: 141 TSANLAQPDYELVGQTVWYQDFPKKQQKSSGQQPASPFEETLTQFLVALKADE------- 193
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKG--F 387
F ++++FS AA L+ SVPG+H G + GH +LR +L++ +
Sbjct: 194 --------GFLREYDFSKAAADLVVSVPGFHRGKHKMDAVGHTRLRALLRDFQWPPADEL 245
Query: 388 KKSPLVYQFSSLGSLDEKWMAEL 410
+ + YQ SSLG+L E +++E
Sbjct: 246 RDDNIYYQTSSLGALYESFVSEF 268
>gi|345570074|gb|EGX52899.1| hypothetical protein AOL_s00007g235 [Arthrobotrys oligospora ATCC
24927]
Length = 651
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 146/574 (25%), Positives = 226/574 (39%), Gaps = 124/574 (21%)
Query: 155 VSRDK---LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC 210
VSRD + S F+L +++ LPA N ++I D++ +I I S N+M D++W++
Sbjct: 74 VSRDPTLIISSPFKLTQIRNLPANRNVDTITISDILGSPLIREIWSFNFMHDLEWMVSHL 133
Query: 211 PV-LAKIPHVLVIHG--------------ESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
+AK + +IHG E D ++ + L +P FGTH
Sbjct: 134 DEDVAKDIDIKIIHGNWRKDDMSRKALESERDKLIDLASSDGGYKIELITAYMPDMFGTH 193
Query: 256 HSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECGFENDLI 313
H+K ++L Y I+VHTAN+I DW+N +Q +W PL ++L + G +
Sbjct: 194 HTKMLVLFYHDDSAEIVVHTANMIPWDWSNMTQAVWRSPKLPLLADDSLERKEG-----V 248
Query: 314 DYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWG 370
Y+ F+A + A+G K K++F + + VPG H G K +G
Sbjct: 249 GYVFK---EAFTAYVGAYGWRTKSLMEQIVKYDFRAVRAVFVGHVPGDHAINGPENKLFG 305
Query: 371 HMKLRTVLQECTFEKGF---KKSPLVY----------QFSSLGSLDEKWMAEL------- 410
K++ VL G K +VY Q SS+ +L E + +
Sbjct: 306 WSKVKRVLTRIGRGGGHGVNKAGRVVYTVKGGGEIAMQCSSVATLGESYFDSVLYPTFST 365
Query: 411 ---SSSMSSGFSEDKTPLGIG---------EPLIVWPTVEDVRCSLEGYAAGNAI-PSPQ 457
+ F +TP E +V+PTVE+VR S+ G+ G +I Q
Sbjct: 366 CRPGGGQLNAFDVLRTPSSSASSSRPSNRPELALVFPTVENVRTSVLGWDGGGSIFMKSQ 425
Query: 458 KNVDKDFLK------KYWAK-------WKASHTGRSRAMPHIKTF--------------- 489
K VDK LK + W + A R +A PHIKT+
Sbjct: 426 KPVDKAQLKYVKPMLRVWGQPPIGLSTAIAVEAERGKATPHIKTYNFFSPPRMDSKDSDT 485
Query: 490 -----------------ARYNGQKLAKAAWGALQKN---NSQLMIRSYELGVLILPS--- 526
A L+K AWG K +S I+SYE G+LI P
Sbjct: 486 TDGEDESGAFNIVSMDWAMITSANLSKQAWGNPTKGSGPSSTSKIQSYEAGILIHPGLWK 545
Query: 527 -AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 585
+ G S + GS + + K+ D + V + + Y+
Sbjct: 546 DLLKDEAGAVTMSAV-------GSKDWLVAEGQKIENCDVPEDMDGKCNMVKVGVRLAYD 598
Query: 586 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHFQ 619
P + Y +D PW D Y +D G WP ++
Sbjct: 599 YPLKPYDEDDEPWCKDMPYEGRDWKGITWPPRWE 632
>gi|223995471|ref|XP_002287409.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976525|gb|EED94852.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 625
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 113/439 (25%), Positives = 185/439 (42%), Gaps = 120/439 (27%)
Query: 195 ILSNYMVDIDWLLP-ACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFG 253
I+SN+++D +LL P + V+V + E+ +E MK +W + G
Sbjct: 113 IISNFIIDFGYLLEKTLPDILDFHRVVVFYQEAHN-VEAMK-----SW------ENMLAG 160
Query: 254 THHSKAMLLIYP-----RGVRIIVH--TANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
T ++ + + P + H +NL D KSQG++ Q FPLK + +
Sbjct: 161 TGNTVEFVRLVPTDPPRSSCNPLSHKFNSNLWRTDIEYKSQGVYSQVFPLKQKTPADDTV 220
Query: 307 G-----------------------------------FENDLIDYLSTLKWPEFSANLPAH 331
FE+DL+ YL + + + + +
Sbjct: 221 NKLKRKQIYNPYEKKKKPAAGSSSRGWPFEDDKSQLFEDDLVGYLESYHYRK-QQSWKMN 279
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK- 388
G + ++++FS A LI SVPGYH+ S+ +G++KLR + E C +
Sbjct: 280 GESMNLLALIRQYDFSEAYAVLIPSVPGYHS-LSIDDFGYLKLRKAIIEWVCNQQSNADS 338
Query: 389 -------KSPLVYQFSSLGSLDEKWM----AELSSSMSSGF----------------SED 421
K PLV Q+SS+GSL W+ A L S+ +S ++
Sbjct: 339 RKSSSNAKPPLVCQYSSVGSLTTAWLDLFTAALDSTSTSAVDPVEYYHEVTKKAKSRAKG 398
Query: 422 KTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA---SHT 477
K + + E + IVWPTV+++R ++EGY G ++P KNV + FL + +W
Sbjct: 399 KKGVDLSERMKIVWPTVDEIRTTIEGYNGGGSVPGRTKNVAQSFLLPLYHRWTKRGNDFI 458
Query: 478 GRS---------RAMPHIKTFAR----------------YNGQKLAKAAWGALQK----N 508
GR+ R +PHIKT+ + L+KAAWG ++ +
Sbjct: 459 GRTDNVDPLRTARNVPHIKTYVQPSTHVIGDTPSIEWMVLTSHNLSKAAWGNIENRSVDD 518
Query: 509 NSQLMIRSYELGVLILPSA 527
+ L IR +ELGV I P+
Sbjct: 519 SKVLFIRHWELGVFISPAT 537
>gi|326431947|gb|EGD77517.1| hypothetical protein PTSG_08615 [Salpingoeca sp. ATCC 50818]
Length = 594
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 88/195 (45%), Gaps = 38/195 (19%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
+PTVEDVR S EGY G ++P K D F K KW+A R+RA+PHIKTF
Sbjct: 422 FCYPTVEDVRTSYEGYVGGGSLPHAIKYREDHVFFAKEACKWRAGWCYRTRALPHIKTFT 481
Query: 491 RYN------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 538
+N L+KAAWG LQK SQL I SYELGV + PS +
Sbjct: 482 AWNTAARSIDWMLLGSHNLSKAAWGQLQKQESQLHILSYELGVFLSPSL--------LGA 533
Query: 539 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
+ P K S T + + PVPY+ P YS+ D W
Sbjct: 534 TLRPLADKLRSVRRPDKHDT-----------------QTAWAPVPYDYPLTPYSTHDEMW 576
Query: 599 SWDKRYTKKDVYGQV 613
WD Y + D +G+V
Sbjct: 577 YWDGVYMQPDTHGRV 591
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 134/305 (43%), Gaps = 41/305 (13%)
Query: 124 NGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT----- 178
GEL +K+ + + E + + DKL F+L R++G+ +
Sbjct: 39 GGELETKRAKAAETVRTERVAAATSSRT------DKLDVVFKLSRLRGVGKAGGSLKEAN 92
Query: 179 ---SCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK 234
SI +++ Q ++ ++ NYM+D+DWLL P + +++++G + +
Sbjct: 93 NPLFATSIAEILSQPGLLSSVQFNYMIDVDWLLDQYPAEYRRLPLMIVYGNDQRVSKETE 152
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-D 293
+ P LP +FGTHH+K MLL + G++++VHTANLI DWN K+QG+WM
Sbjct: 153 HDTSNVRWFRAPYLP-AFGTHHTKMMLLFFHDGMQVVVHTANLISRDWNLKTQGIWMSPK 211
Query: 294 FP--------LKDQNNLSEECGFENDLIDYLST--------LKWPEFSANLPAHGNFKIN 337
P ++D ++ S GF DL YL + + AH +
Sbjct: 212 LPRFSPKRGRVQDISSYS-PTGFGADLWSYLRAYGDGVQGGVSMRAVRERIAAHDLTHVK 270
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
F ++ L+ P G + WG + + +L + G +V QFS
Sbjct: 271 VVFACQYERD-----LLPLSPAATAGRTKTAWGQHEAQDLLLQQHAAGG--ADVVVCQFS 323
Query: 398 SLGSL 402
S+G +
Sbjct: 324 SIGKM 328
>gi|448083780|ref|XP_004195441.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359376863|emb|CCE85246.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 576
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 127/503 (25%), Positives = 203/503 (40%), Gaps = 106/503 (21%)
Query: 177 NTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE---SDGTLEH 232
N + V++ D+I D+ + N+ +D+++ L + + + G S +
Sbjct: 110 NYNAVTLSDMIGMPDLRSSFQFNFAIDLEFFLGHVHRSKESKTITFVLGSDLLSPEVKDE 169
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWM 291
+++ + K LP FGTHH+K M+ Y II+ T NL +D++ +Q W
Sbjct: 170 VQKRYGVDASDIKVDLPKRFGTHHTKMMVNFYENETCEIIIMTCNLQPIDFSALTQMCWR 229
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSA 349
+ ++ + F+ D+I YL + KIN +F+ S
Sbjct: 230 SGRLSRASSSNPGKPRFKTDIIRYLKRYR------------KQKINELADTLAEFDMSGI 277
Query: 350 AVRLIASVPG----YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK 405
V L+ASVPG T +++G+ KL VL+ G + + Y + +
Sbjct: 278 DVELVASVPGNFNLARTADDSEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISY 337
Query: 406 WMAELSSSMSSGFSEDKTPLGIGE--------------------------PLIVWPTVED 439
A + +S FS PL P I++P +D
Sbjct: 338 PFALKEKNTASVFSHIICPLIFSRNSDRLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKD 397
Query: 440 VRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFAR 491
+ S G+ +G AI + +N + +K Y KW+ASH GR PH+K +
Sbjct: 398 IALSGTGFYSGQAIHFKYDTSAIHRNQFEQNIKPYLYKWRASHKNAGREETPPHVKLYMC 457
Query: 492 YNG-------------QKLAKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 532
NG L+K AWGA ++ + S I SYELGVLI PS+ H
Sbjct: 458 DNGDNWKTLRWVLMASHNLSKQAWGARRELRYRSADPSTYEISSYELGVLI-PSSSDH-- 514
Query: 533 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 592
+VP S+ Q+ +D G V + +P+ LPP+RYS
Sbjct: 515 ------KLVP-------VFDSRHQRK---------VTDQGD----VPVRIPFILPPERYS 548
Query: 593 SEDVPWSWDKRY-TKKDVYGQVW 614
S+D PWS Y + KD +G W
Sbjct: 549 SDDKPWSAYSNYGSLKDKFGHTW 571
>gi|307108295|gb|EFN56535.1| hypothetical protein CHLNCDRAFT_144174 [Chlorella variabilis]
Length = 682
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 103/211 (48%), Gaps = 15/211 (7%)
Query: 175 WANTSCV--SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKI----PHVLVIHGESDG 228
WAN + S+ D+++G++ + + + WLL ACP L + E+ G
Sbjct: 476 WANEGFLGLSLGDLVRGEMRWCLYCSMALHARWLLSACPDLRPLVTWRTKTRKALREASG 535
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG 288
+R ++LH PP+P +G HHSK ML+ Y GVR I+ T NL ++++Q
Sbjct: 536 AAAEGRR-----FVLHTPPVPDRWGRHHSKMMLIEYATGVRFILPTPNLQFHQLHSQTQA 590
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN-PSFFKKFNFS 347
++ QDFP K FE L YL+ L+ P A H + P ++ +FS
Sbjct: 591 VFFQDFPPKQDGTSPPGSDFETSLARYLAALQLPGEEAK---HAQAGWHWPELVRRHDFS 647
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
+A L+ASVPG H G +GH +L +L
Sbjct: 648 AARAVLVASVPGSHGGELAAAYGHKRLAALL 678
>gi|195161240|ref|XP_002021476.1| GL26495 [Drosophila persimilis]
gi|194103276|gb|EDW25319.1| GL26495 [Drosophila persimilis]
Length = 511
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/241 (31%), Positives = 117/241 (48%), Gaps = 33/241 (13%)
Query: 307 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 366
GF DL+ YL K + + + +K +FS+ V + SVPG H S+
Sbjct: 236 GFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGSV 285
Query: 367 K--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
+ WGH +L ++L + + P+V Q SS+GSL A + + +D +P
Sbjct: 286 RGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSSP 344
Query: 425 LGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
G + +++P+ +V S +G G +P + DK +LK + +WK+S R
Sbjct: 345 GGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDRHR 404
Query: 480 SRAMPHIKTFARYN------------GQKLAKAAWGALQKNNSQ---LMIRSYELGVLIL 524
SRAMPHIKT++RYN L+KAAWG+ KN + L I +YE GVL L
Sbjct: 405 SRAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAWGSFNKNTNLQPCLRIANYEAGVLFL 464
Query: 525 P 525
P
Sbjct: 465 P 465
>gi|307211793|gb|EFN87774.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 445
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 131/272 (48%), Gaps = 25/272 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ Y++D++WL + + ++ +++GE E + N A +
Sbjct: 165 ILDISFGEIVNSLHLTYILDVEWLCLQYLLAGQSTNMTILYGERTDE-EELDDNITAVQV 223
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P FG+HH+K M+L Y G+R++V TANL DW N+ QG+W+ L +
Sbjct: 224 ----QMPFEFGSHHTKIMILQYKDDGIRVVVSTANLYFEDWQNRMQGMWISPH-LPRLSK 278
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
++ CG F+ DL YL++ + P K +K +FS+ V LIAS
Sbjct: 279 AAKRCGESPTNFKKDLQRYLNSYQNPA----------LKRWRDLVRKADFSAVNVCLIAS 328
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
PGY + + WG+ KL VL Q +K ++ Q S++GS K+ LS +
Sbjct: 329 TPGYFRRTDVDLWGYKKLANVLSQHVMLPSNARKWSIIAQSSAVGSFGPKYEGWLSKEII 388
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLE 445
+ + P ++P+V++ S +
Sbjct: 389 RSMTRETKRDLKNYPKFQFIYPSVKNYEQSFD 420
>gi|328868012|gb|EGG16393.1| protein-tyrosine phosphatase 3 [Dictyostelium fasciculatum]
Length = 596
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/421 (26%), Positives = 183/421 (43%), Gaps = 71/421 (16%)
Query: 163 TFRLLRVQGLPAWANTSC----VSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-- 215
+F L R+ G N+S ++ RD+I + ++++ + +D +W++ K
Sbjct: 145 SFYLNRIYGESNDNNSSTTPKTLTFRDIISPSGLESVIAMGFGMDTEWMMNEIIRSQKGR 204
Query: 216 --IPHVLVIH-GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIV 272
IP VI G+ + +N IL + + +G HSK +LL+Y +R++V
Sbjct: 205 KDIPMTFVIDCGDPKKKGTTVIQN--ITLIL----VHVLYGCMHSKLILLLYKDYIRVVV 258
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQN---------------------NLSEECGFEND 311
+AN D+ Q +W QDF K +LS +
Sbjct: 259 PSANPFEEDYIRIGQTIWYQDFQKKLPPPPPPLATTPTLKPIPSTSKTISLSLKQMTTKK 318
Query: 312 LIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
+T +F +L N FKI F +F+F A +LI S+PG+H G++L +G
Sbjct: 319 PTTTTTTTTTNDFQISLKTLLNCFKIETKFLDQFDFECAKAQLIISIPGFHNGATLNSYG 378
Query: 371 HMKLRTVLQECTFEK---------GFKKSPLVYQFSSLGSLDEKWMAEL--SSSMSSGFS 419
H+KLR+VL +K FK+ + Q SSLG+++ W S +
Sbjct: 379 HLKLRSVLTSYYNQKEKDLNLKIDNFKRD-VFSQCSSLGNVNSGWNQHFLESCRIPKNNL 437
Query: 420 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNV-DKDFLKKYWAKWKASHT 477
ED I + L I++PTV + + + + + I K+ DK F + K H
Sbjct: 438 ED-----ISKSLHILFPTVSWITSNHKRMQSASIIRFQDKSYDDKTFPRNSMTLIKHRHP 492
Query: 478 GRSRAMPHIKTFA---------RYN-----GQKLAKAAWGALQKNNSQLMIRSYELGVLI 523
R + H K RY+ L+ AAWG +QKN +Q+ + +YE+GV++
Sbjct: 493 HRGNMLLHTKVNVGVTTIGKNKRYDWIYVGSHNLSPAAWGKIQKNQTQIQLSNYEIGVVL 552
Query: 524 L 524
L
Sbjct: 553 L 553
>gi|299740649|ref|XP_001833897.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
gi|298404347|gb|EAU87927.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
Length = 627
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 117/441 (26%), Positives = 181/441 (41%), Gaps = 83/441 (18%)
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
NG+ + A D P TFRL +V G + D+ AI+S++ +
Sbjct: 169 NGEFRQTATRGVDPRADGKP-TFRLTQVLGE---------------KKDLTFAIISSFAL 212
Query: 202 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
D+ W+ +P ++V + D T + +N NWI PPL +G H K ML
Sbjct: 213 DLPWIYEFFD--RSVPVIVV--AQPDATGQASMKNVLPNWIKTTPPLRGGYGCQHMKFML 268
Query: 262 LIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLS 317
L + G +R++V TANLI DW +W+QD PL+ ++ + F L+ L+
Sbjct: 269 LFHKTGRLRVVVSTANLISYDWREMENTVWLQDVPLRSSSSTAPVRATDDFPGTLLYMLA 328
Query: 318 TLK-WPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMK 373
L P + H N I +++++S L+ S+ G H G S+ K GH +
Sbjct: 329 ALNVVPALKIMINEHPNLPIKTIEELRERWDWSKVKAHLVPSIAGKHEGWPSVIKTGHPR 388
Query: 374 LRTVLQECTFEKGF----KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED-------- 421
L V+++ G KK L Q SSLG+ +W+ E S +ED
Sbjct: 389 LMAVVRKMAMRTGTGSQAKKLTLECQGSSLGNYTTQWLNEFYYSARGESAEDWLDRSKKQ 448
Query: 422 --KTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD-KDFLKKYWAKWKASHT 477
K P P+ I++PT + V+ S G G I ++ D K+F ++ + K S
Sbjct: 449 REKQPY---PPVKIIFPTKKTVQESTFGEQGGGTIFCRRRQWDGKNFPRELFHDSK-SKA 504
Query: 478 GRS-----------RAMPHIKT----------------------FARYNGQKLAKAAWGA 504
GRS R H T +A +AWG
Sbjct: 505 GRSLMHSKMIIGTLRDSTHASTSQDGSETEDSDDEIQIIQPAVGWAYIGSHNFTPSAWGT 564
Query: 505 LQKN--NSQLMIRSYELGVLI 523
L + N L I +YE+GV+
Sbjct: 565 LSGSSFNPTLNITNYEVGVVF 585
>gi|13543875|gb|AAH06083.1| TDP1 protein [Homo sapiens]
Length = 298
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 79/133 (59%), Gaps = 5/133 (3%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQG 288
NLIH DW+ K+QG
Sbjct: 283 NLIHADWHQKTQG 295
>gi|344301196|gb|EGW31508.1| hypothetical protein SPAPADRAFT_154759 [Spathaspora passalidarum
NRRL Y-27907]
Length = 549
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 110/426 (25%), Positives = 170/426 (39%), Gaps = 101/426 (23%)
Query: 248 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
+P FGTHH+K M+ + + I++ ++N+ +D+ +Q LW K +
Sbjct: 163 IPNRFGTHHTKMMINFFKGDTMEIVIMSSNITRLDFGGLTQMLWRSGRLSKIKPKTIPLV 222
Query: 307 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--- 361
G F+ DL++YL+ E + K+++FSS V LIAS PG +
Sbjct: 223 GKRFQKDLMNYLNKYNKVEITQL----------SKRLKQYDFSSVNVELIASAPGSYNLR 272
Query: 362 -TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
+ + +G+ KL L+ + S L Y + S A + + FS
Sbjct: 273 DVTNETEIYGYGKLHQALKRNSLLIDNSISKLKYNIIAQVSAISYPFAVETFQTAGIFSH 332
Query: 421 DKTPLGIGE------------------------PLIVWPTVEDVRCSLEGYAAGNAI--- 453
PL + P+I++PT E+V S G+ AG AI
Sbjct: 333 LLCPLVFSKKEEFKLLEPGTNSFRQHQKDHNYNPIIIFPTPEEVAGSNVGFRAGGAIHFD 392
Query: 454 ---PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNG-------------Q 495
KN + +K Y KW + + TGR + MPH+K + NG
Sbjct: 393 YNRSFVHKNYYQQCIKPYLHKWSSRETITGREKVMPHVKLYMCDNGDNWSTLKWVYMGSH 452
Query: 496 KLAKAAWGA------LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 549
L+K AWG+ L N S I SYELGVL+ P P E
Sbjct: 453 NLSKQAWGSRRGNKFLSSNPSIYDISSYELGVLVYPK---------------PGE----- 492
Query: 550 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KD 608
TL + D+ S+ + + +P++LPP +Y S D+PWS Y D
Sbjct: 493 ------------TLVPNYLGDSIPKSKNIPIRLPFKLPPVKYLSTDLPWSGHVSYGGLAD 540
Query: 609 VYGQVW 614
YG+ +
Sbjct: 541 KYGETY 546
>gi|324522792|gb|ADY48131.1| Tyrosyl-DNA phosphodiesterase, partial [Ascaris suum]
Length = 306
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 142/297 (47%), Gaps = 22/297 (7%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKLPST-FRLLRVQGLPAWANTSCVSIRDVIQG-DI 191
+ D D + + ++ F L S ++ G P +T+ S+ ++++
Sbjct: 7 ENDGDDASSARTPSASMVKFRKQDSPLLSNRLYFTKIVGHPCRYSTNAFSLSELLELISP 66
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM------KRNKPANWILHK 245
I +I N+M+D+ WLL P + +I GE++GT H+ +R K N + +
Sbjct: 67 IASIHFNFMIDLHWLLSQYPERCSAYPISIIVGENNGT-NHLDVRAEARRCKADNVSVGR 125
Query: 246 PPLPISFGTHHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
L + +GTHHSK ++ + +++ TANL+ DW++K+Q + P+ +
Sbjct: 126 ARLVLPYGTHHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEG 185
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+ F DLI YL+ ++ G + +FS R+I+S+PGYH G
Sbjct: 186 QNNFRKDLISYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGD 239
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSG 417
++GH++LR VL+ + KK V QFSS+GSL K W+ A+ S++ G
Sbjct: 240 QKDRYGHLRLRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGG 294
>gi|294659254|ref|XP_461609.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
gi|199433821|emb|CAG90056.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
Length = 583
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 115/443 (25%), Positives = 181/443 (40%), Gaps = 122/443 (27%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
LP FGTHH+K M+ Y II+ T NL +D+ +Q W + N+S E
Sbjct: 182 LPTRFGTHHTKMMINFYEDDTSEIIIMTCNLQKIDFGGLTQMCWKSGRLHRSNGNISPER 241
Query: 307 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG 359
G F+ DL +YL +K NP +++FS + L+AS PG
Sbjct: 242 GARFQKDLKNYLF---------------RYKKNPLRELGKSLDEYDFSPVNIELVASAPG 286
Query: 360 Y----HTGSSLKKWGHMKLRTVLQECTF----EKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ + + + +G+ KL VL+ KG K ++ Q SS+ A
Sbjct: 287 FFNMAESTNDSEIYGYGKLYQVLRRNNLLIDNSKGENKYNILAQVSSISYP----FATEK 342
Query: 412 SSMSSGFSEDKTPL---GIGE-----------------------PLIVWPTVEDVRCSLE 445
S+ +S FS PL G+ + P I++P+V+DV S
Sbjct: 343 SNTASIFSHLLCPLIFSGMSKASFNLLKPGAASFKSHQNTHNYRPHILYPSVDDVANSNV 402
Query: 446 GYAAGNAI-------PSPQKNVDKDFLKKYWAKWK----ASHTGRSRAMPHIKTFARYNG 494
G+A+G A+ P+ + +++ +K Y +W+ A TGR +PH+K + NG
Sbjct: 403 GFASGQALHFKFTTTPTHRNQYEQN-IKPYLYRWQSGSHADETGRENVVPHVKLYMCDNG 461
Query: 495 -------------QKLAKAAWGALQKNNSQLM--------IRSYELGVLILPSAKRHGCG 533
L+K AWGA KN ++ + SYELGVL+
Sbjct: 462 DDWCTLRWVLMGSHNLSKQAWGA--KNETKFTNSDPSVYKVSSYELGVLV---------- 509
Query: 534 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 593
N+ P++ G T L + + A + L +P++LPP +Y
Sbjct: 510 ---PGNMDPND--DGIT---------LKPIYGRDTFPAPQHNNDTPLRIPFKLPPVKYKP 555
Query: 594 EDVPWSWDKRYTK--KDVYGQVW 614
+ PWS Y KD +GQ +
Sbjct: 556 SERPWSALINYGNNLKDRFGQCY 578
>gi|145533358|ref|XP_001452429.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420117|emb|CAK85032.1| unnamed protein product [Paramecium tetraurelia]
Length = 508
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 161/340 (47%), Gaps = 59/340 (17%)
Query: 226 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 281
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 282 WNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSF 340
W SQ +W+QDF + + F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKEFKVGLKEFLDNI--------LPSSHKFEDLLKIK 258
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQFS 397
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ +
Sbjct: 259 YNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQTT 318
Query: 398 SLGSLDEKWMAELS--------SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
S+G LD ++ + + M E+K+ L +++PT + ++ +A
Sbjct: 319 SIGQLDVNYVDFVQQQQNNKSIAQMLFNQQEEKSILK-----LIYPTSDYIQNQT---SA 370
Query: 450 GNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTF---------- 489
G +P Q+ + F K + +++ S H G +PH+K
Sbjct: 371 GPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVMIITGIDEKID 427
Query: 490 ---ARYNG-QKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
+ Y G L++AAWG L+KN +QL I + ELGVL P
Sbjct: 428 DKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 467
>gi|149245486|ref|XP_001527220.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449614|gb|EDK43870.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 554
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 114/443 (25%), Positives = 171/443 (38%), Gaps = 120/443 (27%)
Query: 248 LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
+P FGTHH+K M+ + V I++ ++N+ +D+ +Q +W P + +
Sbjct: 154 IPTRFGTHHTKMMINFFEDLSVEIVISSSNITRLDFGGLTQMVWRSGRLPQSGETIGEKG 213
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGS 364
F+ DLI YL+ K L N +NF S V LIAS PG Y+
Sbjct: 214 IQFKKDLIGYLNKYKKVPVD-KLATRLNL---------YNFLSVDVELIASAPGKYNLQK 263
Query: 365 SLKKWGHMKLRTVLQ--------------ECTFEKGFKK---------SPLVYQFSSLGS 401
+G+ L L+ E +K KK S + Y FS+
Sbjct: 264 DSSLYGYGSLYKALERNNLLLNNKNVEHDEIDNDKHNKKKHYNVLAQVSAISYPFST--- 320
Query: 402 LDEKW-----MAELSSSMSSGFSEDKTPLGIGE-------------PLIVWPTVEDVRCS 443
EKW L + E L G+ P I++PTV++V S
Sbjct: 321 --EKWATAGIFTHLLCPLIFSKDEKFRLLAPGKESIKRHQKEHNYTPHIIFPTVDEVASS 378
Query: 444 LEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHT----GRSRAMPHIKTFARYN 493
GY AG+AI KN +K Y +KW +S T GR R MPH+K + N
Sbjct: 379 TIGYVAGSAIHFDYTRSFVHKNYFTQAIKPYLSKWDSSDTKEVTGRERVMPHVKLYMCDN 438
Query: 494 G-------------QKLAKAAWGALQKN------NSQLMIRSYELGVLILPSAKRHGCGF 534
L+K AWG+ + N + + + SYELGVL P
Sbjct: 439 ADNWKTIKWCYMGSHNLSKQAWGSKKGNKFVNDHSDEYEVSSYELGVLFTP--------- 489
Query: 535 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 594
K G+T ++ K + + ++ +P++LPP YS
Sbjct: 490 -----------KEGTTMVPSYKENK-----------SSIRGDHTFVRMPFQLPPALYSLL 527
Query: 595 DVPWSWDKRYTKK-DVYGQVWPR 616
D+PWS Y K D+ G + +
Sbjct: 528 DMPWSGHVSYGDKLDLMGSTYKK 550
>gi|116194574|ref|XP_001223099.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
gi|88179798|gb|EAQ87266.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
Length = 349
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 87/311 (27%), Positives = 132/311 (42%), Gaps = 66/311 (21%)
Query: 344 FNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
++FS LIASVPG H S+ WG + L+ KK + Q SS+
Sbjct: 62 YDFSEIRGSLIASVPGRHVFEEEDSITWWGSAAMSRALEAVPISS--KKPEIAIQTSSIA 119
Query: 401 SL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI--- 453
+L + W+ L S+ G S TPL +V+PT +++R SL+GY +G++I
Sbjct: 120 TLGGSDTWLKNILFRSLRGGRS--TTPLAQRPSFKVVFPTPDEIRKSLDGYHSGSSIHTK 177
Query: 454 -PSPQKNVDKDFLKKYWAKWK--------------ASHTGRSRAMPHIKTFARYNG---- 494
SPQ+ +L+ + W GR RA PHIKT+ RY+G
Sbjct: 178 TQSPQQASQLTYLRPMFHHWANDSDRGAPLSYGDIPKEAGRKRAAPHIKTYIRYSGYGPE 237
Query: 495 -----------QKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 543
L+K AWG +++ + SYE+GVL+ P G +VP+
Sbjct: 238 PPTVDWALLTSANLSKQAWGDAPNTRNEVRVASYEIGVLVWPELYGEGA------TMVPT 291
Query: 544 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 603
+ E G G ++ V L +PY LP Q Y +VPW ++
Sbjct: 292 FMTDSLAE---------------GEVPEGTATAVA-LRMPYNLPLQAYGEGEVPWVATEK 335
Query: 604 YTKKDVYGQVW 614
+ + D G+ W
Sbjct: 336 HLEPDWMGRAW 346
>gi|440302433|gb|ELP94746.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba invadens IP1]
Length = 446
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 165/389 (42%), Gaps = 84/389 (21%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL 248
G++ L+ ++ DI WLL P+L K V +H DG+L + N +
Sbjct: 38 GELYACFLTTFVFDIGWLLREVPIL-KTVQVQFVH---DGSLSEDEERLIHNLDFQCIKV 93
Query: 249 PISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGF 308
G HH K M+++Y G+R ++ T NL+ D+ K+ G++++DF K N+ S+
Sbjct: 94 SPFRGCHHVKIMVMLYEGGLRFVLSTGNLLEQDYEIKTNGIYVRDFKPK-SNSFSKM--- 149
Query: 309 ENDLID-YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
ND+ + +L+T+++ S N + + F+FS+ L+ SVPG G
Sbjct: 150 -NDIGEHFLTTMRYYLNSIN--------TDIGYLDDFDFSTIDAWLLLSVPGKFHGDMAS 200
Query: 368 KWGHMKLRTVLQECTF---------------------------------EKGFK------ 388
+ G +L ++L+ +F +KG K
Sbjct: 201 EVGLGQLSSLLKSFSFGSQKDQKTQEEHKTSALINPVVPTKQSQKTSTSQKGLKSPEIEC 260
Query: 389 --KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
++ ++ Q SSLG L + + SS + +WPT + VR S G
Sbjct: 261 AEQAVIISQSSSLGYLSSNFTEKFKSSFVPNVHHIQLK-------TLWPTEDFVRVSATG 313
Query: 447 YAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIKTFARYNGQ---------- 495
YA G ++ Q+NV L +Y ++ R PHIKT+ G
Sbjct: 314 YAGGQSLFLTQQNVKSGVALYRYEPRFP-----RHYIQPHIKTYLVKVGDTFRCGVLTSA 368
Query: 496 KLAKAAWGALQKNNSQLMIRSYELGVLIL 524
++ AAWG + + + I ++E+G+L +
Sbjct: 369 NMSAAAWG--KPMSYGIDISNFEMGLLFV 395
>gi|409075791|gb|EKM76167.1| hypothetical protein AGABI1DRAFT_45345 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 625
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 112/432 (25%), Positives = 175/432 (40%), Gaps = 75/432 (17%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
+ F R TFRL +V G N S ++ AILS+Y +D W+
Sbjct: 171 QTATRFAEPRKDGQRTFRLTQVLG-----NKS----------ELAFAILSSYSLDFPWIY 215
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+P ++V ++ G +K P W+ PPL FG H K MLL Y G
Sbjct: 216 EFFD--RSVPVIMVAQPDAMGQAA-IKYTFP-TWVKTTPPLRGGFGCQHMKFMLLFYKNG 271
Query: 268 -VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLK-WPE 323
+R+++ TANLI DW + +W+QD P++ Q + F + + L + P
Sbjct: 272 NLRVVISTANLIAYDWRDMENSVWLQDLPMRPQLMPPDPKAKDFPSIMQQVLHAVNVAPA 331
Query: 324 FSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE 380
LP H N + ++++S V L+AS+ G H G S+ K GH +L ++
Sbjct: 332 LRTMLPDHPNIPLRTIEDLRMRWDWSKVKVHLVASIAGKHEGWPSIVKTGHPRLMMAIRT 391
Query: 381 CTFE--KGFKKSPLVY--QFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIG 428
+G K ++ Q SSLG+ +W+ E S +ED + L
Sbjct: 392 MGLRPSRGLGKGNMIIECQGSSLGNFTTQWLNEFHWSARGESAEDWLDEPKRRREKLPYP 451
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKK--YWAKWKA----------- 474
I++PT + V+ S G G I +K K+F + Y +K KA
Sbjct: 452 SVRILFPTKKIVQESASGEPGGGTIFCRRKQWAAKNFPRDKFYVSKSKAGPVLMHSKMII 511
Query: 475 ---SHTGRSRAM------------PHIKT------FARYNGQKLAKAAWGALQKN--NSQ 511
HT + A P +K +A +AWG L + N
Sbjct: 512 ATIQHTNPASASLNREGSDTEEDEPEVKIIEPAVGWAYVGSHNFTPSAWGTLSGSAFNPI 571
Query: 512 LMIRSYELGVLI 523
L I +YE+G++
Sbjct: 572 LNITNYEIGIVF 583
>gi|170097685|ref|XP_001880062.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645465|gb|EDR09713.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 609
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 143/331 (43%), Gaps = 47/331 (14%)
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
NG+ + A + +D + +TFRL V G + DI AILS+Y +
Sbjct: 165 NGEFRQTATRHADPRKDNM-ATFRLTEVLGQ---------------KKDIAFAILSSYSL 208
Query: 202 DIDWLL----PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
D W+ PA PV ++ + D T + +N +WI P L G H
Sbjct: 209 DWMWIYQFFDPATPV--------IMVAQPDQTGRAIIKNVLPHWIKTTPYLRGGHGCQHM 260
Query: 258 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
K MLL Y G +R++V TANLI DW + +W+QD PL+ + + + N D+
Sbjct: 261 KFMLLFYRNGRLRVVVSTANLIEYDWRDMENSVWLQDVPLR-SSPIPHDPKATN---DFP 316
Query: 317 STLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMK 373
S ++ S N+ H N + ++++S V L+ S+ G H G ++ K GH +
Sbjct: 317 SIIQRVLNSLNVKPHPNLALKSIEDLRCRWDWSKVKVHLVPSIAGKHEGWPAVIKTGHPR 376
Query: 374 LRTVLQECTFEKGFKKSP---LVYQFSSLGSLDEKWMAELSSSMSSGFSED--KTPLGIG 428
L ++E G K+ L Q SSLG +WM E S +ED P
Sbjct: 377 LMMAVREMAMRTGKGKAKELILECQGSSLGIYTTQWMNEFHWSARGESAEDWLDEPKKRR 436
Query: 429 EPL------IVWPTVEDVRCSLEGYAAGNAI 453
E L I +P+ V+ S G G I
Sbjct: 437 EKLPYPPIKIFFPSKRTVQESALGEKGGGTI 467
>gi|74830335|emb|CAI39050.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 521
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 165/350 (47%), Gaps = 66/350 (18%)
Query: 226 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 281
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 282 WNNKSQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INP 338
W SQ +W+QDF + + + +S+E F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKE--FKVGLKEFLDNI--------LPSSHKFEDLLK 256
Query: 339 SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQ 395
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ
Sbjct: 257 IKYNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQ 316
Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPL-----GIGEPL-----------IVWPTVED 439
+S+G LD ++ + S + + I + L +++PT +
Sbjct: 317 TTSIGQLDVNYVDFVQQCCSGQQIKQSQKIEQNNKSIAQMLFNQQEEKSILKLIYPTSDY 376
Query: 440 VRCSLEGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTF 489
++ +AG +P Q+ + F K + +++ S H G +PH+K
Sbjct: 377 IQNQT---SAGPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVM 430
Query: 490 -------------ARYNG-QKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
+ Y G L++AAWG L+KN +QL I + ELGVL P
Sbjct: 431 IITGIDEKIDDKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 480
>gi|449019998|dbj|BAM83400.1| probable tyrosyl-DNA phosphodiesterase [Cyanidioschyzon merolae
strain 10D]
Length = 615
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 148/349 (42%), Gaps = 83/349 (23%)
Query: 254 THHSKAMLL-IYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL 312
HHSK M+L + VR+++HT+N I DW K QG++ D PL+ + S GF DL
Sbjct: 208 VHHSKFMILRLRDDRVRLVIHTSNDIAYDWFFKCQGIFAVDLPLRGAGSASPNTGFCADL 267
Query: 313 IDYLS---------------------TLKWPEFSANL-PAHGNFKINPSFFKKFNFSSA- 349
YL T+ P +A+L A +F+ ++S+
Sbjct: 268 QQYLGAYIRAGERALHGGVTSARRFGTMVAPGDAASLVDAVSHFR---RLMTCCDYSAVD 324
Query: 350 AVRLIASVPGYHTGS--------------SLKKWGHMKLRTV----LQECTFEKGFKKS- 390
VRL++SVPG+H S ++ +GH++L + L+ CT S
Sbjct: 325 GVRLVSSVPGWHRISGQSRTSQTSRTASHAVCAFGHLRLANLVASSLRHCTEAARHPNSL 384
Query: 391 PLVYQFSSLGSLDEK------------WM-AELSSSMSSGFSED----------KTPLGI 427
V Q SSL S+D + W+ +EL S+ G K G
Sbjct: 385 AFVLQGSSLSSVDARCPRAASETLARYWLTSELFRSLCGGDGGGGGVGEESVFAKLAEGS 444
Query: 428 GEPLIVWPTVEDVRCSLEGYAAG-NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 486
+ +VWPT V S+ G +G I Q +D + +++ +W A R+ MPH+
Sbjct: 445 AQVYLVWPTRTQVLTSIVGIDSGMGLIARAQAFLDPE-IRQLLTRWNADWCARTSVMPHM 503
Query: 487 KTFARYNGQ------------KLAKAAWGALQKNNSQLMIRSYELGVLI 523
KT + ++ + + AAWG QK S L ++ELGVL
Sbjct: 504 KTISCWDTRTDQCLYCYLGSANVTPAAWGITQKQGSLLRCMNWELGVLF 552
>gi|145497459|ref|XP_001434718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401846|emb|CAK67321.1| unnamed protein product [Paramecium tetraurelia]
Length = 522
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/335 (27%), Positives = 157/335 (46%), Gaps = 53/335 (15%)
Query: 230 LEHMKR-NKPANWILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
LE ++R N NW + KP + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 286 SQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFK 342
SQG+W+QDF + + + S+E F++ L ++L + LP F+ + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQE--FKSMLREFLYEI--------LPTSHKFEDLLKIKYD 262
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSL 399
++F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+
Sbjct: 263 DYDFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSI 322
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGI--------GEPLIVWPTVEDVRCSLE-GYAAG 450
G +D ++ + +G S K I + +++PT + + G
Sbjct: 323 GQMDNNYV-DFVLQCCTGRSTKKINQMILNQQEEEQSKLKLIYPTADYIENQTHGGVDFA 381
Query: 451 NAIPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF-------------AR 491
N + Q++ + F K + K++ S HTG +PH+K +
Sbjct: 382 NPLHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLDEDINDQTSI 438
Query: 492 YNG-QKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
Y G ++ AWG ++KN +QL I + ELGVL P
Sbjct: 439 YIGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 473
>gi|298705565|emb|CBJ28816.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 947
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 78/142 (54%), Gaps = 8/142 (5%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P FR +R+ PA +N VS+ +++ G+ A++++Y+VD ++LL A P L +P +L
Sbjct: 178 PPLFRPVRIPSDPA-SNADGVSLGELLGGEYTEALVASYLVDAEFLLNAAPRLKTVPFLL 236
Query: 221 VIHGESDGTL-----EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+ + D L +KR PA + P I G HHSK +LL Y GVR+++ T
Sbjct: 237 IQGIKEDKPLVVSMKAFLKREHPAAVVYL--PKTIHIGLHHSKMILLKYKTGVRVVIMTC 294
Query: 276 NLIHVDWNNKSQGLWMQDFPLK 297
N+ DW + Q W QDFP K
Sbjct: 295 NMRPDDWGGRCQAAWYQDFPFK 316
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 22/114 (19%)
Query: 308 FENDLIDYLSTLKWPE--FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
FE LIDY + P + +L A ++FSSA V LI SVPG H G
Sbjct: 423 FEEILIDYFEHVGGPAAVWGRSLSA-------------YDFSSANVTLIPSVPGRHKGRD 469
Query: 366 LKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDE---KWMAELSSSM 414
L ++GHM++R VL +E G + + +Q +S+ +L KW+ E++ S
Sbjct: 470 LYRYGHMRVRAVLAREEVHVRPGSHR--VAFQAASIMNLSRRPYKWLGEITESF 521
>gi|422293515|gb|EKU20815.1| tyrosyl-dna phosphodiesterase 1, partial [Nannochloropsis gaditana
CCMP526]
Length = 160
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 80/140 (57%), Gaps = 9/140 (6%)
Query: 261 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 320
LL+Y G+R+++ T+N I VDW+NK+QG+W+QDFP + + +++ F DL +YL L
Sbjct: 3 LLLYEGGIRVMICTSNFIEVDWHNKTQGIWVQDFPKLREEDKADDSLFGRDLREYLQALN 62
Query: 321 -WPEFSANLPAHGNFKINPSF-------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 372
+ + H K +P + +FSSA L+ASVPG HTG K+GH+
Sbjct: 63 GFENECGSRGPHSPGKGHPLLTEMIEQELPRIDFSSAQAVLLASVPGKHTGHDKFKFGHL 122
Query: 373 KLRTVLQECTFEKG-FKKSP 391
KLR +L++ G F +P
Sbjct: 123 KLRRLLEKEPMPPGLFPSTP 142
>gi|392587577|gb|EIW76911.1| phospholipase D nuclease [Coniophora puteana RWD-64-598 SS2]
Length = 667
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 110/468 (23%), Positives = 194/468 (41%), Gaps = 71/468 (15%)
Query: 188 QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH-GESDGTLEHMKRNKPANWILHKP 246
+ +I AILS++ I W+ PH VI + D + +N NW++ P
Sbjct: 220 KSNIEFAILSSFSTSISWIYEFFD-----PHTPVIFVAQPDSSGNAALKNVLPNWLMTTP 274
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ---NNL 302
L +G H K MLL Y G +R+++ TANLI DW + +W+QD P + ++
Sbjct: 275 FLRNGYGCQHMKFMLLFYKDGRLRVVISTANLIDYDWRDIENAVWLQDVPRRPSPIPHDP 334
Query: 303 SEECGFENDLIDYLSTLKWPEFSANLPA--HGNFKIN--PSFFKKFNFSSAAVRLIASVP 358
+ F + + + L ++ AN+ A H N + ++FS V+L+ S+
Sbjct: 335 KAKDDFPSIMQNVLRSVNVRPALANMLANDHPNLPLQTIADLRTHWDFSKVKVKLVPSIA 394
Query: 359 GYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSP----LVYQFSSLGSLDEKWMAELSSS 413
G H G ++ + GH +L +++ G K+ + Q SS+G+ +W+ E S
Sbjct: 395 GKHEGWPAVVQSGHPRLMKAVRDMGLRTGKGKAAKELVVECQGSSIGTYTTQWLNEFHHS 454
Query: 414 MSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+ED +T L I++P+++ VR + G G + F
Sbjct: 455 ARGESAEDWLDAPRSRRTKLPFPPVKIIFPSLKRVRATALGERGGGTM----------FC 504
Query: 466 KKYWAKWKASHTGR----------SRAMPHIK----TFARYN-----GQKLAKAAWGALQ 506
K+ A+W+ + R R + H K TF R N G +K+A Q
Sbjct: 505 KR--AQWEGKNFPRGSFYESESRGGRTLMHTKMIIGTF-RSNPLVSVGAGTSKSAPQKKQ 561
Query: 507 KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE--IKSGSTETSQIQKTKL---V 561
+S+ ++ I + G + + N PS SGS+ + +
Sbjct: 562 LEDSETEPEDDDVDPDIQIVNEPIGWAYVGSHNFTPSAWGTLSGSSFNPSLNNINYELGI 621
Query: 562 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 609
+ + D S ++ PP++Y S+DVPW D+ +++
Sbjct: 622 VMPLYNDEDIDRVS-------CFKHPPKKYGSDDVPWMQDESLILREI 662
>gi|150865397|ref|XP_001384596.2| hypothetical protein PICST_67678 [Scheffersomyces stipitis CBS
6054]
gi|149386653|gb|ABN66567.2| putative tyrosyl-DNA phosphodiesterase [Scheffersomyces stipitis
CBS 6054]
Length = 553
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/427 (24%), Positives = 175/427 (40%), Gaps = 102/427 (23%)
Query: 248 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
+P FGTHH+K M+ + + I++ + NL +D +Q LW L+ ++++ E
Sbjct: 165 IPNRFGTHHTKMMVNFFEDKSCEIVIMSFNLNKIDVVGLTQTLWRSGRLQLETEDSVKLE 224
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D ++YL P ++ + ++F S V L+AS PG +
Sbjct: 225 RGENFKRDFMNYLKKYNSPVVTSLADR----------LQSYDFHSIDVELLASAPGKYEI 274
Query: 364 SSLKK----WGHMKLRTVLQECTFEKGFKKSPLVYQF---------------SSLGSLDE 404
++L +G+ KL +L+ K +Y F S S+
Sbjct: 275 TNLTDKDEVYGYGKLYQILKRNNLLVDNTKGDKLYNFLSQVTSISYPFNVRGSQTASVFS 334
Query: 405 KWMAELS-SSMSSGF-----SEDKTPLGIGE----PLIVWPTVEDVRCSLEGYAAGNAIP 454
+A L S S+GF D T + P +V+PTV+++ + G+ AG A+
Sbjct: 335 HLLAPLVFSGGSNGFKILLPGSDSTSKHQKDNYYLPHMVYPTVKEIANNNVGFGAGQAVH 394
Query: 455 SPQKNVD------KDFLKKYWAKWKASH----TGRSRAMPHIKTFARYNG---------- 494
D + ++ Y KW +S TGR +PH K F NG
Sbjct: 395 MKHTKSDTHRYQYQQNIRPYLRKWNSSGSDIVTGRESVVPHCKYFMCDNGDNFSSLKWAL 454
Query: 495 ---QKLAKAAWGA---LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 548
L+K AWG+ N ++ I S+ELGV++ P + G +VP+
Sbjct: 455 VGSHNLSKQAWGSPVPKSTNPNKYEISSFELGVVVFP---KEG------EKLVPA----- 500
Query: 549 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS-WDKRYTKK 607
+G D + + L +P+ LPP +Y+++D PWS W K
Sbjct: 501 -----------------YGE-DTVNDDKAIPLRMPFSLPPTKYTAQDEPWSEWVSYGELK 542
Query: 608 DVYGQVW 614
D +GQ +
Sbjct: 543 DKFGQTY 549
>gi|301770841|ref|XP_002920838.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial
[Ailuropoda melanoleuca]
Length = 172
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 76/131 (58%), Gaps = 6/131 (4%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTH 255
NY D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTH
Sbjct: 2 NYCFDVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTH 61
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSEE--CGFEND 311
H+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ P+ + S E F+ D
Sbjct: 62 HTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKAD 121
Query: 312 LIDYLSTLKWP 322
LI YL P
Sbjct: 122 LISYLMAYNAP 132
>gi|254565439|ref|XP_002489830.1| hypothetical protein [Komagataella pastoris GS115]
gi|238029626|emb|CAY67549.1| hypothetical protein PAS_chr1-1_0480 [Komagataella pastoris GS115]
gi|328350245|emb|CCA36645.1| tyrosyl-DNA phosphodiesterase 1 [Komagataella pastoris CBS 7435]
Length = 562
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 122/548 (22%), Positives = 219/548 (39%), Gaps = 107/548 (19%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQ--DNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
++ K D + SK +Q+ EQ D + +++E+ + + S RL
Sbjct: 52 AQGSKEQQVDAQEEPQKHSKTQKQEKEQVIDLTDDQDAEDRPA---IDTTTVQSPIRLFN 108
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
N C+S++D++ + N+ +++D+ L + I+ ++
Sbjct: 109 SPAHKPQDNIDCISLKDLVSSPQLSKTYQFNFCINVDFFLKYITSDPLSTEIYFINS-AE 167
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKS 286
+E ++N+ + H F THH+K M+ + G +I+V +AN+ +D+ +
Sbjct: 168 YLVEMTQQNRMRFKLRHVDIQLERFATHHTKMMVNFFRDGTAQIVVMSANMTEMDFVGNT 227
Query: 287 QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 346
QGLWM P+ + N E F+ND + YL + + +L A K ++F
Sbjct: 228 QGLWMS--PMLSKGN-GRESSFKNDFLAYLKA--YNKHDLDLLAEE--------LKLYDF 274
Query: 347 SSAAVRLIASVPGYHT----GSSLKK---WGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSS 398
+ ++SVPG T LK+ +G+ KL +L+ F K + + ++ Q ++
Sbjct: 275 GNVKAEFLSSVPGTFTIPEEDDRLKRSVQYGYGKLFQLLKLNNLFPKATESTDILAQVAT 334
Query: 399 LGS-LDEKWMAELSSSMSSGFSEDKTPLGIG---------------EPLIVWPTVEDVRC 442
+ S D + + ++ + K P+ G P +V+PT +V
Sbjct: 335 IASPFDFRSSNIFTHLLAPLINGTKFPIAGGLEPLQKAINDDVHPFNPFLVFPTKNEVFG 394
Query: 443 S-LEGYAAG---------NAIP--SPQKNVDKDFLKKYWAKWKAS------HTGRSRAMP 484
S L+ Y +G + +P + Q N+ ++K+ +W S GRS P
Sbjct: 395 SVLKEYTSGIFYNIDDSSHKVPFLTNQHNI----IRKFMYRWTNSDPNLNQKAGRSNLAP 450
Query: 485 HIKTFARYN------------GQKLAKAAWGALQK--NNSQLMIRSYELGVLILPSAKRH 530
H+KT+ N L+K AWG K N + I SYE G+ I P K +
Sbjct: 451 HVKTYCASNDGFQTFMWYLLTSANLSKQAWGYPLKGSNGLKYKISSYEAGIFIHP--KLY 508
Query: 531 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 590
G + +L + S VV + VPY P ++
Sbjct: 509 GEDY------------------------QLKPILSRDSFPNRDKDNVVPIRVPYAFPLEK 544
Query: 591 YSSEDVPW 598
Y D PW
Sbjct: 545 YHDSDEPW 552
>gi|426193767|gb|EKV43700.1| hypothetical protein AGABI2DRAFT_121836 [Agaricus bisporus var.
bisporus H97]
Length = 635
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 175/432 (40%), Gaps = 75/432 (17%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
+ F R TFRL +V G N S ++ AILS+Y +D W+
Sbjct: 181 QTATRFAEPRKDGQRTFRLTQVLG-----NKS----------ELAFAILSSYSLDFPWIY 225
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+P ++V ++ G +K P W+ PPL FG H K MLL Y G
Sbjct: 226 EF--FDRSVPVIMVAQPDAMGQAA-IKYTFP-TWVKTTPPLRGGFGCQHMKFMLLFYKNG 281
Query: 268 -VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLK-WPE 323
+R+++ TANLI DW + +W+QD P++ Q + F + + L + P
Sbjct: 282 NLRVVISTANLIAYDWRDMENSVWLQDLPMRPQLMPPDPKAKDFPSIMQQVLHAVNVAPA 341
Query: 324 FSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE 380
L H N + ++++S V L+AS+ G H G S+ K GH +L ++
Sbjct: 342 LRTMLSDHPNIPLRTIEDLRMRWDWSKVKVHLVASIAGKHEGWPSIVKTGHPRLMMAIRT 401
Query: 381 CTFE--KGFKKSPLVY--QFSSLGSLDEKWMAELSSSMSSGFSED--KTPLGIGEPL--- 431
+G K ++ Q SSLG+ +W+ E S +ED P E L
Sbjct: 402 MGLRPSRGLGKGNMIIECQGSSLGNFTTQWLNEFHWSARGESAEDWLDEPKRRREKLPYP 461
Query: 432 ---IVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKK--YWAKWKA----------- 474
I++PT + V+ S G G I +K K+F + Y +K KA
Sbjct: 462 PVRILFPTKKIVQESASGEPGGGTIFCRRKQWAAKNFPRDKFYVSKSKAGPVLMHSKMII 521
Query: 475 ---SHTGRSRAM------------PHIKT------FARYNGQKLAKAAWGALQKN--NSQ 511
HT + A P +K +A +AWG L + N
Sbjct: 522 ATIQHTNPASASLNREGSDTEEDEPEVKIIEPAVGWAYVGSHNFTPSAWGTLSGSAFNPI 581
Query: 512 LMIRSYELGVLI 523
L I +YE+G++
Sbjct: 582 LNITNYEIGIVF 593
>gi|390595745|gb|EIN05149.1| phospholipase D/nuclease, partial [Punctularia strigosozonata
HHB-11173 SS5]
Length = 622
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 151/369 (40%), Gaps = 57/369 (15%)
Query: 113 SQKRVSNDGATNGELSSKKMRQQDEQDNE--NGKNSEEALCNFHVSRDKLPSTFRLLRVQ 170
S++RV D A + + E + NG+ + A + +D P TFRL +
Sbjct: 131 SKRRVRVDPALSSASGPSTSSRTTEMEPMFWNGEIRQTANAHVDPRKDTKP-TFRLTEII 189
Query: 171 GLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
G + D+ AI++ Y +D WL P+ PV V+ +
Sbjct: 190 GK---------------KSDVKFAIIAGYCIDWAWLYHFFEPSTPV--------VVVAQP 226
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
D T + NWI PPL G H K MLL Y G +R+++ TAN I DW +
Sbjct: 227 DTTGARSVKEVLPNWIRTTPPLRGGRGCMHMKFMLLFYRTGRLRVVISTANFIDYDWRDI 286
Query: 286 SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH-GNFKIN------- 337
+W+QD PL+ +++ D+ +T + + N+ A IN
Sbjct: 287 ENTVWVQDVPLR-----QTPIRYDHKATDFPATFERVFKALNVEAALQALTINDHPDIPL 341
Query: 338 PS---FFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKG-FKKSPL 392
PS K++FS L+ASV G H G + + GH L +++ G ++ L
Sbjct: 342 PSVTDLRTKWDFSKVKAHLVASVAGKHEGWPEVIRNGHTALMKAVRDMGARAGKGREVEL 401
Query: 393 VYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSL 444
Q SS+G+ +WM E S +ED + L IV+P++ V+ S
Sbjct: 402 ECQGSSIGTYSTQWMNEFHYSCRGESAEDWLDQPKTRRAKLPWPPVKIVFPSLATVQASR 461
Query: 445 EGYAAGNAI 453
G G I
Sbjct: 462 LGEKGGGTI 470
>gi|449686459|ref|XP_002156800.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial [Hydra
magnipapillata]
Length = 206
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 114/235 (48%), Gaps = 64/235 (27%)
Query: 248 LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG 307
LPI++GTHH RI W KS ++D +N+
Sbjct: 19 LPIAYGTHH------------RI-----------W--KSPLFAIKDVAYDGKND-----P 48
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTGSS 365
F+ DL++YLS+ +GN K+ K+++ SSA V L++SVPG +TG
Sbjct: 49 FKEDLLEYLSS------------YGNSKLGMYAEKLKEYDMSSANVHLVSSVPGRYTGFK 96
Query: 366 LKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE--------KWMAELSSSMS 415
+ +WGH+KLR +L K P++ QFSS+GSL +W++ LS+
Sbjct: 97 MHQWGHLKLRKLLLSYGPSKDLVNENWPIIGQFSSIGSLGSESSSWLCGEWLSSLSTCKD 156
Query: 416 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-----SPQKNVDKDFL 465
E K L +++PT+E+VR SLEGY+AG ++P + ++ KDFL
Sbjct: 157 DELKESKANLK-----LIYPTIENVRNSLEGYSAGCSLPYGIQVAMKQRYLKDFL 206
>gi|358056499|dbj|GAA97673.1| hypothetical protein E5Q_04351 [Mixia osmundae IAM 14324]
Length = 686
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 159/362 (43%), Gaps = 42/362 (11%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
L R+Q R G +++ + N ++ E A FH S L FR V
Sbjct: 228 LQRAQARAQALGLVEPAIATANIPSASTSTNVAHRHLENA---FHPS---LGIYFRKSAV 281
Query: 170 Q-GLPAWANTS--CVSIRDVI--QGDIIVAILSNYMVDIDWLLPAC--PVLAKIPHVLVI 222
+ A+ T+ +S++D+I + I ++S+Y D+DWL+ P L K +L +
Sbjct: 282 RPTFNAFHRTTEDALSLQDIIGPKDRIEKLVMSSYATDLDWLVAHVLPPELGKQ-VLLAL 340
Query: 223 HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
G +D + N P + LH PP+ + G H K +L++Y R+ + TANL+ DW
Sbjct: 341 PGPADAPITSFVPNHP-HIKLHCPPVCRTSGAMHIKLILVVYDDFCRVAIPTANLVPYDW 399
Query: 283 NNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN--LPAHGNFKINPSF 340
+W+QDFP Q +L++ F L L L E S N LP +F
Sbjct: 400 QQIENAVWIQDFP--RQGSLAKPTRFAQTLHTTLRLLCIEEDSRNAVLPLDVDFS----- 452
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSL 399
+ + R+I S PG SS + GH L LQ+ + L Q SS+
Sbjct: 453 ------AGISARMILSTPG---SSSSEPNGHKLLGQALQDLHLLPARDQDVRLECQGSSI 503
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTP---LGIGEPL-----IVWPTVEDVRCSLEGYAAGN 451
G+L+++W+ E SS+ P EPL IV+PT+ ++ + G A G
Sbjct: 504 GALNDEWLLEFYSSICGRPVRTMFPKVQTANFEPLRTLFRIVFPTLRNIENTHLGTAGGG 563
Query: 452 AI 453
+
Sbjct: 564 TL 565
>gi|74834157|emb|CAI44465.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 532
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 158/344 (45%), Gaps = 61/344 (17%)
Query: 230 LEHMKR-NKPANWILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
LE ++R N NW + KP + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 286 SQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFK 342
SQG+W+QDF + + + S+E F++ L ++L + LP F+ + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQE--FKSMLREFLYEI--------LPTSHKFEDLLKIKYD 262
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSL 399
++F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+
Sbjct: 263 DYDFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSI 322
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTP-----LGIGEPL------------IVWPTVEDVRC 442
G +D ++ + + + + P I + + +++PT + +
Sbjct: 323 GQMDNNYVDFVLQCCTGRVYKQQLPNEQSTKKINQMILNQQEEEQSKLKLIYPTADYIEN 382
Query: 443 SLE-GYAAGNAIPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF------ 489
G N + Q++ + F K + K++ S HTG +PH+K
Sbjct: 383 QTHGGVDFANPLHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLD 439
Query: 490 -------ARYNG-QKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
+ Y G ++ AWG ++KN +QL I + ELGVL P
Sbjct: 440 EDINDQTSIYIGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 483
>gi|353240852|emb|CCA72701.1| hypothetical protein PIIN_06638 [Piriformospora indica DSM 11827]
Length = 636
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 134/291 (46%), Gaps = 35/291 (12%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPI 250
+ AILS+Y DI WL + + + V++++ ++ +K P NWI+ P L
Sbjct: 200 VAFAILSSYSTDIAWLYG---MFSPMTPVILVNQPTETGNSDVKGILP-NWIMTMPFLRG 255
Query: 251 SFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC--- 306
G H K MLL Y G +R+++ TAN I DW + W+QDFP + + E
Sbjct: 256 GRGAMHVKLMLLFYRSGRLRLVLPTANFIDYDWRDIENTAWVQDFPPLSKPAVGREATSS 315
Query: 307 GFENDLIDYLSTLKW-PEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG 363
F + L L+ L P ++ L H N I K +NF+ AAV+LI S+ G + G
Sbjct: 316 AFASTLQMVLTKLNVSPALASLLTDHPNLPIKFIGDLGKGWNFTKAAVKLIPSMSGKYEG 375
Query: 364 -SSLKKWGHMKLRTVLQECTFEKGF----KKSP-----LVYQFSSLGSLDEKWMAELSSS 413
+ K GH+ L + + +G KK P + Q SS+G+ +W+ E SS
Sbjct: 376 WDQVLKQGHVSLMKGIMDIGAHRGHTKRDKKKPPEELIVECQGSSIGTYSAQWLQEFYSS 435
Query: 414 M----------SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
S S K P PL I++P+++ V+ S+ G G +
Sbjct: 436 CCGISPETWLDKSKASRSKLP---KPPLRILFPSLKTVQSSVLGEDGGGTM 483
>gi|193785768|dbj|BAG51203.1| unnamed protein product [Homo sapiens]
Length = 118
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 70/145 (48%), Gaps = 43/145 (29%)
Query: 483 MPHIKTFAR------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 530
MPHIKT+ R L+KAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 1 MPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA--- 57
Query: 531 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 590
F S V + +GS E + PVPY+LPP+
Sbjct: 58 ---FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPEL 90
Query: 591 YSSEDVPWSWDKRYTKK-DVYGQVW 614
Y S+D PW W+ Y K D +G +W
Sbjct: 91 YGSKDRPWIWNIPYVKAPDTHGNMW 115
>gi|340503654|gb|EGR30196.1| tyrosyl-DNA phosphodiesterase family protein, putative
[Ichthyophthirius multifiliis]
Length = 547
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 147/323 (45%), Gaps = 49/323 (15%)
Query: 240 NWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
NW L PP S G H K L+ + +R++V + NL DW+ S LW QDFPL
Sbjct: 201 NWTLIHPPKDASVSWGGAFHPKLWLIKFNEFLRVVVGSGNLHICDWSVWSNCLWYQDFPL 260
Query: 297 KDQNNLSEECG---------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
K Q N +E F N LID ++ + N+ KI+ +++++S
Sbjct: 261 KKQQNAQKEKNQQQWDFEGDFSNTLIDIVNRM----MPDNVKYQNLLKID---LEEYDYS 313
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
+ L+++VPG H +++K G KL ++ F + K+ + Y+ S+LG++D K++
Sbjct: 314 EVKIILLSNVPGRHL--NIQKHGLGKLNAIIN--AFGQQNKQKIITYESSTLGNIDNKFL 369
Query: 408 AELSSSM---SSGF---SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP---SPQK 458
E S+ S F S++ + +++PT + + C Y A P + +
Sbjct: 370 NEFYKSVNLASCDFQKNSKENIKDIQNQFKVIFPTKKYI-CQDTLYGIEYASPVILNEKY 428
Query: 459 NVDKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQ--------------KLAKAAW 502
++ F+K + +++ + S +PH+K + + AAW
Sbjct: 429 YSNEKFIKDVFYQFECPKGYFYHSGVIPHLKVMVVNDKEDQISDDSLIYVGSHNFTGAAW 488
Query: 503 GALQKNNSQLMIRSYELGVLILP 525
G +KN SQ+ + ELGV+ P
Sbjct: 489 GRYEKNYSQIYCMNTELGVVYPP 511
>gi|384490985|gb|EIE82181.1| hypothetical protein RO3G_06886 [Rhizopus delemar RA 99-880]
Length = 338
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 136/306 (44%), Gaps = 39/306 (12%)
Query: 240 NWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---- 294
N I+ +PPL + +G H+K MLL +R+++ +AN++ D+ ++MQDF
Sbjct: 18 NRIIIQPPLKDNKYGVFHNKLMLLFRSSSLRVVIGSANMVACDYEELENVVFMQDFPELI 77
Query: 295 -PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
PLK +++ E F D+ D L ++ P K++FS A R+
Sbjct: 78 VPLKSESDFPE---FAKDICDVLDKMRVPTTVKEE------------LLKYDFSKAKARI 122
Query: 354 IASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSLDEKWMAELS 411
+ASV G G KK+GH +L ++++ T P V Q SSLGSL ++ E+
Sbjct: 123 VASVSGVFEGEEEYKKYGHTRLADIVRDITGPLDPNNYPKVEMQTSSLGSLSVSYLQEIY 182
Query: 412 SSMS--SGFSEDKTPLGIGE-----PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 463
S S FS+ K + P+ I++PT + V S G A ++I K
Sbjct: 183 QSFCGISSFSDGKAVRSSLQKNQLPPIDIIFPTRDTVTSSRYGGAGADSICFNTATWRKP 242
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTFAR--YNGQKLAKAAWGALQKNNS----QLMIRSY 517
K SH R A+ H K + +AWG + + +L I ++
Sbjct: 243 TFPKQVMCDSISH--RQGALMHSKALTSMIFRSHNSTTSAWGKFTVSKASKLPKLSISNW 300
Query: 518 ELGVLI 523
ELGV+
Sbjct: 301 ELGVVF 306
>gi|330842084|ref|XP_003293015.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
gi|325076694|gb|EGC30460.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
Length = 564
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 106/436 (24%), Positives = 174/436 (39%), Gaps = 82/436 (18%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPA--WANTSCVSIRDVI-QGDIIVAILSNYMV 201
N C L +TF L ++ P + + + ++I ++ + D+ A++ + +
Sbjct: 113 NEATTFCTIIGENYYLSNTFYLNTIKNQPKNLFNSPTTLTIEHLLLEKDMKSAMVCGFCL 172
Query: 202 DIDWLLPACPVLAKIPHVLV-------IHGESDGTLEHMKRNKPANWILHKPPLPISFGT 254
+ +W+ A+ HV + I E G + K N PPL S+ T
Sbjct: 173 ESEWIYKIF-YEAQGRHVPITFIRHYFISEEKKGIQQINKSTMAIN-----PPLG-SYQT 225
Query: 255 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID 314
H K +LL++P +RII+ ++N +D+++ +Q +W QDF +K + + + D
Sbjct: 226 FHGKLILLVFPEFIRIIIPSSNPTQLDYDSLNQTIWFQDFQIKK----APKQATPSKDND 281
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH----TGSSLKK-- 368
+L TLK+ S P+ F +++FS A+ LI SVPG++ GS + +
Sbjct: 282 FLKTLKYFLASIGCPS-------VKFLDEYDFSEASAHLIISVPGFYKHDGAGSGIIESD 334
Query: 369 ---WGHMKLRTVLQ-------ECTFEKGFKKS------PLVYQFSSLGSLDEKWMAELSS 412
G KL +VL+ E T K+ YQ SS+G +
Sbjct: 335 KPLMGIYKLESVLKKYYRNQDETTDYTVLDKNNQHCVRDFYYQASSIGGEKGNFRNNFVK 394
Query: 413 SMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW 472
+S PL I P W D R G + KN + D K ++
Sbjct: 395 HLSPSIENSDKPLHIIYPTDQWIKSNDHRLQHAG-----CLFLSNKNYNND--KSCFSYL 447
Query: 473 KASHTGRSRAMPHIK----TFARYN---------------------GQKLAKAAWGALQK 507
+ R + H K T R N + AAWGA QK
Sbjct: 448 SPKYDYRKHLVYHSKVLVGTSTRLNKPLKDTLNQRSNIKYDWVYAGSHNFSSAAWGAFQK 507
Query: 508 NNSQLMIRSYELGVLI 523
N +Q+ I +YE+GVL
Sbjct: 508 NETQIQISNYEIGVLF 523
>gi|154311214|ref|XP_001554937.1| hypothetical protein BC1G_06725 [Botryotinia fuckeliana B05.10]
Length = 405
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 133/349 (38%), Gaps = 82/349 (23%)
Query: 343 KFNFSSAAVRLIASVPGYHTGS---SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
K++FS LIASVPG S WG L L+ + +V Q SS+
Sbjct: 60 KYDFSEIKAALIASVPGKQDTELSPSQTGWGWAGLTNALKSVPSHHNTQPE-IVIQVSSI 118
Query: 400 GSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS- 455
SL +KW+ ++S E K+P G I++PT ++VR S+ GYA+GNAI +
Sbjct: 119 ASLGPTDKWLTHFFKALS----ESKSPRKTGSKFKIIFPTADEVRRSINGYASGNAIHTK 174
Query: 456 ---PQKNVDKDFLKKYWAKW------------------------------KASHTGRSRA 482
P + +LK W K R RA
Sbjct: 175 ILTPAQGKQLAYLKPMLCHWAGDGAQHSSSSSLSSNTPSKSSQSFTSPELKTQEAYRRRA 234
Query: 483 MPHIKTFARY-------------------NGQKLAKAAWGALQKNNSQLMIRSYELGVLI 523
PHIKT+ R+ L+K AWG + ++ I SYE+GVL+
Sbjct: 235 APHIKTYIRFSSDSTSSSSSQKSIDWMLVTSANLSKQAWGESINSADKVRICSYEIGVLV 294
Query: 524 LP---SAKRHGCGFS---CTSNIVPS--------EIKSGSTETSQIQKTKLVTLTWHGSS 569
P K++G C N PS EI + ++ L
Sbjct: 295 WPDLWEEKQNGKNVKMVPCFGNDTPSIPFVSPSLEIVGQKEIRVEGEEGHLKRKRCDDRE 354
Query: 570 DAGASSE----VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
D E +V +PY+LP Y +D+PW Y++ D G+ W
Sbjct: 355 DEKRQEESHTIIVGARMPYDLPLVSYGKDDIPWCASASYSEPDWMGKTW 403
>gi|145527276|ref|XP_001449438.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|74834160|emb|CAI44466.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
gi|124417026|emb|CAK82041.1| unnamed protein product [Paramecium tetraurelia]
Length = 532
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 150/345 (43%), Gaps = 72/345 (20%)
Query: 234 KRNKPANWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
K N NW++ KP S G H K +L +P+ +RI++ + NL DW SQ +W
Sbjct: 158 KYNNYPNWMVIKPSKLGSCMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIWSQAMW 217
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSA 349
+QDF + F+ L ++L + LP F+ + + ++F
Sbjct: 218 IQDFKIGKSELDQGSQEFKTMLREFLYEI--------LPTSHKFEDLLKIKYDDYDFKDV 269
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVL--QECTFEKGFKKSPLV-YQFSSLGSLDEKW 406
++LI S+PG G+ L K+G M+L++VL + C + K V YQ +S+G LD+ +
Sbjct: 270 NIKLITSIPGRFVGNQLFKYGMMRLQSVLYYELCNNKMEIPKQVCVTYQTTSIGQLDDNY 329
Query: 407 M----------------------AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSL 444
+ +L+ + + E+++ L +++PT + +
Sbjct: 330 IDFALQCCTGKVYKQPLASEQNNKKLNQMILNQQEEEQSKLK-----LIYPTADYIENQT 384
Query: 445 EGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTF----- 489
G G +P Q + F K + K++ S HTG +PH+K
Sbjct: 385 HG---GVDFANPLYLKKQLYENPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGL 438
Query: 490 --------ARYNG-QKLAKAAWGALQKNNSQLMIRSYELGVLILP 525
+ Y G ++ AWG ++KN +QL I + ELGVL P
Sbjct: 439 DEEINDYTSIYIGSHNFSQGAWGKMEKNATQLYIANTELGVLYPP 483
>gi|336379126|gb|EGO20282.1| hypothetical protein SERLADRAFT_452973 [Serpula lacrymans var.
lacrymans S7.9]
Length = 620
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 48/316 (15%)
Query: 163 TFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPH 218
TFRL V G + +I AILS+Y + + W+ P+ PV
Sbjct: 169 TFRLTEVLGK---------------KSEISFAILSSYSLSVSWIYEFFDPSVPV------ 207
Query: 219 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANL 277
+I + D + + +N NWI P L G H K MLL Y G +R+++ TANL
Sbjct: 208 --IIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANL 265
Query: 278 IHVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HG 332
I D+ + +W+QD PL+ Q N+ F + L L P + +L H
Sbjct: 266 IDYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHP 325
Query: 333 NFKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKK 389
N + +++S V+L+ S+ G H G + GH +L +++ G K
Sbjct: 326 NLPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGK 385
Query: 390 SP----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTV 437
+ + Q SS+G+ +WM E S +ED + L IV+P++
Sbjct: 386 AAKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSL 445
Query: 438 EDVRCSLEGYAAGNAI 453
+ V+ S+ G G +
Sbjct: 446 KTVQTSVLGEPGGGTM 461
>gi|336366433|gb|EGN94780.1| hypothetical protein SERLA73DRAFT_171190 [Serpula lacrymans var.
lacrymans S7.3]
Length = 607
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 48/316 (15%)
Query: 163 TFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPH 218
TFRL V G + +I AILS+Y + + W+ P+ PV
Sbjct: 156 TFRLTEVLGK---------------KSEISFAILSSYSLSVSWIYEFFDPSVPV------ 194
Query: 219 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANL 277
+I + D + + +N NWI P L G H K MLL Y G +R+++ TANL
Sbjct: 195 --IIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANL 252
Query: 278 IHVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HG 332
I D+ + +W+QD PL+ Q N+ F + L L P + +L H
Sbjct: 253 IDYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHP 312
Query: 333 NFKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKK 389
N + +++S V+L+ S+ G H G + GH +L +++ G K
Sbjct: 313 NLPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGK 372
Query: 390 SP----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTV 437
+ + Q SS+G+ +WM E S +ED + L IV+P++
Sbjct: 373 AAKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSL 432
Query: 438 EDVRCSLEGYAAGNAI 453
+ V+ S+ G G +
Sbjct: 433 KTVQTSVLGEPGGGTM 448
>gi|393219182|gb|EJD04669.1| phospholipase D/nuclease [Fomitiporia mediterranea MF3/22]
Length = 583
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 159/362 (43%), Gaps = 59/362 (16%)
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDK-LPSTFRLLRVQGLPAWANT 178
DG+T+ L + ++ + +G+ + + N V RDK + TFRL + G
Sbjct: 81 DGSTSAGLKVSRGKENESDLFWDGELRQ--VANRLVDRDKDVWPTFRLSEIIG------- 131
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ DI +AILS+Y +DWL P P+ VLV DG +K
Sbjct: 132 --------PKSDITLAILSSYSNAVDWLYDFFEPTTPI------VLVNQPGEDGN-SGLK 176
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD 293
P N ++ KP + G H K +LL Y G +RI + TAN + DW + W+QD
Sbjct: 177 ELAP-NILMTKPFIRNGRGCMHIKILLLFYKDGRLRICLPTANFVEYDWRDIENTAWVQD 235
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA------HGNFKINP-----SFFK 342
P++ + D+ TL+ N+PA GNF P
Sbjct: 236 VPMRKTT-----IRHDPKAADFPGTLQRVLHKLNVPAALTKLLDGNFPELPIEALSELRM 290
Query: 343 KFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSL 399
++++S V+L+AS+ G + G +++ GH L +QE T KG K+ L Q SS+
Sbjct: 291 RWDWSKVKVKLVASLAGKYEGWDEVERTGHPALAKAIQELGVTPPKG-KELVLECQGSSI 349
Query: 400 GSLDEKWMAELSSSMSSGFSE------DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGN 451
G+ +WM E+ S ++ + + PL I++P++ V+ S+ G G
Sbjct: 350 GTYSRQWMDEIYCSAKGQSAKAWLNKPRSQRMKLAWPLIKILFPSLATVKDSVLGMPGGG 409
Query: 452 AI 453
+
Sbjct: 410 TM 411
>gi|451998304|gb|EMD90769.1| hypothetical protein COCHEDRAFT_1179942 [Cochliobolus
heterostrophus C5]
Length = 567
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/414 (23%), Positives = 173/414 (41%), Gaps = 52/414 (12%)
Query: 175 WANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLE- 231
+ T ++I +V++ D + A++S++M D +WL PV K V +++ + +
Sbjct: 148 YPRTDDITIDEVLEADTVRTAVISSFMWDSEWLFKKLNPV--KTKQVWIMNAKGKDVQQR 205
Query: 232 ---HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
M+ N +H PP+ + HSK MLL P +RI++ TAN+I DW +
Sbjct: 206 WQKEMEDMGVPNLKIHFPPMDGMIQSMHSKFMLLFGPNKLRIVIPTANMIQTDWGEVAND 265
Query: 287 -------QGLWMQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
+++ D P + S + F +L+ +L K PE
Sbjct: 266 WQPGVMENSIFLIDLPRRGNETTSTQENMTRFGQELMYFLEMQKVPEMVLQ--------- 316
Query: 337 NPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQ 395
F+FS + + + S+ G H S G L +Q+ + ++ L Y
Sbjct: 317 ---GILNFDFSQTSHLAFVHSIGGSHKTESEHPTGLPGLARAIQDLRLDN-VEQIELDYA 372
Query: 396 FSSLGSLDEKWMAELS-SSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNA 452
SSLG++++ +++ L ++ F+ D + I +PT E V S+ G G
Sbjct: 373 ASSLGAINDSFLSRLYLAACGKCFAADTATVSDVRRHIRIYFPTNETVEKSIGGPDCGGI 432
Query: 453 IPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ----------KLAKAA 501
I Q+ + D F ++ +++S G + R +G+ L+++A
Sbjct: 433 ISLSQQRYNADTFPRECLRDYESSRAGMLSHNKLLLARGRKDGRPVGWVYVGSANLSESA 492
Query: 502 WGALQ--KNN--SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
WG + KN L IR++E GV++ R G VP I G+ E
Sbjct: 493 WGGQKVIKNGKMGSLNIRNWECGVVMTVPEDRLGGRDKDRDKTVPMSIFEGTVE 546
>gi|403372152|gb|EJY85968.1| Tyrosyl-DNA phosphodiesterase [Oxytricha trifallax]
Length = 676
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/420 (25%), Positives = 167/420 (39%), Gaps = 94/420 (22%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG----TLEHMKRNKPANWILHKP 246
I AILS + DI+ + KIP + + + D L K N N++ +
Sbjct: 264 IQRAILSTMVFDIELITQLLD--EKIPMTIFLDRDKDDKGPQVLYEEKLN--LNFVFQQK 319
Query: 247 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---PLKDQNNLS 303
S+ HSK +L + +R+IV +ANL DW S W QDF L N +S
Sbjct: 320 WGGNSYSVFHSKLILYEFDDRLRVIVTSANLYTQDWELLSNVTWFQDFFKAELGKNNEIS 379
Query: 304 EEC---------------------------------GFENDLIDYLSTLKWPEFSANLPA 330
+ F+ L DYL + +P
Sbjct: 380 QSSTTQSVKVATKEERKNPFNFNEQRPQQQQQPFQNDFKQYLKDYLEVI--------IPK 431
Query: 331 HGNFKINPSF-----FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
N K+ F KF+FS+A LIAS+ G H KK+G +L +++ +K
Sbjct: 432 --NVKVREVFRQKIDLDKFDFSTANAFLIASINGRHADREFKKYGQARLGELVRNV--DK 487
Query: 386 GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-----IVWPTVEDV 440
+K+ + YQ SS+G L+ K+M +SM + F + K + E + +++PT+ V
Sbjct: 488 QHEKT-ITYQTSSIGKLNTKFM----TSMYNQFGKSKK---VSEDIHQNFRVIFPTIGYV 539
Query: 441 RCSLEGYAAGNAI----------PS-PQKNVDKDFLKKYWAKWKASHT--------GRSR 481
S G ++I P P+K+ + K HT G+
Sbjct: 540 STSHLGPENASSIILQESYWYDTPGFPRKSFYRQVGKSKLLDKNLYHTKFMIITDKGKES 599
Query: 482 AMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 541
+ T + + AWG L+KN+SQ+ I ++ELGV+ P +N+V
Sbjct: 600 EITD-DTVLYFGSHNFSGGAWGNLEKNDSQISISNWELGVVFGPQVGSQEMKQKMINNMV 658
>gi|403418586|emb|CCM05286.1| predicted protein [Fibroporia radiculosa]
Length = 1675
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 130/308 (42%), Gaps = 36/308 (11%)
Query: 168 RVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIH 223
R LP + T ++ RD DI AI+S Y+ + WL P PV+A +
Sbjct: 1234 RKDTLPTFRLTDILAPRD----DIAFAIVSAYVYNYSWLYSLFSPNTPVIA-------VA 1282
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDW 282
+ +G E +K P NWI P L G H K MLL Y G +RI++ TAN+I DW
Sbjct: 1283 QDPEGQ-ETIKTILP-NWIKTTPFLRNGMGCMHMKFMLLFYKSGRLRIMISTANMIEYDW 1340
Query: 283 NNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-------FK 335
+ W+QD PL+ +S + E+ + L+ + L +H +
Sbjct: 1341 RDIENTAWVQDVPLRSA-PISHDPKAEDFAAAMVRVLRAISVAPALVSHLRNDHPDLPLQ 1399
Query: 336 INPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
F K++FS V L+ S+ G H G + GH L L+ K ++
Sbjct: 1400 RLEEFRMKWDFSKVKVSLVPSIAGKHEGWPKVILAGHTALMKALRNLNAAADKDKEVILE 1459
Query: 395 -QFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLE 445
Q SS+G+ +WM E S ++ + L I++PT + VR S
Sbjct: 1460 CQGSSIGNYSTQWMNEFHCSARGESAQSWLDVSKARRAKLSFPPVKILFPTSQYVRDSAL 1519
Query: 446 GYAAGNAI 453
G A G +
Sbjct: 1520 GEAGGGTM 1527
>gi|118399033|ref|XP_001031843.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89286177|gb|EAR84180.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 562
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 107/468 (22%), Positives = 189/468 (40%), Gaps = 83/468 (17%)
Query: 131 KMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGL----PAWANTSCVSIRDV 186
K RQ ++Q+N+ + N V L + + + L P + +
Sbjct: 81 KFRQNEQQENQPKNKLTDFYMNQLVHHKNLKTNKHFINFRALFYEDPFYKEKNLCP---- 136
Query: 187 IQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN------KPAN 240
+ +I A L+ +D + +LP +V V+ + + KRN N
Sbjct: 137 -KKTLISAFLTTKGLDEELVLPLVKA-----NVKVVIADDKIKQWNEKRNVIKNHQYFEN 190
Query: 241 WILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
+ + PP L ++G HSK +L +P+ +RI++ T NL + W N S +W +DF L
Sbjct: 191 FTIVYPPKDYLSKTWGCFHSKLWILKFPKFLRIVIGTGNLRILHWTNWSNIIWFKDFELI 250
Query: 298 DQN-NLSEECGFENDLIDYLST-LKWPEFSANLPAHGNFKINPSF--------------- 340
Q +S+ + N I S +K N + +N F
Sbjct: 251 PQQIQVSQSLDYFNSNISIGSKGVKVVNLEKNYRNINDVDMNEDFIDVLNEFIDKICPYF 310
Query: 341 ---------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP 391
+ + L++S+PG +GS + +G M++R + Q K
Sbjct: 311 DVKEMLDINLRNYQIKGINFMLVSSLPGKFSGSQIHDYGKMRIRKICQVFNPRNIDSKKV 370
Query: 392 LVYQFSSLGSLDEKWMAE-LSSSMSSGFS-----EDKT----PLGIGEPLIVWPTVEDVR 441
L Q +SLG++D ++ E L + F +DK P E +++P+ + ++
Sbjct: 371 LYSQSTSLGTIDRTFVNEFLFCFLPYQFCSEIELKDKVKKNDPEKNDEIRLIFPSKDYIQ 430
Query: 442 C-SLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKA--------SHTGRSRAMPHIKTF-- 489
+L+G + + K K+ FLK + +++ S + +PH KT
Sbjct: 431 NKTLDGAGYSDTLFLTSKRYQKESFLKNIFYQFQCKQMDSLGESQDKQKGIIPHFKTMIV 490
Query: 490 ARYNGQ------------KLAKAAWGALQKNNSQLMIRSYELGVLILP 525
NG+ ++AAWG L K+N+QL I + ELG+LI P
Sbjct: 491 CEQNGEINDDSIIYIGSHNFSEAAWGKLNKDNTQLYISNTELGILIPP 538
>gi|449544019|gb|EMD34993.1| hypothetical protein CERSUDRAFT_54191, partial [Ceriporiopsis
subvermispora B]
Length = 621
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 132/297 (44%), Gaps = 33/297 (11%)
Query: 173 PAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH 232
P + T ++ RD ++ AILS Y ++ W+ P ++V H + G+ E
Sbjct: 176 PTFRLTEILAPRDEVE----CAILSAYCINWPWIYSF--FNRDTPVIMVAH-DQQGSNET 228
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWM 291
+K P NWI P L G H K MLL Y G +R++V TAN I DW + W+
Sbjct: 229 IKEVLP-NWIKTTPFLRNGMGCMHIKFMLLFYKSGRLRVVVTTANFIEHDWRDIENTAWV 287
Query: 292 QDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFS 347
QD P + N + F I L TL N+ H N I K++FS
Sbjct: 288 QDIPKRPTPIPNDPKADDFPAAWIRVLRTL-------NI-QHPNLPIQRLEDLRMKWDFS 339
Query: 348 SAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDE 404
AV+L+ S+ G H G ++ K GH L +++ + KG K+ L Q SS+G+
Sbjct: 340 KVAVKLVPSLAGKHEGWPNVIKTGHTGLMKAVRDMGAQVPKG-KQMVLECQGSSIGTYST 398
Query: 405 KWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI 453
+WM E S ++ ++ L +++P++ VR S+ G G +
Sbjct: 399 QWMNEFHCSARGESAQSWLDVSRARRSKLPWPAVKLIFPSLRTVRESVLGEPGGGTM 455
>gi|118380757|ref|XP_001023542.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89305309|gb|EAS03297.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 584
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 169/403 (41%), Gaps = 75/403 (18%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPA--NWILHK 245
D+ ++ Y + + L+P +L H ++ + + D +++ + + NW L
Sbjct: 166 DVQSIFMTTYGYETELLMP---ILKSNKHFVLANDKPMHDKSIKDVIKENDGFKNWTLIH 222
Query: 246 PPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----- 297
PP +S G H K L+ + +R+++ + NL DW+ S LW QDFPL
Sbjct: 223 PPKDVSSSWGGAFHPKLWLIKFSSFLRVVIGSGNLHVSDWSVWSNCLWYQDFPLNANKKE 282
Query: 298 --DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
Q S + FE D L+ L + + KIN +++S + LI+
Sbjct: 283 KTQQKPSSPKWDFEGDFKITLTELVKKMMPSGINYQDLLKIN---LDDYDYSEVKIILIS 339
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECT-FEKGFKKSP----------LVYQFSSLGSLDE 404
S+ G HT + K+G K+ ++Q T EK P + YQ +SLG++D
Sbjct: 340 SIVGRHT--DIYKYGRGKMYKIIQAFTQNEKNITNQPNNNLTQNQKIITYQCTSLGNIDN 397
Query: 405 KWMAELSSSMSSG-----FSEDKT-----PLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
++ E + ++ +DK P I + +++PT E + E G
Sbjct: 398 TFLNEFYTCATANKPITELKKDKANKKQDPNLIEQKFRLIFPTAEYI---YEDTIYGPEY 454
Query: 454 PSP----QKNVDKD-FLKKYWAKWKA-----SHTGRSRAMPHIKTFARYNG--------- 494
SP QK +K+ F K + ++ + HTG A+PH+KT +
Sbjct: 455 ASPVILNQKYYEKESFPKSIFHQFCSPDNYFYHTG---AIPHLKTMVVTDNDLQIKDDSI 511
Query: 495 -----QKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 532
AAWG +K+ SQ+ + ELG+ I P + C
Sbjct: 512 VYIGSHNFTAAAWGRFEKDYSQIYNSNTELGI-IYPPMEDSAC 553
>gi|324542673|gb|ADY49650.1| Tyrosyl-DNA phosphodiesterase 1, partial [Ascaris suum]
Length = 133
Score = 80.5 bits (197), Expect = 3e-12, Method: Composition-based stats.
Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 63/180 (35%)
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK----------- 496
AG A+P + + +L + KW+ GR+RAMPHIK+++ ++ +
Sbjct: 2 AGGALPYQRNTAARQPYLLERMHKWRCERFGRTRAMPHIKSYSAFSDGRCLPSWLLITSA 61
Query: 497 -LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 555
L+KAAWG LQK SQL IRSYELGVL+ T+ +
Sbjct: 62 NLSKAAWGELQKKESQLAIRSYELGVLL--------------------------TDEDSL 95
Query: 556 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 615
Q +PY++P ++ D PW D YTK D++G WP
Sbjct: 96 QL------------------------LPYDMPLTKFEPGDQPWVCDDTYTKPDIHGATWP 131
>gi|393244923|gb|EJD52434.1| phospholipase D/nuclease [Auricularia delicata TFB-10046 SS5]
Length = 628
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 133/314 (42%), Gaps = 49/314 (15%)
Query: 170 QGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DG 228
Q PA+ + + +D +Q + +LS+Y DI WLL P +P +LV H + DG
Sbjct: 183 QNGPAFRLSQIIGNKDELQ----LVVLSSYSNDIPWLLTMFP--DTVPVILVNHPVTPDG 236
Query: 229 T-LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKS 286
L ++ N++L P + G H K MLL Y G +R+ + TAN I DW +
Sbjct: 237 NDLTYLS----TNFVLVTPSMQQDSGAMHIKLMLLFYKSGRLRVAIPTANFIQYDWRDIE 292
Query: 287 QGLWMQDFPLKDQ----NNLSEECGFENDLIDYLSTLKWPE---------FSANLPAHGN 333
+W+QD P +D L +E F L+D L L F+ L A
Sbjct: 293 NAVWLQDIPKRDAPTPFAKLPKELDFAAQLVDTLRALNVGRAVESQMQNGFAPPLRALDE 352
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEK-GFKKSP 391
++ +++S RL+ S+ G H G + + GH L L++ + G K
Sbjct: 353 LRM------WWDWSKVTARLVPSLKGSHEGWPRVTRVGHTSLLKALRDLGADTPGSCKLL 406
Query: 392 LVYQFSSLGSLDEKWMAELSSSMSSGFSE-----------DKTPLGIGEPL-IVWPTVED 439
L Q SS+G +W + S SE D P P+ I++P++
Sbjct: 407 LECQGSSIGQYTRRWTHQFYRSARGEPSEKFSWIAKQSAFDNLPY---PPIKIIFPSLRT 463
Query: 440 VRCSLEGYAAGNAI 453
V S+ G G +
Sbjct: 464 VEESVLGKPGGGTM 477
>gi|392563164|gb|EIW56343.1| phospholipase D/nuclease [Trametes versicolor FP-101664 SS1]
Length = 641
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 119/440 (27%), Positives = 182/440 (41%), Gaps = 57/440 (12%)
Query: 190 DIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHG-ESDGTLEHMKRNKPANWILH 244
DI AI+S + W+ P PV+A V H + T++ + NWI
Sbjct: 216 DIEFAIVSAFCWSYQWMYQLFSPNTPVIA------VDHDPRGNATIKAIL----PNWIRT 265
Query: 245 KPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ--NN 301
P L FG H K MLL+Y G +R++V TANL+ DW + +W+QD P +
Sbjct: 266 TPFLRNGFGCMHMKFMLLLYRDGRLRVVVSTANLVEYDWRDIENSVWVQDIPKRPSPVTQ 325
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANL--PAHGNFKIN--PSFFKKFNFSSAAVRLIASV 357
++ F + ++ L L N+ H N + ++FS L+ SV
Sbjct: 326 PADTEDFASAMVRVLHALNVAPALINMLRNDHPNLPLQRLEDLRSHWDFSRVKAALVPSV 385
Query: 358 PGYHTG-SSLKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDEKWMAE--LSS 412
G H G + GH +L L E T K K+ L Q SS+G+ W+ E LS+
Sbjct: 386 AGKHEGWPKVILTGHTRLMKALLDMEATVPKD-KELALECQGSSIGNYSSMWVNEFFLSA 444
Query: 413 SMSSGFSEDKTP----LGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD-KDFL 465
S S +TP + P I++PT + VR S+ G + G + +K + +F
Sbjct: 445 RGESTQSWLETPKTRRAKVPYPAVKILFPTAQYVRESVLGESGGGTMFCRRKQWEGANFP 504
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWGALQKNNSQLMIRSYELGV-LIL 524
++ + + + + R R + H K +K G L + RS E+
Sbjct: 505 RQLFHQ---TRSKRGRVLMHSKMILGTFKEKT-----GTLDGHQRASATRSSEVDTDEDA 556
Query: 525 PSAKRHGCGFSCTSNIVPSE----IKSGSTETSQIQKTKL-VTLTWHGSSDAGASSEVVY 579
SAK G + + N PS SG + I +L V + H + E V
Sbjct: 557 GSAKLAGWVYVGSHNFTPSAWGTLSGSGFNPSLNINNYELGVVIPLH-------TQEEVD 609
Query: 580 LPVPYELPPQRY-SSEDVPW 598
+E PPQ+Y S D PW
Sbjct: 610 KVACWERPPQKYVSGRDEPW 629
>gi|328769090|gb|EGF79135.1| hypothetical protein BATDEDRAFT_90149 [Batrachochytrium
dendrobatidis JAM81]
Length = 554
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 132/300 (44%), Gaps = 42/300 (14%)
Query: 194 AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFG 253
A LS++ +D DWL P KI +++ + W+ P + +G
Sbjct: 117 ACLSSFSIDDDWLCDVFPSTIKICLARPKPKMVPESVDKLPVTNNILWVF--PKMSAGYG 174
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD----QNNLSEECGFE 309
H K LL YP+ +R+++ +ANL+ DW ++ QDFP+ + Q+ SE
Sbjct: 175 AMHIKFQLLWYPKFLRVVITSANLMPHDWQELENVVFYQDFPILNSRVRQSQHSETASSS 234
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL--K 367
+ ++ TL S N+P + +K +FS A L+ S+PG H +S+ +
Sbjct: 235 TN--EFSKTLYNLLVSMNIPQSVIASV-----QKHDFSKALGMLVVSLPGKHDATSMETR 287
Query: 368 KWGHMKLRTVLQECT--FEKGFKKSPLVYQFSSLGSLDEKWMAELSS------------S 413
++G M L T Q + F +++ + Q +S+GS W+ + S S
Sbjct: 288 QFGSMGLCTASQVISRQFRFDLEQAIVCMQTASMGSTHPAWLRYMLSAFRGQDVIPETPS 347
Query: 414 MSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW 472
++S F++ + + EP+ I++P+ V S G G I F K+W+ +
Sbjct: 348 LASFFTQSMSSI---EPITILFPSRRTVETSRNGIPGGGTI---------FFSSKFWSTF 395
>gi|406860446|gb|EKD13504.1| tyrosyl-DNA phosphodiesterase domain protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 669
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 157/369 (42%), Gaps = 47/369 (12%)
Query: 169 VQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G+P + + I +V+Q D+ +A+LS + ++ +W+ K+ + V+ ++D
Sbjct: 198 VNGMPRHGDD--IKIEEVLQKNDLELAVLSAFQIEPEWVESKLNQRTKV--IWVLQAKTD 253
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK-- 285
+++ PAN+ P + + HSK LL +P +R++V +ANL DW
Sbjct: 254 AERQNISSKAPANYRFCFPNMEGNINCMHSKLQLLAHPTHLRVVVPSANLTSYDWGETGI 313
Query: 286 -SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
++ D P + F N+L+ ++ + + +A + + F
Sbjct: 314 MENICFLIDLPRLPPGEKTVVTNFANELVYFVEQMGLDQKTA------------TSLQNF 361
Query: 345 NFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
+FS +A + + S+ G H+GS+ K+ G+ L T +++ + + + +S+GSL+
Sbjct: 362 DFSRTAHLAFVHSIGGSHSGSTWKRTGYCGLGTAIKKLGMATEVDLN-IEFLSASIGSLN 420
Query: 404 EKWMA--ELSSSMSSGFSE-----DKTPLGIGEPL--------------IVWPTVEDVRC 442
+ +M L++ G +E +K G I +PT E V
Sbjct: 421 DSFMECLYLAAQGDDGATEYRWRTEKPTKSKGRSAAEHKLLGNVNSNCRIYFPTKETVEA 480
Query: 443 SLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK-LAKA 500
S G G I K D D F +K K+ G M + FAR QK K
Sbjct: 481 SRGGVTGGGTICLQSKWFDSDTFPRKLMRDCKSVRKG--ILMHNKMIFARARDQKQYPKI 538
Query: 501 AWGALQKNN 509
AW + +N
Sbjct: 539 AWAYVGSHN 547
>gi|260788030|ref|XP_002589054.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
gi|229274227|gb|EEN45065.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
Length = 130
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 13/90 (14%)
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR------------YNGQ 495
AG ++P K +L K+ +W +S GR+RA PHIKT+ R
Sbjct: 8 AGGSLPYSINTARKQPYLNKFLHQWSSSARGRTRASPHIKTYTRTSPDCSRLAWFLVTSA 67
Query: 496 KLAKAAWGALQKNNSQLMIRSYELGVLILP 525
L+KAAWGAL+KN +QLMIRSYE+GVL LP
Sbjct: 68 NLSKAAWGALEKNGAQLMIRSYEIGVLFLP 97
>gi|270017231|gb|EFA13677.1| hypothetical protein TcasGA2_TC001393 [Tribolium castaneum]
Length = 416
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 133/302 (44%), Gaps = 35/302 (11%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPV--LAKIPHVLVIHGESDGTLEHMKRNKPANWILHKP 246
G++ ++ N+M+DI WL+ L K P ++ E E +++ P N H
Sbjct: 120 GELKCSLQINFMIDIMWLMERYRERNLGKKPLTILYGDEFPKMKEFIEKFLP-NVSHHYV 178
Query: 247 PLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNN 301
+ FG HHSK + Y +R+++ TANL + DWN+ +QGLW+ P
Sbjct: 179 KMKDPFGCHHSKIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEK 238
Query: 302 LSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
E GF++ L++YL NLP K + K+ +FS+ V L+ SVPG
Sbjct: 239 SGESPTGFKSSLLNYLK-------HYNLPV---LKPWIDYVKRADFSAVRVFLVTSVPGK 288
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELS 411
H + H + + C+ K P ++ Q SS+GS+ + L
Sbjct: 289 HYPGTQGSHVHHVGDLLSRHCSLPA--KTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLR 346
Query: 412 SSMSSGFSEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLK 466
S++ S K + I++P+V++V G +G +P S Q N + +L+
Sbjct: 347 STLLRSLSGHKQTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQ 406
Query: 467 KY 468
Y
Sbjct: 407 SY 408
>gi|169620876|ref|XP_001803849.1| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
gi|160704126|gb|EAT79090.2| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
Length = 384
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 139/338 (41%), Gaps = 72/338 (21%)
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKKSP---L 392
+ ++FSS I SVP + K +G + L +L KK+ +
Sbjct: 58 LRDYDFSSIKAAFIGSVPSRQKPIATKPAQQTSFGWLGLEEILSNVPITANAKKASAPHI 117
Query: 393 VYQFSSLGSLDEK--WMAELSSSM---SSGFSEDKTPLGIGEPL---------------- 431
V Q SS+ +L W+ + S + ++G E+ +P
Sbjct: 118 VMQVSSIATLGAAPTWLNKFQSVLCRSAAGQLEEAPAASSSKPPKLFSKGGMSSAKQDKP 177
Query: 432 ------IVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKAS------ 475
I++PT ++VR SL+GY +G++I S Q+ ++L + WKA+
Sbjct: 178 LSPKFNIIFPTSDEVRTSLDGYDSGSSIHMKLLSIQQQKQLEYLHPLFCHWKATPDSNSK 237
Query: 476 -HTGRSRAMPHIKTFARYNGQK-------------LAKAAWGALQKNNSQLMIRSYELGV 521
R A PHIKT+ RY+ +K L+K AWG + + I+S+E GV
Sbjct: 238 GQAMRGPAAPHIKTYIRYSDEKHKTIDWAMVTSANLSKQAWGDVVNKKDETWIQSWEAGV 297
Query: 522 LILPSAKRHGCGFSCTSNIVPSEIKS--GSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 579
++ P S + +VP K G+ + S K G+ + A V+
Sbjct: 298 VVWPEL----FAESKEAIMVPVFGKDMPGTEDVSSQDVNK-------GADEGQAGKTVIG 346
Query: 580 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 617
+PY+LP Y++++ PW + D G+ WP +
Sbjct: 347 FRMPYDLPLTPYTAKEKPWCAQMPSAEPDWMGRAWPGY 384
>gi|451845752|gb|EMD59064.1| hypothetical protein COCSADRAFT_41609 [Cochliobolus sativus ND90Pr]
Length = 568
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 93/422 (22%), Positives = 172/422 (40%), Gaps = 67/422 (15%)
Query: 175 WANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEH 232
+ T +I +V++ D + A++S++M D +WL PV K + +++ + +
Sbjct: 148 YPRTDDTTIDEVLEADTVRTAVISSFMWDSEWLFKKLDPV--KTKQLWIMNAKGKDIQQR 205
Query: 233 MKRNKPA----NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
++ A N +H PP+ + HSK MLL P+ +RI++ TAN+I DW +
Sbjct: 206 WQKEMEAMGVPNLKIHFPPMDGMIQSMHSKLMLLFGPKKLRIVIPTANMIQTDWGEVAND 265
Query: 287 -------QGLWMQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
+++ D P + S + F +L+ +L K PE
Sbjct: 266 WQPGVMENSIFLIDLPRRGNETTSTKENMTRFGQELMYFLEMQKVPEMVLQ--------- 316
Query: 337 NPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQ 395
F+FS + + + S+ G H S G + L +Q+ + ++ L Y
Sbjct: 317 ---GILNFDFSQTSHLAFVHSIGGSHKTESEHPTGLLGLTRAIQDLHLDN-VEQMELDYA 372
Query: 396 FSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
SSLG++++ +++ L ++ F+ D P I +PT E V+ S+ G G
Sbjct: 373 ASSLGAINDSFLSRLYLAACGRCFAADTAMVPDVRNHIRIYFPTNETVKKSIGGPDCGGI 432
Query: 453 IPSPQKNVD-----KDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK----------- 496
I Q+ + ++ L+ Y + R+ + H K +K
Sbjct: 433 ISLSQQRYNAATFPRECLRDY-------ESLRAGMLSHNKLLLARGRKKDGRPVGWVYVG 485
Query: 497 ---LAKAAWGALQ----KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 549
L+++AWG + L IR++E GV++ R VP + G+
Sbjct: 486 SANLSESAWGGQKVIKDGKMGSLNIRNWECGVVMTVPDDRLAGLDKDKDKTVPMSVFEGT 545
Query: 550 TE 551
E
Sbjct: 546 VE 547
>gi|378727943|gb|EHY54402.1| tyrosyl-DNA phosphodiesterase 1 [Exophiala dermatitidis NIH/UT8656]
Length = 793
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 114/248 (45%), Gaps = 49/248 (19%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDI 203
+S+ A N H +R + S FRL ++ LP+ N +S+ D++ +I A + NY D+
Sbjct: 100 SSKGAPPNGHAAR-LIASPFRLTSIRDLPSSQNIDTISLHDILGIPLIKEAWIFNYCFDV 158
Query: 204 DWLLPACP--VLAKIPHVLVIHGE---SDGT---LEHMKRNKPANWILHKPPLPISFGTH 255
DWL+ + +++ V V+HG DG +E R P N +P +FGTH
Sbjct: 159 DWLMSYFDEDIRSQV-KVKVVHGSWRAEDGNRLGIEDACRRWP-NVESVTAYMPDAFGTH 216
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEECG--- 307
HSK +L + ++++HTAN++H DW N +Q +W P NN + G
Sbjct: 217 HSKMFILFTHDDLAQVVIHTANMLHRDWTNMTQAVWQSPMLPVLPPTTNNNSTGAKGNQP 276
Query: 308 ----------------FENDLIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAA 350
F++D++ YLS A+G K +F+FSS
Sbjct: 277 KSTSTSPIGSIGTGSRFKHDMMAYLS------------AYGTKTKSLREQLVRFDFSSVR 324
Query: 351 VRLIASVP 358
L+ASVP
Sbjct: 325 GALVASVP 332
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/278 (21%), Positives = 102/278 (36%), Gaps = 91/278 (32%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK----DFLKKYWAKWKASHT---------G 478
I++PT ++V SL+GYA+G +I + L+ +W S T G
Sbjct: 515 IIFPTPQNVASSLDGYASGGSIHMKAQAASHLNQISLLRPSLCQWTRSQTGASSSSSLSG 574
Query: 479 RSRAMPHIKTFARYNGQ------------------KLAKAAWGAL-----QKNNSQLMIR 515
R A PH+KT+ R+ + L+ AWG + ++ +++++
Sbjct: 575 RHLAAPHVKTYIRFKSKPTTQHPTPDIDWALLTSANLSTQAWGVVREPKDKRKEKEVVVQ 634
Query: 516 SYELGVLILP-----------SAKRHGCG-------------FSCTSN------------ 539
S+E+GVL+ P + K+ G G T+N
Sbjct: 635 SFEIGVLVWPGLFGPEFEDEGTIKQDGAGSGRDARMGTGDYDIKNTTNPSKEDQSQNLNS 694
Query: 540 -------------------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 580
+ P+ I +G E + + ++ +V +
Sbjct: 695 VHSVRMAPVFGTDMPSQLQLQPANIGTGIVEDGTASGNGNENGNVNEKDVSSTTTTLVGI 754
Query: 581 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHF 618
+PY+LP Y D+PWS Y D +G+ WPR F
Sbjct: 755 RLPYDLPLTPYVETDMPWSPQGVYEVPDRHGRRWPRDF 792
>gi|440632301|gb|ELR02220.1| hypothetical protein GMDG_01013 [Geomyces destructans 20631-21]
Length = 529
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 172/392 (43%), Gaps = 63/392 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A+LS++ D +W+L +A+ +L+ E ++++ P+
Sbjct: 93 IKIEEVLQKNDLDLAVLSSFQWDQEWILSKLD-MARTKLILIAQAVPRDDQEEVRKSAPS 151
Query: 240 NWILHKPP-LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP 295
N P + T HSK LL +P +R++V +ANL+ DW +++ D P
Sbjct: 152 NVRFCFPSNKDETVSTMHSKLQLLAHPSHLRVVVPSANLVPYDWGETGVMENTVFLIDLP 211
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFS-SAAVRL 353
N + EN L + L+ F L A G + KI S KF+FS +A +
Sbjct: 212 RLAANKV---VSIEN-LTPFCRELR--RF---LKAQGLDSKITDSLL-KFDFSQTAGLAF 261
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKW----- 406
+ S+ G HT + K G+ L + +QE PL F +S+G+L + +
Sbjct: 262 VHSIGGNHTENDWKTIGYPGLGSAIQELGLA---NTGPLNVTFVSASIGALTDDFVLAIL 318
Query: 407 --------MAELS--SSMSSGFSEDKTPLGIGEPL------IVWPTVEDVRCSLEGYAAG 450
+ EL+ +S S + + T I++P+ E VR S G +G
Sbjct: 319 LACKGDDGLTELTWRTSTSPAYRKRTTKEETLLMEMEEGFRIMFPSHETVRTSKNGTNSG 378
Query: 451 NAIP-SPQKNVDKDFLKKYWAKWKASHTG---RSRAMPHIKTFARYNGQK---------- 496
I P+ + F K+ + K+ G S+ + T +G +
Sbjct: 379 GTICLDPKYYQREQFPKELFRDCKSKRAGLLLHSKLLFTAPTHMNADGDRGKAWAYVGSA 438
Query: 497 -LAKAAWGALQKNNS----QLMIRSYELGVLI 523
L+++AWG L KN S +L R++E GV+I
Sbjct: 439 NLSESAWGRLTKNKSTKQVKLYCRNWECGVVI 470
>gi|238496339|ref|XP_002379405.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694285|gb|EED50629.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 491
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 62/259 (23%), Positives = 113/259 (43%), Gaps = 51/259 (19%)
Query: 387 FKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLE 445
FK+ L Y +KW+ + + +S+S + + P + I++PT +++R SL
Sbjct: 250 FKRDLLAYLTEYGPKKTDKWLKDVMFASLSPASTSTRQP----KYSIIFPTADEIRRSLN 305
Query: 446 GYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------------TGRSRAMPHIKTF 489
GY +G +I S + +++ Y W H GR RA PHIKT+
Sbjct: 306 GYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSHTSKQDAGRRRAAPHIKTY 365
Query: 490 ARYNGQK--------------LAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 535
R++ + L+ AWGA + ++ I S+E+G+++ P
Sbjct: 366 IRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSWEIGIVVWPQLYVQDTE-- 423
Query: 536 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 595
++ +VP+ K + E + + ++ T V+ L +PY+LP Y++ D
Sbjct: 424 -SATMVPT-FKRDTPEPLENKDSETTPDT------------VIGLRMPYDLPLTPYAAHD 469
Query: 596 VPWSWDKRYTKKDVYGQVW 614
PW ++ + D GQ W
Sbjct: 470 TPWCATAQHLEPDWLGQTW 488
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 122/254 (48%), Gaps = 48/254 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS F+L ++ L A + N V +R+++ +I NY+ D+D+++ + +
Sbjct: 85 IPSPFQLTHIRDLAASSDNNVDTVRLREILGDPMIRECWQFNYLHDVDFIMGQFDEDVRR 144
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPP------------LPISFGTHHSKAMLLI 263
+ V ++HG KR+ P + + +P +FGTHHSK M+L+
Sbjct: 145 LVKVKIVHGS-------WKRDAPNRVRIDEACSRYPNVEAVVAYMPEAFGTHHSKMMVLL 197
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE------CGFENDLIDY 315
+ V++++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ Y
Sbjct: 198 RHDDLVQVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVEDLILGSGARFKRDLLAY 257
Query: 316 LS------TLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 366
L+ T KW + F++ PA + + P + F + R S+ GY +G S+
Sbjct: 258 LTEYGPKKTDKWLKDVMFASLSPASTSTR-QPKYSIIFPTADEIRR---SLNGYGSGGSI 313
Query: 367 KKWGHMKLRTVLQE 380
HMKL++ Q+
Sbjct: 314 ----HMKLQSAAQQ 323
>gi|315045107|ref|XP_003171929.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
gi|311344272|gb|EFR03475.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
Length = 678
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 112/239 (46%), Gaps = 23/239 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPA-CPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ + +V+Q D+ +A+LS+++ D+DWLL + ++ GE + + M+
Sbjct: 210 IKLEEVLQQADLELAVLSSFLWDMDWLLAKFTNPKTRFLFIMGAKGE-ERQAQLMRETAS 268
Query: 239 ANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
WI L PP+ HSK MLL +P +RI++ +ANL DW K L++ D
Sbjct: 269 MPWIRLCFPPMDGEVHCMHSKLMLLFHPNHMRIVIPSANLDPYDWGEKGGVMENMLFLID 328
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
P K + ++ F ++L+ +L K N KI +F+FS +
Sbjct: 329 LPRKAREADEDKTPFRDELVYFLRASKL-----------NEKIIDKML-QFDFSNTTKYA 376
Query: 353 LIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
+ S+ G H GS S ++ GH L T ++ E + L Y SS+GSL ++ L
Sbjct: 377 FVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLETS-RPLTLDYITSSVGSLTATFLQNL 434
>gi|146413473|ref|XP_001482707.1| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 164/425 (38%), Gaps = 110/425 (25%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 306 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 364 SSL----KKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGS--LDEKWMAELSSSMS 415
+ + +G KL VL+ + K ++ Q SS+ + EK+ S
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHT------S 324
Query: 416 SGFSEDKTPLGIGEP-----------------------LIVWPTVEDVRCSLEGYAAGNA 452
S F+ PL +P IV+PT ++V + G+ AG +
Sbjct: 325 SIFTHILCPLIFDDPQFSMLSPGRETTRNHQKLYNYTPTIVYPTAQEVSQANVGFGAGAS 384
Query: 453 I------PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK-------- 496
I +N K + Y KW KA GR+ PH+K + NG +
Sbjct: 385 IHFNYTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWAL 444
Query: 497 -----LAKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+K AWGA + KN + + SYELGVL+ G + T +K+
Sbjct: 445 LCSHNLSKQAWGAPKSKNGRKYHVASYELGVLVP------GTPHTLTPTYPHDHLKNC-- 496
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDV 609
+ L +P+++PP+ Y D PWS + + KD
Sbjct: 497 --------------------------LAPLRLPFKVPPEPYGDSDQPWSPHMNFGELKDR 530
Query: 610 YGQVW 614
+G +
Sbjct: 531 FGNTY 535
>gi|190348157|gb|EDK40564.2| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 109/426 (25%), Positives = 166/426 (38%), Gaps = 112/426 (26%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 306 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 364 SSLKK----WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGS--LDEKWMAELSSSMS 415
+ +G KL VL+ + K ++ Q SS+ + EK+ S
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHT------S 324
Query: 416 SGFSEDKTPLGIGEP-----------------------LIVWPTVEDVRCSLEGYAAGNA 452
S F+ PL +P IV+PT ++V + G+ AG +
Sbjct: 325 SIFTHILCPLIFDDPQFSMLSPGRETTRNHQKLYNYTPTIVYPTAQEVSQANVGFGAGAS 384
Query: 453 I------PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK-------- 496
I +N K + Y KW KA GR+ PH+K + NG +
Sbjct: 385 IHFNYTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWAL 444
Query: 497 -----LAKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
L+K AWGA + KN + + SYELGVL+ G+
Sbjct: 445 LCSHNLSKQAWGAPKSKNGRKYHVASYELGVLV-----------------------PGTP 481
Query: 551 ETSQIQKTKLVTLTW-HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KD 608
T +T T+ H S + L +P+++PP+ Y D PWS + + KD
Sbjct: 482 HT--------LTPTYPHDHSKNCLAP----LRLPFKVPPEPYGDSDQPWSPHMNFGELKD 529
Query: 609 VYGQVW 614
+G +
Sbjct: 530 RFGNTY 535
>gi|320165097|gb|EFW41996.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 545
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 141/319 (44%), Gaps = 42/319 (13%)
Query: 165 RLLRVQGLPAWAN-TSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHV--- 219
RL Q + + N +S ++ +D+I+ ++ A+ S+Y D DW + + +
Sbjct: 100 RLAEKQAMTSITNDSSSITFQDLIKPRELRRALFSSYEADTDWFVQQLAPMVRSRGASVQ 159
Query: 220 LVIHGESDGTLEHMKRNKPANWILHKPPLPI--SFGTHHSKAMLLIY-PRGVRIIVHTAN 276
L + G + N + ++ PL I + G H + MLL + +R+ V +A+
Sbjct: 160 LFVSSSPTG-----RGNTALSPNINMTPLTIGKTSGRLHGRLMLLFHGSDTLRVAVTSAS 214
Query: 277 LIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTL-----KWPEFSANLP 329
L+ DW + QDFP++ + E G F++ L++Y++ L K + P
Sbjct: 215 LVPSDWGVLENVTYYQDFPIEAKRPTVTERGLAFQSTLMNYVTQLVAHQPKDDDVDDRHP 274
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK----KWGHMKLRTVLQE--CTF 383
A + K NF + RLI+S P + S+L+ + G M L LQ T
Sbjct: 275 ARAARILKE--LKTVNFDTVEARLISSYPEH---SNLETNGCRQGLMALEQALQAEYSTL 329
Query: 384 EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-----------I 432
SP++YQ SS+G + + W+ + +++ ++G + G P
Sbjct: 330 PAQVLNSPIIYQSSSIGQVSDPWVTQFATACNAGAPARISGESRGSPFAIDPADALKLQF 389
Query: 433 VWPTVEDVRCSLEGYAAGN 451
++PT V +L+G+ G+
Sbjct: 390 IFPTTATVSQALQGFPEGH 408
>gi|402219032|gb|EJT99107.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 680
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 115/491 (23%), Positives = 201/491 (40%), Gaps = 106/491 (21%)
Query: 177 NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH-- 232
N I D++ D+ +LS+Y D WL P +IP +LV+ + D + H
Sbjct: 207 NRPRFKITDIVSPASDLEFVLLSSYCTDTPWLTTFLP--REIPVLLVV--DPDPSQRHDA 262
Query: 233 -MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLW 290
+K +W+ P + S G H K +LL Y G +R+ + TANL+ DW + ++
Sbjct: 263 SLKNLGIGDWLRVTPRIWQSRGVMHIKVLLLFYKSGRLRVAIPTANLVDYDWRDIENTVF 322
Query: 291 MQDF-PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG----NFKINPSFFKKFN 345
+QD P+ D + + F L L +L P NL G + + K++
Sbjct: 323 VQDLPPITDSSADPQSHDFPTYLWGVLKSLNVPAGLLNLVNSGYPSLPLQSLQNLQDKWD 382
Query: 346 FSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFE-KGFKKSPLVYQFSSLGSLD 403
+ RL+ASV G + G +++ +GH +L ++++ + K K + Q SS+G+
Sbjct: 383 WCKMRARLVASVAGNYEGWYNVRMYGHPRLSAIIRDSRAQPKKGKVLNIECQGSSVGNCT 442
Query: 404 EKWMAELSSS-------------MSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAG 450
+++ E+ S MS + P+ I++PT++ V S+ G G
Sbjct: 443 TQYLNEVYKSCCGIDPISWIDIPMSRQVRQPWPPVK-----ILFPTLKTVDDSVFGRNGG 497
Query: 451 NAIPSPQKNVDKDFLKK-YWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWGALQKNN 509
+ F KK YW+K G + + + +GQ L +
Sbjct: 498 GSF----------FCKKPYWSK-----LGSPKEL--FYSVKAKDGQVLM----------H 530
Query: 510 SQLMIRSYELGV--------LILPSAKRHGCGFSCTSNIV---PSEIKSGSTETSQIQKT 558
+++++ +Y+ G L LP++ + G IV SE ++ +TE +
Sbjct: 531 TKMIVGTYKTGSLPVLRPAPLALPASGK-GKAKEKPEVIVLSSDSETEASTTEDEEDAGE 589
Query: 559 KLVTLTW-----HGSSDA--GASSEVVYLP------------------------VPYELP 587
+ W H + A G S + +P VP+E P
Sbjct: 590 QKTPEAWVYMGSHNFTMAAWGTVSGSILVPKLNISNFEMGIVLPIEDQKELERIVPWERP 649
Query: 588 PQRYSSEDVPW 598
P+RY +DVPW
Sbjct: 650 PRRYGPKDVPW 660
>gi|395329020|gb|EJF61409.1| phospholipase D/nuclease [Dichomitus squalens LYAD-421 SS1]
Length = 656
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 108/419 (25%), Positives = 164/419 (39%), Gaps = 80/419 (19%)
Query: 172 LPAWANTSCVSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+PA N + +++ + DI AI+S Y D ++ + P + V H T
Sbjct: 210 IPAQDNRPLFRLSEILTLKEDIEFAIISAYCWDYKFVYQLMD--RRTPVIAVDHSP---T 264
Query: 230 LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQG 288
E + NWI P L FG H K MLL + G +RI+V TANL+ DW +
Sbjct: 265 GEASIKAILPNWIRTTPFLRGGFGCMHMKFMLLFFRTGRLRIVVSTANLVEYDWRDIENT 324
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL-PAHGNFKIN---------- 337
+W+QD P + ++ + D+ S L N+ PA N N
Sbjct: 325 VWVQDVPKRPSPEPADP-----KVEDFASALVRMLHGVNVAPALVNHLKNEYPNLPLQRL 379
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQ 395
++FS RLI S+ G H G + GH L L++ E K L Q
Sbjct: 380 EELRTHWDFSRVKARLIPSIAGKHEGWPKVILTGHTCLMKSLKDIGAETPKDKDLVLECQ 439
Query: 396 FSSLGSLDEKWMAELSSSMS--------SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
SS+G+ W+ E S G + L + I++PT + VR S+ G
Sbjct: 440 GSSVGAYTTAWLNEFYCSARGESAQTWLDGPKSRRAKLPLPPIKILFPTAQYVRDSVLGE 499
Query: 448 AAGNAIPSPQKNVD-KDFLKKYWAKWKASHTGRSRAMPHIK----TF------------- 489
G + +K + K+F ++ + + + + R R + H K TF
Sbjct: 500 VGGGTMFCRRKQWEGKNFPRELFHQ---TRSKRGRVLMHSKMVLGTFRDKRRKQQTLTDS 556
Query: 490 ------------ARYNGQKLA-----------KAAWGALQKN--NSQLMIRSYELGVLI 523
+ Q+LA +AWG L + N L I +YELGVLI
Sbjct: 557 EDEAEDGRNADSGSRDRQQLAGWVYVGSHNFTPSAWGTLTGSAFNPTLNITNYELGVLI 615
>gi|302695723|ref|XP_003037540.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
gi|300111237|gb|EFJ02638.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
Length = 646
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 124/287 (43%), Gaps = 28/287 (9%)
Query: 188 QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPP 247
+ +I AILS+Y +D +W + V+++ DG + +N NWI P
Sbjct: 212 KSEIEFAILSSYALDAEWTYS---FFERDTPVIIVQQTKDG--DASIKNWLPNWIRASPF 266
Query: 248 LPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN------ 300
L +G H K MLL Y G +R+ + TANL+ D+ + W+QD P + +
Sbjct: 267 LRNGYGCMHMKFMLLFYKTGRLRVYIPTANLVQYDYRDIENFAWLQDIPRRPAHKPEPKP 326
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVP 358
N + +++ L+ + +P H N + + +++S V L+AS+
Sbjct: 327 NPEDFPSIMQRVLEALNIRPAQLETNTIPQHPNLPLQSISDLRRLWDWSLVKVHLVASLH 386
Query: 359 GYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSLDEKWMAELSSSM-- 414
G + G S+ + GH +L ++ ++ V Q SS+G W+ E+ SM
Sbjct: 387 GKYEGWPSVLQVGHPRLMKAVRNMGLAVDKEREVEVECQGSSIGRCTSVWINEMYGSMRG 446
Query: 415 --------SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI 453
++ + TPL + + IV+PT V + G G I
Sbjct: 447 QSAREWLDATKKRREATPLPLVK--IVYPTKATVHATAWGVNGGGTI 491
>gi|403173802|ref|XP_003332829.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170701|gb|EFP88410.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 583
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/290 (25%), Positives = 123/290 (42%), Gaps = 30/290 (10%)
Query: 181 VSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ I D+I + I +A++S+Y++++ W+ + ++VI +D K N+
Sbjct: 156 LRIEDIIGPKDRIKMALVSSYVLELPWIHK---LFNPRTRIMVIRHHTD--CGSFKVNER 210
Query: 239 ANWILHKPPL------PISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
AN L PP+ G H K ++ Y R+ + TAN + D+ +W+Q
Sbjct: 211 ANMFLCHPPMLKTANGNAKAGCMHIKFFIIFYDNFCRVAIPTANAVSFDYEFVENAIWIQ 270
Query: 293 DFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
DF N + +D+ + TL LP K +F SAA
Sbjct: 271 DFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLPFRKP-------LKDHDFGSAA 323
Query: 351 VRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
L+ S+ G H +S H+ +L+T+ + G + + L Q SS+GS D KW+
Sbjct: 324 ANLVVSIQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-RTATLECQGSSIGSYDLKWLN 382
Query: 409 EL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
S S + +ED PL +++PT+ VR S G A +
Sbjct: 383 NFYRCASGSPPTASTEDPDLQTKTPPLTVLYPTLHTVRNSHSGKAGAGTL 432
>gi|212546295|ref|XP_002153301.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064821|gb|EEA18916.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 596
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 129/292 (44%), Gaps = 44/292 (15%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + I +V+Q D+ +A+LS + D+ W+ K ++V+ + + T L++ +
Sbjct: 232 NGDDIKIEEVLQTADLELALLSAFQWDMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQE 291
Query: 235 R--NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
N P N L PP+ HSK MLL +P +RI+V +AN++ DW +
Sbjct: 292 ETANMP-NIRLCFPPMDGQVNCMHSKLMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENT 350
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA---HGNF--KINPSFFKK 343
+++ D P K ND D T + E S L A H N K++ FK+
Sbjct: 351 VFLIDLPKKST----------NDAADSPKTAFYEELSYFLKASTLHENIIAKLSAFDFKE 400
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLG 400
N + + ++ G H G SL + GH L + G K + P+ F SS+G
Sbjct: 401 TNRYA----FVHTIGGSHFGESLTRTGHCGLGKAVTSL----GLKTREPINIDFVTSSIG 452
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
SL +++M + S +T I +I+ +V C L G + NA
Sbjct: 453 SLTDEFMRSIYLSAQG----KQTLYSIIRTIIL-----NVSCRLGGDGSTNA 495
>gi|212546293|ref|XP_002153300.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064820|gb|EEA18915.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 684
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 129/292 (44%), Gaps = 44/292 (15%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + I +V+Q D+ +A+LS + D+ W+ K ++V+ + + T L++ +
Sbjct: 232 NGDDIKIEEVLQTADLELALLSAFQWDMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQE 291
Query: 235 R--NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
N P N L PP+ HSK MLL +P +RI+V +AN++ DW +
Sbjct: 292 ETANMP-NIRLCFPPMDGQVNCMHSKLMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENT 350
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA---HGNF--KINPSFFKK 343
+++ D P K ND D T + E S L A H N K++ FK+
Sbjct: 351 VFLIDLPKKST----------NDAADSPKTAFYEELSYFLKASTLHENIIAKLSAFDFKE 400
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLG 400
N + + ++ G H G SL + GH L + G K + P+ F SS+G
Sbjct: 401 TNRYA----FVHTIGGSHFGESLTRTGHCGLGKAVTSL----GLKTREPINIDFVTSSIG 452
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
SL +++M + S +T I +I+ +V C L G + NA
Sbjct: 453 SLTDEFMRSIYLSAQG----KQTLYSIIRTIIL-----NVSCRLGGDGSTNA 495
>gi|16768278|gb|AAL28358.1| GH27933p [Drosophila melanogaster]
Length = 161
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 60/110 (54%), Gaps = 16/110 (14%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFA 490
+++P+ +V S +G G +P + DK +LK Y +WK+S RSRAMPHIK++
Sbjct: 6 MIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAMPHIKSYT 65
Query: 491 RYN------------GQKLAKAAWGALQKNNS---QLMIRSYELGVLILP 525
R+N L+KAAWG KN++ L I +YE GVL LP
Sbjct: 66 RFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 115
>gi|297806769|ref|XP_002871268.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297317105|gb|EFH47527.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1083
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 87/199 (43%), Gaps = 35/199 (17%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH-------GESDGTLEHMKRNKPANWIL 243
I +A L++ DI W L C + + +P + H D N P N +
Sbjct: 403 IFIATLTS---DILWFLTCCEIPSHLPVTIACHHAERCWSSSPDARSTAPLPNYP-NVTM 458
Query: 244 HKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
PP P I+FG HH K +L +R+I+ +ANL+ WN+ + +W Q
Sbjct: 459 VFPPFPEEIAFGKDRKNRGIACHHPKLFILQREVSIRVIITSANLVARQWNDVTNTVWWQ 518
Query: 293 DFPLK---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
DFP + D +L C G + D L+ ++P+ ++ I F K
Sbjct: 519 DFPRRADPDVLSLFGHCRRETNHGLKTDFCAQLAGFA-ASLLTDVPSQAHWIIE---FTK 574
Query: 344 FNFSSAAVRLIASVPGYHT 362
+NF +A L+ASVPG H+
Sbjct: 575 YNFEHSACHLVASVPGIHS 593
>gi|367027210|ref|XP_003662889.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
gi|347010158|gb|AEO57644.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
Length = 646
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 100/448 (22%), Positives = 161/448 (35%), Gaps = 86/448 (19%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q + +A+LS+Y D +W+L + A+ +LV + E M+ N P
Sbjct: 215 IKIEEVLQKQHLHLAVLSSYQWDEEWMLSKIDI-ARTKLILVAFAADEAQKEEMRSNVPR 273
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
+ I P G+ HSK MLL Y +RI+V T NL+ DW +++ D P
Sbjct: 274 DRIRFCFPPMHGIGSMHSKLMLLKYENYLRIVVPTGNLMSFDWGETGTMENMVFILDLP- 332
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIA 355
K + E N D L L A G + + ++F+ A +
Sbjct: 333 KFETAEGREAQKLNRFADQLFYF--------LRAQGLDEKLVDSLRNYDFTEAGRYEFVH 384
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL--- 410
++PG HTG + G+ L Q G + P+ +SLG+++ + L
Sbjct: 385 TIPGSHTGDDALRTGYCGLG---QSVNALVGTRSEPVELDLVCASLGAVNYGLLTSLYYA 441
Query: 411 ---------------SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPS 455
S F+ L I +P+ E V S G I
Sbjct: 442 CLGDPLREYEERASGSQRNRDAFTSRAISLVKEHMRIFFPSRETVLRSKGGKDGAGTIC- 500
Query: 456 PQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKT----------------FARY 492
L K+W + + R + H K FA
Sbjct: 501 --------LLSKWWQAPTFPRELVRDCKSVRQGVLMHTKALYVRPCSPTSQQSGRCFAYV 552
Query: 493 NGQKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 548
L+++AWG L ++ + +L R++E GVL+ CT V +G
Sbjct: 553 GSANLSESAWGRLSRDRASGKPKLTCRNWECGVLL------------CTDRTVEGSSGAG 600
Query: 549 STETSQIQKTKLVTLTWHGSSDAGASSE 576
S V + W G + +G E
Sbjct: 601 SDNLGVFDGCVPVPMEWPGRAISGEGGE 628
>gi|261190935|ref|XP_002621876.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
gi|239590920|gb|EEQ73501.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
Length = 696
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 121/489 (24%), Positives = 201/489 (41%), Gaps = 106/489 (21%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + L Y SS+GSL+E++
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTT-RDINLDYVTSSVGSLNEQF 462
Query: 407 MAE--LSSSMSSGFSE------------------DKTPLG--IGEPLIVWPTVEDVRCSL 444
+ L++ SG E +T G + +V+P+++ VR S
Sbjct: 463 LRSMYLAAQGDSGLKELTLRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLDTVRKSK 522
Query: 445 EGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGRSRAMPHIKTF---------- 489
G I K KD ++ ++ R + H K
Sbjct: 523 GGAENAGTICFQSKWYNSATFPKDIMRDNISR-------REGLLMHNKILFVRPEKPITS 575
Query: 490 -----ARYNG------QKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGF 534
RY G L+++AWG L + S +L R++E GV+I RH
Sbjct: 576 LKDNSTRYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAG 632
Query: 535 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRY 591
+S +PS +G T T K + +SD G+ V+ +PVP +P RY
Sbjct: 633 KLSS--IPS---TGRTAT---LLAKSESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRY 684
Query: 592 SSEDVPWSW 600
+ P+ +
Sbjct: 685 HGRNRPFFY 693
>gi|392580440|gb|EIW73567.1| hypothetical protein TREMEDRAFT_70993 [Tremella mesenterica DSM
1558]
Length = 758
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 112/477 (23%), Positives = 178/477 (37%), Gaps = 129/477 (27%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLV------IHGESDGTLEHMKRNKPANWIL 243
+I + ILS +++D DWL P K+P V+V +H +G ++ + +
Sbjct: 335 EIKLIILSTFVLDDDWLSGILPDPQKVPTVIVRPHPKEMHSTYNGKVQAQVTGE----VF 390
Query: 244 HKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNN 301
P + G H K + Y G +R+++ TAN + DW+ ++QDF P K +
Sbjct: 391 CYPLMLDERGAAHMKYAWIFYKTGRLRVMISTANFVPYDWDWIENTTFVQDFLPRKPTSP 450
Query: 302 LSEECGFENDLIDYLSTL--------------KWPEFSANLPAH--GNFKINPSFFKKFN 345
G D + + +L + ++LP G F+ K++
Sbjct: 451 APTTKG--EDFVAHFRSLFIHLKVHKALRYLKDQHKAGSDLPPQVSGAFE----GLDKYD 504
Query: 346 FSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSL 402
+S +VRLI SV GYH G K+G +L VL++ + K LV +F SSLG
Sbjct: 505 WSRVSVRLIMSVAGYHHGYDQADKYGMTRLGKVLKDEGLVQS-KGERLVAEFQGSSLGQY 563
Query: 403 DEKW---MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQK 458
+ +W +L + D PL I++P++ V S G G +
Sbjct: 564 NIEWYNTFYQLCTGKDVRALVDHPKYKDWPPLKIIFPSLATVEASELGKDGGGTM----- 618
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIK----TF------------------------- 489
K F + S + R + H K TF
Sbjct: 619 FCGKAFTANTKHLFHHSESKRGGVLMHTKMLIGTFEPIPRSLGFTSVDCKSGKRKASEME 678
Query: 490 -ARYNG------QKLAKAAWGALQKNNSQLMIRSYELGVLI-LPSAKRHGCGFSCTSNIV 541
+ Y G + AAWG + +L IR+YELG+L LP K
Sbjct: 679 ESPYGGWIYVGSHNFSAAAWGTMNFKEKRLTIRNYELGILFPLPRDK------------- 725
Query: 542 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
A A +++V PY+ P ++YSS D+PW
Sbjct: 726 -----------------------------ARAMADIV---APYKRPARQYSSNDIPW 750
>gi|119467668|ref|XP_001257640.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
gi|119405792|gb|EAW15743.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
Length = 676
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 114/248 (45%), Gaps = 41/248 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +AILS++M DI+WL V K L++ D E KR A
Sbjct: 238 ITIEEVFQRSDLELAILSSFMWDIEWLF--SKVDTKSTRFLLVMQAKD---ELTKRQYEA 292
Query: 240 ------NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGL 289
N L PP+ HSK MLL +P +RI+ TANL DW
Sbjct: 293 ETASMSNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSA 352
Query: 290 WMQDFPLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNF 346
++ D P K ++ + FE DL+ +L STL+ S +F+F
Sbjct: 353 FLIDLPRKVATTSVGSKTVFEEDLVYFLRASTLQENIISR--------------LDEFDF 398
Query: 347 SSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSL 402
S + + L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL
Sbjct: 399 SQTSHIMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSL 454
Query: 403 DEKWMAEL 410
++++ +
Sbjct: 455 TDEFLRSI 462
>gi|403159950|ref|XP_003320511.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169349|gb|EFP76092.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 573
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 127/292 (43%), Gaps = 30/292 (10%)
Query: 179 SCVSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN 236
+ + I D+I + I +A++S+Y++++ W+ + ++VI +D K N
Sbjct: 144 NALRIEDIIGPKDRIKMALVSSYVLELPWIHK---LFNPRTRIMVIRHHTD--CGSFKVN 198
Query: 237 KPANWILHKPPLPISF------GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
+ AN L PP+ + G H K ++ Y R+ + TAN + D+ +W
Sbjct: 199 ERANMFLCHPPMLKTANGNAKPGCMHIKFFIIFYDNFCRVAIPTANAVSFDYEFVENAIW 258
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPAHGNFKINPSFFKKFNFSS 348
+QDF N + +D+ + TL LP F+ + +F S
Sbjct: 259 IQDFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLP----FR---KPLEDHDFRS 311
Query: 349 AAVRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
AA L+ SV G H +S H+ +L+T+ + G + + L Q SS+GS D KW
Sbjct: 312 AAANLVVSVQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-RTATLECQGSSIGSYDLKW 370
Query: 407 MAEL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
+ S S + +ED PL +++P++ VR S G A +
Sbjct: 371 LNNFYRCASGSPPTASTEDPDLQTKTPPLSVLYPSLHTVRNSHSGKAGAGTL 422
>gi|327354754|gb|EGE83611.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ATCC 18188]
Length = 696
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 121/489 (24%), Positives = 200/489 (40%), Gaps = 106/489 (21%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + L Y SS+GSL+E++
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTT-RDINLDYVTSSVGSLNEQF 462
Query: 407 MAE--LSSSMSSGFSE------------------DKTPLG--IGEPLIVWPTVEDVRCSL 444
+ L++ SG E +T G + +V+P++ VR S
Sbjct: 463 LRSMYLAAQGDSGLKELTLRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLNTVRKSK 522
Query: 445 EGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGRSRAMPHIKTF---------- 489
G I K KD ++ ++ R + H K
Sbjct: 523 GGAENAGTICFQSKWYNSATFPKDIMRDNISR-------REGLLMHNKILFVRPEKPITS 575
Query: 490 -----ARYNG------QKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGF 534
RY G L+++AWG L + S +L R++E GV+I RH
Sbjct: 576 LKDNSTRYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAG 632
Query: 535 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRY 591
+S +PS +G T T K + +SD G+ V+ +PVP +P RY
Sbjct: 633 KLSS--IPS---TGRTAT---LLAKSESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRY 684
Query: 592 SSEDVPWSW 600
+ P+ +
Sbjct: 685 HGRNRPFFY 693
>gi|409042750|gb|EKM52233.1| hypothetical protein PHACADRAFT_148739 [Phanerochaete carnosa
HHB-10118-sp]
Length = 603
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 112/250 (44%), Gaps = 22/250 (8%)
Query: 173 PAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH 232
P + T ++ RD DI+ AI+S Y++++ W P V+V + G E
Sbjct: 155 PVFRLTDILAPRD----DIVFAIVSAYVINLPWFYSF--FNRGTPVVIVTQDPAAGN-ET 207
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG--VRIIVHTANLIHVDWNNKSQGLW 290
+K P +WI P L G H K +++ R +R+++ TAN I DW + +W
Sbjct: 208 LKEVLP-DWIKTTPFLRNGRGCQHMKVTFILFYRTSRLRMVISTANFIEYDWRDIENSVW 266
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL-----PAHGNFKIN--PSFFKK 343
+QD P + + ++ + + + ++ L+ + L H N + K
Sbjct: 267 LQDVPPR-PSPIAHDSKANDFPMAFMRVLRGVNVAPALLTLTKNGHSNLPLKRIEELRMK 325
Query: 344 FNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLG 400
++FS V LI S+ G H G + + GH L LQ+ KG K+ L Q SS+G
Sbjct: 326 WDFSKIKVALIPSLAGKHEGWPKVIQTGHTALMKALQDMGARTPKG-KELVLECQGSSIG 384
Query: 401 SLDEKWMAEL 410
+ +W+ E
Sbjct: 385 TYTTQWLNEF 394
>gi|343476326|emb|CCD12540.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 173
Score = 69.3 bits (168), Expect = 6e-09, Method: Composition-based stats.
Identities = 43/113 (38%), Positives = 60/113 (53%), Gaps = 18/113 (15%)
Query: 195 ILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH--------- 244
IL Y++D++WL P+L +++I GE G L +K + +LH
Sbjct: 44 ILGGYVMDVEWLFRVSDPLLMSKCTIVLISGEK-GFL-----HKYRHLVLHDRFGRNRVK 97
Query: 245 --KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP 295
+P LPI FG HHSK ML I G+R+ V TAN I DWN K+QG++ P
Sbjct: 98 IVEPCLPIPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYFFHSP 150
>gi|294896960|ref|XP_002775774.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239882085|gb|EER07590.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 201
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 85/175 (48%), Gaps = 23/175 (13%)
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--------PLKDQNNLSE 304
GT H+K +++ + +R+ + ++N+ DW SQ +W+ DF P + +
Sbjct: 1 GTMHAKLIIIERAQALRVCISSSNVTPQDWEGVSQCIWVADFKPANDPEAPARKRVKPDH 60
Query: 305 ECGFENDLIDYLSTLKWPEFSANLP---AHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 360
F + L ++ T F ++P + ++ + +FN V LIAS PGY
Sbjct: 61 TSDFGDQLARFIET-----FFRSIPDSSSLSSYWVKVLTGSRFNVKLPKGVELIASAPGY 115
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
G WGHM+LR +L + E+ +++Q SS+G L ++A+LS S++
Sbjct: 116 WKGDDRDNWGHMRLRALLSDVHSEE------ILFQCSSIGFLPASFLADLSKSLN 164
>gi|159122429|gb|EDP47550.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 665
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 117/244 (47%), Gaps = 33/244 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +AILS++M DI+WL + +LV+ + D T + +
Sbjct: 227 ITIEEVFQRSDLELAILSSFMWDIEWLFSKVDTKS-TRFLLVMQAKDDLTKRQYEAETAS 285
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL +P +RI+ TANL DW ++ D
Sbjct: 286 MSNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLID 345
Query: 294 FPLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SA 349
P K ++ + FE +L+ +L STL+ N+ S +F+FS ++
Sbjct: 346 LPRKVATTSVGSKTVFEEELVYFLRASTLQ-----ENI---------ISRLDEFDFSPTS 391
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKW 406
+ L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL +++
Sbjct: 392 HIMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEF 447
Query: 407 MAEL 410
+ +
Sbjct: 448 LRSI 451
>gi|238494160|ref|XP_002378316.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694966|gb|EED51309.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 679
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 111/242 (45%), Gaps = 29/242 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYDELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 EL 410
+
Sbjct: 463 SI 464
>gi|281210780|gb|EFA84946.1| hypothetical protein PPL_01939 [Polysphondylium pallidum PN500]
Length = 493
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 136/315 (43%), Gaps = 54/315 (17%)
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP----LK 297
I+H P L G HSK +LL Y + +R+++ ++NL DW Q +++ D P
Sbjct: 134 IIHPPLLVSQIGILHSKIILLEYQQIIRVVISSSNLTGSDWEVLGQTIFIVDIPRIKKNN 193
Query: 298 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLI 354
N + F+ +L+D LS+L + + + +N +F+FS + ++
Sbjct: 194 IDNINDNKDQFKYELVDILSSLGFTD---------DHIVNA--LDQFDFSMIHQHGIHIV 242
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
+S+PG + S K+G KL ++ E + K+ VYQ S++G +W++
Sbjct: 243 SSIPGVY---SHNKYGLSKLASLASEY---QSTSKATAVYQSSAIGMTSREWLSSF---- 292
Query: 415 SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL--KKYWAKW 472
K +G + +PT+ + + G + DKD L K +K
Sbjct: 293 -------KAAIGTDNLTLPFPTLNTIDEMITYNPLGATESVTIRYHDKDLLLSNKMLSKL 345
Query: 473 KASH----------TGRSRAMP-HIKTFAR----YNG-QKLAKAAWGALQKNNSQLMIRS 516
+ ++ T S P H K Y+G +A+WG++ K S + I +
Sbjct: 346 QYNNERDPKVDNSITNLSSHPPLHSKVLITDRWIYHGSHNFTEASWGSISKRQSTIKISN 405
Query: 517 YELGVLILPSAKRHG 531
+E GV I P+A G
Sbjct: 406 FETGVFI-PTALFTG 419
>gi|7576178|emb|CAB87929.1| hypothetical protein [Arabidopsis thaliana]
Length = 1075
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 32/195 (16%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPP 247
L+ + DI W L C +P + H D N P N + PP
Sbjct: 401 FLATFTSDILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPP 459
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K +L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 460 FPEEIAFGKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPR 519
Query: 297 K---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+ D +L C G + D L+ ++P+ ++ + F K+NF
Sbjct: 520 RADPDLLSLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFE 575
Query: 348 SAAVRLIASVPGYHT 362
+A L+ASVPG H+
Sbjct: 576 HSAGHLVASVPGIHS 590
>gi|391872408|gb|EIT81535.1| hypothetical protein Ao3042_01981 [Aspergillus oryzae 3.042]
Length = 679
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 111/242 (45%), Gaps = 29/242 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYHELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 EL 410
+
Sbjct: 463 SI 464
>gi|317148904|ref|XP_001822999.2| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
Length = 667
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 111/242 (45%), Gaps = 29/242 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYHELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 EL 410
+
Sbjct: 463 SI 464
>gi|42567721|ref|NP_196357.2| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
gi|30102672|gb|AAP21254.1| At5g07400 [Arabidopsis thaliana]
gi|110743660|dbj|BAE99667.1| hypothetical protein [Arabidopsis thaliana]
gi|332003770|gb|AED91153.1| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
Length = 1084
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 32/195 (16%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPP 247
L+ + DI W L C +P + H D N P N + PP
Sbjct: 401 FLATFTSDILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPP 459
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K +L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 460 FPEEIAFGKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPR 519
Query: 297 K---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+ D +L C G + D L+ ++P+ ++ + F K+NF
Sbjct: 520 RADPDLLSLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFE 575
Query: 348 SAAVRLIASVPGYHT 362
+A L+ASVPG H+
Sbjct: 576 HSAGHLVASVPGIHS 590
>gi|406602541|emb|CCH45857.1| Tyrosyl-DNA phosphodiesterase 1 [Wickerhamomyces ciferrii]
Length = 587
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/493 (20%), Positives = 191/493 (38%), Gaps = 110/493 (22%)
Query: 179 SCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHV----LVIHGESDGTLEHM 233
+ V I DV+ ++ L +Y D++++LP H L I ++ L+
Sbjct: 142 NSVIISDVLSSPNLRSCYLFSYQHDLEFILPQF-------HSNNIDLTIVYQTGTVLDSP 194
Query: 234 KRNKPANWILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
KR N + +P + +HH K ++ +Y V++ + + N+ ++W+ +Q +W
Sbjct: 195 KRALFRNVQFIEVAMP-PYSSHHPKLIINVYNDDTVQLFLVSCNMTFMEWSTNNQMIWQS 253
Query: 293 DFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
KD N S++ F+ L +Y+ + P+ + KK++F+S
Sbjct: 254 PRLHKDLN--SKDTVFKTHLFNYIKNYQKPQLDTLV----------VLLKKYDFNSIIGD 301
Query: 353 LIASVPGYHTGSSLKKWG--------------HMKLRTVL-QECTFEKGFKKSPLVYQFS 397
++S T WG H K R +L Q + + +P + Q +
Sbjct: 302 FVSSATS--TSDKFGFWGLYNSLLSKGLIPRKHEKERQLLYQTSSIASAIRHTPTINQSA 359
Query: 398 SLGS------LDEKWMAELSSSMSSGFSEDKTPLGIGE--------PLIVWPTVEDVRCS 443
++ + K+ S+S F + + + P I++P++ DVR S
Sbjct: 360 NIFTHLLLPLFSGKYTNHGRLSISRDFPLSNGFISVEQFSKEYKVKPYIIYPSLSDVRNS 419
Query: 444 LEGYAAGN-AIPSPQKNVDK---DFLKKYWAKWKASHTGRSRAMPHIKTF---------- 489
L GY +G + +P +K DFL + S++ + + P F
Sbjct: 420 LFGYGSGGWSHFNPHSKWNKPMNDFLTP--KVFHHSYSQQRKTNPSHTKFLIMSSDNFKT 477
Query: 490 ---ARYNGQKLAKAAWGALQKNNSQLM------IRSYELGVLILPSAKRHGCGFSCTSNI 540
+ ++K AWG L + +YE G+L+ PS +G G
Sbjct: 478 LDWVFFTSTNMSKQAWGTPPTKKDLLSLPPKSNVSNYETGILLCPSD--YGSGI------ 529
Query: 541 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSW 600
K + L + + + +YLP + LPP++YS++D PW
Sbjct: 530 ------------------KFIPLEFGQEKNLEENEVPIYLP--FRLPPEKYSNQDEPWCV 569
Query: 601 DKRYTKKDVYGQV 613
K + D+ G +
Sbjct: 570 SKSHDLPDILGNL 582
>gi|242823839|ref|XP_002488140.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218713061|gb|EED12486.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 673
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 107/246 (43%), Gaps = 27/246 (10%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + + I +V+Q D+ +A+LS + D +WL K ++V+ + + T L++ +
Sbjct: 229 NNNDIKIEEVLQTADLELAVLSAFQWDTEWLFSKFRTPGKTRFLMVMQAKEESTRLQYQQ 288
Query: 235 RNKPA-NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGL 289
N L PP+ HSK MLL +P +RI+V +ANL+ DW + +
Sbjct: 289 ETADMPNIRLCFPPMEGQIKCMHSKLMLLFHPDYLRIVVPSANLVPYDWGEQGGVMENTV 348
Query: 290 WMQDFPLKDQNNLSE--ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF- 346
++ D P + ++ + + F +L +L H N F+F
Sbjct: 349 FLIDLPKRSAQDVPDTPKKAFYEELAFFLQAST---------VHNNIIAK---LSSFDFK 396
Query: 347 SSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDE 404
++ R + ++ G H G ++ GH L + P+ F SS+GSL +
Sbjct: 397 ETSRYRFVHTIGGSHIGECRRRTGHCGLGQAVSSLGLR---THEPISIDFVTSSIGSLTD 453
Query: 405 KWMAEL 410
++M +
Sbjct: 454 EFMRSI 459
>gi|402224759|gb|EJU04821.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 955
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/296 (23%), Positives = 130/296 (43%), Gaps = 28/296 (9%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP 249
++ + S + D +WL P A +P + + H E + P + ++ P
Sbjct: 508 ELRFVLTSAFGTDFEWLRSMIP--AGVPLLSINHPTDRERWEPQIKPLPLDGWIYATPKM 565
Query: 250 ISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGF 308
G H K +LL Y G +R+++ TANL+ DW + +++QD P K++++ +E F
Sbjct: 566 NKGGIMHVKLLLLFYKNGRLRLVIPTANLVPDDWRDIENTMFLQDIPAKNKDSSAEPHPF 625
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG-YHT 362
L +L L + L G + P + +++S +L+ S G Y
Sbjct: 626 PVYLASFLKILNVHNGLSAL-VQGGYPNLPLPSLDALATGWDWSRVTAQLVGSPAGSYED 684
Query: 363 GSSLKKWGHMKLRTVLQECTFEKGF-KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED 421
S+++WGH +L +++ + K+ L YQ SS+G+ +++ + S G S D
Sbjct: 685 WDSVRRWGHPRLGEAVRQLKAQPPTGKRLNLEYQGSSIGNYTTQYLNDFYKS-GCGLSPD 743
Query: 422 ---KTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK-YWAK 471
+ P P IV+P++ V ++ G + F +K YW+K
Sbjct: 744 VSKRRPKAQPWPAIQIVYPSLTTVDNTVLGRLGAGSF----------FCRKQYWSK 789
>gi|302823724|ref|XP_002993511.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
gi|300138642|gb|EFJ05403.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
Length = 920
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 90/208 (43%), Gaps = 33/208 (15%)
Query: 181 VSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLE 231
VS+ D++ DI ++++ DI W + + + +P + H +E
Sbjct: 239 VSVADLLAPLEDIREMFVASFTTDIIWFISSFGLPKTLPVTIACHDSERSWSTAISDRME 298
Query: 232 HMKRNKPANWILHKPPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
H P N + PP P+ G HH K LL + +R+IV ++NL +
Sbjct: 299 HPYCEWP-NLKVVYPPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYR 357
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSE-------ECGFEN-DLIDYLSTLKWPEFSANLPAHG 332
W S +W QDFPL++ + S E G N D YL+ ++P+
Sbjct: 358 QWLQVSNTVWWQDFPLRNTRDYSSLFSSKITEGGERNGDFAAYLAGF-ISTLVKDVPSEA 416
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGY 360
++ + +NFS A V L+ASVPG+
Sbjct: 417 HWATD---LACYNFSKATVSLVASVPGF 441
>gi|358399116|gb|EHK48459.1| hypothetical protein TRIATDRAFT_290150 [Trichoderma atroviride IMI
206040]
Length = 590
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 106/478 (22%), Positives = 187/478 (39%), Gaps = 101/478 (21%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG--TLEH----- 232
++I +V Q D + +A+LS++ D +W+L + +L++ DG LE
Sbjct: 149 ITIEEVFQKDKLELAVLSSFQWDEEWMLSKLDY--RRTKILLLAFARDGAQVLEFIHKTL 206
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGL 289
M+ N PAN PP+ G HSK LL YP +R+++ T NL+ DW +
Sbjct: 207 MQGNVPANIKFCFPPMH-GVGAMHSKLQLLKYPSHLRVVIPTGNLMPYDWGETGVMENMV 265
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-S 348
++ D P D + + + T + E L A G + + ++FS +
Sbjct: 266 FLIDLPRLDHPVSTHASAARS----HAPTRFYTELVYFLQATGVGEKMVASLANYDFSRT 321
Query: 349 AAVRLIASVPGYHTG--------------------------SSLKKWGHMKLRTVLQECT 382
A + + ++PG H+ +SL +R + C
Sbjct: 322 ADLAFVHTIPGSHSAKNAERIASVADLGLASVDPVDVDLVCASLGALNQQMVRAIYNACR 381
Query: 383 FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC 442
+ G + SS S + +++++S + L I +PT V
Sbjct: 382 GDDGTDEYHKPASTSSRSSAKKPTTTTTTATVTS-----QEQLLRERFRIYFPTDRTVSQ 436
Query: 443 SLEGYAAGNAI---------PSPQKNVDKDFL---------KKYWAKWKASHTGRSRAMP 484
S G AG I P+ + + +D + K + + + +G+++A+
Sbjct: 437 SRGGRNAGGTICVQTKWWRAPNFPRELVRDVISRDRVLMHSKMIFVRRRPGDSGQAQAVR 496
Query: 485 HIKTFARYNGQKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNI 540
+A L+++AWG + K+ S +L+ R++E GV+I
Sbjct: 497 QSPGWAYVGSANLSESAWGRMSKDKSTGGFKLVCRNWECGVII----------------P 540
Query: 541 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
VP E+ + KT L T S+D S +PVP ++P Y S D PW
Sbjct: 541 VP--------ESQPVDKTTLPT-----SADDDMSMFAGTVPVPMQVPGPVYRSSDQPW 585
>gi|326472360|gb|EGD96369.1| hypothetical protein TESG_03817 [Trichophyton tonsurans CBS 112818]
Length = 676
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 111/241 (46%), Gaps = 27/241 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V+Q D+ +A+LS+++ D+DWLL + + ++ + + E + R +
Sbjct: 218 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETAS 276
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
L PP+ HSK MLL + +RI++ +ANL DW + L++ D
Sbjct: 277 MSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLID 336
Query: 294 FPLKDQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K + + F ++L+ +L STL N KI +++FS +A
Sbjct: 337 LPRKANETVDDTTPFRDELVYFLRASTL-------------NEKIIDKML-QYDFSQTAK 382
Query: 351 VRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
+ S+ G H GS S ++ GH L T ++ + L Y SS+GSL ++
Sbjct: 383 YAFVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQN 441
Query: 410 L 410
L
Sbjct: 442 L 442
>gi|326484528|gb|EGE08538.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Trichophyton equinum CBS 127.97]
Length = 462
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 111/241 (46%), Gaps = 27/241 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V+Q D+ +A+LS+++ D+DWLL + + ++ + + E + R +
Sbjct: 233 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETAS 291
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
L PP+ HSK MLL + +RI++ +ANL DW + L++ D
Sbjct: 292 MSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLID 351
Query: 294 FPLKDQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K + + F ++L+ +L STL N KI +++FS +A
Sbjct: 352 LPRKANETVDDTTPFRDELVYFLRASTL-------------NEKIIDKML-QYDFSQTAK 397
Query: 351 VRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
+ S+ G H GS S ++ GH L T ++ + L Y SS+GSL ++
Sbjct: 398 YAFVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQN 456
Query: 410 L 410
L
Sbjct: 457 L 457
>gi|302142785|emb|CBI20080.3| unnamed protein product [Vitis vinifera]
Length = 1032
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 366 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 425
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 426 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 485
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 486 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 537
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 538 YDFKGATGHLVASVPGIH 555
>gi|359493967|ref|XP_002283806.2| PREDICTED: uncharacterized protein LOC100243589 [Vitis vinifera]
Length = 1091
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 406 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 465
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 466 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 525
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 526 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 577
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 578 YDFKGATGHLVASVPGIH 595
>gi|147770909|emb|CAN67540.1| hypothetical protein VITISV_012382 [Vitis vinifera]
Length = 1423
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 410 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 469
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 470 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 529
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 530 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 581
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 582 YDFKGATGHLVASVPGIH 599
>gi|317035597|ref|XP_001396653.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 640
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 116/252 (46%), Gaps = 29/252 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++M +++WL + K +LV+ E D T +
Sbjct: 184 IKIEEVLQKADLELAVMSSFMWEMEWLFSKFNI-EKTRFILVMQAEDDATKRQYESETAT 242
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQD 293
N L PP+ HSK MLL +P +R++V TANL DW + +++ D
Sbjct: 243 MRNLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLID 302
Query: 294 FPLKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 303 LPKK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSK 347
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--A 408
+ ++ G HT ++ K+ G+ L ++ + + Y SS+G++ ++++
Sbjct: 348 YAFVHTIGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCM 406
Query: 409 ELSSSMSSGFSE 420
L+S G +E
Sbjct: 407 YLASQGDDGLTE 418
>gi|255539987|ref|XP_002511058.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223550173|gb|EEF51660.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 1148
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 41/205 (20%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWI 242
+I+ ++ + DI W L C + + +P + H D + N P N
Sbjct: 457 NIMRIFIATFTSDILWFLSYCEIPSHLPVTIACHNTERCWSSNPDKRISMPYSNFP-NLS 515
Query: 243 LHKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM 291
+ PP P I+FG HH K ++L +R+I+ +ANL+ W+N + +W
Sbjct: 516 VVFPPFPEAIAFGNDRRRQGIACHHPKLLVLQRENSIRVIITSANLVPNQWHNVTNTIWW 575
Query: 292 QDFPLKDQNNLS--------------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
QDFP + +LS F L ++++L ++P+ ++ +
Sbjct: 576 QDFPRRSTPDLSSLFTRVSDGEISQDSRSDFAAQLAGFIASL-----VIDVPSQAHWVVE 630
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHT 362
K+NF A L+AS+PG H+
Sbjct: 631 ---LTKYNFDGALGYLVASIPGIHS 652
>gi|340518445|gb|EGR48686.1| predicted protein [Trichoderma reesei QM6a]
Length = 534
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 98/472 (20%), Positives = 180/472 (38%), Gaps = 101/472 (21%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS++ D +W+L + ++ +L+ + + M+ PA
Sbjct: 105 ITIEEVFQKDHLELALLSSFQWDEEWMLSKLDI-SRTKLLLLAFAKDEAQKNQMRGIVPA 163
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
N PP+ G HSK LL YP +R+++ T NL+ DW +++ D P
Sbjct: 164 NIKFCFPPMH-GVGAMHSKLQLLKYPNRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPR 222
Query: 297 KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRL 353
+ + + F +L+ +L A G + ++FS ++ +
Sbjct: 223 LENPATTPQSPTAFYTELVYFLQ------------ATGVGDKMVASLSNYDFSKTSDIAF 270
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKG-------FKKSPLVYQFSSLGSLDEKW 406
+ ++PG HTG + ++ G+ L + + ++ +SLG+L+ ++
Sbjct: 271 VHTIPGSHTGKAAERTGYCGLGASVAALGLASAEPVEVDLLARCGDLHCCASLGALNHEF 330
Query: 407 MAEL----------------SSSMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSL 444
+ + S + SS K P I +PT V S
Sbjct: 331 IEAIYNACRGRDGIEDFKNKSGAASSRSKAAKKPDEAASKELQERFRIYFPTERTVAGSR 390
Query: 445 EGYAAGNAI-------PSP-------QKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
G AG I SP + + +D L + G + +A
Sbjct: 391 GGRNAGGTICVQAKWWRSPTFPTELVRDVIARDRLLVHSKMIFVRRVGHDQTTQQRPGWA 450
Query: 491 RYNGQKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 546
L+++AWG L ++ S ++ R++E GV ILP
Sbjct: 451 YVGSANLSESAWGRLSRDRSTKAIKMNCRNWECGV-ILP--------------------- 488
Query: 547 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 598
+ ++K V + G A + V PVP ++P Y+S D PW
Sbjct: 489 --------VPESKAVDMARAGGDMAMFAGTV---PVPMQVPGPAYASSDRPW 529
>gi|357520291|ref|XP_003630434.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355524456|gb|AET04910.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 1064
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 87/199 (43%), Gaps = 41/199 (20%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHG-------ESDGTLEHMKRNKPANWILHKPP 247
++ + DI W L C + +P + + D + +N P N ++ PP
Sbjct: 394 FIATFTSDITWFLTYCKIPYHLPVTIACQNTEKCWSSKPDERVFVPYQNYP-NLVVVHPP 452
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K ++L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 453 FPETIAFGKDHKRHGIACHHPKLIVLQREDSIRVIITSANLVEKQWNSVTNTIWWQDFPR 512
Query: 297 --------------KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK 342
D+ + + +C F L ++++L ++P+ ++
Sbjct: 513 AILVDYASLFRKIDDDEVHRNSKCDFAAQLAGFMASL-----VIDVPSQAHWITQ---LT 564
Query: 343 KFNFSSAAVRLIASVPGYH 361
K++F SA L+AS+PG H
Sbjct: 565 KYDFGSATGHLVASLPGIH 583
>gi|46111419|ref|XP_382767.1| hypothetical protein FG02591.1 [Gibberella zeae PH-1]
Length = 676
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 17/198 (8%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WL+ L K +L+ +S+ M+ N P
Sbjct: 142 IKIEEVFQKDKLELALLSSYQWDDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPP 200
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPL 296
P + G HSK LL YP +R++V +ANL+ DW +++ D P
Sbjct: 201 GIKFVFPAMN-GPGAMHSKLQLLKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPR 259
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
D + F +L +LS E N + +F S K F + +
Sbjct: 260 LDGSATHRPTPFSTELGRFLSATGVGETMVNSLTNYDF----SQTKHLGF-------VYT 308
Query: 357 VPGYHTGSSLKKWGHMKL 374
+PG H G LK+ G+ L
Sbjct: 309 IPGGHQGDELKRIGYSGL 326
>gi|225678545|gb|EEH16829.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 686
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 114/258 (44%), Gaps = 31/258 (12%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHM 233
N + I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE D E
Sbjct: 221 NGDDIKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELE 278
Query: 234 KRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
K + L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 279 NDTKSMGSVRLCFPPMEPQVNCMHSKLMLLFHLNHLRIVIPSANLIPFDWGEKGGIMENV 338
Query: 289 LWMQDFP-LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+++ D P + + + F DL+ +L ++NL K NF
Sbjct: 339 VFLIDLPRISPSPDATPRTPFLEDLVYFLQ-------ASNLDEQ-------IIQKMLNFD 384
Query: 348 SAAVRLIA---SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE 404
+A + IA ++ G HT + K+ G L + + + L Y SS+GSL+E
Sbjct: 385 FSATKDIAFVHTIGGSHTDPTWKRTGLCGLGRAITSLGLQTS-QNLNLDYVTSSVGSLNE 443
Query: 405 KWMAE--LSSSMSSGFSE 420
+++ L++ +G E
Sbjct: 444 QFLRSIYLAAQGDTGLKE 461
>gi|134082171|emb|CAK42283.1| unnamed protein product [Aspergillus niger]
Length = 655
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 111/237 (46%), Gaps = 28/237 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++M +++WL + K +LV+ E D T E
Sbjct: 224 IKIEEVLQKADLELAVMSSFMWEMEWLFSKFNI-EKTRFILVMQAEDDATYESETATM-R 281
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFP 295
N L PP+ HSK MLL +P +R++V TANL DW + +++ D P
Sbjct: 282 NLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLP 341
Query: 296 LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 342 KK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSKYA 386
Query: 353 LIASVP--GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
+ ++P G HT ++ K+ G+ L ++ + + Y SS+G++ ++++
Sbjct: 387 FVHTIPSGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFL 442
>gi|452985745|gb|EME85501.1| hypothetical protein MYCFIDRAFT_133255 [Pseudocercospora fijiensis
CIRAD86]
Length = 482
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 105/448 (23%), Positives = 187/448 (41%), Gaps = 69/448 (15%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V++ + A+LS + DIDWLL P K V+ + D
Sbjct: 70 IKLEEVLEPSSVRTAVLSAFQWDIDWLLRKLKTPLNGGSTKCVFVMQAKEKEDRDQWRED 129
Query: 235 RNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLW 290
+ ++++ P + HSK MLL +P +RI + TANL++ DW Q ++
Sbjct: 130 ASDMSHFLRFCFPNMSGLISCMHSKLMLLFHPHKLRIAIPTANLLNFDWGETGQMENSVF 189
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+ D P G + L D S + E + G + KF+FS+
Sbjct: 190 LIDLPRYSD-------GLKASLEDLPSFGR--ELMYFIQKQGLDQDVRDGVLKFDFSATR 240
Query: 351 -VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWM 407
+ + +V G H + G + L + ++E G S L +F SS+G L+E +
Sbjct: 241 DMAFVHTVGGVHYKDEAARTGLLGLSSAVRELGLSTG---SDLEIEFAASSIGMLNEAQV 297
Query: 408 AELSSSMSSGFSEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 463
+L ++ + + I +PT + VR S G +AG + K+
Sbjct: 298 NDLHTAARGKPQQSSSTTETSTARKNVRIYFPTADTVRSSTAG-SAGTICLQRKYFEAKN 356
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTF-ARYNGQKLA----------KAAWGAL--QKNNS 510
F + + +K++ G + H K AR +K+A K+AWG L +++ +
Sbjct: 357 FPRDIFRDYKSTRRG---LLSHNKILCARSRKEKVAWVYVGSANMSKSAWGELGAKRDEN 413
Query: 511 QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 570
++ R++E GV ILP A++ V E T+ + LV++
Sbjct: 414 KITCRNWECGV-ILPVARK-----------VKDENGDEETDDEGEDEKALVSMN------ 455
Query: 571 AGASSEVVYLPVPYELPPQRYSSEDVPW 598
A + V+ L P+E+P + Y+ + PW
Sbjct: 456 --AFANVIDL--PFEVPGEEYAGRE-PW 478
>gi|302787823|ref|XP_002975681.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
gi|300156682|gb|EFJ23310.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
Length = 920
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 91/211 (43%), Gaps = 41/211 (19%)
Query: 181 VSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLE 231
VS+ D++ DI ++++ DI W + + + +P + H +E
Sbjct: 239 VSVADLLAPLEDIREMFVASFTTDIIWFISSFGLPKTLPVTIACHDSERSWSTAISDRME 298
Query: 232 HMKRNKPANWILHKPPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
H P N + PP P+ G HH K LL + +R+IV ++NL +
Sbjct: 299 HPYCEWP-NLKVVYPPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYR 357
Query: 281 DWNNKSQGLWMQDFPLKDQNNLS-----------EECG-FENDLIDYLSTLKWPEFSANL 328
W S +W QDFPL++ + S E G F L ++STL ++
Sbjct: 358 QWLQVSNTVWWQDFPLRNTRDYSSLFSSKITDGGERNGDFAAYLAGFISTL-----VKDV 412
Query: 329 PAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
P+ ++ + +NFS A V L+ASVPG
Sbjct: 413 PSEAHWATD---LACYNFSKATVSLVASVPG 440
>gi|71004940|ref|XP_757136.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
gi|46096766|gb|EAK81999.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
Length = 687
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 125/292 (42%), Gaps = 47/292 (16%)
Query: 193 VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR-------------NKPA 239
+A+L+ Y + IDWL P + VL E EH+ R +
Sbjct: 226 LAVLATYDLRIDWLYSLFPRQLPVTLVLPPPKEDYRVNEHVARPGLHPSHIFGGDFTRCP 285
Query: 240 NWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
W + P P + T H K ++L++ R +R+ + + NL +DW+ ++QDFPL
Sbjct: 286 GWQICVPNKPKGGWLTQHIKFLILVHQRFLRVAILSGNLNAIDWDRIENTAYIQDFPLLG 345
Query: 299 QNNL------------SEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 346
Q ++ S + F++ L+ L +L P A A +++F
Sbjct: 346 QASMINHGSGSSSGSKSSQNDFKSQLVRVLRSLSMPASHAVYAA----------LDRYDF 395
Query: 347 SSAA-VRLIASVPGYHTGSSLKKWGHMKLRTV--LQECTFEKGFKKS-PLVYQFSSLGSL 402
S A R++AS P +SL++W ++ + + L + + G K+S L Q SSL +
Sbjct: 396 SLATRARIVASWP---EAASLREWDQIETQGLGRLGKVVRDLGIKESVELECQGSSLANH 452
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 454
D KW+ S PL G+P V P + ++ + GNA+P
Sbjct: 453 DVKWIEHFHLLASGVEPRGLLPLK-GKPNEVHP---EYASAIGATSKGNALP 500
>gi|239613173|gb|EEQ90160.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ER-3]
Length = 662
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 114/465 (24%), Positives = 187/465 (40%), Gaps = 96/465 (20%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + +F S E W
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTTRDINLDYVRFPS-----ENW 458
Query: 407 MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVD 461
++ G +DK +V+P++ VR S G I K
Sbjct: 459 -GVVTKRTDGGKWKDKF-------RVVFPSLNTVRKSKGGAENAGTICFQSKWYNSATFP 510
Query: 462 KDFLKKYWAKWKASHTGRSRAMPHIKTF---------------ARYNG------QKLAKA 500
KD ++ ++ R + H K RY G L+++
Sbjct: 511 KDIMRDNISR-------REGLLMHNKILFVRPEKPITSLKDNSTRYAGWAYVGSANLSES 563
Query: 501 AWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 556
AWG L + S +L R++E GV+I RH +S +PS +G T T
Sbjct: 564 AWGRLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--IPS---TGRTAT---L 612
Query: 557 KTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 598
K + +SD G+ V+ +PVP +P RY + P+
Sbjct: 613 LAKSESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPF 657
>gi|429855706|gb|ELA30650.1| tyrosyl-dna phosphodiesterase domain-containing protein
[Colletotrichum gloeosporioides Nara gc5]
Length = 620
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 109/482 (22%), Positives = 192/482 (39%), Gaps = 76/482 (15%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKL-----PSTF 164
L+R KR + + + K ++ D D++ +N+ L + + L F
Sbjct: 77 LARLGKRSATQADLDENFQTSKSQRTDAADSQELRNAAPVLKVQEQAANALDLPFAKGAF 136
Query: 165 RLLRVQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
R +G P + + I +V+Q + + +A+LS++ D +WLL + VLV +
Sbjct: 137 RRTWARGYPRTGDD--IKIEEVLQKEQLQLAVLSSFQWDEEWLLSKIDC-RRTKMVLVAY 193
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
+D ++ N PA I P P+ G HSK +L Y +R++V + NL+ DW
Sbjct: 194 AANDAEKAVIRSNAPAGLIRFCFP-PMHGGYMHSKLQILNY---LRLVVPSGNLVPYDWG 249
Query: 284 NKS---QGLWMQDFPLKD--QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
+++ D P + Q E F +L +L+ L E K+
Sbjct: 250 ETGVLENMVFLIDLPRYETQQTTAGTETLFGKELRRFLTALGIGE-----------KLVK 298
Query: 339 SFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
S ++FS ++ + ++ G H S + G+ L + + + Y S
Sbjct: 299 S-LDNYDFSETSRYGFVHTISGSHANDSWQHTGYCGLGNTARSLGLATDYPVD-VDYVAS 356
Query: 398 SLGSLDEKWMAEL----------------------SSSMSSGFSEDKTPLGIGEPL---- 431
SLGSL+ ++ + S + SG S +T L
Sbjct: 357 SLGSLNHGYLTAIYNACQGDSGMKEYEARQSKSTRSKAGRSGPSGSRTITAEAVDLQHHF 416
Query: 432 -IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTG----------R 479
I +PT + V S G +A I +K F ++ +++ TG R
Sbjct: 417 RIYFPTEKTVSSSRGGRSAAGTICMQEKWWKSSTFPRELLRDCESTRTGLLLHSKAIFVR 476
Query: 480 SRAMPHIKTFARYNGQKLAKAAWGALQKN----NSQLMIRSYELGVLILPSAKRHGCGFS 535
RA + +A L+++AWG L K+ ++L R++E GVL+ + GC S
Sbjct: 477 ERAC-NGAVWAYMGSANLSESAWGRLVKDRESGTAKLSCRNWECGVLV-AVGRTAGCADS 534
Query: 536 CT 537
T
Sbjct: 535 GT 536
>gi|242072904|ref|XP_002446388.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
gi|241937571|gb|EES10716.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
Length = 972
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 47/282 (16%)
Query: 117 VSNDGATNGEL---SSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLP 173
V+NDG +GEL SK R + + G +EE + D STF L R+ G
Sbjct: 214 VANDG--DGELPFHGSKGCRDDNAEQPGCGSGNEEQYHSEACYSDG--STFFLNRLVGTG 269
Query: 174 AWANT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
+ S V++ ++ ++ ++ + DI W L C + +P + H + D
Sbjct: 270 SDTRAEPQSGVTLPQLLHPVDSLVRVFIATFTSDISWFLNYCKIPQHLPVTIACHNK-DR 328
Query: 229 TLEHMKRNKPANWILHKP---------PLPISFG---------THHSKAMLLIYPRGVRI 270
N+ A P P I+FG HH K ++L +R+
Sbjct: 329 CWSASSENRTAAPFESHPKLLLVFPRFPEEIAFGQDRKKQGVACHHPKLIVLQREDSMRV 388
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWP 322
IV +ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 389 IVTSANLVPRQWHLITNTVWWQDFPRRTSLDYAALFSAAEKQKSDFAAQLVSFIASM--- 445
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+P+ + IN K++F A LIASVPG H S
Sbjct: 446 --VNEVPSQA-YLINE--IAKYDFEGAGGYLIASVPGIHAQS 482
>gi|115386326|ref|XP_001209704.1| predicted protein [Aspergillus terreus NIH2624]
gi|114190702|gb|EAU32402.1| predicted protein [Aspergillus terreus NIH2624]
Length = 381
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 89/195 (45%), Gaps = 23/195 (11%)
Query: 171 GLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
GLP + + I +V+Q D+ VA+LS++M D+DWL + V ++ + D T
Sbjct: 199 GLPRQGDD--IKIEEVLQRSDLKVAVLSSFMWDMDWLFSKMDQV-NTRFVFLMQAKDDAT 255
Query: 230 LEHMKRNKP--ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
+R N L PP+ HSK M+L +P VRI++ TANL DW
Sbjct: 256 KRQYERETADLRNLKLCFPPMEGQVQCMHSKLMILFHPGHVRIVIPTANLTPYDWGEMGG 315
Query: 285 -KSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+++ D P ++ E F+ +LI +L A +++ + +
Sbjct: 316 VMENTVFLIDLPKLHPDSERIETNFKKELIYFLQ------------ASAAYEMVTTKLNE 363
Query: 344 FNFSSAA-VRLIASV 357
++FS A + L+ S+
Sbjct: 364 YDFSKTAHIALVHSI 378
>gi|295662314|ref|XP_002791711.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279837|gb|EEH35403.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 589
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 119/509 (23%), Positives = 197/509 (38%), Gaps = 121/509 (23%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMV 201
NS+ A V + +PS +L RV+ PA + NT V +RD++ +I NY+
Sbjct: 54 NSKIARQESPVMPNGIPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIF 113
Query: 202 DIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFG 253
DID+L+ + + V +IHG ES + E +R ++ +P +FG
Sbjct: 114 DIDYLMSQFDQDVRDLVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFG 171
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
THHSK M++I +++ + +K W+++ N+LS ++L
Sbjct: 172 THHSKMMIIIKHDDQAQNHKISSVATLGQTDK----WLKETLF---NSLSPPSARSSELF 224
Query: 314 DYLSTLKWPEFSANLPAHGNFKI---NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
+ +N PA NF I P ++ S+ GY +G S+
Sbjct: 225 ---------KTESNSPA--NFSIIFPTPDEIRR------------SLNGYMSGGSI---- 257
Query: 371 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP 430
HMKL++ Q+ Q L +W + ++D G P
Sbjct: 258 HMKLQSAAQQ-------------KQLQYLRPYLCRWAGDA--------NDDGGVKSAGGP 296
Query: 431 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
R LEG ++ D LKK + + R A PHIKT+
Sbjct: 297 ------ATSKRKRLEGNDVSESV------QDCAALKKEHRPIREAGRRR--AAPHIKTYV 342
Query: 491 RYN-------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILP------------ 525
R++ L+ AWGA ++ I SYE+GVL+ P
Sbjct: 343 RFSDTDMTTIDWAMVTSANLSLQAWGAAANAKKEIRICSYEIGVLVWPDLFVDEEIDDSD 402
Query: 526 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL----TWHGSSDAGASSE--VVY 579
G G + + SG+ T ++ +V + +A SS+ +V
Sbjct: 403 EPLTKGKGKDNSRREI-----SGNKNTKDVKTAVMVPCFKRDMPEAAENAARSSDTTLVG 457
Query: 580 LPVPYELPPQRYSSEDVPWSWDKRYTKKD 608
+PY+LP Y+++D PW Y++ D
Sbjct: 458 FRMPYDLPLHSYTAKDQPWCATATYSEPD 486
>gi|408391841|gb|EKJ71209.1| hypothetical protein FPSE_08715 [Fusarium pseudograminearum CS3096]
Length = 598
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 85/194 (43%), Gaps = 17/194 (8%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WL+ L K +L+ +S+ M+ N P
Sbjct: 142 IKIEEVFQKDKLELALLSSYQWDDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPP 200
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPL 296
P + G HSK LL YP +R++V +ANL+ DW +++ D P
Sbjct: 201 GIKFVFPAM-NGPGAMHSKLQLLKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPR 259
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
D + F +L +LS E N + +F S K F + +
Sbjct: 260 LDGSATHRPTPFSIELGRFLSATGVGETMVNSLTNYDF----SQTKHLGF-------VYT 308
Query: 357 VPGYHTGSSLKKWG 370
+PG H G LK+ G
Sbjct: 309 IPGGHQGDELKRIG 322
>gi|302892021|ref|XP_003044892.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
gi|256725817|gb|EEU39179.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
Length = 674
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 86/199 (43%), Gaps = 19/199 (9%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WLL L + +LV + M+ N P
Sbjct: 148 IKIEEVFQKDRLELAVLSSYQWDDEWLLSKID-LRRTKLLLVASAADESQKREMQSNTPP 206
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
P + G HSK LL YP +R++V TANL+ DW +++ D P
Sbjct: 207 GIRFCFPAMN-GPGAMHSKLQLLKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLPK 265
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIA 355
+ + + F +L +LS G S ++FS + +
Sbjct: 266 LEASVDHQPTHFSTELGRFLSET------------GVGAGMVSSLSNYDFSRTKHLGFVY 313
Query: 356 SVPGYHTGSSLKKWGHMKL 374
++PG H G SLK+ G+ L
Sbjct: 314 TIPGGHVGDSLKRIGYCGL 332
>gi|118785322|ref|XP_001237572.1| AGAP010578-PA [Anopheles gambiae str. PEST]
gi|116128029|gb|EAU76763.1| AGAP010578-PA [Anopheles gambiae str. PEST]
Length = 239
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 64/138 (46%), Gaps = 7/138 (5%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL 248
G++ ++ YM+DI+WLL H L+I + LE + +P N K
Sbjct: 83 GELECSLQLTYMIDINWLLEQYSDAGYEQHPLLILYGDESELETISDKQP-NVTAIKIKT 141
Query: 249 PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD-FPL----KDQNNL 302
FG HH+K L Y G +R++V TANL DW N++QGLW+ P D
Sbjct: 142 KTGFGLHHTKMGLYGYCDGSMRVVVSTANLYENDWYNRTQGLWISPRLPAVPEGSDPTYG 201
Query: 303 SEECGFENDLIDYLSTLK 320
F + L++YL K
Sbjct: 202 ESRTDFRSSLLEYLGAYK 219
>gi|115458196|ref|NP_001052698.1| Os04g0403400 [Oryza sativa Japonica Group]
gi|113564269|dbj|BAF14612.1| Os04g0403400 [Oryza sativa Japonica Group]
Length = 1011
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|222628800|gb|EEE60932.1| hypothetical protein OsJ_14671 [Oryza sativa Japonica Group]
Length = 1021
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|400597097|gb|EJP64841.1| ubiquitin interaction domain-containing protein [Beauveria bassiana
ARSEF 2860]
Length = 540
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 119/270 (44%), Gaps = 32/270 (11%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
L+R KR N + G M++ Q +E ++S+ L T R
Sbjct: 70 LNRLGKRRRN--SIEGSTQEPDMKRLTSQRSERAESSQPRY---------LQGTVRRTWT 118
Query: 170 QGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
+G P ++ +++ +++Q D+ +A+LS++ D +WLL +K +L+ S+
Sbjct: 119 RGYPKTSDD--ITVEEILQKDDLQLALLSSFQWDEEWLLSKLNA-SKTRILLLAFAASEE 175
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
+ M+ N P N PP+ G+ HSK L +P+ +R+++ + NL+ DW
Sbjct: 176 QKQLMRGNVPKNIRFCFPPMN-GPGSMHSKLQFLKFPKYLRLVIPSGNLVPYDWGETGVM 234
Query: 287 -QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFN 345
+++ D P + + F ++ +L A G + ++
Sbjct: 235 ENMVFLIDLPRLEASGNRTMTVFGENVARFLK------------ASGVDEAMVESIANYD 282
Query: 346 FSSAA-VRLIASVPGYHTGSSLKKWGHMKL 374
FS+ A + + S+PG H G +L++ G+ L
Sbjct: 283 FSATANLGFVYSIPGGHMGEALRQVGYCGL 312
>gi|38346146|emb|CAD40679.2| OSJNBb0118P14.6 [Oryza sativa Japonica Group]
Length = 989
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|346971357|gb|EGY14809.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium dahliae VdLs.17]
Length = 609
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 111/491 (22%), Positives = 184/491 (37%), Gaps = 114/491 (23%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V++ D + +A++S++ D W L A+ V + + ++ E ++ N P+
Sbjct: 166 IKIEEVLEKDKLELAVVSSFQWDEPWFLSKVDT-ARTRMVFIAYAKNGAEQETLRANVPS 224
Query: 240 NWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFP 295
+ I L PP+ G HSK LL YP +RI+V + NL+ DW +++ D P
Sbjct: 225 SRIKLCFPPMH-GIGCMHSKLQLLKYPNHLRIVVPSGNLVPYDWGETGVLENIVFLIDLP 283
Query: 296 LKDQNNLSEEC--GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
Q + G + + + + L+ F L A G + F+F+ + R
Sbjct: 284 RIVQAPEDRDAIRGHDAAGVSFGTELR--RF---LRAQGLDESLVKSLDNFDFTETERYR 338
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSS 412
I ++ G HT + G+ L + K + Y SSLGS+D ++ + +
Sbjct: 339 FIHTIAGGHTDQLSGETGYHGLSRAVHSMGLSTD-KPISVDYVTSSLGSIDNSFIKTIYT 397
Query: 413 SMSSGFSEDKTPLGIGEP------------------------LIVWPTVEDVRCSLEGYA 448
+ D G+ +P I +PT + V S G A
Sbjct: 398 ACQG--LNDGQKDGVDQPSRRNTKTALAATATDSDKALGAKMRIYFPTEDTVAKSRGGKA 455
Query: 449 AGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ-------- 495
AG I +K +D L+ A T R M F + NG
Sbjct: 456 AGGTICFQEKWWGSATFPRDMLR------DAISTRRGVLMHDKIIFVQPNGTGGQDDPGA 509
Query: 496 --------KLAKAAWGALQK----NNSQLMIRSYELGVLILP--SAKRHGCGFSCTSNIV 541
L+++AWG L K ++L R++E GVL+ + R G S
Sbjct: 510 GWAYVGSANLSESAWGRLTKERGSGRAKLTCRNWECGVLVPTGNTGDRSSGGLS------ 563
Query: 542 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRY------SS 593
G+ +AG E +PVP P + Y ++
Sbjct: 564 -------------------------GAGEAGKMLEAFRGAVPVPMVAPSRAYGASSNDTA 598
Query: 594 EDVPWSWDKRY 604
D PW + KRY
Sbjct: 599 ADRPWLFMKRY 609
>gi|345560675|gb|EGX43800.1| hypothetical protein AOL_s00215g536 [Arthrobotrys oligospora ATCC
24927]
Length = 634
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 113/267 (42%), Gaps = 38/267 (14%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
+QG+ ++ ++I +V+Q D + A+LS Y D W+L + VLV+H + D
Sbjct: 191 IQGVARTSDD--ITIEEVLQKDTLQTAVLSAYQWDFLWILEKIKT-GECDLVLVLHAKED 247
Query: 228 GTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
++H +RN L P + + HSK LL + +R++V TANL DW
Sbjct: 248 EVVDHYRRNLCNIPRTRLCFPDMSGNVNIMHSKLQLLFHLTHLRVVVPTANLTSYDWGEA 307
Query: 286 SQGLWMQDFPLKDQNNLSEECGFEND--LIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ S E EN +ID+ K + P+H F N F K
Sbjct: 308 T-------------GTGSNEGVMENSVFIIDFPELPKTSTEGSTNPSHTPFSRNLLHFCK 354
Query: 344 ---------------FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF 387
++F+ S + + S+ G H G + G L +++ K
Sbjct: 355 AKGMPSDIIKKVDQVYDFTRSQRLGFVYSIGGSHHGDEALRNGVCGLACAVRDLGL-KTR 413
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSM 414
K+ Y SSLGSL+++++ + ++
Sbjct: 414 KRVEADYITSSLGSLNKEFLLRIYRAL 440
>gi|224119906|ref|XP_002318192.1| predicted protein [Populus trichocarpa]
gi|222858865|gb|EEE96412.1| predicted protein [Populus trichocarpa]
Length = 1131
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 82/208 (39%), Gaps = 45/208 (21%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPL 248
++ + DI W L C + +P + H S + + N ++ PP
Sbjct: 460 FIATFTSDILWFLSHCEIPCHLPVTIACHNTERCWSSSPDNRTSVPYSDFPNLVVVFPPF 519
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLI------HVDWNNKSQGLWM 291
P I+FG HH K ++L +R+I+ +ANL+ H WNN + +W
Sbjct: 520 PESIAFGQDRKRRGIACHHPKLLVLQREDSIRVIITSANLVSNQVVAHSKWNNVTNTVWW 579
Query: 292 QDFPLKD--------------QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
QDFP + N F L +++ L N+P+ +
Sbjct: 580 QDFPARSAPDPSPLFIRVSDGDANKDSRSDFAAQLAGFMACL-----VINVPSQAYWI-- 632
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTGSS 365
S K++F A L+ASVPG H+ S
Sbjct: 633 -SELTKYDFEGANGHLVASVPGIHSRRS 659
>gi|218194792|gb|EEC77219.1| hypothetical protein OsI_15757 [Oryza sativa Indica Group]
Length = 974
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 322 FIATFSSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 381
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 382 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 441
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 442 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 493
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 494 AGYLIASVPGIYA 506
>gi|342884381|gb|EGU84597.1| hypothetical protein FOXB_04892 [Fusarium oxysporum Fo5176]
Length = 632
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 32/203 (15%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ I +V Q D + +A+LS+Y D +WL+ P K+ +L+ +S+ M+ N P
Sbjct: 146 IKIEEVFQKDKLELALLSSYQWDDEWLMSKIDPRKTKL--LLLAFADSEAQKSEMRSNAP 203
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFP 295
P + G HSK LL YP +R++V TANL+ DW +++ D P
Sbjct: 204 PGIKFVFPAM-NGPGAMHSKLQLLKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLP 262
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ F +L +LS E H F +
Sbjct: 263 RLKDPATYRQTAFSTELGRFLSATGVGEG-----MHLGF-------------------VY 298
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL 378
++PG H G SLK+ G+ L T +
Sbjct: 299 TIPGGHQGDSLKRIGYSGLGTTV 321
>gi|116192211|ref|XP_001221918.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
gi|88181736|gb|EAQ89204.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
Length = 670
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 85/387 (21%), Positives = 161/387 (41%), Gaps = 66/387 (17%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++ D W+L + + +L+ S+ M+ N P
Sbjct: 226 IKIEEVLQKNDLKLAVVSSFQWDEHWMLSKIDI-TRTKLMLIAFAASEAQKAEMRANVPK 284
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP- 295
N + P G HSK MLL Y R +RI+V T N + DW +++ D P
Sbjct: 285 NRVRFCFPPMHGIGAMHSKLMLLKYERYMRIVVPTGNFMSYDWGETGTMENMVFIIDLPK 344
Query: 296 --LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VR 352
+Q + F ++L +L A G + S + ++F+ A+ +
Sbjct: 345 FETAEQREAQKPDPFSSELFYFLR------------AQGLDEKLVSSLRNYDFTEASRYK 392
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL 410
+ ++PG HT W + ++++ + P+ F +SLG+++ +++ +
Sbjct: 393 FVHTIPGSHTDED--AWRRTAVSSLIRAT-------RDPIDIDFVCASLGAINYDFLSAM 443
Query: 411 -------------SSSMSSGFSE---DKTPLGIGEPL-IVWPTVEDVRCSLEG-YAAGNA 452
+ + S G E D+ + E + + +P+ E V S G AG
Sbjct: 444 YYACLGDPLVEYQARTGSKGQREAFNDRAQFLVKEHMRVFFPSRETVLQSKGGKEGAGTI 503
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTG---RSRAM------PHIK---TFARYNGQKLAKA 500
P F ++ K+ G S+ + P I+ A L+++
Sbjct: 504 CFKPIWWQAPTFPQQILRDCKSVRPGVLMHSKVIYIRPNDPGIRWNQCLAYVGSANLSES 563
Query: 501 AWGALQKNN----SQLMIRSYELGVLI 523
AWG L ++ ++L R++E GVLI
Sbjct: 564 AWGKLVRDRVTKKAKLTCRNWECGVLI 590
>gi|402072975|gb|EJT68632.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 629
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 190/469 (40%), Gaps = 91/469 (19%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKP 238
++I V+Q D++ +A+LS++ D DWL P+ KI V E +E +
Sbjct: 204 ITIDQVLQKDMLQMAVLSSFQWDTDWLWRKVNPMKTKITLVAYAGNE----VEKAAVVES 259
Query: 239 ANWI--LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
A I L PP+ FG HSK LL +P +RI+V + NL+ DW G +
Sbjct: 260 ARGIARLCFPPMN-GFGYMHSKLQLLKFPGFLRIVVPSGNLVSYDWGE--TGTMENVVFI 316
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIA 355
D + + G E + + + L A G + +K++F+ ++ +
Sbjct: 317 IDLPPVGDLAGSEGNTLTSFGE----DLCYFLKAQGLEESLIKSLRKYDFTETSRYGFVH 372
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL--S 411
S+PG H G S + G+ L + + P+ SS+GSL K+ + L +
Sbjct: 373 SIPGSHMGDSWNQTGYCGLGRAVNKLGLA---TDQPIEVDLVASSIGSLTSKFCSALYKA 429
Query: 412 SSMSSGFSED-----KTPLGIGEPL------------IVWPTVEDVRCSLEGY-AAGNA- 452
SG E K G+G + +P+++ V S G +AG
Sbjct: 430 CQGDSGIKEHESKGAKAKNGMGGAASTTQAALAQRFRVYFPSLQSVVASRGGRNSAGTTC 489
Query: 453 -------IPSPQKNVDKDFL--KKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWG 503
+PS + + +D++ ++ K RA ++A L+++AWG
Sbjct: 490 LQSRWWNLPSFPRELFRDYMNPRRVLVHSKIIFV---RAPSGGASWAYVGSANLSESAWG 546
Query: 504 ALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK---SGSTETSQIQ 556
L K+ + ++ R++E GV I+P+ H E+K G E + I
Sbjct: 547 KLVKDRTSSSPKMTCRNWESGV-IVPAGSGH-------------ELKHQGHGRAEGAGIC 592
Query: 557 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED---VPWSWDK 602
+ V + G +P+P LP Y+S D +PW D+
Sbjct: 593 GS--VGAVFEGC-----------VPLPMTLPGTEYASGDGTRLPWFIDQ 628
>gi|358380063|gb|EHK17742.1| hypothetical protein TRIVIDRAFT_82987 [Trichoderma virens Gv29-8]
Length = 528
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 96/203 (47%), Gaps = 24/203 (11%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS++ D +W++ + + +L+ + + M+ N P+
Sbjct: 96 ITIEEVFQKDQLELAVLSSFQWDEEWMMSKLDI-RRTKILLLAFAKDEAQKNLMRGNVPS 154
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
N PP+ G HSK LL YP +R+++ T NL+ DW +++ D P
Sbjct: 155 NIKFCFPPM-HGPGAMHSKLQLLKYPDRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPR 213
Query: 297 ---KDQNNLSEECGFENDLIDYL-STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
+ GF +L+ +L ST + A+L ++FS ++ +
Sbjct: 214 LGNPATHPPQRPTGFYTELVYFLQSTGVGDKMVASL-------------SNYDFSKTSDI 260
Query: 352 RLIASVPGYHTGSSLKKWGHMKL 374
+ ++PG H+G++ K+ G+ L
Sbjct: 261 AFVHTIPGSHSGNAAKRTGYCGL 283
>gi|430811371|emb|CCJ31122.1| unnamed protein product [Pneumocystis jirovecii]
Length = 402
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 75/336 (22%), Positives = 128/336 (38%), Gaps = 54/336 (16%)
Query: 187 IQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LH 244
I+ DI+ A+LS +++D W+L L+K V + H +SD K + N + L
Sbjct: 101 IENDILKAAVLSAFVIDPIWVLSKIQ-LSKTIVVFIHHAKSD------KEKQAINELYLC 153
Query: 245 KPPLPISF------GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP 295
P + F H K LL Y +R+++ +ANL+ DW +++ DFP
Sbjct: 154 FPNVSAIFPSMEGANCMHCKLQLLFYTTYLRVVIPSANLVDYDWGETGVMENSMYIHDFP 213
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
++ FE DL Y +P+ +FK+ S + +
Sbjct: 214 RRESAFTEFSTNFERDLFHYCKAKNYPDHILKKMQCYDFKM-----------SKNIHFVH 262
Query: 356 SVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
S+P S LK G++ L +Q+ + SSLG L +M + ++
Sbjct: 263 SIPARALNSVDLKDTGYLSLARAVQKLGKASKNDIEINIIVTSSLGLLKSAFMTNIYRAL 322
Query: 415 SSGFSEDKTPLGIGEPLIVW--------PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK 466
D++ L W P++ V S G + I F K
Sbjct: 323 KG----DQSIASYNMDLQSWKTSIKVHFPSINTVLSSNGGKESAGTIC---------FQK 369
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAW 502
++W + +S M H R + L+++AW
Sbjct: 370 QFWENLEFP---KSCLMHHKIILVRNSSANLSESAW 402
>gi|293335739|ref|NP_001168462.1| hypothetical protein [Zea mays]
gi|223948435|gb|ACN28301.1| unknown [Zea mays]
gi|414587433|tpg|DAA38004.1| TPA: hypothetical protein ZEAMMB73_810727 [Zea mays]
Length = 989
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 111/278 (39%), Gaps = 40/278 (14%)
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
V NDG SK R + G +EE + D STF L R+ +
Sbjct: 228 VVNDGDPELFNGSKGCRDDSSEKPGCGSGNEEQYHSEGCYSDG--STFFLNRLADTGSNT 285
Query: 177 NT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES----- 226
T S V++ ++ ++ ++ + +DI W L C + +P + H +
Sbjct: 286 QTEPQSGVTLPQLLHPVNSLVRVFIATFTLDISWFLNYCKIPQHLPVTIACHNKERCWSA 345
Query: 227 ---DGTLEHMKRNKPANWILHKPPLPISFG---------THHSKAMLLIYPRGVRIIVHT 274
+ T + + + + P I+FG HH K ++L +R+IV +
Sbjct: 346 SSENRTAAPFESHPKLLLVFPRFPEDIAFGKDRKKQGVACHHPKLIVLQREDSMRVIVTS 405
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWPEFSA 326
ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 406 ANLVPRQWHLITNTVWWQDFPCRTSPDYAALFSAAKKQKSDFAAQLVSFIASM------V 459
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
N + I K++F A LIASVPG H S
Sbjct: 460 NEVRSQAYWITE--VAKYDFEGAGGYLIASVPGIHAQS 495
>gi|326521102|dbj|BAJ96754.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 646
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 98/236 (41%), Gaps = 44/236 (18%)
Query: 162 STFRLLRVQGLPAWANT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKI 216
STF L R+ G+ S V++ ++ G ++ ++ + DI W L C + +
Sbjct: 267 STFFLNRLTGIRPEMRAEQHSGVTLPQLLHPVGSLLRVFIATFTSDISWFLDYCKIPQYL 326
Query: 217 PHVLVIHGE-------SDGTLEHMKRNKPANWILHKPPLP--ISFG---------THHSK 258
P + H + S+ N P N +L P P I+FG HH K
Sbjct: 327 PVTIACHNKDRCWSANSESRTAAPFENHP-NILLVYPRFPEVIAFGKDRKNQGVACHHPK 385
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
++L +R+I+ +ANL+ W+ + +W QDFP C D S
Sbjct: 386 LIVLQREDSMRVIISSANLVPRQWHLITNTVWWQDFP----------CRTSPDYSALFSA 435
Query: 319 LKWP--EFSANLPAHGNFKIN--PS------FFKKFNFSSAAVRLIASVPGYHTGS 364
+ P +F+A L + IN PS +++F A L+ASVPG + S
Sbjct: 436 FEGPKSDFAAQLVSFIGSLINEVPSQAYWINEIARYDFEGAGGYLVASVPGLYMPS 491
>gi|154272585|ref|XP_001537145.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150409132|gb|EDN04588.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 478
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 89/196 (45%), Gaps = 32/196 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 286 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KTTRFLLIMGEKEEDKKRELENDTK 343
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI+V +ANL+ DW + + ++
Sbjct: 344 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPDYLRIVVPSANLVPFDWGEQGGVMENIVFLI 403
Query: 293 DFPLK--DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D P K D +N + F ++L+ +L +N KK F+FS
Sbjct: 404 DLPRKSPDLDN-DPQTSFLDELVYFLQA---------------STVNEQIIKKMLRFDFS 447
Query: 348 SAA-VRLIASVPGYHT 362
+ + I ++ G HT
Sbjct: 448 ATKDIAFIHTIGGSHT 463
>gi|414587432|tpg|DAA38003.1| TPA: hypothetical protein ZEAMMB73_810727, partial [Zea mays]
Length = 816
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 111/278 (39%), Gaps = 40/278 (14%)
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
V NDG SK R + G +EE + D STF L R+ +
Sbjct: 228 VVNDGDPELFNGSKGCRDDSSEKPGCGSGNEEQYHSEGCYSDG--STFFLNRLADTGSNT 285
Query: 177 NT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES----- 226
T S V++ ++ ++ ++ + +DI W L C + +P + H +
Sbjct: 286 QTEPQSGVTLPQLLHPVNSLVRVFIATFTLDISWFLNYCKIPQHLPVTIACHNKERCWSA 345
Query: 227 ---DGTLEHMKRNKPANWILHKPPLPISFG---------THHSKAMLLIYPRGVRIIVHT 274
+ T + + + + P I+FG HH K ++L +R+IV +
Sbjct: 346 SSENRTAAPFESHPKLLLVFPRFPEDIAFGKDRKKQGVACHHPKLIVLQREDSMRVIVTS 405
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWPEFSA 326
ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 406 ANLVPRQWHLITNTVWWQDFPCRTSPDYAALFSAAKKQKSDFAAQLVSFIASM------V 459
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
N + I K++F A LIASVPG H S
Sbjct: 460 NEVRSQAYWITE--VAKYDFEGAGGYLIASVPGIHAQS 495
>gi|357167454|ref|XP_003581171.1| PREDICTED: uncharacterized protein LOC100837648 [Brachypodium
distachyon]
Length = 987
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 86/202 (42%), Gaps = 35/202 (17%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANW 241
G ++ ++ + DI W L C + +P + H + + + N P N
Sbjct: 302 GSLLRVFITTFTSDICWFLDYCNIPQHLPVTIACHNKERCWSASRESRMAAPFVNHP-NV 360
Query: 242 ILHKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
+L P P I+FG HH K ++L +R+I+ +ANL+ W+ + +W
Sbjct: 361 LLVYPQFPEVIAFGKDRKKQGVACHHPKLIVLQREDSMRVIITSANLVPRQWHLITNTVW 420
Query: 291 MQDFPLKDQNNLSE--------ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK 342
QDFP + + S + F L+ ++ +L +P+ + IN
Sbjct: 421 WQDFPCRTSPDYSAIFSAVEEPKSDFAVQLVSFIGSLI-----NEVPSQA-YWINE--IA 472
Query: 343 KFNFSSAAVRLIASVPGYHTGS 364
K+NF A L+ASVPG + S
Sbjct: 473 KYNFEGAGGYLVASVPGLYMPS 494
>gi|302653979|ref|XP_003018803.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
gi|291182481|gb|EFE38158.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 70/146 (47%), Gaps = 14/146 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V+Q D+ +A+LS+++ D+DWLL P+ L ++ GE T +
Sbjct: 208 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKFTNPSTRFL----FIMGAKGEERRTQLLRE 263
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLW 290
+ L PP+ HSK MLL + +RI++ +ANL DW K L+
Sbjct: 264 TASMSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLF 323
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYL 316
+ D P K + + F ++L+ +L
Sbjct: 324 LIDLPRKANETIDDTTPFRDELVYFL 349
>gi|159464062|ref|XP_001690261.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284249|gb|EDP09999.1| predicted protein [Chlamydomonas reinhardtii]
Length = 424
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/31 (70%), Positives = 28/31 (90%)
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
G+R+++HTAN I+ D NNKSQGLW+QDFPLK
Sbjct: 174 GLRLVIHTANAIYADCNNKSQGLWVQDFPLK 204
>gi|320040691|gb|EFW22624.1| hypothetical protein CPSG_00523 [Coccidioides posadasii str.
Silveira]
Length = 651
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/405 (23%), Positives = 165/405 (40%), Gaps = 84/405 (20%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ +V+Q D+ +A+LS++ ++DWL V K L++ G E KR
Sbjct: 212 IKFEEVVQKDDLELAVLSSFQWNMDWLFTKFNV--KKTRFLLVMGHK---YEEEKRQTQK 266
Query: 240 NWI------LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGL 289
++ L P+ HSK MLL +P +R++V +ANL+ DW + L
Sbjct: 267 DFADIPSIRLCFVPMGPQVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLL 326
Query: 290 WMQDFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS- 347
++ D P K + + F ++L+ +L E KI +F+F
Sbjct: 327 FLIDLPRKILGSQEKTSTPFFDELVYFLKASALHE-----------KIIAK-LSEFDFGK 374
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSLGSLDEK 405
+A + ++ G HTGS WG + + + T PL Y SSLGSL+++
Sbjct: 375 TAGFAFVHTIGGSHTGSD---WGKTGVCGLGKAVTMLGLQTPQPLKLDYVTSSLGSLNDQ 431
Query: 406 WM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCS 443
+M EL+ S F DK + + + LI +P+++ V+ S
Sbjct: 432 FMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGS 491
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FAR----------- 491
+ I K ++ ++ + S + R + H KT F R
Sbjct: 492 RARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDAN 549
Query: 492 ---YNG------QKLAKAAWGALQKNNS----QLMIRSYELGVLI 523
Y G L+++AWG L + S +L R++E GV+I
Sbjct: 550 TTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 594
>gi|302406010|ref|XP_003000841.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
gi|261360099|gb|EEY22527.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
Length = 586
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 105/481 (21%), Positives = 180/481 (37%), Gaps = 95/481 (19%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G P + I +V++ D + +A++S++ D WLL A+ V + + ++
Sbjct: 156 VHGFPR--TNDDIKIEEVLEKDKLELAVVSSFQWDEPWLLSKVDT-ARTRMVFIAYAKNG 212
Query: 228 GTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS 286
E ++ + P++ I L PP+ G HSK LL Y +RI+V + NL+ DW
Sbjct: 213 AEQETLRASVPSSRIKLCFPPM-YGIGCMHSKLQLLKYQNHLRIVVPSGNLVPYDWGETG 271
Query: 287 ---QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+++ D P Q + + ND + F L A G +
Sbjct: 272 VLENMVFLIDLPRIVQASGDGDAIRGNDAAGVSFGTELRRF---LRAQGLDESLVKSLDN 328
Query: 344 FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL 402
F+F+ + R I ++ G HT + G+ L + P+ + +
Sbjct: 329 FDFTETERFRFIHTIAGGHTDQLSGETGYHGLSRAVHSLGLS---TDEPITVDYVAQQDQ 385
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
++ + + + + +G + I +PT + V S G AAG I
Sbjct: 386 NDGGNQPSRRNTKTALNATDSQKALGVKMRIYFPTEDTVARSRGGKAAGGTIC------- 438
Query: 462 KDFLKKYWAK-------WKASHTGRSRAMPHIK-TFARYN---GQ-------------KL 497
F +K+W + S + R + H K F + N GQ L
Sbjct: 439 --FQEKWWGSATFPREMLRDSISTRPGVLMHDKIIFVQPNSTGGQDDPGAGWAYVGSANL 496
Query: 498 AKAAWGALQK----NNSQLMIRSYELGVLI--LPSAKRHGCGFSCTSNIVPSEIKSGSTE 551
+++AWG L K ++L R++E GVL+ + R G S
Sbjct: 497 SESAWGRLTKERGSGRAKLTCRNWECGVLVPTRTTGDRSSGGLS---------------- 540
Query: 552 TSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRY------SSEDVPWSWDKR 603
G+ +AG E +PVP P + Y ++ D PW + KR
Sbjct: 541 ---------------GAGEAGKMLEAFRGAVPVPMVAPSRAYGTSSNDTAADRPWLFMKR 585
Query: 604 Y 604
Y
Sbjct: 586 Y 586
>gi|226294747|gb|EEH50167.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides brasiliensis Pb18]
Length = 589
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 6/113 (5%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR 235
N + I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE + +
Sbjct: 221 NGDDIKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELE 278
Query: 236 NKPANW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
N + L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 279 NDTKSMGSVRLCFPPMEPQVNCMHSKLMLLFHLNHLRIVIPSANLIPFDWGEK 331
>gi|388851550|emb|CCF54740.1| uncharacterized protein [Ustilago hordei]
Length = 665
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 78/166 (46%), Gaps = 21/166 (12%)
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC----GFE 309
T H K ++L++ +R+ + + NL VDW+ G+++QDFPLK S G E
Sbjct: 285 TQHMKFLVLVHEGWLRVAIASGNLNEVDWSRIENGVFIQDFPLKGGEGSSARAEGRGGVE 344
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS--SAAVRLIASVPGYHTGSSLK 367
ND + L TL S P+H + + +F+FS A R++AS P SSL+
Sbjct: 345 NDFKEQL-TLVLKSLSVP-PSHPVW----TALDRFDFSLGGARARIVASWP---EASSLQ 395
Query: 368 KW------GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
W G +L V+++ + Q SSL + D KW+
Sbjct: 396 GWDRIETQGLGRLGKVVRDLDIPAVKGGMEVECQGSSLANHDLKWI 441
>gi|389739055|gb|EIM80250.1| phospholipase D/nuclease [Stereum hirsutum FP-91666 SS1]
Length = 698
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 126/314 (40%), Gaps = 45/314 (14%)
Query: 171 GLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
G P + TS + + + AI+S+Y + + W+ P+ PV+ ++ E+
Sbjct: 217 GKPVFGLTSIIGDKS----QVAFAIISSYALQLSWIYEFFDPSTPVV-----MVAQPTEA 267
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
+ + +K P NWI P L +G H M + Y G +RI + TANL+ DW +
Sbjct: 268 EKGQKTIKEILP-NWIRVTPFLRSGYGVMH---MKIFYKSGRLRIAISTANLVDFDWKDI 323
Query: 286 SQGLWMQDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP----- 338
+W+QD P + + + + F L L +L H + P
Sbjct: 324 ENTVWIQDVPQRSKPIPHDPKADDFPTAFERVLKALNVEPALTSL-VHNDHPTIPLSSLH 382
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSP---- 391
S ++FS L+ S+ G H + + G L ++E E G
Sbjct: 383 PGSLRTAYDFSRVKAHLVPSLAGKHEHWPQVLRVGETALMKAVREIGCEVGSGSGGGKRG 442
Query: 392 ---LVYQFSSLGSLDEKWMAELSSSMSSGFSE---DKTPLGIGE------PLIVWPTVED 439
+ YQ SS+G+ +W+ E S E DKT + I++PT E
Sbjct: 443 KLRVEYQGSSIGTYSTQWINEFYICASGTSPEKYLDKTKASKSKLPYPDSMTILFPTREW 502
Query: 440 VRCSLEGYAAGNAI 453
V+ S+ G A G +
Sbjct: 503 VKGSVLGEAGGGTM 516
>gi|171686654|ref|XP_001908268.1| hypothetical protein [Podospora anserina S mat+]
gi|170943288|emb|CAP68941.1| unnamed protein product [Podospora anserina S mat+]
Length = 438
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
V I +V+Q DI+ +A++S++ D DW+L + ++ L+ + +S+ E M+ N P
Sbjct: 254 VKIEEVLQKDILELAVISSFQWDEDWMLSKIDI-SRTKLYLIAYAKSEAQNE-MRNNVPK 311
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
+ I P + G HSK MLL Y +R++V T N + DW
Sbjct: 312 SRIRFCFPAMQAVGAMHSKLMLLKYEGYLRVVVPTGNFMSYDW 354
>gi|322701752|gb|EFY93501.1| tyrosyl-DNA phosphodiesterase, putative [Metarhizium acridum CQMa
102]
Length = 267
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 66/158 (41%), Gaps = 30/158 (18%)
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQ-----------KLAKAAWGALQKNNSQLMIRSY 517
W + S+T + T+ RYN + ++K AWG ++ + +L + S+
Sbjct: 126 WVIYDPSYTTGPTTVQTALTYIRYNEKGSIDWAMLTSANISKQAWGEAERPSGELRVASW 185
Query: 518 ELGVLILPSAKRHGCGFSCT-SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 576
E+GVL+ P T + VP E K S GA
Sbjct: 186 EIGVLVWPGLVGQDVSMVGTFQSDVPKEPKE------------------QADSKTGAGGV 227
Query: 577 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 614
++ + +PY LP QRY + +VPW ++ + D +G+ W
Sbjct: 228 LIGVRIPYSLPLQRYGAGEVPWVATMKHGEPDRFGRQW 265
>gi|367050628|ref|XP_003655693.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
gi|347002957|gb|AEO69357.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
Length = 657
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 2/105 (1%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q + +A+LS+Y D+ WLL LA+ +L+ + E M+ P
Sbjct: 240 IKIEEVLQKQQLELAVLSSYQWDVRWLLSKVD-LARTKLILIAFAADEAHKEEMRNAVPR 298
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
I P G+ HSK LL Y + +RI+V T NL+ DW
Sbjct: 299 ERIRFCFPPMQPVGSMHSKLQLLKYEKYMRIVVPTGNLMSFDWGE 343
>gi|327295831|ref|XP_003232610.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
gi|326464921|gb|EGD90374.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
Length = 677
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 70/146 (47%), Gaps = 14/146 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V+Q D+ +A+LS+++ D+DWLL P+ L ++ GE +
Sbjct: 217 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKFTNPSTRFL----FIMGAKGEERRAQLLRE 272
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLW 290
+ L PP+ HSK MLL + +RI++ +ANL DW K L+
Sbjct: 273 TASMSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLF 332
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYL 316
+ D P K +++ F ++L+ +L
Sbjct: 333 LIDLPRKANETVNDTTPFRDELVYFL 358
>gi|121703656|ref|XP_001270092.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
gi|119398236|gb|EAW08666.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
Length = 683
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 105/236 (44%), Gaps = 23/236 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +A+LS+++ D++W +LV+ + D T + +
Sbjct: 238 ITIEEVFQKDDLELAVLSSFIWDMEWFFSKLDT-KHSRFLLVMQAKDDATKRQYEAETAS 296
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL +P +RI+V TANL DW ++ D
Sbjct: 297 MRNLRLCFPPMDGQINCMHSKLMLLFHPEYLRIVVPTANLTPYDWGEMGGVMENSAFLID 356
Query: 294 FP--LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P ++ + F DL+ +LS + E N+ A K+ F++ + +
Sbjct: 357 LPRKSSTLSSSDSKTAFLEDLVFFLSASRLHE---NVIA----KLGDYDFRE----TKHI 405
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
L+ ++ G H + K G L ++ FK + Y SS+GSL ++++
Sbjct: 406 MLVHTIGGSHI-ENFSKTGFCGLGRAVKALGLST-FKSISIDYVTSSVGSLTDEFL 459
>gi|296415071|ref|XP_002837215.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633076|emb|CAZ81406.1| unnamed protein product [Tuber melanosporum]
Length = 603
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 105/243 (43%), Gaps = 28/243 (11%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG--ESDGTLEHMKRNK 237
++ +V+Q + + VA+LS + DIDW+L P+ V+V+H E D + + +
Sbjct: 236 ITFEEVLQKESLCVAVLSAFQWDIDWVLKKLPLDTIQRLVMVMHAKEEQDRSYKVQQLGS 295
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG----VRIIVHTANLIHVDWNN----KSQGL 289
L PP+ HSK MLL + G +R+ V +ANL DW +
Sbjct: 296 LPRTTLVLPPMQGQVSCMHSKLMLLFHMNGDQRWLRVAVPSANLTDYDWGELGGVMENTV 355
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
++ D P + N + F +L + + PE N G ++ + S K F
Sbjct: 356 FIIDLPRLPKPN-HNQTHFAKELHHFCAAKGMPEDVLN----GLYRYDFSRTKDMAF--- 407
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWM 407
+ S+ G + G ++ G+ L T ++ G L + F SSLG+ + ++
Sbjct: 408 ----VHSIGGSNAGKDWRRTGYSGLGTAVKALGLSSG---PGLEFDFVTSSLGAANMGFI 460
Query: 408 AEL 410
+ +
Sbjct: 461 SNM 463
>gi|307109628|gb|EFN57866.1| hypothetical protein CHLNCDRAFT_143336 [Chlorella variabilis]
Length = 213
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 63/139 (45%), Gaps = 31/139 (22%)
Query: 483 MPHIKTFARYN--------------GQKLAKAAWGALQKNNSQLMIRSYELGVLILP--- 525
MPH K + R++ L+KAAWG L+ + SQL I SYELGVL+LP
Sbjct: 1 MPHSKAYLRWSHGDHGPEIAWCYVGSHNLSKAAWGCLELDASQLHICSYELGVLLLPRLE 60
Query: 526 SAKR--HGCGFSCTSN------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 577
+A R CGFSCT ++ + + + L W D+ A+ V
Sbjct: 61 AAYRTSRWCGFSCTGGQPGAAAPRLAQAAAAAGAAGTAAVPSVRFLQWR-QGDSQAAEMV 119
Query: 578 -----VYLPVPYELPPQRY 591
V LPVP+ LPP Y
Sbjct: 120 QGQLGVPLPVPFHLPPVPY 138
>gi|389632429|ref|XP_003713867.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
gi|351646200|gb|EHA54060.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
Length = 636
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 113/248 (45%), Gaps = 20/248 (8%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGES 226
+QG P ++ ++I +V+Q D + +A+LS++ D +WL P K + E+
Sbjct: 168 LQGQPR--SSQDITIEEVLQKDQLELAVLSSFAWDPEWLWTKVDPTKTKTTLIAFAGNEA 225
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS 286
D + + + L PP+ + G HSK LL +P +RI+V + NL+ DW ++
Sbjct: 226 D--QKEVTASAQGVARLCFPPMNGN-GCMHSKLQLLKFPGYLRIVVPSGNLVPYDWGEQN 282
Query: 287 QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFN 345
G+ + D L E++ + E S L A G N +I S +K++
Sbjct: 283 -GIMENSVFIIDLPPLKAGVKLEDNTLTSFGE----ELSYFLTAQGLNERIINS-LRKYD 336
Query: 346 FS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSL 402
FS ++ + ++ G HTG ++ G+ L +Q P+ F SS+G+L
Sbjct: 337 FSQTSRYAFVHTIAGVHTGDKWRRTGYCGLGRAIQNLGLA---TDEPVEIDFVASSMGAL 393
Query: 403 DEKWMAEL 410
++ L
Sbjct: 394 KYGYLLAL 401
>gi|392861898|gb|EAS37505.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 672
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 95/400 (23%), Positives = 167/400 (41%), Gaps = 74/400 (18%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN--- 236
+ +V+Q D+ +A+LS++ ++DWL V K +LV+ + + + +++
Sbjct: 233 IKFEEVVQKDDLELAVLSSFQWNMDWLFTKFNV-KKTRFLLVMGHKYEEEKQQTQKDFAD 291
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQ 292
P+ + P P HSK MLL +P +R++V +ANL+ DW + L++
Sbjct: 292 IPSIRLCFVPMGP-QVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLLFLI 350
Query: 293 DFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
D P K + + F ++L+ +L E KI +F+F +A
Sbjct: 351 DLPRKILGSQEKTSTPFFDELVYFLKASALHE-----------KIIAK-LSEFDFGKTAG 398
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--- 407
+ ++ G HTGS K G L + E + L Y SSLGSL++++M
Sbjct: 399 FAFVHTIGGSHTGSYWGKTGVCGLGKAVTMLGLETP-QPLKLDYITSSLGSLNDQFMRSM 457
Query: 408 ----------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCSLEGYA 448
EL+ S F DK + + + LI +P+++ V+ S +
Sbjct: 458 YLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGSRARPS 517
Query: 449 AGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FAR--------------YN 493
I K ++ ++ + S + R + H KT F R Y
Sbjct: 518 GAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDANTTAYQ 575
Query: 494 G------QKLAKAAWGALQKNNS----QLMIRSYELGVLI 523
G L+++AWG L + S +L R++E GV+I
Sbjct: 576 GWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 615
>gi|295668965|ref|XP_002795031.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226285724|gb|EEH41290.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 668
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE + + N
Sbjct: 231 IKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELENDTK 288
Query: 240 NW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
+ L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 289 SMGSVRLCFPPMEPQVNCMHSKLMLLFHLNYLRIVIPSANLIPFDWGEK 337
>gi|342320632|gb|EGU12571.1| Endoplasmic reticulum Ca-transporting P-type ATPase [Rhodotorula
glutinis ATCC 204091]
Length = 1978
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 143/386 (37%), Gaps = 86/386 (22%)
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN-LSEECG-FEN 310
G H+K ++ + RI++ TAN + DW+ ++ DFP + + ++EE F+N
Sbjct: 1630 GIMHTKLLIFYHEDFCRIVIPTANAVSYDWSQIDNAFYVHDFPRRRSASPVNEESNPFKN 1689
Query: 311 DLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 369
S + + +P H + S F+ SS V+L+ S G + K
Sbjct: 1690 PTHTQFSKKSFQVCYYLGIPKH---ILQESLHYDFS-SSTDVQLVHSNQGKFPAADYDKG 1745
Query: 370 GHMK-LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL---------SSSMSSGFS 419
G + L + F G + SS+G W+ ++ S+ SG
Sbjct: 1746 GGIAGLAKAVSAFGFASG-GHWEIEVTGSSIGQYSSTWLTQMLAACSGIHPSTYFRSGKG 1804
Query: 420 ED------KTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFL 465
D KTP G L I++PT +++ S G G I P K K
Sbjct: 1805 NDVPSQLPKTPSGQPTRLPIKIIFPTQDEILSSPGGAGHGGTIFCPSKTWNSLTFPKHLF 1864
Query: 466 KKYWAKWK--ASHT---------GRSRAMPHIKTFARYNGQKLAKAAWGALQ--KNNSQL 512
+ +K K +HT ++ P + F +AWG LQ K+ QL
Sbjct: 1865 HRGESKRKNIPAHTKIILGLHRFAKAPTPPVHEGFIYLGSHNFTPSAWGRLQNGKDGPQL 1924
Query: 513 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 572
+YELGV++ +++ S E + + T+LVT
Sbjct: 1925 FCNNYELGVVL--------------------TLRASSAEELEAKATELVT---------- 1954
Query: 573 ASSEVVYLPVPYELPPQRYSSEDVPW 598
Y+ P +Y DVPW
Sbjct: 1955 -----------YKRPLVKYGPNDVPW 1969
>gi|440802395|gb|ELR23324.1| hypothetical protein ACA1_069080 [Acanthamoeba castellanii str.
Neff]
Length = 675
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 8/95 (8%)
Query: 33 VIGR--TNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK----SGDQRKKLSSNE 86
V+GR +P SDKR SRK L +GS SLV G NP +K G + L NE
Sbjct: 2 VLGRGLCGVPSSDKRCSRKQAELMLGRNGSLSLVPRGVNPAYLKRAADKGGEAVMLQRNE 61
Query: 87 HVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDG 121
S+ DGD+ L+ + + + L SQ+R + +
Sbjct: 62 KYSLEDGDVFTLV--ANCYPFTVLRCSQERPTKEA 94
>gi|189207467|ref|XP_001940067.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976160|gb|EDU42786.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 564
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/391 (20%), Positives = 162/391 (41%), Gaps = 59/391 (15%)
Query: 175 WANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT---- 229
+ T+ ++I ++++ + +A++ ++ D W+ +I + +++ + G
Sbjct: 142 YPRTNDITIDELLEAPQVNIAVICSFQYDSSWMYEKLDP-TRIKQIWLMYSKFRGEDIRE 200
Query: 230 --LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
+ ++ N LH PP+ + HSK MLL +RI + TAN+ DW
Sbjct: 201 KLIREWTESRIPNMKLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTQTDWGEVGN 260
Query: 285 ------KSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
+++ D P + + + E F DLI + LK + + +
Sbjct: 261 DWQPGVMENSVFVIDLPRRSDDGVGKVEELPSFGRDLIFF---LKAQQVESRVTGG---- 313
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
KF+F+ + + S+ G H + G L ++E ++ + L Y
Sbjct: 314 -----VLKFDFADTKHLAFVHSIGGSHKEELERPTGLPGLANAVRELQYDD-VEHIELDY 367
Query: 395 QFSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
SSLG++++ +++ + ++ F++D P I +PT E V S+ G N
Sbjct: 368 AASSLGAINDTFLSRIHLAARGKNFTQDNAAVPDVRDHFRIYFPTNETVEKSIGGSGCAN 427
Query: 452 AIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK-TFAR------------YNGQ-K 496
I +K + F K+ + ++ G + H K FAR Y G
Sbjct: 428 IISLSKKYYNASTFPKECLRDYDSTRRG---MLSHNKLLFARGRRTDGRPFAWVYVGSAN 484
Query: 497 LAKAAWGALQKNNS----QLMIRSYELGVLI 523
++++AWG + S L +R++E GV++
Sbjct: 485 ISESAWGGQKVLKSGKVGALNVRNWECGVIV 515
>gi|330927762|ref|XP_003301988.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
gi|311322883|gb|EFQ89910.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
Length = 572
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 90/419 (21%), Positives = 176/419 (42%), Gaps = 57/419 (13%)
Query: 175 WANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT---- 229
+ T+ ++I ++++ + +A++ +Y D W+ K+ + +++ + G
Sbjct: 150 YPRTNDITIDELLEAPHVNIAVICSYQYDSSWMYEKLDP-TKVKQIWLMYAKFRGEDIRE 208
Query: 230 --LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
L+ ++ N LH PP+ + HSK MLL +RI + TAN+ DW
Sbjct: 209 KLLQEWAESRVPNMRLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTPTDWGEVGN 268
Query: 285 ------KSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
+++ D P + + + + F DL+ + LK E + K
Sbjct: 269 DWQPGVMENSVFLIDLPRRSDDGVGKVEDLPPFGRDLVFF---LKAQEVGS--------K 317
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
+ KF+F+ + + S+ G H S + G L ++E ++ + L Y
Sbjct: 318 VTDGVL-KFDFADTKHLAFVHSIGGSHKEESERPTGLPGLANAVRELQYDD-VEHLELDY 375
Query: 395 QFSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
SSLG++++ +++ + ++ F++D P I +PT + V S G N
Sbjct: 376 AASSLGAINDTFLSRIYLAARGKSFTKDNAVVPDVRDHIRIYFPTNDTVEKSTGGPDCAN 435
Query: 452 AIPSPQKNVD-----KDFLKKYWAKWKA--SHT----GRSRAMPHIKTFA-RYNGQ-KLA 498
I +K + K+ L+ Y + + SH R R + K FA Y G ++
Sbjct: 436 IISLSRKYYNASTFPKECLRDYVSTRRGMLSHNKLLFARGRRT-NGKPFAWVYVGSANIS 494
Query: 499 KAAWGALQKNNS----QLMIRSYELGVLI-LPSAKRHGCGFSCTSNIVPSEIKSGSTET 552
++AWG + S L +R++E GV++ +P K + + P + G+ E
Sbjct: 495 ESAWGGQKVLKSGKVGALSVRNWECGVMVPVPDDKLEQVDLKADA-VPPMSVFEGTVEV 552
>gi|342319803|gb|EGU11749.1| Proteophosphoglycan 5 [Rhodotorula glutinis ATCC 204091]
Length = 564
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 128/323 (39%), Gaps = 59/323 (18%)
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQNNLS 303
+P P + G HSK LL YP + +++ + N + +D + ++ P +
Sbjct: 211 RPLYPWASGCAHSKFFLLFYPGFLLLVITSCNTMRIDMDLSDNHWYIHALPEIPPGKKRK 270
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA--VRLIASVPGYH 361
+ FE+DL+ ++ L WPE ++ K++F SA V L+ASVPG
Sbjct: 271 AKTTFEHDLLAHMLDLDWPE-----------ELVSRVRGKYDFRSAEGRVHLVASVPGTK 319
Query: 362 TGSSLK-KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
+ + +G ++L + ++ + + S+ SL +W+ + +
Sbjct: 320 RATDDEGSYGMLRLNALARQIIPPSVRPDIDMEFCAGSVNSLPPEWIDQTDKLLRGRDLS 379
Query: 421 DKTPL---GIGEP----------LIVWPTVEDV-RCSLEGYAAGNAI---------PSPQ 457
P+ G+ EP IV+PT V CS + A + I P
Sbjct: 380 RAVPVTKPGVPEPPVSLNNLPEWSIVFPTKATVAACSPQVIEAASNIGCCLNNAKWPETS 439
Query: 458 KNVDKDFLK------------KYWAKWKASHTGRSRAMPHIKTFARYNGQKLAKAAWGAL 505
V F K++ +WK S A P + +N L+KAA G +
Sbjct: 440 NEVRSMFFDYGSKDPGRLFHMKFY-QWKDSRNKDPSAPPLMVYLGSHN---LSKAALGEV 495
Query: 506 QKNNS-----QLMIRSYELGVLI 523
+ S ++ ++ELGV+I
Sbjct: 496 SRLKSGAGDVRIKCNNFELGVVI 518
>gi|388580252|gb|EIM20568.1| phospholipase D/nuclease [Wallemia sebi CBS 633.66]
Length = 417
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 74/140 (52%), Gaps = 8/140 (5%)
Query: 250 ISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN----NLSE 304
+ GT+H+K L+ G +R++V TAN I +DW ++MQDFPLK Q + +
Sbjct: 5 FAHGTYHAKFALIFTTDGWLRVVVTTANFIPIDWMWNENTVFMQDFPLKGQTLGGESSEQ 64
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHT 362
+ F++D +L LK + + P+ K++FS + RLI+S+ ++
Sbjct: 65 KSAFQSDWTWFLYKLKLNKSLKLVADQMPDTPLPNVDAVNKWDFSRSKARLISSISETYS 124
Query: 363 G-SSLKKWGHMKLRTVLQEC 381
G +++K GH +L ++++
Sbjct: 125 GLENIRKVGHFRLADLVRQA 144
>gi|396484884|ref|XP_003842038.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
gi|312218614|emb|CBX98559.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
Length = 588
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 6/114 (5%)
Query: 174 AWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT--- 229
A+ T+ +SI +++Q I +A++S++M D DWL + K+ + V++ +
Sbjct: 332 AYPRTNDISIDELLQTPSIHMAVISSFMWDADWLHKKLDPI-KVKQIWVMNAKGKDVQKR 390
Query: 230 -LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
L+ MK N LH PP+ + HSK +LL + +R V TAN+ +DW
Sbjct: 391 WLQEMKDTGVPNLTLHFPPMHGMIQSMHSKFLLLFGKKKLRFAVPTANMTCIDW 444
>gi|296810424|ref|XP_002845550.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
gi|238842938|gb|EEQ32600.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
Length = 672
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 66/146 (45%), Gaps = 12/146 (8%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS+++ D+DWLL L I G + + A
Sbjct: 309 IKIEEVFQPSDLELAVLSSFLWDMDWLL--LKFTNPKTRFLFIMGAKGEEKQKQLLEETA 366
Query: 240 NW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQ 292
+ L PP+ HSK MLL +P +RI+ TANL DW K L++
Sbjct: 367 SMPRIRLCFPPMEGEVNCMHSKLMLLFHPGYLRIVTPTANLDPYDWGEKGGEMENMLFLI 426
Query: 293 DFPLKDQ--NNLSEECGFENDLIDYL 316
D P K + + F ++L+ +L
Sbjct: 427 DLPRKSDGGTGIDDATPFRDELVYFL 452
>gi|70984252|ref|XP_747643.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845270|gb|EAL85605.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 277
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 29/183 (15%)
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDF 294
+N L PP+ HSK MLL +P +RI+ TANL DW ++ D
Sbjct: 2 SNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLIDL 61
Query: 295 PLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K ++ + FE +L+ +L STL+ S +F+FS ++
Sbjct: 62 PRKVATTSVGSKTVFEEELVYFLRASTLQENIISR--------------LDEFDFSPTSH 107
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWM 407
+ L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL ++++
Sbjct: 108 IMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEFL 163
Query: 408 AEL 410
+
Sbjct: 164 RSI 166
>gi|320587853|gb|EFX00328.1| mitochondrial translation optimization protein [Grosmannia
clavigera kw1407]
Length = 1223
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 151/379 (39%), Gaps = 55/379 (14%)
Query: 193 VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISF 252
+A+LS++ D +W++ V K +L+ + + M+ N P + + P +S
Sbjct: 142 LAVLSSFQWDEEWMMQHVDV-RKTKLLLIAYAADENQKVEMRENVPNSNVRFCFPPMLSV 200
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGFE 309
G HSK LL Y +RI+V T NL+ DW +++ D P L + G
Sbjct: 201 GAMHSKLQLLKYADYLRIVVPTGNLVPYDWGESGTIENMVFIIDLP-----RLPAQAGRI 255
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 368
+ +L L + L A + ++FS+ A + ++ G H S ++
Sbjct: 256 SGKTPFLDDLSY-----FLKAQAVDQSLVQSLDNYDFSATARYAFVHTISGSHAKDSWER 310
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLV--YQFSSLGSLDEKWMAEL--SSSMSSGFSE---- 420
G+ L ++ + + PL Y SS+GSL + + L + +G E
Sbjct: 311 TGYCGLGRAIKSLGWA---TEEPLQLDYLCSSIGSLGDDLLNALYYACQGDTGMKEYEAR 367
Query: 421 -DKTPLGI----GEP------LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDF 464
+K G+ EP + +P+ + V S G I + +
Sbjct: 368 ANKPKKGVLASSSEPDWKSRMRVYFPSHQTVVRSRGGIRGAGTICFRRNWWESAKFPRKI 427
Query: 465 LKKYW--AKWKASHTGR---SRAMPHIKTFARYNGQKLAKAAWGALQKNNS----QLMIR 515
L+ Y K +HT R + + L+++AWG L K+ + +L R
Sbjct: 428 LRDYQNVKKGTLAHTKLLFVRREASSAQAWTYLGSANLSESAWGRLVKDRATKEPRLTCR 487
Query: 516 SYELGVLI----LPSAKRH 530
++E GVLI P A+R
Sbjct: 488 NWECGVLIPAVPRPEAERR 506
>gi|398406723|ref|XP_003854827.1| hypothetical protein MYCGRDRAFT_35953, partial [Zymoseptoria
tritici IPO323]
gi|339474711|gb|EGP89803.1| hypothetical protein MYCGRDRAFT_35953 [Zymoseptoria tritici IPO323]
Length = 266
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 101/253 (39%), Gaps = 45/253 (17%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEEC---GFE 309
HSK MLL +P +RI + TANL++ DW Q ++M D P +SE F
Sbjct: 20 HSKLMLLFHPDKLRIAIPTANLLNFDWGETGQMENSVFMVDLPRLADGKISEAGDLPAFG 79
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 368
+LI +L + + KF+FS+ + + +V G H ++
Sbjct: 80 QELIYFLEQQGLDD-----------DVRTGVL-KFDFSATKDMAFVHTVGGMHFRDEAER 127
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS--------------- 413
G M L +++ + L + SS+G L++ ++ + S+
Sbjct: 128 TGLMGLSKAVKQLNLAT--QDLELDFAASSIGRLNDNYLRDFHSAAKGISLIAQAAEAKS 185
Query: 414 -MSSGFSEDKTPLGIGEP-------LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+S F + K + +P I +PT VR S G AAG + F
Sbjct: 186 KAASTFFDRKKASTVAKPDNVREKVRIYFPTASTVRVSTAG-AAGTLCIARNYFEGSTFP 244
Query: 466 KKYWAKWKASHTG 478
+ + +K++ TG
Sbjct: 245 RACFRDYKSTRTG 257
>gi|452845379|gb|EME47312.1| hypothetical protein DOTSEDRAFT_21105 [Dothistroma septosporum
NZE10]
Length = 584
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 104/490 (21%), Positives = 193/490 (39%), Gaps = 107/490 (21%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V++ + A+LS + D +W+L K P ++G S + M+ P
Sbjct: 136 IKLEEVLEPSSVRTAVLSAFQWDTEWVLSKL----KTP----LNGGSTKCVFVMQAKTPD 187
Query: 240 NWILHK--------------PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
++ PP+ + HSK MLL +P +R+ + +ANL++ DW
Sbjct: 188 ERAQYREWASGFEACLRICLPPMDGAIYCMHSKLMLLFHPHKLRVAIPSANLLNFDWGET 247
Query: 286 SQ---GLWMQDFP-LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF 341
Q ++M D P L + + E DL T E + G K
Sbjct: 248 GQMENSVFMIDLPRLAGSTSQTTE-----DL-----TFFGQELMFFIERQGLDKDLRKGV 297
Query: 342 KKFNFSSAA-VRLIASVPGY-HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
F+FS+ + I +V G + + + G + L ++ ++ + + SS+
Sbjct: 298 LGFDFSATEHMAFIHTVGGMNYERTGADRTGLLGLSRAVRYLGLTTDQRELEIDFAASSI 357
Query: 400 GSLDEKWMAELSSSMS-----SGFSEDKTPLG--------------------IGEPLIVW 434
G L++ + +L S+ S + +E K+ I + L V+
Sbjct: 358 GQLNDSQVQDLHSAASGQDLIAQAAEAKSKAATNFFAKKAASSKAASTSERDIKQKLRVY 417
Query: 435 -PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------RSRAM 483
PT E V+ S G AAG + K F + + +K++ G RS+++
Sbjct: 418 FPTKETVQASTAG-AAGTICLQRKYFEGKTFPRAIFRDYKSTRKGLLSHNKILCARSKSL 476
Query: 484 PHIKTFARYNGQ-KLAKAAWGALQKNNSQLMI--RSYELGVL------ILPSAKRHGCGF 534
+ Y G ++K+AWG + K+ + I R++E GVL ILP A +
Sbjct: 477 AWL-----YIGSANMSKSAWGEIPKDRKERRITCRNWECGVLLPVPKEILPPACKEKARR 531
Query: 535 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 594
T + SE S E + + +L + +P+E+P Y+
Sbjct: 532 RHTDDEEDSETDSEDEEPQLVDMSVFSSL----------------VDLPFEVPGDDYNGR 575
Query: 595 DVPWSWDKRY 604
+ PW + +++
Sbjct: 576 E-PWYFTEKH 584
>gi|169625658|ref|XP_001806232.1| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
gi|160705700|gb|EAT76477.2| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
Length = 895
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 77/156 (49%), Gaps = 18/156 (11%)
Query: 178 TSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT----LEH 232
T+ ++I +V+Q + + +A++S++M D +WL L K+ + +++ +S +
Sbjct: 465 TNDITIDEVLQAESVNIAVVSSFMWDSEWLNKKLSPL-KVKQIWIMNAKSQDVQQRWVRE 523
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK------- 285
M+ N +H PP+ + HSK MLL +R++V TAN+ +DW +K
Sbjct: 524 MEDAGIPNLRIHFPPMGGLIHSMHSKFMLLFGRDKLRLVVPTANMTPMDWGDKVNNWQPG 583
Query: 286 --SQGLWMQDFPLKDQNNLSEE---CGFENDLIDYL 316
L++ D P + + ++ F +L+ +L
Sbjct: 584 VMENSLFLVDLPRRSDGVMGKKQDLTTFGKELVCFL 619
>gi|343426865|emb|CBQ70393.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 654
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 73/161 (45%), Gaps = 14/161 (8%)
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
T H K ++L++ +R+ + + NL +DW ++QDFPL G
Sbjct: 274 TQHMKFLILVHEGFLRVAILSGNLNQIDWERIENTAFIQDFPLLSSATKPNVAGPSQSTN 333
Query: 314 DYLSTLKWPEFSANLPA-HGNFKINPSFFKKFNFSSA-AVRLIASVPGYHTGSSLKKWGH 371
D+ L S +LPA H + + F+FS+A R++AS P SSL W
Sbjct: 334 DFKLQLIRSLRSLSLPASHAIY----AALDTFDFSAATCARIVASWP---EPSSLADWER 386
Query: 372 MKLRTV--LQECTFEKGFKKSPLV---YQFSSLGSLDEKWM 407
++ + + L + E G + S V Q SSL + D KW+
Sbjct: 387 IETQGLGRLGKVVRELGIRPSQSVEVECQGSSLANHDVKWV 427
>gi|407927985|gb|EKG20864.1| hypothetical protein MPH_01847 [Macrophomina phaseolina MS6]
Length = 642
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 103/473 (21%), Positives = 183/473 (38%), Gaps = 101/473 (21%)
Query: 181 VSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ ILS + D +W V + L I G ++ + PA
Sbjct: 218 IKIEEVLQNHDLKSLILSTFDFDHEWF--GTKVKLDMTRQLWIVGAANDDQRYEWSLAPA 275
Query: 240 NWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFP 295
+ + L + G +H K ++ +P+ +R+ + TANL DW + +++ D P
Sbjct: 276 VYSNVELCVLDMKNGHNHGKFLIGSHPKYLRVAITTANLKGHDWGESGKMENTVFIIDLP 335
Query: 296 -LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
L + SE+ F +L YL +L + L A +F++S + +
Sbjct: 336 RLPEGKKTSEDEATAFCQNLRFYLKSL-----NVGLSAR-------DALLRFDWSRTRNL 383
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE-L 410
+ S+ G G ++ G L ++E + + L Y SSLG+L +M + L
Sbjct: 384 GFVCSLQGASIGDDGQRIGLPGLSQAIKELNLKS--NRLALDYATSSLGALSRGFMKQFL 441
Query: 411 SSSMSSGFSEDK----TPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+++ K + +G+ L + +PTV+ VR S G AG I
Sbjct: 442 TAAKGEELEATKEKYDADIKLGDLLKQFRVYFPTVDTVRASKGGEEAGGTI--------- 492
Query: 463 DFLKKYW--------AKWKASHTGRSRAMPHIKTF--------------ARYNGQKLA-- 498
FL+K W A + R+ + H K G+K+A
Sbjct: 493 -FLRKRWYDAPSFPKASMHDHKSTRNGILSHNKLIICRGQIGPEDEDNAGATEGKKVAWA 551
Query: 499 --------KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 550
+AAWG L ++ + ++ + + + CG I+P S
Sbjct: 552 YVGSHNFTQAAWGTLSRDKNTKTLKV---------NCRNNECGV-----IIPIFRGGASE 597
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRYSSEDVPWSWD 601
+ Q K + D EV + +P+E+P +RY ++ PW D
Sbjct: 598 QVGQEDK--------NAEEDGLPGYEVFARKMEIPFEIPGERYGNKK-PWFTD 641
>gi|347836693|emb|CCD51265.1| hypothetical protein [Botryotinia fuckeliana]
Length = 638
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/253 (22%), Positives = 108/253 (42%), Gaps = 52/253 (20%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIV-AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
QG P + + I +V+Q + AIL + +D DW+ K+ + V+ +++
Sbjct: 279 AQGFPREDD---IKIEEVLQSSTLEHAILGAFQIDSDWIRSKIQPSTKV--IWVLQAKTE 333
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ 287
+ K P + PP+ + HSK +L +P +R+++ +ANL DW +S
Sbjct: 334 AEKMNFKSLAPETYRFCFPPMEGNVNIMHSKLQILAHPTHLRLVIPSANLTPYDW-GESG 392
Query: 288 GL-----WMQDFP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
G+ ++ D P L + S++ F DL+ +L +
Sbjct: 393 GILENVVFLIDLPRLPNGEKASDDQLTPFAQDLLHFLHAM-------------------- 432
Query: 340 FFKKFNFSSAAVRLIASVP--GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF 396
+ R I S+ G H G++L++ G+ L + C G PL ++
Sbjct: 433 --------TLTPRTIESLKRGGSHFGTNLQRTGYPGLGS----CVRSLGLNTDHPLEIEY 480
Query: 397 --SSLGSLDEKWM 407
+S+G+LD++++
Sbjct: 481 VTASIGNLDDRFL 493
>gi|380495056|emb|CCF32689.1| ubiquitin interaction domain-containing protein [Colletotrichum
higginsianum]
Length = 641
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 114/514 (22%), Positives = 193/514 (37%), Gaps = 118/514 (22%)
Query: 177 NTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR 235
N + I +V+Q D + +A+LS++ D +WLL + +L+ + ++ ++
Sbjct: 148 NGEDIKIEEVLQKDKLQLAVLSSFQWDEEWLLGKVDA-RQTKMLLIAYANNEAEKATIRA 206
Query: 236 NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQ 292
N P + P P+ G HSK +L Y +RI++ + NL+ DW +++
Sbjct: 207 NAPTGLVRFCFP-PMHGGYMHSKLQILKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLI 265
Query: 293 DFPL---KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-S 348
D P Q F +L +L L E K+ S ++FS +
Sbjct: 266 DLPRIGGTHQTAPPAGTAFGTELRRFLRALGLDE-----------KLVKS-LDNYDFSKT 313
Query: 349 AAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKW 406
+ + S+ G H S + G+ L + ++ + P + Y SSLGSL +
Sbjct: 314 SRYGFVHSIAGSHANDSWQHTGYCGLGSTVRSLGLA---TEEPVNIDYVASSLGSLTHDY 370
Query: 407 MAEL--SSSMSSGFSE-------------DKTPLGIGEPL------------IVWPTVED 439
+ + + SG E K L PL I +PT +
Sbjct: 371 LTAIYHACQGDSGMKEYEARQSKPTRNKAAKAGLAGSRPLGEGTLQWQHHFRIYFPTEKT 430
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKT-FAR 491
V S G ++ I F +K+W + + RS + H K+ F R
Sbjct: 431 VSSSRGGRSSAGTIC---------FQEKWWKSSTFPRELLRDCQSVRSGLLLHSKSIFVR 481
Query: 492 ----------YNGQ-KLAKAAWGALQKNN----SQLMIRSYELGVLILPSAKRHGCGFSC 536
Y G L+++AWG L K+ ++L R++E GVL+ G S
Sbjct: 482 GRAGGDAAWAYVGSANLSESAWGRLVKDRESGAAKLTCRNWECGVLVAVEGNPTGTADSG 541
Query: 537 TSNIVPSEIKSGSTETSQIQKTKL-------VTLTWHGSSDAGAS--------------- 574
T V + S +++Q L T T G + A A+
Sbjct: 542 TRPGVDQDAHSRRHPWARVQAQTLEGYARDEETSTSRGVAAATAADSEENRRQQQLDRDE 601
Query: 575 ----SEV--VYLPVPYELPPQRYSSEDV----PW 598
EV +P+P ++P RY S++ PW
Sbjct: 602 SAGLDEVFGTTVPIPMKVPAGRYMSDESAASRPW 635
>gi|310793199|gb|EFQ28660.1| ubiquitin interaction domain-containing protein [Glomerella
graminicola M1.001]
Length = 628
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 110/496 (22%), Positives = 185/496 (37%), Gaps = 105/496 (21%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +++Q D + +A+LS++ D +WLL V + +LV + ++ ++ N P
Sbjct: 154 IKIEEILQKDKLQLAVLSSFQWDEEWLLSKVDV-RQTRLLLVAYANNEAEKAAIRANAPT 212
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
+ P P+ G HSK +L Y +RI++ + NL+ DW +++ D P
Sbjct: 213 GLVRFCFP-PMYGGYMHSKLQILKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLIDLPK 271
Query: 297 KDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
+ + E F +L +L L E K+ S ++F+ ++
Sbjct: 272 LESTQQAAPPAETLFGTELRRFLRALGLDE-----------KLVKS-LDSYDFTETSRYG 319
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTV----LQECTFEKGFKKSPLV---YQFSSLGSLDEK 405
+ S+ G H S W H T L G V Y SSLGSL++
Sbjct: 320 FVHSIAGSHANDS---WQHTGQSTRGYCGLGSTVRSLGLATEDAVDIDYVASSLGSLNDA 376
Query: 406 WMAEL--SSSMSSGFSE------------------DKTPLGIGEPL-------IVWPTVE 438
+ + + SG E D + EPL I +PT
Sbjct: 377 SLKAIYYACQGDSGMKEYDARKPKPARSKAAKAGLDGSRPVFNEPLQLQRHFRIYFPTEH 436
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKTF-- 489
V S G ++ I F +K+W + + RS + H K
Sbjct: 437 TVSSSRGGRSSAGTIC---------FQEKWWKSSTFPRELLRDCQSVRSGLLLHTKAIFV 487
Query: 490 --------ARYNGQKLAKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCT 537
A L+++AWG L K +L R++E GVL+ G + T
Sbjct: 488 QARDGAAWAYMGSANLSESAWGRLVKERDSGAPKLTCRNWECGVLVAVDGNLPGSADTGT 547
Query: 538 SNIVPSEIKSGSTETSQIQKTKLVTLT--------WHGSSDAGASSEVVY---LPVPYEL 586
V + + G S+ + VT+T D E V+ +P+P ++
Sbjct: 548 RPGVDQDAQ-GQAPMSKGEGGPAVTVTDSEEKQRHQQLGQDEPRCLEGVFGTTMPIPMKV 606
Query: 587 PPQRYSSEDV----PW 598
P RY+S++ PW
Sbjct: 607 PAGRYTSDESAASRPW 622
>gi|453087183|gb|EMF15224.1| phospholipase D/nuclease [Mycosphaerella populorum SO2202]
Length = 629
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 83/413 (20%), Positives = 156/413 (37%), Gaps = 91/413 (22%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEECGFENDL 312
HSK MLL + +RI + TANL++ DW Q +++ D P Q ++ F +L
Sbjct: 242 HSKLMLLFHADKLRIAIPTANLLNFDWGETGQMENTVFLIDLPRLPQGQKNDLTSFGREL 301
Query: 313 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 371
+ ++ G + F+FS+ A + + +V G H + G
Sbjct: 302 MFFIEM------------QGLDQDVRDGVLNFDFSATADIAFVHTVGGVHYKDQAARTGL 349
Query: 372 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW-----MAELSSSMSSGFSEDKTPLG 426
+ L +++ G + + SS+G+L +K MA + + E ++ G
Sbjct: 350 LGLSRTVRQMDLVAG-PSLEIDFAASSIGALTDKQLNDFHMAARGVDLLAHAREARSKAG 408
Query: 427 IG------------------EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
+ + +PT E VR S G AAG + F K+
Sbjct: 409 ASFFKKAGSKTVTATTNVRKKIRVYFPTKETVRSSTAG-AAGTICLQREYYERNSFPKEC 467
Query: 469 WAKWKASHTG-------------RSRAMPHIK-----------------TFARYNGQKLA 498
+ ++++ G RS A H + ++
Sbjct: 468 FRDYRSTRKGLLSHNKILCARGFRSTASEHADPPGVSVAATGSPNSNPVAWVYVGSSNMS 527
Query: 499 KAAWGAL--QKNNSQLMIRSYELGVLI------LPSAKRHGCGFSCTSNIVPSEIKSGST 550
K+AWG L ++ S++ R++E GV++ LPS+ F SE ++
Sbjct: 528 KSAWGELAAERTESKITCRNWECGVILSVPVETLPSSAGEA-AFKQRDANGDSETETEDE 586
Query: 551 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 603
++Q + V + A ++ L P+ +P + Y S++ PW + ++
Sbjct: 587 TSAQTSTPEFVNIE--------AFRRIIDL--PFSIPGEEYKSQE-PWYFKEQ 628
>gi|156389579|ref|XP_001635068.1| predicted protein [Nematostella vectensis]
gi|156222158|gb|EDO43005.1| predicted protein [Nematostella vectensis]
Length = 597
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 63/118 (53%), Gaps = 7/118 (5%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK-SGDQR-KKLS 83
L++G IGR + V+DKR+SR H TL + +G +L TNP K SG ++ L
Sbjct: 18 LAEGKTSIGRGPLLSVADKRVSRSHATLDIN-NGKLTLSATHTNPTFFKLSGREKFSALR 76
Query: 84 SNEHVSIADGDIIELIPGHHFFKYVTLS-RSQKRVSNDGATNGE--LSSKKMRQQDEQ 138
+E + GD+I L+P H F+ ++++ + N+GA E L+ + Q+E+
Sbjct: 77 KDESQELKTGDLISLLPDQHVFEIISINPNTHSTAVNNGALTDEKTLAGSTEKSQEEK 134
>gi|85109758|ref|XP_963073.1| hypothetical protein NCU06222 [Neurospora crassa OR74A]
gi|28924724|gb|EAA33837.1| predicted protein [Neurospora crassa OR74A]
Length = 657
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 245 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 296
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 297 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 356
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 357 IIDLPLLDDPDVTRELTHFGEELLYFL 383
>gi|350290891|gb|EGZ72105.1| phospholipase D/nuclease [Neurospora tetrasperma FGSC 2509]
Length = 657
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 244 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 295
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 296 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 355
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 356 IIDLPLLDDPDVTRELTHFGEELLYFL 382
>gi|336469464|gb|EGO57626.1| hypothetical protein NEUTE1DRAFT_81347 [Neurospora tetrasperma FGSC
2508]
Length = 656
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 244 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 295
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 296 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 355
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 356 IIDLPLLDDPDVTRELTHFGEELLYFL 382
>gi|307108296|gb|EFN56536.1| hypothetical protein CHLNCDRAFT_144175 [Chlorella variabilis]
Length = 226
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 41/72 (56%), Gaps = 16/72 (22%)
Query: 483 MPHIKTFARYNGQKLA----------KAAWGALQKNNSQLMIRSYELGVLILPS-----A 527
MPH+KT+ R+ G +A KAAWG L ++ +L ++S+EL VL+LPS
Sbjct: 1 MPHLKTYCRHVGGDVAWLCLGSHNVSKAAWGELLRDG-RLYVKSFELSVLLLPSRELAYQ 59
Query: 528 KRHGCGFSCTSN 539
+ GFSCTS
Sbjct: 60 RSRRRGFSCTSG 71
>gi|367001138|ref|XP_003685304.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
gi|357523602|emb|CCE62870.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
Length = 563
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 106/488 (21%), Positives = 184/488 (37%), Gaps = 92/488 (18%)
Query: 181 VSIRDVIQGD--IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ ++D+ GD + +IL ++ ++++LL L I ++ VI ++ +K+
Sbjct: 109 IRMKDIF-GDNRLKTSILFSFQFEMNFLLSQFN-LDTIENIYVIAQKNTVVPPTLKKFNS 166
Query: 239 A----NWI-LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ 292
N + + PP F HHSK ++ IY + ++ + + N + N Q W
Sbjct: 167 VFDRLNIVEFYMPP----FSCHHSKMVINIYEDKSCKLFIPSNNFTFYETNLPQQVCWEG 222
Query: 293 DFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN---LPAHGNFKINPSFFKKFNFSSA 349
D N+ +++ F+ +LI Y + N +P N F K N
Sbjct: 223 PTLPYDINSKNQKISFKENLISYFQSYPSEVKIMNRTIIPMISNID-----FSKLN---- 273
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM-- 407
V + S P S + K ++ + L C+ + K++ + Q S++G K +
Sbjct: 274 NVEFLYSSPN-DKDSGISKLLYLLEKNDLLGCSDDIN-KRTHFLCQSSTIGGSLSKTVPL 331
Query: 408 ---AELSSSMSSGFSEDKTPLGIGE------------PLIVWPTVEDVRCSLEGYAAGNA 452
L SG + L + P IV+PTVE++R S G+ N
Sbjct: 332 NIFTHLMIPEFSGIQKSNKKLKTSQELIDIYREKRISPYIVYPTVEELRNSPSGWKCSNW 391
Query: 453 IPSPQKN-------VDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR-------------- 491
KN + KDF Y K + + R H K + R
Sbjct: 392 FHFNYKNKAEYYEVLAKDFKLFYKQKDQLTSKYRKATPSHSKFYIRCTENDSKVPARFSK 451
Query: 492 -----YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 546
+ L+ AWG L R+YE+G+L+ G +C+S +
Sbjct: 452 LDWCIFTSSNLSFNAWGKLSSK-----PRNYEVGILL---CSNEGQQINCSSFSRKIDEH 503
Query: 547 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE-DVPWSWDKRYT 605
G + S TK +D + V+ VP+ LP + Y + D + K Y
Sbjct: 504 QGCSRLSDSNNTK---------NDGKKNINVM---VPFTLPLEPYDIKYDTAFCIQKSYN 551
Query: 606 KKDVYGQV 613
D +G+V
Sbjct: 552 LPDCFGEV 559
>gi|255719760|ref|XP_002556160.1| KLTH0H06468p [Lachancea thermotolerans]
gi|238942126|emb|CAR30298.1| KLTH0H06468p [Lachancea thermotolerans CBS 6340]
Length = 570
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 105/494 (21%), Positives = 184/494 (37%), Gaps = 101/494 (20%)
Query: 181 VSIRDVI-QGDIIVAILSNYMVDIDWLLPA------CPVLAKIPHVL---VIHGESDGTL 230
++++++ + + A L ++ ++D++LP ++A+ +L I ++ L
Sbjct: 112 ITLQEIFSESKLTRAWLFSFQYELDFILPMFNESTQITIIAQKGTILPPTRISSKTSKIL 171
Query: 231 EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGL 289
MK + L PP F HHSK ++ Y G I + + N H + N Q +
Sbjct: 172 SKMKTIE-----LQMPP----FACHHSKMIVNEYRDGSCCIYIPSNNFTHAETNLPQQIV 222
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWP-------EFSANLPAHGNFKINPSFFK 342
W L+ + +E F L+ YL+ +P EF L ++ F
Sbjct: 223 WCSP-RLRRCSEAVKESEFRKSLVKYLNA--YPVSLKPLIEFLGTLDFTSLDQLGVEFI- 278
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKK------WGHMKLRTVLQECTFEKGFKKSPLVYQF 396
F+ +++ +P H S ++ G + R + Q T +PL
Sbjct: 279 -FSCPKPFESILSGIPLLHKALSSRQHAAGGNTGRERHRYLSQVSTI-----GAPLKTGL 332
Query: 397 SSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG-----------EPLIVWPTVEDVRCSLE 445
G+L M L S + G + K I EP IV+PT E++R S
Sbjct: 333 EYPGNLFSHLMIPLLSGLLVGPRDRKRAYEIPNLHKVFEDYNIEPYIVYPTPEEIRQSPM 392
Query: 446 GYAAGNAIPSP-QKNVDKDFLKKYWAKWKASHTG-------RSRAMPHIKTFAR------ 491
GY G +N + KW H R R H K + +
Sbjct: 393 GYLTGGWFHFHWLRNQATKTVYNTLKKWGVLHKQQPQDCPRRGRTPSHTKFYMKSTTLLD 452
Query: 492 ------------YNGQKLAKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSN 539
+ L+ AWG + ++YE+GVL S R S+
Sbjct: 453 NQAPFSEVDWFLFTTANLSLNAWGTTTRKP-----QNYEVGVL-FKSQDRRRITVKSVSD 506
Query: 540 IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 599
+V S+ +S T QI GSS +++ + + VP+++ P Y D +
Sbjct: 507 LVYSKFRS----TGQIL----------GSSKVHSNANICVM-VPFDINPVPYQPGDDAFC 551
Query: 600 WDKRYTKKDVYGQV 613
+ Y D++G++
Sbjct: 552 VSRSYEAPDIHGKL 565
>gi|443895439|dbj|GAC72785.1| ras-related GTPase [Pseudozyma antarctica T-34]
Length = 689
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 113/271 (41%), Gaps = 49/271 (18%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR--- 235
+ S R+ +Q +A+L+ Y + +DWL P + +L E T + R
Sbjct: 216 ATASSRNGLQ----LAVLATYDLRMDWLYSLFPKGLPVTLILPPPKEDYRTDPSVARPGL 271
Query: 236 ---------NKPANWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
+ W + P P + T H K ++L++P +R+ + + NL +DW
Sbjct: 272 HRSEIFGDFARCPGWQICVPSKPKGGWLTQHMKFLILVHPDFLRVAILSGNLNGIDWERI 331
Query: 286 SQGLWMQDFPLKDQ----------NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
++QDFPL ++ F+ L+ L +L P ++ P +
Sbjct: 332 ENTAYIQDFPLNTDTAKAATPAHGSSQGRTNDFKAQLVRILRSLGMP---SSHPVY---- 384
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHM------KLRTVLQECTFEKGFK 388
+ + +FS A R++AS P S+L +W M +L V+++ +
Sbjct: 385 ---AALDRHDFSQATRARIVASWP---EASNLAEWDRMETQGLGRLGKVVRDLGIQPKRS 438
Query: 389 KS-PLVYQFSSLGSLDEKWMAELSSSMSSGF 418
S L Q SSL + D KW+ E ++SGF
Sbjct: 439 GSLQLECQGSSLANHDIKWI-EHFHLLASGF 468
>gi|255945889|ref|XP_002563712.1| Pc20g12270 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588447|emb|CAP86556.1| Pc20g12270 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 658
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 8/118 (6%)
Query: 169 VQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G P N ++I +VIQ D+ + + S+++ D+ WL + +L I +D
Sbjct: 217 VTGFPRSGNE--ITIEEVIQRDDLELGVFSSFLWDMSWLY--SKFNSSSTRILFIMQAND 272
Query: 228 GTLEHMKRNKPAN---WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
+ R +N + L PP+ HSK +L+ +P +RI V +ANL DW
Sbjct: 273 EETQKQYRQDVSNMRNFRLCFPPMEPQVFCMHSKLLLMFHPGYLRIAVPSANLTPTDW 330
>gi|330841055|ref|XP_003292520.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
gi|325077216|gb|EGC30943.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
Length = 658
Score = 46.6 bits (109), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 62/136 (45%), Gaps = 32/136 (23%)
Query: 175 WANTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVL--AKIPHVLVIHGESDGTLE 231
W NT +S D+I + + AI++ Y +DI W++ + KIP + +
Sbjct: 151 WINT--LSFSDLISKPGMKFAIVTGYSIDIKWVMNSFERSQGTKIPITFIRDYD------ 202
Query: 232 HMKRNKPANWILHKPPLPISFGT-------------HHSKAMLLIYPRGVRIIVHTANLI 278
K++KP P PI F H+K ++L+Y +RI V +AN
Sbjct: 203 -QKKHKPG-------PHPIPFSNCTIIHPVLSGDQIFHAKLLVLVYDTWIRIAVTSANPS 254
Query: 279 HVDWNNKSQGLWMQDF 294
+++N SQ +W QDF
Sbjct: 255 SYEYSNLSQSIWYQDF 270
>gi|410917580|ref|XP_003972264.1| PREDICTED: aprataxin and PNK-like factor-like [Takifugu rubripes]
Length = 124
Score = 46.2 bits (108), Expect = 0.054, Method: Composition-based stats.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSG--DQRKKLS 83
L G VIGR + V DKR+SR H L + DG L NP ++S D + L
Sbjct: 17 LPPGETVIGRGPLLRVVDKRVSRHH-GLLENIDGCLRLKPTHMNPCFIQSSLTDDPRPLQ 75
Query: 84 SNEHVSIADGDIIELIPGHHFFKYVTL 110
+ S+ DGD+ L+PG ++ VT+
Sbjct: 76 KDSWFSLQDGDLFSLLPGQLIYRVVTV 102
>gi|350636132|gb|EHA24492.1| hypothetical protein ASPNIDRAFT_183042 [Aspergillus niger ATCC
1015]
Length = 324
Score = 46.2 bits (108), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 85/190 (44%), Gaps = 25/190 (13%)
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFP 295
N L PP+ HSK MLL +P +R++V TANL DW + +++ D P
Sbjct: 3 NLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLP 62
Query: 296 LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 63 KK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSKYA 107
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--AEL 410
+ ++ G HT ++ K+ G+ L ++ + + Y SS+G++ ++++ L
Sbjct: 108 FVHTIGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCMYL 166
Query: 411 SSSMSSGFSE 420
+S G +E
Sbjct: 167 ASQGDDGLTE 176
>gi|294944973|ref|XP_002784507.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239897573|gb|EER16303.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 230
Score = 45.4 bits (106), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 85/206 (41%), Gaps = 31/206 (15%)
Query: 181 VSIRDVIQGD---IIVAILSNYMVDIDWLLPACPVLAKIPHVLVI-HGESDGTLEHMKRN 236
++ D+I GD I LS++ DI+WLL P VLV + G + +++
Sbjct: 31 LTFADII-GDKTTIKAVFLSSFGCDIEWLLEHFAF--GTPIVLVDDYDRKRGAMAEIQQP 87
Query: 237 KPANWILHKPPLPI-------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
W K P GT H+K +++ + +R+ + ++NL DW SQ +
Sbjct: 88 FGEVWSQMKIVHPYFETGGLYDSGTMHAKLIIIERAQALRVCISSSNLTPQDWEGVSQCI 147
Query: 290 WMQDF--------PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG---NFKINP 338
W+ DF P + + F + L ++ T F ++P ++ +
Sbjct: 148 WVADFKAANDFEAPARKRVKPDHTSDFGDQLARFIET-----FFRSIPDSSSLWSYWVKV 202
Query: 339 SFFKKFNFS-SAAVRLIASVPGYHTG 363
+FN V LIAS PGY G
Sbjct: 203 LTGSRFNVKLPKGVELIASAPGYWKG 228
>gi|449302183|gb|EMC98192.1| hypothetical protein BAUCODRAFT_416098 [Baudoinia compniacensis
UAMH 10762]
Length = 610
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 7/109 (6%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPAC---PVLAKIPHVLVIHGESDGTLEHMKRN 236
+ I +V++ + A+LS + D++W+L P + V+ + D + M
Sbjct: 142 IKIEEVLEPRTLRTALLSAFQWDVEWVLSKLKVPPNGGTTKCIFVMQAKEDSLRQQMLTE 201
Query: 237 KPANWILHKPPLPISFGT---HHSKAMLLIYPRGVRIIVHTANLIHVDW 282
A + P G+ HSK MLL +P +RI + +ANL+ DW
Sbjct: 202 TDAMRPFLRLTFPYMGGSVFCMHSKLMLLFHPHKLRIAIPSANLLSFDW 250
>gi|157103380|ref|XP_001647953.1| polynucleotide kinase- 3'-phosphatase [Aedes aegypti]
gi|108884176|gb|EAT48401.1| AAEL000527-PA, partial [Aedes aegypti]
Length = 507
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%)
Query: 23 PKLPLSQGPNVIGRT-NIPVSDKRLSRKHITLTASADGSASLVVD-GTNPVVVKSGDQRK 80
P + + +IGR+ + D SR+ + L A+ G LV G+NP V+ K
Sbjct: 11 PPIRIDSDRKIIGRSPETLIQDPCCSRQQVCLKANFKGGFVLVKSLGSNPSVLNG----K 66
Query: 81 KLSSNEHVSIADGDIIELIPGHHFFKYV 108
+L N DGDI+EL+PG H + +V
Sbjct: 67 QLEKNMGYEAYDGDILELLPGQHQYTFV 94
>gi|401885055|gb|EJT49186.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1170
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 14/140 (10%)
Query: 254 THHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN-- 310
+ H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 428 SEHQKWAFVFYKTGRLRVAIMTANMVDYDWERIENTVFVQDV-LPNKAGHSPDWHLPDFP 486
Query: 311 ----DLIDYLSTLKWPEFSANLPAHGN---FKINPSF--FKKFNFSSAAVRLIASVPGYH 361
DL +L K EF G+ +PS+ F K+++S RL+ S+ G +
Sbjct: 487 QQFADLFKHLKIHKGIEFMRQTHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISIAGKY 546
Query: 362 TG-SSLKKWGHMKLRTVLQE 380
G + KWG +L V+QE
Sbjct: 547 EGFHDMSKWGIGRLGQVVQE 566
>gi|45184994|ref|NP_982712.1| AAR169Cp [Ashbya gossypii ATCC 10895]
gi|44980615|gb|AAS50536.1| AAR169Cp [Ashbya gossypii ATCC 10895]
Length = 540
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 114/295 (38%), Gaps = 51/295 (17%)
Query: 185 DVIQGDIIV--AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
+V+ GD + L ++ +++WLL P HV V+ GT++ + A
Sbjct: 91 EVVLGDTDLERVYLFSFQYEMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVR 145
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P F +HHSK ++ Y + R+++ +AN ++ + Q +WM +
Sbjct: 146 YRMVWMP-PFSSHHSKMVIAFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAA 204
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVP 358
+ F + L DYL +PE L +K +F+ + + S P
Sbjct: 205 EQQPSRFRSGLQDYLQM--YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAP 253
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG------------SLDEKW 406
G T + K G +L L E G + S Q SS+G +L
Sbjct: 254 GARTRA---KTGLAQLAAQLDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHL 309
Query: 407 MAELSSSMSSGFSED-KTPLGIGE-----------PLIVWPTVEDVRCSLEGYAA 449
M L S + G + K LG E P I++PTVED G+ A
Sbjct: 310 MVPLLSGHTQGLPKSVKDCLGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLA 364
>gi|374105912|gb|AEY94823.1| FAAR169Cp [Ashbya gossypii FDAG1]
Length = 540
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 114/295 (38%), Gaps = 51/295 (17%)
Query: 185 DVIQGDIIV--AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
+V+ GD + L ++ +++WLL P HV V+ GT++ + A
Sbjct: 91 EVVLGDTDLERVYLFSFQYEMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVR 145
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P F +HHSK ++ Y + R+++ +AN ++ + Q +WM +
Sbjct: 146 YRMVWMP-PFSSHHSKMVIAFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAA 204
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVP 358
+ F + L DYL +PE L +K +F+ + + S P
Sbjct: 205 EQQPSRFRSGLQDYLQM--YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAP 253
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG------------SLDEKW 406
G T + K G +L L E G + S Q SS+G +L
Sbjct: 254 GARTRA---KTGLAQLAAQLDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHL 309
Query: 407 MAELSSSMSSGFSED-KTPLGIGE-----------PLIVWPTVEDVRCSLEGYAA 449
M L S + G + K LG E P I++PTVED G+ A
Sbjct: 310 MVPLLSGHTQGLPKSVKDCLGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLA 364
>gi|323449457|gb|EGB05345.1| hypothetical protein AURANDRAFT_72265 [Aureococcus anophagefferens]
Length = 1631
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 81/207 (39%), Gaps = 47/207 (22%)
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK-WMAE 409
V I SVPG+ G+ +GH +R L +G + + SSLG LD K ++
Sbjct: 851 VHFIGSVPGFRRGAFADAFGHRAIRRALA----REGLTVARAEFANSSLGRLDNKVFLRG 906
Query: 410 LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC----SLEGYAAGNAIPSPQKNVDKDFL 465
++S+ D+ IVWP+ + C L +A + Q N D +
Sbjct: 907 FATSLFGAGDLDRLK-------IVWPS-QATACRSSRKLMLHAMTEDKGTAQMNGPDDRI 958
Query: 466 KKYWAKWKASHTGRSR-----------AMPHIKTFARYNG-----------QKLAKAAWG 503
W A+ R+R + H K A ++G + AAWG
Sbjct: 959 ------WNAAGFPRARFHHYHAPSDRQTLHHTKMLACFDGDDRLVAVVGGSHNCSGAAWG 1012
Query: 504 ALQKNNSQLMIRSYELGVLILPSAKRH 530
+ N S +M SYE GVL+ A R
Sbjct: 1013 VGEDNMSVIM--SYEAGVLVACGAGRR 1037
>gi|291225011|ref|XP_002732503.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 544
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 15/165 (9%)
Query: 25 LPLSQGPNVIGRTN-IPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK--SGDQRKK 81
+PL G ++GR + +SDKR+SR H L + G ++ NP + D+ +
Sbjct: 18 IPLPPGQTILGRGPFLGISDKRVSRSHAILEVDS-GKLRILPTHINPTFHQRLGTDKLRP 76
Query: 82 LSSNEHVSIADGDIIELIPGHHFFKYV--------TLSRSQKR-VSNDGATNGELSSKKM 132
L+ +E + +G+ LIP H FK V T S S K V + +KK
Sbjct: 77 LAKDEWQELKNGEKFSLIPEFHIFKVVIDEKPINNTSSNSSKTPVEEENGKETITENKKT 136
Query: 133 RQQDEQDNENGKNSEEALCNFHVSR--DKLPSTFRLLRVQGLPAW 175
+ + NG+ S+ + N + DK + R + LP+W
Sbjct: 137 DDVESDEKPNGEKSKPSAGNVQTVKLEDKKEVALPVQRERKLPSW 181
>gi|66822393|ref|XP_644551.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|66822691|ref|XP_644700.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
gi|60472674|gb|EAL70625.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|60472831|gb|EAL70780.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
Length = 734
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 21/149 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHV 219
P++F L P + +S +D+I+ ++ A++S + +D +W+ I +
Sbjct: 207 PNSFYLNSTNEQPRICTINTLSFKDLIKKPGMVGALVSGFALDPEWV---------IKEI 257
Query: 220 LVIHGESDGTLEHMKRNKPANWILH---------KPPLPISFGTHHSKAMLLIYPRGVRI 270
HG +KP H PPL ++ +HSK M+ + VR+
Sbjct: 258 RKEHGNKVKFTFVKNYSKPETKGRHAINDFITVINPPL-FNYQLYHSKLMIFTFVDLVRV 316
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
++ ++N D++ Q +W QDF LK Q
Sbjct: 317 VIPSSNPTKFDYSGWGQTIWFQDF-LKKQ 344
>gi|440797761|gb|ELR18837.1| Poly(ADP-ribose) polymerase catalytic domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 601
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 64/133 (48%), Gaps = 7/133 (5%)
Query: 11 PLDNNLREDNSLPKLPLSQGPNVIGRTNIP-VSDKRLSRKHITLT-ASADGSASLVVDGT 68
P + ++ LP + L G +GR + + D RLSRK +T+ G AS+ V G
Sbjct: 26 PPEAHVHLPQDLPTVSLKHGETDLGRGRLTQLLDPRLSRKQLTVEWDEHSGRASVHVHGM 85
Query: 69 NPVVVKSGDQRKKLSSNEH---VSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNG 125
NP V + Q++ ++ ++ V + DG +I L+PG + + + R + A G
Sbjct: 86 NPSYVHAQGQQEGVAVSKETGKVEVGDGVVISLLPGLYGYTLRIIDREAS--TAPPANAG 143
Query: 126 ELSSKKMRQQDEQ 138
++S K + + E
Sbjct: 144 HVNSHKRKLEGEH 156
>gi|406694621|gb|EKC97945.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1114
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 64/139 (46%), Gaps = 14/139 (10%)
Query: 255 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN--- 310
H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 381 EHQKWAFIFYKTGRLRVAIMTANMMDYDWERIENTVFLQDV-LPNKAGHSPDWHLPDFPQ 439
Query: 311 ---DLIDYLSTLKWPEFSAN---LPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHT 362
DL +L K EF L + +PS+ F K+++S RL+ S+ G +
Sbjct: 440 QFADLFKHLKIHKGIEFMRQKHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISISGKYE 499
Query: 363 G-SSLKKWGHMKLRTVLQE 380
G + KWG +L V+QE
Sbjct: 500 GFHDMSKWGIGRLGQVVQE 518
>gi|154298872|ref|XP_001549857.1| hypothetical protein BC1G_11683 [Botryotinia fuckeliana B05.10]
Length = 495
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 56/139 (40%), Gaps = 28/139 (20%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIV-AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
QG P + + I +V+Q + AIL + +D DW+ K+ VL E++
Sbjct: 279 AQGFPREDD---IKIEEVLQSSTLEHAILGAFQIDSDWIRSKIQPSTKVIWVLQAKTEAE 335
Query: 228 GTLEHMKR-------NK-----------------PANWILHKPPLPISFGTHHSKAMLLI 263
H KR NK P + PP+ + HSK +L
Sbjct: 336 SFPRHQKRPEIQLQRNKELARYGGVIKMNFKSLAPETYRFCFPPMEGNVNIMHSKLQILA 395
Query: 264 YPRGVRIIVHTANLIHVDW 282
+P +R+++ +ANL DW
Sbjct: 396 HPTHLRLVIPSANLTPYDW 414
>gi|344232732|gb|EGV64605.1| phospholipase D/nuclease [Candida tenuis ATCC 10573]
Length = 171
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 54/155 (34%), Gaps = 53/155 (34%)
Query: 465 LKKYWAKWKASH--TGRSRAMPHIKTFARYNGQKLAKAAWGALQKNN------------- 509
+K Y KW H TGR R H+K + NG W + +N
Sbjct: 32 IKPYLCKWNNGHEYTGRERNPAHVKLYMCDNGDDFKSLKWLYMGSHNLSKQAWGGGSGFG 91
Query: 510 -----SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 564
++ + SYELG+LI P + TL
Sbjct: 92 SWQNINEYQVSSYELGILITPENDKD-------------------------------TLK 120
Query: 565 WHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 599
SD SSE + +P LPP RYS D+PWS
Sbjct: 121 PVFCSDF--SSEKYPVRMPLYLPPTRYSPTDMPWS 153
>gi|219116995|ref|XP_002179292.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409183|gb|EEC49115.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 708
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 95/234 (40%), Gaps = 50/234 (21%)
Query: 253 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL---KDQNNLSEECGF 308
G HH K M+L+ G V ++V T+NL + S W+Q FP + L EE
Sbjct: 263 GVHHPKFMILLERSGDVVVVVSTSNLTEPRATDAS---WLQRFPAARSSRERKLKEE--- 316
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF--------------KKFNFSSAAVRLI 354
E+D L+ + + + H + P F K F+FS A V L+
Sbjct: 317 EDDFGIVLTNMLEAQTLSCRKGH----VTPMGFCRQELGWNSLRDLTKHFDFSKAQVHLV 372
Query: 355 ASVPG---YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--------LVYQFSSLGSLD 403
A++PG T S + +G ++ V++ + + P L+ Q +SLGS
Sbjct: 373 ATIPGDRLSKTASPSELFGRQRVSAVMKRLSQGPTPRLPPILRSEDDRLIVQPTSLGS-- 430
Query: 404 EKW----MAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAA 449
+W M E+ S D + + + I+WPT ++ G+A
Sbjct: 431 -EWTRANMTEVVRSYLGHEDRDVSKVRDAQVFPRLDILWPTERFMKAYRTGFAG 483
>gi|50310989|ref|XP_455517.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49644653|emb|CAG98225.1| KLLA0F09625p [Kluyveromyces lactis]
Length = 497
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 89/405 (21%), Positives = 158/405 (39%), Gaps = 78/405 (19%)
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
AN +H+ +P +G HHSK + + G +R+ V + NL + N Q +W PL
Sbjct: 123 ANLKVHRVDIPSPWGCHHSKIIFSFHQNGTMRMHVPSFNLSREEMNLVQQTVWTS--PLL 180
Query: 297 --KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
K + ++ FE++L++YL++ +S+ +G + +K + +
Sbjct: 181 YEKSETVPKKKSRFEDELLEYLNS-----YSSYTSLYG-LIASLKRYKWHVLDEQNCQFV 234
Query: 355 ASVPGYHTG-----SSLKKWGHMKLR------------TVLQECTFEKGFKKSPLVYQFS 397
S P Y+ G S L+ G MKL +Q + F+K + Q
Sbjct: 235 YSTP-YNGGLTQLKSCLRASG-MKLHGDEEDDDLSFVNLFIQVSSMGNPFRKKFDLLQDV 292
Query: 398 SLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR-CSLEGYAAGNAIPSP 456
+ L W + E TP + +VWPT +++ C +G +A
Sbjct: 293 MIPYLYTDWFEKDGYDKKLKSKEYTTPF-LAHSTLVWPTKTEIKECMTQGLSANWFFYKR 351
Query: 457 QKNVDKDFL----KKYWAKWKASHTGRSRAM--PHIKTFARYNGQ--------------K 496
+ ++ + K A+ + ++R M H K + ++ +
Sbjct: 352 SEQTERKVVPCLRKHVPLPTNATQSDKNRHMVPSHTKYYIQFTDENTLKRPDWILLTSHN 411
Query: 497 LAKAAWG--ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 554
L++AAWG L+K +YE G+L + R+ + S P G T S+
Sbjct: 412 LSQAAWGPSPLKKPT------NYECGILYTTTMGRNKVRLTLASAQQP----PGRTIGSR 461
Query: 555 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 599
+ + V T V + PY L QRYS+ D P++
Sbjct: 462 VPEDITVLPT-------------VKVVTPYPLKFQRYSATDEPYT 493
>gi|328850417|gb|EGF99582.1| hypothetical protein MELLADRAFT_94260 [Melampsora larici-populina
98AG31]
Length = 286
Score = 42.7 bits (99), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 23/124 (18%)
Query: 175 WANTSCVSIR--DVI--QGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
W + S +IR D+I + + A++S Y+VDI WL P P+L ++ H +
Sbjct: 132 WHSDSQDAIRAEDIIYPKHKVTKALVSGYVVDIGWLRGLFDPGTPLL------IIKHDKD 185
Query: 227 DGTLEHMKRNKPANWILHKPPLPIS------FGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
GT + +R P ++ H PP+ ++ G H K ++ + VR+ + T N +
Sbjct: 186 AGTFKLKQR--PNTFLCH-PPMKLTAKGSLAHGAMHVKFFIIYFADRVRVAISTGNPVEF 242
Query: 281 DWNN 284
D+
Sbjct: 243 DYQT 246
>gi|404485080|ref|ZP_11020284.1| hypothetical protein HMPREF9448_00695 [Barnesiella intestinihominis
YIT 11860]
gi|404340085|gb|EJZ66516.1| hypothetical protein HMPREF9448_00695 [Barnesiella intestinihominis
YIT 11860]
Length = 172
Score = 42.0 bits (97), Expect = 0.91, Method: Composition-based stats.
Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 4 TKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRT------NIPV--SDKRLSRKHITLTA 55
T +G++ L+N + PL G N+IGR +IP+ SD + R+H +
Sbjct: 54 TSLGFITVLENAF---GYRQEFPLHAGDNIIGRASKGTEVDIPIETSDMSMDRRHCIINV 110
Query: 56 SADGSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIEL 98
G+ ++ NP + + + + LS E + DGD++ +
Sbjct: 111 KEKGNRPILTVRDNPSLTGTFLRHELLSDRERAVLHDGDVVTI 153
>gi|50292179|ref|XP_448522.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527834|emb|CAG61483.1| unnamed protein product [Candida glabrata]
Length = 553
Score = 42.0 bits (97), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 7/86 (8%)
Query: 243 LHKPPLPISFGTHHSKAMLLIYP--RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
++ PP + HHSK ++ IY RGVR+ + + N + N Q LW F + +
Sbjct: 182 IYMPP----YSCHHSKMIIGIYRNGRGVRVFLPSNNFTWAETNWPQQVLWSSPF-MSISD 236
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSA 326
E GF+ L DYLS K E ++
Sbjct: 237 KAVEMNGFQRSLCDYLSFYKLKELNS 262
>gi|340374112|ref|XP_003385582.1| PREDICTED: aprataxin and PNK-like factor-like [Amphimedon
queenslandica]
Length = 432
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK-SG-DQRKKLS 83
LS+G + IGR + ++DKR+SR H T+ + D + S+ TNP K SG D++ +L
Sbjct: 15 LSKGEHTIGRGPLLKITDKRVSRNHATVKVNDDNAVSICPRHTNPCYYKPSGRDEQIQLK 74
Query: 84 SNEHVSIADGDIIELI 99
+ +++DGD I ++
Sbjct: 75 KDVWQTLSDGDQISIL 90
>gi|435853317|ref|YP_007314636.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
gi|433669728|gb|AGB40543.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
Length = 372
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 4/69 (5%)
Query: 220 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
L++H DGT MKR K N + P P GT AMLL Y +G +IV H
Sbjct: 233 LIVHAYPDGTAPGMKRIKKLNLQAQRIPAP---GTSEDIAMLLAYEKGAELIVAVGTHTH 289
Query: 280 -VDWNNKSQ 287
+D+ K +
Sbjct: 290 MIDFLEKGR 298
>gi|146162654|ref|XP_001009833.2| FHA domain containing protein [Tetrahymena thermophila]
gi|146146354|gb|EAR89588.2| FHA domain containing protein [Tetrahymena thermophila SB210]
Length = 561
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 108/260 (41%), Gaps = 29/260 (11%)
Query: 1 MSATKI----GYLVPLDNNLREDNSLPKLPLSQGPNVIGRT-NIPVSDKRLSRKHITLTA 55
MS+T I G L+P ++E L+Q +++GR ++ V + ++S +H L
Sbjct: 1 MSSTDIQQKWGELIPKGGLVQE-----TFVLNQKEHILGRRGDLKVDNPKVSGQHCVLKY 55
Query: 56 SADGSASLVVDGTNPVVVKSGDQ--RKKLSSNEHVSIADGDIIELIPGH-----HFFKYV 108
+ ++D V +G KKL N+ V + +GD++ L+ + FK V
Sbjct: 56 DYAQKKAYIID-----VSSNGTSLFNKKLEKNKEVELENGDLVNLLQDKSQWIGYIFKLV 110
Query: 109 TLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
S + D AT K++ Q EQ +N K +E +++L + ++
Sbjct: 111 DNVDSFNKDQQDTATKNNTDQKQLEQSLEQYKQNEKEQQEQNEQIKKMQNELEERLKKVK 170
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVA-ILSNYMVDIDWLL-----PACPVLAKIPHVLVI 222
+ +CV D++ ++ L NY D L ACP + P +
Sbjct: 171 EDDEHFEKDQTCVVCIDLLYNPYLMTPCLHNYCCDCMCELLKNKDIACPQCREKPISVQK 230
Query: 223 HGESDGTLE-HMKRNKPANW 241
+ + + +E +KRN W
Sbjct: 231 NYQLNNLIEAFIKRNPDKKW 250
>gi|91786388|ref|YP_547340.1| ABC transporter-like protein [Polaromonas sp. JS666]
gi|91695613|gb|ABE42442.1| carbohydrate ABC transporter ATP-binding protein, CUT1 family
[Polaromonas sp. JS666]
Length = 360
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 20 NSLPKLPLSQGPNVIG----RTNIP-VSDKRLSRKHITLTASADGSASLVVDGTNPVVVK 74
N + LP+ QG ++G R +P VS +RL+ TLTA GSA + + VV+
Sbjct: 237 NLIAALPVGQGVQLVGGPVLRMAVPSVSAQRLA----TLTAGIRGSALRIEERAGDVVLA 292
Query: 75 SGDQRKKLSSNE---HVSIADGDIIELIPGHHFF 105
+ ++S ++ HV+ A G+++ + G H+F
Sbjct: 293 GRVELAEISGSDTFVHVATAAGELVAQLTGVHYF 326
>gi|320168830|gb|EFW45729.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 538
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 13/120 (10%)
Query: 6 IGYLVPL--DNNLREDNSLPKLPLSQGPNVIGR---TNIPVSDKRLSRKHITLTASADGS 60
+ LVPL R D + + L +G V+GR TN+ D+RLSR H + DG+
Sbjct: 4 LARLVPLLMPAASRPDPASKVVDLERGETVLGRGPLTNL--EDRRLSRNHAKIQIDHDGA 61
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEH------VSIADGDIIELIPGHHFFKYVTLSRSQ 114
A ++ V+ D S+E VS+ GD++ L+P F+ V L + Q
Sbjct: 62 AHIMSTHKTLCSVRRADAAGGDGSDEQLPLHTWVSLKHGDVLFLMPNAFPFRVVNLVKEQ 121
>gi|440802752|gb|ELR23681.1| hypothetical protein ACA1_073250 [Acanthamoeba castellanii str.
Neff]
Length = 294
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 42/74 (56%), Gaps = 6/74 (8%)
Query: 34 IGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVV----KSGDQRKKLSSNEHVS 89
+GR + V+DKR+SR+ + ++ A + V+G NPV V K+GD + LS E
Sbjct: 22 LGRGVLGVTDKRISRRQLQISLRGPALA-VTVEGVNPVYVRRAGKAGDG-ELLSRGEEAI 79
Query: 90 IADGDIIELIPGHH 103
+ +GD++ L+ H
Sbjct: 80 LRNGDVVTLLADLH 93
>gi|296223668|ref|XP_002757728.1| PREDICTED: aprataxin and PNK-like factor [Callithrix jacchus]
Length = 478
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 11/105 (10%)
Query: 9 LVPLDNNLREDNSLPKLPLSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDG 67
L PLD P++ L+ G V+GR + ++DKR+SR+H L ADG +
Sbjct: 7 LQPLDGG-------PRVALASGETVVGRGPLLGITDKRVSRRHAILEV-ADGQLRIKPVH 58
Query: 68 TNPVVVKSGDQRK--KLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
TNP +S ++ + L +N + GD L+ + F+ + +
Sbjct: 59 TNPCFYQSSEKSQLVPLKTNLWCCLNPGDSFSLLVDKYTFRVLAI 103
>gi|195572577|ref|XP_002104272.1| GD20873 [Drosophila simulans]
gi|194200199|gb|EDX13775.1| GD20873 [Drosophila simulans]
Length = 523
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 23/120 (19%)
Query: 27 LSQGPNVIGRT-NIPVSDKRLSRKHITLTASADGSA-SLVVDGTNPVVVKSGDQRKKLSS 84
L+ G N +GR+ + D + S++ I L + SL V G NP V +
Sbjct: 37 LTAGENFVGRSRETGIRDSKCSKRQIQLQVDLKKAVVSLKVLGVNPCGVNG----LMVMQ 92
Query: 85 NEHVSIADGDIIELIPGHHFFKYV-----------------TLSRSQKRVSNDGATNGEL 127
N + GD++E++ G H F+ V TLS S+K D A NG+L
Sbjct: 93 NSECELKHGDLVEIVYGRHPFEVVFNPPPEDDKEKAEPLSTTLSHSEKSERWDSAGNGKL 152
>gi|281205023|gb|EFA79217.1| hypothetical protein PPL_08045 [Polysphondylium pallidum PN500]
Length = 487
Score = 39.3 bits (90), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 8 YLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDG 67
+L+ L + + +N L + G IGR ++ +S+K+ SRK I + + L+ +G
Sbjct: 10 HLIHLKSINKAENLLDHTYKATGTYEIGRGSLGISEKKCSRKQILIKLDEHSNYYLISNG 69
Query: 68 TNPVVVKSGDQRK--KLSSNEHVSIADGDIIELI 99
NP +K D+ +++ +E + DGD ++
Sbjct: 70 INPSYLKKYDKDYFVQMTKDEEYVLEDGDSFSML 103
>gi|440791002|gb|ELR12258.1| UBA/TSN domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 615
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 55/106 (51%), Gaps = 12/106 (11%)
Query: 24 KLPLSQGPNVI-GRTN--IPVSDKRLSRKHITLT-----ASADGSASLVVDGTNPVVV-- 73
++ LS G +++ GR + + +SDKR SR+ LT +D +LV G N V
Sbjct: 14 EVELSAGADIVMGRGSPLLGISDKRCSRRQAVLTFLPPATPSDQPFALVAHGPNTTFVRR 73
Query: 74 KSGDQRKKLSSNEHVSIADGDIIELIPGHH--FFKYVTLSRSQKRV 117
+ ++R+ ++ E + DGD+I L P +H + +++ Q++
Sbjct: 74 RGAEEREGMAKGEVYFLNDGDVIRLPPDYHPIVLRLISVGGEQEQT 119
>gi|145235397|ref|XP_001390347.1| hypothetical protein ANI_1_556034 [Aspergillus niger CBS 513.88]
gi|134058029|emb|CAK38258.1| unnamed protein product [Aspergillus niger]
gi|350632869|gb|EHA21236.1| hypothetical protein ASPNIDRAFT_54717 [Aspergillus niger ATCC 1015]
Length = 387
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 64/158 (40%), Gaps = 35/158 (22%)
Query: 9 LVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSD----KRLSRKHITLTASADGSASLV 64
+ PLD+ E N LP L+ P + PVS+ R+ + A A +
Sbjct: 183 VTPLDHPHEEINDLPVHRLT-NPQIF----YPVSESRQFNRVDAGRVFSAAPALEHEQVA 237
Query: 65 VDGTNPV--------------VVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
D NP +V GD+ EH + D+ IP H VT
Sbjct: 238 KDAANPSEAISRVTQNPSHIELVGKGDE-------EHQVLQPADV--RIPHPHM---VTS 285
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEE 148
+R KRV N+GA + EL ++ QQD D E + ++E
Sbjct: 286 TRDIKRVPNEGAKHAELYQARLNQQDAADQERKRLAQE 323
>gi|321474170|gb|EFX85136.1| hypothetical protein DAPPUDRAFT_46356 [Daphnia pulex]
Length = 512
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 12/123 (9%)
Query: 31 PNVIGRTNIP-VSDKRLSRKHITLTASAD-GSASLVVDGTNPVVVKSGDQRKKLSSNEHV 88
P VIGR + + D RLSR H+ L A + G S+ + G N K+G K + +E V
Sbjct: 23 PLVIGRGPLTRIKDPRLSRNHVELVADCEKGLLSVKLIGAN--ACKAGTSIIK-AKDESV 79
Query: 89 SIADGDIIELIPGHHFFKYV------TLSRSQKRVSNDGATNGELSSKKMRQQDE-QDNE 141
+ G+IIEL+ F+ + S+K S + + +KK + +D ++ +
Sbjct: 80 QLKHGEIIELLEKQFPFRVEFSPDPNQVPSSRKSTSAEDVQDPSFFAKKQKMEDTWEEID 139
Query: 142 NGK 144
NGK
Sbjct: 140 NGK 142
>gi|71907102|ref|YP_284689.1| cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
gi|71846723|gb|AAZ46219.1| Cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
Length = 531
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP +L +P L + G DG L W + PL + G A+L ++
Sbjct: 119 WLLPPAAILLTLPFSLALFGIGDGALA-------TGWTFYA-PLSVQGGMGVDFAILAVH 170
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
G+ I+ + N+I +N ++ G+ M PL
Sbjct: 171 ILGISSIMGSINIIVTIFNMRAPGMTMMKLPL 202
>gi|253995926|ref|YP_003047990.1| cytochrome c oxidase subunit I [Methylotenera mobilis JLW8]
gi|253982605|gb|ACT47463.1| cytochrome c oxidase, subunit I [Methylotenera mobilis JLW8]
Length = 530
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP +L +P L + G DG L W + PPL I G A+ ++
Sbjct: 118 WLLPPSAILLTLPFTLALFGIGDGALA-------TGWTFY-PPLSIQGGIGVDFAIFAVH 169
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
G+ ++ + N+I +N ++ G+ + P+
Sbjct: 170 LLGISSVLGSINIIVTLFNMRAPGMTLMKMPM 201
>gi|257095684|ref|YP_003169325.1| cytochrome c oxidase subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257048208|gb|ACV37396.1| cytochrome c oxidase, subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 535
Score = 38.9 bits (89), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP L +P +L + G DG + W L+ PL + G A+ I+
Sbjct: 123 WLLPPAAALLTLPFILALFGIGDGAVN-------TGWTLYA-PLSVQGGMGVDFAIFSIH 174
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
GV I+ + N+I +N ++ G+ M PL
Sbjct: 175 ILGVSSILGSINIIVTIFNLRAPGMTMMKLPL 206
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,559,127,745
Number of Sequences: 23463169
Number of extensions: 462129661
Number of successful extensions: 1044282
Number of sequences better than 100.0: 523
Number of HSP's better than 100.0 without gapping: 282
Number of HSP's successfully gapped in prelim test: 241
Number of HSP's that attempted gapping in prelim test: 1041826
Number of HSP's gapped (non-prelim): 863
length of query: 626
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 477
effective length of database: 8,863,183,186
effective search space: 4227738379722
effective search space used: 4227738379722
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)