BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006675
(636 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359483320|ref|XP_002265078.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Vitis vinifera]
Length = 621
Score = 965 bits (2495), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/636 (73%), Positives = 531/636 (83%), Gaps = 15/636 (2%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
MS ++IG+LVPL+ NL ED S PKLP+ G NVIGR +I VSDKRLSRKH+TL AS +GS
Sbjct: 1 MSLSQIGFLVPLNRNLEEDTSTPKLPIPTGANVIGRNSISVSDKRLSRKHLTLIASGNGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSND 120
VV+GTNPVVV SG+QRKKL + E I + DIIELIPGH+FFKYVT++
Sbjct: 61 VDAVVEGTNPVVVASGNQRKKLRTGEKAVITNDDIIELIPGHYFFKYVTVA--------- 111
Query: 121 GATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSC 180
GE KK D Q+ E+ N +A+ +F + +D LP T+RLLRV+ LPAWANTS
Sbjct: 112 ----GEKCEKKGNSMDAQNMES--NEVKAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSS 165
Query: 181 VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPAN 240
VSIRDVIQGD+++A+LSNYMVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP N
Sbjct: 166 VSIRDVIQGDVLIAVLSNYMVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPN 225
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
WILHKPPLPISFGTHHSKAMLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q
Sbjct: 226 WILHKPPLPISFGTHHSKAMLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQK 285
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
LS+ C FENDLIDYLS LKWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGY
Sbjct: 286 ELSKGCAFENDLIDYLSVLKWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGY 345
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
HTGS+LKKWGHMKL +VLQEC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +
Sbjct: 346 HTGSNLKKWGHMKLCSVLQECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCD 405
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS 480
DKTPLG+G+PLI+WPTVEDVRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR
Sbjct: 406 DKTPLGLGKPLIIWPTVEDVRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRC 465
Query: 481 RAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
RAMPHIKT+ RYNGQ LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 466 RAMPHIKTYTRYNGQNLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINR 525
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
G GFSCT N PS+ K G +E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++
Sbjct: 526 GQGFSCTDNGSPSKNKCGLSENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQ 585
Query: 601 YSSEDVPWSWDKRYTKKDVYGQVWPRHFQLYAFQDS 636
YSSEDVPWSWD+RY KKDV GQVWPRH QLY+ DS
Sbjct: 586 YSSEDVPWSWDRRYYKKDVCGQVWPRHVQLYSSPDS 621
>gi|302144065|emb|CBI23170.3| unnamed protein product [Vitis vinifera]
Length = 678
Score = 964 bits (2492), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/678 (69%), Positives = 546/678 (80%), Gaps = 42/678 (6%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
MS ++IG+LVPL+ NL ED S PKLP+ G NVIGR +I VSDKRLSRKH+TL AS +GS
Sbjct: 1 MSLSQIGFLVPLNRNLEEDTSTPKLPIPTGANVIGRNSISVSDKRLSRKHLTLIASGNGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLS--RSQKRVS 118
VV+GTNPVVV SG+QRKKL + E I + DIIELIPGH+FFKYVT++ + +K+ +
Sbjct: 61 VDAVVEGTNPVVVASGNQRKKLRTGEKAVITNDDIIELIPGHYFFKYVTVAGEKCEKKGN 120
Query: 119 NDGATNGE-----LSSKKMRQ-----------QDEQDNE---------NGKN-------- 145
+ A N E LS K+MRQ Q E +N+ GK+
Sbjct: 121 SMDAQNMESNEVSLSRKRMRQVSEDEAFARKLQAEMENDVLVQERSLVTGKSGYSQASTA 180
Query: 146 -------SEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSN 198
+ EA+ +F + +D LP T+RLLRV+ LPAWANTS VSIRDVIQGD+++A+LSN
Sbjct: 181 SIPSSHMNSEAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSSVSIRDVIQGDVLIAVLSN 240
Query: 199 YMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 258
YMVDIDWLL +CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSK
Sbjct: 241 YMVDIDWLLSSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSK 300
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
AMLL+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS
Sbjct: 301 AMLLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSV 360
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
LKWPEF+ANLPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VL
Sbjct: 361 LKWPEFTANLPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLCSVL 420
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
QEC F+K F+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVE
Sbjct: 421 QECIFDKEFQKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVE 480
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 498
DVRCSLEGYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTGR RAMPHIKT+ RYNGQ LA
Sbjct: 481 DVRCSLEGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTGRCRAMPHIKTYTRYNGQNLA 540
Query: 499 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 558
WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS G GFSCT N PS+ K G
Sbjct: 541 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPINRGQGFSCTDNGSPSKNKCG 600
Query: 559 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKD 618
+E ++ Q+TKLVTLTW G+ + +SSEV+ LPVPYELPP++YSSEDVPWSWD+RY KKD
Sbjct: 601 LSENTKSQRTKLVTLTWEGNRSSDSSSEVIPLPVPYELPPKQYSSEDVPWSWDRRYYKKD 660
Query: 619 VYGQVWPRHFQLYAFQDS 636
V GQVWPRH QLY+ DS
Sbjct: 661 VCGQVWPRHVQLYSSPDS 678
>gi|255554997|ref|XP_002518536.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223542381|gb|EEF43923.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 665
Score = 951 bits (2458), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/633 (72%), Positives = 519/633 (81%), Gaps = 30/633 (4%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
KIG+LVPL NL ED S+PK+ LS+GPN IGR+++ VSDKRLSR H++LT S DGSA L
Sbjct: 62 KIGFLVPLKLNLEEDTSIPKISLSEGPNAIGRSHVSVSDKRLSRNHLSLTTSVDGSAFLT 121
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
+GTNPVV+KSGDQRKKLS E SI GD+IELIPGHHFFKY +G N
Sbjct: 122 PEGTNPVVIKSGDQRKKLSPGEKASINSGDVIELIPGHHFFKY----------EGEGECN 171
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
G KNSEEA+ F+V+ DKLP TFRL++V+GLPAWANTSCVSI
Sbjct: 172 G-----------------AKNSEEAIGKFNVNDDKLPLTFRLMKVKGLPAWANTSCVSIT 214
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI+ A+LSNYMVDIDWL+ ACP LAK+P+VLV+HGE DGTLEHMKR KPANWILH
Sbjct: 215 DVIQGDIVFAVLSNYMVDIDWLMSACPALAKVPNVLVLHGEGDGTLEHMKRTKPANWILH 274
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KPPLPISFGTHHSKAMLL+YPRG+RIIVHTANLI+VDWNNK+QGLWMQDFP KD+ + ++
Sbjct: 275 KPPLPISFGTHHSKAMLLVYPRGMRIIVHTANLIYVDWNNKTQGLWMQDFPWKDEKSQTK 334
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
CGFENDL+DYL+TLKWPEF+ LPA G+F INPSFFKKF++S+AAVRLIASVPGYHTG
Sbjct: 335 GCGFENDLVDYLNTLKWPEFTVKLPALGSFTINPSFFKKFDYSTAAVRLIASVPGYHTGP 394
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
+LKKWGHMKLR+VLQECTF K FK SPL YQFSSLGSLD KWM EL++S+SSG SED+TP
Sbjct: 395 NLKKWGHMKLRSVLQECTFRKEFKNSPLAYQFSSLGSLDAKWMTELATSLSSGLSEDRTP 454
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNAIPSP KNV+KD LKKYW+KWKA+H+GR RAMP
Sbjct: 455 LGLGEPRIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKDILKKYWSKWKATHSGRCRAMP 514
Query: 485 HIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 543
HIKTF RYNGQKLAW LLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS+ K HGC
Sbjct: 515 HIKTFTRYNGQKLAWLLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSSYKNHGCR 574
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 603
SCT + SE + G S+ KT+LVTL W G D SS+V+ LPVPYELPPQ YSS
Sbjct: 575 LSCTDHGARSEDEYGLLADSEEPKTELVTLMWQGPKD--PSSQVIPLPVPYELPPQPYSS 632
Query: 604 EDVPWSWDKRYTKKDVYGQVWPRHFQLYAFQDS 636
EDVPWSWD+RY+KKDVYGQVWPR QLY DS
Sbjct: 633 EDVPWSWDRRYSKKDVYGQVWPRLVQLYTSLDS 665
>gi|449479663|ref|XP_004155668.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 959
Score = 910 bits (2352), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/643 (68%), Positives = 509/643 (79%), Gaps = 20/643 (3%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
K+GYLVPLD NL DNS K+ LS+GPN IGR+N+ VS+KR+SRKHITLT S DGSA L+
Sbjct: 318 KVGYLVPLDKNLEVDNSGLKIRLSEGPNSIGRSNVLVSEKRISRKHITLTTSTDGSAKLL 377
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTL---SR------SQK 115
VDGTNPVV+ SGD RKKL E V I DGD+IELIPGH+ FKY + SR QK
Sbjct: 378 VDGTNPVVINSGDGRKKLGPRESVIIRDGDVIELIPGHYPFKYASHCFNSRPGSEDLGQK 437
Query: 116 RV--------SNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLL 167
RV S A E+ S Q NS EA+ NFH+ D+LP TFRLL
Sbjct: 438 RVRQVAHDKISERVAKRAEMGSPLENMQSGSSKSKEANSVEAIRNFHIPDDRLPMTFRLL 497
Query: 168 RVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V+GLP WANTSCV I D+IQGDI+ A+LSNYMVDIDWL+PACP LAKIP VLVIHGE D
Sbjct: 498 SVKGLPPWANTSCVRITDIIQGDILFAVLSNYMVDIDWLIPACPTLAKIPQVLVIHGEGD 557
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ 287
GTL++MKR KPANWILHKPPLPISFGTHHSKA+ L+YPRG+R++VHTANLI+VDWNNKSQ
Sbjct: 558 GTLDNMKRKKPANWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQ 617
Query: 288 GLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
GLWMQDFP KDQN+ S C FE+DL+DYLS LKWPEF A+ P HGNF INP FF+KF++S
Sbjct: 618 GLWMQDFPWKDQNSSSRGCAFEDDLVDYLSALKWPEFPASFPGHGNFNINPYFFRKFDYS 677
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
AAVRLIASVPGYHTG LKKWGHMKLR+VLQEC F+K F++SPLVYQFSSLGSL+EKWM
Sbjct: 678 KAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECIFDKEFQRSPLVYQFSSLGSLNEKWM 737
Query: 408 AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK 467
AE ++S+SSGF+ DKTPLG+GEPLIVWPTVEDVRCSLEGYAAG+AIPSP KNV+K FL+K
Sbjct: 738 AEFAASLSSGFTPDKTPLGLGEPLIVWPTVEDVRCSLEGYAAGSAIPSPLKNVEKGFLRK 797
Query: 468 YWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
YWAKW + H+GR AMPHIKTFARYNGQKLAW +LTS+NLS+AAWGALQKNNSQLMIRSY
Sbjct: 798 YWAKWNSFHSGRCHAMPHIKTFARYNGQKLAWLVLTSSNLSQAAWGALQKNNSQLMIRSY 857
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI--QKTKLVTLTWHGSSDAGASS 585
ELGVL LP KR+ FSCT N ++ KS + S+ KT+LVTL W + + S
Sbjct: 858 ELGVLFLPQ-KRNDYSFSCTKNGGSAQNKSTVSRPSETLEGKTELVTLAWQENKKRESLS 916
Query: 586 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHF 628
EV+ LP+PYELPPQ Y EDVPWSWD+RYT+KDV+G VWPR F
Sbjct: 917 EVIQLPIPYELPPQPYGPEDVPWSWDRRYTQKDVHGAVWPRQF 959
>gi|449434370|ref|XP_004134969.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cucumis sativus]
Length = 613
Score = 901 bits (2328), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/628 (68%), Positives = 505/628 (80%), Gaps = 15/628 (2%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ ++GYLVPLD NL DNS K+ LS+GPN IGR+N+ VS+KR+SRKHITLT S DGS
Sbjct: 1 MARLQVGYLVPLDKNLEVDNSGLKIRLSEGPNSIGRSNVLVSEKRISRKHITLTTSTDGS 60
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSND 120
A L+V+GTNPVV+ SGD RKKL E V I DGD+IELIPGH+ FKY + + + S D
Sbjct: 61 AKLLVEGTNPVVINSGDGRKKLGPRESVIIRDGDVIELIPGHYPFKYASHCFNSRPGSED 120
Query: 121 GATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSC 180
L K++RQ+ NS EA+ NFH+ D+LP TFRLL V+GLP WANTSC
Sbjct: 121 ------LGQKRVRQE--------ANSVEAIRNFHIPDDRLPMTFRLLSVKGLPPWANTSC 166
Query: 181 VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPAN 240
V I D+IQGDI+ A+LSNYMVDIDWL+PACP LAK+P VLVIHGE DGTL++MKR KPAN
Sbjct: 167 VRITDIIQGDILFAVLSNYMVDIDWLIPACPALAKVPQVLVIHGEGDGTLDNMKRKKPAN 226
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
WILHKPPLPISFGTHHSKA+ L+YPRG+R++VHTANLI+VDWNNKSQGLWMQDFP KDQN
Sbjct: 227 WILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQN 286
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
+ S C FE+DL+DYLS LKWPEF A+ P HGNF INP FF+KF++S AAVRLIASVPGY
Sbjct: 287 SSSRGCAFEDDLVDYLSALKWPEFPASFPGHGNFNINPYFFRKFDYSKAAVRLIASVPGY 346
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
HTG LKKWGHMKLR+VLQEC F+K F++SPLVYQFSSLGSL+EKWMAE ++S+SSGF+
Sbjct: 347 HTGRYLKKWGHMKLRSVLQECIFDKEFQRSPLVYQFSSLGSLNEKWMAEFAASLSSGFTP 406
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS 480
DKTPLG+GEPLIVWPTVEDVRCSLEGYAAG+A+PSP KNV+K FL KYWAKW + H+GR
Sbjct: 407 DKTPLGLGEPLIVWPTVEDVRCSLEGYAAGSAVPSPLKNVEKGFLTKYWAKWNSFHSGRC 466
Query: 481 RAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
AMPHIKTFARYNGQKLAW +LTS+NLS+AAWGALQKNNSQLMIRSYELGVL LP KR+
Sbjct: 467 HAMPHIKTFARYNGQKLAWLVLTSSNLSQAAWGALQKNNSQLMIRSYELGVLFLPQ-KRN 525
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
FSCT N ++ + KT+LVTL W + + SEV+ LP+PYELPPQ
Sbjct: 526 DYSFSCTKNGGSAQSTVSRPSETLEGKTELVTLAWQENKKRESLSEVIQLPIPYELPPQP 585
Query: 601 YSSEDVPWSWDKRYTKKDVYGQVWPRHF 628
Y EDVPWSW++RYT+KDV+G VWPR F
Sbjct: 586 YGPEDVPWSWERRYTQKDVHGAVWPRQF 613
>gi|356507524|ref|XP_003522514.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 1 [Glycine
max]
Length = 610
Score = 899 bits (2324), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/624 (71%), Positives = 515/624 (82%), Gaps = 22/624 (3%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
++GYLVPL+ N +E+ S+PK +S G NVIGR NIPV DKRLSRKH+TLTAS +GSASL+
Sbjct: 6 QVGYLVPLNRNFKEEASVPKFAVSDGINVIGRNNIPVPDKRLSRKHLTLTASPNGSASLL 65
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
V+GTNP+VV SG++R+KL+ E +I +GDIIELIPGHH FKY L
Sbjct: 66 VEGTNPIVVNSGNKRRKLNPKEEATICNGDIIELIPGHHLFKYQVLGG------------ 113
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
R D + + NS EA+ NFHV D++PSTFRLL VQGLP WANTSCVSI
Sbjct: 114 --------RNADARKSSGEDNSVEAIRNFHVPSDQIPSTFRLLHVQGLPPWANTSCVSIG 165
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI VAILSNYMVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILH
Sbjct: 166 DVIQGDIKVAILSNYMVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILH 225
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KP LPISFGTHHSKAM+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+
Sbjct: 226 KPSLPISFGTHHSKAMMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSK 285
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GFENDL++YLS LKWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GS
Sbjct: 286 GSGFENDLVEYLSVLKWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGS 345
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
SLKKWGHMKLR++LQECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTP
Sbjct: 346 SLKKWGHMKLRSLLQECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTP 405
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMP
Sbjct: 406 LGMGEPQIIWPTVEDVRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMP 465
Query: 485 HIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 543
HIKTFARY Q LAWFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LPS KRH
Sbjct: 466 HIKTFARYKNQSLAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESV 525
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYS 602
FSCTSN+ SE K + E+S+++KTKLVTLT +SSEV+ LP+PYELPP YS
Sbjct: 526 FSCTSNVTVSEDKCPARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYS 585
Query: 603 SEDVPWSWDKRYTKKDVYGQVWPR 626
S+D+PWSWD++Y KKDVYG VWPR
Sbjct: 586 SQDIPWSWDRQYNKKDVYGHVWPR 609
>gi|356507526|ref|XP_003522515.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like isoform 2 [Glycine
max]
Length = 599
Score = 891 bits (2303), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/624 (71%), Positives = 514/624 (82%), Gaps = 33/624 (5%)
Query: 5 KIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLV 64
++GYLVPL+ N +E+ S+PK +S G NVIGR NIPV DKRLSRKH+TLTAS +GSASL+
Sbjct: 6 QVGYLVPLNRNFKEEASVPKFAVSDGINVIGRNNIPVPDKRLSRKHLTLTASPNGSASLL 65
Query: 65 VDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN 124
V+GTNP+VV SG++R+KL+ E +I +GDIIELIPGHH FKY ++
Sbjct: 66 VEGTNPIVVNSGNKRRKLNPKEEATICNGDIIELIPGHHLFKY--------------QSS 111
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIR 184
GE NS EA+ NFHV D++PSTFRLL VQGLP WANTSCVSI
Sbjct: 112 GE-----------------DNSVEAIRNFHVPSDQIPSTFRLLHVQGLPPWANTSCVSIG 154
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
DVIQGDI VAILSNYMVDIDWL+PACP L+K+PHVLVIHGESDG ++++KR+KPANWILH
Sbjct: 155 DVIQGDIKVAILSNYMVDIDWLVPACPALSKVPHVLVIHGESDGRVDYIKRSKPANWILH 214
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
KP LPISFGTHHSKAM+LIYP+GVR+IVHTANLI+VDWNNKSQGLWMQDFP KDQN+LS+
Sbjct: 215 KPSLPISFGTHHSKAMMLIYPQGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKDQNSLSK 274
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GFENDL++YLS LKWPEFS NLP G+ I PSFF+KF++S A VRLIASVPGYH+GS
Sbjct: 275 GSGFENDLVEYLSVLKWPEFSVNLPFLGSVSICPSFFRKFDYSDARVRLIASVPGYHSGS 334
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
SLKKWGHMKLR++LQECTF++ FKKSPLVYQFSSLGSLDEKWM EL+SSMS+G SEDKTP
Sbjct: 335 SLKKWGHMKLRSLLQECTFDEEFKKSPLVYQFSSLGSLDEKWMTELASSMSAGLSEDKTP 394
Query: 425 LGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP 484
LG+GEP I+WPTVEDVRCSLEGYAAGNA+PSP KNV+K FLKKYWAKWKA HTGR RAMP
Sbjct: 395 LGMGEPQIIWPTVEDVRCSLEGYAAGNAVPSPLKNVEKTFLKKYWAKWKADHTGRCRAMP 454
Query: 485 HIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-KRHGCG 543
HIKTFARY Q LAWFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LPS KRH
Sbjct: 455 HIKTFARYKNQSLAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPSLFKRHESV 514
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY-LPVPYELPPQRYS 602
FSCTSN+ SE K + E+S+++KTKLVTLT +SSEV+ LP+PYELPP YS
Sbjct: 515 FSCTSNVTVSEDKCPARESSEMKKTKLVTLTGIKKESMHSSSEVIIPLPLPYELPPLPYS 574
Query: 603 SEDVPWSWDKRYTKKDVYGQVWPR 626
S+D+PWSWD++Y KKDVYG VWPR
Sbjct: 575 SQDIPWSWDRQYNKKDVYGHVWPR 598
>gi|224078752|ref|XP_002305614.1| predicted protein [Populus trichocarpa]
gi|222848578|gb|EEE86125.1| predicted protein [Populus trichocarpa]
Length = 599
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/628 (69%), Positives = 505/628 (80%), Gaps = 31/628 (4%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ + I YLVPL +L E+ S+PKLPLS G N IGR +I SDKRLSR H++LT S S
Sbjct: 1 MTHSPIAYLVPLSPSLEENASIPKLPLSNGQNTIGRNDISASDKRLSRNHLSLTLSLT-S 59
Query: 61 ASLVVDGTNPV-VVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSN 119
+++ V+GTNPV VVKSG +R+KL + E I + DIIELIPG++F+KYV + S
Sbjct: 60 STITVEGTNPVAVVKSGKRRRKLRAGEKAEIINDDIIELIPGNYFYKYVEME------SG 113
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTS 179
N E EEA+ +F VS D+L TFRLLRV+ LPAWANTS
Sbjct: 114 GPPRNCE--------------------EEAIRDFGVSEDELALTFRLLRVKELPAWANTS 153
Query: 180 CVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
CVSI DVI+GDI+VAILSNYMVD+DWLL ACP +AK+P+V+VIHGE DGTLEHMKR KPA
Sbjct: 154 CVSINDVIKGDILVAILSNYMVDMDWLLSACPTIAKVPNVMVIHGEGDGTLEHMKRRKPA 213
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
NWILHKP LPISFGTHHSKAM L+YPRGVR+IVHTANLI+VDWNNKSQGLWMQDFP K++
Sbjct: 214 NWILHKPRLPISFGTHHSKAMFLVYPRGVRVIVHTANLIYVDWNNKSQGLWMQDFPWKEE 273
Query: 300 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
+ CGFENDL+DYLS LKWPEF+ LP G+ IN SFFKKF++S AAVRLIASVPG
Sbjct: 274 KKPGKGCGFENDLVDYLSMLKWPEFTVKLPNLGSISINASFFKKFDYSHAAVRLIASVPG 333
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFS 419
YHTG++L+KWGHMKL++VLQECTF+ FK+SPLVYQFSSLGSLDEKWM EL+ SMSSG++
Sbjct: 334 YHTGANLRKWGHMKLQSVLQECTFDNEFKRSPLVYQFSSLGSLDEKWMTELAISMSSGYA 393
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR 479
EDKTPLG+G P I+WPTVEDVRCSLEGYAAGNAIP P KNV+K FLKKYWAKWKASH+GR
Sbjct: 394 EDKTPLGLGVPQIIWPTVEDVRCSLEGYAAGNAIPGPLKNVEKGFLKKYWAKWKASHSGR 453
Query: 480 SRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA-K 538
RAMPHIKTF RYNGQKLAWFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS+ +
Sbjct: 454 CRAMPHIKTFTRYNGQKLAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSSIR 513
Query: 539 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 598
R+G GFSCTSN PS GS S+ +T LVTL W G+SD ++S+V+ LPVPYELPP
Sbjct: 514 RYGSGFSCTSNGGPSMDNCGSLVDSEELRTTLVTLKWQGTSD--SASKVIPLPVPYELPP 571
Query: 599 QRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
YSSEDVPWSWD+RY+KKDVYGQVWPR
Sbjct: 572 IPYSSEDVPWSWDRRYSKKDVYGQVWPR 599
>gi|297811655|ref|XP_002873711.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
gi|297319548|gb|EFH49970.1| hypothetical protein ARALYDRAFT_488358 [Arabidopsis lyrata subsp.
lyrata]
Length = 612
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/631 (67%), Positives = 502/631 (79%), Gaps = 24/631 (3%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+ED+S P++ LS+GPN IGR N+ + DKRLSRKHIT+ AS GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDDSSPRITLSEGPNFIGRGNVSIVDKRLSRKHITIMASTSGS 60
Query: 61 ASLVVDGTNPVVVKS--GDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL V+GTNPVV++S G +RKK+ E VS+++ D+IELIPGHHFFK V L +K
Sbjct: 61 ASLSVEGTNPVVIRSSGGGERKKVKPREEVSVSNDDLIELIPGHHFFKLVLLPVEKK--- 117
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
+ E ++KK R+ ++ EA+ F +KLPSTFRLL V GLP WANT
Sbjct: 118 ----GSHERATKKARKAEDD--------VEAIRRFCPPNEKLPSTFRLLSVNGLPDWANT 165
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GDI+ AILSNYMVD+DWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 166 SCVSINDVIEGDIVAAILSNYMVDVDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 225
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
NWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 226 VNWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 285
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEFSANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 286 DDKDPPKGCGFEGDLIDYLTVLKWPEFSANLPGRGNVKINAAFFKKFDYSDAKVRLIASV 345
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +LKKWGHMKLRT+LQEC F++ F +SPLVYQFSSLGSLDEKW+AE +S+SSG
Sbjct: 346 PGYHTGLNLKKWGHMKLRTILQECIFDREFCRSPLVYQFSSLGSLDEKWLAEFGNSLSSG 405
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
SEDKTPLG G+PLI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+W A H+
Sbjct: 406 ISEDKTPLGPGDPLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWTADHS 465
Query: 478 GRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
R RAMPHIKTF RYN QKLAWFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 466 ARGRAMPHIKTFTRYNDQKLAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 525
Query: 538 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 595
K GC FSCT + PS +K+ + +K +KLVT+TW G D S E++ LP+PYE
Sbjct: 526 IKTQGCIFSCTES-NPSTMKAKQERKDEAEKRSKLVTMTWQGDRD---SPEIISLPIPYE 581
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
LPP+ YS+EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 582 LPPKPYSAEDVPWSWDRGYSKKDVYGQVWPR 612
>gi|22326821|ref|NP_197021.2| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
gi|23297734|gb|AAN13014.1| unknown protein [Arabidopsis thaliana]
gi|226511716|gb|ACO60340.1| tyrosyl-DNA phosphodiesterase I [Arabidopsis thaliana]
gi|332004741|gb|AED92124.1| tyrosyl-DNA phosphodiesterase 1 [Arabidopsis thaliana]
Length = 605
Score = 862 bits (2226), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/631 (66%), Positives = 498/631 (78%), Gaps = 31/631 (4%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
R RAMPHIKTF RYN QK+AWFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 538 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 595
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
LPP+ YS EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 575 LPPKPYSPEDVPWSWDRGYSKKDVYGQVWPR 605
>gi|17381098|gb|AAL36361.1| unknown protein [Arabidopsis thaliana]
Length = 605
Score = 860 bits (2223), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/631 (66%), Positives = 498/631 (78%), Gaps = 31/631 (4%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV++ FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEEPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
R RAMPHIKTF RYN QK+AWFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 538 -KRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 595
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
LPP+ YS EDVPWSWD+ Y+KKDVYGQVWPR
Sbjct: 575 LPPKPYSPEDVPWSWDRGYSKKDVYGQVWPR 605
>gi|7671486|emb|CAB89327.1| putative protein [Arabidopsis thaliana]
Length = 627
Score = 818 bits (2114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/609 (65%), Positives = 477/609 (78%), Gaps = 31/609 (5%)
Query: 1 MSATKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGS 60
M+ +++ YL+PL +L+EDNS P++ LS+GPN+IGR N+ + DKRLSRKHIT+ S GS
Sbjct: 1 MAHSQVAYLIPLKADLKEDNSSPRITLSEGPNIIGRGNVSIVDKRLSRKHITIIVSTSGS 60
Query: 61 ASLVVDGTNPVVVKS-GD-QRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
ASL VDGTNPVV++S GD +RKK+ +E VS+ + D+IELIPGHHFFK V L
Sbjct: 61 ASLSVDGTNPVVIRSSGDGERKKVKPSEEVSVCNDDLIELIPGHHFFKLVLL-------- 112
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT 178
NG + K + +D+ EA+ F +KLPSTFRLL V LP WANT
Sbjct: 113 -----NGRAAKKARKAEDDV---------EAIRRFCPPNEKLPSTFRLLSVDALPDWANT 158
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
SCVSI DVI+GD++ AILSNYMVDIDWL+ ACP LA IP V+VIHGE DG E+++R KP
Sbjct: 159 SCVSINDVIEGDVVAAILSNYMVDIDWLMSACPKLANIPQVMVIHGEGDGRQEYIQRKKP 218
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
ANWILHKP LPISFGTHHSKA+ L+YPRGVR++VHTANLIHVDWNNKSQGLWMQDFP KD
Sbjct: 219 ANWILHKPRLPISFGTHHSKAIFLVYPRGVRVVVHTANLIHVDWNNKSQGLWMQDFPWKD 278
Query: 299 QN-NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ + + CGFE DLIDYL+ LKWPEF+ANLP GN KIN +FFKKF++S A VRLIASV
Sbjct: 279 DDKDPPKGCGFEGDLIDYLNVLKWPEFTANLPGRGNVKINAAFFKKFDYSDATVRLIASV 338
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSG 417
PGYHTG +L KWGHMKLRT+LQEC F++ F++SPL+YQFSSLGSLDEKW+AE +S+SSG
Sbjct: 339 PGYHTGFNLNKWGHMKLRTILQECIFDREFRRSPLIYQFSSLGSLDEKWLAEFGNSLSSG 398
Query: 418 FSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
+EDKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSP KNV+K FLKKYWA+WKA H+
Sbjct: 399 ITEDKTPLGPGDSLIIWPTVEDVRCSLEGYAAGNAIPSPLKNVEKPFLKKYWARWKADHS 458
Query: 478 GRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS- 536
R RAMPHIKTF RYN QK+AWFLLTS+NLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 459 ARGRAMPHIKTFTRYNDQKIAWFLLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPSP 518
Query: 537 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQK-TKLVTLTWHGSSDAGASSEVVYLPVPYE 595
K GC FSCT + PS +K+ +++K +KLVT+TW G D E++ LPVPY+
Sbjct: 519 IKTQGCVFSCTES-NPSVMKAKQETKDEVEKRSKLVTMTWQGDRDL---PEIISLPVPYQ 574
Query: 596 LPPQRYSSE 604
LPP+ YS E
Sbjct: 575 LPPKPYSPE 583
>gi|326504850|dbj|BAK06716.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 669
Score = 804 bits (2076), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/671 (57%), Positives = 486/671 (72%), Gaps = 52/671 (7%)
Query: 2 SATKIGYLVP-LDNNLREDNS----LPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTAS 56
S ++G LVP ++ N+ +P +P+ +G NV+GR+N+ DKR+SRKH++L A
Sbjct: 5 SRVRVGTLVPFVEGKSGSPNASSLPMPSIPIFEGSNVVGRSNLVAVDKRVSRKHLSLRAV 64
Query: 57 ADGSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKR 116
DGS +VV+GTNP+VV+S QR+K+ + + I D++ELIPG +F KYV +S +++
Sbjct: 65 PDGSVEVVVEGTNPIVVRSEGQRRKVCAQQRAKIMPDDVLELIPGEYFMKYVNMS-DERK 123
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSR------------------- 157
+S A+ KK ++ E+D+ K + + + + ++R
Sbjct: 124 IS---ASVDSHDLKKGKRHSEEDSVAAKRNRQVMEDEALARTLQESFAEESASVTEVLSS 180
Query: 158 ---------------------DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL 196
D LP +FRL+RVQGLP+W NTS V+I+DVIQG++++A+L
Sbjct: 181 LDSAGSSERNKERTHSVGPLKDVLPLSFRLMRVQGLPSWTNTSTVTIQDVIQGEVLLAVL 240
Query: 197 SNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
SNYMVD+DWLL ACP L K+PHVLV+HGE +LE +K+ KP NWILHKPPLPISFGTHH
Sbjct: 241 SNYMVDMDWLLTACPSLRKVPHVLVLHGEDGASLERLKKTKPTNWILHKPPLPISFGTHH 300
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
SKAMLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP K+ N++S GFENDL+DYL
Sbjct: 301 SKAMLLVYPQGIRVVVHTANLIHVDWNNKSQGLWAQDFPWKEANDMSTNIGFENDLVDYL 360
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 376
LKWPEF NLP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+
Sbjct: 361 RALKWPEFRVNLPVVGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNMKKWGHMKLRS 420
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPT 436
VL+EC FEK F KSPL+YQFSSLGSLDEKWM+E + S+S+G ++D + LGIG+PLIVWPT
Sbjct: 421 VLEECVFEKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKADDGSQLGIGKPLIVWPT 480
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK 496
VEDVRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR RAMPHIKTF RYNGQ
Sbjct: 481 VEDVRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCRAMPHIKTFTRYNGQN 540
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
+AWFLLTS+NLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT S
Sbjct: 541 IAWFLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVPQFSCTDK---SRSN 597
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 616
+ KTKLVTL W G + S+EVV LPVPY+LPPQ Y EDVPWSWD+RYTK
Sbjct: 598 LDKLALGKNIKTKLVTLCWKGDEEKDPSAEVVRLPVPYQLPPQLYGPEDVPWSWDRRYTK 657
Query: 617 KDVYGQVWPRH 627
KDVYG VW RH
Sbjct: 658 KDVYGSVWSRH 668
>gi|357122586|ref|XP_003562996.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Brachypodium
distachyon]
Length = 671
Score = 804 bits (2076), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/671 (57%), Positives = 483/671 (71%), Gaps = 50/671 (7%)
Query: 2 SATKIGYLVPLDNNLRED--NSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVP SLP +P+ +G NV+GR+N+ V DKR+SRKH++L SADG
Sbjct: 5 SRVRVGTLVPFGEGKAGSLGASLPSIPIFEGSNVVGRSNLVVVDKRVSRKHLSLRVSADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSN 119
S +VV+G NP+VV+S QR+++ + E I D++ELIPG +F KYV + K S+
Sbjct: 65 SIEVVVEGPNPIVVQSEGQRRRVCAKERAKIIHDDVLELIPGDYFVKYVNMGDEHK--SS 122
Query: 120 DGATNGELSSKKMRQQDE-----------QDNENGKNSEEALCNFHVS------------ 156
+ +L K +++E +D + +E+ +S
Sbjct: 123 TPVDSNDLKKGKRHREEECVVAKRNRQIVEDEALARTLQESFAEETMSATGMACVQVSSS 182
Query: 157 --------------------RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL 196
+D LP TFRL+RVQGLP+W NTS V+I+DVIQG++++A+L
Sbjct: 183 LDSAGSSERNNERMHSAGSLKDVLPLTFRLMRVQGLPSWTNTSAVTIQDVIQGEVLLAVL 242
Query: 197 SNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
SNYMVD+DWLL ACP L K+PHVLV+HGE +LEH+K++KPANWILHKPPLPI+FGTHH
Sbjct: 243 SNYMVDMDWLLTACPSLRKVPHVLVLHGEDGASLEHLKKSKPANWILHKPPLPITFGTHH 302
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
SKAMLL+YP+G+R++VHTANLIHVDWNNKSQGLW QDFP KD ++++ FE+DL+DYL
Sbjct: 303 SKAMLLVYPQGIRVVVHTANLIHVDWNNKSQGLWTQDFPWKDTKDMNKNISFESDLVDYL 362
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 376
S LKWPEF LP G+ IN +FF+KF++SS+ VRLI SVPGYH G ++KKWGHMKLR+
Sbjct: 363 SALKWPEFRIKLPVAGDVNINAAFFRKFDYSSSTVRLIGSVPGYHVGPNIKKWGHMKLRS 422
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPT 436
VL+ C FEK F KSPL+YQFSSLGSLDEKWM E + S+S+G ++D +PLGIG+PLIVWPT
Sbjct: 423 VLEGCVFEKQFCKSPLIYQFSSLGSLDEKWMTEFACSLSAGKADDGSPLGIGKPLIVWPT 482
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK 496
VEDVRCS+EGYAAG+ IPSPQKNV+KDFL+KYW++WKA H GR AMPHIKTFARYNGQ
Sbjct: 483 VEDVRCSIEGYAAGSCIPSPQKNVEKDFLRKYWSRWKADHVGRCHAMPHIKTFARYNGQN 542
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
+AWFLLTS+NLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT +
Sbjct: 543 IAWFLLTSSNLSKAAWGALQKNNTQLMIRSYELGVLFLPKTLQSVSRFSCTEK---NHSN 599
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 616
G+ + KTKLVTL W + S+EV+ LPVPY+LPPQ Y EDVPWSWD+RYTK
Sbjct: 600 LGNLTLGKTIKTKLVTLCWKDDEEKEPSAEVIRLPVPYQLPPQLYGPEDVPWSWDRRYTK 659
Query: 617 KDVYGQVWPRH 627
KDVYG VWPRH
Sbjct: 660 KDVYGAVWPRH 670
>gi|293331809|ref|NP_001169273.1| uncharacterized protein LOC100383136 [Zea mays]
gi|224028313|gb|ACN33232.1| unknown [Zea mays]
gi|414886956|tpg|DAA62970.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
gi|414886957|tpg|DAA62971.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 665
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/668 (58%), Positives = 482/668 (72%), Gaps = 50/668 (7%)
Query: 2 SATKIGYLVPL--DNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL DN + S+ +P+ QGPNV+GR ++ V DKR+SRKH++L AS DG
Sbjct: 5 SRVRLGTLVPLTKDNAGSSNGSVSSIPIFQGPNVVGRDHLVVVDKRISRKHLSLHASTDG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQ----- 114
S +VV+G NP++V+S QR+K+ + E IA GD++ELIPG +F KYV +
Sbjct: 65 SIEVVVEGPNPIIVRSKGQRRKVCAKETAKIAHGDVLELIPGDYFVKYVDMGDEHVPMHL 124
Query: 115 -----------------KRV------------------SNDGATNGELSSKKMRQQDEQD 139
KR+ ++D A +G S +K+ D
Sbjct: 125 SDLMKGKRYSEEHGAAVKRIRQIMEDEALAKTLQESFAADDAAVSGMPSGQKISSHDSAG 184
Query: 140 NENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNY 199
+ N + +D LP TFRL+ VQGLP+W NTS V+I+DVIQG++++A+LSNY
Sbjct: 185 SSERNNDRTH--SVGPLKDMLPLTFRLMHVQGLPSWTNTSSVTIQDVIQGEVLLAVLSNY 242
Query: 200 MVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKA 259
MVDIDWLL ACP L K+PHVLV+HG+ +LE MK+ KPANWILH+PPLPISFGTHHSKA
Sbjct: 243 MVDIDWLLTACPSLRKVPHVLVLHGQDGASLELMKKLKPANWILHRPPLPISFGTHHSKA 302
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTL 319
MLL+YP+G+RI+VHTANLIHVDWN KSQGLWMQDFP KD +++++ FENDL+DYLS L
Sbjct: 303 MLLVYPQGIRIVVHTANLIHVDWNYKSQGLWMQDFPWKDTVDMNKKTAFENDLVDYLSAL 362
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ 379
KWPEF NLP G+ IN +FF+KF++S++ VRLI SVPGYH GS+++KWGHMKLR VL
Sbjct: 363 KWPEFRVNLPGVGDVNINAAFFRKFDYSNSMVRLIGSVPGYHVGSNIRKWGHMKLRNVLD 422
Query: 380 ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVED 439
E F K F KSPL+YQFSSLGSLDEKWM+E + S+S+G S+D + LGIG+PLIVWPTVED
Sbjct: 423 EIMFNKQFCKSPLIYQFSSLGSLDEKWMSEFACSLSAGKSDDGSQLGIGKPLIVWPTVED 482
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAW 499
VRCS+EGYAAG+ IPSPQKNV++DFLKKYW++WKA H GR RAMPHIKTF RY+GQ +AW
Sbjct: 483 VRCSIEGYAAGSCIPSPQKNVERDFLKKYWSRWKADHVGRCRAMPHIKTFTRYSGQNIAW 542
Query: 500 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 559
FLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT I+ G
Sbjct: 543 FLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVPQFSCTEK--SRSIRDGV 600
Query: 560 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 619
I KTKLVTL W G + +V LPVPY+LPPQ Y ++DVPWSWD+RYTKKDV
Sbjct: 601 ALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYGTQDVPWSWDRRYTKKDV 656
Query: 620 YGQVWPRH 627
YG VWPR+
Sbjct: 657 YGSVWPRY 664
>gi|115472491|ref|NP_001059844.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|33146648|dbj|BAC79984.1| putative tyrosyl-DNA phosphodiesterase [Oryza sativa Japonica
Group]
gi|113611380|dbj|BAF21758.1| Os07g0530100 [Oryza sativa Japonica Group]
gi|215697362|dbj|BAG91356.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222637174|gb|EEE67306.1| hypothetical protein OsJ_24533 [Oryza sativa Japonica Group]
Length = 671
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/677 (56%), Positives = 491/677 (72%), Gaps = 62/677 (9%)
Query: 2 SATKIGYLVPLD--NNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL+ N + S+ +P+ G NV+GR ++ V DKR+SRKH++L ASADG
Sbjct: 5 SRVRVGNLVPLNEGNASSSNGSVSSIPIYLGANVVGRNHLVVVDKRVSRKHLSLHASADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSN 119
S VV+G NP++V+S QR+K+ + E V IA D++ELIPG +F KY+ + + K ++
Sbjct: 65 SIEAVVEGPNPIIVRSEGQRRKVCAQERVKIAHDDVLELIPGEYFVKYLNVGDNHKSSTS 124
Query: 120 DGATNGE----------LSSKKMRQ---------------QDEQDNENGKNSEEALCNFH 154
G+++ + + K+ RQ +E +G ++ L +
Sbjct: 125 MGSSDFKKGKRLCEDDTVVIKRNRQIMEDEALARSLQKSFAEESSTISGLGCDQMLSSLD 184
Query: 155 VS----------------RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSN 198
+ +D L TFRL+RVQGLP+W NTS V+I+DVIQG++++A+LSN
Sbjct: 185 SAGFSERNNERIHSVDYLKDVLSLTFRLMRVQGLPSWTNTSSVTIQDVIQGEVLLAVLSN 244
Query: 199 YMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSK 258
YMVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHSK
Sbjct: 245 YMVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHSK 304
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
AMLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS
Sbjct: 305 AMLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRSVSFENDLVDYLSA 364
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
+KWPEF NLP G+ IN +FF+KF++ S++VRLI SVPGYH G ++KKWGHMKLR+VL
Sbjct: 365 IKWPEFRVNLPVVGDVNINAAFFRKFDYKSSSVRLIGSVPGYHVGPNIKKWGHMKLRSVL 424
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTVE
Sbjct: 425 EGCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFAFSLSAGKSDNGSPLGIGKPLIVWPTVE 484
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA 498
DVR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +A
Sbjct: 485 DVRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDIA 544
Query: 499 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNIV 551
WFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+
Sbjct: 545 WFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNLA 604
Query: 552 PS-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSW 610
P EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +EDVPWSW
Sbjct: 605 PGKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDVPWSW 653
Query: 611 DKRYTKKDVYGQVWPRH 627
DKRYTKKDVYG VWPRH
Sbjct: 654 DKRYTKKDVYGSVWPRH 670
>gi|218199747|gb|EEC82174.1| hypothetical protein OsI_26284 [Oryza sativa Indica Group]
Length = 843
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/683 (56%), Positives = 488/683 (71%), Gaps = 64/683 (9%)
Query: 2 SATKIGYLVPLD--NNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL+ N + S+ +P+ G NV+GR ++ V DKR+SRKH++L ASADG
Sbjct: 5 SRVRVGNLVPLNEGNASSSNGSVSSIPIYLGANVVGRNHLVVVDKRVSRKHLSLHASADG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYV----------- 108
S VV+G NP++V+S QR+K+ + E V IA D++ELIPG +F KY+
Sbjct: 65 SIEAVVEGPNPIIVRSEGQRRKVCAQERVKIAHDDVLELIPGEYFVKYLNVGDNHKSSTS 124
Query: 109 ------------------------------TLSRS-QKRVSNDGATNGELSSKKMRQQDE 137
L+RS QK + + +T L +M +
Sbjct: 125 MGSSDFKKGKRLCEDDTVVIKRNRQIMEDEALARSLQKSFAEESSTISGLGCDQMLSSLD 184
Query: 138 QDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS 197
+ +N+E + + +D L TFRL+RVQGLP+W NTS V+I+DVIQG++++A+LS
Sbjct: 185 SAGSSERNNER-IHSVDYLKDVLSLTFRLMRVQGLPSWTNTSSVTIQDVIQGEVLLAVLS 243
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYMVD++WLL ACP L K+ HVLVIHGE ++E +K+ KPANWILHKPPLPISFGTHHS
Sbjct: 244 NYMVDMEWLLTACPSLRKVRHVLVIHGEDGASVELLKKVKPANWILHKPPLPISFGTHHS 303
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 317
KAMLL+YP+G+R++VHTANLIHVDWNNK+QGLWMQDFP KD +++ FENDL+DYLS
Sbjct: 304 KAMLLVYPQGIRVVVHTANLIHVDWNNKTQGLWMQDFPWKDAKDVNRIVSFENDLVDYLS 363
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV 377
+KWPEF NLP G+ IN +FF+KF++ S+ VRLI SVPGYH G ++KKWGHMKLR+V
Sbjct: 364 AIKWPEFRVNLPVVGDVNINAAFFRKFDYKSSLVRLIGSVPGYHVGPNIKKWGHMKLRSV 423
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTV 437
L+ CTFE+ F K+P++YQFSSLGSLDEKWM+E + S+S+G S++ +PLGIG+PLIVWPTV
Sbjct: 424 LEGCTFEQQFCKAPMIYQFSSLGSLDEKWMSEFACSLSAGKSDNGSPLGIGKPLIVWPTV 483
Query: 438 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKL 497
EDVR S+EGYAAG+ IPSPQKNV+KDFL+KYWA+WKA H GR RAMPHIKTF RYNGQ +
Sbjct: 484 EDVRTSIEGYAAGSCIPSPQKNVEKDFLRKYWARWKADHVGRCRAMPHIKTFTRYNGQDI 543
Query: 498 AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCT-------SNI 550
AWFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP + FSCT +N+
Sbjct: 544 AWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPKTHQSVPQFSCTGKNNSNLNNL 603
Query: 551 VPS-EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 609
P EI KTKLVTL W + S+E++ LPVPY+LPP+ Y +ED PWS
Sbjct: 604 APGKEI-----------KTKLVTLCWKSDEEKEQSTEIIRLPVPYQLPPKPYGTEDDPWS 652
Query: 610 WDKRYTKKDVYGQVWPRHFQLYA 632
WDKRYTKKDVYG VWPRH + A
Sbjct: 653 WDKRYTKKDVYGSVWPRHGGIQA 675
>gi|242050414|ref|XP_002462951.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
gi|241926328|gb|EER99472.1| hypothetical protein SORBIDRAFT_02g035180 [Sorghum bicolor]
Length = 689
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/519 (64%), Positives = 406/519 (78%), Gaps = 8/519 (1%)
Query: 109 TLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
LS+ + ++ A +G S +K+ D +G+N+E + +D LP TFRL+R
Sbjct: 178 VLSKQESFAEDNTAVSGMTSGQKISSHDSA-GSSGRNNERKH-SIGPLKDMLPLTFRLMR 235
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
VQGLP+W NTS VSI+DVIQG++++A+LSNYMVDIDWLL ACP L K+PHVLV+HG+
Sbjct: 236 VQGLPSWTNTSSVSIQDVIQGEVLLAVLSNYMVDIDWLLTACPSLKKVPHVLVLHGQDGA 295
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG 288
+LE MK+ KPANWILHKPPLPISFGTHHSKAMLL+YP+G+RI+VHTANLIHVDWN KSQG
Sbjct: 296 SLELMKKLKPANWILHKPPLPISFGTHHSKAMLLVYPQGIRIVVHTANLIHVDWNYKSQG 355
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSS 348
LWMQDFP KD N+++ + FENDL+DYLS LKWPEFS NLP G+ IN +FF+KF++ +
Sbjct: 356 LWMQDFPWKDTNDMNNKVPFENDLVDYLSALKWPEFSVNLPEVGDVNINAAFFRKFDYRN 415
Query: 349 AAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
+ VRLI SVPGYH G +++KWGHMKLR VL E TF K F KSPL+YQFSSLGSLDEKWM+
Sbjct: 416 SMVRLIGSVPGYHVGPNIRKWGHMKLRNVLDEITFNKQFCKSPLIYQFSSLGSLDEKWMS 475
Query: 409 ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
E + S+S+G S+D + LGIG+PLIVWPTVEDVRCS+EGYAAG+ IPSPQKNV+KDFLKKY
Sbjct: 476 EFACSLSAGKSDDGSQLGIGKPLIVWPTVEDVRCSIEGYAAGSCIPSPQKNVEKDFLKKY 535
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYE 528
W++WKA H GR RAMPHIKTF RY+GQ +AWFLLTS+NLSKAAWGALQKNN+QLMIRSYE
Sbjct: 536 WSRWKADHVGRCRAMPHIKTFTRYSGQNIAWFLLTSSNLSKAAWGALQKNNTQLMIRSYE 595
Query: 529 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 588
LGVL LP + FSCT S + KTKLVTL W G + +V
Sbjct: 596 LGVLFLPQTLQSIPQFSCTEK---SRSSRDGVAIGRTIKTKLVTLCWKGDEE---DPSIV 649
Query: 589 YLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
LPVPY+LPPQ Y ++DVPWSWD+RYTKKDVYG VWPRH
Sbjct: 650 KLPVPYQLPPQPYGTQDVPWSWDRRYTKKDVYGSVWPRH 688
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 78/116 (67%), Gaps = 2/116 (1%)
Query: 2 SATKIGYLVPL--DNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADG 59
S ++G LVPL DN + S+ +P+ QG NV+GR ++ V DKR+SRKH++L AS DG
Sbjct: 5 SRVRLGTLVPLTKDNAGSSNGSVSNIPIFQGSNVVGRDHLVVVDKRISRKHLSLHASTDG 64
Query: 60 SASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQK 115
S +VV+G NP++V+S QR+K+ + IA GD++ELIPG +F KYV + K
Sbjct: 65 SIEVVVEGPNPIMVRSNGQRRKVCATGKAKIAHGDVLELIPGDYFVKYVDMGDEHK 120
>gi|168038405|ref|XP_001771691.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676998|gb|EDQ63474.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 598
Score = 664 bits (1714), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/622 (54%), Positives = 433/622 (69%), Gaps = 37/622 (5%)
Query: 25 LPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSGDQRKKLSS 84
+ L +GPN IGR ++ ++K++SRKH+ L S+D + L V G NPVV+KSG ++KL
Sbjct: 1 IALFEGPNSIGRDDLVSANKQVSRKHVVLKTSSDCTFELSVIGQNPVVIKSGSGKRKLLP 60
Query: 85 NEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQ---DNE 141
N I+ GDIIE +PG +K +TL T ELS + + DE D E
Sbjct: 61 NARALISAGDIIEFLPGKMPYK-LTLE----------PTEDELSPRAANKLDEAFGVDYE 109
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
G S STFRL++V+GLP WAN CV+IR VIQGD+ VA+LSNYMV
Sbjct: 110 AGCRSS--------------STFRLMQVKGLPQWANKGCVNIRGVIQGDVQVALLSNYMV 155
Query: 202 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
DIDWLL ACP L +P V++ HGES G+LE ++ KP +W+LHKPPL +S+GTHH+KAM
Sbjct: 156 DIDWLLEACPRLKTVPSVVIFHGESGGSLELLQARKPNSWLLHKPPLRLSYGTHHTKAMF 215
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD-QNNLSEECGFENDLIDYLSTLK 320
L+YP G+RI+VHTANLI++DWNNKSQGLW QDFP K+ S+ FENDL++YL L+
Sbjct: 216 LLYPTGIRIVVHTANLIYIDWNNKSQGLWTQDFPYKNVAAGESKPSPFENDLVEYLQALE 275
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 380
W A + G ++ +FF+KF++SSA VRL+ASVPGYH G +L KWGH+KLRT+LQE
Sbjct: 276 WTGCIAIISGIGEVHVDAAFFRKFDYSSAMVRLVASVPGYHLGRNLTKWGHLKLRTILQE 335
Query: 381 CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
FE+ FK SP VYQFSSLGSLDEKWM E SS+ +G + LG G IVWPTVED+
Sbjct: 336 QHFEEHFKGSPCVYQFSSLGSLDEKWMGEFGSSIQAGSTFGNEQLGPGPVQIVWPTVEDI 395
Query: 441 RCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWF 500
R SLEGYAAG A+PSP KNV++ FL KYW +W+A HTGRSRA+PHIKTF RYN Q+LAWF
Sbjct: 396 RNSLEGYAAGGAVPSPLKNVERAFLSKYWYRWQADHTGRSRAIPHIKTFLRYNDQRLAWF 455
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG---FSCT--SNIVPSEI 555
LLTS+NLSKAAWG LQKN SQLMIRSYELGVL LPS + FSCT S+I+P E+
Sbjct: 456 LLTSSNLSKAAWGVLQKNGSQLMIRSYELGVLFLPSLVGNNSNVTPFSCTYSSSILPREL 515
Query: 556 KSGSTETS--QIQKTKLVTLTWHGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSWDK 612
++ + Q++ TKLVTL+W S+ + ++ V LP+PY LPP +Y +D+PWSWD+
Sbjct: 516 QNREDDGGKRQLRHTKLVTLSWKSSNHEKSDMDIFVRLPIPYALPPVKYDPKDIPWSWDR 575
Query: 613 RYTKKDVYGQVWPRHFQLYAFQ 634
+Y + D++G+VWPR + Y Q
Sbjct: 576 QYREPDMFGEVWPRQVRRYTMQ 597
>gi|147781461|emb|CAN76118.1| hypothetical protein VITISV_033882 [Vitis vinifera]
Length = 592
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/440 (71%), Positives = 352/440 (80%), Gaps = 47/440 (10%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
EA+ +F + +D LP T+RLLRV+ LPAWANTS VSIRDVIQGD+++A+LSNYMVDIDWLL
Sbjct: 137 EAIRHFSIPKDNLPLTYRLLRVKDLPAWANTSSVSIRDVIQGDVLIAVLSNYMVDIDWLL 196
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+CP LAKIPHVLVIHGE DGTL+HMK+NKP NWILHKPPLPISFGTHHSKAMLL+YPRG
Sbjct: 197 SSCPTLAKIPHVLVIHGEGDGTLDHMKKNKPPNWILHKPPLPISFGTHHSKAMLLVYPRG 256
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN 327
VR+IVHTANLI+VDWNNKSQGLWMQDFP K Q LS+ C FENDLIDYLS LKWPEF+AN
Sbjct: 257 VRVIVHTANLIYVDWNNKSQGLWMQDFPWKVQKELSKGCAFENDLIDYLSVLKWPEFTAN 316
Query: 328 LPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF 387
LPA G+F IN SFFKKF++S+A VRLIASVPGYHTGS+LKKWGHMKL +VLQEC F+K F
Sbjct: 317 LPALGSFNINSSFFKKFDYSNAVVRLIASVPGYHTGSNLKKWGHMKLXSVLQECIFDKEF 376
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLE-- 445
+KSPL YQFSSLGSLDEKWM EL+SSMSSG +DKTPLG+G+PLI+WPTVEDVRCSLE
Sbjct: 377 QKSPLAYQFSSLGSLDEKWMTELASSMSSGSCDDKTPLGLGKPLIIWPTVEDVRCSLEAH 436
Query: 446 ---------------------------GYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG 478
GYAAGNAIPSPQKNV+K+FLKKYWAKWKA+HTG
Sbjct: 437 ITCWIPGYLLGFYMCKFALHQSYYIVQGYAAGNAIPSPQKNVEKEFLKKYWAKWKATHTG 496
Query: 479 RSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAK 538
R WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL LPS
Sbjct: 497 R------------------CWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLFLPSPI 538
Query: 539 RHGCGFSCTSNIVPSEIKSG 558
G GFSCT N PS++ G
Sbjct: 539 NRGQGFSCTDNGSPSKMFPG 558
>gi|357504797|ref|XP_003622687.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355497702|gb|AES78905.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 849
Score = 646 bits (1666), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/449 (67%), Positives = 371/449 (82%), Gaps = 4/449 (0%)
Query: 1 MSATKIGYLVPLDNNL--REDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASAD 58
S ++IGYL+PL+ N +E S PKL +S G N+IGR N+PV+DKRLSRKH+T+TASAD
Sbjct: 3 FSHSQIGYLIPLNPNSEEKEKASTPKLTISDGTNIIGRNNVPVNDKRLSRKHLTITASAD 62
Query: 59 GSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTLSRSQKRVS 118
G+A+L V+GTNPVVV SG++R+KL+S + +I DGD+IELIPGH+ FKY RS K
Sbjct: 63 GTANLHVEGTNPVVVNSGNKRRKLNSKQTAAIFDGDVIELIPGHYLFKYQVSQRSPKVAD 122
Query: 119 NDGATNGELSSKKMRQQDEQDNENG--KNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
N G+ S+ + + ++G ++ EE + +F V+ D++P TFRLLRVQGLP WA
Sbjct: 123 NKHHERGKNSATQRHDKIAVTQKHGSSRSCEEPIRDFRVADDQIPCTFRLLRVQGLPPWA 182
Query: 177 NTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN 236
NTSCVSI DVIQGDI+VA+LSNYMVD+DWL+PACP L+K+PHVLV+HGESD + +KR+
Sbjct: 183 NTSCVSISDVIQGDILVAVLSNYMVDVDWLVPACPALSKVPHVLVLHGESDERVACIKRS 242
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
KP NWILHKPPLPISFGTHHSKAM L+YPRGVR+I+HTANLI+VDWNNKSQGLWMQDFP
Sbjct: 243 KPKNWILHKPPLPISFGTHHSKAMFLVYPRGVRVIIHTANLIYVDWNNKSQGLWMQDFPW 302
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
KDQN+ S+ FENDL++YLS LKWPEFS NLP+ GNF I PSFFKKF++S A VRLIAS
Sbjct: 303 KDQNSPSKGSRFENDLVEYLSALKWPEFSVNLPSLGNFSICPSFFKKFDYSDAMVRLIAS 362
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
VPGYH+G+ LKKWGHMKLR+VLQECTF+K FKKSPLVYQFSSLGSLDEKWM EL+SSMS+
Sbjct: 363 VPGYHSGNGLKKWGHMKLRSVLQECTFDKEFKKSPLVYQFSSLGSLDEKWMVELASSMSA 422
Query: 417 GFSEDKTPLGIGEPLIVWPTVEDVRCSLE 445
G SEDK PLG+GEP I+WPTVE+VRCS+E
Sbjct: 423 GLSEDKVPLGMGEPQIIWPTVEEVRCSIE 451
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 133/175 (76%), Positives = 147/175 (84%), Gaps = 1/175 (0%)
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAW 512
IPSP KNV+K FLKKYWAKWKA+HTGR+RAMPHIKTFARYN Q LAWF LTS+NLSKAAW
Sbjct: 633 IPSPMKNVEKAFLKKYWAKWKANHTGRTRAMPHIKTFARYNNQNLAWFCLTSSNLSKAAW 692
Query: 513 GALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVT 572
GALQKNNSQLMIRSYELGVL LPS + GCGFSCTSN+ S+ KS + ETS+++KTKLVT
Sbjct: 693 GALQKNNSQLMIRSYELGVLFLPSLLKPGCGFSCTSNVKQSKDKSPAQETSKMKKTKLVT 752
Query: 573 LTWHGSSDAGASSEVVY-LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
LT +SSEV+ LPVPYELPP YSSEDVPWSWD+RY KKD YGQVWPR
Sbjct: 753 LTAPTRDTTHSSSEVIIQLPVPYELPPLPYSSEDVPWSWDRRYFKKDDYGQVWPR 807
>gi|302774643|ref|XP_002970738.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
gi|300161449|gb|EFJ28064.1| hypothetical protein SELMODRAFT_11047 [Selaginella moellendorffii]
Length = 478
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 285/476 (59%), Positives = 356/476 (74%), Gaps = 8/476 (1%)
Query: 153 FHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPV 212
H +R P F+LLRVQGLP WAN CV I DVI+GD++VAILSNYMVDI+WLL ACP+
Sbjct: 8 LHSARS--PCGFQLLRVQGLPDWANAGCVRISDVIKGDVLVAILSNYMVDIEWLLSACPL 65
Query: 213 LAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIV 272
L IP V++IHGES+ + ++ KP+NW+L KP L IS+GTHHSKAMLL+YP GVR++V
Sbjct: 66 LRSIPQVVMIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKAMLLVYPTGVRVVV 123
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG 332
HTANLI++DWNNK+QGLWMQDFP K ++ FENDL+DYL+ L+W + ++ HG
Sbjct: 124 HTANLINIDWNNKTQGLWMQDFPFKSMTGITTASDFENDLVDYLTALEWSGCTVDVQHHG 183
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL 392
KIN +F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+E F+K F+ SPL
Sbjct: 184 QMKINAIYFRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILKEEKFDKKFQNSPL 243
Query: 393 VYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
VYQFSSLGSLDEKWM E SSS+S G + D LG+GE I++PTVEDVR SLEGY AG A
Sbjct: 244 VYQFSSLGSLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVEDVRQSLEGYRAGAA 303
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAW 512
IPSP KNV+K LKKYW++W+A HTGRSRAMPHIKTF R+ LAW LTS+NLSKAAW
Sbjct: 304 IPSPAKNVEKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAWVCLTSSNLSKAAW 363
Query: 513 GALQKNNSQLMIRSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLV 571
GALQKN +QLMIRSYELGV+ LPS + +SCT ++ P ++ + ET + KL
Sbjct: 364 GALQKNKTQLMIRSYELGVVFLPSMLSKFKNRYSCTEDL-PLINENEACETGEAPNVKLY 422
Query: 572 TLTWHGSSD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
TL S D +++++ LP+PY LPP RYSS+D PW WDK+Y DVYG+ WP
Sbjct: 423 TLAATESVDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYLHPDVYGKRWP 478
>gi|302771966|ref|XP_002969401.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
gi|300162877|gb|EFJ29489.1| hypothetical protein SELMODRAFT_170833 [Selaginella moellendorffii]
Length = 491
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 284/469 (60%), Positives = 355/469 (75%), Gaps = 9/469 (1%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P F+LLRVQGLP WAN CV I DVI+GD++VAILSNYMVDI+WLL ACP+L IP V+
Sbjct: 27 PCGFQLLRVQGLPDWANAGCVRISDVIKGDVLVAILSNYMVDIEWLLSACPLLRSIPQVV 86
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
+IHGES+ + ++ KP+NW+L KP L IS+GTHHSKAMLL+YP GVR++VHTANLI++
Sbjct: 87 MIHGESN--VSQLQSVKPSNWLLFKPRLWISYGTHHSKAMLLVYPTGVRVVVHTANLINI 144
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
DWNNK+QGLWMQDFPLK ++ FENDL+DYL+ L+W + ++ HG KIN S+
Sbjct: 145 DWNNKTQGLWMQDFPLKSMTGITTASDFENDLVDYLTALEWSGCTVDVQHHGQMKINASY 204
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
F+ F+FS+AAVRLI S+PGYH+G L KWGHMKLR++L+E F+K F+ SPLVYQFSSLG
Sbjct: 205 FRNFDFSNAAVRLIGSIPGYHSGPQLNKWGHMKLRSILKEEKFDKKFQNSPLVYQFSSLG 264
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
SLDEKWM E SSS+S G + D LG+GE I++PTVEDVR SLEGY AG AIPSP KNV
Sbjct: 265 SLDEKWMEEFSSSLSEGSTLDGRRLGLGEAQIIFPTVEDVRQSLEGYRAGAAIPSPAKNV 324
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS 520
+K LKKYW++W+A HTGRSRAMPHIKTF R+ LAW LTS+NLSKAAWGALQKN +
Sbjct: 325 EKPLLKKYWSRWQAEHTGRSRAMPHIKTFVRFRENALAWVCLTSSNLSKAAWGALQKNKT 384
Query: 521 QLMIRSYELGVLILPSA-KRHGCGFSCTSNI-VPSEIKSGSTETSQIQKTKLVTLTWHGS 578
QLMIRSYELGV+ LPS + +SCT ++ + +E ++ T + KL TL S
Sbjct: 385 QLMIRSYELGVVFLPSMLSKFKNRYSCTEDLPLINENEACKTGAPNV---KLYTLAATES 441
Query: 579 SD--AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
D +++++ LP+PY LPP RYSS+D PW WDK+Y DVYG+ WP
Sbjct: 442 MDEEEDTNAKIIRLPLPYALPPPRYSSQDEPWKWDKQYLHPDVYGKRWP 490
>gi|302833870|ref|XP_002948498.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
gi|300266185|gb|EFJ50373.1| hypothetical protein VOLCADRAFT_88920 [Volvox carteri f.
nagariensis]
Length = 1521
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 182/422 (43%), Positives = 242/422 (57%), Gaps = 57/422 (13%)
Query: 162 STFRLLRVQGLPAWANTSC--VSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHV 219
S LLRV+GL NT C V +R V+ G + +A++SNYM+D+ WLL CP LAK
Sbjct: 122 SPVHLLRVRGLSPRYNTGCLGVDLRHVVSGPLQLALVSNYMIDMGWLLSCCPDLAKARQF 181
Query: 220 LVIHGESDGTLEHMKRNKP----ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
V+HGE M++ A+ LH+PPLPI +GTHHSKA LL Y G+R+I+HTA
Sbjct: 182 FVVHGEGPDAEPEMRQQAAEAGAAHVRLHRPPLPIMYGTHHSKAFLLAYSTGLRLIIHTA 241
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNF 334
N ++ D N+K+QGLW+QDFP KD + FE DL+ Y L P AN
Sbjct: 242 NCVYPDCNDKTQGLWVQDFPRKDTVAAAAPVSTFEQDLVAYFRALALPPAMAN------- 294
Query: 335 KINPSF--FKKFNFSSAAVRLIASVPGYHTGSS-LKKWGHMKLRTVLQECTFEKGFKKSP 391
P F +FS A L+ASVPGYH G++ ++ +GHM+LR +L++ F
Sbjct: 295 ---PLFEAIAMHDFSFARGTLVASVPGYHRGTAAVQSYGHMRLRRLLEQVPLPSCFAAEG 351
Query: 392 ----------------LVYQFSSLGSLDEKWMA-ELSSSMSS------------------ 416
L+ Q SS+GS D+ W+ E+ +S+++
Sbjct: 352 SSCGTASSSSAVPPEGLIIQCSSMGSFDQAWLVDEMGASLAACRRQPPPPPPPPRPLAAA 411
Query: 417 --GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA 474
G +VWPTVE+VR S+EG+ AG +IP P +NV K F+ +Y+A+W
Sbjct: 412 PPPRPSGPPGCGPLPLAVVWPTVEEVRNSIEGWNAGRSIPGPSRNVSKPFMGRYYARWGG 471
Query: 475 SHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
GR RAMPHIKT+ RY GQ+LAWFL+TS NLSKAAWG LQKN SQLMIRSYELGVL+
Sbjct: 472 EAVGRQRAMPHIKTYTRYRGQQLAWFLVTSHNLSKAAWGELQKNGSQLMIRSYELGVLVT 531
Query: 535 PS 536
P+
Sbjct: 532 PA 533
>gi|384252305|gb|EIE25781.1| tyrosyl-DNA phosphodiesterase [Coccomyxa subellipsoidea C-169]
Length = 502
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 190/493 (38%), Positives = 281/493 (56%), Gaps = 43/493 (8%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVS--IRDVIQGDIIVAIL-SNYMVDIDWLLPACPVLAKI 216
+P LLRV+GLP + + ++D++ G + +L SN+M+D+ W + A P +
Sbjct: 2 IPPVASLLRVRGLPEQFSRGALGTQLKDLLSGGPMRWLLISNFMIDMRWFVSAAPSVLDA 61
Query: 217 PHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRII 271
V V+HGE ++ + +P W++H+ P+ +G HHSKA L+ + RG+R++
Sbjct: 62 DRVTVVHGEKSNPTSVSWMQQIAAGRP--WVIHQARCPLQYGVHHSKAFLVQFDRGLRVV 119
Query: 272 VHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLP 329
VHTANLIH D N K+QGLW QDFP KD+ + + FE L DY++ L+ P A
Sbjct: 120 VHTANLIHQDCNCKTQGLWYQDFPRKDERSPQDNASRLFETTLSDYIAALRLPAREAQ-- 177
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
H I + +FSSA LI SVPGYH G++ +K+GHM +R++L F+ F++
Sbjct: 178 -HAQQVI-----AQHDFSSARAHLIPSVPGYHQGAAKQKYGHMLVRSLLARQRFDPVFRR 231
Query: 390 SPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-------IVWPTVEDVRC 442
SP+V QFSSLGS+ W++E S+++G D P G L +VWPTVE+V+
Sbjct: 232 SPIVAQFSSLGSITGAWLSEFRESLAAGDCWDSNPSGSAGRLGPAADFRVVWPTVEEVKN 291
Query: 443 SLEGYAAGNAIPSPQKNVDKD-------FLKKYWAKWKAS--HTGRSRAMPHIKTFARYN 493
S+EG+ AG +IP NV K L+ +W ++ + GR AMPHIK++ R++
Sbjct: 292 SVEGWFAGCSIPGTHANVLKTDKGLSTPILQPFWCRFDGAPATAGRQHAMPHIKSYLRHS 351
Query: 494 GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA----KRH-GCGFSCTS 548
GQ+LA+ +LTS NLSKAAWG LQKNN+QL I YELGVL+LPS +RH GFSCT+
Sbjct: 352 GQRLAYIVLTSHNLSKAAWGVLQKNNTQLHIMHYELGVLLLPSLEESYRRHRHFGFSCTA 411
Query: 549 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
S + + + S+++ S +E + + +PY+LPP RY +D PW
Sbjct: 412 PA--SHKPAAAAQPSRVEFWAADGAAAGSSEALSTGAEKLEILLPYQLPPVRYGPQDQPW 469
Query: 609 SWDKRYTKKDVYG 621
+ D G
Sbjct: 470 MTGVEFPGLDSQG 482
>gi|303279543|ref|XP_003059064.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458900|gb|EEH56196.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 520
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 192/531 (36%), Positives = 279/531 (52%), Gaps = 80/531 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P FRL +G+ A AN CVSI DV++G + AI+ N+ VD+DW L ACP L V+
Sbjct: 1 PPAFRLWSTEGVTADANAGCVSISDVVRGSVRWAIVMNFTVDLDWFLAACPALRTARRVI 60
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
+++G + + P +W HKPP P +GTHH+KA +L Y GVR+++HTANL H
Sbjct: 61 LMYGNMHPGVAEI----PKHWSTHKPPCP-QYGTHHTKAFILAYDAGVRVVIHTANLTHH 115
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+N Q +W QDFPLK +++ FENDL+ Y+S L+W S + +++P
Sbjct: 116 DFNKSCQAVWYQDFPLKRESS-PPGSAFENDLVRYVSRLQWSGESVD-----GERVSPEA 169
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
++++FS A V+LIASVPG H G L++WGHM +RT L+ T + FK S ++ Q++S G
Sbjct: 170 LRRYDFSGAGVKLIASVPGRHAGEELRRWGHMAVRTALERETHDDAFKGSSVLCQYTSTG 229
Query: 401 SLDEKWMAE------------LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYA 448
SL +KW+ E S G + + LG GE ++WPTVE++R GYA
Sbjct: 230 SLPKKWLDEEFRDSLCAGACAGGGGGSVGGNANDRSLGPGEMQLLWPTVEEIRTCDVGYA 289
Query: 449 AGNAIPSPQKNVDKDFLKKYWAKWK---------ASHTGRSRAMPHIKTFARY------- 492
AG +IP KNV + L + + KW A GR + MPHIKTF+RY
Sbjct: 290 AGGSIPGNGKNVRRPHLTEKFHKWAKPNDDDDDDAHPMGRRKHMPHIKTFSRYYDALTPY 349
Query: 493 ----------NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS------ 536
G K A+ ++ S NLS AAWG L+ SQ+ + SYELGV+ LPS
Sbjct: 350 QKKRGGGGGVAGAKFAYVIVCSHNLSGAAWGKLEHGGSQIHVYSYELGVMFLPSLIGART 409
Query: 537 -------AKRHGCGFSCTSNIVP------SEIKSGSTETSQIQKTKLVTLTWHGSSDA-- 581
+ F C + + P + + ++E + + L G++ A
Sbjct: 410 AKPFSALSATEADPFRCLAAVRPRATTTATATATATSEGAVVLTHALTLARPPGAATATT 469
Query: 582 --GASSEVVYLPVPYELPPQRYS--------SEDVPWSWDKRYTKKDVYGQ 622
G S+ + P+PY +PP RY+ D PW WD+RY D +G+
Sbjct: 470 ASGPSATLALCPLPYNVPPLRYNLDDNAPLLERDEPWVWDQRYDVADEWGR 520
>gi|255087474|ref|XP_002505660.1| predicted protein [Micromonas sp. RCC299]
gi|226520930|gb|ACO66918.1| predicted protein [Micromonas sp. RCC299]
Length = 536
Score = 321 bits (822), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 189/509 (37%), Positives = 266/509 (52%), Gaps = 50/509 (9%)
Query: 161 PSTFRLLRVQGLPAWANTS----CVSIRDVIQGDIIVAILSNYMVDIDWLLP--ACPVLA 214
P FRLL NTS CVS+RD++ G + ++ N+M+D+ WLL CP L
Sbjct: 20 PPLFRLLTTDPADLNPNTSGNAGCVSLRDIVSGPVRWCVVMNFMIDLPWLLSPDGCPELL 79
Query: 215 KIPHVLVIHGESDGTL----EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRI 270
+IP V+ I E E ++ +W + PP P FGTHH+K +L+Y GVR+
Sbjct: 80 RIPKVVWIGDERSSPTPRDPEFLRLKGERDWTVVNPPCP-KFGTHHTKCFILVYDTGVRV 138
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLP- 329
VHTANLIH D ++ W QDFP K +L FE DL YL+TL W + + LP
Sbjct: 139 CVHTANLIHGDVRKRTNAAWCQDFPNKSAAHLGRSSEFERDLGRYLATLGWKDETCALPG 198
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
A G+ + PS +F+FS A +LIASVPG GS++ +GH +R L TF FK+
Sbjct: 199 AGGDVVVGPSAMSRFDFSGAGAKLIASVPGRWVGSAMMNYGHTSVRHALAGMTFPGVFKR 258
Query: 390 SPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP--------LGIGEPLIVWPTVEDVR 441
+P+V QF+S+G+ EKWM E++ S +G +E LG G+ +VWPT+ +VR
Sbjct: 259 APVVCQFTSVGATTEKWMGEMARSFGAGATETDDANEWPGGPCLGDGDLRLVWPTMGEVR 318
Query: 442 CSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA------------------SHTGRSRAM 483
S GY G +IP + ++ +++ +W+ TGR R M
Sbjct: 319 GSNLGYVTGGSIPGATDKISREHVRRRLHRWRGDVGATRGTKLLDHPPASTDPTGRGRVM 378
Query: 484 PHIKTFARY---NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA--- 537
PH+KTFARY LAW ++ S NLS AAWG L+KN +Q+ I SYELGVL+ P +
Sbjct: 379 PHVKTFARYAPNAPHHLAWVIVGSHNLSGAAWGRLEKNETQIAILSYELGVLLSPRSIGK 438
Query: 538 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA--GASSE-VVYLPVPY 594
R F+CT V G + ++ + G D+ G S E V + P+PY
Sbjct: 439 TRVAAPFTCTPGAVSHR---GEVVPRCLGGVRISAASDDGPGDSPPGDSREFVAFAPLPY 495
Query: 595 ELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+PP Y+ D PW+ D D YG+V
Sbjct: 496 RVPPVPYAPSDAPWAVDAWDETPDKYGRV 524
>gi|332223510|ref|XP_003260916.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Nomascus
leucogenys]
Length = 608
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 207/575 (36%), Positives = 303/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K SS E + S +D ++ +P K V SNDGA +G +
Sbjct: 75 KRQKSSSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISASNDGAAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G E + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSG----EVQDIWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKTPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTP 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DIIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGGDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|410962801|ref|XP_003987957.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Felis catus]
Length = 608
Score = 298 bits (763), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 186/484 (38%), Positives = 265/484 (54%), Gaps = 61/484 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFRFYLTRVSGIEPKDNSGALHIKDILSPLFGTLLSSAQFNYCFDVDWLVKQYPPQFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ + Q + F+ DLI YL P +
Sbjct: 283 NLIHADWHQKTQGIWLSPLYPRVVHGTQRSGDSTTHFKADLISYLMAYNAPSLKEWI--- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKK 389
++ + S V LI S PG GS WGH +LR +L+E + KG +
Sbjct: 340 -------DVIQEHDLSETNVYLIGSTPGRFQGSQKDHWGHFRLRKLLKEHASSIPKG-ES 391
Query: 390 SPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GS+ + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 392 WPIVGQFSSIGSMGADESKWLCSEFKESLVTQGKESRTPGKSAAPLHLIYPSVENVRTSL 451
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFL 501
EGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL
Sbjct: 452 EGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRLSPDFSQIAWFL 511
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 512 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFRVKQKFFSGSKE 565
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 620
+ PVPY+LPP+ Y S+D PW W+ YTK D +
Sbjct: 566 PTS------------------------SFPVPYDLPPELYGSKDRPWIWNIPYTKAPDTH 601
Query: 621 GQVW 624
G +W
Sbjct: 602 GNMW 605
>gi|296215712|ref|XP_002754236.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Callithrix
jacchus]
Length = 606
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 191/483 (39%), Positives = 266/483 (55%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 161 PYQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPREFRKK 220
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 221 PILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 280
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P + A
Sbjct: 281 NLIHADWHQKTQGVWLSPLYPRIVDGTHKSGESITHFKADLISYLMAYNAPSLKEWIDA- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR VL++ ++S
Sbjct: 340 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKVLKDHASSIPNEESW 390
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 391 PVVGQFSSIGSLGADESKWLCSEFKESMLALGKESKTPGKSSVPLYLIYPSVENVRTSLE 450
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 451 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLI 510
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 511 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 564
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 565 ------------------------MTTFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 600
Query: 622 QVW 624
+W
Sbjct: 601 NMW 603
>gi|403298195|ref|XP_003939917.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403298197|ref|XP_003939918.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 605
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 191/483 (39%), Positives = 266/483 (55%), Gaps = 60/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 161 PYQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPREFRKK 220
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 221 PVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 280
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 281 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWI--- 337
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 338 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 390
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 391 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 450
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRSRAMPHIKT+ R + ++AWFL+
Sbjct: 451 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSRAMPHIKTYMRPSPDFSRIAWFLI 510
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 511 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 564
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 565 -------------------------MPFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 599
Query: 622 QVW 624
+W
Sbjct: 600 NMW 602
>gi|297695684|ref|XP_002825063.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pongo abelii]
gi|297695686|ref|XP_002825064.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pongo abelii]
Length = 608
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 204/575 (35%), Positives = 302/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDGA +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGAAQRTENHGPPT 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSRALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKTSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFVGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|397525717|ref|XP_003832802.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Pan paniscus]
gi|397525719|ref|XP_003832803.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Pan paniscus]
Length = 608
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 301/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRQAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|350539189|ref|NP_001233557.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|343962149|dbj|BAK62662.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410225564|gb|JAA10001.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410265878|gb|JAA20905.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
gi|410301400|gb|JAA29300.1| tyrosyl-DNA phosphodiesterase 1 [Pan troglodytes]
Length = 608
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 301/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRRAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|351706738|gb|EHB09657.1| Tyrosyl-DNA phosphodiesterase 1 [Heterocephalus glaber]
Length = 655
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 194/507 (38%), Positives = 278/507 (54%), Gaps = 60/507 (11%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGIKPKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP AN L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKD---QNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
N+I DW+ K+QG+W+ +P D Q + + F+ DLI YL+ P +
Sbjct: 283 NIIREDWHQKTQGIWLSPLYPRIDHGTQGSGESKTHFKADLISYLTAYNAPPLQEWI--- 339
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +L+E T +
Sbjct: 340 -------DTIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHGTSIPKAECW 392
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
PLV QFSS+GSL + KW+ +E S+ + +E+KTP PL +++P+VE+VR SLE
Sbjct: 393 PLVGQFSSIGSLGADESKWLCSEFKESLLTQGAENKTPGKSSIPLHLIYPSVENVRTSLE 452
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R N ++AWFL+
Sbjct: 453 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRLSPNSSRIAWFLV 512
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWG L+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 513 TSANLSKAAWGVLEKNGTQLMIRSYELGVLFLPSA------FGLASFKVKQKFSSGSQEL 566
Query: 563 S-----------QIQKTKLVTLTWHGSSDAGASSEVVY-------------LPVPYELPP 598
+ ++ +K T G+ G +S V PVPY+LPP
Sbjct: 567 APPFPVPYDLPPELYGSKGETWA-QGTMGGGLASFKVKQKFSSGSQELAPPFPVPYDLPP 625
Query: 599 QRYSSEDVPWSWDKRYTKK-DVYGQVW 624
+ Y S+D PW W+ Y K D +G +W
Sbjct: 626 ELYGSKDRPWIWNIPYVKAPDRHGNMW 652
>gi|20127586|ref|NP_060789.2| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|57242805|ref|NP_001008744.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|37999797|sp|Q9NUW8.2|TYDP1_HUMAN RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|15930062|gb|AAH15474.1| Tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|85725382|gb|ABC79301.1| tyrosyl-DNA phosphodiesterase 1 [Homo sapiens]
gi|119601820|gb|EAW81414.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601821|gb|EAW81415.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
gi|119601822|gb|EAW81416.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Homo sapiens]
Length = 608
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 301/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|7023536|dbj|BAA91997.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 203/575 (35%), Positives = 301/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNPESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|189054943|dbj|BAG37927.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 202/575 (35%), Positives = 301/575 (52%), Gaps = 72/575 (12%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E +M
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKENM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA F S V + +GS E +
Sbjct: 541 VLFLPSA------FGLDSFKVKQKFFAGSQEP------------------------MATF 570
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 571 PVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|73964387|ref|XP_547950.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Canis lupus familiaris]
Length = 609
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 187/484 (38%), Positives = 266/484 (54%), Gaps = 61/484 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 223
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + + S E F+ DLI YL +
Sbjct: 284 NLIHADWHQKTQGIWLSPLYPRMAQATHRSGESATHFKADLISYLMAYNAAPLKEWIDT- 342
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKK 389
+ + S V LI S PG GS WGH +LR +L+E + KG +
Sbjct: 343 ---------IHEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLREHASSITKG-ES 392
Query: 390 SPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GS+ D KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPIVGQFSSIGSMGADDSKWLCSEFKESLVTLGKESRTPGKSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFL 501
EGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL
Sbjct: 453 EGYPAGGSLPYSIQTAEKQNWLHSYFHKWMADTSGRSNAMPHIKTYMRSSPDFSQIAWFL 512
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS E
Sbjct: 513 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSKE 566
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 620
+ PVPY+LPP+ Y ++D PW W+ YTK D +
Sbjct: 567 PA------------------------AAFPVPYDLPPELYGNKDRPWIWNIPYTKAPDTH 602
Query: 621 GQVW 624
G +W
Sbjct: 603 GNMW 606
>gi|417403294|gb|JAA48458.1| Putative tyrosyl-dna phosphodiesterase [Desmodus rotundus]
Length = 611
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 186/485 (38%), Positives = 266/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N++ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 166 PFQFYLTRVSGIKPKYNSAALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 225
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HTA
Sbjct: 226 PILLVHGDKREAKAHLHAEAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTA 285
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ----NNLSEECG--FENDLIDYLSTLKWPEFSANLP 329
NLI DW+ K+QG+W+ PL + ++S E F+ DLI YL+ P + +
Sbjct: 286 NLICADWHQKTQGIWLS--PLYPRVACGTHMSGESATHFKADLISYLTAYNAPPLNEWI- 342
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFK 388
+ + S V LI S PG GS WGH +LR +L+E + G +
Sbjct: 343 ---------DIIRDHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSTPGAE 393
Query: 389 KSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GS+ KW+ +E ++++ E + P PL +++P+VE+VR S
Sbjct: 394 AWPVVGQFSSIGSMGADASKWLCSEFKETLATLGKESRAPGKGVTPLHLIYPSVENVRTS 453
Query: 444 LEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ--KLAWF 500
LEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWF
Sbjct: 454 LEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSHAMPHIKTYMRPSPDFGRIAWF 513
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V SGS
Sbjct: 514 LVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFQVKQRFFSGSQ 567
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ YTK D
Sbjct: 568 EPA------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYTKAPDT 603
Query: 620 YGQVW 624
+G +W
Sbjct: 604 HGNMW 608
>gi|49258603|pdb|1QZQ|A Chain A, Human Tyrosyl Dna Phosphodiesterase
gi|49258604|pdb|1QZQ|B Chain B, Human Tyrosyl Dna Phosphodiesterase
Length = 483
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 189/483 (39%), Positives = 265/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 38 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 97
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 98 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 157
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 158 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWI--- 214
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 215 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESW 267
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 268 PVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 327
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 328 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 387
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + +GS E
Sbjct: 388 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP 441
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 442 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 477
Query: 622 QVW 624
+W
Sbjct: 478 NMW 480
>gi|449280745|gb|EMC87981.1| Tyrosyl-DNA phosphodiesterase 1 [Columba livia]
Length = 604
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 181/484 (37%), Positives = 267/484 (55%), Gaps = 58/484 (11%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V G+ N+ + I+D++ G ++ + NY D+ WL+ P +
Sbjct: 156 PFRFFLTKVTGIEQSYNSGALHIKDILSPLFGTLVSSAQFNYCFDVGWLVRQYPQEFRKK 215
Query: 218 HVLVIHGES-DGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HGE + E + + +P I + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 216 PLLIVHGEKRESKAELVAQARPYEHISFCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 275
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKDQNNL----SEECGFENDLIDYLSTLKWPEFSANLPA 330
NLI DW+ K+QG+W+ +P Q E F++DLI YL+ P +
Sbjct: 276 NLIAEDWHQKTQGIWLSPLYPRLPQGTTGSAGESETNFKSDLISYLTAYNSPTLKEWI-- 333
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V L+ S PG + GS +KWGH++LR +L++ ++S
Sbjct: 334 --------DLIQEHDLSETRVYLLGSTPGRYQGSDKEKWGHLRLRKLLKDHASSIPARES 385
Query: 391 -PLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL KW+ +E S+ + S TPL P+ +V+PTV++VR SL
Sbjct: 386 WPVVGQFSSIGSLGVDGSKWLCSEFQESLVAAGSSVTTPLKCDVPIHLVYPTVDNVRQSL 445
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFL 501
EGY AG ++P + K +L Y+ KW AS +GRS A+PHIKT+ R + QK+AWFL
Sbjct: 446 EGYPAGGSLPYSIQTAQKQLWLHSYFHKWAASISGRSHAIPHIKTYMRPSPDFQKIAWFL 505
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+T ANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA G+ C SE K +T
Sbjct: 506 VTLANLSKAAWGALEKSGTQLMIRSYELGVLFLPSAFGLDKGYFCVRGKTLSESKESAT- 564
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 620
Y PVPY+LPP++Y S+D PW W+ +T D +
Sbjct: 565 ---------------------------YFPVPYDLPPEQYGSKDQPWIWNIPHTDAPDTH 597
Query: 621 GQVW 624
G +W
Sbjct: 598 GNMW 601
>gi|402876919|ref|XP_003902197.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 1 [Papio anubis]
gi|402876921|ref|XP_003902198.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 2 [Papio anubis]
Length = 603
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 188/483 (38%), Positives = 265/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 622 QVW 624
+W
Sbjct: 598 NMW 600
>gi|383873205|ref|NP_001244708.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|355693501|gb|EHH28104.1| hypothetical protein EGK_18452 [Macaca mulatta]
gi|380814614|gb|AFE79181.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
gi|383419927|gb|AFH33177.1| tyrosyl-DNA phosphodiesterase 1 [Macaca mulatta]
Length = 603
Score = 292 bits (747), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 188/483 (38%), Positives = 265/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 622 QVW 624
+W
Sbjct: 598 NMW 600
>gi|355778790|gb|EHH63826.1| hypothetical protein EGM_16873 [Macaca fascicularis]
Length = 603
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 188/483 (38%), Positives = 265/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 158 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 217
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 218 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 277
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ DLI YL P +
Sbjct: 278 NLIHADWHQKTQGIWLSPLYPRIVDGTHESGESTTHFKADLISYLMAYNAPSLKEWIDT- 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
+ + S V LI S PG GS WGH +LR +L++ +S
Sbjct: 337 ---------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKDHASSIPNAESW 387
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 388 PVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 447
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 448 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 507
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + +GS E
Sbjct: 508 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDNFKVKQKFFAGSQEP 561
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 562 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 597
Query: 622 QVW 624
+W
Sbjct: 598 NMW 600
>gi|354478467|ref|XP_003501436.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
gi|344235810|gb|EGV91913.1| Tyrosyl-DNA phosphodiesterase 1 [Cricetulus griseus]
Length = 609
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 184/485 (37%), Positives = 264/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIRDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRNK 223
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP AN L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILIVHGDKREDKAHLHAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ +P DQ + + F+ DLI YL + P +
Sbjct: 284 NLIREDWHQKTQGIWLSPLYPRLDQGSHTSGESSTHFKADLISYLMSYNAPSLQEWIDT- 342
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
++ + S V L+ S PG GS WGH +LR +L+ T K
Sbjct: 343 ---------IQEHDLSETNVYLVGSTPGRFQGSHKDNWGHFRLRKLLR--THAPSVPKDE 391
Query: 391 --PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + + +TP PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKESLLALREDGRTPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWF 500
LEGY AG ++P + ++ ++L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYGIQTAERQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSSDFNKLAWF 511
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWG L+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGTLEKNGTQLMIRSYELGVLFLPSA------FGLDAFKVKQKFFSSSC 565
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 620 YGQVW 624
+G +W
Sbjct: 602 HGNMW 606
>gi|28373796|pdb|1MU7|A Chain A, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373797|pdb|1MU7|B Chain B, Crystal Structure Of A Human Tyrosyl-dna Phosphodiesterase
(tdp1)- Tungstate Complex
gi|28373798|pdb|1MU9|A Chain A, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|28373799|pdb|1MU9|B Chain B, Crystal Structure Of A Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)- Vanadate Complex
gi|29726730|pdb|1NOP|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|29726731|pdb|1NOP|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1) In Complex With Vanadate, Dna And A Human
Topoisomerase I-Derived Peptide
gi|46015472|pdb|1RFF|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015473|pdb|1RFF|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octapeptide Klnyydpr, And
Tetranucleotide Agtt.
gi|46015478|pdb|1RFI|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015479|pdb|1RFI|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Pentapeptide Klnyk, And
Tetranucleotide Agtc
gi|46015488|pdb|1RG1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015489|pdb|1RG1|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtt
gi|46015492|pdb|1RG2|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015493|pdb|1RG2|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agta
gi|46015502|pdb|1RGT|A Chain A, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015503|pdb|1RGT|B Chain B, Crystal Structure Of Human Tyrosyl-dna Phosphodiesterase
Complexed With Vanadate, Octopamine, And Tetranucleotide
Agtc
gi|46015506|pdb|1RGU|A Chain A, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015507|pdb|1RGU|B Chain B, The Crystal Structure Of Human Tyrosyl-dna
Phosphodiesterase Complexed With Vanadate, Octopamine,
And Tetranucleotide Agtg
gi|46015511|pdb|1RH0|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
gi|46015512|pdb|1RH0|B Chain B, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
Complexed With Vanadate, Octopamine And Trinucleotide
Gtt
Length = 485
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 187/483 (38%), Positives = 265/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 40 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 99
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 100 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 159
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ +LI YL+ P +
Sbjct: 160 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISYLTAYNAPSLKEWI--- 216
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 217 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESW 269
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E SM + E KTP PL +++P+VE+VR SLE
Sbjct: 270 PVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 329
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+
Sbjct: 330 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLV 389
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA S V + +GS E
Sbjct: 390 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------LGLDSFKVKQKFFAGSQEP 443
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
+ PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 444 ------------------------MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 479
Query: 622 QVW 624
+W
Sbjct: 480 NMW 482
>gi|348573481|ref|XP_003472519.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Cavia porcellus]
Length = 607
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 192/523 (36%), Positives = 276/523 (52%), Gaps = 66/523 (12%)
Query: 122 ATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCV 181
NG +S +++++DE + S E + + P F L RV G+ N+ +
Sbjct: 128 GNNGLPASHRLKEEDEYET-----SGEGQDIWDMLDKGNPFQFYLTRVSGIKPKYNSKAL 182
Query: 182 SIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKR 235
I+D++ G ++ + NY D+DWL+ P + +L++HG E+ L H +
Sbjct: 183 HIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKADL-HAQA 241
Query: 236 NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-F 294
AN L + L I+FGTHH+K MLL+Y G R+++HT+N+I DW+ K+QG+W+ +
Sbjct: 242 KPYANVSLCQAKLDIAFGTHHTKMMLLLYEEGFRVVIHTSNIIREDWHQKTQGIWLSPLY 301
Query: 295 PLKD---QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P D Q + F+ DLI YL P + ++ + S V
Sbjct: 302 PRLDPGSQKSGESRTHFKADLISYLMAYNAPPLKEWIDT----------IREHDLSETNV 351
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSL---DEKWM 407
LI S PG GS WGH KLR +L+E T + PLV QFSS+GSL + KW+
Sbjct: 352 YLIGSTPGRFQGSQKDNWGHFKLRKLLKEHGTPVPKTECWPLVGQFSSIGSLGADESKWL 411
Query: 408 -AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-F 464
+E S+ + E+K P PL +++P+VE+VR SLEGY AG ++P + +K +
Sbjct: 412 CSEFKESLLTLGPENKIPGKSSVPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQKW 471
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQL 522
L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+TSANLSKAAWGAL+KN +QL
Sbjct: 472 LHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSRIAWFLVTSANLSKAAWGALEKNGTQL 531
Query: 523 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 582
MIRSYELGVL LPS F S V + SGS + +
Sbjct: 532 MIRSYELGVLFLPSV------FGLDSFKVKQKFFSGSQDPT------------------- 566
Query: 583 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 567 -----TAFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 604
>gi|50748586|ref|XP_421313.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gallus gallus]
Length = 606
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 186/515 (36%), Positives = 275/515 (53%), Gaps = 62/515 (12%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKL----PSTFRLLRVQGLPAWANTSCVSIRDVIQ- 188
+ + NE ++ E L + D L P F L +V+G+ N+ + I+D++
Sbjct: 127 KDEHSKNEKAEDYNEVLGEPQDTWDLLSGGNPFGFFLTKVRGIEQSYNSGALHIKDILSP 186
Query: 189 --GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILH 244
G ++ + NY +D+ WL+ P + +L++HGE + E + + +P N
Sbjct: 187 LFGTLVSSAQFNYCIDVAWLVRQYPQEYRKKPLLIVHGEKRESKAELLAQARPFENISFC 246
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQN 300
+ L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P +
Sbjct: 247 QAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLPQGSSD 306
Query: 301 NLSE-ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
+ E E F++DLI YL P + ++ + S V L+ S PG
Sbjct: 307 SAGESETNFKSDLISYLMAYSSPVLKEWI----------DLIREHDLSETRVYLLGSTPG 356
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSLD---EKWM-AELSSSM 414
+ G +KWGH+KLR +L++ ++S P+V QFSS+GSL KW+ +E S+
Sbjct: 357 RYQGIDKEKWGHLKLRKLLKDHASSIPAQESWPVVGQFSSIGSLGADGSKWLCSEFQESL 416
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKW 472
+ S L P+ +V+PTV +VR SLEGY AG ++P + K L Y+ KW
Sbjct: 417 VAAGSGVAALLKCDVPIHLVYPTVSNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYFHKW 476
Query: 473 KASHTGRSRAMPHIKTFAR--YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R ++ QK+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 477 SAEVSGRSHAMPHIKTYMRPSHDFQKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 536
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LPSA G+ + SE K +T
Sbjct: 537 VLFLPSAFGLDKGYFHVKGNMLSEGKDSATS----------------------------F 568
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVP++LPP+RY S+D PW W+ YT D +G +W
Sbjct: 569 PVPFDLPPERYGSKDQPWIWNIPYTSAPDTHGNMW 603
>gi|162417986|ref|NP_082630.2| tyrosyl-DNA phosphodiesterase 1 [Mus musculus]
gi|148686961|gb|EDL18908.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Mus musculus]
Length = 609
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 186/485 (38%), Positives = 263/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWF 500
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 620 YGQVW 624
+G +W
Sbjct: 602 HGNMW 606
>gi|311261437|ref|XP_003128731.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sus scrofa]
Length = 606
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 183/482 (37%), Positives = 258/482 (53%), Gaps = 58/482 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 162 PFQFYLTRVSGIKPKYNSGALHIRDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 221
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 222 PVLLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 281
Query: 276 NLIHVDWNNKSQGLWM----QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ Q + F+ DLI YLS
Sbjct: 282 NLIHADWHQKTQGIWLSPLYQRIVPGSHRSGESATHFKADLISYLSAYN----------A 331
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LI S PG G WGH +LR +L+E +S
Sbjct: 332 AALKEWIDTIQEHDLSETNVYLIGSTPGRFQGDQKDNWGHFRLRKLLKENGSSIPKAESW 391
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
P+V QFSS+ S+ + KW+ +E S+ + E +TP G +++P+VE+VR SLEG
Sbjct: 392 PVVGQFSSISSMGADESKWLCSEFKESLVTLGKESRTPGGAVPLHLIYPSVENVRTSLEG 451
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLT 503
Y AG ++P + +K +L Y+ KW A+ +GRS AMPHIKT+ R + ++AWFL+T
Sbjct: 452 YPAGGSLPYSIQTAEKQTWLHSYFHKWSAATSGRSNAMPHIKTYMRPSPDFSQIAWFLVT 511
Query: 504 SANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 563
SANLSKAAWGAL+KN SQLMIRSYELGVL LP+A F S V + SGS E +
Sbjct: 512 SANLSKAAWGALEKNGSQLMIRSYELGVLFLPAA------FGLDSFRVKQKFFSGSQEPT 565
Query: 564 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 622
PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 566 ------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYMKAPDTHGN 601
Query: 623 VW 624
+W
Sbjct: 602 MW 603
>gi|40796186|gb|AAH65162.1| Tdp1 protein [Mus musculus]
Length = 609
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 186/485 (38%), Positives = 263/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWF 500
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDT 601
Query: 620 YGQVW 624
+G +W
Sbjct: 602 HGNMW 606
>gi|37999670|sp|Q8BJ37.2|TYDP1_MOUSE RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1; AltName: Full=Protein expressed in
male leptotene and zygotene spermatocytes 501;
Short=MLZ-501
Length = 609
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 186/485 (38%), Positives = 263/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWF 500
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSC 565
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 566 EPT------------------------ASFPVPYDLPPELYRSKDRPWIWNIPYVKAPDT 601
Query: 620 YGQVW 624
+G +W
Sbjct: 602 HGNMW 606
>gi|224051603|ref|XP_002200587.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Taeniopygia guttata]
Length = 609
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 186/518 (35%), Positives = 276/518 (53%), Gaps = 66/518 (12%)
Query: 135 QDEQDNENGKNSE------EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
+D++ +EN K E EA + + P F L +V G+ N+ + I+D++
Sbjct: 127 KDDKLSENLKEEEYNVTPSEAQDTWDLVTGDNPFRFFLTKVSGIEQSYNSGALHIKDILS 186
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWIL 243
G +I + NY +D+ WL+ P + +L++HGE + E + + +P N
Sbjct: 187 PLFGTLISSAQFNYCIDVGWLVRQYPQEFRKKPLLIVHGEKRESKAELIAQARPYENISF 246
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS 303
+ L I+FGTHH+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ + S
Sbjct: 247 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIAEDWHQKTQGIWLSPLYPRLSKGTS 306
Query: 304 EECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
G F++DLI YL+ P + ++ + S V L+ S P
Sbjct: 307 GSAGESATNFKSDLISYLAAYNSPALREWI----------DLIQEHDLSETRVYLLGSTP 356
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS---PLVYQFSSLGSLD---EKWM-AELS 411
G + G+ +KWGH++LR +L+E ++S PLV QFSS+GS+ KW+ +E
Sbjct: 357 GRYQGNDKEKWGHLRLRKLLKEHALPIPAQESWPLPLVGQFSSIGSMGADGSKWLCSEFQ 416
Query: 412 SSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYW 469
S+ + S T P+ +V+PTV +VR SLEGY AG ++P + K L Y+
Sbjct: 417 ESLVAAGSSVTTFRKCDVPIHLVYPTVNNVRQSLEGYPAGGSLPYSIQTAQKQLWLHSYF 476
Query: 470 AKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
KW A TGR+ A+PHIKT+ R + QK+AWFL+TSANLSKAAWGAL+KN SQLMIRSY
Sbjct: 477 HKWSADVTGRTHAIPHIKTYMRLSPDFQKIAWFLVTSANLSKAAWGALEKNGSQLMIRSY 536
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
ELGVL LPSA F + + +GS + +
Sbjct: 537 ELGVLFLPSA------FGIFRLDLRKKFFTGSEQPAT----------------------T 568
Query: 588 VYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
Y PVPY+LPP++Y S+D PW W+ YT D +G +W
Sbjct: 569 TYFPVPYDLPPEQYGSKDQPWIWNIPYTDAPDTHGNMW 606
>gi|149737576|ref|XP_001496143.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Equus caballus]
Length = 611
Score = 285 bits (730), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 182/485 (37%), Positives = 262/485 (54%), Gaps = 63/485 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 166 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKT 225
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 226 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 285
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWPEFSANLP 329
NL+H DW+ K+QG+W+ PL + ++ F+ DLI YL P +
Sbjct: 286 NLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKADLISYLMAYNAPSLKEWI- 342
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKK 389
++ + S V LI S PG GS WGH +LR +L+E +
Sbjct: 343 ---------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSIPKAE 393
Query: 390 S-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
S P+V QFSS+GS+ + KW+ +E S+ + E KTP P +++P+VE+VR S
Sbjct: 394 SWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPGKSVSPFHLIYPSVENVRTS 453
Query: 444 LEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWF 500
LEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWF
Sbjct: 454 LEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQIAWF 513
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + S +
Sbjct: 514 LVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSDNQ 567
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DV 619
E + PVPY+LPP+ Y S+D PW W+ Y K D
Sbjct: 568 EPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYIKAPDT 603
Query: 620 YGQVW 624
+G +W
Sbjct: 604 HGNMW 608
>gi|291406635|ref|XP_002719650.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Oryctolagus cuniculus]
Length = 609
Score = 285 bits (728), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 191/535 (35%), Positives = 283/535 (52%), Gaps = 64/535 (11%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
+S + + G +G +S +++++ E +E ++ + + P F L RV
Sbjct: 116 VSSPRDGTAQTGGNHGPAASHRLKEEGEDKHETAGEGQDL---WDMLDRGNPFRFYLTRV 172
Query: 170 QGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES 226
G+ N+ + I+D++ G ++ + NY D+DWL+ P + +L++HG+
Sbjct: 173 SGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRRKPILLVHGDK 232
Query: 227 DGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+
Sbjct: 233 REAKAHLHAQAKPYENIALCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHEDWHQ 292
Query: 285 KSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPA-HGNFKINPS 339
K+QG+W+ +P L + S E F+ DLI YL P + HG+
Sbjct: 293 KTQGIWLSPLYPRLVHGTHRSGESTTHFKADLISYLMAYNAPSLQEWIDTIHGH------ 346
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSS 398
+ S V LI S PG G+ WGH +LR +L+E T +S P+V QFSS
Sbjct: 347 -----DLSETNVYLIGSTPGRFQGNQKDNWGHFRLRKLLKEHTSSVPQAESWPIVGQFSS 401
Query: 399 LGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
+GSL + KW+ +E S+ + +T PL +++P+VE+VR SLEGY AG ++
Sbjct: 402 IGSLGADESKWLCSEFKESLLTLGQASRTAGKSTVPLHLIYPSVENVRTSLEGYPAGGSL 461
Query: 454 P-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKA 510
P S Q +++L Y+ KW A +GRS AMPHIKT+ R + K+AWFL+TSANLSKA
Sbjct: 462 PYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKA 521
Query: 511 AWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 570
AWGAL+KN +QLMIRSYELGVL LP+ F S V + S E +
Sbjct: 522 AWGALEKNGTQLMIRSYELGVLFLPAT------FGLDSFNVKQKFFSSHQEPA------- 568
Query: 571 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 569 -----------------AAFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 606
>gi|72255547|ref|NP_001026827.1| tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|123781898|sp|Q4G056.1|TYDP1_RAT RecName: Full=Tyrosyl-DNA phosphodiesterase 1; Short=Tyr-DNA
phosphodiesterase 1
gi|71051114|gb|AAH98739.1| Tyrosyl-DNA phosphodiesterase 1 [Rattus norvegicus]
gi|149025341|gb|EDL81708.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Rattus norvegicus]
Length = 609
Score = 285 bits (728), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 184/484 (38%), Positives = 260/484 (53%), Gaps = 61/484 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D++WL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P Q N + F+ DL YL P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +LQ +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFL 501
EGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWFL
Sbjct: 453 EGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFL 512
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F + V + S S+E
Sbjct: 513 VTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDTFKVKQKFFSSSSE 566
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 620
+ PVPY+LPP+ Y S+D PW W+ Y K D +
Sbjct: 567 P------------------------MASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTH 602
Query: 621 GQVW 624
G +W
Sbjct: 603 GNMW 606
>gi|327259270|ref|XP_003214461.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Anolis
carolinensis]
Length = 603
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 184/510 (36%), Positives = 280/510 (54%), Gaps = 61/510 (11%)
Query: 138 QDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVA 194
Q E+ + SE+ + + + P F L +V+G+ + N + I+D++ G ++ +
Sbjct: 134 QSQESSQPSEKVQDTWDLLNGENPFRFFLTKVKGIDSKYNLGALHIKDILSPLFGTLVSS 193
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISF 252
NY +D+ WL+ P + +L++HGE + ++ N L + L I+F
Sbjct: 194 AQFNYCIDLGWLVKQYPKEFREKPLLIVHGEKRESKAELQEEASLYDNVRLCQAKLDIAF 253
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEECG 307
GTHH+K MLL Y G+R+++HT+NLI DW K+QG+W+ P ++
Sbjct: 254 GTHHTKMMLLHYEEGLRVVIHTSNLIADDWYQKTQGIWLSPLYPRLPPGASASDGESHTM 313
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
F++DLI YL + K PA G + K+ +FS V L+ S PG + S +
Sbjct: 314 FKSDLISYLMSYK-------SPALGKWA---ETIKQHDFSETRVYLLGSTPGRYQNSDKE 363
Query: 368 KWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDK 422
KWGH++L+ +L++ + + S P++ QFSS+GS+ KW+ +E S++S ++ K
Sbjct: 364 KWGHLRLKKLLKDHVMQVSDQDSWPVIGQFSSIGSMGADQSKWLCSEFRDSLTSLGNDTK 423
Query: 423 TPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRS 480
P+ +V+PTVE+VR SLEGY AG ++P + K +L Y+ KW A +GRS
Sbjct: 424 ALTNRDIPIHLVYPTVENVRQSLEGYPAGGSLPYSIETAKKQLWLHAYFHKWSAETSGRS 483
Query: 481 RAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAK 538
RAMPHIKT+ R + QK+AWFL+TSANLSKAAWGA +K +QLMIRSYELGVL LPS
Sbjct: 484 RAMPHIKTYMRASPDFQKIAWFLVTSANLSKAAWGAFEKKGTQLMIRSYELGVLFLPSE- 542
Query: 539 RHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPP 598
F S Q++++ S+ +SS PVPY+LPP
Sbjct: 543 -----FGLNSGYF------------QVKESMF--------SNEPSSS----FPVPYDLPP 573
Query: 599 QRYSSEDVPWSWDKRYTKK-DVYGQVW-PR 626
++Y +D PW W+ YT+ D YG +W PR
Sbjct: 574 KKYEGKDRPWIWNIPYTRAPDTYGNMWVPR 603
>gi|126282139|ref|XP_001366471.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Monodelphis domestica]
Length = 608
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 188/499 (37%), Positives = 271/499 (54%), Gaps = 60/499 (12%)
Query: 146 SEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVD 202
S+E+ + + +K P F L +V G+ N + I+D++ G ++ + NY D
Sbjct: 147 SDESQEPWDLLEEKNPFRFYLTKVSGIMPKYNAGVLHIKDILSPLFGTLLSSAQFNYCFD 206
Query: 203 IDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAM 260
IDWL+ P+ + +L++HG+ + ++ KP N L + L I+FGTHH+K M
Sbjct: 207 IDWLIRQYPLEFRKKPILLVHGDKREAKARLQEQAKPYENISLCQAKLDIAFGTHHTKMM 266
Query: 261 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFENDLIDY 315
LL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E F++DLI Y
Sbjct: 267 LLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTSGESSTNFKSDLIRY 326
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L T P + K ++ + S V LI S PG GS + WGH +LR
Sbjct: 327 LMTYNAP----------SLKEWADIIQEHDLSETRVYLIGSTPGRFQGSHKEDWGHFRLR 376
Query: 376 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 430
+L+E T ++S P+V QFSS+GSL + KW+ AE S+ + K+ P
Sbjct: 377 KLLKEHTSLVPEQQSWPIVGQFSSIGSLGADESKWLCAEFKESLVVLGNCGKSQGQQDVP 436
Query: 431 L-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKT 488
L +++PTVE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT
Sbjct: 437 LYLIYPTVENVRKSLEGYPAGGSLPYSLQTAEKQLWLHSYFHKWSAETSGRSHAMPHIKT 496
Query: 489 FARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 546
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPS F
Sbjct: 497 YMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPST------FGM 550
Query: 547 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 606
+ V ++ S + E V PVPY+LPP Y S+D
Sbjct: 551 DTFKVKKKVFSENREP------------------------VTSFPVPYDLPPNIYDSKDR 586
Query: 607 PWSWDKRYTKK-DVYGQVW 624
PW W+ YTK D +G +W
Sbjct: 587 PWIWNIPYTKAPDTHGNMW 605
>gi|125841556|ref|XP_700174.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Danio rerio]
Length = 615
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 182/492 (36%), Positives = 262/492 (53%), Gaps = 83/492 (16%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V G+P NT + I++++ G + ++ NY DI W++ P + V+
Sbjct: 173 FYLNKVTGIPKKYNTGALHIKEILSPMFGTLKESVQFNYCFDIPWMVEQYPPEFRNKPVV 232
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLP----------ISFGTHHSKAMLLIYPRGVRI 270
++HGE KR A I P P I+FGTHH+K MLL Y G R+
Sbjct: 233 LVHGE--------KRESKACLIEQAKPYPHISFCQAKLDIAFGTHHTKMMLLWYEEGFRV 284
Query: 271 IVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE----CGFENDLIDYLSTLKWPEFS 325
I+ T+NLI DW K+QG+WM +P Q + GF+ DL++YL + PE +
Sbjct: 285 IILTSNLIRADWYQKTQGMWMSPLYPRLPQGSPGTAGESLTGFKRDLLEYLEAYRAPELA 344
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE- 384
+ K+ + S V LI S PG + G +++KWGH++LR +L E T
Sbjct: 345 NWI----------ERIKQHDLSETRVYLIGSTPGRYQGPAMEKWGHLRLRKLLSEHTQPM 394
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEP----LIVWPT 436
+ ++ ++ QFSS+GS+ KW+A E ++++ K+ + P L+++P+
Sbjct: 395 QNEERWHVLGQFSSIGSMGLDKTKWLAAEFQRTLTTLGKAGKS---LASPETQMLLIYPS 451
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ 495
VE+VR SLEGY AG ++P + K +L Y+ W A TGRS AMPHIKT+ R +
Sbjct: 452 VENVRTSLEGYPAGGSLPYSIQTAQKQLWLHSYFHGWHADVTGRSNAMPHIKTYMRISPD 511
Query: 496 --KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 553
+LAWFL+TSANLSKAAWGAL+KNN+Q+M+RSYELGVL LPSA F N+ P
Sbjct: 512 FTQLAWFLVTSANLSKAAWGALEKNNTQIMVRSYELGVLYLPSAFNMST-FPVEKNVFP- 569
Query: 554 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 613
A S + PVP++LPPQRYSS+D PW W+
Sbjct: 570 -----------------------------ACSSSIGFPVPFDLPPQRYSSKDRPWIWNIP 600
Query: 614 YTKK-DVYGQVW 624
YT+ D +G VW
Sbjct: 601 YTQAPDTHGNVW 612
>gi|348500374|ref|XP_003437748.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oreochromis
niloticus]
Length = 616
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 180/489 (36%), Positives = 263/489 (53%), Gaps = 80/489 (16%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL N+ + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 177 FYLNKVTGLEKKYNSGALHIRDILSPLFGTLKESVQFNYCFDIAWMVKQYPSEFRDRPVL 236
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLP----------ISFGTHHSKAMLLIYPRGVRI 270
++HG+ KR A I P P I+FGTHH+K MLL Y G R+
Sbjct: 237 IVHGD--------KREAKARLIQQAQPFPHVRFCQAKLDIAFGTHHTKMMLLWYEEGFRV 288
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFS 325
I+ T+NLI DW K+QG+WM + S G F+ DL++YL++ + PE
Sbjct: 289 IILTSNLIRADWYQKTQGMWMSPLYPRLPKESSASAGESPTFFKRDLLEYLASYRAPELE 348
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE- 384
+ K+ + S V L+ S PG + GS +++WGH++LR +L E T
Sbjct: 349 EWI----------QRIKEHDLSETRVYLVGSTPGRYVGSDMERWGHLRLRKLLYEHTNPI 398
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPL-IVWPTVED 439
G ++ P++ QFSS+GS+ KW+A E ++++ K+ L P+ +++P+VED
Sbjct: 399 PGEERWPVIGQFSSIGSMGLDKSKWLAGEFQRTLTT---LGKSSLRPDPPMHLLYPSVED 455
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQK 496
VR SLEGY AG ++P + K +L Y+ +WKA TGRS AMPHIKT+ R + +
Sbjct: 456 VRMSLEGYPAGGSLPYSIQTAQKQLWLHSYFHRWKAEATGRSHAMPHIKTYMRASPDFSQ 515
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
LAWFL+TSANLSKAAWGAL+KNN+Q+M+RSYELGVL LPSA FS N P
Sbjct: 516 LAWFLVTSANLSKAAWGALEKNNTQMMVRSYELGVLYLPSAFGMKT-FSVDKNPFP---- 570
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 616
V+ ++ G PVP++LPP Y+++D PW W+ Y++
Sbjct: 571 --------------VSASFSG------------FPVPFDLPPTSYTTKDQPWIWNIPYSQ 604
Query: 617 K-DVYGQVW 624
D +G +W
Sbjct: 605 APDTHGNIW 613
>gi|346467109|gb|AEO33399.1| hypothetical protein [Amblyomma maculatum]
Length = 423
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 176/454 (38%), Positives = 251/454 (55%), Gaps = 64/454 (14%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--ANWILHKP 246
G ++ + NY DI WL+ P + +L++HGE + ++ + N +
Sbjct: 7 GQLVRSAQFNYCFDIPWLVEQYPPEFRSFPLLIVHGEQREAKKELEASAADFKNLSFVQA 66
Query: 247 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLS 303
L I +GTHH+K MLL+Y G+RI++HTANL+ DW K+Q +W+ + D
Sbjct: 67 KLEIVYGTHHTKMMLLLYKDGLRIVIHTANLVASDWAQKTQAIWVSPVCTRLASDSKGGD 126
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH 361
E GF+ DL+ YLS A+G+ +IN + + +FS+ V L+ SVPG H
Sbjct: 127 SETGFKADLLTYLS------------AYGDPRINEWCHYIRSHDFSAVKVFLVGSVPGRH 174
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLD---EKWM-AELSSSMS 415
TG +GH++LRT+L + K S PLV QFSS+GSL + W+ E SS+S
Sbjct: 175 TGPRKSSFGHLRLRTLLNQHGPSKDLVSSHWPLVAQFSSIGSLGTSAQAWLTGEFLSSLS 234
Query: 416 SGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWK 473
+ S TP + PL +V+P+V+DVRCSLEGY AG +IP K +L Y+ +WK
Sbjct: 235 ATKSSGSTPQSV--PLKLVFPSVDDVRCSLEGYPAGASIPYSIVTASKQRWLDSYFYRWK 292
Query: 474 ASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
+ GR+ A PHIKT+ R + G++ AWFL+TSANLSKAAWGA +KN SQLMIRSYELGV
Sbjct: 293 SERLGRTAASPHIKTYTRLSPSGKQAAWFLVTSANLSKAAWGAFEKNGSQLMIRSYELGV 352
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 591
L+ P+ S Q T + SD SS +YLP
Sbjct: 353 LLFPA--------------------------SFGQATTFIV------SDESCSSSALYLP 380
Query: 592 VPYELPPQRYSSEDVPWSWDKRYTK-KDVYGQVW 624
+PY+LP Y+S+D PW+WD ++ + D +G +W
Sbjct: 381 LPYDLPLVPYTSDDEPWTWDSQHRELPDRFGNMW 414
>gi|344274118|ref|XP_003408865.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Loxodonta africana]
Length = 612
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 188/513 (36%), Positives = 273/513 (53%), Gaps = 60/513 (11%)
Query: 131 KMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ-- 188
+ R ++E+++E K S E + + P F L RV G+ N + IRD++
Sbjct: 138 RHRLKEEEEDEY-KTSGEGQDIWDMVNKGNPFQFYLTRVSGIKPKYNCGALHIRDILSPL 196
Query: 189 -GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWILHK 245
G ++ + NY D+DWL+ P + +L++HG+ H+ KP N L +
Sbjct: 197 FGTLVSSAQFNYCFDVDWLVKQYPPEFRNKPILLVHGDKREAKAHLHAEAKPYENISLCQ 256
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP--LKDQNNL 302
L I+FGTHH+K MLL+Y G+R+++HTANLIH DW+ K+QG+W+ +P + +
Sbjct: 257 AKLDIAFGTHHTKMMLLLYEEGLRVVIHTANLIHADWHQKTQGIWLSPLYPRIVHGTHGP 316
Query: 303 SEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 361
E F+ DL+ YL P + ++ + S V LI S PG
Sbjct: 317 GESPTHFKADLVSYLMAYNAPPLKGWI----------DTIQEHDLSETNVYLIGSTPGRF 366
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSS 416
G WGH +LR +L+E T ++ P+V QFSS+GS+ + KW+ +E S+ +
Sbjct: 367 QGDQKDNWGHFRLRKLLREHTSPIPKAEAWPIVGQFSSIGSMGTDESKWLCSEFKESLLT 426
Query: 417 GFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKA 474
+ +T PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A
Sbjct: 427 LGKDGRTLGKSTAPLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSA 486
Query: 475 SHTGRSRAMPHIKTFAR--YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
+GRS AMPHIKT+ R + +AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL
Sbjct: 487 ETSGRSSAMPHIKTYMRPSPDFSSIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVL 546
Query: 533 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 592
LPS F S V + SGS E + PV
Sbjct: 547 FLPSV------FGLDSFKVRQKFFSGSQEL------------------------MASFPV 576
Query: 593 PYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 577 PYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 609
>gi|301123067|ref|XP_002909260.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
gi|262100022|gb|EEY58074.1| tyrosyl-DNA phosphodiesterase, putative [Phytophthora infestans
T30-4]
Length = 1123
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 165/397 (41%), Positives = 223/397 (56%), Gaps = 54/397 (13%)
Query: 158 DKLPST--FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAK 215
D PS F L R++ PA N + D+++GD +L+NYM D+ WL CP L +
Sbjct: 20 DTTPSELGFYLNRLKTAPASHNLHAKRLSDLLEGDFSRCLLTNYMFDLPWLFTECPRLKE 79
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+P VLV HGE D + +N PPLPI +GTHH+K ++ +YP VR+ + TA
Sbjct: 80 VPVVLV-HGERDRQGMTKECRDYSNVTPVAPPLPIPYGTHHTKMLVALYPERVRVAIFTA 138
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---------CGFENDLIDYLSTLKWPEFSA 326
N + DWN K+QGLW QDF LK + EE FE DL+ YLS+L P
Sbjct: 139 NFLSNDWNTKTQGLWYQDFGLKVLTDSDEEEKEAVAKSSSDFEADLVHYLSSLGAP---- 194
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKG 386
K+ K+F+FSSA V L+ SVPG H G ++K+GH+++R
Sbjct: 195 -------VKLFCGELKRFDFSSARVALVPSVPGVHKGKDMEKYGHLRVR----------- 236
Query: 387 FKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPLGIGEPLIVWPTVEDVRCSL 444
+LGSLDEKW+ E + S+ G T + + ++WP VEDVR SL
Sbjct: 237 -----------NLGSLDEKWLFGEFAESLLPGKKHISSTSMPVQALHVIWPAVEDVRNSL 285
Query: 445 EGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYNGQ-----KLA 498
EG+ +G +IP P KN+ K FL KY KW + R AMPHIK++AR+N +L
Sbjct: 286 EGWNSGRSIPCPLKNM-KPFLHKYLRKWMPPAELHRQNAMPHIKSYARFNASEDKAGELD 344
Query: 499 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
W ++TS+NLSKAAWG+LQKN +Q MIRSYELGV+ LP
Sbjct: 345 WAIVTSSNLSKAAWGSLQKNKTQFMIRSYELGVMFLP 381
>gi|426233768|ref|XP_004010886.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ovis aries]
Length = 612
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 184/483 (38%), Positives = 263/483 (54%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 167 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 226
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 227 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 286
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ +P + + S E F+ DLI YL+
Sbjct: 287 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATHFKADLISYLAAYN----------A 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKS 390
K ++ + S V LIAS PG G+ WGH +LR +L+E + G +
Sbjct: 337 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPAPGAESW 396
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 397 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAVPLHLIYPSVENVRTSLE 455
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+
Sbjct: 456 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 515
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+K +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 516 TSANLSKAAWGALEKGGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 568
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 569 ----------------------EPTASFPVPYDLPPELYGDRDRPWIWNIPYVKAPDTHG 606
Query: 622 QVW 624
+W
Sbjct: 607 NMW 609
>gi|395827684|ref|XP_003787027.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Otolemur garnettii]
Length = 608
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 201/573 (35%), Positives = 298/573 (52%), Gaps = 68/573 (11%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATN--GELSSKK 131
+R++ S E++ S +D ++ P K + V DG G S+
Sbjct: 75 KRQRSDSQEYLGWCLSSSDDELQPETPEKQAKKVIVKEEEDISVPQDGTAQRTGNHSTPA 134
Query: 132 MRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ--- 188
+ E+++E + S E + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEY-ETSGEGQDIWDMLDKGNPFQFYLTRVSGIKPKYNSGALHIKDILSPLF 193
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHK 245
G ++ + NY D+DWL+ P + +L++HG E+ L H + N L +
Sbjct: 194 GTLVSSAQFNYCFDVDWLIKQYPPEFRKKPILLVHGDKREAKADL-HAQAKPYGNISLCQ 252
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLS 303
L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + + S
Sbjct: 253 AKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHEDWHQKTQGIWLSPLYPRIVHGTHKS 312
Query: 304 EE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 361
E F+ DLI YL ++A+ K + + S V LI+S PG
Sbjct: 313 GESVTHFKADLISYLMA-----YNAS-----PLKEWIDLIHEHDLSETNVYLISSTPGRF 362
Query: 362 TGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWMA-ELSSSMSS 416
GS WGH +LR +L+E +S P+V QFSS+GSL + KW++ E S+ +
Sbjct: 363 QGSQKDNWGHFRLRKLLKEHASSIPAAESWPIVGQFSSIGSLGADESKWLSSEFKESLLT 422
Query: 417 GFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKA 474
E K P PL +++P+VE+VR SLEGY AG ++P + +K ++L Y+ KW A
Sbjct: 423 LGKESKAPGKSTVPLHLIYPSVENVRTSLEGYPAGGSLPYGIQTAEKQNWLHSYFHKWSA 482
Query: 475 SHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
+GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL
Sbjct: 483 ETSGRSHAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVL 542
Query: 533 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 592
LPSA F S V + S + E + PV
Sbjct: 543 FLPSA------FGLDSFKVKQKFFSANKEP------------------------MATFPV 572
Query: 593 PYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PY+LPP+ Y ++D PW W+ Y K D +G +W
Sbjct: 573 PYDLPPELYGNKDRPWIWNIPYVKAPDTHGNMW 605
>gi|440911964|gb|ELR61579.1| Tyrosyl-DNA phosphodiesterase 1, partial [Bos grunniens mutus]
Length = 616
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 181/483 (37%), Positives = 259/483 (53%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 171 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 230
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 231 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 290
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ + + F+ DLI YL+
Sbjct: 291 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISYLAAYN----------A 340
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LIAS PG G+ WGH +LR +L+E +S
Sbjct: 341 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPMPKAESW 400
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 401 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAPLHLIYPSVENVRTSLE 459
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+
Sbjct: 460 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 519
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 520 TSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 572
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 573 ----------------------EPTASFPVPYDLPPELYGDRDRPWIWNIPYVKAPDTHG 610
Query: 622 QVW 624
+W
Sbjct: 611 NMW 613
>gi|300798259|ref|NP_001180084.1| tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
gi|296482871|tpg|DAA24986.1| TPA: tyrosyl-DNA phosphodiesterase 1 [Bos taurus]
Length = 612
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 181/483 (37%), Positives = 259/483 (53%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ NT + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 167 PFQFYLTRVSGIKPKYNTGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIRQYPPEFRKK 226
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
VL++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 227 PVLLVHGDKREAKAHLLAEAKPYGNVTLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 286
Query: 276 NLIHVDWNNKSQGLWMQDFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ + + F+ DLI YL+
Sbjct: 287 NLIREDWHQKTQGIWLSPLYPRIVHGTHGSGESATNFKADLISYLAAYN----------A 336
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LIAS PG G+ WGH +LR +L+E +S
Sbjct: 337 APLKEWIDTIQEHDLSETNVYLIASTPGRFQGNQKDNWGHFRLRKLLKEHASPMPKAESW 396
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P++ QFSS+GS+ + KW+ +E S+ + E +T LG PL +++P+VE+VR SLE
Sbjct: 397 PVIGQFSSIGSMGADESKWLCSEFKESLVTLGKESRT-LGSAAPLHLIYPSVENVRTSLE 455
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+
Sbjct: 456 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYLRPSPDFSQIAWFLV 515
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+K+ +QLMIRSYELGVL LPSA F S V + SGS++
Sbjct: 516 TSANLSKAAWGALEKSGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGSSQ- 568
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
PVPY+LPP+ Y D PW W+ Y K D +G
Sbjct: 569 ----------------------EPTASFPVPYDLPPEVYGDRDRPWIWNIPYVKAPDTHG 606
Query: 622 QVW 624
+W
Sbjct: 607 NMW 609
>gi|348675737|gb|EGZ15555.1| hypothetical protein PHYSODRAFT_505563 [Phytophthora sojae]
Length = 1258
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/398 (40%), Positives = 222/398 (55%), Gaps = 55/398 (13%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIP 217
D F L ++ PA N S+ D+++GD +L+NYM D+ WL CP L +P
Sbjct: 27 DARECAFHLTCLKNAPAAPNVHTKSLGDLLEGDFSRCLLTNYMYDLPWLFAECPRLRDVP 86
Query: 218 HVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANL 277
VL++HGE D + + AN PPLPI++GTHH+K ++ +YP VR+ + TAN
Sbjct: 87 -VLLVHGERDRQGMMKECREYANVTPVAPPLPIAYGTHHTKMLVALYPEKVRVAIFTANF 145
Query: 278 IHVDWNNKSQGLWMQDFPLKDQNNLSEE------------CGFENDLIDYLSTLKWPEFS 325
+ DWN K+QG+W QDF LK + +E FE DL+ YLS+L
Sbjct: 146 LSNDWNTKTQGVWFQDFGLKVLDGSEDEEKDAVADNSTAINDFEADLVHYLSSLG----- 200
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
K+ +F+FS+A V L+ SVPG H G ++K+GH+++R
Sbjct: 201 ------AQVKLFCGELMRFDFSAARVALVPSVPGVHKGKDMEKYGHLRVR---------- 244
Query: 386 GFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSE-DKTPLGIGEPLIVWPTVEDVRCS 443
+LGSLDEKW+ E + SM G T + + I+WP+V+DVR S
Sbjct: 245 ------------NLGSLDEKWLFGEFAESMLPGKKNVSPTSMPVQALHIIWPSVDDVRNS 292
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYN-----GQKL 497
LEG+ +G +IP P KN+ K FL KY KW R AMPHIK++AR+N +L
Sbjct: 293 LEGWNSGRSIPCPLKNM-KPFLHKYLRKWTPPEELHRQNAMPHIKSYARFNPSDEKAGEL 351
Query: 498 AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
W ++TS+NLSKAAWGALQKN +QLMIRSYELGV+ LP
Sbjct: 352 DWVIVTSSNLSKAAWGALQKNKTQLMIRSYELGVMFLP 389
>gi|113931582|ref|NP_001039242.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
gi|89273341|emb|CAJ81457.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus (Silurana) tropicalis]
Length = 597
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 181/518 (34%), Positives = 278/518 (53%), Gaps = 60/518 (11%)
Query: 127 LSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDV 186
+ SKK+++ E + K ++ + + + P F L +V G+ N+ + I+D+
Sbjct: 117 VQSKKIQENIEVKQKKCKTPSDSQDTWDLLQAGEPFRFYLTKVMGIKPKYNSGALHIKDI 176
Query: 187 IQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANWI 242
+ G ++ + NY DI WL+ P + +L++HGE + + + P I
Sbjct: 177 LSPLFGTLVSSAQFNYCFDIKWLVKQYPEEFRDKPLLIVHGEKRESKAKLHEDAHPYEHI 236
Query: 243 -LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
L + L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW K+QG+W+ +
Sbjct: 237 RLCQAKLDIAFGTHHTKMMLLLYTEGLRVVIHTSNLIHEDWYQKTQGIWLSPLYPRLPEG 296
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
S G F +DL+ YL++ P + K+ + S V LI S
Sbjct: 297 ASVSAGESSTNFRSDLVAYLASYNSPSLREWM----------DIIKQHDLSETRVYLIGS 346
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEK-GFKKSPLVYQFSSLGSL---DEKWM-AELS 411
PG G+ KWGH +LR +L+E T G + P++ QFSS+GS+ KW+ +E +
Sbjct: 347 TPGRFQGNDKDKWGHFRLRKLLRENTSAAPGQETWPVIGQFSSIGSMGVDKTKWLCSEFT 406
Query: 412 SSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYW 469
S+++ K+ PL +++P+V++VR SLEGY AG ++P S Q + +L Y+
Sbjct: 407 ESLTTLGKSIKSLQKTEIPLHLIYPSVDNVRTSLEGYPAGGSLPYSIQTAQKQPWLHSYF 466
Query: 470 AKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
KWKA + RS+AMPHIKT+ R + Q LAWFL+TSANLSKAAWG+L+KN +QL IRSY
Sbjct: 467 HKWKAETSRRSQAMPHIKTYMRLSPDSQHLAWFLVTSANLSKAAWGSLEKNGAQLFIRSY 526
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
ELGVL LPSA ET+ V L + S++ +++
Sbjct: 527 ELGVLFLPSA----------------------FETNTFN----VKLNIYASNEPSSNA-- 558
Query: 588 VYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y ++D PW W+ Y D +G +W
Sbjct: 559 --FPVPYDLPPEHYGAKDRPWVWNIPYVNAPDTHGNIW 594
>gi|410911974|ref|XP_003969465.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Takifugu rubripes]
Length = 614
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 177/482 (36%), Positives = 265/482 (54%), Gaps = 65/482 (13%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL NT + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 174 FYLNKVTGLDRKYNTGALHIRDILSPLFGTLKASVQFNYCFDIAWMVKQYPEEFRDRPVL 233
Query: 221 VIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANL 277
++HG E+ L + P + + L I+FGTHH+K MLL Y G R+IV T+NL
Sbjct: 234 IVHGDKREAKARLVQQAQGFP-HIQFCQAKLDIAFGTHHTKMMLLWYEEGFRVIVLTSNL 292
Query: 278 IHVDWNNKSQGLWMQD-FP----LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG 332
I DW K+QG+WM FP ++ F+ DL++YL++ + PE +
Sbjct: 293 IRADWYQKTQGMWMSPLFPRLPEGSSASSGESPTYFKRDLLEYLASYRAPELEEWI---- 348
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE-KGFKKSP 391
K+ + S +V L+ S PG + GS +++WGH++LR +L E T G ++ P
Sbjct: 349 ------QRIKEHDLSETSVYLVGSTPGRYVGSDMERWGHLRLRKLLSEHTEAFPGEERWP 402
Query: 392 LVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEG 446
++ QFSS+GS+ KW+A E +M++ K+ + P+ +++P++EDVR SLEG
Sbjct: 403 VIGQFSSIGSMGLDKTKWLAGEFQRTMTT---MGKSTVRSDPPMQLLYPSIEDVRTSLEG 459
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLT 503
Y AG ++P + K +L ++ +WKA TGRS AMPHIKT+ R N +LAWF +T
Sbjct: 460 YPAGGSLPYSIQTAQKQLWLHSFFHRWKADSTGRSHAMPHIKTYMRVSPNFTELAWFFMT 519
Query: 504 SANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 563
SANLSKAAWGAL+KNN+Q+MIRSYELGVL +PSA + +T
Sbjct: 520 SANLSKAAWGALEKNNTQMMIRSYELGVLFVPSAFK--------------------MKTF 559
Query: 564 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 622
+ K+ + +SS PVP++LPP YS +D PW W+ Y++ D +G
Sbjct: 560 PVNKSPFLV----------SSSSFSGFPVPFDLPPTAYSPKDQPWIWNIPYSQAPDTHGN 609
Query: 623 VW 624
+W
Sbjct: 610 IW 611
>gi|340383155|ref|XP_003390083.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Amphimedon
queenslandica]
Length = 535
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 179/485 (36%), Positives = 262/485 (54%), Gaps = 70/485 (14%)
Query: 161 PSTFRLLRVQGLPAWANTS--CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAK 215
P+ F L +V+G+P N V I+D++ G++I + NYM DI WLL P +
Sbjct: 97 PTLFYLTKVRGIPDRYNDPRYTVGIKDILSSTHGNLIGSAQFNYMFDIKWLLDQYPEDKR 156
Query: 216 IPHVLVIHGESDGTLEHMKRNK--PANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVH 273
+L++HG E ++ + N L + L + FGTHHSK MLL Y G+R+++H
Sbjct: 157 SLPLLIVHGFQGREFESLRMDSLPHPNIKLLQAKLDL-FGTHHSKMMLLSYNEGLRVVIH 215
Query: 274 TANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN 333
TANLI DW+ K+QG+WM P+ ++ + C F++DL+ YL T ++
Sbjct: 216 TANLIQKDWDQKTQGVWMS--PVFPKSTVKRSCKFQDDLLSYLDT-----YTGAAMNEWK 268
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSP 391
K+ K + SS +IASVPG HTG ++ KWGHMKLR VL+E + K P
Sbjct: 269 EKV-----KSHDMSSCRAHIIASVPGPHTGLNIFKWGHMKLRKVLEEHGPSASTTTKDWP 323
Query: 392 LVYQFSSLGSL--------DEKWMAELSSSMSSGFSED-KTPLGIGEPLIVWPTVEDVRC 442
++ QFSS+GSL +W+ LSS +G + ++ + G+ +V+PTVE+++
Sbjct: 324 VIGQFSSIGSLGPAPSSWLTSEWLTSLSSCWKTGTVKTLRSEIPKGKLQLVFPTVENIKN 383
Query: 443 SLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAW 499
SLEGY AG ++P + Q + + +L ++ +W A GRSRA PHIKT+ R + +LAW
Sbjct: 384 SLEGYMAGGSVPYASQTALKQPYLTTFFNQWVAEGYGRSRASPHIKTYMRVSPTCDRLAW 443
Query: 500 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 559
FLLTSANLSKAAWG +K +QL IRSYE+GVL+LP + +SG+
Sbjct: 444 FLLTSANLSKAAWGGFEKKGTQLRIRSYEIGVLLLP------------------DDESGT 485
Query: 560 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 619
+ +SS LP+P +LP Y + D PW W+ RY D
Sbjct: 486 LMVGE------------------SSSNNSMLPIPIDLPLTDYKTTDRPWIWNDRYLAPDC 527
Query: 620 YGQVW 624
G VW
Sbjct: 528 KGNVW 532
>gi|432115827|gb|ELK36975.1| Tyrosyl-DNA phosphodiesterase 1 [Myotis davidii]
Length = 610
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 185/489 (37%), Positives = 262/489 (53%), Gaps = 71/489 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 165 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVRQYPPEFRKK 224
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 225 PILLVHGDKREAKAHLHAEAKPYPNVSLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 284
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWP---EFSA 326
NLI DW+ K+QG+W+ PL + + F+ DLI YL P E+
Sbjct: 285 NLIREDWHQKTQGMWVS--PLYPRMAHGTPGSGESTTHFKADLISYLMAYNAPPLQEWVD 342
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFE 384
+ AH + S V LI S PG G+ WGH +LR VL+E +
Sbjct: 343 VIHAH-------------DLSETNVYLIGSTPGRFQGNQKDNWGHFRLRKVLKEHASSIP 389
Query: 385 KGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVED 439
K + P++ QFSS+GS+ + KW+ AE ++ + E + P PL +++P+VE+
Sbjct: 390 KA-EAWPVIGQFSSIGSMGADESKWLCAEFKETLVTLGKESRAPGRSPAPLHLIYPSVEN 448
Query: 440 VRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQK 496
VR SLEGY AG ++P S Q + +L Y+ KW A +GRS AMPHIKT+ R + +
Sbjct: 449 VRTSLEGYPAGGSLPYSIQTAEKQSWLHAYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQ 508
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V +
Sbjct: 509 IAWFLVTSANLSKAAWGALEKNGAQLMIRSYELGVLFLPSA------FGLDSFRVKPKFF 562
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK 616
SGS E + PVPY+LPP+ Y S+D PW W+ Y K
Sbjct: 563 SGSQEPT------------------------ASFPVPYDLPPELYGSKDRPWIWNIPYVK 598
Query: 617 K-DVYGQVW 624
D +G +W
Sbjct: 599 APDTHGNMW 607
>gi|395503746|ref|XP_003756224.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Sarcophilus harrisii]
Length = 612
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 181/504 (35%), Positives = 272/504 (53%), Gaps = 60/504 (11%)
Query: 141 ENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILS 197
E+ +EA ++++ +K F L +V G+ N+ + I+D++ G ++ +
Sbjct: 146 EDDVTFDEAQESWNLLDEKNLFRFYLTKVSGILPKYNSGALHIKDILSPLFGTLLSSAQF 205
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTH 255
NY ++DWL+ P+ + +L++HG+ + ++ KP N L + L I+FGTH
Sbjct: 206 NYCFEVDWLVRQYPLEFRKKPILLVHGDKREAKARLQEKAKPYENISLCQAKLDIAFGTH 265
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 310
H+K MLL+Y G+R+++HT+NLI DW+ K+QG+W+ P + E F++
Sbjct: 266 HTKMMLLLYEEGLRVVIHTSNLIQADWHQKTQGIWLSPLYPRLPYGTPSTHGESSTNFKS 325
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DLI YL P + +K + S V LI S PG G ++ WG
Sbjct: 326 DLISYLMAYNAPPLKEWI----------DIVQKHDLSETRVYLIGSTPGRFQGKHIEDWG 375
Query: 371 HMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPL 425
H +LR +L+E T ++S P+V QFSS+GSL + KW+ +E S+ + K
Sbjct: 376 HFRLRKLLKEHTSLLPEQQSWPIVGQFSSIGSLGADESKWLCSEFKDSLVILGNHGKNQG 435
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAM 483
PL +++PTVE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AM
Sbjct: 436 QHNVPLHLIYPTVENVRNSLEGYPAGGSLPYSLQTAEKQVWLHSYFHKWSAETSGRSNAM 495
Query: 484 PHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 541
PHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 496 PHIKTYMRLSPDFAKMAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA---- 551
Query: 542 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 601
F + + ++ S E + PVPY+LPP+ Y
Sbjct: 552 --FGMDTFKIKRKVFSEKQEPA------------------------TSFPVPYDLPPEIY 585
Query: 602 SSEDVPWSWDKRYTKK-DVYGQVW 624
+S+D PW W+ Y K D +G +W
Sbjct: 586 NSKDRPWIWNIPYVKAPDTHGNMW 609
>gi|148237298|ref|NP_001087094.1| tyrosyl-DNA phosphodiesterase 1 [Xenopus laevis]
gi|49903395|gb|AAH76790.1| Tdp1-prov protein [Xenopus laevis]
Length = 597
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 176/484 (36%), Positives = 258/484 (53%), Gaps = 60/484 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V G+ N+ + I+D++ G ++ + NY DI+WL+ P +
Sbjct: 151 PFRFYLTKVTGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDIEWLVKQYPEEFRNK 210
Query: 218 HVLVIHGESDGTLEHMKRNK-PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HGE + + + P I L + L I++GTHH+K MLL+Y G+R+++HT+
Sbjct: 211 PLLIVHGEKRESKTKLHEDAHPYEHIRLCQAKLDIAYGTHHTKMMLLLYTEGLRVVIHTS 270
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLPA 330
NLI DW K+QG+W+ + S G F +DLI YL++ P +
Sbjct: 271 NLIREDWYQKTQGIWLSPLYPRLPEGASVSAGESSTNFRSDLIAYLASYNSPSLREWM-- 328
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
K+ + S V LI S PG G KWGH +LR +L+E T K+
Sbjct: 329 --------DIIKQHDLSETRVYLIGSTPGRFQGKDKDKWGHFRLRKLLRENTSAGPDKEM 380
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P++ QFSS+GS+ KW+ +E + S+ + K+ PL +++P+V++VR SL
Sbjct: 381 WPVIGQFSSIGSMGVDKTKWLCSEFTESLKTLGKSIKSLQKSEIPLRLIYPSVDNVRTSL 440
Query: 445 EGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFL 501
EGY AG ++P S Q + +L Y+ KWKA +GRS+A+PHIKT+ R+ + Q LAWFL
Sbjct: 441 EGYPAGGSLPYSIQTAQKQPWLHSYFHKWKAETSGRSQAIPHIKTYMRFSPDFQNLAWFL 500
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+TSANLSKAAWG+L+KN +QL IRSYELGVL LPSA F+ NI SG+
Sbjct: 501 VTSANLSKAAWGSLEKNGAQLFIRSYELGVLFLPSAFDTNT-FNVKVNIYSHNEPSGNA- 558
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVY 620
PVPY+LPP+ Y S+D PW W+ Y D +
Sbjct: 559 ----------------------------FPVPYDLPPEHYGSKDRPWVWNIPYVNAPDTH 590
Query: 621 GQVW 624
G +W
Sbjct: 591 GNIW 594
>gi|427789081|gb|JAA59992.1| Putative tyrosyl-dna phosphodiesterase [Rhipicephalus pulchellus]
Length = 614
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/481 (36%), Positives = 265/481 (55%), Gaps = 73/481 (15%)
Query: 169 VQGLPAWANTSCV--SIRDVIQGDIIVAILS---NYMVDIDWLLPACPVLAKIPHVLVIH 223
V G+PA NT+ + S+RD++ D+ + S NY DI WL+ P + +LV+H
Sbjct: 173 VTGIPARYNTAQIARSVRDLLSPDMGRLVRSAQFNYCFDIPWLVEQYPTEFRNLPLLVVH 232
Query: 224 GESDGTLEHMKRNKPANWILH----KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
GE + ++ + A+ H + L I +GTHH+K MLL+Y G+R+++HTAN+I
Sbjct: 233 GEQREAKKALETS--ASGFQHVSFAQAKLEIVYGTHHTKMMLLLYKEGLRVVIHTANMIP 290
Query: 280 VDWNNKSQGLWMQDFPLK---DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
DW K+Q +W+ + N E GF DL++YLS A+G+ I
Sbjct: 291 TDWAQKTQAIWVGPVCPRLAPGSNGGDSETGFRADLLNYLS------------AYGDTHI 338
Query: 337 NP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PL 392
N + + +FS+ V L+ SVPG HTG +GH++LR +L + K + PL
Sbjct: 339 NEWCHYIRTHDFSAVKVFLVGSVPGRHTGPRKSCFGHLRLRNLLSQHGPSKDLVSNHWPL 398
Query: 393 VYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGY 447
V QFSS+GSL E W+ E SS+S+ T + PL +V+P+V+DVRCSLEGY
Sbjct: 399 VAQFSSIGSLGASAESWLLGEFLSSLSTTKGSVVTARSV--PLKLVFPSVDDVRCSLEGY 456
Query: 448 AAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTS 504
AG +IP DK +L ++ +WK+ GR+ A PHIKT+ R + +++AW L+TS
Sbjct: 457 PAGASIPYSIVTADKQRWLDSFFHRWKSERLGRTAASPHIKTYTRLSPSSKQIAWLLVTS 516
Query: 505 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
ANLSKAAWGAL+KN SQLMIRSYELG+L+ P+ F + V SE +G++
Sbjct: 517 ANLSKAAWGALEKNGSQLMIRSYELGILLFPA------NFGQATTFVVSEGANGNS---- 566
Query: 565 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDVYGQV 623
++LP+PY++P Y+ +D PW+WD ++ + D +G +
Sbjct: 567 ----------------------ALFLPLPYDVPLVPYTKDDEPWTWDSQHRELPDRFGNM 604
Query: 624 W 624
W
Sbjct: 605 W 605
>gi|20150581|pdb|1JY1|A Chain A, Crystal Structure Of Human Tyrosyl-Dna Phosphodiesterase
(Tdp1)
Length = 464
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 183/483 (37%), Positives = 259/483 (53%), Gaps = 59/483 (12%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 19 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 78
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K LL+Y G+R+++HT+
Sbjct: 79 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKXXLLLYEEGLRVVIHTS 138
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P + D + S E F+ +LI YL+ P +
Sbjct: 139 NLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISYLTAYNAPSLKEWI--- 195
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K + S V LI S PG GS WGH +L+ +L++ +S
Sbjct: 196 -------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSXPNAESW 248
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GSL + KW+ +E S + E KTP PL +++P+VE+VR SLE
Sbjct: 249 PVVGQFSSVGSLGADESKWLCSEFKESXLTLGKESKTPGKSSVPLYLIYPSVENVRTSLE 308
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS A PHIKT+ R + K+AWFL+
Sbjct: 309 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAXPHIKTYXRPSPDFSKIAWFLV 368
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTET 562
TSANLSKAAWGAL+KN +QL IRSYELGVL LPSA S V + +GS E
Sbjct: 369 TSANLSKAAWGALEKNGTQLXIRSYELGVLFLPSA------LGLDSFKVKQKFFAGSQEP 422
Query: 563 SQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYG 621
PVPY+LPP+ Y S+D PW W+ Y K D +G
Sbjct: 423 XAT------------------------FPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHG 458
Query: 622 QVW 624
W
Sbjct: 459 NXW 461
>gi|395746171|ref|XP_003778400.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Pongo abelii]
Length = 589
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 180/487 (36%), Positives = 270/487 (55%), Gaps = 41/487 (8%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDGA +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGAAQRTENHGPPT 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSRALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPQIVDGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPDAESWPVVGQFSSIGSLGSDESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E+KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKENKTPGKTSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSA 537
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|426377770|ref|XP_004055628.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Gorilla gorilla
gorilla]
Length = 608
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 196/582 (33%), Positives = 290/582 (49%), Gaps = 86/582 (14%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHK 245
G ++ + NY D+DWL+ P + +L++HG+ H+ K
Sbjct: 191 PLFGMLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQA-------K 243
Query: 246 PPLPISFGTHHS---------KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP 295
P IS K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P
Sbjct: 244 PYENISLCQLSEIGKRFLLCEKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYP 303
Query: 296 -LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
+ D + S E F+ DLI YL P + K + S V
Sbjct: 304 RIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVY 353
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM- 407
LI S PG GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+
Sbjct: 354 LIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLC 413
Query: 408 AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
+E SM + E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L
Sbjct: 414 SEFKESMLTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWL 473
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLM 523
Y+ KW A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLM
Sbjct: 474 HSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLM 533
Query: 524 IRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGA 583
IRSYELGVL LPSA F S V + +GS E
Sbjct: 534 IRSYELGVLFLPSA------FGLDSFKVKQKFFAGSQEP--------------------- 566
Query: 584 SSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 567 ---MATFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 605
>gi|397525721|ref|XP_003832804.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 isoform 3 [Pan paniscus]
Length = 589
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 179/487 (36%), Positives = 268/487 (55%), Gaps = 41/487 (8%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQRQAEKVVIKKEKDISAPNDGTAQRTENHGPPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSIGSLGADESKWLCSEFEESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSA 537
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|79154014|gb|AAI07878.1| TDP1 protein [Homo sapiens]
Length = 589
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 179/487 (36%), Positives = 268/487 (55%), Gaps = 41/487 (8%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 75 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 134
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 135 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 190
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM-KRNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 191 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 250
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 251 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 310
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 311 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 360
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 361 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 420
Query: 415 SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
+ E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW
Sbjct: 421 LTLGKESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKW 480
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A +GRS AMPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELG
Sbjct: 481 SAETSGRSNAMPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELG 540
Query: 531 VLILPSA 537
VL LPSA
Sbjct: 541 VLFLPSA 547
>gi|452821653|gb|EME28681.1| tyrosyl-DNA phosphodiesterase 1 [Galdieria sulphuraria]
Length = 452
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 166/457 (36%), Positives = 244/457 (53%), Gaps = 50/457 (10%)
Query: 182 SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
S+ ++ Q +L+NYM D+ WL P+L + +L++HG+ + + P ++
Sbjct: 27 SLDEIFQPGFHSVLLTNYMFDLSWLFQRVPILLTVERLLIVHGDE----QVYQPFSPYHF 82
Query: 242 I-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
I HKP LP +GTHH+K ++L YP VR ++ TAN+I DW K+QG++++DFP K
Sbjct: 83 ITFHKPRLPFPYGTHHTKLIILFYPTKVRFVLTTANMIQSDWEYKTQGMFLKDFPQKTGE 142
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
+ C F + DYLS L P + S +++FS A V LI SVPGY
Sbjct: 143 --LKSCPFLETMDDYLSALGEP-----------LRYYRSLLCQYDFSKAGVVLIPSVPGY 189
Query: 361 HTGSSLKKWGHMKLRT-VLQECTF--EKGFKKSP------LVYQFSSLGSLDEKWM-AEL 410
H G +L K+GH L + + Q C E+ ++ L+ Q SS+GS+ EKW+ EL
Sbjct: 190 HGGRNLDKYGHRSLHSNISQYCCISDEQRIRRKTTHSTIRLLLQCSSMGSISEKWLKQEL 249
Query: 411 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWA 470
SM S + + E ++WP+V+ VR S++GYA+G A P +KN + F +
Sbjct: 250 FHSMVSSCWKQEDWQYCFEWDLIWPSVQQVRNSIQGYASGAAFPWTKKNY-RSFQSSHLC 308
Query: 471 KWKASHTGRSRAMPHIKTFARY-NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
W A R+ +PH+K++ Y + WFLLTSANLS AAWG L +N SQL IRSYEL
Sbjct: 309 LWNAYFFRRNAWLPHMKSYMAYEESGNIFWFLLTSANLSTAAWGRLVRNQSQLFIRSYEL 368
Query: 530 GVLILPSAKRHGCGFSC-TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 588
GVL P C ++C N++ ++ + TS + K ++ +
Sbjct: 369 GVLWTPML----CSYTCPMDNVI--QLTTPQHITSYYPREK-------------NNNILF 409
Query: 589 YLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
LP+P++LPPQ Y S D PW WD Y D G VWP
Sbjct: 410 CLPLPFQLPPQHYDSNDSPWLWDAIYKSPDRLGNVWP 446
>gi|281340418|gb|EFB16002.1| hypothetical protein PANDA_009635 [Ailuropoda melanoleuca]
Length = 388
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 171/421 (40%), Positives = 235/421 (55%), Gaps = 56/421 (13%)
Query: 219 VLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTAN 276
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+N
Sbjct: 6 ILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSN 65
Query: 277 LIHVDWNNKSQGLWMQDF--PLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHG 332
LIH DW+ K+QG+W+ P+ + S E F+ DLI YL P +
Sbjct: 66 LIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKADLISYLMAYNAPSLKEWI---- 121
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL 392
+ + S V LI S PG GS WGH +LR +L+E KG + P+
Sbjct: 122 ------DIIHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASPKG-ESWPV 174
Query: 393 VYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGY 447
V QFSS+GS+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY
Sbjct: 175 VGQFSSIGSMGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGY 234
Query: 448 AAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTS 504
AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+TS
Sbjct: 235 PAGGSLPYSIQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTS 294
Query: 505 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
ANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 295 ANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPAA 348
Query: 565 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQV 623
PVPY+LPP+ Y S+D PW W+ YTK D +G +
Sbjct: 349 A------------------------FPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNM 384
Query: 624 W 624
W
Sbjct: 385 W 385
>gi|26329523|dbj|BAC28500.1| unnamed protein product [Mus musculus]
gi|148686960|gb|EDL18907.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_a [Mus musculus]
Length = 579
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 168/413 (40%), Positives = 237/413 (57%), Gaps = 40/413 (9%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWF 500
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 553
L+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA SNIVP+
Sbjct: 512 LVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA--------FVSNIVPA 556
>gi|431839199|gb|ELK01126.1| Tyrosyl-DNA phosphodiesterase 1 [Pteropus alecto]
Length = 709
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 163/395 (41%), Positives = 234/395 (59%), Gaps = 28/395 (7%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHM-KRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAEAKPYGNISLCQAKLEIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAH 331
NLI DW+ K+QG+W+ +P + N S E F+ DL+ YL + N PA
Sbjct: 283 NLIRADWHQKTQGIWLSPLYPRIAPGTNTSGESTTHFKADLVSYL-------MAYNAPA- 334
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K ++ + S V LI S PG GS WGH +LR +L+E +S
Sbjct: 335 --LKEWIDVIQEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLRKLLKEHASSIPKAESW 392
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
P+V QFSS+GS+ + KW+ +E ++++ E KTP PL +++P+VE+VR SLE
Sbjct: 393 PVVGQFSSIGSMGADESKWLCSEFKETLATLGRESKTPGKSAVPLHLIYPSVENVRTSLE 452
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLL 502
GY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+
Sbjct: 453 GYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSQIAWFLV 512
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 513 TSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA 547
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 581 AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
+G+ PVPY+LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 662 SGSQEPAASFPVPYDLPPELYGSKDRPWIWNIPYVKAPDTHGNMW 706
>gi|302790465|ref|XP_002977000.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
gi|300155478|gb|EFJ22110.1| hypothetical protein SELMODRAFT_416931 [Selaginella moellendorffii]
Length = 301
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/297 (47%), Positives = 191/297 (64%), Gaps = 35/297 (11%)
Query: 88 VSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSE 147
V I+ GDI++++PG FFK++ L S K + ++ L+S K ++Q E D + +
Sbjct: 24 VQISTGDIVKMLPGDRFFKFM-LCSSLKGKAVASHSDNVLASNKRKRQIEDDEAFARALQ 82
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ----------GDIIVAILS 197
+ +LLRVQGL WAN CV I DVI+ ++ AILS
Sbjct: 83 Q----------------QLLRVQGLLDWANAGCVRICDVIKVIRALVFLRIRILLFAILS 126
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYMVDI+WLL ACP+L I V++IHGES+ + ++ KP+N +L KP L I++GT HS
Sbjct: 127 NYMVDIEWLLSACPLLRTILQVVMIHGESN--VSQLQSVKPSNRLLFKPRLWIAYGTPHS 184
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLS 317
LL+YP GV+++VHTANLI++DWNNK+QGLWMQDFP K + S+ FENDL+DYL+
Sbjct: 185 ---LLVYPTGVQVVVHTANLINIDWNNKNQGLWMQDFPFKSKTGASD---FENDLVDYLT 238
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 374
L+W + ++ HG KIN F+ F FS+AAVRL+ASVPGYH+G L KWGHMKL
Sbjct: 239 ALEWLGCTVDVQHHGKMKINVGHFRNFYFSNAAVRLVASVPGYHSGPQLNKWGHMKL 295
>gi|195997043|ref|XP_002108390.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
gi|190589166|gb|EDV29188.1| hypothetical protein TRIADDRAFT_19546 [Trichoplax adhaerens]
Length = 569
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 189/552 (34%), Positives = 283/552 (51%), Gaps = 89/552 (16%)
Query: 98 LIPGHHFFKYVTLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSR 157
L+ G + VT S +K S+D K QD+ + + +CN +
Sbjct: 63 LLVGEANSREVTESPRKKLKSHDVRVEQPRVETKEHSQDQAE-------PDQMCNKY--- 112
Query: 158 DKLPSTFRLLRVQGLPAWAN--TSCVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPV 212
++ L +V+GL N TS + IR+++ + ++I +I NYM D+ WLL P
Sbjct: 113 -----SYYLSKVRGLNNNYNSRTSSIHIREILALEKSELISSIQFNYMFDVSWLLDQYPE 167
Query: 213 LAKIPHVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVR 269
+ VL++HG +S LE + P N H+ L +++GTHHSK M L+Y G+R
Sbjct: 168 DYRKNPVLIVHGYSGQSRNNLEQQGQPFP-NVKFHQAKLEMAYGTHHSKMMFLLYSNGLR 226
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPL----KDQNNLSEECGFENDLIDYLSTLKWPEFS 325
I++HTANLI DW ++QG+W+ L K + N++++ GF+ DL+DY+++
Sbjct: 227 IVIHTANLIPQDWGRRTQGIWISPLFLKRSDKSEMNIADDTGFKQDLLDYVASYG----- 281
Query: 326 ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
PA ++ S + + SS V LIASVPG H G ++ KWGH+KLR +L+ K
Sbjct: 282 ---PALFEWR---SRIMEHDMSSVNVFLIASVPGRHAGKNIDKWGHLKLRKILKRNGPSK 335
Query: 386 GFKKS--PLVYQFSSLGSLDEK---WM-AELSSSMSSGFSEDKTPLG--IGEPLIVWPTV 437
+ P + QFSS+GSL K W+ +E +S+SS + + LG + +++P+V
Sbjct: 336 DDVSANWPAICQFSSIGSLGSKRDAWLYSEFRTSLSSTSTTRLSQLGERKADVKLIFPSV 395
Query: 438 EDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NG 494
E+VR LEGY G+ +P + +K +L W A TGR RA PHIKT+ R +
Sbjct: 396 ENVRNCLEGYKGGSCLPYNRGTANKQPWLNSLLHNWAAKKTGRHRASPHIKTYTRVSPDN 455
Query: 495 QKLAWFLLTS--ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP 552
+LAWFL+T ANLSKAAWG ++KN +QLMIRSYE+GVL LP G F
Sbjct: 456 TELAWFLITRQVANLSKAAWGTMEKNETQLMIRSYEIGVLFLPKQFGDGKTF-------- 507
Query: 553 SEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDK 612
KT + W +PY+LP Y +D PW+WD
Sbjct: 508 --------------KTCDLKTNW---------------LIPYDLPLIPYGLQDSPWTWDT 538
Query: 613 RYTKKDVYGQVW 624
+ + D +G W
Sbjct: 539 PHLEPDTHGAQW 550
>gi|405964823|gb|EKC30268.1| Tyrosyl-DNA phosphodiesterase 1 [Crassostrea gigas]
Length = 461
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 170/485 (35%), Positives = 256/485 (52%), Gaps = 64/485 (13%)
Query: 161 PSTFRLLRVQGLPAWANTS-CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKI 216
P +F L +V G+ + N + +S+RD++ G++ + NYM +I WL+ P +
Sbjct: 17 PLSFFLTKVYGISSDYNGAYTMSLRDILSESMGNLQESCQFNYMFEIPWLIQQYPASFRQ 76
Query: 217 PHVLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L +HG G ++ + K N + L + +GTHH+K M L+Y G+R+++HT
Sbjct: 77 KPLLCVHGFQGGQKAGLEADARKFTNIKFCQAKLEMPYGTHHTKMMFLLYDNGLRVVIHT 136
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLP 329
ANLI DW+ K+QG+W+ K ++ S G F+ DL+ Y++ K
Sbjct: 137 ANLIERDWHQKTQGIWISPVFPKLKSGPSPTQGDSPTHFKRDLLQYVAAYK--------- 187
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFK 388
K + + SSA V ++ SVPG H +GHMKLR +L E ++
Sbjct: 188 -AYQLKDWQDHISRHDLSSANVFIVGSVPGRHMAEKKHWFGHMKLRKLLNENGPVKEQAS 246
Query: 389 KSPLVYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSL 444
K P++ QFSS+GSL E W++ E S+++ PL E +++PTV++VR SL
Sbjct: 247 KWPVIGQFSSIGSLGASKENWLSVEFLQSLATVKGTSSVPLAPVEFKLIFPTVDNVRTSL 306
Query: 445 EGYAAGNAIPSPQKNVDKD--FLKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWF 500
EGY AG +IP NV K +L Y+ +WK+ GR+RAMPHIKT+ R + ++ AWF
Sbjct: 307 EGYPAGGSIPY-SINVAKKQPWLHSYFHQWKSEGRGRNRAMPHIKTYCRPSPTWEEAAWF 365
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L+TS+NLSKAAWGAL+K SQLMIRSYE+GVL +P F C+S +
Sbjct: 366 LVTSSNLSKAAWGALEKKGSQLMIRSYEIGVLFIPKYLVENAVFECSSKV---------- 415
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDV 619
+AG + V +PY+LPP+ Y+ D PW WD + + D
Sbjct: 416 ------------------KEAGQKTFV----LPYDLPPRAYTKSDKPWIWDIAHKELPDS 453
Query: 620 YGQVW 624
G +W
Sbjct: 454 NGNMW 458
>gi|256073128|ref|XP_002572884.1| tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 1234
Score = 251 bits (642), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 166/460 (36%), Positives = 254/460 (55%), Gaps = 71/460 (15%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHK 245
G+++ +I N+M DI WL P + + ++H G+ +L+ K +N +
Sbjct: 819 GELVSSIQFNFMFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQ 877
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNN 301
+ + +G HH+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q N
Sbjct: 878 ADIRLPYGVHHTKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKN 937
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRL 353
L++ + F DL++YL + + +L + +P F ++F V L
Sbjct: 938 LNDTDSKTNFRADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVL 989
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSLDEK----WMA 408
IASV G H G SLKK+GH +L VLQ C + S P++ QFSS+GSL K +
Sbjct: 990 IASVSGRHAGESLKKFGHTRLGEVLQTCNSQ--IPSSWPVIGQFSSIGSLGPKPTDWFTT 1047
Query: 409 ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKK 467
E SSS++ K G+ +++P+VEDVR SLEGY AG +P + +K +L +
Sbjct: 1048 EWSSSLAG-----KGARGL---RMIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQ 1099
Query: 468 YWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
++ +W+A + SRA PHIK++ R +GQ++ WFLLTSANLSK+AWGA +K+ SQLMIR
Sbjct: 1100 FFYRWQAFN--HSRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIR 1157
Query: 526 SYELGVLILPS-AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGAS 584
SYELGVL LP+ K F EI + + SQ ++
Sbjct: 1158 SYELGVLFLPTNYKESAHSF---------EILKNNAKYSQ-----------------SST 1191
Query: 585 SEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
E++ P+PYELPP +Y S D PW DK ++ D++G++W
Sbjct: 1192 DELLPFPIPYELPPVKYQSNDEPWILDKPHSLPDIFGRIW 1231
>gi|432853024|ref|XP_004067503.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Oryzias latipes]
Length = 614
Score = 251 bits (641), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 172/482 (35%), Positives = 254/482 (52%), Gaps = 68/482 (14%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +V GL NT + IRD++ G + ++ NY DI W++ P + VL
Sbjct: 177 FYLNKVTGLDKKYNTGALHIRDILSPLFGTLKESVQFNYCFDIPWMVQQYPPEFRDRPVL 236
Query: 221 VIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLI 278
++HG+ + + A + + L I+FGTHH+K MLL Y G R+I+ T+NLI
Sbjct: 237 IVHGDKREAKARLLQQAQAFPHVRFCQAKLDIAFGTHHTKMMLLWYEEGFRVIILTSNLI 296
Query: 279 HVDWNNKSQGLWMQDFPLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSANLPAHGN 333
DW K+QG+WM + G F+ DL+DYL++ + PE +
Sbjct: 297 RADWYQKTQGMWMSPLFPRLPAGSGWSAGESPTFFKRDLLDYLTSYRAPELEEWI----- 351
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE-KGFKKSPL 392
K+ + S V L+ S PG G +++WGH++LR +L E T G +K P+
Sbjct: 352 -----QRIKEHDLSETRVYLVGSTPGRFVGPDMERWGHLRLRKLLYEHTNPIPGEEKWPV 406
Query: 393 VYQFSSLGSL---DEKWMA-ELSSSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEG 446
+ QFSS+GS+ KW+A E +M++ P +P L+++P VEDVR SLEG
Sbjct: 407 IGQFSSIGSMGLDKTKWLAGEFQRTMTTLGKSSSRP----DPPVLLLYPAVEDVRMSLEG 462
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLT 503
Y AG ++P + K +L Y+ +WKA+ TGRS AMPHIKT+ R + +LAWFL+T
Sbjct: 463 YPAGGSLPYSIQTAQKQLWLHGYFHRWKANATGRSHAMPHIKTYMRVSPDFTELAWFLVT 522
Query: 504 SANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 563
LS AWGAL+KNNSQ+M+RSYELGVL +PSA
Sbjct: 523 RCLLS--AWGALEKNNSQVMVRSYELGVLYVPSA-------------------------- 554
Query: 564 QIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQ 622
L T S+ +SS +L VP++LPP Y+++D PW W+ Y+++ D +G
Sbjct: 555 ----FNLKTFPVDKSAFPVSSSSSGFL-VPFDLPPTPYAAKDQPWIWNIPYSQEPDTHGN 609
Query: 623 VW 624
+W
Sbjct: 610 IW 611
>gi|241556145|ref|XP_002399612.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
gi|215499691|gb|EEC09185.1| tyrosyl-DNA phosphodiesterase, putative [Ixodes scapularis]
Length = 624
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 164/479 (34%), Positives = 250/479 (52%), Gaps = 66/479 (13%)
Query: 169 VQGLPAWANTSCV--SIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
V+G+PA N + SI D++ G+++ + NY DI WL+ P + +L++H
Sbjct: 180 VKGIPAIYNAPSIARSIEDILSPNMGELVRSAQFNYCFDIPWLVERYPAEFRNLPLLIVH 239
Query: 224 GESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVD 281
GE ++ + + + + L I +GTHH+K MLL+Y G+R+++HT+NL+ D
Sbjct: 240 GEQRDAKRELEASASSFKHVSFAQAKLEIVYGTHHTKMMLLLYKEGMRVVIHTSNLVESD 299
Query: 282 WNNKSQGLWMQDFPLKDQNNLSEECG---FENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
W K+Q W+ K F DL++YL + +G+ KIN
Sbjct: 300 WAQKTQAAWIGPLCPKASGGAGGGDSATGFRADLLEYLGS------------YGDPKINE 347
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVY 394
+ + +FS+ V L+ SVPG HTG+ +GH+KLR +L K S P +
Sbjct: 348 WCHYLRAHDFSAVKVFLVGSVPGRHTGARKSSFGHLKLRKLLSLHGPPKELVSSYWPAIA 407
Query: 395 QFSSLGSLD---EKWM-AELSSSMSS-GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
QFSS+GSL + W+ AE +S+++ TP +V+P+V+DVRCSLEGY A
Sbjct: 408 QFSSIGSLGTGPDNWLRAEFLTSLAAVKGGPPLTPSSTVPVKLVFPSVDDVRCSLEGYPA 467
Query: 450 GNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSAN 506
G +IP +K +L Y+ +W++ GR+ A PH+K++AR + G++ AW L+TSAN
Sbjct: 468 GASIPYSISTANKQRWLDAYFFRWRSGRFGRTHASPHVKSYARLSPSGKQTAWLLVTSAN 527
Query: 507 LSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 566
LSKAAWGA +K+ SQLMIRSYELGVL P Q
Sbjct: 528 LSKAAWGAFEKSGSQLMIRSYELGVLFFPG-----------------------------Q 558
Query: 567 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
T T G S AG ++ VP+++P Y +DVPW+WD ++ + D +G +W
Sbjct: 559 FGDARTFTVGGDSMAGKGCLPLF--VPFDVPLTPYGQDDVPWTWDSQHREAPDRFGNMW 615
>gi|428172199|gb|EKX41110.1| hypothetical protein GUITHDRAFT_142267 [Guillardia theta CCMP2712]
Length = 465
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/334 (39%), Positives = 191/334 (57%), Gaps = 15/334 (4%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
F L G+ N V +RDV+QGD++ AI +NYMV WLL +L+ IP V+ ++
Sbjct: 127 FWLFHTDGIEEPGNEQAVRLRDVVQGDVLWAIFTNYMVQERWLLSEIALLSSIPRVVFMY 186
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
++ + + PP P +G HHSK MLL Y GVR++V TAN IH D
Sbjct: 187 ---PFLSSLASPPSSSSIVRYAPPTP-QYGVHHSKVMLLGYNTGVRVVVMTANHIHGDHY 242
Query: 284 NKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ + LW QDFPLK + E FE+DL+ Y +W LP K++ + ++
Sbjct: 243 DMTDALWAQDFPLKGEGE--ERSEFEDDLVSYFQATQWK--GTTLPC--GSKLDAQYLRR 296
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
++F +A +++ASVPG H G + WGHMK+R +L TF+ F K P+V+Q +S+GSL
Sbjct: 297 YSFKNARAKIVASVPGRHQGEKMHMWGHMKMRRILSRETFDPLFNKCPMVWQCTSIGSLS 356
Query: 404 EKWMAELSSSMSSGFSEDKTPLGIGE--PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
EKW+ E +SS+ G + + +G E P +WPT+E+VR S +GY G +IP KNV
Sbjct: 357 EKWIEEFTSSLCEGKNTEGKNIGRPEEPPHFIWPTMEEVRTSSKGYTMGESIPGFSKNVH 416
Query: 462 KDFLKKYWAKWKASHTG---RSRAMPHIKTFARY 492
K FL K + +W + + R RAMPHIKT+ R+
Sbjct: 417 KPFLLKMFCRWSSGSSDPQLRRRAMPHIKTWLRF 450
>gi|28071068|emb|CAD61915.1| unnamed protein product [Homo sapiens]
Length = 369
Score = 244 bits (624), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 157/381 (41%), Positives = 212/381 (55%), Gaps = 54/381 (14%)
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEE--CGFENDLI 313
K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI
Sbjct: 26 KMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKADLI 85
Query: 314 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 373
YL P + K + S V LI S PG GS WGH +
Sbjct: 86 SYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFR 135
Query: 374 LRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIG 428
L+ +L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP
Sbjct: 136 LKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESMLTLGKESKTPGKSS 195
Query: 429 EPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 486
PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHI
Sbjct: 196 VPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHI 255
Query: 487 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 544
KT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 256 KTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------F 309
Query: 545 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 604
S V + +GS E + PVPY+LPP+ Y S+
Sbjct: 310 GLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSK 345
Query: 605 DVPWSWDKRYTKK-DVYGQVW 624
D PW W+ Y K D +G +W
Sbjct: 346 DRPWIWNIPYVKAPDTHGNMW 366
>gi|320165079|gb|EFW41978.1| tyrosyl-DNA phosphodiesterase 1 [Capsaspora owczarzaki ATCC 30864]
Length = 622
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 160/410 (39%), Positives = 226/410 (55%), Gaps = 50/410 (12%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVI 222
F+L R G+ W N + S+R ++ D+ ++ NYMVD+DWL+ P + + V+
Sbjct: 195 FQLTRAGGINEWFNRNAFSLRQLLSDMDLQSSVQFNYMVDLDWLMTIFPRELQARPMTVV 254
Query: 223 HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
HG ++ K + +PPLPI+FGTHH+K M L Y +RI++HTAN+I DW
Sbjct: 255 HGLTESADVLQAAGKKWGKTIIRPPLPIAFGTHHTKMMFLFYSDSMRIVIHTANIIPSDW 314
Query: 283 NNKSQGLWMQ-DFPLK----DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-FKI 336
K++G+W FPLK Q + S FE L YL+ A+G+ +
Sbjct: 315 YAKTEGVWCSPKFPLKASTAQQASSSTGRAFEQTLNKYLT------------AYGSCIRQ 362
Query: 337 NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTV-LQECTFEKGFKKSPLVYQ 395
K++FS+A V LIASVPG H G + +WGHM+LR + L + L+ Q
Sbjct: 363 VREQAMKYDFSAANVALIASVPGRHAGLAKSEWGHMQLRKLPLPANVASQPVNTHQLIGQ 422
Query: 396 FSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYA 448
FSS+GSL E W+ +E S S+S+ ++ +P I P +++P+VE+VR SLEGY
Sbjct: 423 FSSIGSLGASPETWLTSEFSVSLSAHKAQGLSP-PIAHPRALRLIFPSVENVRLSLEGYL 481
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--------NGQK--- 496
AG A+P K +L +++ W A+ +GR AMPHIK++AR + Q+
Sbjct: 482 AGGALPYRLATHSKQAWLDQFFCTWNATRSGRQHAMPHIKSYARIAVSPKTADSAQQAEA 541
Query: 497 -------LAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILPS 536
L WFLLTSANLSKAAWG LQK + QL IRSYELGVL PS
Sbjct: 542 TDSTNVALGWFLLTSANLSKAAWGTLQKKGTAAEQLEIRSYELGVLFHPS 591
>gi|47220883|emb|CAG03090.1| unnamed protein product [Tetraodon nigroviridis]
Length = 607
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 167/454 (36%), Positives = 245/454 (53%), Gaps = 88/454 (19%)
Query: 206 LLPACP--------VLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP-------- 249
LL ACP L + VL++HG+ KR A + P
Sbjct: 204 LLQACPRRQSPHQWCLRRDRPVLIVHGD--------KREAKARLVQQAQAFPHVQFCQAK 255
Query: 250 --ISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE- 305
I+FGTHH+K MLL Y G R+++ T+NLI DW K+QG+WM FP + + +
Sbjct: 256 LDIAFGTHHTKMMLLWYEEGFRVVILTSNLIRADWYQKTQGMWMSPLFPRLPEGSGARAG 315
Query: 306 ---CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT 362
F+ DL++YL++ + + + ++ + S A+V L+ S PG +
Sbjct: 316 ESPTSFKRDLLEYLASYRAAQLEEWM----------ERIQEHDLSEASVYLVGSTPGRYV 365
Query: 363 GSSLKKWGHMKLRTVLQECT-FEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSSSMSS- 416
G+ +++WGH++LR +L+E T G + P+V QFSS+GS+ KW+A E ++S+
Sbjct: 366 GADMERWGHLRLRKLLREHTETPAGQDRWPVVGQFSSIGSMGLDKSKWLAGEFQHTLSTL 425
Query: 417 GFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWK 473
G S ++ PL L+++P+VEDVR SLEGY AG ++P S Q + +L ++ +W+
Sbjct: 426 GQSSARSDPPL-----LLLYPSVEDVRTSLEGYPAGGSLPYSIQTAQRQLWLHAFFHRWR 480
Query: 474 ASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
A TGRS AMPHIKT+ R + +LAWFL+TSANLSKAAWGAL+KNN+Q+MIRSYELGV
Sbjct: 481 ADSTGRSHAMPHIKTYMRASPGYTELAWFLVTSANLSKAAWGALEKNNTQVMIRSYELGV 540
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 591
L LP+A + T + S +SS P
Sbjct: 541 LFLPAA------------------------------FNMKTFPVNTSPFPVSSSSFSGFP 570
Query: 592 VPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
VP++LPP YS +D PW W+ Y++ D +G VW
Sbjct: 571 VPFDLPPTAYSPKDQPWIWNIPYSQAPDTHGNVW 604
>gi|67971950|dbj|BAE02317.1| unnamed protein product [Macaca fascicularis]
gi|67971954|dbj|BAE02319.1| unnamed protein product [Macaca fascicularis]
Length = 343
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/379 (40%), Positives = 211/379 (55%), Gaps = 54/379 (14%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLIDY 315
MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D + S E F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIVDGTHKSGESTTHFKADLISY 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L P + + + S V LI S PG GS WGH +LR
Sbjct: 62 LMAYNAPSLKEWIDT----------IHEHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLR 111
Query: 376 TVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEP 430
+L++ +S P+V QFSS+GSL + KW+ +E SM + E KTP P
Sbjct: 112 KLLKDHASSIPNAESWPVVGQFSSIGSLGADESKWLCSEFKESMLTLGKESKTPGKSSVP 171
Query: 431 L-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT 488
L +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AMPHIKT
Sbjct: 172 LYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAMPHIKT 231
Query: 489 FARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 546
+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 232 YMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGL 285
Query: 547 TSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDV 606
+ V + +GS E + PVPY+LPP+ Y S+D
Sbjct: 286 DNFKVKQKFFAGSQE------------------------PMATFPVPYDLPPELYGSKDR 321
Query: 607 PWSWDKRYTKK-DVYGQVW 624
PW W+ Y K D +G +W
Sbjct: 322 PWIWNIPYVKAPDTHGNMW 340
>gi|198414495|ref|XP_002123899.1| PREDICTED: similar to tyrosyl-DNA phosphodiesterase 1 [Ciona
intestinalis]
Length = 471
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 155/369 (42%), Positives = 224/369 (60%), Gaps = 36/369 (9%)
Query: 181 VSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK 237
+ I+DV+ G++I ++ NY +D+DWL+ PV + + +IHG G + +
Sbjct: 123 LGIKDVLSEKFGNLIESVQFNYCIDVDWLIQQYPVSCQGKPLTIIHG---GNVS--PNPQ 177
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
N L K LP +GTHH+K MLL Y G+R+++ T NL+ DW K+QG WM P+
Sbjct: 178 YPNITLVKVNLP-PYGTHHTKMMLLHYTSGLRVVILTTNLVPQDWGQKTQGFWMS--PIF 234
Query: 298 DQNNLSEECGFENDL-IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
+ ++ F+ ++Y+S+ K + + + + + SSA V LI S
Sbjct: 235 PKTTPTKTSKFKPRFGLEYVSSYK----------NKSLQRWVDHIRSHDMSSANVILIGS 284
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL---DEKWMA-ELSS 412
+PG HTG +L WGHM+LR VL+ T +K P++ QFSS+GSL ++KW+ E +
Sbjct: 285 IPGRHTGHNLSTWGHMRLRKVLKNET-KKIDSSWPVIGQFSSIGSLGSSNQKWLCNEWLT 343
Query: 413 SMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWA 470
S+SS T LG PL +++P+V+DVR SLEGY AG +IP S + + +L+ Y
Sbjct: 344 SLSSC---SNTTLGASPPLKLIFPSVDDVRMSLEGYPAGASIPYSRNIALKQPWLRPYLH 400
Query: 471 KWKASHTGRSRAMPHIKTFAR---YNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRS 526
KW A+H GR++A PHIK++AR YN +L WFLLTSANLSKAAWG+L+KNNSQL I+S
Sbjct: 401 KWVATHAGRTQAAPHIKSYARISPYNTNIRLPWFLLTSANLSKAAWGSLEKNNSQLSIKS 460
Query: 527 YELGVLILP 535
YELGVL LP
Sbjct: 461 YELGVLFLP 469
>gi|325180643|emb|CCA15048.1| tyrosylDNA phosphodiesterase putative [Albugo laibachii Nc14]
Length = 489
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 173/479 (36%), Positives = 249/479 (51%), Gaps = 69/479 (14%)
Query: 164 FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
F L ++GL A N +++ D++ G+ +LSNYM D+ WL+ V + +
Sbjct: 60 FYLTPIKGLSAAQNQYSIALTDLLDGEFTSCLLSNYMYDVPWLMQQYFV------SIFLF 113
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
+S ++H + K N P LPI FGTHHSK M++ Y VR+ + TAN + +DWN
Sbjct: 114 WQS---IKH-QCQKYTNIKTIAPYLPIPFGTHHSKMMIIWYAEKVRVAIFTANFLPIDWN 169
Query: 284 NKSQGLWMQDFPLKDQNNLS-------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
NK+QG+W QDF LK + + S E FE DLIDYL + G +
Sbjct: 170 NKTQGIWFQDFGLKSETSASSRTNLWPERIDFEADLIDYL-------IHVDKIHLGELCL 222
Query: 337 NPSFFKKFNFSSAAVRLIASVPGYHTGSS----LKKWGHMKLRTVLQECTFEKGFKKSPL 392
+K++FS+A V L+ASVPG H + + K+GH+++R +LQ T E + PL
Sbjct: 223 T---LEKYDFSTANVALVASVPGTHKNRAIWIDMHKYGHLRMRRLLQ--TLEAWNNEYPL 277
Query: 393 VYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
+ QFSSLGSL E W+ E + S+ + + + P ++WP+ E VR S+EG+ AG
Sbjct: 278 ICQFSSLGSLTEPWLYHEFTESLQAHSTTKQRP----ALHLIWPSAEQVRNSIEGWNAGR 333
Query: 452 AIPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAMPHIKTFARYNGQ----KLAWFLLTSAN 506
AIP P KN+ K FL K+ W RS AMPHIK++A+++ L W LL+S+N
Sbjct: 334 AIPCPLKNM-KPFLHKFLRTWNPPPKLHRSNAMPHIKSYAQFDPTALDGTLRWALLSSSN 392
Query: 507 LSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 566
LS AAWG+ QK +Q MIRS+E+GVL P R+ CT +V
Sbjct: 393 LSSAAWGSYQKQKNQFMIRSFEIGVLFHPKVYRNDK--LCTDPLV--------------- 435
Query: 567 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS-EDVPWSWDKRYTKKDVYGQVW 624
V T +D AS + P PY P Q Y + +D PW W+ + D G +
Sbjct: 436 ----VIGT---PADEAASQNAIRFPAPYNFPLQAYDTKQDEPWIWNLAWDLPDSTGACY 487
>gi|443688556|gb|ELT91218.1| hypothetical protein CAPTEDRAFT_131694, partial [Capitella teleta]
Length = 374
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 206/351 (58%), Gaps = 25/351 (7%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH----KPPLPISFG 253
N+ +DI WL+ PV + +LV+HG + +++R A H + L + +G
Sbjct: 2 NFKIDIPWLVAQYPVHHRTKPLLVVHGSTRQEKANLERE--ARLFTHVDLCQAKLEMIYG 59
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQN-NLSEECGFEN 310
THH+K M+L Y GVR+I+HTANLIH DW+ K+QG+WM PL Q+ N F+
Sbjct: 60 THHTKMMILSYVNGVRVIIHTANLIHSDWHQKTQGVWMSPLFPPLAPQSRNGDSPTNFKR 119
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DL+ Y++ K + + S K+ +FS+A V LIASVPG H+G+SL ++G
Sbjct: 120 DLLQYINAYKSQSLNEWI----------SIIKRHDFSTAKVFLIASVPGRHSGASLNEFG 169
Query: 371 HMKLRTVLQEC-TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE 429
H+KL+ VL++ K+ P++ QFSS+GSL + LSS + + FS + +
Sbjct: 170 HLKLKKVLRQFGPSSDACKQWPVLAQFSSIGSLGPTPESWLSSELLTSFSATRGSGSQSK 229
Query: 430 PLI--VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHI 486
P + ++P DVR SLEGY AG ++P K + + +W++ GR++A PHI
Sbjct: 230 PRLHLMYPCRHDVRLSLEGYGAGGSLPYSINTAKKQPWFRTICNRWRSECNGRTKACPHI 289
Query: 487 KTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
KT+ R + LAWF LTSANLSKAAWG L+K SQLM+RSYELGVL LP
Sbjct: 290 KTYLRASPDWHNLAWFTLTSANLSKAAWGMLEKQGSQLMVRSYELGVLFLP 340
>gi|440800948|gb|ELR21974.1| tyrosylDNA phosphodiesterase-related, putative [Acanthamoeba
castellanii str. Neff]
Length = 601
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 162/456 (35%), Positives = 228/456 (50%), Gaps = 92/456 (20%)
Query: 172 LPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLE 231
PA AN + IR +I ++ A++ Y VD+DWL+ CPVL P V +
Sbjct: 231 FPADANQGALGIRQIIPENVERAVIVTYQVDMDWLMRRCPVLPHPPPPNVHY-------- 282
Query: 232 HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM 291
+KP W+L +G HH K MLL + + TANLI D+ K+QG+W+
Sbjct: 283 ----HKP--WVL-------DYGCHHGKMMLLFWK-----AITTANLIQKDYERKTQGIWL 324
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
QDFP K + FE+ L+DY ++ + PS + +++S+ V
Sbjct: 325 QDFPKKRGD-------FEDTLVDYF---------GHMGNERQLQFQPSSLRHYDYSAVRV 368
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF-KKSPLVYQFSSLGSLDEKWMAEL 410
L+ SVPGYH+ ++L ++GHM+LR +L T ++S + QFSS+GSL KW+ E
Sbjct: 369 ALVTSVPGYHSRATLNRYGHMRLRGLLSRVTMPAEIERRSSVACQFSSVGSLTAKWVEEE 428
Query: 411 --SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
S M+S S D E +VWPTV+ VR S++GYAAG ++ + N KDF+
Sbjct: 429 FGQSLMASAGSSDSKKEAQVE--LVWPTVDYVRSSIDGYAAGGSLCFGESN-RKDFMTPL 485
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYE 528
+ ++KA R R PHIK LTSANLSKAAWGALQK N+QLMIR++E
Sbjct: 486 FRQYKAMPESRGRVTPHIKV------------CLTSANLSKAAWGALQKGNTQLMIRNFE 533
Query: 529 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 588
+GVL LPS F + I GS+ A S + V
Sbjct: 534 IGVLFLPSH------FDDRTFIA-------------------------GSAPAALSKDSV 562
Query: 589 YLPVPYELPP-QRYSSEDVPWSWDKRYTKKDVYGQV 623
+P+PY + P +RY D PW WD + D GQ
Sbjct: 563 VIPLPYRIEPLERYGPRDEPWIWDLPRPEPDALGQT 598
>gi|349604421|gb|AEP99976.1| Tyrosyl-DNA phosphodiesterase 1-like protein, partial [Equus
caballus]
Length = 345
Score = 234 bits (598), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 149/384 (38%), Positives = 210/384 (54%), Gaps = 58/384 (15%)
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFEN 310
+K MLL+Y G+R+++HT+NL+H DW+ K+QG+W+ PL + ++ F+
Sbjct: 1 TKMMLLLYEEGLRVVIHTSNLLHADWHQKTQGIWLS--PLYSRIVHGTHSSGESTTHFKA 58
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DLI YL P + ++ + S V LI S PG GS WG
Sbjct: 59 DLISYLMAYNAPSLKEWI----------DVIQEHDLSETNVYLIGSTPGRFQGSQKDNWG 108
Query: 371 HMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPL 425
H +LR +L+E +S P+V QFSS+GS+ + KW+ +E S+ + E KTP
Sbjct: 109 HFRLRALLKEHASSIPKAESWPIVGQFSSIGSMGADESKWLCSEFKESLVTLGKESKTPG 168
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 483
P +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +GRS AM
Sbjct: 169 KSVSPFHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETSGRSNAM 228
Query: 484 PHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 541
PHIKT+ R + ++AWFL+TSANLSKAAWGAL++N +QLMIRSYELGVL LPSA
Sbjct: 229 PHIKTYMRPSPDFSQIAWFLVTSANLSKAAWGALERNGAQLMIRSYELGVLFLPSA---- 284
Query: 542 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 601
F S V + S + E + PVPY+LPP+ Y
Sbjct: 285 --FGLDSFKVKQKFFSDNQEPT------------------------ASFPVPYDLPPELY 318
Query: 602 SSEDVPWSWDKRYTKK-DVYGQVW 624
S+D PW W+ Y K D +G +W
Sbjct: 319 GSKDRPWIWNIPYIKAPDTHGNMW 342
>gi|18044048|gb|AAH19804.1| Tdp1 protein [Mus musculus]
Length = 343
Score = 234 bits (598), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 152/380 (40%), Positives = 209/380 (55%), Gaps = 56/380 (14%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDY 315
MLL+Y G+R+++HT+NLI DW+ K+QG+W+ +P DQ + + F+ DLI Y
Sbjct: 2 MLLLYEEGLRVVIHTSNLIREDWHQKTQGIWLSPLYPRIDQGSHAAGESSTRFKADLISY 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
L+ P + ++ + S V LI S PG GS WGH +LR
Sbjct: 62 LTAYNAPPLQEWI----------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLR 111
Query: 376 TVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGE 429
+LQ + KG + P+V QFSS+GSL + KW+ +E S+ + E + P
Sbjct: 112 KLLQAHAPSTPKG-ECWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAV 170
Query: 430 PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK 487
PL +++P+VE+VR SLEGY AG ++P + +K +L Y+ KW A +GRS AMPHIK
Sbjct: 171 PLHLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIK 230
Query: 488 TFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 545
T+ R + KLAWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F
Sbjct: 231 TYMRPSPDFSKLAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FG 284
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
+ V + S S E + PVPY+LPP+ Y S+D
Sbjct: 285 LDTFKVKQKFFSSSCEPT------------------------ASFPVPYDLPPELYRSKD 320
Query: 606 VPWSWDKRYTKK-DVYGQVW 624
PW W+ Y K D +G +W
Sbjct: 321 RPWIWNIPYVKAPDTHGNMW 340
>gi|358337002|dbj|GAA55434.1| tyrosyl-DNA phosphodiesterase 1, partial [Clonorchis sinensis]
Length = 1156
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 156/432 (36%), Positives = 229/432 (53%), Gaps = 49/432 (11%)
Query: 188 QGDIIVAILSNYMVDIDWLLP-------ACPVLAKIPHVLVIHGESDGTLEHM--KRNKP 238
GD++ + NYM D+DWL+ +CP+L V HG+ L + K
Sbjct: 759 HGDLVSSAQFNYMFDVDWLMQQYPKQFRSCPLLL----VHAYHGQDKAALNSVVSKYENI 814
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
+ H + + FGTHH+K M L Y G+RI++HTAN+I DW+ ++QG+W+ L+
Sbjct: 815 RQCVAH---IRLPFGTHHTKMMFLKYADGLRIVIHTANMIPDDWDRRTQGIWLSPKLLRK 871
Query: 299 QNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
SE + F L++YL + A P+ + + ++FS V L+
Sbjct: 872 SGTSSETDSDTKFRETLVNYLR--GYGSTVAGTPSSPLGEWIEELLQ-YDFSPIRVFLVG 928
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
SV G H GSSLK +GH +L +LQ+ T E PL+ QFSS+GSL + L++ S
Sbjct: 929 SVSGMHGGSSLKHFGHPRLANLLQDYTLEVP-SSWPLIGQFSSIGSLGAQPTTWLTTQWS 987
Query: 416 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKA 474
S + K G+ +++P V+DVR SLEGYAAG +P ++ +K +L+++ +W A
Sbjct: 988 SSLA-GKGARGL---RMIFPCVDDVRNSLEGYAAGGCLPYSRQTAEKQPWLRQFLHRWCA 1043
Query: 475 SHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
SRA PHIK++ R +G +WFLLTSANLSKAAWG+ K+ SQLMIRSYELGVL
Sbjct: 1044 G--PHSRAAPHIKSYTRISNDGTHASWFLLTSANLSKAAWGSFVKDGSQLMIRSYELGVL 1101
Query: 533 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 592
+P + +C + PS + S QI AG + + PV
Sbjct: 1102 FVPGQFQEKA--NCFRLVTPSRTTTPSDALKQI---------------AGMRTHSIPFPV 1144
Query: 593 PYELPPQRYSSE 604
PY+LPP Y ++
Sbjct: 1145 PYDLPPVLYDTD 1156
>gi|360045261|emb|CCD82809.1| putative tyrosyl-DNA phosphodiesterase [Schistosoma mansoni]
Length = 483
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 160/474 (33%), Positives = 249/474 (52%), Gaps = 79/474 (16%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH---GESDGTLEHMKRNKPANWILHK 245
G+++ +I N+M DI WL P + + ++H G+ +L+ K +N +
Sbjct: 48 GELVSSIQFNFMFDIPWLREQYPERFRSLPLTIVHDFQGKMKKSLDE-SVAKYSNIRTCQ 106
Query: 246 PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNN 301
+ + +G HH+K M+L Y G++II+HTAN+I DW+ ++QG+WM ++ Q N
Sbjct: 107 ADIRLPYGVHHTKMMMLKYKDGLKIIIHTANMISDDWDRRTQGIWMSPKLKLLSVEQQKN 166
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-----FKKFNFSSAAVRL 353
L++ + F DL++YL + + +L + +P F ++F V L
Sbjct: 167 LNDTDSKTNFRADLLEYLKS-----YGRDLTQSTS---SPLFEWINCLHSYDFRPIKVVL 218
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS 413
IASV G H G SLKK+GH +L VLQ C + P++ QFSS+GSL K ++
Sbjct: 219 IASVSGRHAGESLKKFGHTRLGEVLQTCNSQIP-SSWPVIGQFSSIGSLGPKPTDWFTTE 277
Query: 414 MSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKW 472
SS + K G+ +++P+VEDVR SLEGY AG +P + +K +L +++ +W
Sbjct: 278 WSSSLA-GKGARGLR---MIYPSVEDVRNSLEGYFAGGCLPYTKTTAEKQPWLCQFFYRW 333
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
+A + SRA PHIK++ R +GQ++ WFLLTSANLSK+AWGA +K+ SQLMIRSYELG
Sbjct: 334 QAFN--HSRAAPHIKSYTRMSPDGQQIGWFLLTSANLSKSAWGAYEKSKSQLMIRSYELG 391
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL LP+ + EI + + SQ ++ E++
Sbjct: 392 VLFLPTNYKESAH--------SFEILKNNAKYSQ-----------------SSTDELLPF 426
Query: 591 PVPYELPPQRYSSED--------------------VPWSWDKRYTKKDVYGQVW 624
P+PYELPP +Y S PW DK ++ D++G++W
Sbjct: 427 PIPYELPPVKYQSNGKKLYMCIIIFLSLFFAMDKYEPWILDKPHSLPDIFGRIW 480
>gi|339256684|ref|XP_003370218.1| 7 transmembrane receptor [Trichinella spiralis]
gi|316965617|gb|EFV50306.1| 7 transmembrane receptor [Trichinella spiralis]
Length = 478
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 163/487 (33%), Positives = 243/487 (49%), Gaps = 63/487 (12%)
Query: 164 FRLLRVQGLPAWANTSCVSIRD---VIQGD----IIVAILSNYMVDIDWLLPACPVLAKI 216
F L +V GL N + VS+++ + G+ + N+++D W + P +
Sbjct: 27 FYLTKVYGLDEKWNENAVSMKNFNLALLGENPDELEATAQFNFLIDYGWTMAQYPENCRQ 86
Query: 217 PHVLVIHGESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+ ++ + + K N L LPI FGTHHSK LL Y +G+++ +HT
Sbjct: 87 KPLTIVTSSQSSRWNDLVNDVRKATNVSLVDARLPIPFGTHHSKMTLLRYRKGLKVAIHT 146
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE----CGFENDLIDYLSTLKWPEFSANLP 329
ANLI DW K+QG+++ FPL + N +++ F+ DLI YL+ P A
Sbjct: 147 ANLIEYDWCEKTQGMYISPLFPLIENNTGTDDYDSKTNFKADLIAYLNAYTNPAVKAWAE 206
Query: 330 AHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFK 388
N+ + A V ++AS+PG H ++ WGH+KL +L+ ++
Sbjct: 207 EIENYDMR----------EANVFIVASIPGRHRDVAMYNWGHLKLGRILKTHLNYDAIDA 256
Query: 389 KSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDV 440
P+V QFSS+GSL EKW+ E ++S+ E + EP +V+P+VE+V
Sbjct: 257 NWPVVCQFSSIGSLGTKPEKWLLGEFAASLGRTAFECS---ALQEPFRNLKLVYPSVENV 313
Query: 441 RCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKL 497
RCS EGY G +P + K +L+++ +W GRS A+PHIKT+ RY+ QKL
Sbjct: 314 RCSSEGYYGGTCLPYTEAVASKQQYLQQFMHRWMCECFGRSHAVPHIKTYFRYSPCFQKL 373
Query: 498 AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKS 557
AWFLLTSANLSKAAWG +K+N Q IRSYE+GVL +P F C NI
Sbjct: 374 AWFLLTSANLSKAAWGVTEKSNQQFNIRSYEIGVLFIPE-------FFCERNI------- 419
Query: 558 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 617
+Q K T+ H + + ++ P+P +LP YS D W D Y +
Sbjct: 420 ----NFFLQGLKAFTI--HRNVETPSAE----FPLPMDLPLVPYSQNDKMWIIDIPYGEA 469
Query: 618 DVYGQVW 624
D +G W
Sbjct: 470 DAHGITW 476
>gi|324510072|gb|ADY44216.1| Tyrosyl-DNA phosphodiesterase [Ascaris suum]
Length = 452
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 154/508 (30%), Positives = 246/508 (48%), Gaps = 80/508 (15%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKLPST-FRLLRVQGLPAWANTSCVSIRDVIQG-DI 191
+ D D + + ++ F L S ++ G P +T+ S+ ++++
Sbjct: 7 ENDGDDASSARTPSASMVKFRKQDSPLLSNRLYFTKIVGHPCRYSTNAFSLSELLELISP 66
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM------KRNKPANWILHK 245
I +I N+M+D+ WLL P + +I GE++GT H+ +R K N + +
Sbjct: 67 IASIHFNFMIDLHWLLSQYPERCSAYPISIIVGENNGT-NHLDVRAEARRCKADNVSVGR 125
Query: 246 PPLPISFGTHHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
L + +GTHHSK ++ + +++ TANL+ DW++K+Q + P+ +
Sbjct: 126 ARLVLPYGTHHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEG 185
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+ F DLI YL+ ++ G + +FS R+I+S+PGYH G
Sbjct: 186 QNNFRKDLISYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGD 239
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSGFSE 420
++GH++LR VL+ + KK V QFSS+GSL K W+ A+ S++ G
Sbjct: 240 QKDRYGHLRLRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGGIPV 297
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
++ L +++P VEDVR S+EGY AG A+P + + +L + KW+ GR
Sbjct: 298 PESSL-----RLIYPCVEDVRNSVEGYMAGGALPYQRNTAARQPYLLERMHKWRCERFGR 352
Query: 480 SRAMPHIKTFARY-NGQKL-AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
+RAMPHIK+++ + +G+ L +W L+TSANLSKAAWG LQK SQL IRSYELGVL+
Sbjct: 353 TRAMPHIKSYSAFSDGRCLPSWLLITSANLSKAAWGELQKKESQLAIRSYELGVLL---- 408
Query: 538 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 597
T+ +Q +PY++P
Sbjct: 409 ----------------------TDEDSLQL------------------------LPYDMP 422
Query: 598 PQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
++ D PW D YTK D++G WP
Sbjct: 423 LTKFEPGDQPWVCDDTYTKPDIHGATWP 450
>gi|71423941|ref|XP_812626.1| tyrosyl-DNA phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70877431|gb|EAN90775.1| tyrosyl-DNA phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 171/540 (31%), Positives = 265/540 (49%), Gaps = 87/540 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L+NYM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLANYMIDIEWLVRVA 60
Query: 211 PVLAKIPH-VLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + + ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQQIFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K DQ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDQANLTFSAGNEIRGNKF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF----SEDK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL PL IV+PT +VR SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
R RA+PH+KT+ R N +K + WF+LTSANLS+AAWG QK QL IRSYE GV
Sbjct: 349 KIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 532 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 584
+ + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEEGPS 461
Query: 585 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQVWPRHFQL 630
+ Y P+ PY ++ QR +++D+PW D + KDV+G+ R +L
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGKEIHRAMEL 521
>gi|148686962|gb|EDL18909.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_c [Mus musculus]
Length = 542
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 136/376 (36%), Positives = 204/376 (54%), Gaps = 33/376 (8%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKN 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P DQ + + F+ DL YL+ P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIDQGSHTAGESSTRFKADLTSYLTAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFK 388
++ + S V LI S PG GS WGH +LR +LQ + KG +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHRDNWGHFRLRKLLQAHAPSTPKG-E 391
Query: 389 KSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCS 443
P+V QFSS+GSL + KW+ +E S+ + E + P PL +++P+VE+VR S
Sbjct: 392 CWPIVGQFSSIGSLGPDESKWLCSEFKDSLLALREEGRPPGKSAVPLHLIYPSVENVRTS 451
Query: 444 LEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR--YNGQKLAWF 500
LEGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWF
Sbjct: 452 LEGYPAGGSLPYSIQTAEKQRWLHSYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWF 511
Query: 501 LLTSANLSKAAWGALQ 516
L+T K WG ++
Sbjct: 512 LVTRQPAFK-YWGPVR 526
>gi|340055492|emb|CCC49811.1| putative tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma vivax
Y486]
Length = 548
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 169/521 (32%), Positives = 241/521 (46%), Gaps = 83/521 (15%)
Query: 168 RVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPACPVLAKIPHVL 220
R++ LP + S + + D++ D +L+NY++D +WLL P + L
Sbjct: 10 RIKALPT-ESPSAIRLGDILHCDAENPDERWTHVVLANYLIDPEWLLRVAPAITCTSRQL 68
Query: 221 VIHGESDGTLEHMKRNKPANWI------LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
I G H + A + + +PP+P+ FG HH+K +L I RG+R+ V T
Sbjct: 69 FIITGERGFAHHFASSTMAAHMGAGRVTVIEPPMPLPFGVHHTKLVLGINSRGLRVAVLT 128
Query: 275 ANLIHVDWNNKSQGLWMQDFP-----------LKDQNNLSEECG--FENDLIDYLSTLKW 321
AN I DW+ K+QG++MQDFP L E G F ++L YL +
Sbjct: 129 ANFIEEDWDMKAQGIYMQDFPRSLTPDKEGRYTAQSATLQEGRGERFRSELRRYLHS--- 185
Query: 322 PEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQEC 381
+ +G I PS F +FSSA+V LIASVPGYH G +G +L V+Q
Sbjct: 186 --YGLLSDENGLKGIPPSHFDGIDFSSASVELIASVPGYHRGGEAYSFGMGRLLKVVQSV 243
Query: 382 TFEKGFK--KSPLVYQFSSLGSLDEKWMAELSSSMSSGF---SEDKTPLGIGEP--LIVW 434
K L +QFSS G L EK++ L +M + D+ P EP +V+
Sbjct: 244 QMGPILDGGKPILTWQFSSQGLLTEKFLKSLEDAMLGNHAVGATDRRP----EPEVRVVY 299
Query: 435 PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG---------RSRAMPH 485
PT +V+ SLEG+ G ++P + ++ +W H G R RAMPH
Sbjct: 300 PTESEVKNSLEGWRGGMSLPV-RLRCCHPYINARMHRW--CHRGVSEAVNKPVRGRAMPH 356
Query: 486 IKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 543
+KT+ R L WFLLTSANLS+AAWG Q+N SQL IRSYELGVL S C
Sbjct: 357 LKTYMRLAEGEDSLHWFLLTSANLSRAAWGEWQRNGSQLAIRSYELGVL-YDSKSFINCA 415
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH-GSSDAGASSEVVYLPV------PYEL 596
+ PS S ++ L+ L G++D + V++LP PYE
Sbjct: 416 EGELFVVTPSR---RIPLPSSVEGDGLLRLHIRAGANDIIGEAPVLFLPYDALHPEPYES 472
Query: 597 PPQR---------------YSSEDVPWSWDKRYTKKDVYGQ 622
Q S++DVPW D + +D G+
Sbjct: 473 TLQLRKNHGSSVENESHAPLSTKDVPWVVDAPHHGRDALGK 513
>gi|345304821|ref|XP_003428263.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1 [Ornithorhynchus
anatinus]
Length = 580
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 208/375 (55%), Gaps = 27/375 (7%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L +V+G+ N+ + IRD++ G ++ + NY D+DWL+ P +
Sbjct: 159 PFRFYLTKVKGIMPKYNSGALHIRDILSPLLGTLVSSAQFNYCFDVDWLIKQYPPEFRNK 218
Query: 218 HVLVIHGES-DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ + + ++ KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 219 PLLLVHGDKREAKAQLHEQAKPYENICLCQAKLDIAFGTHHTKMMLLLYEEGMRVVIHTS 278
Query: 276 NLIHVDWNNKSQGLWMQD-FP--LKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAH 331
NLIH DW+ K+QG+W+ +P +++ ++ + F+ DLI+YL P +
Sbjct: 279 NLIHADWHQKTQGIWLSPLYPRLVRETHSSGDSVTHFKTDLINYLMAYNSPSLKEWI--- 335
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS- 390
K+ + S V LI S PG G + WGH +LR +L+E + ++S
Sbjct: 336 -------DIIKEHDLSETRVYLIGSTPGRFQGQKKEDWGHFRLRKLLEEHSSSIPEEESW 388
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
P+V QFSS+GS+ + KW+ +E S+ K+ G +++PTV++VR SLEG
Sbjct: 389 PIVGQFSSIGSMGADESKWLCSEFKDSLVMLGKSGKSQGGHVPIHLIYPTVDNVRKSLEG 448
Query: 447 YAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLT 503
Y AG ++P + K +L Y+ KW A +GRS AMPHIKT+ R + Q++AWFL+T
Sbjct: 449 YPAGGSLPYSIQTAQKQLWLHSYFHKWSAEISGRSHAMPHIKTYMRLSPDFQQIAWFLVT 508
Query: 504 SANLSKAAWGALQKN 518
A+ G L +N
Sbjct: 509 RASAFDVTGGFLTEN 523
>gi|407867395|gb|EKG08563.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 205 bits (521), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 171/540 (31%), Positives = 264/540 (48%), Gaps = 87/540 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L+NYM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLANYMIDIEWLVRVA 60
Query: 211 PVLAKIPHVL-VIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + L ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQQLFIVSGEKEYEKKIQSSFLFRYIKAKKIR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K D+ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQNSPKTDRANLTFSAGNEIRGNNF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYDIISNTENTEA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE----DK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDTELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL PL IV+PT +VR SLEG+ G ++P + ++ +W G
Sbjct: 293 KPL---RPLVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINGRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
R RA+PH+KT+ R N +K + WF+LTSANLS+AAWG QK QL IRSYE GV
Sbjct: 349 KIEFLRRRALPHLKTYMRLNEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 532 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 584
+ + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKGSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEEGPS 461
Query: 585 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQVWPRHFQL 630
+ Y P+ PY ++ QR +++D+PW D + KDV+G+ R +L
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGKEIHRAMEL 521
>gi|397627380|gb|EJK68455.1| hypothetical protein THAOC_10361 [Thalassiosira oceanica]
Length = 656
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 151/501 (30%), Positives = 236/501 (47%), Gaps = 98/501 (19%)
Query: 195 ILSNYMVDIDWLLP-ACPVLAKIPHVLVIHGESDGTLEHMKR------------------ 235
I+ NY++D +L A P L + V+V +G S + R
Sbjct: 181 IICNYLIDFSYLFQRASPELLQFQRVVVFYGTSGQACPAVMRQWERLLEGTGRTVAFVQL 240
Query: 236 --NKPANWILHKPPLPISFGTHHSKAMLLIYP------RGVRIIVHTANLIHVDWNNKSQ 287
+ P N + P+ I +G HH+K L+ Y + +HT+N++H D KSQ
Sbjct: 241 LPSDPPNSRANPLPVKIEYGVHHTKMFLMGYEDEESGISKCHVSIHTSNILHSDAELKSQ 300
Query: 288 GLWMQDFPLK------DQNNLSEECG-------FENDLIDYLSTLKWPEFSANLPAHGNF 334
G++ QDFPLK N S+E FE+DL+ Y+ + ++ + + +F
Sbjct: 301 GVYAQDFPLKVAPGKSTGNPYSKEEDASKTPRQFEDDLVTYMESYRYQARQSWCSSSASF 360
Query: 335 KINPS------FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR-TVLQECTFEKGF 387
++ + ++FS+A LI SVPG H + + ++G++KLR V+Q +
Sbjct: 361 GLSNQPMTILQLIRAYDFSTAYCVLIPSVPGRHRANDMHEYGYLKLRKAVIQHA---RSQ 417
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGF--------SEDKTPLGIGEPL----IVWP 435
SPL+ QFSSLGSL+ KW+++ S + S S+ K G + IVWP
Sbjct: 418 TNSPLLLQFSSLGSLNGKWLSQFLSCLDSSAQSFDPVTESDKKKSKGTSDLASRMKIVWP 477
Query: 436 TVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR------SRAMPHIKTF 489
+VE+VR +EGY+ G AIP KN++K FL + +W + + S+ PHIKTF
Sbjct: 478 SVEEVRTCVEGYSGGGAIPGRTKNLEKAFLMPLYHRWSSRNPNNEGPLKTSKHAPHIKTF 537
Query: 490 AR--YNGQKLAWFLLTSANLSKAAWGALQKNNSQ-----LMIRSYELGVLILPSAKRHGC 542
+ +G ++ W LL S NLS AA G +QK + L IR +ELGV I P +
Sbjct: 538 VQPSSDGTEIEWMLLGSHNLSIAALGQIQKRHKDSSEKILFIRHWELGVFISPRTLKQAG 597
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
+ K VTL + + SE V +P+PY+L P Y+
Sbjct: 598 NYD----------------------GKDVTLVPYRGGGMSSGSE-VQVPLPYDLNPTPYN 634
Query: 603 SEDVPWSWDKRYTKKDVYGQV 623
+EDV W+ D+ D +G++
Sbjct: 635 NEDVTWAVDRTTFLPDRFGRI 655
>gi|149025342|gb|EDL81709.1| tyrosyl-DNA phosphodiesterase 1, isoform CRA_b [Rattus norvegicus]
Length = 542
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 131/362 (36%), Positives = 195/362 (53%), Gaps = 30/362 (8%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ A N+ + I+D++ G ++ + NY D++WL+ P +
Sbjct: 164 PFQFYLTRVSGIKAKYNSKALHIKDILSPLFGTLVSSAQFNYCFDVNWLIKQYPPEFRKK 223
Query: 218 HVLVIHG---ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHT 274
+L++HG E+ L H + AN L + L I+FGTHH+K MLL+Y G+R+++HT
Sbjct: 224 PILLVHGDKREAKADL-HAQAKPYANISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHT 282
Query: 275 ANLIHVDWNNKSQGLWMQD-FPLKDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPA 330
+NLI DW+ K+QG+W+ +P Q N + F+ DL YL P +
Sbjct: 283 SNLIREDWHQKTQGIWLSPLYPRIYQGNHTSGESSTHFKADLTSYLMAYNAPPLQEWI-- 340
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS 390
++ + S V LI S PG GS WGH +LR +LQ +
Sbjct: 341 --------DIIQEHDLSETNVYLIGSTPGRFQGSHKDNWGHFRLRKLLQAHAPSAPRGEC 392
Query: 391 -PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSL 444
P+V QFSS+GSL + KW+ +E S+ + E +TP PL +++P+VE+VR SL
Sbjct: 393 WPVVGQFSSIGSLGPDESKWLCSEFKESLLAVREEGRTPGRSAVPLHLIYPSVENVRTSL 452
Query: 445 EGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFAR--YNGQKLAWFL 501
EGY AG ++P + +K +L Y+ KW A +GRS AMPHIKT+ R + KLAWFL
Sbjct: 453 EGYPAGGSLPYGIQTAEKQRWLHPYFHKWSAETSGRSNAMPHIKTYMRPSPDFSKLAWFL 512
Query: 502 LT 503
+T
Sbjct: 513 VT 514
>gi|343477672|emb|CCD11565.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 548
Score = 202 bits (513), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 204/375 (54%), Gaps = 51/375 (13%)
Query: 194 AILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH-------- 244
IL Y++D++WL P+L +++I GE G L +K + +LH
Sbjct: 43 VILGGYVIDVEWLFRVSGPLLMSKCTIVLISGEK-GFL-----HKYRHLVLHDRFGRNRV 96
Query: 245 ---KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQN 300
+P LPI FG HHSK ML I G+R+ V TAN I DWN K+QG++ QDFP LK Q+
Sbjct: 97 KIVEPCLPIPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYFQDFPRLKTQS 156
Query: 301 -----NLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
N+S G F N++ YLS + ++++P G + S +F+FS A V
Sbjct: 157 ENIVLNISSIEGKGMRFRNEIKRYLSCIG---VASSMPKDGCIPL--SLLDEFDFSGACV 211
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKWMAE 409
LIASVPGYH S + +G KL+++LQ ++P L +QF+S G L ++
Sbjct: 212 ELIASVPGYHRCSDAQHYGLGKLKSILQSMQLPSSLDRNPPVLTWQFTSQGLLTANFLNS 271
Query: 410 LSSSMSSGFSEDKTPLGIG--EPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+ MS + + P G +P+ +V+PT +V+ SLEG+ G ++P + ++
Sbjct: 272 MKQIMS---IDARNPTGEDKMDPVVRVVYPTETEVKNSLEGWRGGLSLPVTLRCC-HSYI 327
Query: 466 KKYWAKWKASHTG------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQK 517
+ +W G RS+ +PH+KT+ R + L+WFLLTSANLS+AAWG Q
Sbjct: 328 NERLFRWGTVPQGSEVENERSKGLPHLKTYTRLTESEDGLSWFLLTSANLSRAAWGEWQH 387
Query: 518 NNSQLMIRSYELGVL 532
+QL+IRSYELGVL
Sbjct: 388 GGTQLLIRSYELGVL 402
>gi|71404281|ref|XP_804861.1| tyrosyl-DNA Phosphodiesterase (Tdp1) [Trypanosoma cruzi strain CL
Brener]
gi|70868036|gb|EAN83010.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi]
Length = 551
Score = 201 bits (511), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 166/532 (31%), Positives = 262/532 (49%), Gaps = 87/532 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L++YM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWSYVLLASYMIDIEWLVRVA 60
Query: 211 PVLAKIP-HVLVIHGE--------SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
P L + + ++ GE S ++K K + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKKQLFIVSGEKEYEKKIQSSFLFRYIKAEKVR---IVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ G+R+ V TAN I DW KSQG+++QDFP K D+ NL+ G F
Sbjct: 118 CVNANGIRVAVLTANFIQDDWAYKSQGIYVQDFPRKQTSPKTDRANLTFSAGNEIRGNKF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + N A I + F + +FS+ V +I S+PGYH + +
Sbjct: 178 KNELLRYLNCYGIISNTENTVA-----IPSTLFDEIDFSAVCVEIITSIPGYHRYTDIHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE----DK 422
+G ++ VL E + L++QFSS G L ++ L ++MS+ + +K
Sbjct: 233 FGLGRIPKVLHSIDMELSDSIRAPLLIWQFSSQGKLTNSFLNALENAMSTEWKSIEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL P+ IV+PT +VR SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPL---RPVVQIVYPTESEVRESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGLC 348
Query: 479 -----RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
R RA+PH+KT+ R +K + WF+LTSANLS+AAWG QK QL IRSYE GV
Sbjct: 349 KMEFLRRRALPHLKTYMRLTEKKDGIKWFILTSANLSRAAWGEWQKKGDQLAIRSYEFGV 408
Query: 532 LILPS---AKRHGCGFSCTSNI---VPSEIKS-GSTETSQIQKTKLVTLTWHGSSDAGAS 584
+ S + G FS T + +PS ++ G E Q K + + G S
Sbjct: 409 VYGKSSFISFLEGEPFSVTPSRKIPLPSLVEGDGLVEVHIDQGGK-------QNIEKGPS 461
Query: 585 SEVVYLPV---PY----ELPPQR-------YSSEDVPWSWDKRYTKKDVYGQ 622
+ Y P+ PY ++ QR +++D+PW D + KDV+G+
Sbjct: 462 LFLPYDPLHLEPYASTVQMQDQRGNNCESWINTDDIPWVIDMPHFGKDVFGK 513
>gi|84043866|ref|XP_951723.1| tyrosyl-DNA phosphodiesterase [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|33348708|gb|AAQ16032.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
gi|62358538|gb|AAX78999.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei]
Length = 553
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 175/548 (31%), Positives = 261/548 (47%), Gaps = 107/548 (19%)
Query: 147 EEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNY 199
E LC F VSR V GL A + S +++ D++ +I +L+NY
Sbjct: 3 ETKLCPFWVSR-----------VSGL-ATESPSALTLSDLLHCNIEDPSEVWTHVVLANY 50
Query: 200 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 251
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSNC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-----------LKDQN 300
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP +
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 301 NLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
L G F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 359 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 417 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 472
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPV-RLRCCHPYVNERLYRWG 340
Query: 473 -----KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
+ + GR+RAMPHIKT+ R NG L WF+LTSANLS+AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 526 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 576
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 577 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 614
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQREHRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 615 TKKDVYGQ 622
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|261326666|emb|CBH09628.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma brucei
gambiense DAL972]
Length = 553
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 174/548 (31%), Positives = 259/548 (47%), Gaps = 107/548 (19%)
Query: 147 EEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDI-------IVAILSNY 199
E LC F VSR V GL A + S +++ D++ +I +L+NY
Sbjct: 3 ETKLCPFWVSR-----------VSGL-ATESPSALTLSDLLHCNIEDPSEVWTHVVLANY 50
Query: 200 MVDIDWL--LPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI------LHKPPLPIS 251
++D++W+ + C L+ HV+++ GE +G E + A + + KP LP+
Sbjct: 51 LIDLEWVFDMATCLQLSSC-HVMIVSGE-EGLAERYAASPLAGLLGKERVEIIKPKLPLP 108
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP----------LKDQNN 301
FG HH K +L + +GVRI V TAN I DW K+QG+++QDFP
Sbjct: 109 FGVHHGKLILCVNSKGVRISVLTANFIESDWGKKTQGIYVQDFPRLVTSSASSNSMGSLQ 168
Query: 302 LSEEC---GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVP 358
C F+ ++ YLS + A G I S + ++S A V L++SVP
Sbjct: 169 ALRRCRGTRFKEEIKRYLSCI-----GAISSTTGTNCIPLSLLDEVDYSGACVELVSSVP 223
Query: 359 GYHTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
G H S ++G +L+ VL+ + + G LV+QFSS G+L ++ L M+
Sbjct: 224 GCHRNSDAYRFGMGRLQEVLRAMQISSPSGENSPTLVWQFSSQGTLTSNFLRSLERVMT- 282
Query: 417 GFSEDKTPL-GIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKW- 472
S D TPL P I++PT +V+ S EG+ G ++P + ++ + +W
Sbjct: 283 -ISTDNTPLPDTKSPTVRIIYPTEAEVKGSFEGWHGGLSLPV-RLRCCHPYVNERLYRWG 340
Query: 473 -----KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
+ + GR+RAMPHIKT+ R NG L WF+LTSANLS+AAWG QK +Q++IR
Sbjct: 341 QRPYAEGADRGRNRAMPHIKTYMRLTENGDGLKWFMLTSANLSRAAWGEWQKGGTQILIR 400
Query: 526 SYELGVL------ILPSAKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWH 576
SYELGV+ I P+ G FS T + VPS I + + K+ TL
Sbjct: 401 SYELGVVYGTDSFINPA---DGGLFSATPSKPIPVPSSIGGDG-----LVRVKIKTL--- 449
Query: 577 GSSDAGASSEVVYLPVPYELPPQRY----------------------SSEDVPWSWDKRY 614
S++ ++LP L PQ Y SS DVPW D +
Sbjct: 450 -PSESDRDEPTLFLPY-NPLNPQPYVSTLQMQQRERRHTGHSCVSQLSSLDVPWLVDLPH 507
Query: 615 TKKDVYGQ 622
KD G+
Sbjct: 508 RGKDCLGK 515
>gi|322787271|gb|EFZ13407.1| hypothetical protein SINV_04400 [Solenopsis invicta]
Length = 647
Score = 196 bits (498), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 139/438 (31%), Positives = 221/438 (50%), Gaps = 63/438 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +L+++G+ ++H K + +N
Sbjct: 251 ILDRSLGEIVKSLHLNFMVDVGWLCLQYLLAGQRTDMLILYGDR---VDHEKLH--SNIT 305
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQ 299
+ + +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ P L +
Sbjct: 306 MIEVQMPTQFGCHHTKIMILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPES 365
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N S+ GF+ DL YL+ ++P+ + + A ++ NFS V L+AS
Sbjct: 366 ANPSDGESPTGFKKDLERYLNKYRFPDLTQWISA----------VRRANFSDVKVFLVAS 415
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG H + WGH KL VL + T + P+V Q SS+GSL + + LS +
Sbjct: 416 VPGTHKDNEADSWGHKKLAHVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKEII 475
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S + T P ++P++++ + S + +P S + + + +++ Y +W
Sbjct: 476 PCMSRETTKGLKSHPHFQFIYPSIDNYKQSFDCRNLSCCLPYSAKTHSKQQWIESYLYQW 535
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
KA TGR RAMPHIK++ R + + ++WF+LTSANLSKAAWG +Q+NN +M SYE G
Sbjct: 536 KAKRTGRDRAMPHIKSYTRISPDLRSISWFVLTSANLSKAAWG-MQRNNHYIM--SYEAG 592
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
V+ +P K +T T + V
Sbjct: 593 VVFIP---------------------------------KFITGTTTFPIEDEEDPAVPVF 619
Query: 591 PVPYELPPQRYSSEDVPW 608
P+PY+LP RY S D P+
Sbjct: 620 PIPYDLPLCRYESSDRPF 637
>gi|209879936|ref|XP_002141408.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
gi|209557014|gb|EEA07059.1| tyrosyl-DNA phosphodiesterase family protein [Cryptosporidium muris
RN66]
Length = 513
Score = 196 bits (498), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 139/493 (28%), Positives = 234/493 (47%), Gaps = 100/493 (20%)
Query: 181 VSIRDVIQGD-------------IIVAILSNYMVDIDWLLPAC---PVLAKIPHVLVIHG 224
+SI+D+ + D I ++S+Y++DI WL + K+ +L+IHG
Sbjct: 48 LSIKDIFRADCEYCFDGEQDSWLIQDLLVSSYIIDIKWLFKEVRLNKIDEKLNRLLIIHG 107
Query: 225 ES---DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG----------VRII 271
S D T E N N+ + P +P+ +G H K ++L + + +R++
Sbjct: 108 GSCNLDDTTEIQILNIAKNYEIQCPTMPLPYGVFHPKFLILKFSKQDPIIKKEESFIRLV 167
Query: 272 VHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE---CGFENDLIDYL-STLKWPEFSAN 327
+ TAN + DW K+Q +W+QDF L + +N + + C + ++++ S ++ +F ++
Sbjct: 168 ITTANFLESDWKFKTQAVWVQDFLLANNSNGAMKNPFCEYFGMFLNHIISKIEHKKFWSD 227
Query: 328 LPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE------- 380
L K++++ +A V L+ASVPGYH G ++K WGH++++ +++
Sbjct: 228 L------------IKQYDYDNATVDLVASVPGYHKGENMKLWGHLRMKEIMKYKTDLNST 275
Query: 381 ---------CTFEK-----GFKKSPLVYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKTPL 425
C E+ +S ++ QFSSLG EKW+ E S+++ +E T
Sbjct: 276 LNIEQPNRICKVEQYNNEYRHVESRIICQFSSLGKFSEKWLTQEFGDSLNTCINEYTTKS 335
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----RSR 481
+V+PT E V SLEG G +IP N+ K ++ K W + R
Sbjct: 336 SFE---LVYPTAEQVYKSLEGIYGGGSIPVKHNNITKSWISKILHLWGSGTLSNPSIRDL 392
Query: 482 AMPHIKTFARY--NGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
++PHIKTF RY N + + W S NL AAWG LQ N +Q+ IR+YELGV+I P
Sbjct: 393 SVPHIKTFLRYLWNSDRKTVSIPWIFYGSHNLGPAAWGQLQNNQTQMCIRNYELGVIITP 452
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
+ + I++ T + TK+ T S+ + VP+
Sbjct: 453 YTLYNNVKY----------IRTKRNRTPKFIWTKMET----------KSTPNYNIRVPFS 492
Query: 596 LPPQRYSSEDVPW 608
+PP +Y + D PW
Sbjct: 493 IPPIQYKTNDTPW 505
>gi|299115351|emb|CBN74172.1| Tyrosyl-DNA phosphodiesterase [Ectocarpus siliculosus]
Length = 607
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 162/514 (31%), Positives = 242/514 (47%), Gaps = 107/514 (20%)
Query: 119 NDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPA-WAN 177
N +NG S K ++ DN+ + +K P +RLL P+ A+
Sbjct: 39 NSSNSNGGTSQSKRPASEQGDNKTPSQRKGKRPRSFQPFEK-PPLYRLLSTS--PSDRAS 95
Query: 178 TSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RN 236
T V + D++ GD A+L NYMVD L+ P L +P V ++HG GT + + R+
Sbjct: 96 TGSVGLDDLLSGDFESALLCNYMVDYALLVRCAPRLGSVP-VTIVHGFKPGTQDEVNLRS 154
Query: 237 KPA---NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD 293
+ A L P LP +GT+H+K ++L +P G+R+ V TAN I VD +KSQG+W QD
Sbjct: 155 QCAVNPGVKLRYPELP-EYGTNHAKMIILKFPTGIRVAVLTANFIVVDVTDKSQGVWYQD 213
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
FP + S C F+ DL+ +L F PA S +++F A V L
Sbjct: 214 FPKR----TSGSCAFQEDLMGFL-------FKVGGPASAF----ASTLGEYDFRGARVAL 258
Query: 354 IASVPGY-----------HTGSSLKKWGHMKLRTVLQE-------CTFEKGFKKSPLVYQ 395
+ SVPG H G L K+GHM++R +L ++G K ++ Q
Sbjct: 259 VPSVPGTGGNTPGTGGKPHKGRDLHKYGHMRVRALLAREKEDGTGAKLKEGGHK--VLCQ 316
Query: 396 FSSLGSLDE---KWMAELSSSM-------------SSGFSEDKTPLGIGEP--LIVWPTV 437
SSL SL + +W++E+ +S SED+ + E +VWP+V
Sbjct: 317 ISSLASLTKTPNRWLSEILASFMPLEDEGKKAEPTRRSVSEDEAQATLLEQHLRVVWPSV 376
Query: 438 EDVRCSLEGYAAGNAI-----------------PSPQKNVDKDFLKKYWAKWKAS-HTGR 479
E VR S +G+ AG +I + + N L+ KWK + R
Sbjct: 377 EAVRTSSQGWIAGGSICCNTVNMYGGKYKWPNMDNYRSNTPLPELRPLLRKWKGNPAVNR 436
Query: 480 SRAMPHIKTFARY-------------NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRS 526
+R PHIK++ RY +G ++AWFLLTS+NLS++AWG L K ++ L +RS
Sbjct: 437 TRDAPHIKSYLRYREVAGENGTETRVDGDEVAWFLLTSSNLSRSAWGYLNKASTDLTLRS 496
Query: 527 YELGVLILPS-------------AKRHGCGFSCT 547
+E+GV+ LPS A GF+CT
Sbjct: 497 FEMGVMFLPSLLRSPSQDSDDGNAAAKASGFTCT 530
>gi|219113113|ref|XP_002186140.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582990|gb|ACI65610.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 305
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 175/304 (57%), Gaps = 20/304 (6%)
Query: 250 ISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK---DQNNLSEE 305
I +G HHSK L+ Y + +RII+HTAN+ + D + K+Q + QDF LK + N++
Sbjct: 1 IPYGVHHSKFFLVGYADQSLRIIIHTANIRYDDIHCKAQAAFFQDFGLKSPENFTNVANT 60
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
C FE DLIDYL + ++ + K F ++++FSSA L+ S PGYH
Sbjct: 61 CEFEEDLIDYLDSYRYTRLHKWTKSGSKTKSLGQFVREYDFSSAKAVLVPSTPGYHRLDE 120
Query: 366 LKKWGHMKLRTVLQECTF--EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 423
+ GH K+R + T E+ P+V QFSS+GSL E+++ EL +SM S D+
Sbjct: 121 KHRRGHWKMRQTIPSHTEAPEEETICDPIVCQFSSIGSLTERYLLELQTSMDMKQSRDRG 180
Query: 424 PLGIGE--PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG--- 478
G E +V+PTVE++R S+EGY G ++P +NV K FLK+ + +W A +
Sbjct: 181 RPGRLELSLKLVYPTVEEIRTSVEGYRGGGSVPGTMRNVGKPFLKRLFCRWSALSSSDMN 240
Query: 479 ---RSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNN----SQLMIRSYEL 529
+ R +PH+KT+ + N + L WF+LTS NLSKAAWG +Q ++ +L +R +EL
Sbjct: 241 PLWKGRNVPHMKTYFQTNSTTETLHWFVLTSHNLSKAAWGEIQTSSRYGGRRLFVRHWEL 300
Query: 530 GVLI 533
GV +
Sbjct: 301 GVFL 304
>gi|407394035|gb|EKF26770.1| tyrosyl-DNA Phosphodiesterase (Tdp1), putative [Trypanosoma cruzi
marinkellei]
Length = 551
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 159/533 (29%), Positives = 255/533 (47%), Gaps = 90/533 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDII-------VAILSNYMVDIDWLLPAC 210
+KL F + RV G+ N S +++ D++ D+ +L++YM+DI+WL+
Sbjct: 2 NKLLCPFWVNRVDGISV-DNPSALTLGDLLYCDVNDQEEVWNYVLLASYMIDIEWLVCVA 60
Query: 211 PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH---------KPPLPISFGTHHSKAML 261
P L + L I G E+ K+ + ++ + +P LP+ FG HHSK +L
Sbjct: 61 PSLLQTKQKLFI---VSGEKEYEKKIQSSSLFAYIKAEKVRIVEPKLPLPFGVHHSKLVL 117
Query: 262 LIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK------DQNNLSEECG-------F 308
+ +G+R+ V TAN I DW KSQG+++QDFP + D+ NL+ G F
Sbjct: 118 CVNAKGIRVAVLTANFIQDDWVCKSQGIYVQDFPRRQNLPKTDRANLTFSAGSEIRGSEF 177
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK 368
+N+L+ YL+ + A I + F + +FS+A V +I S+PGY+ + +
Sbjct: 178 KNELLRYLNC-----YGIISNAENTVAIPSTLFDEIDFSAACVEIITSIPGYYRYNDVHS 232
Query: 369 WGHMKLRTVLQECTFE--KGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDK 422
+G ++ VL E + L++QFSS G L ++ L ++MS S +K
Sbjct: 233 FGLGRIPKVLHSIDMELSDSIQVPLLIWQFSSQGKLTNSFLVALENAMSTEGKSNEEANK 292
Query: 423 TPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG-- 478
PL P+ IV+PT +V+ SLEG+ G ++P + ++ + +W G
Sbjct: 293 KPLC---PVVQIVYPTESEVKESLEGWRGGLSLPLRLSSCHP-YINRRLHRWGQGTRGTC 348
Query: 479 ----RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
R RA+PH+KT+ R +K + W +LTSANLS+AAWG QK +QL IRSYE GV+
Sbjct: 349 KIELRRRALPHLKTYMRLTEKKDGIKWLILTSANLSRAAWGEWQKKGNQLAIRSYEFGVV 408
Query: 533 ILPS---AKRHGCGFSCTSNI---VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
+ G FS T + +PS ++ I + G
Sbjct: 409 YGKDSFISFLEGEPFSVTPSRKIPLPSLVEGDGLAEVHIDQ---------GGKKDIEEGP 459
Query: 587 VVYLPV-PYELPP---------QR-------YSSEDVPWSWDKRYTKKDVYGQ 622
++LP P L P QR +++D+PW D + KDV+G+
Sbjct: 460 TLFLPYDPLHLEPYASTVQMQNQRGNNCDSSINTDDIPWVIDMPHFGKDVFGK 512
>gi|393910432|gb|EJD75879.1| TDP1 protein [Loa loa]
Length = 672
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 161/511 (31%), Positives = 226/511 (44%), Gaps = 92/511 (18%)
Query: 137 EQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAI 195
E D+ K SE + DKL +V GL N + S ++++ + +I
Sbjct: 15 ECDDLESKGSEGKRMKQNCLMDKL----YFNKVVGLAEQYNVNAFSFAELLELISPVASI 70
Query: 196 LSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPI 250
N+M+D+ WLL P + + +I GE GT +K+ N + + L I
Sbjct: 71 HFNFMIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMI 130
Query: 251 SFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC--- 306
FGTHHSK + G V II+ TANL+ DWN K+Q F + +C
Sbjct: 131 PFGTHHSKISIFESNTGRVHIIIATANLLESDWNFKTQAF----FHCSGNELAAGDCPDR 186
Query: 307 ---GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL+ YL K + L H +++ + S R++ SVPG H G
Sbjct: 187 NGSDFQTDLVKYLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKG 240
Query: 364 SSLKKWGHMKLRTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGF 418
L K+GH +LR +L+E + GF SLG+ + W+ + +S+S G
Sbjct: 241 VQLTKYGHPRLRVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGA 300
Query: 419 SEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASH 476
D GE L I++P VEDVR S EGYAAG + P S V + +L + KW + H
Sbjct: 301 ETD------GEHLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDH 354
Query: 477 TGRSRAMPHIKTFARYNGQKL--AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
GRSRAMPHIKT+A + L +W L+TSANLSKAAWG Q QL IRSYE G+L
Sbjct: 355 LGRSRAMPHIKTYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF- 413
Query: 535 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 594
SD + + Y
Sbjct: 414 --------------------------------------------SDPESLDMLPY----- 424
Query: 595 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
+LP +Y D W DK Y K D++ + WP
Sbjct: 425 DLPLTKYDDNDRVWIVDKTYRKPDIFRKTWP 455
>gi|357630668|gb|EHJ78636.1| hypothetical protein KGM_17628 [Danaus plexippus]
Length = 581
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 142/452 (31%), Positives = 220/452 (48%), Gaps = 67/452 (14%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLA-KIPHVLVIHGESDGTLEHMKRNK 237
+ + I D G++ ++ N+MVD WLL + +++GE L ++ K
Sbjct: 181 TLLEILDSSLGELKCSLQINFMVDAGWLLAHYYFAGYSAKKLTILYGEESAELRNISAKK 240
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ---- 292
P N H+ + FG HH+K MLL Y G +R++V TANL DW N++QGLW+
Sbjct: 241 P-NVEAHQVKMATPFGKHHTKMMLLCYEDGSLRVVVSTANLYMDDWENRTQGLWLSPSCP 299
Query: 293 DFPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P + ++ E GF+ L+DYL + P+ + + ++ +FS V
Sbjct: 300 QLPAESPSHSGESPTGFKRSLLDYLHHYRLPQLAVYV----------HRVQRCDFSHINV 349
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKS-PLVYQFSSLGSLDEKWMAE 409
L+ SVPG H +S WG +++ +L+ C +S PL+ Q SSLGS + +
Sbjct: 350 FLVCSVPGTHYSAS---WGFLRVGALLRAHCAVPPQETRSWPLIAQASSLGSYGKDPGSW 406
Query: 410 LSSSMSSGFSEDK-TPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDF 464
L+ F++ K P + P +++P++E+V+ S +G G +P S +V + +
Sbjct: 407 LTGDFLHHFTKIKDQPQTLTPPPDLKLIYPSLENVKSSHDGLLGGGCLPYSAAVHVKQPW 466
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQL 522
LK + +W+A H+ R RAMPHIK++ R + + A++LLTS N+SKAAWG K+ L
Sbjct: 467 LKDFLYQWRALHSERDRAMPHIKSYTRVSPDNSRAAFYLLTSGNVSKAAWGVRNKDGG-L 525
Query: 523 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 582
+ SYE GVL LP F S+ P
Sbjct: 526 RLMSYEAGVLFLPR-------FVINSDFFPL----------------------------- 549
Query: 583 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRY 614
S + LPVPY+LPPQRYS + PW D Y
Sbjct: 550 CPSSALRLPVPYDLPPQRYSPDMSPWVSDYLY 581
>gi|170587939|ref|XP_001898731.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
gi|158592944|gb|EDP31539.1| Tyrosyl-DNA phosphodiesterase family protein [Brugia malayi]
Length = 454
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 131/357 (36%), Positives = 181/357 (50%), Gaps = 26/357 (7%)
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA-----NWILHKP 246
+ +I N+M+D+ WLL P + + +I GE GT + R N + +
Sbjct: 67 VASIHFNFMIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTRTAVKQCGVNNVTVGRA 126
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
L I FGTHHSK + G V I++ TANL+ DWN K+Q + + +N
Sbjct: 127 RLMIPFGTHHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIERSADNRCNP 186
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D + YL+ K + G + N S R++ SVPG H G
Sbjct: 187 NGSDFQADFVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARIVYSVPGAHKG 240
Query: 364 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFS 419
L K+GH +LR +L+E + QFSSLGSL + W+ + +S++ G
Sbjct: 241 VQLTKYGHPRLRVILKELFGNVKMDEFTYHVQFSSLGSLGAAPQYWLTGQFLNSLAGGAE 300
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTG 478
D L I++P VEDVR S EGY AG + P V + +L + KW+++H G
Sbjct: 301 TDGKHL-----RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMYKWRSNHLG 355
Query: 479 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
RSRAMPHIKT+A + N K W L+TSANLSKAAWG Q +QL IRSYE GVL
Sbjct: 356 RSRAMPHIKTYAAFAKNSLKPLWLLVTSANLSKAAWGDYQLKKTQLTIRSYEFGVLF 412
>gi|307188952|gb|EFN73469.1| Probable tyrosyl-DNA phosphodiesterase [Camponotus floridanus]
Length = 666
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 137/439 (31%), Positives = 218/439 (49%), Gaps = 65/439 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNK-PANW 241
I D G+I+ ++ N+MVD+ WL + + +++++GE + R K +N
Sbjct: 269 ILDRSLGEIVNSLHMNFMVDVGWLCLQYLLAGQRTDMMILYGE------RVDREKLGSNI 322
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
+ +P+ FG HHSK M+ Y G+R++V TANL DW+N++QGLW+ PL
Sbjct: 323 TMIHVDMPVRFGCHHSKIMIFQYKDDGIRVVVSTANLYSDDWDNRTQGLWISPHLPLLPE 382
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YLS + P + + A ++ NFS+ V L+A
Sbjct: 383 SANPSDGESPTGFKKDLERYLSKYRHPALTQWIWA----------VRRANFSAVNVFLVA 432
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + + WGH KL VL + T + P+V Q SS+GSL + + LS +
Sbjct: 433 SVPGTHKDAEVDSWGHRKLAYVLSRHATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDI 492
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S + T P ++P++E+ + S + +P S Q + + +++ Y +
Sbjct: 493 IPCMSRETTKGLKSHPNFQFIYPSIENYKHSFDCRNLSCCLPYSAQVHSKQQWIESYLYQ 552
Query: 472 WKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
W+A T R RAMPHIK++ R + +++ WF+LTSANLSKAAWG +Q++N +M SYE
Sbjct: 553 WRAKRTRRDRAMPHIKSYTRISPDLKRIPWFVLTSANLSKAAWG-VQRSNHYIM--SYEA 609
Query: 530 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 589
GV+ +P K +T T + V
Sbjct: 610 GVIFIP---------------------------------KFITQTTTFPIEDEEDPAVPI 636
Query: 590 LPVPYELPPQRYSSEDVPW 608
P+PY+LP +RY S D P+
Sbjct: 637 FPIPYDLPLRRYDSSDSPF 655
>gi|402592672|gb|EJW86599.1| tyrosyl-DNA phosphodiesterase [Wuchereria bancrofti]
Length = 453
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 133/357 (37%), Positives = 181/357 (50%), Gaps = 26/357 (7%)
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-----LEHMKRNKPANWILHKP 246
+ +I N+M+D+ WLL P + + +I GE GT +K+ N I+ +
Sbjct: 66 VASIHFNFMIDLRWLLEQYPARLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVIVGRA 125
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
L I FGTHHSK + G V I++ TANL+ DWN K+Q + +N
Sbjct: 126 RLMIPFGTHHSKISIFESSTGRVHIVISTANLLENDWNFKTQAFYHCSGIELSADNRCNP 185
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D + YL+ K + G + N S R++ SVPG H G
Sbjct: 186 NGSDFQADFVKYLNEYKTSQ------DWGLIEYWRDRVASINLSHVKARVVYSVPGAHKG 239
Query: 364 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELSSSMSSGFS 419
L K+GH +LR +L+E + QFSSLGSL + W+ + +S+S G
Sbjct: 240 VQLTKYGHPRLRVILKELFGNVKMDEFTYHAQFSSLGSLGAAPQYWLTGQFLNSLSGGAE 299
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTG 478
D L I++P VEDVR S EGY AG + P V + +L + KW++ H G
Sbjct: 300 TDGKHL-----RIIYPCVEDVRNSNEGYQAGGSFPYNNSVAVKQPYLLDFMHKWRSDHLG 354
Query: 479 RSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
RSRAMPHIKT+A + N K W L+TSANLSKAAWG Q +QL IRSYE GVL
Sbjct: 355 RSRAMPHIKTYAAFAKNSLKPLWLLVTSANLSKAAWGNYQLKKTQLTIRSYEFGVLF 411
>gi|291233547|ref|XP_002736713.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Saccoglossus
kowalevskii]
Length = 431
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 134/379 (35%), Positives = 203/379 (53%), Gaps = 43/379 (11%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDE----QDNENGKNSEEALCNFHVSRDKLPSTFRL 166
++S KR +D + LS KK R +DE + ++ ++ E + + + P F L
Sbjct: 60 NQSNKRRRSDEQPSSHLSCKKSRTEDESPQSKKSKTQSSTSEKMSPYENYIEAAPLNFFL 119
Query: 167 LRVQGLPAWANTS-CVSIRDVI---QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVI 222
+V G+P N+S V I+D++ G++I + NYM DI WL+ P + +L+I
Sbjct: 120 TKVFGIPNHYNSSLAVGIKDILSASMGNLISSAQFNYMFDIPWLVQQYPEQFRSKPLLII 179
Query: 223 HG--ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
HG +D T H ++ N L + L I +GTHHSK M L+Y G+R+++HTAN+IH
Sbjct: 180 HGSQRADKTTLHENAHRYPNITLCQAKLDIMYGTHHSKMMFLLYDNGMRVVIHTANIIHN 239
Query: 281 DWNNKSQGLWMQD-FP-LKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
DW K+QG+W+ FP L +LS+ F DL++YL A+G K
Sbjct: 240 DWYQKTQGVWISPLFPKLASDQDLSQGDSVTQFRKDLLEYLG------------AYGTNK 287
Query: 336 INPSF---FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-P 391
+ ++ + SSA V +I SVPG HTG+S KWGH+KLR VLQE + K P
Sbjct: 288 HLQEWQETIRQHDMSSAKVFIIGSVPGRHTGASKMKWGHLKLRKVLQEHGPDGSTVKDWP 347
Query: 392 LVYQFSSLGS--------LDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
++ QFSS+GS L +W+ LS+ ++G + P + +++P VE+VR S
Sbjct: 348 VIGQFSSVGSLGSGPENWLSSEWLESLSTVQANGIVKLSKP----KLNLIFPCVENVRRS 403
Query: 444 LEGYAAGNAIPSPQKNVDK 462
LEGY AG ++P KN K
Sbjct: 404 LEGYPAGASLPYSIKNARK 422
>gi|67609723|ref|XP_667058.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658157|gb|EAL36834.1| hypothetical protein Chro.70273 [Cryptosporidium hominis]
Length = 511
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 141/448 (31%), Positives = 223/448 (49%), Gaps = 66/448 (14%)
Query: 195 ILSNYMVDIDWLLPACP----VLAKIPHVLVIHGESDGTLEHMKRNKPANWIL----HKP 246
+ S+Y+ D++W++ + I +L + D + +N + P
Sbjct: 92 LFSSYLADVNWVINEIGDSELICENIESILFVSHGFDNPQNYKLKNCNKMKNKKISTYSP 151
Query: 247 PLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF
Sbjct: 152 YLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDF---FH 208
Query: 300 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIASVP 358
N ++C F +DYL EF N+ K S ++FNF A V+L+ASVP
Sbjct: 209 NIERKDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVASVP 259
Query: 359 GYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM-AEL 410
GY G + WGH+++R+++ Q + E G K+ ++ QFSSLG + EKW+ EL
Sbjct: 260 GYFFGDEMFMWGHLRVRSLIKRFVSKDQGKSDELGEKRERIILQFSSLGRISEKWLYTEL 319
Query: 411 SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYW 469
+SS+S + P G L I++PTVE V S+EG G ++P ++ + K ++KK
Sbjct: 320 ASSLS------EIP---GTKLEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIKKLL 370
Query: 470 AKWKASHTGRS----RAMPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKNNSQ 521
KW ++ + +PHIKTF +Y N K+ W + S NLS AAWG +QK+ SQ
Sbjct: 371 HKWGTGTMEKNATDEKVIPHIKTFLKYKIFDNAIKIIWLVQGSYNLSNAAWGQIQKDGSQ 430
Query: 522 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 581
IR+YELG+ I H F +E E + + ++ +A
Sbjct: 431 FCIRNYELGIFI------HKDQFEFERYFKLNE------EFPKFFWKRKSNFSFISEINA 478
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWS 609
++ P+P++LPP+RYS+ D PW+
Sbjct: 479 NKPIRLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|302422748|ref|XP_003009204.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
gi|261352350|gb|EEY14778.1| tyrosyl-DNA phosphodiesterase [Verticillium albo-atrum VaMs.102]
Length = 527
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 156/513 (30%), Positives = 238/513 (46%), Gaps = 82/513 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPH 218
PS F+L ++ LP +N V+++D++ +I N++ DI +L+ + +
Sbjct: 43 PSPFQLTHIRDLPTSSNADAVTLKDLLGDPLISECWEFNFLHDIPFLMSHFDEDTRDLVK 102
Query: 219 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRVALQEEAAAWKNVELHTAPMPEMFGTHHTKMMILFRHDDTAQVII 162
Query: 273 HTANLIHVDWNNKSQGLWMQDF-PLKDQNN-----------LSEECG----FENDLIDYL 316
HTAN+I DW N + G+W PL Q N +E+ G F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPNGGKLEDGEVYEANEDFGSGRKFKSDLLRYL 222
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 374
+ + ++ +++F+ LIASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRLLTEQLARYDFAGVRAVLIASVPGRHAIHDTSQTAWGWPAL 274
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 429
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQTG--KSEIVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSIGPRPAF--- 329
Query: 430 PLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS---------- 475
+V+PT +++R SL+GYA+G +I SPQ+ +LK + W
Sbjct: 330 -KVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKSIFCHWANDAPGGKELSKD 388
Query: 476 ----HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
GR RA PHIKT+ RY Q + W LLTSANLSK AWG ++ I S+E GV
Sbjct: 389 TLLRDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 590
L+ PS + +G+ E + + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVTGTDEATMVGTFKTDAPGEEAPSSAPSSTGNVVGL 490
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+PY LP Q Y +++PW K D G+V
Sbjct: 491 RMPYSLPLQLYGKDEIPWVLRMSIPKPDWAGRV 523
>gi|383853604|ref|XP_003702312.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Megachile
rotundata]
Length = 701
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/450 (31%), Positives = 225/450 (50%), Gaps = 73/450 (16%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +L+++G+ ++ K + N
Sbjct: 308 ILDRSLGEIVNSLHINFMVDVGWLCLQYLLAGQRTDMLILYGDR---VDEEKLS--LNIT 362
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQ 299
+ +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ PL +
Sbjct: 363 MIPVQMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPPLPES 422
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N ++ GF+ DL+ YL+ + P + A ++ +FSS V IAS
Sbjct: 423 ANTNDGESPTGFKKDLLLYLNKYRQPAITEWTSA----------VRRADFSSVNVFFIAS 472
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWM-AELS 411
VPG H G WGH KL VL + T + LV Q SS+GSL E W+ E++
Sbjct: 473 VPGRHKGVEYDSWGHRKLGYVLSKHATLPPDAPRWTLVAQSSSIGSLGPSYESWLLKEIT 532
Query: 412 SSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKK 467
SSMS ++P + P ++P++ + + S + +P S Q + +++++
Sbjct: 533 SSMSK-----ESPSNLKSHPNFQFIYPSINNYKQSFDCRVGSCCLPYSLQTHSKQEWIES 587
Query: 468 YWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
Y +WKA+ T R +AMPHIK++ R+ + +K+ WF+LTSANLSKAAWG + K++ +M
Sbjct: 588 YMYQWKATRTARDKAMPHIKSYTRFSPDMKKIPWFVLTSANLSKAAWGTVGKDSHYIM-- 645
Query: 526 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 585
+YE GV+ +P F S P + +
Sbjct: 646 NYEGGVIFIPK-------FIIGSTTFPVQEEENG-------------------------- 672
Query: 586 EVVYLPVPYELPPQRYSSEDVPWSWDKRYT 615
V P+PY+LPP +Y S D P+ + Y+
Sbjct: 673 -VPVFPIPYDLPPTKYQSGDKPFVMEFFYS 701
>gi|66362892|ref|XP_628412.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
gi|46229443|gb|EAK90261.1| tyrosyl-DNA phodphodiesterase 1 (tdp1) [Cryptosporidium parvum Iowa
II]
Length = 511
Score = 189 bits (479), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 138/447 (30%), Positives = 219/447 (48%), Gaps = 64/447 (14%)
Query: 195 ILSNYMVDIDWLLPACP----VLAKIPHVLVIHGESDGTLEHMKRNKPANWIL----HKP 246
+ S+Y+ D++W++ + I +L + D + +N + P
Sbjct: 92 LFSSYLADVNWVINEIGDSELICENIESILFVSHGFDNPQNYKLKNFNKVKNKKISTYSP 151
Query: 247 PLPISFGTHHSKAMLLIY-----PRG--VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
L + +G H K +LL++ P+ VR +V +ANLI DW K Q +W+QDF +
Sbjct: 152 YLKVPYGVFHPKFILLVFEHLVQPKKNFVRFVVTSANLIQQDWELKIQSIWVQDFFHSIE 211
Query: 300 NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSAAVRLIASVP 358
++C F +DYL EF N+ K S ++FNF A V+L+ASVP
Sbjct: 212 R---KDCEF----LDYLQ-----EFLKNILNGSKLKDFWLSKVQEFNFEDATVKLVASVP 259
Query: 359 GYHTGSSLKKWGHMKLRTVL-------QECTFEKGFKKSPLVYQFSSLGSLDEKWM-AEL 410
GY G + WGH+++R+++ Q+ + E K+ +V QFSSLG + EKW+ EL
Sbjct: 260 GYFFGDEMFMWGHLRVRSLIKRFVSKDQDKSDELREKRERIVLQFSSLGRISEKWLYTEL 319
Query: 411 SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWA 470
+SS+S + E I++PTVE V S+EG G ++P ++ + K ++KK
Sbjct: 320 ASSLSE--------IPGTELEIIFPTVEQVVNSIEGINGGGSLPVKKEYICKPWIKKLLH 371
Query: 471 KWKASHTGRS----RAMPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKNNSQL 522
KW ++ + +PHIKTF +Y N K+ W + S NLS AAWG +QK+ SQ
Sbjct: 372 KWGTGTMKKNATDEKVIPHIKTFLKYKIFGNAIKIIWLVQGSYNLSNAAWGQIQKDGSQF 431
Query: 523 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 582
IR+YELG+ I F P + S I + +A
Sbjct: 432 CIRNYELGIFIHKDQFEFERYFKLNEEF-PKFFWKRKSNCSLISEI-----------NAN 479
Query: 583 ASSEVVYLPVPYELPPQRYSSEDVPWS 609
+ ++ P+P++LPP+RYS+ D PW+
Sbjct: 480 QPNVLLNFPLPFKLPPKRYSNSDHPWN 506
>gi|332029124|gb|EGI69135.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 667
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 138/439 (31%), Positives = 217/439 (49%), Gaps = 65/439 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +++++G+ + R K N I
Sbjct: 273 ILDRSLGEIVNSLHLNFMVDVGWLCLQYLLAGQCTDMMILYGD------RVDREKLNNNI 326
Query: 243 -LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKD 298
+ + +P FG HH+K M+L Y G+R++V TANL DW N++QGLW+ P L +
Sbjct: 327 TMIEVDMPTKFGCHHTKIMILQYKDDGIRVVVSTANLYSDDWENRTQGLWISPHLPRLPE 386
Query: 299 QNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
N S+ GF+ DL Y + + P + + A ++ +FS V L+A
Sbjct: 387 SANPSDGESPTGFKKDLERYFNKYRHPALTQWICA----------IRRADFSDVNVFLVA 436
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + WG+ KL VL T + P+V Q SS+GSL + + LS +
Sbjct: 437 SVPGTHKDNEADSWGYKKLAHVLSRYATLPPDAPQWPIVAQSSSIGSLGPNFESWLSKDI 496
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S + T P ++P++E+ + S + +P S + + + +++ Y +
Sbjct: 497 IPCMSRETTKGLKSHPHFQFIYPSIENYKQSFDCRNLSCCLPYSTKVHSKQQWIESYLYQ 556
Query: 472 WKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
WKA TGR RAMPHIK++ R + ++++WF+LTSANLSKAAWG +Q+NN +M SYE
Sbjct: 557 WKAKRTGRDRAMPHIKSYTRISPDLKRISWFVLTSANLSKAAWG-VQRNNHYIM--SYEA 613
Query: 530 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 589
GV+ +P KL+T T + V
Sbjct: 614 GVIFIP---------------------------------KLITGTTTFPIEEEEDPAVPV 640
Query: 590 LPVPYELPPQRYSSEDVPW 608
P+PY+LP RY S D P+
Sbjct: 641 FPIPYDLPLCRYESSDSPF 659
>gi|346323354|gb|EGX92952.1| tyrosyl-DNA phosphodiesterase [Cordyceps militaris CM01]
Length = 515
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 160/521 (30%), Positives = 243/521 (46%), Gaps = 89/521 (17%)
Query: 154 HVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP- 211
H S D + S FRL ++ L +N +++ D++ +I + NY DI +L+
Sbjct: 32 HKSVDTVSSPFRLTWIRDLDEESNQDAITLTDLLGDPLISECWNFNYQHDIPFLMGTFDR 91
Query: 212 -VLAKIPHVLVIHG---ESDGT---LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
+ A + V V+HG DG L + P N LH P+P FGTHHSK ML+++
Sbjct: 92 DIRAHV-QVHVVHGFWKREDGNRLRLVEQAEHFP-NVKLHVAPMPEMFGTHHSK-MLIVF 148
Query: 265 PRG--VRIIVHTANLIHVDWNNKSQGLWM-----------QDFPLKDQNNLSEECGFEND 311
R ++I+HTAN+I DW N + W+ +D P + F+ D
Sbjct: 149 RRDDTAQVIIHTANMIAKDWTNMTNAAWISPILPKLNTAPKDSPRPENMTPGSGPRFQFD 208
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSF------FKKFNFSSAAVRLIASVPG---YHT 362
L+ YL++ ++ P+ K ++FSS L+ASVPG HT
Sbjct: 209 LLSYLTSYD--------------RMRPTCTGLVQSLKVYDFSSVKGSLVASVPGTHEVHT 254
Query: 363 GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWM-AELSSSMSSGFS 419
+ WG + L++ + G KS + Q SS+ +L ++ W+ L ++S G S
Sbjct: 255 EAGATAWGWSAMGKCLEQIPCQAG--KSEVTVQVSSIATLGGNDGWLRGTLFKALSKGKS 312
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS 475
T + +V+PT +++R SL+GYA+G +I S Q+ + +L+ + W A
Sbjct: 313 A-TTAAAAPQFKVVFPTADEIRASLDGYASGGSIHTKIQSKQQEMQLRYLRPIFHYWMAD 371
Query: 476 HT----------GRSRAMPHIKTFARYNGQK-LAWFLLTSANLSKAAWGALQKNNSQLMI 524
GR RA PHIKT+ R N + + W L+TSANLSK AWG K Q I
Sbjct: 372 DASKAASSFRDAGRDRAAPHIKTYIRTNEKNTMDWALVTSANLSKQAWGEAAKPTGQFRI 431
Query: 525 RSYELGVLILPSA-KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGA 583
S+E+GVL+ PS K+ C + VP GS E Q+ G
Sbjct: 432 ASWEIGVLVWPSLFKKDAIMKGCFKSDVP-----GSAEGHGGQR--------------GE 472
Query: 584 SSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ VV +PY LP ++YS E +PW + K+D GQ W
Sbjct: 473 AETVVGFRMPYSLPLRKYSREAMPWVATMSHEKEDCLGQSW 513
>gi|50550131|ref|XP_502538.1| YALI0D07590p [Yarrowia lipolytica]
gi|49648406|emb|CAG80726.1| YALI0D07590p [Yarrowia lipolytica CLIB122]
Length = 471
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 151/509 (29%), Positives = 234/509 (45%), Gaps = 89/509 (17%)
Query: 150 LCNFHVSRDKLPST-----FRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDI 203
+ N V R K+ S +L + LP NT V ++D+I + A+ N+M+D+
Sbjct: 1 MDNDRVKRRKVESESDNGRTQLTAITALPDEENTGSVHLKDLIGSPHLEAMWQFNFMIDL 60
Query: 204 DWLLPAC--PVLAKIPHVLVI---HGESDGTLEHMKRNKPA--NWILHKPPLPISFGTHH 256
++L ++ I V+ GE ++ P N + + L F THH
Sbjct: 61 AFVLDNIHKNAMSNIKCRFVMGDFSGEKIAAFRAQAKSLPIADNIEVGRAKLSNLFATHH 120
Query: 257 SKAMLLIY-----PRGVRIIVHTANLIHVDWNNKSQGLWM-QDFPLKDQNNLSEECG-FE 309
+K M+L + R ++++HTAN+IH DW+N +QG+W Q K + N FE
Sbjct: 121 TKMMVLFFKEDKGERSAQVVIHTANMIHHDWDNMTQGVWKSQKVKEKRKTNTEGSTSTFE 180
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 369
DL+ YLS + S + F ++F++SS R++ SVPG H KKW
Sbjct: 181 TDLVAYLSEYQLDTTSKLI----------KFLQRFDWSSETARVVGSVPGTHKD---KKW 227
Query: 370 GHMKLRTVLQECTFE-----KGFKKSPLVYQFSSLGSL--DEKWMA-ELSSSMSSGFSED 421
G ++ +L E + +G + +V Q SS+GSL +KW+ +L ++ D
Sbjct: 228 GLTRVADLLDEHKEDHKSDYEGSESDTIVLQSSSIGSLGVTDKWITPQLVGALDGRSPRD 287
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASHT 477
+ G+ IVWPTVE+VR S +GY G +I S ++K+ WKA +
Sbjct: 288 RDGHGLPASQIVWPTVENVRRSFDGYDLGMSIHFKNESDTHRKQYAYMKERMNVWKADNK 347
Query: 478 GRSRAMPHIKTFARYN-GQKLAWFLLTSANLSKAAWGALQ-KNNSQLMIRSYELGVLILP 535
R+RAMPHIKT+ R+ KL W LLTSAN+SK AWG++ S+ I S+ELGVL+ P
Sbjct: 348 HRTRAMPHIKTYTRFTRAGKLRWVLLTSANISKYAWGSVSAAKESKFSIPSWELGVLLFP 407
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
A F ++ +PY+
Sbjct: 408 QAVGKAV-FDLKDSV-----------------------------------------IPYD 425
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
P YS++D PW+ + + +KD G W
Sbjct: 426 WPLTNYSAKDEPWTKNADHLEKDTNGFPW 454
>gi|429856258|gb|ELA31180.1| tyrosyl-dna phosphodiesterase [Colletotrichum gloeosporioides Nara
gc5]
Length = 517
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 152/509 (29%), Positives = 244/509 (47%), Gaps = 82/509 (16%)
Query: 159 KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK- 215
++ S F+L ++ LP AN V+++D++ GD ++A NY+ DI +L+ K
Sbjct: 45 RIKSPFQLTWIRDLPEPANRDAVALKDIL-GDPLIAECWEFNYLHDIHFLMSHFDEDTKS 103
Query: 216 IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG D ++ A N LH +P FGTHHSK M+L+ + +
Sbjct: 104 LVKVHVVHGFWKREDPNRLALQEEASAYSNVELHGAYMPEMFGTHHSKMMILVRHDDSAQ 163
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPL------KDQNNLSEECG----FENDLIDYLSTL 319
+++HTAN+I DW N + +WM PL KD + + G F++DL+ YL
Sbjct: 164 VVIHTANMIAKDWTNMTNAVWMS--PLLRLLKEKDSTSCEDAIGTGQRFKHDLLSYLKA- 220
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
++ P + +++FSS LIASVPG H+ +S WG L+ V
Sbjct: 221 ----YNVRRPTLRDLVDK---LSQYDFSSVKAALIASVPGRHSIHDTSQTSWGWPALKHV 273
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IV 433
L+ + G KS +V Q SS+ +L + W+ + L + +S S DK P +V
Sbjct: 274 LRHVPVQDG--KSEIVVQISSIATLGATDNWIQKCLFNPLSE--SSDKGPKKTKPTFKVV 329
Query: 434 WPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------KAS 475
+PT +++R SL+GYA+G +I S Q+ +L ++ W
Sbjct: 330 FPTADEIRRSLDGYASGGSIHTKIQSQQQAKQLAYLHPFFCHWGNDAPNGKALPETATVR 389
Query: 476 HTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
GR RA PHIKT+ RY + + W L+TSAN+SK AWG + + ++ I S+E+GVL+ P
Sbjct: 390 EAGRKRAAPHIKTYIRYGEKSIDWALVTSANISKQAWGEVAGASQEVRIASWEIGVLVWP 449
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
T +++ S +TE S+ VV + +PY
Sbjct: 450 EMMAEKATMMST---FQTDLPSNNTE---------------------GSNPVVGVRIPYN 485
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
LP Q Y+ +++PW + + D G+ W
Sbjct: 486 LPLQHYAKDEIPWVATMAHAEPDNMGRFW 514
>gi|414886955|tpg|DAA62969.1| TPA: hypothetical protein ZEAMMB73_115946 [Zea mays]
Length = 140
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/145 (64%), Positives = 106/145 (73%), Gaps = 6/145 (4%)
Query: 483 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 542
MPHIKTF RY+GQ +AWFLLTSANLSKAAWGALQKNN+QLMIRSYELGVL LP +
Sbjct: 1 MPHIKTFTRYSGQNIAWFLLTSANLSKAAWGALQKNNTQLMIRSYELGVLFLPQTLQSVP 60
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
FSCT I+ G I KTKLVTL W G + +V LPVPY+LPPQ Y
Sbjct: 61 QFSCTEK--SRSIRDGVALGKTI-KTKLVTLCWKGDEE---DPSIVRLPVPYQLPPQPYG 114
Query: 603 SEDVPWSWDKRYTKKDVYGQVWPRH 627
++DVPWSWD+RYTKKDVYG VWPR+
Sbjct: 115 TQDVPWSWDRRYTKKDVYGSVWPRY 139
>gi|380026209|ref|XP_003696847.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
florea]
Length = 695
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 149/451 (33%), Positives = 220/451 (48%), Gaps = 89/451 (19%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--AN 240
I D+ G+I+ ++ N+MVDI WL + + ++ ++ GE T P +N
Sbjct: 301 ILDISLGEIVNSLHINFMVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSN 353
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 354 VTTFYVDMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLS 413
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
+ N SE GF+ DL YL+ + P + A ++ +FSS V +
Sbjct: 414 ESANSSEGESPTGFKKDLERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFL 463
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP------LVYQFSSLGSLD---EK 405
ASVPG HT WGH KL ++L K K P LV Q SS+GSL E
Sbjct: 464 ASVPGRHTDMEYDSWGHRKLGSILS-----KHAKLPPDAPQWILVAQSSSIGSLGPNYES 518
Query: 406 WMA-ELSSSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNV 460
W+ E++SSMS + P+G+ P ++P++ + + S + +P S Q +
Sbjct: 519 WLQKEITSSMSK-----ENPVGLKSHPNFHFIYPSLNNYKQSFDCRVGSCCLPYSLQTHS 573
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKN 518
+ +++ Y +WKA TGR RAMPHIKT+ R + +++ WF+LTSANLSKAAWG + KN
Sbjct: 574 KQKWIESYMYQWKAKQTGRDRAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKN 633
Query: 519 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHG 577
+ +M +YE GV+ +PS F S+ P E + G
Sbjct: 634 SHYIM--NYEGGVVFIPS-------FITGSSTFPIKEEEPG------------------- 665
Query: 578 SSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
V PVPY+LP RY D P+
Sbjct: 666 ---------VPIFPVPYDLPLTRYEKNDSPF 687
>gi|340960785|gb|EGS21966.1| hypothetical protein CTHT_0038420 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 487
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 156/508 (30%), Positives = 228/508 (44%), Gaps = 76/508 (14%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IP 217
+PS FRL R++ LPA N V+++D++ +I NYM DID+L+ A + +
Sbjct: 10 IPSPFRLTRIRDLPANLNQDTVTLKDLLGDPLISECWEFNYMHDIDFLMSAFDEDTRHLV 69
Query: 218 HVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRI 270
V V+HG S TL P N LH +P FGTHHSK M+L+ + RI
Sbjct: 70 KVHVVHGFWKREDLSRVTLHEQAARYP-NVALHAAYMPEMFGTHHSKMMILLRHDDTARI 128
Query: 271 IVHTANLIHVDWNNKSQGLWMQDF-PL----KDQNNLSEE-----CGFENDLIDYLSTLK 320
++HTAN+I DW N +Q +WM + PL Q N+ E F+ DL++YL
Sbjct: 129 VIHTANMIVRDWTNMTQAVWMSPWLPLMKGPSQQENVHEAKPGSGAKFKVDLLNYLRAYD 188
Query: 321 WPEFSANLPAHGNFKINPSFFK--KFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRT 376
+ G P K +F+FS LIASVPG H SS +WG +
Sbjct: 189 ---------SRGRETCKPIIEKLMRFDFSEVKGALIASVPGRHKLNDSSPTRWGWAAMEQ 239
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP----LI 432
L+ + + + + ++LG D S ++S G + + +P +
Sbjct: 240 ALKTVPVHQQAEIAIQISSIATLGPTDNWLKNTFSRALSGGRG-----VSLSQPPPSFKV 294
Query: 433 VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY 492
++PT +++R SL+GYA+G +I + ++ + + K +GR RA PHIKT+ RY
Sbjct: 295 IFPTADEIRKSLDGYASGGSIHTKIQSPQQVKQLQQADKSAVLDSGRKRAAPHIKTYIRY 354
Query: 493 NG---QKLAWFLLTSANLSKAAWG-------------ALQKNNSQLMIRSYELGVLILPS 536
Q + W LLTSANLSK AWG + ++ I SYE+GVL+ P
Sbjct: 355 GNKSHQTIDWALLTSANLSKQAWGEAASAPGGSKGKSTASSGDREVRIASYEIGVLVWPE 414
Query: 537 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 596
T G T Q K V L +PY L
Sbjct: 415 LWGEDAAMKATFMTDNLGDSRGGEFTEQEGKV------------------TVALRMPYSL 456
Query: 597 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
P Q Y + +VPW + + D GQVW
Sbjct: 457 PLQPYDNAEVPWVATTNHEEPDWMGQVW 484
>gi|310798351|gb|EFQ33244.1| tyrosyl-DNA phosphodiesterase [Glomerella graminicola M1.001]
Length = 517
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 152/514 (29%), Positives = 244/514 (47%), Gaps = 89/514 (17%)
Query: 159 KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK- 215
++ S F+L R++ LP AN V+++D++ GD ++A N++ DI +L+ A+
Sbjct: 42 RIRSPFQLTRIRDLPEAANRDTVALKDIL-GDPLIAECWEFNFLHDIHFLMSHFDADARD 100
Query: 216 IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG D ++ A N LH +P FGTHHSK M+LI + +
Sbjct: 101 LVKVHVVHGFWKREDPNRLALQEEADAYPNVELHSAFMPEMFGTHHSKMMILIRHDDSAQ 160
Query: 270 IIVHTANLIHVDWNNKSQGLW------------MQDFPLKDQNNLSEECGFENDLIDYLS 317
+++HTAN+I DW N + +W ++D P D + E F++DL+ YL
Sbjct: 161 VVIHTANMIAKDWTNMTNAVWRSPMLPLLPNNYVEDAPTNDHPFGTGE-RFKHDLLGYLR 219
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLR 375
++A P K ++FSS +LIASVPG H +S WG L+
Sbjct: 220 A-----YNARRP---TLKSLVDQICHYDFSSVRAKLIASVPGRHPIHDTSQTAWGWPALK 271
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-----LSSSMSSGFSEDKTPLGIG 428
L+ ++G KS +V Q SS+ +L + W + L+ S ++ S + +
Sbjct: 272 RALRSVPVQEG--KSEVVVQVSSIATLGSSDSWTQKCLFDSLAVSKNNSSSNPRPKFKV- 328
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNVDKDFLKKYWAKWK----------- 473
V+PT +++R SL+GYA+G +I + Q+ +L+ + W
Sbjct: 329 ----VFPTADEIRRSLDGYASGGSIHTKIQSQQQAKQLQYLRSMFCHWANDAPDGEPLPE 384
Query: 474 ---ASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
GR RA PHIKT+ RY + + W L+TSAN+SK AWG + + ++ I S+E+G
Sbjct: 385 TATIREAGRQRAAPHIKTYIRYGEKSIDWALVTSANISKQAWGEAARPSQEVRIASWEIG 444
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL+ PS I G+ E+ QK DAG VV +
Sbjct: 445 VLVWPSI------------IAEKATMIGAFESDMPQK------------DAGDGDPVVGI 480
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+PY +P Q Y +++PW +T+ D G+ W
Sbjct: 481 RIPYSIPLQSYGKDEIPWVASMVHTEPDSMGRFW 514
>gi|346970364|gb|EGY13816.1| tyrosyl-DNA phosphodiesterase [Verticillium dahliae VdLs.17]
Length = 527
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 153/513 (29%), Positives = 234/513 (45%), Gaps = 82/513 (15%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPH 218
PS F+L ++ LP +N V+++D++ +I N++ DI +L+ + +
Sbjct: 43 PSPFQLTHIRDLPDSSNADTVTLKDLLGDPLISECWEFNFLHDIPFLMSHFDKDTRDLVK 102
Query: 219 VLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
V V+HG DG ++ A N LH P+P FGTHH+K M+L + ++I+
Sbjct: 103 VHVVHGFWKREDGNRMALQEEAAAWKNLELHNAPMPEMFGTHHTKMMILFRFDDTAQVII 162
Query: 273 HTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG---------------FENDLIDYL 316
HTAN+I DW N + G+W PL Q + + F++DL+ YL
Sbjct: 163 HTANMIAKDWTNMTNGVWRSPLLPLGPQPDSGKPEAEEESEADEDFGSGRKFKSDLLSYL 222
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKL 374
+ + + K++F+ IASVPG H +S WG L
Sbjct: 223 RAYDARKIT--------LRPLTEQLVKYDFAGIRAVFIASVPGRHAIHDTSQTAWGWPAL 274
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAEL---SSSMSSGFSEDKTPLGIGE 429
+ L+ + G KS +V Q SS+ +L + W+ + S S+S G S P
Sbjct: 275 KRALRRVPVQAG--KSEVVVQISSIATLGGTDSWLQKCLFDSLSLSKGSSISPRPAF--- 329
Query: 430 PLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK------------ 473
+V+PT +++R SL+GYA+G +I SPQ+ +LK + W
Sbjct: 330 -RVVFPTADEIRRSLDGYASGGSIHTKIASPQQAKQLAYLKPIFCHWANDAPGGKEISKD 388
Query: 474 --ASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
GR RA PHIKT+ RY Q + W LLTSANLSK AWG ++ I S+E GV
Sbjct: 389 TALQDAGRQRAAPHIKTYIRYGTQSIDWALLTSANLSKQAWGEAASAAQEVRIASWEAGV 448
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS-EVVYL 590
L+ PS + +G+ E + K S A +S+ VV L
Sbjct: 449 LVWPS------------------LVAGTDEAIMVGTFKTDAPGEEAPSGAPSSTGNVVGL 490
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+PY LP Q Y +++PW +T+ D G+V
Sbjct: 491 RMPYSLPLQLYGKDEIPWVASNEHTEPDWAGRV 523
>gi|367053563|ref|XP_003657160.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
gi|347004425|gb|AEO70824.1| hypothetical protein THITE_2122630 [Thielavia terrestris NRRL 8126]
Length = 548
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 151/515 (29%), Positives = 231/515 (44%), Gaps = 79/515 (15%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHV 219
S F+L +++ LP N +++D++ +I NY+ DID+L+ A P + + V
Sbjct: 63 SPFKLTKIRDLPPELNRDTTTLKDILGDPLISECWEFNYLHDIDFLMAAFDPDVRGLVQV 122
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E LE ++ N LH +P FGTHHSK M+L+ + +I++H
Sbjct: 123 HVVHGFWKREDPSRLELQAAASRYENVTLHNAYMPEMFGTHHSKMMILLRHDDTAQIVIH 182
Query: 274 TANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEE-----CGFENDLIDYLSTLKWPE 323
TAN+I DW N +Q +W+ P + N +E F+ D ++YL +
Sbjct: 183 TANMIVRDWTNMTQAVWLSPRLPLIKPAQQAVNQAEARTGSGAKFKMDFLNYLRSYD--- 239
Query: 324 FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS--SLKKWGHMKLRTVLQEC 381
K +++FS LIASVPG H S S +WG + L+
Sbjct: 240 -----TRKSTCKPIIEQLLRYDFSEIRASLIASVPGRHKFSENSPTRWGWAAMEEALKAV 294
Query: 382 TFEKGFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ KS + Q SS+ +L + W+ + ++S G P + +V+PT +
Sbjct: 295 PVSQA--KSEIAIQISSIATLGPTDSWLKDTFFRALSRGRRGTGPPSAPPDFKVVFPTPD 352
Query: 439 DVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK--------------ASHTGRS 480
++R SL+GYA+G +I SPQ+ +L+ W GR
Sbjct: 353 EIRKSLDGYASGGSIHTKIQSPQQVKQLQYLRPMLCHWANDSPHGVELEAGAAVQEAGRK 412
Query: 481 RAMPHIKTFARYNGQ-------KLAWFLLTSANLSKAAWG-ALQKNNSQLMIRSYELGVL 532
RA PH+KT+ RY G + W LLTSANLSK AWG A ++ I SYE+GVL
Sbjct: 413 RAAPHVKTYIRYRGDGPPHGPITIDWALLTSANLSKQAWGEAANAKTGEIRISSYEIGVL 472
Query: 533 ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 592
+ P + + G + + + + G + V L V
Sbjct: 473 VWP--ELYAPGATMQATFLTDTLAEGERRDAAAAAATAVPLR-----------------V 513
Query: 593 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
PY LP Q Y +VPW Y+++D GQVW RH
Sbjct: 514 PYNLPLQPYGKGEVPWVATASYSERDWMGQVW-RH 547
>gi|48094884|ref|XP_392205.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Apis
mellifera]
Length = 692
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 219/446 (49%), Gaps = 79/446 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP--AN 240
I D+ G+I+ ++ N+MVDI WL + + ++ ++ GE T P +N
Sbjct: 298 ILDISLGEIVNSLHINFMVDIGWLHVQYMLAEQNTNMSILLGERVDT-------GPVGSN 350
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 351 VTTFYVDMPTKFGCHHTKIMILKYKDDGIRVVVSTANLYMDDWENRTQGVWISPHLPPLS 410
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
+ N SE GF+ DL YL+ + P + A ++ +FSS V +
Sbjct: 411 ESANSSEGESPTGFKKDLERYLNRYRQPGITEWTCA----------VRRADFSSVNVFFL 460
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-E 409
ASVPG HT WGH KL ++L + + LV Q SS+GSL E W+ E
Sbjct: 461 ASVPGRHTDMEYDSWGHRKLGSILSKHAKLPPDAPQWTLVAQSSSIGSLGPNYESWLQKE 520
Query: 410 LSSSMSSGFSEDKTPLGI-GEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
++SSMS + P+G+ P ++P++ + + S + +P S Q + + ++
Sbjct: 521 ITSSMSK-----ENPVGLKSHPNFHFIYPSLNNYKRSFDCRVGSCCLPYSLQTHSKQKWI 575
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLM 523
+ Y +WKA TGR +AMPHIKT+ R + +++ WF+LTSANLSKAAWG + KN+ +M
Sbjct: 576 ESYMYQWKAKQTGRDKAMPHIKTYTRISPDLKRIPWFVLTSANLSKAAWGTVGKNSHYIM 635
Query: 524 IRSYELGVLILPSAKRHGCGFSCTSNIVP-SEIKSGSTETSQIQKTKLVTLTWHGSSDAG 582
+YE GV+ +PS F S+ P E + G
Sbjct: 636 --NYEGGVVFIPS-------FITGSSTFPIKEEEPG------------------------ 662
Query: 583 ASSEVVYLPVPYELPPQRYSSEDVPW 608
V P+PY+LP RY D P+
Sbjct: 663 ----VPVFPIPYDLPLTRYEKNDSPF 684
>gi|408398119|gb|EKJ77253.1| hypothetical protein FPSE_02528 [Fusarium pseudograminearum CS3096]
Length = 513
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 155/508 (30%), Positives = 236/508 (46%), Gaps = 76/508 (14%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+PS ++L +Q LP N VS++D++ +I N++ DI +L+ A P +
Sbjct: 38 IPSPWQLTWIQDLPESENKDAVSLQDLLGDPLISECWEFNFLHDIPFLMNAFDPDTRHLV 97
Query: 218 HVLVIHGESDGTLEHMKRNKPA---------NWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+V ++HG +H +N+ A N +H P+P FGTHHSK M+L +
Sbjct: 98 NVHLVHG----FWKHEDKNRIALENAAAKFENVNIHIAPMPEMFGTHHSKMMVLFRHDDT 153
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL--------IDYLSTL 319
++I+HTAN+I DW N + G+W + N E L ID L+ L
Sbjct: 154 AQVIIHTANMIPKDWTNMTNGVWKSPLLPRMSNTQILTSSPEEFLVGSGERFKIDLLNYL 213
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTV 377
K+ + + + K+ ++++FS+ LIASVPG H + + WG L+
Sbjct: 214 KFYDKRKIVCKPLSDKL-----QQYDFSTVKAALIASVPGRHDVHDMSETSWGWAALKRC 268
Query: 378 LQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPL--IV 433
L+ + S +V Q SS+ +L K W L ++ S K G+G P +V
Sbjct: 269 LRHVPCHQ-HGDSDIVVQVSSIATLGPKDDW---LQKTLFDHLSRCKD-TGLGRPRFKVV 323
Query: 434 WPTVEDVRCSLEGYAAG----NAIPSPQKNVDKDFLKKYWAKWKAS-------------H 476
+PT +++R SL+GYA+G I SPQ+ ++L+ + W
Sbjct: 324 FPTADEIRRSLDGYASGLSIHTKIQSPQQAKQLEYLRPMFHHWANDSPGGTKLPDGPVLE 383
Query: 477 TGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 536
+GR RA PHIKT+ R N + W LLTSAN+SK AWG + ++ I S+E+GVLI P
Sbjct: 384 SGRKRAAPHIKTYVRSNKSSIDWGLLTSANISKQAWGEAAQLTGEMRIASWEVGVLIWPE 443
Query: 537 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 596
G T E+ E + S VV L +PY
Sbjct: 444 LLEPGSVMVGTYKTDVPEVSRSPKEDEE-------------------SLPVVGLRIPYNT 484
Query: 597 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
P QRY+SE+VPW +T+ D GQ W
Sbjct: 485 PLQRYTSEEVPWVVSMSHTEPDWAGQSW 512
>gi|397613425|gb|EJK62211.1| hypothetical protein THAOC_17185, partial [Thalassiosira oceanica]
Length = 576
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 144/517 (27%), Positives = 236/517 (45%), Gaps = 114/517 (22%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG-TLEHMKR--------NKPANWILHK 245
+++++++D+++L P + K V+V +G +G +++ M++ K +I
Sbjct: 56 VITSFLLDVEYLFEELPEIIKYQKVIVYYGSVEGNSMQAMRQWEQVLGNSGKTVEFIRLV 115
Query: 246 P---------PLP--ISFGTHHSKAMLLIYPRG----VRIIVHTANLIHVDWNNKSQGLW 290
P PLP + +G HHSK L Y RI +H+ANL D K+QG++
Sbjct: 116 PSDPPYSATNPLPFKLPYGVHHSKFFLSGYEEEGKHMCRIGIHSANLRRSDIERKTQGIY 175
Query: 291 MQDF--------------PLK-----DQNNLSEECGFENDLIDYLSTLKWPE-----FSA 326
+QDF P K + ++L + FE+DLI Y+ + ++ FS
Sbjct: 176 VQDFPAKAPKKQAAAAVNPYKRAKVDEDDDLRQ---FEDDLITYMESYRYYVRGQIWFSP 232
Query: 327 NLPAHGNFKINP----SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQEC- 381
+ G + ++++FS A L+ SVPGYH + K+G+ K+ ++
Sbjct: 233 STTQSGGLTDRSHSILTLLRRYDFSCAYAVLVPSVPGYHQARDMPKFGYYKIHKAVKNAR 292
Query: 382 TFEKGFKKS---------PLVYQFSSLGSLDEKWMAELSSSMSSGFSED----------K 422
+ G +S P+++Q SSLG++ +W+ +L +++ S +
Sbjct: 293 SGRAGSNQSSSGETETPKPIIFQVSSLGTIQNRWLIKLLAAIDSNCHRNDPSTYLPAGKS 352
Query: 423 TPLGIGEPL-----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT 477
P G PL +VWPTVE+VR +EGYA G AIP + +DKDFL + +W T
Sbjct: 353 IPQGKTPPLETRMKLVWPTVEEVRTCVEGYAGGGAIPGTTEKLDKDFLLPLYHRWSNPDT 412
Query: 478 G------RSRAMPHIKTFAR-YNGQKLAWFLLTSANLSKAAWGALQ----KNNSQLMIRS 526
+R PHIKTF + +G ++ W +LTS NLSK + G Q N +LMI+
Sbjct: 413 NILGPLRTARYAPHIKTFVQPGDGDEIHWVVLTSHNLSKPSLGEFQTDTKTNERRLMIQH 472
Query: 527 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
+ELGV P + ++P E E Q G DA
Sbjct: 473 WELGVFFSPETLTKMTSDNSPLRMIPFE------EAGQC-----------GIKDA----- 510
Query: 587 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+P+PY L P RY + W+ D+ + D +G+V
Sbjct: 511 -ALVPLPYSLHPSRYDENEEAWATDRPASTPDAFGRV 546
>gi|313236496|emb|CBY11811.1| unnamed protein product [Oikopleura dioica]
Length = 495
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 136/441 (30%), Positives = 214/441 (48%), Gaps = 80/441 (18%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
NYM+D++++L P +KI L + G D + + P N P+P FGTHH+
Sbjct: 118 NYMIDLEFVLKHHPNSSKI---LFVSG--DTLFQPGRDGIPDNIFQSVVPVP-QFGTHHT 171
Query: 258 KAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFP--LKDQNNLSEECGFENDLID 314
K +L + G+R+ +++ANL+ DW ++Q +W+ LK+++ S E FE DL++
Sbjct: 172 KMSILKFRNIGLRVAIYSANLLDYDWRERTQVIWLSPLLPLLKEKSKTSSE--FETDLVE 229
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKL 374
Y+ + ++ L + F+K++FSS R I S PG +GH+KL
Sbjct: 230 YIDSYSLAPLNSLLQS----------FEKYDFSSIKARFIGSSPGRRRDKEKWIFGHLKL 279
Query: 375 RTVLQECTFEKGFKKSPLVYQFSSLGSLDEK-------WMAEL--SSSMSSGFSEDKTPL 425
R VL++ + K LV Q SS+GSL + ++A L S +S +++D
Sbjct: 280 RKVLKKIS--NCAKNDKLVAQCSSIGSLRSRDSWLYNEFLASLMTCSDAASYYTKDNDAF 337
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASH-TGRSRAM 483
+ V+PTVE +RCS GY++G + P S + + + ++ Y +KW+ TGRSR M
Sbjct: 338 SL-----VYPTVEQIRCSKFGYSSGGSFPYSAKTHESQKWIIYYMSKWEPDEKTGRSRVM 392
Query: 484 PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 543
PH K + R + K+ WFL S NLSKAAWG +K ++QL IRS+E VL++P
Sbjct: 393 PHSKIYQRVSDGKVKWFLSGSHNLSKAAWGQYEKGDTQLHIRSFEASVLLIPE------D 446
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 603
+ S P+ + E Q RYS
Sbjct: 447 YGLESFNFPAFPNFHNFEKIQ-----------------------------------RYSD 471
Query: 604 EDVPWSWDKRYTKKDVYGQVW 624
D PW +D +Y + D + Q W
Sbjct: 472 NDFPWLYDNKYLQPDDFNQTW 492
>gi|157109623|ref|XP_001650753.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108868427|gb|EAT32652.1| AAEL015141-PA [Aedes aegypti]
Length = 624
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 142/443 (32%), Positives = 214/443 (48%), Gaps = 62/443 (13%)
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPH--VLVIHGESDGTLEHMKRNKPANWI 242
D G++ ++ N+MVDI WLL A +L+++G+ L+ + KP N
Sbjct: 224 DTSLGELECSVQMNFMVDIGWLL-GHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NVT 281
Query: 243 LHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQ 299
K + FG HH+K L Y G +R++V TANL DW+N++QGLW+ P+ +
Sbjct: 282 AVKVHIATPFGVHHTKMGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPED 341
Query: 300 NNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
++ + GF +LI YL++ K G+ + + +K NFS V L+AS
Sbjct: 342 SDTGAGDSKTGFRENLITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVAS 391
Query: 357 VPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG H + WGH ++ +L + + PLV Q SS+GSL + + S +
Sbjct: 392 VPGGHLNTPKGPLWGHPRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEVL 450
Query: 416 SGFSEDKTPLGIGE-PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAK 471
+ F D P+G+ P +++P+ +VR S + G +P + DK +LK Y +
Sbjct: 451 ASFRRDSAPIGLRRVPAFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLHQ 510
Query: 472 WKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYE 528
WK+ R++A+PHIKT+ R++ + L WFLLTSANLSKAAWG K+ + L I SYE
Sbjct: 511 WKSDSRNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYE 570
Query: 529 LGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 588
GVL LP F N P E K G
Sbjct: 571 AGVLFLPK-------FVIEENFFPMESKPGQQHPQ------------------------- 598
Query: 589 YLPVPYELPPQRYSSEDVPWSWD 611
P+PY++P Y+ ED P+ D
Sbjct: 599 -FPMPYDVPIIPYALEDTPFFMD 620
>gi|158293221|ref|XP_558110.3| AGAP010577-PA [Anopheles gambiae str. PEST]
gi|157016854|gb|EAL40355.3| AGAP010577-PA [Anopheles gambiae str. PEST]
Length = 584
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 148/461 (32%), Positives = 219/461 (47%), Gaps = 73/461 (15%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHV--LVIHGESD 227
P A V+ ++++ G++ ++ N+MVDI WLL A A +V L+++G+
Sbjct: 169 PTHAEPLSVTFQELLDSSLGELECSVQMNFMVDIGWLL-AHYFFAGYENVPLLILYGDET 227
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKS 286
L + + KP N K + FG HH+K L Y G +R++V TANL DW+N++
Sbjct: 228 PELRMVSQKKP-NVTAVKVEIKTPFGVHHTKMGLYGYRDGSMRVVVSTANLYEDDWHNRT 286
Query: 287 QGLWMQD----FPLKDQNNLSE-ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF 341
QGLW+ P E F + L+ YL K P+ + +
Sbjct: 287 QGLWISPRLPAVPEGSDTTYGESRSDFRSSLLTYLDAYKLPQLQPWM----------ARI 336
Query: 342 KKFNFSSAAVRLIASVPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
+K +FS V L+ASVPG HT ++ WGH +L +L + PLV Q SS+G
Sbjct: 337 RKTDFSDVKVFLVASVPGGHTNTAKGPLWGHPRLGYLLSQHAAPID-DSCPLVAQSSSIG 395
Query: 401 SLD---EKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIP 454
SL E W+ L M+S F +D P+GI +++P+ +VR S +G G +P
Sbjct: 396 SLGPSPESWV--LGEIMAS-FRKDSAPVGIRRLPGFRMIYPSFSNVRQSHDGMMGGGCLP 452
Query: 455 SPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG 513
+ +V +++LK Y +W + R++AMPHIKT+ R++ + L WFLLTSANLSKAAWG
Sbjct: 453 YVRSTHVKQEWLKDYLQQWCSRARHRNKAMPHIKTYCRWSHRGLYWFLLTSANLSKAAWG 512
Query: 514 ALQKN---NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKL 570
K L I SYE GVL LP N P E
Sbjct: 513 VYNKTGRFEKPLRINSYEAGVLFLPK-------LLLDENFFPME---------------- 549
Query: 571 VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 611
A+ + P+PY++P Y+ ED P+ D
Sbjct: 550 ------------ANKKHPQFPMPYDVPTIPYAPEDTPFFMD 578
>gi|301791029|ref|XP_002930517.1| PREDICTED: LOW QUALITY PROTEIN: tyrosyl-DNA phosphodiesterase
1-like [Ailuropoda melanoleuca]
Length = 473
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 138/382 (36%), Positives = 196/382 (51%), Gaps = 57/382 (14%)
Query: 258 KAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNNLSEECG--FENDLI 313
K MLL+Y G+ +++HT++LIH D + K+QG W+ +P + + S E F+ DLI
Sbjct: 131 KMMLLLYEEGLWVVIHTSDLIHADCHQKTQGEWLTPLYPRIIHGXHRSGESATHFKADLI 190
Query: 314 DYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMK 373
YL P + K + S V LI S PG GS GH +
Sbjct: 191 SYLMAYNAPSLKEWI----------DTVHKHDISETNVYLIGSTPGCFQGSRKDHXGHFR 240
Query: 374 LRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGI 427
LR +L+E + KG + P+V QFSS+GSL D KW+ +E S+++ E +TP
Sbjct: 241 LRKLLKEHASSIPKG-ESWPIVGQFSSIGSLGADDLKWLCSEFKESLATLGKESRTPGKS 299
Query: 428 GEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPH 485
PL +++P+VE+V+ SLE Y AG+++PS + +K + L Y+ K A +G + AMPH
Sbjct: 300 AVPLHLIYPSVENVQTSLEEYPAGDSLPSSIQIAEKQNCLHSYFHKXVADTSGCNNAMPH 359
Query: 486 IKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 543
IK + R + ++ W L+TS NLSK GAL+KN QLMI SYE GVL L SA
Sbjct: 360 IKRYMRPSPDFSQIVWLLVTSTNLSKTTXGALEKNGXQLMIHSYEXGVLFLLSA------ 413
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 603
F S V K KL +G+ PVPY+LPP+ Y S
Sbjct: 414 FGLDSFKV---------------KQKL----------SGSKEPAATFPVPYDLPPELYGS 448
Query: 604 EDVPWSWDKRYTK-KDVYGQVW 624
+D P + YTK D +G +W
Sbjct: 449 KDRPXIXNIPYTKVPDTHGNMW 470
>gi|350415522|ref|XP_003490669.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Bombus
impatiens]
Length = 697
Score = 181 bits (460), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 137/439 (31%), Positives = 217/439 (49%), Gaps = 65/439 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ N+MVD+ WL + + + ++ G + K + I
Sbjct: 304 ILDISLGEIVKSLHINFMVDVGWLCLQYLLAGQRTDMSILFGT------RVDEEKLSLNI 357
Query: 243 LHKPP-LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
P +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 358 TMIPVWMPTKFGCHHTKVMILKYKDDGIRVVVSTANLYSDDWENRTQGVWISPHLPLLAE 417
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YL + P + + A K+ NFSS V +A
Sbjct: 418 SANPSDGESPTGFKRDLERYLHKYEQPALTEWISA----------VKRANFSSVNVFFVA 467
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG HTG WG+ KL VL + + LV Q SS+GSL + + + +
Sbjct: 468 SVPGRHTGVEYDYWGYRKLGHVLSKHAKLPPDAPQWTLVVQSSSIGSLGPNYESWIQKEI 527
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S S++ P P ++P++ + + S + +P S Q + +++++ Y +
Sbjct: 528 ISSMSKENPPGLKSCPNFRFIYPSLNNYKQSFDCQVGSCCLPYSIQTHSKQEWVESYMYQ 587
Query: 472 WKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
WKA+ T R +A+PHIKT+ R N +K+ WF+LTSANLSKAAWG ++K++ ++ +YE
Sbjct: 588 WKATRTARDKAIPHIKTYTRISPNLEKIPWFVLTSANLSKAAWGMVRKDSHHIL--NYEA 645
Query: 530 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 589
GV+ +P +GST T I+K +AG V
Sbjct: 646 GVIFIP------------------HFVTGST-TFPIKK-----------EEAG----VPV 671
Query: 590 LPVPYELPPQRYSSEDVPW 608
P+PY+LP RY S D P+
Sbjct: 672 FPIPYDLPLTRYGSGDKPF 690
>gi|157129902|ref|XP_001661809.1| tyrosyl-dna phosphodiesterase [Aedes aegypti]
gi|108872048|gb|EAT36273.1| AAEL011629-PA [Aedes aegypti]
Length = 536
Score = 181 bits (460), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 141/444 (31%), Positives = 214/444 (48%), Gaps = 60/444 (13%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPH-VLVIHGESDGTLEHMKRNKPANW 241
+ D G++ ++ N+MVDI WLL +L+++G+ L+ + KP N
Sbjct: 134 LLDTSLGELECSVQMNFMVDIGWLLGHYFFAGYEDRPLLILYGDETPELKMVSSKKP-NV 192
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKD 298
K + FG HH+K L Y G +R++V TANL DW+N++QGLW+ P+ +
Sbjct: 193 TAVKVHIATPFGVHHTKMGLYGYTDGSMRVVVSTANLYEDDWHNRTQGLWVSPRLPPMPE 252
Query: 299 QNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
++ + GF +LI YL++ K G+ + + +K NFS V L+A
Sbjct: 253 DSDTGAGDSKTGFRENLITYLNSYKI----------GHLQPWVARIQKTNFSEVNVFLVA 302
Query: 356 SVPGYHTGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG H + WGH ++ +L + + PLV Q SS+GSL + + S +
Sbjct: 303 SVPGGHLNTPKGPLWGHPRMGYLLGKHSAPID-DSCPLVAQSSSIGSLGPNPQSWVLSEV 361
Query: 415 SSGFSEDKTPLGIGE-PL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWA 470
+ F D P+G+ P +++P+ +VR S + G +P + DK +LK Y
Sbjct: 362 LASFRRDSAPIGLRRVPAFKMIFPSFSNVRNSHDHLLGGGCLPYMKATHDKQVWLKDYLH 421
Query: 471 KWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSY 527
+WK+ R++A+PHIKT+ R++ + L WFLLTSANLSKAAWG K+ + L I SY
Sbjct: 422 QWKSDSRNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSY 481
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
E GVL LP F N P E K G
Sbjct: 482 EAGVLFLPK-------FVIEENFFPMESKPGQQHPQ------------------------ 510
Query: 588 VYLPVPYELPPQRYSSEDVPWSWD 611
P+PY++P Y+ ED P+ D
Sbjct: 511 --FPMPYDVPIIPYALEDTPFFMD 532
>gi|340710910|ref|XP_003394026.1| PREDICTED: LOW QUALITY PROTEIN: probable tyrosyl-DNA
phosphodiesterase-like [Bombus terrestris]
Length = 697
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 217/439 (49%), Gaps = 65/439 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ N+MVD+ WL + + + +++G + + K + I
Sbjct: 304 ILDISLGEIVKSLHINFMVDVGWLCLQYLLAGQRTDMSIMYGS------RVDKEKLSLNI 357
Query: 243 LHKPP-LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL--- 296
P +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ PL
Sbjct: 358 TMIPVWIPTKFGCHHTKVMILKYKDDGIRVVVSTANLYSCDWENRTQGVWISPHLPLLAE 417
Query: 297 -KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ ++ GF+ DL YL + + A ++ NFSS V +A
Sbjct: 418 SANPSDGESPTGFKRDLERYLHKYHQRGLTEWISA----------VRRANFSSVNVFFLA 467
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
SVPG HTG WG+ KL VL + + LV Q SS+GS + + + +
Sbjct: 468 SVPGKHTGVEYDYWGYRKLGQVLSKHAKLPPDAPQWTLVAQSSSIGSFGPNYESWIQKEI 527
Query: 415 SSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAK 471
S S++ P +P ++P++ + + S + +P S + + +++L+ Y +
Sbjct: 528 VSSMSKENPPGLKSQPNFQFIYPSINNYKQSFDCQVGSCCLPYSIETHSKQEWLESYMYQ 587
Query: 472 WKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
WKA+ T R +A+PHIKT+ R N +K+ WF+LTSANLSKAAWG ++ ++ L I +YE
Sbjct: 588 WKATRTARDKAIPHIKTYTRISPNLEKIPWFVLTSANLSKAAWGIVRVDS--LHILNYEA 645
Query: 530 GVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 589
GV+ +P +GST T I+K +AG V
Sbjct: 646 GVIFIP------------------HFVTGST-TFPIKK-----------EEAG----VPV 671
Query: 590 LPVPYELPPQRYSSEDVPW 608
P+PY+LP RY SED P+
Sbjct: 672 FPIPYDLPLTRYGSEDKPF 690
>gi|380479741|emb|CCF42843.1| tyrosyl-DNA phosphodiesterase [Colletotrichum higginsianum]
Length = 520
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 148/514 (28%), Positives = 241/514 (46%), Gaps = 87/514 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK 215
D++ S F+L R++ LP AN V+++D++ GD ++A N++ DI +L+ +
Sbjct: 44 DRIASPFQLTRIRDLPEAANKDTVTLKDIL-GDPLIAECWEFNFLHDIHFLMSHFDEDTR 102
Query: 216 -IPHVLVIHG---ESDGTLEHMKRNKPA--NWILHKPPLPISFGTHHSKAMLLI-YPRGV 268
+ V V+HG + D ++++ A N LH +P FGTHHSK M+LI +
Sbjct: 103 NLVKVHVVHGFWKKEDPNRLALQKDAEAYPNVELHGAFMPEMFGTHHSKMMVLIRHDDSA 162
Query: 269 RIIVHTANLIHVDWNNKSQGLW-------MQDFPLKDQNNLSEECG----FENDLIDYLS 317
++I+HTAN+I DW N + +W + D +D + G F++DL+ YL
Sbjct: 163 QVIIHTANMIVRDWTNMTNAVWRSPLLPLLSDEHAEDTSATDHPFGTGKRFKHDLLSYLR 222
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLR 375
++A P ++FSS IASVPG H +S WG L+
Sbjct: 223 A-----YNARRPITRTLVAQ---LCNYDFSSVRATFIASVPGRHPILDTSQTAWGWPALK 274
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAE-----LSSSMSSGFSEDKTPLGIG 428
L ++G +S +V Q SS+ +L + W+ + L+ S + S K +
Sbjct: 275 RALGSVPVQEG--ESEIVIQVSSIATLGPTDSWIQKCLFDSLAVSKNKSSSRPKPKFKV- 331
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNVDKDFLKKYWAKWK----------- 473
V+PT +++R SL+GYA+G +I + Q+ +L+ + W
Sbjct: 332 ----VFPTADEIRQSLDGYASGGSIHTKIQSQQQMKQLQYLRPIFCHWANDAPEGKILSE 387
Query: 474 ---ASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
GR RA PHIKT+ RY + + W L+TSAN+SK AWG + ++ + S+E+G
Sbjct: 388 TAAIQKAGRERAAPHIKTYIRYGEKSIDWALVTSANISKQAWGEAMGASQEVRVASWEVG 447
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL+ PS I + G+ ET + + G+ VV L
Sbjct: 448 VLVWPSI------------ITDNATMVGTFETDMPPR------------EGGSGDTVVGL 483
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+PY LP Q Y +++PW +T+ D G+ W
Sbjct: 484 RIPYNLPLQSYGKDEIPWVASMAHTEPDRMGRFW 517
>gi|194855370|ref|XP_001968528.1| GG24923 [Drosophila erecta]
gi|190660395|gb|EDV57587.1| GG24923 [Drosophila erecta]
Length = 580
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 130/374 (34%), Positives = 195/374 (52%), Gaps = 35/374 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQ 232
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ + +P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAI-RVRMPTPFATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 292 PEDADTGAGESLTGFKQDLMLYLVEYKITQLQPWI----------ARIRKSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H SS++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHRESSVRGHPWGHARLGSLLSKHAAPID-DRIPVVCQSSSIGSLGVSVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---Q 521
Y +WK+S RSRAMPHIK++ R+N Q + WF+LTSANLSKAAWG KN++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 522 LMIRSYELGVLILP 535
L I +YE+GVL LP
Sbjct: 521 LRIANYEVGVLFLP 534
>gi|194771042|ref|XP_001967588.1| GF20606 [Drosophila ananassae]
gi|190615089|gb|EDV30613.1| GF20606 [Drosophila ananassae]
Length = 576
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 135/388 (34%), Positives = 202/388 (52%), Gaps = 40/388 (10%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGES 226
P + V++++++ G+I + N+MVDI WLL +L K +LV++G+
Sbjct: 158 PTHSEPLSVTLQEILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDE 215
Query: 227 DGTLEHMKRNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNN 284
L + + KP I K P P F T H+K MLL Y G +R+++ TANL DW+N
Sbjct: 216 SPELLSIGKFKPQVTAIGVKMPTP--FATSHTKMMLLAYNDGSMRVVISTANLYEDDWHN 273
Query: 285 KSQGLWMQ-DFPL----KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
++QG+W+ P D + GF+ DL+ YL K + + +
Sbjct: 274 RTQGVWISPKLPELHEDADTGAGESQTGFKQDLMLYLVEYKISQLQPWI----------A 323
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
+K +FS+ V + SVPG H S+++ WGH +L +L + + P+V Q S
Sbjct: 324 RIRKSDFSAINVFFLGSVPGGHRESTVRGHPWGHARLGALLAKHATPIN-DRIPVVCQSS 382
Query: 398 SLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAI 453
S+GSL A + + +D TPLG + +++P+ +V S +G G +
Sbjct: 383 SIGSLGANVQAWIQQDFVNSLKKDSTPLGKLRQMPTFKMIYPSFGNVSGSHDGMLGGGCL 442
Query: 454 PSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKA 510
P + DK +LK + +WK++ RSRAMPHIKT+ RYN Q + WF+LTSANLSKA
Sbjct: 443 PYGKNTNDKQPWLKDHLHQWKSNDRYRSRAMPHIKTYTRYNLEDQSVYWFVLTSANLSKA 502
Query: 511 AWGALQKN-NSQ--LMIRSYELGVLILP 535
AWG KN N Q L I +YE GVL LP
Sbjct: 503 AWGCFNKNSNVQPCLRIANYEAGVLFLP 530
>gi|195118058|ref|XP_002003557.1| GI21930 [Drosophila mojavensis]
gi|193914132|gb|EDW12999.1| GI21930 [Drosophila mojavensis]
Length = 596
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 148/452 (32%), Positives = 222/452 (49%), Gaps = 73/452 (16%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I ++ N+M+DI WLL +L+K +LV++G D L + + KP
Sbjct: 191 IFDESLGEIESSVQINFMIDIGWLLGHYYFAGILSK--PLLVLYGADDPNLVDIGKFKPQ 248
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PL 296
+ K + F T H+K MLL Y G +R+++ TANL DW+N++QGLWM PL
Sbjct: 249 VTAI-KVQMQSPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWMSPRLPPL 307
Query: 297 -KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
+D + + E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 308 PEDADTAAGESPTGFKQDLMLYLVEYKLSQLQPWI----------ARIRKSDFSAINVFF 357
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
I SVPG H S+++ WG +L ++L + E P+V Q SS+GSL A
Sbjct: 358 IGSVPGGHRESAVRGHPWGCARLGSLLAKHAAPVEPNI---PVVCQSSSIGSLGANVQAW 414
Query: 410 LSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-F 464
+ + S F +D +P+G L +++P+ +V S +G G +P + DK +
Sbjct: 415 IEQDILSNFRKDSSPIGRLSQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGKSTNDKQPW 474
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGAL-QKNNSQ 521
LK Y +WK+ RS+AMPHIK++ R+N Q + WF+LTSANLSKAAWGA +K+N Q
Sbjct: 475 LKNYLHQWKSGDRHRSQAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGAFNKKSNLQ 534
Query: 522 --LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS 579
L I +YE GVL LP F + P
Sbjct: 535 PCLRIFNYEAGVLFLPK-------FVTGEDTFPL-------------------------- 561
Query: 580 DAGASSEVVYLPVPYELPPQRYSSEDVPWSWD 611
A + V P+PY++P Y +D P+ D
Sbjct: 562 -GNARNGVPAFPLPYDVPLTPYGPDDTPFLMD 592
>gi|125984342|ref|XP_001355935.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
gi|54644254|gb|EAL32995.1| GA28884 [Drosophila pseudoobscura pseudoobscura]
Length = 576
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/377 (35%), Positives = 198/377 (52%), Gaps = 41/377 (10%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP- 238
I D G+I ++ N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 171 IFDESLGEIESSVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 228
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
I K P P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL
Sbjct: 229 VTAIGVKMPTP--FATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLL 284
Query: 297 ----KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+D + + E GF DL+ YL K + + + +K +FS+
Sbjct: 285 PALSEDADTAAGESLTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAIN 334
Query: 351 VRLIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
V + SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A
Sbjct: 335 VFFVGSVPGGHREGSVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQA 393
Query: 409 ELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD- 463
+ + +D +P G + +++P+ +V S +G G +P + DK
Sbjct: 394 WIQQDFVNSLRKDSSPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQP 453
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQ 521
+LK + +WK+S RSRAMPHIKT+ RYN Q + WF+LTSANLSKAAWG+ KN +
Sbjct: 454 WLKAHLQQWKSSDRHRSRAMPHIKTYTRYNLTDQSVYWFVLTSANLSKAAWGSFNKNTNL 513
Query: 522 ---LMIRSYELGVLILP 535
L I +YE GVL LP
Sbjct: 514 QPCLRIANYEAGVLFLP 530
>gi|195470993|ref|XP_002087790.1| GE18215 [Drosophila yakuba]
gi|194173891|gb|EDW87502.1| GE18215 [Drosophila yakuba]
Length = 582
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 129/374 (34%), Positives = 194/374 (51%), Gaps = 35/374 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKPQ 232
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ + +P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAI-RVRMPTPFATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V
Sbjct: 292 PEDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H SS++ WGH +L ++L + + P++ Q SS+GSL A +
Sbjct: 342 LGSVPGGHRESSVRGHPWGHARLGSLLSKHATPID-DRIPVICQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPAGKLRQMPPFKMIYPSFGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---Q 521
Y +WK+S RSRAMPHIK++ R+N Q + WF+LTSANLSKAAWG KN++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEEQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 522 LMIRSYELGVLILP 535
L I +YE+GVL LP
Sbjct: 521 LRIANYEVGVLFLP 534
>gi|307211789|gb|EFN87770.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 645
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 194/365 (53%), Gaps = 30/365 (8%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N+MVD+ WL + + +++++G+ + + N
Sbjct: 250 ILDRSLGEIVNSLHLNFMVDVGWLCLQYLLAGQRTDMMILYGDRVD-----QESLGCNIT 304
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPL---- 296
+ +P +FG HH+K M+L Y G+RI+V TANL DW N++QGLW+ PL
Sbjct: 305 MIHVDMPSAFGCHHTKIMILQYKDDGIRIVVSTANLYSDDWENRTQGLWISPHLPLLPES 364
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
+ N+ F+ D YLS + P + + +K +FS+ V +AS
Sbjct: 365 ANSNDGESPTNFKKDFERYLSKYRHPALTQWI----------WIVRKADFSAVNVYFVAS 414
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG H + WGH KL +L Q T + ++ Q SS+GSL + + LS +
Sbjct: 415 VPGTHKNVDVDFWGHRKLAQILSQHATLPPDAPQWSIIAQSSSIGSLGPNYESWLSREIV 474
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S S + T P V+P++E+ + S + + +P S + + + +++ Y +W
Sbjct: 475 SSMSRETTQGLKSHPKFQFVYPSIENYKRSFDFQTLSSCLPYSLKVHSKQQWIESYLYQW 534
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
KA+ TGR+RA+PHIK++ R + + + WF+LTSANLSKAAWGA Q++N +M +YE G
Sbjct: 535 KATRTGRNRAIPHIKSYTRISPDLKSIPWFVLTSANLSKAAWGA-QRSNYYIM--NYEAG 591
Query: 531 VLILP 535
V+ LP
Sbjct: 592 VVFLP 596
>gi|402082685|gb|EJT77703.1| hypothetical protein GGTG_02808 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 583
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 152/514 (29%), Positives = 243/514 (47%), Gaps = 83/514 (16%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVL 220
S FRL ++ L N V ++DVI +I I + NY+ DI+++L A + + H++
Sbjct: 101 SPFRLTHIKDLAPQDNVDAVRLKDVIGDPLISEIWNFNYLHDINFVLGA--LDEDVRHMI 158
Query: 221 ---VIHG---ESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRII 271
VIHG + D ++R+ + N LH +P FGTHHSK ++L+ + +++
Sbjct: 159 KVNVIHGFWKKDDRRRIDLQRDAAQNKNLTLHTAFMPEMFGTHHSKMLILLRHDDTAQVV 218
Query: 272 VHTANLIHVDWNNKSQGLWMQ-DFPLK----------DQNNLSEECG--FENDLIDYLST 318
+HTAN+I DW N +Q +W+ PL+ D +L E G F+ DL+ YL
Sbjct: 219 IHTANMIPKDWTNMTQSIWLSPRLPLQKPTAPAPAHVDYESLPEGSGEKFKLDLLSYLR- 277
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRT 376
A + ++++FSS L+ASVPG H S WG +R
Sbjct: 278 -------AYDKRRAICRPLVQELQRYDFSSVRATLVASVPGRHQIHDRSAATWGWAAIRR 330
Query: 377 VLQECTFEKGFKKSP-LVYQFSSLGSL--DEKWM-AELSSSMSSGFSEDKTPLGIGEPL- 431
L+ + ++P +V Q SS+ +L + W+ L SMS G + +P
Sbjct: 331 ALESVPLQTAAGRTPEVVVQVSSIATLGPTDSWLRGALFDSMSRGKAAAVA---APKPRF 387
Query: 432 -IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK------------- 473
+++PT +++R SL+GYAAG +I S Q+ +LK + W
Sbjct: 388 KVIFPTPDEIRASLDGYAAGASIHTKIQSAQQVKQLMYLKPLFCHWANDSALGNEKDENA 447
Query: 474 -ASHTGRSRAMPHIKTFARY-NGQK-LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
GR+RA PH+KT+ RY +G++ L W L+TSANLSK AWG ++ I S+E+G
Sbjct: 448 PIRDAGRNRAAPHVKTYIRYGDGERSLDWALMTSANLSKQAWGEAVNAMGEVRIASWEIG 507
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
VL+ PS F+ + + P S + + + V+ L
Sbjct: 508 VLVWPSL------FAEKARMAPV-FGSDRLSVEEADEAR------------QGGGPVMGL 548
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+PY LP Q Y +++PW +Y + D G+ W
Sbjct: 549 RIPYNLPVQAYGRDEIPWVATAKYDELDCKGRKW 582
>gi|345487640|ref|XP_001604652.2| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 690
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 132/441 (29%), Positives = 210/441 (47%), Gaps = 63/441 (14%)
Query: 185 DVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH 244
D+ G+I+ ++ N+MV+I WL + A+ P + + G ++ P+N L
Sbjct: 295 DISLGEIVDSLHINFMVEIGWLCLQYLLAAQNPKMTIFCG----SVCDPNVALPSNITLV 350
Query: 245 KPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNN 301
+ +P +FG HHSK + Y G +RI+V TAN+ DW N++QGLWM PL + N
Sbjct: 351 EVNMPAAFGCHHSKISVFKYSDGGIRIVVSTANIYSDDWENRTQGLWMSPHLPPLPNSAN 410
Query: 302 LSE---ECGFENDLIDYLSTLKWPEFSA--NLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
S+ F+ +YL+ + P+ NL K+ + S+ V +AS
Sbjct: 411 PSDGESPTNFKKSFREYLNAYRNPKLVEWENL------------VKRADCSAVNVFFVAS 458
Query: 357 VPGYHTGSSLKKWGHMKLRTVLQE-CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
+PG H G SL WGH +L +L E + ++ Q SS+G+L + + + S++
Sbjct: 459 IPGSHKGLSLNSWGHRRLAAILNEHAVLPPDAPQWTIIAQSSSIGNLGPTFDSWIQSNIV 518
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKW 472
S +K P V+P++ + S + A +P +K+ +K ++LK Y +W
Sbjct: 519 FSLSREKAKGIKSNPNFHFVYPSLRNYEGSFDCKAGSCCLPYSRKSHEKQEWLKNYLYQW 578
Query: 473 KASHTGRSRAMPHIKTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
KA TGR++AMPH+K++ R + ++ WF+LTSANLSK AWG K I +YE G
Sbjct: 579 KADETGRTKAMPHVKSYTRISPDLTQIPWFVLTSANLSKGAWGTTAKTGVSHYIMNYEAG 638
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
V+ +P F P IK+ S S ++
Sbjct: 639 VVFIPK-------FVINQQTFP--IKTSS------------------------SPDIPVF 665
Query: 591 PVPYELPPQRYSSEDVPWSWD 611
+PY+LP RY DVP+ D
Sbjct: 666 RLPYDLPLTRYRQNDVPFVID 686
>gi|156400100|ref|XP_001638838.1| predicted protein [Nematostella vectensis]
gi|156225962|gb|EDO46775.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 158/289 (54%), Gaps = 47/289 (16%)
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE---- 404
VRLIASVPG H G + KWGH+KLR +LQE + P++ QFSS+GSL
Sbjct: 1 VRLIASVPGRHAGLNKNKWGHLKLRKILQEHGPPSSDVTTNWPVIGQFSSIGSLGPDKNK 60
Query: 405 ----KWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
+W+ L+++ F G PL +V+PTV++VR +L +AG +IP K
Sbjct: 61 WLCGEWLQSLAATCGRTF-------GSNAPLKLVFPTVDNVRTTLWFISAGGSIPYSHKT 113
Query: 460 VDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQ 516
+K +L ++ W A+ GRSRA PHIKT+ R + +LAWF++TS+NLSKAAWG L+
Sbjct: 114 AEKQPYLPSFFCSWNATSRGRSRASPHIKTYMRTSPDHSRLAWFMVTSSNLSKAAWGVLE 173
Query: 517 KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 576
K SQLMIRSYE+GVL LP+ + T+ I + + +
Sbjct: 174 KGGSQLMIRSYEIGVLFLPADQ--------------------VTDREAIDQCRDIL---- 209
Query: 577 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
+ + ++ VP++LPP YS ++ PW WD RY K D G +W
Sbjct: 210 -GGNRLSDEPCTHVHVPFDLPPSPYSDDEKPWMWDVRYLDKPDTNGNIW 257
>gi|336471045|gb|EGO59206.1| hypothetical protein NEUTE1DRAFT_145272 [Neurospora tetrasperma
FGSC 2508]
gi|350292122|gb|EGZ73317.1| phospholipase D/nuclease, partial [Neurospora tetrasperma FGSC
2509]
Length = 619
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 180/590 (30%), Positives = 264/590 (44%), Gaps = 107/590 (18%)
Query: 130 KKMRQQDEQDNENGKNSEEAL----CNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRD 185
KK R E+ E +EE C++ R + S F L ++ L +N VS++
Sbjct: 44 KKRRTSPEEGEEESFPAEEQAKKQPCSY---RRVVASPFHLTTIRSLGQNSNKDTVSLKG 100
Query: 186 VIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPHVLVIHG----ESDGTLE-HMKRNKP 238
++ +I NY+ DID+L+ A + + V VIHG E+ L+ +
Sbjct: 101 LLGDPLIKECWEFNYLHDIDFLMSAFDSDVRHLIKVHVIHGFWKKENTNRLQIQSDAARY 160
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQ-DFPL 296
N H LP FGTHHSK M+L+ II+HTANLI DW+N +Q W+ PL
Sbjct: 161 PNITTHHAYLPEPFGTHHSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPL 220
Query: 297 ----KDQNNLSEECG--------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
QNN S F+ D ++YL + + A N I+ K+
Sbjct: 221 LKPDAQQNNSSPRSSLPAGSGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKY 269
Query: 345 NFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKG------------FKKS 390
+FSS LIASVPG H+ +WG ++ L+ + +K
Sbjct: 270 DFSSIRGSLIASVPGRHSLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKP 329
Query: 391 PLVYQFSSLGSLD--EKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLE 445
+V Q SS+ +L + W+ SG KT L I++PT +++R SL+
Sbjct: 330 EVVIQISSIATLGPTDNWLKNTLFEALSGSQGPKTLLSSKSKPDFKIIFPTPDEIRKSLD 389
Query: 446 GYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIK 487
GYA+G +I S Q+ +L+ + W GR+RA PHIK
Sbjct: 390 GYASGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSADGVGTTTTTPIREAGRNRAAPHIK 449
Query: 488 TFARYNGQK----LAWFLLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKR 539
TF R+ + W LLTSANLSK AWG Q KNN+ Q+ I SYE+GVL+ P
Sbjct: 450 TFIRFANHNTKNSIDWALLTSANLSKQAWGDAQSKNNAGEPQVRICSYEIGVLVWPELFA 509
Query: 540 HGCGFSCTSN------IVPSEI-KSGSTETSQIQKTKLVTLTWHGSSDAG---------- 582
G S S +VP+ + + ++ S+ +T L+ +S +G
Sbjct: 510 DSDGTSSGSKTGQKAVMVPTFLTDTPASHGSEKDRTSLLGEKQGSASTSGNGEEDGKGDD 569
Query: 583 -----ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
+S+ VV L +PY LP QRY ++VPW + + D GQVW RH
Sbjct: 570 EKEEKSSTVVVGLRMPYNLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 618
>gi|195435334|ref|XP_002065649.1| GK15563 [Drosophila willistoni]
gi|194161734|gb|EDW76635.1| GK15563 [Drosophila willistoni]
Length = 572
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 133/394 (33%), Positives = 208/394 (52%), Gaps = 52/394 (13%)
Query: 173 PAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGES 226
P + V++++++ G+I + N+MVDI WLL +LAK ++V++G+
Sbjct: 154 PTHSEPLSVTLQEILDESLGEIESTVQINFMVDIGWLLGHYYFAGILAK--PLIVLYGDE 211
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
L ++ + KP + K +P F T H+K MLL Y G +R+++ TANL DW+N+
Sbjct: 212 SPELLNISKLKPQVTAI-KVQMPTPFATSHTKMMLLAYTDGSMRVVISTANLYEDDWHNR 270
Query: 286 SQGLWMQ-DFPLKDQNNLSEEC---------GFENDLIDYLSTLKWPEFSANLPAHGNFK 335
+QG+W+ P LSEE GF+ DL+ YL K + +
Sbjct: 271 TQGVWISPRLPA-----LSEEADTAAGESKTGFKQDLMLYLVEYKLTQLQPWI------- 318
Query: 336 INPSFFKKFNFSSAAVRLIASVPGYHTGSSLK--KWGHMKLRTVLQE--CTFEKGFKKSP 391
+ +K +FS+ V LIASVPG H S++ WGH +L ++L + E + P
Sbjct: 319 ---ARIRKSDFSAINVFLIASVPGGHREGSVRGHPWGHARLGSLLAKHAAPIED---RIP 372
Query: 392 LVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGY 447
+V Q SS+GSL A + + +D + +G L +++P+ +V S +G
Sbjct: 373 VVCQSSSIGSLGPNVQAWIQQDFVNSLRKDSSTVGRLRQLPPFKMIYPSFGNVSRSHDGM 432
Query: 448 AAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTS 504
G +P + DK +LK++ +WK+ R++AMPHIK + RYN Q + WF+LTS
Sbjct: 433 LGGGCLPYGKNTNDKQPWLKEHLQQWKSGDRYRNQAMPHIKCYTRYNLENQSVYWFVLTS 492
Query: 505 ANLSKAAWGALQKNNS---QLMIRSYELGVLILP 535
ANLSKAAWG+ KN++ L I +YE GVL LP
Sbjct: 493 ANLSKAAWGSFNKNSNIQPCLRIANYEAGVLFLP 526
>gi|24581359|ref|NP_523465.2| glaikit [Drosophila melanogaster]
gi|37999816|sp|Q9VQM4.1|TYDP1_DROME RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase; AltName: Full=Protein glaikit
gi|7295840|gb|AAF51141.1| glaikit [Drosophila melanogaster]
gi|15292079|gb|AAK93308.1| LD37277p [Drosophila melanogaster]
gi|220946228|gb|ACL85657.1| gkt-PA [synthetic construct]
Length = 580
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 131/374 (35%), Positives = 190/374 (50%), Gaps = 35/374 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K P +L+ ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P+ E GF+ DL+ YL K + + + + +FS+ V
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---Q 521
Y +WK+S RSRAMPHIK++ R+N Q + WF+LTSANLSKAAWG KN++
Sbjct: 461 DYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPC 520
Query: 522 LMIRSYELGVLILP 535
L I +YE GVL LP
Sbjct: 521 LRIANYEAGVLFLP 534
>gi|308462649|ref|XP_003093606.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
gi|308249623|gb|EFO93575.1| hypothetical protein CRE_02619 [Caenorhabditis remanei]
Length = 462
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 147/473 (31%), Positives = 225/473 (47%), Gaps = 109/473 (23%)
Query: 182 SIRDVIQGDI--IVAILSNYMVDIDWLLPACP--VLAKIPHVLVIHGESDGTLEHMKRNK 237
S+ D++ DI I ++ N+M+D ++L+ + P + P LV+ L
Sbjct: 67 SLEDIL-ADIRPISSLHMNFMIDFEFLVNSYPPSLRTTTPITLVVGAPDVSDLRKSTLQY 125
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P N +H LPI FGTHHSK +L G + +IV TANLI DW K+Q + +
Sbjct: 126 P-NVTVHSASLPIPFGTHHSKLSILESDDGFIHVIVSTANLISDDWEFKTQQFYYA-MGM 183
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP-SFFKKF----NFSSAAV 351
+ ++ E F+ DLI+YLS + NP S +KK +FS+
Sbjct: 184 RREDEF-ERSPFQEDLIEYLS----------------YYSNPLSTWKKLIESTDFSTVTD 226
Query: 352 RLIASVPGYHTGSS-LKKWGHMKLRTVL-QECTFEKGFK---KSPLVYQFSSLGSLDEKW 406
RLI S PGYHT + + GH +L T+L Q+ F+ ++ + + Q SS+GSL
Sbjct: 227 RLIFSTPGYHTDPQHVSRLGHPRLSTILSQKFPFDPKYEHTDRCTFIAQCSSIGSL---- 282
Query: 407 MAELSSSMSSGFS-------EDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSP 456
S+ SS F E P +P +V+P VEDVR S +GYA G ++P
Sbjct: 283 ----GSAPSSWFRGQFLKSLEAANPAPKNKPPKMYLVFPCVEDVRNSCQGYAGGGSVPYR 338
Query: 457 QKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGAL 515
D+ +L+ + KW+++ R++A+PH KT+ +Y+ + W LLTSAN+SKAAWG +
Sbjct: 339 NSVHDRQKWLQDFMCKWRSNTKRRTKAVPHCKTYVKYDQKIAQWQLLTSANVSKAAWGEM 398
Query: 516 ----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLV 571
+KN QLMIRS+E+GVLI T+ S+
Sbjct: 399 SFSKKKNVDQLMIRSWEIGVLI--------------------------TDPSRFN----- 427
Query: 572 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+P++ P YS D P++ D+++ + D+ G VW
Sbjct: 428 --------------------IPFDYPCVPYSPTDRPFTTDQKHEQPDILGCVW 460
>gi|389628810|ref|XP_003712058.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|351644390|gb|EHA52251.1| hypothetical protein MGG_06176 [Magnaporthe oryzae 70-15]
gi|440474085|gb|ELQ42852.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae Y34]
gi|440485911|gb|ELQ65827.1| tyrosyl-DNA phosphodiesterase 1 [Magnaporthe oryzae P131]
Length = 555
Score = 175 bits (443), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 154/507 (30%), Positives = 229/507 (45%), Gaps = 78/507 (15%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAKIPHV 219
S FRL R++ L N + + D+I GD ++A NY+ DI++LL A +
Sbjct: 83 SPFRLTRIRDLGEEDNADALGLNDII-GDPLIAECWDFNYLHDIEFLLDALDQDVRDVVK 141
Query: 220 LVI------HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
+ + + L K N +LH LP FGTHHSK ++L+ + ++I+
Sbjct: 142 VHVVHGFWKKDDPSRILLQDDAEKHKNVVLHTAFLPEIFGTHHSKMLVLLRHDDTAQVII 201
Query: 273 HTANLIHVDWNNKSQGLWMQ-DFPL---------KDQNNLSEECG--FENDLIDYLSTLK 320
HTAN+I DW N + G+W+ PL NL+E G F+ DL++YL
Sbjct: 202 HTANMIPKDWTNMTNGIWLSPRLPLLQGQDPADASQYENLAEGTGYKFKIDLLNYLRA-- 259
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVL 378
+ + N +K++FSS LIASVPG H T S WG + ++ L
Sbjct: 260 ---YDDKRVVCRDLVTN---LEKYDFSSIRGTLIASVPGRHDFTDLSTSAWGWVAIKRAL 313
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLD--EKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWP 435
+ + G KS +V Q SS+ +L + W+ L SM G + P + I++P
Sbjct: 314 RSVPLQVG--KSEVVTQISSIATLGPTDTWLQRTLFESMCRGKTTGVAPRP--QFKIIFP 369
Query: 436 TVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--------------HT 477
T +++R SL+GY +G +I S Q+ + K W
Sbjct: 370 TADEIRRSLDGYGSGGSIHTKIQSSQQAKQLIYQKPLLCHWANDSPHGQDLGQNIPILDA 429
Query: 478 GRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSA 537
GR+RA PHIKT+ RY + W LL+SANLSK AWG SQ I S+E+GVL+ P
Sbjct: 430 GRNRAAPHIKTYIRYGANSIDWALLSSANLSKQAWGDATGAGSQTRISSWEIGVLVWPE- 488
Query: 538 KRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELP 597
++ + +K +T + T L VV L PY LP
Sbjct: 489 ------LFAKDALMTTVVKK---DTPSRETTNLC-----------PGRPVVGLRSPYSLP 528
Query: 598 PQRYSSEDVPWSWDKRYTKKDVYGQVW 624
Q+Y + +VPW Y++ D G W
Sbjct: 529 VQKYGNGEVPWVATLSYSEPDWAGNTW 555
>gi|195034799|ref|XP_001988977.1| GH11458 [Drosophila grimshawi]
gi|193904977|gb|EDW03844.1| GH11458 [Drosophila grimshawi]
Length = 590
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 146/450 (32%), Positives = 219/450 (48%), Gaps = 69/450 (15%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+M+DI WLL +L K +LV++G+ L + + KP
Sbjct: 185 ILDESLGEIESTVQINFMIDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 242
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DFPL- 296
+ + +P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ P
Sbjct: 243 VTAV-RVKMPTPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPAL 301
Query: 297 -KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
+D + + E GF+ DL+ YL K + + + +K +FS+ V L
Sbjct: 302 AEDADTAAGESATGFKQDLMLYLVEYKLSQLQPWI----------ARIRKSDFSAVNVFL 351
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
I SVPG H +++ WG +L ++L + + P+V Q SS+GSL A +
Sbjct: 352 IGSVPGGHREGAVRGHPWGCARLGSLLAKHATPVE-DRIPVVCQSSSIGSLGANVQAWIQ 410
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
S +D TPLG L +++P+ +V S +G G +P + DK +LK
Sbjct: 411 QDFVSNLRKDSTPLGRLRQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYGRNTNDKQPWLK 470
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKN-NSQ-- 521
+ +WK+ RS+AMPHIK++ R+N Q + WF+LTSANLSKAAWG+ KN N Q
Sbjct: 471 AHLQQWKSGDRHRSQAMPHIKSYTRFNLEEQCIYWFVLTSANLSKAAWGSFNKNPNIQPC 530
Query: 522 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 581
L I +YE GVL LP F P G+S
Sbjct: 531 LRIANYEAGVLFLPR-------FVTGEETFPL-----------------------GNSRN 560
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWSWD 611
G V P+PY++P Y ++D P+ D
Sbjct: 561 G----VPAFPLPYDVPLTPYGADDKPFLMD 586
>gi|195576262|ref|XP_002077995.1| GD23212 [Drosophila simulans]
gi|194190004|gb|EDX03580.1| GD23212 [Drosophila simulans]
Length = 580
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 130/374 (34%), Positives = 190/374 (50%), Gaps = 35/374 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G+I + N+MVDI WLL +L K P +L+ ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLSIGKFKQQV 233
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 234 TAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPAL 291
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P+ E GF+ DL+ YL K + + + + +FS+ V
Sbjct: 292 PVDADTGAGESLTGFKQDLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFF 341
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 342 LGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWIQ 400
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G + +++P+ +V S +G G +P + DK +LK
Sbjct: 401 QDFVNSLKKDSTPVGKLRQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLK 460
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---Q 521
Y +WK+S RSRAMPHIK++ R+N Q + WF+LTSANLSKAAWG K+++
Sbjct: 461 DYLQQWKSSDRYRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPC 520
Query: 522 LMIRSYELGVLILP 535
L I +YE GVL LP
Sbjct: 521 LRIANYEAGVLFLP 534
>gi|321478262|gb|EFX89219.1| hypothetical protein DAPPUDRAFT_310135 [Daphnia pulex]
Length = 580
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 135/407 (33%), Positives = 207/407 (50%), Gaps = 48/407 (11%)
Query: 161 PSTFRLLRVQGLP-AWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPA-CPVLAK 215
P + L ++ +P W + ++ D++ G + ++ N+MV++ WLL C +
Sbjct: 151 PVCYFLSSIENVPETWDQSLTLTFSDLLHPSLGVLQESVQFNFMVELGWLLAQYCQHKVQ 210
Query: 216 IPHVLVIHG-ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVH 273
+LVI+G ES+ R + I KP P FG+HH+K ++ Y G +RI+VH
Sbjct: 211 RKPMLVIYGTESEELAAAQSRVPTLHTIRVKPKYP--FGSHHTKMSMMSYEDGNLRIVVH 268
Query: 274 TANLIHVDWNNKSQGLWMQDF--PLKDQNN-----------LSEECGFENDLIDYLSTLK 320
T NLI DW +++QGLW+ PL ++N GF+ DLI YL
Sbjct: 269 TGNLIESDWEDRTQGLWISPSCPPLSSKDNEKIGDGDSIGDGDSITGFKRDLIRYLE--- 325
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS-----LKKWGHMKLR 375
S +L A K ++ + SS V I S PG H S + KWGH+ L
Sbjct: 326 ----SYSLSA---LKPWIEKIRQADMSSIKVCFIPSSPGSHAIQSEANEKVPKWGHLHLS 378
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELSSSMSSGFSEDKTPLGIGEPL 431
+LQ+ + ++ Q SS+GSL W+A EL SM G S T LG
Sbjct: 379 WLLQQHASSEA--DDSIIMQCSSIGSLGPSPSSWLAGELGVSM--GASSGVTKLGQKNVQ 434
Query: 432 IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
+V+P +DV+ S+ G G +P S Q + + + + KW++ R+ AMPHIK++A
Sbjct: 435 VVYPCFQDVKSSIHGLLGGGCLPYSHQGHNKQTWFTGFLHKWRSDSRLRTTAMPHIKSYA 494
Query: 491 RYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
R + + ++F+LTSAN+SKAAWG +++LMI+S+E GVL LP
Sbjct: 495 RVSSDMSRASFFVLTSANVSKAAWGMRINKDTKLMIQSFEAGVLFLP 541
>gi|170040309|ref|XP_001847946.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
gi|167863873|gb|EDS27256.1| tyrosyl-dna phosphodiesterase [Culex quinquefasciatus]
Length = 615
Score = 172 bits (436), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 135/438 (30%), Positives = 211/438 (48%), Gaps = 58/438 (13%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPH-VLVIHGESDGTLEHMKRNKPANWILHKPP 247
G++ ++ N+MVDI WLL +L+++G+ L+ + KP N K
Sbjct: 217 GELECSVQMNFMVDIGWLLGHYFFAGYEDRPLLILYGDESPELKTVSTKKP-NVTALKVH 275
Query: 248 LPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD-FPL----KDQNN 301
+ FG HH+K L Y G +R+++ TANL D++N++QGLW+ P D
Sbjct: 276 IATPFGVHHTKMGLYGYTDGSMRVVISTANLYEDDFHNRTQGLWISPRLPALAEDADTGA 335
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH 361
GF LI YL++ K+ + +A + S ++ +F V +AS+PG H
Sbjct: 336 GESRTGFRESLITYLNSYKFAQLAAWV----------SRIQRTDFGEVNVFFVASIPGGH 385
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
++ WGH +L +L + + PLV Q SS+GSL + + S + + F
Sbjct: 386 LNTAKGPLWGHPRLGYLLGKHSAPID-DACPLVAQSSSIGSLGPNPQSWVLSEIMASFRR 444
Query: 421 DKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASH 476
D P+G+ +++P+ +VR S + G +P + +K +LK + +WK+
Sbjct: 445 DSAPVGLRRVPSFRMIFPSFSNVRNSHDNLLGGGCLPYMRATHEKQPWLKDHLHQWKSDC 504
Query: 477 TGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLI 533
R++A+PHIKT+ R++ + L WFLLTSANLSKAAWG K+ + L I SYE+GVL
Sbjct: 505 RNRTKAVPHIKTYCRWSHRGLYWFLLTSANLSKAAWGVYNKSAKFEAPLRINSYEVGVLF 564
Query: 534 LPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 593
LP F N P E KS G + A P+P
Sbjct: 565 LPK-------FVIDENFFPMESKSS------------------GDNKHPA------FPMP 593
Query: 594 YELPPQRYSSEDVPWSWD 611
Y++P Y+ ED P+ D
Sbjct: 594 YDVPIIPYAPEDSPFFMD 611
>gi|7529314|emb|CAB86488.1| Glaikit protein [Drosophila melanogaster]
Length = 580
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 131/375 (34%), Positives = 191/375 (50%), Gaps = 37/375 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHG-ESDGTLEHMKRNKP 238
I D G+I + N+MVDI WLL +L K +LV++G ES L K +
Sbjct: 175 ILDESLGEIESTVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLSIGKFKQQ 232
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD---- 293
I K P P F T H+K M L Y G +R+++ TANL DW+N++QGLW+
Sbjct: 233 VTAIRVKMPTP--FATSHTKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPA 290
Query: 294 FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
P+ E GF+ D + YL K + +P + +FS+ V
Sbjct: 291 LPVDADTGARESLTGFKQDRMLYLVEYKISQLQPWIPR----------IRNSDFSAINVF 340
Query: 353 LIASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
+ SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A +
Sbjct: 341 FLGSVPGGHREGSVRGHPWGHARLASLLAKHAAPID-DRIPVVCQSSSIGSLGANVQAWI 399
Query: 411 SSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFL 465
+ +D TP+G + +++P+ +V S +G G +P N ++ +L
Sbjct: 400 QQDFVNSPKKDSTPVGKLRQMPPFKMIYPSYGNVAGSHDGMLGGGCLPYGKNTNDNQPWL 459
Query: 466 KKYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS--- 520
K Y +WK+S RSRAMPHIK++ R+N Q + WF+LTSANLSKAAWG KN++
Sbjct: 460 KDYLQQWKSSDRFRSRAMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQP 519
Query: 521 QLMIRSYELGVLILP 535
L I +YE GVL LP
Sbjct: 520 CLRIANYEAGVLFLP 534
>gi|195388525|ref|XP_002052930.1| GJ17827 [Drosophila virilis]
gi|194149387|gb|EDW65085.1| GJ17827 [Drosophila virilis]
Length = 592
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/450 (31%), Positives = 211/450 (46%), Gaps = 69/450 (15%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
I D G I ++ N+M+DI WLL +L K +LV++G+ L + + KP
Sbjct: 187 ILDESLGKIESSVQINFMIDIGWLLGHYYFAGILDK--PLLVLYGDESPDLLGIGKFKPQ 244
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD----F 294
+ K +P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+
Sbjct: 245 VTAI-KVNMPTPFATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWISPRLPAL 303
Query: 295 PLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
P E GF+ DL+ YL K + + + +K +FS+ V L
Sbjct: 304 PEGADTAAGESPTGFKQDLMLYLVEYKVSQLQPWI----------ARIRKSDFSAVNVFL 353
Query: 354 IASVPGYHTGSSLK--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELS 411
I SVPG H S+++ WG +L ++L + + P+V Q SS+GSL A +
Sbjct: 354 IGSVPGGHRESAVRGHPWGCARLGSLLAKHAAPVD-DRIPVVCQSSSIGSLGANVQAWIQ 412
Query: 412 SSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLK 466
+ +D TP+G L +++P+ +V S +G G +P + DK +LK
Sbjct: 413 QDFVNNLRKDSTPVGRLRQLPPFKMIYPSFGNVSRSHDGMLGGGCLPYSKNTNDKQPWLK 472
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---Q 521
+ +WK+ RS+AMPHIK++ R+N Q + WF+LTSANLSKAAWG+ KN+
Sbjct: 473 AHLQQWKSGDRHRSQAMPHIKSYTRFNLEQQCVYWFVLTSANLSKAAWGSFNKNSQIQPC 532
Query: 522 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 581
L I +YE GVL LP F P
Sbjct: 533 LRIANYEAGVLFLPR-------FVTGEETFPL---------------------------G 558
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWSWD 611
A V P+PY++P Y +D P+ D
Sbjct: 559 NARDGVPAFPLPYDVPLTPYGPDDTPFLMD 588
>gi|336270704|ref|XP_003350111.1| hypothetical protein SMAC_01002 [Sordaria macrospora k-hell]
Length = 624
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 164/548 (29%), Positives = 245/548 (44%), Gaps = 97/548 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S F L ++ L +N +S++ ++ +I+ NY+ +ID+L+ A + + V
Sbjct: 91 SPFHLTTIRSLGQASNKDTISLKHLLGDPLIIECWEFNYLHNIDFLMNAFDEDIRHLVKV 150
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E L+ ++ N H LP FGTHHSK M+L II+H
Sbjct: 151 HVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTHHSKLMVLFRLDDTAEIIIH 210
Query: 274 TANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG-------------FENDLIDYLSTL 319
TANLI DW N + G W+ PL + FE D ++YL +
Sbjct: 211 TANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPPAAGSGEKFEIDFLNYLRSY 270
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
+ +A P K++FSS LIASVPG H+ + +WG ++
Sbjct: 271 R----TACKPLVDQLS-------KYDFSSIRGSLIASVPGRHSLVDNFPTRWGWAAMKET 319
Query: 378 LQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIG 428
L+ + +K+ +V Q SS+ +L + W L S++ S + P +
Sbjct: 320 LKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW---LKSTLFEALSGSQGPKTLS 376
Query: 429 EP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--- 475
+++PT +++R SL+GY++G +I S Q+ +L+ + W
Sbjct: 377 SSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSAD 436
Query: 476 ------------HTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQ-KN 518
GR RA PHIKTF RY QK + W LLTSANLSK AWG Q KN
Sbjct: 437 GGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWALLTSANLSKQAWGDAQSKN 496
Query: 519 NS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP----------SEIKSGSTETSQ 564
N+ Q+ I SYE+GV++ P G G + +VP S K G++ +
Sbjct: 497 NAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFLTDTPTGLSSSKDGTSLAGE 556
Query: 565 IQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 619
TK T G + S+ VV L +PY LP QRY ++VPW + + D
Sbjct: 557 RGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQRYGPQEVPWVATANHLEPDW 616
Query: 620 YGQVWPRH 627
GQVW RH
Sbjct: 617 MGQVW-RH 623
>gi|367033183|ref|XP_003665874.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
gi|347013146|gb|AEO60629.1| hypothetical protein MYCTH_2310031 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 160/575 (27%), Positives = 252/575 (43%), Gaps = 125/575 (21%)
Query: 130 KKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG 189
K+ R Q ++ E ++ SR S FRL +++ LP N ++++D++ G
Sbjct: 46 KRRRAQSLEETEPARSPS-------ASRRVFDSPFRLTKIRDLPREMNKDTITLKDIL-G 97
Query: 190 DIIVAIL--SNYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRNKPANW 241
D ++A NY+ DID+L+ A P + + V V+HG + +G ++ N
Sbjct: 98 DPLIAECWEFNYLHDIDFLMAAFDPDVRHLVKVHVVHGFWKREDPNGLELQEAASRFQNV 157
Query: 242 ILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQ 299
LH +P +GTHHSK M+L+ +I++HTAN+I DW N +Q +W+ PL +
Sbjct: 158 TLHSAFMPEMYGTHHSKMMILLRRDDTAQIVIHTANMIIRDWTNMTQAVWLSPRLPLMEP 217
Query: 300 NNLS---EECG------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+ EE F+ D ++YL A + K++FS+
Sbjct: 218 SRCDARPEEVAAGSGAKFKIDFLNYL--------RAYDTRRTTCRPIIDQLSKYDFSAIR 269
Query: 351 VRLIASVPGYHT--GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKW 406
LIASVPG H +S +WG + L+ ++S + Q SS+ +L + W
Sbjct: 270 GSLIASVPGRHKLDDTSPTRWGWAAMEQALKSVPVSS--RRSDIAIQISSIATLGPTDTW 327
Query: 407 MAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 459
L S+ S + + +P +++PT +++R SL+GY++G +I SPQ+
Sbjct: 328 ---LKSTFFRSLSGGRPGGTLQQPPNFQVIFPTPDEIRKSLDGYSSGASIHTKVQSPQQV 384
Query: 460 VDKDFLKK---YWAKWKAS----------------------------------HTGRSRA 482
+L+ +WA A+ GR RA
Sbjct: 385 KQLAYLRPMLYHWANDSANGADPQEGGGGGERRREDYENDGGDDEGDSAVVVKEAGRKRA 444
Query: 483 MPHIKTFARY---NGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILP 535
PHIKT+ RY +G + W L+TSANLSK AWG + + I SYE+GVL+ P
Sbjct: 445 APHIKTYIRYGDKSGPSIDWALVTSANLSKQAWGEAAVRGADGGATMRIASYEIGVLVWP 504
Query: 536 SAKRHGC---GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV 592
G G T ++ E+K G+T V L +
Sbjct: 505 GLYGEGAIMRGTFLTDSLGTEEVKEGTT--------------------------AVALRM 538
Query: 593 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
PY LP Q Y +VPW Y++ D GQ+W RH
Sbjct: 539 PYNLPLQPYGKGEVPWVATANYSEPDWKGQIW-RH 572
>gi|406865596|gb|EKD18637.1| tyrosyl-DNA phosphodiesterase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 532
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 158/565 (27%), Positives = 247/565 (43%), Gaps = 95/565 (16%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNF----HVSRDKLP----- 161
SR ++++S+D ++ + +D+ D N KN ++ + RD+ P
Sbjct: 10 SRKRRKLSSD--------DEETQSEDDTDQNNKKNLPYSITRSISPPPLRRDREPEVQVA 61
Query: 162 ----STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAK 215
S F+L ++ LP N VS+++++ I NY+ D+++L+ A +
Sbjct: 62 KVLKSPFQLTCIKDLPEAVNKDAVSLKNILGDPTITECWEFNYLHDLEFLMEAFHDDVRD 121
Query: 216 IPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV- 268
V V+HG S L+ + P N LH +P FGTHHSK ++L+
Sbjct: 122 RTKVHVVHGFWKSEDASRLNLQAQAKKYP-NITLHTAYMPEMFGTHHSKMLVLLRKYDTA 180
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQDFP--------LKDQNNLSEECGFENDLIDYLSTLK 320
+I++HTAN+ DW+N +Q W+ L+D + F+ D ++YL
Sbjct: 181 QIVIHTANMQAFDWDNMTQAAWISPLLPQIREKELLEDTEPIGSGSRFKFDFLNYLRAYD 240
Query: 321 WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVL 378
P G K NFS+ L+ASVPG + S K WG L+ L
Sbjct: 241 TKRVICK-PLVGKLM-------KHNFSAIRGALVASVPGKQSIKSDSKTLWGWAGLKKAL 292
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVE 438
+ K+ +V Q SS+ +L EKW+ + + ++ + + IV+PT +
Sbjct: 293 EAVPVRS--KEGEIVIQISSIATLSEKWIDK--TLFAAMSTSKSHGSSKSKFKIVFPTAD 348
Query: 439 DVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA------------SHTGRSRA 482
++R SL GY +G+AI S + LK W S GR RA
Sbjct: 349 EIRRSLNGYNSGSAIHTKIQSHAQARQLQLLKPMLCHWAGDSDEKGPSSAPVSDAGRKRA 408
Query: 483 MPHIKTFARY---NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR 539
PHIKTF R+ + W L+TSANLSK AWG + I SYE+GVL+ P
Sbjct: 409 APHIKTFIRFPDATRSTIDWMLVTSANLSKQAWGEGTNAAGDVRICSYEIGVLVWPGL-- 466
Query: 540 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 599
F + +VP+ K+ + + S A +E+V +PY+LP
Sbjct: 467 ----FGDNATMVPT-FKTDNPDASA----------------AKPGTELVGARMPYDLPLV 505
Query: 600 RYSSEDVPWSWDKRYTKKDVYGQVW 624
Y +D+PW Y + D GQVW
Sbjct: 506 PYGKDDLPWCATSSYEEPDWKGQVW 530
>gi|317027510|ref|XP_001399437.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 568
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 145/523 (27%), Positives = 225/523 (43%), Gaps = 107/523 (20%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 152
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + + +M P +FGTH
Sbjct: 153 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAIVAYM---------------PEAFGTH 197
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL + SE F
Sbjct: 198 HSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARF 257
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 258 KRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRES 305
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGF 418
T S+ K WG + LR VL+ + +V Q SS+ SL + KW+ ++ + S
Sbjct: 306 TDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPS 365
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S + P IV+PT +++R SL GY +G +I S + +++ Y W
Sbjct: 366 SNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAG 421
Query: 475 S----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKNNSQ 521
GR RA PHIKT+ RY+ ++ W ++TSANLS AWGA N +
Sbjct: 422 DVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 481
Query: 522 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 581
+ I S+E+GV++ P G G S ++P + ++I T V
Sbjct: 482 VRICSWEIGVVVWPELI-AGAGAEGRSVMMPCFRRDMPDADAEIPTTTTVGFR------- 533
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+PY+LP RY D+PW +++ D GQ W
Sbjct: 534 ----------MPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 566
>gi|380095505|emb|CCC06978.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 666
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 164/548 (29%), Positives = 245/548 (44%), Gaps = 97/548 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S F L ++ L +N +S++ ++ +I+ NY+ +ID+L+ A + + V
Sbjct: 133 SPFHLTTIRSLGQASNKDTISLKHLLGDPLIIECWEFNYLHNIDFLMNAFDEDIRHLVKV 192
Query: 220 LVIHG----ESDGTLE-HMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
V+HG E L+ ++ N H LP FGTHHSK M+L II+H
Sbjct: 193 HVVHGFWKKEDPNRLQIQSDTDRYPNITTHHAYLPEPFGTHHSKLMVLFRLDDTAEIIIH 252
Query: 274 TANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECG-------------FENDLIDYLSTL 319
TANLI DW N + G W+ PL + FE D ++YL +
Sbjct: 253 TANLIPKDWGNMTNGAWISPRLPLLKADTQQPASSTRSSPPAAGSGEKFEIDFLNYLRSY 312
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTV 377
+ +A P K++FSS LIASVPG H+ + +WG ++
Sbjct: 313 R----TACKPLVDQLS-------KYDFSSIRGSLIASVPGRHSLVDNFPTRWGWAAMKET 361
Query: 378 LQECTFEKGF-------KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIG 428
L+ + +K+ +V Q SS+ +L + W L S++ S + P +
Sbjct: 362 LKSVPVRQTADRDHNKSEKAEMVIQISSIATLGPTDNW---LKSTLFEALSGSQGPKTLS 418
Query: 429 EP------LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK----- 473
+++PT +++R SL+GY++G +I S Q+ +L+ + W
Sbjct: 419 SSSKKPDFKVIFPTPDEIRKSLDGYSSGGSIHTKIQSAQQAKQLQYLRPIFCHWANDSAD 478
Query: 474 ----------ASHTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQ-KN 518
GR RA PHIKTF RY QK + W LLTSANLSK AWG Q KN
Sbjct: 479 GGDDTTTTVPIREAGRQRAAPHIKTFIRYTNQKTKDRIDWALLTSANLSKQAWGDAQSKN 538
Query: 519 NS---QLMIRSYELGVLILPSA-KRHGCGFSCTSNIVP----------SEIKSGSTETSQ 564
N+ Q+ I SYE+GV++ P G G + +VP S K G++ +
Sbjct: 539 NAGEPQVRICSYEIGVMVWPELFADSGGGEKRKAVMVPTFLTDTPTGLSSSKDGTSLAGE 598
Query: 565 IQKTKLVT-----LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 619
TK T G + S+ VV L +PY LP QRY ++VPW + + D
Sbjct: 599 RGGTKSATRDGEDGGAGGDEEEDESTVVVGLRMPYNLPLQRYGPQEVPWVATANHLEPDW 658
Query: 620 YGQVWPRH 627
GQVW RH
Sbjct: 659 MGQVW-RH 665
>gi|358365748|dbj|GAA82370.1| tyrosyl-DNA phosphodiesterase [Aspergillus kawachii IFO 4308]
Length = 585
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 144/529 (27%), Positives = 226/529 (42%), Gaps = 106/529 (20%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 97 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 156
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + +M P +FGTH
Sbjct: 157 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAITAYM---------------PEAFGTH 201
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL ++ SE F
Sbjct: 202 HSKMMILLRHDDLAQVVIHTANMIAGDWANMCQAVWRSPLLPLCSNSSGSESIATPGTRF 261
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ L+ASVP
Sbjct: 262 KRDLLSYLR------------EYGPKKTGPLVAQLEKHDFSTVRAALVASVPSKQKIRES 309
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGF 418
T S+ K WG + LR VL+ ++ + +V Q SS+ SL +KW+ ++ + S
Sbjct: 310 TDSTRKTLWGWLALRDVLRSVPIDRSEDRPHIVTQISSVASLGQTDKWLKDVFFTSLSPS 369
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 370 SNTPKPRFS----IIFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRSYLCHWAG 425
Query: 475 S----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKNNSQ 521
GR RA PHIKT+ RY+ ++ W ++TSANLS AWGA N +
Sbjct: 426 DGAEDEVKVKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 485
Query: 522 LMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 575
+ I S+E+GV++ P A+ C VP + + K + T
Sbjct: 486 VRICSWEIGVVVWPELVTGAGAEGRSVMVPCFRRDVPDADAVAAAGAAANANVKEIPTT- 544
Query: 576 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
V +PY+LP RYS D+PW +++ D GQ W
Sbjct: 545 ----------TTVGFRMPYDLPLTRYSETDIPWCATASHSEPDWLGQTW 583
>gi|134056346|emb|CAK47581.1| unnamed protein product [Aspergillus niger]
Length = 559
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 143/511 (27%), Positives = 222/511 (43%), Gaps = 92/511 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKI 216
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQ------- 145
Query: 217 PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV----RIIV 272
E + H + +P +FGTHHSK M+L+ + R+++
Sbjct: 146 ------FDEDEACTRHPNVEAIVAY------MPEAFGTHHSKMMILLRHDDLAHEHRVVI 193
Query: 273 HTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----FENDLIDYLSTLKWPEFSA 326
HTAN+I DW N Q +W PL + SE F+ DL+ YL
Sbjct: 194 HTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARFKRDLLSYLRE-------- 245
Query: 327 NLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH-----TGSSLKK-WGHMKLRTVL 378
+G K P + +K +FS+ LIASVP T S+ K WG + LR VL
Sbjct: 246 ----YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRESTDSNQKTLWGWLALRDVL 301
Query: 379 QECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPLIVWPT 436
+ + +V Q SS+ SL + KW+ ++ + S S + P IV+PT
Sbjct: 302 RSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPSSNNPKPRFS----IVFPT 357
Query: 437 VEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS----------HTGRSRA 482
+++R SL GY +G +I S + +++ Y W GR RA
Sbjct: 358 PDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAGDVAEDEVKMKREAGRRRA 417
Query: 483 MPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS--- 536
PHIKT+ RY+ ++ W ++TSANLS AWGA N ++ I S+E+GV++ P
Sbjct: 418 APHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGEVRICSWEIGVVVWPELIA 477
Query: 537 ---AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 593
A+ C +P + + + K + T V +P
Sbjct: 478 GAGAEGRSVMMPCFRRDMPDADAVAAADANANADKKEIPTT-----------TTVGFRMP 526
Query: 594 YELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
Y+LP RY D+PW +++ D GQ W
Sbjct: 527 YDLPLTRYGETDIPWCATASHSEPDWLGQTW 557
>gi|242006203|ref|XP_002423943.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
gi|212507213|gb|EEB11205.1| tyrosyl-DNA phosphodiesterase, putative [Pediculus humanus
corporis]
Length = 447
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 135/434 (31%), Positives = 207/434 (47%), Gaps = 75/434 (17%)
Query: 198 NYMVDIDWLLPACPVLAKI-PHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
N+MV++ WL+ + P + +++ DG L ++ + I K P P FG HH
Sbjct: 71 NFMVELPWLMAQYAINDLFNPSMTILYDVQDGDLANIPEHLNIKAIKIKSPYP--FGHHH 128
Query: 257 SKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWM--------QDFPLKDQNNLSEECG 307
+K + Y R +R ++TANLI DW +++QG+W+ D P+ N +
Sbjct: 129 TKMSIFFYTDRSIRFAIYTANLIESDWEDRTQGVWISPKCPYLGDDVPI---NYGESDTL 185
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
F+ +++ YL + K PE L KI + + S V ++SVPG S +
Sbjct: 186 FKFEILQYLISYKLPEIRNLL-----IKIQET-----DCSLIKVFFVSSVPG----SVID 231
Query: 368 KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL----DEKWMAELSSSMSSGFSEDKT 423
+G++KL +++E E K +V Q SS+GSL D + E S SS S +
Sbjct: 232 NFGYIKLGKIIKEHAVENSEDKERIVIQCSSIGSLGPAPDSWLLNEFVKSTSSKLSSPQV 291
Query: 424 PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRA 482
IV+P+V +V S+ G + G +P S ++ + +L KY +W H RS+A
Sbjct: 292 S-------IVYPSVRNVASSIYGLSGGGCLPYSSGTHIKQLWLNKYLMQWYCEHRKRSKA 344
Query: 483 MPHIKTFARYNGQK--LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
+PHIKT+AR N K ++WFLLTSANLSKAAWG K + L I SYE GVL LP +
Sbjct: 345 VPHIKTYARINEDKEEISWFLLTSANLSKAAWGKKLK-SGMLQIMSYEAGVLFLPKLLIN 403
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
F +I+K ++G E P+PY++P
Sbjct: 404 KNVF-------------------KIKKF---------GYNSGNDDE---FPIPYDIPLTS 432
Query: 601 YSSEDVPWSWDKRY 614
Y D + +DK +
Sbjct: 433 YQETDRLFLFDKNF 446
>gi|17540580|ref|NP_500149.1| Protein F52C12.1 [Caenorhabditis elegans]
gi|37999811|sp|Q9TXV7.1|TYDP1_CAEEL RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|351063437|emb|CCD71624.1| Protein F52C12.1 [Caenorhabditis elegans]
Length = 451
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 138/458 (30%), Positives = 215/458 (46%), Gaps = 85/458 (18%)
Query: 185 DVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDGTLEHMKRNKPANW 241
D I DI I ++ ++M+D ++L+ + P L + P LV+ L +N+
Sbjct: 58 DEILADIRPINSLHFSFMLDFEFLIGSYPPSLREYPITLVVGAPDAPDLLKCTKNQKLVT 117
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
++ LPI FGTHH+K +L G +IV TANL+ DW K+Q + +F +K +
Sbjct: 118 VVGAS-LPIPFGTHHTKMSILEDEDGRFHVIVSTANLVPDDWEFKTQQFYY-NFGVKIAS 175
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
F++DL++YLS + +K +FS + RLI S PGY
Sbjct: 176 GTVPRSDFQDDLLEYLSMYR-----------NQLDTWKQLLQKVDFSQISDRLIFSTPGY 224
Query: 361 HTGSSLKKWGHMKLRTVLQE-CTFEKGFK---KSPLVYQFSSLGSLDE---KWMAE--LS 411
HT ++ GH +L +L E F+ ++ + V Q SS+GSL W L
Sbjct: 225 HTDPPTQRPGHPRLFRILSEKFPFDASYEHTERCTFVAQCSSIGSLGSAPINWFRGQFLQ 284
Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWA 470
S + S + P + +V+P+VEDVR S +GYA G ++P + + +L+
Sbjct: 285 SLEGANPSPKQKPAKM---YLVFPSVEDVRTSCQGYAGGCSVPYRNSVHARQKWLQGNMC 341
Query: 471 KWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRS 526
KW+++ R+ A+PH KT+ +Y+ + W LLTSANLSKAAWG + KN QLMIRS
Sbjct: 342 KWRSNAKRRTNAVPHCKTYVKYDKKVAIWQLLTSANLSKAAWGEVSFNKSKNVEQLMIRS 401
Query: 527 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
+E+GVLI T+ S+
Sbjct: 402 WEMGVLI--------------------------TDPSRFN-------------------- 415
Query: 587 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+P++ P YS+ D P+ DK++ K D+ G +W
Sbjct: 416 -----IPFDYPLVPYSATDEPFVTDKKHEKPDILGCIW 448
>gi|322706849|gb|EFY98429.1| tyrosyl-DNA phosphodiesterase 1 [Metarhizium anisopliae ARSEF 23]
Length = 517
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 150/518 (28%), Positives = 239/518 (46%), Gaps = 104/518 (20%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACPVLAK 215
++L S ++L ++ LP N V+++D++ GD +++ NY+ D+ +L+ A +
Sbjct: 51 ERLASPWQLTWIRDLPEELNYDAVTLKDLL-GDPLISDCWEFNYLHDVPFLMDAFDQDTR 109
Query: 216 -IPHVLVIHGESDGTLEHMKRNKP------------ANWILHKPPLPISFGTHHSKAMLL 262
+ +V V+HG KR+ P N LH P+P FGTHHSK M+L
Sbjct: 110 HLVNVHVVHG-------FWKRDDPHRLALTAESSGFDNVKLHVAPMPEMFGTHHSKMMVL 162
Query: 263 I-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ-----NNLSEECG--------F 308
+ II+HTAN+I DW N + +W P Q L E C F
Sbjct: 163 FRHDNTAEIIIHTANMIPKDWTNMTNAVWRT--PRLSQLPPGFRQLQEYCDLPIGSGERF 220
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK- 367
+ DL++YL + + + + +++FSS LIASVPG H L
Sbjct: 221 KADLLNYLKSYDSRKLTC--------RTLIDRLVQYDFSSVKGALIASVPGKHDIHDLSG 272
Query: 368 -KWGHMKLRTVLQECTFEKGFKKSPLVYQ-FSSLGSLDEKWMAELSSSMSSGFSEDKTPL 425
+G ++ L ++G K + L F SL + ++ S FS
Sbjct: 273 TAYGWSGVKRYLSSVPCKEGAKDTWLQKTLFDSLAT------SKTKSLQRPKFS------ 320
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------- 472
IV+PT +++R SL+GYA+G +I S Q+ +L++ W
Sbjct: 321 ------IVFPTADEIRQSLDGYASGASIHTKIQSSQQAQQLGYLRRILHHWANDSPDGIA 374
Query: 473 -----KASHTGRSRAMPHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRS 526
K + GR RA PHIKT+ RYN + + W +LTSAN+SK AWG + + +L + S
Sbjct: 375 SSPEIKTRNGGRDRAAPHIKTYIRYNEEGSIDWAMLTSANISKQAWGEASRPSGELRVAS 434
Query: 527 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
+E+GVL+ P +V ++ T S + K SS A AS
Sbjct: 435 WEIGVLVWP-------------GLVGQDVSMVGTFQSDVPKKP----KEQASSKADASGV 477
Query: 587 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
++ + +PY LP QRY +E+VPW ++++ D +G+ W
Sbjct: 478 LMGVRIPYSLPLQRYGAEEVPWVATMQHSEPDRFGRQW 515
>gi|350634393|gb|EHA22755.1| hypothetical protein ASPNIDRAFT_174927 [Aspergillus niger ATCC
1015]
Length = 581
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 144/529 (27%), Positives = 225/529 (42%), Gaps = 106/529 (20%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPA------- 209
+PS +L ++ LPA + NT V +RD++ +I NY+ D+D+L+
Sbjct: 93 IPSPIQLTHIRDLPASSGHNTDTVRLRDILGDPLIRECWQFNYLFDVDFLMSQFDEDVRR 152
Query: 210 --------------CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
P +I H + + +M P +FGTH
Sbjct: 153 LVKVKVVHGSWKRDAPNRQRIDEACTRHPNVEAIVAYM---------------PEAFGTH 197
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-----F 308
HSK M+L+ + ++++HTAN+I DW N Q +W PL + SE F
Sbjct: 198 HSKMMILLRHDDLAQLVIHTANMIAGDWANMCQAVWRSPLLPLCSDGSGSENIATPGARF 257
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYH----- 361
+ DL+ YL +G K P + +K +FS+ LIASVP
Sbjct: 258 KRDLLSYLRE------------YGQRKTGPLVAQLEKHDFSAVRAALIASVPSKQKIRES 305
Query: 362 TGSSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGF 418
T S+ K WG + LR VL+ + +V Q SS+ SL + KW+ ++ + S
Sbjct: 306 TDSNQKTLWGWLALRDVLRSVPVSPSEDRPHIVTQISSVASLGQTDKWLKDVFFASLSPS 365
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKA 474
S + P IV+PT +++R SL GY +G +I S + +++ Y W
Sbjct: 366 SNNPKPRFS----IVFPTPDEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLCHWAG 421
Query: 475 S----------HTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKNNSQ 521
GR RA PHIKT+ RY+ ++ W ++TSANLS AWGA N +
Sbjct: 422 DVAEDEVKMKREAGRRRAAPHIKTYIRYSSSEMDRIDWAMVTSANLSTQAWGAAVNANGE 481
Query: 522 LMIRSYELGVLILPS------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 575
+ I S+E+GV++ P A+ C +P + + + K + T
Sbjct: 482 VRICSWEIGVVVWPELIAGAGAEGRSVMMPCFRRDMPDADAVAAADANANADKKEIPTT- 540
Query: 576 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
V +PY+LP RY D+PW +++ D GQ W
Sbjct: 541 ----------TTVGFRMPYDLPLTRYGETDIPWCATASHSEPDWLGQTW 579
>gi|312069908|ref|XP_003137901.1| tyrosyl-DNA phosphodiesterase [Loa loa]
Length = 426
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 149/504 (29%), Positives = 216/504 (42%), Gaps = 107/504 (21%)
Query: 137 EQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAI 195
E D+ K SE + DKL +V GL N + S ++++ + +I
Sbjct: 15 ECDDLESKGSEGKRMKQNCLMDKL----YFNKVVGLAEQYNVNAFSFAELLELISPVASI 70
Query: 196 LSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG-----TLEHMKRNKPANWILHKPPLPI 250
N+M+D+ WLL P + + +I GE G T +K+ N + + L I
Sbjct: 71 HFNFMIDLRWLLTQYPGRLRQGPITLIVGERMGTDFTLTKTAVKQCGVNNVNVGRARLMI 130
Query: 251 SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN 310
FGTHHSK + + + + L D P ++ ++ F+
Sbjct: 131 PFGTHHSKISI--------------------FESNTGRLAAGDCPDRNGSD------FQT 164
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
DL+ YL K + L H +++ + S R++ SVPG H G L K+G
Sbjct: 165 DLVKYLDEYKTSQ-DWGLIEHWRDRVS-----NIDLSQVKARVVYSVPGTHKGVQLTKYG 218
Query: 371 HMKLRTVLQECTFE----KGFKKSPLVYQFSSLGSLDEKWM-AELSSSMSSGFSEDKTPL 425
H +LR +L+E + GF SLG+ + W+ + +S+S G D
Sbjct: 219 HPRLRVILKELFGDVKNMDGFTYHAQCSSLGSLGAAPQYWLTGQFLNSLSGGAETD---- 274
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAM 483
GE L I++P VEDVR S EGYAAG + P S V + +L + KW + H GRSRAM
Sbjct: 275 --GEHLRIIYPCVEDVRNSNEGYAAGGSFPYSNSVAVKQPYLLNFMHKWSSDHLGRSRAM 332
Query: 484 PHIKTFARYNGQKL--AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHG 541
PHIKT+A + L +W L+TSANLSKAAWG Q QL IRSYE G+L
Sbjct: 333 PHIKTYAAFAKDSLKPSWLLITSANLSKAAWGDYQSKKPQLTIRSYEFGLLF-------- 384
Query: 542 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 601
SD + + Y +LP +Y
Sbjct: 385 -------------------------------------SDPESLDMLPY-----DLPLTKY 402
Query: 602 SSEDVPWSWDKRYTKKDVYGQVWP 625
D W DK Y K D++ + WP
Sbjct: 403 DDNDRVWIVDKTYRKPDIFRKTWP 426
>gi|341892674|gb|EGT48609.1| hypothetical protein CAEBREN_24547 [Caenorhabditis brenneri]
Length = 451
Score = 168 bits (426), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 125/357 (35%), Positives = 181/357 (50%), Gaps = 45/357 (12%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA--NWILHKPPLPISFGTH 255
++M++ D+L+ P + + ++ GE D ++ ++R+ A N + LPI +GTH
Sbjct: 71 SFMIEPDYLMNCYPQSIRSNPITLVVGEPD--VKDLRRSMHAYKNVTVIGASLPIPYGTH 128
Query: 256 HSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID 314
HSK +L G + +IV +AN+I DW K+Q W + +K + ++ F+NDLI+
Sbjct: 129 HSKLSILEGEDGTIHVIVSSANMISEDWEFKTQQFWY-GYGVKKETQVTGS-EFQNDLIE 186
Query: 315 YL-----STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 369
YL S W E K +FS RLI SVPGYH
Sbjct: 187 YLGYYPSSMNSWTEL----------------IKCTDFSEVKDRLIFSVPGYHKAKK-NSL 229
Query: 370 GHMKLRTVL-QECTFEKGF---KKSPLVYQFSSLGSLD---EKWMAE--LSSSMSSGFSE 420
GHM LR++L F+ F ++ Q SS+GSL W L S +
Sbjct: 230 GHMALRSILIDRFPFDPNFVHTDRTTFFCQCSSIGSLGPTPANWFRGQFLKSLEGAATPP 289
Query: 421 DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGR 479
P + +++P VEDVR S EGYA G ++P + L+ + +WKA R
Sbjct: 290 QNKPARL---FVLFPRVEDVRMSAEGYAGGKSVPYRNSVHQRQLWLQHAFCRWKADKKKR 346
Query: 480 SRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLI 533
+RA+PH KT+ + + W LLTSANLSKAAWG LQK N+ QLMIRSYE+GVL+
Sbjct: 347 TRAIPHCKTYMKIDKDGQKWQLLTSANLSKAAWGELQKVNTANEQLMIRSYEMGVLV 403
>gi|268553849|ref|XP_002634911.1| Hypothetical protein CBG22509 [Caenorhabditis briggsae]
Length = 421
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 195/379 (51%), Gaps = 35/379 (9%)
Query: 172 LPAWANTSCVSIRDVIQGDI--IVAILSNYMVDIDWLLPACP-VLAKIPHVLVIHGESDG 228
+P + +S+ D++ DI A+ ++M+D +LL + P L P LV+ G SD
Sbjct: 21 VPRQESEGSLSLEDIL-ADIRPTQALHLSFMIDFQYLLNSYPPSLRTTPMTLVV-GASDK 78
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQ 287
+ N + PLPI FGTHH+K ++ G V +IV TANL+ DW K+Q
Sbjct: 79 AALSRECAAHKNVTVIGAPLPIPFGTHHTKMSIMESEDGRVHVIVSTANLVPDDWEFKTQ 138
Query: 288 GLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFN 345
+ +D ++ C F++DL++YLS F NL + P + +
Sbjct: 139 QFYYACGLRRDGE--AQRCPFQSDLLEYLS------FYRNL-------LTPWRELIQSTD 183
Query: 346 FSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK---KSPLVYQFSSLGSL 402
FSS RLI S PGYHT + +G R + ++ F+ ++ + + Q SS+GS+
Sbjct: 184 FSSITDRLIFSTPGYHTHVARLNFGPRLARILTEKFPFDPSYEHTERCTFISQCSSIGSI 243
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQK- 458
++ + E P +P +++P VEDVR S +GYA G ++P
Sbjct: 244 GKQPIDWFRGQFLKSL-EGANPAPKSKPAKMYLIFPCVEDVRTSCQGYAGGGSVPYRNSV 302
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWG----A 514
+V + +L+ KW+++ R+ A+PH KT+ +++ + W L+TSANLSKAAWG +
Sbjct: 303 HVRQKWLQGVMCKWRSNAKRRTHAVPHCKTYVKFDKKVPQWQLVTSANLSKAAWGEASFS 362
Query: 515 LQKNNSQLMIRSYELGVLI 533
K QLM+RSYE+GVLI
Sbjct: 363 KAKKTDQLMVRSYEMGVLI 381
>gi|164425147|ref|XP_962379.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
gi|157070809|gb|EAA33143.2| hypothetical protein NCU06345 [Neurospora crassa OR74A]
Length = 527
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 167/518 (32%), Positives = 234/518 (45%), Gaps = 101/518 (19%)
Query: 198 NYMVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTLE-HMKRNKPANWILHKPPLPIS 251
NY+ DID+L+ A + + V VIHG E L+ + N H LP
Sbjct: 22 NYLHDIDFLMGAFDSDVRHLIKVHVIHGFWKKEDPNRLQIQSDAARYPNITTHHAYLPEP 81
Query: 252 FGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDF-----PLKDQNNLSEE 305
FGTHHSK M+L+ II+HTANLI DW+N +Q W+ P QN S
Sbjct: 82 FGTHHSKMMVLLRADDTAEIIIHTANLIPRDWSNMTQAAWISPRLPLLKPDAQQNTSSTR 141
Query: 306 ------CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
CG F+ D ++YL + + A N I+ K++FSS LIASV
Sbjct: 142 SPPPAGCGEKFKIDFLNYLRSYR---------AACNPLIDQ--LAKYDFSSIRGSLIASV 190
Query: 358 PGYHT--GSSLKKWGHMKLRTVLQECTFEKG------------FKKSPLVYQFSSLGSLD 403
PG H+ +WG ++ L+ + +K +V Q SS+ +L
Sbjct: 191 PGRHSLVDDFPTRWGWAAMKETLKSVPVRQAGDRVQGGGDVDDSEKPEVVIQISSIATLG 250
Query: 404 --EKWMAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAI---- 453
+ W+ SG KT L +P I++PT +++R SL+GYA+G +I
Sbjct: 251 PTDNWLKNTLFEALSGSQGPKTLLS-SKPKPDFKIIFPTPDEIRKSLDGYASGGSIHTKI 309
Query: 454 PSPQKNVDKDFLKKYWAKWKAS--------------HTGRSRAMPHIKTFARYNGQK--- 496
S Q+ +L+ + W GR+RA PHIKTF R+ K
Sbjct: 310 QSAQQAKQLQYLRPMFCHWANDSADGVGTTTTTPIREAGRNRAAPHIKTFIRFANHKTKN 369
Query: 497 -LAWFLLTSANLSKAAWGALQ-KNNS---QLMIRSYELGVLILPSAKRHGCGFSCTSNI- 550
+ W LLTSANLSK AWG Q KNN+ Q+ I SYE+GVL+ P G S S +
Sbjct: 370 TIDWALLTSANLSKQAWGDAQSKNNAGEPQVHICSYEIGVLVWPELFADSDGTSSGSKMG 429
Query: 551 -----VPSEIKS-----GSTE---TSQIQKTKLVTLTWHGSSDAGASSE--------VVY 589
VP+ +K GS + +S +K + + +G D E VV
Sbjct: 430 QKAVMVPTFLKDTPAIHGSEKDRPSSLGEKQGPTSTSRNGEKDGKGDDEKEEKSSTVVVG 489
Query: 590 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
L +PY LP QRY ++VPW + + D GQVW RH
Sbjct: 490 LRMPYNLPLQRYGLQEVPWVATANHLEPDWMGQVW-RH 526
>gi|301770839|ref|XP_002920828.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Ailuropoda
melanoleuca]
Length = 205
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 136/232 (58%), Gaps = 36/232 (15%)
Query: 399 LGSLDEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-S 455
+G+ D KW+ +E S+ + E +TP PL +++P+VE+VR SLEGY AG ++P S
Sbjct: 1 MGADDSKWLCSEFKESLVTLGKESQTPGRSAVPLHLIYPSVENVRTSLEGYPAGGSLPYS 60
Query: 456 PQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWG 513
Q +++L Y+ KW A +GRS AMPHIKT+ R + ++AWFL+TSANLSKAAWG
Sbjct: 61 IQTAEKQNWLHSYFHKWSADTSGRSNAMPHIKTYMRPSPDFSEIAWFLVTSANLSKAAWG 120
Query: 514 ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 573
AL+KN +QLMIRSYELGVL LPSA F S V + GS E +
Sbjct: 121 ALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFFGSKEPAAA-------- 166
Query: 574 TWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
PVPY+LPP+ Y S+D PW W+ YTK D +G +W
Sbjct: 167 ----------------FPVPYDLPPELYGSKDRPWIWNIPYTKAPDTHGNMW 202
>gi|296424093|ref|XP_002841585.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637828|emb|CAZ85776.1| unnamed protein product [Tuber melanosporum]
Length = 510
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 142/502 (28%), Positives = 228/502 (45%), Gaps = 87/502 (17%)
Query: 157 RDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPAC-PVLA 214
R ++ S F+L RV LP N V IRD+++ G + + NY+ D+DW++ P +
Sbjct: 60 RIRVASPFQLTRVDELPESENVDAVGIRDILRRGPLKEVWIFNYLFDLDWVMNQFDPDVK 119
Query: 215 KIPHVLVIHG-----ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-V 268
V ++HG +++ H + N L +P +GTHHSK +L
Sbjct: 120 DTVKVRIVHGSWRREDANRARIHDQAESYPNVKLVCAFMPEPYGTHHSKMFVLFRTDDHA 179
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG--------FENDLIDYLSTL 319
+II+HTAN+I DW N +Q +W PL Q++ S F+ D++ Y S
Sbjct: 180 QIIIHTANMIPFDWQNMTQAVWQSPLLPLLPQDHGSPRAQTFKPIGQRFKTDILAYFSAY 239
Query: 320 KWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSSLKK---WGHMKLR 375
G + +++F + SVPG +H +S K WG +L
Sbjct: 240 ----------GEGRTDFLTTQLSRYSFDPVKAVFVGSVPGKFHIDASNGKGYEWGWRRLA 289
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAEL--SSSMSSGFSEDKTPLGIGEPL 431
+VL++ K +V Q SS+ +L K W++ + +S +S F+ P +
Sbjct: 290 SVLRKVPLRSPEAKGCIVVQVSSIATLGSKNTWLSPVLFASLKTSRFTASAEP----KFH 345
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR 491
+++PT ++R SL GY +G+++ K+ + + + G +RA PHIKT+ R
Sbjct: 346 VIFPTANEIRESLNGYRSGSSL-----------HMKFQSPAQQAQLG-ARAAPHIKTYIR 393
Query: 492 Y---NGQKLAWFLLTSANLSKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 542
+ + ++ W LLTSAN+S AWGA +K N+ ++ I SYE GVL+ P
Sbjct: 394 FSDTDCTQIDWALLTSANISIQAWGAAEKDPIGRINHREVRICSYEAGVLVYPEILDVEE 453
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
+P EI G T AG L +PY LP ++Y+
Sbjct: 454 MVPTFRKDIPDEIGDGGT--------------------AG-------LRMPYGLPLRKYA 486
Query: 603 SEDVPWSWDKRYTKKDVYGQVW 624
S ++PW K Y+ D GQ W
Sbjct: 487 SNEMPWCAYKSYSDVDWLGQRW 508
>gi|312378421|gb|EFR25002.1| hypothetical protein AND_10059 [Anopheles darlingi]
Length = 436
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 202/436 (46%), Gaps = 58/436 (13%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHV--LVIHGESDGTLEHMKRNKPANWILHKP 246
G + ++ N+MVDI WLL A A +V L+++G+ L + + KP N K
Sbjct: 42 GQLESSVQMNFMVDIGWLL-AHYYFAGYENVPLLILYGDETPELRMVSKKKP-NVTAVKV 99
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
+ G HH+K L Y G +RI++ TANL DW+N++QGLW+ P +
Sbjct: 100 DIKTPVGVHHTKMGLYGYRDGSMRIVISTANLYEDDWHNRTQGLWIS--PRLPAVPEDAD 157
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTG 363
F + D+ S L A L A+ ++ P + ++ +FS V L+ASVPG H
Sbjct: 158 TAFGESVTDFRSNLL-----AYLDAYKLTQLQPWIARIRRTDFSDIKVCLVASVPGGHVN 212
Query: 364 SSLKK-WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 422
+ WGH +L +L + PLV Q SS+GSL + + + + F +D
Sbjct: 213 TPKGPLWGHARLGYLLTKYAAPID-DSCPLVAQSSSIGSLGPSPESWVLGEIMANFRKDS 271
Query: 423 TPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTG 478
P+GI +++P+ +VR S + G +P + K ++LK Y +W
Sbjct: 272 APIGIRRMPGFRMIYPSYSNVRQSHDSLLGGGCLPYGRATHSKQEWLKTYLHQWFCRSRH 331
Query: 479 RSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILP 535
R++AMPHIKT+ R++ + L WFLLTSANLSK+AWG K L I SYE GVL LP
Sbjct: 332 RNKAMPHIKTYCRWSHRGLYWFLLTSANLSKSAWGVYNKAGRFEKPLRINSYEAGVLFLP 391
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
N P E A + P+PY+
Sbjct: 392 K-------LLLDENFFPME----------------------------AGKKDPQFPMPYD 416
Query: 596 LPPQRYSSEDVPWSWD 611
+P Y+ ED P+ D
Sbjct: 417 VPIIPYAPEDTPFFMD 432
>gi|195342204|ref|XP_002037691.1| GM18399 [Drosophila sechellia]
gi|194132541|gb|EDW54109.1| GM18399 [Drosophila sechellia]
Length = 539
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 182/359 (50%), Gaps = 39/359 (10%)
Query: 200 MVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHH 256
MVDI WLL +L K P +L+ ES L K + I K P P F T H
Sbjct: 162 MVDIGWLLGHYYFAGILDK-PLLLLYGDESPELLNIGKFKQQVTAIRVKMPTP--FATSH 218
Query: 257 SKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE-CGFEN 310
+K M L Y G +R+++ TANL DW+N++QGLW+ P+ E GF+
Sbjct: 219 TKMMFLGYSDGSMRVVISTANLYEDDWHNRTQGLWISPKLPALPVDADTGAGESLTGFKQ 278
Query: 311 DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-- 368
DL+ YL K + + + + +FS+ V + SVPG H S++
Sbjct: 279 DLMLYLVEYKISQLQPWI----------ARIRNSDFSAINVFFLGSVPGGHREGSVRGHP 328
Query: 369 WGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLG 426
WGH +L +++ + E + P+V Q SS+GSL A + + +D T +G
Sbjct: 329 WGHARLASLVAKHAAPIED---RIPVVCQSSSIGSLGANVQAWIQQDFVNSLKKDSTSVG 385
Query: 427 IGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSR 481
+ +++P+ +V S +G G +P + DK +LK Y +WK+S RSR
Sbjct: 386 KLRQMPPFKMIYPSYGNVSGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRYRSR 445
Query: 482 AMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 535
AMPHIK++ R+N Q + WF+LTSANLSKAAWG K+++ L I +YE GVL LP
Sbjct: 446 AMPHIKSYTRFNLEDQSVYWFVLTSANLSKAAWGCFNKSSNIQPCLRIANYEAGVLFLP 504
>gi|307109629|gb|EFN57867.1| hypothetical protein CHLNCDRAFT_143337 [Chlorella variabilis]
Length = 370
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 163/314 (51%), Gaps = 46/314 (14%)
Query: 160 LPSTFRLLRVQGLPAWANTSCV--SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIP 217
L + L+RV+ +P+WAN + S+ ++ G+I ++ N M+D+ WLL ACP L +
Sbjct: 68 LDAPMHLMRVRSIPSWANAGFLGASLSSLVCGNIRWILIQNAMLDLPWLLSACPDLHRAE 127
Query: 218 HVLVI-------------HGESDGTLEHMKRNKPANWIL--------HKPPLPISFGTHH 256
+L++ G TL+ +R L ++P + GT+H
Sbjct: 128 RILLVSHRPWLAKKAKVEEGAKPRTLQARERKLADVRALGLEDRASVYEPAIG-GHGTNH 186
Query: 257 SKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
SK L+ Y RG+R+I+ +AN + D NNK+Q L+ QDFP KD+ + + FE L Y+
Sbjct: 187 SKFFLVDYERGMRVIIMSANAVFSDCNNKTQVLFTQDFPRKDEQS-PKTSAFEGALEAYI 245
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRT 376
L+ P G + +FS+A L+ASVPG H G+ L KWGHM++R
Sbjct: 246 RELRMP--------CGPTLHLVQLIRSCDFSAARGHLVASVPGRHKGADLHKWGHMRMRA 297
Query: 377 VLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA-ELSSSMSSGFSEDKT---------PLG 426
VL + F F+ +PL Q SSLG L+E+W+ E S+++G E T PLG
Sbjct: 298 VLCQEAFPARFRGAPLAAQMSSLGLLNERWLVREFRYSLAAGLCEGGTDVLGLPANGPLG 357
Query: 427 IGEPLIVWPTVEDV 440
+ +V+PTVE+V
Sbjct: 358 LQ---LVYPTVEEV 368
>gi|193659893|ref|XP_001947945.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 1
[Acyrthosiphon pisum]
Length = 684
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 221/455 (48%), Gaps = 67/455 (14%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNK 237
S + D GD+ ++ N+MV++ WL + + + +++ D ++ + + K
Sbjct: 277 SFAELLDKSLGDLSESLHLNFMVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKK 336
Query: 238 PANWILHKPPL-PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DF 294
+ HK + +FG HSK + Y G +R++V +ANL DW +QG+W+ F
Sbjct: 337 KLLNVRHKKIINKNAFGHQHSKVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKF 396
Query: 295 PLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
PLK++++ S+ + F+ D++ YL++ + P + +K +FS A V
Sbjct: 397 PLKEEDDKSDGNSQTDFKIDILRYLNSFREPSLVPWIQK----------IEKVDFSQANV 446
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKW 406
I SVPG HT WGH+ L+ +L++ C + P++ Q SSLGSL DE+W
Sbjct: 447 FFIPSVPGKHTEPL---WGHLYLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEW 503
Query: 407 M-AELSSSMSSGFSEDKTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+ +E S+S+ D T +P+ +++P+V++V S +G G +P + +K
Sbjct: 504 LKSEFVESLSASTYCDDTDTD-NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEK 562
Query: 463 DF-LKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNN 519
LKKY W+ R++AMPHIKT+ R + +++WFLL SANLSKAAWG K++
Sbjct: 563 QLWLKKYMCLWQCHSRKRTKAMPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSD 622
Query: 520 SQL-MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 578
Q I ++E GVL LP F S+ P
Sbjct: 623 EQSNFIMAHEAGVLFLPQ-------FLIGSDTFP-------------------------- 649
Query: 579 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 613
D ++ Y +P++LP YS D PW+ R
Sbjct: 650 IDETEPNKFPYFSLPFDLPLAGYSDTDQPWTISTR 684
>gi|307211794|gb|EFN87775.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 441
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 128/437 (29%), Positives = 206/437 (47%), Gaps = 65/437 (14%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ Y++D++WL + + ++ +++GE E + N A
Sbjct: 49 ILDISFGEIVNSLHLTYILDVEWLCLQYLLAGQSTNMTILYGERRDE-EELDDNITA--- 104
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLK 297
+H +P FG HHSK M+L Y G+R++V TANL DW N +QG+W+
Sbjct: 105 IHMK-MPFEFGCHHSKIMILQYKDNGIRVVVSTANLFFEDWQNSTQGIWISPHLPRLSKA 163
Query: 298 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
++N F+ DL YLS+ + P K KK +FS+ V LIAS+
Sbjct: 164 AKHNGESLTNFKKDLQRYLSSYRNPA----------LKRWRKLVKKTDFSAINVCLIASI 213
Query: 358 PGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSS 416
PG H ++ WG+ KL VL Q T K ++ Q S++GS K+ + LS +
Sbjct: 214 PG-HFEHTVDLWGYKKLANVLSQHVTLPPDALKWSIIAQSSAVGSFGPKYGSWLSKEIVW 272
Query: 417 GFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLKKYWAKW 472
+ + P ++P+V++ S + Y G + S + V + ++K Y +W
Sbjct: 273 SMTRETERDLNNYPKFQFIYPSVKNYEQSFD-YQNGTSCFSYSREVHSKQQWIKSYLYQW 331
Query: 473 KASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
KA+ T R +AMPHIK++ R + +++AWF+LTSANLSK AWG ++++ I +YE+G
Sbjct: 332 KAARTERDQAMPHIKSYTRISSDLKRIAWFVLTSANLSKGAWGVQREDD--YYITNYEVG 389
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
+ LP F T + + I
Sbjct: 390 IAFLPKFITRITTFPITDEDLTNSI----------------------------------F 415
Query: 591 PVPYELPPQRYSSEDVP 607
P+PY+LP Y S D P
Sbjct: 416 PIPYDLPLCPYDSSDSP 432
>gi|326476634|gb|EGE00644.1| tyrosyl-DNA phosphodiesterase [Trichophyton tonsurans CBS 112818]
gi|326478089|gb|EGE02099.1| tyrosyl-DNA phosphodiesterase [Trichophyton equinum CBS 127.97]
Length = 588
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 150/536 (27%), Positives = 244/536 (45%), Gaps = 85/536 (15%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N CV +RD++ +I NY+ D+D+++
Sbjct: 67 SRQKIIPSPIQLTHIRDISDSTGYNEGCVKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 126
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 127 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 184
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ +II+HTAN+I DW N +Q +W Q + + CG F+ DL+ Y
Sbjct: 185 RHDNLAQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQAQVCDTCGGFGSSARFKRDLLAY 244
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F S LIASVP +
Sbjct: 245 LE------------AYHNKTINTLIRQLQRYDFGSVKAVLIASVPTRLPVKEFDSNRRTL 292
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L + +W+ E LSS
Sbjct: 293 WGWPALKDAIGSIPIDRSSSRAQNPHIIVQVSSIATLGQTDRWLKETFLSSLYPQPEVNQ 352
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKW--- 472
+ I++PT +++R SL+G+ +G +I PS QK + +L++Y W
Sbjct: 353 NRSTSNVKFSIIFPTPDEIRRSLDGHGSGGSIHMKIQSPSQQKQLA--YLRRYLCHWAGD 410
Query: 473 --------------KASHTGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGAL 515
+ GR RA PHIKT+ R++ + W ++TSANLS AWGA
Sbjct: 411 AEGRKNSDPTTKSDRVREAGRRRAAPHIKTYIRFSDSDMDNIDWAMITSANLSTQAWGAG 470
Query: 516 QKNNSQLMIRSYELGVLILPSAKR----HGCGFSCTSN---IVPSEIKSGSTETSQIQKT 568
+ ++ I S+E+GVLI P R GC S +N ++P K + +Q +
Sbjct: 471 ANTHGEVRICSWEIGVLIWPDLFREEHIEGCSDSSLTNHVKMIPC-FKRNTPSEKPLQSS 529
Query: 569 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ + SDA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 530 ENDSTKVALHSDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 584
>gi|171683299|ref|XP_001906592.1| hypothetical protein [Podospora anserina S mat+]
gi|170941609|emb|CAP67263.1| unnamed protein product [Podospora anserina S mat+]
Length = 569
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 154/557 (27%), Positives = 239/557 (42%), Gaps = 121/557 (21%)
Query: 151 CNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLP 208
+H + S F+L +++ LPA N ++RDV+ GD +++ NY+ DID+L+
Sbjct: 49 AKYHPPFKSVGSPFQLTKIKDLPAGLNKDTYTLRDVL-GDPLISECWEFNYLHDIDFLMS 107
Query: 209 ACPV-LAKIPHVLVIHGESDGTLEHMKRNKPA------------NWILHKPPLPISFGTH 255
A + + V V+HG KR P N LH LP FGTH
Sbjct: 108 AFDEDVRSLVKVHVVHG-------FWKREDPNRLALQESAARFNNVTLHAAFLPEMFGTH 160
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPL------KDQNNLSEECG 307
HSK +L+ + ++++HTANLI DW N +QG W PL + + +
Sbjct: 161 HSKMFILLRHDDTAQLVIHTANLITRDWTNMTQGAWFSPRLPLLKPEHDEGRPRIGNGAK 220
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT---GS 364
F+ D ++YL + P + K++FSS LI+SVPG HT +
Sbjct: 221 FKLDFLNYLRA-----YDTKRPTCKDITTK---LMKYDFSSINGSLISSVPGRHTVTQST 272
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSL--DEKWMAE-----LSSSMSS 416
S +G +++ L + P V Q SS+ +L + W+ L ++ ++
Sbjct: 273 SSTNFGWAAMKSALAAVPIHSTIEHKPEVAIQISSIATLGPTDSWLKNTFLHTLGNTPAT 332
Query: 417 GFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 472
F +V+PT +++R SL+GY +G +I SPQ+ +LK + W
Sbjct: 333 TFK------------VVFPTPDEIRKSLDGYMSGGSIHTKTQSPQQVKQLQYLKPLFHHW 380
Query: 473 ---------------------------------KASHTGRSRAMPHIKTFARYNGQK--- 496
K ++GR RA PHIKT+ R +
Sbjct: 381 ANDSASGLRMFPPRPLLSPSANAPSPNIAINASKVKNSGRKRAAPHIKTYIRSHRPTPES 440
Query: 497 ------LAWFLLTSANLSKAAWG-ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSN 549
+ W LLTSANLSK AWG AL + + I SYE+GVL+ P + +
Sbjct: 441 SETDIHIDWALLTSANLSKQAWGEALSAKENTVRISSYEIGVLVWPGL------YGENAV 494
Query: 550 IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLPVPYELPPQRYSSEDVP 607
+ P+ ++ Q + G D EV V L +PY+LP Q Y +VP
Sbjct: 495 MKPAFLEDALPPPEQTRGDG----DGKGKEDYDGKDEVVEVALRMPYDLPLQPYGPGEVP 550
Query: 608 WSWDKRYTKKDVYGQVW 624
W +T+ D G++W
Sbjct: 551 WVATASHTEPDWMGRIW 567
>gi|401428160|ref|XP_003878563.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494811|emb|CBZ30114.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 682
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 167/647 (25%), Positives = 255/647 (39%), Gaps = 198/647 (30%)
Query: 155 VSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVA-------ILSNYMVDIDWLL 207
V + + PS+ LLR +RD+ + D+ +LS+Y+ D+ WLL
Sbjct: 27 VPQGRAPSSCSLLR--------------LRDLFRCDLADPGECWQHILLSSYVTDLRWLL 72
Query: 208 PACPVLAKIPHVLVIHGESDGT---------------------------LEHMKRNKPAN 240
P L+ + LV+ GT + ++ A
Sbjct: 73 ATVPELSAVTGKLVVLSGEKGTATLRRTTGDPSSPYTATSPLMDRVNPFMAALREQARAT 132
Query: 241 WILH-----------KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
LH +PPLP++FGTHH+K L + RG+RI + TANL+ DW KSQG+
Sbjct: 133 SALHTTLSRERLAVLEPPLPVAFGTHHTKMALCVNSRGLRISIFTANLVEQDWCWKSQGI 192
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLST------------LKWPEFSANL--------- 328
++QDFP K S + ++ ++ K EF A+L
Sbjct: 193 YLQDFPWKAATECSNDVAAGATVVKTAASSTSKGGNGSNTLTKGAEFVAHLRNYLMQCGV 252
Query: 329 -------------PAHGNFKI-NPSFFKKFNFSSAAVRLIASVPG---YHTGSSLKKWGH 371
A G I F +FS+AAV LI+SVPG Y + + G
Sbjct: 253 SLTTACASPTDAVSAAGPLGIFETDFLSHIDFSAAAVWLISSVPGTCAYGEVAPGYRVGL 312
Query: 372 MKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPL 425
+L VL+ T L +Q+SS GSL+ ++ L ++M S TP
Sbjct: 313 CRLAEVLRRSALTMATAPASVDLSWQYSSQGSLNLAFLNSLQAAMCGESVSVIESGDTPR 372
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG------- 478
G+ + +V+PT E+VR S EG+ G ++P + +F+ +W +S G
Sbjct: 373 GVRDVQVVYPTEEEVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAF 431
Query: 479 -----------------------------------------RSRAMPHIKTFARYNGQK- 496
R A+PHIK++A +
Sbjct: 432 PRPAKVAAAHASREDAVDVDGVDSDGGEGTTASLTCSCAAYRQFALPHIKSYAAVAPDRS 491
Query: 497 -LAWFLLTSANLSKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNI 550
+ WFLLTSANLS+AAWG+L Q+ + Q ++RSYELGV+ + H S S +
Sbjct: 492 CVRWFLLTSANLSQAAWGSLSRKMNQRGSRQQLVRSYELGVIYDSHSAIHPSASSWFSVV 551
Query: 551 VPSEIKSGSTETSQ--IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS---- 603
++I+ S S+ + +T L G ++ V L PY L P Y+S
Sbjct: 552 SKTKIELPSARNSRAMLYETPL-----------GVETQNVCLYTPYNLLCPTPYASTAAL 600
Query: 604 ---------------------EDVPWSWDKRYTKKDVYGQVWPRHFQ 629
DVPW D + +D YG + F+
Sbjct: 601 RARRDAPVEGEQAVAGSTLDCSDVPWVLDMPHRGRDAYGLDFEEAFE 647
>gi|302662485|ref|XP_003022896.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
gi|291186867|gb|EFE42278.1| hypothetical protein TRV_02978 [Trichophyton verrucosum HKI 0517]
Length = 587
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 145/535 (27%), Positives = 240/535 (44%), Gaps = 83/535 (15%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 66 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 125
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 126 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 183
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q + + CG F+ DL+ Y
Sbjct: 184 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLAQPQVGDTCGVFGSSTRFKRDLLAY 243
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 244 LE------------AYNNKTINTLIRQLQRYDFGAVKAMLIASVPTRLPVKEFDSNKRTL 291
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L + KW+ E LSS
Sbjct: 292 WGWPALKDAISSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWLKETFLSSLCPQPEVNQ 351
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW----- 472
I++PT +++R SL+GY +G +I SP + +L++Y W
Sbjct: 352 SRSTSNARFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRRYLCHWAGDAE 411
Query: 473 ------------KASHTGRSRAMPHIKTFARYNGQKL---AWFLLTSANLSKAAWGALQK 517
+ GR RA PHIKT+ R++ + W ++TSANLS AWGA
Sbjct: 412 DPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAMITSANLSTQAWGAGAN 471
Query: 518 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKS--------GSTETSQIQKTK 569
+ ++ I S+E+GVL+ P R C+ + + + +K S + Q +
Sbjct: 472 THGEVRICSWEIGVLMWPDLFREKNIEECSDSSLTNYVKMIPCFKRNVPSEKPPQTSEND 531
Query: 570 LVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+T H SDA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 532 STKVTLH--SDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 583
>gi|327299128|ref|XP_003234257.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
gi|326463151|gb|EGD88604.1| tyrosyl-DNA phosphodiesterase [Trichophyton rubrum CBS 118892]
Length = 586
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 143/535 (26%), Positives = 243/535 (45%), Gaps = 83/535 (15%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 65 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYVMGQFD 124
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 125 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 182
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q+ + + CG F+ DL+ Y
Sbjct: 183 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVGDACGVFGSSARFKRDLLAY 242
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 243 LE------------AYNNNTINTLIRQLQQYDFGAVKAVLIASVPTRLPVKEFDSNRRTL 290
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE--LSSSMSSGFSED 421
WG L+ + ++ ++ ++ Q SS+ +L + KW+ E SS S
Sbjct: 291 WGWPALKDAIGSIPIDRSSSQAQNPHIIIQVSSIATLGQTDKWLKETFFSSLYSQPEVNQ 350
Query: 422 KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW----- 472
+ I++PT +++R SL+GY +G +I SP + +L++Y W
Sbjct: 351 SRSTSKAKFSIIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRRYLCHWAGDAE 410
Query: 473 ------------KASHTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQK 517
+ GR RA PHIK++ R++ + W ++TSANLS AWGA
Sbjct: 411 GPKNADPTTTSDRVREAGRRRAAPHIKSYIRFSDSDMDSIDWAMITSANLSTQAWGAGAN 470
Query: 518 NNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK--------SGSTETSQIQKTK 569
+ ++ I S+E+G+LI P R C+ + + + +K + S + Q +
Sbjct: 471 THGEVRICSWEIGILIWPDLFREENIEECSDSSLTNHVKMIPCFKRNTPSEKPLQTSEND 530
Query: 570 LVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ +T H DA + V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 531 SIKVTLH--LDATNMTR-VGLRMPYDLPLIPYTPQEVPWCATSVHREPDWMGQTW 582
>gi|302894143|ref|XP_003045952.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726879|gb|EEU40239.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1086
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 134/428 (31%), Positives = 204/428 (47%), Gaps = 73/428 (17%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPAC-PVLAKI 216
+ S ++L +Q L N VS+RD++ GD ++A N++ DI +L+ A P +
Sbjct: 38 IKSPWQLTWIQDLSEEDNRDAVSLRDLL-GDPLIAECWEFNFLHDIHFLMDAFDPDTRHL 96
Query: 217 PHVLVIHG------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V V+HG ES +E N +H P+P FGTHHSK M+L + +
Sbjct: 97 VKVHVVHGFWKREDESRIAIEQAAAEF-NNVQIHIAPMPEMFGTHHSKMMILFRHDDTAQ 155
Query: 270 IIVHTANLIHVDWNNKSQGLWM------------------QDFPLKDQNNLSEECGFEND 311
+I+HTAN+I DW N + G+W +D P+ + F+ D
Sbjct: 156 VIIHTANMISKDWTNMTNGIWKSPLLPKMTVAPTHTTSSPEDHPVGSGDR------FKID 209
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--W 369
L++YL + + K ++FSS L+ASVPG H L + W
Sbjct: 210 LLNYLRAYDRRKITC--------KALTDELVHYDFSSIKAALVASVPGRHNIRDLSETSW 261
Query: 370 GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGI 427
G L+ LQ+ E ++S +V Q SS+ +L E W L ++ S K P +
Sbjct: 262 GWAALKRCLQQVPCEDQ-EQSEIVVQISSIATLGAKEDW---LKKTLFEPLSRCKNP-SL 316
Query: 428 GEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWK-------- 473
G+P +V+PT +++R SL+GYA+G +I S Q+ ++L+ + W
Sbjct: 317 GKPKFKVVFPTADEIRRSLDGYASGGSIHTKIQSAQQAKQLEYLRPIFHHWANDSPSGAK 376
Query: 474 ------ASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
GR RA PHIKT+ R N + W LLTSANLSK AWG + ++ I S+
Sbjct: 377 LPEGATVKDGGRKRAAPHIKTYIRSNKSSIDWALLTSANLSKQAWGEAARPTGEMRIASW 436
Query: 528 ELGVLILP 535
E+GVL+ P
Sbjct: 437 EIGVLVWP 444
>gi|156549662|ref|XP_001604678.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like [Nasonia
vitripennis]
Length = 573
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 190/378 (50%), Gaps = 51/378 (13%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G++I ++ N+M ++ WL+ + ++P + V++G +W+
Sbjct: 113 IIDYTTGELIDSLHINFMAEMLWLINEYMLAVQVPKMTVLYG---------------SWL 157
Query: 243 ----LHKPPLPISF--------GTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGL 289
+++ P I F G HHSK + Y +RI++ ++N+ DW +++QGL
Sbjct: 158 DPDMMYEIPFDIEFVNVEMSEFGCHHSKISIFKYTGDKIRIMISSSNIYAEDWQSRTQGL 217
Query: 290 WMQDF-PL--KDQNNLSEE--CGFENDLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKK 343
W+ F PL +D N E F+ D + YLS PE F + H +
Sbjct: 218 WISPFLPLLPEDANESDGESPTNFKRDFLQYLSMYNQPEVFGWSALIH-----------R 266
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSSLGSL 402
+ S+ V IASVPG+H GSSL WGH KL +L + +K P++ Q SS+G
Sbjct: 267 ADCSAINVFFIASVPGHHDGSSLDTWGHRKLAALLSAHASLPSDAQKWPVIAQSSSVGVF 326
Query: 403 DEKWMAELSSSMSSGFSE--DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN- 459
+ + LSSS+ S+ DK + E ++P+ + S + + + ++N
Sbjct: 327 GPDYQSWLSSSIVRTMSKEKDKKIIIFPEFKFIYPSKNNYNQSYDNQIGSSCLMYNEQNY 386
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQK 517
+ + +LK Y +WK+ GR++AMPH+K + R + ++AWF LTSANLSK A G + +
Sbjct: 387 LKQQWLKDYLYQWKSDKIGRTQAMPHLKCYTRISPDESEMAWFFLTSANLSKGAMGKMLR 446
Query: 518 NNSQLMIRSYELGVLILP 535
N + + +YE GVL LP
Sbjct: 447 NCTVQTLCNYEAGVLFLP 464
>gi|347837882|emb|CCD52454.1| hypothetical protein [Botryotinia fuckeliana]
Length = 639
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 158/561 (28%), Positives = 241/561 (42%), Gaps = 109/561 (19%)
Query: 154 HVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPACP 211
H + + S F+L ++ LP +N VS++D++ GD +++ NY+ D+D+L+
Sbjct: 96 HTKQRVVKSPFQLTTIRDLPDSSNVDTVSLKDIL-GDPLISECWEFNYLHDLDFLMEQFD 154
Query: 212 V-LAKIPHVLVIHG----ESDGTLEHMKR-NKPANWILHKPPLPISFGTHHSKAMLLIYP 265
+ + V VIHG E L M++ ++ +N L +P FGTHHSK ML+I+
Sbjct: 155 EDVRNLVRVNVIHGFWKREDHSRLNLMEQASRYSNIKLLTAYMPEMFGTHHSK-MLIIFR 213
Query: 266 RG--VRIIVHTANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSEECGFENDLID 314
+II+HTAN+I DW N +Q LW + L + + + F+ D ++
Sbjct: 214 HDCTAQIIIHTANMIPFDWTNMTQALWKSPHLPLLNPKKPTLVEASRIGSGSKFKLDFLN 273
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPGYHTGSSLKK--- 368
YL I S + K++FS LIASVPG G+ L
Sbjct: 274 YLRAYDTKRI-----------ICKSLIEQLLKYDFSEIKAALIASVPGKQ-GTELSPSQT 321
Query: 369 -WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPL 425
WG L L+ + +V Q SS+ SL +KW+ ++S E K+P
Sbjct: 322 GWGWAGLTNALKSVPSHHNTQPE-IVIQVSSIASLGPTDKWLTHFFKALS----ESKSPR 376
Query: 426 GIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS----PQKNVDKDFLKKYWAKW-------- 472
G I++PT ++VR S+ GYA+GNAI + P + +LK W
Sbjct: 377 KTGSKFKIIFPTADEVRRSINGYASGNAIHTKILTPAQGKQLAYLKPMLCHWAGDGAQHS 436
Query: 473 ----------------------KASHTGRSRAMPHIKTFARYNGQK---------LAWFL 501
K R RA PHIKT+ R++ + W L
Sbjct: 437 SSSSLSSNPPSESSQSFTSPELKTQEAYRRRAAPHIKTYIRFSSDSTSSSSSQKSIDWML 496
Query: 502 LTSANLSKAAWGALQKNNSQLMIRSYELGVLILP---SAKRHGCGFS---CTSNIVPS-- 553
+TSANLSK AWG + ++ I SYE+GVL+ P K++G C N PS
Sbjct: 497 VTSANLSKQAWGESINSADKVRICSYEIGVLVWPDLWEEKQNGKNVKMVPCFGNDTPSIP 556
Query: 554 ------EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE----VVYLPVPYELPPQRYSS 603
EI + ++ L D E +V +PY+LP Y
Sbjct: 557 FVSPSLEIVGQKEIRVEGEEGHLKRKRCDAREDEKRQEESHTIIVGARMPYDLPLVSYGK 616
Query: 604 EDVPWSWDKRYTKKDVYGQVW 624
+D+PW Y++ D G+ W
Sbjct: 617 DDIPWCASASYSEPDWMGKTW 637
>gi|225682330|gb|EEH20614.1| tyrosyl-DNA phosphodiesterase [Paracoccidioides brasiliensis Pb03]
Length = 628
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 152/572 (26%), Positives = 244/572 (42%), Gaps = 122/572 (21%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L RV+ PA + NT V +RD++ +I NY+ D+D+L+ +
Sbjct: 69 IPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIFDVDFLMSQFDQDVRG 128
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V +IHG ES + E +R ++ +P +FGTHHSK M++I +
Sbjct: 129 LVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFGTHHSKMMVIIKHDDQ 186
Query: 268 VRIIVHTANLIHVDWNNKSQGLW-----------MQDFPLKDQNNLSEECGFENDLIDYL 316
+I++HTAN+I DW N Q +W ++ P N++ F+ DL+ Y
Sbjct: 187 AQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHPSATPNDVGTGSRFKRDLLAYF 246
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGH 371
T H +K++FS+ LIAS P T L WG
Sbjct: 247 ETY----------GHNKTGALIEQLEKYDFSAIRAALIASAPSRQTIDELDSKRRTLWGW 296
Query: 372 MKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSLDE--KWMAEL--------SSSMSSG 417
L+ +++ F+KG K K P +V Q SS+ +L + KW+ E S+ S
Sbjct: 297 PALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTDKWLKETLFNSLSPPSARSSEL 356
Query: 418 F-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 472
F +E +P I++PT +++R SL GY +G +I S + +L+ Y +W
Sbjct: 357 FKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHMKLQSAAQQKQLQYLRPYLCRW 413
Query: 473 ---------------------------------------KASH-----TGRSRAMPHIKT 488
K +H GR RA PHIKT
Sbjct: 414 AGDANDDGGVKSAGGPATSKRKRLEGNEVSESVQDGASLKKAHRPIREAGRRRAAPHIKT 473
Query: 489 FARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 545
+ R++ + W ++TSANLS AWGA ++ I SYE+GVL+ P
Sbjct: 474 YIRFSDTDMTTIDWAMVTSANLSLQAWGAAANAKKEIRICSYEIGVLVWPDLFVDEEIDD 533
Query: 546 CTSNIVPSEIK-------SGSTETSQIQKTKLVTLTWHGSSDAG------ASSEVVYLPV 592
++ + K SG T ++ +V +A +++ +V +
Sbjct: 534 SDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMVPCFKRDMPEAAENEARSSNTTLVGFRM 593
Query: 593 PYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
PY+LP Y+++D PW Y++ D GQ W
Sbjct: 594 PYDLPLHSYAAKDQPWCATATYSEPDWLGQTW 625
>gi|258577075|ref|XP_002542719.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902985|gb|EEP77386.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 669
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 152/533 (28%), Positives = 234/533 (43%), Gaps = 104/533 (19%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG--ESDG---- 228
N + +RD++ +I N++ DID+L+ P + + V V+HG + D
Sbjct: 153 NGDTIKLRDILGDPLIKECWQFNFLFDIDFLMDQFDPDVKNLVKVKVVHGSWKKDAPNRI 212
Query: 229 -TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + ++++HTAN+I DW N
Sbjct: 213 RVDEQCSRYQNVEPIIAYMPEP--FGTHHSKMMVLIRHDDCAQVVIHTANMIPGDWANMC 270
Query: 287 QGLWMQD-FPLKDQNNLSE-----ECG----FENDLIDYLSTLKWPEFSANLPAHGNFKI 336
Q +W PL NN E E G F+ DL+ YL A+G K
Sbjct: 271 QAVWKSPLLPLLSPNNDREPSITGEIGSGPRFKRDLLAYLE------------AYGRKKT 318
Query: 337 NP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEK---- 385
P K + F LIASVP SL WG L+ VL+ K
Sbjct: 319 GPLVEQLKNYGFDGIRAALIASVPSRQRFPSLDSRKETIWGWPALQDVLRRIPIHKQQPL 378
Query: 386 GFKKSPLVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC 442
K+S +V Q SS+ SL +KW+ E +S+ + D P + I++PT +++R
Sbjct: 379 QSKRSRIVIQISSIASLGQSDKWLKETFFASLYPHSAADGAP----QLSIIFPTPDEIRR 434
Query: 443 SLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKAS----------------------- 475
SL GY +G + I S + D+++ Y W
Sbjct: 435 SLNGYGSGGSIHMKIQSSAQQKQLDYMRPYLCHWAGDSENNQTPVSATDVLTHDSAIDRY 494
Query: 476 --------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMI 524
GR RA PHIKT+ R++ + + W ++TSANLS AWGA ++ I
Sbjct: 495 PPKATPVREAGRRRAAPHIKTYIRFSDEDMRTIDWAMVTSANLSTQAWGAAINAKQEVRI 554
Query: 525 RSYELGVLILP------SAKRHGCGF-------SCTSNIVPSEIKSGSTETSQIQKTKLV 571
S+E+GVL+ P S +R+ G S + ++P + S S++++ ++
Sbjct: 555 CSWEIGVLVWPDLFCNGSERRNESGEENKDKAKSDYARMIPC-FRRDSPCLSEVERYEIE 613
Query: 572 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ + + G S +V +PY+LP + YS DVPW + + D GQ W
Sbjct: 614 ETSKKDADNTGVLSTLVGFRMPYDLPLKPYSPRDVPWCATASHKEPDWLGQTW 666
>gi|296817701|ref|XP_002849187.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
gi|238839640|gb|EEQ29302.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma otae CBS 113480]
Length = 606
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 140/530 (26%), Positives = 241/530 (45%), Gaps = 79/530 (14%)
Query: 160 LPSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L V+ +P N C+ +RD++ +I N++ D+D+++ K
Sbjct: 87 IPSPIQLTHVRDIPDSTGYNKDCIRLRDILGDPMIKECWQFNFLFDVDYIMGQFDRDVKD 146
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ + ++HG E+ + + KR I+ +P FGTHHSK M+L+ +
Sbjct: 147 LVQLKIVHGSWKKEAPNKIAIDDACKRYPNVEAIVAY--MPELFGTHHSKMMVLVRHDDL 204
Query: 268 VRIIVHTANLIHVDWNNKSQGLW------MQDFPLKD-QNNLSEECGFENDLIDYLSTLK 320
+II+HTAN+I DW N +Q +W + F + D + ++ F+ DL+ YL+
Sbjct: 205 TQIIIHTANMIPRDWGNMTQAVWRSPLLPLSQFKMADSRGDIGSGARFKRDLLAYLN--- 261
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
A+ N KI+ ++++F LI+SVP L WG
Sbjct: 262 ---------AYNNKKIDMLIDQLQRYDFGEVKAALISSVPSRQPARELDSGKRTLWGWPA 312
Query: 374 LRTVLQECTFEKGFKKS---PLVYQFSSLGSL--DEKWMAE--LSSSMSSGFSEDKTPLG 426
L+ + + +V Q SS+ +L +KW+ E SS + D + +
Sbjct: 313 LKDAISSIPIRGNSSQRLEPQVVVQVSSIATLGQTDKWLKETFFSSLCPQSRASDTSNIS 372
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------- 475
+ I++PT +++R SL+GYA+G +I S + +L++Y +W
Sbjct: 373 STKFSIIFPTPDEIRRSLDGYASGGSIHMKIQSAAQQKQLQYLRRYLCRWAGDAAGQRDT 432
Query: 476 --------------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKN 518
GR RA PHIKT+ R++ + W ++TSANLS AWGA
Sbjct: 433 NPATQPDKGSSIVREAGRKRAAPHIKTYIRFSDSGMTSIDWAMVTSANLSTQAWGAGANT 492
Query: 519 NSQLMIRSYELGVLILPS--AKRHGCGFSCTSNIVPSE-IKSGSTETSQIQKTKLVTLTW 575
++ I S+E+GVL+ P +R +S I P + I +T + + +
Sbjct: 493 QGEVRICSWEIGVLVWPDLFRERMTSKDKDSSTIHPVKMIPCFKCDTPSEKSLLCESDST 552
Query: 576 HGSSDAGASSEV-VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ +S +GA++ + L +PY LP Y+ +DVPW + + D GQ W
Sbjct: 553 NSTSHSGATNMTRIGLRMPYNLPLVPYTHQDVPWCATAVHREPDWLGQTW 602
>gi|392867268|gb|EAS29510.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 616
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 146/533 (27%), Positives = 232/533 (43%), Gaps = 107/533 (20%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + VIHG +S +
Sbjct: 103 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRI 162
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 163 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 220
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 221 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 268
Query: 337 NP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF-- 387
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 269 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 328
Query: 388 --KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
KK +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 329 EPKKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 382
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 383 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTP 442
Query: 476 ------HTGRSRAMPHIKTFARYNG----QKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++ +
Sbjct: 443 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 502
Query: 526 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 571
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 503 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 561
Query: 572 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 562 ESS-RGDDKSKAWRTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|302500932|ref|XP_003012459.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
gi|291176017|gb|EFE31819.1| hypothetical protein ARB_01418 [Arthroderma benhamiae CBS 112371]
Length = 587
Score = 158 bits (400), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 141/535 (26%), Positives = 238/535 (44%), Gaps = 83/535 (15%)
Query: 156 SRDKL-PSTFRLLRVQGLP--AWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP 211
SR K+ PS +L ++ + N C+ +RD++ +I NY+ D+D+++
Sbjct: 66 SRQKIIPSPIQLTHIRDISDSTGYNEGCIKLRDILGDPMIKECWQFNYLFDVDYIMGQFD 125
Query: 212 VLAK-IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI 263
K + + +IHG E+ + + KR A ++ P P FGTHHSK M+LI
Sbjct: 126 RDVKDLIQLKIIHGSWKKEAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILI 183
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDY 315
+ ++I+HTAN+I DW N +Q +W Q+ + + CG F+ DL+ Y
Sbjct: 184 RHDNLAQVIIHTANMIPRDWGNMTQAVWRSPLLPLSQSQVDDTCGVFGSSARFKRDLLAY 243
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-----HTGSSLKK 368
L A+ N IN ++++F + LIASVP +
Sbjct: 244 LE------------AYNNKTINILIRQLRRYDFGAVKALLIASVPTRLPVKEFDSNRRTL 291
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSLDE--KWMAE-----LSSSMSSGF 418
WG L+ + ++ ++ ++ Q SS+ +L + KW+ E L
Sbjct: 292 WGWPALKDAIGSIPIDRSSSQAQNPHIIVQVSSIATLGQTDKWLRETFLRSLCPQPEVNQ 351
Query: 419 SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW-- 472
S + + I++PT +++R SL+GY +G +I SP + +L+ Y W
Sbjct: 352 SRSTSNVKFS---IIFPTPDEIRRSLDGYGSGGSIHMKIQSPPQQKQLAYLRHYLCHWAG 408
Query: 473 ---------------KASHTGRSRAMPHIKTFARYNGQKL---AWFLLTSANLSKAAWGA 514
+ GR RA PHIKT+ R++ + W ++TSANLS AWGA
Sbjct: 409 DAEDPKNSDPATKSDRVREAGRRRAAPHIKTYIRFSDSDMNSIDWAMITSANLSTQAWGA 468
Query: 515 LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 574
++ I S+E+GVLI P R C+ + + + +K + K + +
Sbjct: 469 GANTQGEVRICSWEVGVLIWPDLFREENIEECSDSSLTNYVKMIPCFKRNVPSEKPLQTS 528
Query: 575 WHGSSDAGASSEV-----VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ S+ S+ V L +PY+LP Y+ ++VPW + + D GQ W
Sbjct: 529 ENDSTKVTLHSDATNMTRVGLRMPYDLPLIPYTPQEVPWCATAVHREPDWMGQTW 583
>gi|119178141|ref|XP_001240773.1| hypothetical protein CIMG_07936 [Coccidioides immitis RS]
Length = 531
Score = 158 bits (400), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 146/533 (27%), Positives = 232/533 (43%), Gaps = 107/533 (20%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + VIHG +S +
Sbjct: 18 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIKIRVIHGSWKKDSPNRI 77
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 78 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 135
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 136 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 183
Query: 337 NPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGF-- 387
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 184 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 243
Query: 388 --KKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
KK +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 244 EPKKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 297
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 298 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDTSTP 357
Query: 476 ------HTGRSRAMPHIKTFARYNG----QKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++ +
Sbjct: 358 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 417
Query: 526 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 571
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 418 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 476
Query: 572 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 477 ESS-RGDDKSKAWRTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 528
>gi|7648685|gb|AAF65624.1|AF182003_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 189
Score = 158 bits (399), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 96/210 (45%), Positives = 123/210 (58%), Gaps = 35/210 (16%)
Query: 420 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHT 477
E KTP PL +++P+VE+VR SLEGY AG ++P S Q +++L Y+ KW A +
Sbjct: 7 ESKTPGKSSVPLYLIYPSVENVRTSLEGYPAGGSLPYSIQTAEKQNWLHSYFHKWSAETS 66
Query: 478 GRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
GRS AMPHIKT+ R + K+AWF +TSANLSKAAWGAL+KN +QLMIRSYELGVL LP
Sbjct: 67 GRSNAMPHIKTYMRPSPDFSKIAWFRVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLP 126
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
SA F S V + +GS E + PVPY+
Sbjct: 127 SA------FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYD 156
Query: 596 LPPQRYSSEDVPWSWDKRYTKK-DVYGQVW 624
LPP+ Y S+D PW W+ Y K D +G +W
Sbjct: 157 LPPELYGSKDRPWIWNIPYVKAPDTHGNMW 186
>gi|303310201|ref|XP_003065113.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240104773|gb|EER22968.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 616
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 143/536 (26%), Positives = 230/536 (42%), Gaps = 113/536 (21%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHGE--------- 225
N V+++D++ +I NY+ DID+L+ P + + + V+HG
Sbjct: 103 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRI 162
Query: 226 -SDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWN 283
D H + +P I+ P P FGTHHSK M+LI + +II+HTAN+I DW
Sbjct: 163 YIDEACAHYQNVEP---IIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWA 217
Query: 284 NKSQGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN 333
N QG+W +D+ + F+ D++ YL A+G
Sbjct: 218 NMCQGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGR 265
Query: 334 FKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG 386
K P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 266 KKTGPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQ 325
Query: 387 FKKSP----LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
P +V Q SS+ SL +KW+ + + F+ P +++PT +++
Sbjct: 326 LSCEPEKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSVIFPTPDEI 379
Query: 441 RCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS--------------------- 475
R SL GY +G +I S + D+++ Y W
Sbjct: 380 RRSLNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDE 439
Query: 476 ---------HTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQL 522
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++
Sbjct: 440 STPNNTFVREAGRRRAAPHIKTYIRFSDAEDMCTIDWAMVTSANLSTQAWGAAINANQEV 499
Query: 523 MIRSYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKT 568
+ S+E+GVL+ P +A R S + ++P + + S++++
Sbjct: 500 RVCSWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERL 558
Query: 569 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+L + G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 559 ELEEPS-RGDDKSKAWKTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 613
>gi|242787594|ref|XP_002481044.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
gi|218721191|gb|EED20610.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces stipitatus
ATCC 10500]
Length = 577
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 152/568 (26%), Positives = 254/568 (44%), Gaps = 102/568 (17%)
Query: 127 LSSKKMR---QQDEQDNENGKNSEEALCNFHVSRDK-LPSTFRLLRVQGLPAWANTSCVS 182
L+S++ R Q +Q ++ K + E + R + +PS F+L ++ LP+ N V
Sbjct: 40 LTSRERRPPENQHDQHTDHIKRNNETNADIIEGRPRVIPSPFQLTHIRDLPSDKNVDTVQ 99
Query: 183 IRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-IPHVLVIHG----ESDGTL---EHM 233
+ D++ +I NY D+D+++ K + V ++HG +S L E
Sbjct: 100 LHDILGDPMIRECWQFNYCFDVDFVMSQFDQDVKDLVQVKIVHGSWKQDSPNRLRIDEAC 159
Query: 234 KRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
R I+ P P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ LW
Sbjct: 160 ARYPNVEPIVAYMPEP--FGTHHSKMMILLRHDDLAQVIIHTANMLAGDWTNMSQALWRS 217
Query: 293 DF-PLKDQ--NNLSEECG-------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSF-- 340
PL N +EE F+ DL+ YL EF +G K
Sbjct: 218 PLLPLSSTPYNPATEEAAVFGTGARFKRDLLAYL------EF------YGRRKTGSLVDQ 265
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKG--FKKSPLV 393
+KF+F + L+ASVP S + WG L+ L++ + + +V
Sbjct: 266 LRKFDFYAIRAVLVASVPSKERLSRMNSSQSTLWGWPALKDALRQISLSDNEHIEDPHVV 325
Query: 394 YQFSSLGSL--DEKWMAEL--SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
Q SS+ SL +KW+ ++ S S + + + IV+PT +++R SL GY +
Sbjct: 326 IQVSSIASLGQTDKWLKDVLFDSLCPSSILPNASKRCNPKFSIVFPTPDEIRRSLNGYGS 385
Query: 450 GNAIPSPQKNVDKD----FLKKYWAKW----------------------KASHTGRSRAM 483
G +I ++V + +++ Y W +++ GR RA
Sbjct: 386 GGSIHMKLQSVAQQKQLQYMRPYLCHWAGDQEQTPVRISRTNAEVPSNIQSTDAGRRRAA 445
Query: 484 PHIKTFARYNGQ----KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKR 539
PHIKT+ R++ + + W ++TSANLS AWGA +N ++ I S+E+GVL+ P
Sbjct: 446 PHIKTYIRFSDKTKMDSIDWVMITSANLSTQAWGAAPNSNGEVRICSWEIGVLVWP---- 501
Query: 540 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE---VVYLPVPYEL 596
++ G + ++ K+V + +++ +V +PY+L
Sbjct: 502 --------------QLIVGDSPEPGAERPKMVPCFQKDRPELPNNNDITPIVGFRMPYDL 547
Query: 597 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
P RY +DVPW + + D GQ W
Sbjct: 548 PLARYGVQDVPWCATINHPEPDWLGQSW 575
>gi|320034009|gb|EFW15955.1| tyrosyl-DNA phosphodiesterase [Coccidioides posadasii str.
Silveira]
Length = 559
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 144/533 (27%), Positives = 231/533 (43%), Gaps = 107/533 (20%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG----ESDGTL 230
N V+++D++ +I NY+ DID+L+ P + + + V+HG +S +
Sbjct: 46 NCDTVTLQDILGDPLIKECWQFNYLFDIDFLMKQFDPDVKNLIRIRVVHGSWKKDSANRI 105
Query: 231 ---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKS 286
E R + I+ P P FGTHHSK M+LI + +II+HTAN+I DW N
Sbjct: 106 YIDEACARYQNVEPIIAYMPEP--FGTHHSKMMILIRHDDCAQIIIHTANMIPGDWANMC 163
Query: 287 QGLWM----------QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
QG+W +D+ + F+ D++ YL A+G K
Sbjct: 164 QGVWRSPLLPLLPLDRDYDQSISGIIGSGRRFKRDILAYLD------------AYGRKKT 211
Query: 337 NPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFEKGFKK 389
P KK++F LIASVP +L WG ++ VL++ K
Sbjct: 212 GPLVEQLKKYDFDEVRAALIASVPSRQEIPNLDSQKKTIWGWPAVQDVLRQIPTHKQLSC 271
Query: 390 SP----LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCS 443
P +V Q SS+ SL +KW+ + + F+ P I++PT +++R S
Sbjct: 272 EPEKPRIVIQISSIASLGQTDKWLKD------TFFNALCPPSAAARFSIIFPTPDEIRRS 325
Query: 444 LEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------------------------ 475
L GY +G +I S + D+++ Y W
Sbjct: 326 LNGYRSGGSIHMKLQSAAQQKQFDYMRPYLCHWAGDCENNQNADIEKSVSSTVTLDESTP 385
Query: 476 ------HTGRSRAMPHIKTFARYNG----QKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++ +
Sbjct: 386 NNTFVREAGRRRAAPHIKTYIRFSDAEDMRTIDWAMVTSANLSTQAWGAAINANQEVRVC 445
Query: 526 SYELGVLILP--------SAKRHGCGFSCT------SNIVPSEIKSGSTETSQIQKTKLV 571
S+E+GVL+ P +A R S + ++P + + S++++ +L
Sbjct: 446 SWEIGVLVWPDLFLNDPQTADRDDKMLSKAYERGEYAQMIPC-FRQNAPCLSEVERLELE 504
Query: 572 TLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ G + A +V +PY LP + YSS D+PW +T+ D GQ W
Sbjct: 505 EPS-RGDDKSKAWKTLVGFRMPYNLPLKPYSSRDIPWCATATHTEPDWLGQTW 556
>gi|226289717|gb|EEH45201.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 639
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 154/582 (26%), Positives = 244/582 (41%), Gaps = 145/582 (24%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L RV+ PA + NT V +RD++ +I NY+ D+D+L+ +
Sbjct: 69 IPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIFDVDFLMSQFDQDVRG 128
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V +IHG ES + E +R ++ +P +FGTHHSK M++I +
Sbjct: 129 LVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFGTHHSKMMVIIKHDDQ 186
Query: 268 VRIIVHTANLIHVDWNNKSQGLW-----------MQDFPLKDQNNLSEECGFENDLIDYL 316
+I++HTAN+I DW N Q +W ++ P N++ F+ DL+ Y
Sbjct: 187 AQIVIHTANMIAGDWANMCQAVWRSPMLPMLSNKRREHPSATPNDVGTGSRFKRDLLAYF 246
Query: 317 STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGH 371
T H +K++FS+ LIASVP T L WG
Sbjct: 247 ETY----------GHNKTGALIEQLEKYDFSAIRAALIASVPSRQTIDELDSKRRTLWGW 296
Query: 372 MKLRTVLQECTFEKGFK---KSP-LVYQFSSLGSL--DEKWMAEL--------SSSMSSG 417
L+ +++ F+KG K K P +V Q SS+ +L +KW+ E S+ S
Sbjct: 297 PALKDTIRQIPFKKGTKSTEKQPQIVIQISSVATLGQTDKWLKETLFNSLSPPSARSSEL 356
Query: 418 F-SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW 472
F +E +P I++PT +++R SL GY +G +I S + +L+ Y +W
Sbjct: 357 FKTESNSPAKFS---IIFPTPDEIRRSLNGYMSGGSIHMKLQSAAQQKQLQYLQPYLCRW 413
Query: 473 --------------------------------------KASH-----TGRSRAMPHIKTF 489
K +H GR RA PHIKT+
Sbjct: 414 AGDANDDGVKSAGGPATSKRKRLEGNEVSESVQDGASLKKAHRPIREAGRRRAAPHIKTY 473
Query: 490 ARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSC 546
R++ + W ++TSANLS AWGA ++ I SYE+GVL+ P
Sbjct: 474 VRFSDTDMTTIDWAMVTSANLSLQAWGAAANVKKEIRICSYEIGVLVWPRF--------- 524
Query: 547 TSNIVPSEIK-------------------SGSTETSQIQKTKLVTLTWHGSSDAG----- 582
IV EI SG T ++ +V +A
Sbjct: 525 ---IVDEEIDDSDEPLMKEKGKDNSRGEISGHKNTKDVKTAVMVPCFKRDMPEAAENEAR 581
Query: 583 -ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+++ +V +PY+LP Y+++D PW Y++ D Y +
Sbjct: 582 SSNTTLVGFRMPYDLPLHSYAAKDQPWCATATYSEPDCYADM 623
>gi|195177151|ref|XP_002028871.1| GL22360 [Drosophila persimilis]
gi|194104354|gb|EDW26397.1| GL22360 [Drosophila persimilis]
Length = 946
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/337 (35%), Positives = 177/337 (52%), Gaps = 38/337 (11%)
Query: 197 SNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPISF 252
S +MVDI WLL +L K +LV++G+ L + + KP I K P P F
Sbjct: 186 SIFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQVTAIGVKMPTP--F 241
Query: 253 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEE- 305
T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL +D + + E
Sbjct: 242 ATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLLPALSEDADTAAGES 299
Query: 306 -CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
GF DL+ YL K + + + +K +FS+ V + SVPG H
Sbjct: 300 LTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREG 349
Query: 365 SLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDK 422
S++ WGH +L ++L + + P+V Q SS+GSL A + + +D
Sbjct: 350 SVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDS 408
Query: 423 TPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHT 477
+P G + +++P+ +V S +G G +P + DK +LK + +WK+S
Sbjct: 409 SPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDR 468
Query: 478 GRSRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAW 512
RSRAMPHIKT++RYN Q + WF+LTSANLSKAAW
Sbjct: 469 HRSRAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAW 505
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 142/291 (48%), Gaps = 35/291 (12%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPA---CPVLAKIPHVLVIHGESDGTLEHMKRNKP- 238
I D G+I ++ N+MVDI WLL +L K +LV++G+ L + + KP
Sbjct: 651 ILDESLGEIESSVQINFMVDIGWLLGHYYFAGILDK--PLLVLYGDESPELLGIGKFKPQ 708
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
I K P P F T H+K MLL Y G +R+++ TANL DW+N++QGLW+ PL
Sbjct: 709 VTAIGVKMPTP--FATSHTKMMLLGYADGSMRVVISTANLYEDDWHNRTQGLWIS--PLL 764
Query: 297 ----KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+D + + E GF DL+ YL K + + + +K +FS+
Sbjct: 765 PALSEDADTAAGESLTGFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAIN 814
Query: 351 VRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMA 408
V + SVPG H S++ WGH +L ++L + + P+V Q SS+GSL A
Sbjct: 815 VFFVGSVPGGHREGSVRGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQA 873
Query: 409 ELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPS 455
+ + +D +P G + +++P+ +V S +G G +PS
Sbjct: 874 WIQQDFVNSLRKDSSPGGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPS 924
>gi|398021965|ref|XP_003864145.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
gi|322502379|emb|CBZ37463.1| tyrosyl-DNA phosphodiesterase-like protein [Leishmania donovani]
Length = 682
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 155/617 (25%), Positives = 248/617 (40%), Gaps = 184/617 (29%)
Query: 177 NTSCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 35 SCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLPWLLATVPELSAVTGKLVLLSGEKGT 94
Query: 230 ---------------------------LEHMKRNKPANWILH-----------KPPLPIS 251
+ ++ A LH +PPLP++
Sbjct: 95 ATLRRTTGDSSCPYTAASPLMDRVNPFMAALREQARATSALHTTLSRERLAVLEPPLPVA 154
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S + +
Sbjct: 155 FGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSNDDSADAT 214
Query: 312 LIDYLST------------LKWPEFSANL-----------------PAHGNFKINP---- 338
+++ ++ K EF A+L P P
Sbjct: 215 MVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASAAGPLGIF 274
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQECTFEKGFKKSP-- 391
F +FS+AAV L++SVPG + + + G +L VL+ +
Sbjct: 275 ETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMATSPASVD 334
Query: 392 LVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+VR S EG+
Sbjct: 335 LSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 394
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ +W +S G
Sbjct: 395 RGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASREDAVDVDGV 453
Query: 479 -------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGAL-- 515
R A+PHIK++A + + WFLLTSANLS+AAWG+L
Sbjct: 454 DIDGGEETTASLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 513
Query: 516 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ--IQKTKL 570
Q+ + Q ++RSYELGVL + + S S + S+I+ + S+ + +T L
Sbjct: 514 KVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESKIELPNARNSRAMLYETPL 573
Query: 571 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 604
G ++ V L +PY L P Y+S
Sbjct: 574 -----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVEEAALDFS 622
Query: 605 DVPWSWDKRYTKKDVYG 621
DVPW D + KD YG
Sbjct: 623 DVPWVLDMPHRGKDAYG 639
>gi|121715578|ref|XP_001275398.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
gi|119403555|gb|EAW13972.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus clavatus NRRL
1]
Length = 576
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 144/524 (27%), Positives = 235/524 (44%), Gaps = 88/524 (16%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V +RD++ +I N++ D+D+L+ + +
Sbjct: 80 IPSPIQLTHIRDLSAASGNNVDTVRLRDILGDPMIRECWQFNFLFDVDFLMNQFDEDVRR 139
Query: 216 IPHVLVIHG--ESDG-----TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ V V+HG + D E R I+ P P FGTHHSK M+L+ +
Sbjct: 140 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAIVAYMPEP--FGTHHSKMMILLRHDDL 197
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-------FENDLIDYLSTL 319
++++HTAN+I DW N Q +W PL+ +++EE G F+ DL+ YL+
Sbjct: 198 AQVVIHTANMIPGDWANMCQAVWRSPLLPLQKVEHIAEEPGTIGSGARFKRDLLAYLN-- 255
Query: 320 KWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHM 372
+G K P +F+FSS LIASVP +SL WG
Sbjct: 256 ----------EYGAKKTGPLVKQLARFDFSSVRAALIASVPSKQKLASLDLQRKTLWGWP 305
Query: 373 KLRTVLQEC--TFEKGFKKSP--LVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLG 426
LR ++ T E+G + + ++ Q SS+ +L + KW+ ++ + S + + TP
Sbjct: 306 ALRETTRQIPLTREQGSETATPHIITQISSIATLGQTDKWLKDVFFN-SLAPTSNPTPPT 364
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW---------- 472
+ IV+PT +++R SL GY +G +I S ++ +++ Y W
Sbjct: 365 KSKYSIVFPTPDEIRRSLNGYGSGGSIHMKLQSTTQHKQLQYMRPYLRHWAGDSSTHSSD 424
Query: 473 --------KASHTGRSRAMPHIKTFARYNG----QKLAWFLLTSANLSKAAWGALQKNNS 520
K GR RA PHIKT+ R+ + W ++TSANLS AWGA +N
Sbjct: 425 GRGETSTTKTQEAGRRRAAPHIKTYIRFADANRMNAIDWAMVTSANLSTQAWGAAVNSNG 484
Query: 521 QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 580
++ I S+E+GV++ P ++ + +Q K L
Sbjct: 485 EVRICSWEVGVMVWPQLFAEKAEQQQQQAMMVPCFRRDLPVDCPVQPAKCDVL------- 537
Query: 581 AGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
V L +PY+LP Y +++VPW + + D GQ W
Sbjct: 538 -------VGLRMPYDLPLTSYRADEVPWCATATHMEPDWLGQTW 574
>gi|332029127|gb|EGI69138.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 542
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/442 (29%), Positives = 203/442 (45%), Gaps = 72/442 (16%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ + VD+ WL L+ +D T+ + R P +
Sbjct: 141 ILDRSLGEIVNSLHLTFTVDVGWLYL---------QYLLAGQRTDMTILYKYRVCPCHEE 191
Query: 243 LHKPPLPI------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD--- 293
L K I F +HH+ M+L Y G+R++V TA L DW N++QGLW+
Sbjct: 192 LSKNITIIHVDGQHEFSSHHANIMILQYSNGIRVVVSTAALYSDDWKNRTQGLWISPHLP 251
Query: 294 -FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P + + E GF+ DL YLS + P + + A + +FS V
Sbjct: 252 YLPESAKPSDGESPTGFKKDLERYLSKYEQPALTQWIRA----------VQMADFSDVNV 301
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG--SLDEKWMAE 409
L+ASVPG H G WG+ KL VL ++ P+V Q S +G L E W+ +
Sbjct: 302 FLVASVPGIHKGYEDDFWGYRKLAHVLSCYVTLPRNEQWPIVAQSSGVGCFGLFENWLED 361
Query: 410 LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKY 468
+ MS S+D + ++P++ + + S + + +N + +L+ Y
Sbjct: 362 IIWCMSKETSKDSNNYPHFQ--FIYPSIANYKQSFDFRVLSTPLSYNTENHFKQQWLESY 419
Query: 469 WAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRS 526
+WKA TGR RAMP+IK++ R + +K+ WFLLTSANLSKAAWG+ ++ + I +
Sbjct: 420 LYQWKAKRTGRDRAMPNIKSYTRISPDLKKIPWFLLTSANLSKAAWGSNKQYD--YSIGN 477
Query: 527 YELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
YE GVL +P + +G+T T G D G
Sbjct: 478 YEAGVLFIP------------------KFITGTT-----------TFPIGGEEDTG---- 504
Query: 587 VVYLPVPYELPPQRYSSEDVPW 608
V P+PY+LP +Y +D P+
Sbjct: 505 VPMFPIPYDLPLSQYEFDDSPF 526
>gi|46123591|ref|XP_386349.1| hypothetical protein FG06173.1 [Gibberella zeae PH-1]
Length = 1094
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 135/421 (32%), Positives = 206/421 (48%), Gaps = 61/421 (14%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+PS ++L +Q LP N VS+RD++ +I N++ DI +L+ A P +
Sbjct: 38 IPSPWQLTWIQDLPESENKDAVSLRDLLGDPLISECWEFNFLHDIPFLMNAFDPDTRHLV 97
Query: 218 HVLVIHGESDGTLEHMKRNKPA---------NWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+V ++HG +H +N+ A N +H P+P FGTHHSK M+L +
Sbjct: 98 NVHLVHG----FWKHEDKNRIALENAAAKFENVNVHIAPMPEMFGTHHSKMMILFRHGDT 153
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPL-----KDQNNLSEECGF-----ENDLIDYLS 317
++I+HTAN+I DW N + G+W PL K Q S F E ID L+
Sbjct: 154 AQVIIHTANMIPKDWTNMTNGVWKS--PLLPRMSKTQTPASSPEEFLVGSGERFKIDLLN 211
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLR 375
LK+ + + + K+ K+++FS+ LIASVPG H + + WG L+
Sbjct: 212 YLKFYDKRKIICKPLSDKL-----KQYDFSTIKAALIASVPGRHDAHDMSETSWGWAALK 266
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPL-- 431
L+ + S +V Q SS+ +L K W L ++ K G+ P
Sbjct: 267 RCLRHVPCHQ-HGDSDIVVQVSSIATLGPKDDW---LQKTLFDHLGRCKD-TGLRRPRFK 321
Query: 432 IVWPTVEDVRCSLEGYAAG----NAIPSPQKNVDKDFLKKYWAKWK-------------A 474
+V+PT +++R SL+GYA+G I SPQ+ ++L+ + W
Sbjct: 322 VVFPTADEIRRSLDGYASGLSIHTKIQSPQQAKQLEYLRPMFHHWANDSPGGTKLPDGPV 381
Query: 475 SHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
+GR RA PHIKT+ R N + W LLTSAN+SK AWG + ++ I S+E+GVLI
Sbjct: 382 LESGRKRAAPHIKTYVRSNKSSIDWGLLTSANISKQAWGEAARPTGEMRIASWEVGVLIW 441
Query: 535 P 535
P
Sbjct: 442 P 442
>gi|332029125|gb|EGI69136.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 522
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 184/365 (50%), Gaps = 29/365 (7%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ N++VD++WL + + + +++G D N N
Sbjct: 113 ILDCSLGEIVYSLHLNFIVDVEWLCWQYLLAGQCTDMTILYG--DKAYYQTLFN---NIT 167
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP-LKDQ 299
+ K + F HH+K M+L Y G+R+IV TANL DW N +QGLW+ P L +
Sbjct: 168 IIKVNIETGFACHHTKIMILQYKDDGIRVIVSTANLRSTDWENVTQGLWISPHLPRLPES 227
Query: 300 NNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
N S+ GF+ DL YLS + P + + A + +FS V LIAS
Sbjct: 228 ANPSDGESPTGFKKDLERYLSKYEQPTLTQWICA----------VQMADFSKVNVFLIAS 277
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
VPG + + WG+ KL VL + T P+V Q SS+G L + + L +
Sbjct: 278 VPGIYQNNEANFWGYKKLAHVLSRHVTLPSDVFPWPIVAQSSSIGKLGSSFESWLLKDII 337
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKW 472
S + T G+P ++P++++ + S P S + + + +L Y +W
Sbjct: 338 PCMSRESTESTKGQPEFKFIYPSIQNYKQSFHYKNLSWCSPYSAEAHSKQQWLDLYLHQW 397
Query: 473 KASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
KA T R RAMPHIK++ R + + + WF+LTSANLSKAAWG+++++ I +YE G
Sbjct: 398 KAKRTERDRAMPHIKSYTRISPDLKSIPWFVLTSANLSKAAWGSIKRHGYS--IENYEAG 455
Query: 531 VLILP 535
++ +P
Sbjct: 456 IIFVP 460
>gi|320590454|gb|EFX02897.1| tyrosyl-DNA phosphodiesterase [Grosmannia clavigera kw1407]
Length = 553
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 148/538 (27%), Positives = 230/538 (42%), Gaps = 88/538 (16%)
Query: 144 KNSEEALCNFHVSRD---KLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NY 199
+N EEA + S D + S F+L ++ LPA N V++ ++ ++ NY
Sbjct: 45 RNGEEA--HDSTSTDAGVRFRSPFQLTAIRDLPAEDNVDTVTVDEIFGSPLVAECWEFNY 102
Query: 200 MVDIDWLLPAC-----PVLAKIPHVLVIHGESDGTLE-HMKRNKPANWILHKPPLPISFG 253
+ DI + + A ++ E LE + + AN LH +P FG
Sbjct: 103 LHDIGFFMDALNEDVRHLVHVHVVHGFWKREDQRRLELEAEAARYANVQLHTAFMPEPFG 162
Query: 254 THHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW--------MQDFPLKDQNNLSE 304
THHSK A+L + +++++TAN+I DW N +QG+W D +D++ +
Sbjct: 163 THHSKMAVLFRHDDTAQVVIYTANMIPHDWANMTQGVWRSPLLPLLADDVDGEDESEIDG 222
Query: 305 ECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
G F+ DL+ YL S P +++F++ LIASVPG
Sbjct: 223 PVGSGRRFKTDLLSYLRAYN-QRRSICRPLVERLA-------RYDFAAVQAALIASVPGR 274
Query: 361 HT------GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD--EKW------ 406
H+ +WG L+ L+ + + +V Q SS+ +L + W
Sbjct: 275 HSLIRQPDEKYHTQWGWTALKNTLRSVPVQAVAPSTEIVLQVSSMATLGPTDAWIRHTLF 334
Query: 407 --MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSP----QKNV 460
MA SS++ G S K L V+PT +++R SLEGY +G +I + Q+
Sbjct: 335 SAMATASSAVDKGGSIGKEELQQPRFRAVFPTADEIRRSLEGYKSGTSIHTKIQSSQQQR 394
Query: 461 DKDFLKKYWAKWKASH--------------TGRSRAMPHIKTFARYNGQKLAWFLLTSAN 506
+++ W GR RA PHIKT+ RY + W LLTSAN
Sbjct: 395 QLQYMRPLLCHWANDSPDGAKLPDGATPIVNGRKRAAPHIKTYVRYGQVGVDWALLTSAN 454
Query: 507 LSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQ 566
LSK AWG ++ + S+E+GV++ P G + ++ +I GS Q
Sbjct: 455 LSKQAWGEAVTAAGEVRVASWEIGVMVWP-------GLFAETAVM--QIVGGSDSVLQPA 505
Query: 567 KTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
K A VV L VPY+LP Q+Y ++PW + D GQ W
Sbjct: 506 TGK------------AAGRPVVALRVPYDLPLQQYGKGEIPWVCTLPDEEPDWTGQAW 551
>gi|169775023|ref|XP_001821979.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
gi|83769842|dbj|BAE59977.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 570
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 133/522 (25%), Positives = 243/522 (46%), Gaps = 96/522 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS F+L ++ L A + N V +R+++ +I NY+ D+D+++ + +
Sbjct: 85 IPSPFQLTHIRDLAASSDNNVDTVRLREILGDPMIRECWQFNYLHDVDFIMGQFDEDVRR 144
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPP------------LPISFGTHHSKAMLLI 263
+ V ++HG KR+ P + + +P +FGTHHSK M+L+
Sbjct: 145 LVKVKIVHGS-------WKRDAPNRVRIDEACSRYPNVEAVVAYMPEAFGTHHSKMMVLL 197
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE------CGFENDLIDY 315
+ V++++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ Y
Sbjct: 198 RHDDLVQVVIHTANMIPGDWTNMCQAVWRSPLLPLQKTDDRVEDLTLGSGARFKRDLLAY 257
Query: 316 LSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L+ +G K P +K++F + L+ASVP L
Sbjct: 258 LT------------EYGPKKTGPLVEQLRKYDFGAIRAALVASVPSKQKVDDLDSQKKTL 305
Query: 369 WGHMKLRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDK 422
WG L+ ++++ + K+ +V Q SS+ +L +KW+ + + +S+S + +
Sbjct: 306 WGWPALKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWLKDVMFASLSPASTSTR 365
Query: 423 TPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH-- 476
P + I++PT +++R SL GY +G +I S + +++ Y W H
Sbjct: 366 QP----KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDT 421
Query: 477 ----------TGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQL 522
GR RA PHIKT+ R++ + + W ++TSANLS AWGA + ++
Sbjct: 422 AEPSHTSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEV 481
Query: 523 MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAG 582
I S+E+G+++ P + ++ +VP+ K + E + + ++ T
Sbjct: 482 RICSWEIGIVVWPQLYVQD---TESATMVPT-FKRDTPEPLENKDSETTPDT-------- 529
Query: 583 ASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
V+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 530 ----VIGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 567
>gi|340521404|gb|EGR51638.1| predicted protein [Trichoderma reesei QM6a]
Length = 1118
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 135/439 (30%), Positives = 212/439 (48%), Gaps = 61/439 (13%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVL 220
S ++L R++ LP N V +RD++ +I N++ DI ++L A + + L
Sbjct: 42 SPWQLTRIRDLPEELNRDTVRLRDILDDPLITECWQFNFLHDIPFVLSAFDDMVRNRVQL 101
Query: 221 -VIHG--ESDGTLEHMKRNKPA---NWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVH 273
V+HG + D + ++ A N LH P+P FGTHHSK M++ ++++H
Sbjct: 102 HVVHGFWKKDDESRIVLSDQAAQFHNVHLHCAPMPEMFGTHHSKMMVIFRSDDTAQVVIH 161
Query: 274 TANLIHVDWNNKSQGLWM---------QDFPLKDQNNLSEECG--FENDLIDYLSTLKWP 322
TAN+I DW N + +W QD + L G F+ DL++YL ++
Sbjct: 162 TANMIPKDWTNMTNAVWRSPRLPRLGEQDTLFQQGQQLPVGSGTRFKVDLLEYLR--QYE 219
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMKLRTVLQE 380
+ + +N F+FSS IASVPG H+ +S WG ++ L+
Sbjct: 220 LYRPTCKQLVDRLVN------FDFSSIRAAFIASVPGRHSFRDASRPAWGWAAVQRCLRC 273
Query: 381 CTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEP--LIVWPT 436
E+G +S +V Q SS+ +L K W L ++ + TP G P +V+PT
Sbjct: 274 VPVERG--QSQIVVQISSIATLGAKDDW---LQRTLFDSLATSLTP-NTGRPGFKVVFPT 327
Query: 437 VEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWK---------------ASHT 477
V+++R S++GYA+G + I SPQ+ +L+ W + +
Sbjct: 328 VDEIRNSIDGYASGRSIHTKIQSPQQIRQLGYLRPILHHWANDSAGGAKLPGEPSISGDS 387
Query: 478 GRSRAMPHIKTFARYN-GQKLAWFLLTSANLSKAAWG-ALQKNNSQLMIRSYELGVLILP 535
GR RA PHIKT+ R+N + W +LTSAN+SK AWG AL + I S+E+GVL+ P
Sbjct: 388 GRDRAAPHIKTYIRFNESNTIDWAMLTSANMSKQAWGEALSSTTGNIRIASWEVGVLVWP 447
Query: 536 SAK-RHGCGFSCTSNIVPS 553
G S ++VPS
Sbjct: 448 GLLCEDGAMVSSPKSLVPS 466
>gi|146098236|ref|XP_001468366.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
gi|134072733|emb|CAM71450.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania infantum JPCM5]
Length = 682
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 154/617 (24%), Positives = 246/617 (39%), Gaps = 184/617 (29%)
Query: 177 NTSCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 35 SCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLPWLLATVPELSAVTGKLVLLSGEKGT 94
Query: 230 ---------------------------LEHMKRNKPANWILH-----------KPPLPIS 251
+ ++ LH +PPLP++
Sbjct: 95 ATLRRTTGDSSCPYTAASPLMDRVNPFMAALREQARPTSALHTTLSRERLAVLEPPLPVA 154
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FGTHH+K L + RG+R+ + TANL+ DW KSQG+++QDFP K S + +
Sbjct: 155 FGTHHTKMALCVNGRGLRVSIFTANLVEQDWCRKSQGIYVQDFPWKTATVRSNDDSADAT 214
Query: 312 LIDYLST------------LKWPEFSANL-----------------PAHGNFKINP---- 338
+++ ++ K EF A+L P P
Sbjct: 215 MVETATSSTSNSNNGSNTFTKGAEFVAHLRHYLMQCGVSLAAACASPTDAASAAGPLGIF 274
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTGSSL---KKWGHMKLRTVLQECTFEKGFKKSP-- 391
F +FS+AAV L++SVPG + + + G +L VL+ +
Sbjct: 275 ETDFLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRVGLCRLAEVLRRSALTMATSPASVD 334
Query: 392 LVYQFSSLGSLDEKWMAELSSSM----SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M ++ P G+ + +V+PT E+VR S EG+
Sbjct: 335 LSWQYSSQGSLNPAFLNSLQAAMCGESAAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 394
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ +W +S G
Sbjct: 395 RGGMSLPL-RVQCCHEFVNARLHRWGSSEEGHTAKRAFPRPPKVAAAHASREDAVDVDGV 453
Query: 479 -------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGAL-- 515
R A+PHIK++A + + WFLLTSANLS+AAWG+L
Sbjct: 454 DIDGGEETTPSLAGSCAADRQFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 513
Query: 516 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ--IQKTKL 570
Q+ + Q ++RSYELGVL + + S S + S I+ + S+ + +T L
Sbjct: 514 KVNQRGSRQQLVRSYELGVLYDSHSAIYPSASSWFSVVAESRIELPNARNSRAMLYETPL 573
Query: 571 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 604
G ++ V L +PY L P Y+S
Sbjct: 574 -----------GVDTQDVCLYIPYNLLCPTPYASTAALRAHRHAPDEGEQAVEEAALDCS 622
Query: 605 DVPWSWDKRYTKKDVYG 621
DVPW D + KD YG
Sbjct: 623 DVPWVLDMPHRGKDAYG 639
>gi|212543739|ref|XP_002152024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
gi|210066931|gb|EEA21024.1| tyrosyl-DNA phosphodiesterase, putative [Talaromyces marneffei ATCC
18224]
Length = 587
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 151/570 (26%), Positives = 244/570 (42%), Gaps = 99/570 (17%)
Query: 125 GELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR-------VQGLPAWAN 177
G S+ + Q E ++ E + D L FR++R ++ LP N
Sbjct: 45 GRPSNARRDQNAESAPQDFDIKENTQIDIDREDDSLRDKFRIIRSPIQLTHIRDLPNDKN 104
Query: 178 TSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTL- 230
V + D++ +I NY D+D+++ + + V ++HG +S +
Sbjct: 105 IDTVQLHDILGDPMIRECWQFNYCFDVDFVMSQFDQDVRDLVQVKIVHGSWKQDSANRIR 164
Query: 231 --EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQ 287
E R I+ P P FGTHHSK M+L+ + ++I+HTAN++ DW N SQ
Sbjct: 165 IDEACARYPNVESIVAYMPEP--FGTHHSKMMILLRHDDHAQVIIHTANMLAGDWTNMSQ 222
Query: 288 GLWMQDF----PLKDQNNLSEECGF------ENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
+W P++D + ++ F + DL+ YL EF +GN K
Sbjct: 223 AVWRSPLLSLSPIRDNSETAQAASFGTGARFKRDLLAYL------EF------YGNKKTR 270
Query: 338 PSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKLRTVLQECTFE-KGFKK 389
+KF+F + LIASVP S WG L+ L++ + +
Sbjct: 271 SLVDQLRKFDFQAIRAALIASVPSKERISRADSSRSTLWGWPALKDTLRQVPLRIREKNQ 330
Query: 390 SP-LVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSL 444
P +V Q SS+ SL +KW+ ++ SE + P I++PT +++R SL
Sbjct: 331 CPHVVIQISSIASLGQTDKWLKDVLFDSLCLPSELPHTNKMPRPKYSIIFPTPDEIRRSL 390
Query: 445 EGYAAGNAIPSPQKNVDKD----FLKKYWAKW----------------------KASHTG 478
GY +G +I +++ + +++ Y +W + + G
Sbjct: 391 NGYGSGGSIHMKLQSITQQKQLQYMRPYLCQWAGDQKQTAMGTLHLNAESVYNSQRTDAG 450
Query: 479 RSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
R RA PHIKT+ R+ + + W ++TSANLS AWGA +N ++ I S+E+GVL
Sbjct: 451 RRRAAPHIKTYIRFADKTKMDTIDWAMITSANLSTQAWGAAANSNGEVRICSWEIGVLFW 510
Query: 535 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 594
P I ST T + + T S D S +V +PY
Sbjct: 511 PEL------------IAGDPFNPNSTRTEMVPSFRKDTPDPTESEDV---SSIVGFRMPY 555
Query: 595 ELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+LP YS++DVPW + + D GQ W
Sbjct: 556 DLPLTPYSAQDVPWCATINHPEPDWLGQSW 585
>gi|342883838|gb|EGU84260.1| hypothetical protein FOXB_05217 [Fusarium oxysporum Fo5176]
Length = 1127
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 131/456 (28%), Positives = 213/456 (46%), Gaps = 56/456 (12%)
Query: 124 NGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSI 183
N + ++M + D Q E+ + S + S ++L ++ LP N V++
Sbjct: 2 NRPVKRQRMEEPDAQTPESLQRSISPPKKRDRKLTVVKSPWQLTWIRDLPEGDNQDAVTL 61
Query: 184 RDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLEHMKRN 236
+D++ +I N++ DI +L+ + P + V ++HG +++ +
Sbjct: 62 KDLLSDPLISECWEFNFLHDIPFLMNSFDPDTRHLVKVHLVHGFWKREDANRIALENASS 121
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLW----- 290
+ N H P+P FGTHHSK M+L G ++I+HTAN+I DW N S G+W
Sbjct: 122 EFENIKTHIAPMPEMFGTHHSKMMILFRHDGTAQVIIHTANMIPKDWTNMSNGVWKSPLL 181
Query: 291 -----MQDFPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
Q+F + +++ F+ DL++YL + K +
Sbjct: 182 PKLSGAQNFQASPEDHSVGSGQRFKIDLLNYLKAYDRRKIIC--------KPLTDKLTHY 233
Query: 345 NFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL 402
+FSS L+ASVPG H + + WG L+ LQ + S +V Q SS+ +L
Sbjct: 234 DFSSIKAALVASVPGKHDARDMSETSWGWAALKRCLQHVPCQD-HGDSDIVVQVSSIATL 292
Query: 403 DEK--WMAELSSSMSSGFSEDKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----P 454
K W L ++ + K P G+G P +V+PT +++R SL+GYA+G +I
Sbjct: 293 GAKDDW---LQKTLFEPLTRSKNP-GLGRPRFKVVFPTADEIRRSLDGYASGGSIHTKIQ 348
Query: 455 SPQKNVDKDFLKKYWAKWK--------------ASHTGRSRAMPHIKTFARYNGQKLAWF 500
S Q+ ++L+ + W +GR RA PHIKT+ R N + W
Sbjct: 349 SSQQAKQLEYLRPIFHHWANDSPRGAKLPEDTPLRDSGRKRAAPHIKTYIRSNKSSIDWG 408
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 536
LLTSAN+SK AWG + ++ I S+E+GVLI S
Sbjct: 409 LLTSANISKQAWGEAARPTGEMRIASWEIGVLIWAS 444
>gi|156034731|ref|XP_001585784.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980]
gi|154698704|gb|EDN98442.1| hypothetical protein SS1G_13301 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 633
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 161/626 (25%), Positives = 263/626 (42%), Gaps = 135/626 (21%)
Query: 113 SQKRVSNDGATNGELSSKKMRQ--------------------QDEQDNENGKNSEEALCN 152
+QKR D TN +++ K +R+ Q+E E+ S + +
Sbjct: 27 AQKRRKVDDNTNDDINEKGVRRGMNRSISPPPLRRYRKEIPIQEEGSLEHKVESSKQTSS 86
Query: 153 FHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAIL--SNYMVDIDWLLPAC 210
+ + S F+L ++ LPA +N VS++D++ GD +++ NY+ ++D+L+
Sbjct: 87 KITKQKVVKSPFQLTSIRDLPASSNVDTVSLKDIL-GDPLISECWEFNYLHNLDFLMGQF 145
Query: 211 PV-LAKIPHVLVIHG----ESDGTLEHMKRN-KPANWILHKPPLPISFGTHHSKAMLLI- 263
+ + V V+HG E L M++ K +N L +P FGTHHSK ++L
Sbjct: 146 DEDVRNLVKVNVVHGFWKREDQSRLNLMEQALKYSNVKLLTAYMPEMFGTHHSKMLILFR 205
Query: 264 YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPL--------KDQNNLSEECGFENDLID 314
+ ++I+HTAN+I DW N +Q +W PL K+ + F+ DL++
Sbjct: 206 HDSTAQVIIHTANMIPFDWTNMTQAMWKSPLLPLLDPEKPNPKESGQMGSGSKFKIDLLN 265
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFK---KFNFSSAAVRLIASVPGYHT---GSSLKK 368
YL H I + K +FS L+AS PG S+
Sbjct: 266 YLGAY-----------HTKRAICKPLIEQLSKHDFSEIRAALVASTPGKQDIELDSTETA 314
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLG 426
WG L ++L+ K + +V Q SS+ SL +KW L+ + S K P
Sbjct: 315 WGWAGLSSILKSIPCSK--TQPEIVVQISSIASLGPTDKW---LNQTFFKALSTSKDPSP 369
Query: 427 IGEPLIVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKA-------- 474
+ I++PT +++R S+ GY++G+AI + + +LK W
Sbjct: 370 KPKFKIIFPTADEIRRSINGYSSGSAIHTKILTSAQGKQLAYLKPLLCHWAGDGEQHSST 429
Query: 475 -----------------------------SHTGRSRAMPHIKTFARYNG---QKLAWFLL 502
+ R RA PHIKT+ R++ + + W L+
Sbjct: 430 SQTSSTSESATSSNTSNIALSPHMASPPPQNAHRKRAAPHIKTYIRFSSSSHKTIDWMLV 489
Query: 503 TSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP---SEIKSGS 559
TSANLSK AWG ++ I SYE+GV++ P G S +VP ++I S
Sbjct: 490 TSANLSKQAWGENINTAGEVRICSYEIGVIVWPGLWDEG----NKSKMVPCFGTDIPSRP 545
Query: 560 TETSQIQKTKLVTLT--------------WHGSSDAGASSE-------VVYLPVPYELPP 598
TS+++ T V T G + SE ++ +PY+LP
Sbjct: 546 DVTSELESTVAVEATSVTADNNNIREKGKGKGREEIEKKSENDTENTILIGARIPYDLPL 605
Query: 599 QRYSSEDVPWSWDKRYTKKDVYGQVW 624
Y+ D+PW Y++ D G W
Sbjct: 606 IPYTKSDIPWCASASYSEPDWMGNTW 631
>gi|154344310|ref|XP_001568099.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065433|emb|CAM40865.1| putative tyrosyl-DNA phosphodiesterase [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 680
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 138/497 (27%), Positives = 203/497 (40%), Gaps = 141/497 (28%)
Query: 177 NTSCVSIRDVIQGDII-------VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+ S + +RD+ D+ +LS+YM D WLL P L+ + LV+ GT
Sbjct: 37 SCSLLRLRDLFCCDVADTDECWQYILLSSYMTDFRWLLRTVPELSAVTGKLVVLSGEKGT 96
Query: 230 L-------------------------------EHMKRNKPANWILHK-------PPLPIS 251
EH + +L + PPLPI+
Sbjct: 97 ATLRCTTGEPLHSYTATSPLLDRVNPFVASLREHAQTTSAVGTLLSRERLAVLEPPLPIA 156
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK-------------- 297
FGTHHSK L + RG+R+ + TANL+ DW KSQG+++QDFP K
Sbjct: 157 FGTHHSKMALCVNSRGLRVSIFTANLLEQDWCWKSQGIYVQDFPWKTSAKSSKHDSLDAT 216
Query: 298 --------DQNNLSEECGFENDLIDYLS----------TLKWPEFSANLPAHGNFKI-NP 338
+N S C D ++L + A G I
Sbjct: 217 AGTATTGYSSSNFSGVCPKGIDFAEHLRHYLIQCGVSLAAAFTSLKAAASLAGPLGIFET 276
Query: 339 SFFKKFNFSSAAVRLIASVPGYHTGSSLK---KWGHMKLRTVLQE--CTFEKGFKKSPLV 393
F +FS+AAV L++SVPG H + + G +L VL+ T L+
Sbjct: 277 DFLSHIDFSAAAVWLVSSVPGTHAHGEVSPGYRVGLCRLAEVLRRSPLTMATTPASVDLI 336
Query: 394 YQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAA 449
+Q+SS GSL+ ++ L ++M + P G+ + L+V+PT E+VR S EG+
Sbjct: 337 WQYSSQGSLNSTFLNTLQAAMCGEAVTVIESGNAPRGVRDVLVVYPTEEEVRNSWEGWRG 396
Query: 450 GNAIP-------------------------------SPQKNV---------------DKD 463
G ++P P K V D D
Sbjct: 397 GGSLPLRVQCCHEFVNNRLHRWGSRAEDHAVEHGLTQPAKGVAAHASREDAVDVDQADSD 456
Query: 464 FLKKYWAKWKASHTG-RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGAL----- 515
++ A AS R A+PHIK++A + + WFLLTSANLS+AAWG++
Sbjct: 457 RDEEATASLVASCAAYRQFALPHIKSYAAVAPDRTCVRWFLLTSANLSQAAWGSVSGKVK 516
Query: 516 QKNNSQLMIRSYELGVL 532
++ Q ++RSYELGVL
Sbjct: 517 KRGLCQQLVRSYELGVL 533
>gi|328721089|ref|XP_003247207.1| PREDICTED: probable tyrosyl-DNA phosphodiesterase-like isoform 2
[Acyrthosiphon pisum]
Length = 678
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 131/455 (28%), Positives = 219/455 (48%), Gaps = 73/455 (16%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVL-AKIPHVLVIHGESDGTLEHMKRNK 237
S + D GD+ ++ N+MV++ WL + + + +++ D ++ + + K
Sbjct: 277 SFAELLDKSLGDLSESLHLNFMVELGWLFAQYFITDQRGKKMTLLYERCDEDIDELHKKK 336
Query: 238 PANWILHKPPL-PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQ-DF 294
+ HK + +FG HSK + Y G +R++V +ANL DW +QG+W+ F
Sbjct: 337 KLLNVRHKKIINKNAFGHQHSKVSMFAYADGSLRVVVMSANLCEDDWTKYAQGIWVSPKF 396
Query: 295 PLKDQNNLSE---ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
PLK++++ S+ + F+ D++ YL++ + P + +K +FS A
Sbjct: 397 PLKEEDDKSDGNSQTDFKIDILRYLNSFREPSLVPWIQK----------IEKVDFSQA-- 444
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSL---DEKW 406
+VPG HT WGH+ L+ +L++ C + P++ Q SSLGSL DE+W
Sbjct: 445 ----NVPGKHTEPL---WGHLYLKNILKKHACLPFCVPSEWPIIAQCSSLGSLGTTDEEW 497
Query: 407 M-AELSSSMSSGFSEDKTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+ +E S+S+ D T +P+ +++P+V++V S +G G +P + +K
Sbjct: 498 LKSEFVESLSASTYCDDTDTD-NDPIPFHLIYPSVKNVLNSWDGALGGICLPYNKILHEK 556
Query: 463 DF-LKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNN 519
LKKY W+ R++AMPHIKT+ R + +++WFLL SANLSKAAWG K++
Sbjct: 557 QLWLKKYMCLWQCHSRKRTKAMPHIKTYCRISPCLTEMSWFLLGSANLSKAAWGRKLKSD 616
Query: 520 SQL-MIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 578
Q I ++E GVL LP F S+ P
Sbjct: 617 EQSNFIMAHEAGVLFLPQ-------FLIGSDTFP-------------------------- 643
Query: 579 SDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 613
D ++ Y +P++LP YS D PW+ R
Sbjct: 644 IDETEPNKFPYFSLPFDLPLAGYSDTDQPWTISTR 678
>gi|213409511|ref|XP_002175526.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
gi|212003573|gb|EEB09233.1| tyrosyl-DNA phosphodiesterase [Schizosaccharomyces japonicus
yFS275]
Length = 518
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 149/506 (29%), Positives = 221/506 (43%), Gaps = 82/506 (16%)
Query: 158 DKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAK 215
+K S L ++ LP N C+S+R +I + N+ +D+ +++ P + K
Sbjct: 52 EKQDSPIFLNSIKSLPDEENVHCLSLRQLIGSKNLRETWQFNFCIDLGFIVENMHPSVLK 111
Query: 216 IPHVLVIHGESDGT-----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-GVR 269
V V HG S + L K P + LH +P +GTHHSK M+ + +
Sbjct: 112 QVKVHVTHGYSYDSPRMDVLRQQKTRLPMDIELHSVYVP-QWGTHHSKIMVNFFADDSCQ 170
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC------GFENDLIDYLSTLKWPE 323
+++HTAN+I +DW SQ ++ PL + + E F+ D YLS K
Sbjct: 171 VVIHTANMIQMDWEGMSQAIYKT--PLLWRKTVEREGPPSVGDRFQKDFCSYLSHYK--- 225
Query: 324 FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQ--EC 381
A L ++++F+S I+SVPG G L WGH +L L E
Sbjct: 226 HCAKLICK---------LQRYDFTSVKAIFISSVPGKFGGDKLDSWGHNRLEKELAAIES 276
Query: 382 TFE-----KGFKKSPL-VYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEPLIV 433
E F+ S + V Q SS+GS + ++ E + ++ + K ++
Sbjct: 277 MAEFMGPRNKFQDSDICVSQCSSMGSFGARQAFLKEHTKALHCDLTHWK---------LI 327
Query: 434 WPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK 487
+PTV DVR SL G+ +G++I V++ KWKA +GR R PH+K
Sbjct: 328 FPTVTDVRDSLLGWHSGSSIHFNVTARGAPAQVEELVRHNQLCKWKAMKSGRQRIAPHVK 387
Query: 488 TFARYN--GQKLAWFLLTSANLSKAAWGALQ------KNNSQLMIRSYELGVLILPSAKR 539
T+ R N G + W LLTSANLSK AWG L+ K L IRSYE GVL+ P
Sbjct: 388 TYMRLNDEGTLIRWVLLTSANLSKPAWGTLEGVAANSKTEHGLRIRSYEAGVLLHPGLFA 447
Query: 540 HGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQ 599
+C V KS S ++ D S V + +P++ PPQ
Sbjct: 448 DDSNSACAFFPV---YKSNSLKSPNF--------------DFPLS---VAIRMPWDFPPQ 487
Query: 600 RYSSEDVPWSWDKRYTKKDVYGQVWP 625
Y +D WS + D G WP
Sbjct: 488 PYGDKDDIWSPSIPRNETDWLGSKWP 513
>gi|332029128|gb|EGI69139.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 550
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 128/450 (28%), Positives = 200/450 (44%), Gaps = 85/450 (18%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVI------HGESDGTLEHMKRN 236
I D G+I+ ++ +MVD+ WL + + + ++ H E + E
Sbjct: 157 ILDRSLGEIVNSLHLTFMVDVTWLYLQYLLAGQRTDMTILCKHRICHEELNICHE----- 211
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD--- 293
N I+ + +HH+ M+L Y G+R+IV TA L +DW N++QGLW+
Sbjct: 212 ---NVIIEIVGQLDQYSSHHANIMILQYKNGIRVIVSTAGLYSIDWENRTQGLWISPHLP 268
Query: 294 -FPLKDQNNLSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P + + E GF+ DL YLS K P + + A + +FS V
Sbjct: 269 YLPESAKPSDGESPTGFKKDLERYLSKYKQPALTQWIRA----------VQMADFSDVNV 318
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL--------- 402
L+ASVPG + WG+ KL VL ++ P+V Q S +G
Sbjct: 319 FLVASVPGIYKADEADFWGYRKLAHVLSRYATLPRNEQWPIVAQSSGVGCFGLFKNWLLK 378
Query: 403 DEKW-MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
D W M+E++S S + + ++P++E+ + S + + +N
Sbjct: 379 DIIWSMSEMTSKASKNHPQFQ---------FIYPSIENYKQSFDYQCLITPLTYSAENHS 429
Query: 462 K-DFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKN 518
K +L+ Y +WKA+ TGR RAMP+IK++ R + +K+ WFLLTSANLSKAAWG+ K
Sbjct: 430 KQQWLESYLYQWKATRTGRDRAMPNIKSYTRISPDLKKIPWFLLTSANLSKAAWGST-KQ 488
Query: 519 NSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGS 578
I +YE GVL +P K +T T
Sbjct: 489 YKGYSIGNYEAGVLFIP---------------------------------KFITGTTTFP 515
Query: 579 SDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
++ V P+PY+LP +Y S+D P+
Sbjct: 516 VGEEKNTGVPVFPIPYDLPLTQYESDDSPF 545
>gi|71001518|ref|XP_755440.1| tyrosyl-DNA phosphodiesterase [Aspergillus fumigatus Af293]
gi|66853078|gb|EAL93402.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
Af293]
gi|159129510|gb|EDP54624.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus fumigatus
A1163]
Length = 564
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 141/528 (26%), Positives = 229/528 (43%), Gaps = 100/528 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V ++D++ +I N++ D+D+L+ + +
Sbjct: 72 IPSPIQLSHIRDLSAASGNNVDTVRLKDILGDPLIRECWQFNFLFDVDFLMSQFDEDVRR 131
Query: 216 IPHVLVIHGESDGTLEHMKRNKPA-----NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG + R + A N +P FGTHHSK M+L+ + +
Sbjct: 132 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAITAYMPEPFGTHHSKMMILLRHDDLAQ 191
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-------FENDLIDYLSTLKW 321
+++HTAN+I DW N Q +W PL+ E G F+ DL+ YL+
Sbjct: 192 VVIHTANMIPGDWANMCQAVWRSPLLPLRKSGREPEGPGAIGSGVRFKRDLLAYLNE--- 248
Query: 322 PEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKL 374
+G K P ++F+FS+ LIASVP SSL WG L
Sbjct: 249 ---------YGVKKTGPLVRQLERFDFSAVRAALIASVPSKQRLSSLDSQKKTLWGWPAL 299
Query: 375 RTVLQECTFEKGFK----KSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIG 428
+ ++ K +S +V Q SS+ SL + KW+ ++ S + I
Sbjct: 300 KEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDKWLKDV---FFPSLSPTPSMASIP 356
Query: 429 EPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------- 475
+P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 357 QPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQQKQLQYMRPYLRHWAGDSDSSSST 416
Query: 476 -----HTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRS 526
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++ I S
Sbjct: 417 STPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNNAGEVRISS 476
Query: 527 YELGVLILP--------SAKRH--GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWH 576
+E+GV++ P + +RH C +P ++
Sbjct: 477 WEIGVIVWPQLFVHEDNTTERHQQAVMVPCFKRDIPLQL--------------------- 515
Query: 577 GSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
D +V L +PY+LP Y + +VPW +T+ D GQ W
Sbjct: 516 -PEDMPRCDVLVGLRMPYDLPLIPYKANEVPWCATIAHTEPDWLGQTW 562
>gi|256575388|gb|ACU87659.1| tyrosyl DNA phosphodiesterase 1 [Leishmania donovani]
Length = 828
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 161/641 (25%), Positives = 251/641 (39%), Gaps = 204/641 (31%)
Query: 171 GLPAWANT--------------SC--VSIRDVIQGDIIVA-------ILSNYMVDIDWLL 207
G+P W N SC + +RD+ + D+ +LS+Y+ D+ WLL
Sbjct: 159 GVPLWVNAIDSFASVPQRHAPLSCSLLRLRDLFRCDVADPGECWQHILLSSYVTDLRWLL 218
Query: 208 PACPVLAKIPHVLVIHGESDGT---------------------------LEHMKRNKPAN 240
P L+ + LV+ GT + ++ A
Sbjct: 219 ATVPELSAVTGKLVVLSGEKGTATLRRSTGDPSSPYTAASPLMDRVNPFMAALREQARAT 278
Query: 241 WILH-----------KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
LH +PPLP++FGTHH+K L + RG+R+ + TANL+ DW KSQG+
Sbjct: 279 SPLHTALSRERLAVLEPPLPVAFGTHHTKMALCVNGRGLRVSIFTANLVEQDWCWKSQGI 338
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLST------------LKWPEFSANLPAH------ 331
++QDFP K S + +++ + K EF A+L +
Sbjct: 339 YVQDFPWKTATERSNDDSAGTTMVETAARSTSDSNNGSNAFTKGAEFVAHLRQYLMQCGV 398
Query: 332 -------------------GNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGSSLKKW-- 369
G F+ + F +FS+AAV L++SVPG Y G +
Sbjct: 399 SLAAACASPADAASAAGPLGIFETD--FLSHIDFSAAAVWLVSSVPGTYAHGEVCPGYRV 456
Query: 370 GHMKLRTVLQEC--TFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKT 423
G +L VL+ T L +Q+SS GSL+ ++ L ++M +
Sbjct: 457 GLCRLAEVLRRSALTMATAPASVDLSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDA 516
Query: 424 PLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----- 478
P G+ + +V+PT ++VR S EG+ G ++P + +F+ +W +S G
Sbjct: 517 PRGVRDVQVVYPTEDEVRNSWEGWRGGGSLPL-RVQCCHEFVNARLHRWGSSEAGHTAKR 575
Query: 479 -------------------------------------------RSRAMPHIKTFARYNGQ 495
R A+PHIK++A
Sbjct: 576 AFPRPAKVAAAHASREDAVDVDGVDSDGGEGTPVSLAGSCAAYRQFALPHIKSYAAVAPD 635
Query: 496 K--LAWFLLTSANLSKAAWGAL-----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 548
+ + WFLLTSANLS+AAWG+L Q + Q ++RSYELGVL + + S S
Sbjct: 636 RSCVRWFLLTSANLSQAAWGSLSRKVNQHGSRQQLVRSYELGVLYDSHSAIYPSASSWFS 695
Query: 549 NIVPSEIKSGSTETSQ--IQKTKLVTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-- 603
+ S+I+ + S+ + +T L G ++ V L PY L P Y+S
Sbjct: 696 VVAKSKIELPNARNSRAVLYETPL-----------GVDTQDVCLYTPYNLLCPTPYASTA 744
Query: 604 -----------------------EDVPWSWDKRYTKKDVYG 621
DVPW D + +D YG
Sbjct: 745 ALRAHRDAPDTGEQAVAGAALDCSDVPWVLDMPHRGRDAYG 785
>gi|307105869|gb|EFN54116.1| hypothetical protein CHLNCDRAFT_13268, partial [Chlorella
variabilis]
Length = 150
Score = 149 bits (377), Expect = 4e-33, Method: Composition-based stats.
Identities = 78/179 (43%), Positives = 101/179 (56%), Gaps = 40/179 (22%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFAR 491
+VW TV +V+ S+EG+ AG +IP P KNVD+ FL+ Y+ +W GR RAMPHIK++ R
Sbjct: 10 LVWTTVAEVQNSIEGWMAGRSIPGPAKNVDRPFLQAYYRRWGGEACGRQRAMPHIKSYLR 69
Query: 492 YNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIV 551
Y G +AW + S NLSKAAWG LQK SQLM+RSYELGVL++PS +
Sbjct: 70 YRGDDVAWLYVGSHNLSKAAWGQLQKQGSQLMVRSYELGVLLVPSLE------------- 116
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV--VYLPVPYELPPQRYSSEDVPW 608
G+ A A + V LP+PY LPPQRY++ D PW
Sbjct: 117 -------------------------GAYQAAARGQELRVPLPIPYTLPPQRYAAGDQPW 150
>gi|119481099|ref|XP_001260578.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
gi|119408732|gb|EAW18681.1| tyrosyl-DNA phosphodiesterase, putative [Neosartorya fischeri NRRL
181]
Length = 564
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 141/529 (26%), Positives = 232/529 (43%), Gaps = 102/529 (19%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS +L ++ L A + N V ++D++ +I N++ D+D+L+ + +
Sbjct: 72 IPSPIQLTHIRDLSAASGNNVDTVRLKDILGDPMIRECWQFNFLFDVDFLMSQFDEDVRR 131
Query: 216 IPHVLVIHGESDGTLEHMKRNKPA-----NWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
+ V V+HG + R + A N +P FGTHHSK M+L+ + +
Sbjct: 132 LVQVKVVHGSWKKDAPNRIRIEEACPRYPNVEAITAYMPEPFGTHHSKMMILLRHDDLAQ 191
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG-------FENDLIDYLSTLKW 321
+++HTAN+I DW N Q +W L+ E G F+ DL+ YL+
Sbjct: 192 VVIHTANMIPGDWANMCQAVWRSPLLALRKSEREPEGPGAIGSGARFKRDLLAYLNE--- 248
Query: 322 PEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMKL 374
+G K P ++F+FS+ LIASVP SSL WG L
Sbjct: 249 ---------YGVKKTGPLVRQLERFDFSAVRAALIASVPSKQRLSSLDSRKKTLWGWPAL 299
Query: 375 RTVLQECTFEKGFK----KSPLVYQFSSLGSLDE--KWMAELS-SSMSSGFSEDKTPLGI 427
+ ++ K +S +V Q SS+ SL + KW+ ++ +S+S S + P
Sbjct: 300 KEATRQIPLTPKGKSQTVQSHIVTQISSIASLGQTDKWLKDVFFASLSPTSSMESIP--- 356
Query: 428 GEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS------ 475
+P I++PT +++R SL GY +G +I S + +++ Y W
Sbjct: 357 -QPKFSIIFPTPDEIRRSLNGYGSGGSIHMKLQSATQQKQLQYMRPYLRHWAGDSDSSSS 415
Query: 476 ------HTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIR 525
GR RA PHIKT+ R++ + + W ++TSANLS AWGA N ++ I
Sbjct: 416 TSTPQREAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNNAGEVRIS 475
Query: 526 SYELGVLILP--------SAKRH--GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTW 575
S+E+GV++ P + +RH C +P ++
Sbjct: 476 SWEIGVMVWPQLFVREDNTTERHQQAVMVPCFKRDIPLQL-------------------- 515
Query: 576 HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ +V L +PY+LP Y + +VPW +T+ D GQ W
Sbjct: 516 --PDETPGCDVLVGLRMPYDLPLTPYKANEVPWCATAAHTEPDWLGQTW 562
>gi|307211795|gb|EFN87776.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 463
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 183/367 (49%), Gaps = 31/367 (8%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWL-LPACPVLAKIPHVLVIHGESDGT-LEHMKRNKPAN 240
I D G+I+ ++ ++VD++WL L + ++ H D T L P
Sbjct: 99 ILDKSLGEIVNSLHLTFIVDVEWLCLQYALAGQRTDMTILYHNRRDDTDLSDNISIMP-- 156
Query: 241 WILHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFP--- 295
+++ L + THH+K M+L Y G+R++V TANL DW N++QGLW+ P
Sbjct: 157 --VYEAELVFNSETHHTKIMILQYKDDGIRVVVSTANLYSNDWENRTQGLWISPHLPRLP 214
Query: 296 -LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
L ++ F+ D YLS P + K +FS+ V +
Sbjct: 215 ELASSSDGESPTNFKQDFKRYLSRYWNPALKQWMDV----------VSKADFSAVNVCFV 264
Query: 355 ASVPGYHTGSSLKKWGHMKL-RTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS 413
ASVPG +T + WGH KL R + Q T + ++ Q SS+G+L + + LS
Sbjct: 265 ASVPGNYTHFNADYWGHRKLARVLFQHTTLPPDAPQWSIIAQSSSIGNLGPNYESWLSKE 324
Query: 414 MSSGFSEDKTPLGIGEPLI--VWPTVEDVRCSLEGYAAGNAI-PSPQKNVDKDFLKKYWA 470
+ S++ + P ++P+VE+ S + + + + +++ + +++ +
Sbjct: 325 IVLSMSQETMQMTNRYPKFQYIYPSVENYERSFDFRNSISCFYYTAERHSKQQWIEPFLH 384
Query: 471 KWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYE 528
+WKA+ TGR RAMPHIK++ R + ++++WF+LTSANLSK+AWG S I +YE
Sbjct: 385 QWKATRTGRDRAMPHIKSYMRISPDLKRISWFMLTSANLSKSAWGV---KRSTYSITNYE 441
Query: 529 LGVLILP 535
GV+ LP
Sbjct: 442 AGVVFLP 448
>gi|315052274|ref|XP_003175511.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
gi|311340826|gb|EFR00029.1| tyrosyl-DNA phosphodiesterase 1 [Arthroderma gypseum CBS 118893]
Length = 591
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 147/537 (27%), Positives = 234/537 (43%), Gaps = 92/537 (17%)
Query: 160 LPSTFRLLRVQGL--PAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK- 215
+PS +L ++ + N C+ +RD++ +I NY+ D+D+++ K
Sbjct: 71 IPSPIQLTHIRDINDSTGYNKDCIKLRDILGDPMIKECWQFNYLFDVDYIMSQFDRDVKD 130
Query: 216 IPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRG 267
+ + +IHG E+ + + KR A ++ P P FGTHHSK M+LI +
Sbjct: 131 LIQLKIIHGSWKREAPNRIAIDDACKRYPNAEAVVAYMPEP--FGTHHSKMMILIRHDNL 188
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG-------FENDLIDYLSTLK 320
+II+HTAN+I DW N +Q +W Q ++ + G F+ DL+ YL
Sbjct: 189 AQIIIHTANMIPRDWGNMTQAVWRSPLLPFSQPHVGDTHGEFGSGARFKRDLLAYLD--- 245
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
A+ N I ++++F + LIASVP + WG
Sbjct: 246 ---------AYNNKTIGLLIHQLQRYDFGAVKAVLIASVPSRLPVKAFDSNRKTLWGWPA 296
Query: 374 LRTVLQECTFEKGFK---KSPLVYQFSSLGSLDE--KWMAEL---SSSMSSGFSEDKTPL 425
LR ++ + K ++ Q SS+ +L + KW+ E S S F++ +
Sbjct: 297 LRDAIRSIPIDHSSSQTLKPHIIVQVSSIATLGQTDKWLKETFFGSLCPQSRFNQTISAC 356
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKAS---- 475
I++PT +++R SL+GY +G +I S QK + +L+ Y W
Sbjct: 357 HANFS-IIFPTPDEIRRSLDGYGSGGSIHMKIQSASQQKQLA--YLRHYLCHWAGDAEGQ 413
Query: 476 -----------------HTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGAL 515
GRSRA PHIKT+ R++ ++ W ++TSANLS AWGA
Sbjct: 414 RDPGPATESVKGLAYVREAGRSRAAPHIKTYIRFSDSGMSSIDWAMVTSANLSTQAWGAG 473
Query: 516 QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK--------SGSTETSQIQK 567
++ I S+E+GVLI P R C + + +K + S E Q +
Sbjct: 474 ANAQGEVRICSWEIGVLIWPELFRENNIEKCNDSSPINHVKMIPCFKRNTPSKEPLQPPE 533
Query: 568 TKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ LT H DA V + +PY LP Y+ DVPW + + D GQ W
Sbjct: 534 SDSTKLTSH--PDATNMIRVGFR-MPYNLPLVPYTPRDVPWCATAAHREPDWMGQTW 587
>gi|332029126|gb|EGI69137.1| Putative tyrosyl-DNA phosphodiesterase [Acromyrmex echinatior]
Length = 511
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/441 (27%), Positives = 198/441 (44%), Gaps = 69/441 (15%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ + VD+ WL + + + ++ E + N I
Sbjct: 114 ILDRSLGEIVNSLHLTFRVDVTWLYLQYLLAGQCTDMTILCKRKTRIHEKLSEN-----I 168
Query: 243 LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN- 300
F +HH+ M+L Y G+R+IV TA L +W N++QGLW+ P ++
Sbjct: 169 TIIKVDGHEFSSHHTNIMILQYKNGIRVIVSTAGLYSAEWENRTQGLWISPHLPYLPESA 228
Query: 301 ---NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASV 357
+ GF+ DL YLS P + + ++ +FS V L+ASV
Sbjct: 229 HPSDGESSTGFKKDLERYLSKYDQPVLTQWICT----------VRRVDFSDVNVFLVASV 278
Query: 358 PGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS---SLGSLDEKWMA-ELSSS 413
PG H + WG KL VL ++ P+V Q S + GS E W+ ++
Sbjct: 279 PGIHKSYEINFWGCKKLAYVLSRYVTLPSNEQWPIVIQSSGVGNFGSTIESWLLRDIIRC 338
Query: 414 MSSGFSEDKTPLGIG---EPLIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYW 469
MS +T +G+ + ++P++E+ + S + ++ S + + + +L++Y
Sbjct: 339 MSK-----ETSIGLKNHPQFQFIYPSIENYKQSFDCQDLITSLTYSVEIHSKQQWLEQYL 393
Query: 470 AKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
+WKA TGR AMP IK++ R + +++ WFLLTSANLSKAAWG +++ I +Y
Sbjct: 394 YQWKAKRTGRDCAMPGIKSYTRISPDSKRVPWFLLTSANLSKAAWGLIKRYEG-YSIGNY 452
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
E GVL +P K++T T + V
Sbjct: 453 EAGVLFIP---------------------------------KVITGTATFPIGEEEDAAV 479
Query: 588 VYLPVPYELPPQRYSSEDVPW 608
P+PY+LP RY S+D P+
Sbjct: 480 PTFPIPYDLPLSRYDSDDSPF 500
>gi|290999837|ref|XP_002682486.1| predicted protein [Naegleria gruberi]
gi|284096113|gb|EFC49742.1| predicted protein [Naegleria gruberi]
Length = 320
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 149/286 (52%), Gaps = 35/286 (12%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDY 315
H+K ++ + +RI+V +ANL DW+ Q +W+QDFP K+ + + FEN L+++
Sbjct: 2 HAKLFIIEFDDFIRIVVSSANLTDFDWSFFKQCIWIQDFPKKENISNNNTNQFENTLVEF 61
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR 375
W + + +P +F +K+++S+A LI S+PGYHT K+GH+ ++
Sbjct: 62 -----WTKLTDGIPG--------NFLRKYDYSNAKGELIPSIPGYHTNIEKDKYGHLAIK 108
Query: 376 TVLQECTFEK----GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL 431
++ F K K+SPL YQ SS+GS++ W+ ELSSS + +D
Sbjct: 109 KAIERMNFTKNEILNLKQSPLYYQMSSIGSMNLDWIKELSSSF---YLKDCNNFN----- 160
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK----YWAKWKASHTGRSRAMPHIK 487
IV+P++E V S G G I K + K +++ +A+H S+ + H++
Sbjct: 161 IVFPSLESVSSSHFGLRCGGMIHLKSKTFETSTFPKHLMTHYSPNQANHLAHSKILLHLE 220
Query: 488 TFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
K + + S NLS+ A G LQKN +QL I +YELGV+
Sbjct: 221 NL------KNGYIFVGSHNLSQPALGKLQKNGTQLYISNYELGVIF 260
>gi|358393671|gb|EHK43072.1| hypothetical protein TRIATDRAFT_225252 [Trichoderma atroviride IMI
206040]
Length = 1124
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 136/496 (27%), Positives = 225/496 (45%), Gaps = 72/496 (14%)
Query: 126 ELSSKKMRQQDEQDNENGKNSEEALCN-FHVSRDKL------PSTFRLLRVQGLPAWANT 178
+ + K+ R + D NG + E+L R K S ++L R++ LP N
Sbjct: 2 DFARKRSRDAADGDEGNGDEALESLSRPISPPRKKFRQINIQKSPWQLTRIRDLPDELNK 61
Query: 179 SCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHG-----ESDGTLE 231
VS++D++ +I N++ DI +++ + ++ + V+HG + + L
Sbjct: 62 DTVSLQDLLGDPLIRECWQFNFLHDIPFMVNTFDETVRRLVQLHVVHGFWKKSDLNRILL 121
Query: 232 HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLW 290
+ N LH P+P FGTHHSK M++ +II+HTAN+I DW N + +W
Sbjct: 122 SDAAARYPNVHLHCAPMPEMFGTHHSKMMVMFRSDNTAQIIIHTANMIPRDWTNMTNAVW 181
Query: 291 MQ-DFPLKDQNNLSEECG----------FENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
PL ++ + G F+ DL+ YL +K+ + K
Sbjct: 182 QSPKLPLLPVPDIISQHGQTLPLGSGLRFKADLLSYL--MKYDSYKVTC------KPLAD 233
Query: 340 FFKKFNFSSAAVRLIASVPGYH--TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
F+FSS IASVPG H +S WG L+ LQ G S +V Q S
Sbjct: 234 RLGYFDFSSVRAAFIASVPGKHDIRDASQPAWGWAGLQRCLQGVPVGPG--GSAIVVQIS 291
Query: 398 SLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 454
S+ +L ++ W+ L +S+++ + + +V+PT +++R SL+GYA+GN+I
Sbjct: 292 SIATLGANDDWLQRTLFNSLATSLTPNANKPSFK---VVFPTADEIRNSLDGYASGNSIH 348
Query: 455 SPQK-------------------NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN-G 494
+ + N KD + +GR+RA PHIKT+ R+N
Sbjct: 349 TKIQSAQHISQLRYLHPILHHWANDSKDGAALFAGASIYGDSGRNRAAPHIKTYIRFNCN 408
Query: 495 QKLAWFLLTSANLSKAAWG-ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 553
+ W +LTSAN+SK AWG L+ + I S+E+GVL+ P+ C ++ S
Sbjct: 409 TTIDWAMLTSANMSKQAWGETLKPTTGEFRIASWEVGVLVWPN-------LLCKDGVMLS 461
Query: 554 EIKSGSTETSQIQKTK 569
+S + S + +
Sbjct: 462 SFQSDTVNMSPFSQAQ 477
>gi|255950552|ref|XP_002566043.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211593060|emb|CAP99435.1| Pc22g21470 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 520
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 135/519 (26%), Positives = 219/519 (42%), Gaps = 118/519 (22%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLL------PACPVLA 214
S +L ++ LP N + +RD++ +I NY+ D+D+L+ AC +
Sbjct: 62 SPIKLTHIRDLPEGNNVDTIRLRDILGDPMIRECWQFNYLFDVDFLMSQFDEDEAC---S 118
Query: 215 KIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
+ P+V I +P FGTHHSK M+L+ + ++I+H
Sbjct: 119 RYPNVEPIVAY----------------------MPEPFGTHHSKMMILLRHDDLAQVIIH 156
Query: 274 TANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG--------FENDLIDYLSTLKWPEF 324
TAN+IH+DW N +Q W PL+ N + F+ DL+ YL
Sbjct: 157 TANMIHMDWTNMTQAAWCSPLLPLQKANTAGSQADNKIGSGARFKRDLLAYLK------- 209
Query: 325 SANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGY-HTGSSLKK----WGHMKLRTV 377
A+G K P ++FSS LIASVP H S + WG L+ +
Sbjct: 210 -----AYGPKKTGPLVQQLDNYDFSSIRAALIASVPSKKHVSDSSSEEDTLWGWPALKDL 264
Query: 378 LQECTFEKGF--KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPL-- 431
+ + ++ KK +V Q SS+ +L + KW+ E+ F + TP +P
Sbjct: 265 MSQIPIQQKSPSKKPHVVIQISSVATLGQTNKWLKEV-------FFKSLTP----QPTTY 313
Query: 432 -IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASHTGRSRAM--- 483
I++PT +++R SL GY +G++I S + +++ + +W + +
Sbjct: 314 SIIFPTPDEIRRSLNGYNSGSSIHMKTQSAAQQKQLQYMRPHLCQWAGDSLPPGQCIDLS 373
Query: 484 ---------------PHIKTFARY---NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIR 525
PHIKT+ R+ + + + W +++SANLS AWGA + ++ I
Sbjct: 374 EENPPRREAGRARAAPHIKTYIRFADSDMKTIDWAMVSSANLSTQAWGAATNGSGEVRIC 433
Query: 526 SYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASS 585
S+E+GV++ P R G G G SDA +S
Sbjct: 434 SWEIGVVVWPDLFRDGA--------------EGKAPVPDALMVPCFKRDRPGVSDADTAS 479
Query: 586 EVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
VV +PY+LP Y + D PW + D G+ W
Sbjct: 480 VVVGFRMPYDLPLTPYGAADEPWCATASHALPDWRGESW 518
>gi|307211791|gb|EFN87772.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 530
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 184/368 (50%), Gaps = 38/368 (10%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G+I+ ++ +MVD WL + + +++++GE K N
Sbjct: 153 ILDRSLGEIVNSLHLTFMVDARWLCLQYLLAGQCTDMMILYGERVD-----KEKLGDNIT 207
Query: 243 LHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ L +
Sbjct: 208 TVHVEMPFEFGCHHTKIMILQYRDNGIRVVVSTANLYSDDWENRTQGMWISPH-LPRLSK 266
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
++ CG F+ DL YL T P K +K +FS+ V LIAS
Sbjct: 267 AAKRCGESPTNFKKDLQRYLGTYHNPA----------LKRWRKLVRKADFSAINVCLIAS 316
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLD---EKWMA-ELS 411
PG ++ WG+ KL VL + T + ++ Q SS+G+ E W++ E+
Sbjct: 317 TPG-RFRHTVNLWGYKKLADVLFRHVTQLPNALEWSIIAQSSSVGNFGPRYEGWLSKEIV 375
Query: 412 SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK--DFLKKYW 469
SM+ D + +++P+VE+ S + Y G + + V ++K Y
Sbjct: 376 RSMAWKTVRDLKDYPKFQ--LIYPSVENYEQSFD-YQNGTSCFFYSREVHSKLQWIKSYL 432
Query: 470 AKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
+WKA+ TGR++AMP+IK++ R + +++AWF+LTSANL+K AWG + N I +Y
Sbjct: 433 YQWKATKTGRNQAMPYIKSYTRISPDLKRIAWFVLTSANLNKGAWGVQRSN---YYIANY 489
Query: 528 ELGVLILP 535
E+GV LP
Sbjct: 490 EVGVAFLP 497
>gi|115384578|ref|XP_001208836.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196528|gb|EAU38228.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1250
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 157/583 (26%), Positives = 255/583 (43%), Gaps = 113/583 (19%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQ----DEQDNENGKNSEEALCNFHVSRDK-LPSTFR 165
++ K S+D TN + +R+ + ++ +S N + +PS F+
Sbjct: 708 AKRAKLSSDDSTTNSTTALASLRRSITPPSPRPSKRAASSPAKTTNAQQDTARVIPSPFQ 767
Query: 166 LLRVQGLPAWANTSCVSIR-DVIQGDIIV--AILSNYMVDIDWLLPACPV-LAKIPHVLV 221
L V+ L + + ++R I GD ++ NY+ D+D+L+ + + V V
Sbjct: 768 LTHVRDLAESSGNNADTVRLHNILGDPMIRECWQFNYLFDVDFLMKQFDEDVRSLVKVKV 827
Query: 222 IHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVH 273
+HG E+ + E R I+ +P +FGTHHSK M+L+ + ++++H
Sbjct: 828 VHGSWKREAPNRIRIDEACSRYPNVEAIVAY--MPEAFGTHHSKMMILLRHDDLAQVVIH 885
Query: 274 TANLIHVDWNNKSQGLWMQDF-PL-KDQNNLSEECG-------FENDLIDYLSTLKWPEF 324
TAN+I DW N Q +W PL KD + SE+ F+ DL+ YL
Sbjct: 886 TANMIPGDWANMCQAVWRSPLLPLRKDIDAESEDAAKIGSGMRFKRDLLAYLDH------ 939
Query: 325 SANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPG---YHTGSSLKK--WGHMKLRTV 377
+G K P ++++F + L+ASVP +T S + WG L+ V
Sbjct: 940 ------YGPKKTGPLVDQLRRYDFDAVRAALVASVPSKQKINTADSQRTTLWGWPALKDV 993
Query: 378 LQECTFEK-GFKKSP----LVYQFSSLGSLDE--KWMAE-----LSSSMSSGFSEDKTPL 425
++ G KS +V Q SS+ SL + KW+ E LSS +S +S
Sbjct: 994 VRGIPLRAAGGSKSAVTPHIVSQISSVASLGQTDKWLKEVFFKSLSSDPTSKYS------ 1047
Query: 426 GIGEPLIVWPTVEDVRCSLEGYAAGNAI-----PSPQKNVDKDFLKKYWAKW-------- 472
I++PT +++R SL GY +G +I +PQ+ +++ Y W
Sbjct: 1048 ------IIFPTDDEIRRSLNGYGSGGSIHMKIQSAPQQK-QLQYIRPYLCHWAGDRDDGS 1100
Query: 473 -------KASHTGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQ 521
+ GR RA PHIKT+ +++ K + W ++TSANLS AWGA + +
Sbjct: 1101 SAGTSMSRKRDAGRRRAAPHIKTYIQFSDTKTMDSIDWAMVTSANLSTQAWGAAPNASGE 1160
Query: 522 LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDA 581
+ I SYE+GV++ P S+ +S Q T +
Sbjct: 1161 IRICSYEIGVVVWPQL------------FADSDAESAVMVPCFKQDTPAF-----AEREG 1203
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
S VV L +PY+LP Y+ +D PW +T+ D GQ W
Sbjct: 1204 PVPSVVVGLRMPYDLPLTSYTPKDTPWCATATHTEPDWLGQTW 1246
>gi|396459207|ref|XP_003834216.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
gi|312210765|emb|CBX90851.1| hypothetical protein LEMA_P058850.1 [Leptosphaeria maculans JN3]
Length = 650
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 146/589 (24%), Positives = 263/589 (44%), Gaps = 111/589 (18%)
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTS 179
DG +G K Q++ D ++G++ + + NF +PS +L+R++ + A N
Sbjct: 86 DGGLDG-----KGDQEEHPDIKSGRDGDSNI-NF------IPSPIQLIRIEDMGAMQNVD 133
Query: 180 CVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHG----ESDGTLEHM 233
+ + D++ +I + NY+ D+ +++ + + V ++HG + + +E +
Sbjct: 134 AIGLGDILGDPLIRECWNFNYLFDLGFVMQHFDSDVRHMVKVKIVHGFWRRDDERRIELL 193
Query: 234 KR-NKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLW- 290
+ + N L +P FGTHHSK ++L + +II+HTAN+I+ DW+N +Q +W
Sbjct: 194 EAAERYPNIELLSAYIPDPFGTHHSKMLILFRHDDTAQIIIHTANMIYRDWSNMTQAVWS 253
Query: 291 -------MQDFPLKDQNNLSEECG----FENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
Q +P ++ ++ S G F+ DL+ YL+ + K S
Sbjct: 254 SPMLPLSTQKWPTENPDSASHPVGSGLRFKVDLLRYLAAYE-----------RRTKDLVS 302
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKK-SP-- 391
++F + I SVP + K +G + LR +L + + K SP
Sbjct: 303 QLAHYDFFAIRAAFIGSVPSRQNPDASKPSEETSFGWLGLREILTQVPVARRDKSHSPPH 362
Query: 392 LVYQFSSLGSLDEK--WMAELSSSMSS----------------GFSEDKTPLGIGEPL-- 431
+V Q SS+ +L + W+ S +SS S P P
Sbjct: 363 IVTQISSIATLGAQPTWLTHFQSVLSSEPKVSNTAVSGSTKTASASPKHAPNNPPPPTFS 422
Query: 432 IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW--------------K 473
I++PT E++R L+GYA+G +I S Q+ ++ + W +
Sbjct: 423 IIFPTPEELRTCLDGYASGASIHWKLQSAQQQKQLAYMHPFLRHWHSPAPTSPPQDSPRR 482
Query: 474 ASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELG 530
A+H R A PHIKT+ R++ Q + W LLTSANLSK AWG + +++ ++S+E G
Sbjct: 483 AAH--RGPAAPHIKTYIRFSNQDHTTIDWALLTSANLSKQAWGDVVGKKNEMRVQSWEAG 540
Query: 531 VLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSS----------- 579
V++ P+ H + P+ + + +Q+ L +GS+
Sbjct: 541 VVLWPALFAHNS-VPGNRALAPAIMVPVFARDAPLQE-DLAGWLRNGSAAHNHNVCADRV 598
Query: 580 ----DAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
++ + VV +PY+LP Y+++++PW RY + D G W
Sbjct: 599 SPVRNSAVNVTVVGFRMPYDLPLCPYTADEMPWCATMRYAEPDGKGMAW 647
>gi|358384803|gb|EHK22400.1| hypothetical protein TRIVIDRAFT_179757 [Trichoderma virens Gv29-8]
Length = 1118
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 133/446 (29%), Positives = 212/446 (47%), Gaps = 80/446 (17%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGD--IIVAILSNYMVDIDWLLPACPVLAKIPHV 219
S ++L R++ +P N V++ D++ GD I NY+ DI +++ A +
Sbjct: 42 SPWQLTRIRDVPEELNKDTVALGDIL-GDPSITECWQFNYLHDIPFVMNAFDKNVRDSVQ 100
Query: 220 L-VIHG-----------ESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-R 266
L V+HG S+ L+H N LH P+P FGTHHSK M+L +
Sbjct: 101 LHVVHGFWKRNDLNRVILSEHALQH------PNVHLHCAPMPEMFGTHHSKMMILFHSDN 154
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLK----------DQNNLSEECGFENDLIDY 315
+I++HTAN+I DW N + +W P + Q F+ DL+ Y
Sbjct: 155 TAQIVIHTANMIPKDWTNMTNAVWRSPKLPWRWELDPRLQQAQQAPFGSGIRFKADLLAY 214
Query: 316 LSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWGHMK 373
L +++ + +N F+FSS LIASVPG + +S WG
Sbjct: 215 L--MQYDSHRVTCKQLVDRLVN------FDFSSIRAALIASVPGRYNLYDTSSPAWGWTA 266
Query: 374 LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK--WMAE-LSSSMSSGFSED-KTPLGIGE 429
L+ LQ E G +S +V Q SS+ +L K W+ + L +S+++ ++D K P +
Sbjct: 267 LKRCLQTVPVETG--ESQIVVQISSIATLGAKDDWLQKILFNSLATSRNQDTKKP----D 320
Query: 430 PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK----DFLKKYWAKWK------------ 473
+V+PT +++R SL+GYA+G +I + K+ +L W
Sbjct: 321 FKVVFPTADEIRNSLDGYASGQSIHTKIKSAQHIRQLHYLHPMLHHWANDSADGVGLLEQ 380
Query: 474 ---ASHTGRSRAMPHIKTFARYN-GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYEL 529
+ +GR+RA PHIKT+ R+N + W +LTSAN+SK AWG + ++ I S+E+
Sbjct: 381 PPISGDSGRNRAAPHIKTYTRFNQNNSIDWAMLTSANMSKQAWGEAPSSTGEVRIASWEV 440
Query: 530 GVLILPSAKRHGCGFSCTSNIVPSEI 555
GVL+ P G C + ++ S I
Sbjct: 441 GVLVWP-------GLLCENGVMVSSI 459
>gi|440634212|gb|ELR04131.1| hypothetical protein GMDG_01435 [Geomyces destructans 20631-21]
Length = 570
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 151/532 (28%), Positives = 239/532 (44%), Gaps = 99/532 (18%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIP 217
+ S F+L R++ P N VS+ +++ +I + NYM D+D+L+ P
Sbjct: 69 ISSPFKLTRIRDSPGSLNNGSVSLGEIVCDPMIREMWQFNYMHDLDFLMSNMDPDTKDTV 128
Query: 218 HVLVIHG--ESDGTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRIIV 272
+ V+HG + + L HMK K N L +P FGTHH+K M+L+ + +II+
Sbjct: 129 KIHVVHGYWKQESGL-HMKSQALKYPNVHLRCAYMPEIFGTHHTKMMVLLRHDDQAQIII 187
Query: 273 HTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEEC-GFENDLIDYLSTLKWP-EFSANLP 329
HTAN+I DW N SQ W PL L+++ + Y S L++ +F L
Sbjct: 188 HTANMIPQDWANLSQDAWTSPLLPLLPAEKLADQTLARGSKSASYGSGLRFKLDFLGYLK 247
Query: 330 AHGNFK--INPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK--WGHMKLRTVLQECTF 383
A+ + + P K++FSS L+ VPG H S +G +R +L
Sbjct: 248 AYDSRRTICKPLIEELLKYDFSSIRGALVGHVPGRHHVESDNPTLFGWSAIRAILNTIPV 307
Query: 384 EKGFKKSPLVYQFSSLGSL--DEKWMAE--LSSSMSSGFSEDKTP-LGIGEPLIVWPTVE 438
G K +V Q SS+ +L ++W+ + ++ +S S KTP LG IV+PT +
Sbjct: 308 HNG-DKPEVVAQVSSIATLGVTDQWLQKTLFAALSASSNSPSKTPKLG-----IVFPTPD 361
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKD----FLKKYWAKWKASH------------------ 476
++R SL+GY +G +I + V ++ +LK + W +
Sbjct: 362 EIRKSLDGYNSGGSIHVRIQTVAQEKQLQYLKPLFYHWAGDNRPVSPPSTSSPGPSTVAS 421
Query: 477 ---------------------TGRSRAMPHIKTFARYNGQ---KLAWFLLTSANLSKAAW 512
GR+RA PHIKT+ R+ + ++ W L+TSANLSK AW
Sbjct: 422 TVREAWQNRAGPSAVASTVREAGRNRAAPHIKTYIRFADEAKTRIDWALVTSANLSKQAW 481
Query: 513 GALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVT 572
G + I SYELGVL+ PS ++ + +VP T Q + K
Sbjct: 482 GERLNAAGDVRICSYELGVLVSPSM------YAEDAVMVP---------TFQTDRPK--- 523
Query: 573 LTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+A + +PY+LP RY +++ PW K Y + D G+ +
Sbjct: 524 -------EAVDGKITIGCRMPYDLPLVRYGADEEPWCATKAYEELDWMGRSY 568
>gi|67484562|ref|XP_657501.1| tyrosyl-DNA phosphodiesterase [Entamoeba histolytica HM-1:IMSS]
gi|56474754|gb|EAL52111.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449702140|gb|EMD42834.1| tyrosylDNA phosphodiesterase, putative [Entamoeba histolytica KU27]
Length = 402
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/404 (28%), Positives = 198/404 (49%), Gaps = 41/404 (10%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ +S+ D+ G+I L+ ++ D+ WL P+L KIP V
Sbjct: 6 FHLNKLELTPSLMKEKDTISLHDIFNTPGEIYSVTLTTFVFDLQWLFDELPILTKIP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
IH +GTL + + + +P+ G HH K M+++Y G+R ++ TANLI
Sbjct: 65 FIH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVKIMIILYEGGLRFVLSTANLIP 121
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
+D+N KSQG++++DF + + + E G +L+TL+ S N + S
Sbjct: 122 LDYNLKSQGIYIKDFKPSESSTILNEKG-----THFLTTLQSYFTSVN--------VTIS 168
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
+ F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSL
Sbjct: 169 YLSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVYDILNNKLHVQFNNHCTIAAQASSL 228
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
G ++ ELS +++ E K I+WPT + +R S GY G+ + N
Sbjct: 229 GLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGY-HGSCSFFLRSN 279
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNN 519
K + + Y+ K+ R PHIKT+ Y + +LTS+N+S AAWG + N
Sbjct: 280 FVKTW-ENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KPTN 335
Query: 520 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 563
S L I +YE+G+L + + F+ T +P +IK + +S
Sbjct: 336 SSLEINNYEMGMLFIDN-------FTLTRFPLPYDIKQSTKYSS 372
>gi|425771231|gb|EKV09680.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum Pd1]
gi|425776784|gb|EKV14988.1| Tyrosyl-DNA phosphodiesterase, putative [Penicillium digitatum
PHI26]
Length = 900
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 141/523 (26%), Positives = 232/523 (44%), Gaps = 81/523 (15%)
Query: 162 STFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHV 219
S +L ++ LP N V +RD++ +I N++ D+D+L+ + + V
Sbjct: 397 SPVQLTHIRDLPDGNNVDAVRLRDILGDPMIRECWQFNFIFDVDFLMAHFDEDVRSLVKV 456
Query: 220 LVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRII 271
V+HG E + E R I+ P P FGTHHSK M+L+ + +++
Sbjct: 457 KVVHGSWRREDSNRIRVEEACSRYPNVEPIVAYMPEP--FGTHHSKMMILLRHDDLAQVV 514
Query: 272 VHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEECG--------FENDLIDYLSTLKWP 322
+HTAN+IH+DW N +Q W+ PL+ ++ F+ DL+ YL
Sbjct: 515 IHTANMIHMDWTNMTQAAWLSPLLPLQKATSVESPTDAKVGSGARFKRDLLAYLK----- 569
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVR--LIASVPGYHTGSSLKK-----WGHMKLR 375
A+G K P + N+ +R LIASVP S WG ++
Sbjct: 570 -------AYGPKKTGPLVQQLDNYDFCPIRAALIASVPSKKHASDSSSDEETLWGWPAVK 622
Query: 376 TVLQECTFEK--GFKKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDKTPLGIGEPL 431
++ + ++ KK +V Q SS+ +L + KW+ ++ F + TP +P
Sbjct: 623 DLMGQVPIQQKNTSKKPHIVIQTSSVATLGQTNKWLKDV-------FFKALTPTHSPQPT 675
Query: 432 --IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKAS---------- 475
I++PT +++R SL GY +G +I S + ++ Y +W
Sbjct: 676 YSIIFPTPDEIRRSLNGYNSGVSIHMKIQSAAQQKQLQYMSPYLCQWAGDSLPPGQCIDL 735
Query: 476 --------HTGRSRAMPHIKTFARY---NGQKLAWFLLTSANLSKAAWGALQKNNSQLMI 524
GR+RA PHIKT+ R+ + + + W +++SANLS AWGA + ++ I
Sbjct: 736 SEDNPPKREAGRARAAPHIKTYIRFADSDMKTIDWAMVSSANLSTQAWGAATNASGEVRI 795
Query: 525 RSYELGVLILPSAKRH-GCGFSCTSNIVPSEIKS-GSTETSQIQKTKLVTLTWHGSSD-A 581
S+E+GV++ P R GC + + + SE ++ G + SD A
Sbjct: 796 CSWEIGVVVWPELFRDGGCDDAASPSASESESRAEGKPPAPDVLMVPCFKRDRPVVSDGA 855
Query: 582 GASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+S VV +PY+LP Y + D PW + D GQ W
Sbjct: 856 ETASMVVGFRMPYDLPLTPYGAGDEPWCATASHALPDWQGQSW 898
>gi|167389207|ref|XP_001738862.1| tyrosyl-DNA phosphodiesterase [Entamoeba dispar SAW760]
gi|165897690|gb|EDR24772.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba dispar SAW760]
Length = 721
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 187/377 (49%), Gaps = 38/377 (10%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ +S+ D+ G+I +L+ ++ D+ WL P+L ++P V
Sbjct: 6 FHLNKLELTPSLMKEKDTISLHDLFNTPGEIYSVVLTTFVFDLQWLFNELPILTRVP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
IH + + + + ++ P+P+ G HH K M+++Y G+R ++ TANLI +
Sbjct: 65 FIHNGNLSCFDQLLIQQYKDF--QTFPIPLKKGCHHVKIMIMLYEGGLRFVLSTANLIPI 122
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+N KSQG++++DF + + + E G +L+TL+ N A N + S+
Sbjct: 123 DYNLKSQGIYVKDFKPSESSTVLNEKG-----THFLTTLQ------NYLASVN--VTVSY 169
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSLG
Sbjct: 170 LSDFDYSTIDGWLLLSIPGIHKGNDLNKYGMKQVHDILNMKLHVQFNNHCTIAAQASSLG 229
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
++ ELS +++ E K I+WPT + +R S GY + +
Sbjct: 230 LFTSQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGYHGSCSF-----FL 276
Query: 461 DKDFLK---KYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQK 517
+F+K Y+ K+ R PHIKT+ Y + +LTS+N+S AAWG +
Sbjct: 277 RSNFVKTWENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KP 333
Query: 518 NNSQLMIRSYELGVLIL 534
NS L I +YE+G+L +
Sbjct: 334 TNSTLEINNYEIGMLFI 350
>gi|189242173|ref|XP_970490.2| PREDICTED: similar to tyrosyl-dna phosphodiesterase [Tribolium
castaneum]
Length = 358
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 116/377 (30%), Positives = 173/377 (45%), Gaps = 63/377 (16%)
Query: 252 FGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNNLSEE- 305
FG HHSK + Y +R+++ TANL + DWN+ +QGLW+ P E
Sbjct: 23 FGCHHSKIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEKSGESP 82
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
GF++ L++YL NLP K + K+ +FS+ V L+ SVPG H +
Sbjct: 83 TGFKSSLLNYLKHY-------NLPV---LKPWIDYVKRADFSAVRVFLVTSVPGKHYPGT 132
Query: 366 LKKWGHMKLRTVLQECTF-------EKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGF 418
H + + C+ +G ++ Q SS+GS+ + L S++
Sbjct: 133 QGSHVHHVGDLLSRHCSLPAKTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLRSTLLRSL 192
Query: 419 SEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWK 473
S K + I++P+V++V G +G +P S Q N + +L+ Y +WK
Sbjct: 193 SGHKQTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQSYLHQWK 252
Query: 474 ASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
A GRSRAMPHIKT+ R + KLAWF +TSANLSK+AWG + + +RSYE GV
Sbjct: 253 ADKLGRSRAMPHIKTYCRVSPCLSKLAWFFITSANLSKSAWGGNLQKDKGAYVRSYEAGV 312
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLP 591
+ LP K E +I+ T +G + ++ P
Sbjct: 313 MFLP--------------------KFFDEEYFEIETTL-----------SGKNKKL--FP 339
Query: 592 VPYELPPQRYSSEDVPW 608
Y+LP Y S D PW
Sbjct: 340 FMYDLPLTEYKSSDYPW 356
>gi|67539466|ref|XP_663507.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|40738576|gb|EAA57766.1| hypothetical protein AN5903.2 [Aspergillus nidulans FGSC A4]
gi|259479929|tpe|CBF70601.1| TPA: tyrosyl-DNA phosphodiesterase, putative (AFU_orthologue;
AFUA_2G11070) [Aspergillus nidulans FGSC A4]
Length = 586
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 141/508 (27%), Positives = 228/508 (44%), Gaps = 92/508 (18%)
Query: 177 NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIPHVLVIHGESDGTLEHMK 234
N V +RD++ +I NY D+D+L+ + + V V+HG E+
Sbjct: 95 NDDTVKLRDILGDPLIRECWQFNYCFDVDFLMDQFDEDVRNLVRVKVVHGSWKKDSENRV 154
Query: 235 RNKPANWILHKPP--------LPISFGTHHSKAMLLI-YPRGVRIIVHTANLIHVDWNNK 285
R + A + P +P FGTHHSK M+L+ + ++++HTAN++ DW +
Sbjct: 155 RIEKA---CQRYPNVEPIVAYMPEPFGTHHSKMMILLRHDDFAQVVIHTANMLAGDWGDM 211
Query: 286 SQGLWMQDF-PL----KDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
Q +W PL +D+N+ + G F+ DL+ YL A+G K P
Sbjct: 212 CQAIWRSPLLPLTDGHEDKNSTAWGTGARFKRDLLAYLK------------AYGVKKTGP 259
Query: 339 SF--FKKFNFSSAAVRLIASVPGYHT-------GSSLKKWGHMKLRTVLQECTFEK---- 385
K++FS+ LIASVP G+S KWG L+ L+ +
Sbjct: 260 LVEQLGKYDFSAVRAALIASVPSKQKVDASSIDGNSKTKWGWPALKEALRNVPLRENVGA 319
Query: 386 -GFKKSP-LVYQFSSLGSLDE--KWMAELS-SSMSSGFSEDKTPLGIGEPLIVWPTVEDV 440
G P +V Q SS+ +L + KW+ ++ +++++ S KT +++PT E++
Sbjct: 320 DGTATVPHIVTQISSIATLGQTDKWLKDVFFNALAASSSSTKTRPRYS---VIFPTAEEI 376
Query: 441 RCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKW----------KASHTGRSRAMPHI 486
R SL+GY G +I S + +L+ Y W + GR RA PHI
Sbjct: 377 RRSLKGYGYGGSIHMKLQSAAQKKQLQYLRPYLCHWAGDVSGQAPKRLQDAGRRRAAPHI 436
Query: 487 KTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS------- 536
KT+ R+ Q + W L+TSANLS AWGA ++ + S+E+GVL+ P
Sbjct: 437 KTYIRFADQHMRSIDWALVTSANLSTQAWGAAANAAGEVRVCSWEIGVLVWPELLTTEPQ 496
Query: 537 AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYEL 596
+R S + +VP K +S++ A + ++ +PY+L
Sbjct: 497 GQRKHQQQSRSVAMVPCFKKDKPDPSSKVGN--------------AAPAALIGFRMPYDL 542
Query: 597 PPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
P YS++D PW + + D GQ W
Sbjct: 543 PLTPYSTQDEPWCATMSHIEPDWLGQTW 570
>gi|451851539|gb|EMD64837.1| hypothetical protein COCSADRAFT_36213 [Cochliobolus sativus ND90Pr]
Length = 610
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 144/538 (26%), Positives = 229/538 (42%), Gaps = 104/538 (19%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS RL R++ LP N V + D++ +I + NY+ D+D+++ + +
Sbjct: 103 IPSPVRLTRIEKLPKEKNVDTVGLTDLLGDPLIKECWNFNYLFDLDFIMQHFDRDIRDMV 162
Query: 218 HVLVIHGESDGT-------LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG G LE +R N L +P FGTHHSK ++L + +
Sbjct: 163 KVKIVHGFWRGDDKNRIALLETAERY--PNIELISAYIPDPFGTHHSKMLILFRHDDTAQ 220
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDF-PL-----KDQNNLSE--ECG----FENDLIDYL- 316
+++HTAN+IH DW N +Q +W PL +Q+N S+ G F+ DL+ YL
Sbjct: 221 VVIHTANMIHRDWANMTQAVWASPLLPLLRHTTSEQSNSSKIHSIGSGERFKVDLLRYLY 280
Query: 317 ----------STLKWPEFS-----------------ANLPAHGNF------KINPSFFKK 343
S LK+ +FS A P+H F +I S K
Sbjct: 281 AYGMRLGALTSQLKYYDFSSIRAAFLGSAPSKQKLTAAGPSHTAFGWLGLDQILSSIPVK 340
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
+ S ++ + T + W +++L C K +K F+ L
Sbjct: 341 ASGDSLRPHIVTQISSVATLGATPTW-LFHFQSILSRCPDAKDTEKEEASSSFTKASMLF 399
Query: 404 EKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKN 459
K + + + FS +V+PT ++R L+GY AG +I S Q+
Sbjct: 400 TKQESNAAEAPEPKFS------------VVFPTPAEIRMPLDGYTAGGSIHWKFQSVQQQ 447
Query: 460 VDKDFLKKYWAKW--------KASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLS 508
+++ W R A PHIKT+ R++ + + W LLTSANLS
Sbjct: 448 KQLEYMHPILCHWTPVSRPDPSQQEAHRGTAAPHIKTYIRFSDETHTTIDWALLTSANLS 507
Query: 509 KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 568
K AWG + N ++ ++S+E GV++ P+ F +S +VP + + ET +
Sbjct: 508 KQAWGDVMNKNEEIRVQSWETGVVMWPAL---FAEFEHSSTMVPV-FGADNPETGK---- 559
Query: 569 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPR 626
HG G VV +PY LP YS+++ PW Y + D YG W R
Sbjct: 560 -------HGE---GKRETVVGFRMPYNLPLVPYSADERPWCATLAYEEPDRYGLTWAR 607
>gi|407035177|gb|EKE37579.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba nuttalli P19]
Length = 402
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 198/404 (49%), Gaps = 41/404 (10%)
Query: 164 FRLLRVQGLPAWA-NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
F L +++ P+ VS+ D+ G+I L+ ++ D+ WL P+L +IP V
Sbjct: 6 FHLNKLELTPSLMKEKDTVSLHDIFNTPGEIYSVTLTTFVFDLQWLFDELPILTRIP-VQ 64
Query: 221 VIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
+H +GTL + + + +P+ G HH K M+++Y G+R ++ TANLI
Sbjct: 65 FVH---NGTLNYFDQLLIQEYKDFETFSVPLKKGCHHVKIMIILYEGGLRFVLSTANLIP 121
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
+D+N KSQG++++DF + + + E G +L+TL+ S N + S
Sbjct: 122 LDYNLKSQGIYIKDFKPSESSTVLNEKG-----AHFLTTLQSYFTSVN--------VTIS 168
Query: 340 FFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
+ F++S+ L+ S+PG H G+ L K+G ++ +L + + Q SSL
Sbjct: 169 YLSDFDYSTIDGWLLLSIPGTHKGNDLNKYGMKQVYDILNNKLHVQFTNHCTIAAQASSL 228
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN 459
G ++ ELS +++ E K I+WPT + +R S GY G+ + N
Sbjct: 229 GLFTNQYRRELSLCLTNQ-PESKFQ-------IIWPTEDFIRTSETGY-HGSCSFFLRSN 279
Query: 460 VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNN 519
K + + Y+ K+ R PHIKT+ Y + +LTS+N+S AAWG + N
Sbjct: 280 FVKTW-ENYFYKFLPPFP-RHLIQPHIKTYVIYEEDIPKYGILTSSNISGAAWG--KPTN 335
Query: 520 SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETS 563
S L I +YE+G+L + + F+ T +P +IK + +S
Sbjct: 336 STLEINNYEMGMLFIDN-------FTLTRFPLPYDIKQSTKYSS 372
>gi|307211790|gb|EFN87771.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 553
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 198/445 (44%), Gaps = 77/445 (17%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D G I+ ++ N MVD+ WL + + P+++++ + G E + N +
Sbjct: 165 ILDRSLGQIVSSLHLNCMVDVGWLCLQYLLAGQRPNMVILCSQRLGE-EELGDNIT---V 220
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQN 300
+H +P FG HH+K M+L Y G+R++V TANL DW N++QG+W+ P
Sbjct: 221 VHVE-MPFEFGCHHTKVMILQYKDVGIRVVVSTANLYASDWKNRTQGIWISPHLP----- 274
Query: 301 NLSEEC---------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
LSE F+ DL YL++ + P K +K +FS+ V
Sbjct: 275 RLSEAAKWSSGESPTNFKKDLQRYLNSYQNPA----------LKRWRDLVRKADFSAVNV 324
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
IAS PG+ + WG+ KL VL Q K ++ Q S++GS K+ L
Sbjct: 325 CFIASTPGHFRRIDVNLWGYKKLANVLSQHVMLPPDAPKWSIIAQSSAVGSFGPKYEGWL 384
Query: 411 SSSMSSGFSE--DKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV--DKDFLK 466
S + + ++ E ++P+V++ S + Y G++ K V + ++K
Sbjct: 385 SKEIVRSMTRETERDLKDYPEFQFIYPSVKNYEQSFD-YQDGSSCFLYMKEVHSKQQWIK 443
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMI 524
Y +WKA +G +AMPHIK++ R + +++AWF+LTSANLSK AWG I
Sbjct: 444 SYLYQWKAK-SGCDQAMPHIKSYTRISPDLKRIAWFVLTSANLSKGAWGV---QRGDYYI 499
Query: 525 RSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGAS 584
+YE+GV LP F T + + I
Sbjct: 500 TNYEVGVAFLPKFITGTTTFPITDEDLTAPI----------------------------- 530
Query: 585 SEVVYLPVPYELPPQRYSSEDVPWS 609
P+PY+ P Y S D P++
Sbjct: 531 -----FPIPYDFPLCPYDSNDSPFT 550
>gi|7648683|gb|AAF65623.1|AF182002_1 tyrosyl-DNA phosphodiesterase protein [Homo sapiens]
Length = 415
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 176/360 (48%), Gaps = 37/360 (10%)
Query: 78 QRKKLSSNEHV----SIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGAT-----NGELS 128
+R+K S E + S +D ++ +P K V NDG +G +
Sbjct: 70 KRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHGAPA 129
Query: 129 SKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQ 188
++++++++ +G+ + + + P F L RV G+ N+ + I+D++
Sbjct: 130 CHRLKEEEDEYETSGEGQD----IWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILS 185
Query: 189 ---GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RNKP-ANWIL 243
G ++ + NY D+DWL+ P + +L++HG+ H+ + KP N L
Sbjct: 186 PLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISL 245
Query: 244 HKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FP-LKDQNN 301
+ L I+FGTHH+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ +P + D +
Sbjct: 246 CQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTH 305
Query: 302 LSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
S E F+ DLI YL P + K + S V LI S PG
Sbjct: 306 KSGESPTHFKADLISYLMAYNAPSLKEWI----------DVIHKHDLSETNVYLIGSTPG 355
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSM 414
GS WGH +L+ +L++ +S P+V QFSS+GSL + KW+ +E SM
Sbjct: 356 RFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVGSLGADESKWLCSEFKESM 415
>gi|240276539|gb|EER40051.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H143]
Length = 685
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 136/479 (28%), Positives = 207/479 (43%), Gaps = 112/479 (23%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAM 260
+ + V +IHG ES + E +R I+ P P FGTHHSK M
Sbjct: 121 QFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKMM 178
Query: 261 LLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGF 308
+LI + ++++HTAN+I DW N Q +W P++ + + + F
Sbjct: 179 ILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVGRGNRF 238
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSL 366
+ DL+ YL A+GN K P +K++F + LIASVP L
Sbjct: 239 KRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDEL 286
Query: 367 KK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL----- 410
WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 287 DSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAAL 346
Query: 411 --SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 347 SPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQL 403
Query: 463 DFLKKYWAKW----------KASHT---------------------------------GR 479
++L+ Y +W A H+ GR
Sbjct: 404 EYLRPYLCRWAGDTGDGSDISAKHSINSGQERCDSKDANESVRKYVTTGKNSQPIRNAGR 463
Query: 480 SRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+E+GVLI P
Sbjct: 464 RRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLIWP 522
>gi|325092032|gb|EGC45342.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus H88]
Length = 682
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 136/479 (28%), Positives = 207/479 (43%), Gaps = 112/479 (23%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAM 260
+ + V +IHG ES + E +R I+ P P FGTHHSK M
Sbjct: 121 QFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKMM 178
Query: 261 LLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGF 308
+LI + ++++HTAN+I DW N Q +W P++ + + + F
Sbjct: 179 ILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVGRGNRF 238
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSL 366
+ DL+ YL A+GN K P +K++F + LIASVP L
Sbjct: 239 KRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDEL 286
Query: 367 KK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL----- 410
WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 287 DSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAAL 346
Query: 411 --SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 347 SPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQL 403
Query: 463 DFLKKYWAKW----------KASHT---------------------------------GR 479
++L+ Y +W A H+ GR
Sbjct: 404 EYLRPYLCRWAGDTGDGSDISAKHSINSGQERRDSKDANESVRKYVTTGKNSQPIRNAGR 463
Query: 480 SRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+E+GVLI P
Sbjct: 464 RRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLIWP 522
>gi|225555717|gb|EEH04008.1| tyrosyl-DNA phosphodiesterase [Ajellomyces capsulatus G186AR]
Length = 637
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 134/484 (27%), Positives = 206/484 (42%), Gaps = 122/484 (25%)
Query: 152 NFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLP 208
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 61 NAPISSRIIPSPIQLTHIRDFAASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLMS 120
Query: 209 ACPV-LAKIPHVLVIHGESDGTLEHMKRNKP----ANWILHKPP--------LPISFGTH 255
+ + V +IHG KR P + H+ P +P FGTH
Sbjct: 121 QFDEDVRDLVKVKIIHGS-------WKRESPNRIRVDEACHRYPNVEPIVAYMPEPFGTH 173
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLS 303
HSK M+LI + ++++HTAN+I DW N Q +W P++ + + +
Sbjct: 174 HSKMMILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMESGHASATLDGVG 233
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYH 361
F+ DL+ YL A+GN K P +K++F + LIASVP
Sbjct: 234 RGNRFKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQ 281
Query: 362 TGSSLKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL 410
L WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 282 AIDELDSEKQTLWGWPALKDTIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKET 341
Query: 411 -------SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQ 457
S +S KT P I++PT +++R SL GYA+G +I S
Sbjct: 342 FFAALSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAA 398
Query: 458 KNVDKDFLKKYWAKW----------KASHT------------------------------ 477
+ ++L+ Y +W A H+
Sbjct: 399 QRKQLEYLRPYLCRWASDTGDGSDISAKHSINSGQERCESKNVNESVQKCVATSKNSQPI 458
Query: 478 ---GRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
GR RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+E+GV
Sbjct: 459 RNAGRCRAAPHIKTYIRFSDANLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGV 518
Query: 532 LILP 535
L+ P
Sbjct: 519 LVWP 522
>gi|327358116|gb|EGE86973.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ATCC 18188]
Length = 655
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 147/597 (24%), Positives = 237/597 (39%), Gaps = 147/597 (24%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGFENDLIDY 315
++++HTAN+I DW N Q +W P+ + N F+ DLI Y
Sbjct: 188 QAQVVIHTANMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS------ 536
IKT+ R++ L W +++SANLS AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWPDLFVNRK 535
Query: 537 --------------AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL--------- 573
G + + ++ K K+ +
Sbjct: 536 VDDDEDDDEDDDDDDDDDDDGSEWKEKGKGKKARENDRRGAREDKNKVAVMLPCFKQDMP 595
Query: 574 TWHGSSDAGAS------SEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
D+G+S + V L +PY+LP Y+ +D PW Y + D GQ W
Sbjct: 596 EVRVDKDSGSSTTTATTTTFVGLRMPYDLPLSPYTPQDQPWCATASYKETDWLGQTW 652
>gi|157875345|ref|XP_001686067.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
gi|68129140|emb|CAJ06851.1| tyrosyl-DNA phosphodiesterase 1 [Leishmania major strain Friedlin]
Length = 828
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 152/617 (24%), Positives = 243/617 (39%), Gaps = 188/617 (30%)
Query: 179 SCVSIRDVIQGDIIVA-------ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-- 229
S + +RD+ + D+ +LS+Y+ D+ WLL P L+ + LV+ GT
Sbjct: 183 SLLRLRDLFRCDVADPGECWQHILLSSYVTDLRWLLATVPELSAVTGKLVVLSGEKGTAT 242
Query: 230 -------------------------LEHMKRNKPANWILH-----------KPPLPISFG 253
+ ++ LH +PPLP++FG
Sbjct: 243 LRRTTGDPSSPYTAVPPLMDRVNPFMTALREQASGTSPLHTALSRERLAVLEPPLPVAFG 302
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
T+H+K L I +G+R+ + TANL+ DW KSQG+++QDFP K S + ++
Sbjct: 303 TYHTKMALCINGKGLRVSIFTANLVEQDWCWKSQGIYVQDFPWKPVTERSNDDSAGTIMV 362
Query: 314 DYLST------------LKWPEFSANLPAH-------------------------GNFKI 336
+ + K EF A+L + G F+
Sbjct: 363 ETAARSTSNSNNGSNTFTKGAEFVAHLRHYLMRCGVSLASACASPADAASAAGPLGIFET 422
Query: 337 NPSFFKKFNFSSAAVRLIASVPG-YHTG--SSLKKWGHMKLRTVLQECTFEKGFKKSP-- 391
+ F +F++AAV L++SVPG Y G + + G +L VL+ +
Sbjct: 423 D--FLSHIDFTAAAVWLVSSVPGTYAHGEVCPVYRVGLCRLGEVLRRSALTTATAPASVD 480
Query: 392 LVYQFSSLGSLDEKWMAELSSSMS----SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
L +Q+SS GSL+ ++ L ++M + P G+ + +V+PT E+VR S EG+
Sbjct: 481 LSWQYSSQGSLNPAFLNSLQAAMCGESVAVIESGDAPRGVRDVQVVYPTEEEVRNSWEGW 540
Query: 448 AAGNAIPSPQKNVDKDFLKKYWAKWKASHTG----------------------------- 478
G ++P + +F+ W +S G
Sbjct: 541 RGGGSLPLCVQCC-HEFVNARLHCWGSSEAGHMAKRAFPRPAKVAAVHASREDAVDVDGV 599
Query: 479 -------------------RSRAMPHIKTFARYNGQK--LAWFLLTSANLSKAAWGAL-- 515
R A+PHIK++A + + WFLLTSANLS+AAWG+L
Sbjct: 600 DSDGGEGTPVSLAGSCAAYRRFALPHIKSYAAVAPDRSCVRWFLLTSANLSQAAWGSLSR 659
Query: 516 ---QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK--SGSTETSQIQKTKL 570
Q + Q ++RSYELGVL + + S S + S+I+ + + + +T L
Sbjct: 660 KVNQHGSRQQLVRSYELGVLYDSHSAIYQSASSWFSVVAKSKIELPNACNSRAMLYETPL 719
Query: 571 VTLTWHGSSDAGASSEVVYLPVPYE-LPPQRYSS-------------------------E 604
G ++ V L PY L P Y+S
Sbjct: 720 -----------GIGTQDVCLYTPYNLLCPTPYASTAALRAHRDAPDKGEQAVAGAALDCS 768
Query: 605 DVPWSWDKRYTKKDVYG 621
DVPW D + +D YG
Sbjct: 769 DVPWVLDMPHRGRDAYG 785
>gi|154273448|ref|XP_001537576.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150416088|gb|EDN11432.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 610
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 133/480 (27%), Positives = 205/480 (42%), Gaps = 112/480 (23%)
Query: 151 CNFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLL 207
N +S +PS +L ++ A + NT V +RD++ +I NYM D+D+L+
Sbjct: 60 VNAPISSRVIPSPIQLTHIRDFSASSGYNTDSVKLRDILGDPLIKECWQFNYMFDVDFLM 119
Query: 208 PACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKA 259
+ + V +IHG ES + E +R I+ P P FGTHHSK
Sbjct: 120 SQFDEDVRDLVKVKIIHGSWKRESPNRIRVDEACRRYPNVEPIVAYMPEP--FGTHHSKM 177
Query: 260 MLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECG 307
M+LI + ++++HTAN+I DW N Q +W P++ + + +
Sbjct: 178 MILIRHDDQAQVVIHTANMIAGDWANMCQAVWRSPLLPMRPEMENGHSYATLDGVRRGNR 237
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSS 365
F+ DL+ YL A+GN K P +K++F + LIASVP
Sbjct: 238 FKRDLLAYLE------------AYGNKKTGPLVDQLEKYDFGAVRAGLIASVPTRQAIDE 285
Query: 366 LKK-----WGHMKLRTVLQECTFEKG----FKKSPLVYQFSSLGSLDE--KWMAEL---- 410
L WG L+ +Q+ G KK ++ Q SS+ +L + KW+ E
Sbjct: 286 LDSEKQTLWGWPALKDAIQQIPLGGGNNTVGKKPQIIIQISSVATLGQTDKWLKETFFAA 345
Query: 411 ---SSSMSSGFSEDKT--PLGIGEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
S +S KT P I++PT +++R SL GYA+G +I S +
Sbjct: 346 LSPSRPRASNLFNPKTDPPTKFS---IIFPTPDEIRRSLNGYASGGSIHMKLQSAAQRKQ 402
Query: 462 KDFLKKYWAKWKAS-------------------------------------------HTG 478
++L+ Y +W + G
Sbjct: 403 LEYLRPYLCRWAGDTGDGSDISAKHPINSGQERCDSKDANESVQKYVTTGKNSQPIRNAG 462
Query: 479 RSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
R RA PHIKT+ R++ LA W ++TSANLS AWGA ++ I S+E+GVL+ P
Sbjct: 463 RRRAAPHIKTYIRFSDADLATIDWAMVTSANLSVQAWGAAANGKKEIRICSWEIGVLVWP 522
>gi|332376511|gb|AEE63395.1| unknown [Dendroctonus ponderosae]
Length = 584
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 182/379 (48%), Gaps = 53/379 (13%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD--GTLEHMKRNKPANWILHKP 246
G + ++ N+M+DI WL+ + L I D +E+M+R P N H
Sbjct: 187 GPLKESLQINFMIDIGWLVKQYKAREQDNKPLTILYGDDWPDMVEYMRRFCP-NVKHHFV 245
Query: 247 PLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEE 305
+ FG HH+K + Y +R++V TANL + DWN+ +QGLW+ K +N +E
Sbjct: 246 KMKDPFGCHHTKLGIYAYEDESIRVVVSTANLYYEDWNHYNQGLWISPRLAKLPSNSAER 305
Query: 306 -----CGFENDLIDYLSTLK------WPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
GF+ L+DYL + + W ++ AN +F V L+
Sbjct: 306 DGEAITGFKGHLLDYLRSYQLPILRDWVKYVANA----------------DFGEVKVALV 349
Query: 355 ASVPGYH----TGSSLKKWGHMKLRTVLQECTF---EKGFKKSPLVY----QFSSLGSLD 403
S PG H GS L + G + + Q C + PL + Q SS+GS+
Sbjct: 350 YSAPGKHYAKQNGSHLHRVGDL----LSQHCVLPAKTTAQSEGPLSWGILAQASSIGSIG 405
Query: 404 EKWMAELSSSM-SSGFSEDKTPL-GIGEPLI--VWPTVEDVRCSLEGYAAGNAIP-SPQK 458
+ L S+ S S ++PL G + I V+P+V +V G +G +P S
Sbjct: 406 KTAAEWLRGSLLRSLASHKQSPLPGNSQATISLVYPSVSNVAHGYFGLESGGCLPYSKAT 465
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQ 516
N + +L+ Y +W A R+RAMPHIK++ R + KLA+FLLTSANLSK+A G
Sbjct: 466 NEKQRWLQTYMHQWIADARHRTRAMPHIKSYCRVSPGLDKLAYFLLTSANLSKSARGNNI 525
Query: 517 KNNSQLMIRSYELGVLILP 535
+ + IRSYE+GV+ LP
Sbjct: 526 QKDGGCYIRSYEMGVMFLP 544
>gi|302797949|ref|XP_002980735.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
gi|300151741|gb|EFJ18386.1| hypothetical protein SELMODRAFT_420273 [Selaginella moellendorffii]
Length = 197
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 90/148 (60%), Gaps = 28/148 (18%)
Query: 209 ACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV 268
ACP L IP V++IHGES+ + MLL+YP GV
Sbjct: 71 ACPPLRTIPQVVMIHGESNVS-------------------------QLQSVMLLVYPTGV 105
Query: 269 RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL 328
R++VHTANLI++DWNNK+QGLWMQDFP K S+ FENDL+DYL+ L+W + ++
Sbjct: 106 RVVVHTANLINIDWNNKNQGLWMQDFPFKSMTGASD---FENDLVDYLTALEWLGCTVDV 162
Query: 329 PAHGNFKINPSFFKKFNFSSAAVRLIAS 356
HG KIN F+ F+FS+AAVRL+AS
Sbjct: 163 QHHGKMKINVGHFQNFDFSNAAVRLVAS 190
>gi|451995661|gb|EMD88129.1| hypothetical protein COCHEDRAFT_1227354 [Cochliobolus
heterostrophus C5]
Length = 571
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 142/536 (26%), Positives = 231/536 (43%), Gaps = 103/536 (19%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACP-VLAKIP 217
+PS +L +++ LP N V + D++ +I + NY+ D+D+++ + K+
Sbjct: 63 IPSPVQLTQIEKLPREKNVDTVCLSDLLGDPLINECWNFNYLFDLDFVMQHFDWDVRKMV 122
Query: 218 HVLVIHGESDG------TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVRI 270
+ ++HG G TL P N L +P FGTHHSK ++L Y +I
Sbjct: 123 RIKIVHGFWRGDDKNRMTLLEAAEEYP-NIELISAYIPDPFGTHHSKMLILFRYDDTAQI 181
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG------------FENDLIDYLST 318
I+HTAN+I DW N +Q +W+ ++ SEE F+ DL+ YL
Sbjct: 182 IIHTANMIRRDWANMTQAVWVSPLLPLLRHTTSEESKSTSIHSIGSGERFKVDLLRYLY- 240
Query: 319 LKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRLIASVPGYHTGS----SLKKWGHMK 373
A+G + S K +NFS + S P S S +G +
Sbjct: 241 -----------AYGKGTRALTSQLKHYNFSGIRAAFLGSAPSRQKPSAASPSHTAFGWLG 289
Query: 374 LRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLDEK--WMAELSSSMS-------------- 415
L +L + + +V Q SS+ +L W+ S +S
Sbjct: 290 LDQILSGIPAKASEDSSRPHVVTQISSVATLGATPTWLFHFQSILSRCSNVNDSEKEEAS 349
Query: 416 SGFSEDKT--------PLGIGEPL--IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
S F+E T +G EP +V+PT +++R SL+GY++G +I S Q+
Sbjct: 350 SSFTEACTLSIQQKTNTVGAPEPKFSVVFPTPDEIRMSLDGYSSGGSIHWKFESAQQQKQ 409
Query: 462 KDFLKKYWAKW----------KASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLS 508
+++ W + +H RS A PHIKT+ R++ + + W LLTS+NLS
Sbjct: 410 LEYMHPILCHWAPVSQPDQPQRKAH--RSTAAPHIKTYIRFSDETHTTIDWALLTSSNLS 467
Query: 509 KAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKT 568
K AWG + N ++ I+S+E GV++ P+ +S I+ + E +
Sbjct: 468 KQAWGDVANKNDEIRIQSWETGVVLWPAL---FAEHEHSSTIMVPVFGIDNPEADSTYEA 524
Query: 569 KLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
K T VV +PY LP YS+++ PW + + D YG+ W
Sbjct: 525 KKGT--------------VVGFRMPYNLPLVPYSADERPWCATMAHKEPDRYGRTW 566
>gi|189210395|ref|XP_001941529.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977622|gb|EDU44248.1| tyrosyl-DNA phosphodiesterase 1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 624
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 136/548 (24%), Positives = 234/548 (42%), Gaps = 109/548 (19%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS +L R++ L N V + D++ +I + N++ D+D+++ + +
Sbjct: 100 IPSPIQLTRIEKLSDHQNVDTVGLADLLGDPLIKECWNFNFLFDLDFVMQHLDRDVRDMV 159
Query: 218 HVLVIHG---ESDGT----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG D LE +R N L +P FGTHHSK ++L + +
Sbjct: 160 KVKIVHGFWKRDDANRISLLETAERY--PNIELLSAYIPDPFGTHHSKMLILFRHDDTAQ 217
Query: 270 IIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQ--NNLSEECG---------FENDLIDYLS 317
+++HTAN+IH DW N +Q +W P+ Q +LS+ F++DL+ Y+
Sbjct: 218 VVIHTANMIHRDWANMTQAVWASPQLPMLSQASQSLSDSDKTYPIGSGQRFKSDLLRYIG 277
Query: 318 TLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH----TGSSLKKWGHMK 373
+ K + ++FSS I S P SS +G +
Sbjct: 278 AYE-----------KRLKGLAAQLGDYDFSSIRAAFIGSAPSRQKPERAVSSNNSFGWLG 326
Query: 374 LRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEK--WM--------------------AE 409
L+ +L K SP +V Q SS+ +L W+ A
Sbjct: 327 LKEILSTVPISKARASSPPHIVAQVSSIATLGAAPTWLSNFQSVLSSHSKATVSVPENAT 386
Query: 410 LSSSMSSGFSEDKTPLGIGEP---LIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDK 462
+SS+ +S F++ T + I++PT E++R SL GY +G +I S Q+
Sbjct: 387 VSSTKASTFTKRDTSVTKAPSPKFSIIFPTPEEIRNSLNGYGSGGSIHWKLQSAQQQKQL 446
Query: 463 DFLKKYWAKWKA--------------SHTGRSRAMPHIKTFARYNGQK---LAWFLLTSA 505
+++ W + R A PHIKT+ R++ ++ + W +LTSA
Sbjct: 447 EYMHPMLCHWTSTPSASASSLTNVSKQEAHRGPAAPHIKTYIRFSDEEQKAIDWAMLTSA 506
Query: 506 NLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVP--------SEIKS 557
N SK AWG ++ I+S+E GV++ P+ ++VP E
Sbjct: 507 NFSKQAWGDTVNKKEEIWIQSWETGVVVWPALFAETAKGVNEVSMVPVFGKDMPKVEDAR 566
Query: 558 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK 617
+T+ ++ +T++ T V L +PY+LP + Y++++ PW YT+
Sbjct: 567 VNTKGKEVGETRIKT--------------TVGLRMPYDLPLKPYTADEKPWCATMAYTEP 612
Query: 618 DVYGQVWP 625
D G WP
Sbjct: 613 DRNGHFWP 620
>gi|261191861|ref|XP_002622338.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
gi|239589654|gb|EEQ72297.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis SLH14081]
Length = 653
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 128/473 (27%), Positives = 204/473 (43%), Gaps = 112/473 (23%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLW----------MQDFPLKDQNN-LSEECGFENDLIDY 315
V++++HTAN+I DW N Q +W M+ P +N F+ DLI Y
Sbjct: 188 QVQVVIHTANMIAGDWANMCQAVWRSPLLPMCPEMEHGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
IKT+ R++ L W +++SANLS AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWP 528
>gi|19075361|ref|NP_587861.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe
972h-]
gi|74625832|sp|Q9USG9.1|TYDP1_SCHPO RecName: Full=Probable tyrosyl-DNA phosphodiesterase; Short=Tyr-DNA
phosphodiesterase
gi|6066756|emb|CAB58371.1| tyrosyl-DNA phosphodiesterase Tdp1 [Schizosaccharomyces pombe]
Length = 536
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 142/544 (26%), Positives = 226/544 (41%), Gaps = 100/544 (18%)
Query: 156 SRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC---- 210
S + + S L ++ LP N C+ ++ +I + N+ VD+++LL
Sbjct: 16 SNEIIDSPIFLNKISALPESENVHCLLLKQLIGSPQLKQTWQFNFCVDLNFLLENMHASV 75
Query: 211 --PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPR-G 267
V +I H +S L + P N L+ +P+ +GTHHSK M+ +
Sbjct: 76 FPTVDVRITHGYDSKSDSLARLTAQMNHCPVNVKLYSVYVPM-WGTHHSKIMVNFFKDDS 134
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQ------------------------------DFPLK 297
+I++HTANL+ DW SQ ++ +K
Sbjct: 135 CQIVIHTANLVEPDWIGMSQAIFKTPLLYPKANDSLSTSSVPEYGNPSKIRKHEGSLDIK 194
Query: 298 DQNN---LSEECGFEN----------DLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
D N + + FEN D + + +F A L + + K +
Sbjct: 195 DDRNCDIIDVDSAFENFKHKSDTRSSDDLGVIGRQFQQDFLAYLKNYRHTYELIEKLKMY 254
Query: 345 NFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQ 395
+FS+ I SVPG G WG KL+ +L+ EK KK + Q
Sbjct: 255 DFSAIRAIFIGSVPGKFEGEEESSWGLGKLKKILK--MLEKDSKKDEKTKFEESDICISQ 312
Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI-- 453
SS+GS K E + ++ GF + G ++PTV++V+ S+ G+ +G++I
Sbjct: 313 CSSMGSFGPK--QEYIAELTDGFGCQR-----GNWKFLFPTVKEVQQSMLGWQSGSSIHF 365
Query: 454 ----PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSANL 507
+ V+ K KW A GR R PHIKT+ R+ +G+ L W L+TSANL
Sbjct: 366 NILGKTAASQVETLKKGKNLCKWVAMKAGRQRVAPHIKTYMRFSNDGELLRWVLVTSANL 425
Query: 508 SKAAWGALQKNNSQ------LMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
SK AWG L+ + ++ L IRSYE GVL+ P C I+ K+ +
Sbjct: 426 SKPAWGTLEGHKAKSRSTRGLRIRSYEAGVLLYPKLFEESQRAPC---IMTPTYKTNTPN 482
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYG 621
+ ++ ++G V+ + + ++ PP Y +D WS T KD G
Sbjct: 483 LDEKRR------EFYG-------KRVIGVRMCWDFPPVEYEDKDEIWSPVINRTDKDWLG 529
Query: 622 QVWP 625
VWP
Sbjct: 530 YVWP 533
>gi|330928975|ref|XP_003302469.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
gi|311322144|gb|EFQ89422.1| hypothetical protein PTT_14295 [Pyrenophora teres f. teres 0-1]
Length = 621
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 132/542 (24%), Positives = 232/542 (42%), Gaps = 96/542 (17%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAKIP 217
+PS +L R+ L N V + D++ +I + N++ D+++++ + +
Sbjct: 96 IPSPIQLTRIMKLHGHQNVDTVGLNDLLGDPLIKECWNFNFLFDLEFVMQHFDRDVRDMV 155
Query: 218 HVLVIHG---ESDGT----LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPRGVR 269
V ++HG D LE +R N L +P FGTHHSK ++L + +
Sbjct: 156 KVKIVHGFWKRDDANRISLLETAERY--PNIELLSAYIPDPFGTHHSKMLVLFRHDDTAQ 213
Query: 270 IIVHTANLIHVDWNNKSQGLWMQ-DFPL----------KDQNNLSEECGFENDLIDYLST 318
II+HTAN+IH DW N +Q +W+ PL + N + F++DL+ Y+
Sbjct: 214 IIIHTANMIHRDWANMTQAVWVSPQLPLLSRASQSQSDTNTNPIGSGERFKSDLLRYIGA 273
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS----SLKKWGHMKL 374
+ K + + ++FSS I SVP S +G + L
Sbjct: 274 YE-----------KRLKGLIAQLEDYDFSSIRAAFIGSVPSRQKPGRAIPSTTSFGWLGL 322
Query: 375 RTVLQECTFEKGFKKSP--LVYQFSSLGSLDEK--WMAELSSSMSSGFSEDKTPLGIGEP 430
+ +L K SP +V Q SS+ +L W++ L S +SS +S+ T +
Sbjct: 323 KEILSTIPISKAKAFSPPHIVAQVSSIATLGAAPTWLSNLQSVLSS-YSKATTSVPENTT 381
Query: 431 L-------------------------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVD 461
+ +++P E++R SL+GY +G +I S Q+
Sbjct: 382 VSFTKASSFFTKRDDSVRIASSPKFSVIFPNPEEIRNSLDGYGSGGSIHWKLQSAQQQKQ 441
Query: 462 KDFLKKYWAKWKASHTG--------------RSRAMPHIKTFARYNGQK---LAWFLLTS 504
+++ W ++ + R A PHIKT+ R++ + + W +LTS
Sbjct: 442 LEYMHPMLCHWASTPSAPALASTDVPRREAHRGPAAPHIKTYIRFSDDEQNTIDWAMLTS 501
Query: 505 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
ANLSK AWG + ++ I+S+E GV++ P+ F+ T+ E+
Sbjct: 502 ANLSKQAWGDVVNKKEEIWIQSWETGVVVWPAL------FAETTQAAVDEVVMVPMFGKD 555
Query: 565 IQKTKLVTLTWHG-SSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQV 623
+ + G ++ +V +PY+LP + Y++++ PW YT+ D G
Sbjct: 556 MPGVDDNGVNLEGKEAEEMRPKTIVGFRMPYDLPLKPYTADEKPWCATMAYTEPDRNGHA 615
Query: 624 WP 625
WP
Sbjct: 616 WP 617
>gi|239608603|gb|EEQ85590.1| tyrosyl-DNA phosphodiesterase [Ajellomyces dermatitidis ER-3]
Length = 653
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 125/473 (26%), Positives = 201/473 (42%), Gaps = 112/473 (23%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIV--AILSNYMVDIDWLLPACPV-LA 214
+PS +L ++ A + N V +RD++ GD ++ + NYM D+D+L+ +
Sbjct: 71 IPSPIQLTHIRDFSASSGNNADAVRLRDIL-GDPLIKESWQFNYMFDVDFLMSQFDEDVR 129
Query: 215 KIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFGTHHSKAMLLI-YPR 266
+ +V ++HG ES + E +R I+ P P FGTHHSK M+LI +
Sbjct: 130 NLVNVKIVHGSWKRESPNRIHIDESCRRYPNVEPIVAYMPEP--FGTHHSKMMILIRHDD 187
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQ----------NNLSEECGFENDLIDY 315
++++HT N+I DW N Q +W P+ + N F+ DLI Y
Sbjct: 188 QAQVVIHTTNMIAGDWANMCQAVWRSPLLPMCHEMKRGPGSTASNRFGSGIRFKRDLIAY 247
Query: 316 LSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKK----- 368
L A+G K P +K++FS+ L+ASVP L
Sbjct: 248 LE------------AYGRKKTGPLVDQLEKYDFSTVRAGLVASVPSRQAIDELDSEKHTL 295
Query: 369 WGHMKLRTVLQECTFEKGF----KKSPLVYQFSSLGSLDE--KWMAELSSSMSSGFSEDK 422
WG L+ +Q+ KG + +V Q SS+ +L + KW+ E + S
Sbjct: 296 WGWPALKDAIQQIPLNKGTNTTGNRPQIVIQISSVATLGQTDKWLKETFFAALSPSPSRS 355
Query: 423 TPLGIGEPL--------IVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWA 470
+ G+ +P I++PT +++R SL GYA+G +I S + ++L+ Y
Sbjct: 356 SSSGLFKPKTNPPAKFSIIFPTPDEIRRSLNGYASGGSIHMKLQSSAQRKQLEYLRPYLC 415
Query: 471 KWKAS---------------------------------------------HTGRSRAMPH 485
+W GR RA PH
Sbjct: 416 RWAGDGDASDGSDISGQGSTNTNARRERGKDASESSQKHATIDKNGQPIRQAGRRRAAPH 475
Query: 486 IKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
IKT+ R++ L W +++SANLS AWGA ++ I S+E+GV++ P
Sbjct: 476 IKTYIRFSDADLTTIDWAMVSSANLSLQAWGAAANGKKEIRICSWEIGVIVWP 528
>gi|448079213|ref|XP_004194340.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359375762|emb|CCE86344.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 575
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 137/503 (27%), Positives = 209/503 (41%), Gaps = 96/503 (19%)
Query: 177 NTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE---SDGTLEH 232
N + V++ D+I D+ + N+ +D+++ L K + + G S +
Sbjct: 110 NYNAVTLSDMIGMSDLQSSFQFNFAIDLEFFLEHVDRSKKSKTITFVLGSDLLSPEVKDE 169
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWM 291
+++ + K LP FGTHH+K M+ Y G II+ T NL +D++ +Q W
Sbjct: 170 VQKRYGVDASDIKVDLPKRFGTHHTKMMVNFYEDGTCEIIIMTCNLQPIDFSALTQMCWR 229
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSA 349
K ++ + + F+ D+I YL + P KIN KF+ S
Sbjct: 230 SGRLSKASSSNAGQNRFKTDIIRYLKRYRKP------------KINELADTLAKFDMSGI 277
Query: 350 AVRLIASVPG----YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK 405
V L+ASVPG +++G+ KL VL+ G + + Y + +
Sbjct: 278 DVELVASVPGNFNLARATDESEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISY 337
Query: 406 WMAELSSSMSSGFSEDKTPLGIGE--------------------------PLIVWPTVED 439
A + +S FS PL P I++P +D
Sbjct: 338 PFALKEKNTASVFSHIICPLVFSRNSERLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKD 397
Query: 440 VRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFAR 491
+ S G+ +G AI + +N + +K Y KW+ASH GR PH+K +
Sbjct: 398 IALSGTGFYSGQAIHFKYDTSAIHRNQYEQNIKPYLYKWRASHKNAGRDETPPHVKLYMC 457
Query: 492 YNG---QKLAWFLLTSANLSKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 542
NG + L W L+ S NLSK AWGA ++ + S I SYELGVLI PS H
Sbjct: 458 DNGDNWKTLRWVLMASHNLSKQAWGARRELRYRSADPSAYEISSYELGVLI-PSKSDH-- 514
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
+VP S E S+ G V + +P+ LPP+RYS
Sbjct: 515 ------KLVPVFDSSHQQEVSE-----------QGD---------VPVRIPFILPPERYS 548
Query: 603 SEDVPWSWDKRY-TKKDVYGQVW 624
S+D PWS Y + KD +G W
Sbjct: 549 SDDKPWSAYSNYGSLKDKFGNTW 571
>gi|354543539|emb|CCE40258.1| hypothetical protein CPAR2_102960 [Candida parapsilosis]
Length = 532
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 142/560 (25%), Positives = 220/560 (39%), Gaps = 103/560 (18%)
Query: 114 QKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEA--LCNFHVSRDKLPSTFRLLRVQG 171
+KR S+ E +K+ + + E+ ++ + EE L N + S +LL
Sbjct: 3 EKRKSDAFKAASEHWAKRFKNESERVQDDSAHHEETKPLGNNSTTVSCFSSQIKLLHNPS 62
Query: 172 LP----AWANTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPAC--PVLAKIPHVLVIHG 224
P N V I D+I ++ N+ VD+ + L A+ ++ I G
Sbjct: 63 YPEQDLTRVNQDTVRIHDLIGSSELKETYQFNFNVDLPFFLSFLHPTFTARKRKLVFITG 122
Query: 225 ES--DGTLEHMKRNKPANWILH-KPPLPISFGTHHSKAML-LIYPRGVRIIVHTANLIHV 280
D E K K + I + +P FGTHH+K M+ + +I+ + NL +
Sbjct: 123 NKLLDSADEETKSIKSSYNISEVQANIPSRFGTHHTKMMINFFHGNSAEVIIMSCNLTKL 182
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF 340
D+ +Q +W + ++ F++DLI YL T + P+ A
Sbjct: 183 DFGGLTQMIWRSGRLARGNTTGTKSIKFKSDLIGYLRTYEKPQIDTLATA---------- 232
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECT--------------FEKG 386
+ F+FS V LIAS PG++ ++ + H ++ C F
Sbjct: 233 LETFSFSGIDVDLIASSPGHYDLNNEEP--HYGYGSLFDACKRNDLLIDNRDKSHHFNVL 290
Query: 387 FKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGE-------------PLIV 433
+ S + Y F+ L M +E L G P IV
Sbjct: 291 AQTSAISYPFAVEKGATAGVFTHLLCPMLFSKNEKFCLLAPGAQSLRRHQSKHNYTPSIV 350
Query: 434 WPTVEDVRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKAS----HTGRSRAM 483
+P+V++V S G+AAG AI KN +K Y KW + TGR R M
Sbjct: 351 FPSVDEVAASTVGFAAGQAIHFDYSRSYVHKNYYNQAIKPYHKKWDSGDVKVFTGRERVM 410
Query: 484 PHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKNN------SQLMIRSYELGVLIL 534
PH+K + NG + + W + S NLSK AWG+ + N SQ + SYELG+L+
Sbjct: 411 PHVKLYMCDNGDNWETIKWCYMGSHNLSKQAWGSRKGNKFVNNDPSQYEVNSYELGILVT 470
Query: 535 PSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPY 594
P + + PS + SDAG V Y+ +P+
Sbjct: 471 PRP---------NTKMKPSYL-----------------------SDAGTEGGVTYIRMPF 498
Query: 595 ELPPQRYSSEDVPWSWDKRY 614
+LPP YS D PWS Y
Sbjct: 499 KLPPAAYSDNDKPWSGHVSY 518
>gi|391868838|gb|EIT78047.1| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae 3.042]
Length = 389
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 185/397 (46%), Gaps = 72/397 (18%)
Query: 268 VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNNLSEE------CGFENDLIDYLSTLK 320
VR+++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ YL+
Sbjct: 22 VRVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVEDLTLGSGARFKRDLLAYLT--- 78
Query: 321 WPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHTGSSLKK-----WGHMK 373
+G K P +K++F + L+ASVP L WG
Sbjct: 79 ---------EYGPKKTGPLVEQLRKYDFGAIRAALVASVPSKQKVDDLDSQKKTLWGWPA 129
Query: 374 LRTVLQECTFEKGFKKSP---LVYQFSSLGSL--DEKWMAE-LSSSMSSGFSEDKTPLGI 427
L+ ++++ + K+ +V Q SS+ +L +KW+ + + +S+S + + P
Sbjct: 130 LKDIMRQIPPAQKTTKATTPHIVTQISSVATLGQTDKWLKDVMFASLSPASTSTRQP--- 186
Query: 428 GEPLIVWPTVEDVRCSLEGYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------- 476
+ I++PT +++R SL GY +G +I S + +++ Y W H
Sbjct: 187 -KYSIIFPTADEIRRSLNGYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSH 245
Query: 477 -----TGRSRAMPHIKTFARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
GR RA PHIKT+ R++ + + W ++TSANLS AWGA + ++ I S+
Sbjct: 246 TSKQDAGRRRAAPHIKTYIRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSW 305
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
E+G+++ P + ++ +VP+ K + E + + ++ T V
Sbjct: 306 EIGIVVWPQLYVQD---TESATMVPT-FKRDTPEPLENKDSETTPDT------------V 349
Query: 588 VYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
+ L +PY+LP Y++ D PW ++ + D GQ W
Sbjct: 350 IGLRMPYDLPLTPYAAHDTPWCATAQHLEPDWLGQTW 386
>gi|260945317|ref|XP_002616956.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
gi|238848810|gb|EEQ38274.1| hypothetical protein CLUG_02400 [Clavispora lusitaniae ATCC 42720]
Length = 748
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 136/495 (27%), Positives = 212/495 (42%), Gaps = 93/495 (18%)
Query: 176 ANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPAC-PVLAKIPHVLV-IHGESDGTLEH 232
N V++ D++ D++ N+ VD+++ L P AK +V + G +
Sbjct: 293 VNVDTVTVHDLVGAPDLLETFQFNFNVDLEYFLTFLHPNFAKNKRKIVFVTGTAYLAGHP 352
Query: 233 MKRNKPANWILHK--PPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGL 289
++ A + + + PLP F +HHSK M+ YP V II+ T NL +D+ +Q +
Sbjct: 353 LREIIKAKYNISECIAPLPNRFASHHSKMMINFYPHDQVEIIIMTCNLTQLDFGGLTQSV 412
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
W + + F+ DL YL K + + +N++S
Sbjct: 413 WRSGKLKRGKTTAKLGSRFKQDLERYLLKYKMATIEKVVQR----------LRDYNYNSV 462
Query: 350 AVRLIASVPGY----HTGSSLKKWGHMKLRTVLQ--ECTFEKGFKKSPLVYQFSSLGSLD 403
V L+AS PG H + + +G+ KLR VLQ + + K ++ Q +S+
Sbjct: 463 GVELVASAPGTYSIDHIDENDETYGYGKLRQVLQRNDLLIKDTEKHHNILAQVTSIAYPY 522
Query: 404 EKWMAELSSSMSS-----GFSEDKTPLGIG-------------EPLIVWPTVEDVRCSLE 445
+ +S +S FS K L G +P +V+PTV++V S
Sbjct: 523 SSRKGDTASILSHLLCPLMFSHWKKHLEPGTQSTSKHQEEFKYKPQLVFPTVKEVASSNF 582
Query: 446 GYAAGNAIPSP-------QKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFARYNG-- 494
G+ +G+A+ QK +++ +K Y KW TGR R PH+K +A NG
Sbjct: 583 GFLSGSAVHFKHSGSLIHQKQYEQN-VKPYLCKWSTPENVTGRERVTPHVKYYACDNGDG 641
Query: 495 -QKLAWFLLTSANLSKAAWGALQ-KNNSQLM-IRSYELGVLILPSAKRHGCGFSCTSNIV 551
L W L+ S NLSK AWG + K+ Q + SYEL VL+ S K N+V
Sbjct: 642 WNTLKWVLVGSHNLSKQAWGYPEAKSKGQTFDVASYELSVLVPGSGK----------NLV 691
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPV--PYELPPQRYSSEDVPWS 609
P K SS+ + +PV P++LPP RY D+PWS
Sbjct: 692 PVFKKD-------------------------VSSDTITIPVRFPFKLPPTRYGENDLPWS 726
Query: 610 WDKRYTK-KDVYGQV 623
Y K KD +G +
Sbjct: 727 AGSDYGKLKDRWGNL 741
>gi|345570074|gb|EGX52899.1| hypothetical protein AOL_s00007g235 [Arthrobotrys oligospora ATCC
24927]
Length = 651
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 152/574 (26%), Positives = 235/574 (40%), Gaps = 114/574 (19%)
Query: 155 VSRDK---LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC 210
VSRD + S F+L +++ LPA N ++I D++ +I I S N+M D++W++
Sbjct: 74 VSRDPTLIISSPFKLTQIRNLPANRNVDTITISDILGSPLIREIWSFNFMHDLEWMVSHL 133
Query: 211 PV-LAKIPHVLVIHG--------------ESDGTLEHMKRNKPANWILHKPPLPISFGTH 255
+AK + +IHG E D ++ + L +P FGTH
Sbjct: 134 DEDVAKDIDIKIIHGNWRKDDMSRKALESERDKLIDLASSDGGYKIELITAYMPDMFGTH 193
Query: 256 HSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ-DFPLKDQNNLSEECGFENDLI 313
H+K ++L Y I+VHTAN+I DW+N +Q +W PL ++L + G +
Sbjct: 194 HTKMLVLFYHDDSAEIVVHTANMIPWDWSNMTQAVWRSPKLPLLADDSLERKEG-----V 248
Query: 314 DYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFSSAAVRLIASVPGYHT--GSSLKKWG 370
Y+ F+A + A+G K K++F + + VPG H G K +G
Sbjct: 249 GYVFK---EAFTAYVGAYGWRTKSLMEQIVKYDFRAVRAVFVGHVPGDHAINGPENKLFG 305
Query: 371 HMKLRTVLQECTFEKGF---KKSPLVY----------QFSSLGSLDEKWMAEL------- 410
K++ VL G K +VY Q SS+ +L E + +
Sbjct: 306 WSKVKRVLTRIGRGGGHGVNKAGRVVYTVKGGGEIAMQCSSVATLGESYFDSVLYPTFST 365
Query: 411 ---SSSMSSGFSEDKTPLGIG---------EPLIVWPTVEDVRCSLEGYAAGNAI-PSPQ 457
+ F +TP E +V+PTVE+VR S+ G+ G +I Q
Sbjct: 366 CRPGGGQLNAFDVLRTPSSSASSSRPSNRPELALVFPTVENVRTSVLGWDGGGSIFMKSQ 425
Query: 458 KNVDKDFLK------KYWAK-------WKASHTGRSRAMPHIKTF--------------- 489
K VDK LK + W + A R +A PHIKT+
Sbjct: 426 KPVDKAQLKYVKPMLRVWGQPPIGLSTAIAVEAERGKATPHIKTYNFFSPPRMDSKDSDT 485
Query: 490 -------ARYNGQKLAWFLLTSANLSKAAWGALQKN---NSQLMIRSYELGVLILPS--- 536
+N + W ++TSANLSK AWG K +S I+SYE G+LI P
Sbjct: 486 TDGEDESGAFNIVSMDWAMITSANLSKQAWGNPTKGSGPSSTSKIQSYEAGILIHPGLWK 545
Query: 537 -AKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYE 595
+ G S + GS + + K+ D + V + + Y+
Sbjct: 546 DLLKDEAGAVTMSAV-------GSKDWLVAEGQKIENCDVPEDMDGKCNMVKVGVRLAYD 598
Query: 596 LPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHFQ 629
P + Y +D PW D Y +D G WP ++
Sbjct: 599 YPLKPYDEDDEPWCKDMPYEGRDWKGITWPPRWE 632
>gi|195161240|ref|XP_002021476.1| GL26495 [Drosophila persimilis]
gi|194103276|gb|EDW25319.1| GL26495 [Drosophila persimilis]
Length = 511
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/241 (35%), Positives = 127/241 (52%), Gaps = 23/241 (9%)
Query: 307 GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 366
GF DL+ YL K + + + +K +FS+ V + SVPG H S+
Sbjct: 236 GFRQDLMLYLVEYKISQLQPWI----------ARIRKSDFSAINVFFVGSVPGGHREGSV 285
Query: 367 K--KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTP 424
+ WGH +L ++L + + P+V Q SS+GSL A + + +D +P
Sbjct: 286 RGHPWGHARLGSLLAKHATPID-DRIPVVCQSSSIGSLGANVQAWIQQDFVNSLRKDSSP 344
Query: 425 LGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGR 479
G + +++P+ +V S +G G +P + DK +LK + +WK+S R
Sbjct: 345 GGKLRQMPPFKMIYPSFNNVSGSHDGMIGGGCLPYGKNTNDKQPWLKAHLQQWKSSDRHR 404
Query: 480 SRAMPHIKTFARYN--GQKLAWFLLTSANLSKAAWGALQKNNSQ---LMIRSYELGVLIL 534
SRAMPHIKT++RYN Q + WF+LTSANLSKAAWG+ KN + L I +YE GVL L
Sbjct: 405 SRAMPHIKTYSRYNLTDQSIYWFVLTSANLSKAAWGSFNKNTNLQPCLRIANYEAGVLFL 464
Query: 535 P 535
P
Sbjct: 465 P 465
>gi|448516422|ref|XP_003867567.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis Co 90-125]
gi|380351906|emb|CCG22130.1| hypothetical protein CORT_0B04230 [Candida orthopsilosis]
Length = 533
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 146/572 (25%), Positives = 229/572 (40%), Gaps = 123/572 (21%)
Query: 112 RSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQG 171
+++ + S DG T+ E +RQ D + A+ +F PS +LL
Sbjct: 22 KTESKQSQDGKTDCE----DVRQPD--------TTSVAIASF-------PSQLKLLYNPS 62
Query: 172 LPA----WANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPAC-PVLAKIPHVLVIHGE 225
P N + IRD+I ++ N+ VD+ + L P + +V
Sbjct: 63 YPEKELPSVNQDTLRIRDLIGSALLKETYQFNFNVDLPFFLSFLHPTFKREERKIVFITG 122
Query: 226 S---DGTLEHMKRNKPANWILH--KPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIH 279
S D + E + K AN+ + + +P FGTHH+K M+ Y V +I+ + N
Sbjct: 123 SRLLDPSFEETESIK-ANYNISEVQAHIPSRFGTHHTKMMINFYTDESVEVIIMSCNFTR 181
Query: 280 VDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPE--FSANLPAHGNFKIN 337
+D+ +Q +W + ++ F++DLI YL T P+ + A L
Sbjct: 182 LDFGGLTQMIWRSGRLILGNTTGAKSSKFKSDLIAYLRTYARPQIDYLAKL--------- 232
Query: 338 PSFFKKFNFSSAAVRLIASVPG-YHTGSSLKKWGHMKLRTVLQECT-----------FEK 385
+ ++FS V LIAS PG Y S +G+ L + +
Sbjct: 233 ---LEPYSFSGIDVELIASSPGKYDLNSEGPHYGYGSLYNACKRNNLLIDNRDKSRHYNV 289
Query: 386 GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG-------------EPLI 432
+ S + Y FS L M + + L G P I
Sbjct: 290 LAQTSAISYPFSVEKGATAGIFTHLLCPMLFSKNGEFKLLAPGIQSLRRHQSEHNYTPSI 349
Query: 433 VWPTVEDVRCSLEGYAAGNAIPSP------QKNVDKDFLKKYWAKWKASHT----GRSRA 482
++P V +V S G+AAG AI KN + +K Y KW +S + GR +
Sbjct: 350 IFPAVSEVVSSTIGFAAGQAIHFDYSRSFIHKNYYQQAIKPYLKKWNSSSSMSLAGREQV 409
Query: 483 MPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQKN------NSQLMIRSYELGVLI 533
MPH+K + NG + + W + S NLSK AWG+ + N +SQ + SYELGVL+
Sbjct: 410 MPHVKLYMCDNGDNWRSIKWCYMGSHNLSKQAWGSRKGNKFVNDDSSQYEVNSYELGVLV 469
Query: 534 LPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVP 593
+P K + + PS +K D G+ V Y+ +P
Sbjct: 470 VPKPK---------TEMKPSYLK-----------------------DLGSEEGVTYVRMP 497
Query: 594 YELPPQRYSSEDVPWSWDKRYTK-KDVYGQVW 624
++LPP YS D PWS Y + +D G +
Sbjct: 498 FKLPPTAYSENDKPWSGHASYGELRDSKGNTY 529
>gi|223995471|ref|XP_002287409.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976525|gb|EED94852.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 625
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 119/439 (27%), Positives = 193/439 (43%), Gaps = 110/439 (25%)
Query: 195 ILSNYMVDIDWLLP-ACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFG 253
I+SN+++D +LL P + V+V + E+ +E MK +W + G
Sbjct: 113 IISNFIIDFGYLLEKTLPDILDFHRVVVFYQEAHN-VEAMK-----SW------ENMLAG 160
Query: 254 THHSKAMLLIYP-----RGVRIIVH--TANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
T ++ + + P + H +NL D KSQG++ Q FPLK + +
Sbjct: 161 TGNTVEFVRLVPTDPPRSSCNPLSHKFNSNLWRTDIEYKSQGVYSQVFPLKQKTPADDTV 220
Query: 307 G-----------------------------------FENDLIDYLSTLKWPEFSANLPAH 331
FE+DL+ YL + + + + +
Sbjct: 221 NKLKRKQIYNPYEKKKKPAAGSSSRGWPFEDDKSQLFEDDLVGYLESYHYRK-QQSWKMN 279
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFK- 388
G + ++++FS A LI SVPGYH+ S+ +G++KLR + E C +
Sbjct: 280 GESMNLLALIRQYDFSEAYAVLIPSVPGYHS-LSIDDFGYLKLRKAIIEWVCNQQSNADS 338
Query: 389 -------KSPLVYQFSSLGSLDEKWM----AELSSSMSSGF----------------SED 421
K PLV Q+SS+GSL W+ A L S+ +S ++
Sbjct: 339 RKSSSNAKPPLVCQYSSVGSLTTAWLDLFTAALDSTSTSAVDPVEYYHEVTKKAKSRAKG 398
Query: 422 KTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA---SHT 477
K + + E + IVWPTV+++R ++EGY G ++P KNV + FL + +W
Sbjct: 399 KKGVDLSERMKIVWPTVDEIRTTIEGYNGGGSVPGRTKNVAQSFLLPLYHRWTKRGNDFI 458
Query: 478 GRS---------RAMPHIKTFARYNGQ------KLAWFLLTSANLSKAAWGALQK----N 518
GR+ R +PHIKT+ + + + W +LTS NLSKAAWG ++ +
Sbjct: 459 GRTDNVDPLRTARNVPHIKTYVQPSTHVIGDTPSIEWMVLTSHNLSKAAWGNIENRSVDD 518
Query: 519 NSQLMIRSYELGVLILPSA 537
+ L IR +ELGV I P+
Sbjct: 519 SKVLFIRHWELGVFISPAT 537
>gi|326431947|gb|EGD77517.1| hypothetical protein PTSG_08615 [Salpingoeca sp. ATCC 50818]
Length = 594
Score = 122 bits (305), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 95/195 (48%), Gaps = 28/195 (14%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
+PTVEDVR S EGY G ++P K D F K KW+A R+RA+PHIKTF
Sbjct: 422 FCYPTVEDVRTSYEGYVGGGSLPHAIKYREDHVFFAKEACKWRAGWCYRTRALPHIKTFT 481
Query: 491 RYN--GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 548
+N + + W LL S NLSKAAWG LQK SQL I SYELGV + PS +
Sbjct: 482 AWNTAARSIDWMLLGSHNLSKAAWGQLQKQESQLHILSYELGVFLSPSL--------LGA 533
Query: 549 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
+ P K S T + + PVPY+ P YS+ D W
Sbjct: 534 TLRPLADKLRSVRRPDKHDT-----------------QTAWAPVPYDYPLTPYSTHDEMW 576
Query: 609 SWDKRYTKKDVYGQV 623
WD Y + D +G+V
Sbjct: 577 YWDGVYMQPDTHGRV 591
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 134/305 (43%), Gaps = 41/305 (13%)
Query: 124 NGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANT----- 178
GEL +K+ + + E + + DKL F+L R++G+ +
Sbjct: 39 GGELETKRAKAAETVRTERVAAATSSRT------DKLDVVFKLSRLRGVGKAGGSLKEAN 92
Query: 179 ---SCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK 234
SI +++ Q ++ ++ NYM+D+DWLL P + +++++G + +
Sbjct: 93 NPLFATSIAEILSQPGLLSSVQFNYMIDVDWLLDQYPAEYRRLPLMIVYGNDQRVSKETE 152
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ-D 293
+ P LP +FGTHH+K MLL + G++++VHTANLI DWN K+QG+WM
Sbjct: 153 HDTSNVRWFRAPYLP-AFGTHHTKMMLLFFHDGMQVVVHTANLISRDWNLKTQGIWMSPK 211
Query: 294 FP--------LKDQNNLSEECGFENDLIDYLST--------LKWPEFSANLPAHGNFKIN 337
P ++D ++ S GF DL YL + + AH +
Sbjct: 212 LPRFSPKRGRVQDISSYS-PTGFGADLWSYLRAYGDGVQGGVSMRAVRERIAAHDLTHVK 270
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
F ++ L+ P G + WG + + +L + G +V QFS
Sbjct: 271 VVFACQYERD-----LLPLSPAATAGRTKTAWGQHEAQDLLLQQHAAGG--ADVVVCQFS 323
Query: 398 SLGSL 402
S+G +
Sbjct: 324 SIGKM 328
>gi|281201405|gb|EFA75617.1| protein-tyrosine phosphatase 3 [Polysphondylium pallidum PN500]
Length = 665
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 138/295 (46%), Gaps = 69/295 (23%)
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEND 311
FG HSK MLL+Y +R+++ +AN D+++ Q +W QDFP N+ F++
Sbjct: 390 FGCQHSKLMLLVYDDSIRVVIPSANPTRFDYDDIGQTIWFQDFP--KVNSQPPPSQFQDT 447
Query: 312 LIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 371
L ++ + P +F K++FS A V LI S+PGYH G+S+ + GH
Sbjct: 448 LKLFIKSCALPN---------------TFLDKYDFSIAKVHLIVSIPGYHRGASMNQCGH 492
Query: 372 MKLRTVLQECTFEKG-----------FKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFS 419
M+LR++L++ +K KK + Q SSLG +++KW + L S+ + S
Sbjct: 493 MQLRSILKKYYTDKENDLKHSDFPIIIKKREVHSQTSSLGLVNDKWSPQFLESTQTLTKS 552
Query: 420 EDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGR 479
+ P G+ I++P KN+
Sbjct: 553 KLVDPTGLLH--ILFP----------------------KNL----------------ILH 572
Query: 480 SRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
S+ + F + + W + S NLS AAWG LQK+NSQL I ++E+GVL+L
Sbjct: 573 SKIITGTTKFEHNDKLRFDWVYVGSHNLSPAAWGRLQKDNSQLYISNFEIGVLLL 627
>gi|448083780|ref|XP_004195441.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
gi|359376863|emb|CCE85246.1| Piso0_004828 [Millerozyma farinosa CBS 7064]
Length = 576
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 133/503 (26%), Positives = 210/503 (41%), Gaps = 96/503 (19%)
Query: 177 NTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE---SDGTLEH 232
N + V++ D+I D+ + N+ +D+++ L + + + G S +
Sbjct: 110 NYNAVTLSDMIGMPDLRSSFQFNFAIDLEFFLGHVHRSKESKTITFVLGSDLLSPEVKDE 169
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWM 291
+++ + K LP FGTHH+K M+ Y II+ T NL +D++ +Q W
Sbjct: 170 VQKRYGVDASDIKVDLPKRFGTHHTKMMVNFYENETCEIIIMTCNLQPIDFSALTQMCWR 229
Query: 292 QDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSA 349
+ ++ + F+ D+I YL + KIN +F+ S
Sbjct: 230 SGRLSRASSSNPGKPRFKTDIIRYLKRYR------------KQKINELADTLAEFDMSGI 277
Query: 350 AVRLIASVPG----YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK 405
V L+ASVPG T +++G+ KL VL+ G + + Y + +
Sbjct: 278 DVELVASVPGNFNLARTADDSEEYGYGKLYQVLKRNDLLLGNEDTDKEYNVLAQATSISY 337
Query: 406 WMAELSSSMSSGFSEDKTPLGIGE--------------------------PLIVWPTVED 439
A + +S FS PL P I++P +D
Sbjct: 338 PFALKEKNTASVFSHIICPLIFSRNSDRLFDVLEPGTKSFRDHQIKHSYNPHIIYPCAKD 397
Query: 440 VRCSLEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASH--TGRSRAMPHIKTFAR 491
+ S G+ +G AI + +N + +K Y KW+ASH GR PH+K +
Sbjct: 398 IALSGTGFYSGQAIHFKYDTSAIHRNQFEQNIKPYLYKWRASHKNAGREETPPHVKLYMC 457
Query: 492 YNG---QKLAWFLLTSANLSKAAWGALQK------NNSQLMIRSYELGVLILPSAKRHGC 542
NG + L W L+ S NLSK AWGA ++ + S I SYELGVLI PS+ H
Sbjct: 458 DNGDNWKTLRWVLMASHNLSKQAWGARRELRYRSADPSTYEISSYELGVLI-PSSSDH-- 514
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
+VP S+ Q+ +D G V + +P+ LPP+RYS
Sbjct: 515 ------KLVP-------VFDSRHQRK---------VTDQGD----VPVRIPFILPPERYS 548
Query: 603 SEDVPWSWDKRY-TKKDVYGQVW 624
S+D PWS Y + KD +G W
Sbjct: 549 SDDKPWSAYSNYGSLKDKFGHTW 571
>gi|390364206|ref|XP_788891.3| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like [Strongylocentrotus
purpuratus]
Length = 414
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 120/428 (28%), Positives = 192/428 (44%), Gaps = 83/428 (19%)
Query: 260 MLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEECG-----FENDLI 313
M L+Y G+R+++HTAN+I DW+ K+QG+W+ FP +N + G F+ DL+
Sbjct: 2 MFLLYADGMRVVIHTANIIESDWHQKTQGVWISPLFPKLPSSNQTATNGESPSFFKRDLL 61
Query: 314 DYLSTLKWPEFSANLPAHGNFKINP--SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGH 371
YL+ + P + P + +FSSA V LI+SVPG H KWGH
Sbjct: 62 AYLTAYRSPS------------LQPWKDHITQHDFSSAKVFLISSVPGRHARELKNKWGH 109
Query: 372 MKLRTVLQECTFEKGFKKS-PLVYQFSSLGSL---DEKWM-AELSSSMSS--GFSEDKTP 424
+K+R +L++ +K ++ P++ QFSS+GSL KW+ AE SMS+ G S T
Sbjct: 110 LKVRKILRQYGPDKEQVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKGQSGSFTS 169
Query: 425 LGIGEPL-IVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTG---- 478
+ +++P ++VR SLEGY AG ++P S Q + +L +++ + G
Sbjct: 170 NADTRHMKLIFPCSDNVRTSLEGYPAGASLPYSIQTAKKQPYLHQFFLREILRQYGPDKE 229
Query: 479 RSRAMPHIKTFA---RYNGQKLAWF---LLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
+ + P I F+ G K W L S + K G+ N ++ L
Sbjct: 230 QVQTWPVIGQFSSIGSLGGDKTKWLCAEFLQSMSTVKGQSGSFTSNADTRHMK------L 283
Query: 533 ILPSA---KRHGCGFSCTSNIVPSEIKSGSTE--TSQIQKTKLVTLTWHGSSDAGASS-- 585
I P + + G+ +++ P I++ + Q L W G+ + AS
Sbjct: 284 IFPCSDNVRTSLEGYPAGASL-PYSIQTAKKQPYLHQFFFANLSKAAW-GAYEKNASQLM 341
Query: 586 ------EVVYLP----------------------VPYELPPQRYSSEDVPWSWDKRYTKK 617
V+ +P +P+++P YS D PW WD YT K
Sbjct: 342 IRSYEIGVMMIPSFFDKSRKTFPLTEGRGQKEFSLPWDVPLTPYSKTDRPWIWDIPYTDK 401
Query: 618 -DVYGQVW 624
D +G W
Sbjct: 402 PDSHGNAW 409
>gi|254565439|ref|XP_002489830.1| hypothetical protein [Komagataella pastoris GS115]
gi|238029626|emb|CAY67549.1| hypothetical protein PAS_chr1-1_0480 [Komagataella pastoris GS115]
gi|328350245|emb|CCA36645.1| tyrosyl-DNA phosphodiesterase 1 [Komagataella pastoris CBS 7435]
Length = 562
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 131/548 (23%), Positives = 228/548 (41%), Gaps = 97/548 (17%)
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQ--DNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
++ K D + SK +Q+ EQ D + +++E+ + + S RL
Sbjct: 52 AQGSKEQQVDAQEEPQKHSKTQKQEKEQVIDLTDDQDAEDRPA---IDTTTVQSPIRLFN 108
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
N C+S++D++ + N+ +++D+ L + I+ ++
Sbjct: 109 SPAHKPQDNIDCISLKDLVSSPQLSKTYQFNFCINVDFFLKYITSDPLSTEIYFINS-AE 167
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKS 286
+E ++N+ + H F THH+K M+ + G +I+V +AN+ +D+ +
Sbjct: 168 YLVEMTQQNRMRFKLRHVDIQLERFATHHTKMMVNFFRDGTAQIVVMSANMTEMDFVGNT 227
Query: 287 QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 346
QGLWM P+ + N E F+ND + YL + + +L A K ++F
Sbjct: 228 QGLWMS--PMLSKGN-GRESSFKNDFLAYLKA--YNKHDLDLLAEE--------LKLYDF 274
Query: 347 SSAAVRLIASVPGYHT----GSSLKK---WGHMKLRTVLQ-ECTFEKGFKKSPLVYQFSS 398
+ ++SVPG T LK+ +G+ KL +L+ F K + + ++ Q ++
Sbjct: 275 GNVKAEFLSSVPGTFTIPEEDDRLKRSVQYGYGKLFQLLKLNNLFPKATESTDILAQVAT 334
Query: 399 LGS-LDEKWMAELSSSMSSGFSEDKTPLGIG---------------EPLIVWPTVEDVRC 442
+ S D + + ++ + K P+ G P +V+PT +V
Sbjct: 335 IASPFDFRSSNIFTHLLAPLINGTKFPIAGGLEPLQKAINDDVHPFNPFLVFPTKNEVFG 394
Query: 443 S-LEGYAAG---------NAIP--SPQKNVDKDFLKKYWAKWKAS------HTGRSRAMP 484
S L+ Y +G + +P + Q N+ ++K+ +W S GRS P
Sbjct: 395 SVLKEYTSGIFYNIDDSSHKVPFLTNQHNI----IRKFMYRWTNSDPNLNQKAGRSNLAP 450
Query: 485 HIKTFARYNG--QKLAWFLLTSANLSKAAWGALQK--NNSQLMIRSYELGVLILPSAKRH 540
H+KT+ N Q W+LLTSANLSK AWG K N + I SYE G+ I P K +
Sbjct: 451 HVKTYCASNDGFQTFMWYLLTSANLSKQAWGYPLKGSNGLKYKISSYEAGIFIHP--KLY 508
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
G + +L + S VV + VPY P ++
Sbjct: 509 GEDY------------------------QLKPILSRDSFPNRDKDNVVPIRVPYAFPLEK 544
Query: 601 YSSEDVPW 608
Y D PW
Sbjct: 545 YHDSDEPW 552
>gi|116194574|ref|XP_001223099.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
gi|88179798|gb|EAQ87266.1| hypothetical protein CHGG_03885 [Chaetomium globosum CBS 148.51]
Length = 349
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 140/311 (45%), Gaps = 56/311 (18%)
Query: 344 FNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG 400
++FS LIASVPG H S+ WG + L+ KK + Q SS+
Sbjct: 62 YDFSEIRGSLIASVPGRHVFEEEDSITWWGSAAMSRALEAVPISS--KKPEIAIQTSSIA 119
Query: 401 SL--DEKWMAE-LSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI--- 453
+L + W+ L S+ G S TPL +V+PT +++R SL+GY +G++I
Sbjct: 120 TLGGSDTWLKNILFRSLRGGRS--TTPLAQRPSFKVVFPTPDEIRKSLDGYHSGSSIHTK 177
Query: 454 -PSPQKNVDKDFLKKYWAKWK--------------ASHTGRSRAMPHIKTFARYNG---- 494
SPQ+ +L+ + W GR RA PHIKT+ RY+G
Sbjct: 178 TQSPQQASQLTYLRPMFHHWANDSDRGAPLSYGDIPKEAGRKRAAPHIKTYIRYSGYGPE 237
Query: 495 -QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPS 553
+ W LLTSANLSK AWG +++ + SYE+GVL+ P G +VP+
Sbjct: 238 PPTVDWALLTSANLSKQAWGDAPNTRNEVRVASYEIGVLVWPELYGEGA------TMVPT 291
Query: 554 EIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 613
+ E G G ++ V L +PY LP Q Y +VPW ++
Sbjct: 292 FMTDSLAE---------------GEVPEGTATAVA-LRMPYNLPLQAYGEGEVPWVATEK 335
Query: 614 YTKKDVYGQVW 624
+ + D G+ W
Sbjct: 336 HLEPDWMGRAW 346
>gi|444707427|gb|ELW48704.1| Tyrosyl-DNA phosphodiesterase 1 [Tupaia chinensis]
Length = 389
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/241 (36%), Positives = 117/241 (48%), Gaps = 71/241 (29%)
Query: 391 PLVYQFSSLGSL---DEKWM-AELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLE 445
PLV QFSS+G L + KW+ +E S+ + + K P PL +++P+VE+VR SLE
Sbjct: 210 PLVGQFSSIGFLGADESKWLCSEFKESLLTLGRDSKIPGKSTVPLHLIYPSVENVRTSLE 269
Query: 446 GYAAGNAIP-SPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTS 504
GY AG ++P S Q +++L Y+
Sbjct: 270 GYPAGGSLPYSIQTAEKQNWLHSYF----------------------------------H 295
Query: 505 ANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
ANLSKAAWGAL+KN +QLMIRSYELGVL LPSA F S V + SGS
Sbjct: 296 ANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA------FGLDSFKVKQKFFSGS----- 344
Query: 565 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKK-DVYGQV 623
HG + + PVPY+LPP+ Y +D PW W+ Y K D +G +
Sbjct: 345 -----------HGPTAS--------FPVPYDLPPELYGHKDRPWIWNIPYVKAPDTHGNM 385
Query: 624 W 624
W
Sbjct: 386 W 386
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV+G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 105 PFQFYLTRVKGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 164
Query: 218 HVLVIHGESDGTLEHM-KRNKP 238
+L++HG+ H+ R KP
Sbjct: 165 PILLVHGDKREAKAHLHARAKP 186
>gi|294659254|ref|XP_461609.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
gi|199433821|emb|CAG90056.2| DEHA2G01584p [Debaryomyces hansenii CBS767]
Length = 583
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 187/443 (42%), Gaps = 112/443 (25%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
LP FGTHH+K M+ Y II+ T NL +D+ +Q W + N+S E
Sbjct: 182 LPTRFGTHHTKMMINFYEDDTSEIIIMTCNLQKIDFGGLTQMCWKSGRLHRSNGNISPER 241
Query: 307 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG 359
G F+ DL +YL +K NP +++FS + L+AS PG
Sbjct: 242 GARFQKDLKNYLF---------------RYKKNPLRELGKSLDEYDFSPVNIELVASAPG 286
Query: 360 Y----HTGSSLKKWGHMKLRTVLQECTF----EKGFKKSPLVYQFSSLGSLDEKWMAELS 411
+ + + + +G+ KL VL+ KG K ++ Q SS+ A
Sbjct: 287 FFNMAESTNDSEIYGYGKLYQVLRRNNLLIDNSKGENKYNILAQVSSISYP----FATEK 342
Query: 412 SSMSSGFSEDKTPL---GIGE-----------------------PLIVWPTVEDVRCSLE 445
S+ +S FS PL G+ + P I++P+V+DV S
Sbjct: 343 SNTASIFSHLLCPLIFSGMSKASFNLLKPGAASFKSHQNTHNYRPHILYPSVDDVANSNV 402
Query: 446 GYAAGNAI-------PSPQKNVDKDFLKKYWAKWK----ASHTGRSRAMPHIKTFARYNG 494
G+A+G A+ P+ + +++ +K Y +W+ A TGR +PH+K + NG
Sbjct: 403 GFASGQALHFKFTTTPTHRNQYEQN-IKPYLYRWQSGSHADETGRENVVPHVKLYMCDNG 461
Query: 495 QK---LAWFLLTSANLSKAAWGALQKNNSQLM--------IRSYELGVLILPSAKRHGCG 543
L W L+ S NLSK AWGA KN ++ + SYELGVL+
Sbjct: 462 DDWCTLRWVLMGSHNLSKQAWGA--KNETKFTNSDPSVYKVSSYELGVLV---------- 509
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 603
N+ P++ G T L + + A + L +P++LPP +Y
Sbjct: 510 ---PGNMDPND--DGIT---------LKPIYGRDTFPAPQHNNDTPLRIPFKLPPVKYKP 555
Query: 604 EDVPWSWDKRYTK--KDVYGQVW 624
+ PWS Y KD +GQ +
Sbjct: 556 SERPWSALINYGNNLKDRFGQCY 578
>gi|193785768|dbj|BAG51203.1| unnamed protein product [Homo sapiens]
Length = 118
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 82/145 (56%), Gaps = 33/145 (22%)
Query: 483 MPHIKTFARY--NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
MPHIKT+ R + K+AWFL+TSANLSKAAWGAL+KN +QLMIRSYELGVL LPSA
Sbjct: 1 MPHIKTYMRPSPDFSKIAWFLVTSANLSKAAWGALEKNGTQLMIRSYELGVLFLPSA--- 57
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
F S V + +GS E + PVPY+LPP+
Sbjct: 58 ---FGLDSFKVKQKFFAGSQE------------------------PMATFPVPYDLPPEL 90
Query: 601 YSSEDVPWSWDKRYTKK-DVYGQVW 624
Y S+D PW W+ Y K D +G +W
Sbjct: 91 YGSKDRPWIWNIPYVKAPDTHGNMW 115
>gi|400603196|gb|EJP70794.1| tyrosyl-DNA phosphodiesterase [Beauveria bassiana ARSEF 2860]
Length = 399
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 164/352 (46%), Gaps = 46/352 (13%)
Query: 160 LPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAKIPH 218
PS FRL V+ L N V++ D++ +I S NY+ I +L+ A + PH
Sbjct: 38 FPSPFRLTWVRDLEEENNKDAVTLSDLLGDPLISECWSFNYLHSISFLMDAFDRDIR-PH 96
Query: 219 VLV--IHG---ESDGTLEHMKRNKP--ANWILHKPPLPISFGTHHSKAMLLIYPRG--VR 269
V V +HG DG + N LH P+P FGTHHSK ML+++ R +
Sbjct: 97 VKVHIVHGFWKREDGNRIGLVEQAALFPNVNLHAAPMPEMFGTHHSK-MLILFRRDDTTQ 155
Query: 270 IIVHTANLIHVDWNNKSQGLW-------MQDFPLKD--QNNLSEECG--FENDLIDYLST 318
+I+HTAN+I DW N + +W ++ P + ++++ G F++DL+ YL
Sbjct: 156 VIIHTANMIAKDWTNMTNAVWTSPVLSKLKKVPDDPSWREDMAQGSGHRFKSDLLSYLRC 215
Query: 319 LKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHT---GSSLKKWGHMKLR 375
+ N K+++FSS LIASVPG H + WG +
Sbjct: 216 YDRMRPTCNALVES--------LKEYDFSSVRGSLIASVPGTHEVHGDPGVTSWGWKSMS 267
Query: 376 TVLQECTFEKGFKKSPLVYQFSSLGSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL-I 432
LQ+ E G S + Q SS+ +L ++ W L ++ S+ K + +
Sbjct: 268 KCLQQIPCEPGV--SQVAVQVSSIATLGGNDGW---LRGTLFRALSKGKVATALSPQFKV 322
Query: 433 VWPTVEDVRCSLEGYAAGNA----IPSPQKNVDKDFLKKYWAKWKASHTGRS 480
V+PT +++R SL+GYA+G + I S Q+ + ++L+ + W R+
Sbjct: 323 VFPTADEIRASLDGYASGGSIHTKIQSKQQQMQLNYLRPIFHHWMTDDDSRT 374
>gi|344301196|gb|EGW31508.1| hypothetical protein SPAPADRAFT_154759 [Spathaspora passalidarum
NRRL Y-27907]
Length = 549
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/426 (26%), Positives = 175/426 (41%), Gaps = 91/426 (21%)
Query: 248 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
+P FGTHH+K M+ + + I++ ++N+ +D+ +Q LW K +
Sbjct: 163 IPNRFGTHHTKMMINFFKGDTMEIVIMSSNITRLDFGGLTQMLWRSGRLSKIKPKTIPLV 222
Query: 307 G--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH--- 361
G F+ DL++YL+ E + K+++FSS V LIAS PG +
Sbjct: 223 GKRFQKDLMNYLNKYNKVEITQL----------SKRLKQYDFSSVNVELIASAPGSYNLR 272
Query: 362 -TGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
+ + +G+ KL L+ + S L Y + S A + + FS
Sbjct: 273 DVTNETEIYGYGKLHQALKRNSLLIDNSISKLKYNIIAQVSAISYPFAVETFQTAGIFSH 332
Query: 421 DKTPLGIGE------------------------PLIVWPTVEDVRCSLEGYAAGNAI--- 453
PL + P+I++PT E+V S G+ AG AI
Sbjct: 333 LLCPLVFSKKEEFKLLEPGTNSFRQHQKDHNYNPIIIFPTPEEVAGSNVGFRAGGAIHFD 392
Query: 454 ---PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSA 505
KN + +K Y KW + + TGR + MPH+K + NG L W + S
Sbjct: 393 YNRSFVHKNYYQQCIKPYLHKWSSRETITGREKVMPHVKLYMCDNGDNWSTLKWVYMGSH 452
Query: 506 NLSKAAWGA------LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 559
NLSK AWG+ L N S I SYELGVL+ P P E
Sbjct: 453 NLSKQAWGSRRGNKFLSSNPSIYDISSYELGVLVYPK---------------PGE----- 492
Query: 560 TETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KD 618
TL + D+ S+ + + +P++LPP +Y S D+PWS Y D
Sbjct: 493 ------------TLVPNYLGDSIPKSKNIPIRLPFKLPPVKYLSTDLPWSGHVSYGGLAD 540
Query: 619 VYGQVW 624
YG+ +
Sbjct: 541 KYGETY 546
>gi|300121378|emb|CBK21758.2| unnamed protein product [Blastocystis hominis]
Length = 397
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 148/311 (47%), Gaps = 39/311 (12%)
Query: 242 ILHKPPLPISF--GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLK 297
++ PP S+ G H+K +LL + +RI++ +ANL DW SQ +WMQDF K
Sbjct: 60 LIVSPPFAQSYLRGCFHAKLLLLRFSDRLRIVISSANLTTEDWTMWSQCVWMQDFFNAPK 119
Query: 298 DQNNLSE---ECGFENDLIDYLSTLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAAV 351
D ++ + F LI +L PE F+A F+ F + +V
Sbjct: 120 DSTRVAAKKLDLEFRTQLISFLRKCCVPEERIFNA--------------FRGVFFENVSV 165
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECT--FEKGF---KKSPLVYQFSSLGSLDEKW 406
+L+ASVPG + G + +G ++LR+VL+ EK K P++ Q SS+G+ + W
Sbjct: 166 QLVASVPGVYQGDRMNDYGQLRLRSVLKGLNDYMEKVASLPKNPPILSQCSSIGNPSQNW 225
Query: 407 MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEG-YAAGNAIPSPQKNVDKDF 464
+ + S G + + + L IV+PT V S+ G AG+ I + K F
Sbjct: 226 ILSMLKSCYGGREIVEKKGKLADLLHIVYPTNVYVNNSIIGPEMAGSLIFMQKVYTAKAF 285
Query: 465 LKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMI 524
L++ ++K + GR +PH K +K L AWG ++K SQ+ I
Sbjct: 286 LREMLKRYKDA-PGRETTLPHSKYLMNVPLKK-------RPRLPWVAWGQIEKKESQIAI 337
Query: 525 RSYELGVLILP 535
+YE GV++LP
Sbjct: 338 CNYECGVVLLP 348
>gi|440302433|gb|ELP94746.1| tyrosyl-DNA phosphodiesterase, putative [Entamoeba invadens IP1]
Length = 446
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 171/389 (43%), Gaps = 74/389 (19%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL 248
G++ L+ ++ DI WLL P+L K V +H DG+L + N +
Sbjct: 38 GELYACFLTTFVFDIGWLLREVPIL-KTVQVQFVH---DGSLSEDEERLIHNLDFQCIKV 93
Query: 249 PISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGF 308
G HH K M+++Y G+R ++ T NL+ D+ K+ G++++DF K N+ S+
Sbjct: 94 SPFRGCHHVKIMVMLYEGGLRFVLSTGNLLEQDYEIKTNGIYVRDFKPK-SNSFSKM--- 149
Query: 309 ENDLID-YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK 367
ND+ + +L+T+++ S N + + F+FS+ L+ SVPG G
Sbjct: 150 -NDIGEHFLTTMRYYLNSIN--------TDIGYLDDFDFSTIDAWLLLSVPGKFHGDMAS 200
Query: 368 KWGHMKLRTVLQECTF---------------------------------EKGFK------ 388
+ G +L ++L+ +F +KG K
Sbjct: 201 EVGLGQLSSLLKSFSFGSQKDQKTQEEHKTSALINPVVPTKQSQKTSTSQKGLKSPEIEC 260
Query: 389 --KSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEG 446
++ ++ Q SSLG L + + SS + +WPT + VR S G
Sbjct: 261 AEQAVIISQSSSLGYLSSNFTEKFKSSFVPNVHHIQLK-------TLWPTEDFVRVSATG 313
Query: 447 YAAGNAIPSPQKNVDKDF-LKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSA 505
YA G ++ Q+NV L +Y ++ R PHIKT+ G +LTSA
Sbjct: 314 YAGGQSLFLTQQNVKSGVALYRYEPRFP-----RHYIQPHIKTYLVKVGDTFRCGVLTSA 368
Query: 506 NLSKAAWGALQKNNSQLMIRSYELGVLIL 534
N+S AAWG + + + I ++E+G+L +
Sbjct: 369 NMSAAAWG--KPMSYGIDISNFEMGLLFV 395
>gi|355723700|gb|AES07977.1| tyrosyl-DNA phosphodiesterase 1 [Mustela putorius furo]
Length = 381
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 95/173 (54%), Gaps = 13/173 (7%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 164 PFRFYLTRVSGIKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLIKQYPPEFRKK 223
Query: 218 HVLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 224 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 283
Query: 276 NLIHVDWNNKSQGLWMQDFPLKDQ------NNLSEECGFENDLIDYLSTLKWP 322
NLIH DW+ K+QG+W+ PL Q + F+ DLI YL+ P
Sbjct: 284 NLIHADWHQKTQGIWLS--PLYPQIIHGTHRSGESTTHFKADLISYLTAYNAP 334
>gi|328868012|gb|EGG16393.1| protein-tyrosine phosphatase 3 [Dictyostelium fasciculatum]
Length = 596
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 112/421 (26%), Positives = 186/421 (44%), Gaps = 61/421 (14%)
Query: 163 TFRLLRVQGLPAWANTSC----VSIRDVIQGDIIVAILS-NYMVDIDWLLPACPVLAK-- 215
+F L R+ G N+S ++ RD+I + ++++ + +D +W++ K
Sbjct: 145 SFYLNRIYGESNDNNSSTTPKTLTFRDIISPSGLESVIAMGFGMDTEWMMNEIIRSQKGR 204
Query: 216 --IPHVLVIH-GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIV 272
IP VI G+ + +N IL + + +G HSK +LL+Y +R++V
Sbjct: 205 KDIPMTFVIDCGDPKKKGTTVIQN--ITLIL----VHVLYGCMHSKLILLLYKDYIRVVV 258
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQN---------------------NLSEECGFEND 311
+AN D+ Q +W QDF K +LS +
Sbjct: 259 PSANPFEEDYIRIGQTIWYQDFQKKLPPPPPPLATTPTLKPIPSTSKTISLSLKQMTTKK 318
Query: 312 LIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
+T +F +L N FKI F +F+F A +LI S+PG+H G++L +G
Sbjct: 319 PTTTTTTTTTNDFQISLKTLLNCFKIETKFLDQFDFECAKAQLIISIPGFHNGATLNSYG 378
Query: 371 HMKLRTVLQECTFEK---------GFKKSPLVYQFSSLGSLDEKWMAEL--SSSMSSGFS 419
H+KLR+VL +K FK+ + Q SSLG+++ W S +
Sbjct: 379 HLKLRSVLTSYYNQKEKDLNLKIDNFKRD-VFSQCSSLGNVNSGWNQHFLESCRIPKNNL 437
Query: 420 EDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNV-DKDFLKKYWAKWKASHT 477
ED I + L I++PTV + + + + + I K+ DK F + K H
Sbjct: 438 ED-----ISKSLHILFPTVSWITSNHKRMQSASIIRFQDKSYDDKTFPRNSMTLIKHRHP 492
Query: 478 GRSRAMPHIKTFARYN----GQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
R + H K ++ W + S NLS AAWG +QKN +Q+ + +YE+GV++
Sbjct: 493 HRGNMLLHTKVNVGVTTIGKNKRYDWIYVGSHNLSPAAWGKIQKNQTQIQLSNYEIGVVL 552
Query: 534 L 534
L
Sbjct: 553 L 553
>gi|440797312|gb|ELR18403.1| Tyrosyl-DNA phosphodiesterase [Acanthamoeba castellanii str. Neff]
Length = 569
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 134/263 (50%), Gaps = 35/263 (13%)
Query: 164 FRLLRVQGLP-AWANTSCVSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLV 221
F L ++GL A AN+ C+SIR +++ + ++ A+++++ D++W+L P IP LV
Sbjct: 25 FVLNEIKGLRGADANSGCISIRKLVRPESLVAALVTSFTEDVEWVLSVIP--PTIPITLV 82
Query: 222 IHGESDGTLEHMKRNKPANWILHKPPLPI-SFG-------THHSKAMLLIY-PRGVRIIV 272
H E ++ ++ N + PPL + FG H+K MLL Y +R++V
Sbjct: 83 RHWEEPDREGEVRISR--NIRVIHPPLALPGFGGGQAMRAKMHAKLMLLRYRDNTLRVVV 140
Query: 273 HTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPA 330
+ANL D+ Q +W QDFP K Q + ++ FE L +L LK E
Sbjct: 141 TSANLAQPDYELVGQTVWYQDFPKKQQKSSGQQPASPFEETLTQFLVALKADE------- 193
Query: 331 HGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKG--F 387
F ++++FS AA L+ SVPG+H G + GH +LR +L++ +
Sbjct: 194 --------GFLREYDFSKAAADLVVSVPGFHRGKHKMDAVGHTRLRALLRDFQWPPADEL 245
Query: 388 KKSPLVYQFSSLGSLDEKWMAEL 410
+ + YQ SSLG+L E +++E
Sbjct: 246 RDDNIYYQTSSLGALYESFVSEF 268
>gi|149245486|ref|XP_001527220.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449614|gb|EDK43870.1| hypothetical protein LELG_02049 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 554
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 177/443 (39%), Gaps = 110/443 (24%)
Query: 248 LPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
+P FGTHH+K M+ + V I++ ++N+ +D+ +Q +W P + +
Sbjct: 154 IPTRFGTHHTKMMINFFEDLSVEIVISSSNITRLDFGGLTQMVWRSGRLPQSGETIGEKG 213
Query: 306 CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPG-YHTGS 364
F+ DLI YL+ K L N +NF S V LIAS PG Y+
Sbjct: 214 IQFKKDLIGYLNKYKKVPVD-KLATRLNL---------YNFLSVDVELIASAPGKYNLQK 263
Query: 365 SLKKWGHMKLRTVLQ--------------ECTFEKGFKK---------SPLVYQFSSLGS 401
+G+ L L+ E +K KK S + Y FS+
Sbjct: 264 DSSLYGYGSLYKALERNNLLLNNKNVEHDEIDNDKHNKKKHYNVLAQVSAISYPFST--- 320
Query: 402 LDEKW-----MAELSSSMSSGFSEDKTPLGIGE-------------PLIVWPTVEDVRCS 443
EKW L + E L G+ P I++PTV++V S
Sbjct: 321 --EKWATAGIFTHLLCPLIFSKDEKFRLLAPGKESIKRHQKEHNYTPHIIFPTVDEVASS 378
Query: 444 LEGYAAGNAI------PSPQKNVDKDFLKKYWAKWKASHT----GRSRAMPHIKTFARYN 493
GY AG+AI KN +K Y +KW +S T GR R MPH+K + N
Sbjct: 379 TIGYVAGSAIHFDYTRSFVHKNYFTQAIKPYLSKWDSSDTKEVTGRERVMPHVKLYMCDN 438
Query: 494 G---QKLAWFLLTSANLSKAAWGALQKN------NSQLMIRSYELGVLILPSAKRHGCGF 544
+ + W + S NLSK AWG+ + N + + + SYELGVL P
Sbjct: 439 ADNWKTIKWCYMGSHNLSKQAWGSKKGNKFVNDHSDEYEVSSYELGVLFTP--------- 489
Query: 545 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 604
K G+T ++ K + + ++ +P++LPP YS
Sbjct: 490 -----------KEGTTMVPSYKENK-----------SSIRGDHTFVRMPFQLPPALYSLL 527
Query: 605 DVPWSWDKRYTKK-DVYGQVWPR 626
D+PWS Y K D+ G + +
Sbjct: 528 DMPWSGHVSYGDKLDLMGSTYKK 550
>gi|299740649|ref|XP_001833897.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
gi|298404347|gb|EAU87927.2| hypothetical protein CC1G_01574 [Coprinopsis cinerea okayama7#130]
Length = 627
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 119/441 (26%), Positives = 187/441 (42%), Gaps = 73/441 (16%)
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
NG+ + A D P TFRL +V G + D+ AI+S++ +
Sbjct: 169 NGEFRQTATRGVDPRADGKP-TFRLTQVLGE---------------KKDLTFAIISSFAL 212
Query: 202 DIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAML 261
D+ W+ +P ++V + D T + +N NWI PPL +G H K ML
Sbjct: 213 DLPWIYEFFD--RSVPVIVV--AQPDATGQASMKNVLPNWIKTTPPLRGGYGCQHMKFML 268
Query: 262 LIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLS 317
L + G +R++V TANLI DW +W+QD PL+ ++ + F L+ L+
Sbjct: 269 LFHKTGRLRVVVSTANLISYDWREMENTVWLQDVPLRSSSSTAPVRATDDFPGTLLYMLA 328
Query: 318 TLKW-PEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMK 373
L P + H N I +++++S L+ S+ G H G S+ K GH +
Sbjct: 329 ALNVVPALKIMINEHPNLPIKTIEELRERWDWSKVKAHLVPSIAGKHEGWPSVIKTGHPR 388
Query: 374 LRTVLQECTFEKGF----KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED-------- 421
L V+++ G KK L Q SSLG+ +W+ E S +ED
Sbjct: 389 LMAVVRKMAMRTGTGSQAKKLTLECQGSSLGNYTTQWLNEFYYSARGESAEDWLDRSKKQ 448
Query: 422 --KTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD-KDFLKKYWAKWKASHT 477
K P P+ I++PT + V+ S G G I ++ D K+F ++ + K S
Sbjct: 449 REKQPY---PPVKIIFPTKKTVQESTFGEQGGGTIFCRRRQWDGKNFPRELFHDSK-SKA 504
Query: 478 GRS-----------RAMPHIKTFARYNGQK------------LAWFLLTSANLSKAAWGA 514
GRS R H T + + + W + S N + +AWG
Sbjct: 505 GRSLMHSKMIIGTLRDSTHASTSQDGSETEDSDDEIQIIQPAVGWAYIGSHNFTPSAWGT 564
Query: 515 LQKN--NSQLMIRSYELGVLI 533
L + N L I +YE+GV+
Sbjct: 565 LSGSSFNPTLNITNYEVGVVF 585
>gi|150865397|ref|XP_001384596.2| hypothetical protein PICST_67678 [Scheffersomyces stipitis CBS
6054]
gi|149386653|gb|ABN66567.2| putative tyrosyl-DNA phosphodiesterase [Scheffersomyces stipitis
CBS 6054]
Length = 553
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 181/427 (42%), Gaps = 92/427 (21%)
Query: 248 LPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
+P FGTHH+K M+ + + I++ + NL +D +Q LW L+ ++++ E
Sbjct: 165 IPNRFGTHHTKMMVNFFEDKSCEIVIMSFNLNKIDVVGLTQTLWRSGRLQLETEDSVKLE 224
Query: 306 CG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
G F+ D ++YL P ++ + ++F S V L+AS PG +
Sbjct: 225 RGENFKRDFMNYLKKYNSPVVTSLADR----------LQSYDFHSIDVELLASAPGKYEI 274
Query: 364 SSLKK----WGHMKLRTVLQECTFEKGFKKSPLVYQF---------------SSLGSLDE 404
++L +G+ KL +L+ K +Y F S S+
Sbjct: 275 TNLTDKDEVYGYGKLYQILKRNNLLVDNTKGDKLYNFLSQVTSISYPFNVRGSQTASVFS 334
Query: 405 KWMAELS-SSMSSGF-----SEDKTPLGIGE----PLIVWPTVEDVRCSLEGYAAGNAIP 454
+A L S S+GF D T + P +V+PTV+++ + G+ AG A+
Sbjct: 335 HLLAPLVFSGGSNGFKILLPGSDSTSKHQKDNYYLPHMVYPTVKEIANNNVGFGAGQAVH 394
Query: 455 SPQKNVD------KDFLKKYWAKWKASH----TGRSRAMPHIKTFARYNGQK---LAWFL 501
D + ++ Y KW +S TGR +PH K F NG L W L
Sbjct: 395 MKHTKSDTHRYQYQQNIRPYLRKWNSSGSDIVTGRESVVPHCKYFMCDNGDNFSSLKWAL 454
Query: 502 LTSANLSKAAWGA---LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSG 558
+ S NLSK AWG+ N ++ I S+ELGV++ P + G +VP+
Sbjct: 455 VGSHNLSKQAWGSPVPKSTNPNKYEISSFELGVVVFP---KEG------EKLVPA----- 500
Query: 559 STETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS-WDKRYTKK 617
+G D + + L +P+ LPP +Y+++D PWS W K
Sbjct: 501 -----------------YGE-DTVNDDKAIPLRMPFSLPPTKYTAQDEPWSEWVSYGELK 542
Query: 618 DVYGQVW 624
D +GQ +
Sbjct: 543 DKFGQTY 549
>gi|307108295|gb|EFN56535.1| hypothetical protein CHLNCDRAFT_144174 [Chlorella variabilis]
Length = 682
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 103/211 (48%), Gaps = 15/211 (7%)
Query: 175 WANTSCV--SIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKI----PHVLVIHGESDG 228
WAN + S+ D+++G++ + + + WLL ACP L + E+ G
Sbjct: 476 WANEGFLGLSLGDLVRGEMRWCLYCSMALHARWLLSACPDLRPLVTWRTKTRKALREASG 535
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG 288
+R ++LH PP+P +G HHSK ML+ Y GVR I+ T NL ++++Q
Sbjct: 536 AAAEGRR-----FVLHTPPVPDRWGRHHSKMMLIEYATGVRFILPTPNLQFHQLHSQTQA 590
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN-PSFFKKFNFS 347
++ QDFP K FE L YL+ L+ P A H + P ++ +FS
Sbjct: 591 VFFQDFPPKQDGTSPPGSDFETSLARYLAALQLPGEEAK---HAQAGWHWPELVRRHDFS 647
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVL 378
+A L+ASVPG H G +GH +L +L
Sbjct: 648 AARAVLVASVPGSHGGELAAAYGHKRLAALL 678
>gi|154311214|ref|XP_001554937.1| hypothetical protein BC1G_06725 [Botryotinia fuckeliana B05.10]
Length = 405
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 142/349 (40%), Gaps = 72/349 (20%)
Query: 343 KFNFSSAAVRLIASVPGYHTGS---SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
K++FS LIASVPG S WG L L+ + +V Q SS+
Sbjct: 60 KYDFSEIKAALIASVPGKQDTELSPSQTGWGWAGLTNALKSVPSHHNTQPE-IVIQVSSI 118
Query: 400 GSL--DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS- 455
SL +KW+ ++S E K+P G I++PT ++VR S+ GYA+GNAI +
Sbjct: 119 ASLGPTDKWLTHFFKALS----ESKSPRKTGSKFKIIFPTADEVRRSINGYASGNAIHTK 174
Query: 456 ---PQKNVDKDFLKKYWAKW------------------------------KASHTGRSRA 482
P + +LK W K R RA
Sbjct: 175 ILTPAQGKQLAYLKPMLCHWAGDGAQHSSSSSLSSNTPSKSSQSFTSPELKTQEAYRRRA 234
Query: 483 MPHIKTFARYNGQK---------LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
PHIKT+ R++ + W L+TSANLSK AWG + ++ I SYE+GVL+
Sbjct: 235 APHIKTYIRFSSDSTSSSSSQKSIDWMLVTSANLSKQAWGESINSADKVRICSYEIGVLV 294
Query: 534 LP---SAKRHGCGFS---CTSNIVPS--------EIKSGSTETSQIQKTKLVTLTWHGSS 579
P K++G C N PS EI + ++ L
Sbjct: 295 WPDLWEEKQNGKNVKMVPCFGNDTPSIPFVSPSLEIVGQKEIRVEGEEGHLKRKRCDDRE 354
Query: 580 DAGASSE----VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
D E +V +PY+LP Y +D+PW Y++ D G+ W
Sbjct: 355 DEKRQEESHTIIVGARMPYDLPLVSYGKDDIPWCASASYSEPDWMGKTW 403
>gi|449019998|dbj|BAM83400.1| probable tyrosyl-DNA phosphodiesterase [Cyanidioschyzon merolae
strain 10D]
Length = 615
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 155/349 (44%), Gaps = 73/349 (20%)
Query: 254 THHSKAMLL-IYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDL 312
HHSK M+L + VR+++HT+N I DW K QG++ D PL+ + S GF DL
Sbjct: 208 VHHSKFMILRLRDDRVRLVIHTSNDIAYDWFFKCQGIFAVDLPLRGAGSASPNTGFCADL 267
Query: 313 IDYLS---------------------TLKWPEFSANL-PAHGNFKINPSFFKKFNFSSA- 349
YL T+ P +A+L A +F+ ++S+
Sbjct: 268 QQYLGAYIRAGERALHGGVTSARRFGTMVAPGDAASLVDAVSHFR---RLMTCCDYSAVD 324
Query: 350 AVRLIASVPGYHTGS--------------SLKKWGHMKLRTV----LQECTFEKGFKKS- 390
VRL++SVPG+H S ++ +GH++L + L+ CT S
Sbjct: 325 GVRLVSSVPGWHRISGQSRTSQTSRTASHAVCAFGHLRLANLVASSLRHCTEAARHPNSL 384
Query: 391 PLVYQFSSLGSLDEK------------WM-AELSSSMSSGFSED----------KTPLGI 427
V Q SSL S+D + W+ +EL S+ G K G
Sbjct: 385 AFVLQGSSLSSVDARCPRAASETLARYWLTSELFRSLCGGDGGGGGVGEESVFAKLAEGS 444
Query: 428 GEPLIVWPTVEDVRCSLEGYAAG-NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHI 486
+ +VWPT V S+ G +G I Q +D + +++ +W A R+ MPH+
Sbjct: 445 AQVYLVWPTRTQVLTSIVGIDSGMGLIARAQAFLDPE-IRQLLTRWNADWCARTSVMPHM 503
Query: 487 KTFARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
KT + ++ + + + L SAN++ AAWG QK S L ++ELGVL
Sbjct: 504 KTISCWDTRTDQCLYCYLGSANVTPAAWGITQKQGSLLRCMNWELGVLF 552
>gi|307211793|gb|EFN87774.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 445
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 131/272 (48%), Gaps = 25/272 (9%)
Query: 183 IRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
I D+ G+I+ ++ Y++D++WL + + ++ +++GE E + N A +
Sbjct: 165 ILDISFGEIVNSLHLTYILDVEWLCLQYLLAGQSTNMTILYGERTDE-EELDDNITAVQV 223
Query: 243 LHKPPLPISFGTHHSKAMLLIYPR-GVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P FG+HH+K M+L Y G+R++V TANL DW N+ QG+W+ L +
Sbjct: 224 ----QMPFEFGSHHTKIMILQYKDDGIRVVVSTANLYFEDWQNRMQGMWISPH-LPRLSK 278
Query: 302 LSEECG-----FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
++ CG F+ DL YL++ + P K +K +FS+ V LIAS
Sbjct: 279 AAKRCGESPTNFKKDLQRYLNSYQNPA----------LKRWRDLVRKADFSAVNVCLIAS 328
Query: 357 VPGYHTGSSLKKWGHMKLRTVL-QECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
PGY + + WG+ KL VL Q +K ++ Q S++GS K+ LS +
Sbjct: 329 TPGYFRRTDVDLWGYKKLANVLSQHVMLPSNARKWSIIAQSSAVGSFGPKYEGWLSKEII 388
Query: 416 SGFSEDKTPLGIGEP--LIVWPTVEDVRCSLE 445
+ + P ++P+V++ S +
Sbjct: 389 RSMTRETKRDLKNYPKFQFIYPSVKNYEQSFD 420
>gi|451998304|gb|EMD90769.1| hypothetical protein COCHEDRAFT_1179942 [Cochliobolus
heterostrophus C5]
Length = 567
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 102/414 (24%), Positives = 179/414 (43%), Gaps = 42/414 (10%)
Query: 175 WANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLE- 231
+ T ++I +V++ D + A++S++M D +WL PV K V +++ + +
Sbjct: 148 YPRTDDITIDEVLEADTVRTAVISSFMWDSEWLFKKLNPV--KTKQVWIMNAKGKDVQQR 205
Query: 232 ---HMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
M+ N +H PP+ + HSK MLL P +RI++ TAN+I DW +
Sbjct: 206 WQKEMEDMGVPNLKIHFPPMDGMIQSMHSKFMLLFGPNKLRIVIPTANMIQTDWGEVAND 265
Query: 287 -------QGLWMQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
+++ D P + S + F +L+ +L K PE
Sbjct: 266 WQPGVMENSIFLIDLPRRGNETTSTQENMTRFGQELMYFLEMQKVPEMVLQ--------- 316
Query: 337 NPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQ 395
F+FS + + + S+ G H S G L +Q+ + ++ L Y
Sbjct: 317 ---GILNFDFSQTSHLAFVHSIGGSHKTESEHPTGLPGLARAIQDLRLDN-VEQIELDYA 372
Query: 396 FSSLGSLDEKWMAELS-SSMSSGFSEDKTPLGIGEP--LIVWPTVEDVRCSLEGYAAGNA 452
SSLG++++ +++ L ++ F+ D + I +PT E V S+ G G
Sbjct: 373 ASSLGAINDSFLSRLYLAACGKCFAADTATVSDVRRHIRIYFPTNETVEKSIGGPDCGGI 432
Query: 453 IPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAA 511
I Q+ + D F ++ +++S G + R +G+ + W + SANLS++A
Sbjct: 433 ISLSQQRYNADTFPRECLRDYESSRAGMLSHNKLLLARGRKDGRPVGWVYVGSANLSESA 492
Query: 512 WGALQ--KNN--SQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
WG + KN L IR++E GV++ R G VP I G+ E
Sbjct: 493 WGGQKVIKNGKMGSLNIRNWECGVVMTVPEDRLGGRDKDRDKTVPMSIFEGTVE 546
>gi|145533358|ref|XP_001452429.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420117|emb|CAK85032.1| unnamed protein product [Paramecium tetraurelia]
Length = 508
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 169/342 (49%), Gaps = 53/342 (15%)
Query: 226 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 281
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 282 WNNKSQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INP 338
W SQ +W+QDF + + + +S+E F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKE--FKVGLKEFLDNI--------LPSSHKFEDLLK 256
Query: 339 SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQ 395
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ
Sbjct: 257 IKYNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQ 316
Query: 396 FSSLGSLDEKWMAELS--------SSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
+S+G LD ++ + + M E+K+ L +++PT + ++
Sbjct: 317 TTSIGQLDVNYVDFVQQQQNNKSIAQMLFNQQEEKSILK-----LIYPTSDYIQNQT--- 368
Query: 448 AAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTFARYN-GQK 496
+AG +P Q+ + F K + +++ S H G +PH+K +K
Sbjct: 369 SAGPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVMIITGIDEK 425
Query: 497 L---AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
+ + S NLS+AAWG L+KN +QL I + ELGVL P
Sbjct: 426 IDDKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 467
>gi|170097685|ref|XP_001880062.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645465|gb|EDR09713.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 609
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 118/435 (27%), Positives = 180/435 (41%), Gaps = 71/435 (16%)
Query: 142 NGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMV 201
NG+ + A + +D + +TFRL V G + DI AILS+Y +
Sbjct: 165 NGEFRQTATRHADPRKDNM-ATFRLTEVLGQ---------------KKDIAFAILSSYSL 208
Query: 202 DIDWLL----PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHS 257
D W+ PA PV ++ + D T + +N +WI P L G H
Sbjct: 209 DWMWIYQFFDPATPV--------IMVAQPDQTGRAIIKNVLPHWIKTTPYLRGGHGCQHM 260
Query: 258 KAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYL 316
K MLL Y G +R++V TANLI DW + +W+QD PL+ + + + N D+
Sbjct: 261 KFMLLFYRNGRLRVVVSTANLIEYDWRDMENSVWLQDVPLR-SSPIPHDPKATN---DFP 316
Query: 317 STLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMK 373
S ++ S N+ H N + ++++S V L+ S+ G H G ++ K GH +
Sbjct: 317 SIIQRVLNSLNVKPHPNLALKSIEDLRCRWDWSKVKVHLVPSIAGKHEGWPAVIKTGHPR 376
Query: 374 LRTVLQECTFEKGFKKSP---LVYQFSSLGSLDEKWMAELSSSMSSGFSED--KTPLGIG 428
L ++E G K+ L Q SSLG +WM E S +ED P
Sbjct: 377 LMMAVREMAMRTGKGKAKELILECQGSSLGIYTTQWMNEFHWSARGESAEDWLDEPKKRR 436
Query: 429 EPL------IVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKKYWAKWK-------- 473
E L I +P+ V+ S G G I +K K+F + ++ K
Sbjct: 437 EKLPYPPIKIFFPSKRTVQESALGEKGGGTIFCRRKQWSTKNFPRDHFYDSKSKGGPVLM 496
Query: 474 ------ASHTGRSRAMPHIKTFARYNGQK-------LAWFLLTSANLSKAAWGALQKN-- 518
A+H +R + L W L S N + +AWG L +
Sbjct: 497 HSKMIIATHQETTRKTLQAAESSSEEDDDIEVVDPPLGWSYLGSHNFTPSAWGNLSGSSF 556
Query: 519 NSQLMIRSYELGVLI 533
N L I +YELG++
Sbjct: 557 NPVLNIANYELGIVF 571
>gi|260788030|ref|XP_002589054.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
gi|229274227|gb|EEN45065.1| hypothetical protein BRAFLDRAFT_87527 [Branchiostoma floridae]
Length = 130
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/90 (56%), Positives = 65/90 (72%), Gaps = 3/90 (3%)
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARY--NGQKLAWFLLTSA 505
AG ++P K +L K+ +W +S GR+RA PHIKT+ R + +LAWFL+TSA
Sbjct: 8 AGGSLPYSINTARKQPYLNKFLHQWSSSARGRTRASPHIKTYTRTSPDCSRLAWFLVTSA 67
Query: 506 NLSKAAWGALQKNNSQLMIRSYELGVLILP 535
NLSKAAWGAL+KN +QLMIRSYE+GVL LP
Sbjct: 68 NLSKAAWGALEKNGAQLMIRSYEIGVLFLP 97
>gi|409075791|gb|EKM76167.1| hypothetical protein AGABI1DRAFT_45345 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 625
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 115/436 (26%), Positives = 179/436 (41%), Gaps = 73/436 (16%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
+ F R TFRL +V G N S ++ AILS+Y +D W+
Sbjct: 171 QTATRFAEPRKDGQRTFRLTQVLG-----NKS----------ELAFAILSSYSLDFPWIY 215
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+P ++V ++ G +K P W+ PPL FG H K MLL Y G
Sbjct: 216 EFFD--RSVPVIMVAQPDAMGQAA-IKYTFP-TWVKTTPPLRGGFGCQHMKFMLLFYKNG 271
Query: 268 -VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLK-WPE 323
+R+++ TANLI DW + +W+QD P++ Q + F + + L + P
Sbjct: 272 NLRVVISTANLIAYDWRDMENSVWLQDLPMRPQLMPPDPKAKDFPSIMQQVLHAVNVAPA 331
Query: 324 FSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE 380
LP H N + ++++S V L+AS+ G H G S+ K GH +L ++
Sbjct: 332 LRTMLPDHPNIPLRTIEDLRMRWDWSKVKVHLVASIAGKHEGWPSIVKTGHPRLMMAIRT 391
Query: 381 CTFE--KGFKKSPLVY--QFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIG 428
+G K ++ Q SSLG+ +W+ E S +ED + L
Sbjct: 392 MGLRPSRGLGKGNMIIECQGSSLGNFTTQWLNEFHWSARGESAEDWLDEPKRRREKLPYP 451
Query: 429 EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKK--YWAKWKA----------- 474
I++PT + V+ S G G I +K K+F + Y +K KA
Sbjct: 452 SVRILFPTKKIVQESASGEPGGGTIFCRRKQWAAKNFPRDKFYVSKSKAGPVLMHSKMII 511
Query: 475 ---SHTGRSRAM------------PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN- 518
HT + A P +K G W + S N + +AWG L +
Sbjct: 512 ATIQHTNPASASLNREGSDTEEDEPEVKIIEPAVG----WAYVGSHNFTPSAWGTLSGSA 567
Query: 519 -NSQLMIRSYELGVLI 533
N L I +YE+G++
Sbjct: 568 FNPILNITNYEIGIVF 583
>gi|13543875|gb|AAH06083.1| TDP1 protein [Homo sapiens]
Length = 298
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 79/133 (59%), Gaps = 5/133 (3%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQ---GDIIVAILSNYMVDIDWLLPACPVLAKIP 217
P F L RV G+ N+ + I+D++ G ++ + NY D+DWL+ P +
Sbjct: 163 PFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYCFDVDWLVKQYPPEFRKK 222
Query: 218 HVLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+L++HG+ H+ + KP N L + L I+FGTHH+K MLL+Y G+R+++HT+
Sbjct: 223 PILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTKMMLLLYEEGLRVVIHTS 282
Query: 276 NLIHVDWNNKSQG 288
NLIH DW+ K+QG
Sbjct: 283 NLIHADWHQKTQG 295
>gi|324542673|gb|ADY49650.1| Tyrosyl-DNA phosphodiesterase 1, partial [Ascaris suum]
Length = 133
Score = 99.0 bits (245), Expect = 7e-18, Method: Composition-based stats.
Identities = 60/180 (33%), Positives = 85/180 (47%), Gaps = 53/180 (29%)
Query: 449 AGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKL--AWFLLTSA 505
AG A+P + + +L + KW+ GR+RAMPHIK+++ ++ + +W L+TSA
Sbjct: 2 AGGALPYQRNTAARQPYLLERMHKWRCERFGRTRAMPHIKSYSAFSDGRCLPSWLLITSA 61
Query: 506 NLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQI 565
NLSKAAWG LQK SQL IRSYELGVL+ T+ +
Sbjct: 62 NLSKAAWGELQKKESQLAIRSYELGVLL--------------------------TDEDSL 95
Query: 566 QKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWP 625
Q +PY++P ++ D PW D YTK D++G WP
Sbjct: 96 QL------------------------LPYDMPLTKFEPGDQPWVCDDTYTKPDIHGATWP 131
>gi|145497459|ref|XP_001434718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401846|emb|CAK67321.1| unnamed protein product [Paramecium tetraurelia]
Length = 522
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 160/337 (47%), Gaps = 47/337 (13%)
Query: 230 LEHMKR-NKPANWILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
LE ++R N NW + KP + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 286 SQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFK 342
SQG+W+QDF + + + S+E F++ L ++L + LP F+ + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQE--FKSMLREFLYEI--------LPTSHKFEDLLKIKYD 262
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSL 399
++F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+
Sbjct: 263 DYDFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSI 322
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGI--------GEPLIVWPTVEDVRCSLE-GYAAG 450
G +D ++ + +G S K I + +++PT + + G
Sbjct: 323 GQMDNNYV-DFVLQCCTGRSTKKINQMILNQQEEEQSKLKLIYPTADYIENQTHGGVDFA 381
Query: 451 NAIPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF------ARYNGQKLA 498
N + Q++ + F K + K++ S HTG +PH+K N Q
Sbjct: 382 NPLHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLDEDINDQTSI 438
Query: 499 WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
+ + S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 439 Y--IGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 473
>gi|324522792|gb|ADY48131.1| Tyrosyl-DNA phosphodiesterase, partial [Ascaris suum]
Length = 306
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 142/297 (47%), Gaps = 22/297 (7%)
Query: 134 QQDEQDNENGKNSEEALCNFHVSRDKLPST-FRLLRVQGLPAWANTSCVSIRDVIQ-GDI 191
+ D D + + ++ F L S ++ G P +T+ S+ ++++
Sbjct: 7 ENDGDDASSARTPSASMVKFRKQDSPLLSNRLYFTKIVGHPCRYSTNAFSLSELLELISP 66
Query: 192 IVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHM------KRNKPANWILHK 245
I +I N+M+D+ WLL P + +I GE++GT H+ +R K N + +
Sbjct: 67 IASIHFNFMIDLHWLLSQYPERCSAYPISIIVGENNGT-NHLDVRAEARRCKADNVSVGR 125
Query: 246 PPLPISFGTHHSK-AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSE 304
L + +GTHHSK ++ + +++ TANL+ DW++K+Q + P+ +
Sbjct: 126 ARLVLPYGTHHSKLSIFETDSEMIHVVISTANLLQNDWDSKTQAFYHCSAPIVNGEVEEG 185
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+ F DLI YL+ ++ G + +FS R+I+S+PGYH G
Sbjct: 186 QNNFRKDLISYLNAY------SSSSDFGMIEYWRDRIANADFSDVNARIISSIPGYHVGD 239
Query: 365 SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK---WM-AELSSSMSSG 417
++GH++LR VL+ + KK V QFSS+GSL K W+ A+ S++ G
Sbjct: 240 QKDRYGHLRLRRVLRSLQLD--LKKPSFVAQFSSIGSLGPKPDSWLTAQFLQSLAGG 294
>gi|358056499|dbj|GAA97673.1| hypothetical protein E5Q_04351 [Mixia osmundae IAM 14324]
Length = 686
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 119/450 (26%), Positives = 194/450 (43%), Gaps = 54/450 (12%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
L R+Q R G +++ + N ++ E A FH S L FR V
Sbjct: 228 LQRAQARAQALGLVEPAIATANIPSASTSTNVAHRHLENA---FHPS---LGIYFRKSAV 281
Query: 170 Q-GLPAWANTS--CVSIRDVI--QGDIIVAILSNYMVDIDWLLPAC--PVLAKIPHVLVI 222
+ A+ T+ +S++D+I + I ++S+Y D+DWL+ P L K +L +
Sbjct: 282 RPTFNAFHRTTEDALSLQDIIGPKDRIEKLVMSSYATDLDWLVAHVLPPELGKQ-VLLAL 340
Query: 223 HGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
G +D + N P + LH PP+ + G H K +L++Y R+ + TANL+ DW
Sbjct: 341 PGPADAPITSFVPNHP-HIKLHCPPVCRTSGAMHIKLILVVYDDFCRVAIPTANLVPYDW 399
Query: 283 NNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN--LPAHGNFKINPSF 340
+W+QDFP Q +L++ F L L L E S N LP +F
Sbjct: 400 QQIENAVWIQDFP--RQGSLAKPTRFAQTLHTTLRLLCIEEDSRNAVLPLDVDFS----- 452
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQFSSL 399
+ + R+I S PG SS + GH L LQ+ + L Q SS+
Sbjct: 453 ------AGISARMILSTPG---SSSSEPNGHKLLGQALQDLHLLPARDQDVRLECQGSSI 503
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTP---LGIGEPL-----IVWPTVEDVRCSLEGYAAGN 451
G+L+++W+ E SS+ P EPL IV+PT+ ++ + G A G
Sbjct: 504 GALNDEWLLEFYSSICGRPVRTMFPKVQTANFEPLRTLFRIVFPTLRNIENTHLGTAGGG 563
Query: 452 AIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKLA-------WFLLT 503
+ + + K + S + R+ + H K A++ + A W +
Sbjct: 564 TLFCNRSTWENRHFPKEC--MRQSTSKRAGVVMHTKMILAQFRMSRHAQSDRPPGWLYVG 621
Query: 504 SANLSKAAWGALQKNNSQLMIRSYELGVLI 533
S N + AAWG + S + + ELG+++
Sbjct: 622 SHNFTAAAWG--KSTASSFKVSNCELGIVM 649
>gi|74830335|emb|CAI39050.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 521
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 168/350 (48%), Gaps = 56/350 (16%)
Query: 226 SDGTLEHMKR-NKPANWILHKPPL--PISFG-THHSKAMLLIYPRGVRIIVHTANLIHVD 281
+D LE ++ N NW + KP I+FG + H K +L +P+ +RI++ + NL D
Sbjct: 147 NDKKLEIIEEFNGHPNWTVIKPSKLSSITFGGSFHPKIWILKFPKFIRIVIGSQNLHVGD 206
Query: 282 WNNKSQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INP 338
W SQ +W+QDF + + + +S+E F+ L ++L + LP+ F+ +
Sbjct: 207 WTVWSQAMWIQDFQIGNSELDEVSKE--FKVGLKEFLDNI--------LPSSHKFEDLLK 256
Query: 339 SFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF---KKSPLVYQ 395
+ ++F + +RLI S+PG TG+ + K+G M++++V+ F K+ + YQ
Sbjct: 257 IKYNDYDFQNINIRLITSIPGRFTGNQMNKYGMMRIQSVINSELKSSDFEIPKQVSIAYQ 316
Query: 396 FSSLGSLDEKWMAELSSSMSSGFSEDKTPL-----GIGEPL-----------IVWPTVED 439
+S+G LD ++ + S + + I + L +++PT +
Sbjct: 317 TTSIGQLDVNYVDFVQQCCSGQQIKQSQKIEQNNKSIAQMLFNQQEEKSILKLIYPTSDY 376
Query: 440 VRCSLEGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIKTF 489
++ +AG +P Q+ + F K + +++ S H G +PH+K
Sbjct: 377 IQNQT---SAGPEYANPLFLRKQQYDNPKFPKNIFHRYQGSNYYYWHAGN---IPHLKVM 430
Query: 490 ARYN-GQKL---AWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
+K+ + S NLS+AAWG L+KN +QL I + ELGVL P
Sbjct: 431 IITGIDEKIDDKTSIYIGSHNLSQAAWGRLEKNATQLFISNTELGVLYPP 480
>gi|330842084|ref|XP_003293015.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
gi|325076694|gb|EGC30460.1| hypothetical protein DICPUDRAFT_99531 [Dictyostelium purpureum]
Length = 564
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/432 (25%), Positives = 181/432 (41%), Gaps = 64/432 (14%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPA--WANTSCVSIRDVI-QGDIIVAILSNYMV 201
N C L +TF L ++ P + + + ++I ++ + D+ A++ + +
Sbjct: 113 NEATTFCTIIGENYYLSNTFYLNTIKNQPKNLFNSPTTLTIEHLLLEKDMKSAMVCGFCL 172
Query: 202 DIDWLLPACPVLAKIPHVLV-------IHGESDGTLEHMKRNKPANWILHKPPLPISFGT 254
+ +W+ A+ HV + I E G + K N PPL S+ T
Sbjct: 173 ESEWIYKIF-YEAQGRHVPITFIRHYFISEEKKGIQQINKSTMAIN-----PPLG-SYQT 225
Query: 255 HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLID 314
H K +LL++P +RII+ ++N +D+++ +Q +W QDF +K + + + D
Sbjct: 226 FHGKLILLVFPEFIRIIIPSSNPTQLDYDSLNQTIWFQDFQIKK----APKQATPSKDND 281
Query: 315 YLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYH----TGSSLKK-- 368
+L TLK+ S P+ F +++FS A+ LI SVPG++ GS + +
Sbjct: 282 FLKTLKYFLASIGCPS-------VKFLDEYDFSEASAHLIISVPGFYKHDGAGSGIIESD 334
Query: 369 ---WGHMKLRTVLQ-------ECTFEKGFKKS------PLVYQFSSLGSLDEKWMAELSS 412
G KL +VL+ E T K+ YQ SS+G +
Sbjct: 335 KPLMGIYKLESVLKKYYRNQDETTDYTVLDKNNQHCVRDFYYQASSIGGEKGNFRNNFVK 394
Query: 413 SMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK----KY 468
+S PL I P W D R +A + + N DK KY
Sbjct: 395 HLSPSIENSDKPLHIIYPTDQWIKSNDHRLQ---HAGCLFLSNKNYNNDKSCFSYLSPKY 451
Query: 469 -WAKWKASHT----GRSRAM--PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQ 521
+ K H+ G S + P T + + K W S N S AAWGA QKN +Q
Sbjct: 452 DYRKHLVYHSKVLVGTSTRLNKPLKDTLNQRSNIKYDWVYAGSHNFSSAAWGAFQKNETQ 511
Query: 522 LMIRSYELGVLI 533
+ I +YE+GVL
Sbjct: 512 IQISNYEIGVLF 523
>gi|426193767|gb|EKV43700.1| hypothetical protein AGABI2DRAFT_121836 [Agaricus bisporus var.
bisporus H97]
Length = 635
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 116/436 (26%), Positives = 179/436 (41%), Gaps = 73/436 (16%)
Query: 148 EALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL 207
+ F R TFRL +V G N S ++ AILS+Y +D W+
Sbjct: 181 QTATRFAEPRKDGQRTFRLTQVLG-----NKS----------ELAFAILSSYSLDFPWIY 225
Query: 208 PACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG 267
+P ++V ++ G +K P W+ PPL FG H K MLL Y G
Sbjct: 226 EF--FDRSVPVIMVAQPDAMGQAA-IKYTFP-TWVKTTPPLRGGFGCQHMKFMLLFYKNG 281
Query: 268 -VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTLK-WPE 323
+R+++ TANLI DW + +W+QD P++ Q + F + + L + P
Sbjct: 282 NLRVVISTANLIAYDWRDMENSVWLQDLPMRPQLMPPDPKAKDFPSIMQQVLHAVNVAPA 341
Query: 324 FSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE 380
L H N + ++++S V L+AS+ G H G S+ K GH +L ++
Sbjct: 342 LRTMLSDHPNIPLRTIEDLRMRWDWSKVKVHLVASIAGKHEGWPSIVKTGHPRLMMAIRT 401
Query: 381 CTFE--KGFKKSPLVY--QFSSLGSLDEKWMAELSSSMSSGFSED--KTPLGIGEPL--- 431
+G K ++ Q SSLG+ +W+ E S +ED P E L
Sbjct: 402 MGLRPSRGLGKGNMIIECQGSSLGNFTTQWLNEFHWSARGESAEDWLDEPKRRREKLPYP 461
Query: 432 ---IVWPTVEDVRCSLEGYAAGNAIPSPQKN-VDKDFLKK--YWAKWKA----------- 474
I++PT + V+ S G G I +K K+F + Y +K KA
Sbjct: 462 PVRILFPTKKIVQESASGEPGGGTIFCRRKQWAAKNFPRDKFYVSKSKAGPVLMHSKMII 521
Query: 475 ---SHTGRSRAM------------PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKN- 518
HT + A P +K G W + S N + +AWG L +
Sbjct: 522 ATIQHTNPASASLNREGSDTEEDEPEVKIIEPAVG----WAYVGSHNFTPSAWGTLSGSA 577
Query: 519 -NSQLMIRSYELGVLI 533
N L I +YE+G++
Sbjct: 578 FNPILNITNYEIGIVF 593
>gi|74834157|emb|CAI44465.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
Length = 532
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 161/346 (46%), Gaps = 55/346 (15%)
Query: 230 LEHMKR-NKPANWILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
LE ++R N NW + KP + G H K +L +P+ +RI++ + NL DW
Sbjct: 153 LEIIERYNNYPNWTVIKPSKLSTNMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIW 212
Query: 286 SQGLWMQDFPL--KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFK 342
SQG+W+QDF + + + S+E F++ L ++L + LP F+ + +
Sbjct: 213 SQGMWIQDFKIGKSELDQTSQE--FKSMLREFLYEI--------LPTSHKFEDLLKIKYD 262
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE--CTFEKGFKKSPLV-YQFSSL 399
++F +RLI S+PG G+ L K+G M+L++V+ + C + K V YQ +S+
Sbjct: 263 DYDFKDVNIRLITSIPGRFVGNQLFKYGMMRLQSVIYQELCNNKMEIPKQVCVTYQTTSI 322
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTP-----LGIGEPL------------IVWPTVEDVRC 442
G +D ++ + + + + P I + + +++PT + +
Sbjct: 323 GQMDNNYVDFVLQCCTGRVYKQQLPNEQSTKKINQMILNQQEEEQSKLKLIYPTADYIEN 382
Query: 443 SLE-GYAAGNAIPSPQKNVDK-DFLKKYWAKWKAS-----HTGRSRAMPHIKTF------ 489
G N + Q++ + F K + K++ S HTG +PH+K
Sbjct: 383 QTHGGVDFANPLHLKQQSYESPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGLD 439
Query: 490 ARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
N Q + + S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 440 EDINDQTSIY--IGSHNFSQGAWGKMEKNATQLFISNTELGVLYPP 483
>gi|353240852|emb|CCA72701.1| hypothetical protein PIIN_06638 [Piriformospora indica DSM 11827]
Length = 636
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/413 (26%), Positives = 171/413 (41%), Gaps = 84/413 (20%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPI 250
+ AILS+Y DI WL + + + V++++ ++ +K P NWI+ P L
Sbjct: 200 VAFAILSSYSTDIAWLYG---MFSPMTPVILVNQPTETGNSDVKGILP-NWIMTMPFLRG 255
Query: 251 SFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC--- 306
G H K MLL Y G +R+++ TAN I DW + W+QDFP + + E
Sbjct: 256 GRGAMHVKLMLLFYRSGRLRLVLPTANFIDYDWRDIENTAWVQDFPPLSKPAVGREATSS 315
Query: 307 GFENDLIDYLSTLKW-PEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTG 363
F + L L+ L P ++ L H N I K +NF+ AAV+LI S+ G + G
Sbjct: 316 AFASTLQMVLTKLNVSPALASLLTDHPNLPIKFIGDLGKGWNFTKAAVKLIPSMSGKYEG 375
Query: 364 -SSLKKWGHMKLRTVLQECTFEKGF----KKSP-----LVYQFSSLGSLDEKWMAELSSS 413
+ K GH+ L + + +G KK P + Q SS+G+ +W+ E SS
Sbjct: 376 WDQVLKQGHVSLMKGIMDIGAHRGHTKRDKKKPPEELIVECQGSSIGTYSAQWLQEFYSS 435
Query: 414 M----------SSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQK---- 458
S S K P PL I++P+++ V+ S+ G G +
Sbjct: 436 CCGISPETWLDKSKASRSKLP---KPPLRILFPSLKTVQSSVLGEDGGGTMFCRTSQWEG 492
Query: 459 -NVDKDFLKKYWAKWKASHTGRSRAMPHIK-----------------TFARYNGQK---- 496
N +D S++ R + + H K T +Y QK
Sbjct: 493 ANFPRDLFYD-------SNSKRGKVLMHTKMILGLWRDSSSDERSSTTLRKYAKQKEVLE 545
Query: 497 --------------LAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 533
W + S N + +AWG L + L I +YELG+LI
Sbjct: 546 IDSDDEVEIIDPFAAGWLYVGSHNFTPSAWGTLSGSAFTPVLNITNYELGILI 598
>gi|340503654|gb|EGR30196.1| tyrosyl-DNA phosphodiesterase family protein, putative
[Ichthyophthirius multifiliis]
Length = 547
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 152/323 (47%), Gaps = 39/323 (12%)
Query: 240 NWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
NW L PP S G H K L+ + +R++V + NL DW+ S LW QDFPL
Sbjct: 201 NWTLIHPPKDASVSWGGAFHPKLWLIKFNEFLRVVVGSGNLHICDWSVWSNCLWYQDFPL 260
Query: 297 KDQNNLSEECG---------FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
K Q N +E F N LID ++ + N+ KI+ +++++S
Sbjct: 261 KKQQNAQKEKNQQQWDFEGDFSNTLIDIVNRM----MPDNVKYQNLLKID---LEEYDYS 313
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
+ L+++VPG H +++K G KL ++ F + K+ + Y+ S+LG++D K++
Sbjct: 314 EVKIILLSNVPGRHL--NIQKHGLGKLNAIIN--AFGQQNKQKIITYESSTLGNIDNKFL 369
Query: 408 AELSSSM---SSGF---SEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP---SPQK 458
E S+ S F S++ + +++PT + + C Y A P + +
Sbjct: 370 NEFYKSVNLASCDFQKNSKENIKDIQNQFKVIFPTKKYI-CQDTLYGIEYASPVILNEKY 428
Query: 459 NVDKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQKL----AWFLLTSANLSKAAW 512
++ F+K + +++ + S +PH+K + + + + S N + AAW
Sbjct: 429 YSNEKFIKDVFYQFECPKGYFYHSGVIPHLKVMVVNDKEDQISDDSLIYVGSHNFTGAAW 488
Query: 513 GALQKNNSQLMIRSYELGVLILP 535
G +KN SQ+ + ELGV+ P
Sbjct: 489 GRYEKNYSQIYCMNTELGVVYPP 511
>gi|298705565|emb|CBJ28816.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 947
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 78/142 (54%), Gaps = 8/142 (5%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVL 220
P FR +R+ PA +N VS+ +++ G+ A++++Y+VD ++LL A P L +P +L
Sbjct: 178 PPLFRPVRIPSDPA-SNADGVSLGELLGGEYTEALVASYLVDAEFLLNAAPRLKTVPFLL 236
Query: 221 VIHGESDGTL-----EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTA 275
+ + D L +KR PA + P I G HHSK +LL Y GVR+++ T
Sbjct: 237 IQGIKEDKPLVVSMKAFLKREHPAAVVYL--PKTIHIGLHHSKMILLKYKTGVRVVIMTC 294
Query: 276 NLIHVDWNNKSQGLWMQDFPLK 297
N+ DW + Q W QDFP K
Sbjct: 295 NMRPDDWGGRCQAAWYQDFPFK 316
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 65/164 (39%), Gaps = 59/164 (35%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIP----------------SPQKNVDKDFLKKYWAKWK-A 474
+VWPT E VR S G+ +G +P + Q N + LK W A
Sbjct: 658 VVWPTEEAVRTSNLGWESGAGMPCLTTTLYEGGYRKCETNYQLNRVMEELKPLLCTWTGA 717
Query: 475 SHTGRSRAMPHIKTFARY------------NGQKLAWFLLTSANLSKAAWGALQKNN--- 519
R AMPH+ T+ RY + LA+FLL S +L + AWG L+ N
Sbjct: 718 KGMDRGNAMPHLNTYYRYRELPRTDGSLKMSKDGLAYFLLASHSLHRIAWGYLEHRNPPQ 777
Query: 520 ---------------------------SQLMIRSYELGVLILPS 536
+QL I+S+++GV+ LPS
Sbjct: 778 RPRKRRVRMKPIYPPKPENTLPYKEEEAQLDIKSFDMGVMFLPS 821
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 57/114 (50%), Gaps = 22/114 (19%)
Query: 308 FENDLIDYLSTLKWPE--FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSS 365
FE LIDY + P + +L A ++FSSA V LI SVPG H G
Sbjct: 423 FEEILIDYFEHVGGPAAVWGRSLSA-------------YDFSSANVTLIPSVPGRHKGRD 469
Query: 366 LKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDE---KWMAELSSSM 414
L ++GHM++R VL +E G + + +Q +S+ +L KW+ E++ S
Sbjct: 470 LYRYGHMRVRAVLAREEVHVRPGSHR--VAFQAASIMNLSRRPYKWLGEITESF 521
>gi|422293515|gb|EKU20815.1| tyrosyl-dna phosphodiesterase 1, partial [Nannochloropsis gaditana
CCMP526]
Length = 160
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 80/140 (57%), Gaps = 9/140 (6%)
Query: 261 LLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK 320
LL+Y G+R+++ T+N I VDW+NK+QG+W+QDFP + + +++ F DL +YL L
Sbjct: 3 LLLYEGGIRVMICTSNFIEVDWHNKTQGIWVQDFPKLREEDKADDSLFGRDLREYLQALN 62
Query: 321 -WPEFSANLPAHGNFKINPSF-------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHM 372
+ + H K +P + +FSSA L+ASVPG HTG K+GH+
Sbjct: 63 GFENECGSRGPHSPGKGHPLLTEMIEQELPRIDFSSAQAVLLASVPGKHTGHDKFKFGHL 122
Query: 373 KLRTVLQECTFEKG-FKKSP 391
KLR +L++ G F +P
Sbjct: 123 KLRRLLEKEPMPPGLFPSTP 142
>gi|16768278|gb|AAL28358.1| GH27933p [Drosophila melanogaster]
Length = 161
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/110 (46%), Positives = 70/110 (63%), Gaps = 6/110 (5%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFA 490
+++P+ +V S +G G +P + DK +LK Y +WK+S RSRAMPHIK++
Sbjct: 6 MIYPSYGNVAGSHDGMLGGGCLPYGKNTNDKQPWLKDYLQQWKSSDRFRSRAMPHIKSYT 65
Query: 491 RYN--GQKLAWFLLTSANLSKAAWGALQKNNS---QLMIRSYELGVLILP 535
R+N Q + WF+LTSANLSKAAWG KN++ L I +YE GVL LP
Sbjct: 66 RFNLEDQSVYWFVLTSANLSKAAWGCFNKNSNIQPCLRIANYEAGVLFLP 115
>gi|451845752|gb|EMD59064.1| hypothetical protein COCSADRAFT_41609 [Cochliobolus sativus ND90Pr]
Length = 568
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/422 (23%), Positives = 180/422 (42%), Gaps = 57/422 (13%)
Query: 175 WANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEH 232
+ T +I +V++ D + A++S++M D +WL PV K + +++ + +
Sbjct: 148 YPRTDDTTIDEVLEADTVRTAVISSFMWDSEWLFKKLDPV--KTKQLWIMNAKGKDIQQR 205
Query: 233 MKRNKPA----NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
++ A N +H PP+ + HSK MLL P+ +RI++ TAN+I DW +
Sbjct: 206 WQKEMEAMGVPNLKIHFPPMDGMIQSMHSKLMLLFGPKKLRIVIPTANMIQTDWGEVAND 265
Query: 287 -------QGLWMQDFPLKDQNNLSEE---CGFENDLIDYLSTLKWPEFSANLPAHGNFKI 336
+++ D P + S + F +L+ +L K PE
Sbjct: 266 WQPGVMENSIFLIDLPRRGNETTSTKENMTRFGQELMYFLEMQKVPEMVLQ--------- 316
Query: 337 NPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQ 395
F+FS + + + S+ G H S G + L +Q+ + ++ L Y
Sbjct: 317 ---GILNFDFSQTSHLAFVHSIGGSHKTESEHPTGLLGLTRAIQDLHLDN-VEQMELDYA 372
Query: 396 FSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
SSLG++++ +++ L ++ F+ D P I +PT E V+ S+ G G
Sbjct: 373 ASSLGAINDSFLSRLYLAACGRCFAADTAMVPDVRNHIRIYFPTNETVKKSIGGPDCGGI 432
Query: 453 IPSPQKNVD-----KDFLKKYWAKWKASHTGRSRAMPHIKTF----ARYNGQKLAWFLLT 503
I Q+ + ++ L+ Y + R+ + H K + +G+ + W +
Sbjct: 433 ISLSQQRYNAATFPRECLRDY-------ESLRAGMLSHNKLLLARGRKKDGRPVGWVYVG 485
Query: 504 SANLSKAAWGALQ----KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGS 559
SANLS++AWG + L IR++E GV++ R VP + G+
Sbjct: 486 SANLSESAWGGQKVIKDGKMGSLNIRNWECGVVMTVPDDRLAGLDKDKDKTVPMSVFEGT 545
Query: 560 TE 561
E
Sbjct: 546 VE 547
>gi|169620876|ref|XP_001803849.1| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
gi|160704126|gb|EAT79090.2| hypothetical protein SNOG_13643 [Phaeosphaeria nodorum SN15]
Length = 384
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 147/338 (43%), Gaps = 62/338 (18%)
Query: 341 FKKFNFSSAAVRLIASVPGYHTGSSLK-----KWGHMKLRTVLQECTFEKGFKKSP---L 392
+ ++FSS I SVP + K +G + L +L KK+ +
Sbjct: 58 LRDYDFSSIKAAFIGSVPSRQKPIATKPAQQTSFGWLGLEEILSNVPITANAKKASAPHI 117
Query: 393 VYQFSSLGSLDEK--WMAELSSSM---SSGFSEDKTPLGIGEPL---------------- 431
V Q SS+ +L W+ + S + ++G E+ +P
Sbjct: 118 VMQVSSIATLGAAPTWLNKFQSVLCRSAAGQLEEAPAASSSKPPKLFSKGGMSSAKQDKP 177
Query: 432 ------IVWPTVEDVRCSLEGYAAGNAIP----SPQKNVDKDFLKKYWAKWKAS------ 475
I++PT ++VR SL+GY +G++I S Q+ ++L + WKA+
Sbjct: 178 LSPKFNIIFPTSDEVRTSLDGYDSGSSIHMKLLSIQQQKQLEYLHPLFCHWKATPDSNSK 237
Query: 476 -HTGRSRAMPHIKTFARYNGQK---LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGV 531
R A PHIKT+ RY+ +K + W ++TSANLSK AWG + + I+S+E GV
Sbjct: 238 GQAMRGPAAPHIKTYIRYSDEKHKTIDWAMVTSANLSKQAWGDVVNKKDETWIQSWEAGV 297
Query: 532 LILPSAKRHGCGFSCTSNIVPSEIKS--GSTETSQIQKTKLVTLTWHGSSDAGASSEVVY 589
++ P S + +VP K G+ + S K G+ + A V+
Sbjct: 298 VVWPEL----FAESKEAIMVPVFGKDMPGTEDVSSQDVNK-------GADEGQAGKTVIG 346
Query: 590 LPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRH 627
+PY+LP Y++++ PW + D G+ WP +
Sbjct: 347 FRMPYDLPLTPYTAKEKPWCAQMPSAEPDWMGRAWPGY 384
>gi|238496339|ref|XP_002379405.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694285|gb|EED50629.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 491
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 121/259 (46%), Gaps = 41/259 (15%)
Query: 387 FKKSPLVYQFSSLGSLDEKWMAE-LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLE 445
FK+ L Y +KW+ + + +S+S + + P + I++PT +++R SL
Sbjct: 250 FKRDLLAYLTEYGPKKTDKWLKDVMFASLSPASTSTRQP----KYSIIFPTADEIRRSLN 305
Query: 446 GYAAGNAI----PSPQKNVDKDFLKKYWAKWKASH------------TGRSRAMPHIKTF 489
GY +G +I S + +++ Y W H GR RA PHIKT+
Sbjct: 306 GYGSGGSIHMKLQSAAQQKQLQYMRPYLRHWAGDHDTAEPSHTSKQDAGRRRAAPHIKTY 365
Query: 490 ARYNGQK----LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 545
R++ + + W ++TSANLS AWGA + ++ I S+E+G+++ P
Sbjct: 366 IRFSDAEKMDTIDWAMVTSANLSTQAWGAAVNASGEVRICSWEIGIVVWPQLYVQDTE-- 423
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
++ +VP+ K + E + + ++ T V+ L +PY+LP Y++ D
Sbjct: 424 -SATMVPT-FKRDTPEPLENKDSETTPDT------------VIGLRMPYDLPLTPYAAHD 469
Query: 606 VPWSWDKRYTKKDVYGQVW 624
PW ++ + D GQ W
Sbjct: 470 TPWCATAQHLEPDWLGQTW 488
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 122/254 (48%), Gaps = 48/254 (18%)
Query: 160 LPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMVDIDWLLPACPV-LAK 215
+PS F+L ++ L A + N V +R+++ +I NY+ D+D+++ + +
Sbjct: 85 IPSPFQLTHIRDLAASSDNNVDTVRLREILGDPMIRECWQFNYLHDVDFIMGQFDEDVRR 144
Query: 216 IPHVLVIHGESDGTLEHMKRNKPANWILHKPP------------LPISFGTHHSKAMLLI 263
+ V ++HG KR+ P + + +P +FGTHHSK M+L+
Sbjct: 145 LVKVKIVHGS-------WKRDAPNRVRIDEACSRYPNVEAVVAYMPEAFGTHHSKMMVLL 197
Query: 264 -YPRGVRIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE------CGFENDLIDY 315
+ V++++HTAN+I DW N Q +W PL+ ++ E+ F+ DL+ Y
Sbjct: 198 RHDDLVQVVIHTANMIPGDWANMCQAVWRSPLLPLQKTDDRVEDLILGSGARFKRDLLAY 257
Query: 316 LS------TLKWPE---FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL 366
L+ T KW + F++ PA + + P + F + R S+ GY +G S+
Sbjct: 258 LTEYGPKKTDKWLKDVMFASLSPASTSTR-QPKYSIIFPTADEIRR---SLNGYGSGGSI 313
Query: 367 KKWGHMKLRTVLQE 380
HMKL++ Q+
Sbjct: 314 ----HMKLQSAAQQ 323
>gi|392587577|gb|EIW76911.1| phospholipase D nuclease [Coniophora puteana RWD-64-598 SS2]
Length = 667
Score = 92.4 bits (228), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 108/470 (22%), Positives = 193/470 (41%), Gaps = 65/470 (13%)
Query: 188 QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH-GESDGTLEHMKRNKPANWILHKP 246
+ +I AILS++ I W+ PH VI + D + +N NW++ P
Sbjct: 220 KSNIEFAILSSFSTSISWIYEFFD-----PHTPVIFVAQPDSSGNAALKNVLPNWLMTTP 274
Query: 247 PLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ---NNL 302
L +G H K MLL Y G +R+++ TANLI DW + +W+QD P + ++
Sbjct: 275 FLRNGYGCQHMKFMLLFYKDGRLRVVISTANLIDYDWRDIENAVWLQDVPRRPSPIPHDP 334
Query: 303 SEECGFENDLIDYLSTLKWPEFSANLPA--HGNFKIN--PSFFKKFNFSSAAVRLIASVP 358
+ F + + + L ++ AN+ A H N + ++FS V+L+ S+
Sbjct: 335 KAKDDFPSIMQNVLRSVNVRPALANMLANDHPNLPLQTIADLRTHWDFSKVKVKLVPSIA 394
Query: 359 GYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSP----LVYQFSSLGSLDEKWMAELSSS 413
G H G ++ + GH +L +++ G K+ + Q SS+G+ +W+ E S
Sbjct: 395 GKHEGWPAVVQSGHPRLMKAVRDMGLRTGKGKAAKELVVECQGSSIGTYTTQWLNEFHHS 454
Query: 414 MSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+ED +T L I++P+++ VR + G G + F
Sbjct: 455 ARGESAEDWLDAPRSRRTKLPFPPVKIIFPSLKRVRATALGERGGGTM----------FC 504
Query: 466 KKYWAKWKASHTGR----------SRAMPHIKT-FARYNGQKLAWFLLTSANLSKAAWGA 514
K+ A+W+ + R R + H K + L + A SK+A
Sbjct: 505 KR--AQWEGKNFPRGSFYESESRGGRTLMHTKMIIGTFRSNPL---VSVGAGTSKSAPQK 559
Query: 515 LQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE--IKSGSTETSQIQKTKL-- 570
Q +S+ ++ I + G + + N PS SGS+ +
Sbjct: 560 KQLEDSETEPEDDDVDPDIQIVNEPIGWAYVGSHNFTPSAWGTLSGSSFNPSLNNINYEL 619
Query: 571 -VTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDV 619
+ + + D S ++ PP++Y S+DVPW D+ +++
Sbjct: 620 GIVMPLYNDEDIDRVS-------CFKHPPKKYGSDDVPWMQDESLILREI 662
>gi|301770841|ref|XP_002920838.1| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial
[Ailuropoda melanoleuca]
Length = 172
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 76/131 (58%), Gaps = 6/131 (4%)
Query: 198 NYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMK-RNKP-ANWILHKPPLPISFGTH 255
NY D+DWL+ P + +L++HG+ H+ + KP N L + L I+FGTH
Sbjct: 2 NYCFDVDWLIKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTH 61
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--PLKDQNNLSEE--CGFEND 311
H+K MLL+Y G+R+++HT+NLIH DW+ K+QG+W+ P+ + S E F+ D
Sbjct: 62 HTKMMLLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPPIIHGTHRSGESTTHFKAD 121
Query: 312 LIDYLSTLKWP 322
LI YL P
Sbjct: 122 LISYLMAYNAP 132
>gi|390595745|gb|EIN05149.1| phospholipase D/nuclease, partial [Punctularia strigosozonata
HHB-11173 SS5]
Length = 622
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 122/507 (24%), Positives = 198/507 (39%), Gaps = 110/507 (21%)
Query: 113 SQKRVSNDGATNGELSSKKMRQQDEQDNE--NGKNSEEALCNFHVSRDKLPSTFRLLRVQ 170
S++RV D A + + E + NG+ + A + +D P TFRL +
Sbjct: 131 SKRRVRVDPALSSASGPSTSSRTTEMEPMFWNGEIRQTANAHVDPRKDTKP-TFRLTEII 189
Query: 171 GLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
G + D+ AI++ Y +D WL P+ PV V+ +
Sbjct: 190 GK---------------KSDVKFAIIAGYCIDWAWLYHFFEPSTPV--------VVVAQP 226
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
D T + NWI PPL G H K MLL Y G +R+++ TAN I DW +
Sbjct: 227 DTTGARSVKEVLPNWIRTTPPLRGGRGCMHMKFMLLFYRTGRLRVVISTANFIDYDWRDI 286
Query: 286 SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA---------HGNFKI 336
+W+QD PL+ +++ D+ +T + + N+ A H + +
Sbjct: 287 ENTVWVQDVPLR-----QTPIRYDHKATDFPATFERVFKALNVEAALQALTINDHPDIPL 341
Query: 337 NPS---FFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKG-FKKSP 391
PS K++FS L+ASV G H G + + GH L +++ G ++
Sbjct: 342 -PSVTDLRTKWDFSKVKAHLVASVAGKHEGWPEVIRNGHTALMKAVRDMGARAGKGREVE 400
Query: 392 LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCS 443
L Q SS+G+ +WM E S +ED + L IV+P++ V+ S
Sbjct: 401 LECQGSSIGTYSTQWMNEFHYSCRGESAEDWLDQPKTRRAKLPWPPVKIVFPSLATVQAS 460
Query: 444 LEGYAAGNAI--PSPQKNVDKDFLKKYWAKWKASHTGRSRAMP---HIK----TFARYNG 494
G G I S Q +K F ++ + H RS+ P H K TF G
Sbjct: 461 RLGEKGGGTIFCRSNQWQAEK-FPRELF------HDSRSKRGPVLMHSKMVLATFRPKGG 513
Query: 495 QK---------------------------------LAWFLLTSANLSKAAWGALQKN--N 519
Q + W + S N + +AWG L +
Sbjct: 514 QSTLVDSDSETESETESESDEEVKIVEPKERKKKLVGWIYVGSHNFTPSAWGNLSGSAFG 573
Query: 520 SQLMIRSYELGVLILPSAKRHGCGFSC 546
+ I +YE+G+++ ++ + +C
Sbjct: 574 PIMNITNYEIGIVLPLTSGKEADAIAC 600
>gi|145527276|ref|XP_001449438.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|74834160|emb|CAI44466.1| Tyrosyl-DNA phosphodiesterase [Paramecium tetraurelia]
gi|124417026|emb|CAK82041.1| unnamed protein product [Paramecium tetraurelia]
Length = 532
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 151/345 (43%), Gaps = 62/345 (17%)
Query: 234 KRNKPANWILHKPPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
K N NW++ KP S G H K +L +P+ +RI++ + NL DW SQ +W
Sbjct: 158 KYNNYPNWMVIKPSKLGSCMFGGAFHPKIWILKFPKFIRIVIGSQNLHVGDWTIWSQAMW 217
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK-INPSFFKKFNFSSA 349
+QDF + F+ L ++L + LP F+ + + ++F
Sbjct: 218 IQDFKIGKSELDQGSQEFKTMLREFLYEI--------LPTSHKFEDLLKIKYDDYDFKDV 269
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVL--QECTFEKGFKKSPLV-YQFSSLGSLDEKW 406
++LI S+PG G+ L K+G M+L++VL + C + K V YQ +S+G LD+ +
Sbjct: 270 NIKLITSIPGRFVGNQLFKYGMMRLQSVLYYELCNNKMEIPKQVCVTYQTTSIGQLDDNY 329
Query: 407 M----------------------AELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSL 444
+ +L+ + + E+++ L +++PT + +
Sbjct: 330 IDFALQCCTGKVYKQPLASEQNNKKLNQMILNQQEEEQSKLK-----LIYPTADYIENQT 384
Query: 445 EGYAAGNAIPSP-----QKNVDKDFLKKYWAKWKAS-----HTGRSRAMPHIK----TFA 490
G G +P Q + F K + K++ S HTG +PH+K T
Sbjct: 385 HG---GVDFANPLYLKKQLYENPKFPKHLFYKYQGSDHYYWHTGN---IPHLKVMIITGL 438
Query: 491 RYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
+ S N S+ AWG ++KN +QL I + ELGVL P
Sbjct: 439 DEEINDYTSIYIGSHNFSQGAWGKMEKNATQLYIANTELGVLYPP 483
>gi|449544019|gb|EMD34993.1| hypothetical protein CERSUDRAFT_54191, partial [Ceriporiopsis
subvermispora B]
Length = 621
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 167/406 (41%), Gaps = 66/406 (16%)
Query: 173 PAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH 232
P + T ++ RD ++ AILS Y ++ W+ P ++V H + G+ E
Sbjct: 176 PTFRLTEILAPRDEVE----CAILSAYCINWPWIYSF--FNRDTPVIMVAH-DQQGSNET 228
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWM 291
+K P NWI P L G H K MLL Y G +R++V TAN I DW + W+
Sbjct: 229 IKEVLP-NWIKTTPFLRNGMGCMHIKFMLLFYKSGRLRVVVTTANFIEHDWRDIENTAWV 287
Query: 292 QDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFS 347
QD P + N + F I L TL N+ H N I K++FS
Sbjct: 288 QDIPKRPTPIPNDPKADDFPAAWIRVLRTL-------NI-QHPNLPIQRLEDLRMKWDFS 339
Query: 348 SAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGSLDE 404
AV+L+ S+ G H G ++ K GH L +++ KG K+ L Q SS+G+
Sbjct: 340 KVAVKLVPSLAGKHEGWPNVIKTGHTGLMKAVRDMGAQVPKG-KQMVLECQGSSIGTYST 398
Query: 405 KWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSP 456
+WM E S ++ ++ L +++P++ VR S+ G G +
Sbjct: 399 QWMNEFHCSARGESAQSWLDVSRARRSKLPWPAVKLIFPSLRTVRESVLGEPGGGTMFCR 458
Query: 457 QKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-----------FARYNG----------- 494
+ D K + S++ R + + H K F R
Sbjct: 459 RNQWDAPKFPK--ELFHDSNSKRGKVLMHSKMIIATFRSASTPFTRGQSETDSETEPESD 516
Query: 495 -------QKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGV 531
Q + W + S N + +AWG L + N L I +YELG+
Sbjct: 517 AEETESRQPIGWAYMGSHNFTPSAWGTLSGSAFNPTLNITNYELGI 562
>gi|449686459|ref|XP_002156800.2| PREDICTED: tyrosyl-DNA phosphodiesterase 1-like, partial [Hydra
magnipapillata]
Length = 206
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 114/235 (48%), Gaps = 64/235 (27%)
Query: 248 LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECG 307
LPI++GTHH RI W KS ++D +N+
Sbjct: 19 LPIAYGTHH------------RI-----------W--KSPLFAIKDVAYDGKND-----P 48
Query: 308 FENDLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVPGYHTGSS 365
F+ DL++YLS+ +GN K+ K+++ SSA V L++SVPG +TG
Sbjct: 49 FKEDLLEYLSS------------YGNSKLGMYAEKLKEYDMSSANVHLVSSVPGRYTGFK 96
Query: 366 LKKWGHMKLRTVLQECTFEKGFKKS--PLVYQFSSLGSLDE--------KWMAELSSSMS 415
+ +WGH+KLR +L K P++ QFSS+GSL +W++ LS+
Sbjct: 97 MHQWGHLKLRKLLLSYGPSKDLVNENWPIIGQFSSIGSLGSESSSWLCGEWLSSLSTCKD 156
Query: 416 SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP-----SPQKNVDKDFL 465
E K L +++PT+E+VR SLEGY+AG ++P + ++ KDFL
Sbjct: 157 DELKESKANLK-----LIYPTIENVRNSLEGYSAGCSLPYGIQVAMKQRYLKDFL 206
>gi|403418586|emb|CCM05286.1| predicted protein [Fibroporia radiculosa]
Length = 1675
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 106/419 (25%), Positives = 171/419 (40%), Gaps = 70/419 (16%)
Query: 168 RVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIH 223
R LP + T ++ RD DI AI+S Y+ + WL P PV+A +
Sbjct: 1234 RKDTLPTFRLTDILAPRD----DIAFAIVSAYVYNYSWLYSLFSPNTPVIA-------VA 1282
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDW 282
+ +G E +K P NWI P L G H K MLL Y G +RI++ TAN+I DW
Sbjct: 1283 QDPEGQ-ETIKTILP-NWIKTTPFLRNGMGCMHMKFMLLFYKSGRLRIMISTANMIEYDW 1340
Query: 283 NNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGN-------FK 335
+ W+QD PL+ +S + E+ + L+ + L +H +
Sbjct: 1341 RDIENTAWVQDVPLRSA-PISHDPKAEDFAAAMVRVLRAISVAPALVSHLRNDHPDLPLQ 1399
Query: 336 INPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
F K++FS V L+ S+ G H G + GH L L+ K ++
Sbjct: 1400 RLEEFRMKWDFSKVKVSLVPSIAGKHEGWPKVILAGHTALMKALRNLNAAADKDKEVILE 1459
Query: 395 -QFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTVEDVRCSLE 445
Q SS+G+ +WM E S ++ + L I++PT + VR S
Sbjct: 1460 CQGSSIGNYSTQWMNEFHCSARGESAQSWLDVSKARRAKLSFPPVKILFPTSQYVRDSAL 1519
Query: 446 GYAAGNAIPSPQKNVD-KDFLKKYWAKWKASHTGRSRAMPHIKTF--------ARYNGQK 496
G A G + + + F ++ + + S + R + + H K + ++G
Sbjct: 1520 GEAGGGTMFCRRNQWEGAKFPRELFHQ---SRSKRGKVLMHSKMILGMFRSRPSVFSGSS 1576
Query: 497 --------------------LAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 533
+ W + S N + +AWG L + N L I +YELG+++
Sbjct: 1577 NRSDSETEDEDDPESDQEKLIGWLYVGSHNFTPSAWGTLSGSAFNPTLNITNYELGIVL 1635
>gi|190348157|gb|EDK40564.2| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 113/422 (26%), Positives = 171/422 (40%), Gaps = 94/422 (22%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 306 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 364 SSL----KKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGS---------------- 401
+ + +G KL VL+ + K ++ Q SS+
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHTSSIFTHI 330
Query: 402 -----LDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI--- 453
D+ + LS + + K L P IV+PT ++V + G+ AG +I
Sbjct: 331 LCPLIFDDPQFSMLSPGRETTRNHQK--LYNYTPTIVYPTAQEVSQANVGFGAGASIHFN 388
Query: 454 ---PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFLLTSA 505
+N K + Y KW KA GR+ PH+K + NG + + W LL S
Sbjct: 389 YTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWALLCSH 448
Query: 506 NLSKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
NLSK AWGA + KN + + SYELGVL+ G+ T
Sbjct: 449 NLSKQAWGAPKSKNGRKYHVASYELGVLV-----------------------PGTPHT-- 483
Query: 565 IQKTKLVTLTW-HGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDVYGQ 622
+T T+ H S + L +P+++PP+ Y D PWS + + KD +G
Sbjct: 484 ------LTPTYPHDHSKNCLAP----LRLPFKVPPEPYGDSDQPWSPHMNFGELKDRFGN 533
Query: 623 VW 624
+
Sbjct: 534 TY 535
>gi|146413473|ref|XP_001482707.1| hypothetical protein PGUG_04662 [Meyerozyma guilliermondii ATCC
6260]
Length = 537
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 170/425 (40%), Gaps = 100/425 (23%)
Query: 248 LPISFGTHHSKAMLLIYPRGV-RIIVHTANLIHVDWNNKSQGLWMQD-FPLKDQNNLSEE 305
LP FGTHH+K M+ + + +++ T N+ +D +Q W L S
Sbjct: 163 LPDRFGTHHTKMMINFFENQLCEVVIMTCNITKLDIGGLTQMCWRSGRLALGTTKPDSMG 222
Query: 306 CGFENDLIDYLSTLKWPEFS--ANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
F+ DL DYL K + S AN +++FSS V L+AS PGY
Sbjct: 223 YRFQRDLTDYLKRYKKKKLSELANR------------IMEYDFSSINVELVASAPGYFDM 270
Query: 364 SSL----KKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLGS--LDEKWMAELSSSMS 415
+ + +G KL VL+ + K ++ Q SS+ + EK+ S
Sbjct: 271 DDITTNSEVYGFGKLYQVLKRNNLLIKDTSKHHNMLSQVSSIAYPVVSEKFHT------S 324
Query: 416 SGFSEDKTPLGIGEP-----------------------LIVWPTVEDVRCSLEGYAAGNA 452
S F+ PL +P IV+PT ++V + G+ AG +
Sbjct: 325 SIFTHILCPLIFDDPQFSMLSPGRETTRNHQKLYNYTPTIVYPTAQEVSQANVGFGAGAS 384
Query: 453 I------PSPQKNVDKDFLKKYWAKW--KASHTGRSRAMPHIKTFARYNGQK---LAWFL 501
I +N K + Y KW KA GR+ PH+K + NG + + W L
Sbjct: 385 IHFNYTRSHAHENQYKQNILPYLHKWTSKADTAGRNHVPPHVKLYLCDNGDEWKSIKWAL 444
Query: 502 LTSANLSKAAWGALQ-KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
L S NLSK AWGA + KN + + SYELGVL+ G + T +K+
Sbjct: 445 LCSHNLSKQAWGAPKSKNGRKYHVASYELGVLVP------GTPHTLTPTYPHDHLKNC-- 496
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKRYTK-KDV 619
+ L +P+++PP+ Y D PWS + + KD
Sbjct: 497 --------------------------LAPLRLPFKVPPEPYGDSDQPWSPHMNFGELKDR 530
Query: 620 YGQVW 624
+G +
Sbjct: 531 FGNTY 535
>gi|384490985|gb|EIE82181.1| hypothetical protein RO3G_06886 [Rhizopus delemar RA 99-880]
Length = 338
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 141/314 (44%), Gaps = 45/314 (14%)
Query: 240 NWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---- 294
N I+ +PPL + +G H+K MLL +R+++ +AN++ D+ ++MQDF
Sbjct: 18 NRIIIQPPLKDNKYGVFHNKLMLLFRSSSLRVVIGSANMVACDYEELENVVFMQDFPELI 77
Query: 295 -PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRL 353
PLK +++ E F D+ D L ++ P K++FS A R+
Sbjct: 78 VPLKSESDFPE---FAKDICDVLDKMRVPTTVKEE------------LLKYDFSKAKARI 122
Query: 354 IASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSLDEKWMAELS 411
+ASV G G KK+GH +L ++++ T P V Q SSLGSL ++ E+
Sbjct: 123 VASVSGVFEGEEEYKKYGHTRLADIVRDITGPLDPNNYPKVEMQTSSLGSLSVSYLQEIY 182
Query: 412 SSMS--SGFSEDKTPLGIGE-----PL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 463
S S FS+ K + P+ I++PT + V S G A ++I
Sbjct: 183 QSFCGISSFSDGKAVRSSLQKNQLPPIDIIFPTRDTVTSSRYGGAGADSIC--------- 233
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNS--- 520
F W K ++ H + A + + L + S N + +AWG + +
Sbjct: 234 FNTATWRKPTFPKQVMCDSISH-RQGALMHSKALTSMIFRSHNSTTSAWGKFTVSKASKL 292
Query: 521 -QLMIRSYELGVLI 533
+L I ++ELGV+
Sbjct: 293 PKLSISNWELGVVF 306
>gi|440632301|gb|ELR02220.1| hypothetical protein GMDG_01013 [Geomyces destructans 20631-21]
Length = 529
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 178/392 (45%), Gaps = 53/392 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A+LS++ D +W+L +A+ +L+ E ++++ P+
Sbjct: 93 IKIEEVLQKNDLDLAVLSSFQWDQEWILSKLD-MARTKLILIAQAVPRDDQEEVRKSAPS 151
Query: 240 NWILHKPP-LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP 295
N P + T HSK LL +P +R++V +ANL+ DW +++ D P
Sbjct: 152 NVRFCFPSNKDETVSTMHSKLQLLAHPSHLRVVVPSANLVPYDWGETGVMENTVFLIDLP 211
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFNFS-SAAVRL 353
N + EN L + L+ F L A G + KI S K F+FS +A +
Sbjct: 212 RLAANKV---VSIEN-LTPFCRELR--RF---LKAQGLDSKITDSLLK-FDFSQTAGLAF 261
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKW----- 406
+ S+ G HT + K G+ L + +QE PL F +S+G+L + +
Sbjct: 262 VHSIGGNHTENDWKTIGYPGLGSAIQELGLAN---TGPLNVTFVSASIGALTDDFVLAIL 318
Query: 407 --------MAELS--SSMSSGFSEDKTPLGIGEPL------IVWPTVEDVRCSLEGYAAG 450
+ EL+ +S S + + T I++P+ E VR S G +G
Sbjct: 319 LACKGDDGLTELTWRTSTSPAYRKRTTKEETLLMEMEEGFRIMFPSHETVRTSKNGTNSG 378
Query: 451 NAIP-SPQKNVDKDFLKKYWAKWKASHTG---RSRAMPHIKTFARYNGQK-LAWFLLTSA 505
I P+ + F K+ + K+ G S+ + T +G + AW + SA
Sbjct: 379 GTICLDPKYYQREQFPKELFRDCKSKRAGLLLHSKLLFTAPTHMNADGDRGKAWAYVGSA 438
Query: 506 NLSKAAWGALQKNNS----QLMIRSYELGVLI 533
NLS++AWG L KN S +L R++E GV+I
Sbjct: 439 NLSESAWGRLTKNKSTKQVKLYCRNWECGVVI 470
>gi|118399033|ref|XP_001031843.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89286177|gb|EAR84180.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 562
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 110/468 (23%), Positives = 193/468 (41%), Gaps = 73/468 (15%)
Query: 131 KMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGL----PAWANTSCVSIRDV 186
K RQ ++Q+N+ + N V L + + + L P + +
Sbjct: 81 KFRQNEQQENQPKNKLTDFYMNQLVHHKNLKTNKHFINFRALFYEDPFYKEKNLCP---- 136
Query: 187 IQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP------AN 240
+ +I A L+ +D + +LP +V V+ + + KRN N
Sbjct: 137 -KKTLISAFLTTKGLDEELVLPLVKA-----NVKVVIADDKIKQWNEKRNVIKNHQYFEN 190
Query: 241 WILHKPP---LPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
+ + PP L ++G HSK +L +P+ +RI++ T NL + W N S +W +DF L
Sbjct: 191 FTIVYPPKDYLSKTWGCFHSKLWILKFPKFLRIVIGTGNLRILHWTNWSNIIWFKDFELI 250
Query: 298 DQN-NLSEECGFENDLIDYLST-LKWPEFSANLPAHGNFKINPSF--------------- 340
Q +S+ + N I S +K N + +N F
Sbjct: 251 PQQIQVSQSLDYFNSNISIGSKGVKVVNLEKNYRNINDVDMNEDFIDVLNEFIDKICPYF 310
Query: 341 ---------FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP 391
+ + L++S+PG +GS + +G M++R + Q K
Sbjct: 311 DVKEMLDINLRNYQIKGINFMLVSSLPGKFSGSQIHDYGKMRIRKICQVFNPRNIDSKKV 370
Query: 392 LVYQFSSLGSLDEKWMAE-LSSSMSSGFS-----EDKT----PLGIGEPLIVWPTVEDVR 441
L Q +SLG++D ++ E L + F +DK P E +++P+ + ++
Sbjct: 371 LYSQSTSLGTIDRTFVNEFLFCFLPYQFCSEIELKDKVKKNDPEKNDEIRLIFPSKDYIQ 430
Query: 442 C-SLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKA--------SHTGRSRAMPHIKTF-- 489
+L+G + + K K+ FLK + +++ S + +PH KT
Sbjct: 431 NKTLDGAGYSDTLFLTSKRYQKESFLKNIFYQFQCKQMDSLGESQDKQKGIIPHFKTMIV 490
Query: 490 ARYNGQ--KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
NG+ + + S N S+AAWG L K+N+QL I + ELG+LI P
Sbjct: 491 CEQNGEINDDSIIYIGSHNFSEAAWGKLNKDNTQLYISNTELGILIPP 538
>gi|393244923|gb|EJD52434.1| phospholipase D/nuclease [Auricularia delicata TFB-10046 SS5]
Length = 628
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 115/441 (26%), Positives = 174/441 (39%), Gaps = 108/441 (24%)
Query: 170 QGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-DG 228
Q PA+ + + +D +Q + +LS+Y DI WLL P +P +LV H + DG
Sbjct: 183 QNGPAFRLSQIIGNKDELQ----LVVLSSYSNDIPWLLTMFP--DTVPVILVNHPVTPDG 236
Query: 229 T-LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKS 286
L ++ N++L P + G H K MLL Y G +R+ + TAN I DW +
Sbjct: 237 NDLTYLS----TNFVLVTPSMQQDSGAMHIKLMLLFYKSGRLRVAIPTANFIQYDWRDIE 292
Query: 287 QGLWMQDFPLKDQ----NNLSEECGFENDLIDYLSTLKWPE---------FSANLPAHGN 333
+W+QD P +D L +E F L+D L L F+ L A
Sbjct: 293 NAVWLQDIPKRDAPTPFAKLPKELDFAAQLVDTLRALNVGRAVESQMQNGFAPPLRALDE 352
Query: 334 FKINPSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEK-GFKKSP 391
++ +++S RL+ S+ G H G + + GH L L++ + G K
Sbjct: 353 LRM------WWDWSKVTARLVPSLKGSHEGWPRVTRVGHTSLLKALRDLGADTPGSCKLL 406
Query: 392 LVYQFSSLGSLDEKWMAELSSSMSSGFSE-----------DKTPLGIGEPL-IVWPTVED 439
L Q SS+G +W + S SE D P P+ I++P++
Sbjct: 407 LECQGSSIGQYTRRWTHQFYRSARGEPSEKFSWIAKQSAFDNLPY---PPIKIIFPSLRT 463
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKA----------SHTGRSRAMPHIKT- 488
V S+ G G + K WKA S++ R R + H K
Sbjct: 464 VEESVLGKPGGGTMFCDPKT------------WKAPKFPRENFFDSNSKRGRVLMHTKMI 511
Query: 489 ---FAR------------------------------YNGQKLA-WFLLTSANLSKAAWGA 514
F R +KLA W + S N + AAWG
Sbjct: 512 LGIFERDTMFTAKGKRRDDPYDTDDDEVTIVEPKSTKKREKLAGWLYVGSHNFTPAAWGH 571
Query: 515 LQKNNSQ--LMIRSYELGVLI 533
L ++ L IR+YELGV++
Sbjct: 572 LSGSSITPILSIRNYELGVVL 592
>gi|392563164|gb|EIW56343.1| phospholipase D/nuclease [Trametes versicolor FP-101664 SS1]
Length = 641
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 169/399 (42%), Gaps = 69/399 (17%)
Query: 190 DIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHG-ESDGTLEHMKRNKPANWILH 244
DI AI+S + W+ P PV+A V H + T++ + NWI
Sbjct: 216 DIEFAIVSAFCWSYQWMYQLFSPNTPVIA------VDHDPRGNATIKAIL----PNWIRT 265
Query: 245 KPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQ--NN 301
P L FG H K MLL+Y G +R++V TANL+ DW + +W+QD P +
Sbjct: 266 TPFLRNGFGCMHMKFMLLLYRDGRLRVVVSTANLVEYDWRDIENSVWVQDIPKRPSPVTQ 325
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANL--PAHGNFKIN--PSFFKKFNFSSAAVRLIASV 357
++ F + ++ L L N+ H N + ++FS L+ SV
Sbjct: 326 PADTEDFASAMVRVLHALNVAPALINMLRNDHPNLPLQRLEDLRSHWDFSRVKAALVPSV 385
Query: 358 PGYHTG-SSLKKWGHMKLRTVL--QECTFEKGFKKSPLVYQFSSLGSLDEKWMAE--LSS 412
G H G + GH +L L E T K K+ L Q SS+G+ W+ E LS+
Sbjct: 386 AGKHEGWPKVILTGHTRLMKALLDMEATVPKD-KELALECQGSSIGNYSSMWVNEFFLSA 444
Query: 413 SMSSGFSEDKTP----LGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD-KDFL 465
S S +TP + P I++PT + VR S+ G + G + +K + +F
Sbjct: 445 RGESTQSWLETPKTRRAKVPYPAVKILFPTAQYVRESVLGESGGGTMFCRRKQWEGANFP 504
Query: 466 KKYWAKWKASHTGRSRAMPHIK----TFARYNGQ------------------------KL 497
++ + + + + R R + H K TF G KL
Sbjct: 505 RQLFHQ---TRSKRGRVLMHSKMILGTFKEKTGTLDGHQRASATRSSEVDTDEDAGSAKL 561
Query: 498 A-WFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 533
A W + S N + +AWG L + N L I +YELGV+I
Sbjct: 562 AGWVYVGSHNFTPSAWGTLSGSGFNPSLNINNYELGVVI 600
>gi|118380757|ref|XP_001023542.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila]
gi|89305309|gb|EAS03297.1| Tyrosyl-DNA phosphodiesterase family protein [Tetrahymena
thermophila SB210]
Length = 584
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 175/403 (43%), Gaps = 65/403 (16%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNKPA--NWILHK 245
D+ ++ Y + + L+P +L H ++ + + D +++ + + NW L
Sbjct: 166 DVQSIFMTTYGYETELLMP---ILKSNKHFVLANDKPMHDKSIKDVIKENDGFKNWTLIH 222
Query: 246 PPLPISF---GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK----- 297
PP +S G H K L+ + +R+++ + NL DW+ S LW QDFPL
Sbjct: 223 PPKDVSSSWGGAFHPKLWLIKFSSFLRVVIGSGNLHVSDWSVWSNCLWYQDFPLNANKKE 282
Query: 298 --DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
Q S + FE D L+ L + + KIN +++S + LI+
Sbjct: 283 KTQQKPSSPKWDFEGDFKITLTELVKKMMPSGINYQDLLKIN---LDDYDYSEVKIILIS 339
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECT-FEKGFKKSP----------LVYQFSSLGSLDE 404
S+ G HT + K+G K+ ++Q T EK P + YQ +SLG++D
Sbjct: 340 SIVGRHT--DIYKYGRGKMYKIIQAFTQNEKNITNQPNNNLTQNQKIITYQCTSLGNIDN 397
Query: 405 KWMAELSSSMSSG-----FSEDKT-----PLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
++ E + ++ +DK P I + +++PT E + E G
Sbjct: 398 TFLNEFYTCATANKPITELKKDKANKKQDPNLIEQKFRLIFPTAEYI---YEDTIYGPEY 454
Query: 454 PSP----QKNVDKD-FLKKYWAKWKA-----SHTGRSRAMPHIKTFARYNG----QKLAW 499
SP QK +K+ F K + ++ + HTG A+PH+KT + + +
Sbjct: 455 ASPVILNQKYYEKESFPKSIFHQFCSPDNYFYHTG---AIPHLKTMVVTDNDLQIKDDSI 511
Query: 500 FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGC 542
+ S N + AAWG +K+ SQ+ + ELG+ I P + C
Sbjct: 512 VYIGSHNFTAAAWGRFEKDYSQIYNSNTELGI-IYPPMEDSAC 553
>gi|336379126|gb|EGO20282.1| hypothetical protein SERLADRAFT_452973 [Serpula lacrymans var.
lacrymans S7.9]
Length = 620
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 48/316 (15%)
Query: 163 TFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPH 218
TFRL V G + +I AILS+Y + + W+ P+ PV
Sbjct: 169 TFRLTEVLGK---------------KSEISFAILSSYSLSVSWIYEFFDPSVPV------ 207
Query: 219 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANL 277
+I + D + + +N NWI P L G H K MLL Y G +R+++ TANL
Sbjct: 208 --IIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANL 265
Query: 278 IHVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HG 332
I D+ + +W+QD PL+ Q N+ F + L L P + +L H
Sbjct: 266 IDYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHP 325
Query: 333 NFKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKK 389
N + +++S V+L+ S+ G H G + GH +L +++ G K
Sbjct: 326 NLPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGK 385
Query: 390 SP----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTV 437
+ + Q SS+G+ +WM E S +ED + L IV+P++
Sbjct: 386 AAKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSL 445
Query: 438 EDVRCSLEGYAAGNAI 453
+ V+ S+ G G +
Sbjct: 446 KTVQTSVLGEPGGGTM 461
>gi|336366433|gb|EGN94780.1| hypothetical protein SERLA73DRAFT_171190 [Serpula lacrymans var.
lacrymans S7.3]
Length = 607
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 48/316 (15%)
Query: 163 TFRLLRVQGLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPH 218
TFRL V G + +I AILS+Y + + W+ P+ PV
Sbjct: 156 TFRLTEVLGK---------------KSEISFAILSSYSLSVSWIYEFFDPSVPV------ 194
Query: 219 VLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANL 277
+I + D + + +N NWI P L G H K MLL Y G +R+++ TANL
Sbjct: 195 --IIVAQPDESGQATIKNVLPNWIRTTPFLRYGRGCMHMKFMLLFYKTGRLRVVISTANL 252
Query: 278 IHVDWNNKSQGLWMQDFPLKDQ---NNLSEECGFENDLIDYLSTLK-WPEFSANLPA-HG 332
I D+ + +W+QD PL+ Q N+ F + L L P + +L H
Sbjct: 253 IDYDYRDIENAIWLQDVPLRPQPLPNDPKAVDNFATVMQRVLHALNVRPALATHLKTDHP 312
Query: 333 NFKINP--SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKK 389
N + +++S V+L+ S+ G H G + GH +L +++ G K
Sbjct: 313 NLPLQSIDHLRSHWDWSKVKVKLVPSIAGKHEGWPKVILTGHTRLMKAIRDMGLRTGKGK 372
Query: 390 SP----LVYQFSSLGSLDEKWMAELSSSMSSGFSED--------KTPLGIGEPLIVWPTV 437
+ + Q SS+G+ +WM E S +ED + L IV+P++
Sbjct: 373 AAKDLVIECQGSSIGTYSTQWMNEFHWSARGESAEDWLDEPKTRRAKLPYPAVKIVFPSL 432
Query: 438 EDVRCSLEGYAAGNAI 453
+ V+ S+ G G +
Sbjct: 433 KTVQTSVLGEPGGGTM 448
>gi|403372152|gb|EJY85968.1| Tyrosyl-DNA phosphodiesterase [Oxytricha trifallax]
Length = 676
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 108/428 (25%), Positives = 169/428 (39%), Gaps = 100/428 (23%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG----TLEHMKRNKPANWILHKP 246
I AILS + DI+ + KIP + + + D L K N N++ +
Sbjct: 264 IQRAILSTMVFDIELITQLLD--EKIPMTIFLDRDKDDKGPQVLYEEKLN--LNFVFQQK 319
Query: 247 PLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF---PLKDQNNLS 303
S+ HSK +L + +R+IV +ANL DW S W QDF L N +S
Sbjct: 320 WGGNSYSVFHSKLILYEFDDRLRVIVTSANLYTQDWELLSNVTWFQDFFKAELGKNNEIS 379
Query: 304 EEC---------------------------------GFENDLIDYLSTLKWPEFSANLPA 330
+ F+ L DYL + +P
Sbjct: 380 QSSTTQSVKVATKEERKNPFNFNEQRPQQQQQPFQNDFKQYLKDYLEVI--------IPK 431
Query: 331 HGNFKINPSF-----FKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEK 385
N K+ F KF+FS+A LIAS+ G H KK+G +L +++ +K
Sbjct: 432 --NVKVREVFRQKIDLDKFDFSTANAFLIASINGRHADREFKKYGQARLGELVRNV--DK 487
Query: 386 GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-----IVWPTVEDV 440
+K+ + YQ SS+G L+ K+M +SM + F + K + E + +++PT+ V
Sbjct: 488 QHEKT-ITYQTSSIGKLNTKFM----TSMYNQFGKSKK---VSEDIHQNFRVIFPTIGYV 539
Query: 441 RCSLEGYAAGNAIPSPQKNVDKDFLKKYW-------AKWKASHTGRSRAMP----HIKTF 489
S G ++I + YW K G+S+ + H K
Sbjct: 540 STSHLGPENASSII---------LQESYWYDTPGFPRKSFYRQVGKSKLLDKNLYHTKFM 590
Query: 490 ARYNGQKLAW------FLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 543
+ K + S N S AWG L+KN+SQ+ I ++ELGV+ P
Sbjct: 591 IITDKGKESEITDDTVLYFGSHNFSGGAWGNLEKNDSQISISNWELGVVFGPQVGSQEMK 650
Query: 544 FSCTSNIV 551
+N+V
Sbjct: 651 QKMINNMV 658
>gi|406860446|gb|EKD13504.1| tyrosyl-DNA phosphodiesterase domain protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 669
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 163/375 (43%), Gaps = 50/375 (13%)
Query: 169 VQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G+P + + I +V+Q D+ +A+LS + ++ +W+ K+ + V+ ++D
Sbjct: 198 VNGMPRHGDD--IKIEEVLQKNDLELAVLSAFQIEPEWVESKLNQRTKV--IWVLQAKTD 253
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK-- 285
+++ PAN+ P + + HSK LL +P +R++V +ANL DW
Sbjct: 254 AERQNISSKAPANYRFCFPNMEGNINCMHSKLQLLAHPTHLRVVVPSANLTSYDWGETGI 313
Query: 286 -SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKF 344
++ D P + F N+L+ ++ + + +A + + F
Sbjct: 314 MENICFLIDLPRLPPGEKTVVTNFANELVYFVEQMGLDQKTA------------TSLQNF 361
Query: 345 NFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLD 403
+FS +A + + S+ G H+GS+ K+ G+ L T +++ + + + +S+GSL+
Sbjct: 362 DFSRTAHLAFVHSIGGSHSGSTWKRTGYCGLGTAIKKLGMATEVDLN-IEFLSASIGSLN 420
Query: 404 EKWMA--ELSSSMSSGFSE-----DKTPLGIGEPL--------------IVWPTVEDVRC 442
+ +M L++ G +E +K G I +PT E V
Sbjct: 421 DSFMECLYLAAQGDDGATEYRWRTEKPTKSKGRSAAEHKLLGNVNSNCRIYFPTKETVEA 480
Query: 443 SLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK----L 497
S G G I K D D F +K K+ G M + FAR QK +
Sbjct: 481 SRGGVTGGGTICLQSKWFDSDTFPRKLMRDCKSVRKGI--LMHNKMIFARARDQKQYPKI 538
Query: 498 AWFLLTSANLSKAAW 512
AW + S NLS++AW
Sbjct: 539 AWAYVGSHNLSESAW 553
>gi|393219182|gb|EJD04669.1| phospholipase D/nuclease [Fomitiporia mediterranea MF3/22]
Length = 583
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 159/362 (43%), Gaps = 59/362 (16%)
Query: 120 DGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDK-LPSTFRLLRVQGLPAWANT 178
DG+T+ L + ++ + +G+ + + N V RDK + TFRL + G
Sbjct: 81 DGSTSAGLKVSRGKENESDLFWDGELRQ--VANRLVDRDKDVWPTFRLSEIIG------- 131
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ DI +AILS+Y +DWL P P+ VLV DG +K
Sbjct: 132 --------PKSDITLAILSSYSNAVDWLYDFFEPTTPI------VLVNQPGEDGN-SGLK 176
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD 293
P N ++ KP + G H K +LL Y G +RI + TAN + DW + W+QD
Sbjct: 177 ELAP-NILMTKPFIRNGRGCMHIKILLLFYKDGRLRICLPTANFVEYDWRDIENTAWVQD 235
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA------HGNFKINP-----SFFK 342
P++ + D+ TL+ N+PA GNF P
Sbjct: 236 VPMRKTT-----IRHDPKAADFPGTLQRVLHKLNVPAALTKLLDGNFPELPIEALSELRM 290
Query: 343 KFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSL 399
++++S V+L+AS+ G + G +++ GH L +QE T KG K+ L Q SS+
Sbjct: 291 RWDWSKVKVKLVASLAGKYEGWDEVERTGHPALAKAIQELGVTPPKG-KELVLECQGSSI 349
Query: 400 GSLDEKWMAELSSSMSSGFSE------DKTPLGIGEPL--IVWPTVEDVRCSLEGYAAGN 451
G+ +WM E+ S ++ + + PL I++P++ V+ S+ G G
Sbjct: 350 GTYSRQWMDEIYCSAKGQSAKAWLNKPRSQRMKLAWPLIKILFPSLATVKDSVLGMPGGG 409
Query: 452 AI 453
+
Sbjct: 410 TM 411
>gi|395329020|gb|EJF61409.1| phospholipase D/nuclease [Dichomitus squalens LYAD-421 SS1]
Length = 656
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 110/419 (26%), Positives = 167/419 (39%), Gaps = 70/419 (16%)
Query: 172 LPAWANTSCVSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
+PA N + +++ + DI AI+S Y D ++ + P + V H T
Sbjct: 210 IPAQDNRPLFRLSEILTLKEDIEFAIISAYCWDYKFVYQLMD--RRTPVIAVDHSP---T 264
Query: 230 LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQG 288
E + NWI P L FG H K MLL + G +RI+V TANL+ DW +
Sbjct: 265 GEASIKAILPNWIRTTPFLRGGFGCMHMKFMLLFFRTGRLRIVVSTANLVEYDWRDIENT 324
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL-PAHGNFKIN---------- 337
+W+QD P + ++ + D+ S L N+ PA N N
Sbjct: 325 VWVQDVPKRPSPEPADP-----KVEDFASALVRMLHGVNVAPALVNHLKNEYPNLPLQRL 379
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQ 395
++FS RLI S+ G H G + GH L L++ E K L Q
Sbjct: 380 EELRTHWDFSRVKARLIPSIAGKHEGWPKVILTGHTCLMKSLKDIGAETPKDKDLVLECQ 439
Query: 396 FSSLGSLDEKWMAELSSSMS--------SGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY 447
SS+G+ W+ E S G + L + I++PT + VR S+ G
Sbjct: 440 GSSVGAYTTAWLNEFYCSARGESAQTWLDGPKSRRAKLPLPPIKILFPTAQYVRDSVLGE 499
Query: 448 AAGNAIPSPQKNVD-KDFLKKYWAKWKASHTGRSRAMPHIK----TF------------- 489
G + +K + K+F ++ + + + + R R + H K TF
Sbjct: 500 VGGGTMFCRRKQWEGKNFPRELFHQ---TRSKRGRVLMHSKMVLGTFRDKRRKQQTLTDS 556
Query: 490 -------------ARYNGQKLAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 533
+R Q W + S N + +AWG L + N L I +YELGVLI
Sbjct: 557 EDEAEDGRNADSGSRDRQQLAGWVYVGSHNFTPSAWGTLTGSAFNPTLNITNYELGVLI 615
>gi|452985745|gb|EME85501.1| hypothetical protein MYCFIDRAFT_133255 [Pseudocercospora fijiensis
CIRAD86]
Length = 482
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 110/448 (24%), Positives = 194/448 (43%), Gaps = 59/448 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V++ + A+LS + DIDWLL P K V+ + D
Sbjct: 70 IKLEEVLEPSSVRTAVLSAFQWDIDWLLRKLKTPLNGGSTKCVFVMQAKEKEDRDQWRED 129
Query: 235 RNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLW 290
+ ++++ P + HSK MLL +P +RI + TANL++ DW Q ++
Sbjct: 130 ASDMSHFLRFCFPNMSGLISCMHSKLMLLFHPHKLRIAIPTANLLNFDWGETGQMENSVF 189
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+ D P G + L D S + E + G + KF+FS+
Sbjct: 190 LIDLPRYSD-------GLKASLEDLPSFGR--ELMYFIQKQGLDQDVRDGVLKFDFSATR 240
Query: 351 -VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWM 407
+ + +V G H + G + L + ++E G S L +F SS+G L+E +
Sbjct: 241 DMAFVHTVGGVHYKDEAARTGLLGLSSAVRELGLSTG---SDLEIEFAASSIGMLNEAQV 297
Query: 408 AELSSSMSSGFSEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKD 463
+L ++ + + I +PT + VR S G +AG + K+
Sbjct: 298 NDLHTAARGKPQQSSSTTETSTARKNVRIYFPTADTVRSSTAG-SAGTICLQRKYFEAKN 356
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTF-ARYNGQKLAWFLLTSANLSKAAWGAL--QKNNS 520
F + + +K++ G + H K AR +K+AW + SAN+SK+AWG L +++ +
Sbjct: 357 FPRDIFRDYKSTRRG---LLSHNKILCARSRKEKVAWVYVGSANMSKSAWGELGAKRDEN 413
Query: 521 QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 580
++ R++E GV ILP A++ V E T+ + LV++
Sbjct: 414 KITCRNWECGV-ILPVARK-----------VKDENGDEETDDEGEDEKALVSMN------ 455
Query: 581 AGASSEVVYLPVPYELPPQRYSSEDVPW 608
A + V+ L P+E+P + Y+ + PW
Sbjct: 456 --AFANVIDL--PFEVPGEEYAGRE-PW 478
>gi|406602541|emb|CCH45857.1| Tyrosyl-DNA phosphodiesterase 1 [Wickerhamomyces ciferrii]
Length = 587
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 111/498 (22%), Positives = 199/498 (39%), Gaps = 110/498 (22%)
Query: 179 SCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHV----LVIHGESDGTLEHM 233
+ V I DV+ ++ L +Y D++++LP H L I ++ L+
Sbjct: 142 NSVIISDVLSSPNLRSCYLFSYQHDLEFILPQF-------HSNNIDLTIVYQTGTVLDSP 194
Query: 234 KRNKPANWILHKPPLPISFGTHHSKAMLLIY-PRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
KR N + +P + +HH K ++ +Y V++ + + N+ ++W+ +Q +W
Sbjct: 195 KRALFRNVQFIEVAMP-PYSSHHPKLIINVYNDDTVQLFLVSCNMTFMEWSTNNQMIWQS 253
Query: 293 DFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVR 352
KD N S++ F+ L +Y+ + P+ + KK++F+S
Sbjct: 254 PRLHKDLN--SKDTVFKTHLFNYIKNYQKPQLDTLV----------VLLKKYDFNSIIGD 301
Query: 353 LIASVPGYHTGSSLKKWG--------------HMKLRTVL-QECTFEKGFKKSPLVYQFS 397
++S T WG H K R +L Q + + +P + Q +
Sbjct: 302 FVSSATS--TSDKFGFWGLYNSLLSKGLIPRKHEKERQLLYQTSSIASAIRHTPTINQSA 359
Query: 398 SLGS------LDEKWMAELSSSMSSGFSEDKTPLGIG-------------EPLIVWPTVE 438
++ + K+ S+S F PL G +P I++P++
Sbjct: 360 NIFTHLLLPLFSGKYTNHGRLSISRDF-----PLSNGFISVEQFSKEYKVKPYIIYPSLS 414
Query: 439 DVRCSLEGYAAGN-AIPSPQKNVDK---DFLKKYWAKWKASHTGRSRAMPHIKTF---AR 491
DVR SL GY +G + +P +K DFL + S++ + + P F +
Sbjct: 415 DVRNSLFGYGSGGWSHFNPHSKWNKPMNDFLTP--KVFHHSYSQQRKTNPSHTKFLIMSS 472
Query: 492 YNGQKLAWFLLTSANLSKAAWGALQKNNSQLM------IRSYELGVLILPSAKRHGCGFS 545
N + L W TS N+SK AWG L + +YE G+L+ PS +G G
Sbjct: 473 DNFKTLDWVFFTSTNMSKQAWGTPPTKKDLLSLPPKSNVSNYETGILLCPSD--YGSGI- 529
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
K + L + + + +YLP + LPP++YS++D
Sbjct: 530 -----------------------KFIPLEFGQEKNLEENEVPIYLP--FRLPPEKYSNQD 564
Query: 606 VPWSWDKRYTKKDVYGQV 623
PW K + D+ G +
Sbjct: 565 EPWCVSKSHDLPDILGNL 582
>gi|261190935|ref|XP_002621876.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
gi|239590920|gb|EEQ73501.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis SLH14081]
Length = 696
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 121/482 (25%), Positives = 206/482 (42%), Gaps = 82/482 (17%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + L Y SS+GSL+E++
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTT-RDINLDYVTSSVGSLNEQF 462
Query: 407 MAE--LSSSMSSGFSE------------------DKTPLG--IGEPLIVWPTVEDVRCSL 444
+ L++ SG E +T G + +V+P+++ VR S
Sbjct: 463 LRSMYLAAQGDSGLKELTLRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLDTVRKSK 522
Query: 445 EGYAAGNAI---------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ 495
G I + K++ +D + + + R I + + +
Sbjct: 523 GGAENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHNKILFVRPEKPITSLKDNSTR 582
Query: 496 KLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIV 551
W + SANLS++AWG L + S +L R++E GV+I RH +S +
Sbjct: 583 YAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--I 637
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 608
PS +G T T K + +SD G+ V+ +PVP +P RY + P+
Sbjct: 638 PS---TGRTAT---LLAKSESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPF 691
Query: 609 SW 610
+
Sbjct: 692 FY 693
>gi|327354754|gb|EGE83611.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ATCC 18188]
Length = 696
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 121/482 (25%), Positives = 205/482 (42%), Gaps = 82/482 (17%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + L Y SS+GSL+E++
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTT-RDINLDYVTSSVGSLNEQF 462
Query: 407 MAE--LSSSMSSGFSE------------------DKTPLG--IGEPLIVWPTVEDVRCSL 444
+ L++ SG E +T G + +V+P++ VR S
Sbjct: 463 LRSMYLAAQGDSGLKELTLRTSKRFPSENWGVVTKRTDGGKWKDKFRVVFPSLNTVRKSK 522
Query: 445 EGYAAGNAI---------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ 495
G I + K++ +D + + + R I + + +
Sbjct: 523 GGAENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHNKILFVRPEKPITSLKDNSTR 582
Query: 496 KLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIV 551
W + SANLS++AWG L + S +L R++E GV+I RH +S +
Sbjct: 583 YAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--I 637
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 608
PS +G T T K + +SD G+ V+ +PVP +P RY + P+
Sbjct: 638 PS---TGRTAT---LLAKSESEDSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPF 691
Query: 609 SW 610
+
Sbjct: 692 FY 693
>gi|295662314|ref|XP_002791711.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279837|gb|EEH35403.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 589
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 125/509 (24%), Positives = 205/509 (40%), Gaps = 111/509 (21%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA--NTSCVSIRDVIQGDIIVAILS-NYMV 201
NS+ A V + +PS +L RV+ PA + NT V +RD++ +I NY+
Sbjct: 54 NSKIARQESPVMPNGIPSPIQLTRVRDFPASSENNTDTVKLRDILGDPLIKECWQFNYIF 113
Query: 202 DIDWLLPACPV-LAKIPHVLVIHG----ESDGTL---EHMKRNKPANWILHKPPLPISFG 253
DID+L+ + + V +IHG ES + E +R ++ +P +FG
Sbjct: 114 DIDYLMSQFDQDVRDLVKVKIIHGSWKRESPNRIHIDEGCRRYPNVEPMVAY--MPEAFG 171
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
THHSK M++I +++ + +K W+++ N+LS ++L
Sbjct: 172 THHSKMMIIIKHDDQAQNHKISSVATLGQTDK----WLKETLF---NSLSPPSARSSELF 224
Query: 314 DYLSTLKWPEFSANLPAHGNFKI---NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWG 370
+ +N PA NF I P ++ S+ GY +G S+
Sbjct: 225 ---------KTESNSPA--NFSIIFPTPDEIRR------------SLNGYMSGGSI---- 257
Query: 371 HMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEP 430
HMKL++ Q+ Q L +W + ++D G P
Sbjct: 258 HMKLQSAAQQ-------------KQLQYLRPYLCRWAGDA--------NDDGGVKSAGGP 296
Query: 431 LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFA 490
R LEG ++ D LKK + + R A PHIKT+
Sbjct: 297 ------ATSKRKRLEGNDVSESV------QDCAALKKEHRPIREAGRRR--AAPHIKTYV 342
Query: 491 RYNGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP------------ 535
R++ + W ++TSANLS AWGA ++ I SYE+GVL+ P
Sbjct: 343 RFSDTDMTTIDWAMVTSANLSLQAWGAAANAKKEIRICSYEIGVLVWPDLFVDEEIDDSD 402
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL----TWHGSSDAGASSE--VVY 589
G G + + SG+ T ++ +V + +A SS+ +V
Sbjct: 403 EPLTKGKGKDNSRREI-----SGNKNTKDVKTAVMVPCFKRDMPEAAENAARSSDTTLVG 457
Query: 590 LPVPYELPPQRYSSEDVPWSWDKRYTKKD 618
+PY+LP Y+++D PW Y++ D
Sbjct: 458 FRMPYDLPLHSYTAKDQPWCATATYSEPD 486
>gi|392580440|gb|EIW73567.1| hypothetical protein TREMEDRAFT_70993 [Tremella mesenterica DSM
1558]
Length = 758
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 116/477 (24%), Positives = 184/477 (38%), Gaps = 119/477 (24%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLV------IHGESDGTLEHMKRNKPANWIL 243
+I + ILS +++D DWL P K+P V+V +H +G ++ + +
Sbjct: 335 EIKLIILSTFVLDDDWLSGILPDPQKVPTVIVRPHPKEMHSTYNGKVQAQVTGE----VF 390
Query: 244 HKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDF-PLKDQNN 301
P + G H K + Y G +R+++ TAN + DW+ ++QDF P K +
Sbjct: 391 CYPLMLDERGAAHMKYAWIFYKTGRLRVMISTANFVPYDWDWIENTTFVQDFLPRKPTSP 450
Query: 302 LSEECGFENDLIDYLSTL--------------KWPEFSANLPAH--GNFKINPSFFKKFN 345
G D + + +L + ++LP G F+ K++
Sbjct: 451 APTTKG--EDFVAHFRSLFIHLKVHKALRYLKDQHKAGSDLPPQVSGAFE----GLDKYD 504
Query: 346 FSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSL 402
+S +VRLI SV GYH G K+G +L VL++ + K LV +F SSLG
Sbjct: 505 WSRVSVRLIMSVAGYHHGYDQADKYGMTRLGKVLKDEGLVQS-KGERLVAEFQGSSLGQY 563
Query: 403 DEKW---MAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQK 458
+ +W +L + D PL I++P++ V S G G +
Sbjct: 564 NIEWYNTFYQLCTGKDVRALVDHPKYKDWPPLKIIFPSLATVEASELGKDGGGTM----- 618
Query: 459 NVDKDFLKKYWAKWKASHTGRSRAMPHIK----TFARY------------NGQKLA---- 498
K F + S + R + H K TF +G++ A
Sbjct: 619 FCGKAFTANTKHLFHHSESKRGGVLMHTKMLIGTFEPIPRSLGFTSVDCKSGKRKASEME 678
Query: 499 ------WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI-LPSAKRHGCGFSCTSNIV 551
W + S N S AAWG + +L IR+YELG+L LP K
Sbjct: 679 ESPYGGWIYVGSHNFSAAAWGTMNFKEKRLTIRNYELGILFPLPRDK------------- 725
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
A A +++V PY+ P ++YSS D+PW
Sbjct: 726 -----------------------------ARAMADIV---APYKRPARQYSSNDIPW 750
>gi|302695723|ref|XP_003037540.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
gi|300111237|gb|EFJ02638.1| hypothetical protein SCHCODRAFT_47163 [Schizophyllum commune H4-8]
Length = 646
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 162/403 (40%), Gaps = 76/403 (18%)
Query: 188 QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPP 247
+ +I AILS+Y +D +W + V+++ DG + +N NWI P
Sbjct: 212 KSEIEFAILSSYALDAEWTYS---FFERDTPVIIVQQTKDG--DASIKNWLPNWIRASPF 266
Query: 248 LPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC 306
L +G H K MLL Y G +R+ + TANL+ D+ + W+QD P + + +
Sbjct: 267 LRNGYGCMHMKFMLLFYKTGRLRVYIPTANLVQYDYRDIENFAWLQDIPRRPAHKPEPKP 326
Query: 307 GFEN------DLIDYLSTLKWPEFSANLPAHGNFKIN--PSFFKKFNFSSAAVRLIASVP 358
E+ +++ L+ + +P H N + + +++S V L+AS+
Sbjct: 327 NPEDFPSIMQRVLEALNIRPAQLETNTIPQHPNLPLQSISDLRRLWDWSLVKVHLVASLH 386
Query: 359 GYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSPLV-YQFSSLGSLDEKWMAELSSSM-- 414
G + G S+ + GH +L ++ ++ V Q SS+G W+ E+ SM
Sbjct: 387 GKYEGWPSVLQVGHPRLMKAVRNMGLAVDKEREVEVECQGSSIGRCTSVWINEMYGSMRG 446
Query: 415 --------SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK 466
++ + TPL + + IV+PT V + G G I F +
Sbjct: 447 QSAREWLDATKKRREATPLPLVK--IVYPTKATVHATAWGVNGGGTI----------FCR 494
Query: 467 KYWAKWKAS-------HTGRSRAMP---HIKTFARYNGQK-------------------- 496
+ A W+A H +S P H K K
Sbjct: 495 R--ATWEAKNFPRQLFHDSKSTGGPVLMHTKLIEAKTSAKPSTTSTNNNDINSTIDDIEV 552
Query: 497 ----LAWFLLTSANLSKAAWGALQKN--NSQLMIRSYELGVLI 533
L W + S N +++AWG L + N L + +YELGV+
Sbjct: 553 VHPALGWVYVGSHNFTQSAWGTLSGSGFNPVLNVTNYELGVVF 595
>gi|340518445|gb|EGR48686.1| predicted protein [Trichoderma reesei QM6a]
Length = 534
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 110/485 (22%), Positives = 188/485 (38%), Gaps = 117/485 (24%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS++ D +W+L + ++ +L+ + + M+ PA
Sbjct: 105 ITIEEVFQKDHLELALLSSFQWDEEWMLSKLDI-SRTKLLLLAFAKDEAQKNQMRGIVPA 163
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
N PP+ G HSK LL YP +R+++ T NL+ DW +++ D P
Sbjct: 164 NIKFCFPPMH-GVGAMHSKLQLLKYPNRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPR 222
Query: 297 KDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRL 353
+ + + F +L+ +L A G + ++FS ++ +
Sbjct: 223 LENPATTPQSPTAFYTELVYFLQ------------ATGVGDKMVASLSNYDFSKTSDIAF 270
Query: 354 IASVPGYHTGSSLKKWGHMKLRTVLQECTFEKG-------FKKSPLVYQFSSLGSLDEKW 406
+ ++PG HTG + ++ G+ L + + ++ +SLG+L+ ++
Sbjct: 271 VHTIPGSHTGKAAERTGYCGLGASVAALGLASAEPVEVDLLARCGDLHCCASLGALNHEF 330
Query: 407 MAEL----------------SSSMSSGFSEDKTPLGIGEP------LIVWPTVEDVRCSL 444
+ + S + SS K P I +PT V S
Sbjct: 331 IEAIYNACRGRDGIEDFKNKSGAASSRSKAAKKPDEAASKELQERFRIYFPTERTVAGSR 390
Query: 445 EGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHT----------GRSRAMPHIKT-FARYN 493
G AG I AKW S T R R + H K F R
Sbjct: 391 GGRNAGGTI-------------CVQAKWWRSPTFPTELVRDVIARDRLLVHSKMIFVRRV 437
Query: 494 G------QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCG 543
G Q+ W + SANLS++AWG L ++ S ++ R++E GV ILP
Sbjct: 438 GHDQTTQQRPGWAYVGSANLSESAWGRLSRDRSTKAIKMNCRNWECGV-ILP-------- 488
Query: 544 FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSS 603
+ ++K V + G A + V PVP ++P Y+S
Sbjct: 489 ---------------------VPESKAVDMARAGGDMAMFAGTV---PVPMQVPGPAYAS 524
Query: 604 EDVPW 608
D PW
Sbjct: 525 SDRPW 529
>gi|320165097|gb|EFW41996.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 545
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 97/420 (23%), Positives = 186/420 (44%), Gaps = 69/420 (16%)
Query: 165 RLLRVQGLPAWAN-TSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPA-CPVLAKIPHVLV 221
RL Q + + N +S ++ +D+I+ ++ A+ S+Y D DW + P++ +
Sbjct: 100 RLAEKQAMTSITNDSSSITFQDLIKPRELRRALFSSYEADTDWFVQQLAPMVRSRGASVQ 159
Query: 222 IHGESDGTLEHMKRNKPANWILHKPPLPI--SFGTHHSKAMLLIY-PRGVRIIVHTANLI 278
+ S T + N + ++ PL I + G H + MLL + +R+ V +A+L+
Sbjct: 160 LFVSSSPT---GRGNTALSPNINMTPLTIGKTSGRLHGRLMLLFHGSDTLRVAVTSASLV 216
Query: 279 HVDWNNKSQGLWMQDFPLKDQNNLSEECG--FENDLIDYLSTL-----KWPEFSANLPAH 331
DW + QDFP++ + E G F++ L++Y++ L K + PA
Sbjct: 217 PSDWGVLENVTYYQDFPIEAKRPTVTERGLAFQSTLMNYVTQLVAHQPKDDDVDDRHPAR 276
Query: 332 GNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLK----KWGHMKLRTVLQE--CTFEK 385
+ K NF + RLI+S P + S+L+ + G M L LQ T
Sbjct: 277 AARILKE--LKTVNFDTVEARLISSYPEH---SNLETNGCRQGLMALEQALQAEYSTLPA 331
Query: 386 GFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPL-----------IVW 434
SP++YQ SS+G + + W+ + +++ ++G + G P ++
Sbjct: 332 QVLNSPIIYQSSSIGQVSDPWVTQFATACNAGAPARISGESRGSPFAIDPADALKLQFIF 391
Query: 435 PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK---------WKASHTGRSRAMPH 485
PT V +L+G+ G+ P + F +Y++ +++ H +P+
Sbjct: 392 PTTATVSQALQGFPEGH----PHR---LHFFPRYFSSTFPRGSLFDYQSKH---GNVLPN 441
Query: 486 IKTFARYNGQK--LAWFLLTSANLSKAAWG-ALQKNNSQL---------MIRSYELGVLI 533
K R ++ + + ++ S +L +WG ++S+L M+R++EL VLI
Sbjct: 442 SKVLLRVPDEQSTIGYAVIGSHSLGIGSWGNGAVSSDSKLGAKATSKPRMMRNFELSVLI 501
>gi|328769090|gb|EGF79135.1| hypothetical protein BATDEDRAFT_90149 [Batrachochytrium
dendrobatidis JAM81]
Length = 554
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 107/484 (22%), Positives = 192/484 (39%), Gaps = 116/484 (23%)
Query: 194 AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFG 253
A LS++ +D DWL P KI +++ + W+ P + +G
Sbjct: 117 ACLSSFSIDDDWLCDVFPSTIKICLARPKPKMVPESVDKLPVTNNILWVF--PKMSAGYG 174
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD----QNNLSEECGFE 309
H K LL YP+ +R+++ +ANL+ DW ++ QDFP+ + Q+ SE
Sbjct: 175 AMHIKFQLLWYPKFLRVVITSANLMPHDWQELENVVFYQDFPILNSRVRQSQHSETASSS 234
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSL--K 367
+ ++ TL S N+P + +K +FS A L+ S+PG H +S+ +
Sbjct: 235 TN--EFSKTLYNLLVSMNIPQSVIASV-----QKHDFSKALGMLVVSLPGKHDATSMETR 287
Query: 368 KWGHMKLRTVLQECT--FEKGFKKSPLVYQFSSLGSLDEKWMAELSS------------S 413
++G M L T Q + F +++ + Q +S+GS W+ + S S
Sbjct: 288 QFGSMGLCTASQVISRQFRFDLEQAIVCMQTASMGSTHPAWLRYMLSAFRGQDVIPETPS 347
Query: 414 MSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI-------PSPQKNVDKDFL 465
++S F++ + + EP+ I++P+ V S G G I + +++ +D +
Sbjct: 348 LASFFTQSMSSI---EPITILFPSRRTVETSRNGIPGGGTIFFSSKFWSTFPRHIIRDGV 404
Query: 466 KK-----------------YWAKWKASHTGRSRAMP-HIKTFARYNGQKL-----AWFLL 502
K Y S ++P H + A + KL +
Sbjct: 405 SKTQGILMHSKINVVIGIGYIDLLATSQQLDIVSVPIHTQDNAHDHNTKLEKEIHGYIYC 464
Query: 503 TSANLSKAAWG-----------------ALQKNNSQLMIRSYELGVLILPSAKRHGCGFS 545
S N ++AAWG ++Q + Q+ I+++ELG+L LP R C
Sbjct: 465 GSHNATQAAWGSVPVMRSSVSTSSQSCKSIQHGHLQVEIKNWELGIL-LPFRIRDVC--- 520
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
S G + ++ ++ +P+E PP +Y D
Sbjct: 521 -------------------------------SHSSVGFNPDLSFV-LPFEYPPAKYGPTD 548
Query: 606 VPWS 609
P+S
Sbjct: 549 KPFS 552
>gi|378727943|gb|EHY54402.1| tyrosyl-DNA phosphodiesterase 1 [Exophiala dermatitidis NIH/UT8656]
Length = 793
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 110/278 (39%), Gaps = 81/278 (29%)
Query: 432 IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK----DFLKKYWAKWKASHTG--------- 478
I++PT ++V SL+GYA+G +I + L+ +W S TG
Sbjct: 515 IIFPTPQNVASSLDGYASGGSIHMKAQAASHLNQISLLRPSLCQWTRSQTGASSSSSLSG 574
Query: 479 RSRAMPHIKTFARYNGQ--------KLAWFLLTSANLSKAAWGAL-----QKNNSQLMIR 525
R A PH+KT+ R+ + + W LLTSANLS AWG + ++ +++++
Sbjct: 575 RHLAAPHVKTYIRFKSKPTTQHPTPDIDWALLTSANLSTQAWGVVREPKDKRKEKEVVVQ 634
Query: 526 SYELGVLILP-----------SAKRHGCG-------------FSCTSN------------ 549
S+E+GVL+ P + K+ G G T+N
Sbjct: 635 SFEIGVLVWPGLFGPEFEDEGTIKQDGAGSGRDARMGTGDYDIKNTTNPSKEDQSQNLNS 694
Query: 550 -------------------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL 590
+ P+ I +G E + + ++ +V +
Sbjct: 695 VHSVRMAPVFGTDMPSQLQLQPANIGTGIVEDGTASGNGNENGNVNEKDVSSTTTTLVGI 754
Query: 591 PVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVWPRHF 628
+PY+LP Y D+PWS Y D +G+ WPR F
Sbjct: 755 RLPYDLPLTPYVETDMPWSPQGVYEVPDRHGRRWPRDF 792
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 114/248 (45%), Gaps = 49/248 (19%)
Query: 145 NSEEALCNFHVSRDKLPSTFRLLRVQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDI 203
+S+ A N H +R + S FRL ++ LP+ N +S+ D++ +I A + NY D+
Sbjct: 100 SSKGAPPNGHAAR-LIASPFRLTSIRDLPSSQNIDTISLHDILGIPLIKEAWIFNYCFDV 158
Query: 204 DWLLPACP--VLAKIPHVLVIHGE---SDGT---LEHMKRNKPANWILHKPPLPISFGTH 255
DWL+ + +++ V V+HG DG +E R P N +P +FGTH
Sbjct: 159 DWLMSYFDEDIRSQV-KVKVVHGSWRAEDGNRLGIEDACRRWP-NVESVTAYMPDAFGTH 216
Query: 256 HSKAMLLI-YPRGVRIIVHTANLIHVDWNNKSQGLWMQD----FPLKDQNNLSEECG--- 307
HSK +L + ++++HTAN++H DW N +Q +W P NN + G
Sbjct: 217 HSKMFILFTHDDLAQVVIHTANMLHRDWTNMTQAVWQSPMLPVLPPTTNNNSTGAKGNQP 276
Query: 308 ----------------FENDLIDYLSTLKWPEFSANLPAHGN-FKINPSFFKKFNFSSAA 350
F++D++ YLS A+G K +F+FSS
Sbjct: 277 KSTSTSPIGSIGTGSRFKHDMMAYLS------------AYGTKTKSLREQLVRFDFSSVR 324
Query: 351 VRLIASVP 358
L+ASVP
Sbjct: 325 GALVASVP 332
>gi|403173802|ref|XP_003332829.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170701|gb|EFP88410.2| hypothetical protein PGTG_14494 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 583
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/413 (23%), Positives = 164/413 (39%), Gaps = 70/413 (16%)
Query: 173 PAWANTSCVSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTL 230
P NT + I D+I + I +A++S+Y++++ W+ + ++VI +D
Sbjct: 150 PDCPNT--LRIEDIIGPKDRIKMALVSSYVLELPWIHK---LFNPRTRIMVIRHHTD--C 202
Query: 231 EHMKRNKPANWILHKPPL------PISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
K N+ AN L PP+ G H K ++ Y R+ + TAN + D+
Sbjct: 203 GSFKVNERANMFLCHPPMLKTANGNAKAGCMHIKFFIIFYDNFCRVAIPTANAVSFDYEF 262
Query: 285 KSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPAHGNFKINPSFFK 342
+W+QDF N + +D+ + TL LP K
Sbjct: 263 VENAIWIQDFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLPFRKP-------LK 315
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFKKSPLVYQFSSLG 400
+F SAA L+ S+ G H +S H+ +L+T+ + G + + L Q SS+G
Sbjct: 316 DHDFGSAAANLVVSIQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-RTATLECQGSSIG 374
Query: 401 SLDEKWMAEL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPS 455
S D KW+ S S + +ED PL +++PT+ VR S G A +
Sbjct: 375 SYDLKWLNNFYRCASGSPPTASTEDPDLQTKTPPLTVLYPTLHTVRNSHSGKAGAGTLFC 434
Query: 456 PQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKTF------------------------- 489
+ +K +F +A + TG + H+K
Sbjct: 435 NKATWEKANFPTHIFADTMSKRTG---VLMHVKMILGLFNSDSSAKSTSSTLDTASVEKS 491
Query: 490 ----ARYNGQKLAWFLLTSANLSKAAWGALQ-----KNNSQLMIRSYELGVLI 533
R N + + S N + AAWG +++ L I ++ELGV++
Sbjct: 492 GARDGRINKDHAGFLYIGSHNFTPAAWGKFNLKSGSDDSTSLEISNWELGVVL 544
>gi|367027210|ref|XP_003662889.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
gi|347010158|gb|AEO57644.1| hypothetical protein MYCTH_2304039 [Myceliophthora thermophila ATCC
42464]
Length = 646
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 104/450 (23%), Positives = 170/450 (37%), Gaps = 80/450 (17%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q + +A+LS+Y D +W+L + A+ +LV + E M+ N P
Sbjct: 215 IKIEEVLQKQHLHLAVLSSYQWDEEWMLSKIDI-ARTKLILVAFAADEAQKEEMRSNVPR 273
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
+ I P G+ HSK MLL Y +RI+V T NL+ DW +++ D P
Sbjct: 274 DRIRFCFPPMHGIGSMHSKLMLLKYENYLRIVVPTGNLMSFDWGETGTMENMVFILDLP- 332
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIA 355
K + E N D L L A G + + ++F+ A +
Sbjct: 333 KFETAEGREAQKLNRFADQLFYF--------LRAQGLDEKLVDSLRNYDFTEAGRYEFVH 384
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL--- 410
++PG HTG + G+ L Q G + P+ +SLG+++ + L
Sbjct: 385 TIPGSHTGDDALRTGYCGLG---QSVNALVGTRSEPVELDLVCASLGAVNYGLLTSLYYA 441
Query: 411 ---------------SSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPS 455
S F+ L I +P+ E V S G I
Sbjct: 442 CLGDPLREYEERASGSQRNRDAFTSRAISLVKEHMRIFFPSRETVLRSKGGKDGAGTIC- 500
Query: 456 PQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKTF--------ARYNGQKLAWF 500
L K+W + + R + H K ++ +G+ A+
Sbjct: 501 --------LLSKWWQAPTFPRELVRDCKSVRQGVLMHTKALYVRPCSPTSQQSGRCFAY- 551
Query: 501 LLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
+ SANLS++AWG L ++ + +L R++E GVL+ CT V
Sbjct: 552 -VGSANLSESAWGRLSRDRASGKPKLTCRNWECGVLL------------CTDRTVEGSSG 598
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
+GS V + W G + +G E
Sbjct: 599 AGSDNLGVFDGCVPVPMEWPGRAISGEGGE 628
>gi|317035597|ref|XP_001396653.2| tyrosyl-DNA phosphodiesterase [Aspergillus niger CBS 513.88]
Length = 640
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 109/477 (22%), Positives = 190/477 (39%), Gaps = 76/477 (15%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++M +++WL + K +LV+ E D T +
Sbjct: 184 IKIEEVLQKADLELAVMSSFMWEMEWLFSKFNI-EKTRFILVMQAEDDATKRQYESETAT 242
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQD 293
N L PP+ HSK MLL +P +R++V TANL DW + +++ D
Sbjct: 243 MRNLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLID 302
Query: 294 FPLKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 303 LPKK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSK 347
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--A 408
+ ++ G HT ++ K+ G+ L ++ + + Y SS+G++ ++++
Sbjct: 348 YAFVHTIGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCM 406
Query: 409 ELSSSMSSGFSEDKTPLGIGEPL-----------------------IVWPTVEDVRCSLE 445
L+S G +E P+ + +P+ V S
Sbjct: 407 YLASQGDDGLTEFSIRYAKTFPVPRRDNPSKLLKKDTGSEWSDRFRLYFPSQNTVATSKG 466
Query: 446 GYAAGNAIPSPQK-----NVDKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQKLA 498
G + K N + L+ ++ K H P Q A
Sbjct: 467 GPRCAGTVCFQSKWYNGENFPRHILRDCESQRKGLLMHNKILYVRPDDPIPLSETTQCRA 526
Query: 499 WFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSE 554
W + SAN+S++AWG L ++ S +L R++E GV++ R S+
Sbjct: 527 WAYIGSANMSESAWGRLVQDRSTKSPKLNCRNWECGVIVPVIEDRTDS----------SD 576
Query: 555 IKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPW 608
+K E K + +D GA+ VV+ +PVP +P RY PW
Sbjct: 577 LKDKIHEDKCKGKASEFSSLSSSDNDDGANLPVVFENTIPVPMRVPGARYGGGRKPW 633
>gi|212546293|ref|XP_002153300.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064820|gb|EEA18915.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 684
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 116/498 (23%), Positives = 190/498 (38%), Gaps = 115/498 (23%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + I +V+Q D+ +A+LS + D+ W+ K ++V+ + + T L++ +
Sbjct: 232 NGDDIKIEEVLQTADLELALLSAFQWDMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQE 291
Query: 235 R--NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
N P N L PP+ HSK MLL +P +RI+V +AN++ DW +
Sbjct: 292 ETANMP-NIRLCFPPMDGQVNCMHSKLMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENT 350
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA---HGNF--KINPSFFKK 343
+++ D P K ND D T + E S L A H N K++ FK+
Sbjct: 351 VFLIDLPKKST----------NDAADSPKTAFYEELSYFLKASTLHENIIAKLSAFDFKE 400
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLG 400
N + + ++ G H G SL + GH L + G K + P+ F SS+G
Sbjct: 401 TNRYA----FVHTIGGSHFGESLTRTGHCGLGKAVTSL----GLKTREPINIDFVTSSIG 452
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNV 460
SL +++M + S +T I +I+ +V C L G + NA +
Sbjct: 453 SLTDEFMRSIYLSAQG----KQTLYSIIRTIIL-----NVSCRLGGDGSTNAQRTTSSEW 503
Query: 461 DKDFLKKYWAKWKASHTGRSRAMPHIKTFAR--YNGQKL--------------------- 497
F Y ++ S + SR F + G K
Sbjct: 504 KSRFRVYYPSEQTVSQSKGSRRSAGTICFQEKWFTGPKFPRNTLHDCISRREGLLMHNKM 563
Query: 498 ------------------AWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILP 535
W + SANLS++AWG + + +L R++E GVL+
Sbjct: 564 MFVRPEKPINLPGGSNCAGWAYVGSANLSESAWGKVVHDRVRKEPKLNCRNWECGVLV-- 621
Query: 536 SAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYL----- 590
+ + P+ G + K + +GA ++V +
Sbjct: 622 ----------PITELPPAAGSDGEEQNKDSAKKE---------DKSGAEGDIVEIFGSTV 662
Query: 591 PVPYELPPQRYSSEDVPW 608
PVP +P SE PW
Sbjct: 663 PVPMRVPAPSLGSELKPW 680
>gi|270017231|gb|EFA13677.1| hypothetical protein TcasGA2_TC001393 [Tribolium castaneum]
Length = 416
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 133/302 (44%), Gaps = 35/302 (11%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPV--LAKIPHVLVIHGESDGTLEHMKRNKPANWILHKP 246
G++ ++ N+M+DI WL+ L K P ++ E E +++ P N H
Sbjct: 120 GELKCSLQINFMIDIMWLMERYRERNLGKKPLTILYGDEFPKMKEFIEKFLP-NVSHHYV 178
Query: 247 PLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ----DFPLKDQNN 301
+ FG HHSK + Y +R+++ TANL + DWN+ +QGLW+ P
Sbjct: 179 KMKDPFGCHHSKIGIYFYEDNSLRVVISTANLYYEDWNHYNQGLWLSPPCPQLPETATEK 238
Query: 302 LSEE-CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGY 360
E GF++ L++YL NLP K + K+ +FS+ V L+ SVPG
Sbjct: 239 SGESPTGFKSSLLNYLK-------HYNLPV---LKPWIDYVKRADFSAVRVFLVTSVPGK 288
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSP---------LVYQFSSLGSLDEKWMAELS 411
H + H + + C+ K P ++ Q SS+GS+ + L
Sbjct: 289 HYPGTQGSHVHHVGDLLSRHCSLPA--KTGPDSEGPLSWGIIAQASSIGSMGKSPAEWLR 346
Query: 412 SSMSSGFSEDKTPLGIGEP----LIVWPTVEDVRCSLEGYAAGNAIP-SPQKNVDKDFLK 466
S++ S K + I++P+V++V G +G +P S Q N + +L+
Sbjct: 347 STLLRSLSGHKQTQLVSNSNATLSIIFPSVDNVMNGYFGAESGGCLPYSKQTNEKQRWLQ 406
Query: 467 KY 468
Y
Sbjct: 407 SY 408
>gi|225678545|gb|EEH16829.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 686
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 120/486 (24%), Positives = 199/486 (40%), Gaps = 82/486 (16%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR 235
N + I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE + +
Sbjct: 221 NGDDIKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELE 278
Query: 236 NKPANW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
N + L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 279 NDTKSMGSVRLCFPPMEPQVNCMHSKLMLLFHLNHLRIVIPSANLIPFDWGEKGGIMENV 338
Query: 289 LWMQDFP-LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+++ D P + + + F DL+ +L ++NL K NF
Sbjct: 339 VFLIDLPRISPSPDATPRTPFLEDLVYFLQ-------ASNLDEQ-------IIQKMLNFD 384
Query: 348 SAAVRLIA---SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDE 404
+A + IA ++ G HT + K+ G L + + + L Y SS+GSL+E
Sbjct: 385 FSATKDIAFVHTIGGSHTDPTWKRTGLCGLGRAITSLGLQTS-QNLNLDYVTSSVGSLNE 443
Query: 405 KWMAE--LSSSMSSGFSE---------DKTPLGI------GEP-----LIVWPTVEDVRC 442
+++ L++ +G E LG+ GE + +P++ V
Sbjct: 444 QFLRSIYLAAQGDTGLKELTFRTSRTLPSEKLGVLTTRTDGEKWRDRFKVYFPSLNTVCQ 503
Query: 443 SLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKA--SHTGRSRAMPH--IKTFARYN 493
S G I K ++ ++ ++ H+ A P I + +
Sbjct: 504 SKGGTMNAGTICFQSKWYNSTTFPRNVMRNNISRRDGLLMHSKMLFACPDKPITSSKDNS 563
Query: 494 GQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSN 549
Q W + SANLS++AWG L + S +L R++E GV+I + G G
Sbjct: 564 TQYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI--PIRHRGSG------ 615
Query: 550 IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV-----YLPVPYELPPQRYSSE 604
+ S+ SGST + KL + S S++V +PVP +P + Y
Sbjct: 616 QLSSQPSSGST-----LRPKLEPESESASVTVSDGSKLVSVFEPRIPVPMRVPGEPYQPG 670
Query: 605 DVPWSW 610
D PW +
Sbjct: 671 DKPWYY 676
>gi|403159950|ref|XP_003320511.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169349|gb|EFP76092.2| hypothetical protein PGTG_02533 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 573
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/407 (23%), Positives = 165/407 (40%), Gaps = 68/407 (16%)
Query: 179 SCVSIRDVI--QGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN 236
+ + I D+I + I +A++S+Y++++ W+ + ++VI +D K N
Sbjct: 144 NALRIEDIIGPKDRIKMALVSSYVLELPWIHK---LFNPRTRIMVIRHHTD--CGSFKVN 198
Query: 237 KPANWILHKPPLPISF------GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
+ AN L PP+ + G H K ++ Y R+ + TAN + D+ +W
Sbjct: 199 ERANMFLCHPPMLKTANGNAKPGCMHIKFFIIFYDNFCRVAIPTANAVSFDYEFVENAIW 258
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLK--WPEFSANLPAHGNFKINPSFFKKFNFSS 348
+QDF N + +D+ + TL LP F+ + +F S
Sbjct: 259 IQDFRRFSGNTIGYNSRRSDDVPPFRKTLDDLLDRMGVPLP----FR---KPLEDHDFRS 311
Query: 349 AAVRLIASVPGYHTGSSLKKWGHM--KLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
AA L+ SV G H +S H+ +L+T+ + G + + L Q SS+GS D KW
Sbjct: 312 AAANLVVSVQGTHPANSPMGQAHLAEQLKTLGLQSGPGTG-RTATLECQGSSIGSYDLKW 370
Query: 407 MAEL----SSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
+ S S + +ED PL +++P++ VR S G A + + +
Sbjct: 371 LNNFYRCASGSPPTASTEDPDLQTKTPPLSVLYPSLHTVRNSHSGKAGAGTLFCNKATWE 430
Query: 462 K-DFLKKYWAKWKASHTGRSRAMPHIKTF-----------------------------AR 491
K +F +A + TG + H+K R
Sbjct: 431 KANFPTHIFADTMSKRTG---VLMHVKMILGLFNSDSSAESTSSTLATASVDKSGARDGR 487
Query: 492 YNGQKLAWFLLTSANLSKAAWGALQK-----NNSQLMIRSYELGVLI 533
N + + S N + AAWG +++ L I ++ELGV++
Sbjct: 488 INKDHAGFLYIGSHNFTPAAWGKFNSKSGSDDSTSLEISNWELGVVL 534
>gi|239613173|gb|EEQ90160.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces dermatitidis ER-3]
Length = 662
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 114/460 (24%), Positives = 193/460 (41%), Gaps = 72/460 (15%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 242 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KQTRFLLIMGEKEEDKKRELENDTK 299
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI V +ANL+ DW + + ++
Sbjct: 300 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPSYLRIAVPSANLVPFDWGEQGGVMENIVFLI 359
Query: 293 DFPLKDQNNLSEECG--FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D PLK +L+ G F +DL+ +L ++NL + KK F+FS
Sbjct: 360 DLPLKSP-DLANGPGTSFLDDLVYFLQ-------ASNL--------HDQIIKKMLQFDFS 403
Query: 348 SAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW 406
+ + + ++ G HT +K G L + + + + +F S E W
Sbjct: 404 ATKDIAFVHTIGGSHTDPKWRKTGLCGLGSAITALGLQTTRDINLDYVRFPS-----ENW 458
Query: 407 MAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAI---------PSPQ 457
++ G +DK +V+P++ VR S G I +
Sbjct: 459 -GVVTKRTDGGKWKDKF-------RVVFPSLNTVRKSKGGAENAGTICFQSKWYNSATFP 510
Query: 458 KNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQK 517
K++ +D + + + R I + + + W + SANLS++AWG L
Sbjct: 511 KDIMRDNISRREGLLMHNKILFVRPEKPITSLKDNSTRYAGWAYVGSANLSESAWGRLVL 570
Query: 518 NNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTL 573
+ S +L R++E GV+I RH +S +PS +G T T K +
Sbjct: 571 DRSTTKPKLNCRNWECGVVI---PIRHNDAGKLSS--IPS---TGRTAT---LLAKSESE 619
Query: 574 TWHGSSDAGASSEVVY---LPVPYELPPQRYSSEDVPWSW 610
+SD G+ V+ +PVP +P RY + P+ +
Sbjct: 620 DSSANSDDGSEVTTVFEPTIPVPMIVPAPRYHGRNRPFFY 659
>gi|317148904|ref|XP_001822999.2| tyrosyl-DNA phosphodiesterase [Aspergillus oryzae RIB40]
Length = 667
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 163/389 (41%), Gaps = 53/389 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYHELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 E--LSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCSLEGY-----AAGNA 452
L+ G +E P LI T E+ + Y +
Sbjct: 463 SIYLACQGDDGSTEYVLRTAKSFPVRSRSNPTQLINKSTAEEWKDRFRVYFPSETTVNDT 522
Query: 453 IPSPQKNVDKDFLKKYWAKWK-ASHTGRSRAM---PHIKTFARYNGQKLAWFLLTSANLS 508
PQ F +++ K H R + P N Q AW + SANLS
Sbjct: 523 KGGPQSAGTICFQSRWYTGPKFPRHVLRDCILYVRPDDPATLPDNSQCRAWAYVGSANLS 582
Query: 509 KAAWGALQKNNS----QLMIRSYELGVLI 533
++AWG L + + +L R++E GVL+
Sbjct: 583 ESAWGRLVQERATKEPKLNCRNWECGVLM 611
>gi|315045107|ref|XP_003171929.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
gi|311344272|gb|EFR03475.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma gypseum CBS 118893]
Length = 678
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 112/239 (46%), Gaps = 23/239 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPA-CPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ + +V+Q D+ +A+LS+++ D+DWLL + ++ GE + + M+
Sbjct: 210 IKLEEVLQQADLELAVLSSFLWDMDWLLAKFTNPKTRFLFIMGAKGE-ERQAQLMRETAS 268
Query: 239 ANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
WI L PP+ HSK MLL +P +RI++ +ANL DW K L++ D
Sbjct: 269 MPWIRLCFPPMDGEVHCMHSKLMLLFHPNHMRIVIPSANLDPYDWGEKGGVMENMLFLID 328
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
P K + ++ F ++L+ +L K N KI +F+FS +
Sbjct: 329 LPRKAREADEDKTPFRDELVYFLRASKL-----------NEKIIDKML-QFDFSNTTKYA 376
Query: 353 LIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL 410
+ S+ G H GS S ++ GH L T ++ E + L Y SS+GSL ++ L
Sbjct: 377 FVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLETS-RPLTLDYITSSVGSLTATFLQNL 434
>gi|409042750|gb|EKM52233.1| hypothetical protein PHACADRAFT_148739 [Phanerochaete carnosa
HHB-10118-sp]
Length = 603
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 102/432 (23%), Positives = 171/432 (39%), Gaps = 95/432 (21%)
Query: 173 PAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH 232
P + T ++ RD DI+ AI+S Y++++ W P V+V + G E
Sbjct: 155 PVFRLTDILAPRD----DIVFAIVSAYVINLPWFYSF--FNRGTPVVIVTQDPAAGN-ET 207
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG--VRIIVHTANLIHVDWNNKSQGLW 290
+K P +WI P L G H K +++ R +R+++ TAN I DW + +W
Sbjct: 208 LKEVLP-DWIKTTPFLRNGRGCQHMKVTFILFYRTSRLRMVISTANFIEYDWRDIENSVW 266
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANL-----PAHGNFKIN--PSFFKK 343
+QD P + + ++ + + + ++ L+ + L H N + K
Sbjct: 267 LQDVPPR-PSPIAHDSKANDFPMAFMRVLRGVNVAPALLTLTKNGHSNLPLKRIEELRMK 325
Query: 344 FNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQE--CTFEKGFKKSPLVYQFSSLG 400
++FS V LI S+ G H G + + GH L LQ+ KG K+ L Q SS+G
Sbjct: 326 WDFSKIKVALIPSLAGKHEGWPKVIQTGHTALMKALQDMGARTPKG-KELVLECQGSSIG 384
Query: 401 SLDEKWMAELSSSMSSGFSED----------KTPLGIGEPLIVWPTVEDVRCSLEGYAAG 450
+ +W+ E + +E + P + + I++PT + V+ S G G
Sbjct: 385 TYTTQWLNEFYVTARGESAESWLDQPRARRARLPFPLVK--ILFPTRKTVQDSALGEPGG 442
Query: 451 NAIPSPQKNVDKDFLKKYWAKWKA----------SHTGRSRAMPHIK----TFARY---- 492
+ F ++ A+W+ S + R R + H K TF
Sbjct: 443 GTM----------FCRR--AQWQGANFPRELFHDSKSKRGRVLMHSKLILATFRDSAFAA 490
Query: 493 -----------------------------NGQKLAWFLLTSANLSKAAWGALQKN--NSQ 521
N + W + S N + +AWG L + N
Sbjct: 491 SSSGSSKRHDTPSTDVSDDEIVEVPPPPGNEDFVGWAYVGSHNFTPSAWGTLSGSAFNPT 550
Query: 522 LMIRSYELGVLI 533
L I +YELGVL+
Sbjct: 551 LNITNYELGVLV 562
>gi|346971357|gb|EGY14809.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium dahliae VdLs.17]
Length = 609
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 116/491 (23%), Positives = 189/491 (38%), Gaps = 104/491 (21%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V++ D + +A++S++ D W L A+ V + + ++ E ++ N P+
Sbjct: 166 IKIEEVLEKDKLELAVVSSFQWDEPWFLSKVDT-ARTRMVFIAYAKNGAEQETLRANVPS 224
Query: 240 NWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFP 295
+ I L PP+ G HSK LL YP +RI+V + NL+ DW +++ D P
Sbjct: 225 SRIKLCFPPMH-GIGCMHSKLQLLKYPNHLRIVVPSGNLVPYDWGETGVLENIVFLIDLP 283
Query: 296 LKDQNNLSEEC--GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
Q + G + + + + L+ F L A G + F+F+ + R
Sbjct: 284 RIVQAPEDRDAIRGHDAAGVSFGTELR--RF---LRAQGLDESLVKSLDNFDFTETERYR 338
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSS 412
I ++ G HT + G+ L + K + Y SSLGS+D ++ + +
Sbjct: 339 FIHTIAGGHTDQLSGETGYHGLSRAVHSMGLSTD-KPISVDYVTSSLGSIDNSFIKTIYT 397
Query: 413 SMSSGFSEDKTPLGIGEP------------------------LIVWPTVEDVRCSLEGYA 448
+ D G+ +P I +PT + V S G A
Sbjct: 398 ACQG--LNDGQKDGVDQPSRRNTKTALAATATDSDKALGAKMRIYFPTEDTVAKSRGGKA 455
Query: 449 AGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ------KL 497
AG I +K +D L+ A T R M F + NG
Sbjct: 456 AGGTICFQEKWWGSATFPRDMLR------DAISTRRGVLMHDKIIFVQPNGTGGQDDPGA 509
Query: 498 AWFLLTSANLSKAAWGALQK----NNSQLMIRSYELGVLILP--SAKRHGCGFSCTSNIV 551
W + SANLS++AWG L K ++L R++E GVL+ + R G S
Sbjct: 510 GWAYVGSANLSESAWGRLTKERGSGRAKLTCRNWECGVLVPTGNTGDRSSGGLS------ 563
Query: 552 PSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRY------SS 603
G+ +AG E +PVP P + Y ++
Sbjct: 564 -------------------------GAGEAGKMLEAFRGAVPVPMVAPSRAYGASSNDTA 598
Query: 604 EDVPWSWDKRY 614
D PW + KRY
Sbjct: 599 ADRPWLFMKRY 609
>gi|119467668|ref|XP_001257640.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
gi|119405792|gb|EAW15743.1| tyrosyl-DNA phosphodiesterase domain protein [Neosartorya fischeri
NRRL 181]
Length = 676
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 120/483 (24%), Positives = 193/483 (39%), Gaps = 101/483 (20%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +AILS++M DI+WL V K L++ D E KR A
Sbjct: 238 ITIEEVFQRSDLELAILSSFMWDIEWLF--SKVDTKSTRFLLVMQAKD---ELTKRQYEA 292
Query: 240 ------NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGL 289
N L PP+ HSK MLL +P +RI+ TANL DW
Sbjct: 293 ETASMSNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSA 352
Query: 290 WMQDFPLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNF 346
++ D P K ++ + FE DL+ +L STL+ S +F+F
Sbjct: 353 FLIDLPRKVATTSVGSKTVFEEDLVYFLRASTLQENIISR--------------LDEFDF 398
Query: 347 SSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSL 402
S + + L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL
Sbjct: 399 SQTSHIMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSL 454
Query: 403 DEKWMAE--LSSSMSSGFSE----------DKTPLGIGEPLIVWPTVEDVRCSLEGY-AA 449
++++ L+S G ++ + P + LI T E+ + Y +
Sbjct: 455 TDEFLRSIYLASQGDDGITDFTLRTSKTFPARNPNDTDQ-LIHKNTAEEWKDRFRVYFPS 513
Query: 450 GNAIPSPQKNVDKDFLKKYWAKW-----------KASHTGRSRAMPHIKT-FARYN---- 493
+ + D + +KW + + R + H K F R +
Sbjct: 514 QTTVEQSRGGPDCAGTICFQSKWYEGPKFPRHVLRDCKSRRPGLLMHNKILFIRPDEPIR 573
Query: 494 ----GQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFS 545
Q W + SANLS++AWG L ++ + +L R++E GVL+ P +
Sbjct: 574 LPNSSQCRGWAYVGSANLSESAWGRLVQDKTTKQPKLNCRNWECGVLV-PILDKDNSLDK 632
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
+ N SG T + T +PVP +P QRY
Sbjct: 633 VSDN------DSGKRATESADMLDVFRDT---------------VPVPMTVPGQRYGPGL 671
Query: 606 VPW 608
PW
Sbjct: 672 KPW 674
>gi|134082171|emb|CAK42283.1| unnamed protein product [Aspergillus niger]
Length = 655
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 108/453 (23%), Positives = 188/453 (41%), Gaps = 60/453 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++M +++WL + K +LV+ E D T E
Sbjct: 224 IKIEEVLQKADLELAVMSSFMWEMEWLFSKFNI-EKTRFILVMQAEDDATYESETATM-R 281
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFP 295
N L PP+ HSK MLL +P +R++V TANL DW + +++ D P
Sbjct: 282 NLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLP 341
Query: 296 LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 342 KK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSKYA 386
Query: 353 LIASVP--GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--A 408
+ ++P G HT ++ K+ G+ L ++ + + Y SS+G++ ++++
Sbjct: 387 FVHTIPSGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCM 445
Query: 409 ELSSSM------SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQK---- 458
L+S + S +D + +P+ V S G + K
Sbjct: 446 YLASQVPRRDNPSKLLKKDTGSEWSDRFRLYFPSQNTVATSKGGPRCAGTVCFQSKWYNG 505
Query: 459 -NVDKDFLKKYWAKWKA--SHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAWGAL 515
N + L+ ++ K H P Q AW + SAN+S++AWG L
Sbjct: 506 ENFPRHILRDCESQRKGLLMHNKILYVRPDDPIPLSETTQCRAWAYIGSANMSESAWGRL 565
Query: 516 QKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLV 571
++ S +L R++E GV++ R S++K E K
Sbjct: 566 VQDRSTKSPKLNCRNWECGVIVPVIEDRTDS----------SDLKDKIHEDKCKGKASEF 615
Query: 572 TLTWHGSSDAGASSEVVY---LPVPYELPPQRY 601
+ +D GA+ VV+ +PVP +P RY
Sbjct: 616 SSLSSSDNDDGANLPVVFENTIPVPMRVPGARY 648
>gi|429855706|gb|ELA30650.1| tyrosyl-dna phosphodiesterase domain-containing protein
[Colletotrichum gloeosporioides Nara gc5]
Length = 620
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 115/484 (23%), Positives = 197/484 (40%), Gaps = 70/484 (14%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKL-----PSTF 164
L+R KR + + + K ++ D D++ +N+ L + + L F
Sbjct: 77 LARLGKRSATQADLDENFQTSKSQRTDAADSQELRNAAPVLKVQEQAANALDLPFAKGAF 136
Query: 165 RLLRVQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIH 223
R +G P + + I +V+Q + + +A+LS++ D +WLL + VLV +
Sbjct: 137 RRTWARGYPRTGDD--IKIEEVLQKEQLQLAVLSSFQWDEEWLLSKIDC-RRTKMVLVAY 193
Query: 224 GESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWN 283
+D ++ N PA I P P+ G HSK +L Y +R++V + NL+ DW
Sbjct: 194 AANDAEKAVIRSNAPAGLIRFCFP-PMHGGYMHSKLQILNY---LRLVVPSGNLVPYDWG 249
Query: 284 NKS---QGLWMQDFPLKD--QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP 338
+++ D P + Q E F +L +L+ L E K+
Sbjct: 250 ETGVLENMVFLIDLPRYETQQTTAGTETLFGKELRRFLTALGIGE-----------KLVK 298
Query: 339 SFFKKFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFS 397
S ++FS ++ + ++ G H S + G+ L + + + Y S
Sbjct: 299 S-LDNYDFSETSRYGFVHTISGSHANDSWQHTGYCGLGNTARSLGLATDYPVD-VDYVAS 356
Query: 398 SLGSLDEKWMAEL----------------------SSSMSSGFSEDKTPLGIGEPL---- 431
SLGSL+ ++ + S + SG S +T L
Sbjct: 357 SLGSLNHGYLTAIYNACQGDSGMKEYEARQSKSTRSKAGRSGPSGSRTITAEAVDLQHHF 416
Query: 432 -IVWPTVEDVRCSLEGYAAGNAIPSPQK-NVDKDFLKKYWAKWKASHTGRSRAMPHIKT- 488
I +PT + V S G +A I +K F ++ +++ TG + H K
Sbjct: 417 RIYFPTEKTVSSSRGGRSAAGTICMQEKWWKSSTFPRELLRDCESTRTG---LLLHSKAI 473
Query: 489 FARYNGQKLA-WFLLTSANLSKAAWGALQKN----NSQLMIRSYELGVLILPSAKRHGCG 543
F R A W + SANLS++AWG L K+ ++L R++E GVL+ + GC
Sbjct: 474 FVRERACNGAVWAYMGSANLSESAWGRLVKDRESGTAKLSCRNWECGVLV-AVGRTAGCA 532
Query: 544 FSCT 547
S T
Sbjct: 533 DSGT 536
>gi|322701752|gb|EFY93501.1| tyrosyl-DNA phosphodiesterase, putative [Metarhizium acridum CQMa
102]
Length = 267
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 74/158 (46%), Gaps = 20/158 (12%)
Query: 469 WAKWKASHTGRSRAMPHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNNSQLMIRSY 527
W + S+T + T+ RYN + + W +LTSAN+SK AWG ++ + +L + S+
Sbjct: 126 WVIYDPSYTTGPTTVQTALTYIRYNEKGSIDWAMLTSANISKQAWGEAERPSGELRVASW 185
Query: 528 ELGVLILPSAKRHGCGFSCT-SNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSE 586
E+GVL+ P T + VP E K S GA
Sbjct: 186 EIGVLVWPGLVGQDVSMVGTFQSDVPKEPKE------------------QADSKTGAGGV 227
Query: 587 VVYLPVPYELPPQRYSSEDVPWSWDKRYTKKDVYGQVW 624
++ + +PY LP QRY + +VPW ++ + D +G+ W
Sbjct: 228 LIGVRIPYSLPLQRYGAGEVPWVATMKHGEPDRFGRQW 265
>gi|281210780|gb|EFA84946.1| hypothetical protein PPL_01939 [Polysphondylium pallidum PN500]
Length = 493
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 138/311 (44%), Gaps = 44/311 (14%)
Query: 242 ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP----LK 297
I+H P L G HSK +LL Y + +R+++ ++NL DW Q +++ D P
Sbjct: 134 IIHPPLLVSQIGILHSKIILLEYQQIIRVVISSSNLTGSDWEVLGQTIFIVDIPRIKKNN 193
Query: 298 DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLI 354
N + F+ +L+D LS+L + + + +N +F+FS + ++
Sbjct: 194 IDNINDNKDQFKYELVDILSSLGFTD---------DHIVNA--LDQFDFSMIHQHGIHIV 242
Query: 355 ASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
+S+PG + S K+G KL ++ E + K+ VYQ S++G +W++
Sbjct: 243 SSIPGVY---SHNKYGLSKLASLASEY---QSTSKATAVYQSSAIGMTSREWLSSF---- 292
Query: 415 SSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL--KKYWAKW 472
K +G + +PT+ + + G + DKD L K +K
Sbjct: 293 -------KAAIGTDNLTLPFPTLNTIDEMITYNPLGATESVTIRYHDKDLLLSNKMLSKL 345
Query: 473 KASHTGRSRAMPHIKTFARY---NGQKLA---WFLLTSANLSKAAWGALQKNNSQLMIRS 526
+ ++ + I + + + + L W S N ++A+WG++ K S + I +
Sbjct: 346 QYNNERDPKVDNSITNLSSHPPLHSKVLITDRWIYHGSHNFTEASWGSISKRQSTIKISN 405
Query: 527 YELGVLILPSA 537
+E GV I P+A
Sbjct: 406 FETGVFI-PTA 415
>gi|400597097|gb|EJP64841.1| ubiquitin interaction domain-containing protein [Beauveria bassiana
ARSEF 2860]
Length = 540
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 107/489 (21%), Positives = 202/489 (41%), Gaps = 87/489 (17%)
Query: 110 LSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRV 169
L+R KR N + G M++ Q +E ++S+ L T R
Sbjct: 70 LNRLGKRRRN--SIEGSTQEPDMKRLTSQRSERAESSQPRY---------LQGTVRRTWT 118
Query: 170 QGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
+G P ++ +++ +++Q D+ +A+LS++ D +WLL +K +L+ S+
Sbjct: 119 RGYPKTSDD--ITVEEILQKDDLQLALLSSFQWDEEWLLSKLNA-SKTRILLLAFAASEE 175
Query: 229 TLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS-- 286
+ M+ N P N PP+ G+ HSK L +P+ +R+++ + NL+ DW
Sbjct: 176 QKQLMRGNVPKNIRFCFPPMN-GPGSMHSKLQFLKFPKYLRLVIPSGNLVPYDWGETGVM 234
Query: 287 -QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFN 345
+++ D P + + F ++ +L A G + ++
Sbjct: 235 ENMVFLIDLPRLEASGNRTMTVFGENVARFLK------------ASGVDEAMVESIANYD 282
Query: 346 FSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSL 402
FS+ A + + S+PG H G +L++ G+ L ++ +P+ +SLGS+
Sbjct: 283 FSATANLGFVYSIPGGHMGEALRQVGYCGLGATVRGLGLA---TDTPIEVDLACASLGSI 339
Query: 403 D-------------EKWMAELSSSMSSGFSEDKT-PLG--IGEPLIVWPTVEDVRCSLEG 446
+ + M E ++ + + T P G + I +PT V S G
Sbjct: 340 NYDLINAVYNACQGDDGMQEYNARVGRKLKDKGTRPTGRLRDQFRIYFPTDRTVSESKGG 399
Query: 447 YAAGNAI---------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTF-------A 490
+ I PS K + +D + R + H K A
Sbjct: 400 RQSAGTICVQAKWWRAPSFPKELVRDCVNN-----------RDGLLMHSKIILVRRPAAA 448
Query: 491 RYNGQ--KLAWFLLTSANLSKAAWGALQKN----NSQLMIRSYELGVLI-LPSAKRHGCG 543
GQ + W + SANLS++AWG + K+ ++++ R++E GV++ + +GC
Sbjct: 449 ELIGQTPAMGWAYIGSANLSESAWGRVVKDRGTGSAKMSCRNWECGVVVPVHGNPGNGCD 508
Query: 544 FSCTSNIVP 552
+ S +VP
Sbjct: 509 ITIFSGVVP 517
>gi|330927762|ref|XP_003301988.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
gi|311322883|gb|EFQ89910.1| hypothetical protein PTT_13657 [Pyrenophora teres f. teres 0-1]
Length = 572
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 94/421 (22%), Positives = 181/421 (42%), Gaps = 51/421 (12%)
Query: 175 WANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT---- 229
+ T+ ++I ++++ + +A++ +Y D W+ K+ + +++ + G
Sbjct: 150 YPRTNDITIDELLEAPHVNIAVICSYQYDSSWMYEKLDP-TKVKQIWLMYAKFRGEDIRE 208
Query: 230 --LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
L+ ++ N LH PP+ + HSK MLL +RI + TAN+ DW
Sbjct: 209 KLLQEWAESRVPNMRLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTPTDWGEVGN 268
Query: 285 ------KSQGLWMQDFPLKDQNNLSEECG---FENDLIDYLSTLKWPEFSANLPAHGNFK 335
+++ D P + + + + F DL+ + LK E + K
Sbjct: 269 DWQPGVMENSVFLIDLPRRSDDGVGKVEDLPPFGRDLVFF---LKAQEVGS--------K 317
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
+ KF+F+ + + S+ G H S + G L ++E ++ + L Y
Sbjct: 318 VTDGVL-KFDFADTKHLAFVHSIGGSHKEESERPTGLPGLANAVRELQYDD-VEHLELDY 375
Query: 395 QFSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
SSLG++++ +++ + ++ F++D P I +PT + V S G N
Sbjct: 376 AASSLGAINDTFLSRIYLAARGKSFTKDNAVVPDVRDHIRIYFPTNDTVEKSTGGPDCAN 435
Query: 452 AIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK-TFA---RYNGQKLAWFLLTSAN 506
I +K + F K+ + ++ G + H K FA R NG+ AW + SAN
Sbjct: 436 IISLSRKYYNASTFPKECLRDYVSTRRG---MLSHNKLLFARGRRTNGKPFAWVYVGSAN 492
Query: 507 LSKAAWGALQKNNS----QLMIRSYELGVLI-LPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
+S++AWG + S L +R++E GV++ +P K + + P + G+ E
Sbjct: 493 ISESAWGGQKVLKSGKVGALSVRNWECGVMVPVPDDKLEQVDLKADA-VPPMSVFEGTVE 551
Query: 562 T 562
Sbjct: 552 V 552
>gi|402072975|gb|EJT68632.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 629
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 117/478 (24%), Positives = 194/478 (40%), Gaps = 99/478 (20%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWL-LPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
++I V+Q D++ +A+LS++ D DWL P+ KI V E +E +
Sbjct: 204 ITIDQVLQKDMLQMAVLSSFQWDTDWLWRKVNPMKTKITLVAYAGNE----VEKAAVVES 259
Query: 239 ANWI--LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
A I L PP+ FG HSK LL +P +RI+V + NL+ DW G +
Sbjct: 260 ARGIARLCFPPMN-GFGYMHSKLQLLKFPGFLRIVVPSGNLVSYDWGET--GTMENVVFI 316
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVRLIA 355
D + + G E + + + L A G + +K++F+ ++ +
Sbjct: 317 IDLPPVGDLAGSEGNTLTSFGE----DLCYFLKAQGLEESLIKSLRKYDFTETSRYGFVH 372
Query: 356 SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL--S 411
S+PG H G S + G+ L + + P+ SS+GSL K+ + L +
Sbjct: 373 SIPGSHMGDSWNQTGYCGLGRAVNKLGLA---TDQPIEVDLVASSIGSLTSKFCSALYKA 429
Query: 412 SSMSSGFSED-----KTPLGIGEPL------------IVWPTVEDVRCSLEGY-AAGNA- 452
SG E K G+G + +P+++ V S G +AG
Sbjct: 430 CQGDSGIKEHESKGAKAKNGMGGAASTTQAALAQRFRVYFPSLQSVVASRGGRNSAGTTC 489
Query: 453 -------IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIK-TFARYNGQKLAWFLLTS 504
+PS + + +D++ R + H K F R +W + S
Sbjct: 490 LQSRWWNLPSFPRELFRDYMNPR------------RVLVHSKIIFVRAPSGGASWAYVGS 537
Query: 505 ANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK---S 557
ANLS++AWG L K+ + ++ R++E GV I+P+ H E+K
Sbjct: 538 ANLSESAWGKLVKDRTSSSPKMTCRNWESGV-IVPAGSGH-------------ELKHQGH 583
Query: 558 GSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED---VPWSWDK 612
G E + I + V + G +P+P LP Y+S D +PW D+
Sbjct: 584 GRAEGAGICGS--VGAVFEGC-----------VPLPMTLPGTEYASGDGTRLPWFIDQ 628
>gi|189207467|ref|XP_001940067.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976160|gb|EDU42786.1| tyrosyl-DNA phosphodiesterase domain containing protein
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 564
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 86/391 (21%), Positives = 169/391 (43%), Gaps = 49/391 (12%)
Query: 175 WANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT---- 229
+ T+ ++I ++++ + +A++ ++ D W+ +I + +++ + G
Sbjct: 142 YPRTNDITIDELLEAPQVNIAVICSFQYDSSWMYEKLDP-TRIKQIWLMYSKFRGEDIRE 200
Query: 230 --LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
+ ++ N LH PP+ + HSK MLL +RI + TAN+ DW
Sbjct: 201 KLIREWTESRIPNMKLHFPPMDGMIVSMHSKFMLLFGKEKLRIAIPTANMTQTDWGEVGN 260
Query: 285 ------KSQGLWMQDFPLKDQNN---LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
+++ D P + + + E F DLI + LK + + +
Sbjct: 261 DWQPGVMENSVFVIDLPRRSDDGVGKVEELPSFGRDLIFF---LKAQQVESRVTGG---- 313
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVY 394
KF+F+ + + S+ G H + G L ++E ++ + L Y
Sbjct: 314 -----VLKFDFADTKHLAFVHSIGGSHKEELERPTGLPGLANAVRELQYDD-VEHIELDY 367
Query: 395 QFSSLGSLDEKWMAELS-SSMSSGFSEDK--TPLGIGEPLIVWPTVEDVRCSLEGYAAGN 451
SSLG++++ +++ + ++ F++D P I +PT E V S+ G N
Sbjct: 368 AASSLGAINDTFLSRIHLAARGKNFTQDNAAVPDVRDHFRIYFPTNETVEKSIGGSGCAN 427
Query: 452 AIPSPQKNVDKD-FLKKYWAKWKASHTGRSRAMPHIK-TFA---RYNGQKLAWFLLTSAN 506
I +K + F K+ + ++ G + H K FA R +G+ AW + SAN
Sbjct: 428 IISLSKKYYNASTFPKECLRDYDSTRRG---MLSHNKLLFARGRRTDGRPFAWVYVGSAN 484
Query: 507 LSKAAWGALQKNNS----QLMIRSYELGVLI 533
+S++AWG + S L +R++E GV++
Sbjct: 485 ISESAWGGQKVLKSGKVGALNVRNWECGVIV 515
>gi|358399116|gb|EHK48459.1| hypothetical protein TRIATDRAFT_290150 [Trichoderma atroviride IMI
206040]
Length = 590
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 116/486 (23%), Positives = 189/486 (38%), Gaps = 107/486 (22%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG--TLEH----- 232
++I +V Q D + +A+LS++ D +W+L + +L++ DG LE
Sbjct: 149 ITIEEVFQKDKLELAVLSSFQWDEEWMLSKLDY--RRTKILLLAFARDGAQVLEFIHKTL 206
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGL 289
M+ N PAN PP+ G HSK LL YP +R+++ T NL+ DW +
Sbjct: 207 MQGNVPANIKFCFPPMH-GVGAMHSKLQLLKYPSHLRVVIPTGNLMPYDWGETGVMENMV 265
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-S 348
++ D P D + + + T + E L A G + + ++FS +
Sbjct: 266 FLIDLPRLDHPVSTHASAARS----HAPTRFYTELVYFLQATGVGEKMVASLANYDFSRT 321
Query: 349 AAVRLIASVPGYHTG--------------------------SSLKKWGHMKLRTVLQECT 382
A + + ++PG H+ +SL +R + C
Sbjct: 322 ADLAFVHTIPGSHSAKNAERIASVADLGLASVDPVDVDLVCASLGALNQQMVRAIYNACR 381
Query: 383 FEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC 442
+ G + SS S + +++++S + L I +PT V
Sbjct: 382 GDDGTDEYHKPASTSSRSSAKKPTTTTTTATVTS-----QEQLLRERFRIYFPTDRTVSQ 436
Query: 443 SLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARY---- 492
S G AG I K N ++ ++ R R + H K F R
Sbjct: 437 SRGGRNAGGTICVQTKWWRAPNFPRELVRDV--------ISRDRVLMHSKMIFVRRRPGD 488
Query: 493 NGQKLA------WFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGC 542
+GQ A W + SANLS++AWG + K+ S +L+ R++E GV+I
Sbjct: 489 SGQAQAVRQSPGWAYVGSANLSESAWGRMSKDKSTGGFKLVCRNWECGVII--------- 539
Query: 543 GFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYS 602
VP E+ + KT L T S+D S +PVP ++P Y
Sbjct: 540 -------PVP--------ESQPVDKTTLPT-----SADDDMSMFAGTVPVPMQVPGPVYR 579
Query: 603 SEDVPW 608
S D PW
Sbjct: 580 SSDQPW 585
>gi|389632429|ref|XP_003713867.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
gi|351646200|gb|EHA54060.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae 70-15]
Length = 636
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 115/488 (23%), Positives = 206/488 (42%), Gaps = 73/488 (14%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGES 226
+QG P ++ ++I +V+Q D + +A+LS++ D +WL P K + E+
Sbjct: 168 LQGQPR--SSQDITIEEVLQKDQLELAVLSSFAWDPEWLWTKVDPTKTKTTLIAFAGNEA 225
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS 286
D + + + L PP+ + G HSK LL +P +RI+V + NL+ DW ++
Sbjct: 226 D--QKEVTASAQGVARLCFPPMNGN-GCMHSKLQLLKFPGYLRIVVPSGNLVPYDWGEQN 282
Query: 287 QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFN 345
G+ + D L E++ + E S L A G N +I S +K++
Sbjct: 283 -GIMENSVFIIDLPPLKAGVKLEDNTLTSFGE----ELSYFLTAQGLNERIINS-LRKYD 336
Query: 346 FS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSL 402
FS ++ + ++ G HTG ++ G+ L +Q P+ F SS+G+L
Sbjct: 337 FSQTSRYAFVHTIAGVHTGDKWRRTGYCGLGRAIQNLGLA---TDEPVEIDFVASSMGAL 393
Query: 403 DEKWMAELSSSMS--SGFSE-----DKTPLGIGEPL------------IVWPTVEDVRCS 443
++ L ++ SG + KT + I +P++ V S
Sbjct: 394 KYGYLLALYNAFQGDSGLKDYQSRASKTKTSKEDAASAQQAKLRDFFRIYFPSLATVEAS 453
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS---------RAMPHIK-TFARYN 493
G + + L+ W W+A+ R+ A+ H K FAR
Sbjct: 454 RGGTRSAGTL----------CLRSGW--WEAATFPRALFRDYENPRGALVHSKIVFARPP 501
Query: 494 GQKLAWFLLTSANLSKAAWGAL---QKNNSQ--LMIRSYELGVLILPSAKRHGCGFSCTS 548
AW + SAN+S++AWG L + +SQ + R++E GV I+P + G + ++
Sbjct: 502 DASAAWAYVGSANVSESAWGNLLVKDRASSQPKMSCRNWECGV-IVPVGEPASPGRTLST 560
Query: 549 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRYS--- 602
I P + +G + + + + S E ++ +P+P +LP + Y+
Sbjct: 561 GIDPGDASAGKGGSLHGHQARNSPQEQNAPVGRSRSIEELFSECVPLPMQLPGRSYALAH 620
Query: 603 SEDVPWSW 610
VP W
Sbjct: 621 GGKVPHPW 628
>gi|212546295|ref|XP_002153301.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210064821|gb|EEA18916.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 596
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 129/292 (44%), Gaps = 44/292 (15%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + I +V+Q D+ +A+LS + D+ W+ K ++V+ + + T L++ +
Sbjct: 232 NGDDIKIEEVLQTADLELALLSAFQWDMQWMFTKFRTPNKTRFLMVMQAKEESTRLQYQE 291
Query: 235 R--NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQG 288
N P N L PP+ HSK MLL +P +RI+V +AN++ DW +
Sbjct: 292 ETANMP-NIRLCFPPMDGQVNCMHSKLMLLFHPEYLRIVVPSANMVPYDWGEQGGVMENT 350
Query: 289 LWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPA---HGNF--KINPSFFKK 343
+++ D P K ND D T + E S L A H N K++ FK+
Sbjct: 351 VFLIDLPKKST----------NDAADSPKTAFYEELSYFLKASTLHENIIAKLSAFDFKE 400
Query: 344 FNFSSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFK-KSPLVYQF--SSLG 400
N + + ++ G H G SL + GH L + G K + P+ F SS+G
Sbjct: 401 TNRYA----FVHTIGGSHFGESLTRTGHCGLGKAVTSL----GLKTREPINIDFVTSSIG 452
Query: 401 SLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
SL +++M + S +T I +I+ +V C L G + NA
Sbjct: 453 SLTDEFMRSIYLSAQG----KQTLYSIIRTIIL-----NVSCRLGGDGSTNA 495
>gi|307109628|gb|EFN57866.1| hypothetical protein CHLNCDRAFT_143336 [Chlorella variabilis]
Length = 213
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 71/139 (51%), Gaps = 21/139 (15%)
Query: 483 MPHIKTFARY----NGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP--- 535
MPH K + R+ +G ++AW + S NLSKAAWG L+ + SQL I SYELGVL+LP
Sbjct: 1 MPHSKAYLRWSHGDHGPEIAWCYVGSHNLSKAAWGCLELDASQLHICSYELGVLLLPRLE 60
Query: 536 SAKR--HGCGFSCTSN------IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
+A R CGFSCT ++ + + + L W D+ A+ V
Sbjct: 61 AAYRTSRWCGFSCTGGQPGAAAPRLAQAAAAAGAAGTAAVPSVRFLQWR-QGDSQAAEMV 119
Query: 588 -----VYLPVPYELPPQRY 601
V LPVP+ LPP Y
Sbjct: 120 QGQLGVPLPVPFHLPPVPY 138
>gi|402219032|gb|EJT99107.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 680
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 142/322 (44%), Gaps = 46/322 (14%)
Query: 177 NTSCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEH-- 232
N I D++ D+ +LS+Y D WL P +IP +LV+ + D + H
Sbjct: 207 NRPRFKITDIVSPASDLEFVLLSSYCTDTPWLTTFLP--REIPVLLVV--DPDPSQRHDA 262
Query: 233 -MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLW 290
+K +W+ P + S G H K +LL Y G +R+ + TANL+ DW + ++
Sbjct: 263 SLKNLGIGDWLRVTPRIWQSRGVMHIKVLLLFYKSGRLRVAIPTANLVDYDWRDIENTVF 322
Query: 291 MQDF-PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG----NFKINPSFFKKFN 345
+QD P+ D + + F L L +L P NL G + + K++
Sbjct: 323 VQDLPPITDSSADPQSHDFPTYLWGVLKSLNVPAGLLNLVNSGYPSLPLQSLQNLQDKWD 382
Query: 346 FSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFE-KGFKKSPLVYQFSSLGSLD 403
+ RL+ASV G + G +++ +GH +L ++++ + K K + Q SS+G+
Sbjct: 383 WCKMRARLVASVAGNYEGWYNVRMYGHPRLSAIIRDSRAQPKKGKVLNIECQGSSVGNCT 442
Query: 404 EKWMAELSSS-------------MSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAG 450
+++ E+ S MS + P+ I++PT++ V S+ G G
Sbjct: 443 TQYLNEVYKSCCGIDPISWIDIPMSRQVRQPWPPVK-----ILFPTLKTVDDSVFGRNGG 497
Query: 451 NAIPSPQKNVDKDFLKK-YWAK 471
+ F KK YW+K
Sbjct: 498 GSF----------FCKKPYWSK 509
>gi|297806769|ref|XP_002871268.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297317105|gb|EFH47527.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1083
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 87/199 (43%), Gaps = 35/199 (17%)
Query: 191 IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIH-------GESDGTLEHMKRNKPANWIL 243
I +A L++ DI W L C + + +P + H D N P N +
Sbjct: 403 IFIATLTS---DILWFLTCCEIPSHLPVTIACHHAERCWSSSPDARSTAPLPNYP-NVTM 458
Query: 244 HKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQ 292
PP P I+FG HH K +L +R+I+ +ANL+ WN+ + +W Q
Sbjct: 459 VFPPFPEEIAFGKDRKNRGIACHHPKLFILQREVSIRVIITSANLVARQWNDVTNTVWWQ 518
Query: 293 DFPLK---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
DFP + D +L C G + D L+ ++P+ ++ I F K
Sbjct: 519 DFPRRADPDVLSLFGHCRRETNHGLKTDFCAQLAGFA-ASLLTDVPSQAHWIIE---FTK 574
Query: 344 FNFSSAAVRLIASVPGYHT 362
+NF +A L+ASVPG H+
Sbjct: 575 YNFEHSACHLVASVPGIHS 593
>gi|320040691|gb|EFW22624.1| hypothetical protein CPSG_00523 [Coccidioides posadasii str.
Silveira]
Length = 651
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 101/405 (24%), Positives = 171/405 (42%), Gaps = 74/405 (18%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ +V+Q D+ +A+LS++ ++DWL V K L++ G E KR
Sbjct: 212 IKFEEVVQKDDLELAVLSSFQWNMDWLFTKFNV--KKTRFLLVMGHK---YEEEKRQTQK 266
Query: 240 NWI------LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGL 289
++ L P+ HSK MLL +P +R++V +ANL+ DW + L
Sbjct: 267 DFADIPSIRLCFVPMGPQVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLL 326
Query: 290 WMQDFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS- 347
++ D P K + + F ++L+ +L E KI +F+F
Sbjct: 327 FLIDLPRKILGSQEKTSTPFFDELVYFLKASALHE-----------KIIAK-LSEFDFGK 374
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSLGSLDEK 405
+A + ++ G HTGS WG + + + T PL Y SSLGSL+++
Sbjct: 375 TAGFAFVHTIGGSHTGSD---WGKTGVCGLGKAVTMLGLQTPQPLKLDYVTSSLGSLNDQ 431
Query: 406 WM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCS 443
+M EL+ S F DK + + + LI +P+++ V+ S
Sbjct: 432 FMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGS 491
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL----- 497
+ I K ++ ++ + S + R + H KT F R + K+
Sbjct: 492 RARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDAN 549
Query: 498 -----AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 533
W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 550 TTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 594
>gi|302406010|ref|XP_003000841.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
gi|261360099|gb|EEY22527.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Verticillium albo-atrum VaMs.102]
Length = 586
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 110/481 (22%), Positives = 185/481 (38%), Gaps = 85/481 (17%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G P + I +V++ D + +A++S++ D WLL A+ V + + ++
Sbjct: 156 VHGFPR--TNDDIKIEEVLEKDKLELAVVSSFQWDEPWLLSKVDT-ARTRMVFIAYAKNG 212
Query: 228 GTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS 286
E ++ + P++ I L PP+ G HSK LL Y +RI+V + NL+ DW
Sbjct: 213 AEQETLRASVPSSRIKLCFPPM-YGIGCMHSKLQLLKYQNHLRIVVPSGNLVPYDWGETG 271
Query: 287 ---QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+++ D P Q + + ND + F L A G +
Sbjct: 272 VLENMVFLIDLPRIVQASGDGDAIRGNDAAGVSFGTELRRF---LRAQGLDESLVKSLDN 328
Query: 344 FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSL 402
F+F+ + R I ++ G HT + G+ L + P+ + +
Sbjct: 329 FDFTETERFRFIHTIAGGHTDQLSGETGYHGLSRAVHSLGLS---TDEPITVDYVAQQDQ 385
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPQKNVD 461
++ + + + + +G + I +PT + V S G AAG I
Sbjct: 386 NDGGNQPSRRNTKTALNATDSQKALGVKMRIYFPTEDTVARSRGGKAAGGTIC------- 438
Query: 462 KDFLKKYWAK-------WKASHTGRSRAMPHIK-TFARYN---GQK---LAWFLLTSANL 507
F +K+W + S + R + H K F + N GQ W + SANL
Sbjct: 439 --FQEKWWGSATFPREMLRDSISTRPGVLMHDKIIFVQPNSTGGQDDPGAGWAYVGSANL 496
Query: 508 SKAAWGALQK----NNSQLMIRSYELGVLI--LPSAKRHGCGFSCTSNIVPSEIKSGSTE 561
S++AWG L K ++L R++E GVL+ + R G S
Sbjct: 497 SESAWGRLTKERGSGRAKLTCRNWECGVLVPTRTTGDRSSGGLS---------------- 540
Query: 562 TSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRY------SSEDVPWSWDKR 613
G+ +AG E +PVP P + Y ++ D PW + KR
Sbjct: 541 ---------------GAGEAGKMLEAFRGAVPVPMVAPSRAYGTSSNDTAADRPWLFMKR 585
Query: 614 Y 614
Y
Sbjct: 586 Y 586
>gi|116192211|ref|XP_001221918.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
gi|88181736|gb|EAQ89204.1| hypothetical protein CHGG_05823 [Chaetomium globosum CBS 148.51]
Length = 670
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 90/399 (22%), Positives = 165/399 (41%), Gaps = 80/399 (20%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ +A++S++ D W+L + + +L+ S+ M+ N P
Sbjct: 226 IKIEEVLQKNDLKLAVVSSFQWDEHWMLSKIDI-TRTKLMLIAFAASEAQKAEMRANVPK 284
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP- 295
N + P G HSK MLL Y R +RI+V T N + DW +++ D P
Sbjct: 285 NRVRFCFPPMHGIGAMHSKLMLLKYERYMRIVVPTGNFMSYDWGETGTMENMVFIIDLPK 344
Query: 296 --LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VR 352
+Q + F ++L +L A G + S + ++F+ A+ +
Sbjct: 345 FETAEQREAQKPDPFSSELFYFLR------------AQGLDEKLVSSLRNYDFTEASRYK 392
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWMAEL 410
+ ++PG HT W + ++++ + P+ F +SLG+++ +++ +
Sbjct: 393 FVHTIPGSHTDED--AWRRTAVSSLIRAT-------RDPIDIDFVCASLGAINYDFLSAM 443
Query: 411 -------------SSSMSSGFSE---DKTPLGIGEPL-IVWPTVEDVRCSLEGYAAGNAI 453
+ + S G E D+ + E + + +P+ E V S G I
Sbjct: 444 YYACLGDPLVEYQARTGSKGQREAFNDRAQFLVKEHMRVFFPSRETVLQSKGGKEGAGTI 503
Query: 454 PSPQKNVDKDFLKKYWAKWKA----------SHTGRSRAMPHIKT-FARYNGQKLAW--- 499
K W W+A + R + H K + R N + W
Sbjct: 504 ----------CFKPIW--WQAPTFPQQILRDCKSVRPGVLMHSKVIYIRPNDPGIRWNQC 551
Query: 500 -FLLTSANLSKAAWGALQKNN----SQLMIRSYELGVLI 533
+ SANLS++AWG L ++ ++L R++E GVLI
Sbjct: 552 LAYVGSANLSESAWGKLVRDRVTKKAKLTCRNWECGVLI 590
>gi|294896960|ref|XP_002775774.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239882085|gb|EER07590.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 201
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 85/175 (48%), Gaps = 23/175 (13%)
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDF--------PLKDQNNLSE 304
GT H+K +++ + +R+ + ++N+ DW SQ +W+ DF P + +
Sbjct: 1 GTMHAKLIIIERAQALRVCISSSNVTPQDWEGVSQCIWVADFKPANDPEAPARKRVKPDH 60
Query: 305 ECGFENDLIDYLSTLKWPEFSANLP---AHGNFKINPSFFKKFNFS-SAAVRLIASVPGY 360
F + L ++ T F ++P + ++ + +FN V LIAS PGY
Sbjct: 61 TSDFGDQLARFIET-----FFRSIPDSSSLSSYWVKVLTGSRFNVKLPKGVELIASAPGY 115
Query: 361 HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMS 415
G WGHM+LR +L + E+ +++Q SS+G L ++A+LS S++
Sbjct: 116 WKGDDRDNWGHMRLRALLSDVHSEE------ILFQCSSIGFLPASFLADLSKSLN 164
>gi|343476326|emb|CCD12540.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 173
Score = 68.9 bits (167), Expect = 7e-09, Method: Composition-based stats.
Identities = 43/113 (38%), Positives = 60/113 (53%), Gaps = 18/113 (15%)
Query: 195 ILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKPANWILH--------- 244
IL Y++D++WL P+L +++I GE G L +K + +LH
Sbjct: 44 ILGGYVMDVEWLFRVSDPLLMSKCTIVLISGEK-GFL-----HKYRHLVLHDRFGRNRVK 97
Query: 245 --KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP 295
+P LPI FG HHSK ML I G+R+ V TAN I DWN K+QG++ P
Sbjct: 98 IVEPCLPIPFGVHHSKMMLCINNNGIRVAVLTANFIEDDWNYKTQGIYFFHSP 150
>gi|326472360|gb|EGD96369.1| hypothetical protein TESG_03817 [Trichophyton tonsurans CBS 112818]
Length = 676
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 93/405 (22%), Positives = 164/405 (40%), Gaps = 68/405 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V+Q D+ +A+LS+++ D+DWLL + + ++ + + E + R +
Sbjct: 218 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETAS 276
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
L PP+ HSK MLL + +RI++ +ANL DW + L++ D
Sbjct: 277 MSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLID 336
Query: 294 FPLKDQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K + + F ++L+ +L STL N KI +++FS +A
Sbjct: 337 LPRKANETVDDTTPFRDELVYFLRASTL-------------NEKIIDKML-QYDFSQTAK 382
Query: 351 VRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
+ S+ G H GS S ++ GH L T ++ + L Y SS+GSL ++
Sbjct: 383 YAFVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQN 441
Query: 410 L--SSSMSSGFSEDKTPLG--------------------------IGEPLIVWPTVEDVR 441
L S+ +G + G G + +P+ E V
Sbjct: 442 LYWSAQGDNGTKQLSARAGNPRSSSKSSSNNNNNKKSGGRVDDDWTGRMKVYFPSRETVC 501
Query: 442 CSLEGYAAGNAI---------PSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARY 492
S G +A + P ++V +D S R +
Sbjct: 502 SSRGGVSAAGTLCLMSKWYNSPMFPRDVMRDNRSVREGLLMHSKVLYVRPEGEARKGESR 561
Query: 493 NGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLI 533
+ W + SANLS++AWG L + ++L R++E GV++
Sbjct: 562 SADCAEWAYVGSANLSESAWGRLVIDRKTKQAKLNCRNWESGVVV 606
>gi|159122429|gb|EDP47550.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 665
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 115/244 (47%), Gaps = 33/244 (13%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +AILS++M DI+WL + +LV+ + D T + +
Sbjct: 227 ITIEEVFQRSDLELAILSSFMWDIEWLFSKVDTKS-TRFLLVMQAKDDLTKRQYEAETAS 285
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL +P +RI+ TANL DW ++ D
Sbjct: 286 MSNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLID 345
Query: 294 FPLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SA 349
P K ++ + FE +L+ +L STL+ S +F+FS ++
Sbjct: 346 LPRKVATTSVGSKTVFEEELVYFLRASTLQENIISR--------------LDEFDFSPTS 391
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKW 406
+ L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL +++
Sbjct: 392 HIMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEF 447
Query: 407 MAEL 410
+ +
Sbjct: 448 LRSI 451
>gi|345560675|gb|EGX43800.1| hypothetical protein AOL_s00215g536 [Arthrobotrys oligospora ATCC
24927]
Length = 634
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 171/419 (40%), Gaps = 64/419 (15%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
+QG+ ++ ++I +V+Q D + A+LS Y D W+L + VLV+H + D
Sbjct: 191 IQGVARTSDD--ITIEEVLQKDTLQTAVLSAYQWDFLWILEKIKT-GECDLVLVLHAKED 247
Query: 228 GTLEHMKRN--KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
++H +RN L P + + HSK LL + +R++V TANL DW
Sbjct: 248 EVVDHYRRNLCNIPRTRLCFPDMSGNVNIMHSKLQLLFHLTHLRVVVPTANLTSYDWGEA 307
Query: 286 SQGLWMQDFPLKDQNNLSEECGFEND--LIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ S E EN +ID+ K + P+H F N F K
Sbjct: 308 T-------------GTGSNEGVMENSVFIIDFPELPKTSTEGSTNPSHTPFSRNLLHFCK 354
Query: 344 ---------------FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGF 387
++F+ S + + S+ G H G + G L +++ K
Sbjct: 355 AKGMPSDIIKKVDQVYDFTRSQRLGFVYSIGGSHHGDEALRNGVCGLACAVRDLGL-KTR 413
Query: 388 KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLI----VWPTVEDVRCS 443
K+ Y SSLGSL+++++ + ++ G K+ I + I P E
Sbjct: 414 KRVEADYITSSLGSLNKEFLLRIYRAL-HGDEGKKSVQNIPKTFIGRQVKAPEDESTDSE 472
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYW---AKWKAS-----HTGRSRAMPHIKT----FAR 491
E + + + + N ++ W +K+ S + R + H K R
Sbjct: 473 TEEDESDDKV--WRDNGGTICFQRQWFNGSKFPQSLLHDCQSVRRGMLMHNKIIFVRLPR 530
Query: 492 YNGQKLAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLI---LPSAKRHGCG 543
G + W + S NLS++AWG L + + ++ R++E GV++ LP + H G
Sbjct: 531 PRGNSIGWAYVGSHNLSESAWGKLVWDRSEKDFKMSNRNWECGVIVPVALPDGQEHTRG 589
>gi|392861898|gb|EAS37505.2| tyrosyl-DNA phosphodiesterase [Coccidioides immitis RS]
Length = 672
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 99/400 (24%), Positives = 173/400 (43%), Gaps = 64/400 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRN--- 236
+ +V+Q D+ +A+LS++ ++DWL V K +LV+ + + + +++
Sbjct: 233 IKFEEVVQKDDLELAVLSSFQWNMDWLFTKFNV-KKTRFLLVMGHKYEEEKQQTQKDFAD 291
Query: 237 KPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQ 292
P+ + P P HSK MLL +P +R++V +ANL+ DW + L++
Sbjct: 292 IPSIRLCFVPMGP-QVNCMHSKLMLLFHPNHLRLVVPSANLVPYDWGEQGGIIENLLFLI 350
Query: 293 DFPLKDQNNLSEECG-FENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
D P K + + F ++L+ +L E KI +F+F +A
Sbjct: 351 DLPRKILGSQEKTSTPFFDELVYFLKASALHE-----------KIIAK-LSEFDFGKTAG 398
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--- 407
+ ++ G HTGS K G L + E + L Y SSLGSL++++M
Sbjct: 399 FAFVHTIGGSHTGSYWGKTGVCGLGKAVTMLGLETP-QPLKLDYITSSLGSLNDQFMRSM 457
Query: 408 ----------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVEDVRCSLEGYA 448
EL+ S F DK + + + LI +P+++ V+ S +
Sbjct: 458 YLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKTVQGSRARPS 517
Query: 449 AGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL---------- 497
I K ++ ++ + S + R + H KT F R + K+
Sbjct: 518 GAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKIIGDANTTAYQ 575
Query: 498 AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 533
W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 576 GWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 615
>gi|238494160|ref|XP_002378316.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
gi|220694966|gb|EED51309.1| tyrosyl-DNA phosphodiesterase, putative [Aspergillus flavus
NRRL3357]
Length = 679
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 111/242 (45%), Gaps = 29/242 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYDELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 EL 410
+
Sbjct: 463 SI 464
>gi|452845379|gb|EME47312.1| hypothetical protein DOTSEDRAFT_21105 [Dothistroma septosporum
NZE10]
Length = 584
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 109/489 (22%), Positives = 196/489 (40%), Gaps = 95/489 (19%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V++ + A+LS + D +W+L K P ++G S + M+ P
Sbjct: 136 IKLEEVLEPSSVRTAVLSAFQWDTEWVLSKL----KTP----LNGGSTKCVFVMQAKTPD 187
Query: 240 NWILHK--------------PPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
++ PP+ + HSK MLL +P +R+ + +ANL++ DW
Sbjct: 188 ERAQYREWASGFEACLRICLPPMDGAIYCMHSKLMLLFHPHKLRVAIPSANLLNFDWGET 247
Query: 286 SQ---GLWMQDFP-LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF 341
Q ++M D P L + + E DL T E + G K
Sbjct: 248 GQMENSVFMIDLPRLAGSTSQTTE-----DL-----TFFGQELMFFIERQGLDKDLRKGV 297
Query: 342 KKFNFSSAA-VRLIASVPGY-HTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
F+FS+ + I +V G + + + G + L ++ ++ + + SS+
Sbjct: 298 LGFDFSATEHMAFIHTVGGMNYERTGADRTGLLGLSRAVRYLGLTTDQRELEIDFAASSI 357
Query: 400 GSLDEKWMAELSSSMS-----SGFSEDKTPLG--------------------IGEPLIVW 434
G L++ + +L S+ S + +E K+ I + L V+
Sbjct: 358 GQLNDSQVQDLHSAASGQDLIAQAAEAKSKAATNFFAKKAASSKAASTSERDIKQKLRVY 417
Query: 435 -PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYN 493
PT E V+ S G AAG + K F + + +K++ G + H K
Sbjct: 418 FPTKETVQASTAG-AAGTICLQRKYFEGKTFPRAIFRDYKSTRKG---LLSHNKILC-AR 472
Query: 494 GQKLAWFLLTSANLSKAAWGALQKNNSQLMI--RSYELGVL------ILPSAKRHGCGFS 545
+ LAW + SAN+SK+AWG + K+ + I R++E GVL ILP A +
Sbjct: 473 SKSLAWLYIGSANMSKSAWGEIPKDRKERRITCRNWECGVLLPVPKEILPPACKEKARRR 532
Query: 546 CTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSED 605
T + SE S E + + +L + +P+E+P Y+ +
Sbjct: 533 HTDDEEDSETDSEDEEPQLVDMSVFSSL----------------VDLPFEVPGDDYNGRE 576
Query: 606 VPWSWDKRY 614
PW + +++
Sbjct: 577 -PWYFTEKH 584
>gi|7576178|emb|CAB87929.1| hypothetical protein [Arabidopsis thaliana]
Length = 1075
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 32/195 (16%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPP 247
L+ + DI W L C +P + H D N P N + PP
Sbjct: 401 FLATFTSDILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPP 459
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K +L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 460 FPEEIAFGKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPR 519
Query: 297 K---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+ D +L C G + D L+ ++P+ ++ + F K+NF
Sbjct: 520 RADPDLLSLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFE 575
Query: 348 SAAVRLIASVPGYHT 362
+A L+ASVPG H+
Sbjct: 576 HSAGHLVASVPGIHS 590
>gi|42567721|ref|NP_196357.2| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
gi|30102672|gb|AAP21254.1| At5g07400 [Arabidopsis thaliana]
gi|110743660|dbj|BAE99667.1| hypothetical protein [Arabidopsis thaliana]
gi|332003770|gb|AED91153.1| forkhead-associated domainand FHA domain-containing protein
[Arabidopsis thaliana]
Length = 1084
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 82/195 (42%), Gaps = 32/195 (16%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWILHKPP 247
L+ + DI W L C +P + H D N P N + PP
Sbjct: 401 FLATFTSDILWFLTCCDTPRHLPVTIACHNAERCWSSNPDARTAVPLPNYP-NVTMVYPP 459
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K +L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 460 FPEEIAFGKDRTNRGIACHHPKLFILQRKDSIRVIITSANLVARQWNDVTNTVWWQDFPR 519
Query: 297 K---DQNNLSEEC------GFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS 347
+ D +L C G + D L+ ++P+ ++ + F K+NF
Sbjct: 520 RADPDLLSLFGHCQRETNHGLKPDFCAQLAGFA-ASLLTDVPSQAHWILE---FTKYNFE 575
Query: 348 SAAVRLIASVPGYHT 362
+A L+ASVPG H+
Sbjct: 576 HSAGHLVASVPGIHS 590
>gi|391872408|gb|EIT81535.1| hypothetical protein Ao3042_01981 [Aspergillus oryzae 3.042]
Length = 679
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 111/242 (45%), Gaps = 29/242 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS++M +++WL AK LV+ + + T K A
Sbjct: 240 IKIEEVFQKSDLQLAVLSSFMWEMEWLFSKLNT-AKTRFYLVMQAKDESTKLQYKSETAA 298
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL + VRI+V TANL DW +++ D
Sbjct: 299 MSNLRLCFPPMDGQVNCMHSKLMLLFHSGYVRIVVPTANLTPYDWGEIGGLMENSVFIID 358
Query: 294 FPLK-DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
P + D+++ GF ++L + LK N+ A ++FS +A +
Sbjct: 359 LPKRTDKDSGFTRTGFYHELTYF---LKASTLHENIIAK---------LTDYDFSRTAHI 406
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWMA 408
+ ++ G H G S ++ G+ L + G + S PL F SS+GSL ++++
Sbjct: 407 AFVHTIGGSHMGDSWRRTGYCGLGRAVNSL----GLRTSKPLNIDFVTSSVGSLTDEFLR 462
Query: 409 EL 410
+
Sbjct: 463 SI 464
>gi|242823839|ref|XP_002488140.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218713061|gb|EED12486.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 673
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 107/246 (43%), Gaps = 27/246 (10%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT-LEHMK 234
N + + I +V+Q D+ +A+LS + D +WL K ++V+ + + T L++ +
Sbjct: 229 NNNDIKIEEVLQTADLELAVLSAFQWDTEWLFSKFRTPGKTRFLMVMQAKEESTRLQYQQ 288
Query: 235 RNKPA-NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGL 289
N L PP+ HSK MLL +P +RI+V +ANL+ DW + +
Sbjct: 289 ETADMPNIRLCFPPMEGQIKCMHSKLMLLFHPDYLRIVVPSANLVPYDWGEQGGVMENTV 348
Query: 290 WMQDFPLKDQNNLSE--ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF- 346
++ D P + ++ + + F +L +L H N F+F
Sbjct: 349 FLIDLPKRSAQDVPDTPKKAFYEELAFFLQAST---------VHNNIIAK---LSSFDFK 396
Query: 347 SSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDE 404
++ R + ++ G H G ++ GH L + P+ F SS+GSL +
Sbjct: 397 ETSRYRFVHTIGGSHIGECRRRTGHCGLGQAVSSLGLR---THEPISIDFVTSSIGSLTD 453
Query: 405 KWMAEL 410
++M +
Sbjct: 454 EFMRSI 459
>gi|402224759|gb|EJU04821.1| phospholipase D/nuclease [Dacryopinax sp. DJM-731 SS1]
Length = 955
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/296 (23%), Positives = 130/296 (43%), Gaps = 28/296 (9%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLP 249
++ + S + D +WL P A +P + + H E + P + ++ P
Sbjct: 508 ELRFVLTSAFGTDFEWLRSMIP--AGVPLLSINHPTDRERWEPQIKPLPLDGWIYATPKM 565
Query: 250 ISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGF 308
G H K +LL Y G +R+++ TANL+ DW + +++QD P K++++ +E F
Sbjct: 566 NKGGIMHVKLLLLFYKNGRLRLVIPTANLVPDDWRDIENTMFLQDIPAKNKDSSAEPHPF 625
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINP-----SFFKKFNFSSAAVRLIASVPG-YHT 362
L +L L + L G + P + +++S +L+ S G Y
Sbjct: 626 PVYLASFLKILNVHNGLSAL-VQGGYPNLPLPSLDALATGWDWSRVTAQLVGSPAGSYED 684
Query: 363 GSSLKKWGHMKLRTVLQECTFEKGF-KKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSED 421
S+++WGH +L +++ + K+ L YQ SS+G+ +++ + S G S D
Sbjct: 685 WDSVRRWGHPRLGEAVRQLKAQPPTGKRLNLEYQGSSIGNYTTQYLNDFYKS-GCGLSPD 743
Query: 422 ---KTPLGIGEPL--IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKK-YWAK 471
+ P P IV+P++ V ++ G + F +K YW+K
Sbjct: 744 VSKRRPKAQPWPAIQIVYPSLTTVDNTVLGRLGAGSF----------FCRKQYWSK 789
>gi|302823724|ref|XP_002993511.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
gi|300138642|gb|EFJ05403.1| hypothetical protein SELMODRAFT_449151 [Selaginella moellendorffii]
Length = 920
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 90/208 (43%), Gaps = 33/208 (15%)
Query: 181 VSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLE 231
VS+ D++ DI ++++ DI W + + + +P + H +E
Sbjct: 239 VSVADLLAPLEDIREMFVASFTTDIIWFISSFGLPKTLPVTIACHDSERSWSTAISDRME 298
Query: 232 HMKRNKPANWILHKPPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
H P N + PP P+ G HH K LL + +R+IV ++NL +
Sbjct: 299 HPYCEWP-NLKVVYPPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYR 357
Query: 281 DWNNKSQGLWMQDFPLKDQNNLSE-------ECGFEN-DLIDYLSTLKWPEFSANLPAHG 332
W S +W QDFPL++ + S E G N D YL+ ++P+
Sbjct: 358 QWLQVSNTVWWQDFPLRNTRDYSSLFSSKITEGGERNGDFAAYLAGF-ISTLVKDVPSEA 416
Query: 333 NFKINPSFFKKFNFSSAAVRLIASVPGY 360
++ + +NFS A V L+ASVPG+
Sbjct: 417 HWATD---LACYNFSKATVSLVASVPGF 441
>gi|169625658|ref|XP_001806232.1| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
gi|160705700|gb|EAT76477.2| hypothetical protein SNOG_16105 [Phaeosphaeria nodorum SN15]
Length = 895
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 89/400 (22%), Positives = 167/400 (41%), Gaps = 54/400 (13%)
Query: 178 TSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT----LEH 232
T+ ++I +V+Q + + +A++S++M D +WL L K+ + +++ +S +
Sbjct: 465 TNDITIDEVLQAESVNIAVVSSFMWDSEWLNKKLSPL-KVKQIWIMNAKSQDVQQRWVRE 523
Query: 233 MKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK------- 285
M+ N +H PP+ + HSK MLL +R++V TAN+ +DW +K
Sbjct: 524 MEDAGIPNLRIHFPPMGGLIHSMHSKFMLLFGRDKLRLVVPTANMTPMDWGDKVNNWQPG 583
Query: 286 --SQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
L++ D P + + ++ + + L+ E + G K + +
Sbjct: 584 VMENSLFLVDLPRRSDGVMGKKQDLTTFGKELVCFLEKQELDKKV-IEGVLKFDFTQTDH 642
Query: 344 FNFSSAAVR---LIASVPGYHTGSSLK-KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
F A + + + G H G + G L +++ + K+ L Y +SL
Sbjct: 643 LAFVHAILEEQSITCTSGGVHKGEQQQLSTGLPGLAKAIRDVHLDD-VKEIELDYASASL 701
Query: 400 GSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGY-----AAGNAIP 454
G++++ ++ + + G+PL V VR Y A N+I
Sbjct: 702 GAINDNFLQRIYLAAQ------------GKPLTTTSAVSQVRRHFRIYFPTDDAVQNSIG 749
Query: 455 SPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIK-TFAR---YNGQKLAWFLLT 503
P Y+ + + R + H K F R +G+ AW +
Sbjct: 750 GPDCGGIISLSSHYYNAATFPRECLRNYDSTRRGMLSHNKLLFVRGIKNDGRPFAWVYVG 809
Query: 504 SANLSKAAWGALQ----KNNSQLMIRSYELGVLI-LPSAK 538
SAN+S++AWGA + L IR++E GVL+ +P+ K
Sbjct: 810 SANMSESAWGAQKVLKSGQTGSLNIRNWECGVLMPVPNEK 849
>gi|307108296|gb|EFN56536.1| hypothetical protein CHLNCDRAFT_144175 [Chlorella variabilis]
Length = 226
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 47/72 (65%), Gaps = 6/72 (8%)
Query: 483 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS-----A 537
MPH+KT+ R+ G +AW L S N+SKAAWG L ++ +L ++S+EL VL+LPS
Sbjct: 1 MPHLKTYCRHVGGDVAWLCLGSHNVSKAAWGELLRDG-RLYVKSFELSVLLLPSRELAYQ 59
Query: 538 KRHGCGFSCTSN 549
+ GFSCTS
Sbjct: 60 RSRRRGFSCTSG 71
>gi|302142785|emb|CBI20080.3| unnamed protein product [Vitis vinifera]
Length = 1032
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 366 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 425
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 426 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 485
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 486 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 537
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 538 YDFKGATGHLVASVPGIH 555
>gi|359493967|ref|XP_002283806.2| PREDICTED: uncharacterized protein LOC100243589 [Vitis vinifera]
Length = 1091
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 406 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 465
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 466 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 525
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 526 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 577
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 578 YDFKGATGHLVASVPGIH 595
>gi|147770909|emb|CAN67540.1| hypothetical protein VITISV_012382 [Vitis vinifera]
Length = 1423
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIH------GESDGTLEHMKRNKPANWILHKPPL 248
++ + D+ W L C V +P + H S ++ + N ++ PP
Sbjct: 410 FVATFTSDVLWFLSYCKVPGHLPVTIACHHTERCWSSSADKRAYVPYSDYPNLVIVHPPF 469
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-- 295
P I+FG HH K ++L +RII+ +ANL+ WN+ + +W QDFP
Sbjct: 470 PEAIAFGRDRKKLGVACHHPKLLVLQREDSIRIIITSANLVAKQWNSVTNTVWWQDFPRI 529
Query: 296 ------------LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+ NL F L ++++L ++P+ ++ + K
Sbjct: 530 SPPDYSSIFTQFCDGEINLDSRSDFAAQLAGFMASL-----VIDVPSQAHWIME---LTK 581
Query: 344 FNFSSAAVRLIASVPGYH 361
++F A L+ASVPG H
Sbjct: 582 YDFKGATGHLVASVPGIH 599
>gi|255539987|ref|XP_002511058.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
gi|223550173|gb|EEF51660.1| tyrosyl-DNA phosphodiesterase, putative [Ricinus communis]
Length = 1148
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 41/205 (20%)
Query: 190 DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANWI 242
+I+ ++ + DI W L C + + +P + H D + N P N
Sbjct: 457 NIMRIFIATFTSDILWFLSYCEIPSHLPVTIACHNTERCWSSNPDKRISMPYSNFP-NLS 515
Query: 243 LHKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWM 291
+ PP P I+FG HH K ++L +R+I+ +ANL+ W+N + +W
Sbjct: 516 VVFPPFPEAIAFGNDRRRQGIACHHPKLLVLQRENSIRVIITSANLVPNQWHNVTNTIWW 575
Query: 292 QDFPLKDQNNLS--------------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
QDFP + +LS F L ++++L ++P+ ++ +
Sbjct: 576 QDFPRRSTPDLSSLFTRVSDGEISQDSRSDFAAQLAGFIASL-----VIDVPSQAHWVVE 630
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHT 362
K+NF A L+AS+PG H+
Sbjct: 631 ---LTKYNFDGALGYLVASIPGIHS 652
>gi|326484528|gb|EGE08538.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Trichophyton equinum CBS 127.97]
Length = 462
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 111/241 (46%), Gaps = 27/241 (11%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ + +V+Q D+ +A+LS+++ D+DWLL + + ++ + + E + R +
Sbjct: 233 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKF-TNPRTRFLFIMGAKGEERREQLLRETAS 291
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQD 293
L PP+ HSK MLL + +RI++ +ANL DW + L++ D
Sbjct: 292 MSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEEGGVMENMLFLID 351
Query: 294 FPLKDQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K + + F ++L+ +L STL N KI +++FS +A
Sbjct: 352 LPRKANETVDDTTPFRDELVYFLRASTL-------------NEKIIDKML-QYDFSQTAK 397
Query: 351 VRLIASVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE 409
+ S+ G H GS S ++ GH L T ++ + L Y SS+GSL ++
Sbjct: 398 YAFVHSIGGSHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYIASSVGSLTATFLQN 456
Query: 410 L 410
L
Sbjct: 457 L 457
>gi|453087183|gb|EMF15224.1| phospholipase D/nuclease [Mycosphaerella populorum SO2202]
Length = 629
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 92/413 (22%), Positives = 165/413 (39%), Gaps = 81/413 (19%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEECGFENDL 312
HSK MLL + +RI + TANL++ DW Q +++ D P Q G +NDL
Sbjct: 242 HSKLMLLFHADKLRIAIPTANLLNFDWGETGQMENTVFLIDLPRLPQ-------GQKNDL 294
Query: 313 IDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGH 371
+ L + + G + F+FS+ A + + +V G H + G
Sbjct: 295 TSFGRELMF-----FIEMQGLDQDVRDGVLNFDFSATADIAFVHTVGGVHYKDQAARTGL 349
Query: 372 MKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKW-----MAELSSSMSSGFSEDKTPLG 426
+ L +++ G + + SS+G+L +K MA + + E ++ G
Sbjct: 350 LGLSRTVRQMDLVAG-PSLEIDFAASSIGALTDKQLNDFHMAARGVDLLAHAREARSKAG 408
Query: 427 IG------------------EPLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY 468
+ + +PT E VR S G AAG + F K+
Sbjct: 409 ASFFKKAGSKTVTATTNVRKKIRVYFPTKETVRSSTAG-AAGTICLQREYYERNSFPKEC 467
Query: 469 WAKWKASHTG-------------RSRAMPH-------IKTFARYNGQKLAWFLLTSANLS 508
+ ++++ G RS A H + N +AW + S+N+S
Sbjct: 468 FRDYRSTRKGLLSHNKILCARGFRSTASEHADPPGVSVAATGSPNSNPVAWVYVGSSNMS 527
Query: 509 KAAWGAL--QKNNSQLMIRSYELGVLI------LPSAKRHGCGFSCTSNIVPSEIKSGST 560
K+AWG L ++ S++ R++E GV++ LPS+ F SE ++
Sbjct: 528 KSAWGELAAERTESKITCRNWECGVILSVPVETLPSSAGEA-AFKQRDANGDSETETEDE 586
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWSWDKR 613
++Q + V + A ++ L P+ +P + Y S++ PW + ++
Sbjct: 587 TSAQTSTPEFVNIE--------AFRRIIDL--PFSIPGEEYKSQE-PWYFKEQ 628
>gi|320587853|gb|EFX00328.1| mitochondrial translation optimization protein [Grosmannia
clavigera kw1407]
Length = 1223
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 161/384 (41%), Gaps = 55/384 (14%)
Query: 193 VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP-ANWILHKPPLPIS 251
+A+LS++ D +W++ V K +L+ + + M+ N P +N PP+ +S
Sbjct: 142 LAVLSSFQWDEEWMMQHVDV-RKTKLLLIAYAADENQKVEMRENVPNSNVRFCFPPM-LS 199
Query: 252 FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPLKDQNNLSEECGF 308
G HSK LL Y +RI+V T NL+ DW +++ D P L + G
Sbjct: 200 VGAMHSKLQLLKYADYLRIVVPTGNLVPYDWGESGTIENMVFIIDLP-----RLPAQAGR 254
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLK 367
+ +L L + L A + ++FS+ A + ++ G H S +
Sbjct: 255 ISGKTPFLDDLSY-----FLKAQAVDQSLVQSLDNYDFSATARYAFVHTISGSHAKDSWE 309
Query: 368 KWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSLGSLDEKWMAEL--SSSMSSGFSE--- 420
+ G+ L ++ + + PL Y SS+GSL + + L + +G E
Sbjct: 310 RTGYCGLGRAIKSLGWA---TEEPLQLDYLCSSIGSLGDDLLNALYYACQGDTGMKEYEA 366
Query: 421 --DKTPLGI----GEP------LIVWPTVEDVRCSLEGYAAGNAIPSPQKN--VDKDFLK 466
+K G+ EP + +P+ + V S G I ++N F +
Sbjct: 367 RANKPKKGVLASSSEPDWKSRMRVYFPSHQTVVRSRGGIRGAGTI-CFRRNWWESAKFPR 425
Query: 467 KYWAKWKASHTGRSRAMPHIK-TFARYNGQKL-AWFLLTSANLSKAAWGALQKNNS---- 520
K ++ G + H K F R AW L SANLS++AWG L K+ +
Sbjct: 426 KILRDYQNVKKG---TLAHTKLLFVRREASSAQAWTYLGSANLSESAWGRLVKDRATKEP 482
Query: 521 QLMIRSYELGVLI----LPSAKRH 540
+L R++E GVLI P A+R
Sbjct: 483 RLTCRNWECGVLIPAVPRPEAERR 506
>gi|255719760|ref|XP_002556160.1| KLTH0H06468p [Lachancea thermotolerans]
gi|238942126|emb|CAR30298.1| KLTH0H06468p [Lachancea thermotolerans CBS 6340]
Length = 570
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 112/494 (22%), Positives = 192/494 (38%), Gaps = 91/494 (18%)
Query: 181 VSIRDVI-QGDIIVAILSNYMVDIDWLLPA------CPVLAKIPHVL---VIHGESDGTL 230
++++++ + + A L ++ ++D++LP ++A+ +L I ++ L
Sbjct: 112 ITLQEIFSESKLTRAWLFSFQYELDFILPMFNESTQITIIAQKGTILPPTRISSKTSKIL 171
Query: 231 EHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGL 289
MK + L PP F HHSK ++ Y G I + + N H + N Q +
Sbjct: 172 SKMKTIE-----LQMPP----FACHHSKMIVNEYRDGSCCIYIPSNNFTHAETNLPQQIV 222
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWP-------EFSANLPAHGNFKINPSFFK 342
W L+ + +E F L+ YL+ +P EF L ++ F
Sbjct: 223 WCSP-RLRRCSEAVKESEFRKSLVKYLNA--YPVSLKPLIEFLGTLDFTSLDQLGVEFI- 278
Query: 343 KFNFSSAAVRLIASVPGYHTGSSLKK------WGHMKLRTVLQECTFEKGFKKSPLVYQF 396
F+ +++ +P H S ++ G + R + Q T +PL
Sbjct: 279 -FSCPKPFESILSGIPLLHKALSSRQHAAGGNTGRERHRYLSQVSTI-----GAPLKTGL 332
Query: 397 SSLGSLDEKWMAELSSSMSSGFSEDKTPLGIG-----------EPLIVWPTVEDVRCSLE 445
G+L M L S + G + K I EP IV+PT E++R S
Sbjct: 333 EYPGNLFSHLMIPLLSGLLVGPRDRKRAYEIPNLHKVFEDYNIEPYIVYPTPEEIRQSPM 392
Query: 446 GYAAGNAIPSP-QKNVDKDFLKKYWAKWKASHTG-------RSRAMPHIKTFARYNG--- 494
GY G +N + KW H R R H K + +
Sbjct: 393 GYLTGGWFHFHWLRNQATKTVYNTLKKWGVLHKQQPQDCPRRGRTPSHTKFYMKSTTLLD 452
Query: 495 -----QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSN 549
++ WFL T+ANLS AWG + ++YE+GVL S R S+
Sbjct: 453 NQAPFSEVDWFLFTTANLSLNAWGTTTRKP-----QNYEVGVL-FKSQDRRRITVKSVSD 506
Query: 550 IVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 609
+V S+ +S T QI GSS +++ + + VP+++ P Y D +
Sbjct: 507 LVYSKFRS----TGQIL----------GSSKVHSNANICVM-VPFDINPVPYQPGDDAFC 551
Query: 610 WDKRYTKKDVYGQV 623
+ Y D++G++
Sbjct: 552 VSRSYEAPDIHGKL 565
>gi|46111419|ref|XP_382767.1| hypothetical protein FG02591.1 [Gibberella zeae PH-1]
Length = 676
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 17/198 (8%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WL+ L K +L+ +S+ M+ N P
Sbjct: 142 IKIEEVFQKDKLELALLSSYQWDDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPP 200
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPL 296
P + G HSK LL YP +R++V +ANL+ DW +++ D P
Sbjct: 201 GIKFVFPAMN-GPGAMHSKLQLLKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPR 259
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
D + F +L +LS E N + +F S K F + +
Sbjct: 260 LDGSATHRPTPFSTELGRFLSATGVGETMVNSLTNYDF----SQTKHLGF-------VYT 308
Query: 357 VPGYHTGSSLKKWGHMKL 374
+PG H G LK+ G+ L
Sbjct: 309 IPGGHQGDELKRIGYSGL 326
>gi|357520291|ref|XP_003630434.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
gi|355524456|gb|AET04910.1| Tyrosyl-DNA phosphodiesterase [Medicago truncatula]
Length = 1064
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 87/199 (43%), Gaps = 41/199 (20%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHG-------ESDGTLEHMKRNKPANWILHKPP 247
++ + DI W L C + +P + + D + +N P N ++ PP
Sbjct: 394 FIATFTSDITWFLTYCKIPYHLPVTIACQNTEKCWSSKPDERVFVPYQNYP-NLVVVHPP 452
Query: 248 LP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
P I+FG HH K ++L +R+I+ +ANL+ WN+ + +W QDFP
Sbjct: 453 FPETIAFGKDHKRHGIACHHPKLIVLQREDSIRVIITSANLVEKQWNSVTNTIWWQDFPR 512
Query: 297 --------------KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK 342
D+ + + +C F L ++++L ++P+ ++
Sbjct: 513 AILVDYASLFRKIDDDEVHRNSKCDFAAQLAGFMASL-----VIDVPSQAHWITQ---LT 564
Query: 343 KFNFSSAAVRLIASVPGYH 361
K++F SA L+AS+PG H
Sbjct: 565 KYDFGSATGHLVASLPGIH 583
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 112/306 (36%), Gaps = 100/306 (32%)
Query: 348 SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
+A LIAS+ + +G +L+ VL + + + + S +VY SS+GS++ K++
Sbjct: 746 AAFCSLIASIQ--------RHYGLWRLQEVLNQYRWPESLE-SEIVYGASSIGSVNSKFL 796
Query: 408 AELSS-----SMSSGFSEDKTP----------LGIGEPLIVWPTVEDVRCSLEGYAAGNA 452
A S+ S+ SE+ P L I++PT+E V+ + G
Sbjct: 797 AAFSAAAGKKSLQHFDSEESDPEWGCWNAREELKNPSVKIIFPTIERVKSAYNGILPSRR 856
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPH--------------IKTF-ARYNGQKL 497
I F ++ W + K A+PH + F +R +
Sbjct: 857 ILC--------FSERTWQRLKTLDVLHD-AVPHPHERVGHPMHTKVVRRCFWSRGEAPSI 907
Query: 498 AWFLLTSANLSKAAWGALQKN----------------NSQLMIRSYELGVLILPSAKRHG 541
W S N S AAWG N NS L I +YELG++
Sbjct: 908 GWVYCGSHNFSAAAWGRQISNPFGTKADDPHKGDPSVNSGLHICNYELGIIF-------- 959
Query: 542 CGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRY 601
PSE + E +++ TKL + +PY +P +Y
Sbjct: 960 -------TFPPSE----NNECPKVKSTKLDDIV-----------------LPYVVPAPKY 991
Query: 602 SSEDVP 607
S D P
Sbjct: 992 GSLDKP 997
>gi|380495056|emb|CCF32689.1| ubiquitin interaction domain-containing protein [Colletotrichum
higginsianum]
Length = 641
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 119/514 (23%), Positives = 198/514 (38%), Gaps = 108/514 (21%)
Query: 177 NTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR 235
N + I +V+Q D + +A+LS++ D +WLL + +L+ + ++ ++
Sbjct: 148 NGEDIKIEEVLQKDKLQLAVLSSFQWDEEWLLGKVDA-RQTKMLLIAYANNEAEKATIRA 206
Query: 236 NKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQ 292
N P + P P+ G HSK +L Y +RI++ + NL+ DW +++
Sbjct: 207 NAPTGLVRFCFP-PMHGGYMHSKLQILKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLI 265
Query: 293 DFPL---KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-S 348
D P Q F +L +L L E K+ S ++FS +
Sbjct: 266 DLPRIGGTHQTAPPAGTAFGTELRRFLRALGLDE-----------KLVKS-LDNYDFSKT 313
Query: 349 AAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--LVYQFSSLGSLDEKW 406
+ + S+ G H S + G+ L + ++ + P + Y SSLGSL +
Sbjct: 314 SRYGFVHSIAGSHANDSWQHTGYCGLGSTVRSLGLA---TEEPVNIDYVASSLGSLTHDY 370
Query: 407 MAEL--SSSMSSGFSE-------------DKTPLGIGEPL------------IVWPTVED 439
+ + + SG E K L PL I +PT +
Sbjct: 371 LTAIYHACQGDSGMKEYEARQSKPTRNKAAKAGLAGSRPLGEGTLQWQHHFRIYFPTEKT 430
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKT-FAR 491
V S G ++ I F +K+W + + RS + H K+ F R
Sbjct: 431 VSSSRGGRSSAGTIC---------FQEKWWKSSTFPRELLRDCQSVRSGLLLHSKSIFVR 481
Query: 492 -YNGQKLAWFLLTSANLSKAAWGALQKNN----SQLMIRSYELGVLILPSAKRHGCGFSC 546
G AW + SANLS++AWG L K+ ++L R++E GVL+ G S
Sbjct: 482 GRAGGDAAWAYVGSANLSESAWGRLVKDRESGAAKLTCRNWECGVLVAVEGNPTGTADSG 541
Query: 547 TSNIVPSEIKSGSTETSQIQKTKL-------VTLTWHGSSDAGAS--------------- 584
T V + S +++Q L T T G + A A+
Sbjct: 542 TRPGVDQDAHSRRHPWARVQAQTLEGYARDEETSTSRGVAAATAADSEENRRQQQLDRDE 601
Query: 585 ----SEV--VYLPVPYELPPQRYSSEDV----PW 608
EV +P+P ++P RY S++ PW
Sbjct: 602 SAGLDEVFGTTVPIPMKVPAGRYMSDESAASRPW 635
>gi|407927985|gb|EKG20864.1| hypothetical protein MPH_01847 [Macrophomina phaseolina MS6]
Length = 642
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 106/473 (22%), Positives = 188/473 (39%), Gaps = 91/473 (19%)
Query: 181 VSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q D+ ILS + D +W V + L I G ++ + PA
Sbjct: 218 IKIEEVLQNHDLKSLILSTFDFDHEWF--GTKVKLDMTRQLWIVGAANDDQRYEWSLAPA 275
Query: 240 NWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFP 295
+ + L + G +H K ++ +P+ +R+ + TANL DW + +++ D P
Sbjct: 276 VYSNVELCVLDMKNGHNHGKFLIGSHPKYLRVAITTANLKGHDWGESGKMENTVFIIDLP 335
Query: 296 -LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAV 351
L + SE+ F +L YL +L + L A +F++S + +
Sbjct: 336 RLPEGKKTSEDEATAFCQNLRFYLKSL-----NVGLSAR-------DALLRFDWSRTRNL 383
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE-L 410
+ S+ G G ++ G L ++E + + L Y SSLG+L +M + L
Sbjct: 384 GFVCSLQGASIGDDGQRIGLPGLSQAIKELNLKS--NRLALDYATSSLGALSRGFMKQFL 441
Query: 411 SSSMSSGFSEDK----TPLGIGEPL----IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDK 462
+++ K + +G+ L + +PTV+ VR S G AG I
Sbjct: 442 TAAKGEELEATKEKYDADIKLGDLLKQFRVYFPTVDTVRASKGGEEAGGTI--------- 492
Query: 463 DFLKKYW--------AKWKASHTGRSRAMPHIKTF--------------ARYNGQKLAWF 500
FL+K W A + R+ + H K G+K+AW
Sbjct: 493 -FLRKRWYDAPSFPKASMHDHKSTRNGILSHNKLIICRGQIGPEDEDNAGATEGKKVAWA 551
Query: 501 LLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGST 560
+ S N ++AAWG L ++ + ++ + + + CG I+P S
Sbjct: 552 YVGSHNFTQAAWGTLSRDKNTKTLKV---------NCRNNECGV-----IIPIFRGGASE 597
Query: 561 ETSQIQKTKLVTLTWHGSSDAGASSEVVY--LPVPYELPPQRYSSEDVPWSWD 611
+ Q K + D EV + +P+E+P +RY ++ PW D
Sbjct: 598 QVGQEDK--------NAEEDGLPGYEVFARKMEIPFEIPGERYGNKK-PWFTD 641
>gi|302787823|ref|XP_002975681.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
gi|300156682|gb|EFJ23310.1| hypothetical protein SELMODRAFT_415650 [Selaginella moellendorffii]
Length = 920
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 91/211 (43%), Gaps = 41/211 (19%)
Query: 181 VSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLE 231
VS+ D++ DI ++++ DI W + + + +P + H +E
Sbjct: 239 VSVADLLAPLEDIREMFVASFTTDIIWFISSFGLPKTLPVTIACHDSERSWSTAISDRME 298
Query: 232 HMKRNKPANWILHKPPLPI-----------SFGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
H P N + PP P+ G HH K LL + +R+IV ++NL +
Sbjct: 299 HPYCEWP-NLKVVYPPFPVLRRTNDKSRMRGVGCHHPKFFLLKRSKDIRVIVTSSNLNYR 357
Query: 281 DWNNKSQGLWMQDFPLKDQNNLS-----------EECG-FENDLIDYLSTLKWPEFSANL 328
W S +W QDFPL++ + S E G F L ++STL ++
Sbjct: 358 QWLQVSNTVWWQDFPLRNTRDYSSLFSSKITDGGERNGDFAAYLAGFISTL-----VKDV 412
Query: 329 PAHGNFKINPSFFKKFNFSSAAVRLIASVPG 359
P+ ++ + +NFS A V L+ASVPG
Sbjct: 413 PSEAHWATD---LACYNFSKATVSLVASVPG 440
>gi|71004940|ref|XP_757136.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
gi|46096766|gb|EAK81999.1| hypothetical protein UM00989.1 [Ustilago maydis 521]
Length = 687
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 125/292 (42%), Gaps = 47/292 (16%)
Query: 193 VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR-------------NKPA 239
+A+L+ Y + IDWL P + VL E EH+ R +
Sbjct: 226 LAVLATYDLRIDWLYSLFPRQLPVTLVLPPPKEDYRVNEHVARPGLHPSHIFGGDFTRCP 285
Query: 240 NWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKD 298
W + P P + T H K ++L++ R +R+ + + NL +DW+ ++QDFPL
Sbjct: 286 GWQICVPNKPKGGWLTQHIKFLILVHQRFLRVAILSGNLNAIDWDRIENTAYIQDFPLLG 345
Query: 299 QNNL------------SEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNF 346
Q ++ S + F++ L+ L +L P A A +++F
Sbjct: 346 QASMINHGSGSSSGSKSSQNDFKSQLVRVLRSLSMPASHAVYAA----------LDRYDF 395
Query: 347 SSAA-VRLIASVPGYHTGSSLKKWGHMKLRTV--LQECTFEKGFKKS-PLVYQFSSLGSL 402
S A R++AS P +SL++W ++ + + L + + G K+S L Q SSL +
Sbjct: 396 SLATRARIVASWP---EAASLREWDQIETQGLGRLGKVVRDLGIKESVELECQGSSLANH 452
Query: 403 DEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRCSLEGYAAGNAIP 454
D KW+ S PL G+P V P + ++ + GNA+P
Sbjct: 453 DVKWIEHFHLLASGVEPRGLLPLK-GKPNEVHP---EYASAIGATSKGNALP 500
>gi|327295831|ref|XP_003232610.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
gi|326464921|gb|EGD90374.1| hypothetical protein TERG_06602 [Trichophyton rubrum CBS 118892]
Length = 677
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 105/480 (21%), Positives = 184/480 (38%), Gaps = 83/480 (17%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V+Q D+ +A+LS+++ D+DWLL P+ L ++ GE +
Sbjct: 217 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKFTNPSTRFL----FIMGAKGEERRAQLLRE 272
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLW 290
+ L PP+ HSK MLL + +RI++ +ANL DW K L+
Sbjct: 273 TASMSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLF 332
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA 350
+ D P K +++ F ++L+ +L E + H +N F + S AA
Sbjct: 333 LIDLPRKANETVNDTTPFRDELVYFLRASTLNEKIIDKMLH---TLNSIFVNSNSLSLAA 389
Query: 351 VRLIA---SVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
S + S ++ GH L T ++ + L Y SS+GSL ++
Sbjct: 390 CCCCCCWLSGGSHIGSGSYERTGHCGLGTAVKSLGLATS-RPLKLDYITSSVGSLTATFL 448
Query: 408 AEL--SSSMSSGFSEDKTPLG----------------------IGEPLIVWPTVEDVRCS 443
L S+ +G + G G + +P+ E VR S
Sbjct: 449 QNLYWSAQGDNGTKQLSARAGNTRSSNKSNQSSKRSGRGDDDWTGRMKVYFPSRETVRSS 508
Query: 444 LEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQK------ 496
G +A + K + + + + + R + H K +AR G+
Sbjct: 509 RGGVSAAGTLCLMSKWYNSPMFPR--DVMRDNRSVREGLLMHSKVLYARPEGEARKGESR 566
Query: 497 ----LAWFLLTSANLSKAAWGAL----QKNNSQLMIRSYELGVLILPSAKRHGCGFSCTS 548
W + SANLS++AWG L + ++L R++E GV ++P + S
Sbjct: 567 SADCAGWAYVGSANLSESAWGRLVIDRKTKQAKLNCRNWESGV-VVPVGRGEDGTQRGAS 625
Query: 549 NIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
+ + E SQ + +PVP + P + Y+ ++ PW
Sbjct: 626 AASAAAGAAPEAELSQTFR--------------------AAVPVPMQEPGREYAEDEQPW 665
>gi|310793199|gb|EFQ28660.1| ubiquitin interaction domain-containing protein [Glomerella
graminicola M1.001]
Length = 628
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 115/496 (23%), Positives = 190/496 (38%), Gaps = 95/496 (19%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +++Q D + +A+LS++ D +WLL V + +LV + ++ ++ N P
Sbjct: 154 IKIEEILQKDKLQLAVLSSFQWDEEWLLSKVDV-RQTRLLLVAYANNEAEKAAIRANAPT 212
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPL 296
+ P P+ G HSK +L Y +RI++ + NL+ DW +++ D P
Sbjct: 213 GLVRFCFP-PMYGGYMHSKLQILKYEGYLRIVIPSGNLVPYDWGETGVLENMVFLIDLPK 271
Query: 297 KDQNNLS---EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
+ + E F +L +L L E K+ S ++F+ ++
Sbjct: 272 LESTQQAAPPAETLFGTELRRFLRALGLDE-----------KLVKS-LDSYDFTETSRYG 319
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTV----LQECTFEKGFKKSPLV---YQFSSLGSLDEK 405
+ S+ G H S W H T L G V Y SSLGSL++
Sbjct: 320 FVHSIAGSHANDS---WQHTGQSTRGYCGLGSTVRSLGLATEDAVDIDYVASSLGSLNDA 376
Query: 406 WMAEL--SSSMSSGFSE------------------DKTPLGIGEPL-------IVWPTVE 438
+ + + SG E D + EPL I +PT
Sbjct: 377 SLKAIYYACQGDSGMKEYDARKPKPARSKAAKAGLDGSRPVFNEPLQLQRHFRIYFPTEH 436
Query: 439 DVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAK-------WKASHTGRSRAMPHIKTFAR 491
V S G ++ I F +K+W + + RS + H K
Sbjct: 437 TVSSSRGGRSSAGTIC---------FQEKWWKSSTFPRELLRDCQSVRSGLLLHTKAIFV 487
Query: 492 YNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCT 547
AW + SANLS++AWG L K +L R++E GVL+ G + T
Sbjct: 488 QARDGAAWAYMGSANLSESAWGRLVKERDSGAPKLTCRNWECGVLVAVDGNLPGSADTGT 547
Query: 548 SNIVPSEIKSGSTETSQIQKTKLVTLT--------WHGSSDAGASSEVVY---LPVPYEL 596
V + + G S+ + VT+T D E V+ +P+P ++
Sbjct: 548 RPGVDQDAQ-GQAPMSKGEGGPAVTVTDSEEKQRHQQLGQDEPRCLEGVFGTTMPIPMKV 606
Query: 597 PPQRYSSEDV----PW 608
P RY+S++ PW
Sbjct: 607 PAGRYTSDESAASRPW 622
>gi|242072904|ref|XP_002446388.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
gi|241937571|gb|EES10716.1| hypothetical protein SORBIDRAFT_06g015125 [Sorghum bicolor]
Length = 972
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 47/282 (16%)
Query: 117 VSNDGATNGEL---SSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLP 173
V+NDG +GEL SK R + + G +EE + D STF L R+ G
Sbjct: 214 VANDG--DGELPFHGSKGCRDDNAEQPGCGSGNEEQYHSEACYSDG--STFFLNRLVGTG 269
Query: 174 AWANT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDG 228
+ S V++ ++ ++ ++ + DI W L C + +P + H + D
Sbjct: 270 SDTRAEPQSGVTLPQLLHPVDSLVRVFIATFTSDISWFLNYCKIPQHLPVTIACHNK-DR 328
Query: 229 TLEHMKRNKPANWILHKP---------PLPISFG---------THHSKAMLLIYPRGVRI 270
N+ A P P I+FG HH K ++L +R+
Sbjct: 329 CWSASSENRTAAPFESHPKLLLVFPRFPEEIAFGQDRKKQGVACHHPKLIVLQREDSMRV 388
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWP 322
IV +ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 389 IVTSANLVPRQWHLITNTVWWQDFPRRTSLDYAALFSAAEKQKSDFAAQLVSFIASM--- 445
Query: 323 EFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
+P+ + IN K++F A LIASVPG H S
Sbjct: 446 --VNEVPSQA-YLINE--IAKYDFEGAGGYLIASVPGIHAQS 482
>gi|367001138|ref|XP_003685304.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
gi|357523602|emb|CCE62870.1| hypothetical protein TPHA_0D02330 [Tetrapisispora phaffii CBS 4417]
Length = 563
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 113/488 (23%), Positives = 191/488 (39%), Gaps = 82/488 (16%)
Query: 181 VSIRDVIQGD--IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ ++D+ GD + +IL ++ ++++LL L I ++ VI ++ +K+
Sbjct: 109 IRMKDIF-GDNRLKTSILFSFQFEMNFLLSQFN-LDTIENIYVIAQKNTVVPPTLKKFNS 166
Query: 239 A----NWI-LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQ 292
N + + PP F HHSK ++ IY + ++ + + N + N Q W
Sbjct: 167 VFDRLNIVEFYMPP----FSCHHSKMVINIYEDKSCKLFIPSNNFTFYETNLPQQVCWEG 222
Query: 293 DFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSAN---LPAHGNFKINPSFFKKFNFSSA 349
D N+ +++ F+ +LI Y + N +P N F K N
Sbjct: 223 PTLPYDINSKNQKISFKENLISYFQSYPSEVKIMNRTIIPMISNID-----FSKLN---- 273
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM-- 407
V + S P S + K ++ + L C+ + K++ + Q S++G K +
Sbjct: 274 NVEFLYSSPN-DKDSGISKLLYLLEKNDLLGCSDDIN-KRTHFLCQSSTIGGSLSKTVPL 331
Query: 408 ---AELSSSMSSGFSEDKTPLGIGE------------PLIVWPTVEDVRCSLEGYAAGNA 452
L SG + L + P IV+PTVE++R S G+ N
Sbjct: 332 NIFTHLMIPEFSGIQKSNKKLKTSQELIDIYREKRISPYIVYPTVEELRNSPSGWKCSNW 391
Query: 453 IPSPQKN-------VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQ---------K 496
KN + KDF Y K + + R H K + R K
Sbjct: 392 FHFNYKNKAEYYEVLAKDFKLFYKQKDQLTSKYRKATPSHSKFYIRCTENDSKVPARFSK 451
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIK 556
L W + TS+NLS AWG L R+YE+G+L+ G +C+S +
Sbjct: 452 LDWCIFTSSNLSFNAWGKLSSK-----PRNYEVGILL---CSNEGQQINCSSFSRKIDEH 503
Query: 557 SGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE-DVPWSWDKRYT 615
G + S TK +D + V+ VP+ LP + Y + D + K Y
Sbjct: 504 QGCSRLSDSNNTK---------NDGKKNINVM---VPFTLPLEPYDIKYDTAFCIQKSYN 551
Query: 616 KKDVYGQV 623
D +G+V
Sbjct: 552 LPDCFGEV 559
>gi|158293223|ref|XP_001237573.2| AGAP010579-PA [Anopheles gambiae str. PEST]
gi|157016855|gb|EAU76764.2| AGAP010579-PA [Anopheles gambiae str. PEST]
Length = 103
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/53 (56%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 483 MPHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILP 535
MPHIKT+ R+ + L WFLLTSAN SK+AWG + + + L I +YE GVL LP
Sbjct: 1 MPHIKTYCRWTPEGLQWFLLTSANFSKSAWG-ITRYDKLLYINNYEAGVLFLP 52
>gi|115386326|ref|XP_001209704.1| predicted protein [Aspergillus terreus NIH2624]
gi|114190702|gb|EAU32402.1| predicted protein [Aspergillus terreus NIH2624]
Length = 381
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 89/195 (45%), Gaps = 23/195 (11%)
Query: 171 GLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT 229
GLP + + I +V+Q D+ VA+LS++M D+DWL + V ++ + D T
Sbjct: 199 GLPRQGDD--IKIEEVLQRSDLKVAVLSSFMWDMDWLFSKMDQV-NTRFVFLMQAKDDAT 255
Query: 230 LEHMKRNKP--ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN--- 284
+R N L PP+ HSK M+L +P VRI++ TANL DW
Sbjct: 256 KRQYERETADLRNLKLCFPPMEGQVQCMHSKLMILFHPGHVRIVIPTANLTPYDWGEMGG 315
Query: 285 -KSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK 343
+++ D P ++ E F+ +LI +L A +++ + +
Sbjct: 316 VMENTVFLIDLPKLHPDSERIETNFKKELIYFLQ------------ASAAYEMVTTKLNE 363
Query: 344 FNFSSAA-VRLIASV 357
++FS A + L+ S+
Sbjct: 364 YDFSKTAHIALVHSI 378
>gi|121703656|ref|XP_001270092.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
gi|119398236|gb|EAW08666.1| tyrosyl-DNA phosphodiesterase domain protein [Aspergillus clavatus
NRRL 1]
Length = 683
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 108/473 (22%), Positives = 181/473 (38%), Gaps = 74/473 (15%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D+ +A+LS+++ D++W +LV+ + D T + +
Sbjct: 238 ITIEEVFQKDDLELAVLSSFIWDMEWFFSKLDT-KHSRFLLVMQAKDDATKRQYEAETAS 296
Query: 240 --NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN----KSQGLWMQD 293
N L PP+ HSK MLL +P +RI+V TANL DW ++ D
Sbjct: 297 MRNLRLCFPPMDGQINCMHSKLMLLFHPEYLRIVVPTANLTPYDWGEMGGVMENSAFLID 356
Query: 294 FP--LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAV 351
P ++ + F DL+ +LS + E N+ A K+ F++ + +
Sbjct: 357 LPRKSSTLSSSDSKTAFLEDLVFFLSASRLHE---NVIA----KLGDYDFRE----TKHI 405
Query: 352 RLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAE-- 409
L+ ++ G H + K G L ++ FK + Y SS+GSL ++++
Sbjct: 406 MLVHTIGGSHI-ENFSKTGFCGLGRAVKALGLST-FKSISIDYVTSSVGSLTDEFLRSIY 463
Query: 410 LSSSMSSGFSE-----DKT----PLGIGEPLIVWPTVEDVRCSLEGY------------- 447
L+ G +E KT P +++ P E+ + Y
Sbjct: 464 LACQGDDGMTEHALRTTKTMPARPPTTTSSILLKPAAEECKDRFRVYFPSQTTVEQSRGG 523
Query: 448 --AAGNAIPSPQKNVDKDFLKKYWAKWKAS------HTGRSRAMPHIKTFARYNGQKLAW 499
AG + F K K+ H P Q W
Sbjct: 524 PNCAGTICFQQRWYEGPKFPKHLLRDCKSRRPGLLMHNKMLFVTPDEPITLPDTSQCQGW 583
Query: 500 FLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEI 555
+ SANLS++AWG L ++ + +L R++E GVLI A+ T+ P E
Sbjct: 584 AYVGSANLSESAWGRLVQDRATKRPKLNCRNWECGVLIPVRAE-------ATAENRPKES 636
Query: 556 KSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
+S + + G + + +PVP +P QRY PW
Sbjct: 637 ESKPVDG--------LDKPGEGEVERMLDTFKDTVPVPMRVPGQRYGPGLKPW 681
>gi|302892021|ref|XP_003044892.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
gi|256725817|gb|EEU39179.1| hypothetical protein NECHADRAFT_94592 [Nectria haematococca mpVI
77-13-4]
Length = 674
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 86/199 (43%), Gaps = 19/199 (9%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WLL L + +LV + M+ N P
Sbjct: 148 IKIEEVFQKDRLELAVLSSYQWDDEWLLSKID-LRRTKLLLVASAADESQKREMQSNTPP 206
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
P + G HSK LL YP +R++V TANL+ DW +++ D P
Sbjct: 207 GIRFCFPAMN-GPGAMHSKLQLLKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLPK 265
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIA 355
+ + + F +L +LS G S ++FS + +
Sbjct: 266 LEASVDHQPTHFSTELGRFLSET------------GVGAGMVSSLSNYDFSRTKHLGFVY 313
Query: 356 SVPGYHTGSSLKKWGHMKL 374
++PG H G SLK+ G+ L
Sbjct: 314 TIPGGHVGDSLKRIGYCGL 332
>gi|408391841|gb|EKJ71209.1| hypothetical protein FPSE_08715 [Fusarium pseudograminearum CS3096]
Length = 598
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 85/194 (43%), Gaps = 17/194 (8%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D + +A+LS+Y D +WL+ L K +L+ +S+ M+ N P
Sbjct: 142 IKIEEVFQKDKLELALLSSYQWDDEWLVSKFD-LRKTKLLLLAFADSEAQKSEMRSNAPP 200
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFPL 296
P + G HSK LL YP +R++V +ANL+ DW +++ D P
Sbjct: 201 GIKFVFPAM-NGPGAMHSKLQLLKYPDYLRVVVPSANLVPYDWGETGVMENMVFLIDLPR 259
Query: 297 KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIAS 356
D + F +L +LS E N + +F S K F + +
Sbjct: 260 LDGSATHRPTPFSIELGRFLSATGVGETMVNSLTNYDF----SQTKHLGF-------VYT 308
Query: 357 VPGYHTGSSLKKWG 370
+PG H G LK+ G
Sbjct: 309 IPGGHQGDELKRIG 322
>gi|118785322|ref|XP_001237572.1| AGAP010578-PA [Anopheles gambiae str. PEST]
gi|116128029|gb|EAU76763.1| AGAP010578-PA [Anopheles gambiae str. PEST]
Length = 239
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 64/138 (46%), Gaps = 7/138 (5%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPL 248
G++ ++ YM+DI+WLL H L+I + LE + +P N K
Sbjct: 83 GELECSLQLTYMIDINWLLEQYSDAGYEQHPLLILYGDESELETISDKQP-NVTAIKIKT 141
Query: 249 PISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQD-FPL----KDQNNL 302
FG HH+K L Y G +R++V TANL DW N++QGLW+ P D
Sbjct: 142 KTGFGLHHTKMGLYGYCDGSMRVVVSTANLYENDWYNRTQGLWISPRLPAVPEGSDPTYG 201
Query: 303 SEECGFENDLIDYLSTLK 320
F + L++YL K
Sbjct: 202 ESRTDFRSSLLEYLGAYK 219
>gi|389739055|gb|EIM80250.1| phospholipase D/nuclease [Stereum hirsutum FP-91666 SS1]
Length = 698
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 95/425 (22%), Positives = 163/425 (38%), Gaps = 79/425 (18%)
Query: 171 GLPAWANTSCVSIRDVIQGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
G P + TS + + + AI+S+Y + + W+ P+ PV+ ++ E+
Sbjct: 217 GKPVFGLTSIIGDK----SQVAFAIISSYALQLSWIYEFFDPSTPVV-----MVAQPTEA 267
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNK 285
+ + +K P NWI P L +G H M + Y G +RI + TANL+ DW +
Sbjct: 268 EKGQKTIKEILP-NWIRVTPFLRSGYGVMH---MKIFYKSGRLRIAISTANLVDFDWKDI 323
Query: 286 SQGLWMQDFPLKDQ--NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINP----- 338
+W+QD P + + + + F L L +L H + P
Sbjct: 324 ENTVWIQDVPQRSKPIPHDPKADDFPTAFERVLKALNVEPALTSL-VHNDHPTIPLSSLH 382
Query: 339 --SFFKKFNFSSAAVRLIASVPGYHTG-SSLKKWGHMKLRTVLQECTFEKGFKKSP---- 391
S ++FS L+ S+ G H + + G L ++E E G
Sbjct: 383 PGSLRTAYDFSRVKAHLVPSLAGKHEHWPQVLRVGETALMKAVREIGCEVGSGSGGGKRG 442
Query: 392 ---LVYQFSSLGSLDEKWMAELSSSMSSGFSE---DKTPLGIGE------PLIVWPTVED 439
+ YQ SS+G+ +W+ E S E DKT + I++PT E
Sbjct: 443 KLRVEYQGSSIGTYSTQWINEFYICASGTSPEKYLDKTKASKSKLPYPDSMTILFPTREW 502
Query: 440 VRCSLEGYAAGNAIPSPQKNVDK-DFLKKYWAKWKASHTGRSRAMPHIKT---------- 488
V+ S+ G A G + + D F ++ + + S + R + + H K
Sbjct: 503 VKGSVLGEAGGGTMFCRKDQWDAPKFPRELFGQ---SKSKRGKVLMHSKVHESSVTESES 559
Query: 489 ------------------FARYNGQKLAWFLLTSANLSKAAWGALQKNNSQ--LMIRSYE 528
+ + W + S N + +AWG L + L I +YE
Sbjct: 560 ESEPEPPQDAEESDSDLEIVEKKAKAVGWAYVGSHNFTPSAWGTLSGSGFHPVLNITNYE 619
Query: 529 LGVLI 533
LG+++
Sbjct: 620 LGIVL 624
>gi|115458196|ref|NP_001052698.1| Os04g0403400 [Oryza sativa Japonica Group]
gi|113564269|dbj|BAF14612.1| Os04g0403400 [Oryza sativa Japonica Group]
Length = 1011
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|38346146|emb|CAD40679.2| OSJNBb0118P14.6 [Oryza sativa Japonica Group]
Length = 989
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|222628800|gb|EEE60932.1| hypothetical protein OsJ_14671 [Oryza sativa Japonica Group]
Length = 1021
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 321 FIATFTSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 380
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 381 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 440
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 441 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 492
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 493 AGYLIASVPGIYA 505
>gi|224119906|ref|XP_002318192.1| predicted protein [Populus trichocarpa]
gi|222858865|gb|EEE96412.1| predicted protein [Populus trichocarpa]
Length = 1131
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 82/208 (39%), Gaps = 45/208 (21%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHG------ESDGTLEHMKRNKPANWILHKPPL 248
++ + DI W L C + +P + H S + + N ++ PP
Sbjct: 460 FIATFTSDILWFLSHCEIPCHLPVTIACHNTERCWSSSPDNRTSVPYSDFPNLVVVFPPF 519
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLI------HVDWNNKSQGLWM 291
P I+FG HH K ++L +R+I+ +ANL+ H WNN + +W
Sbjct: 520 PESIAFGQDRKRRGIACHHPKLLVLQREDSIRVIITSANLVSNQVVAHSKWNNVTNTVWW 579
Query: 292 QDFPLKD--------------QNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKIN 337
QDFP + N F L +++ L N+P+ +
Sbjct: 580 QDFPARSAPDPSPLFIRVSDGDANKDSRSDFAAQLAGFMACL-----VINVPSQAYWI-- 632
Query: 338 PSFFKKFNFSSAAVRLIASVPGYHTGSS 365
S K++F A L+ASVPG H+ S
Sbjct: 633 -SELTKYDFEGANGHLVASVPGIHSRRS 659
>gi|218194792|gb|EEC77219.1| hypothetical protein OsI_15757 [Oryza sativa Indica Group]
Length = 974
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 80/193 (41%), Gaps = 33/193 (17%)
Query: 195 ILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA------NWILHKPPL 248
++ + D+ W L C V +P + H + + A N +L P
Sbjct: 322 FIATFSSDVSWFLDYCKVPQNLPVTIACHNKERCWSASRESRTAAPFGSYPNLLLVYPQF 381
Query: 249 P--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
P I+FG HH K ++L +R+IV +ANL+ W+ + +W QDFP +
Sbjct: 382 PEEIAFGKDRKKQGVACHHPKLLVLQRKDSMRVIVTSANLVPRQWHLITNTVWWQDFPCR 441
Query: 298 DQNNLS--------EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
+ S + F L+ +++ F N ++ IN K+NF A
Sbjct: 442 TSTDYSALFSKVEESKSDFATQLVSFIA------FLINEVPSQSYWINE--IAKYNFEGA 493
Query: 350 AVRLIASVPGYHT 362
A LIASVPG +
Sbjct: 494 AGYLIASVPGIYA 506
>gi|156844717|ref|XP_001645420.1| hypothetical protein Kpol_534p43 [Vanderwaltozyma polyspora DSM
70294]
gi|156116082|gb|EDO17562.1| hypothetical protein Kpol_534p43 [Vanderwaltozyma polyspora DSM
70294]
Length = 568
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/421 (22%), Positives = 170/421 (40%), Gaps = 88/421 (20%)
Query: 251 SFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFE 309
+F HHSK ++ Y +I + + N +++ N Q W+ L + + E F+
Sbjct: 184 AFSCHHSKMIINFYEDNSCKIFIPSNNFTYMETNLPQQVCWVSP-RLPEASGTPPENKFK 242
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 368
+L Y+ + + L S+ ++ +F+S + V + SVP + S K+
Sbjct: 243 KNLFKYIYSYQDKRVRQVL----------SYLREIDFNSLSNVEFVYSVPSKSSVSGFKQ 292
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG-SLDEKW---------------MAELSS 412
+ L+ +E + + Q S++G S+ +K+ + E ++
Sbjct: 293 LAALLLKNSTKEDFSTPTDIQHHYLCQTSTIGGSISKKFPLNLFTGIMIPTFSRLIEFNT 352
Query: 413 SMSSGFSEDKTPLGIGE--------PLIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDF 464
+S S+ +P + E P +V+PTVE++R S G++ + ++ +
Sbjct: 353 EPNSR-SKSASPEDMIEQLNSHNIKPYLVYPTVEEIRNSPSGWSCSGWFNFRYQKNNEQY 411
Query: 465 LK-----KYWAKWKASHTGRSR-AMP-------HIKTFARYNGQK----LAWFLLTSANL 507
L K + K A+ + R A P KT + N L W + TSANL
Sbjct: 412 LSLLNDFKCFYKQNANLISKHRKATPSHSKFYLKSKTSVKSNSNNPFDILDWCVYTSANL 471
Query: 508 SKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQK 567
S +AWG S + R+YE+G+L ST QI+
Sbjct: 472 SVSAWGT-----SSRLARNYEVGILF------------------------QSTPELQIKC 502
Query: 568 TKLVTLTWH-GS--SDAGASSEVVYLPVPYELPPQRY-SSEDVPWSWDKRYTKKDVYGQV 623
V + + GS SD S V + VP+ LP Y +++D + K Y D+ G+
Sbjct: 503 KSFVDVIYRKGSKLSDTAPSCNTVNVMVPFTLPCSPYDTTKDEAFCISKNYDLPDINGEY 562
Query: 624 W 624
+
Sbjct: 563 F 563
>gi|342884381|gb|EGU84597.1| hypothetical protein FOXB_04892 [Fusarium oxysporum Fo5176]
Length = 632
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 85/203 (41%), Gaps = 32/203 (15%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGESDGTLEHMKRNKP 238
+ I +V Q D + +A+LS+Y D +WL+ P K+ +L+ +S+ M+ N P
Sbjct: 146 IKIEEVFQKDKLELALLSSYQWDDEWLMSKIDPRKTKL--LLLAFADSEAQKSEMRSNAP 203
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFP 295
P + G HSK LL YP +R++V TANL+ DW +++ D P
Sbjct: 204 PGIKFVFPAM-NGPGAMHSKLQLLKYPDYLRVVVPTANLVPYDWGETGVMENMVFLIDLP 262
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
+ F +L +LS E H F +
Sbjct: 263 RLKDPATYRQTAFSTELGRFLSATGVGEG-----MHLGF-------------------VY 298
Query: 356 SVPGYHTGSSLKKWGHMKLRTVL 378
++PG H G SLK+ G+ L T +
Sbjct: 299 TIPGGHQGDSLKRIGYSGLGTTV 321
>gi|449302183|gb|EMC98192.1| hypothetical protein BAUCODRAFT_416098 [Baudoinia compniacensis
UAMH 10762]
Length = 610
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 97/431 (22%), Positives = 171/431 (39%), Gaps = 79/431 (18%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPAC---PVLAKIPHVLVIHGESDGTLEHMKRN 236
+ I +V++ + A+LS + D++W+L P + V+ + D + M
Sbjct: 142 IKIEEVLEPRTLRTALLSAFQWDVEWVLSKLKVPPNGGTTKCIFVMQAKEDSLRQQMLTE 201
Query: 237 KPANWILHKPPLPISFGT---HHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQD 293
A + P G+ HSK MLL +P +RI + +ANL+ DW
Sbjct: 202 TDAMRPFLRLTFPYMGGSVFCMHSKLMLLFHPHKLRIAIPSANLLSFDWG---------- 251
Query: 294 FPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---------- 343
+ + E F DL + + + +L G + F KK
Sbjct: 252 -----ETGMMENSVFIIDLPRLVDEQRARVTADDLTFFGKELL--YFLKKQDIDQDVRDG 304
Query: 344 ---FNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSL 399
F+F++ A + + + G G ++ G L ++ + + + + SS+
Sbjct: 305 VLGFDFAATAHIAFVHTAGGTSFGEEAQRTGLPGLARAVRSLRLQT--RSLEVDFAASSI 362
Query: 400 GSLDEKWMAELSSS---------MSSGFSEDKTPLGIGEP--------------LIVWPT 436
GSL+++++ + S+ S+ S+ K P I +PT
Sbjct: 363 GSLNDEFLRSVHSAAKGEDAIALTSAAASQAKANFFRPSPGKRTSAADNIKTKLRIYFPT 422
Query: 437 VEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FAR---- 491
E V S G AAG S + + F + + + ++ G + H K +AR
Sbjct: 423 QETVTNSTAG-AAGTICLSRKWYENMTFPRSVFRDYVSTRPG---LLSHNKILYARGKQK 478
Query: 492 YNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCT 547
Q +AW + SAN+S++AWG L + ++ R++E GVL+ A+R S
Sbjct: 479 QGTQDVAWAYVGSANMSESAWGKLSYDRKAKVWKVNCRNWECGVLLPVPAERLR---SAA 535
Query: 548 SNIVPSEIKSG 558
SN E KSG
Sbjct: 536 SNNNTKEAKSG 546
>gi|358380063|gb|EHK17742.1| hypothetical protein TRIVIDRAFT_82987 [Trichoderma virens Gv29-8]
Length = 528
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 95/203 (46%), Gaps = 24/203 (11%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS++ D +W++ + + +L+ + + M+ N P+
Sbjct: 96 ITIEEVFQKDQLELAVLSSFQWDEEWMMSKLDI-RRTKILLLAFAKDEAQKNLMRGNVPS 154
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS---QGLWMQDFPL 296
N PP+ G HSK LL YP +R+++ T NL+ DW +++ D P
Sbjct: 155 NIKFCFPPM-HGPGAMHSKLQLLKYPDRLRVVIPTGNLVPYDWGETGVMENMVFLIDLPR 213
Query: 297 ---KDQNNLSEECGFENDLIDYL-STLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-V 351
+ GF +L+ +L ST + A+L ++FS + +
Sbjct: 214 LGNPATHPPQRPTGFYTELVYFLQSTGVGDKMVASL-------------SNYDFSKTSDI 260
Query: 352 RLIASVPGYHTGSSLKKWGHMKL 374
+ ++PG H+G++ K+ G+ L
Sbjct: 261 AFVHTIPGSHSGNAAKRTGYCGL 283
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 60/141 (42%), Gaps = 44/141 (31%)
Query: 479 RSRAMPHIKT-FARYNG------QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSY 527
R R + H K F R G Q W + SANLS++AWG L K+ S ++ R++
Sbjct: 416 RDRLLIHSKMIFVRRVGDGQATRQPPGWAYVGSANLSESAWGRLSKDKSTEGIKMSCRNW 475
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
E GV+I VP E+ + KT S+D +
Sbjct: 476 ECGVII----------------PVP--------ESKTVDKTV-------ASADMAMFAGT 504
Query: 588 VYLPVPYELPPQRYSSEDVPW 608
V PVP ++P Y+S D+PW
Sbjct: 505 V--PVPMQVPGPVYTSNDLPW 523
>gi|50310989|ref|XP_455517.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49644653|emb|CAG98225.1| KLLA0F09625p [Kluyveromyces lactis]
Length = 497
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 97/405 (23%), Positives = 164/405 (40%), Gaps = 68/405 (16%)
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL- 296
AN +H+ +P +G HHSK + + G +R+ V + NL + N Q +W PL
Sbjct: 123 ANLKVHRVDIPSPWGCHHSKIIFSFHQNGTMRMHVPSFNLSREEMNLVQQTVWTS--PLL 180
Query: 297 --KDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLI 354
K + ++ FE++L++YL++ +S+ +G + +K + +
Sbjct: 181 YEKSETVPKKKSRFEDELLEYLNS-----YSSYTSLYG-LIASLKRYKWHVLDEQNCQFV 234
Query: 355 ASVPGYHTG-----SSLKKWGHMKLR------------TVLQECTFEKGFKKSPLVYQFS 397
S P Y+ G S L+ G MKL +Q + F+K + Q
Sbjct: 235 YSTP-YNGGLTQLKSCLRASG-MKLHGDEEDDDLSFVNLFIQVSSMGNPFRKKFDLLQDV 292
Query: 398 SLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVR-CSLEGYAAG----NA 452
+ L W + E TP + +VWPT +++ C +G +A
Sbjct: 293 MIPYLYTDWFEKDGYDKKLKSKEYTTPF-LAHSTLVWPTKTEIKECMTQGLSANWFFYKR 351
Query: 453 IPSPQKNVDKDFLKKYWAKWKASHTGRSRAM--PHIKTFARYNGQ----KLAWFLLTSAN 506
++ V K A+ + ++R M H K + ++ + + W LLTS N
Sbjct: 352 SEQTERKVVPCLRKHVPLPTNATQSDKNRHMVPSHTKYYIQFTDENTLKRPDWILLTSHN 411
Query: 507 LSKAAWG--ALQKNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQ 564
LS+AAWG L+K +YE G+L + R+ + S P G T S+
Sbjct: 412 LSQAAWGPSPLKKPT------NYECGILYTTTMGRNKVRLTLASAQQP----PGRTIGSR 461
Query: 565 IQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 609
+ + V T V + PY L QRYS+ D P++
Sbjct: 462 VPEDITVLPT-------------VKVVTPYPLKFQRYSATDEPYT 493
>gi|342320632|gb|EGU12571.1| Endoplasmic reticulum Ca-transporting P-type ATPase [Rhodotorula
glutinis ATCC 204091]
Length = 1978
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/388 (23%), Positives = 147/388 (37%), Gaps = 80/388 (20%)
Query: 253 GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN-LSEECG-FEN 310
G H+K ++ + RI++ TAN + DW+ ++ DFP + + ++EE F+N
Sbjct: 1630 GIMHTKLLIFYHEDFCRIVIPTANAVSYDWSQIDNAFYVHDFPRRRSASPVNEESNPFKN 1689
Query: 311 DLIDYLSTLKWPE-FSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKW 369
S + + +P H + S F+ SS V+L+ S G + K
Sbjct: 1690 PTHTQFSKKSFQVCYYLGIPKH---ILQESLHYDFS-SSTDVQLVHSNQGKFPAADYDKG 1745
Query: 370 GHMK-LRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAEL---------SSSMSSGFS 419
G + L + F G + SS+G W+ ++ S+ SG
Sbjct: 1746 GGIAGLAKAVSAFGFASG-GHWEIEVTGSSIGQYSSTWLTQMLAACSGIHPSTYFRSGKG 1804
Query: 420 ED------KTPLGIGEPL---IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWA 470
D KTP G L I++PT +++ S G G I P K + K+
Sbjct: 1805 NDVPSQLPKTPSGQPTRLPIKIIFPTQDEILSSPGGAGHGGTIFCPSKTWNSLTFPKHL- 1863
Query: 471 KWKASHTGRSRAMPHIKT------FARYNGQKL--AWFLLTSANLSKAAWGALQ--KNNS 520
+ + R H K FA+ + + L S N + +AWG LQ K+
Sbjct: 1864 -FHRGESKRKNIPAHTKIILGLHRFAKAPTPPVHEGFIYLGSHNFTPSAWGRLQNGKDGP 1922
Query: 521 QLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSD 580
QL +YELGV++ +++ S E + + T+LVT
Sbjct: 1923 QLFCNNYELGVVL--------------------TLRASSAEELEAKATELVT-------- 1954
Query: 581 AGASSEVVYLPVPYELPPQRYSSEDVPW 608
Y+ P +Y DVPW
Sbjct: 1955 -------------YKRPLVKYGPNDVPW 1969
>gi|293335739|ref|NP_001168462.1| hypothetical protein [Zea mays]
gi|223948435|gb|ACN28301.1| unknown [Zea mays]
gi|414587433|tpg|DAA38004.1| TPA: hypothetical protein ZEAMMB73_810727 [Zea mays]
Length = 989
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 111/278 (39%), Gaps = 40/278 (14%)
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
V NDG SK R + G +EE + D STF L R+ +
Sbjct: 228 VVNDGDPELFNGSKGCRDDSSEKPGCGSGNEEQYHSEGCYSDG--STFFLNRLADTGSNT 285
Query: 177 NT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES----- 226
T S V++ ++ ++ ++ + +DI W L C + +P + H +
Sbjct: 286 QTEPQSGVTLPQLLHPVNSLVRVFIATFTLDISWFLNYCKIPQHLPVTIACHNKERCWSA 345
Query: 227 ---DGTLEHMKRNKPANWILHKPPLPISFG---------THHSKAMLLIYPRGVRIIVHT 274
+ T + + + + P I+FG HH K ++L +R+IV +
Sbjct: 346 SSENRTAAPFESHPKLLLVFPRFPEDIAFGKDRKKQGVACHHPKLIVLQREDSMRVIVTS 405
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWPEFSA 326
ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 406 ANLVPRQWHLITNTVWWQDFPCRTSPDYAALFSAAKKQKSDFAAQLVSFIASM------V 459
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
N + I K++F A LIASVPG H S
Sbjct: 460 NEVRSQAYWITE--VAKYDFEGAGGYLIASVPGIHAQS 495
>gi|326521102|dbj|BAJ96754.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 646
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 98/236 (41%), Gaps = 44/236 (18%)
Query: 162 STFRLLRVQGLPAWANT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKI 216
STF L R+ G+ S V++ ++ G ++ ++ + DI W L C + +
Sbjct: 267 STFFLNRLTGIRPEMRAEQHSGVTLPQLLHPVGSLLRVFIATFTSDISWFLDYCKIPQYL 326
Query: 217 PHVLVIHGE-------SDGTLEHMKRNKPANWILHKPPLP--ISFG---------THHSK 258
P + H + S+ N P N +L P P I+FG HH K
Sbjct: 327 PVTIACHNKDRCWSANSESRTAAPFENHP-NILLVYPRFPEVIAFGKDRKNQGVACHHPK 385
Query: 259 AMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLST 318
++L +R+I+ +ANL+ W+ + +W QDFP C D S
Sbjct: 386 LIVLQREDSMRVIISSANLVPRQWHLITNTVWWQDFP----------CRTSPDYSALFSA 435
Query: 319 LKWP--EFSANLPAHGNFKIN--PS------FFKKFNFSSAAVRLIASVPGYHTGS 364
+ P +F+A L + IN PS +++F A L+ASVPG + S
Sbjct: 436 FEGPKSDFAAQLVSFIGSLINEVPSQAYWINEIARYDFEGAGGYLVASVPGLYMPS 491
>gi|154272585|ref|XP_001537145.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150409132|gb|EDN04588.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 478
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 89/196 (45%), Gaps = 32/196 (16%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGE--SDGTLEHMKRNK 237
+ +V+Q D+ +A+LS+YM ++DW+ + K L+I GE D E K
Sbjct: 286 IKFEEVVQKSDLELAVLSSYMWNVDWMFSKFDI--KTTRFLLIMGEKEEDKKRELENDTK 343
Query: 238 PANWI-LHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL----WMQ 292
+ L PP+ HSK MLL +P +RI+V +ANL+ DW + + ++
Sbjct: 344 SMGSVRLCFPPMEPQVNCMHSKLMLLFHPDYLRIVVPSANLVPFDWGEQGGVMENIVFLI 403
Query: 293 DFPLK--DQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKK---FNFS 347
D P K D +N + F ++L+ +L +N KK F+FS
Sbjct: 404 DLPRKSPDLDN-DPQTSFLDELVYFLQA---------------STVNEQIIKKMLRFDFS 447
Query: 348 SAA-VRLIASVPGYHT 362
+ + I ++ G HT
Sbjct: 448 ATKDIAFIHTIGGSHT 463
>gi|414587432|tpg|DAA38003.1| TPA: hypothetical protein ZEAMMB73_810727, partial [Zea mays]
Length = 816
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 111/278 (39%), Gaps = 40/278 (14%)
Query: 117 VSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLRVQGLPAWA 176
V NDG SK R + G +EE + D STF L R+ +
Sbjct: 228 VVNDGDPELFNGSKGCRDDSSEKPGCGSGNEEQYHSEGCYSDG--STFFLNRLADTGSNT 285
Query: 177 NT---SCVSIRDVIQ--GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES----- 226
T S V++ ++ ++ ++ + +DI W L C + +P + H +
Sbjct: 286 QTEPQSGVTLPQLLHPVNSLVRVFIATFTLDISWFLNYCKIPQHLPVTIACHNKERCWSA 345
Query: 227 ---DGTLEHMKRNKPANWILHKPPLPISFG---------THHSKAMLLIYPRGVRIIVHT 274
+ T + + + + P I+FG HH K ++L +R+IV +
Sbjct: 346 SSENRTAAPFESHPKLLLVFPRFPEDIAFGKDRKKQGVACHHPKLIVLQREDSMRVIVTS 405
Query: 275 ANLIHVDWNNKSQGLWMQDFPLKDQNNLS--------EECGFENDLIDYLSTLKWPEFSA 326
ANL+ W+ + +W QDFP + + + ++ F L+ +++++
Sbjct: 406 ANLVPRQWHLITNTVWWQDFPCRTSPDYAALFSAAKKQKSDFAAQLVSFIASM------V 459
Query: 327 NLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTGS 364
N + I K++F A LIASVPG H S
Sbjct: 460 NEVRSQAYWITE--VAKYDFEGAGGYLIASVPGIHAQS 495
>gi|430811371|emb|CCJ31122.1| unnamed protein product [Pneumocystis jirovecii]
Length = 402
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 131/346 (37%), Gaps = 64/346 (18%)
Query: 187 IQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LH 244
I+ DI+ A+LS +++D W+L L+K V + H +SD K + N + L
Sbjct: 101 IENDILKAAVLSAFVIDPIWVLSKIQ-LSKTIVVFIHHAKSD------KEKQAINELYLC 153
Query: 245 KPPLPISF------GTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK---SQGLWMQDFP 295
P + F H K LL Y +R+++ +ANL+ DW +++ DFP
Sbjct: 154 FPNVSAIFPSMEGANCMHCKLQLLFYTTYLRVVIPSANLVDYDWGETGVMENSMYIHDFP 213
Query: 296 LKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIA 355
++ FE DL Y +P+ +FK+ S + +
Sbjct: 214 RRESAFTEFSTNFERDLFHYCKAKNYPDHILKKMQCYDFKM-----------SKNIHFVH 262
Query: 356 SVPGYHTGS-SLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSM 414
S+P S LK G++ L +Q+ + SSLG L +M + ++
Sbjct: 263 SIPARALNSVDLKDTGYLSLARAVQKLGKASKNDIEINIIVTSSLGLLKSAFMTNIYRAL 322
Query: 415 SSGFSEDKTPLGIGEPLIVW--------PTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLK 466
D++ L W P++ V S G + I F K
Sbjct: 323 KG----DQSIASYNMDLQSWKTSIKVHFPSINTVLSSNGGKESAGTIC---------FQK 369
Query: 467 KYWAKWKASHTGRSRAMPHIKTFARYNGQKLAWFLLTSANLSKAAW 512
++W + +S M H R +SANLS++AW
Sbjct: 370 QFWENLEFP---KSCLMHHKIILVRN----------SSANLSESAW 402
>gi|302653979|ref|XP_003018803.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
gi|291182481|gb|EFE38158.1| hypothetical protein TRV_07162 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 70/146 (47%), Gaps = 14/146 (9%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLL-----PACPVLAKIPHVLVIHGESDGTLEHMK 234
+ + +V+Q D+ +A+LS+++ D+DWLL P+ L ++ GE T +
Sbjct: 208 IKLEEVLQPSDLELAVLSSFLWDMDWLLMKFTNPSTRFL----FIMGAKGEERRTQLLRE 263
Query: 235 RNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLW 290
+ L PP+ HSK MLL + +RI++ +ANL DW K L+
Sbjct: 264 TASMSRIRLCFPPMDGEVNCMHSKLMLLFHANHLRIVIPSANLDPYDWGEKGGVMENMLF 323
Query: 291 MQDFPLKDQNNLSEECGFENDLIDYL 316
+ D P K + + F ++L+ +L
Sbjct: 324 LIDLPRKANETIDDTTPFRDELVYFL 349
>gi|357167454|ref|XP_003581171.1| PREDICTED: uncharacterized protein LOC100837648 [Brachypodium
distachyon]
Length = 987
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 86/202 (42%), Gaps = 35/202 (17%)
Query: 189 GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGES-------DGTLEHMKRNKPANW 241
G ++ ++ + DI W L C + +P + H + + + N P N
Sbjct: 302 GSLLRVFITTFTSDICWFLDYCNIPQHLPVTIACHNKERCWSASRESRMAAPFVNHP-NV 360
Query: 242 ILHKPPLP--ISFG---------THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLW 290
+L P P I+FG HH K ++L +R+I+ +ANL+ W+ + +W
Sbjct: 361 LLVYPQFPEVIAFGKDRKKQGVACHHPKLIVLQREDSMRVIITSANLVPRQWHLITNTVW 420
Query: 291 MQDFPLKDQNNLSE--------ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFK 342
QDFP + + S + F L+ ++ +L +P+ + IN
Sbjct: 421 WQDFPCRTSPDYSAIFSAVEEPKSDFAVQLVSFIGSLI-----NEVPSQA-YWINE--IA 472
Query: 343 KFNFSSAAVRLIASVPGYHTGS 364
K+NF A L+ASVPG + S
Sbjct: 473 KYNFEGAGGYLVASVPGLYMPS 494
>gi|159464062|ref|XP_001690261.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284249|gb|EDP09999.1| predicted protein [Chlamydomonas reinhardtii]
Length = 424
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/31 (70%), Positives = 28/31 (90%)
Query: 267 GVRIIVHTANLIHVDWNNKSQGLWMQDFPLK 297
G+R+++HTAN I+ D NNKSQGLW+QDFPLK
Sbjct: 174 GLRLVIHTANAIYADCNNKSQGLWVQDFPLK 204
>gi|50292179|ref|XP_448522.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527834|emb|CAG61483.1| unnamed protein product [Candida glabrata]
Length = 553
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 141/335 (42%), Gaps = 65/335 (19%)
Query: 243 LHKPPLPISFGTHHSKAMLLIYP--RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN 300
++ PP + HHSK ++ IY RGVR+ + + N + N Q LW F + +
Sbjct: 182 IYMPP----YSCHHSKMIIGIYRNGRGVRVFLPSNNFTWAETNWPQQVLWSSPF-MSISD 236
Query: 301 NLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPG 359
E GF+ L DYLS K E ++ + + +FS A V I S P
Sbjct: 237 KAVEMNGFQRSLCDYLSFYKLKELNS---------LVKDTIMRTDFSGLADVEFIYSCPK 287
Query: 360 YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPL---VYQFSSLGSLDEK-------WMAE 409
G +++ +M L+++ + T + + L + Q S++G +
Sbjct: 288 -TKGKNIETGLNMFLKSIEKVETELRDVDQISLNLFLCQSSTIGGPIGRRKDNPSNLFTH 346
Query: 410 LSSSMSSGFSE----DKTPL------GIGEPLIVWPTVEDVRCSLEGY-AAG----NAIP 454
+ + GFSE D+ L P I++P ++++R + G +AG N
Sbjct: 347 VIVPTARGFSEAAKSDQQALLKAYHENKTYPCIIYPCMKEIRDASVGINSAGWFNFNYTR 406
Query: 455 SPQKNVDKDFLK---KYWAKWKASHTGRSRAMP--HIKTFARYN--GQKLA--------- 498
+ + D+L+ K + K+ +T + R H K + R+ Q +A
Sbjct: 407 NDTQLQQYDWLRNKIKVFYKYNRDYTTKQRLTTPSHTKFYLRFRMPSQSMAQGMRVPEHI 466
Query: 499 -WFLLTSANLSKAAWGALQKNNSQLMIRSYELGVL 532
W L TSANLS AWG L R+YE+GV+
Sbjct: 467 DWCLFTSANLSSNAWGTLGSQP-----RNYEVGVM 496
>gi|226294747|gb|EEH50167.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides brasiliensis Pb18]
Length = 589
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 6/113 (5%)
Query: 177 NTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR 235
N + I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE + +
Sbjct: 221 NGDDIKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELE 278
Query: 236 NKPANW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
N + L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 279 NDTKSMGSVRLCFPPMEPQVNCMHSKLMLLFHLNHLRIVIPSANLIPFDWGEK 331
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 22/125 (17%)
Query: 495 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCGFSCTSNI 550
Q W + SANLS++AWG L + S +L R++E GV+I + G G
Sbjct: 468 QYAGWAYVGSANLSESAWGRLVLDRSTTKPKLNCRNWECGVVI--PIRHRGSG------Q 519
Query: 551 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV-----YLPVPYELPPQRYSSED 605
+ S+ SGST + KL + S S++V +PVP +P + Y D
Sbjct: 520 LSSQPSSGST-----LRPKLEPESESASVTVSDGSKLVSVFEPRIPVPMRVPGEPYQPGD 574
Query: 606 VPWSW 610
PW +
Sbjct: 575 KPWYY 579
>gi|410081624|ref|XP_003958391.1| hypothetical protein KAFR_0G02220 [Kazachstania africana CBS 2517]
gi|372464979|emb|CCF59256.1| hypothetical protein KAFR_0G02220 [Kazachstania africana CBS 2517]
Length = 527
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 107/502 (21%), Positives = 205/502 (40%), Gaps = 90/502 (17%)
Query: 163 TFRLLRVQ----GLPAWANTSC--VSIRDVI-QGDIIVAILSNYMVDIDWLL----PACP 211
+F+L++ + LP +S +S++D+ ++ +L +Y ++D+LL P+
Sbjct: 78 SFKLIKSEYYDLNLPENIRSSSDFISLKDIFGNSNLESTVLFSYQFNLDFLLDQFHPSIK 137
Query: 212 VLAKIPHVLVIHGESDGTLEHMKRNKPANWI-LHKPPLPISFGTHHSKAMLLIY-PRGVR 269
+ + I+ S + I ++ PP + +HHSK +L Y + V+
Sbjct: 138 SITMVAQKGTINPVSPESFHLFPILDKCKIIDIYMPP----YTSHHSKMILNFYRDKSVK 193
Query: 270 IIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLP 329
I + + N H + N Q W P Q + F+ +L+ YL + + + +
Sbjct: 194 IFIPSNNFTHHETNLPQQICWCS--PSLYQGK-TGSVLFQENLLSYLKSYEDKTLNTTI- 249
Query: 330 AHGNFKINPSFFKKFNF---------SSAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQE 380
+ ++N K +F +S+ ++L+ + H K GH + Q
Sbjct: 250 YYELLQLNFESLKDVDFVYSCPSKENASSGLKLLVELLSKHDND---KSGHY----LCQT 302
Query: 381 CTFEKGFKKSPLVYQFSSLG--SLDEKWMAELSSSMSSGFSEDKTPLGIG---EPLIVWP 435
T KS F+ L +L + SS ++ +E +P I++P
Sbjct: 303 STIGGPLNKSQNSNIFTHLMIPALSNMFGMSNSSRLTIPTTEQVLQFNKNNNIKPYILYP 362
Query: 436 TVEDVR-CSLEGYAAG------NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMP-HIK 487
TV++++ C + +G + IP + + + F ++ + S + + RA P H K
Sbjct: 363 TVKELQNCPMGWLPSGWFHFNYDRIPMYYETLKEKF-DIFYKQDAESISIQRRATPSHSK 421
Query: 488 TFARYNGQ---KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCGF 544
+ + + + +L W L TSANLS +AWG + R+YE+GVL + C
Sbjct: 422 FYMKSSTETFTELDWCLYTSANLSMSAWGKITTKP-----RNYEVGVLFTGKDRLIRC-- 474
Query: 545 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSE 604
T + L + + S+VV VP+ L Q+Y ++
Sbjct: 475 -----------------------TSFIDLIYKRT---DGQSDVV---VPFTLKLQKYEAD 505
Query: 605 DVPWSWDKRYTKKDVYGQVWPR 626
D + K Y D+ G+++ R
Sbjct: 506 DEAFCMSKDYGLLDINGRLYER 527
>gi|388851550|emb|CCF54740.1| uncharacterized protein [Ustilago hordei]
Length = 665
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 78/166 (46%), Gaps = 21/166 (12%)
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEEC----GFE 309
T H K ++L++ +R+ + + NL VDW+ G+++QDFPLK S G E
Sbjct: 285 TQHMKFLVLVHEGWLRVAIASGNLNEVDWSRIENGVFIQDFPLKGGEGSSARAEGRGGVE 344
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS--SAAVRLIASVPGYHTGSSLK 367
ND + L TL S P+H + + +F+FS A R++AS P SSL+
Sbjct: 345 NDFKEQL-TLVLKSLSVP-PSHPVW----TALDRFDFSLGGARARIVASWP---EASSLQ 395
Query: 368 KW------GHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM 407
W G +L V+++ + Q SSL + D KW+
Sbjct: 396 GWDRIETQGLGRLGKVVRDLDIPAVKGGMEVECQGSSLANHDLKWI 441
>gi|171686654|ref|XP_001908268.1| hypothetical protein [Podospora anserina S mat+]
gi|170943288|emb|CAP68941.1| unnamed protein product [Podospora anserina S mat+]
Length = 438
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 3/105 (2%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
V I +V+Q DI+ +A++S++ D DW+L + ++ L+ + +S+ E M+ N P
Sbjct: 254 VKIEEVLQKDILELAVISSFQWDEDWMLSKIDI-SRTKLYLIAYAKSEAQNE-MRNNVPK 311
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
+ I P + G HSK MLL Y +R++V T N + DW
Sbjct: 312 SRIRFCFPAMQAVGAMHSKLMLLKYEGYLRVVVPTGNFMSYDWGE 356
>gi|367050628|ref|XP_003655693.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
gi|347002957|gb|AEO69357.1| hypothetical protein THITE_2130975 [Thielavia terrestris NRRL 8126]
Length = 657
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 2/105 (1%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V+Q + +A+LS+Y D+ WLL LA+ +L+ + E M+ P
Sbjct: 240 IKIEEVLQKQQLELAVLSSYQWDVRWLLSKVD-LARTKLILIAFAADEAHKEEMRNAVPR 298
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
I P G+ HSK LL Y + +RI+V T NL+ DW
Sbjct: 299 ERIRFCFPPMQPVGSMHSKLQLLKYEKYMRIVVPTGNLMSFDWGE 343
>gi|374105912|gb|AEY94823.1| FAAR169Cp [Ashbya gossypii FDAG1]
Length = 540
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 151/409 (36%), Gaps = 82/409 (20%)
Query: 185 DVIQGDIIV--AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
+V+ GD + L ++ +++WLL P HV V+ GT++ + A
Sbjct: 91 EVVLGDTDLERVYLFSFQYEMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVR 145
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P F +HHSK ++ Y + R+++ +AN ++ + Q +WM +
Sbjct: 146 YRMVWMP-PFSSHHSKMVIAFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAA 204
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVP 358
+ F + L DYL +PE L +K +F+ + + S P
Sbjct: 205 EQQPSRFRSGLQDYLQM--YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAP 253
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG------------SLDEKW 406
G T + K G +L L E G + S Q SS+G +L
Sbjct: 254 GARTRA---KTGLAQLAAQLDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHL 309
Query: 407 MAELSSSMSSGFSED-KTPLGIGE-----------PLIVWPTVEDVRCSLEGYAAG---- 450
M L S + G + K LG E P I++PTVED G+ A
Sbjct: 310 MVPLLSGHTQGLPKSVKDCLGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLASGWFH 369
Query: 451 ----------NAIPSPQKN----VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK 496
N S + N +++ + + R R H K + ++
Sbjct: 370 FHHSRTAATRNHYSSLRDNGCFVKQREYELRPGGRTALPIIRRDRVPCHTKFYIKFASAS 429
Query: 497 ---------LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 536
WFL TSANLS AWGA ++YE GVL S
Sbjct: 430 ATSWNSLTDCEWFLFTSANLSTHAWGA----PPSYQPKNYECGVLYTKS 474
>gi|387220095|gb|AFJ69756.1| tyrosyl-dna phosphodiesterase 1, partial [Nannochloropsis gaditana
CCMP526]
Length = 103
Score = 55.8 bits (133), Expect = 7e-05, Method: Composition-based stats.
Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 22/84 (26%)
Query: 464 FLKKYWAKWKASHTGRSRAMPHIKTFARY-------------NGQ---------KLAWFL 501
+LK+ A+W+ GR RAMPH+K+F R+ NG+ +LAW L
Sbjct: 20 YLKERLARWEGGRWGRQRAMPHLKSFLRFSVIREGAGAAPGENGRGQGACKETTRLAWVL 79
Query: 502 LTSANLSKAAWGALQKNNSQLMIR 525
+TS N SK AWG LQ I+
Sbjct: 80 ITSHNYSKPAWGELQSKGEVFKIQ 103
>gi|45184994|ref|NP_982712.1| AAR169Cp [Ashbya gossypii ATCC 10895]
gi|44980615|gb|AAS50536.1| AAR169Cp [Ashbya gossypii ATCC 10895]
Length = 540
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 151/409 (36%), Gaps = 82/409 (20%)
Query: 185 DVIQGDIIV--AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWI 242
+V+ GD + L ++ +++WLL P HV V+ GT++ + A
Sbjct: 91 EVVLGDTDLERVYLFSFQYEMNWLLDLIP-----EHVQVVVTAQKGTVQEADGGRAARVR 145
Query: 243 LHKPPLPISFGTHHSKAMLLIYP-RGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNN 301
+P F +HHSK ++ Y + R+++ +AN ++ + Q +WM +
Sbjct: 146 YRMVWMP-PFSSHHSKMVIAFYQDQRCRVVLPSANFTALETSLPQQMMWMTPQLAHSRAA 204
Query: 302 LSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS---SAAVRLIASVP 358
+ F + L DYL +PE L +K +F+ + + S P
Sbjct: 205 EQQPSRFRSGLQDYLQM--YPEPDRELLQR---------LRKIDFAPVDATGAAFVYSAP 253
Query: 359 GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLG------------SLDEKW 406
G T + K G +L L E G + S Q SS+G +L
Sbjct: 254 GARTRA---KTGLAQLAAQLDEGPAAGG-RHSHYFCQSSSIGGPLNSRSAENPRNLFVHL 309
Query: 407 MAELSSSMSSGFSED-KTPLGIGE-----------PLIVWPTVEDVRCSLEGYAAG---- 450
M L S + G + K LG E P I++PTVED G+ A
Sbjct: 310 MVPLLSGHTQGLPKSVKDCLGEKEAYALLQRERLHPYILYPTVEDFNECFTGWLASGWFH 369
Query: 451 ----------NAIPSPQKN----VDKDFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQK 496
N S + N +++ + + R R H K + ++
Sbjct: 370 FHHSRTAATRNHYSSLRDNGCFVKQREYELRPGGRTALPIIRRDRVPCHTKFYIKFASAS 429
Query: 497 ---------LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPS 536
WFL TSANLS AWGA ++YE GVL S
Sbjct: 430 ATSWNSLTDCEWFLFTSANLSTHAWGA----PPSYQPKNYECGVLYTKS 474
>gi|296415071|ref|XP_002837215.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633076|emb|CAZ81406.1| unnamed protein product [Tuber melanosporum]
Length = 603
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 105/243 (43%), Gaps = 28/243 (11%)
Query: 181 VSIRDVIQGD-IIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHG--ESDGTLEHMKRNK 237
++ +V+Q + + VA+LS + DIDW+L P+ V+V+H E D + + +
Sbjct: 236 ITFEEVLQKESLCVAVLSAFQWDIDWVLKKLPLDTIQRLVMVMHAKEEQDRSYKVQQLGS 295
Query: 238 PANWILHKPPLPISFGTHHSKAMLLIYPRG----VRIIVHTANLIHVDWNN----KSQGL 289
L PP+ HSK MLL + G +R+ V +ANL DW +
Sbjct: 296 LPRTTLVLPPMQGQVSCMHSKLMLLFHMNGDQRWLRVAVPSANLTDYDWGELGGVMENTV 355
Query: 290 WMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSA 349
++ D P + N + F +L + + PE N G ++ + S K F
Sbjct: 356 FIIDLPRLPKPN-HNQTHFAKELHHFCAAKGMPEDVLN----GLYRYDFSRTKDMAF--- 407
Query: 350 AVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQF--SSLGSLDEKWM 407
+ S+ G + G ++ G+ L T ++ G L + F SSLG+ + ++
Sbjct: 408 ----VHSIGGSNAGKDWRRTGYSGLGTAVKALGLSSG---PGLEFDFVTSSLGAANMGFI 460
Query: 408 AEL 410
+ +
Sbjct: 461 SNM 463
>gi|342319803|gb|EGU11749.1| Proteophosphoglycan 5 [Rhodotorula glutinis ATCC 204091]
Length = 564
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 129/319 (40%), Gaps = 41/319 (12%)
Query: 245 KPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFP-LKDQNNLS 303
+P P + G HSK LL YP + +++ + N + +D + ++ P +
Sbjct: 211 RPLYPWASGCAHSKFFLLFYPGFLLLVITSCNTMRIDMDLSDNHWYIHALPEIPPGKKRK 270
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA--VRLIASVPGYH 361
+ FE+DL+ ++ L WPE ++ K++F SA V L+ASVPG
Sbjct: 271 AKTTFEHDLLAHMLDLDWPE-----------ELVSRVRGKYDFRSAEGRVHLVASVPGTK 319
Query: 362 TGSSLK-KWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSE 420
+ + +G ++L + ++ + + S+ SL +W+ + +
Sbjct: 320 RATDDEGSYGMLRLNALARQIIPPSVRPDIDMEFCAGSVNSLPPEWIDQTDKLLRGRDLS 379
Query: 421 DKTPL---GIGEP----------LIVWPTVEDV-RCSLEGYAAGNAIPSPQKNVD----K 462
P+ G+ EP IV+PT V CS + A + I N
Sbjct: 380 RAVPVTKPGVPEPPVSLNNLPEWSIVFPTKATVAACSPQVIEAASNIGCCLNNAKWPETS 439
Query: 463 DFLKKYWAKWKASHTGRSRAMPHIKTFARYNGQKLA---WFLLTSANLSKAAWGALQKNN 519
+ ++ + + + GR M + N A L S NLSKAA G + +
Sbjct: 440 NEVRSMFFDYGSKDPGRLFHMKFYQWKDSRNKDPSAPPLMVYLGSHNLSKAALGEVSRLK 499
Query: 520 S-----QLMIRSYELGVLI 533
S ++ ++ELGV+I
Sbjct: 500 SGAGDVRIKCNNFELGVVI 518
>gi|295668965|ref|XP_002795031.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226285724|gb|EEH41290.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 668
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +VIQ D+ +A+LS+Y+ D DWL + K ++I GE + + N
Sbjct: 231 IKIEEVIQKSDLELAVLSSYIWDADWLFSKFDI--KKSRFILIMGEKEEDKKRELENDTK 288
Query: 240 NW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
+ L PP+ HSK MLL + +RI++ +ANLI DW K
Sbjct: 289 SMGSVRLCFPPMEPQVNCMHSKLMLLFHLNYLRIVIPSANLIPFDWGEK 337
>gi|440802395|gb|ELR23324.1| hypothetical protein ACA1_069080 [Acanthamoeba castellanii str.
Neff]
Length = 675
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 8/95 (8%)
Query: 33 VIGR--TNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK----SGDQRKKLSSNE 86
V+GR +P SDKR SRK L +GS SLV G NP +K G + L NE
Sbjct: 2 VLGRGLCGVPSSDKRCSRKQAELMLGRNGSLSLVPRGVNPAYLKRAADKGGEAVMLQRNE 61
Query: 87 HVSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDG 121
S+ DGD+ L+ + + + L SQ+R + +
Sbjct: 62 KYSLEDGDVFTLV--ANCYPFTVLRCSQERPTKEA 94
>gi|219116995|ref|XP_002179292.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409183|gb|EEC49115.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 708
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 100/437 (22%), Positives = 164/437 (37%), Gaps = 122/437 (27%)
Query: 253 GTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPL---KDQNNLSEECGF 308
G HH K M+L+ G V ++V T+NL + S W+Q FP + L EE
Sbjct: 263 GVHHPKFMILLERSGDVVVVVSTSNLTEPRATDAS---WLQRFPAARSSRERKLKEE--- 316
Query: 309 ENDLIDYLSTLKWPEFSANLPAHGNFKINPSFF--------------KKFNFSSAAVRLI 354
E+D L+ + + + H + P F K F+FS A V L+
Sbjct: 317 EDDFGIVLTNMLEAQTLSCRKGH----VTPMGFCRQELGWNSLRDLTKHFDFSKAQVHLV 372
Query: 355 ASVPG---YHTGSSLKKWGHMKLRTVLQECTFEKGFKKSP--------LVYQFSSLGSLD 403
A++PG T S + +G ++ V++ + + P L+ Q +SLGS
Sbjct: 373 ATIPGDRLSKTASPSELFGRQRVSAVMKRLSQGPTPRLPPILRSEDDRLIVQPTSLGS-- 430
Query: 404 EKW----MAELSSSMSSGFSEDKTPLGIGEPL----IVWPTVEDVRCSLEGYAA------ 449
+W M E+ S D + + + I+WPT ++ G+A
Sbjct: 431 -EWTRANMTEVVRSYLGHEDRDVSKVRDAQVFPRLDILWPTERFMKAYRTGFAGRGSPAS 489
Query: 450 ----GNAIPSPQ------------------KNVDKDFLKKYWAKWKASHTGRSRAMPHIK 487
G+A + + +D L + + RS PHIK
Sbjct: 490 VVCIGDAFDTKELVLFKENEGYLFLSSDTFSKIDLSCLSRMAQYEVSVPLQRSCLPPHIK 549
Query: 488 TFAR-YNG---------------QKLAWFLLTSANLSKAAWG-ALQKNNSQLMIRSY--- 527
+ R + G + ++FLLTSA LS+ A G L + S+ + SY
Sbjct: 550 SICRLFQGNDYRLRQDYGLPKSEEIFSYFLLTSACLSRGAQGETLTQLGSRETVVSYANF 609
Query: 528 ELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEV 587
ELGVL +++ G P++ + + +
Sbjct: 610 ELGVLF--TSRLQGRASDRVYGWKPAQCMCRNRPRTSL---------------------- 645
Query: 588 VYLPVPYELPPQRYSSE 604
++LPVP+ L P RY S+
Sbjct: 646 IHLPVPFSLRPARYQSD 662
>gi|440473340|gb|ELQ42143.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae Y34]
gi|440489437|gb|ELQ69093.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Magnaporthe oryzae P131]
Length = 614
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 105/485 (21%), Positives = 190/485 (39%), Gaps = 108/485 (22%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDII-VAILSNYMVDIDWLLPAC-PVLAKIPHVLVIHGES 226
+QG P ++ ++I +V+Q D + +A+LS++ D +WL P K + E+
Sbjct: 168 LQGQPR--SSQDITIEEVLQKDQLELAVLSSFAWDPEWLWTKVDPTKTKTTLIAFAGNEA 225
Query: 227 DGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS 286
D LL +P +RI+V + NL+ DW ++
Sbjct: 226 D---------------------------------LLKFPGYLRIVVPSGNLVPYDWGEQN 252
Query: 287 QGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG-NFKINPSFFKKFN 345
G+ + D L E++ + E S L A G N +I S +K++
Sbjct: 253 -GIMENSVFIIDLPPLKAGVKLEDNTLTSFGE----ELSYFLTAQGLNERIINSL-RKYD 306
Query: 346 FS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTF------EKGFKKSPLVYQF-- 396
FS ++ + ++ G HTG ++ G+ L +Q E F S Y F
Sbjct: 307 FSQTSRYAFVHTIAGVHTGDKWRRTGYCGLGRAIQNLGLATDEPVEIDFVVSGPNYPFLP 366
Query: 397 -------SSLGSLDEKWMAELSSSMS--SGFSE-----DKTPLGIGEPL----------- 431
SS+G+L ++ L ++ SG + KT +
Sbjct: 367 NYLRQAASSMGALKYGYLLALYNAFQGDSGLKDYQSRASKTKTSKEDAASAQQAKLRDFF 426
Query: 432 -IVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRS---------R 481
I +P++ V S G + + L+ W W+A+ R+
Sbjct: 427 RIYFPSLATVEASRGGTRSAGTL----------CLRSGW--WEAATFPRALFRDYENPRG 474
Query: 482 AMPHIK-TFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
A+ H K FAR AW + SAN+S++AW + Q ++ R++E GV I+P +
Sbjct: 475 ALVHSKIVFARPPDASAAWAYVGSANVSESAWASSQP---KMSCRNWECGV-IVPVGEPA 530
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELP 597
G + ++ I P + +G + + + + S E ++ +P+P +LP
Sbjct: 531 SPGRTLSTGIDPGDASAGKGGSLHGHQARNSPQEQNAPVGRSRSIEELFSECVPLPMQLP 590
Query: 598 PQRYS 602
+ Y+
Sbjct: 591 GRSYA 595
>gi|344232732|gb|EGV64605.1| phospholipase D/nuclease [Candida tenuis ATCC 10573]
Length = 171
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 64/155 (41%), Gaps = 43/155 (27%)
Query: 465 LKKYWAKWKASH--TGRSRAMPHIKTFARYNG---QKLAWFLLTSANLSKAAWGALQ--- 516
+K Y KW H TGR R H+K + NG + L W + S NLSK AWG
Sbjct: 32 IKPYLCKWNNGHEYTGRERNPAHVKLYMCDNGDDFKSLKWLYMGSHNLSKQAWGGGSGFG 91
Query: 517 --KNNSQLMIRSYELGVLILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLT 574
+N ++ + SYELG+LI P + TL
Sbjct: 92 SWQNINEYQVSSYELGILITPENDKD-------------------------------TLK 120
Query: 575 WHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPWS 609
SD SSE + +P LPP RYS D+PWS
Sbjct: 121 PVFCSDF--SSEKYPVRMPLYLPPTRYSPTDMPWS 153
>gi|307211792|gb|EFN87773.1| Probable tyrosyl-DNA phosphodiesterase [Harpegnathos saltator]
Length = 95
Score = 53.9 bits (128), Expect = 2e-04, Method: Composition-based stats.
Identities = 40/127 (31%), Positives = 56/127 (44%), Gaps = 39/127 (30%)
Query: 483 MPHIKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRH 540
MPHIK++ R + +++AWF+LTSANLSK+AWG I +YE+GV LP
Sbjct: 1 MPHIKSYTRISPDLKRIAWFVLTSANLSKSAWGV---QRGDYYITNYEVGVAFLPKF--- 54
Query: 541 GCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQR 600
I T+ +T D + + P+PY+LP
Sbjct: 55 ------------------------ITGTRTFPIT-----DEDLTGPI--FPIPYDLPLCP 83
Query: 601 YSSEDVP 607
Y S D P
Sbjct: 84 YDSSDSP 90
>gi|388580252|gb|EIM20568.1| phospholipase D/nuclease [Wallemia sebi CBS 633.66]
Length = 417
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 74/140 (52%), Gaps = 8/140 (5%)
Query: 250 ISFGTHHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQN----NLSE 304
+ GT+H+K L+ G +R++V TAN I +DW ++MQDFPLK Q + +
Sbjct: 5 FAHGTYHAKFALIFTTDGWLRVVVTTANFIPIDWMWNENTVFMQDFPLKGQTLGGESSEQ 64
Query: 305 ECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHT 362
+ F++D +L LK + + P+ K++FS + RLI+S+ ++
Sbjct: 65 KSAFQSDWTWFLYKLKLNKSLKLVADQMPDTPLPNVDAVNKWDFSRSKARLISSISETYS 124
Query: 363 G-SSLKKWGHMKLRTVLQEC 381
G +++K GH +L ++++
Sbjct: 125 GLENIRKVGHFRLADLVRQA 144
>gi|396484884|ref|XP_003842038.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
gi|312218614|emb|CBX98559.1| hypothetical protein LEMA_P077980.1 [Leptosphaeria maculans JN3]
Length = 588
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 60/114 (52%), Gaps = 6/114 (5%)
Query: 174 AWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGT--- 229
A+ T+ +SI +++Q I +A++S++M D DWL + K+ + V++ +
Sbjct: 332 AYPRTNDISIDELLQTPSIHMAVISSFMWDADWLHKKLDPI-KVKQIWVMNAKGKDVQKR 390
Query: 230 -LEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDW 282
L+ MK N LH PP+ + HSK +LL + +R V TAN+ +DW
Sbjct: 391 WLQEMKDTGVPNLTLHFPPMHGMIQSMHSKFLLLFGKKKLRFAVPTANMTCIDW 444
>gi|323449457|gb|EGB05345.1| hypothetical protein AURANDRAFT_72265 [Aureococcus anophagefferens]
Length = 1631
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 86/207 (41%), Gaps = 37/207 (17%)
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEK-WMAE 409
V I SVPG+ G+ +GH +R L +G + + SSLG LD K ++
Sbjct: 851 VHFIGSVPGFRRGAFADAFGHRAIRRALA----REGLTVARAEFANSSLGRLDNKVFLRG 906
Query: 410 LSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC----SLEGYAAGNAIPSPQKNVDKDFL 465
++S+ D+ IVWP+ + C L +A + Q N D +
Sbjct: 907 FATSLFGAGDLDRLK-------IVWPS-QATACRSSRKLMLHAMTEDKGTAQMNGPDDRI 958
Query: 466 KKYWAKWKASHTGRSR-----------AMPHIKTFARYNG-QKLAWFLLTSANLSKAAWG 513
W A+ R+R + H K A ++G +L + S N S AAWG
Sbjct: 959 ------WNAAGFPRARFHHYHAPSDRQTLHHTKMLACFDGDDRLVAVVGGSHNCSGAAWG 1012
Query: 514 ALQKNNSQLMIRSYELGVLILPSAKRH 540
+ N S +M SYE GVL+ A R
Sbjct: 1013 VGEDNMSVIM--SYEAGVLVACGAGRR 1037
>gi|296810424|ref|XP_002845550.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
gi|238842938|gb|EEQ32600.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Arthroderma otae CBS 113480]
Length = 672
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 66/146 (45%), Gaps = 12/146 (8%)
Query: 181 VSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
+ I +V Q D+ +A+LS+++ D+DWLL L I G + + A
Sbjct: 309 IKIEEVFQPSDLELAVLSSFLWDMDWLL--LKFTNPKTRFLFIMGAKGEEKQKQLLEETA 366
Query: 240 NW---ILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK----SQGLWMQ 292
+ L PP+ HSK MLL +P +RI+ TANL DW K L++
Sbjct: 367 SMPRIRLCFPPMEGEVNCMHSKLMLLFHPGYLRIVTPTANLDPYDWGEKGGEMENMLFLI 426
Query: 293 DFPLKDQ--NNLSEECGFENDLIDYL 316
D P K + + F ++L+ +L
Sbjct: 427 DLPRKSDGGTGIDDATPFRDELVYFL 452
>gi|347836693|emb|CCD51265.1| hypothetical protein [Botryotinia fuckeliana]
Length = 638
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/388 (22%), Positives = 154/388 (39%), Gaps = 87/388 (22%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIV-AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
QG P + + I +V+Q + AIL + +D DW+ K+ + V+ +++
Sbjct: 279 AQGFPREDD---IKIEEVLQSSTLEHAILGAFQIDSDWIRSKIQPSTKV--IWVLQAKTE 333
Query: 228 GTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ 287
+ K P + PP+ + HSK +L +P +R+++ +ANL DW +S
Sbjct: 334 AEKMNFKSLAPETYRFCFPPMEGNVNIMHSKLQILAHPTHLRLVIPSANLTPYDW-GESG 392
Query: 288 GL-----WMQDFP-LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPS 339
G+ ++ D P L + S++ F DL+ +L +
Sbjct: 393 GILENVVFLIDLPRLPNGEKASDDQLTPFAQDLLHFLHAM-------------------- 432
Query: 340 FFKKFNFSSAAVRLIASVP--GYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF 396
+ R I S+ G H G++L++ G+ L + C G PL ++
Sbjct: 433 --------TLTPRTIESLKRGGSHFGTNLQRTGYPGLGS----CVRSLGLNTDHPLEIEY 480
Query: 397 --SSLGSLDE-------------------KWMAE------LSSSMSSGFSEDKTPLGIGE 429
+S+G+LD+ KW E + + M + SE+ IG
Sbjct: 481 VTASIGNLDDRFLRTMYLASQGDNGSKEYKWRTEKPARSKMETVMETQLSEE-----IGR 535
Query: 430 PLIVW-PTVEDVRCSLEGYAAGNAIPSPQKNVDKD-FLKKYWAKWKASHTG--RSRAMPH 485
V+ P+ + V+ S G A I K + F ++ ++ G M
Sbjct: 536 RFRVYFPSEQTVKESKGGTNAAGTICFRSKWYNASAFPRELMRDCQSRREGLLMHNKMLF 595
Query: 486 IKTFARYNGQK-LAWFLLTSANLSKAAW 512
++T K +AW + SANLS++AW
Sbjct: 596 VRTRRTQKSPKPVAWVYVGSANLSESAW 623
>gi|70984252|ref|XP_747643.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845270|gb|EAL85605.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 277
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 29/183 (15%)
Query: 239 ANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDF 294
+N L PP+ HSK MLL +P +RI+ TANL DW ++ D
Sbjct: 2 SNLRLCFPPMEGQVNCMHSKLMLLFHPGYLRIVAPTANLTPYDWGEMGGVMENSAFLIDL 61
Query: 295 PLK-DQNNLSEECGFENDLIDYL--STLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAA 350
P K ++ + FE +L+ +L STL+ S +F+FS ++
Sbjct: 62 PRKVATTSVGSKTVFEEELVYFLRASTLQENIISR--------------LDEFDFSPTSH 107
Query: 351 VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS-PLVYQF--SSLGSLDEKWM 407
+ L+ ++ G HTG++ ++ G+ L + G + S P+ F SS+GSL ++++
Sbjct: 108 IMLVHTIGGSHTGNTWRRTGYCGLGRAVNAL----GLRTSKPINIDFVASSVGSLTDEFL 163
Query: 408 AEL 410
+
Sbjct: 164 RSI 166
>gi|398406723|ref|XP_003854827.1| hypothetical protein MYCGRDRAFT_35953, partial [Zymoseptoria
tritici IPO323]
gi|339474711|gb|EGP89803.1| hypothetical protein MYCGRDRAFT_35953 [Zymoseptoria tritici IPO323]
Length = 266
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 101/253 (39%), Gaps = 45/253 (17%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQ---GLWMQDFPLKDQNNLSEEC---GFE 309
HSK MLL +P +RI + TANL++ DW Q ++M D P +SE F
Sbjct: 20 HSKLMLLFHPDKLRIAIPTANLLNFDWGETGQMENSVFMVDLPRLADGKISEAGDLPAFG 79
Query: 310 NDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKK 368
+LI +L + + KF+FS+ + + +V G H ++
Sbjct: 80 QELIYFLEQQGLDD-----------DVRTGVL-KFDFSATKDMAFVHTVGGMHFRDEAER 127
Query: 369 WGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSS--------------- 413
G M L +++ + L + SS+G L++ ++ + S+
Sbjct: 128 TGLMGLSKAVKQLNLAT--QDLELDFAASSIGRLNDNYLRDFHSAAKGISLIAQAAEAKS 185
Query: 414 -MSSGFSEDKTPLGIGEP-------LIVWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFL 465
+S F + K + +P I +PT VR S G AAG + F
Sbjct: 186 KAASTFFDRKKASTVAKPDNVREKVRIYFPTASTVRVSTAG-AAGTLCIARNYFEGSTFP 244
Query: 466 KKYWAKWKASHTG 478
+ + +K++ TG
Sbjct: 245 RACFRDYKSTRTG 257
>gi|343426865|emb|CBQ70393.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 654
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/418 (22%), Positives = 153/418 (36%), Gaps = 109/418 (26%)
Query: 254 THHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFENDLI 313
T H K ++L++ +R+ + + NL +DW ++QDFPL G
Sbjct: 274 TQHMKFLILVHEGFLRVAILSGNLNQIDWERIENTAFIQDFPLLSSATKPNVAGPSQSTN 333
Query: 314 DYLSTLKWPEFSANLPA-HGNFKINPSFFKKFNFSSA-AVRLIASVPGYHTGSSLKKWGH 371
D+ L S +LPA H + + F+FS+A R++AS P SSL W
Sbjct: 334 DFKLQLIRSLRSLSLPASHAIY----AALDTFDFSAATCARIVASWP---EPSSLADWER 386
Query: 372 MKLRTV--LQECTFEKGFKKSPLV---YQFSSLGSLDEKWMAELSSSMSSGFSEDKTPL- 425
++ + + L + E G + S V Q SSL + D KW+ + K PL
Sbjct: 387 IETQGLGRLGKVVRELGIRPSQSVEVECQGSSLANHDVKWVEHFHMLAAGVEPRGKLPLK 446
Query: 426 -----------------GIGEPLIVWP--------TVEDVRCSL------EGYAAGNAIP 454
G+ + +P TVE +L E +AA + P
Sbjct: 447 GKANEAHAEYARLMGQDGLPPVKVCFPSHRYVEERTVEGPLGALSFFGKAETFAASSIKP 506
Query: 455 ---SPQKN----------------VDKDFLKKYWAKWKASHTGRSRAMP---HIKTFARY 492
+PQ + + + ++ + A P H + AR
Sbjct: 507 LYHTPQSRRGDIMIHAKSILALTAAGTALVNQAFTAASDAYISNTAARPVPSHAWSGARP 566
Query: 493 NGQKLAWFLLTSANLSKAAWGALQKNNSQ--LMIRSYELGVLILPSAKRHGCGFSCTSNI 550
Q + W L S+N ++AA G + + S+ + ++ELGV +LP +
Sbjct: 567 AEQPIGWTYLGSSNFTRAAHGTISGSASKPTMSCMNWELGV-VLP--------------V 611
Query: 551 VPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVYLPVPYELPPQRYSSEDVPW 608
SE+++ E ++ V Y P QRY+ D PW
Sbjct: 612 YASEVEACGVEAEGLRA------------------------VVYHRPVQRYAVGDAPW 645
>gi|156389579|ref|XP_001635068.1| predicted protein [Nematostella vectensis]
gi|156222158|gb|EDO43005.1| predicted protein [Nematostella vectensis]
Length = 597
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 63/118 (53%), Gaps = 7/118 (5%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK-SGDQR-KKLS 83
L++G IGR + V+DKR+SR H TL + +G +L TNP K SG ++ L
Sbjct: 18 LAEGKTSIGRGPLLSVADKRVSRSHATLDIN-NGKLTLSATHTNPTFFKLSGREKFSALR 76
Query: 84 SNEHVSIADGDIIELIPGHHFFKYVTLS-RSQKRVSNDGATNGE--LSSKKMRQQDEQ 138
+E + GD+I L+P H F+ ++++ + N+GA E L+ + Q+E+
Sbjct: 77 KDESQELKTGDLISLLPDQHVFEIISINPNTHSTAVNNGALTDEKTLAGSTEKSQEEK 134
>gi|255945889|ref|XP_002563712.1| Pc20g12270 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588447|emb|CAP86556.1| Pc20g12270 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 658
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 93/410 (22%), Positives = 165/410 (40%), Gaps = 70/410 (17%)
Query: 169 VQGLPAWANTSCVSIRDVIQ-GDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
V G P N ++I +VIQ D+ + + S+++ D+ WL + +L I +D
Sbjct: 217 VTGFPRSGNE--ITIEEVIQRDDLELGVFSSFLWDMSWLY--SKFNSSSTRILFIMQAND 272
Query: 228 GTLEHMKRNKPAN---WILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNN 284
+ R +N + L PP+ HSK +L+ +P +RI V +ANL DW
Sbjct: 273 EETQKQYRQDVSNMRNFRLCFPPMEPQVFCMHSKLLLMFHPGYLRIAVPSANLTPTDWG- 331
Query: 285 KSQGLWMQDFPLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNF-------KIN 337
++ L E F LID L L+ PE + P + +++
Sbjct: 332 --------------EDRLMENTVF---LID-LPRLEVPE-AGKTPFYEELVYFLQASELH 372
Query: 338 PSFFKK---FNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV 393
+ KK F+F+ + + +V G +T ++ G L ++ E + +
Sbjct: 373 RNIIKKLDNFDFTETKRYAFVHTVGGSNTDGKWQRTGFSGLGRAIKSLGLETNAPVN-VD 431
Query: 394 YQFSSLGSLDEKWM-----------AELSSSMSSGFSEDKTPLGI----GEPL----IVW 434
Y SSLGS++ ++ A L + + + P + E L I +
Sbjct: 432 YVASSLGSINTPFLRSIYLACKGDNALLDYELRTANRRREPPAEVLAYNQECLDHFRIYF 491
Query: 435 PTVEDVRCSLEGY--AAGNAIPSPQ----KNVDKDFLKKYWAKWKA-SHTGRSRAMPHIK 487
P+ E R A G +P N +D L+ ++ H + P
Sbjct: 492 PSDETARAVHPNAKDAIGTICFNPAWWSGANFPRDTLRDCVSERGVLMHNKLAFVHPSTP 551
Query: 488 TFARYNGQKLAWFLLTSANLSKAAWGALQKN----NSQLMIRSYELGVLI 533
N + W + SANLS++AWG + K+ + ++ R++E GV++
Sbjct: 552 IEMPDNKECHGWAYVGSANLSESAWGRIVKDPKTKSLKMNCRNWECGVIV 601
>gi|85109758|ref|XP_963073.1| hypothetical protein NCU06222 [Neurospora crassa OR74A]
gi|28924724|gb|EAA33837.1| predicted protein [Neurospora crassa OR74A]
Length = 657
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 245 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 296
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 297 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 356
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 357 IIDLPLLDDPDVTRELTHFGEELLYFL 383
>gi|336469464|gb|EGO57626.1| hypothetical protein NEUTE1DRAFT_81347 [Neurospora tetrasperma FGSC
2508]
Length = 656
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 244 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 295
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 296 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 355
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 356 IIDLPLLDDPDVTRELTHFGEELLYFL 382
>gi|350290891|gb|EGZ72105.1| phospholipase D/nuclease [Neurospora tetrasperma FGSC 2509]
Length = 657
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 19/147 (12%)
Query: 181 VSIRDVIQGDII-VAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPA 239
++I +V Q D + +A+LS +++D WL ++ K +L + G + +
Sbjct: 244 ITIEEVFQKDKLQLAVLSTFILDEAWLFDKLDLM-KTKLILCRGAPNQG-------EQIS 295
Query: 240 NWI-----LHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQG---LW 290
W+ + K +P++ G HSK LL Y +RI+V +ANL+ DW L+
Sbjct: 296 TWLDGFPTVRKHLVPMNGSGCMHSKLQLLKYKDHLRIVVPSANLVSFDWGETGDMENILF 355
Query: 291 MQDFPLKDQNNLSEECG-FENDLIDYL 316
+ D PL D +++ E F +L+ +L
Sbjct: 356 IIDLPLLDDPDVTRELTHFGEELLYFL 382
>gi|325095061|gb|EGC48371.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus H88]
Length = 652
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 129/325 (39%), Gaps = 67/325 (20%)
Query: 336 INPSFFKK---FNFSSAA-VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFE--KGFKK 389
+N KK F+FS+ + I ++ G HT +K G L + + +
Sbjct: 342 VNEQIIKKMLRFDFSATKDIAFIHTIGGSHTDPKWEKTGLCGLGRAITSLNLQTSQDINL 401
Query: 390 SPLVYQFSSLGSLDEKWM-------------AELSSSMSSGFSEDKTPLGIGEP------ 430
+V+Q SS+GSL+E+++ EL+ S F +K + +
Sbjct: 402 DYIVFQTSSVGSLNEQFLRSIYLAAQGDNGLKELTLRTSRTFPSEKWGVVTNKSDGAKWK 461
Query: 431 ---LIVWPTVEDVRCSLEGYAAGNAIPSPQK-----NVDKDFLKKYWAKW-------KAS 475
+ +P++ VR S G I K KD ++ ++ K
Sbjct: 462 DKFRVYFPSLNTVRNSKGGIENAGTICFQSKWYNSATFPKDIMRDNISRREGLLMHNKML 521
Query: 476 HTGRSRAMPHIKTFA-RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELG 530
+ + +K + RY+G W + SANLS++AWG L + + +L R++E G
Sbjct: 522 FVRPDKPITSVKNNSIRYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECG 577
Query: 531 VL--ILPSAKRHGCGFSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVV 588
V+ I + + T I S +SG TS SD G+ V
Sbjct: 578 VVIPIRHNDEEKSSYIPSTRGITTSVAESGGGNTS-------------AGSDDGSRVASV 624
Query: 589 Y---LPVPYELPPQRYSSEDVPWSW 610
+ +PVP ++P QRY D P+ +
Sbjct: 625 FEPTVPVPMKVPAQRYHGRDRPFFY 649
>gi|443895439|dbj|GAC72785.1| ras-related GTPase [Pseudozyma antarctica T-34]
Length = 689
Score = 47.4 bits (111), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 112/271 (41%), Gaps = 49/271 (18%)
Query: 179 SCVSIRDVIQGDIIVAILSNYMVDIDWLLPACPVLAKIPHVLVIHGESDGTLEHMKR--- 235
+ S R+ +Q +A+L+ Y + +DWL P + +L E T + R
Sbjct: 216 ATASSRNGLQ----LAVLATYDLRMDWLYSLFPKGLPVTLILPPPKEDYRTDPSVARPGL 271
Query: 236 ---------NKPANWILHKPPLPIS-FGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNK 285
+ W + P P + T H K ++L++P +R+ + + NL +DW
Sbjct: 272 HRSEIFGDFARCPGWQICVPSKPKGGWLTQHMKFLILVHPDFLRVAILSGNLNGIDWERI 331
Query: 286 SQGLWMQDFPLKDQ----------NNLSEECGFENDLIDYLSTLKWPEFSANLPAHGNFK 335
++QDFPL ++ F+ L+ L +L P +H +
Sbjct: 332 ENTAYIQDFPLNTDTAKAATPAHGSSQGRTNDFKAQLVRILRSLGMPS------SHPVY- 384
Query: 336 INPSFFKKFNFSSAA-VRLIASVPGYHTGSSLKKWGHM------KLRTVLQECTFEKGFK 388
+ + +FS A R++AS P S+L +W M +L V+++ +
Sbjct: 385 ---AALDRHDFSQATRARIVASWP---EASNLAEWDRMETQGLGRLGKVVRDLGIQPKRS 438
Query: 389 KS-PLVYQFSSLGSLDEKWMAELSSSMSSGF 418
S L Q SSL + D KW+ E ++SGF
Sbjct: 439 GSLQLECQGSSLANHDIKWI-EHFHLLASGF 468
>gi|330841055|ref|XP_003292520.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
gi|325077216|gb|EGC30943.1| hypothetical protein DICPUDRAFT_89860 [Dictyostelium purpureum]
Length = 658
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 62/136 (45%), Gaps = 32/136 (23%)
Query: 175 WANTSCVSIRDVI-QGDIIVAILSNYMVDIDWLLPACPVL--AKIPHVLVIHGESDGTLE 231
W NT +S D+I + + AI++ Y +DI W++ + KIP + +
Sbjct: 151 WINT--LSFSDLISKPGMKFAIVTGYSIDIKWVMNSFERSQGTKIPITFIRDYD------ 202
Query: 232 HMKRNKPANWILHKPPLPISFGT-------------HHSKAMLLIYPRGVRIIVHTANLI 278
K++KP P PI F H+K ++L+Y +RI V +AN
Sbjct: 203 -QKKHKPG-------PHPIPFSNCTIIHPVLSGDQIFHAKLLVLVYDTWIRIAVTSANPS 254
Query: 279 HVDWNNKSQGLWMQDF 294
+++N SQ +W QDF
Sbjct: 255 SYEYSNLSQSIWYQDF 270
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 37/230 (16%)
Query: 337 NPSFFKKFNFSSAAVRLIASVPGYHTGSSLKKWGHMKLR-------------TVLQECTF 383
N F +F+FS++ +LI S+PG + +S K G +LR TV +
Sbjct: 385 NVQFLDQFDFSTSKAQLIISIPGEYKHTS-NKMGLERLRYHVNNYYKTQENNTVYGDDVK 443
Query: 384 EKGFKKSPLVYQFSSLG---SLDEKWMAELS-----SSMSSGFSEDKTPLGIGEPL---I 432
+ +K YQ SS+G + +++ +++++ + + G+ I
Sbjct: 444 SQSIQKI-FYYQSSSVGLSTFFKQAFVSNFKVNNNITTINTFHTMNSNNNNNGKDKSFHI 502
Query: 433 VWPTVEDVRCSLEGYAAGNAIPSPQKNVDKDFLKKY-WAKWKASHTGRSRAMPHIKTFA- 490
++PT V+ + G + D + KY ++ ++ H R + H K
Sbjct: 503 IYPTARWVKETQAKQKLGKVLSLAYDIYD---INKYDFSYFQIKHGYRKNTVSHSKIIVG 559
Query: 491 ------RYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
+ K W S N+S AAWG+ S L I +YE+G+L+L
Sbjct: 560 VSQNSLKNKELKYDWCYSGSHNISSAAWGSPSSRTSDLSILNYEMGILLL 609
>gi|410917580|ref|XP_003972264.1| PREDICTED: aprataxin and PNK-like factor-like [Takifugu rubripes]
Length = 124
Score = 46.2 bits (108), Expect = 0.050, Method: Composition-based stats.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVKSG--DQRKKLS 83
L G VIGR + V DKR+SR H L + DG L NP ++S D + L
Sbjct: 17 LPPGETVIGRGPLLRVVDKRVSRHH-GLLENIDGCLRLKPTHMNPCFIQSSLTDDPRPLQ 75
Query: 84 SNEHVSIADGDIIELIPGHHFFKYVTL 110
+ S+ DGD+ L+PG ++ VT+
Sbjct: 76 KDSWFSLQDGDLFSLLPGQLIYRVVTV 102
>gi|350636132|gb|EHA24492.1| hypothetical protein ASPNIDRAFT_183042 [Aspergillus niger ATCC
1015]
Length = 324
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 85/190 (44%), Gaps = 25/190 (13%)
Query: 240 NWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKS----QGLWMQDFP 295
N L PP+ HSK MLL +P +R++V TANL DW + +++ D P
Sbjct: 3 NLRLCFPPMGGQVVCMHSKLMLLFHPEYLRLVVPTANLTPYDWGEMNGVMENSVFLIDLP 62
Query: 296 LKDQNNLSEE--CGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFS-SAAVR 352
K N+ E+ F DL+ + LK N+ A F+FS ++
Sbjct: 63 KK---NVLEKPTTHFYEDLVVF---LKASTLHENIIAK---------LDNFDFSKTSKYA 107
Query: 353 LIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWM--AEL 410
+ ++ G HT ++ K+ G+ L ++ + + Y SS+G++ ++++ L
Sbjct: 108 FVHTIGGSHTDTAWKRTGYCGLGRAVERLNLCTSIPLN-IDYIASSVGAITDQFLRCMYL 166
Query: 411 SSSMSSGFSE 420
+S G +E
Sbjct: 167 ASQGDDGLTE 176
>gi|294944973|ref|XP_002784507.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
gi|239897573|gb|EER16303.1| tyrosyl-dna phosphodiesterase, putative [Perkinsus marinus ATCC
50983]
Length = 230
Score = 45.8 bits (107), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 85/206 (41%), Gaps = 31/206 (15%)
Query: 181 VSIRDVIQGD---IIVAILSNYMVDIDWLLPACPVLAKIPHVLVI-HGESDGTLEHMKRN 236
++ D+I GD I LS++ DI+WLL P VLV + G + +++
Sbjct: 31 LTFADII-GDKTTIKAVFLSSFGCDIEWLLEHFAF--GTPIVLVDDYDRKRGAMAEIQQP 87
Query: 237 KPANWILHKPPLPI-------SFGTHHSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGL 289
W K P GT H+K +++ + +R+ + ++NL DW SQ +
Sbjct: 88 FGEVWSQMKIVHPYFETGGLYDSGTMHAKLIIIERAQALRVCISSSNLTPQDWEGVSQCI 147
Query: 290 WMQDF--------PLKDQNNLSEECGFENDLIDYLSTLKWPEFSANLPAHG---NFKINP 338
W+ DF P + + F + L ++ T F ++P ++ +
Sbjct: 148 WVADFKAANDFEAPARKRVKPDHTSDFGDQLARFIET-----FFRSIPDSSSLWSYWVKV 202
Query: 339 SFFKKFNFS-SAAVRLIASVPGYHTG 363
+FN V LIAS PGY G
Sbjct: 203 LTGSRFNVKLPKGVELIASAPGYWKG 228
>gi|157103380|ref|XP_001647953.1| polynucleotide kinase- 3'-phosphatase [Aedes aegypti]
gi|108884176|gb|EAT48401.1| AAEL000527-PA, partial [Aedes aegypti]
Length = 507
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%)
Query: 23 PKLPLSQGPNVIGRT-NIPVSDKRLSRKHITLTASADGSASLVVD-GTNPVVVKSGDQRK 80
P + + +IGR+ + D SR+ + L A+ G LV G+NP V+ K
Sbjct: 11 PPIRIDSDRKIIGRSPETLIQDPCCSRQQVCLKANFKGGFVLVKSLGSNPSVLNG----K 66
Query: 81 KLSSNEHVSIADGDIIELIPGHHFFKYV 108
+L N DGDI+EL+PG H + +V
Sbjct: 67 QLEKNMGYEAYDGDILELLPGQHQYTFV 94
>gi|66822393|ref|XP_644551.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|66822691|ref|XP_644700.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
gi|60472674|gb|EAL70625.1| hypothetical protein DDB_G0273869 [Dictyostelium discoideum AX4]
gi|60472831|gb|EAL70780.1| hypothetical protein DDB_G0273125 [Dictyostelium discoideum AX4]
Length = 734
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 496 KLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLIL 534
K W S N S +AWGA QKN SQ+ I ++E+GVL+L
Sbjct: 655 KYDWVYTGSHNFSLSAWGAFQKNESQVSISNFEIGVLLL 693
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 21/149 (14%)
Query: 161 PSTFRLLRVQGLPAWANTSCVSIRDVIQG-DIIVAILSNYMVDIDWLLPACPVLAKIPHV 219
P++F L P + +S +D+I+ ++ A++S + +D +W+ I +
Sbjct: 207 PNSFYLNSTNEQPRICTINTLSFKDLIKKPGMVGALVSGFALDPEWV---------IKEI 257
Query: 220 LVIHGESDGTLEHMKRNKPANWILH---------KPPLPISFGTHHSKAMLLIYPRGVRI 270
HG +KP H PPL ++ +HSK M+ + VR+
Sbjct: 258 RKEHGNKVKFTFVKNYSKPETKGRHAINDFITVINPPL-FNYQLYHSKLMIFTFVDLVRV 316
Query: 271 IVHTANLIHVDWNNKSQGLWMQDFPLKDQ 299
++ ++N D++ Q +W QDF LK Q
Sbjct: 317 VIPSSNPTKFDYSGWGQTIWFQDF-LKKQ 344
>gi|401885055|gb|EJT49186.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 2479]
Length = 1170
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 65/140 (46%), Gaps = 14/140 (10%)
Query: 254 THHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN-- 310
+ H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 428 SEHQKWAFVFYKTGRLRVAIMTANMVDYDWERIENTVFVQDV-LPNKAGHSPDWHLPDFP 486
Query: 311 ----DLIDYLSTLKWPEFSANLPAHGN---FKINPSF--FKKFNFSSAAVRLIASVPGYH 361
DL +L K EF G+ +PS+ F K+++S RL+ S+ G +
Sbjct: 487 QQFADLFKHLKIHKGIEFMRQTHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISIAGKY 546
Query: 362 TG-SSLKKWGHMKLRTVLQE 380
G + KWG +L V+QE
Sbjct: 547 EGFHDMSKWGIGRLGQVVQE 566
>gi|240276898|gb|EER40409.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus H143]
Length = 183
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 26/127 (20%)
Query: 491 RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVL--ILPSAKRHGCGF 544
RY+G W + SANLS++AWG L + + +L R++E GV+ I + +
Sbjct: 69 RYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECGVVIPIRHNDEEKSSYI 124
Query: 545 SCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPPQRY 601
T I S +SG TS SD G+ V+ +PVP ++P QRY
Sbjct: 125 PSTRGITTSVAESGGGNTS-------------AGSDDGSRVASVFEPTVPVPMKVPAQRY 171
Query: 602 SSEDVPW 608
D P+
Sbjct: 172 HGRDRPF 178
>gi|291225011|ref|XP_002732503.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 544
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 15/165 (9%)
Query: 25 LPLSQGPNVIGRTN-IPVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK--SGDQRKK 81
+PL G ++GR + +SDKR+SR H L + G ++ NP + D+ +
Sbjct: 18 IPLPPGQTILGRGPFLGISDKRVSRSHAILEVDS-GKLRILPTHINPTFHQRLGTDKLRP 76
Query: 82 LSSNEHVSIADGDIIELIPGHHFFKYV--------TLSRSQKR-VSNDGATNGELSSKKM 132
L+ +E + +G+ LIP H FK V T S S K V + +KK
Sbjct: 77 LAKDEWQELKNGEKFSLIPEFHIFKVVIDEKPINNTSSNSSKTPVEEENGKETITENKKT 136
Query: 133 RQQDEQDNENGKNSEEALCNFHVSR--DKLPSTFRLLRVQGLPAW 175
+ + NG+ S+ + N + DK + R + LP+W
Sbjct: 137 DDVESDEKPNGEKSKPSAGNVQTVKLEDKKEVALPVQRERKLPSW 181
>gi|440797761|gb|ELR18837.1| Poly(ADP-ribose) polymerase catalytic domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 601
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 64/133 (48%), Gaps = 7/133 (5%)
Query: 11 PLDNNLREDNSLPKLPLSQGPNVIGRTNIP-VSDKRLSRKHITLT-ASADGSASLVVDGT 68
P + ++ LP + L G +GR + + D RLSRK +T+ G AS+ V G
Sbjct: 26 PPEAHVHLPQDLPTVSLKHGETDLGRGRLTQLLDPRLSRKQLTVEWDEHSGRASVHVHGM 85
Query: 69 NPVVVKSGDQRKKLSSNEH---VSIADGDIIELIPGHHFFKYVTLSRSQKRVSNDGATNG 125
NP V + Q++ ++ ++ V + DG +I L+PG + + + R + A G
Sbjct: 86 NPSYVHAQGQQEGVAVSKETGKVEVGDGVVISLLPGLYGYTLRIIDREAS--TAPPANAG 143
Query: 126 ELSSKKMRQQDEQ 138
++S K + + E
Sbjct: 144 HVNSHKRKLEGEH 156
>gi|225554729|gb|EEH03024.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Ajellomyces capsulatus G186AR]
Length = 676
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 59/132 (44%), Gaps = 32/132 (24%)
Query: 491 RYNGQKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCG--- 543
RY+G W + SANLS++AWG L + + +L R++E GV+I RH
Sbjct: 562 RYSG----WTYVGSANLSESAWGRLVLDRATTKPKLNCRNWECGVVI---PIRHNDEEKS 614
Query: 544 --FSCTSNIVPSEIKSGSTETSQIQKTKLVTLTWHGSSDAGASSEVVY---LPVPYELPP 598
T I S +SG TS SD G+ V+ +PVP ++P
Sbjct: 615 PYIPSTRGITTSVAESGGGNTS-------------AGSDDGSRVASVFEPTVPVPMKVPA 661
Query: 599 QRYSSEDVPWSW 610
QRY D P+ +
Sbjct: 662 QRYHGRDRPFFY 673
>gi|406694621|gb|EKC97945.1| Ran GTPase activator [Trichosporon asahii var. asahii CBS 8904]
Length = 1114
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 64/139 (46%), Gaps = 14/139 (10%)
Query: 255 HHSKAMLLIYPRG-VRIIVHTANLIHVDWNNKSQGLWMQDFPLKDQNNLSEECGFEN--- 310
H K + Y G +R+ + TAN++ DW +++QD L ++ S + +
Sbjct: 381 EHQKWAFIFYKTGRLRVAIMTANMMDYDWERIENTVFLQDV-LPNKAGHSPDWHLPDFPQ 439
Query: 311 ---DLIDYLSTLKWPEFSAN---LPAHGNFKINPSF--FKKFNFSSAAVRLIASVPGYHT 362
DL +L K EF L + +PS+ F K+++S RL+ S+ G +
Sbjct: 440 QFADLFKHLKIHKGIEFMRQKHPLGSQVPISSDPSYTDFGKWDWSRVKARLVISISGKYE 499
Query: 363 G-SSLKKWGHMKLRTVLQE 380
G + KWG +L V+QE
Sbjct: 500 GFHDMSKWGIGRLGQVVQE 518
>gi|154298872|ref|XP_001549857.1| hypothetical protein BC1G_11683 [Botryotinia fuckeliana B05.10]
Length = 495
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 56/139 (40%), Gaps = 28/139 (20%)
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIV-AILSNYMVDIDWLLPACPVLAKIPHVLVIHGESD 227
QG P + + I +V+Q + AIL + +D DW+ K+ VL E++
Sbjct: 279 AQGFPREDD---IKIEEVLQSSTLEHAILGAFQIDSDWIRSKIQPSTKVIWVLQAKTEAE 335
Query: 228 GTLEHMKR-------NK-----------------PANWILHKPPLPISFGTHHSKAMLLI 263
H KR NK P + PP+ + HSK +L
Sbjct: 336 SFPRHQKRPEIQLQRNKELARYGGVIKMNFKSLAPETYRFCFPPMEGNVNIMHSKLQILA 395
Query: 264 YPRGVRIIVHTANLIHVDW 282
+P +R+++ +ANL DW
Sbjct: 396 HPTHLRLVIPSANLTPYDW 414
>gi|444315287|ref|XP_004178301.1| hypothetical protein TBLA_0A10020 [Tetrapisispora blattae CBS 6284]
gi|387511340|emb|CCH58782.1| hypothetical protein TBLA_0A10020 [Tetrapisispora blattae CBS 6284]
Length = 566
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 429 EPLIVWPTVEDVRCS-LEGYAAG--NAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPH 485
+P++V+PT ++++ S G AAG + I S K F K+ K T S + +
Sbjct: 405 QPMVVFPTTQEIKDSPTHGDAAGWFHNIGSNSFESQKIFYKQGPNVSKERGTTPSHSKYY 464
Query: 486 IKTFARYNG--QKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLILPSAKRHGCG 543
+K+ + L W + TS+NLS +AWG +K+ R++E+G++I P ++G
Sbjct: 465 MKSTCTDEDPFKYLDWCIYTSSNLSMSAWGTDRKDP-----RNFEIGIVIKP---KNGGK 516
Query: 544 FSCTS 548
C S
Sbjct: 517 LKCHS 521
>gi|328850417|gb|EGF99582.1| hypothetical protein MELLADRAFT_94260 [Melampsora larici-populina
98AG31]
Length = 286
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 23/124 (18%)
Query: 175 WANTSCVSIR--DVI--QGDIIVAILSNYMVDIDWLL----PACPVLAKIPHVLVIHGES 226
W + S +IR D+I + + A++S Y+VDI WL P P+L ++ H +
Sbjct: 132 WHSDSQDAIRAEDIIYPKHKVTKALVSGYVVDIGWLRGLFDPGTPLL------IIKHDKD 185
Query: 227 DGTLEHMKRNKPANWILHKPPLPIS------FGTHHSKAMLLIYPRGVRIIVHTANLIHV 280
GT + +R P ++ H PP+ ++ G H K ++ + VR+ + T N +
Sbjct: 186 AGTFKLKQR--PNTFLCH-PPMKLTAKGSLAHGAMHVKFFIIYFADRVRVAISTGNPVEF 242
Query: 281 DWNN 284
D+
Sbjct: 243 DYQT 246
>gi|303322280|ref|XP_003071133.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240110832|gb|EER28988.1| Tyrosyl-DNA phosphodiesterase family protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 608
Score = 42.4 bits (98), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 99/231 (42%), Gaps = 45/231 (19%)
Query: 343 KFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLV--YQFSSL 399
+F+F +A + ++ G HTGS WG + + + T PL Y SSL
Sbjct: 326 EFDFGKTAGFAFVHTIGGSHTGSD---WGKTGVCGLGKAVTMLGLQTPQPLKLDYVTSSL 382
Query: 400 GSLDEKWM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTV 437
GSL++++M EL+ S F DK + + + LI +P++
Sbjct: 383 GSLNDQFMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSL 442
Query: 438 EDVRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQK 496
+ V+ S + I K ++ ++ + S + R + H KT F R + K
Sbjct: 443 KTVQGSRARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGK 500
Query: 497 L----------AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 533
+ W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 501 IIGDANTTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 551
>gi|323454653|gb|EGB10523.1| hypothetical protein AURANDRAFT_62499 [Aureococcus anophagefferens]
Length = 1848
Score = 42.4 bits (98), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 13/73 (17%)
Query: 484 PHIKTFARYNGQ-KLAWFLLTSANLSKAAWGALQKNN-----------SQLMIRSYELGV 531
PH+ + ++G+ + LLTSANLS AAWG + N L IRS+ELGV
Sbjct: 1744 PHLMLYVLHDGRGAVRRALLTSANLSAAAWGRRRSANDPENADACDAAGALEIRSFELGV 1803
Query: 532 LILPSAKRHGCGF 544
+ P A G GF
Sbjct: 1804 CV-PVAPDAGEGF 1815
>gi|119196585|ref|XP_001248896.1| hypothetical protein CIMG_02667 [Coccidioides immitis RS]
Length = 629
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 98/229 (42%), Gaps = 41/229 (17%)
Query: 343 KFNFS-SAAVRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGS 401
+F+F +A + ++ G HTGS K G L + E + L Y SSLGS
Sbjct: 347 EFDFGKTAGFAFVHTIGGSHTGSYWGKTGVCGLGKAVTMLGLETP-QPLKLDYITSSLGS 405
Query: 402 LDEKWM-------------AELSSSMSSGFSEDKTPLGIGEP---------LIVWPTVED 439
L++++M EL+ S F DK + + + LI +P+++
Sbjct: 406 LNDQFMRSMYLAAQGDNGLKELTLRTSKTFPSDKWGVTVKKADGAEWKDRFLIYFPSLKT 465
Query: 440 VRCSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSRAMPHIKT-FARYNGQKL- 497
V+ S + I K ++ ++ + S + R + H KT F R + K+
Sbjct: 466 VQGSRARPSGAGTICFQSKWYNRAEFPRH--TLRDSLSRRHGILMHSKTIFVRPDNGKII 523
Query: 498 ---------AWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLI 533
W + SANLS++AWG L + S +L R++E GV+I
Sbjct: 524 GDANTTAYQGWTYVGSANLSESAWGRLVIDRSTTKPKLNCRNWECGVII 572
>gi|404485080|ref|ZP_11020284.1| hypothetical protein HMPREF9448_00695 [Barnesiella intestinihominis
YIT 11860]
gi|404340085|gb|EJZ66516.1| hypothetical protein HMPREF9448_00695 [Barnesiella intestinihominis
YIT 11860]
Length = 172
Score = 42.0 bits (97), Expect = 0.93, Method: Composition-based stats.
Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 4 TKIGYLVPLDNNLREDNSLPKLPLSQGPNVIGRT------NIPV--SDKRLSRKHITLTA 55
T +G++ L+N + PL G N+IGR +IP+ SD + R+H +
Sbjct: 54 TSLGFITVLENAF---GYRQEFPLHAGDNIIGRASKGTEVDIPIETSDMSMDRRHCIINV 110
Query: 56 SADGSASLVVDGTNPVVVKSGDQRKKLSSNEHVSIADGDIIEL 98
G+ ++ NP + + + + LS E + DGD++ +
Sbjct: 111 KEKGNRPILTVRDNPSLTGTFLRHELLSDRERAVLHDGDVVTI 153
>gi|330792943|ref|XP_003284546.1| hypothetical protein DICPUDRAFT_148330 [Dictyostelium purpureum]
gi|325085576|gb|EGC38981.1| hypothetical protein DICPUDRAFT_148330 [Dictyostelium purpureum]
Length = 613
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 45/204 (22%), Positives = 90/204 (44%), Gaps = 19/204 (9%)
Query: 339 SFFKKFNFSSAA---VRLIASVPGYHTGSSLKKWGHMKLRTVLQECTFEKGFKKS--PLV 393
S+ F+FS + +++++P +S ++ G +KL++V+Q L
Sbjct: 346 SYLDDFDFSICTDNNIHIVSTIPSLSNDNSNQQNGFLKLKSVVQNYNSSNNNPDGVYSLT 405
Query: 394 YQFSSLGSLDEKWMAELSSSMSSGFSEDKTPLGIGEPLIVWPTVEDVRC--SLEGYAAGN 451
YQ S++GS+ + W + ++ + + IV+PT++ ++ + + A
Sbjct: 406 YQSSAIGSIRKNWFENFTDNLFPNLVRTEKKVS-----IVFPTLDTIQTLSNKDKNLALE 460
Query: 452 AIPSPQKNVDKDFLKKYWAKWKA-SHTGRSRAMP---HIKTFARYNGQKLAWFLLTSANL 507
+I +++ D+LKK + +G ++ +P I F N W S N
Sbjct: 461 SITIRYQDL-TDYLKKKNLLYDYFEESGHNQVIPLHSKIIIFLEENKPNSGWVYHGSHNF 519
Query: 508 SKAAWGALQKNNSQLMIRSYELGV 531
S+ +WG L S + +YE GV
Sbjct: 520 SEGSWGMLS--GSGIKTFNYETGV 541
>gi|443723184|gb|ELU11715.1| hypothetical protein CAPTEDRAFT_223095 [Capitella teleta]
Length = 942
Score = 42.0 bits (97), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 61/304 (20%), Positives = 119/304 (39%), Gaps = 39/304 (12%)
Query: 256 HSKAMLLIYPRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL---KDQNNLS--------- 303
H +LL + +R+I+ +A+L W Q W DFPL K+ + S
Sbjct: 477 HPNLILLRFKHCLRVIITSASLRRRHWEEVVQLGWTADFPLAVDKETDETSWVAMNMMDE 536
Query: 304 EECGFENDLIDYLSTLKWPEFSANLPAHGNFKINPSFFKKFNFSSAAVRLIASVPGYHTG 363
EE E + ++ + L+ F +L G+ + F+ S VRLI S G +
Sbjct: 537 EEARAEAQVTNFGTDLE--GFLKDLQIDGDHLLTGI---DFSVLSPCVRLITSKLGAVSQ 591
Query: 364 SSLKKWGHMKLRTVLQECTFEKGFKKSPLVYQFSSLGSLDEKWMAELSSSMSSGFSEDKT 423
+ + +L++++ ++ K+ + LG ++ + +S +G +
Sbjct: 592 EESENYAVARLKSLISRFPWKANSKRDNVCVS-HRLGLSNDTPLGIISDIFRTG-DRNSP 649
Query: 424 PLGIGEPLIVWPTVEDVR--CSLEGYAAGNAIPSPQKNVDKDFLKKYWAKWKASHTGRSR 481
P +++P+ D + CS + + +D D L + H+ +
Sbjct: 650 PFK-----LLYPSEADAKKHCSEVDGLTYEDLATDDTFIDFDIL---FHSHPFLHSSKES 701
Query: 482 AMPHIKTFARYN-------GQKLAWFLLTSANLSKAAWG---ALQKNNSQLMIRSYELGV 531
+ H +Y ++L WF+ S L +WG ++ N ++ ELGV
Sbjct: 702 LVLHANALLKYEDITDDSGSKRLGWFMFGSQVLGLKSWGDSNRRRRRNEVQILERMELGV 761
Query: 532 LILP 535
+ P
Sbjct: 762 GVFP 765
>gi|340374112|ref|XP_003385582.1| PREDICTED: aprataxin and PNK-like factor-like [Amphimedon
queenslandica]
Length = 432
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 27 LSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDGTNPVVVK-SG-DQRKKLS 83
LS+G + IGR + ++DKR+SR H T+ + D + S+ TNP K SG D++ +L
Sbjct: 15 LSKGEHTIGRGPLLKITDKRVSRNHATVKVNDDNAVSICPRHTNPCYYKPSGRDEQIQLK 74
Query: 84 SNEHVSIADGDIIELI 99
+ +++DGD I ++
Sbjct: 75 KDVWQTLSDGDQISIL 90
>gi|435853317|ref|YP_007314636.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
gi|433669728|gb|AGB40543.1| putative membrane-anchored protein [Halobacteroides halobius DSM
5150]
Length = 372
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 4/69 (5%)
Query: 220 LVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIYPRGVRIIVHTANLIH 279
L++H DGT MKR K N + P P GT AMLL Y +G +IV H
Sbjct: 233 LIVHAYPDGTAPGMKRIKKLNLQAQRIPAP---GTSEDIAMLLAYEKGAELIVAVGTHTH 289
Query: 280 -VDWNNKSQ 287
+D+ K +
Sbjct: 290 MIDFLEKGR 298
>gi|91786388|ref|YP_547340.1| ABC transporter-like protein [Polaromonas sp. JS666]
gi|91695613|gb|ABE42442.1| carbohydrate ABC transporter ATP-binding protein, CUT1 family
[Polaromonas sp. JS666]
Length = 360
Score = 40.4 bits (93), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 20 NSLPKLPLSQGPNVIG----RTNIP-VSDKRLSRKHITLTASADGSASLVVDGTNPVVVK 74
N + LP+ QG ++G R +P VS +RL+ TLTA GSA + + VV+
Sbjct: 237 NLIAALPVGQGVQLVGGPVLRMAVPSVSAQRLA----TLTAGIRGSALRIEERAGDVVLA 292
Query: 75 SGDQRKKLSSNE---HVSIADGDIIELIPGHHFF 105
+ ++S ++ HV+ A G+++ + G H+F
Sbjct: 293 GRVELAEISGSDTFVHVATAAGELVAQLTGVHYF 326
>gi|146162654|ref|XP_001009833.2| FHA domain containing protein [Tetrahymena thermophila]
gi|146146354|gb|EAR89588.2| FHA domain containing protein [Tetrahymena thermophila SB210]
Length = 561
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 108/260 (41%), Gaps = 29/260 (11%)
Query: 1 MSATKI----GYLVPLDNNLREDNSLPKLPLSQGPNVIGRT-NIPVSDKRLSRKHITLTA 55
MS+T I G L+P ++E L+Q +++GR ++ V + ++S +H L
Sbjct: 1 MSSTDIQQKWGELIPKGGLVQE-----TFVLNQKEHILGRRGDLKVDNPKVSGQHCVLKY 55
Query: 56 SADGSASLVVDGTNPVVVKSGDQ--RKKLSSNEHVSIADGDIIELIPGH-----HFFKYV 108
+ ++D V +G KKL N+ V + +GD++ L+ + FK V
Sbjct: 56 DYAQKKAYIID-----VSSNGTSLFNKKLEKNKEVELENGDLVNLLQDKSQWIGYIFKLV 110
Query: 109 TLSRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEEALCNFHVSRDKLPSTFRLLR 168
S + D AT K++ Q EQ +N K +E +++L + ++
Sbjct: 111 DNVDSFNKDQQDTATKNNTDQKQLEQSLEQYKQNEKEQQEQNEQIKKMQNELEERLKKVK 170
Query: 169 VQGLPAWANTSCVSIRDVIQGDIIVA-ILSNYMVDIDWLL-----PACPVLAKIPHVLVI 222
+ +CV D++ ++ L NY D L ACP + P +
Sbjct: 171 EDDEHFEKDQTCVVCIDLLYNPYLMTPCLHNYCCDCMCELLKNKDIACPQCREKPISVQK 230
Query: 223 HGESDGTLE-HMKRNKPANW 241
+ + + +E +KRN W
Sbjct: 231 NYQLNNLIEAFIKRNPDKKW 250
>gi|322711943|gb|EFZ03516.1| tyrosyl-DNA phosphodiesterase domain-containing protein
[Metarhizium anisopliae ARSEF 23]
Length = 496
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 495 QKLAWFLLTSANLSKAAWGALQKNNS----QLMIRSYELGVLILPSAKRHGCG 543
+KLAW + SANLS++AWG + + + ++M R++E GV++ A G G
Sbjct: 349 EKLAWAYVGSANLSESAWGRVVTDRASGQKKMMCRNWECGVVLPVRAFEQGSG 401
>gi|320168830|gb|EFW45729.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 538
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 13/120 (10%)
Query: 6 IGYLVPL--DNNLREDNSLPKLPLSQGPNVIGR---TNIPVSDKRLSRKHITLTASADGS 60
+ LVPL R D + + L +G V+GR TN+ D+RLSR H + DG+
Sbjct: 4 LARLVPLLMPAASRPDPASKVVDLERGETVLGRGPLTNL--EDRRLSRNHAKIQIDHDGA 61
Query: 61 ASLVVDGTNPVVVKSGDQRKKLSSNEH------VSIADGDIIELIPGHHFFKYVTLSRSQ 114
A ++ V+ D S+E VS+ GD++ L+P F+ V L + Q
Sbjct: 62 AHIMSTHKTLCSVRRADAAGGDGSDEQLPLHTWVSLKHGDVLFLMPNAFPFRVVNLVKEQ 121
>gi|440802752|gb|ELR23681.1| hypothetical protein ACA1_073250 [Acanthamoeba castellanii str.
Neff]
Length = 294
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 42/74 (56%), Gaps = 6/74 (8%)
Query: 34 IGRTNIPVSDKRLSRKHITLTASADGSASLVVDGTNPVVV----KSGDQRKKLSSNEHVS 89
+GR + V+DKR+SR+ + ++ A + V+G NPV V K+GD + LS E
Sbjct: 22 LGRGVLGVTDKRISRRQLQISLRGPALA-VTVEGVNPVYVRRAGKAGDG-ELLSRGEEAI 79
Query: 90 IADGDIIELIPGHH 103
+ +GD++ L+ H
Sbjct: 80 LRNGDVVTLLADLH 93
>gi|401626756|gb|EJS44678.1| tdp1p [Saccharomyces arboricola H-6]
Length = 539
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 9/50 (18%)
Query: 497 LAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI----LPSAKRHGC 542
L W L TSANLS+ AWG + K R+YE+GVL LP ++ C
Sbjct: 451 LEWCLYTSANLSQTAWGTISKKP-----RNYEVGVLYHSGRLPGTRKITC 495
>gi|296223668|ref|XP_002757728.1| PREDICTED: aprataxin and PNK-like factor [Callithrix jacchus]
Length = 478
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 11/105 (10%)
Query: 9 LVPLDNNLREDNSLPKLPLSQGPNVIGRTNI-PVSDKRLSRKHITLTASADGSASLVVDG 67
L PLD P++ L+ G V+GR + ++DKR+SR+H L ADG +
Sbjct: 7 LQPLDGG-------PRVALASGETVVGRGPLLGITDKRVSRRHAILEV-ADGQLRIKPVH 58
Query: 68 TNPVVVKSGDQRK--KLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
TNP +S ++ + L +N + GD L+ + F+ + +
Sbjct: 59 TNPCFYQSSEKSQLVPLKTNLWCCLNPGDSFSLLVDKYTFRVLAI 103
>gi|195572577|ref|XP_002104272.1| GD20873 [Drosophila simulans]
gi|194200199|gb|EDX13775.1| GD20873 [Drosophila simulans]
Length = 523
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 23/120 (19%)
Query: 27 LSQGPNVIGRT-NIPVSDKRLSRKHITLTASADGSA-SLVVDGTNPVVVKSGDQRKKLSS 84
L+ G N +GR+ + D + S++ I L + SL V G NP V +
Sbjct: 37 LTAGENFVGRSRETGIRDSKCSKRQIQLQVDLKKAVVSLKVLGVNPCGVNG----LMVMQ 92
Query: 85 NEHVSIADGDIIELIPGHHFFKYV-----------------TLSRSQKRVSNDGATNGEL 127
N + GD++E++ G H F+ V TLS S+K D A NG+L
Sbjct: 93 NSECELKHGDLVEIVYGRHPFEVVFNPPPEDDKEKAEPLSTTLSHSEKSERWDSAGNGKL 152
>gi|281205023|gb|EFA79217.1| hypothetical protein PPL_08045 [Polysphondylium pallidum PN500]
Length = 487
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 8 YLVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSDKRLSRKHITLTASADGSASLVVDG 67
+L+ L + + +N L + G IGR ++ +S+K+ SRK I + + L+ +G
Sbjct: 10 HLIHLKSINKAENLLDHTYKATGTYEIGRGSLGISEKKCSRKQILIKLDEHSNYYLISNG 69
Query: 68 TNPVVVKSGDQRK--KLSSNEHVSIADGDIIELI 99
NP +K D+ +++ +E + DGD ++
Sbjct: 70 INPSYLKKYDKDYFVQMTKDEEYVLEDGDSFSML 103
>gi|145235397|ref|XP_001390347.1| hypothetical protein ANI_1_556034 [Aspergillus niger CBS 513.88]
gi|134058029|emb|CAK38258.1| unnamed protein product [Aspergillus niger]
gi|350632869|gb|EHA21236.1| hypothetical protein ASPNIDRAFT_54717 [Aspergillus niger ATCC 1015]
Length = 387
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 64/158 (40%), Gaps = 35/158 (22%)
Query: 9 LVPLDNNLREDNSLPKLPLSQGPNVIGRTNIPVSD----KRLSRKHITLTASADGSASLV 64
+ PLD+ E N LP L+ P + PVS+ R+ + A A +
Sbjct: 183 VTPLDHPHEEINDLPVHRLT-NPQIF----YPVSESRQFNRVDAGRVFSAAPALEHEQVA 237
Query: 65 VDGTNPV--------------VVKSGDQRKKLSSNEHVSIADGDIIELIPGHHFFKYVTL 110
D NP +V GD+ EH + D+ IP H VT
Sbjct: 238 KDAANPSEAISRVTQNPSHIELVGKGDE-------EHQVLQPADV--RIPHPHM---VTS 285
Query: 111 SRSQKRVSNDGATNGELSSKKMRQQDEQDNENGKNSEE 148
+R KRV N+GA + EL ++ QQD D E + ++E
Sbjct: 286 TRDIKRVPNEGAKHAELYQARLNQQDAADQERKRLAQE 323
>gi|329901801|ref|ZP_08272900.1| hypothetical protein IMCC9480_3399 [Oxalobacteraceae bacterium
IMCC9480]
gi|327549010|gb|EGF33621.1| hypothetical protein IMCC9480_3399 [Oxalobacteraceae bacterium
IMCC9480]
Length = 658
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 484 PHIKTFARYNGQKLAWFLLTSANLSKAAWGALQKNNSQLMIRSYELGVLI 533
PH K + GQ L+TSAN S +AWG ++ + L I+++ELGV +
Sbjct: 343 PHAKVYCFTRGQSRR-LLITSANFSPSAWG-IENRHGSLTIKNFELGVCL 390
>gi|440791002|gb|ELR12258.1| UBA/TSN domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 615
Score = 39.3 bits (90), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 55/106 (51%), Gaps = 12/106 (11%)
Query: 24 KLPLSQGPNVI-GRTN--IPVSDKRLSRKHITLT-----ASADGSASLVVDGTNPVVV-- 73
++ LS G +++ GR + + +SDKR SR+ LT +D +LV G N V
Sbjct: 14 EVELSAGADIVMGRGSPLLGISDKRCSRRQAVLTFLPPATPSDQPFALVAHGPNTTFVRR 73
Query: 74 KSGDQRKKLSSNEHVSIADGDIIELIPGHH--FFKYVTLSRSQKRV 117
+ ++R+ ++ E + DGD+I L P +H + +++ Q++
Sbjct: 74 RGAEEREGMAKGEVYFLNDGDVIRLPPDYHPIVLRLISVGGEQEQT 119
>gi|321474170|gb|EFX85136.1| hypothetical protein DAPPUDRAFT_46356 [Daphnia pulex]
Length = 512
Score = 38.9 bits (89), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 60/123 (48%), Gaps = 12/123 (9%)
Query: 31 PNVIGRTNIP-VSDKRLSRKHITLTASAD-GSASLVVDGTNPVVVKSGDQRKKLSSNEHV 88
P VIGR + + D RLSR H+ L A + G S+ + G N K+G K + +E V
Sbjct: 23 PLVIGRGPLTRIKDPRLSRNHVELVADCEKGLLSVKLIGAN--ACKAGTSIIK-AKDESV 79
Query: 89 SIADGDIIELIPGHHFFKYV------TLSRSQKRVSNDGATNGELSSKKMRQQDE-QDNE 141
+ G+IIEL+ F+ + S+K S + + +KK + +D ++ +
Sbjct: 80 QLKHGEIIELLEKQFPFRVEFSPDPNQVPSSRKSTSAEDVQDPSFFAKKQKMEDTWEEID 139
Query: 142 NGK 144
NGK
Sbjct: 140 NGK 142
>gi|253995926|ref|YP_003047990.1| cytochrome c oxidase subunit I [Methylotenera mobilis JLW8]
gi|253982605|gb|ACT47463.1| cytochrome c oxidase, subunit I [Methylotenera mobilis JLW8]
Length = 530
Score = 38.9 bits (89), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP +L +P L + G DG L W + PPL I G A+ ++
Sbjct: 118 WLLPPSAILLTLPFTLALFGIGDGALA-------TGWTFY-PPLSIQGGIGVDFAIFAVH 169
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
G+ ++ + N+I +N ++ G+ + P+
Sbjct: 170 LLGISSVLGSINIIVTLFNMRAPGMTLMKMPM 201
>gi|71907102|ref|YP_284689.1| cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
gi|71846723|gb|AAZ46219.1| Cytochrome c oxidase, subunit I [Dechloromonas aromatica RCB]
Length = 531
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP +L +P L + G DG L W + PL + G A+L ++
Sbjct: 119 WLLPPAAILLTLPFSLALFGIGDGALA-------TGWTFYA-PLSVQGGMGVDFAILAVH 170
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
G+ I+ + N+I +N ++ G+ M PL
Sbjct: 171 ILGISSIMGSINIIVTIFNMRAPGMTMMKLPL 202
>gi|257095684|ref|YP_003169325.1| cytochrome c oxidase subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257048208|gb|ACV37396.1| cytochrome c oxidase, subunit I [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 535
Score = 38.9 bits (89), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 205 WLLPACPVLAKIPHVLVIHGESDGTLEHMKRNKPANWILHKPPLPISFGTHHSKAMLLIY 264
WLLP L +P +L + G DG + W L+ PL + G A+ I+
Sbjct: 123 WLLPPAAALLTLPFILALFGIGDGAVN-------TGWTLYA-PLSVQGGMGVDFAIFSIH 174
Query: 265 PRGVRIIVHTANLIHVDWNNKSQGLWMQDFPL 296
GV I+ + N+I +N ++ G+ M PL
Sbjct: 175 ILGVSSILGSINIIVTIFNLRAPGMTMMKLPL 206
>gi|195330722|ref|XP_002032052.1| GM26347 [Drosophila sechellia]
gi|194120995|gb|EDW43038.1| GM26347 [Drosophila sechellia]
Length = 523
Score = 38.5 bits (88), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 23/120 (19%)
Query: 27 LSQGPNVIGRT-NIPVSDKRLSRKHITLTASADGSA-SLVVDGTNPVVVKSGDQRKKLSS 84
L+ G N +GR+ + D + S++ I L + SL V G NP V +
Sbjct: 37 LTAGENFVGRSRETGIRDSKCSKRQIQLQVDLKKAVVSLKVLGVNPCGVNG----LMVMQ 92
Query: 85 NEHVSIADGDIIELIPGHHFFKYV-----------------TLSRSQKRVSNDGATNGEL 127
N + GD++E++ G H F+ V TLS S+K D A NG+L
Sbjct: 93 NSECELKHGDLVEIVYGRHPFEVVFNPPPEDDKEKTEPLSTTLSPSEKSERWDSAGNGKL 152
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,700,365,106
Number of Sequences: 23463169
Number of extensions: 467421005
Number of successful extensions: 1056615
Number of sequences better than 100.0: 542
Number of HSP's better than 100.0 without gapping: 343
Number of HSP's successfully gapped in prelim test: 199
Number of HSP's that attempted gapping in prelim test: 1053957
Number of HSP's gapped (non-prelim): 973
length of query: 636
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 487
effective length of database: 8,863,183,186
effective search space: 4316370211582
effective search space used: 4316370211582
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)