BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001299
(1105 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356537037|ref|XP_003537037.1| PREDICTED: uncharacterized protein LOC100806794 [Glycine max]
Length = 1903
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/1070 (44%), Positives = 611/1070 (57%), Gaps = 137/1070 (12%)
Query: 99 QLPLQCNKADGPSNVHNAVVPYQVGPSSEHG--ALVPHQIKEKRKKLKPEVVLDPGTLRM 156
QL L D + NA+VPY+ S HG +VP IK ++ L+P+V LD T R+
Sbjct: 850 QLNLNAEARDLAFHEENALVPYKQKNSLIHGDGVIVPFHIK--KQHLRPKVNLDDETDRV 907
Query: 157 WNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
W L++ +D + ++ED KWW++ER VF GR SF ARMHL+ GDRRF WKGSVV
Sbjct: 908 WKLLL-LDINSHGIDGTDEDKAKWWEEERNVFRGRADSFIARMHLVQGDRRFSRWKGSVV 966
Query: 217 DSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTN---DHTPSDENLRTTAS----LEPIG 269
DSVVGV+LTQNV+D+LSSSA+MSLAARFP S++ H D L +EP
Sbjct: 967 DSVVGVFLTQNVTDHLSSSAFMSLAARFPKNSSSMCKRHHKEDTRLVVNEPQVHIVEPEE 1026
Query: 270 SNSTS----NGAVYDS---------EGNMYFVTEPE----PDRCCELKDRDDAFDSRIQR 312
S N +VYD G V E P L D ++ S + +
Sbjct: 1027 STEWDVKLLNQSVYDQTSTIDMAEHSGEKEAVNSNESCGTPSSVISLTDESNSRLSELPQ 1086
Query: 313 KALQE-----------------------NGDIKVMTDAVPSQ--------AFDTSSVQSL 341
K ++E NGD K + D V SQ + D S+ Q+
Sbjct: 1087 KNIKEHCSPTRSGILSATIEEGEEKSCYNGDRKELNDIVSSQGSVFSSQISGDFSNDQNP 1146
Query: 342 DRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFPTVNSKADVASPSQ 401
++ NS+ +V SST N T FS L+ S + + + K++ S
Sbjct: 1147 EKIGSCSDSNSEVEVLSSTAKYNHFGSNTSFSKL--LEMVSSTKFYEDNSQKSE----SI 1200
Query: 402 NHITQSSVTQFWP--TGNSTADVASPSKTCIKESSIAASTEIPQLENTALLQDKVDGILF 459
+ V F P T ST+D+ + + SS+ + Q+ T
Sbjct: 1201 ENSGMLEVNGFDPFKTEASTSDLKKKDENGMNRSSLQTTEPAGQVAITH----------- 1249
Query: 460 CDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSIS--SSDFNDPFEISVLPSNRESFRT 517
+ + P N Q S F +IS + D S L + + R
Sbjct: 1250 -SQSIASQVHPREQSNHQQQS----------FFNISGQTQDLMQKERGSGLGEQKNATRN 1298
Query: 518 GMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLR---RI 574
G + A K +GK K ++ +WD LR +
Sbjct: 1299 GTNEISSAPIKLKTKEQGKEKKDDF------------------------NWDSLRIDAQA 1334
Query: 575 YSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
+ E++ + MDS+DW+AVRCAD SEIA+ IKERG N +A RIK FLNRLVE H SID
Sbjct: 1335 KAGKREKTENTMDSLDWDAVRCADVSEIAETIKERGMNNRLAERIKNFLNRLVEEHESID 1394
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
LEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQP
Sbjct: 1395 LEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1454
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
LP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC AC
Sbjct: 1455 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKSKPNCNAC 1514
Query: 755 PLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN----PIPVIRIEAD 810
P+R EC+HFASA ASARFALPGP +K IV++ + I Q+P +++ P P + D
Sbjct: 1515 PMRAECRHFASAFASARFALPGPEQKSIVSTTGNSVINQNPSEIISQLHLPPPENTAQED 1574
Query: 811 PVS--------ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRL 862
+ ES ++IN C+PIIEEPR+P P+C + ES+ D +D D EIPT+ L
Sbjct: 1575 EIQLTEVSRQLESKFEINICQPIIEEPRTPEPECLQ--ESQTDIEDAFYEDSSEIPTINL 1632
Query: 863 QEREFKENFPNFMEMNKVML--QDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF N N+M+ N + + S ALVAL +AAS+P KLK LRTEH VY LPD
Sbjct: 1633 NIEEFTLNLQNYMQENMELQGGEMSKALVALNPQAASIPMPKLKNVGRLRTEHCVYELPD 1692
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD--SKLCNSEICYSCNS 978
H LL+G+D R+PDDP YLLAIW+PGE+ NS+ P+ C+S++ +LCN + C+SCNS
Sbjct: 1693 THPLLQGWDTREPDDPGKYLLAIWTPGETANSIQPPESNCSSQEECGQLCNEKECFSCNS 1752
Query: 979 VCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRR 1038
E N+ IVRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR I NL R
Sbjct: 1753 FREANSQIVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPISVPRSWIWNLNR 1812
Query: 1039 EIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
V FG+S TT+F L ++ Q+ FWRG++CVRGFDR R PRPL+ RLH
Sbjct: 1813 RTVYFGTSVTTIFKGLTTQETQQCFWRGYICVRGFDREARAPRPLMARLH 1862
>gi|449456184|ref|XP_004145830.1| PREDICTED: uncharacterized protein LOC101216331 [Cucumis sativus]
Length = 1810
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/1041 (42%), Positives = 590/1041 (56%), Gaps = 107/1041 (10%)
Query: 106 KADGPSNVHNA-VVPY-----QVGPSSEH---------GALVPH--QIKEKRKKLKPEVV 148
+ +G ++VH ++PY + EH GA+VPH + +KK V
Sbjct: 789 QKEGTASVHEKQIIPYARKGGKKNSKHEHNPNSLDGMQGAIVPHPKSLNSTKKKEFGRVY 848
Query: 149 LDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRF 208
L+P + +W +++ D + K++ E+WW+ ER+VF GRI +F A MHLILGDRRF
Sbjct: 849 LEPRDITVWKVLIENDSNSEKEKID----EEWWENERKVFRGRINAFNAIMHLILGDRRF 904
Query: 209 KPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDH-----------TPSDE 257
PWKGSVVDSVVGV+LTQNVSD+LSSSAYMSLAA FPL T ++ T +E
Sbjct: 905 SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLLETENYHGEEIFCIQQSTQRNE 964
Query: 258 NLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSR-IQRKALQ 316
L S S +N A D P+ EL DDA S+ Q +++
Sbjct: 965 RLFLRESKWNNDSMMETNKATGD------------PEEAKELMSADDAISSQGCQGSSIK 1012
Query: 317 ENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTG 376
EN D+ +++ T ++LD T + K+ +SN +S ++ G
Sbjct: 1013 ENHDLTLLSSTCLEDDCGTCLFKNLDDTDNLALHSDKSTFEKEPYSSNQDSTLS----CG 1068
Query: 377 NLKKNSV-----------NQLFPTVNSKADVASPSQNHITQSSVTQFWPTGNSTADVASP 425
+ KKN N + +N V +P SS + + + A
Sbjct: 1069 SNKKNRTSEYKEVGWRDQNPISGNLNPSDTVHTPRSLGKCYSSAECISKSKSGLENNAED 1128
Query: 426 SKTCI------------KESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRI 473
S +C K S E Q + L+ D C E +G ++
Sbjct: 1129 SNSCEEMAVDLQFTPNEKSQGFIGSIEKFQNQEIQLIGDVNAKCSLCSESNEG-----KM 1183
Query: 474 DNGNQASTSGRND---LKSDFRSISSSDFNDPFEISVLPSNRES-FRTGMPQAHDATTTS 529
+ G+Q S+ N + D + S + SV+P++ + +
Sbjct: 1184 EAGSQFSSDIDNSSLLVDFDVERVQSDE-------SVVPASENTNKAKEKEKKEVKGYLE 1236
Query: 530 KKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSV 589
++P KE +D + K P+ + N++++ WD LRR + E RS D MDSV
Sbjct: 1237 DRNPNHLNDEKE--TDKGKAKKSKMKPEVDW-NSLREKWDSLRRKHPPCEPRSRDHMDSV 1293
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DWEAVR A+ ++IADAIKERGQ NIIAGRIK+FL+R LHG IDLEWLR+ PP VKEY
Sbjct: 1294 DWEAVRFAEPTKIADAIKERGQHNIIAGRIKQFLDRTARLHGCIDLEWLRHAPPKDVKEY 1353
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL+I+GLGLKSVEC+RLL+LQ +AFPVD+NVGRIAVRLGWVPL+PLP ++ +HLL+ +P+
Sbjct: 1354 LLEIDGLGLKSVECIRLLALQQVAFPVDINVGRIAVRLGWVPLEPLPEEVQMHLLETFPM 1413
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
MD IQ YLWPRL LDQ+TLYELHYQ+ITFGK+FC KR PNC ACPLR +C+H+ASA AS
Sbjct: 1414 MDSIQKYLWPRLSMLDQRTLYELHYQLITFGKVFCTKRKPNCNACPLRADCRHYASAYAS 1473
Query: 770 ARFALPGPSEKGIVTSEFGNGIGQSPPLVVNP-IPVIRIEADPVSESGYQINNCEPIIEE 828
AR ALPG EKGIV++ + N ++ I+A+P S + +NCEPI+E
Sbjct: 1474 ARLALPGLQEKGIVST-----MSSEKSFEGNTRASLLHIDANPFS-AECSTDNCEPIVEA 1527
Query: 829 PRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSAL 888
P SP P ES ++I +D D +++P +RL +F N ++ N + AL
Sbjct: 1528 PPSPEPAHDESQLTDI--EDLYEYDSDDVPIIRLSSGQFTTTSLNCVD-NSI-----GAL 1579
Query: 889 VALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGE 948
V L AS+P RKLK LRTEH VY LPD H LL +RRDP+DP PYLL I SPGE
Sbjct: 1580 VPLHPRVASIPMRKLKHIERLRTEHQVYELPDIHPLLSKLERRDPNDPCPYLLCILSPGE 1639
Query: 949 SPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNG 1007
+ +S P C ++ ++C+ C SCN V ++N+G V GTILIPCRTA RG+FPLNG
Sbjct: 1640 TVDSCEPPNTICMYREIGEICSEGSCSSCNIVRQQNSGAVHGTILIPCRTAMRGKFPLNG 1699
Query: 1008 TYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGF 1067
TYFQVNEVFAD ETS +PI +PR I +L R I FG+S TT+F L+ IQ F +GF
Sbjct: 1700 TYFQVNEVFADDETSKNPIQIPREWIWDLPRRIAYFGTSTTTIFRGLEKEDIQYCFQKGF 1759
Query: 1068 VCVRGFDRRYRCPRPLVNRLH 1088
+CVRGF+RR R P+ L RLH
Sbjct: 1760 ICVRGFNRRTRTPKRLTERLH 1780
>gi|449530536|ref|XP_004172250.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus]
Length = 1176
Score = 700 bits (1806), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/1021 (42%), Positives = 590/1021 (57%), Gaps = 98/1021 (9%)
Query: 106 KADGPSNVHNA-VVPY-----QVGPSSEH---------GALVPH--QIKEKRKKLKPEVV 148
+ +G ++VH ++PY + EH GA+VPH + +KK V
Sbjct: 186 QKEGTASVHEKQIIPYARKGGKKNSKHEHNPNSLDGMQGAIVPHPKSLNSTKKKEFGRVY 245
Query: 149 LDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRF 208
L+P + +W +++ D + K++ E+WW+ ER+VF GRI +F A MHLILGDRRF
Sbjct: 246 LEPRDITVWKVLIENDSNSEKEKID----EEWWENERKVFRGRINAFNAIMHLILGDRRF 301
Query: 209 KPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDH-----------TPSDE 257
PWKGSVVDSVVGV+LTQNVSD+LSSSAYMSLAA FPL T ++ T +E
Sbjct: 302 SPWKGSVVDSVVGVFLTQNVSDHLSSSAYMSLAATFPLLETENYHGEEVFCIQQSTQRNE 361
Query: 258 NLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSR-IQRKALQ 316
L S S +N A D P+ EL DDA S+ Q +++
Sbjct: 362 RLFLCESKWNNDSMMETNKATGD------------PEEAKELMSADDAISSQGCQGSSIK 409
Query: 317 ENGDIKVMTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTG 376
EN D+ +++ T ++LD T + K+ +SN +S ++ S
Sbjct: 410 ENHDLTLLSSTCLEDDCGTCLCKNLDDTDNLALHSDKSTFEKEPYSSNQDSTLSCGS--- 466
Query: 377 NLKKNSVNQLFPTVNSK-ADVASPSQNHITQSSVTQFWPTGNSTADVASPSKTCIKESS- 434
N+ N+K A+ S S++ + ++ NS ++A + E S
Sbjct: 467 -------NKKIELQNTKSAECISKSKSGLENNA-----EDSNSCEEMAVDLQFTPNEKSQ 514
Query: 435 -IAASTEIPQLENTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRND---LKSD 490
S E Q + L+ D C E +G +++ G+Q S+ N + D
Sbjct: 515 GFIGSIEKFQNQEIELIGDVNAKCSLCSESNEG-----KMEAGSQFSSDIDNSSLLVDFD 569
Query: 491 FRSISSSDFNDPFEISVLPSNRES-FRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKND 549
+ S + SV+P++ + + ++P KE +D
Sbjct: 570 VERVQSDE-------SVVPASENTNKAKEKEKKEVKGYLEDRNPNHLNDEKE--TDKGKA 620
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKER 609
+ K P+ + N++++ WD LRR + E RS D MDSVDWEAVR A+ ++IADAIKER
Sbjct: 621 KKSKMKPEVDW-NSLREKWDSLRRKHPPCEPRSRDHMDSVDWEAVRFAEPTKIADAIKER 679
Query: 610 GQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL 669
GQ NIIAGRIK+FL+R LHG IDLEWLR+ PP VKEYLL+I+GLGLKSVEC+RLL+L
Sbjct: 680 GQHNIIAGRIKQFLDRTARLHGCIDLEWLRHAPPKDVKEYLLEIDGLGLKSVECIRLLAL 739
Query: 670 QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTL 729
Q +AFPVD+NVGRIAVRLGWVPL+PLP ++ +HLL+ +P+MD IQ YLWPRL LDQ+TL
Sbjct: 740 QQVAFPVDINVGRIAVRLGWVPLEPLPEEVQMHLLETFPMMDSIQKYLWPRLSMLDQRTL 799
Query: 730 YELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGN 789
YELHYQ+ITFGK+FC KR PNC ACPLR +C+H+ASA ASAR ALPG EKGIV++
Sbjct: 800 YELHYQLITFGKVFCTKRKPNCNACPLRADCRHYASAYASARLALPGLQEKGIVST---- 855
Query: 790 GIGQSPPLVVNP-IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDD 848
+ N ++ I+A+P S + +NCEPI+E P SP P ES ++I +D
Sbjct: 856 -MSSGKSFEGNTRASLLHIDANPFS-AECSTDNCEPIVEAPPSPEPAHDESQLTDI--ED 911
Query: 849 FSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAH 908
D +++P +RL +F N ++ + ALV L AS+P RKLK
Sbjct: 912 LYEYDSDDVPIIRLSSGQFTTTSLNCVDGS------IGALVPLHPRVASIPMRKLKHIER 965
Query: 909 LRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKL 967
LRTEH VY LPD H LL +RRDP+DP PYLL I SPGE+ +S P C ++ ++
Sbjct: 966 LRTEHQVYELPDIHPLLSKLERRDPNDPCPYLLCILSPGETVDSCEPPNTICMYREIGEI 1025
Query: 968 CNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPIN 1027
C+ C SCN V ++N+G V GTILIPCRTA RG+FPLNGTYFQVNEVFAD ETS +PI
Sbjct: 1026 CSEGSCSSCNIVRQQNSGAVHGTILIPCRTAMRGKFPLNGTYFQVNEVFADDETSKNPIQ 1085
Query: 1028 VPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRL 1087
+PR I +L R I FG+S TT+F L+ IQ F +GF+CVRGF+RR R P+ L RL
Sbjct: 1086 IPREWIWDLPRRIAYFGTSTTTIFRGLEKEDIQYCFQKGFICVRGFNRRTRTPKRLTERL 1145
Query: 1088 H 1088
H
Sbjct: 1146 H 1146
>gi|356546049|ref|XP_003541444.1| PREDICTED: protein ROS1-like [Glycine max]
Length = 1881
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/559 (59%), Positives = 408/559 (72%), Gaps = 23/559 (4%)
Query: 567 DWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR + + E++ + MDS+DW+AVRCAD SEIA+ IKERG N +A RIK FL
Sbjct: 1295 DWDSLRIEAQAKAGKREKTDNTMDSLDWDAVRCADVSEIAETIKERGMNNRLADRIKNFL 1354
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLVE HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1355 NRLVEEHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1414
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1415 AVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQETLYELHYQMITFGKVF 1474
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN--- 800
C K PNC ACP+RGEC+HFASA ASAR ALPGP +K IV++ + I Q+P +++
Sbjct: 1475 CTKSKPNCNACPMRGECRHFASAFASARLALPGPEQKSIVSTTGNSVIDQNPSEIISQLH 1534
Query: 801 -PIPVIRIEADPVS--------ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSI 851
P P +AD + ES +IN C+PIIEEP +P P+C + +++I+D +
Sbjct: 1535 LPPPESTAQADDIQLTEVSRQLESKSEINICQPIIEEPTTPEPECLQVSQTDIEDAFYE- 1593
Query: 852 GDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAH 908
D+ EIPT+ L EF N N+M+ K+ LQ+ S ALVAL EAAS+P KLK +
Sbjct: 1594 -DLCEIPTINLNIEEFTMNLQNYMQ-EKMELQEAEMSKALVALNPEAASIPMPKLKNVSR 1651
Query: 909 LRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD--SK 966
LRTEH VY LPD H LL+G+D R+PDDP YLLAIW+PGE+ NS+ P+ +C+S++ +
Sbjct: 1652 LRTEHCVYELPDTHPLLQGWDTREPDDPGKYLLAIWTPGETANSIQPPESKCSSQEECGQ 1711
Query: 967 LCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPI 1026
LCN C+SCNS E N+ IVRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI
Sbjct: 1712 LCNENECFSCNSFREANSQIVRGTLLIPCRTATRGSFPLNGTYFQVNEVFADHDSSLNPI 1771
Query: 1027 NVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNR 1086
+VPR I NL R V FG+S TT+F L ++IQ+ FWRG+VCVRGFDR R PRPL+ R
Sbjct: 1772 SVPRSWIWNLNRRTVYFGTSVTTIFKGLTTQEIQQCFWRGYVCVRGFDREARAPRPLMAR 1831
Query: 1087 LHCSPSKKEAGSKRSRDES 1105
LH SK +R++ ES
Sbjct: 1832 LHFPASKLAKTKERTKKES 1850
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/149 (53%), Positives = 102/149 (68%), Gaps = 5/149 (3%)
Query: 99 QLPLQCNKADGPSNVHNAVVPYQVGPSSEHG--ALVPHQIKEKRKKLKPEVVLDPGTLRM 156
QL L D + NA+VPY+ S HG +VP IK ++ L+P+V LD T R+
Sbjct: 772 QLNLNTEVRDLAFHEQNALVPYKQQNSLIHGDGVIVPFHIK--KQHLRPKVDLDDETDRV 829
Query: 157 WNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
W L++ +D + ++ED KWW++ER VF GR SF ARMHL+ GDRRF WKGSVV
Sbjct: 830 WKLLL-LDINSHGIDGTDEDKAKWWEEERNVFRGRADSFIARMHLVQGDRRFSRWKGSVV 888
Query: 217 DSVVGVYLTQNVSDNLSSSAYMSLAARFP 245
DSVVGV+LTQNV+D+LSSSA+MSLAARFP
Sbjct: 889 DSVVGVFLTQNVTDHLSSSAFMSLAARFP 917
>gi|225449724|ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera]
Length = 2198
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/534 (60%), Positives = 400/534 (74%), Gaps = 10/534 (1%)
Query: 567 DWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + + G ERS D MDS+D+EA+RCA + I++AIKERG N++A RIK+FL
Sbjct: 1659 DWDSLRKQVQANGRKRERSKDTMDSLDYEAIRCAHVNVISEAIKERGMNNMLAERIKDFL 1718
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR+ PPDK K+YLL I GLGLKSVECVRLL+L +AFPVD NVGRI
Sbjct: 1719 NRLVREHGSIDLEWLRDSPPDKAKDYLLSIRGLGLKSVECVRLLTLHQLAFPVDTNVGRI 1778
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQ+ITFGK+F
Sbjct: 1779 AVRLGWVPLQPLPESLQLHLLELYPMLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVF 1838
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA ASAR ALP P EK IV+S + ++P +NPIP
Sbjct: 1839 CTKHKPNCNACPMRGECRHFASAFASARLALPAPEEKSIVSSTAPSVADRNPTAFINPIP 1898
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E++ + + + CEPIIE P +P PQC E+ ES+I+D + D +EIPT++L
Sbjct: 1899 LPSLESNLLGKEEQDTSKCEPIIEVPATPEPQCIETLESDIEDAFYE--DPDEIPTIKLN 1956
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF N N+M+ N + LQ+ S ALVAL +A S+PT KLK + LRTEH VY LPD
Sbjct: 1957 FEEFTLNLQNYMQEN-MELQEGDMSKALVALDPKATSIPTPKLKNVSRLRTEHQVYELPD 2015
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSV 979
+H LL+G D R+PDDPSPYLLAIW+PGE+ NS P+ RC S++ KLCN + C+SCNS+
Sbjct: 2016 SHPLLKGMDIREPDDPSPYLLAIWTPGETANSSQPPERRCESQEPGKLCNEKTCFSCNSL 2075
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR I NL R
Sbjct: 2076 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSINPIDVPRAWIWNLPRR 2135
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L IQ FWRGFVCVRGFD++ R PRPL+ RLH S +K
Sbjct: 2136 TVYFGTSVTSIFRGLPTEGIQYCFWRGFVCVRGFDQKSRAPRPLMARLHLSANK 2189
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/171 (47%), Positives = 111/171 (64%), Gaps = 21/171 (12%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKL 143
II + K L + D ++ S NA++ Y+ GA++P++IK ++K
Sbjct: 1107 IIERLKSLELNDTSNETV---------SQEENAIILYK-----GDGAIIPYEIK--KRKP 1150
Query: 144 KPEVVLDPGTLRMWNLIMNIDDGTTKD-QTSNEDMEKWWQKEREVFEGRIQSFTARMHLI 202
+P+V LD T R+W L+M G +D S+E KWW++EREVF GR SF ARMHL+
Sbjct: 1151 RPKVDLDLETERVWKLLM----GAEQDVGDSDERKAKWWEEEREVFRGRADSFIARMHLV 1206
Query: 203 LGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHT 253
GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSL +RFPL ++ T
Sbjct: 1207 QGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRFPLHPESNKT 1257
>gi|449512809|ref|XP_004164145.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus]
Length = 1736
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/552 (58%), Positives = 403/552 (73%), Gaps = 11/552 (1%)
Query: 557 KKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKERGQQN 613
K NS+ DWD LR+ + + G+ E+ D MDS+D+EA+R AD EI++AIKERG N
Sbjct: 1192 KVNSEKKGGMDWDSLRKQVEANGQIKEKGKDAMDSIDYEAIRLADVREISNAIKERGMNN 1251
Query: 614 IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
++A RIKEFLNRLV HGSIDLEWLR+VPPDK K+YLL + GLGLKSVECVRLL+L H+A
Sbjct: 1252 MLAERIKEFLNRLVTDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLA 1311
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELH
Sbjct: 1312 FPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELH 1371
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQ 793
YQ+ITFGK+FC K PNC ACP+RGECKHFASA ASAR ALP P EKGIV S +
Sbjct: 1372 YQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKGIVASTNPMSTEK 1431
Query: 794 SPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGD 853
PP+V NP+P++ E +E+ + CEPI+E P +P P+ +E ES+I+D + D
Sbjct: 1432 QPPIVTNPLPILPPEGSTYAENTSGPSKCEPIVEVPATPEPEPNEITESDIEDAFYE--D 1489
Query: 854 IEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEH 913
+EIPT++L EFK +++ + S ALVAL EAA +PT KLK + LRTEH
Sbjct: 1490 PDEIPTIKLSMEEFKTTLQHYIPEGDM----SKALVALNPEAAFIPTPKLKNVSRLRTEH 1545
Query: 914 HVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEI 972
VY LPD+H LLR DRR+PDDPSPYLLAIW+PGE+ NS+ P+ C S+D ++LCN
Sbjct: 1546 QVYELPDSHPLLREMDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNEIT 1605
Query: 973 CYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHT 1032
C++CNS E N+ VRGT+L+PCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR
Sbjct: 1606 CFTCNSRREANSQTVRGTLLVPCRTAMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRKW 1665
Query: 1033 IANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPS 1092
+ NL R V FG+S +T+F L +IQ+ FWRGFVCVRGFD++ R PRPL+ RLH P+
Sbjct: 1666 LWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFDQKTRAPRPLIARLHF-PA 1724
Query: 1093 KKEAGSKRSRDE 1104
K A K + E
Sbjct: 1725 SKLAKVKNGQTE 1736
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/311 (36%), Positives = 177/311 (56%), Gaps = 37/311 (11%)
Query: 83 DIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKE--KR 140
+I+ K K L++ D + + NA+VPY+ +GA+VP+ E ++
Sbjct: 654 EILHKMKRLSLNDHEVSI----------RSEQNAIVPYK-----GNGAVVPYVESEYLRK 698
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
+K +P V +DP T R+WNL+M +G+ ++ +D EKWW++ER+VF GR SF ARMH
Sbjct: 699 RKARPRVDIDPETERIWNLLMG-KEGSEGIESHEKDKEKWWEEERKVFRGRADSFIARMH 757
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLR 260
L+ GDRRF WKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP+KS + NLR
Sbjct: 758 LVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPVKSAS-------NLR 810
Query: 261 TTASLE-PIGSNSTSNGAVYDSEGNMYFVTEPEPDRC----CELKDRDDAFDSRIQRKAL 315
T +E I +N ++ +Y +E + V E R + ++ +S ++
Sbjct: 811 TQGEVETSIVANESAACVLYPAESIRWHVQELSVPRFEMPQTSINHQNQIANSGTEKIFT 870
Query: 316 QENGDIKVMTDAVPSQ-AFDTSSVQSLDRTQLFPTGNSKAD---VASSTKTSNAESF--I 369
+ G I V + + SQ +FD++ Q + NS+A+ V+ ++ +++ +F I
Sbjct: 871 ELGGQI-VEEEVISSQDSFDSTITQGTAGARSCSGSNSEAEEPIVSYNSSSTHYSNFTDI 929
Query: 370 TQFSHTGNLKK 380
Q T ++K
Sbjct: 930 KQMETTATIQK 940
>gi|330370553|gb|AEC12446.1| repressor of gene silencing 1 [Gossypium hirsutum]
Length = 1758
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/559 (57%), Positives = 406/559 (72%), Gaps = 24/559 (4%)
Query: 567 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + E++ MDS+DWEAVRCA+ EIA+ IKERG N++A RIK+FL
Sbjct: 1203 DWDSLRKQTEVNGRKREKTERTMDSLDWEAVRCAEVHEIAETIKERGMNNVLAQRIKDFL 1262
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1263 NRLVRDHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1322
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1323 AVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWPRLCKLDQRTLYELHYQMITFGKVF 1382
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA ASAR ALPGP EK IV++ ++P ++++ +
Sbjct: 1383 CTKGKPNCNACPMRGECRHFASAFASARLALPGPEEKSIVSATENGTSDRNPAVIIDQLA 1442
Query: 804 VIRIEADPVSESGYQ------------INNCEPIIEEPRSPGPQCSESPESEIDDDDFSI 851
+ +++ + + YQ +N C+PIIEEP SP P+C++ E++I +D FS
Sbjct: 1443 LPLPQSNELLDRNYQSEANQHLQAASTVNKCDPIIEEPASPEPECTQVAENDI-EDMFS- 1500
Query: 852 GDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAH 908
D +EIPT++L EF + N+M+ N + LQ+ S ALVALTAEAAS+PT +LK
Sbjct: 1501 EDPDEIPTIKLNMEEFTQTLQNYMQ-NNIELQEGDMSKALVALTAEAASIPTPRLKNVNR 1559
Query: 909 LRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKL 967
LRTEH VY LPD+H LL D+R+PDDP YLLAIW+PGE+ NS+ P+ RCNS++ KL
Sbjct: 1560 LRTEHQVYELPDSHPLLNELDKREPDDPCKYLLAIWTPGETANSIQQPERRCNSQEHGKL 1619
Query: 968 CNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPIN 1027
C+ E C+SCNS+ E + IVRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI
Sbjct: 1620 CDDETCFSCNSIQEAESQIVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPIA 1679
Query: 1028 VPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRL 1087
VPR + NL R +V FG+S ++F L IQ FWRG+VCVRGFD++ R PRPL+ RL
Sbjct: 1680 VPREWLWNLPRRMVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARL 1739
Query: 1088 HCSPSK--KEAGSKRSRDE 1104
H S+ K G DE
Sbjct: 1740 HFPVSRLAKAKGKGAGEDE 1758
Score = 149 bits (377), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 85/174 (48%), Positives = 113/174 (64%), Gaps = 23/174 (13%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNV----HNAVVPYQVGPSSEHGALVPHQIKEK 139
++ +F L I GS + Q A P N+ HNA+V Y+ G +VP + +
Sbjct: 623 LLEQFNQLDINREGSAIACQEQNALVPYNMIYQEHNALVVYR------DGTIVPF-VPTR 675
Query: 140 RKKLKPEVVLDPGTLRMWNLIM-NID----DGTTKDQTSNEDMEKWWQKEREVFEGRIQS 194
+++ +P+V LD T R+W L++ NI+ DGT +E+ KWW ER VF GR S
Sbjct: 676 KRRPRPKVDLDEETNRVWKLLLENINSEGIDGT------DEEKAKWW-AERRVFSGRADS 728
Query: 195 FTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
F ARMHL+ GDRRF PWKGSV+DSV+GV+LTQNVSD+LSSSA+MSLAARFP+KS
Sbjct: 729 FIARMHLVQGDRRFSPWKGSVLDSVIGVFLTQNVSDHLSSSAFMSLAARFPIKS 782
>gi|449465553|ref|XP_004150492.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus]
Length = 1679
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/552 (58%), Positives = 403/552 (73%), Gaps = 11/552 (1%)
Query: 557 KKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKERGQQN 613
K NS+ DWD LR+ + + G+ E+ D MDS+D+EA+R AD EI++AIKERG N
Sbjct: 1135 KVNSEKKGGMDWDSLRKQVEANGQIKEKGKDAMDSIDYEAIRLADVREISNAIKERGMNN 1194
Query: 614 IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
++A RIKEFLNRLV HGSIDLEWLR+VPPDK K+YLL + GLGLKSVECVRLL+L H+A
Sbjct: 1195 MLAERIKEFLNRLVTDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLA 1254
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELH
Sbjct: 1255 FPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELH 1314
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQ 793
YQ+ITFGK+FC K PNC ACP+RGECKHFASA ASAR ALP P EKGIV S +
Sbjct: 1315 YQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKGIVASTNPMSTEK 1374
Query: 794 SPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGD 853
PP+V NP+P++ E +E+ + CEPI+E P +P P+ +E ES+I+D + D
Sbjct: 1375 QPPIVTNPLPILPPEGSTYAENTSGPSKCEPIVEVPATPEPEPNEITESDIEDAFYE--D 1432
Query: 854 IEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEH 913
+EIPT++L EFK +++ + S ALVAL EAA +PT KLK + LRTEH
Sbjct: 1433 PDEIPTIKLSMEEFKTTLQHYIPEGDM----SKALVALNPEAAFIPTPKLKNVSRLRTEH 1488
Query: 914 HVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEI 972
VY LPD+H LLR DRR+PDDPSPYLLAIW+PGE+ NS+ P+ C S+D ++LCN
Sbjct: 1489 QVYELPDSHPLLREMDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNEIT 1548
Query: 973 CYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHT 1032
C++CNS E N+ VRGT+L+PCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR
Sbjct: 1549 CFTCNSRREANSQTVRGTLLVPCRTAMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRKW 1608
Query: 1033 IANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPS 1092
+ NL R V FG+S +T+F L +IQ+ FWRGFVCVRGFD++ R PRPL+ RLH P+
Sbjct: 1609 LWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFDQKTRAPRPLIARLHF-PA 1667
Query: 1093 KKEAGSKRSRDE 1104
K A K + E
Sbjct: 1668 SKLAKVKNGQTE 1679
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/312 (36%), Positives = 178/312 (57%), Gaps = 37/312 (11%)
Query: 83 DIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKE--KR 140
+I+ K K L++ D + + NA+VPY+ +GA+VP+ E ++
Sbjct: 597 EILHKMKRLSLNDHEVSI----------RSEQNAIVPYK-----GNGAVVPYVESEYLRK 641
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
+K +P V +DP T R+WNL+M +G+ ++ +D EKWW++ER+VF GR SF ARMH
Sbjct: 642 RKARPRVDIDPETERIWNLLMG-KEGSEGIESHEKDKEKWWEEERKVFRGRADSFIARMH 700
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLR 260
L+ GDRRF WKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP+KS + NLR
Sbjct: 701 LVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPVKSAS-------NLR 753
Query: 261 TTASLE-PIGSNSTSNGAVYDSEGNMYFVTEPEPDRC----CELKDRDDAFDSRIQRKAL 315
T +E I +N ++ +Y +E + V E R + ++ +S ++
Sbjct: 754 TQGEVETSIVANESAACVLYPAESIRWHVQELSVPRFEMPQTSINHQNQIANSGTEKIFT 813
Query: 316 QENGDIKVMTDAVPSQ-AFDTSSVQSLDRTQLFPTGNSKAD---VASSTKTSNAESF--I 369
+ G I V + + SQ +FD++ Q + NS+A+ V+ ++ +++ +F I
Sbjct: 814 ELGGQI-VEEEVISSQDSFDSTITQGTAGARSCSGSNSEAEEPIVSYNSSSTHYSNFTDI 872
Query: 370 TQFSHTGNLKKN 381
Q T ++K+
Sbjct: 873 KQMETTATIQKS 884
>gi|296090400|emb|CBI40219.3| unnamed protein product [Vitis vinifera]
Length = 1621
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/534 (60%), Positives = 400/534 (74%), Gaps = 10/534 (1%)
Query: 567 DWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + + G ERS D MDS+D+EA+RCA + I++AIKERG N++A RIK+FL
Sbjct: 1082 DWDSLRKQVQANGRKRERSKDTMDSLDYEAIRCAHVNVISEAIKERGMNNMLAERIKDFL 1141
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR+ PPDK K+YLL I GLGLKSVECVRLL+L +AFPVD NVGRI
Sbjct: 1142 NRLVREHGSIDLEWLRDSPPDKAKDYLLSIRGLGLKSVECVRLLTLHQLAFPVDTNVGRI 1201
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQ+ITFGK+F
Sbjct: 1202 AVRLGWVPLQPLPESLQLHLLELYPMLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVF 1261
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA ASAR ALP P EK IV+S + ++P +NPIP
Sbjct: 1262 CTKHKPNCNACPMRGECRHFASAFASARLALPAPEEKSIVSSTAPSVADRNPTAFINPIP 1321
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E++ + + + CEPIIE P +P PQC E+ ES+I+D + D +EIPT++L
Sbjct: 1322 LPSLESNLLGKEEQDTSKCEPIIEVPATPEPQCIETLESDIEDAFYE--DPDEIPTIKLN 1379
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF N N+M+ N + LQ+ S ALVAL +A S+PT KLK + LRTEH VY LPD
Sbjct: 1380 FEEFTLNLQNYMQEN-MELQEGDMSKALVALDPKATSIPTPKLKNVSRLRTEHQVYELPD 1438
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSV 979
+H LL+G D R+PDDPSPYLLAIW+PGE+ NS P+ RC S++ KLCN + C+SCNS+
Sbjct: 1439 SHPLLKGMDIREPDDPSPYLLAIWTPGETANSSQPPERRCESQEPGKLCNEKTCFSCNSL 1498
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR I NL R
Sbjct: 1499 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSINPIDVPRAWIWNLPRR 1558
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L IQ FWRGFVCVRGFD++ R PRPL+ RLH S +K
Sbjct: 1559 TVYFGTSVTSIFRGLPTEGIQYCFWRGFVCVRGFDQKSRAPRPLMARLHLSANK 1612
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/171 (47%), Positives = 111/171 (64%), Gaps = 21/171 (12%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKL 143
II + K L + D ++ S NA++ Y+ GA++P++IK ++K
Sbjct: 782 IIERLKSLELNDTSNETV---------SQEENAIILYKGD-----GAIIPYEIK--KRKP 825
Query: 144 KPEVVLDPGTLRMWNLIMNIDDGTTKD-QTSNEDMEKWWQKEREVFEGRIQSFTARMHLI 202
+P+V LD T R+W L+M G +D S+E KWW++EREVF GR SF ARMHL+
Sbjct: 826 RPKVDLDLETERVWKLLM----GAEQDVGDSDERKAKWWEEEREVFRGRADSFIARMHLV 881
Query: 203 LGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHT 253
GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSL +RFPL ++ T
Sbjct: 882 QGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRFPLHPESNKT 932
>gi|138996985|dbj|BAF52855.1| repressor of silencing 1 [Nicotiana tabacum]
Length = 1796
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/585 (56%), Positives = 405/585 (69%), Gaps = 45/585 (7%)
Query: 548 NDRTKKTTPKKNSDNTVQQDWDLLR-RIYSTGE--ERSHDKMDSVDWEAVRCADESEIAD 604
N + KK P K N DWD LR + + G+ ER+ + MDS+DWEAVRCAD +EIA
Sbjct: 1230 NTKAKKERPGKEKQNV---DWDSLRLQAQNNGKKRERTANTMDSLDWEAVRCADVNEIAH 1286
Query: 605 AIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECV 664
I+ERG N++A RIK+FLNR+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECV
Sbjct: 1287 TIRERGMNNMLAERIKDFLNRIFREHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSVECV 1346
Query: 665 RLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYL 724
RLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC L
Sbjct: 1347 RLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKL 1406
Query: 725 DQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVT 784
DQ+TLYELHY MITFGK+FC K PNC ACPLRGEC+HFASA ASAR ALP P EK IV+
Sbjct: 1407 DQRTLYELHYHMITFGKVFCTKSKPNCNACPLRGECRHFASAFASARLALPAPEEKSIVS 1466
Query: 785 SEFGNGIGQSP---------PL------------VVNPIPVIRIEADPVSESGYQINNCE 823
+ GQ+P PL ++N P+I + A P E
Sbjct: 1467 ATENKAAGQNPFQNFSQLLLPLPQADQTPLEHSKLINSAPIIEVPATP-----------E 1515
Query: 824 PIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQ 883
PI+EEP SP P+ +PE +I+D F D EIPT+ L EF +N FME N + Q
Sbjct: 1516 PIVEEPASPEPE-QNAPEVDIEDAYFE--DPNEIPTITLNMAEFTQNVKKFMENNMELQQ 1572
Query: 884 --DSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLL 941
S ALVALT EAAS+P KLK + LRTEH VY L D+H LL GFD+R+PDDP YLL
Sbjct: 1573 VEMSKALVALTPEAASIPVPKLKHISRLRTEHQVYELSDSHPLLEGFDKREPDDPCSYLL 1632
Query: 942 AIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANR 1000
AIW+PGE+ +S+ P +CNS+++ +LC+ E C++CNS+ E ++ VRGTILIPCRTA R
Sbjct: 1633 AIWTPGETADSIHPPAIKCNSQEAGRLCDDETCFACNSLREAHSQTVRGTILIPCRTAMR 1692
Query: 1001 GRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQ 1060
G FPLNGTYFQVNEVFADH++S +PI+VPR + NL R V FG+S T+F L IQ
Sbjct: 1693 GSFPLNGTYFQVNEVFADHDSSLNPIDVPRDWLWNLPRRTVYFGTSIPTIFKGLTTESIQ 1752
Query: 1061 EYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDES 1105
FWRGFVCVRGFD++ R PRPL+ RLH P+ + + +K DE+
Sbjct: 1753 HCFWRGFVCVRGFDKKTRAPRPLMARLHF-PASRLSRTKGKPDEN 1796
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 76/135 (56%), Positives = 97/135 (71%), Gaps = 7/135 (5%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKE-KRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
NA+V Y+ G++VP +R+K +P+V LD T R+W L++ + D T
Sbjct: 737 NALVIYR-----RDGSIVPFAGSFIRRRKPRPKVDLDDETTRVWKLLLQDINSEGIDGT- 790
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
+ED KWW++ER VF GR+ SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 791 DEDKAKWWEEERGVFHGRVDSFVARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLS 850
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAA+FPLKS
Sbjct: 851 SSAFMSLAAQFPLKS 865
>gi|225440992|ref|XP_002277401.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera]
Length = 1942
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/551 (58%), Positives = 399/551 (72%), Gaps = 11/551 (1%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAI 606
+ KK ++ NT+ WD LR+ + ER+ + MDS+DWEAVRC+D +EIA+ I
Sbjct: 1385 KAKKGKARREEKNTLH--WDNLRKEAQVNGRKRERTVNTMDSLDWEAVRCSDVNEIANTI 1442
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
KERG N++A RIK+FLNRLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRL
Sbjct: 1443 KERGMNNMLAERIKDFLNRLVRDHGSIDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRL 1502
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ
Sbjct: 1503 LTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQ 1562
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
+TLYELHYQMITFGK+FC K PNC ACP+RGEC+HFASA ASAR AL GP E+ IV++
Sbjct: 1563 RTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLALTGPEERSIVSTN 1622
Query: 787 FGNGIGQSPPLVVNPIPVIRIEADP-VSESGYQINNCEPIIEEPRSPGPQCSESPESEID 845
+ +P + +NP+P+ SE+ INNCEPI+E P +P + + ES+I+
Sbjct: 1623 ANESMDGNPDVTINPLPLPPPLPQKQSSEANPGINNCEPIVEVPATPEQEHPQILESDIE 1682
Query: 846 DDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD--SSALVALTAEAASVPTRKL 903
D + D +EIPT++L EF N N+M+ N + + S ALVALT E AS+P KL
Sbjct: 1683 DTLYE--DPDEIPTIKLNIEEFTHNLQNYMQRNMELQESDMSKALVALTPEVASIPMPKL 1740
Query: 904 KRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK 963
K + LRTEHHVY LPD+H LL G D+R+PDDP YLLAIW+PGE+ NS+ P+ C+S+
Sbjct: 1741 KNVSRLRTEHHVYELPDSHPLLEGLDKREPDDPCSYLLAIWTPGETANSIQPPERTCSSQ 1800
Query: 964 DS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
+S LC+ + C+SCNS+ E N+ VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S
Sbjct: 1801 ESGGLCDEKTCFSCNSIREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSS 1860
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
+PI+VPR I NL R V FG+S T+F L IQ FWRGFVCVRGFD++ R PRP
Sbjct: 1861 LNPIDVPRAWIWNLPRRTVYFGTSIPTIFKGLSTEDIQYCFWRGFVCVRGFDQKTRAPRP 1920
Query: 1083 LVNRLHCSPSK 1093
L+ RLH S+
Sbjct: 1921 LMARLHFPASR 1931
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/181 (48%), Positives = 113/181 (62%), Gaps = 16/181 (8%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVH----NAVVPYQVGPSSEHGALVPHQIK-- 137
II + K L I + S++ Q A P N++ NA+V Y+ G +VP +
Sbjct: 822 IIEQLKHLDI-NRESKISYQEQNALVPYNMNKEEKNALVLYK-----RDGTIVPFEDSFG 875
Query: 138 -EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
K+++ +P V LD T R+W L+M + D T +E+ KWW++ER VF GR SF
Sbjct: 876 LVKKRRPRPRVDLDEETSRVWKLLMGNINSEGIDGT-DEEKAKWWEEERNVFRGRADSFI 934
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSD 256
ARMHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+MSLAA FP K +H PS
Sbjct: 935 ARMHLVQGDRRFSKWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAAHFPCKC--NHRPST 992
Query: 257 E 257
E
Sbjct: 993 E 993
>gi|297740062|emb|CBI30244.3| unnamed protein product [Vitis vinifera]
Length = 1470
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/551 (58%), Positives = 399/551 (72%), Gaps = 11/551 (1%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAI 606
+ KK ++ NT+ WD LR+ + ER+ + MDS+DWEAVRC+D +EIA+ I
Sbjct: 913 KAKKGKARREEKNTLH--WDNLRKEAQVNGRKRERTVNTMDSLDWEAVRCSDVNEIANTI 970
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
KERG N++A RIK+FLNRLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRL
Sbjct: 971 KERGMNNMLAERIKDFLNRLVRDHGSIDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRL 1030
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ
Sbjct: 1031 LTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQ 1090
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
+TLYELHYQMITFGK+FC K PNC ACP+RGEC+HFASA ASAR AL GP E+ IV++
Sbjct: 1091 RTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLALTGPEERSIVSTN 1150
Query: 787 FGNGIGQSPPLVVNPIPVIRIEADP-VSESGYQINNCEPIIEEPRSPGPQCSESPESEID 845
+ +P + +NP+P+ SE+ INNCEPI+E P +P + + ES+I+
Sbjct: 1151 ANESMDGNPDVTINPLPLPPPLPQKQSSEANPGINNCEPIVEVPATPEQEHPQILESDIE 1210
Query: 846 DDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD--SSALVALTAEAASVPTRKL 903
D + D +EIPT++L EF N N+M+ N + + S ALVALT E AS+P KL
Sbjct: 1211 DTLYE--DPDEIPTIKLNIEEFTHNLQNYMQRNMELQESDMSKALVALTPEVASIPMPKL 1268
Query: 904 KRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK 963
K + LRTEHHVY LPD+H LL G D+R+PDDP YLLAIW+PGE+ NS+ P+ C+S+
Sbjct: 1269 KNVSRLRTEHHVYELPDSHPLLEGLDKREPDDPCSYLLAIWTPGETANSIQPPERTCSSQ 1328
Query: 964 DS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
+S LC+ + C+SCNS+ E N+ VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S
Sbjct: 1329 ESGGLCDEKTCFSCNSIREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSS 1388
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
+PI+VPR I NL R V FG+S T+F L IQ FWRGFVCVRGFD++ R PRP
Sbjct: 1389 LNPIDVPRAWIWNLPRRTVYFGTSIPTIFKGLSTEDIQYCFWRGFVCVRGFDQKTRAPRP 1448
Query: 1083 LVNRLHCSPSK 1093
L+ RLH S+
Sbjct: 1449 LMARLHFPASR 1459
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/181 (48%), Positives = 113/181 (62%), Gaps = 16/181 (8%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVH----NAVVPYQVGPSSEHGALVPHQIK-- 137
II + K L I + S++ Q A P N++ NA+V Y+ G +VP +
Sbjct: 556 IIEQLKHLDI-NRESKISYQEQNALVPYNMNKEEKNALVLYK-----RDGTIVPFEDSFG 609
Query: 138 -EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
K+++ +P V LD T R+W L+M + D T +E+ KWW++ER VF GR SF
Sbjct: 610 LVKKRRPRPRVDLDEETSRVWKLLMGNINSEGIDGT-DEEKAKWWEEERNVFRGRADSFI 668
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSD 256
ARMHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+MSLAA FP K +H PS
Sbjct: 669 ARMHLVQGDRRFSKWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAAHFPCKC--NHRPST 726
Query: 257 E 257
E
Sbjct: 727 E 727
>gi|218187756|gb|EEC70183.1| hypothetical protein OsI_00912 [Oryza sativa Indica Group]
Length = 1952
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/576 (56%), Positives = 397/576 (68%), Gaps = 28/576 (4%)
Query: 549 DRTKKTTPKKNSDNTVQQDWDLLRR--IYSTG-EERSHDKMDSVDWEAVRCADESEIADA 605
D +K P+ + DWD+LR+ +YS G +ERS + DS+DWE +R A+ EI+D
Sbjct: 1384 DGSKAKRPRVGAGKKKTYDWDMLRKEVLYSHGNKERSQNAKDSIDWETIRQAEVKEISDT 1443
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
I+ERG N++A RIK+FLNRLV HGSIDLEWLR V DK K+YLL I GLGLKSVECVR
Sbjct: 1444 IRERGMNNMLAERIKDFLNRLVRDHGSIDLEWLRYVDSDKAKDYLLSIRGLGLKSVECVR 1503
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
LL+L H+AFPVD NVGRI VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LD
Sbjct: 1504 LLTLHHMAFPVDTNVGRICVRLGWVPLQPLPESLQLHLLEMYPMLENIQKYLWPRLCKLD 1563
Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS 785
Q+TLYELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR ALPGP EK +VTS
Sbjct: 1564 QRTLYELHYQMITFGKVFCTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTS 1623
Query: 786 EFGNGIGQSPPLVVNPIPVIRIEADPV--------SESGYQINNCEPIIEEPRSPGPQCS 837
P+ I + PV + + +NN +PIIEEP SP P+
Sbjct: 1624 --------GTPIAAETFHQTYISSRPVVSQLEWNSNTCHHGMNNRQPIIEEPASPEPEHE 1675
Query: 838 ESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAE 894
E +D + D EEIPT++L EF +N ++M+ N + ++D S ALVA+T E
Sbjct: 1676 TEEMKECAIEDSFVDDPEEIPTIKLNFEEFTQNLKSYMQANNIEIEDADMSKALVAITPE 1735
Query: 895 AASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVG 954
AS+PT KLK + LRTEH VY LPD+H LL GF++R+PDDP PYLL+IW+PGE+ S
Sbjct: 1736 VASIPTPKLKNVSRLRTEHQVYELPDSHPLLEGFNQREPDDPCPYLLSIWTPGETAQSTD 1795
Query: 955 SPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVN 1013
+P+ CNS+++ +LC S C+SCNS+ E A VRGT+LIPCRTA RG FPLNGTYFQVN
Sbjct: 1796 APKSVCNSQENGELCASNTCFSCNSIREAQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVN 1855
Query: 1014 EVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGF 1073
EVFADH++S +PI+VPR I NL R V FG+S T+F L +IQ FWRGFVCVRGF
Sbjct: 1856 EVFADHDSSRNPIDVPRSWIWNLPRRTVYFGTSIPTIFKGLTTEEIQHCFWRGFVCVRGF 1915
Query: 1074 DRRYRCPRPLVNRLHCSPS-----KKEAGSKRSRDE 1104
DR R PRPL RLH S KK AGS RD+
Sbjct: 1916 DRTSRAPRPLYARLHFPASKITRNKKSAGSAPGRDD 1951
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 110/164 (67%), Gaps = 7/164 (4%)
Query: 102 LQCNKADGPSNV--HNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNL 159
L NK++ P H A+VPY + E G +VP + K KRK+ + +V LDP T MW L
Sbjct: 824 LDINKSEDPVTAEPHGALVPY----NGEFGPIVPFEGKVKRKRSRAKVDLDPVTALMWKL 879
Query: 160 IMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSV 219
+M D + ++D EKW +ER++F+GR+ SF ARMHL+ GDRRF PWKGSVVDSV
Sbjct: 880 LMGPDMSDCAEGM-DKDKEKWLNEERKIFQGRVDSFIARMHLVQGDRRFSPWKGSVVDSV 938
Query: 220 VGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTA 263
VGV+LTQNVSD+LSSSA+M+LAA+FP+K P++ T +
Sbjct: 939 VGVFLTQNVSDHLSSSAFMALAAKFPVKPEASEKPANVMFHTIS 982
>gi|449440467|ref|XP_004138006.1| PREDICTED: uncharacterized protein LOC101222935 [Cucumis sativus]
Length = 1849
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/583 (56%), Positives = 410/583 (70%), Gaps = 25/583 (4%)
Query: 540 KEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRC 596
KE K ++K+ +K++ N WD LR+ T +R+ D MDS+DWEA+RC
Sbjct: 1274 KELKPASSRSQSKQVAKEKDNIN-----WDNLRKRTETNGKTRQRTEDTMDSLDWEAIRC 1328
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +EIA AI+ERG N++A RIK+FLNRLV+ HGSIDLEWLR+V PD+ KEYLL I GL
Sbjct: 1329 ADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKEYLLSIRGL 1388
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
GLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ Y
Sbjct: 1389 GLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKY 1448
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPG 776
LWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+RGEC+HFASA ASAR LP
Sbjct: 1449 LWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPA 1508
Query: 777 PSEKGIVTS----EFGNGIGQS---PPLVVNPIPVIRIEADPV----SESGYQINNCEPI 825
P +K IV++ E N ++ P L + P + +E P S+ C PI
Sbjct: 1509 PEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSVEIKPSESHQSDGKTTAGACVPI 1568
Query: 826 IEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD- 884
IEEP +P Q + + ++ ID +D D +EIPT++L EF +N N+++ N + LQ+
Sbjct: 1569 IEEPATP-EQETATQDAIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQKN-MELQEG 1626
Query: 885 --SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLA 942
S AL+ALT EAAS+PT KLK + LRTEH VY LPD H LL DRR+PDDPS YLLA
Sbjct: 1627 DMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLDRREPDDPSSYLLA 1686
Query: 943 IWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRG 1001
IW+PGE+ NS+ P+ RC+S++ +LC E C SCNSV E N+ +VRGT+LIPCRTA RG
Sbjct: 1687 IWTPGETANSIQLPEKRCSSQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAMRG 1746
Query: 1002 RFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQE 1061
FPLNGTYFQVNEVFADHE+S +PI+VPR I NL R V FG+S T+F L + IQ
Sbjct: 1747 SFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGIQH 1806
Query: 1062 YFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
FWRGFVCVRGFD++ R PRPL+ RLH SK G ++ D+
Sbjct: 1807 CFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1849
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/135 (57%), Positives = 96/135 (71%), Gaps = 7/135 (5%)
Query: 114 HNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
HNA+V Y G +VP +KR+ +P+V LD T R+W L+M + D T
Sbjct: 786 HNAIVVY-----GRDGTIVPFNPIKKRRP-RPKVELDEETGRVWKLLMGNINSKGIDGTD 839
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
E++ KWW++ER+VF+GR SF ARMHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 840 EENI-KWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLS 898
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAARFP KS
Sbjct: 899 SSAFMSLAARFPPKS 913
>gi|356533903|ref|XP_003535497.1| PREDICTED: transcriptional activator DEMETER-like [Glycine max]
Length = 1915
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/558 (56%), Positives = 400/558 (71%), Gaps = 13/558 (2%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADE 599
KS +K + +K P+ ++ + DWD LR+ + T +ERS D MDS+D+EAVRCA
Sbjct: 1356 KSQIKVSKARKEKPE--TEKKLASDWDKLRKEVQVNGTEKERSMDTMDSLDYEAVRCASV 1413
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IKERG N++A RIKEFL+RLV HGSIDLEWLR+VP DK K++LL GLGLK
Sbjct: 1414 KEISKTIKERGMNNLLAERIKEFLDRLVTEHGSIDLEWLRHVPQDKAKDFLLSFRGLGLK 1473
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
SVECVRLL+LQ+IAFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ +Q YLWP
Sbjct: 1474 SVECVRLLTLQNIAFPVDTNVGRIAVRLGWVPLQPLPEALQLHLLELYPVLEAVQKYLWP 1533
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSE 779
RLC LDQ+TLYELHYQMITFGK+FC K+ P C ACP+R EC+HFASA ASAR ALPGP E
Sbjct: 1534 RLCKLDQRTLYELHYQMITFGKVFCTKKKPKCNACPMRAECRHFASAFASARLALPGPEE 1593
Query: 780 KGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSES 839
K IV+ ++ + NP+ + +E + + Q CEPIIEEP +P + +E+
Sbjct: 1594 KHIVSMHVPIAAERNYFVNENPMVLPLLENNLSRQVSPQSWQCEPIIEEPATPEREWTEA 1653
Query: 840 PESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFM----EMNKVMLQDSSALVALTAEA 895
ES++ +DF D +EI ++ L ++ N N++ E N+ + S ALVAL +
Sbjct: 1654 EESDM--EDFFKEDSDEILSIDLNAKKSTVNVQNYLQEYNEHNEGCM--SKALVALNPRS 1709
Query: 896 ASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGS 955
AS+PT KLK + LRTEH VY LPD+H LL D+R+PDDPSPYLLAIW+PGE+PNSV
Sbjct: 1710 ASIPTPKLKNVSRLRTEHQVYELPDSHPLLEKMDKREPDDPSPYLLAIWTPGETPNSVEP 1769
Query: 956 PQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEV 1015
P+ RC S+DS LCN C+SCNS+ E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+
Sbjct: 1770 PERRCGSQDSALCNDNTCFSCNSIREANSQTVRGTLLIPCRTATRGSFPLNGTYFQVNEL 1829
Query: 1016 FADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDR 1075
FADH +S PI++PR I NL R V FG+S +++F L ++IQ FWRGFVCVRGFD+
Sbjct: 1830 FADHASSVQPIDIPREWIWNLPRRTVYFGTSVSSIFKDLSTQEIQHCFWRGFVCVRGFDQ 1889
Query: 1076 RYRCPRPLVNRLHCSPSK 1093
+ R PRPL RLH S S+
Sbjct: 1890 KERAPRPLQARLHFSASR 1907
Score = 148 bits (374), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/179 (48%), Positives = 112/179 (62%), Gaps = 14/179 (7%)
Query: 83 DIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKK 142
+II + DL +R+ ++ KA P N +VVPYQ E L K+ K
Sbjct: 821 EIICRPNDLKLRESNMS-EMEGLKALVPYNGDRSVVPYQ-----EFELL-------KKHK 867
Query: 143 LKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLI 202
+P+V LD T R W L+M G+ + ++++ EKWW KER VF GR+ SF ARMHLI
Sbjct: 868 PRPKVDLDAETERTWKLLMG-KVGSEGLEETDKEKEKWWDKERNVFHGRVDSFIARMHLI 926
Query: 203 LGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRT 261
GDRRF WKGSVVDSV+GV+LTQNVSD+LSSSA+MSLA+RFPL+S + D + T
Sbjct: 927 QGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLASRFPLQSKSSKKTYDVDTNT 985
>gi|356574657|ref|XP_003555462.1| PREDICTED: transcriptional activator DEMETER-like [Glycine max]
Length = 1851
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/557 (57%), Positives = 395/557 (70%), Gaps = 10/557 (1%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADE 599
KS +K + +K P+ ++ DWD LR+ I +ERS D MDS+D+EA+RCA
Sbjct: 1279 KSKIKVSKERKAKPE--TEKKHASDWDKLRKEVQINRIEKERSTDTMDSLDYEAIRCASV 1336
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IKERG N++A RIKEFLNRLV HGSIDLEWLR+VP DK K+YLL GLGLK
Sbjct: 1337 KEISKTIKERGMNNMLAERIKEFLNRLVTEHGSIDLEWLRHVPQDKAKDYLLSFRGLGLK 1396
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
SVECVRLL+L HIAFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ +Q YLWP
Sbjct: 1397 SVECVRLLTLHHIAFPVDTNVGRIAVRLGWVPLQPLPEALQLHLLELYPVLETVQKYLWP 1456
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSE 779
RLC LDQ+TLYELHYQMITFGK+FC K+ PNC ACP+R EC+HFASA ASAR ALPGP E
Sbjct: 1457 RLCKLDQRTLYELHYQMITFGKVFCTKKKPNCNACPMRAECRHFASAFASARLALPGPEE 1516
Query: 780 KGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSES 839
K IV+ ++ + NP+ + +E + + + CEPIIEEP +P + E+
Sbjct: 1517 KRIVSMHVPIATERNYFVNENPMVLPLLENNLSRQVNPESWQCEPIIEEPATPEREWKEA 1576
Query: 840 PESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD--SSALVALTAEAAS 897
ES++ +DF D +EI ++ L +E N N ++ K + S ALVAL E+AS
Sbjct: 1577 EESDM--EDFLKVDSDEILSIGLNAKESTVNVQNHLQEYKEHNEGDMSKALVALNPESAS 1634
Query: 898 VPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
+PT KLK + LRTEH VY LPD+H LL D+R+PDDPSPYLLAIW+PGE+PNSV P+
Sbjct: 1635 IPTPKLKNVSRLRTEHQVYELPDSHPLLEKMDKREPDDPSPYLLAIWTPGETPNSVEPPE 1694
Query: 958 YRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVF 1016
RC S+DS LCN C+SCNS+ E N+ VRGTILIPCRTA RG FPLNGTYFQVNE+F
Sbjct: 1695 RRCESQDSANLCNDSTCFSCNSIREANSQTVRGTILIPCRTATRGSFPLNGTYFQVNELF 1754
Query: 1017 ADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRR 1076
ADH +S PI++PR I NL R FG+S +++F L +QIQ FWRGFVCVRGFD++
Sbjct: 1755 ADHASSVQPIDIPREWIWNLPRRTAYFGTSVSSIFKGLSTQQIQHCFWRGFVCVRGFDQK 1814
Query: 1077 YRCPRPLVNRLHCSPSK 1093
R PRPL RLH S S+
Sbjct: 1815 ERAPRPLQARLHFSASR 1831
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/175 (47%), Positives = 111/175 (63%), Gaps = 14/175 (8%)
Query: 74 QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 133
Q+ + +II + DL + + + ++ KA P N +VVPYQ E L
Sbjct: 735 HQIYSNLIDEIICQLNDLKLGES-NMTEMEGQKALVPYNGDRSVVPYQ-----EFELL-- 786
Query: 134 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQ 193
K+ K +P+V LD T R W L+M G+ + ++++ EKWW +ER VF GR+
Sbjct: 787 -----KKHKPRPKVDLDAETERTWKLLMG-KGGSEGLEGTDKEKEKWWDEERNVFHGRVD 840
Query: 194 SFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
SF ARMHLI GDRRF WKGSVVDSV+GV+LTQNVSD+LSSSA+MSLA+RFPL+S
Sbjct: 841 SFIARMHLIQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLASRFPLQS 895
>gi|356503779|ref|XP_003520681.1| PREDICTED: protein ROS1-like [Glycine max]
Length = 1764
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/557 (56%), Positives = 403/557 (72%), Gaps = 21/557 (3%)
Query: 567 DWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR + E++ + MDS+DW+AVR AD SEIA+AIKERG N++A RI+ FL
Sbjct: 1182 DWDSLRIQAEAKAGKREKTENTMDSLDWDAVRRADVSEIANAIKERGMNNMLAERIQSFL 1241
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
N LV+ HG IDLEWLR+VPPD+ KE+LL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1242 NLLVDKHGGIDLEWLRDVPPDQAKEFLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1301
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQ+ITFGK+F
Sbjct: 1302 AVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVF 1361
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA ASAR ALPG +K IV + N Q+P LV+N +P
Sbjct: 1362 CTKSKPNCNACPMRGECRHFASAFASARLALPGSEQKSIVITTGNNATEQNPSLVINQLP 1421
Query: 804 VI-----------RIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIG 852
++ + E E+ +IN +PIIEEP +P P+CS+ E++I +D F+
Sbjct: 1422 LLLPENINQAELQQTEVIRQLEAKSEINISQPIIEEPATPEPECSQVSENDI-EDTFNEE 1480
Query: 853 DIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHL 909
EIPT++L EF N N+M+ N + LQ+ S ALVAL AA +PT KLK + L
Sbjct: 1481 SC-EIPTIKLDIEEFTLNLQNYMQEN-MELQEGEMSKALVALHPGAACIPTPKLKNVSRL 1538
Query: 910 RTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLC 968
RTEH+VY LPD+H LL G+++R+PDDP YLLAIW+PGE+ +S+ P+ +C+S++ +LC
Sbjct: 1539 RTEHYVYELPDSHPLLNGWNKREPDDPGKYLLAIWTPGETADSIQPPESKCSSQECGRLC 1598
Query: 969 NSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINV 1028
N C+SCNS E ++ IVRGT+LIPCRTA RG FPLNGTYFQVNEVFAD+++S +PI+V
Sbjct: 1599 NENECFSCNSFREASSQIVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADNDSSLNPISV 1658
Query: 1029 PRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
PR I NL R V FG+S ++F L R+IQ+ FWRG+VCVRGFDR R PRPL+ RLH
Sbjct: 1659 PRSWIWNLDRRTVYFGTSIPSIFKGLSTREIQQCFWRGYVCVRGFDREKRAPRPLLARLH 1718
Query: 1089 CSPSKKEAGSKRSRDES 1105
S+ +++ ES
Sbjct: 1719 FPVSRLPKNRDKTQQES 1735
Score = 149 bits (376), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 81/178 (45%), Positives = 113/178 (63%), Gaps = 11/178 (6%)
Query: 80 TTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ---- 135
T + +F+ L I ++ L P N N V E+G ++P +
Sbjct: 651 TVDTLTEQFERLNIYREAREIVLHGQNMLVPYNQQNQKNKRLV---HEYGTIIPFEGPFD 707
Query: 136 -IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQS 194
I+++R + P+V LD T R+W L+M +D + ++ED KWW+ ER VF GR +S
Sbjct: 708 PIRKQRPR--PKVDLDEETNRVWKLLM-LDINSHGINGTDEDKAKWWEDERNVFRGRAES 764
Query: 195 FTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDH 252
F ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFPL+S++++
Sbjct: 765 FIARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLRSSSNY 822
>gi|449501064|ref|XP_004161268.1| PREDICTED: uncharacterized protein LOC101230418 [Cucumis sativus]
Length = 1768
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/585 (56%), Positives = 410/585 (70%), Gaps = 27/585 (4%)
Query: 540 KEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRC 596
KE K ++K+ +K++ N WD LR+ T +R+ D MDS+DWEA+RC
Sbjct: 1191 KELKPASSRSQSKQVAKEKDNIN-----WDNLRKRTETNGKTRQRTEDTMDSLDWEAIRC 1245
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +EIA AI+ERG N++A RIK+FLNRLV+ HGSIDLEWLR+V PD+ KEYLL I GL
Sbjct: 1246 ADVNEIAHAIRERGMNNMLAERIKDFLNRLVKDHGSIDLEWLRDVEPDQAKEYLLSIRGL 1305
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
GLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YPV++ IQ Y
Sbjct: 1306 GLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKY 1365
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPG 776
LWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+RGEC+HFASA ASAR LP
Sbjct: 1366 LWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLGLPA 1425
Query: 777 PSEKGIVTS----EFGNGIGQS---PPLVVNPIPVIRIEADPV----SESGYQINNCEPI 825
P +K IV++ E N ++ P L + P + +E P S+ C PI
Sbjct: 1426 PEDKRIVSTTECREPDNNQPRTIDQPMLSLPPSTISSVEIKPSESHQSDGKTTAGACVPI 1485
Query: 826 IEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD- 884
IEEP +P Q + + ++ ID +D D +EIPT++L EF +N N+++ N + LQ+
Sbjct: 1486 IEEPATP-EQETATQDAIIDIEDAFYEDPDEIPTIKLNIEEFSQNLQNYVQKN-MELQEG 1543
Query: 885 --SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYL 940
S AL+ALT EAAS+PT KLK + LRTEH VY LPD H LL DRR+PDDPS YL
Sbjct: 1544 DMSKALIALTPEAASIPTPKLKNVSRLRTEHQVYELPDNHPLLEKLKLDRREPDDPSSYL 1603
Query: 941 LAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTAN 999
LAIW+PGE+ NS+ P+ RC+S++ +LC E C SCNSV E N+ +VRGT+LIPCRTA
Sbjct: 1604 LAIWTPGETANSIQLPEKRCSSQEHHQLCCEEECLSCNSVREANSFMVRGTLLIPCRTAM 1663
Query: 1000 RGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQI 1059
RG FPLNGTYFQVNEVFADHE+S +PI+VPR I NL R V FG+S T+F L + I
Sbjct: 1664 RGSFPLNGTYFQVNEVFADHESSLNPIDVPRDWIWNLPRRTVYFGTSIPTIFKGLSTQGI 1723
Query: 1060 QEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
Q FWRGFVCVRGFD++ R PRPL+ RLH SK G ++ D+
Sbjct: 1724 QHCFWRGFVCVRGFDQKTRAPRPLMARLHFPASKLNRGRGKTEDQ 1768
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/135 (57%), Positives = 96/135 (71%), Gaps = 7/135 (5%)
Query: 114 HNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
HNA+V Y G +VP +KR+ +P+V LD T R+W L+M + D T
Sbjct: 703 HNAIVVY-----GRDGTIVPFNPIKKRRP-RPKVELDEETGRVWKLLMGNINSKGIDGTD 756
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
E++ KWW++ER+VF+GR SF ARMHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 757 EENI-KWWEEERKVFQGRADSFIARMHLVQGDRRFSQWKGSVVDSVVGVFLTQNVSDHLS 815
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAARFP KS
Sbjct: 816 SSAFMSLAARFPPKS 830
>gi|357155497|ref|XP_003577140.1| PREDICTED: uncharacterized protein LOC100841571 [Brachypodium
distachyon]
Length = 1974
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/597 (55%), Positives = 405/597 (67%), Gaps = 44/597 (7%)
Query: 524 DATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEE 580
D ++ +KK+ G GK + Y DWD+LR+ + EE
Sbjct: 1407 DGSSKAKKARVGTGKKRAY------------------------DWDILRKEVLVNHGNEE 1442
Query: 581 RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRN 640
R+++ D++DWE +R D EI+D I+ERG N++A RIK FLNRLV HGSIDLEWLR
Sbjct: 1443 RANNAKDALDWETIRQIDVKEISDTIRERGMNNMLAERIKAFLNRLVTDHGSIDLEWLRY 1502
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
V PDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVPLQPLP L
Sbjct: 1503 VDPDKAKEYLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRICVRLGWVPLQPLPESLQ 1562
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R EC
Sbjct: 1563 LHLLELYPMLENIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRAEC 1622
Query: 761 KHFASAVASARFALPGPSEKGIVTSEFGNGI----GQSPPLVVNPIPVIRIEADPVSESG 816
KHFASA ASAR ALPGP EK +VTS GN I Q P + P+ + A P
Sbjct: 1623 KHFASAFASARLALPGPEEKSLVTS--GNPIVAESCQQPYISSRPLNQLDWNAHP---HD 1677
Query: 817 YQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFME 876
+ + N +PIIEEP SP P+ + E +D D EEIPT++L EF +N N+M+
Sbjct: 1678 HVLGNRQPIIEEPASPEPEPETAELKEGAIEDMFFDDPEEIPTIKLNFEEFAQNLKNYMQ 1737
Query: 877 MNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDP 933
+N + ++D SSALVA+T EAAS+PT +LK + LRTEH VY LPD+H LL GFD+R+P
Sbjct: 1738 VNNIDIEDADMSSALVAITPEAASIPTPRLKNVSRLRTEHQVYELPDSHPLLEGFDQREP 1797
Query: 934 DDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTIL 992
DDP PYLL+IW+PGE+ S +P CNS +S KLC+S C+SCNS+ E A VRGT+L
Sbjct: 1798 DDPCPYLLSIWTPGETAQSADAPMTSCNSHESGKLCDSSACFSCNSIREVQAQKVRGTLL 1857
Query: 993 IPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFN 1052
IPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR I NL R V FG+S T+F
Sbjct: 1858 IPCRTAMRGSFPLNGTYFQVNEVFADHDSSRNPIDVPRSWIWNLPRRTVYFGTSVPTIFK 1917
Query: 1053 ALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK----KEAGSKRSRDES 1105
L IQ FWRGFVCVRGFDR R PRPL RLH SK K+ + +RD++
Sbjct: 1918 GLTTEDIQHCFWRGFVCVRGFDRISRAPRPLYARLHFPASKITRNKKPTASAARDDA 1974
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/209 (48%), Positives = 132/209 (63%), Gaps = 30/209 (14%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKAD--GPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRK 141
II+K K L I NK+D G +H+A+VPY+ E GA+VP++ K KRK
Sbjct: 826 IIQKIKVLDI-----------NKSDDTGAVELHSALVPYK----GEVGAVVPYEGKVKRK 870
Query: 142 KLKPEVVLDPGTLRMWNLIMNID--DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARM 199
+ + +V LDP T MW L+M D DG+ + ++D EKW +ER++F+GR+ SF ARM
Sbjct: 871 RARAKVSLDPVTALMWKLLMEPDMVDGS---EEMDKDKEKWLDEERKIFQGRVDSFIARM 927
Query: 200 HLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENL 259
HL+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LSSSA+M+LAA+FP KS PSD
Sbjct: 928 HLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAAKFPAKSEVSKIPSDRMF 987
Query: 260 RTTASLEPIGSNSTSNGAVYDS---EGNM 285
T P N +G DS +GN+
Sbjct: 988 HT-----PSEKNGGCSGLFGDSVKLQGNI 1011
>gi|193082832|emb|CAQ58412.1| putative transcriptional activator DEMETER [Hordeum vulgare subsp.
vulgare]
Length = 1981
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/554 (57%), Positives = 404/554 (72%), Gaps = 14/554 (2%)
Query: 549 DRTKKTTPKKNSDNTVQQDWDLLRR--IYSTG-EERSHDKMDSVDWEAVRCADESEIADA 605
DR+K T K + DWD+LR+ + + G EERS + D++DWE +R + EI++
Sbjct: 1417 DRSK--TKKTRAGKKRTYDWDILRKEVLANRGNEERSENAKDALDWETIRQINVKEISNT 1474
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
I+ERG N++A RIK+FLNR+V HGSIDLEWLR+V PDK KEYLL I GLGLKSVECVR
Sbjct: 1475 IRERGMNNMLAERIKDFLNRVVRDHGSIDLEWLRHVDPDKAKEYLLSIRGLGLKSVECVR 1534
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
LL+L H+AFPVD NVGRI VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LD
Sbjct: 1535 LLTLHHMAFPVDTNVGRICVRLGWVPLQPLPESLQLHLLELYPMLENIQKYLWPRLCKLD 1594
Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS 785
Q+TLYELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR ALPGP EK +VTS
Sbjct: 1595 QRTLYELHYQMITFGKVFCTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTS 1654
Query: 786 --EFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESE 843
+G Q P ++P+ + +++ + + + ++NC+PIIEEP SP P+ + E
Sbjct: 1655 GNPIASGSCQQP--YISPMRLNQLDWNAHAHD-HILDNCQPIIEEPASPEPEPETAEMRE 1711
Query: 844 IDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPT 900
+D + D EEIPT++L EF +N N+M++N + ++D SSALVA+T EAAS+PT
Sbjct: 1712 SAIEDIFLDDPEEIPTIKLNFEEFAQNLKNYMQVNNIEMEDADMSSALVAITPEAASIPT 1771
Query: 901 RKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC 960
+LK + LRTEH VY LPD+H LL G+D+R+PDDP PYLL+IW+PGE+ S+ +P+ C
Sbjct: 1772 PRLKNVSRLRTEHQVYELPDSHPLLEGYDQREPDDPCPYLLSIWTPGETAQSIDAPKTAC 1831
Query: 961 NSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADH 1019
NS +S KLC+S C+SCNS+ E A VRGTIL+PCRTA RG FPLNGTYFQVNEVFADH
Sbjct: 1832 NSNESGKLCDSSACFSCNSMREAQAQTVRGTILVPCRTAMRGSFPLNGTYFQVNEVFADH 1891
Query: 1020 ETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRC 1079
++S +P++VPR I +L R V FG+S ++F L IQ+ FWRGFVCVRGFDR R
Sbjct: 1892 DSSRNPVDVPRRWIWDLPRRTVYFGTSVPSIFKGLTTEDIQQCFWRGFVCVRGFDRTSRA 1951
Query: 1080 PRPLVNRLHCSPSK 1093
PRPL RLH SK
Sbjct: 1952 PRPLYARLHFPASK 1965
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 118/184 (64%), Gaps = 22/184 (11%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNV--HNAVVPYQVGPSSEHGALVPHQIKEKRK 141
II+K K L I NK+DG + H A+VPY+ E GA++P++ K KRK
Sbjct: 839 IIQKIKVLDI-----------NKSDGIGSAEPHGALVPYK----GEFGAIIPYEGKGKRK 883
Query: 142 KLKPEVVLDPGTLRMWNLIMNID--DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARM 199
+ +V LDP T MW L+M D DG+ + ++D EKW +ER++F GRI SF ARM
Sbjct: 884 YARAKVNLDPVTALMWKLLMEPDMVDGS---EGMDKDKEKWLDEERKIFRGRIDSFIARM 940
Query: 200 HLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENL 259
HL+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LSSSA+M+LAA+FP K +D
Sbjct: 941 HLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAAKFPAKPEVSKISADRMF 1000
Query: 260 RTTA 263
T +
Sbjct: 1001 HTAS 1004
>gi|413925773|gb|AFW65705.1| hypothetical protein ZEAMMB73_319662 [Zea mays]
Length = 1904
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 339/650 (52%), Positives = 427/650 (65%), Gaps = 20/650 (3%)
Query: 458 LFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRT 517
L C + YT T G + S SG +D R ++S ++ + S L N+ S
Sbjct: 1259 LVCSNPQEVYTTSTD-QMGAEQSQSGCGQKYNDARVQTAS--HERHQSSTLCDNQNSCSV 1315
Query: 518 GMPQ-AHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR-IY 575
+ A D+T + +G K+++ S K + PKK +D DWD LR+ +
Sbjct: 1316 VLQGVASDSTQKFVDTQKGPSKARQNGSKAK----VRGRPKKKTD-----DWDSLRKKVL 1366
Query: 576 STG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI 633
S G ++RSHD D+VDWEAVR A+ EI++ I+ERG N++A RIKEFLNRLV HGSI
Sbjct: 1367 SNGGDKQRSHDARDTVDWEAVRQAEVREISETIRERGMNNMLAERIKEFLNRLVTDHGSI 1426
Query: 634 DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
DLEWLR+V PDK K++LL I GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVPLQ
Sbjct: 1427 DLEWLRDVQPDKAKDFLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRICVRLGWVPLQ 1486
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA 753
PLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC +
Sbjct: 1487 PLPESLQLHLLEMYPMLEHIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNS 1546
Query: 754 CPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVS 813
CP+R ECKHFASA ASAR ALP P EK +VT E N + S +N V ++E
Sbjct: 1547 CPMRAECKHFASAFASARLALPAPEEKCLVTLEDPNVVEFSHQTYINSGSVGQLEWSANY 1606
Query: 814 ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPN 873
N +PIIEEP SP + E +DF + +EIPT+ L EF +N +
Sbjct: 1607 PKHAVSGNHQPIIEEPLSPECETENIEAHEGAIEDFFCEESDEIPTINLNIEEFTQNLKD 1666
Query: 874 FMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDR 930
+M+ N V ++ S ALVA+T +AAS+PT KLK LRTEH VY LPD+H LL GF++
Sbjct: 1667 YMQANNVEIEYADMSKALVAITPDAASIPTPKLKNVNRLRTEHQVYELPDSHPLLEGFEQ 1726
Query: 931 RDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRG 989
R+PDDP PYLL+IW+PGE+ S+ +P+ C+S ++ +LC S C+SCN++ E A VRG
Sbjct: 1727 REPDDPCPYLLSIWTPGETAQSIDAPKTFCDSGETGRLCGSSTCFSCNNIREMQAQKVRG 1786
Query: 990 TILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATT 1049
T+LIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR I +L R V FG+S T
Sbjct: 1787 TLLIPCRTAMRGSFPLNGTYFQVNEVFADHCSSQNPIDVPRSWIWDLPRRTVYFGTSVPT 1846
Query: 1050 LFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSK 1099
+F L +IQ FWRGFVCVRGFDR R PRPL RLH SK G K
Sbjct: 1847 IFRGLTTEEIQRCFWRGFVCVRGFDRTVRAPRPLYARLHFPVSKVVRGKK 1896
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/167 (49%), Positives = 108/167 (64%), Gaps = 20/167 (11%)
Query: 84 IIRKFKDLTIR---DGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKR 140
II K K LTI G S++P NA+VPY+ E AL+ + K K+
Sbjct: 799 IIEKIKLLTINGPDKGVSEVP------------KNALVPYE----GEFSALIAFEGKAKK 842
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
+ + +V +DP T MWNL+M D G + ++D EKW +ER VF GR+ SF ARMH
Sbjct: 843 SRPRAKVNIDPVTTMMWNLLMGPDMGDDAEGL-DKDKEKWLDEERRVFRGRVDSFIARMH 901
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
L+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+M++AA+FP+K
Sbjct: 902 LVQGDRRFSRWKGSVVDSVVGVFLTQNVSDHLSSSAFMAVAAKFPVK 948
>gi|224113515|ref|XP_002316517.1| DNA glycosylase [Populus trichocarpa]
gi|222865557|gb|EEF02688.1| DNA glycosylase [Populus trichocarpa]
Length = 519
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 317/515 (61%), Positives = 383/515 (74%), Gaps = 12/515 (2%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
MDS+D+EAVR A EI+DAIKERG N++A RI+EFLNRLV HGSIDLEWLR+VPPDK
Sbjct: 1 MDSLDYEAVRSARVKEISDAIKERGMNNMLAERIQEFLNRLVREHGSIDLEWLRDVPPDK 60
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
K+YLL I GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+
Sbjct: 61 AKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLE 120
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R EC+HFAS
Sbjct: 121 LYPILESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSRPNCNACPMRAECRHFAS 180
Query: 766 AVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPI 825
A ASAR ALPGP EKGI TS +SP + +NP+P+ E +P G I +C PI
Sbjct: 181 AFASARLALPGPEEKGITTSTVPFMPERSPGIGINPMPLPPPEDNPHKRHGSDIGSCVPI 240
Query: 826 IEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD- 884
IEEP +P + +E E++I+D D +EIPT++L EF EN N+M N + LQ+
Sbjct: 241 IEEPATPDQENTELTETDIEDFG---EDPDEIPTIKLNMEEFTENLQNYMHTN-LELQEG 296
Query: 885 --SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRG---FDRRDPDDPSPY 939
S ALVAL AS+PT KLK + LRTEH VY LPD+H LL G DRR+PDDPSPY
Sbjct: 297 DMSKALVALNPN-ASIPTPKLKNVSRLRTEHQVYELPDSHPLLEGNLQMDRREPDDPSPY 355
Query: 940 LLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTA 998
LLAIW+PGE+ NS+ P +C S++ +KLC+ + C+SCNS+ E N+ VRGT+LIPCRTA
Sbjct: 356 LLAIWTPGETANSIEPPDQQCQSREPNKLCDEKTCFSCNSIREANSQTVRGTLLIPCRTA 415
Query: 999 NRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQ 1058
RG FPLNGTYFQVNE+FADHE+S +PI+VPR I NL R IV FG+S +++F L
Sbjct: 416 MRGSFPLNGTYFQVNEMFADHESSLNPIDVPRSLIWNLPRRIVYFGTSVSSIFKGLSTEG 475
Query: 1059 IQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
IQ FWRGFVCVRGFD++ R PRPL RLH S+
Sbjct: 476 IQFCFWRGFVCVRGFDQKTRAPRPLKARLHFPASR 510
>gi|333471379|gb|AEF38423.1| 5-methylcytosine DNA glycosylase [Triticum aestivum]
Length = 1975
Score = 634 bits (1634), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/536 (58%), Positives = 394/536 (73%), Gaps = 12/536 (2%)
Query: 567 DWDLLRR--IYSTG-EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD+LR+ + S G EER + D++DWE +R D EI++AI+ERG N+++ RI++FL
Sbjct: 1427 DWDILRKEVLASRGNEERGENAKDALDWETIRQIDVKEISNAIRERGMNNMLSERIQDFL 1486
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NR+V HGSIDLEWLR V PDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1487 NRVVRDHGSIDLEWLRYVDPDKAKEYLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1546
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1547 CVRLGWVPLQPLPESLQLHLLELYPMLENIQKYLWPRLCKLDQRTLYELHYQMITFGKVF 1606
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS--EFGNGIGQSPPLVVNP 801
C K PNC ACP+R ECKHFASA ASAR ALPGP EK +VTS +G Q P ++
Sbjct: 1607 CTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTSGNPIASGSCQQP--YISS 1664
Query: 802 IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
+ + +++ + + + ++N +PIIEEP SP P+ + E +D + D EEIPT++
Sbjct: 1665 MRLNQLDWN-ANAHDHILDNRQPIIEEPASPEPEPETAEMRESAIEDIFLDDPEEIPTIK 1723
Query: 862 LQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVL 918
L EF +N N+M++N + ++D SSALVA+T EAAS+PT +LK + LRTEH VY L
Sbjct: 1724 LNFEEFAQNLKNYMQVNNIEMEDADMSSALVAITPEAASIPTPRLKNVSRLRTEHQVYEL 1783
Query: 919 PDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCN 977
PD+H LL G+D+R+PDDP PYLL+IW+PGE+ S+ +P+ CNS +S KLC+S C+SCN
Sbjct: 1784 PDSHPLLEGYDQREPDDPCPYLLSIWTPGETAQSIDAPKTACNSNESGKLCDSSACFSCN 1843
Query: 978 SVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLR 1037
S+ E A VRGTIL+PCRTA RG FPLNGTYFQVNEVFADH++S +P++VPR I +L
Sbjct: 1844 SMREAQAQTVRGTILVPCRTAMRGSFPLNGTYFQVNEVFADHDSSRNPVDVPRRWIWDLP 1903
Query: 1038 REIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
R V FG+S ++F L IQ+ FWRGFVCVRGFDR R PRPL RLH SK
Sbjct: 1904 RRTVYFGTSVPSIFKGLTTEDIQQCFWRGFVCVRGFDRTSRAPRPLYARLHFPASK 1959
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 119/184 (64%), Gaps = 22/184 (11%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKAD--GPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRK 141
II+K K L I NK+D G + H A+VPY+ E GA++P++ K KRK
Sbjct: 833 IIQKIKILDI-----------NKSDDTGSAEPHGALVPYK----GEFGAIIPYEGKGKRK 877
Query: 142 KLKPEVVLDPGTLRMWNLIMNID--DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARM 199
+ +V LDP T MW L+M D DG+ + ++D EKW ++ER++F GRI SF ARM
Sbjct: 878 YARAKVNLDPVTALMWKLLMEPDMVDGS---EGMDKDKEKWLEEERKIFRGRIDSFIARM 934
Query: 200 HLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENL 259
HL+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LSSSA+M+LAA+FP K +D
Sbjct: 935 HLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAAKFPAKPEVSKISADRMF 994
Query: 260 RTTA 263
T +
Sbjct: 995 HTAS 998
>gi|333471383|gb|AEF38425.1| 5-methylcytosine DNA glycosylase [Triticum aestivum]
Length = 1981
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/536 (58%), Positives = 394/536 (73%), Gaps = 12/536 (2%)
Query: 567 DWDLLRR--IYSTG-EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD+LR+ + S G EER + D++DWE +R D EI++AI+ERG N+++ RI++FL
Sbjct: 1433 DWDILRKEVLASRGNEERGENAKDALDWETIRQIDVKEISNAIRERGMNNMLSERIQDFL 1492
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NR+V HGSIDLEWLR V PDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1493 NRVVRDHGSIDLEWLRYVDPDKAKEYLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1552
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1553 CVRLGWVPLQPLPESLQLHLLELYPMLENIQKYLWPRLCKLDQRTLYELHYQMITFGKVF 1612
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS--EFGNGIGQSPPLVVNP 801
C K PNC ACP+R ECKHFASA ASAR ALPGP EK +VTS +G Q P ++
Sbjct: 1613 CTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTSGNPIASGSCQQP--YISS 1670
Query: 802 IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
+ + +++ + + + ++N +PIIEEP SP P+ + E +D + D EEIPT++
Sbjct: 1671 MRLNQLDWN-ANAHDHILDNRQPIIEEPASPEPEPETAEMRESAIEDIFLDDPEEIPTIK 1729
Query: 862 LQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVL 918
L EF +N N+M++N + ++D SSALVA+T EAAS+PT +LK + LRTEH VY L
Sbjct: 1730 LNFEEFAQNLKNYMQVNNIEMEDADMSSALVAITPEAASIPTPRLKNVSRLRTEHQVYEL 1789
Query: 919 PDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCN 977
PD+H LL G+D+R+PDDP PYLL+IW+PGE+ S+ +P+ CNS +S KLC+S C+SCN
Sbjct: 1790 PDSHPLLEGYDQREPDDPCPYLLSIWTPGETAQSIDAPKTACNSNESGKLCDSSACFSCN 1849
Query: 978 SVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLR 1037
S+ E A VRGTIL+PCRTA RG FPLNGTYFQVNEVFADH++S +P++VPR I +L
Sbjct: 1850 SMREAQAQTVRGTILVPCRTAMRGSFPLNGTYFQVNEVFADHDSSRNPVDVPRRWIWDLP 1909
Query: 1038 REIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
R V FG+S ++F L IQ+ FWRGFVCVRGFDR R PRPL RLH SK
Sbjct: 1910 RRTVYFGTSVPSIFKGLTTEDIQQCFWRGFVCVRGFDRTSRAPRPLYARLHFPASK 1965
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/166 (51%), Positives = 113/166 (68%), Gaps = 11/166 (6%)
Query: 102 LQCNKAD--GPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNL 159
L NK+D G + H A+VPY+ E GA++P++ K KRK + +V LDP T MW L
Sbjct: 846 LDINKSDDTGSAEPHGALVPYK----GEFGAIIPYEGKGKRKYARAKVNLDPVTALMWKL 901
Query: 160 IMNID--DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVD 217
+M D DG+ + ++D EKW ++ER++F GRI SF ARMHL+ GDRRF PWKGSVVD
Sbjct: 902 LMEPDMVDGS---EGMDKDKEKWLEEERKIFRGRIDSFIARMHLVQGDRRFSPWKGSVVD 958
Query: 218 SVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTA 263
SVVGV+LTQNVSD+LSSSA+M+LAA+FP K +D T +
Sbjct: 959 SVVGVFLTQNVSDHLSSSAFMALAAKFPAKPEVSKISADRMFHTAS 1004
>gi|30680560|ref|NP_196076.2| transcriptional activator DEMETER [Arabidopsis thaliana]
gi|332003377|gb|AED90760.1| transcriptional activator DEMETER [Arabidopsis thaliana]
Length = 1729
Score = 630 bits (1625), Expect = e-177, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1192 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1251
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1252 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1311
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1312 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1371
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1372 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1431
Query: 805 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1432 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1490
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1491 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1549
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1550 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1609
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1610 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1669
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1670 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1723
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 76 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 646 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 688
Query: 136 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 689 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 746
Query: 196 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 255
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 747 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 805
Query: 256 DENLRTTASLEPIG 269
+ N+R+ +P G
Sbjct: 806 ERNVRSVVVEDPEG 819
>gi|255580114|ref|XP_002530889.1| conserved hypothetical protein [Ricinus communis]
gi|223529542|gb|EEF31495.1| conserved hypothetical protein [Ricinus communis]
Length = 1876
Score = 630 bits (1625), Expect = e-177, Method: Compositional matrix adjust.
Identities = 308/535 (57%), Positives = 382/535 (71%), Gaps = 10/535 (1%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADE 599
K+++ + +K K S DWD LR+ + +ERS MDS+D+EA+R A
Sbjct: 1318 KANLNRNASKARKAKAESGQKDAVDWDSLRKQVLVNGRKKERSESAMDSLDYEAMRSAHV 1377
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+EI+D IKERG N++A RIK+FLNRLV HGSIDLEWLR+VPPDK KEYLL I GLGLK
Sbjct: 1378 NEISDTIKERGMNNMLAERIKDFLNRLVREHGSIDLEWLRDVPPDKAKEYLLSIRGLGLK 1437
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
SVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWP
Sbjct: 1438 SVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWP 1497
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSE 779
RLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R EC+HFASA ASAR ALPGP +
Sbjct: 1498 RLCKLDQRTLYELHYQMITFGKVFCTKSRPNCNACPMRAECRHFASAFASARLALPGPED 1557
Query: 780 KGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSES 839
K IVT+ +SP +V++P+P+ E + ++ G I +C PIIEEP +P + +E
Sbjct: 1558 KSIVTATVPLTTERSPGIVIDPLPLPPAEDNLLTRRGSDIVSCVPIIEEPATPEQEHTEV 1617
Query: 840 PESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAA 896
ES+I +D D +EIPT++L E N N+M+ N + LQ+ S ALVAL EAA
Sbjct: 1618 IESDI--EDIFDEDPDEIPTIKLNMEELTVNLQNYMQAN-MELQECDMSKALVALNPEAA 1674
Query: 897 SVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSP 956
S+PT KLK + LRTEH VY LPD+H LL D+R PDDPSPYLLAIW+PGE+ NS+ P
Sbjct: 1675 SIPTPKLKNVSRLRTEHQVYELPDSHPLLNRMDKRQPDDPSPYLLAIWTPGETANSIQPP 1734
Query: 957 QYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEV 1015
+ C + KLCN + C+SCNS+ E N+ VRGT+LIPCRTA RG FPLNGTYFQVNEV
Sbjct: 1735 ERHCQFQGPDKLCNEQTCFSCNSIRETNSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEV 1794
Query: 1016 FADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCV 1070
FADHE+S +PI+VPR I NL R +V FG+S +T+F L IQ FW+G V
Sbjct: 1795 FADHESSLNPIDVPRAWIWNLPRRMVYFGTSVSTIFKGLSTEGIQYCFWKGAYAV 1849
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/140 (55%), Positives = 101/140 (72%), Gaps = 8/140 (5%)
Query: 116 AVVPYQVGPSSEHGALVPHQIKE--KRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
A+VPY+ GAL+P+ E K++K +P+V LDP T R+W L+M + G + T
Sbjct: 831 AIVPYKGD-----GALIPYDGFEIIKKRKPRPKVDLDPETERVWKLLMWKEGGEGLEGTD 885
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
E ++WW++ER VF GR SF ARMHL+ GDRRF WKGSVVDSV+GV+LTQNVSD+LS
Sbjct: 886 QEK-KQWWEEERRVFGGRADSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLS 944
Query: 234 SSAYMSLAARFPLKSTNDHT 253
SSA+M+LAA+FPLKS + T
Sbjct: 945 SSAFMNLAAKFPLKSMRNRT 964
>gi|21743571|gb|AAM77215.1| DEMETER protein [Arabidopsis thaliana]
Length = 1729
Score = 630 bits (1624), Expect = e-177, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1192 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1251
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1252 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1311
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1312 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1371
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1372 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESFPPVAIPMIEL 1431
Query: 805 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1432 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1490
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1491 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1549
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1550 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1609
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1610 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1669
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1670 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1723
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 76 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 646 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 688
Query: 136 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 689 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 746
Query: 196 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 255
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 747 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 805
Query: 256 DENLRTTASLEPIG 269
+ N+R+ +P G
Sbjct: 806 ERNVRSVVVEDPEG 819
>gi|333471381|gb|AEF38424.1| 5-methylcytosine DNA glycosylase [Triticum aestivum]
Length = 1982
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 309/536 (57%), Positives = 392/536 (73%), Gaps = 12/536 (2%)
Query: 567 DWDLLRR--IYSTGEE-RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD+LR+ + S G E R + D++DWE +R D EI++ I+ERG N+++ RI++FL
Sbjct: 1434 DWDILRKEVLASRGNEGRGENAKDALDWETIRQIDVKEISNTIRERGMNNMLSERIQDFL 1493
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NR+V HGSIDLEWLR V PDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1494 NRVVRDHGSIDLEWLRYVDPDKAKEYLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1553
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1554 CVRLGWVPLQPLPESLQLHLLELYPMLENIQKYLWPRLCKLDQRTLYELHYQMITFGKVF 1613
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS--EFGNGIGQSPPLVVNP 801
C K PNC ACP+R ECKHFASA ASAR ALPGP EK +VTS +G Q P ++
Sbjct: 1614 CTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTSGNPIASGSCQQP--YISS 1671
Query: 802 IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
+ + +++ + + + ++N +PIIEEP SP P+ + E +D + D EEIPT++
Sbjct: 1672 MRLNQLDWN-ANAHDHILDNRQPIIEEPASPEPEPETAEMRESAIEDIFLDDPEEIPTIK 1730
Query: 862 LQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVL 918
L EF +N N+M++N + ++D SSALVA+T EAAS+PT +LK + LRTEH VY L
Sbjct: 1731 LNFEEFAQNLKNYMQVNNIEMEDADMSSALVAITPEAASIPTPRLKNVSRLRTEHQVYEL 1790
Query: 919 PDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCN 977
PD+H LL G+D+R+PDDP PYLL+IW+PGE+ S+ +P+ CNS +S KLC+S C+SCN
Sbjct: 1791 PDSHPLLEGYDQREPDDPCPYLLSIWTPGETAQSIDAPKTACNSNESGKLCDSSACFSCN 1850
Query: 978 SVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLR 1037
S+ E A VRGTIL+PCRTA RG FPLNGTYFQVNEVFADH++S +P++VPR I +L
Sbjct: 1851 SMREAQAQTVRGTILVPCRTAMRGSFPLNGTYFQVNEVFADHDSSRNPVDVPRRWIWDLP 1910
Query: 1038 REIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
R V FG+S ++F L IQ+ FWRGFVCVRGFDR R PRPL RLH SK
Sbjct: 1911 RRTVYFGTSVPSIFKGLTTEDIQQCFWRGFVCVRGFDRTSRAPRPLYARLHFPASK 1966
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/184 (49%), Positives = 118/184 (64%), Gaps = 22/184 (11%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKAD--GPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRK 141
II+K K L I NK+D G + H A+VPY+ E GA++P++ K KRK
Sbjct: 840 IIQKIKVLDI-----------NKSDDTGSAEPHGALVPYK----GEFGAIIPYEGKGKRK 884
Query: 142 KLKPEVVLDPGTLRMWNLIMNID--DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARM 199
+ +V LDP T MW L+M D DG+ + ++D EKW ++ER +F GRI SF ARM
Sbjct: 885 YARAKVNLDPVTALMWKLLMEPDMVDGS---EGMDKDKEKWLEEERRIFRGRIDSFIARM 941
Query: 200 HLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENL 259
HL+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LSSSA+M+LAA+FP K +D
Sbjct: 942 HLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAAKFPAKPEVSKISADRMF 1001
Query: 260 RTTA 263
T +
Sbjct: 1002 HTAS 1005
>gi|145334291|ref|NP_001078527.1| transcriptional activator DEMETER [Arabidopsis thaliana]
gi|108935833|sp|Q8LK56.2|DME_ARATH RecName: Full=Transcriptional activator DEMETER; AltName: Full=DNA
glycosylase-related protein DME
gi|84782664|gb|ABC61677.1| DNA glycosylase DEMETER [Arabidopsis thaliana]
gi|332003378|gb|AED90761.1| transcriptional activator DEMETER [Arabidopsis thaliana]
Length = 1987
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689
Query: 805 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981
Score = 137 bits (344), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 76 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 904 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946
Query: 136 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 947 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004
Query: 196 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 255
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063
Query: 256 DENLRTTASLEPIG 269
+ N+R+ +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077
>gi|255573046|ref|XP_002527453.1| conserved hypothetical protein [Ricinus communis]
gi|223533188|gb|EEF34945.1| conserved hypothetical protein [Ricinus communis]
Length = 1712
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 322/580 (55%), Positives = 394/580 (67%), Gaps = 32/580 (5%)
Query: 544 SDMKNDR-------TKKTTPKKNSDNTVQQD--WDLLRRIYSTGEERSHDKMDSVDWEAV 594
+ MKN R T+KTT K + N V++ W+ L R YS RS DS+DWEAV
Sbjct: 1143 TSMKNKRSGLVPGETRKTT--KGTKNVVKEKSYWNDLGRKYS--RPRSSAATDSIDWEAV 1198
Query: 595 RCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIE 654
R A E+EIADAIK RGQ NI+A +IK+ LNR+++ HGSIDLEWLR+ P D VK YLL+IE
Sbjct: 1199 RQAPETEIADAIKSRGQHNIMARKIKKSLNRILDYHGSIDLEWLRHAPDDDVKVYLLEIE 1258
Query: 655 GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
GLGLKSVEC+RLL+L H AFPVD NV RIAVRLGWVPL+PLPG L +HLL+EYPVMD IQ
Sbjct: 1259 GLGLKSVECLRLLTLYHDAFPVDTNVARIAVRLGWVPLEPLPGVLQLHLLEEYPVMDTIQ 1318
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFAL 774
YLWPRLC LDQKTLYELHYQMITFGK+FC K PNCG CP+R EC+H ASA+AS L
Sbjct: 1319 KYLWPRLCKLDQKTLYELHYQMITFGKVFCTKLKPNCGVCPMRAECRHLASAIASENLCL 1378
Query: 775 PGPSEKGIVTSEFGN------GIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEE 828
PG ++G S+ N + + L+VNP V SES ++ +CEP+IEE
Sbjct: 1379 PGIPKRGEERSKVPNMSLESSAVDANDALIVNPTAVSLSGYVKASESKFETQSCEPLIEE 1438
Query: 829 PRSPGPQCSESPESEIDD----DDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQ- 883
P+SP P ++I+D + I D EEIP ++L F+ N FM+ + LQ
Sbjct: 1439 PKSP------EPVADIEDFEIANGIDINDGEEIPIIQLSNEPFRANVQYFMDEYRNNLQT 1492
Query: 884 DSS--ALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLL 941
DSS ALV L+ SVP RKLK + LRTEH VY +PD HELL G R D +D SPYLL
Sbjct: 1493 DSSSRALVPLSVNVDSVPVRKLKNISRLRTEHQVYEIPDDHELLIGLPRPDRNDQSPYLL 1552
Query: 942 AIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRG 1001
AIW+PGESP S P+ RCNS+ ++LC E C+ C S+ E VRGTIL+PCRTA RG
Sbjct: 1553 AIWTPGESPASCQPPEKRCNSQGAELCKDETCFYCQSIWEERTETVRGTILVPCRTAMRG 1612
Query: 1002 RFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQE 1061
RFPLNGTYFQVNEVFADHE+S++ I VPR +I LRR IV G+S +F A +++IQE
Sbjct: 1613 RFPLNGTYFQVNEVFADHESSYNAIIVPRSSIWYLRRRIVYCGTSPNAIFKACSLKEIQE 1672
Query: 1062 YFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRS 1101
FW+GF+CVRG+D + P+PL R HC PSK S +S
Sbjct: 1673 NFWKGFICVRGWDAKTGAPKPLAKRFHCPPSKMVKNSSKS 1712
Score = 134 bits (336), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 111/264 (42%), Positives = 163/264 (61%), Gaps = 22/264 (8%)
Query: 86 RKFKDLTIRDGGSQLPLQCNK------ADGPSNVH--NAVVPYQVGPSSEHG---ALVPH 134
RK K+L I D G C+ D P+ + +A+VPYQ S++ G A+V
Sbjct: 776 RKHKNLLIDDIGEGRLDVCDADAIVSYQDPPTKIEGSSAIVPYQDPSSTKIGGSSAIV-- 833
Query: 135 QIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQS 194
K K+KKL +V+LD T+R WNL++ ID+ ++ +++ EK W++E E+F GRI+S
Sbjct: 834 LHKHKKKKLTAQVLLDKETIRQWNLLVKIDN-VLGEKEEDKEREKKWEEEVEIFHGRIKS 892
Query: 195 FTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTP 254
FT+RMH+ILGDRRFKPWKGSVVDSVVGV+LTQNV+D LSS+AYMSLA+++P++S ++
Sbjct: 893 FTSRMHVILGDRRFKPWKGSVVDSVVGVFLTQNVTDFLSSAAYMSLASKYPIQSKSNQEA 952
Query: 255 SDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCE------LKDRDDAFDS 308
SD+ L S EPIG + G + N YF+TE E + L D D ++
Sbjct: 953 SDDELDYKNSQEPIGDDIVCTGTTENLHANGYFITEFEEGTSVDIVEEISLVDVKDIGNA 1012
Query: 309 RIQRKALQENGDIKVMTDAVPSQA 332
+Q+++ ++ +IK +T P A
Sbjct: 1013 ILQKQSYRQ--EIKALTHENPENA 1034
>gi|297806437|ref|XP_002871102.1| hypothetical protein ARALYDRAFT_487239 [Arabidopsis lyrata subsp.
lyrata]
gi|297316939|gb|EFH47361.1| hypothetical protein ARALYDRAFT_487239 [Arabidopsis lyrata subsp.
lyrata]
Length = 1997
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/537 (58%), Positives = 384/537 (71%), Gaps = 14/537 (2%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ERS D MDS+D+EA+R A SEI+DAIKERG N++A RIK+FL
Sbjct: 1458 WDSLRKDVEVNEGRKERSKDSMDSIDYEAIRRASISEISDAIKERGMNNMLAVRIKDFLE 1517
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R V+ HG IDLEWLR+VPPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1518 RTVKDHGGIDLEWLRDVPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1577
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VRLGWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ TLYELHYQ+ITFGK+FC
Sbjct: 1578 VRLGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQPTLYELHYQLITFGKVFC 1637
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPS--EKGIVTSEFGNGIGQSPPLVVNPI 802
K PNC ACP+RGEC+HFASA ASAR ALP P+ E+ + ++ PP+ + P+
Sbjct: 1638 TKSRPNCNACPMRGECRHFASAYASARLALPAPAPDERSLTSATIPVPPKSFPPVAI-PM 1696
Query: 803 PVIRIEADPVSESGYQIN--NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTL 860
+ + + G N NCEPIIEEP SP +C+E ES+I+D ++ D +EIPT+
Sbjct: 1697 IELPLPLEKALARGAPSNRENCEPIIEEPASPEQECTEITESDIEDAYYN-EDPDEIPTI 1755
Query: 861 RLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYV 917
+L +F ME N + LQ+ S ALVAL A S+PT KLK + LRTEH VY
Sbjct: 1756 KLNIEQFGMTLREHMERN-MELQEGDMSKALVALNPTATSIPTPKLKNISRLRTEHQVYQ 1814
Query: 918 LPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSC 976
LPD+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C C
Sbjct: 1815 LPDSHPLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSEC 1874
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
N+V E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S +PI+VPR I +L
Sbjct: 1875 NNVREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLNPIDVPRDWIWDL 1934
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
R V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1935 PRRTVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1991
Score = 136 bits (343), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 86/190 (45%), Positives = 120/190 (63%), Gaps = 21/190 (11%)
Query: 80 TTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEK 139
+ A+II + ++L + D + NA+V Y+ GA+VP++ K
Sbjct: 913 SIAEIIYRMQNLNLGDKNRE------------QEQNALVLYR-----GDGAVVPYE--SK 953
Query: 140 RKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARM 199
++K +P+V LD T R+WNL+M + D+ ++ EKWW++ER VF GR SF ARM
Sbjct: 954 KQKPRPKVDLDDETTRIWNLLMGKGE-KEGDEEMDKKKEKWWEEERRVFRGRADSFIARM 1012
Query: 200 HLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENL 259
HL+ GDRRF PWKGSVVDSV+GV+LTQNV+D+LSSSA+MSLAARFP KS N + N+
Sbjct: 1013 HLVQGDRRFSPWKGSVVDSVIGVFLTQNVTDHLSSSAFMSLAARFPPKS-NSSREDERNV 1071
Query: 260 RTTASLEPIG 269
R+ +P G
Sbjct: 1072 RSVVVEDPEG 1081
>gi|449451249|ref|XP_004143374.1| PREDICTED: protein ROS1-like [Cucumis sativus]
Length = 1782
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 325/598 (54%), Positives = 415/598 (69%), Gaps = 35/598 (5%)
Query: 514 SFRTGMP--QAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLL 571
S+ TG P HD + +SK + D K+D K + + V DWD L
Sbjct: 1188 SWETGEPAQNKHDHSLSSKFN------------DPKSDIIKPNRERVKKEKRVGVDWDSL 1235
Query: 572 RR-IYSTGE-ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL 629
R+ + +TG +R+ + MDS+DWEAVRCAD +IA I+ERG N +A RIK+FL+RLV+
Sbjct: 1236 RKQVEATGRRDRTTNTMDSLDWEAVRCADIDDIAYTIRERGMNNRLAERIKDFLDRLVKD 1295
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGW 689
HGS DLEWLR+VPPD+ KEYLL I GLGLKSVECVRLL+LQ +AFPVD NVGRIAVRLGW
Sbjct: 1296 HGSTDLEWLRDVPPDQAKEYLLSIRGLGLKSVECVRLLTLQQVAFPVDTNVGRIAVRLGW 1355
Query: 690 VPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSP 749
VPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K P
Sbjct: 1356 VPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKP 1415
Query: 750 NCGACPLRGECKHFASAVASARFALPGPSEKGIV-TSEFGNGIGQSPPLVVNPIPVIRIE 808
NC ACP+RGEC+HFASA ASAR +LP P EK ++ +E I Q+ + P+ + + E
Sbjct: 1416 NCNACPMRGECRHFASAFASARLSLPAPEEKSLINATERKADINQAVVVHQQPLALTQ-E 1474
Query: 809 ADPVSESGYQI-------NNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
++P+ ES Q+ +N +PIIEEP++P P+C + SEID +D D +EIPT++
Sbjct: 1475 SEPI-ESIQQLISVKSGGSNKDPIIEEPQTPEPECPQI--SEIDIEDTLYEDPDEIPTIK 1531
Query: 862 LQEREFKENFPNFMEMNKVMLQDSS---ALVALTAEAASVPTRKLKRCAHLRTEHHVYVL 918
L F ++ N+M+ N + LQ+ S ALV L+ EAAS+P KLK + LRTEH VY L
Sbjct: 1532 LNIEAFTKHVQNYMQEN-MELQEGSMSKALVLLSPEAASIPMPKLKNISRLRTEHQVYEL 1590
Query: 919 PDAHELLRG--FDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYS 975
PD+H LL +RR+PDDP YLLAIW+PGE+ NSV C+S++S LC + C+S
Sbjct: 1591 PDSHPLLEKLQLERREPDDPCFYLLAIWTPGETANSVEQLHAHCSSQESGGLCGEKECFS 1650
Query: 976 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1035
CNSV E ++ +VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR +
Sbjct: 1651 CNSVREPDSEVVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPIDVPRSWLWK 1710
Query: 1036 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
L R V FG+S T+F L +IQ FW+G+VCVRGFD+ R PRPL+ RLH SK
Sbjct: 1711 LSRRTVYFGTSIPTIFKGLSTEEIQGCFWKGYVCVRGFDQTTRAPRPLIARLHFPASK 1768
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/283 (33%), Positives = 151/283 (53%), Gaps = 20/283 (7%)
Query: 114 HNAVVPYQVGPSSEHGALVPHQ-----IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 168
H+A+V Y+ G++VP + IK++R+ K V LD T+R+W L+M D +
Sbjct: 704 HSALVVYRGD-----GSIVPFEGALDPIKKRRRFAK--VDLDEETVRVWKLLM---DNSN 753
Query: 169 KD--QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQ 226
K+ + S+E +KWW++ER VF GR SF A+MHLI GDR F WKGSV+DSV+GV+LTQ
Sbjct: 754 KELVEGSDEAKDKWWEEERSVFSGRTDSFIAKMHLIQGDRGFSQWKGSVLDSVIGVFLTQ 813
Query: 227 NVSDNLSSSAYMSLAARFPLKSTNDHTPS-DENLRTTASLEPIGSNSTSNGAVYDSEGNM 285
NVSD+LSSSA+MSLAAR+PLKS + H S DE + + + ++ + +
Sbjct: 814 NVSDHLSSSAFMSLAARYPLKSKSLHESSVDEQTSLILNESQVTLCQAEDSVIWAKQISD 873
Query: 286 YFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQAFDTSSVQSLDRTQ 345
+ + CE+ ++ F + + G + ++ + + L+ +
Sbjct: 874 QSICKQSCTTVCEIDQAEENFLTSSDSSGSKSAGVTSMRGYQCSVTSYSSKKIVELEDRR 933
Query: 346 LFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSVNQLFP 388
L N+ + S A++ I+ S + ++S+N L P
Sbjct: 934 LTTEINTTVEACSLGNEKTADAAIS--SQMSVVSEHSINSLCP 974
>gi|222617989|gb|EEE54121.1| hypothetical protein OsJ_00892 [Oryza sativa Japonica Group]
Length = 1964
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 317/589 (53%), Positives = 391/589 (66%), Gaps = 42/589 (7%)
Query: 549 DRTKKTTPKKNSDNTVQQDWDLLRR--IYSTG-EERSHDKMDSVDWEAVRCADESEIADA 605
D +K P+ + DWD+LR+ +YS G +ERS + DS+DWE +R A+ EI+D
Sbjct: 1384 DGSKAKRPRVGAGKKKTYDWDMLRKEVLYSHGNKERSQNAKDSIDWETIRQAEVKEISDT 1443
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVEC-- 663
I+ERG N++A RIK+FLNRLV HGSIDLEWLR V DK K+YLL I GLGLKSVEC
Sbjct: 1444 IRERGMNNMLAERIKDFLNRLVRDHGSIDLEWLRYVDSDKAKDYLLSIRGLGLKSVECNM 1503
Query: 664 -----------VRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
R+ S+ + VD NVGRI VRLGWVPLQPLP L +HLL+ YP+++
Sbjct: 1504 CEAWMGATSAPTRVSSVAPVG-DVDTNVGRICVRLGWVPLQPLPESLQLHLLEMYPMLEN 1562
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARF 772
IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR
Sbjct: 1563 IQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRAECKHFASAFASARL 1622
Query: 773 ALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPV--------SESGYQINNCEP 824
ALPGP EK +VTS P+ I + PV + + +NN +P
Sbjct: 1623 ALPGPEEKSLVTS--------GTPIAAETFHQTYISSRPVVSQLEWNSNTCHHGMNNRQP 1674
Query: 825 IIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD 884
IIEEP SP P+ E +D + D EEIPT++L EF +N ++M+ N + ++D
Sbjct: 1675 IIEEPASPEPEHETEEMKECAIEDSFVDDPEEIPTIKLNFEEFTQNLKSYMQANNIEIED 1734
Query: 885 ---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLL 941
S ALVA+T E AS+PT KLK + LRTEH VY LPD+H LL GF++R+PDDP PYLL
Sbjct: 1735 ADMSKALVAITPEVASIPTPKLKNVSRLRTEHQVYELPDSHPLLEGFNQREPDDPCPYLL 1794
Query: 942 AIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANR 1000
+IW+PGE+ S +P+ CNS+++ +LC S C+SCNS+ E A VRGT+LIPCRTA R
Sbjct: 1795 SIWTPGETAQSTDAPKSVCNSQENGELCASNTCFSCNSIREAQAQKVRGTLLIPCRTAMR 1854
Query: 1001 GRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQ 1060
G FPLNGTYFQVNEVFADH++S +PI+VPR I NL R V FG+S T+F L +IQ
Sbjct: 1855 GSFPLNGTYFQVNEVFADHDSSRNPIDVPRSWIWNLPRRTVYFGTSIPTIFKGLTTEEIQ 1914
Query: 1061 EYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPS-----KKEAGSKRSRDE 1104
FWRGFVCVRGFDR R PRPL RLH S KK AGS RD+
Sbjct: 1915 HCFWRGFVCVRGFDRTSRAPRPLYARLHFPASKITRNKKSAGSAPGRDD 1963
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 110/164 (67%), Gaps = 7/164 (4%)
Query: 102 LQCNKADGPSNV--HNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNL 159
L NK++ P H A+VPY + E G +VP + K KRK+ + +V LDP T MW L
Sbjct: 824 LDINKSEDPVTAEPHGALVPY----NGEFGPIVPFEGKVKRKRSRAKVDLDPVTALMWKL 879
Query: 160 IMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSV 219
+M D + ++D EKW +ER++F+GR+ SF ARMHL+ GDRRF PWKGSVVDSV
Sbjct: 880 LMGPDMSDCAEGM-DKDKEKWLNEERKIFQGRVDSFIARMHLVQGDRRFSPWKGSVVDSV 938
Query: 220 VGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTA 263
VGV+LTQNVSD+LSSSA+M+LAA+FP+K P++ T +
Sbjct: 939 VGVFLTQNVSDHLSSSAFMALAAKFPVKPEASEKPANVMFHTIS 982
>gi|357142343|ref|XP_003572540.1| PREDICTED: uncharacterized protein LOC100823274 [Brachypodium
distachyon]
Length = 1946
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 308/534 (57%), Positives = 381/534 (71%), Gaps = 7/534 (1%)
Query: 567 DWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DW+ LR+ S ++RSH+ DSVDWEAVR AD +I++ I+ERG N++A RIKEFL
Sbjct: 1403 DWENLRKEVSCNGGNKQRSHNTKDSVDWEAVRQADVRDISETIRERGMNNVLAERIKEFL 1462
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR++ PDK K+YLL I GLGLKS ECVRLL+L H+AFPVD NV RI
Sbjct: 1463 NRLVSDHGSIDLEWLRDLQPDKAKDYLLSIRGLGLKSAECVRLLTLHHMAFPVDTNVARI 1522
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ TLYELHYQMITFGK+F
Sbjct: 1523 CVRLGWVPLQPLPESLQLHLLELYPMLEHIQKYLWPRLCELDQLTLYELHYQMITFGKVF 1582
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC +CP+R ECKHFASA ASAR +LPGP EK +V SE N ++P P
Sbjct: 1583 CTKSKPNCNSCPMRAECKHFASAFASARLSLPGPEEKSLVASEATNAAESCHQTYIDPRP 1642
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
V ++E + N +PI+EEP SP P+ + ++ +DF + +EIPT+ L
Sbjct: 1643 VGQLEWNTNDCRHPGSGNHQPIVEEPSSPEPEPEIAETKQVSIEDFFTEEPDEIPTISLN 1702
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
+EFK+N ++M+ N + ++D S ALVA+T EAAS+PT +LK + LRTEH VY LPD
Sbjct: 1703 IKEFKQNLKSYMQANNIEIEDADMSRALVAITPEAASIPTPRLKNVSRLRTEHQVYELPD 1762
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL GFDRR DDP PYLL+IW+PGE+ S +P+ CNS+++ KLC S C+SC+S
Sbjct: 1763 SHPLLEGFDRRQTDDPCPYLLSIWTPGETAQSTDAPKTFCNSEETGKLCGSSTCFSCSSA 1822
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E A VR TILIPCRTA RG FPLNGTYFQVNE+FADH +S +PI+V R I NL R
Sbjct: 1823 REVQARKVRATILIPCRTAMRGSFPLNGTYFQVNELFADHYSSQNPIDVERSLIWNLPRR 1882
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
V FG+S T+F L +IQ+ FWRGFVCVRGFDR+ R PRPL RLH SK
Sbjct: 1883 TVYFGTSIPTIFRGLTTEEIQQCFWRGFVCVRGFDRKLRAPRPLFPRLHFPASK 1936
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/140 (55%), Positives = 101/140 (72%), Gaps = 13/140 (9%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNID--DGTTKDQTSNEDMEKWWQKERE 186
GALVP + K+K+ + +V +DP T MWNL+M D DG + ++D EKW ++ER+
Sbjct: 842 GALVPFEGNVKKKRSRAKVNMDPVTALMWNLLMAPDMCDGA---EGMDKDKEKWLEEERK 898
Query: 187 VFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
VF GRI SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LSSSA+MS+A++FP+
Sbjct: 899 VFRGRIDSFIARMHLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMSVASKFPV 958
Query: 247 KSTN--------DHTPSDEN 258
K + HTP ++N
Sbjct: 959 KLEDPEKPAARVSHTPPEQN 978
>gi|330370551|gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase
[Gossypium hirsutum]
Length = 2055
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 311/540 (57%), Positives = 382/540 (70%), Gaps = 20/540 (3%)
Query: 544 SDMKNDRTKKTTPKKNSDNTVQQDWDLLRR-IYSTG--EERSHDKMDSVDWEAVRCADES 600
S +K R K K N+ DWD LR+ + + G +ERS D MDS+D+EA+R A+ +
Sbjct: 1527 SSLKPKRRKAQEGKNNAT-----DWDQLRKQVQANGLKKERSKDTMDSLDYEAMRNANVN 1581
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
EI++ IKERG N++A RIK+FLNRLV H SIDLEWLR+VPPDK K+YLL I GLGLKS
Sbjct: 1582 EISNTIKERGMNNMLAERIKDFLNRLVRDHESIDLEWLRDVPPDKAKDYLLSIRGLGLKS 1641
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
VECVRLL+L H+AFPVD NVGRIAVRLGWVPLQP P L +HLL+ YP+++ IQ YLWPR
Sbjct: 1642 VECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPPPESLQLHLLELYPILESIQKYLWPR 1701
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEK 780
LC LDQ TLYELHYQMITFGK+FC K PNC ACP+RGEC+HFA A ASARFALPGP E+
Sbjct: 1702 LCKLDQYTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFAGAFASARFALPGPEER 1761
Query: 781 GIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVS---ESGYQINNCEPIIEEPRSPGPQCS 837
I +S +P VN IP+ PV + G + N EPIIEEP +P P+ +
Sbjct: 1762 SITSSTAPMISETNPTRAVNQIPL----PPPVHNLLKVGPNVGNNEPIIEEPTTPEPEHA 1817
Query: 838 ESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD--SSALVALTAEA 895
E ES D +D D +EIPT++L EF N ++M+ N + S ALVAL A
Sbjct: 1818 EGSES--DTEDACYDDPDEIPTIKLNIEEFTANLQHYMQGNMEPQEGDLSKALVALNPNA 1875
Query: 896 ASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGS 955
AS+PT KLK + LRTEH VY LPD H LL+ ++R+PDDPSPYLLAIW+PGE+ NS+
Sbjct: 1876 ASIPTPKLKNVSRLRTEHCVYELPDKHPLLKQMEKREPDDPSPYLLAIWTPGETANSIQP 1935
Query: 956 PQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNE 1014
P+ C S++ +LCN + C++CNSV E N VRGTILIPCRTA RG FPLNGTYFQVNE
Sbjct: 1936 PEQSCGSQEPGRLCNEKTCFACNSVREANTETVRGTILIPCRTAMRGSFPLNGTYFQVNE 1995
Query: 1015 VFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFD 1074
VFADH++S +P++VPR I NL R V FG+S +++F L IQ FW+GFVCVRGFD
Sbjct: 1996 VFADHDSSLNPVDVPREWIWNLPRRTVYFGTSVSSIFKGLSTEGIQYCFWKGFVCVRGFD 2055
Score = 162 bits (410), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/169 (51%), Positives = 113/169 (66%), Gaps = 21/169 (12%)
Query: 83 DIIRKFKDLTIRDGGSQLPLQCNKADGP-SNVHNAVVPYQVGPSSEHGALVPHQIKEK-R 140
+II +FK LT+ + K + P + + NA+V Y G +VP + E +
Sbjct: 969 EIINRFKGLTLEE----------KNNKPKAELQNALVLYNGA-----GTVVPFEGFESIK 1013
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
KK++P V LDP T R+WNL+M G + T D EKWW++ER VF GR+ SF ARMH
Sbjct: 1014 KKVRPRVDLDPETNRVWNLLM----GKEGEDTEGTDKEKWWEEERRVFHGRVDSFIARMH 1069
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKST 249
L+ GDRRF WKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAA+FPLKS+
Sbjct: 1070 LVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFPLKSS 1118
>gi|42569673|ref|NP_181190.3| protein ROS1 [Arabidopsis thaliana]
gi|71658826|sp|Q9SJQ6.2|ROS1_ARATH RecName: Full=Protein ROS1; AltName: Full=DEMETER-like protein 1;
AltName: Full=Repressor of silencing 1
gi|30909281|gb|AAP37178.1| ROS1 [Arabidopsis thaliana]
gi|330254169|gb|AEC09263.1| protein ROS1 [Arabidopsis thaliana]
Length = 1393
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)
Query: 567 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LRR + E++ MD+VDW+A+R AD E+A+ IK RG + +A RI+ FL
Sbjct: 858 DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
+RLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918 DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978 AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 802
C K PNC ACP++GEC+HFASA ASAR ALP ++E G G PL ++ P
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089
Query: 803 PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 859
P R + V SE ++ CEPIIEEP SP P+ +E ++I++ F D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147
Query: 860 LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 916
+RL F N ME NK LQD SSALVALTAE AS+P KLK + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 975
LPD H LL ++R+PDDP YLLAIW+PGE+ +S+ C + LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266
Query: 976 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1035
CNS+ E + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR I
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326
Query: 1036 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
L R V FG+S T+F L +IQ FW+G+VCVRGFDR+ R P+PL+ RLH SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)
Query: 116 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 170
A+VPY + GA+VP K+ + +P+V LD T R+W L++ + D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552
Query: 171 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 230
S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611
Query: 231 NLSSSAYMSLAARFPL 246
+LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627
>gi|297827135|ref|XP_002881450.1| hypothetical protein ARALYDRAFT_482629 [Arabidopsis lyrata subsp.
lyrata]
gi|297327289|gb|EFH57709.1| hypothetical protein ARALYDRAFT_482629 [Arabidopsis lyrata subsp.
lyrata]
Length = 1432
Score = 620 bits (1600), Expect = e-174, Method: Compositional matrix adjust.
Identities = 317/540 (58%), Positives = 382/540 (70%), Gaps = 20/540 (3%)
Query: 567 DWDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LRR E E++ MD+VDWEA+R AD SE+A+ IK RG + +A RI+ FL
Sbjct: 897 DWDSLRREAQGREGIREKTARTMDTVDWEAIRAADVSEVAETIKSRGMNHKLAERIQGFL 956
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
+RLV+ HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 957 DRLVDDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1016
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 1017 AVRLGWVPLQPLPESLQLHLLEMYPILESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1076
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA ASAR ALP +EKG+ T + ++P + P P
Sbjct: 1077 CTKSKPNCNACPMRGECRHFASAFASARLALPS-TEKGMATPD------KNPLPLHLPEP 1129
Query: 804 VIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTL 860
+ R + V SE ++ EPIIEEP SP P+ ++ ++I+D F D EEIPT+
Sbjct: 1130 LQREQGSEVVQHSEPAKKVTCSEPIIEEPASPEPESAQVSIADIEDAFFE--DPEEIPTI 1187
Query: 861 RLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYV 917
RL F N ME NK LQD SSALVALTAEAAS+P KLK + LRTEH VY
Sbjct: 1188 RLNMDAFTSNLKKLMEHNK-ELQDGNMSSALVALTAEAASLPIPKLKNISQLRTEHQVYE 1246
Query: 918 LPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYSC 976
LPD H LL ++R+PDDP YLLAIW+PGE+ +S+ +C + KLC+ E C+SC
Sbjct: 1247 LPDEHPLLVHLEKREPDDPCSYLLAIWTPGETADSIQPAVSKCIFQANGKLCDEETCFSC 1306
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
NS+ E IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR I +L
Sbjct: 1307 NSIKEARTQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPIDVPREWIWDL 1366
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEA 1096
R V FG+S T+F L IQ+ FW+G+VCVRGFDR R P+PL+ RLH SK +A
Sbjct: 1367 PRRTVYFGTSIPTIFKGLPTETIQQCFWKGYVCVRGFDRTTRGPKPLIARLHFPASKLKA 1426
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/145 (51%), Positives = 94/145 (64%), Gaps = 10/145 (6%)
Query: 112 NVHNAVVPYQVGPSSEH--------GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNI 163
N A++PY + GA+VP +KR+ +P+V LD T R+W L++
Sbjct: 491 NSETALIPYSMRSQGNQIILFGGGAGAIVPVTPVKKRRP-RPKVDLDDETDRVWKLLLEN 549
Query: 164 DDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVY 223
+ D S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+
Sbjct: 550 INSEGVD-GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVF 608
Query: 224 LTQNVSDNLSSSAYMSLAARFPLKS 248
LTQNVSD+LSSSA+MSLAA FP S
Sbjct: 609 LTQNVSDHLSSSAFMSLAAEFPAPS 633
>gi|297827133|ref|XP_002881449.1| hypothetical protein ARALYDRAFT_902767 [Arabidopsis lyrata subsp.
lyrata]
gi|297327288|gb|EFH57708.1| hypothetical protein ARALYDRAFT_902767 [Arabidopsis lyrata subsp.
lyrata]
Length = 1619
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 311/537 (57%), Positives = 384/537 (71%), Gaps = 20/537 (3%)
Query: 567 DWDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LRR E E++ MDSVDWEA+R AD SE+A+ IK+RG +++A RI+ FL
Sbjct: 1076 DWDSLRREAEGREGKREKTTRTMDSVDWEAIRTADVSEVAETIKKRGMNHMLAERIQGFL 1135
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR++PPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NV RI
Sbjct: 1136 NRLVNEHGSIDLEWLRDIPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVARI 1195
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 1196 AVRLGWVPLQPLPESLQLHLLEMYPILESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1255
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC ACP+RGEC+HFASA SAR ALP +EKG+ T + ++P + P P
Sbjct: 1256 CTKSKPNCNACPMRGECRHFASAFVSARLALPS-TEKGMGTPD------KNPLPLHLPEP 1308
Query: 804 VIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTL 860
+ R + V SE ++ CEPIIEEP SP P+ ++ ++I+D F D EEIPT+
Sbjct: 1309 LYREQGSEVKQHSEPAKKVTCCEPIIEEPASPEPESAQVSIADIEDAFFE--DPEEIPTI 1366
Query: 861 RLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYV 917
RL F N +E N+ LQD S+ALVALTAEAA +P KLK + LRTEH VY
Sbjct: 1367 RLNTDAFTSNLKKIIEQNQ-ELQDGNMSTALVALTAEAAYLPMPKLKNISQLRTEHQVYE 1425
Query: 918 LPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSC 976
LPD+H LL ++R+PDDP YLLAIW+PGE+ +S+ +C S+++ ++C+ E C+SC
Sbjct: 1426 LPDSHPLLVQLEKREPDDPCSYLLAIWTPGETADSIQPTVSKCISQENGQICDEETCFSC 1485
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
NS+ E + VRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR T+ +L
Sbjct: 1486 NSIKEARSQTVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPIDVPRETLWDL 1545
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
R V FG+S T+F L IQ+ FWRG+VCVRGFDR R P+PL+ RLH SK
Sbjct: 1546 TRRAVYFGTSIPTIFKGLSTETIQQCFWRGYVCVRGFDRETRGPKPLIARLHFPVSK 1602
Score = 97.4 bits (241), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 94/176 (53%), Gaps = 41/176 (23%)
Query: 112 NVHNAVVPYQVGPSSEH--------GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNI 163
N A+VPY + GA+VP +KR+ +P+V LD T ++W L++
Sbjct: 600 NSETALVPYSMKTQGNQIVLFGGGAGAIVPVTPVKKRRP-RPKVDLDDETEKVWKLLLEN 658
Query: 164 DDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVY 223
+ D S++ KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+
Sbjct: 659 INSEGID-GSDDQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVF 717
Query: 224 LTQNVSDNLS-------------------------------SSAYMSLAARFPLKS 248
LTQNVSD+LS SSA+MSL + FP+ S
Sbjct: 718 LTQNVSDHLSRYKHLRNSTSKPNPTQQQVTKLTCFVFFCPCSSAFMSLVSEFPVTS 773
>gi|218196882|gb|EEC79309.1| hypothetical protein OsI_20143 [Oryza sativa Indica Group]
Length = 1873
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 311/568 (54%), Positives = 387/568 (68%), Gaps = 23/568 (4%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAI 606
+ +TT KKNS+N DWD LRR +ER D+ DSVDWEAVRCAD I+ AI
Sbjct: 1313 KKSRTTSKKNSENF---DWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAI 1369
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
+ERG N++A RI++FLNRLV HGSIDLEWLR+VPPD K+YLL I GLGLKSVECVRL
Sbjct: 1370 RERGMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRL 1429
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ
Sbjct: 1430 LTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQ 1489
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
+TLYELHYQMITFGK+FC K +PNC ACP+R EC+HFASA ASAR ALP P +K +V
Sbjct: 1490 QTLYELHYQMITFGKVFCTKSTPNCNACPMRSECRHFASAFASARLALPSPQDKRLVNLS 1549
Query: 787 ----FGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPES 842
F NG +P N P+ ++E + NN PIIEEP SP +
Sbjct: 1550 NQFAFHNGTMPTP----NSTPLPQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLE 1604
Query: 843 EIDDDDFSIGDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVP 899
D +DF D +EIPT++L F +N N + E NK D + ALVA++ EAAS+P
Sbjct: 1605 N-DIEDFD-EDTDEIPTIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIP 1662
Query: 900 TRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
KLK LRTEH+VY LPD+H L++ ++R+PDDPSPYLLAIW+P E ++ +P+
Sbjct: 1663 VPKLKNVHRLRTEHYVYELPDSHPLMQQLALEQREPDDPSPYLLAIWTPDELKDTREAPK 1722
Query: 958 YRCNSK-DSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVF 1016
CN + + LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVF
Sbjct: 1723 PCCNPQTEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVF 1782
Query: 1017 ADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRR 1076
ADH +SH+PIN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGF+
Sbjct: 1783 ADHSSSHNPINIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFNME 1842
Query: 1077 YRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
R PRPL H + SK SK++ E
Sbjct: 1843 TRAPRPLCPHFHLAASKLRRSSKKAATE 1870
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 116/185 (62%), Gaps = 23/185 (12%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIK---EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQ 171
NA+VPY VG GA+VP+Q + K+++ + +V LD T R+WNL+M G D
Sbjct: 836 NALVPY-VG-----GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLM----GKAADP 885
Query: 172 TSNEDM--EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVS 229
D+ E+WW++EREVF+GR SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNV+
Sbjct: 886 VDGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVA 945
Query: 230 DNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVT 289
D+LSSSAYM+LAA FP T H ++ + + E I +++ + ++ +F
Sbjct: 946 DHLSSSAYMALAASFP---TGSHGNCNDGIAGQDNEEIISTSAVGDRGTFE-----FFYN 997
Query: 290 EPEPD 294
PD
Sbjct: 998 GSRPD 1002
>gi|222631766|gb|EEE63898.1| hypothetical protein OsJ_18723 [Oryza sativa Japonica Group]
Length = 1857
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 311/568 (54%), Positives = 385/568 (67%), Gaps = 23/568 (4%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAI 606
+ +TT KKNS+N DWD LRR +ER D+ DSVDWEAVRCAD I+ AI
Sbjct: 1297 KKSRTTSKKNSENF---DWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAI 1353
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
+ERG N++A RI++FLNRLV HGSIDLEWLR+VPPD K+YLL I GLGLKSVECVRL
Sbjct: 1354 RERGMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRL 1413
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ
Sbjct: 1414 LTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQ 1473
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
+TLYELHYQMITFGK+FC K PNC ACP+R EC+HFASA ASAR ALP P +K +V
Sbjct: 1474 QTLYELHYQMITFGKVFCTKSKPNCNACPMRSECRHFASAFASARLALPSPQDKRLVNLS 1533
Query: 787 ----FGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPES 842
F NG +P N P+ ++E + NN PIIEEP SP +
Sbjct: 1534 NQFAFHNGTMPTP----NSTPLPQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLE 1588
Query: 843 EIDDDDFSIGDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVP 899
D +DF D +EIP ++L F +N N + E NK D + ALVA++ EAAS+P
Sbjct: 1589 N-DIEDFD-EDTDEIPIIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIP 1646
Query: 900 TRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
KLK LRTEH+VY LPD+H L++ D+R+PDDPSPYLLAIW+P E ++ +P+
Sbjct: 1647 VPKLKNVHRLRTEHYVYELPDSHPLMQQLALDQREPDDPSPYLLAIWTPDELKDTREAPK 1706
Query: 958 YRCNSK-DSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVF 1016
CN + + LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVF
Sbjct: 1707 PCCNPQTEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVF 1766
Query: 1017 ADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRR 1076
ADH +SH+PIN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGF+
Sbjct: 1767 ADHSSSHNPINIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFNME 1826
Query: 1077 YRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
R PRPL H + SK SK++ E
Sbjct: 1827 TRAPRPLCPHFHLAASKLRRSSKKAATE 1854
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 116/185 (62%), Gaps = 23/185 (12%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIK---EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQ 171
NA+VPY VG GA+VP+Q + K+++ + +V LD T R+WNL+M G D
Sbjct: 820 NALVPY-VG-----GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLM----GKAADP 869
Query: 172 TSNEDM--EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVS 229
D+ E+WW++EREVF+GR SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNV+
Sbjct: 870 VDGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVA 929
Query: 230 DNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVT 289
D+LSSSAYM+LAA FP T H ++ + + E I +++ + ++ +F
Sbjct: 930 DHLSSSAYMALAASFP---TGSHGNCNDGIAGQDNEEIISTSAVGDRGTFE-----FFYN 981
Query: 290 EPEPD 294
PD
Sbjct: 982 GSRPD 986
>gi|220682961|gb|ACL80319.1| DNA glycosylase/lyase 701 [Oryza sativa Japonica Group]
Length = 1812
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 311/568 (54%), Positives = 385/568 (67%), Gaps = 23/568 (4%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAI 606
+ +TT KKNS+N DWD LRR +ER D+ DSVDWEAVRCAD I+ AI
Sbjct: 1252 KKSRTTSKKNSENF---DWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAI 1308
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
+ERG N++A RI++FLNRLV HGSIDLEWLR+VPPD K+YLL I GLGLKSVECVRL
Sbjct: 1309 RERGMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRL 1368
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ
Sbjct: 1369 LTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQ 1428
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
+TLYELHYQMITFGK+FC K PNC ACP+R EC+HFASA ASAR ALP P +K +V
Sbjct: 1429 QTLYELHYQMITFGKVFCTKSKPNCNACPMRSECRHFASAFASARLALPSPQDKRLVNLS 1488
Query: 787 ----FGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPES 842
F NG +P N P+ ++E + NN PIIEEP SP +
Sbjct: 1489 NQFAFHNGTMPTP----NSTPLPQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLE 1543
Query: 843 EIDDDDFSIGDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVP 899
D +DF D +EIP ++L F +N N + E NK D + ALVA++ EAAS+P
Sbjct: 1544 N-DIEDFD-EDTDEIPIIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIP 1601
Query: 900 TRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
KLK LRTEH+VY LPD+H L++ D+R+PDDPSPYLLAIW+P E ++ +P+
Sbjct: 1602 VPKLKNVHRLRTEHYVYELPDSHPLMQQLALDQREPDDPSPYLLAIWTPDELKDTREAPK 1661
Query: 958 YRCNSK-DSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVF 1016
CN + + LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVF
Sbjct: 1662 PCCNPQTEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVF 1721
Query: 1017 ADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRR 1076
ADH +SH+PIN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGF+
Sbjct: 1722 ADHSSSHNPINIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFNME 1781
Query: 1077 YRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
R PRPL H + SK SK++ E
Sbjct: 1782 TRAPRPLCPHFHLAASKLRRSSKKAATE 1809
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 116/185 (62%), Gaps = 23/185 (12%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIK---EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQ 171
NA+VPY VG GA+VP+Q + K+++ + +V LD T R+WNL+M G D
Sbjct: 775 NALVPY-VG-----GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLM----GKAADP 824
Query: 172 TSNEDM--EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVS 229
D+ E+WW++EREVF+GR SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNV+
Sbjct: 825 VDGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVA 884
Query: 230 DNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVT 289
D+LSSSAYM+LAA FP T H ++ + + E I +++ + ++ +F
Sbjct: 885 DHLSSSAYMALAASFP---TGSHGNCNDGIAGQDNEEIISTSAVGDRGTFE-----FFYN 936
Query: 290 EPEPD 294
PD
Sbjct: 937 GSRPD 941
>gi|413945495|gb|AFW78144.1| hypothetical protein ZEAMMB73_419140 [Zea mays]
Length = 909
Score = 600 bits (1548), Expect = e-168, Method: Compositional matrix adjust.
Identities = 303/570 (53%), Positives = 381/570 (66%), Gaps = 16/570 (2%)
Query: 528 TSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR-IYSTGE--ERSHD 584
TSK S + + + +TT KN++N DW+ LRR S G+ +RS +
Sbjct: 330 TSKSSFTSYNGVPDTAAQASKPKKTRTTTAKNTENF---DWEKLRRQACSEGQMKQRSFE 386
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
+ DS+DWEAVRCAD I+ AI+ERG N++A RI+ FLNRLV HGSIDLEWLR +PPD
Sbjct: 387 RRDSIDWEAVRCADVRRISHAIRERGMNNVLAERIQNFLNRLVRDHGSIDLEWLRYIPPD 446
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K+YLL I GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL
Sbjct: 447 SAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLL 506
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC KR PNC ACP+R ECKHFA
Sbjct: 507 ELYPILETIQKYLWPRLCKLDQQTLYELHYQMITFGKVFCTKRQPNCNACPMRSECKHFA 566
Query: 765 SAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEP 824
SA ASAR ALP P E+ +V S +N + R+E + + N EP
Sbjct: 567 SAFASARLALPAPQEESLVKLSNPFAFQNSSMHAMNSTHLPRLEGS-IHSREFLPKNSEP 625
Query: 825 IIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD 884
IIEEP SP E P +++D + EIPT++L F +N N ++ + LQ
Sbjct: 626 IIEEPASPR---EERPPETMENDIEDFYEDGEIPTIKLNMEAFAQNLENCIKESNNELQS 682
Query: 885 ---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPY 939
+ ALVA++ EAAS+P KLK LRTEH+VY LPDAH LL+ G D+R+ DDP+PY
Sbjct: 683 DDIAKALVAISTEAASIPVPKLKNVLRLRTEHYVYELPDAHPLLQQLGLDQREHDDPTPY 742
Query: 940 LLAIWSPGESPNSVGSPQYRCNSK-DSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTA 998
LLAIW+P +P+ C+ + LCN+E+C++C + E + VRGTIL+PCRTA
Sbjct: 743 LLAIWTPDGIKEITKTPKPCCDPQMGGDLCNNEMCHNCTAEKENQSRYVRGTILVPCRTA 802
Query: 999 NRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQ 1058
RG FPLNGTYFQVNEVFADH +SH+PI+V R + NL+R +V FG+S T+F L +
Sbjct: 803 MRGSFPLNGTYFQVNEVFADHRSSHNPIHVEREMLWNLQRRMVFFGTSVPTIFKGLRTEE 862
Query: 1059 IQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
IQ+ FWRGFVCVRGFD R PRPL LH
Sbjct: 863 IQQCFWRGFVCVRGFDMETRAPRPLCPHLH 892
>gi|297734851|emb|CBI17085.3| unnamed protein product [Vitis vinifera]
Length = 906
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 306/541 (56%), Positives = 374/541 (69%), Gaps = 23/541 (4%)
Query: 567 DWDLLRRIYSTG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
DWD LRR YS + R HD DS+DWEAVR A+ + +A AI RG N++A RIK+FL+
Sbjct: 375 DWDSLRRFYSNRGRKSRIHDARDSLDWEAVRDANVTHVAKAISTRGMNNVLAARIKDFLD 434
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
RLV HG +DLEWLR+VPP+K K+YLL I GLGLKSVECVRLL+L H AFPVD NVGRI
Sbjct: 435 RLVRDHGKLDLEWLRDVPPEKAKDYLLSIRGLGLKSVECVRLLTLHHHAFPVDTNVGRIT 494
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VRLGWVPLQPLP D+++H L +YP MD IQ YLWPRLC LDQKTLYELHYQMITFGK+FC
Sbjct: 495 VRLGWVPLQPLPSDVYLHSLDQYPPMDTIQKYLWPRLCTLDQKTLYELHYQMITFGKVFC 554
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVV-NPIP 803
K++P C ACPLRG+CKH+AS VAS + P E+GI N +G + LVV N +P
Sbjct: 555 TKKNPYCDACPLRGDCKHYASVVASRFVCISAPKEEGI-----PNYVGSN--LVVDNHVP 607
Query: 804 VIRI---EADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTL 860
++ + E P + G NNCEPIIE P SP P+ E D +DF D +EIPT+
Sbjct: 608 LVSVSILEDGPNYQPGGTYNNCEPIIEMPTSPRPKSEEK-----DIEDFCDEDFDEIPTI 662
Query: 861 RLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
RL E EF N M ++ AL + A S+PTRKLK + LRTEHHVY LPD
Sbjct: 663 RLDE-EFGAGIQNCPHMYPTPQEEDHGSQALVSVATSIPTRKLKNISRLRTEHHVYELPD 721
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL R+ DP P+L AIWSPGE+ +S PQ RCN + S +LC E C CN+
Sbjct: 722 SHHLLTEVSLRESCDPCPFLFAIWSPGETADSFEQPQRRCNLEGSGELCKDETCSFCNAT 781
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E+++ I+RGT+LIPCRTA RG FPLNGTYFQVNEVFAD E+S +PI V R I NL R
Sbjct: 782 REQDSQIIRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADDESSRNPIIVRRTEIWNLPRR 841
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPS---KKEA 1096
+ FG+SAT++F L + +IQ FW+GF+CVRGF+R+ R P+PLV RLH SPS K +A
Sbjct: 842 MAYFGTSATSIFRGLSIMEIQNCFWQGFICVRGFNRKTRAPKPLVRRLHISPSHMGKAKA 901
Query: 1097 G 1097
G
Sbjct: 902 G 902
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/196 (45%), Positives = 119/196 (60%), Gaps = 29/196 (14%)
Query: 56 PTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHN 115
PT PPR S + E+ + S +I+K + L+I D +QL
Sbjct: 157 PTSTTPPRQS---SWEAEEVRVDS----LIQKLELLSISDKINQL--------------- 194
Query: 116 AVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNE 175
VPYQ + + LV +Q K K P+V LDP TL+MWNLI+ + ++ ++
Sbjct: 195 --VPYQKS-APKGNQLVRYQAK----KPAPKVDLDPETLKMWNLIVLKNRSEGVEEEMDK 247
Query: 176 DMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSS 235
EK W++EREVF GR++SF A MHL+ GDRRF PWKGSV+DSVVGV+LTQNVSD LSSS
Sbjct: 248 KDEKRWEEEREVFRGRVKSFLACMHLLQGDRRFHPWKGSVLDSVVGVFLTQNVSDKLSSS 307
Query: 236 AYMSLAARFPLKSTND 251
A+M+LAA FPL ST +
Sbjct: 308 AFMALAALFPLPSTGN 323
>gi|359479390|ref|XP_002270885.2| PREDICTED: DEMETER-like protein 3-like [Vitis vinifera]
Length = 1372
Score = 593 bits (1530), Expect = e-166, Method: Compositional matrix adjust.
Identities = 322/620 (51%), Positives = 397/620 (64%), Gaps = 60/620 (9%)
Query: 509 PSNRESFRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDW 568
P ESF + Q ++ T K+ + K K++ K D + DW
Sbjct: 778 PHCMESFTSYSEQRNEMGETENKNLNNSNEGKRGKTNDKEDHSF--------------DW 823
Query: 569 DLLRRIYSTGEERS--HDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
D LRR YS +S HD DS+DWEAVR A+ + +A AI RG N++A RIK+FL+RL
Sbjct: 824 DSLRRFYSNRGRKSRIHDARDSLDWEAVRDANVTHVAKAISTRGMNNVLAARIKDFLDRL 883
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V HG +DLEWLR+VPP+K K+YLL I GLGLKSVECVRLL+L H AFPVD NVGRI VR
Sbjct: 884 VRDHGKLDLEWLRDVPPEKAKDYLLSIRGLGLKSVECVRLLTLHHHAFPVDTNVGRITVR 943
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LGWVPLQPLP D+++H L +YP MD IQ YLWPRLC LDQKTLYELHYQMITFGK+FC K
Sbjct: 944 LGWVPLQPLPSDVYLHSLDQYPPMDTIQKYLWPRLCTLDQKTLYELHYQMITFGKVFCTK 1003
Query: 747 RSPNCGACPLRGECKHFASAVASARFAL-------------------PGPSEKGIVTSEF 787
++P C ACPLRG+CKH+AS VAS R L P E+GI
Sbjct: 1004 KNPYCDACPLRGDCKHYASVVAS-RLPLIKNGGDGQLDLASPIENQPAAPKEEGIP---- 1058
Query: 788 GNGIGQSPPLVV-NPIPVIRI---EADPVSESGYQINNCEPIIEEPRSPGPQCSESPESE 843
N +G + LVV N +P++ + E P + G NNCEPIIE P SP P+ E
Sbjct: 1059 -NYVGSN--LVVDNHVPLVSVSILEDGPNYQPGGTYNNCEPIIEMPTSPRPKSEEK---- 1111
Query: 844 IDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKL 903
D +DF D +EIPT+RL E EF N M ++ AL + A S+PTRKL
Sbjct: 1112 -DIEDFCDEDFDEIPTIRLDE-EFGAGIQNCPHMYPTPQEEDHGSQALVSVATSIPTRKL 1169
Query: 904 KRCAHLRTEHHVYVLPDAHELL--RGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN 961
K + LRTEHHVY LPD+H LL RG RR+ DP P+L AIWSPGE+ +S PQ RCN
Sbjct: 1170 KNISRLRTEHHVYELPDSHHLLTERGM-RRESCDPCPFLFAIWSPGETADSFEQPQRRCN 1228
Query: 962 SKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHE 1020
+ S +LC E C CN+ E+++ I+RGT+LIPCRTA RG FPLNGTYFQVNEVFAD E
Sbjct: 1229 LEGSGELCKDETCSFCNATREQDSQIIRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADDE 1288
Query: 1021 TSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCP 1080
+S +PI V R I NL R + FG+SAT++F L + +IQ FW+GF+CVRGF+R+ R P
Sbjct: 1289 SSRNPIIVRRTEIWNLPRRMAYFGTSATSIFRGLSIMEIQNCFWQGFICVRGFNRKTRAP 1348
Query: 1081 RPLVNRLHCSPS---KKEAG 1097
+PLV RLH SPS K +AG
Sbjct: 1349 KPLVRRLHISPSHMGKAKAG 1368
Score = 143 bits (361), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 98/137 (71%), Gaps = 5/137 (3%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSN 174
N +VPYQ + + LV +Q K K P+V LDP TL+MWNLI+ + ++ +
Sbjct: 613 NQLVPYQKS-APKGNQLVRYQAK----KPAPKVDLDPETLKMWNLIVLKNRSEGVEEEMD 667
Query: 175 EDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSS 234
+ EK W++EREVF GR++SF A MHL+ GDRRF PWKGSV+DSVVGV+LTQNVSD LSS
Sbjct: 668 KKDEKRWEEEREVFRGRVKSFLACMHLLQGDRRFHPWKGSVLDSVVGVFLTQNVSDKLSS 727
Query: 235 SAYMSLAARFPLKSTND 251
SA+M+LAA FPL ST +
Sbjct: 728 SAFMALAALFPLPSTGN 744
>gi|357152016|ref|XP_003575981.1| PREDICTED: uncharacterized protein LOC100830449 [Brachypodium
distachyon]
Length = 1717
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 299/553 (54%), Positives = 377/553 (68%), Gaps = 22/553 (3%)
Query: 553 KTTPKKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKER 609
+TT KKN +N DWD RR Y G ERS ++ DSV+WEAVRCAD I+ AI+ER
Sbjct: 1141 RTTNKKNMENF---DWDKFRRQAYDDGHMNERSFERRDSVNWEAVRCADVQRISHAIRER 1197
Query: 610 GQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL 669
G N++A RI+ FL RLV HGSIDLEWLR++PPD K+YLL I GLGLKSVECVRLL+L
Sbjct: 1198 GMNNVLAERIQNFLKRLVRDHGSIDLEWLRDIPPDSAKDYLLSIRGLGLKSVECVRLLTL 1257
Query: 670 QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTL 729
H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TL
Sbjct: 1258 HHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQQTL 1317
Query: 730 YELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS--EF 787
YELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR ALP P EK +V S +F
Sbjct: 1318 YELHYQMITFGKVFCTKSKPNCNACPMRSECKHFASAFASARLALPAPQEKSLVKSSNQF 1377
Query: 788 GNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDD- 846
G P +P ++E + NN EPI+EEP SP + E PE+ ++D
Sbjct: 1378 SFENGGLPTRNSTVLP--QLEGSTFGRD-FPANNSEPIVEEPASPREE--ECPETLVNDI 1432
Query: 847 DDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQ---DSSALVALTAEAASVPTRKL 903
+D+ + D EIPT++L F N N ++ + LQ + ALVA++ +AAS+P KL
Sbjct: 1433 EDYDV-DTGEIPTIKLNMETFARNLENCIKESNNDLQFGDIAKALVAISTQAASIPAPKL 1491
Query: 904 KRCAHLRTEHHVYVLPDAHELLRG--FDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN 961
K LRTEH+VY LPD+H L++ D+R+PDDPSPYLL IW + + + C+
Sbjct: 1492 KNVHRLRTEHYVYELPDSHSLVQQLELDQREPDDPSPYLLTIWMEDDIKEMSKASKSCCD 1551
Query: 962 SK-DSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHE 1020
S+ ++ CN+E C+ C E + VRGTIL+PCRTA +G FPLNGTYFQVNEVFADH+
Sbjct: 1552 SEVETGFCNNE-CHYCVPERENQSRYVRGTILVPCRTAMKGSFPLNGTYFQVNEVFADHK 1610
Query: 1021 TSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCP 1080
+SH PI V R + NL+R +V FG+S ++F L ++Q+ FWRGFVCVRGFD R P
Sbjct: 1611 SSHDPIYVAREQLWNLQRRMVYFGTSVPSIFKGLTTEKVQQCFWRGFVCVRGFDMETRAP 1670
Query: 1081 RPLVNRLHCSPSK 1093
RPL LH + S+
Sbjct: 1671 RPLCPHLHLAASR 1683
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/133 (57%), Positives = 92/133 (69%), Gaps = 8/133 (6%)
Query: 113 VHNAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQT 172
V N++VPY P H IK++R + K V LD T R+WNL+M T D T
Sbjct: 705 VSNSLVPYSGLPYER----PLHLIKKQRPRAK--VDLDYETTRVWNLLMG-KAAETVDGT 757
Query: 173 SNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNL 232
+ D E++WQKER+VF+GR SF ARM L+ GDR F PWKGSVVDSVVGV+LTQNVSD+L
Sbjct: 758 -DVDKERFWQKERDVFQGRANSFIARMRLVQGDRHFSPWKGSVVDSVVGVFLTQNVSDHL 816
Query: 233 SSSAYMSLAARFP 245
SSSAYM+LAA FP
Sbjct: 817 SSSAYMALAASFP 829
>gi|222631769|gb|EEE63901.1| hypothetical protein OsJ_18726 [Oryza sativa Japonica Group]
Length = 1837
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 304/564 (53%), Positives = 375/564 (66%), Gaps = 31/564 (5%)
Query: 553 KTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKER 609
+TT KKNS+N DWD LRR +ER D+ DSVDWEAVRCAD I+ AI+ER
Sbjct: 1290 RTTSKKNSENF---DWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAIRER 1346
Query: 610 GQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL 669
G N++A RI++FLNRLV HGSIDLEWLR+VPPD K+YLL I GLGLKSVECVRLL+L
Sbjct: 1347 GMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRLLTL 1406
Query: 670 QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTL 729
H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TL
Sbjct: 1407 HHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQQTL 1466
Query: 730 YELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS---- 785
YELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR ALP P +K +V
Sbjct: 1467 YELHYQMITFGKVFCTKSKPNCNACPMRSECKHFASAFASARLALPSPQDKRLVNMSNQF 1526
Query: 786 EFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEID 845
+F NG +P + P++++E + NN PIIEEP SP + D
Sbjct: 1527 DFQNGTMPTP----HSTPLLQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLEN-D 1580
Query: 846 DDDFSIGDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVPTRK 902
+DF D +EIPT++L F +N N + E NK D + ALVA++ EAAS+P K
Sbjct: 1581 IEDFD-EDTDEIPTIKLNMEAFAQNLENCIKESNKDFQSDDITKALVAISNEAASIPVPK 1639
Query: 903 LKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC 960
LK LRTEH+VY LPD+H L++ D+R+PDDP+ + E+P +PQ
Sbjct: 1640 LKNVHRLRTEHYVYELPDSHPLMQQLALDQREPDDPN----ELKDTREAPKPCCNPQ--- 1692
Query: 961 NSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHE 1020
+ LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVFADH
Sbjct: 1693 --TEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVFADHS 1750
Query: 1021 TSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCP 1080
+SH+PIN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGFD R P
Sbjct: 1751 SSHNPINIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFDMETRAP 1810
Query: 1081 RPLVNRLHCSPSKKEAGSKRSRDE 1104
RPL H + SK SK + E
Sbjct: 1811 RPLCPHFHLAASKLRRSSKTAATE 1834
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/136 (56%), Positives = 98/136 (72%), Gaps = 15/136 (11%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIK---EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQ 171
NA+VPY VG GA+VP+Q + K+++ + +V LD T R+WNL+M G D
Sbjct: 809 NALVPY-VG-----GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLM----GKAADP 858
Query: 172 TSNEDM--EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVS 229
D+ E+WW++EREVF+GR SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNV+
Sbjct: 859 VDGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVA 918
Query: 230 DNLSSSAYMSLAARFP 245
D+LSSSAYM+LAA FP
Sbjct: 919 DHLSSSAYMALAASFP 934
>gi|242090707|ref|XP_002441186.1| hypothetical protein SORBIDRAFT_09g021920 [Sorghum bicolor]
gi|241946471|gb|EES19616.1| hypothetical protein SORBIDRAFT_09g021920 [Sorghum bicolor]
Length = 1704
Score = 587 bits (1512), Expect = e-164, Method: Compositional matrix adjust.
Identities = 309/619 (49%), Positives = 386/619 (62%), Gaps = 67/619 (10%)
Query: 484 RNDLKSDFRSISSSDFNDPF---EISVLPSNRE---SFRTGMPQAHDATTTSKKSPRGKG 537
R+D +S F + + F EI+V + +E + ++G + T+ ++P+ K
Sbjct: 1122 RHDKRSSFEAPDLKEHESVFPTHEIAVEATRKEYEYTSKSGFTSCNGVPDTAAQAPKAK- 1180
Query: 538 KSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAV 594
+T KK+++N DWD LRR YS G+ RS ++ DSVDWEAV
Sbjct: 1181 -------------KTRTATKKDAENF---DWDKLRRQAYSEGQMKTRSVERRDSVDWEAV 1224
Query: 595 RCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIE 654
RCAD I+ AI+ERG NI+A RI+ FLNRLV HGSIDLEWLR +PPD K+YLL I
Sbjct: 1225 RCADAQRISHAIRERGMNNILAERIQNFLNRLVRDHGSIDLEWLRYIPPDSAKDYLLSIR 1284
Query: 655 GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YP+++ IQ
Sbjct: 1285 GLGLKSVECVRLLTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPILETIQ 1344
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFAL 774
YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R ECKHFASA ASAR AL
Sbjct: 1345 KYLWPRLCKLDQQTLYELHYQMITFGKVFCTKSKPNCNACPMRSECKHFASAFASARLAL 1404
Query: 775 PGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGP 834
P P EK +V S +N + R+E + + N EPIIEEP SP
Sbjct: 1405 PAPQEKSLVKLSNQFAFQNSSMHTMNSTHLPRLEGS-LHSREFLPKNSEPIIEEPASPR- 1462
Query: 835 QCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVAL 891
E P I++D + EIPT++L F +N N ++ + LQ + ALVA+
Sbjct: 1463 --EEGPPETIENDIEDFYEDGEIPTIKLNMEVFAQNLENCIKESNNELQSDDIAKALVAI 1520
Query: 892 TAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGES 949
+ EAAS+P KLK LRTEH+VY LPDAH LL+ G D+R+PDDP+PYLLAIW+P +
Sbjct: 1521 STEAASIPVPKLKNVHRLRTEHYVYELPDAHPLLQQLGLDQREPDDPTPYLLAIWTPEK- 1579
Query: 950 PNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTY 1009
E + VRGTIL+PCRTA RG FPLNGTY
Sbjct: 1580 -------------------------------ENQSRYVRGTILVPCRTAMRGSFPLNGTY 1608
Query: 1010 FQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVC 1069
FQVNEVFADH +SH+PI+V R + NL+R +V FG+S T+F L +IQ+ FWRGFVC
Sbjct: 1609 FQVNEVFADHRSSHNPIHVQREQLWNLQRRMVFFGTSVPTIFKGLTTEEIQQCFWRGFVC 1668
Query: 1070 VRGFDRRYRCPRPLVNRLH 1088
VRGFD R PRPL LH
Sbjct: 1669 VRGFDMETRAPRPLCPHLH 1687
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/191 (47%), Positives = 119/191 (62%), Gaps = 30/191 (15%)
Query: 108 DGPSNVHNAVVPYQVGPSSEHGALVPHQ-----IKEKRKKLKPEVVLDPGTLRMWNLIMN 162
+G S NA+VPY +VP++ +K++R + K V LD T R+WNL+M
Sbjct: 700 EGTSGPSNALVPYG------GAVMVPYERPLQLVKKQRPRAK--VDLDFETTRVWNLLMG 751
Query: 163 ID---DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSV 219
DGT ++ E+WWQ+EREVF+GR SF ARM L+ GDRRF PWKGSVVDSV
Sbjct: 752 KSAEPDGTDVEK------ERWWQQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSV 805
Query: 220 VGVYLTQNVSDNLSSSAYMSLAARFPLKSTN----DHTPSDENLRTTASLEPIGSNST-- 273
VGV+LTQNV+D+LSSSAYM+LAA FP +S N D +++N +TT++ +G S
Sbjct: 806 VGVFLTQNVADHLSSSAYMALAASFPSRSVNNSCKDDATTEDNEQTTSTSALVGEKSVFD 865
Query: 274 --SNGAVYDSE 282
NGA D E
Sbjct: 866 LFYNGARPDLE 876
>gi|240255311|ref|NP_187612.5| protein demeter-like 2 [Arabidopsis thaliana]
gi|378405159|sp|Q9SR66.2|DML2_ARATH RecName: Full=DEMETER-like protein 2
gi|332641327|gb|AEE74848.1| protein demeter-like 2 [Arabidopsis thaliana]
Length = 1332
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)
Query: 404 ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 447
+TQ ++ P TG ST + +P++ C +ESS + + I + +
Sbjct: 613 VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672
Query: 448 -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 506
+L D + +L ++ K G+ T NDL +SS+ + ++
Sbjct: 673 KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728
Query: 507 VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 565
+P E P++ T ++S R + K K + KK+ S
Sbjct: 729 EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788
Query: 566 QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 622
DWD LR+ +G ER+ MD+VDW+A+RC D +IA+ I +RG N++A RIK F
Sbjct: 789 VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848
Query: 623 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL IAFPVD NVGR
Sbjct: 849 LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909 IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 802
FC K PNC ACP++ EC+H++SA ASAR ALP P E + + P+VVN
Sbjct: 969 FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028
Query: 803 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 856
P + + + E+ + NCEPIIEEP SP P+ E + D ++G E+
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087
Query: 857 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 915
IPT+ L + + + + +NK S LV L+ AA++P RKLK LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143
Query: 916 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 973
+ LPD H +L GF+RR+ +D PYLLAIW+PGE+ NS+ P+ RC ++ LCN C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203
Query: 974 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1033
+ CN E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263
Query: 1034 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1090
+L+R + GSS +++ L + I+ F G+VCVRGFDR R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)
Query: 78 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 136
+S T D I K F++L I G LP N A++ Y+ E A+V +
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491
Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
KK KP+V LDP T R+W L+M+ D D S+E+ KWW++ER +F GR SF
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597
>gi|242065150|ref|XP_002453864.1| hypothetical protein SORBIDRAFT_04g019820 [Sorghum bicolor]
gi|241933695|gb|EES06840.1| hypothetical protein SORBIDRAFT_04g019820 [Sorghum bicolor]
gi|333471385|gb|AEF38426.1| 5-methylcytosine DNA glycosylase [Sorghum bicolor]
Length = 1891
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 300/540 (55%), Positives = 364/540 (67%), Gaps = 30/540 (5%)
Query: 567 DWDLLRR-IYSTG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + S G ++RSHD D+VDWEAVR A+ EI++ I+ERG N++A RIKEFL
Sbjct: 1367 DWDSLRKEVLSNGGDKQRSHDARDTVDWEAVRQAEVREISETIRERGMNNMLAERIKEFL 1426
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR+V PDK K++LL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1427 NRLVTDHGSIDLEWLRDVQPDKAKDFLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1486
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ +YELHYQMITFGK+F
Sbjct: 1487 CVRLGWVPLQPLPESLQLHLLE-----------------------MYELHYQMITFGKVF 1523
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC +CP+R ECKHFASA ASAR ALP P EK + TSE N + +N
Sbjct: 1524 CTKSKPNCNSCPMRAECKHFASAFASARLALPAPEEKRLATSEDANVVEFCHQTYINSGT 1583
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
V ++E N +PIIEEP SP P+ E +DF D +EIPT+ L
Sbjct: 1584 VGQLEWSANYPKHAVSGNHQPIIEEPLSPEPEPENVEAKEGAIEDFFCEDPDEIPTINLN 1643
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF +N N+M+ N V ++ S ALVA+T +AAS+PT KLK + LRTEH VY LPD
Sbjct: 1644 IEEFTQNLKNYMQANNVDIEYADMSKALVAITPDAASIPTPKLKNISRLRTEHQVYELPD 1703
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL GF++R+PDDP PYLL+IW+PGE+ S +P+ C+S ++ +LC S C+SCNS+
Sbjct: 1704 SHPLLDGFEQREPDDPCPYLLSIWTPGETAQSTDAPKTFCDSGETGRLCGSSTCFSCNSI 1763
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E A VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR I +L R
Sbjct: 1764 REMQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHYSSQNPIDVPRSWIWDLPRR 1823
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSK 1099
V FG+S T+F L QIQ+ FWRGFVCVRGFDR R PRPL RLH SK G K
Sbjct: 1824 TVYFGTSVPTIFRGLTTEQIQQCFWRGFVCVRGFDRTVRAPRPLYARLHFPASKVVRGKK 1883
Score = 130 bits (326), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 83/165 (50%), Positives = 111/165 (67%), Gaps = 14/165 (8%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKRKKL 143
II+K K L+I +G ++ + K NA+VPY+ E GALV + K K+ +
Sbjct: 808 IIQKIKLLSI-NGPDKIVAEVPK--------NALVPYE----GEFGALVAFEGKTKKSRS 854
Query: 144 KPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLIL 203
+ +V +DP T MWNL+M D G + ++D EKW +ER VF GR+ SF ARMHL+
Sbjct: 855 RAKVNIDPVTTMMWNLLMGPDMGDGAEGL-DKDKEKWLDEERRVFRGRVDSFIARMHLVQ 913
Query: 204 GDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+M++AA+FP K+
Sbjct: 914 GDRRFSRWKGSVVDSVVGVFLTQNVSDHLSSSAFMAVAAKFPAKT 958
>gi|242085358|ref|XP_002443104.1| hypothetical protein SORBIDRAFT_08g008620 [Sorghum bicolor]
gi|241943797|gb|EES16942.1| hypothetical protein SORBIDRAFT_08g008620 [Sorghum bicolor]
Length = 1856
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 300/549 (54%), Positives = 371/549 (67%), Gaps = 34/549 (6%)
Query: 567 DWDLLRR-IYSTG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + S G ++RSHD D+VDWEAVR A+ EI++ I+ERG N++A RIKEFL
Sbjct: 1331 DWDSLRKEVLSNGGDKQRSHDARDTVDWEAVRQAEVREISETIRERGMNNMLAERIKEFL 1390
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
+RLV HGSIDLEWLR+V PDK K++LL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1391 DRLVTDHGSIDLEWLRDVQPDKAKDFLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1450
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ +YELHYQMITFGK+F
Sbjct: 1451 CVRLGWVPLQPLPESLQLHLLE-----------------------MYELHYQMITFGKVF 1487
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC +CP+R ECKHFASA ASAR ALP P EK + TSE N P+ +N
Sbjct: 1488 CTKSKPNCNSCPMRVECKHFASAFASARLALPAPEEKRLATSEDPNVAEFCHPIYINSGT 1547
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
V ++E +N +PIIEEP SP P+ E +DF D +EIPT+ L
Sbjct: 1548 VGQLEWTANYPKHAVSDNHQPIIEEPLSPEPEPENVEAKEGAIEDFFCEDPDEIPTINLN 1607
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF +N N+M+ N V ++ S ALVA+T +AAS+PT KLK + LRTEH VY LPD
Sbjct: 1608 IEEFTQNLKNYMQANNVEIEYADMSKALVAITPDAASIPTPKLKNVSRLRTEHQVYELPD 1667
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL GF++R+PDDP PYLL+IW+PGE+ S +P+ C+S ++ +LC S C+SCNS+
Sbjct: 1668 SHPLLEGFEQREPDDPCPYLLSIWTPGETAQSTDAPKTFCDSGETGRLCGSSTCFSCNSI 1727
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E A VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR I +L R
Sbjct: 1728 RETQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHYSSQNPIDVPRSWIWDLPRR 1787
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK----KE 1095
V FG+S T+F L +IQ+ FWRGFVCVRGFDR R PRPL RLH SK ++
Sbjct: 1788 TVYFGTSVPTIFRGLTTEEIQQCFWRGFVCVRGFDRTVRAPRPLYARLHFPASKVVRGRK 1847
Query: 1096 AGSKRSRDE 1104
G+ R +E
Sbjct: 1848 PGAAREEEE 1856
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/183 (48%), Positives = 111/183 (60%), Gaps = 24/183 (13%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNV-----HNAVVPYQVGPSSEHGALVPHQIKE 138
II+K K L+I +GP V NA+VPYQ E GALV + K
Sbjct: 771 IIQKIKLLSI--------------NGPDKVVAEVPKNALVPYQ----GEFGALVAFKGKT 812
Query: 139 KRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTAR 198
K+ + + +V +DP T MWNL+M D G + ++D EKW +ER VF GR+ SF AR
Sbjct: 813 KKSRSRAKVNIDPVTTMMWNLLMGPDMGDGAEGL-DKDKEKWLDEERRVFRGRVDSFIAR 871
Query: 199 MHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDEN 258
MHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+M +AA+FP K+ P E
Sbjct: 872 MHLVQGDRRFSRWKGSVVDSVVGVFLTQNVSDHLSSSAFMGVAAKFPAKTEVPEKPVAEM 931
Query: 259 LRT 261
T
Sbjct: 932 CHT 934
>gi|226499668|ref|NP_001146555.1| uncharacterized protein LOC100280151 [Zea mays]
gi|219887797|gb|ACL54273.1| unknown [Zea mays]
Length = 501
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 287/491 (58%), Positives = 348/491 (70%), Gaps = 4/491 (0%)
Query: 613 NIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
N++A RIKEFLNRLV HGSIDLEWLR+V PDK K++LL I GLGLKSVECVRLL+L H+
Sbjct: 3 NMLAERIKEFLNRLVTDHGSIDLEWLRDVQPDKAKDFLLSIRGLGLKSVECVRLLTLHHM 62
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
AFPVD NVGRI VRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYEL
Sbjct: 63 AFPVDTNVGRICVRLGWVPLQPLPESLQLHLLEMYPMLEHIQKYLWPRLCKLDQRTLYEL 122
Query: 733 HYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIG 792
HYQMITFGK+FC K PNC +CP+R ECKHFASA ASAR ALP P EK +VT E N +
Sbjct: 123 HYQMITFGKVFCTKSKPNCNSCPMRAECKHFASAFASARLALPAPEEKCLVTLEDPNVVE 182
Query: 793 QSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIG 852
S +N V ++E N +PIIEEP SP + E +DF
Sbjct: 183 FSHQTYINSGSVGQLEWSANYPKHAVSGNHQPIIEEPLSPECETENIEAHEGAIEDFFCE 242
Query: 853 DIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHL 909
+ +EIPT+ L EF +N ++M+ N V ++ S ALVA+T +AAS+PT KLK L
Sbjct: 243 ESDEIPTINLNIEEFTQNLKDYMQANNVEIEYADMSKALVAITPDAASIPTPKLKNVNRL 302
Query: 910 RTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLC 968
RTEH VY LPD+H LL GF++R+PDDP PYLL+IW+PGE+ S+ +P+ C+S ++ +LC
Sbjct: 303 RTEHQVYELPDSHPLLEGFEQREPDDPCPYLLSIWTPGETAQSIDAPKTFCDSGETGRLC 362
Query: 969 NSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINV 1028
S IC+SCN++ E A VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+V
Sbjct: 363 GSSICFSCNNIREMQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHCSSQNPIDV 422
Query: 1029 PRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
PR I +L R V FG+S T+F L +IQ FWRGFVCVRGFDR R PRPL RLH
Sbjct: 423 PRSWIWDLPRRTVYFGTSVPTIFRGLTTEEIQRCFWRGFVCVRGFDRTVRAPRPLYARLH 482
Query: 1089 CSPSKKEAGSK 1099
SK G K
Sbjct: 483 FPVSKVVRGKK 493
>gi|449527101|ref|XP_004170551.1| PREDICTED: LOW QUALITY PROTEIN: protein ROS1-like, partial [Cucumis
sativus]
Length = 987
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 306/577 (53%), Positives = 392/577 (67%), Gaps = 42/577 (7%)
Query: 514 SFRTGMP--QAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLL 571
S+ TG P HD + +SK + D K+D K + + V DWD L
Sbjct: 428 SWETGEPAQNKHDHSLSSKFN------------DPKSDIIKPNRERVKKEKRVGVDWDSL 475
Query: 572 RR-IYSTGE-ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL 629
R+ + +TG +R+ + MDS+DWEAVRCAD +IA I+ERG N +A RIK+FL+RLV+
Sbjct: 476 RKQVEATGRRDRTTNTMDSLDWEAVRCADIDDIAYTIRERGMNNRLAERIKDFLDRLVKD 535
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGW 689
HGS DLEWLR+VPPD+ KEYLL I GLGLKSVECVRLL+LQ +AFPVD NVGRIAVRLGW
Sbjct: 536 HGSTDLEWLRDVPPDQAKEYLLSIRGLGLKSVECVRLLTLQQVAFPVDTNVGRIAVRLGW 595
Query: 690 VPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSP 749
VPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K P
Sbjct: 596 VPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKP 655
Query: 750 NCGACPLRGECKHFASAVASARFALPGPSEKGIV-TSEFGNGIGQSPPLVVNPIPVIRIE 808
NC ACP+RGEC+HFASA ASAR +LP P EK ++ +E I Q+ + P+ + + E
Sbjct: 656 NCNACPMRGECRHFASAFASARLSLPAPEEKSLINATERKADINQAVVVHQQPLALTQ-E 714
Query: 809 ADPVSESGYQI-------NNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
++P+ ES Q+ +N +PIIEEP++P P+C + SEID +D D +EIPT++
Sbjct: 715 SEPI-ESIQQLISVKSGGSNKDPIIEEPQTPEPECPQI--SEIDIEDTLYEDPDEIPTIK 771
Query: 862 LQEREFKENFPNFMEMNKVMLQDSS---ALVALTAEAASVPTRKLKRCAHLRTEHHVYVL 918
L F ++ N+M+ N + LQ+ S ALV L+ EAAS+P KLK + LRTEH VY L
Sbjct: 772 LNIEAFTKHVQNYMQEN-MELQEGSMSKALVLLSPEAASIPMPKLKNISRLRTEHQVYEL 830
Query: 919 PDAHELLRG--FDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYS 975
PD+H LL +RR+PDDP YLLAIW+PGE+ NSV C+S++S LC + C+S
Sbjct: 831 PDSHPLLEKLQLERREPDDPCFYLLAIWTPGETANSVEQLHAHCSSQESGGLCGEKECFS 890
Query: 976 CNSVCERNAGIVRGTI-------LIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINV 1028
CNSV E ++ +VRGT IPCRTA RG FPLNGTYFQVNEVFADH++S +PI+V
Sbjct: 891 CNSVREPDSEVVRGTTSANQLQXXIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPIDV 950
Query: 1029 PRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWR 1065
PR + L R V FG+S T+F L +IQ FW+
Sbjct: 951 PRSWLWKLSRRTVYFGTSIPTIFKGLSTEEIQGCFWK 987
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 91/185 (49%), Gaps = 3/185 (1%)
Query: 205 DRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS-DENLRTTA 263
DR F WKGSV+DSV+GV+LTQNVSD+LSSSA+MSLAAR+PLKS + H S DE
Sbjct: 28 DRGFSQWKGSVLDSVIGVFLTQNVSDHLSSSAFMSLAARYPLKSKSLHESSVDEQTSLIL 87
Query: 264 SLEPIGSNSTSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKV 323
+ + + ++ + + + + CE+ ++ F + + G +
Sbjct: 88 NESQVTLCQAEDSVIWAKQISDQSICKQSCTTVCEIDQAEENFLTSSDSSGSKTAGVTSM 147
Query: 324 MTDAVPSQAFDTSSVQSLDRTQLFPTGNSKADVASSTKTSNAESFITQFSHTGNLKKNSV 383
++ + + L+ +L N+ + S A++ I+ S + ++S+
Sbjct: 148 RGYQCSVTSYSSKKIVELEDRRLTTEINTTVEACSLGNEKTADAAIS--SQMSVVSEHSI 205
Query: 384 NQLFP 388
N L P
Sbjct: 206 NSLCP 210
>gi|413936924|gb|AFW71475.1| hypothetical protein ZEAMMB73_049283 [Zea mays]
Length = 1906
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 297/540 (55%), Positives = 363/540 (67%), Gaps = 30/540 (5%)
Query: 567 DWDLLRR-IYSTG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ ++S G ++R++D D+VDWEAVR A+ EI++ I+ERG N++A RIKEFL
Sbjct: 1382 DWDSLRKEVFSNGGDKQRNNDARDTVDWEAVRQAEVREISETIRERGMNNMLAERIKEFL 1441
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HG IDLEWLR+VPPDK K++LL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1442 NRLVTDHGGIDLEWLRDVPPDKAKDFLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRI 1501
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
VRLGWVPLQPLP L +HLL+ +YELHYQMITFGK+F
Sbjct: 1502 CVRLGWVPLQPLPESLQLHLLE-----------------------MYELHYQMITFGKVF 1538
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIP 803
C K PNC +CPLR ECKHFASA ASAR ALP P EK + TSE N + +N
Sbjct: 1539 CTKSKPNCNSCPLRAECKHFASAFASARLALPAPEEKRLATSEDPNVVEFCHQTYINSGA 1598
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
V +E N +P IEEP SP P+ + +DF D +EIPT+ L
Sbjct: 1599 VGELEWSANYPKHAVCGNLQPFIEEPLSPEPEPENVEAKDGAIEDFFNEDPDEIPTINLN 1658
Query: 864 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
EF +N N+M+ N V ++ S ALVA+T EAAS+PT KLK + LRTEH VY LPD
Sbjct: 1659 IEEFTQNLKNYMQANHVEIEYADMSKALVAITPEAASIPTPKLKNVSRLRTEHQVYELPD 1718
Query: 921 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
+H LL GF++R+PDDP PYLL+IW+PGE+ S +P+ C+S ++ +LC S C+SCNS+
Sbjct: 1719 SHPLLEGFEQREPDDPCPYLLSIWTPGETAQSTNAPKTFCDSGETGQLCGSLTCFSCNSL 1778
Query: 980 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
E A VRGT+LIPCRTA RG FPLNGTYFQVNEVFADH +S +PI+VPR I +L R
Sbjct: 1779 REMQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHCSSQNPIDVPRSWIWDLPRR 1838
Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSK 1099
V FG+S T+F L +IQ+ FWRGFVCVRGFDR R PRPL RLH SK G K
Sbjct: 1839 TVYFGTSVPTIFRGLTTEEIQQCFWRGFVCVRGFDRTVRAPRPLYARLHFPASKVVRGKK 1898
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 74/133 (55%), Positives = 97/133 (72%), Gaps = 5/133 (3%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSN 174
+A+VPY+ E GALV + K K+ + + +V +DP T MWNL+M D G + +
Sbjct: 842 DALVPYE----GEFGALVAFEGKTKKNRSRAKVNIDPVTTMMWNLLMGPDMGDGAEGL-D 896
Query: 175 EDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSS 234
+D EKW +ER+VF GR+ SF ARMHL+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSS
Sbjct: 897 KDKEKWLDEERKVFRGRVDSFIARMHLVQGDRRFSRWKGSVVDSVVGVFLTQNVSDHLSS 956
Query: 235 SAYMSLAARFPLK 247
SA+M++AA+FP K
Sbjct: 957 SAFMAVAAKFPAK 969
>gi|339699499|gb|AEJ87849.1| hypothetical protein [Fragaria x ananassa]
Length = 496
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 287/498 (57%), Positives = 358/498 (71%), Gaps = 10/498 (2%)
Query: 580 ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLR 639
E++ + MDSVDWEAVR A+ ++IA IKERG N++A RIKEFLNRL+ HGS+DLEWLR
Sbjct: 2 EKTRNTMDSVDWEAVRTANVNDIAQTIKERGMNNMLAERIKEFLNRLLREHGSVDLEWLR 61
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
+VPPD+ KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L
Sbjct: 62 DVPPDQAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESL 121
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
+HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHY+MITFGK+FC K PNC ACP+RGE
Sbjct: 122 QLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYRMITFGKVFCTKSKPNCNACPMRGE 181
Query: 760 CKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQI 819
C+HFASA ASAR ALPGP +K IV+S ++P +V N + + E P+ + G
Sbjct: 182 CRHFASAFASARLALPGPGQKSIVSSTSSPVAEKNPAVVGNIMSLPPPENYPLQKVGAY- 240
Query: 820 NNCEPIIEEPRSPGPQCSESPESEIDD---DDFSIGDIEEIPTLRLQEREFKENFPNFME 876
EPIIEEP SP + + SEI+D ++ D ++I + + + + + M+
Sbjct: 241 -RSEPIIEEPVSPEQEFTNLSHSEIEDLLNEEEYDDDPDDITHINISMEQLTDTLQDHMQ 299
Query: 877 MNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDP 933
N + L+D S ALVAL A S+PT KLK + LRTEH VY LPD+H LL G DR +P
Sbjct: 300 -NNMDLKDGDLSKALVALNPAATSLPTPKLKDVSRLRTEHQVYELPDSHPLLEGVDRLEP 358
Query: 934 DDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTIL 992
DDPSPYLLAIW+PGE+ NS P+ C SK+ + LC+ + C++CNS+ E + IVRGTIL
Sbjct: 359 DDPSPYLLAIWTPGETANSFQPPESSCGSKEGNTLCSEKTCFTCNSIREEKSQIVRGTIL 418
Query: 993 IPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFN 1052
IPCRTA RG FPLNGTYFQVNE+FADH +S +PI+VPR I NL R V FG+S T++F
Sbjct: 419 IPCRTAMRGSFPLNGTYFQVNEMFADHASSANPIDVPRGWIWNLPRRPVYFGTSVTSIFR 478
Query: 1053 ALDMRQIQEYFWRGFVCV 1070
L IQ FW+G+VCV
Sbjct: 479 GLTTEGIQYCFWKGYVCV 496
>gi|357129088|ref|XP_003566199.1| PREDICTED: uncharacterized protein LOC100844361 [Brachypodium
distachyon]
Length = 1546
Score = 530 bits (1366), Expect = e-147, Method: Compositional matrix adjust.
Identities = 265/487 (54%), Positives = 332/487 (68%), Gaps = 16/487 (3%)
Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
K FLNRLV HGSIDLEWLR++PPD K+YLL I GLGLKSVECVRLL+L H+AFPVD N
Sbjct: 1041 KSFLNRLVRDHGSIDLEWLRDIPPDSAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTN 1100
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
VGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ Y+WPRLC LDQ+TLYELHYQMITF
Sbjct: 1101 VGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYIWPRLCKLDQQTLYELHYQMITF 1160
Query: 740 GKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS----EFGNGIGQSP 795
GK+FC K PNC ACP+R ECKHFASA ASAR ALP P EK +V S F N +
Sbjct: 1161 GKVFCTKSKPNCNACPMRSECKHFASAFASARLALPAPQEKSLVKSSNQFSFENSGLPTR 1220
Query: 796 PLVVNPIPVIRIEADPVSESGYQI--NNCEPIIEEPRSP-GPQCSESPESEIDDDDFSIG 852
N + + S G + EPI+EEP SP +C E+ E++I+D D
Sbjct: 1221 NHAWNSTVLSVLPQLEGSTYGRDVLATYSEPIVEEPASPREEECPETLENDIEDYD---A 1277
Query: 853 DIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVPTRKLKRCAHL 909
D EIPT++L F +N N + E NK + D + ALVA++ EAAS+P KLK L
Sbjct: 1278 DNGEIPTIKLNMEAFAQNLENCIKESNKDLQSDDIAKALVAISTEAASIPVPKLKNVRRL 1337
Query: 910 RTEHHVYVLPDAHELLRGFD--RRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK-DSK 966
RTEH+VY LPD+H L++ + RR+PDDPSPYLL IW + +P+ C+S+ ++
Sbjct: 1338 RTEHYVYELPDSHSLVQQLELARREPDDPSPYLLTIWMEDDIKEMSKAPKSCCDSQMEAD 1397
Query: 967 LCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPI 1026
CN+ C+ C E + VRGTIL+PCRTA +G FPLNGTYFQVNEVFADH++SH PI
Sbjct: 1398 FCNNGKCHYCVPERENQSRYVRGTILVPCRTAMKGSFPLNGTYFQVNEVFADHKSSHDPI 1457
Query: 1027 NVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNR 1086
+V R + +L+R +V FG+S +T+F L ++Q+ FWRGFVCVRGFD + PRPL
Sbjct: 1458 HVAREQLWSLQRRMVYFGTSVSTIFKGLTTEEVQQCFWRGFVCVRGFDMETKAPRPLCPH 1517
Query: 1087 LHCSPSK 1093
LH + SK
Sbjct: 1518 LHLAASK 1524
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 99/149 (66%), Gaps = 11/149 (7%)
Query: 134 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQ 193
H +K++R + K V LD T R+WNL+M T ++ D E+WWQ+EREVF+GR
Sbjct: 721 HLVKKQRPRAK--VNLDYETTRVWNLLMG--KATEPVDGTDVDKERWWQQEREVFQGRAN 776
Query: 194 SFTARMHLIL----GDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKST 249
SF ARM L+ GDR F PWKGSVVDSVVGV+LTQNVSD+LSSSA+M+LAA FP S
Sbjct: 777 SFIARMRLVQVYRNGDRHFSPWKGSVVDSVVGVFLTQNVSDHLSSSAFMALAASFPPGSV 836
Query: 250 NDHTPSDENLRTTASLEPIGSNSTSNGAV 278
H+ ++++ + A+ E I S S G +
Sbjct: 837 --HSNCEDDITSQAN-EEIFSMSAVGGDI 862
>gi|297833776|ref|XP_002884770.1| hypothetical protein ARALYDRAFT_341139 [Arabidopsis lyrata subsp.
lyrata]
gi|297330610|gb|EFH61029.1| hypothetical protein ARALYDRAFT_341139 [Arabidopsis lyrata subsp.
lyrata]
Length = 1078
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 306/662 (46%), Positives = 391/662 (59%), Gaps = 73/662 (11%)
Query: 446 NTALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSS------DF 499
NT L+ ++VD + + G G+ T N + +SS+ +
Sbjct: 461 NTVLMNEQVDS-----QMVKG--------KGHMPYTDDLNGMSEGISMVSSTSTHCELNL 507
Query: 500 ND-PFEISVLPSNRESFRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKK 558
N+ P E+ + RE T PQ D ++ K + K S +K KK+T
Sbjct: 508 NEVPPEVELCSHQRELESTIQPQ--DQQENTRTEDVKKNRKKPTTSKLK----KKSTEPA 561
Query: 559 NSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNII 615
S+ DWD LR+ +G ER+ MD+VDW+A+RC D ++IA+ I +RG N++
Sbjct: 562 KSNKKKSFDWDSLRKQAESGGRKRERTERTMDTVDWDALRCTDVNKIANIIIKRGMNNML 621
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
A RIK FLNRLV HGSIDLEWLR++ PD+ KEYLL I GLGLKSVECVRLLSL IAFP
Sbjct: 622 AERIKAFLNRLVRKHGSIDLEWLRDIHPDQAKEYLLSINGLGLKSVECVRLLSLHQIAFP 681
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VD NVGRIAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY
Sbjct: 682 VDTNVGRIAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYH 741
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSP 795
MITFGK+FC K PNC ACP++ EC+H+ASA ASAR ALP P E T +
Sbjct: 742 MITFGKVFCTKVKPNCNACPMKAECRHYASARASARLALPEPEESDRTTVMVHERRYKRK 801
Query: 796 PLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIE 855
P VVN P + + + E+ + NCEPIIEEP SP P+ E E ++ ++G +
Sbjct: 802 PFVVNFRPSLFLFQEKEHEAQ-RSQNCEPIIEEPASPEPEYIERDIEEYPWNNNNVGTSK 860
Query: 856 E-------IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAH 908
+ IPT+ L KE + + +NK S LV L+ AA++P RKLK
Sbjct: 861 DPWENKDVIPTIMLN----KEAGTSHLVVNKEA-GTSQDLVVLSTYAAAIPRRKLKIKEK 915
Query: 909 LRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLC 968
LRTEH+VY LP H +L GF+R + DD PYLLAI +PGE
Sbjct: 916 LRTEHNVYELPKYHSILEGFERHEDDDLVPYLLAICTPGEE------------------- 956
Query: 969 NSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINV 1028
E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH +S +PI V
Sbjct: 957 ------------EESHNTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHGSSINPIEV 1004
Query: 1029 PRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
P +I +LRR I FGSS +++ L +++I++ F G+VCVRGFDR R P+ LV RLH
Sbjct: 1005 PTQSIWDLRRRIAYFGSSVSSICKGLSVKEIEDNFQEGYVCVRGFDRENRKPKSLVKRLH 1064
Query: 1089 CS 1090
CS
Sbjct: 1065 CS 1066
Score = 92.8 bits (229), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 87/197 (44%), Positives = 113/197 (57%), Gaps = 26/197 (13%)
Query: 78 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 136
+S T D I K F++L I S LP N A++ Y+ +E A+V +
Sbjct: 223 RSLTIDAITKLFEELDINKESSCLP---------HNRETALILYKKA-YAEQQAIVKYS- 271
Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
KK KP+V LDP T R+W L+M D D S+E+ KWW++ER +F GR SF
Sbjct: 272 ----KKQKPKVQLDPETSRVWKLLMLSIDSDGVD-GSDEEKRKWWEEERNMFHGRANSFI 326
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSD 256
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP+ D
Sbjct: 327 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPV---------D 377
Query: 257 ENLRTTASLEPIGSNST 273
N + LE GS+ T
Sbjct: 378 WNFNKGSCLEEWGSSVT 394
>gi|255575365|ref|XP_002528585.1| Protein ROS1, putative [Ricinus communis]
gi|223531981|gb|EEF33793.1| Protein ROS1, putative [Ricinus communis]
Length = 1634
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 274/559 (49%), Positives = 345/559 (61%), Gaps = 88/559 (15%)
Query: 567 DWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR+ + + G ER+ + MDS+DWEAVRCA+ ++IA+ IKERG N++A RIK FL
Sbjct: 1103 DWDALRKQVEANGRKRERTPNTMDSLDWEAVRCAEVNDIANTIKERGMNNVLAERIKNFL 1162
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NRLV HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1163 NRLVREHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1222
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+F
Sbjct: 1223 AVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVF 1282
Query: 744 CKKRSPNCGACPLRGECKHFASAVA----------------------------------- 768
C K PNC ACP+RGEC+HFASA A
Sbjct: 1283 CTKSKPNCNACPMRGECRHFASAFARSKNLNVIKETSYNWSFRLSTMNSMCKQSTGLFPV 1342
Query: 769 --------SARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADP--------- 811
SAR ALPGP EKG+V++ +P +V +P+ +A
Sbjct: 1343 SKDHHVKCSARLALPGPEEKGMVSATENRTNEPNPAAMVGQLPLPLPQATEQSEENQLSK 1402
Query: 812 ---VSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFK 868
SE+ Y +N+CEPI+EEP SP P+ + E++++D D +EIPT++L EF
Sbjct: 1403 SIQQSEAQYGVNSCEPIVEEPSSPEPERIQVTENDMEDT--FCEDPDEIPTIKLNIEEFT 1460
Query: 869 ENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGF 928
+N N+M+ N + LQ+ ALT + ++ H +Y L +
Sbjct: 1461 QNLQNYMQ-NNMELQECDMSKALTGSS-------------IQYGHPLYTLTLFQSHI--- 1503
Query: 929 DRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN-SKDSKLCNSEICYSCNSVCERNAGIV 987
PS + A E+ NS+ P+ CN +D KLC+ + C+SCNS+ E N+ IV
Sbjct: 1504 ------SPSYWTCAC----ETANSIQPPENNCNLQEDGKLCDEKTCFSCNSIREANSQIV 1553
Query: 988 RGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSA 1047
RGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S +PI+VPR I NL R V FG+S
Sbjct: 1554 RGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLNPIDVPRSWIWNLPRRTVYFGTSI 1613
Query: 1048 TTLFNALDMRQIQEYFWRG 1066
T+F L IQ FWRG
Sbjct: 1614 PTIFKGLTTEGIQHCFWRG 1632
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 82/151 (54%), Positives = 102/151 (67%), Gaps = 24/151 (15%)
Query: 114 HNAVVPYQVGPSSEH--------GALVPHQ---IKEKRKKLKPEVVLDPGTLRMWNLIM- 161
NA+VPY G ++ G +VP+ IK++R + P+V LD T ++W L+M
Sbjct: 739 QNALVPYSAGNEQQNALVLYRSNGTVVPYTDSLIKKRRSR--PQVDLDEETNKVWRLLMA 796
Query: 162 NID----DGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVD 217
NI+ +GTT ED KWW++ER VF GR SF ARMHL+ GDRRF WKGSVVD
Sbjct: 797 NINSEGINGTT------EDKAKWWEEERNVFRGRANSFIARMHLVQGDRRFSQWKGSVVD 850
Query: 218 SVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
SVVGV+LTQNVSD+LSSSA+MSLAA FPLKS
Sbjct: 851 SVVGVFLTQNVSDHLSSSAFMSLAAHFPLKS 881
>gi|79496905|ref|NP_195132.3| DEMETER-like protein 3 [Arabidopsis thaliana]
gi|84028203|sp|O49498.2|DML3_ARATH RecName: Full=DEMETER-like protein 3
gi|332660916|gb|AEE86316.1| DEMETER-like protein 3 [Arabidopsis thaliana]
Length = 1044
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 807 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 864 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 976
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI R L
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1094
+ + GS+ T++F LD R+I+ FW GF+C+R FDR+ R P+ LV RLH P ++
Sbjct: 977 EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
V IK +K + +V LDP T++ W+++M ++D ++ E E W+KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 391
Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>gi|52354425|gb|AAU44533.1| hypothetical protein AT4G34060 [Arabidopsis thaliana]
Length = 1044
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 807 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 864 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 976
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI R L
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976
Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1094
+ + GS+ T++F LD R+I+ FW GF+C+R FDR+ R P+ LV RLH P ++
Sbjct: 977 EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 83/115 (72%), Gaps = 2/115 (1%)
Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
V IK +K + +V LDP T++ W+++M ++D ++ +++ E +KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSY-DDKETEAKXKKEREIFQTR 391
Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>gi|115464171|ref|NP_001055685.1| Os05g0445900 [Oryza sativa Japonica Group]
gi|113579236|dbj|BAF17599.1| Os05g0445900, partial [Oryza sativa Japonica Group]
Length = 473
Score = 508 bits (1307), Expect = e-140, Method: Compositional matrix adjust.
Identities = 259/477 (54%), Positives = 322/477 (67%), Gaps = 17/477 (3%)
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
LR+VPPD K+YLL I GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVP+QPLP
Sbjct: 1 LRDVPPDSAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRICVRLGWVPIQPLPE 60
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
L +HLL+ YPV++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K PNC ACP+R
Sbjct: 61 SLQLHLLELYPVLETIQKYLWPRLCKLDQQTLYELHYQMITFGKVFCTKSKPNCNACPMR 120
Query: 758 GECKHFASAVASARFALPGPSEKGIVTSE----FGNGIGQSPPLVVNPIPVIRIEADPVS 813
EC+HFASA ASAR ALP P +K +V F NG +P N P+ ++E +
Sbjct: 121 SECRHFASAFASARLALPSPQDKRLVNLSNQFAFHNGTMPTP----NSTPLPQLEGS-IH 175
Query: 814 ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPN 873
NN PIIEEP SP + D +DF D +EIP ++L F +N N
Sbjct: 176 ARDVHANNTNPIIEEPASPREEECRELLEN-DIEDFD-EDTDEIPIIKLNMEAFSQNLEN 233
Query: 874 FM-EMNKVMLQD--SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLR--GF 928
+ E NK D + ALVA++ EAAS+P KLK LRTEH+VY LPD+H L++
Sbjct: 234 CIKESNKDFQSDDITKALVAISNEAASIPVPKLKNVHRLRTEHYVYELPDSHPLMQQLAL 293
Query: 929 DRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK-DSKLCNSEICYSCNSVCERNAGIV 987
D+R+PDDPSPYLLAIW+P E ++ +P+ CN + + LC++E+C++C S E V
Sbjct: 294 DQREPDDPSPYLLAIWTPDELKDTREAPKPCCNPQTEGGLCSNEMCHNCVSERENQYRYV 353
Query: 988 RGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSA 1047
RGT+L+PCRTA RG FPLNGTYFQVNEVFADH +SH+PIN+PR + NL R +V FG+S
Sbjct: 354 RGTVLVPCRTAMRGSFPLNGTYFQVNEVFADHSSSHNPINIPREQLWNLHRRMVYFGTSV 413
Query: 1048 TTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSRDE 1104
T+F L +IQ FWRGFVCVRGF+ R PRPL H + SK SK++ E
Sbjct: 414 PTIFKGLTTEEIQHCFWRGFVCVRGFNMETRAPRPLCPHFHLAASKLRRSSKKAATE 470
>gi|6143875|gb|AAF04422.1|AC010927_15 hypothetical protein [Arabidopsis thaliana]
Length = 1309
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 291/698 (41%), Positives = 393/698 (56%), Gaps = 63/698 (9%)
Query: 404 ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 447
+TQ ++ P TG ST + +P++ C +ESS + + I + +
Sbjct: 613 VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672
Query: 448 -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 506
+L D + +L ++ K G+ T NDL +SS+ + ++
Sbjct: 673 KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728
Query: 507 VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 565
+P E P++ T ++S R + K K + KK+ S
Sbjct: 729 EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788
Query: 566 QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 622
DWD LR+ +G ER+ MD+VDW+A+RC D +IA+ I +RG N++A RIK F
Sbjct: 789 VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848
Query: 623 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL IAFPVD NVGR
Sbjct: 849 LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
IAVRLGWVPLQPLP +L +HLL+ LYELHY MITFGK+
Sbjct: 909 IAVRLGWVPLQPLPDELQMHLLE-----------------------LYELHYHMITFGKV 945
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 802
FC K PNC ACP++ EC+H++SA ASAR ALP P E + + P+VVN
Sbjct: 946 FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1005
Query: 803 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 856
P + + + E+ + NCEPIIEEP SP P+ E + D ++G E+
Sbjct: 1006 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1064
Query: 857 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 915
IPT+ L + + + + +NK S LV L+ AA++P RKLK LRTEHHV
Sbjct: 1065 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1120
Query: 916 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 973
+ LPD H +L GF+RR+ +D PYLLAIW+PGE+ NS+ P+ RC ++ LCN C
Sbjct: 1121 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1180
Query: 974 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1033
+ CN E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP I
Sbjct: 1181 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1240
Query: 1034 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRG-FVCV 1070
+L+R + GSS +++ L + I+ F G +C+
Sbjct: 1241 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGMLICL 1278
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)
Query: 78 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 136
+S T D I K F++L I G LP N A++ Y+ E A+V +
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491
Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
KK KP+V LDP T R+W L+M+ D D S+E+ KWW++ER +F GR SF
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597
>gi|125552526|gb|EAY98235.1| hypothetical protein OsI_20146 [Oryza sativa Indica Group]
Length = 1802
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 272/559 (48%), Positives = 341/559 (61%), Gaps = 56/559 (10%)
Query: 553 KTTPKKNSDNTVQQDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKER 609
+TT KKNS+N DWD LRR +ER D+ DSVDWEAVRCAD I+ AI+ER
Sbjct: 1290 RTTSKKNSENF---DWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRISHAIRER 1346
Query: 610 GQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL 669
G N++A RI++FLNRLV HGSIDLEWLR+VPPD K+YLL I GLGLKSVECVRLL+L
Sbjct: 1347 GMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVECVRLLTL 1406
Query: 670 QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTL 729
H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YPV++ IQ YLWPRLC LDQ+TL
Sbjct: 1407 HHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCKLDQQTL 1466
Query: 730 YELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGN 789
YELHYQMITFGK CK C F+S + + +F N
Sbjct: 1467 YELHYQMITFGK--CKT-------------CTSFSSGQKVGEYE---------QSFDFQN 1502
Query: 790 GIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDF 849
G +P + P++++E + NN PIIEEP SP + D +DF
Sbjct: 1503 GTMPTP----HSTPLLQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLEN-DIEDF 1556
Query: 850 SIGDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVPTRKLKRC 906
D +EIPT++L F +N N + E NK D + ALVA++ EAAS+P KLK
Sbjct: 1557 D-EDTDEIPTIKLNMEAFAQNLENCIKESNKDFQSDDITKALVAISNEAASIPVPKLKNV 1615
Query: 907 AHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK-DS 965
LRTEH+V+ R SP+ + E ++ +P+ CN + +
Sbjct: 1616 HRLRTEHYVFC----------SQLRTSRFTSPH-----ATDELKDTREAPKPCCNPQTEG 1660
Query: 966 KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHP 1025
LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVFADH +SH+P
Sbjct: 1661 GLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVFADHSSSHNP 1720
Query: 1026 INVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVN 1085
IN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGFD R PRPL
Sbjct: 1721 INIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFDMETRAPRPLCP 1780
Query: 1086 RLHCSPSKKEAGSKRSRDE 1104
H + SK SK + E
Sbjct: 1781 HFHLAASKLRRSSKTAATE 1799
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/136 (56%), Positives = 98/136 (72%), Gaps = 15/136 (11%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIK---EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQ 171
NA+VPY VG GA+VP+Q + K+++ + +V LD T R+WNL+M G D
Sbjct: 809 NALVPY-VG-----GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLM----GKAADP 858
Query: 172 TSNEDM--EKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVS 229
D+ E+WW++EREVF+GR SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNV+
Sbjct: 859 VDGTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVA 918
Query: 230 DNLSSSAYMSLAARFP 245
D+LSSSAYM+LAA FP
Sbjct: 919 DHLSSSAYMALAASFP 934
>gi|224100667|ref|XP_002311968.1| predicted protein [Populus trichocarpa]
gi|222851788|gb|EEE89335.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 483 bits (1244), Expect = e-133, Method: Compositional matrix adjust.
Identities = 235/406 (57%), Positives = 296/406 (72%), Gaps = 9/406 (2%)
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TLYELHYQMITFGK+FC K
Sbjct: 1 LGWVPLQPLPESLQLHLLELYPILESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTK 60
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP+R EC+HFASA SAR ALPGP K I TS +SP +V+NP+P++
Sbjct: 61 SRPNCNACPMRAECRHFASAFTSARLALPGPETKDITTSTVPFMPEKSPSIVINPMPLLP 120
Query: 807 IEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQERE 866
E + G+ I +CEPIIEEP +P + +E E++I+D D +EIPT++L E
Sbjct: 121 PEDNQHKSVGFDIGSCEPIIEEPVTPDQEQTELAETDIEDFG---EDPDEIPTIKLNMEE 177
Query: 867 FKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHE 923
F EN N++ N + LQ+ S ALVAL AS+PT KLK + LRTEH VY LPD+H
Sbjct: 178 FTENLQNYIHSN-MELQEYDMSKALVALDPN-ASIPTPKLKNVSRLRTEHQVYELPDSHP 235
Query: 924 LLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKD-SKLCNSEICYSCNSVCER 982
LL+G D+R+PDDPSPYLLAIW+PGE+ NS+ P+ +C S++ +KLCN +IC+SCNS+ E
Sbjct: 236 LLQGMDKREPDDPSPYLLAIWTPGETANSIEPPEQQCQSREPNKLCNEKICFSCNSIREA 295
Query: 983 NAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVC 1042
N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S +PI+VPR I NL R IV
Sbjct: 296 NSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSLNPIDVPRSLIWNLPRRIVY 355
Query: 1043 FGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
FG+S ++F L +Q FW+GFVCVRGFD++ R PRPL RLH
Sbjct: 356 FGTSVPSIFKGLSTEGVQHCFWKGFVCVRGFDQKTRAPRPLKARLH 401
>gi|138996988|dbj|BAF52856.1| repressor of silencing 2a [Nicotiana tabacum]
Length = 1673
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 245/461 (53%), Positives = 304/461 (65%), Gaps = 44/461 (9%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADE 599
K D ND K + DWD LR + E++ + +DS+DWEAVRCA+
Sbjct: 1222 KVDSVNDNPSKAKKGQLGKEKENIDWDSLRLEAQANGKKREKTANTLDSLDWEAVRCANV 1281
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+EIA I+ERG N +A RIK FLNR+V HGSIDLEWLR+VPPDK KEYLL I GLGLK
Sbjct: 1282 NEIAHTIRERGMNNKLAERIKNFLNRIVSEHGSIDLEWLRDVPPDKAKEYLLSIRGLGLK 1341
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
SVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWP
Sbjct: 1342 SVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWP 1401
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSE 779
RLC LDQ+TLYELHY MITFGK+FC K PNC ACPLRGEC+HFASA ASAR ALP P E
Sbjct: 1402 RLCKLDQRTLYELHYHMITFGKVFCTKSKPNCNACPLRGECRHFASAFASARLALPAPEE 1461
Query: 780 KGIVTSEFGNGIGQSP-------PL--------------VVNPIPVIRIEADPVSESGYQ 818
K IV++ +P PL ++N P+I + A P
Sbjct: 1462 KSIVSATENKASNNNPRENFTHLPLPLPPGNQQPVEHQKLINSAPIIEVPATP------- 1514
Query: 819 INNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNF---- 874
EPI+E P +P + ++PE +I+D F D EIPT+ L EF +N +
Sbjct: 1515 ----EPIVELPATPEQEQIQAPEIDIEDTYFE--DPCEIPTIELNMAEFTQNLKKYVKNN 1568
Query: 875 MEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPD 934
ME+++V + S+ALVALT+EAAS+P KLK + LRTEH VY LPD+H LL G D+R+PD
Sbjct: 1569 MELHQVEI--SNALVALTSEAASIPVPKLKNVSRLRTEHQVYELPDSHPLLEGLDKREPD 1626
Query: 935 DPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICY 974
DP YLLAIW+PGE+ NS P+ +CNS++S KLC +E +
Sbjct: 1627 DPCSYLLAIWTPGETANSFQPPETQCNSQESGKLCENETMF 1667
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/135 (54%), Positives = 93/135 (68%), Gaps = 7/135 (5%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKEKRK-KLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
+A+V YQ G++VP RK K +P+V +D T R+W L++ + D T
Sbjct: 807 SALVLYQ-----RDGSIVPFGSSLVRKRKPRPKVDVDDETDRVWKLLLQDINSEGVDGT- 860
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
+ED KWW++ER VF R SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 861 DEDKAKWWEEERRVFNSRADSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLS 920
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAA FPLK+
Sbjct: 921 SSAFMSLAAHFPLKT 935
>gi|138996990|dbj|BAF52857.1| repressor of silencing 2b [Nicotiana tabacum]
Length = 1673
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 245/461 (53%), Positives = 304/461 (65%), Gaps = 44/461 (9%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADE 599
K D ND K + DWD LR + E++ + +DS+DWEAVRCA+
Sbjct: 1222 KVDSVNDNPSKAKKGQLGKEKENIDWDSLRLEAQANGKKREKTANTLDSLDWEAVRCANV 1281
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+EIA I+ERG N +A RIK FLNR+V HGSIDLEWLR+VPPDK KEYLL I GLGLK
Sbjct: 1282 NEIAHTIRERGMNNKLAERIKNFLNRIVSEHGSIDLEWLRDVPPDKAKEYLLSIRGLGLK 1341
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
SVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWP
Sbjct: 1342 SVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWP 1401
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSE 779
RLC LDQ+TLYELHY MITFGK+FC K PNC ACPLRGEC+HFASA ASAR ALP P E
Sbjct: 1402 RLCKLDQRTLYELHYHMITFGKVFCTKSKPNCNACPLRGECRHFASAFASARLALPAPEE 1461
Query: 780 KGIVTSEFGNGIGQSP-------PL--------------VVNPIPVIRIEADPVSESGYQ 818
K IV++ +P PL ++N P+I + A P
Sbjct: 1462 KSIVSATENKASNNNPRENFTHLPLPLPPGNQQPVEHQKLINSAPIIEVPATP------- 1514
Query: 819 INNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNF---- 874
EPI+E P +P + ++PE +I+D F D EIPT+ L EF +N +
Sbjct: 1515 ----EPIVELPATPEQEQIQAPEIDIEDTYFE--DPCEIPTIELNMAEFTQNLKKYVKNN 1568
Query: 875 MEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPD 934
ME+++V + S+ALVALT+EAAS+P KLK + LRTEH VY LPD+H LL G D+R+PD
Sbjct: 1569 MELHQVEI--SNALVALTSEAASIPVPKLKNVSRLRTEHQVYELPDSHPLLEGLDKREPD 1626
Query: 935 DPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICY 974
DP YLLAIW+PGE+ NS P+ +CNS++S KLC +E +
Sbjct: 1627 DPCSYLLAIWTPGETANSFQPPETQCNSQESGKLCENETMF 1667
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/135 (54%), Positives = 93/135 (68%), Gaps = 7/135 (5%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKEKRK-KLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
+A+V YQ G++VP RK K +P+V +D T R+W L++ + D T
Sbjct: 807 SALVLYQ-----RDGSIVPFGSSLVRKRKPRPKVDVDDETDRVWKLLLQDINSEGVDGT- 860
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
+ED KWW++ER VF R SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 861 DEDKAKWWEEERRVFNSRADSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLS 920
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAA FPLK+
Sbjct: 921 SSAFMSLAAHFPLKT 935
>gi|45935143|gb|AAS79601.1| putative endonuclease III protein [Ipomoea trifida]
gi|118562896|dbj|BAF37786.1| hypothetical protein [Ipomoea trifida]
Length = 1687
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 282/634 (44%), Positives = 351/634 (55%), Gaps = 125/634 (19%)
Query: 543 KSDMKNDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEI 602
KS K + K PK + D W+ LRR Y MDS+DW AVRCA EI
Sbjct: 1098 KSTEKKGKVTKMMPKPDVD------WEELRRTYYNPNRTPGTLMDSIDWNAVRCAPVGEI 1151
Query: 603 ADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVE 662
A I+ RG N++A +IK FL+RLVE HGSIDLEWL++VPP+K KE+LL I G+GLKS E
Sbjct: 1152 AKVIENRGMNNVLAEKIKAFLDRLVEDHGSIDLEWLKDVPPEKAKEFLLSIRGVGLKSTE 1211
Query: 663 CVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLC 722
CVRLL+L H AFPVD N+ RI VRLGWVPL+PLPGDL IHLL +
Sbjct: 1212 CVRLLTLGHHAFPVDTNIARIVVRLGWVPLEPLPGDLQIHLLDQ---------------- 1255
Query: 723 YLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEK-G 781
Y LHYQ+ITFGK+ C K++PNC ACP+R ECKHFASA AS+R L G EK G
Sbjct: 1256 -------YVLHYQLITFGKVICTKKNPNCNACPMRAECKHFASAFASSRLRLRGAPEKPG 1308
Query: 782 IVTSE---------------FGNG-IGQSPPLVVNPIPVIRIEADPV------------- 812
+ S+ F + I Q P + I + +ADP
Sbjct: 1309 MSNSQPMLPHIPDIEDFPYKFKDSHIRQQPSALA--IEIASQDADPTRGLVRDIEDFPYE 1366
Query: 813 ----------SESGYQINN---------CEPIIEEPRS-------------------PGP 834
E Y+++N CEPIIE P S P P
Sbjct: 1367 LGNKYEILASQEFPYELDNKDEKSYTQTCEPIIEVPASLEPESQTCEPIIEVPASPEPEP 1426
Query: 835 QCSESPESEI--------DDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSS 886
+ + S E +I DDDD EIP ++L EFK N F+ + S+
Sbjct: 1427 ESTVSLERDISNILHETEDDDD-------EIPHIKLDTEEFKRNLKTFLNSEFEDEEVSN 1479
Query: 887 ALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSP 946
ALVALT + ++P K+K LRT H VY++PD+H LL GF+RR+ DDP PYLLAIW P
Sbjct: 1480 ALVALTPQDTTIPAPKIKSVERLRTRHRVYIVPDSHPLLIGFERRELDDPCPYLLAIW-P 1538
Query: 947 GESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLN 1006
ES N S C S+DS +C+ ++ ++ E N V GTILIPCRTAN+G FPLN
Sbjct: 1539 QESLNVKES----C-SQDSLICSGDL---DSAFQETNNQTVCGTILIPCRTANKGSFPLN 1590
Query: 1007 GTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRG 1066
GTYFQVNEVFADHE+S PIN+PR I NL + + GS+A+ + + M +IQ FW G
Sbjct: 1591 GTYFQVNEVFADHESSVCPINIPRKWIWNLTQRYLYCGSTASAIARGMQMDEIQHCFWNG 1650
Query: 1067 FVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKR 1100
FVCVR FDR+ R P L R H PS AG K+
Sbjct: 1651 FVCVRAFDRQTRNPIHLSKRFHELPSL--AGKKK 1682
Score = 99.8 bits (247), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 87/236 (36%), Positives = 133/236 (56%), Gaps = 26/236 (11%)
Query: 83 DIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPH--QIKEKR 140
+II+K + L I D QL ++ N+H AV+P+ + GA++P+ ++++KR
Sbjct: 719 EIIQKLQRLQISDEHGQLVVR------DLNLHGAVIPFN---NKFDGAIIPYNNKLEKKR 769
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
K P+V LDP +LR WNL+M D + + +E +K W++++E+F R++ F MH
Sbjct: 770 KMPMPKVDLDPESLRRWNLLMETD-ASELPEDEDEKKKKDWEEQKEIFCSRLKEFITCMH 828
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS----SSAYMSLAARFPLKSTNDHTPSD 256
++ GDRRF PWKGSVVDSVVGV+LTQNVSD LS S+A+M LAA++ + ST+ D
Sbjct: 829 VLQGDRRFSPWKGSVVDSVVGVFLTQNVSDYLSRFFYSNAFMYLAAKYGVTSTSCDIQVD 888
Query: 257 ENLRTTASLEPIGSNSTSNGAVYDSE--------GNMYFVTEP--EPDRCCELKDR 302
++ P G N+ + + S G+ V P D CC D+
Sbjct: 889 SPCSQESTSNPTGPNNVQSHETFISSAEKEPGSLGDDRVVKHPPVNHDFCCSSPDK 944
>gi|26451047|dbj|BAC42629.1| unknown protein [Arabidopsis thaliana]
gi|28950995|gb|AAO63421.1| At5g04570 [Arabidopsis thaliana]
Length = 416
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 229/412 (55%), Positives = 287/412 (69%), Gaps = 7/412 (1%)
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC K
Sbjct: 1 MGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTK 60
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV-I 805
PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I + +
Sbjct: 61 SRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIELPL 120
Query: 806 RIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQER 865
+E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 121 PLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLNIE 179
Query: 866 EFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAH 922
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD+H
Sbjct: 180 QFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSH 238
Query: 923 ELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCE 981
LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+ E
Sbjct: 239 RLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLRE 298
Query: 982 RNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIV 1041
N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R V
Sbjct: 299 ANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTV 358
Query: 1042 CFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 359 YFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 410
>gi|357495073|ref|XP_003617825.1| DNA glycosylase [Medicago truncatula]
gi|355519160|gb|AET00784.1| DNA glycosylase [Medicago truncatula]
Length = 1004
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 244/551 (44%), Positives = 329/551 (59%), Gaps = 64/551 (11%)
Query: 566 QDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR 625
Q W+ LR+I+S ++ D DSVDWEAVR A+ +E+A AIK+RGQQNIIA +I+ L
Sbjct: 452 QHWETLRKIHSKSDQHI-DHADSVDWEAVRDANVNEVAAAIKKRGQQNIIAYKIQVALKG 510
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
+ HGS++LEWL+++PP++ KEY L I GLGLKSVEC+RLL+LQHI+FPVDVNVGRI V
Sbjct: 511 FMVNHGSMNLEWLKDIPPNEAKEYFLSIFGLGLKSVECLRLLTLQHISFPVDVNVGRIVV 570
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RLGWVPLQPLP + IH L+++P KIQ YLWPRLC LD +TLYELHYQ+ITFGK+FC
Sbjct: 571 RLGWVPLQPLPESIQIHNLEKFPDPIKIQQYLWPRLCKLDHQTLYELHYQLITFGKVFCT 630
Query: 746 KRSPNCGACPLRGECKHFASAVASARFALPGPS--EKGIVTSEFGNGIGQSPPLVVNPIP 803
KR+PNC ACP++ CK++AS++A + ALP S ++ IV ++ + S +
Sbjct: 631 KRNPNCNACPMKDGCKYYASSLARTKLALPPKSTTDQSIVATQMDHCFPYSDYWSNSTST 690
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDI--------- 854
+ E+ CEPI+E P SP IDDD+ D
Sbjct: 691 LFTKES----------KECEPIVEMPASP---------ELIDDDEEIYHDYTYESDEEDI 731
Query: 855 --------------EEIPTLRLQEREFKENFP---NFMEMNKVMLQDSSALVALTAEAAS 897
E+IPT + +E P N E + S +LVA A +
Sbjct: 732 ESDEEDIESDEEDIEDIPTFNISSQESSSCLPMYGNSFEEFDHGVNASQSLVAFHPNATN 791
Query: 898 VPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
K+K + L+TE VYVL D H LL + R+ DDPSPYLL +W P E +S S +
Sbjct: 792 SHLSKMKNASRLKTERTVYVLTDNHPLLAEYPSREHDDPSPYLLVVWRPAELESSGESSK 851
Query: 958 YRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFA 1017
+ ++S + V GT+LIPCRTA R RFPLNGTYFQVNEVFA
Sbjct: 852 TDLHEEESSQTET----------------VPGTLLIPCRTAMRARFPLNGTYFQVNEVFA 895
Query: 1018 DHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRY 1077
D+ + PI+VPR I +L+++I FG+ A+++ L M +I+++F +GF+CVR D
Sbjct: 896 DYASMKKPIHVPRKWIWSLKKQIAYFGTGASSMTRGLSMEEIKDFFCKGFICVRAIDTNT 955
Query: 1078 RCPRPLVNRLH 1088
PRP+ + LH
Sbjct: 956 GAPRPISSILH 966
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/102 (52%), Positives = 76/102 (74%), Gaps = 4/102 (3%)
Query: 146 EVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGD 205
EV+LD TLR+WNL++ K + ++E ++W+ R+++ + SF ++H + GD
Sbjct: 289 EVMLDEETLRVWNLLV----VENKHEENDEHKRRYWEMIRKLYHKMVISFLDQVHTVQGD 344
Query: 206 RRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
RRF PWKGSV+DSV GV+LTQNVSD+LSSSA+MSLAARFP+K
Sbjct: 345 RRFLPWKGSVLDSVGGVFLTQNVSDHLSSSAFMSLAARFPVK 386
>gi|147785279|emb|CAN77395.1| hypothetical protein VITISV_035357 [Vitis vinifera]
Length = 1824
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 263/551 (47%), Positives = 332/551 (60%), Gaps = 74/551 (13%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAI 606
+ KK ++ NT+ WD LR+ + ER+ + MDS+DWEAVRC+D +EIA+ I
Sbjct: 1262 KAKKGKARREEKNTLH--WDNLRKEAQVNGRKRERTVNTMDSLDWEAVRCSDVNEIANTI 1319
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
KERG N++A RIK+FLNRLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRL
Sbjct: 1320 KERGMNNMLAERIKDFLNRLVRDHGSIDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRL 1379
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRIAVRLGWVPLQPLP L L
Sbjct: 1380 LTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQ-----------------------LHL 1416
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
LYELHYQMITFGK+FC K PNC ACP+RGEC+HFASA ASAR AL GP E+ IV++
Sbjct: 1417 LELYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLALTGPEERSIVSTN 1476
Query: 787 FGNGIGQSPPLVVNPIPVIRIEADP-VSESGYQINNCEPIIEEPRSPGPQCSESPESEID 845
+ +P + +NP+P+ SE+ INNCEPI+E P +P + + ES+I+
Sbjct: 1477 ANESMDGNPDVTINPLPLPPPLPQKQSSEANPGINNCEPIVEVPATPEQEHPQILESDIE 1536
Query: 846 DDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD--SSALVALTAEAASVPTRKL 903
D + D +EIPT++L EF N N+M+ N + + S ALVALT E AS+P KL
Sbjct: 1537 DTLYE--DPDEIPTIKLNIEEFTHNLQNYMQRNMELQESDMSKALVALTPEVASIPMPKL 1594
Query: 904 KRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK 963
K + LRTEHHV+ + + F SP+ PQ +
Sbjct: 1595 KNVSRLRTEHHVWTKENLMIHVHTF--------SPF---------------GPQVKLQIP 1631
Query: 964 DSKLCNSEICYSC-NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
+ L + + V +R+A +IPCRTA RG FPLNGTYFQVNEVFADH++S
Sbjct: 1632 FNHLKEXVAXRNLEDYVMKRHA----SRAIIPCRTAMRGSFPLNGTYFQVNEVFADHDSS 1687
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
+PI+VPR I NL R V FG+S T+F +GFVCVRGFD++ R PRP
Sbjct: 1688 LNPIDVPRAWIWNLPRRTVYFGTSIPTIF-------------KGFVCVRGFDQKTRAPRP 1734
Query: 1083 LVNRLHCSPSK 1093
L+ RLH S+
Sbjct: 1735 LMARLHFPASR 1745
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 86/181 (47%), Positives = 112/181 (61%), Gaps = 16/181 (8%)
Query: 84 IIRKFKDLTIRDGGSQLPLQCNKADGPSNVH----NAVVPYQVGPSSEHGALVPHQIK-- 137
II + K L I + S++ Q A P N++ NA+V Y+ G +VP +
Sbjct: 725 IIEQLKHLDI-NRESKISYQEQNALVPYNMNKEEKNALVLYK-----RDGTIVPFEDSFG 778
Query: 138 -EKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
K+++ +P V LD T R+W L+M + D T +E+ KWW++ER VF GR SF
Sbjct: 779 LVKKRRPRPRVDLDEETSRVWKLLMGNINSEGIDGT-DEEKAKWWEEERNVFRGRADSFI 837
Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSD 256
ARMHL+ GDRRF W GSVVDSVVGV+LTQNVSD+LSSSA+MSLAA FP K +H PS
Sbjct: 838 ARMHLVQGDRRFSKWXGSVVDSVVGVFLTQNVSDHLSSSAFMSLAAHFPCKC--NHRPST 895
Query: 257 E 257
E
Sbjct: 896 E 896
>gi|186516091|ref|NP_001119112.1| DEMETER-like protein 3 [Arabidopsis thaliana]
gi|332660917|gb|AEE86317.1| DEMETER-like protein 3 [Arabidopsis thaliana]
Length = 957
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 231/458 (50%), Positives = 291/458 (63%), Gaps = 14/458 (3%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 807 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 864 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 976
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 977 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNE 1014
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NE
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNE 954
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
V IK +K + +V LDP T++ W+++M ++D ++ E E W+KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 391
Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>gi|302818184|ref|XP_002990766.1| hypothetical protein SELMODRAFT_161136 [Selaginella moellendorffii]
gi|300141504|gb|EFJ08215.1| hypothetical protein SELMODRAFT_161136 [Selaginella moellendorffii]
Length = 470
Score = 421 bits (1081), Expect = e-114, Method: Compositional matrix adjust.
Identities = 239/499 (47%), Positives = 303/499 (60%), Gaps = 55/499 (11%)
Query: 613 NIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
NI+AGR+K FL+R+ E HGSIDLEW+R+VPP K++LL I GLGLKSVEC+RLL+L H+
Sbjct: 3 NILAGRMKGFLDRVYEEHGSIDLEWIRDVPPQDAKDFLLSIRGLGLKSVECIRLLALDHL 62
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
AFPVD NVGRI VRLGWVP+QPLP +L +HLL+ YPV + +Q Y+WPRLC LD+ TLYEL
Sbjct: 63 AFPVDTNVGRILVRLGWVPIQPLPEELELHLLELYPVQETVQKYIWPRLCTLDRLTLYEL 122
Query: 733 HYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIG 792
HYQMITFGK+FC K PNC ACP+R EC+HFASA ASAR ALP P + + G+ G
Sbjct: 123 HYQMITFGKVFCTKTRPNCNACPMRMECRHFASAYASARLALPAPPTQ---CQQQGDKAG 179
Query: 793 QS---------PPLVVNPIPVIRIEADPVSESGYQINNCEPII------EEPRSPGPQCS 837
PP+ + P + + A P CEPII E
Sbjct: 180 PQQQEEVLTLPPPVSLPPATLTNVVATP--------KKCEPIIEEPGSPEPEPESESCEG 231
Query: 838 ESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAAS 897
S +++D F+ G TL L E + P + V S L+ L E A
Sbjct: 232 SSTCPDMEDLLFTQGS----ETLHLNLIEQTQAPPQY----TVASASSKELMVLPPEFAY 283
Query: 898 VPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
+P KLK LRT H+VY LPD H L++ + R+ DDP YLLA+WS E P
Sbjct: 284 IPVPKLKNIFRLRTVHYVYELPDHHPLVQQLEPREKDDPCFYLLALWSQDEQP------- 336
Query: 958 YRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFA 1017
++S+L N + + V+GT+LIPCRTA RG FPLNGTYFQVNEVFA
Sbjct: 337 -----QNSQLENQQ---------RVDDEFVKGTLLIPCRTAMRGSFPLNGTYFQVNEVFA 382
Query: 1018 DHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRY 1077
D ET + P+NV R + NL R+ V FG+S +F L ++Q FW+G+VCVRGFDR+
Sbjct: 383 DSETGNFPLNVSRSLLWNLHRKFVYFGTSVPAIFRGLTADEVQACFWKGYVCVRGFDRKS 442
Query: 1078 RCPRPLVNRLHCSPSKKEA 1096
R PRPL RLH +PS ++
Sbjct: 443 RSPRPLAARLHIAPSNRKG 461
>gi|302809902|ref|XP_002986643.1| hypothetical protein SELMODRAFT_450920 [Selaginella moellendorffii]
gi|302809904|ref|XP_002986644.1| hypothetical protein SELMODRAFT_450920 [Selaginella moellendorffii]
gi|300145531|gb|EFJ12206.1| hypothetical protein SELMODRAFT_450920 [Selaginella moellendorffii]
gi|300145532|gb|EFJ12207.1| hypothetical protein SELMODRAFT_450920 [Selaginella moellendorffii]
Length = 469
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 239/498 (47%), Positives = 303/498 (60%), Gaps = 54/498 (10%)
Query: 613 NIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
NI+AGR+K FL+R+ E HGSIDLEW+R+VPP K++LL I GLGLKSVEC+RLL+L H+
Sbjct: 3 NILAGRMKGFLDRVYEEHGSIDLEWIRDVPPQDAKDFLLSIRGLGLKSVECIRLLALDHL 62
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
AFPVD NVGRI VRLGWVP+QPLP +L +HLL+ YPV + +Q Y+WPRLC LD+ TLYEL
Sbjct: 63 AFPVDTNVGRILVRLGWVPIQPLPEELELHLLELYPVQETVQKYVWPRLCTLDRLTLYEL 122
Query: 733 HYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIG 792
HYQMITFGK+FC K PNC ACP+R EC+HFASA ASAR ALP P + + G+ G
Sbjct: 123 HYQMITFGKVFCTKTRPNCNACPMRMECRHFASAYASARLALPAPPTQ---CEQQGDKAG 179
Query: 793 QS--------PPLVVNPIPVIRIEADPVSESGYQINNCEPII------EEPRSPGPQCSE 838
PP+ + P + + A P CEPII E
Sbjct: 180 AQQQEVLTLPPPVSLPPATLTNVVATP--------KKCEPIIEEPRSPEPEPESESCEGS 231
Query: 839 SPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASV 898
S +++D F+ G TL L E + P + V S L+ L E A +
Sbjct: 232 STCPDMEDLLFTQGS----ETLHLNLIEQPQAPPQY----TVASASSKELMVLPPEFAYI 283
Query: 899 PTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQY 958
P KLK LRT H+VY LPD H L++ + R+ DDP YLLA+WS E P
Sbjct: 284 PVPKLKNIFRLRTVHYVYELPDHHPLVQQLEPREKDDPCFYLLALWSQDEQP-------- 335
Query: 959 RCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFAD 1018
++S+L N + + V+GT+LIPCRTA RG FPLNGTYFQVNEVFAD
Sbjct: 336 ----QNSQLENQQ---------RVDDEFVKGTLLIPCRTAMRGSFPLNGTYFQVNEVFAD 382
Query: 1019 HETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYR 1078
ET + P+NV R + NL R+ V FG+S +F L ++Q FW+G+VCVRGFDR+ R
Sbjct: 383 SETGNFPLNVSRSLLWNLHRKFVYFGTSVPAIFRGLTADEVQACFWKGYVCVRGFDRKSR 442
Query: 1079 CPRPLVNRLHCSPSKKEA 1096
PRPL RLH +PS ++
Sbjct: 443 SPRPLAARLHIAPSNRKG 460
>gi|138996994|dbj|BAF52858.1| repressor of silencing 3 [Nicotiana tabacum]
Length = 1662
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 219/455 (48%), Positives = 277/455 (60%), Gaps = 80/455 (17%)
Query: 567 DWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DWD LR + ER+ + MDS+DWEAVRCA+ +EIA IKERG N +A RIK FL
Sbjct: 1260 DWDSLRLEAQANGKKRERTANTMDSLDWEAVRCANVNEIAHTIKERGMNNKLAERIKNFL 1319
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
NR+V HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 1320 NRIVSEHGSIDLEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1379
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQ+T
Sbjct: 1380 AVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWPRLCKLDQRTF-------------- 1425
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSP-------P 796
AR ALP EK IV++ +P P
Sbjct: 1426 --------------------------ARLALPAQEEKSIVSATENKASNNNPREKFTHLP 1459
Query: 797 L--------------VVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPES 842
L ++N P+I + A P EPI+E P +P + ++PE
Sbjct: 1460 LPLPPGNQQPVEHQKLINSAPIIEVLATP-----------EPIVELPATPEQEQMQAPEI 1508
Query: 843 EIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVM--LQDSSALVALTAEAASVPT 900
+I+D + D EIPT+ + EF +N ++E N + ++ S+ALVALT+EAAS+P
Sbjct: 1509 DIEDTYYE--DPCEIPTIEVNMAEFTQNLKKYVENNMELHQVEMSNALVALTSEAASIPV 1566
Query: 901 RKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC 960
KLK + LRTEH VY LPD+H LL G D+R+PDDP YLLAIW+PGE+ NS P+ +C
Sbjct: 1567 PKLKNVSRLRTEHQVYELPDSHPLLEGLDKREPDDPCSYLLAIWTPGETANSFQPPETQC 1626
Query: 961 NSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIP 994
NS++S KLC +E C SCNS+ E ++ VRGT+LIP
Sbjct: 1627 NSQESGKLCENETCSSCNSIREAHSQTVRGTLLIP 1661
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/135 (54%), Positives = 93/135 (68%), Gaps = 7/135 (5%)
Query: 115 NAVVPYQVGPSSEHGALVPHQIKEKRK-KLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTS 173
+A+V YQ G++VP RK K +P+V +D T R+W L++ + D T
Sbjct: 821 SALVLYQ-----RDGSIVPFGSSLVRKRKPRPKVDVDDETDRVWKLLLQDINSEGVDGT- 874
Query: 174 NEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLS 233
+ED KWW++ER VF R SF ARM L+ GDRRF PWKGSVVDSVVGV+LTQNVSD+LS
Sbjct: 875 DEDKAKWWEEERRVFNSRADSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVSDHLS 934
Query: 234 SSAYMSLAARFPLKS 248
SSA+MSLAA FPLK+
Sbjct: 935 SSAFMSLAAHFPLKT 949
>gi|115446223|ref|NP_001046891.1| Os02g0496500 [Oryza sativa Japonica Group]
gi|48716417|dbj|BAD23025.1| transcriptional activator-like [Oryza sativa Japonica Group]
gi|48716542|dbj|BAD23146.1| transcriptional activator-like [Oryza sativa Japonica Group]
gi|113536422|dbj|BAF08805.1| Os02g0496500 [Oryza sativa Japonica Group]
Length = 1207
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 235/552 (42%), Positives = 304/552 (55%), Gaps = 71/552 (12%)
Query: 568 WDLLRR-IYSTGEE-RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR 625
WD LR YS G + + DS DWEAV A EIA++I RGQ +IA RI+ FL R
Sbjct: 677 WDKLREEAYSKGYKIKGTGISDSADWEAVLHAPAVEIANSIAVRGQHYVIALRIQAFLKR 736
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
+ + HG+ DL+WLR VP + K YL+ I GLG KSV+C+RLLSL+H FPVDVNV RI
Sbjct: 737 VKKDHGNFDLDWLRYVPRESAKNYLISILGLGDKSVDCIRLLSLKHKGFPVDVNVARIVT 796
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RLGWV LQPLP HL+ YP+M +Q YLWPRLC + ++ LYELH MITFGK C
Sbjct: 797 RLGWVKLQPLPFSAEFHLVGLYPIMRDVQKYLWPRLCTISKEKLYELHCLMITFGKAICT 856
Query: 746 KRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNP---- 801
K SPNC ACP +CK++ S++ AR +LP P+E G+ G+ P
Sbjct: 857 KVSPNCRACPFSAKCKYYNSSL--ARLSLP-PAEG------HGHEYGEEQASTATPGRLL 907
Query: 802 --------------IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESP-ESEIDD 846
P I+I ES Y+ CEPIIE P SP + ESP E E+ +
Sbjct: 908 LSNDSHIAGFQQVCQPQIKINMPAGRESIYK---CEPIIEIPPSPEHEYEESPYEQELYE 964
Query: 847 DDFSIGDIEE-IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKR 905
DD DIE+ IP L+ + + + L + ++ ASV ++LK
Sbjct: 965 DDLC--DIEDTIPELQYDFEIDLCSLKHTVNNGSWTPNSGKDLALINSQHASVQNKRLKN 1022
Query: 906 CAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS 965
LRTEH+ YVLPD H +L F+ R P+DP PYLL + S
Sbjct: 1023 IGRLRTEHNAYVLPDDHAILEEFEDRVPEDPCPYLLVVIS-------------------- 1062
Query: 966 KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHP 1025
C+ E V+GTILIPCRTA RG FPLNGTYFQ +EVFADH +S P
Sbjct: 1063 --CSDE-------------HTVKGTILIPCRTATRGNFPLNGTYFQDHEVFADHSSSRSP 1107
Query: 1026 INVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVN 1085
I +PR I NL R IV FGSS ++ + I++ + +G++CVRGFDR R P+P+
Sbjct: 1108 ITIPRECIWNLDRCIVYFGSSIQSIMKGQTRQDIEDCYKKGYICVRGFDRNTRYPKPICA 1167
Query: 1086 RLHCSPSKKEAG 1097
+LH + + G
Sbjct: 1168 KLHATNERNGTG 1179
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 88/149 (59%), Gaps = 18/149 (12%)
Query: 112 NVHNAVVPYQVGPS-SEHGALVPHQ------IKEKR---KKLKPEVV-LDPGTLRMWNLI 160
N +A+VPYQ P+ + ALVP Q I R K L+ +V+ LD TL+++N++
Sbjct: 408 NGESALVPYQKAPAATSSSALVPIQNSTQLAIVHHRNHLKNLRTKVLGLDEKTLQVYNVL 467
Query: 161 MNIDDGTTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
D+ D S E ++ W + R FE + F A +H +LG RRF W GSV
Sbjct: 468 RKWDE---TDSESFEGVDIGSGPEWNETRRHFEHYVDVFIATVHGLLGPRRFSEWGGSVT 524
Query: 217 DSVVGVYLTQNVSDNLSSSAYMSLAARFP 245
DSVVG +LTQNV+DNLSS+A+++L A+FP
Sbjct: 525 DSVVGTFLTQNVADNLSSNAFLNLVAKFP 553
>gi|218190787|gb|EEC73214.1| hypothetical protein OsI_07297 [Oryza sativa Indica Group]
Length = 1165
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 222/488 (45%), Positives = 285/488 (58%), Gaps = 87/488 (17%)
Query: 535 GKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR--IYSTGEERSHDKM-DSVDW 591
GK K E + +N R + DWD LR+ + + G + DK D++DW
Sbjct: 727 GKSKHTEKRPKARNVRGRT--------KMKHYDWDNLRKEVLRNHGNRQRSDKAKDTIDW 778
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA +FLNRLV HGSIDLEWLR++ PDK K++LL
Sbjct: 779 EA---------------------------DFLNRLVRDHGSIDLEWLRDIEPDKAKDFLL 811
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I GLGLKS ECVRLL+L +AFPVD NV RI VRLGWVPLQPLP L +HLL+ YP+++
Sbjct: 812 SIRGLGLKSTECVRLLTLHQMAFPVDTNVARICVRLGWVPLQPLPESLQLHLLELYPMLE 871
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
IQ Y+WPRLC LDQ LYELHYQMITFGK+FC K PNC +CP+R ECKHFASA ASAR
Sbjct: 872 HIQKYIWPRLCKLDQLILYELHYQMITFGKVFCSKSKPNCNSCPMRAECKHFASAFASAR 931
Query: 772 FALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRS 831
ALPGPS+K E+ N S + + P+ + + N+
Sbjct: 932 LALPGPSKK-TSKPEYPNDAESSHKKYTH--------SRPMGQLSWNTNH---------- 972
Query: 832 PGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSAL 888
PG E+ E+EI +DF D +EIP + L +EF +N +++ N + ++D S+AL
Sbjct: 973 PG-HVYEAREAEI--EDFFSEDPDEIPIINLNVKEFAQNLKSYIHANNIEIEDADMSNAL 1029
Query: 889 VALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSP-- 946
VA++ +AASVPT KLK LRTEH VY LPD+H LL GFD+R+PDDPSPYLL+IW+P
Sbjct: 1030 VAISPQAASVPTSKLKNVNRLRTEHQVYELPDSHPLLEGFDQREPDDPSPYLLSIWTPGK 1089
Query: 947 ---------------------GESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNA 984
GE+ S +P+ CNSK++ K C S C+SCNS E +
Sbjct: 1090 LMCSHPTFTLIQVILMIKISTGETAQSTDAPKTFCNSKETGKFCESSTCFSCNSTREMQS 1149
Query: 985 GIVRGTIL 992
VRGT+L
Sbjct: 1150 QKVRGTLL 1157
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/162 (50%), Positives = 104/162 (64%), Gaps = 2/162 (1%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVF 188
GALVP + K K++ EV +DP T MWNL+M +D + +ED E+ ++ER +F
Sbjct: 298 GALVPLEGKVKKRGSWAEVKIDPVTNLMWNLLMALDKCEGVEGI-DEDKERLLEEERRMF 356
Query: 189 EGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
GRI SF ARMHL+ GDRRF PWKGSVVDSVV V+LTQNVSD+LSSSA+M+LAARFP+KS
Sbjct: 357 RGRIDSFIARMHLVQGDRRFSPWKGSVVDSVVDVFLTQNVSDHLSSSAFMALAARFPVKS 416
Query: 249 TNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTE 290
P+ T + S S G +GN +FV E
Sbjct: 417 EGPEKPAAVEKSTPTPPKQKDSCSGVLGESAKLQGN-FFVEE 457
>gi|297802586|ref|XP_002869177.1| hypothetical protein ARALYDRAFT_353424 [Arabidopsis lyrata subsp.
lyrata]
gi|297315013|gb|EFH45436.1| hypothetical protein ARALYDRAFT_353424 [Arabidopsis lyrata subsp.
lyrata]
Length = 1072
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 234/557 (42%), Positives = 310/557 (55%), Gaps = 69/557 (12%)
Query: 561 DNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIK 620
D TV DW LR++Y+ R MDSV+W VR + + I+ RGQ I++ RI
Sbjct: 553 DETV--DWKTLRKMYTKEGSRPKMHMDSVNWSDVRLSGQKVFETTIRRRGQFRILSERIL 610
Query: 621 EFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNV 680
+FLN V+ +G++DLEWLRN P D VK YLL+IEG+GLKS ECVRLL L+H AFPVD NV
Sbjct: 611 KFLNDEVQHNGTLDLEWLRNAPSDLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNV 670
Query: 681 GRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFG 740
GRIAVRLGWVPL+PLP + +H Q++ +LC+L + ++HY TF
Sbjct: 671 GRIAVRLGWVPLEPLPNGVQMH-----------QLF---QLCFLIN-LIKKIHY---TFT 712
Query: 741 ------------KIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFG 788
++FC K PNC ACP++ ECK+FASA S++ L GP EK F
Sbjct: 713 NNYLLSIKTNSFQVFCTKVIPNCNACPMKSECKYFASAYVSSKVLLEGPEEKMHEPDTFM 772
Query: 789 NGIGQSPPL-VVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDD 847
N Q + + + I +I P Q C+P++E P SP + ES ++I+D
Sbjct: 773 NAYSQDADVDMTSNINLIEECVSPGCSD--QAICCKPLVEFPSSPRAEIPES--TDIEDI 828
Query: 848 DF--SIGDIEEIPTLRLQEREFKENFPNFMEMNKVML-----QDSSALVALTAEAASVPT 900
F +P + K+N + + ML + S ALV LT E A +P
Sbjct: 829 PFINPYQLYARVPKIDFDMDALKKNVEDALVKGGRMLSSSDEEISKALVVLTPENACIPI 888
Query: 901 ---RKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQ 957
RK+K LRTEH VYVLPD HELL F+RR+ DDPSPYLLAIW PGE+ +S P+
Sbjct: 889 KPPRKMKYYDRLRTEHVVYVLPDNHELLHDFERRELDDPSPYLLAIWQPGETSSSFIPPK 948
Query: 958 YRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFA 1017
+CNS SKLC + C C ++ E+++ RGTIL VFA
Sbjct: 949 KKCNSDGSKLCKIKSCSYCWTIREQSSNTFRGTIL----------------------VFA 986
Query: 1018 DHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRY 1077
DHETS PI PR L + + GSS T++F LD R+IQ FW GF+C+R FDR+
Sbjct: 987 DHETSLKPIVFPRELCDGLEKRALYCGSSVTSIFRFLDTRRIQLCFWTGFLCMRAFDRKQ 1046
Query: 1078 RCPRPLVNRLHCSPSKK 1094
R P+ LV RLH P ++
Sbjct: 1047 RYPKELVRRLHTPPDER 1063
Score = 100 bits (248), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 110/183 (60%), Gaps = 17/183 (9%)
Query: 69 QVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEH 128
Q+ ++ S+ AD+ +D ++ S LP Q K ++ + E
Sbjct: 311 QISYQKYHSLSSLADVPLHIEDTLMKSASSVLPEQPIKKT--KDIAKLI--------KEM 360
Query: 129 GAL-----VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQK 183
G L V IK +K + +V LDP T++ W L+M ++D ++ ++++ E W+K
Sbjct: 361 GRLKINKRVTTMIKSAKKLVTAKVSLDPETIKEWELLM-VNDLPSR-SYADKETEAKWKK 418
Query: 184 EREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAAR 243
ERE+F+ RI F RMHL+ G+R+FK WKGSVVDSVVGV+LTQNV+D LSS+A+MS+AA+
Sbjct: 419 EREIFQSRIDLFINRMHLLQGNRKFKQWKGSVVDSVVGVFLTQNVTDYLSSNAFMSVAAK 478
Query: 244 FPL 246
FP+
Sbjct: 479 FPV 481
>gi|357510919|ref|XP_003625748.1| DNA glycosylase [Medicago truncatula]
gi|355500763|gb|AES81966.1| DNA glycosylase [Medicago truncatula]
Length = 497
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 215/448 (47%), Positives = 270/448 (60%), Gaps = 81/448 (18%)
Query: 730 YELHYQMITFGKI--------------------------FCKKRSPNCGACPLRGECKHF 763
YELHYQ+ITFGK+ FC K PNC ACP+RGEC+HF
Sbjct: 16 YELHYQLITFGKVNYISHASETDHILVKSSLNLNQCMQVFCTKSKPNCNACPMRGECRHF 75
Query: 764 ASAVASARFALPGPSEK---GIVTSEFGNGIGQSPPLVVNPIPV---------------- 804
ASA ASAR ALPGP +K +V + N Q+P VN +P+
Sbjct: 76 ASAFASARLALPGPEQKEQKNLVITTGNNATDQNPSTTVNQLPLSLPGLEQKNIEITTGT 135
Query: 805 --------IRIEADPVS--------------------ESGYQINNCEPIIEEPRSPGPQC 836
+ I P+S E +INNC+PIIEEP +P P+C
Sbjct: 136 DATDQNSSVVINQLPLSLSENKNQTEELQQTVAIRQLEMNSEINNCQPIIEEPTTPEPEC 195
Query: 837 SESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTA 893
S E++I+D F + EIPT+ L EF N N+M+ N + LQ+ S ALVAL
Sbjct: 196 SHVSENDIED--FFYEESNEIPTINLDIEEFTLNLQNYMQEN-MELQEGEMSKALVALNQ 252
Query: 894 EAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSV 953
EAA +PT KLK + LRTEH VY LPD+H LL G+++R+PDDP YLLAIW+PGE+ NS+
Sbjct: 253 EAAYIPTTKLKNVSRLRTEHSVYELPDSHPLLEGWEKREPDDPGKYLLAIWTPGETANSI 312
Query: 954 GSPQYRCNSKD-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQV 1012
P RC+++D +LCN E C+SCNS E N+ IVRGTILIPCRTA RG FPLNGTYFQV
Sbjct: 313 QPPDRRCSAQDCGQLCNEEECFSCNSFREANSQIVRGTILIPCRTAMRGSFPLNGTYFQV 372
Query: 1013 NEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRG 1072
NEVFADHE+S +PI+VPR I NL R V FG+S T++F L +IQ+ FWRGFVCVRG
Sbjct: 373 NEVFADHESSLNPISVPRSLIWNLDRRTVHFGTSVTSIFKGLATPEIQQCFWRGFVCVRG 432
Query: 1073 FDRRYRCPRPLVNRLHCSPSKKEAGSKR 1100
F+R R PRPL+ RLH P+ + A +K+
Sbjct: 433 FERSTRAPRPLMARLH-FPASRLAKNKK 459
>gi|115446213|ref|NP_001046886.1| Os02g0494700 [Oryza sativa Japonica Group]
gi|113536417|dbj|BAF08800.1| Os02g0494700 [Oryza sativa Japonica Group]
Length = 1648
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 225/490 (45%), Positives = 292/490 (59%), Gaps = 79/490 (16%)
Query: 535 GKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR--IYSTGEERSHDKM-DSVDW 591
GK K E + +N R + DWD LR+ +++ G + DK D++DW
Sbjct: 1198 GKSKHTEKRPKARNVRGRT--------KMKHYDWDNLRKEVLHNHGNRQRSDKAKDTIDW 1249
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA +FLNRLV HGSIDLEWLR++ PDK K +LL
Sbjct: 1250 EA---------------------------DFLNRLVRDHGSIDLEWLRDIEPDKAKGFLL 1282
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I GLGLKS ECVRLL+L +AFPVD NV RI VRLGWVPLQPLP L +HLL+ YP+++
Sbjct: 1283 SIRGLGLKSTECVRLLTLHQMAFPVDTNVARICVRLGWVPLQPLPESLQLHLLELYPLLE 1342
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
IQ Y+WPRLC LDQ LYELHYQMITFGK+FC K PNC +CP+R ECKHFASA ASAR
Sbjct: 1343 HIQKYIWPRLCKLDQLILYELHYQMITFGKVFCSKSKPNCNSCPMRAECKHFASAFASAR 1402
Query: 772 FALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRS 831
ALPGPS+K E+ N S + P+ ++ + + G+ + EP +
Sbjct: 1403 LALPGPSKK-TSKPEYPNDAESSHKKYTHSRPMGQLSWN-TNHPGH--------VYEPST 1452
Query: 832 PGPQ--CSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SS 886
P P+ +E+ E+EI +DF D +EIP + L EF +N +++ N + ++D S+
Sbjct: 1453 PEPEPDIAEAREAEI--EDFFSEDPDEIPIINLNVEEFAQNLKSYIHANNIEIEDADMSN 1510
Query: 887 ALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSP 946
ALVA++ +AASVPT KLK LRTEH VY LPD+H LL GFD+R+PDDPSPYLL+IW+P
Sbjct: 1511 ALVAISPQAASVPTSKLKNVNRLRTEHQVYELPDSHPLLEGFDQREPDDPSPYLLSIWTP 1570
Query: 947 -----------------------GESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCER 982
GE+ S +P+ CNSK++ KLC S C+SCNS E
Sbjct: 1571 GKLMCSHPTFTLIQVILMIKISTGETAQSTDAPKTFCNSKETGKLCESSTCFSCNSTREM 1630
Query: 983 NAGIVRGTIL 992
+ VRGT+L
Sbjct: 1631 QSQKVRGTLL 1640
Score = 142 bits (359), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 71/127 (55%), Positives = 92/127 (72%), Gaps = 1/127 (0%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVF 188
GALVP + K K++ + EV +DP T MWNL+M +D + +ED E+ ++ER +F
Sbjct: 769 GALVPLEGKVKKRGSRAEVKIDPVTNLMWNLLMALDKCEGVEGI-DEDKERLLEEERRMF 827
Query: 189 EGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
GRI SF A MHL+ GDRRF PWKGS+VDSVV V+LTQNVSD+LSSSA+M+LAARFP+KS
Sbjct: 828 RGRIDSFIAHMHLVQGDRRFSPWKGSIVDSVVDVFLTQNVSDHLSSSAFMALAARFPVKS 887
Query: 249 TNDHTPS 255
P+
Sbjct: 888 EGPEKPA 894
>gi|222622896|gb|EEE57028.1| hypothetical protein OsJ_06806 [Oryza sativa Japonica Group]
Length = 1615
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 223/488 (45%), Positives = 285/488 (58%), Gaps = 87/488 (17%)
Query: 535 GKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR--IYSTGEERSHDKM-DSVDW 591
GK K E + +N R + DWD LR+ +++ G + DK D++DW
Sbjct: 1177 GKSKHTEKRPKARNVRGRT--------KMKHYDWDNLRKEVLHNHGNRQRSDKAKDTIDW 1228
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA +FLNRLV HGSIDLEWLR++ PDK K +LL
Sbjct: 1229 EA---------------------------DFLNRLVRDHGSIDLEWLRDIEPDKAKGFLL 1261
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I GLGLKS ECVRLL+L +AFPVD NV RI VRLGWVPLQPLP L +HLL+ YP+++
Sbjct: 1262 SIRGLGLKSTECVRLLTLHQMAFPVDTNVARICVRLGWVPLQPLPESLQLHLLELYPLLE 1321
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
IQ Y+WPRLC LDQ LYELHYQMITFGK+FC K PNC +CP+R ECKHFASA ASAR
Sbjct: 1322 HIQKYIWPRLCKLDQLILYELHYQMITFGKVFCSKSKPNCNSCPMRAECKHFASAFASAR 1381
Query: 772 FALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRS 831
ALPGPS+K E+ N S + + P+ + + N+
Sbjct: 1382 LALPGPSKK-TSKPEYPNDAESSHKKYTH--------SRPMGQLSWNTNH---------- 1422
Query: 832 PGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSAL 888
PG E+ E+EI +DF D +EIP + L EF +N +++ N + ++D S+AL
Sbjct: 1423 PG-HVYEAREAEI--EDFFSEDPDEIPIINLNVEEFAQNLKSYIHANNIEIEDADMSNAL 1479
Query: 889 VALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSP-- 946
VA++ +AASVPT KLK LRTEH VY LPD+H LL GFD+R+PDDPSPYLL+IW+P
Sbjct: 1480 VAISPQAASVPTSKLKNVNRLRTEHQVYELPDSHPLLEGFDQREPDDPSPYLLSIWTPGK 1539
Query: 947 ---------------------GESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNA 984
GE+ S +P+ CNSK++ KLC S C+SCNS E +
Sbjct: 1540 LMCSHPTFTLIQVILMIKISTGETAQSTDAPKTFCNSKETGKLCESSTCFSCNSTREMQS 1599
Query: 985 GIVRGTIL 992
VRGT+L
Sbjct: 1600 QKVRGTLL 1607
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 104/162 (64%), Gaps = 2/162 (1%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVF 188
GALVP + K K++ + EV +DP T MWNL+M +D + +ED E+ ++ER +F
Sbjct: 748 GALVPLEGKVKKRGSRAEVKIDPVTNLMWNLLMALDKCEGVEGI-DEDKERLLEEERRMF 806
Query: 189 EGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
GRI SF A MHL+ GDRRF PWKGS+VDSVV V+LTQNVSD+LSSSA+M+LAARFP+KS
Sbjct: 807 RGRIDSFIAHMHLVQGDRRFSPWKGSIVDSVVDVFLTQNVSDHLSSSAFMALAARFPVKS 866
Query: 249 TNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTE 290
P+ T + S S G +GN +FV E
Sbjct: 867 EGPEKPAAVEKSTPTPPKQKDSCSGVLGESAKLQGN-FFVEE 907
>gi|242077266|ref|XP_002448569.1| hypothetical protein SORBIDRAFT_06g029335 [Sorghum bicolor]
gi|241939752|gb|EES12897.1| hypothetical protein SORBIDRAFT_06g029335 [Sorghum bicolor]
Length = 901
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 228/551 (41%), Positives = 298/551 (54%), Gaps = 65/551 (11%)
Query: 567 DWDLLRR-IYSTG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
DW LR Y G ++ + D VDWE+V A SE+A I RGQ NI++ RI+E L
Sbjct: 385 DWKELREEAYKKGYNDKSGTETSDVVDWESVLNAPFSEVAKCIAVRGQHNILSVRIRELL 444
Query: 624 NRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
+ + + GS DL+WLR + +K K+ LL I G G+KSV+C+ LLSL+H AFPVDVNV R
Sbjct: 445 DHVQKAQDGSFDLDWLRFISHEKAKKILLSIHGFGVKSVDCICLLSLRHRAFPVDVNVAR 504
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
I RLGWV LQPL G HL+ YP++D +Q YLWPRLC +D++ LYELH MITFGK+
Sbjct: 505 IVTRLGWVKLQPLNG-ADFHLINLYPLLDDVQRYLWPRLCTIDKEKLYELHCLMITFGKV 563
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALP-------GPSEKGIVTSEFGNGIGQSP 795
C K++PNC ACP G C ++ S +A LP G + ++ S G+
Sbjct: 564 VCTKQNPNCRACPFSGSCIYYNSLLARKPLPLPEKHEYEQGEQQAKMIAS--GSCTPSFQ 621
Query: 796 PLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESP-ESEIDDDDFSIGDI 854
+ IP+ P I+NCEPIIE P SP + + +P E E D+ + DI
Sbjct: 622 QMYQYQIPISSTTETP------PIHNCEPIIEMPPSPEYEHNRTPNEQENSYGDYYVCDI 675
Query: 855 EEIPTLRLQ-EREFK----ENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHL 909
E+ +Q + E + PN N L + + + V +KLK L
Sbjct: 676 EDFAPEGVQYDAEINICSSKRVPN---SNSWTPSRGKDLAVINPKGSFVRNKKLKNIGRL 732
Query: 910 RTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCN 969
RTEH+ YVLPD H +L F+ R P D PYLL + S C
Sbjct: 733 RTEHNAYVLPDDHRILEEFEDRVPGDTCPYLLVVLS----------------------CP 770
Query: 970 SEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVP 1029
+ V GT+LIPCRTANRG+FPLNGTYFQ NEVFAD+ +S +PI +P
Sbjct: 771 DDF-------------TVEGTVLIPCRTANRGKFPLNGTYFQENEVFADYSSSRNPITIP 817
Query: 1030 RHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHC 1089
R I L+R IV FGSS ++ IQE F +G+VCVR F RR R P L LH
Sbjct: 818 RECIGLLKRSIVYFGSSIHSITKGQTREDIQECFKKGYVCVRAFHRRSRVPLRLCATLHA 877
Query: 1090 SPS-KKEAGSK 1099
+ + KK AG K
Sbjct: 878 TNTIKKPAGDK 888
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 90/178 (50%), Gaps = 34/178 (19%)
Query: 116 AVVPYQVGPSSEHGALVP--HQIK------EKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
A++PYQ ++ AL P H + + KL+ +V+ L TLR++ ++
Sbjct: 125 ALLPYQRTVAASF-ALAPLWHSTELDIPRPQNHGKLQAKVLDLTDETLRVFAVL------ 177
Query: 167 TTKDQTSNEDMEKW-------WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSV 219
T D++ +E E + W ER FE + F M ++G R++ W GS++DSV
Sbjct: 178 TEWDRSDSESFEGFDIGNGPEWDTERRTFENYVDIFIDEMFALIGPRKYSRWGGSLIDSV 237
Query: 220 VGVYLTQNVSDNLS-----------SSAYMSLAARFPLKSTNDHTPSDENLRTTASLE 266
VG +LTQN +DNLS S A+M+LAA+FP K D+N+ + +L+
Sbjct: 238 VGTFLTQNSADNLSSYIFHSLSMENSQAFMNLAAKFPPKKRKKLDGDDKNMNSNEALD 295
>gi|413936930|gb|AFW71481.1| hypothetical protein ZEAMMB73_696780 [Zea mays]
Length = 650
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 221/531 (41%), Positives = 296/531 (55%), Gaps = 64/531 (12%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDK 645
D VDWE+V A SE+A I RGQ NI+A RI FL+ + + GS DL+WLR V +K
Sbjct: 148 DVVDWESVLNAPFSEVAKCIAIRGQHNILAVRILAFLDHVKKAQDGSFDLDWLRFVSREK 207
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
K++LL I G+G+KS +C+ LLSL+H AFPVDVNV RI RLGWV LQPL G HL+
Sbjct: 208 AKKFLLSIHGIGVKSADCICLLSLRHRAFPVDVNVARIVTRLGWVKLQPLNG-ADFHLID 266
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
YP++D +Q YLWPRLC +D++ LYELH MITFGK+ C+K++PNCGACP CK++ S
Sbjct: 267 LYPILDDVQRYLWPRLCTIDKEKLYELHCLMITFGKVVCRKKNPNCGACPFSASCKYYNS 326
Query: 766 AVASARFALPGPSEKG---IVTSEFGNGIGQSPPLVVNPIPV--IRIEADPVSESGYQIN 820
++A LP E+G +G+ S + I + E P I+
Sbjct: 327 SLAKKSLPLPEKHEQGEQQTSMVAYGSCTPSSQQMYRYQIAISSTTTETPP-------IH 379
Query: 821 NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKV 880
+CEPI+E P SP + +E+P + D + DIE+ + E + NK
Sbjct: 380 SCEPIVEMPPSPEYEYNETPNEQEDFYEDYACDIEDFAPGVQYDSEIN------ICSNKH 433
Query: 881 MLQDSS-------ALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDP 933
L ++S L + + + ++KLK LRTEH+ YVLPD H +L F+ R P
Sbjct: 434 TLNNNSWTPNCGKDLAVINPKCSFGQSKKLKNTGRLRTEHNAYVLPDGHVILEEFEDRVP 493
Query: 934 DDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILI 993
D PYLL + S P+ D K V+GT+LI
Sbjct: 494 GDRCPYLLVVIS---CPD------------DYK--------------------VKGTVLI 518
Query: 994 PCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNA 1053
PCRTANRG+FPLNGTYFQ NEVFAD+ +S +PI VPR I L R IV FGSS ++
Sbjct: 519 PCRTANRGKFPLNGTYFQENEVFADYSSSRNPITVPRECIGMLERSIVYFGSSIHSITKG 578
Query: 1054 LDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK--KEAGSKRSR 1102
+ IQE G++CVR F R+ R P L + LH + +K +E KR+R
Sbjct: 579 QTRQDIQECLKEGYICVRSFHRKTRIPLRLCSTLHATNTKPAREKPMKRTR 629
>gi|293331251|ref|NP_001168988.1| uncharacterized protein LOC100382817 [Zea mays]
gi|223974285|gb|ACN31330.1| unknown [Zea mays]
Length = 650
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/531 (41%), Positives = 295/531 (55%), Gaps = 64/531 (12%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDK 645
D VDWE+V A SE+A I RGQ NI+A RI FL+ + + GS DL+WLR V +K
Sbjct: 148 DVVDWESVLNAPFSEVAKCIAIRGQHNILAVRILAFLDHVKKAQDGSFDLDWLRFVSREK 207
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
K++LL I G+G+KS +C+ LLSL+H AFPVDVNV RI RLGWV LQPL G HL+
Sbjct: 208 AKKFLLSIHGIGVKSADCICLLSLRHRAFPVDVNVARIVTRLGWVKLQPLNG-ADFHLID 266
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
YP++D +Q YLWPRLC +D++ LYE H MITFGK+ C+K++PNCGACP CK++ S
Sbjct: 267 LYPILDDVQRYLWPRLCTIDKEKLYEPHCLMITFGKVVCRKKNPNCGACPFSASCKYYNS 326
Query: 766 AVASARFALPGPSEKG---IVTSEFGNGIGQSPPLVVNPIPV--IRIEADPVSESGYQIN 820
++A LP E+G +G+ S + I + E P I+
Sbjct: 327 SLAKKSLPLPEKHEQGEQQTSMVAYGSCTPSSQQMYRYQIAISSTTTETPP-------IH 379
Query: 821 NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKV 880
+CEPI+E P SP + +E+P + D + DIE+ + E + NK
Sbjct: 380 SCEPIVEMPPSPEYEYNETPNEQEDFYEDYACDIEDFAPGVQYDSEIN------ICSNKH 433
Query: 881 MLQDSS-------ALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDP 933
L ++S L + + + ++KLK LRTEH+ YVLPD H +L F+ R P
Sbjct: 434 TLNNNSWTPNCGKDLAVINPKCSFGQSKKLKNTGRLRTEHNAYVLPDGHVILEEFEDRVP 493
Query: 934 DDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILI 993
D PYLL + S P+ D K V+GT+LI
Sbjct: 494 GDRCPYLLVVIS---CPD------------DYK--------------------VKGTVLI 518
Query: 994 PCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNA 1053
PCRTANRG+FPLNGTYFQ NEVFAD+ +S +PI VPR I L R IV FGSS ++
Sbjct: 519 PCRTANRGKFPLNGTYFQENEVFADYSSSRNPITVPRECIGMLERSIVYFGSSIHSITKG 578
Query: 1054 LDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK--KEAGSKRSR 1102
+ IQE G++CVR F R+ R P L + LH + +K +E KR+R
Sbjct: 579 QTRQDIQECLKEGYICVRSFHRKTRIPLRLCSTLHATNTKPAREKPMKRTR 629
>gi|383100759|emb|CCG47990.1| conserved hypothetical protein, expressed [Triticum aestivum]
Length = 950
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 227/538 (42%), Positives = 302/538 (56%), Gaps = 43/538 (7%)
Query: 568 WDLLRR-IYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ Y G + D V+WEAV+ A ++A I RGQ ++A RI+ FL
Sbjct: 431 WDKLRKEAYGKGYDNGSKTRITDKVNWEAVQKASFVDVAKCIAGRGQHYLLALRIQAFLT 490
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WLR +P + K+YLL I GLG KSV+C+RLLSL+H AFPVDVNV RI
Sbjct: 491 RIKKDHGSFDLDWLRCLPRESAKKYLLSINGLGAKSVDCIRLLSLEHKAFPVDVNVARIV 550
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL WV LQ + H L YP+M +Q YLWPRLC +D++ LYELH MITFGK+ C
Sbjct: 551 TRLQWVELQCCSEEFHSVDL--YPLMQDVQSYLWPRLCTIDKEKLYELHCLMITFGKVIC 608
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP RG+C+++ S + R LP P+E+ + E I S L+ N +
Sbjct: 609 TKVDPNCNACPFRGDCRYYKSKL--TRPLLP-PAEEHVRGGEEKTSIVTSERLLSNGSCM 665
Query: 805 IRIEADPVSESGYQIN-NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEI-PTLRL 862
+ + +G Q + +CEPIIE P SP + E E ++D + DIE+I P +
Sbjct: 666 PSHQIEESRTAGRQPSRSCEPIIEVPPSPEYEYEALDEQEYPNED-DLVDIEDIMPGVHY 724
Query: 863 Q-EREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDA 921
E N P M N + +AL+ + +RK+K LRTEH YVLPD
Sbjct: 725 DVEINLCSNKP--MVSNCSWTPNHGKDLALSNSQHT--SRKMKHIGRLRTEHLAYVLPDD 780
Query: 922 HELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCE 981
H LL F+ R P+DPSPYLL + ++P E C
Sbjct: 781 HPLLEEFEERVPEDPSPYLLVLHPCPDNPPPGA---------------VENC-------- 817
Query: 982 RNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIV 1041
+V+GTILIPCRTA+RG FPLNGTYFQ +EVFAD +S PI+ + +L + IV
Sbjct: 818 ----MVKGTILIPCRTASRGNFPLNGTYFQDHEVFADDSSSRLPIDFSSECLNDLGKCIV 873
Query: 1042 CFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSK 1099
FGSS ++ + I++ + +G+VC+RGFDRR R PR L LH KKE SK
Sbjct: 874 YFGSSIHSITKGQTRQDIEDCYKKGYVCIRGFDRRTRSPRRLCAALHSINEKKEDNSK 931
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 81/144 (56%), Gaps = 18/144 (12%)
Query: 116 AVVPYQVGPSSE--------HGALVPHQIKEKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
AVVPYQ ++ G L PH +++P+V+ LD TLR+ ++ ++
Sbjct: 211 AVVPYQNAAPTDASCSALVPFGGLGPHG--NHSNQVRPKVLGLDVETLRVCGVLTKWEE- 267
Query: 167 TTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGV 222
D S E ++ W + R ++ + SF A + + G R + W GSV+DSVVG
Sbjct: 268 --IDSESFEGVDIGSGPGWAETRRDYKKLVDSFIATVKDLFGLREYSRWGGSVIDSVVGT 325
Query: 223 YLTQNVSDNLSSSAYMSLAARFPL 246
+LTQNV+D+LSS+A+M LAA+FPL
Sbjct: 326 FLTQNVADHLSSNAFMILAAKFPL 349
>gi|357142346|ref|XP_003572541.1| PREDICTED: protein ROS1-like [Brachypodium distachyon]
Length = 992
Score = 363 bits (931), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 223/556 (40%), Positives = 300/556 (53%), Gaps = 73/556 (13%)
Query: 568 WDLLR--RIYSTGEERSHDKM-DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR + RS ++ D VDWEAV A E+A I RGQ ++A RI+ FL
Sbjct: 476 WDRLRGEACRKGYDNRSETRITDKVDWEAVLHAPLIEVAKCIAGRGQHYLLALRIQAFLA 535
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WL+ VP + K YLL + GLG KSV+C+RLLSL+ AFPVDVNV RI
Sbjct: 536 RIKKDHGSFDLDWLKYVPRESAKNYLLSVNGLGAKSVDCIRLLSLKQKAFPVDVNVSRIV 595
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL WV L+ P + HL+ YP+M IQ YLWPRLC + ++ LYELH MITFGK+ C
Sbjct: 596 TRLEWVELECSPEEF--HLVDLYPLMKDIQTYLWPRLCTIGKEKLYELHCLMITFGKVIC 653
Query: 745 KKRSPNCGACPLRGECKHFASAVAS-----ARFALPGPSEK--GIVTSE---FGNGIGQS 794
K +PNC ACP R C+++ S +A A ++ GP E+ +VTSE NG
Sbjct: 654 TKAAPNCKACPFRARCRYYKSNLARSLLPPAEESVHGPGEEQTSMVTSERLLLPNGSCTP 713
Query: 795 PPLVVNPIPVIRIEADPVSESGYQ----INNCEPIIEEPRSPGPQCSESPESEIDDDDFS 850
LV + + ES NCEPIIE P P P+C E E D+
Sbjct: 714 GHLVCQ---------NQIKESKTAGRVPTRNCEPIIEVP--PSPEC----EHEALDEQEQ 758
Query: 851 IGDIEEIPTLRLQEREFKENFPNFMEMNKVML---QDSSALVALTAEAASVPTRKLKRCA 907
DIE++ + ++ + K N ++ M + LV + S + KLK
Sbjct: 759 CLDIEDMMS-DGEQYDAKINLCSYKPMVSIGCWTPNRGKDLVLSNSHHTSYQSPKLKNPG 817
Query: 908 HLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKL 967
LRTEHH YVLPD H +L F++R P+DP PYLL + + P
Sbjct: 818 RLRTEHHAYVLPDDHVILEEFEKRVPEDPCPYLLVV---------IPCP----------- 857
Query: 968 CNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPIN 1027
+ +V+GT+LIPCRTA+RG FPLNGTYFQ +EVFAD+ +S PI
Sbjct: 858 ---------------DDEVVKGTMLIPCRTASRGNFPLNGTYFQDHEVFADYTSSRFPIT 902
Query: 1028 VPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRL 1087
+ R I L R IV FGSS ++ + I++ F +G+VC+R FDR+ R P+ L L
Sbjct: 903 IHRELIWELERCIVYFGSSIHSITKGQTRQDIEDCFKKGYVCIRAFDRQTRYPKRLCATL 962
Query: 1088 HCSPSKKEAGSKRSRD 1103
H + K+E ++ D
Sbjct: 963 HANTGKQEGSEQKEGD 978
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 84/147 (57%), Gaps = 20/147 (13%)
Query: 116 AVVPYQVGP------------SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNI 163
A+VPYQ P +S A+VP+ + K+ + K + + P T R+++++M
Sbjct: 219 AMVPYQNAPIDSPCSALVPFGNSAQLAMVPYLNRGKKVRAK-LLGITPETKRVYDVLMKW 277
Query: 164 DDGTTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSV 219
D+ D S E ++ W K R ++ + SF A + + G R F W GSV+DSV
Sbjct: 278 DE---IDGESFEGLDIGSGPEWDKIRLEYKKHVDSFIAIVKDLFGPREFSQWGGSVIDSV 334
Query: 220 VGVYLTQNVSDNLSSSAYMSLAARFPL 246
VG +LTQNV+D+LSS+A+M LAA+FP+
Sbjct: 335 VGTFLTQNVADHLSSNAFMVLAAKFPM 361
>gi|300681535|emb|CBH32632.1| conserved hypothetical protein, expressed [Triticum aestivum]
Length = 950
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 224/537 (41%), Positives = 300/537 (55%), Gaps = 39/537 (7%)
Query: 568 WDLLR-RIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR Y G + D V+WEAV+ A ++A I RGQ ++A RI+ FL
Sbjct: 431 WDKLRDEAYRKGYDNGSKTRITDKVNWEAVQQASFVDVAKCIAGRGQHYLLALRIQAFLT 490
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WLR +P + K+YLL I GLG KSV+C+RLLSL+H AFPVDVNV RI
Sbjct: 491 RIKKDHGSFDLDWLRCLPRESAKKYLLSINGLGAKSVDCIRLLSLEHKAFPVDVNVARIV 550
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL WV LQ + H L YP+M +Q YLWPRLC +D++ LYELH MITFGK+ C
Sbjct: 551 TRLQWVELQCCSEEFHSVDL--YPLMQDVQNYLWPRLCTIDKEKLYELHCLMITFGKVIC 608
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
K PNC ACP RG+C+++ S + R LP P+E+ + E I S L+ N +
Sbjct: 609 TKVDPNCNACPFRGDCRYYKSKL--TRPLLP-PAEEHVRGGEEKTSIVTSERLLSNGSSI 665
Query: 805 IRIEADPVSESGYQIN-NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
+ + +G Q + +CEPIIE P SP + E E ++D + DIE+I +
Sbjct: 666 PSHQIEESRTAGRQPSRSCEPIIEMPPSPEYEYEALDEQEYPNED-DLVDIEDIMSGVHY 724
Query: 864 EREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHE 923
+ E N + +AL+ + +RK+K LRTEH YVLPD H
Sbjct: 725 DVEINLCSNKPTVSNCSWTPNRGKDLALSNSQHT--SRKMKHIGRLRTEHLAYVLPDGHP 782
Query: 924 LLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERN 983
LL F+ R P+DPSPYLL + ++P G+ Q +C
Sbjct: 783 LLEEFEERVPEDPSPYLLVLHPCPDNPPP-GAVQ-----------------NC------- 817
Query: 984 AGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCF 1043
+V+GTILIPCRTA+ G FPLNGTYFQ +EVFAD +S PI + +L + IV F
Sbjct: 818 --MVKGTILIPCRTASTGNFPLNGTYFQDHEVFADDSSSRLPIVFSSECLNDLGKCIVYF 875
Query: 1044 GSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKR 1100
GSS ++ + I++ + +G+VC+RGFDRR R PR L LH KKE GSK
Sbjct: 876 GSSIHSITKGQTRQDIEDCYKKGYVCIRGFDRRTRSPRRLRAALHSINEKKEDGSKH 932
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 80/144 (55%), Gaps = 18/144 (12%)
Query: 116 AVVPYQVGPSSE--------HGALVPHQIKEKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
A+VPYQ ++ G L H +++P+V+ LD TLR+ ++ ++
Sbjct: 211 AIVPYQNAAPTDASCSTLVPFGGLGGH--GNHSNQVRPKVLGLDVETLRVCGVLTKWEE- 267
Query: 167 TTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGV 222
D S E ++ W + R ++ R+ F A + + G R + W GSV+DSVVG
Sbjct: 268 --IDSESFEGVDIGSGPGWNETRRDYKKRVDWFIATVKDLFGPREYSRWGGSVIDSVVGT 325
Query: 223 YLTQNVSDNLSSSAYMSLAARFPL 246
+LTQNV+D+LSS+A+M LAA+FP+
Sbjct: 326 FLTQNVADHLSSNAFMILAAKFPM 349
>gi|52353713|gb|AAU44279.1| unknow protein [Oryza sativa Japonica Group]
Length = 375
Score = 344 bits (883), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 185/379 (48%), Positives = 238/379 (62%), Gaps = 17/379 (4%)
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE----FGNGI 791
MITFGK+FC K PNC ACP+R EC+HFASA ASAR ALP P +K +V F NG
Sbjct: 1 MITFGKVFCTKSKPNCNACPMRSECRHFASAFASARLALPSPQDKRLVNLSNQFAFHNGT 60
Query: 792 GQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSI 851
+P N P+ ++E + NN PIIEEP SP + D +DF
Sbjct: 61 MPTP----NSTPLPQLEGS-IHARDVHANNTNPIIEEPASPREEECRELLEN-DIEDFD- 113
Query: 852 GDIEEIPTLRLQEREFKENFPNFM-EMNKVMLQD--SSALVALTAEAASVPTRKLKRCAH 908
D +EIP ++L F +N N + E NK D + ALVA++ EAAS+P KLK
Sbjct: 114 EDTDEIPIIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIPVPKLKNVHR 173
Query: 909 LRTEHHVYVLPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK-DS 965
LRTEH+VY LPD+H L++ D+R+PDDPSPYLLAIW+P E ++ +P+ CN + +
Sbjct: 174 LRTEHYVYELPDSHPLMQQLALDQREPDDPSPYLLAIWTPDELKDTREAPKPCCNPQTEG 233
Query: 966 KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHP 1025
LC++E+C++C S E VRGT+L+PCRTA RG FPLNGTYFQVNEVFADH +SH+P
Sbjct: 234 GLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVFADHSSSHNP 293
Query: 1026 INVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVN 1085
IN+PR + NL R +V FG+S T+F L +IQ FWRGFVCVRGF+ R PRPL
Sbjct: 294 INIPREQLWNLHRRMVYFGTSVPTIFKGLTTEEIQHCFWRGFVCVRGFNMETRAPRPLCP 353
Query: 1086 RLHCSPSKKEAGSKRSRDE 1104
H + SK SK++ E
Sbjct: 354 HFHLAASKLRRSSKKAATE 372
>gi|168003630|ref|XP_001754515.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694136|gb|EDQ80485.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 413
Score = 337 bits (865), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 196/418 (46%), Positives = 246/418 (58%), Gaps = 26/418 (6%)
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LGWVPL+PLP +L +HLL+ YPV IQ YLWPRLC LD +TLYELHYQMI+FGK+FC K
Sbjct: 1 LGWVPLEPLPEELQLHLLELYPVQATIQKYLWPRLCKLDHQTLYELHYQMISFGKVFCTK 60
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP+R ECKHFASAV+SA+ ALP P P L + P +
Sbjct: 61 SKPNCNACPMRPECKHFASAVSSAKLALPAPERP-----------HDRPTLALPPGTISS 109
Query: 807 IEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQERE 866
D + C+P +EEP +P + + +I+D FS+ ++EE + L +
Sbjct: 110 SGEDYLRPPRTVSQYCQPFVEEPMTPESEANVEDLDDIEDYPFSVEEVEEDAVMNLYQES 169
Query: 867 FKENFPNFM--EMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHEL 924
M + N S +V + EAAS+P KLK LRT H+VY LPD H L
Sbjct: 170 PSTVTEQSMSGDANPNGSSGSQEMVLVPLEAASIPVPKLKNVGRLRTVHYVYELPDNHPL 229
Query: 925 LRGFDRRDPDDPSPYLLAIWSP--------GESPNSVGS-PQYRCNSKDSKLCNSEICYS 975
L G D R+PDDP YLLAIWSP SVG P+ N D+ N E ++
Sbjct: 230 LDGLDSREPDDPCTYLLAIWSPVTRLVLIVAYLELSVGKVPESIPNINDNA--NEENPFA 287
Query: 976 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1035
+S E + I + T+L+PCRTA +G FPLNGTYFQVNEVFADH +S PI VPR + N
Sbjct: 288 -SSGSEGDESI-KATLLVPCRTAMQGSFPLNGTYFQVNEVFADHASSLQPILVPRTLLWN 345
Query: 1036 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
LRR V FG+S T++F + +IQ FWRG+VCVRGFDR R P+PLV RLH K
Sbjct: 346 LRRRFVFFGTSVTSIFRGMTAEEIQACFWRGYVCVRGFDRTTRAPKPLVGRLHLQAGK 403
>gi|413945496|gb|AFW78145.1| hypothetical protein ZEAMMB73_419140 [Zea mays]
Length = 565
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 136/208 (65%), Positives = 163/208 (78%), Gaps = 6/208 (2%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAI 606
+ +TT KN++N DW+ LRR S G+ +RS ++ DS+DWEAVRCAD I+ AI
Sbjct: 352 KKTRTTTAKNTENF---DWEKLRRQACSEGQMKQRSFERRDSIDWEAVRCADVRRISHAI 408
Query: 607 KERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
+ERG N++A RI+ FLNRLV HGSIDLEWLR +PPD K+YLL I GLGLKSVECVRL
Sbjct: 409 RERGMNNVLAERIQNFLNRLVRDHGSIDLEWLRYIPPDSAKDYLLSIRGLGLKSVECVRL 468
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
L+L H+AFPVD NVGRI VRLGWVP+QPLP L +HLL+ YP+++ IQ YLWPRLC LDQ
Sbjct: 469 LTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPILETIQKYLWPRLCKLDQ 528
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGAC 754
+TLYELHYQMITFGK+FC KR PNC AC
Sbjct: 529 QTLYELHYQMITFGKVFCTKRQPNCNAC 556
>gi|357445859|ref|XP_003593207.1| Repressor of silencing [Medicago truncatula]
gi|355482255|gb|AES63458.1| Repressor of silencing [Medicago truncatula]
Length = 731
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 130/204 (63%), Positives = 165/204 (80%), Gaps = 1/204 (0%)
Query: 565 QQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
+Q W+ LR+I+S +R D +DS+DWEAVR A E+A+AIK RGQ NIIA +I+ LN
Sbjct: 485 RQKWEALRKIHSRS-DRHIDHVDSIDWEAVRNAKVGEVAEAIKMRGQHNIIAKKIQLALN 543
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+ +E HG+ DLEWL+ +PP++ KEYLL+I GLGLKSVEC+RLL+LQHI+FPVDVNVGRI
Sbjct: 544 KFLEHHGTTDLEWLKYIPPNEAKEYLLNIFGLGLKSVECLRLLTLQHISFPVDVNVGRIV 603
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VRLGWVPLQPLP + IH L+++P KIQ YLWPRLC LD TLYELHYQ+ITFGK+FC
Sbjct: 604 VRLGWVPLQPLPESIQIHNLEQFPDPIKIQQYLWPRLCKLDHHTLYELHYQLITFGKVFC 663
Query: 745 KKRSPNCGACPLRGECKHFASAVA 768
KR+PNC ACP++ CK++AS++A
Sbjct: 664 TKRNPNCNACPMKDNCKYYASSLA 687
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/166 (40%), Positives = 102/166 (61%), Gaps = 14/166 (8%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVF 188
G LV ++ R LK V D TLR+WNL++ + KD E ++W++ R+ +
Sbjct: 294 GELVLYE----RPLLKVNVKFDEETLRVWNLLVAENKHDEKD----EHKRRYWEEIRKAY 345
Query: 189 EGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
++SF R+H + GDRRF PWKGSV+DS+ GV+LTQNVSD+LSSSA+MSLAARFP+KS
Sbjct: 346 HEAVESFLTRVHNVQGDRRFLPWKGSVLDSIGGVFLTQNVSDHLSSSAFMSLAARFPVKS 405
Query: 249 -----TNDHTPSD-ENLRTTASLEPIGSNSTSNGAVYDSEGNMYFV 288
+N+ SD ++ + +E +N +S +++ + Y +
Sbjct: 406 VSCEQSNNMIFSDPKSDKKVEEMEAQKANESSKVVNKETQNSSYLI 451
>gi|224068973|ref|XP_002326244.1| DNA glycosylase [Populus trichocarpa]
gi|222833437|gb|EEE71914.1| DNA glycosylase [Populus trichocarpa]
Length = 257
Score = 282 bits (721), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 142/251 (56%), Positives = 179/251 (71%), Gaps = 5/251 (1%)
Query: 847 DDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKL 903
+D D +EIP ++L EF +N N+M+ N + LQ+ S ALVALTAEAAS+P KL
Sbjct: 2 EDTFCEDPDEIPIIKLNIEEFTQNLQNYMQEN-MELQEADMSKALVALTAEAASIPVPKL 60
Query: 904 KRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSK 963
K + LRTEH VY LPD+H LL+ DRR+PDDP YLLAIW+PGE+ NS+ + C+
Sbjct: 61 KNVSRLRTEHQVYELPDSHPLLQRLDRREPDDPCSYLLAIWTPGETANSIQPLERSCSLH 120
Query: 964 D-SKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
+ KLC+ + C+SCN++ E N+ IVRGT+LIPCRTA RG FPLNGTYFQVNEVFADH++S
Sbjct: 121 ECGKLCDEKTCFSCNNIREENSQIVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSS 180
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
+PI+VPR I NL R V FG+S T+F L IQ FWRG+VCVRGFD++ R PRP
Sbjct: 181 LNPIDVPRAWIWNLPRRTVYFGTSIPTIFKGLTTAGIQHCFWRGYVCVRGFDQKTRAPRP 240
Query: 1083 LVNRLHCSPSK 1093
L+ RLH SK
Sbjct: 241 LMARLHFPASK 251
>gi|300681534|emb|CBH32631.1| conserved hypothetical protein [Triticum aestivum]
Length = 840
Score = 270 bits (690), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 166/382 (43%), Positives = 219/382 (57%), Gaps = 16/382 (4%)
Query: 568 WDLLR-RIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR Y G + D V+WEAV+ A ++A I RGQ ++A RI+ FL
Sbjct: 431 WDKLRDEAYRKGYDNGSKTRITDKVNWEAVQQASFVDVAKCIAGRGQHYLLALRIQAFLT 490
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WLR +P + K+YLL I GLG KSV+C+RLLSL+H AFPVDVNV RI
Sbjct: 491 RIKKDHGSFDLDWLRCLPRESAKKYLLSINGLGAKSVDCIRLLSLEHKAFPVDVNVARIV 550
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL WV LQ + H L YP+M +Q YLWPRLC +D++ LYELH MITFGK+ C
Sbjct: 551 TRLQWVELQCCSEEFHSVDL--YPLMQDVQNYLWPRLCTIDKEKLYELHCLMITFGKVIC 608
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN--PI 802
K PNC ACP RG+C+++ S + R LP P+E+ + E I S L+ N I
Sbjct: 609 TKVDPNCNACPFRGDCRYYKSKL--TRPLLP-PAEEHVRGGEEKTSIVTSERLLSNGSSI 665
Query: 803 PVIRIEADPVSESGYQIN-NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLR 861
P +IE +G Q + +CEPIIE P SP + E E ++D + DIE+I +
Sbjct: 666 PSHQIEES--RTAGRQPSRSCEPIIEMPPSPEYEYEALDEQEYPNED-DLVDIEDIMSGV 722
Query: 862 LQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDA 921
+ E N + +AL+ + +RK+K LRTEH YVLPD
Sbjct: 723 HYDVEINLCSNKPTVSNCSWTPNRGKDLALSNSQHT--SRKMKHIGRLRTEHLAYVLPDG 780
Query: 922 HELLRGFDRRDPDDPSPYLLAI 943
H LL F+ R P+DPSPYLL +
Sbjct: 781 HPLLEEFEERVPEDPSPYLLVL 802
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 80/144 (55%), Gaps = 18/144 (12%)
Query: 116 AVVPYQVGPSSE--------HGALVPHQIKEKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
A+VPYQ ++ G L H +++P+V+ LD TLR+ ++ ++
Sbjct: 211 AIVPYQNAAPTDASCSTLVPFGGLGGH--GNHSNQVRPKVLGLDVETLRVCGVLTKWEE- 267
Query: 167 TTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGV 222
D S E ++ W + R ++ R+ F A + + G R + W GSV+DSVVG
Sbjct: 268 --IDSESFEGVDIGSGPGWNETRRDYKKRVDWFIATVKDLFGPREYSRWGGSVIDSVVGT 325
Query: 223 YLTQNVSDNLSSSAYMSLAARFPL 246
+LTQNV+D+LSS+A+M LAA+FP+
Sbjct: 326 FLTQNVADHLSSNAFMILAAKFPM 349
>gi|7406462|emb|CAB85564.1| putative protein [Arabidopsis thaliana]
Length = 234
Score = 264 bits (675), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 127/210 (60%), Positives = 152/210 (72%), Gaps = 1/210 (0%)
Query: 885 SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIW 944
S ALVAL S+PT KLK + LRTEH VY LPD+H LL G D+R+PDDPSPYLLAIW
Sbjct: 19 SKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLDGMDKREPDDPSPYLLAIW 78
Query: 945 SPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRF 1003
+PGE+ NS P+ +C K S K+C E C CNS+ E N+ VRGT+LIPCRTA RG F
Sbjct: 79 TPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANSQTVRGTLLIPCRTAMRGSF 138
Query: 1004 PLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYF 1063
PLNGTYFQVNE+FADHE+S PI+VPR I +L R V FG+S T++F L QIQ F
Sbjct: 139 PLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTVYFGTSVTSIFRGLSTEQIQFCF 198
Query: 1064 WRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
W+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 199 WKGFVCVRGFEQKTRAPRPLMARLHFPASK 228
>gi|4581149|gb|AAD24633.1| hypothetical protein [Arabidopsis thaliana]
Length = 1207
Score = 256 bits (654), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/298 (48%), Positives = 180/298 (60%), Gaps = 21/298 (7%)
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSE 786
K YELHYQMITFGK+FC K PNC ACP++GEC+HFASA ASAR ALP ++E
Sbjct: 921 KAKYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASAFASARLALP--------STE 972
Query: 787 FGNGIGQSPPLVVN-PIPVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPES 842
G G PL ++ P P R + V SE ++ CEPIIEEP SP P+ +E +
Sbjct: 973 KGMGTPDKNPLPLHLPEPFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIA 1032
Query: 843 EIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVP 899
+I++ F D EEIPT+RL F N ME NK LQD SSALVALTAE AS+P
Sbjct: 1033 DIEEAFFE--DPEEIPTIRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLP 1089
Query: 900 TRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYR 959
KLK + LRTEH VY LPD H LL ++R+PDDP YLLAIW+PGE+ +S+
Sbjct: 1090 MPKLKNISQLRTEHRVYELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVST 1149
Query: 960 C-NSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVF 1016
C + LC+ E C+SCNS+ E + IVRGTIL+ R R +F ++ F
Sbjct: 1150 CIFQANGMLCDEETCFSCNSIKETRSQIVRGTILV--YQLRRYRLAFGKVHFVLSSFF 1205
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)
Query: 116 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 170
A+VPY + GA+VP K+ + +P+V LD T R+W L++ + D
Sbjct: 466 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 524
Query: 171 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 230
S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 525 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 583
Query: 231 NLSSSAYMSLAARFPL 246
+LSSSA+MSLA++FP+
Sbjct: 584 HLSSSAFMSLASQFPV 599
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 66/112 (58%), Gaps = 14/112 (12%)
Query: 567 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKE-- 621
DWD LRR + E++ MD+VDW+A+R AD E+A+ IK RG + +A RI+
Sbjct: 830 DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQYLT 889
Query: 622 --------FLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
FL+RLV HGSIDLEWLR+VPPDK K Y L + + V C +
Sbjct: 890 LNMKIMQGFLDRLVNDHGSIDLEWLRDVPPDKAK-YELHYQMITFGKVFCTK 940
>gi|31296708|gb|AAP46637.1| PG3 [Hordeum vulgare]
Length = 689
Score = 256 bits (653), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 164/389 (42%), Positives = 220/389 (56%), Gaps = 23/389 (5%)
Query: 568 WDLLRR-IYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR Y G + D V+WEAV+ A ++A I RGQ ++A RI+ FL
Sbjct: 251 WDKLREEAYRKGYDNGSKTRITDKVNWEAVQQASFVDVAKCIAGRGQHYVLALRIQAFLT 310
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WLR +P + K+YL+ I GLG KSV+C+RLLSL+H AFPVDVNV RI
Sbjct: 311 RIKKDHGSFDLDWLRYLPRESAKKYLISINGLGAKSVDCIRLLSLEHKAFPVDVNVPRIV 370
Query: 685 VRLGWVPLQPLPGDLH-IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
RL WV LQ + H + L + P KI LWPRLC +D++ LYELH MITFGK+
Sbjct: 371 TRLQWVELQCCSEEFHSVDLYQLMPGCGKIT--LWPRLCTIDKEKLYELHCLMITFGKVI 428
Query: 744 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNP-- 801
C K PNC ACP RG+C+++ S + R LP P+E+ + E + S L+ N
Sbjct: 429 CTKVDPNCNACPFRGDCRYYKSKL--TRPLLP-PAEEHVRGGEEKKSMVTSERLLSNGSC 485
Query: 802 IPVIRIEADPVSES---GYQIN-NCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEI 857
+P +++ + ES G Q + +CEPIIE P SP + E E ++D + DIE+I
Sbjct: 486 MPSLQVWQHQIEESRAAGRQPSRSCEPIIEVPPSPEYEYEALDEQEYPNED-DLVDIEDI 544
Query: 858 PTLRLQ---EREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHH 914
+ R+ E N P M N + AL+ + +R LK LRTEH
Sbjct: 545 MSKRVHYDVEINLCSNKP--MVSNCSWTPNHGKDWALSNSQHT--SRNLKHIGRLRTEHL 600
Query: 915 VYVLPDAHELLRGFDRRDPDDPSPYLLAI 943
YVLPD H LL + R P+DPSPYLL +
Sbjct: 601 AYVLPDGHALLEEVEERVPEDPSPYLLVL 629
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 79/144 (54%), Gaps = 18/144 (12%)
Query: 116 AVVPYQVGPSSEH--------GALVPHQIKEKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
A+VPYQ ++ G L PH + +++P+V+ LD TLR++ + ++
Sbjct: 29 AIVPYQNAAQADASCSALVSLGDLGPH--GNRSNQVRPKVLGLDDDTLRVYGALTKWEE- 85
Query: 167 TTKDQTSNEDMEKW----WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGV 222
D S E ++ W R +E + F A + + G R + W GSV+DSVVG
Sbjct: 86 --IDSESFEGVDIGSGPKWDDIRRKYENLVDWFIATVKDLFGPREYSRWGGSVIDSVVGT 143
Query: 223 YLTQNVSDNLSSSAYMSLAARFPL 246
+LTQNV+D+LS +A+M LAA+FP+
Sbjct: 144 FLTQNVADHLSRNAFMILAAKFPM 167
>gi|413925772|gb|AFW65704.1| hypothetical protein ZEAMMB73_319662 [Zea mays]
Length = 1522
Score = 250 bits (639), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/276 (50%), Positives = 180/276 (65%), Gaps = 16/276 (5%)
Query: 458 LFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRT 517
L C + YT T G + S SG +D R ++S ++ + S L N+ S
Sbjct: 1259 LVCSNPQEVYTTSTD-QMGAEQSQSGCGQKYNDARVQTAS--HERHQSSTLCDNQNSCSV 1315
Query: 518 GMPQ-AHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR-IY 575
+ A D+T + +G K+++ S K + PKK +D DWD LR+ +
Sbjct: 1316 VLQGVASDSTQKFVDTQKGPSKARQNGSKAK----VRGRPKKKTD-----DWDSLRKKVL 1366
Query: 576 STG--EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI 633
S G ++RSHD D+VDWEAVR A+ EI++ I+ERG N++A RIKEFLNRLV HGSI
Sbjct: 1367 SNGGDKQRSHDARDTVDWEAVRQAEVREISETIRERGMNNMLAERIKEFLNRLVTDHGSI 1426
Query: 634 DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
DLEWLR+V PDK K++LL I GLGLKSVECVRLL+L H+AFPVD NVGRI VRLGWVPLQ
Sbjct: 1427 DLEWLRDVQPDKAKDFLLSIRGLGLKSVECVRLLTLHHMAFPVDTNVGRICVRLGWVPLQ 1486
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTL 729
PLP L +HLL+ YP+++ IQ YLWPRLC LDQ+TL
Sbjct: 1487 PLPESLQLHLLEMYPMLEHIQKYLWPRLCKLDQRTL 1522
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/167 (49%), Positives = 108/167 (64%), Gaps = 20/167 (11%)
Query: 84 IIRKFKDLTIR---DGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQIKEKR 140
II K K LTI G S++P NA+VPY+ E AL+ + K K+
Sbjct: 799 IIEKIKLLTINGPDKGVSEVP------------KNALVPYE----GEFSALIAFEGKAKK 842
Query: 141 KKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMH 200
+ + +V +DP T MWNL+M D G + ++D EKW +ER VF GR+ SF ARMH
Sbjct: 843 SRPRAKVNIDPVTTMMWNLLMGPDMGDDAEGL-DKDKEKWLDEERRVFRGRVDSFIARMH 901
Query: 201 LILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
L+ GDRRF WKGSVVDSVVGV+LTQNVSD+LSSSA+M++AA+FP+K
Sbjct: 902 LVQGDRRFSRWKGSVVDSVVGVFLTQNVSDHLSSSAFMAVAAKFPVK 948
>gi|224138760|ref|XP_002326683.1| hypothetical protein POPTRDRAFT_423916 [Populus trichocarpa]
gi|222834005|gb|EEE72482.1| hypothetical protein POPTRDRAFT_423916 [Populus trichocarpa]
Length = 210
Score = 246 bits (629), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 114/211 (54%), Positives = 147/211 (69%), Gaps = 2/211 (0%)
Query: 885 SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIW 944
S ALV++ S P K+KR LRTEH VY LPD HE+L G D+R+ DD PYLLAIW
Sbjct: 2 SQALVSVGTTPLSAP--KMKRVTSLRTEHQVYELPDNHEILVGLDKRERDDSVPYLLAIW 59
Query: 945 SPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFP 1004
PGE+PNS P+ C+S+ S+LC+ + C++C + E+ AGIVRGTILIPCRTA +G FP
Sbjct: 60 QPGETPNSSQQPEKLCSSQGSQLCDQKTCFACEGIREQQAGIVRGTILIPCRTALKGSFP 119
Query: 1005 LNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFW 1064
LNGTYFQVNEVFADH++S+ PI VPR + NL + + GSS ++F L +++I + FW
Sbjct: 120 LNGTYFQVNEVFADHKSSYDPIIVPRELLWNLVKRTLYVGSSTKSIFRDLSLKEIHQNFW 179
Query: 1065 RGFVCVRGFDRRYRCPRPLVNRLHCSPSKKE 1095
GF CV+ F+R P+PL R HCS SK E
Sbjct: 180 TGFTCVKAFERGTGAPKPLARRFHCSASKME 210
>gi|222628678|gb|EEE60810.1| hypothetical protein OsJ_14411 [Oryza sativa Japonica Group]
Length = 1072
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 183/571 (32%), Positives = 256/571 (44%), Gaps = 131/571 (22%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
KKN + +D R + G R D D VDWEA+ A +++ D IK+RGQ + +A
Sbjct: 268 KKNDTSHWHSLYDEAR---NRGYIRDDDIPDMVDWEALMNAPFADVVDCIKDRGQHSQMA 324
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAF-- 674
RI FL R+ HG+IDLEWLR +P K K+YL + GLG KSV+C+RLLSL+H AF
Sbjct: 325 FRILAFLIRMKRDHGNIDLEWLRFIPRAKAKQYLHSVIGLGHKSVDCIRLLSLRHRAFPA 384
Query: 675 ----------PVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYL 724
P+D N+ I RLGWV L+PLP +E+ +DK
Sbjct: 385 SYLFSNQIILPIDTNIAHIVTRLGWVQLRPLPSS------QEFHRVDK------------ 426
Query: 725 DQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVT 784
YELH QMITFGK C+K PNCGACP ECK++ S A ALP S++
Sbjct: 427 -----YELHCQMITFGKAICRKSKPNCGACPFTSECKYYKSQFGRAALALPEYSQQDATK 481
Query: 785 SEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSP-----GPQCSES 839
+ ++ L+ +IE +E +N CEP+IE P +P G E
Sbjct: 482 DANMDDPAKTYDLIFKA-HQYQIEYGKNTE----MNYCEPVIEIPPTPLHENRGETSDED 536
Query: 840 PES--EIDDDDFSIG----DIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTA 893
E+ DDD IG D+E+I E + + P ++ ++ +
Sbjct: 537 DENGYYFDDDMEDIGRHDYDMEDIEHDYDMEVDLRSAKPT-TNTSQAGATPGKEMIPINP 595
Query: 894 EAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSV 953
A S P + + LRTE+ ++PD H +L+ FD R P D +PYLL S E
Sbjct: 596 RAKSTP---MVKKFSLRTEYTACIIPDGHIILKKFDPRVPGDRNPYLLVFRSFDEH---- 648
Query: 954 GSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVN 1013
V+ TIL
Sbjct: 649 --------------------------------TVKATIL--------------------- 655
Query: 1014 EVFADHETSHHPINVPRHTIANLRRE--IVCFGSSATTLFNALDMRQIQEYFWRGFVCVR 1071
VFADH +S PI + R + LRR+ IV FG+ ++ +G++C R
Sbjct: 656 -VFADHSSSRSPIEINRDLVWELRRQTCIVHFGTRVHSVT-------------KGYICTR 701
Query: 1072 GFDRRYRCPRPLVNRLHCSPSKKEAGSKRSR 1102
FDRR + P+ L +H + K+ G KR+R
Sbjct: 702 EFDRRTKFPKQLCVEIHATNVNKDIGKKRAR 732
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 55/79 (69%)
Query: 181 WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSL 240
W + R FE ++ F M I+GDR F PW GSVV SVVG +LTQNVSDNLSS+A+M++
Sbjct: 68 WNERRREFERKVDHFMYNMRSIIGDRNFSPWGGSVVTSVVGTFLTQNVSDNLSSNAFMTI 127
Query: 241 AARFPLKSTNDHTPSDENL 259
AARFPLK+ + +N+
Sbjct: 128 AARFPLKNRRNAGHHSDNV 146
>gi|48716531|dbj|BAD23135.1| putative transcriptional activator DEMETER [Oryza sativa Japonica
Group]
Length = 1552
Score = 237 bits (604), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 125/238 (52%), Positives = 150/238 (63%), Gaps = 38/238 (15%)
Query: 535 GKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLRR--IYSTGEERSHDKM-DSVDW 591
GK K E + +N R + DWD LR+ +++ G + DK D++DW
Sbjct: 1174 GKSKHTEKRPKARNVRGRTKMK--------HYDWDNLRKEVLHNHGNRQRSDKAKDTIDW 1225
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA +FLNRLV HGSIDLEWLR++ PDK K +LL
Sbjct: 1226 EA---------------------------DFLNRLVRDHGSIDLEWLRDIEPDKAKGFLL 1258
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I GLGLKS ECVRLL+L +AFPVD NV RI VRLGWVPLQPLP L +HLL+ YP+++
Sbjct: 1259 SIRGLGLKSTECVRLLTLHQMAFPVDTNVARICVRLGWVPLQPLPESLQLHLLELYPLLE 1318
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
IQ Y+WPRLC LDQ LYELHYQMITFGK+FC K PNC +CP+R ECKHFASA AS
Sbjct: 1319 HIQKYIWPRLCKLDQLILYELHYQMITFGKVFCSKSKPNCNSCPMRAECKHFASAFAS 1376
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 110/189 (58%), Gaps = 30/189 (15%)
Query: 834 PQCSESP---ESEIDDDDFSIGDIEEIPTLRLQEREFKENFPNFMEMNKVMLQD---SSA 887
P C+ P E + F+ D +EIP + L EF +N +++ N + ++D S+A
Sbjct: 1356 PNCNSCPMRAECKHFASAFASEDPDEIPIINLNVEEFAQNLKSYIHANNIEIEDADMSNA 1415
Query: 888 LVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSP- 946
LVA++ +AASVPT KLK LRTEH VY LPD+H LL GFD+R+PDDPSPYLL+IW+P
Sbjct: 1416 LVAISPQAASVPTSKLKNVNRLRTEHQVYELPDSHPLLEGFDQREPDDPSPYLLSIWTPG 1475
Query: 947 ----------------------GESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSVCERN 983
GE+ S +P+ CNSK++ KLC S C+SCNS E
Sbjct: 1476 KLMCSHPTFTLIQVILMIKISTGETAQSTDAPKTFCNSKETGKLCESSTCFSCNSTREMQ 1535
Query: 984 AGIVRGTIL 992
+ VRGT+L
Sbjct: 1536 SQKVRGTLL 1544
Score = 143 bits (360), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 71/127 (55%), Positives = 92/127 (72%), Gaps = 1/127 (0%)
Query: 129 GALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVF 188
GALVP + K K++ + EV +DP T MWNL+M +D + +ED E+ ++ER +F
Sbjct: 745 GALVPLEGKVKKRGSRAEVKIDPVTNLMWNLLMALDKCEGVEGI-DEDKERLLEEERRMF 803
Query: 189 EGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
GRI SF A MHL+ GDRRF PWKGS+VDSVV V+LTQNVSD+LSSSA+M+LAARFP+KS
Sbjct: 804 RGRIDSFIAHMHLVQGDRRFSPWKGSIVDSVVDVFLTQNVSDHLSSSAFMALAARFPVKS 863
Query: 249 TNDHTPS 255
P+
Sbjct: 864 EGPEKPA 870
>gi|168050578|ref|XP_001777735.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670836|gb|EDQ57397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1992
Score = 224 bits (572), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 195/672 (29%), Positives = 300/672 (44%), Gaps = 143/672 (21%)
Query: 131 LVPHQIKEKRKKLKPEVVLDPGTLRMWN-LIMNIDDGTTKDQTSNEDMEKWWQKEREVFE 189
+V ++ + K KP+V LD T++ + L++ + + D+T ++ W ++R+++
Sbjct: 901 MVSYRGMRPKVKCKPKVFLDEVTIKTFERLLLQGGEASDADRT-----DENWAEQRQLWG 955
Query: 190 GRIQSFTARMHLI-------------------LGDRRFKPWKGSVVDSVVGVYLTQNVSD 230
GR SF A M + GDR F W GSV+DSVVGV+LTQNVSD
Sbjct: 956 GRAHSFNAVMRDVQGWNPDVVVCVLCWTCPGDAGDRTFSKWGGSVLDSVVGVFLTQNVSD 1015
Query: 231 NLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYF--- 287
LSS+A+M L ARFP + ++T+ V E ++
Sbjct: 1016 FLSSNAFMELCARFP----------------------VRRDATTGSCVASKEQSLLSKEM 1053
Query: 288 -----VTEPEPDRCCELKDRDDA----FDSRIQRKALQENGDIKVMTDAVPSQAFDT--- 335
V P+P L R + D Q+ A ++ +V+T+ ++ DT
Sbjct: 1054 SRECPVETPDPYLTRALHPRAEGSAAGIDELQQKTAGAQDDSGQVITEVCVAETPDTLLK 1113
Query: 336 SSVQSLDRTQLFPT---GNSKADVASSTKTSNAESFITQFSHT----------------- 375
++ D PT G+ D++ S+ I S +
Sbjct: 1114 KPQEAKDILVDLPTRDDGHRDLDISDPYSLSDGHGGIEALSRSIAIDNLAVALETVHIQS 1173
Query: 376 --------GNLKKNSVNQLFPT--VNSKADVAS-----------PSQNHITQSSVTQFWP 414
+K + N P V AD AS P N + +S T+
Sbjct: 1174 SAVPKDCCSFMKMTTTNVRAPDEHVTDDADYASRPQATKIVSEMPGDNRVQESIGTEAGT 1233
Query: 415 TGNSTADVA----SPSKTCIKESSIAASTEIPQLENTALLQDKVDGILFCDECLDGYTKP 470
G T DV S + C+ + + T P ++LQDK L D D K
Sbjct: 1234 QG-PTQDVGEKRHSSEQGCLVGNDEKSDTRKPFSRGYSVLQDKFKLSLEEDLTPDRSEK- 1291
Query: 471 TRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEISVLPSNRESFRTGMPQAHDATTTSK 530
+T+ R+ L + ++ +P +I R ++ H + + K
Sbjct: 1292 -------LLATALRHILGKSITAREAAVVKEPLKIE----KRRKLKSSKNARHVSKISVK 1340
Query: 531 KSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQQDWDLLR-RIYSTGEERSHDKMDSV 589
+ G+ + E+ + K + KTT W+ LR ++ S E+ + DSV
Sbjct: 1341 ELTGGQRATYEHGFNYKLNAEMKTT------------WEALRAKVLSDNFEKDYSISDSV 1388
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKE 648
DWEAVR AD + +AD IKERG NI++GRIK L+R+ + GS+DLEW+R + P +
Sbjct: 1389 DWEAVRLADVAVVADLIKERGMNNILSGRIKSLLDRIYRDQDGSLDLEWIRKLSPIDSQN 1448
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L+++ GLG+KSVEC+RLL+L H +FPVD NVGRI VRLGWVPL+PLP + +HLL
Sbjct: 1449 FLINVRGLGIKSVECIRLLTLHHPSFPVDTNVGRILVRLGWVPLEPLPEKIRLHLL---- 1504
Query: 709 VMDKIQMYLWPR 720
+MY PR
Sbjct: 1505 -----EMYFVPR 1511
Score = 162 bits (410), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/284 (40%), Positives = 137/284 (48%), Gaps = 80/284 (28%)
Query: 885 SSALVALTAEAASVPTRKLKRCAHLRTEHHVYV--------------------------- 917
S L+ L AAS P LK LRT H+VYV
Sbjct: 1708 SQELILLPPYAASFPAPLLKSVQRLRTIHYVYVPLHGQFLSDFEIEQNVQVRYTVQLSFL 1767
Query: 918 -------------LPDAHELLR--GFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNS 962
LPD H LL D+RD DDP YLLAIW+ E P ++
Sbjct: 1768 ALIHFNLVDCSYELPDHHPLLHQAKMDKRDSDDPCLYLLAIWNADEVPAAM--------P 1819
Query: 963 KDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
K S S SCNS E V GTIL+PCRTAN+G FPLNGTYFQVNEVFADH +S
Sbjct: 1820 KISDDDASNPFASCNSGQE-----VPGTILVPCRTANKGSFPLNGTYFQVNEVFADHASS 1874
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNAL--------------------DMRQ--IQ 1060
H P+ V R + NL+R+ V FG+S T++F L DM Q IQ
Sbjct: 1875 HDPLRVSRTLMWNLKRKFVYFGTSVTSIFRDLKKVRLFASSQKDLTRNGIETDMSQGEIQ 1934
Query: 1061 EYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK---KEAGSKRS 1101
F +G+VCVR FD+ R P+PL RLH S +K AG++RS
Sbjct: 1935 CCFKKGYVCVRAFDQATRKPKPLAPRLHQSAAKIVAARAGAQRS 1978
>gi|326503606|dbj|BAJ86309.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 723
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 103/203 (50%), Positives = 133/203 (65%), Gaps = 5/203 (2%)
Query: 568 WDLLRR-IYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR Y G + D V+WEAV+ A ++A I RGQ ++A RI+ FL
Sbjct: 485 WDKLREEAYRKGYDNGSKTRITDKVNWEAVQQASFVDVAKCIAGRGQHYVLALRIQAFLT 544
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+ + HGS DL+WLR +P + K+YL+ I GLG KSV+C+RLLSL+H AFPVDVNV RI
Sbjct: 545 RIKKDHGSFDLDWLRYLPRESAKKYLISINGLGAKSVDCIRLLSLEHKAFPVDVNVARIV 604
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL WV LQ + H L YP+M +Q YLWPRLC +D++ LYELH MITFGK+ C
Sbjct: 605 TRLQWVELQCCSEEFHSVDL--YPLMQDVQSYLWPRLCTIDKEKLYELHCLMITFGKVIC 662
Query: 745 KKRSPNCGACPLRGECKHFASAV 767
K PNC ACP RG+C+++ S +
Sbjct: 663 TKVDPNCNACPFRGDCRYYKSKL 685
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 79/144 (54%), Gaps = 18/144 (12%)
Query: 116 AVVPYQVGPSSEH--------GALVPHQIKEKRKKLKPEVV-LDPGTLRMWNLIMNIDDG 166
A+VPYQ ++ G L PH + +++P V+ LD TLR++ + ++
Sbjct: 227 AIVPYQNAAQADASCSALVPLGDLGPH--GNRSNQVRPRVLGLDDDTLRVYGALTKWEE- 283
Query: 167 TTKDQTSNEDMEKW----WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGV 222
D S E ++ W R +E + F A + + G R + W GSV+DSVVG
Sbjct: 284 --IDSESFEGVDIGSGPKWDDIRRKYENLVDWFIATVKDLFGPREYSRWGGSVIDSVVGT 341
Query: 223 YLTQNVSDNLSSSAYMSLAARFPL 246
+LTQNV+D+LSS+A+M LAA+FP+
Sbjct: 342 FLTQNVADHLSSNAFMILAAKFPM 365
>gi|218190781|gb|EEC73208.1| hypothetical protein OsI_07289 [Oryza sativa Indica Group]
Length = 1111
Score = 212 bits (539), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 185/375 (49%), Gaps = 69/375 (18%)
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNP- 801
C K SPNC ACP +CK++ S++A R +LP P+E G+ G+ P
Sbjct: 758 ICTKVSPNCRACPFSAKCKYYNSSLA--RLSLP-PAEG------HGHEYGEEQASTATPG 808
Query: 802 -----------------IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESP-ESE 843
P I+I ES Y+ CEPIIE P SP + ESP E E
Sbjct: 809 RLLLSNDSHIAGFQQVCQPQIKINMPAGRESIYK---CEPIIEIPPSPEHEYEESPYEQE 865
Query: 844 IDDDDFSIGDIEE-IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRK 902
+ +DD DIE+ IP L+ + + + L + ++ ASV +K
Sbjct: 866 LYEDDLC--DIEDTIPELQYDFEIDLCSLKHTVNNGSWTPNSGKDLALINSQHASVQNKK 923
Query: 903 LKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNS 962
LK LRTEH+ YVLPD H +L F+ R P+DP PYLL + S
Sbjct: 924 LKNIGRLRTEHNAYVLPDDHAILEEFEDRVPEDPCPYLLVVIS----------------- 966
Query: 963 KDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
C+ E V+GTILIPCRTA RG FPLNGTYFQ +EVFADH +S
Sbjct: 967 -----CSDE-------------HTVKGTILIPCRTATRGNFPLNGTYFQDHEVFADHSSS 1008
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
PI +PR I NL R IV FGSS ++ + I++ + +G++CVRGFDR R P+P
Sbjct: 1009 RSPITIPRECIWNLDRCIVYFGSSIQSIMKGQTRQDIEDCYKKGYICVRGFDRNTRYPKP 1068
Query: 1083 LVNRLHCSPSKKEAG 1097
+ +LH + + G
Sbjct: 1069 ICAKLHATNERNGTG 1083
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 73/111 (65%), Gaps = 2/111 (1%)
Query: 568 WDLLRR-IYSTGEE-RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR 625
WD LR YS G + + K DS DWEAV A EIA++I RGQ +IA RI+ FL R
Sbjct: 647 WDKLREEAYSKGYKIKGTGKSDSADWEAVLHAPAVEIANSIAVRGQHYVIALRIQAFLKR 706
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
+ + HG+ DL+WLR VP + K YL+ I GLG KSV+C+RLLSL+H FP
Sbjct: 707 VKKDHGNFDLDWLRYVPRESAKNYLISILGLGDKSVDCIRLLSLKHKGFPA 757
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 88/149 (59%), Gaps = 18/149 (12%)
Query: 112 NVHNAVVPYQVGPS-SEHGALVPHQ------IKEKR---KKLKPEVV-LDPGTLRMWNLI 160
N +A+VPYQ P+ + ALVP Q I R K L+ +V+ LD TL+++N++
Sbjct: 378 NGESALVPYQKAPAATSSSALVPIQNSTQLAIVHHRNHLKNLRTKVLGLDEKTLQVYNVL 437
Query: 161 MNIDDGTTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
D+ D S E ++ W + R FE + F A +H +LG RRF W GSV
Sbjct: 438 RKWDE---TDSESFEGVDIGSGPEWNETRRHFEHYVDVFIATVHGLLGPRRFSEWGGSVT 494
Query: 217 DSVVGVYLTQNVSDNLSSSAYMSLAARFP 245
DSVVG +LTQNV+DNLSS+A+++L A+FP
Sbjct: 495 DSVVGTFLTQNVADNLSSNAFLNLVAKFP 523
>gi|222622899|gb|EEE57031.1| hypothetical protein OsJ_06813 [Oryza sativa Japonica Group]
Length = 1051
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 185/375 (49%), Gaps = 69/375 (18%)
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNP- 801
C K SPNC ACP +CK++ S++A R +LP P+E G+ G+ P
Sbjct: 698 ICTKVSPNCRACPFSAKCKYYNSSLA--RLSLP-PAEG------HGHEYGEEQASTATPG 748
Query: 802 -----------------IPVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESP-ESE 843
P I+I ES Y+ CEPIIE P SP + ESP E E
Sbjct: 749 RLLLSNDSHIAGFQQVCQPQIKINMPAGRESIYK---CEPIIEIPPSPEHEYEESPYEQE 805
Query: 844 IDDDDFSIGDIEE-IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRK 902
+ +DD DIE+ IP L+ + + + L + ++ ASV ++
Sbjct: 806 LYEDDLC--DIEDTIPELQYDFEIDLCSLKHTVNNGSWTPNSGKDLALINSQHASVQNKR 863
Query: 903 LKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNS 962
LK LRTEH+ YVLPD H +L F+ R P+DP PYLL + S
Sbjct: 864 LKNIGRLRTEHNAYVLPDDHAILEEFEDRVPEDPCPYLLVVIS----------------- 906
Query: 963 KDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETS 1022
C+ E V+GTILIPCRTA RG FPLNGTYFQ +EVFADH +S
Sbjct: 907 -----CSDE-------------HTVKGTILIPCRTATRGNFPLNGTYFQDHEVFADHSSS 948
Query: 1023 HHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRP 1082
PI +PR I NL R IV FGSS ++ + I++ + +G++CVRGFDR R P+P
Sbjct: 949 RSPITIPRECIWNLDRCIVYFGSSIQSIMKGQTRQDIEDCYKKGYICVRGFDRNTRYPKP 1008
Query: 1083 LVNRLHCSPSKKEAG 1097
+ +LH + + G
Sbjct: 1009 ICAKLHATNERNGTG 1023
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 72/111 (64%), Gaps = 2/111 (1%)
Query: 568 WDLLRR-IYSTGEE-RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR 625
WD LR YS G + + DS DWEAV A EIA++I RGQ +IA RI+ FL R
Sbjct: 587 WDKLREEAYSKGYKIKGTGISDSADWEAVLHAPAVEIANSIAVRGQHYVIALRIQAFLKR 646
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
+ + HG+ DL+WLR VP + K YL+ I GLG KSV+C+RLLSL+H FP
Sbjct: 647 VKKDHGNFDLDWLRYVPRESAKNYLISILGLGDKSVDCIRLLSLKHKGFPA 697
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 88/149 (59%), Gaps = 18/149 (12%)
Query: 112 NVHNAVVPYQVGPS-SEHGALVPHQ------IKEKR---KKLKPEVV-LDPGTLRMWNLI 160
N +A+VPYQ P+ + ALVP Q I R K L+ +V+ LD TL+++N++
Sbjct: 318 NGESALVPYQKAPAATSSSALVPIQNSTQLAIVHHRNHLKNLRTKVLGLDEKTLQVYNVL 377
Query: 161 MNIDDGTTKDQTSNEDME----KWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
D+ D S E ++ W + R FE + F A +H +LG RRF W GSV
Sbjct: 378 RKWDE---TDSESFEGVDIGSGPEWNETRRHFEHYVDVFIATVHGLLGPRRFSEWGGSVT 434
Query: 217 DSVVGVYLTQNVSDNLSSSAYMSLAARFP 245
DSVVG +LTQNV+DNLSS+A+++L A+FP
Sbjct: 435 DSVVGTFLTQNVADNLSSNAFLNLVAKFP 463
>gi|168067245|ref|XP_001785533.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662839|gb|EDQ49643.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1894
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 128/292 (43%), Positives = 165/292 (56%), Gaps = 36/292 (12%)
Query: 568 WDLLRRIYSTGEERSHDKM--DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR 625
W+ LR + S D + D VDW AV+ A E+AD IK RGQ N +AGR+K FL+R
Sbjct: 1218 WEPLRARIVAEQTYSRDTLTEDCVDWNAVQQASVHEVADVIKNRGQHNALAGRLKAFLDR 1277
Query: 626 L-VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+ + +G IDLEW+R +PP+ K +LL G+GLKSVEC+RLL L H +FPVD NVGRIA
Sbjct: 1278 VHRDQNGVIDLEWIRKLPPEDAKAFLLSFRGVGLKSVECIRLLCLHHPSFPVDTNVGRIA 1337
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
VRLGWVPL+PLP + + LYELHY MITFGK+FC
Sbjct: 1338 VRLGWVPLEPLPEETQL-----------------------HLLELYELHYHMITFGKVFC 1374
Query: 745 KKRSPNCGACPLRGECKHFASAVASARFALPGPSE-KGIVTSEFGNGIGQSPPLVVNPIP 803
K PNC ACPLR ECKH+AS+ ASA+ L P + + ++ F P + I
Sbjct: 1375 TKSKPNCNACPLRSECKHYASSYASAKLLLKWPEKPQNKQSTVFALPEHHPAPQAESVIS 1434
Query: 804 VIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIE 855
R+ + Q C PIIEEP SP P P +I+++ F I + E
Sbjct: 1435 DSRL-------TEVQSVACLPIIEEPCSPEPCIDNIP--DIEENPFRIDNNE 1477
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/127 (49%), Positives = 84/127 (66%), Gaps = 8/127 (6%)
Query: 987 VRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSS 1046
VRGT+LIPC+TA RG FPLNGTYFQVNEVFADH +S PI+VPR + +LRR V F ++
Sbjct: 1763 VRGTLLIPCKTAMRGWFPLNGTYFQVNEVFADHTSSLQPISVPRTLLWSLRRRFVYFSTA 1822
Query: 1047 ATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK--------KEAGS 1098
+ LF + + +I+ F +G+VC+R FD R P+ L +RLH S +K K G
Sbjct: 1823 VSNLFKDMSLEEIRCCFQKGYVCLRAFDLATRQPKILAHRLHQSGAKIVKAPVPRKGKGI 1882
Query: 1099 KRSRDES 1105
RSR+ +
Sbjct: 1883 GRSRNNA 1889
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 79/128 (61%), Gaps = 14/128 (10%)
Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEK---WWQKEREVFEGRIQ 193
+++R + KP+V LD T++M+ + T +S+E+ E+ W++ R+ ++ R
Sbjct: 762 RKRRLRYKPKVDLDLNTVKMFKTL-------TMKGSSHEESERNKASWEECRQQWQNRAH 814
Query: 194 SFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK-STND- 251
F + M + G+R F WKGSV+DSV G +LTQNVSD LSS+A+M+L ARFP + T D
Sbjct: 815 QFISIMRQVQGNRAFSRWKGSVMDSVGGAFLTQNVSDFLSSNAFMALRARFPARPGTADE 874
Query: 252 --HTPSDE 257
HTP E
Sbjct: 875 ETHTPLAE 882
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 39/64 (60%)
Query: 885 SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDDPSPYLLAIW 944
S L+ L A P KLK LRT H+VY LPD H LL D R+PDDP YLLAIW
Sbjct: 1603 SQELILLPPCADLAPAPKLKNVQRLRTVHYVYELPDNHPLLFAMDEREPDDPCFYLLAIW 1662
Query: 945 SPGE 948
SPG+
Sbjct: 1663 SPGK 1666
>gi|224068969|ref|XP_002326243.1| predicted protein [Populus trichocarpa]
gi|222833436|gb|EEE71913.1| predicted protein [Populus trichocarpa]
Length = 134
Score = 194 bits (493), Expect = 2e-46, Method: Composition-based stats.
Identities = 88/111 (79%), Positives = 97/111 (87%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
MDS+DWEAVRCAD +EIA+ IKERG NI+A RIK LNRLV HGSIDLEWLR++PPDK
Sbjct: 1 MDSLDWEAVRCADVNEIANTIKERGMNNILAERIKNLLNRLVREHGSIDLEWLRDIPPDK 60
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLP 696
KEYLL I GLGLKSVEC+RLL+L H+AFPVD NVGRIAVRLGWVPLQPLP
Sbjct: 61 AKEYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLP 111
>gi|7406461|emb|CAB85563.1| putative protein [Arabidopsis thaliana]
Length = 555
Score = 193 bits (490), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 91/141 (64%), Positives = 111/141 (78%), Gaps = 3/141 (2%)
Query: 568 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 414 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 473
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 474 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 533
Query: 685 VRLGWVPLQPLPGDLHIHLLK 705
VR+GWVPLQPLP L +HLL+
Sbjct: 534 VRMGWVPLQPLPESLQLHLLE 554
>gi|224138764|ref|XP_002326684.1| DNA glycosylase [Populus trichocarpa]
gi|222834006|gb|EEE72483.1| DNA glycosylase [Populus trichocarpa]
Length = 121
Score = 192 bits (489), Expect = 6e-46, Method: Composition-based stats.
Identities = 88/121 (72%), Positives = 103/121 (85%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
MDSVDWEAVR A+ S++A I+ RGQQ IIAGRIK+FL+R+V++H SIDLEWLR PPD
Sbjct: 1 MDSVDWEAVRQAETSKVASIIEGRGQQTIIAGRIKQFLDRVVDMHKSIDLEWLRYAPPDD 60
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
VK+YLL+ GLGLKSVECVRLLSLQ +AFPVDVNV RIAVRLGWVPL+ LPG L HL++
Sbjct: 61 VKDYLLEFMGLGLKSVECVRLLSLQQVAFPVDVNVARIAVRLGWVPLKALPGSLQFHLIE 120
Query: 706 E 706
E
Sbjct: 121 E 121
>gi|2911056|emb|CAA17566.1| hypothetical protein [Arabidopsis thaliana]
gi|7270355|emb|CAB80123.1| hypothetical protein [Arabidopsis thaliana]
Length = 917
Score = 192 bits (489), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 139/391 (35%), Positives = 183/391 (46%), Gaps = 92/391 (23%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
DW+ LRR+Y+ R MDSV+W VR + QN++ IK+
Sbjct: 571 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSG-------------QNVLETTIKK----- 612
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
G R+LS + + VD NVGRIAVR
Sbjct: 613 ---RGQF-------------------------------RILSERIL---VDTNVGRIAVR 635
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
LG VPL+PLP + +H L EY ELHYQMITFGK+FC K
Sbjct: 636 LGLVPLEPLPNGVQMHQLFEY-----------------------ELHYQMITFGKVFCTK 672
Query: 747 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 673 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 730
Query: 807 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 731 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 788
Query: 864 EREFKENFPNFMEMNKVML----QDSSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
K++ + + ++ M + S ALV T E A +P RK+K LRTEH VY
Sbjct: 789 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 848
Query: 917 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPG 947
VLPD HELL F+RR DDPSPYLLAIW PG
Sbjct: 849 VLPDNHELLHDFERRKLDDPSPYLLAIWQPG 879
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
V IK +K + +V LDP T++ W+++M ++D ++ E E W+KERE+F+ R
Sbjct: 404 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 461
Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 462 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 516
>gi|14021063|dbj|BAB47587.1| putative FPPsynthase1 [Euphorbia tirucalli]
Length = 119
Score = 189 bits (479), Expect = 9e-45, Method: Composition-based stats.
Identities = 85/108 (78%), Positives = 94/108 (87%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D +DWEAVRCAD EIA+ IKERG N++A RIK+FLNRLV HG+IDLEWLR+VPPDK
Sbjct: 12 DYLDWEAVRCADVDEIANTIKERGMNNVLAKRIKDFLNRLVREHGNIDLEWLRDVPPDKA 71
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
KEYLL I GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQP
Sbjct: 72 KEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 119
>gi|224126427|ref|XP_002329551.1| hypothetical protein POPTRDRAFT_783774 [Populus trichocarpa]
gi|222870260|gb|EEF07391.1| hypothetical protein POPTRDRAFT_783774 [Populus trichocarpa]
Length = 121
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 87/121 (71%), Positives = 99/121 (81%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
MDSVDWEAVR A+ SEIA I+ RGQQ IIAG IK FLNR+VE+H SIDLEWLR PPD
Sbjct: 1 MDSVDWEAVRQAETSEIASIIEARGQQTIIAGSIKGFLNRVVEMHKSIDLEWLRYAPPDD 60
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
VK+YLL+ GLGLKSVECVRLLS+QH+AFPVD+NV I RLGW PL+PLPG L HL++
Sbjct: 61 VKDYLLEFTGLGLKSVECVRLLSIQHVAFPVDINVAWIVFRLGWAPLKPLPGSLQFHLIE 120
Query: 706 E 706
E
Sbjct: 121 E 121
>gi|224100671|ref|XP_002311969.1| predicted protein [Populus trichocarpa]
gi|222851789|gb|EEE89336.1| predicted protein [Populus trichocarpa]
Length = 108
Score = 181 bits (458), Expect = 3e-42, Method: Composition-based stats.
Identities = 83/108 (76%), Positives = 94/108 (87%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
DS+D+EAVRCA +EI++AIKERG I+A RI+EFLNRLV HGSIDLEWLR+VPP K
Sbjct: 1 DSLDYEAVRCAGVNEISEAIKERGMNKILAERIQEFLNRLVREHGSIDLEWLRDVPPGKA 60
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
K+YLL I GLGLKSVECVRLL+L H+AFPVD NVGRIAVRLGWVPLQP
Sbjct: 61 KDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 108
>gi|297723055|ref|NP_001173891.1| Os04g0358000 [Oryza sativa Japonica Group]
gi|255675367|dbj|BAH92619.1| Os04g0358000 [Oryza sativa Japonica Group]
Length = 1310
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 162/579 (27%), Positives = 235/579 (40%), Gaps = 134/579 (23%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
KKN + +D R + G R D D VDWEA+ A +++ D IK+RGQ + +A
Sbjct: 464 KKNDTSHWHSLYDEAR---NRGYIRDDDIPDMVDWEALMNAPFADVVDCIKDRGQHSQMA 520
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
RI FL R+ HG+IDLEWLR +P K R+ + A
Sbjct: 521 FRILAFLIRMKRDHGNIDLEWLRFIPRAKANYK--------------ARMGATPSFAIIS 566
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT-LYELHYQ 735
V+ GR +Q +PG L + R+C +Q+T YELH Q
Sbjct: 567 GVSSGRQQNSKFSTLIQAMPGYLS-----------------FDRICLTNQRTNRYELHCQ 609
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSP 795
MITFGK C+K PNCGACP ECK++ S A ALP S++ + ++
Sbjct: 610 MITFGKAICRKSKPNCGACPFTSECKYYKSQFGRAALALPEYSQQDATKDANMDDPAKTY 669
Query: 796 PLVVNPIPVIRIEADPVSESGYQINNCEPIIEEPRSP-----GPQCSESPES--EIDDDD 848
L+ +IE +E +N CEP+IE P +P G E E+ DDD
Sbjct: 670 DLIFKA-HQYQIEYGKNTE----MNYCEPVIEIPPTPLHENRGETSDEDDENGYYFDDDM 724
Query: 849 FSIG----DIEEIPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLK 904
IG D+E+I E + + P ++ ++ + A S P +
Sbjct: 725 EDIGRHDYDMEDIEHDYDMEVDLRSAKPT-TNTSQAGATPGKEMIPINPRAKSTP---MV 780
Query: 905 RCAHLRTEHHVY-------------------VLPDAHELLRGFDRRDPDDPSPYLLAIWS 945
+ LRTE+ ++PD H +L+ FD R P D +PYLL S
Sbjct: 781 KKFSLRTEYTAIFYSWALIILVFFTIAVFRCIIPDGHIILKKFDPRVPGDRNPYLLVFRS 840
Query: 946 PGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPCRTANRGRFPL 1005
E V+ TIL
Sbjct: 841 FDEH------------------------------------TVKATIL------------- 851
Query: 1006 NGTYFQVNEVFADHETSHHPINVPRHTIANLRRE--IVCFGSSATTLFNALDMRQIQEYF 1063
VFADH +S PI + R + LRR+ IV FG+ ++ + ++
Sbjct: 852 ---------VFADHSSSRSPIEINRDLVWELRRQTCIVHFGTRVHSVTKGQTREGLYHFY 902
Query: 1064 WRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSR 1102
G++C R FDRR + P+ L +H + K+ G KR+R
Sbjct: 903 NEGYICTREFDRRTKFPKQLCVEIHATNVNKDIGKKRAR 941
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 55/79 (69%)
Query: 181 WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSL 240
W + R FE ++ F M I+GDR F PW GSVV SVVG +LTQNVSDNLSS+A+M++
Sbjct: 264 WNERRREFERKVDHFMYNMRSIIGDRNFSPWGGSVVTSVVGTFLTQNVSDNLSSNAFMTI 323
Query: 241 AARFPLKSTNDHTPSDENL 259
AARFPLK+ + +N+
Sbjct: 324 AARFPLKNRRNAGHHSDNV 342
>gi|412985467|emb|CCO18913.1| predicted protein [Bathycoccus prasinos]
Length = 2056
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/177 (45%), Positives = 112/177 (63%), Gaps = 4/177 (2%)
Query: 587 DSVDWEAV-RCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
D VDW V A +I + I+ RG +++A RIK+ L R+++ G + LE+LR ++
Sbjct: 1123 DCVDWHRVLHEASIDDIIECIRCRGMHSMLAHRIKKILRRVLDERGVLSLEFLREASTEE 1182
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
YL +IEG+G K+ CV LLSL++ FPVDVNVGRI RLGWVPL+ D + LL+
Sbjct: 1183 ANAYLNEIEGMGAKTSACVNLLSLENRDFPVDVNVGRIMARLGWVPLE---DDFKLELLE 1239
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+Y + + +L RL D LYELHY MIT GK+FC KR PNC +CP+ +C++
Sbjct: 1240 QYAPEESVYEFLSERLNTFDVTMLYELHYHMITLGKVFCAKRDPNCASCPMNSDCEY 1296
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 50/97 (51%)
Query: 987 VRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSS 1046
++ + A + FPL+GTYFQVNEVF D ++ P++V T+ + + G+S
Sbjct: 1757 IKCVFFVTALCACKRSFPLHGTYFQVNEVFLDLRSAAKPVSVDIETLKTSPKIVTLLGAS 1816
Query: 1047 ATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPL 1083
++ + ++ F +CVR ++RR PR L
Sbjct: 1817 IGSVTRGMTRTEVTRLFNHQVLCVRAWERRTGYPREL 1853
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 177 MEKWWQKEREVFEGRIQSFTARMHLILGDRRFK-PWKGSVVDSVVGVYLTQNVSDNLSSS 235
++K+ E+ ++ +I F + LG R W GSV+DSV+G LTQNV+D LSS+
Sbjct: 720 IQKFGHDEKRMWRDKIGEFLRKSRKTLGHRPLDVSWGGSVLDSVIGANLTQNVTDVLSST 779
Query: 236 AYMSLAARFPL 246
A ++L A+FP+
Sbjct: 780 AMLNLMAKFPV 790
>gi|255072753|ref|XP_002500051.1| DNA glycosylase [Micromonas sp. RCC299]
gi|226515313|gb|ACO61309.1| DNA glycosylase [Micromonas sp. RCC299]
Length = 2192
Score = 154 bits (388), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 114/187 (60%), Gaps = 4/187 (2%)
Query: 581 RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLR 639
RS + D +DW AV A ++ + I+ RG + A RI+ L R+ E G + LE+LR
Sbjct: 1387 RSENTYDMIDWRAVMNAPLGDVVECIRCRGMHFMQARRIQRILRRIHDERKGELSLEFLR 1446
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
N + + YLL +EG G+K+V C+ LLSL FPVDVNVGRI RLGWVPL+ +
Sbjct: 1447 NCTVEIARGYLLSLEGFGVKTVSCITLLSLFRADFPVDVNVGRIMARLGWVPLE---TEQ 1503
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
+ L EY + +L RL +TL+ELHY MIT GK+FC+KR+PNC ACPLR
Sbjct: 1504 ALEELAEYAPEPAVYTFLRERLNSFGLQTLFELHYHMITLGKVFCEKRTPNCRACPLRDM 1563
Query: 760 CKHFASA 766
C++ +S
Sbjct: 1564 CEYASSG 1570
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 2/118 (1%)
Query: 968 CNSEICYSCNSVCERNAG--IVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHP 1025
C + + +AG +V T+++PCR A G+FPL+GTYFQ NEVF D ET+ P
Sbjct: 1847 CGLPSATAAARIASESAGRKMVPLTVMVPCRAAMNGKFPLHGTYFQTNEVFLDAETAVRP 1906
Query: 1026 INVPRHTIANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPL 1083
V + L V GSS ++ + ++ F VCVR ++ PRPL
Sbjct: 1907 KLVAASELETLPTVSVYLGSSVASICRGMSRAEVASSFANRAVCVRSWEPHTGHPRPL 1964
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 176 DMEKWWQKEREVFEGRIQSFTARMHLILGDRRFK--PWKGSVVDSVVGVYLTQNVSDNLS 233
DM + + ER + + F AR LGDRR K W GSV+DSV G LTQNVSD LS
Sbjct: 1133 DMIVFGRDERLEWARKTGMFLARARTALGDRRLKMDAWGGSVLDSVFGALLTQNVSDVLS 1192
Query: 234 SSAYMSLAARFPLKST 249
SSA M+LAA+FP T
Sbjct: 1193 SSAIMNLAAKFPGPGT 1208
>gi|224113519|ref|XP_002316518.1| predicted protein [Populus trichocarpa]
gi|222865558|gb|EEF02689.1| predicted protein [Populus trichocarpa]
Length = 1312
Score = 153 bits (386), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 167/319 (52%), Gaps = 38/319 (11%)
Query: 46 KIIQVERNGLPTQKMPPRYSRGLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQCN 105
KI Q ++ L + P RG +Q ST II + + L + G ++ N
Sbjct: 763 KIPQEPKDDLKVHQQPYAKRRGRPA---KQTFSSTIEQIIYQMEGLRLNAGSKKIE---N 816
Query: 106 KADGPSNVHNAVVPYQVGPSSEHGALVPHQIKE--KRKKLKPEVVLDPGTLRMWNLIMNI 163
K NA+VPY+ G LVP+ E K+ K +P+V LDP + R+W L+M
Sbjct: 817 KE------QNALVPYK-----GDGKLVPYDGFEVVKKHKPRPKVDLDPESDRVWKLLMG- 864
Query: 164 DDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVY 223
+G+ + +++ E+WW +ER+VF GR+ SF ARMHL+ GDRRF WKGSVVDSV+GV+
Sbjct: 865 KEGSQGLEGTDKGKEQWWGEERKVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVF 924
Query: 224 LTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRT------TASLEPIGSNSTSNGA 277
LTQNVSD+LSSSA+MSLA+ FPLK + E T L P SN
Sbjct: 925 LTQNVSDHLSSSAFMSLASLFPLKLRSSGACDRERTSIVIEEPDTCILNPNDIKWNSNPL 984
Query: 278 VYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPSQ-AFDTS 336
S + EP KD + F I+R ++ E + + V SQ +FD+S
Sbjct: 985 YNQSSVTHHGSAEPH-------KDSETLF---IERASMVETQSHSLEEEFVLSQDSFDSS 1034
Query: 337 SVQSLDRTQLFPTGNSKAD 355
+VQ+ + + + NS+A+
Sbjct: 1035 TVQA-NGVRSYSGSNSEAE 1052
>gi|224126423|ref|XP_002329550.1| hypothetical protein POPTRDRAFT_783773 [Populus trichocarpa]
gi|222870259|gb|EEF07390.1| hypothetical protein POPTRDRAFT_783773 [Populus trichocarpa]
Length = 213
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/185 (44%), Positives = 115/185 (62%), Gaps = 23/185 (12%)
Query: 125 SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKE 184
+SE A+VP+Q R + + EV LDP +LRMWN ++ I G +++ E E+WW++E
Sbjct: 10 TSEIHAVVPYQ---GRGRAEHEVDLDPESLRMWNQLVKIYSGVDEEKEDGE-REQWWRRE 65
Query: 185 REVFEGRIQSFTARMHLIL---------------GDRRFKPWKGSVVDSVVGVYLTQNVS 229
F+GRI F + +H +L GDRRFK W GSV+ SVVGV+LTQNV+
Sbjct: 66 LLTFQGRIDLFISLLHQVLETFYCTIEGHIACFSGDRRFKQWNGSVMGSVVGVFLTQNVA 125
Query: 230 DNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVT 289
D+LSS+AYMSL A+FP++ST+ S T P+ S+ S G +DS+G+ YFVT
Sbjct: 126 DHLSSAAYMSLVAKFPVRSTSIQQES----MTVDDQRPVESSMASIGPTFDSDGSQYFVT 181
Query: 290 EPEPD 294
EPEPD
Sbjct: 182 EPEPD 186
>gi|384251193|gb|EIE24671.1| hypothetical protein COCSUDRAFT_62092 [Coccomyxa subellipsoidea
C-169]
Length = 1568
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 67/128 (52%), Positives = 85/128 (66%), Gaps = 3/128 (2%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
LEWLR+V D ++YL+ + GLG KSV CV LL+L FPVD NVGRI RLGW+PL
Sbjct: 722 LEWLRDVAEDDARQYLMGVMGLGRKSVACVMLLALCKHDFPVDTNVGRICARLGWIPLD- 780
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ + L Y ++ YL RL + D +TLYELHYQMIT GK+FC K+ PNC AC
Sbjct: 781 --AEEALEDLDRYAPEPEVHKYLHSRLMHFDTETLYELHYQMITLGKVFCSKKDPNCRAC 838
Query: 755 PLRGECKH 762
PLR +C++
Sbjct: 839 PLRPQCEY 846
Score = 100 bits (248), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 55/94 (58%)
Query: 990 TILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATT 1049
+L+PCRTA RGRFPLNGTYFQVNEVF D + PI V + R V FG+S
Sbjct: 1282 AVLVPCRTAMRGRFPLNGTYFQVNEVFLDETSLRSPIQVADEVVGAWVRRPVYFGTSVAA 1341
Query: 1050 LFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPL 1083
+ + ++ +F G +CVR FD R PRPL
Sbjct: 1342 VCRGMGQSEVTRFFLDGLLCVRAFDPITRFPRPL 1375
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 2/64 (3%)
Query: 191 RIQSFTARMHLILGDRR--FKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKS 248
R+ SF +LGDRR W GSV+DSV+GV+LTQNVSD LSS A+M+LAA FPL +
Sbjct: 426 RVASFLQAARGVLGDRRRVSGQWAGSVLDSVMGVFLTQNVSDALSSKAWMTLAATFPLAA 485
Query: 249 TNDH 252
++
Sbjct: 486 ADNQ 489
>gi|224138768|ref|XP_002326685.1| hypothetical protein POPTRDRAFT_796365 [Populus trichocarpa]
gi|222834007|gb|EEE72484.1| hypothetical protein POPTRDRAFT_796365 [Populus trichocarpa]
Length = 897
Score = 145 bits (367), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 88/166 (53%), Positives = 115/166 (69%), Gaps = 10/166 (6%)
Query: 130 ALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFE 189
AL+P+ R + + EV LDP +LR WN +M ID G +++ ++WWQ+E+ F
Sbjct: 596 ALIPYW---GRGRARHEVDLDPESLRRWNQLMKIDSGEGEEEDGKR--KEWWQREKITFN 650
Query: 190 GRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKST 249
GRI +FT+R H LGDRRF+ WKGSVVDSVVGV+LTQNVSD+LSS+AYMSL FP++ST
Sbjct: 651 GRIDAFTSRSHQFLGDRRFRQWKGSVVDSVVGVFLTQNVSDHLSSAAYMSLVENFPVQST 710
Query: 250 NDHTPSDENLRTTASLEPIGSNSTSNGAVYDSEGNMYFVTE-PEPD 294
++ S T +PI S+ TS GA YD++GN YFVTE PEPD
Sbjct: 711 SNQQAS----MTIDGQKPIESSMTSIGATYDADGNRYFVTEVPEPD 752
>gi|308803847|ref|XP_003079236.1| DEMETER protein (ISS) [Ostreococcus tauri]
gi|116057691|emb|CAL53894.1| DEMETER protein (ISS), partial [Ostreococcus tauri]
Length = 856
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 67/138 (48%), Positives = 88/138 (63%), Gaps = 3/138 (2%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++ G + LE+LR P D+ +EYLL ++G+G+K+ CV LL+L FPVDVNVGRI
Sbjct: 243 RVMAQRGCLSLEFLREAPTDEAREYLLALDGMGVKTTSCVLLLALHRTDFPVDVNVGRIM 302
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLGWVPL+ + + L +Y + +L RL LYELHY MIT GK+FC
Sbjct: 303 ARLGWVPLE---SETALEELAQYAPEPAVYTFLRKRLNSFGIDMLYELHYHMITLGKVFC 359
Query: 745 KKRSPNCGACPLRGECKH 762
KR PNCGACPLR C++
Sbjct: 360 GKRLPNCGACPLRDICEY 377
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 52/94 (55%)
Query: 990 TILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATT 1049
IL PC ++ +FPL+GTYFQ NEVF D ++S P+ + + + RR V G+S +
Sbjct: 617 AILTPCASSMGNKFPLHGTYFQTNEVFLDEKSSVVPLKILKGALVKCRRVRVLVGTSIGS 676
Query: 1050 LFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPL 1083
+ + Q+ F +CVR ++R + P PL
Sbjct: 677 ITRGMSRAQVTAAFASQRICVRSWNRVTKQPGPL 710
>gi|357452085|ref|XP_003596319.1| Repressor of silencing [Medicago truncatula]
gi|355485367|gb|AES66570.1| Repressor of silencing [Medicago truncatula]
Length = 684
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/122 (56%), Positives = 83/122 (68%), Gaps = 19/122 (15%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+I DAIKERG N +A R++ FL RLV+ HGSIDLEWLR+VPPD+ KEYL
Sbjct: 546 KIVDAIKERGMNNKLAKRVQNFLIRLVDDHGSIDLEWLRDVPPDQTKEYL---------- 595
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
IAFP+D NVGRIAVRLGWVPL PLP L +HLL+ YP+++ IQ +LW R
Sbjct: 596 ---------SSIAFPIDTNVGRIAVRLGWVPLHPLPESLQLHLLELYPLVNSIQKHLWAR 646
Query: 721 LC 722
LC
Sbjct: 647 LC 648
Score = 103 bits (256), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 80/135 (59%), Gaps = 18/135 (13%)
Query: 557 KKNSDNTVQQDWDLLR---RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQN 613
+K++ Q D D LR + + E++ + MDSVDW+ VRC D +IADAIKERG N
Sbjct: 91 RKSNAGKEQFDKDSLRLNAQAKAGKREKTENTMDSVDWDDVRCVDCKKIADAIKERGINN 150
Query: 614 IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
+A I+ FL RLV+ HG IDLE + LGLKS +CV+LL+L A
Sbjct: 151 KLAKHIQNFLIRLVDDHGRIDLEC---------------VAELGLKSAKCVQLLTLHQPA 195
Query: 674 FPVDVNVGRIAVRLG 688
FPV N+GRI++ LG
Sbjct: 196 FPVATNIGRISIHLG 210
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 95/179 (53%), Gaps = 28/179 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+L I+ L ++EC+ + S++ + + I + GWVPLQPLP L +HLL+ P+
Sbjct: 307 MLVIQPLRAYALECLVIQSVEGSCPGMFI----IQIVGGWVPLQPLPESLQLHLLELNPL 362
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
++ IQ +LW RL L Y I+ G ++ KK NC ACP+R +C+HFASA AS
Sbjct: 363 VNFIQKHLWARLSTLP--------YDYISKGLMYQKK--TNCNACPMRADCRHFASAFAS 412
Query: 770 ARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESGYQINNCEPIIEE 828
P P + I S GN + P VV +S+ +++N C+PIIEE
Sbjct: 413 TTLMFPRPEQCFIGAS--GNSVTDKIPPVV------------MSQLHFKVNICQPIIEE 457
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/66 (62%), Positives = 52/66 (78%)
Query: 183 KEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAA 242
+E ++ GR +SF ARMH I G +F WKGS+VD VVGV+LTQ+VSD+LS SA+MSLAA
Sbjct: 456 EECKLLRGRAESFIARMHPIQGYIQFSRWKGSIVDLVVGVFLTQHVSDHLSCSAFMSLAA 515
Query: 243 RFPLKS 248
+FP KS
Sbjct: 516 QFPKKS 521
>gi|357445861|ref|XP_003593208.1| Transcriptional activator DEMETER [Medicago truncatula]
gi|355482256|gb|AES63459.1| Transcriptional activator DEMETER [Medicago truncatula]
Length = 251
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 95/235 (40%), Positives = 126/235 (53%), Gaps = 23/235 (9%)
Query: 822 CEPIIEEPRSPGPQCSESPESEIDDDDFSIG---DIEEIPTLRL--QEREFKENFPN-FM 875
CEPI+E P SP + +E + E D+ + DIE++ TL L + F + N F
Sbjct: 30 CEPIVEMPASPELESTELIDDEKMYYDYVVENDEDIEDMMTLNLSIESSCFSKICDNSFQ 89
Query: 876 EMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELLRGFDRRDPDD 935
E + M +S LVAL A K+K + L+TE VYVL D H LL + R+ DD
Sbjct: 90 EFDHDMTAPTS-LVALHPNATINRLSKMKNASRLKTERLVYVLTDNHPLLTEYPPREHDD 148
Query: 936 PSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAGIVRGTILIPC 995
PSPYLL IW P E +S S + + ++S V GT+LIPC
Sbjct: 149 PSPYLLVIWLPDELESSDESSKTDLHEEESS----------------QTKTVLGTLLIPC 192
Query: 996 RTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTL 1050
RTA + FPLNGTYFQVNEVFAD+ + PINVPR I +L + I FG+ +T+
Sbjct: 193 RTAMKACFPLNGTYFQVNEVFADYASMIQPINVPRQWIWSLEKRITYFGTGTSTI 247
>gi|218194662|gb|EEC77089.1| hypothetical protein OsI_15494 [Oryza sativa Indica Group]
Length = 1253
Score = 134 bits (337), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 144/539 (26%), Positives = 209/539 (38%), Gaps = 153/539 (28%)
Query: 578 GEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEW 637
G R D D VDWEA+ A +++ D IK+RGQ + +A RI FL R+ HG+IDLEW
Sbjct: 485 GYIRDDDIPDMVDWEALMNAPFADVVDCIKDRGQHSQMAFRILAFLIRMKRDHGNIDLEW 544
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
LR +P K R+ + A V+ GR +Q +PG
Sbjct: 545 LRFIPRAKANYK--------------ARMGATPSFAIISGVSSGRQQNSKFSTLIQAMPG 590
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT-LYELHYQMITFGKIFCKKRSPNCGACPL 756
L + R+C +Q+T YELH QMITFGK
Sbjct: 591 YLS-----------------FDRICLTNQRTNRYELHCQMITFGK--------------- 618
Query: 757 RGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESG 816
A ALP S++ + ++ L+ +IE +E
Sbjct: 619 -------------AALALPEYSQQDATKDANMDDPAKTYDLIFKA-HQYQIEYGKNTE-- 662
Query: 817 YQINNCEPIIEEPRSP-----GPQCSESPES--EIDDDDFSIG----DIEEIPTLRLQER 865
+N CEP+IE P +P G E E+ DDD IG D+E+I E
Sbjct: 663 --MNYCEPVIEIPPTPLHENRGETSDEDDENGYYFDDDMEDIGRHDYDMEDIEHDYDMEV 720
Query: 866 EFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPDAHELL 925
+ + P ++ ++ + A S P + + LRTE+ ++PD H +L
Sbjct: 721 DLRSAKPT-TNTSQAGATPGKEMIPINPRAKSTP---MVKKFSLRTEYTACIIPDGHIIL 776
Query: 926 RGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSCNSVCERNAG 985
+ FD R P D +PYLL S E
Sbjct: 777 KKFDPRVPGDRNPYLLVFRSFDEH------------------------------------ 800
Query: 986 IVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE--IVCF 1043
V+ TIL VFADH +S PI + R + LRR+ IV F
Sbjct: 801 TVKATIL----------------------VFADHSSSRSPIEINRDLVWELRRQTCIVHF 838
Query: 1044 GSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRSR 1102
G+ ++ +G++C R FDRR + P+ L +H + K+ G KR+R
Sbjct: 839 GTRVHSVT-------------KGYICTREFDRRTKFPKQLCVEIHATNVNKDIGKKRAR 884
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 55/79 (69%)
Query: 181 WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSL 240
W + R FE ++ F M I+GDR F PW GSVV SVVG +LTQNVSDNLSS+A+M++
Sbjct: 267 WNERRREFERKVDHFMYNMRSIIGDRNFSPWGGSVVTSVVGTFLTQNVSDNLSSNAFMTI 326
Query: 241 AARFPLKSTNDHTPSDENL 259
AARFPLK+ + +N+
Sbjct: 327 AARFPLKNRRNAGHHSDNV 345
>gi|38346441|emb|CAE04435.2| OSJNBa0018J19.2 [Oryza sativa Japonica Group]
gi|116309571|emb|CAH66631.1| OSIGBa0105P02.5 [Oryza sativa Indica Group]
Length = 798
Score = 129 bits (325), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 154/352 (43%), Gaps = 52/352 (14%)
Query: 578 GEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEW 637
G R D D VDWEA+ A +++ D IK+RGQ + +A RI FL R+ HG+IDLEW
Sbjct: 482 GYIRDDDIPDMVDWEALMNAPFADVVDCIKDRGQHSQMAFRILAFLIRMKRDHGNIDLEW 541
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
LR +P K R+ + A V+ GR +Q +PG
Sbjct: 542 LRFIPRAKANYK--------------ARMGATPSFAIISGVSSGRQQNSKFSTLIQAMPG 587
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT-LYELHYQMITFGKIFCKKRSPNCGACPL 756
L + R+C +Q+T YELH QMITFGK C+K PNCGACP
Sbjct: 588 YLS-----------------FDRICLTNQRTNRYELHCQMITFGKAICRKSKPNCGACPF 630
Query: 757 RGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIRIEADPVSESG 816
ECK++ S A ALP S++ + ++ L+ +IE +E
Sbjct: 631 TSECKYYKSQFGRAALALPEYSQQDATKDANMDDPAKTYDLIFK-AHQYQIEYGKNTE-- 687
Query: 817 YQINNCEPIIEEPRSP-----GPQCSESPES--EIDDDDFSIG----DIEEIPTLRLQER 865
+N CEP+IE P +P G E E+ DDD IG D+E+I E
Sbjct: 688 --MNYCEPVIEIPPTPLHENRGETSDEDDENGYYFDDDMEDIGRHDYDMEDIEHDYDMEV 745
Query: 866 EFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHVYV 917
+ + P ++ ++ + A S P + + LRTE+ Y
Sbjct: 746 DLRSAKPT-TNTSQAGATPGKEMIPINPRAKSTP---MVKKFSLRTEYTAYA 793
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 51/68 (75%)
Query: 181 WQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSL 240
W + R FE ++ F M I+GDR F PW GSVV SVVG +LTQNVSDNLSS+A+M++
Sbjct: 264 WNERRREFERKVDHFMYNMRSIIGDRNFSPWGGSVVTSVVGTFLTQNVSDNLSSNAFMTI 323
Query: 241 AARFPLKS 248
AARFPLK+
Sbjct: 324 AARFPLKN 331
>gi|413949116|gb|AFW81765.1| hypothetical protein ZEAMMB73_889003 [Zea mays]
Length = 107
Score = 119 bits (299), Expect = 8e-24, Method: Composition-based stats.
Identities = 53/88 (60%), Positives = 64/88 (72%)
Query: 1001 GRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLFNALDMRQIQ 1060
G FPLNGTYFQVNEVF++H +SH+PI+V R + NL+R +V FG+S T+F L +IQ
Sbjct: 3 GSFPLNGTYFQVNEVFSNHRSSHNPIHVKREQLWNLQRCMVFFGTSVPTIFRGLTTEEIQ 62
Query: 1061 EYFWRGFVCVRGFDRRYRCPRPLVNRLH 1088
FWRG VCVRGFDR R PRPL LH
Sbjct: 63 HCFWRGVVCVRGFDRETRAPRPLCPHLH 90
>gi|302856195|ref|XP_002959522.1| hypothetical protein VOLCADRAFT_36766 [Volvox carteri f.
nagariensis]
gi|300254966|gb|EFJ39413.1| hypothetical protein VOLCADRAFT_36766 [Volvox carteri f.
nagariensis]
Length = 108
Score = 115 bits (287), Expect = 2e-22, Method: Composition-based stats.
Identities = 53/107 (49%), Positives = 72/107 (67%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D VDWEAVR A + +AD I+ RG ++A RIK FLN++ +I LEWLRN ++
Sbjct: 2 DIVDWEAVRTAPDELLADLIRCRGMHVMLARRIKAFLNQVRTGRSTISLEWLRNANVEEA 61
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
YL+ +EGLG KSV C+ LL+L FPVD+NV R+ RLGW+P++
Sbjct: 62 TNYLMAVEGLGRKSVACIVLLALHGKEFPVDINVARVFARLGWIPIE 108
>gi|145346471|ref|XP_001417710.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577938|gb|ABO96003.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 107
Score = 108 bits (270), Expect = 2e-20, Method: Composition-based stats.
Identities = 53/107 (49%), Positives = 68/107 (63%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D VDW AV A EI I+ RG ++A RIK L R++ G + LE+LR+ P
Sbjct: 1 DQVDWYAVLDAPLVEIVKCIRCRGMHWMLARRIKGILKRVMAQRGCLSLEFLRDTPTRDA 60
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
EYLL ++G+G+K+ CV LL+L FPVDVNVGRI RLGWVPL+
Sbjct: 61 NEYLLALDGMGVKTTSCVLLLALHRTDFPVDVNVGRIMARLGWVPLE 107
>gi|303277163|ref|XP_003057875.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460532|gb|EEH57826.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 108
Score = 107 bits (268), Expect = 3e-20, Method: Composition-based stats.
Identities = 52/108 (48%), Positives = 72/108 (66%), Gaps = 1/108 (0%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDK 645
D +DW AV A ++ + I+ RG ++A RIK L R+ E G++ LE+LR+VP +
Sbjct: 1 DLIDWNAVLKAPIEDVVECIRCRGMHFMLARRIKNLLRRIASERDGALSLEFLRDVPTEL 60
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
+ YLL +EG G+K+V C+ LL+L FPVDVNVGRI RLGWVPL+
Sbjct: 61 ARGYLLSLEGYGVKTVSCILLLALYRADFPVDVNVGRIMARLGWVPLE 108
>gi|86605550|ref|YP_474313.1| base excision DNA repair protein [Synechococcus sp. JA-3-3Ab]
gi|86554092|gb|ABC99050.1| base excision DNA repair protein, HhH-GPD family [Synechococcus sp.
JA-3-3Ab]
Length = 222
Score = 105 bits (262), Expect = 2e-19, Method: Composition-based stats.
Identities = 63/170 (37%), Positives = 84/170 (49%), Gaps = 18/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAV AD +++A I+ G + A RI+E L +V+ GS+ L++LR++ YL
Sbjct: 63 WEAVLAADPTKLAQVIRPGGLAQLKAARIQEILAAIVKRQGSLSLDFLRDLDDADALAYL 122
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L ++G+GLK+ CV L L PVD +V R+A RLG V + P D L P
Sbjct: 123 LSLKGVGLKTAACVLLFGLGRDLCPVDTHVHRVANRLGLVRARH-PDDTFAQLSPRIPA- 180
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
Y LH +I GK CK R P CG CPLR EC
Sbjct: 181 ----------------GKAYSLHVNLIRLGKRICKARMPECGRCPLRHEC 214
>gi|440791540|gb|ELR12778.1| base excision DNA repair protein, HhHGPD subfamily protein
[Acanthamoeba castellanii str. Neff]
Length = 494
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
DWE VR A S++AD I+ G G I+ LN+L E G + LE LR++P ++VK
Sbjct: 228 DWERVRTAKPSQLADVIRTGGLAQRKGGIIQGLLNQLHEERDGELSLEHLRDMPTEEVKH 287
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L +G+G K+ CV + +++ FPVD +V R++ RLG+V
Sbjct: 288 ALTKYKGIGPKTASCVLMFNMKRADFPVDTHVHRVSTRLGFVR----------------- 330
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ Q Y + D+ ++H +I GK CK RSP CG C LR +C++
Sbjct: 331 GTTREQTYEFMNATMPDE-IKRDMHVLLIRHGKQVCKARSPKCGECVLRDDCRY 383
>gi|302837608|ref|XP_002950363.1| hypothetical protein VOLCADRAFT_90672 [Volvox carteri f.
nagariensis]
gi|300264368|gb|EFJ48564.1| hypothetical protein VOLCADRAFT_90672 [Volvox carteri f.
nagariensis]
Length = 1442
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 63/97 (64%), Gaps = 3/97 (3%)
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ YL+ +EGLG KSV C+ LL+L FPVD+NV R+ RLGW+P++ + + L
Sbjct: 25 LRSYLMAVEGLGRKSVACIVLLALHGKEFPVDINVARVFARLGWIPIE---AEATLEQLD 81
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
YP ++ YL RL + D +TLYELHYQ IT GK+
Sbjct: 82 SYPQEPEVHKYLRSRLMHFDLETLYELHYQSITLGKL 118
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 68/167 (40%), Gaps = 49/167 (29%)
Query: 987 VRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHP----------INVPRHTIANL 1036
+R +L+PCR A RG FPLNGTYFQ NEVF T+ P I V +A L
Sbjct: 1269 LRAAVLMPCRVALRGGFPLNGTYFQTNEVFLVEHTAQQPLRLRLGQLLGIGV-EAGLAAL 1327
Query: 1037 R------------------------------REI------VCFGSSATTLFNALDMRQIQ 1060
RE+ V FG + +++ + ++
Sbjct: 1328 EPPSPSSLPTPSLPQNDAVEMAAAFGPTIVPREVRHECTAVYFGHATSSITRDMTAAEVA 1387
Query: 1061 EYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK--EAGSKRSRDES 1105
E F G VCVR F PRP+ L + + AG++RS E+
Sbjct: 1388 ELFEEGAVCVRAFCLFTGNPRPIPAFLGAPSAGRGPRAGARRSAIEA 1434
>gi|86609598|ref|YP_478360.1| base excision DNA repair protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86558140|gb|ABD03097.1| base excision DNA repair protein, HhH-GPD family [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 212
Score = 101 bits (252), Expect = 2e-18, Method: Composition-based stats.
Identities = 60/170 (35%), Positives = 85/170 (50%), Gaps = 18/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAV AD +++A I+ G ++ A RI+E L + E GS+ L++LR++ ++L
Sbjct: 55 WEAVLAADPADLAQVIRSGGLAHLKAARIQEILAAIQERQGSLSLDFLRDLDDAGALDFL 114
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L ++G+GLK+ CV L L PVD +V R+A RLG V + P D L + P
Sbjct: 115 LSLKGVGLKTATCVLLFGLGRDLCPVDTHVHRVANRLGLVRAKH-PDDTFAQLSPQIP-- 171
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ Y LH + GK CK R P CG CPLR C
Sbjct: 172 ---------------RGKAYSLHVNFVRLGKRICKARMPECGRCPLRPTC 206
>gi|365873799|ref|ZP_09413332.1| putative endoIII-related endonuclease [Thermanaerovibrio velox DSM
12556]
gi|363983886|gb|EHM10093.1| putative endoIII-related endonuclease [Thermanaerovibrio velox DSM
12556]
Length = 242
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 86/170 (50%), Gaps = 17/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W +V A E+A+AIK G N A RI+E L + G +E +++ PPD+V+E+L
Sbjct: 83 WGSVAVAPVEEVAEAIKVAGLGNSKAARIREILEVVRGRFGGYTMEPIKHWPPDRVREFL 142
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L + G+G K+ CV L L AFPVD +V R++ RLG+ PL P D+ + L +
Sbjct: 143 LSLPGVGAKTTACVMLFDLGIPAFPVDTHVARLSSRLGFAPLGMEPKDIQVVLEG----L 198
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K + YL H MI G+ C+ R PNC C + C
Sbjct: 199 LKPERYLGA-------------HVNMICHGRAVCRARRPNCKGCVVTSLC 235
>gi|312880248|ref|ZP_07740048.1| DNA-(apurinic or apyrimidinic site) lyase [Aminomonas paucivorans
DSM 12260]
gi|310783539|gb|EFQ23937.1| DNA-(apurinic or apyrimidinic site) lyase [Aminomonas paucivorans
DSM 12260]
Length = 232
Score = 100 bits (248), Expect = 6e-18, Method: Composition-based stats.
Identities = 61/171 (35%), Positives = 84/171 (49%), Gaps = 17/171 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DWE+V AD E+ +AI+ G N I+ L RL E G+ L LR+ PD+ +EY
Sbjct: 69 DWESVALADPGEVQEAIRIAGLANAKGPSIQAILERLREDWGAPTLVPLRSWKPDRAREY 128
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ CV + L AFPVD +V RI RLGWVP P + +++E
Sbjct: 129 LEALPGVGPKTAACVMVFDLGFPAFPVDTHVARICRRLGWVPANLPPHRIQ-RVMEETVA 187
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ Q H +I G+ C+ RSP C AC L G C
Sbjct: 188 RERFQ----------------GAHLNLIAHGRAVCRARSPRCPACVLVGVC 222
>gi|390556756|ref|ZP_10243163.1| DNA-(Apurinic or apyrimidinic site) lyase [Nitrolancetus
hollandicus Lb]
gi|390174670|emb|CCF82450.1| DNA-(Apurinic or apyrimidinic site) lyase [Nitrolancetus
hollandicus Lb]
Length = 247
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 104/218 (47%), Gaps = 36/218 (16%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD ++ + LRR Y T WEAVR A +++ADAI+ G A
Sbjct: 53 QHTSDVNSERAFQDLRRRYPT-------------WEAVRTAPVADVADAIRSGGLGARKA 99
Query: 617 GRIKEFLNRLVELHGSIDLEW---LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
RI+ L+R++ HG+ D EW L+ +P + K L+ + G+G K+ CV L + A
Sbjct: 100 PRIQAALDRILS-HGTED-EWSRALKTLPLAEAKARLMALPGVGPKTAACVLLFACGRPA 157
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
PVD +V R++ RLG + +P+ H D+I+ L P + +Y H
Sbjct: 158 LPVDTHVYRVSKRLGLI--EPVVTAEQAH--------DQIEKLLNP-------EDVYSFH 200
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
MI G+ C R P C CPLR C FA A +R
Sbjct: 201 LNMIAHGRQVCTARKPRCEWCPLRSRCA-FAQGKAGSR 237
>gi|269925878|ref|YP_003322501.1| HhH-GPD family protein [Thermobaculum terrenum ATCC BAA-798]
gi|269789538|gb|ACZ41679.1| HhH-GPD family protein [Thermobaculum terrenum ATCC BAA-798]
Length = 236
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 580 ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLR 639
+R +D + DW+ V A E+A IK G NI A RI+ L + GS DL++L+
Sbjct: 60 DRLYDHFNG-DWKRVMEAPTEEVAQLIKSAGLSNIKAPRIQSVLKEIWSRLGSFDLQFLK 118
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
++P + KE+L I G+G K+ CV + SL PVD +V R+++RLG +P +
Sbjct: 119 DMPINAAKEWLTSINGVGPKTAACVLMFSLGLPVMPVDTHVHRVSLRLGLIPPKTNADRA 178
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
H D + + P + Y H +I G+ CK P C CPL
Sbjct: 179 H----------DILAQIVSP-------ERAYPFHINLIRHGRRVCKAPVPKCTICPLTCL 221
Query: 760 CKHFASAVASAR 771
C +F S R
Sbjct: 222 CVYFHQNSLSIR 233
>gi|392407188|ref|YP_006443796.1| endoIII-related endonuclease [Anaerobaculum mobile DSM 13181]
gi|390620324|gb|AFM21471.1| putative endoIII-related endonuclease [Anaerobaculum mobile DSM
13181]
Length = 253
Score = 93.2 bits (230), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 79/166 (47%), Gaps = 17/166 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V A EIA I+ G N+ A RIKE L + + G L LR + + E+L
Sbjct: 88 WDEVLAATHEEIASVIRPAGLSNVKAMRIKEILKIVSDRFGDCSLRDLRRMKKGDIVEFL 147
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+V CV L L AFPVD +VGR+ R+GWV + P P +
Sbjct: 148 SSLPGVGPKTVACVLLFDLGLPAFPVDTHVGRLCKRIGWVSAKCSP-----------PEI 196
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
KI + P Y + H +I+ G+ C R P CG CPL
Sbjct: 197 QKIMESIIPEELY------WSAHLDLISHGRNICLARRPQCGECPL 236
>gi|357419910|ref|YP_004932902.1| HhH-GPD family protein [Thermovirga lienii DSM 17291]
gi|355397376|gb|AER66805.1| HhH-GPD family protein [Thermovirga lienii DSM 17291]
Length = 244
Score = 92.8 bits (229), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 89/179 (49%), Gaps = 17/179 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V A E+ADAI+ G N+ A RIK+ L + + GS L+ L+ + +++ +L
Sbjct: 83 WQQVIDAKVEEVADAIRSAGIANVKAQRIKKILATVNGVWGSCSLKGLKRLEKEEIISFL 142
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K++ CV L L AFPVD +V R++ RLG+V LP D+
Sbjct: 143 SSLPGVGPKTIACVLLFDLGIPAFPVDTHVARLSKRLGFVGGNTLPEDI----------- 191
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
+KI L P +L H MI G+ CK +P C C L C+H S A+
Sbjct: 192 EKILEELVPEERFLGG------HLNMIAHGRAICKALNPRCQLCSLAHLCEHLNSKGAA 244
>gi|289523004|ref|ZP_06439858.1| endonuclease III [Anaerobaculum hydrogeniformans ATCC BAA-1850]
gi|289503547|gb|EFD24711.1| endonuclease III [Anaerobaculum hydrogeniformans ATCC BAA-1850]
Length = 255
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 81/166 (48%), Gaps = 17/166 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V E+E+A+ IK G NI A RIK L + E GS L+ L+ + +++ ++L
Sbjct: 88 WEDVLVTAETELAEVIKPAGLSNIKASRIKSVLGLITERFGSCSLKPLKGMKKEEIIDFL 147
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+V CV L L AFPVD +V R+ R+GWV + P +
Sbjct: 148 SSLPGVGPKTVACVLLFDLGIPAFPVDTHVNRLCKRIGWVSPKSTPEETQ---------- 197
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
KI + P Y + H +I+ G+ C R P C CPL
Sbjct: 198 -KIMGSVIPSDLY------WSAHLDIISHGRNICVSRRPKCTICPL 236
>gi|282856173|ref|ZP_06265456.1| endonuclease III (DNA-(apurinic or apyrimidinic site)lyase)
[Pyramidobacter piscolens W5455]
gi|282585932|gb|EFB91217.1| endonuclease III (DNA-(apurinic or apyrimidinic site)lyase)
[Pyramidobacter piscolens W5455]
Length = 234
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 77/170 (45%), Gaps = 17/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + IA+AIK G N A I L + E G L+ L++ P +++
Sbjct: 77 WDRVAALPQDRIAEAIKPAGICNNKAATILRVLKTVKERFGEYSLKRLKSGTPAAAWDFM 136
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
I+G+G K+ CV L AFPVD +V RI+ RLGW + P ++ L + P
Sbjct: 137 THIQGVGPKTAACVLAFDLGFPAFPVDTHVARISKRLGWADAKETPAEIQARLERLVPDG 196
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K H +I GK CK R+P CG CPLRG C
Sbjct: 197 MKCAG-----------------HLDVIQHGKNICKARAPKCGECPLRGIC 229
>gi|320159427|ref|YP_004172651.1| putative DNA glycosylase [Anaerolinea thermophila UNI-1]
gi|319993280|dbj|BAJ62051.1| putative DNA glycosylase [Anaerolinea thermophila UNI-1]
Length = 237
Score = 90.1 bits (222), Expect = 6e-15, Method: Composition-based stats.
Identities = 52/171 (30%), Positives = 84/171 (49%), Gaps = 17/171 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A ++ +AI+ G N R++E L ++ G +DL +L+++P ++ + +L
Sbjct: 67 WEDVRDAPLEQVIEAIRPAGLANQKGARLQEVLRQITAERGGLDLSFLQDLPAEEARTWL 126
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L +G+G+K+ V L SL AFPVD +V R++ R+G P Q D H +L + +
Sbjct: 127 LRFKGVGVKTASIVLLFSLNKPAFPVDTHVHRVSGRIGLRPPQMSAEDAHAYLAQVF--- 183
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ H +I G+ C R P C CPLR C+
Sbjct: 184 --------------TPEQYAAGHLNLIRLGREVCHARKPACPRCPLRAVCE 220
>gi|374850043|dbj|BAL53042.1| HhH-GPD family protein [uncultured Chloroflexi bacterium]
Length = 235
Score = 89.7 bits (221), Expect = 7e-15, Method: Composition-based stats.
Identities = 50/172 (29%), Positives = 88/172 (51%), Gaps = 17/172 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAVR A E+ +AI+ G A R+++ L + + G+++L++L ++P ++ + +L
Sbjct: 70 WEAVRDAPLDEVIEAIRLAGLAQQKAPRLQQVLREITQQRGALELDFLADLPLEEARRWL 129
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ +G+G K+ V L SL AFPVD ++ R++ R+G P + + H L +P
Sbjct: 130 MQFKGVGPKTAAIVLLFSLGRPAFPVDTHIYRVSGRIGLRPARMSVEEAHAFLEALFP-- 187
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+T Y +H +I G+ C+ R P C CP+R C +
Sbjct: 188 ---------------PETYYAVHLNLIRLGREVCQARRPQCERCPIREWCDY 224
>gi|325968210|ref|YP_004244402.1| DNA-(apurinic or apyrimidinic site) lyase [Vulcanisaeta moutnovskia
768-28]
gi|323707413|gb|ADY00900.1| DNA-(apurinic or apyrimidinic site) lyase [Vulcanisaeta moutnovskia
768-28]
Length = 231
Score = 89.0 bits (219), Expect = 1e-14, Method: Composition-based stats.
Identities = 53/169 (31%), Positives = 91/169 (53%), Gaps = 18/169 (10%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +A+AIK G I A +I E L+R++ + DL W++++P D+ ++ LL++ G+G
Sbjct: 76 EDALANAIKPAGMHRIRARKIIE-LSRVILENYRGDLTWIKDLPLDEARKALLELPGVGE 134
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ + + L++L +AFPVD ++ RI++RLG I + Y + + W
Sbjct: 135 KTADVI-LVNLGKLAFPVDTHITRISIRLG------------IAKSRNYHEIQRA----W 177
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
R+ D E+H ++I FG+ C R+P C C R C ++ S V
Sbjct: 178 MRILTPDPSRYLEIHLKLIQFGRDICIARNPRCDMCGFREVCTYYISNV 226
>gi|269792590|ref|YP_003317494.1| DNA-(apurinic or apyrimidinic site) lyase [Thermanaerovibrio
acidaminovorans DSM 6589]
gi|269100225|gb|ACZ19212.1| DNA-(apurinic or apyrimidinic site) lyase [Thermanaerovibrio
acidaminovorans DSM 6589]
Length = 232
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 84/170 (49%), Gaps = 17/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE+V A ++ AI+ G A RIKE L ++ E G++ L +R+ D+V+ +L
Sbjct: 75 WESVMEAPVEDLEGAIRVAGLGASKARRIKEVLYKVKETLGTLSLGAMRSWRRDEVEAFL 134
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+V CV + L AFPVD +VGR++VR+G P P ++ + L
Sbjct: 135 STLPGVGPKTVACVLVFDLGIPAFPVDTHVGRLSVRMGLAPGGMKPWEIQLRLES----- 189
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+D + H +I G+ CK + P CG CPL G C
Sbjct: 190 ------------LIDPERYLGAHVNLIFHGRRICKAQRPRCGDCPLLGTC 227
>gi|116754221|ref|YP_843339.1| HhH-GPD family protein [Methanosaeta thermophila PT]
gi|116665672|gb|ABK14699.1| HhH-GPD family protein [Methanosaeta thermophila PT]
Length = 219
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 83/169 (49%), Gaps = 23/169 (13%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E EIADAI+ G +I A RIK L RL + GS+DL L+ + + + YL I G+
Sbjct: 65 ASEEEIADAIRPGGLADIKAARIKGALERLRDDFGSVDLSPLKRMSAVEARNYLKSIPGI 124
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + L A PVD +V R++ R+G VP E +++ Q
Sbjct: 125 GPKTASVLMLFGFGMSAMPVDTHVYRVSRRMGLVP--------------ENASIEETQRI 170
Query: 717 L---WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L P Y+ LH +I G++ CK R+P C C L+G C++
Sbjct: 171 LEEITPHEKYIS------LHINLIRHGRLVCKARNPLCKKCELKGLCRY 213
>gi|167997017|ref|XP_001751215.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697196|gb|EDQ83532.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 272
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 84/177 (47%), Gaps = 33/177 (18%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V AD ++ DAI+ G A RI L+ + GSI LE++R++ D++K L
Sbjct: 96 WEEVHAADPKKVEDAIRCGGLAETKAKRIINILDTIFTERGSICLEYVRSMNVDQIKAEL 155
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + L+ FPVD +V R++ LGWVP ++H+ P
Sbjct: 156 SRFKGVGPKTVACVLMFHLEQNEFPVDTHVFRLSKMLGWVPASADREKTYLHMNSRVP-- 213
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC-----------GACPL 756
D+++ Y+LH ++T G KR P C G CPL
Sbjct: 214 DEVK---------------YDLHCLLVTHG-----KRCPRCAKGGRAQTAPDGPCPL 250
>gi|288575002|ref|ZP_06393359.1| HhH-GPD family protein [Dethiosulfovibrio peptidovorans DSM 11002]
gi|288570743|gb|EFC92300.1| HhH-GPD family protein [Dethiosulfovibrio peptidovorans DSM 11002]
Length = 238
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 85/183 (46%), Gaps = 19/183 (10%)
Query: 580 ERSHDKMDSVD--WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEW 637
+R+ DK+ ++ WE V E+ADAI+ G N+ AGR+ + L + + G L
Sbjct: 67 DRAFDKLKTLYPLWEDVASVTPDELADAIRVAGIANVKAGRMLDVLKIIHDELGEYGLTG 126
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
L+ D V+ +L + G+G K+ CV + + AFPVD +V R R+ WVP P
Sbjct: 127 LKYRDHDGVRAFLEGLPGVGPKTAACVLVFDMDIPAFPVDTHVARFCRRMEWVPRSATPV 186
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
+ ++ K P D++ H +I+ GK CK R P C CPL
Sbjct: 187 RIQEYMEKIVP----------------DERK-KGAHLNIISHGKSICKARKPICQRCPLI 229
Query: 758 GEC 760
C
Sbjct: 230 DLC 232
>gi|298242385|ref|ZP_06966192.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
gi|297555439|gb|EFH89303.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
Length = 245
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 83/177 (46%), Gaps = 15/177 (8%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A E+ + I + A RI+ + ++ E G+++L++L ++P ++ +L
Sbjct: 81 WEGVRDAPTKEVEETIANVNFPEVKAPRIQAIMRQITEERGNLNLDFLCSLPVEEAAAWL 140
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+ CV L S + PVD++V R ++R+G + + + H L P
Sbjct: 141 NRFQGIGPKTTACVLLFSCKMPILPVDIHVHRTSIRIGLIGNKVTADNAHTLLQALLPN- 199
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
D +T+Y H ++ G+ C P C CPL C ++ + V
Sbjct: 200 --------------DARTIYNFHKGLLRLGQRICVYERPRCNQCPLTKLCDYYRTVV 242
>gi|255074589|ref|XP_002500969.1| predicted protein [Micromonas sp. RCC299]
gi|226516232|gb|ACO62227.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 216
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 75/153 (49%), Gaps = 17/153 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A +++ DAI+ G + RI+ LN L E G +E+LR++ D VK L
Sbjct: 81 WEQVRTAKPAKVEDAIRCGGLAEVKVSRIQVILNTLKEERGECSMEYLRDMSDDDVKAEL 140
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + L+ FPVD +V +IA LGW+P D + HL + P
Sbjct: 141 SRFKGVGPKTVSCVLMFCLKRPDFPVDTHVWKIAKDLGWIPKGAGREDAYEHLNRRVP-- 198
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
++LH ++ GK++
Sbjct: 199 ---------------DDCKFDLHVLLVEHGKVY 216
>gi|386855256|ref|YP_006259433.1| Endonuclease III [Deinococcus gobiensis I-0]
gi|379998785|gb|AFD23975.1| Endonuclease III [Deinococcus gobiensis I-0]
Length = 223
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 88/192 (45%), Gaps = 16/192 (8%)
Query: 572 RRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG 631
R ++ E H+ + DW + A +A AI+ A RI+E L R+ + G
Sbjct: 32 RTTHADEEAAYHELRELGDWADIIAAPTEVVAHAIRRSNYPESKAPRIQETLRRIRDAPG 91
Query: 632 SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
DL++LR++P ++L D+ G+G+K+ V L + FPVD +V RI +R+G +P
Sbjct: 92 GYDLDFLRDLPVKDALKWLTDLPGVGVKTASLVLLFNYHRPVFPVDTHVFRINIRVGTIP 151
Query: 692 LQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ ++ L D LYELH +++ G+ C +P C
Sbjct: 152 ----------------QMTEQAAHRALLALLPPDPPFLYELHVNLLSHGRQVCTWTAPKC 195
Query: 752 GACPLRGECKHF 763
G C +R C +
Sbjct: 196 GECFIRERCDAY 207
>gi|448684914|ref|ZP_21693001.1| endonuclease III [Haloarcula japonica DSM 6131]
gi|445782845|gb|EMA33686.1| endonuclease III [Haloarcula japonica DSM 6131]
Length = 277
Score = 83.6 bits (205), Expect = 6e-13, Method: Composition-based stats.
Identities = 72/231 (31%), Positives = 101/231 (43%), Gaps = 25/231 (10%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEE-RSHDKMDSVDWEAVRCADESEIADAIKERGQQNII 615
+ SD Q D L R Y EE S DS A+ AD+ E+A+ I G N
Sbjct: 64 QNTSDKASQPAHDELMRRYGGDEEANSEGDTDSDLARALADADQPELAETISSAGLYNQK 123
Query: 616 AGRIKEFLNRLVELHG--SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHI 672
+ RI R+ E +G S E++R P+ V+ LLD+ G+G K+ +CV L + +
Sbjct: 124 SERIIALAQRICEEYGGESGFDEFVREEDPEAVRSTLLDMNGVGPKTADCVLLFAGGRGG 183
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
FPVD +V RIA R+G P + +L ++ P +K +
Sbjct: 184 VFPVDTHVHRIARRMGLAPADADHETVREYLERDVPA----------------EKCGFG- 226
Query: 733 HYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALPGPSE 779
H MI FG+ +C R P C ACPL G C AS P +E
Sbjct: 227 HTAMIQFGREYCSARKPACLDDPDACPLAGHCDQIGVYPASGEVVDPAEAE 277
>gi|297815970|ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297321706|gb|EFH52127.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 294
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 82/173 (47%), Gaps = 14/173 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+WE V A+ I AI+ G A IK LNRL G + LE+LR + ++VK
Sbjct: 127 NWEDVLAAESKSIESAIRCGGLAPKKAVCIKNILNRLQTERGVLCLEYLRGLSVEEVKTE 186
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L +G+G K+V CV + +LQH FPVD +V IA LGWVP ++HL + P
Sbjct: 187 LSHFKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKTYVHLNRRIPD 246
Query: 710 MDKIQ----MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
K +Y +LC +KT+ K K R + CPL G
Sbjct: 247 ELKFDLNCLLYTHGKLCSNCKKTV----------AKPKAKARVASPDECPLVG 289
>gi|195645374|gb|ACG42155.1| ROS1 protein [Zea mays]
gi|219887377|gb|ACL54063.1| unknown [Zea mays]
Length = 276
Score = 83.2 bits (204), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 82/172 (47%), Gaps = 13/172 (7%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G I LE+LR + D+VK+ L
Sbjct: 107 WDQVVDEEGKRLEDAIRCGGLAATKAARIRSMLRDVRERRGKICLEYLRELSVDEVKKEL 166
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + LQ FPVD +V RI +GWVP +IHL + P
Sbjct: 167 SRFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITKAMGWVPATASREKAYIHLNNKIP-- 224
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMI--TFGKIFCKKRSPNCGACPLRGEC 760
D ++ D L+ H ++ K+ KR + ACPL G C
Sbjct: 225 DDLK---------FDLNCLFVTHGKLCQSCTKKVGSDKRKSSNSACPLAGYC 267
>gi|325283930|ref|YP_004256471.1| HhH-GPD family protein [Deinococcus proteolyticus MRP]
gi|324315739|gb|ADY26854.1| HhH-GPD family protein [Deinococcus proteolyticus MRP]
Length = 248
Score = 82.8 bits (203), Expect = 9e-13, Method: Composition-based stats.
Identities = 53/175 (30%), Positives = 83/175 (47%), Gaps = 16/175 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW+A+ A ++A AI+ + A RI+ L + E G DL++L +P + +
Sbjct: 79 DWDAIVAAPVEQVAHAIRRSNYPDQKAPRIQATLRAIREKRGGYDLDFLAELPVAEALSW 138
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L D+ G+G+K+ V L + FPVD +V RI R+G +P G+ H
Sbjct: 139 LTDLPGVGVKTASLVLLFNYARPVFPVDTHVHRINTRVGTIPKM---GEQTAH------- 188
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ + L P D LY+LH ++ G+ C +P CG C LR C +A
Sbjct: 189 --RALLTLLPS----DPPLLYDLHVNLLKHGQQVCTWNNPKCGRCILRERCDAYA 237
>gi|15807425|ref|NP_296158.1| endonuclease III [Deinococcus radiodurans R1]
gi|6460251|gb|AAF11977.1|AE002073_7 endonuclease III [Deinococcus radiodurans R1]
Length = 259
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 92/196 (46%), Gaps = 17/196 (8%)
Query: 570 LLRRIYSTGEERSHDKMDSV-DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE 628
L +R EE ++ ++ ++ DW+A+ A +A AI+ A RI+E L R+
Sbjct: 58 LSQRTTHADEEAAYQELRTLGDWDAITLAPTDAVAHAIRRSNYPESKAPRIQETLRRIKA 117
Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
G DL++LR++P ++L D+ G+G+K+ V L + FPVD +V R++ R+G
Sbjct: 118 APGGYDLDFLRDLPVKDALKWLTDLPGVGVKTASLVLLFNYARPVFPVDTHVHRVSTRVG 177
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
+P + ++ L D LYELH ++ G+ C
Sbjct: 178 VIPR----------------MGEQAAHRALLALLPPDPPYLYELHINFLSHGRQVCTWTR 221
Query: 749 PNCGACPLRGECKHFA 764
P CG C LR C +A
Sbjct: 222 PKCGKCILRERCDAYA 237
>gi|224133424|ref|XP_002321564.1| predicted protein [Populus trichocarpa]
gi|222868560|gb|EEF05691.1| predicted protein [Populus trichocarpa]
Length = 309
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 88/184 (47%), Gaps = 36/184 (19%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V A+ I DAI+ G A I+ L+ L+E G + LE+LR++P ++K L
Sbjct: 144 WENVLAAESKFIEDAIRCGGLAPTKAACIRNILSSLMEKKGRLCLEYLRDLPVAEIKAEL 203
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + +LQ FPVD +V IA +GWVP PV
Sbjct: 204 SHFKGIGPKTVACVLMFNLQKDDFPVDTHVFEIAKAIGWVP----------------PVA 247
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQ----MITFGKIFCKKRSPNCG----------ACPL 756
D+ + YL +L+ + EL + + T GK+ C+K + G +CPL
Sbjct: 248 DRNKTYL-----HLNHRIPKELKFDLNCLLYTHGKL-CRKCTKKSGSQQRKETHDDSCPL 301
Query: 757 RGEC 760
C
Sbjct: 302 LNYC 305
>gi|303272857|ref|XP_003055790.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463764|gb|EEH61042.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 249
Score = 82.0 bits (201), Expect = 2e-12, Method: Composition-based stats.
Identities = 48/127 (37%), Positives = 71/127 (55%), Gaps = 2/127 (1%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE +R A +E+ AIK G I RIK LN LVE G+ +E+LR++ D VK L
Sbjct: 114 WETIRAAPSAEVEAAIKCGGLAEIKTSRIKVILNTLVEERGAPCMEYLRDMSDDDVKSEL 173
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K++ CV + L+ FPVD +V +IA+ LGWVP + HL + P
Sbjct: 174 SRFKGVGPKTIACVLVFCLRR--FPVDAHVWKIAMALGWVPKSASRDAAYEHLNRRVPDA 231
Query: 711 DKIQMYL 717
K+ +++
Sbjct: 232 CKLDLHV 238
>gi|378733828|gb|EHY60287.1| endonuclease III [Exophiala dermatitidis NIH/UT8656]
Length = 442
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 81/184 (44%), Gaps = 24/184 (13%)
Query: 581 RSHDKMDSV-----DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL 635
R+ MD V WEA+ ++++ +AIK G + + I LN++ E +G L
Sbjct: 243 RAKLNMDKVYGGSDKWEAIAAGGQAKLQEAIKSGGLSAVKSKVIISILNQVYEKYGVYSL 302
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
+ L + + +L +G+G K+ CV L LQ +F VD +V RI LGW P
Sbjct: 303 DHLHSASSEDAMREMLSFQGVGPKTASCVLLFCLQRESFAVDTHVWRITGLLGWRPKSAS 362
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGA 753
+ + HL + P DK Y LH ++T GK CK N G
Sbjct: 363 RDETYAHLDSKIPDEDK-----------------YGLHILLVTHGKRCDECKAGGKNLGR 405
Query: 754 CPLR 757
C LR
Sbjct: 406 CELR 409
>gi|413944180|gb|AFW76829.1| ROS1 protein [Zea mays]
Length = 340
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 82/172 (47%), Gaps = 13/172 (7%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G I LE+LR + D+VK+ L
Sbjct: 171 WDQVVDEEGKRLEDAIRCGGLAATKAARIRSMLRDVRERRGKICLEYLRELSVDEVKKEL 230
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + LQ FPVD +V RI +GWVP +IHL + P
Sbjct: 231 SRFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITKAMGWVPATASREKAYIHLNNKIP-- 288
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMI--TFGKIFCKKRSPNCGACPLRGEC 760
D ++ D L+ H ++ K+ KR + ACPL G C
Sbjct: 289 DDLK---------FDLNCLFVTHGKLCQSCTKKVGSDKRKSSNSACPLAGYC 331
>gi|326436846|gb|EGD82416.1| hypothetical protein PTSG_03059 [Salpingoeca sp. ATCC 50818]
Length = 495
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 56/101 (55%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V AD + AI+ G + RIK L+ L HG + LE+LR +P VKE L
Sbjct: 271 WEDVHSADVGALEAAIRSAGLAQTKSRRIKSILDTLHAEHGKLSLEYLRELPSHTVKEQL 330
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+G+G K++ C+ L ++Q VD +V R+A R GWVP
Sbjct: 331 ARFKGVGPKTISCLLLFAMQRPDMAVDTHVFRLAKRAGWVP 371
>gi|384248095|gb|EIE21580.1| DNA glycosylase, partial [Coccomyxa subellipsoidea C-169]
Length = 257
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 80/178 (44%), Gaps = 35/178 (19%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V A ++ +AI+ G +I A RIK L L+E G I +E+LR + D++K L
Sbjct: 95 WEDVLRAPPGKMEEAIRMGGLADIKAERIKAILATLLEERGKICMEYLREMSDDEIKAEL 154
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + L FPVD +V RI+ LGWVP + + HL P
Sbjct: 155 F--KGVGKKTVACVLMFCLAREEFPVDTHVWRISKSLGWVPAKATRDAAYEHLNLHVPA- 211
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC-----------GACPLR 757
Y+LH M+ G KR P C G CPLR
Sbjct: 212 ----------------DVRYDLHVLMVEHG-----KRCPRCASNGKPRKESHGDCPLR 248
>gi|242095290|ref|XP_002438135.1| hypothetical protein SORBIDRAFT_10g008590 [Sorghum bicolor]
gi|241916358|gb|EER89502.1| hypothetical protein SORBIDRAFT_10g008590 [Sorghum bicolor]
Length = 279
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 81/172 (47%), Gaps = 13/172 (7%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G I LE+LR + D+VK+ L
Sbjct: 109 WDQVVDEEGKRLEDAIRCGGLATTKAARIRAMLRDVRERRGKICLEYLRELSVDEVKKEL 168
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + LQ FPVD +V RI +GWVP +IHL + P
Sbjct: 169 SRFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITKAMGWVPATASREKAYIHLNNKIP-- 226
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMI--TFGKIFCKKRSPNCGACPLRGEC 760
D ++ D L+ H ++ K+ K + ACPL G C
Sbjct: 227 DDLK---------FDLNCLFVTHGKLCQSCTKKVVSDKSKNSNAACPLAGYC 269
>gi|18408516|ref|NP_566893.1| HhH-GPD base excision DNA repair protein-related protein
[Arabidopsis thaliana]
gi|332644814|gb|AEE78335.1| HhH-GPD base excision DNA repair protein-related protein
[Arabidopsis thaliana]
Length = 293
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 86/177 (48%), Gaps = 26/177 (14%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V A+ I +AI+ G A IK LNRL G + LE+LR + ++VK L
Sbjct: 128 WDDVLNAESKSIENAIRCGGLAPKKAVCIKNILNRLQNERGRLCLEYLRGLSVEEVKTEL 187
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + +LQH FPVD +V IA LGWVP ++HL ++ P
Sbjct: 188 SHFKGVGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKTYVHLNRKIP-- 245
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGA-------CPLRG 758
D+++ ++L+ + T GKI CKK A CPL G
Sbjct: 246 DELK---------------FDLNCLLYTHGKICSNCKKNVAKPKAKVASPDDCPLVG 287
>gi|294102031|ref|YP_003553889.1| DNA-(apurinic or apyrimidinic site) lyase [Aminobacterium
colombiense DSM 12261]
gi|293617011|gb|ADE57165.1| DNA-(apurinic or apyrimidinic site) lyase [Aminobacterium
colombiense DSM 12261]
Length = 233
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 92/204 (45%), Gaps = 30/204 (14%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ +D + ++ LR++Y T WE V AD I +AI+ G +I +
Sbjct: 51 QNTNDRNRDRAYESLRQLYPT-------------WEEVAQADTERIKEAIRVAGLSDIKS 97
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
RIKE L + + GS ++ LR ++ + +L + G+G K+V CV L L + AFPV
Sbjct: 98 KRIKEILVAVQDAFGSYSIKELRKRGREQARAFLFKLPGVGAKTVACVLLFDLGYPAFPV 157
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQM 736
D ++ R + R+GW + P ++ L + P + YL H +
Sbjct: 158 DTHIHRFSKRIGWAHDRCKPEEIEGMLEQVVPE----ERYLGG-------------HINI 200
Query: 737 ITFGKIFCKKRSPNCGACPLRGEC 760
IT G+ C R P C C + C
Sbjct: 201 ITHGRNICLARQPRCDKCSVNDLC 224
>gi|330508003|ref|YP_004384431.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosaeta concilii
GP6]
gi|328928811|gb|AEB68613.1| DNA-(apurinic or apyrimidinic site) lyase, putative [Methanosaeta
concilii GP6]
Length = 224
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 85/179 (47%), Gaps = 17/179 (9%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K D ++E++ A E+AD I+E G NI A RI+E L + G++D+ +L + D
Sbjct: 57 KRDYGNYESLLLAPTEEVADCIREGGLANIKALRIQEVLLSIKRDRGAMDIGFLEGMDKD 116
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ YLLD+ G+G K+ V L + PVD +V R++ RLG VP P
Sbjct: 117 EAMSYLLDLPGVGPKTASIVLLFAFGMPFMPVDTHVFRVSQRLGLVPENLSPE------- 169
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K +++I P CY + H +I G+ C+ R P C L+ C F
Sbjct: 170 KAQKALERI----VPPECY------HSFHLNLIRHGRQICRARGPKHEECALKECCDCF 218
>gi|255540783|ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis]
gi|223550571|gb|EEF52058.1| Endonuclease III, putative [Ricinus communis]
Length = 291
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 72/142 (50%), Gaps = 4/142 (2%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K D W+ V A+ I +AI+ G A IK LN L+E G I LE+LR++ D
Sbjct: 112 KSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASCIKNILNCLLEKKGKICLEYLRDMSVD 171
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
++K L +G+G K+V CV + LQ FPVD +V IA LGWVP ++HL
Sbjct: 172 EIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDTHVFEIAKALGWVPEVADRNKTYLHLN 231
Query: 705 KEYPVMDKIQ----MYLWPRLC 722
+ P K +Y +LC
Sbjct: 232 QRIPNELKFDLNCLLYTHGKLC 253
>gi|307596439|ref|YP_003902756.1| HhH-GPD family protein [Vulcanisaeta distributa DSM 14429]
gi|307551640|gb|ADN51705.1| HhH-GPD family protein [Vulcanisaeta distributa DSM 14429]
Length = 232
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 86/169 (50%), Gaps = 18/169 (10%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +A+AIK G I A +I E ++E +G DL W+ + P D+ ++ LL++ G+G
Sbjct: 76 EDALANAIKPAGMHRIRARKIIELSRVILERYGG-DLTWIVDSPLDEARKALLELPGVGE 134
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ + + L++L FPVD ++ RI++RLG V + Y + K W
Sbjct: 135 KTADVI-LVNLGKPTFPVDTHITRISIRLGIVKS------------RNYREIQKA----W 177
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
+ D E+H ++I FG+ C+ R+P C C + C ++ S V
Sbjct: 178 MGILTPDPSRYLEVHLKLIQFGRDVCRARNPRCDMCGFKEVCNYYMSNV 226
>gi|320333147|ref|YP_004169858.1| HhH-GPD family protein [Deinococcus maricopensis DSM 21211]
gi|319754436|gb|ADV66193.1| HhH-GPD family protein [Deinococcus maricopensis DSM 21211]
Length = 258
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 88/189 (46%), Gaps = 21/189 (11%)
Query: 579 EERSHDKMDSV-DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEW 637
EE ++ ++ + DW+A+ A ++A AI+ + A RI+ L + G+ DL +
Sbjct: 60 EETAYQELVKIGDWDAIAEAPVEQVAHAIRASNYPEVKAPRIQATLKAICAQRGNYDLSF 119
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
L ++P + ++L D+ G+G+K+ V L + FPVD +V RI R+G +P G
Sbjct: 120 LVDLPHEDGLKWLTDLPGVGVKTASLVLLFNFSKPVFPVDTHVHRITTRVGAIPRM---G 176
Query: 698 DLHIH--LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACP 755
+ H LLK L D LYELH ++ G+ C P C C
Sbjct: 177 EAVAHKALLK---------------LLAPDPPFLYELHINLLKHGQQVCTFSRPRCPKCV 221
Query: 756 LRGECKHFA 764
LR C A
Sbjct: 222 LRDLCDAHA 230
>gi|380494482|emb|CCF33119.1| base excision DNA repair protein [Colletotrichum higginsianum]
Length = 385
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 88/200 (44%), Gaps = 29/200 (14%)
Query: 570 LLRRIYS-----TGEERSHDKMDSV-----DWEAVRCADESEIADAIKERGQQNIIAGRI 619
L+R I S T R+ MD+V +WE + S++ +AIK G + + I
Sbjct: 183 LVRTILSQNTSDTNSTRAKRSMDAVYGGSDEWERIVDGGVSKLQEAIKCGGLSQVKSKVI 242
Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
LN++ E +GS L+ L N + + L+ +G+G K+ CV L LQ +F VD +
Sbjct: 243 IGILNQVREKYGSYSLDHLFNASNEDAMQELISFQGVGPKTASCVLLFCLQRESFAVDTH 302
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
V RI +GW P + H HL P DK Y LH ++
Sbjct: 303 VWRITGLMGWRPRSASRDETHAHLDVRIPDEDK-----------------YGLHILLVKH 345
Query: 740 GKIF--CKKRSPNCGACPLR 757
GK+ CK + G C LR
Sbjct: 346 GKVCAECKAGGKSVGKCELR 365
>gi|193215044|ref|YP_001996243.1| HhH-GPD family protein [Chloroherpeton thalassium ATCC 35110]
gi|193088521|gb|ACF13796.1| HhH-GPD family protein [Chloroherpeton thalassium ATCC 35110]
Length = 227
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 80/177 (45%), Gaps = 19/177 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+W + A +EIA +I+ G N A RIK L LV G++ L++L + V E+
Sbjct: 68 EWAILLDAPVAEIAKSIEIGGLANQKAQRIKAILQELVRTQGALSLDFLADFSDKAVLEF 127
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L +G+G+K+ CV L L PVD ++ RI RLG +
Sbjct: 128 LTSFKGVGVKTAGCVLLFGLGRDVCPVDTHIHRILNRLGIFSTKH--------------- 172
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
D+ L P ++ Y LH +I GK C R PNC C L +C+ FA A
Sbjct: 173 ADETFAELQP---HIPTGKAYSLHVNLIRHGKRVCCARKPNCQTCMLAEDCE-FARA 225
>gi|413945491|gb|AFW78140.1| hypothetical protein ZEAMMB73_391413 [Zea mays]
Length = 78
Score = 80.5 bits (197), Expect = 6e-12, Method: Composition-based stats.
Identities = 34/52 (65%), Positives = 42/52 (80%)
Query: 1000 RGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGSSATTLF 1051
RG FPLNGTYFQVNEVFADH +SH+PI+V R + NL+R +V FG+S T+F
Sbjct: 2 RGSFPLNGTYFQVNEVFADHRSSHNPIHVEREMLWNLQRRMVFFGTSVPTIF 53
>gi|310800884|gb|EFQ35777.1| base excision DNA repair protein [Glomerella graminicola M1.001]
Length = 389
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 88/200 (44%), Gaps = 29/200 (14%)
Query: 570 LLRRIYS-----TGEERSHDKMDSV-----DWEAVRCADESEIADAIKERGQQNIIAGRI 619
L+R I S T R+ MD+V +WE + ++ +AIK G + + I
Sbjct: 187 LVRTILSQNTSDTNSTRAKRSMDAVYGGSDEWEKIVEGGVHKLQEAIKCGGLSQVKSKVI 246
Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
LN++ E +GS L+ L N ++ + L+ +G+G K+ CV L LQ +F VD +
Sbjct: 247 IGILNQVKEKYGSYTLDHLFNATNEEAMQELISFQGVGPKTASCVLLFCLQRESFAVDTH 306
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
V RI LGW P + H HL P DK Y LH ++
Sbjct: 307 VWRITGLLGWRPKSASRDETHAHLDVRIPDEDK-----------------YGLHILLVKH 349
Query: 740 GKIF--CKKRSPNCGACPLR 757
GK+ CK + G C LR
Sbjct: 350 GKVCDECKAGGKSIGKCELR 369
>gi|307108459|gb|EFN56699.1| hypothetical protein CHLNCDRAFT_11543, partial [Chlorella
variabilis]
Length = 185
Score = 80.1 bits (196), Expect = 6e-12, Method: Composition-based stats.
Identities = 55/172 (31%), Positives = 79/172 (45%), Gaps = 25/172 (14%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A + +ADAI+ G +I AGRI+ L+ L G LE LR +P K L
Sbjct: 38 WEGVRTAPLAAVADAIRVGGLADIKAGRIQAILSTLAAERGECSLEHLRGLPAAAAKAEL 97
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+ CV L +L+ F VD +V I+ LGWVP+ + H+ + P
Sbjct: 98 SRFKGVGKKTAACVLLFALELPEFAVDTHVWEISKALGWVPVNATRDQAYDHMNELVP-- 155
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
D+++ Y+LH ++ GK C AC G K
Sbjct: 156 DELK---------------YDLHVLLVRHGK--------QCPACAKAGSAKQ 184
>gi|258406277|ref|YP_003199019.1| HhH-GPD family protein [Desulfohalobium retbaense DSM 5692]
gi|257798504|gb|ACV69441.1| HhH-GPD family protein [Desulfohalobium retbaense DSM 5692]
Length = 218
Score = 80.1 bits (196), Expect = 6e-12, Method: Composition-based stats.
Identities = 49/159 (30%), Positives = 72/159 (45%), Gaps = 16/159 (10%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D E+A I+ G + AGRIK FL R E D L+ +P +++E LL + G+G
Sbjct: 70 DTEELAALIRPAGYYRVKAGRIKNFL-RFFEHEAGFDFTVLQALPTPEIRERLLGVNGIG 128
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + L +L F VD RI R G +P + DL + + P
Sbjct: 129 PETADSIALYALDKPTFVVDTYTARIFGRHGQIPEEISYADLQAYFTEALPE-------- 180
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
D E H Q++ GK +CKK+ P C CPL
Sbjct: 181 -------DTALFNEFHAQIVRVGKHWCKKKQPQCHRCPL 212
>gi|357477261|ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula]
gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula]
Length = 280
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 83/164 (50%), Gaps = 11/164 (6%)
Query: 570 LLRRIYS--TGEERSHDKMDSVD-----WEAVRCADESEIADAIKERGQQNIIAGRIKEF 622
L+R I S T E S+ S+ WE V A+ E+ +AI+ G A IK
Sbjct: 87 LVRTILSQNTTEANSNKAFASLKSLFPTWEHVHGAESKELENAIRCGGLAPTKAKCIKNL 146
Query: 623 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
L+ L+E G + LE+LR++ D+VK L +G+G K+V CV + +LQ FPVD ++
Sbjct: 147 LSCLLERKGKMCLEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNLQLDDFPVDTHIFE 206
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ----MYLWPRLC 722
IA +GWVP ++HL + P K +Y +LC
Sbjct: 207 IAKTMGWVPAAADRNKTYLHLNQRIPDELKFDLNCLLYTHGKLC 250
>gi|383762204|ref|YP_005441186.1| putative DNA glycosylase [Caldilinea aerophila DSM 14535 = NBRC
104270]
gi|381382472|dbj|BAL99288.1| putative DNA glycosylase [Caldilinea aerophila DSM 14535 = NBRC
104270]
Length = 239
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 77/174 (44%), Gaps = 17/174 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW+AVR A I +AI+ G N A I L R+ GS DL L P ++ Y
Sbjct: 69 DWDAVRTAPLDAIKEAIRPAGMYNQKAPAIVATLERIKADWGSYDLRPLAQKPVEEAFTY 128
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ V L AFPVD ++ RI+ R+G + Y +
Sbjct: 129 LTSLPGVGHKTASIVILFCFNGAAFPVDTHIQRISQRVG----------ISRRRADAYTI 178
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+W L ++ Y LH +I G+ C+ +P C CPL+ C ++
Sbjct: 179 KR-----IWESL--IEPSAFYALHINLIRHGRRTCQALTPRCELCPLQSMCDYY 225
>gi|260655074|ref|ZP_05860562.1| base excision DNA repair protein, HhH-GPD family [Jonquetella
anthropi E3_33 E1]
gi|424844920|ref|ZP_18269531.1| putative endoIII-related endonuclease [Jonquetella anthropi DSM
22815]
gi|260630185|gb|EEX48379.1| base excision DNA repair protein, HhH-GPD family [Jonquetella
anthropi E3_33 E1]
gi|363986358|gb|EHM13188.1| putative endoIII-related endonuclease [Jonquetella anthropi DSM
22815]
Length = 234
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 70/170 (41%), Gaps = 17/170 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W ++E+ I+ G + + I L +L G L LR P + ++
Sbjct: 77 WADAAALSQAELISLIRPAGLCDSKSATIIRVLGAAKDLTGQYSLGLLRKKKPAEAWNFM 136
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
I+G+G+K+ CV + L AFPVD +V R R+GW P + P + + P
Sbjct: 137 TSIKGVGVKTAACVMVFDLGFPAFPVDTHVARFCRRMGWAPEKASPAAIQEMMEGLVPDS 196
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K H +IT G+ CK R P CG C LRG C
Sbjct: 197 RKAGA-----------------HLNIITHGRRVCKARGPLCGDCLLRGLC 229
>gi|392966115|ref|ZP_10331534.1| HhH-GPD family protein [Fibrisoma limi BUZ 3]
gi|387845179|emb|CCH53580.1| HhH-GPD family protein [Fibrisoma limi BUZ 3]
Length = 255
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 86/194 (44%), Gaps = 19/194 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A ++ DAIK + A I+ L ++ G + +++LR++ + ++L
Sbjct: 66 WEQVRDAPLPDLIDAIKTANYPEVKAPYIQNVLAHIIRERGDVSIDFLRDLSTEAAMKWL 125
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
D+ G+GLK+ + L + + PVD +V R+ R+G + + + K Y ++
Sbjct: 126 TDLPGIGLKTATLLLLFNFRKPVLPVDTHVHRVTQRVGLIGPK-------VSAEKAYAML 178
Query: 711 DKIQMYLWPRLCYL--DQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
L YL D K L+ H G+ C +P C C LR C ++ S
Sbjct: 179 ----------LAYLPQDAKDLFNFHKHFYWHGQRICTWYTPKCHECVLRDMCDYYQSGGT 228
Query: 769 SARFALPGPSEKGI 782
+ P P K +
Sbjct: 229 MMLSSTPVPKSKRV 242
>gi|449015802|dbj|BAM79204.1| similar to endonuclease III [Cyanidioschyzon merolae strain 10D]
Length = 276
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 79/178 (44%), Gaps = 26/178 (14%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL--------VELHGSIDLEWLRNV 641
DW V A EI + I+ G A I++ L L + + LE+LRNV
Sbjct: 101 DWADVLHAPVGEIEETIRIGGLARTKAQHIQDILRELEHNAATTGTDTDWTQGLEFLRNV 160
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
P +V EYL G+G K+ CV + +L+ +FPVD ++ R+A RLGWVP + H
Sbjct: 161 PTAQVMEYLTRYPGVGPKTAACVAMFTLERDDSFPVDTHIFRLAKRLGWVP------NGH 214
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
+ + + I L Y +H MI G+ C R P C C L G
Sbjct: 215 TREQAQVDLENLIPSTLH-----------YPMHILMIEHGRKVCTARGPRCQHCKLAG 261
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 8/73 (10%)
Query: 185 REVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARF 244
R+V++ +Q++ G+R +P SV++++V L+QN +D LS +AY +L +R+
Sbjct: 48 RQVYQALVQTY--------GERNRRPTGRSVLEALVRTILSQNTTDKLSGAAYRALRSRY 99
Query: 245 PLKSTNDHTPSDE 257
+ H P E
Sbjct: 100 KDWADVLHAPVGE 112
>gi|225457138|ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera]
Length = 310
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 84/172 (48%), Gaps = 12/172 (6%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V AD I +AI+ G A IK+ L+ L+E G + LE+LR++ D++K L
Sbjct: 139 WQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGKLCLEYLRDLTVDEIKTEL 198
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + LQ FPVD +V +I +GWVP ++HL + P
Sbjct: 199 SHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPAVADRKKAYLHLNRRIPDE 258
Query: 711 DKIQ----MYLWPRLCY--------LDQKTLYELHYQMITFGKIFCKKRSPN 750
K ++ +LC+ +K +E ++T+ K SPN
Sbjct: 259 LKFDLNCLLFTHGKLCHECTQKGANQKRKESHESSCPLLTYCGDMFKSSSPN 310
>gi|55378639|ref|YP_136489.1| endonuclease III [Haloarcula marismortui ATCC 43049]
gi|55231364|gb|AAV46783.1| endonuclease III [Haloarcula marismortui ATCC 43049]
Length = 278
Score = 77.8 bits (190), Expect = 3e-11, Method: Composition-based stats.
Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 28/214 (13%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEE-RSHDKMDSVDW-EAVRCADESEIADAIKERGQQNI 614
+ SD Q D L Y GE+ S +DS D A+ AD+ E+A+ I G N
Sbjct: 64 QNTSDKASQPAHDDLMARYGGGEDANSEGDIDSTDLARALADADQPELAETISSAGLYNQ 123
Query: 615 IAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQ 670
+ RI R+ E +G D ++R+ P+ V+ LLD+ G+G K+ +CV L + +
Sbjct: 124 KSERIIALAQRICEEYGGEAGFD-AFVRDSDPEAVRSTLLDMNGVGPKTADCVLLFAGGR 182
Query: 671 HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY 730
FPVD +V RIA R+G P + +L ++ P C
Sbjct: 183 GGVFPVDTHVHRIARRMGLAPADADHETVRAYLERDVPAAK----------CGFG----- 227
Query: 731 ELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
H +I FG+ +C R P C ACPL G C
Sbjct: 228 --HTAIIQFGREYCSARKPACLDDPDACPLAGHC 259
>gi|297733838|emb|CBI15085.3| unnamed protein product [Vitis vinifera]
Length = 310
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 65/118 (55%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V AD I +AI+ G A IK+ L+ L+E G + LE+LR++ D++K L
Sbjct: 139 WQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGKLCLEYLRDLTVDEIKTEL 198
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+G+G K+V CV + LQ FPVD +V +I +GWVP ++HL + P
Sbjct: 199 SHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPAVADRKKAYLHLNRRIP 256
>gi|357124677|ref|XP_003564024.1| PREDICTED: endonuclease III-like [Brachypodium distachyon]
Length = 277
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 83/179 (46%), Gaps = 27/179 (15%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G+I LE+LR + D+VK L
Sbjct: 109 WDQVVDEEGKGLEDAIRCGGLAATKAARIRAMLRDVRERRGAICLEYLRELSVDEVKREL 168
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K++ CV + LQ FPVD +V RI +GWVP +IHL K+ P
Sbjct: 169 SQFKGIGPKTMACVLMFYLQKDDFPVDTHVLRITKAIGWVPSTATREKAYIHLNKKIP-- 226
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA---------CPLRGEC 760
D ++ ++L+ +T GK+ C + GA CPL C
Sbjct: 227 DDLK---------------FDLNCLFVTHGKL-CPSCTKKLGAQKLKVANATCPLASYC 269
>gi|448681086|ref|ZP_21691232.1| endonuclease III [Haloarcula argentinensis DSM 12282]
gi|445768144|gb|EMA19231.1| endonuclease III [Haloarcula argentinensis DSM 12282]
Length = 277
Score = 77.0 bits (188), Expect = 6e-11, Method: Composition-based stats.
Identities = 68/227 (29%), Positives = 100/227 (44%), Gaps = 25/227 (11%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHD-KMDSVDWEAVRCADESEIADAIKERGQQNII 615
+ SD Q D L Y E S + DS A+ AD+ E+A+ I G N
Sbjct: 64 QNTSDKASQPAHDELMARYGGDENASSEGDTDSDLARALADADQPELAETISSAGLYNQK 123
Query: 616 AGRIKEFLNRLVELHG--SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHI 672
+ RI R+ E +G S E++R P++V+ LLD+ G+G K+ +CV L + +
Sbjct: 124 SERIIALAQRICEEYGGESGFDEFVRESDPEEVRSTLLDMNGVGPKTADCVLLFAGGRGG 183
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
FPVD +V RIA R+G P + +L ++ P +K +
Sbjct: 184 VFPVDTHVHRIARRMGLAPADADHETVREYLERDVPA----------------EKCGFG- 226
Query: 733 HYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALP 775
H MI FG+ +C R P C ACPL G C ++ A P
Sbjct: 227 HTAMIQFGREYCSARKPACLDDPDACPLAGHCDQVGVYPSTGEVADP 273
>gi|374857375|dbj|BAL60228.1| endonuclease III [uncultured candidate division OP1 bacterium]
Length = 225
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 80/171 (46%), Gaps = 20/171 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
++E++ A ++A+AIK G A RI+E L R+ G DL +L+++ +
Sbjct: 60 NYESIVRAPTEKVAEAIKVGGLHQQKAKRIQEILRRIQRERGDYDLSYLKDLSTEHAVRE 119
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL +G+G K+ V SL FPVD ++ RI RLG V + P D+ L V
Sbjct: 120 LLKFDGVGKKTAGVVLTFSLNKPYFPVDTHIHRITQRLGLVKQREDPHDVMNAL-----V 174
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D+++ Y+LH +I G+ CK R P C C + C
Sbjct: 175 PDRLK---------------YQLHLHLIWHGRQTCKARKPLCDQCVIADLC 210
>gi|448637456|ref|ZP_21675694.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
gi|445764303|gb|EMA15458.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
Length = 278
Score = 76.6 bits (187), Expect = 7e-11, Method: Composition-based stats.
Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 28/214 (13%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEE-RSHDKMDSVDW-EAVRCADESEIADAIKERGQQNI 614
+ SD Q D L Y GE+ S ++S D A+ AD+ E+A+ I G N
Sbjct: 64 QNTSDKASQPAHDDLMARYGGGEDTNSEGNINSTDLARALADADQPELAETISSAGLYNQ 123
Query: 615 IAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQ 670
+ RI R+ E +G D ++R+ +KV+ LLD+ G+G K+ +CV L + +
Sbjct: 124 KSERIIALAQRICEEYGGEAGFD-AFVRDSDSEKVRSTLLDMNGVGPKTADCVLLFAGGR 182
Query: 671 HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY 730
FPVD +V RIA R+G P + +L ++ P C
Sbjct: 183 GGVFPVDTHVHRIARRMGLAPADADHETVRAYLERDVPAAK----------CGFG----- 227
Query: 731 ELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
H MI FG+ +C R P C ACPL G C
Sbjct: 228 --HTAMIQFGREYCSARKPACLDDPDACPLAGHC 259
>gi|443896168|dbj|GAC73512.1| nitrogen permease regulator NLRG/NPR2, partial [Pseudozyma
antarctica T-34]
Length = 979
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 74/170 (43%), Gaps = 22/170 (12%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
D +WEA+ A ESE+ +AI+ G N A IK L++ +E HG + L+ L
Sbjct: 528 DHFGRRNWEAIHAAPESELVEAIRCGGLANNKARTIKGILSQTMEKHGKLSLDHLHTATD 587
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
D++ + L+ G+G K CV + + VD +V R+ LGWVP + + HL
Sbjct: 588 DEIMQELVSFNGVGPKVASCVLAFCIGRQSMAVDTHVFRLCKSLGWVPEKANRDQTYYHL 647
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA 753
+ P K Y LH +I G KR NC A
Sbjct: 648 HERVPGHLK-----------------YPLHVLLIQHG-----KRCANCSA 675
>gi|374852558|dbj|BAL55489.1| endonuclease III [uncultured candidate division OP1 bacterium]
Length = 225
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 79/170 (46%), Gaps = 20/170 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
+E++ A ++A+AIK G A RI+E L R+ G DL +L+++ + L
Sbjct: 61 YESIVRAPTEKVAEAIKVGGLHQQKAKRIQEILRRIQRERGDYDLSYLKDLSTEHAVREL 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L +G+G K+ V SL FPVD ++ RI RLG V + P D+ L V
Sbjct: 121 LKFDGVGKKTAGVVLTFSLNKPYFPVDTHIHRITQRLGLVKQREDPHDVMNAL-----VP 175
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D+++ Y+LH +I G+ CK R P C C + C
Sbjct: 176 DRLK---------------YQLHLHLIWHGRQTCKARKPLCDQCVIADLC 210
>gi|298252264|ref|ZP_06976067.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
gi|297546856|gb|EFH80724.1| HhH-GPD family protein [Ktedonobacter racemifer DSM 44963]
Length = 242
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 82/194 (42%), Gaps = 27/194 (13%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH---------GSIDLEWLRNV 641
WE VR A E+A+AIK G N+ A RI+ L+ L E + + L+
Sbjct: 65 WEEVRDAPTHEVAEAIKSGGLANVKAPRIQSALHTLSEWQRAKGDTRSLSAFLQDELKGQ 124
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
P ++ YL + G+G K+ CV L ++ P+D ++ R+ RLG + + H
Sbjct: 125 PLEEAWRYLQQMPGVGPKTAACVLLFNMGRPLMPIDTHLHRLTHRLGLIGPKVSADQAHT 184
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LK P + Y LH +I G+ C + P C CPL EC
Sbjct: 185 IFLKALP-----------------PEWAYTLHVNLIRHGRTICHAQRPKCPQCPLLSECA 227
Query: 762 HFASAVASARFALP 775
+A +V LP
Sbjct: 228 -YAGSVNPQETMLP 240
>gi|326510563|dbj|BAJ87498.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 282
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 63/118 (53%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G+I LE+LR++ D+VK L
Sbjct: 114 WDQVVDEEGMGLEDAIRCGGLAATKAARIRAMLRGVREKRGAICLEYLRDLSVDEVKREL 173
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+G+G K+V CV + LQ FPVD +V RI LGWVP +IH+ + P
Sbjct: 174 SQFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITKALGWVPPTATREKAYIHMNNKIP 231
>gi|297625112|ref|YP_003706546.1| HhH-GPD family protein [Truepera radiovictrix DSM 17093]
gi|297166292|gb|ADI16003.1| HhH-GPD family protein [Truepera radiovictrix DSM 17093]
Length = 241
Score = 75.5 bits (184), Expect = 2e-10, Method: Composition-based stats.
Identities = 55/201 (27%), Positives = 87/201 (43%), Gaps = 16/201 (7%)
Query: 570 LLRRIYSTGEERSHDKMDSV--DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 627
L +R EER++ +M W VR A +E+A+AI + A I++ + R++
Sbjct: 55 LSQRTTWQNEERAYRRMWERFGSWVGVRDAPVAELAEAIAPSNYPEVKAPNIQKTVARVL 114
Query: 628 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
E DL +LR++P D+ +L + G+GLK+ V L PVD +V R++ R+
Sbjct: 115 ERSPEADLSFLRDLPLDEALAWLTSLPGVGLKTASLVLLFCFARPVLPVDTHVYRVSQRV 174
Query: 688 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKR 747
G LL L + LY H ++ G+ C R
Sbjct: 175 G--------------LLSARVKTPTAAHAPLLALLPPEPAVLYNFHMALLVHGQRLCVWR 220
Query: 748 SPNCGACPLRGECKHFASAVA 768
+P C CPL C+ F + V
Sbjct: 221 APRCSRCPLTARCRWFRAQVG 241
>gi|125596638|gb|EAZ36418.1| hypothetical protein OsJ_20749 [Oryza sativa Japonica Group]
Length = 277
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 84/182 (46%), Gaps = 32/182 (17%)
Query: 591 WEAVRCADESE---IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK 647
W+ V DE E + DAI+ G A RI+ L + E G I LE+LR++ D+VK
Sbjct: 108 WDQV--VDEEEGKRLEDAIRCGGLAATKAARIRAMLRGVRERRGKICLEYLRDLSVDEVK 165
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
L +G+G K+V CV + LQ FPVD +V RI +GWVP ++HL +
Sbjct: 166 TELSRFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITNAIGWVPATASRERAYLHLNSKI 225
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA---------CPLRG 758
P D ++ ++L+ +T GK+ C+ S G CPL
Sbjct: 226 P--DDLK---------------FDLNCLFVTHGKL-CQSCSKKLGGQKTTGSNSMCPLAS 267
Query: 759 EC 760
C
Sbjct: 268 YC 269
>gi|356513571|ref|XP_003525486.1| PREDICTED: endonuclease III-like [Glycine max]
Length = 284
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 71/137 (51%), Gaps = 4/137 (2%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V A+ ++ +AI+ G A IK L L E G + LE+LR++ D+VK L
Sbjct: 117 WEQVLWAESKDVENAIRCGGLAPTKASCIKNVLRCLRERRGELCLEYLRDLSVDEVKAEL 176
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+G+G K+V CV + +LQ FPVD ++ IA +GWVP ++HL + P
Sbjct: 177 SLFKGIGPKTVACVLMFNLQQDDFPVDTHIFEIAKTMGWVPAVANRNKSYLHLNQRVPNE 236
Query: 711 DKIQ----MYLWPRLCY 723
K +Y +LC+
Sbjct: 237 LKFDLNCLLYTHGKLCH 253
>gi|302680965|ref|XP_003030164.1| hypothetical protein SCHCODRAFT_57851 [Schizophyllum commune H4-8]
gi|300103855|gb|EFI95261.1| hypothetical protein SCHCODRAFT_57851 [Schizophyllum commune H4-8]
Length = 317
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 87/188 (46%), Gaps = 27/188 (14%)
Query: 576 STGEERSHDK-MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
S+G +RS D ++ A+ A S +ADAI+ G N A I+ L + HGS
Sbjct: 114 SSGAKRSLDAAFGRNNFAAIADAPTSAVADAIRSGGLANKKAATIQRVLRDIKAKHGSYS 173
Query: 635 LEWLRNVPPD-KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
L+ L +V D +V L+ +G+G K+ CV L L +F VD +V R++ LGWVP
Sbjct: 174 LQHLADVCADAEVMRELMAYDGVGPKTASCVLLFCLGRESFAVDTHVFRLSKLLGWVP-- 231
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT----LYELHYQMITFGKIFCKKRSP 749
P D++Q +LD + Y LH MI G+I ++
Sbjct: 232 --------------PKADRVQTQ-----AHLDLRIPGDRKYGLHVLMIDHGRICTGCKTK 272
Query: 750 NCGACPLR 757
G+C L+
Sbjct: 273 GKGSCILK 280
>gi|159042442|ref|YP_001541694.1| DNA-(apurinic or apyrimidinic site) lyase [Caldivirga
maquilingensis IC-167]
gi|157921277|gb|ABW02704.1| DNA-(apurinic or apyrimidinic site) lyase [Caldivirga
maquilingensis IC-167]
Length = 230
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 96/193 (49%), Gaps = 26/193 (13%)
Query: 573 RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS 632
R+Y ER +++ S+ + +SE+ + ++ G A R+ E N + E +
Sbjct: 55 RVY----ERLKERLGSITPNTILSLSKSELENILRPIGSFRQRARRLIELANTVNEKYNG 110
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
LE++R + D+ + L+++ G+G K+ + V LL+L FPVD ++ RI+ RLG
Sbjct: 111 -SLEFIRGMGTDEARRTLMNLPGVGPKTADVV-LLNLGKPVFPVDTHIMRISHRLG---- 164
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
++ Y +K+ + W +L L +H +I FG+ C+ R P C
Sbjct: 165 ----------VMGGY---EKVSAF-WIKL--LKPNEYLMVHLGLIAFGRAICRSRRPLCE 208
Query: 753 ACPLRGECKHFAS 765
CPLR +CK++ S
Sbjct: 209 HCPLRVKCKYYLS 221
>gi|51535148|dbj|BAD37860.1| HhH-GPD base excision DNA repair protein-related-like [Oryza sativa
Japonica Group]
gi|51535812|dbj|BAD37897.1| HhH-GPD base excision DNA repair protein-related-like [Oryza sativa
Japonica Group]
gi|218197872|gb|EEC80299.1| hypothetical protein OsI_22321 [Oryza sativa Indica Group]
Length = 277
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 84/182 (46%), Gaps = 32/182 (17%)
Query: 591 WEAVRCADESE---IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK 647
W+ V DE E + DAI+ G A RI+ L + E G I LE+LR++ D+VK
Sbjct: 108 WDQV--VDEEEGKRLEDAIRCGGLAATKAARIRAMLRGVRERRGKICLEYLRDLSVDEVK 165
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
L +G+G K+V CV + LQ FPVD +V RI +GWVP ++HL +
Sbjct: 166 TELSRFKGIGPKTVACVLMFYLQKDDFPVDTHVLRITNAIGWVPATASRERAYLHLNSKI 225
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA---------CPLRG 758
P D ++ ++L+ +T GK+ C+ S G CPL
Sbjct: 226 P--DDLK---------------FDLNCLFVTHGKL-CQSCSKKLGGQKTTGSNSMCPLAS 267
Query: 759 EC 760
C
Sbjct: 268 YC 269
>gi|436837874|ref|YP_007323090.1| HhH-GPD family protein [Fibrella aestuarina BUZ 2]
gi|384069287|emb|CCH02497.1| HhH-GPD family protein [Fibrella aestuarina BUZ 2]
Length = 236
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 81/191 (42%), Gaps = 15/191 (7%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAVR A ++ DAI+ I A I+ L L+ G+ ++++LR + ++ ++L
Sbjct: 61 WEAVRDAPLPDLIDAIQTANYPEIKAPYIQNVLRVLIAERGTANIDFLRELSTEEAMQWL 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
++ G+G K+ + L + Q PVD +V R+ R+G + + H LL P
Sbjct: 121 TNLPGIGPKTATLLLLFAFQKPVLPVDTHVHRVTQRIGAIGPKVSAAKAHDLLLSYLPA- 179
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
D L+ H G+ C +P C AC L +C + S
Sbjct: 180 --------------DAPVLFNFHKHFYWHGQRVCTWYNPRCEACVLAPDCDYLQSGKTML 225
Query: 771 RFALPGPSEKG 781
+ P KG
Sbjct: 226 LTSTPSKKPKG 236
>gi|358381882|gb|EHK19556.1| hypothetical protein TRIVIDRAFT_24735, partial [Trichoderma virens
Gv29-8]
Length = 274
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 71/170 (41%), Gaps = 19/170 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+WEA+ ++ I+ G + + I L + +GS L+ L + D
Sbjct: 108 NWEAIVDGGTEKLQRTIRSGGLSVVKSKVIINILRQTKAKYGSYSLDHLLHASNDSAMRE 167
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL +G+G K+ CV L LQ +FPVD +V RI LGW P + HL P
Sbjct: 168 LLSFQGVGPKTASCVLLFCLQRDSFPVDTHVYRITGLLGWRPETASREEAQAHLEARVPD 227
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGACPLR 757
DK Y LH ++T GK C+ S G CPLR
Sbjct: 228 EDK-----------------YGLHILLVTHGKTCNECRAGSTKSGKCPLR 260
>gi|323507809|emb|CBQ67680.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 638
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 71/165 (43%), Gaps = 22/165 (13%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+W A+ A ESE+ DAI+ G N A IK L + +E HG + L+ L + D + +
Sbjct: 199 ANWAAIHAAPESELVDAIRCGGLANNKARTIKGILAQTLEKHGKLSLDHLHDASDDDIMQ 258
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L+ G+G K CV + + VD +V R+ LGWVP + + HL + P
Sbjct: 259 QLVAFNGVGPKVASCVLAFCIGRDSMAVDTHVFRLCKALGWVPDKANRDQTYYHLHERVP 318
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA 753
K Y LH +I G KR NC A
Sbjct: 319 GHLK-----------------YALHVLLIAHG-----KRCANCSA 341
>gi|297719745|ref|NP_001172234.1| Os01g0217900 [Oryza sativa Japonica Group]
gi|255673002|dbj|BAH90964.1| Os01g0217900, partial [Oryza sativa Japonica Group]
Length = 70
Score = 74.7 bits (182), Expect = 3e-10, Method: Composition-based stats.
Identities = 38/69 (55%), Positives = 42/69 (60%), Gaps = 5/69 (7%)
Query: 1041 VCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPS-----KKE 1095
V FG+S T+F L +IQ FWRGFVCVRGFDR R PRPL RLH S KK
Sbjct: 1 VYFGTSIPTIFKGLTTEEIQHCFWRGFVCVRGFDRTSRAPRPLYARLHFPASKITRNKKS 60
Query: 1096 AGSKRSRDE 1104
AGS RD+
Sbjct: 61 AGSAPGRDD 69
>gi|345560267|gb|EGX43392.1| hypothetical protein AOL_s00215g128 [Arthrobotrys oligospora ATCC
24927]
Length = 531
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 64/121 (52%), Gaps = 8/121 (6%)
Query: 579 EERSHDKM-DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-------LH 630
+E HD + V+W V D E+ AI++ G NI + IK L + E
Sbjct: 162 KEYGHDPVHGGVNWSLVHRRDVKELYKAIEKGGLANIKSKAIKGILEEVCEGTKKRGGKD 221
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
G I L++L D+V E L+ +G+G+K+ CV + L+ AFPVD +V RI+ LGWV
Sbjct: 222 GEISLDYLHEKGDDEVMETLMGFKGVGVKTATCVLMFCLRRSAFPVDTHVFRISKLLGWV 281
Query: 691 P 691
P
Sbjct: 282 P 282
>gi|448626650|ref|ZP_21671429.1| endonuclease III [Haloarcula vallismortis ATCC 29715]
gi|445760262|gb|EMA11526.1| endonuclease III [Haloarcula vallismortis ATCC 29715]
Length = 277
Score = 73.9 bits (180), Expect = 5e-10, Method: Composition-based stats.
Identities = 69/232 (29%), Positives = 102/232 (43%), Gaps = 26/232 (11%)
Query: 553 KTTPKKNSDNTVQQDW--DLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERG 610
+T +N+ + V Q +L+ R S G S D+ A+ AD+ E+A+ I G
Sbjct: 59 RTVLSQNTTDKVSQPAHDELMARYGSDGGPASGGDTDADLARALADADQPELAETISSAG 118
Query: 611 QQNIIAGRIKEFLNRLVELHGSID--LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
N + RI R+ E +G D E++R PD+V+ LLD+ G+G K+ +CV L +
Sbjct: 119 LYNQKSKRIITLAQRICEEYGGEDGFDEFVREGDPDEVRSTLLDMNGVGPKTADCVLLFA 178
Query: 669 -LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQK 727
+ FPVD +V RI R+G P + + +L ++ P C
Sbjct: 179 GGRGGVFPVDTHVHRITRRMGLAPAEADHETVREYLERDVPAAK----------CGFG-- 226
Query: 728 TLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALP 775
H MI FG+ +C R P C ACPL G C AS P
Sbjct: 227 -----HTAMIQFGREYCSARKPACLDDPDACPLAGHCDQVGVYPASGDVVDP 273
>gi|71003520|ref|XP_756426.1| hypothetical protein UM00279.1 [Ustilago maydis 521]
gi|46096031|gb|EAK81264.1| hypothetical protein UM00279.1 [Ustilago maydis 521]
Length = 1875
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 73/169 (43%), Gaps = 25/169 (14%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+W A+ A ESE+ +AI+ G N A IK LN+ ++ HG + L+ L + D++ +
Sbjct: 1435 NWHAIHAAPESELVEAIRCGGLANNKARTIKGILNQTLQRHGKLSLDHLHDATDDEIMQE 1494
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ G+G K CV + + VD +V R+ L WVP + + HL + P
Sbjct: 1495 LVSFNGVGPKVASCVLAFCIGRQSMAVDTHVFRLCKALAWVPEKANRDQTYYHLHERVPG 1554
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
K Y LH +I GK+ C C +G
Sbjct: 1555 PLK-----------------YALHVLLIKHGKM--------CANCSAKG 1578
>gi|170288346|ref|YP_001738584.1| HhH-GPD family protein [Thermotoga sp. RQ2]
gi|170175849|gb|ACB08901.1| HhH-GPD family protein [Thermotoga sp. RQ2]
Length = 220
Score = 73.6 bits (179), Expect = 7e-10, Method: Composition-based stats.
Identities = 51/166 (30%), Positives = 81/166 (48%), Gaps = 19/166 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++A+ IK G NI R+K L L E + +L LR++P ++E LL I+G+G
Sbjct: 68 EEKVAELIKPAGFFNIKTKRLKALLKFLKEY--NYNLSRLRDLPTHILRERLLKIKGIGK 125
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L +L+ F VD R+ R+ + L +Y + K+ M +
Sbjct: 126 ETADAILLYALEKPIFVVDSYTRRLLKRIFNIELN------------DYDEVQKLFMTHY 173
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
P D + E H ++ K FC K +P CG CPLR EC H +
Sbjct: 174 PE----DVRLYQEFHGLIVEHAKKFCNK-TPKCGVCPLRKECYHVS 214
>gi|448419884|ref|ZP_21580728.1| endoiii-related endonuclease [Halosarcina pallida JCM 14848]
gi|445674798|gb|ELZ27335.1| endoiii-related endonuclease [Halosarcina pallida JCM 14848]
Length = 283
Score = 73.2 bits (178), Expect = 7e-10, Method: Composition-based stats.
Identities = 67/221 (30%), Positives = 97/221 (43%), Gaps = 33/221 (14%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVD---------WEAVRCADESEIADAIK 607
+ SD Q D L YS+GE RS + + E++ A++SE+A+ I
Sbjct: 60 QNTSDVASQPAHDSLMERYSSGEARSASETRAGSEATRERSDLAESLAAAEQSELAETIS 119
Query: 608 ERGQQNIIAGRIKEFLNRLVELHGSID--LEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
G N + I R+ E G D ++ P +V+E LLDI G+G K+ +CV
Sbjct: 120 SAGLYNQKSEMIIGAAERICEEFGGADGFDAFVTGGDPGEVRERLLDIHGVGPKTADCVL 179
Query: 666 LLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYL 724
L S + FPVD +V RIA R+G P + HL ++ P
Sbjct: 180 LFSGGRGGVFPVDTHVHRIARRMGLAPADADHEAVREHLERDVPA--------------- 224
Query: 725 DQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECK 761
+K + H MI FG+ +C R P C ACPL C+
Sbjct: 225 -EKCGFG-HTAMIQFGREYCTARKPACLDGPEACPLYDLCE 263
>gi|15643148|ref|NP_228192.1| repair endonuclease [Thermotoga maritima MSB8]
gi|4980885|gb|AAD35467.1|AE001718_4 repair endonuclease, putative [Thermotoga maritima MSB8]
Length = 232
Score = 73.2 bits (178), Expect = 7e-10, Method: Composition-based stats.
Identities = 50/166 (30%), Positives = 82/166 (49%), Gaps = 19/166 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++A+ I+ G NI R+KE L L E + +L LR++P ++E LL I+G+G
Sbjct: 80 EEKVAELIRPAGFFNIKTKRLKELLKFLKEY--NYNLSRLRDLPTHILRERLLKIKGIGK 137
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L +L+ F VD R+ R+ + L +Y + K+ M +
Sbjct: 138 ETADAILLYALEKPVFVVDSYTRRLLKRIFNIELN------------DYDEVQKLFMTHY 185
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
P D + E H ++ K FC K +P CG CPL+ EC H +
Sbjct: 186 PE----DVRLYQEFHGLIVEHAKKFCSK-TPKCGVCPLKKECCHVS 226
>gi|171184578|ref|YP_001793497.1| DNA-(apurinic or apyrimidinic site) lyase [Pyrobaculum neutrophilum
V24Sta]
gi|170933790|gb|ACB39051.1| DNA-(apurinic or apyrimidinic site) lyase [Pyrobaculum neutrophilum
V24Sta]
Length = 222
Score = 73.2 bits (178), Expect = 8e-10, Method: Composition-based stats.
Identities = 55/181 (30%), Positives = 86/181 (47%), Gaps = 23/181 (12%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
+ S+ EAV E E+A IK G I A +K ++ HG I + L + ++
Sbjct: 59 LGSITPEAVAKLAEEELAALIKPAGMYRIRARALKALAEAFLK-HG-ITPQRLLEMGAER 116
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
+ +L+ + G+G K+ + V L+++ AFPVD ++ RIA R G + +
Sbjct: 117 ARAFLMSLPGVGKKTADVV-LVNIGLPAFPVDTHITRIARRWG--------------IGR 161
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
Y + + M P YL E H ++I FG+ C+ RSP CG CP+ C F S
Sbjct: 162 SYDEISRWFMDRLPPARYL------EFHLKLIQFGRDVCRARSPRCGVCPIGERCPSFKS 215
Query: 766 A 766
A
Sbjct: 216 A 216
>gi|448671097|ref|ZP_21687089.1| endonuclease III [Haloarcula amylolytica JCM 13557]
gi|445766179|gb|EMA17313.1| endonuclease III [Haloarcula amylolytica JCM 13557]
Length = 278
Score = 73.2 bits (178), Expect = 8e-10, Method: Composition-based stats.
Identities = 60/191 (31%), Positives = 86/191 (45%), Gaps = 28/191 (14%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDI 653
AD+ E+A+ I G N + RI R+ E +G D E++R+ P+ V+ LLD+
Sbjct: 106 ADQPELAETISSAGLYNQKSERIIALAQRICEEYGGEAGFD-EFVRDSDPEAVRSTLLDM 164
Query: 654 EGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
G+G K+ +CV L + + FPVD +V RIA R+G P + +L ++ P
Sbjct: 165 NGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGLAPPDADHETVREYLERDVPAAK- 223
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVA 768
C H MI FG+ +C R P C ACPL G C V
Sbjct: 224 ---------CGFG-------HTAMIQFGREYCSARKPACLDDPDACPLAGHCDQV--GVY 265
Query: 769 SARFALPGPSE 779
A + PSE
Sbjct: 266 PAADEITDPSE 276
>gi|418046315|ref|ZP_12684409.1| HhH-GPD family protein [Thermotoga maritima MSB8]
gi|351675868|gb|EHA59028.1| HhH-GPD family protein [Thermotoga maritima MSB8]
Length = 220
Score = 73.2 bits (178), Expect = 9e-10, Method: Composition-based stats.
Identities = 50/166 (30%), Positives = 82/166 (49%), Gaps = 19/166 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++A+ I+ G NI R+KE L L E + +L LR++P ++E LL I+G+G
Sbjct: 68 EEKVAELIRPAGFFNIKTKRLKELLKFLKEY--NYNLSRLRDLPTHILRERLLKIKGIGK 125
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L +L+ F VD R+ R+ + L +Y + K+ M +
Sbjct: 126 ETADAILLYALEKPVFVVDSYTRRLLKRIFNIELN------------DYDEVQKLFMTHY 173
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
P D + E H ++ K FC K +P CG CPL+ EC H +
Sbjct: 174 PE----DVRLYQEFHGLIVEHAKKFCSK-TPKCGVCPLKKECCHVS 214
>gi|448411957|ref|ZP_21576313.1| endonuclease III [Halosimplex carlsbadense 2-9-1]
gi|445669891|gb|ELZ22499.1| endonuclease III [Halosimplex carlsbadense 2-9-1]
Length = 279
Score = 72.8 bits (177), Expect = 9e-10, Method: Composition-based stats.
Identities = 58/192 (30%), Positives = 86/192 (44%), Gaps = 24/192 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID--LEWLRNVPPDKVKEY 649
E++ AD+ E+A+ I G N + RI R+ E +G D E+++ P+ V+E
Sbjct: 100 ESLAAADQQELAETISSAGLYNQKSERIVALAERICEEYGGADGFDEFVKTGEPNVVRET 159
Query: 650 LLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LLD+ G+G K+ +CV L S + FPVD +V RIA R+G L P D
Sbjct: 160 LLDMNGVGPKTADCVLLFSGGRGGVFPVDTHVHRIARRMG---LAPADAD-------HEG 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
V + ++ + C H MI FG+ +C R P C ACPL C
Sbjct: 210 VREALEAAVPEEKCGFG-------HTAMIQFGREYCSARKPACLDDPEACPLADLCDQVG 262
Query: 765 SAVASARFALPG 776
+ + P
Sbjct: 263 VSPGTGEVVDPA 274
>gi|395217179|ref|ZP_10401538.1| HhH-GPD family protein [Pontibacter sp. BAB1700]
gi|394455098|gb|EJF09644.1| HhH-GPD family protein [Pontibacter sp. BAB1700]
Length = 235
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 80/181 (44%), Gaps = 15/181 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DWE V A E E+ADA+ A +I++ L + E G I++++L ++P D+ +
Sbjct: 65 DWEGVLNAPEDELADAVSTTRYPLQKAPQIQQTLRMIKEDRGEINIDFLADLPIDEAMAW 124
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ + L + + PVD +V R++ R+G + + H LL+ P
Sbjct: 125 LTRLPGVGPKTATLLLLFNFKKPVLPVDTHVFRVSQRVGLIGAKITANKAHDVLLEMLPK 184
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
D L+ H + G+ C P C C L G C +++ A
Sbjct: 185 ---------------DATELFNFHKHLFWHGQKVCTWSYPKCEKCVLNGICNYYSQVRAK 229
Query: 770 A 770
Sbjct: 230 G 230
>gi|340516169|gb|EGR46419.1| predicted protein [Trichoderma reesei QM6a]
Length = 274
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 19/170 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+W ++ ++ I G + + I + L++ +G L+ L + D+
Sbjct: 118 NWASIVEGGTEKLQRTIHSGGLSVVKSRVIMDILHQTKARYGKYSLDHLLDASDDEAMRE 177
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL +G+G K+ CV L L+ +F VD +V RI+ LGW+P HL P
Sbjct: 178 LLSFQGVGPKTASCVLLFCLRRDSFAVDTHVYRISSLLGWIPPSATREQAQAHLEARVPD 237
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGK--IFCKKRSPNCGACPLR 757
DK Y LH M+T GK CK R N G C LR
Sbjct: 238 EDK-----------------YGLHILMVTHGKRCAECKARGKNAGKCALR 270
>gi|448605866|ref|ZP_21658459.1| endonuclease III [Haloferax sulfurifontis ATCC BAA-897]
gi|445741189|gb|ELZ92693.1| endonuclease III [Haloferax sulfurifontis ATCC BAA-897]
Length = 268
Score = 72.0 bits (175), Expect = 2e-09, Method: Composition-based stats.
Identities = 58/198 (29%), Positives = 92/198 (46%), Gaps = 30/198 (15%)
Query: 592 EAVRCADESEIADAIKERG----QQNIIAGRIKEFLNRLVELHGSIDLE-WLRNVPPDKV 646
E++ AD I DAI+ G + N+I G +E L + D + ++R P V
Sbjct: 90 ESLAAADREGIVDAIRSGGLYNQKSNLIQGVAEEVL---ADFGSEADFDRYVREEDPATV 146
Query: 647 KEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L +
Sbjct: 147 RDRLLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALER 206
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECK 761
+ P D+K + H MI FG+ FCK R+P C GACP+ C+
Sbjct: 207 DIP----------------DEKCGFG-HTAMIQFGREFCKARTPACLDGPGACPMADVCE 249
Query: 762 HFASAVASARFALPGPSE 779
A+ P ++
Sbjct: 250 MVGVDPAAQSVTDPAAAD 267
>gi|449541266|gb|EMD32251.1| hypothetical protein CERSUDRAFT_22472, partial [Ceriporiopsis
subvermispora B]
Length = 275
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 83/186 (44%), Gaps = 36/186 (19%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK-- 647
D+ A+ A +SE+ +AI+ G N AG I++ L + E HGS L+ L V + K
Sbjct: 83 DFAAIAEAPKSEVVEAIRMGGLANKKAGVIQQVLKEVRERHGSYSLQHLAGVVEKEAKGK 142
Query: 648 --------------EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
+ L+ +G+G K+ CV L L +FPVD +V R++ LGWVP +
Sbjct: 143 AKSSGTPVSDEEAMQELVSYDGVGPKTASCVLLFCLGRASFPVDTHVFRLSKLLGWVPAK 202
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK--KRSPNC 751
HL + P Q Y LH MI G+ CK K S +
Sbjct: 203 ADRVTAQAHLDLKIP-----------------QDLKYGLHVLMIAHGR-RCKGCKTSGSR 244
Query: 752 GACPLR 757
+CPL+
Sbjct: 245 TSCPLK 250
>gi|440476191|gb|ELQ44813.1| HhH-GPD family base excision DNA repair protein [Magnaporthe oryzae
Y34]
gi|440486908|gb|ELQ66733.1| HhH-GPD family base excision DNA repair protein [Magnaporthe oryzae
P131]
Length = 375
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 72/170 (42%), Gaps = 19/170 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+W AV ++ +AI+ G + + I L + +GS L+ LR ++
Sbjct: 195 NWPAVVAGGVGKLEEAIRCGGLSVVKSRVIMSILETCAQRYGSYSLDHLREASDEEAMRE 254
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+L +G+G K+ CV L L +F VD +V R+ LGW P + H+HL P
Sbjct: 255 MLAFKGVGPKTASCVLLFCLGRESFAVDTHVHRLTGMLGWRPAGTSREEAHLHLDARIPD 314
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGK--IFCKKRSPNCGACPLR 757
DK Y LH +IT GK CK + G C LR
Sbjct: 315 EDK-----------------YGLHVLLITHGKRCAECKAGGKSSGKCELR 347
>gi|159898923|ref|YP_001545170.1| DNA-(apurinic or apyrimidinic site) lyase [Herpetosiphon
aurantiacus DSM 785]
gi|159891962|gb|ABX05042.1| DNA-(apurinic or apyrimidinic site) lyase [Herpetosiphon
aurantiacus DSM 785]
Length = 222
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 77/173 (44%), Gaps = 17/173 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W AV A+ SE+ + I+ G I A RI+ L ++E G L++LR + + + +L
Sbjct: 61 WAAVLNAESSELEETIRVGGLAKIKAARIQNTLAVILEQRGEFSLDFLRELGLHEARAWL 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+ CV + A VD ++ R+A R+G + + H L PV
Sbjct: 121 TALPGIGPKTAGCVLCFACNQPAMIVDTHIHRVAKRVGMIGPKVSADAAHDLLESAVPV- 179
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
DQ +Y+ H ++ G+ C + P C CPL C +
Sbjct: 180 --------------DQ--MYQFHVSVLLHGRQICHAQRPACERCPLTEICDFY 216
>gi|402075883|gb|EJT71306.1| hypothetical protein GGTG_10565 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 393
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 81/226 (35%), Gaps = 50/226 (22%)
Query: 560 SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 619
S NT + +R HD +W AV + + DAI+ G + I
Sbjct: 182 SQNTTDANSARAKRAMDAAYGGRHD-----NWAAVAAGGTARLEDAIRCGGLAAAKSRVI 236
Query: 620 KEFLNRLVELHGS--------------------------IDLEWLRNVPPDKVKEYLLDI 653
L RL E H S LE +R++P LL
Sbjct: 237 ISILERLRERHASSSSLSSLSSSPAVAKDDDEKNQNDSIYSLEHIRDLPTADAMRELLSF 296
Query: 654 EGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ CV L L +F VD +V RIA LGW P + + H+HL P DK
Sbjct: 297 RGVGPKTASCVLLFCLGRDSFAVDTHVHRIAGLLGWRPPRATRDETHLHLDARVPDTDK- 355
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGK--IFCKKRSPNCGACPLR 757
Y LH M+T GK CK G C LR
Sbjct: 356 ----------------YGLHVLMVTHGKRCAECKAGGRAAGRCELR 385
>gi|55376682|ref|YP_134533.1| endonuclease III [Haloarcula marismortui ATCC 43049]
gi|448642537|ref|ZP_21678496.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
gi|55229407|gb|AAV44827.1| endonuclease III [Haloarcula marismortui ATCC 43049]
gi|445759337|gb|EMA10615.1| endonuclease III [Haloarcula sinaiiensis ATCC 33800]
Length = 233
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+AD I+ G + A RI+ L + E G+ L +L +P D+ K
Sbjct: 74 DFAAIESADHDELADTIRVAGLPDQKAARIQRALAAIREETGGAYSLAFLDAMPTDEAKG 133
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V VD +V R++ R G V H L P
Sbjct: 134 WLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVTESATNKRAHDELDAIIP 193
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
D+ T Y LH +IT G+ FC RSP+C
Sbjct: 194 ----------------DELT-YPLHVLLITHGREFCSARSPDCA 220
>gi|358400221|gb|EHK49552.1| hypothetical protein TRIATDRAFT_26345, partial [Trichoderma
atroviride IMI 206040]
Length = 268
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 78/186 (41%), Gaps = 21/186 (11%)
Query: 576 STGEERSHDKM--DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI 633
ST + S DK S +W+A+ ++ IK G + + I L + + +G+
Sbjct: 86 STRAKLSMDKTYGRSDNWDAIVQGGTDKLQQTIKSGGLSVVKSKVIMNILRQTKDQYGTY 145
Query: 634 DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
L L N D+ LL +G+G K+ CV L LQ +F VD +V RI+ LGW P +
Sbjct: 146 SLNHLLNASNDEAMRELLSFQGVGPKTASCVLLFCLQRDSFAVDTHVYRISGLLGWRPGE 205
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNC 751
+ HL P DK Y LH M+T G+ CK
Sbjct: 206 ANREEAQAHLEATVPDEDK-----------------YGLHVLMVTHGRTCDECKAGGKIA 248
Query: 752 GACPLR 757
G C LR
Sbjct: 249 GKCALR 254
>gi|389622827|ref|XP_003709067.1| HhH-GPD family base excision DNA repair protein [Magnaporthe oryzae
70-15]
gi|351648596|gb|EHA56455.1| HhH-GPD family base excision DNA repair protein [Magnaporthe oryzae
70-15]
Length = 382
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 72/170 (42%), Gaps = 19/170 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+W AV ++ +AI+ G + + I L + +GS L+ LR ++
Sbjct: 202 NWPAVVAGGVGKLEEAIRCGGLSVVKSRVIMSILETCAQRYGSYSLDHLREASDEEAMRE 261
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+L +G+G K+ CV L L +F VD +V R+ LGW P + H+HL P
Sbjct: 262 MLAFKGVGPKTASCVLLFCLGRESFAVDTHVHRLTGMLGWRPAGTSREEAHLHLDARIPD 321
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGK--IFCKKRSPNCGACPLR 757
DK Y LH +IT GK CK + G C LR
Sbjct: 322 EDK-----------------YGLHVLLITHGKRCAECKAGGKSSGKCELR 354
>gi|388852390|emb|CCF54005.1| uncharacterized protein [Ustilago hordei]
Length = 635
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 69/153 (45%), Gaps = 17/153 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+WEA+ A ESE+ +AI+ G N A IK L + +E HG + L+ L N D++ +
Sbjct: 193 NWEAIHSAPESELVEAIRCGGLANNKARTIKGILAQTMERHGKLSLDHLHNATDDEIMQE 252
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ G+G K CV + + VD +V R+ L WVP + + HL + P
Sbjct: 253 LVRFNGVGPKVASCVLAFCIGRQSMAVDTHVFRLCKALRWVPEKANRDQTYYHLHERVPG 312
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
K Y LH +I GK+
Sbjct: 313 DLK-----------------YALHVLLIKHGKM 328
>gi|397691245|ref|YP_006528499.1| HhH-GPD family protein [Melioribacter roseus P3M]
gi|395812737|gb|AFN75486.1| HhH-GPD family protein [Melioribacter roseus P3M]
Length = 221
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 32/203 (15%)
Query: 560 SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 619
+DN + + L+R YS DW + A EI IK+ G N AG I
Sbjct: 31 NDNNSYEAYQNLKRKYS-------------DWNQLVKAPLKEIEKTIKKAGLTNQKAGAI 77
Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
K + L +D+++++ + + EYL +G+G+K+ CV L SL PVD +
Sbjct: 78 KNLVVNLSR-DNRLDMKFIKKMDNKEALEYLTSFKGIGIKTASCVLLFSLYRNVCPVDTH 136
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
V R R+G V + P + + + P + + LH +I
Sbjct: 137 VHRTLNRIGIVN-EKSPEKTFLRINENLP-----------------EGIAHSLHTNLIKL 178
Query: 740 GKIFCKKRSPNCGACPLRGECKH 762
G+ C+ +P+C CP+ C +
Sbjct: 179 GRGICRPTNPDCTVCPISDLCAY 201
>gi|282164121|ref|YP_003356506.1| putative endonuclease III [Methanocella paludicola SANAE]
gi|282156435|dbj|BAI61523.1| putative endonuclease III [Methanocella paludicola SANAE]
Length = 225
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 18/166 (10%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E +IA+ IK G I A RIKE +++ +G D++++ P+ ++ LL IEG+
Sbjct: 66 APEDKIAELIKIGGLYTIKARRIKEISRLILDDYGG-DIDFVCTANPEAARKELLSIEGV 124
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ +CV L + PVD +V R+ RLG VP + + H L++ P +
Sbjct: 125 GPKTADCVLLFACGDDVIPVDTHVFRVTKRLGIVPEKADHEETHRILMENVPAGKR---- 180
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+H +I FG+ C+ +SP C L C +
Sbjct: 181 -------------GSVHVDLIRFGREICRAQSPKHDECFLIDVCDY 213
>gi|429220080|ref|YP_007181724.1| endoIII-related endonuclease [Deinococcus peraridilitoris DSM
19664]
gi|429130943|gb|AFZ67958.1| putative endoIII-related endonuclease [Deinococcus peraridilitoris
DSM 19664]
Length = 263
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 79/175 (45%), Gaps = 15/175 (8%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
V WE V AD + +A+AI + A I+ L + E G ++++L ++P ++
Sbjct: 96 VSWEGVEQADVTALAEAISPSNFAEVKAPNIQRTLRAIREARGEYNIDFLADLPVEEGLR 155
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+LL + G+GLK+ V L + PVD +V R++ R+G + +
Sbjct: 156 WLLALPGVGLKTATLVLLFCFRKSVLPVDTHVHRVSQRVGLIGPK--------------- 200
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
VM + L L + LY H ++T G+ C P C C LR C ++
Sbjct: 201 VMHEAAHRLLLALLPPEPALLYNFHLNLLTHGQKICTFSRPRCSKCVLREVCDYY 255
>gi|322371204|ref|ZP_08045756.1| DNA-(apurinic or apyrimidinic site) lyase [Haladaptatus
paucihalophilus DX253]
gi|320549194|gb|EFW90856.1| DNA-(apurinic or apyrimidinic site) lyase [Haladaptatus
paucihalophilus DX253]
Length = 268
Score = 70.9 bits (172), Expect = 4e-09, Method: Composition-based stats.
Identities = 62/198 (31%), Positives = 89/198 (44%), Gaps = 26/198 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E + AD E+AD I+ G N A I E R++ GS E++R P++V+
Sbjct: 90 ETLAKADHGELADTIRSAGLYNQKADVIIESAERVLADFGSAAAFDEFVREEEPNEVRSE 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LLD+ G+G K+ +CV L S Q FPVD +V RI R+G P D+ L +E P
Sbjct: 150 LLDMNGVGPKTADCVLLFSGGQSGVFPVDTHVHRIYRRMGIAPADADHEDVREVLEREVP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
+K + H I FG+ +C R P C ACP+ C+
Sbjct: 210 A----------------EKCGFG-HTASIQFGREYCSARKPACLDGPEACPMFDLCEQV- 251
Query: 765 SAVASARFALPGPSEKGI 782
V R + P+E G+
Sbjct: 252 -GVDVERGEVVDPAESGL 268
>gi|397619559|gb|EJK65312.1| hypothetical protein THAOC_13838, partial [Thalassiosira oceanica]
Length = 527
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 1/103 (0%)
Query: 590 DWEAV-RCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
DWE V ++ S +A AI+ G R++ L ++ E G + E+LR+ D++K+
Sbjct: 334 DWEQVAESSNASRVAAAIRCAGLSKTRTERMQLMLQKVKEERGKANFEFLRDCSSDEIKK 393
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
LL +G+G K+V CV L +L FPVD +V RI+ + WVP
Sbjct: 394 QLLQHKGMGPKTVSCVLLFALLKPEFPVDTHVLRISKQNRWVP 436
>gi|344212676|ref|YP_004796996.1| endonuclease III [Haloarcula hispanica ATCC 33960]
gi|343784031|gb|AEM58008.1| endonuclease III [Haloarcula hispanica ATCC 33960]
Length = 277
Score = 70.1 bits (170), Expect = 6e-09, Method: Composition-based stats.
Identities = 59/189 (31%), Positives = 87/189 (46%), Gaps = 28/189 (14%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDIEG 655
+ E+A+ I G N + RI R+ E +G D E++R+ P+KV+ LLD+ G
Sbjct: 107 QPELAETISSAGLYNQKSERIIALAQRICEEYGGEAGFD-EFVRDSDPEKVRSTLLDMNG 165
Query: 656 LGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L + + FPVD +V RIA R+G P + +L ++ P
Sbjct: 166 VGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGLAPPDADHETVREYLERDVP------ 219
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASA 770
+K + H MI FG+ +C R P C ACPL G C V A
Sbjct: 220 ----------GEKCGFG-HTAMIQFGREYCSARKPACLDDPDACPLAGHCDQV--GVYPA 266
Query: 771 RFALPGPSE 779
+ PSE
Sbjct: 267 ADEVTDPSE 275
>gi|336375725|gb|EGO04061.1| hypothetical protein SERLA73DRAFT_175790 [Serpula lacrymans var.
lacrymans S7.3]
gi|336388834|gb|EGO29978.1| hypothetical protein SERLADRAFT_458392 [Serpula lacrymans var.
lacrymans S7.9]
Length = 366
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 90/221 (40%), Gaps = 30/221 (13%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYST---------GEERSHDKMDSV----DWEAVRC 596
R + +P N+ T ++L + T R+ +D+ D+ A+
Sbjct: 117 RLRAPSPSANAARTCGSVPNVLESLIGTILSQNTSSANSTRAKHSLDTAFGRNDFAAIAK 176
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ AI G N A I++ L + + HG+ +L+ L +VP ++ L+ G+
Sbjct: 177 AARPDVVAAIASGGLANKKAKTIQDILESVKQRHGAYNLQHLADVPDEEAMRELVSYNGV 236
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV L +F VD +V R++ LGWVP HL P K
Sbjct: 237 GPKTAACVLSFCLGRESFAVDTHVYRLSRLLGWVPQSADRVLAQAHLDVTLPGELK---- 292
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
Y LH M+ G+I + G+CPL+
Sbjct: 293 -------------YALHVMMVKHGRICSGCKKDGKGSCPLK 320
>gi|365174645|ref|ZP_09362085.1| hypothetical protein HMPREF1006_00030 [Synergistes sp. 3_1_syn1]
gi|363614442|gb|EHL65936.1| hypothetical protein HMPREF1006_00030 [Synergistes sp. 3_1_syn1]
Length = 231
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 74/180 (41%), Gaps = 17/180 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V AD E+ + ++ G + RI++ L + E G L+ LR +V+EYL
Sbjct: 65 WEEVAHADLEELKEVLRIAGMSSTKPPRIQQILAAVKEKFGGYTLKELRGWKQPEVREYL 124
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+ V L FPVD ++ R++ R GW + P
Sbjct: 125 TSLPGVGPKTSAIVECFDLGMPGFPVDTHITRLSKRFGWADEKSPP-------------- 170
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
DKIQ L L + H + G+ C R P+C C L C + A A
Sbjct: 171 DKIQARL---EAALPPERFRGGHLNFLDHGRSICSARKPDCARCALAKWCGYGKRAAEEA 227
>gi|18312246|ref|NP_558913.1| DNA-(apurinic or apyrimidinic site) lyase (endonuclease III, PaNth)
[Pyrobaculum aerophilum str. IM2]
gi|7141250|gb|AAF37269.1|AF222334_1 putative DNA glycosylase [Pyrobaculum aerophilum]
gi|18159688|gb|AAL63095.1| DNA-(apurinic or apyrimidinic site) lyase (endonuclease III, PaNth)
[Pyrobaculum aerophilum str. IM2]
Length = 223
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 23/165 (13%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+ + IK G A +K ++L + E L + P++ +E LL + G+G K+
Sbjct: 76 LEELIKPAGMYRQRARNLKALAEAFIQL--GLTPERLVEMGPERARELLLSLPGVGKKTA 133
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
+ V L++L AFPVD ++ RIA R G + ++Y + + M PR
Sbjct: 134 DVV-LVNLGLPAFPVDTHITRIAKRWG--------------IGEKYDEISRWFMERLPRD 178
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
YLD H ++I FG+ C+ R+P CG CP+ +C F SA
Sbjct: 179 KYLD------FHLKLIQFGRDVCRARNPKCGQCPIGAKCPSFKSA 217
>gi|374327779|ref|YP_005085979.1| hypothetical protein P186_2339 [Pyrobaculum sp. 1860]
gi|356643048|gb|AET33727.1| hypothetical protein P186_2339 [Pyrobaculum sp. 1860]
Length = 224
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 84/182 (46%), Gaps = 23/182 (12%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
++ V EA+ E+ + I+ G A +K + ++L I + L + P
Sbjct: 60 RLGGVTPEALLALSTEELEELIRPAGMHRQRARNLKALADAFIKL--GITPQRLVEMGPG 117
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ + LL + G+G K+ + V L++L AFPVD ++ RIA R G GD + +
Sbjct: 118 EARRLLLSLPGVGEKTADVV-LVNLGLPAFPVDTHITRIAKRWG-------VGDRYGQIS 169
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ + M + YL E+H ++I FG+ CK R P CG CP+ C +
Sbjct: 170 RWFMEMVPPERYL-------------EIHLKLIQFGRYVCKARDPRCGVCPIGERCPSYR 216
Query: 765 SA 766
SA
Sbjct: 217 SA 218
>gi|385802541|ref|YP_005838941.1| DNA-(apurinic or apyrimidinic site) lyase [Haloquadratum walsbyi
C23]
gi|339728033|emb|CCC39154.1| endonuclease III [Haloquadratum walsbyi C23]
Length = 223
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ A E+AD I+ G +N A RI+ L + E G L +L N + +
Sbjct: 58 DYRAMETAPHEELADVIEVAGLKNQKAARIQRALTAIRKETGGEYTLMFLANQSTEAAQS 117
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V + F VD +V R+A R G + H L + P
Sbjct: 118 WLTDIKGVGPKTASVVLNFHFEKPTFAVDTHVERLAKRFGLLDSTASNKRAHTELNELVP 177
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
D ++ Y LH MIT G+ +C +SPNC
Sbjct: 178 --DDLK---------------YSLHVLMITHGREYCTAQSPNCA 204
>gi|126458694|ref|YP_001054972.1| HhH-GPD family protein [Pyrobaculum calidifontis JCM 11548]
gi|126248415|gb|ABO07506.1| HhH-GPD family protein [Pyrobaculum calidifontis JCM 11548]
Length = 219
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 95/207 (45%), Gaps = 34/207 (16%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD + +D L+R ++ + EAV E E+A+ IK G I A
Sbjct: 44 QNTSDRNAFKAYDQLKR-----------RLGEITPEAVLQLSEDELAELIKPAGMYRIRA 92
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
I+ + + + E LR + P + +++LL + G+G K+ + + L++L AFPV
Sbjct: 93 RNIRALADAFI--RHKVTPEKLREMGPVEARKFLLSLPGVGEKTADVI-LVNLGLPAFPV 149
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQM 736
D ++ RIA R G V G++ ++ P + YL E+H ++
Sbjct: 150 DTHIRRIAKRWGIVGNH---GEISRRFMEAVPP----EKYL-------------EVHLKL 189
Query: 737 ITFGKIFCKKRSPNCGACPLRGECKHF 763
I FG+ C R+P C CP+ +C +
Sbjct: 190 IQFGRDICTARAPKCHICPIGSKCPSY 216
>gi|403252796|ref|ZP_10919102.1| repair endonuclease [Thermotoga sp. EMP]
gi|402811884|gb|EJX26367.1| repair endonuclease [Thermotoga sp. EMP]
Length = 232
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 81/166 (48%), Gaps = 19/166 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++A+ I+ G NI R+KE L L E + DL LR++P ++E L I+G+G
Sbjct: 80 EEKMAELIRPAGFFNIKTKRLKELLKFLKEY--NYDLSRLRDLPTHILRERFLKIKGIGK 137
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L +L+ F VD R+ R+ + L +Y + ++ M +
Sbjct: 138 ETADAILLYALEKPIFVVDSYTRRLLKRIFNIELN------------DYDEVQRLFMTHY 185
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
P D + E H ++ K FC K +P C ACPLR EC H +
Sbjct: 186 PE----DVQLYQEFHGLIVEHAKKFCSK-TPKCDACPLRKECYHVS 226
>gi|354611948|ref|ZP_09029900.1| HhH-GPD family protein [Halobacterium sp. DL1]
gi|353191526|gb|EHB57032.1| HhH-GPD family protein [Halobacterium sp. DL1]
Length = 269
Score = 69.3 bits (168), Expect = 1e-08, Method: Composition-based stats.
Identities = 63/212 (29%), Positives = 92/212 (43%), Gaps = 33/212 (15%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD Q D L Y+ GEE + A+ A + E+A+ I G N +
Sbjct: 63 QNTSDVASQPAHDALLERYAPGEETDLAR-------ALADAHQDELAETISSAGLYNQKS 115
Query: 617 GRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
+ ++ E +G D E++R PD V++ LLD++G+G K+ +CV L S
Sbjct: 116 ETLIRLAGQICEEYGGEAGFD-EFVRTEDPDAVRDALLDMKGVGPKTADCVLLFSGGRAG 174
Query: 674 -FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYEL 732
FPVD +V RIA R+G P P H V + ++ + C
Sbjct: 175 VFPVDTHVHRIARRMGLAP----PDADH------EGVREALEAAVPAEACGFG------- 217
Query: 733 HYQMITFGKIFCKKRSPNC----GACPLRGEC 760
H MI FG+ +C R P C ACPL C
Sbjct: 218 HTAMIQFGREYCTARKPACLDGPEACPLYDYC 249
>gi|284036648|ref|YP_003386578.1| HhH-GPD family protein [Spirosoma linguale DSM 74]
gi|283815941|gb|ADB37779.1| HhH-GPD family protein [Spirosoma linguale DSM 74]
Length = 241
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 70/176 (39%), Gaps = 15/176 (8%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE VR A ++ DAI+ I A I+ L L G ++++L + + +L
Sbjct: 65 WEQVRDAPLPDLIDAIRSANYPEIKAPYIQNLLTHLFRETGQANVDFLGQLSTEDAMAWL 124
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G+K+ + L Q PVD +V R+ RLG + + H LL P
Sbjct: 125 TKLPGIGMKTATILLLFKFQKPVLPVDTHVHRVTQRLGLIGPKVSAEKAHTILLSYLP-- 182
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
LD L+ H G+ C P C C L+ C ++ S
Sbjct: 183 -------------LDALVLFNFHKHFYWHGQRVCTWYFPKCSECVLQTMCDYYQSG 225
>gi|148269671|ref|YP_001244131.1| HhH-GPD family protein [Thermotoga petrophila RKU-1]
gi|281411619|ref|YP_003345698.1| HhH-GPD family protein [Thermotoga naphthophila RKU-10]
gi|147735215|gb|ABQ46555.1| DNA-3-methyladenine glycosylase III [Thermotoga petrophila RKU-1]
gi|281372722|gb|ADA66284.1| HhH-GPD family protein [Thermotoga naphthophila RKU-10]
Length = 220
Score = 69.3 bits (168), Expect = 1e-08, Method: Composition-based stats.
Identities = 48/166 (28%), Positives = 81/166 (48%), Gaps = 19/166 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++A+ I+ G NI R+K L L E + +L LR++P ++E LL I+G+G
Sbjct: 68 EEKVAELIRPAGFFNIKTKRLKALLKFLKEY--NYNLSRLRDLPTHILRERLLKIKGIGK 125
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L +L+ F VD R+ R+ + L +Y + ++ M +
Sbjct: 126 ETADAILLYALEKPIFVVDSYTRRLLKRIFNIELN------------DYDEVQRLFMTHY 173
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
P D + E H ++ K FC K +P CG CPL+ EC H +
Sbjct: 174 PE----DVRLYQEFHGLIVEHAKKFCSK-TPKCGVCPLKKECCHVS 214
>gi|110667125|ref|YP_656936.1| endonuclease III; DNA-(apurinic or apyrimidinic site) lyase
[Haloquadratum walsbyi DSM 16790]
gi|109624872|emb|CAJ51281.1| endonuclease III [Haloquadratum walsbyi DSM 16790]
Length = 223
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ A E+AD I+ G +N A RI+ L + E G L +L N + +
Sbjct: 58 DYRAMETAPHEELADIIEVAGLKNQKAARIQRALTAIRKETGGEYTLMFLANQSTEAAQS 117
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V + F VD +V R+A R G + H L + P
Sbjct: 118 WLTDIKGVGPKTASVVLNFHFEKPTFAVDTHVERLAKRFGLLDSTASNKRAHTELNELVP 177
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
D ++ Y LH MIT G+ +C +SPNC
Sbjct: 178 --DDLK---------------YSLHVLMITHGREYCTAQSPNCA 204
>gi|313127078|ref|YP_004037348.1| endoiii-related endonuclease [Halogeometricum borinquense DSM
11551]
gi|448288454|ref|ZP_21479653.1| endoiii-related endonuclease [Halogeometricum borinquense DSM
11551]
gi|312293443|gb|ADQ67903.1| predicted endoIII-related endonuclease [Halogeometricum borinquense
DSM 11551]
gi|445569605|gb|ELY24177.1| endoiii-related endonuclease [Halogeometricum borinquense DSM
11551]
Length = 277
Score = 68.9 bits (167), Expect = 1e-08, Method: Composition-based stats.
Identities = 55/170 (32%), Positives = 79/170 (46%), Gaps = 24/170 (14%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEG 655
++SE+A+ I G N + I R+ E G D E++++ PD V++ LLDI G
Sbjct: 106 EQSELAETIASAGLYNQKSEMIIGAAERICESFGGADGFDEFVKDEDPDTVRKRLLDIHG 165
Query: 656 LGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L S + FPVD +V RI R+G D+ HL + P
Sbjct: 166 VGPKTADCVLLFSGGRGGVFPVDTHVHRIGRRMGLASADADHEDVREHLEADVPA----- 220
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
+K + H MI FG+ +CK R P C ACPL C
Sbjct: 221 -----------EKCGFG-HTAMIQFGREYCKARKPACLDGPEACPLYDLC 258
>gi|327310305|ref|YP_004337202.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoproteus uzoniensis
768-20]
gi|326946784|gb|AEA11890.1| DNA-(apurinic or apyrimidinic site) lyase [Thermoproteus uzoniensis
768-20]
Length = 213
Score = 68.9 bits (167), Expect = 1e-08, Method: Composition-based stats.
Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 25/169 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+A+ E +A+ I+ G + A ++ E L + +DL + ++ ++ + +L
Sbjct: 62 QALLSLGEERLAELIRPAGMHRVRARKLIELSRSLSD----VDLSRIADMDVEEARRFLT 117
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ + V L +L AFPVD ++ RIA R G + K Y +
Sbjct: 118 SLPGVGEKTADVV-LANLGKPAFPVDTHITRIARRWG--------------IGKRYGEIS 162
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ M P YL E+H ++I FG+ +C+ RSP CG CP+R C
Sbjct: 163 RWFMERLPPERYL------EVHLKLIQFGRDYCRARSPRCGECPVRDLC 205
>gi|284042265|ref|YP_003392605.1| DNA-(apurinic or apyrimidinic site) lyase [Conexibacter woesei DSM
14684]
gi|283946486|gb|ADB49230.1| DNA-(apurinic or apyrimidinic site) lyase [Conexibacter woesei DSM
14684]
Length = 241
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 79/174 (45%), Gaps = 21/174 (12%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL--HGSIDLEWLRNVPPDKVK 647
DW AV A +E+ +AI+ G + + RI++ L + + +DL +LR+ +
Sbjct: 79 DWRAVLEAPNAEVEEAIRPGGISKVKSKRIQQILRAIDDSPEGAGLDLSFLRDASVPDGQ 138
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV-PLQPLPGDLHIHLLKE 706
YL + G+G K+ CV L + PVD +V R+ +RL + P P +LH +L
Sbjct: 139 RYLCSLPGVGRKTAACVLLFAYGLRDVPVDTHVSRVGMRLRLLRPGAPFE-ELHDEMLDL 197
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P ++ E H ++ G+ C R P+C AC LR C
Sbjct: 198 SPRGQEL-----------------EFHVNLLRHGRRTCHARRPDCPACVLRRVC 234
>gi|409048228|gb|EKM57706.1| hypothetical protein PHACADRAFT_251498 [Phanerochaete carnosa
HHB-10118-sp]
Length = 323
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 83/198 (41%), Gaps = 39/198 (19%)
Query: 577 TGEERSHD-KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL 635
T +R+ D K ++EA+ A +SE+ DA++ G N A I+ L+ + E HGS L
Sbjct: 110 TAAKRNLDVKFGRNNFEAIVNAPKSELVDALRTGGLANKKAEVIQRILHEVKERHGSYSL 169
Query: 636 EWLRNV-----PPDKVKEY--------LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
+ L V P ++ K + L+ G+G K+ CV L L +F VD +V R
Sbjct: 170 QHLAGVVESQEPGEEAKVFSDEDAMKELVSYNGVGPKTASCVLLFCLGRASFAVDTHVFR 229
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
++ LGWVP + HL P Y LH M+ G+
Sbjct: 230 LSRLLGWVPARANRVTAQAHLDLRVP-----------------NDLKYGLHVLMVGHGR- 271
Query: 743 FCKKRSPNCGACPLRGEC 760
C C RGEC
Sbjct: 272 -------QCNGCRGRGEC 282
>gi|229918737|ref|YP_002887383.1| endonuclease III [Exiguobacterium sp. AT1b]
gi|229470166|gb|ACQ71938.1| endonuclease III [Exiguobacterium sp. AT1b]
Length = 219
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 76/170 (44%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ AD SEI IK G A +K ++V HG I VP D+ L
Sbjct: 63 EAMAAADVSEIEALIKRIGLYRNKAKNVKALSEKIVNEHGGI-------VPSDRAS--LE 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V ++ AF VD +V R++ RLG + + L+K++P
Sbjct: 114 ALPGVGRKTANVVLSVAFHEPAFAVDTHVERVSKRLGICRWKDNVRQVEDTLMKKFP--- 170
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ +LH+Q I FG+ CK + P C ACPL C+
Sbjct: 171 --------------REEWSQLHHQFIFFGRYHCKAQRPGCEACPLLHMCR 206
>gi|296420260|ref|XP_002839693.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635887|emb|CAZ83884.1| unnamed protein product [Tuber melanosporum]
Length = 441
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 87/200 (43%), Gaps = 32/200 (16%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN--RLVELH--------GSIDLEW 637
+VDW+AVR AD ++ ++I++ G RIK L+ R L G + L++
Sbjct: 194 TVDWDAVRRADLDDVVNSIRKGGMAPTKGRRIKNLLDAKRKEPLSNPDKDDEDGDLSLDY 253
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
+ + D+ ++ L +G+G K+ CV L L+ +F VD +V R++ L WVP +
Sbjct: 254 IHKLSDDEARQKLTSFDGVGPKTASCVMLFCLRRDSFAVDTHVFRLSKFLKWVPAKATRE 313
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
+ HL P K Y LH +I G+ CK+ C A P
Sbjct: 314 TTYAHLDVRVPAEHK-----------------YALHNLLIRHGRT-CKE----CKAGPKH 351
Query: 758 GECKHFASAVASARFALPGP 777
K +S + PGP
Sbjct: 352 KGKKRVSSETSLDLSPEPGP 371
>gi|239906402|ref|YP_002953143.1| DNA glycosylase [Desulfovibrio magneticus RS-1]
gi|239796268|dbj|BAH75257.1| putative DNA glycosylase [Desulfovibrio magneticus RS-1]
Length = 216
Score = 68.2 bits (165), Expect = 3e-08, Method: Composition-based stats.
Identities = 45/168 (26%), Positives = 76/168 (45%), Gaps = 16/168 (9%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D + A +A+ I+ G + AGR+K + +VE G DL L D+ ++
Sbjct: 60 LDPHVLHAASLEAVAERIRPAGHFRVKAGRLKNLMALIVEELGG-DLTALAGYDLDQARD 118
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL ++G+G ++ + + L L AF VD RI R G P + +L + P
Sbjct: 119 KLLSVKGVGPETADSILLYGLNLPAFVVDAYTARICFRHGLAPEEAGYDELRELFMDALP 178
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+ +++Y E H ++ G +C+ R+P C ACPL
Sbjct: 179 --EDVRLY-------------NEFHALLVRVGNAWCRPRAPKCAACPL 211
>gi|257053763|ref|YP_003131596.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus utahensis
DSM 12940]
gi|256692526|gb|ACV12863.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus utahensis
DSM 12940]
Length = 274
Score = 68.2 bits (165), Expect = 3e-08, Method: Composition-based stats.
Identities = 67/229 (29%), Positives = 98/229 (42%), Gaps = 29/229 (12%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD Q D L Y + +E D +D A+ A+++E+A+ I G N +
Sbjct: 65 QNTSDVASQPAHDALMDRYGSEDEEEVDLVD-----ALADAEQAELAETISGAGLYNQKS 119
Query: 617 GRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIA 673
RI E R+ E +G D ++R P + V+E LL + G+G K+ +CV L + +
Sbjct: 120 ARIIEIAQRIREEYGGEDDFDAFVREEPAEAVRETLLAMNGVGPKTADCVLLFAGGRDGV 179
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD +V RI RLG P PG H V + ++ + C H
Sbjct: 180 FPVDTHVHRIYRRLGIAP----PGADH------EAVREVLEKKVPEGKCGFG-------H 222
Query: 734 YQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALPGPS 778
I FG+ +C R P C ACPL C AS P +
Sbjct: 223 TASIQFGREYCSAREPACLDGPEACPLYDLCDRVGIEPASGDVVDPAET 271
>gi|448627973|ref|ZP_21672205.1| endonuclease III [Haloarcula vallismortis ATCC 29715]
gi|445758595|gb|EMA09901.1| endonuclease III [Haloarcula vallismortis ATCC 29715]
Length = 233
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 76/175 (43%), Gaps = 20/175 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+EA+ AD ++AD I+ G + A RI+ L + E G+ L +L +P ++ K
Sbjct: 74 DFEAIESADHDKLADTIRVAGLPDQKAARIQRALAAIREETGGAYSLAFLDAMPAEEAKG 133
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V VD +V R++ R G V H L P
Sbjct: 134 WLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVTESATNKRAHDELDALIP 193
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG--ACPLRGECK 761
D+ T Y LH +IT G+ FC R P+C C CK
Sbjct: 194 ----------------DELT-YPLHVLLITHGREFCSARRPDCSNPVCERYCSCK 231
>gi|449067484|ref|YP_007434566.1| endonuclease III [Sulfolobus acidocaldarius N8]
gi|449069758|ref|YP_007436839.1| endonuclease III [Sulfolobus acidocaldarius Ron12/I]
gi|449035992|gb|AGE71418.1| endonuclease III [Sulfolobus acidocaldarius N8]
gi|449038266|gb|AGE73691.1| endonuclease III [Sulfolobus acidocaldarius Ron12/I]
Length = 225
Score = 68.2 bits (165), Expect = 3e-08, Method: Composition-based stats.
Identities = 49/172 (28%), Positives = 79/172 (45%), Gaps = 23/172 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+ ++ AD + I DAI+ G N A IK + +L +I+++ K++++L
Sbjct: 71 DRIKHADINTIIDAIRVAGLGNSKARYIKNVAEVINDLDLNIEIDC------QKLRDFLT 124
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
IEG+G K+ + V L + FP+D ++ R+ RLG+ L P
Sbjct: 125 AIEGIGDKTADVVLLTCFRCREFPIDTHIRRVISRLGF--------------LGSSPKYK 170
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I Y R D L LH+ +I G+ CK R P C C +R CK++
Sbjct: 171 DISEYFKTRFSSED---LLNLHHLLIAHGRKTCKSRKPICDKCVIRDYCKYY 219
>gi|70607241|ref|YP_256111.1| endonuclease III [Sulfolobus acidocaldarius DSM 639]
gi|68567889|gb|AAY80818.1| endonuclease III [Sulfolobus acidocaldarius DSM 639]
Length = 223
Score = 68.2 bits (165), Expect = 3e-08, Method: Composition-based stats.
Identities = 49/172 (28%), Positives = 79/172 (45%), Gaps = 23/172 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+ ++ AD + I DAI+ G N A IK + +L +I+++ K++++L
Sbjct: 69 DRIKHADINTIIDAIRVAGLGNSKARYIKNVAEVINDLDLNIEIDC------QKLRDFLT 122
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
IEG+G K+ + V L + FP+D ++ R+ RLG+ L P
Sbjct: 123 AIEGIGDKTADVVLLTCFRCREFPIDTHIRRVISRLGF--------------LGSSPKYK 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I Y R D L LH+ +I G+ CK R P C C +R CK++
Sbjct: 169 DISEYFKTRFSSED---LLNLHHLLIAHGRKTCKSRKPICDKCVIRDYCKYY 217
>gi|320354681|ref|YP_004196020.1| DNA-3-methyladenine glycosylase III [Desulfobulbus propionicus DSM
2032]
gi|320123183|gb|ADW18729.1| DNA-3-methyladenine glycosylase III [Desulfobulbus propionicus DSM
2032]
Length = 214
Score = 67.8 bits (164), Expect = 3e-08, Method: Composition-based stats.
Identities = 46/157 (29%), Positives = 76/157 (48%), Gaps = 16/157 (10%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+A+ I+ G NI AGR+ L+ + + HG +L+ P +++E LL I+G+G ++
Sbjct: 73 LAEYIRPAGYYNIKAGRLHNLLSTINQQHGG-NLQAFLEQPLPQLREQLLAIKGIGRETA 131
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
+ + L + F VD RI VR H L+ E + IQ L
Sbjct: 132 DSILLYAAGLPIFVVDAYTHRILVR-------------H-QLIDEECGYEAIQELFMDNL 177
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
D + E H ++ G ++CKK+ P+C +CPL+G
Sbjct: 178 A-CDPRLYNEYHALLVRVGNVYCKKKHPDCASCPLQG 213
>gi|406886945|gb|EKD33865.1| hypothetical protein ACD_75C02568G0003 [uncultured bacterium]
Length = 223
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 75/171 (43%), Gaps = 18/171 (10%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+ +E + EIA IK G N+ AGR++ LN + E H L+ +E
Sbjct: 67 MTYETLSLCSAEEIAVYIKPAGYYNLKAGRLRNLLNMIAE-HYDGQLDCFLQEKLAVARE 125
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LLD++G+G ++ + + L S H F VD+ R+ R V + +Y
Sbjct: 126 RLLDVKGIGPETADSILLYSCGHPIFVVDMYTHRVFSRHNLVAEE-----------TDYH 174
Query: 709 VMDKIQM-YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
M + M +L P D E H ++ FCKK +P C CPLRG
Sbjct: 175 AMQALFMDHLSP-----DPVLFNEFHALVVRVAATFCKKTNPQCETCPLRG 220
>gi|399575316|ref|ZP_10769074.1| endoiii-related endonuclease [Halogranum salarium B-1]
gi|399239584|gb|EJN60510.1| endoiii-related endonuclease [Halogranum salarium B-1]
Length = 265
Score = 67.8 bits (164), Expect = 3e-08, Method: Composition-based stats.
Identities = 55/174 (31%), Positives = 84/174 (48%), Gaps = 30/174 (17%)
Query: 597 ADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDL-EWLRNVPPDKVKEYLL 651
A++SE+A+ I+ G N +I G +E ++ E G+ + E++R P V+ LL
Sbjct: 93 AEQSELAETIQSAGLYNQKSEMIVGAAEEIVD---EFGGADEFDEFVREEDPQTVRTRLL 149
Query: 652 DIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+I G+G K+ +CV L S + FPVD +V RIA R+G P ++ HL ++ P
Sbjct: 150 EIHGVGPKTADCVLLFSGGRGGVFPVDTHVHRIARRMGLAPADADHEEVREHLERDVPA- 208
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
+K + H MI FG+ +C R P C ACPL C
Sbjct: 209 ---------------EKCGFG-HTAMIQFGREYCTARKPACLDGPEACPLYDLC 246
>gi|389846321|ref|YP_006348560.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|448616059|ref|ZP_21664769.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|388243627|gb|AFK18573.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
gi|445750714|gb|EMA02151.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mediterranei ATCC 33500]
Length = 268
Score = 67.8 bits (164), Expect = 3e-08, Method: Composition-based stats.
Identities = 57/195 (29%), Positives = 86/195 (44%), Gaps = 24/195 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ AD I DAI+ G N + I+ ++ GS D ++R P V++
Sbjct: 90 ESLAAADREGIIDAIRSGGLYNQKSKLIQGVAEEVLADFGSEDDFDRYVREADPSTVRDR 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL+++G+G K+ +CV L + + FPVD +V RIA RLG P + L +E P
Sbjct: 150 LLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRLGVAPADADHEGVRAALEREVP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
+K + H MI FG+ FCK R P C CP+ C+
Sbjct: 210 A----------------EKCGFG-HTAMIQFGREFCKARKPVCLDGKDECPMADVCEKVG 252
Query: 765 SAVASARFALPGPSE 779
VA P ++
Sbjct: 253 VDVAGQSVVDPAAAD 267
>gi|358060340|dbj|GAA93745.1| hypothetical protein E5Q_00391 [Mixia osmundae IAM 14324]
Length = 1544
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 78/176 (44%), Gaps = 26/176 (14%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D+ A++ A ++ ++I G N+ A IK+ L+++ E G +DL +L ++ V
Sbjct: 643 DYAAIQAASVEDVTESIATGGLANVKAKVIKKVLDQVQEKRGKLDLTYLHDMEDIDVMRE 702
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ +G+G K+ CV L L +F VD +V R+ L WVP + HL + P
Sbjct: 703 LISFDGVGFKTASCVLLFCLGRESFAVDTHVFRLTKALNWVPAKADRETTFQHLDVKIPG 762
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPN--------CGACPLR 757
K Y LH +I G+ CK S N +CPLR
Sbjct: 763 PLK-----------------YPLHSLLIHHGR-SCKNCSANGVLTEEPVVESCPLR 800
>gi|226357357|ref|YP_002787097.1| DNA-(apurinic or apyrimidinic site) lyase [Deinococcus deserti
VCD115]
gi|226319347|gb|ACO47343.1| putative DNA-(apurinic or apyrimidinic site) lyase (endonuclease
III) [Deinococcus deserti VCD115]
Length = 247
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 76/175 (43%), Gaps = 16/175 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW+A+ A +A AI+ A RI+ L + + G +L++LR +P ++
Sbjct: 72 DWDAIIAAPTEAVAHAIRRSNYPESKAPRIQATLRAIRDAPGGYNLDFLRELPVKDALKW 131
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L D+ G+G+K+ V L + FPVD +V R+ R+G +P G+ H +
Sbjct: 132 LTDLPGVGIKTASLVLLFNYARPVFPVDTHVHRVNTRVGTIPRM---GEQAAHRALLGLL 188
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
LYELH ++ G+ C P C C LR C FA
Sbjct: 189 PPD-------------PPLLYELHINLLKHGQKVCTWSRPRCLQCVLRERCDAFA 230
>gi|229581431|ref|YP_002839830.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
Y.N.15.51]
gi|228012147|gb|ACP47908.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
Y.N.15.51]
Length = 233
Score = 67.8 bits (164), Expect = 4e-08, Method: Composition-based stats.
Identities = 50/188 (26%), Positives = 87/188 (46%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + AD S+I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDADLSDIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMN---------- 167
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+Y ++ L+ L +LH+ +I G+ CK R P C +C ++ C+++
Sbjct: 168 -AKYELISSTLKELF------SAYDLLQLHHLLIAHGRQICKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|167628251|ref|YP_001678750.1| base excision repair protein, hhh-gpd family [Heliobacterium
modesticaldum Ice1]
gi|167590991|gb|ABZ82739.1| base excision repair protein, hhh-gpd family [Heliobacterium
modesticaldum Ice1]
Length = 231
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 83/178 (46%), Gaps = 22/178 (12%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D A+ A ++A ++ N A R++ F R+V+ +G LE L ++ ++++
Sbjct: 61 LDISALAEAPREQVARLVRSTRYYNQKAERLQGFARRIVDEYGG-RLENLLSLEAGELRK 119
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LLDI+G+G ++ +C+ L Q F VD RI RLG+ E
Sbjct: 120 RLLDIKGIGKETADCIILYGAQQPIFVVDAYTRRIFSRLGY--------------FSEKV 165
Query: 709 VMDKIQMYLWPRL---CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
D++Q + RL YL Q E H Q+ G +C + P+C CPL C ++
Sbjct: 166 GYDEMQAFFAERLEPDLYLFQ----EYHAQIDGLGNRYCLAKGPSCAECPLGDCCTYY 219
>gi|15805952|ref|NP_294652.1| endonuclease III [Deinococcus radiodurans R1]
gi|6458651|gb|AAF10505.1|AE001945_9 endonuclease III, putative [Deinococcus radiodurans R1]
Length = 338
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 89/209 (42%), Gaps = 31/209 (14%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
++N+ Q+ W++L Y E D D + E+ + A G + A
Sbjct: 135 QQNTRRVAQRQWEVLTATYPQWEAALLDGPDGI---------EATLKSAGG--GLSRMKA 183
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPP-----DKVKEYLLDIEGLGLKSVECVRLLSLQH 671
I L L E HG + L +LR P ++ ++ L + G+G K+V V L L+
Sbjct: 184 DYIYGILAHLQEHHGGLSLRFLREFPHTPEGHEQARQALAALPGVGHKTVALVLLFDLRR 243
Query: 672 IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYE 731
A PVD N+ R A R L+ +P + H ++ + + + D +T +
Sbjct: 244 PAMPVDGNMERAAKR-----LELVPAAWNSHKVERW----------YAEVMPADWETRFA 288
Query: 732 LHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LH + G+ C+ + P C CPLR C
Sbjct: 289 LHISGVRHGRDTCRSKHPLCPQCPLREFC 317
>gi|359417776|ref|ZP_09209845.1| HhH-GPD family protein [Candidatus Haloredivivus sp. G17]
gi|358031869|gb|EHK00704.1| HhH-GPD family protein [Candidatus Haloredivivus sp. G17]
Length = 215
Score = 67.0 bits (162), Expect = 5e-08, Method: Composition-based stats.
Identities = 50/162 (30%), Positives = 75/162 (46%), Gaps = 17/162 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D+E +R AD+ E+ D I+ G A RI+E LN + E GS +E+L + D+ K++
Sbjct: 61 DFEEIRYADKEELIDTIRIAGLGPTKAERIQEALNIIYEEEGSYSVEFLDELSLDEGKKW 120
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L DI G+G K+ + + PVD +V R+A R G VP H L ++ P
Sbjct: 121 LTDIPGIGPKTASVILGFHFKKPTIPVDTHVHRLANRYGLVPEGSSRKRTHDILEEKVP- 179
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
D+I+ Y H +I G+ CK C
Sbjct: 180 -DEIK---------------YSFHRLLIEHGRAECKASKDEC 205
>gi|361069807|gb|AEW09215.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
Length = 137
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 67/130 (51%), Gaps = 21/130 (16%)
Query: 749 PNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF------GNGIGQSPPLVVNPI 802
PNC ACP+R +CKHFASA SAR L E T+ N S PL P+
Sbjct: 1 PNCNACPMRCDCKHFASAFTSARLLLTSTKESSSDTNPLLTLPYKENENAPSTPL---PL 57
Query: 803 PVIR--------IEADPVS--ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIG 852
P I+ E P+ ES Q + CEPIIEEP +P P+ E+ + +++ ++
Sbjct: 58 PTIQQDVNIEKPAEVLPLQLLESNPQDHKCEPIIEEPTTPEPEIIENTNTIVEECAYT-- 115
Query: 853 DIEEIPTLRL 862
D +EIP ++L
Sbjct: 116 DPDEIPVIKL 125
>gi|448625246|ref|ZP_21671013.1| endonuclease III [Haloferax denitrificans ATCC 35960]
gi|445749008|gb|EMA00454.1| endonuclease III [Haloferax denitrificans ATCC 35960]
Length = 258
Score = 67.0 bits (162), Expect = 6e-08, Method: Composition-based stats.
Identities = 55/180 (30%), Positives = 84/180 (46%), Gaps = 30/180 (16%)
Query: 592 EAVRCADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLE-WLRNVPPDKV 646
E++ AD I DAI+ G N +I G +E L + D + ++R P V
Sbjct: 90 ESLAAADREGIVDAIRSGGLYNQKSKLIQGVAEEVL---ADFGSEADFDRYVREEAPATV 146
Query: 647 KEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L +
Sbjct: 147 RDRLLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALER 206
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECK 761
+ P D+K + H MI FG+ FCK R P C ACP+ C+
Sbjct: 207 DIP----------------DEKCGFG-HTAMIQFGREFCKARKPACLDGPEACPMADVCE 249
>gi|42557689|emb|CAF28664.1| putative endonuclease III [uncultured crenarchaeote]
Length = 219
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 80/178 (44%), Gaps = 27/178 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D + + A+ +I D+I G NI A RIK+ + L+E S VP + E
Sbjct: 69 DIDGISKAELKDIRDSIHSIGFYNIKAKRIKQVVQLLIEKFDS-------KVPSNL--EE 119
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL + G+G K+ CV + + A PVDV+V RI+ RLG V + + E +
Sbjct: 120 LLTLPGVGRKTANCVLVYAFNQPAIPVDVHVHRISNRLGIVNTRKVEET-------ELEL 172
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
+ I +W E++ +T+G+ C P C C L+ CK + S V
Sbjct: 173 CNIIDKEMW-----------IEVNDTFVTYGQNVCLPIKPKCNICQLKKMCKFYKSNV 219
>gi|119719059|ref|YP_919554.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
gi|119524179|gb|ABL77551.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
Length = 236
Score = 66.6 bits (161), Expect = 7e-08, Method: Composition-based stats.
Identities = 60/210 (28%), Positives = 98/210 (46%), Gaps = 33/210 (15%)
Query: 559 NSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGR 618
N +NT+ + +++ EER V+ E +R A SE+A+AI+ G Q A
Sbjct: 57 NENNTL--------KAFASLEERV-----GVECEKIRKAGLSELAEAIRPAGLQEQKAKA 103
Query: 619 IKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDV 678
IK+ + L E +G D+ L + ++V L IEG+G K+++ V L + + P+D
Sbjct: 104 IKQVASLLYEKYGC-DIGKLLSRGVEEVIRELKQIEGIGDKTID-VLLANYGYPVLPIDT 161
Query: 679 NVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMIT 738
+V R++VRLG PG Y M K + LD H +I
Sbjct: 162 HVRRVSVRLGLAR----PG--------SYRAMQKSLHGFFREEARLDA------HLYLIK 203
Query: 739 FGKIFCKKRSPNCGACPLRGECKHFASAVA 768
G+ C+ ++P C CPL C ++ ++ A
Sbjct: 204 LGRTLCRAKNPLCDECPLSDLCCYYRTSRA 233
>gi|227828269|ref|YP_002830049.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.14.25]
gi|229579909|ref|YP_002838308.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
Y.G.57.14]
gi|238620462|ref|YP_002915288.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.16.4]
gi|227460065|gb|ACP38751.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.14.25]
gi|228010624|gb|ACP46386.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
Y.G.57.14]
gi|238381532|gb|ACR42620.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.16.4]
Length = 233
Score = 66.6 bits (161), Expect = 7e-08, Method: Composition-based stats.
Identities = 50/188 (26%), Positives = 87/188 (46%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + AD S+I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDADLSDIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMN---------- 167
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+Y ++ L+ L +LH+ +I G+ CK R P C +C ++ C+++
Sbjct: 168 -AKYELISSTLKELF------SAYDLLQLHHLLIAHGRQTCKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|119872647|ref|YP_930654.1| HhH-GPD family protein [Pyrobaculum islandicum DSM 4184]
gi|119674055|gb|ABL88311.1| HhH-GPD family protein [Pyrobaculum islandicum DSM 4184]
Length = 222
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 85/182 (46%), Gaps = 23/182 (12%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
++ ++ E++ E E+A+ IK G A +K L I + L + +
Sbjct: 58 RLGTITPESLNKMSEGELAELIKPAGMYRQRARVLKNLAETF--LKYDITPQRLLEMGAE 115
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ + +LL + G+G K+ + + L++L AFPVD ++ RIA R G +
Sbjct: 116 RARAFLLTLPGVGKKTADVI-LVNLGLPAFPVDTHITRIARRWG--------------IG 160
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
K Y D+I + RL Q ELH ++I FG+ CK R+P C CP+ C +
Sbjct: 161 KSY---DEISRWFIERL---PQHKYLELHLKLIQFGREICKARNPKCDVCPIGQRCPSYK 214
Query: 765 SA 766
SA
Sbjct: 215 SA 216
>gi|227831027|ref|YP_002832807.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
L.S.2.15]
gi|284998523|ref|YP_003420291.1| HhH-GPD family protein [Sulfolobus islandicus L.D.8.5]
gi|227457475|gb|ACP36162.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
L.S.2.15]
gi|284446419|gb|ADB87921.1| HhH-GPD family protein [Sulfolobus islandicus L.D.8.5]
Length = 233
Score = 66.6 bits (161), Expect = 7e-08, Method: Composition-based stats.
Identities = 50/188 (26%), Positives = 87/188 (46%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + AD S+I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDADLSDIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMN---------- 167
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+Y ++ L+ L +LH+ +I G+ CK R P C +C ++ C+++
Sbjct: 168 -AKYELISSTLKELF------SAYDLLQLHHLLIAHGRQTCKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|307111853|gb|EFN60087.1| hypothetical protein CHLNCDRAFT_14787, partial [Chlorella
variabilis]
Length = 56
Score = 66.2 bits (160), Expect = 9e-08, Method: Composition-based stats.
Identities = 31/58 (53%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
+ LEWLR+ P + + YL++I GLG KSV C+ LL L FPVD NVGRI R WV
Sbjct: 1 LSLEWLRDAPDEVARNYLMNINGLGRKSVGCIMLLCLGKKEFPVDTNVGRICAR--WV 56
>gi|390601386|gb|EIN10780.1| DNA glycosylase [Punctularia strigosozonata HHB-11173 SS5]
Length = 359
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 81/195 (41%), Gaps = 32/195 (16%)
Query: 576 STGEERSHDK-MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
STG +RS D ++ A+ A + DAI+ G N AG I+ L + HG
Sbjct: 133 STGAKRSLDAAFGRNNFAAIADAPRERVVDAIRHGGLANKKAGVIQNLLKEIKARHGDYS 192
Query: 635 LEWLRNVPP------DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
L+ L + P D++ L+ +G+G K+ CV L L+ +F VD +V R++ LG
Sbjct: 193 LQHLASRPSKSALSDDEIMHELVSYDGVGPKTASCVLLFCLERPSFAVDTHVFRLSKMLG 252
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
WVP + HL P K Y LH + G++
Sbjct: 253 WVPAKSDRVLAQAHLDARIPGELK-----------------YGLHCGFVKHGRV-----C 290
Query: 749 PNCGACPLRG---EC 760
P C A P G EC
Sbjct: 291 PACKAGPSGGGKVEC 305
>gi|448651578|ref|ZP_21680628.1| endonuclease III [Haloarcula californiae ATCC 33799]
gi|445770458|gb|EMA21521.1| endonuclease III [Haloarcula californiae ATCC 33799]
Length = 233
Score = 66.2 bits (160), Expect = 9e-08, Method: Composition-based stats.
Identities = 51/164 (31%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+AD I+ G + A RI+ L + E G+ L +L +P D+ K
Sbjct: 74 DFAAIESADHDELADTIRVAGLPDQKAARIQRALAAIREETGGAYSLAFLDAMPTDEAKG 133
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V VD +V R++ R G V H L P
Sbjct: 134 WLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVTESATNKRAHDELDAIIP 193
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
D+ T Y LH +IT G+ FC RSP+C
Sbjct: 194 ----------------DELT-YPLHVLLITHGREFCSARSPDCA 220
>gi|389690704|ref|ZP_10179597.1| putative endoIII-related endonuclease [Microvirga sp. WSM3557]
gi|388588947|gb|EIM29236.1| putative endoIII-related endonuclease [Microvirga sp. WSM3557]
Length = 255
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 77/173 (44%), Gaps = 14/173 (8%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
DW A+ A E+ + I+ A R++ L + E HGS+ L++L +P D+ +
Sbjct: 65 ADWRAMMDAPTEEVEETIQGVTWPEQKAPRLQAVLRAIEERHGSLSLDFLDTMPVDEARA 124
Query: 649 YLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+L I G+G K+ V S L+ A PVD + R+A R G +P G H L
Sbjct: 125 WLESIPGVGPKTSAAVLSFSKLRKAALPVDSHHHRVAQRTGLIPQSTAVGPSHAVLAALL 184
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P W D + +Y+ H M+ G+ C +SP C C + C
Sbjct: 185 P-------EDW------DAQQVYDHHEVMMLHGQRCCFFKSPACERCAILDLC 224
>gi|126466261|ref|YP_001041370.1| HhH-GPD family protein [Staphylothermus marinus F1]
gi|126015084|gb|ABN70462.1| HhH-GPD family protein [Staphylothermus marinus F1]
Length = 228
Score = 66.2 bits (160), Expect = 9e-08, Method: Composition-based stats.
Identities = 56/214 (26%), Positives = 94/214 (43%), Gaps = 34/214 (15%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD + + L++IY E + +K+ S E ++ +A+K G N A
Sbjct: 45 QNTSDKNAIRAYLNLKKIYG---EITPEKILSTSIE--------KLVEALKPAGMYNQRA 93
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE---YLLDIEGLGLKSVECVRLLSLQHIA 673
RI E E + +L L V K++E YL+++ G+GLK+ + V L+ +
Sbjct: 94 QRIVELAKIFTERNVKEELRKL--VEEGKLREARKYLVNLPGVGLKTADVVLLMYYKQPV 151
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD ++ R++ RLG++ Y + + M L E H
Sbjct: 152 FPVDTHIRRVSKRLGYIEKD------------NYETISRWWMK------QLKPNEYLEAH 193
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
+IT G+ CK R P C CP+ CK++ +
Sbjct: 194 LLLITHGRKTCKARKPLCDKCPINKYCKYYCKTI 227
>gi|409074630|gb|EKM75023.1| hypothetical protein AGABI1DRAFT_24137, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 235
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 10/112 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP------ 643
++EA+ A + ++ DAI+ G N A I+ L + E HGS L+ L P
Sbjct: 82 NFEAIAAAPKQKVIDAIRSGGLANKKARMIQGLLQAVKEKHGSYSLQHLARKPESTSSAP 141
Query: 644 ----DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
D++ + LL +G+G K+ CV L + +FPVD +V R++ LGW+P
Sbjct: 142 ALGDDEIMDELLSYDGVGPKTASCVLLFCIGRDSFPVDTHVLRLSHLLGWIP 193
>gi|426195516|gb|EKV45446.1| hypothetical protein AGABI2DRAFT_44066, partial [Agaricus bisporus
var. bisporus H97]
Length = 234
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 10/112 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP------ 643
++EA+ A + ++ DAI+ G N A I+ L + E HGS L+ L P
Sbjct: 81 NFEAIAAAPKQKVIDAIRSGGLANKKARMIQGLLQAVKEKHGSYSLQHLARKPESTSSAP 140
Query: 644 ----DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
D++ + LL +G+G K+ CV L + +FPVD +V R++ LGW+P
Sbjct: 141 ALRDDEIMDELLSYDGVGPKTASCVLLFCIGRDSFPVDTHVLRLSHLLGWIP 192
>gi|385776592|ref|YP_005649160.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
REY15A]
gi|323475340|gb|ADX85946.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
REY15A]
Length = 233
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 50/188 (26%), Positives = 86/188 (45%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + AD S I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDADLSNIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMN---------- 167
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+Y ++ L+ L +LH+ +I G+ CK R P C +C ++ C+++
Sbjct: 168 -AKYELISSTLKELF------SAYDLLQLHHLLIAHGRQTCKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|383788526|ref|YP_005473095.1| DNA glycosylase [Caldisericum exile AZM16c01]
gi|381364163|dbj|BAL80992.1| DNA glycosylase [Caldisericum exile AZM16c01]
Length = 241
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 49/167 (29%), Positives = 82/167 (49%), Gaps = 21/167 (12%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D E+++ IKE G + A R+K FL+ + + +L L ++ + ++ LL+I+G+G
Sbjct: 79 DIQELSNYIKESGFYRLKAQRLKNFLDFFKKY--NFELLDLTHIEIENLRNELLNIKGVG 136
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + L +L+ F VD R AVR G +L+ D+I++
Sbjct: 137 KETADSIILYALEKPIFVVDNYTKRFAVRFG--------------ILENMSSYDEIRLLF 182
Query: 718 WPRLCYLDQKT----LYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L ++K E+H ++ GK FCKK PNC ACPL C
Sbjct: 183 EDALKSENEKETLIRFKEMHALIVELGKNFCKKE-PNCSACPLGNLC 228
>gi|367471322|ref|ZP_09470967.1| Endonuclease III [Patulibacter sp. I11]
gi|365813601|gb|EHN08854.1| Endonuclease III [Patulibacter sp. I11]
Length = 248
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 52/187 (27%), Positives = 76/187 (40%), Gaps = 34/187 (18%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS---------------ID 634
DWE V A EI AI+ G + + RI L+ + E G +D
Sbjct: 75 DWELVARAPVEEIEAAIRPGGISKVKSVRIAAILDAIAERSGPTARALGLDPAPDASPLD 134
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG-WVPLQ 693
L W+ P +++L ++ G+G K+ CV L + PVD +V R+ RL + P
Sbjct: 135 LTWMEAAPVAAGRDFLCELPGVGRKTAACVLLFAYGLHDVPVDTHVQRVGTRLALFRPGA 194
Query: 694 PLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGA 753
PL +LH +L P + ELH ++ G+ C R P C
Sbjct: 195 PL-DELHDEMLWITP-----------------EGAGLELHVNLLRHGRRTCHARRPACSE 236
Query: 754 CPLRGEC 760
C LR C
Sbjct: 237 CALRRMC 243
>gi|386002355|ref|YP_005920654.1| hypothetical protein Mhar_1672 [Methanosaeta harundinacea 6Ac]
gi|357210411|gb|AET65031.1| hypothetical protein Mhar_1672 [Methanosaeta harundinacea 6Ac]
Length = 133
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 39/128 (30%), Positives = 62/128 (48%), Gaps = 17/128 (13%)
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
IDLE+LR ++ +++L+ + G+G K+ V L + + PVD +V R++ RLG+VP
Sbjct: 13 IDLEFLRGADAEEARQFLVSLPGVGPKTAAVVLLFAFRMPLLPVDTHVNRLSRRLGFVPA 72
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
++I + PR Y H +I G+ C+ RSP+CG
Sbjct: 73 G-----------ASIEEAERILEEITPREKYC------SFHVNLIRHGRAVCRARSPSCG 115
Query: 753 ACPLRGEC 760
AC L C
Sbjct: 116 ACVLAEVC 123
>gi|84489308|ref|YP_447540.1| endonuclease III [Methanosphaera stadtmanae DSM 3091]
gi|84372627|gb|ABC56897.1| putative endonuclease III [Methanosphaera stadtmanae DSM 3091]
Length = 219
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 81/177 (45%), Gaps = 27/177 (15%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
E + A +IA+ +++ G N+ A RIKE N L+E + V PD ++E L
Sbjct: 68 MEEIADAPVDDIANLVRQAGFYNVKAARIKEVSNILLEEYDG--------VVPDTLEE-L 118
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L + G+G K+ CV + Q A PVDV+V RI+ RLG V P D L + P
Sbjct: 119 LKLPGVGRKTANCVLVFGFQKDAIPVDVHVHRISNRLGLVHTDK-PEDTEEVLREIVP-- 175
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
Q Y P ++ M+ FG+ CK +P CP C+ + S +
Sbjct: 176 ---QEYWLP------------INDLMVQFGQNICKPINPQHIECPFTDLCQLYKSEL 217
>gi|390959927|ref|YP_006423684.1| putative endoIII-related endonuclease [Terriglobus roseus DSM
18391]
gi|390414845|gb|AFL90349.1| putative endoIII-related endonuclease [Terriglobus roseus DSM
18391]
Length = 258
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 52/175 (29%), Positives = 80/175 (45%), Gaps = 14/175 (8%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D V WE +R A EI D ++ + A ++K+ L R+ E G++ L +L DKV
Sbjct: 89 DEVRWERLRDASIPEIEDTLRLATFPDRKAVQLKQTLERITERAGTLSLAFLGKYRTDKV 148
Query: 647 KEYLLDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
+ ++ ++ G G+K S V SL+ A +D + RIA+RLG VP + L+
Sbjct: 149 RAWIEELPGAGVKASAAVVNFSSLRRRAIAIDGHHQRIAIRLGVVPANATARQVEAALVP 208
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P W + T+ E H + G+ C R +C CPLR C
Sbjct: 209 LAPAD-------WTAV------TMDEHHTLVKKLGQRVCTLREAHCNRCPLREVC 250
>gi|114566352|ref|YP_753506.1| HhH-GPD [Syntrophomonas wolfei subsp. wolfei str. Goettingen]
gi|114337287|gb|ABI68135.1| DNA-3-methyladenine glycosylase III [Syntrophomonas wolfei subsp.
wolfei str. Goettingen]
Length = 222
Score = 65.9 bits (159), Expect = 1e-07, Method: Composition-based stats.
Identities = 49/178 (27%), Positives = 81/178 (45%), Gaps = 24/178 (13%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D+ A+R E E+AD IK A R+K L+ + E +G D++ + + P +++
Sbjct: 59 LDFPALRSIAEEELADFIKPALYHRQKARRLKILLDFIAENYGG-DIDLMFSEPLPQIRA 117
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV----PLQPLPGDLHIHLL 704
LL + G+G ++ + + L + + F VD RI RLGWV + + G + HL
Sbjct: 118 RLLALWGIGPETADSILLYAGNYPVFVVDAYTIRIFTRLGWVEDKCSYEKMQGLMQNHL- 176
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
PV D + E H ++ G +CKK+ C CPL C +
Sbjct: 177 ---PV---------------DTQIYNEYHALLVALGANYCKKKKALCQECPLAEYCPY 216
>gi|51892440|ref|YP_075131.1| endonuclease III [Symbiobacterium thermophilum IAM 14863]
gi|51856129|dbj|BAD40287.1| endonuclease III [Symbiobacterium thermophilum IAM 14863]
Length = 232
Score = 65.5 bits (158), Expect = 1e-07, Method: Composition-based stats.
Identities = 45/185 (24%), Positives = 77/185 (41%), Gaps = 17/185 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ + + E+ + G + I AG + L + G++ LE LR P D+ +L
Sbjct: 57 WDGLLGLTQVELVRLLWGSGLEEIRAGHLLAVAGLLRDRFGAVTLEPLRQWPDDECLAFL 116
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ + G+G+K+ CV L L FP D + R+ R+G + + L P
Sbjct: 117 MSLPGMGMKTALCVMLYGLDRPVFPADAHCIRVLKRMGVIDAR----------LAHRPAQ 166
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
++ + L Y+ LH ++ G+ C R P C C + C +A +A
Sbjct: 167 RELARIVPGHLGYV-------LHVNLVAHGQQICTARCPRCAECVVANYCLLGEAAQQAA 219
Query: 771 RFALP 775
LP
Sbjct: 220 EGQLP 224
>gi|452208019|ref|YP_007488141.1| endonuclease III [Natronomonas moolapensis 8.8.11]
gi|452084119|emb|CCQ37452.1| endonuclease III [Natronomonas moolapensis 8.8.11]
Length = 271
Score = 65.5 bits (158), Expect = 2e-07, Method: Composition-based stats.
Identities = 50/169 (29%), Positives = 77/169 (45%), Gaps = 24/169 (14%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSID--LEWLRNVPPDKVKEYLLDIEGLGL 658
E+A+ I G N + RI + R++ +G ++R PP +V+E LL++ G+G
Sbjct: 97 ELAETISSAGLYNQKSKRIIDIAERVLAEYGDTAGFDRFVREEPPSEVRETLLEMNGVGP 156
Query: 659 KSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ +CV L + + FPVD +V RIA RLG P V D ++ +
Sbjct: 157 KTADCVLLFAGGRGGVFPVDTHVHRIARRLGIAPADA----------DHEAVRDALETAV 206
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
C H MI FG+ +C R+P C ACP+ C+
Sbjct: 207 PEEKCGFG-------HTAMIQFGREYCTARTPACLEDPDACPMADVCEQ 248
>gi|413945494|gb|AFW78143.1| hypothetical protein ZEAMMB73_419140 [Zea mays]
Length = 426
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 49/71 (69%), Gaps = 6/71 (8%)
Query: 553 KTTPKKNSDNTVQQDWDLLRR-IYSTGE--ERSHDKMDSVDWEAVRCADESEIADAIKER 609
+TT KN++N DW+ LRR S G+ +RS ++ DS+DWEAVRCAD I+ AI+ER
Sbjct: 355 RTTTAKNTENF---DWEKLRRQACSEGQMKQRSFERRDSIDWEAVRCADVRRISHAIRER 411
Query: 610 GQQNIIAGRIK 620
G N++A RI+
Sbjct: 412 GMNNVLAERIQ 422
>gi|222099288|ref|YP_002533856.1| Repair endonuclease [Thermotoga neapolitana DSM 4359]
gi|221571678|gb|ACM22490.1| Repair endonuclease [Thermotoga neapolitana DSM 4359]
Length = 396
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 19/169 (11%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + IA I+ G NI A R+K L L E + +L+ L+ +P ++E LL
Sbjct: 238 EKLSSLSTERIAHLIRPAGFFNIKAKRLKALLEFLREY--NFNLKLLKRMPLGALRELLL 295
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I+G+G ++ + + L +L+ F VD R+ R+ + L K+Y +
Sbjct: 296 KIKGIGKETADAILLYALEKPIFVVDSYTKRLLARIFNIEL------------KDYDEIQ 343
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K+ M +P D + ELH ++ K FC K +P C CPL+ +C
Sbjct: 344 KLFMSCYPH----DVRLYQELHGLIVEHAKRFCSK-NPKCRECPLKKKC 387
>gi|118430904|ref|NP_147001.2| endonuclease III [Aeropyrum pernix K1]
gi|116062229|dbj|BAA79061.2| endonuclease III [Aeropyrum pernix K1]
Length = 229
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 75/172 (43%), Gaps = 25/172 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A ++ +AI+E G A +K +V G +L PP++++E L+
Sbjct: 76 EAILEASYDDLVEAIREAGLPRQKASALKALAEAVVRWGGE---NYLLKAPPEELREKLM 132
Query: 652 DIEGLGLKSVECVRLLSLQHIA---FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
I G+G K+ + LSL A F VD + R+A R G L+ E
Sbjct: 133 SIRGIGPKTADV--FLSLVRKAPGVFAVDTHAARVARRWG--------------LVGEKA 176
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D+I L+ Y E H +I G+ +CK R P C CPLR C
Sbjct: 177 GYDEISKALYN---YFGPGNSEEAHRLIIALGRTYCKARRPRCRECPLRSVC 225
>gi|292655041|ref|YP_003534938.1| endonuclease III [Haloferax volcanii DS2]
gi|448292657|ref|ZP_21483063.1| endonuclease III [Haloferax volcanii DS2]
gi|291372380|gb|ADE04607.1| endonuclease III [Haloferax volcanii DS2]
gi|445572413|gb|ELY26953.1| endonuclease III [Haloferax volcanii DS2]
Length = 268
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 56/198 (28%), Positives = 88/198 (44%), Gaps = 30/198 (15%)
Query: 592 EAVRCADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLE-WLRNVPPDKV 646
E++ AD I +AI+ G N +I G +E L + D + ++R P V
Sbjct: 90 ESLAAADREGIVEAIRSGGLYNQKSKLIQGVAEEVL---ADFGSEADFDRYVREEAPATV 146
Query: 647 KEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L +
Sbjct: 147 RDRLLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALER 206
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECK 761
+ P D+K + H MI FG+ FCK R P C ACP+ C
Sbjct: 207 DIP----------------DEKCGFG-HTAMIQFGREFCKARKPACLDGPEACPMTDVCD 249
Query: 762 HFASAVASARFALPGPSE 779
A+ P ++
Sbjct: 250 MVGVDAAAQSVTDPAAAD 267
>gi|217076701|ref|YP_002334417.1| nth endonuclease III [Thermosipho africanus TCF52B]
gi|217036554|gb|ACJ75076.1| nth endonuclease III [Thermosipho africanus TCF52B]
Length = 203
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 53/165 (32%), Positives = 75/165 (45%), Gaps = 27/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E +I + IK G A RI E +VE + I PD ++E LL + G+
Sbjct: 61 AKEEDIYELIKPAGLYRQKAKRIIEISKIIVEKYSGI--------VPDSLEE-LLKLPGV 111
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V +S A VD +V RI+ RLGWV + P + E+ +M+ +
Sbjct: 112 GRKTANIVLYVSFSKSALAVDTHVHRISNRLGWVNTKT-PEE------TEFKLMEILPKN 164
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LW ++ M+ FGK CK SPNC CP+ CK
Sbjct: 165 LWG-----------PINGSMVEFGKKVCKPVSPNCKICPISKYCK 198
>gi|410464085|ref|ZP_11317553.1| putative endonuclease III-like protein [Desulfovibrio magneticus
str. Maddingley MBC34]
gi|409982800|gb|EKO39221.1| putative endonuclease III-like protein [Desulfovibrio magneticus
str. Maddingley MBC34]
Length = 216
Score = 65.1 bits (157), Expect = 2e-07, Method: Composition-based stats.
Identities = 46/168 (27%), Positives = 75/168 (44%), Gaps = 16/168 (9%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D A+ A +A+ I+ G + AGR+K + +VE G DL L + ++
Sbjct: 60 LDPYALHAASLEAVAEPIRPAGHFRVKAGRLKNLMALIVEDLGG-DLTALAGYDLGQARD 118
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL ++G+G ++ + + L L AF VD RI R G VP +L + P
Sbjct: 119 KLLAVKGVGPETADSILLYGLNLPAFVVDAYTARICFRHGLVPEDAGYDELRELFMDALP 178
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+ + +Y E H ++ G +C+ R+P C ACPL
Sbjct: 179 --EDVGLY-------------NEFHALLVRVGNAWCRPRAPKCAACPL 211
>gi|229585498|ref|YP_002844000.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.16.27]
gi|228020548|gb|ACP55955.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
M.16.27]
Length = 233
Score = 64.7 bits (156), Expect = 2e-07, Method: Composition-based stats.
Identities = 49/188 (26%), Positives = 87/188 (46%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + A+ S+I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDANLSDIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMN---------- 167
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+Y ++ L+ L +LH+ +I G+ CK R P C +C ++ C+++
Sbjct: 168 -AKYELISSTLKELF------SAYDLLQLHHLLIAHGRQTCKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|383318935|ref|YP_005379776.1| EndoIII-related endonuclease [Methanocella conradii HZ254]
gi|379320305|gb|AFC99257.1| putative EndoIII-related endonuclease [Methanocella conradii HZ254]
Length = 291
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 81/180 (45%), Gaps = 18/180 (10%)
Query: 581 RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRN 640
R + +D + CAD+ E+ + I+ G A + E ++ E G LEW+R
Sbjct: 118 RLQRRYKGIDVHKMACADKRELEELIRFSGPYK--ASYMIECSRQIEERWGG-SLEWMRR 174
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
P D+ + LL + G+G K+ +CV L SL H PVD ++ R++ RLG L GD
Sbjct: 175 APTDEARRALLSLHGVGPKTADCVLLFSLGHSVVPVDTHICRVSQRLG---LSMSMGDSE 231
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ V + ++ L K H +I G+ FCK +P CP+ C
Sbjct: 232 A---AKRRVKEDLERGL---------KMPGMAHLLIINLGRDFCKALAPLHHICPVEELC 279
>gi|419759441|ref|ZP_14285740.1| endonuclease III [Thermosipho africanus H17ap60334]
gi|407515451|gb|EKF50196.1| endonuclease III [Thermosipho africanus H17ap60334]
Length = 203
Score = 64.7 bits (156), Expect = 3e-07, Method: Composition-based stats.
Identities = 53/165 (32%), Positives = 75/165 (45%), Gaps = 27/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E +I + IK G A RI E +VE + I PD ++E LL + G+
Sbjct: 61 AKEEDIYELIKPAGLYRQKAKRIIEISKIIVEKYSGI--------VPDSLEE-LLKLPGV 111
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V +S A VD +V RI+ RLGWV + P + E+ +M+ +
Sbjct: 112 GRKTANIVLYVSFSKPALAVDTHVHRISNRLGWVNTKT-PEE------TEFKLMEILPKN 164
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LW ++ M+ FGK CK SPNC CP+ CK
Sbjct: 165 LWG-----------PINGSMVEFGKKVCKPVSPNCKICPISKYCK 198
>gi|333911156|ref|YP_004484889.1| HhH-GPD family protein [Methanotorris igneus Kol 5]
gi|333751745|gb|AEF96824.1| HhH-GPD family protein [Methanotorris igneus Kol 5]
Length = 220
Score = 64.3 bits (155), Expect = 3e-07, Method: Composition-based stats.
Identities = 52/180 (28%), Positives = 84/180 (46%), Gaps = 19/180 (10%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP- 643
K + +D + D ++ IK G NI A R+K + +V+ +GS E L +P
Sbjct: 52 KENLIDERKILEIDTEKLKKLIKPAGFYNIKAERLKNITHHIVKNYGST--ENLAKLPIK 109
Query: 644 -DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
+ +++ LL+++G+G ++ + + L +L F VD RI RLG I
Sbjct: 110 LEDLRKELLNVKGIGKETADSILLYALDRPIFVVDAYTKRIFSRLGV-----------IE 158
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+EY + I P+ D K E H ++ GK +CKKR+P C CPL C +
Sbjct: 159 GGEEYDEIRHIFEENLPK----DLKIYKEYHALIVELGKHYCKKRNPACEKCPLSDLCDY 214
>gi|374308995|ref|YP_005055425.1| iron-sulfur cluster loop [Granulicella mallensis MP5ACTX8]
gi|358751005|gb|AEU34395.1| iron-sulfur cluster loop [Granulicella mallensis MP5ACTX8]
Length = 261
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 81/191 (42%), Gaps = 15/191 (7%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
+WE+VR A +EI AI + A R+K L + E +GS+ L++L DK++ +
Sbjct: 85 NWESVRDATVAEIERAIAVVTFPEVKAPRLKAALMGITERYGSLTLDFLARYRTDKIRSW 144
Query: 650 LLDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L EG+G + S V +L+ A +D N R+ RL VP +
Sbjct: 145 LEQFEGVGPQVSAAVVNFSTLRRRAISIDANHLRVVQRLCVVP------------RADAA 192
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
+ ++ M L P D + L E H + G+ C P C ACPL C
Sbjct: 193 ITEERLMRLVPE--TWDAEMLDEHHSLVKLHGQTLCTFSEPRCEACPLLDICPTGKRNTG 250
Query: 769 SARFALPGPSE 779
+ P P +
Sbjct: 251 ELKLTAPTPGD 261
>gi|429858184|gb|ELA33014.1| base excision dna repair protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 309
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 68/150 (45%), Gaps = 14/150 (9%)
Query: 570 LLRRIYS--TGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 627
L+R I S T + S S+D EA++C S + + I LN++
Sbjct: 159 LVRTILSQNTSDTNSTRAKRSMDDEAIKCGGLSAVKSKV------------IISILNQVK 206
Query: 628 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
E +G L+ L + + + L+ +G+G K+ CV L LQ +F VD +V RI L
Sbjct: 207 EKYGKYSLDHLFSASNEDAMQELISFQGVGPKTASCVLLFCLQRESFAVDTHVWRITGLL 266
Query: 688 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
GW P + H HL P DK +++
Sbjct: 267 GWRPKSASRDETHAHLDVRIPDEDKYGLHI 296
>gi|297527052|ref|YP_003669076.1| DNA-(apurinic or apyrimidinic site) lyase [Staphylothermus
hellenicus DSM 12710]
gi|297255968|gb|ADI32177.1| DNA-(apurinic or apyrimidinic site) lyase [Staphylothermus
hellenicus DSM 12710]
Length = 234
Score = 64.3 bits (155), Expect = 3e-07, Method: Composition-based stats.
Identities = 55/210 (26%), Positives = 92/210 (43%), Gaps = 34/210 (16%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD + + L++IY E + DK+ S + ++ +A+K G N A
Sbjct: 45 QNTSDKNAIRAYFNLKKIYG---EITPDKILSTPID--------KLIEALKPAGMYNQRA 93
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE---YLLDIEGLGLKSVECVRLLSLQHIA 673
RI E E +++ E + + K++E YL+ + G+GLK+ + V L+
Sbjct: 94 QRIIELAKIFTE--KNVEEELGKLIEEGKLREARKYLVSLPGVGLKTADVVLLMYYGQPV 151
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD ++ R+ RLG++ +Y + M L E H
Sbjct: 152 FPVDTHIRRVTKRLGYIGKD------------DYEAISSWWMK------QLKPNDYLETH 193
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+IT G+ CK R P C CP+R CK++
Sbjct: 194 LLLITHGRKTCKARKPLCNICPIRKYCKYY 223
>gi|257389114|ref|YP_003178887.1| DNA-(apurinic or apyrimidinic site) lyase [Halomicrobium mukohataei
DSM 12286]
gi|257171421|gb|ACV49180.1| DNA-(apurinic or apyrimidinic site) lyase [Halomicrobium mukohataei
DSM 12286]
Length = 270
Score = 64.3 bits (155), Expect = 3e-07, Method: Composition-based stats.
Identities = 62/229 (27%), Positives = 96/229 (41%), Gaps = 36/229 (15%)
Query: 557 KKNSDNTVQQDWD-LLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNII 615
+ SD Q D L+ R + GE+ + D AD+ +A+ I+ G N
Sbjct: 65 QNTSDKASQPAHDALMERYGADGEDLAATLAD---------ADQQRLAETIQPAGLHNQK 115
Query: 616 AGRIKEFLNRLVELHG---SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQH 671
+ I R+ E +G D ++R P V+ LLD++G+G K+ +CV L + +
Sbjct: 116 SETIVRLAGRVCEEYGFGAEFD-AFVRGGDPGDVRSALLDMKGVGPKTADCVLLFAGGRG 174
Query: 672 IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYE 731
FPVD +V RIA R+G P + L + P +K +
Sbjct: 175 GVFPVDTHVHRIARRMGLAPADADHEGVRAALEDDIPA----------------EKCGFG 218
Query: 732 LHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALPG 776
H MI FG+ +C R P C ACPL +C+ + A+ P
Sbjct: 219 -HTAMIQFGREYCSARKPACLDDPEACPLADQCEQLGVSPATGEVFDPA 266
>gi|356565037|ref|XP_003550751.1| PREDICTED: DEMETER-like protein 2-like [Glycine max]
Length = 172
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 52/93 (55%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
A IK L L E G + LE+LR++ D+VK L +G+G K+V CV + +LQ FP
Sbjct: 30 ASCIKNVLRCLRERRGELCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNLQQDDFP 89
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
VD ++ IA +GWVP +IHL + P
Sbjct: 90 VDTHIFEIAKTMGWVPSVADKNKSYIHLNQRIP 122
>gi|424780567|ref|ZP_18207440.1| Endonuclease III [Catellicoccus marimammalium M35/04/3]
gi|422842969|gb|EKU27416.1| Endonuclease III [Catellicoccus marimammalium M35/04/3]
Length = 218
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 72/160 (45%), Gaps = 31/160 (19%)
Query: 612 QNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH 671
+NIIA +LVE G VP D+ + L+ + G+G K+ V ++
Sbjct: 88 KNIIACS-----QKLVEEFGG-------KVPADR--KQLMSLPGVGRKTANVVLSVAFNI 133
Query: 672 IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYE 731
AF VD ++ RIA RL VPL + ++K + P+ +T
Sbjct: 134 PAFAVDTHIQRIAKRLQIVPLDASVDE-----------VEKTITSIMPK------ETWNH 176
Query: 732 LHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
LH+Q+I FG+ C + P C CP EC ++ S VA +
Sbjct: 177 LHHQLIYFGRYLCTAKKPKCDQCPFTKECLYYESEVAEKK 216
>gi|170101314|ref|XP_001881874.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164643229|gb|EDR07482.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 308
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 90/195 (46%), Gaps = 31/195 (15%)
Query: 580 ERSHDKMDSV----DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL 635
R+ +D+V ++ A+ A + DAI+ G N A I+ L+ + HG L
Sbjct: 120 HRAKTSLDAVFGRNNFVAIAQAPRERLVDAIRSGGLANKKAATIQNLLHSIRGKHGEYSL 179
Query: 636 EWL-------RNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
+ L R + D++ + L+ +G+G K+ CV L L +F VD +V R++ LG
Sbjct: 180 QHLAAAESSGRRMSDDEIMKELISYDGVGPKTASCVLLFCLGRNSFAVDTHVFRLSKLLG 239
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK--K 746
WVP + HL + V D+++ Y+LH MI G++ CK K
Sbjct: 240 WVPQKSDRVLAQAHL--DIRVPDELK---------------YDLHVLMIQHGRL-CKGCK 281
Query: 747 RSPNCGACPLRGECK 761
++ + AC L+ K
Sbjct: 282 KTGSGQACILKTYLK 296
>gi|73668473|ref|YP_304488.1| endonuclease III [Methanosarcina barkeri str. Fusaro]
gi|72395635|gb|AAZ69908.1| endonuclease III [Methanosarcina barkeri str. Fusaro]
Length = 204
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 76/169 (44%), Gaps = 28/169 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E VR AD SEI IK+ G + AGRIKE L+E +G VP D E LL
Sbjct: 59 EMVR-ADVSEIETLIKDVGFYRVKAGRIKEISRLLLEKYGG-------RVPDDM--EALL 108
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
++ G+G K+ CV + A VD +V RI+ RLG V + +P + L K +P
Sbjct: 109 ELPGVGRKTANCVLAHAFLKDALAVDTHVHRISNRLGLVETK-VPEETETELKKIFP--- 164
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
QK ++ ++ G+ C+ SP C C L C
Sbjct: 165 --------------QKYWRHVNLLLVKLGQNTCRPISPRCKTCTLDDIC 199
>gi|448652011|ref|ZP_21681024.1| endonuclease III [Haloarcula californiae ATCC 33799]
gi|445769414|gb|EMA20488.1| endonuclease III [Haloarcula californiae ATCC 33799]
Length = 238
Score = 63.9 bits (154), Expect = 4e-07, Method: Composition-based stats.
Identities = 51/158 (32%), Positives = 78/158 (49%), Gaps = 7/158 (4%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEE-RSHDKMDSVDW-EAVRCADESEIADAIKERGQQNI 614
+ SD Q D L Y GE+ S +DS D A+ AD+ E+A+ I G N
Sbjct: 64 QNTSDKASQSAHDDLMARYGGGEDANSEGDIDSTDLARALADADQPELAETISSAGLYNQ 123
Query: 615 IAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQ 670
+ RI R+ E +G D ++R+ +KV+ LLD+ G+G K+ +CV L + +
Sbjct: 124 KSERIIALAQRICEEYGGEAGFD-AFVRDSDSEKVRSTLLDMNGVGPKTADCVLLFAGGR 182
Query: 671 HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
FPVD +V RIA R+G P + +L ++ P
Sbjct: 183 GGVFPVDTHVHRIARRIGLAPADAAHETVRAYLERDVP 220
>gi|297801424|ref|XP_002868596.1| hypothetical protein ARALYDRAFT_916063 [Arabidopsis lyrata subsp.
lyrata]
gi|297314432|gb|EFH44855.1| hypothetical protein ARALYDRAFT_916063 [Arabidopsis lyrata subsp.
lyrata]
Length = 93
Score = 63.9 bits (154), Expect = 5e-07, Method: Composition-based stats.
Identities = 28/43 (65%), Positives = 34/43 (79%)
Query: 1003 FPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRREIVCFGS 1045
FPLNGTYFQ NEVFAD+++S +PI+VPR I +L R I FGS
Sbjct: 5 FPLNGTYFQTNEVFADYDSSINPIDVPRKLIWDLDRRIAYFGS 47
>gi|258570499|ref|XP_002544053.1| predicted protein [Uncinocarpus reesii 1704]
gi|237904323|gb|EEP78724.1| predicted protein [Uncinocarpus reesii 1704]
Length = 1143
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 24/128 (18%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE---------------LHGS 632
SVDW VR A EI DAIK G +I + IKE L + E HG+
Sbjct: 244 SVDWNKVRHAPVGEIRDAIKSGGLADIKSKHIKEILTMVYEENIARRNELQTDGKAKHGN 303
Query: 633 ---------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
+ L+++ + D+ + + G+G+K+ CV L L+ F VD +V R+
Sbjct: 304 RVDHADEHMLSLDYMHALSKDEAMQRFIKYPGIGVKTAACVVLFCLRRPCFAVDTHVFRL 363
Query: 684 AVRLGWVP 691
+ LGW+P
Sbjct: 364 SKWLGWIP 371
>gi|448572049|ref|ZP_21640138.1| endonuclease III [Haloferax lucentense DSM 14919]
gi|448596871|ref|ZP_21654009.1| endonuclease III [Haloferax alexandrinus JCM 10717]
gi|445721082|gb|ELZ72751.1| endonuclease III [Haloferax lucentense DSM 14919]
gi|445740752|gb|ELZ92257.1| endonuclease III [Haloferax alexandrinus JCM 10717]
Length = 268
Score = 63.9 bits (154), Expect = 5e-07, Method: Composition-based stats.
Identities = 54/195 (27%), Positives = 87/195 (44%), Gaps = 24/195 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS-IDLE-WLRNVPPDKVKEY 649
E++ AD I +AI+ G N + I+ ++ GS D + ++R P V++
Sbjct: 90 ESLAAADREGIVEAIRSGGLYNQKSKLIRGVAEEVLADFGSEADFDRYVREEAPATVRDR 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L ++ P
Sbjct: 150 LLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALERDIP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
D+K + H MI FG+ FC R P C ACP+ C
Sbjct: 210 ----------------DEKCGFG-HTAMIQFGREFCTARKPACLDGPDACPMADVCDMVG 252
Query: 765 SAVASARFALPGPSE 779
A+ P ++
Sbjct: 253 VDAAAQSVTDPAAAD 267
>gi|269837103|ref|YP_003319331.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerobacter
thermophilus DSM 20745]
gi|269786366|gb|ACZ38509.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerobacter
thermophilus DSM 20745]
Length = 247
Score = 63.9 bits (154), Expect = 5e-07, Method: Composition-based stats.
Identities = 48/181 (26%), Positives = 76/181 (41%), Gaps = 17/181 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V A E+ADAI+ G A RI+ L + L L +P + K L
Sbjct: 77 WDEVVAAPVEEVADAIRSGGLARQKAPRIQAALAAALNSGEDPPLASLFTLPLPEAKRRL 136
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+ CV L + A PVD +V R++ R+G + + E
Sbjct: 137 TSLPGIGPKTAACVLLFACGRPALPVDTHVYRVSRRVGLID----------QGVSEAAAH 186
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
D+++ L P +Y H +I G+ CK P C C + C ++ + S+
Sbjct: 187 DRLEPLLKP-------DEVYPFHVGLIRHGRRVCKATRPRCDECCISDLCDYYQAGCQSS 239
Query: 771 R 771
R
Sbjct: 240 R 240
>gi|226356223|ref|YP_002785963.1| DNA-(apurinic or apyrimidinic site) lyase [Deinococcus deserti
VCD115]
gi|226318213|gb|ACO46209.1| putative DNA-(apurinic or apyrimidinic site) lyase (Endonuclease
III) [Deinococcus deserti VCD115]
Length = 237
Score = 63.9 bits (154), Expect = 5e-07, Method: Composition-based stats.
Identities = 51/189 (26%), Positives = 79/189 (41%), Gaps = 18/189 (9%)
Query: 591 WEAVRCADESEIADAIKERGQ--QNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
WEA I ++ G + AG I L L E G++DL +R + + +
Sbjct: 54 WEAALLDGPDGIETVLRAAGGGLAQMKAGYIHGILVHLDERLGTLDLSAVRKLNDQEART 113
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G+K+ + L L A PVD N+ RIA RL VP + P
Sbjct: 114 LLEGLPGVGMKTASLILLFDLLRPALPVDTNIERIAKRLELVPQRWTP------------ 161
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
+K++ + + + D H + G++ C+ R P C C LRG C A+ +
Sbjct: 162 --EKVERW-FDAVVRRDWAERATFHVAGVRHGRLTCRPRDPRCDQCVLRGLCPS-AALLG 217
Query: 769 SARFALPGP 777
F GP
Sbjct: 218 PDAFKAAGP 226
>gi|76802666|ref|YP_330761.1| repair DNA N-glycosylase / DNA-(apurinic or apyrimidinic site)
lyase [Natronomonas pharaonis DSM 2160]
gi|76558531|emb|CAI50123.1| endonuclease III [Natronomonas pharaonis DSM 2160]
Length = 268
Score = 63.9 bits (154), Expect = 5e-07, Method: Composition-based stats.
Identities = 51/169 (30%), Positives = 76/169 (44%), Gaps = 24/169 (14%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID--LEWLRNVPPDKVKEYLLDIEGL 656
+ +A+ I G N + R+ +VE +G D ++ PPD+V+E LL++ G+
Sbjct: 95 QPTLAETISGAGLYNQKSTRLIALAEFVVETYGGADGFDGFVTEAPPDEVRETLLELNGV 154
Query: 657 GLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
G K+ +CV L S + FPVD +V RIA R+G P + L + P
Sbjct: 155 GPKTADCVLLFSGGRDGVFPVDTHVHRIARRMGLAPADADHEAVRSALEADVP------- 207
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
+K + H MI FG+ +C R P C ACPL C
Sbjct: 208 ---------GEKCGFG-HTAMIQFGREYCTARKPACLDDPEACPLADRC 246
>gi|116747801|ref|YP_844488.1| HhH-GPD family protein [Syntrophobacter fumaroxidans MPOB]
gi|116696865|gb|ABK16053.1| DNA-3-methyladenine glycosylase III [Syntrophobacter fumaroxidans
MPOB]
Length = 222
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 17/170 (10%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+ +EA+ E +A+ I+ G N A ++K F + E G L+ D ++
Sbjct: 62 LSFEALCAVPEPLLAELIRSSGYYNQKARKLKAFCRHVCET-GHAGLDGFLAQDTDTLRS 120
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL I G+G ++ + + L + +F VD R+ R GWV +E P
Sbjct: 121 ELLRIRGVGPETADSIVLYAAHKPSFVVDTYTHRVFSRHGWV--------------QESP 166
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
D+++ + L D ELH ++ G +FC+K +P CG CPL G
Sbjct: 167 SYDELRGFFMDCL-EPDVGLFQELHALLVRTGHLFCRK-TPRCGGCPLEG 214
>gi|94986389|ref|YP_605753.1| HhH-GPD [Deinococcus geothermalis DSM 11300]
gi|94556670|gb|ABF46584.1| HhH-GPD [Deinococcus geothermalis DSM 11300]
Length = 242
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 50/196 (25%), Positives = 82/196 (41%), Gaps = 17/196 (8%)
Query: 570 LLRRIYSTGEERSHDKMDSV-DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE 628
L +R EE ++ + + DW+A+ A +A AI+ A RI+ L + +
Sbjct: 51 LSQRTTQQDEELAYQALRQLGDWDAIIAAPTEAVAHAIRRSNYPESKAPRIQATLRAIRD 110
Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
GS DL++L +P ++L + G+G+K+ V L + FPVD +V R+ R+G
Sbjct: 111 ERGSYDLDFLAELPVKDALKWLTALPGVGIKTASLVLLFNYARPVFPVDTHVHRVTTRVG 170
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
+ G+ H + L+ELH ++ G+ C
Sbjct: 171 VIGRM---GEQAAHRALLQLLPPD-------------PPFLFELHLNLLRHGQKVCTWVR 214
Query: 749 PNCGACPLRGECKHFA 764
P C C LR C A
Sbjct: 215 PKCPVCVLRARCDAHA 230
>gi|260892299|ref|YP_003238396.1| DNA-(apurinic or apyrimidinic site) lyase [Ammonifex degensii KC4]
gi|260864440|gb|ACX51546.1| DNA-(apurinic or apyrimidinic site) lyase [Ammonifex degensii KC4]
Length = 240
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 77/164 (46%), Gaps = 23/164 (14%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
+E E+A+ ++ G A ++ +V+ DL+ L N+P + ++ LL + G+
Sbjct: 82 CEEQELAEILRPVGLHRRKAKLLRTLAREVVDF----DLKALSNLPTFEARQRLLRLPGV 137
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + V LL L H FPVD ++ RI RL W + P ++IQ
Sbjct: 138 GPKTAD-VLLLHLGHPLFPVDTHIARITWRLSWA---------------KRPCYEEIQK- 180
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+W L + E H ++I +G+ C+ R P C C LR C
Sbjct: 181 VWMEL--FSPEDYQEAHLRLIQWGREICQARKPRCFTCFLRICC 222
>gi|407465823|ref|YP_006776705.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosopumilus sp. AR2]
gi|407049011|gb|AFS83763.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosopumilus sp. AR2]
Length = 216
Score = 63.2 bits (152), Expect = 7e-07, Method: Composition-based stats.
Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 20/145 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ ID ++ VP + E L+++ G+G K+ CV + + + A PVD++V RI+
Sbjct: 92 RIIEVAKIIDKKYKGKVPDNL--ETLVELPGVGRKTANCVLVYAFEKPAIPVDIHVHRIS 149
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + E +M KIQ W +++ + +G+ C
Sbjct: 150 NRLGLVDTKN-PEE------TEQELMKKIQKKYW-----------IDINDTFVMYGQNIC 191
Query: 745 KKRSPNCGACPLRGECKHFASAVAS 769
K SP C C ++ CK + S AS
Sbjct: 192 KPISPMCNVCKIKKSCKFYKSKNAS 216
>gi|115375240|ref|ZP_01462505.1| endonuclease III (DNA-(apurinic or apyrimidinic site)lyase)
[Stigmatella aurantiaca DW4/3-1]
gi|310820220|ref|YP_003952578.1| endonuclease III [Stigmatella aurantiaca DW4/3-1]
gi|115367707|gb|EAU66677.1| endonuclease III (DNA-(apurinic or apyrimidinic site)lyase)
[Stigmatella aurantiaca DW4/3-1]
gi|309393292|gb|ADO70751.1| Endonuclease III (DNA-(Apurinic or apyrimidinic site)lyase)
[Stigmatella aurantiaca DW4/3-1]
Length = 237
Score = 63.2 bits (152), Expect = 7e-07, Method: Composition-based stats.
Identities = 52/180 (28%), Positives = 83/180 (46%), Gaps = 23/180 (12%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG-SIDLEWLRNVPPDKVK 647
V WEAVR A +E+ +AI A R++ L + G + L++L +P + +
Sbjct: 67 VTWEAVRDAPCAEVQEAIAPVTWPEQKAPRLQHILREITARRGGDMALDFLGALPVPQAR 126
Query: 648 EYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGW----VPLQPLPGDLHIH 702
+L + G+G K+ V L S L+ A PVD + R+AVRLG +P+ P L
Sbjct: 127 AWLESLPGVGPKTSAAVLLFSRLRRPALPVDSHHYRVAVRLGLLSARIPVGPSHALLAAL 186
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L +E+ + +Y+ H ++ G+ C +SP CG CP+ C H
Sbjct: 187 LPQEW-----------------GAQQVYDHHEVLMLHGQRCCYHQSPACGRCPVLELCPH 229
>gi|222444488|ref|ZP_03607003.1| hypothetical protein METSMIALI_00099 [Methanobrevibacter smithii
DSM 2375]
gi|261350964|ref|ZP_05976381.1| endonuclease III [Methanobrevibacter smithii DSM 2374]
gi|222434053|gb|EEE41218.1| endonuclease III [Methanobrevibacter smithii DSM 2375]
gi|288860304|gb|EFC92602.1| endonuclease III [Methanobrevibacter smithii DSM 2374]
Length = 208
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 76/166 (45%), Gaps = 27/166 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ + I+ G + A RIKE L++ +G PD +KE L+++ G+
Sbjct: 69 APTDDVEELIRCSGFYRVKAARIKEVSRILIDQYGG--------EVPDNLKE-LVELPGV 119
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV + + + A PVD +V RI+ R+G V + P + L K P
Sbjct: 120 GRKTANCVLVYAFELPAIPVDTHVHRISNRIGLVNTKT-PEQTEVELAKIAP------KE 172
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LW +L L M+ FG+ CK SP C CP+ C +
Sbjct: 173 LWIKLNDL-----------MVQFGQTICKPMSPQCEMCPISDICDY 207
>gi|448542485|ref|ZP_21624719.1| endonuclease III [Haloferax sp. ATCC BAA-646]
gi|448549955|ref|ZP_21628560.1| endonuclease III [Haloferax sp. ATCC BAA-645]
gi|445707085|gb|ELZ58948.1| endonuclease III [Haloferax sp. ATCC BAA-646]
gi|445713003|gb|ELZ64784.1| endonuclease III [Haloferax sp. ATCC BAA-645]
Length = 270
Score = 63.2 bits (152), Expect = 8e-07, Method: Composition-based stats.
Identities = 54/179 (30%), Positives = 82/179 (45%), Gaps = 30/179 (16%)
Query: 592 EAVRCADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLE-WLRNVPPDKV 646
E++ AD I +AI+ G N +I G +E L + D + ++R P V
Sbjct: 90 ESLAAADREGIIEAIRSGGLYNQKSKLIQGVAEEVL---ADFGSEADFDRYVREEAPATV 146
Query: 647 KEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L +
Sbjct: 147 RDRLLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRMGLAPADADHEGVRAALER 206
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
+ P D+K + H MI FG+ FC R P C ACPL C
Sbjct: 207 DIP----------------DEKCGFG-HTAMIQFGREFCTARKPACLDGPDACPLYDLC 248
>gi|449469507|ref|XP_004152461.1| PREDICTED: DEMETER-like protein 2-like [Cucumis sativus]
Length = 234
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 52/90 (57%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V A+ I DAI+ G + A IK L+ L + G + LE+LR++ D++K L
Sbjct: 143 WEDVLSAESKCIEDAIRCGGLASTKASCIKNILSSLSKRRGKLCLEFLRDLSIDEIKAEL 202
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNV 680
+G+G K+V CV + +LQ FPVD +V
Sbjct: 203 STFKGIGPKTVACVLMFNLQKDDFPVDTHV 232
>gi|85375167|ref|YP_459229.1| endonuclease III family protein [Erythrobacter litoralis HTCC2594]
gi|84788250|gb|ABC64432.1| endonuclease III family protein [Erythrobacter litoralis HTCC2594]
Length = 225
Score = 63.2 bits (152), Expect = 9e-07, Method: Composition-based stats.
Identities = 48/180 (26%), Positives = 82/180 (45%), Gaps = 14/180 (7%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAV A E+ ++ + ++ A R+K+ LN ++ G++DL L N+ + E+L
Sbjct: 58 WEAVAAAPVEELEVRLQRQTFPSVAAQRLKDCLNAIIAERGAVDLRHLSNLAIEDAMEWL 117
Query: 651 LDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+ G+ K+ V SL + A VD + RI R+G VP + + Y
Sbjct: 118 ERLPGVARKNSAGVMNASLFERKAMVVDGHHRRIMQRMGMVPPKA-------DTARTYDA 170
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
+ I W + E H + G+ C+ R+P+C CP+R +C+ A A+
Sbjct: 171 LMPIVPEEW------SAADMDEHHLLLKKLGQTHCRPRAPHCEGCPVRADCRTGTGARAA 224
>gi|385810254|ref|YP_005846650.1| endonuclease III [Ignavibacterium album JCM 16511]
gi|383802302|gb|AFH49382.1| Putative endonuclease III [Ignavibacterium album JCM 16511]
Length = 236
Score = 63.2 bits (152), Expect = 9e-07, Method: Composition-based stats.
Identities = 44/172 (25%), Positives = 72/172 (41%), Gaps = 18/172 (10%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V S+I IK G A IK L L + HG + L L+ D+V + L
Sbjct: 60 WEDVLSLKTSQIERIIKVAGLGKQKAKAIKNLLKNLKKYHGKLSLTHLKKKSDDEVLDEL 119
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ +G+G+K+ CV L + PVD +V RI R+G V
Sbjct: 120 ILHKGIGVKTASCVLLFAFDRNVCPVDTHVHRILNRVGVVETS-----------NPEKTF 168
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++I+ ++ + + H ++ G+ +C +P C CP+ C +
Sbjct: 169 NEIK-------SHIPKGAAHSFHTNLLRLGREYCTPTNPKCYECPIENVCNY 213
>gi|361069805|gb|AEW09214.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
Length = 137
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 67/130 (51%), Gaps = 21/130 (16%)
Query: 749 PNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF------GNGIGQSPPLVVNPI 802
PNC ACP+R +CKHFASA SAR L E T+ N S PL P+
Sbjct: 1 PNCNACPMRCDCKHFASAFTSARLLLTSTKETSSDTNPMLTLPYKENENAPSTPL---PL 57
Query: 803 PVIR--------IEADPVS--ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIG 852
P I+ E P+ +S + + CEPIIEEP +P P+ E+ + +++ ++
Sbjct: 58 PTIQQDVNIEKPAEVLPLQLLKSDPEDHKCEPIIEEPTTPEPEFIENTNTIVEECAYT-- 115
Query: 853 DIEEIPTLRL 862
D +EIP ++L
Sbjct: 116 DPDEIPVIKL 125
>gi|148642332|ref|YP_001272845.1| endonuclease III [Methanobrevibacter smithii ATCC 35061]
gi|148551349|gb|ABQ86477.1| endonuclease III [Methanobrevibacter smithii ATCC 35061]
Length = 210
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 76/166 (45%), Gaps = 27/166 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ + I+ G + A RIKE L++ +G PD +KE L+++ G+
Sbjct: 71 APTDDVEELIRCSGFYRVKAARIKEVSRILIDQYGG--------EVPDNLKE-LVELPGV 121
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV + + + A PVD +V RI+ R+G V + P + L K P
Sbjct: 122 GRKTANCVLVYAFELPAIPVDTHVHRISNRIGLVNTKT-PEQTEVELAKIAP------KE 174
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LW +L L M+ FG+ CK SP C CP+ C +
Sbjct: 175 LWIKLNDL-----------MVQFGQTICKPMSPQCEMCPISDICDY 209
>gi|328858824|gb|EGG07935.1| hypothetical protein MELLADRAFT_77474 [Melampsora larici-populina
98AG31]
Length = 371
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 85/182 (46%), Gaps = 23/182 (12%)
Query: 573 RIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHG 631
R YS ER + ++E +R + E+ + I+ G + I LN+++ + G
Sbjct: 183 RAYSKIIER----YGNANFEDIRKSGIKELTETIRVGGLAERKSKVIITILNQIISKGDG 238
Query: 632 SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+ L+ LR + ++V + L++ +G+G+K+ CV + L FPVD +V R++ LGWVP
Sbjct: 239 ILSLDKLRLMSDEQVMQELVEFDGVGIKTGACVSMFCLGRDTFPVDTHVHRLSKSLGWVP 298
Query: 692 LQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ HL + P D K Y LH +I G+ C++ SP
Sbjct: 299 PKATRDQTFFHLNLQLPN---------------DLK--YALHILLIRHGQ-SCRQCSPTS 340
Query: 752 GA 753
A
Sbjct: 341 KA 342
>gi|147919100|ref|YP_687169.1| endonuclease III [Methanocella arvoryzae MRE50]
gi|110622565|emb|CAJ37843.1| predicted endonuclease III [Methanocella arvoryzae MRE50]
Length = 243
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 88/201 (43%), Gaps = 21/201 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYL 650
E + A ES IAD I+ G A IK ++ E G++D ++ P+ ++ L
Sbjct: 63 EQLASAPESAIADLIRIGGLHEQKAKLIKNISQLVIDEYDGTLD--FVCETDPEVARKEL 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L I+G+G K+ +CV L S PVD +V RI RLG VP + H +M
Sbjct: 121 LTIKGVGPKTADCVLLFSCDRDVIPVDTHVFRITKRLGIVPEKA------DHEKARQILM 174
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
+K+ L H +I FG+ CK ++P C L C +A V +A
Sbjct: 175 EKVPEGLRG-----------STHVALIKFGREICKAQNPRHDQCFLLDLCD-YARQVGTA 222
Query: 771 RFALPGPSEKGIVTSEFGNGI 791
R G +E V S G+
Sbjct: 223 RGPECGRTECDSVHSHRDKGL 243
>gi|88602750|ref|YP_502928.1| HhH-GPD [Methanospirillum hungatei JF-1]
gi|88188212|gb|ABD41209.1| DNA-3-methyladenine glycosylase III [Methanospirillum hungatei
JF-1]
Length = 226
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 80/171 (46%), Gaps = 17/171 (9%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D +D + + A + IA I+ N A +I FL+ +E D+ + ++P D+V
Sbjct: 60 DLMDADILYQAGQDTIAPLIRSSRYYNQKARKIMTFLSWFLETCDG-DVSRMASLPTDQV 118
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E LL++ G G ++V+ + L +L+ F VD RI R GW ++
Sbjct: 119 REELLNLSGFGPETVDSILLYALEKPVFVVDAYTRRIGSRQGW--------------FED 164
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D++Q + RL D + H Q++ G CKK +P C CP+R
Sbjct: 165 DASYDQMQEFFMNRLS-PDVPLYNDYHAQIVYLGNKLCKK-TPLCSICPVR 213
>gi|337287836|ref|YP_004627308.1| HhH-GPD family protein [Thermodesulfobacterium sp. OPB45]
gi|334901574|gb|AEH22380.1| HhH-GPD family protein [Thermodesulfobacterium geofontis OPF15]
Length = 218
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 49/156 (31%), Positives = 72/156 (46%), Gaps = 16/156 (10%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
++ IK G N A R+K F+ L+E + DL L + K +E LL+I+GLG ++V
Sbjct: 74 LSQIIKPSGFYNTKAKRLKNFVKFLIENYQG-DLNILFSKGLKKAREELLNIKGLGKETV 132
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
+ + L + F VD RI R VP + Y M + M P+
Sbjct: 133 DSILLYAGNLPIFVVDAYTYRILHRHSLVPEE-----------ATYEEMQALFMENLPQ- 180
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D + E H ++ GK FCKK+ P C CPL+
Sbjct: 181 ---DPQLFNEFHALLVACGKNFCKKKEPLCETCPLK 213
>gi|317153228|ref|YP_004121276.1| HhH-GPD family protein [Desulfovibrio aespoeensis Aspo-2]
gi|316943479|gb|ADU62530.1| HhH-GPD family protein [Desulfovibrio aespoeensis Aspo-2]
Length = 219
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 47/171 (27%), Positives = 79/171 (46%), Gaps = 16/171 (9%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D + A+ D E+A+ I+ G NI A R++ FLN L + G ++E L+ D++
Sbjct: 58 DLLSARAMHALDTGELAELIRPAGYYNIKAARLRNFLNFLNDEAG-FEIESLKTQGMDEL 116
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ +L I G+G ++ + + L +L+ F VD R+ R G H +
Sbjct: 117 RSKVLSINGVGPETADSILLYALEMPTFVVDAYTYRMMDRHGLA-----------HEGID 165
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
Y + I M P D E H ++ GK +C+K++ C CPL+
Sbjct: 166 YHGLRSIFMDALPE----DVSLYNEFHALIVRVGKDWCRKKAGLCATCPLQ 212
>gi|433424488|ref|ZP_20406484.1| endonuclease III [Haloferax sp. BAB2207]
gi|432198075|gb|ELK54399.1| endonuclease III [Haloferax sp. BAB2207]
Length = 268
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 54/195 (27%), Positives = 86/195 (44%), Gaps = 24/195 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS-IDLE-WLRNVPPDKVKEY 649
E++ AD I +AI+ G N + I+ ++ GS D + ++R P V++
Sbjct: 90 ESLAAADREGIVEAIRSGGLYNQKSKLIRGVAEEVLADFGSEADFDRYVREEAPATVRDR 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L + P
Sbjct: 150 LLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALEHDIP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
D+K + H MI FG+ FC R P C ACP+ C
Sbjct: 210 ----------------DEKCGFG-HTAMIQFGREFCTARKPACLDGPDACPMADVCDMVG 252
Query: 765 SAVASARFALPGPSE 779
A+ P ++
Sbjct: 253 VDAAAQSVTDPAAAD 267
>gi|15789648|ref|NP_279472.1| endonuclease III [Halobacterium sp. NRC-1]
gi|169235361|ref|YP_001688561.1| DNA-(apurinic or apyrimidinic site) lyase endonuclease III
[Halobacterium salinarum R1]
gi|10580010|gb|AAG18952.1| endonuclease III [Halobacterium sp. NRC-1]
gi|167726427|emb|CAP13212.1| endonuclease III [Halobacterium salinarum R1]
Length = 265
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 53/185 (28%), Positives = 77/185 (41%), Gaps = 24/185 (12%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEGLGLK 659
+AD I G N + I +R+ E +G D ++R PD V+ LLD+ G+G K
Sbjct: 98 LADTISMAGLYNQKSATIIRIADRVCEEYGGADGFDAFVREGDPDAVRAALLDMTGVGPK 157
Query: 660 SVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
+ +CV L + FPVD +V RIA R+G + P D V ++ +
Sbjct: 158 TADCVLLFAGGAPGVFPVDTHVHRIARRMG---IAPAAAD-------HEAVRAALEAAVP 207
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFAL 774
C H MI FG+ +C R P C ACPL C A+
Sbjct: 208 DAACGFG-------HTAMIQFGREYCTARDPACLDDPAACPLAEHCDRVGVDAATGAVVD 260
Query: 775 PGPSE 779
P ++
Sbjct: 261 PADAD 265
>gi|400601349|gb|EJP68992.1| base excision DNA repair protein [Beauveria bassiana ARSEF 2860]
Length = 361
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 77/189 (40%), Gaps = 24/189 (12%)
Query: 576 STGEERSHDKMDSV-----DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH 630
S R+ MD+V WEA+ ++++ AI+ G + I + L + +
Sbjct: 186 SANSTRAKQSMDTVYGGSDQWEAIAAGGQAKLQRAIQSGGLAATKSRVIIDILAAVRAKY 245
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
G L+ L + E LL G+G K+ C+ L LQ +F VD +V R+ LGW
Sbjct: 246 GVYSLDHLFTAGDAEAMEELLAFPGVGPKTASCILLFCLQRPSFAVDTHVYRLTGVLGWR 305
Query: 691 PLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI--FCKKRS 748
P + HL + P D+++ Y LH +I G+ C ++
Sbjct: 306 PYEATREQAQAHLDAKVP--DELK---------------YPLHVLLIAHGRTCSACNAKA 348
Query: 749 PNCGACPLR 757
C LR
Sbjct: 349 AAGQTCKLR 357
>gi|374633877|ref|ZP_09706242.1| putative endoIII-related endonuclease [Metallosphaera
yellowstonensis MK1]
gi|373523665|gb|EHP68585.1| putative endoIII-related endonuclease [Metallosphaera
yellowstonensis MK1]
Length = 229
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 80/175 (45%), Gaps = 22/175 (12%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A EIA AI+ G N A I+E +++++ LE + N PDK +E L+ + G+
Sbjct: 67 ASIEEIAKAIRPAGLHNNKAKNIRE-ISKIIHESFHDLLEDILNQSPDKAREILVSMPGI 125
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + V L+ + FPVD ++ RI+ RLG + G + Y + +
Sbjct: 126 GTKTADVVLLICKGYRTFPVDTHIFRISRRLG------IEG-------RNYSEISSV--- 169
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
W R K H +IT G+ CK P C C L C+++ + S R
Sbjct: 170 -WVR----HVKDPLNAHLLLITHGRKTCKAVKPKCQECVLIDCCQYYLGVLRSER 219
>gi|220931064|ref|YP_002507972.1| HhH-GPD family protein [Halothermothrix orenii H 168]
gi|219992374|gb|ACL68977.1| HhH-GPD family protein [Halothermothrix orenii H 168]
Length = 224
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 44/155 (28%), Positives = 77/155 (49%), Gaps = 18/155 (11%)
Query: 603 ADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVE 662
A IK G N+ A +IK F+N L E +G L+ + P K+++ LL++ G+G ++ +
Sbjct: 84 AKMIKPAGYYNMKARKIKAFINFLFEDYGG-SLDEMFQEPLSKIRDKLLEVYGIGPETAD 142
Query: 663 CVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLC 722
+ L + + F +D RI R+G++ ++ H L++ +MD +
Sbjct: 143 SILLYAGEFPVFVIDAYTKRIFSRIGYIE-----ENIGYHTLQKM-IMDNLPA------- 189
Query: 723 YLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPL 756
+ +Y E H ++ GK CKK +P C CPL
Sbjct: 190 ---RTGIYNEYHALLVALGKEICKKNNPLCEKCPL 221
>gi|335437951|ref|ZP_08560708.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus tiamatea
SARL4B]
gi|334893555|gb|EGM31766.1| DNA-(apurinic or apyrimidinic site) lyase [Halorhabdus tiamatea
SARL4B]
Length = 281
Score = 62.4 bits (150), Expect = 1e-06, Method: Composition-based stats.
Identities = 61/213 (28%), Positives = 95/213 (44%), Gaps = 26/213 (12%)
Query: 557 KKNSDNTVQQDWDLLRRIYST--GEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNI 614
+ +D Q D L Y + GE+ + ++ +A+ A+++E+A+ I G N
Sbjct: 65 QNTTDTASQPAHDALLERYGSLDGEDADAAESETDLVDALADAEQAELAETISGAGLYNQ 124
Query: 615 IAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQH 671
+ RI + R+ E +G D ++R P + V+E LL + G+G K+ +CV L + +
Sbjct: 125 KSARISQIAERVREKYGGEDEFDAFVREEPAEAVRETLLAMTGVGPKTADCVLLFAGGRD 184
Query: 672 IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYE 731
FPVD +V RI RLG P P + + E V +K C
Sbjct: 185 GVFPVDTHVHRIYRRLGVAP--PDADHEAVRAVLEETVPEK--------KCGFG------ 228
Query: 732 LHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
H I FG+ +C R P C ACPL C
Sbjct: 229 -HTATIQFGREYCSARKPACLDGPEACPLDDLC 260
>gi|197124333|ref|YP_002136284.1| DNA-cytosine methyltransferase [Anaeromyxobacter sp. K]
gi|196174182|gb|ACG75155.1| DNA-cytosine methyltransferase [Anaeromyxobacter sp. K]
Length = 657
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 108/277 (38%), Gaps = 51/277 (18%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+K ++ Q +D L+R +++ WE + A E E+ + G
Sbjct: 67 RKTREDAYQATYDALKRRFAS-------------WEELLRAPEREVEAIVHRGGLGKRKT 113
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
+ L LV+ GS L + ++E+L + + KS C+ + S+ FPV
Sbjct: 114 ASLVGALQALVDRFGSCTLRPALQWKDEALEEFLCSLPEISRKSAYCIMMYSMGRSVFPV 173
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV--MDKIQMYLWPRLCYLDQKTLYE--- 731
D +VGR+ RLG I+ + + +D Q+ Q+TL +
Sbjct: 174 DTHVGRVLQRLG------------IYKGTGFSLEGLDHKQL----------QRTLADVVP 211
Query: 732 ------LHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTS 785
LH ++ G+ CK +P C AC LR C H+ AS A P+ V
Sbjct: 212 PNLRRSLHINLVLHGREVCKAVAPACDACELRQLCSHYRDHEASRVEASDAPT----VVD 267
Query: 786 EFGNGIGQSPPLVVNPIPVI-RIEADPVSESGYQINN 821
F G S ++ ++ DPV+ +N+
Sbjct: 268 LFCGAGGLSEGFTRAGFRLVAAVDRDPVALKTLWLNH 304
>gi|383158249|gb|AFG61482.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158250|gb|AFG61483.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158251|gb|AFG61484.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158252|gb|AFG61485.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158253|gb|AFG61486.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158254|gb|AFG61487.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158255|gb|AFG61488.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158256|gb|AFG61489.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158257|gb|AFG61490.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158258|gb|AFG61491.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158259|gb|AFG61492.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158260|gb|AFG61493.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158261|gb|AFG61494.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158262|gb|AFG61495.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158263|gb|AFG61496.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158264|gb|AFG61497.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
gi|383158265|gb|AFG61498.1| Pinus taeda anonymous locus UMN_535_01 genomic sequence
Length = 137
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 69/130 (53%), Gaps = 21/130 (16%)
Query: 749 PNCGACPLRGECKHFASAVASARFALPGPSEKG-----IVTSEFG-NGIGQSPPLVVNPI 802
PNC ACP+R +CKHFASA SAR L E ++T + N S PL P+
Sbjct: 1 PNCNACPMRCDCKHFASAFTSARLLLTSTKESSSDINPMLTLPYKENENAPSTPL---PL 57
Query: 803 PVIR--------IEADPVS--ESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIG 852
P I+ E P+ +S + + CEPIIEEP +P P+ E+ + +++ ++
Sbjct: 58 PTIQQDVNIEKPAEVLPLQLLKSDPEDHKCEPIIEEPTTPEPEFIENTNTIVEECAYT-- 115
Query: 853 DIEEIPTLRL 862
D +EIP ++L
Sbjct: 116 DPDEIPVIKL 125
>gi|167043370|gb|ABZ08073.1| putative HhH-GPD superfamily base excision DNA repair protein
[uncultured marine crenarchaeote HF4000_ANIW141O9]
Length = 217
Score = 62.4 bits (150), Expect = 1e-06, Method: Composition-based stats.
Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 29/174 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNR-LVELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
A ++ IK G ++ + RI E N L + HG VP D K L++I G
Sbjct: 72 AKAKDVEKIIKSVGFYHVKSRRIIEVANIILTKYHG--------KVPADIDK--LVEIPG 121
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ CV + + + A PVD +V RI+ RLG V + P + + L K+ P
Sbjct: 122 VGRKTANCVLVYAFEKPAIPVDTHVHRISNRLGLVDTKT-PEETEMELRKKVP------- 173
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
+K ++ + +G+ CK SP C C +R C +F + AS
Sbjct: 174 ----------KKYWLPINNTFVMYGQNICKPISPMCSVCKIRNSCNYFKTKNAS 217
>gi|159900799|ref|YP_001547046.1| HhH-GPD family protein [Herpetosiphon aurantiacus DSM 785]
gi|159893838|gb|ABX06918.1| HhH-GPD family protein [Herpetosiphon aurantiacus DSM 785]
Length = 224
Score = 62.4 bits (150), Expect = 1e-06, Method: Composition-based stats.
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 18/184 (9%)
Query: 579 EERSHDKMDSV--DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLE 636
EER + + + WEA+ A +A+AIK A I+ L +++ G I ++
Sbjct: 48 EERGYQNLRATFPTWEAILAAPVEAVAEAIKPANYAPAKANNIQAALAKILAERGEISID 107
Query: 637 WLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLP 696
+L + + +L + G+G K+ V L PVD +V R++ RLG V + P
Sbjct: 108 FLAELSTEAAMAWLTGLRGVGPKTASLVLLFCFSKPILPVDTHVHRVSQRLGLVKAKT-P 166
Query: 697 GDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+ H LW L + D + L+ H ++ G+ C + P C CPL
Sbjct: 167 TEAH--------------EILWQLLPH-DAEWLFNYHVALLRHGQRICLAKRPRCNQCPL 211
Query: 757 RGEC 760
+C
Sbjct: 212 TAQC 215
>gi|407461561|ref|YP_006772878.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosopumilus koreensis AR1]
gi|407045183|gb|AFS79936.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosopumilus koreensis AR1]
Length = 215
Score = 62.4 bits (150), Expect = 1e-06, Method: Composition-based stats.
Identities = 39/143 (27%), Positives = 72/143 (50%), Gaps = 20/143 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ ID ++ VP D E L+++ G+G K+ CV + + + A PVD++V RI+
Sbjct: 92 RIIEVAKIIDSKYKGKVPEDL--ETLVELPGVGRKTANCVLVYAFEKPAIPVDIHVHRIS 149
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
R+G V + P + L+K+ +D+K +++ + +G+ C
Sbjct: 150 NRIGLVDTKN-PEETEQELMKK-----------------IDKKFWIDINDTFVMYGQNIC 191
Query: 745 KKRSPNCGACPLRGECKHFASAV 767
K SP C C ++ CK + S +
Sbjct: 192 KPISPMCDVCKIKRNCKFYKSKI 214
>gi|403178239|ref|XP_003336685.2| hypothetical protein PGTG_18481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375164069|gb|EFP92266.2| hypothetical protein PGTG_18481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 424
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 56/108 (51%), Gaps = 3/108 (2%)
Query: 590 DWEAVRCADESEIADAIKER--GQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKV 646
++ A+R A S I +++ G + I LN + + L+ + LE+LR +P +
Sbjct: 136 NYHAIRRASVSSITAVLQDARVGLAARKSTTIHALLNHIHLNLNPDLSLEFLRYLPDSEA 195
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
E L +G+G K+ CV L L FPVD +V RI LGW+P +P
Sbjct: 196 METLTSFKGVGAKTASCVLLFCLGRNFFPVDTHVFRITKALGWLPPRP 243
>gi|85114220|ref|XP_964657.1| hypothetical protein NCU02035 [Neurospora crassa OR74A]
gi|10303298|emb|CAC10093.1| conserved hypothetical protein [Neurospora crassa]
gi|28926447|gb|EAA35421.1| predicted protein [Neurospora crassa OR74A]
Length = 572
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 68/155 (43%), Gaps = 25/155 (16%)
Query: 560 SDNTVQQDWDLLRRIYSTGEE---RSHDK-MDSVDWEAVRCADESEIADAIKERGQQNII 615
S NT+ D R I + +E R H + S++WE V E ++A AIK G N
Sbjct: 348 SGNTLMAMAD--RAIRNVAQEYGLREHGSGIGSINWEKVASEPEEKLAQAIKVSGNGNQK 405
Query: 616 AGRIKEFLN-------------------RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A IK L+ R V +++L+ + V D+ L+ G+
Sbjct: 406 AKHIKLILDMVALEMAQMAMENKGTGGKREVAFSETLNLDHMHTVTKDEAMAKLVQYPGI 465
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
G+KS CV L L+ F VD +V R LGWVP
Sbjct: 466 GIKSAACVTLFCLRMPCFAVDTHVHRFCRWLGWVP 500
>gi|433638876|ref|YP_007284636.1| putative endoIII-related endonuclease [Halovivax ruber XH-70]
gi|433290680|gb|AGB16503.1| putative endoIII-related endonuclease [Halovivax ruber XH-70]
Length = 296
Score = 62.0 bits (149), Expect = 2e-06, Method: Composition-based stats.
Identities = 55/190 (28%), Positives = 81/190 (42%), Gaps = 24/190 (12%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIE 654
AD EIA+ I G N + I+E + G+ ++R P V++ LL +
Sbjct: 124 ADREEIAETISAAGLYNQKSRMIQEAAEWVRTEFGTAPAFDAYVRETAPATVRDELLSVH 183
Query: 655 GLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ +CV L + Q +PVD +V RI+ RLG P P G + E V
Sbjct: 184 GVGPKTADCVLLFAGGQDGVYPVDTHVHRISRRLGIAP--PEAGHEGVREALETTVPG-- 239
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVAS 769
P+ + H MI FG+ +C R P C ACPL GEC + +
Sbjct: 240 -----PKCGF--------GHTAMIQFGREYCTARKPACLDGTEACPLAGECDRVGVDIEA 286
Query: 770 ARFALPGPSE 779
P ++
Sbjct: 287 QSVVDPAEAD 296
>gi|118577076|ref|YP_876819.1| EndoIII-related endonuclease [Cenarchaeum symbiosum A]
gi|118195597|gb|ABK78515.1| EndoIII-related endonuclease [Cenarchaeum symbiosum A]
Length = 277
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
R++E+ ID ++ VP D E L+ + G+G K+ CV + + + A PVD++V RI
Sbjct: 91 RRIMEVARIIDTKYGGRVPDDL--ETLVGLPGVGRKTANCVLVYAFEKPAIPVDIHVHRI 148
Query: 684 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
+ RLG V + P + L K+ P ++ ++ + +G+
Sbjct: 149 SNRLGLVDTR-TPEETEAALTKKVP-----------------KRHWLHVNDIFVMYGQNI 190
Query: 744 CKKRSPNCGACPLRGECKHFASAVA 768
CK SP C C +R CK++A++ +
Sbjct: 191 CKPVSPMCEVCGIRSLCKYYANSAS 215
>gi|380484040|emb|CCF40248.1| base excision DNA repair protein [Colletotrichum higginsianum]
Length = 361
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 25/174 (14%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK 647
+ ++EA+ ++ DA++ G QN A + L+ + E HG DL+ L N ++
Sbjct: 187 TFNYEAIVEGGMDKLVDALRPGGMQNRKAKILMRLLHDVKERHGKWDLQHLFNASDEEAV 246
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+ ++ GLG K C+ + L+ AF VD ++ RI GW P HL
Sbjct: 247 KEVVSYWGLGPKCAFCLLSICLKRDAFAVDTHIYRITGLWGWRPKDASKELAQAHLDARI 306
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P + Y LHYQ I G+ C AC G+ K
Sbjct: 307 P-----------------NEIKYALHYQFIVHGR--------QCPACRGNGDSK 335
>gi|18419584|gb|AAL69364.1|AF462201_1 unknown [Narcissus pseudonarcissus]
Length = 53
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 28/53 (52%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 1049 TLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKKEAGSKRS 1101
++F L IQ FWRGFVCVRGFDR R P+PL RLH P+ K + +KR+
Sbjct: 2 SIFKGLTTEGIQHCFWRGFVCVRGFDRMMRAPKPLFARLH-FPASKGSRNKRT 53
>gi|448561313|ref|ZP_21634665.1| endonuclease III [Haloferax prahovense DSM 18310]
gi|445721545|gb|ELZ73213.1| endonuclease III [Haloferax prahovense DSM 18310]
Length = 268
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 54/195 (27%), Positives = 87/195 (44%), Gaps = 24/195 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS-IDLE-WLRNVPPDKVKEY 649
E++ AD I +AI+ G N + I+ ++ GS D + ++R P V++
Sbjct: 90 ESLAAADREGIVEAIRSGGLYNQKSKLIQGVAEDVLADFGSEADFDRYVREEVPATVRDR 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L ++ P
Sbjct: 150 LLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALERDIP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
D+K + H MI FG+ FC R P C ACP+ C
Sbjct: 210 ----------------DEKCGFG-HTAMIQFGREFCTARKPACLDGPEACPMADVCDMVG 252
Query: 765 SAVASARFALPGPSE 779
A+ P ++
Sbjct: 253 VDAAAQAVTDPAAAD 267
>gi|161527564|ref|YP_001581390.1| DNA-(apurinic or apyrimidinic site) lyase [Nitrosopumilus maritimus
SCM1]
gi|160338865|gb|ABX11952.1| DNA-(apurinic or apyrimidinic site) lyase [Nitrosopumilus maritimus
SCM1]
Length = 218
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 72/145 (49%), Gaps = 20/145 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ ID ++ VP D + L+ + G+G K+ CV + + + A PVD++V RI+
Sbjct: 94 RIIEVAKIIDSKYKGKVPEDL--DTLVQLPGVGRKTANCVLVYAFEKPAIPVDIHVHRIS 151
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + L+K+ +D+K +++ + +G+ C
Sbjct: 152 NRLGLVETKN-PEETEQELMKK-----------------VDKKFWIDINDTFVMYGQNIC 193
Query: 745 KKRSPNCGACPLRGECKHFASAVAS 769
K SP C C ++ CK + S AS
Sbjct: 194 KPISPMCDVCKIKRSCKFYKSKNAS 218
>gi|170751703|ref|YP_001757963.1| helix-hairpin-helix DNA-binding motif-containing protein
[Methylobacterium radiotolerans JCM 2831]
gi|170658225|gb|ACB27280.1| helix-hairpin-helix motif protein [Methylobacterium radiotolerans
JCM 2831]
Length = 253
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 54/172 (31%), Positives = 77/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DWEAV AD EI AI + A RI++ L L + G +DL +L ++ + + +
Sbjct: 85 DWEAVIDADVPEIEAAIAGVTWPELKAPRIRDVLRALRDRCGGLDLAFLADMEVEAARVW 144
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A RLG + G H L + P
Sbjct: 145 LQAIPGVGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRLGLIGKTVDVGPSHPILRAQLP 204
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W + LY+ H ++ G+ C R P CG C L C
Sbjct: 205 AD-------W------SAQDLYDNHEILMLHGQKVCHHRRPACGRCVLVDLC 243
>gi|294496431|ref|YP_003542924.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalophilus mahii
DSM 5219]
gi|292667430|gb|ADE37279.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalophilus mahii
DSM 5219]
Length = 206
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 70/164 (42%), Gaps = 27/164 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +I IK+ G + AGRIKE L++ E+ VP V+ LL + G+
Sbjct: 63 ADVEKIEILIKDVGFYRVKAGRIKEISQILID-------EYDSQVPASMVE--LLKLPGV 113
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV + A VD +V RI+ RLG V P I L K+ PV
Sbjct: 114 GRKTANCVLSYAFLEKAIAVDTHVHRISNRLGLVD-TVTPDQTEIELQKQVPVS------ 166
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W E++ + FGK CK SP C C + C
Sbjct: 167 YW-----------REVNELFVQFGKTVCKPLSPACEVCAIEDLC 199
>gi|393766633|ref|ZP_10355188.1| helix-hairpin-helix DNA-binding motif-containing protein
[Methylobacterium sp. GXF4]
gi|392727951|gb|EIZ85261.1| helix-hairpin-helix DNA-binding motif-containing protein
[Methylobacterium sp. GXF4]
Length = 253
Score = 61.6 bits (148), Expect = 3e-06, Method: Composition-based stats.
Identities = 58/183 (31%), Positives = 82/183 (44%), Gaps = 17/183 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DWE+V A+ EI AI + A RI+E L + E GS+DL +L ++ + +
Sbjct: 85 DWESVIDAEVPEIQAAIAGVTWPELKAPRIREVLIAVRERCGSLDLAFLGDMDVAAARAW 144
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A RLG + + G H L + P
Sbjct: 145 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRLGLIGPRVDVGPSHAILRAQLP 204
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
W + LY+ H ++ G+ C R P CG C L C SA
Sbjct: 205 AD-------W------SAQALYDNHEILMLHGQQVCHHRRPACGRCVLVDLC---PSAAL 248
Query: 769 SAR 771
AR
Sbjct: 249 GAR 251
>gi|448582052|ref|ZP_21645556.1| endonuclease III [Haloferax gibbonsii ATCC 33959]
gi|445731700|gb|ELZ83283.1| endonuclease III [Haloferax gibbonsii ATCC 33959]
Length = 268
Score = 61.2 bits (147), Expect = 3e-06, Method: Composition-based stats.
Identities = 54/196 (27%), Positives = 86/196 (43%), Gaps = 26/196 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS---IDLEWLRNVPPDKVKE 648
E++ AD I +AI+ G N + I+ ++ GS D ++R P V++
Sbjct: 90 ESLAAADREGIVEAIRSGGLYNQKSKLIQGVAEDVLADFGSEAGFD-RYVREEDPSTVRD 148
Query: 649 YLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
LL+++G+G K+ +CV L + + FPVD +V RIA R+G P + L ++
Sbjct: 149 RLLEMKGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRIGLAPADADHEGVRAALERDI 208
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHF 763
P D+K + H MI FG+ FC R P C ACP+ C
Sbjct: 209 P----------------DEKCGFG-HTAMIQFGREFCTARKPACLDGPEACPMADVCDMV 251
Query: 764 ASAVASARFALPGPSE 779
A+ P ++
Sbjct: 252 GVDAAAQSVTDPAAAD 267
>gi|408403428|ref|YP_006861411.1| endonuclease III [Candidatus Nitrososphaera gargensis Ga9.2]
gi|408364024|gb|AFU57754.1| putative endonuclease III [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 223
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD EI I G NI A RIK+ L+ G VP D + LL + G+
Sbjct: 72 ADIDEIKKIIHSIGFYNIKAERIKQVSQMLISKFGG-------RVPSDI--DSLLKLPGV 122
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV + + A PVDV+V RI+ RLG V + P L K
Sbjct: 123 GRKTANCVLVYAFDKPAIPVDVHVHRISNRLGLVSTKT-PEQTEQDLSK----------- 170
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+D+K +++ + +G+ C PNC AC L+ C+++
Sbjct: 171 ------LVDRKLWTKVNDTFVMYGQNICLPVRPNCKACDLKKMCRYY 211
>gi|392410783|ref|YP_006447390.1| putative endonuclease III-like protein [Desulfomonile tiedjei DSM
6799]
gi|390623919|gb|AFM25126.1| putative endonuclease III-like protein [Desulfomonile tiedjei DSM
6799]
Length = 225
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 80/170 (47%), Gaps = 19/170 (11%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKV 646
++D A+ + ++A I+ G NI A R++ F+N LVE H GS+D + +V +
Sbjct: 62 ALDPLAIYHSSHEQLAAWIRPAGYYNIKAVRLRNFINYLVERHSGSLDSLFSDDV--QVL 119
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ LL I+G+G ++ + + L + F +D GR+ R G VP
Sbjct: 120 RTNLLSIKGIGKETADSMILYAAHKPIFVIDAYTGRVMKRHGIVPYS-----------AN 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
Y VM ++ P +D + + H Q + G +CKK P C CPL
Sbjct: 169 YDVMQELFHENLP----VDVELYNDFHAQFVALGHHYCKK-VPLCHFCPL 213
>gi|448322514|ref|ZP_21511984.1| HhH-GPD family protein [Natronococcus amylolyticus DSM 10524]
gi|445601272|gb|ELY55261.1| HhH-GPD family protein [Natronococcus amylolyticus DSM 10524]
Length = 270
Score = 61.2 bits (147), Expect = 3e-06, Method: Composition-based stats.
Identities = 53/178 (29%), Positives = 78/178 (43%), Gaps = 28/178 (15%)
Query: 592 EAVRCADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK 647
E++ A++SE+A+ I G N +I G +E L G D ++R P+ V+
Sbjct: 92 ESLADAEQSELAETISSAGLYNQKSAMIVGAAEEILEEFDSAAGFDD--FVREAEPETVR 149
Query: 648 EYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
E LL+I G+G K+ +CV L + + FPVD +V RI RLG P + L ++
Sbjct: 150 ERLLEIHGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPADADHEAVRAVLERD 209
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
P C H I FG+ +C R P C ACP+ C
Sbjct: 210 IPAAK----------CGFG-------HTASIQFGREYCTARKPACLEDPDACPMADVC 250
>gi|332796258|ref|YP_004457758.1| DNA-(apurinic or apyrimidinic site) lyase [Acidianus hospitalis W1]
gi|332693993|gb|AEE93460.1| DNA-(apurinic or apyrimidinic site) lyase [Acidianus hospitalis W1]
Length = 232
Score = 61.2 bits (147), Expect = 3e-06, Method: Composition-based stats.
Identities = 47/164 (28%), Positives = 77/164 (46%), Gaps = 20/164 (12%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+IA I+ G I A RIKE L ++++ + DL + + P + +E LL + G+G K+
Sbjct: 78 DIASCIRNIGIYRIKAERIKE-LAKIIKEKYNGDLNKILDKEPKEAREELLSLPGIGEKT 136
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ V L + FPVD ++ RI+ RLG Y + M L+
Sbjct: 137 ADVVLLTCKGYPYFPVDTHIKRISQRLGIAS-------------GSYEQISASLMRLF-- 181
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
D K E H+ +I G+ CK ++P C C L C++++
Sbjct: 182 ----DPKDYLEAHHLLIAHGRNVCKAKNPLCEKCILNDCCEYYS 221
>gi|395325971|gb|EJF58386.1| DNA glycosylase [Dichomitus squalens LYAD-421 SS1]
Length = 353
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 80/193 (41%), Gaps = 42/193 (21%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNV-------- 641
+++A+ A + ++ +AIK G N A I++ L + + HG L+ L V
Sbjct: 137 NFQAIAEASKDQVVEAIKTGGLANKKAAVIQKILREVYDRHGEYSLQHLARVVQSDVDVN 196
Query: 642 ---------------PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
++ + L+ +G+G K+ CV L L +FPVD +V R++
Sbjct: 197 SKAKAEGSSQGRAAVSDEEAMQELVSYDGVGPKTASCVLLFCLGRSSFPVDTHVFRLSRL 256
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGK--IFC 744
LGWVP + HL + V D ++ Y LH M+ G+ C
Sbjct: 257 LGWVPARADRVTAQAHL--DLKVPDHLK---------------YGLHVLMVGHGRRCKGC 299
Query: 745 KKRSPNCGACPLR 757
K S G C L+
Sbjct: 300 KSTSSGKGECVLK 312
>gi|448737131|ref|ZP_21719183.1| HhH-GPD family protein [Halococcus thailandensis JCM 13552]
gi|445804344|gb|EMA54601.1| HhH-GPD family protein [Halococcus thailandensis JCM 13552]
Length = 267
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 53/189 (28%), Positives = 81/189 (42%), Gaps = 24/189 (12%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEG 655
++S +A+ I+ G N + + R+V+ +GS E++ P V+E LLD G
Sbjct: 96 EQSTLAETIESAGLYNQKSETMIAIAERVVDEYGSAAAFDEFVTEEEPATVRETLLDFSG 155
Query: 656 LGLKSVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L S FPVD +V RI RLG P D+ L + P
Sbjct: 156 VGPKTADCVLLFSGGRAGVFPVDTHVHRIYRRLGIAPADADHEDVREVLEAQVPA----- 210
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASA 770
+K + H I FG+ +C RSP C ACP+ C+ A+
Sbjct: 211 -----------EKCGFG-HTATIQFGREYCTARSPACLDDPDACPMGDRCEQVGVYPATD 258
Query: 771 RFALPGPSE 779
P ++
Sbjct: 259 EVVDPSEAD 267
>gi|448313759|ref|ZP_21503471.1| HhH-GPD family protein [Natronolimnobius innermongolicus JCM 12255]
gi|445597069|gb|ELY51147.1| HhH-GPD family protein [Natronolimnobius innermongolicus JCM 12255]
Length = 286
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 51/173 (29%), Positives = 76/173 (43%), Gaps = 24/173 (13%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIE 654
A+ S +A+ I G N + I ++E GS E++++ P+ V+E LLD+
Sbjct: 97 AERSALAETISSAGLYNQKSEVIIATAEWVIEEFGSAVAFDEYVKDEAPETVRETLLDVR 156
Query: 655 GLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ +CV L + + FPVD +V RI RLG P D+ L +E P
Sbjct: 157 GVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPADADHEDVRAVLEREVPAAK-- 214
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
C H I FG+ +C R P C ACP+ C+
Sbjct: 215 --------CGFG-------HTATIQFGREYCTARKPACLEDPEACPMADRCEQ 252
>gi|282165629|ref|YP_003358014.1| putative endonuclease III [Methanocella paludicola SANAE]
gi|282157943|dbj|BAI63031.1| putative endonuclease III [Methanocella paludicola SANAE]
Length = 293
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 78/176 (44%), Gaps = 18/176 (10%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
+ + +D + AD+ E+ + I G A I E +++E G LEW++ V +
Sbjct: 124 RYNGIDVHKMAGADKKELEELIHYSGPYK--AAYIIECSRQIIERWGG-SLEWMKKVSTE 180
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E LL + G+G K+ +CV L SL H PVD ++ R++ RLG L GD
Sbjct: 181 EAREALLSLYGVGPKTADCVLLFSLGHSVTPVDTHICRVSQRLG---LSTSTGDSEAAKR 237
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K ++K + H +I G+ FCK P CP+ C
Sbjct: 238 KVKEDLEK------------KHRIPGMAHLLIINLGRDFCKALVPLHHICPVEDIC 281
>gi|435848105|ref|YP_007310355.1| putative endoIII-related endonuclease [Natronococcus occultus SP4]
gi|433674373|gb|AGB38565.1| putative endoIII-related endonuclease [Natronococcus occultus SP4]
Length = 270
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 53/172 (30%), Positives = 75/172 (43%), Gaps = 28/172 (16%)
Query: 598 DESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDI 653
++SE+A+ I G N +I G +E L G D ++R P+ V+E LL+I
Sbjct: 98 EQSELAETISSAGLYNQKSEMIIGAAEEILAEFESAAGFDD--FVREAAPEGVRERLLEI 155
Query: 654 EGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
G+G K+ +CV L + + FPVD +V RI RLG P D+ L +E P
Sbjct: 156 HGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGVAPPDADHEDVRAVLEREVPAAK- 214
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
C H I FG+ +C R P C ACP+ C
Sbjct: 215 ---------CGFG-------HTASIQFGREYCTARKPACLEDPDACPMGDVC 250
>gi|20093393|ref|NP_619468.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosarcina
acetivorans C2A]
gi|19918762|gb|AAM07948.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosarcina
acetivorans C2A]
Length = 216
Score = 60.5 bits (145), Expect = 5e-06, Method: Composition-based stats.
Identities = 56/192 (29%), Positives = 78/192 (40%), Gaps = 40/192 (20%)
Query: 582 SHDKMDSVDWEAVR-------------CADESEIADAIKERGQQNIIAGRIKEFLNRLVE 628
SH D V + A R AD EI IK+ G + +GRIKE L+E
Sbjct: 47 SHRTRDDVTYPAARKLFERFSTPEEMVGADVEEIEALIKDVGFYRVKSGRIKEISGILLE 106
Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
E+ VP D E LL + G+G K+ CV + A VD +V RI+ RLG
Sbjct: 107 -------EYDGEVPDDM--ETLLKLPGVGRKTANCVLAHAFLKDALAVDTHVHRISNRLG 157
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
V + P + + L K +P QK ++ ++ G+ C+ S
Sbjct: 158 LVETKT-PEETELELKKIFP-----------------QKYWKHINLLLVKLGQNICRPIS 199
Query: 749 PNCGACPLRGEC 760
P C C L C
Sbjct: 200 PKCEVCVLNDMC 211
>gi|332295602|ref|YP_004437525.1| HhH-GPD family protein [Thermodesulfobium narugense DSM 14796]
gi|332178705|gb|AEE14394.1| HhH-GPD family protein [Thermodesulfobium narugense DSM 14796]
Length = 219
Score = 60.5 bits (145), Expect = 5e-06, Method: Composition-based stats.
Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 19/166 (11%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLE-WLRNVPPDKVKEYLLDIEGLGLK 659
+I D+IK G N + + ++ ++E + + + ++ E L I+G+GLK
Sbjct: 71 KIKDSIKIAGLNNQKTKTLHSLGKKFLQNKNYSNIEDYFKKMKISEIVEVFLSIDGIGLK 130
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+V C L L AFPVD ++ RI R+ + D+ I + E + D W
Sbjct: 131 TVSCAILFGLHKPAFPVDTHISRIVQRV--KKKKISKKDIQIEI--EGSIHD------WE 180
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+ L LH +I G+ C+ + NC CP++ C+ + S
Sbjct: 181 K--------LKALHLYLIELGRNICRAKKQNCQMCPIKELCEDYRS 218
>gi|167044454|gb|ABZ09130.1| putative HhH-GPD superfamily base excision DNA repair protein
[uncultured marine crenarchaeote HF4000_APKG6D9]
Length = 216
Score = 60.5 bits (145), Expect = 5e-06, Method: Composition-based stats.
Identities = 38/145 (26%), Positives = 73/145 (50%), Gaps = 20/145 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ I+ ++ VP + + L+++ G+G K+ CV + + + A PVD++V RI+
Sbjct: 92 RIIEVAKIINTKYKGKVPDNL--DTLVELPGVGRKTANCVLVYAFEKPAIPVDIHVHRIS 149
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + L+K+ P +K +++ + +G+ C
Sbjct: 150 NRLGLVDTKN-PEETEQELMKKIP-----------------KKYWIDINDTFVMYGQNIC 191
Query: 745 KKRSPNCGACPLRGECKHFASAVAS 769
K SP C C ++ CK++ + AS
Sbjct: 192 KPISPMCDVCKIKKNCKYYKTKYAS 216
>gi|145590366|ref|YP_001152368.1| HhH-GPD family protein [Pyrobaculum arsenaticum DSM 13514]
gi|145282134|gb|ABP49716.1| HhH-GPD family protein [Pyrobaculum arsenaticum DSM 13514]
Length = 218
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 23/179 (12%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
++ S+ E++R E+ + IK G A +K + + L I E L + D
Sbjct: 59 RLGSITPESLRGISLGELEELIKPAGMYRQRARYLKALADAFITLE--ITPEKLVKMGAD 116
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
++ L+ + G+G K+ + V L +L AFPVD ++ RIA R G +
Sbjct: 117 AARKLLMSLPGVGRKTADVV-LANLGLPAFPVDTHITRIAKRWG--------------VG 161
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
Y + + M P+ YL E H ++I FG+ C+ R+P C CP+ C F
Sbjct: 162 SRYEDISRWFMEQLPKERYL------EFHLKLIQFGRDICRARNPRCEECPIGERCPSF 214
>gi|336463475|gb|EGO51715.1| hypothetical protein NEUTE1DRAFT_89368 [Neurospora tetrasperma FGSC
2508]
gi|350297308|gb|EGZ78285.1| DNA glycosylase [Neurospora tetrasperma FGSC 2509]
Length = 381
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 19/123 (15%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN-------------------RLVE 628
S++WE V E ++A AIK G N A IK L+ R V
Sbjct: 187 SINWEKVASEPEEKLAQAIKVSGNGNQKAKHIKLILDMVALEMAQLAMENNGTGGKREVA 246
Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLG 688
+++L+ + V D+ L+ G+G+KS CV L L+ F VD +V R LG
Sbjct: 247 FPETLNLDHMHTVTKDEAMAKLVQYPGIGIKSAACVTLFCLRMPCFAVDTHVHRFCRWLG 306
Query: 689 WVP 691
WVP
Sbjct: 307 WVP 309
>gi|297585115|ref|YP_003700895.1| HhH-GPD family protein [Bacillus selenitireducens MLS10]
gi|297143572|gb|ADI00330.1| HhH-GPD family protein [Bacillus selenitireducens MLS10]
Length = 211
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 76/170 (44%), Gaps = 19/170 (11%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E +R + E+ + I+ G + A R++ FL E HG D+ L+ V D ++E LL
Sbjct: 58 ERIRAMTDEELEERIRPSGFFRMKAQRLRAFLE-WFESHGD-DVRALQQVETDVLREELL 115
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
++G+G ++ + + L +L F +D RI R+G+ K+Y
Sbjct: 116 QVKGIGAETADSILLYALYRPVFVIDAYTHRIMNRIGY------------QFPKKYNQAQ 163
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P+ D+ + H Q + K CKK+ P C CPL EC+
Sbjct: 164 AFFEEALPK----DEALYNDFHAQFVRHAKEHCKKK-PVCEGCPLEPECE 208
>gi|345870606|ref|ZP_08822557.1| HhH-GPD family protein [Thiorhodococcus drewsii AZ1]
gi|343921419|gb|EGV32135.1| HhH-GPD family protein [Thiorhodococcus drewsii AZ1]
Length = 247
Score = 60.5 bits (145), Expect = 6e-06, Method: Composition-based stats.
Identities = 45/169 (26%), Positives = 77/169 (45%), Gaps = 18/169 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E++ D +ADA++ G N+ A R+++F + V G L+ L + D++++ LL
Sbjct: 84 ESILSLDVEALADALRPAGYFNLKAQRLRDFCSFYVASGG---LDALSRIETDRLRDQLL 140
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
++G+G ++ + + L + F VD R+ RLG L GD ++
Sbjct: 141 AVKGIGPETADDMLLYAFDRPVFVVDAYTRRLFSRLG-----QLSGDEGYEGIRA----- 190
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L P D L E H ++ K C+ R P C C LR +C
Sbjct: 191 AFEAVLGP-----DVAMLKEYHALLVRHAKEACRSRHPRCSTCSLRSDC 234
>gi|170741581|ref|YP_001770236.1| helix-hairpin-helix DNA-binding motif-containing protein
[Methylobacterium sp. 4-46]
gi|168195855|gb|ACA17802.1| helix-hairpin-helix motif [Methylobacterium sp. 4-46]
Length = 239
Score = 60.5 bits (145), Expect = 6e-06, Method: Composition-based stats.
Identities = 50/172 (29%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A +E+ I A R++ L + E HGS+ L++LR++ + +
Sbjct: 63 DWAELIAAPVAEVEAVITGVTWPEQKAPRLQAVLRAVRERHGSLSLDFLRDLDVASARAW 122
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A RLG + + G H L + P
Sbjct: 123 LEAIPGIGPKTSAAVLSFSTLRRPALPVDSHHHRVAQRLGLIGPKVDVGPAHAILRAQLP 182
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W + LY+ H M+ G+ C R+P CG C L C
Sbjct: 183 EA-------W------SAQDLYDNHEVMMLHGQRCCFHRAPACGTCVLLDLC 221
>gi|403170480|ref|XP_003329823.2| hypothetical protein PGTG_11760 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168743|gb|EFP85404.2| hypothetical protein PGTG_11760 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 352
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 55/108 (50%), Gaps = 3/108 (2%)
Query: 590 DWEAVRCADESEIADAIKER--GQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKV 646
++ A+R A S I +++ G + I LN + + L+ + LE+LR +P +
Sbjct: 134 NYHAIRRASVSSITAVLQDARVGLAARKSTTIHALLNHIHLNLNPDLSLEFLRYLPDSEA 193
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
E L +G+G K+ CV L L FPVD +V RI LGW+P P
Sbjct: 194 METLTSFKGVGAKTASCVLLFCLGRNFFPVDTHVFRITKALGWLPPGP 241
>gi|147845261|emb|CAN79062.1| hypothetical protein VITISV_036428 [Vitis vinifera]
Length = 449
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/67 (53%), Positives = 45/67 (67%)
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQM 736
D N+GRI +LG VP QPL L ++LL+ +++ IQ YL RLC LD TLYE YQ+
Sbjct: 343 DTNIGRIXEKLGXVPFQPLLEPLQLYLLELCHMLEFIQKYLLLRLCKLDXXTLYEQCYQL 402
Query: 737 ITFGKIF 743
ITFGKI
Sbjct: 403 ITFGKIL 409
>gi|352682089|ref|YP_004892613.1| DNA endonuclease III [Thermoproteus tenax Kra 1]
gi|350274888|emb|CCC81534.1| DNA endonuclease III [Thermoproteus tenax Kra 1]
Length = 224
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 90/179 (50%), Gaps = 31/179 (17%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI---DLEWLRNV 641
++ ++ +++ E E++ I+ G GR + +L+EL S+ LE + +
Sbjct: 63 RLGIIEPQSILSLSEEELSQLIRPAGMHR---GRAR----KLIELSRSLLGHPLEEISRM 115
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
++ +++L+D+ G+G K+ + V LL++ AFPVD ++ RIA R W
Sbjct: 116 SFEEARKFLMDLPGVGEKTAD-VLLLNMGKPAFPVDTHIKRIAAR--W------------ 160
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ K Y + + W R + + L E+H ++I FG+ +C+ + P C CP+R C
Sbjct: 161 SIGKSYREISE-----WFRKNFPPTRYL-EIHLKLIQFGRDYCRAQRPRCDVCPVRHMC 213
>gi|410721172|ref|ZP_11360516.1| putative endoIII-related endonuclease [Methanobacterium sp.
Maddingley MBC34]
gi|410599623|gb|EKQ54169.1| putative endoIII-related endonuclease [Methanobacterium sp.
Maddingley MBC34]
Length = 213
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 79/168 (47%), Gaps = 29/168 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
AD + + I+ G ++ A RI E +L E G + PD +K LL++ G
Sbjct: 66 ADPALLEPLIRPAGFYHVKAQRIVEVSRKLQDEFKGQV---------PDYMKG-LLELPG 115
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ CV + Q A PVDV+V RI+ RLG V + P + L K
Sbjct: 116 VGRKTANCVLVYGFQKPAIPVDVHVHRISNRLGLVDTKH-PEETEAELEK---------- 164
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
L PR ++ EL+ M+ FG+ C+ +SP CPL+ C ++
Sbjct: 165 -LVPREYWI------ELNDLMVQFGQTICRPQSPRHEECPLQELCDYY 205
>gi|448590303|ref|ZP_21650179.1| endonuclease III [Haloferax elongans ATCC BAA-1513]
gi|445734936|gb|ELZ86491.1| endonuclease III [Haloferax elongans ATCC BAA-1513]
Length = 267
Score = 60.1 bits (144), Expect = 7e-06, Method: Composition-based stats.
Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 24/195 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS-IDLE-WLRNVPPDKVKEY 649
E++ AD + +AI+ G N + I ++ GS D + ++R P+ V+E
Sbjct: 89 ESLAAADREALVEAIRSAGLYNQKSKLIIGVAEAVLADFGSEADFDTYIREADPETVRER 148
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL+++G+G K+ +CV L + + FPVD +V RI+ R+G P + L ++ P
Sbjct: 149 LLEMKGVGPKTADCVLLFAGGRDGVFPVDTHVHRISRRIGLAPADADHEGVRERLERDVP 208
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
C H MI FG+ +C R P C ACP+ C
Sbjct: 209 ----------GEACGFG-------HTAMIQFGREYCTARKPACLDGPEACPMADVCDMVG 251
Query: 765 SAVASARFALPGPSE 779
V++ P ++
Sbjct: 252 VDVSTGAVTDPASAD 266
>gi|374622424|ref|ZP_09694949.1| HhH-GPD family protein [Ectothiorhodospira sp. PHS-1]
gi|373941550|gb|EHQ52095.1| HhH-GPD family protein [Ectothiorhodospira sp. PHS-1]
Length = 221
Score = 60.1 bits (144), Expect = 7e-06, Method: Composition-based stats.
Identities = 49/188 (26%), Positives = 83/188 (44%), Gaps = 33/188 (17%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D E++ +++A I+ G N+ AGR++ F + +E G L R + ++
Sbjct: 59 LDAESLAALPRNDLAGLIRPSGYFNVKAGRLQAFCHWYLEQGGETAL---RGWSTEALRR 115
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL----GWVPLQPLPGDLHIHLL 704
LL ++G+G ++ + + L + + F VD R+ RL G P L + L
Sbjct: 116 GLLSVKGVGPETADDILLYAFERPVFVVDAYTRRLFARLELAGGDEPYDVLRTGVEAVLG 175
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ PV++ E H ++ GK FC+ R P+C CPLR C+
Sbjct: 176 ADVPVLN-------------------EFHALIVAHGKTFCRPR-PSCEGCPLRAMCQ--- 212
Query: 765 SAVASARF 772
AS RF
Sbjct: 213 ---ASLRF 217
>gi|421874622|ref|ZP_16306225.1| endonuclease III [Brevibacillus laterosporus GI-9]
gi|372456478|emb|CCF15774.1| endonuclease III [Brevibacillus laterosporus GI-9]
Length = 190
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 70/160 (43%), Gaps = 26/160 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ D I++ G A I+ L+E +G VP E L+ + G+G K+
Sbjct: 40 ELQDDIRQIGLYKNKAKHIQSLCRLLLEEYGG-------EVPNKH--EELVKLPGVGRKT 90
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V RI+ RLG+ P P + L+K+ P D W
Sbjct: 91 ANVVVSVAFGVPAIAVDTHVERISKRLGFCPDDATPVQVEEILMKKIPKKD------WS- 143
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ H++MI FG+ CK +SP C CPL C
Sbjct: 144 ----------DTHHRMIFFGRYHCKAQSPQCDICPLEDVC 173
>gi|254424749|ref|ZP_05038467.1| hypothetical protein S7335_4909 [Synechococcus sp. PCC 7335]
gi|196192238|gb|EDX87202.1| hypothetical protein S7335_4909 [Synechococcus sp. PCC 7335]
Length = 254
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 77/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW+ VR A + AI A RI++ L ++ + G++ +++L ++P + + +
Sbjct: 83 DWKLVRDAKTEAVQAAIAPCTWPEQKAPRIQQILQQITKERGTLSIDFLADIPVAEARAW 142
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K+ V S L+ A PVD + R+AVR +P + G H L + P
Sbjct: 143 LETLTGVGPKTSAAVLAFSTLRRRALPVDSHHHRVAVRTELIPKKVTVGPSHAILEAQLP 202
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
Q +Y+ H ++ G+ C R+P C C + C
Sbjct: 203 EDWSAQQ-------------VYDNHEVLMLHGQHCCHYRNPTCDRCVILDLC 241
>gi|160902966|ref|YP_001568547.1| HhH-GPD family protein [Petrotoga mobilis SJ95]
gi|160360610|gb|ABX32224.1| HhH-GPD family protein [Petrotoga mobilis SJ95]
Length = 216
Score = 60.1 bits (144), Expect = 7e-06, Method: Composition-based stats.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D ++ E + E ++A I+ G N+ + R+K FL L + + D++ ++N +
Sbjct: 55 DLLEPEKLSKIKEDDLAQLIRSSGFYNLKSKRLKNFLEWLKKY--NYDIDKIKNKSVTSL 112
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E LL I+G+G ++ + + L + + F +D R+ RLG + L +E
Sbjct: 113 REELLSIKGIGKETADSILLYAFEMPVFVIDAYTKRMFSRLGLI------------LSRE 160
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
Y D+ Q + L D + E H ++ K++CKK +P C C L+ +C
Sbjct: 161 Y---DEFQDFFEKNLTK-DVQLYNEYHALIVKHSKVYCKK-TPKCSDCFLKEKC 209
>gi|408381370|ref|ZP_11178919.1| endonuclease III [Methanobacterium formicicum DSM 3637]
gi|407815837|gb|EKF86400.1| endonuclease III [Methanobacterium formicicum DSM 3637]
Length = 213
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 79/167 (47%), Gaps = 27/167 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD + + I+ G ++ A RI E +L L+ + PD +K LL++ G+
Sbjct: 66 ADPTLLEPLIRPAGFYHVKAQRIVEVSRKL--------LDKFKGQVPDDMKN-LLELPGV 116
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV + Q A PVDV+V RI+ RLG V + P + L K
Sbjct: 117 GRKTANCVLVYGFQIPAIPVDVHVHRISNRLGLVNTKT-PEETEAELEK----------- 164
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ PR ++ EL+ M+ FG+ C+ +SP CPL+ C ++
Sbjct: 165 IVPREYWI------ELNDLMVQFGQTICRPQSPRHEECPLQELCDYY 205
>gi|85375166|ref|YP_459228.1| endonuclease III [Erythrobacter litoralis HTCC2594]
gi|84788249|gb|ABC64431.1| endonuclease III [Erythrobacter litoralis HTCC2594]
Length = 224
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 71/176 (40%), Gaps = 33/176 (18%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D+ IA AIK G N+ I++F L+ H R V PD +E LL + G+G
Sbjct: 76 DDEAIAQAIKPCGLYNMKTRNIRKFNQALLAEH--------RGVVPD-TREGLLSLPGIG 126
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K + V + VD +V R+ R+G L + DK L
Sbjct: 127 RKCADIVMSFTFGKDVIAVDTHVHRVCNRIG---------------LTDAKTADKTAEQL 171
Query: 718 ---WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
PR + D H+ +I FGK C R+P C CP+ C +A A A
Sbjct: 172 EERAPRWAHADG------HFWLIQFGKRVCTSRAPKCERCPVSDLCLWYAEQQAQA 221
>gi|407925000|gb|EKG18022.1| hypothetical protein MPH_04712 [Macrophomina phaseolina MS6]
Length = 517
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 64/140 (45%), Gaps = 36/140 (25%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS--------------- 632
SV+W+AVR AD E+ +AIK G ++ + IK+ L + E + +
Sbjct: 258 SVNWDAVRVADTKEVFEAIKSGGLADVKSKDIKKILQMVYEENQARHDALTKDSATTPAG 317
Query: 633 --------------------IDLEWLRNVPP-DKVKEYLLDIEGLGLKSVECVRLLSLQH 671
+ L++L ++ D++ +LL G+G+K+ CV L Q
Sbjct: 318 AENESVGERKLEVARAEANVLSLDYLHSLSSNDEIMAHLLRYPGIGVKTASCVLLFCFQR 377
Query: 672 IAFPVDVNVGRIAVRLGWVP 691
+F VD +V R+A L WVP
Sbjct: 378 PSFAVDTHVFRLAKWLNWVP 397
>gi|339008005|ref|ZP_08640579.1| putative endonuclease III [Brevibacillus laterosporus LMG 15441]
gi|338775208|gb|EGP34737.1| putative endonuclease III [Brevibacillus laterosporus LMG 15441]
Length = 221
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 70/160 (43%), Gaps = 26/160 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ D I++ G A I+ L+E +G VP E L+ + G+G K+
Sbjct: 71 ELQDDIRQIGLYKNKAKHIQSLCRLLLEEYGG-------EVPNKH--EELVKLPGVGRKT 121
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V RI+ RLG+ P P + L+K+ P D W
Sbjct: 122 ANVVVSVAFGVPAIAVDTHVERISKRLGFCPDDATPVQVEEILMKKIPKKD------WS- 174
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ H++MI FG+ CK +SP C CPL C
Sbjct: 175 ----------DTHHRMIFFGRYHCKAQSPQCDICPLEDVC 204
>gi|337286496|ref|YP_004625969.1| HhH-GPD family protein [Thermodesulfatator indicus DSM 15286]
gi|335359324|gb|AEH45005.1| HhH-GPD family protein [Thermodesulfatator indicus DSM 15286]
Length = 217
Score = 59.7 bits (143), Expect = 9e-06, Method: Composition-based stats.
Identities = 46/158 (29%), Positives = 73/158 (46%), Gaps = 18/158 (11%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+A I+ G + A R+K F++ LVE HG +LE L ++ +K +E LL ++G+G ++
Sbjct: 74 LAKLIRPAGYFRVKAKRLKNFVHLLVEEYHG--ELEALFSLGLEKAREKLLLVKGIGPET 131
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L + F +D RI +R G DL +K P
Sbjct: 132 ADSMLLYAGNLPIFVIDAYTRRILLRHGLAHEDMSYEDLQSLFMKNLPK----------- 180
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
D K E H ++ GK FC+ + P C CPL+G
Sbjct: 181 ----DVKLFNEYHALLVACGKNFCRPKKPLCEECPLKG 214
>gi|379005350|ref|YP_005261022.1| putative EndoIII-related endonuclease [Pyrobaculum oguniense TE7]
gi|375160803|gb|AFA40415.1| putative EndoIII-related endonuclease [Pyrobaculum oguniense TE7]
Length = 218
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 83/179 (46%), Gaps = 23/179 (12%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
++ S+ E++R E+ + IK G A +K + + L I E L + D
Sbjct: 59 RLGSITPESLRGISLGELEELIKPAGMYRQRARYLKALADAFITLE--ITPEKLVKMGAD 116
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
++ L+ + G+G K+ + V L++L AFPVD ++ RIA R G G + +
Sbjct: 117 AARKLLMGLPGVGRKTADVV-LVNLGLPAFPVDTHITRIAKRWG-------VGSRYEDIS 168
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ + M+++Q ++ E H ++I FG+ CK R P C CP+ C F
Sbjct: 169 RWF--MEQLQ-----------KERCLEFHLKLIQFGRDICKARKPRCEECPIGERCPSF 214
>gi|448317595|ref|ZP_21507145.1| HhH-GPD family protein [Natronococcus jeotgali DSM 18795]
gi|445602986|gb|ELY56956.1| HhH-GPD family protein [Natronococcus jeotgali DSM 18795]
Length = 271
Score = 59.7 bits (143), Expect = 1e-05, Method: Composition-based stats.
Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 28/194 (14%)
Query: 592 EAVRCADESEIADAIKERGQQN----IIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVK 647
E++ A+ SE+A+ I G N +I G +E L G D ++R P+ V+
Sbjct: 92 ESLADAERSELAETISSAGLYNQKSEMIIGAAEEILEEFGSAAGFDD--FVREEEPEDVR 149
Query: 648 EYLLDIEGLGLKSVECVRLL-SLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ LL+I G+G K+ +CV L S + FPVD +V RI RLG P + L +E
Sbjct: 150 QRLLEIHGVGPKTADCVLLFASGRGGVFPVDTHVHRIYRRLGIAPPDADHEVVRAVLERE 209
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
P C H I FG+ +C R P C ACP+ C+
Sbjct: 210 VPAAK----------CGFG-------HTASIQFGREYCTARKPACLEDPDACPMADVCEQ 252
Query: 763 FASAVASARFALPG 776
A+ P
Sbjct: 253 VGVYPATGEVVDPA 266
>gi|425773936|gb|EKV12261.1| Base excision DNA repair protein [Penicillium digitatum PHI26]
gi|425782386|gb|EKV20298.1| Base excision DNA repair protein [Penicillium digitatum Pd1]
Length = 307
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 72/178 (40%), Gaps = 19/178 (10%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
D+ A+ E+++ + + G A I L ++ H + L L + ++V E
Sbjct: 142 TDYNAISKGGETKLQNVLSCGGLHIRKAKFIMSILRQVKARHDAYTLNHLWPLDDEQVME 201
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L G+G K+ CV L+L F VD ++ RI LGW PL P HL + P
Sbjct: 202 EFLSYNGVGPKTASCVLALTLNRQRFVVDTHIYRITGFLGWRPLHATPEQARAHLETKIP 261
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGACPLRGECKHFA 764
D+ + Y LH IT G+ CK G+C LR + A
Sbjct: 262 --DEFK---------------YSLHLLFITHGRECPECKAGGKLKGSCDLRKALRELA 302
>gi|448529547|ref|ZP_21620567.1| HhH-GPD family protein [Halorubrum hochstenium ATCC 700873]
gi|445708716|gb|ELZ60554.1| HhH-GPD family protein [Halorubrum hochstenium ATCC 700873]
Length = 232
Score = 59.3 bits (142), Expect = 1e-05, Method: Composition-based stats.
Identities = 44/163 (26%), Positives = 71/163 (43%), Gaps = 18/163 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+ + I+ G + A RI+ L + E G+ L +L + D K
Sbjct: 73 DFAAIEAADHEELRETIRVAGLPDQKAARIQRALTAIREETGGAYSLAFLDAMATDDAKA 132
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L +I+G+G K+ V VD +V R++ R G VP G H L ++ P
Sbjct: 133 WLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNGRAHEVLDEQVP 192
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ +Y LH +I G+ C R+P+C
Sbjct: 193 -----------------DELIYPLHVLLIRHGREHCSARNPDC 218
>gi|15897075|ref|NP_341680.1| DNA endonuclease III, (ntH-1) [Sulfolobus solfataricus P2]
gi|284173420|ref|ZP_06387389.1| DNA endonuclease III, (ntH-1) [Sulfolobus solfataricus 98/2]
gi|384433576|ref|YP_005642934.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus solfataricus
98/2]
gi|1707778|emb|CAA69576.1| endonuclease III [Sulfolobus solfataricus P2]
gi|13813246|gb|AAK40470.1| DNA endonuclease III, probable (ntH-1) [Sulfolobus solfataricus P2]
gi|261601730|gb|ACX91333.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus solfataricus
98/2]
Length = 236
Score = 59.3 bits (142), Expect = 1e-05, Method: Composition-based stats.
Identities = 45/176 (25%), Positives = 81/176 (46%), Gaps = 23/176 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + A+ ++I A+K G A R+KE ++E + + ++ L N ++ LL
Sbjct: 69 EKLSNANLADIESALKISGLYRTKAKRLKEISRIILERYNGL-IDSLLNT--SNARDELL 125
Query: 652 DIEGLGLKSVECVRLLSLQHIA---FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+EG+G K+ + V L + FPVD ++ R++ RLG VP + L + +
Sbjct: 126 KLEGIGEKTADVVLLTCYGYYGYKVFPVDTHITRVSKRLGIVPTNAKYSLISSTLKELFS 185
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
D L LH+ +I G+ CK R P C +C ++ C++++
Sbjct: 186 AYD-----------------LLHLHHMLIAHGRQTCKARKPLCNSCIIKECCEYYS 224
>gi|407477461|ref|YP_006791338.1| endonuclease III [Exiguobacterium antarcticum B7]
gi|407061540|gb|AFS70730.1| Endonuclease III [Exiguobacterium antarcticum B7]
Length = 222
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + A +I + IK G A IK RLV LH VP D+ L
Sbjct: 63 ERLAAAPVEQIEEKIKRLGLYRNKAKNIKALAERLVTLHDG-------EVPTDRAG--LE 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V ++ AF VD +V R++ RLG + + + L++ +
Sbjct: 114 ALPGVGRKTANVVLSVAFDVPAFAVDTHVERVSKRLGICRWKDNVTQVEVTLMRRF---- 169
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ +LH+Q I FG+ CK + PNC CPL C+
Sbjct: 170 -------------KRERWSKLHHQFIFFGRYHCKAQRPNCLECPLLDMCR 206
>gi|374385131|ref|ZP_09642640.1| endonuclease III [Odoribacter laneus YIT 12061]
gi|373226557|gb|EHP48881.1| endonuclease III [Odoribacter laneus YIT 12061]
Length = 211
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 73/175 (41%), Gaps = 32/175 (18%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSI--DLEWLRNVPPDKVKE 648
EA+ A+ EI + IK N A + +LV + G + D+E L+ +P
Sbjct: 63 EAMAKAEVGEIYELIKSISYPNNKAKHLSAMAKKLVSDFDGKVPDDMELLQTLP------ 116
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
G+G K+ + ++ A PVD +V R+A R+G V P + L+K P
Sbjct: 117 ------GVGRKTANVIEAVAFHRPAMPVDTHVFRVADRIGLVTNATTPLETEKQLVKNIP 170
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ L H+ +I G+ C R P C +C +R C++F
Sbjct: 171 -----------------SEILSTAHHWLILHGRYTCTARRPKCESCGIRDYCRYF 208
>gi|453088940|gb|EMF16980.1| DNA glycosylase [Mycosphaerella populorum SO2202]
Length = 489
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 78/206 (37%), Gaps = 58/206 (28%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS--------------- 632
SVDW AVR A + ++ AI+ G N+ + IK L + E +
Sbjct: 228 SVDWNAVRLAPQQDVFKAIERGGLANVKSKDIKNILQMVYEENQERRAALLTSNDNASES 287
Query: 633 --------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
I L+ L + + + L+ G+G K+ CV L LQ
Sbjct: 288 QSAPESSTQTEEITKADQDVISLDHLHLMSTNDAIDKLITYPGIGPKTASCVALFCLQRP 347
Query: 673 AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT---- 728
+F VD +V R+ LGWVP G P +D+ Y + D K
Sbjct: 348 SFAVDTHVFRLVQYLGWVPKSTKKGQ---------PKVDRNTTY-----SHCDAKIPDEY 393
Query: 729 LYELHYQMITFGKIFCKKRSPNCGAC 754
Y+LHY +I GK C P C A
Sbjct: 394 KYKLHYLLIKHGKT-C----PRCRAA 414
>gi|257388060|ref|YP_003177833.1| HhH-GPD family protein [Halomicrobium mukohataei DSM 12286]
gi|257170367|gb|ACV48126.1| HhH-GPD family protein [Halomicrobium mukohataei DSM 12286]
Length = 236
Score = 59.3 bits (142), Expect = 1e-05, Method: Composition-based stats.
Identities = 47/163 (28%), Positives = 71/163 (43%), Gaps = 18/163 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+A+ I+ G + A RI+ L + E G+ L +L +P + K
Sbjct: 74 DFAAIEAADHDELAETIRVAGLPDQKAARIQRALTAIREETGGAYSLAFLDALPTAEAKA 133
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V VD +V R++ R G VP H L P
Sbjct: 134 WLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPEDASNERAHDALDAVVP 193
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
D+ T Y LH +I G+ C R+P+C
Sbjct: 194 ----------------DELT-YPLHVLLIRHGRTHCSARNPDC 219
>gi|340345847|ref|ZP_08668979.1| DNA-(Apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum koreensis MY1]
gi|339520988|gb|EGP94711.1| DNA-(Apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum koreensis MY1]
Length = 217
Score = 59.3 bits (142), Expect = 1e-05, Method: Composition-based stats.
Identities = 37/139 (26%), Positives = 70/139 (50%), Gaps = 20/139 (14%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ ID ++ + PD + E L+ + G+G K+ CV + + A PVD++V RI+
Sbjct: 94 RIIEVSKIIDSKY-SGIVPDTLDE-LIQLPGVGRKTANCVLVYAFDKPAIPVDIHVHRIS 151
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + + L+K+ P +K +++ + +G+ C
Sbjct: 152 NRLGLVNTKS-PEETELVLMKKIP-----------------KKYWIKINDTFVMYGQNIC 193
Query: 745 KKRSPNCGACPLRGECKHF 763
K +P C C ++ CK++
Sbjct: 194 KPINPMCNVCKIKKNCKYY 212
>gi|336264806|ref|XP_003347179.1| hypothetical protein SMAC_05479 [Sordaria macrospora k-hell]
gi|380093873|emb|CCC08838.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 560
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 65/150 (43%), Gaps = 21/150 (14%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL--------------------- 626
S++WE V + ++A AIK G N A IK L+ +
Sbjct: 371 SINWEKVATEPQEKLAQAIKVSGNGNQKAKNIKLILDMVALEMAQMAREKSVNNGGGEQE 430
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
V +++L+ + + D+ L+ G+G+KS CV L L+ F VD +V R
Sbjct: 431 VAFPETLNLDHMHTLTKDEAMTKLVRYPGIGIKSAACVTLFCLRKPCFGVDTHVHRFCRW 490
Query: 687 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
LGWVP + P D H + P K +++
Sbjct: 491 LGWVPEKANPEDCFRHCDVKVPDHLKYRLH 520
>gi|256830633|ref|YP_003159361.1| HhH-GPD family protein [Desulfomicrobium baculatum DSM 4028]
gi|256579809|gb|ACU90945.1| HhH-GPD family protein [Desulfomicrobium baculatum DSM 4028]
Length = 232
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 50/185 (27%), Positives = 80/185 (43%), Gaps = 21/185 (11%)
Query: 595 RCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEW--LRNVPPDKVKEYLLD 652
R DE E+A I+ G + A R++ FL + LH + DL+ LR + ++ LL+
Sbjct: 67 RLTDE-ELATFIRPAGAFRVKAARVRNFL---LFLHRTCDLDMDGLRGETVETLRPALLE 122
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G ++ + + L +L +F VD RI R G VP G+L + P
Sbjct: 123 VSGIGPETADSILLYALGLPSFVVDAYTRRILNRHGLVPEDIAYGELREFFMDVLPP--- 179
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARF 772
D E H ++ GK +C KR C CPL + F+ + +
Sbjct: 180 ------------DPALYNEYHALIVRTGKNWCAKRQGKCPDCPLAVFLEEFSPLTCTGKS 227
Query: 773 ALPGP 777
+P P
Sbjct: 228 IIPHP 232
>gi|398399341|ref|XP_003853090.1| hypothetical protein MYCGRDRAFT_40778 [Zymoseptoria tritici IPO323]
gi|339472972|gb|EGP88066.1| hypothetical protein MYCGRDRAFT_40778 [Zymoseptoria tritici IPO323]
Length = 484
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 86/226 (38%), Gaps = 50/226 (22%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKER 609
R T KNS Q L++R G + SVDW+AVR A ++ AI+
Sbjct: 204 RLSANTTNKNSSTAFQ---GLVKRF---GTLKEGVGKGSVDWDAVRTASNHDVFKAIERG 257
Query: 610 GQQNIIAGRIKEFLN--------RLVELHGS--------------------------IDL 635
G I + I+ L R L GS I L
Sbjct: 258 GLAKIKSKDIQSILQIAYEENQERKAALTGSAEDPAGAEHAQESEKQDEIEKAEQNVISL 317
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
+ L + D +++ G+GLK+ CV L + H +F VD +V R+ LGWVP
Sbjct: 318 DHLHLLSRDDAIRKMVEFPGIGLKTASCVALFCMGHASFAVDTHVFRLCQYLGWVPKSTK 377
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGK 741
G+ P +D+ + + D+ Y LH +I GK
Sbjct: 378 KGE---------PKVDRDTTFSHCDVRIPDE-LKYALHQLLIKHGK 413
>gi|11499282|ref|NP_070520.1| endonuclease III [Archaeoglobus fulgidus DSM 4304]
gi|2648861|gb|AAB89556.1| endonuclease III (nth) [Archaeoglobus fulgidus DSM 4304]
Length = 209
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 52/161 (32%), Positives = 74/161 (45%), Gaps = 30/161 (18%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E EIA+ IK G + A R+KE +LVE + S VP E L+ + G+G
Sbjct: 71 EEEIAELIKGVGFYRVKAKRLKELAKKLVEDYSS-------EVPLSF--EELVKLPGIGR 121
Query: 659 KSVECVRLLSLQHI-AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
KS V L+ I A PVD +V RIA RLGW P + L + +P +
Sbjct: 122 KSANVV--LAYSDIPAIPVDTHVHRIANRLGWARTTK-PEETEEVLKRLFP------LEF 172
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
W ++ + M+ FG+ CK + P C CP++G
Sbjct: 173 WEKV-----------NRAMVGFGQTVCKPQKPLCDECPIKG 202
>gi|448480701|ref|ZP_21604703.1| HhH-GPD family protein [Halorubrum arcis JCM 13916]
gi|445822018|gb|EMA71793.1| HhH-GPD family protein [Halorubrum arcis JCM 13916]
Length = 233
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 50/177 (28%), Positives = 77/177 (43%), Gaps = 20/177 (11%)
Query: 576 STGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSID 634
S E D+ D D+ AV AD E+ + I+ G + A RI+ L + E G+
Sbjct: 61 SRAAEALFDRYD--DFAAVESADHEELKETIRVAGLADQKAARIQRALTAIREETGGAYS 118
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
L +L +P D+ K++L +I+G+G K+ V VD +V R++ R G VP
Sbjct: 119 LAFLDAMPTDEAKDWLTEIKGVGPKTASVVLNFHFGKPTIAVDTHVERVSKRFGLVPESA 178
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
H L ++ P D+ T Y LH +I G+ C R +C
Sbjct: 179 SNERAHEALDEQVP----------------DELT-YPLHVLLIRHGREHCSARGADC 218
>gi|451996449|gb|EMD88916.1| hypothetical protein COCHEDRAFT_1226992 [Cochliobolus
heterostrophus C5]
Length = 525
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 75/191 (39%), Gaps = 41/191 (21%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN--------RLVELHGS------- 632
SVDW VR AD+ EI +AIK G ++ + IK+ L R EL S
Sbjct: 243 SVDWNKVRQADQKEIFEAIKSGGLADVKSKDIKKILQMVWEENQARRKELQSSSHKAPGS 302
Query: 633 -------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
+ L+ L + + L G+G K+ CV L LQ +
Sbjct: 303 ANEAEEEKNTEIEKANQDVVSLDHLHLLSSEDAFNALTKYPGIGPKTASCVLLFCLQRPS 362
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGD---LHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY 730
F VD +V R+ LGWV P PGD L + + Y + D Y
Sbjct: 363 FAVDTHVFRLCKWLGWV---PPPGDSRGLAPGAKGTFAGPTRNSTYAHCEVRIPDH-LKY 418
Query: 731 ELHYQMITFGK 741
+LHY +I GK
Sbjct: 419 QLHYLLIKHGK 429
>gi|162447100|ref|YP_001620232.1| endonuclease III [Acholeplasma laidlawii PG-8A]
gi|161985207|gb|ABX80856.1| endonuclease III [Acholeplasma laidlawii PG-8A]
Length = 214
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 26/172 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD ++ D IK G + I RLVE + + VP ++ + L + G+
Sbjct: 67 ADVDDVMDTIKTIGLYKTKSKNIIGLAKRLVEDYDGL-------VPSER--KDLESLPGV 117
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V + A VD ++ RI+ RLG ++ + L K++P
Sbjct: 118 GRKTANVVLSNAFGIPALAVDTHILRISKRLGLADETDDVLEVEMKLNKQFP------KE 171
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
LW ++LH+Q+I FG+ C R PNC C ++ C HF + ++
Sbjct: 172 LW-----------HKLHHQLIFFGRYHCIARKPNCDTCKMQDMCPHFKNVIS 212
>gi|448510854|ref|ZP_21616067.1| HhH-GPD family protein [Halorubrum distributum JCM 9100]
gi|448523661|ref|ZP_21618848.1| HhH-GPD family protein [Halorubrum distributum JCM 10118]
gi|445695608|gb|ELZ47710.1| HhH-GPD family protein [Halorubrum distributum JCM 9100]
gi|445700734|gb|ELZ52725.1| HhH-GPD family protein [Halorubrum distributum JCM 10118]
Length = 233
Score = 58.9 bits (141), Expect = 2e-05, Method: Composition-based stats.
Identities = 50/177 (28%), Positives = 77/177 (43%), Gaps = 20/177 (11%)
Query: 576 STGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSID 634
S E D+ D D+ AV AD E+ + I+ G + A RI+ L + E G+
Sbjct: 61 SRAAEALFDRYD--DFAAVESADHEELKETIRVAGLTDQKAARIQRALTAIREETGGAYS 118
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
L +L +P D+ K++L +I+G+G K+ V VD +V R++ R G VP
Sbjct: 119 LAFLDAMPTDEAKDWLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESA 178
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
H L ++ P D+ T Y LH +I G+ C R +C
Sbjct: 179 SNERAHEALDEQVP----------------DELT-YPLHVLLIRHGREHCSARGADC 218
>gi|389861151|ref|YP_006363391.1| HhH-GPD family protein [Thermogladius cellulolyticus 1633]
gi|388526055|gb|AFK51253.1| HhH-GPD family protein [Thermogladius cellulolyticus 1633]
Length = 227
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 83/181 (45%), Gaps = 19/181 (10%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
++ + E V D E+A IK G A RI E + L + + E +R +
Sbjct: 60 RLGKLTPEKVLSLDVGELAQLIKPAGLHLQRARRIVELASYLRDRLEEFESE-VRRMDVL 118
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E L+++ G+G K+ + V L+ AFPVD ++ RI RLG+V G +
Sbjct: 119 EAREVLMNLPGVGDKTADVVLLVYFGKPAFPVDTHIKRITTRLGFV-----KGGSY---- 169
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
K+ + W C K L E H +I G+ CK R P C CP++ C+++
Sbjct: 170 ------KKVSGF-WQS-CLPPDKYL-ETHLLLIQHGRAICKARKPLCHECPIKEFCEYYK 220
Query: 765 S 765
S
Sbjct: 221 S 221
>gi|7406460|emb|CAB85562.1| putative protein [Arabidopsis thaliana]
Length = 1017
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 72/130 (55%), Gaps = 19/130 (14%)
Query: 76 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 880 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 922
Query: 136 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 923 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 980
Query: 196 TARMHLILGD 205
ARMHL+ G+
Sbjct: 981 IARMHLVQGE 990
>gi|409728179|ref|ZP_11271049.1| HhH-GPD family protein [Halococcus hamelinensis 100A6]
gi|448722675|ref|ZP_21705208.1| HhH-GPD family protein [Halococcus hamelinensis 100A6]
gi|445789100|gb|EMA39793.1| HhH-GPD family protein [Halococcus hamelinensis 100A6]
Length = 270
Score = 58.9 bits (141), Expect = 2e-05, Method: Composition-based stats.
Identities = 62/232 (26%), Positives = 92/232 (39%), Gaps = 35/232 (15%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
+ SD Q D L Y GE+ + A+ D+ ++A+ I+ G N +
Sbjct: 65 QNTSDKASQPAHDALMARYGEGEDLA---------VALSETDQDDLAETIQPAGLYNQKS 115
Query: 617 GRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA- 673
+ R+VE +G E++ P V++ LLD G+G K+ +CV L S
Sbjct: 116 ETLGRIAGRVVEEYGGAAAFDEFVTAEDPATVRDTLLDFSGVGPKTADCVLLFSGGRAGV 175
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
FPVD +V RI R+G P + L + P +K + H
Sbjct: 176 FPVDTHVHRIYRRIGVAPPDADHEGVRAVLEDQVPA----------------EKCGFG-H 218
Query: 734 YQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASARFALP--GPSE 779
I FG+ FCK R P C ACP+ C A+ P P+E
Sbjct: 219 TATIQFGREFCKARKPACLDDPDACPMADLCDQVGVYPATDEVVDPADAPAE 270
>gi|239618511|ref|YP_002941833.1| endonuclease III [Kosmotoga olearia TBF 19.5.1]
gi|239507342|gb|ACR80829.1| endonuclease III [Kosmotoga olearia TBF 19.5.1]
Length = 210
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 89/213 (41%), Gaps = 40/213 (18%)
Query: 564 VQQDWDLLRRIYST---------GEERSHDKMDSV--DWEAVRCADESEIADAIKERGQQ 612
+ QD D R + ST E + K+ SV D A+ A ++ + IK G
Sbjct: 20 IHQDSDPYRVLVSTVLSQRTRDENTEVASKKLFSVYPDVFAIAKAKPEDLYNLIKAAGMY 79
Query: 613 NIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH 671
A RI E +VE + G + PD ++E L + G+G K+ V +S
Sbjct: 80 RQKAERIVEISKIIVETYNGKV---------PDTLEE-LTKLPGVGRKTANIVLNVSFGK 129
Query: 672 IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYE 731
A VD +V RI+ RLGW+ + P L K P LW
Sbjct: 130 AALAVDTHVHRISNRLGWIKTKQ-PEQSEFELQKILP------EELWG-----------P 171
Query: 732 LHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
L+ M+ FG+ CK +P C CP+ C++F+
Sbjct: 172 LNGSMVEFGRRVCKPVNPQCNECPINSCCRYFS 204
>gi|167042822|gb|ABZ07540.1| putative HhH-GPD superfamily base excision DNA repair protein
[uncultured marine microorganism HF4000_ANIW137I15]
Length = 257
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 13/175 (7%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E +R + +A I+ N+ A R+K F+ L H DL L +P D +++YLL
Sbjct: 72 EGLRRIPKFRLAHLIRPARFFNVKADRLKAFVRFLWVGHQG-DLNLLFGLPADLLRKYLL 130
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+++G+G ++ + + L + + F VD RI RLG P G L+E
Sbjct: 131 ELKGIGPETADSILLYAAEFPVFVVDAYTHRIFSRLGLYQ-GPPSGRKGYEALQEG---- 185
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
+ R D E H ++ GK +C+ R P C CPL C + ++A
Sbjct: 186 ------FHRFLSRDAPLYNEYHALLVELGKEYCRPR-PKCEFCPLSSICAYPSAA 233
>gi|163852137|ref|YP_001640180.1| HhH-GPD family protein [Methylobacterium extorquens PA1]
gi|163663742|gb|ABY31109.1| HhH-GPD family protein [Methylobacterium extorquens PA1]
Length = 254
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A EI I + A RIK L + E G++ L++L ++ D+ + +
Sbjct: 86 DWTEIEAASVEEIEATIHGVTWPELKAPRIKAVLAAVQERVGALSLDFLGDMSVDEARGW 145
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A R G + + G H L + P
Sbjct: 146 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRTGLIGAKVDVGPSHAVLRAQLP 205
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D W + LY+ H ++ G+ C RSP CG C L C
Sbjct: 206 --DD-----W------SAQKLYDNHEVLMLHGQRVCFHRSPTCGRCVLLDIC 244
>gi|448613277|ref|ZP_21663157.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mucosum ATCC BAA-1512]
gi|445740174|gb|ELZ91680.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Haloferax mucosum ATCC BAA-1512]
Length = 268
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 53/178 (29%), Positives = 81/178 (45%), Gaps = 24/178 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS-IDLE-WLRNVPPDKVKEY 649
E++ AD I DAI+ G N + I+ ++ GS D + ++R P V++
Sbjct: 90 ESLAAADREGIIDAIRSGGLYNRKSKLIQGVAEDVLTNFGSEADFDRYVREEDPSTVRDR 149
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL++ G+G K+ +CV L + + FPVD +V RIA RLG + + L +E P
Sbjct: 150 LLEMNGVGPKTADCVLLFAGGRGGVFPVDTHVHRIARRLGIASVDADHEAVRATLEREVP 209
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
+K + H MI FG+ FC RSP C CP+ C+
Sbjct: 210 A----------------EKCGFG-HTAMIQFGREFCTARSPVCLDDDDECPMADVCEQ 250
>gi|227494411|ref|ZP_03924727.1| endonuclease III [Actinomyces coleocanis DSM 15436]
gi|226832145|gb|EEH64528.1| endonuclease III [Actinomyces coleocanis DSM 15436]
Length = 226
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 27/172 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +++A + G QN A +++E LV E+ VP ++ E L + G+
Sbjct: 82 ADHADLASITRVLGFQNKRATQLQELSQALVA-------EYAGEVPANR--EALQKLPGV 132
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V + A VD +VGR+ RLGW + P + + K P D
Sbjct: 133 GRKTAHVVLGNAFGIPAITVDTHVGRVTTRLGWSQAKT-PLAIEKDIAKLLPGYD----- 186
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
W LC+ ++I G+ C R P CG CPL+ C A +A
Sbjct: 187 -WTLLCH-----------RLIEHGRAICDARKPLCGQCPLQQLCPASADYLA 226
>gi|448449860|ref|ZP_21591955.1| HhH-GPD family protein [Halorubrum litoreum JCM 13561]
gi|445812626|gb|EMA62617.1| HhH-GPD family protein [Halorubrum litoreum JCM 13561]
Length = 233
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 50/177 (28%), Positives = 77/177 (43%), Gaps = 20/177 (11%)
Query: 576 STGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSID 634
S E D+ D D+ AV AD E+ + I+ G + A RI+ L + E G+
Sbjct: 61 SRAAEALFDRYD--DFAAVEGADHEELKETIRVAGLADQKAARIQRALTAIREETGGAYS 118
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
L +L +P D+ K++L +I+G+G K+ V VD +V R++ R G VP
Sbjct: 119 LAFLDAMPTDEAKDWLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESA 178
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
H L ++ P D+ T Y LH +I G+ C R +C
Sbjct: 179 SNERAHEALDEQVP----------------DELT-YPLHVLLIRHGREHCSARGADC 218
>gi|389844120|ref|YP_006346200.1| endonuclease III [Mesotoga prima MesG1.Ag.4.2]
gi|387858866|gb|AFK06957.1| endonuclease III, DNA-(apurinic or apyrimidinic site) lyase
[Mesotoga prima MesG1.Ag.4.2]
Length = 220
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 71/168 (42%), Gaps = 27/168 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ D IK G A RI +VE V PD ++E L+ I G+
Sbjct: 66 AKPEDLYDLIKASGMYRQKAARIINCARMIVESFA--------GVVPDTLEE-LVTIPGV 116
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V +S + A VD +V RIA RLGWV + P D L+K P
Sbjct: 117 GRKTANIVLNVSFKKEALAVDTHVHRIANRLGWVKTK-TPDDTEFALMKILP------PS 169
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+W ++ M+ FG+ C+ P C C + C++F+
Sbjct: 170 IWG-----------PVNGSMVEFGREICRPIGPKCNLCGISQCCEYFS 206
>gi|303324187|ref|XP_003072081.1| HhH-GPD family base excision DNA repair protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240111791|gb|EER29936.1| HhH-GPD family base excision DNA repair protein [Coccidioides
posadasii C735 delta SOWgp]
gi|320037075|gb|EFW19013.1| helix-hairpin-helix domain-containing protein [Coccidioides
posadasii str. Silveira]
Length = 444
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 57/127 (44%), Gaps = 23/127 (18%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SV+W VR A EI DAIK G + + IKE L + E
Sbjct: 238 SVNWNKVREATVQEIRDAIKSGGLADTKSKHIKEILTMVYEQNIARRKDLVDAKQGGDET 297
Query: 629 LHGS----IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+H S + L+ + + D+ + + G+G+K+ CV L L+ F VD +V R+
Sbjct: 298 VHLSDEHMLSLDHMHGLSKDEAMQEFIKYPGIGVKTAACVVLFCLRRPCFAVDTHVFRLC 357
Query: 685 VRLGWVP 691
LGWVP
Sbjct: 358 KWLGWVP 364
>gi|333987518|ref|YP_004520125.1| HhH-GPD family protein [Methanobacterium sp. SWAN-1]
gi|333825662|gb|AEG18324.1| HhH-GPD family protein [Methanobacterium sp. SWAN-1]
Length = 212
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 75/170 (44%), Gaps = 35/170 (20%)
Query: 602 IADA--------IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDI 653
IADA IK+ G ++ A R+KE I E +V PD +KE LL +
Sbjct: 64 IADAPIENIEILIKKSGFYHVKAKRVKEV--------SKIIHEEYDDVVPDDMKE-LLSL 114
Query: 654 EGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ CV + A PVDV+V RI+ R+G V + P + L+K P
Sbjct: 115 PGVGRKTANCVLVYGFHKDAIPVDVHVHRISNRIGLVDTKT-PEQTEVELMKTVP----- 168
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ Y P L+ ++ FG+ C+ P CP+ C ++
Sbjct: 169 RKYWLP------------LNDLLVQFGQTICRPIGPKHEICPIADLCDYY 206
>gi|452988619|gb|EME88374.1| hypothetical protein MYCFIDRAFT_25275 [Pseudocercospora fijiensis
CIRAD86]
Length = 470
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 70/189 (37%), Gaps = 44/189 (23%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SVDW+AVR A E+ AI+ G I + I+E L E
Sbjct: 227 SVDWDAVRRAPNHEVFKAIERGGLAKIKSKDIQEILQMAYEENQERRSALTSPSGDPAGA 286
Query: 629 ---------------LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
I L+ L + ++ E +L G+G K+ CV L LQ +
Sbjct: 287 EHEPESEKQEEIAKAKQNVISLDHLHLLSTNEAIEKMLSFPGIGPKTASCVALFCLQRPS 346
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
F VD +V R+ LGWVP G+ + Y D + + Y LH
Sbjct: 347 FAVDTHVFRLCQYLGWVPKSVKKGEPKVDRNTTYSHCD----------ARIPDEFKYPLH 396
Query: 734 YQMITFGKI 742
+I GK+
Sbjct: 397 QLLIKHGKV 405
>gi|390443653|ref|ZP_10231441.1| endonuclease III [Nitritalea halalkaliphila LW7]
gi|389666256|gb|EIM77710.1| endonuclease III [Nitritalea halalkaliphila LW7]
Length = 233
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 19/148 (12%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
G + +E ++ P V E L+ + G+G K+ + + VD +V R++ RLG
Sbjct: 94 GKMLVEEFQSEVPQTVNE-LIKLPGVGRKTANVITSVIWHQPNMAVDTHVFRVSRRLGLA 152
Query: 691 PLQP-LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSP 749
PL P ++ HL++ P ++ +++ H+ +I G+ C RSP
Sbjct: 153 PLTAKTPLEVEKHLIRHIP-----------------KEHVHQAHHWLILHGRYTCLARSP 195
Query: 750 NCGACPLRGECKHFASAVASARFALPGP 777
C AC + CKHF + + R P P
Sbjct: 196 KCSACSITHLCKHFERGLRAKRKKDPLP 223
>gi|392869327|gb|EAS27228.2| HhH-GPD family base excision DNA repair protein [Coccidioides
immitis RS]
Length = 444
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 57/127 (44%), Gaps = 23/127 (18%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SV+W VR A EI DAIK G + + IKE L + E
Sbjct: 238 SVNWNKVREAPVQEIRDAIKSGGLADTKSKHIKEILTMVFEQNIARRKDLVDAKQGGNET 297
Query: 629 LHGS----IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+H S + L+ + + D+ + + G+G+K+ CV L L+ F VD +V R+
Sbjct: 298 VHLSDEHMLSLDHMHGLSKDEAMQEFIKYPGIGVKTAACVVLFCLRRPCFAVDTHVFRLC 357
Query: 685 VRLGWVP 691
LGWVP
Sbjct: 358 KWLGWVP 364
>gi|119173285|ref|XP_001239121.1| hypothetical protein CIMG_10143 [Coccidioides immitis RS]
Length = 434
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 57/127 (44%), Gaps = 23/127 (18%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SV+W VR A EI DAIK G + + IKE L + E
Sbjct: 228 SVNWNKVREAPVQEIRDAIKSGGLADTKSKHIKEILTMVFEQNIARRKDLVDAKQGGNET 287
Query: 629 LHGS----IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+H S + L+ + + D+ + + G+G+K+ CV L L+ F VD +V R+
Sbjct: 288 VHLSDEHMLSLDHMHGLSKDEAMQEFIKYPGIGVKTAACVVLFCLRRPCFAVDTHVFRLC 347
Query: 685 VRLGWVP 691
LGWVP
Sbjct: 348 KWLGWVP 354
>gi|392530023|ref|ZP_10277160.1| endonuclease III [Carnobacterium maltaromaticum ATCC 35586]
Length = 214
Score = 58.2 bits (139), Expect = 3e-05, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 72/170 (42%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA I D IK G A I+ +L+E E+ VP K ++ L
Sbjct: 63 EAFLAVPVEAIMDDIKTIGLFRNKARHIQGCCRKLIE-------EFNGEVP--KTRKELT 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G+K+ V + A VD +V R+ RLG L+ G++ L+K+ P
Sbjct: 114 SLPGVGIKTANVVLGDAFGVPAIAVDTHVERVTKRLGICRLKDSVGEVEATLMKKLP--- 170
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ E H+++I FG+ C RSP C CPL EC+
Sbjct: 171 --------------ESMWVEAHHRLIFFGRYHCTARSPKCDICPLLEECR 206
>gi|406911545|gb|EKD51317.1| Helix-hairpin-helix motif:HhH-GPD [uncultured bacterium]
Length = 212
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 72/171 (42%), Gaps = 17/171 (9%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+ +E + +A+ IK G NI R+K FL + G D ++ ++E
Sbjct: 58 LSFERLLATPAQSVAELIKPAGYFNIKTKRLKSFLGAIKREQGDFDTL--SSLSTSSLRE 115
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+LL I G+G ++ + + L + + F VD R+ R L+ E
Sbjct: 116 FLLSISGIGPETADSMMLYAFERPVFVVDAYTNRMLTRHS--------------LVDEEA 161
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
++Q Y L D E H ++ GK FC+K +P C CPLR E
Sbjct: 162 DYFRLQEYFEDHL-EKDVALFNEYHALIVKLGKEFCRKSNPKCEECPLREE 211
>gi|94268173|ref|ZP_01291108.1| Helix-hairpin-helix motif:HhH-GPD [delta proteobacterium MLMS-1]
gi|93451700|gb|EAT02475.1| Helix-hairpin-helix motif:HhH-GPD [delta proteobacterium MLMS-1]
Length = 217
Score = 58.2 bits (139), Expect = 3e-05, Method: Composition-based stats.
Identities = 45/158 (28%), Positives = 72/158 (45%), Gaps = 19/158 (12%)
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+E+A I+ G NI AGR++ L R+ H S L ++ LL+I+G+G +
Sbjct: 74 AELAALIRPAGYYNIKAGRLQNLLGRIHAEHDS--LAAFLAQQSTTLRRQLLEIKGIGPE 131
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ + + L + Q F VD RI R G +P + +Y + +I P
Sbjct: 132 TADSIMLYAAQRPVFVVDTYTHRIFSRHGLLPEE-----------ADYHQVQEIFHDALP 180
Query: 720 RLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPL 756
+ LY E H ++ GK +C+K +P C CPL
Sbjct: 181 A-----EAPLYNEYHALIVRLGKEYCRKSNPRCPTCPL 213
>gi|347525379|ref|YP_004832127.1| endonuclease III / DNA lyase [Lactobacillus ruminis ATCC 27782]
gi|345284338|gb|AEN78191.1| endonuclease III / DNA lyase [Lactobacillus ruminis ATCC 27782]
Length = 213
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 74/169 (43%), Gaps = 30/169 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD SE+ IK G + A + LVE VP K +E L+D+ G+
Sbjct: 68 ADPSEVEKYIKTIGLYHNKAKYLVGCAKDLVERFDG-------KVP--KTREELMDLTGV 118
Query: 657 GLKS--VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
G K+ VE + AF VD +V R+A RL V +P L I E +M+++
Sbjct: 119 GRKTANVELAECFGIP--AFAVDTHVSRVANRLALV--EPTKNVLEI----ERQLMEQVD 170
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W E H+ +I +G+ C R P C CPL ECK+F
Sbjct: 171 ESRW-----------IEAHHLLIAWGRNQCLARKPKCETCPLSFECKYF 208
>gi|448726638|ref|ZP_21709032.1| HhH-GPD family protein [Halococcus morrhuae DSM 1307]
gi|445793968|gb|EMA44532.1| HhH-GPD family protein [Halococcus morrhuae DSM 1307]
Length = 267
Score = 58.2 bits (139), Expect = 3e-05, Method: Composition-based stats.
Identities = 51/189 (26%), Positives = 80/189 (42%), Gaps = 24/189 (12%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEG 655
++S +A+ I+ G N + + R+V+ +GS E++ + P V+E LLD G
Sbjct: 96 EQSTLAETIEPAGLYNQKSETMIAIAERVVDEYGSAAAFDEFVTDEEPATVRETLLDFSG 155
Query: 656 LGLKSVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L S FPVD +V RI RLG P V + ++
Sbjct: 156 VGPKTADCVLLFSGGRAGVFPVDTHVHRIYRRLGIAPADA----------DHEKVREVLE 205
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVASA 770
+ C H I FG+ +C RSP C ACP+ C+ A+
Sbjct: 206 AQVPAEKCGFG-------HTATIQFGREYCTARSPACLDDPDACPMGDLCEQVGVYPATD 258
Query: 771 RFALPGPSE 779
P ++
Sbjct: 259 EVVDPAEAD 267
>gi|448428859|ref|ZP_21584485.1| HhH-GPD family protein [Halorubrum terrestre JCM 10247]
gi|445675837|gb|ELZ28365.1| HhH-GPD family protein [Halorubrum terrestre JCM 10247]
Length = 233
Score = 58.2 bits (139), Expect = 3e-05, Method: Composition-based stats.
Identities = 49/177 (27%), Positives = 77/177 (43%), Gaps = 20/177 (11%)
Query: 576 STGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSID 634
S E D+ D D+ A+ AD E+ + I+ G + A RI+ L + E G+
Sbjct: 61 SRAAEALFDRYD--DFAAIEGADHEELKETIRVAGLADQKAARIQRALTAIREETGGAYS 118
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
L +L +P D+ K++L +I+G+G K+ V VD +V R++ R G VP
Sbjct: 119 LAFLDAMPTDEAKDWLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESA 178
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
H L ++ P D+ T Y LH +I G+ C R +C
Sbjct: 179 SNERAHEALDEQVP----------------DELT-YPLHVLLIRHGREHCSARGADC 218
>gi|313886400|ref|ZP_07820122.1| endonuclease III [Porphyromonas asaccharolytica PR426713P-I]
gi|312924146|gb|EFR34933.1| endonuclease III [Porphyromonas asaccharolytica PR426713P-I]
Length = 219
Score = 58.2 bits (139), Expect = 3e-05, Method: Composition-based stats.
Identities = 36/121 (29%), Positives = 54/121 (44%), Gaps = 18/121 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL-PGDLHIHLLK 705
+E L + G+G KS + + + A PVD +V R+A R+G + P + L+K
Sbjct: 109 REELEALPGVGRKSASVMLAVCFETPAMPVDTHVYRVAKRIGLASSRATTPLAVEQALVK 168
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
P Q L H+Q+I G+ CK R P C C L C+H+A+
Sbjct: 169 RIP-----------------QAQLIRAHHQLILLGRYICKARKPLCAECTLHACCRHYAT 211
Query: 766 A 766
A
Sbjct: 212 A 212
>gi|255305554|ref|ZP_05349726.1| endonuclease III [Clostridium difficile ATCC 43255]
Length = 405
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 263 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 312
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 313 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQKT------EFALMEAIPKERWS 365
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 366 HS-----------HHVLIFHGRRMCKARNPECASCPIKEDCNYY 398
>gi|433590937|ref|YP_007280433.1| putative endoIII-related endonuclease [Natrinema pellirubrum DSM
15624]
gi|448334470|ref|ZP_21523645.1| HhH-GPD family protein [Natrinema pellirubrum DSM 15624]
gi|433305717|gb|AGB31529.1| putative endoIII-related endonuclease [Natrinema pellirubrum DSM
15624]
gi|445619802|gb|ELY73319.1| HhH-GPD family protein [Natrinema pellirubrum DSM 15624]
Length = 272
Score = 57.8 bits (138), Expect = 3e-05, Method: Composition-based stats.
Identities = 52/189 (27%), Positives = 86/189 (45%), Gaps = 26/189 (13%)
Query: 579 EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--E 636
E D +D V E++ A++S +A+ I G N + I + ++E GS
Sbjct: 81 ERYDGDDIDLV--ESLAAAEQSTLAETISGAGLYNQKSETIIDTAEWVLEEFGSAAAFDA 138
Query: 637 WLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPL 695
++++ PP +V+E LL++ G+G K+ +CV L + + FPVD +V RI R+G P
Sbjct: 139 FVKDEPPAEVRETLLEVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPPDAD 198
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC---- 751
+ L ++ P C H I FG+ +C+ R+P C
Sbjct: 199 HEAVREVLERDVPAAK----------CGFG-------HTATIQFGREYCRARTPACLEDP 241
Query: 752 GACPLRGEC 760
ACP+ C
Sbjct: 242 DACPMADRC 250
>gi|448501053|ref|ZP_21612062.1| HhH-GPD family protein [Halorubrum coriense DSM 10284]
gi|445695794|gb|ELZ47894.1| HhH-GPD family protein [Halorubrum coriense DSM 10284]
Length = 233
Score = 57.8 bits (138), Expect = 3e-05, Method: Composition-based stats.
Identities = 46/176 (26%), Positives = 74/176 (42%), Gaps = 20/176 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+ + I+ G + A RI+ L + E G+ L +L +P D K
Sbjct: 73 DFAAIEAADHEELRETIRVAGLPDQKAARIQRSLTAIREETGGAYSLAFLDALPTDDAKA 132
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L +I+G+G K+ V VD +V R++ R G VP H L + P
Sbjct: 133 WLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNERAHDVLDERVP 192
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG--ACPLRGECKH 762
+ +Y LH +I G+ C R+ +C C +C+H
Sbjct: 193 -----------------DELIYPLHVLLIRHGREHCSARNADCDNPVCEAYCDCEH 231
>gi|305664312|ref|YP_003860600.1| HhH-GPD family protein [Ignisphaera aggregans DSM 17230]
gi|304378881|gb|ADM28720.1| HhH-GPD family protein [Ignisphaera aggregans DSM 17230]
Length = 239
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 77/179 (43%), Gaps = 27/179 (15%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID-----LEWLRNV 641
D + + + D S I +AI +AG L L+EL I L+N+
Sbjct: 77 DVITPDRILSIDPSVIINAIN-------VAGLANRRLQSLLELSRHIKENPKFFNDLKNL 129
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
D ++ LL I G+G K+ + L+ + FP+D ++ R+ RLG V
Sbjct: 130 SVDDARKALLSIYGIGYKTADVFLLMIYKKPTFPIDTHIMRVLKRLGIV----------- 178
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
H Y + K + + + + L LH +I G++ CK R+P C CP+ +C
Sbjct: 179 HEDMGYEDIRKFILGVVEH----NPEELLSLHISLIAHGRMICKARNPRCSECPINTKC 233
>gi|126698144|ref|YP_001087041.1| endonuclease III [Clostridium difficile 630]
gi|260682265|ref|YP_003213550.1| endonuclease iii [Clostridium difficile CD196]
gi|260685864|ref|YP_003216997.1| endonuclease iii [Clostridium difficile R20291]
gi|115249581|emb|CAJ67398.1| Endonuclease III [Clostridium difficile 630]
gi|260208428|emb|CBA60983.1| endonuclease iii [Clostridium difficile CD196]
gi|260211880|emb|CBE02318.1| endonuclease iii [Clostridium difficile R20291]
Length = 201
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 59 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 108
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 109 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQK------TEFALMEAIPKERWS 161
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 162 HS-----------HHVLIFHGRRMCKARNPECASCPIKEDCNYY 194
>gi|255099671|ref|ZP_05328648.1| endonuclease III [Clostridium difficile QCD-63q42]
Length = 405
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 263 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 312
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 313 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQKT------EFALMEAIPKERWS 365
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 366 HS-----------HHVLIFHGRRMCKARNPECASCPIKEDCNYY 398
>gi|448473653|ref|ZP_21601795.1| HhH-GPD family protein [Halorubrum aidingense JCM 13560]
gi|445819165|gb|EMA69014.1| HhH-GPD family protein [Halorubrum aidingense JCM 13560]
Length = 232
Score = 57.4 bits (137), Expect = 4e-05, Method: Composition-based stats.
Identities = 48/182 (26%), Positives = 76/182 (41%), Gaps = 22/182 (12%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVP 642
D+ D D+ A+ AD E+A+ I+ G + A RI+ L + E G+ L +L +
Sbjct: 69 DRYD--DFAAIEAADHDELAETIRVAGLADQKAARIQRALAAIREETGGAYSLAFLDAMA 126
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
D K +L DI+G+G K+ V VD +V R++ R G VP H
Sbjct: 127 TDAAKAWLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPASASNQAAHNR 186
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG--ACPLRGEC 760
L + P + +Y LH +I G+ C R +C C +C
Sbjct: 187 LDEIVP-----------------DELIYPLHVLLIRHGREHCSARGADCDNPVCATYCDC 229
Query: 761 KH 762
++
Sbjct: 230 EY 231
>gi|451850889|gb|EMD64190.1| hypothetical protein COCSADRAFT_36763 [Cochliobolus sativus ND90Pr]
Length = 526
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 59/145 (40%), Gaps = 37/145 (25%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL--------NRLVELHGS------- 632
SVDW VR AD+ EI +AIK G ++ + IK L R EL S
Sbjct: 245 SVDWNKVRQADQKEIFEAIKSGGLADVKSKDIKRILEMVWEENQTRRKELQSSSNKAPGS 304
Query: 633 -------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
I L+ L + D L G+G K+ CV L LQ +
Sbjct: 305 ANEAEEEKSAEIEKASQDVISLDHLHLLSNDDAFNALTKYPGIGPKTASCVLLFCLQRPS 364
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGD 698
F VD +V R+ LGWV P PGD
Sbjct: 365 FAVDTHVFRLCQWLGWV---PPPGD 386
>gi|218530896|ref|YP_002421712.1| endonuclease III FCL domain-containing protein [Methylobacterium
extorquens CM4]
gi|218523199|gb|ACK83784.1| Endonuclease III FCL domain protein [Methylobacterium extorquens
CM4]
Length = 254
Score = 57.4 bits (137), Expect = 4e-05, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A EI I + A RIK L + E G++ L++L ++ D+ + +
Sbjct: 86 DWTEIDAASVKEIEATIHGVTWPELKAPRIKAVLAAVQERVGALSLDFLGDMSVDEARGW 145
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A R G + + G H L + P
Sbjct: 146 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRTGLIGAKVDVGPSHAVLRAQLP 205
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D W + LY+ H ++ G+ C RSP CG C L C
Sbjct: 206 --DD-----W------SAQKLYDNHEVLMLHGQRVCFHRSPACGRCVLLDIC 244
>gi|448432620|ref|ZP_21585676.1| HhH-GPD family protein [Halorubrum tebenquichense DSM 14210]
gi|445686876|gb|ELZ39180.1| HhH-GPD family protein [Halorubrum tebenquichense DSM 14210]
Length = 232
Score = 57.4 bits (137), Expect = 4e-05, Method: Composition-based stats.
Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 18/163 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD + + I+ G + A RI+ L + E G+ L +L + D K
Sbjct: 73 DFAAIEAADHEALRETIRVAGLPDQKAARIQRALAAIREETGGAYSLAFLDAMATDDAKA 132
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L DI+G+G K+ V VD +V R++ R G VP G H L + P
Sbjct: 133 WLTDIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNGRAHEVLDERVP 192
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ +Y LH +I G+ C R+P+C
Sbjct: 193 -----------------DELIYPLHVLLIRHGREHCSARNPDC 218
>gi|413944179|gb|AFW76828.1| hypothetical protein ZEAMMB73_573219 [Zea mays]
Length = 264
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 48/89 (53%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
W+ V + + DAI+ G A RI+ L + E G I LE+LR + D+VK+ L
Sbjct: 171 WDQVVDEEGKRLEDAIRCGGLAATKAARIRSMLRDVRERRGKICLEYLRELSVDEVKKEL 230
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
+G+G K+V CV + LQ FPVD +
Sbjct: 231 SRFKGIGPKTVACVLMFYLQKDDFPVDTH 259
>gi|312623064|ref|YP_004024677.1| hhh-gpd family protein [Caldicellulosiruptor kronotskyensis 2002]
gi|312203531|gb|ADQ46858.1| HhH-GPD family protein [Caldicellulosiruptor kronotskyensis 2002]
Length = 225
Score = 57.4 bits (137), Expect = 5e-05, Method: Composition-based stats.
Identities = 48/180 (26%), Positives = 80/180 (44%), Gaps = 17/180 (9%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K++ + E + E ++A+ IK G N A R+KEF N L S DLE L +
Sbjct: 59 KLNILSVEGILQTPEEKLAELIKAAGYYNQKAKRLKEFCNFLKREFNS-DLEKLFALDIS 117
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+++ LL +G+G ++ + + L + F VD R+ RLG + + +
Sbjct: 118 SLRQVLLSQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI--------- 168
Query: 705 KEYPVMDKIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
Y + I M L P+ + + E H ++ K CK + P C C LR C ++
Sbjct: 169 -SYNDLQAIIMANLTPQTKFFN-----EFHALIVKHCKEICKSKKPICNKCCLRLICNYY 222
>gi|288560832|ref|YP_003424318.1| endonuclease III Nth [Methanobrevibacter ruminantium M1]
gi|288543542|gb|ADC47426.1| endonuclease III Nth [Methanobrevibacter ruminantium M1]
Length = 215
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 75/178 (42%), Gaps = 27/178 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D AV A + + IK G + A RI E L++ +G VP + +
Sbjct: 64 DIYAVADAPVEHVQELIKPAGFYRVKAARILEVSRILIDQYGG-------EVP--REMDE 114
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+L + G+G K+ CV + + Q A PVD +V RI+ R G D E +
Sbjct: 115 MLKLPGVGRKTANCVIVFAFQDAAIPVDTHVHRISNRWGI-------ADTKDPEETEQVL 167
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
M+K+ LW +L+ M+ FG+ C+ P C CP+ C + S +
Sbjct: 168 MEKVPKDLW-----------VDLNDLMVQFGQTICRPIGPQCDKCPISDLCDYDVSKL 214
>gi|414083690|ref|YP_006992398.1| endonuclease III [Carnobacterium maltaromaticum LMA28]
gi|412997274|emb|CCO11083.1| endonuclease III [Carnobacterium maltaromaticum LMA28]
Length = 214
Score = 57.4 bits (137), Expect = 5e-05, Method: Composition-based stats.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 17/117 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K ++ L + G+G+K+ V + A VD +V R+ RLG L+ G++ L+
Sbjct: 107 KTRKELTSLPGVGIKTANVVLGDAFGVPAIAVDTHVERVTKRLGICRLKDSVGEVEATLM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ P + E H+++I FG+ C RSP C CPL EC+
Sbjct: 167 KKLP-----------------ESMWVEAHHRLIFFGRYHCTARSPKCDICPLLEECR 206
>gi|393796715|ref|ZP_10380079.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum limnia BG20]
Length = 175
Score = 57.4 bits (137), Expect = 5e-05, Method: Composition-based stats.
Identities = 38/145 (26%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ I+ ++ VP + E L+++ G+G K+ CV + + A PVD++V RI+
Sbjct: 51 RIIEVAKIINSKYKGKVPDNL--EKLVELPGVGRKTANCVLVYAFDKPAIPVDIHVHRIS 108
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + L++ P +K +++ + +G+ C
Sbjct: 109 NRLGLVDTKT-PEETEQELMRIIP-----------------KKFWIDINDTFVMYGQNIC 150
Query: 745 KKRSPNCGACPLRGECKHFASAVAS 769
K SP C C ++ +CK + + AS
Sbjct: 151 KPISPMCSVCKIKNDCKFYKTKNAS 175
>gi|344341598|ref|ZP_08772516.1| HhH-GPD family protein [Thiocapsa marina 5811]
gi|343798530|gb|EGV16486.1| HhH-GPD family protein [Thiocapsa marina 5811]
Length = 242
Score = 57.4 bits (137), Expect = 5e-05, Method: Composition-based stats.
Identities = 55/188 (29%), Positives = 87/188 (46%), Gaps = 24/188 (12%)
Query: 577 TGEERSHDKM---DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI 633
T ER+ +++ S+D E++ D E+ADA++ G N+ + R++ F +E G
Sbjct: 44 TNVERALERLRLRSSLDAESILALDPPELADALRPSGYFNVKSQRVRGFCAAYLEAGGHP 103
Query: 634 DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ 693
L L PD ++ LL I+G+G ++ + + L + F VD RI RLG
Sbjct: 104 GLAAL--ATPD-LRARLLAIKGIGPETADDILLYAFDRPVFVVDAYTRRIFERLGL---- 156
Query: 694 PLPGDLHIHLLKEYPVMDK-IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
L G + Y V+ + + L P D L E H +++ GK C+ R P C
Sbjct: 157 -LAGG------ETYEVIRRGFEQALGP-----DVPMLKEYHALIVSQGKEVCRTR-PACE 203
Query: 753 ACPLRGEC 760
C LR C
Sbjct: 204 RCALRRTC 211
>gi|78043065|ref|YP_359967.1| endonuclease III [Carboxydothermus hydrogenoformans Z-2901]
gi|77995180|gb|ABB14079.1| endonuclease III [Carboxydothermus hydrogenoformans Z-2901]
Length = 210
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 37/168 (22%)
Query: 599 ESEIADAIKERG-----QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDI 653
E E++ I+ G +N+I +I E L+R E HG + PD E LL +
Sbjct: 69 EEELSRQIRSLGLYRNKARNLI--KIAEILDR--EYHGQV---------PDSFAE-LLKL 114
Query: 654 EGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ E + + +FPVD +V R+A RLG + P + L K +P
Sbjct: 115 PGVGPKTAEVIVGVGFNKPSFPVDTHVFRVARRLGLSKART-PEGVSFDLKKIFP----- 168
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ +LH+++I FG+ CK + P+C CP C+
Sbjct: 169 ------------PNSWIDLHHRLIFFGRRICKAQKPSCNICPFPEFCQ 204
>gi|341583175|ref|YP_004763667.1| endonuclease III [Thermococcus sp. 4557]
gi|340810833|gb|AEK73990.1| endonuclease III [Thermococcus sp. 4557]
Length = 238
Score = 57.0 bits (136), Expect = 5e-05, Method: Composition-based stats.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD +KE L+ + G+G K V A PVD +V RI+ RLG P + P
Sbjct: 126 PDDIKE-LMKLPGIGRKCANIVLAYGFGRQAIPVDTHVNRISKRLGLAPPRVQP------ 178
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+K++ YL + Y +K +Y +++ M+ GK C+ P C CPLR C +
Sbjct: 179 --------EKVEEYLAELIPY--EKWIY-VNHAMVDHGKSICRPIGPKCDECPLRELCPY 227
>gi|336121084|ref|YP_004575859.1| HhH-GPD family protein [Methanothermococcus okinawensis IH1]
gi|334855605|gb|AEH06081.1| HhH-GPD family protein [Methanothermococcus okinawensis IH1]
Length = 217
Score = 57.0 bits (136), Expect = 5e-05, Method: Composition-based stats.
Identities = 44/175 (25%), Positives = 81/175 (46%), Gaps = 15/175 (8%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D +D + + + ++ + IK G NI A R+K +V+ + + D + +
Sbjct: 51 DLIDEKRILKINTEKLKELIKPAGFYNIKAERLKNITKYIVDNYKTTDELAKTEKDTNIL 110
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ LL+I+G+G ++ + + L +L F VD RI R G + + + D I L+ E
Sbjct: 111 RNELLNIKGVGKETADSILLYALDRKIFVVDAYTRRIFSRYGIIN-KDMDYD-EIRLIFE 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ + +Y E H ++ GK +CKK++P C CPL C+
Sbjct: 169 DNIIPSLNIY-------------KEYHALIVELGKNYCKKKNPLCNTCPLNLYCR 210
>gi|172057780|ref|YP_001814240.1| endonuclease III [Exiguobacterium sibiricum 255-15]
gi|171990301|gb|ACB61223.1| endonuclease III [Exiguobacterium sibiricum 255-15]
Length = 222
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
EI + IK G A IK +L+ LHG VP D+ L + G+G K+
Sbjct: 72 EIEEKIKRLGLYRNKAKNIKALAEQLLVLHGG-------EVPTDRAG--LEALPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ AF VD +V R++ RLG + D + + E +M + + W +
Sbjct: 123 ANVVLSVAFDVPAFAVDTHVERVSKRLGICRWK----DNVMQV--EQTLMKRFKRERWSK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L H+Q I FG+ CK + PNC CPL C+
Sbjct: 177 L-----------HHQFIFFGRYHCKAQRPNCLECPLLDMCR 206
>gi|15643134|ref|NP_228177.1| endonuclease III [Thermotoga maritima MSB8]
gi|403252815|ref|ZP_10919120.1| endonuclease III [Thermotoga sp. EMP]
gi|418046300|ref|ZP_12684394.1| endonuclease III [Thermotoga maritima MSB8]
gi|8134433|sp|Q9WYK0.1|END3_THEMA RecName: Full=Endonuclease III; AltName: Full=DNA-(apurinic or
apyrimidinic site) lyase
gi|4980869|gb|AAD35453.1|AE001716_16 endonuclease III [Thermotoga maritima MSB8]
gi|351675853|gb|EHA59013.1| endonuclease III [Thermotoga maritima MSB8]
gi|402811577|gb|EJX26061.1| endonuclease III [Thermotoga sp. EMP]
Length = 213
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ D IKE G A RI E LVE +G PD ++E LL + G+
Sbjct: 60 AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V + + A VD +V RI+ RLGWV + P + L K P
Sbjct: 111 GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
LW ++ M+ FG+ CK ++P C C L+ C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199
>gi|452209864|ref|YP_007489978.1| Endonuclease III [Methanosarcina mazei Tuc01]
gi|452099766|gb|AGF96706.1| Endonuclease III [Methanosarcina mazei Tuc01]
Length = 205
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 74/165 (44%), Gaps = 28/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +I + I++ G + AGRIKE L+E + VP D E LL + G+
Sbjct: 63 ADVEDIEELIRDVGFYRVKAGRIKEISRILLEDYNG-------RVPDDM--ETLLKLPGV 113
Query: 657 GLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
G K+ CV + L+ A VD +V RI+ RLG V + P + + L K P
Sbjct: 114 GRKTANCVLAHAFLKEDALAVDTHVHRISNRLGLVVTKN-PEETEMELKKLLP------- 165
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
QK ++ ++ FG+ C+ SP CG C L C
Sbjct: 166 ----------QKYWRHVNILLVKFGQNVCRPISPRCGICVLNDIC 200
>gi|403667443|ref|ZP_10932749.1| endonuclease III [Kurthia sp. JC8E]
Length = 219
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I++ RL++++G +P V+E L+ + G+G K+
Sbjct: 72 ELEQDIRSIGLFRSKAKNIQKLCQRLLDVYGG-------EIPA--VREELVTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V RIA RLG + ++ E +M K W +
Sbjct: 123 ANVVLSVAFGEPAMAVDTHVERIAKRLGLCRWKDNVLEV------EQTIMKKTPRERWSK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+QMI FG+ CK + P CG CPL C+
Sbjct: 177 A-----------HHQMIFFGRYHCKSQKPGCGECPLFTLCR 206
>gi|383319923|ref|YP_005380764.1| DNA-3-methyladenine glycosylase III [Methanocella conradii HZ254]
gi|379321293|gb|AFD00246.1| DNA-3-methyladenine glycosylase III [Methanocella conradii HZ254]
Length = 240
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 20/162 (12%)
Query: 596 CADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
C+ E +IA IK N A +IK F+ + +G DL + + +++ LL ++G
Sbjct: 77 CSVE-DIAPLIKPARFYNSKAVKIKNFMEFFFKEYGG-DLAAMSSEDSASLRKKLLAVKG 134
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
LG ++V+C+ L + F VD RI +R G L GD P D+IQ
Sbjct: 135 LGKETVDCILLYACGKPVFVVDAYTKRIFLRYG-----ILNGD---------PTYDEIQG 180
Query: 716 YLWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPL 756
Y L LY + H Q++ G C+ + P CG+CP+
Sbjct: 181 YFMASL--PPNAALYNDYHAQIVHLGSSICRPK-PLCGSCPI 219
>gi|222479474|ref|YP_002565711.1| HhH-GPD family protein [Halorubrum lacusprofundi ATCC 49239]
gi|222452376|gb|ACM56641.1| HhH-GPD family protein [Halorubrum lacusprofundi ATCC 49239]
Length = 233
Score = 56.6 bits (135), Expect = 7e-05, Method: Composition-based stats.
Identities = 46/169 (27%), Positives = 72/169 (42%), Gaps = 20/169 (11%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVP 642
D+ D D+ A+ AD E+ + I+ G + A RI+ L + E G+ L +L +
Sbjct: 69 DRYD--DFAAIEAADHEELKETIRVAGLADQKAARIQRALAAIREETGGAYSLAFLDAMA 126
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
D KE+L++I+G+G K+ V VD +V R++ R G VP H
Sbjct: 127 TDDAKEWLMEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNQAAHDA 186
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
L K P + +Y LH +I G+ C R +C
Sbjct: 187 LDKLVP-----------------DELIYPLHVLLIRHGRERCSARGADC 218
>gi|338731336|ref|YP_004660728.1| endonuclease III [Thermotoga thermarum DSM 5069]
gi|335365687|gb|AEH51632.1| endonuclease III [Thermotoga thermarum DSM 5069]
Length = 208
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 72/176 (40%), Gaps = 27/176 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E++ A EI D IK G A RI E ++E + VP D + LL
Sbjct: 59 ESLAKAKPEEIYDLIKPSGMYRQKAERIIEVSKIILEKYNG-------KVPSDL--DELL 109
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V A VD +V RI+ RLG+V + P L K P
Sbjct: 110 KLPGVGRKTANIVLFQGFSIPAIAVDTHVHRISNRLGFVKTK-TPEQTEEELSKVLP--- 165
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
LW ++ M+ FG+ C R+P C CP ECK+ SAV
Sbjct: 166 ---KRLWG-----------PINVAMVNFGRNICLPRNPRCEKCPFSKECKYHNSAV 207
>gi|448574894|ref|ZP_21641417.1| endonuclease III [Haloferax larsenii JCM 13917]
gi|445732573|gb|ELZ84155.1| endonuclease III [Haloferax larsenii JCM 13917]
Length = 267
Score = 56.6 bits (135), Expect = 7e-05, Method: Composition-based stats.
Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 22/130 (16%)
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQP 694
+W+R P+ V+E LL+++G+G K+ +CV L + + FPVD +V RI+ R+G P
Sbjct: 135 QWVREGDPETVRERLLEMKGVGPKTADCVLLFAGGRDGVFPVDTHVHRISRRIGLAPADA 194
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC--- 751
+ L ++ P C H MI FG+ +C R P C
Sbjct: 195 DHEGVRERLERDVP----------GEACGFG-------HTAMIQFGREYCTARKPACLDG 237
Query: 752 -GACPLRGEC 760
ACP+ C
Sbjct: 238 PEACPMDDVC 247
>gi|386857015|ref|YP_006261192.1| Endonuclease III [Deinococcus gobiensis I-0]
gi|380000544|gb|AFD25734.1| Endonuclease III, putative [Deinococcus gobiensis I-0]
Length = 213
Score = 56.6 bits (135), Expect = 7e-05, Method: Composition-based stats.
Identities = 52/224 (23%), Positives = 86/224 (38%), Gaps = 27/224 (12%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIA 616
++N+ + W+ L Y E D D ++ R G I A
Sbjct: 8 QQNTRAVATRQWEALTLTYPVWEAALLDGPDGIETTLRRAGG-----------GLTRIKA 56
Query: 617 GRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPV 676
+ L L G + L +L ++ ++ + L + G+G ++ V L L A PV
Sbjct: 57 DYLYGILAALEGSRGELSLRFLHDLGDEEARAVLEGLPGVGQRTASLVLLFDLVRPAMPV 116
Query: 677 DVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQM 736
D N+ R+A RL VP E ++ + + + ++ D +T Y LH
Sbjct: 117 DTNIARMAARLDLVP--------------ETWSTNRTEAW-FGQVIARDWETRYALHLSG 161
Query: 737 ITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEK 780
+ G C R P CG C LR C A+ P P+++
Sbjct: 162 VRHGHETCTPRRPLCGRCVLRDLCPS-AALFLEGEVPEPSPTKR 204
>gi|332299912|ref|YP_004441833.1| endonuclease III [Porphyromonas asaccharolytica DSM 20707]
gi|332176975|gb|AEE12665.1| endonuclease III [Porphyromonas asaccharolytica DSM 20707]
Length = 219
Score = 56.6 bits (135), Expect = 7e-05, Method: Composition-based stats.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 18/121 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL-PGDLHIHLLK 705
+E L + G+G KS + + + A PVD +V R+A R+G + P + L+K
Sbjct: 109 REELEALPGVGRKSASVMLAVCFETPAMPVDTHVYRVAKRIGLASSRATTPLAVEQALVK 168
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
P Q L H+Q+I G+ CK R P C C L C+H+A
Sbjct: 169 RIP-----------------QAQLIRAHHQLILLGRYICKARKPLCAECTLHACCRHYAV 211
Query: 766 A 766
A
Sbjct: 212 A 212
>gi|320334742|ref|YP_004171453.1| iron-sulfur cluster loop [Deinococcus maricopensis DSM 21211]
gi|319756031|gb|ADV67788.1| iron-sulfur cluster loop [Deinococcus maricopensis DSM 21211]
Length = 247
Score = 56.6 bits (135), Expect = 7e-05, Method: Composition-based stats.
Identities = 55/213 (25%), Positives = 92/213 (43%), Gaps = 27/213 (12%)
Query: 548 NDRTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIK 607
+D + ++N+ T ++ + LRR Y T + D D + E+ + DA
Sbjct: 52 DDLIEAVLNQQNTRATTERQYAALRRAYPTWDAALADGPDGI---------EAVLRDA-- 100
Query: 608 ERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLL 667
G + A + L L+E G + L+ LR++ + L + G+G+K+ + L
Sbjct: 101 GGGLARVKADYVWNILYALLE-RGDLSLQHLRHLDDADARTALESLPGVGMKTASALLLF 159
Query: 668 SLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQK 727
L A PVD ++ R++ RL +P + ++LK D++ PR D
Sbjct: 160 DLARPAMPVDGHIDRVSKRLHLIPER-------WNVLKAERWYDEV----LPR----DWA 204
Query: 728 TLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
Y H I G+ C R+P C AC LR C
Sbjct: 205 QRYAYHVATIRHGRETCLTRAPRCNACVLRDLC 237
>gi|335430781|ref|ZP_08557667.1| endonuclease III [Haloplasma contractile SSD-17B]
gi|334887321|gb|EGM25653.1| endonuclease III [Haloplasma contractile SSD-17B]
Length = 210
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 18/121 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD +E L + G+G K+ V + AF VD +V R++ RLG P +
Sbjct: 105 PD-TREELEKLPGVGRKTANVVLSVCFDQPAFAVDTHVARVSKRLGIANENDTPLKIEKK 163
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+ +P ++ LH+QMI FG+ C R PNC C L+ C
Sbjct: 164 LIDVFP-----------------EEKWCALHHQMIFFGRYHCTARKPNCDECKLKDICTE 206
Query: 763 F 763
F
Sbjct: 207 F 207
>gi|431931186|ref|YP_007244232.1| endonuclease III-like protein [Thioflavicoccus mobilis 8321]
gi|431829489|gb|AGA90602.1| putative endonuclease III-like protein [Thioflavicoccus mobilis
8321]
Length = 215
Score = 56.6 bits (135), Expect = 8e-05, Method: Composition-based stats.
Identities = 48/169 (28%), Positives = 74/169 (43%), Gaps = 19/169 (11%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ D +ADAI+ G N+ A R++ F + VE G L L ++ ++E LL
Sbjct: 62 EAILALDHETLADAIRPAGYFNVKARRLRAFCHAFVEEGG---LAGLSSLDTKALRERLL 118
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I G+G ++ + + L + + F VD R+ RLG L GD ++ D
Sbjct: 119 AINGVGPETADDMLLYAFERPVFVVDAYTRRLFARLGL-----LDGDEGYETIR-----D 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L P D + E H ++ K C+ R P C C L C
Sbjct: 169 AFERALGP-----DTECFNEYHALIVRHAKEVCRTR-PRCAGCCLGELC 211
>gi|448352659|ref|ZP_21541440.1| HhH-GPD family protein [Natrialba hulunbeirensis JCM 10989]
gi|445641938|gb|ELY95009.1| HhH-GPD family protein [Natrialba hulunbeirensis JCM 10989]
Length = 278
Score = 56.6 bits (135), Expect = 8e-05, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 77/172 (44%), Gaps = 24/172 (13%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEG 655
++S++A+ I G N +G + + + + GS ++ + PD V+E LLD+ G
Sbjct: 106 EQSQLAETISSAGLYNQKSGMLIDAAEWVCDEFGSAAEFDRFVTDEAPDTVRETLLDVRG 165
Query: 656 LGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L + ++ FPVD +V RI R+G P + ++ L E P
Sbjct: 166 VGPKTADCVLLFAGGRNGVFPVDTHVHRIYRRMGIAPPEADHEEVRAVLEAEVPAAK--- 222
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
C H I FG+ FC R P C ACP+ C+
Sbjct: 223 -------CGFG-------HTATIQFGREFCTARKPACLEDPEACPMADLCEQ 260
>gi|335420680|ref|ZP_08551717.1| HhH-GPD family protein [Salinisphaera shabanensis E1L3A]
gi|334894416|gb|EGM32612.1| HhH-GPD family protein [Salinisphaera shabanensis E1L3A]
Length = 230
Score = 56.6 bits (135), Expect = 9e-05, Method: Composition-based stats.
Identities = 46/165 (27%), Positives = 66/165 (40%), Gaps = 19/165 (11%)
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
I+ G NI A R++ VE G E L+ D ++ LL + G+G ++ + +
Sbjct: 83 IRRAGPHNIKAKRLRALCAWFVEQGG---FEGLQTRDSDTLRRELLALHGIGHETADVIL 139
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
L + Q AF D RI R GW + Y M + + P D
Sbjct: 140 LYAFQRPAFVADAYAFRIMERYGWX-----------RGTRNYERMRRSIEAVGPE----D 184
Query: 726 QKTLY-ELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
Y ELH ++ K C KR P+C C L C H VA+
Sbjct: 185 DAPFYDELHALIVAHAKQRCHKREPDCACCVLNDCCDHGRGWVAA 229
>gi|167645377|ref|YP_001683040.1| HhH-GPD family protein [Caulobacter sp. K31]
gi|167347807|gb|ABZ70542.1| HhH-GPD family protein [Caulobacter sp. K31]
Length = 242
Score = 56.6 bits (135), Expect = 9e-05, Method: Composition-based stats.
Identities = 48/173 (27%), Positives = 75/173 (43%), Gaps = 17/173 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG--SIDLEWLRNVPPDKVKE 648
WE + +E+ D IK+ A + L RL+++ + L+ L + D +
Sbjct: 66 WEDLAETPAAEVQDLIKDVTFPEEKARHLPHAL-RLIQVRSGWKLSLDHLAELELDSARW 124
Query: 649 YLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+L + G+G+K V S L A VD +V R+A R+G VP + L+
Sbjct: 125 WLQGLPGVGVKVAASVLNFSPLNMRALVVDTHVHRVASRMGLVPASYDTAHAYRALMDLV 184
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P W + LYELH+ M G++ C +P CGAC L+ C
Sbjct: 185 PDS-------W------TAEDLYELHWLMKGLGQLLCSHHAPRCGACALKATC 224
>gi|78356480|ref|YP_387929.1| HhH-GPD family protein [Desulfovibrio alaskensis G20]
gi|78218885|gb|ABB38234.1| HhH-GPD family protein [Desulfovibrio alaskensis G20]
Length = 221
Score = 56.2 bits (134), Expect = 9e-05, Method: Composition-based stats.
Identities = 41/138 (29%), Positives = 63/138 (45%), Gaps = 18/138 (13%)
Query: 623 LNRLVEL---HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
L RLV+ D+ L D ++ LLDI+G+G ++ + + L +L +F VD
Sbjct: 90 LRRLVQFLYEEADGDIRNLAGTSMDSLRARLLDIKGIGPETADSILLYALDMPSFVVDAY 149
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
RI VR G LP D+ ++E+ MD + D E H ++
Sbjct: 150 TRRICVRHGL-----LPEDVQYAEMREF-FMDVLPA---------DVSVYNEYHALLVRV 194
Query: 740 GKIFCKKRSPNCGACPLR 757
K +C+K P C +CPLR
Sbjct: 195 AKEWCRKTHPRCDSCPLR 212
>gi|340975657|gb|EGS22772.1| hypothetical protein CTHT_0012470 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 628
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 61/139 (43%), Gaps = 24/139 (17%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-------VEL----------- 629
S+DW+ VR E+A+ IK G A IK+ L+ + +E
Sbjct: 376 SIDWDKVRLGSHRELAETIKIAGNGPKKASHIKQILDMVYAENLEHIEAQTVDKGSEPGG 435
Query: 630 -HGS-----IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
HG+ + L+++ + D+ L+ G+G+K+ CV L L+ F VD +V +
Sbjct: 436 KHGTDRQDLLSLDYMHRMTKDEAMAKLVTYPGVGIKTAACVTLFCLRLPCFAVDTHVHKF 495
Query: 684 AVRLGWVPLQPLPGDLHIH 702
LGWVP P + H
Sbjct: 496 CRWLGWVPPNADPDNCFRH 514
>gi|294508391|ref|YP_003572449.1| endonuclease III [Salinibacter ruber M8]
gi|294344720|emb|CBH25498.1| Endonuclease III [Salinibacter ruber M8]
Length = 386
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 57/123 (46%), Gaps = 19/123 (15%)
Query: 645 KVKEYLLDIE---GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
KV E + D+E G+G K+ V ++ A PVD +V R+A R+G
Sbjct: 208 KVPETIDDLETLTGVGRKTARVVAQVAHDADALPVDTHVFRVANRIG------------- 254
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+KE K R+ + + E H+ +I G+ C RSP+C CP+ ECK
Sbjct: 255 -LVKEDATTPKKVEQQLKRV--IPKAEWGEAHHLLILHGRYTCTARSPDCHDCPIHEECK 311
Query: 762 HFA 764
H+
Sbjct: 312 HYG 314
>gi|399069064|ref|ZP_10749265.1| putative endoIII-related endonuclease [Caulobacter sp. AP07]
gi|398045317|gb|EJL38052.1| putative endoIII-related endonuclease [Caulobacter sp. AP07]
Length = 242
Score = 56.2 bits (134), Expect = 9e-05, Method: Composition-based stats.
Identities = 48/173 (27%), Positives = 76/173 (43%), Gaps = 17/173 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG--SIDLEWLRNVPPDKVKE 648
WE + A ++ + + IKE A + L RL+++ + L+ L + D +
Sbjct: 66 WEDLAEAPDAVVLELIKEVSYPEDKARHLPHAL-RLIQVRSGWKLSLDHLSELELDSARW 124
Query: 649 YLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+L + G+G+K V S L A VD +V R+A RLG +P + L+
Sbjct: 125 WLQGLPGVGVKVAASVLNFSPLNMRALVVDTHVHRVAGRLGLIPASYDTAHAYRALMDLV 184
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P W + LYELH+ M G++ C +P CGAC L+ C
Sbjct: 185 PDS-------W------TAEDLYELHWLMKGLGQLLCGHHAPRCGACSLKATC 224
>gi|319790406|ref|YP_004152039.1| HhH-GPD family protein [Thermovibrio ammonificans HB-1]
gi|317114908|gb|ADU97398.1| HhH-GPD family protein [Thermovibrio ammonificans HB-1]
Length = 221
Score = 56.2 bits (134), Expect = 9e-05, Method: Composition-based stats.
Identities = 51/170 (30%), Positives = 81/170 (47%), Gaps = 17/170 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ DE E+ + IK G A +KE L+R V G I E L+ P ++ LL
Sbjct: 62 EALSSTDEGELQNLIKPAGFFRQKARYLKE-LSRFVVREGGI--EGLKAQPLKVLRPKLL 118
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK-EYPVM 710
+++G+G ++ + + L +L +F VD R+ RLG + + + + L++ E P
Sbjct: 119 NVKGIGPETADSILLYALDKPSFVVDKYTKRLLYRLGVLEGESVSYNRVKTLVEGEIPPT 178
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ K ELH ++ K +CK R PNC CPLR C
Sbjct: 179 EE------------HLKEYKELHALIVELCKRYCKTR-PNCRECPLRELC 215
>gi|148269689|ref|YP_001244149.1| endonuclease III [Thermotoga petrophila RKU-1]
gi|170288365|ref|YP_001738603.1| endonuclease III [Thermotoga sp. RQ2]
gi|281411600|ref|YP_003345679.1| endonuclease III [Thermotoga naphthophila RKU-10]
gi|147735233|gb|ABQ46573.1| endonuclease III [Thermotoga petrophila RKU-1]
gi|170175868|gb|ACB08920.1| endonuclease III [Thermotoga sp. RQ2]
gi|281372703|gb|ADA66265.1| endonuclease III [Thermotoga naphthophila RKU-10]
Length = 213
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A ++ + IKE G A RI E LVE +G PD ++E LL + G+
Sbjct: 60 AKPEDLYNLIKESGMYRQKAARIVEISRILVERYGG--------RVPDSLEE-LLKLPGV 110
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V + + A VD +V RI+ RLGWV + P + L K P
Sbjct: 111 GRKTANIVLWVGFRKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
LW ++ M+ FG+ CK ++P C C L+ C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199
>gi|392588793|gb|EIW78124.1| DNA glycosylase, partial [Coniophora puteana RWD-64-598 SS2]
Length = 276
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 69/165 (41%), Gaps = 21/165 (12%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELH--GSIDLEWLR--NVPPDKVKEYLLD 652
A +++ +AIK G N A I+ L + E H G DL+ L +V ++V L+
Sbjct: 114 APAADVVEAIKHGGLANRKARIIQNLLVSVKEAHPEGRYDLQHLLAGDVSDEEVMRSLVQ 173
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
G+G K+ CV + AF VD +V RI LGWVP HL P
Sbjct: 174 YNGVGPKTAACVLAFCMGRDAFAVDTHVFRITRMLGWVPRHADRVSTQAHLELRVP---- 229
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
P L Y LH + G+ ++ + G CPL+
Sbjct: 230 ------PEL-------KYGLHVMFVKHGRACKGCKNGSRGECPLK 261
>gi|336252280|ref|YP_004595387.1| HhH-GPD family protein [Halopiger xanaduensis SH-6]
gi|335336269|gb|AEH35508.1| HhH-GPD family protein [Halopiger xanaduensis SH-6]
Length = 274
Score = 56.2 bits (134), Expect = 9e-05, Method: Composition-based stats.
Identities = 50/178 (28%), Positives = 83/178 (46%), Gaps = 24/178 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A++S +A+ I+ G N + I + ++E GS E++++ P +V+E
Sbjct: 91 ESLAGAEQSILAETIQPAGLYNQKSEIIIDTAEWVLEAFGSAAAFDEFVKDEDPAEVRET 150
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL++ G+G K+ +CV L + + FPVD +V RI RLG P + L +E P
Sbjct: 151 LLEVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPADADHEAVRAVLEREVP 210
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
+K + H I FG+ +C R P C ACP+ C+
Sbjct: 211 A----------------EKCGFG-HTASIQFGREYCSARKPACLEDPEACPMADLCEQ 251
>gi|289580004|ref|YP_003478470.1| HhH-GPD family protein [Natrialba magadii ATCC 43099]
gi|448281243|ref|ZP_21472550.1| HhH-GPD family protein [Natrialba magadii ATCC 43099]
gi|289529557|gb|ADD03908.1| HhH-GPD family protein [Natrialba magadii ATCC 43099]
gi|445579173|gb|ELY33569.1| HhH-GPD family protein [Natrialba magadii ATCC 43099]
Length = 278
Score = 56.2 bits (134), Expect = 1e-04, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 76/172 (44%), Gaps = 24/172 (13%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIEG 655
++S++A+ I G N + + + + + GS D ++ + PD V+E LLD+ G
Sbjct: 106 EQSQLAETISSAGLYNQKSAMLIDAAEWVCDEFGSADEFDRFVTDETPDTVRETLLDVRG 165
Query: 656 LGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ +CV L + + FPVD +V RI R+G P + ++ L E P
Sbjct: 166 VGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPPEADHEEVRTVLEAEVPAAK--- 222
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKH 762
C H I FG+ FC R P C ACP+ C+
Sbjct: 223 -------CGFG-------HTATIQFGREFCTARKPACLEDPEACPMADICEQ 260
>gi|322708253|gb|EFY99830.1| base excision DNA repair protein [Metarhizium anisopliae ARSEF 23]
Length = 364
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 65/169 (38%), Gaps = 19/169 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE + ++ + +I+ G + I L + +G L+ L + + +
Sbjct: 185 WEEIANGGQARLEKSIQTGGLAATKSKVIIGILQQTKAKYGLYSLDHLFEASDEDAMKEM 244
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ +G+G K+ CV L LQ +F VD +V RI LGW P + HL P
Sbjct: 245 ISFQGVGPKTASCVLLFCLQRPSFAVDTHVHRITGLLGWRPAAAGREETQAHLDAVVPDE 304
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGACPLR 757
+K Y LH +T G+ CK N C LR
Sbjct: 305 EK-----------------YPLHVLFVTHGRQCEECKAGGSNAKTCELR 336
>gi|323339826|ref|ZP_08080095.1| endonuclease III [Lactobacillus ruminis ATCC 25644]
gi|417974103|ref|ZP_12614927.1| endonuclease III [Lactobacillus ruminis ATCC 25644]
gi|323092699|gb|EFZ35302.1| endonuclease III [Lactobacillus ruminis ATCC 25644]
gi|346329558|gb|EGX97853.1| endonuclease III [Lactobacillus ruminis ATCC 25644]
Length = 213
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 21/121 (17%)
Query: 645 KVKEYLLDIEGLGLKS--VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
K +E L+++ G+G K+ VE + AF VD +V R+A RL V +P L I
Sbjct: 107 KTREELMELTGVGRKTANVELAECFGIP--AFAVDTHVSRVANRLALV--EPTKNVLEI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
E +M+++ W E H+ +I +G+ C R P C CPL ECK+
Sbjct: 162 ---ERQLMEQVDESRW-----------IEAHHLLIAWGRHQCLARKPKCETCPLSFECKY 207
Query: 763 F 763
F
Sbjct: 208 F 208
>gi|16126511|ref|NP_421075.1| endonuclease III family protein [Caulobacter crescentus CB15]
gi|221235291|ref|YP_002517728.1| endonuclease III [Caulobacter crescentus NA1000]
gi|13423785|gb|AAK24243.1| endonuclease III family protein [Caulobacter crescentus CB15]
gi|220964464|gb|ACL95820.1| endonuclease III [Caulobacter crescentus NA1000]
Length = 241
Score = 56.2 bits (134), Expect = 1e-04, Method: Composition-based stats.
Identities = 49/172 (28%), Positives = 75/172 (43%), Gaps = 16/172 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE + A + +A AI++ A R+ L + E G + L L+ + D+ + L
Sbjct: 66 WEDLAEAKTARVAMAIQDVTFPEDKARRLITALRMIEEKVGWLSLSHLKTLEVDQARWEL 125
Query: 651 LDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+ G+G+K CV S L A VD +V R+A R+G V H
Sbjct: 126 QALPGVGVKVAACVLNFSDLAMRALVVDTHVDRVARRIGLVGSGDTTNTYHT-------- 177
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMIT-FGKIFCKKRSPNCGACPLRGEC 760
M + P D L+ELH+ M G++ C +P CGACP++ C
Sbjct: 178 ----LMAMAPASWTADD--LFELHWLMKRGLGQMLCGAEAPKCGACPVKQMC 223
>gi|422327448|ref|ZP_16408475.1| endonuclease III [Erysipelotrichaceae bacterium 6_1_45]
gi|371663288|gb|EHO28478.1| endonuclease III [Erysipelotrichaceae bacterium 6_1_45]
Length = 215
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 56/127 (44%), Gaps = 18/127 (14%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
LE V P+ +KE L + G+G K+ VR + + VD +V RI+ RLG +Q
Sbjct: 94 LESFNGVVPESMKE-LTSLAGVGRKTANVVRSVCFDIPSIAVDTHVERISKRLGLAKVQD 152
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ E + K++ W R H+ I FG+ FC R+P C C
Sbjct: 153 SVE------VVEQKLKRKLKRERWNRA-----------HHLFIFFGRYFCTARNPKCEEC 195
Query: 755 PLRGECK 761
P + CK
Sbjct: 196 PFKEFCK 202
>gi|313901141|ref|ZP_07834629.1| endonuclease III [Clostridium sp. HGF2]
gi|373121433|ref|ZP_09535301.1| endonuclease III [Erysipelotrichaceae bacterium 21_3]
gi|312954099|gb|EFR35779.1| endonuclease III [Clostridium sp. HGF2]
gi|371665451|gb|EHO30616.1| endonuclease III [Erysipelotrichaceae bacterium 21_3]
Length = 215
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 56/127 (44%), Gaps = 18/127 (14%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
LE V P+ +KE L + G+G K+ VR + + VD +V RI+ RLG +Q
Sbjct: 94 LESFNGVVPESMKE-LTSLAGVGRKTANVVRSVCFDIPSIAVDTHVERISKRLGLAKVQD 152
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ E + K++ W R H+ I FG+ FC R+P C C
Sbjct: 153 SVE------VVEQKLKRKLKRERWNRA-----------HHLFIFFGRYFCTARNPKCEEC 195
Query: 755 PLRGECK 761
P + CK
Sbjct: 196 PFKEFCK 202
>gi|392561532|gb|EIW54713.1| DNA glycosylase [Trametes versicolor FP-101664 SS1]
Length = 307
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 56/116 (48%), Gaps = 14/116 (12%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNV-------- 641
++ A+ A +++ +AIK G N AG I++ L + HG+ L+ L
Sbjct: 103 NFAAIADAKTADVVEAIKTGGLANKKAGVIQKILAEVHRRHGAYSLQHLAGADVKGKGKA 162
Query: 642 ------PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+ + L+ +G+G K+ CV L L +FPVD +V R++ LGWVP
Sbjct: 163 AGGAPASDEAAMQELVSYDGVGPKTASCVLLFCLGRSSFPVDTHVFRLSRLLGWVP 218
>gi|448456097|ref|ZP_21594950.1| HhH-GPD family protein [Halorubrum lipolyticum DSM 21995]
gi|445812932|gb|EMA62918.1| HhH-GPD family protein [Halorubrum lipolyticum DSM 21995]
Length = 233
Score = 56.2 bits (134), Expect = 1e-04, Method: Composition-based stats.
Identities = 47/177 (26%), Positives = 73/177 (41%), Gaps = 20/177 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKEY 649
+EA+ AD E+A+ I+ G + A RI+ L + E G+ L +L + D KE+
Sbjct: 74 FEAIEAADHDELAETIRVAGLADQKAARIQRALAAIREETGGAYSLAFLDAMATDDAKEW 133
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L I+G+G K+ V VD +V R++ R G VP H L + P
Sbjct: 134 LTAIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNRAAHDALDELVP- 192
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG--ACPLRGECKHFA 764
+ +Y LH +I G+ C R +C C +C+ A
Sbjct: 193 ----------------DELIYPLHVLLIRHGRTHCSARGADCDNPVCEAHCDCEFCA 233
>gi|335357361|ref|ZP_08549231.1| endonuclease III [Lactobacillus animalis KCTC 3501]
Length = 212
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 59/126 (46%), Gaps = 18/126 (14%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
+E V P KE L+ + G+G K+ + V AF VD +V R+A RLG V
Sbjct: 98 VERFNGVVPKTRKE-LMSLSGVGRKTADVVLAECFGIPAFAVDTHVSRVAKRLGMV---- 152
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
P D + L E +M K+ LW + H++MI +G+ C R+P C +C
Sbjct: 153 -PADSDV-LTIEKILMQKVPQDLW-----------IKGHHRMIFWGRYQCMARNPKCSSC 199
Query: 755 PLRGEC 760
PL C
Sbjct: 200 PLLDIC 205
>gi|330934949|ref|XP_003304766.1| hypothetical protein PTT_17442 [Pyrenophora teres f. teres 0-1]
gi|311318473|gb|EFQ87134.1| hypothetical protein PTT_17442 [Pyrenophora teres f. teres 0-1]
Length = 527
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 61/145 (42%), Gaps = 37/145 (25%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH---------------GS 632
SVDW VR AD+ EI +AIK G ++ + IK+ L + E + GS
Sbjct: 246 SVDWNRVRQADQKEIFEAIKSGGLADVKSKDIKKILQMVWEENQARREELLSSSGKAPGS 305
Query: 633 ID-------------------LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
D L+ L + D L G+G K+ CV L LQ +
Sbjct: 306 ADEAEGEKHAEVEKAEQNIVSLDHLHLLSNDDAFNALTKYPGIGPKTASCVLLFCLQRPS 365
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGD 698
F VD +V R+ LGWV P PGD
Sbjct: 366 FAVDTHVFRLCKWLGWV---PPPGD 387
>gi|240139469|ref|YP_002963944.1| hypothetical protein MexAM1_META1p2915 [Methylobacterium extorquens
AM1]
gi|418059134|ref|ZP_12697091.1| iron-sulfur cluster loop [Methylobacterium extorquens DSM 13060]
gi|240009441|gb|ACS40667.1| hypothetical protein MexAM1_META1p2915 [Methylobacterium extorquens
AM1]
gi|373567329|gb|EHP93301.1| iron-sulfur cluster loop [Methylobacterium extorquens DSM 13060]
Length = 254
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 50/172 (29%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A EI I + A RIK L + E G++ L++L ++ ++ + +
Sbjct: 86 DWTEIEAASVEEIEATIHGVTWPELKAPRIKAVLAAVQERVGALTLDFLGDMSVNEARGW 145
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A R G + + G H L + P
Sbjct: 146 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRTGLIGAKVDVGPSHAVLRAQLP 205
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D W + LY+ H ++ G+ C RSP CG C L C
Sbjct: 206 --DD-----W------SAQKLYDNHEVLMLHGQRVCFHRSPACGRCVLLDIC 244
>gi|83816451|ref|YP_446456.1| endonuclease III [Salinibacter ruber DSM 13855]
gi|83757845|gb|ABC45958.1| endonuclease III [Salinibacter ruber DSM 13855]
Length = 324
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 19/122 (15%)
Query: 645 KVKEYLLDIE---GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
KV E + D+E G+G K+ V ++ A PVD +V R+A R+G
Sbjct: 146 KVPETIDDLETLTGVGRKTARVVAQVAHDADALPVDTHVFRVANRIG------------- 192
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+KE K R+ + + E H+ +I G+ C RSP+C CP+ ECK
Sbjct: 193 -LVKEDATTPKKVEQQLKRV--IPKAEWGEAHHLLILHGRYTCTARSPDCHDCPIHEECK 249
Query: 762 HF 763
H+
Sbjct: 250 HY 251
>gi|335996612|ref|ZP_08562529.1| DNA lyase [Lactobacillus ruminis SPM0211]
gi|335351682|gb|EGM53173.1| DNA lyase [Lactobacillus ruminis SPM0211]
Length = 213
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 21/121 (17%)
Query: 645 KVKEYLLDIEGLGLKS--VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
K +E L+++ G+G K+ VE + AF VD +V R+A RL V +P L I
Sbjct: 107 KTREELMELTGVGRKTANVELAECFGIP--AFAVDTHVSRVANRLALV--EPTKNVLEI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
E +M+++ D+ E H+ +I +G+ C R P C CPL ECK+
Sbjct: 162 ---ERQLMEQV-----------DENRWIEAHHLLIAWGRHQCLARKPKCETCPLSFECKY 207
Query: 763 F 763
F
Sbjct: 208 F 208
>gi|448702713|ref|ZP_21700146.1| HhH-GPD family protein [Halobiforma lacisalsi AJ5]
gi|445777274|gb|EMA28244.1| HhH-GPD family protein [Halobiforma lacisalsi AJ5]
Length = 268
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 50/176 (28%), Positives = 76/176 (43%), Gaps = 24/176 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A++S +A+ I G N + I E ++E GS ++++ P V+E
Sbjct: 92 ESLAGAEQSALAETISSAGLYNQKSEIIIETAEWVLEEFGSAAAFDAFVKDENPGTVRET 151
Query: 650 LLDIEGLGLKSVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL++ G+G K+ +CV L + FPVD +V RI RLG P ++ L E P
Sbjct: 152 LLEVRGVGPKTADCVLLFAGGRTGVFPVDTHVHRIYRRLGIAPADADHEEVRAVLEDEVP 211
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
C H I FG+ +C R P C ACP+ C
Sbjct: 212 AAK----------CGFG-------HTATIQFGREYCTARKPACLEDPDACPMGDLC 250
>gi|189200344|ref|XP_001936509.1| base excision DNA repair protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187983608|gb|EDU49096.1| base excision DNA repair protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 522
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 71/188 (37%), Gaps = 35/188 (18%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN--------RLVELHGS------- 632
SVDW VR AD+ E+ +AIK G ++ + IK+ L R EL S
Sbjct: 241 SVDWNRVRQADQKEVFEAIKSGGLADVKSKDIKKILQMVWEENQARREELLSSSDKAPGS 300
Query: 633 -------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
I L+ L + D L G+G K+ CV L LQ +
Sbjct: 301 ANEAEGEKHAEVAKAEQNIISLDHLHLLSNDDAFNALTKYPGIGPKTASCVLLFCLQRPS 360
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
F VD +V R+ LGWVP P L + + Y + D Y LH
Sbjct: 361 FAVDTHVFRLCKWLGWVPPPGDPAGLAPGAKGTFAGPTRNSTYAHCEVRVPDH-LKYPLH 419
Query: 734 YQMITFGK 741
+I GK
Sbjct: 420 QLLIRHGK 427
>gi|332800460|ref|YP_004461959.1| HhH-GPD family protein [Tepidanaerobacter acetatoxydans Re1]
gi|438003859|ref|YP_007273602.1| Endonuclease III [Tepidanaerobacter acetatoxydans Re1]
gi|332698195|gb|AEE92652.1| HhH-GPD family protein [Tepidanaerobacter acetatoxydans Re1]
gi|432180653|emb|CCP27626.1| Endonuclease III [Tepidanaerobacter acetatoxydans Re1]
Length = 217
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 45/171 (26%), Positives = 74/171 (43%), Gaps = 16/171 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+A+ D+ ++A IK N A ++K F + + +G D+ L ++ LL
Sbjct: 62 DAILAVDKDKLAALIKSTMYYNQKALKLKNFCRYIKQNYGG-DIYSLFEKSIPNMRAELL 120
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I+G+G ++ + + L + F VD RI RLG+ LP D +Y M
Sbjct: 121 SIKGIGPETADSIILYAAAKPIFVVDAYTRRIFSRLGF-----LPDD------AKYSQMQ 169
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
M P D E H ++ GK +CK + P C CP++ C +
Sbjct: 170 DFFMSNLPS----DVNLFNEYHALIVRLGKDYCKNKKPLCNECPVKNHCSN 216
>gi|385773949|ref|YP_005646516.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
HVE10/4]
gi|323478064|gb|ADX83302.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfolobus islandicus
HVE10/4]
Length = 233
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 49/188 (26%), Positives = 86/188 (45%), Gaps = 20/188 (10%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+K V E + AD S+I +A+K G A R+K ++E + + L + P
Sbjct: 61 EKKVGVTPEKLSDADLSDIENALKISGLYKTKAKRLKIISKIILEKYNGLIDNLLNSSNP 120
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+E LL EG+G K+ + V L + FP+D ++ R++ RLG VP+ + L
Sbjct: 121 ---REELLKFEGIGEKTADVVLLTCRGYEVFPIDTHITRVSKRLGIVPMNAKYELISSTL 177
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ + D +Q++L +I G+ CK R P C +C ++ C+++
Sbjct: 178 KELFSAYDLLQLHL-----------------LLIAHGRQTCKARKPLCNSCIIKECCEYY 220
Query: 764 ASAVASAR 771
+ AR
Sbjct: 221 SHRDGEAR 228
>gi|330834042|ref|YP_004408770.1| DNA-(apurinic or apyrimidinic site) lyase [Metallosphaera cuprina
Ar-4]
gi|329566181|gb|AEB94286.1| DNA-(apurinic or apyrimidinic site) lyase [Metallosphaera cuprina
Ar-4]
Length = 224
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 80/169 (47%), Gaps = 27/169 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVE--LHGSIDLEWLRNVPPDKVKEYLLDIE 654
AD I IK G N + I+E L R ++ HG D+ L +V P+ ++ L I+
Sbjct: 75 ADLEVIKFCIKSIGLYNNKSITIRE-LARFIQETYHG--DINKLLDVDPELARKELTRIK 131
Query: 655 GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
G+G K+V+ V L + FPVD ++ RI+ RLG + G+ Y V+ +
Sbjct: 132 GIGNKTVDVVLLTCKGYKTFPVDTHIFRISKRLG------IKGN--------YKVVSE-- 175
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W Y TL H +IT G+ CK +P C +C + C+++
Sbjct: 176 --FWKNSVY---DTL-NAHLILITHGRKTCKAINPKCESCMINDCCRYY 218
>gi|359406392|ref|ZP_09199085.1| endonuclease III [Prevotella stercorea DSM 18206]
gi|357556157|gb|EHJ37776.1| endonuclease III [Prevotella stercorea DSM 18206]
Length = 212
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 46/165 (27%), Positives = 69/165 (41%), Gaps = 27/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
ADE E+ + +K N A LVEL I E+ VP D + LL + G+
Sbjct: 68 ADEFEVYEYVKSVSYPNAKA-------RHLVELARMIQNEYGGEVPHDF--DSLLRLPGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPVMDKIQM 715
G K+ + ++ A PVD +V R++ RLG VP P + HL+K P D
Sbjct: 119 GRKTANVMLAVAFGEAAMPVDTHVFRVSHRLGLVPRTADTPLKVEQHLVKHIPADD---- 174
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L + H+ ++ G+ C R +C C + C
Sbjct: 175 -------------LAKAHHWLLLHGRYICTSRKAHCEKCEFKAVC 206
>gi|124027163|ref|YP_001012483.1| EndoIII-related endonuclease [Hyperthermus butylicus DSM 5456]
gi|123977857|gb|ABM80138.1| predicted EndoIII-related endonuclease [Hyperthermus butylicus DSM
5456]
Length = 242
Score = 55.8 bits (133), Expect = 1e-04, Method: Composition-based stats.
Identities = 50/178 (28%), Positives = 79/178 (44%), Gaps = 20/178 (11%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
+M V EAV A E + +AI+ G A ++E R++E G + L +P
Sbjct: 71 EMVGVSPEAVLSAPEDRLIEAIRPAGLARQKARALREAARRILEAGGE---KVLLEMPWR 127
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+++E+LL I G+G K+ + L + F VD + RIA R G L+
Sbjct: 128 ELREFLLSIPGVGKKTADVFLQLVRKAPVFAVDTHAARIAKRWG--------------LV 173
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
E D+ L L + + H +I G+ +C+ R+P C CPLR C +
Sbjct: 174 GEKAGYDETSRAL---LEFFGPERSENAHRLLIALGRTYCRARNPRCDVCPLRDICPY 228
>gi|346314278|ref|ZP_08855799.1| endonuclease III [Erysipelotrichaceae bacterium 2_2_44A]
gi|345906636|gb|EGX76360.1| endonuclease III [Erysipelotrichaceae bacterium 2_2_44A]
Length = 215
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 18/127 (14%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
LE V P+ +KE L + G+G K+ VR + + VD +V RI+ RLG +Q
Sbjct: 94 LESFDGVVPESMKE-LTSLAGVGRKTANVVRSVCFDIPSIAVDTHVERISKRLGLAKVQD 152
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ ++ E + K++ W R H+ I FG+ FC R+P C C
Sbjct: 153 -----SVEVV-EQKLKRKLKRERWNRA-----------HHLFIFFGRYFCTARNPKCEEC 195
Query: 755 PLRGECK 761
P + CK
Sbjct: 196 PFKEFCK 202
>gi|171684933|ref|XP_001907408.1| hypothetical protein [Podospora anserina S mat+]
gi|170942427|emb|CAP68079.1| unnamed protein product [Podospora anserina S mat+]
Length = 813
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 62/135 (45%), Gaps = 20/135 (14%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-------GS-------- 632
S+DWE VR + + ++I+ G A IK L+++ E + G+
Sbjct: 474 SIDWEKVRVSSPQALVNSIRISGNGPKKAQHIKLILDKVYEENLERMKQAGTAENTDKDG 533
Query: 633 -----IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
+ L+++ + D+ E + G+G+K+ CV L L+ F VD +V + L
Sbjct: 534 APPDLLSLDYMHAMTKDQAMEKFVSFPGIGIKTAACVSLFCLRMPCFAVDTHVHKFCRWL 593
Query: 688 GWVPLQPLPGDLHIH 702
GW P++ P ++ H
Sbjct: 594 GWTPVKADPDNVFRH 608
>gi|150020336|ref|YP_001305690.1| endonuclease III [Thermosipho melanesiensis BI429]
gi|149792857|gb|ABR30305.1| endonuclease III [Thermosipho melanesiensis BI429]
Length = 203
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 29/166 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
A E ++ + IK G A RI E +V + G + PD ++E LL + G
Sbjct: 61 AKEEDLYELIKPAGLYRQKAKRIIEISKIIVNKFSGKV---------PDTLEE-LLTLPG 110
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ V +S A VD +V RI+ RLGW + P + L+K P
Sbjct: 111 VGRKTANIVLYVSFSKPALAVDTHVHRISNRLGWCKTKN-PNETEFALMKLLP------K 163
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LW ++ M+ FGK C R+P C CP+ CK
Sbjct: 164 DLWG-----------PINGSMVKFGKNVCLPRNPKCDICPIYDYCK 198
>gi|404398986|ref|ZP_10990570.1| endonuclease III [Pseudomonas fuscovaginae UPB0736]
Length = 212
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+ELHGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIETCRLLIELHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTGLAPG------------KNVLEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKQYLLDS------HHWLILLGRYVCQARKPRCGSCRIEDLCEY 205
>gi|448385412|ref|ZP_21563918.1| HhH-GPD family protein [Haloterrigena thermotolerans DSM 11522]
gi|445656907|gb|ELZ09739.1| HhH-GPD family protein [Haloterrigena thermotolerans DSM 11522]
Length = 307
Score = 55.5 bits (132), Expect = 2e-04, Method: Composition-based stats.
Identities = 48/176 (27%), Positives = 80/176 (45%), Gaps = 24/176 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A++S +A+ I G N + I + ++E GS ++++ PP +V+E
Sbjct: 127 ESLAAAEQSTLAETISGAGLYNQKSETIIDTAEWVLEEFGSAAAFDAFVKDEPPAEVRET 186
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL + G+G K+ +CV L + + FPVD +V RI R+G P + L ++ P
Sbjct: 187 LLGVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPPDADHEAVREVLERDVP 246
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
C H I FG+ +C+ R+P C ACP+ C
Sbjct: 247 AAK----------CGFG-------HTATIQFGREYCRARTPACLEDPDACPMADRC 285
>gi|448350252|ref|ZP_21539071.1| HhH-GPD family protein [Natrialba taiwanensis DSM 12281]
gi|445637759|gb|ELY90907.1| HhH-GPD family protein [Natrialba taiwanensis DSM 12281]
Length = 276
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 53/127 (41%), Gaps = 22/127 (17%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
P +V+E LLD+ G+G K+ +CV L + + FPVD +V RI RLG P D+
Sbjct: 151 PGEVRETLLDVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPPDADHEDVRE 210
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLR 757
+ E P C H I FG+ FC R P C ACP+
Sbjct: 211 VVEDEVPTAK----------CGFG-------HTATIQFGREFCTARKPACLDDPEACPMA 253
Query: 758 GECKHFA 764
C
Sbjct: 254 DICDQVG 260
>gi|456012482|gb|EMF46170.1| Endonuclease III [Planococcus halocryophilus Or1]
Length = 226
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 82/192 (42%), Gaps = 32/192 (16%)
Query: 570 LLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL 629
L+ R+ + ++ H+ D V E+ I+ G A I+ L++
Sbjct: 47 LVNRVTADLFQKYHEPQDYV------AVSLEELQQDIRSIGLFRNKAKNIQALSRILIDE 100
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGW 689
H S+ VP D+ + L+ + G+G K+ V ++ A VD +V R+A RLG
Sbjct: 101 HNSV-------VPADR--DLLMTLPGVGRKTANVVVSVAFGIPALAVDTHVERVAKRLGL 151
Query: 690 VPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSP 749
+ P + ++K+ P D W + H+Q+I FG+ CK ++P
Sbjct: 152 SRWKDNPLQVEETIMKKTPADD------WSKT-----------HHQIIFFGRYHCKSQNP 194
Query: 750 NCGACPLRGECK 761
C CPL C+
Sbjct: 195 GCHICPLFDRCR 206
>gi|255513717|gb|EET89982.1| HhH-GPD family protein [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length = 252
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 80/188 (42%), Gaps = 21/188 (11%)
Query: 581 RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRN 640
+SH +D + E + I+ G A R+K ++ GS+ ++
Sbjct: 86 KSHGALDPAK---IASMPEERLQKLIRSSGFYRQKAKRLKAVSKHIISSSGSVKAFLSKD 142
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
+++ LL ++G+G ++ + V L + F VD RI RL + +L
Sbjct: 143 KAA--LRKELLSMDGIGPETADSVLLYAAGKRIFVVDAYTKRIMSRLYGTNPKIGYDELQ 200
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
H E P + +Y ++H Q++ GK +CK + P C CPL+G C
Sbjct: 201 SHFHAELP--KSVSIY-------------KDMHAQLVELGKNYCKTK-PECSPCPLKGGC 244
Query: 761 KHFASAVA 768
K++ + +A
Sbjct: 245 KYYKANIA 252
>gi|322700299|gb|EFY92055.1| base excision DNA repair protein [Metarhizium acridum CQMa 102]
Length = 364
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 65/169 (38%), Gaps = 19/169 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE + ++ + +I+ G + I L + +G L+ L + + +
Sbjct: 185 WEEIANGGQARLEKSIQTGGLAATKSKVIIGILQQTKAKYGVYSLDHLFEASDEDAMKEM 244
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ +G+G K+ CV L LQ +F VD +V RI LGW P + HL P
Sbjct: 245 ISFQGVGPKTASCVLLFCLQRPSFAVDTHVHRITGLLGWRPAAAGREETQAHLDAVVPDE 304
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF--CKKRSPNCGACPLR 757
+K Y LH +T G+ CK N C LR
Sbjct: 305 EK-----------------YPLHVLFVTHGRQCEECKAGGRNAKICELR 336
>gi|448728587|ref|ZP_21710911.1| endonuclease III [Halococcus saccharolyticus DSM 5350]
gi|445796772|gb|EMA47269.1| endonuclease III [Halococcus saccharolyticus DSM 5350]
Length = 272
Score = 55.1 bits (131), Expect = 2e-04, Method: Composition-based stats.
Identities = 50/188 (26%), Positives = 80/188 (42%), Gaps = 24/188 (12%)
Query: 580 ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EW 637
+R D++D A+ D+ +A+ I+ G N + + R+V+ +GS +
Sbjct: 81 DRYGDRVDGDLAAALANDDQPTLAETIQSAGLYNQKSATMIAIAERVVDEYGSAAEFDAF 140
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLP 696
+ P+ V++ LLD G+G K+ +CV L S + FPVD +V RI RLG P
Sbjct: 141 VTEKDPETVRDALLDFSGVGPKTADCVLLFSGGREGVFPVDTHVHRIYRRLGIAPSDADH 200
Query: 697 GDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----G 752
++ + + P C H I FG+ +C R P C
Sbjct: 201 EEVRTVVEDQVPAAK----------CGFG-------HTASIQFGREYCSARKPACLDDPD 243
Query: 753 ACPLRGEC 760
ACP+ C
Sbjct: 244 ACPMADLC 251
>gi|15678789|ref|NP_275906.1| endonuclease III [Methanothermobacter thermautotrophicus str. Delta
H]
gi|2621854|gb|AAB85267.1| endonuclease III [Methanothermobacter thermautotrophicus str. Delta
H]
Length = 233
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 27/172 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E V A EI I++ G ++ A R++E ++R++ LE PD + E LL
Sbjct: 78 EDVAYAPLEEIEALIRKAGFYHVKARRVRE-VSRII-------LEEYDGKVPDDINE-LL 128
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ CV + + A PVD +V RI+ R+G V + P + L+K P
Sbjct: 129 KLPGVGRKTANCVLVYAFGRPAIPVDTHVHRISNRIGLVDTRT-PEETERALMKVIP--- 184
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
++ EL+ M+ FG+ C+ P CP+ C ++
Sbjct: 185 --------------REYWIELNDLMVQFGQDICRPLGPRHEECPIADHCDYY 222
>gi|403383159|ref|ZP_10925216.1| endonuclease III [Kurthia sp. JC30]
Length = 219
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 72/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A I++ RL+E++G +P ++ E L+ + G+G K+
Sbjct: 72 ELENDIRSIGLFRSKAKNIQKLCARLLEVYGG-------EIPAER--EELVTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V RI RLG + ++ E +M K + W +
Sbjct: 123 ANVVLSVAFNVPAMAVDTHVERITKRLGLCRWKDSVLEV------EQTIMRKTPIERWNK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+QMI FG+ CK + P C CPL +C+
Sbjct: 177 A-----------HHQMIFFGRYHCKAQRPQCDTCPLFDDCR 206
>gi|403069784|ref|ZP_10911116.1| HhH-GPD family protein [Oceanobacillus sp. Ndiop]
Length = 213
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 71/160 (44%), Gaps = 19/160 (11%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+A I+ G NI A RIK F+N H +I E ++ + ++++ LL + G+G ++
Sbjct: 68 ELAQLIRSSGYFNIKAKRIKAFMNWFKRYHYNI--EEIKKMDSNQLRYELLKVNGIGKET 125
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L + F VD RI R+G+ G + + K+ P
Sbjct: 126 ADVMLLYAFDKPIFVVDAYARRIFYRIGYDMPTAYDG-FRLDVEKQLP------------ 172
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+L Q E H ++ K+ CK SP C CPL C
Sbjct: 173 -SHLQQYN--EYHALLVEHAKVHCKS-SPICKGCPLHSIC 208
>gi|304314918|ref|YP_003850065.1| endonuclease III [Methanothermobacter marburgensis str. Marburg]
gi|302588377|gb|ADL58752.1| endonuclease III [Methanothermobacter marburgensis str. Marburg]
Length = 215
Score = 55.1 bits (131), Expect = 2e-04, Method: Composition-based stats.
Identities = 47/174 (27%), Positives = 79/174 (45%), Gaps = 27/174 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E V A ++ +++ G ++ A RI+E L+E +G PD + E LL
Sbjct: 61 EDVAYAPVEKLEQLVRKAGFYHVKARRIREVSRILLEEYGG--------RVPDDIDE-LL 111
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ CV + + PVD +V RI+ R+G V + P + +
Sbjct: 112 KLPGVGRKTANCVLVYAFNKPVVPVDTHVHRISNRIGLVNTRT-PEE-----------TE 159
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
++ M + PR K EL+ M+ FG+ C+ P CP+ EC ++ S
Sbjct: 160 RVLMEVIPR------KYWIELNDLMVQFGQDICRPVGPRHEECPIADECDYYKS 207
>gi|448362246|ref|ZP_21550857.1| HhH-GPD family protein [Natrialba asiatica DSM 12278]
gi|445648767|gb|ELZ01715.1| HhH-GPD family protein [Natrialba asiatica DSM 12278]
Length = 293
Score = 55.1 bits (131), Expect = 2e-04, Method: Composition-based stats.
Identities = 52/189 (27%), Positives = 78/189 (41%), Gaps = 24/189 (12%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIE 654
A+++++A+ I G N + I R+ + GS ++ + P +V+E LLD+
Sbjct: 120 AEQTQLAETISSAGLYNQKSEVIIGAAERVCDEFGSAAAFDAFVTDEAPGEVRETLLDVR 179
Query: 655 GLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ +CV L + + FPVD +V RI RLG P D+ L E P
Sbjct: 180 GVGPKTADCVLLFAGGRSGVFPVDTHVHRIYRRLGIAPPDADHEDVREVLEDEVPTAK-- 237
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVAS 769
C H I FG+ FC R P C ACP+ C A+
Sbjct: 238 --------CGFG-------HTATIQFGREFCTARKPACLDDPEACPMADICDQVGVYPAT 282
Query: 770 ARFALPGPS 778
P +
Sbjct: 283 GEVVDPAET 291
>gi|325106521|ref|YP_004276175.1| endonuclease III [Pedobacter saltans DSM 12145]
gi|324975369|gb|ADY54353.1| endonuclease III [Pedobacter saltans DSM 12145]
Length = 239
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 57/134 (42%), Gaps = 18/134 (13%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
I L + P+ VK+ L + G+G K+ + + A VD +V R+A R+G
Sbjct: 94 AKILLNEFNGIVPEDVKD-LQKMPGVGRKTANVISSVIYHAPAMAVDTHVFRVANRIGLT 152
Query: 691 PLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPN 750
P + L+ P DKI + H+ +I G+ C RSP
Sbjct: 153 TNAKTPLAVEKQLVAHLP-QDKIHI----------------AHHWLILHGRYICLARSPK 195
Query: 751 CGACPLRGECKHFA 764
C CPL G CK++A
Sbjct: 196 CDICPLTGFCKYYA 209
>gi|312899629|ref|ZP_07758955.1| endonuclease III [Enterococcus faecalis TX0470]
gi|311293308|gb|EFQ71864.1| endonuclease III [Enterococcus faecalis TX0470]
Length = 215
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 49/170 (28%), Positives = 72/170 (42%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI D IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIDKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|218782481|ref|YP_002433799.1| endonuclease III [Desulfatibacillum alkenivorans AK-01]
gi|218763865|gb|ACL06331.1| endonuclease III [Desulfatibacillum alkenivorans AK-01]
Length = 210
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 27/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I G + A R+KE L+E HG + VP D E LL + G+G K+
Sbjct: 73 ELEELIHATGFFHTKAKRVKECAAALMEKHGGV-------VPRDM--ESLLALPGVGRKT 123
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + + VD +V RI+ RLG+ + P + L+K L P+
Sbjct: 124 ANVVLNAAFEIPGIVVDTHVQRISQRLGFTKFKD-PVKIEFDLMK-----------LLPK 171
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++D + LH +I G+ C R P CG C L CK
Sbjct: 172 ESWID----FSLH--LIYHGRAVCTARKPKCGECTLAEWCK 206
>gi|448391644|ref|ZP_21566739.1| HhH-GPD family protein [Haloterrigena salina JCM 13891]
gi|445665056|gb|ELZ17734.1| HhH-GPD family protein [Haloterrigena salina JCM 13891]
Length = 274
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 49/198 (24%), Positives = 83/198 (41%), Gaps = 26/198 (13%)
Query: 592 EAVRCADESEIADAIKERG---QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
E++ A+ S +A+ I G Q++ + R E++ + D ++++ P +V++
Sbjct: 90 ESLAAAERSRLAETISSAGLYNQKSEVLIRTAEWILEEFGSAAAFD-AFVKDEEPAEVRD 148
Query: 649 YLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
LL + G+G K+ +CV L + + FPVD +V RI R+G P D+ L +E
Sbjct: 149 TLLSVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPAAADHEDVRAVLEREV 208
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHF 763
P C H I FG+ +C R P C ACP+ C
Sbjct: 209 PAAK----------CGFG-------HTATIQFGREYCTARKPACLEDPDACPMADVCDQV 251
Query: 764 ASAVASARFALPGPSEKG 781
A+ P + +
Sbjct: 252 GVYPATGEVVDPAEATEA 269
>gi|432328581|ref|YP_007246725.1| putative endoIII-related endonuclease [Aciduliprofundum sp.
MAR08-339]
gi|432135290|gb|AGB04559.1| putative endoIII-related endonuclease [Aciduliprofundum sp.
MAR08-339]
Length = 209
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 20/148 (13%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
AG K+ ++ E+ ID E+ VP + + LL + G+G K+ V
Sbjct: 76 AGFYKQKARKIKEIARIIDEEYDGVVP--RTLDELLKLPGVGRKTANIVLSRCFDQDVIA 133
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VDV+V RI+ RLGWV + P + L+K P +K E++
Sbjct: 134 VDVHVHRISNRLGWVSTR-TPEETERELMKILP-----------------KKYWREINEL 175
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHF 763
++ FG+ C+ P C CP+ C +F
Sbjct: 176 LVMFGRTICRPVGPKCDECPVSDFCDYF 203
>gi|325968642|ref|YP_004244834.1| HhH-GPD family protein [Vulcanisaeta moutnovskia 768-28]
gi|323707845|gb|ADY01332.1| HhH-GPD family protein [Vulcanisaeta moutnovskia 768-28]
Length = 230
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 84/198 (42%), Gaps = 25/198 (12%)
Query: 565 QQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
Q WD++ + + S +K+ E + D + +A+ IK + A R+ +F
Sbjct: 53 QTRWDVVHEVLNRLRRLSLNKL-----EVIANTDPNYLAEVIKGVNYRFTKAQRLVKFAK 107
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+ + G L LR V+++LL+ EG+G ++ + + L +L P+ R+
Sbjct: 108 NVTMIGGLGKLR-LRG----DVRDFLLNQEGVGRETADSIMLFALNIPTMPISQYTKRVF 162
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RL L + L Y + + L PR D T +H +IT GK +C
Sbjct: 163 SRL-----------LGLKLGDNYDMWKEFLEGLLPR----DLYTYKLIHASIITIGKKYC 207
Query: 745 KKRSPNCGACPLRGECKH 762
P C CPLR C +
Sbjct: 208 LPTDPLCNKCPLRDVCLY 225
>gi|223995635|ref|XP_002287491.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976607|gb|EED94934.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 251
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK-VKEYLLDIEG 655
D + I AI+ G I A RI+ L + + + E+L+ D+ +++ L +G
Sbjct: 95 VDTTRIETAIRVAGLAKIRAARIQGMLKTVQQERNDANFEYLQFYDSDEEIQKELSRFKG 154
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
+G K++ CV L +L FPVD +V RI ++GW+
Sbjct: 155 MGPKTISCVLLFALGRPDFPVDTHVLRITKQIGWI 189
>gi|429853280|gb|ELA28361.1| HhH-GPD family base excision DNA repair protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 502
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/230 (22%), Positives = 85/230 (36%), Gaps = 64/230 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH----------------- 630
S+DW AVR + ++ A+ G + + ++KE L ++ E +
Sbjct: 234 SIDWNAVRLSPIQKLRKAMITGGLSKVKSEQMKEILEKVFEENQLRKAAFVKEKESGEAA 293
Query: 631 ----------------------GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G + L+ +R +P + +L +G+G+K+ C+ L
Sbjct: 294 SVKGAAKMSQGQKDHQLLKIDNGILTLDHIRTMPAAEAMPEILQYKGIGIKTTSCLLLFC 353
Query: 669 LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT 728
LQ +F VD +V R + LGWVP + H +Y V D+++
Sbjct: 354 LQQPSFAVDTHVWRFCLWLGWVPRKASADKTFKHC--DYRVPDELK-------------- 397
Query: 729 LYELHYQMITFGKIFCKKRSP--------NCGACPLRGECKHFASAVASA 770
Y LH I G+ + RS N CPL F A A
Sbjct: 398 -YGLHQLFIVHGQQCYRCRSGTAKGTAQWNAVVCPLEHLLDRFEKHEAKA 446
>gi|255654585|ref|ZP_05399994.1| endonuclease III [Clostridium difficile QCD-23m63]
gi|296449359|ref|ZP_06891141.1| endonuclease III [Clostridium difficile NAP08]
gi|296880707|ref|ZP_06904659.1| endonuclease III [Clostridium difficile NAP07]
gi|296261829|gb|EFH08642.1| endonuclease III [Clostridium difficile NAP08]
gi|296428280|gb|EFH14175.1| endonuclease III [Clostridium difficile NAP07]
Length = 285
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 143 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 192
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 193 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQK------TEFALMEAIPKERWS 245
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 246 -----------HSHHVLIFHGRRMCKARNPECASCPIKEDCNYY 278
>gi|254974165|ref|ZP_05270637.1| endonuclease III [Clostridium difficile QCD-66c26]
gi|255091566|ref|ZP_05321044.1| endonuclease III [Clostridium difficile CIP 107932]
gi|255313292|ref|ZP_05354875.1| endonuclease III [Clostridium difficile QCD-76w55]
gi|255515983|ref|ZP_05383659.1| endonuclease III [Clostridium difficile QCD-97b34]
gi|255649074|ref|ZP_05395976.1| endonuclease III [Clostridium difficile QCD-37x79]
gi|306519188|ref|ZP_07405535.1| endonuclease III [Clostridium difficile QCD-32g58]
gi|384359832|ref|YP_006197684.1| endonuclease III [Clostridium difficile BI1]
Length = 350
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 208 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 257
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 258 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQK------TEFALMEAIPKERWS 310
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 311 -----------HSHHVLIFHGRRMCKARNPECASCPIKEDCNYY 343
>gi|393796716|ref|ZP_10380080.1| DNA-(apurinic or apyrimidinic site) lyase [Candidatus
Nitrosoarchaeum limnia BG20]
Length = 175
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 37/145 (25%), Positives = 69/145 (47%), Gaps = 20/145 (13%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ I+ ++ VP + E L+++ G+G K+ CV + + A PVD++V RI+
Sbjct: 51 RIIEVAKIINSKYKGKVPDNL--EKLVELPGVGRKTANCVLVYAFDKPAIPVDIHVHRIS 108
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V + P + L++ P +K +++ + +G+ C
Sbjct: 109 NRLGLVNTKT-PEETEQELMRIIP-----------------KKFWIDINDTFVMYGQNIC 150
Query: 745 KKRSPNCGACPLRGECKHFASAVAS 769
K SP C C ++ CK + + S
Sbjct: 151 KPISPMCNVCKIKNNCKFYKTKNVS 175
>gi|423083243|ref|ZP_17071818.1| endonuclease III [Clostridium difficile 002-P50-2011]
gi|423086615|ref|ZP_17075014.1| endonuclease III [Clostridium difficile 050-P50-2011]
gi|357546324|gb|EHJ28255.1| endonuclease III [Clostridium difficile 050-P50-2011]
gi|357546344|gb|EHJ28274.1| endonuclease III [Clostridium difficile 002-P50-2011]
Length = 350
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 208 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 257
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 258 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQK------TEFALMEAIPKERWS 310
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 311 -----------HSHHVLIFHGRRMCKARNPECASCPIKEDCNYY 343
>gi|225849644|ref|YP_002729878.1| endonuclease III [Persephonella marina EX-H1]
gi|225645619|gb|ACO03805.1| endonuclease III [Persephonella marina EX-H1]
Length = 213
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 46/176 (26%), Positives = 76/176 (43%), Gaps = 29/176 (16%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D VD E V C E ++ + IK G + + EF R +++ DK+
Sbjct: 60 DLVDLEKVSCIHEEKLQEIIKPAGFYKRKSRTLIEFSRRF------------KDIEKDKI 107
Query: 647 -KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL ++G+G ++ + + L +L F VD R+ R+G+
Sbjct: 108 TRDLLLSVKGIGKETADSILLYALNRPYFVVDAYTRRVFSRIGF--------------FD 153
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ D+IQ L+ + D E H ++ GK FC+K+ P C CPL CK
Sbjct: 154 KNLSYDEIQE-LFEKNLPEDTDIYKEYHALIVELGKSFCRKK-PLCKDCPLFANCK 207
>gi|303246513|ref|ZP_07332792.1| HhH-GPD family protein [Desulfovibrio fructosovorans JJ]
gi|302492223|gb|EFL52098.1| HhH-GPD family protein [Desulfovibrio fructosovorans JJ]
Length = 216
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 47/167 (28%), Positives = 75/167 (44%), Gaps = 20/167 (11%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYL 650
EA+ E+A+ I+ G + A R++ L +V EL G D+ L + D +E L
Sbjct: 63 EALYALPVEELAELIRPAGYFRVKAARLRNLLALIVHELGG--DITALADGGLDAARERL 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L ++G+G ++ + + L L +F VD RI R P D L+E M
Sbjct: 121 LAVKGVGPETADSILLYGLSLPSFVVDAYTARICNRHALAP-----EDAGYEELREL-FM 174
Query: 711 DKIQMYLWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPL 756
D + + LY E H ++ G +C+ R+P C +CPL
Sbjct: 175 DALP----------EDTALYNEFHALLVRVGNGWCRPRAPRCDSCPL 211
>gi|415719736|ref|ZP_11467775.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis 1500E]
gi|388058996|gb|EIK81763.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis 1500E]
Length = 223
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 22/172 (12%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D ++ A ++ IK G + A I RL + HG + P ++++
Sbjct: 66 DSRSLSAASVKQVESIIKPLGFYRVKAQHIIALSVRLESDFHGVV---------PSRMED 116
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K+ VR + FPVD +V R+ RLGW P K+Y
Sbjct: 117 -LTSLPGVGRKTANVVRGNAFGLPGFPVDTHVIRVTGRLGWRRDCNKP--------KKYA 167
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ I+ + Y D L +++I G+ C R+P C +CPL+ C
Sbjct: 168 NPESIEKEI---TSYFDPADWTNLSHRLILHGRKICTARNPRCASCPLKTTC 216
>gi|242780797|ref|XP_002479670.1| HhH-GPD family base excision DNA repair protein [Talaromyces
stipitatus ATCC 10500]
gi|218719817|gb|EED19236.1| HhH-GPD family base excision DNA repair protein [Talaromyces
stipitatus ATCC 10500]
Length = 407
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 58/142 (40%), Gaps = 38/142 (26%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL------------------ 629
SVDW AVR A +E+ +A+K G + IKE LN + E
Sbjct: 221 SVDWNAVRVAPINEVFEAMKSGGLATTKSKYIKEILNMVYEENLARKEAHIKSEEEGNSG 280
Query: 630 -----HGS---------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL 669
H S + L+W+ + ++ L+ G+G K+ CV L L
Sbjct: 281 PAGAEHESKAQKEVEIALTDENVLSLDWIHALDKEEAMLELIKFPGIGPKTAACVVLFCL 340
Query: 670 QHIAFPVDVNVGRIAVRLGWVP 691
Q F VD ++ RI LGW+P
Sbjct: 341 QRPCFAVDTHIFRICKWLGWLP 362
>gi|337283826|ref|YP_004623300.1| endonuclease III [Pyrococcus yayanosii CH1]
gi|334899760|gb|AEH24028.1| endonuclease III [Pyrococcus yayanosii CH1]
Length = 234
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 45/164 (27%), Positives = 81/164 (49%), Gaps = 22/164 (13%)
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNR---LVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
++IA+A E+ Q+ + + ++ + ++ +VE+ I LE PD ++E L+ + G+
Sbjct: 81 NDIANASVEKMQEFLRSLKVGLWRSKGRWIVEV-SRIILERYDGRVPDTLEE-LMKLPGI 138
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K V A PVD +V R++ RLG PL+ P +K++ Y
Sbjct: 139 GRKCANIVLAYGFGKPAIPVDTHVNRVSKRLGLAPLEASP--------------EKVEEY 184
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L ++ ++ LY +++ M+ GK C+ P C CP+R C
Sbjct: 185 L--KVLIPKEEWLY-VNHAMVDHGKKVCRPIKPKCNECPVRNLC 225
>gi|344997006|ref|YP_004799349.1| HhH-GPD family protein [Caldicellulosiruptor lactoaceticus 6A]
gi|343965225|gb|AEM74372.1| HhH-GPD family protein [Caldicellulosiruptor lactoaceticus 6A]
Length = 232
Score = 54.7 bits (130), Expect = 3e-04, Method: Composition-based stats.
Identities = 46/173 (26%), Positives = 75/173 (43%), Gaps = 17/173 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + ++A+ IK G N R+KEF N L S DLE L + +++ LL
Sbjct: 74 EGILQTSDEKLAELIKPAGYYNQKTKRLKEFCNFLKREFNS-DLEKLFALDISSLRQVLL 132
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + + + Y +
Sbjct: 133 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI----------SYNDLQ 182
Query: 712 KIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I M L P+ + + E H ++ K CK + P C C LR C +F
Sbjct: 183 AIIMANLTPQTKFFN-----EFHALIVKHCKEICKSKKPICNKCCLRLICNYF 230
>gi|346321129|gb|EGX90729.1| HhH-GPD family base excision DNA repair protein [Cordyceps
militaris CM01]
Length = 330
Score = 54.3 bits (129), Expect = 3e-04, Method: Composition-based stats.
Identities = 58/192 (30%), Positives = 82/192 (42%), Gaps = 31/192 (16%)
Query: 576 STGEERSHDKM--DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH--- 630
ST +RS D S WEA+ + ++ AI+ G + A + + +N L +H
Sbjct: 155 STRAKRSMDAAYGRSDAWEAIAAGGQPQLQAAIRCGG---LAATKSRVIVNLLAAVHSKY 211
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
G L+ L + + E LL G+G K+ CV L LQ +F VD +V RIA LGW
Sbjct: 212 GMYSLDHLFHASDTEAMEELLAFPGVGPKTASCVLLFCLQRPSFAVDTHVYRIAGILGWR 271
Query: 691 PLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPN 750
P + HL P D ++ Y LH +I G+ C+
Sbjct: 272 PPEATREQAQAHLDASIP--DALK---------------YPLHVLLIAHGRT-CRA---- 309
Query: 751 CGACPLRGE-CK 761
CGA GE CK
Sbjct: 310 CGAKAAAGESCK 321
>gi|21227382|ref|NP_633304.1| endonuclease III [Methanosarcina mazei Go1]
gi|20905743|gb|AAM30976.1| Endonuclease III [Methanosarcina mazei Go1]
Length = 205
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 74/165 (44%), Gaps = 28/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A+ +I + I++ G + AGRIKE L+E + VP D E LL + G+
Sbjct: 63 ANVEDIEELIRDVGFYRVKAGRIKEISRILLEDYNG-------KVPDDM--ETLLKLPGV 113
Query: 657 GLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
G K+ CV + L+ A VD +V RI+ RLG V + P + + L K P
Sbjct: 114 GRKTANCVLAHAFLKEDALAVDTHVHRISNRLGRVVTKN-PEETEMELKKLLP------- 165
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
QK ++ ++ FG+ C+ SP CG C L C
Sbjct: 166 ----------QKYWRHVNILLVKFGQNVCRPISPRCGICILNDIC 200
>gi|146304857|ref|YP_001192173.1| DNA-(apurinic or apyrimidinic site) lyase [Metallosphaera sedula
DSM 5348]
gi|145703107|gb|ABP96249.1| DNA-(apurinic or apyrimidinic site) lyase [Metallosphaera sedula
DSM 5348]
Length = 230
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 46/190 (24%), Positives = 89/190 (46%), Gaps = 25/190 (13%)
Query: 578 GEERSHDKMDS---VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
G +++++++D + E + AD I I++ G N IKE +++ +G D
Sbjct: 52 GAKKAYEELDKEVGITAEGLSRADPEVIKRCIRKVGLHNNKTKVIKEVSTKILNEYGG-D 110
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
+ + ++ K +E L+++ G+G K+ + + + + FP+D ++ RI+ RLG
Sbjct: 111 INKVLDLGLPKAREKLVELPGVGKKTADVLLITCRDYPVFPIDTHIFRISKRLG------ 164
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ G+ DK+ + W + D L H +IT G+ CK P C C
Sbjct: 165 IDGN-----------YDKVSSF-WREVS--DNLRL-RAHLLLITHGRATCKAIKPKCDTC 209
Query: 755 PLRGECKHFA 764
L C+++A
Sbjct: 210 VLNDCCEYYA 219
>gi|254561880|ref|YP_003068975.1| hypothetical protein METDI3481 [Methylobacterium extorquens DM4]
gi|254269158|emb|CAX25124.1| hypothetical protein METDI3481 [Methylobacterium extorquens DM4]
Length = 254
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 50/172 (29%), Positives = 75/172 (43%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A EI I + A RIK L + E G++ L++L ++ D+ + +
Sbjct: 86 DWTEIGAASVEEIETTIHGVTWPELKAPRIKAVLAAVQERVGALTLDFLGDMSVDEARGW 145
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A R G + + G H L + P
Sbjct: 146 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRTGLIGAKVDVGPSHAVLRAQLP 205
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D W + LY+ H ++ G+ C RSP C C L C
Sbjct: 206 --DD-----W------SAQKLYDNHEVLMLHGQRVCFHRSPACDRCVLLDIC 244
>gi|423091165|ref|ZP_17079423.1| endonuclease III [Clostridium difficile 70-100-2010]
gi|357555447|gb|EHJ37097.1| endonuclease III [Clostridium difficile 70-100-2010]
Length = 350
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 29/164 (17%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLK 659
EI+ IK G + +IK+ +L EL+ G + PD + E L+ + G+G K
Sbjct: 208 EISKEIKSCGLYKSKSQKIKDTSEQLCELYDGEV---------PDSL-EKLIKLPGVGRK 257
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+ V + H A VD +V R++ R+G V +P P E+ +M+ I W
Sbjct: 258 TAGVVLSNAFNHPAIAVDTHVFRVSNRIGIVD-EPNPQK------TEFALMETIPKERWS 310
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
H+ +I G+ CK R+P C +CP++ +C ++
Sbjct: 311 -----------HSHHVLIFHGRRMCKARNPECASCPIKEDCNYY 343
>gi|393201411|ref|YP_006463253.1| EndoIII-related endonuclease [Solibacillus silvestris StLB046]
gi|406665654|ref|ZP_11073426.1| Endonuclease III [Bacillus isronensis B3W22]
gi|327440742|dbj|BAK17107.1| predicted EndoIII-related endonuclease [Solibacillus silvestris
StLB046]
gi|405386519|gb|EKB45946.1| Endonuclease III [Bacillus isronensis B3W22]
Length = 219
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A I+ RL+ +G VP + E L+ + G+G K+
Sbjct: 72 ELQNDIRSIGLYRNKAKNIQLLCARLINEYGG-------EVPASR--EELVTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ E +M K + W R
Sbjct: 123 ANVVLSVAFDIPAMAVDTHVERVSKRLGLCRWKDSVLEV------EETIMKKTPIERWSR 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+Q+I FG+ CK ++P CG CPL +C+
Sbjct: 177 A-----------HHQIIFFGRYHCKAQNPGCGTCPLLDDCR 206
>gi|424811992|ref|ZP_18237232.1| putative EndoIII-related endonuclease [Candidatus Nanosalinarum sp.
J07AB56]
gi|339756214|gb|EGQ39797.1| putative EndoIII-related endonuclease [Candidatus Nanosalinarum sp.
J07AB56]
Length = 219
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 70/161 (43%), Gaps = 18/161 (11%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKEY 649
W V A E E+ D ++ G A RI++ L + E G +++L + + KE+
Sbjct: 61 WREVEEAPEGELTDVVRVAGLGPTKAERIQKALELVREETGGDYSVDFLDEMSTEDAKEW 120
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ V + FPVD +V R++ R G VP H + E V
Sbjct: 121 LESLPGVGPKTAAIVLCFHMGRAVFPVDTHVHRLSKRWGIVPEN--TSRKKTHRIMERKV 178
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPN 750
D+++ Y +H MI G+ C+ R+P
Sbjct: 179 PDELK---------------YTMHILMIRHGRNDCEARNPT 204
>gi|323489718|ref|ZP_08094945.1| endonuclease III [Planococcus donghaensis MPA1U2]
gi|323396849|gb|EGA89668.1| endonuclease III [Planococcus donghaensis MPA1U2]
Length = 226
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I+ L++ H S+ VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIRSIGLFRNKAKNIQALSQILIDEHNSV-------VPADR--DLLMTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R+A RLG + P + ++K+ P D W +
Sbjct: 123 ANVVVSVAFGIPALAVDTHVERVAKRLGLSRWKDNPLQVEETIMKKTPADD------WSK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+Q+I FG+ CK ++P C CPL C+
Sbjct: 177 T-----------HHQIIFFGRYHCKSQNPGCHICPLFDRCR 206
>gi|325958693|ref|YP_004290159.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
AL-21]
gi|325330125|gb|ADZ09187.1| DNA-(apurinic or apyrimidinic site) lyase [Methanobacterium sp.
AL-21]
Length = 216
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 77/172 (44%), Gaps = 27/172 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + A E+ IK+ G ++ A R++E ++R+ I ++ VP D + LL
Sbjct: 63 EEIANAPTEEVEKLIKKSGFYHVKASRVRE-VSRI------IHEDYNDTVPEDMAE--LL 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ CV + A PVDV+V RI+ R+G V P + L+K
Sbjct: 114 SLPGVGRKTANCVLVYGFHKDAIPVDVHVHRISNRIGLVN-TGTPDETEEKLMK------ 166
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ P+ +L L+ + FG+ CK P CP+ C ++
Sbjct: 167 -----IVPKKFWLPLNDLF------VQFGQTICKPIGPKHEICPIAEYCDYY 207
>gi|448488148|ref|ZP_21607078.1| HhH-GPD family protein [Halorubrum californiensis DSM 19288]
gi|445696410|gb|ELZ48499.1| HhH-GPD family protein [Halorubrum californiensis DSM 19288]
Length = 233
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 48/169 (28%), Positives = 72/169 (42%), Gaps = 20/169 (11%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVP 642
D+ D D+ AV AD E+ + I+ G + A RI+ L + E G+ L +L +P
Sbjct: 69 DRYD--DFAAVEAADHEELKETIRVAGLADQKAARIQRALTAIREETGGAYSLAFLDAMP 126
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
D+ K +L +I+G+G K+ V VD +V R++ R G VP H
Sbjct: 127 TDEAKGWLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNERAHEV 186
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
L P D+ T Y LH +I G+ C R +C
Sbjct: 187 LDGRIP----------------DELT-YPLHVLLIRHGRERCSARGADC 218
>gi|448306248|ref|ZP_21496157.1| HhH-GPD family protein [Natronorubrum bangense JCM 10635]
gi|445598662|gb|ELY52717.1| HhH-GPD family protein [Natronorubrum bangense JCM 10635]
Length = 270
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 52/182 (28%), Positives = 82/182 (45%), Gaps = 25/182 (13%)
Query: 587 DSVDW-EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPP 643
D +D E++ AD+S +A+ I G N + I + ++E GS ++++ P
Sbjct: 86 DDIDLAESLASADQSMLAETISGAGLYNQKSETIIDTAAWVLEEFGSAAAFDRYVKDETP 145
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
V+E LL + G+G K+ +CV L S + FPVD +V RI R+G P P H
Sbjct: 146 SVVRETLLSVRGVGPKTADCVLLFSGGRSGVFPVDTHVHRIYRRMGIAP----PDADH-- 199
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRG 758
V + ++ + C H I FG+ +C R+P C ACP+
Sbjct: 200 ----EAVRETLEQDVSAAKCGFG-------HTASIQFGREYCTARTPACLEDPDACPMGD 248
Query: 759 EC 760
C
Sbjct: 249 IC 250
>gi|302871235|ref|YP_003839871.1| HhH-GPD family protein [Caldicellulosiruptor obsidiansis OB47]
gi|302574094|gb|ADL41885.1| HhH-GPD family protein [Caldicellulosiruptor obsidiansis OB47]
Length = 234
Score = 54.3 bits (129), Expect = 4e-04, Method: Composition-based stats.
Identities = 47/173 (27%), Positives = 76/173 (43%), Gaps = 17/173 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + ++A+ IK G N A R+KEF N L S DLE L + +++ LL
Sbjct: 75 EGILQTPDEKLAELIKPAGYYNQKAKRLKEFCNFLKREFNS-DLEKLFALDILSLRKILL 133
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + + + Y +
Sbjct: 134 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI----------SYSDLQ 183
Query: 712 KIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I M L P+ + + E H ++ K CK + P C C LR C +F
Sbjct: 184 AIIMAKLTPQTKFFN-----EFHALIVKHCKEICKSKKPICNKCCLRLICNYF 231
>gi|23098497|ref|NP_691963.1| DNA-lyase [Oceanobacillus iheyensis HTE831]
gi|22776723|dbj|BAC12998.1| DNA-(apurinic or apyrimidinic site) lyase [Oceanobacillus iheyensis
HTE831]
Length = 222
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 78/164 (47%), Gaps = 19/164 (11%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
+E+E+A+ I+ G I A RIK F+ + + D+ ++ +P DK++ LL I+G+G
Sbjct: 67 EENELAEKIRPSGFYRIKAARIKAFITWFRKY--NYDVSIVQQIPHDKLRSELLSIKGIG 124
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + + + + AF D RI R+G +++ Y + K+
Sbjct: 125 DETADVMLVYAFKKQAFIADQYANRIFNRIG------------LNVPSTYRSLQKVVERD 172
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P L Q E H ++ KI CK + P C CP++ C+
Sbjct: 173 LPNDSLLYQ----EYHALLVEHAKIHCKVK-PICNTCPVQTICE 211
>gi|390938170|ref|YP_006401908.1| HhH-GPD family protein [Desulfurococcus fermentans DSM 16532]
gi|390191277|gb|AFL66333.1| HhH-GPD family protein [Desulfurococcus fermentans DSM 16532]
Length = 234
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 18/124 (14%)
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
++ + L+++ G+G K+ + V L + FPVD ++ RI +R+G+
Sbjct: 125 EEARRLLMELPGVGWKTADVVLLRYFRKPVFPVDTHITRITMRMGFTGS----------- 173
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ Y + + M YLD LH +IT G+ C+ R P C C LR C++
Sbjct: 174 -RSYKHISRFWMDNTSPENYLD------LHLYLITHGRRTCRARKPLCNKCVLRDMCRYG 226
Query: 764 ASAV 767
AV
Sbjct: 227 VDAV 230
>gi|154249096|ref|YP_001409921.1| endonuclease III [Fervidobacterium nodosum Rt17-B1]
gi|154153032|gb|ABS60264.1| endonuclease III [Fervidobacterium nodosum Rt17-B1]
Length = 221
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 69/154 (44%), Gaps = 20/154 (12%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
AG +E R++ + I LE V P+K++E LL++ G+G K+ V +S A
Sbjct: 86 AGLYREKAERII-IVSKILLEKYDGVVPNKLEE-LLELPGVGRKTANIVLHVSFDQAALA 143
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VD +V RI+ RLGWV + P L K +M LW ++
Sbjct: 144 VDTHVHRISNRLGWVKTKT-PEQTEEELKK---IMSP---QLWG-----------PINGS 185
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 769
M+ FGK CK SP C C L C F + S
Sbjct: 186 MVEFGKNICKPISPRCEQCFLTECCDFFKNKYNS 219
>gi|156100939|ref|XP_001616163.1| endonuclease III homologue [Plasmodium vivax Sal-1]
gi|148805037|gb|EDL46436.1| endonuclease III homologue, putative [Plasmodium vivax]
Length = 417
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 21/118 (17%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQ-HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
E LL + G+G K + +L H VD++V RI+ RL WV + E
Sbjct: 306 EGLLKLPGIGQKVAHLILQTALDTHEGIAVDIHVHRISNRLNWVCTK-----------NE 354
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K++ ++ PR TL+ EL+ ++ FG++ CK +SP+C C + CK++
Sbjct: 355 SATQSKLESFV-PR-------TLWSELNKTLVGFGQVVCKAKSPHCNMCAVTDGCKYY 404
>gi|451343149|ref|ZP_21912225.1| endonuclease III [Eggerthia catenaformis OT 569 = DSM 20559]
gi|449338145|gb|EMD17297.1| endonuclease III [Eggerthia catenaformis OT 569 = DSM 20559]
Length = 222
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 26/166 (15%)
Query: 596 CADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
CAD +++ IK G A I+ L+ + + VP V E L+D+ G
Sbjct: 74 CADINDLQRYIKRIGLYRNKAKNIQAMCQVLINDYQGV-------VPS--VMEELIDLPG 124
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ V + VD +V RIA RL + L+ G + L K+ P
Sbjct: 125 VGRKTANVVLADAFNIPGIAVDTHVTRIAKRLKFCYLKDNVGTIEKKLRKKIP------- 177
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
Q + H+QMI FG+ CK +P+C CPL C+
Sbjct: 178 ----------QDRWIKSHHQMIFFGRYHCKAINPHCYECPLIDICR 213
>gi|312792801|ref|YP_004025724.1| hhh-gpd family protein [Caldicellulosiruptor kristjanssonii 177R1B]
gi|312179941|gb|ADQ40111.1| HhH-GPD family protein [Caldicellulosiruptor kristjanssonii 177R1B]
Length = 232
Score = 53.9 bits (128), Expect = 5e-04, Method: Composition-based stats.
Identities = 45/173 (26%), Positives = 76/173 (43%), Gaps = 17/173 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + ++A+ +K G N R+KEF N L + S DLE L + +++ LL
Sbjct: 74 EGILQTSDEKLAELVKPAGYYNQKTKRLKEFCNFLKKEFNS-DLEKLFALDISSLRQVLL 132
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + + + Y +
Sbjct: 133 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI----------SYNDLQ 182
Query: 712 KIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I M L P+ + + E H ++ K CK + P C C LR C +F
Sbjct: 183 AIIMANLTPQTKFFN-----EFHALIVKHCKEICKSKKPICNKCCLRLICNYF 230
>gi|389748166|gb|EIM89344.1| DNA glycosylase [Stereum hirsutum FP-91666 SS1]
Length = 405
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 75/173 (43%), Gaps = 14/173 (8%)
Query: 576 STGEERSHDKMDSV-DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
STG ++S D + + A ++ +AIK G AG I+ L + E G
Sbjct: 187 STGAKKSLDAAFCFHKFSRIANAPTEDVVEAIKSGGLARRKAGIIQGLLKSVKEKWGRYS 246
Query: 635 LEWLRNV--PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
L++L + + L+ G+G K+ CV L L +F VD +V R++ LGWVP
Sbjct: 247 LQFLMEGRWSDEAIMRELVSYPGVGPKTAACVLLFCLGRDSFAVDTHVFRLSRVLGWVPR 306
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
+ I + QM+L ++ Y LH M+T G+ CK
Sbjct: 307 AKSATKVAIDRV-------NTQMHL---DVHIPPHLKYPLHVLMVTHGRA-CK 348
>gi|430750009|ref|YP_007212917.1| endonuclease III [Thermobacillus composti KWC4]
gi|430733974|gb|AGA57919.1| endonuclease III [Thermobacillus composti KWC4]
Length = 215
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 73/161 (45%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I++ L+E HG VP D E L+++ G+G K+
Sbjct: 71 ELEQDIRRIGLYRNKAANIQKLCRMLIEKHGG-------EVPRDY--ESLVELPGVGRKT 121
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + A VD +V R++ RLG P D + + K+ M L PR
Sbjct: 122 ANVVISNAFGVPAIAVDTHVERVSKRLGIAK----PDDSVLEVEKKL-------MRLVPR 170
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
D+ T+ H+++I FG+ CK + PNC CPL CK
Sbjct: 171 ----DEWTIT--HHRLIFFGRYHCKAQRPNCPECPLLDLCK 205
>gi|90961668|ref|YP_535584.1| endonuclease III [Lactobacillus salivarius UCC118]
gi|417788262|ref|ZP_12435945.1| endonuclease III [Lactobacillus salivarius NIAS840]
gi|417810344|ref|ZP_12457023.1| endonuclease III [Lactobacillus salivarius GJ-24]
gi|90820862|gb|ABD99501.1| Endonuclease III [Lactobacillus salivarius UCC118]
gi|334308439|gb|EGL99425.1| endonuclease III [Lactobacillus salivarius NIAS840]
gi|335349140|gb|EGM50640.1| endonuclease III [Lactobacillus salivarius GJ-24]
Length = 213
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 70/165 (42%), Gaps = 26/165 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+A+ IK G A + E LVE S+ VP K + L+ + G+
Sbjct: 68 ADPKEVAEYIKTIGLYKNKAKYLVECSKMLVENFNSV-------VP--KTHKELMSLSGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + V AF VD +V RI+ RL VP + L+ + P D I+
Sbjct: 119 GRKTADVVLAERFGVPAFAVDTHVHRISKRLAIVPEDATVRETEKILMSKVPKEDWIKS- 177
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI +G+ C R+P C CPL C+
Sbjct: 178 ----------------HHRMIFWGRYQCMARAPKCETCPLLEICQ 206
>gi|221058032|ref|XP_002261524.1| endonuclease iii homologue [Plasmodium knowlesi strain H]
gi|194247529|emb|CAQ40929.1| endonuclease iii homologue, putative [Plasmodium knowlesi strain H]
Length = 396
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 19/117 (16%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQ-HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
E LL++ G+G K + +L H VD++V RI+ RL WV + E
Sbjct: 285 EGLLELPGIGQKVAHLILQTALDTHEGIAVDIHVHRISNRLNWVCTK-----------NE 333
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K++ Y+ PR + EL+ ++ FG++ CK +SP+C C + CK++
Sbjct: 334 SITQSKLESYV-PRALW------SELNKTLVGFGQVVCKAKSPHCTMCAVTNCCKYY 383
>gi|169827624|ref|YP_001697782.1| endonuclease III [Lysinibacillus sphaericus C3-41]
gi|168992112|gb|ACA39652.1| Probable endonuclease III [Lysinibacillus sphaericus C3-41]
Length = 220
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 17/115 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L+ + G+G K+ V ++ A VD +V R++ RLG + ++ ++K+
Sbjct: 109 REALVTLPGVGRKTANVVLSVAFDIPALAVDTHVERVSKRLGLCRWKDSVLEVEETIMKK 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P MDK W + H+Q+I FG+ CK ++P C ACPL +C+
Sbjct: 169 TP-MDK-----WSKT-----------HHQLIFFGRYHCKAQNPGCHACPLLSDCR 206
>gi|253702180|ref|YP_003023369.1| HhH-GPD family protein [Geobacter sp. M21]
gi|251777030|gb|ACT19611.1| HhH-GPD family protein [Geobacter sp. M21]
Length = 228
Score = 53.9 bits (128), Expect = 5e-04, Method: Composition-based stats.
Identities = 45/173 (26%), Positives = 80/173 (46%), Gaps = 26/173 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL----EWLRNVPPDKVK 647
EA+R DE +A++I+ G N+ + R+K F+ L+E +GS+D +W+R ++
Sbjct: 70 EALREIDEGRLAESIRPSGFFNVKSARLKGFVEWLLERYGSLDAMFEGDWVR------LR 123
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G ++ + + L + +F VD R+ RLG V + + +
Sbjct: 124 EELSAVRGIGPETCDSILLYAGGKPSFVVDAYTRRLFSRLGLVREEDDYHQVRALFMDHL 183
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P K+ ++ E H ++ K C+K+ P C CPL C
Sbjct: 184 PA--KVPLF-------------NEYHALIVEQCKRHCRKK-PLCEGCPLSRFC 220
>gi|374636071|ref|ZP_09707655.1| HhH-GPD family protein [Methanotorris formicicus Mc-S-70]
gi|373560329|gb|EHP86596.1| HhH-GPD family protein [Methanotorris formicicus Mc-S-70]
Length = 218
Score = 53.9 bits (128), Expect = 6e-04, Method: Composition-based stats.
Identities = 44/176 (25%), Positives = 81/176 (46%), Gaps = 15/176 (8%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K + +D + D ++ IK G NI A R+K +VE + + + ++ +
Sbjct: 52 KENLIDEIKILNTDIEKLKKLIKPAGFYNIKAERLKNITKFIVENYKNTENLAKLSIKLE 111
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+++ +L+++G+G ++ + + L +L F +D RI R L+ + G
Sbjct: 112 DLRKEILNVKGIGKETADSILLYALDREIFVIDAYTRRIFSR-----LRIIEGG------ 160
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+EY + I P+ D K E H ++ GK +CKK++P C CPL C
Sbjct: 161 EEYDEIRGIFEKNLPK----DLKIYKEYHALIVELGKHYCKKKNPVCEKCPLHNLC 212
>gi|383785899|ref|YP_005470468.1| DNA-(apurinic or apyrimidinic site) lyase [Fervidobacterium
pennivorans DSM 9078]
gi|383108746|gb|AFG34349.1| endonuclease III [Fervidobacterium pennivorans DSM 9078]
Length = 220
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 64/148 (43%), Gaps = 20/148 (13%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
+G KE R++ + I LE PD ++E LL + G+G K+ V +S A
Sbjct: 89 SGLYKEKAERIIAV-SKIILEKYGGRVPDNLEE-LLSLPGVGRKTANIVLHVSFGQQALA 146
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VD +V RI+ RLGWV + P L K LD ++
Sbjct: 147 VDTHVHRISNRLGWVNTKT-PEQTEEELKK-----------------ILDPNLWGPVNGS 188
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHF 763
M+ FGK C+ SP C C L CK+F
Sbjct: 189 MVEFGKQICRPISPKCEECFLTACCKYF 216
>gi|300088667|ref|YP_003759189.1| HhH-GPD family protein [Dehalogenimonas lykanthroporepellens
BL-DC-9]
gi|299528400|gb|ADJ26868.1| HhH-GPD family protein [Dehalogenimonas lykanthroporepellens
BL-DC-9]
Length = 224
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 73/167 (43%), Gaps = 26/167 (15%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+A I+ G N A ++K + L D+E L++ P + + LL + G+G ++
Sbjct: 76 ELAPVIRSSGYYNAKASKLKALADWLAGYDD--DIESLKDRDPAEFRRELLAVHGVGPET 133
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L +L F +D R+ RLG VP P D + W R
Sbjct: 134 ADSILLYALDVPVFVIDAYTRRLFSRLGIVP----PRDTYDE---------------WQR 174
Query: 721 L--CYLDQKT--LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
L L+Q+ E H ++ K C+ R P+C C L GEC++
Sbjct: 175 LFETNLEQQAGLFNEYHALIVRHAKEVCRSR-PDCAECCLAGECRYL 220
>gi|354603229|ref|ZP_09021228.1| hypothetical protein HMPREF9450_00143 [Alistipes indistinctus YIT
12060]
gi|353349106|gb|EHB93372.1| hypothetical protein HMPREF9450_00143 [Alistipes indistinctus YIT
12060]
Length = 242
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 32/182 (17%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSI---DLEWLRNVPPDKVKE 648
+A+ AD EI I+ N A + L E G D+E L+ +P
Sbjct: 65 QAMADADPEEIYPYIRSISYPNNKAKNLSGMARMLCEEFGGAVPEDIEQLQRLP------ 118
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
G+G K+ V +++ A PVD +V R+A RLG P + L + +P
Sbjct: 119 ------GVGRKTANVVGIVAFGKRAMPVDTHVFRVADRLGLSTGAKTPLQTEMQLTEGFP 172
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
+ L H+ +I G+ C R P+C AC L C+++A+
Sbjct: 173 -----------------PEVLPLAHHWLILHGRYVCTARKPHCEACGLTPWCRYYATQQK 215
Query: 769 SA 770
+A
Sbjct: 216 TA 217
>gi|406893033|gb|EKD38207.1| hypothetical protein ACD_75C00822G0003 [uncultured bacterium]
Length = 223
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 72/168 (42%), Gaps = 16/168 (9%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
+E++ EIA IK G N+ A R++ L+ ++ H L+ + +E L
Sbjct: 69 YESLSQCGVDEIAQYIKPSGYYNLKARRLRNLLD-MIGNHYDGQLDRFLDDDAQAAREML 127
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L ++G+G ++ + + L + H F VD+ R+ R V + +Y M
Sbjct: 128 LGVKGIGPETADSILLYACGHPIFVVDMYTHRVFSRHNMVDEE-----------TDYQTM 176
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRG 758
+ M P D + E H ++ +CKK P C CPL+G
Sbjct: 177 QAVFMNQLP----CDAELFNEFHALVVRVAVTYCKKTIPLCDRCPLQG 220
>gi|302419493|ref|XP_003007577.1| HhH-GPD family base excision DNA repair protein [Verticillium
albo-atrum VaMs.102]
gi|261353228|gb|EEY15656.1| HhH-GPD family base excision DNA repair protein [Verticillium
albo-atrum VaMs.102]
Length = 497
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/208 (23%), Positives = 77/208 (37%), Gaps = 68/208 (32%)
Query: 588 SVDWEAVRCADESEIADAIKERG------------------------------------- 610
S+ W+ VR ++ESE+ DA+K G
Sbjct: 249 SIAWDKVRLSEESEVVDAVKRAGLGPTKGRDIKSILDMVHRDNQIRLAAYVKEKETGEMA 308
Query: 611 ----QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
QN+ G+ +N++ G + L+ +R + D+ G+G+K+ C+ L
Sbjct: 309 DILGAQNLTQGQKDHQVNKMES--GILTLDHVRGLTSDEAMLEFTKYPGIGVKTSSCLIL 366
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
LQ +F VD +V R L WVP + D ++H E + D+++
Sbjct: 367 FCLQQPSFAVDTHVWRFCKWLKWVPPKASRDDTYMH--GEVRIPDRLK------------ 412
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGAC 754
Y LH I GK CG C
Sbjct: 413 ---YGLHQLFIRHGK--------ECGRC 429
>gi|188582085|ref|YP_001925530.1| HhH-GPD family protein [Methylobacterium populi BJ001]
gi|179345583|gb|ACB80995.1| HhH-GPD family protein [Methylobacterium populi BJ001]
Length = 259
Score = 53.5 bits (127), Expect = 6e-04, Method: Composition-based stats.
Identities = 49/172 (28%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW + A +I I+ + A RIK L + E G++ L++L ++ D+ + +
Sbjct: 86 DWAEIEQASVEDIEATIRSVTWPELKAPRIKAVLAAVRERVGALSLDFLGDMSVDEARGW 145
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L I G+G K+ V S L+ A PVD + R+A R G + + G H L + P
Sbjct: 146 LEAIPGIGPKTSAAVLSFSTLRMPALPVDSHHHRVAQRTGLIGSKVDVGPSHAVLRAQLP 205
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D W + LY+ H ++ G+ C SP CG C L C
Sbjct: 206 --DD-----W------SAQKLYDNHEVLMLHGQRVCFHHSPACGRCVLLDIC 244
>gi|452851214|ref|YP_007492898.1| conserved protein of unknown function [Desulfovibrio piezophilus]
gi|451894868|emb|CCH47747.1| conserved protein of unknown function [Desulfovibrio piezophilus]
Length = 217
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 24/174 (13%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
+KM S+ +E E+++ I G NI RIK FL + + S D+E L +
Sbjct: 63 EKMYSLSFE--------ELSNLIHPAGYYNIKTRRIKNFL-EFLNIEASFDIENLASQEI 113
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
++ +L + G+G ++ + + L +LQ F VD RI R DL +H
Sbjct: 114 QDIRPKILGVNGIGPETADAILLYALQLPVFVVDAYTQRILHR----------HDL-VHD 162
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
+Y + + M P+ D E H ++ GK +CKK+ C +CPL+
Sbjct: 163 GIDYHELQALFMDSLPQ----DVSVYNEFHALIVRTGKQWCKKKVGVCDSCPLQ 212
>gi|298530217|ref|ZP_07017619.1| HhH-GPD family protein [Desulfonatronospira thiodismutans ASO3-1]
gi|298509591|gb|EFI33495.1| HhH-GPD family protein [Desulfonatronospira thiodismutans ASO3-1]
Length = 215
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 78/165 (47%), Gaps = 16/165 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + D+ +A+ I+ G I A R+K L + L + L L + ++++ LL
Sbjct: 63 EKMASLDDELLAELIRPSGYFRIKARRLKNLL-EFLRLECAYHLPDLSSQDLQQLRDKLL 121
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
++G+G ++ + + L +L+ +F VD RI R V D+ H L+++ MD
Sbjct: 122 QVKGIGPETADSILLYALEKPSFVVDAYTSRILNRHLLVH-----EDIDYHELRDF-FMD 175
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
++ PR D E H ++ GK +C K +P C CPL
Sbjct: 176 RL-----PR----DVALYNEYHALLVRTGKKWCNKNNPRCDGCPL 211
>gi|336436197|ref|ZP_08615910.1| endonuclease III [Lachnospiraceae bacterium 1_4_56FAA]
gi|336008237|gb|EGN38256.1| endonuclease III [Lachnospiraceae bacterium 1_4_56FAA]
Length = 211
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 53/120 (44%), Gaps = 18/120 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG Q P + L+KE
Sbjct: 109 EDLTSLAGVGRKTANVIRGNIYHDASVVVDTHVKRISNRLGLTKNQD-PEKIEQDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
P D LY + Q+ITFG+ C RSP C C LR CK + ++
Sbjct: 168 PK---------------DHWILYNI--QIITFGRSICTARSPKCEECFLRAYCKTISKSI 210
>gi|374300499|ref|YP_005052138.1| HhH-GPD family protein [Desulfovibrio africanus str. Walvis Bay]
gi|332553435|gb|EGJ50479.1| HhH-GPD family protein [Desulfovibrio africanus str. Walvis Bay]
Length = 227
Score = 53.5 bits (127), Expect = 7e-04, Method: Composition-based stats.
Identities = 50/186 (26%), Positives = 78/186 (41%), Gaps = 28/186 (15%)
Query: 581 RSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL-----HGSIDL 635
R+HD + A+ E+ +A+AI+ G I AGR+ L L HG DL
Sbjct: 55 RAHDLLTPA---AMLDISEAALAEAIRPAGYYRIKAGRLANLLRFLRAEAEEFGHGEADL 111
Query: 636 E-----WLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV 690
L+ +++E LL + G+G ++ + + L +L F VD RIA+R G
Sbjct: 112 HDPALPMLQGRNARELRERLLTVRGIGPETADSILLYALGLPIFVVDAYTARIALRHGLA 171
Query: 691 PLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPN 750
D H L+E ++ D E H ++ G +C+K+ P
Sbjct: 172 -----FEDAGYHELQE----------IFTDALPEDAALFNEYHALLVRVGHEWCRKKEPR 216
Query: 751 CGACPL 756
C CPL
Sbjct: 217 CRDCPL 222
>gi|425736476|ref|ZP_18854780.1| DNA-(apurinic or apyrimidinic site) lyase [Brevibacterium casei
S18]
gi|425478146|gb|EKU45347.1| DNA-(apurinic or apyrimidinic site) lyase [Brevibacterium casei
S18]
Length = 245
Score = 53.5 bits (127), Expect = 7e-04, Method: Composition-based stats.
Identities = 47/176 (26%), Positives = 72/176 (40%), Gaps = 29/176 (16%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D A+ A+ E+ + I+ G A I + N LV+ ++ VP + E
Sbjct: 77 DAHALAVANLGEVEELIRSTGFYRAKARNIVKLANDLVD-------DYDGEVP--RTMEE 127
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYP 708
L+ + G+G K+ V + VD ++GR+A R GW P+ + I L +P
Sbjct: 128 LVKLAGVGRKTANVVLGNAFDTPGITVDTHMGRLARRFGWTTETDPVKAEEDIAAL--FP 185
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
K L L +++I G+ C R P CGACPL C F
Sbjct: 186 T-----------------KELTLLSHRVIFHGRRICHSRKPACGACPLMALCPSFG 224
>gi|302921021|ref|XP_003053199.1| hypothetical protein NECHADRAFT_77955 [Nectria haematococca mpVI
77-13-4]
gi|256734139|gb|EEU47486.1| hypothetical protein NECHADRAFT_77955 [Nectria haematococca mpVI
77-13-4]
Length = 485
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 75/218 (34%), Gaps = 64/218 (29%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SVDW VR A + I +AIK G + IK L + E
Sbjct: 261 SVDWNKVRTAPLTTIVEAIKTGGLAQVKGKDIKAILELVHEENVKRRDAFIQERKSGKMS 320
Query: 629 --------LHGSIDLEWLR------------NVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G DLE L+ + PD+ + L G+G+K+ CV L
Sbjct: 321 GAFKADGKTQGQKDLEILKTEQDILSLDHIHGMHPDEAMQTLTKFPGIGVKTASCVILFC 380
Query: 669 LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT 728
LQ +F VD +V R+ L W+P + HL P K
Sbjct: 381 LQQPSFAVDTHVHRLTGWLKWMPPKATRDQTFSHLEVRIPNHLK---------------- 424
Query: 729 LYELHYQMITFGK--IFCKKRSP------NCGACPLRG 758
Y LH + G+ I C+ + N CPL G
Sbjct: 425 -YGLHKLFVQHGRNCIRCRANTSEGSEEWNNEECPLEG 461
>gi|218281634|ref|ZP_03488042.1| hypothetical protein EUBIFOR_00609 [Eubacterium biforme DSM 3989]
gi|218217269|gb|EEC90807.1| hypothetical protein EUBIFOR_00609 [Eubacterium biforme DSM 3989]
Length = 208
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 71/166 (42%), Gaps = 28/166 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYLLDIEG 655
A SE+ IK G A I LVE HG + P K+ L+ + G
Sbjct: 64 ATVSELEPYIKRIGLYRNKARSISNLSKDLVERYHGQV---------PYTYKD-LMSLAG 113
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ VR ++ +F VD +V R++ RLG L K ++K++
Sbjct: 114 VGRKTANVVRSVAFDIPSFAVDTHVNRVSKRLG--------------LAKYNDSVEKVEE 159
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L ++ D+ + H+ I FG+ C R+P C CP + CK
Sbjct: 160 KLKRKI---DRSRWNQGHHDFIFFGRYLCHSRNPECERCPFKSFCK 202
>gi|342873258|gb|EGU75465.1| hypothetical protein FOXB_14013 [Fusarium oxysporum Fo5176]
Length = 552
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 55/143 (38%), Gaps = 39/143 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SVDW VR A I ++IK G + IK L + E
Sbjct: 326 SVDWNKVRVAPLPTIVESIKTGGLAQVKGKDIKAILELVHEENTKRREAFMQEKKGGNLS 385
Query: 629 --------LHGSIDLEWLR------------NVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G DLE L+ + PD+ + L G+G+K+ CV L
Sbjct: 386 GITGADNKTQGQKDLEILKTEQDILSLDHIHGMAPDEAMQTLTKFPGIGVKTASCVILFC 445
Query: 669 LQHIAFPVDVNVGRIAVRLGWVP 691
LQ +F VD +V RI+ L W+P
Sbjct: 446 LQQPSFAVDTHVHRISGWLKWIP 468
>gi|309775704|ref|ZP_07670702.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
gi|308916543|gb|EFP62285.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
Length = 215
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 18/127 (14%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
L+ V P+ +K+ L + G+G K+ VR + + VD +V RI+ RLG +Q
Sbjct: 94 LDSFDGVVPESMKD-LTSLAGVGRKTANVVRSVCFDIPSIAVDTHVERISKRLGLAKVQD 152
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
+ ++ E + K++ W R H+ I FG+ FC R+P C C
Sbjct: 153 -----SVEVV-EQKLKRKLKRERWNRA-----------HHLFIFFGRYFCTARNPKCEEC 195
Query: 755 PLRGECK 761
P R CK
Sbjct: 196 PFREFCK 202
>gi|376295286|ref|YP_005166516.1| HhH-GPD family protein [Desulfovibrio desulfuricans ND132]
gi|323457847|gb|EGB13712.1| HhH-GPD family protein [Desulfovibrio desulfuricans ND132]
Length = 217
Score = 53.5 bits (127), Expect = 7e-04, Method: Composition-based stats.
Identities = 44/158 (27%), Positives = 70/158 (44%), Gaps = 20/158 (12%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+A+ I+ G NI A RI FL L + DL L++ +++ +L I G+G ++
Sbjct: 73 LAELIRPAGYYNIKAKRIHNFLQFLKD-EAEFDLLALKDRELAELRPKVLSINGIGPETG 131
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVR--LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWP 719
+C+ L +L F VD RI R L W + +Y + I M P
Sbjct: 132 DCILLYALDFPTFVVDAYTARILGRHGLAWEDI-------------DYHGLQSIFMDALP 178
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D E H ++ G +C+K++ C ACPL+
Sbjct: 179 E----DVALYNEYHALIVRVGANWCRKKAGLCDACPLQ 212
>gi|210623502|ref|ZP_03293847.1| hypothetical protein CLOHIR_01797 [Clostridium hiranonis DSM 13275]
gi|210153560|gb|EEA84566.1| hypothetical protein CLOHIR_01797 [Clostridium hiranonis DSM 13275]
Length = 213
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 68/169 (40%), Gaps = 27/169 (15%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E EI + IK G A +IK + I + PD ++E L+ + G+G
Sbjct: 71 EEEIGEMIKTCGLYKSKAKKIK--------MTSEILYNDYNSEVPDSLEE-LIKLPGVGR 121
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + H A PVD +V RI R+G V P L+K P
Sbjct: 122 KTAGVVLSNAFGHPAIPVDTHVFRIVNRIGIVETST-PEKTEFELMKVLP---------- 170
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
++ + H+ I G+ CK R P C CP++ C +F V
Sbjct: 171 -------KERWSKAHHLFIFLGRRMCKARKPECTDCPIKKHCNYFLEIV 212
>gi|448733380|ref|ZP_21715625.1| HhH-GPD family protein [Halococcus salifodinae DSM 8989]
gi|445803114|gb|EMA53414.1| HhH-GPD family protein [Halococcus salifodinae DSM 8989]
Length = 279
Score = 53.1 bits (126), Expect = 8e-04, Method: Composition-based stats.
Identities = 46/171 (26%), Positives = 75/171 (43%), Gaps = 24/171 (14%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIE 654
AD+ +A+ I+ G N + + R+V+ +G + ++ P+ V++ LLD
Sbjct: 105 ADQPTLAETIESAGLYNQKSATMIAIAERIVDEYGGAEEFNGFVAEGDPETVRDALLDFS 164
Query: 655 GLGLKSVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ +CV L S FPVD +V RI RLG P + ++ + + P
Sbjct: 165 GVGPKTADCVLLFSGGRAGVFPVDTHVHRIYRRLGIAPPEADHEEVRAVVEDQVPA---- 220
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
+K + H I FG+ +C R P C ACP+ C
Sbjct: 221 ------------EKCGFG-HTASIQFGREYCTARKPACLDDPDACPMADLC 258
>gi|240103334|ref|YP_002959643.1| Endonuclease III (nth) [Thermococcus gammatolerans EJ3]
gi|239910888|gb|ACS33779.1| Endonuclease III (nth) [Thermococcus gammatolerans EJ3]
Length = 230
Score = 53.1 bits (126), Expect = 8e-04, Method: Composition-based stats.
Identities = 34/120 (28%), Positives = 56/120 (46%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD +KE L+ + G+G K V A PVD +V RI+ RLG P + P + +
Sbjct: 118 PDDIKE-LMKLPGIGRKCANIVLAYGFGRQAIPVDTHVNRISKRLGLAPPRVPPEKVEEY 176
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L++ P +K +Y +++ M+ G+ C+ P C +CPL+ C +
Sbjct: 177 LMELIP----------------KEKWIY-VNHAMVDHGRSICRPIRPKCESCPLKELCPY 219
>gi|156055880|ref|XP_001593864.1| hypothetical protein SS1G_05292 [Sclerotinia sclerotiorum 1980]
gi|154703076|gb|EDO02815.1| hypothetical protein SS1G_05292 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 551
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 58/143 (40%), Gaps = 39/143 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNR-------LVEL----------- 629
SV WEAV A E ++ AIK G N+ IK+ L + L+E+
Sbjct: 273 SVSWEAVFAAPEEDVVKAIKSGGLANVKGSNIKKILKKVYDQNTELLEILLKEVDTDVSV 332
Query: 630 ---------------------HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
+ ++++ + + L+D+ G+G+K+ CV L
Sbjct: 333 PFIGKVLETKEQKEAEIKSLGENMLSIDYIHALDKPAAMDVLMDLPGIGVKTAACVALFC 392
Query: 669 LQHIAFPVDVNVGRIAVRLGWVP 691
L +F VD +V R LGWVP
Sbjct: 393 LGRPSFAVDTHVWRHCKWLGWVP 415
>gi|46128543|ref|XP_388825.1| hypothetical protein FG08649.1 [Gibberella zeae PH-1]
Length = 486
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 77/216 (35%), Gaps = 64/216 (29%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SVDW VR A I ++IK G + IK L + E
Sbjct: 261 SVDWNKVRTAPLPTIVESIKTGGLSQVKGKDIKAILELVYEENTKRREAFLEEKSGGKAT 320
Query: 629 --------LHGSIDLEWLR------------NVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G DLE L+ + PD+ + L G+G+K+ CV L
Sbjct: 321 GITGAEGKTQGQKDLEILKTDQEILSLDHIHGMVPDEAMQTLTKFPGIGVKTASCVILFC 380
Query: 669 LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT 728
LQ +F VD +V RI+ L W+P + HL E + D ++
Sbjct: 381 LQQPSFAVDTHVHRISGWLKWMPRKATRDQTFSHL--EVRIPDHLK-------------- 424
Query: 729 LYELHYQMITFGK--IFCKKRSP------NCGACPL 756
Y LH + G+ I C+ + N CPL
Sbjct: 425 -YGLHKLFVQHGRSCIRCRANTSEGSEEWNKSECPL 459
>gi|347828944|emb|CCD44641.1| hypothetical protein [Botryotinia fuckeliana]
Length = 548
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 59/143 (41%), Gaps = 39/143 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKE-------------------------- 621
SV WEAV + + ++ +AI+ G + AG IK
Sbjct: 273 SVSWEAVLASSKEDVEEAIRSGGLAKVKAGYIKAILQVVFDKNTDLLETLLKGVETDVPV 332
Query: 622 -FLNRLVEL------------HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
F+ +++E + L+++ + L+D+ G+G+K+ CV L
Sbjct: 333 PFVGKVLETKEQKEAEIKSLRENMLSLDYVHTLDKPAAMRVLMDLPGIGVKTAACVALFC 392
Query: 669 LQHIAFPVDVNVGRIAVRLGWVP 691
L +F VD +V R + LGWVP
Sbjct: 393 LGRPSFAVDTHVWRHCMWLGWVP 415
>gi|156041124|ref|XP_001587548.1| hypothetical protein SS1G_11541 [Sclerotinia sclerotiorum 1980]
gi|154695924|gb|EDN95662.1| hypothetical protein SS1G_11541 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 347
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 56/114 (49%), Gaps = 16/114 (14%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH----------------GS 632
VDWEAVR A ++ +AIK G N+ + K L+ + E +
Sbjct: 213 VDWEAVRLAPLEKLYEAIKTEGMGNVKSRSCKAILDMVHEENVIRRQRGKVATNSRGIDL 272
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
+ LE +R++ D+ + + G+G K+ C+ + +QH AF VD +V +I+ R
Sbjct: 273 LSLEHMRSLSKDEAFDKFITFPGVGPKTAACIISICMQHNAFAVDTHVYQISDR 326
>gi|257486632|ref|ZP_05640673.1| endonuclease III [Pseudomonas syringae pv. tabaci str. ATCC 11528]
gi|422680768|ref|ZP_16739039.1| endonuclease III [Pseudomonas syringae pv. tabaci str. ATCC 11528]
gi|331010113|gb|EGH90169.1| endonuclease III [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 212
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNS-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQVAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKQLMKFVPKN 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C +
Sbjct: 171 YLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|154302382|ref|XP_001551601.1| hypothetical protein BC1G_09975 [Botryotinia fuckeliana B05.10]
Length = 548
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 59/143 (41%), Gaps = 39/143 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD--- 644
SV WEAV + + ++ +AI+ G + AG IK L + + + + L+ V D
Sbjct: 273 SVSWEAVLASSKEDVEEAIRSGGLAKVKAGYIKAILQVVFDKNTDLLEALLKGVETDVPV 332
Query: 645 ----KVKE--------------------------------YLLDIEGLGLKSVECVRLLS 668
KV E L+D+ G+G+K+ CV L
Sbjct: 333 PFVGKVLETKEQKEAEIKSLRENMLSLDYVHTLDKPAAMRVLMDLPGIGVKTAACVALFC 392
Query: 669 LQHIAFPVDVNVGRIAVRLGWVP 691
L +F VD +V R + LGWVP
Sbjct: 393 LGRPSFAVDTHVWRHCMWLGWVP 415
>gi|71735109|ref|YP_273633.1| endonuclease III [Pseudomonas syringae pv. phaseolicola 1448A]
gi|289626585|ref|ZP_06459539.1| endonuclease III [Pseudomonas syringae pv. aesculi str. NCPPB 3681]
gi|289647569|ref|ZP_06478912.1| endonuclease III [Pseudomonas syringae pv. aesculi str. 2250]
gi|416028815|ref|ZP_11571704.1| endonuclease III [Pseudomonas syringae pv. glycinea str. race 4]
gi|422404345|ref|ZP_16481398.1| endonuclease III [Pseudomonas syringae pv. glycinea str. race 4]
gi|422582407|ref|ZP_16657543.1| endonuclease III [Pseudomonas syringae pv. aesculi str. 0893_23]
gi|422594538|ref|ZP_16668828.1| endonuclease III [Pseudomonas syringae pv. lachrymans str. M301315]
gi|71555662|gb|AAZ34873.1| endonuclease III [Pseudomonas syringae pv. phaseolicola 1448A]
gi|298156815|gb|EFH97906.1| Endonuclease III [Pseudomonas savastanoi pv. savastanoi NCPPB 3335]
gi|320327082|gb|EFW83096.1| endonuclease III [Pseudomonas syringae pv. glycinea str. race 4]
gi|330867250|gb|EGH01959.1| endonuclease III [Pseudomonas syringae pv. aesculi str. 0893_23]
gi|330876898|gb|EGH11047.1| endonuclease III [Pseudomonas syringae pv. glycinea str. race 4]
gi|330984845|gb|EGH82948.1| endonuclease III [Pseudomonas syringae pv. lachrymans str. M301315]
Length = 212
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 69/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNS-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQVAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKQLMKFVPKN 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C +
Sbjct: 171 YLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|448369885|ref|ZP_21556338.1| HhH-GPD family protein [Natrialba aegyptia DSM 13077]
gi|445650325|gb|ELZ03249.1| HhH-GPD family protein [Natrialba aegyptia DSM 13077]
Length = 276
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 55/138 (39%), Gaps = 22/138 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
P +V+E LLD+ G+G K+ +CV L + + FPVD +V RI RLG P D+
Sbjct: 151 PGEVRETLLDVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPPDADHEDVRE 210
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLR 757
+ E P C H I FG+ C R P C ACP+
Sbjct: 211 VVEDEVPTAK----------CGFG-------HTATIQFGRELCTARKPACLDDPEACPMA 253
Query: 758 GECKHFASAVASARFALP 775
C A+ P
Sbjct: 254 DICDQVGVYPATGEVIDP 271
>gi|416014967|ref|ZP_11562684.1| endonuclease III [Pseudomonas syringae pv. glycinea str. B076]
gi|320325635|gb|EFW81697.1| endonuclease III [Pseudomonas syringae pv. glycinea str. B076]
Length = 212
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 71/161 (44%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNS-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQVAMAVDTHIFRVSNRTGIAPGKNVV-EVEKQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C +
Sbjct: 169 ----ENYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|147920603|ref|YP_685600.1| endonuclease III [Methanocella arvoryzae MRE50]
gi|110620996|emb|CAJ36274.1| predicted endonuclease III [Methanocella arvoryzae MRE50]
Length = 307
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 77/175 (44%), Gaps = 22/175 (12%)
Query: 588 SVDWEAVRCADESEIADAIKERG--QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
+D + AD+ E+ + I+ G + + I +E ++R GS LEW+R P +
Sbjct: 137 GIDVYKMASADKKELEELIRTSGPYKADFIIRCSQEIIDRW---GGS--LEWMRTAPTQE 191
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
+E L+ + G+G K+ +CV L +L H VD ++ R++ R G L GD K
Sbjct: 192 AREALMSLYGVGPKTADCVLLFALGHSVVAVDTHICRVSERTG---LSLATGDSEA--AK 246
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D + + P + H +I G+ FCK P CP+ C
Sbjct: 247 RRVKEDLERKHRIPGMA----------HLLIINLGRDFCKAVLPLHHECPVEDIC 291
>gi|284165866|ref|YP_003404145.1| HhH-GPD family protein [Haloterrigena turkmenica DSM 5511]
gi|284015521|gb|ADB61472.1| HhH-GPD family protein [Haloterrigena turkmenica DSM 5511]
Length = 269
Score = 53.1 bits (126), Expect = 9e-04, Method: Composition-based stats.
Identities = 49/190 (25%), Positives = 79/190 (41%), Gaps = 26/190 (13%)
Query: 597 ADESEIADAIKERG---QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDI 653
A++S +A+ I G Q++ + R E++ + D ++++ P V+E LL I
Sbjct: 96 AEQSRLAETISSAGLYNQKSEVLIRTAEWVLEEFGSAAAFD-AFVKDEDPAAVRETLLSI 154
Query: 654 EGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
G+G K+ +CV L + + FPVD +V RI R+G P + L +E P
Sbjct: 155 RGIGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPAAADHEGVRAVLEREVPAAK- 213
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFASAVA 768
C H I FG+ +C R P C ACP+ C+ A
Sbjct: 214 ---------CGFG-------HTATIQFGREYCTARKPACLEDPDACPMADVCEQVGVYPA 257
Query: 769 SARFALPGPS 778
+ P +
Sbjct: 258 TGEVVDPAAT 267
>gi|433462947|ref|ZP_20420515.1| endonuclease III [Halobacillus sp. BAB-2008]
gi|432188084|gb|ELK45302.1| endonuclease III [Halobacillus sp. BAB-2008]
Length = 224
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ D IK G A IK+ L+ L++ VPP ++ L + G+G K+
Sbjct: 72 ELQDDIKRIGLFRNKAKNIKKLSETLI-------LQFDGQVPP--TQKELESLAGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ E +M KI W
Sbjct: 123 ANVVASVAFGEPAIAVDTHVERVSKRLGICRWKDSVLEV------EKTLMRKIPREEWS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ H++MI FG+ CK + PNC ACPL C+
Sbjct: 176 ----------DTHHRMIFFGRYHCKAQRPNCEACPLLDLCR 206
>gi|408388582|gb|EKJ68263.1| hypothetical protein FPSE_11566 [Fusarium pseudograminearum CS3096]
Length = 486
Score = 53.1 bits (126), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 77/216 (35%), Gaps = 64/216 (29%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
SVDW VR A I ++IK G + IK L + E
Sbjct: 261 SVDWNKVRTAPLPTIVESIKTGGLSQVKGKDIKTILELVYEENTKRREAFLEEKSGGKAT 320
Query: 629 --------LHGSIDLEWLR------------NVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G DLE L+ + PD+ + L G+G+K+ CV L
Sbjct: 321 GLTGAEGKTQGQKDLEILKTDQEILSLDHIHGMVPDEAMQTLTKFPGIGVKTASCVILFC 380
Query: 669 LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT 728
LQ +F VD +V RI+ L W+P + HL E + D ++
Sbjct: 381 LQQPSFAVDTHVHRISGWLKWMPPKATRDQTFSHL--EVRIPDHLK-------------- 424
Query: 729 LYELHYQMITFGK--IFCKKRSP------NCGACPL 756
Y LH + G+ I C+ + N CPL
Sbjct: 425 -YGLHKLFVQHGRSCIRCRANTSEGSEEWNKSECPL 459
>gi|301300783|ref|ZP_07206967.1| endonuclease III [Lactobacillus salivarius ACS-116-V-Col5a]
gi|418961299|ref|ZP_13513186.1| endonuclease III [Lactobacillus salivarius SMXD51]
gi|300851633|gb|EFK79333.1| endonuclease III [Lactobacillus salivarius ACS-116-V-Col5a]
gi|380344966|gb|EIA33312.1| endonuclease III [Lactobacillus salivarius SMXD51]
Length = 213
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 69/165 (41%), Gaps = 26/165 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+A IK G A + E LVE S+ VP K + L+ + G+
Sbjct: 68 ADPKEVAKYIKTIGLYKNKAKYLVECSKMLVENFNSV-------VP--KTHKELMSLSGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + V AF VD +V RI+ RL VP + L+ + P D I+
Sbjct: 119 GRKTADVVLAERFGVPAFAVDTHVHRISKRLAIVPEDATVRETERILMSKVPKEDWIKS- 177
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI +G+ C R+P C CPL C+
Sbjct: 178 ----------------HHRMIFWGRYQCMARAPKCETCPLLEICQ 206
>gi|169350613|ref|ZP_02867551.1| hypothetical protein CLOSPI_01384 [Clostridium spiroforme DSM 1552]
gi|169292667|gb|EDS74800.1| endonuclease III [Clostridium spiroforme DSM 1552]
Length = 214
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 53/116 (45%), Gaps = 17/116 (14%)
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
V++ L + G+G K+ VR ++ AF VD +V RI+ RLG+ ++ L +
Sbjct: 107 VQKELESLPGVGRKTANVVRSVAFDIPAFAVDTHVERISKRLGFAKKDDNVLNVEKKLCR 166
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P ++ + H+Q I FG+ FCK +PNC C L CK
Sbjct: 167 SIP-----------------KERWNKAHHQFIFFGRYFCKATNPNCKECKLFDMCK 205
>gi|57641076|ref|YP_183554.1| endonuclease III [Thermococcus kodakarensis KOD1]
gi|57159400|dbj|BAD85330.1| endonuclease III [Thermococcus kodakarensis KOD1]
Length = 246
Score = 53.1 bits (126), Expect = 0.001, Method: Composition-based stats.
Identities = 43/153 (28%), Positives = 65/153 (42%), Gaps = 34/153 (22%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L+ + G+G K V A PVD +V RI+ RLG P + P
Sbjct: 125 PDDIHE-LMKLPGIGRKCANIVLAYGFGKQAIPVDTHVNRISKRLGLAPPRVAP------ 177
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+K++ YL + +K +Y +++ M+ G+ C+ +P C CPLR C
Sbjct: 178 --------EKVEEYLTALIP--KEKWIY-VNHAMVDHGRSICRPINPKCEECPLREFC-- 224
Query: 763 FASAVASARFALPGPSEKGIVTSEFGNGIGQSP 795
P KG+VT E G + P
Sbjct: 225 --------------PYAKGLVTDEDIKGNARKP 243
>gi|417960104|ref|ZP_12602758.1| Endonuclease III [Candidatus Arthromitus sp. SFB-1]
gi|380331776|gb|EIA22753.1| Endonuclease III [Candidatus Arthromitus sp. SFB-1]
Length = 209
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 73/166 (43%), Gaps = 27/166 (16%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
DE E+ D IK G + + K +N L D E VP + ++ L+ + G+G
Sbjct: 67 DEKELKDKIKSIGLYRM---KSKNIINLCRILEERFDSE----VP--RTRDELITLPGVG 117
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ V AF VDV+V R++ R+G + P P + L+K I L
Sbjct: 118 RKTANVVISNCFGVQAFAVDVHVFRVSNRIG-IANSPTPEKTELDLMKN------IDENL 170
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W +C H+ +I G+ C R PNCG C + CK++
Sbjct: 171 WT-IC----------HHTIIFHGRRCCTSRKPNCGECKISEYCKYY 205
>gi|330502366|ref|YP_004379235.1| DNA-(apurinic or apyrimidinic site) lyase/endonuclease III
[Pseudomonas mendocina NK-01]
gi|328916652|gb|AEB57483.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Pseudomonas mendocina NK-01]
Length = 212
Score = 53.1 bits (126), Expect = 0.001, Method: Composition-based stats.
Identities = 46/173 (26%), Positives = 75/173 (43%), Gaps = 27/173 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS V PD +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRILIEKHGS--------VVPDN-REDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPGKNVV-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C R P CGAC + C++ A
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCTARKPRCGACRIEDLCEYKA 207
>gi|46580743|ref|YP_011551.1| HhH-GPD family DNA repair protein [Desulfovibrio vulgaris str.
Hildenborough]
gi|120601969|ref|YP_966369.1| HhH-GPD family protein [Desulfovibrio vulgaris DP4]
gi|387154013|ref|YP_005702949.1| HhH-GPD family protein [Desulfovibrio vulgaris RCH1]
gi|46450163|gb|AAS96811.1| DNA repair protein, HhH-GPD family [Desulfovibrio vulgaris str.
Hildenborough]
gi|120562198|gb|ABM27942.1| DNA-3-methyladenine glycosylase III [Desulfovibrio vulgaris DP4]
gi|311234457|gb|ADP87311.1| HhH-GPD family protein [Desulfovibrio vulgaris RCH1]
Length = 226
Score = 53.1 bits (126), Expect = 0.001, Method: Composition-based stats.
Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 17/114 (14%)
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
D ++E LL ++G+G ++ + + L + H +F VD RI R G LP D+H
Sbjct: 120 DALRERLLSVKGIGPETADSILLYAFGHPSFVVDAYTRRILSRHGL-----LPEDVHYDE 174
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPL 756
++++ MD + LY E H ++ K +C K P+C ACPL
Sbjct: 175 MRDF-FMDVLD----------PDPVLYNEFHALIVRVAKGWCHKSRPDCAACPL 217
>gi|301105551|ref|XP_002901859.1| endonuclease III, HhH-GPD superfamily base excision DNA repair,
putative [Phytophthora infestans T30-4]
gi|262099197|gb|EEY57249.1| endonuclease III, HhH-GPD superfamily base excision DNA repair,
putative [Phytophthora infestans T30-4]
Length = 287
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 35/204 (17%)
Query: 568 WDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL- 626
+ ++R++ GE + + E V+ E ++++A+K G + A ++K L
Sbjct: 95 YAAMQRLHQLGESE-----EGLTIEVVQSVSEEKLSEALKPVGFYHRKAHQLKRVAAILR 149
Query: 627 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIAV 685
HG I P + E LL + G+G K + LL+ + VD +V R+A
Sbjct: 150 TRFHGDI---------PRSLDE-LLQLPGIGPKIGRVITLLAWGQVDGIVVDTHVHRLAQ 199
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RLGW P D L D I W +L + ++ FG+ C
Sbjct: 200 RLGWSSTTT-PEDTRKEL------EDWIPKEHWGKLSLV-----------VVGFGQTVCT 241
Query: 746 KRSPNCGACPLRGECKHFASAVAS 769
+ P+C CPL +C VAS
Sbjct: 242 AKHPSCSKCPLATKCPSAFKVVAS 265
>gi|417961869|ref|ZP_12604193.1| Endonuclease III [Candidatus Arthromitus sp. SFB-2]
gi|380331928|gb|EIA22866.1| Endonuclease III [Candidatus Arthromitus sp. SFB-2]
Length = 155
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 73/166 (43%), Gaps = 27/166 (16%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
DE E+ D IK G + + K +N L D E VP + ++ L+ + G+G
Sbjct: 13 DEKELQDKIKSIGLYRM---KSKNIINLCRILEERFDSE----VP--RTRDELITLPGVG 63
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ V AF VDV+V R++ R+G + P P + L+K I L
Sbjct: 64 RKTANVVISNCFGVQAFAVDVHVFRVSNRIG-IANSPTPEKTELDLMK------NIDENL 116
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W +C H+ +I G+ C R PNCG C + CK++
Sbjct: 117 WT-IC----------HHTIIFHGRRCCTSRKPNCGECKISEYCKYY 151
>gi|224541854|ref|ZP_03682393.1| hypothetical protein CATMIT_01026 [Catenibacterium mitsuokai DSM
15897]
gi|224525204|gb|EEF94309.1| endonuclease III [Catenibacterium mitsuokai DSM 15897]
Length = 211
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 17/112 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G+G K+ V + A VD +V RI+ RLG+ P D L E +
Sbjct: 111 LISLPGVGRKTANVVMAEGFGYPAIAVDTHVERISKRLGFAK----PED--TVLTVEKKL 164
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
M I W + H+QMI FG+ CK SP+C CPL CK
Sbjct: 165 MKTIPKNRWIKT-----------HHQMIFFGRYHCKAMSPHCKECPLVDICK 205
>gi|85710081|ref|ZP_01041146.1| endonuclease III family protein [Erythrobacter sp. NAP1]
gi|85688791|gb|EAQ28795.1| endonuclease III family protein [Erythrobacter sp. NAP1]
Length = 234
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 72/172 (41%), Gaps = 14/172 (8%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAV A ++ ++ + + A R+K+ LN ++E GS+DL L N+ + +L
Sbjct: 70 WEAVAAAPVGDLEARLQRQTFPTVAAQRLKQCLNTIIEQRGSVDLRHLSNLETAEAMAWL 129
Query: 651 LDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+ G+ K S + S A +D + RI R+G VP + K Y
Sbjct: 130 ETLPGVARKNSAGVMNASSFNRRAMVIDGHHRRIMQRMGIVPAKA-------DTAKTYDA 182
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ I W + E H + G+ C+ R+ C CP +C+
Sbjct: 183 LMPIVPEEW------SAADIDEHHLLLKKLGQTCCRPRAALCEDCPAAPDCE 228
>gi|367021722|ref|XP_003660146.1| hypothetical protein MYCTH_2298082 [Myceliophthora thermophila ATCC
42464]
gi|347007413|gb|AEO54901.1| hypothetical protein MYCTH_2298082 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/178 (23%), Positives = 71/178 (39%), Gaps = 35/178 (19%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------L 629
S+DW +R + E+A IK G + + IK+ L+ + +
Sbjct: 351 SIDWNKIRLSSHQELAQVIKVAGNGPMKSKHIKQILDIVFDENVQRAMMQDPAPEAGKKA 410
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGW 689
+ L+++ + D+ + G+G+K+ CV L L+ F VD +V + LGW
Sbjct: 411 QDLLSLDYMHGMTKDEAMAKFVSYPGVGIKTAACVTLFCLRLPCFAVDTHVHKFCRWLGW 470
Query: 690 VPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKR 747
VP P + H ++ V D ++ Y LH I G++ K R
Sbjct: 471 VPENADPDNCFRH--GDFMVPDHLK---------------YGLHQLFIRHGQVCFKCR 511
>gi|390961830|ref|YP_006425664.1| hypothetical protein CL1_1673 [Thermococcus sp. CL1]
gi|390520138|gb|AFL95870.1| hypothetical protein CL1_1673 [Thermococcus sp. CL1]
Length = 238
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L+ + G+G K V A PVD +V RI+ RLG P + P
Sbjct: 126 PDDIHE-LMKLPGIGRKCANIVLAYGFGKQAIPVDTHVNRISKRLGLAPPRVPP------ 178
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+K++ YL R +K +Y +++ M+ GK C P CG CPL+ C +
Sbjct: 179 --------EKVEEYL--RELIPREKWIY-VNHAMVDHGKNICNPIRPKCGECPLKELCPY 227
>gi|342732970|ref|YP_004771809.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-Japan]
gi|384456337|ref|YP_005668934.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-Yit]
gi|417967782|ref|ZP_12608847.1| Endonuclease III [Candidatus Arthromitus sp. SFB-co]
gi|418015623|ref|ZP_12655188.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-NYU]
gi|418373221|ref|ZP_12965312.1| Endonuclease III [Candidatus Arthromitus sp. SFB-mouse-SU]
gi|342330425|dbj|BAK57067.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-Japan]
gi|345505958|gb|EGX28252.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-NYU]
gi|346984682|dbj|BAK80358.1| endonuclease III [Candidatus Arthromitus sp. SFB-mouse-Yit]
gi|380340820|gb|EIA29370.1| Endonuclease III [Candidatus Arthromitus sp. SFB-co]
gi|380341610|gb|EIA30084.1| Endonuclease III [Candidatus Arthromitus sp. SFB-mouse-SU]
Length = 209
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 73/166 (43%), Gaps = 27/166 (16%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
DE E+ D IK G + + K +N L D E VP + ++ L+ + G+G
Sbjct: 67 DEKELQDKIKSIGLYRM---KSKNIINLCRILEERFDSE----VP--RTRDELITLPGVG 117
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ V AF VDV+V R++ R+G + P P + L+K I L
Sbjct: 118 RKTANVVISNCFGVQAFAVDVHVFRVSNRIG-IANSPTPEKTELDLMKN------IDENL 170
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W +C H+ +I G+ C R PNCG C + CK++
Sbjct: 171 WT-IC----------HHTIIFHGRRCCTSRKPNCGECKISEYCKYY 205
>gi|156937657|ref|YP_001435453.1| HhH-GPD family protein [Ignicoccus hospitalis KIN4/I]
gi|156566641|gb|ABU82046.1| HhH-GPD family protein [Ignicoccus hospitalis KIN4/I]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 50/169 (29%), Positives = 72/169 (42%), Gaps = 29/169 (17%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EAV + + I+ G Q A I E + E+ +I+ K L
Sbjct: 65 EAVLSLGTERLKELIRPAGLQEQKASAIVEAARKWEEVKKAIE---------KGDKGVLT 115
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I+G+G K+ + V L+S H FPVD +V R+A RLG V G+ + KE
Sbjct: 116 RIKGIGEKTADVV-LMSFGHEEFPVDTHVKRVAKRLGLVD-----GNAY----KE----- 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ RL L + E H +I G+ +CK + P C CPL C
Sbjct: 161 -----VSSRLKELFKGRTREAHMYLILLGRKYCKAKKPLCSECPLSDLC 204
>gi|431796640|ref|YP_007223544.1| endonuclease III [Echinicola vietnamensis DSM 17526]
gi|430787405|gb|AGA77534.1| endonuclease III [Echinicola vietnamensis DSM 17526]
Length = 227
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 22/137 (16%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP-LPGDLHI 701
PD VKE L+ + G+G K+ + + VD +V R++ RLG VP P ++
Sbjct: 106 PDTVKE-LIKLPGVGRKTANVITSVVWNQPNMAVDTHVFRVSKRLGLVPQNAKTPLEVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P ++ ++ H+ +I G+ C RSP C AC + C+
Sbjct: 165 QLIKHIP-----------------KEHIHVAHHWLILHGRYVCLARSPKCTACEITHFCR 207
Query: 762 HF---ASAVASARFALP 775
++ S +A+ R LP
Sbjct: 208 YYEKNQSKIAAERDLLP 224
>gi|49078760|gb|AAT49817.1| PA3495, partial [synthetic construct]
Length = 213
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HG VP ++ E L
Sbjct: 62 EAIHALGVEGLSEYIKTIGLYNSKAKNVIETCRILIEKHGG-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R+A R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVANRTGIAPGKNVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ L + H+ +I G+ CK R P CG+C + C++
Sbjct: 169 --------------REYLLDAHHWLILHGRYVCKARKPQCGSCRIEDLCEY 205
>gi|288556139|ref|YP_003428074.1| endonuclease III [Bacillus pseudofirmus OF4]
gi|288547299|gb|ADC51182.1| endonuclease III [Bacillus pseudofirmus OF4]
Length = 218
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 75/161 (46%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A IK+ LVE + VP D+ + L+ + G+G K+
Sbjct: 72 ELEEDIRRIGLFRSKAKNIKKLSQSLVEQYNG-------EVPKDR--DELVKLAGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + +++++ E +M KI + LW
Sbjct: 123 ANVVTSVAFGVPAIAVDTHVERVSKRLGICRWKD-----NVNVV-EQTLMKKIPIELWS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------DSHHRLIFFGRYHCKAQSPKCETCPLLDRCR 206
>gi|399519845|ref|ZP_10760636.1| endonuclease III [Pseudomonas pseudoalcaligenes CECT 5344]
gi|399112242|emb|CCH37195.1| endonuclease III [Pseudomonas pseudoalcaligenes CECT 5344]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 75/173 (43%), Gaps = 27/173 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP ++ E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRILIEKHGS-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPGKNVV-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C R P CGAC + C++ A
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCTARKPRCGACRIEDLCEYKA 207
>gi|296133481|ref|YP_003640728.1| HhH-GPD family protein [Thermincola potens JR]
gi|296032059|gb|ADG82827.1| HhH-GPD family protein [Thermincola potens JR]
Length = 232
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 76/171 (44%), Gaps = 18/171 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ A EI I + A +++ F+N +++ HG DL+ ++++ L
Sbjct: 68 EAMYKAPIEEIEKHIVPTLYWRMKAKKLRAFVNHIMDNYHG--DLDKFLQKDKEELRREL 125
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
L + G+G ++ + + L + + F VD RI RLG+ ++ +K P
Sbjct: 126 LSLYGIGPETADSIILYAAEQPVFVVDAYTRRIFHRLGFFEESVSYDEMQQFFMKHIP-- 183
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P + Y + E H ++ G FC + P+CG CP++ C+
Sbjct: 184 --------PDVRYYN-----EYHALIVGIGNRFCSNKKPDCGNCPIQSVCR 221
>gi|95928500|ref|ZP_01311247.1| HhH-GPD [Desulfuromonas acetoxidans DSM 684]
gi|95135290|gb|EAT16942.1| HhH-GPD [Desulfuromonas acetoxidans DSM 684]
Length = 220
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 15/116 (12%)
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
V++ LLD G+G ++ +C+ L F VD RI RLG LL
Sbjct: 115 VRQRLLDQPGIGPETADCMVLYGAGLPIFVVDAYTRRIFSRLG--------------LLD 160
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
D +Q Y L D E H ++ GK+ C+ R+P C ACPL C+
Sbjct: 161 AKARYDMVQRYAMQHL-PADTSLFNEFHALLVELGKVCCRSRNPRCEACPLNQHCR 215
>gi|325107269|ref|YP_004268337.1| DNA-(apurinic or apyrimidinic site) lyase [Planctomyces
brasiliensis DSM 5305]
gi|324967537|gb|ADY58315.1| DNA-(apurinic or apyrimidinic site) lyase [Planctomyces
brasiliensis DSM 5305]
Length = 231
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 68/167 (40%), Gaps = 27/167 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
++EIA AIK G N A I F L+ E+ VP + + L+ + G+G
Sbjct: 86 QAEIAAAIKPCGLYNSKAKNIHRFCEVLLS-------EFDGRVP--RTRAELMSLPGIGR 136
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K + V+ + VD +V R+ R G L D L
Sbjct: 137 KCADIVQQFAFDIDVIAVDTHVHRVCNRTG---------------LAVGKTADATARSLE 181
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
R + TL+E H+ +I FGK C R+P C C L C+++ S
Sbjct: 182 ERA---PEWTLHEGHFWLIQFGKQICHARTPRCENCSLNHLCRYYRS 225
>gi|116748504|ref|YP_845191.1| endonuclease III [Syntrophobacter fumaroxidans MPOB]
gi|116697568|gb|ABK16756.1| endonuclease III / DNA-(apurinic or apyrimidinic site) lyase
[Syntrophobacter fumaroxidans MPOB]
Length = 227
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 64/150 (42%), Gaps = 27/150 (18%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
A IKE L E HG +PP+ + L+ + G+G K+ + +
Sbjct: 100 ARNIKEACRVLAEEHGG-------EIPPNL--DILVKLPGIGRKTANVILGNAFGIPGIV 150
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VD +VGR++ RLG + P + E +M+ I W + C+ Q
Sbjct: 151 VDTHVGRVSERLGLTS-EKDPEKI------ERDLMEIIPREKWIKFCH-----------Q 192
Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+I G+ C+ R P G CPLR C H A+
Sbjct: 193 LIGLGREICQARKPKTGVCPLRPHCDHAAA 222
>gi|388543463|ref|ZP_10146754.1| endonuclease III [Pseudomonas sp. M47T1]
gi|388278775|gb|EIK98346.1| endonuclease III [Pseudomonas sp. M47T1]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 68/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELHG VP D+ E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKARNVIEACRMLVELHGG-------EVPQDR--ESLEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P K ++K + PR
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGIAPG------------KNVIEVEKNLLKFVPRQ 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C +
Sbjct: 171 YLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|15598691|ref|NP_252185.1| endonuclease III [Pseudomonas aeruginosa PAO1]
gi|107103025|ref|ZP_01366943.1| hypothetical protein PaerPA_01004094 [Pseudomonas aeruginosa PACS2]
gi|116051512|ref|YP_789652.1| endonuclease III [Pseudomonas aeruginosa UCBPP-PA14]
gi|218890260|ref|YP_002439124.1| endonuclease III [Pseudomonas aeruginosa LESB58]
gi|254236439|ref|ZP_04929762.1| endonuclease III [Pseudomonas aeruginosa C3719]
gi|254242175|ref|ZP_04935497.1| endonuclease III [Pseudomonas aeruginosa 2192]
gi|296387984|ref|ZP_06877459.1| endonuclease III [Pseudomonas aeruginosa PAb1]
gi|313108859|ref|ZP_07794842.1| endonuclease III [Pseudomonas aeruginosa 39016]
gi|386057540|ref|YP_005974062.1| endonuclease III [Pseudomonas aeruginosa M18]
gi|386067541|ref|YP_005982845.1| endonuclease III [Pseudomonas aeruginosa NCGM2.S1]
gi|392982768|ref|YP_006481355.1| endonuclease III [Pseudomonas aeruginosa DK2]
gi|416867830|ref|ZP_11916116.1| endonuclease III [Pseudomonas aeruginosa 138244]
gi|416875861|ref|ZP_11918934.1| endonuclease III [Pseudomonas aeruginosa 152504]
gi|418584272|ref|ZP_13148336.1| endonuclease III [Pseudomonas aeruginosa MPAO1/P1]
gi|418591842|ref|ZP_13155728.1| endonuclease III [Pseudomonas aeruginosa MPAO1/P2]
gi|419754900|ref|ZP_14281258.1| endonuclease III [Pseudomonas aeruginosa PADK2_CF510]
gi|420138609|ref|ZP_14646508.1| endonuclease III [Pseudomonas aeruginosa CIG1]
gi|421152679|ref|ZP_15612257.1| endonuclease III [Pseudomonas aeruginosa ATCC 14886]
gi|421158996|ref|ZP_15618178.1| endonuclease III [Pseudomonas aeruginosa ATCC 25324]
gi|421166302|ref|ZP_15624564.1| endonuclease III [Pseudomonas aeruginosa ATCC 700888]
gi|421173249|ref|ZP_15631001.1| endonuclease III [Pseudomonas aeruginosa CI27]
gi|421179340|ref|ZP_15636932.1| endonuclease III [Pseudomonas aeruginosa E2]
gi|421518036|ref|ZP_15964710.1| endonuclease III [Pseudomonas aeruginosa PAO579]
gi|424939679|ref|ZP_18355442.1| endonuclease III [Pseudomonas aeruginosa NCMG1179]
gi|451987559|ref|ZP_21935715.1| Endonuclease III [Pseudomonas aeruginosa 18A]
gi|9949641|gb|AAG06883.1|AE004770_8 endonuclease III [Pseudomonas aeruginosa PAO1]
gi|115586733|gb|ABJ12748.1| endonuclease III [Pseudomonas aeruginosa UCBPP-PA14]
gi|126168370|gb|EAZ53881.1| endonuclease III [Pseudomonas aeruginosa C3719]
gi|126195553|gb|EAZ59616.1| endonuclease III [Pseudomonas aeruginosa 2192]
gi|218770483|emb|CAW26248.1| endonuclease III [Pseudomonas aeruginosa LESB58]
gi|310881344|gb|EFQ39938.1| endonuclease III [Pseudomonas aeruginosa 39016]
gi|334833542|gb|EGM12621.1| endonuclease III [Pseudomonas aeruginosa 138244]
gi|334841489|gb|EGM20117.1| endonuclease III [Pseudomonas aeruginosa 152504]
gi|346056125|dbj|GAA16008.1| endonuclease III [Pseudomonas aeruginosa NCMG1179]
gi|347303846|gb|AEO73960.1| endonuclease III [Pseudomonas aeruginosa M18]
gi|348036100|dbj|BAK91460.1| endonuclease III [Pseudomonas aeruginosa NCGM2.S1]
gi|375046119|gb|EHS38687.1| endonuclease III [Pseudomonas aeruginosa MPAO1/P1]
gi|375049324|gb|EHS41825.1| endonuclease III [Pseudomonas aeruginosa MPAO1/P2]
gi|384398718|gb|EIE45123.1| endonuclease III [Pseudomonas aeruginosa PADK2_CF510]
gi|392318273|gb|AFM63653.1| endonuclease III [Pseudomonas aeruginosa DK2]
gi|403248618|gb|EJY62175.1| endonuclease III [Pseudomonas aeruginosa CIG1]
gi|404347518|gb|EJZ73867.1| endonuclease III [Pseudomonas aeruginosa PAO579]
gi|404524991|gb|EKA35279.1| endonuclease III [Pseudomonas aeruginosa ATCC 14886]
gi|404536198|gb|EKA45844.1| endonuclease III [Pseudomonas aeruginosa CI27]
gi|404538732|gb|EKA48254.1| endonuclease III [Pseudomonas aeruginosa ATCC 700888]
gi|404547208|gb|EKA56220.1| endonuclease III [Pseudomonas aeruginosa E2]
gi|404548847|gb|EKA57786.1| endonuclease III [Pseudomonas aeruginosa ATCC 25324]
gi|451754792|emb|CCQ88238.1| Endonuclease III [Pseudomonas aeruginosa 18A]
gi|453047484|gb|EME95198.1| endonuclease III [Pseudomonas aeruginosa PA21_ST175]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HG VP ++ E L
Sbjct: 62 EAIHALGVEGLSEYIKTIGLYNSKAKNVIETCRILIEKHGG-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R+A R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVANRTGIAPGKNVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ L + H+ +I G+ CK R P CG+C + C++
Sbjct: 169 --------------REYLLDAHHWLILHGRYVCKARKPQCGSCRIEDLCEY 205
>gi|452847106|gb|EME49038.1| hypothetical protein DOTSEDRAFT_49374 [Dothistroma septosporum
NZE10]
Length = 497
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 72/195 (36%), Gaps = 44/195 (22%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS--------------- 632
SVDW+ VR A + E+ AI+ G + + I+ L E + +
Sbjct: 238 SVDWDVVRRAPQKEVFKAIERGGLAQVKSKDIQAILQIAYEENQARKAAFTDPSDNPAGA 297
Query: 633 -------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA 673
I L+ L + D +L + G+G K+ CV L LQ +
Sbjct: 298 ENEPEKEKQNEVTKAKQNIISLDHLHLLSTDDAINKMLSLPGIGPKTASCVALFCLQRPS 357
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
F VD +V R+ LGWVP G + Y D + + Y LH
Sbjct: 358 FAVDTHVFRLCQYLGWVPKSTRKGQSKVDRNTTYSHCD----------VRIPDELKYPLH 407
Query: 734 YQMITFGKIFCKKRS 748
+I GK+ + R+
Sbjct: 408 QLLIKHGKVCPRCRA 422
>gi|378828254|ref|YP_005190986.1| endonuclease III [Sinorhizobium fredii HH103]
gi|365181306|emb|CCE98161.1| putative endonuclease III protein [Sinorhizobium fredii HH103]
Length = 275
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 70/170 (41%), Gaps = 27/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + E ++ DAIK G A + RL+ G VP K +E L+
Sbjct: 109 EKMLALGEEKLRDAIKTIGLYRNKAKNVIALSERLIADFGG-------AVP--KTREELM 159
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V ++ VD ++ RIA R+ P + P ++ +L++ P
Sbjct: 160 TLPGVGRKTANVVLQMAFGQSTIAVDTHLFRIANRIRLAPGK-TPDEVEANLMRVIP--- 215
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
QK LY H+ +I G+ CK R P C C + CK
Sbjct: 216 --------------QKYLYHAHHWLILHGRYVCKARRPECERCIIADICK 251
>gi|355640219|ref|ZP_09051638.1| endonuclease III [Pseudomonas sp. 2_1_26]
gi|354831431|gb|EHF15446.1| endonuclease III [Pseudomonas sp. 2_1_26]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HG VP ++ E L
Sbjct: 62 EAIHALGVEGLSEYIKTIGLYNSKAKNVIETCRILIEKHGG-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R+A R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVANRTGIAPGKNVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ L + H+ +I G+ CK R P CG+C + C++
Sbjct: 169 --------------REYLLDAHHWLILHGRYVCKARKPQCGSCRIEDLCEY 205
>gi|197119883|ref|YP_002140310.1| endonuclease III-like protein [Geobacter bemidjiensis Bem]
gi|197089243|gb|ACH40514.1| endonuclease III-related protein [Geobacter bemidjiensis Bem]
Length = 228
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 49/186 (26%), Positives = 79/186 (42%), Gaps = 40/186 (21%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL----EWLRNVPPDKVK 647
EA+R DE +A+ I+ G N+ + R+K F+ L E +GS+D +W+ ++
Sbjct: 70 EALREIDEGRLAELIRPSGFFNVKSARLKGFVGWLFERYGSLDAMFQGDWI------GLR 123
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI------ 701
E L + G+G ++ + + L + +F VD R+ RLG L D H
Sbjct: 124 EELSAVRGIGPETCDSILLYAGGKPSFVVDAYTRRLFSRLG---LMREEDDYHRVRALFM 180
Query: 702 -HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
HL E P+ + E H ++ K C+K+ P C CPL C
Sbjct: 181 DHLPAEVPLFN-------------------EYHALIVEQCKRHCRKK-PLCDGCPLTRFC 220
Query: 761 KHFASA 766
A+
Sbjct: 221 TFLAAG 226
>gi|299538661|ref|ZP_07051944.1| endonuclease III [Lysinibacillus fusiformis ZC1]
gi|424736041|ref|ZP_18164502.1| endonuclease III [Lysinibacillus fusiformis ZB2]
gi|298726248|gb|EFI66840.1| endonuclease III [Lysinibacillus fusiformis ZC1]
gi|422950070|gb|EKU44440.1| endonuclease III [Lysinibacillus fusiformis ZB2]
Length = 220
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I+ RL++ +G +P +E L+ + G+G K+
Sbjct: 72 ELQQDIRSIGLYRNKAKNIQALCQRLLDEYGG-------EIPA--TREALVTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ E +M K M W +
Sbjct: 123 ANVVLSVAFDIPALAVDTHVERVSKRLGLCRWKDSVLEV------EETIMKKTPMEKWSK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+Q+I FG+ CK ++P C CPL +C+
Sbjct: 177 T-----------HHQLIFFGRYHCKAQNPGCRTCPLLNDCR 206
>gi|198283588|ref|YP_002219909.1| HhH-GPD family protein [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218665212|ref|YP_002426212.1| base excision repair protein, HhH-GPD family [Acidithiobacillus
ferrooxidans ATCC 23270]
gi|198248109|gb|ACH83702.1| HhH-GPD family protein [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218517425|gb|ACK78011.1| base excision repair protein, HhH-GPD family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 223
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 48/185 (25%), Positives = 82/185 (44%), Gaps = 22/185 (11%)
Query: 586 MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG----SIDLEWLRNV 641
+D++ A+ E ++A+ ++ G I R+ L R +E G L N+
Sbjct: 57 VDALSVRALLALPEGDLAELLRPSGFYRIKTRRLLA-LCRFLEARGVGAAPEQLARQANI 115
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
P +++ LL + G+G ++ + + L +L VD RI RLG
Sbjct: 116 P--TLRKDLLAVHGIGAETADSILLYALGLPVMVVDAYTRRIGSRLG------------- 160
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LL++ ++Q + L D +T LH +++ GK +C+ R P CG CPL C
Sbjct: 161 -LLEDDLSYGEVQAGMEAELQPGDVQTRNALHALLVSLGKDYCRPR-PRCGLCPLHACCA 218
Query: 762 HFASA 766
+ A A
Sbjct: 219 YAAGA 223
>gi|422644190|ref|ZP_16707328.1| endonuclease III [Pseudomonas syringae pv. maculicola str. ES4326]
gi|330957742|gb|EGH58002.1| endonuclease III [Pseudomonas syringae pv. maculicola str. ES4326]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNTKAKNVIETCRMLVELHNS-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQVAMAVDTHIFRVSNRTGIAPGKNVV-EVEKQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L H+ +I G+ C+ R P CG+C + C +
Sbjct: 169 ----KNYLLYAHHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|229588698|ref|YP_002870817.1| endonuclease III [Pseudomonas fluorescens SBW25]
gi|229360564|emb|CAY47421.1| endonuclease III [Pseudomonas fluorescens SBW25]
Length = 212
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E LVELHGS VP + +E L
Sbjct: 62 EAIYALGVDGLSEYIKTIGLYNSKAKNVIETCRLLVELHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKPYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|312134533|ref|YP_004001871.1| hhh-gpd family protein [Caldicellulosiruptor owensensis OL]
gi|311774584|gb|ADQ04071.1| HhH-GPD family protein [Caldicellulosiruptor owensensis OL]
Length = 234
Score = 52.8 bits (125), Expect = 0.001, Method: Composition-based stats.
Identities = 46/173 (26%), Positives = 75/173 (43%), Gaps = 17/173 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + ++A+ IK G N A R+KEF N L S DLE L + +++ LL
Sbjct: 75 EGILQTPDEKLAELIKPAGYYNQKAKRLKEFCNFLKREFNS-DLEKLFALDISSLRKALL 133
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + + + Y +
Sbjct: 134 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI----------SYSDLQ 183
Query: 712 KIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I M L P+ + + E H ++ K CK + P C C LR C +
Sbjct: 184 AIIMAKLTPQTKFFN-----EFHALIVKHCKEICKSKKPICNKCCLRLICNYL 231
>gi|336476216|ref|YP_004615357.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosalsum zhilinae
DSM 4017]
gi|335929597|gb|AEH60138.1| DNA-(apurinic or apyrimidinic site) lyase [Methanosalsum zhilinae
DSM 4017]
Length = 203
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 71/164 (43%), Gaps = 27/164 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A+ SEI + IK+ G + + R+KE L+ +G VP + E LL++ G+
Sbjct: 63 AEVSEIEELIKDVGFYRVKSRRVKEIAEILMYRYGG-------EVPDNC--ELLLELPGV 113
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ CV L + VD +V RI+ RLG V P + L K
Sbjct: 114 GRKTANCVLLYAFSKETIAVDTHVHRISNRLGLVK-SSTPDETEEKLKK----------- 161
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ PR + D L+ + FG+ C+ SP C C L C
Sbjct: 162 ILPRSSWKDINELF------VQFGQNICRPVSPKCDICVLCNIC 199
>gi|218883640|ref|YP_002428022.1| Endonuclease III [Desulfurococcus kamchatkensis 1221n]
gi|218765256|gb|ACL10655.1| Endonuclease III [Desulfurococcus kamchatkensis 1221n]
Length = 238
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 56/119 (47%), Gaps = 18/119 (15%)
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
++ + L+++ G+G K+ + V L + FPVD ++ RI +R+G+ G
Sbjct: 129 EEARRLLMELPGVGWKTADVVLLRYFRKPVFPVDTHITRITMRMGFT------GS----- 177
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ Y + + M YLD LH +IT G+ C+ R P C C LR CK+
Sbjct: 178 -RSYKHISRFWMDNTSPGNYLD------LHLYLITHGRRTCRARKPLCNKCVLRDMCKY 229
>gi|317148991|ref|XP_003190266.1| HhH-GPD family base excision DNA repair protein [Aspergillus oryzae
RIB40]
Length = 457
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 35/139 (25%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN---------RLVELHGS------ 632
SV+W+AVR A ++ +AIK G ++ + +K L+ R + + G
Sbjct: 229 SVNWDAVRQAPLKDVFEAIKSGGLADVKSKNLKAILDMVHKENQERREILVKGEDAGPPD 288
Query: 633 --------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
+ L L + ++V E L+ G+G K+ CV L LQ
Sbjct: 289 LMQKSEGSKQYEIACADQHFLSLNHLHTLNTEQVMEELIKYPGIGPKTAACVLLFCLQRP 348
Query: 673 AFPVDVNVGRIAVRLGWVP 691
F VD ++ RI LGWVP
Sbjct: 349 CFAVDTHIFRICKWLGWVP 367
>gi|410668427|ref|YP_006920798.1| endonuclease [Thermacetogenium phaeum DSM 12270]
gi|409106174|gb|AFV12299.1| putative endonuclease [Thermacetogenium phaeum DSM 12270]
Length = 261
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 71/170 (41%), Gaps = 16/170 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + E+A ++ N A +++ F +VE G +LE + ++++ LL
Sbjct: 65 EGLLSCSQEELAALVRPARYHNQKAKKLQSFCRVVVEEFGG-ELEDFLALEMKELRKRLL 123
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G ++ + + L + + F +D GRI RLG+ + Y M
Sbjct: 124 AVYGIGPETADAIILYAAEKPIFVIDAYTGRIFSRLGFFSEKA-----------GYEEMQ 172
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ M P D + E H + G C K P CG CPL CK
Sbjct: 173 RFFMTHLPP----DVRLYNEYHALIDALGHYTCCKSQPRCGECPLEYRCK 218
>gi|408483508|ref|ZP_11189727.1| endonuclease III [Pseudomonas sp. R81]
Length = 212
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E LVELHGS VP + +E L
Sbjct: 62 EAIYALGVDGLSEYIKTIGLYNSKAKNVIETCRLLVELHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKPYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|146306430|ref|YP_001186895.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Pseudomonas mendocina ymp]
gi|421504740|ref|ZP_15951681.1| endonuclease III [Pseudomonas mendocina DLHK]
gi|145574631|gb|ABP84163.1| DNA-(apurinic or apyrimidinic site) lyase [Pseudomonas mendocina
ymp]
gi|400344698|gb|EJO93067.1| endonuclease III [Pseudomonas mendocina DLHK]
Length = 212
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 75/173 (43%), Gaps = 27/173 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP ++ E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRILIEKHGS-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPGKNVV-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C R P CGAC + C++ A
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCTARKPRCGACRIEDLCEYKA 207
>gi|357452087|ref|XP_003596320.1| DEMETER-like protein [Medicago truncatula]
gi|355485368|gb|AES66571.1| DEMETER-like protein [Medicago truncatula]
Length = 192
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 53/91 (58%), Gaps = 2/91 (2%)
Query: 127 EHGALVPHQ-IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKER 185
E ALV Q + KK V LD T ++ L+++ + D T NED KWW++ER
Sbjct: 24 EQNALVLFQGSSDPVKKQHANVDLDEETNKVRKLLLSYLNPDGVDGT-NEDKAKWWEEER 82
Query: 186 EVFEGRIQSFTARMHLILGDRRFKPWKGSVV 216
++F GR +SF ARMH I G +F WKGSVV
Sbjct: 83 KLFRGRSESFIARMHPIQGYIQFSIWKGSVV 113
>gi|160902834|ref|YP_001568415.1| endonuclease III [Petrotoga mobilis SJ95]
gi|160360478|gb|ABX32092.1| endonuclease III [Petrotoga mobilis SJ95]
Length = 210
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 20/145 (13%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
AG K+ R++ + I +E PD+++E L+++ G+G K+ V +S A
Sbjct: 75 AGMYKQKSERIINI-SKILIEKYNGKVPDELEE-LIELPGVGRKTANIVLYVSFGKEALA 132
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQ 735
VD +V RI+ RLGWV + P + L K P LW L+
Sbjct: 133 VDTHVHRISNRLGWVKTK-TPEETEEQLKKIIP------SELWG-----------PLNGS 174
Query: 736 MITFGKIFCKKRSPNCGACPLRGEC 760
M+ FG+ CK SP C C L C
Sbjct: 175 MVNFGQKICKPISPKCDECFLNEVC 199
>gi|398380961|ref|ZP_10539074.1| endonuclease III [Rhizobium sp. AP16]
gi|397720025|gb|EJK80586.1| endonuclease III [Rhizobium sp. AP16]
Length = 259
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 18/117 (15%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E L+ + G+G K+ V ++ H VD +V RIA RL P + P ++ LL
Sbjct: 135 QTREELMTLPGVGRKTANVVMSMAFGHATLAVDTHVFRIANRLLLAPGK-TPDEVEQRLL 193
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K P DQ LY H+ +I G+ CK R P C C + C+
Sbjct: 194 KVIP----------------DQ-YLYHAHHWLILHGRYVCKARKPECERCVIADLCR 233
>gi|330837763|ref|YP_004412404.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta coccoides
DSM 17374]
gi|329749666|gb|AEC03022.1| DNA-(apurinic or apyrimidinic site) lyase [Sphaerochaeta coccoides
DSM 17374]
Length = 237
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 67/168 (39%), Gaps = 27/168 (16%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ + +I AI G A I E +LVE HG + VPPDK E L+
Sbjct: 88 AMLALSQEQIETAIAPAGFFRTKARNILEISKKLVEEHGGL-------VPPDK--EALVS 138
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ L A VD +V IA R GWV + P L K P
Sbjct: 139 LPGVGTKTANLTLNLGFGIDAICVDCHVHTIANRTGWVSTKN-PEQTEKELEKILP---- 193
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ L+ ++++G+ C SP C CP+ C
Sbjct: 194 -------------RRFWIPLNELLVSYGQKICTSVSPRCSICPIASTC 228
>gi|238494280|ref|XP_002378376.1| HhH-GPD family base excision DNA repair protein [Aspergillus flavus
NRRL3357]
gi|220695026|gb|EED51369.1| HhH-GPD family base excision DNA repair protein [Aspergillus flavus
NRRL3357]
Length = 468
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 35/139 (25%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN---------RLVELHGS------ 632
SV+W+AVR A ++ +AIK G ++ + +K L+ R + + G
Sbjct: 240 SVNWDAVRQAPLKDVFEAIKSGGLADVKSKNLKAILDMVHKENQERREILVKGEDAGPSD 299
Query: 633 --------------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI 672
+ L L + ++V E L+ G+G K+ CV L LQ
Sbjct: 300 LMQKSEGSKQYEIACADQHFLSLNHLHTLNTEQVMEELIKYPGIGPKTAACVLLFCLQRP 359
Query: 673 AFPVDVNVGRIAVRLGWVP 691
F VD ++ RI LGWVP
Sbjct: 360 CFAVDTHIFRICKWLGWVP 378
>gi|397780479|ref|YP_006544952.1| endonuclease III related protein [Methanoculleus bourgensis MS2]
gi|396938981|emb|CCJ36236.1| endonuclease III related protein [Methanoculleus bourgensis MS2]
Length = 233
Score = 52.4 bits (124), Expect = 0.001, Method: Composition-based stats.
Identities = 52/180 (28%), Positives = 73/180 (40%), Gaps = 17/180 (9%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D + + AD +IA I N A RI+EF V D + V ++E
Sbjct: 63 LDPDLLAAADTGDIARLIIPSRFYNQKAERIQEFAGIYVA-EFQADPAVMAAVETGALRE 121
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL + G G ++V+ + L + + F VD RI R G LL E
Sbjct: 122 RLLAVRGFGKETVDTILLYACRKPVFVVDAYTRRIFSRYG--------------LLPEKV 167
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
D+ Q L D K + H Q++ GK C+K SP C CP+R C A A
Sbjct: 168 SYDRTQRLFSDHLP-PDVKLFNDYHAQIVRLGKTACRK-SPLCDRCPIRRVCGSLRCAAA 225
>gi|395496886|ref|ZP_10428465.1| endonuclease III [Pseudomonas sp. PAMC 25886]
Length = 212
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E LVELHGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIETCRLLVELHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKPYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|116193597|ref|XP_001222611.1| hypothetical protein CHGG_06516 [Chaetomium globosum CBS 148.51]
gi|88182429|gb|EAQ89897.1| hypothetical protein CHGG_06516 [Chaetomium globosum CBS 148.51]
Length = 549
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 64/160 (40%), Gaps = 31/160 (19%)
Query: 571 LRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-- 628
L R Y G+ + S++WE VR + +E+ IK G + IK L+ + E
Sbjct: 301 LVRYYGRGQVGT--SAGSINWEKVRLSTHAELTQVIKVAGNGPNRSQHIKRILDMVHEEN 358
Query: 629 --------------------------LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVE 662
LH + L+ +R + D+ L G+G+K+
Sbjct: 359 VQRAKMQRPETEAGQTQAVAEAGKTALH-LLSLDHMRAMSKDEAMAKFLSYPGIGIKTAA 417
Query: 663 CVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
CV L LQ F VD +V + LGWVP + P + H
Sbjct: 418 CVTLFCLQKPCFAVDTHVHKFCRWLGWVPDKADPDNCFRH 457
>gi|420236383|ref|ZP_14740869.1| endonuclease III [Parascardovia denticolens IPLA 20019]
gi|391880559|gb|EIT89050.1| endonuclease III [Parascardovia denticolens IPLA 20019]
Length = 318
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 67/172 (38%), Gaps = 22/172 (12%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ A+ SE+A+ I G A N L+ G VP + E L
Sbjct: 166 ALSQANPSEVAEIINSIGFFRTKAQHAVMIANDLITRFGG-------EVP--RTMEELTT 216
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ + + FPVD +V R+ RL W D + + KE V
Sbjct: 217 LPGVGRKTANVILGNAFDLPGFPVDTHVMRVTKRLHWRSDWNKTKDDPVAIEKE--VTAA 274
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ W +L +++I FG+ C R P C CPLR C F
Sbjct: 275 FEPTEW-----------RDLSHRLIDFGRDTCHARKPECLICPLRDTCPSFG 315
>gi|283782700|ref|YP_003373454.1| endonuclease III [Gardnerella vaginalis 409-05]
gi|298253131|ref|ZP_06976923.1| EndoIII-like endonuclease [Gardnerella vaginalis 5-1]
gi|283442104|gb|ADB14570.1| endonuclease III [Gardnerella vaginalis 409-05]
gi|297532526|gb|EFH71412.1| EndoIII-like endonuclease [Gardnerella vaginalis 5-1]
Length = 224
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 15/113 (13%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+ + G+G K+ V + FPVD +V R+ RLGW +Q P + I
Sbjct: 122 EALISLPGVGRKTANVVLGNAFGLPGFPVDTHVIRVTGRLGWRKVQNRPNPVAIE----- 176
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I Y P K L +++I G+ C R+P C +CPL C
Sbjct: 177 ---REITAYFAP-------KEWTNLSHRLILHGRKICTARNPKCVSCPLNTTC 219
>gi|212526396|ref|XP_002143355.1| HhH-GPD family base excision DNA repair protein [Talaromyces
marneffei ATCC 18224]
gi|210072753|gb|EEA26840.1| HhH-GPD family base excision DNA repair protein [Talaromyces
marneffei ATCC 18224]
Length = 449
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 57/143 (39%), Gaps = 39/143 (27%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVEL------------------ 629
SVDW AVR A +E+ +A+K G + IK LN + E
Sbjct: 234 SVDWNAVRVAPINEVFEAMKSGGLATTKSKYIKGILNMVYEENLARKAAHLKSEEEPGKP 293
Query: 630 ------HGS---------------IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
H S + L+W+ + ++ L+ G+G K+ CV L
Sbjct: 294 GPVGAEHESKAQKEVEIALTDENVLSLDWIHALDKEEAMLELIKYPGIGPKTAACVVLFC 353
Query: 669 LQHIAFPVDVNVGRIAVRLGWVP 691
LQ F VD ++ RI LGW+P
Sbjct: 354 LQRPCFAVDTHIFRICKWLGWLP 376
>gi|126651184|ref|ZP_01723394.1| Nth [Bacillus sp. B14905]
gi|126592022|gb|EAZ86088.1| Nth [Bacillus sp. B14905]
Length = 220
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 73/161 (45%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I+ RL++ E+ +P +E L+ + G+G K+
Sbjct: 72 ELQQEIRSIGLYRNKAKNIQALCQRLLD-------EYNGEIPA--TREALVTLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ ++K+ P MDK W +
Sbjct: 123 ANVVLSVAFDIPALAVDTHVERVSKRLGLCRWKDSVLEVEETIMKKTP-MDK-----WSK 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+Q+I FG+ CK ++P C CPL +C+
Sbjct: 177 T-----------HHQLIFFGRYHCKAQNPGCHTCPLLSDCR 206
>gi|20093633|ref|NP_613480.1| EndoIII-related endonuclease [Methanopyrus kandleri AV19]
gi|19886502|gb|AAM01410.1| Predicted EndoIII-related endonuclease [Methanopyrus kandleri AV19]
Length = 233
Score = 52.4 bits (124), Expect = 0.001, Method: Composition-based stats.
Identities = 41/162 (25%), Positives = 72/162 (44%), Gaps = 20/162 (12%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
++ + +++ G A IKE R+ L +DLE + P ++ + L+ + G+G K+
Sbjct: 87 DLVETLRDAGLYRQKAKMIKECCERI--LADGLDLEEIVQKPTEEARRELMRLPGVGPKT 144
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ V L + H PVD +V R++ RLG KEY + + + P
Sbjct: 145 ADVVLLFAGGHDVCPVDTHVARVSRRLGLTDS------------KEYFEVQEAVHEMVP- 191
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ + H +I FG+ C+ R P C C +R C +
Sbjct: 192 -----EGERGKAHLALIQFGREICRPRKPQCELCFVRRFCPY 228
>gi|228469748|ref|ZP_04054714.1| endonuclease III [Porphyromonas uenonis 60-3]
gi|228308683|gb|EEK17416.1| endonuclease III [Porphyromonas uenonis 60-3]
Length = 214
Score = 52.4 bits (124), Expect = 0.001, Method: Composition-based stats.
Identities = 34/121 (28%), Positives = 53/121 (43%), Gaps = 18/121 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP-LPGDLHIHLLK 705
+E L + G+G KS + + A PVD +V R+A R+G + P + L +
Sbjct: 109 REELEALPGVGRKSASVMLAVCFDTPAMPVDTHVFRVAKRIGLASSRATTPLAVEQALRR 168
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
P ++ L H+Q+I G+ CK R P C C L C+H+A+
Sbjct: 169 RIP-----------------REQLIRAHHQLILLGRYICKARKPLCDECTLTACCRHYAT 211
Query: 766 A 766
A
Sbjct: 212 A 212
>gi|385840352|ref|YP_005863676.1| Endonuclease III [Lactobacillus salivarius CECT 5713]
gi|300214473|gb|ADJ78889.1| Endonuclease III [Lactobacillus salivarius CECT 5713]
Length = 213
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 70/165 (42%), Gaps = 26/165 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+A+ +K G A + E LV+ S+ VP K + L+ + G+
Sbjct: 68 ADPKEVAEYVKTIGLYKNKAKYLVECSKMLVDNFNSV-------VP--KTHKELMSLSGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + V AF VD +V RI+ RL VP + L+ + P D I+ Y
Sbjct: 119 GRKTADVVLAERFGVPAFAVDTHVHRISKRLAIVPEDATVRETEKILMSKVPKEDWIKSY 178
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++MI +G+ C R+P C CPL C+
Sbjct: 179 -----------------HRMIFWGRYQCMARAPKCETCPLLEICQ 206
>gi|189423447|ref|YP_001950624.1| HhH-GPD family protein [Geobacter lovleyi SZ]
gi|189419706|gb|ACD94104.1| HhH-GPD family protein [Geobacter lovleyi SZ]
Length = 225
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 73/161 (45%), Gaps = 19/161 (11%)
Query: 606 IKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECV 664
I+ G N+ A R++ F L + + GS+D L P + + LL ++G+G ++ + +
Sbjct: 76 IRPAGYFNVKAVRLQAFTTFLQQQYQGSLDR--LFAAPWQQTRTELLAVKGIGPETADSI 133
Query: 665 RLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYL 724
L + +F VD RI RLG L+ E D ++ + RL L
Sbjct: 134 LLYAGHKPSFVVDAYTRRIFSRLG--------------LVDERISYDGLRRHFMDRLT-L 178
Query: 725 DQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
D E H ++ GK C+ R P C +C L +C + A+
Sbjct: 179 DTALFNEYHALLVELGKQACRPR-PQCSSCCLAAQCSYNAT 218
>gi|302410315|ref|XP_003002991.1| base excision DNA repair protein [Verticillium albo-atrum VaMs.102]
gi|261358015|gb|EEY20443.1| base excision DNA repair protein [Verticillium albo-atrum VaMs.102]
Length = 327
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 17/139 (12%)
Query: 604 DAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVEC 663
+AIK G + + I L + +G+ L+ + + ++ L+ +G+G K+ C
Sbjct: 161 EAIKCGGLAAVKSKVILGILEQAKARYGAYSLDHMFDKTDEEAMRELIGFQGVGPKTASC 220
Query: 664 VRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCY 723
V L L+ +F VD +V RI LGW P + + HL P DK
Sbjct: 221 VLLFCLRRESFAVDTHVWRITGLLGWRPKTASRDETYAHLDVRIPDEDK----------- 269
Query: 724 LDQKTLYELHYQMITFGKI 742
Y LH ++ GK+
Sbjct: 270 ------YGLHILLVKHGKV 282
>gi|307288541|ref|ZP_07568525.1| endonuclease III [Enterococcus faecalis TX0109]
gi|422703006|ref|ZP_16760834.1| endonuclease III [Enterococcus faecalis TX1302]
gi|306500448|gb|EFM69781.1| endonuclease III [Enterococcus faecalis TX0109]
gi|315165565|gb|EFU09582.1| endonuclease III [Enterococcus faecalis TX1302]
Length = 215
Score = 52.4 bits (124), Expect = 0.002, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMILFGRYHCLARAPKCEACPLLYMCQ 206
>gi|424069009|ref|ZP_17806457.1| endonuclease III [Pseudomonas syringae pv. avellanae str.
ISPaVe013]
gi|407995815|gb|EKG36323.1| endonuclease III [Pseudomonas syringae pv. avellanae str.
ISPaVe013]
Length = 212
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNG-------EVP--QTREALETLPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|415726092|ref|ZP_11470593.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis
00703Dmash]
gi|388063965|gb|EIK86533.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis
00703Dmash]
Length = 223
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 12/123 (9%)
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
V P K+++ L + G+G K+ VR + FPVD +V R+ RLGW P
Sbjct: 106 FHGVVPSKMED-LTSLPGVGRKTANVVRGNAFGLPGFPVDTHVIRVTGRLGWRKDCNKP- 163
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
K+Y + I+ + Y D L +++I G+ C R+P C CPL+
Sbjct: 164 -------KKYANPESIEKEI---TSYFDPADWANLSHRLILHGRKICTARNPKCALCPLK 213
Query: 758 GEC 760
C
Sbjct: 214 TTC 216
>gi|325283633|ref|YP_004256174.1| iron-sulfur cluster loop [Deinococcus proteolyticus MRP]
gi|324315442|gb|ADY26557.1| iron-sulfur cluster loop [Deinococcus proteolyticus MRP]
Length = 227
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 52/210 (24%), Positives = 82/210 (39%), Gaps = 38/210 (18%)
Query: 557 KKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERG------ 610
++N+ Q+ W+ L+ Y WEA A+ EI ++ G
Sbjct: 37 QQNTWAVAQRQWEALKAAYP-------------HWEAALLAEPEEIETVLRGAGGGLACS 83
Query: 611 QQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQ 670
+ + G L RL E G L +L +PP + + L + G+G ++ + L L
Sbjct: 84 KSRAVWG----ILRRLEEERGRPSLRFLHRLPPAEARTVLQALPGVGQRTASLLLLFHLA 139
Query: 671 HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY 730
A VD N+ R+ RL VP PG D+ +++L L D
Sbjct: 140 QPAAAVDSNIERLLHRLEVVP----PG----------WKADRQELWLEGVLP-ADAPLRA 184
Query: 731 ELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
H + G+ C + +P C AC LR C
Sbjct: 185 AFHRAGVRHGREICTRHAPRCPACVLREWC 214
>gi|398355789|ref|YP_006401253.1| endonuclease III [Sinorhizobium fredii USDA 257]
gi|390131115|gb|AFL54496.1| endonuclease III [Sinorhizobium fredii USDA 257]
Length = 275
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 69/163 (42%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++ DAIK G A + +L+ GS VP K +E L+ + G+G
Sbjct: 116 EEKLRDAIKTIGLYRNKATNVIALSEKLLADFGS-------EVP--KTREELMTLPGVGR 166
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RIA R+ P + P ++ +L++ P
Sbjct: 167 KTANVVLQMAFGQSTIAVDTHLFRIANRIRLAPGK-TPDEVEANLMRVIP---------- 215
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+K LY H+ +I G+ CK R P C C + CK
Sbjct: 216 -------EKYLYHAHHWLILHGRYVCKARRPECERCIIADICK 251
>gi|297584383|ref|YP_003700163.1| endonuclease III [Bacillus selenitireducens MLS10]
gi|297142840|gb|ADH99597.1| endonuclease III [Bacillus selenitireducens MLS10]
Length = 217
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 54/117 (46%), Gaps = 17/117 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K K L + G+G K+ V ++ A VD +V R++ RLG + ++ L+
Sbjct: 107 KSKTELKKLAGVGRKTANVVASVAFDEPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ P ++ H++MI FG+ CK ++PNC ACPL C+
Sbjct: 167 KKIP-----------------REEWSVTHHRMIFFGRYHCKAQNPNCTACPLSDMCR 206
>gi|159041229|ref|YP_001540481.1| HhH-GPD family protein [Caldivirga maquilingensis IC-167]
gi|157920064|gb|ABW01491.1| HhH-GPD family protein [Caldivirga maquilingensis IC-167]
Length = 233
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 20/143 (13%)
Query: 624 NRLVELHGSI----DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVN 679
+RLV + +I L L + D+V+ LL ++G+G ++ + + L +L + P+
Sbjct: 99 SRLVRIARTITSMGGLRALSKLSDDEVRVMLLSMDGVGYETADSIMLFALNRVTIPISTY 158
Query: 680 VGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITF 739
R+ R+ + +L Y M L PR Y + L+ H ++T
Sbjct: 159 TIRVIKRI------------YGYLGGGYEDWRLTLMKLLPRGLY--EYKLF--HAGVVTT 202
Query: 740 GKIFCKKRSPNCGACPLRGECKH 762
GK +C K +P C CPLR +C+
Sbjct: 203 GKEWCLKETPRCIECPLRNQCRF 225
>gi|406967816|gb|EKD92806.1| hypothetical protein ACD_28C00327G0009 [uncultured bacterium]
Length = 224
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 71/170 (41%), Gaps = 39/170 (22%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSI--DLEWLRNVPPDKVKEYLLDIEG 655
E + I+ G N A I + +L+E HG I LE L ++P G
Sbjct: 70 EETLRAKIRSCGYFNQKAKNIIKASQKLIEKYHGEIPQSLEDLTSLP------------G 117
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+GLKS V + AFPVD +V R+A R+G V DK
Sbjct: 118 VGLKSAGVVLSQAFAIPAFPVDTHVFRVANRIGLV---------------HEKTRDKTAY 162
Query: 716 YL---WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L P+ C++D H Q+I G+ CK + P C CP++ C++
Sbjct: 163 ALEKTLPQDCWID------FHLQLIFHGRKTCKSQKPRCYECPVKDLCEY 206
>gi|14591284|ref|NP_143362.1| endonuclease III [Pyrococcus horikoshii OT3]
gi|3257923|dbj|BAA30606.1| 222aa long hypothetical endonuclease III [Pyrococcus horikoshii
OT3]
Length = 222
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PDK +E L+ + G+G K V A PVD +V RI+ RLG P P ++
Sbjct: 112 PDKFEE-LIKLPGIGRKCANIVLAYGFGIPAIPVDTHVYRISRRLGLAPWDASPEEVEER 170
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L L PR ++ +Y +++ M+ GK CK P C CPLRG C
Sbjct: 171 L-----------KSLIPR-----EEWIY-VNHAMVDHGKSVCKPIKPRCWECPLRGLC 211
>gi|386346167|ref|YP_006044416.1| HhH-GPD family protein [Spirochaeta thermophila DSM 6578]
gi|339411134|gb|AEJ60699.1| HhH-GPD family protein [Spirochaeta thermophila DSM 6578]
Length = 253
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 38/124 (30%), Positives = 53/124 (42%), Gaps = 17/124 (13%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLP-GDLHIHLLK 705
+ LL + G+G ++ + + L F VD RI RLG + P ++ + K
Sbjct: 146 RNALLGVWGVGRETADSILLYGYGVPVFVVDAYTRRIFSRLGLLASDDTPYEEVRSAVEK 205
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
P LD E H ++ K FC+KR P CG CPLR EC H +S
Sbjct: 206 TLP---------------LDHVCYNEFHALLVEHAKRFCRKR-PLCGECPLRLECAHLSS 249
Query: 766 AVAS 769
A S
Sbjct: 250 AGGS 253
>gi|320100933|ref|YP_004176525.1| DNA-(apurinic or apyrimidinic site) lyase [Desulfurococcus mucosus
DSM 2162]
gi|319753285|gb|ADV65043.1| DNA-(apurinic or apyrimidinic site) lyase [Desulfurococcus mucosus
DSM 2162]
Length = 223
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 52/120 (43%), Gaps = 18/120 (15%)
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
++ + L+++ G+G K+ + V L FPVD ++ RI R+G+
Sbjct: 114 NEARRRLMELPGVGEKTADVVLLRYFGIPVFPVDTHISRITRRMGFTE------------ 161
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
Y + M YL ELH +IT G+ CK R P C C LR CKH+
Sbjct: 162 TGRYSDVSSFWMENTSPWNYL------ELHLYLITHGRRICKARKPLCDECVLRDLCKHY 215
>gi|429331422|ref|ZP_19212179.1| endonuclease III [Pseudomonas putida CSV86]
gi|428763895|gb|EKX86053.1| endonuclease III [Pseudomonas putida CSV86]
Length = 212
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVE HGS VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRILVEQHGS-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R G P K ++K + P+
Sbjct: 123 NVVLNTAFRQLAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKALLKFVPKA 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 171 YLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|66047115|ref|YP_236956.1| endonuclease III/Nth [Pseudomonas syringae pv. syringae B728a]
gi|422675381|ref|ZP_16734726.1| endonuclease III [Pseudomonas syringae pv. aceris str. M302273]
gi|63257822|gb|AAY38918.1| endonuclease III / DNA-(apurinic or apyrimidinic site) lyase
[Pseudomonas syringae pv. syringae B728a]
gi|330973100|gb|EGH73166.1| endonuclease III [Pseudomonas syringae pv. aceris str. M302273]
Length = 212
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|23015472|ref|ZP_00055247.1| COG0177: Predicted EndoIII-related endonuclease [Magnetospirillum
magnetotacticum MS-1]
Length = 211
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 44/162 (27%), Positives = 65/162 (40%), Gaps = 27/162 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +A +I+ G A + E RL+ LHG VP D+ L + G+G
Sbjct: 69 EEGLAQSIRTIGLYKTKAKNVIELSRRLLALHGG-------QVPHDRAA--LEALPGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD + R+A R G P K ++++ M
Sbjct: 120 KTANVVLNIAFGEPTIAVDTHCFRVANRTGLAPG------------KTVELVEQALMKAT 167
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P K L H+ +I G+ CK R P CGAC +R C
Sbjct: 168 P------AKWLQHAHHWLILHGRYTCKARKPECGACAVRDLC 203
>gi|333901551|ref|YP_004475424.1| endonuclease III [Pseudomonas fulva 12-X]
gi|333116816|gb|AEF23330.1| endonuclease III [Pseudomonas fulva 12-X]
Length = 212
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP ++ E L
Sbjct: 62 EAIYALGVDGLSEYIKTIGLYNSKAKNVIEACRILIEKHGS-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPG------------KNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K + PR LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KKLLKFVPRDYLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|82596217|ref|XP_726170.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481466|gb|EAA17735.1| Drosophila melanogaster CG9272 gene product [Plasmodium yoelii
yoelii]
Length = 386
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 80/180 (44%), Gaps = 28/180 (15%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K ++ E + E E+ I G N+ + +I + L E + S ++P +
Sbjct: 217 KKHGLNVENILKTSEEELKKLIYGIGFYNVKSKQIIKICQILKEKYNS-------DIPHN 269
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
E L+ + G+G K + + +L +H VD++V RI+ RL WV + D I L
Sbjct: 270 Y--EELIKLPGIGEKVSQLILQTALNKHEGIAVDIHVHRISNRLNWVYTKN-EADTQIKL 326
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
++D++ EL++ ++ FG++ CK + P C C L C+++
Sbjct: 327 K-----------------SFVDKELWSELNHLLVGFGQVICKGKKPLCEKCTLTDYCQYY 369
>gi|424073453|ref|ZP_17810869.1| endonuclease III [Pseudomonas syringae pv. avellanae str.
ISPaVe037]
gi|407996102|gb|EKG36595.1| endonuclease III [Pseudomonas syringae pv. avellanae str.
ISPaVe037]
Length = 212
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|296004674|ref|XP_966134.2| endonuclease III homologue, putative [Plasmodium falciparum 3D7]
gi|225631743|emb|CAG25386.2| endonuclease III homologue, putative [Plasmodium falciparum 3D7]
Length = 437
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 58/118 (49%), Gaps = 21/118 (17%)
Query: 648 EYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL-LK 705
E L + G+G K + + +L +H VD++V RIA RL WV + +L + LK
Sbjct: 324 EELKKLPGIGEKIAQLILQTALNKHEGIAVDIHVHRIANRLNWVNSK---NELDTQMKLK 380
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
Y +Q LW E+++ ++ FG++ CK + P C C L +C+++
Sbjct: 381 SY-----VQKELWS-----------EINHVLVGFGQVICKGKKPLCEKCTLTNKCQYY 422
>gi|222087212|ref|YP_002545747.1| endonuclease III [Agrobacterium radiobacter K84]
gi|221724660|gb|ACM27816.1| endonuclease III [Agrobacterium radiobacter K84]
Length = 259
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 52/117 (44%), Gaps = 18/117 (15%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E L+ + G+G K+ V ++ H VD +V RIA RL P + P ++ L+
Sbjct: 135 QTREELMTLPGVGRKTANVVMSMAFGHATLAVDTHVFRIANRLLLAPGK-TPDEVEQRLM 193
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K P DQ LY H+ +I G+ CK R P C C + C+
Sbjct: 194 KVIP----------------DQ-YLYHAHHWLILHGRYVCKARKPECERCVIADLCR 233
>gi|229546166|ref|ZP_04434891.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
TX1322]
gi|256852805|ref|ZP_05558175.1| endonuclease III [Enterococcus faecalis T8]
gi|307291137|ref|ZP_07571022.1| endonuclease III [Enterococcus faecalis TX0411]
gi|422684523|ref|ZP_16742757.1| endonuclease III [Enterococcus faecalis TX4000]
gi|424759883|ref|ZP_18187541.1| endonuclease III [Enterococcus faecalis R508]
gi|229308690|gb|EEN74677.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
TX1322]
gi|256711264|gb|EEU26302.1| endonuclease III [Enterococcus faecalis T8]
gi|306497791|gb|EFM67323.1| endonuclease III [Enterococcus faecalis TX0411]
gi|315030680|gb|EFT42612.1| endonuclease III [Enterococcus faecalis TX4000]
gi|402404286|gb|EJV36916.1| endonuclease III [Enterococcus faecalis R508]
Length = 215
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|29375731|ref|NP_814885.1| endonuclease III [Enterococcus faecalis V583]
gi|227518409|ref|ZP_03948458.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
TX0104]
gi|227552941|ref|ZP_03982990.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
HH22]
gi|229550352|ref|ZP_04439077.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
ATCC 29200]
gi|255973123|ref|ZP_05423709.1| endonuclease III/Nth [Enterococcus faecalis T1]
gi|255976166|ref|ZP_05426752.1| endonuclease III/Nth [Enterococcus faecalis T2]
gi|256618739|ref|ZP_05475585.1| endonuclease III/Nth [Enterococcus faecalis ATCC 4200]
gi|256762163|ref|ZP_05502743.1| endonuclease III [Enterococcus faecalis T3]
gi|256958651|ref|ZP_05562822.1| endonuclease III/Nth [Enterococcus faecalis DS5]
gi|256962250|ref|ZP_05566421.1| endonuclease III/Nth [Enterococcus faecalis Merz96]
gi|256965444|ref|ZP_05569615.1| endonuclease III/Nth [Enterococcus faecalis HIP11704]
gi|257077995|ref|ZP_05572356.1| endonuclease III/Nth [Enterococcus faecalis JH1]
gi|257082883|ref|ZP_05577244.1| endonuclease III [Enterococcus faecalis E1Sol]
gi|257085585|ref|ZP_05579946.1| endonuclease III/Nth [Enterococcus faecalis Fly1]
gi|257086509|ref|ZP_05580870.1| endonuclease III/Nth [Enterococcus faecalis D6]
gi|257089566|ref|ZP_05583927.1| endonuclease III [Enterococcus faecalis CH188]
gi|257415773|ref|ZP_05592767.1| endonuclease III/Nth [Enterococcus faecalis ARO1/DG]
gi|257418983|ref|ZP_05595977.1| endonuclease III/Nth [Enterococcus faecalis T11]
gi|257422928|ref|ZP_05599918.1| endonuclease III [Enterococcus faecalis X98]
gi|293383279|ref|ZP_06629194.1| endonuclease III [Enterococcus faecalis R712]
gi|293387564|ref|ZP_06632113.1| endonuclease III [Enterococcus faecalis S613]
gi|294781025|ref|ZP_06746377.1| endonuclease III [Enterococcus faecalis PC1.1]
gi|300859914|ref|ZP_07106002.1| endonuclease III [Enterococcus faecalis TUSoD Ef11]
gi|307274543|ref|ZP_07555723.1| endonuclease III [Enterococcus faecalis TX2134]
gi|307278810|ref|ZP_07559873.1| endonuclease III [Enterococcus faecalis TX0860]
gi|312903687|ref|ZP_07762863.1| endonuclease III [Enterococcus faecalis TX0635]
gi|312905786|ref|ZP_07764806.1| endonuclease III [Enterococcus faecalis DAPTO 512]
gi|312909159|ref|ZP_07768018.1| endonuclease III [Enterococcus faecalis DAPTO 516]
gi|312951406|ref|ZP_07770304.1| endonuclease III [Enterococcus faecalis TX0102]
gi|384512896|ref|YP_005707989.1| endonuclease III [Enterococcus faecalis OG1RF]
gi|384518260|ref|YP_005705565.1| endonuclease III [Enterococcus faecalis 62]
gi|397699542|ref|YP_006537330.1| endonuclease III [Enterococcus faecalis D32]
gi|421512166|ref|ZP_15958979.1| Endonuclease III [Enterococcus faecalis ATCC 29212]
gi|422689556|ref|ZP_16747665.1| endonuclease III [Enterococcus faecalis TX0630]
gi|422691633|ref|ZP_16749666.1| endonuclease III [Enterococcus faecalis TX0031]
gi|422694151|ref|ZP_16752155.1| endonuclease III [Enterococcus faecalis TX4244]
gi|422700613|ref|ZP_16758459.1| endonuclease III [Enterococcus faecalis TX1342]
gi|422705899|ref|ZP_16763690.1| endonuclease III [Enterococcus faecalis TX0043]
gi|422711028|ref|ZP_16767962.1| endonuclease III [Enterococcus faecalis TX0027]
gi|422712347|ref|ZP_16769117.1| endonuclease III [Enterococcus faecalis TX0309A]
gi|422718091|ref|ZP_16774763.1| endonuclease III [Enterococcus faecalis TX0309B]
gi|422721186|ref|ZP_16777781.1| endonuclease III [Enterococcus faecalis TX0017]
gi|422721893|ref|ZP_16778473.1| endonuclease III [Enterococcus faecalis TX2137]
gi|422726706|ref|ZP_16783150.1| endonuclease III [Enterococcus faecalis TX0312]
gi|422729113|ref|ZP_16785518.1| endonuclease III [Enterococcus faecalis TX0012]
gi|422733017|ref|ZP_16789343.1| endonuclease III [Enterococcus faecalis TX0645]
gi|422734872|ref|ZP_16791154.1| endonuclease III [Enterococcus faecalis TX1341]
gi|422739571|ref|ZP_16794747.1| endonuclease III [Enterococcus faecalis TX2141]
gi|424671493|ref|ZP_18108492.1| endonuclease III [Enterococcus faecalis 599]
gi|424678455|ref|ZP_18115294.1| endonuclease III [Enterococcus faecalis ERV103]
gi|424681894|ref|ZP_18118678.1| endonuclease III [Enterococcus faecalis ERV116]
gi|424685054|ref|ZP_18121760.1| endonuclease III [Enterococcus faecalis ERV129]
gi|424687158|ref|ZP_18123808.1| endonuclease III [Enterococcus faecalis ERV25]
gi|424692115|ref|ZP_18128629.1| endonuclease III [Enterococcus faecalis ERV31]
gi|424693250|ref|ZP_18129696.1| endonuclease III [Enterococcus faecalis ERV37]
gi|424697506|ref|ZP_18133833.1| endonuclease III [Enterococcus faecalis ERV41]
gi|424700671|ref|ZP_18136854.1| endonuclease III [Enterococcus faecalis ERV62]
gi|424703642|ref|ZP_18139775.1| endonuclease III [Enterococcus faecalis ERV63]
gi|424712278|ref|ZP_18144470.1| endonuclease III [Enterococcus faecalis ERV65]
gi|424718347|ref|ZP_18147596.1| endonuclease III [Enterococcus faecalis ERV68]
gi|424721223|ref|ZP_18150317.1| endonuclease III [Enterococcus faecalis ERV72]
gi|424725925|ref|ZP_18154614.1| endonuclease III [Enterococcus faecalis ERV73]
gi|424727389|ref|ZP_18156021.1| endonuclease III [Enterococcus faecalis ERV81]
gi|424739690|ref|ZP_18168107.1| endonuclease III [Enterococcus faecalis ERV85]
gi|424751534|ref|ZP_18179562.1| endonuclease III [Enterococcus faecalis ERV93]
gi|428766664|ref|YP_007152775.1| endonuclease III [Enterococcus faecalis str. Symbioflor 1]
gi|430360541|ref|ZP_19426330.1| endonuclease III [Enterococcus faecalis OG1X]
gi|430369976|ref|ZP_19428832.1| endonuclease III [Enterococcus faecalis M7]
gi|29343192|gb|AAO80955.1| endonuclease III [Enterococcus faecalis V583]
gi|227074087|gb|EEI12050.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
TX0104]
gi|227177911|gb|EEI58883.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
HH22]
gi|229304474|gb|EEN70470.1| DNA-(apurinic or apyrimidinic site) lyase [Enterococcus faecalis
ATCC 29200]
gi|255964141|gb|EET96617.1| endonuclease III/Nth [Enterococcus faecalis T1]
gi|255969038|gb|EET99660.1| endonuclease III/Nth [Enterococcus faecalis T2]
gi|256598266|gb|EEU17442.1| endonuclease III/Nth [Enterococcus faecalis ATCC 4200]
gi|256683414|gb|EEU23109.1| endonuclease III [Enterococcus faecalis T3]
gi|256949147|gb|EEU65779.1| endonuclease III/Nth [Enterococcus faecalis DS5]
gi|256952746|gb|EEU69378.1| endonuclease III/Nth [Enterococcus faecalis Merz96]
gi|256955940|gb|EEU72572.1| endonuclease III/Nth [Enterococcus faecalis HIP11704]
gi|256986025|gb|EEU73327.1| endonuclease III/Nth [Enterococcus faecalis JH1]
gi|256990913|gb|EEU78215.1| endonuclease III [Enterococcus faecalis E1Sol]
gi|256993615|gb|EEU80917.1| endonuclease III/Nth [Enterococcus faecalis Fly1]
gi|256994539|gb|EEU81841.1| endonuclease III/Nth [Enterococcus faecalis D6]
gi|256998378|gb|EEU84898.1| endonuclease III [Enterococcus faecalis CH188]
gi|257157601|gb|EEU87561.1| endonuclease III/Nth [Enterococcus faecalis ARO1/DG]
gi|257160811|gb|EEU90771.1| endonuclease III/Nth [Enterococcus faecalis T11]
gi|257164752|gb|EEU94712.1| endonuclease III [Enterococcus faecalis X98]
gi|291079302|gb|EFE16666.1| endonuclease III [Enterococcus faecalis R712]
gi|291083074|gb|EFE20037.1| endonuclease III [Enterococcus faecalis S613]
gi|294451971|gb|EFG20421.1| endonuclease III [Enterococcus faecalis PC1.1]
gi|295112731|emb|CBL31368.1| DNA-(apurinic or apyrimidinic site) lyase /endonuclease III
[Enterococcus sp. 7L76]
gi|300850732|gb|EFK78481.1| endonuclease III [Enterococcus faecalis TUSoD Ef11]
gi|306504481|gb|EFM73688.1| endonuclease III [Enterococcus faecalis TX0860]
gi|306508695|gb|EFM77785.1| endonuclease III [Enterococcus faecalis TX2134]
gi|310628125|gb|EFQ11408.1| endonuclease III [Enterococcus faecalis DAPTO 512]
gi|310630666|gb|EFQ13949.1| endonuclease III [Enterococcus faecalis TX0102]
gi|310633040|gb|EFQ16323.1| endonuclease III [Enterococcus faecalis TX0635]
gi|311290583|gb|EFQ69139.1| endonuclease III [Enterococcus faecalis DAPTO 516]
gi|315028111|gb|EFT40043.1| endonuclease III [Enterococcus faecalis TX2137]
gi|315031632|gb|EFT43564.1| endonuclease III [Enterococcus faecalis TX0017]
gi|315034925|gb|EFT46857.1| endonuclease III [Enterococcus faecalis TX0027]
gi|315144668|gb|EFT88684.1| endonuclease III [Enterococcus faecalis TX2141]
gi|315148491|gb|EFT92507.1| endonuclease III [Enterococcus faecalis TX4244]
gi|315150361|gb|EFT94377.1| endonuclease III [Enterococcus faecalis TX0012]
gi|315153677|gb|EFT97693.1| endonuclease III [Enterococcus faecalis TX0031]
gi|315156505|gb|EFU00522.1| endonuclease III [Enterococcus faecalis TX0043]
gi|315158331|gb|EFU02348.1| endonuclease III [Enterococcus faecalis TX0312]
gi|315160901|gb|EFU04918.1| endonuclease III [Enterococcus faecalis TX0645]
gi|315168422|gb|EFU12439.1| endonuclease III [Enterococcus faecalis TX1341]
gi|315171005|gb|EFU15022.1| endonuclease III [Enterococcus faecalis TX1342]
gi|315573746|gb|EFU85937.1| endonuclease III [Enterococcus faecalis TX0309B]
gi|315577515|gb|EFU89706.1| endonuclease III [Enterococcus faecalis TX0630]
gi|315582639|gb|EFU94830.1| endonuclease III [Enterococcus faecalis TX0309A]
gi|323480393|gb|ADX79832.1| endonuclease III [Enterococcus faecalis 62]
gi|327534785|gb|AEA93619.1| endonuclease III [Enterococcus faecalis OG1RF]
gi|397336181|gb|AFO43853.1| endonuclease III [Enterococcus faecalis D32]
gi|401674735|gb|EJS81080.1| Endonuclease III [Enterococcus faecalis ATCC 29212]
gi|402350027|gb|EJU84940.1| endonuclease III [Enterococcus faecalis ERV116]
gi|402351433|gb|EJU86317.1| endonuclease III [Enterococcus faecalis ERV103]
gi|402358521|gb|EJU93189.1| endonuclease III [Enterococcus faecalis 599]
gi|402359907|gb|EJU94527.1| endonuclease III [Enterococcus faecalis ERV129]
gi|402360640|gb|EJU95236.1| endonuclease III [Enterococcus faecalis ERV31]
gi|402365539|gb|EJU99958.1| endonuclease III [Enterococcus faecalis ERV25]
gi|402373403|gb|EJV07480.1| endonuclease III [Enterococcus faecalis ERV62]
gi|402374735|gb|EJV08739.1| endonuclease III [Enterococcus faecalis ERV37]
gi|402376056|gb|EJV10026.1| endonuclease III [Enterococcus faecalis ERV41]
gi|402381314|gb|EJV15023.1| endonuclease III [Enterococcus faecalis ERV65]
gi|402381902|gb|EJV15595.1| endonuclease III [Enterococcus faecalis ERV68]
gi|402384517|gb|EJV18069.1| endonuclease III [Enterococcus faecalis ERV63]
gi|402390268|gb|EJV23623.1| endonuclease III [Enterococcus faecalis ERV73]
gi|402391893|gb|EJV25171.1| endonuclease III [Enterococcus faecalis ERV72]
gi|402396860|gb|EJV29904.1| endonuclease III [Enterococcus faecalis ERV81]
gi|402402961|gb|EJV35657.1| endonuclease III [Enterococcus faecalis ERV85]
gi|402405448|gb|EJV38039.1| endonuclease III [Enterococcus faecalis ERV93]
gi|427184837|emb|CCO72061.1| endonuclease III [Enterococcus faecalis str. Symbioflor 1]
gi|429512818|gb|ELA02414.1| endonuclease III [Enterococcus faecalis OG1X]
gi|429515703|gb|ELA05213.1| endonuclease III [Enterococcus faecalis M7]
Length = 215
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|307718171|ref|YP_003873703.1| hypothetical protein STHERM_c04580 [Spirochaeta thermophila DSM
6192]
gi|306531896|gb|ADN01430.1| hypothetical protein STHERM_c04580 [Spirochaeta thermophila DSM
6192]
Length = 253
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 17/124 (13%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ LL + G+G ++ + + L F VD RI RLG + P +
Sbjct: 146 RNALLGVWGVGRETADSILLYGYGVPVFVVDAYTRRIFSRLGLLASDDTPYE-------- 197
Query: 707 YPVMDKIQMYLWP-RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
V ++ + P +CY E H ++ K FC+KR P CG CPLR EC H +S
Sbjct: 198 -EVRSAVEEAVPPDHVCY------NEFHALLVEHAKRFCRKR-PLCGECPLRLECAHLSS 249
Query: 766 AVAS 769
A S
Sbjct: 250 AGGS 253
>gi|289676761|ref|ZP_06497651.1| endonuclease III [Pseudomonas syringae pv. syringae FF5]
gi|302184713|ref|ZP_07261386.1| endonuclease III [Pseudomonas syringae pv. syringae 642]
gi|422637597|ref|ZP_16701029.1| endonuclease III [Pseudomonas syringae Cit 7]
gi|422669201|ref|ZP_16729050.1| endonuclease III [Pseudomonas syringae pv. aptata str. DSM 50252]
gi|440742104|ref|ZP_20921433.1| endonuclease III [Pseudomonas syringae BRIP39023]
gi|330949993|gb|EGH50253.1| endonuclease III [Pseudomonas syringae Cit 7]
gi|330981559|gb|EGH79662.1| endonuclease III [Pseudomonas syringae pv. aptata str. DSM 50252]
gi|440377927|gb|ELQ14561.1| endonuclease III [Pseudomonas syringae BRIP39023]
Length = 212
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|227485039|ref|ZP_03915355.1| DNA-(apurinic or apyrimidinic site) lyase [Anaerococcus
lactolyticus ATCC 51172]
gi|227237036|gb|EEI87051.1| DNA-(apurinic or apyrimidinic site) lyase [Anaerococcus
lactolyticus ATCC 51172]
Length = 201
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 76/168 (45%), Gaps = 31/168 (18%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D E+ + I+ G A IK L+E + SI VP DK KE L + G+G
Sbjct: 58 DIKELENHIRTIGIYRNKAKNIKAASKILIEDYNSI-------VPADK-KE-LQKLPGVG 108
Query: 658 LKS--VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
K+ V C + IA VD +V R+A R+G L + +DK Q
Sbjct: 109 RKTANVVCANAFGIPSIA--VDTHVFRVANRIG---------------LADANNVDKTQD 151
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
L RL D+ + H+Q+IT G++ CK R+P C C + C ++
Sbjct: 152 QLEKRL---DKSRWSKTHHQLITHGRVLCKARNPLCEECLINKLCIYY 196
>gi|259501844|ref|ZP_05744746.1| endonuclease III [Lactobacillus antri DSM 16041]
gi|259170169|gb|EEW54664.1| endonuclease III [Lactobacillus antri DSM 16041]
Length = 213
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 56/121 (46%), Gaps = 18/121 (14%)
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
V P +KE L + G+G K + V AFPVD +V R+A RL VP P L
Sbjct: 103 GVVPQTLKE-LTSLPGVGRKVADVVLAECFAIPAFPVDTHVSRVARRLAMVP--PKASLL 159
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
I E +M+ + PR +LD H+ MI +G+ C R+P C CPL
Sbjct: 160 TI----EKKLMEAV-----PRDKWLDA------HHSMIFWGRYRCMARNPRCSDCPLLPM 204
Query: 760 C 760
C
Sbjct: 205 C 205
>gi|307268732|ref|ZP_07550100.1| endonuclease III [Enterococcus faecalis TX4248]
gi|306514860|gb|EFM83407.1| endonuclease III [Enterococcus faecalis TX4248]
Length = 215
Score = 52.0 bits (123), Expect = 0.002, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVDEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|325279570|ref|YP_004252112.1| endonuclease III [Odoribacter splanchnicus DSM 20712]
gi|324311379|gb|ADY31932.1| endonuclease III [Odoribacter splanchnicus DSM 20712]
Length = 212
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 59/140 (42%), Gaps = 19/140 (13%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
L E+ ++ ++ VP D E L + G+G K+ V ++ A PVD +V R++
Sbjct: 90 LHEMAQKLERDFQGKVPEDM--ELLQTLPGVGRKTANVVMAVAFHKPAMPVDTHVFRVSN 147
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
R+G V P + L+K P + L H+ +I G+ C
Sbjct: 148 RIGLVNNTKTPLETEKQLVKNIPA-----------------EILSTAHHWLILHGRYVCL 190
Query: 746 KRSPNCGACPLRGECKHFAS 765
R P C C +R C+ F S
Sbjct: 191 ARKPKCEECGIRQWCRFFQS 210
>gi|315122204|ref|YP_004062693.1| endonuclease III [Candidatus Liberibacter solanacearum CLso-ZC1]
gi|313495606|gb|ADR52205.1| endonuclease III [Candidatus Liberibacter solanacearum CLso-ZC1]
Length = 177
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 22/167 (13%)
Query: 597 ADESEIADAIKERGQQNII--AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIE 654
AD + AI E+ QN I G + ++ L ++ E+ +P K E L +
Sbjct: 27 ADTPQKMLAIGEKNLQNHIKTIGIYRRKAKNIISLSHTLINEFDSEIP--KTLEELTRLS 84
Query: 655 GLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQ 714
G+G K + ++ VD ++ RIA R+G P + P + LL+ P
Sbjct: 85 GIGRKGANVILSMAFGIPTIGVDTHIFRIANRIGLAPGKT-PNQVEQSLLRIIP------ 137
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
QK Y HY ++ G+ CK R P C +C + CK
Sbjct: 138 -----------QKHQYNAHYWLVLHGRYVCKARKPQCQSCVISNICK 173
>gi|436842028|ref|YP_007326406.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
AM13 = DSM 14728]
gi|432170934|emb|CCO24305.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
AM13 = DSM 14728]
Length = 217
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 76/170 (44%), Gaps = 16/170 (9%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D + + +R E ++ + I+ G + A R+ FL+ L + L V ++
Sbjct: 58 DGLTMQGLRKFSEEKLQELIRPSGFFRMKAARLNNFLDFLSRNSAECITD-LETVETYEL 116
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E LL+++G+G ++ + + L +L F VD RI R V ++ H L+E
Sbjct: 117 REKLLEVKGIGPETADSILLYALNKPVFVVDAYTRRIFNRHMLVH-----EEIDYHELQE 171
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+ MD + D E H ++ K +CKK SP+C CPL
Sbjct: 172 F-FMDVLSH---------DVNLFNEYHALIVMTAKNWCKKTSPDCEKCPL 211
>gi|315227180|ref|ZP_07868967.1| endonuclease III [Parascardovia denticolens DSM 10105 = JCM 12538]
gi|315119630|gb|EFT82763.1| endonuclease III [Parascardovia denticolens DSM 10105 = JCM 12538]
Length = 327
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 67/172 (38%), Gaps = 22/172 (12%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ A+ SE+A+ I G A N L+ G VP + E L
Sbjct: 175 ALSQANPSEVAEIINSIGFFRTKAQHAVMIANDLITRFGG-------EVP--RTMEELTT 225
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ + + FPVD +V R+ RL W D + + KE V
Sbjct: 226 LPGVGRKTANVILGNAFDLPGFPVDTHVMRVTKRLHWRSDWNKTKDDPVAIEKE--VTAA 283
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ W +L +++I FG+ C R P C CPLR C F
Sbjct: 284 FEPTEW-----------RDLSHRLIDFGRDTCHARKPECLICPLRDTCPSFG 324
>gi|429214593|ref|ZP_19205756.1| endonuclease III [Pseudomonas sp. M1]
gi|428154879|gb|EKX01429.1| endonuclease III [Pseudomonas sp. M1]
Length = 212
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 75/171 (43%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP ++ E L
Sbjct: 62 EAIYALGVDGLSEYIKTIGLYNSKAKNVIETCRILIEKHGS-------KVPENR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R+A R P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVANRTNIAPGKNVL-EVERKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ L + H+ +I G+ CK R P CG+C + C++
Sbjct: 169 --------------REYLLDAHHWLILHGRYVCKARKPQCGSCRIEDLCEY 205
>gi|269129033|ref|YP_003302403.1| endonuclease III [Thermomonospora curvata DSM 43183]
gi|268313991|gb|ACZ00366.1| endonuclease III [Thermomonospora curvata DSM 43183]
Length = 246
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 50/186 (26%), Positives = 76/186 (40%), Gaps = 33/186 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+ I+ G A I + +L E HG PD++++ L+++ G+
Sbjct: 87 ADREELEKIIRPTGFFRAKADNIIKLGQQLCERHGG--------QVPDRMED-LVELAGV 137
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV-MDKIQM 715
G K+ V + + VD + GR+A R GW + PV +++
Sbjct: 138 GRKTANVVLGNAFEVPGITVDTHFGRLARRFGWTS-------------QTDPVKVEREVA 184
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV----ASAR 771
L PR K L ++MI G+ C R P CG CPL C F A+A+
Sbjct: 185 ELIPR------KEWTILSHRMIWHGRRICHARRPACGVCPLARLCPSFGEGPTDPEAAAK 238
Query: 772 FALPGP 777
GP
Sbjct: 239 LVRTGP 244
>gi|389817685|ref|ZP_10208278.1| endonuclease III [Planococcus antarcticus DSM 14505]
gi|388464453|gb|EIM06784.1| endonuclease III [Planococcus antarcticus DSM 14505]
Length = 226
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
VP D+ + L+ + G+G K+ V ++ A VD +V R+A RLG + P +
Sbjct: 105 VPADR--DLLMTLPGVGRKTANVVVSVAFGIPALAVDTHVERVAKRLGLNRWKDNPLQVE 162
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++K+ P D W + H+Q+I FG+ CK + P C CPL C
Sbjct: 163 ETIMKKTPADD------WSKT-----------HHQIIFFGRYHCKSQKPGCHICPLFDRC 205
Query: 761 K 761
+
Sbjct: 206 R 206
>gi|367042070|ref|XP_003651415.1| hypothetical protein THITE_2111678 [Thielavia terrestris NRRL 8126]
gi|346998677|gb|AEO65079.1| hypothetical protein THITE_2111678 [Thielavia terrestris NRRL 8126]
Length = 588
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/142 (23%), Positives = 57/142 (40%), Gaps = 27/142 (19%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE------------------- 628
S++WE VR + E+ + IK G A IK+ L+ + E
Sbjct: 365 SINWEKVRLSSHQELVEVIKIAGNGPKKARHIKQILDMVYEENLQRAQGQPSLTKANPIQ 424
Query: 629 --------LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNV 680
+ L+ + ++ D+ + G+G+K+ CV L L+ F VD +V
Sbjct: 425 RGDGTCVATQNLLSLDHMHSMSKDEAMAKFVSYPGIGIKTAACVTLFCLRMPCFAVDTHV 484
Query: 681 GRIAVRLGWVPLQPLPGDLHIH 702
+ LGWVP + P + H
Sbjct: 485 HKFCRWLGWVPAKADPDNCFRH 506
>gi|223477172|ref|YP_002581604.1| Endonuclease III [Thermococcus sp. AM4]
gi|214032398|gb|EEB73228.1| Endonuclease III [Thermococcus sp. AM4]
Length = 239
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 18/148 (12%)
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
I LE PD + E L+ + G+G K V A PVD +V RI+ RLG P
Sbjct: 108 IILEKYNGKVPDDINE-LMKLPGIGRKCANIVLAYGFGKQAIPVDTHVNRISKRLGLAPP 166
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
+ P +K++ YL R +K +Y +++ M+ G+ CK P C
Sbjct: 167 RVPP--------------EKVEEYL--RELIPKEKWIY-VNHAMVDHGRSICKPIRPKCE 209
Query: 753 ACPLRGECKHFASAVASARFALPGPSEK 780
+CPL+ C + V +EK
Sbjct: 210 SCPLKELCPYAKGLVGDEDIKGANKTEK 237
>gi|330995722|ref|ZP_08319620.1| endonuclease III [Paraprevotella xylaniphila YIT 11841]
gi|329574781|gb|EGG56342.1| endonuclease III [Paraprevotella xylaniphila YIT 11841]
Length = 222
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 43/181 (23%), Positives = 76/181 (41%), Gaps = 27/181 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A I + ++ N A + LVE + S VP D + L+
Sbjct: 63 EAMAAATPETIYEYVRSVSYPNNKAKHLVGMARMLVENYHS-------EVPSDL--DELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHIHLLKEYPVM 710
+ G+G K+ ++ + + A VD +V R++ R+G VP P + L++ +P
Sbjct: 114 KLPGVGRKTANVIQAVVFEKAAMAVDTHVFRVSHRIGLVPATCTTPYSVEKQLVRYFPA- 172
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
+ + H+ +I G+ C R+P C AC L+ C+H+ +A
Sbjct: 173 ----------------PIIPKAHHWLILHGRYTCTARTPKCEACGLKMICRHYLRTHGAA 216
Query: 771 R 771
R
Sbjct: 217 R 217
>gi|222528627|ref|YP_002572509.1| HhH-GPD family protein [Caldicellulosiruptor bescii DSM 6725]
gi|222455474|gb|ACM59736.1| HhH-GPD family protein [Caldicellulosiruptor bescii DSM 6725]
Length = 225
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 47/174 (27%), Positives = 75/174 (43%), Gaps = 19/174 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + +++ IK G N A R+KEF N L S DLE L + +++ LL
Sbjct: 66 EGIFQTSDEMLSELIKPAGYYNQKAKRLKEFCNFLKREFNS-DLEKLFALDISSLRQVLL 124
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + + + Y +
Sbjct: 125 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIESEKI----------SYNDLQ 174
Query: 712 KIQMY-LWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I M L P Q LY E H ++ K CK + P C C L+ C ++
Sbjct: 175 AIVMTNLTP------QTQLYNEFHALIVKHCKEICKSKKPICNKCCLKLICNYY 222
>gi|293400780|ref|ZP_06644925.1| endonuclease III [Erysipelotrichaceae bacterium 5_2_54FAA]
gi|291305806|gb|EFE47050.1| endonuclease III [Erysipelotrichaceae bacterium 5_2_54FAA]
Length = 215
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 71/171 (41%), Gaps = 28/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ A+ E+ I+ G A I+ LVE G + P +KE L
Sbjct: 59 EALASANSKEVEAKIRRIGLYRNKARSIQALSASLVEHFDGQV---------PQSMKE-L 108
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ G+G K+ VR + A VD +V RIA RLG + GD + ++ E +
Sbjct: 109 TSLAGVGRKTANVVRSVCFDIPAIAVDTHVERIAKRLGLAKV----GD-SVEVV-EQKLK 162
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
KI+ W R H+ I FG+ C R+P C CP CK
Sbjct: 163 RKIKRERWNRA-----------HHLFIFFGRYQCTARNPQCDQCPFASFCK 202
>gi|300770189|ref|ZP_07080068.1| DNA-(apurinic or apyrimidinic site) lyase [Sphingobacterium
spiritivorum ATCC 33861]
gi|300762665|gb|EFK59482.1| DNA-(apurinic or apyrimidinic site) lyase [Sphingobacterium
spiritivorum ATCC 33861]
Length = 228
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 73/173 (42%), Gaps = 28/173 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ A E+ I+ N A + N L+E +G + P+++++ L
Sbjct: 63 EALAAASVDEVFSYIRSVSYPNNKAKHLVGMANMLIEKFNGEV---------PEQIED-L 112
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ + G+G K+ + + A VD +V R++ RLG P + L+K
Sbjct: 113 IKLPGVGRKTANVISSVVYNKPAMAVDTHVFRVSNRLGLTSRATTPLAVEKQLVK----- 167
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+L ++T+ H+ +I G+ C R P C CP+ CK++
Sbjct: 168 ------------FLPEETIAVAHHWLILHGRYICLARKPKCEICPITYMCKYY 208
>gi|297243033|ref|ZP_06926971.1| EndoIII-like endonuclease [Gardnerella vaginalis AMD]
gi|296889244|gb|EFH27978.1| EndoIII-like endonuclease [Gardnerella vaginalis AMD]
Length = 224
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 15/113 (13%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+ + G+G K+ V + FPVD +V R+ RLGW +Q P + I
Sbjct: 122 EDLISLPGVGRKTANVVLGNAFGLPGFPVDTHVIRVTGRLGWRKVQNRPNPVAIE----- 176
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I Y P K L +++I G+ C R+P C +CPL C
Sbjct: 177 ---REITAYFAP-------KEWTNLSHRLILHGRKICTARNPKCVSCPLNTTC 219
>gi|121997381|ref|YP_001002168.1| HhH-GPD family protein [Halorhodospira halophila SL1]
gi|121588786|gb|ABM61366.1| HhH-GPD family protein [Halorhodospira halophila SL1]
Length = 234
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 43/160 (26%), Positives = 72/160 (45%), Gaps = 19/160 (11%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+A AI+ G N+ R++ ++ G ++ L+ + ++E LL + G+
Sbjct: 78 ADPEEVATAIRPAGYFNVKTRRLRNLCITYLQ-EGCMEGMQLQRT--EALREKLLAVNGV 134
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G ++ + + L + F VD RI RLGW+ GD L+ ++
Sbjct: 135 GRETADDILLYAFHRPVFVVDAYTRRILQRLGWI-----QGDEGYERLR-----GGVEAA 184
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
L P + ELH Q++ GK C+ +P C CPL
Sbjct: 185 LGP-----NTAAFNELHAQIVALGKDTCRP-TPRCPDCPL 218
>gi|312128249|ref|YP_003993123.1| hhh-gpd family protein [Caldicellulosiruptor hydrothermalis 108]
gi|311778268|gb|ADQ07754.1| HhH-GPD family protein [Caldicellulosiruptor hydrothermalis 108]
Length = 235
Score = 51.6 bits (122), Expect = 0.003, Method: Composition-based stats.
Identities = 45/173 (26%), Positives = 72/173 (41%), Gaps = 15/173 (8%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + + ++++ IK G N A R+KEF N L S DLE L + +++ LL
Sbjct: 74 EGILQTSDEKLSELIKPAGYYNQKAKRLKEFCNFLKREFNS-DLEKLFALDISSLRQVLL 132
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G ++ + + L + F VD R+ RLG + E +
Sbjct: 133 SQKGIGFETADSIILYGAEKPIFVVDSYTKRLFYRLGLIE-------------SEKAKYE 179
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
IQ + L LD E H ++ K CK + P C C L C + +
Sbjct: 180 DIQNLVMQNL-ELDTYIFNEFHALIVKHCKERCKSKKPMCKNCCLSAFCIYLS 231
>gi|307594552|ref|YP_003900869.1| HhH-GPD family protein [Vulcanisaeta distributa DSM 14429]
gi|307549753|gb|ADN49818.1| HhH-GPD family protein [Vulcanisaeta distributa DSM 14429]
Length = 230
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 88/198 (44%), Gaps = 25/198 (12%)
Query: 565 QQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
Q WD++ + + + + + E++ AD + +A+ IK + A R+ +
Sbjct: 53 QTRWDVVHEVLNRLRKLGLNTI-----ESIANADPNYLAEVIKGVNYRFTKAQRLVKLAR 107
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
+ + G LE LR V+++LL+ EG+G ++ + + L +L + P+ R+
Sbjct: 108 NITVIGG---LERLR--LKGDVRDFLLNQEGVGRETADSIMLFALNILTLPISQYTKRVF 162
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
R+ L PG+ +Y + L PR D T +H +IT GK +C
Sbjct: 163 SRV----LGINPGN-------DYDSWKRTLEDLIPR----DLYTYKLVHASVITIGKKYC 207
Query: 745 KKRSPNCGACPLRGECKH 762
+P C CPLR C +
Sbjct: 208 LPDNPLCDKCPLRDLCLY 225
>gi|415709635|ref|ZP_11463260.1| endonuclease III [Gardnerella vaginalis 6420B]
gi|388056307|gb|EIK79183.1| endonuclease III [Gardnerella vaginalis 6420B]
Length = 224
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 15/113 (13%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+ + G+G K+ V + FPVD +V R+ RLGW +Q P + I
Sbjct: 122 EDLISLPGVGRKTANVVLGNAFGLPGFPVDTHVIRVTGRLGWRKVQNRPNPVAIE----- 176
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I Y P K L +++I G+ C R+P C +CPL C
Sbjct: 177 ---REITAYFAP-------KEWTNLSHRLILHGRKICTARNPKCVSCPLNTTC 219
>gi|422618203|ref|ZP_16686902.1| endonuclease III [Pseudomonas syringae pv. japonica str. M301072]
gi|422628829|ref|ZP_16694036.1| endonuclease III [Pseudomonas syringae pv. pisi str. 1704B]
gi|440722818|ref|ZP_20903192.1| endonuclease III [Pseudomonas syringae BRIP34876]
gi|440725610|ref|ZP_20905874.1| endonuclease III [Pseudomonas syringae BRIP34881]
gi|330898582|gb|EGH30001.1| endonuclease III [Pseudomonas syringae pv. japonica str. M301072]
gi|330937552|gb|EGH41493.1| endonuclease III [Pseudomonas syringae pv. pisi str. 1704B]
gi|440360739|gb|ELP97998.1| endonuclease III [Pseudomonas syringae BRIP34876]
gi|440368405|gb|ELQ05441.1| endonuclease III [Pseudomonas syringae BRIP34881]
Length = 212
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|398829286|ref|ZP_10587486.1| putative endoIII-related endonuclease [Phyllobacterium sp. YR531]
gi|398218144|gb|EJN04661.1| putative endoIII-related endonuclease [Phyllobacterium sp. YR531]
Length = 232
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 73/175 (41%), Gaps = 16/175 (9%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
WEAV A I D I + I E L+ +++ +GS++L+ L + ++ +
Sbjct: 66 SWEAVANASPERINDCIYKITFSQTYCVFIPEALSTIIKRYGSLNLDTLSGMTVEQALHF 125
Query: 650 LLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K CV S H+ A VD + R A R + Q K Y
Sbjct: 126 LQQLRGVGPKISACVLNFSTLHMRALVVDTHYLRFAARFQLIKSQ-----------KNYV 174
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMIT-FGKIFCKKRSPNCGACPLRGECKH 762
++ L P +L T E H+ ++ + FC +P C CPL C +
Sbjct: 175 KNIRVIQRLVPN-SWLASDT--EEHHVLVKQLAQEFCVAGTPRCARCPLSDLCNY 226
>gi|296242620|ref|YP_003650107.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
gi|296095204|gb|ADG91155.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
Length = 217
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 68/168 (40%), Gaps = 26/168 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E++I I E G + AG ++E VE G +P E L+ ++G+
Sbjct: 72 ASETDIEAIIGELGLRKR-AGFLRELAQHAVEEFGD-------KIP--NTLEDLMKLKGV 121
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRL-GWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
GL + VR + VD NV R+ R G +P D + + +KI
Sbjct: 122 GLYTANAVRSFAYGMCVPVVDRNVARVLRRFFGLEGEKPAYADREL-----WKFAEKI-- 174
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ E +Y +I G + C R P C CPLR EC +F
Sbjct: 175 --------MPTSACREFNYGLIDLGAMICTSREPQCSRCPLRPECAYF 214
>gi|443642650|ref|ZP_21126500.1| Endonuclease III [Pseudomonas syringae pv. syringae B64]
gi|443282667|gb|ELS41672.1| Endonuclease III [Pseudomonas syringae pv. syringae B64]
Length = 212
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVERQLMKFVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C + A
Sbjct: 169 ----KNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCDYKA 207
>gi|373461989|ref|ZP_09553722.1| endonuclease III [Prevotella maculosa OT 289]
gi|371950166|gb|EHO68024.1| endonuclease III [Prevotella maculosa OT 289]
Length = 236
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 70/171 (40%), Gaps = 29/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYL 650
EA+ AD EI IK N A + E LV + +G + PD + L
Sbjct: 71 EAMAKADWEEIFQLIKSVSYPNAKAHHLSEMAKMLVAQFNGEV---------PDNTDD-L 120
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPV 709
+ G+G K+ V+ + VD +V R++ RLG VP P + + L+K P
Sbjct: 121 TRLPGVGRKTANVVQAVWFGKPTLAVDTHVFRVSHRLGLVPETANTPLKVELELMKYIPE 180
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+D + + H+ ++ G+ CK + P C +CP C
Sbjct: 181 VD-----------------VSDAHHWLLLHGRYVCKSQKPQCESCPFNAFC 214
>gi|114320065|ref|YP_741748.1| HhH-GPD family protein [Alkalilimnicola ehrlichii MLHE-1]
gi|114226459|gb|ABI56258.1| HhH-GPD family protein [Alkalilimnicola ehrlichii MLHE-1]
Length = 221
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 44/175 (25%), Positives = 70/175 (40%), Gaps = 19/175 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ + +A+ I+ G N+ A R+ + G L L +++ LL
Sbjct: 66 EAMLDCPQEALAEVIRPSGYFNVKARRLLALCRAWQDAGGEAGLAALDTA---TLRQRLL 122
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G ++ + + L + + F +D RI RLG V P P+
Sbjct: 123 AVHGVGRETADDILLYAFERPVFVIDAYTRRIFARLGLVDGDP----------GYEPLRA 172
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
+ L P D + H Q++ GK C+ R P CG CPL C SA
Sbjct: 173 AFERALGP-----DVALYNDYHAQIVALGKATCRPR-PRCGVCPLADRCPSAESA 221
>gi|448400249|ref|ZP_21571316.1| HhH-GPD family protein [Haloterrigena limicola JCM 13563]
gi|445667789|gb|ELZ20429.1| HhH-GPD family protein [Haloterrigena limicola JCM 13563]
Length = 270
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 48/176 (27%), Positives = 77/176 (43%), Gaps = 24/176 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A++S +A+ I G N + I + ++E GS ++++ P V+E
Sbjct: 92 ESLAHAEQSTLAETISSAGLYNQKSETIIDTAEWVLEEFGSAAAFDTFVKDEAPAAVRET 151
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL++ G+G K+ +CV L + + FPVD +V RI RLG P + L ++ P
Sbjct: 152 LLEVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPPDADHETVREILERDVP 211
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
C H I FG+ +C R P C ACP+ C
Sbjct: 212 AAK----------CGFG-------HTATIQFGREYCTARKPACLDDPEACPMADLC 250
>gi|448346319|ref|ZP_21535205.1| HhH-GPD family protein [Natrinema altunense JCM 12890]
gi|445632908|gb|ELY86114.1| HhH-GPD family protein [Natrinema altunense JCM 12890]
Length = 310
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 48/194 (24%), Positives = 82/194 (42%), Gaps = 24/194 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A++S +A+ I G N + + + ++E GS ++++ P V+E
Sbjct: 126 ESLASAEQSTLAETISGAGLYNQKSETLIDTAEWVLEEFGSAPAFDAFVKDEGPSIVRET 185
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL + G+G K+ +CV L + + FPVD +V RI R+G P + L ++ P
Sbjct: 186 LLSVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPPDADHEAVREVLERDVP 245
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
C H I FG+ +C+ R P C ACP+ C+
Sbjct: 246 AAK----------CGFG-------HTATIQFGREYCRARKPACLEDPDACPMADVCEQVG 288
Query: 765 SAVASARFALPGPS 778
A+ P +
Sbjct: 289 VFPATGEVVDPAET 302
>gi|386874791|ref|ZP_10117017.1| putative endonuclease III [Candidatus Nitrosopumilus salaria BD31]
gi|386807414|gb|EIJ66807.1| putative endonuclease III [Candidatus Nitrosopumilus salaria BD31]
Length = 218
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 20/132 (15%)
Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
R++E+ I ++ VP D L+++ G+G K+ CV + + + A PVD++V RI+
Sbjct: 94 RIIEVAKIIHTKYKGKVPDDL--PTLIELPGVGRKTANCVLVYAFEKPAIPVDIHVHRIS 151
Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
RLG V D E +M KIQ W +++ + +G+ C
Sbjct: 152 NRLGLV-------DTRNPEETEQELMKKIQKKYW-----------IDINDTFVMYGQNIC 193
Query: 745 KKRSPNCGACPL 756
K SP C C +
Sbjct: 194 KPISPMCDVCKI 205
>gi|385805176|ref|YP_005841574.1| HhH-GPD family protein [Fervidicoccus fontis Kam940]
gi|383795039|gb|AFH42122.1| HhH-GPD family protein [Fervidicoccus fontis Kam940]
Length = 225
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 42/157 (26%), Positives = 67/157 (42%), Gaps = 34/157 (21%)
Query: 615 IAGRIKEFLNRLVELHG--SIDLEWLRNV---PPDKVKEYLLDIEGLGLKSVECVRLLSL 669
I+G K + L G S D +L+N+ PP++ + L I+G+G K+ + +
Sbjct: 89 ISGLYKSKAETIKRLAGFLSKDELFLKNLCKKPPEEARRELTSIKGIGKKTADLFLSVYC 148
Query: 670 QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY-----LWPRLCYL 724
FPVD ++ R+ RL +DK Y W + L
Sbjct: 149 NMPLFPVDRHIKRVTERL----------------------LDKKMDYEEVSQFWREV--L 184
Query: 725 DQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
D+ L E HY++I G+ FC+ + C CPL C+
Sbjct: 185 DRNLLKEAHYKLIYVGRKFCRPKGEKCDLCPLADICR 221
>gi|302349120|ref|YP_003816758.1| endoIII-related endonuclease [Acidilobus saccharovorans 345-15]
gi|302329532|gb|ADL19727.1| predicted EndoIII-related endonuclease [Acidilobus saccharovorans
345-15]
Length = 230
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 17/126 (13%)
Query: 635 LEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP 694
+++L + ++V+ LL I G+G K+V+ L + F VD + RIA R G
Sbjct: 112 VDFLEHGDINEVERALLSIRGIGSKTVDVFLSLYRKVPRFAVDTHAKRIAARWG------ 165
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGAC 754
L ++ +++ L L + + E H +I FG+ +C R+P C C
Sbjct: 166 --------LTRKGASYEEVSGAL---LNFFGPERSDEAHRLLIAFGRAYCTARNPRCSEC 214
Query: 755 PLRGEC 760
PLR C
Sbjct: 215 PLRQYC 220
>gi|386714544|ref|YP_006180867.1| endonuclease III [Halobacillus halophilus DSM 2266]
gi|384074100|emb|CCG45593.1| endonuclease III [Halobacillus halophilus DSM 2266]
Length = 220
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 68/161 (42%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + IK G A IK+ LV+ VPP KE L + G+G K+
Sbjct: 72 ELQNDIKRIGLYRNKAKNIKKLSQTLVDQFNG-------QVPP--TKEELESLAGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P + W
Sbjct: 123 ANVVASVAFNEPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKEE------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI FG+ CK + PNC CPL C+
Sbjct: 176 ----------VTHHRMIFFGRYHCKAKRPNCEECPLLVLCR 206
>gi|68067789|ref|XP_675828.1| endonuclease III [Plasmodium berghei strain ANKA]
gi|56495233|emb|CAH95547.1| endonuclease iii homologue, putative [Plasmodium berghei]
Length = 246
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 31/117 (26%), Positives = 58/117 (49%), Gaps = 19/117 (16%)
Query: 648 EYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
E L+ + G+G K + + +L +H VD++V RI+ RL WV + E
Sbjct: 133 EELIKLPGIGEKVSQLILQTALNKHEGIAVDIHVHRISNRLNWV-----------YTKNE 181
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K++ ++D++ EL++ ++ FG++ CK + P CG C L C+++
Sbjct: 182 LDTQIKLK-------SFVDKELWSELNHLLVGFGQVICKGKKPLCGKCTLTDYCQYY 231
>gi|291288604|ref|YP_003505420.1| endonuclease III [Denitrovibrio acetiphilus DSM 12809]
gi|290885764|gb|ADD69464.1| endonuclease III [Denitrovibrio acetiphilus DSM 12809]
Length = 210
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 75/169 (44%), Gaps = 27/169 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD ++A IK G + + I LVE HG VP D + LL + G+
Sbjct: 68 ADIEDVAKIIKSTGMYKMKSKNIIGMAKALVENHGG-------EVPQDM--DELLALSGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ VR Q VD +V RI+ R+G L + +K++
Sbjct: 119 GRKTANVVRGNFWQKPGVVVDTHVKRISGRVG---------------LTDNTTPEKVEKD 163
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
L +L +++ + +++I FG+ C RSP CG C + CK++AS
Sbjct: 164 L-EKLIKGEKQC--DWCHRVIYFGREICTARSPKCGICGVSHVCKYYAS 209
>gi|167754535|ref|ZP_02426662.1| hypothetical protein CLORAM_00037 [Clostridium ramosum DSM 1402]
gi|237733826|ref|ZP_04564307.1| conserved hypothetical protein [Mollicutes bacterium D7]
gi|167705367|gb|EDS19946.1| endonuclease III [Clostridium ramosum DSM 1402]
gi|229383164|gb|EEO33255.1| conserved hypothetical protein [Coprobacillus sp. D7]
Length = 220
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 72/171 (42%), Gaps = 28/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EAV A E+ IK G A + + L+E I VP D+ + L
Sbjct: 69 EAVSQASLPELEQDIKTIGLYRNKAKNLLALSHVLIEQFDGI-------VPSDQ--KQLE 119
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ VR ++ AF VD +V RI+ RLG+ D
Sbjct: 120 SLPGVGRKTANVVRSVAFDIPAFAVDTHVERISKRLGFAK------------------RD 161
Query: 712 KIQMYLWPRLCY-LDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ + +LC + + + H+Q I FG+ FCK +P+C C L CK
Sbjct: 162 DNVLTVEKKLCRSIPRNRWNKSHHQFIFFGRYFCKATNPSCTECKLFDMCK 212
>gi|227538653|ref|ZP_03968702.1| DNA-(apurinic or apyrimidinic site) lyase [Sphingobacterium
spiritivorum ATCC 33300]
gi|227241572|gb|EEI91587.1| DNA-(apurinic or apyrimidinic site) lyase [Sphingobacterium
spiritivorum ATCC 33300]
Length = 228
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 74/175 (42%), Gaps = 28/175 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ A E+ I+ N A + N L+E +G + P+++++ L
Sbjct: 63 EALAVASVDEVFSYIRSVSYPNNKAKHLVGMANMLIEKFNGEV---------PEQIED-L 112
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ + G+G K+ + + A VD +V R++ RLG P + L+K
Sbjct: 113 IKLPGVGRKTANVISSVVYNKPAMAVDTHVFRVSNRLGLTSRATTPLAVEKQLVK----- 167
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+L ++T+ H+ +I G+ C R P C CP+ CK++ +
Sbjct: 168 ------------FLPEETIAVAHHWLILHGRYICLARKPKCEICPITYMCKYYET 210
>gi|153815003|ref|ZP_01967671.1| hypothetical protein RUMTOR_01218 [Ruminococcus torques ATCC 27756]
gi|317501555|ref|ZP_07959751.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
gi|331088583|ref|ZP_08337494.1| endonuclease III [Lachnospiraceae bacterium 3_1_46FAA]
gi|336440472|ref|ZP_08620059.1| endonuclease III [Lachnospiraceae bacterium 1_1_57FAA]
gi|145847571|gb|EDK24489.1| endonuclease III [Ruminococcus torques ATCC 27756]
gi|316897066|gb|EFV19141.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
gi|330407540|gb|EGG87040.1| endonuclease III [Lachnospiraceae bacterium 3_1_46FAA]
gi|336012917|gb|EGN42808.1| endonuclease III [Lachnospiraceae bacterium 1_1_57FAA]
Length = 207
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 51/113 (45%), Gaps = 18/113 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG+ Q P + L+KE
Sbjct: 109 EDLTSLAGVGRKTANVIRGNIYHDASVVVDTHVKRISNRLGFTK-QSDPEKIEQDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P D LY + Q+ITFG+ C R+P CG C L+ C
Sbjct: 168 PK---------------DHWILYNI--QIITFGRSICTARNPKCGECFLKKYC 203
>gi|227528843|ref|ZP_03958892.1| DNA-(apurinic or apyrimidinic site) lyase [Lactobacillus vaginalis
ATCC 49540]
gi|227351236|gb|EEJ41527.1| DNA-(apurinic or apyrimidinic site) lyase [Lactobacillus vaginalis
ATCC 49540]
Length = 213
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L + G+G K V AFPVD +V R+A RLG V +P L I
Sbjct: 106 PHTMKE-LTSLAGVGRKVANVVLAECFNIPAFPVDTHVSRVARRLGMV--KPNATVLQIE 162
Query: 703 -LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
LKE D+ +LD H+ MI FG+ C ++P C CPL CK
Sbjct: 163 KRLKEAVPKDE----------WLDA------HHAMIFFGRYQCTAKNPKCTKCPLLPICK 206
Query: 762 H 762
+
Sbjct: 207 Y 207
>gi|365157635|ref|ZP_09353887.1| endonuclease III [Bacillus smithii 7_3_47FAA]
gi|363623160|gb|EHL74286.1| endonuclease III [Bacillus smithii 7_3_47FAA]
Length = 229
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I++ L+E H VP + KE L+++ G+G K+
Sbjct: 73 ELEQDIRSIGLYRTKAKNIQKLCKILIEKHQG-------QVP--ETKEELMELPGVGRKT 123
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P + W
Sbjct: 124 ANVVVSIAFGQPAIAVDTHVERVSKRLGICRWKDSVWEVEQTLMKKVPKEE------WSI 177
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 178 T-----------HHRLIFFGRYHCKAQSPKCEECPLLSLCR 207
>gi|409197998|ref|ZP_11226661.1| endonuclease III [Marinilabilia salmonicolor JCM 21150]
Length = 218
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 55/123 (44%), Gaps = 18/123 (14%)
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
V PD +KE L + G+G K+ + + Q A VD +V R++ R+G P D
Sbjct: 104 VVPDDIKE-LQKLPGVGRKTANVIASVVYQKPAMAVDTHVFRVSERIGLTVKAKNPLDAE 162
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L+K YL ++ L H+ +I G+ C R P C C L+ C
Sbjct: 163 MQLVK-----------------YLPEELLPIAHHWLILHGRYVCLARKPKCEKCGLKLWC 205
Query: 761 KHF 763
K++
Sbjct: 206 KYY 208
>gi|415728037|ref|ZP_11471640.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis
6119V5]
gi|388065563|gb|EIK88045.1| Putative EndoIII-related endonuclease [Gardnerella vaginalis
6119V5]
Length = 223
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 68/165 (41%), Gaps = 22/165 (13%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
A ++ IK G + A I RL HG + P ++++ L + G
Sbjct: 73 ASVEQVESIIKPLGFYRVKAQHIIALSLRLESAFHGVV---------PSRMED-LTSLPG 122
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ VR + FPVD +V R+ RLGW P K+Y + I+
Sbjct: 123 VGRKTANVVRGNAFGLPGFPVDTHVIRVTGRLGWRKDCNKP--------KKYANPESIEK 174
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ Y D L +++I G+ C R+P C CPL+ C
Sbjct: 175 EI---TSYFDPADWTNLSHRLILHGRKVCTARNPKCALCPLKTTC 216
>gi|95929994|ref|ZP_01312734.1| endonuclease III [Desulfuromonas acetoxidans DSM 684]
gi|95133963|gb|EAT15622.1| endonuclease III [Desulfuromonas acetoxidans DSM 684]
Length = 211
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 72/173 (41%), Gaps = 29/173 (16%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKE 648
D A AD E+ + I+ G + A I L + G + PD++ +
Sbjct: 64 DVHAYAQADIHEVEEIIRSIGCYRVKAKHIVAAAQLLCQKFSGQV---------PDQLDD 114
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L+ + G+G K+ V + FPVD +V R+A RLGW Q P + L +
Sbjct: 115 -LIQLPGVGRKTANVVLSNAFDKPGFPVDTHVKRVARRLGWTR-QSDPVKIESELCR--- 169
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ LW +L +I G+ CK RSP C CP+ +CK
Sbjct: 170 ---YVEPPLWGHTSHL-----------LIYHGREICKARSPQCERCPVENQCK 208
>gi|371777097|ref|ZP_09483419.1| endonuclease III [Anaerophaga sp. HS1]
Length = 214
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 18/124 (14%)
Query: 641 VPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLH 700
V PD VKE L + G+G K+ + + Q A VD +V R++ RLG P +
Sbjct: 104 VVPDDVKE-LQKLPGVGRKTANVIASVVYQKPALAVDTHVFRVSARLGLTVNAKTPLETE 162
Query: 701 IHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L+K YL Q + H+ +I G+ C R P C C L+ C
Sbjct: 163 KQLVK-----------------YLPQDLIPIAHHWLILHGRYVCLARKPKCRQCGLKEWC 205
Query: 761 KHFA 764
K++
Sbjct: 206 KYYG 209
>gi|336421321|ref|ZP_08601480.1| endonuclease III [Lachnospiraceae bacterium 5_1_57FAA]
gi|336001507|gb|EGN31644.1| endonuclease III [Lachnospiraceae bacterium 5_1_57FAA]
Length = 208
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 51/116 (43%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG Q P + L+KE
Sbjct: 109 EELTSLAGVGRKTANVIRGNIYHEPSVVVDTHVKRISNRLGLTKNQD-PEKIEQDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D LY + Q+ITFG+ C RSP C C L+ CK +
Sbjct: 168 PK---------------DHWILYNI--QIITFGRTICTARSPRCEECFLQKYCKEY 206
>gi|227890756|ref|ZP_04008561.1| DNA-(apurinic or apyrimidinic site) lyase [Lactobacillus salivarius
ATCC 11741]
gi|227867165|gb|EEJ74586.1| DNA-(apurinic or apyrimidinic site) lyase [Lactobacillus salivarius
ATCC 11741]
Length = 222
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 68/164 (41%), Gaps = 26/164 (15%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D E+A IK G A + E LVE S+ VP K + L+ + G+G
Sbjct: 78 DPKEVAKYIKTIGLYKNKAKYLVECSKMLVENFNSV-------VP--KTHKELMSLSGVG 128
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ + V AF VD +V RI+ RL VP + L+ + P D I+
Sbjct: 129 RKTADVVLAERFGVPAFAVDTHVHRISKRLAIVPEDATVRETERILMSKVPKEDWIKS-- 186
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI +G+ C R+P C CPL C+
Sbjct: 187 ---------------HHRMIFWGRYQCMARAPKCETCPLLEICQ 215
>gi|333378080|ref|ZP_08469812.1| endonuclease III [Dysgonomonas mossii DSM 22836]
gi|332883585|gb|EGK03866.1| endonuclease III [Dysgonomonas mossii DSM 22836]
Length = 225
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 54/123 (43%), Gaps = 18/123 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD ++E L I G+G K+ + +++ A PVD +V R++ R+G P
Sbjct: 119 PDTLEE-LESIPGVGRKTANVMLIVAFNKPAMPVDTHVFRVSNRIGLTDNSKKPAQTERE 177
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K Y+ + L + H+ +I G+ C R P C C L CK+
Sbjct: 178 LIK-----------------YIPSRYLSKAHHWLILHGRYVCLARKPKCEECGLTPFCKY 220
Query: 763 FAS 765
F +
Sbjct: 221 FTN 223
>gi|406938283|gb|EKD71549.1| hypothetical protein ACD_46C00163G0006 [uncultured bacterium]
Length = 210
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/216 (21%), Positives = 91/216 (42%), Gaps = 25/216 (11%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKER 609
R +K P ++ D++LL + + + + ++ + A+ E + E+
Sbjct: 12 RFRKNNPHPTTELIYHTDFELLIAVILSAQA-TDKSVNKATGPLFKIANTPEKMLKLGEK 70
Query: 610 GQQNIIAGRIKEFLNRLVELHGSIDL---EWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
G +N I I + N+ + G+ + ++ NVP D+ E L + G+G K+ +
Sbjct: 71 GLKNYIK-TIGLYTNKAKNIIGACQMLVEKFHSNVPQDR--ESLESLPGVGRKTANVILN 127
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
+ VD ++ R+A R+G + P ++ LLK P K
Sbjct: 128 TCFGQLTIAVDTHIFRVANRIGLAKGKT-PLEVEKQLLKNIPNEFK-------------- 172
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ H+ +I G+ C R P C C +R +C++
Sbjct: 173 ---HDAHHWLILHGRYVCTARKPKCPTCIIRDDCEY 205
>gi|397906310|ref|ZP_10507123.1| Endonuclease III [Caloramator australicus RC3]
gi|397160659|emb|CCJ34460.1| Endonuclease III [Caloramator australicus RC3]
Length = 208
Score = 51.2 bits (121), Expect = 0.004, Method: Composition-based stats.
Identities = 43/150 (28%), Positives = 66/150 (44%), Gaps = 20/150 (13%)
Query: 615 IAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAF 674
I G K ++E I ++ NVP +K+++ L+ + G+G K+ V + A
Sbjct: 76 ICGLYKTKAKNIIEASKIIVEQFNGNVP-NKMED-LIKLPGVGRKTANVVLSNAFGKDAI 133
Query: 675 PVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHY 734
VD +V R+A RLG + + L E +M+ I W R H+
Sbjct: 134 AVDTHVFRVANRLGLANAKDV-------LQTEKQLMENIPKSYWSRA-----------HH 175
Query: 735 QMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+I G+ C R P C C LR CK+FA
Sbjct: 176 WLIWHGRKICTARKPKCDLCKLRDFCKYFA 205
>gi|223986376|ref|ZP_03636383.1| hypothetical protein HOLDEFILI_03694 [Holdemania filiformis DSM
12042]
gi|223961667|gb|EEF66172.1| hypothetical protein HOLDEFILI_03694 [Holdemania filiformis DSM
12042]
Length = 226
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 26/174 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A +I D I+ G + A I+ +VE G V P ++E L
Sbjct: 59 EALAEAPLEDIEDCIRRIGLYHNKAKSIQGLARGVVEQFG--------GVMPQTMEE-LT 109
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G KS + + A VD +V R++ RLG P D + + K+ +M
Sbjct: 110 SLPGVGRKSANVIMSVCFGMPAIAVDTHVERVSKRLGLAA----PKDTVLEVEKK--LMR 163
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
K+ W H+ I FG+ FCK ++P C CP C+ + +
Sbjct: 164 KLPKAEWSHA-----------HHLFIFFGRYFCKAKNPQCPDCPFTSFCREYKA 206
>gi|381210153|ref|ZP_09917224.1| endonuclease III [Lentibacillus sp. Grbi]
Length = 221
Score = 51.2 bits (121), Expect = 0.004, Method: Composition-based stats.
Identities = 31/117 (26%), Positives = 54/117 (46%), Gaps = 17/117 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ K+ L+ + G+G K+ V ++ + A VD +V R+A RL + ++ L+
Sbjct: 107 RTKQELMKLAGVGRKTANVVASVAFREPAIAVDTHVERVAKRLAICRWKDSVLEVEQTLM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ P + + H++MI FG+ CK R+P C CPL C+
Sbjct: 167 KKVP-----------------KDEWADTHHRMIFFGRYHCKARNPQCPECPLLDLCR 206
>gi|295099302|emb|CBK88391.1| Predicted EndoIII-related endonuclease [Eubacterium cylindroides
T2-87]
Length = 106
Score = 51.2 bits (121), Expect = 0.004, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 17/111 (15%)
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVM 710
+ + G+G K+ VR ++ +F VD +V RI+ RLG L K Y +
Sbjct: 1 MSLAGVGRKTANVVRSVAFDIPSFAVDTHVDRISKRLG--------------LAKPYDSV 46
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+K++ L ++ D+ H++ I FG+ C R+P C CP CK
Sbjct: 47 EKVEEKLKRKI---DRDRWNRGHHEFIFFGRYLCHSRNPECYRCPFIDICK 94
>gi|167758647|ref|ZP_02430774.1| hypothetical protein CLOSCI_00987 [Clostridium scindens ATCC 35704]
gi|167663843|gb|EDS07973.1| endonuclease III [Clostridium scindens ATCC 35704]
Length = 208
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 51/116 (43%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG Q P + L+KE
Sbjct: 109 EELTSLAGVGRKTANVIRGNIYHEPSVVVDTHVKRISNRLGLTKNQD-PEKIEQDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D LY + Q+ITFG+ C RSP C C L+ CK +
Sbjct: 168 PK---------------DHWILYNI--QIITFGRTICTARSPRCEECFLQKYCKEY 206
>gi|28871291|ref|NP_793910.1| endonuclease III [Pseudomonas syringae pv. tomato str. DC3000]
gi|213970873|ref|ZP_03398996.1| endonuclease III [Pseudomonas syringae pv. tomato T1]
gi|301383776|ref|ZP_07232194.1| endonuclease III [Pseudomonas syringae pv. tomato Max13]
gi|302059916|ref|ZP_07251457.1| endonuclease III [Pseudomonas syringae pv. tomato K40]
gi|302130515|ref|ZP_07256505.1| endonuclease III [Pseudomonas syringae pv. tomato NCPPB 1108]
gi|422590290|ref|ZP_16664946.1| endonuclease III [Pseudomonas syringae pv. morsprunorum str.
M302280]
gi|422657112|ref|ZP_16719555.1| endonuclease III [Pseudomonas syringae pv. lachrymans str. M302278]
gi|28854541|gb|AAO57605.1| endonuclease III [Pseudomonas syringae pv. tomato str. DC3000]
gi|213924396|gb|EEB57968.1| endonuclease III [Pseudomonas syringae pv. tomato T1]
gi|330877328|gb|EGH11477.1| endonuclease III [Pseudomonas syringae pv. morsprunorum str.
M302280]
gi|331015677|gb|EGH95733.1| endonuclease III [Pseudomonas syringae pv. lachrymans str. M302278]
Length = 212
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 68/160 (42%), Gaps = 29/160 (18%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+++ IK G N A + E LVELH G + PD +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNGEV---------PD-TREALEALPGVGRKT 121
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + + IA VD ++ R++ R G P K ++K M P+
Sbjct: 122 ANVVLNTAFRQIAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKQLMKFVPK 169
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LD H+ +I G+ C+ R P CG+C + C
Sbjct: 170 NYLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLC 203
>gi|406837577|ref|ZP_11097171.1| endonuclease III [Lactobacillus vini DSM 20605]
Length = 218
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 33/116 (28%), Positives = 49/116 (42%), Gaps = 17/116 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ + LL + G+G K+ + V Q A VD +V R+A RL VP + L+
Sbjct: 107 QTRSELLQLAGVGQKTADVVLAECFQIPAIAVDTHVSRVAKRLAIVPKNANVLQIEHTLM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K+ P Q + H++MI +G+ C R P C CPL C
Sbjct: 167 KKLP-----------------QSSWIAAHHRMIFWGRYQCMARQPKCSTCPLLEIC 205
>gi|322418099|ref|YP_004197322.1| HhH-GPD family protein [Geobacter sp. M18]
gi|320124486|gb|ADW12046.1| HhH-GPD family protein [Geobacter sp. M18]
Length = 221
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 79/175 (45%), Gaps = 26/175 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL----EWLRNVPPDKVK 647
EA++ +A+ I+ G N+ + R+K F+ L+E HGS+D +W ++
Sbjct: 66 EAIQEIHRDRLAELIRPSGFFNVKSVRLKGFVGWLLERHGSLDAMFRGDW------RALR 119
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+ L+ + G+G ++ + + L + + +F VD R+ RLG V D H
Sbjct: 120 DELIKVRGVGPETCDSILLYAGEKPSFVVDAYTRRLFSRLGVVKED---DDYH------- 169
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
V +L P D E H ++ K C+K+ P+C CPLR C H
Sbjct: 170 RVRSLFMEHLPP-----DVPLFNEYHALIVEQCKRHCRKK-PSCDGCPLRHVCSH 218
>gi|365830798|ref|ZP_09372361.1| endonuclease III [Coprobacillus sp. 3_3_56FAA]
gi|374627265|ref|ZP_09699672.1| endonuclease III [Coprobacillus sp. 8_2_54BFAA]
gi|365262808|gb|EHM92680.1| endonuclease III [Coprobacillus sp. 3_3_56FAA]
gi|373913288|gb|EHQ45126.1| endonuclease III [Coprobacillus sp. 8_2_54BFAA]
Length = 213
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 72/171 (42%), Gaps = 28/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EAV A E+ IK G A + + L+E I VP D+ + L
Sbjct: 62 EAVSQASLPELEQDIKTIGLYRNKAKNLLALSHVLIEQFDGI-------VPSDQ--KQLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ VR ++ AF VD +V RI+ RLG+ D
Sbjct: 113 SLPGVGRKTANVVRSVAFDIPAFAVDTHVERISKRLGFAK------------------RD 154
Query: 712 KIQMYLWPRLCY-LDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ + +LC + + + H+Q I FG+ FCK +P+C C L CK
Sbjct: 155 DNVLTVEKKLCRSIPRNRWNKSHHQFIFFGRYFCKATNPSCTECKLVDMCK 205
>gi|438000126|ref|YP_007183859.1| endonuclease III [Candidatus Kinetoplastibacterium blastocrithidii
(ex Strigomonas culicis)]
gi|451813058|ref|YP_007449511.1| endonuclease III [Candidatus Kinetoplastibacterium blastocrithidii
TCC012E]
gi|429339360|gb|AFZ83782.1| endonuclease III [Candidatus Kinetoplastibacterium blastocrithidii
(ex Strigomonas culicis)]
gi|451779027|gb|AGF49907.1| endonuclease III [Candidatus Kinetoplastibacterium blastocrithidii
TCC012E]
Length = 210
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 30/117 (25%), Positives = 53/117 (45%), Gaps = 18/117 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L + G+G K+ V ++ VD +V R+A R G + P D+ L+K
Sbjct: 109 REELESLPGVGRKTANVVLNVAFNQPTIAVDTHVFRVANRTG-IAKGKNPLDVEKKLIKN 167
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P +K L+ H+ ++ FG+ C KR+P C C ++ C+++
Sbjct: 168 IP-----------------KKYLHNAHHSLVLFGRYICTKRNPKCDQCTIKNLCEYY 207
>gi|422868162|ref|ZP_16914710.1| endonuclease III [Enterococcus faecalis TX1467]
gi|329576491|gb|EGG58001.1| endonuclease III [Enterococcus faecalis TX1467]
Length = 215
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVLEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|422653507|ref|ZP_16716272.1| endonuclease III [Pseudomonas syringae pv. actinidiae str. M302091]
gi|330966555|gb|EGH66815.1| endonuclease III [Pseudomonas syringae pv. actinidiae str. M302091]
Length = 212
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 68/160 (42%), Gaps = 29/160 (18%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+++ IK G N A + E LVELH G + PD +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNGEV---------PD-TREALEALPGVGRKT 121
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + + IA VD ++ R++ R G P K ++K M P+
Sbjct: 122 ANVVLNTAFRQIAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKQLMKFVPK 169
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LD H+ +I G+ C+ R P CG+C + C
Sbjct: 170 NYLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLC 203
>gi|424814711|ref|ZP_18239889.1| putative EndoIII-related endonuclease [Candidatus Nanosalina sp.
J07AB43]
gi|339758327|gb|EGQ43584.1| putative EndoIII-related endonuclease [Candidatus Nanosalina sp.
J07AB43]
Length = 239
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/177 (23%), Positives = 69/177 (38%), Gaps = 21/177 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID---LEWLRNVPPDKV 646
+++ + +D +AD I G A R+K L L E HG + +++L ++
Sbjct: 63 NYKEIENSDVGRLADTINPAGLPQTKAQRLKRTLTALRE-HGEKEDYSVDFLGDMEDSNA 121
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E+L I+G+G K+ + PVD +V RI+ R +P H L +
Sbjct: 122 QEWLEQIKGIGPKTASVLLNFHFDADVIPVDTHVERISKRFRLIPFSASNQKAHDLLNES 181
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P H +I GK C R+P CG L+ C +
Sbjct: 182 VP-----------------HDIKNSFHKLVIEHGKNNCTARNPTCGNTKLKKYCSRY 221
>gi|188997187|ref|YP_001931438.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfurihydrogenibium sp.
YO3AOP1]
gi|188932254|gb|ACD66884.1| DNA-(apurinic or apyrimidinic site) lyase [Sulfurihydrogenibium sp.
YO3AOP1]
Length = 215
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 48/164 (29%), Positives = 67/164 (40%), Gaps = 27/164 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E EI I G A IKE LVE S VP D E LL +G+G
Sbjct: 75 EKEIVKLIYPVGFYRNKAKIIKEISKILVEKFNS-------KVPDDL--ETLLSFKGVGR 125
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V A VDV+V RI+ R+G V + P + L+K P
Sbjct: 126 KTANLVLSEGFGKPAICVDVHVHRISNRIGLVKTKN-PEETEFKLMKILP---------- 174
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+K ++++ ++ FG+ CK P C CP+ C+H
Sbjct: 175 -------KKYWKDINFVLVAFGQTICKPVKPKCKECPIVKYCEH 211
>gi|448298656|ref|ZP_21488684.1| HhH-GPD family protein [Natronorubrum tibetense GA33]
gi|445591326|gb|ELY45532.1| HhH-GPD family protein [Natronorubrum tibetense GA33]
Length = 298
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 46/171 (26%), Positives = 75/171 (43%), Gaps = 24/171 (14%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEYLLDIE 654
A++SE+A+ I G N + + + ++E GS +++++ P +V+ LL +
Sbjct: 122 AEQSELAETISGAGLYNQKSETLIDTAEWVLEEFGSATAFDDYVKDEDPSEVRATLLSVR 181
Query: 655 GLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKI 713
G+G K+ +CV L + + FPVD +V RI RLG P + L ++ P
Sbjct: 182 GVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRLGIAPADADHEAVRAVLERDIPAAK-- 239
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGEC 760
C H I FG+ +C R P C ACP+ C
Sbjct: 240 --------CGF-------AHTATIQFGREYCTARKPACLEDPDACPMGDLC 275
>gi|296282210|ref|ZP_06860208.1| endonuclease III family protein [Citromicrobium bathyomarinum
JL354]
Length = 235
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 44/171 (25%), Positives = 68/171 (39%), Gaps = 14/171 (8%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WEAV A + + + N+ R+K L LV G++DL L + D +L
Sbjct: 70 WEAVADAPLEALQAQLATQTYPNVAGERLKACLTDLVARRGAVDLSHLEPMETDAAMVWL 129
Query: 651 LDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+ G+G K + V +L A +D + R+ R+G VP + + ++ P
Sbjct: 130 EQLPGVGRKIAAGVVNTSTLDRKALVLDSHHRRVLQRMGLVPQKADTARAYAAIMPAMPS 189
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W Y E H M G+ FC+ S CG CP + C
Sbjct: 190 E-------WSAADY------DEHHLLMKEIGRAFCRPASMACGECPAQALC 227
>gi|239947055|ref|ZP_04698808.1| endonuclease III [Rickettsia endosymbiont of Ixodes scapularis]
gi|239921331|gb|EER21355.1| endonuclease III [Rickettsia endosymbiont of Ixodes scapularis]
Length = 212
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 58/127 (45%), Gaps = 19/127 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+ KE L+ + G+G K+ V VD +V R+A R+G+ P +
Sbjct: 105 PNDFKE-LVKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGFAKGNS-PEIVEKE 162
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LL+ +D+K L HY +I G+ CK R P+C CP++ C++
Sbjct: 163 LLQ-----------------IIDEKWLTHAHYWLILHGRYICKARKPDCDICPIKEYCEY 205
Query: 763 FASAVAS 769
+ +A +S
Sbjct: 206 YINAFSS 212
>gi|395795676|ref|ZP_10474979.1| endonuclease III [Pseudomonas sp. Ag1]
gi|421138346|ref|ZP_15598411.1| Endonuclease III/Nth [Pseudomonas fluorescens BBc6R8]
gi|395340136|gb|EJF71974.1| endonuclease III [Pseudomonas sp. Ag1]
gi|404510514|gb|EKA24419.1| Endonuclease III/Nth [Pseudomonas fluorescens BBc6R8]
Length = 212
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 72/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E LVELHG VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIETCRLLVELHGG-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKPYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|423334271|ref|ZP_17312051.1| endonuclease III [Lactobacillus reuteri ATCC 53608]
gi|337728079|emb|CCC03169.1| endonuclease III [Lactobacillus reuteri ATCC 53608]
Length = 213
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L+ + G+G K + V AFPVD +V R+A RL V +P L I
Sbjct: 106 PHTLKE-LMTLSGVGRKVADVVLAECFNIPAFPVDTHVSRVARRLRMV--EPKASVLAI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K M P +LD H+ MI +G+ C R+P C CPL C
Sbjct: 162 --------EKKLMKTIPPEHWLDA------HHSMIFWGRYVCTARNPKCQTCPLLSLC 205
>gi|346976316|gb|EGY19768.1| HhH-GPD family base excision DNA repair protein [Verticillium
dahliae VdLs.17]
Length = 543
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 76/208 (36%), Gaps = 68/208 (32%)
Query: 588 SVDWEAVRCADESEIADA--------------------------------IKER------ 609
S+ W VR ++ESE+ DA IKER
Sbjct: 295 SIAWNKVRLSEESEVVDAVKRAGLGPTKGRDIKSILDMVHRDNQVRLAAYIKERETGEMA 354
Query: 610 ---GQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRL 666
G QN+ G+ + ++ G + L+ +R + D+ G+G+K+ C+ L
Sbjct: 355 DILGAQNLTQGQKDHQVKKMES--GILTLDHVRGLTSDEAMLEFTKYPGIGVKTSSCLIL 412
Query: 667 LSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQ 726
LQ +F VD +V R L WVP + D ++H E + D ++
Sbjct: 413 FCLQQPSFAVDTHVWRFCKWLKWVPPKASRDDTYMH--GEVRIPDTLK------------ 458
Query: 727 KTLYELHYQMITFGKIFCKKRSPNCGAC 754
Y LH I GK CG C
Sbjct: 459 ---YGLHQLFIRHGK--------ECGRC 475
>gi|325264606|ref|ZP_08131336.1| endonuclease III [Clostridium sp. D5]
gi|324030268|gb|EGB91553.1| endonuclease III [Clostridium sp. D5]
Length = 208
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG + P + L+KE
Sbjct: 109 EDLTSLAGVGRKTANVIRGNIYYEPSVVVDTHVKRISKRLGLTKHED-PEKIEQDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D LY + Q+ITFG+ C RSP CG C L+ CK +
Sbjct: 168 PK---------------DHWILYNI--QIITFGRSICTARSPKCGECFLQKYCKEY 206
>gi|448446559|ref|ZP_21590781.1| HhH-GPD family protein [Halorubrum saccharovorum DSM 1137]
gi|445683703|gb|ELZ36093.1| HhH-GPD family protein [Halorubrum saccharovorum DSM 1137]
Length = 232
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 46/176 (26%), Positives = 72/176 (40%), Gaps = 20/176 (11%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VELHGSIDLEWLRNVPPDKVKE 648
D+ A+ AD E+ + I+ G + A RI+ L + E G+ L +L + D K
Sbjct: 73 DFAAIEAADHEELRETIRVAGLADQKAARIQRALAAIREETGGAYSLAFLDAMATDDAKA 132
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
+L +I+G+G K+ V VD +V R++ R G VP H L P
Sbjct: 133 WLTEIKGVGPKTASVVLNFHFGKPTMAVDTHVERVSKRFGLVPESASNQAAHDALDALVP 192
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG--ACPLRGECKH 762
D+ T Y LH +I G+ C R +C C +C++
Sbjct: 193 ----------------DELT-YPLHVLLIRHGREHCSARGADCDNPVCEAYCDCEY 231
>gi|409417881|ref|ZP_11257900.1| endonuclease III [Pseudomonas sp. HYS]
Length = 212
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E H S VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIETCRLLIEQHES-------RVP--ETREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P + + ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPGKTVLE------------VE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KALMKFVPKAYLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|153853551|ref|ZP_01994931.1| hypothetical protein DORLON_00920 [Dorea longicatena DSM 13814]
gi|149753706|gb|EDM63637.1| endonuclease III [Dorea longicatena DSM 13814]
Length = 208
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 20/117 (17%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
E LL + G+G K+ +R ++ H+ + VD +V RI+ RLG Q P + L+KE
Sbjct: 109 EDLLSLAGVGRKTANVIRG-NIYHVPSVVVDTHVKRISNRLGLTKNQD-PDKIEQDLMKE 166
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D L+ +H +ITFG+ C RSP C C L+ CK +
Sbjct: 167 LPE---------------DHWILWNIH--IITFGRTICSARSPKCEDCFLQKYCKEY 206
>gi|212224145|ref|YP_002307381.1| endonuclease III [Thermococcus onnurineus NA1]
gi|212009102|gb|ACJ16484.1| endonuclease III [Thermococcus onnurineus NA1]
Length = 243
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 56/128 (43%), Gaps = 18/128 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L+ + G+G K V A PVD +V RI+ RLG P + P + +
Sbjct: 126 PDDIHE-LMKLPGIGRKCANIVLAYGFGRQAIPVDTHVNRISKRLGLAPPRVQPERVEDY 184
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L + L PR +K +Y +++ M+ GK C+ P C CPLR C +
Sbjct: 185 LRE-----------LIPR-----EKWIY-VNHAMVDHGKTICRPIKPRCDECPLRELCPY 227
Query: 763 FASAVASA 770
V
Sbjct: 228 SKGLVTDG 235
>gi|410459338|ref|ZP_11313089.1| endonuclease III [Bacillus azotoformans LMG 9581]
gi|409930314|gb|EKN67315.1| endonuclease III [Bacillus azotoformans LMG 9581]
Length = 217
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 73/161 (45%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A I++ N L+E +G + VP D++ L+ + G+G K+
Sbjct: 72 ELQNDIRSIGLFRNKAKNIQKLCNMLLEEYGGV-------VPKDRLS--LMKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ E +M K+ LW
Sbjct: 123 ANVVMSVAYGIPAIAVDTHVERVSKRLGICRWKDSVLEV------EETLMKKVPEELWS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ H+++I FG+ CK + P C CPL C+
Sbjct: 176 ----------QTHHRLIFFGRYHCKAQRPQCEICPLLDICR 206
>gi|422696813|ref|ZP_16754764.1| endonuclease III [Enterococcus faecalis TX1346]
gi|315174673|gb|EFU18690.1| endonuclease III [Enterococcus faecalis TX1346]
Length = 215
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 48/170 (28%), Positives = 70/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDVFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C R+P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLARAPKCEACPLLYMCQ 206
>gi|255528039|ref|ZP_05394875.1| endonuclease III [Clostridium carboxidivorans P7]
gi|296185709|ref|ZP_06854118.1| endonuclease III [Clostridium carboxidivorans P7]
gi|255508278|gb|EET84682.1| endonuclease III [Clostridium carboxidivorans P7]
gi|296049837|gb|EFG89262.1| endonuclease III [Clostridium carboxidivorans P7]
Length = 212
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 59/128 (46%), Gaps = 19/128 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD ++E LL + G+G K+ V + A VD +V R++ RLG + P +
Sbjct: 104 PDTMEE-LLKLPGVGRKTANVVLSNAFGVPAIAVDTHVFRVSNRLG-IAKGDTPEQVEKG 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K P +W + H+ +I G++ CK R P+C CPL C++
Sbjct: 162 LMKNVP------RDMWS-----------DTHHYLIWHGRLICKSRKPDCEKCPLAPYCEY 204
Query: 763 FASAVASA 770
F+ SA
Sbjct: 205 FSGGDTSA 212
>gi|70984374|ref|XP_747701.1| HhH-GPD family base excision DNA repair protein [Aspergillus
fumigatus Af293]
gi|66845328|gb|EAL85663.1| HhH-GPD family base excision DNA repair protein [Aspergillus
fumigatus Af293]
Length = 470
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 32/136 (23%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-------------ELHGSID 634
SV+W+AVR A ++ +AIK G I + +IK L+ + E GS D
Sbjct: 241 SVNWDAVRQAPLKDVFEAIKRGGLAEIKSRKIKAILDMVYQENQERKEFLVKGESDGSSD 300
Query: 635 L-------------------EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
L L + + L+ G+G K+ CV L LQ F
Sbjct: 301 LMPGEKEYEIACADQNFLSLNHLHTLSTEDAMTELVKYPGIGPKTAACVILFCLQRPCFA 360
Query: 676 VDVNVGRIAVRLGWVP 691
VD ++ RI LGWVP
Sbjct: 361 VDTHIFRICKWLGWVP 376
>gi|345880287|ref|ZP_08831842.1| endonuclease III [Prevotella oulorum F0390]
gi|343923641|gb|EGV34327.1| endonuclease III [Prevotella oulorum F0390]
Length = 222
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 76/203 (37%), Gaps = 42/203 (20%)
Query: 560 SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 619
+D + Q L R Y T + M + WE +I + IK N A +
Sbjct: 44 TDKRINQITPELFRHYPTAK-----AMAAATWE--------DIFELIKSVSYPNAKAHHL 90
Query: 620 KEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDV 678
E LVE G + PD +E L + G+G K+ V+ + VD
Sbjct: 91 SEMARMLVEQFQGEV---------PD-TREALTLLPGVGRKTANVVQAVWFGKPTLAVDT 140
Query: 679 NVGRIAVRLGWVPLQP-LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMI 737
+V R++ RLG VP P + + LLK P + + H+ ++
Sbjct: 141 HVFRVSHRLGLVPATANTPLKVELALLKHIPTTQ-----------------VSDAHHWIL 183
Query: 738 TFGKIFCKKRSPNCGACPLRGEC 760
G+ CK + P C CP C
Sbjct: 184 LHGRYICKSQRPQCTVCPFNWFC 206
>gi|87312107|ref|ZP_01094213.1| HhH-GPD protein [Blastopirellula marina DSM 3645]
gi|87285203|gb|EAQ77131.1| HhH-GPD protein [Blastopirellula marina DSM 3645]
Length = 221
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 17/156 (10%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+A+ I+ G + A R++ + +V++H S DLE + D ++E LL + G+G ++
Sbjct: 70 ELAEIIRPAGYYRLKAKRLQNLMRYVVDVH-SGDLEAMFACSVDSLREDLLALNGIGPET 128
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L + F VD R+ R GW+ + D H + D+ L P
Sbjct: 129 ADAILLYAGNLPTFVVDTYTSRVLKRHGWIEQE---ADYH-------QIQDQFVSQL-PE 177
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
D E H ++ G C+K +P C CPL
Sbjct: 178 ----DVALFNEYHALLVRVGNGHCRK-TPKCETCPL 208
>gi|160886403|ref|ZP_02067406.1| hypothetical protein BACOVA_04414 [Bacteroides ovatus ATCC 8483]
gi|237723268|ref|ZP_04553749.1| endonuclease III [Bacteroides sp. 2_2_4]
gi|293372757|ref|ZP_06619138.1| endonuclease III [Bacteroides ovatus SD CMC 3f]
gi|298482932|ref|ZP_07001114.1| endonuclease III [Bacteroides sp. D22]
gi|299146883|ref|ZP_07039951.1| endonuclease III [Bacteroides sp. 3_1_23]
gi|383113443|ref|ZP_09934215.1| endonuclease III [Bacteroides sp. D2]
gi|423289569|ref|ZP_17268419.1| endonuclease III [Bacteroides ovatus CL02T12C04]
gi|423298066|ref|ZP_17276125.1| endonuclease III [Bacteroides ovatus CL03T12C18]
gi|156108288|gb|EDO10033.1| endonuclease III [Bacteroides ovatus ATCC 8483]
gi|229447790|gb|EEO53581.1| endonuclease III [Bacteroides sp. 2_2_4]
gi|292632266|gb|EFF50863.1| endonuclease III [Bacteroides ovatus SD CMC 3f]
gi|298270904|gb|EFI12483.1| endonuclease III [Bacteroides sp. D22]
gi|298517374|gb|EFI41255.1| endonuclease III [Bacteroides sp. 3_1_23]
gi|313695612|gb|EFS32447.1| endonuclease III [Bacteroides sp. D2]
gi|392664008|gb|EIY57551.1| endonuclease III [Bacteroides ovatus CL03T12C18]
gi|392667280|gb|EIY60790.1| endonuclease III [Bacteroides ovatus CL02T12C04]
Length = 225
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLED-LIKLPGVGRKTANVIQSVVFKKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|307595236|ref|YP_003901553.1| iron-sulfur cluster loop [Vulcanisaeta distributa DSM 14429]
gi|307550437|gb|ADN50502.1| iron-sulfur cluster loop [Vulcanisaeta distributa DSM 14429]
Length = 307
Score = 50.4 bits (119), Expect = 0.005, Method: Composition-based stats.
Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 22/153 (14%)
Query: 619 IKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDV 678
++E ++RL + D P + K YLL I+ L L+ + V+ Q I P+D
Sbjct: 154 VEELIDRLSGMAAYED--------PVRKKAYLL-IKFLVLRGLLKVKQ---QEIKLPIDN 201
Query: 679 NVGRIAVRLGWVPLQPLPGDL---HIHLLKEYPVMDKIQMY-LWP---RLCYLDQKTLYE 731
++ RIA RLG V L+ +L +H +E V ++ + W R+ LD L +
Sbjct: 202 HLTRIAYRLGIVKLEDWVLELMRRDVHFSRELDVELRLAVRDAWDAVIRIGKLDPIALDD 261
Query: 732 LHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
++ FG+ C + P C CPL+G C +A
Sbjct: 262 FLWR---FGRTICVRDKPQCDKCPLKGVCSAYA 291
>gi|229495325|ref|ZP_04389060.1| endonuclease III [Porphyromonas endodontalis ATCC 35406]
gi|229317768|gb|EEN83666.1| endonuclease III [Porphyromonas endodontalis ATCC 35406]
Length = 217
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/171 (22%), Positives = 71/171 (41%), Gaps = 26/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A + EI IK N A + + R+V+ G ++P +E L+
Sbjct: 63 EAMAQASQEEILALIKSISYPNSKAEHLSKMAQRVVQTFGG-------SIP--ATREELM 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ + + VD +V R++ R+G P D + L++
Sbjct: 114 TLPGVGRKTANVILAVLYNQPTMAVDTHVFRVSERIGLTTRAKTPLDTELTLVR------ 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
Y+ + + + H+ +I G+ C RSP C +C + C++
Sbjct: 168 -----------YIPPELIPKAHHWLILHGRYVCLARSPKCSSCGITSWCRY 207
>gi|440737292|ref|ZP_20916864.1| endonuclease III [Pseudomonas fluorescens BRIP34879]
gi|447915541|ref|YP_007396109.1| endonuclease III [Pseudomonas poae RE*1-1-14]
gi|440382271|gb|ELQ18776.1| endonuclease III [Pseudomonas fluorescens BRIP34879]
gi|445199404|gb|AGE24613.1| endonuclease III [Pseudomonas poae RE*1-1-14]
Length = 212
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E L+ELHG VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLMELHGG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G I K ++K M PR
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVEKQLMKFVPRQ 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 171 YLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|340358490|ref|ZP_08681007.1| endonuclease III [Actinomyces sp. oral taxon 448 str. F0400]
gi|339886340|gb|EGQ75998.1| endonuclease III [Actinomyces sp. oral taxon 448 str. F0400]
Length = 220
Score = 50.4 bits (119), Expect = 0.005, Method: Composition-based stats.
Identities = 44/163 (26%), Positives = 71/163 (43%), Gaps = 35/163 (21%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D S++ ++ G Q A R L++L G + + VPP V+E L+ + G+G
Sbjct: 78 DRSDLEARLRPLGMQRTRAAR-------LIDLGGELIERFDGAVPP--VREMLVTLPGVG 128
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWV----PLQPLPGDLHIHLLKEYPVMDKI 713
K+ V + A VD +VGR++ RLGW P+Q +++
Sbjct: 129 RKTANVVLGDAFGMPAITVDTHVGRLSRRLGWTTSKDPVQ----------------VERD 172
Query: 714 QMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
LW + D +++I G+ C+ RSP CG C L
Sbjct: 173 LAALWEPERWTDG------CHRLIEHGRAVCRSRSPRCGQCAL 209
>gi|159122487|gb|EDP47608.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 470
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 56/136 (41%), Gaps = 32/136 (23%)
Query: 588 SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-------------ELHGSID 634
SV+W+AVR A ++ +AIK G I + +IK L+ + E GS D
Sbjct: 241 SVNWDAVRQAPLKDVFEAIKRGGLAEIKSRKIKAILDMVYQENQERKEFLVKGESDGSSD 300
Query: 635 L-------------------EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
L L + + L+ G+G K+ CV L LQ F
Sbjct: 301 LMPGEKEYEIACADQNFLSLNHLHTLSTEDAMTELVKYPGIGPKTAACVILFCLQRPCFA 360
Query: 676 VDVNVGRIAVRLGWVP 691
VD ++ RI LGWVP
Sbjct: 361 VDTHIFRICKWLGWVP 376
>gi|148543434|ref|YP_001270804.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Lactobacillus reuteri DSM 20016]
gi|184152842|ref|YP_001841183.1| endonuclease III [Lactobacillus reuteri JCM 1112]
gi|227363583|ref|ZP_03847700.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Lactobacillus reuteri MM2-3]
gi|325681777|ref|ZP_08161296.1| endonuclease III [Lactobacillus reuteri MM4-1A]
gi|148530468|gb|ABQ82467.1| DNA-(apurinic or apyrimidinic site) lyase [Lactobacillus reuteri
DSM 20016]
gi|183224186|dbj|BAG24703.1| endonuclease III [Lactobacillus reuteri JCM 1112]
gi|227071379|gb|EEI09685.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Lactobacillus reuteri MM2-3]
gi|324978868|gb|EGC15816.1| endonuclease III [Lactobacillus reuteri MM4-1A]
Length = 213
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L+ + G+G K + V AFPVD +V R+A RL V +P L I
Sbjct: 106 PHTLKE-LITLSGVGRKVADVVLAECFNIPAFPVDTHVSRVARRLRMV--EPKASVLAI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K M P +LD H+ MI +G+ C R+P C CPL C
Sbjct: 162 --------EKKLMKTIPPEHWLDA------HHSMIFWGRYVCTARNPKCQTCPLLSLC 205
>gi|406025340|ref|YP_006705641.1| Endonuclease III [Cardinium endosymbiont cEper1 of Encarsia
pergandiella]
gi|404432939|emb|CCM10221.1| Endonuclease III [Cardinium endosymbiont cEper1 of Encarsia
pergandiella]
Length = 212
Score = 50.4 bits (119), Expect = 0.005, Method: Composition-based stats.
Identities = 44/175 (25%), Positives = 70/175 (40%), Gaps = 27/175 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + C E + I+ G N A I N L+ + S VP + L+
Sbjct: 62 EKIVCLGEERLKQYIRSIGLFNHKAKNIIALSNLLINHYNS-------RVPNHF--DALV 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V VD +V R+A R+G L + +
Sbjct: 113 QLPGVGRKTANVVLNCLFNQPTIAVDTHVFRVAKRIG---------------LAKGITPE 157
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
K++ L L +DQK H+ +I G+ CK R PNC CP++ C ++ +
Sbjct: 158 KVEQEL---LAAIDQKWRKHAHHWLILHGRYICKARKPNCTICPIQTYCTYYQKS 209
>gi|300712085|ref|YP_003737899.1| DNA-(apurinic or apyrimidinic site) lyase [Halalkalicoccus jeotgali
B3]
gi|448295774|ref|ZP_21485838.1| DNA-(apurinic or apyrimidinic site) lyase [Halalkalicoccus jeotgali
B3]
gi|299125768|gb|ADJ16107.1| DNA-(apurinic or apyrimidinic site) lyase [Halalkalicoccus jeotgali
B3]
gi|445583873|gb|ELY38202.1| DNA-(apurinic or apyrimidinic site) lyase [Halalkalicoccus jeotgali
B3]
Length = 267
Score = 50.4 bits (119), Expect = 0.005, Method: Composition-based stats.
Identities = 52/182 (28%), Positives = 75/182 (41%), Gaps = 34/182 (18%)
Query: 592 EAVRCADESEIADAIKERGQQN-------IIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
EA+ A E+A+ I G N A I+E E ++R P+
Sbjct: 90 EALAEAHRDELAETISSAGLYNQKSDVMIAAAEEIREEFGSEAEFDA-----FVRESEPE 144
Query: 645 KVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
V+ LL+I G+G K+ +CV L S + FPVD +V RI R+G P + + L
Sbjct: 145 TVRSRLLEINGVGPKTADCVLLFSGGRGGVFPVDTHVHRIYRRMGIAPPEADHEAVREVL 204
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGE 759
++ P +K + H I FG+ FC R P C ACPL
Sbjct: 205 EEQVPA----------------EKCGFG-HTASIQFGREFCSARKPACLDGPEACPLYDC 247
Query: 760 CK 761
C+
Sbjct: 248 CE 249
>gi|448419852|ref|ZP_21580696.1| endonuclease III [Halosarcina pallida JCM 14848]
gi|445674766|gb|ELZ27303.1| endonuclease III [Halosarcina pallida JCM 14848]
Length = 227
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 73/174 (41%), Gaps = 36/174 (20%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA AD+ E+AD I N A I+ +VE HG PD + E L
Sbjct: 68 EAFAAADQEELADDIGSITYYNNKAKYIRSAAADIVEKHG--------GEVPDTMSE-LT 118
Query: 652 DIEGLGLKSVECVRLLSLQH-----IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
D+ G+G K+ V LQH VD +V R++ RLG + + P + L+
Sbjct: 119 DLAGVGRKTANVV----LQHGHDVVEGIVVDTHVQRLSRRLG-ITEEERPERIEEDLMPV 173
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
P D W +L +L I+ G+ C R+P+CGAC L C
Sbjct: 174 VPEED------WQQLTHL-----------FISHGRAVCTARNPDCGACVLEDVC 210
>gi|225850162|ref|YP_002730396.1| endonuclease III (DNA-(apurinic orapyrimidinic site) lyase)
[Persephonella marina EX-H1]
gi|225646537|gb|ACO04723.1| probable endonuclease III (DNA-(apurinic orapyrimidinic site)
lyase) [Persephonella marina EX-H1]
Length = 219
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 70/162 (43%), Gaps = 27/162 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +IA+AI G A IKE ++V+ G PD + E LL ++G+G
Sbjct: 76 EEKIAEAIYPAGFYRNKAKVIKEISGKIVKDFG--------GKVPDSIDE-LLKLKGVGR 126
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V L A VD +V RI+ RLG+V + + + L K+ P
Sbjct: 127 KTANLVVALGYGKPAICVDTHVHRISNRLGFVKTKT-AEETEMALRKKVP---------- 175
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ E++ + FG+ CK SP C CP+ C
Sbjct: 176 -------REYWNEINDLFVAFGQTICKPVSPKCSECPVSSYC 210
>gi|319790163|ref|YP_004151796.1| DNA-(apurinic or apyrimidinic site) lyase [Thermovibrio
ammonificans HB-1]
gi|317114665|gb|ADU97155.1| DNA-(apurinic or apyrimidinic site) lyase [Thermovibrio
ammonificans HB-1]
Length = 219
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 68/163 (41%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E EIA I G N A +KE LVE +G VP D E LL + G+G
Sbjct: 76 EEEIASLIYPVGFYNRKAKNLKEIARILVEHYGG-------QVPSDL--EELLKLPGVGR 126
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + VD +V RI RLG+V + P + E+ + +K+ W
Sbjct: 127 KTANLVVTQGFKKPGICVDTHVHRIMNRLGFVKTKT-PEE------TEFALREKLPKEFW 179
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
E++ ++ G+ C+ SP C CP+ CK
Sbjct: 180 -----------IEINDLLVALGQHICRPISPKCSQCPIEHLCK 211
>gi|312136836|ref|YP_004004173.1| endonuclease iii ;DNA-(apurinic or apyrimidinic site) lyase
[Methanothermus fervidus DSM 2088]
gi|311224555|gb|ADP77411.1| endonuclease III ;DNA-(apurinic or apyrimidinic site) lyase
[Methanothermus fervidus DSM 2088]
Length = 209
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 49/179 (27%), Positives = 77/179 (43%), Gaps = 31/179 (17%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPP 643
K D++ E + A E ++ IK G + A RIK+ L+ + G + P
Sbjct: 55 KYDTI--EKIANAQEKDLEKLIKCVGFYRVKAKRIKKISKILINKYDGKV---------P 103
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
+KE LL + G+G K+ CV + A PVD +V R+A R+G V + P + L
Sbjct: 104 KNLKE-LLKLPGVGRKTANCVLVYGFNEDAIPVDTHVHRVANRIGLVNTKT-PEETEKTL 161
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K P + E++ + FGK CK +P CP++ CK+
Sbjct: 162 RKIIP-----------------RDYWKEVNKLFVEFGKNICKPTNPKHEKCPIKKFCKY 203
>gi|295690296|ref|YP_003593989.1| iron-sulfur cluster loop [Caulobacter segnis ATCC 21756]
gi|295432199|gb|ADG11371.1| iron-sulfur cluster loop [Caulobacter segnis ATCC 21756]
Length = 241
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 57/132 (43%), Gaps = 16/132 (12%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGW 689
G + L L+ + D+ + L + G+G+K CV S L A VD +V R+A R+G
Sbjct: 106 GWLSLSHLKGLTVDQARWELQALPGVGVKVAACVLNFSDLAMRALVVDSHVDRVAKRIGL 165
Query: 690 VPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMIT-FGKIFCKKRS 748
V H M L P D L+ELH+ M G++ C
Sbjct: 166 VGAGDTTHTYHT------------LMGLAPDAWTADD--LFELHWLMKRGLGQMLCPHEG 211
Query: 749 PNCGACPLRGEC 760
P CGACP++ C
Sbjct: 212 PKCGACPVKAMC 223
>gi|126667563|ref|ZP_01738533.1| endonuclease III [Marinobacter sp. ELB17]
gi|126627989|gb|EAZ98616.1| endonuclease III [Marinobacter sp. ELB17]
Length = 212
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 92/220 (41%), Gaps = 35/220 (15%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIY------STGEERSHDKMDSVDW--EAVRCADESE 601
R ++ PK ++ +++LL + G ++ +K+ SV EA+
Sbjct: 12 RLREENPKPVTELNYSSNFELLIAVILSAQATDVGVNKATNKLYSVANTPEAIFALGVDG 71
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+ + IK G N AG + + L++ H S VP + +E L + G+G K+
Sbjct: 72 LKEYIKTIGLFNSKAGNVIKTCRALIDRHAS-------QVP--RTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + IA VD ++ R++ R G P K ++K + L P
Sbjct: 123 NVVLNTAFGQIAMAVDTHIFRVSNRTGIAPG------------KNVLEVEKRLIRLVP-- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
Q+ L + H+ +I G+ C R P CGAC + C+
Sbjct: 169 ----QEFLLDAHHWLILHGRYTCIARKPRCGACLIEDLCE 204
>gi|153808497|ref|ZP_01961165.1| hypothetical protein BACCAC_02791 [Bacteroides caccae ATCC 43185]
gi|423219425|ref|ZP_17205921.1| endonuclease III [Bacteroides caccae CL03T12C61]
gi|149128819|gb|EDM20036.1| endonuclease III [Bacteroides caccae ATCC 43185]
gi|392625320|gb|EIY19389.1| endonuclease III [Bacteroides caccae CL03T12C61]
Length = 225
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNMED-LIKLPGVGRKTANVIQSVVFHKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELMKNIP-----------------EKLVPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|347753384|ref|YP_004860949.1| endonuclease III [Bacillus coagulans 36D1]
gi|347585902|gb|AEP02169.1| endonuclease III [Bacillus coagulans 36D1]
Length = 219
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I++ L+E +G R VP + +E L+ + G+G K+
Sbjct: 72 ELQQDIRSIGLYRNKAKNIQKLCRMLIEEYG-------REVP--QTREELMKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG+ + ++ L+K+ P + W
Sbjct: 123 ANVVLSVAFGIPAIAVDTHVERVSKRLGFCRYKDSVLEVEQTLMKKVPKEE------WSI 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI FG+ CK + P C CPL C+
Sbjct: 177 T-----------HHRMIFFGRYHCKAQRPQCEICPLLDLCR 206
>gi|387789483|ref|YP_006254548.1| endonuclease III [Solitalea canadensis DSM 3403]
gi|379652316|gb|AFD05372.1| endonuclease III [Solitalea canadensis DSM 3403]
Length = 217
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 52/125 (41%), Gaps = 18/125 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P V+E L ++ G+G K+ + + VD +V R+A R+G P
Sbjct: 106 PATVEE-LQEMPGVGRKTANVIASVIYSQPKMAVDTHVFRVAARIGLTTNAKTPLAAEKQ 164
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K P DKI + H+ +I G+ C RSP C CPL CK+
Sbjct: 165 LVKNIP-QDKIHV----------------AHHWLILHGRYICLARSPKCDICPLTSFCKY 207
Query: 763 FASAV 767
F V
Sbjct: 208 FEQVV 212
>gi|330811337|ref|YP_004355799.1| DNA lyase [Pseudomonas brassicacearum subsp. brassicacearum NFM421]
gi|423698896|ref|ZP_17673386.1| endonuclease III [Pseudomonas fluorescens Q8r1-96]
gi|327379445|gb|AEA70795.1| DNA-(apurinic or apyrimidinic site) lyase [Pseudomonas
brassicacearum subsp. brassicacearum NFM421]
gi|387996602|gb|EIK57932.1| endonuclease III [Pseudomonas fluorescens Q8r1-96]
Length = 212
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 71/171 (41%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLFNSKAKNVIETCRLLIERHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTGLAPG------------KNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C R P CG+C + C++
Sbjct: 161 KKLMKFVPKEFLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|297569523|ref|YP_003690867.1| HhH-GPD family protein [Desulfurivibrio alkaliphilus AHT2]
gi|296925438|gb|ADH86248.1| HhH-GPD family protein [Desulfurivibrio alkaliphilus AHT2]
Length = 258
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 42/164 (25%), Positives = 73/164 (44%), Gaps = 17/164 (10%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+R +A+ I+ G N+ A R+ L + + +G + E + + +++ LL
Sbjct: 107 ALRQLPPEVLAELIRPAGYYNLKARRLHNLLACIADGYGRV--ENFLALAANDLRQQLLA 164
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
++G+G ++ + + L + F VD RI R Q LP + H ++E
Sbjct: 165 VKGIGPETADSICLYAAGKPIFVVDAYTHRIFSRH-----QLLPEEADYHAIQE------ 213
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
++ D E H ++ GK FCKKR+P C CPL
Sbjct: 214 ----IFTDALPADPVLYNEYHALIVRLGKEFCKKRNPRCPECPL 253
>gi|452963265|gb|EME68342.1| EndoIII-related endonuclease [Magnetospirillum sp. SO-1]
Length = 211
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 43/169 (25%), Positives = 68/169 (40%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ E +A+ I+ G A + E RL+ HG VP D+ L
Sbjct: 62 EAMVALGEEGLAECIRTIGLYKTKARNVVELSRRLLARHGG-------KVPHDRAA--LE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V ++ VD + R+A R G P + + + L++ P
Sbjct: 113 ALPGVGRKTANVVLNIAFGEPTIAVDTHCFRVANRTGLAPGRTVEA-VEEGLMRATPA-- 169
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L H+ +I G+ CK R P+CGAC +R C
Sbjct: 170 ---------------RWLQHAHHWLILHGRYVCKARKPDCGACVVRDLC 203
>gi|237798780|ref|ZP_04587241.1| endonuclease III [Pseudomonas syringae pv. oryzae str. 1_6]
gi|331021633|gb|EGI01690.1| endonuclease III [Pseudomonas syringae pv. oryzae str. 1_6]
Length = 212
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 67/161 (41%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
++ IK G N A + E L+ELH VP + +E L + G+G K+
Sbjct: 72 LSGYIKTIGLYNSKAKNVIETCRMLIELHNG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + IA VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQIAMAVDTHIFRVSNRTGIAPG------------KNVVEVEKQLMKFVPKN 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C +
Sbjct: 171 YLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|226293223|gb|EEH48643.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 519
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 57/134 (42%), Gaps = 25/134 (18%)
Query: 583 HDKMD--SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE----------LH 630
HD + SVDW+ VR A E+ DA+K G + + IK L+ + E +
Sbjct: 280 HDGVGEGSVDWDKVRQATLEEVYDAMKAGGLGRVKSRYIKRILDMVHEEGVARRNAMGMG 339
Query: 631 GSI-------------DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVD 677
G + L L + D+ G+G+K+ CV L L+ F VD
Sbjct: 340 GDVYQEGDGNGHEHILSLNHLHTLSKDEAMLEFTKYPGIGVKTAACVILFCLRQPCFAVD 399
Query: 678 VNVGRIAVRLGWVP 691
+V R+ LGW+P
Sbjct: 400 THVVRLCKWLGWLP 413
>gi|218888239|ref|YP_002437560.1| HhH-GPD family protein [Desulfovibrio vulgaris str. 'Miyazaki F']
gi|218759193|gb|ACL10092.1| HhH-GPD family protein [Desulfovibrio vulgaris str. 'Miyazaki F']
Length = 223
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 45/170 (26%), Positives = 74/170 (43%), Gaps = 19/170 (11%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+E+++ I+ G + A R++ L R ++ D L D ++ LL + G+G
Sbjct: 73 EAELSELIRPAGFFRLKAVRLRNLL-RFLDDACGFDFGVLAGQDLDDLRPRLLKVSGVGP 131
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + V L ++ F VD RI R G VP D+ H L++ MD ++
Sbjct: 132 ETADSVLLYAVGLPTFVVDAYTRRILHRHGMVP-----DDIPYHELRDV-FMDVLEP--- 182
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVA 768
D E H ++ K +C+ R+P C CPL C VA
Sbjct: 183 ------DVPLYNEYHALIVRVAKDWCRARAPRCADCPL---CSFLDGGVA 223
>gi|448302984|ref|ZP_21492934.1| HhH-GPD family protein [Natronorubrum sulfidifaciens JCM 14089]
gi|445593991|gb|ELY48158.1| HhH-GPD family protein [Natronorubrum sulfidifaciens JCM 14089]
Length = 270
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 55/212 (25%), Positives = 92/212 (43%), Gaps = 33/212 (15%)
Query: 582 SHDKM------DSVDW-EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSID 634
+HD++ D +D E++ A++S +A+ I G N + I + ++E GS
Sbjct: 75 AHDELMSTYGGDDIDLAESLASAEQSILAETISGAGLYNQKSETIIDTAAWVLEEFGSAA 134
Query: 635 L--EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVP 691
++++ P V++ LL + G+G K+ +CV L + + FPVD +V RI R+G P
Sbjct: 135 AFDTFVKDEAPSTVRDTLLSVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAP 194
Query: 692 LQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ L ++ P C H I FG+ +C R+P C
Sbjct: 195 PDADHEAVRAVLERDVPAAK----------CGFG-------HTASIQFGREYCTARTPAC 237
Query: 752 ----GACPLRGECKHFASAVASARFALPGPSE 779
ACP+ C+ V A+ PSE
Sbjct: 238 LEDPDACPMGDICEQV--GVYPETGAVVDPSE 267
>gi|375084054|ref|ZP_09731064.1| endonuclease III [Thermococcus litoralis DSM 5473]
gi|374741220|gb|EHR77648.1| endonuclease III [Thermococcus litoralis DSM 5473]
Length = 237
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PDK++E L+ + G+G K V PVD +V RI+ RLG P + P + +
Sbjct: 125 PDKIEE-LMKLPGIGRKCANIVLAYGFGKQTIPVDTHVNRISKRLGLAPPKVSPEKVEEY 183
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L + P LW +Y +++ M+ GK CK +P C CPL+ C +
Sbjct: 184 LKQLIP------KDLW----------IY-VNHAMVDHGKRICKPINPKCTECPLQDLCPY 226
>gi|398830894|ref|ZP_10589075.1| endonuclease III [Phyllobacterium sp. YR531]
gi|398213474|gb|EJN00068.1| endonuclease III [Phyllobacterium sp. YR531]
Length = 252
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 68/170 (40%), Gaps = 27/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + E ++ D I+ G A + L+ +GS V PD ++ L+
Sbjct: 90 EKMLALGEEKVGDYIRTIGLWRNKAKNVITLSQTLINDYGS--------VVPDS-RDELV 140
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V ++ VD ++ RI RLG P + P + L+K P
Sbjct: 141 KLPGVGRKTANVVLSMAFGQATMAVDTHIFRIGNRLGLAPGK-TPEQVEDKLMKIIPA-- 197
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ LY H+ +I G+ CK R P C AC + CK
Sbjct: 198 ---------------EYLYHAHHWLILHGRYTCKARKPECPACIIADICK 232
>gi|260905496|ref|ZP_05913818.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Brevibacterium linens BL2]
Length = 246
Score = 50.4 bits (119), Expect = 0.006, Method: Composition-based stats.
Identities = 46/177 (25%), Positives = 77/177 (43%), Gaps = 31/177 (17%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKE 648
D ++ A+ E+ + I G A I + N LV+ + G + P+ + +
Sbjct: 78 DAHSLAVANLIEVEELIHSTGFYRAKARNIVKLANELVDTYDGEV---------PNSL-D 127
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L+ + G+G K+ V + VD ++GR+A RLGW ++ P
Sbjct: 128 RLVKLAGVGRKTANVVLGNAFDTPGLTVDTHMGRLARRLGWTE-------------EDDP 174
Query: 709 VMDKIQMY-LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
V + ++ L+P+ K L L +++I G+ C R P CGACPL C F
Sbjct: 175 VKAEHEIAALFPK------KDLTLLSHRVIFHGRRICHSRRPACGACPLMALCPSFG 225
>gi|336414671|ref|ZP_08595017.1| hypothetical protein HMPREF1017_02125 [Bacteroides ovatus
3_8_47FAA]
gi|335933783|gb|EGM95785.1| hypothetical protein HMPREF1017_02125 [Bacteroides ovatus
3_8_47FAA]
Length = 178
Score = 50.1 bits (118), Expect = 0.006, Method: Composition-based stats.
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + + A VD +V R++ R+G VP P +
Sbjct: 59 PDNLED-LIKLPGVGRKTANVIQSVVFKKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 117
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 118 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 160
Query: 762 HFAS 765
+F +
Sbjct: 161 YFCN 164
>gi|386811544|ref|ZP_10098769.1| putative glycosylase [planctomycete KSU-1]
gi|386403814|dbj|GAB61650.1| putative glycosylase [planctomycete KSU-1]
Length = 214
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 74/166 (44%), Gaps = 17/166 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ + +E++ I+ G N+ A RIK F++ L +G L L ++ LL
Sbjct: 63 EAIHELNVTELSQLIRPSGFFNVKARRIKTFMDWLFSNYGG-SLSRLFAQDLQTLRSELL 121
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
++G+G ++ + + L + F VD RI R G++P E D
Sbjct: 122 AVKGIGPETADSILLYAGNLPTFVVDTYTYRIFSRHGFIP--------------EESSYD 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
+I+ L D K E H ++ GK+FCK + C CPL+
Sbjct: 168 EIKSLFEENLPK-DVKLFNEYHALLVNTGKMFCKPKK-FCEQCPLK 211
>gi|340782445|ref|YP_004749052.1| HhH-GPD family protein [Acidithiobacillus caldus SM-1]
gi|340556597|gb|AEK58351.1| HhH-GPD family protein [Acidithiobacillus caldus SM-1]
Length = 238
Score = 50.1 bits (118), Expect = 0.006, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 59/134 (44%), Gaps = 16/134 (11%)
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL + G+G ++ + + L +LQ VD R+A RLGW D +
Sbjct: 116 LRRKLLGVYGVGEETADSILLYALQRPISVVDAYTKRLAQRLGW-------ADARVSYA- 167
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+Q + +L D + ELH ++ GK +C+ R P C CPL +C+H S
Sbjct: 168 ------ALQSRMEAQLRRNDVRGRQELHALIVVHGKTYCRSR-PVCADCPLLRDCRH-GS 219
Query: 766 AVASARFALPGPSE 779
V S G E
Sbjct: 220 VVTSGAHHRAGDRE 233
>gi|448358879|ref|ZP_21547553.1| HhH-GPD family protein [Natrialba chahannaoensis JCM 10990]
gi|445644559|gb|ELY97572.1| HhH-GPD family protein [Natrialba chahannaoensis JCM 10990]
Length = 299
Score = 50.1 bits (118), Expect = 0.006, Method: Composition-based stats.
Identities = 39/123 (31%), Positives = 53/123 (43%), Gaps = 22/123 (17%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHI 701
PD V+E LLD+ G+G K+ +CV L + + FPVD +V RI R+G P + ++
Sbjct: 174 PDTVRETLLDVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPAEADHEEVRA 233
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLR 757
L E P C H I FG+ C R P C ACP+
Sbjct: 234 VLEAEVPAAK----------CGFG-------HTATIQFGRECCTARKPACLEDPEACPMA 276
Query: 758 GEC 760
C
Sbjct: 277 DIC 279
>gi|373952763|ref|ZP_09612723.1| endonuclease III [Mucilaginibacter paludis DSM 18603]
gi|373889363|gb|EHQ25260.1| endonuclease III [Mucilaginibacter paludis DSM 18603]
Length = 253
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 51/121 (42%), Gaps = 18/121 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P + E L + G+G K+ + + A VD +V R++ RLG P +
Sbjct: 106 PSDINE-LQKMPGVGRKTANVIASVVYDAPAIAVDTHVFRVSNRLGLTTNANTPLAVEKQ 164
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K YL Q TL H+ +I G+ C RSP C CPL CK+
Sbjct: 165 LVK-----------------YLPQNTLAIAHHWLILHGRYICVARSPKCDICPLTWFCKY 207
Query: 763 F 763
+
Sbjct: 208 Y 208
>gi|340354779|ref|ZP_08677477.1| endonuclease III [Sporosarcina newyorkensis 2681]
gi|339623031|gb|EGQ27540.1| endonuclease III [Sporosarcina newyorkensis 2681]
Length = 215
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 63/136 (46%), Gaps = 26/136 (19%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
L+E HG VP + + ++ + G+G K+ V + A VD +V R+A
Sbjct: 97 LIEEHGG-------EVPASR--DVMMTLPGVGRKTANVVVSNAFGVPALAVDTHVERVAK 147
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RLG + P D+ +++ P M+K W + H+Q+I FG+ CK
Sbjct: 148 RLGMNRWKDRPIDVEEKIMRWTP-MEK-----WT-----------QTHHQIIFFGRYHCK 190
Query: 746 KRSPNCGACPLRGECK 761
++PNC CPL C+
Sbjct: 191 AQNPNCPECPLLHLCR 206
>gi|91201716|emb|CAJ74776.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 216
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 76/172 (44%), Gaps = 17/172 (9%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K++ + + + E+A+ ++ G N+ A R+K F+N L + +L + +
Sbjct: 58 KVNKLTPKGIHSLSLPELAELVRPSGFFNVKAKRVKTFVNWLFSKYEG-NLTAMFHQDCR 116
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
++E LL I G+G ++ + + L + F +D RI R G VP
Sbjct: 117 TLREELLSINGIGPETADSILLYAGNMPTFVIDAYTHRIFSRHGLVP------------- 163
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
E D+++ + L D K E H ++ GK+FCK + C CPL
Sbjct: 164 -EESAYDEMKSFFEDNLPE-DTKLFNEYHALIVNIGKLFCKPKRV-CEQCPL 212
>gi|422299741|ref|ZP_16387298.1| endonuclease III [Pseudomonas avellanae BPIC 631]
gi|407988260|gb|EKG30850.1| endonuclease III [Pseudomonas avellanae BPIC 631]
Length = 212
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 69/160 (43%), Gaps = 29/160 (18%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
+++ IK G N A + E LVELH G + PD +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNGEV---------PD-TREALEALPGVGRKT 121
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + + IA VD ++ R++ R G P + + ++ L+K P
Sbjct: 122 ANVVLNTAFRQIAMAVDTHIFRVSNRTGIAPGKNVV-EVEKQLMKFVP------------ 168
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L + H+ +I G+ C+ R P CG+C + C
Sbjct: 169 -----NNYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLC 203
>gi|255021570|ref|ZP_05293613.1| HhH-GPD family protein [Acidithiobacillus caldus ATCC 51756]
gi|254968958|gb|EET26477.1| HhH-GPD family protein [Acidithiobacillus caldus ATCC 51756]
Length = 238
Score = 50.1 bits (118), Expect = 0.007, Method: Composition-based stats.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 15/125 (12%)
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
++ LL + G+G ++ + + L +LQ VD R+A RLGW D +
Sbjct: 116 LRRKLLGVYGVGEETADSILLYALQRPISVVDAYTKRLAQRLGW-------ADARVSYA- 167
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+Q + +L D + ELH ++ GK +C+ R P C CPL +C+H +
Sbjct: 168 ------ALQSRMEAQLRRNDVRGRQELHALIVVHGKTYCRSR-PVCADCPLLRDCRHGSV 220
Query: 766 AVASA 770
+ A
Sbjct: 221 VTSGA 225
>gi|373451906|ref|ZP_09543824.1| endonuclease III [Eubacterium sp. 3_1_31]
gi|371967338|gb|EHO84809.1| endonuclease III [Eubacterium sp. 3_1_31]
Length = 215
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 18/119 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L + G+G K+ VR + A VD +V RIA RLG + GD +
Sbjct: 102 PQSMKE-LTSLAGVGRKTANVVRSVCFDIPAIAVDTHVERIAKRLGLAKV----GD-SVE 155
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ E + KI+ W R H+ I FG+ C R+P C CP CK
Sbjct: 156 VV-EQKLKRKIKRERWNRA-----------HHLFIFFGRYQCTARNPQCDQCPFASFCK 202
>gi|298675584|ref|YP_003727334.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalobium
evestigatum Z-7303]
gi|298288572|gb|ADI74538.1| DNA-(apurinic or apyrimidinic site) lyase [Methanohalobium
evestigatum Z-7303]
Length = 203
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 66/165 (40%), Gaps = 29/165 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYLLDIEG 655
DE EI IK G + RIK+ ++ E G + PD + + LL + G
Sbjct: 63 TDEDEIESLIKNVGFYRVKTQRIKQISEMILDEYDGQV---------PDNLND-LLKLPG 112
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
+G K+ CV + A VD +V RI+ RLG V + P L K P
Sbjct: 113 VGRKTANCVLTYAFSKKAIAVDTHVHRISNRLGLVETKT-PEKTEKDLKKIVP------E 165
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LW ++ L + FG+ C+ SP C C L C
Sbjct: 166 NLWNKINEL-----------FVRFGQNTCRPVSPRCDVCVLNDTC 199
>gi|227544748|ref|ZP_03974797.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Lactobacillus reuteri CF48-3A]
gi|338203728|ref|YP_004649873.1| endonuclease III [Lactobacillus reuteri SD2112]
gi|227185288|gb|EEI65359.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Lactobacillus reuteri CF48-3A]
gi|336448968|gb|AEI57583.1| endonuclease III [Lactobacillus reuteri SD2112]
Length = 213
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L+ + G+G K + V AFPVD +V R+A RL V +P L I
Sbjct: 106 PHTLKE-LMTLSGVGRKVADVVLAECFNIPAFPVDTHVSRVARRLRIV--EPKASVLAI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K M P +LD H+ MI +G+ C R+P C CPL C
Sbjct: 162 --------EKKLMKTIPPEHWLDA------HHSMIFWGRYVCTARNPKCQTCPLLSLC 205
>gi|15614261|ref|NP_242564.1| endonuclease III [Bacillus halodurans C-125]
gi|10174315|dbj|BAB05417.1| endonuclease III (DNA repair) [Bacillus halodurans C-125]
Length = 218
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A IK+ L+E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELEQDIRSIGLYRNKAKNIKKLCQSLLEQYGG-------EVPQDR--DELVKLAGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + + L+K+ P MD+ +
Sbjct: 123 ANVVASVAFGVPAIAVDTHVERVSKRLGICRWKDNVTQVEQTLMKKIP-MDEWSIS---- 177
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK ++P C CPL C+
Sbjct: 178 ------------HHRLIFFGRYHCKAQNPQCDICPLLDMCR 206
>gi|119897726|ref|YP_932939.1| DNA-(apurinic or apyrimidinic site) lyase [Azoarcus sp. BH72]
gi|119670139|emb|CAL94052.1| DNA-(apurinic or apyrimidinic site) lyase [Azoarcus sp. BH72]
Length = 213
Score = 50.1 bits (118), Expect = 0.007, Method: Composition-based stats.
Identities = 47/169 (27%), Positives = 67/169 (39%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ E +AD IK G A L+E HG VP D+ E L
Sbjct: 65 EAMLALGEEGVADYIKTIGLFRNKAKNTVALSRLLLERHGG-------EVPRDR--EALE 115
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + A VD ++ R+A R G P + + + LLK P
Sbjct: 116 ALPGVGRKTANVVLNTIFREPAMAVDTHIFRLANRTGLAPGKDVMA-VEQSLLKRVP--- 171
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L + H+ +I G+ C R PNC AC +R C
Sbjct: 172 --------------KAFLLDAHHWLILHGRYVCTARKPNCAACIVRDLC 206
>gi|429205189|ref|ZP_19196466.1| endonuclease III [Lactobacillus saerimneri 30a]
gi|428146261|gb|EKW98500.1| endonuclease III [Lactobacillus saerimneri 30a]
Length = 213
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 17/116 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L+ + G+G K+ V AF VD +V RIA RL Q + D + L+ E
Sbjct: 109 REELVKLPGVGRKTANVVLSERFGVPAFAVDTHVSRIAKRL-----QIVAADASVTLI-E 162
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
MD + W + H+ +I +G+ +C R P+C CPL C +
Sbjct: 163 RKFMDSLPPEKW-----------LDAHHALIFWGRYWCTARQPHCATCPLLNICAY 207
>gi|392968671|ref|ZP_10334087.1| endonuclease III [Fibrisoma limi BUZ 3]
gi|387843033|emb|CCH56141.1| endonuclease III [Fibrisoma limi BUZ 3]
Length = 221
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 56/130 (43%), Gaps = 27/130 (20%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ---PLPGD- 698
P V E L + G+G K+ + + VD +V R++ RLG PL PL +
Sbjct: 106 PSTVDE-LQTLPGVGRKTAHVILSIVYNQPTMAVDTHVFRVSHRLGLAPLTATTPLAVEK 164
Query: 699 -LHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
L H+ KEY +Y+ H+ +I G+ C RSP C C LR
Sbjct: 165 ALVAHIPKEY---------------------VYKSHHWLILHGRYVCLARSPKCEVCDLR 203
Query: 758 GECKHFASAV 767
CK++ +V
Sbjct: 204 DFCKYYEKSV 213
>gi|254168132|ref|ZP_04874979.1| base excision DNA repair protein, HhH-GPD family [Aciduliprofundum
boonei T469]
gi|197622898|gb|EDY35466.1| base excision DNA repair protein, HhH-GPD family [Aciduliprofundum
boonei T469]
Length = 211
Score = 50.1 bits (118), Expect = 0.007, Method: Composition-based stats.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD +++ LL + G+G K+ V VD +V RI+ RLGWV + P +
Sbjct: 104 PDNLED-LLKLPGVGRKTANIVLSRCYDKDVIAVDTHVHRISNRLGWVNTKT-PEETERE 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K P +K +++ ++ FG+ C+ +P C CP++ CK+
Sbjct: 162 LMKVLP-----------------KKYWKDINELLVMFGRTICRPVAPKCDVCPIKKYCKY 204
Query: 763 F 763
+
Sbjct: 205 Y 205
>gi|322437279|ref|YP_004219491.1| iron-sulfur cluster loop [Granulicella tundricola MP5ACTX9]
gi|321165006|gb|ADW70711.1| iron-sulfur cluster loop [Granulicella tundricola MP5ACTX9]
Length = 241
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 83/198 (41%), Gaps = 25/198 (12%)
Query: 574 IYS-----TGEERSHDKMDSV-----DWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
IYS T E+SH M + WE +R A +EI ++ + A +K+ L
Sbjct: 42 IYSLLSSRTKTEQSHQVMRDLRARFGTWENLRDAAVAEIEQTLQIITFPEVKALWLKKDL 101
Query: 624 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGR 682
+ G + LE+L +K++ +L EG+G K S V +L+ A VD + R
Sbjct: 102 MEITARCGELSLEFLAKYRTNKIRSWLEQFEGVGAKTSAAVVNFSTLRRRALCVDSHHLR 161
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
+ RLG P D I E +M KI W + D +L + H G+
Sbjct: 162 VTQRLGLTPR----ADAAI---TEERLMRKIPAD-WDAIKLDDHHSLIKKH------GQE 207
Query: 743 FCKKRSPNCGACPLRGEC 760
C P+C CPL C
Sbjct: 208 LCTFAEPHCRRCPLLDMC 225
>gi|163785187|ref|ZP_02179872.1| endonuclease III [Hydrogenivirga sp. 128-5-R1-1]
gi|159879544|gb|EDP73363.1| endonuclease III [Hydrogenivirga sp. 128-5-R1-1]
Length = 218
Score = 50.1 bits (118), Expect = 0.007, Method: Composition-based stats.
Identities = 47/170 (27%), Positives = 70/170 (41%), Gaps = 27/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + E +IA+ I G A IK+ I LE PD + E LL
Sbjct: 68 EEISKLSEEKIAELIYPAGFYKNKAKTIKDI--------SKIILEKYNGKVPDTL-EKLL 118
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+G+G K+ V A VD++V RI+ RLG+V + E+ +M+
Sbjct: 119 KFKGVGRKTANLVLSEGFNKPAICVDIHVHRISNRLGFVKTKTPEK-------TEFALME 171
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ W ++ L ++ FG+ CK SP C CP+ CK
Sbjct: 172 KLPEKYWNKINKL-----------LVGFGQTICKPVSPYCSKCPVENLCK 210
>gi|337747107|ref|YP_004641269.1| hypothetical protein KNP414_02839 [Paenibacillus mucilaginosus
KNP414]
gi|379720962|ref|YP_005313093.1| hypothetical protein PM3016_3078 [Paenibacillus mucilaginosus 3016]
gi|386723570|ref|YP_006189896.1| hypothetical protein B2K_15605 [Paenibacillus mucilaginosus K02]
gi|336298296|gb|AEI41399.1| Nth [Paenibacillus mucilaginosus KNP414]
gi|378569634|gb|AFC29944.1| Nth [Paenibacillus mucilaginosus 3016]
gi|384090695|gb|AFH62131.1| hypothetical protein B2K_15605 [Paenibacillus mucilaginosus K02]
Length = 231
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 54/114 (47%), Gaps = 17/114 (14%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+++ G+G K+ V + A VD +V R++ RLG+ + ++ L+K
Sbjct: 109 EKLVELPGVGRKTANVVVSNAFGVPAIAVDTHVERVSKRLGFAAHKDSVLEVEKKLMKRV 168
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P ++ + H+++I FG+ CK ++P CG CPL C+
Sbjct: 169 P-----------------REEWTDTHHRLIFFGRYHCKAQNPKCGECPLLDLCR 205
>gi|164687678|ref|ZP_02211706.1| hypothetical protein CLOBAR_01320 [Clostridium bartlettii DSM
16795]
gi|164603452|gb|EDQ96917.1| endonuclease III [Clostridium bartlettii DSM 16795]
Length = 209
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 53/121 (43%), Gaps = 19/121 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L + G+G K+ + V + H A VD +V R+ R+G V + +
Sbjct: 103 PQTLKE-LTTLPGVGRKTADVVLSNAFNHDAIAVDTHVFRVTNRIGIVNEKNVEK----- 156
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
E+ +MD I W H+ I G+ CK R P C CP++ +C +
Sbjct: 157 --TEFALMDVIPKNRWSHS-----------HHLFIFHGRRMCKARKPECDTCPIKDDCDY 203
Query: 763 F 763
+
Sbjct: 204 Y 204
>gi|449298566|gb|EMC94581.1| hypothetical protein BAUCODRAFT_73567 [Baudoinia compniacensis UAMH
10762]
Length = 452
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 84/229 (36%), Gaps = 54/229 (23%)
Query: 550 RTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKER 609
R T KNS Q L++R G + SVDW+AVR A + E+ AI ER
Sbjct: 162 RLSANTTNKNSSTAFQ---GLVKRF---GTLQEGIGKGSVDWDAVRRAPQKEVFKAI-ER 214
Query: 610 GQ---------QNIIAGRIKEFLNRLVELHGS---------------------------- 632
G Q I+ +E R L
Sbjct: 215 GGLADRKSKDIQAILQLAYEENQERKAALTAESANATAVGAEQEAESEKHAEVEKASQNI 274
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
I L+ L + E +L G+G K+ CV L LQ +F VD +V R+ LGWVP
Sbjct: 275 ISLDHLHLLSTSAAIEKMLSYPGIGPKTASCVALFCLQRPSFAVDTHVFRLCQYLGWVPK 334
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGK 741
+ K P +D+ Y + D+ Y LH +I GK
Sbjct: 335 T---------VKKGQPKVDRNTTYSHCDVRIPDEYK-YPLHQLLIKHGK 373
>gi|262409409|ref|ZP_06085952.1| endonuclease III [Bacteroides sp. 2_1_22]
gi|294644946|ref|ZP_06722682.1| endonuclease III [Bacteroides ovatus SD CC 2a]
gi|294807476|ref|ZP_06766278.1| endonuclease III [Bacteroides xylanisolvens SD CC 1b]
gi|345509483|ref|ZP_08789081.1| endonuclease III [Bacteroides sp. D1]
gi|229446132|gb|EEO51923.1| endonuclease III [Bacteroides sp. D1]
gi|262352861|gb|EEZ01958.1| endonuclease III [Bacteroides sp. 2_1_22]
gi|292639759|gb|EFF58041.1| endonuclease III [Bacteroides ovatus SD CC 2a]
gi|294445316|gb|EFG13981.1| endonuclease III [Bacteroides xylanisolvens SD CC 1b]
Length = 225
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLED-LVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|126179382|ref|YP_001047347.1| HhH-GPD family protein [Methanoculleus marisnigri JR1]
gi|125862176|gb|ABN57365.1| DNA-3-methyladenine glycosylase III [Methanoculleus marisnigri JR1]
Length = 226
Score = 50.1 bits (118), Expect = 0.008, Method: Composition-based stats.
Identities = 46/161 (28%), Positives = 70/161 (43%), Gaps = 17/161 (10%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD ++IA I N A RI+EF +R+ D + + +++E LL + G
Sbjct: 70 ADTADIAPLIVPSRYYNQKAERIREF-SRVYVREFRADPAAMAAMETGELRERLLALRGF 128
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G ++ + + L + F VD RI R G++P + DL L E
Sbjct: 129 GKETTDTILLYVCEKPVFVVDAYTRRIFSRYGFLP-EGASYDLTQRLFAEN--------- 178
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
L P D + + H Q++ G CK RSP C CP+R
Sbjct: 179 LEP-----DAELFNDYHAQIVRLGNTICK-RSPLCDRCPIR 213
>gi|333384285|ref|ZP_08475925.1| endonuclease III [Dysgonomonas gadei ATCC BAA-286]
gi|332826778|gb|EGJ99595.1| endonuclease III [Dysgonomonas gadei ATCC BAA-286]
Length = 211
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 18/122 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD ++E L I G+G K+ + +++ A PVD +V R++ R+G P
Sbjct: 106 PDSMEE-LETIPGVGRKTANVMLIVAFNKPAMPVDTHVFRVSNRIGLTDNSKNPEQTERE 164
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K Y+ K L + H+ +I G+ C R P C C L CK
Sbjct: 165 LIK-----------------YIPTKYLSKAHHWLILHGRYICVARKPKCEECGLTPYCKF 207
Query: 763 FA 764
F+
Sbjct: 208 FS 209
>gi|348684342|gb|EGZ24157.1| hypothetical protein PHYSODRAFT_478985 [Phytophthora sojae]
Length = 278
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 80/192 (41%), Gaps = 35/192 (18%)
Query: 571 LRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL-VEL 629
++R++ GE ++ + E + E E+++ +K G + A ++K L +
Sbjct: 99 MQRLHQLGE-----NVEGLTIETLLATSEEELSEVLKPVGFYHRKAQQLKRVAAILRTQF 153
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIAVRLG 688
HG I P + E L + G+G K + LL+ H+ VD +V R+A RLG
Sbjct: 154 HGDI---------PRSLDE-LQQLPGIGPKIGRVITLLAWGHVDGIVVDTHVHRLAQRLG 203
Query: 689 WVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
W P D L D I W +L ++ FG+ C +
Sbjct: 204 WASTS-TPEDTRREL------EDWIPREHWGKLSL-----------AVVGFGQTVCTAKH 245
Query: 749 PNCGACPLRGEC 760
P+C +CPL +C
Sbjct: 246 PSCSSCPLAPKC 257
>gi|423298961|ref|ZP_17276986.1| endonuclease III [Bacteroides finegoldii CL09T03C10]
gi|408474310|gb|EKJ92829.1| endonuclease III [Bacteroides finegoldii CL09T03C10]
Length = 225
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLED-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P ++ + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELIKNIP-----------------EELIPIAHHWLILHGRYVCQARTPKCATCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|152987876|ref|YP_001347018.1| endonuclease III [Pseudomonas aeruginosa PA7]
gi|452880555|ref|ZP_21957510.1| endonuclease III [Pseudomonas aeruginosa VRFPA01]
gi|150963034|gb|ABR85059.1| endonuclease III [Pseudomonas aeruginosa PA7]
gi|452183029|gb|EME10047.1| endonuclease III [Pseudomonas aeruginosa VRFPA01]
Length = 212
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 71/171 (41%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP ++ E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLFNSKAKNVIETCRILIEKHGS-------QVPDNR--EDLE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + VD ++ R+A R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPTMAVDTHIFRVANRTGIAPG------------KNVLEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K + PR LD H+ +I G+ CK R P CG+C + C++
Sbjct: 161 KKLLKFVPRDYLLDA------HHWLILHGRYVCKARKPQCGSCRIEDLCEY 205
>gi|258648124|ref|ZP_05735593.1| endonuclease III [Prevotella tannerae ATCC 51259]
gi|260852003|gb|EEX71872.1| endonuclease III [Prevotella tannerae ATCC 51259]
Length = 215
Score = 49.7 bits (117), Expect = 0.008, Method: Composition-based stats.
Identities = 45/173 (26%), Positives = 72/173 (41%), Gaps = 27/173 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKE 648
D A+ A E+ + IK N A + LVE +G + P ++E
Sbjct: 61 DAAAMAQATPEELLEYIKSVSYPNSKAKHLAGLAQMLVEAFNGEV---------PTTLEE 111
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K+ V+ ++ A VD +V R++ RLG VP K
Sbjct: 112 -LTRLPGVGRKTANVVQAVAFHKAALAVDTHVFRVSHRLGLVP-------------KTAN 157
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K++M L Y+ ++ + H+ ++ G+ C P C C LRG CK
Sbjct: 158 TPYKVEMALKK---YIPEEKVAPSHFWLLLHGRYVCTALRPKCDKCDLRGLCK 207
>gi|398850623|ref|ZP_10607324.1| endonuclease III [Pseudomonas sp. GM80]
gi|398248446|gb|EJN33860.1| endonuclease III [Pseudomonas sp. GM80]
Length = 212
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 68/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVE HGS VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVERHGS-------EVP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGIAPG------------KNVVEVEKKLMKFVPKE 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C R P CG+C + C++
Sbjct: 171 FLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|332876792|ref|ZP_08444550.1| endonuclease III [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|357047392|ref|ZP_09108999.1| endonuclease III [Paraprevotella clara YIT 11840]
gi|332685351|gb|EGJ58190.1| endonuclease III [Capnocytophaga sp. oral taxon 329 str. F0087]
gi|355529993|gb|EHG99418.1| endonuclease III [Paraprevotella clara YIT 11840]
Length = 222
Score = 49.7 bits (117), Expect = 0.008, Method: Composition-based stats.
Identities = 45/181 (24%), Positives = 77/181 (42%), Gaps = 27/181 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A I + I+ N A + LVE + S VP D + L+
Sbjct: 63 EAMAAATPETIYEYIRSVSYPNNKAKHLVGMARMLVENYHS-------EVPSDL--DELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPVM 710
+ G+G K+ ++ + + A VD +V R++ R+G VP P + L++ +P
Sbjct: 114 KLPGVGRKTANVIQAVVFEKAAMAVDTHVFRVSHRIGLVPDTCTTPYSVEKQLVRYFP-- 171
Query: 711 DKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASA 770
D I + + H+ +I G+ C R+P C C L+ C+H+ +A
Sbjct: 172 DPI---------------IPKAHHWLILHGRYTCTARTPKCEVCGLKMICRHYLRTHGAA 216
Query: 771 R 771
R
Sbjct: 217 R 217
>gi|395648781|ref|ZP_10436631.1| endonuclease III [Pseudomonas extremaustralis 14-3 substr. 14-3b]
Length = 212
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 72/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIETCRLLIERHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKQYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|315640880|ref|ZP_07895977.1| endonuclease III [Enterococcus italicus DSM 15952]
gi|315483358|gb|EFU73857.1| endonuclease III [Enterococcus italicus DSM 15952]
Length = 215
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 50/115 (43%), Gaps = 17/115 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L+ + G+G K+ V + A VD +V RI+ RL L ++ E
Sbjct: 109 REELMSLAGVGRKTANVVLGDAFGVPAIAVDTHVERISKRLRICKLTANVIEV------E 162
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+M KI LW R H+ MI FG+ C R PNC CPL C+
Sbjct: 163 ETLMKKIPENLWIRA-----------HHTMIFFGRYHCTARKPNCAQCPLLDMCQ 206
>gi|410090685|ref|ZP_11287273.1| endonuclease III [Pseudomonas viridiflava UASWS0038]
gi|409762058|gb|EKN47094.1| endonuclease III [Pseudomonas viridiflava UASWS0038]
Length = 212
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 70/171 (40%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E H VP + +E L
Sbjct: 62 EAIHALGVEGLSEYIKTIGLYNSKAKNVIETCRLLMERHNG-------EVP--ETREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQVAMAVDTHIFRVSNRTGIAPG------------KNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M PR LD H+ +I G+ C+ R P CG+C + C +
Sbjct: 161 KQLMKFVPRGYLLDA------HHWLILHGRYVCQARKPRCGSCRIEDLCDY 205
>gi|393784467|ref|ZP_10372630.1| endonuclease III [Bacteroides salyersiae CL02T12C01]
gi|392665448|gb|EIY58972.1| endonuclease III [Bacteroides salyersiae CL02T12C01]
Length = 215
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 58/125 (46%), Gaps = 19/125 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHI 701
PD ++E L+ + G+G K+ ++ + A VD +V R++ R+G V P +
Sbjct: 106 PDTLEE-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVDDTHTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K Y+ +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVK-----------------YIPKKLIPIAHHWLILHGRYICQARTPKCDNCGLQLMCK 207
Query: 762 HFASA 766
+F A
Sbjct: 208 YFQEA 212
>gi|194467585|ref|ZP_03073572.1| endonuclease III [Lactobacillus reuteri 100-23]
gi|194454621|gb|EDX43518.1| endonuclease III [Lactobacillus reuteri 100-23]
Length = 213
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L+ + G+G K + V AFPVD +V R+A RL V +P L I
Sbjct: 106 PHTLKE-LMTLSGVGRKVADVVLAECFNIPAFPVDTHVSRVARRLRMV--EPKASVLTI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K M P +LD H+ MI +G+ C R+P C CPL C
Sbjct: 162 --------EKKLMKTIPPEHWLDA------HHSMIFWGRYVCTARNPKCQTCPLLPLC 205
>gi|448337176|ref|ZP_21526258.1| HhH-GPD family protein [Natrinema pallidum DSM 3751]
gi|445626522|gb|ELY79865.1| HhH-GPD family protein [Natrinema pallidum DSM 3751]
Length = 284
Score = 49.7 bits (117), Expect = 0.009, Method: Composition-based stats.
Identities = 47/194 (24%), Positives = 81/194 (41%), Gaps = 24/194 (12%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL--EWLRNVPPDKVKEY 649
E++ A+ + +A+ I G N + + + ++E GS ++++ P V+E
Sbjct: 100 ESLASAERATLAETISGAGLYNQKSETLIDTAEWVLEEFGSASAFDTFVKDEEPSVVRET 159
Query: 650 LLDIEGLGLKSVECVRLLS-LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL + G+G K+ +CV L + + FPVD +V RI R+G P + L ++ P
Sbjct: 160 LLSVRGVGPKTADCVLLFAGGRGGVFPVDTHVHRIYRRMGIAPPDADHEAVREVLERDVP 219
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC----GACPLRGECKHFA 764
C H I FG+ +C+ R P C ACP+ C+
Sbjct: 220 AAK----------CGFG-------HTATIQFGREYCRARKPACLEDPDACPMADVCEQVG 262
Query: 765 SAVASARFALPGPS 778
A+ P +
Sbjct: 263 VFPATGEVVDPAET 276
>gi|388470025|ref|ZP_10144234.1| endonuclease III [Pseudomonas synxantha BG33R]
gi|388006722|gb|EIK67988.1| endonuclease III [Pseudomonas synxantha BG33R]
Length = 212
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELHG VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVELHGG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G I K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVEKQLMKFVPKP 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 171 YLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|254168035|ref|ZP_04874883.1| base excision DNA repair protein, HhH-GPD family [Aciduliprofundum
boonei T469]
gi|289596043|ref|YP_003482739.1| DNA-(apurinic or apyrimidinic site) lyase [Aciduliprofundum boonei
T469]
gi|197623078|gb|EDY35645.1| base excision DNA repair protein, HhH-GPD family [Aciduliprofundum
boonei T469]
gi|289533830|gb|ADD08177.1| DNA-(apurinic or apyrimidinic site) lyase [Aciduliprofundum boonei
T469]
Length = 211
Score = 49.7 bits (117), Expect = 0.009, Method: Composition-based stats.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD ++E LL + G+G K+ V VD +V RI+ RLGWV + P +
Sbjct: 104 PDNLEE-LLKLPGVGRKTANIVLSRCYDKDVIAVDTHVHRISNRLGWVNTKT-PEETERE 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L+K L +K +++ ++ FG+ C+ +P C CP++ CK+
Sbjct: 162 LMK-----------------VLLKKYWKDINELLVMFGRTICRPVAPKCDVCPIKKYCKY 204
Query: 763 F 763
+
Sbjct: 205 Y 205
>gi|345303642|ref|YP_004825544.1| endonuclease III [Rhodothermus marinus SG0.5JP17-172]
gi|345112875|gb|AEN73707.1| endonuclease III [Rhodothermus marinus SG0.5JP17-172]
Length = 280
Score = 49.7 bits (117), Expect = 0.009, Method: Composition-based stats.
Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 18/121 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L + G+G K+ V ++ A PVD +V R+A R+G V P ++
Sbjct: 106 PDSL-EALESLPGVGPKTARVVASVAFGVAALPVDTHVYRVAHRIGLVRRARTPLEVERR 164
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L ++ P D W E H+ +I G+ C R P+C C L C H
Sbjct: 165 LKRQLPARD------WG-----------EAHHLLILHGRYTCTARQPHCDRCVLTDLCDH 207
Query: 763 F 763
+
Sbjct: 208 Y 208
>gi|325969490|ref|YP_004245682.1| iron-sulfur cluster loop [Vulcanisaeta moutnovskia 768-28]
gi|323708693|gb|ADY02180.1| iron-sulfur cluster loop [Vulcanisaeta moutnovskia 768-28]
Length = 307
Score = 49.7 bits (117), Expect = 0.009, Method: Composition-based stats.
Identities = 44/153 (28%), Positives = 75/153 (49%), Gaps = 22/153 (14%)
Query: 619 IKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDV 678
I+E ++RL + G D P + K YLL I+ L L+ + ++ Q + P+D
Sbjct: 154 IEELIDRLSMMAGYED--------PVRKKAYLL-IKFLVLRGLLKIKP---QEMKLPIDN 201
Query: 679 NVGRIAVRLGWVPLQPLPGDL---HIHLLKEYPVMDKIQMY-LWP---RLCYLDQKTLYE 731
++ RIA RLG V L+ +L +HL +E + ++ + W R+ LD + +
Sbjct: 202 HLTRIAYRLGIVRLEDWVLELMRRDMHLSRELDIELRLAVRDAWDAVIRVSKLDPIAIDD 261
Query: 732 LHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+++ G+ C + P C CPLRG C +A
Sbjct: 262 FLWRL---GRTICIRDKPQCDKCPLRGLCSAYA 291
>gi|378952456|ref|YP_005209944.1| hypothetical protein PSF113_4559 [Pseudomonas fluorescens F113]
gi|359762470|gb|AEV64549.1| Nth [Pseudomonas fluorescens F113]
Length = 212
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 70/171 (40%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 ETIHALGVEGLSEYIKTIGLFNSKAKNVIETCRLLIERHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G P K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTGLAPG------------KNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C R P CG+C + C++
Sbjct: 161 KKLMKFVPKEFLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|383787605|ref|YP_005472174.1| putative endonuclease III-like protein [Fervidobacterium
pennivorans DSM 9078]
gi|383110452|gb|AFG36055.1| putative endonuclease III-like protein [Fervidobacterium
pennivorans DSM 9078]
Length = 315
Score = 49.7 bits (117), Expect = 0.010, Method: Composition-based stats.
Identities = 44/162 (27%), Positives = 74/162 (45%), Gaps = 20/162 (12%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
EI+ IK G N+ A R+K L L + DLE ++ +++++ LL ++G+G ++
Sbjct: 169 EISFLIKPAGFFNVKAQRLKNLLTWLKSY--NFDLEKIKTKSIEEIRDELLSVKGIGKET 226
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L +L+ F VD R+ RL + LKEY + +PR
Sbjct: 227 ADSIILYALELPVFVVDAYTKRLLDRLLGIK------------LKEYDEYRLLFEKTYPR 274
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L Q E H ++ K C+ +P C CP+ CK+
Sbjct: 275 DVALYQ----EFHGLIVEHAKALCRT-NPLCATCPVE-SCKY 310
>gi|428671556|gb|EKX72474.1| endonuclease III family member protein [Babesia equi]
Length = 666
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 66/163 (40%), Gaps = 28/163 (17%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E E+ I G N A IK+ + E G PD ++E L+ + G+G
Sbjct: 520 EEELDTLISRVGFHNTKAKNIKKTATIIHEKFGG--------RVPDTMEE-LVSLPGVGP 570
Query: 659 KSVECVRLLSLQHI-AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K V L+ + I VD++V RI+ RLGWV + P + + L + P
Sbjct: 571 KMANLVIQLAFKRIDGISVDLHVHRISNRLGWVKTK-TPDETRLQLQELIP--------- 620
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
QK E+++ ++ FG+ C P C C C
Sbjct: 621 --------QKLWAEVNHLLVGFGQTICTAAGPGCATCGANKWC 655
>gi|85710083|ref|ZP_01041148.1| Endonuclease III/Nth [Erythrobacter sp. NAP1]
gi|85688793|gb|EAQ28797.1| Endonuclease III/Nth [Erythrobacter sp. NAP1]
Length = 217
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 66/164 (40%), Gaps = 27/164 (16%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
++ IA AIK G N I++F L+E H V PD +E L+ + G+G
Sbjct: 73 NDEAIAKAIKPCGLYNNKTKNIRKFCTALIEEHD--------GVVPD-TREGLMSLPGIG 123
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K + V + VD +V R+ R+G L + +K L
Sbjct: 124 RKCADIVMSFTFGKDVIAVDTHVHRVCNRIG---------------LTDAKTAEKTAQQL 168
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
R + L + H+ +I FGK C+ R P C C + C+
Sbjct: 169 EERS---PEWALADGHFWLIQFGKRVCRSRIPKCDICVVSDLCE 209
>gi|29347081|ref|NP_810584.1| endonuclease III [Bacteroides thetaiotaomicron VPI-5482]
gi|383125310|ref|ZP_09945956.1| endonuclease III [Bacteroides sp. 1_1_6]
gi|29338979|gb|AAO76778.1| endonuclease III [Bacteroides thetaiotaomicron VPI-5482]
gi|251838419|gb|EES66505.1| endonuclease III [Bacteroides sp. 1_1_6]
Length = 225
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD + + L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNMDD-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|336406284|ref|ZP_08586944.1| endonuclease III [Bacteroides sp. 1_1_30]
gi|335935312|gb|EGM97270.1| endonuclease III [Bacteroides sp. 1_1_30]
Length = 225
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD + + L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLDD-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|291294869|ref|YP_003506267.1| endonuclease III [Meiothermus ruber DSM 1279]
gi|290469828|gb|ADD27247.1| endonuclease III [Meiothermus ruber DSM 1279]
Length = 235
Score = 49.7 bits (117), Expect = 0.010, Method: Composition-based stats.
Identities = 49/171 (28%), Positives = 70/171 (40%), Gaps = 27/171 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D A+ A E+ IK G A I RLVE HG VP DK K
Sbjct: 86 DAFALAQATPEEVEPYIKTIGLYRSKARNIVLLARRLVEQHGG-------EVPVDKAK-- 136
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ V + VD ++ R+A RLG + Q P + L + +P
Sbjct: 137 LRALPGVGWKTATVVLGAAFGVPGIAVDTHLTRLAARLG-LSAQKDPEKIGGDLERLFP- 194
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K ++ +H+ +I FG+ C R P C CPL +C
Sbjct: 195 ---------------KEKWVF-VHHALILFGRYRCTARKPQCPGCPLYDDC 229
>gi|294787263|ref|ZP_06752516.1| endonuclease III [Parascardovia denticolens F0305]
gi|294484619|gb|EFG32254.1| endonuclease III [Parascardovia denticolens F0305]
Length = 244
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 67/172 (38%), Gaps = 22/172 (12%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ A+ SE+A+ I G A N L+ G VP + E L
Sbjct: 92 ALSQANPSEVAEIINSIGFFRTKAQHAVMIANDLITRFGG-------EVP--RTMEELTT 142
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ + + FPVD +V R+ RL W D + + KE V
Sbjct: 143 LPGVGRKTANVILGNAFDLPGFPVDTHVMRVTKRLHWRSDWNKTKDDPVAIEKE--VTAA 200
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ W +L +++I FG+ C R P C CPLR C F
Sbjct: 201 FEPTEW-----------RDLSHRLIDFGRDTCHARKPECLICPLRDTCPSFG 241
>gi|312110487|ref|YP_003988803.1| endonuclease III [Geobacillus sp. Y4.1MC1]
gi|336234949|ref|YP_004587565.1| endonuclease III [Geobacillus thermoglucosidasius C56-YS93]
gi|423719504|ref|ZP_17693686.1| endonuclease III [Geobacillus thermoglucosidans TNO-09.020]
gi|311215588|gb|ADP74192.1| endonuclease III [Geobacillus sp. Y4.1MC1]
gi|335361804|gb|AEH47484.1| endonuclease III [Geobacillus thermoglucosidasius C56-YS93]
gi|383367596|gb|EID44873.1| endonuclease III [Geobacillus thermoglucosidans TNO-09.020]
Length = 216
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ I+ G A I++ L+E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIRSIGLYRNKAKNIQKLCAMLIEKYGG-------QVPQDR--DELMKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG+ + ++ L+K+ P + W
Sbjct: 123 ANVVVSVAFGIPAIAVDTHVERVSKRLGFCRWEDSVLEVEKTLMKKIPKEE------WSI 176
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H++MI FG+ CK +SP C CPL C+
Sbjct: 177 T-----------HHRMIFFGRYHCKAQSPQCHVCPLLDLCR 206
>gi|423216244|ref|ZP_17202769.1| endonuclease III [Bacteroides xylanisolvens CL03T12C04]
gi|295088002|emb|CBK69525.1| DNA-(apurinic or apyrimidinic site) lyase /endonuclease III
[Bacteroides xylanisolvens XB1A]
gi|392691095|gb|EIY84346.1| endonuclease III [Bacteroides xylanisolvens CL03T12C04]
Length = 225
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD + + L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLDD-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARAPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|312959249|ref|ZP_07773767.1| endonuclease III [Pseudomonas fluorescens WH6]
gi|311286509|gb|EFQ65072.1| endonuclease III [Pseudomonas fluorescens WH6]
Length = 212
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 72/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 EAIYALGVDGLSEYIKTIGLYNSKAKNVIETCRLLIEQHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + + VD ++ R++ R G I K ++
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVE 160
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K M P+ LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 161 KQLMKFVPKPYLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|398989592|ref|ZP_10692829.1| endonuclease III [Pseudomonas sp. GM24]
gi|399015076|ref|ZP_10717352.1| endonuclease III [Pseudomonas sp. GM16]
gi|398109087|gb|EJL99026.1| endonuclease III [Pseudomonas sp. GM16]
gi|398147214|gb|EJM35929.1| endonuclease III [Pseudomonas sp. GM24]
Length = 212
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVE HGS+ VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVERHGSV-------VP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P + + V +K+ M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGIAPGKNV-----------VEVENKL-MKFVPKE 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C R P CG+C + C++
Sbjct: 171 FLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|220935177|ref|YP_002514076.1| HhH-GPD family protein [Thioalkalivibrio sulfidophilus HL-EbGr7]
gi|219996487|gb|ACL73089.1| HhH-GPD family protein [Thioalkalivibrio sulfidophilus HL-EbGr7]
Length = 217
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 76/170 (44%), Gaps = 19/170 (11%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ ++E+A I+ G N+ A R+K ++ HG LR+ P +K++ LL
Sbjct: 62 EAMLDLSDAELARLIRPSGYFNVKARRLKALCRWYLD-HGGRRR--LRHWPTEKLRASLL 118
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G ++ + + L + F +D R+ RLG H H Y D
Sbjct: 119 SVHGIGPETADDILLYAFDRPVFVIDAYTRRLLGRLG-----------HPHAQAAY---D 164
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+M L L D++ E H ++ GK C+ + P C C L +C+
Sbjct: 165 DFRMSLESTLGQ-DERLYNEYHALIVAHGKDVCRPK-PRCEQCVLSTKCE 212
>gi|14520880|ref|NP_126355.1| endonuclease III [Pyrococcus abyssi GE5]
gi|5458097|emb|CAB49586.1| nth endonuclease III [Pyrococcus abyssi GE5]
gi|380741424|tpe|CCE70058.1| TPA: endonuclease III [Pyrococcus abyssi GE5]
Length = 222
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 18/128 (14%)
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
I LE + PDK +E L+ + G+G K V A PVD +V RI+ RLG P
Sbjct: 102 IILEKYKGRVPDKFEE-LIKLPGIGRKCANIVLAYGFGIPAIPVDTHVYRISRRLGLAPW 160
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
P ++ LKE L PR ++ +Y +++ M+ GK C+ P C
Sbjct: 161 DASPEEVE-ERLKE----------LIPR-----EEWIY-VNHAMVDHGKSVCRPIKPRCD 203
Query: 753 ACPLRGEC 760
CPL+ C
Sbjct: 204 ECPLKELC 211
>gi|325298553|ref|YP_004258470.1| endonuclease III [Bacteroides salanitronis DSM 18170]
gi|324318106|gb|ADY35997.1| endonuclease III [Bacteroides salanitronis DSM 18170]
Length = 214
Score = 49.7 bits (117), Expect = 0.010, Method: Composition-based stats.
Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 23/122 (18%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP---LQPLPGDL 699
PD ++E L+ + G+G K+ ++ + A VD +V R++ R+G VP PL +
Sbjct: 106 PDTLEE-LVKLPGVGRKTANVIQSVVFHKAAMAVDTHVFRVSHRIGLVPGTCTTPLATEK 164
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
H+ Y+ + + + H+ +I G+ C R+P C C L G
Sbjct: 165 HLTR-------------------YIPEALIPKAHHWLILHGRYVCTARNPKCDKCGLNGI 205
Query: 760 CK 761
C+
Sbjct: 206 CQ 207
>gi|104780422|ref|YP_606920.1| endonuclease III [Pseudomonas entomophila L48]
gi|95109409|emb|CAK14109.1| endonuclease III [Pseudomonas entomophila L48]
Length = 212
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+A+ ++D IK G N A + E LVE H VP + +E L
Sbjct: 62 QAIHALGVEGLSDYIKTIGLYNSKAKNVIETCRLLVEQHDG-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + +A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQLAMAVDTHIFRVSNRTGIAPGKTVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|383319791|ref|YP_005380632.1| hypothetical protein Mtc_1360 [Methanocella conradii HZ254]
gi|379321161|gb|AFD00114.1| hypothetical protein Mtc_1360 [Methanocella conradii HZ254]
Length = 282
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 47/113 (41%), Gaps = 17/113 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
LL IEG+G K+ +CV L S PVD +V R+ RLG VP + L++E P
Sbjct: 8 LLSIEGVGPKTADCVLLFSCGRDVIPVDTHVFRVTKRLGMVPEDASHEEARRILMEEIPP 67
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K +H +I G+ CK R P C L C +
Sbjct: 68 GMK-----------------GSVHVCLIKLGREICKARGPKHNICFLLDLCDY 103
>gi|347542986|ref|YP_004857623.1| endonuclease III [Candidatus Arthromitus sp. SFB-rat-Yit]
gi|346986022|dbj|BAK81697.1| endonuclease III [Candidatus Arthromitus sp. SFB-rat-Yit]
Length = 209
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 71/166 (42%), Gaps = 27/166 (16%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
DE ++ D IK G + I L E E+ VP K ++ L+ + G+G
Sbjct: 67 DEEKLKDKIKCIGLYRTKSKNIINLCKILKE-------EFNGEVP--KTRDELITLPGVG 117
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ V AF VDV+V R++ R+G + P + L+K I L
Sbjct: 118 RKTANVVISNCFGVQAFAVDVHVFRVSNRIG-IANSKTPEQTELELMKN------IDESL 170
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
W +C H+ +I G+ C R PNCG C ++ CK++
Sbjct: 171 WT-IC----------HHTIIFHGRRCCTSRKPNCGECKIKEYCKYY 205
>gi|332157727|ref|YP_004423006.1| endonuclease III [Pyrococcus sp. NA2]
gi|331033190|gb|AEC51002.1| endonuclease III [Pyrococcus sp. NA2]
Length = 220
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 52/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PDK E LL + G+G K V A PVD +V RI+ RLG P P ++
Sbjct: 112 PDKFDE-LLKLPGIGRKCANIVLAYGFGIPAIPVDTHVYRISRRLGLAPWDASPEEVEER 170
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L L PR ++ +Y +++ M+ GK C+ P C CPLR C
Sbjct: 171 L-----------KSLIPR-----EEWIY-VNHAMVDHGKRICRPVEPRCNECPLRDLC 211
>gi|154505234|ref|ZP_02041972.1| hypothetical protein RUMGNA_02748 [Ruminococcus gnavus ATCC 29149]
gi|336433960|ref|ZP_08613767.1| endonuclease III [Lachnospiraceae bacterium 2_1_58FAA]
gi|153794432|gb|EDN76852.1| endonuclease III [Ruminococcus gnavus ATCC 29149]
gi|336014865|gb|EGN44696.1| endonuclease III [Lachnospiraceae bacterium 2_1_58FAA]
Length = 208
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 51/116 (43%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ +R + VD +V RI+ RLG + P + L+KE
Sbjct: 109 EDLTSLAGVGRKTANVIRGNIYHDPSVVVDTHVKRISRRLGLTKNED-PEKIETDLMKEL 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D LY + Q+ITFG+ C RSP C C L+ CK F
Sbjct: 168 PK---------------DHWILYNI--QIITFGRSICTARSPKCEQCFLQKYCKEF 206
>gi|296109162|ref|YP_003616111.1| HhH-GPD family protein [methanocaldococcus infernus ME]
gi|295433976|gb|ADG13147.1| HhH-GPD family protein [Methanocaldococcus infernus ME]
Length = 220
Score = 49.3 bits (116), Expect = 0.011, Method: Composition-based stats.
Identities = 47/178 (26%), Positives = 77/178 (43%), Gaps = 17/178 (9%)
Query: 585 KMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPD 644
K D +D E + DE E+ + I++ G A +KE +V +GS D +
Sbjct: 49 KRDLIDEEKILKIDEDELINIIRKVGYYKRKARALKELTGFIVNNYGSTDEMAKSDESLI 108
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
++E LL I+G+G ++ + + L +L F V+ R L G L+I
Sbjct: 109 SLREKLLSIKGIGKETADSILLYALDRETFVVNAYTKR------------LFGRLNIIHE 156
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K+Y + + PR D K E + ++ K C+K P C CPL+ C +
Sbjct: 157 KDYEKIKRFFESQIPR----DLKIYKEYNALIVEHCKRACRK-VPKCLTCPLKNICPY 209
>gi|222099270|ref|YP_002533838.1| Endonuclease III [Thermotoga neapolitana DSM 4359]
gi|221571660|gb|ACM22472.1| Endonuclease III [Thermotoga neapolitana DSM 4359]
Length = 208
Score = 49.3 bits (116), Expect = 0.011, Method: Composition-based stats.
Identities = 57/215 (26%), Positives = 93/215 (43%), Gaps = 36/215 (16%)
Query: 552 KKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSV--DWEAVRCADESEIADAIKER 609
K+T P + +TV L +R ER+ K+ V E + A ++ D IKE
Sbjct: 18 KETDPFRVLISTV-----LSQRTRDENTERAARKLFEVYRTPEDLAKAKPEDLYDLIKES 72
Query: 610 GQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS 668
G A RI + +VE + G + PD ++E LL + G+G K+ V +
Sbjct: 73 GMYRQKAERIVKISKIIVEKYSGKV---------PDTLEE-LLRLPGVGRKTANIVLWVG 122
Query: 669 LQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKT 728
+ A VD +V RI+ RLGWV + P + L + P +K
Sbjct: 123 FRKPALAVDTHVHRISNRLGWVKTKT-PEETEKALKRLLP-----------------EKL 164
Query: 729 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
++ M+ FG+ C+ +P C C L+ C+++
Sbjct: 165 WGPINGSMVEFGRNVCRPVNPKCEDCFLKKHCEYY 199
>gi|336114200|ref|YP_004568967.1| endonuclease III [Bacillus coagulans 2-6]
gi|335367630|gb|AEH53581.1| endonuclease III [Bacillus coagulans 2-6]
Length = 219
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 19/126 (15%)
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
E+ R VP + +E L+ + G+G K+ V ++ A VD +V R++ RLG+ +
Sbjct: 100 EYGREVP--QTREELMKLPGVGRKTANVVLSVAFGIPAIAVDTHVERVSKRLGFCRYKDS 157
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACP 755
++ L+K+ P + W H++MI FG+ CK + P C CP
Sbjct: 158 VLEVEQTLMKKVPKEE------WS-----------ITHHRMIFFGRYHCKAQRPQCEICP 200
Query: 756 LRGECK 761
L C+
Sbjct: 201 LLDLCR 206
>gi|312870016|ref|ZP_07730153.1| endonuclease III [Lactobacillus oris PB013-T2-3]
gi|417885775|ref|ZP_12529926.1| endonuclease III [Lactobacillus oris F0423]
gi|311094413|gb|EFQ52720.1| endonuclease III [Lactobacillus oris PB013-T2-3]
gi|341594694|gb|EGS37378.1| endonuclease III [Lactobacillus oris F0423]
Length = 213
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 52/118 (44%), Gaps = 18/118 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P +KE L + G+G K + V AFPVD +V R+A RL VP P L I
Sbjct: 106 PQTLKE-LTSLPGVGRKVADVVLAECFTIPAFPVDTHVSRVARRLAMVP--PKASLLAI- 161
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
E +M+ + W + H+ MI +G+ C R+P C CPL C
Sbjct: 162 ---EKKLMEAVPEDKW-----------LDAHHSMIFWGRYRCMARNPRCSDCPLLPMC 205
>gi|344344631|ref|ZP_08775492.1| HhH-GPD family protein [Marichromatium purpuratum 984]
gi|343803795|gb|EGV21700.1| HhH-GPD family protein [Marichromatium purpuratum 984]
Length = 221
Score = 49.3 bits (116), Expect = 0.013, Method: Composition-based stats.
Identities = 48/174 (27%), Positives = 78/174 (44%), Gaps = 21/174 (12%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
D + +A+ ++ G + R+ F R H + LE L + +++E+LL G+G
Sbjct: 69 DPARLAELLRPSGYYRVKTRRLLAFCERY---HAAGGLEPLAALATPRLREWLLATPGIG 125
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + L + H F VD R+ RLG L GD ++Y ++I+ +
Sbjct: 126 PETADDILLYAFAHPVFVVDAYTRRLFSRLGL-----LAGD------EDY---ERIRRWF 171
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
L D L E H ++ K C+ R P C C LR C ASA +S R
Sbjct: 172 EQALPD-DVALLQEYHALIVAHAKHACRAR-PVCAGCALRACCP--ASAPSSER 221
>gi|289192102|ref|YP_003458043.1| HhH-GPD family protein [Methanocaldococcus sp. FS406-22]
gi|288938552|gb|ADC69307.1| HhH-GPD family protein [Methanocaldococcus sp. FS406-22]
Length = 222
Score = 49.3 bits (116), Expect = 0.013, Method: Composition-based stats.
Identities = 41/168 (24%), Positives = 77/168 (45%), Gaps = 16/168 (9%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
+D E + DE ++ + IK G N+ A R+K +V+ +G+ + + ++
Sbjct: 62 LDEEKILNVDEDKLKELIKPAGFYNLKAKRLKNVTKFIVDNYGNTEEMAKTDKDTLTLRA 121
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL I G+G ++ + + L +L +F VD R+ RLG ++ E
Sbjct: 122 ELLSINGVGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------IINEKA 167
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
D+I+ ++ + D + E H ++ K FC+K+ P C CP+
Sbjct: 168 KYDEIKE-IFEKSLPKDLEIYKEYHALIVEHCKKFCRKK-PLCDNCPI 213
>gi|385799712|ref|YP_005836116.1| DNA-(apurinic or apyrimidinic site) lyase [Halanaerobium praevalens
DSM 2228]
gi|309389076|gb|ADO76956.1| endonuclease III; DNA-(apurinic or apyrimidinic site) lyase
[Halanaerobium praevalens DSM 2228]
Length = 218
Score = 49.3 bits (116), Expect = 0.013, Method: Composition-based stats.
Identities = 35/128 (27%), Positives = 58/128 (45%), Gaps = 20/128 (15%)
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
E+ VP K ++ LL + G+G K+ V + AFPVD +V R++ RLG
Sbjct: 104 EFNSQVP--KTRKELLKLSGVGRKTANVVLSSAFAKAAFPVDTHVFRVSARLG------- 154
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACP 755
+ K +K L PR ++D H+ +I G+ CK ++P+C C
Sbjct: 155 -----LSSGKNVSTTEKELTDLIPRKYWID------FHHWLIDHGRALCKAQNPDCKNCF 203
Query: 756 LRGECKHF 763
+ C ++
Sbjct: 204 AKKICNYY 211
>gi|148652689|ref|YP_001279782.1| endonuclease III [Psychrobacter sp. PRwf-1]
gi|148571773|gb|ABQ93832.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Psychrobacter sp. PRwf-1]
Length = 231
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ E + IK G N A + + LV+ H S V PD K+ L
Sbjct: 79 EAIYALGEEGLKQYIKTIGLYNSKAKNVIKACKDLVDKHNS--------VVPDNRKD-LE 129
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + VD ++ R++ R G + + L E+ +++
Sbjct: 130 ALAGVGRKTANVVLNTAFGQPTMAVDTHIFRVSNRTGLATGKTV-------LAVEHKLIE 182
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ P LD H+ +I G+ C+ R+P CGACP+ EC
Sbjct: 183 RV-----PDDFILDA------HHYLILHGRYTCQARTPKCGACPVYTEC 220
>gi|292671021|ref|ZP_06604447.1| endonuclease III [Selenomonas noxia ATCC 43541]
gi|422343635|ref|ZP_16424562.1| endonuclease III [Selenomonas noxia F0398]
gi|292647338|gb|EFF65310.1| endonuclease III [Selenomonas noxia ATCC 43541]
gi|355378051|gb|EHG25242.1| endonuclease III [Selenomonas noxia F0398]
Length = 210
Score = 49.3 bits (116), Expect = 0.013, Method: Composition-based stats.
Identities = 51/178 (28%), Positives = 75/178 (42%), Gaps = 35/178 (19%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ ++E+ AI + G + A I E + L++ +G VP D E L
Sbjct: 64 EAIASLGQAELEAAIHDCGFFRMKAKHILETCDILLQEYGG-------EVPADF--EALQ 114
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL----GWVPLQPLPGDLHIHLLKEY 707
+ G+G K+ V ++ + A VD +V R+A RL G PL+ G L K
Sbjct: 115 KLPGVGRKTANVVMSVAFRVPAIAVDTHVFRVANRLHLAVGKTPLEVEKG-----LQKAI 169
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
P D W + H+ +I G+ CK R P CG CPL C A+
Sbjct: 170 PRAD------WS-----------DAHHWLILHGRQICKARKPLCGDCPLSFICPSSAA 210
>gi|150398564|ref|YP_001329031.1| endonuclease III [Sinorhizobium medicae WSM419]
gi|150030079|gb|ABR62196.1| endonuclease III [Sinorhizobium medicae WSM419]
Length = 236
Score = 49.3 bits (116), Expect = 0.013, Method: Composition-based stats.
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 18/117 (15%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ HLL
Sbjct: 114 RTREELVTLPGVGRKTANVVLSMAFGEATIAVDTHIFRIAHRIRIAPGKT-PDEVEAHLL 172
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ P + LY H+ +I G+ CK R P C C + CK
Sbjct: 173 RVIP-----------------EHRLYHAHHWLILHGRYVCKARRPECERCVIADICK 212
>gi|392375109|ref|YP_003206942.1| hypothetical protein DAMO_2061 [Candidatus Methylomirabilis
oxyfera]
gi|258592802|emb|CBE69111.1| conserved protein of unknown function [Candidatus Methylomirabilis
oxyfera]
Length = 228
Score = 49.3 bits (116), Expect = 0.014, Method: Composition-based stats.
Identities = 37/159 (23%), Positives = 65/159 (40%), Gaps = 17/159 (10%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
+ +A I+ G N+ R+K L+ +G + + +++E LL I G+G
Sbjct: 73 QEHLATLIRPSGYYNMKTERLKHVTRFLLTRYGG-SVRRMGRTGLSELREELLGISGVGE 131
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
++ + + L + F VD R+ R G + G++ + P
Sbjct: 132 ETADSILLYAGDRPIFVVDAYTRRVLERHGLIAKNTRYGEIQRLFMTHLPT--------- 182
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D E H ++ GK +C+ R+PNC CPLR
Sbjct: 183 ------DATLFNEYHALLVAVGKTYCR-RTPNCDKCPLR 214
>gi|424897115|ref|ZP_18320689.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2297]
gi|393181342|gb|EJC81381.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2297]
Length = 260
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 49/115 (42%), Gaps = 18/115 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ L+K
Sbjct: 139 REELVKLPGVGRKTANVVLSMAFGQATMAVDTHIFRIANRIRLAPGK-TPDEVEARLMKV 197
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P Q LY H+ +I G+ CK R P C C + CK
Sbjct: 198 IP-----------------QHYLYHAHHWLILHGRYTCKARRPECERCVIADLCK 235
>gi|315230941|ref|YP_004071377.1| endonuclease III [Thermococcus barophilus MP]
gi|315183969|gb|ADT84154.1| endonuclease III [Thermococcus barophilus MP]
Length = 236
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD ++E L+ + G+G K V A PVD +V RI+ RLG P + P + +
Sbjct: 122 PDMLEE-LVKLPGIGRKCANIVLAYGFGKQAIPVDTHVNRISKRLGLAPPKAPPEKVEEY 180
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L + P LW +Y +++ M+ GK C+ SP C CPL+ C +
Sbjct: 181 LKELIP------KELW----------IY-VNHAMVDHGKAICRPISPRCDECPLKTLCPY 223
>gi|379023259|ref|YP_005299920.1| endonuclease III [Rickettsia canadensis str. CA410]
gi|376324197|gb|AFB21438.1| endonuclease III [Rickettsia canadensis str. CA410]
Length = 209
Score = 48.9 bits (115), Expect = 0.014, Method: Composition-based stats.
Identities = 34/127 (26%), Positives = 58/127 (45%), Gaps = 19/127 (14%)
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
N P+ KE L+ + G+G K+ V VD +V R+A R+G
Sbjct: 102 NTVPNSFKE-LVKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIG----------- 149
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
L + + ++ L L L++K L H+ +I G+ CK R P+C CP++
Sbjct: 150 ----LAQGSTPEVVEKEL---LQILNKKWLMHAHHWLILHGRYICKARKPDCDICPVKKY 202
Query: 760 CKHFASA 766
C+++ +A
Sbjct: 203 CEYYLNA 209
>gi|298387599|ref|ZP_06997151.1| endonuclease III [Bacteroides sp. 1_1_14]
gi|298259806|gb|EFI02678.1| endonuclease III [Bacteroides sp. 1_1_14]
Length = 176
Score = 48.9 bits (115), Expect = 0.014, Method: Composition-based stats.
Identities = 32/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD + + L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 57 PDNMDD-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 115
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P +K + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 116 ELVKNIP-----------------EKLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 158
Query: 762 HFAS 765
+F +
Sbjct: 159 YFCN 162
>gi|259480217|tpe|CBF71146.1| TPA: hypothetical base excision DNA repair protein (Eurofung)
[Aspergillus nidulans FGSC A4]
Length = 502
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 61/152 (40%), Gaps = 43/152 (28%)
Query: 583 HDKMD--SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN---------RLVELHG 631
HD + SV+W+AVR A ++ +AIK G + + IK L+ R + + G
Sbjct: 269 HDGIGKGSVNWDAVRRAPVKDVFEAIKSGGLADSKSKNIKAILDMVYKENQERRNILVKG 328
Query: 632 S--------------------------------IDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+ L +L +P ++V L+ G+G K
Sbjct: 329 QDTNSDSGKFVQQLNDKPEGEKQYEIACADQNFLSLNYLHGLPTEEVMTELMKYPGIGPK 388
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+ CV L LQ F VD ++ RI L WVP
Sbjct: 389 TAACVLLFCLQRPCFAVDTHIFRICKWLNWVP 420
>gi|345882993|ref|ZP_08834444.1| endonuclease III [Prevotella sp. C561]
gi|345044205|gb|EGW48249.1| endonuclease III [Prevotella sp. C561]
Length = 215
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 65/172 (37%), Gaps = 27/172 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D + + A EI D +K N A + E LVE G VP D
Sbjct: 61 DAKTMATATSDEIFDYVKSVSYPNSKAKHLVEMSKMLVEKFGG-------EVPSDP--NA 111
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP-LPGDLHIHLLKEYP 708
L+ + G+G K+ ++ + VD +V R++ RLG VP P + +L+K P
Sbjct: 112 LITLPGVGRKTANVIQAVWFGKPTLAVDTHVYRVSHRLGLVPSTANTPRKVEDYLMKNIP 171
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ + W ++ G+ CK P+C CP C
Sbjct: 172 TEEVTDAHHW-----------------ILLHGRYVCKSAKPDCEHCPFDDMC 206
>gi|150391360|ref|YP_001321409.1| endonuclease III [Alkaliphilus metalliredigens QYMF]
gi|149951222|gb|ABR49750.1| endonuclease III [Alkaliphilus metalliredigens QYMF]
Length = 216
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 68/172 (39%), Gaps = 27/172 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + E E+ IK G N+ + I + L+E HG VP ++ E L+
Sbjct: 67 ERILTLTEVELGQWIKSCGFYNMKSKNILATCHLLMEKHGG-------EVPEER--EALM 117
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V A VD +V R++ RLG + E +M
Sbjct: 118 ALPGVGRKTANVVISNVFGQDAIAVDTHVFRVSNRLGLAHSDNVDDT-------EQDLMK 170
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
I +W + H+ +I G+ CK R P C CPL C H+
Sbjct: 171 SIPKSMWS-----------DAHHWIILHGRRICKARRPLCEECPLTTYCLHY 211
>gi|389852800|ref|YP_006355034.1| endonuclease III [Pyrococcus sp. ST04]
gi|388250106|gb|AFK22959.1| endonuclease III [Pyrococcus sp. ST04]
Length = 216
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 55/128 (42%), Gaps = 18/128 (14%)
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
I LE V P+ ++E L+ + G+G K V A PVD +V R++ RLG VP
Sbjct: 101 IILEKYGGVVPNTLEE-LMKLPGIGRKCANIVLAYGFGIPAIPVDTHVNRVSKRLGLVPP 159
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
P + L K P + LY +++ M+ GK C+ P C
Sbjct: 160 NATPEKVEEILKKLLP----------------KDEWLY-VNHAMVDHGKAVCRPIKPKCD 202
Query: 753 ACPLRGEC 760
CPLR C
Sbjct: 203 ECPLRDIC 210
>gi|308174025|ref|YP_003920730.1| endonuclease III [Bacillus amyloliquefaciens DSM 7]
gi|384158761|ref|YP_005540834.1| endonuclease III [Bacillus amyloliquefaciens TA208]
gi|384164798|ref|YP_005546177.1| endonuclease III [Bacillus amyloliquefaciens LL3]
gi|384167825|ref|YP_005549203.1| endonuclease III [Bacillus amyloliquefaciens XH7]
gi|307606889|emb|CBI43260.1| endonuclease III [Bacillus amyloliquefaciens DSM 7]
gi|328552849|gb|AEB23341.1| endonuclease III [Bacillus amyloliquefaciens TA208]
gi|328912353|gb|AEB63949.1| endonuclease III [Bacillus amyloliquefaciens LL3]
gi|341827104|gb|AEK88355.1| endonuclease III [Bacillus amyloliquefaciens XH7]
Length = 219
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEEYGG-------EVPKDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>gi|225683840|gb|EEH22124.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 451
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 57/134 (42%), Gaps = 25/134 (18%)
Query: 583 HDKMD--SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE----------LH 630
HD + SVDW+ VR A E+ +A+K G + + IK L+ + E +
Sbjct: 212 HDGVGEGSVDWDKVRQATLEEVYNAMKAGGLGRVKSRYIKRILDMVHEEGVARRNAMGMG 271
Query: 631 GSI-------------DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVD 677
G I L L + D+ G+G+K+ CV L L+ F VD
Sbjct: 272 GDIYQEGDGNGHEHILSLNHLHTLSKDEAMLEFTKYPGIGVKTAACVILFCLRQPCFAVD 331
Query: 678 VNVGRIAVRLGWVP 691
+V R+ LGW+P
Sbjct: 332 THVVRLCKWLGWLP 345
>gi|114566493|ref|YP_753647.1| endonuclease III [Syntrophomonas wolfei subsp. wolfei str.
Goettingen]
gi|114337428|gb|ABI68276.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Syntrophomonas wolfei subsp. wolfei str. Goettingen]
Length = 207
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 71/170 (41%), Gaps = 27/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ D S + + I+ G A +K+ +VE ++ VP D + LL
Sbjct: 60 EALAAIDLSLLEEKIRGVGLYRNKARHLKKMAQIIVE-------QYQGEVPSDF--DELL 110
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G KS +R + + VD +V R+A RLG V LP L ++ P
Sbjct: 111 SLPGVGRKSANVIRSVVFKKPGLGVDTHVHRVANRLGLVN-SKLPEQTEKALKEQIP--- 166
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+K E H+ +I G+ C+ R P C C L G C+
Sbjct: 167 --------------EKCWSEAHHLLIFHGRRICQARKPQCNNCVLEGLCE 202
>gi|123505745|ref|XP_001329048.1| HhH-GPD superfamily base excision DNA repair protein [Trichomonas
vaginalis G3]
gi|121911998|gb|EAY16825.1| HhH-GPD superfamily base excision DNA repair protein [Trichomonas
vaginalis G3]
Length = 238
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 57/132 (43%), Gaps = 28/132 (21%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQH-----IAFPVDVNVGRIAVRLGWVPLQPLPG 697
P +KE L G+G+K + L++ H I VDV+V RI+ LGWV + P
Sbjct: 122 PKTLKE-LTSFNGVGVK----MGTLAMAHCWGEQIGIGVDVHVHRISNLLGWVKTKK-PD 175
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D + L K P +W E+++ ++ FG+ C + P C CP++
Sbjct: 176 DTELALQKILP------KEIWS-----------EVNHTLVGFGQTICDAKKPKCDECPIK 218
Query: 758 GECKHFASAVAS 769
C AS
Sbjct: 219 DTCPALQRGSAS 230
>gi|387783942|ref|YP_006070025.1| putative endonuclease III (DNA-(apurinic or apyrimidinic site)
lyase) [Streptococcus salivarius JIM8777]
gi|338744824|emb|CCB95190.1| putative endonuclease III (DNA-(apurinic or apyrimidinic site)
lyase) [Streptococcus salivarius JIM8777]
Length = 214
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K + L + G+G K+ V + VD +V R++ RL VP ++ L+
Sbjct: 107 KTHKELESLPGVGRKTANVVLAEVYGIPSIAVDTHVSRVSKRLNIVPEDASVEEIEAELM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K+ P D I H++MI FG+ C ++P C CPL+ CK++
Sbjct: 167 KKIPKRDWIIS-----------------HHRMIFFGRYHCLAKNPKCQTCPLQSYCKYY 208
>gi|289208913|ref|YP_003460979.1| HhH-GPD family protein [Thioalkalivibrio sp. K90mix]
gi|288944544|gb|ADC72243.1| HhH-GPD family protein [Thioalkalivibrio sp. K90mix]
Length = 216
Score = 48.9 bits (115), Expect = 0.015, Method: Composition-based stats.
Identities = 44/174 (25%), Positives = 78/174 (44%), Gaps = 19/174 (10%)
Query: 587 DSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV 646
D +D E++ +A+ I+ G N+ A R++ L R +E G + E L + + +
Sbjct: 56 DVLDPESLLELPHEILAEHIRPSGYFNVKADRLRHLL-RFLEQQGGV--EALARMETEAL 112
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ LL ++G+G ++ + + L + + F VD R+ RLG + DL + + E
Sbjct: 113 RSALLSVKGVGPETADDIVLYAFERPVFVVDAYTRRLFERLGLPHARGAYDDLRVWVESE 172
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L P D + +LH ++ GK C+ + P C CPL C
Sbjct: 173 ----------LGP-----DAQAFNDLHALIVEHGKQRCRPK-PLCQGCPLSDVC 210
>gi|153956018|ref|YP_001396783.1| hypothetical protein CKL_3410 [Clostridium kluyveri DSM 555]
gi|219856358|ref|YP_002473480.1| hypothetical protein CKR_3015 [Clostridium kluyveri NBRC 12016]
gi|146348876|gb|EDK35412.1| Hypothetical protein CKL_3410 [Clostridium kluyveri DSM 555]
gi|219570082|dbj|BAH08066.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length = 219
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 69/165 (41%), Gaps = 27/165 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E E+ + IK G + I E ++E H VP K E L + G+G
Sbjct: 69 EEELGEKIKSCGFFRSKSKHILEASRVILESHKG-------EVP--KTMEELTKLSGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + A VD +V R++ RLG + + P + L+K P +W
Sbjct: 120 KTANVVLSNAFGIPAIAVDTHVFRVSNRLG-IAIGNTPDKVEKELMKNIP------ESMW 172
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ H+ +I G++ CK R P+C CPL C++F
Sbjct: 173 S-----------DTHHYLIWHGRLICKSRKPDCENCPLVPWCQYF 206
>gi|228476833|ref|ZP_04061478.1| endonuclease III [Streptococcus salivarius SK126]
gi|387761465|ref|YP_006068442.1| endonuclease III [Streptococcus salivarius 57.I]
gi|228251407|gb|EEK10552.1| endonuclease III [Streptococcus salivarius SK126]
gi|339292232|gb|AEJ53579.1| endonuclease III [Streptococcus salivarius 57.I]
Length = 214
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K + L + G+G K+ V + VD +V R++ RL VP ++ L+
Sbjct: 107 KTHKELESLPGVGRKTANVVLAEVYGIPSIAVDTHVSRVSKRLNIVPEDASVEEIEAELM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K+ P D I H++MI FG+ C ++P C CPL+ CK++
Sbjct: 167 KKIPKRDWIIS-----------------HHRMIFFGRYHCLAKNPKCQTCPLQSYCKYY 208
>gi|410454551|ref|ZP_11308486.1| DNA-lyase [Bacillus bataviensis LMG 21833]
gi|409931832|gb|EKN68807.1| DNA-lyase [Bacillus bataviensis LMG 21833]
Length = 213
Score = 48.9 bits (115), Expect = 0.016, Method: Composition-based stats.
Identities = 44/160 (27%), Positives = 67/160 (41%), Gaps = 19/160 (11%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
++A I+ G NI A RIK ++ L IDL ++ + ++ LL I+G+G ++
Sbjct: 68 DLAQLIRPSGFFNIKAKRIKSYMAWFKTLEYDIDL--IKKIDKHLLRNELLSIKGIGRET 125
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
+ + L + F VD RI RLG+ + + Y K PR
Sbjct: 126 GDVILLYAFDTPIFIVDAYARRIFYRLGY------------DMPESYDAFRKQIEKELPR 173
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D E H ++ K CK + P C CPL G C
Sbjct: 174 ----DLVLYNEFHALVVEHAKEHCKAK-PICEGCPLLGIC 208
>gi|404494240|ref|YP_006718346.1| endonuclease III-like protein [Pelobacter carbinolicus DSM 2380]
gi|77546248|gb|ABA89810.1| endonuclease III-related protein [Pelobacter carbinolicus DSM 2380]
Length = 232
Score = 48.9 bits (115), Expect = 0.017, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 19/146 (13%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
A R++ F L+E H DL+ + + P +V++ LL +G+G ++ + + L + +F
Sbjct: 101 AQRLQLFATCLLEKHQG-DLDAMLSGPLSQVRQTLLTFKGIGPETADSILLYAGHRPSFV 159
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLY-ELHY 734
VD R+ R G L GD + Y + + M PR Q L+ E H
Sbjct: 160 VDAYTRRLFKRYG-----VLEGD------ETYEDIRALFMAHLPR-----QVDLFNEYHA 203
Query: 735 QMITFGKIFCKKRSPNCGACPLRGEC 760
++ K FC+KR P C CPL+ EC
Sbjct: 204 LIVEQCKTFCRKR-PLCENCPLQPEC 228
>gi|403737479|ref|ZP_10950275.1| endonuclease III [Austwickia chelonae NBRC 105200]
gi|403192427|dbj|GAB77045.1| endonuclease III [Austwickia chelonae NBRC 105200]
Length = 248
Score = 48.9 bits (115), Expect = 0.017, Method: Composition-based stats.
Identities = 54/198 (27%), Positives = 81/198 (40%), Gaps = 35/198 (17%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKE 648
D A+ AD +E+ + I+ G A + + L+ H G + P ++KE
Sbjct: 77 DATALAAADRTELEEIIRPIGFHRNKADALLKLSAELLARHDGRV---------PGRLKE 127
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L+ + G+G K+ V + VD +VGR+A R GW E P
Sbjct: 128 -LVALSGVGRKTANVVLGNAFGVPGITVDTHVGRLARRFGWTT-------------AEDP 173
Query: 709 VMDKIQM-YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC-KHFASA 766
V+ + ++ L+PR K L + +I G+ C R P CGAC + C H
Sbjct: 174 VVVETEVGALFPR------KEWTMLSHTLIFHGRRTCHARRPACGACTVASLCPSHGIGE 227
Query: 767 VASARFAL---PGPSEKG 781
AR A PGP G
Sbjct: 228 TDPARAAALVKPGPVATG 245
>gi|281423216|ref|ZP_06254129.1| endonuclease III [Prevotella oris F0302]
gi|281402552|gb|EFB33383.1| endonuclease III [Prevotella oris F0302]
Length = 229
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 71/192 (36%), Gaps = 42/192 (21%)
Query: 571 LRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-L 629
L R Y T E M DWE +I IK N A + E LVE
Sbjct: 55 LYRHYPTAEA-----MAQADWE--------DIFQLIKSVSYPNSKAHHLSEMAKILVERF 101
Query: 630 HGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGW 689
+G + PD + L + G+G K+ V+ + VD +V R++ RLG
Sbjct: 102 NGEV---------PDNTDD-LTQLPGVGRKTANVVQAVWFGKPTLAVDTHVYRVSHRLGL 151
Query: 690 VP-LQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRS 748
VP P + + L+K P D + H+ ++ G+ CK +
Sbjct: 152 VPETANTPLKVELELMKYIPKAD-----------------VGNAHHWLLLHGRYICKSQK 194
Query: 749 PNCGACPLRGEC 760
P C CP C
Sbjct: 195 PQCQDCPFNTFC 206
>gi|329961055|ref|ZP_08299334.1| endonuclease III [Bacteroides fluxus YIT 12057]
gi|328532341|gb|EGF59145.1| endonuclease III [Bacteroides fluxus YIT 12057]
Length = 224
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 57/123 (46%), Gaps = 19/123 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD ++E L+ + G+G K+ ++ + A VD +V R++ RLG VP + P +
Sbjct: 106 PDTLEE-LVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRLGLVPDKCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P D + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIPEAD-----------------IPIAHHWLILHGRYVCQARTPQCDNCGLQLMCK 207
Query: 762 HFA 764
++
Sbjct: 208 YYC 210
>gi|23010115|ref|ZP_00050917.1| COG0177: Predicted EndoIII-related endonuclease [Magnetospirillum
magnetotacticum MS-1]
Length = 271
Score = 48.9 bits (115), Expect = 0.018, Method: Composition-based stats.
Identities = 44/146 (30%), Positives = 65/146 (44%), Gaps = 14/146 (9%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLS-LQHIAF 674
A RIK L + E G++DL +L + D+ + +L I G+G K+ V S L+ A
Sbjct: 130 APRIKAVLAAVRERVGALDLAFLEEMSVDEARGWLEGIPGIGPKTSAAVLSFSTLRMPAL 189
Query: 675 PVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHY 734
PVD + R+A R G + + G H L + P W + LY+ H
Sbjct: 190 PVDSHHHRVAQRTGLIGPKVDVGPSHGMLRAQLPAD-------W------SAQKLYDNHE 236
Query: 735 QMITFGKIFCKKRSPNCGACPLRGEC 760
++ G+ C RSP C C L C
Sbjct: 237 VLMLHGQRVCFHRSPACDRCVLLDIC 262
>gi|359783795|ref|ZP_09287004.1| endonuclease III [Pseudomonas psychrotolerans L19]
gi|359368255|gb|EHK68837.1| endonuclease III [Pseudomonas psychrotolerans L19]
Length = 212
Score = 48.9 bits (115), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 27/163 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E L+E HG VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKARNVIETCRLLLERHGG-------EVP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + +A VD ++ R++ R P + + ++ + LLK P
Sbjct: 123 NVVLNTAFRQLAMAVDTHIFRVSNRTNLAPGKNVV-EVELRLLKVVP------------- 168
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ L + H+ +I G+ C+ R P CG+C + C++ A
Sbjct: 169 ----RDFLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEYKA 207
>gi|344300617|gb|EGW30938.1| hypothetical protein SPAPADRAFT_156404 [Spathaspora passalidarum
NRRL Y-27907]
Length = 329
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 82/201 (40%), Gaps = 40/201 (19%)
Query: 567 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
++D + +++ E+ + D + EAV ES+I I + G N AG IK L
Sbjct: 115 NFDAMTKLHKAMLEKGYK--DGLCIEAVLATSESDIDKLIAKVGFHNRKAGYIKRSCQML 172
Query: 627 VELH-GSI-----DLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNV 680
++ H G I ++ L V P YLL G G+ S VDV++
Sbjct: 173 IDSHNGDIPTTIEEITALPGVGPKM--GYLLLQAGWGINS------------GIGVDVHL 218
Query: 681 GRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW-PRLCYLDQKTLYELHYQMITF 739
R+A GWV + P ++++ W PR + D L M+ F
Sbjct: 219 HRLAQMWGWVSKKATT-----------PEKARLELEDWLPRQYWKDVNPL------MVGF 261
Query: 740 GKIFCKKRSPNCGACPLRGEC 760
G++ C R+ NC C L C
Sbjct: 262 GQVVCGSRAKNCDVCSLNTMC 282
>gi|317055920|ref|YP_004104387.1| DNA-(apurinic or apyrimidinic site) lyase [Ruminococcus albus 7]
gi|315448189|gb|ADU21753.1| DNA-(apurinic or apyrimidinic site) lyase [Ruminococcus albus 7]
Length = 212
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 69/164 (42%), Gaps = 27/164 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +EI + +K G A IKE +L + +G V PD + E L + G+
Sbjct: 71 ADIAEIEEIVKPCGLYKTKAKSIKEMCIQLRDEYG--------GVLPDTL-EGLTKLSGI 121
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ + A D + RI RLG V + P K++
Sbjct: 122 GRKTANLIMGDIYHKPAVVTDTHCIRITGRLGLVK-------------NKEPA--KVEAE 166
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LW L + +L ++++ FG+ +C RSP CG CPL C
Sbjct: 167 LWK---ILPPEKSSDLCHRLVLFGREYCTARSPKCGGCPLNDIC 207
>gi|167630835|ref|YP_001681334.1| endonuclease iii [Heliobacterium modesticaldum Ice1]
gi|167593575|gb|ABZ85323.1| endonuclease iii [Heliobacterium modesticaldum Ice1]
Length = 203
Score = 48.5 bits (114), Expect = 0.018, Method: Composition-based stats.
Identities = 48/178 (26%), Positives = 73/178 (41%), Gaps = 39/178 (21%)
Query: 592 EAVRCADESEIADAIKE------RGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDK 645
EA+ + E+ D IK +G+ + A RI LVE HG VP
Sbjct: 47 EAMLLLTQEELEDLIKSIGLYRNKGRNILAACRI------LVEKHGG-------QVP--G 91
Query: 646 VKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLK 705
+E L + G+G K+ V + Q A VD +V R++ RLG + + +
Sbjct: 92 YREGLEKLPGVGRKTANVVLAEAFQEPAIAVDTHVFRVSNRLGLAQAKDV-------VKT 144
Query: 706 EYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
E +M+ I LW + H+ +I G+ C R P CG C L C+ +
Sbjct: 145 EQDLMNNIPRDLWAKA-----------HHWLIFHGRQVCHARKPACGVCRLAECCREY 191
>gi|67540934|ref|XP_664241.1| hypothetical protein AN6637.2 [Aspergillus nidulans FGSC A4]
gi|40738976|gb|EAA58166.1| hypothetical protein AN6637.2 [Aspergillus nidulans FGSC A4]
Length = 1085
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 61/152 (40%), Gaps = 43/152 (28%)
Query: 583 HDKMD--SVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN---------RLVELHG 631
HD + SV+W+AVR A ++ +AIK G + + IK L+ R + + G
Sbjct: 852 HDGIGKGSVNWDAVRRAPVKDVFEAIKSGGLADSKSKNIKAILDMVYKENQERRNILVKG 911
Query: 632 S--------------------------------IDLEWLRNVPPDKVKEYLLDIEGLGLK 659
+ L +L +P ++V L+ G+G K
Sbjct: 912 QDTNSDSGKFVQQLNDKPEGEKQYEIACADQNFLSLNYLHGLPTEEVMTELMKYPGIGPK 971
Query: 660 SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+ CV L LQ F VD ++ RI L WVP
Sbjct: 972 TAACVLLFCLQRPCFAVDTHIFRICKWLNWVP 1003
>gi|418017971|ref|ZP_12657527.1| endonuclease III [Streptococcus salivarius M18]
gi|345526820|gb|EGX30131.1| endonuclease III [Streptococcus salivarius M18]
Length = 214
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K + L + G+G K+ V + VD +V R++ RL VP ++ L+
Sbjct: 107 KTHKELESLPGVGRKTANVVLAEVYGIPSIAVDTHVSRVSKRLNIVPEDASVEEIEAELM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K+ P D I H++MI FG+ C ++P C CPL+ CK++
Sbjct: 167 KKIPKRDWIIS-----------------HHRMIFFGRYHCLAKNPKCQTCPLQSYCKYY 208
>gi|224026016|ref|ZP_03644382.1| hypothetical protein BACCOPRO_02769 [Bacteroides coprophilus DSM
18228]
gi|224019252|gb|EEF77250.1| hypothetical protein BACCOPRO_02769 [Bacteroides coprophilus DSM
18228]
Length = 216
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 23/122 (18%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP---LQPLPGDL 699
PD + E L+ + G+G K+ ++ + A VD +V R++ R+G VP PL +
Sbjct: 106 PDTL-EQLVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPQTCTTPLATEK 164
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
+ L+K Y+ ++ + H+ +I G+ C RSP C +C L G
Sbjct: 165 Y--LIK-----------------YIPEELIPTAHHWLILHGRYVCTARSPKCESCGLNGL 205
Query: 760 CK 761
CK
Sbjct: 206 CK 207
>gi|83313269|ref|YP_423533.1| EndoIII-related endonuclease [Magnetospirillum magneticum AMB-1]
gi|82948110|dbj|BAE52974.1| Predicted EndoIII-related endonuclease [Magnetospirillum magneticum
AMB-1]
Length = 211
Score = 48.5 bits (114), Expect = 0.019, Method: Composition-based stats.
Identities = 42/162 (25%), Positives = 64/162 (39%), Gaps = 27/162 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E + +I+ G A + E RL+ LHG VP D+ L + G+G
Sbjct: 69 EEGLVQSIRTIGLYKTKAKNVIELSRRLLSLHGG-------QVPHDRAA--LEALPGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD + R+A R G P + + + L+K P
Sbjct: 120 KTANVVLNIAFGEPTIAVDTHCFRVANRTGLAPGKTVEA-VEQGLMKATPA--------- 169
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K L H+ +I G+ CK R P+C AC +R C
Sbjct: 170 --------KWLQHAHHWLILHGRYTCKARKPDCAACVVRELC 203
>gi|198276951|ref|ZP_03209482.1| hypothetical protein BACPLE_03156 [Bacteroides plebeius DSM 17135]
gi|198270476|gb|EDY94746.1| endonuclease III [Bacteroides plebeius DSM 17135]
Length = 221
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 56/122 (45%), Gaps = 23/122 (18%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP---LQPLPGDL 699
PD ++E L+ + G+G K+ ++ + A VD +V R++ R+G VP PL +
Sbjct: 106 PDTLEE-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPRTCTTPLATEK 164
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
L+K Y+ + + H+ +I G+ C R+P C +C L G
Sbjct: 165 Q--LVK-----------------YIPESLIPTAHHWLILHGRYVCTARAPKCESCGLNGI 205
Query: 760 CK 761
CK
Sbjct: 206 CK 207
>gi|10640069|emb|CAC11921.1| endonuclease III related protein [Thermoplasma acidophilum]
Length = 197
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 19/125 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L+ + G+GLK+ + V A VD +V RI+ R+GW + P +
Sbjct: 87 PDSIDE-LVSLPGVGLKTAKVVLAEGFNRPAIAVDTHVFRISHRIGWSSAR-TPEETSEE 144
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L + PV +Q+ P M+ FGK C+ P C CP+ C++
Sbjct: 145 LERIIPV--DLQVGFNP---------------MMVEFGKAICRPVRPLCDRCPVSEYCRY 187
Query: 763 FASAV 767
+ V
Sbjct: 188 YEEKV 192
>gi|357051748|ref|ZP_09112914.1| endonuclease III, partial [Enterococcus saccharolyticus 30_1]
gi|355379183|gb|EHG26349.1| endonuclease III, partial [Enterococcus saccharolyticus 30_1]
Length = 216
Score = 48.5 bits (114), Expect = 0.020, Method: Composition-based stats.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI + I+ G A IK ++L+E VP + +E L+
Sbjct: 63 EALAAAPVEEIIEKIRTIGLYRNKAKNIKACASQLIERFNG-------QVP--RTREELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + A VD +V R+ RL L D ++ L E +M
Sbjct: 114 SLPGVGRKTANVVLGDAFGIPAIAVDTHVERVTKRLRICRL-----DANV-LEVEQTLMK 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ +I FG+ C R+P C CPL C+
Sbjct: 168 KVPEDLWVKT-----------HHTLIFFGRYHCTARAPKCEVCPLLTMCQ 206
>gi|23099212|ref|NP_692678.1| endonuclease III [Oceanobacillus iheyensis HTE831]
gi|22777440|dbj|BAC13713.1| endonuclease III (DNA repair) [Oceanobacillus iheyensis HTE831]
Length = 216
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 17/115 (14%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
KE L+ + G+G K+ V ++ + VD +V R++ RLG + ++ L+K+
Sbjct: 109 KEELVKLAGVGRKTANVVASIAFNEPSIAVDTHVERVSKRLGICKWKDSVLEVENTLMKK 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P D+ ++ H++MI FG+ CK R+P C CPL C+
Sbjct: 169 VP---------------RDEWSVT--HHRMIFFGRYHCKARNPQCPECPLLELCR 206
>gi|394993581|ref|ZP_10386326.1| endonuclease III [Bacillus sp. 916]
gi|393805693|gb|EJD67067.1| endonuclease III [Bacillus sp. 916]
Length = 219
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+++ G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEEYGG-------EVPKDR--DELVNLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLPLCR 206
>gi|304393063|ref|ZP_07374992.1| endonuclease III [Ahrensia sp. R2A130]
gi|303294828|gb|EFL89199.1| endonuclease III [Ahrensia sp. R2A130]
Length = 226
Score = 48.5 bits (114), Expect = 0.020, Method: Composition-based stats.
Identities = 47/190 (24%), Positives = 80/190 (42%), Gaps = 37/190 (19%)
Query: 572 RRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHG 631
RR++ + + +KM ++ E VR +AI+ G A + +LV+ G
Sbjct: 47 RRLFPIAD--TPEKMLALGEEGVR--------EAIRTIGLYRNKAKNVIALSQKLVDDFG 96
Query: 632 SIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
S+ VP + +E L+ + G+G K+ V ++ VD ++ RI R+G P
Sbjct: 97 SV-------VP--QTREELVTLPGVGRKTANVVMSMAFGIPTMAVDTHILRIGNRMGIAP 147
Query: 692 LQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC 751
+ P D+ L+ P + LY H+ +I G+ CK R+P C
Sbjct: 148 GKT-PDDIEAILMARVP-----------------EDYLYHAHHWLILHGRYTCKARTPLC 189
Query: 752 GACPLRGECK 761
C + CK
Sbjct: 190 EECIIADLCK 199
>gi|157364361|ref|YP_001471128.1| endonuclease III [Thermotoga lettingae TMO]
gi|157314965|gb|ABV34064.1| endonuclease III [Thermotoga lettingae TMO]
Length = 217
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 72/177 (40%), Gaps = 29/177 (16%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKE 648
D ++ A S+I D IK G A RI + +VE +G I P + +
Sbjct: 60 DVYSLSMAKPSQIYDLIKASGMYRQKAERIIKVSQIIVEKFNGKI---------PANLHD 110
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
LL + G+G K+ V A VD +V RI+ RLG+V + P L K P
Sbjct: 111 -LLSLPGVGRKTANIVLYHCFCQPALAVDTHVHRISNRLGFVKTK-TPEQTEEGLKKIIP 168
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFAS 765
+K ++ M+ FGK C R P C CP+ C++F S
Sbjct: 169 -----------------EKFWGPINGAMVEFGKKVCLPRKPKCQECPVNKYCEYFNS 208
>gi|154686481|ref|YP_001421642.1| hypothetical protein RBAM_020490 [Bacillus amyloliquefaciens FZB42]
gi|384265830|ref|YP_005421537.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum YAU
B9601-Y2]
gi|385265206|ref|ZP_10043293.1| HhH-GPD superfamily base excision DNA repair protein [Bacillus sp.
5B6]
gi|387898837|ref|YP_006329133.1| DNA-(apurinic or apyrimidinic site) lyase [Bacillus
amyloliquefaciens Y2]
gi|429505620|ref|YP_007186804.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum
AS43.3]
gi|452855993|ref|YP_007497676.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum
UCMB5036]
gi|154352332|gb|ABS74411.1| Nth [Bacillus amyloliquefaciens FZB42]
gi|380499183|emb|CCG50221.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum YAU
B9601-Y2]
gi|385149702|gb|EIF13639.1| HhH-GPD superfamily base excision DNA repair protein [Bacillus sp.
5B6]
gi|387172947|gb|AFJ62408.1| DNA-(apurinic or apyrimidinic site) lyase [Bacillus
amyloliquefaciens Y2]
gi|429487210|gb|AFZ91134.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum
AS43.3]
gi|452080253|emb|CCP22015.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum
UCMB5036]
Length = 219
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+++ G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEEYGG-------EVPKDR--DELVNLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLPLCR 206
>gi|219848286|ref|YP_002462719.1| HhH-GPD family protein [Chloroflexus aggregans DSM 9485]
gi|219542545|gb|ACL24283.1| HhH-GPD family protein [Chloroflexus aggregans DSM 9485]
Length = 223
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 66/172 (38%), Gaps = 25/172 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ ADESE+ + ++ G IK + + H +P D E LL
Sbjct: 67 EALALADESELREVLRPLGLSKRFPYLIKA-ARYICDNHNG-------EIPADF--ESLL 116
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G + + + + P DVNV R R + E +
Sbjct: 117 KVPGIGKYTATAILIFAYGQKLVPADVNVLRFVSRFTGL---------------EMGHVT 161
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K LW L L + + ++ F ++ C+ R P C ACPL C +F
Sbjct: 162 KGSRELWNLLPLLSEANVGLSAENLLDFTRLICRARKPKCNACPLSAHCSYF 213
>gi|374307877|ref|YP_005054308.1| endonuclease III [Filifactor alocis ATCC 35896]
gi|291166112|gb|EFE28158.1| endonuclease III [Filifactor alocis ATCC 35896]
Length = 211
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 75/181 (41%), Gaps = 37/181 (20%)
Query: 584 DKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPP 643
DKM S+ E E+ + I+ G N A I + L+E +G VP
Sbjct: 60 DKMVSLS--------EGELIELIRTIGFYNNKAKNILMTSHILLEKYGG-------EVP- 103
Query: 644 DKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHL 703
K +E L+ + G+G K+ V + AF VD +VGR+ RLG + P + I +
Sbjct: 104 -KTREELVKLPGVGRKTANVVISNAFGIPAFAVDTHVGRVTNRLGLTKSKN-PNQIEIDV 161
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLY-ELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ P K LY + H+ I G+ CK P C +CPL C +
Sbjct: 162 TSQLP------------------KKLYTQAHHLFIFHGRKCCKAIRPLCDSCPLTVNCTY 203
Query: 763 F 763
+
Sbjct: 204 Y 204
>gi|94984892|ref|YP_604256.1| HhH-GPD [Deinococcus geothermalis DSM 11300]
gi|94555173|gb|ABF45087.1| Endonuclease III, alpha helical glycosidase superfamily
[Deinococcus geothermalis DSM 11300]
Length = 269
Score = 48.5 bits (114), Expect = 0.021, Method: Composition-based stats.
Identities = 48/185 (25%), Positives = 76/185 (41%), Gaps = 31/185 (16%)
Query: 591 WEAVRCADESEIADAIKER---GQQNIIAGRIKEFLNRLVEL------------HGSIDL 635
WEA AD + +A+ G A I L+RL EL G + L
Sbjct: 89 WEAA-LADGPDGVEAVLRAAGGGLARTKADSIWNVLHRLAELGLAGELGLAGEGRGGLSL 147
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
LR + ++ + L + G+G+K+ + L L A P++ N+ R+A RL P +
Sbjct: 148 RVLRTMTDEEARALLESLPGVGMKTASLLLLFDLARPAIPIENNIHRVAGRLDLFPSR-- 205
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACP 755
++LK D++ W LD+ T H I G+ C+ + P C C
Sbjct: 206 -----WNVLKAERWFDEVLPRDW-----LDRAT---FHVSAIRHGRQTCRAQRPRCACCV 252
Query: 756 LRGEC 760
L+ C
Sbjct: 253 LQDLC 257
>gi|163867394|ref|YP_001608588.1| endonuclease III [Bartonella tribocorum CIP 105476]
gi|161017035|emb|CAK00593.1| endonuclease III [Bartonella tribocorum CIP 105476]
Length = 253
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 103/275 (37%), Gaps = 51/275 (18%)
Query: 519 MPQAHDAT--TTSKKSPRGKGKSKEYKSD-----MKNDRTKKTTPKKNSDNTVQQDWDLL 571
MPQ++ T T K S S Y D + ++ TPK SD + LL
Sbjct: 1 MPQSNIKTPRNTIKLSKMRNLSSISYNKDEIAEIFRRFSVQRPTPK--SDLNYINTFTLL 58
Query: 572 RRIYSTGEERSHDKMDSVDWEAVRCAD---------ESEIADAIKERGQQNIIAGRIKEF 622
+ + + + ++ E R AD E EIA I+ G A +
Sbjct: 59 VAVVLSAQA-TDISVNKATKELFRLADQPEKMVALGEEEIAHHIRSIGLWRAKARNVYAL 117
Query: 623 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
N L++ +G PD E L+ + G+G K+ V ++ VD ++ R
Sbjct: 118 SNCLIDQYGG--------QVPDTC-EALMSLPGVGRKTANVVLNVAFGQPTLAVDTHIFR 168
Query: 683 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
++ RLG P + P + LLK PV L H+ +I G+
Sbjct: 169 LSNRLGLAPGKT-PEIVEKKLLKIIPV-----------------HYLRHAHHWLILHGRY 210
Query: 743 FCKKRSPNCGACPLRGECKHFASAVASARFALPGP 777
C+ R P C C + CK AS A+P P
Sbjct: 211 ICQARKPQCTQCIIADLCK-----AASKTNAIPAP 240
>gi|403380710|ref|ZP_10922767.1| hypothetical protein PJC66_12927 [Paenibacillus sp. JC66]
Length = 225
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 17/114 (14%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+++ G+G K+ V + A VD +V R++ RLG L+ ++ L+K+
Sbjct: 109 EQLVELPGVGRKTANVVVSNAFDVPAIAVDTHVERVSKRLGLAKLKDSVLEVEKSLMKKV 168
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P + W + H+++I FG+ CK ++P C CPL C+
Sbjct: 169 PREE------WT-----------QTHHRLIFFGRYHCKAQNPRCEICPLLDICR 205
>gi|307273846|ref|ZP_07555068.1| endonuclease III [Enterococcus faecalis TX0855]
gi|306509531|gb|EFM78579.1| endonuclease III [Enterococcus faecalis TX0855]
Length = 215
Score = 48.5 bits (114), Expect = 0.021, Method: Composition-based stats.
Identities = 47/170 (27%), Positives = 70/170 (41%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI IK G A IK +L+E VP + ++ L+
Sbjct: 63 EALAAAPVEEIIAKIKTIGLYRNKAKNIKACAQQLLERFNG-------EVP--QTRDELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + AF VD +V R++ RL L ++ E +M
Sbjct: 114 SLPGVGRKTANVVMGDAFGEPAFAVDTHVERVSKRLRICKLNANVTEV------EQTLMR 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ MI FG+ C +P C ACPL C+
Sbjct: 168 KVPKELWVKT-----------HHTMIFFGRYHCLAIAPKCEACPLLYMCQ 206
>gi|302340587|ref|YP_003805793.1| DNA-(apurinic or apyrimidinic site) lyase [Spirochaeta smaragdinae
DSM 11293]
gi|301637772|gb|ADK83199.1| DNA-(apurinic or apyrimidinic site) lyase [Spirochaeta smaragdinae
DSM 11293]
Length = 224
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 72/172 (41%), Gaps = 32/172 (18%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D ++ D +IA I G + A ++K +L E VP ++ +
Sbjct: 75 DLGSLAAMDTEQIAKLIYPAGFYRVKAAQLKTIAMKLKE----------TGVPAER--DR 122
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHIHLLKEYP 708
LL + G+G K+ V L+ A VDV+V RI+ RLG + P ++ +
Sbjct: 123 LLALPGVGRKTANLVLGLAFGIPAICVDVHVHRISNRLGLITTTTPEKSEMALE------ 176
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ PR +++ TL+ + FG+ CK SP C CPL C
Sbjct: 177 -------AILPRRYWIEINTLF------VAFGQTLCKPVSPLCSRCPLADVC 215
>gi|346225871|ref|ZP_08847013.1| endonuclease III [Anaerophaga thermohalophila DSM 12881]
Length = 216
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 27/136 (19%)
Query: 628 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
E HG + PD VKE L + G+G K+ + + Q VD +V R++ R+
Sbjct: 100 EFHGEV---------PDDVKE-LQRLPGVGRKTANVIASVVYQKPVLAVDTHVFRVSARI 149
Query: 688 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKR 747
G P + L+K YL + L H+ +I G+ C R
Sbjct: 150 GLTVNAKTPIEAEKQLIK-----------------YLPEDLLPIAHHWLILHGRYVCLAR 192
Query: 748 SPNCGACPLRGECKHF 763
P C C L+ CK++
Sbjct: 193 KPKCDQCGLKMWCKYY 208
>gi|417094306|ref|ZP_11957898.1| endonuclease III protein [Rhizobium etli CNPAF512]
gi|327194662|gb|EGE61511.1| endonuclease III protein [Rhizobium etli CNPAF512]
Length = 260
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 67/163 (41%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++ D IK G A + LV+ E+ VP + +E L+ + G+G
Sbjct: 100 EEKVRDYIKTIGLYRNKAKNVIALSQMLVD-------EFAGRVP--ETREELVRLPGVGR 150
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RIA R+ P + P ++ L+K P
Sbjct: 151 KTANVVLSMAFGQATMAVDTHIFRIANRIRLAPGK-TPDEVEARLMKVVP---------- 199
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ LY H+ +I G+ CK R P C C + CK
Sbjct: 200 -------KQYLYHAHHWLILHGRYTCKARRPECERCVIADICK 235
>gi|302530869|ref|ZP_07283211.1| endonuclease III [Streptomyces sp. AA4]
gi|302439764|gb|EFL11580.1| endonuclease III [Streptomyces sp. AA4]
Length = 253
Score = 48.5 bits (114), Expect = 0.021, Method: Composition-based stats.
Identities = 50/191 (26%), Positives = 77/191 (40%), Gaps = 32/191 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +E+ + ++ G A + LVE +G VP K + L+ + G+
Sbjct: 85 ADRAELEEYLRTTGFFRAKANSLMGLGAALVERYGG-------EVP--KKLDDLVTLPGV 135
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV-MDKIQM 715
G K+ V + VD + GR+ R GW +E PV ++
Sbjct: 136 GRKTANVVLGNAFDVPGITVDTHFGRLVRRWGWTA-------------EEDPVKVEHAVG 182
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFA-- 773
L PR K L +++I G+ C R P CGACPLR +C F + A
Sbjct: 183 ELIPR------KEWTMLSHRVIFHGRRVCHARKPACGACPLRKDCPSFGAGPTEFEVAAK 236
Query: 774 -LPGPSEKGIV 783
+ GP + I+
Sbjct: 237 LVKGPEKDHIL 247
>gi|374853927|dbj|BAL56822.1| DNA repair protein HhH-GPD [uncultured candidate division OP1
bacterium]
Length = 197
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 17/164 (10%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
E ++A+ I+ G A ++K F+ L H DL + P K++ LL + G+G
Sbjct: 40 SEDQLAELIRPAGYFRQKAKKLKVFITHLYA-HYDGDLPTMLAQPTSKLRAELLSLWGIG 98
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + L + F VD RI RLG L E DK+Q L
Sbjct: 99 PETADSILLYAAHKPVFVVDAYTQRICSRLG--------------LASEKISYDKLQE-L 143
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ R K E H ++ GK C+ + P C C L EC+
Sbjct: 144 FMRKLPKSVKLFNEYHALLVRHGKEICRPK-PRCARCVLAHECQ 186
>gi|410583666|ref|ZP_11320771.1| DNA-(apurinic or apyrimidinic site) lyase [Thermaerobacter
subterraneus DSM 13965]
gi|410504528|gb|EKP94038.1| DNA-(apurinic or apyrimidinic site) lyase [Thermaerobacter
subterraneus DSM 13965]
Length = 258
Score = 48.5 bits (114), Expect = 0.022, Method: Composition-based stats.
Identities = 49/190 (25%), Positives = 75/190 (39%), Gaps = 27/190 (14%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ E E+ I+ G A + LVE HG VP + +E L+
Sbjct: 79 EAMLELSEDELGAMIRTIGLWRNKARNLLAACRILVERHGG-------QVP--RTREELM 129
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + A VD +V R+A RLG + P + L++++P +
Sbjct: 130 QLPGVGRKTANVVLSNAFGIPAIAVDTHVFRVARRLG-LATGTTPERVEQELMEKFPEAE 188
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASAR 771
W R H+ +I G+ C R+P C AC LR +C + A
Sbjct: 189 ------WSRA-----------HHWLIWHGRRICHARNPRCQACALRPDCPEGRARAGGAS 231
Query: 772 FALPGPSEKG 781
E G
Sbjct: 232 LEAAAEVEPG 241
>gi|387892350|ref|YP_006322647.1| endonuclease III [Pseudomonas fluorescens A506]
gi|423690227|ref|ZP_17664747.1| endonuclease III [Pseudomonas fluorescens SS101]
gi|387163314|gb|AFJ58513.1| endonuclease III [Pseudomonas fluorescens A506]
gi|388001854|gb|EIK63183.1| endonuclease III [Pseudomonas fluorescens SS101]
Length = 212
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 69/161 (42%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E L+ELHG VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLLELHGG-------EVP--QTREALEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G I K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTG------------IARGKNVVEVEKQLMKFVPKP 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C+ R P CG+C + C++
Sbjct: 171 YLLDS------HHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|157804115|ref|YP_001492664.1| endonuclease III [Rickettsia canadensis str. McKiel]
gi|157785378|gb|ABV73879.1| Endonuclease III [Rickettsia canadensis str. McKiel]
Length = 209
Score = 48.5 bits (114), Expect = 0.022, Method: Composition-based stats.
Identities = 34/127 (26%), Positives = 58/127 (45%), Gaps = 19/127 (14%)
Query: 640 NVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDL 699
N P+ KE L+ + G+G K+ V VD +V R+A R+G
Sbjct: 102 NSVPNSFKE-LVKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIG----------- 149
Query: 700 HIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGE 759
L + + ++ L L L++K L H+ +I G+ CK R P+C CP++
Sbjct: 150 ----LAQGSTPEVVEKEL---LQILNKKWLMHAHHWLILHGRYICKARKPDCDICPVKKY 202
Query: 760 CKHFASA 766
C+++ +A
Sbjct: 203 CEYYLNA 209
>gi|398848557|ref|ZP_10605368.1| endonuclease III [Pseudomonas sp. GM84]
gi|398248014|gb|EJN33443.1| endonuclease III [Pseudomonas sp. GM84]
Length = 212
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+A+ +++ IK G N A + E L+E HG VP + +E L
Sbjct: 62 QAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRLLIERHGG-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPAMAVDTHIFRVSNRTGIAPGKTVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|227823996|ref|YP_002827969.1| endonuclease III [Sinorhizobium fredii NGR234]
gi|227342998|gb|ACP27216.1| putative endonuclease III protein [Sinorhizobium fredii NGR234]
Length = 317
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++ D IK G A + RL+ G VP K +E L+ + G+G
Sbjct: 158 EEKLRDHIKTIGLYRNKAKNVIALSERLIADFGG-------AVP--KTREELMTLPGVGR 208
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RIA R+ P + P ++ L++ P
Sbjct: 209 KTANVVLQMAFGQSTIAVDTHLFRIANRIRLAPGK-TPDEVEAKLMRVIP---------- 257
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K LY H+ +I G+ CK R P C C + CK
Sbjct: 258 -------PKYLYHAHHWLILHGRYVCKARRPECERCIIADICK 293
>gi|424916723|ref|ZP_18340087.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM597]
gi|392852899|gb|EJB05420.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM597]
Length = 261
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 18/115 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
++ L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ L+K
Sbjct: 140 RDELVKLPGVGRKTANVVLSMAFGQATMAVDTHIFRIANRIKLAPGK-TPDEVEARLMKV 198
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P Q LY H+ +I G+ CK R P C C + CK
Sbjct: 199 IP-----------------QHYLYHAHHWLILHGRYTCKARRPECERCVIADLCK 236
>gi|340398994|ref|YP_004728019.1| putative endonuclease III [Streptococcus salivarius CCHSS3]
gi|338742987|emb|CCB93495.1| putative endonuclease III (DNA-(apurinic or apyrimidinic site)
lyase) [Streptococcus salivarius CCHSS3]
Length = 214
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K ++ L + G+G K+ V + VD +V R++ RL P ++ L+
Sbjct: 107 KTRKELESLPGVGRKTANVVLAEVYGIPSIAVDTHVSRVSKRLNIAPEDASVEEIEAELM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K+ P D I H++MI FG+ C ++P C CPL+ CK++
Sbjct: 167 KKIPKKDWIIS-----------------HHRMIFFGRYHCLAKNPKCQTCPLQSYCKYY 208
>gi|312143859|ref|YP_003995305.1| endonuclease III [Halanaerobium hydrogeniformans]
gi|311904510|gb|ADQ14951.1| endonuclease III [Halanaerobium hydrogeniformans]
Length = 216
Score = 48.5 bits (114), Expect = 0.023, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 19/125 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P K KE LL + G+G K+ V + AFPVD +V R++ RL +
Sbjct: 109 PHKRKE-LLKLAGVGRKTANVVLANAFDKAAFPVDTHVFRVSSRLA------------LS 155
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K V +K L P+ ++D LH+ +I G+ CK ++P+C C C +
Sbjct: 156 SAKNPEVTEKELTELIPKKYWID------LHHWLIDHGRAICKAQNPDCDNCFCSHLCPY 209
Query: 763 FASAV 767
+ + V
Sbjct: 210 YKNIV 214
>gi|433773185|ref|YP_007303652.1| endonuclease III [Mesorhizobium australicum WSM2073]
gi|433665200|gb|AGB44276.1| endonuclease III [Mesorhizobium australicum WSM2073]
Length = 270
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+++ D I+ G A + L+ HG VP D+ + L+ + G+G
Sbjct: 107 ETKVGDHIRTIGLWRNKAKNVIALSQALIRDHGG-------EVPDDR--DELVKLPGVGR 157
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RI RLG P + P + LLK P
Sbjct: 158 KTANVVLNMAFGQHTMAVDTHILRIGNRLGLAPGK-TPEQVEQGLLKIIP---------- 206
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ + H+ +I G+ CK R P+C AC + CK
Sbjct: 207 -------DEYMRHAHHWLILHGRYVCKARKPDCPACVIADICK 242
>gi|390952666|ref|YP_006416425.1| putative endonuclease III-like protein [Thiocystis violascens DSM
198]
gi|390429235|gb|AFL76300.1| putative endonuclease III-like protein [Thiocystis violascens DSM
198]
Length = 232
Score = 48.5 bits (114), Expect = 0.023, Method: Composition-based stats.
Identities = 44/168 (26%), Positives = 72/168 (42%), Gaps = 19/168 (11%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ D +E+A+A++ G N+ A R+ F VE + LE + + ++ LL
Sbjct: 72 AILALDPAELAEALRPAGYFNLKARRLSAFC---VEYLAAGGLEGWGALDTETLRRALLK 128
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G ++ + + L + + F VD RI RLG + GD ++
Sbjct: 129 VNGIGPETADDMLLYAFERPVFVVDAYTRRIFARLG-----AIAGDEAYEQIRA-----G 178
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ L P D E H ++ K C+ R P C CPLR C
Sbjct: 179 FEQTLGP-----DVPLFKEYHALIVQHAKQACRAR-PICPDCPLREAC 220
>gi|332798680|ref|YP_004460179.1| DNA-(apurinic or apyrimidinic site) lyase [Tepidanaerobacter
acetatoxydans Re1]
gi|332696415|gb|AEE90872.1| DNA-(apurinic or apyrimidinic site) lyase [Tepidanaerobacter
acetatoxydans Re1]
Length = 285
Score = 48.5 bits (114), Expect = 0.023, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 683
G +DL LR +P D+ +E LL + G+G K +C+ L S++ AFP+DV + R+
Sbjct: 184 GKVDLYGLRELPTDEAREVLLTLPGVGRKVADCIMLYSMRKFDAFPIDVWIKRV 237
>gi|438001671|ref|YP_007271414.1| 8-oxoguanine-DNA-glycosylase [Tepidanaerobacter acetatoxydans Re1]
gi|432178465|emb|CCP25438.1| 8-oxoguanine-DNA-glycosylase [Tepidanaerobacter acetatoxydans Re1]
Length = 285
Score = 48.5 bits (114), Expect = 0.023, Method: Composition-based stats.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 683
G +DL LR +P D+ +E LL + G+G K +C+ L S++ AFP+DV + R+
Sbjct: 184 GKVDLYGLRELPTDEAREVLLTLPGVGRKVADCIMLYSMRKFDAFPIDVWIKRV 237
>gi|325275201|ref|ZP_08141168.1| endonuclease III [Pseudomonas sp. TJI-51]
gi|324099688|gb|EGB97567.1| endonuclease III [Pseudomonas sp. TJI-51]
Length = 212
Score = 48.5 bits (114), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRLLIERHGS-------QVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + VD ++ R++ R G P + + ++ L+K P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPTMAVDTHIFRVSNRTGIAPGKTVL-EVEKKLVKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|384567793|ref|ZP_10014897.1| endonuclease III [Saccharomonospora glauca K62]
gi|384523647|gb|EIF00843.1| endonuclease III [Saccharomonospora glauca K62]
Length = 227
Score = 48.5 bits (114), Expect = 0.024, Method: Composition-based stats.
Identities = 46/171 (26%), Positives = 70/171 (40%), Gaps = 29/171 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD +E+ + I+ G A + LVE HG P K+ + L+ + G+
Sbjct: 59 ADRTELEELIRPTGFFRNKATSLMGLGAALVERHGG--------EVPGKLDD-LVKLPGV 109
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM- 715
G K+ V + VD + GR+ R GW E PV + ++
Sbjct: 110 GRKTANVVLGEAFGVPGITVDTHFGRLVRRWGWTD-------------SEDPVKVEHEIG 156
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
L+PR K L +++I G+ C R P CGACPL +C F +
Sbjct: 157 SLFPR------KDWTLLSHRVIFHGRRVCHARKPACGACPLAKDCPSFGAG 201
>gi|255037376|ref|YP_003087997.1| endonuclease III [Dyadobacter fermentans DSM 18053]
gi|254950132|gb|ACT94832.1| endonuclease III [Dyadobacter fermentans DSM 18053]
Length = 220
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 77/179 (43%), Gaps = 27/179 (15%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKE 648
D E++ ++ E+ I+ N A + LVE H + P V++
Sbjct: 61 DPESLAASNTEEVFTYIRSISYPNNKAKHLVGMARMLVEQFHSEV---------PSTVED 111
Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
L + G+G K+ + + A VD +V R++ RLG VP+ + P
Sbjct: 112 -LQKMPGVGRKTANVIASVIFSMPAMAVDTHVFRVSRRLGLVPMT-----------AKTP 159
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
+ + ++ + ++ + +++ H+ +I G+ C R+P C CPL C++F V
Sbjct: 160 LAVEREL-----VTHIPKHLIHKAHHWLILHGRYVCTARNPQCFQCPLSPFCRYFEKNV 213
>gi|409095774|ref|ZP_11215798.1| endonuclease III [Thermococcus zilligii AN1]
Length = 242
Score = 48.1 bits (113), Expect = 0.024, Method: Composition-based stats.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 18/120 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+ + E L+ + G+G K V A PVD +V RI+ RLG P P
Sbjct: 118 PEDINE-LMKLPGIGRKCANIVLAYGFGKQAIPVDTHVNRISKRLGLAPPGVAP------ 170
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+K++ YL + + K +Y +++ M+ G+ C+ P C CPLR C +
Sbjct: 171 --------EKVEEYLAGLIP--EDKWIY-VNHAMVDHGRSICRPIKPKCDECPLRELCPY 219
>gi|380696329|ref|ZP_09861188.1| endonuclease III [Bacteroides faecis MAJ27]
Length = 225
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L+ + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNMED-LIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P + + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIP-----------------ENLIPIAHHWLILHGRYVCQARTPKCDTCGLQMMCK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|424869508|ref|ZP_18293209.1| Putative endonuclease III [Leptospirillum sp. Group II 'C75']
gi|124514281|gb|EAY55795.1| putative endonuclease III [Leptospirillum rubarum]
gi|387220626|gb|EIJ75278.1| Putative endonuclease III [Leptospirillum sp. Group II 'C75']
Length = 210
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 69/162 (42%), Gaps = 29/162 (17%)
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNR-LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
S+I I G A IK R L E G I PD + E LL + G+GL
Sbjct: 68 SDIESLIFPVGFYRTKAKTIKTIAERVLTEFGGKI---------PDTL-EGLLSLPGVGL 117
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + + F VD++V RI R W +Q D H+++ PV+ + W
Sbjct: 118 KTANLVLTVGFEKEGFCVDIHVHRILNR--WGVIQTHSPDETYHIVE--PVLPR----KW 169
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
R + ++ FG+ FC+ SP C CPL +C
Sbjct: 170 KR----------RANALLVAFGQHFCRPVSPFCSVCPLLPDC 201
>gi|393216983|gb|EJD02473.1| DNA glycosylase [Fomitiporia mediterranea MF3/22]
Length = 570
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 61/150 (40%), Gaps = 24/150 (16%)
Query: 616 AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFP 675
A R+ ++V+ +G + PD K+ + ++ G+G S + ++ H
Sbjct: 172 AARLLSGAQKIVKEYGGLF--------PDNAKDMMANVPGIGRYSAGAISSIAYNHCESV 223
Query: 676 VDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYL--DQKTLYELH 733
+D NV R+ R+ + P K +D LW + D + ++
Sbjct: 224 LDGNVNRLLSRVLALHANP----------KSKATLD----VLWDGAAAMVKDTEEPGNIN 269
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+I G CK R P+C ACPLR C +
Sbjct: 270 QALIELGSTVCKPRDPDCAACPLRDYCNAY 299
>gi|317502642|ref|ZP_07960762.1| endonuclease III [Prevotella salivae DSM 15606]
gi|315666261|gb|EFV05808.1| endonuclease III [Prevotella salivae DSM 15606]
Length = 229
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 68/171 (39%), Gaps = 29/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ A+ +I + IK N A + E LVE +G + PD +E L
Sbjct: 63 EAMAKANWEDIFELIKSVSYPNAKAHHLSEMSKILVERFNGKV---------PDNTEE-L 112
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPV 709
+ G+G K+ V+ + VD +V R++ RL VP P + + LLK P
Sbjct: 113 TQLPGVGRKTANVVQSVWFGKPTLAVDTHVYRVSHRLSLVPEAANTPLKVELELLKHIPE 172
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D + H+ ++ G+ CK + P C CP C
Sbjct: 173 AD-----------------VSNAHHWLLLHGRYVCKSQKPQCDDCPFNTIC 206
>gi|26987828|ref|NP_743253.1| endonuclease III [Pseudomonas putida KT2440]
gi|24982528|gb|AAN66717.1|AE016300_2 endonuclease III [Pseudomonas putida KT2440]
Length = 212
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E H S VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRLLIERHDS-------QVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + A VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPAMAVDTHIFRVSNRTGIAPGKTVL-EVEKKLLKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|257869416|ref|ZP_05649069.1| endonuclease III [Enterococcus gallinarum EG2]
gi|257803580|gb|EEV32402.1| endonuclease III [Enterococcus gallinarum EG2]
Length = 221
Score = 48.1 bits (113), Expect = 0.025, Method: Composition-based stats.
Identities = 47/170 (27%), Positives = 72/170 (42%), Gaps = 26/170 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A EI + I+ G A IK ++L+E VP + +E L+
Sbjct: 63 EALAAAPVEEIIEKIRTIGLYRNKAKNIKACASQLIERFNG-------QVP--RTREELV 113
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + A VD +V R+ RL L D ++ L E +M
Sbjct: 114 SLPGVGRKTANVVLGDAFGIPAIAVDTHVERVTKRLRICRL-----DANV-LEVEQTLMK 167
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ LW + H+ +I FG+ C R+P C CPL C+
Sbjct: 168 KVPEDLWVKT-----------HHTLIFFGRYHCTARAPKCEVCPLLTMCQ 206
>gi|255693170|ref|ZP_05416845.1| endonuclease III [Bacteroides finegoldii DSM 17565]
gi|260621061|gb|EEX43932.1| endonuclease III [Bacteroides finegoldii DSM 17565]
Length = 225
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 57/124 (45%), Gaps = 19/124 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD +++ L + G+G K+ ++ + A VD +V R++ R+G VP P +
Sbjct: 106 PDNLED-LTKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPDSCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P ++ + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELIKNIP-----------------EELIPIAHHWLILHGRYVCQARTPKCATCGLQMICK 207
Query: 762 HFAS 765
+F +
Sbjct: 208 YFCN 211
>gi|452976633|gb|EME76448.1| endonuclease III [Bacillus sonorensis L12]
Length = 220
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 72/175 (41%), Gaps = 26/175 (14%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ L+E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLCKMLIEEYGG-------EVPRDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG+ + ++ L+K+ P + W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGFCRWKDSVTEVEKTLMKKIPESE------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
H+++I FG+ CK + P C CPL C+ LP
Sbjct: 176 ----------ATHHRLIFFGRYHCKAQRPKCEECPLFSLCREGQKRYKKGLVKLP 220
>gi|451812331|ref|YP_007448785.1| endonuclease III [Candidatus Kinetoplastibacterium galatii TCC219]
gi|451778233|gb|AGF49181.1| endonuclease III [Candidatus Kinetoplastibacterium galatii TCC219]
Length = 210
Score = 48.1 bits (113), Expect = 0.026, Method: Composition-based stats.
Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 18/117 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L + G+G K+ V ++ VD +V R++ R G + P D+ L+K
Sbjct: 109 REELESLPGVGRKTSNVVLNIAFNQPTIAVDTHVFRVSNRTG-IAKGKNPLDVEKELVKN 167
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P +K L+ H+ +I FG+ C KR P C C ++ C+++
Sbjct: 168 VP-----------------EKYLHNAHHLLILFGRYICIKRKPKCNQCIIKDLCEYY 207
>gi|317128625|ref|YP_004094907.1| endonuclease III [Bacillus cellulosilyticus DSM 2522]
gi|315473573|gb|ADU30176.1| endonuclease III [Bacillus cellulosilyticus DSM 2522]
Length = 221
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A IK+ L+E + +P +K + L+ + G+G K+
Sbjct: 72 ELENDIRSIGLFRSKAKNIKKLCQSLIEDYNG-------EIPKEKSE--LVKLAGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P
Sbjct: 123 ANVVASVAFNEPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKLP------------ 170
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ H+++I FG+ CK +SP C CPL C+
Sbjct: 171 -----KEEWSVSHHRLIFFGRYHCKAQSPRCNECPLLSLCR 206
>gi|299140539|ref|ZP_07033677.1| endonuclease III [Prevotella oris C735]
gi|298577505|gb|EFI49373.1| endonuclease III [Prevotella oris C735]
Length = 229
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 67/171 (39%), Gaps = 29/171 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYL 650
EA+ AD +I IK N A + E LVE +G + PD + L
Sbjct: 63 EAMAQADWEDIFQLIKSVSYPNSKAHHLSEMAKILVERFNGEV---------PDNTDD-L 112
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPV 709
+ G+G K+ V+ + VD +V R++ RLG VP P + + L+K P
Sbjct: 113 TQLPGVGRKTANVVQAVWFGKPTLAVDTHVYRVSHRLGLVPETANTPLKVELELMKYIPK 172
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
D + H+ ++ G+ CK + P C CP C
Sbjct: 173 AD-----------------VGNAHHWLLLHGRYICKSQRPQCQDCPFSTFC 206
>gi|167035313|ref|YP_001670544.1| endonuclease III [Pseudomonas putida GB-1]
gi|166861801|gb|ABZ00209.1| endonuclease III [Pseudomonas putida GB-1]
Length = 212
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E HGS VP + +E L
Sbjct: 62 EAIYALGVEGLSEYIKTIGLYNSKAKNVIEACRLLIERHGS-------EVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + VD ++ R++ R G P + + ++ L+K P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPTMAVDTHIYRVSNRTGIAPGKTVL-EVEKKLVKFVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 169 --------------KDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|452820109|gb|EME27156.1| HhH-GPD base excision DNA repair protein-related protein [Galdieria
sulphuraria]
Length = 203
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 43/86 (50%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
DW V A ++ + I+ G A RIKE L L + G + LE LR + V++
Sbjct: 110 DWNQVIEAPLRDVEECIRVAGLAKTKAARIKELLETLQKERGELSLESLRGSCTETVEKE 169
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFP 675
L +G+GLK++ CV SL FP
Sbjct: 170 LSRFKGVGLKTMACVSAFSLGLQVFP 195
>gi|256811111|ref|YP_003128480.1| HhH-GPD family protein [Methanocaldococcus fervens AG86]
gi|256794311|gb|ACV24980.1| HhH-GPD family protein [Methanocaldococcus fervens AG86]
Length = 211
Score = 48.1 bits (113), Expect = 0.027, Method: Composition-based stats.
Identities = 41/165 (24%), Positives = 76/165 (46%), Gaps = 16/165 (9%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + DE ++ + IK G N+ A R+K +V+ +GS + + +++ LL
Sbjct: 61 EKILNIDEDKLKELIKPAGFYNLKAKRLKNTTKFIVDNYGSTEGMAKTDKDTLTLRKELL 120
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
I G+G ++ + + L +L +F VD R+ RLG ++ E D
Sbjct: 121 SINGVGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------IINEKAKYD 166
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+I+ ++ + D + E H ++ K FC+K+ P C CP+
Sbjct: 167 EIKE-IFEKSLPKDLEIYKEYHALIVEHCKKFCRKK-PLCEKCPI 209
>gi|418032607|ref|ZP_12671090.1| endonuclease III [Bacillus subtilis subsp. subtilis str. SC-8]
gi|351471470|gb|EHA31591.1| endonuclease III [Bacillus subtilis subsp. subtilis str. SC-8]
Length = 185
Score = 48.1 bits (113), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 38 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 88
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 89 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 141
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 142 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 172
>gi|149185054|ref|ZP_01863371.1| endonuclease III family protein [Erythrobacter sp. SD-21]
gi|148831165|gb|EDL49599.1| endonuclease III family protein [Erythrobacter sp. SD-21]
Length = 239
Score = 48.1 bits (113), Expect = 0.027, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 71/172 (41%), Gaps = 14/172 (8%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
WE V A E+ + + NI A R+K L L+EL G +DL L + +L
Sbjct: 72 WEGVAKAPLEELQAELATQTYPNIAAERLKASLTALIELRGLVDLSHLAEMETLPAMRWL 131
Query: 651 LDIEGLGLK-SVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
+ G+G K + + +L A +D + RI R+G VP + D ++ PV
Sbjct: 132 EQLPGIGRKIAAGVMNASTLDRPAIVLDGHHTRILQRMGLVPPKA-STDRAFEVI--MPV 188
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
M K W E H M G+ C+ +P+C AC CK
Sbjct: 189 MPK----EW------HGADFDEHHLLMKKLGQTTCRPAAPDCAACLALPLCK 230
>gi|436837987|ref|YP_007323203.1| endonuclease III [Fibrella aestuarina BUZ 2]
gi|384069400|emb|CCH02610.1| endonuclease III [Fibrella aestuarina BUZ 2]
Length = 218
Score = 48.1 bits (113), Expect = 0.027, Method: Composition-based stats.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 17/121 (14%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P V E L + G+G K+ + + A VD +V R++ RLG VP
Sbjct: 106 PATVAE-LQRLPGVGRKTANVIVSVVYNQPAMAVDTHVFRVSHRLGLVPRTAT-----TP 159
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L E +M I L PR H+ +I G+ C RSP C C L+ CK+
Sbjct: 160 LAVEKSLMAHIPTALVPRF-----------HHWLILHGRYVCIARSPKCAECALKPVCKY 208
Query: 763 F 763
+
Sbjct: 209 Y 209
>gi|337266339|ref|YP_004610394.1| endonuclease III [Mesorhizobium opportunistum WSM2075]
gi|336026649|gb|AEH86300.1| endonuclease III [Mesorhizobium opportunistum WSM2075]
Length = 266
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+++ D I+ G A + L+ HG VP D+ + L+ + G+G
Sbjct: 107 EAKVGDHIRTIGLWRNKAKNVIALSEALIRDHGG-------AVPDDR--DELVKLPGVGR 157
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RI RLG P + P + LLK P
Sbjct: 158 KTANVVLNMAFGQHTMAVDTHIFRIGNRLGLAPGK-TPEQVEHGLLKIIP---------- 206
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ + H+ +I G+ CK R P+C AC + CK
Sbjct: 207 -------DEYMRHAHHWLILHGRYVCKARKPDCPACVIADICK 242
>gi|415725253|ref|ZP_11470256.1| endonuclease III [Gardnerella vaginalis 00703C2mash]
gi|388062058|gb|EIK84694.1| endonuclease III [Gardnerella vaginalis 00703C2mash]
Length = 237
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ V + + FPVD +V R+ RLGWV + +E
Sbjct: 135 LTKLPGVGRKTANVVLGNAFKIPGFPVDTHVMRVTSRLGWVSEASAKAKDAAKIEREVTA 194
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
C+ ++ L +++I G++ C R+P CG CPLR C
Sbjct: 195 ------------CFAPEE-WTNLSHRLILHGRMICIARNPRCGDCPLRFLC 232
>gi|256420205|ref|YP_003120858.1| endonuclease III [Chitinophaga pinensis DSM 2588]
gi|256035113|gb|ACU58657.1| endonuclease III [Chitinophaga pinensis DSM 2588]
Length = 215
Score = 48.1 bits (113), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 50/118 (42%), Gaps = 17/118 (14%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G+G K+ + + Q VD +V R++ R+G P LLK
Sbjct: 112 LVKLPGVGRKTANVITSVVHQQPNMAVDTHVFRVSARIGLTTNATTPLQTEKQLLK---- 167
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAV 767
Y+ + ++ H+ +I G+ C RSP C C LR CK++ S V
Sbjct: 168 -------------YIPTEKVHIAHHWLILHGRYICVARSPKCEECGLRPVCKYYHSLV 212
>gi|357023960|ref|ZP_09086126.1| endonuclease III [Mesorhizobium amorphae CCNWGS0123]
gi|355544051|gb|EHH13161.1| endonuclease III [Mesorhizobium amorphae CCNWGS0123]
Length = 273
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 67/163 (41%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+++ D I+ G A + L+ HG VP D+ E L+ + G+G
Sbjct: 113 EAKVGDYIRTIGLWRNKARNVIALSQALIADHGG-------EVPDDR--EQLVKLPGVGR 163
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RI RLG P + ++++ L
Sbjct: 164 KTANVVLNMAFGQHTMAVDTHILRIGNRLGLAPGK---------------TPEQVEQGL- 207
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L + + + H+ +I G+ CK R P+C AC + CK
Sbjct: 208 --LTVIPDEYMRHAHHWLILHGRYVCKARKPDCPACVIADICK 248
>gi|220931920|ref|YP_002508828.1| endonuclease III [Halothermothrix orenii H 168]
gi|219993230|gb|ACL69833.1| endonuclease III [Halothermothrix orenii H 168]
Length = 212
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 62/138 (44%), Gaps = 20/138 (14%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
++E+ + E+ VP ++ LL + G+G K+ V + FPVD +V RI+
Sbjct: 90 IIEVSNILIKEYGGKVP--GTRKELLKLPGVGRKTANVVLACAFNKKTFPVDTHVFRISN 147
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RLG V K +K M + P ++D +H+ +I G+ CK
Sbjct: 148 RLGLVS------------AKRTNEAEKQLMEVIPEEKWVD------MHHWLIFHGREVCK 189
Query: 746 KRSPNCGACPLRGECKHF 763
R+P C C L+ C ++
Sbjct: 190 ARNPACHFCELKPFCNYY 207
>gi|451947518|ref|YP_007468113.1| putative endonuclease III-like protein [Desulfocapsa sulfexigens
DSM 10523]
gi|451906866|gb|AGF78460.1| putative endonuclease III-like protein [Desulfocapsa sulfexigens
DSM 10523]
Length = 218
Score = 48.1 bits (113), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 16/160 (10%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
++ ++A+ I+ G NI A R+K L + + + +L + ++ +E LL ++G+G
Sbjct: 71 EQEQLAEYIRPSGYYNIKARRLKNLLQMISDKYEG-ELTYFLKDSLEESRENLLKVKGVG 129
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
++ + + L + + F +D R+ R Q + D L++ MD +
Sbjct: 130 PETADAILLYAAEKPVFVIDTYTHRVFSRH-----QLVEEDTDYFTLQQ-EFMDNL---- 179
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
P D E H ++ K FCKK P C CPL+
Sbjct: 180 -PE----DPVLFNEFHALIVAVAKEFCKKAKPRCSDCPLQ 214
>gi|404494924|ref|YP_006719030.1| endonuclease III-like protein [Geobacter metallireducens GS-15]
gi|418067578|ref|ZP_12704917.1| HhH-GPD family protein [Geobacter metallireducens RCH3]
gi|78192552|gb|ABB30319.1| endonuclease III-related protein [Geobacter metallireducens GS-15]
gi|373558576|gb|EHP84912.1| HhH-GPD family protein [Geobacter metallireducens RCH3]
Length = 223
Score = 48.1 bits (113), Expect = 0.029, Method: Composition-based stats.
Identities = 45/170 (26%), Positives = 74/170 (43%), Gaps = 17/170 (10%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+R + +A+AI+ G N+ + R+K+F L E HG LE + ++E LL
Sbjct: 62 EALRDVPAASLAEAIRPAGYFNVKSLRLKDFAGYLWERHGG-SLERMFAGDWHALREELL 120
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G ++ + + L + F VD R+ LG +L D
Sbjct: 121 GVRGIGPETADSILLYAGGKPTFVVDAYTKRLFAALG--------------ILNGSAGYD 166
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+++ L D + E H ++ GK C+KR P C C L C+
Sbjct: 167 EVRDLFMANLP-PDVRLFNEYHALIVEHGKRHCRKR-PLCPGCGLHLFCR 214
>gi|386814885|ref|ZP_10102103.1| endonuclease III ;DNA-(apurinic or apyrimidinic site) lyase
[Thiothrix nivea DSM 5205]
gi|386419461|gb|EIJ33296.1| endonuclease III ;DNA-(apurinic or apyrimidinic site) lyase
[Thiothrix nivea DSM 5205]
Length = 210
Score = 48.1 bits (113), Expect = 0.030, Method: Composition-based stats.
Identities = 44/169 (26%), Positives = 70/169 (41%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
+A+ E + D IK G N A I E LVE H S VP D+ L
Sbjct: 62 KAIYALGEEGLKDYIKTIGLFNSKAKNIIETCRILVEKHHS-------QVPEDRAA--LE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ + + +H VD ++ R++ R G P + + ++ LLK P
Sbjct: 113 ALPGVGRKTANVILNTAFRHPTMAVDTHIFRVSNRTGIAPGRNVL-EVEKALLKHVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
++ L + H+ +I G+ C R+P CG C + C
Sbjct: 169 --------------KQYLLDAHHWLILHGRYTCVARNPKCGQCIIADLC 203
>gi|300857069|ref|YP_003782053.1| endonuclease III [Clostridium ljungdahlii DSM 13528]
gi|300437184|gb|ADK16951.1| endonuclease III [Clostridium ljungdahlii DSM 13528]
Length = 214
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+++ G+G K+ + V + A VD +V R++ RLG + P + + L+K
Sbjct: 109 EQLIELPGVGRKTADVVLSNAFGVPAIAVDTHVFRVSNRLG-IAKGTTPHKVEMELMKNI 167
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P +W + H+ +I G+ CK R P+C CPL C++F
Sbjct: 168 P------KSMWS-----------DSHHYLIWHGRRICKSRKPDCEHCPLAPYCEYF 206
>gi|403238043|ref|ZP_10916629.1| endonuclease III [Bacillus sp. 10403023]
Length = 219
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 72/161 (44%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ + I+ G A I++ L+E +G I +P D+ + L + G+G K+
Sbjct: 72 ELQNDIRSIGLYRNKAKNIQKLCQMLLEDYGGI-------LPQDR--DELTKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG+ + ++ L+K+ P
Sbjct: 123 ANVVVSVAFGIPAIAVDTHVERVSKRLGFCRWKDSVLEVEKTLMKKVP------------ 170
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
++ H++MI FG+ CK ++P C CPL C+
Sbjct: 171 -----KEEWSATHHRMIFFGRYHCKAQNPQCEICPLLEVCR 206
>gi|372270183|ref|ZP_09506231.1| endonuclease III [Marinobacterium stanieri S30]
Length = 211
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 27/171 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ + + IK G N A + + L+ELHGS VP D+ L
Sbjct: 62 EAMLALGVDGVKEHIKTIGLFNTKAENVIKTCRMLIELHGS-------EVPQDRAA--LE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + VD ++ R++ R G P + + D+ LL+ P
Sbjct: 113 KLPGVGRKTANVVLNTAFGQPTMAVDTHIFRVSNRTGIAPGKTVL-DVEKKLLRHVP--- 168
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
+ L + H+ +I G+ C R P CG+C + C +
Sbjct: 169 --------------KPFLVDAHHWLILHGRYVCTARKPKCGSCLIEDLCDY 205
>gi|268317390|ref|YP_003291109.1| endonuclease III [Rhodothermus marinus DSM 4252]
gi|262334924|gb|ACY48721.1| endonuclease III [Rhodothermus marinus DSM 4252]
Length = 267
Score = 48.1 bits (113), Expect = 0.030, Method: Composition-based stats.
Identities = 33/116 (28%), Positives = 50/116 (43%), Gaps = 17/116 (14%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ V ++ A PVD +V R+A R+G V P ++ L ++
Sbjct: 97 EALESLPGVGPKTARVVASVAFGVAALPVDTHVYRVAHRIGLVRRARTPLEVERRLKRQL 156
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P D W E H+ +I G+ C R P+C C L C H+
Sbjct: 157 PARD------WG-----------EAHHLLILHGRYTCTARRPHCERCVLTDLCDHY 195
>gi|427386757|ref|ZP_18882954.1| endonuclease III [Bacteroides oleiciplenus YIT 12058]
gi|425725966|gb|EKU88833.1| endonuclease III [Bacteroides oleiciplenus YIT 12058]
Length = 224
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 19/123 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHI 701
PD ++E L+ + G+G K+ ++ + A VD +V R++ RLG VP P +
Sbjct: 106 PDTLEE-LVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRLGLVPDTCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K P D + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVKNIPEAD-----------------IPIAHHWLILHGRYVCQARTPQCDNCGLQLMCK 207
Query: 762 HFA 764
++
Sbjct: 208 YYC 210
>gi|420145504|ref|ZP_14652968.1| Endonuclease III [Lactobacillus coryniformis subsp. coryniformis
CECT 5711]
gi|398402878|gb|EJN56169.1| Endonuclease III [Lactobacillus coryniformis subsp. coryniformis
CECT 5711]
Length = 216
Score = 48.1 bits (113), Expect = 0.030, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 17/111 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G+G K+ + V + AF VD +V R+ RL VP + ++ ++K P
Sbjct: 112 LMQLPGVGRKTADVVLADAFGIPAFAVDTHVTRVTKRLRMVPQKATVLEIEQRMMKLLP- 170
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LW H++MI FG+ C R+P C CPL C
Sbjct: 171 -----EDLW-----------VTAHHRMIYFGRYHCLARAPKCEQCPLLSMC 205
>gi|393788681|ref|ZP_10376808.1| endonuclease III [Bacteroides nordii CL02T12C05]
gi|392654361|gb|EIY48009.1| endonuclease III [Bacteroides nordii CL02T12C05]
Length = 215
Score = 48.1 bits (113), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 53/120 (44%), Gaps = 18/120 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL-QPLPGDLHIHLLKE 706
E L+ + G+G K+ ++ + A VD +V R++ R+G V P + L+K
Sbjct: 110 EELVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVGANHTTPFSVEKELMKN 169
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
P +K + H+ +I G+ C+ R+P C C L+ CK+F A
Sbjct: 170 IP-----------------EKLVPIAHHWLILHGRYVCQARTPKCDTCGLQMMCKYFQEA 212
>gi|390935585|ref|YP_006393090.1| endonuclease III [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
gi|389571086|gb|AFK87491.1| endonuclease III [Thermoanaerobacterium saccharolyticum
JW/SL-YS485]
Length = 214
Score = 48.1 bits (113), Expect = 0.031, Method: Composition-based stats.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 19/125 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD + E L+++ G+G K+ V + + A VD +V R++ R+G +
Sbjct: 107 PDDM-EKLMELPGVGRKTANVVISNAFKKDAIAVDTHVFRVSNRIGLAESDDV------- 158
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L EY +MD I LW H+ +I G+ C R P C CP+ CK
Sbjct: 159 LKTEYQLMDIIPKNLWSLS-----------HHLLIYHGRNICAARKPKCDICPINHICKF 207
Query: 763 FASAV 767
+ + +
Sbjct: 208 YKNFI 212
>gi|333367814|ref|ZP_08460048.1| endonuclease III [Psychrobacter sp. 1501(2011)]
gi|332978351|gb|EGK15076.1| endonuclease III [Psychrobacter sp. 1501(2011)]
Length = 231
Score = 48.1 bits (113), Expect = 0.031, Method: Composition-based stats.
Identities = 42/169 (24%), Positives = 71/169 (42%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ E + D IK G N A + + L+E H S PD K+ L
Sbjct: 79 EAIYALGEEGLKDYIKTIGLYNSKAKNVIKACKDLIEKHNS--------QVPDNRKD-LE 129
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + VD ++ R++ R G + + L E +++
Sbjct: 130 ALAGVGRKTANVVLNTAFGQPTMAVDTHIFRVSNRTGLATGKTV-------LAVEQKLVE 182
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I + + + H+ +I G+ C+ R+P CGACP+ EC
Sbjct: 183 RIP-----------EDYILDAHHYLILHGRYTCQARTPKCGACPVYEEC 220
>gi|402300275|ref|ZP_10819796.1| endonuclease III [Bacillus alcalophilus ATCC 27647]
gi|401724572|gb|EJS97921.1| endonuclease III [Bacillus alcalophilus ATCC 27647]
Length = 216
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 55/117 (47%), Gaps = 17/117 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K ++ L+ + G+G K+ V ++ A VD +V R++ RLG + ++ L+
Sbjct: 107 KERDELVKLAGVGRKTANVVTSVAFGVPAIAVDTHVERVSKRLGICRWKDNVREVEETLM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K+ P ++ + H+Q+I FG+ CK +SP C CPL C+
Sbjct: 167 KKIP-----------------EERWSDTHHQLIFFGRYHCKAQSPQCPECPLLSMCR 206
>gi|325570288|ref|ZP_08146154.1| endonuclease III [Enterococcus casseliflavus ATCC 12755]
gi|420263227|ref|ZP_14765865.1| endonuclease III [Enterococcus sp. C1]
gi|325156771|gb|EGC68945.1| endonuclease III [Enterococcus casseliflavus ATCC 12755]
gi|394769515|gb|EJF49360.1| endonuclease III [Enterococcus sp. C1]
Length = 218
Score = 47.8 bits (112), Expect = 0.032, Method: Composition-based stats.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 17/117 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K +E L+ + G+G K+ V + A VD +V R++ RL L D ++ L
Sbjct: 107 KTREDLVSLPGVGRKTANVVLGDAYGIPAIAVDTHVERVSKRLRICKL-----DANV-LE 160
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
E +M KI LW + H+ MI FG+ C R+P C CPL C+
Sbjct: 161 VEQTLMKKIPEPLWVKT-----------HHTMIFFGRYHCTARAPKCEICPLLAMCQ 206
>gi|302390179|ref|YP_003826000.1| 8-oxoguanine DNA glycosylase [Thermosediminibacter oceani DSM
16646]
gi|302200807|gb|ADL08377.1| 8-oxoguanine DNA glycosylase domain protein [Thermosediminibacter
oceani DSM 16646]
Length = 292
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 23/146 (15%)
Query: 592 EAVRCADESEIADA-IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
E + E++I D RG+ I A R+ G I L+ L +P + ++YL
Sbjct: 152 EVLASLSEAQIRDTRCGFRGKYVIAAARMVA--------GGDIVLDELEKLPTGEARDYL 203
Query: 651 LDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY-- 707
+ I G+G K +CV L SL+ AFPVDV + R+ L + + +P + L+E+
Sbjct: 204 MKIPGVGRKIADCVLLFSLRKFDAFPVDVWIKRVVEHL-YFDGREMP----VKKLQEFAE 258
Query: 708 ----PVMDKIQMYL--WPRLCYLDQK 727
P+ Q YL + R C+ D K
Sbjct: 259 NRFGPLAGFAQQYLFHYTRTCWSDIK 284
>gi|225574096|ref|ZP_03782707.1| hypothetical protein RUMHYD_02161 [Blautia hydrogenotrophica DSM
10507]
gi|225038696|gb|EEG48942.1| endonuclease III [Blautia hydrogenotrophica DSM 10507]
Length = 210
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 71/168 (42%), Gaps = 29/168 (17%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
AD E+ IK G + A I RLV +G VP D E L+ + G+
Sbjct: 68 ADLRELEQDIKPTGFYHNKAKNIIGCAQRLVNEYGG-------EVPSDL--EALVSLPGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV-MDKIQM 715
G K+ +R + VD +V RI+ RLG +E PV ++K M
Sbjct: 119 GRKTANVIRGNIFHEPSVVVDTHVKRISRRLGLTR-------------EEDPVKIEKDLM 165
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
+ PR + LY + Q+ITFG+ C RSP C C L C +
Sbjct: 166 KVLPR----EHWILYNI--QIITFGRQICFARSPKCEECFLTKYCSEY 207
>gi|333396087|ref|ZP_08477904.1| endonuclease III [Lactobacillus coryniformis subsp. coryniformis
KCTC 3167]
Length = 216
Score = 47.8 bits (112), Expect = 0.032, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 17/111 (15%)
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G+G K+ + V + AF VD +V R+ RL VP + ++ ++K P
Sbjct: 112 LMQLPGVGRKTADVVLADAFGIPAFAVDTHVTRVTKRLRMVPQKATVLEIEQRMMKLLP- 170
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LW H++MI FG+ C R+P C CPL C
Sbjct: 171 -----EDLW-----------VTAHHRMIYFGRYHCLARAPKCEQCPLLSMC 205
>gi|435850481|ref|YP_007312067.1| putative endoIII-related endonuclease [Methanomethylovorans
hollandica DSM 15978]
gi|433661111|gb|AGB48537.1| putative endoIII-related endonuclease [Methanomethylovorans
hollandica DSM 15978]
Length = 208
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 73/169 (43%), Gaps = 29/169 (17%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV-ELHGSIDLEWLRNVPPDKVKEYLL 651
A+ AD ++ IK+ G + +GRI E L+ E G + PD + + LL
Sbjct: 59 AMAAADPEDLEKLIKDVGFYRVKSGRIIEISRILLQEYEGKV---------PDNIND-LL 108
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ CV + + A VD +V RI+ RL V P + + L + P
Sbjct: 109 KLPGVGRKTANCVLTYAFRKDAIAVDTHVHRISNRLCLVTTNA-PEETEVELERVVP--- 164
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LW Y+++ ++ FG+ C+ SP C C L C
Sbjct: 165 ---RELWQ---YVNE--------LLVRFGQDVCRPISPKCDICVLEDLC 199
>gi|410478975|ref|YP_006766612.1| EndoIII-related endonuclease [Leptospirillum ferriphilum ML-04]
gi|406774227|gb|AFS53652.1| putative EndoIII-related endonuclease [Leptospirillum ferriphilum
ML-04]
Length = 225
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 69/162 (42%), Gaps = 29/162 (17%)
Query: 600 SEIADAIKERGQQNIIAGRIKEFLNR-LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
S+I I G A IK R L E G I PD + E LL + G+GL
Sbjct: 83 SDIESLIFPVGFYRTKAKTIKTIAERVLTEFGGKI---------PDTL-EGLLSLPGVGL 132
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + + F VD++V RI R W +Q D H+++ PV+ + W
Sbjct: 133 KTANLVLTVGFEKEGFCVDIHVHRILNR--WGVIQTHSPDETYHIVE--PVLPR----KW 184
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
R + ++ FG+ FC+ SP C CPL +C
Sbjct: 185 KR----------RANALLVAFGQHFCRPVSPFCSVCPLLPDC 216
>gi|403350158|gb|EJY74527.1| hypothetical protein OXYTRI_04215 [Oxytricha trifallax]
Length = 372
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 86/197 (43%), Gaps = 40/197 (20%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++ I++ G N A +K+ +++E + I VP D K L+ + G+G
Sbjct: 100 EQDLNRLIEKVGFHNKKAIYLKKAARQIIEDYKGI-------VPSDYDK--LIALPGVGP 150
Query: 659 KSVECVRLLSLQH-----IAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHIHLLKEYPVMDK 712
K L LQ+ + VD +V RIA RL WVP Q PG+ L+E+ DK
Sbjct: 151 KMAH----LLLQNCFDKTVGISVDTHVHRIANRLKWVPKQTKTPGETA-KALQEWLPQDK 205
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC----KHFASAVA 768
W ++ ++ ++ FG++ CK P C C R C K +
Sbjct: 206 -----WEKINWM-----------LVGFGQMICKPIGPRCYECKARDLCPFEPKSKPPTKS 249
Query: 769 SARFALPGPSEKGIVTS 785
++ A+ ++ G++ S
Sbjct: 250 KSKIAIVKSNDDGVINS 266
>gi|257075651|ref|ZP_05570012.1| endonuclease III [Ferroplasma acidarmanus fer1]
Length = 217
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 93/235 (39%), Gaps = 48/235 (20%)
Query: 545 DMKND--RTKKTTPKKNSDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRC------ 596
D KN + ++ +P+ + + T W L+ I S H D V +A R
Sbjct: 3 DFKNIYMKIREVSPEHHFEFTDSPFWILITTILS------HRTKDIVTDQAARSLYNKYH 56
Query: 597 -------ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
AD +++ IK G N+ + RI +E+ I+ E+ VP +
Sbjct: 57 DSVGLENADPADVKAIIKYVGFSNVKSLRI-------IEIARIINHEYGGKVP--DTHDE 107
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L+ + G G K+ V A VD +V R++ R+G V + P + L P+
Sbjct: 108 LVKLPGTGSKTANIVLTQGFNIPAIAVDTHVFRVSNRIGLVHTKN-PDETEEALKSIVPL 166
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
++ E + M+ FGK CK SP C CP+ C +FA
Sbjct: 167 EYQV-----------------EFNPVMVEFGKNICKPVSPRCNICPVSDCCDYFA 204
>gi|383642104|ref|ZP_09954510.1| endonuclease III [Sphingomonas elodea ATCC 31461]
Length = 221
Score = 47.8 bits (112), Expect = 0.032, Method: Composition-based stats.
Identities = 41/162 (25%), Positives = 63/162 (38%), Gaps = 27/162 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+ + + IK G N A + L+ HG VP D+ + L + G+G
Sbjct: 69 EAGLKEHIKTIGLFNTKAKNVIALSEALIRDHGGA-------VPEDR--DALEQLPGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + H F VD ++ R+ R G P + P + + L K P
Sbjct: 120 KTANVVMNTAFGHETFAVDTHIFRVGNRTGLAPGKT-PLAVELKLEKAVP---------- 168
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
Q H+ +I G+ CK R+P C CP+ C
Sbjct: 169 -------QPFRLHAHHWLILHGRYICKARTPECWRCPVADLC 203
>gi|190893518|ref|YP_001980060.1| endonuclease III protein [Rhizobium etli CIAT 652]
gi|190698797|gb|ACE92882.1| endonuclease III protein [Rhizobium etli CIAT 652]
Length = 268
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E ++ D IK G A + LV+ E+ VP + +E L+ + G+G
Sbjct: 108 EEKVRDYIKTIGLYRNKAKNVIALSQMLVD-------EFAGRVP--ETREELVRLPGVGR 158
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RIA R+ P + P ++ L+K P
Sbjct: 159 KTANVVLSMAFGQATMAVDTHIFRIANRIRLAPGK-TPDEVEARLMKVVP---------- 207
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ LY H+ +I G+ CK R P C C + CK
Sbjct: 208 -------KHYLYHAHHWLILHGRYTCKARRPECERCVIADICK 243
>gi|259046849|ref|ZP_05737250.1| endonuclease III [Granulicatella adiacens ATCC 49175]
gi|259036472|gb|EEW37727.1| endonuclease III [Granulicatella adiacens ATCC 49175]
Length = 212
Score = 47.8 bits (112), Expect = 0.033, Method: Composition-based stats.
Identities = 42/170 (24%), Positives = 74/170 (43%), Gaps = 26/170 (15%)
Query: 593 AVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
A+ ++E + + I+ G A I+ +LVE + VP + +E L+
Sbjct: 64 ALANSEEEVVIECIQSLGLYRNKAKNIRLCAQQLVEKYNG-------EVP--RTREELVS 114
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ V ++ AF VD +V R++ RL + K+ + +
Sbjct: 115 LAGVGRKTANVVLSVAFGLPAFAVDTHVERVSKRL--------------QICKQSASVLE 160
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
++ L +L + + H+ MI FG+ C RSP C CPL C +
Sbjct: 161 VEETLCKKL---PKNKWGKAHHWMIFFGRYHCTARSPKCQGCPLLDLCAY 207
>gi|357454607|ref|XP_003597584.1| DEMETER-like protein [Medicago truncatula]
gi|355486632|gb|AES67835.1| DEMETER-like protein [Medicago truncatula]
Length = 219
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 30/35 (85%), Gaps = 1/35 (2%)
Query: 215 VVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKST 249
+VDSVV + + NVSD+LSSSA+MSL ARFPLKS+
Sbjct: 1 MVDSVVE-FFSLNVSDHLSSSAFMSLVARFPLKSS 34
>gi|189462599|ref|ZP_03011384.1| hypothetical protein BACCOP_03289 [Bacteroides coprocola DSM 17136]
gi|189430760|gb|EDU99744.1| endonuclease III [Bacteroides coprocola DSM 17136]
Length = 215
Score = 47.8 bits (112), Expect = 0.033, Method: Composition-based stats.
Identities = 31/115 (26%), Positives = 50/115 (43%), Gaps = 18/115 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHIHLLKE 706
E L+ + G+G K+ ++ + A VD +V R++ R+G VP P HL+K
Sbjct: 110 EELVKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPKTCTTPLATEKHLVK- 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
Y+ + + H+ +I G+ C R+P C C L G CK
Sbjct: 169 ----------------YIPENLIPTAHHWLILHGRYVCTARNPKCEECGLNGICK 207
>gi|424924648|ref|ZP_18348009.1| endonuclease III [Pseudomonas fluorescens R124]
gi|404305808|gb|EJZ59770.1| endonuclease III [Pseudomonas fluorescens R124]
Length = 212
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVELH VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRMLVELHNG-------EVP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGIAPG------------KNVVEVEKKLMKFVPKD 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C R P CG+C + C++
Sbjct: 171 YLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|339442257|ref|YP_004708262.1| hypothetical protein CXIVA_11940 [Clostridium sp. SY8519]
gi|338901658|dbj|BAK47160.1| hypothetical protein CXIVA_11940 [Clostridium sp. SY8519]
Length = 214
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 18/114 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L+ + G+G K+ +R + + VD +V RI+ RLG+ Q P + L+K+
Sbjct: 111 EDLISLPGVGRKTANVIRGNIYKDPSIVVDTHVKRISNRLGFTKEQD-PEKIEYDLMKKL 169
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P D Y + Q+IT G+ C RSP CG C L CK
Sbjct: 170 PK---------------DHWISYNM--QIITLGRTICTARSPKCGECFLSDLCK 206
>gi|452207737|ref|YP_007487859.1| endonuclease III [Natronomonas moolapensis 8.8.11]
gi|452083837|emb|CCQ37164.1| endonuclease III [Natronomonas moolapensis 8.8.11]
Length = 227
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 74/184 (40%), Gaps = 39/184 (21%)
Query: 584 DKMDSVDWEAVRC-ADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNV 641
D D+ D A ADESE+A AI N A I +VE+H G +
Sbjct: 59 DLFDAYDGPAAYAEADESELAAAIDSITYYNNKASYITSACRDIVEIHDGEV-------- 110
Query: 642 PPDKVKEYLLDIEGLGLKSVECVRLLSLQH-----IAFPVDVNVGRIAVRLGWVPLQPLP 696
PD + E L D+ G+G K+ V LQH VD +V RI+ RLG + P
Sbjct: 111 -PDTMSE-LTDLAGVGRKTANVV----LQHGHEVVEGIVVDTHVQRISRRLGLTE-EHTP 163
Query: 697 GDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
+ L+ P D W +L +I+ G+ C R+P+C C L
Sbjct: 164 ERIESDLMGVVPEGD------WKEFTHL-----------LISHGRDTCTARNPDCSNCVL 206
Query: 757 RGEC 760
C
Sbjct: 207 EDIC 210
>gi|375362747|ref|YP_005130786.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum CAU
B946]
gi|421731269|ref|ZP_16170395.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum M27]
gi|371568741|emb|CCF05591.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum CAU
B946]
gi|407075423|gb|EKE48410.1| endonuclease III [Bacillus amyloliquefaciens subsp. plantarum M27]
Length = 219
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEEYGG-------EVPKDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLPLCR 206
>gi|310791057|gb|EFQ26586.1| HhH-GPD superfamily base excision DNA repair protein [Glomerella
graminicola M1.001]
Length = 517
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 2/94 (2%)
Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
DE E G + G+ + L ++ G + L+ +R +P D V L +G
Sbjct: 287 DEKETGKVANVPGAAEMTTGQKEHQLKKIEA--GILTLDHIRAMPSDDVMMALTKYPHIG 344
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP 691
+K+ C+ L LQ +F VD +V R+ L WVP
Sbjct: 345 VKTAACLLLFCLQMPSFAVDTHVYRMCKWLYWVP 378
>gi|409082857|gb|EKM83215.1| hypothetical protein AGABI1DRAFT_69456 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 383
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 79/187 (42%), Gaps = 36/187 (19%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EAV ADES +++AI + G G IK L H D +VP K + L
Sbjct: 167 EAVIAADESVMSEAINKVGFWRRKTGYIKRATQML---HDDFD----SDVP--KTVDELC 217
Query: 652 DIEGLGLKSVECVRLLSLQ-----HIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+ G+G K + L+LQ + VDV+V RI RLGW H K
Sbjct: 218 SLPGVGPK----MAFLALQVAWDLNHGIGVDVHVHRITNRLGW----------HQKPTKN 263
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPL--RGECKHFA 764
P ++ + W L + E+++ ++ FG++ C P C C L +G C
Sbjct: 264 -PEETRLNLQSW-----LPTEFHREINHMLVGFGQVICLPVGPRCDMCDLSTKGLCPSAQ 317
Query: 765 SAVASAR 771
AV ++
Sbjct: 318 KAVKRSK 324
>gi|451346582|ref|YP_007445213.1| endonuclease III [Bacillus amyloliquefaciens IT-45]
gi|449850340|gb|AGF27332.1| endonuclease III [Bacillus amyloliquefaciens IT-45]
Length = 219
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEEYGG-------EVPKDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+K+ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMKKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLPLCR 206
>gi|339325233|ref|YP_004684926.1| endonuclease III [Cupriavidus necator N-1]
gi|338165390|gb|AEI76445.1| endonuclease III [Cupriavidus necator N-1]
Length = 214
Score = 47.8 bits (112), Expect = 0.035, Method: Composition-based stats.
Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E+ +++ IK G A + E + LVE HG VPPD+ E L + G+G
Sbjct: 69 EAGLSEYIKTIGLYKTKAKHVIETCHILVERHGG-------KVPPDR--EALEALPGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + VD ++ R+A R G P + + + LLK P
Sbjct: 120 KTANVVLNTAFGEPTIAVDTHIFRVANRTGLAPGKTVQ-IVEQKLLKCVP---------- 168
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ L++ H+ +I G+ CK R P C C + C+
Sbjct: 169 -------HEFLHDAHHWLILHGRYVCKARKPECWHCVIEPLCE 204
>gi|385830614|ref|YP_005868427.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
gi|326406622|gb|ADZ63693.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
Length = 218
Score = 47.8 bits (112), Expect = 0.035, Method: Composition-based stats.
Identities = 44/172 (25%), Positives = 68/172 (39%), Gaps = 23/172 (13%)
Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 649
D + + A EI I+ G A I LV G + L ++P DK +
Sbjct: 61 DAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQMLVTDFGDL----LPDLPKDK--KV 114
Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
L + G+G K+ V + VD +V R++ RL VP + ++ L+K P
Sbjct: 115 LQTLPGVGRKTANVVLAEAYGIPGIAVDTHVERVSKRLDIVPQKATVLEVEEKLMKLIP- 173
Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
QK + H+ +I FG+ C + P C CP+ CK
Sbjct: 174 ----------------QKKWVQAHHHLIFFGRYHCTAKKPKCADCPVLDYCK 209
>gi|424898948|ref|ZP_18322496.1| putative endoIII-related endonuclease [Prevotella bivia DSM 20514]
gi|388593658|gb|EIM33895.1| putative endoIII-related endonuclease [Prevotella bivia DSM 20514]
Length = 216
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 70/164 (42%), Gaps = 25/164 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A+ ++ + +K N A + E LVE +G VP D KE L+ + G+
Sbjct: 68 AEVEDVFEYVKSVSYPNSKAKHLVEMSRMLVEAYGG-------EVPSDP-KE-LVKLPGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V+ + VD +V R++ RLG VP KE K++ Y
Sbjct: 119 GRKTANVVQAVWFGKPTLAVDTHVYRVSHRLGLVP-------------KEANTPRKVEDY 165
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L + ++ ++ + H+ ++ G+ CK P C CP C
Sbjct: 166 L---MKHIAKEEVTNAHHWILLHGRYICKSARPLCEKCPFEAFC 206
>gi|398311183|ref|ZP_10514657.1| endonuclease III [Bacillus mojavensis RO-H-1]
Length = 219
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPKDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>gi|375292201|ref|YP_005126740.1| endonuclease III [Corynebacterium diphtheriae INCA 402]
gi|371581872|gb|AEX45538.1| endonuclease III [Corynebacterium diphtheriae INCA 402]
Length = 251
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA A+E+E+ + I+ G A + +LV ++ +P D E+L+
Sbjct: 75 EAYASANEAELQEMIRPTGFYKAKAAHLIGMGQKLVT-------DFSGEIPRDL--EHLV 125
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ VR + VD + GR+ RLG Q P + E+ + D
Sbjct: 126 SLPGVGRKTAHVVRGNAFDIPGLTVDTHFGRLVRRLGLT-TQTNPVKV------EHEIAD 178
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
I+ W T++ +++I G+ C R+ CGAC L C + A
Sbjct: 179 LIEKKEW---------TMFS--HRIIFHGRRVCHSRTAACGACFLAPRCPSYGEA 222
>gi|424886110|ref|ZP_18309721.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2012]
gi|393177872|gb|EJC77913.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2012]
Length = 260
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 18/115 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ L+K
Sbjct: 139 REELVKLPGVGRKTANVVLSMAFGQATMAVDTHIFRIANRIRLAPGK-TPDEVEARLMKV 197
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P + LY H+ +I G+ CK R P C C + CK
Sbjct: 198 IP-----------------KHYLYHAHHWLILHGRYTCKARRPECERCVIADLCK 235
>gi|51893925|ref|YP_076616.1| endonuclease III [Symbiobacterium thermophilum IAM 14863]
gi|51857614|dbj|BAD41772.1| endonuclease III [Symbiobacterium thermophilum IAM 14863]
Length = 235
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 66/161 (40%), Gaps = 27/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
EI + I++ G A I+ L+E HG VP E L+ + G+G K+
Sbjct: 73 EIGEMIRDCGLWKSKAKNIQGLSQMLLEKHGG-------EVP--STMEELIQLPGVGRKT 123
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V + A VD +V R+A RLG + P + E +M++I W
Sbjct: 124 ANVVLSNAFGIPAIAVDTHVFRVANRLGLAEAKT-PEE------TERQLMERIPREYWS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ H+ +I G+ C R+P C CPL C+
Sbjct: 176 ----------QAHHWLIYHGRQVCHARNPQCSQCPLLPHCR 206
>gi|23335629|ref|ZP_00120863.1| COG0177: Predicted EndoIII-related endonuclease [Bifidobacterium
longum DJO10A]
gi|23464990|ref|NP_695593.1| endonuclease III [Bifidobacterium longum NCC2705]
gi|189440057|ref|YP_001955138.1| EndoIII-related endonuclease [Bifidobacterium longum DJO10A]
gi|227545733|ref|ZP_03975782.1| DNA-(apurinic or apyrimidinic site) lyase [Bifidobacterium longum
subsp. longum ATCC 55813]
gi|239622592|ref|ZP_04665623.1| endonuclease III [Bifidobacterium longum subsp. infantis CCUG
52486]
gi|296453434|ref|YP_003660577.1| endonuclease III [Bifidobacterium longum subsp. longum JDM301]
gi|312133395|ref|YP_004000734.1| nth [Bifidobacterium longum subsp. longum BBMN68]
gi|322688415|ref|YP_004208149.1| endonuclease III [Bifidobacterium longum subsp. infantis 157F]
gi|322690434|ref|YP_004220004.1| endonuclease III [Bifidobacterium longum subsp. longum JCM 1217]
gi|384202230|ref|YP_005587977.1| endonuclease III [Bifidobacterium longum subsp. longum KACC 91563]
gi|419848731|ref|ZP_14371820.1| endonuclease III [Bifidobacterium longum subsp. longum 1-6B]
gi|419855237|ref|ZP_14377999.1| endonuclease III [Bifidobacterium longum subsp. longum 44B]
gi|23325590|gb|AAN24229.1| endonuclease III [Bifidobacterium longum NCC2705]
gi|189428492|gb|ACD98640.1| Putative EndoIII-related endonuclease [Bifidobacterium longum
DJO10A]
gi|227213849|gb|EEI81688.1| DNA-(apurinic or apyrimidinic site) lyase [Bifidobacterium longum
subsp. infantis ATCC 55813]
gi|239514589|gb|EEQ54456.1| endonuclease III [Bifidobacterium longum subsp. infantis CCUG
52486]
gi|291517527|emb|CBK71143.1| DNA-(apurinic or apyrimidinic site) lyase /endonuclease III
[Bifidobacterium longum subsp. longum F8]
gi|296182865|gb|ADG99746.1| endonuclease III [Bifidobacterium longum subsp. longum JDM301]
gi|311772621|gb|ADQ02109.1| Nth [Bifidobacterium longum subsp. longum BBMN68]
gi|320455290|dbj|BAJ65912.1| endonuclease III [Bifidobacterium longum subsp. longum JCM 1217]
gi|320459751|dbj|BAJ70371.1| endonuclease III [Bifidobacterium longum subsp. infantis 157F]
gi|338755237|gb|AEI98226.1| endonuclease III [Bifidobacterium longum subsp. longum KACC 91563]
gi|386406935|gb|EIJ21927.1| endonuclease III [Bifidobacterium longum subsp. longum 1-6B]
gi|386415979|gb|EIJ30498.1| endonuclease III [Bifidobacterium longum subsp. longum 44B]
Length = 228
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 23/143 (16%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
L+ L ++D + VP + + L + G+G K+ V + FPVD +V R+
Sbjct: 95 LIGLATALDERFGGVVP--RTMDELTSLPGVGRKTANVVLGNAFDIPGFPVDTHVMRVTG 152
Query: 686 RL----GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGK 741
RL W P P + + +P + W L +++I FG+
Sbjct: 153 RLRWRSDWRSAHPDPVKIEKEITSCFPPEE------W-----------TNLSHRLILFGR 195
Query: 742 IFCKKRSPNCGACPLRGECKHFA 764
C R+P+C CPL C +A
Sbjct: 196 ATCHARTPDCANCPLSDTCPSYA 218
>gi|302343836|ref|YP_003808365.1| HhH-GPD family protein [Desulfarculus baarsii DSM 2075]
gi|301640449|gb|ADK85771.1| HhH-GPD family protein [Desulfarculus baarsii DSM 2075]
Length = 214
Score = 47.8 bits (112), Expect = 0.037, Method: Composition-based stats.
Identities = 43/151 (28%), Positives = 61/151 (40%), Gaps = 16/151 (10%)
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
I+ G NI A R+ L R +E H L L P D+++ LL +G+G ++ + +
Sbjct: 77 IRPAGYYNIKAARLGHLL-RTMEAHREGGLSRLLARPTDELRHKLLATKGVGPETADSIL 135
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
L + F VD RI R G L + + + VMD D
Sbjct: 136 LYAAGRPIFVVDAYTFRILGRHG------LADESMGYFDLQEAVMDATPH---------D 180
Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPL 756
E H ++ GK CKK P C CPL
Sbjct: 181 AAFYNEFHALLVRLGKQRCKKSKPLCQGCPL 211
>gi|116253944|ref|YP_769782.1| endonuclease III [Rhizobium leguminosarum bv. viciae 3841]
gi|115258592|emb|CAK09696.1| putative endonuclease III [Rhizobium leguminosarum bv. viciae 3841]
Length = 268
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 49/115 (42%), Gaps = 18/115 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
++ L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ + L+K
Sbjct: 147 RDELVRLPGVGRKTANVVLSMAFGRATMAVDTHIFRIANRIRLAPGK-TPDEVEVRLMKV 205
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P LY H+ +I G+ CK R P C C + CK
Sbjct: 206 IP-----------------NHYLYHAHHWLILHGRYICKARRPECERCIIADLCK 243
>gi|405972859|gb|EKC37606.1| N-glycosylase/DNA lyase [Crassostrea gigas]
Length = 364
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 6/85 (7%)
Query: 612 QNIIAGRIKEFLNR----LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLL 667
+N+ G +++N+ ++E G L LR +P + K+ LL + G+G K +CV L+
Sbjct: 200 RNLGFGYRAKYINKSAQQILEKGGETWLRALREIPYAEAKKELLALNGVGAKVADCVCLM 259
Query: 668 SLQHI-AFPVDVNVGRIAVRLGWVP 691
SL A PVD +V +IA R G++P
Sbjct: 260 SLDKTDALPVDTHVWQIAAR-GYMP 283
>gi|453330098|dbj|GAC87821.1| A/G-specific adenine glycosylase [Gluconobacter thailandicus NBRC
3255]
Length = 353
Score = 47.8 bits (112), Expect = 0.039, Method: Composition-based stats.
Identities = 39/146 (26%), Positives = 65/146 (44%), Gaps = 15/146 (10%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
PD V+E LL + G+G + + ++ PVD NV R+ RL + PLP +
Sbjct: 104 PDTVEE-LLKLPGIGAYTARAIASIAFGVPVVPVDGNVERVTSRLFAIE-DPLPASRPL- 160
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
L ++ ++++ P+ QK + + G C R+P+C CP R C+
Sbjct: 161 LARQAALLNQ------PKAA---QKRPSDFAQALFDLGATVCTPRNPSCLTCPWRPACQG 211
Query: 763 FASAVASARFALPGPSEKGIVTSEFG 788
A +AS LP + K + +G
Sbjct: 212 HAKGIAS---TLPRKAPKQARPTRYG 234
>gi|429192463|ref|YP_007178141.1| endoIII-related endonuclease [Natronobacterium gregoryi SP2]
gi|448325876|ref|ZP_21515254.1| HhH-GPD family protein [Natronobacterium gregoryi SP2]
gi|429136681|gb|AFZ73692.1| putative endoIII-related endonuclease [Natronobacterium gregoryi
SP2]
gi|445613968|gb|ELY67653.1| HhH-GPD family protein [Natronobacterium gregoryi SP2]
Length = 270
Score = 47.8 bits (112), Expect = 0.039, Method: Composition-based stats.
Identities = 38/130 (29%), Positives = 55/130 (42%), Gaps = 22/130 (16%)
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIA-FPVDVNVGRIAVRLGWVPLQP 694
E++ P+ V+E LL++ G+G K+ +CV L + FPVD +V RI RLG
Sbjct: 138 EFVTEKAPETVRETLLEVRGVGPKTADCVLLFAGGRTGVFPVDTHVHRIYRRLGIASPDA 197
Query: 695 LPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNC--- 751
++ L + P + C H I FG+ +C R P C
Sbjct: 198 DHEEVRTVLEEVVPA----------KKCGFG-------HTATIQFGREYCTARKPACLED 240
Query: 752 -GACPLRGEC 760
ACPL C
Sbjct: 241 PDACPLGDRC 250
>gi|386867584|ref|YP_006280578.1| EndoIII-related endonuclease [Bifidobacterium animalis subsp.
animalis ATCC 25527]
gi|385701667|gb|AFI63615.1| Putative EndoIII-related endonuclease [Bifidobacterium animalis
subsp. animalis ATCC 25527]
Length = 247
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 71/169 (42%), Gaps = 22/169 (13%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ A ++ I+ G ++ A I ++VE G VP + E L
Sbjct: 69 EALADARLEDVESIIRPLGFYHVKAEHIIAVARQIVERFGG-------EVP--QTMEELT 119
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + FPVD +V R+ RL W +K +
Sbjct: 120 SLPGVGRKTANVVLGNAFHVPGFPVDTHVIRVTGRLHW----------RDDWMKANTTPE 169
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I+ + C+ + + +L +++I FG+ C RSP C CPL C
Sbjct: 170 RIEQEITG--CFPESE-WTDLSHRLIIFGRNICTSRSPECETCPLLPTC 215
>gi|398306834|ref|ZP_10510420.1| endonuclease III [Bacillus vallismortis DV1-F-3]
Length = 219
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPEDR--DELVKLPGVGRKT 122
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>gi|257064026|ref|YP_003143698.1| endonuclease III [Slackia heliotrinireducens DSM 20476]
gi|256791679|gb|ACV22349.1| endonuclease III [Slackia heliotrinireducens DSM 20476]
Length = 210
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 23/117 (19%)
Query: 648 EYLLDIEGLGLKS---VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQP-LPGDLHIHL 703
E L + G+G K+ V C + Q IA VD +V RIA RLG+ P + L
Sbjct: 108 EGLQSLPGVGRKTANVVMCQAFRNAQGIA--VDTHVFRIAHRLGFATRNDDTPDKVEAKL 165
Query: 704 LKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
LK YP D W LY +++Q + FG+ FC R+P C CP+ C
Sbjct: 166 LKVYPQTD------W----------LY-INHQWVHFGREFCSARNPKCLTCPIHDLC 205
>gi|451936569|ref|YP_007460423.1| endonuclease III [Candidatus Kinetoplastibacterium oncopeltii
TCC290E]
gi|451777492|gb|AGF48467.1| endonuclease III [Candidatus Kinetoplastibacterium oncopeltii
TCC290E]
Length = 210
Score = 47.8 bits (112), Expect = 0.040, Method: Composition-based stats.
Identities = 28/117 (23%), Positives = 53/117 (45%), Gaps = 18/117 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
+E L + G+G K+ V ++ VD +V R++ R G + L D+ L++
Sbjct: 109 REGLESLPGVGRKTANVVLNVAFNQPTIAVDTHVFRVSNRTGIAKGKKLI-DVENKLIRN 167
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
P + L++ H+ ++ FG+ C KR+P C CP+ C+++
Sbjct: 168 IP-----------------NRYLHKAHHMLVLFGRYICVKRNPKCSHCPINDLCEYY 207
>gi|440227974|ref|YP_007335065.1| endonuclease III [Rhizobium tropici CIAT 899]
gi|440039485|gb|AGB72519.1| endonuclease III [Rhizobium tropici CIAT 899]
Length = 259
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 51/117 (43%), Gaps = 18/117 (15%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
+ +E L+ + G+G K+ V ++ VD ++ RIA RL P + P ++ L+
Sbjct: 135 RTREELITLPGVGRKTANVVLSMAFGQATMAVDTHIFRIANRLLLAPGK-TPDEVEHRLM 193
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
K P DQ LY H+ +I G+ CK R P C C + C+
Sbjct: 194 KIIP----------------DQ-YLYHAHHWLILHGRYVCKARKPECERCVIADLCR 233
>gi|386010752|ref|YP_005929029.1| Endonuclease III [Pseudomonas putida BIRD-1]
gi|313497458|gb|ADR58824.1| Endonuclease III [Pseudomonas putida BIRD-1]
Length = 212
Score = 47.8 bits (112), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 40/172 (23%), Positives = 72/172 (41%), Gaps = 29/172 (16%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA+ +++ IK G N A + E L+E H S VP + +E L
Sbjct: 62 EAIHALGVEGLSEYIKTIGLYNSKAKNVIETCRLLIERHDS-------QVP--QTREALE 112
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V + + A VD ++ R++ R G P
Sbjct: 113 ALPGVGRKTANVVLNTAFRQPAMAVDTHIFRVSNRTGIAP-------------------G 153
Query: 712 KIQMYLWPRLC-YLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
K + + +L ++ + L + H+ +I G+ C+ R P CG+C + C++
Sbjct: 154 KTVLEVEKKLVKFVPKDYLLDAHHWLILHGRYVCQARKPRCGSCRIEDLCEY 205
>gi|452915860|ref|ZP_21964486.1| endonuclease III [Bacillus subtilis MB73/2]
gi|452116208|gb|EME06604.1| endonuclease III [Bacillus subtilis MB73/2]
Length = 203
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 56 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 106
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 107 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 159
Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
H+++I FG+ CK +SP C CPL C+
Sbjct: 160 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 190
>gi|403251702|ref|ZP_10918031.1| putative endoIII-related endonuclease [actinobacterium SCGC
AAA027-L06]
gi|402914990|gb|EJX35984.1| putative endoIII-related endonuclease [actinobacterium SCGC
AAA027-L06]
Length = 233
Score = 47.4 bits (111), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 67/170 (39%), Gaps = 31/170 (18%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYLLDIEG 655
AD +I +K G A IK N++++ G + P+K+ + L+ + G
Sbjct: 73 ADIKDIQRLVKSTGFYRAKAKNIKGLSNQIMDDFSGKV---------PNKLDD-LIQLPG 122
Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPVMDKIQ 714
+G K+ V + VD + GR++ R GW L P+ + + L
Sbjct: 123 VGRKTANVVLGHAFGIPGITVDTHFGRLSRRFGWSKNLDPVKVEFEVGEL---------- 172
Query: 715 MYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
+ QK L +MI G+ C R P CGAC L C F
Sbjct: 173 ---------ITQKEWTNLSQRMIWHGRRVCHSRKPACGACSLAKLCPSFG 213
>gi|70732175|ref|YP_261931.1| endonuclease III [Pseudomonas protegens Pf-5]
gi|68346474|gb|AAY94080.1| endonuclease III [Pseudomonas protegens Pf-5]
Length = 212
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVE H S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVERHNS-------EVP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGLAPG------------KNVVEVEKKLMKFVPKE 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C R P CG+C + C++
Sbjct: 171 FLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|209551017|ref|YP_002282934.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2304]
gi|209536773|gb|ACI56708.1| endonuclease III [Rhizobium leguminosarum bv. trifolii WSM2304]
Length = 260
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 18/115 (15%)
Query: 647 KEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKE 706
++ L+ + G+G K+ V ++ VD ++ RIA R+ P + P ++ L+K
Sbjct: 139 RDELVKLPGVGRKTANVVLSMAFGQATMAVDTHIFRIANRIKLAPGK-TPDEVEARLMKV 197
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P Q LY H+ +I G+ CK R P C C + CK
Sbjct: 198 IP-----------------QHYLYHAHHWLILHGRHTCKARRPECERCVIADLCK 235
>gi|218960389|ref|YP_001740164.1| putative endonuclease III (nth-like) [Candidatus Cloacamonas
acidaminovorans]
gi|167729046|emb|CAO79957.1| putative endonuclease III (nth-like) [Candidatus Cloacamonas
acidaminovorans str. Evry]
Length = 222
Score = 47.4 bits (111), Expect = 0.042, Method: Composition-based stats.
Identities = 31/118 (26%), Positives = 55/118 (46%), Gaps = 19/118 (16%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
P+++ + LL++ G+G K+ VR ++ Q A VDV+V RI R G++ + P + +
Sbjct: 119 PEEIDD-LLELPGVGRKTANLVRAVAFQKPAICVDVHVHRICNRWGYIQTKT-PLETEMT 176
Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L ++ P +K + ++ FG+ C R P C CP+ C
Sbjct: 177 LRQKLP-----------------EKYWLNFNSYLVAFGQNLCTPRKPKCEICPVAEFC 217
>gi|345006727|ref|YP_004809580.1| endonuclease III [halophilic archaeon DL31]
gi|344322353|gb|AEN07207.1| endonuclease III [halophilic archaeon DL31]
Length = 227
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 92/237 (38%), Gaps = 67/237 (28%)
Query: 560 SDNTVQQDWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRI 619
+D V + + L R Y T E+ ++ ADE ++A+ I N AG +
Sbjct: 49 TDERVNKVTEELYRTYETAEDFAN-------------ADEEQLAEEIYGITFHNSKAGYL 95
Query: 620 KEFLNRLVELH-GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-----A 673
K +VE H G + PD ++E L ++ G+G K+ V LQH
Sbjct: 96 KSIGEDIVEKHNGEV---------PDTMEE-LTELSGVGRKTANVV----LQHAHDVVEG 141
Query: 674 FPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELH 733
VD +V RI+ RLG + P + LL+ P + W + +L
Sbjct: 142 IVVDTHVQRISRRLGLTE-EERPEAIEQDLLEFVPRRN------WQQFTHL--------- 185
Query: 734 YQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNG 790
I G+ C R+P+CG C L C PSEKG + +G
Sbjct: 186 --FIDHGRAVCTARNPDCGDCELADIC----------------PSEKGDSDVDLASG 224
>gi|218531841|ref|YP_002422657.1| DNA-(apurinic or apyrimidinic site) lyase [Methylobacterium
extorquens CM4]
gi|218524144|gb|ACK84729.1| DNA-(apurinic or apyrimidinic site) lyase [Methylobacterium
extorquens CM4]
Length = 238
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 70/176 (39%), Gaps = 31/176 (17%)
Query: 591 WEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYL 650
+E +R + E+ IK N +KE +++E +G N+P ++ + L
Sbjct: 87 FEELRDLPDDELRRIIKPVAHYNRKTKNLKEMARQIIEDYGG-------NIPDNR--DDL 137
Query: 651 LDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL--PGDLHIHLLKEYP 708
+ ++G+G K V+ + + + VD +V R+ RLG V DL + + P
Sbjct: 138 MKLQGVGRKCVDILMNFTFSQDSIAVDTHVLRVLNRLGVVDTTSAKQAADL---INAQTP 194
Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFA 764
K + W +I G C R+P C CPL C +A
Sbjct: 195 ARHKRHAHEW-----------------LIQHGMKICVARTPKCADCPLTKHCDWYA 233
>gi|319902841|ref|YP_004162569.1| endonuclease III; DNA-(apurinic or apyrimidinic site) lyase
[Bacteroides helcogenes P 36-108]
gi|319417872|gb|ADV44983.1| endonuclease III; DNA-(apurinic or apyrimidinic site) lyase
[Bacteroides helcogenes P 36-108]
Length = 224
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 58/122 (47%), Gaps = 19/122 (15%)
Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHI 701
PD ++E L+ + G+G K+ ++ ++ A VD +V R++ RLG V + P +
Sbjct: 106 PDTLEE-LIKLPGVGRKTANVIQSVAFNKAAMAVDTHVFRVSHRLGLVSDKCTTPFSVEK 164
Query: 702 HLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
L+K Y+ + + H+ +I G+ C+ R+P C C L+ CK
Sbjct: 165 ELVK-----------------YIPEADISIAHHWLILHGRYTCQARTPQCDECGLQLLCK 207
Query: 762 HF 763
++
Sbjct: 208 YY 209
>gi|320105404|ref|YP_004180994.1| iron-sulfur cluster loop [Terriglobus saanensis SP1PR4]
gi|319923925|gb|ADV81000.1| iron-sulfur cluster loop [Terriglobus saanensis SP1PR4]
Length = 258
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 76/173 (43%), Gaps = 14/173 (8%)
Query: 589 VDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKE 648
V W+ +R A +EI ++I + A ++K+ L + G + L+ L +K++
Sbjct: 82 VCWDKLRDAPVAEIEESIALATFPDRKAPQLKQTLEEITRRVGKLSLDSLATYKTEKIRA 141
Query: 649 YLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
+L G+G+K+ V S H A +D + RIA+RLG L P D
Sbjct: 142 WLEQFPGVGVKTSGAVVNFSWLHRKAICIDSHHQRIAIRLG---LAPKGADAR------- 191
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+++ M + P +Q + E H + G+ C R P C C LR C
Sbjct: 192 -AVEEQLMAIAPEEWTAEQ--MDEHHSLVKLHGQERCTFREPRCSRCALREVC 241
>gi|282858599|ref|ZP_06267761.1| endonuclease III [Prevotella bivia JCVIHMP010]
gi|282588603|gb|EFB93746.1| endonuclease III [Prevotella bivia JCVIHMP010]
Length = 206
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 70/164 (42%), Gaps = 25/164 (15%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A+ ++ + +K N A + E LVE +G VP D KE L+ + G+
Sbjct: 58 AEVEDVFEYVKSVSYPNSKAKHLVEMSRMLVEAYGG-------EVPSDP-KE-LVKLPGV 108
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
G K+ V+ + VD +V R++ RLG VP KE K++ Y
Sbjct: 109 GRKTANVVQAVWFGKPTLAVDTHVYRVSHRLGLVP-------------KEANTPRKVEDY 155
Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
L + ++ ++ + H+ ++ G+ CK P C CP C
Sbjct: 156 L---MKHIAKEEVTNAHHWILLHGRYICKSARPLCEKCPFEAFC 196
>gi|383788949|ref|YP_005473518.1| putative DNA glycosylase/AP lyase [Caldisericum exile AZM16c01]
gi|381364586|dbj|BAL81415.1| putative DNA glycosylase/AP lyase [Caldisericum exile AZM16c01]
Length = 218
Score = 47.4 bits (111), Expect = 0.043, Method: Composition-based stats.
Identities = 46/162 (28%), Positives = 66/162 (40%), Gaps = 27/162 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +I I G A +IKE L+E + VP D E LL ++G+G
Sbjct: 75 EKDIEKLIYPVGFYKRKAKQIKEVAKTLIEKYDG-------RVPNDL--EELLKLKGVGR 125
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V + VD +V RI+ RLGWV + P + L K P
Sbjct: 126 KTANLVITEAFDDYGICVDTHVHRISNRLGWVKTKN-PEQTEMELRKILP---------- 174
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+K ++ ++TFG+ CK SP C CP+ C
Sbjct: 175 -------KKYWKTINPILVTFGQNICKPISPLCSKCPIEPLC 209
>gi|297618317|ref|YP_003703476.1| HhH-GPD family protein [Syntrophothermus lipocalidus DSM 12680]
gi|297146154|gb|ADI02911.1| HhH-GPD family protein [Syntrophothermus lipocalidus DSM 12680]
Length = 239
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 72/162 (44%), Gaps = 18/162 (11%)
Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
++ + I+ N A R+K F LV + DLE L ++ +++ LL ++G+G ++
Sbjct: 75 KLGNLIRPARYYNQKAARLKAF-AELVNIKYEGDLEKLLSLSTPELRMELLALKGIGPET 133
Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK-IQMYLWP 719
+ + L + F VD RI RLG+ L D+ +++++ + + YL+
Sbjct: 134 ADSILLYAANRPVFVVDAYTKRIFQRLGY-----LEADVGYSVVQDFFTSNLPCETYLFN 188
Query: 720 RLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
E H ++ G C R P C CPL CK
Sbjct: 189 -----------EYHALIVALGNRICLARRPLCQKCPLLHRCK 219
>gi|419851082|ref|ZP_14374041.1| endonuclease III [Bifidobacterium longum subsp. longum 35B]
gi|419852670|ref|ZP_14375533.1| endonuclease III [Bifidobacterium longum subsp. longum 2-2B]
gi|386407322|gb|EIJ22301.1| endonuclease III [Bifidobacterium longum subsp. longum 35B]
gi|386410084|gb|EIJ24895.1| endonuclease III [Bifidobacterium longum subsp. longum 2-2B]
Length = 228
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 23/143 (16%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
L+ L ++D + VP + + L + G+G K+ V + FPVD +V R+
Sbjct: 95 LIGLATALDERFGGVVP--RTMDELTSLPGVGRKTANVVLGNAFDIPGFPVDTHVMRVTG 152
Query: 686 RL----GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGK 741
RL W P P + + +P + W L +++I FG+
Sbjct: 153 RLRWRSDWRSAHPDPVKIEKEITSCFPPEE------W-----------TNLSHRLILFGR 195
Query: 742 IFCKKRSPNCGACPLRGECKHFA 764
C R+P+C CPL C +A
Sbjct: 196 ATCHARTPDCANCPLSDTCPSYA 218
>gi|213691195|ref|YP_002321781.1| endonuclease III [Bifidobacterium longum subsp. infantis ATCC 15697
= JCM 1222]
gi|213522656|gb|ACJ51403.1| endonuclease III [Bifidobacterium longum subsp. infantis ATCC 15697
= JCM 1222]
Length = 228
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 61/139 (43%), Gaps = 15/139 (10%)
Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAV 685
L+ L ++D + VP + + L + G+G K+ V + FPVD +V R+
Sbjct: 95 LIGLATALDERFGGVVP--RTMDGLTSLPGVGRKTANVVLGNAFDIPGFPVDTHVMRVTG 152
Query: 686 RLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCK 745
RL W +P KI+ + C+ ++ L +++I FG+ C
Sbjct: 153 RLRW----------RSDWRSAHPDPVKIEKEI--TACFPPEE-WTNLSHRLILFGRATCH 199
Query: 746 KRSPNCGACPLRGECKHFA 764
R+P+C CPL C +A
Sbjct: 200 ARTPDCANCPLSDTCPSYA 218
>gi|399039507|ref|ZP_10735069.1| endonuclease III [Rhizobium sp. CF122]
gi|398062392|gb|EJL54168.1| endonuclease III [Rhizobium sp. CF122]
Length = 243
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 27/163 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E + D IK G A + LV+ GS+ VP + ++ L+ + G+G
Sbjct: 83 EERVRDYIKTIGLYRNKAKNVIALSRMLVDKFGSV-------VP--QTRDELVTLPGVGR 133
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K+ V ++ VD ++ RIA R+ P + P ++ L+K P
Sbjct: 134 KTANVVLSMAFGQATMAVDTHIFRIANRIKLAPGK-TPDEVEERLMKVVP---------- 182
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
+ LY H+ +I G+ CK R P C C + CK
Sbjct: 183 -------RHYLYHAHHWLILHGRYTCKARKPECERCVIADICK 218
>gi|376247587|ref|YP_005139531.1| endonuclease III [Corynebacterium diphtheriae HC04]
gi|376250405|ref|YP_005137286.1| endonuclease III [Corynebacterium diphtheriae HC03]
gi|376256223|ref|YP_005144114.1| endonuclease III [Corynebacterium diphtheriae VA01]
gi|372111909|gb|AEX77968.1| endonuclease III [Corynebacterium diphtheriae HC03]
gi|372114155|gb|AEX80213.1| endonuclease III [Corynebacterium diphtheriae HC04]
gi|372118740|gb|AEX82474.1| endonuclease III [Corynebacterium diphtheriae VA01]
Length = 251
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA A+E+E+ + I+ G A + +LV ++ +P D E+L+
Sbjct: 75 EAYASANEAELQEMIRPTGFYKAKAAHLIGMGQKLVT-------DFSGEIPRDL--EHLV 125
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ VR + VD + GR+ RLG Q P + E+ + D
Sbjct: 126 SLPGVGRKTAHVVRGNAFDIPGLTVDTHFGRLVRRLGLT-TQTNPVKV------EHEIAD 178
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
I+ W T++ +++I G+ C R+ CGAC L C + A
Sbjct: 179 LIEKKEW---------TMFS--HRIIFHGRRVCHSRTAACGACFLAPRCPSYGEA 222
>gi|116627954|ref|YP_820573.1| endonuclease III, DNA repair [Streptococcus thermophilus LMD-9]
gi|386086829|ref|YP_006002703.1| Endonuclease III, DNA repair [Streptococcus thermophilus ND03]
gi|386344902|ref|YP_006041066.1| endonuclease III, DNA repair [Streptococcus thermophilus JIM 8232]
gi|387909931|ref|YP_006340237.1| endonuclease III, DNA repair [Streptococcus thermophilus
MN-ZLW-002]
gi|116101231|gb|ABJ66377.1| DNA-(apurinic or apyrimidinic site) lyase / endonuclease III
[Streptococcus thermophilus LMD-9]
gi|312278542|gb|ADQ63199.1| Endonuclease III, DNA repair [Streptococcus thermophilus ND03]
gi|339278363|emb|CCC20111.1| endonuclease III, DNA repair [Streptococcus thermophilus JIM 8232]
gi|387574866|gb|AFJ83572.1| endonuclease III, DNA repair [Streptococcus thermophilus
MN-ZLW-002]
Length = 214
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 17/119 (14%)
Query: 645 KVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLL 704
K + L + G+G K+ V + VD +V R++ RL P ++ L+
Sbjct: 107 KTHKELESLPGVGRKTANVVLAEVYGIPSIAVDTHVSRVSKRLNIAPENASVEEIEAELM 166
Query: 705 KEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
K+ P D I H++MI FG+ C ++P C CPL+ CK++
Sbjct: 167 KKIPKKDWIIS-----------------HHRMIFFGRYHCLAKNPKCQTCPLQSYCKYY 208
>gi|408356631|ref|YP_006845162.1| endonuclease III [Amphibacillus xylanus NBRC 15112]
gi|407727402|dbj|BAM47400.1| endonuclease III [Amphibacillus xylanus NBRC 15112]
Length = 218
Score = 47.4 bits (111), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 51/114 (44%), Gaps = 17/114 (14%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEY 707
E L + G+G K+ V ++ A VD +V R+ RLG + P ++ L+++
Sbjct: 110 EELESLAGVGRKTANVVLSVAFGVPAIAVDTHVERVTKRLGICRWKDTPLEVEKTLMRKV 169
Query: 708 PVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
P + W H+++I FG+ CK R+PNC C L C+
Sbjct: 170 PKEE------WS-----------ATHHRLIFFGRYHCKARTPNCDDCELLSLCR 206
>gi|387770312|ref|ZP_10126495.1| endonuclease III [Pasteurella bettyae CCUG 2042]
gi|386904674|gb|EIJ69463.1| endonuclease III [Pasteurella bettyae CCUG 2042]
Length = 211
Score = 47.4 bits (111), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 29/187 (15%)
Query: 578 GEERSHDKMDSVDW--EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDL 635
G ++ DK+ SV E + + + IK G N A I + L+E H S
Sbjct: 46 GVNKATDKLFSVANTPEDILALGVDGLKEYIKTIGLYNAKAENIIKTCRTLIEKHHS--- 102
Query: 636 EWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPL 695
VP ++ E L + G+G K+ V + H VD ++ R++ R G+ P + +
Sbjct: 103 ----QVPENR--EALEALAGVGRKTANVVLNTAFGHPTIAVDTHIFRVSNRTGFAPGKDV 156
Query: 696 PGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACP 755
L + V D+ +M ++H+ +I FG+ C R P CG+C
Sbjct: 157 ---LEVEEKLNKVVPDEFKM---------------DVHHWLILFGRYTCIARKPRCGSCL 198
Query: 756 LRGECKH 762
+ C++
Sbjct: 199 IEDLCEY 205
>gi|156742725|ref|YP_001432854.1| DNA-(apurinic or apyrimidinic site) lyase [Roseiflexus castenholzii
DSM 13941]
gi|156234053|gb|ABU58836.1| DNA-(apurinic or apyrimidinic site) lyase [Roseiflexus castenholzii
DSM 13941]
Length = 219
Score = 47.4 bits (111), Expect = 0.045, Method: Composition-based stats.
Identities = 47/169 (27%), Positives = 66/169 (39%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E + E EIA+ I G A I+ RL+E HG VP D + LL
Sbjct: 69 EKMLALSEEEIAELIYPVGFYRNKARTIRAICRRLIEEHGG-------KVPADL--DALL 119
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ V VD +V RI R G+V + P + + L + P
Sbjct: 120 ALPGVGRKTANLVLTAGFDLPGICVDTHVHRICNRWGYVQTRT-PEETEMKLREILP--- 175
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W E++ ++T G+ C SP C ACPL C
Sbjct: 176 ---FEYWK-----------EINGLLVTLGQNICHPTSPRCSACPLAHLC 210
>gi|325294609|ref|YP_004281123.1| DNA-(apurinic or apyrimidinic site) lyase [Desulfurobacterium
thermolithotrophum DSM 11699]
gi|325065057|gb|ADY73064.1| DNA-(apurinic or apyrimidinic site) lyase [Desulfurobacterium
thermolithotrophum DSM 11699]
Length = 218
Score = 47.4 bits (111), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 69/163 (42%), Gaps = 29/163 (17%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVE-LHGSIDLEWLRNVPPDKVKEYLLDIEGLG 657
E EIA I G A IKE L+E +G + PD++ E LL + G+G
Sbjct: 75 EEEIASLIYPVGFYRRKAKNIKEICKVLIEKYNGKV---------PDEIDE-LLKLPGVG 124
Query: 658 LKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYL 717
K+ V L VD +V RI+ RLG+V + P + E+ + +K+
Sbjct: 125 RKTANLVVTLGYGKPGICVDTHVHRISNRLGYVNTKT-PEE------TEFALREKLPKDY 177
Query: 718 WPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
W E++ +++ G+ C SP C CP+ C
Sbjct: 178 W-----------IEINDLLVSLGQHICHPTSPKCSQCPIEKYC 209
>gi|420239451|ref|ZP_14743771.1| endonuclease III [Rhizobium sp. CF080]
gi|398080505|gb|EJL71315.1| endonuclease III [Rhizobium sp. CF080]
Length = 267
Score = 47.4 bits (111), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 71/169 (42%), Gaps = 22/169 (13%)
Query: 595 RCADESEIADAIKERGQQNII--AGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLD 652
+ AD E A+ E G I G + ++ L + E+ VP K +E L+
Sbjct: 94 KIADTPEKMLALGEEGVTQYIKTIGLYRNKAKNVIALCRMLIDEFGSEVP--KTREELIR 151
Query: 653 IEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDK 712
+ G+G K+ V ++ VD +V RIA RL P + P ++ +LLK P
Sbjct: 152 LPGVGRKTANVVMSMAFGVPTLAVDTHVFRIANRLLMAPGK-TPDEVEENLLKVIP---- 206
Query: 713 IQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
DQ L+ H+ +I G+ CK R P C C + C+
Sbjct: 207 ------------DQ-YLFHAHHWLILHGRYCCKARKPECERCVIADLCR 242
>gi|281420062|ref|ZP_06251061.1| endonuclease III [Prevotella copri DSM 18205]
gi|281405862|gb|EFB36542.1| endonuclease III [Prevotella copri DSM 18205]
Length = 215
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 63/165 (38%), Gaps = 27/165 (16%)
Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
A E +I + I N A + E +LVE+ G VP + + L + G+
Sbjct: 68 ASEEDIYELISSVSYPNAKAKHLAEMSRQLVEMFGG-------EVP--EAADDLEKLAGV 118
Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVP-LQPLPGDLHIHLLKEYPVMDKIQM 715
G K+ +R + H VD +V R++ R+G VP P + +L+K P D
Sbjct: 119 GRKTANVIRAVWFGHATMAVDTHVYRVSHRMGLVPKTADTPRKVEDYLMKHIPAEDIPNA 178
Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+ W ++ G+ CK P C C C
Sbjct: 179 HHW-----------------ILLHGRYICKSTKPLCDKCFFNEYC 206
>gi|451941382|ref|YP_007462019.1| endonuclease III [Bartonella vinsonii subsp. berkhoffii str.
Winnie]
gi|451900769|gb|AGF75231.1| endonuclease III [Bartonella vinsonii subsp. berkhoffii str.
Winnie]
Length = 246
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 102/265 (38%), Gaps = 45/265 (16%)
Query: 525 ATTTSKKSPRGKGKSKEYKSDMKNDRTKKTT---PKKNSDNTVQQDWDLLRRIYSTGEER 581
A TSK S K Y D + ++ + P SD + LL + + +
Sbjct: 2 AQRTSKLSKMQKSSEILYSEDEIEEIFRRFSVQRPVPKSDLIYTNVFTLLVAVILSAQA- 60
Query: 582 SHDKMDSVDWEAVRCAD---------ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGS 632
+ ++ E R AD E EIA I+ G A + N L++ +G
Sbjct: 61 TDTSVNKATKELFRLADQPEKMVALGEEEIACHIRTIGLWRAKARNVYALCNFLIDQYGG 120
Query: 633 IDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPL 692
VP + E L+ + G+G K+ V ++ H VD ++ R+ RLG P
Sbjct: 121 -------KVPEKR--EDLMSLPGVGRKTANVVLNVAFGHPTLAVDTHILRLGNRLGLAPG 171
Query: 693 QPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCG 752
+ P + LLK P+ + L+ H+ +I G+ C+ R C
Sbjct: 172 KT-PEIVEEKLLKIIPI-----------------RYLHHAHHWLILHGRYICQARKVQCT 213
Query: 753 ACPLRGECKHFASAVASARFALPGP 777
C + CK +A+ + A+P P
Sbjct: 214 QCIIADLCK---AAIKTN--AIPAP 233
>gi|425789183|ref|YP_007017103.1| endonuclease III [Helicobacter pylori Aklavik117]
gi|425627498|gb|AFX90966.1| endonuclease III [Helicobacter pylori Aklavik117]
Length = 216
Score = 47.4 bits (111), Expect = 0.047, Method: Composition-based stats.
Identities = 38/128 (29%), Positives = 56/128 (43%), Gaps = 20/128 (15%)
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
+ V P KE L+ +EG+G K+ V + VD +V R RLG
Sbjct: 106 FKGVIPSTQKE-LMSLEGVGQKTANVVLSVCFDANCIAVDTHVFRTTHRLGL-------S 157
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D + + E + D L + L +LH+ +I FG+ CK ++P CGAC L+
Sbjct: 158 DANTPIKTEKELSD------------LFKDNLSKLHHALILFGRYTCKAKNPLCGACFLK 205
Query: 758 GECKHFAS 765
C AS
Sbjct: 206 EFCVSKAS 213
>gi|325663087|ref|ZP_08151537.1| hypothetical protein HMPREF0490_02277 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325470541|gb|EGC73771.1| hypothetical protein HMPREF0490_02277 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 275
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Query: 631 GSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGR 682
G +DL+ +R++P K K+ LL + G+G K +C+ L +L H+ AFP+D ++ +
Sbjct: 183 GEVDLDAIRHLPYAKAKKELLKLFGVGEKVADCICLFALHHLQAFPIDTHISQ 235
>gi|288959678|ref|YP_003450019.1| endonuclease III [Azospirillum sp. B510]
gi|288911986|dbj|BAI73475.1| endonuclease III [Azospirillum sp. B510]
Length = 215
Score = 47.4 bits (111), Expect = 0.047, Method: Composition-based stats.
Identities = 40/155 (25%), Positives = 60/155 (38%), Gaps = 27/155 (17%)
Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
IK G N A + L+E HG VP D+ E L + G+G K+ V
Sbjct: 80 IKTIGLFNTKAKNVIRLSELLIERHGG-------EVPRDR--EALEQLPGVGRKTANVVL 130
Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
++ VD ++ R+ R G P + P + LLK P + + + W
Sbjct: 131 NVAFGEETIAVDTHIFRVGNRTGLAPGKT-PDAVEAKLLKTVPKLYRRHAHHW------- 182
Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
+I G+ CK R P+C CP+ C
Sbjct: 183 ----------LILHGRYVCKARKPDCPVCPVADLC 207
>gi|265751141|ref|ZP_06087204.1| endonuclease III [Bacteroides sp. 3_1_33FAA]
gi|263238037|gb|EEZ23487.1| endonuclease III [Bacteroides sp. 3_1_33FAA]
Length = 214
Score = 47.4 bits (111), Expect = 0.047, Method: Composition-based stats.
Identities = 29/116 (25%), Positives = 52/116 (44%), Gaps = 18/116 (15%)
Query: 648 EYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQ-PLPGDLHIHLLKE 706
E L+ + G+G K+ ++ + A VD +V R++ R+G VP P +L+K
Sbjct: 110 EELIKLPGVGRKTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVPQTCTTPLATEKYLMK- 168
Query: 707 YPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
Y+ ++ + H+ +I G+ C R+P C C L G C++
Sbjct: 169 ----------------YIPKEIVPTAHHWLILHGRYVCMARTPKCSECGLNGLCRY 208
>gi|239623210|ref|ZP_04666241.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239522176|gb|EEQ62042.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 261
Score = 47.4 bits (111), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 66/169 (39%), Gaps = 27/169 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
E AD E+ I G ++ A I LVE G PD + E L
Sbjct: 115 EKFAAADLKELEKDIHSTGFYHMKAKNIIACCKDLVEKFG--------GQVPDTI-EDLT 165
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ +R + VD +V RI+ +LG + P + L+K V+
Sbjct: 166 SLAGVGRKTANVIRGNIYNEPSIVVDTHVKRISRKLGLTKEED-PEKIEYDLMK---VLP 221
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
K LW + +IT G+ C R P CG C LR +C
Sbjct: 222 KDHWILW--------------NIHIITLGRTICIARRPGCGQCFLREDC 256
>gi|386754297|ref|YP_006227515.1| endonuclease III [Helicobacter pylori Shi112]
gi|384560555|gb|AFI01022.1| endonuclease III [Helicobacter pylori Shi112]
Length = 216
Score = 47.4 bits (111), Expect = 0.048, Method: Composition-based stats.
Identities = 37/128 (28%), Positives = 57/128 (44%), Gaps = 20/128 (15%)
Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPG 697
+ V P KE L+ ++G+G K+ V + VD +V R RLG
Sbjct: 106 FKGVIPSTQKE-LMSLDGVGQKTANVVLSVCFNANCIAVDTHVFRTTHRLGL-------S 157
Query: 698 DLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLR 757
D + + E + D L + L +LH+ +I FG+ CK ++P+CGAC L+
Sbjct: 158 DANTPIKTEEELSD------------LFKDNLSKLHHALILFGRYTCKAKNPSCGACFLK 205
Query: 758 GECKHFAS 765
C AS
Sbjct: 206 EFCVSKAS 213
>gi|389685832|ref|ZP_10177155.1| endonuclease III [Pseudomonas chlororaphis O6]
gi|399005502|ref|ZP_10708082.1| endonuclease III [Pseudomonas sp. GM17]
gi|425901222|ref|ZP_18877813.1| endonuclease III [Pseudomonas chlororaphis subsp. aureofaciens
30-84]
gi|388550174|gb|EIM13444.1| endonuclease III [Pseudomonas chlororaphis O6]
gi|397883764|gb|EJL00251.1| endonuclease III [Pseudomonas chlororaphis subsp. aureofaciens
30-84]
gi|398125990|gb|EJM15443.1| endonuclease III [Pseudomonas sp. GM17]
Length = 212
Score = 47.4 bits (111), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 27/161 (16%)
Query: 602 IADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSV 661
+++ IK G N A + E LVE H S VP + +E L + G+G K+
Sbjct: 72 LSEYIKTIGLYNSKAKNVIETCRLLVERHNS-------EVP--QTREELEALPGVGRKTA 122
Query: 662 ECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL 721
V + + + VD ++ R++ R G P K ++K M P+
Sbjct: 123 NVVLNTAFRQLTMAVDTHIFRVSNRTGLAPG------------KNVVEVEKKLMKFVPKE 170
Query: 722 CYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
LD H+ +I G+ C R P CG+C + C++
Sbjct: 171 YLLDS------HHWLILHGRYVCLARKPRCGSCRIEDLCEY 205
>gi|375289987|ref|YP_005124527.1| endonuclease III [Corynebacterium diphtheriae 241]
gi|376244820|ref|YP_005135059.1| endonuclease III [Corynebacterium diphtheriae HC01]
gi|376292379|ref|YP_005164053.1| endonuclease III [Corynebacterium diphtheriae HC02]
gi|371579658|gb|AEX43325.1| endonuclease III [Corynebacterium diphtheriae 241]
gi|372107450|gb|AEX73511.1| endonuclease III [Corynebacterium diphtheriae HC01]
gi|372109702|gb|AEX75762.1| endonuclease III [Corynebacterium diphtheriae HC02]
Length = 251
Score = 47.4 bits (111), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%)
Query: 592 EAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLL 651
EA A+E+E+ + I+ G A + +LV ++ +P D E+L+
Sbjct: 75 EAYASANEAELQEMIRPTGFYKAKAAHLIGMGQKLVT-------DFSGEIPRDL--EHLV 125
Query: 652 DIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMD 711
+ G+G K+ VR + VD + GR+ RLG Q P + E+ + D
Sbjct: 126 SLPGVGRKTAHVVRGNAFDIPGLTVDTHFGRLVRRLGLT-TQTNPVKV------EHEIAD 178
Query: 712 KIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
I+ W T++ +++I G+ C R+ CGAC L C + A
Sbjct: 179 LIEKKEW---------TMFS--HRIIFHGRRVCHSRTAACGACFLAPRCPSYGEA 222
>gi|284161459|ref|YP_003400082.1| DNA-(apurinic or apyrimidinic site) lyase [Archaeoglobus profundus
DSM 5631]
gi|284011456|gb|ADB57409.1| DNA-(apurinic or apyrimidinic site) lyase [Archaeoglobus profundus
DSM 5631]
Length = 211
Score = 47.4 bits (111), Expect = 0.048, Method: Composition-based stats.
Identities = 50/165 (30%), Positives = 68/165 (41%), Gaps = 28/165 (16%)
Query: 599 ESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGL 658
E +I + IK G A +K+ LV +G PD ++E LL + G+G
Sbjct: 69 EDDIDELIKPVGFHRTKAKNLKKLAEVLVNNYGG--------KVPDNLEE-LLKLPGVGR 119
Query: 659 KSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLW 718
K V L L A VD +V RIA RLG V + P + L K P LW
Sbjct: 120 KVANIV-LAHLGKPAIAVDTHVHRIANRLGVVRTKR-PEETEKELKKIVP------KDLW 171
Query: 719 PRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
RL + + FG+ CK P C CP + C++F
Sbjct: 172 SRL-----------NKAFVGFGQTVCKPLKPLCEECPFKSFCEYF 205
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,696,015,446
Number of Sequences: 23463169
Number of extensions: 771276057
Number of successful extensions: 1987929
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 460
Number of HSP's successfully gapped in prelim test: 2705
Number of HSP's that attempted gapping in prelim test: 1981725
Number of HSP's gapped (non-prelim): 5891
length of query: 1105
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 951
effective length of database: 8,745,867,341
effective search space: 8317319841291
effective search space used: 8317319841291
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)