BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029597
(191 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255547375|ref|XP_002514745.1| endonuclease, putative [Ricinus communis]
gi|223546349|gb|EEF47851.1| endonuclease, putative [Ricinus communis]
Length = 203
Score = 289 bits (739), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 139/182 (76%), Positives = 157/182 (86%), Gaps = 4/182 (2%)
Query: 13 TATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRIS---SADSTANPRSFP 69
TA+ +S+P S RR T SATL D E+ TEE+ TLF DL+IS + S+ PRSFP
Sbjct: 23 TASFSSTPDSSHRRS-TLAHSATLQDREVLTEEDTLTLFEDLQISEKYTKFSSTEPRSFP 81
Query: 70 YSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
YSVKQQCW+KAEKV+GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTL
Sbjct: 82 YSVKQQCWDKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTL 141
Query: 130 ENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCR 189
ENCQVLQATVNRSKGNRTELSR+DLIQ+SSYCRV+GRDMDL+ELSAYGNVR Q+SGGCR
Sbjct: 142 ENCQVLQATVNRSKGNRTELSRADLIQRSSYCRVAGRDMDLIELSAYGNVRHAQESGGCR 201
Query: 190 IQ 191
IQ
Sbjct: 202 IQ 203
>gi|225457397|ref|XP_002284877.1| PREDICTED: uncharacterized protein LOC100262165 isoform 1 [Vitis
vinifera]
Length = 199
Score = 285 bits (728), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 135/172 (78%), Positives = 150/172 (87%), Gaps = 1/172 (0%)
Query: 20 PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
PS SR + S +LLD TE+E +TLF +L+IS + NPRSFPYSVKQQCWEK
Sbjct: 29 PSGSRSTTRLSPSSTSLLDTPFITEDEDATLFDNLKIS-PNPNFNPRSFPYSVKQQCWEK 87
Query: 80 AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
AEK++GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATV
Sbjct: 88 AEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATV 147
Query: 140 NRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
NRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGC IQ
Sbjct: 148 NRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCTIQ 199
>gi|449477241|ref|XP_004154969.1| PREDICTED: uncharacterized protein LOC101226861 [Cucumis sativus]
Length = 200
Score = 271 bits (692), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 129/156 (82%), Positives = 142/156 (91%), Gaps = 4/156 (2%)
Query: 38 DNELATEEEISTLFTDLRISSADSTAN--PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
D +L T E S+L DL++S DS +N PRSFP+SVKQQCW+KAEK++GRDPDRWR+DA
Sbjct: 47 DRDLVTGE--SSLLGDLKLSPDDSVSNYNPRSFPHSVKQQCWDKAEKIKGRDPDRWRRDA 104
Query: 96 LGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLI 155
LGN VFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE+SR++LI
Sbjct: 105 LGNTVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEISRAELI 164
Query: 156 QKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
QKSSYCRVSGRDMDLLELSAYGNV RGQDSGGCRIQ
Sbjct: 165 QKSSYCRVSGRDMDLLELSAYGNVHRGQDSGGCRIQ 200
>gi|449440796|ref|XP_004138170.1| PREDICTED: uncharacterized protein LOC101222461 [Cucumis sativus]
Length = 200
Score = 270 bits (691), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 129/156 (82%), Positives = 141/156 (90%), Gaps = 4/156 (2%)
Query: 38 DNELATEEEISTLFTDLRISSADSTAN--PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
D +L T E S+L DL+ S DS +N PRSFP+SVKQQCW+KAEK++GRDPDRWR+DA
Sbjct: 47 DRDLVTGE--SSLLGDLKFSPDDSVSNYNPRSFPHSVKQQCWDKAEKIKGRDPDRWRRDA 104
Query: 96 LGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLI 155
LGN VFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE+SR++LI
Sbjct: 105 LGNTVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEISRAELI 164
Query: 156 QKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
QKSSYCRVSGRDMDLLELSAYGNV RGQDSGGCRIQ
Sbjct: 165 QKSSYCRVSGRDMDLLELSAYGNVHRGQDSGGCRIQ 200
>gi|388520967|gb|AFK48545.1| unknown [Lotus japonicus]
Length = 198
Score = 257 bits (656), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/184 (69%), Positives = 145/184 (78%), Gaps = 14/184 (7%)
Query: 8 KRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRS 67
K R +T + S SP R +T SATLLD++ LF D NPRS
Sbjct: 29 KDRSATTSSKLSSSPKR----STSPSATLLDDDPVA------LFDDY----THPNPNPRS 74
Query: 68 FPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
FP+SVKQ+CWEKA+K++GRDPDRWR+D LGN +FRKLVGCPGCLCHDYDHI+PYSKGG+S
Sbjct: 75 FPHSVKQKCWEKADKIKGRDPDRWRRDPLGNTIFRKLVGCPGCLCHDYDHIVPYSKGGQS 134
Query: 128 TLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGG 187
TLENCQVLQATVNR+KGNRT +SR+DL+QKSSYCRVSGRDMDLLELSAYGNVR G DSGG
Sbjct: 135 TLENCQVLQATVNRAKGNRTGISRADLMQKSSYCRVSGRDMDLLELSAYGNVRHGPDSGG 194
Query: 188 CRIQ 191
CRIQ
Sbjct: 195 CRIQ 198
>gi|351720795|ref|NP_001237956.1| uncharacterized protein LOC100500174 [Glycine max]
gi|255629565|gb|ACU15130.1| unknown [Glycine max]
Length = 206
Score = 256 bits (655), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 121/158 (76%), Positives = 141/158 (89%), Gaps = 4/158 (2%)
Query: 34 ATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRK 93
ATLLD ++++++LF D + + NPRSFP+SVKQ+CWEKA+KV+GRDPDRWR+
Sbjct: 53 ATLLD----VDDDVTSLFHDYPSPNPNPNPNPRSFPHSVKQKCWEKADKVKGRDPDRWRR 108
Query: 94 DALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSD 153
DALGN +FRKLVGCPGCLCHDYDHI+PYSKGG+STLENCQVLQATVNRSKGNRTE+S+++
Sbjct: 109 DALGNTLFRKLVGCPGCLCHDYDHIIPYSKGGESTLENCQVLQATVNRSKGNRTEISKAE 168
Query: 154 LIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
LIQKSSYCRVS RDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 169 LIQKSSYCRVSDRDMDLLELSAYGNVRRGPDSGGCRIQ 206
>gi|115451005|ref|NP_001049103.1| Os03g0169800 [Oryza sativa Japonica Group]
gi|19071624|gb|AAL84291.1|AC073556_8 putative endonuclease [Oryza sativa Japonica Group]
gi|108706401|gb|ABF94196.1| HNH endonuclease domain-containing protein, putative, expressed
[Oryza sativa Japonica Group]
gi|113547574|dbj|BAF11017.1| Os03g0169800 [Oryza sativa Japonica Group]
gi|215686970|dbj|BAG90840.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218192172|gb|EEC74599.1| hypothetical protein OsI_10187 [Oryza sativa Indica Group]
Length = 187
Score = 256 bits (653), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 122/148 (82%), Positives = 134/148 (90%), Gaps = 2/148 (1%)
Query: 44 EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
EE L T R S+ S++ PRSFPY VKQ+CWEKAE+V GRDP+RWR+DALGNVVFRK
Sbjct: 42 EELDGLLLTAPRPSA--SSSEPRSFPYVVKQRCWEKAERVAGRDPERWRRDALGNVVFRK 99
Query: 104 LVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRV 163
LVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TELS+S+LIQKS+YCRV
Sbjct: 100 LVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRV 159
Query: 164 SGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
SGRDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 160 SGRDMDLLELSAYGNVRRGPDSGGCRIQ 187
>gi|222624269|gb|EEE58401.1| hypothetical protein OsJ_09573 [Oryza sativa Japonica Group]
Length = 187
Score = 254 bits (649), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 121/148 (81%), Positives = 134/148 (90%), Gaps = 2/148 (1%)
Query: 44 EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
EE L T R S+ S++ PRSFPY VKQ+CWEKAE++ GRDP+RWR+DALGNVVFRK
Sbjct: 42 EELDGLLLTAPRPSA--SSSEPRSFPYVVKQRCWEKAERLPGRDPERWRRDALGNVVFRK 99
Query: 104 LVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRV 163
LVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TELS+S+LIQKS+YCRV
Sbjct: 100 LVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRV 159
Query: 164 SGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
SGRDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 160 SGRDMDLLELSAYGNVRRGPDSGGCRIQ 187
>gi|15232817|ref|NP_190333.1| HNH endonuclease [Arabidopsis thaliana]
gi|6522532|emb|CAB61975.1| hypothetical protein [Arabidopsis thaliana]
gi|117168237|gb|ABK32201.1| At3g47490 [Arabidopsis thaliana]
gi|332644767|gb|AEE78288.1| HNH endonuclease [Arabidopsis thaliana]
Length = 234
Score = 252 bits (643), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 131/205 (63%), Positives = 153/205 (74%), Gaps = 24/205 (11%)
Query: 1 MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
M+P +P RR++ A +SSPS P+R GE SATLLD EE
Sbjct: 40 MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 92
Query: 47 ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
+ LF D PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 93 MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 149
Query: 107 CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGR 166
CPGCLCHDYDHI+PYSKGGKSTLENCQVLQA VNRSKGN+T++SRS+LIQ+SSYCRV+GR
Sbjct: 150 CPGCLCHDYDHIVPYSKGGKSTLENCQVLQAKVNRSKGNKTDISRSELIQRSSYCRVAGR 209
Query: 167 DMDLLELSAYGNVRRGQDSGGCRIQ 191
DMDL+EL+AYGNV+R S GCRIQ
Sbjct: 210 DMDLIELTAYGNVQRAPTSSGCRIQ 234
>gi|242036789|ref|XP_002465789.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
gi|241919643|gb|EER92787.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
Length = 188
Score = 250 bits (638), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 113/138 (81%), Positives = 128/138 (92%)
Query: 54 LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
L + S + PRSF Y+VKQQCWEKAE+V GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51 LLTAPPPSASEPRSFSYAVKQQCWEKAERVPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLEL 173
DYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TE+S+S+LIQKS+YCRVSGRDMDL+EL
Sbjct: 111 DYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVEL 170
Query: 174 SAYGNVRRGQDSGGCRIQ 191
SAYGNVRRG DSGGC+IQ
Sbjct: 171 SAYGNVRRGPDSGGCKIQ 188
>gi|326534162|dbj|BAJ89431.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 180
Score = 249 bits (637), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 120/186 (64%), Positives = 142/186 (76%), Gaps = 10/186 (5%)
Query: 6 EPKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANP 65
+P RR+ RA P P + G T ++ + ++ L S + P
Sbjct: 5 KPMRRR----RADHPPPPQSFGATARPTSPRSSTSASAVADLDELL------HTPSASEP 54
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RSFP++VKQQCWEKAEKV GRDP+RWR+DALGN+VFRKLVGCPGCLCHDYDHILPYSKGG
Sbjct: 55 RSFPHAVKQQCWEKAEKVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHILPYSKGG 114
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
KSTLENCQVLQATVNRSKGN TE+S+S+L Q+S+YCRVSGRDMDL E+SA+GNVRRG DS
Sbjct: 115 KSTLENCQVLQATVNRSKGNNTEISKSELAQRSAYCRVSGRDMDLFEVSAFGNVRRGPDS 174
Query: 186 GGCRIQ 191
GGC+IQ
Sbjct: 175 GGCKIQ 180
>gi|224104735|ref|XP_002313547.1| predicted protein [Populus trichocarpa]
gi|222849955|gb|EEE87502.1| predicted protein [Populus trichocarpa]
Length = 128
Score = 248 bits (634), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 113/127 (88%), Positives = 124/127 (97%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PRSFPYSVKQQCWEKAEKV+GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKG
Sbjct: 2 PRSFPYSVKQQCWEKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 61
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
GKS+LENCQVLQATVNRSKGNRTELSR++LI++SSYCRVSGRDMD +ELSAYGNVRR Q+
Sbjct: 62 GKSSLENCQVLQATVNRSKGNRTELSRAELIRRSSYCRVSGRDMDFVELSAYGNVRREQE 121
Query: 185 SGGCRIQ 191
SGGCRIQ
Sbjct: 122 SGGCRIQ 128
>gi|293331837|ref|NP_001168386.1| uncharacterized protein LOC100382155 [Zea mays]
gi|223947943|gb|ACN28055.1| unknown [Zea mays]
gi|414865044|tpg|DAA43601.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
Length = 188
Score = 248 bits (634), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 112/138 (81%), Positives = 128/138 (92%)
Query: 54 LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
L + S + PRSF Y+VKQQCWEKAE++ GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51 LLTAPPPSASEPRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLEL 173
DYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TE+S+S+LIQKS+YCRVSGRDMDL+EL
Sbjct: 111 DYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVEL 170
Query: 174 SAYGNVRRGQDSGGCRIQ 191
SAYGNVRRG DSGGC+IQ
Sbjct: 171 SAYGNVRRGPDSGGCKIQ 188
>gi|297815918|ref|XP_002875842.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
subsp. lyrata]
gi|297321680|gb|EFH52101.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
subsp. lyrata]
Length = 225
Score = 248 bits (634), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 128/197 (64%), Positives = 153/197 (77%), Gaps = 16/197 (8%)
Query: 1 MRPNGEPKRRKSTA-TRASSPSPSR-----RRGETTVRSATLLDNELATEEEISTLFTDL 54
M+P +P RR++ A + +SSPS +R RG SATLLD EE+ LF
Sbjct: 39 MKP--DPARRRNRALSPSSSPSKTRPGSGEARGGKVKTSATLLD-----REEMG-LFPG- 89
Query: 55 RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
D PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVGCPGCLCHD
Sbjct: 90 -SGYDDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDPLGNIVFRKLVGCPGCLCHD 148
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELS 174
YDHI+PYSKGGKS LENCQVLQA VNRSKGN+T++SR++LIQ+SSYCRV+GRDMDL+EL+
Sbjct: 149 YDHIVPYSKGGKSILENCQVLQAKVNRSKGNKTDISRAELIQRSSYCRVAGRDMDLIELT 208
Query: 175 AYGNVRRGQDSGGCRIQ 191
AYGNV+R +S GCRIQ
Sbjct: 209 AYGNVQRAPESSGCRIQ 225
>gi|357113864|ref|XP_003558721.1| PREDICTED: uncharacterized protein LOC100825547 [Brachypodium
distachyon]
Length = 188
Score = 247 bits (630), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/172 (68%), Positives = 137/172 (79%)
Query: 20 PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
P PS+ G T ++ + + L L + S + PRSF Y+VKQQCWEK
Sbjct: 17 PPPSQSFGATARPNSPRSPTSASAASVSADLDGLLLTAPPPSASEPRSFSYAVKQQCWEK 76
Query: 80 AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
AEKV GRDP+RWR+DALGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATV
Sbjct: 77 AEKVPGRDPERWRRDALGNMVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATV 136
Query: 140 NRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
NRSKGN+TE+S+S+L Q+S+YCRVSGRDMDL ELSA+GNVRRG DSGGC+IQ
Sbjct: 137 NRSKGNKTEISKSELAQRSAYCRVSGRDMDLFELSAFGNVRRGPDSGGCKIQ 188
>gi|334185796|ref|NP_001190027.1| HNH endonuclease [Arabidopsis thaliana]
gi|332644768|gb|AEE78289.1| HNH endonuclease [Arabidopsis thaliana]
Length = 215
Score = 239 bits (610), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 153/225 (68%), Gaps = 44/225 (19%)
Query: 1 MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
M+P +P RR++ A +SSPS P+R GE SATLLD EE
Sbjct: 1 MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 53
Query: 47 ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
+ LF D PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 54 MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 110
Query: 107 CPGCLCHDYDHILPYS--------------------KGGKSTLENCQVLQATVNRSKGNR 146
CPGCLCHDYDHI+PYS KGGKSTLENCQVLQA VNRSKGN+
Sbjct: 111 CPGCLCHDYDHIVPYSKDMCPLEHAFDEPHKDFLDLKGGKSTLENCQVLQAKVNRSKGNK 170
Query: 147 TELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
T++SRS+LIQ+SSYCRV+GRDMDL+EL+AYGNV+R S GCRIQ
Sbjct: 171 TDISRSELIQRSSYCRVAGRDMDLIELTAYGNVQRAPTSSGCRIQ 215
>gi|116794423|gb|ABK27138.1| unknown [Picea sitchensis]
Length = 200
Score = 239 bits (609), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 116/186 (62%), Positives = 135/186 (72%), Gaps = 22/186 (11%)
Query: 6 EPKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANP 65
+P R+KS A A RR T + +A + + E EE S NP
Sbjct: 37 KPWRKKSAAATA-------RR--TVIETAGIDEEEQVVGEESSEY-------------NP 74
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R+FPY +KQ CWEKA+K++GRDPDRWR+DALGN VFRKLVGC GCLCHDYDHI PYSKGG
Sbjct: 75 RAFPYHIKQACWEKADKIKGRDPDRWRRDALGNTVFRKLVGCSGCLCHDYDHIHPYSKGG 134
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
KST+ENCQVLQ NR+K N T++S +DLIQKS+YCRVSGRDMD LELSAYGNV RGQ+S
Sbjct: 135 KSTVENCQVLQGAANRAKSNHTDMSTTDLIQKSAYCRVSGRDMDFLELSAYGNVHRGQES 194
Query: 186 GGCRIQ 191
GGCRIQ
Sbjct: 195 GGCRIQ 200
>gi|168048449|ref|XP_001776679.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671971|gb|EDQ58515.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 129
Score = 189 bits (479), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 86/129 (66%), Positives = 102/129 (79%), Gaps = 3/129 (2%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RSFP+ VK+ CW KAE V+GRDP RWR+DALGNVVFRKLVGC GCLCHDYDHI+PYSKGG
Sbjct: 1 RSFPHDVKEACWAKAETVKGRDPARWRRDALGNVVFRKLVGCQGCLCHDYDHIVPYSKGG 60
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
STL+NCQVLQA NR KG R + +++ Q S+YC ++GRDMD +ELSAYGNV +D
Sbjct: 61 PSTLDNCQVLQAPANRFKGARINVPVTEMQQHSAYCSLTGRDMDFVELSAYGNVNHNKDD 120
Query: 186 G---GCRIQ 191
GCR+Q
Sbjct: 121 EGGIGCRMQ 129
>gi|359491703|ref|XP_003634310.1| PREDICTED: uncharacterized protein LOC100262165 isoform 2 [Vitis
vinifera]
Length = 167
Score = 182 bits (463), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/121 (69%), Positives = 98/121 (80%), Gaps = 1/121 (0%)
Query: 20 PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
PS SR + S +LLD TE+E +TLF +L+IS + NPRSFPYSVKQQCWEK
Sbjct: 29 PSGSRSTTRLSPSSTSLLDTPFITEDEDATLFDNLKIS-PNPNFNPRSFPYSVKQQCWEK 87
Query: 80 AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
AEK++GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQ +
Sbjct: 88 AEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQENM 147
Query: 140 N 140
+
Sbjct: 148 D 148
>gi|224119042|ref|XP_002317971.1| predicted protein [Populus trichocarpa]
gi|222858644|gb|EEE96191.1| predicted protein [Populus trichocarpa]
Length = 174
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 87/125 (69%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F VK+ CW+ AE V GR P+RWRKDA GN+V ++ C GCLC++YDHILP+SKG
Sbjct: 24 PRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPFSKG 83
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
G+ST++NCQ+LQ VNR K N+ EL ++ L S + +++D++E++ YG+V R +
Sbjct: 84 GESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIRPGN 143
Query: 185 SGGCR 189
CR
Sbjct: 144 QCRCR 148
>gi|118481005|gb|ABK92456.1| unknown [Populus trichocarpa]
Length = 180
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 87/125 (69%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F VK+ CW+ AE V GR P+RWRKDA GN+V ++ C GCLC++YDHILP+SKG
Sbjct: 29 PRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPFSKG 88
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
G+ST++NCQ+LQ VNR K N+ EL ++ L S + +++D++E++ YG+V R +
Sbjct: 89 GESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIRPGN 148
Query: 185 SGGCR 189
CR
Sbjct: 149 QCRCR 153
>gi|116783732|gb|ABK23066.1| unknown [Picea sitchensis]
Length = 168
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 88/116 (75%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ CW+KA+ V GR P+RWRKD GN++ R+ C GCLC++YDHI+P+SKGG
Sbjct: 20 RFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIICRRFGHCEGCLCYEYDHIIPFSKGG 79
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
++T+ENCQ+LQ+ VNRSKGN+ +L+R+ L Q S + + +++D++E++ YGNV R
Sbjct: 80 ETTVENCQILQSRVNRSKGNKDQLNRTVLEQYSCDLKFTDKELDVIEMAVYGNVMR 135
>gi|116780247|gb|ABK21604.1| unknown [Picea sitchensis]
Length = 166
Score = 139 bits (350), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 60/124 (48%), Positives = 90/124 (72%)
Query: 58 SADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
SA + R F ++ CW+KA+ V GR P+RWRKD GN+V R+ C GCLC++YDH
Sbjct: 10 SARNQKRARFFDEKMRNLCWQKADVVPGRHPERWRKDGAGNIVCRRFWHCDGCLCYEYDH 69
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
I+P+SKGG++T+ENCQ+LQ+ VNRSKGN+ +L ++ L Q S + + +++D++E++ YG
Sbjct: 70 IIPFSKGGETTVENCQILQSRVNRSKGNKDQLDKTVLEQYSCDLKFTDKELDVIEMAVYG 129
Query: 178 NVRR 181
NV R
Sbjct: 130 NVMR 133
>gi|42572595|ref|NP_974393.1| HNH endonuclease [Arabidopsis thaliana]
gi|332644766|gb|AEE78287.1| HNH endonuclease [Arabidopsis thaliana]
Length = 176
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/137 (56%), Positives = 90/137 (65%), Gaps = 24/137 (17%)
Query: 1 MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
M+P +P RR++ A +SSPS P+R GE SATLLD EE
Sbjct: 40 MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 92
Query: 47 ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
+ LF D PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 93 MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 149
Query: 107 CPGCLCHDYDHILPYSK 123
CPGCLCHDYDHI+PYSK
Sbjct: 150 CPGCLCHDYDHIVPYSK 166
>gi|452824505|gb|EME31507.1| endonuclease [Galdieria sulphuraria]
Length = 146
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/131 (49%), Positives = 85/131 (64%), Gaps = 1/131 (0%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
+D N R F VK CWEKA V GRDPDRWR DA N V ++L C GCLCH+YDH+
Sbjct: 5 SDHFTNDRIFSNRVKAACWEKATPVPGRDPDRWRLDAFRNPVCKRLTSCEGCLCHEYDHV 64
Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
+PYS+GG ST+ENCQ+LQ +NR K +R +L+ L S S R++DL+E++ YGN
Sbjct: 65 IPYSQGGASTVENCQILQTRINRLKADR-QLTAEQLESFSCEITFSERELDLIEMAVYGN 123
Query: 179 VRRGQDSGGCR 189
V+R C+
Sbjct: 124 VQRDHFRCRCK 134
>gi|168006043|ref|XP_001755719.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693038|gb|EDQ79392.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 180
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 86/116 (74%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ QCW+KA+ V GR P+RWRKD GN+V R+L C GCLC++YDH+ P+SKGG
Sbjct: 28 RIFDGKMRNQCWQKADVVPGRHPERWRKDVAGNIVCRRLTRCEGCLCYEYDHVRPFSKGG 87
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
++T++NCQ+LQ+ VNR K NR++L+ DL Q S + + +++DL+E+ YGNV R
Sbjct: 88 QTTVDNCQILQSRVNRMKSNRSDLASDDLKQFSCDLKFTDKELDLIEMGVYGNVIR 143
>gi|242039281|ref|XP_002467035.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
gi|241920889|gb|EER94033.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
Length = 180
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 88/125 (70%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
+ A PRSF + CW KA + GR P+RWR DA+GNVV R+ GC GCLC++YD
Sbjct: 22 AQATEKERPRSFDEKTRTACWRKAAVLAGRHPERWRLDAVGNVVCRRFWGCHGCLCYEYD 81
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
HI+P+SKGG+ST+ENCQ+LQ VNRSK ++T + ++++ S + + +++D++E++ Y
Sbjct: 82 HIVPFSKGGESTVENCQILQTRVNRSKSDKTWVEQAEMRGFSCDIKFTDQELDIIEMAVY 141
Query: 177 GNVRR 181
G+V R
Sbjct: 142 GDVIR 146
>gi|195651337|gb|ACG45136.1| endonuclease/ nucleic acid binding protein [Zea mays]
Length = 180
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 88/125 (70%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PRSF + CW KA + GR P+RWR+DA+GNVV R+ GC GCLC++YDHI+P+SKG
Sbjct: 30 PRSFDEKTRTACWRKAAVLAGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPFSKG 89
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
G+ST+ENCQ+LQ VNRSK ++ ++++++ S + +++D++E++ YG+V R
Sbjct: 90 GESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIRPGK 149
Query: 185 SGGCR 189
CR
Sbjct: 150 QCRCR 154
>gi|449440818|ref|XP_004138181.1| PREDICTED: uncharacterized protein LOC101204599 [Cucumis sativus]
gi|449477195|ref|XP_004154957.1| PREDICTED: uncharacterized protein LOC101224154 [Cucumis sativus]
Length = 181
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 84/125 (67%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F K CW KA+ V GR P+RWRKDA GNVV ++ C GCLC +YDHI+PYSKG
Sbjct: 29 PRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKG 88
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
G+ST +NCQ+LQ VNR K N+ ++ S+L S + + +++D++E++ YG+V R +
Sbjct: 89 GESTADNCQILQTRVNRFKSNKDDVDTSELKGYSCDVKFTDKELDIIEMAVYGDVIRPGN 148
Query: 185 SGGCR 189
CR
Sbjct: 149 QCRCR 153
>gi|218184321|gb|EEC66748.1| hypothetical protein OsI_33113 [Oryza sativa Indica Group]
Length = 178
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 89/133 (66%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
S+A PRSF + CW KA V GR P+RWR+DA GNVV R+ C GCLC++YD
Sbjct: 20 SAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYD 79
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
HI+P+SKGG+ST++NCQ+LQ VNRSK ++ + ++++ S + + +++D++E++ Y
Sbjct: 80 HIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFSCDIKFTDKELDVIEMAVY 139
Query: 177 GNVRRGQDSGGCR 189
G+V R CR
Sbjct: 140 GDVIRPGKQCRCR 152
>gi|115481558|ref|NP_001064372.1| Os10g0335200 [Oryza sativa Japonica Group]
gi|21671981|gb|AAM74343.1|AC115686_10 Putative HNH endonuclease [Oryza sativa Japonica Group]
gi|23477807|gb|AAN34947.1| Putative HNH endonuclease [Oryza sativa Japonica Group]
gi|31431266|gb|AAP53074.1| HNH endonuclease family protein, expressed [Oryza sativa Japonica
Group]
gi|113638981|dbj|BAF26286.1| Os10g0335200 [Oryza sativa Japonica Group]
gi|125574417|gb|EAZ15701.1| hypothetical protein OsJ_31114 [Oryza sativa Japonica Group]
gi|215693775|dbj|BAG88974.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707241|dbj|BAG93701.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 178
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 89/133 (66%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
S+A PRSF + CW KA V GR P+RWR+DA GNVV R+ C GCLC++YD
Sbjct: 20 SAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYD 79
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
HI+P+SKGG+ST++NCQ+LQ VNRSK ++ + ++++ S + + +++D++E++ Y
Sbjct: 80 HIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFSCDIKFTDKELDVIEMAVY 139
Query: 177 GNVRRGQDSGGCR 189
G+V R CR
Sbjct: 140 GDVIRPGKQCRCR 152
>gi|326513564|dbj|BAJ87801.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 175
Score = 132 bits (332), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 84/117 (71%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PRSF + CW KA + GR P+RWR+DA GNVV R+ C GCLC++YDHI+P+SKG
Sbjct: 29 PRSFDEKTRTACWRKAAVLTGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYDHIVPFSKG 88
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
G+ST+ENCQ+LQ VNRSK ++ + ++++ S + + +++D++E++ YG+V R
Sbjct: 89 GESTVENCQILQTRVNRSKSDKAWVEKAEMQGFSCDIKFTDKELDVIEMAVYGDVIR 145
>gi|229594535|ref|XP_001023127.3| HNH endonuclease family protein [Tetrahymena thermophila]
gi|225566791|gb|EAS02882.3| HNH endonuclease family protein [Tetrahymena thermophila SB210]
Length = 208
Score = 132 bits (331), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 76/115 (66%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F S+K QCW A GR PDRWR DA+GN V R L C G LCH+YDHI+P+SKG
Sbjct: 40 PRCFTKSMKDQCWNNATIFIGRHPDRWRLDAVGNPVLRGLTSCLGPLCHEYDHIVPFSKG 99
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNV 179
GK+ + NCQ+LQ VNR K N+T S +LI SS + + ++ D +E + YG++
Sbjct: 100 GKTEISNCQILQTMVNRYKSNKTNQSHKELIAVSSTNKFTNQEFDSIERAIYGDI 154
>gi|359491684|ref|XP_003634305.1| PREDICTED: uncharacterized protein LOC100852985 [Vitis vinifera]
gi|297733935|emb|CBI15182.3| unnamed protein product [Vitis vinifera]
Length = 167
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 59/132 (44%), Positives = 84/132 (63%)
Query: 58 SADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
S D PR F K CW KA+ V GR PDRWRKDA GN+V ++ C GCLC +YDH
Sbjct: 9 SRDKEERPRFFDPKAKSLCWAKADAVPGRHPDRWRKDAAGNIVCKRFCNCQGCLCFEYDH 68
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
I+P+SKGG+S ENCQ+LQ VNR K ++ + ++ L S + + +++D++E++ YG
Sbjct: 69 IIPFSKGGESVPENCQILQTRVNRFKSDKDNVDKTQLQGYSCDVKFTDKELDIIEMAVYG 128
Query: 178 NVRRGQDSGGCR 189
+V R + CR
Sbjct: 129 DVIRPGNQCRCR 140
>gi|302811781|ref|XP_002987579.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
gi|300144733|gb|EFJ11415.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
Length = 174
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 83/116 (71%), Gaps = 1/116 (0%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F +++QCWEKAE V GR P+RWRKD GNVV RK +GC GC+C+++DHI P+SKGG
Sbjct: 21 RLFDEKMRRQCWEKAEVVPGRHPERWRKDGAGNVVCRKFIGCEGCMCYEFDHIRPFSKGG 80
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
++ + NCQ+LQ+ VNR K N+ + L Q S + + R++DL+E++ YGNV R
Sbjct: 81 ETDVSNCQILQSRVNRLKSNK-DADEHLLQQYSCDLKFTARELDLVEMAVYGNVIR 135
>gi|357146087|ref|XP_003573871.1| PREDICTED: uncharacterized protein LOC100831965 [Brachypodium
distachyon]
Length = 180
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 54/127 (42%), Positives = 86/127 (67%)
Query: 55 RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
R+ PRSF + CW KA + GR P+RWR+D GN+V R+ C GCLC++
Sbjct: 20 RVPKPKEKERPRSFDEKTRAACWRKAAVLAGRHPERWRQDTAGNIVCRRFWSCHGCLCYE 79
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELS 174
YDHI+P+SKGG+ST+ENCQ+LQ VNRSK +++ + ++++ S + + +++D++E++
Sbjct: 80 YDHIVPFSKGGESTVENCQILQTRVNRSKSDKSWVEKAEMQGASCDIKFTDKELDVIEMA 139
Query: 175 AYGNVRR 181
YG+V R
Sbjct: 140 VYGDVIR 146
>gi|302822371|ref|XP_002992844.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
gi|300139392|gb|EFJ06134.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
Length = 174
Score = 130 bits (326), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 83/116 (71%), Gaps = 1/116 (0%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F +++QCW+KAE V GR P+RWRKD GNVV RK +GC GC+C+++DHI P+SKGG
Sbjct: 21 RLFDEKMRRQCWQKAEVVPGRHPERWRKDGAGNVVCRKFIGCEGCMCYEFDHIRPFSKGG 80
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
++ + NCQ+LQ+ VNR K N+ + L Q S + + R++DL+E++ YGNV R
Sbjct: 81 ETDVSNCQILQSRVNRLKSNK-DADEHLLQQYSCDLKFTARELDLVEMAVYGNVIR 135
>gi|351722082|ref|NP_001235440.1| uncharacterized protein LOC100527904 [Glycine max]
gi|255633516|gb|ACU17116.1| unknown [Glycine max]
Length = 162
Score = 129 bits (325), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 58/133 (43%), Positives = 84/133 (63%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
S D PR F + K CW KA+ V GR P+RWRKDA GNVV ++ C GCLC++YD
Sbjct: 9 SRGDGEKRPRFFDSNAKAICWSKADTVPGRHPERWRKDAAGNVVCKRFFNCIGCLCYEYD 68
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
HI+P+SKGG+ST +NCQ+LQ+ VNR K ++ + L S + +++D++E++ Y
Sbjct: 69 HIIPFSKGGESTADNCQILQSRVNRLKSDKYNIDSDQLKDYSCEVNFTDKELDIIEMAVY 128
Query: 177 GNVRRGQDSGGCR 189
+V R + CR
Sbjct: 129 SDVMRPGNQCRCR 141
>gi|255547311|ref|XP_002514713.1| endonuclease, putative [Ricinus communis]
gi|223546317|gb|EEF47819.1| endonuclease, putative [Ricinus communis]
Length = 173
Score = 125 bits (315), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 79/125 (63%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F K CW AE V GR P+RWRKDA GN+V ++L C GCLC YDHI+P+SKG
Sbjct: 22 PRFFSNKAKNICWANAETVPGRHPERWRKDAAGNIVCKRLGNCQGCLCFQYDHIIPFSKG 81
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
G+ST NCQ+LQ VN K + + +S L S + + +++D++E++ YG+V R +
Sbjct: 82 GESTAANCQILQTRVNNYKADAEAVDKSQLKGYSCDLKFTDKELDIIEMAVYGDVIRPGN 141
Query: 185 SGGCR 189
CR
Sbjct: 142 QCRCR 146
>gi|414865043|tpg|DAA43600.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
Length = 141
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 52/70 (74%), Positives = 61/70 (87%)
Query: 54 LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
L + S + PRSF Y+VKQQCWEKAE++ GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51 LLTAPPPSASEPRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110
Query: 114 DYDHILPYSK 123
DYDHI+PYSK
Sbjct: 111 DYDHIVPYSK 120
>gi|297850254|ref|XP_002893008.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297338850|gb|EFH69267.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 179
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 77/117 (65%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F K +CW A+ V GR P+RWRKDA GN+V ++ C GCLC +YDHI+PYSKG
Sbjct: 33 PRFFDGKAKNKCWANADTVPGRHPERWRKDAAGNIVCKRFGNCNGCLCFEYDHIVPYSKG 92
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
G+S ENCQ+LQ VNR K + + + L S + + +++D++E++ YG+V R
Sbjct: 93 GESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDIIEMAVYGDVLR 149
>gi|15221835|ref|NP_173303.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
gi|6730716|gb|AAF27111.1|AC011809_20 Unknown protein [Arabidopsis thaliana]
gi|26451201|dbj|BAC42704.1| unknown protein [Arabidopsis thaliana]
gi|28973585|gb|AAO64117.1| unknown protein [Arabidopsis thaliana]
gi|332191624|gb|AEE29745.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
Length = 186
Score = 123 bits (308), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 76/117 (64%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PR F K +CW A+ V GR P+RWRKD GN+V ++ C GCLC +YDHI+PYSKG
Sbjct: 40 PRFFDGKAKNKCWANADIVPGRHPERWRKDVAGNIVCKRFGNCNGCLCFEYDHIVPYSKG 99
Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
G+S ENCQ+LQ VNR K + + + L S + + +++D++E++ YG+V R
Sbjct: 100 GESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDVIEMAVYGDVLR 156
>gi|452821821|gb|EME28847.1| endonuclease isoform 1 [Galdieria sulphuraria]
Length = 199
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 58/123 (47%), Positives = 78/123 (63%), Gaps = 9/123 (7%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
+ F + K+ CW KA+ VRGRDP RWR+D GNV+ +KL GC G C D+DH++P SKGG
Sbjct: 59 KQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVICKKLRGCRGPFCMDFDHVVPLSKGG 118
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSG-------RDMDLLELSAYGN 178
+S L+NCQVLQ T NR KG+R +D+ K Y +S R +D++E YG+
Sbjct: 119 ESVLDNCQVLQTTANRHKGDRLVEPTTDI--KKYYKEISDRRVLPLQRHLDIVEFYVYGD 176
Query: 179 VRR 181
V R
Sbjct: 177 VIR 179
>gi|452821820|gb|EME28846.1| endonuclease isoform 2 [Galdieria sulphuraria]
Length = 190
Score = 116 bits (291), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 75/119 (63%), Gaps = 9/119 (7%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
+ F + K+ CW KA+ VRGRDP RWR+D GNV+ +KL GC G C D+DH++P SKGG
Sbjct: 59 KQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVICKKLRGCRGPFCMDFDHVVPLSKGG 118
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSG-------RDMDLLELSAYG 177
+S L+NCQVLQ T NR KG+R +D+ K Y +S R +D++E YG
Sbjct: 119 ESVLDNCQVLQTTANRHKGDRLVEPTTDI--KKYYKEISDRRVLPLQRHLDIVEFYVYG 175
>gi|449015785|dbj|BAM79187.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 231
Score = 115 bits (288), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 76/119 (63%), Gaps = 3/119 (2%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RSF + ++ WE+A+ V GRDP+RWR D GN V R L GC GC C+++DH +PYS GG
Sbjct: 26 RSFTEAQRKAAWERAKPVPGRDPERWRYDVYGNPVLRPLHGCLGCFCYEFDHRVPYSLGG 85
Query: 126 KSTLENCQVLQATVNRSKGNRTE---LSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
++L+NC +LQ +NR K N E +SR + Q + + DMD +E + YG+VRR
Sbjct: 86 ATSLDNCDILQTRINRLKSNALEGVNISREQIRQYACEDVSNVIDMDAIEFAVYGDVRR 144
>gi|145521759|ref|XP_001446729.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414218|emb|CAK79332.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 74/117 (63%)
Query: 61 STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILP 120
S NPR F +++CW A+ + GR+PDRWR DA+GN V + L C G LC+ +DHI+P
Sbjct: 23 SPYNPRRFDKLAREECWNNAQIMEGRNPDRWRLDAVGNPVCKALNACRGPLCYQFDHIVP 82
Query: 121 YSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
YSKGG+S +NCQ+LQ+ VN K N+ +S+ +L Q S + D +E + YG
Sbjct: 83 YSKGGESISKNCQLLQSFVNIYKSNKENISKDELKQVSPQLNFTSEVFDAIEYAIYG 139
>gi|363814422|ref|NP_001242848.1| uncharacterized protein LOC100775923 [Glycine max]
gi|255641881|gb|ACU21209.1| unknown [Glycine max]
Length = 121
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 66/98 (67%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
S D PR F + K CW KA+ V GR P+RWRKDA GNVV ++ C GCLC++YD
Sbjct: 9 SRGDGEKRPRFFDSNAKAICWGKADTVPGRHPERWRKDAAGNVVCKRFFNCIGCLCYEYD 68
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
HI+P+SKGG+ST +NCQ+LQ+ VNR K ++ + L
Sbjct: 69 HIIPFSKGGESTADNCQILQSRVNRFKSDKYNIDSEQL 106
>gi|297733966|emb|CBI15213.3| unnamed protein product [Vitis vinifera]
Length = 99
Score = 109 bits (272), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 51/55 (92%), Positives = 54/55 (98%)
Query: 137 ATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
ATVNRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGC IQ
Sbjct: 45 ATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCTIQ 99
>gi|147866359|emb|CAN79860.1| hypothetical protein VITISV_035905 [Vitis vinifera]
Length = 222
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/55 (92%), Positives = 54/55 (98%)
Query: 135 LQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCR 189
LQATVNRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGG R
Sbjct: 4 LQATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGSR 58
>gi|145512195|ref|XP_001442014.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409286|emb|CAK74617.1| unnamed protein product [Paramecium tetraurelia]
Length = 167
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%)
Query: 61 STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILP 120
S NPR F +++C A+ + GR+PDRWR DA+GN V + L C G LC+ +DHI+P
Sbjct: 51 SPYNPRRFDKLAREECRNNAQIMEGRNPDRWRLDAVGNPVCKALNSCRGSLCYQFDHIVP 110
Query: 121 YSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
YSKGG+S NCQ+LQ+ VN K N+ +S+ +L S + + D +E + YG
Sbjct: 111 YSKGGESISINCQLLQSFVNIYKSNKENISKDELKHVSPQLKFTPVQFDAIEYAIYG 167
>gi|357113854|ref|XP_003558716.1| PREDICTED: uncharacterized protein LOC100824123 [Brachypodium
distachyon]
Length = 246
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 70/118 (59%), Gaps = 1/118 (0%)
Query: 64 NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
N FP +V++ CW AE+V GR P RWRKD GNV+ +K G YDHI P S+
Sbjct: 67 NEEKFPRAVRKWCWINAEEVPGRHPGRWRKDPYGNVLCKKQTTTLGLSYFHYDHIHPRSR 126
Query: 124 GGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
GGK ENCQ+L A +N K N + +R +++ S + S ++DL+E + YGNV R
Sbjct: 127 GGKGVKENCQLLLAYLNIFKSNDIK-TREEMVDNSYHITYSNEELDLIECNVYGNVVR 183
>gi|340502170|gb|EGR28883.1| hypothetical protein IMG5_167370 [Ichthyophthirius multifiliis]
Length = 121
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%)
Query: 85 GRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
GR P+RWR DA+GN V + L C G LC+ YDHI+PYSKGG++ +ENCQ+LQ+ VN+ K
Sbjct: 3 GRHPERWRLDAVGNPVIKSLTSCNGPLCYQYDHIVPYSKGGETQIENCQLLQSIVNKHKS 62
Query: 145 NRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVR 180
N+ + + S+L S + + D +E + YG +
Sbjct: 63 NKIDATYSELKNVSPKDDFTNFEYDCIEKAIYGTYQ 98
>gi|197306242|gb|ACH59472.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
Length = 84
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 41/79 (51%), Positives = 56/79 (70%), Gaps = 3/79 (3%)
Query: 57 SSADSTANP---RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
SS D+ +N R F ++ CW+KA+ V GR P+RWRKD GN+V R+ C GCLC+
Sbjct: 6 SSEDNPSNQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCY 65
Query: 114 DYDHILPYSKGGKSTLENC 132
+YDHI+P+SKGG +T+ENC
Sbjct: 66 EYDHIIPFSKGGGTTVENC 84
>gi|449018629|dbj|BAM82031.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 188
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 6/121 (4%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ + CW A+ V GR P RWR D +GN+V R L GC G LC++YDH P+ KGG
Sbjct: 27 RLFSHTERNICWRNAQVVPGRHPRRWRFDRVGNLVCRALHGCNGPLCYEYDHRAPWGKGG 86
Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGR-----DMDLLELSAYGNVR 180
+S NC VLQ + NR K ++ E ++ + + ++ RV R ++D +E YG++
Sbjct: 87 RSDSTNCDVLQTSANRWKSDQPESRVTEKLLR-AHSRVLSRSLSQEELDTIEYVIYGDIL 145
Query: 181 R 181
R
Sbjct: 146 R 146
>gi|197306216|gb|ACH59459.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306218|gb|ACH59460.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306220|gb|ACH59461.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306222|gb|ACH59462.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306224|gb|ACH59463.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306226|gb|ACH59464.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306228|gb|ACH59465.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306230|gb|ACH59466.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306232|gb|ACH59467.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306234|gb|ACH59468.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306238|gb|ACH59470.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306244|gb|ACH59473.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306246|gb|ACH59474.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306248|gb|ACH59475.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306250|gb|ACH59476.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306252|gb|ACH59477.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306254|gb|ACH59478.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306256|gb|ACH59479.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306258|gb|ACH59480.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306262|gb|ACH59482.1| HNH endonuclease domain-containing protein [Pseudotsuga macrocarpa]
Length = 84
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 37/67 (55%), Positives = 50/67 (74%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ CW+KA+ V GR P+RWRKD GN+V R+ C GCLC++YDHI+P+SKGG
Sbjct: 18 RFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77
Query: 126 KSTLENC 132
+T+ENC
Sbjct: 78 GTTVENC 84
>gi|197306240|gb|ACH59471.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
gi|197306260|gb|ACH59481.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
Length = 84
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 37/67 (55%), Positives = 50/67 (74%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ CW+KA+ V GR P+RWRKD GN+V R+ C GCLC++YDHI+P+SKGG
Sbjct: 18 RFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77
Query: 126 KSTLENC 132
+T+ENC
Sbjct: 78 GTTVENC 84
>gi|197306236|gb|ACH59469.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
Length = 84
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 37/67 (55%), Positives = 50/67 (74%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
R F ++ CW+KA+ V GR P+RWRKD GN+V R+ C GCLC++YDHI+P+SKGG
Sbjct: 18 RFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77
Query: 126 KSTLENC 132
+T+ENC
Sbjct: 78 GTTVENC 84
>gi|226504958|ref|NP_001144283.1| uncharacterized protein LOC100277163 [Zea mays]
gi|195639630|gb|ACG39283.1| hypothetical protein [Zea mays]
Length = 139
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 34/59 (57%), Positives = 44/59 (74%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
PRSF + CW KA + GR P+RWR+DA+GNVV R+ GC GCLC++YDHI+P+SK
Sbjct: 30 PRSFDEKTRTACWRKAAVLSGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPFSK 88
>gi|38605769|emb|CAE05870.3| OSJNBa0044K18.12 [Oryza sativa Japonica Group]
Length = 204
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 61/130 (46%), Gaps = 16/130 (12%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
A + A PRSF V++ CW A KV GR DRWR DA GN V + L G
Sbjct: 17 ASAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNG------------ 64
Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
GG + NCQ++Q NRSK NR L + + S + +D +E++ Y +
Sbjct: 65 ---KHGGPNDQSNCQIIQTGANRSKSNRI-LRKDQMKGLSRTMNWTHPHLDTIEIAVYAS 120
Query: 179 VRRGQDSGGC 188
V R S C
Sbjct: 121 VERDDLSCTC 130
>gi|125548724|gb|EAY94546.1| hypothetical protein OsI_16320 [Oryza sativa Indica Group]
Length = 232
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 60/130 (46%), Gaps = 16/130 (12%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
A + A PRSF V++ CW A KV GR DR R DA GN V + L G
Sbjct: 17 ASAPAKPRSFSADVREACWNNAPKVIGRSGDRCRLDANGNPVLKALNG------------ 64
Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
GG + NCQ++Q NRSK NR L + + S + +D +E++ Y +
Sbjct: 65 ---KHGGPNDQSNCQIIQTGANRSKSNRI-LRKDQMKGLSRTMNWTHPHLDTIEIAVYAS 120
Query: 179 VRRGQDSGGC 188
V R S C
Sbjct: 121 VERDDLSCTC 130
>gi|115345574|ref|YP_771756.1| hypothetical protein RD1_A0048 [Roseobacter denitrificans OCh 114]
gi|115292895|gb|ABI93348.1| OSJNBa0044K18.12 [Roseobacter denitrificans OCh 114]
Length = 96
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 68 FPYSVK---QQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
PYS Q W KA V+G++P+ +R+DA GNV+++ + DH+ P SKG
Sbjct: 1 MPYSTNPKAQHVWNKARPVKGKNPNLYRRDAQGNVIYKPAYNRDSAMGWQVDHVWPRSKG 60
Query: 125 GKSTLENCQVLQATVNRSKGNRT 147
G +N Q LQ N+ K N+T
Sbjct: 61 GSDARKNLQALQTGANKRKSNKT 83
>gi|222629057|gb|EEE61189.1| hypothetical protein OsJ_15190 [Oryza sativa Japonica Group]
Length = 118
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 44/88 (50%), Gaps = 15/88 (17%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
A + A PRSF V++ CW A KV GR DRWR DA GN V + L G
Sbjct: 17 ASAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNG------------ 64
Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNR 146
GG + NCQ++Q NRSK NR
Sbjct: 65 ---KHGGPNDQSNCQIIQTGANRSKSNR 89
>gi|413934346|gb|AFW68897.1| hypothetical protein ZEAMMB73_497763 [Zea mays]
Length = 132
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 45/68 (66%)
Query: 122 SKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
S GG+ST+ENCQ+LQ VNRSK ++ ++++++ S + +++D++E++ YG+V R
Sbjct: 39 STGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 98
Query: 182 GQDSGGCR 189
CR
Sbjct: 99 PGKQCRCR 106
>gi|403340432|gb|EJY69501.1| hypothetical protein OXYTRI_09762 [Oxytricha trifallax]
Length = 153
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 38/79 (48%)
Query: 33 SATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWR 92
S L + L T + ++R D AN R F K CW+++ + RDP RWR
Sbjct: 28 SPLLQNTSLDTAIQDVVFDKNIRKFVEDEKANNRIFTRQQKDVCWQQSTTIPIRDPKRWR 87
Query: 93 KDALGNVVFRKLVGCPGCL 111
D LGN VF L GC G L
Sbjct: 88 LDPLGNPVFYTLRGCHGAL 106
>gi|222617839|gb|EEE53971.1| hypothetical protein OsJ_00588 [Oryza sativa Japonica Group]
Length = 1000
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
+ A TANPR F W++ + + G DP +R DA GNV++ L D
Sbjct: 240 AEAAPTANPRGF----ANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWD 294
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
DH P ++GG++ N +++QA V R K N+ E
Sbjct: 295 IDHWFPCARGGRTVPSNLRIVQAQVCRKKHNKLEF 329
>gi|115434848|ref|NP_001042182.1| Os01g0176500 [Oryza sativa Japonica Group]
gi|55295944|dbj|BAD67812.1| unknown protein [Oryza sativa Japonica Group]
gi|113531713|dbj|BAF04096.1| Os01g0176500 [Oryza sativa Japonica Group]
Length = 832
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
AD+ A P + P W++ + + G DP +R DA GNV++ L D D
Sbjct: 70 ADAEAAPTANPRGFANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWDID 128
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
H P ++GG++ N +++QA V R K N+ E
Sbjct: 129 HWFPCARGGRTVPSNLRIVQAQVCRKKHNKLEF 161
>gi|242051879|ref|XP_002455085.1| hypothetical protein SORBIDRAFT_03g004110 [Sorghum bicolor]
gi|241927060|gb|EES00205.1| hypothetical protein SORBIDRAFT_03g004110 [Sorghum bicolor]
Length = 819
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 7/95 (7%)
Query: 57 SSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
+ A TANPR F W++ + + G DP +R DA GNV++ L D
Sbjct: 70 AEAPPTANPRGF----ANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWD 124
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
DH P ++GGK+ N +++Q V R K N+ E
Sbjct: 125 IDHWFPCARGGKTVPSNLRIVQWQVCRKKQNKLEF 159
>gi|168059079|ref|XP_001781532.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667011|gb|EDQ53651.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 189
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 56/112 (50%), Gaps = 7/112 (6%)
Query: 40 ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
+ AT ++ LF + + D N R ++ W++ + + G DP ++R D+ G
Sbjct: 54 DFATLKKWDGLFPTVDDTERDPDLNARKH----AEELWQQLDHLGNAGFDPAKFRVDSYG 109
Query: 98 NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NVV+ L + DH P+S+GGK+ + N +VLQ ++ K +R E
Sbjct: 110 NVVYWN-ADPSSPLAWEIDHWFPHSRGGKTVVSNLRVLQWQASQRKKSRLEF 160
>gi|410097764|ref|ZP_11292745.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
CL02T12C30]
gi|410101011|ref|ZP_11295965.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
CL02T12C30]
gi|410101180|ref|ZP_11296130.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
CL02T12C30]
gi|409213077|gb|EKN06106.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
CL02T12C30]
gi|409213657|gb|EKN06673.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223854|gb|EKN16789.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
CL02T12C30]
Length = 83
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 37/67 (55%)
Query: 77 WEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQ 136
W+K VRG+DP+ +RKD GN+++ G L + DH P +KGG + L N Q +
Sbjct: 13 WQKGLIVRGKDPNLYRKDCYGNLMYWYSYGKFTKLGWNVDHSKPIAKGGTNHLNNLQPMN 72
Query: 137 ATVNRSK 143
N SK
Sbjct: 73 CFANSSK 79
>gi|357127491|ref|XP_003565413.1| PREDICTED: uncharacterized protein LOC100821388 [Brachypodium
distachyon]
Length = 815
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 45/92 (48%), Gaps = 3/92 (3%)
Query: 60 DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
D+ A P + P W++ + + G DP +R DA GNV++ L D H
Sbjct: 70 DAEALPAANPRGFASLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWDVHH 128
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
P ++GGK+ N +V+QA V R K N+ E
Sbjct: 129 WFPCARGGKTVPSNLRVVQAQVCRKKQNKLEF 160
>gi|168027278|ref|XP_001766157.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682589|gb|EDQ69006.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 190
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 74 QQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLEN 131
++ WE+ + + G DP ++R DA GNV++ L + DH P+S+GG++ N
Sbjct: 86 EELWEQLDHLGNAGFDPAKFRVDAYGNVLYWG-ADPSSPLSWEVDHWFPHSRGGRTVPSN 144
Query: 132 CQVLQATVNRSKGNRTEL 149
Q++Q + K NR E
Sbjct: 145 LQIVQWQARQRKKNRLEF 162
>gi|168033708|ref|XP_001769356.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679276|gb|EDQ65725.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 190
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
Query: 70 YSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
+ ++ WE+ + + G DP ++R DA GNV++ L + DH P+S+GG +
Sbjct: 82 HEFAEELWEQLDHLGNAGFDPAKFRVDAYGNVLYWS-ADPSSPLSWEVDHWFPHSRGGLT 140
Query: 128 TLENCQVLQATVNRSKGNRTEL 149
N Q++Q + K NR E
Sbjct: 141 APSNLQIVQWQARQRKKNRLEF 162
>gi|260655767|ref|ZP_05861236.1| putative HNH endonuclease domain protein [Jonquetella anthropi
E3_33 E1]
gi|260629383|gb|EEX47577.1| putative HNH endonuclease domain protein [Jonquetella anthropi
E3_33 E1]
Length = 384
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 18/146 (12%)
Query: 11 KSTATRASSPSPSRRRGETTVRSATLLDNEL--ATEEEISTLFTDLRISSADSTANPRS- 67
+S + P R+ + V LL N+ +TE + L ++I D +S
Sbjct: 247 QSVINWVKTLFPVHRKEQQRVEWG-LLYNQYGNSTEFNANDLEKQIKILMQDEDVTKKSG 305
Query: 68 -FPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH------DYDHILP 120
+PY + ++ E+ +R DR ++A R+ CP C H + DHI P
Sbjct: 306 IYPYLLTKE--ERYLSIRAF-TDRMIREAYE----RQSGICPACNKHFEINEMEADHITP 358
Query: 121 YSKGGKSTLENCQVLQATVNRSKGNR 146
+SKGG++ ENCQ+L T NR K N+
Sbjct: 359 WSKGGRTIAENCQMLCKTCNRKKSNK 384
>gi|288802822|ref|ZP_06408259.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288334639|gb|EFC73077.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 119
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 73 KQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGC-PGCLCHDYDHILPYSKGGKSTLEN 131
K+ CW A+ V G D D RKDA G +F+ G + DH+ P GG N
Sbjct: 7 KEYCWNHAQIVEGYDKDSIRKDACGAWIFKAHYGMRDSVFGWEVDHVFPVILGGDDFKRN 66
Query: 132 CQVLQATVNRSKGN 145
+ +Q N SKG+
Sbjct: 67 LRAMQWKNNVSKGD 80
>gi|168040184|ref|XP_001772575.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676130|gb|EDQ62617.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 212
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 72 VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
++ W++ + + G DP ++R D GNVV+ L D DH P S+GGK+ L
Sbjct: 80 FAEELWQQLDHLGNAGFDPAKFRVDPYGNVVYWN-ADPSSPLAWDIDHWFPRSRGGKTKL 138
Query: 130 ENCQVLQATVNRSKGNRTEL 149
N +++Q K NR E
Sbjct: 139 PNLRIVQWQAYLRKRNRLEF 158
>gi|168054230|ref|XP_001779535.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669016|gb|EDQ55611.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 190
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
Query: 70 YSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
+ ++ WE+ + + G DP ++R DA GNV++ L + DH P+ +GG++
Sbjct: 82 HEFAEEIWEQLDHLGDAGFDPAKFRVDAYGNVLYWS-ADPSSPLSWEVDHWFPHLRGGRT 140
Query: 128 TLENCQVLQATVNRSKGNRTEL 149
N Q++Q + K NR E
Sbjct: 141 VPSNLQIVQWQARQRKKNRLEF 162
>gi|302805536|ref|XP_002984519.1| hypothetical protein SELMODRAFT_234547 [Selaginella moellendorffii]
gi|300147907|gb|EFJ14569.1| hypothetical protein SELMODRAFT_234547 [Selaginella moellendorffii]
Length = 204
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 40 ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
E+A EE+ LF I +D+ S + K W++ + + G DP +R D+ G
Sbjct: 69 EVAKLEEVQDLFP---IVLSDTRKEHPSHLHKFKDVLWQQLDHLGNAGFDPACFRVDSYG 125
Query: 98 NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NV++ L + H PYS+GGK+ L N ++ Q V+ K + E
Sbjct: 126 NVLYWH-ADPASPLAWEVCHWFPYSRGGKTVLNNLRLAQWQVSLKKKEKLEF 176
>gi|168018025|ref|XP_001761547.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687231|gb|EDQ73615.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 223
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 72 VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
++ W + + + G DP+++R D GNVV L D DH P S+GGK+ L
Sbjct: 81 FAEELWRQLDHLGNAGFDPEKFRVDPYGNVVHWN-ADPSSPLAWDIDHWFPQSRGGKTEL 139
Query: 130 ENCQVLQATVNRSKGNRTEL 149
N +++Q K NR E
Sbjct: 140 PNLRIVQWQAYARKRNRLEF 159
>gi|148727130|ref|YP_001285624.1| gp48 [Propionibacterium phage PA6]
gi|91982954|gb|ABE68617.1| gp48 [Propionibacterium phage PA6]
Length = 100
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQAQQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|302782555|ref|XP_002973051.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
gi|300159652|gb|EFJ26272.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
Length = 204
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 40 ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
E+A E++ LF I +DS S + K W++ + + G DP +R D+ G
Sbjct: 69 EVAKLEQVQDLFP---IVLSDSRKEHPSHLHKFKDVLWQQLDHLGNAGFDPACFRVDSYG 125
Query: 98 NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NV++ L + H PYS+GGK+ L N ++ Q V+ K + E
Sbjct: 126 NVLYWH-ADPASPLAWEVCHWFPYSRGGKTVLNNLRLAQWQVSLKKKEKLEF 176
>gi|410491223|ref|YP_006906446.1| hypothetical protein P100A_45 [Propionibacterium phage P100_A]
gi|407302224|gb|AFT97776.1| hypothetical protein P100A_45 [Propionibacterium phage P100_A]
Length = 100
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQAKQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|410491718|ref|YP_006906942.1| hypothetical protein ATCC29399BT_46 [Propionibacterium phage
ATCC29399B_T]
gi|410507509|ref|YP_006907129.1| hypothetical protein ATCC29399BC_46 [Propionibacterium phage
ATCC29399B_C]
gi|407302364|gb|AFT97913.1| hypothetical protein ATCC29399BT_46 [Propionibacterium phage
ATCC29399B_T]
gi|407302411|gb|AFT97959.1| hypothetical protein ATCC29399BC_46 [Propionibacterium phage
ATCC29399B_C]
Length = 100
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQARQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|410491362|ref|YP_006906587.1| hypothetical protein P101A_47 [Propionibacterium phage P101A]
gi|407302036|gb|AFT97592.1| hypothetical protein P101A_47 [Propionibacterium phage P101A]
Length = 100
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQARQQGQTECPLCGAQIAWGTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|410491407|ref|YP_006906632.1| HNH endonuclease [Propionibacterium phage P1.1]
gi|407302316|gb|AFT97866.1| HNH endonuclease [Propionibacterium phage P1.1]
Length = 101
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 15 RWRQRLITQARQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 74
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 75 TCNRSKGNRSE 85
>gi|257439996|ref|ZP_05615751.1| putative HNH endonuclease domain protein [Faecalibacterium
prausnitzii A2-165]
gi|257197603|gb|EEU95887.1| HNH endonuclease domain protein [Faecalibacterium prausnitzii
A2-165]
Length = 367
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 27/45 (60%), Gaps = 6/45 (13%)
Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
CP C H DHI P+SKGGK+T ENCQ+L A NR K N
Sbjct: 322 CPKCGKHFEIEEMQADHITPWSKGGKTTAENCQMLCADCNRRKSN 366
>gi|410490847|ref|YP_006906071.1| HNH endonuclease [Propionibacterium phage P9.1]
gi|422385762|ref|ZP_16465888.1| putative HNH endonuclease domain protein [Propionibacterium acnes
HL096PA3]
gi|327327949|gb|EGE69721.1| putative HNH endonuclease domain protein [Propionibacterium acnes
HL096PA3]
gi|407301941|gb|AFT97499.1| HNH endonuclease [Propionibacterium phage P9.1]
Length = 100
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRARLITQAQQQGQTECPLCGAQIAWGTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|410491317|ref|YP_006906540.1| hypothetical protein P1001_47 [Propionibacterium phage P100_1]
gi|407302272|gb|AFT97823.1| hypothetical protein P1001_47 [Propionibacterium phage P100_1]
Length = 100
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRARLITQAQQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|284097476|ref|ZP_06385569.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
gi|283831012|gb|EFC35029.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
Length = 171
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 27/146 (18%)
Query: 7 PKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPR 66
PK+R S + A P + A L E E++ L D D T P
Sbjct: 45 PKKRASMKSVAWGPLYDEHHERDDLDPARL-------EAEVARLHKD-----DDVTKKPG 92
Query: 67 SFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH------DYDHIL 119
+PY + + EK +R P + + + G CP C H + DHI
Sbjct: 93 IYPYVLTRD--EKYLNIRAFTP------TMRAAAYERQEGICPRCGEHFVLGAMEADHID 144
Query: 120 PYSKGGKSTLENCQVLQATVNRSKGN 145
P+S+GGK+ ENCQ+L NR KG+
Sbjct: 145 PWSEGGKTAPENCQMLCRPCNRRKGS 170
>gi|373455654|ref|ZP_09547483.1| hypothetical protein HMPREF9453_01652 [Dialister succinatiphilus
YIT 11850]
gi|371934747|gb|EHO62527.1| hypothetical protein HMPREF9453_01652 [Dialister succinatiphilus
YIT 11850]
Length = 148
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 26/35 (74%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
D DHI+P S+GG ST++N Q L NRSKGNRT+
Sbjct: 80 DVDHIVPQSRGGGSTMDNLQYLCPHCNRSKGNRTD 114
>gi|410491593|ref|YP_006906817.1| hypothetical protein P104A_45 [Propionibacterium phage P104A]
gi|407302083|gb|AFT97638.1| hypothetical protein P104A_45 [Propionibacterium phage P104A]
Length = 100
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 14 RWRARLITQARQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|423220008|ref|ZP_17206504.1| hypothetical protein HMPREF1061_03277 [Bacteroides caccae
CL03T12C61]
gi|392624271|gb|EIY18364.1| hypothetical protein HMPREF1061_03277 [Bacteroides caccae
CL03T12C61]
Length = 390
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 26/33 (78%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG +++ENC++L T NRSKGNR
Sbjct: 358 DADHVTAWSKGGATSMENCEMLCKTHNRSKGNR 390
>gi|153807020|ref|ZP_01959688.1| hypothetical protein BACCAC_01297 [Bacteroides caccae ATCC 43185]
gi|149130140|gb|EDM21350.1| HNH endonuclease domain protein [Bacteroides caccae ATCC 43185]
Length = 390
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 26/33 (78%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG +++ENC++L T NRSKGNR
Sbjct: 358 DADHVTAWSKGGATSMENCEMLCKTHNRSKGNR 390
>gi|359482658|ref|XP_002285224.2| PREDICTED: uncharacterized protein LOC100251511 [Vitis vinifera]
Length = 746
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 3/92 (3%)
Query: 60 DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
D A P + P W++ + G DP +R D GNV++ L D DH
Sbjct: 68 DPKAKPTTKPKIFISLLWKQLNHLGNAGFDPAIFRVDPYGNVLYYH-ADSASPLAWDIDH 126
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
P S+GG + N ++LQ V R K N+ E
Sbjct: 127 WFPCSRGGLTVPSNLRILQWQVCRRKHNKLEF 158
>gi|410491905|ref|YP_006907176.1| hypothetical protein P100D_47 [Propionibacterium phage P100D]
gi|407302176|gb|AFT97729.1| hypothetical protein P100D_47 [Propionibacterium phage P100D]
Length = 100
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGC------------LCHDYDHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C + DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQAQQQGQTTCPLCGVTITWDTHQLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|386390355|ref|ZP_10075145.1| HNH endonuclease domain protein [Haemophilus paraphrohaemolyticus
HK411]
gi|385693240|gb|EIG23892.1| HNH endonuclease domain protein [Haemophilus paraphrohaemolyticus
HK411]
Length = 388
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 14/54 (25%)
Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C HD DH+L +SKGG ++ ENC++L NRSKGN+
Sbjct: 335 CPLCAIHDNADKTKIWTMKEMEADHVLAWSKGGATSAENCEMLCIKHNRSKGNK 388
>gi|330858397|ref|YP_004414751.1| Gp48 [Propionibacterium phage PAS50]
gi|260066608|gb|ACX30886.1| Gp48 [Propionibacterium phage PAS50]
Length = 101
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C HD DHI P S+GG +TL+N Q++
Sbjct: 15 RWRQRLITQARQQGQTECPLCGATITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 74
Query: 138 TVNRSKGNRTE 148
T NRSKGN+T+
Sbjct: 75 TCNRSKGNQTQ 85
>gi|330858443|ref|YP_004414796.1| Gp48 [Propionibacterium phage PAD20]
gi|260066561|gb|ACX30840.1| Gp48 [Propionibacterium phage PAD20]
Length = 100
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 90 RWRKDALGNVVFRKLVGCPGC------------LCHDYDHILPYSKGGKSTLENCQVLQA 137
RWR+ + + CP C + DHI P S+GG +TL+N Q++
Sbjct: 14 RWRQRLITQARQQGQTECPLCGATITWDTHQLPTSPEADHITPVSRGGLNTLDNGQIICR 73
Query: 138 TVNRSKGNRTE 148
T NRSKGNR+E
Sbjct: 74 TCNRSKGNRSE 84
>gi|429740338|ref|ZP_19274029.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
gi|429153310|gb|EKX96095.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
Length = 285
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 6/57 (10%)
Query: 91 WRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
WR+D + V C ++DHI+P+SKGG +T+ N Q+L NRSK N+
Sbjct: 234 WRRDG------GRCVKCGSQENLEFDHIIPFSKGGSNTIRNIQLLCEKCNRSKSNKI 284
>gi|302345663|ref|YP_003814016.1| HNH endonuclease domain protein [Prevotella melaninogenica ATCC
25845]
gi|302149883|gb|ADK96145.1| HNH endonuclease domain protein [Prevotella melaninogenica ATCC
25845]
Length = 373
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 37/87 (42%), Gaps = 29/87 (33%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLC-HDY-----DHIL 119
R FP +KQ WEK + CP C DY DHI
Sbjct: 310 RCFPDDIKQAVWEKQYHI-----------------------CPSCKKEFDYEFMEGDHIT 346
Query: 120 PYSKGGKSTLENCQVLQATVNRSKGNR 146
P+ +GG++ +ENCQ+L NR KG R
Sbjct: 347 PWREGGRTVIENCQMLCRECNRRKGGR 373
>gi|417162963|ref|ZP_11998293.1| PF03235 family protein [Escherichia coli 99.0741]
gi|386173454|gb|EIH45466.1| PF03235 family protein [Escherichia coli 99.0741]
Length = 390
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 26/31 (83%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGG+STL+NC++L T NRSKGN+
Sbjct: 360 DHVSAWSKGGESTLDNCEMLCITHNRSKGNK 390
>gi|410491817|ref|YP_006907042.1| hypothetical protein P105_45 [Propionibacterium phage P105]
gi|407302128|gb|AFT97682.1| hypothetical protein P105_45 [Propionibacterium phage P105]
Length = 100
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 27/35 (77%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
+ DHI P S+GG +TL+N Q++ T NRSKGNR+E
Sbjct: 50 EADHITPVSRGGLNTLDNGQIICRTCNRSKGNRSE 84
>gi|410491270|ref|YP_006906493.1| HNH endonuclease [Propionibacterium phage P14.4]
gi|407301990|gb|AFT97547.1| HNH endonuclease [Propionibacterium phage P14.4]
Length = 101
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 27/35 (77%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
+ DHI P S+GG +TL+N Q++ T NRSKGNR+E
Sbjct: 51 EADHITPVSRGGLNTLDNGQIICRTCNRSKGNRSE 85
>gi|422680858|ref|ZP_16739129.1| stress protein, partial [Pseudomonas syringae pv. tabaci str. ATCC
11528]
gi|331010203|gb|EGH90259.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 138
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 21/126 (16%)
Query: 41 LATEEEISTLFTDLR-ISSADSTANPRSFPYSVKQQCWE-KAEKVR-------GRDPDRW 91
L TE E+ L +R ++ + R Y KQ E K E +R P
Sbjct: 15 LTTERELDDLVDYIRALAKGHKLPSVRVDAYEFKQVTGEAKGEPIRVNVSSMPAPKPPAA 74
Query: 92 RKDALGNVVFRKLV-----GCPGC-----LCHDYDHILPYSKGGKSTLENCQVLQATVNR 141
R++A+ V R++ C C LC +DHI+P+S+GG +T+ N Q+L N
Sbjct: 75 RREAIPRSVQREVWQRDAGKCVECGTRQLLC--FDHIVPFSRGGSNTVRNLQLLCEPCNL 132
Query: 142 SKGNRT 147
SKGNR
Sbjct: 133 SKGNRI 138
>gi|410689486|ref|YP_006963215.1| hypothetical protein [Arthrobacter sp. 31-32]
gi|389595975|gb|AFK88745.1| hypothetical protein [Arthrobacter sp. 31-32]
Length = 388
Score = 43.9 bits (102), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 25/33 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGGK+ L NC++L T NR+KGNR
Sbjct: 356 DADHVAAWSKGGKTDLANCEMLCVTHNRAKGNR 388
>gi|331005592|ref|ZP_08328963.1| hypothetical protein IMCC1989_2172 [gamma proteobacterium IMCC1989]
gi|330420607|gb|EGG94902.1| hypothetical protein IMCC1989_2172 [gamma proteobacterium IMCC1989]
Length = 389
Score = 43.9 bits (102), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 25/33 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + +ENCQ+L T NR+KGN+
Sbjct: 357 DADHVTAWSKGGATNIENCQMLCKTHNRAKGNK 389
>gi|238926542|ref|ZP_04658302.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
gi|238885488|gb|EEQ49126.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
Length = 363
Score = 43.9 bits (102), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
DHI P+SKGGK+T ENCQ+L A NR K N
Sbjct: 333 DHITPWSKGGKTTPENCQMLCAPCNRRKSN 362
>gi|421735209|ref|ZP_16174207.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
gi|407076867|gb|EKE49775.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
Length = 379
Score = 43.9 bits (102), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 24/31 (77%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P+S+GGK+TL+NCQ+L T N K N
Sbjct: 349 DHIVPWSEGGKTTLDNCQMLCTTCNIKKSNH 379
>gi|288803679|ref|ZP_06409109.1| HNH endonuclease domain-containing protein [Prevotella
melaninogenica D18]
gi|288333919|gb|EFC72364.1| HNH endonuclease domain-containing protein [Prevotella
melaninogenica D18]
Length = 373
Score = 43.9 bits (102), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 29/87 (33%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLC-HDY-----DHIL 119
R FP +K+ WEK + CP C DY DHI
Sbjct: 310 RCFPDDIKRAVWEKQHHI-----------------------CPSCQKEFDYEFMEGDHIT 346
Query: 120 PYSKGGKSTLENCQVLQATVNRSKGNR 146
P+ +GG++ +ENCQ+L NR KG R
Sbjct: 347 PWREGGRTVIENCQMLCRECNRRKGGR 373
>gi|257486734|ref|ZP_05640775.1| HNH endonuclease [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 217
Score = 43.5 bits (101), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 21/126 (16%)
Query: 41 LATEEEISTLFTDLR-ISSADSTANPRSFPYSVKQQCWE-KAEKVR-------GRDPDRW 91
L TE E+ L +R ++ + R Y KQ E K E +R P
Sbjct: 94 LTTERELDDLVDYIRALAKGHKLPSVRVDAYEFKQVTGEAKGEPIRVNVSSMPAPKPPAA 153
Query: 92 RKDALGNVVFRKLV-----GCPGC-----LCHDYDHILPYSKGGKSTLENCQVLQATVNR 141
R++A+ V R++ C C LC +DHI+P+S+GG +T+ N Q+L N
Sbjct: 154 RREAIPRSVQREVWQRDAGKCVECGTRQLLC--FDHIVPFSRGGSNTVRNLQLLCEPCNL 211
Query: 142 SKGNRT 147
SKGNR
Sbjct: 212 SKGNRI 217
>gi|359427003|ref|ZP_09218078.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
gi|358237616|dbj|GAB07660.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
Length = 548
Score = 43.5 bits (101), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 4/51 (7%)
Query: 100 VFRKLVG----CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
VFR+ G C +YDHI+P+S+GG +++EN Q+L N++KG R
Sbjct: 497 VFRRCGGRCVECGSTSYLEYDHIIPWSRGGATSVENLQILCRACNQAKGAR 547
>gi|169840962|ref|ZP_02874075.1| hypothetical protein cdivTM_27683 [candidate division TM7
single-cell isolate TM7a]
Length = 66
Score = 43.5 bits (101), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 26/33 (78%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG +++ENC++L + NRSKGN+
Sbjct: 34 DADHVSAWSKGGSTSMENCEMLCKSHNRSKGNK 66
>gi|407001791|gb|EKE18705.1| hypothetical protein ACD_9C00258G0006 [uncultured bacterium]
Length = 387
Score = 43.5 bits (101), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 26/33 (78%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG ++++NCQ+L T NR+KGNR
Sbjct: 355 DADHVAAWSKGGATSVKNCQMLCKTHNRAKGNR 387
>gi|356557263|ref|XP_003546937.1| PREDICTED: uncharacterized protein LOC100776970 [Glycine max]
Length = 718
Score = 43.1 bits (100), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 8/117 (6%)
Query: 35 TLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWR 92
L ++E T +++ +F + A +T+ P+ F SV W++ + G DP R
Sbjct: 37 ALSEDEAETAKDLDEIFPIID-PKAKATSKPKIF-VSV---LWKQLRHLGNAGFDPAVIR 91
Query: 93 KDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
D GNV++ L D DH P S+GG + L N ++LQ + K N+ E
Sbjct: 92 VDGYGNVLYYH-ADSASPLAWDVDHWFPCSRGGLTVLSNLRLLQRQACKRKKNKLEF 147
>gi|336115757|ref|YP_004570523.1| hypothetical protein MLP_01060 [Microlunatus phosphovorus NM-1]
gi|334683535|dbj|BAK33120.1| hypothetical protein MLP_01060 [Microlunatus phosphovorus NM-1]
Length = 73
Score = 42.7 bits (99), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG S L NC++L NRSKGNR
Sbjct: 41 DADHVTAWSKGGASDLANCEMLCVPHNRSKGNR 73
>gi|449981939|ref|ZP_21818046.1| HNH endonuclease [Streptococcus mutans 5SM3]
gi|450046601|ref|ZP_21839016.1| HNH endonuclease [Streptococcus mutans N34]
gi|449175287|gb|EMB77712.1| HNH endonuclease [Streptococcus mutans 5SM3]
gi|449198573|gb|EMB99680.1| HNH endonuclease [Streptococcus mutans N34]
Length = 412
Score = 42.7 bits (99), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
DHI P+S GGK+TL+NCQ+L N SKGN
Sbjct: 381 DHITPWSLGGKTTLDNCQLLCKKHNSSKGN 410
>gi|450031203|ref|ZP_21833586.1| HNH endonuclease [Streptococcus mutans G123]
gi|449191820|gb|EMB93276.1| HNH endonuclease [Streptococcus mutans G123]
Length = 412
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
DHI P+S GGK+TL+NCQ+L N SKGN
Sbjct: 381 DHITPWSLGGKTTLDNCQLLCKKHNSSKGN 410
>gi|404212558|ref|YP_006671568.1| hypothetical protein KTR9_4973 [Gordonia sp. KTR9]
gi|403648253|gb|AFR51432.1| hypothetical protein KTR9_4973 [Gordonia sp. KTR9]
Length = 382
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 24/33 (72%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG S L NC++L + NRSKGNR
Sbjct: 350 DADHVTAWSKGGASDLANCEMLCVSHNRSKGNR 382
>gi|356547420|ref|XP_003542110.1| PREDICTED: uncharacterized protein LOC100797659 [Glycine max]
Length = 740
Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 59 ADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
A +T+ P+ F SV W++ + G DP R D GNV++ L D D
Sbjct: 73 AKATSKPKIF-VSV---LWKQLRHLGNAGFDPAVIRVDGYGNVLYYH-ADSASPLAWDVD 127
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
H P S+GG + L N ++LQ + K N+ E
Sbjct: 128 HWFPCSRGGLTVLSNLRILQRQACKRKKNKLEF 160
>gi|302342138|ref|YP_003806667.1| hypothetical protein Deba_0701 [Desulfarculus baarsii DSM 2075]
gi|301638751|gb|ADK84073.1| hypothetical protein Deba_0701 [Desulfarculus baarsii DSM 2075]
Length = 103
Score = 42.4 bits (98), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGC-PGCLCHDYDHILPYSKG 124
R P + ++ W+KA DPD R+D G ++R G D D+I+P G
Sbjct: 3 RDLPLELIERVWQKARPAPRLDPDSVRQDCFGAWIYRHAFGDRASAYGWDLDYIVPPDCG 62
Query: 125 GKSTLENCQVLQATVNRSKG 144
G+ L+N + LQ + +G
Sbjct: 63 GRGELDNLRPLQWQNSLMRG 82
>gi|429725832|ref|ZP_19260648.1| HNH endonuclease domain protein [Prevotella sp. oral taxon 473 str.
F0040]
gi|429148735|gb|EKX91736.1| HNH endonuclease domain protein [Prevotella sp. oral taxon 473 str.
F0040]
Length = 390
Score = 42.4 bits (98), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 26/33 (78%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG +TL+NC++L + NR+KGN+
Sbjct: 358 DADHVTAWSKGGATTLDNCEMLCKSHNRAKGNK 390
>gi|365895382|ref|ZP_09433497.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
3843]
gi|365423858|emb|CCE06039.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
3843]
Length = 185
Score = 42.4 bits (98), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C CL D +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFACQYCLSGDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNMT 137
>gi|440742015|ref|ZP_20921345.1| TerF-like protein [Pseudomonas syringae BRIP39023]
gi|440378101|gb|ELQ14731.1| TerF-like protein [Pseudomonas syringae BRIP39023]
Length = 217
Score = 42.4 bits (98), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
LC +DHI+P+S+GG +T+ N Q+L N SKGNR
Sbjct: 183 LC--FDHIVPFSRGGSNTVRNLQLLCEPCNLSKGNRI 217
>gi|257486772|ref|ZP_05640813.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 217
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
LC +DHI+P+S+GG +T+ N Q+L N SKGNR
Sbjct: 183 LC--FDHIVPFSRGGSNTVRNLQLLCEPCNLSKGNRI 217
>gi|315498410|ref|YP_004087214.1| hnh endonuclease [Asticcacaulis excentricus CB 48]
gi|315416422|gb|ADU13063.1| HNH endonuclease [Asticcacaulis excentricus CB 48]
Length = 186
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C C HD +DH+LP S+GGKST N A N KG +T
Sbjct: 86 NVFLRDQFSCQYCGCRHDLTFDHVLPVSQGGKSTWTNIITACAPCNLRKGGKTPQQAQMA 145
Query: 155 IQKSSY 160
+ KS +
Sbjct: 146 LAKSPH 151
>gi|384253642|gb|EIE27116.1| hypothetical protein COCSUDRAFT_55140 [Coccomyxa subellipsoidea
C-169]
Length = 134
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 90 RWRKDALGNVVFRKLVGCPGCLC-HDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
R+R D GNVV + G C+C + DHI P+++GG S +EN L NR+ N +
Sbjct: 76 RFRSDPYGNVVALEARGS--CVCAFEVDHIFPWARGGLSVVENFMALFWRSNRNVKNDKD 133
Query: 149 L 149
+
Sbjct: 134 I 134
>gi|431803894|ref|YP_007230797.1| HNH endonuclease domain-containing protein [Pseudomonas putida
HB3267]
gi|430794659|gb|AGA74854.1| HNH endonuclease domain-containing protein [Pseudomonas putida
HB3267]
Length = 154
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 33/66 (50%), Gaps = 7/66 (10%)
Query: 88 PDRWRKDALGNVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
PDR R + ++R C C H DHI P GG++TLEN Q L N SKG
Sbjct: 92 PDRLRTAVMERDMYR----CVRCGSHRELCADHIFPEVLGGEATLENLQTLCRPCNTSKG 147
Query: 145 NRTELS 150
R E+S
Sbjct: 148 GRVEVS 153
>gi|227356931|ref|ZP_03841307.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
gi|425072865|ref|ZP_18475971.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
gi|227162908|gb|EEI47860.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
gi|404596639|gb|EKA97159.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
Length = 98
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
+ DHI+P KGGK T+ENCQVL +R K + ELS
Sbjct: 62 ELDHIIPLFKGGKDTIENCQVLCIECHRKK-TKEELS 97
>gi|255553017|ref|XP_002517551.1| conserved hypothetical protein [Ricinus communis]
gi|223543183|gb|EEF44715.1| conserved hypothetical protein [Ricinus communis]
Length = 263
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 8/122 (6%)
Query: 30 TVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRD 87
T + +E+ +++ +F + D A P + P W++ + G D
Sbjct: 44 TFIPFAMQHHEIMRAKQLEQMFPII-----DPKAKPTAKPKIFFNLLWKQLNHLGNAGFD 98
Query: 88 PDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
P R D GNV++ L D DH P +GG + N ++LQ V + K N+
Sbjct: 99 PAVIRVDPYGNVLYLH-ADKASPLAWDIDHWFPCQRGGLTVPSNLRILQWQVCKKKHNKL 157
Query: 148 EL 149
E
Sbjct: 158 EF 159
>gi|225377537|ref|ZP_03754758.1| hypothetical protein ROSEINA2194_03187 [Roseburia inulinivorans DSM
16841]
gi|225210614|gb|EEG92968.1| hypothetical protein ROSEINA2194_03187 [Roseburia inulinivorans DSM
16841]
Length = 84
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + + NCQ+L T NR+KGN+
Sbjct: 52 DADHVTAWSKGGVTDISNCQMLCKTHNRAKGNK 84
>gi|111221190|ref|YP_711984.1| tellurium resistance protein terE [Frankia alni ACN14a]
gi|111148722|emb|CAJ60397.1| Tellurium resistance protein terE [Frankia alni ACN14a]
Length = 644
Score = 41.6 bits (96), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 25/34 (73%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
++DH++P+S+GG S++ N Q+L NRSKG R
Sbjct: 611 EFDHVIPFSRGGASSVNNLQLLCRRCNRSKGARI 644
>gi|270593505|ref|ZP_06221465.1| hypothetical protein HAINFHK1212_1873, partial [Haemophilus
influenzae HK1212]
gi|270318413|gb|EFA29540.1| hypothetical protein HAINFHK1212_1873 [Haemophilus influenzae
HK1212]
Length = 161
Score = 41.6 bits (96), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)
Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C HD DH+ +SKGG ++ ENC++L NR+KGN+
Sbjct: 108 CPLCAIHDNANKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 161
>gi|346226706|ref|ZP_08847848.1| hypothetical protein AtheD1_16304 [Anaerophaga thermohalophila DSM
12881]
Length = 465
Score = 41.6 bits (96), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 15/85 (17%)
Query: 78 EKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCL--------------CHDYDHILPYS 122
E + K++ RD +R + +++R+ G C CL + DH+LP+S
Sbjct: 380 ENSIKIKERDENRAFSELERIIIYRRDKGFCQQCLRDGKPENEAKVSWSKYQADHVLPHS 439
Query: 123 KGGKSTLENCQVLQATVNRSKGNRT 147
KGGK+ +EN ++L + N+SKG +
Sbjct: 440 KGGKTIIENGELLCSYHNQSKGAKI 464
>gi|168037356|ref|XP_001771170.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677550|gb|EDQ64019.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 187
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 72 VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
++ W++ E + G DP ++R D GNVV L D D PY +GG++ L
Sbjct: 81 FAEELWQQLEHLGNAGFDPAKFRVDPYGNVVHWN-ADPSSPLAWDVDLWFPYPRGGRTKL 139
Query: 130 ENCQVLQATVNRSKGNRTEL 149
N +++Q K NR E
Sbjct: 140 PNLRIVQWQACLRKRNRLEF 159
>gi|87311894|ref|ZP_01094006.1| restriction endonuclease [Blastopirellula marina DSM 3645]
gi|87285425|gb|EAQ77347.1| restriction endonuclease [Blastopirellula marina DSM 3645]
Length = 197
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
PG DH+LP ++GG+S+ ENC + N SK NRT L
Sbjct: 112 PGSESLTIDHVLPRAQGGESSWENCVLACVECNHSKANRTPL 153
>gi|402307168|ref|ZP_10826195.1| HNH endonuclease domain protein [Prevotella sp. MSX73]
gi|400379008|gb|EJP31857.1| HNH endonuclease domain protein [Prevotella sp. MSX73]
Length = 84
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + + NCQ+L + NR+KGN+
Sbjct: 52 DADHVAAWSKGGATDIHNCQMLCKSHNRAKGNK 84
>gi|423209364|ref|ZP_17195918.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
gi|404617222|gb|EKB14158.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
Length = 218
Score = 41.6 bits (96), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 2/36 (5%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
LC +DHI+P+SKGG +++ N Q+L N SKGNR
Sbjct: 184 LC--FDHIVPFSKGGSNSVRNLQLLCERCNLSKGNR 217
>gi|319897846|ref|YP_004136043.1| hnh endonuclease [Haemophilus influenzae F3031]
gi|317433352|emb|CBY81731.1| HNH endonuclease [Haemophilus influenzae F3031]
Length = 388
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)
Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C HD DH+ +SKGG ++ ENC++L NR+KGN+
Sbjct: 335 CPLCAIHDNANKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 388
>gi|385809935|ref|YP_005846331.1| hypothetical protein IALB_1353 [Ignavibacterium album JCM 16511]
gi|383801983|gb|AFH49063.1| Hypothetical protein IALB_1353 [Ignavibacterium album JCM 16511]
Length = 436
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 3/48 (6%)
Query: 102 RKLVGCPGCLC-HDY--DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
RK C G + D+ DHI P+SKGG + L+N ++ N SKGNR
Sbjct: 389 RKCTKCGGLVTWTDFSADHIKPFSKGGATKLQNAAIMHKNCNSSKGNR 436
>gi|373467210|ref|ZP_09558511.1| HNH endonuclease domain protein [Haemophilus sp. oral taxon 851
str. F0397]
gi|371759169|gb|EHO47915.1| HNH endonuclease domain protein [Haemophilus sp. oral taxon 851
str. F0397]
Length = 388
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)
Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C HD DH+ +SKGG ++ ENC++L NR+KGN+
Sbjct: 335 CPLCAIHDNVNKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 388
>gi|303245412|ref|ZP_07331696.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
gi|302493261|gb|EFL53123.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
Length = 309
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
D DHI P SKGGK+ N QVL + NR+KGN+ + L+++ +
Sbjct: 139 DVDHIKPRSKGGKNEYANLQVLCSRCNRAKGNKEDTDYRPLVREEA 184
>gi|294675057|ref|YP_003575673.1| HNH endonuclease [Prevotella ruminicola 23]
gi|294473171|gb|ADE82560.1| HNH endonuclease [Prevotella ruminicola 23]
Length = 374
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 23/84 (27%)
Query: 66 RSFPYSVKQQCWEKAEK---VRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYS 122
R+FP +K + WE E + G++ D F + G DHI P+
Sbjct: 311 RAFPEDIKLKVWENQEHKCAICGKEFD-----------FEFMEG---------DHITPWR 350
Query: 123 KGGKSTLENCQVLQATVNRSKGNR 146
GG++ +ENCQ+L NR KG++
Sbjct: 351 DGGRTVVENCQMLCRECNRRKGSK 374
>gi|83591793|ref|YP_425545.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum ATCC 11170]
gi|386348484|ref|YP_006046732.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum F11]
gi|83574707|gb|ABC21258.1| CRISPR-associated endonuclease, Csn1 family [Rhodospirillum rubrum
ATCC 11170]
gi|346716920|gb|AEO46935.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum F11]
Length = 1173
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 13/36 (36%), Positives = 27/36 (75%)
Query: 112 CHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+D DH++P ++GG+ +L+N + Q+ N++KG++T
Sbjct: 623 AYDIDHVIPLARGGRDSLDNMVLCQSDANKTKGDKT 658
>gi|281422823|ref|ZP_06253822.1| putative HNH endonuclease domain protein [Prevotella copri DSM
18205]
gi|281403089|gb|EFB33769.1| putative HNH endonuclease domain protein [Prevotella copri DSM
18205]
Length = 389
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGGK+ L NCQ+L NR+KGN+
Sbjct: 359 DHVTAWSKGGKTDLANCQMLCKMHNRTKGNK 389
>gi|406036742|ref|ZP_11044106.1| hypothetical protein AparD1_07217 [Acinetobacter parvus DSM 16617 =
CIP 108168]
Length = 386
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 25/33 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + ++NC++L T NR+KGNR
Sbjct: 354 DADHVTAWSKGGATDVKNCEMLCKTHNRAKGNR 386
>gi|407643249|ref|YP_006807008.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
gi|407306133|gb|AFU00034.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
Length = 618
Score = 41.2 bits (95), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 20/81 (24%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RS P VK + W++ K V C ++DHI+P S+GG
Sbjct: 557 RSIPQEVKAEVWQRDGG--------------------KCVECGDSHYLEFDHIIPLSRGG 596
Query: 126 KSTLENCQVLQATVNRSKGNR 146
++ N Q+L + NR+KG R
Sbjct: 597 ATSAANLQILCRSCNRAKGAR 617
>gi|238926382|ref|ZP_04658142.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
gi|238885786|gb|EEQ49424.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
Length = 760
Score = 41.2 bits (95), Expect = 0.23, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
+ DHI+P +KGG+S LEN Q+L + N KG++ E
Sbjct: 723 NVDHIVPMNKGGRSVLENLQILCRSCNAKKGDQLE 757
>gi|297743366|emb|CBI36233.3| unnamed protein product [Vitis vinifera]
Length = 657
Score = 40.8 bits (94), Expect = 0.25, Method: Composition-based stats.
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 3/92 (3%)
Query: 60 DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
D A P + P W++ + G DP +R D GNV++ L D DH
Sbjct: 68 DPKAKPTTKPKIFISLLWKQLNHLGNAGFDPAIFRVDPYGNVLYYH-ADSASPLAWDIDH 126
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
P S+GG + N ++LQ V R K N+ E
Sbjct: 127 WFPCSRGGLTVPSNLRILQWQVCRRKHNKLEF 158
>gi|7021740|gb|AAF35421.1| hypothetical protein [Arabidopsis thaliana]
Length = 188
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 51/130 (39%), Gaps = 8/130 (6%)
Query: 22 PSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAE 81
P + T L NE E+ +F + D A P + P W++
Sbjct: 35 PYKNGPPFTFMPYILQQNESLRCREVDQMFPVI-----DPKARPTTKPKIFLSLLWKQLN 89
Query: 82 KV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
+ G DP R D GNVV+ L +DH P S+GG + N +++Q
Sbjct: 90 HLGNAGFDPAVIRIDPYGNVVYFH-ADSASPLAWSFDHWFPCSRGGLTVPSNLRIVQWQA 148
Query: 140 NRSKGNRTEL 149
++K ++ E
Sbjct: 149 RKNKKDKLEF 158
>gi|428169182|gb|EKX38118.1| hypothetical protein GUITHDRAFT_144431 [Guillardia theta CCMP2712]
Length = 320
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH-DYDHILPYSKG 124
R +P +V + W + + D W GN++ + G G +C+ + DH+ P S+G
Sbjct: 153 RQYPRAVMEVHWIPSWFHKLFKVDMW-----GNII--SVDGLQGAMCYAEEDHLFPSSRG 205
Query: 125 GKSTLENCQVLQATVNRSKGNRT 147
G+++L N ++Q NR K +
Sbjct: 206 GRTSLANLVLIQFVANRKKKEKI 228
>gi|359424187|ref|ZP_09215309.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
gi|358240461|dbj|GAB04891.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
Length = 379
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
D DHI+P+SKGGK+ EN Q+L NRSKG+
Sbjct: 347 DADHIVPWSKGGKTEAENGQMLCIGCNRSKGD 378
>gi|392944277|ref|ZP_10309919.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
gi|392287571|gb|EIV93595.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
Length = 607
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
++DH++P+S+GG S++ N Q+L NR+KG R
Sbjct: 574 EFDHVIPFSRGGASSVNNLQLLCRRCNRAKGAR 606
>gi|255070577|ref|XP_002507370.1| predicted protein [Micromonas sp. RCC299]
gi|226522645|gb|ACO68628.1| predicted protein [Micromonas sp. RCC299]
Length = 1022
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 11/59 (18%)
Query: 91 WRKDALGNVVFRKLVGCPGCL------CHDYDHILPYSKGGKSTLENCQVLQATVNRSK 143
+R D GNVV + PG C +YDH+LP+S+GG S N +V+ N+ K
Sbjct: 784 FRVDMYGNVVAK-----PGAAENFSVCCTEYDHVLPWSRGGTSKSANIEVISWIANQRK 837
>gi|384536863|ref|YP_005720948.1| HNH nuclease [Sinorhizobium meliloti SM11]
gi|336033755|gb|AEH79687.1| HNH nuclease [Sinorhizobium meliloti SM11]
Length = 113
Score = 40.8 bits (94), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+YDHI+PY+ STL NCQVL +R +G +T
Sbjct: 34 EYDHIVPYALSEDSTLSNCQVLCVPCHRGEGAKT 67
>gi|331267282|ref|YP_004326912.1| HNH endonuclease family [Streptococcus oralis Uo5]
gi|326683954|emb|CBZ01572.1| HNH endonuclease family [Streptococcus oralis Uo5]
Length = 412
Score = 40.8 bits (94), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
DHI+P+S GGK+ ENCQ+L N SKGN
Sbjct: 381 DHIIPWSLGGKTVPENCQLLCKKHNSSKGN 410
>gi|449460568|ref|XP_004148017.1| PREDICTED: uncharacterized protein LOC101221010 [Cucumis sativus]
gi|449520032|ref|XP_004167038.1| PREDICTED: uncharacterized LOC101221010 [Cucumis sativus]
Length = 711
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 50/122 (40%), Gaps = 8/122 (6%)
Query: 30 TVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRD 87
T +L E AT E+ LF + + A P + P W++ + G D
Sbjct: 43 TFTPYSLKKRETATARELDKLFPII-----NPKAKPTTKPKLFVSLLWKQLNHLGNAGFD 97
Query: 88 PDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
P R D GNV++ L D DH P S+GG + N ++LQ + K ++
Sbjct: 98 PAVIRVDHYGNVLYYH-ADSASPLAWDIDHWFPCSRGGLTVPSNLRILQWQACKRKHHKL 156
Query: 148 EL 149
E
Sbjct: 157 EF 158
>gi|303236487|ref|ZP_07323074.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
gi|302483338|gb|EFL46346.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
Length = 367
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 6/46 (13%)
Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C H + DHI+P+ KGGK+ +NCQ+L NR+K +
Sbjct: 322 CPICGKHFEIEEMEADHIIPWHKGGKTEKDNCQMLCMKCNRTKSGK 367
>gi|416418648|ref|ZP_11689169.1| hypothetical protein CWATWH0003_B254 [Crocosphaera watsonii WH
0003]
gi|357259652|gb|EHJ09318.1| hypothetical protein CWATWH0003_B254 [Crocosphaera watsonii WH
0003]
Length = 66
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 22/31 (70%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+ YSKGGK+ N Q L +T NR KGNR
Sbjct: 36 DHIIAYSKGGKTENWNGQALCSTCNRQKGNR 66
>gi|421277404|ref|ZP_15728224.1| HNH endonuclease [Streptococcus mitis SPAR10]
gi|395876685|gb|EJG87761.1| HNH endonuclease [Streptococcus mitis SPAR10]
Length = 388
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + NCQ+L T NR+KGNR
Sbjct: 356 DADHVTAWSKGGATDESNCQMLCVTHNRAKGNR 388
>gi|239907119|ref|YP_002953860.1| HIT family protein [Desulfovibrio magneticus RS-1]
gi|239796985|dbj|BAH75974.1| HIT family protein [Desulfovibrio magneticus RS-1]
Length = 324
Score = 40.4 bits (93), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE-----LSRSDLIQKSSYCRVSGR 166
D DHI P SKGG + N QVL + NRSKGN+ + L++ + I +C + R
Sbjct: 154 DVDHIRPRSKGGGNEYANLQVLCSKCNRSKGNKEDTDYRALAQGEAIPGCPFCYDAAR 211
>gi|348169152|ref|ZP_08876046.1| HNH nuclease [Saccharopolyspora spinosa NRRL 18395]
Length = 221
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
YDHI+P + GG ST EN Q+L A N+SKG
Sbjct: 189 YDHIIPVALGGSSTEENLQLLCAQCNQSKG 218
>gi|108758899|ref|YP_629743.1| HNH endonuclease domain-containing protein [Myxococcus xanthus DK
1622]
gi|108462779|gb|ABF87964.1| HNH endonuclease domain protein [Myxococcus xanthus DK 1622]
Length = 180
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
YDH+LP ++GGK++ EN + N+ KGNRT
Sbjct: 96 YDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128
>gi|405355448|ref|ZP_11024674.1| HNH endonuclease [Chondromyces apiculatus DSM 436]
gi|397091790|gb|EJJ22592.1| HNH endonuclease [Myxococcus sp. (contaminant ex DSM 436)]
Length = 180
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
YDH+LP ++GGK++ EN + N+ KGNRT
Sbjct: 96 YDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128
>gi|443674162|ref|ZP_21139202.1| HNH endonuclease [Rhodococcus sp. AW25M09]
gi|443413333|emb|CCQ17541.1| HNH endonuclease [Rhodococcus sp. AW25M09]
Length = 390
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +S GG + L+NC++L T NR+KGNR
Sbjct: 360 DHVTAWSNGGNTDLDNCEMLCVTHNRAKGNR 390
>gi|441509730|ref|ZP_20991644.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
108223]
gi|441446246|dbj|GAC49605.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
108223]
Length = 334
Score = 40.4 bits (93), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
L + DHI+P S+GG ST+EN Q L NRSK N+
Sbjct: 295 LLLEVDHIVPVSRGGLSTMENLQTLCWKCNRSKSNK 330
>gi|359423170|ref|ZP_09214313.1| hypothetical protein GOAMR_06_01010 [Gordonia amarae NBRC 15530]
gi|358241617|dbj|GAB03895.1| hypothetical protein GOAMR_06_01010 [Gordonia amarae NBRC 15530]
Length = 394
Score = 40.0 bits (92), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +S+GG STL NC++L NR+KGN+
Sbjct: 364 DHVTAWSRGGASTLANCEMLCIVHNRAKGNK 394
>gi|254295258|ref|YP_003061281.1| HNH endonuclease [Hirschia baltica ATCC 49814]
gi|254043789|gb|ACT60584.1| HNH endonuclease [Hirschia baltica ATCC 49814]
Length = 85
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
+DHI+P SK G +EN Q+ A N++KGNR
Sbjct: 53 FDHIIPKSKHGDDKIENLQLAHAICNKTKGNR 84
>gi|114800298|ref|YP_762067.1| HNH endonuclease domain-containing protein [Hyphomonas neptunium
ATCC 15444]
gi|114740472|gb|ABI78597.1| HNH endonuclease domain protein [Hyphomonas neptunium ATCC 15444]
Length = 197
Score = 40.0 bits (92), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P SKGG++T EN + N +KG +
Sbjct: 97 NVFLRDGFKCAYCGAHDNLTFDHVIPRSKGGRTTWENIVTACSPCNLAKGGK 148
>gi|227484896|ref|ZP_03915212.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
51172]
gi|227237051|gb|EEI87066.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
51172]
Length = 121
Score = 40.0 bits (92), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P+SKGGK+ ENCQ+L N K N
Sbjct: 91 DHIVPWSKGGKTIAENCQMLCGDCNIKKSNH 121
>gi|422793041|ref|ZP_16845739.1| HNH endonuclease [Escherichia coli TA007]
gi|323970450|gb|EGB65712.1| HNH endonuclease [Escherichia coli TA007]
gi|380248683|emb|CBW53077.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Hadar]
Length = 390
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 29/97 (29%)
Query: 64 NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD--------- 114
N R F +K+Q ++ R D+ + + N CP C D
Sbjct: 309 NIRFFDEKIKKQAYK-------RQTDKAEQQGISN--------CPLCASSDNANKTKIWA 353
Query: 115 -----YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGG+S +NC++L T NRSKGN+
Sbjct: 354 LKEMEADHVSAWSKGGESKPDNCEMLCVTHNRSKGNK 390
>gi|323137342|ref|ZP_08072420.1| HNH endonuclease [Methylocystis sp. ATCC 49242]
gi|322397329|gb|EFX99852.1| HNH endonuclease [Methylocystis sp. ATCC 49242]
Length = 190
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P SKGG +T EN + N KG+R
Sbjct: 90 NVFLRDRFTCQYCGAHDELTFDHVIPRSKGGATTWENVVAACSPCNLRKGDR 141
>gi|300088252|ref|YP_003758774.1| hypothetical protein Dehly_1158 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
gi|299527985|gb|ADJ26453.1| protein of unknown function DUF262 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
Length = 388
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
D DH+ +SKGG S+ ENC++L T NR+KGN
Sbjct: 356 DADHVSAWSKGGDSSPENCEMLCITHNRAKGN 387
>gi|367467210|ref|ZP_09467189.1| Tellurium resistance protein terE [Patulibacter sp. I11]
gi|365817693|gb|EHN12644.1| Tellurium resistance protein terE [Patulibacter sp. I11]
Length = 176
Score = 40.0 bits (92), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%)
Query: 107 CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
C YDH++P + GG +T+EN QVL A N+ KG
Sbjct: 135 CGATFDLQYDHVIPIALGGANTVENLQVLCAPCNQRKG 172
>gi|379707873|ref|YP_005263078.1| hypothetical protein NOCYR_1649 [Nocardia cyriacigeorgica GUH-2]
gi|374845372|emb|CCF62438.1| conserved protein of unknown function; putative endonuclease domain
[Nocardia cyriacigeorgica GUH-2]
Length = 549
Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 20/81 (24%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RS P VK + W+ RD R V C ++DHI+P S+GG
Sbjct: 488 RSIPQEVKAEVWQ-------RDGGR-------------CVECGDGHYLEFDHIIPLSRGG 527
Query: 126 KSTLENCQVLQATVNRSKGNR 146
++ N Q+L NR+KG R
Sbjct: 528 ATSATNLQILCRACNRAKGAR 548
>gi|402848418|ref|ZP_10896676.1| HNH endonuclease family protein [Rhodovulum sp. PH10]
gi|402501324|gb|EJW12978.1| HNH endonuclease family protein [Rhodovulum sp. PH10]
Length = 172
Score = 39.7 bits (91), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C HD +DH++P S GG++T +N + N KGN+T
Sbjct: 72 NVFLRDRFSCQYCGSHDDLTFDHLIPRSLGGQTTWDNVVTACSACNLKKGNKT 124
>gi|224143811|ref|XP_002336083.1| predicted protein [Populus trichocarpa]
gi|222871397|gb|EEF08528.1| predicted protein [Populus trichocarpa]
Length = 188
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 10/114 (8%)
Query: 39 NELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDAL 96
+E+ E+ +F + A TA P+ F W + + G DP R D
Sbjct: 53 HEILRTRELDQMFPVIE-PKAKQTAKPKIF----ISLLWRQLNHLGNAGFDPAVIRVDPY 107
Query: 97 GNVV-FRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
GNV+ F P L D DH P +GG + N ++LQ V + K N+ E
Sbjct: 108 GNVLYFHADKASP--LAWDIDHWFPCPRGGLTVPRNLRILQRQVCKRKHNKLEF 159
>gi|224058715|ref|XP_002299617.1| predicted protein [Populus trichocarpa]
gi|222846875|gb|EEE84422.1| predicted protein [Populus trichocarpa]
Length = 188
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 10/114 (8%)
Query: 39 NELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDAL 96
+E+ E+ +F + A TA P+ F W + + G DP R D
Sbjct: 53 HEILRTRELDQMFPVIE-PKAKQTAKPKIF----ISLLWRQLNHLGNAGFDPAVIRVDPY 107
Query: 97 GNVV-FRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
GNV+ F P L D DH P +GG + N ++LQ V + K N+ E
Sbjct: 108 GNVLYFHADKASP--LAWDIDHWFPCPRGGLTVPRNLRILQRQVCKRKHNKLEF 159
>gi|401564832|ref|ZP_10805697.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
gi|400188432|gb|EJO22596.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
Length = 754
Score = 39.7 bits (91), Expect = 0.65, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
+ DHI P + GG+S L+N QVL + N SKG++ E
Sbjct: 719 EVDHIKPMNDGGQSVLDNLQVLCKSCNASKGDKAE 753
>gi|159897861|ref|YP_001544108.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
gi|159890900|gb|ABX03980.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
Length = 321
Score = 39.7 bits (91), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 25/34 (73%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
++DHI+P+SKGG +T+ N Q+L N +KG+R
Sbjct: 288 EFDHIIPHSKGGANTVGNVQLLCRKCNLAKGDRI 321
>gi|169346887|ref|ZP_02865835.1| type III restriction enzyme, res subunit family [Clostridium
perfringens C str. JGS1495]
gi|169296946|gb|EDS79070.1| type III restriction enzyme, res subunit family [Clostridium
perfringens C str. JGS1495]
Length = 740
Score = 39.3 bits (90), Expect = 0.71, Method: Composition-based stats.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DHI P SKGG +TL+N Q+L + N+ KG++ E
Sbjct: 708 DHIKPISKGGLTTLDNLQLLCSKCNKIKGDKLE 740
>gi|308535332|ref|YP_003933733.1| HNH endonuclease [Geobacter bemidjiensis Bem]
gi|308052677|gb|ADO00821.1| HNH endonuclease [Geobacter bemidjiensis Bem]
Length = 161
Score = 39.3 bits (90), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 7/70 (10%)
Query: 78 EKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCHD---YDHILPYSKGGKSTLENCQ 133
E+A+ R R P A+ NVV+ + G C C ++ +DHI+P+SKGG T N Q
Sbjct: 91 EQADSRRKRQPI---ARAVRNVVWNRDGGVCVECGSNENLEFDHIIPHSKGGSDTERNIQ 147
Query: 134 VLQATVNRSK 143
+L NR+K
Sbjct: 148 LLCMKCNRTK 157
>gi|297830140|ref|XP_002882952.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297328792|gb|EFH59211.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 727
Score = 39.3 bits (90), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 41/100 (41%), Gaps = 3/100 (3%)
Query: 52 TDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPG 109
D D A P + P W++ + G DP R D GNVV+
Sbjct: 59 VDAMFPVIDPKARPTTKPKIFLSLLWKQLNHLGNAGFDPAVIRIDPYGNVVYFH-ADSAS 117
Query: 110 CLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
L +DH P S+GG + N +++Q ++K ++ E
Sbjct: 118 PLAWSFDHWFPCSRGGLTVPSNLRIVQWQARKNKKDKLEF 157
>gi|433624626|ref|YP_007258256.1| Putative uncharacterized protein [Mycoplasma cynos C142]
gi|429534652|emb|CCP24154.1| Putative uncharacterized protein [Mycoplasma cynos C142]
Length = 104
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 92 RKDALGNVVFRKLVGCPGC-LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
++D G ++ + G P C + DHILP S GGK LEN + N K N+
Sbjct: 20 KRDFAGRIMDKSAYGDPNSEFCWNIDHILPVSCGGKDELENFIAVHCETNTEKANKFPHF 79
Query: 151 RSDLIQKSSYCRVSGRDMDLLEL 173
+++ KS R S + + E+
Sbjct: 80 KAN--NKSFEIRRSNKGFNKYEI 100
>gi|183602772|ref|ZP_02964135.1| hypothetical protein BIFLAC_01035 [Bifidobacterium animalis subsp.
lactis HN019]
gi|219683130|ref|YP_002469513.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis AD011]
gi|241191461|ref|YP_002968855.1| hypothetical protein Balac_1446 [Bifidobacterium animalis subsp.
lactis Bl-04]
gi|241196867|ref|YP_002970422.1| hypothetical protein Balat_1446 [Bifidobacterium animalis subsp.
lactis DSM 10140]
gi|384190095|ref|YP_005575843.1| hypothetical protein [Bifidobacterium animalis subsp. lactis BB-12]
gi|384192885|ref|YP_005578632.1| hypothetical protein [Bifidobacterium animalis subsp. lactis CNCM
I-2494]
gi|384194458|ref|YP_005580204.1| HNH endonuclease domain protein [Bifidobacterium animalis subsp.
lactis BLC1]
gi|384196023|ref|YP_005581768.1| hypothetical protein BalV_1401 [Bifidobacterium animalis subsp.
lactis V9]
gi|387821324|ref|YP_006301367.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis B420]
gi|387823007|ref|YP_006302956.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis Bi-07]
gi|423679993|ref|ZP_17654869.1| hypothetical protein FEM_15483 [Bifidobacterium animalis subsp.
lactis BS 01]
gi|183218010|gb|EDT88658.1| hypothetical protein BIFLAC_01035 [Bifidobacterium animalis subsp.
lactis HN019]
gi|219620780|gb|ACL28937.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis AD011]
gi|240249853|gb|ACS46793.1| hypothetical protein Balac_1446 [Bifidobacterium animalis subsp.
lactis Bl-04]
gi|240251421|gb|ACS48360.1| hypothetical protein Balat_1446 [Bifidobacterium animalis subsp.
lactis DSM 10140]
gi|289177587|gb|ADC84833.1| Hypothetical protein BIF_00872 [Bifidobacterium animalis subsp.
lactis BB-12]
gi|295794454|gb|ADG33989.1| hypothetical protein BalV_1401 [Bifidobacterium animalis subsp.
lactis V9]
gi|340365622|gb|AEK30913.1| Hypothetical protein BALAC2494_01292 [Bifidobacterium animalis
subsp. lactis CNCM I-2494]
gi|345283317|gb|AEN77171.1| HNH endonuclease domain protein [Bifidobacterium animalis subsp.
lactis BLC1]
gi|366040992|gb|EHN17505.1| hypothetical protein FEM_15483 [Bifidobacterium animalis subsp.
lactis BS 01]
gi|386654025|gb|AFJ17155.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis B420]
gi|386655615|gb|AFJ18744.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis Bi-07]
Length = 372
Score = 39.3 bits (90), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C H DH +P+SKGGK+ LENC +L N K N+
Sbjct: 327 CPLCQQHFELSEMQADHNIPWSKGGKTILENCVMLCVKDNIKKSNK 372
>gi|442318514|ref|YP_007358535.1| HNH endonuclease domain-containing protein [Myxococcus stipitatus
DSM 14675]
gi|441486156|gb|AGC42851.1| HNH endonuclease domain-containing protein [Myxococcus stipitatus
DSM 14675]
Length = 180
Score = 39.3 bits (90), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
YDH++P ++GGK++ EN + N+ KGNRT
Sbjct: 96 YDHVIPRAQGGKTSWENVVIACVPCNQKKGNRT 128
>gi|357455079|ref|XP_003597820.1| hypothetical protein MTR_2g102680 [Medicago truncatula]
gi|355486868|gb|AES68071.1| hypothetical protein MTR_2g102680 [Medicago truncatula]
Length = 707
Score = 39.3 bits (90), Expect = 0.80, Method: Composition-based stats.
Identities = 33/117 (28%), Positives = 47/117 (40%), Gaps = 3/117 (2%)
Query: 35 TLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWR 92
T L L +E F D D A P + W++ + G DP R
Sbjct: 43 TFLPYALPEDEVERVSFLDEMFPIIDPKAKPTTKAKIFASILWKQLSHLGNAGFDPAVIR 102
Query: 93 KDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
D GNV++ L D DH P S+GG + L N ++LQ V + K ++ E
Sbjct: 103 VDGYGNVLYYH-ADSASPLAWDIDHWFPCSRGGLTVLSNLRILQRQVCKRKKHKLEF 158
>gi|240255352|ref|NP_188174.4| uncharacterized protein [Arabidopsis thaliana]
gi|11994223|dbj|BAB01345.1| unnamed protein product [Arabidopsis thaliana]
gi|332642171|gb|AEE75692.1| uncharacterized protein [Arabidopsis thaliana]
Length = 729
Score = 39.3 bits (90), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 41/100 (41%), Gaps = 3/100 (3%)
Query: 52 TDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPG 109
D D A P + P W++ + G DP R D GNVV+
Sbjct: 60 VDQMFPVIDPKARPTTKPKIFLSLLWKQLNHLGNAGFDPAVIRIDPYGNVVYFH-ADSAS 118
Query: 110 CLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
L +DH P S+GG + N +++Q ++K ++ E
Sbjct: 119 PLAWSFDHWFPCSRGGLTVPSNLRIVQWQARKNKKDKLEF 158
>gi|154500496|ref|ZP_02038534.1| hypothetical protein BACCAP_04168 [Bacteroides capillosus ATCC
29799]
gi|150270727|gb|EDM98023.1| HNH endonuclease domain protein [Pseudoflavonifractor capillosus
ATCC 29799]
Length = 750
Score = 39.3 bits (90), Expect = 0.82, Method: Composition-based stats.
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHILP SKGGKS EN Q+L + N KG++
Sbjct: 720 DHILPMSKGGKSVPENLQILCRSCNGRKGDQ 750
>gi|326512504|dbj|BAJ99607.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 283
Score = 39.3 bits (90), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 10/91 (10%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
N++FR C C D DH++P S+GGK EN + N KGN+T L +
Sbjct: 179 NILFRDGFSCQYCSSEDDLTIDHVIPASRGGKWEWENLVTACSRCNSRKGNKTLLQANMK 238
Query: 155 IQKSSYCRVSGRDMDLLEL----SAYGNVRR 181
++K ++ D++ + SA+ +RR
Sbjct: 239 LRKIPK---GPKEFDIIAVPLTKSAFRTIRR 266
>gi|417782516|ref|ZP_12430243.1| HNH endonuclease domain protein [Leptospira interrogans str.
C10069]
gi|418669102|ref|ZP_13230498.1| HNH endonuclease domain protein [Leptospira interrogans serovar
Pyrogenes str. 2006006960]
gi|409954299|gb|EKO08791.1| HNH endonuclease domain protein [Leptospira interrogans str.
C10069]
gi|410755141|gb|EKR16775.1| HNH endonuclease domain protein [Leptospira interrogans serovar
Pyrogenes str. 2006006960]
Length = 239
Score = 39.3 bits (90), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSD 153
PG H DHI+P+SKGG + +EN + A N KGN E S D
Sbjct: 192 PGTELH-CDHIIPWSKGGLTVIENLRTTCAKCNLGKGNLIETSVWD 236
>gi|450159591|ref|ZP_21879522.1| HNH endonuclease [Streptococcus mutans 66-2A]
gi|449240999|gb|EMC39646.1| HNH endonuclease [Streptococcus mutans 66-2A]
Length = 412
Score = 39.3 bits (90), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
DHI P+S GGK+ ENCQ+L N SKGN
Sbjct: 381 DHITPWSLGGKTVPENCQLLCKKHNSSKGN 410
>gi|254432244|ref|ZP_05045947.1| HNH nuclease [Cyanobium sp. PCC 7001]
gi|197626697|gb|EDY39256.1| HNH nuclease [Cyanobium sp. PCC 7001]
Length = 145
Score = 39.3 bits (90), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 78 EKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQA 137
E K+R R RWR+ +L + ++ + C GC DH+LP S+GG S ENC
Sbjct: 5 ELCPKLRAR---RWRQ-SLHQLTQQRCIYC-GCRSESIDHVLPKSRGGPSVNENCVPACL 59
Query: 138 TVNRSKGNRTELS 150
N KG+ S
Sbjct: 60 ACNGRKGDSEAFS 72
>gi|67920391|ref|ZP_00513911.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
gi|67857875|gb|EAM53114.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
Length = 584
Score = 38.9 bits (89), Expect = 0.92, Method: Composition-based stats.
Identities = 29/87 (33%), Positives = 35/87 (40%), Gaps = 14/87 (16%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
P S PY W K P R K + +K G CP C H + D
Sbjct: 493 PESSPYDGNWTYWSKRRGTYTGTPARVAK------LLKKQKGICPQCKQHFTPEDLIEVD 546
Query: 117 HILPYSKGGKSTLENCQVLQATVNRSK 143
HI+P SKGGK T N Q L + +K
Sbjct: 547 HIIPKSKGGKDTYNNLQALHRHCHDAK 573
>gi|381180675|ref|ZP_09889513.1| hypothetical protein TresaDRAFT_1271 [Treponema saccharophilum DSM
2985]
gi|380767378|gb|EIC01379.1| hypothetical protein TresaDRAFT_1271 [Treponema saccharophilum DSM
2985]
Length = 274
Score = 38.9 bits (89), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL----SRSDLIQKSSYCRVSGRDM 168
++HI+P KGG ++ NCQ++ NR K N + +R+++ +K Y ++ M
Sbjct: 31 FEHIIPKDKGGTNSFSNCQIVSRKFNRQKYNMNTIFSFTNRNEIYKKCLYASIAHAVM 88
>gi|399071595|ref|ZP_10750043.1| restriction endonuclease [Caulobacter sp. AP07]
gi|398043268|gb|EJL36188.1| restriction endonuclease [Caulobacter sp. AP07]
Length = 186
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C D +DH++P S+GG++T EN A N SKG RT
Sbjct: 86 NLFLRDGFSCQYCGAPDDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 138
>gi|257456239|ref|ZP_05621436.1| HNH endonuclease domain protein [Treponema vincentii ATCC 35580]
gi|257446325|gb|EEV21371.1| HNH endonuclease domain protein [Treponema vincentii ATCC 35580]
Length = 172
Score = 38.9 bits (89), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGG + NCQ+L T NR+KGNR
Sbjct: 142 DHVSAWSKGGATNSSNCQMLCKTHNRAKGNR 172
>gi|409098020|ref|ZP_11218044.1| hypothetical protein PagrP_06221 [Pedobacter agri PB92]
Length = 296
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
++DHI+P+SKGG +T N Q+L NR+K R
Sbjct: 262 EFDHIIPFSKGGSNTYRNIQLLCEGCNRTKSARI 295
>gi|88803149|ref|ZP_01118675.1| hypothetical protein PI23P_11192 [Polaribacter irgensii 23-P]
gi|88780715|gb|EAR11894.1| hypothetical protein PI23P_11192 [Polaribacter irgensii 23-P]
Length = 387
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + +NC++L T NR+KGN+
Sbjct: 355 DADHVTAWSKGGATLKDNCEMLCKTHNRAKGNK 387
>gi|367473211|ref|ZP_09472776.1| conserved hypothetical protein; putative HNH endonuclease
[Bradyrhizobium sp. ORS 285]
gi|365274500|emb|CCD85244.1| conserved hypothetical protein; putative HNH endonuclease
[Bradyrhizobium sp. ORS 285]
Length = 185
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DH++P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLT 137
>gi|365878540|ref|ZP_09418012.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 375]
gi|365293573|emb|CCD90543.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 375]
Length = 185
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DH++P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLT 137
>gi|210623659|ref|ZP_03293968.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
gi|210153424|gb|EEA84430.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
Length = 912
Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DHI+P SKGGK+ +EN Q+L N KG+ E
Sbjct: 661 DHIVPLSKGGKTVVENLQLLTRWENAQKGDSLE 693
>gi|383825597|ref|ZP_09980744.1| pCQ3_20 [Mycobacterium xenopi RIVM700367]
gi|383334516|gb|EID12955.1| pCQ3_20 [Mycobacterium xenopi RIVM700367]
Length = 130
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
DHILP+S GG T +N +VL N ++GNR E+
Sbjct: 92 LDHILPWSLGGPDTEDNLRVLCRPCNSARGNRIEV 126
>gi|456352674|dbj|BAM87119.1| conserved hypothetical protein [Agromonas oligotrophica S58]
Length = 174
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DH++P SKGG++T EN + N KGN T
Sbjct: 74 NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLRKGNLT 126
>gi|41408779|ref|NP_961615.1| hypothetical protein MAP2681c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|254774117|ref|ZP_05215633.1| hypothetical protein MaviaA2_05510 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|417749729|ref|ZP_12398118.1| restriction endonuclease [Mycobacterium avium subsp.
paratuberculosis S397]
gi|41397137|gb|AAS04998.1| hypothetical protein MAP_2681c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|336458728|gb|EGO37688.1| restriction endonuclease [Mycobacterium avium subsp.
paratuberculosis S397]
Length = 314
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 34/78 (43%), Gaps = 20/78 (25%)
Query: 90 RWRKDALG-----------NVVFRKLVGCPGC---------LCHDYDHILPYSKGGKSTL 129
RWRK A G ++ R C C L + DHI+P SKGG ST
Sbjct: 234 RWRKSAAGQRALMTSRLRNSIKVRDHYTCRYCSVSLAAEPHLLLEVDHIVPISKGGMSTP 293
Query: 130 ENCQVLQATVNRSKGNRT 147
+N Q L NR+K N+
Sbjct: 294 DNLQTLCWRCNRTKSNKV 311
>gi|422012873|ref|ZP_16359511.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
gi|394752101|gb|EJF35815.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
Length = 94
Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 8/45 (17%)
Query: 111 LCHDY--------DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
LCH DH++P SKGG STL+N ++ N +KGNR
Sbjct: 32 LCHQLVRRRDASVDHVVPSSKGGPSTLDNLKLAHRRCNFAKGNRV 76
>gi|338529937|ref|YP_004663271.1| HNH endonuclease domain-containing protein [Myxococcus fulvus HW-1]
gi|337256033|gb|AEI62193.1| HNH endonuclease domain-containing protein [Myxococcus fulvus HW-1]
Length = 180
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+DH+LP ++GGK++ EN + N+ KGNRT
Sbjct: 96 HDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128
>gi|383643665|ref|ZP_09956071.1| HNH endonuclease [Sphingomonas elodea ATCC 31461]
Length = 188
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCH---DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C H +DH++P ++GG++T EN A N KG RT
Sbjct: 88 NLFLRDKFACQYCGSHRDLTFDHVIPRAQGGRTTWENVATACAPCNLKKGGRT 140
>gi|147669720|ref|YP_001214538.1| HNH endonuclease [Dehalococcoides sp. BAV1]
gi|146270668|gb|ABQ17660.1| HNH endonuclease [Dehalococcoides sp. BAV1]
Length = 228
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 1/35 (2%)
Query: 115 YDHILPYSKGGKSTL-ENCQVLQATVNRSKGNRTE 148
+DHI+PYSKGG S + EN Q+L A+ N SK E
Sbjct: 194 FDHIIPYSKGGSSLVAENIQLLCASCNLSKHANIE 228
>gi|307107571|gb|EFN55813.1| hypothetical protein CHLNCDRAFT_145317 [Chlorella variabilis]
Length = 276
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DHI+P+SKGG S N Q++ N SKGN+++
Sbjct: 230 DHIIPFSKGGPSEEGNAQLVHRFCNSSKGNKSD 262
>gi|226532958|ref|NP_001141933.1| uncharacterized protein LOC100274082 [Zea mays]
gi|194706496|gb|ACF87332.1| unknown [Zea mays]
gi|414883692|tpg|DAA59706.1| TPA: hypothetical protein ZEAMMB73_991900 [Zea mays]
Length = 287
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+++R C C C D DH++P S+GGK EN A N KG +T
Sbjct: 183 NILYRDDFTCQYCSCGDDLTIDHVIPISRGGKWEWENLVTACARCNSRKGQKT 235
>gi|295690224|ref|YP_003593917.1| HNH endonuclease [Caulobacter segnis ATCC 21756]
gi|295432127|gb|ADG11299.1| HNH endonuclease [Caulobacter segnis ATCC 21756]
Length = 186
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C + +DH++P S+GG++T EN A N SKG RT
Sbjct: 86 NLFLRDAFSCQYCGSTEDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 138
>gi|357386016|ref|YP_004900740.1| HNH endonuclease family protein [Pelagibacterium halotolerans B2]
gi|351594653|gb|AEQ52990.1| HNH endonuclease family protein [Pelagibacterium halotolerans B2]
Length = 185
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C HD +DH+LP SKGG++T EN + N K NRT
Sbjct: 85 NVFLRDRFQCQYCGSRHDLTFDHVLPRSKGGRTTWENIVAACSPCNLRKANRT 137
>gi|57233948|ref|YP_181981.1| HNH endonuclease domain-containing protein [Dehalococcoides
ethenogenes 195]
gi|57224396|gb|AAW39453.1| HNH endonuclease domain protein [Dehalococcoides ethenogenes 195]
Length = 232
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 54/130 (41%), Gaps = 36/130 (27%)
Query: 20 PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
PS +R+ + + + + NE+AT TL D+R + P +K+ W +
Sbjct: 138 PSDNRKVFKFKLELTSEITNEIAT----PTLDADIR-----------AIPSDIKRIVWAR 182
Query: 80 AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL-ENCQVLQAT 138
+ K V C +DH++PYSKGG S + EN Q+L A+
Sbjct: 183 DQG--------------------KCVKCGNNSNLHFDHVIPYSKGGSSLIAENIQLLCAS 222
Query: 139 VNRSKGNRTE 148
N SK E
Sbjct: 223 CNLSKHASIE 232
>gi|392942375|ref|ZP_10308017.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
gi|392285669|gb|EIV91693.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
Length = 607
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 20/92 (21%)
Query: 55 RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
R+ A + R+ P +VK + W++ V C +
Sbjct: 535 RLVIAPGARDSRAIPQAVKAEVWQRDGG--------------------ACVQCQATEYLE 574
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
+DH++P+S+GG +++ N Q+L N+ KG R
Sbjct: 575 FDHVIPHSRGGATSVNNLQLLCRRCNQEKGAR 606
>gi|54023566|ref|YP_117808.1| hypothetical protein nfa15980 [Nocardia farcinica IFM 10152]
gi|54015074|dbj|BAD56444.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length = 760
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
++DHI+P S+GG ++ N Q+L NR KG R
Sbjct: 727 EFDHIIPLSRGGATSAANLQILCRACNRRKGAR 759
>gi|430376075|ref|ZP_19430478.1| hypothetical protein MOMA_02985 [Moraxella macacae 0408225]
gi|429541306|gb|ELA09334.1| hypothetical protein MOMA_02985 [Moraxella macacae 0408225]
Length = 387
Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGG + ++NC++L NR+KGNR
Sbjct: 357 DHVTAWSKGGLTGIDNCEMLCKVHNRAKGNR 387
>gi|304321776|ref|YP_003855419.1| hypothetical protein PB2503_11149 [Parvularcula bermudensis
HTCC2503]
gi|303300678|gb|ADM10277.1| hypothetical protein PB2503_11149 [Parvularcula bermudensis
HTCC2503]
Length = 188
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 7/62 (11%)
Query: 92 RKDALG--NVVFRKLVGCPGCLCHD-----YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
RK A NV R + C C D +DH++P S+GGK+T +N + N KG
Sbjct: 78 RKPAFTRFNVFLRDVFTCQYCGTQDRDNLTFDHLIPRSRGGKTTWQNIVTACSPCNLKKG 137
Query: 145 NR 146
N+
Sbjct: 138 NK 139
>gi|91974822|ref|YP_567481.1| HNH endonuclease [Rhodopseudomonas palustris BisB5]
gi|91681278|gb|ABE37580.1| HNH endonuclease [Rhodopseudomonas palustris BisB5]
Length = 185
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NV R C C + +DHI+P SKGG++T EN + N KGN T +
Sbjct: 85 NVFLRDRFACQYCGAPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPI 139
>gi|449446780|ref|XP_004141149.1| PREDICTED: uncharacterized protein LOC101207660 [Cucumis sativus]
Length = 277
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+++R C C H+ DH+LP S+GG+ T EN N KG +T
Sbjct: 174 NILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT 226
>gi|406930511|gb|EKD65859.1| hypothetical protein ACD_49C00079G0023 [uncultured bacterium (gcode
4)]
Length = 360
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 6/44 (13%)
Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
CP C H + DHI P+ + GK++ ENCQ+L NR K
Sbjct: 315 CPKCNDHFEINEMEADHITPWHEWGKTSAENCQMLCKNCNRVKS 358
>gi|218440993|ref|YP_002379322.1| HNH endonuclease [Cyanothece sp. PCC 7424]
gi|218173721|gb|ACK72454.1| HNH endonuclease [Cyanothece sp. PCC 7424]
Length = 169
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
DHILP SKGGK T +N + + N KG+RT S + +I K+S
Sbjct: 106 DHILPRSKGGKHTWDNVVIACESCNSRKGDRTP-SEAGMILKTS 148
>gi|29826461|ref|NP_828767.1| putative NHN endonuclease [Streptomyces avermitilis MA-4680]
gi|29611259|dbj|BAC75302.1| putative HNH endonuclease [Streptomyces avermitilis MA-4680]
Length = 677
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 86 RDPDRWRKDALGNVVFRKLVG----CPGCLCHDY--DHILPYSKGGKSTLENCQVLQATV 139
RDP R+ N+++ G C L + DH++P+ +GG++ +EN Q L A
Sbjct: 4 RDPQRFFTQNQRNILYDLAEGRCETCSAPLLDGWEADHMVPWVQGGRTVIENGQALCAQC 63
Query: 140 NRSKG 144
N+ KG
Sbjct: 64 NKGKG 68
>gi|162148898|ref|YP_001603359.1| endonuclease [Gluconacetobacter diazotrophicus PAl 5]
gi|209545355|ref|YP_002277584.1| HNH endonuclease [Gluconacetobacter diazotrophicus PAl 5]
gi|161787475|emb|CAP57071.1| putative endonuclease protein [Gluconacetobacter diazotrophicus PAl
5]
gi|209533032|gb|ACI52969.1| HNH endonuclease [Gluconacetobacter diazotrophicus PAl 5]
Length = 186
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 98 NVVFRKLVGCPGCL----CHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C CL HD +DH++P KGG++T EN + N KG+R
Sbjct: 83 NVFLRDNFSCQYCLDRLPTHDLTFDHVIPRCKGGRTTWENVVTACSPCNLMKGSR 137
>gi|386773495|ref|ZP_10095873.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
Length = 146
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 41/94 (43%), Gaps = 15/94 (15%)
Query: 69 PYSVKQ---QCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH-----------D 114
PY + Q + W + RGRDP++ R V + ++ G C
Sbjct: 43 PYPLAQIESRAWWEWHWARGRDPEKVRTKIPARV-RQAVIARDGHTCQLCGGEVEPDDVH 101
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DHI+P+S GG T++N +V + N +G E
Sbjct: 102 LDHIVPWSHGGPDTVKNLRVTHSLCNMKRGAPVE 135
>gi|296313798|ref|ZP_06863739.1| putative HNH endonuclease domain protein [Neisseria polysaccharea
ATCC 43768]
gi|296839622|gb|EFH23560.1| putative HNH endonuclease domain protein [Neisseria polysaccharea
ATCC 43768]
Length = 376
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 29/89 (32%)
Query: 64 NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH------DYDH 117
N R+F S+KQ+ +EK + + CP C + DH
Sbjct: 311 NIRAFSDSIKQKVYEKQDGI-----------------------CPHCKQKFELSEMEGDH 347
Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNR 146
I P+ +GGK+ ENCQ+L NR K ++
Sbjct: 348 ITPWCEGGKTIEENCQMLCRDCNRRKSDK 376
>gi|383453402|ref|YP_005367391.1| HNH endonuclease domain-containing protein [Corallococcus
coralloides DSM 2259]
gi|380728026|gb|AFE04028.1| HNH endonuclease domain-containing protein [Corallococcus
coralloides DSM 2259]
Length = 180
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
YDH+LP ++GG+++ +N + N+ KGNRT
Sbjct: 96 YDHVLPRAQGGRTSWDNIVIACVPCNQKKGNRT 128
>gi|86160544|ref|YP_467329.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
gi|85777055|gb|ABC83892.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
Length = 334
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 13/111 (11%)
Query: 37 LDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDAL 96
+D LA E+ T+ +S + +PR P V+++ WE R R +WR A
Sbjct: 219 VDLLLAKAEKRKGALTERPRASVRPSRDPRHIPAHVRREVWE-----RDRGCCQWRL-AN 272
Query: 97 GNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
G + C DH++P ++GG+ T N +VL A N R
Sbjct: 273 GEI-------CGSTHALQLDHVVPRARGGEPTAGNLRVLCAAHNLEAARRV 316
>gi|16126439|ref|NP_421003.1| HNH endonuclease [Caulobacter crescentus CB15]
gi|221235220|ref|YP_002517656.1| HNH endonuclease family protein [Caulobacter crescentus NA1000]
gi|13423703|gb|AAK24171.1| HNH endonuclease family protein [Caulobacter crescentus CB15]
gi|220964392|gb|ACL95748.1| HNH endonuclease family protein [Caulobacter crescentus NA1000]
Length = 187
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C + +DH++P S+GG++T EN A N SKG RT
Sbjct: 87 NLFLRDSFSCQYCGSPEDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 139
>gi|414165213|ref|ZP_11421460.1| hypothetical protein HMPREF9697_03361 [Afipia felis ATCC 53690]
gi|410882993|gb|EKS30833.1| hypothetical protein HMPREF9697_03361 [Afipia felis ATCC 53690]
Length = 185
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C CL + +DHI+P S+GG++T EN + N KG+ T
Sbjct: 85 NVFLRDRFSCQYCLSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLRKGSMT 137
>gi|383806641|ref|ZP_09962203.1| restriction endonuclease [Candidatus Aquiluna sp. IMCC13023]
gi|383299811|gb|EIC92424.1| restriction endonuclease [Candidatus Aquiluna sp. IMCC13023]
Length = 178
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 113 HDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+ DHI+P S+GGK T EN N SKGNRT
Sbjct: 101 NTIDHIMPKSRGGKDTWENLVACCLECNNSKGNRT 135
>gi|316932026|ref|YP_004107008.1| HNH endonuclease [Rhodopseudomonas palustris DX-1]
gi|315599740|gb|ADU42275.1| HNH endonuclease [Rhodopseudomonas palustris DX-1]
Length = 185
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFVCQYCGAPDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLT 137
>gi|281420445|ref|ZP_06251444.1| putative HNH endonuclease domain protein [Prevotella copri DSM
18205]
gi|281405502|gb|EFB36182.1| putative HNH endonuclease domain protein [Prevotella copri DSM
18205]
Length = 388
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 14/54 (25%)
Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C C D DH+ + GG + NCQ+L T NRSKGNR
Sbjct: 335 CPLCACGDGPNKDKIWALKEMEADHVKAWVNGGATDESNCQMLCKTHNRSKGNR 388
>gi|196231796|ref|ZP_03130653.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
gi|196224268|gb|EDY18781.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
Length = 194
Score = 37.7 bits (86), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
+ DH++P S+GGK+T ENC + VN K +R
Sbjct: 121 NIDHVVPRSRGGKTTWENCVLAHREVNTRKADR 153
>gi|39933633|ref|NP_945909.1| HNH endonuclease [Rhodopseudomonas palustris CGA009]
gi|192288988|ref|YP_001989593.1| HNH endonuclease [Rhodopseudomonas palustris TIE-1]
gi|39647479|emb|CAE26000.1| HNH endonuclease:HNH nuclease [Rhodopseudomonas palustris CGA009]
gi|192282737|gb|ACE99117.1| HNH endonuclease [Rhodopseudomonas palustris TIE-1]
Length = 185
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFVCQYCGAPDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLT 137
>gi|295105782|emb|CBL03325.1| Restriction endonuclease [Gordonibacter pamelaeae 7-10-1-b]
Length = 113
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P S+GGKS + N Q L N KGNR
Sbjct: 80 DHIVPVSRGGKSVMGNLQTLCEVCNCGKGNR 110
>gi|453363910|dbj|GAC80436.1| hypothetical protein GM1_017_00950 [Gordonia malaquae NBRC 108250]
Length = 141
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 25/46 (54%)
Query: 102 RKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
R L G G + +DH++P S+GG +T NC N KG+RT
Sbjct: 78 RHLCGYCGLHANTWDHVMPQSRGGGNTWFNCIAACGPCNWDKGDRT 123
>gi|260887966|ref|ZP_05899229.1| putative HNH endonuclease domain protein [Selenomonas sputigena
ATCC 35185]
gi|330838550|ref|YP_004413130.1| hypothetical protein Selsp_0703 [Selenomonas sputigena ATCC 35185]
gi|260862217|gb|EEX76717.1| putative HNH endonuclease domain protein [Selenomonas sputigena
ATCC 35185]
gi|329746314|gb|AEB99670.1| protein of unknown function DUF262 [Selenomonas sputigena ATCC
35185]
Length = 374
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 6/46 (13%)
Query: 107 CPGCLCH-DYD-----HILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C H D+D HI+P+SKGG + +NCQ+L + N K ++
Sbjct: 328 CPVCKEHFDFDEMEGDHIMPWSKGGHTQPDNCQMLCKSCNGKKTDK 373
>gi|291542181|emb|CBL15291.1| Restriction endonuclease [Ruminococcus bromii L2-63]
Length = 374
Score = 37.7 bits (86), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI P+SKGG++ L+NCQ+L + N K ++
Sbjct: 343 DHIKPWSKGGRTILDNCQMLCKSCNAKKTDK 373
>gi|146343281|ref|YP_001208329.1| HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 278]
gi|148252701|ref|YP_001237286.1| HNH endonuclease [Bradyrhizobium sp. BTAi1]
gi|146196087|emb|CAL80114.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 278]
gi|146404874|gb|ABQ33380.1| putative HNH endonuclease [Bradyrhizobium sp. BTAi1]
Length = 185
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C D +DH++P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFVCQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLTPQQAKMF 144
Query: 155 IQKSSY 160
+++ Y
Sbjct: 145 PKQAPY 150
>gi|407463091|ref|YP_006774408.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
gi|407046713|gb|AFS81466.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
Length = 228
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 35/81 (43%), Gaps = 21/81 (25%)
Query: 64 NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
N R P +KQ+ W++ E K V C +DH++P+SK
Sbjct: 163 NRRIIPTKIKQEVWKRDEG--------------------KCVKCGSKENLHFDHVIPHSK 202
Query: 124 GGKS-TLENCQVLQATVNRSK 143
GG S T EN Q+L A N K
Sbjct: 203 GGTSITAENIQILCAKHNLEK 223
>gi|223935746|ref|ZP_03627662.1| HNH endonuclease [bacterium Ellin514]
gi|223895754|gb|EEF62199.1| HNH endonuclease [bacterium Ellin514]
Length = 194
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHILP S+GGK +NC +N KGNR
Sbjct: 123 IDHILPVSRGGKDAWDNCVWACKAINSRKGNR 154
>gi|365173256|ref|ZP_09361377.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
gi|363616949|gb|EHL68368.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
Length = 374
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 6/46 (13%)
Query: 107 CPGCLCH-DYD-----HILPYSKGGKSTLENCQVLQATVNRSKGNR 146
CP C H +YD HI P+SKGG++T +NCQ+L + N K ++
Sbjct: 328 CPVCGEHFEYDEMEGDHIKPWSKGGQTTPDNCQMLCKSCNAKKTDK 373
>gi|27376478|ref|NP_768007.1| hypothetical protein bll1367 [Bradyrhizobium japonicum USDA 110]
gi|27349618|dbj|BAC46632.1| bll1367 [Bradyrhizobium japonicum USDA 110]
Length = 268
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 168 NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 227
Query: 155 IQKSSY 160
++S++
Sbjct: 228 PRQSAF 233
>gi|418175095|ref|ZP_12811693.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
gi|353843655|gb|EHE23699.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
Length = 178
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
L + DHI+P SKGG ST +N Q L NRSK N+
Sbjct: 141 LLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 176
>gi|422850756|ref|ZP_16897426.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
gi|422857591|ref|ZP_16904241.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
gi|325695504|gb|EGD37404.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
gi|327462774|gb|EGF09096.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
Length = 329
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
L + DHI+P SKGG ST +N Q L NRSK N+
Sbjct: 292 LLLEIDHIIPISKGGLSTRDNLQTLCWKCNRSKSNKI 328
>gi|257790601|ref|YP_003181207.1| HNH endonuclease [Eggerthella lenta DSM 2243]
gi|257474498|gb|ACV54818.1| HNH endonuclease [Eggerthella lenta DSM 2243]
Length = 127
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P S+GGKS + N Q L N KGNR
Sbjct: 94 DHIVPVSRGGKSVMSNLQTLCEDCNCGKGNR 124
>gi|357111451|ref|XP_003557526.1| PREDICTED: uncharacterized protein LOC100822155 [Brachypodium
distachyon]
Length = 287
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
N+++R C C D DH++P S+GGK EN + N KG++T L +
Sbjct: 183 NILYRDGFSCQYCSSEDDLTIDHVIPTSRGGKWEWENLVAACSRCNSRKGHKTLLQANMK 242
Query: 155 IQKSSYCRVSGRDMDLLEL----SAYGNVRRGQ 183
++K + ++ D++ + SA+ +RR Q
Sbjct: 243 LRKVPK---APKEFDIIAVPLTKSAFRTIRRKQ 272
>gi|392529601|ref|ZP_10276738.1| HNH endonuclease [Carnobacterium maltaromaticum ATCC 35586]
Length = 113
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 108 PGCLCHDYDHILPYSKGGK-STLENCQVLQATVNRSKGNRTELSRSDLI 155
P L DHI+P +KGG S +EN Q+ AT NR K ++ ++++ I
Sbjct: 42 PNPLAPTVDHIIPVNKGGHPSAMENLQLAHATCNRQKSDKLFNAKTNKI 90
>gi|295106530|emb|CBL04073.1| Restriction endonuclease [Gordonibacter pamelaeae 7-10-1-b]
Length = 113
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P S+GGKS + N Q L N KGNR
Sbjct: 80 DHIVPVSRGGKSVMSNLQTLCEDCNCGKGNR 110
>gi|197121751|ref|YP_002133702.1| HNH nuclease [Anaeromyxobacter sp. K]
gi|196171600|gb|ACG72573.1| HNH nuclease [Anaeromyxobacter sp. K]
Length = 382
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 54/129 (41%), Gaps = 37/129 (28%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH--DYDHILPYS 122
P P V+++ WE RD G F G P H + DH++P +
Sbjct: 280 PDHIPAHVRREVWE-------RD--------CGRCTFVLPSGEPCGSTHRLELDHVVPVA 324
Query: 123 KGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRG 182
+GG ST++N ++ R +G+ E +R L D L + AY RRG
Sbjct: 325 RGGASTVDNLRI------RCRGHNLEEARRAL-------------GDAL-VDAYAPRRRG 364
Query: 183 QDSGGCRIQ 191
+ GG R++
Sbjct: 365 GEHGGARVE 373
>gi|85709598|ref|ZP_01040663.1| HNH endonuclease family protein [Erythrobacter sp. NAP1]
gi|85688308|gb|EAQ28312.1| HNH endonuclease family protein [Erythrobacter sp. NAP1]
Length = 202
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C D +DH++P GGK+T EN A N KG RT
Sbjct: 102 NVFLRDKFSCQYCGSPDNLTFDHVMPRRLGGKTTWENIITACAPCNMKKGGRT 154
>gi|444915323|ref|ZP_21235457.1| HNH endonuclease family protein [Cystobacter fuscus DSM 2262]
gi|444713552|gb|ELW54449.1| HNH endonuclease family protein [Cystobacter fuscus DSM 2262]
Length = 179
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
YDH++P ++GG++T EN + N++KG RT
Sbjct: 95 YDHVVPRAQGGRTTWENIVIACVPCNQAKGGRT 127
>gi|414171531|ref|ZP_11426442.1| hypothetical protein HMPREF9695_00088 [Afipia broomeae ATCC 49717]
gi|410893206|gb|EKS40996.1| hypothetical protein HMPREF9695_00088 [Afipia broomeae ATCC 49717]
Length = 185
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P S+GG++T EN + N KGN T
Sbjct: 85 NVFLRDRFSCQYCTSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLRKGNMT 137
>gi|421597177|ref|ZP_16040843.1| hypothetical protein BCCGELA001_07611 [Bradyrhizobium sp.
CCGE-LA001]
gi|404270711|gb|EJZ34725.1| hypothetical protein BCCGELA001_07611 [Bradyrhizobium sp.
CCGE-LA001]
Length = 185
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144
Query: 155 IQKSSY 160
++S++
Sbjct: 145 PRQSAF 150
>gi|402771107|ref|YP_006590644.1| HNH endonuclease [Methylocystis sp. SC2]
gi|401773127|emb|CCJ05993.1| HNH endonuclease [Methylocystis sp. SC2]
Length = 172
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C H+ +DH++P SKGG +T EN + N KG+R
Sbjct: 72 NVFLRDRFTCQYCGEHEELTFDHVIPRSKGGATTWENVVAACSPCNLRKGDR 123
>gi|347535197|ref|YP_004842622.1| hypothetical protein FBFL15_0247 [Flavobacterium branchiophilum
FL-15]
gi|345528355|emb|CCB68385.1| Protein of unknown function [Flavobacterium branchiophilum FL-15]
Length = 366
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI P+ +GGK+T ENCQ+L NR K +
Sbjct: 336 DHITPWHEGGKTTAENCQMLCKLDNRIKSGK 366
>gi|365891320|ref|ZP_09429753.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
3809]
gi|365332734|emb|CCE02284.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
3809]
Length = 185
Score = 37.4 bits (85), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C D +DH++P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFVCQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLRKGNLTPQQAKMF 144
Query: 155 IQKSSY 160
+++ Y
Sbjct: 145 PKQAPY 150
>gi|332187299|ref|ZP_08389038.1| HNH endonuclease family protein [Sphingomonas sp. S17]
gi|332012720|gb|EGI54786.1| HNH endonuclease family protein [Sphingomonas sp. S17]
Length = 188
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCH---DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C H +DH++P ++GG++T EN A N KG RT
Sbjct: 88 NLFLRDRFSCQYCGSHRDLTFDHVIPRAQGGRTTWENVVTACAPCNLKKGGRT 140
>gi|296532418|ref|ZP_06895145.1| arsenate reductase [Roseomonas cervicalis ATCC 49957]
gi|296267261|gb|EFH13159.1| arsenate reductase [Roseomonas cervicalis ATCC 49957]
Length = 42
Score = 37.0 bits (84), Expect = 3.5, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DH++P+SKGG +T EN Q+L N K N E
Sbjct: 4 DHVIPFSKGGHTTWENYQLLCKPCNVKKSNSIE 36
>gi|425736696|ref|ZP_18854994.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
gi|425477862|gb|EKU45079.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
Length = 331
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 22/36 (61%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
L + DHI P SKGG S +EN Q L NRSK N+
Sbjct: 295 LLLEVDHIKPISKGGLSEIENLQTLCWRCNRSKSNK 330
>gi|395804648|ref|ZP_10483884.1| HNH endonuclease [Flavobacterium sp. F52]
gi|395433267|gb|EJF99224.1| HNH endonuclease [Flavobacterium sp. F52]
Length = 186
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 22/34 (64%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
DHI P SKGGK+ + N ++ T N S+GNR +
Sbjct: 124 DHIQPASKGGKNNMRNGRIFCGTCNTSRGNRANV 157
>gi|390937545|ref|YP_006395105.1| putative endonuclease [Bifidobacterium bifidum BGN4]
gi|389891159|gb|AFL05226.1| putative endonuclease [Bifidobacterium bifidum BGN4]
Length = 240
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 94 DALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+ GN + +L GC G D DHI+PYS GG T+ N + N S+ +R
Sbjct: 17 ETWGNACWLRLPGCTGVGEED-DHIVPYSHGGMDTVANIRRACKHCNASRQDRV 69
>gi|90422135|ref|YP_530505.1| HNH endonuclease [Rhodopseudomonas palustris BisB18]
gi|90104149|gb|ABD86186.1| HNH endonuclease [Rhodopseudomonas palustris BisB18]
Length = 185
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFSCQYCGAPEDLTFDHIMPRSKGGQTTWENVVAACSPCNLRKGNMT 137
>gi|67924112|ref|ZP_00517558.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
gi|67854037|gb|EAM49350.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
Length = 276
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 13/103 (12%)
Query: 41 LATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVV 100
T E + T+ I + P S PY W K P R + ++
Sbjct: 162 FQTREYVLYHHTETEIKR-HTLVKPESSPYDGNWTYWSKRRGTYTGTPAR-----VAKLL 215
Query: 101 FRKLVGCPGCLCH-------DYDHILPYSKGGKSTLENCQVLQ 136
++ CP C H + DHI+P SKGGK T N Q L
Sbjct: 216 KKQKGICPQCKQHFTPEDLIEVDHIIPKSKGGKDTYNNLQALH 258
>gi|449489536|ref|XP_004158341.1| PREDICTED: uncharacterized protein LOC101226520 [Cucumis sativus]
Length = 250
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+++R C C H+ DH+LP S+GG+ T EN N KG +T
Sbjct: 174 NILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT 226
>gi|358455199|ref|ZP_09165427.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
gi|357081452|gb|EHI90883.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
Length = 545
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 26/114 (22%)
Query: 36 LLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
L D E A TL R+ A + RS P VK W+
Sbjct: 454 LSDPEYAAAVLAGTLRVAKRLVLAPGQRDTRSIPPDVKAAVWQ----------------- 496
Query: 96 LGNVVFRKLVGCPGCLCHDY---DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
R C C +Y DH++P+S GG +++ N Q+L N KG R
Sbjct: 497 ------RDGGACAQCHATEYLEFDHVIPHSLGGATSVNNLQLLCRRCNLEKGAR 544
>gi|168484153|ref|ZP_02709105.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC1873-00]
gi|168493974|ref|ZP_02718117.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC3059-06]
gi|169834462|ref|YP_001693806.1| HNH endonuclease domain-containing protein [Streptococcus
pneumoniae Hungary19A-6]
gi|417685762|ref|ZP_12335042.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
gi|417695447|ref|ZP_12344627.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
gi|418079981|ref|ZP_12717197.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
gi|418088689|ref|ZP_12725850.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
gi|418090912|ref|ZP_12728057.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
gi|418097696|ref|ZP_12734798.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
gi|418106621|ref|ZP_12743667.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
gi|418109254|ref|ZP_12746283.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
gi|418113876|ref|ZP_12750869.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
gi|418134007|ref|ZP_12770867.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
gi|418158923|ref|ZP_12795629.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
gi|418161331|ref|ZP_12798022.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
gi|418168382|ref|ZP_12805028.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
gi|418218031|ref|ZP_12844699.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
gi|418220228|ref|ZP_12846885.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
gi|418237869|ref|ZP_12864426.1| HNH endonuclease family protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419422272|ref|ZP_13962491.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
gi|419488280|ref|ZP_14028033.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
gi|419520261|ref|ZP_14059860.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
gi|419525050|ref|ZP_14064616.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
gi|421248645|ref|ZP_15705108.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
gi|421271937|ref|ZP_15722784.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
gi|168996964|gb|ACA37576.1| HNH endonuclease domain protein [Streptococcus pneumoniae
Hungary19A-6]
gi|172042557|gb|EDT50603.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC1873-00]
gi|183576210|gb|EDT96738.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC3059-06]
gi|332077580|gb|EGI88041.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
gi|332203444|gb|EGJ17511.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
gi|353754300|gb|EHD34913.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
gi|353764430|gb|EHD44979.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
gi|353766285|gb|EHD46825.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
gi|353771852|gb|EHD52359.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
gi|353781535|gb|EHD61977.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
gi|353784547|gb|EHD64967.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
gi|353788540|gb|EHD68937.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
gi|353826578|gb|EHE06736.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
gi|353830921|gb|EHE11050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
gi|353836301|gb|EHE16389.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
gi|353876228|gb|EHE56077.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
gi|353878025|gb|EHE57864.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
gi|353895002|gb|EHE74742.1| HNH endonuclease family protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|353903994|gb|EHE79508.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
gi|379541885|gb|EHZ07050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
gi|379560754|gb|EHZ25776.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
gi|379589603|gb|EHZ54442.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
gi|379590395|gb|EHZ55233.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
gi|395615274|gb|EJG75290.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
gi|395877496|gb|EJG88565.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
Length = 326
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 23/39 (58%)
Query: 109 GCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
L + DHI+P SKGG ST +N Q L NRSK N+
Sbjct: 287 SLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNKI 325
>gi|383774651|ref|YP_005453720.1| hypothetical protein S23_64260 [Bradyrhizobium sp. S23321]
gi|398826008|ref|ZP_10584278.1| restriction endonuclease [Bradyrhizobium sp. YR681]
gi|381362778|dbj|BAL79608.1| hypothetical protein S23_64260 [Bradyrhizobium sp. S23321]
gi|398222215|gb|EJN08599.1| restriction endonuclease [Bradyrhizobium sp. YR681]
Length = 185
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144
Query: 155 IQKSSY 160
++S++
Sbjct: 145 PRQSAF 150
>gi|418077823|ref|ZP_12715047.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
gi|418116043|ref|ZP_12753020.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
gi|419439525|ref|ZP_13979582.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
gi|419459153|ref|ZP_13999089.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
gi|419461430|ref|ZP_14001347.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
gi|353748845|gb|EHD29496.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
gi|353791681|gb|EHD72055.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
gi|379534025|gb|EHY99237.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
gi|379534955|gb|EHZ00162.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
gi|379582010|gb|EHZ46893.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
Length = 356
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 23/39 (58%)
Query: 109 GCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
L + DHI+P SKGG ST +N Q L NRSK N+
Sbjct: 317 SLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNKI 355
>gi|167647163|ref|YP_001684826.1| HNH endonuclease [Caulobacter sp. K31]
gi|167349593|gb|ABZ72328.1| HNH endonuclease [Caulobacter sp. K31]
Length = 186
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C D +DH++P S+GG++T EN A N KG RT
Sbjct: 86 NLFLRDGFSCQYCGSPDDLTFDHVIPRSRGGRTTWENIVTACAPCNLHKGGRT 138
>gi|86747604|ref|YP_484100.1| HNH endonuclease [Rhodopseudomonas palustris HaA2]
gi|86570632|gb|ABD05189.1| HNH endonuclease [Rhodopseudomonas palustris HaA2]
Length = 185
Score = 37.0 bits (84), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFSCQYCGAPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNMT 137
>gi|416376881|ref|ZP_11683511.1| HNH endonuclease, partial [Crocosphaera watsonii WH 0003]
gi|357266329|gb|EHJ14977.1| HNH endonuclease, partial [Crocosphaera watsonii WH 0003]
Length = 447
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 33/84 (39%), Gaps = 14/84 (16%)
Query: 61 STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH------ 113
+ P S PY W K P R K + +K G CP C H
Sbjct: 364 TLVKPESSPYDGNWTYWSKRRGTYTGTPARVAK------LLKKQKGICPQCKQHFTPEDL 417
Query: 114 -DYDHILPYSKGGKSTLENCQVLQ 136
+ DHI+P SKGGK T N Q L
Sbjct: 418 IEVDHIIPKSKGGKDTYNNLQALH 441
>gi|158318135|ref|YP_001510643.1| stress protein [Frankia sp. EAN1pec]
gi|158113540|gb|ABW15737.1| stress protein [Frankia sp. EAN1pec]
Length = 560
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 105 VGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
V C ++DH++P+S+GG S++ N Q+L N KG R
Sbjct: 518 VQCQATEYLEFDHVIPHSRGGASSVNNLQLLCRRCNLEKGAR 559
>gi|260881784|ref|ZP_05893557.1| HNH endonuclease domain protein [Mitsuokella multacida DSM 20544]
gi|260847880|gb|EEX67887.1| HNH endonuclease domain protein [Mitsuokella multacida DSM 20544]
Length = 90
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 26/46 (56%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
D DHILP S GG L+N Q L NRSK + + + D I+ ++
Sbjct: 34 DIDHILPQSYGGGDGLDNLQCLCKHCNRSKRDSLDHTAQDYIKNNA 79
>gi|386772121|ref|ZP_10094499.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
Length = 526
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 95 ALGNVVFRKLVGCPGCLCH----DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
A+ + R+ C G C+ YDHI+P S+GG ++L+N Q L A N+ + ++
Sbjct: 389 AMRRFLSRRDTTCRGPFCNAQIRQYDHIVPVSRGGPTSLDNGQGLCAHCNKKEQLAAQVE 448
Query: 151 R 151
R
Sbjct: 449 R 449
>gi|261493161|ref|ZP_05989692.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. BOVINE]
gi|261495917|ref|ZP_05992339.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. OVINE]
gi|261308438|gb|EEY09719.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. OVINE]
gi|261311164|gb|EEY12336.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. BOVINE]
Length = 387
Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DH+ +SKGG + ENC++L + NR+KGN+
Sbjct: 357 DHVSAWSKGGLTGAENCEMLCISHNRAKGNK 387
>gi|451340018|ref|ZP_21910523.1| hypothetical protein C791_7907 [Amycolatopsis azurea DSM 43854]
gi|449417231|gb|EMD22907.1| hypothetical protein C791_7907 [Amycolatopsis azurea DSM 43854]
Length = 63
Score = 36.6 bits (83), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
DHI+P SKGG TL+N + + N S+GNR
Sbjct: 21 DHIVPRSKGGPDTLDNARPAHRSCNSSRGNRA 52
>gi|94986243|ref|YP_605607.1| HNH endonuclease [Deinococcus geothermalis DSM 11300]
gi|94556524|gb|ABF46438.1| HNH endonuclease [Deinococcus geothermalis DSM 11300]
Length = 189
Score = 36.6 bits (83), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV+ R C C DH+LP S+GG+ + EN T N+ KGNRT
Sbjct: 99 NVLRRDTFACQYCGAETDLTLDHVLPRSRGGRHSWENVVTACRTCNQRKGNRT 151
>gi|197106299|ref|YP_002131676.1| restriction endonuclease [Phenylobacterium zucineum HLK1]
gi|196479719|gb|ACG79247.1| restriction endonuclease [Phenylobacterium zucineum HLK1]
Length = 186
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 9/109 (8%)
Query: 44 EEEISTLFTDLR--ISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVF 101
+E I +F D +S+ D + SF + K + R P R N+
Sbjct: 34 QEVIKAVFLDRVDVVSTYDQVIHSPSFEMKLPSVVSLKHYVAQDRPPAFTR----FNLFL 89
Query: 102 RKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
R C C + +DH++P S+GG++T EN A N +KG RT
Sbjct: 90 RDSFTCQYCTAGEELTFDHVIPRSRGGRTTWENIVTACARCNLTKGGRT 138
>gi|171914460|ref|ZP_02929930.1| HNH endonuclease [Verrucomicrobium spinosum DSM 4136]
Length = 193
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
+ DHI+P S+GGK++ +NC + +N K +RT
Sbjct: 122 NIDHIMPRSRGGKTSWDNCVLADKRINSRKADRT 155
>gi|295840114|ref|ZP_06827047.1| HNH endonuclease domain-containing protein [Streptomyces sp. SPB74]
gi|197696993|gb|EDY43926.1| HNH endonuclease domain-containing protein [Streptomyces sp. SPB74]
Length = 211
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%)
Query: 97 GNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
G+ V G P C + DH+ P S+GG + N QVL + N SKG R
Sbjct: 161 GDPVPAAPTGRPLARCLELDHVHPRSRGGSDDVSNLQVLCTSCNTSKGARI 211
>gi|384215175|ref|YP_005606341.1| hypothetical protein BJ6T_14660 [Bradyrhizobium japonicum USDA 6]
gi|354954074|dbj|BAL06753.1| hypothetical protein BJ6T_14660 [Bradyrhizobium japonicum USDA 6]
Length = 185
Score = 36.6 bits (83), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFNCQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144
Query: 155 IQKSSY 160
++S++
Sbjct: 145 PRQSAF 150
>gi|67924437|ref|ZP_00517863.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
gi|67853726|gb|EAM49059.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
Length = 596
Score = 36.6 bits (83), Expect = 5.0, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
P + PY W K P R K + +K G CP C + + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541
Query: 117 HILPYSKGGKSTLENCQVLQ 136
HI+P SKGGK T N Q L
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561
>gi|392383290|ref|YP_005032487.1| putative HNH endonuclease [Azospirillum brasilense Sp245]
gi|356878255|emb|CCC99127.1| putative HNH endonuclease [Azospirillum brasilense Sp245]
Length = 187
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 11/74 (14%)
Query: 98 NVVFRKLVGCPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE--- 148
NV R C C H +DH++P S+GG++T +N + N +KGNR
Sbjct: 83 NVFLRDRFTCQYCGHHFPTQELTFDHVIPRSRGGRTTWDNVVTSCSACNLAKGNRLPHSC 142
Query: 149 --LSRSDLIQKSSY 160
+ S Q S+Y
Sbjct: 143 GMIPLSPPFQPSAY 156
>gi|67924762|ref|ZP_00518164.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
gi|67853387|gb|EAM48744.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
Length = 183
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 13/103 (12%)
Query: 41 LATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVV 100
T E + T+ I + P S PY W K P R + ++
Sbjct: 69 FQTREYVLYHHTETEIKR-HTLVKPESSPYDGNWTYWSKRRGTYTGTPAR-----VAKLL 122
Query: 101 FRKLVGCPGCLCH-------DYDHILPYSKGGKSTLENCQVLQ 136
++ CP C H + DHI+P SKGGK T N Q L
Sbjct: 123 KKQKGICPQCKQHFTPEDLIEVDHIIPKSKGGKDTYNNLQALH 165
>gi|374578280|ref|ZP_09651376.1| restriction endonuclease [Bradyrhizobium sp. WSM471]
gi|386398693|ref|ZP_10083471.1| restriction endonuclease [Bradyrhizobium sp. WSM1253]
gi|374426601|gb|EHR06134.1| restriction endonuclease [Bradyrhizobium sp. WSM471]
gi|385739319|gb|EIG59515.1| restriction endonuclease [Bradyrhizobium sp. WSM1253]
Length = 185
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NV R C C + +DHI+P SKGG++T EN + N KGN T +
Sbjct: 85 NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPV 139
>gi|452964360|gb|EME69402.1| HNH endonuclease family protein [Magnetospirillum sp. SO-1]
Length = 186
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 6/57 (10%)
Query: 98 NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
NV R C C HD +DH+LP S+GG++T +N A N KG+R
Sbjct: 82 NVFLRDRFSCQYCGEWFPTHDLTFDHVLPRSRGGRTTWDNVVTACAPCNLRKGDRLH 138
>gi|15807725|ref|NP_285380.1| TerF-like protein [Deinococcus radiodurans R1]
gi|6460551|gb|AAF12257.1|AE001862_83 TerF-related protein [Deinococcus radiodurans R1]
Length = 579
Score = 36.6 bits (83), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 20/82 (24%)
Query: 66 RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
RS P VK + W++ + + V C ++DH++P+S GG
Sbjct: 518 RSIPQKVKLEVWQRDQG--------------------QCVECGDTNYLEFDHVIPHSLGG 557
Query: 126 KSTLENCQVLQATVNRSKGNRT 147
S++ N Q+L N +K NR
Sbjct: 558 ASSVGNLQLLCRRCNLAKSNRI 579
>gi|67923004|ref|ZP_00516498.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
gi|67855152|gb|EAM50417.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
Length = 596
Score = 36.6 bits (83), Expect = 5.5, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
P + PY W K P R K + +K G CP C + + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541
Query: 117 HILPYSKGGKSTLENCQVLQ 136
HI+P SKGGK T N Q L
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561
>gi|431809878|ref|YP_007236767.1| HNH endonuclease [Burkholderia phage BcepMigl]
gi|394777723|gb|AFN39090.1| HNH endonuclease [Burkholderia phage BcepMigl]
Length = 147
Score = 36.6 bits (83), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 11/75 (14%)
Query: 80 AEKVRGRDPDRWRKDALGNVVFRKLVGCPGC-----LCHDYDHILPYSKGGKSTLENCQV 134
A ++ R PD R+ FR C C LC DHI+P S GG +TLEN Q
Sbjct: 79 AAYLKTRIPDELRQAVFERDGFR----CVECGSNKRLC--ADHIIPESAGGPTTLENLQT 132
Query: 135 LQATVNRSKGNRTEL 149
+ N KG++ +
Sbjct: 133 MCRPCNSEKGDKAPV 147
>gi|326386057|ref|ZP_08207681.1| HNH endonuclease [Novosphingobium nitrogenifigens DSM 19370]
gi|326209282|gb|EGD60075.1| HNH endonuclease [Novosphingobium nitrogenifigens DSM 19370]
Length = 227
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C H+ +DH++P GG++T EN A N KG RT
Sbjct: 127 NLFLRDRFMCQYCGSHEHLTFDHVVPRRLGGRTTWENVSTACAPCNMRKGGRT 179
>gi|219847405|ref|YP_002461838.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
gi|219541664|gb|ACL23402.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
Length = 206
Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
Query: 115 YDHILPYSKGGKST-LENCQVLQATVNRSKGNRTE 148
YDHI+PYSKGG S EN Q+L A N K ++ E
Sbjct: 172 YDHIIPYSKGGTSLKAENIQLLCARHNLEKRDKIE 206
>gi|67923370|ref|ZP_00516851.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
gi|67854795|gb|EAM50073.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
endonuclease [Crocosphaera watsonii WH 8501]
Length = 596
Score = 36.6 bits (83), Expect = 5.8, Method: Composition-based stats.
Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)
Query: 65 PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
P + PY W K P R K + +K G CP C + + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541
Query: 117 HILPYSKGGKSTLENCQVLQ 136
HI+P SKGGK T N Q L
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561
>gi|452751447|ref|ZP_21951193.1| HNH endonuclease family protein [alpha proteobacterium JLT2015]
gi|451961597|gb|EMD84007.1| HNH endonuclease family protein [alpha proteobacterium JLT2015]
Length = 214
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
N+ R C C D +DH++P + GG++T EN A N +KG RT
Sbjct: 114 NLFLRDRFSCQYCGRTDDLTFDHVIPRAYGGRTTWENVATACAPCNLAKGGRT 166
>gi|384260910|ref|YP_005416096.1| HNH endonuclease [Rhodospirillum photometricum DSM 122]
gi|378402010|emb|CCG07126.1| HNH endonuclease [Rhodospirillum photometricum DSM 122]
Length = 218
Score = 36.2 bits (82), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 98 NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C HD +DH++P SKGG++T +N N K NRT
Sbjct: 115 NVFLRDGFTCQYCGTRLPAHDLTFDHVVPRSKGGRTTWDNVVAACGPCNMRKANRT 170
>gi|92118950|ref|YP_578679.1| HNH endonuclease [Nitrobacter hamburgensis X14]
gi|91801844|gb|ABE64219.1| HNH endonuclease [Nitrobacter hamburgensis X14]
Length = 185
Score = 36.2 bits (82), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P SKGG++T +N + N KGN T
Sbjct: 85 NVFLRDRFMCQYCTASEDLTFDHIIPRSKGGQTTWDNVVAACSPCNLRKGNLT 137
>gi|390956260|ref|YP_006420017.1| restriction endonuclease [Terriglobus roseus DSM 18391]
gi|390411178|gb|AFL86682.1| restriction endonuclease [Terriglobus roseus DSM 18391]
Length = 205
Score = 36.2 bits (82), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 33/75 (44%), Gaps = 6/75 (8%)
Query: 86 RDPDRWRKDALGNVVFRKLVGCPGCLC------HDYDHILPYSKGGKSTLENCQVLQATV 139
R P + R + N++ R C C C DH+LP S+GG ST EN
Sbjct: 95 RIPHQTRALSRKNILLRDRNTCQYCGCILTTGDLTLDHVLPRSRGGLSTWENLVACCHDC 154
Query: 140 NRSKGNRTELSRSDL 154
NR KGN+ D+
Sbjct: 155 NRRKGNQLLHELQDM 169
>gi|410867562|ref|YP_006982173.1| HNH endonuclease domain-containing protein [Propionibacterium
acidipropionici ATCC 4875]
gi|410824203|gb|AFV90818.1| HNH endonuclease domain-containing protein [Propionibacterium
acidipropionici ATCC 4875]
Length = 330
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
L + DHI+P S+GG ST +N Q L NRSK N+
Sbjct: 291 LLLEVDHIVPVSRGGLSTEDNLQTLCWRCNRSKSNK 326
>gi|424865305|ref|ZP_18289174.1| diadenosine tetraphosphate [SAR86 cluster bacterium SAR86B]
gi|400758911|gb|EJP73111.1| diadenosine tetraphosphate [SAR86 cluster bacterium SAR86B]
Length = 325
Score = 36.2 bits (82), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
D DHILP SKGG + + N Q L T NR K N + D+
Sbjct: 155 DVDHILPRSKGGSNDISNLQSLCYTCNRQKRNLDDTDLRDM 195
>gi|340776977|ref|ZP_08696920.1| HNH endonuclease [Acetobacter aceti NBRC 14818]
Length = 186
Score = 36.2 bits (82), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 98 NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE--- 148
N+ R C C HD +DH++P S+GG++T EN +T N KG+R
Sbjct: 83 NLFLRDNFSCQYCNDQLPTHDLTFDHVIPRSRGGRTTWENIVTACSTCNLLKGSRMPHEI 142
Query: 149 --LSRSDLIQKSSY 160
R +Q SS+
Sbjct: 143 RMFPRRQPVQPSSW 156
>gi|337285718|ref|YP_004625191.1| HNH endonuclease [Thermodesulfatator indicus DSM 15286]
gi|335358546|gb|AEH44227.1| HNH endonuclease [Thermodesulfatator indicus DSM 15286]
Length = 203
Score = 36.2 bits (82), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
DHI+P S+GGK+T EN + N KGNRT
Sbjct: 127 DHIIPRSRGGKTTWENVVLCCRKCNIKKGNRT 158
>gi|434405466|ref|YP_007148351.1| restriction endonuclease [Cylindrospermum stagnale PCC 7417]
gi|428259721|gb|AFZ25671.1| restriction endonuclease [Cylindrospermum stagnale PCC 7417]
Length = 81
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
DHI+P S+GGK+ + N Q L T N+ K N+ +
Sbjct: 40 DHIIPLSRGGKNDISNLQTLCLTCNQQKTNKID 72
>gi|86605206|ref|YP_473969.1| HNH endonuclease domain-containing protein [Synechococcus sp.
JA-3-3Ab]
gi|86553748|gb|ABC98706.1| HNH endonuclease domain protein [Synechococcus sp. JA-3-3Ab]
Length = 178
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 98 NVVFRKLVGCPGCLCHD----YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV++R C C C+ DH++P S+GG+ T EN N KGNRT
Sbjct: 70 NVLYRDGHRCQYCGCYGEDLTLDHVIPRSRGGEDTWENVVSACVRCNVKKGNRT 123
>gi|381159335|ref|ZP_09868567.1| restriction endonuclease [Thiorhodovibrio sp. 970]
gi|380877399|gb|EIC19491.1| restriction endonuclease [Thiorhodovibrio sp. 970]
Length = 361
Score = 36.2 bits (82), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI P+ +GGK++ ENCQ+L NR K +
Sbjct: 331 DHITPWHEGGKTSAENCQMLCKEDNRRKSGK 361
>gi|379059551|ref|ZP_09850077.1| hypothetical protein SproM1_15864 [Serinicoccus profundi MCCC
1A05965]
Length = 384
Score = 36.2 bits (82), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
D DH+ +SKGG + L N +L T NR+KGN+
Sbjct: 352 DADHVTAWSKGGATDLSNLTMLCQTHNRAKGNK 384
>gi|218682266|ref|ZP_03529867.1| HNH endonuclease [Rhizobium etli CIAT 894]
Length = 228
Score = 35.8 bits (81), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|85714890|ref|ZP_01045876.1| HNH endonuclease [Nitrobacter sp. Nb-311A]
gi|85698376|gb|EAQ36247.1| HNH endonuclease [Nitrobacter sp. Nb-311A]
Length = 206
Score = 35.8 bits (81), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P SKGG++T +N + N KGN T
Sbjct: 106 NVFLRDRFMCQYCTAVEDLTFDHIIPRSKGGQTTWDNVVAACSPCNLRKGNLT 158
>gi|115522824|ref|YP_779735.1| HNH endonuclease [Rhodopseudomonas palustris BisA53]
gi|115516771|gb|ABJ04755.1| HNH endonuclease [Rhodopseudomonas palustris BisA53]
Length = 185
Score = 35.8 bits (81), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
NV R C C + +DHI+P SKGG++T EN + N KGN T
Sbjct: 85 NVFLRDRFVCQYCGAPEDLTFDHIMPRSKGGQTTWENVVAACSPCNLRKGNLT 137
>gi|406998393|gb|EKE16327.1| hypothetical protein ACD_11C00020G0032 [uncultured bacterium]
Length = 367
Score = 35.8 bits (81), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 18/109 (16%)
Query: 44 EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
E+EI L D ++ +PY + +Q E+ +R + ++ +++A R+
Sbjct: 271 EDEIKELMQDEEVTKPSGI-----YPYVLTRQ--ERYLNIRAFN-EKMKREAYE----RQ 318
Query: 104 LVGCPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
C C+ H + DHI P+ +GGK+ NCQ+L NR K +
Sbjct: 319 KGKCVKCMKHFEIGEMEADHIKPWHEGGKTIANNCQILCKDDNRRKSGK 367
>gi|296121467|ref|YP_003629245.1| HNH endonuclease [Planctomyces limnophilus DSM 3776]
gi|296013807|gb|ADG67046.1| HNH endonuclease [Planctomyces limnophilus DSM 3776]
Length = 198
Score = 35.8 bits (81), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%)
Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
PG DH+ P S+GG ST ENC + N K NRT
Sbjct: 112 PGSEELTIDHVQPRSRGGTSTWENCVLACIACNNKKANRT 151
>gi|347449376|gb|AEO93417.1| gp157 [Bacillus phage G]
Length = 138
Score = 35.8 bits (81), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
DHI+P SKGGK+ + N Q + N KGN+
Sbjct: 106 DHIVPVSKGGKNHISNYQTMCIICNEKKGNQ 136
>gi|405382583|ref|ZP_11036364.1| restriction endonuclease [Rhizobium sp. CF142]
gi|397320989|gb|EJJ25416.1| restriction endonuclease [Rhizobium sp. CF142]
Length = 185
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|330832281|ref|YP_004401106.1| HNH endonuclease [Streptococcus suis ST3]
gi|329306504|gb|AEB80920.1| HNH endonuclease [Streptococcus suis ST3]
Length = 115
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 97 GNVVFRKLVGCPGCLCHDYDHILPYSKGGK-STLENCQVLQATVNRSKGNRTELSRSDLI 155
G +V +KL P +C DH++P +KGG S +EN Q+ + NR K ++ S
Sbjct: 31 GQLVDKKL-KYPNPMCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSKAE 89
Query: 156 QKS 158
QK+
Sbjct: 90 QKT 92
>gi|402489973|ref|ZP_10836766.1| HNH endonuclease [Rhizobium sp. CCGE 510]
gi|401811312|gb|EJT03681.1| HNH endonuclease [Rhizobium sp. CCGE 510]
Length = 185
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|356560967|ref|XP_003548757.1| PREDICTED: uncharacterized protein LOC100810482 [Glycine max]
Length = 282
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 8/80 (10%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT-ELSRSD 153
N++FR C C H+ DH++P + GG+ T EN A N KG +T E ++
Sbjct: 180 NILFRDNYTCQYCSSHENLTIDHVMPTALGGEWTWENLVTACAKCNCKKGRKTLEEAKMK 239
Query: 154 LIQKSSYCRVSGRDMDLLEL 173
LI+ +D D+L +
Sbjct: 240 LIKPPK----VPKDYDILAI 255
>gi|21673556|ref|NP_661621.1| HNH endonuclease [Chlorobium tepidum TLS]
gi|21646667|gb|AAM71963.1| HNH endonuclease family protein [Chlorobium tepidum TLS]
Length = 170
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 107 CPGCLCHD----YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
C C C D DH++P S+GG+ T EN + N KGNRT
Sbjct: 87 CQYCGCKDGSLTVDHVMPKSRGGEDTWENLITACKSCNTKKGNRT 131
>gi|209883894|ref|YP_002287751.1| HNH endonuclease [Oligotropha carboxidovorans OM5]
gi|337742396|ref|YP_004634124.1| HNH endonuclease domain-containing protein [Oligotropha
carboxidovorans OM5]
gi|386031361|ref|YP_005952136.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM4]
gi|209872090|gb|ACI91886.1| HNH endonuclease [Oligotropha carboxidovorans OM5]
gi|336096427|gb|AEI04253.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM4]
gi|336100060|gb|AEI07883.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM5]
Length = 185
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
NV R C C + +DHI+P S+GG++T EN + N KG+ T +
Sbjct: 85 NVFLRDRFSCQYCTSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLKKGSMTMAQANMF 144
Query: 155 IQKSSY 160
+++ Y
Sbjct: 145 PRQTPY 150
>gi|84685670|ref|ZP_01013567.1| hypothetical protein 1099457000253_RB2654_06854 [Maritimibacter
alkaliphilus HTCC2654]
gi|84666336|gb|EAQ12809.1| hypothetical protein RB2654_06854 [Rhodobacterales bacterium
HTCC2654]
Length = 1126
Score = 35.8 bits (81), Expect = 8.9, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 3/45 (6%)
Query: 103 KLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
K +G GC D DHI+ GG S N Q+L AT NR G+ T
Sbjct: 312 KGLGKSGC---DIDHIVEKQMGGTSVPSNLQILDATKNRESGSNT 353
>gi|296445443|ref|ZP_06887400.1| HNH endonuclease [Methylosinus trichosporium OB3b]
gi|296257009|gb|EFH04079.1| HNH endonuclease [Methylosinus trichosporium OB3b]
Length = 190
Score = 35.8 bits (81), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 3/55 (5%)
Query: 98 NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
NV R C C HD +DH++P SKGG +T EN + N KG+R
Sbjct: 90 NVFLRDRFTCQYCGAQHDLTFDHVIPRSKGGATTWENVVAACSPCNLRKGDRLPF 144
>gi|241206532|ref|YP_002977628.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM1325]
gi|240860422|gb|ACS58089.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM1325]
Length = 185
Score = 35.8 bits (81), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|209551129|ref|YP_002283046.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM2304]
gi|424886574|ref|ZP_18310182.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM2012]
gi|424897225|ref|ZP_18320799.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM2297]
gi|424916606|ref|ZP_18339970.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM597]
gi|209536885|gb|ACI56820.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM2304]
gi|392852782|gb|EJB05303.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM597]
gi|393175925|gb|EJC75967.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM2012]
gi|393181452|gb|EJC81491.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
WSM2297]
Length = 185
Score = 35.8 bits (81), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|190893620|ref|YP_001980162.1| endonuclease [Rhizobium etli CIAT 652]
gi|218515988|ref|ZP_03512828.1| putative endonuclease protein [Rhizobium etli 8C-3]
gi|417097164|ref|ZP_11959076.1| putative endonuclease protein [Rhizobium etli CNPAF512]
gi|421589410|ref|ZP_16034555.1| endonuclease [Rhizobium sp. Pop5]
gi|190698899|gb|ACE92984.1| putative endonuclease protein [Rhizobium etli CIAT 652]
gi|327193381|gb|EGE60281.1| putative endonuclease protein [Rhizobium etli CNPAF512]
gi|403705646|gb|EJZ21183.1| endonuclease [Rhizobium sp. Pop5]
Length = 185
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|424872552|ref|ZP_18296214.1| restriction endonuclease [Rhizobium leguminosarum bv. viciae
WSM1455]
gi|393168253|gb|EJC68300.1| restriction endonuclease [Rhizobium leguminosarum bv. viciae
WSM1455]
Length = 185
Score = 35.8 bits (81), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 98 NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
NV R C C HD +DH++P + GG++T EN + N KG++
Sbjct: 85 NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136
>gi|118443767|ref|YP_877830.1| HNH endonuclease domain-containing protein [Clostridium novyi NT]
gi|118134223|gb|ABK61267.1| HNH endonuclease domain protein [Clostridium novyi NT]
Length = 299
Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 21/34 (61%)
Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
L + DHI+P SKGG ST +N Q L NR KG
Sbjct: 262 LLLEIDHIIPISKGGMSTEKNLQTLCWKCNRKKG 295
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.388
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,915,154,870
Number of Sequences: 23463169
Number of extensions: 110196010
Number of successful extensions: 327660
Number of sequences better than 100.0: 393
Number of HSP's better than 100.0 without gapping: 243
Number of HSP's successfully gapped in prelim test: 150
Number of HSP's that attempted gapping in prelim test: 327321
Number of HSP's gapped (non-prelim): 435
length of query: 191
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 57
effective length of database: 9,215,130,721
effective search space: 525262451097
effective search space used: 525262451097
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)