BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029597
         (191 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255547375|ref|XP_002514745.1| endonuclease, putative [Ricinus communis]
 gi|223546349|gb|EEF47851.1| endonuclease, putative [Ricinus communis]
          Length = 203

 Score =  289 bits (739), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 139/182 (76%), Positives = 157/182 (86%), Gaps = 4/182 (2%)

Query: 13  TATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRIS---SADSTANPRSFP 69
           TA+ +S+P  S RR  T   SATL D E+ TEE+  TLF DL+IS   +  S+  PRSFP
Sbjct: 23  TASFSSTPDSSHRRS-TLAHSATLQDREVLTEEDTLTLFEDLQISEKYTKFSSTEPRSFP 81

Query: 70  YSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
           YSVKQQCW+KAEKV+GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTL
Sbjct: 82  YSVKQQCWDKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTL 141

Query: 130 ENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCR 189
           ENCQVLQATVNRSKGNRTELSR+DLIQ+SSYCRV+GRDMDL+ELSAYGNVR  Q+SGGCR
Sbjct: 142 ENCQVLQATVNRSKGNRTELSRADLIQRSSYCRVAGRDMDLIELSAYGNVRHAQESGGCR 201

Query: 190 IQ 191
           IQ
Sbjct: 202 IQ 203


>gi|225457397|ref|XP_002284877.1| PREDICTED: uncharacterized protein LOC100262165 isoform 1 [Vitis
           vinifera]
          Length = 199

 Score =  285 bits (728), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 135/172 (78%), Positives = 150/172 (87%), Gaps = 1/172 (0%)

Query: 20  PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
           PS SR     +  S +LLD    TE+E +TLF +L+IS  +   NPRSFPYSVKQQCWEK
Sbjct: 29  PSGSRSTTRLSPSSTSLLDTPFITEDEDATLFDNLKIS-PNPNFNPRSFPYSVKQQCWEK 87

Query: 80  AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
           AEK++GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATV
Sbjct: 88  AEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATV 147

Query: 140 NRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           NRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGC IQ
Sbjct: 148 NRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCTIQ 199


>gi|449477241|ref|XP_004154969.1| PREDICTED: uncharacterized protein LOC101226861 [Cucumis sativus]
          Length = 200

 Score =  271 bits (692), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 129/156 (82%), Positives = 142/156 (91%), Gaps = 4/156 (2%)

Query: 38  DNELATEEEISTLFTDLRISSADSTAN--PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
           D +L T E  S+L  DL++S  DS +N  PRSFP+SVKQQCW+KAEK++GRDPDRWR+DA
Sbjct: 47  DRDLVTGE--SSLLGDLKLSPDDSVSNYNPRSFPHSVKQQCWDKAEKIKGRDPDRWRRDA 104

Query: 96  LGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLI 155
           LGN VFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE+SR++LI
Sbjct: 105 LGNTVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEISRAELI 164

Query: 156 QKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           QKSSYCRVSGRDMDLLELSAYGNV RGQDSGGCRIQ
Sbjct: 165 QKSSYCRVSGRDMDLLELSAYGNVHRGQDSGGCRIQ 200


>gi|449440796|ref|XP_004138170.1| PREDICTED: uncharacterized protein LOC101222461 [Cucumis sativus]
          Length = 200

 Score =  270 bits (691), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 129/156 (82%), Positives = 141/156 (90%), Gaps = 4/156 (2%)

Query: 38  DNELATEEEISTLFTDLRISSADSTAN--PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
           D +L T E  S+L  DL+ S  DS +N  PRSFP+SVKQQCW+KAEK++GRDPDRWR+DA
Sbjct: 47  DRDLVTGE--SSLLGDLKFSPDDSVSNYNPRSFPHSVKQQCWDKAEKIKGRDPDRWRRDA 104

Query: 96  LGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLI 155
           LGN VFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE+SR++LI
Sbjct: 105 LGNTVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEISRAELI 164

Query: 156 QKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           QKSSYCRVSGRDMDLLELSAYGNV RGQDSGGCRIQ
Sbjct: 165 QKSSYCRVSGRDMDLLELSAYGNVHRGQDSGGCRIQ 200


>gi|388520967|gb|AFK48545.1| unknown [Lotus japonicus]
          Length = 198

 Score =  257 bits (656), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 127/184 (69%), Positives = 145/184 (78%), Gaps = 14/184 (7%)

Query: 8   KRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRS 67
           K R +T +   S SP R    +T  SATLLD++         LF D          NPRS
Sbjct: 29  KDRSATTSSKLSSSPKR----STSPSATLLDDDPVA------LFDDY----THPNPNPRS 74

Query: 68  FPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
           FP+SVKQ+CWEKA+K++GRDPDRWR+D LGN +FRKLVGCPGCLCHDYDHI+PYSKGG+S
Sbjct: 75  FPHSVKQKCWEKADKIKGRDPDRWRRDPLGNTIFRKLVGCPGCLCHDYDHIVPYSKGGQS 134

Query: 128 TLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGG 187
           TLENCQVLQATVNR+KGNRT +SR+DL+QKSSYCRVSGRDMDLLELSAYGNVR G DSGG
Sbjct: 135 TLENCQVLQATVNRAKGNRTGISRADLMQKSSYCRVSGRDMDLLELSAYGNVRHGPDSGG 194

Query: 188 CRIQ 191
           CRIQ
Sbjct: 195 CRIQ 198


>gi|351720795|ref|NP_001237956.1| uncharacterized protein LOC100500174 [Glycine max]
 gi|255629565|gb|ACU15130.1| unknown [Glycine max]
          Length = 206

 Score =  256 bits (655), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 121/158 (76%), Positives = 141/158 (89%), Gaps = 4/158 (2%)

Query: 34  ATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRK 93
           ATLLD     ++++++LF D    + +   NPRSFP+SVKQ+CWEKA+KV+GRDPDRWR+
Sbjct: 53  ATLLD----VDDDVTSLFHDYPSPNPNPNPNPRSFPHSVKQKCWEKADKVKGRDPDRWRR 108

Query: 94  DALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSD 153
           DALGN +FRKLVGCPGCLCHDYDHI+PYSKGG+STLENCQVLQATVNRSKGNRTE+S+++
Sbjct: 109 DALGNTLFRKLVGCPGCLCHDYDHIIPYSKGGESTLENCQVLQATVNRSKGNRTEISKAE 168

Query: 154 LIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           LIQKSSYCRVS RDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 169 LIQKSSYCRVSDRDMDLLELSAYGNVRRGPDSGGCRIQ 206


>gi|115451005|ref|NP_001049103.1| Os03g0169800 [Oryza sativa Japonica Group]
 gi|19071624|gb|AAL84291.1|AC073556_8 putative endonuclease [Oryza sativa Japonica Group]
 gi|108706401|gb|ABF94196.1| HNH endonuclease domain-containing protein, putative, expressed
           [Oryza sativa Japonica Group]
 gi|113547574|dbj|BAF11017.1| Os03g0169800 [Oryza sativa Japonica Group]
 gi|215686970|dbj|BAG90840.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218192172|gb|EEC74599.1| hypothetical protein OsI_10187 [Oryza sativa Indica Group]
          Length = 187

 Score =  256 bits (653), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 122/148 (82%), Positives = 134/148 (90%), Gaps = 2/148 (1%)

Query: 44  EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
           EE    L T  R S+  S++ PRSFPY VKQ+CWEKAE+V GRDP+RWR+DALGNVVFRK
Sbjct: 42  EELDGLLLTAPRPSA--SSSEPRSFPYVVKQRCWEKAERVAGRDPERWRRDALGNVVFRK 99

Query: 104 LVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRV 163
           LVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TELS+S+LIQKS+YCRV
Sbjct: 100 LVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRV 159

Query: 164 SGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           SGRDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 160 SGRDMDLLELSAYGNVRRGPDSGGCRIQ 187


>gi|222624269|gb|EEE58401.1| hypothetical protein OsJ_09573 [Oryza sativa Japonica Group]
          Length = 187

 Score =  254 bits (649), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 121/148 (81%), Positives = 134/148 (90%), Gaps = 2/148 (1%)

Query: 44  EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
           EE    L T  R S+  S++ PRSFPY VKQ+CWEKAE++ GRDP+RWR+DALGNVVFRK
Sbjct: 42  EELDGLLLTAPRPSA--SSSEPRSFPYVVKQRCWEKAERLPGRDPERWRRDALGNVVFRK 99

Query: 104 LVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRV 163
           LVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TELS+S+LIQKS+YCRV
Sbjct: 100 LVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRV 159

Query: 164 SGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           SGRDMDLLELSAYGNVRRG DSGGCRIQ
Sbjct: 160 SGRDMDLLELSAYGNVRRGPDSGGCRIQ 187


>gi|15232817|ref|NP_190333.1| HNH endonuclease [Arabidopsis thaliana]
 gi|6522532|emb|CAB61975.1| hypothetical protein [Arabidopsis thaliana]
 gi|117168237|gb|ABK32201.1| At3g47490 [Arabidopsis thaliana]
 gi|332644767|gb|AEE78288.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 234

 Score =  252 bits (643), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 131/205 (63%), Positives = 153/205 (74%), Gaps = 24/205 (11%)

Query: 1   MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
           M+P  +P RR++ A  +SSPS          P+R  GE        SATLLD      EE
Sbjct: 40  MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 92

Query: 47  ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
           +  LF        D    PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 93  MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 149

Query: 107 CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGR 166
           CPGCLCHDYDHI+PYSKGGKSTLENCQVLQA VNRSKGN+T++SRS+LIQ+SSYCRV+GR
Sbjct: 150 CPGCLCHDYDHIVPYSKGGKSTLENCQVLQAKVNRSKGNKTDISRSELIQRSSYCRVAGR 209

Query: 167 DMDLLELSAYGNVRRGQDSGGCRIQ 191
           DMDL+EL+AYGNV+R   S GCRIQ
Sbjct: 210 DMDLIELTAYGNVQRAPTSSGCRIQ 234


>gi|242036789|ref|XP_002465789.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
 gi|241919643|gb|EER92787.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
          Length = 188

 Score =  250 bits (638), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 113/138 (81%), Positives = 128/138 (92%)

Query: 54  LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
           L  +   S + PRSF Y+VKQQCWEKAE+V GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51  LLTAPPPSASEPRSFSYAVKQQCWEKAERVPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLEL 173
           DYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TE+S+S+LIQKS+YCRVSGRDMDL+EL
Sbjct: 111 DYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVEL 170

Query: 174 SAYGNVRRGQDSGGCRIQ 191
           SAYGNVRRG DSGGC+IQ
Sbjct: 171 SAYGNVRRGPDSGGCKIQ 188


>gi|326534162|dbj|BAJ89431.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 180

 Score =  249 bits (637), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 120/186 (64%), Positives = 142/186 (76%), Gaps = 10/186 (5%)

Query: 6   EPKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANP 65
           +P RR+    RA  P P +  G T   ++       +   ++  L          S + P
Sbjct: 5   KPMRRR----RADHPPPPQSFGATARPTSPRSSTSASAVADLDELL------HTPSASEP 54

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RSFP++VKQQCWEKAEKV GRDP+RWR+DALGN+VFRKLVGCPGCLCHDYDHILPYSKGG
Sbjct: 55  RSFPHAVKQQCWEKAEKVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHILPYSKGG 114

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
           KSTLENCQVLQATVNRSKGN TE+S+S+L Q+S+YCRVSGRDMDL E+SA+GNVRRG DS
Sbjct: 115 KSTLENCQVLQATVNRSKGNNTEISKSELAQRSAYCRVSGRDMDLFEVSAFGNVRRGPDS 174

Query: 186 GGCRIQ 191
           GGC+IQ
Sbjct: 175 GGCKIQ 180


>gi|224104735|ref|XP_002313547.1| predicted protein [Populus trichocarpa]
 gi|222849955|gb|EEE87502.1| predicted protein [Populus trichocarpa]
          Length = 128

 Score =  248 bits (634), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 113/127 (88%), Positives = 124/127 (97%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PRSFPYSVKQQCWEKAEKV+GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKG
Sbjct: 2   PRSFPYSVKQQCWEKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 61

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           GKS+LENCQVLQATVNRSKGNRTELSR++LI++SSYCRVSGRDMD +ELSAYGNVRR Q+
Sbjct: 62  GKSSLENCQVLQATVNRSKGNRTELSRAELIRRSSYCRVSGRDMDFVELSAYGNVRREQE 121

Query: 185 SGGCRIQ 191
           SGGCRIQ
Sbjct: 122 SGGCRIQ 128


>gi|293331837|ref|NP_001168386.1| uncharacterized protein LOC100382155 [Zea mays]
 gi|223947943|gb|ACN28055.1| unknown [Zea mays]
 gi|414865044|tpg|DAA43601.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
          Length = 188

 Score =  248 bits (634), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 112/138 (81%), Positives = 128/138 (92%)

Query: 54  LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
           L  +   S + PRSF Y+VKQQCWEKAE++ GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51  LLTAPPPSASEPRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLEL 173
           DYDHI+PYSKGGKSTLENCQVLQATVNRSKGN+TE+S+S+LIQKS+YCRVSGRDMDL+EL
Sbjct: 111 DYDHIVPYSKGGKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVEL 170

Query: 174 SAYGNVRRGQDSGGCRIQ 191
           SAYGNVRRG DSGGC+IQ
Sbjct: 171 SAYGNVRRGPDSGGCKIQ 188


>gi|297815918|ref|XP_002875842.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
           subsp. lyrata]
 gi|297321680|gb|EFH52101.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
           subsp. lyrata]
          Length = 225

 Score =  248 bits (634), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 128/197 (64%), Positives = 153/197 (77%), Gaps = 16/197 (8%)

Query: 1   MRPNGEPKRRKSTA-TRASSPSPSR-----RRGETTVRSATLLDNELATEEEISTLFTDL 54
           M+P  +P RR++ A + +SSPS +R      RG     SATLLD      EE+  LF   
Sbjct: 39  MKP--DPARRRNRALSPSSSPSKTRPGSGEARGGKVKTSATLLD-----REEMG-LFPG- 89

Query: 55  RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
                D    PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVGCPGCLCHD
Sbjct: 90  -SGYDDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDPLGNIVFRKLVGCPGCLCHD 148

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELS 174
           YDHI+PYSKGGKS LENCQVLQA VNRSKGN+T++SR++LIQ+SSYCRV+GRDMDL+EL+
Sbjct: 149 YDHIVPYSKGGKSILENCQVLQAKVNRSKGNKTDISRAELIQRSSYCRVAGRDMDLIELT 208

Query: 175 AYGNVRRGQDSGGCRIQ 191
           AYGNV+R  +S GCRIQ
Sbjct: 209 AYGNVQRAPESSGCRIQ 225


>gi|357113864|ref|XP_003558721.1| PREDICTED: uncharacterized protein LOC100825547 [Brachypodium
           distachyon]
          Length = 188

 Score =  247 bits (630), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 117/172 (68%), Positives = 137/172 (79%)

Query: 20  PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
           P PS+  G T   ++       +     + L   L  +   S + PRSF Y+VKQQCWEK
Sbjct: 17  PPPSQSFGATARPNSPRSPTSASAASVSADLDGLLLTAPPPSASEPRSFSYAVKQQCWEK 76

Query: 80  AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
           AEKV GRDP+RWR+DALGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQATV
Sbjct: 77  AEKVPGRDPERWRRDALGNMVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQATV 136

Query: 140 NRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           NRSKGN+TE+S+S+L Q+S+YCRVSGRDMDL ELSA+GNVRRG DSGGC+IQ
Sbjct: 137 NRSKGNKTEISKSELAQRSAYCRVSGRDMDLFELSAFGNVRRGPDSGGCKIQ 188


>gi|334185796|ref|NP_001190027.1| HNH endonuclease [Arabidopsis thaliana]
 gi|332644768|gb|AEE78289.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 215

 Score =  239 bits (610), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 131/225 (58%), Positives = 153/225 (68%), Gaps = 44/225 (19%)

Query: 1   MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
           M+P  +P RR++ A  +SSPS          P+R  GE        SATLLD      EE
Sbjct: 1   MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 53

Query: 47  ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
           +  LF        D    PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 54  MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 110

Query: 107 CPGCLCHDYDHILPYS--------------------KGGKSTLENCQVLQATVNRSKGNR 146
           CPGCLCHDYDHI+PYS                    KGGKSTLENCQVLQA VNRSKGN+
Sbjct: 111 CPGCLCHDYDHIVPYSKDMCPLEHAFDEPHKDFLDLKGGKSTLENCQVLQAKVNRSKGNK 170

Query: 147 TELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           T++SRS+LIQ+SSYCRV+GRDMDL+EL+AYGNV+R   S GCRIQ
Sbjct: 171 TDISRSELIQRSSYCRVAGRDMDLIELTAYGNVQRAPTSSGCRIQ 215


>gi|116794423|gb|ABK27138.1| unknown [Picea sitchensis]
          Length = 200

 Score =  239 bits (609), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 116/186 (62%), Positives = 135/186 (72%), Gaps = 22/186 (11%)

Query: 6   EPKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANP 65
           +P R+KS A  A       RR  T + +A + + E    EE S               NP
Sbjct: 37  KPWRKKSAAATA-------RR--TVIETAGIDEEEQVVGEESSEY-------------NP 74

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R+FPY +KQ CWEKA+K++GRDPDRWR+DALGN VFRKLVGC GCLCHDYDHI PYSKGG
Sbjct: 75  RAFPYHIKQACWEKADKIKGRDPDRWRRDALGNTVFRKLVGCSGCLCHDYDHIHPYSKGG 134

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
           KST+ENCQVLQ   NR+K N T++S +DLIQKS+YCRVSGRDMD LELSAYGNV RGQ+S
Sbjct: 135 KSTVENCQVLQGAANRAKSNHTDMSTTDLIQKSAYCRVSGRDMDFLELSAYGNVHRGQES 194

Query: 186 GGCRIQ 191
           GGCRIQ
Sbjct: 195 GGCRIQ 200


>gi|168048449|ref|XP_001776679.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671971|gb|EDQ58515.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 129

 Score =  189 bits (479), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 86/129 (66%), Positives = 102/129 (79%), Gaps = 3/129 (2%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RSFP+ VK+ CW KAE V+GRDP RWR+DALGNVVFRKLVGC GCLCHDYDHI+PYSKGG
Sbjct: 1   RSFPHDVKEACWAKAETVKGRDPARWRRDALGNVVFRKLVGCQGCLCHDYDHIVPYSKGG 60

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDS 185
            STL+NCQVLQA  NR KG R  +  +++ Q S+YC ++GRDMD +ELSAYGNV   +D 
Sbjct: 61  PSTLDNCQVLQAPANRFKGARINVPVTEMQQHSAYCSLTGRDMDFVELSAYGNVNHNKDD 120

Query: 186 G---GCRIQ 191
               GCR+Q
Sbjct: 121 EGGIGCRMQ 129


>gi|359491703|ref|XP_003634310.1| PREDICTED: uncharacterized protein LOC100262165 isoform 2 [Vitis
           vinifera]
          Length = 167

 Score =  182 bits (463), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/121 (69%), Positives = 98/121 (80%), Gaps = 1/121 (0%)

Query: 20  PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
           PS SR     +  S +LLD    TE+E +TLF +L+IS  +   NPRSFPYSVKQQCWEK
Sbjct: 29  PSGSRSTTRLSPSSTSLLDTPFITEDEDATLFDNLKIS-PNPNFNPRSFPYSVKQQCWEK 87

Query: 80  AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
           AEK++GRDPDRWR+D LGN+VFRKLVGCPGCLCHDYDHI+PYSKGGKSTLENCQVLQ  +
Sbjct: 88  AEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKGGKSTLENCQVLQENM 147

Query: 140 N 140
           +
Sbjct: 148 D 148


>gi|224119042|ref|XP_002317971.1| predicted protein [Populus trichocarpa]
 gi|222858644|gb|EEE96191.1| predicted protein [Populus trichocarpa]
          Length = 174

 Score =  140 bits (352), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 87/125 (69%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F   VK+ CW+ AE V GR P+RWRKDA GN+V ++   C GCLC++YDHILP+SKG
Sbjct: 24  PRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPFSKG 83

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           G+ST++NCQ+LQ  VNR K N+ EL ++ L   S     + +++D++E++ YG+V R  +
Sbjct: 84  GESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIRPGN 143

Query: 185 SGGCR 189
              CR
Sbjct: 144 QCRCR 148


>gi|118481005|gb|ABK92456.1| unknown [Populus trichocarpa]
          Length = 180

 Score =  140 bits (352), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 87/125 (69%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F   VK+ CW+ AE V GR P+RWRKDA GN+V ++   C GCLC++YDHILP+SKG
Sbjct: 29  PRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPFSKG 88

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           G+ST++NCQ+LQ  VNR K N+ EL ++ L   S     + +++D++E++ YG+V R  +
Sbjct: 89  GESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIRPGN 148

Query: 185 SGGCR 189
              CR
Sbjct: 149 QCRCR 153


>gi|116783732|gb|ABK23066.1| unknown [Picea sitchensis]
          Length = 168

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 88/116 (75%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   ++  CW+KA+ V GR P+RWRKD  GN++ R+   C GCLC++YDHI+P+SKGG
Sbjct: 20  RFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIICRRFGHCEGCLCYEYDHIIPFSKGG 79

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           ++T+ENCQ+LQ+ VNRSKGN+ +L+R+ L Q S   + + +++D++E++ YGNV R
Sbjct: 80  ETTVENCQILQSRVNRSKGNKDQLNRTVLEQYSCDLKFTDKELDVIEMAVYGNVMR 135


>gi|116780247|gb|ABK21604.1| unknown [Picea sitchensis]
          Length = 166

 Score =  139 bits (350), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 60/124 (48%), Positives = 90/124 (72%)

Query: 58  SADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
           SA +    R F   ++  CW+KA+ V GR P+RWRKD  GN+V R+   C GCLC++YDH
Sbjct: 10  SARNQKRARFFDEKMRNLCWQKADVVPGRHPERWRKDGAGNIVCRRFWHCDGCLCYEYDH 69

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
           I+P+SKGG++T+ENCQ+LQ+ VNRSKGN+ +L ++ L Q S   + + +++D++E++ YG
Sbjct: 70  IIPFSKGGETTVENCQILQSRVNRSKGNKDQLDKTVLEQYSCDLKFTDKELDVIEMAVYG 129

Query: 178 NVRR 181
           NV R
Sbjct: 130 NVMR 133


>gi|42572595|ref|NP_974393.1| HNH endonuclease [Arabidopsis thaliana]
 gi|332644766|gb|AEE78287.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 176

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/137 (56%), Positives = 90/137 (65%), Gaps = 24/137 (17%)

Query: 1   MRPNGEPKRRKSTATRASSPS----------PSRRRGET----TVRSATLLDNELATEEE 46
           M+P  +P RR++ A  +SSPS          P+R  GE        SATLLD      EE
Sbjct: 40  MKP--DPTRRRNRALSSSSPSKTRPELSSGSPNRNTGEARGGKVKTSATLLD-----REE 92

Query: 47  ISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG 106
           +  LF        D    PRSFPYSVKQQCWEKAEK++GRDP+RWR+D LGN+VFRKLVG
Sbjct: 93  MG-LFPG--SGYGDPNPEPRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVG 149

Query: 107 CPGCLCHDYDHILPYSK 123
           CPGCLCHDYDHI+PYSK
Sbjct: 150 CPGCLCHDYDHIVPYSK 166


>gi|452824505|gb|EME31507.1| endonuclease [Galdieria sulphuraria]
          Length = 146

 Score =  138 bits (347), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 65/131 (49%), Positives = 85/131 (64%), Gaps = 1/131 (0%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
           +D   N R F   VK  CWEKA  V GRDPDRWR DA  N V ++L  C GCLCH+YDH+
Sbjct: 5   SDHFTNDRIFSNRVKAACWEKATPVPGRDPDRWRLDAFRNPVCKRLTSCEGCLCHEYDHV 64

Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
           +PYS+GG ST+ENCQ+LQ  +NR K +R +L+   L   S     S R++DL+E++ YGN
Sbjct: 65  IPYSQGGASTVENCQILQTRINRLKADR-QLTAEQLESFSCEITFSERELDLIEMAVYGN 123

Query: 179 VRRGQDSGGCR 189
           V+R      C+
Sbjct: 124 VQRDHFRCRCK 134


>gi|168006043|ref|XP_001755719.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693038|gb|EDQ79392.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 180

 Score =  137 bits (345), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 59/116 (50%), Positives = 86/116 (74%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   ++ QCW+KA+ V GR P+RWRKD  GN+V R+L  C GCLC++YDH+ P+SKGG
Sbjct: 28  RIFDGKMRNQCWQKADVVPGRHPERWRKDVAGNIVCRRLTRCEGCLCYEYDHVRPFSKGG 87

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           ++T++NCQ+LQ+ VNR K NR++L+  DL Q S   + + +++DL+E+  YGNV R
Sbjct: 88  QTTVDNCQILQSRVNRMKSNRSDLASDDLKQFSCDLKFTDKELDLIEMGVYGNVIR 143


>gi|242039281|ref|XP_002467035.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
 gi|241920889|gb|EER94033.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
          Length = 180

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 88/125 (70%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           + A     PRSF    +  CW KA  + GR P+RWR DA+GNVV R+  GC GCLC++YD
Sbjct: 22  AQATEKERPRSFDEKTRTACWRKAAVLAGRHPERWRLDAVGNVVCRRFWGCHGCLCYEYD 81

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
           HI+P+SKGG+ST+ENCQ+LQ  VNRSK ++T + ++++   S   + + +++D++E++ Y
Sbjct: 82  HIVPFSKGGESTVENCQILQTRVNRSKSDKTWVEQAEMRGFSCDIKFTDQELDIIEMAVY 141

Query: 177 GNVRR 181
           G+V R
Sbjct: 142 GDVIR 146


>gi|195651337|gb|ACG45136.1| endonuclease/ nucleic acid binding protein [Zea mays]
          Length = 180

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 88/125 (70%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PRSF    +  CW KA  + GR P+RWR+DA+GNVV R+  GC GCLC++YDHI+P+SKG
Sbjct: 30  PRSFDEKTRTACWRKAAVLAGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPFSKG 89

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           G+ST+ENCQ+LQ  VNRSK ++  ++++++   S     + +++D++E++ YG+V R   
Sbjct: 90  GESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIRPGK 149

Query: 185 SGGCR 189
              CR
Sbjct: 150 QCRCR 154


>gi|449440818|ref|XP_004138181.1| PREDICTED: uncharacterized protein LOC101204599 [Cucumis sativus]
 gi|449477195|ref|XP_004154957.1| PREDICTED: uncharacterized protein LOC101224154 [Cucumis sativus]
          Length = 181

 Score =  133 bits (334), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 60/125 (48%), Positives = 84/125 (67%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F    K  CW KA+ V GR P+RWRKDA GNVV ++   C GCLC +YDHI+PYSKG
Sbjct: 29  PRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKG 88

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           G+ST +NCQ+LQ  VNR K N+ ++  S+L   S   + + +++D++E++ YG+V R  +
Sbjct: 89  GESTADNCQILQTRVNRFKSNKDDVDTSELKGYSCDVKFTDKELDIIEMAVYGDVIRPGN 148

Query: 185 SGGCR 189
              CR
Sbjct: 149 QCRCR 153


>gi|218184321|gb|EEC66748.1| hypothetical protein OsI_33113 [Oryza sativa Indica Group]
          Length = 178

 Score =  133 bits (334), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 59/133 (44%), Positives = 89/133 (66%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           S+A     PRSF    +  CW KA  V GR P+RWR+DA GNVV R+   C GCLC++YD
Sbjct: 20  SAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYD 79

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
           HI+P+SKGG+ST++NCQ+LQ  VNRSK ++  + ++++   S   + + +++D++E++ Y
Sbjct: 80  HIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFSCDIKFTDKELDVIEMAVY 139

Query: 177 GNVRRGQDSGGCR 189
           G+V R      CR
Sbjct: 140 GDVIRPGKQCRCR 152


>gi|115481558|ref|NP_001064372.1| Os10g0335200 [Oryza sativa Japonica Group]
 gi|21671981|gb|AAM74343.1|AC115686_10 Putative HNH endonuclease [Oryza sativa Japonica Group]
 gi|23477807|gb|AAN34947.1| Putative HNH endonuclease [Oryza sativa Japonica Group]
 gi|31431266|gb|AAP53074.1| HNH endonuclease family protein, expressed [Oryza sativa Japonica
           Group]
 gi|113638981|dbj|BAF26286.1| Os10g0335200 [Oryza sativa Japonica Group]
 gi|125574417|gb|EAZ15701.1| hypothetical protein OsJ_31114 [Oryza sativa Japonica Group]
 gi|215693775|dbj|BAG88974.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215707241|dbj|BAG93701.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 178

 Score =  133 bits (334), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 59/133 (44%), Positives = 89/133 (66%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           S+A     PRSF    +  CW KA  V GR P+RWR+DA GNVV R+   C GCLC++YD
Sbjct: 20  SAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYD 79

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
           HI+P+SKGG+ST++NCQ+LQ  VNRSK ++  + ++++   S   + + +++D++E++ Y
Sbjct: 80  HIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFSCDIKFTDKELDVIEMAVY 139

Query: 177 GNVRRGQDSGGCR 189
           G+V R      CR
Sbjct: 140 GDVIRPGKQCRCR 152


>gi|326513564|dbj|BAJ87801.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 175

 Score =  132 bits (332), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 55/117 (47%), Positives = 84/117 (71%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PRSF    +  CW KA  + GR P+RWR+DA GNVV R+   C GCLC++YDHI+P+SKG
Sbjct: 29  PRSFDEKTRTACWRKAAVLTGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYDHIVPFSKG 88

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           G+ST+ENCQ+LQ  VNRSK ++  + ++++   S   + + +++D++E++ YG+V R
Sbjct: 89  GESTVENCQILQTRVNRSKSDKAWVEKAEMQGFSCDIKFTDKELDVIEMAVYGDVIR 145


>gi|229594535|ref|XP_001023127.3| HNH endonuclease family protein [Tetrahymena thermophila]
 gi|225566791|gb|EAS02882.3| HNH endonuclease family protein [Tetrahymena thermophila SB210]
          Length = 208

 Score =  132 bits (331), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 58/115 (50%), Positives = 76/115 (66%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F  S+K QCW  A    GR PDRWR DA+GN V R L  C G LCH+YDHI+P+SKG
Sbjct: 40  PRCFTKSMKDQCWNNATIFIGRHPDRWRLDAVGNPVLRGLTSCLGPLCHEYDHIVPFSKG 99

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNV 179
           GK+ + NCQ+LQ  VNR K N+T  S  +LI  SS  + + ++ D +E + YG++
Sbjct: 100 GKTEISNCQILQTMVNRYKSNKTNQSHKELIAVSSTNKFTNQEFDSIERAIYGDI 154


>gi|359491684|ref|XP_003634305.1| PREDICTED: uncharacterized protein LOC100852985 [Vitis vinifera]
 gi|297733935|emb|CBI15182.3| unnamed protein product [Vitis vinifera]
          Length = 167

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 59/132 (44%), Positives = 84/132 (63%)

Query: 58  SADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
           S D    PR F    K  CW KA+ V GR PDRWRKDA GN+V ++   C GCLC +YDH
Sbjct: 9   SRDKEERPRFFDPKAKSLCWAKADAVPGRHPDRWRKDAAGNIVCKRFCNCQGCLCFEYDH 68

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
           I+P+SKGG+S  ENCQ+LQ  VNR K ++  + ++ L   S   + + +++D++E++ YG
Sbjct: 69  IIPFSKGGESVPENCQILQTRVNRFKSDKDNVDKTQLQGYSCDVKFTDKELDIIEMAVYG 128

Query: 178 NVRRGQDSGGCR 189
           +V R  +   CR
Sbjct: 129 DVIRPGNQCRCR 140


>gi|302811781|ref|XP_002987579.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
 gi|300144733|gb|EFJ11415.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
          Length = 174

 Score =  131 bits (329), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 59/116 (50%), Positives = 83/116 (71%), Gaps = 1/116 (0%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   +++QCWEKAE V GR P+RWRKD  GNVV RK +GC GC+C+++DHI P+SKGG
Sbjct: 21  RLFDEKMRRQCWEKAEVVPGRHPERWRKDGAGNVVCRKFIGCEGCMCYEFDHIRPFSKGG 80

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           ++ + NCQ+LQ+ VNR K N+ +     L Q S   + + R++DL+E++ YGNV R
Sbjct: 81  ETDVSNCQILQSRVNRLKSNK-DADEHLLQQYSCDLKFTARELDLVEMAVYGNVIR 135


>gi|357146087|ref|XP_003573871.1| PREDICTED: uncharacterized protein LOC100831965 [Brachypodium
           distachyon]
          Length = 180

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 54/127 (42%), Positives = 86/127 (67%)

Query: 55  RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
           R+        PRSF    +  CW KA  + GR P+RWR+D  GN+V R+   C GCLC++
Sbjct: 20  RVPKPKEKERPRSFDEKTRAACWRKAAVLAGRHPERWRQDTAGNIVCRRFWSCHGCLCYE 79

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELS 174
           YDHI+P+SKGG+ST+ENCQ+LQ  VNRSK +++ + ++++   S   + + +++D++E++
Sbjct: 80  YDHIVPFSKGGESTVENCQILQTRVNRSKSDKSWVEKAEMQGASCDIKFTDKELDVIEMA 139

Query: 175 AYGNVRR 181
            YG+V R
Sbjct: 140 VYGDVIR 146


>gi|302822371|ref|XP_002992844.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
 gi|300139392|gb|EFJ06134.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
          Length = 174

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 83/116 (71%), Gaps = 1/116 (0%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   +++QCW+KAE V GR P+RWRKD  GNVV RK +GC GC+C+++DHI P+SKGG
Sbjct: 21  RLFDEKMRRQCWQKAEVVPGRHPERWRKDGAGNVVCRKFIGCEGCMCYEFDHIRPFSKGG 80

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           ++ + NCQ+LQ+ VNR K N+ +     L Q S   + + R++DL+E++ YGNV R
Sbjct: 81  ETDVSNCQILQSRVNRLKSNK-DADEHLLQQYSCDLKFTARELDLVEMAVYGNVIR 135


>gi|351722082|ref|NP_001235440.1| uncharacterized protein LOC100527904 [Glycine max]
 gi|255633516|gb|ACU17116.1| unknown [Glycine max]
          Length = 162

 Score =  129 bits (325), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 58/133 (43%), Positives = 84/133 (63%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           S  D    PR F  + K  CW KA+ V GR P+RWRKDA GNVV ++   C GCLC++YD
Sbjct: 9   SRGDGEKRPRFFDSNAKAICWSKADTVPGRHPERWRKDAAGNVVCKRFFNCIGCLCYEYD 68

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAY 176
           HI+P+SKGG+ST +NCQ+LQ+ VNR K ++  +    L   S     + +++D++E++ Y
Sbjct: 69  HIIPFSKGGESTADNCQILQSRVNRLKSDKYNIDSDQLKDYSCEVNFTDKELDIIEMAVY 128

Query: 177 GNVRRGQDSGGCR 189
            +V R  +   CR
Sbjct: 129 SDVMRPGNQCRCR 141


>gi|255547311|ref|XP_002514713.1| endonuclease, putative [Ricinus communis]
 gi|223546317|gb|EEF47819.1| endonuclease, putative [Ricinus communis]
          Length = 173

 Score =  125 bits (315), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 79/125 (63%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F    K  CW  AE V GR P+RWRKDA GN+V ++L  C GCLC  YDHI+P+SKG
Sbjct: 22  PRFFSNKAKNICWANAETVPGRHPERWRKDAAGNIVCKRLGNCQGCLCFQYDHIIPFSKG 81

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQD 184
           G+ST  NCQ+LQ  VN  K +   + +S L   S   + + +++D++E++ YG+V R  +
Sbjct: 82  GESTAANCQILQTRVNNYKADAEAVDKSQLKGYSCDLKFTDKELDIIEMAVYGDVIRPGN 141

Query: 185 SGGCR 189
              CR
Sbjct: 142 QCRCR 146


>gi|414865043|tpg|DAA43600.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
          Length = 141

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 52/70 (74%), Positives = 61/70 (87%)

Query: 54  LRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
           L  +   S + PRSF Y+VKQQCWEKAE++ GRDP+RWR+DALGN+VFRKLVGCPGCLCH
Sbjct: 51  LLTAPPPSASEPRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCH 110

Query: 114 DYDHILPYSK 123
           DYDHI+PYSK
Sbjct: 111 DYDHIVPYSK 120


>gi|297850254|ref|XP_002893008.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338850|gb|EFH69267.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 179

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 77/117 (65%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F    K +CW  A+ V GR P+RWRKDA GN+V ++   C GCLC +YDHI+PYSKG
Sbjct: 33  PRFFDGKAKNKCWANADTVPGRHPERWRKDAAGNIVCKRFGNCNGCLCFEYDHIVPYSKG 92

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           G+S  ENCQ+LQ  VNR K  +  +  + L   S   + + +++D++E++ YG+V R
Sbjct: 93  GESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDIIEMAVYGDVLR 149


>gi|15221835|ref|NP_173303.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
 gi|6730716|gb|AAF27111.1|AC011809_20 Unknown protein [Arabidopsis thaliana]
 gi|26451201|dbj|BAC42704.1| unknown protein [Arabidopsis thaliana]
 gi|28973585|gb|AAO64117.1| unknown protein [Arabidopsis thaliana]
 gi|332191624|gb|AEE29745.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
          Length = 186

 Score =  123 bits (308), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 76/117 (64%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
           PR F    K +CW  A+ V GR P+RWRKD  GN+V ++   C GCLC +YDHI+PYSKG
Sbjct: 40  PRFFDGKAKNKCWANADIVPGRHPERWRKDVAGNIVCKRFGNCNGCLCFEYDHIVPYSKG 99

Query: 125 GKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           G+S  ENCQ+LQ  VNR K  +  +  + L   S   + + +++D++E++ YG+V R
Sbjct: 100 GESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDVIEMAVYGDVLR 156


>gi|452821821|gb|EME28847.1| endonuclease isoform 1 [Galdieria sulphuraria]
          Length = 199

 Score =  119 bits (297), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 58/123 (47%), Positives = 78/123 (63%), Gaps = 9/123 (7%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           + F +  K+ CW KA+ VRGRDP RWR+D  GNV+ +KL GC G  C D+DH++P SKGG
Sbjct: 59  KQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVICKKLRGCRGPFCMDFDHVVPLSKGG 118

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSG-------RDMDLLELSAYGN 178
           +S L+NCQVLQ T NR KG+R     +D+  K  Y  +S        R +D++E   YG+
Sbjct: 119 ESVLDNCQVLQTTANRHKGDRLVEPTTDI--KKYYKEISDRRVLPLQRHLDIVEFYVYGD 176

Query: 179 VRR 181
           V R
Sbjct: 177 VIR 179


>gi|452821820|gb|EME28846.1| endonuclease isoform 2 [Galdieria sulphuraria]
          Length = 190

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 75/119 (63%), Gaps = 9/119 (7%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           + F +  K+ CW KA+ VRGRDP RWR+D  GNV+ +KL GC G  C D+DH++P SKGG
Sbjct: 59  KQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVICKKLRGCRGPFCMDFDHVVPLSKGG 118

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSG-------RDMDLLELSAYG 177
           +S L+NCQVLQ T NR KG+R     +D+  K  Y  +S        R +D++E   YG
Sbjct: 119 ESVLDNCQVLQTTANRHKGDRLVEPTTDI--KKYYKEISDRRVLPLQRHLDIVEFYVYG 175


>gi|449015785|dbj|BAM79187.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 231

 Score =  115 bits (288), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 54/119 (45%), Positives = 76/119 (63%), Gaps = 3/119 (2%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RSF  + ++  WE+A+ V GRDP+RWR D  GN V R L GC GC C+++DH +PYS GG
Sbjct: 26  RSFTEAQRKAAWERAKPVPGRDPERWRYDVYGNPVLRPLHGCLGCFCYEFDHRVPYSLGG 85

Query: 126 KSTLENCQVLQATVNRSKGNRTE---LSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
            ++L+NC +LQ  +NR K N  E   +SR  + Q +     +  DMD +E + YG+VRR
Sbjct: 86  ATSLDNCDILQTRINRLKSNALEGVNISREQIRQYACEDVSNVIDMDAIEFAVYGDVRR 144


>gi|145521759|ref|XP_001446729.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414218|emb|CAK79332.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 74/117 (63%)

Query: 61  STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILP 120
           S  NPR F    +++CW  A+ + GR+PDRWR DA+GN V + L  C G LC+ +DHI+P
Sbjct: 23  SPYNPRRFDKLAREECWNNAQIMEGRNPDRWRLDAVGNPVCKALNACRGPLCYQFDHIVP 82

Query: 121 YSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
           YSKGG+S  +NCQ+LQ+ VN  K N+  +S+ +L Q S     +    D +E + YG
Sbjct: 83  YSKGGESISKNCQLLQSFVNIYKSNKENISKDELKQVSPQLNFTSEVFDAIEYAIYG 139


>gi|363814422|ref|NP_001242848.1| uncharacterized protein LOC100775923 [Glycine max]
 gi|255641881|gb|ACU21209.1| unknown [Glycine max]
          Length = 121

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 50/98 (51%), Positives = 66/98 (67%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           S  D    PR F  + K  CW KA+ V GR P+RWRKDA GNVV ++   C GCLC++YD
Sbjct: 9   SRGDGEKRPRFFDSNAKAICWGKADTVPGRHPERWRKDAAGNVVCKRFFNCIGCLCYEYD 68

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           HI+P+SKGG+ST +NCQ+LQ+ VNR K ++  +    L
Sbjct: 69  HIIPFSKGGESTADNCQILQSRVNRFKSDKYNIDSEQL 106


>gi|297733966|emb|CBI15213.3| unnamed protein product [Vitis vinifera]
          Length = 99

 Score =  109 bits (272), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 51/55 (92%), Positives = 54/55 (98%)

Query: 137 ATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCRIQ 191
           ATVNRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGC IQ
Sbjct: 45  ATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCTIQ 99


>gi|147866359|emb|CAN79860.1| hypothetical protein VITISV_035905 [Vitis vinifera]
          Length = 222

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/55 (92%), Positives = 54/55 (98%)

Query: 135 LQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGCR 189
           LQATVNRSKGNRTE+S++DLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGG R
Sbjct: 4   LQATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRRGQDSGGSR 58


>gi|145512195|ref|XP_001442014.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409286|emb|CAK74617.1| unnamed protein product [Paramecium tetraurelia]
          Length = 167

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/117 (42%), Positives = 72/117 (61%)

Query: 61  STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILP 120
           S  NPR F    +++C   A+ + GR+PDRWR DA+GN V + L  C G LC+ +DHI+P
Sbjct: 51  SPYNPRRFDKLAREECRNNAQIMEGRNPDRWRLDAVGNPVCKALNSCRGSLCYQFDHIVP 110

Query: 121 YSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYG 177
           YSKGG+S   NCQ+LQ+ VN  K N+  +S+ +L   S   + +    D +E + YG
Sbjct: 111 YSKGGESISINCQLLQSFVNIYKSNKENISKDELKHVSPQLKFTPVQFDAIEYAIYG 167


>gi|357113854|ref|XP_003558716.1| PREDICTED: uncharacterized protein LOC100824123 [Brachypodium
           distachyon]
          Length = 246

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 51/118 (43%), Positives = 70/118 (59%), Gaps = 1/118 (0%)

Query: 64  NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
           N   FP +V++ CW  AE+V GR P RWRKD  GNV+ +K     G     YDHI P S+
Sbjct: 67  NEEKFPRAVRKWCWINAEEVPGRHPGRWRKDPYGNVLCKKQTTTLGLSYFHYDHIHPRSR 126

Query: 124 GGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           GGK   ENCQ+L A +N  K N  + +R +++  S +   S  ++DL+E + YGNV R
Sbjct: 127 GGKGVKENCQLLLAYLNIFKSNDIK-TREEMVDNSYHITYSNEELDLIECNVYGNVVR 183


>gi|340502170|gb|EGR28883.1| hypothetical protein IMG5_167370 [Ichthyophthirius multifiliis]
          Length = 121

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 43/96 (44%), Positives = 63/96 (65%)

Query: 85  GRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           GR P+RWR DA+GN V + L  C G LC+ YDHI+PYSKGG++ +ENCQ+LQ+ VN+ K 
Sbjct: 3   GRHPERWRLDAVGNPVIKSLTSCNGPLCYQYDHIVPYSKGGETQIENCQLLQSIVNKHKS 62

Query: 145 NRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVR 180
           N+ + + S+L   S     +  + D +E + YG  +
Sbjct: 63  NKIDATYSELKNVSPKDDFTNFEYDCIEKAIYGTYQ 98


>gi|197306242|gb|ACH59472.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
          Length = 84

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 56/79 (70%), Gaps = 3/79 (3%)

Query: 57  SSADSTANP---RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH 113
           SS D+ +N    R F   ++  CW+KA+ V GR P+RWRKD  GN+V R+   C GCLC+
Sbjct: 6   SSEDNPSNQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCY 65

Query: 114 DYDHILPYSKGGKSTLENC 132
           +YDHI+P+SKGG +T+ENC
Sbjct: 66  EYDHIIPFSKGGGTTVENC 84


>gi|449018629|dbj|BAM82031.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 188

 Score = 92.4 bits (228), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 6/121 (4%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F ++ +  CW  A+ V GR P RWR D +GN+V R L GC G LC++YDH  P+ KGG
Sbjct: 27  RLFSHTERNICWRNAQVVPGRHPRRWRFDRVGNLVCRALHGCNGPLCYEYDHRAPWGKGG 86

Query: 126 KSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGR-----DMDLLELSAYGNVR 180
           +S   NC VLQ + NR K ++ E   ++ + + ++ RV  R     ++D +E   YG++ 
Sbjct: 87  RSDSTNCDVLQTSANRWKSDQPESRVTEKLLR-AHSRVLSRSLSQEELDTIEYVIYGDIL 145

Query: 181 R 181
           R
Sbjct: 146 R 146


>gi|197306216|gb|ACH59459.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306218|gb|ACH59460.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306220|gb|ACH59461.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306222|gb|ACH59462.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306224|gb|ACH59463.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306226|gb|ACH59464.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306228|gb|ACH59465.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306230|gb|ACH59466.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306232|gb|ACH59467.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306234|gb|ACH59468.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306238|gb|ACH59470.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306244|gb|ACH59473.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306246|gb|ACH59474.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306248|gb|ACH59475.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306250|gb|ACH59476.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306252|gb|ACH59477.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306254|gb|ACH59478.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306256|gb|ACH59479.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306258|gb|ACH59480.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306262|gb|ACH59482.1| HNH endonuclease domain-containing protein [Pseudotsuga macrocarpa]
          Length = 84

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 37/67 (55%), Positives = 50/67 (74%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   ++  CW+KA+ V GR P+RWRKD  GN+V R+   C GCLC++YDHI+P+SKGG
Sbjct: 18  RFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77

Query: 126 KSTLENC 132
            +T+ENC
Sbjct: 78  GTTVENC 84


>gi|197306240|gb|ACH59471.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
 gi|197306260|gb|ACH59481.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
          Length = 84

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 37/67 (55%), Positives = 50/67 (74%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   ++  CW+KA+ V GR P+RWRKD  GN+V R+   C GCLC++YDHI+P+SKGG
Sbjct: 18  RFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77

Query: 126 KSTLENC 132
            +T+ENC
Sbjct: 78  GTTVENC 84


>gi|197306236|gb|ACH59469.1| HNH endonuclease domain-containing protein [Pseudotsuga menziesii]
          Length = 84

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 37/67 (55%), Positives = 50/67 (74%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           R F   ++  CW+KA+ V GR P+RWRKD  GN+V R+   C GCLC++YDHI+P+SKGG
Sbjct: 18  RFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIPFSKGG 77

Query: 126 KSTLENC 132
            +T+ENC
Sbjct: 78  GTTVENC 84


>gi|226504958|ref|NP_001144283.1| uncharacterized protein LOC100277163 [Zea mays]
 gi|195639630|gb|ACG39283.1| hypothetical protein [Zea mays]
          Length = 139

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 34/59 (57%), Positives = 44/59 (74%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
           PRSF    +  CW KA  + GR P+RWR+DA+GNVV R+  GC GCLC++YDHI+P+SK
Sbjct: 30  PRSFDEKTRTACWRKAAVLSGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPFSK 88


>gi|38605769|emb|CAE05870.3| OSJNBa0044K18.12 [Oryza sativa Japonica Group]
          Length = 204

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 61/130 (46%), Gaps = 16/130 (12%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
           A + A PRSF   V++ CW  A KV GR  DRWR DA GN V + L G            
Sbjct: 17  ASAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNG------------ 64

Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
                GG +   NCQ++Q   NRSK NR  L +  +   S     +   +D +E++ Y +
Sbjct: 65  ---KHGGPNDQSNCQIIQTGANRSKSNRI-LRKDQMKGLSRTMNWTHPHLDTIEIAVYAS 120

Query: 179 VRRGQDSGGC 188
           V R   S  C
Sbjct: 121 VERDDLSCTC 130


>gi|125548724|gb|EAY94546.1| hypothetical protein OsI_16320 [Oryza sativa Indica Group]
          Length = 232

 Score = 68.9 bits (167), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 60/130 (46%), Gaps = 16/130 (12%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
           A + A PRSF   V++ CW  A KV GR  DR R DA GN V + L G            
Sbjct: 17  ASAPAKPRSFSADVREACWNNAPKVIGRSGDRCRLDANGNPVLKALNG------------ 64

Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGN 178
                GG +   NCQ++Q   NRSK NR  L +  +   S     +   +D +E++ Y +
Sbjct: 65  ---KHGGPNDQSNCQIIQTGANRSKSNRI-LRKDQMKGLSRTMNWTHPHLDTIEIAVYAS 120

Query: 179 VRRGQDSGGC 188
           V R   S  C
Sbjct: 121 VERDDLSCTC 130


>gi|115345574|ref|YP_771756.1| hypothetical protein RD1_A0048 [Roseobacter denitrificans OCh 114]
 gi|115292895|gb|ABI93348.1| OSJNBa0044K18.12 [Roseobacter denitrificans OCh 114]
          Length = 96

 Score = 67.0 bits (162), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 68  FPYSVK---QQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKG 124
            PYS     Q  W KA  V+G++P+ +R+DA GNV+++        +    DH+ P SKG
Sbjct: 1   MPYSTNPKAQHVWNKARPVKGKNPNLYRRDAQGNVIYKPAYNRDSAMGWQVDHVWPRSKG 60

Query: 125 GKSTLENCQVLQATVNRSKGNRT 147
           G    +N Q LQ   N+ K N+T
Sbjct: 61  GSDARKNLQALQTGANKRKSNKT 83


>gi|222629057|gb|EEE61189.1| hypothetical protein OsJ_15190 [Oryza sativa Japonica Group]
          Length = 118

 Score = 66.2 bits (160), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 44/88 (50%), Gaps = 15/88 (17%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHI 118
           A + A PRSF   V++ CW  A KV GR  DRWR DA GN V + L G            
Sbjct: 17  ASAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNG------------ 64

Query: 119 LPYSKGGKSTLENCQVLQATVNRSKGNR 146
                GG +   NCQ++Q   NRSK NR
Sbjct: 65  ---KHGGPNDQSNCQIIQTGANRSKSNR 89


>gi|413934346|gb|AFW68897.1| hypothetical protein ZEAMMB73_497763 [Zea mays]
          Length = 132

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 45/68 (66%)

Query: 122 SKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRR 181
           S GG+ST+ENCQ+LQ  VNRSK ++  ++++++   S     + +++D++E++ YG+V R
Sbjct: 39  STGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 98

Query: 182 GQDSGGCR 189
                 CR
Sbjct: 99  PGKQCRCR 106


>gi|403340432|gb|EJY69501.1| hypothetical protein OXYTRI_09762 [Oxytricha trifallax]
          Length = 153

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 38/79 (48%)

Query: 33  SATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWR 92
           S  L +  L T  +      ++R    D  AN R F    K  CW+++  +  RDP RWR
Sbjct: 28  SPLLQNTSLDTAIQDVVFDKNIRKFVEDEKANNRIFTRQQKDVCWQQSTTIPIRDPKRWR 87

Query: 93  KDALGNVVFRKLVGCPGCL 111
            D LGN VF  L GC G L
Sbjct: 88  LDPLGNPVFYTLRGCHGAL 106


>gi|222617839|gb|EEE53971.1| hypothetical protein OsJ_00588 [Oryza sativa Japonica Group]
          Length = 1000

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
           + A  TANPR F        W++ + +   G DP  +R DA GNV++         L  D
Sbjct: 240 AEAAPTANPRGF----ANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWD 294

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            DH  P ++GG++   N +++QA V R K N+ E 
Sbjct: 295 IDHWFPCARGGRTVPSNLRIVQAQVCRKKHNKLEF 329


>gi|115434848|ref|NP_001042182.1| Os01g0176500 [Oryza sativa Japonica Group]
 gi|55295944|dbj|BAD67812.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531713|dbj|BAF04096.1| Os01g0176500 [Oryza sativa Japonica Group]
          Length = 832

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 3/93 (3%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           AD+ A P + P       W++ + +   G DP  +R DA GNV++         L  D D
Sbjct: 70  ADAEAAPTANPRGFANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWDID 128

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           H  P ++GG++   N +++QA V R K N+ E 
Sbjct: 129 HWFPCARGGRTVPSNLRIVQAQVCRKKHNKLEF 161


>gi|242051879|ref|XP_002455085.1| hypothetical protein SORBIDRAFT_03g004110 [Sorghum bicolor]
 gi|241927060|gb|EES00205.1| hypothetical protein SORBIDRAFT_03g004110 [Sorghum bicolor]
          Length = 819

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 57  SSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
           + A  TANPR F        W++ + +   G DP  +R DA GNV++         L  D
Sbjct: 70  AEAPPTANPRGF----ANLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWD 124

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            DH  P ++GGK+   N +++Q  V R K N+ E 
Sbjct: 125 IDHWFPCARGGKTVPSNLRIVQWQVCRKKQNKLEF 159


>gi|168059079|ref|XP_001781532.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667011|gb|EDQ53651.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 189

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 56/112 (50%), Gaps = 7/112 (6%)

Query: 40  ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
           + AT ++   LF  +  +  D   N R       ++ W++ + +   G DP ++R D+ G
Sbjct: 54  DFATLKKWDGLFPTVDDTERDPDLNARKH----AEELWQQLDHLGNAGFDPAKFRVDSYG 109

Query: 98  NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NVV+         L  + DH  P+S+GGK+ + N +VLQ   ++ K +R E 
Sbjct: 110 NVVYWN-ADPSSPLAWEIDHWFPHSRGGKTVVSNLRVLQWQASQRKKSRLEF 160


>gi|410097764|ref|ZP_11292745.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|410101011|ref|ZP_11295965.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|410101180|ref|ZP_11296130.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409213077|gb|EKN06106.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409213657|gb|EKN06673.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223854|gb|EKN16789.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 83

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 37/67 (55%)

Query: 77  WEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQ 136
           W+K   VRG+DP+ +RKD  GN+++    G    L  + DH  P +KGG + L N Q + 
Sbjct: 13  WQKGLIVRGKDPNLYRKDCYGNLMYWYSYGKFTKLGWNVDHSKPIAKGGTNHLNNLQPMN 72

Query: 137 ATVNRSK 143
              N SK
Sbjct: 73  CFANSSK 79


>gi|357127491|ref|XP_003565413.1| PREDICTED: uncharacterized protein LOC100821388 [Brachypodium
           distachyon]
          Length = 815

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 45/92 (48%), Gaps = 3/92 (3%)

Query: 60  DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
           D+ A P + P       W++ + +   G DP  +R DA GNV++         L  D  H
Sbjct: 70  DAEALPAANPRGFASLLWKQLDHLGNAGFDPALFRVDAYGNVLYLH-ADSASPLAWDVHH 128

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
             P ++GGK+   N +V+QA V R K N+ E 
Sbjct: 129 WFPCARGGKTVPSNLRVVQAQVCRKKQNKLEF 160


>gi|168027278|ref|XP_001766157.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682589|gb|EDQ69006.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 190

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 74  QQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLEN 131
           ++ WE+ + +   G DP ++R DA GNV++         L  + DH  P+S+GG++   N
Sbjct: 86  EELWEQLDHLGNAGFDPAKFRVDAYGNVLYWG-ADPSSPLSWEVDHWFPHSRGGRTVPSN 144

Query: 132 CQVLQATVNRSKGNRTEL 149
            Q++Q    + K NR E 
Sbjct: 145 LQIVQWQARQRKKNRLEF 162


>gi|168033708|ref|XP_001769356.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679276|gb|EDQ65725.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 190

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 3/82 (3%)

Query: 70  YSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
           +   ++ WE+ + +   G DP ++R DA GNV++         L  + DH  P+S+GG +
Sbjct: 82  HEFAEELWEQLDHLGNAGFDPAKFRVDAYGNVLYWS-ADPSSPLSWEVDHWFPHSRGGLT 140

Query: 128 TLENCQVLQATVNRSKGNRTEL 149
              N Q++Q    + K NR E 
Sbjct: 141 APSNLQIVQWQARQRKKNRLEF 162


>gi|260655767|ref|ZP_05861236.1| putative HNH endonuclease domain protein [Jonquetella anthropi
           E3_33 E1]
 gi|260629383|gb|EEX47577.1| putative HNH endonuclease domain protein [Jonquetella anthropi
           E3_33 E1]
          Length = 384

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 18/146 (12%)

Query: 11  KSTATRASSPSPSRRRGETTVRSATLLDNEL--ATEEEISTLFTDLRISSADSTANPRS- 67
           +S      +  P  R+ +  V    LL N+   +TE   + L   ++I   D     +S 
Sbjct: 247 QSVINWVKTLFPVHRKEQQRVEWG-LLYNQYGNSTEFNANDLEKQIKILMQDEDVTKKSG 305

Query: 68  -FPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH------DYDHILP 120
            +PY + ++  E+   +R    DR  ++A      R+   CP C  H      + DHI P
Sbjct: 306 IYPYLLTKE--ERYLSIRAF-TDRMIREAYE----RQSGICPACNKHFEINEMEADHITP 358

Query: 121 YSKGGKSTLENCQVLQATVNRSKGNR 146
           +SKGG++  ENCQ+L  T NR K N+
Sbjct: 359 WSKGGRTIAENCQMLCKTCNRKKSNK 384


>gi|288802822|ref|ZP_06408259.1| conserved hypothetical protein [Prevotella melaninogenica D18]
 gi|288334639|gb|EFC73077.1| conserved hypothetical protein [Prevotella melaninogenica D18]
          Length = 119

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 73  KQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGC-PGCLCHDYDHILPYSKGGKSTLEN 131
           K+ CW  A+ V G D D  RKDA G  +F+   G        + DH+ P   GG     N
Sbjct: 7   KEYCWNHAQIVEGYDKDSIRKDACGAWIFKAHYGMRDSVFGWEVDHVFPVILGGDDFKRN 66

Query: 132 CQVLQATVNRSKGN 145
            + +Q   N SKG+
Sbjct: 67  LRAMQWKNNVSKGD 80


>gi|168040184|ref|XP_001772575.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676130|gb|EDQ62617.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 212

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 3/80 (3%)

Query: 72  VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
             ++ W++ + +   G DP ++R D  GNVV+         L  D DH  P S+GGK+ L
Sbjct: 80  FAEELWQQLDHLGNAGFDPAKFRVDPYGNVVYWN-ADPSSPLAWDIDHWFPRSRGGKTKL 138

Query: 130 ENCQVLQATVNRSKGNRTEL 149
            N +++Q      K NR E 
Sbjct: 139 PNLRIVQWQAYLRKRNRLEF 158


>gi|168054230|ref|XP_001779535.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162669016|gb|EDQ55611.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 190

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 3/82 (3%)

Query: 70  YSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKS 127
           +   ++ WE+ + +   G DP ++R DA GNV++         L  + DH  P+ +GG++
Sbjct: 82  HEFAEEIWEQLDHLGDAGFDPAKFRVDAYGNVLYWS-ADPSSPLSWEVDHWFPHLRGGRT 140

Query: 128 TLENCQVLQATVNRSKGNRTEL 149
              N Q++Q    + K NR E 
Sbjct: 141 VPSNLQIVQWQARQRKKNRLEF 162


>gi|302805536|ref|XP_002984519.1| hypothetical protein SELMODRAFT_234547 [Selaginella moellendorffii]
 gi|300147907|gb|EFJ14569.1| hypothetical protein SELMODRAFT_234547 [Selaginella moellendorffii]
          Length = 204

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 40  ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
           E+A  EE+  LF    I  +D+     S  +  K   W++ + +   G DP  +R D+ G
Sbjct: 69  EVAKLEEVQDLFP---IVLSDTRKEHPSHLHKFKDVLWQQLDHLGNAGFDPACFRVDSYG 125

Query: 98  NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NV++         L  +  H  PYS+GGK+ L N ++ Q  V+  K  + E 
Sbjct: 126 NVLYWH-ADPASPLAWEVCHWFPYSRGGKTVLNNLRLAQWQVSLKKKEKLEF 176


>gi|168018025|ref|XP_001761547.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687231|gb|EDQ73615.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 223

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 72  VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
             ++ W + + +   G DP+++R D  GNVV          L  D DH  P S+GGK+ L
Sbjct: 81  FAEELWRQLDHLGNAGFDPEKFRVDPYGNVVHWN-ADPSSPLAWDIDHWFPQSRGGKTEL 139

Query: 130 ENCQVLQATVNRSKGNRTEL 149
            N +++Q      K NR E 
Sbjct: 140 PNLRIVQWQAYARKRNRLEF 159


>gi|148727130|ref|YP_001285624.1| gp48 [Propionibacterium phage PA6]
 gi|91982954|gb|ABE68617.1| gp48 [Propionibacterium phage PA6]
          Length = 100

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQAQQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|302782555|ref|XP_002973051.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
 gi|300159652|gb|EFJ26272.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
          Length = 204

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 40  ELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALG 97
           E+A  E++  LF    I  +DS     S  +  K   W++ + +   G DP  +R D+ G
Sbjct: 69  EVAKLEQVQDLFP---IVLSDSRKEHPSHLHKFKDVLWQQLDHLGNAGFDPACFRVDSYG 125

Query: 98  NVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NV++         L  +  H  PYS+GGK+ L N ++ Q  V+  K  + E 
Sbjct: 126 NVLYWH-ADPASPLAWEVCHWFPYSRGGKTVLNNLRLAQWQVSLKKKEKLEF 176


>gi|410491223|ref|YP_006906446.1| hypothetical protein P100A_45 [Propionibacterium phage P100_A]
 gi|407302224|gb|AFT97776.1| hypothetical protein P100A_45 [Propionibacterium phage P100_A]
          Length = 100

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQAKQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|410491718|ref|YP_006906942.1| hypothetical protein ATCC29399BT_46 [Propionibacterium phage
           ATCC29399B_T]
 gi|410507509|ref|YP_006907129.1| hypothetical protein ATCC29399BC_46 [Propionibacterium phage
           ATCC29399B_C]
 gi|407302364|gb|AFT97913.1| hypothetical protein ATCC29399BT_46 [Propionibacterium phage
           ATCC29399B_T]
 gi|407302411|gb|AFT97959.1| hypothetical protein ATCC29399BC_46 [Propionibacterium phage
           ATCC29399B_C]
          Length = 100

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQARQQGQTECPLCGVTITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|410491362|ref|YP_006906587.1| hypothetical protein P101A_47 [Propionibacterium phage P101A]
 gi|407302036|gb|AFT97592.1| hypothetical protein P101A_47 [Propionibacterium phage P101A]
          Length = 100

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQARQQGQTECPLCGAQIAWGTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|410491407|ref|YP_006906632.1| HNH endonuclease [Propionibacterium phage P1.1]
 gi|407302316|gb|AFT97866.1| HNH endonuclease [Propionibacterium phage P1.1]
          Length = 101

 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 15  RWRQRLITQARQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 74

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 75  TCNRSKGNRSE 85


>gi|257439996|ref|ZP_05615751.1| putative HNH endonuclease domain protein [Faecalibacterium
           prausnitzii A2-165]
 gi|257197603|gb|EEU95887.1| HNH endonuclease domain protein [Faecalibacterium prausnitzii
           A2-165]
          Length = 367

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 27/45 (60%), Gaps = 6/45 (13%)

Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           CP C  H        DHI P+SKGGK+T ENCQ+L A  NR K N
Sbjct: 322 CPKCGKHFEIEEMQADHITPWSKGGKTTAENCQMLCADCNRRKSN 366


>gi|410490847|ref|YP_006906071.1| HNH endonuclease [Propionibacterium phage P9.1]
 gi|422385762|ref|ZP_16465888.1| putative HNH endonuclease domain protein [Propionibacterium acnes
           HL096PA3]
 gi|327327949|gb|EGE69721.1| putative HNH endonuclease domain protein [Propionibacterium acnes
           HL096PA3]
 gi|407301941|gb|AFT97499.1| HNH endonuclease [Propionibacterium phage P9.1]
          Length = 100

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR   +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRARLITQAQQQGQTECPLCGAQIAWGTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|410491317|ref|YP_006906540.1| hypothetical protein P1001_47 [Propionibacterium phage P100_1]
 gi|407302272|gb|AFT97823.1| hypothetical protein P1001_47 [Propionibacterium phage P100_1]
          Length = 100

 Score = 45.8 bits (107), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR   +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRARLITQAQQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|284097476|ref|ZP_06385569.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
 gi|283831012|gb|EFC35029.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
          Length = 171

 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 27/146 (18%)

Query: 7   PKRRKSTATRASSPSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPR 66
           PK+R S  + A  P          +  A L       E E++ L  D      D T  P 
Sbjct: 45  PKKRASMKSVAWGPLYDEHHERDDLDPARL-------EAEVARLHKD-----DDVTKKPG 92

Query: 67  SFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH------DYDHIL 119
            +PY + +   EK   +R   P       +    + +  G CP C  H      + DHI 
Sbjct: 93  IYPYVLTRD--EKYLNIRAFTP------TMRAAAYERQEGICPRCGEHFVLGAMEADHID 144

Query: 120 PYSKGGKSTLENCQVLQATVNRSKGN 145
           P+S+GGK+  ENCQ+L    NR KG+
Sbjct: 145 PWSEGGKTAPENCQMLCRPCNRRKGS 170


>gi|373455654|ref|ZP_09547483.1| hypothetical protein HMPREF9453_01652 [Dialister succinatiphilus
           YIT 11850]
 gi|371934747|gb|EHO62527.1| hypothetical protein HMPREF9453_01652 [Dialister succinatiphilus
           YIT 11850]
          Length = 148

 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 26/35 (74%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           D DHI+P S+GG ST++N Q L    NRSKGNRT+
Sbjct: 80  DVDHIVPQSRGGGSTMDNLQYLCPHCNRSKGNRTD 114


>gi|410491593|ref|YP_006906817.1| hypothetical protein P104A_45 [Propionibacterium phage P104A]
 gi|407302083|gb|AFT97638.1| hypothetical protein P104A_45 [Propionibacterium phage P104A]
          Length = 100

 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR   +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRARLITQARQQGQTECPLCGATITWNTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|423220008|ref|ZP_17206504.1| hypothetical protein HMPREF1061_03277 [Bacteroides caccae
           CL03T12C61]
 gi|392624271|gb|EIY18364.1| hypothetical protein HMPREF1061_03277 [Bacteroides caccae
           CL03T12C61]
          Length = 390

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 26/33 (78%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +++ENC++L  T NRSKGNR
Sbjct: 358 DADHVTAWSKGGATSMENCEMLCKTHNRSKGNR 390


>gi|153807020|ref|ZP_01959688.1| hypothetical protein BACCAC_01297 [Bacteroides caccae ATCC 43185]
 gi|149130140|gb|EDM21350.1| HNH endonuclease domain protein [Bacteroides caccae ATCC 43185]
          Length = 390

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 26/33 (78%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +++ENC++L  T NRSKGNR
Sbjct: 358 DADHVTAWSKGGATSMENCEMLCKTHNRSKGNR 390


>gi|359482658|ref|XP_002285224.2| PREDICTED: uncharacterized protein LOC100251511 [Vitis vinifera]
          Length = 746

 Score = 45.1 bits (105), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 3/92 (3%)

Query: 60  DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
           D  A P + P       W++   +   G DP  +R D  GNV++         L  D DH
Sbjct: 68  DPKAKPTTKPKIFISLLWKQLNHLGNAGFDPAIFRVDPYGNVLYYH-ADSASPLAWDIDH 126

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
             P S+GG +   N ++LQ  V R K N+ E 
Sbjct: 127 WFPCSRGGLTVPSNLRILQWQVCRRKHNKLEF 158


>gi|410491905|ref|YP_006907176.1| hypothetical protein P100D_47 [Propionibacterium phage P100D]
 gi|407302176|gb|AFT97729.1| hypothetical protein P100D_47 [Propionibacterium phage P100D]
          Length = 100

 Score = 45.1 bits (105), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGC------------LCHDYDHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C               + DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQAQQQGQTTCPLCGVTITWDTHQLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|386390355|ref|ZP_10075145.1| HNH endonuclease domain protein [Haemophilus paraphrohaemolyticus
           HK411]
 gi|385693240|gb|EIG23892.1| HNH endonuclease domain protein [Haemophilus paraphrohaemolyticus
           HK411]
          Length = 388

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 14/54 (25%)

Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  HD               DH+L +SKGG ++ ENC++L    NRSKGN+
Sbjct: 335 CPLCAIHDNADKTKIWTMKEMEADHVLAWSKGGATSAENCEMLCIKHNRSKGNK 388


>gi|330858397|ref|YP_004414751.1| Gp48 [Propionibacterium phage PAS50]
 gi|260066608|gb|ACX30886.1| Gp48 [Propionibacterium phage PAS50]
          Length = 101

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC------HDY------DHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C        HD       DHI P S+GG +TL+N Q++  
Sbjct: 15  RWRQRLITQARQQGQTECPLCGATITWDTHDLPTSPEADHITPVSRGGLNTLDNGQIICR 74

Query: 138 TVNRSKGNRTE 148
           T NRSKGN+T+
Sbjct: 75  TCNRSKGNQTQ 85


>gi|330858443|ref|YP_004414796.1| Gp48 [Propionibacterium phage PAD20]
 gi|260066561|gb|ACX30840.1| Gp48 [Propionibacterium phage PAD20]
          Length = 100

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 36/71 (50%), Gaps = 12/71 (16%)

Query: 90  RWRKDALGNVVFRKLVGCPGC------------LCHDYDHILPYSKGGKSTLENCQVLQA 137
           RWR+  +     +    CP C               + DHI P S+GG +TL+N Q++  
Sbjct: 14  RWRQRLITQARQQGQTECPLCGATITWDTHQLPTSPEADHITPVSRGGLNTLDNGQIICR 73

Query: 138 TVNRSKGNRTE 148
           T NRSKGNR+E
Sbjct: 74  TCNRSKGNRSE 84


>gi|429740338|ref|ZP_19274029.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
 gi|429153310|gb|EKX96095.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
          Length = 285

 Score = 45.1 bits (105), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 6/57 (10%)

Query: 91  WRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           WR+D        + V C      ++DHI+P+SKGG +T+ N Q+L    NRSK N+ 
Sbjct: 234 WRRDG------GRCVKCGSQENLEFDHIIPFSKGGSNTIRNIQLLCEKCNRSKSNKI 284


>gi|302345663|ref|YP_003814016.1| HNH endonuclease domain protein [Prevotella melaninogenica ATCC
           25845]
 gi|302149883|gb|ADK96145.1| HNH endonuclease domain protein [Prevotella melaninogenica ATCC
           25845]
          Length = 373

 Score = 44.7 bits (104), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 37/87 (42%), Gaps = 29/87 (33%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLC-HDY-----DHIL 119
           R FP  +KQ  WEK   +                       CP C    DY     DHI 
Sbjct: 310 RCFPDDIKQAVWEKQYHI-----------------------CPSCKKEFDYEFMEGDHIT 346

Query: 120 PYSKGGKSTLENCQVLQATVNRSKGNR 146
           P+ +GG++ +ENCQ+L    NR KG R
Sbjct: 347 PWREGGRTVIENCQMLCRECNRRKGGR 373


>gi|417162963|ref|ZP_11998293.1| PF03235 family protein [Escherichia coli 99.0741]
 gi|386173454|gb|EIH45466.1| PF03235 family protein [Escherichia coli 99.0741]
          Length = 390

 Score = 44.7 bits (104), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 26/31 (83%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +SKGG+STL+NC++L  T NRSKGN+
Sbjct: 360 DHVSAWSKGGESTLDNCEMLCITHNRSKGNK 390


>gi|410491817|ref|YP_006907042.1| hypothetical protein P105_45 [Propionibacterium phage P105]
 gi|407302128|gb|AFT97682.1| hypothetical protein P105_45 [Propionibacterium phage P105]
          Length = 100

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 27/35 (77%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           + DHI P S+GG +TL+N Q++  T NRSKGNR+E
Sbjct: 50  EADHITPVSRGGLNTLDNGQIICRTCNRSKGNRSE 84


>gi|410491270|ref|YP_006906493.1| HNH endonuclease [Propionibacterium phage P14.4]
 gi|407301990|gb|AFT97547.1| HNH endonuclease [Propionibacterium phage P14.4]
          Length = 101

 Score = 44.7 bits (104), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 27/35 (77%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           + DHI P S+GG +TL+N Q++  T NRSKGNR+E
Sbjct: 51  EADHITPVSRGGLNTLDNGQIICRTCNRSKGNRSE 85


>gi|422680858|ref|ZP_16739129.1| stress protein, partial [Pseudomonas syringae pv. tabaci str. ATCC
           11528]
 gi|331010203|gb|EGH90259.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 138

 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 21/126 (16%)

Query: 41  LATEEEISTLFTDLR-ISSADSTANPRSFPYSVKQQCWE-KAEKVR-------GRDPDRW 91
           L TE E+  L   +R ++      + R   Y  KQ   E K E +R          P   
Sbjct: 15  LTTERELDDLVDYIRALAKGHKLPSVRVDAYEFKQVTGEAKGEPIRVNVSSMPAPKPPAA 74

Query: 92  RKDALGNVVFRKLV-----GCPGC-----LCHDYDHILPYSKGGKSTLENCQVLQATVNR 141
           R++A+   V R++       C  C     LC  +DHI+P+S+GG +T+ N Q+L    N 
Sbjct: 75  RREAIPRSVQREVWQRDAGKCVECGTRQLLC--FDHIVPFSRGGSNTVRNLQLLCEPCNL 132

Query: 142 SKGNRT 147
           SKGNR 
Sbjct: 133 SKGNRI 138


>gi|410689486|ref|YP_006963215.1| hypothetical protein [Arthrobacter sp. 31-32]
 gi|389595975|gb|AFK88745.1| hypothetical protein [Arthrobacter sp. 31-32]
          Length = 388

 Score = 43.9 bits (102), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 25/33 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGGK+ L NC++L  T NR+KGNR
Sbjct: 356 DADHVAAWSKGGKTDLANCEMLCVTHNRAKGNR 388


>gi|331005592|ref|ZP_08328963.1| hypothetical protein IMCC1989_2172 [gamma proteobacterium IMCC1989]
 gi|330420607|gb|EGG94902.1| hypothetical protein IMCC1989_2172 [gamma proteobacterium IMCC1989]
          Length = 389

 Score = 43.9 bits (102), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 25/33 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG + +ENCQ+L  T NR+KGN+
Sbjct: 357 DADHVTAWSKGGATNIENCQMLCKTHNRAKGNK 389


>gi|238926542|ref|ZP_04658302.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
 gi|238885488|gb|EEQ49126.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
          Length = 363

 Score = 43.9 bits (102), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           DHI P+SKGGK+T ENCQ+L A  NR K N
Sbjct: 333 DHITPWSKGGKTTPENCQMLCAPCNRRKSN 362


>gi|421735209|ref|ZP_16174207.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
 gi|407076867|gb|EKE49775.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
          Length = 379

 Score = 43.9 bits (102), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 24/31 (77%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P+S+GGK+TL+NCQ+L  T N  K N 
Sbjct: 349 DHIVPWSEGGKTTLDNCQMLCTTCNIKKSNH 379


>gi|288803679|ref|ZP_06409109.1| HNH endonuclease domain-containing protein [Prevotella
           melaninogenica D18]
 gi|288333919|gb|EFC72364.1| HNH endonuclease domain-containing protein [Prevotella
           melaninogenica D18]
          Length = 373

 Score = 43.9 bits (102), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 29/87 (33%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLC-HDY-----DHIL 119
           R FP  +K+  WEK   +                       CP C    DY     DHI 
Sbjct: 310 RCFPDDIKRAVWEKQHHI-----------------------CPSCQKEFDYEFMEGDHIT 346

Query: 120 PYSKGGKSTLENCQVLQATVNRSKGNR 146
           P+ +GG++ +ENCQ+L    NR KG R
Sbjct: 347 PWREGGRTVIENCQMLCRECNRRKGGR 373


>gi|257486734|ref|ZP_05640775.1| HNH endonuclease [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 217

 Score = 43.5 bits (101), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 58/126 (46%), Gaps = 21/126 (16%)

Query: 41  LATEEEISTLFTDLR-ISSADSTANPRSFPYSVKQQCWE-KAEKVR-------GRDPDRW 91
           L TE E+  L   +R ++      + R   Y  KQ   E K E +R          P   
Sbjct: 94  LTTERELDDLVDYIRALAKGHKLPSVRVDAYEFKQVTGEAKGEPIRVNVSSMPAPKPPAA 153

Query: 92  RKDALGNVVFRKLV-----GCPGC-----LCHDYDHILPYSKGGKSTLENCQVLQATVNR 141
           R++A+   V R++       C  C     LC  +DHI+P+S+GG +T+ N Q+L    N 
Sbjct: 154 RREAIPRSVQREVWQRDAGKCVECGTRQLLC--FDHIVPFSRGGSNTVRNLQLLCEPCNL 211

Query: 142 SKGNRT 147
           SKGNR 
Sbjct: 212 SKGNRI 217


>gi|359427003|ref|ZP_09218078.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
 gi|358237616|dbj|GAB07660.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
          Length = 548

 Score = 43.5 bits (101), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 4/51 (7%)

Query: 100 VFRKLVG----CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           VFR+  G    C      +YDHI+P+S+GG +++EN Q+L    N++KG R
Sbjct: 497 VFRRCGGRCVECGSTSYLEYDHIIPWSRGGATSVENLQILCRACNQAKGAR 547


>gi|169840962|ref|ZP_02874075.1| hypothetical protein cdivTM_27683 [candidate division TM7
           single-cell isolate TM7a]
          Length = 66

 Score = 43.5 bits (101), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 26/33 (78%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +++ENC++L  + NRSKGN+
Sbjct: 34  DADHVSAWSKGGSTSMENCEMLCKSHNRSKGNK 66


>gi|407001791|gb|EKE18705.1| hypothetical protein ACD_9C00258G0006 [uncultured bacterium]
          Length = 387

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 26/33 (78%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG ++++NCQ+L  T NR+KGNR
Sbjct: 355 DADHVAAWSKGGATSVKNCQMLCKTHNRAKGNR 387


>gi|356557263|ref|XP_003546937.1| PREDICTED: uncharacterized protein LOC100776970 [Glycine max]
          Length = 718

 Score = 43.1 bits (100), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 8/117 (6%)

Query: 35  TLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWR 92
            L ++E  T +++  +F  +    A +T+ P+ F  SV    W++   +   G DP   R
Sbjct: 37  ALSEDEAETAKDLDEIFPIID-PKAKATSKPKIF-VSV---LWKQLRHLGNAGFDPAVIR 91

Query: 93  KDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            D  GNV++         L  D DH  P S+GG + L N ++LQ    + K N+ E 
Sbjct: 92  VDGYGNVLYYH-ADSASPLAWDVDHWFPCSRGGLTVLSNLRLLQRQACKRKKNKLEF 147


>gi|336115757|ref|YP_004570523.1| hypothetical protein MLP_01060 [Microlunatus phosphovorus NM-1]
 gi|334683535|dbj|BAK33120.1| hypothetical protein MLP_01060 [Microlunatus phosphovorus NM-1]
          Length = 73

 Score = 42.7 bits (99), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 23/33 (69%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG S L NC++L    NRSKGNR
Sbjct: 41  DADHVTAWSKGGASDLANCEMLCVPHNRSKGNR 73


>gi|449981939|ref|ZP_21818046.1| HNH endonuclease [Streptococcus mutans 5SM3]
 gi|450046601|ref|ZP_21839016.1| HNH endonuclease [Streptococcus mutans N34]
 gi|449175287|gb|EMB77712.1| HNH endonuclease [Streptococcus mutans 5SM3]
 gi|449198573|gb|EMB99680.1| HNH endonuclease [Streptococcus mutans N34]
          Length = 412

 Score = 42.7 bits (99), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           DHI P+S GGK+TL+NCQ+L    N SKGN
Sbjct: 381 DHITPWSLGGKTTLDNCQLLCKKHNSSKGN 410


>gi|450031203|ref|ZP_21833586.1| HNH endonuclease [Streptococcus mutans G123]
 gi|449191820|gb|EMB93276.1| HNH endonuclease [Streptococcus mutans G123]
          Length = 412

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           DHI P+S GGK+TL+NCQ+L    N SKGN
Sbjct: 381 DHITPWSLGGKTTLDNCQLLCKKHNSSKGN 410


>gi|404212558|ref|YP_006671568.1| hypothetical protein KTR9_4973 [Gordonia sp. KTR9]
 gi|403648253|gb|AFR51432.1| hypothetical protein KTR9_4973 [Gordonia sp. KTR9]
          Length = 382

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 24/33 (72%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG S L NC++L  + NRSKGNR
Sbjct: 350 DADHVTAWSKGGASDLANCEMLCVSHNRSKGNR 382


>gi|356547420|ref|XP_003542110.1| PREDICTED: uncharacterized protein LOC100797659 [Glycine max]
          Length = 740

 Score = 42.7 bits (99), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 59  ADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYD 116
           A +T+ P+ F  SV    W++   +   G DP   R D  GNV++         L  D D
Sbjct: 73  AKATSKPKIF-VSV---LWKQLRHLGNAGFDPAVIRVDGYGNVLYYH-ADSASPLAWDVD 127

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           H  P S+GG + L N ++LQ    + K N+ E 
Sbjct: 128 HWFPCSRGGLTVLSNLRILQRQACKRKKNKLEF 160


>gi|302342138|ref|YP_003806667.1| hypothetical protein Deba_0701 [Desulfarculus baarsii DSM 2075]
 gi|301638751|gb|ADK84073.1| hypothetical protein Deba_0701 [Desulfarculus baarsii DSM 2075]
          Length = 103

 Score = 42.4 bits (98), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 1/80 (1%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGC-PGCLCHDYDHILPYSKG 124
           R  P  + ++ W+KA      DPD  R+D  G  ++R   G        D D+I+P   G
Sbjct: 3   RDLPLELIERVWQKARPAPRLDPDSVRQDCFGAWIYRHAFGDRASAYGWDLDYIVPPDCG 62

Query: 125 GKSTLENCQVLQATVNRSKG 144
           G+  L+N + LQ   +  +G
Sbjct: 63  GRGELDNLRPLQWQNSLMRG 82


>gi|429725832|ref|ZP_19260648.1| HNH endonuclease domain protein [Prevotella sp. oral taxon 473 str.
           F0040]
 gi|429148735|gb|EKX91736.1| HNH endonuclease domain protein [Prevotella sp. oral taxon 473 str.
           F0040]
          Length = 390

 Score = 42.4 bits (98), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 26/33 (78%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +TL+NC++L  + NR+KGN+
Sbjct: 358 DADHVTAWSKGGATTLDNCEMLCKSHNRAKGNK 390


>gi|365895382|ref|ZP_09433497.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
           3843]
 gi|365423858|emb|CCE06039.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
           3843]
          Length = 185

 Score = 42.4 bits (98), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  CL  D   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFACQYCLSGDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNMT 137


>gi|440742015|ref|ZP_20921345.1| TerF-like protein [Pseudomonas syringae BRIP39023]
 gi|440378101|gb|ELQ14731.1| TerF-like protein [Pseudomonas syringae BRIP39023]
          Length = 217

 Score = 42.4 bits (98), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           LC  +DHI+P+S+GG +T+ N Q+L    N SKGNR 
Sbjct: 183 LC--FDHIVPFSRGGSNTVRNLQLLCEPCNLSKGNRI 217


>gi|257486772|ref|ZP_05640813.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 217

 Score = 42.4 bits (98), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           LC  +DHI+P+S+GG +T+ N Q+L    N SKGNR 
Sbjct: 183 LC--FDHIVPFSRGGSNTVRNLQLLCEPCNLSKGNRI 217


>gi|315498410|ref|YP_004087214.1| hnh endonuclease [Asticcacaulis excentricus CB 48]
 gi|315416422|gb|ADU13063.1| HNH endonuclease [Asticcacaulis excentricus CB 48]
          Length = 186

 Score = 42.4 bits (98), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C C HD  +DH+LP S+GGKST  N     A  N  KG +T       
Sbjct: 86  NVFLRDQFSCQYCGCRHDLTFDHVLPVSQGGKSTWTNIITACAPCNLRKGGKTPQQAQMA 145

Query: 155 IQKSSY 160
           + KS +
Sbjct: 146 LAKSPH 151


>gi|384253642|gb|EIE27116.1| hypothetical protein COCSUDRAFT_55140 [Coccomyxa subellipsoidea
           C-169]
          Length = 134

 Score = 42.4 bits (98), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 3/61 (4%)

Query: 90  RWRKDALGNVVFRKLVGCPGCLC-HDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           R+R D  GNVV  +  G   C+C  + DHI P+++GG S +EN   L    NR+  N  +
Sbjct: 76  RFRSDPYGNVVALEARGS--CVCAFEVDHIFPWARGGLSVVENFMALFWRSNRNVKNDKD 133

Query: 149 L 149
           +
Sbjct: 134 I 134


>gi|431803894|ref|YP_007230797.1| HNH endonuclease domain-containing protein [Pseudomonas putida
           HB3267]
 gi|430794659|gb|AGA74854.1| HNH endonuclease domain-containing protein [Pseudomonas putida
           HB3267]
          Length = 154

 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 33/66 (50%), Gaps = 7/66 (10%)

Query: 88  PDRWRKDALGNVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           PDR R   +   ++R    C  C  H     DHI P   GG++TLEN Q L    N SKG
Sbjct: 92  PDRLRTAVMERDMYR----CVRCGSHRELCADHIFPEVLGGEATLENLQTLCRPCNTSKG 147

Query: 145 NRTELS 150
            R E+S
Sbjct: 148 GRVEVS 153


>gi|227356931|ref|ZP_03841307.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
 gi|425072865|ref|ZP_18475971.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
 gi|227162908|gb|EEI47860.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
 gi|404596639|gb|EKA97159.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
          Length = 98

 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
           + DHI+P  KGGK T+ENCQVL    +R K  + ELS
Sbjct: 62  ELDHIIPLFKGGKDTIENCQVLCIECHRKK-TKEELS 97


>gi|255553017|ref|XP_002517551.1| conserved hypothetical protein [Ricinus communis]
 gi|223543183|gb|EEF44715.1| conserved hypothetical protein [Ricinus communis]
          Length = 263

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 8/122 (6%)

Query: 30  TVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRD 87
           T     +  +E+   +++  +F  +     D  A P + P       W++   +   G D
Sbjct: 44  TFIPFAMQHHEIMRAKQLEQMFPII-----DPKAKPTAKPKIFFNLLWKQLNHLGNAGFD 98

Query: 88  PDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           P   R D  GNV++         L  D DH  P  +GG +   N ++LQ  V + K N+ 
Sbjct: 99  PAVIRVDPYGNVLYLH-ADKASPLAWDIDHWFPCQRGGLTVPSNLRILQWQVCKKKHNKL 157

Query: 148 EL 149
           E 
Sbjct: 158 EF 159


>gi|225377537|ref|ZP_03754758.1| hypothetical protein ROSEINA2194_03187 [Roseburia inulinivorans DSM
           16841]
 gi|225210614|gb|EEG92968.1| hypothetical protein ROSEINA2194_03187 [Roseburia inulinivorans DSM
           16841]
          Length = 84

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG + + NCQ+L  T NR+KGN+
Sbjct: 52  DADHVTAWSKGGVTDISNCQMLCKTHNRAKGNK 84


>gi|111221190|ref|YP_711984.1| tellurium resistance protein terE [Frankia alni ACN14a]
 gi|111148722|emb|CAJ60397.1| Tellurium resistance protein terE [Frankia alni ACN14a]
          Length = 644

 Score = 41.6 bits (96), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 25/34 (73%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           ++DH++P+S+GG S++ N Q+L    NRSKG R 
Sbjct: 611 EFDHVIPFSRGGASSVNNLQLLCRRCNRSKGARI 644


>gi|270593505|ref|ZP_06221465.1| hypothetical protein HAINFHK1212_1873, partial [Haemophilus
           influenzae HK1212]
 gi|270318413|gb|EFA29540.1| hypothetical protein HAINFHK1212_1873 [Haemophilus influenzae
           HK1212]
          Length = 161

 Score = 41.6 bits (96), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)

Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  HD               DH+  +SKGG ++ ENC++L    NR+KGN+
Sbjct: 108 CPLCAIHDNANKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 161


>gi|346226706|ref|ZP_08847848.1| hypothetical protein AtheD1_16304 [Anaerophaga thermohalophila DSM
           12881]
          Length = 465

 Score = 41.6 bits (96), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 15/85 (17%)

Query: 78  EKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCL--------------CHDYDHILPYS 122
           E + K++ RD +R   +    +++R+  G C  CL               +  DH+LP+S
Sbjct: 380 ENSIKIKERDENRAFSELERIIIYRRDKGFCQQCLRDGKPENEAKVSWSKYQADHVLPHS 439

Query: 123 KGGKSTLENCQVLQATVNRSKGNRT 147
           KGGK+ +EN ++L +  N+SKG + 
Sbjct: 440 KGGKTIIENGELLCSYHNQSKGAKI 464


>gi|168037356|ref|XP_001771170.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677550|gb|EDQ64019.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 187

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)

Query: 72  VKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL 129
             ++ W++ E +   G DP ++R D  GNVV          L  D D   PY +GG++ L
Sbjct: 81  FAEELWQQLEHLGNAGFDPAKFRVDPYGNVVHWN-ADPSSPLAWDVDLWFPYPRGGRTKL 139

Query: 130 ENCQVLQATVNRSKGNRTEL 149
            N +++Q      K NR E 
Sbjct: 140 PNLRIVQWQACLRKRNRLEF 159


>gi|87311894|ref|ZP_01094006.1| restriction endonuclease [Blastopirellula marina DSM 3645]
 gi|87285425|gb|EAQ77347.1| restriction endonuclease [Blastopirellula marina DSM 3645]
          Length = 197

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%)

Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           PG      DH+LP ++GG+S+ ENC +     N SK NRT L
Sbjct: 112 PGSESLTIDHVLPRAQGGESSWENCVLACVECNHSKANRTPL 153


>gi|402307168|ref|ZP_10826195.1| HNH endonuclease domain protein [Prevotella sp. MSX73]
 gi|400379008|gb|EJP31857.1| HNH endonuclease domain protein [Prevotella sp. MSX73]
          Length = 84

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 24/33 (72%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG + + NCQ+L  + NR+KGN+
Sbjct: 52  DADHVAAWSKGGATDIHNCQMLCKSHNRAKGNK 84


>gi|423209364|ref|ZP_17195918.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
 gi|404617222|gb|EKB14158.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
          Length = 218

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 2/36 (5%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           LC  +DHI+P+SKGG +++ N Q+L    N SKGNR
Sbjct: 184 LC--FDHIVPFSKGGSNSVRNLQLLCERCNLSKGNR 217


>gi|319897846|ref|YP_004136043.1| hnh endonuclease [Haemophilus influenzae F3031]
 gi|317433352|emb|CBY81731.1| HNH endonuclease [Haemophilus influenzae F3031]
          Length = 388

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)

Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  HD               DH+  +SKGG ++ ENC++L    NR+KGN+
Sbjct: 335 CPLCAIHDNANKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 388


>gi|385809935|ref|YP_005846331.1| hypothetical protein IALB_1353 [Ignavibacterium album JCM 16511]
 gi|383801983|gb|AFH49063.1| Hypothetical protein IALB_1353 [Ignavibacterium album JCM 16511]
          Length = 436

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 3/48 (6%)

Query: 102 RKLVGCPGCLC-HDY--DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           RK   C G +   D+  DHI P+SKGG + L+N  ++    N SKGNR
Sbjct: 389 RKCTKCGGLVTWTDFSADHIKPFSKGGATKLQNAAIMHKNCNSSKGNR 436


>gi|373467210|ref|ZP_09558511.1| HNH endonuclease domain protein [Haemophilus sp. oral taxon 851
           str. F0397]
 gi|371759169|gb|EHO47915.1| HNH endonuclease domain protein [Haemophilus sp. oral taxon 851
           str. F0397]
          Length = 388

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 14/54 (25%)

Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  HD               DH+  +SKGG ++ ENC++L    NR+KGN+
Sbjct: 335 CPLCAIHDNVNKTKIWTMKEMEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 388


>gi|303245412|ref|ZP_07331696.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
 gi|302493261|gb|EFL53123.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
          Length = 309

 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
           D DHI P SKGGK+   N QVL +  NR+KGN+ +     L+++ +
Sbjct: 139 DVDHIKPRSKGGKNEYANLQVLCSRCNRAKGNKEDTDYRPLVREEA 184


>gi|294675057|ref|YP_003575673.1| HNH endonuclease [Prevotella ruminicola 23]
 gi|294473171|gb|ADE82560.1| HNH endonuclease [Prevotella ruminicola 23]
          Length = 374

 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 23/84 (27%)

Query: 66  RSFPYSVKQQCWEKAEK---VRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYS 122
           R+FP  +K + WE  E    + G++ D           F  + G         DHI P+ 
Sbjct: 311 RAFPEDIKLKVWENQEHKCAICGKEFD-----------FEFMEG---------DHITPWR 350

Query: 123 KGGKSTLENCQVLQATVNRSKGNR 146
            GG++ +ENCQ+L    NR KG++
Sbjct: 351 DGGRTVVENCQMLCRECNRRKGSK 374


>gi|83591793|ref|YP_425545.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum ATCC 11170]
 gi|386348484|ref|YP_006046732.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum F11]
 gi|83574707|gb|ABC21258.1| CRISPR-associated endonuclease, Csn1 family [Rhodospirillum rubrum
           ATCC 11170]
 gi|346716920|gb|AEO46935.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum F11]
          Length = 1173

 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 13/36 (36%), Positives = 27/36 (75%)

Query: 112 CHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
            +D DH++P ++GG+ +L+N  + Q+  N++KG++T
Sbjct: 623 AYDIDHVIPLARGGRDSLDNMVLCQSDANKTKGDKT 658


>gi|281422823|ref|ZP_06253822.1| putative HNH endonuclease domain protein [Prevotella copri DSM
           18205]
 gi|281403089|gb|EFB33769.1| putative HNH endonuclease domain protein [Prevotella copri DSM
           18205]
          Length = 389

 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +SKGGK+ L NCQ+L    NR+KGN+
Sbjct: 359 DHVTAWSKGGKTDLANCQMLCKMHNRTKGNK 389


>gi|406036742|ref|ZP_11044106.1| hypothetical protein AparD1_07217 [Acinetobacter parvus DSM 16617 =
           CIP 108168]
          Length = 386

 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 25/33 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG + ++NC++L  T NR+KGNR
Sbjct: 354 DADHVTAWSKGGATDVKNCEMLCKTHNRAKGNR 386


>gi|407643249|ref|YP_006807008.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
 gi|407306133|gb|AFU00034.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
          Length = 618

 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 20/81 (24%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RS P  VK + W++                       K V C      ++DHI+P S+GG
Sbjct: 557 RSIPQEVKAEVWQRDGG--------------------KCVECGDSHYLEFDHIIPLSRGG 596

Query: 126 KSTLENCQVLQATVNRSKGNR 146
            ++  N Q+L  + NR+KG R
Sbjct: 597 ATSAANLQILCRSCNRAKGAR 617


>gi|238926382|ref|ZP_04658142.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
 gi|238885786|gb|EEQ49424.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
          Length = 760

 Score = 41.2 bits (95), Expect = 0.23,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           + DHI+P +KGG+S LEN Q+L  + N  KG++ E
Sbjct: 723 NVDHIVPMNKGGRSVLENLQILCRSCNAKKGDQLE 757


>gi|297743366|emb|CBI36233.3| unnamed protein product [Vitis vinifera]
          Length = 657

 Score = 40.8 bits (94), Expect = 0.25,   Method: Composition-based stats.
 Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 3/92 (3%)

Query: 60  DSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDH 117
           D  A P + P       W++   +   G DP  +R D  GNV++         L  D DH
Sbjct: 68  DPKAKPTTKPKIFISLLWKQLNHLGNAGFDPAIFRVDPYGNVLYYH-ADSASPLAWDIDH 126

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
             P S+GG +   N ++LQ  V R K N+ E 
Sbjct: 127 WFPCSRGGLTVPSNLRILQWQVCRRKHNKLEF 158


>gi|7021740|gb|AAF35421.1| hypothetical protein [Arabidopsis thaliana]
          Length = 188

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 51/130 (39%), Gaps = 8/130 (6%)

Query: 22  PSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAE 81
           P +     T     L  NE     E+  +F  +     D  A P + P       W++  
Sbjct: 35  PYKNGPPFTFMPYILQQNESLRCREVDQMFPVI-----DPKARPTTKPKIFLSLLWKQLN 89

Query: 82  KV--RGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATV 139
            +   G DP   R D  GNVV+         L   +DH  P S+GG +   N +++Q   
Sbjct: 90  HLGNAGFDPAVIRIDPYGNVVYFH-ADSASPLAWSFDHWFPCSRGGLTVPSNLRIVQWQA 148

Query: 140 NRSKGNRTEL 149
            ++K ++ E 
Sbjct: 149 RKNKKDKLEF 158


>gi|428169182|gb|EKX38118.1| hypothetical protein GUITHDRAFT_144431 [Guillardia theta CCMP2712]
          Length = 320

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 8/83 (9%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH-DYDHILPYSKG 124
           R +P +V +  W  +   +    D W     GN++   + G  G +C+ + DH+ P S+G
Sbjct: 153 RQYPRAVMEVHWIPSWFHKLFKVDMW-----GNII--SVDGLQGAMCYAEEDHLFPSSRG 205

Query: 125 GKSTLENCQVLQATVNRSKGNRT 147
           G+++L N  ++Q   NR K  + 
Sbjct: 206 GRTSLANLVLIQFVANRKKKEKI 228


>gi|359424187|ref|ZP_09215309.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
 gi|358240461|dbj|GAB04891.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
          Length = 379

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           D DHI+P+SKGGK+  EN Q+L    NRSKG+
Sbjct: 347 DADHIVPWSKGGKTEAENGQMLCIGCNRSKGD 378


>gi|392944277|ref|ZP_10309919.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
 gi|392287571|gb|EIV93595.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
          Length = 607

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           ++DH++P+S+GG S++ N Q+L    NR+KG R
Sbjct: 574 EFDHVIPFSRGGASSVNNLQLLCRRCNRAKGAR 606


>gi|255070577|ref|XP_002507370.1| predicted protein [Micromonas sp. RCC299]
 gi|226522645|gb|ACO68628.1| predicted protein [Micromonas sp. RCC299]
          Length = 1022

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 11/59 (18%)

Query: 91  WRKDALGNVVFRKLVGCPGCL------CHDYDHILPYSKGGKSTLENCQVLQATVNRSK 143
           +R D  GNVV +     PG        C +YDH+LP+S+GG S   N +V+    N+ K
Sbjct: 784 FRVDMYGNVVAK-----PGAAENFSVCCTEYDHVLPWSRGGTSKSANIEVISWIANQRK 837


>gi|384536863|ref|YP_005720948.1| HNH nuclease [Sinorhizobium meliloti SM11]
 gi|336033755|gb|AEH79687.1| HNH nuclease [Sinorhizobium meliloti SM11]
          Length = 113

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           +YDHI+PY+    STL NCQVL    +R +G +T
Sbjct: 34  EYDHIVPYALSEDSTLSNCQVLCVPCHRGEGAKT 67


>gi|331267282|ref|YP_004326912.1| HNH endonuclease family [Streptococcus oralis Uo5]
 gi|326683954|emb|CBZ01572.1| HNH endonuclease family [Streptococcus oralis Uo5]
          Length = 412

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           DHI+P+S GGK+  ENCQ+L    N SKGN
Sbjct: 381 DHIIPWSLGGKTVPENCQLLCKKHNSSKGN 410


>gi|449460568|ref|XP_004148017.1| PREDICTED: uncharacterized protein LOC101221010 [Cucumis sativus]
 gi|449520032|ref|XP_004167038.1| PREDICTED: uncharacterized LOC101221010 [Cucumis sativus]
          Length = 711

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 50/122 (40%), Gaps = 8/122 (6%)

Query: 30  TVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRD 87
           T    +L   E AT  E+  LF  +     +  A P + P       W++   +   G D
Sbjct: 43  TFTPYSLKKRETATARELDKLFPII-----NPKAKPTTKPKLFVSLLWKQLNHLGNAGFD 97

Query: 88  PDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           P   R D  GNV++         L  D DH  P S+GG +   N ++LQ    + K ++ 
Sbjct: 98  PAVIRVDHYGNVLYYH-ADSASPLAWDIDHWFPCSRGGLTVPSNLRILQWQACKRKHHKL 156

Query: 148 EL 149
           E 
Sbjct: 157 EF 158


>gi|303236487|ref|ZP_07323074.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
 gi|302483338|gb|EFL46346.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
          Length = 367

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 6/46 (13%)

Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  H      + DHI+P+ KGGK+  +NCQ+L    NR+K  +
Sbjct: 322 CPICGKHFEIEEMEADHIIPWHKGGKTEKDNCQMLCMKCNRTKSGK 367


>gi|416418648|ref|ZP_11689169.1| hypothetical protein CWATWH0003_B254 [Crocosphaera watsonii WH
           0003]
 gi|357259652|gb|EHJ09318.1| hypothetical protein CWATWH0003_B254 [Crocosphaera watsonii WH
           0003]
          Length = 66

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 22/31 (70%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+ YSKGGK+   N Q L +T NR KGNR
Sbjct: 36  DHIIAYSKGGKTENWNGQALCSTCNRQKGNR 66


>gi|421277404|ref|ZP_15728224.1| HNH endonuclease [Streptococcus mitis SPAR10]
 gi|395876685|gb|EJG87761.1| HNH endonuclease [Streptococcus mitis SPAR10]
          Length = 388

 Score = 40.4 bits (93), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +   NCQ+L  T NR+KGNR
Sbjct: 356 DADHVTAWSKGGATDESNCQMLCVTHNRAKGNR 388


>gi|239907119|ref|YP_002953860.1| HIT family protein [Desulfovibrio magneticus RS-1]
 gi|239796985|dbj|BAH75974.1| HIT family protein [Desulfovibrio magneticus RS-1]
          Length = 324

 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 32/58 (55%), Gaps = 5/58 (8%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE-----LSRSDLIQKSSYCRVSGR 166
           D DHI P SKGG +   N QVL +  NRSKGN+ +     L++ + I    +C  + R
Sbjct: 154 DVDHIRPRSKGGGNEYANLQVLCSKCNRSKGNKEDTDYRALAQGEAIPGCPFCYDAAR 211


>gi|348169152|ref|ZP_08876046.1| HNH nuclease [Saccharopolyspora spinosa NRRL 18395]
          Length = 221

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           YDHI+P + GG ST EN Q+L A  N+SKG
Sbjct: 189 YDHIIPVALGGSSTEENLQLLCAQCNQSKG 218


>gi|108758899|ref|YP_629743.1| HNH endonuclease domain-containing protein [Myxococcus xanthus DK
           1622]
 gi|108462779|gb|ABF87964.1| HNH endonuclease domain protein [Myxococcus xanthus DK 1622]
          Length = 180

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           YDH+LP ++GGK++ EN  +     N+ KGNRT
Sbjct: 96  YDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128


>gi|405355448|ref|ZP_11024674.1| HNH endonuclease [Chondromyces apiculatus DSM 436]
 gi|397091790|gb|EJJ22592.1| HNH endonuclease [Myxococcus sp. (contaminant ex DSM 436)]
          Length = 180

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           YDH+LP ++GGK++ EN  +     N+ KGNRT
Sbjct: 96  YDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128


>gi|443674162|ref|ZP_21139202.1| HNH endonuclease [Rhodococcus sp. AW25M09]
 gi|443413333|emb|CCQ17541.1| HNH endonuclease [Rhodococcus sp. AW25M09]
          Length = 390

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +S GG + L+NC++L  T NR+KGNR
Sbjct: 360 DHVTAWSNGGNTDLDNCEMLCVTHNRAKGNR 390


>gi|441509730|ref|ZP_20991644.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
           108223]
 gi|441446246|dbj|GAC49605.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
           108223]
          Length = 334

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           L  + DHI+P S+GG ST+EN Q L    NRSK N+
Sbjct: 295 LLLEVDHIVPVSRGGLSTMENLQTLCWKCNRSKSNK 330


>gi|359423170|ref|ZP_09214313.1| hypothetical protein GOAMR_06_01010 [Gordonia amarae NBRC 15530]
 gi|358241617|dbj|GAB03895.1| hypothetical protein GOAMR_06_01010 [Gordonia amarae NBRC 15530]
          Length = 394

 Score = 40.0 bits (92), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +S+GG STL NC++L    NR+KGN+
Sbjct: 364 DHVTAWSRGGASTLANCEMLCIVHNRAKGNK 394


>gi|254295258|ref|YP_003061281.1| HNH endonuclease [Hirschia baltica ATCC 49814]
 gi|254043789|gb|ACT60584.1| HNH endonuclease [Hirschia baltica ATCC 49814]
          Length = 85

 Score = 40.0 bits (92), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           +DHI+P SK G   +EN Q+  A  N++KGNR
Sbjct: 53  FDHIIPKSKHGDDKIENLQLAHAICNKTKGNR 84


>gi|114800298|ref|YP_762067.1| HNH endonuclease domain-containing protein [Hyphomonas neptunium
           ATCC 15444]
 gi|114740472|gb|ABI78597.1| HNH endonuclease domain protein [Hyphomonas neptunium ATCC 15444]
          Length = 197

 Score = 40.0 bits (92), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P SKGG++T EN     +  N +KG +
Sbjct: 97  NVFLRDGFKCAYCGAHDNLTFDHVIPRSKGGRTTWENIVTACSPCNLAKGGK 148


>gi|227484896|ref|ZP_03915212.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
           51172]
 gi|227237051|gb|EEI87066.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
           51172]
          Length = 121

 Score = 40.0 bits (92), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P+SKGGK+  ENCQ+L    N  K N 
Sbjct: 91  DHIVPWSKGGKTIAENCQMLCGDCNIKKSNH 121


>gi|422793041|ref|ZP_16845739.1| HNH endonuclease [Escherichia coli TA007]
 gi|323970450|gb|EGB65712.1| HNH endonuclease [Escherichia coli TA007]
 gi|380248683|emb|CBW53077.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Hadar]
          Length = 390

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 29/97 (29%)

Query: 64  NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD--------- 114
           N R F   +K+Q ++       R  D+  +  + N        CP C   D         
Sbjct: 309 NIRFFDEKIKKQAYK-------RQTDKAEQQGISN--------CPLCASSDNANKTKIWA 353

Query: 115 -----YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
                 DH+  +SKGG+S  +NC++L  T NRSKGN+
Sbjct: 354 LKEMEADHVSAWSKGGESKPDNCEMLCVTHNRSKGNK 390


>gi|323137342|ref|ZP_08072420.1| HNH endonuclease [Methylocystis sp. ATCC 49242]
 gi|322397329|gb|EFX99852.1| HNH endonuclease [Methylocystis sp. ATCC 49242]
          Length = 190

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P SKGG +T EN     +  N  KG+R
Sbjct: 90  NVFLRDRFTCQYCGAHDELTFDHVIPRSKGGATTWENVVAACSPCNLRKGDR 141


>gi|300088252|ref|YP_003758774.1| hypothetical protein Dehly_1158 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299527985|gb|ADJ26453.1| protein of unknown function DUF262 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 388

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           D DH+  +SKGG S+ ENC++L  T NR+KGN
Sbjct: 356 DADHVSAWSKGGDSSPENCEMLCITHNRAKGN 387


>gi|367467210|ref|ZP_09467189.1| Tellurium resistance protein terE [Patulibacter sp. I11]
 gi|365817693|gb|EHN12644.1| Tellurium resistance protein terE [Patulibacter sp. I11]
          Length = 176

 Score = 40.0 bits (92), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%)

Query: 107 CPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           C       YDH++P + GG +T+EN QVL A  N+ KG
Sbjct: 135 CGATFDLQYDHVIPIALGGANTVENLQVLCAPCNQRKG 172


>gi|379707873|ref|YP_005263078.1| hypothetical protein NOCYR_1649 [Nocardia cyriacigeorgica GUH-2]
 gi|374845372|emb|CCF62438.1| conserved protein of unknown function; putative endonuclease domain
           [Nocardia cyriacigeorgica GUH-2]
          Length = 549

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 20/81 (24%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RS P  VK + W+       RD  R              V C      ++DHI+P S+GG
Sbjct: 488 RSIPQEVKAEVWQ-------RDGGR-------------CVECGDGHYLEFDHIIPLSRGG 527

Query: 126 KSTLENCQVLQATVNRSKGNR 146
            ++  N Q+L    NR+KG R
Sbjct: 528 ATSATNLQILCRACNRAKGAR 548


>gi|402848418|ref|ZP_10896676.1| HNH endonuclease family protein [Rhodovulum sp. PH10]
 gi|402501324|gb|EJW12978.1| HNH endonuclease family protein [Rhodovulum sp. PH10]
          Length = 172

 Score = 39.7 bits (91), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C  HD   +DH++P S GG++T +N     +  N  KGN+T
Sbjct: 72  NVFLRDRFSCQYCGSHDDLTFDHLIPRSLGGQTTWDNVVTACSACNLKKGNKT 124


>gi|224143811|ref|XP_002336083.1| predicted protein [Populus trichocarpa]
 gi|222871397|gb|EEF08528.1| predicted protein [Populus trichocarpa]
          Length = 188

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 10/114 (8%)

Query: 39  NELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDAL 96
           +E+    E+  +F  +    A  TA P+ F        W +   +   G DP   R D  
Sbjct: 53  HEILRTRELDQMFPVIE-PKAKQTAKPKIF----ISLLWRQLNHLGNAGFDPAVIRVDPY 107

Query: 97  GNVV-FRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           GNV+ F      P  L  D DH  P  +GG +   N ++LQ  V + K N+ E 
Sbjct: 108 GNVLYFHADKASP--LAWDIDHWFPCPRGGLTVPRNLRILQRQVCKRKHNKLEF 159


>gi|224058715|ref|XP_002299617.1| predicted protein [Populus trichocarpa]
 gi|222846875|gb|EEE84422.1| predicted protein [Populus trichocarpa]
          Length = 188

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 10/114 (8%)

Query: 39  NELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDAL 96
           +E+    E+  +F  +    A  TA P+ F        W +   +   G DP   R D  
Sbjct: 53  HEILRTRELDQMFPVIE-PKAKQTAKPKIF----ISLLWRQLNHLGNAGFDPAVIRVDPY 107

Query: 97  GNVV-FRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           GNV+ F      P  L  D DH  P  +GG +   N ++LQ  V + K N+ E 
Sbjct: 108 GNVLYFHADKASP--LAWDIDHWFPCPRGGLTVPRNLRILQRQVCKRKHNKLEF 159


>gi|401564832|ref|ZP_10805697.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
 gi|400188432|gb|EJO22596.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
          Length = 754

 Score = 39.7 bits (91), Expect = 0.65,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           + DHI P + GG+S L+N QVL  + N SKG++ E
Sbjct: 719 EVDHIKPMNDGGQSVLDNLQVLCKSCNASKGDKAE 753


>gi|159897861|ref|YP_001544108.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
 gi|159890900|gb|ABX03980.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
          Length = 321

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 25/34 (73%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           ++DHI+P+SKGG +T+ N Q+L    N +KG+R 
Sbjct: 288 EFDHIIPHSKGGANTVGNVQLLCRKCNLAKGDRI 321


>gi|169346887|ref|ZP_02865835.1| type III restriction enzyme, res subunit family [Clostridium
           perfringens C str. JGS1495]
 gi|169296946|gb|EDS79070.1| type III restriction enzyme, res subunit family [Clostridium
           perfringens C str. JGS1495]
          Length = 740

 Score = 39.3 bits (90), Expect = 0.71,   Method: Composition-based stats.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           DHI P SKGG +TL+N Q+L +  N+ KG++ E
Sbjct: 708 DHIKPISKGGLTTLDNLQLLCSKCNKIKGDKLE 740


>gi|308535332|ref|YP_003933733.1| HNH endonuclease [Geobacter bemidjiensis Bem]
 gi|308052677|gb|ADO00821.1| HNH endonuclease [Geobacter bemidjiensis Bem]
          Length = 161

 Score = 39.3 bits (90), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 78  EKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCHD---YDHILPYSKGGKSTLENCQ 133
           E+A+  R R P      A+ NVV+ +  G C  C  ++   +DHI+P+SKGG  T  N Q
Sbjct: 91  EQADSRRKRQPI---ARAVRNVVWNRDGGVCVECGSNENLEFDHIIPHSKGGSDTERNIQ 147

Query: 134 VLQATVNRSK 143
           +L    NR+K
Sbjct: 148 LLCMKCNRTK 157


>gi|297830140|ref|XP_002882952.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328792|gb|EFH59211.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 727

 Score = 39.3 bits (90), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 41/100 (41%), Gaps = 3/100 (3%)

Query: 52  TDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPG 109
            D      D  A P + P       W++   +   G DP   R D  GNVV+        
Sbjct: 59  VDAMFPVIDPKARPTTKPKIFLSLLWKQLNHLGNAGFDPAVIRIDPYGNVVYFH-ADSAS 117

Query: 110 CLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            L   +DH  P S+GG +   N +++Q    ++K ++ E 
Sbjct: 118 PLAWSFDHWFPCSRGGLTVPSNLRIVQWQARKNKKDKLEF 157


>gi|433624626|ref|YP_007258256.1| Putative uncharacterized protein [Mycoplasma cynos C142]
 gi|429534652|emb|CCP24154.1| Putative uncharacterized protein [Mycoplasma cynos C142]
          Length = 104

 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 92  RKDALGNVVFRKLVGCPGC-LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
           ++D  G ++ +   G P    C + DHILP S GGK  LEN   +    N  K N+    
Sbjct: 20  KRDFAGRIMDKSAYGDPNSEFCWNIDHILPVSCGGKDELENFIAVHCETNTEKANKFPHF 79

Query: 151 RSDLIQKSSYCRVSGRDMDLLEL 173
           +++   KS   R S +  +  E+
Sbjct: 80  KAN--NKSFEIRRSNKGFNKYEI 100


>gi|183602772|ref|ZP_02964135.1| hypothetical protein BIFLAC_01035 [Bifidobacterium animalis subsp.
           lactis HN019]
 gi|219683130|ref|YP_002469513.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis AD011]
 gi|241191461|ref|YP_002968855.1| hypothetical protein Balac_1446 [Bifidobacterium animalis subsp.
           lactis Bl-04]
 gi|241196867|ref|YP_002970422.1| hypothetical protein Balat_1446 [Bifidobacterium animalis subsp.
           lactis DSM 10140]
 gi|384190095|ref|YP_005575843.1| hypothetical protein [Bifidobacterium animalis subsp. lactis BB-12]
 gi|384192885|ref|YP_005578632.1| hypothetical protein [Bifidobacterium animalis subsp. lactis CNCM
           I-2494]
 gi|384194458|ref|YP_005580204.1| HNH endonuclease domain protein [Bifidobacterium animalis subsp.
           lactis BLC1]
 gi|384196023|ref|YP_005581768.1| hypothetical protein BalV_1401 [Bifidobacterium animalis subsp.
           lactis V9]
 gi|387821324|ref|YP_006301367.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis B420]
 gi|387823007|ref|YP_006302956.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis Bi-07]
 gi|423679993|ref|ZP_17654869.1| hypothetical protein FEM_15483 [Bifidobacterium animalis subsp.
           lactis BS 01]
 gi|183218010|gb|EDT88658.1| hypothetical protein BIFLAC_01035 [Bifidobacterium animalis subsp.
           lactis HN019]
 gi|219620780|gb|ACL28937.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis AD011]
 gi|240249853|gb|ACS46793.1| hypothetical protein Balac_1446 [Bifidobacterium animalis subsp.
           lactis Bl-04]
 gi|240251421|gb|ACS48360.1| hypothetical protein Balat_1446 [Bifidobacterium animalis subsp.
           lactis DSM 10140]
 gi|289177587|gb|ADC84833.1| Hypothetical protein BIF_00872 [Bifidobacterium animalis subsp.
           lactis BB-12]
 gi|295794454|gb|ADG33989.1| hypothetical protein BalV_1401 [Bifidobacterium animalis subsp.
           lactis V9]
 gi|340365622|gb|AEK30913.1| Hypothetical protein BALAC2494_01292 [Bifidobacterium animalis
           subsp. lactis CNCM I-2494]
 gi|345283317|gb|AEN77171.1| HNH endonuclease domain protein [Bifidobacterium animalis subsp.
           lactis BLC1]
 gi|366040992|gb|EHN17505.1| hypothetical protein FEM_15483 [Bifidobacterium animalis subsp.
           lactis BS 01]
 gi|386654025|gb|AFJ17155.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis B420]
 gi|386655615|gb|AFJ18744.1| HNH endonuclease [Bifidobacterium animalis subsp. lactis Bi-07]
          Length = 372

 Score = 39.3 bits (90), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 6/46 (13%)

Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  H        DH +P+SKGGK+ LENC +L    N  K N+
Sbjct: 327 CPLCQQHFELSEMQADHNIPWSKGGKTILENCVMLCVKDNIKKSNK 372


>gi|442318514|ref|YP_007358535.1| HNH endonuclease domain-containing protein [Myxococcus stipitatus
           DSM 14675]
 gi|441486156|gb|AGC42851.1| HNH endonuclease domain-containing protein [Myxococcus stipitatus
           DSM 14675]
          Length = 180

 Score = 39.3 bits (90), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           YDH++P ++GGK++ EN  +     N+ KGNRT
Sbjct: 96  YDHVIPRAQGGKTSWENVVIACVPCNQKKGNRT 128


>gi|357455079|ref|XP_003597820.1| hypothetical protein MTR_2g102680 [Medicago truncatula]
 gi|355486868|gb|AES68071.1| hypothetical protein MTR_2g102680 [Medicago truncatula]
          Length = 707

 Score = 39.3 bits (90), Expect = 0.80,   Method: Composition-based stats.
 Identities = 33/117 (28%), Positives = 47/117 (40%), Gaps = 3/117 (2%)

Query: 35  TLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWR 92
           T L   L  +E     F D      D  A P +         W++   +   G DP   R
Sbjct: 43  TFLPYALPEDEVERVSFLDEMFPIIDPKAKPTTKAKIFASILWKQLSHLGNAGFDPAVIR 102

Query: 93  KDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            D  GNV++         L  D DH  P S+GG + L N ++LQ  V + K ++ E 
Sbjct: 103 VDGYGNVLYYH-ADSASPLAWDIDHWFPCSRGGLTVLSNLRILQRQVCKRKKHKLEF 158


>gi|240255352|ref|NP_188174.4| uncharacterized protein [Arabidopsis thaliana]
 gi|11994223|dbj|BAB01345.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642171|gb|AEE75692.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 729

 Score = 39.3 bits (90), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 41/100 (41%), Gaps = 3/100 (3%)

Query: 52  TDLRISSADSTANPRSFPYSVKQQCWEKAEKV--RGRDPDRWRKDALGNVVFRKLVGCPG 109
            D      D  A P + P       W++   +   G DP   R D  GNVV+        
Sbjct: 60  VDQMFPVIDPKARPTTKPKIFLSLLWKQLNHLGNAGFDPAVIRIDPYGNVVYFH-ADSAS 118

Query: 110 CLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            L   +DH  P S+GG +   N +++Q    ++K ++ E 
Sbjct: 119 PLAWSFDHWFPCSRGGLTVPSNLRIVQWQARKNKKDKLEF 158


>gi|154500496|ref|ZP_02038534.1| hypothetical protein BACCAP_04168 [Bacteroides capillosus ATCC
           29799]
 gi|150270727|gb|EDM98023.1| HNH endonuclease domain protein [Pseudoflavonifractor capillosus
           ATCC 29799]
          Length = 750

 Score = 39.3 bits (90), Expect = 0.82,   Method: Composition-based stats.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHILP SKGGKS  EN Q+L  + N  KG++
Sbjct: 720 DHILPMSKGGKSVPENLQILCRSCNGRKGDQ 750


>gi|326512504|dbj|BAJ99607.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 283

 Score = 39.3 bits (90), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 10/91 (10%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           N++FR    C  C   D    DH++P S+GGK   EN     +  N  KGN+T L  +  
Sbjct: 179 NILFRDGFSCQYCSSEDDLTIDHVIPASRGGKWEWENLVTACSRCNSRKGNKTLLQANMK 238

Query: 155 IQKSSYCRVSGRDMDLLEL----SAYGNVRR 181
           ++K        ++ D++ +    SA+  +RR
Sbjct: 239 LRKIPK---GPKEFDIIAVPLTKSAFRTIRR 266


>gi|417782516|ref|ZP_12430243.1| HNH endonuclease domain protein [Leptospira interrogans str.
           C10069]
 gi|418669102|ref|ZP_13230498.1| HNH endonuclease domain protein [Leptospira interrogans serovar
           Pyrogenes str. 2006006960]
 gi|409954299|gb|EKO08791.1| HNH endonuclease domain protein [Leptospira interrogans str.
           C10069]
 gi|410755141|gb|EKR16775.1| HNH endonuclease domain protein [Leptospira interrogans serovar
           Pyrogenes str. 2006006960]
          Length = 239

 Score = 39.3 bits (90), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSD 153
           PG   H  DHI+P+SKGG + +EN +   A  N  KGN  E S  D
Sbjct: 192 PGTELH-CDHIIPWSKGGLTVIENLRTTCAKCNLGKGNLIETSVWD 236


>gi|450159591|ref|ZP_21879522.1| HNH endonuclease [Streptococcus mutans 66-2A]
 gi|449240999|gb|EMC39646.1| HNH endonuclease [Streptococcus mutans 66-2A]
          Length = 412

 Score = 39.3 bits (90), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGN 145
           DHI P+S GGK+  ENCQ+L    N SKGN
Sbjct: 381 DHITPWSLGGKTVPENCQLLCKKHNSSKGN 410


>gi|254432244|ref|ZP_05045947.1| HNH nuclease [Cyanobium sp. PCC 7001]
 gi|197626697|gb|EDY39256.1| HNH nuclease [Cyanobium sp. PCC 7001]
          Length = 145

 Score = 39.3 bits (90), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 5/73 (6%)

Query: 78  EKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQA 137
           E   K+R R   RWR+ +L  +  ++ + C GC     DH+LP S+GG S  ENC     
Sbjct: 5   ELCPKLRAR---RWRQ-SLHQLTQQRCIYC-GCRSESIDHVLPKSRGGPSVNENCVPACL 59

Query: 138 TVNRSKGNRTELS 150
             N  KG+    S
Sbjct: 60  ACNGRKGDSEAFS 72


>gi|67920391|ref|ZP_00513911.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
 gi|67857875|gb|EAM53114.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
          Length = 584

 Score = 38.9 bits (89), Expect = 0.92,   Method: Composition-based stats.
 Identities = 29/87 (33%), Positives = 35/87 (40%), Gaps = 14/87 (16%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
           P S PY      W K        P R  K      + +K  G CP C  H       + D
Sbjct: 493 PESSPYDGNWTYWSKRRGTYTGTPARVAK------LLKKQKGICPQCKQHFTPEDLIEVD 546

Query: 117 HILPYSKGGKSTLENCQVLQATVNRSK 143
           HI+P SKGGK T  N Q L    + +K
Sbjct: 547 HIIPKSKGGKDTYNNLQALHRHCHDAK 573


>gi|381180675|ref|ZP_09889513.1| hypothetical protein TresaDRAFT_1271 [Treponema saccharophilum DSM
           2985]
 gi|380767378|gb|EIC01379.1| hypothetical protein TresaDRAFT_1271 [Treponema saccharophilum DSM
           2985]
          Length = 274

 Score = 38.9 bits (89), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 17/58 (29%), Positives = 32/58 (55%), Gaps = 4/58 (6%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL----SRSDLIQKSSYCRVSGRDM 168
           ++HI+P  KGG ++  NCQ++    NR K N   +    +R+++ +K  Y  ++   M
Sbjct: 31  FEHIIPKDKGGTNSFSNCQIVSRKFNRQKYNMNTIFSFTNRNEIYKKCLYASIAHAVM 88


>gi|399071595|ref|ZP_10750043.1| restriction endonuclease [Caulobacter sp. AP07]
 gi|398043268|gb|EJL36188.1| restriction endonuclease [Caulobacter sp. AP07]
          Length = 186

 Score = 38.9 bits (89), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C   D   +DH++P S+GG++T EN     A  N SKG RT
Sbjct: 86  NLFLRDGFSCQYCGAPDDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 138


>gi|257456239|ref|ZP_05621436.1| HNH endonuclease domain protein [Treponema vincentii ATCC 35580]
 gi|257446325|gb|EEV21371.1| HNH endonuclease domain protein [Treponema vincentii ATCC 35580]
          Length = 172

 Score = 38.9 bits (89), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 22/31 (70%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +SKGG +   NCQ+L  T NR+KGNR
Sbjct: 142 DHVSAWSKGGATNSSNCQMLCKTHNRAKGNR 172


>gi|409098020|ref|ZP_11218044.1| hypothetical protein PagrP_06221 [Pedobacter agri PB92]
          Length = 296

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           ++DHI+P+SKGG +T  N Q+L    NR+K  R 
Sbjct: 262 EFDHIIPFSKGGSNTYRNIQLLCEGCNRTKSARI 295


>gi|88803149|ref|ZP_01118675.1| hypothetical protein PI23P_11192 [Polaribacter irgensii 23-P]
 gi|88780715|gb|EAR11894.1| hypothetical protein PI23P_11192 [Polaribacter irgensii 23-P]
          Length = 387

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 24/33 (72%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG +  +NC++L  T NR+KGN+
Sbjct: 355 DADHVTAWSKGGATLKDNCEMLCKTHNRAKGNK 387


>gi|367473211|ref|ZP_09472776.1| conserved hypothetical protein; putative HNH endonuclease
           [Bradyrhizobium sp. ORS 285]
 gi|365274500|emb|CCD85244.1| conserved hypothetical protein; putative HNH endonuclease
           [Bradyrhizobium sp. ORS 285]
          Length = 185

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DH++P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLT 137


>gi|365878540|ref|ZP_09418012.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 375]
 gi|365293573|emb|CCD90543.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 375]
          Length = 185

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DH++P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLT 137


>gi|210623659|ref|ZP_03293968.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
 gi|210153424|gb|EEA84430.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
          Length = 912

 Score = 38.9 bits (89), Expect = 1.2,   Method: Composition-based stats.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           DHI+P SKGGK+ +EN Q+L    N  KG+  E
Sbjct: 661 DHIVPLSKGGKTVVENLQLLTRWENAQKGDSLE 693


>gi|383825597|ref|ZP_09980744.1| pCQ3_20 [Mycobacterium xenopi RIVM700367]
 gi|383334516|gb|EID12955.1| pCQ3_20 [Mycobacterium xenopi RIVM700367]
          Length = 130

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 23/35 (65%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
            DHILP+S GG  T +N +VL    N ++GNR E+
Sbjct: 92  LDHILPWSLGGPDTEDNLRVLCRPCNSARGNRIEV 126


>gi|456352674|dbj|BAM87119.1| conserved hypothetical protein [Agromonas oligotrophica S58]
          Length = 174

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DH++P SKGG++T EN     +  N  KGN T
Sbjct: 74  NVFLRDRFACQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLRKGNLT 126


>gi|41408779|ref|NP_961615.1| hypothetical protein MAP2681c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|254774117|ref|ZP_05215633.1| hypothetical protein MaviaA2_05510 [Mycobacterium avium subsp.
           avium ATCC 25291]
 gi|417749729|ref|ZP_12398118.1| restriction endonuclease [Mycobacterium avium subsp.
           paratuberculosis S397]
 gi|41397137|gb|AAS04998.1| hypothetical protein MAP_2681c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|336458728|gb|EGO37688.1| restriction endonuclease [Mycobacterium avium subsp.
           paratuberculosis S397]
          Length = 314

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 34/78 (43%), Gaps = 20/78 (25%)

Query: 90  RWRKDALG-----------NVVFRKLVGCPGC---------LCHDYDHILPYSKGGKSTL 129
           RWRK A G           ++  R    C  C         L  + DHI+P SKGG ST 
Sbjct: 234 RWRKSAAGQRALMTSRLRNSIKVRDHYTCRYCSVSLAAEPHLLLEVDHIVPISKGGMSTP 293

Query: 130 ENCQVLQATVNRSKGNRT 147
           +N Q L    NR+K N+ 
Sbjct: 294 DNLQTLCWRCNRTKSNKV 311


>gi|422012873|ref|ZP_16359511.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
 gi|394752101|gb|EJF35815.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
          Length = 94

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 8/45 (17%)

Query: 111 LCHDY--------DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           LCH          DH++P SKGG STL+N ++     N +KGNR 
Sbjct: 32  LCHQLVRRRDASVDHVVPSSKGGPSTLDNLKLAHRRCNFAKGNRV 76


>gi|338529937|ref|YP_004663271.1| HNH endonuclease domain-containing protein [Myxococcus fulvus HW-1]
 gi|337256033|gb|AEI62193.1| HNH endonuclease domain-containing protein [Myxococcus fulvus HW-1]
          Length = 180

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           +DH+LP ++GGK++ EN  +     N+ KGNRT
Sbjct: 96  HDHVLPRAQGGKTSWENIVIACVPCNQKKGNRT 128


>gi|383643665|ref|ZP_09956071.1| HNH endonuclease [Sphingomonas elodea ATCC 31461]
          Length = 188

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCH---DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C  H    +DH++P ++GG++T EN     A  N  KG RT
Sbjct: 88  NLFLRDKFACQYCGSHRDLTFDHVIPRAQGGRTTWENVATACAPCNLKKGGRT 140


>gi|147669720|ref|YP_001214538.1| HNH endonuclease [Dehalococcoides sp. BAV1]
 gi|146270668|gb|ABQ17660.1| HNH endonuclease [Dehalococcoides sp. BAV1]
          Length = 228

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 1/35 (2%)

Query: 115 YDHILPYSKGGKSTL-ENCQVLQATVNRSKGNRTE 148
           +DHI+PYSKGG S + EN Q+L A+ N SK    E
Sbjct: 194 FDHIIPYSKGGSSLVAENIQLLCASCNLSKHANIE 228


>gi|307107571|gb|EFN55813.1| hypothetical protein CHLNCDRAFT_145317 [Chlorella variabilis]
          Length = 276

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           DHI+P+SKGG S   N Q++    N SKGN+++
Sbjct: 230 DHIIPFSKGGPSEEGNAQLVHRFCNSSKGNKSD 262


>gi|226532958|ref|NP_001141933.1| uncharacterized protein LOC100274082 [Zea mays]
 gi|194706496|gb|ACF87332.1| unknown [Zea mays]
 gi|414883692|tpg|DAA59706.1| TPA: hypothetical protein ZEAMMB73_991900 [Zea mays]
          Length = 287

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+++R    C  C C D    DH++P S+GGK   EN     A  N  KG +T
Sbjct: 183 NILYRDDFTCQYCSCGDDLTIDHVIPISRGGKWEWENLVTACARCNSRKGQKT 235


>gi|295690224|ref|YP_003593917.1| HNH endonuclease [Caulobacter segnis ATCC 21756]
 gi|295432127|gb|ADG11299.1| HNH endonuclease [Caulobacter segnis ATCC 21756]
          Length = 186

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C   +   +DH++P S+GG++T EN     A  N SKG RT
Sbjct: 86  NLFLRDAFSCQYCGSTEDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 138


>gi|357386016|ref|YP_004900740.1| HNH endonuclease family protein [Pelagibacterium halotolerans B2]
 gi|351594653|gb|AEQ52990.1| HNH endonuclease family protein [Pelagibacterium halotolerans B2]
          Length = 185

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   HD  +DH+LP SKGG++T EN     +  N  K NRT
Sbjct: 85  NVFLRDRFQCQYCGSRHDLTFDHVLPRSKGGRTTWENIVAACSPCNLRKANRT 137


>gi|57233948|ref|YP_181981.1| HNH endonuclease domain-containing protein [Dehalococcoides
           ethenogenes 195]
 gi|57224396|gb|AAW39453.1| HNH endonuclease domain protein [Dehalococcoides ethenogenes 195]
          Length = 232

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 54/130 (41%), Gaps = 36/130 (27%)

Query: 20  PSPSRRRGETTVRSATLLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEK 79
           PS +R+  +  +   + + NE+AT     TL  D+R           + P  +K+  W +
Sbjct: 138 PSDNRKVFKFKLELTSEITNEIAT----PTLDADIR-----------AIPSDIKRIVWAR 182

Query: 80  AEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTL-ENCQVLQAT 138
            +                     K V C       +DH++PYSKGG S + EN Q+L A+
Sbjct: 183 DQG--------------------KCVKCGNNSNLHFDHVIPYSKGGSSLIAENIQLLCAS 222

Query: 139 VNRSKGNRTE 148
            N SK    E
Sbjct: 223 CNLSKHASIE 232


>gi|392942375|ref|ZP_10308017.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
 gi|392285669|gb|EIV91693.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
          Length = 607

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 20/92 (21%)

Query: 55  RISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHD 114
           R+  A    + R+ P +VK + W++                         V C      +
Sbjct: 535 RLVIAPGARDSRAIPQAVKAEVWQRDGG--------------------ACVQCQATEYLE 574

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           +DH++P+S+GG +++ N Q+L    N+ KG R
Sbjct: 575 FDHVIPHSRGGATSVNNLQLLCRRCNQEKGAR 606


>gi|54023566|ref|YP_117808.1| hypothetical protein nfa15980 [Nocardia farcinica IFM 10152]
 gi|54015074|dbj|BAD56444.1| hypothetical protein [Nocardia farcinica IFM 10152]
          Length = 760

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           ++DHI+P S+GG ++  N Q+L    NR KG R
Sbjct: 727 EFDHIIPLSRGGATSAANLQILCRACNRRKGAR 759


>gi|430376075|ref|ZP_19430478.1| hypothetical protein MOMA_02985 [Moraxella macacae 0408225]
 gi|429541306|gb|ELA09334.1| hypothetical protein MOMA_02985 [Moraxella macacae 0408225]
          Length = 387

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +SKGG + ++NC++L    NR+KGNR
Sbjct: 357 DHVTAWSKGGLTGIDNCEMLCKVHNRAKGNR 387


>gi|304321776|ref|YP_003855419.1| hypothetical protein PB2503_11149 [Parvularcula bermudensis
           HTCC2503]
 gi|303300678|gb|ADM10277.1| hypothetical protein PB2503_11149 [Parvularcula bermudensis
           HTCC2503]
          Length = 188

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 7/62 (11%)

Query: 92  RKDALG--NVVFRKLVGCPGCLCHD-----YDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           RK A    NV  R +  C  C   D     +DH++P S+GGK+T +N     +  N  KG
Sbjct: 78  RKPAFTRFNVFLRDVFTCQYCGTQDRDNLTFDHLIPRSRGGKTTWQNIVTACSPCNLKKG 137

Query: 145 NR 146
           N+
Sbjct: 138 NK 139


>gi|91974822|ref|YP_567481.1| HNH endonuclease [Rhodopseudomonas palustris BisB5]
 gi|91681278|gb|ABE37580.1| HNH endonuclease [Rhodopseudomonas palustris BisB5]
          Length = 185

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T +
Sbjct: 85  NVFLRDRFACQYCGAPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPI 139


>gi|449446780|ref|XP_004141149.1| PREDICTED: uncharacterized protein LOC101207660 [Cucumis sativus]
          Length = 277

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+++R    C  C  H+    DH+LP S+GG+ T EN        N  KG +T
Sbjct: 174 NILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT 226


>gi|406930511|gb|EKD65859.1| hypothetical protein ACD_49C00079G0023 [uncultured bacterium (gcode
           4)]
          Length = 360

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 6/44 (13%)

Query: 107 CPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           CP C  H      + DHI P+ + GK++ ENCQ+L    NR K 
Sbjct: 315 CPKCNDHFEINEMEADHITPWHEWGKTSAENCQMLCKNCNRVKS 358


>gi|218440993|ref|YP_002379322.1| HNH endonuclease [Cyanothece sp. PCC 7424]
 gi|218173721|gb|ACK72454.1| HNH endonuclease [Cyanothece sp. PCC 7424]
          Length = 169

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
           DHILP SKGGK T +N  +   + N  KG+RT  S + +I K+S
Sbjct: 106 DHILPRSKGGKHTWDNVVIACESCNSRKGDRTP-SEAGMILKTS 148


>gi|29826461|ref|NP_828767.1| putative NHN endonuclease [Streptomyces avermitilis MA-4680]
 gi|29611259|dbj|BAC75302.1| putative HNH endonuclease [Streptomyces avermitilis MA-4680]
          Length = 677

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)

Query: 86  RDPDRWRKDALGNVVFRKLVG----CPGCLCHDY--DHILPYSKGGKSTLENCQVLQATV 139
           RDP R+      N+++    G    C   L   +  DH++P+ +GG++ +EN Q L A  
Sbjct: 4   RDPQRFFTQNQRNILYDLAEGRCETCSAPLLDGWEADHMVPWVQGGRTVIENGQALCAQC 63

Query: 140 NRSKG 144
           N+ KG
Sbjct: 64  NKGKG 68


>gi|162148898|ref|YP_001603359.1| endonuclease [Gluconacetobacter diazotrophicus PAl 5]
 gi|209545355|ref|YP_002277584.1| HNH endonuclease [Gluconacetobacter diazotrophicus PAl 5]
 gi|161787475|emb|CAP57071.1| putative endonuclease protein [Gluconacetobacter diazotrophicus PAl
           5]
 gi|209533032|gb|ACI52969.1| HNH endonuclease [Gluconacetobacter diazotrophicus PAl 5]
          Length = 186

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 6/55 (10%)

Query: 98  NVVFRKLVGCPGCL----CHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  CL     HD  +DH++P  KGG++T EN     +  N  KG+R
Sbjct: 83  NVFLRDNFSCQYCLDRLPTHDLTFDHVIPRCKGGRTTWENVVTACSPCNLMKGSR 137


>gi|386773495|ref|ZP_10095873.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
          Length = 146

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 41/94 (43%), Gaps = 15/94 (15%)

Query: 69  PYSVKQ---QCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH-----------D 114
           PY + Q   + W +    RGRDP++ R      V  + ++   G  C             
Sbjct: 43  PYPLAQIESRAWWEWHWARGRDPEKVRTKIPARV-RQAVIARDGHTCQLCGGEVEPDDVH 101

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
            DHI+P+S GG  T++N +V  +  N  +G   E
Sbjct: 102 LDHIVPWSHGGPDTVKNLRVTHSLCNMKRGAPVE 135


>gi|296313798|ref|ZP_06863739.1| putative HNH endonuclease domain protein [Neisseria polysaccharea
           ATCC 43768]
 gi|296839622|gb|EFH23560.1| putative HNH endonuclease domain protein [Neisseria polysaccharea
           ATCC 43768]
          Length = 376

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 29/89 (32%)

Query: 64  NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH------DYDH 117
           N R+F  S+KQ+ +EK + +                       CP C         + DH
Sbjct: 311 NIRAFSDSIKQKVYEKQDGI-----------------------CPHCKQKFELSEMEGDH 347

Query: 118 ILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           I P+ +GGK+  ENCQ+L    NR K ++
Sbjct: 348 ITPWCEGGKTIEENCQMLCRDCNRRKSDK 376


>gi|383453402|ref|YP_005367391.1| HNH endonuclease domain-containing protein [Corallococcus
           coralloides DSM 2259]
 gi|380728026|gb|AFE04028.1| HNH endonuclease domain-containing protein [Corallococcus
           coralloides DSM 2259]
          Length = 180

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           YDH+LP ++GG+++ +N  +     N+ KGNRT
Sbjct: 96  YDHVLPRAQGGRTSWDNIVIACVPCNQKKGNRT 128


>gi|86160544|ref|YP_467329.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
 gi|85777055|gb|ABC83892.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
          Length = 334

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 13/111 (11%)

Query: 37  LDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDAL 96
           +D  LA  E+     T+   +S   + +PR  P  V+++ WE     R R   +WR  A 
Sbjct: 219 VDLLLAKAEKRKGALTERPRASVRPSRDPRHIPAHVRREVWE-----RDRGCCQWRL-AN 272

Query: 97  GNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           G +       C        DH++P ++GG+ T  N +VL A  N     R 
Sbjct: 273 GEI-------CGSTHALQLDHVVPRARGGEPTAGNLRVLCAAHNLEAARRV 316


>gi|16126439|ref|NP_421003.1| HNH endonuclease [Caulobacter crescentus CB15]
 gi|221235220|ref|YP_002517656.1| HNH endonuclease family protein [Caulobacter crescentus NA1000]
 gi|13423703|gb|AAK24171.1| HNH endonuclease family protein [Caulobacter crescentus CB15]
 gi|220964392|gb|ACL95748.1| HNH endonuclease family protein [Caulobacter crescentus NA1000]
          Length = 187

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C   +   +DH++P S+GG++T EN     A  N SKG RT
Sbjct: 87  NLFLRDSFSCQYCGSPEDLTFDHVIPRSRGGRTTWENIVTACAPCNLSKGGRT 139


>gi|414165213|ref|ZP_11421460.1| hypothetical protein HMPREF9697_03361 [Afipia felis ATCC 53690]
 gi|410882993|gb|EKS30833.1| hypothetical protein HMPREF9697_03361 [Afipia felis ATCC 53690]
          Length = 185

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  CL  +   +DHI+P S+GG++T EN     +  N  KG+ T
Sbjct: 85  NVFLRDRFSCQYCLSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLRKGSMT 137


>gi|383806641|ref|ZP_09962203.1| restriction endonuclease [Candidatus Aquiluna sp. IMCC13023]
 gi|383299811|gb|EIC92424.1| restriction endonuclease [Candidatus Aquiluna sp. IMCC13023]
          Length = 178

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query: 113 HDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           +  DHI+P S+GGK T EN        N SKGNRT
Sbjct: 101 NTIDHIMPKSRGGKDTWENLVACCLECNNSKGNRT 135


>gi|316932026|ref|YP_004107008.1| HNH endonuclease [Rhodopseudomonas palustris DX-1]
 gi|315599740|gb|ADU42275.1| HNH endonuclease [Rhodopseudomonas palustris DX-1]
          Length = 185

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFVCQYCGAPDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLT 137


>gi|281420445|ref|ZP_06251444.1| putative HNH endonuclease domain protein [Prevotella copri DSM
           18205]
 gi|281405502|gb|EFB36182.1| putative HNH endonuclease domain protein [Prevotella copri DSM
           18205]
          Length = 388

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 14/54 (25%)

Query: 107 CPGCLCHD--------------YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C C D               DH+  +  GG +   NCQ+L  T NRSKGNR
Sbjct: 335 CPLCACGDGPNKDKIWALKEMEADHVKAWVNGGATDESNCQMLCKTHNRSKGNR 388


>gi|196231796|ref|ZP_03130653.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
 gi|196224268|gb|EDY18781.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
          Length = 194

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           + DH++P S+GGK+T ENC +    VN  K +R
Sbjct: 121 NIDHVVPRSRGGKTTWENCVLAHREVNTRKADR 153


>gi|39933633|ref|NP_945909.1| HNH endonuclease [Rhodopseudomonas palustris CGA009]
 gi|192288988|ref|YP_001989593.1| HNH endonuclease [Rhodopseudomonas palustris TIE-1]
 gi|39647479|emb|CAE26000.1| HNH endonuclease:HNH nuclease [Rhodopseudomonas palustris CGA009]
 gi|192282737|gb|ACE99117.1| HNH endonuclease [Rhodopseudomonas palustris TIE-1]
          Length = 185

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFVCQYCGAPDDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLT 137


>gi|295105782|emb|CBL03325.1| Restriction endonuclease [Gordonibacter pamelaeae 7-10-1-b]
          Length = 113

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P S+GGKS + N Q L    N  KGNR
Sbjct: 80  DHIVPVSRGGKSVMGNLQTLCEVCNCGKGNR 110


>gi|453363910|dbj|GAC80436.1| hypothetical protein GM1_017_00950 [Gordonia malaquae NBRC 108250]
          Length = 141

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 25/46 (54%)

Query: 102 RKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           R L G  G   + +DH++P S+GG +T  NC       N  KG+RT
Sbjct: 78  RHLCGYCGLHANTWDHVMPQSRGGGNTWFNCIAACGPCNWDKGDRT 123


>gi|260887966|ref|ZP_05899229.1| putative HNH endonuclease domain protein [Selenomonas sputigena
           ATCC 35185]
 gi|330838550|ref|YP_004413130.1| hypothetical protein Selsp_0703 [Selenomonas sputigena ATCC 35185]
 gi|260862217|gb|EEX76717.1| putative HNH endonuclease domain protein [Selenomonas sputigena
           ATCC 35185]
 gi|329746314|gb|AEB99670.1| protein of unknown function DUF262 [Selenomonas sputigena ATCC
           35185]
          Length = 374

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 6/46 (13%)

Query: 107 CPGCLCH-DYD-----HILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  H D+D     HI+P+SKGG +  +NCQ+L  + N  K ++
Sbjct: 328 CPVCKEHFDFDEMEGDHIMPWSKGGHTQPDNCQMLCKSCNGKKTDK 373


>gi|291542181|emb|CBL15291.1| Restriction endonuclease [Ruminococcus bromii L2-63]
          Length = 374

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI P+SKGG++ L+NCQ+L  + N  K ++
Sbjct: 343 DHIKPWSKGGRTILDNCQMLCKSCNAKKTDK 373


>gi|146343281|ref|YP_001208329.1| HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 278]
 gi|148252701|ref|YP_001237286.1| HNH endonuclease [Bradyrhizobium sp. BTAi1]
 gi|146196087|emb|CAL80114.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. ORS 278]
 gi|146404874|gb|ABQ33380.1| putative HNH endonuclease [Bradyrhizobium sp. BTAi1]
          Length = 185

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   D   +DH++P SKGG++T EN     +  N  KGN T       
Sbjct: 85  NVFLRDRFVCQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLKKGNLTPQQAKMF 144

Query: 155 IQKSSY 160
            +++ Y
Sbjct: 145 PKQAPY 150


>gi|407463091|ref|YP_006774408.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
 gi|407046713|gb|AFS81466.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
          Length = 228

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 35/81 (43%), Gaps = 21/81 (25%)

Query: 64  NPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSK 123
           N R  P  +KQ+ W++ E                     K V C       +DH++P+SK
Sbjct: 163 NRRIIPTKIKQEVWKRDEG--------------------KCVKCGSKENLHFDHVIPHSK 202

Query: 124 GGKS-TLENCQVLQATVNRSK 143
           GG S T EN Q+L A  N  K
Sbjct: 203 GGTSITAENIQILCAKHNLEK 223


>gi|223935746|ref|ZP_03627662.1| HNH endonuclease [bacterium Ellin514]
 gi|223895754|gb|EEF62199.1| HNH endonuclease [bacterium Ellin514]
          Length = 194

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
            DHILP S+GGK   +NC      +N  KGNR
Sbjct: 123 IDHILPVSRGGKDAWDNCVWACKAINSRKGNR 154


>gi|365173256|ref|ZP_09361377.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
 gi|363616949|gb|EHL68368.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
          Length = 374

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 6/46 (13%)

Query: 107 CPGCLCH-DYD-----HILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           CP C  H +YD     HI P+SKGG++T +NCQ+L  + N  K ++
Sbjct: 328 CPVCGEHFEYDEMEGDHIKPWSKGGQTTPDNCQMLCKSCNAKKTDK 373


>gi|27376478|ref|NP_768007.1| hypothetical protein bll1367 [Bradyrhizobium japonicum USDA 110]
 gi|27349618|dbj|BAC46632.1| bll1367 [Bradyrhizobium japonicum USDA 110]
          Length = 268

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T       
Sbjct: 168 NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 227

Query: 155 IQKSSY 160
            ++S++
Sbjct: 228 PRQSAF 233


>gi|418175095|ref|ZP_12811693.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
 gi|353843655|gb|EHE23699.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
          Length = 178

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 23/36 (63%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           L  + DHI+P SKGG ST +N Q L    NRSK N+
Sbjct: 141 LLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 176


>gi|422850756|ref|ZP_16897426.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
 gi|422857591|ref|ZP_16904241.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
 gi|325695504|gb|EGD37404.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
 gi|327462774|gb|EGF09096.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
          Length = 329

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 23/37 (62%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           L  + DHI+P SKGG ST +N Q L    NRSK N+ 
Sbjct: 292 LLLEIDHIIPISKGGLSTRDNLQTLCWKCNRSKSNKI 328


>gi|257790601|ref|YP_003181207.1| HNH endonuclease [Eggerthella lenta DSM 2243]
 gi|257474498|gb|ACV54818.1| HNH endonuclease [Eggerthella lenta DSM 2243]
          Length = 127

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P S+GGKS + N Q L    N  KGNR
Sbjct: 94  DHIVPVSRGGKSVMSNLQTLCEDCNCGKGNR 124


>gi|357111451|ref|XP_003557526.1| PREDICTED: uncharacterized protein LOC100822155 [Brachypodium
           distachyon]
          Length = 287

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           N+++R    C  C   D    DH++P S+GGK   EN     +  N  KG++T L  +  
Sbjct: 183 NILYRDGFSCQYCSSEDDLTIDHVIPTSRGGKWEWENLVAACSRCNSRKGHKTLLQANMK 242

Query: 155 IQKSSYCRVSGRDMDLLEL----SAYGNVRRGQ 183
           ++K      + ++ D++ +    SA+  +RR Q
Sbjct: 243 LRKVPK---APKEFDIIAVPLTKSAFRTIRRKQ 272


>gi|392529601|ref|ZP_10276738.1| HNH endonuclease [Carnobacterium maltaromaticum ATCC 35586]
          Length = 113

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 108 PGCLCHDYDHILPYSKGGK-STLENCQVLQATVNRSKGNRTELSRSDLI 155
           P  L    DHI+P +KGG  S +EN Q+  AT NR K ++   ++++ I
Sbjct: 42  PNPLAPTVDHIIPVNKGGHPSAMENLQLAHATCNRQKSDKLFNAKTNKI 90


>gi|295106530|emb|CBL04073.1| Restriction endonuclease [Gordonibacter pamelaeae 7-10-1-b]
          Length = 113

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P S+GGKS + N Q L    N  KGNR
Sbjct: 80  DHIVPVSRGGKSVMSNLQTLCEDCNCGKGNR 110


>gi|197121751|ref|YP_002133702.1| HNH nuclease [Anaeromyxobacter sp. K]
 gi|196171600|gb|ACG72573.1| HNH nuclease [Anaeromyxobacter sp. K]
          Length = 382

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 54/129 (41%), Gaps = 37/129 (28%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCH--DYDHILPYS 122
           P   P  V+++ WE       RD         G   F    G P    H  + DH++P +
Sbjct: 280 PDHIPAHVRREVWE-------RD--------CGRCTFVLPSGEPCGSTHRLELDHVVPVA 324

Query: 123 KGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSSYCRVSGRDMDLLELSAYGNVRRG 182
           +GG ST++N ++      R +G+  E +R  L              D L + AY   RRG
Sbjct: 325 RGGASTVDNLRI------RCRGHNLEEARRAL-------------GDAL-VDAYAPRRRG 364

Query: 183 QDSGGCRIQ 191
            + GG R++
Sbjct: 365 GEHGGARVE 373


>gi|85709598|ref|ZP_01040663.1| HNH endonuclease family protein [Erythrobacter sp. NAP1]
 gi|85688308|gb|EAQ28312.1| HNH endonuclease family protein [Erythrobacter sp. NAP1]
          Length = 202

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   D   +DH++P   GGK+T EN     A  N  KG RT
Sbjct: 102 NVFLRDKFSCQYCGSPDNLTFDHVMPRRLGGKTTWENIITACAPCNMKKGGRT 154


>gi|444915323|ref|ZP_21235457.1| HNH endonuclease family protein [Cystobacter fuscus DSM 2262]
 gi|444713552|gb|ELW54449.1| HNH endonuclease family protein [Cystobacter fuscus DSM 2262]
          Length = 179

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 115 YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           YDH++P ++GG++T EN  +     N++KG RT
Sbjct: 95  YDHVVPRAQGGRTTWENIVIACVPCNQAKGGRT 127


>gi|414171531|ref|ZP_11426442.1| hypothetical protein HMPREF9695_00088 [Afipia broomeae ATCC 49717]
 gi|410893206|gb|EKS40996.1| hypothetical protein HMPREF9695_00088 [Afipia broomeae ATCC 49717]
          Length = 185

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P S+GG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFSCQYCTSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLRKGNMT 137


>gi|421597177|ref|ZP_16040843.1| hypothetical protein BCCGELA001_07611 [Bradyrhizobium sp.
           CCGE-LA001]
 gi|404270711|gb|EJZ34725.1| hypothetical protein BCCGELA001_07611 [Bradyrhizobium sp.
           CCGE-LA001]
          Length = 185

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T       
Sbjct: 85  NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144

Query: 155 IQKSSY 160
            ++S++
Sbjct: 145 PRQSAF 150


>gi|402771107|ref|YP_006590644.1| HNH endonuclease [Methylocystis sp. SC2]
 gi|401773127|emb|CCJ05993.1| HNH endonuclease [Methylocystis sp. SC2]
          Length = 172

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  H+   +DH++P SKGG +T EN     +  N  KG+R
Sbjct: 72  NVFLRDRFTCQYCGEHEELTFDHVIPRSKGGATTWENVVAACSPCNLRKGDR 123


>gi|347535197|ref|YP_004842622.1| hypothetical protein FBFL15_0247 [Flavobacterium branchiophilum
           FL-15]
 gi|345528355|emb|CCB68385.1| Protein of unknown function [Flavobacterium branchiophilum FL-15]
          Length = 366

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI P+ +GGK+T ENCQ+L    NR K  +
Sbjct: 336 DHITPWHEGGKTTAENCQMLCKLDNRIKSGK 366


>gi|365891320|ref|ZP_09429753.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
           3809]
 gi|365332734|emb|CCE02284.1| putative HNH endonuclease:HNH nuclease [Bradyrhizobium sp. STM
           3809]
          Length = 185

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   D   +DH++P SKGG++T EN     +  N  KGN T       
Sbjct: 85  NVFLRDRFVCQYCHSGDDLTFDHVIPRSKGGQTTWENVVAACSPCNLRKGNLTPQQAKMF 144

Query: 155 IQKSSY 160
            +++ Y
Sbjct: 145 PKQAPY 150


>gi|332187299|ref|ZP_08389038.1| HNH endonuclease family protein [Sphingomonas sp. S17]
 gi|332012720|gb|EGI54786.1| HNH endonuclease family protein [Sphingomonas sp. S17]
          Length = 188

 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCH---DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C  H    +DH++P ++GG++T EN     A  N  KG RT
Sbjct: 88  NLFLRDRFSCQYCGSHRDLTFDHVIPRAQGGRTTWENVVTACAPCNLKKGGRT 140


>gi|296532418|ref|ZP_06895145.1| arsenate reductase [Roseomonas cervicalis ATCC 49957]
 gi|296267261|gb|EFH13159.1| arsenate reductase [Roseomonas cervicalis ATCC 49957]
          Length = 42

 Score = 37.0 bits (84), Expect = 3.5,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           DH++P+SKGG +T EN Q+L    N  K N  E
Sbjct: 4   DHVIPFSKGGHTTWENYQLLCKPCNVKKSNSIE 36


>gi|425736696|ref|ZP_18854994.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
 gi|425477862|gb|EKU45079.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
          Length = 331

 Score = 37.0 bits (84), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 22/36 (61%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           L  + DHI P SKGG S +EN Q L    NRSK N+
Sbjct: 295 LLLEVDHIKPISKGGLSEIENLQTLCWRCNRSKSNK 330


>gi|395804648|ref|ZP_10483884.1| HNH endonuclease [Flavobacterium sp. F52]
 gi|395433267|gb|EJF99224.1| HNH endonuclease [Flavobacterium sp. F52]
          Length = 186

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           DHI P SKGGK+ + N ++   T N S+GNR  +
Sbjct: 124 DHIQPASKGGKNNMRNGRIFCGTCNTSRGNRANV 157


>gi|390937545|ref|YP_006395105.1| putative endonuclease [Bifidobacterium bifidum BGN4]
 gi|389891159|gb|AFL05226.1| putative endonuclease [Bifidobacterium bifidum BGN4]
          Length = 240

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 94  DALGNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           +  GN  + +L GC G    D DHI+PYS GG  T+ N +      N S+ +R 
Sbjct: 17  ETWGNACWLRLPGCTGVGEED-DHIVPYSHGGMDTVANIRRACKHCNASRQDRV 69


>gi|90422135|ref|YP_530505.1| HNH endonuclease [Rhodopseudomonas palustris BisB18]
 gi|90104149|gb|ABD86186.1| HNH endonuclease [Rhodopseudomonas palustris BisB18]
          Length = 185

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFSCQYCGAPEDLTFDHIMPRSKGGQTTWENVVAACSPCNLRKGNMT 137


>gi|67924112|ref|ZP_00517558.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
 gi|67854037|gb|EAM49350.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
          Length = 276

 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 13/103 (12%)

Query: 41  LATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVV 100
             T E +    T+  I    +   P S PY      W K        P R     +  ++
Sbjct: 162 FQTREYVLYHHTETEIKR-HTLVKPESSPYDGNWTYWSKRRGTYTGTPAR-----VAKLL 215

Query: 101 FRKLVGCPGCLCH-------DYDHILPYSKGGKSTLENCQVLQ 136
            ++   CP C  H       + DHI+P SKGGK T  N Q L 
Sbjct: 216 KKQKGICPQCKQHFTPEDLIEVDHIIPKSKGGKDTYNNLQALH 258


>gi|449489536|ref|XP_004158341.1| PREDICTED: uncharacterized protein LOC101226520 [Cucumis sativus]
          Length = 250

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+++R    C  C  H+    DH+LP S+GG+ T EN        N  KG +T
Sbjct: 174 NILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT 226


>gi|358455199|ref|ZP_09165427.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
 gi|357081452|gb|EHI90883.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
          Length = 545

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 26/114 (22%)

Query: 36  LLDNELATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDA 95
           L D E A      TL    R+  A    + RS P  VK   W+                 
Sbjct: 454 LSDPEYAAAVLAGTLRVAKRLVLAPGQRDTRSIPPDVKAAVWQ----------------- 496

Query: 96  LGNVVFRKLVGCPGCLCHDY---DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
                 R    C  C   +Y   DH++P+S GG +++ N Q+L    N  KG R
Sbjct: 497 ------RDGGACAQCHATEYLEFDHVIPHSLGGATSVNNLQLLCRRCNLEKGAR 544


>gi|168484153|ref|ZP_02709105.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC1873-00]
 gi|168493974|ref|ZP_02718117.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC3059-06]
 gi|169834462|ref|YP_001693806.1| HNH endonuclease domain-containing protein [Streptococcus
           pneumoniae Hungary19A-6]
 gi|417685762|ref|ZP_12335042.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
 gi|417695447|ref|ZP_12344627.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
 gi|418079981|ref|ZP_12717197.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
 gi|418088689|ref|ZP_12725850.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
 gi|418090912|ref|ZP_12728057.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
 gi|418097696|ref|ZP_12734798.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
 gi|418106621|ref|ZP_12743667.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
 gi|418109254|ref|ZP_12746283.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
 gi|418113876|ref|ZP_12750869.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
 gi|418134007|ref|ZP_12770867.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
 gi|418158923|ref|ZP_12795629.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
 gi|418161331|ref|ZP_12798022.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
 gi|418168382|ref|ZP_12805028.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
 gi|418218031|ref|ZP_12844699.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
 gi|418220228|ref|ZP_12846885.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
 gi|418237869|ref|ZP_12864426.1| HNH endonuclease family protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|419422272|ref|ZP_13962491.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
 gi|419488280|ref|ZP_14028033.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
 gi|419520261|ref|ZP_14059860.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
 gi|419525050|ref|ZP_14064616.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
 gi|421248645|ref|ZP_15705108.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
 gi|421271937|ref|ZP_15722784.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
 gi|168996964|gb|ACA37576.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           Hungary19A-6]
 gi|172042557|gb|EDT50603.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC1873-00]
 gi|183576210|gb|EDT96738.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC3059-06]
 gi|332077580|gb|EGI88041.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
 gi|332203444|gb|EGJ17511.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
 gi|353754300|gb|EHD34913.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
 gi|353764430|gb|EHD44979.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
 gi|353766285|gb|EHD46825.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
 gi|353771852|gb|EHD52359.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
 gi|353781535|gb|EHD61977.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
 gi|353784547|gb|EHD64967.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
 gi|353788540|gb|EHD68937.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
 gi|353826578|gb|EHE06736.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
 gi|353830921|gb|EHE11050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
 gi|353836301|gb|EHE16389.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
 gi|353876228|gb|EHE56077.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
 gi|353878025|gb|EHE57864.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
 gi|353895002|gb|EHE74742.1| HNH endonuclease family protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|353903994|gb|EHE79508.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
 gi|379541885|gb|EHZ07050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
 gi|379560754|gb|EHZ25776.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
 gi|379589603|gb|EHZ54442.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
 gi|379590395|gb|EHZ55233.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
 gi|395615274|gb|EJG75290.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
 gi|395877496|gb|EJG88565.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
          Length = 326

 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 23/39 (58%)

Query: 109 GCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
             L  + DHI+P SKGG ST +N Q L    NRSK N+ 
Sbjct: 287 SLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNKI 325


>gi|383774651|ref|YP_005453720.1| hypothetical protein S23_64260 [Bradyrhizobium sp. S23321]
 gi|398826008|ref|ZP_10584278.1| restriction endonuclease [Bradyrhizobium sp. YR681]
 gi|381362778|dbj|BAL79608.1| hypothetical protein S23_64260 [Bradyrhizobium sp. S23321]
 gi|398222215|gb|EJN08599.1| restriction endonuclease [Bradyrhizobium sp. YR681]
          Length = 185

 Score = 37.0 bits (84), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T       
Sbjct: 85  NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144

Query: 155 IQKSSY 160
            ++S++
Sbjct: 145 PRQSAF 150


>gi|418077823|ref|ZP_12715047.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
 gi|418116043|ref|ZP_12753020.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
 gi|419439525|ref|ZP_13979582.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
 gi|419459153|ref|ZP_13999089.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
 gi|419461430|ref|ZP_14001347.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
 gi|353748845|gb|EHD29496.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
 gi|353791681|gb|EHD72055.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
 gi|379534025|gb|EHY99237.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
 gi|379534955|gb|EHZ00162.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
 gi|379582010|gb|EHZ46893.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
          Length = 356

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 23/39 (58%)

Query: 109 GCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
             L  + DHI+P SKGG ST +N Q L    NRSK N+ 
Sbjct: 317 SLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNKI 355


>gi|167647163|ref|YP_001684826.1| HNH endonuclease [Caulobacter sp. K31]
 gi|167349593|gb|ABZ72328.1| HNH endonuclease [Caulobacter sp. K31]
          Length = 186

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C   D   +DH++P S+GG++T EN     A  N  KG RT
Sbjct: 86  NLFLRDGFSCQYCGSPDDLTFDHVIPRSRGGRTTWENIVTACAPCNLHKGGRT 138


>gi|86747604|ref|YP_484100.1| HNH endonuclease [Rhodopseudomonas palustris HaA2]
 gi|86570632|gb|ABD05189.1| HNH endonuclease [Rhodopseudomonas palustris HaA2]
          Length = 185

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFSCQYCGAPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNMT 137


>gi|416376881|ref|ZP_11683511.1| HNH endonuclease, partial [Crocosphaera watsonii WH 0003]
 gi|357266329|gb|EHJ14977.1| HNH endonuclease, partial [Crocosphaera watsonii WH 0003]
          Length = 447

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 33/84 (39%), Gaps = 14/84 (16%)

Query: 61  STANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH------ 113
           +   P S PY      W K        P R  K      + +K  G CP C  H      
Sbjct: 364 TLVKPESSPYDGNWTYWSKRRGTYTGTPARVAK------LLKKQKGICPQCKQHFTPEDL 417

Query: 114 -DYDHILPYSKGGKSTLENCQVLQ 136
            + DHI+P SKGGK T  N Q L 
Sbjct: 418 IEVDHIIPKSKGGKDTYNNLQALH 441


>gi|158318135|ref|YP_001510643.1| stress protein [Frankia sp. EAN1pec]
 gi|158113540|gb|ABW15737.1| stress protein [Frankia sp. EAN1pec]
          Length = 560

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 105 VGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           V C      ++DH++P+S+GG S++ N Q+L    N  KG R
Sbjct: 518 VQCQATEYLEFDHVIPHSRGGASSVNNLQLLCRRCNLEKGAR 559


>gi|260881784|ref|ZP_05893557.1| HNH endonuclease domain protein [Mitsuokella multacida DSM 20544]
 gi|260847880|gb|EEX67887.1| HNH endonuclease domain protein [Mitsuokella multacida DSM 20544]
          Length = 90

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 26/46 (56%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDLIQKSS 159
           D DHILP S GG   L+N Q L    NRSK +  + +  D I+ ++
Sbjct: 34  DIDHILPQSYGGGDGLDNLQCLCKHCNRSKRDSLDHTAQDYIKNNA 79


>gi|386772121|ref|ZP_10094499.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
          Length = 526

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 4/61 (6%)

Query: 95  ALGNVVFRKLVGCPGCLCH----DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELS 150
           A+   + R+   C G  C+     YDHI+P S+GG ++L+N Q L A  N+ +    ++ 
Sbjct: 389 AMRRFLSRRDTTCRGPFCNAQIRQYDHIVPVSRGGPTSLDNGQGLCAHCNKKEQLAAQVE 448

Query: 151 R 151
           R
Sbjct: 449 R 449


>gi|261493161|ref|ZP_05989692.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. BOVINE]
 gi|261495917|ref|ZP_05992339.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. OVINE]
 gi|261308438|gb|EEY09719.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. OVINE]
 gi|261311164|gb|EEY12336.1| HNH endonuclease [Mannheimia haemolytica serotype A2 str. BOVINE]
          Length = 387

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DH+  +SKGG +  ENC++L  + NR+KGN+
Sbjct: 357 DHVSAWSKGGLTGAENCEMLCISHNRAKGNK 387


>gi|451340018|ref|ZP_21910523.1| hypothetical protein C791_7907 [Amycolatopsis azurea DSM 43854]
 gi|449417231|gb|EMD22907.1| hypothetical protein C791_7907 [Amycolatopsis azurea DSM 43854]
          Length = 63

 Score = 36.6 bits (83), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           DHI+P SKGG  TL+N +    + N S+GNR 
Sbjct: 21  DHIVPRSKGGPDTLDNARPAHRSCNSSRGNRA 52


>gi|94986243|ref|YP_605607.1| HNH endonuclease [Deinococcus geothermalis DSM 11300]
 gi|94556524|gb|ABF46438.1| HNH endonuclease [Deinococcus geothermalis DSM 11300]
          Length = 189

 Score = 36.6 bits (83), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV+ R    C  C        DH+LP S+GG+ + EN      T N+ KGNRT
Sbjct: 99  NVLRRDTFACQYCGAETDLTLDHVLPRSRGGRHSWENVVTACRTCNQRKGNRT 151


>gi|197106299|ref|YP_002131676.1| restriction endonuclease [Phenylobacterium zucineum HLK1]
 gi|196479719|gb|ACG79247.1| restriction endonuclease [Phenylobacterium zucineum HLK1]
          Length = 186

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 9/109 (8%)

Query: 44  EEEISTLFTDLR--ISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVF 101
           +E I  +F D    +S+ D   +  SF   +      K    + R P   R     N+  
Sbjct: 34  QEVIKAVFLDRVDVVSTYDQVIHSPSFEMKLPSVVSLKHYVAQDRPPAFTR----FNLFL 89

Query: 102 RKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           R    C  C   +   +DH++P S+GG++T EN     A  N +KG RT
Sbjct: 90  RDSFTCQYCTAGEELTFDHVIPRSRGGRTTWENIVTACARCNLTKGGRT 138


>gi|171914460|ref|ZP_02929930.1| HNH endonuclease [Verrucomicrobium spinosum DSM 4136]
          Length = 193

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           + DHI+P S+GGK++ +NC +    +N  K +RT
Sbjct: 122 NIDHIMPRSRGGKTSWDNCVLADKRINSRKADRT 155


>gi|295840114|ref|ZP_06827047.1| HNH endonuclease domain-containing protein [Streptomyces sp. SPB74]
 gi|197696993|gb|EDY43926.1| HNH endonuclease domain-containing protein [Streptomyces sp. SPB74]
          Length = 211

 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 26/51 (50%)

Query: 97  GNVVFRKLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           G+ V     G P   C + DH+ P S+GG   + N QVL  + N SKG R 
Sbjct: 161 GDPVPAAPTGRPLARCLELDHVHPRSRGGSDDVSNLQVLCTSCNTSKGARI 211


>gi|384215175|ref|YP_005606341.1| hypothetical protein BJ6T_14660 [Bradyrhizobium japonicum USDA 6]
 gi|354954074|dbj|BAL06753.1| hypothetical protein BJ6T_14660 [Bradyrhizobium japonicum USDA 6]
          Length = 185

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T       
Sbjct: 85  NVFLRDRFNCQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPAQAKMF 144

Query: 155 IQKSSY 160
            ++S++
Sbjct: 145 PRQSAF 150


>gi|67924437|ref|ZP_00517863.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
 gi|67853726|gb|EAM49059.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
          Length = 596

 Score = 36.6 bits (83), Expect = 5.0,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
           P + PY      W K        P R  K      + +K  G CP C  +       + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541

Query: 117 HILPYSKGGKSTLENCQVLQ 136
           HI+P SKGGK T  N Q L 
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561


>gi|392383290|ref|YP_005032487.1| putative HNH endonuclease [Azospirillum brasilense Sp245]
 gi|356878255|emb|CCC99127.1| putative HNH endonuclease [Azospirillum brasilense Sp245]
          Length = 187

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 11/74 (14%)

Query: 98  NVVFRKLVGCPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE--- 148
           NV  R    C  C  H       +DH++P S+GG++T +N     +  N +KGNR     
Sbjct: 83  NVFLRDRFTCQYCGHHFPTQELTFDHVIPRSRGGRTTWDNVVTSCSACNLAKGNRLPHSC 142

Query: 149 --LSRSDLIQKSSY 160
             +  S   Q S+Y
Sbjct: 143 GMIPLSPPFQPSAY 156


>gi|67924762|ref|ZP_00518164.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
 gi|67853387|gb|EAM48744.1| HNH endonuclease [Crocosphaera watsonii WH 8501]
          Length = 183

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 13/103 (12%)

Query: 41  LATEEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVV 100
             T E +    T+  I    +   P S PY      W K        P R     +  ++
Sbjct: 69  FQTREYVLYHHTETEIKR-HTLVKPESSPYDGNWTYWSKRRGTYTGTPAR-----VAKLL 122

Query: 101 FRKLVGCPGCLCH-------DYDHILPYSKGGKSTLENCQVLQ 136
            ++   CP C  H       + DHI+P SKGGK T  N Q L 
Sbjct: 123 KKQKGICPQCKQHFTPEDLIEVDHIIPKSKGGKDTYNNLQALH 165


>gi|374578280|ref|ZP_09651376.1| restriction endonuclease [Bradyrhizobium sp. WSM471]
 gi|386398693|ref|ZP_10083471.1| restriction endonuclease [Bradyrhizobium sp. WSM1253]
 gi|374426601|gb|EHR06134.1| restriction endonuclease [Bradyrhizobium sp. WSM471]
 gi|385739319|gb|EIG59515.1| restriction endonuclease [Bradyrhizobium sp. WSM1253]
          Length = 185

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T +
Sbjct: 85  NVFLRDRFACQYCGSPEDLTFDHIIPRSKGGQTTWENVVAACSPCNLRKGNLTPV 139


>gi|452964360|gb|EME69402.1| HNH endonuclease family protein [Magnetospirillum sp. SO-1]
          Length = 186

 Score = 36.6 bits (83), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 6/57 (10%)

Query: 98  NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           NV  R    C  C      HD  +DH+LP S+GG++T +N     A  N  KG+R  
Sbjct: 82  NVFLRDRFSCQYCGEWFPTHDLTFDHVLPRSRGGRTTWDNVVTACAPCNLRKGDRLH 138


>gi|15807725|ref|NP_285380.1| TerF-like protein [Deinococcus radiodurans R1]
 gi|6460551|gb|AAF12257.1|AE001862_83 TerF-related protein [Deinococcus radiodurans R1]
          Length = 579

 Score = 36.6 bits (83), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 20/82 (24%)

Query: 66  RSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVGCPGCLCHDYDHILPYSKGG 125
           RS P  VK + W++ +                     + V C      ++DH++P+S GG
Sbjct: 518 RSIPQKVKLEVWQRDQG--------------------QCVECGDTNYLEFDHVIPHSLGG 557

Query: 126 KSTLENCQVLQATVNRSKGNRT 147
            S++ N Q+L    N +K NR 
Sbjct: 558 ASSVGNLQLLCRRCNLAKSNRI 579


>gi|67923004|ref|ZP_00516498.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
 gi|67855152|gb|EAM50417.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
          Length = 596

 Score = 36.6 bits (83), Expect = 5.5,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
           P + PY      W K        P R  K      + +K  G CP C  +       + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541

Query: 117 HILPYSKGGKSTLENCQVLQ 136
           HI+P SKGGK T  N Q L 
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561


>gi|431809878|ref|YP_007236767.1| HNH endonuclease [Burkholderia phage BcepMigl]
 gi|394777723|gb|AFN39090.1| HNH endonuclease [Burkholderia phage BcepMigl]
          Length = 147

 Score = 36.6 bits (83), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 11/75 (14%)

Query: 80  AEKVRGRDPDRWRKDALGNVVFRKLVGCPGC-----LCHDYDHILPYSKGGKSTLENCQV 134
           A  ++ R PD  R+       FR    C  C     LC   DHI+P S GG +TLEN Q 
Sbjct: 79  AAYLKTRIPDELRQAVFERDGFR----CVECGSNKRLC--ADHIIPESAGGPTTLENLQT 132

Query: 135 LQATVNRSKGNRTEL 149
           +    N  KG++  +
Sbjct: 133 MCRPCNSEKGDKAPV 147


>gi|326386057|ref|ZP_08207681.1| HNH endonuclease [Novosphingobium nitrogenifigens DSM 19370]
 gi|326209282|gb|EGD60075.1| HNH endonuclease [Novosphingobium nitrogenifigens DSM 19370]
          Length = 227

 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C  H+   +DH++P   GG++T EN     A  N  KG RT
Sbjct: 127 NLFLRDRFMCQYCGSHEHLTFDHVVPRRLGGRTTWENVSTACAPCNMRKGGRT 179


>gi|219847405|ref|YP_002461838.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
 gi|219541664|gb|ACL23402.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
          Length = 206

 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 23/35 (65%), Gaps = 1/35 (2%)

Query: 115 YDHILPYSKGGKST-LENCQVLQATVNRSKGNRTE 148
           YDHI+PYSKGG S   EN Q+L A  N  K ++ E
Sbjct: 172 YDHIIPYSKGGTSLKAENIQLLCARHNLEKRDKIE 206


>gi|67923370|ref|ZP_00516851.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
 gi|67854795|gb|EAM50073.1| RNA-directed DNA polymerase (Reverse transcriptase):HNH
           endonuclease [Crocosphaera watsonii WH 8501]
          Length = 596

 Score = 36.6 bits (83), Expect = 5.8,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 32/80 (40%), Gaps = 14/80 (17%)

Query: 65  PRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRKLVG-CPGCLCH-------DYD 116
           P + PY      W K        P R  K      + +K  G CP C  +       + D
Sbjct: 488 PEASPYDGNWTYWSKRRGTYSGTPTRVAK------LLKKQKGICPQCKQYFTPEDLIEVD 541

Query: 117 HILPYSKGGKSTLENCQVLQ 136
           HI+P SKGGK T  N Q L 
Sbjct: 542 HIIPKSKGGKDTYNNLQALH 561


>gi|452751447|ref|ZP_21951193.1| HNH endonuclease family protein [alpha proteobacterium JLT2015]
 gi|451961597|gb|EMD84007.1| HNH endonuclease family protein [alpha proteobacterium JLT2015]
          Length = 214

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           N+  R    C  C   D   +DH++P + GG++T EN     A  N +KG RT
Sbjct: 114 NLFLRDRFSCQYCGRTDDLTFDHVIPRAYGGRTTWENVATACAPCNLAKGGRT 166


>gi|384260910|ref|YP_005416096.1| HNH endonuclease [Rhodospirillum photometricum DSM 122]
 gi|378402010|emb|CCG07126.1| HNH endonuclease [Rhodospirillum photometricum DSM 122]
          Length = 218

 Score = 36.2 bits (82), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 6/56 (10%)

Query: 98  NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C      HD  +DH++P SKGG++T +N        N  K NRT
Sbjct: 115 NVFLRDGFTCQYCGTRLPAHDLTFDHVVPRSKGGRTTWDNVVAACGPCNMRKANRT 170


>gi|92118950|ref|YP_578679.1| HNH endonuclease [Nitrobacter hamburgensis X14]
 gi|91801844|gb|ABE64219.1| HNH endonuclease [Nitrobacter hamburgensis X14]
          Length = 185

 Score = 36.2 bits (82), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P SKGG++T +N     +  N  KGN T
Sbjct: 85  NVFLRDRFMCQYCTASEDLTFDHIIPRSKGGQTTWDNVVAACSPCNLRKGNLT 137


>gi|390956260|ref|YP_006420017.1| restriction endonuclease [Terriglobus roseus DSM 18391]
 gi|390411178|gb|AFL86682.1| restriction endonuclease [Terriglobus roseus DSM 18391]
          Length = 205

 Score = 36.2 bits (82), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 33/75 (44%), Gaps = 6/75 (8%)

Query: 86  RDPDRWRKDALGNVVFRKLVGCPGCLC------HDYDHILPYSKGGKSTLENCQVLQATV 139
           R P + R  +  N++ R    C  C C         DH+LP S+GG ST EN        
Sbjct: 95  RIPHQTRALSRKNILLRDRNTCQYCGCILTTGDLTLDHVLPRSRGGLSTWENLVACCHDC 154

Query: 140 NRSKGNRTELSRSDL 154
           NR KGN+      D+
Sbjct: 155 NRRKGNQLLHELQDM 169


>gi|410867562|ref|YP_006982173.1| HNH endonuclease domain-containing protein [Propionibacterium
           acidipropionici ATCC 4875]
 gi|410824203|gb|AFV90818.1| HNH endonuclease domain-containing protein [Propionibacterium
           acidipropionici ATCC 4875]
          Length = 330

 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 23/36 (63%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           L  + DHI+P S+GG ST +N Q L    NRSK N+
Sbjct: 291 LLLEVDHIVPVSRGGLSTEDNLQTLCWRCNRSKSNK 326


>gi|424865305|ref|ZP_18289174.1| diadenosine tetraphosphate [SAR86 cluster bacterium SAR86B]
 gi|400758911|gb|EJP73111.1| diadenosine tetraphosphate [SAR86 cluster bacterium SAR86B]
          Length = 325

 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 23/41 (56%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           D DHILP SKGG + + N Q L  T NR K N  +    D+
Sbjct: 155 DVDHILPRSKGGSNDISNLQSLCYTCNRQKRNLDDTDLRDM 195


>gi|340776977|ref|ZP_08696920.1| HNH endonuclease [Acetobacter aceti NBRC 14818]
          Length = 186

 Score = 36.2 bits (82), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 11/74 (14%)

Query: 98  NVVFRKLVGCPGC----LCHD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTE--- 148
           N+  R    C  C      HD  +DH++P S+GG++T EN     +T N  KG+R     
Sbjct: 83  NLFLRDNFSCQYCNDQLPTHDLTFDHVIPRSRGGRTTWENIVTACSTCNLLKGSRMPHEI 142

Query: 149 --LSRSDLIQKSSY 160
               R   +Q SS+
Sbjct: 143 RMFPRRQPVQPSSW 156


>gi|337285718|ref|YP_004625191.1| HNH endonuclease [Thermodesulfatator indicus DSM 15286]
 gi|335358546|gb|AEH44227.1| HNH endonuclease [Thermodesulfatator indicus DSM 15286]
          Length = 203

 Score = 36.2 bits (82), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 21/32 (65%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           DHI+P S+GGK+T EN  +     N  KGNRT
Sbjct: 127 DHIIPRSRGGKTTWENVVLCCRKCNIKKGNRT 158


>gi|434405466|ref|YP_007148351.1| restriction endonuclease [Cylindrospermum stagnale PCC 7417]
 gi|428259721|gb|AFZ25671.1| restriction endonuclease [Cylindrospermum stagnale PCC 7417]
          Length = 81

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNRTE 148
           DHI+P S+GGK+ + N Q L  T N+ K N+ +
Sbjct: 40  DHIIPLSRGGKNDISNLQTLCLTCNQQKTNKID 72


>gi|86605206|ref|YP_473969.1| HNH endonuclease domain-containing protein [Synechococcus sp.
           JA-3-3Ab]
 gi|86553748|gb|ABC98706.1| HNH endonuclease domain protein [Synechococcus sp. JA-3-3Ab]
          Length = 178

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 98  NVVFRKLVGCPGCLCHD----YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV++R    C  C C+      DH++P S+GG+ T EN        N  KGNRT
Sbjct: 70  NVLYRDGHRCQYCGCYGEDLTLDHVIPRSRGGEDTWENVVSACVRCNVKKGNRT 123


>gi|381159335|ref|ZP_09868567.1| restriction endonuclease [Thiorhodovibrio sp. 970]
 gi|380877399|gb|EIC19491.1| restriction endonuclease [Thiorhodovibrio sp. 970]
          Length = 361

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI P+ +GGK++ ENCQ+L    NR K  +
Sbjct: 331 DHITPWHEGGKTSAENCQMLCKEDNRRKSGK 361


>gi|379059551|ref|ZP_09850077.1| hypothetical protein SproM1_15864 [Serinicoccus profundi MCCC
           1A05965]
          Length = 384

 Score = 36.2 bits (82), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 114 DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           D DH+  +SKGG + L N  +L  T NR+KGN+
Sbjct: 352 DADHVTAWSKGGATDLSNLTMLCQTHNRAKGNK 384


>gi|218682266|ref|ZP_03529867.1| HNH endonuclease [Rhizobium etli CIAT 894]
          Length = 228

 Score = 35.8 bits (81), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|85714890|ref|ZP_01045876.1| HNH endonuclease [Nitrobacter sp. Nb-311A]
 gi|85698376|gb|EAQ36247.1| HNH endonuclease [Nitrobacter sp. Nb-311A]
          Length = 206

 Score = 35.8 bits (81), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P SKGG++T +N     +  N  KGN T
Sbjct: 106 NVFLRDRFMCQYCTAVEDLTFDHIIPRSKGGQTTWDNVVAACSPCNLRKGNLT 158


>gi|115522824|ref|YP_779735.1| HNH endonuclease [Rhodopseudomonas palustris BisA53]
 gi|115516771|gb|ABJ04755.1| HNH endonuclease [Rhodopseudomonas palustris BisA53]
          Length = 185

 Score = 35.8 bits (81), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           NV  R    C  C   +   +DHI+P SKGG++T EN     +  N  KGN T
Sbjct: 85  NVFLRDRFVCQYCGAPEDLTFDHIMPRSKGGQTTWENVVAACSPCNLRKGNLT 137


>gi|406998393|gb|EKE16327.1| hypothetical protein ACD_11C00020G0032 [uncultured bacterium]
          Length = 367

 Score = 35.8 bits (81), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 18/109 (16%)

Query: 44  EEEISTLFTDLRISSADSTANPRSFPYSVKQQCWEKAEKVRGRDPDRWRKDALGNVVFRK 103
           E+EI  L  D  ++          +PY + +Q  E+   +R  + ++ +++A      R+
Sbjct: 271 EDEIKELMQDEEVTKPSGI-----YPYVLTRQ--ERYLNIRAFN-EKMKREAYE----RQ 318

Query: 104 LVGCPGCLCH------DYDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
              C  C+ H      + DHI P+ +GGK+   NCQ+L    NR K  +
Sbjct: 319 KGKCVKCMKHFEIGEMEADHIKPWHEGGKTIANNCQILCKDDNRRKSGK 367


>gi|296121467|ref|YP_003629245.1| HNH endonuclease [Planctomyces limnophilus DSM 3776]
 gi|296013807|gb|ADG67046.1| HNH endonuclease [Planctomyces limnophilus DSM 3776]
          Length = 198

 Score = 35.8 bits (81), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 21/40 (52%)

Query: 108 PGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           PG      DH+ P S+GG ST ENC +     N  K NRT
Sbjct: 112 PGSEELTIDHVQPRSRGGTSTWENCVLACIACNNKKANRT 151


>gi|347449376|gb|AEO93417.1| gp157 [Bacillus phage G]
          Length = 138

 Score = 35.8 bits (81), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 116 DHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           DHI+P SKGGK+ + N Q +    N  KGN+
Sbjct: 106 DHIVPVSKGGKNHISNYQTMCIICNEKKGNQ 136


>gi|405382583|ref|ZP_11036364.1| restriction endonuclease [Rhizobium sp. CF142]
 gi|397320989|gb|EJJ25416.1| restriction endonuclease [Rhizobium sp. CF142]
          Length = 185

 Score = 35.8 bits (81), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|330832281|ref|YP_004401106.1| HNH endonuclease [Streptococcus suis ST3]
 gi|329306504|gb|AEB80920.1| HNH endonuclease [Streptococcus suis ST3]
          Length = 115

 Score = 35.8 bits (81), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 97  GNVVFRKLVGCPGCLCHDYDHILPYSKGGK-STLENCQVLQATVNRSKGNRTELSRSDLI 155
           G +V +KL   P  +C   DH++P +KGG  S +EN Q+   + NR K ++     S   
Sbjct: 31  GQLVDKKL-KYPNPMCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSKAE 89

Query: 156 QKS 158
           QK+
Sbjct: 90  QKT 92


>gi|402489973|ref|ZP_10836766.1| HNH endonuclease [Rhizobium sp. CCGE 510]
 gi|401811312|gb|EJT03681.1| HNH endonuclease [Rhizobium sp. CCGE 510]
          Length = 185

 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|356560967|ref|XP_003548757.1| PREDICTED: uncharacterized protein LOC100810482 [Glycine max]
          Length = 282

 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 8/80 (10%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT-ELSRSD 153
           N++FR    C  C  H+    DH++P + GG+ T EN     A  N  KG +T E ++  
Sbjct: 180 NILFRDNYTCQYCSSHENLTIDHVMPTALGGEWTWENLVTACAKCNCKKGRKTLEEAKMK 239

Query: 154 LIQKSSYCRVSGRDMDLLEL 173
           LI+         +D D+L +
Sbjct: 240 LIKPPK----VPKDYDILAI 255


>gi|21673556|ref|NP_661621.1| HNH endonuclease [Chlorobium tepidum TLS]
 gi|21646667|gb|AAM71963.1| HNH endonuclease family protein [Chlorobium tepidum TLS]
          Length = 170

 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 107 CPGCLCHD----YDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           C  C C D     DH++P S+GG+ T EN      + N  KGNRT
Sbjct: 87  CQYCGCKDGSLTVDHVMPKSRGGEDTWENLITACKSCNTKKGNRT 131


>gi|209883894|ref|YP_002287751.1| HNH endonuclease [Oligotropha carboxidovorans OM5]
 gi|337742396|ref|YP_004634124.1| HNH endonuclease domain-containing protein [Oligotropha
           carboxidovorans OM5]
 gi|386031361|ref|YP_005952136.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM4]
 gi|209872090|gb|ACI91886.1| HNH endonuclease [Oligotropha carboxidovorans OM5]
 gi|336096427|gb|AEI04253.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM4]
 gi|336100060|gb|AEI07883.1| HNH endonuclease domain protein [Oligotropha carboxidovorans OM5]
          Length = 185

 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTELSRSDL 154
           NV  R    C  C   +   +DHI+P S+GG++T EN     +  N  KG+ T    +  
Sbjct: 85  NVFLRDRFSCQYCTSQEDLTFDHIIPRSRGGQTTWENVVAACSPCNLKKGSMTMAQANMF 144

Query: 155 IQKSSY 160
            +++ Y
Sbjct: 145 PRQTPY 150


>gi|84685670|ref|ZP_01013567.1| hypothetical protein 1099457000253_RB2654_06854 [Maritimibacter
           alkaliphilus HTCC2654]
 gi|84666336|gb|EAQ12809.1| hypothetical protein RB2654_06854 [Rhodobacterales bacterium
           HTCC2654]
          Length = 1126

 Score = 35.8 bits (81), Expect = 8.9,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 3/45 (6%)

Query: 103 KLVGCPGCLCHDYDHILPYSKGGKSTLENCQVLQATVNRSKGNRT 147
           K +G  GC   D DHI+    GG S   N Q+L AT NR  G+ T
Sbjct: 312 KGLGKSGC---DIDHIVEKQMGGTSVPSNLQILDATKNRESGSNT 353


>gi|296445443|ref|ZP_06887400.1| HNH endonuclease [Methylosinus trichosporium OB3b]
 gi|296257009|gb|EFH04079.1| HNH endonuclease [Methylosinus trichosporium OB3b]
          Length = 190

 Score = 35.8 bits (81), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 98  NVVFRKLVGCPGCLC-HD--YDHILPYSKGGKSTLENCQVLQATVNRSKGNRTEL 149
           NV  R    C  C   HD  +DH++P SKGG +T EN     +  N  KG+R   
Sbjct: 90  NVFLRDRFTCQYCGAQHDLTFDHVIPRSKGGATTWENVVAACSPCNLRKGDRLPF 144


>gi|241206532|ref|YP_002977628.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM1325]
 gi|240860422|gb|ACS58089.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM1325]
          Length = 185

 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|209551129|ref|YP_002283046.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM2304]
 gi|424886574|ref|ZP_18310182.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM2012]
 gi|424897225|ref|ZP_18320799.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM2297]
 gi|424916606|ref|ZP_18339970.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM597]
 gi|209536885|gb|ACI56820.1| HNH endonuclease [Rhizobium leguminosarum bv. trifolii WSM2304]
 gi|392852782|gb|EJB05303.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM597]
 gi|393175925|gb|EJC75967.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM2012]
 gi|393181452|gb|EJC81491.1| restriction endonuclease [Rhizobium leguminosarum bv. trifolii
           WSM2297]
          Length = 185

 Score = 35.8 bits (81), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|190893620|ref|YP_001980162.1| endonuclease [Rhizobium etli CIAT 652]
 gi|218515988|ref|ZP_03512828.1| putative endonuclease protein [Rhizobium etli 8C-3]
 gi|417097164|ref|ZP_11959076.1| putative endonuclease protein [Rhizobium etli CNPAF512]
 gi|421589410|ref|ZP_16034555.1| endonuclease [Rhizobium sp. Pop5]
 gi|190698899|gb|ACE92984.1| putative endonuclease protein [Rhizobium etli CIAT 652]
 gi|327193381|gb|EGE60281.1| putative endonuclease protein [Rhizobium etli CNPAF512]
 gi|403705646|gb|EJZ21183.1| endonuclease [Rhizobium sp. Pop5]
          Length = 185

 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|424872552|ref|ZP_18296214.1| restriction endonuclease [Rhizobium leguminosarum bv. viciae
           WSM1455]
 gi|393168253|gb|EJC68300.1| restriction endonuclease [Rhizobium leguminosarum bv. viciae
           WSM1455]
          Length = 185

 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 98  NVVFRKLVGCPGCLCHD---YDHILPYSKGGKSTLENCQVLQATVNRSKGNR 146
           NV  R    C  C  HD   +DH++P + GG++T EN     +  N  KG++
Sbjct: 85  NVFLRDRFECQYCGAHDDLTFDHVIPRAHGGETTWENVVAACSPCNLRKGSK 136


>gi|118443767|ref|YP_877830.1| HNH endonuclease domain-containing protein [Clostridium novyi NT]
 gi|118134223|gb|ABK61267.1| HNH endonuclease domain protein [Clostridium novyi NT]
          Length = 299

 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 21/34 (61%)

Query: 111 LCHDYDHILPYSKGGKSTLENCQVLQATVNRSKG 144
           L  + DHI+P SKGG ST +N Q L    NR KG
Sbjct: 262 LLLEIDHIIPISKGGMSTEKNLQTLCWKCNRKKG 295


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.388 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,915,154,870
Number of Sequences: 23463169
Number of extensions: 110196010
Number of successful extensions: 327660
Number of sequences better than 100.0: 393
Number of HSP's better than 100.0 without gapping: 243
Number of HSP's successfully gapped in prelim test: 150
Number of HSP's that attempted gapping in prelim test: 327321
Number of HSP's gapped (non-prelim): 435
length of query: 191
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 57
effective length of database: 9,215,130,721
effective search space: 525262451097
effective search space used: 525262451097
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 72 (32.3 bits)