BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029699
         (189 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359491684|ref|XP_003634305.1| PREDICTED: uncharacterized protein LOC100852985 [Vitis vinifera]
 gi|297733935|emb|CBI15182.3| unnamed protein product [Vitis vinifera]
          Length = 167

 Score =  281 bits (718), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 133/170 (78%), Positives = 148/170 (87%), Gaps = 8/170 (4%)

Query: 6   SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
           SSSSP+RSR        EERPR+FD KAKS+CWAKA+ VPGRHP+RWRKDAAGNIVCKRF
Sbjct: 2   SSSSPHRSRD------KEERPRFFDPKAKSLCWAKADAVPGRHPDRWRKDAAGNIVCKRF 55

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
           CNC GCLCFEYDHI+PFSKGGES  +NCQILQTRVNRFKSDK+ VD T+L+G+SCDVKFT
Sbjct: 56  CNCQGCLCFEYDHIIPFSKGGESVPENCQILQTRVNRFKSDKDNVDKTQLQGYSCDVKFT 115

Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVN 175
           DKELDIIEMAVYGDVIRPGNQCRC+TVAEMLGQ+K KDR  +A C +  N
Sbjct: 116 DKELDIIEMAVYGDVIRPGNQCRCRTVAEMLGQYKPKDR--MAACKLPFN 163


>gi|449440818|ref|XP_004138181.1| PREDICTED: uncharacterized protein LOC101204599 [Cucumis sativus]
 gi|449477195|ref|XP_004154957.1| PREDICTED: uncharacterized protein LOC101224154 [Cucumis sativus]
          Length = 181

 Score =  275 bits (704), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 126/160 (78%), Positives = 144/160 (90%), Gaps = 2/160 (1%)

Query: 17  NSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
           + + S +ERPR+FD KAK+ CWAKA+ VPGRHPERWRKDAAGN+VCKRFCNC GCLCFEY
Sbjct: 20  SGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEY 79

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAV 136
           DHIVP+SKGGESTADNCQILQTRVNRFKS+K+ VD ++LKG+SCDVKFTDKELDIIEMAV
Sbjct: 80  DHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKGYSCDVKFTDKELDIIEMAV 139

Query: 137 YGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNH 176
           YGDVIRPGNQCRC+TV+E+LGQ+KSKDR  LA C +  N 
Sbjct: 140 YGDVIRPGNQCRCRTVSEVLGQYKSKDR--LAPCKLPYNE 177


>gi|255547311|ref|XP_002514713.1| endonuclease, putative [Ricinus communis]
 gi|223546317|gb|EEF47819.1| endonuclease, putative [Ricinus communis]
          Length = 173

 Score =  271 bits (692), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 128/167 (76%), Positives = 145/167 (86%), Gaps = 3/167 (1%)

Query: 7   SSSPNRSRS-SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
           SSSP R+RS SNS +   ERPR+F  KAK++CWA AETVPGRHPERWRKDAAGNIVCKR 
Sbjct: 2   SSSPQRARSESNSGDGAGERPRFFSNKAKNICWANAETVPGRHPERWRKDAAGNIVCKRL 61

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
            NC GCLCF+YDHI+PFSKGGESTA NCQILQTRVN +K+D E VD ++LKG+SCD+KFT
Sbjct: 62  GNCQGCLCFQYDHIIPFSKGGESTAANCQILQTRVNNYKADAEAVDKSQLKGYSCDLKFT 121

Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           DKELDIIEMAVYGDVIRPGNQCRC+TVAE+LGQ KSKDR  +A C +
Sbjct: 122 DKELDIIEMAVYGDVIRPGNQCRCRTVAEILGQTKSKDR--VAACKL 166


>gi|224119042|ref|XP_002317971.1| predicted protein [Populus trichocarpa]
 gi|222858644|gb|EEE96191.1| predicted protein [Populus trichocarpa]
          Length = 174

 Score =  268 bits (684), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 127/170 (74%), Positives = 144/170 (84%), Gaps = 5/170 (2%)

Query: 6   SSSSPNRSRSSNSNNS---NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
           S SSP R R  NSN      ++RPR+FD   K MCW  AE VPGRHPERWRKDAAGNIVC
Sbjct: 1   SPSSPQRYRGGNSNGGGGGEDQRPRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVC 60

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
           KRFCNCHGCLC+EYDHI+PFSKGGEST DNCQILQTRVNRFKS+K+++D T+LKG+SC++
Sbjct: 61  KRFCNCHGCLCYEYDHILPFSKGGESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEI 120

Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
            FTDKELDIIEMAVYGDVIRPGNQCRC+T+AEMLGQ KSKDR  +A C +
Sbjct: 121 NFTDKELDIIEMAVYGDVIRPGNQCRCRTIAEMLGQLKSKDR--VAACKL 168


>gi|118481005|gb|ABK92456.1| unknown [Populus trichocarpa]
          Length = 180

 Score =  264 bits (675), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 118/150 (78%), Positives = 135/150 (90%), Gaps = 2/150 (1%)

Query: 23  EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
           ++RPR+FD   K MCW  AE VPGRHPERWRKDAAGNIVCKRFCNCHGCLC+EYDHI+PF
Sbjct: 26  DQRPRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPF 85

Query: 83  SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           SKGGEST DNCQILQTRVNRFKS+K+++D T+LKG+SC++ FTDKELDIIEMAVYGDVIR
Sbjct: 86  SKGGESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIR 145

Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           PGNQCRC+T+AEMLGQ KSKDR  +A C +
Sbjct: 146 PGNQCRCRTIAEMLGQLKSKDR--VAACKL 173


>gi|297850254|ref|XP_002893008.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338850|gb|EFH69267.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 179

 Score =  258 bits (658), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 118/147 (80%), Positives = 133/147 (90%)

Query: 17  NSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
           N  ++ EERPR+FDGKAK+ CWA A+TVPGRHPERWRKDAAGNIVCKRF NC+GCLCFEY
Sbjct: 24  NVKSNGEERPRFFDGKAKNKCWANADTVPGRHPERWRKDAAGNIVCKRFGNCNGCLCFEY 83

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAV 136
           DHIVP+SKGGES A+NCQILQTRVNRFKS +E VDAT LK +SC ++FTDKELDIIEMAV
Sbjct: 84  DHIVPYSKGGESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDIIEMAV 143

Query: 137 YGDVIRPGNQCRCKTVAEMLGQFKSKD 163
           YGDV+RPG +CRCKTVAE+LGQ KSKD
Sbjct: 144 YGDVLRPGKECRCKTVAELLGQSKSKD 170


>gi|15221835|ref|NP_173303.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
 gi|6730716|gb|AAF27111.1|AC011809_20 Unknown protein [Arabidopsis thaliana]
 gi|26451201|dbj|BAC42704.1| unknown protein [Arabidopsis thaliana]
 gi|28973585|gb|AAO64117.1| unknown protein [Arabidopsis thaliana]
 gi|332191624|gb|AEE29745.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
          Length = 186

 Score =  257 bits (656), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 116/148 (78%), Positives = 132/148 (89%)

Query: 16  SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           SN  ++ EERPR+FDGKAK+ CWA A+ VPGRHPERWRKD AGNIVCKRF NC+GCLCFE
Sbjct: 30  SNVKSNGEERPRFFDGKAKNKCWANADIVPGRHPERWRKDVAGNIVCKRFGNCNGCLCFE 89

Query: 76  YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
           YDHIVP+SKGGES A+NCQILQTRVNRFKS +E VDAT LK +SC ++FTDKELD+IEMA
Sbjct: 90  YDHIVPYSKGGESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDVIEMA 149

Query: 136 VYGDVIRPGNQCRCKTVAEMLGQFKSKD 163
           VYGDV+RPG +CRCKTVAE+LGQ KSKD
Sbjct: 150 VYGDVLRPGKECRCKTVAELLGQSKSKD 177


>gi|351722082|ref|NP_001235440.1| uncharacterized protein LOC100527904 [Glycine max]
 gi|255633516|gb|ACU17116.1| unknown [Glycine max]
          Length = 162

 Score =  254 bits (648), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 117/158 (74%), Positives = 136/158 (86%), Gaps = 5/158 (3%)

Query: 6   SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
           SSSSP RSR        E+RPR+FD  AK++CW+KA+TVPGRHPERWRKDAAGN+VCKRF
Sbjct: 2   SSSSPQRSRGDG-----EKRPRFFDSNAKAICWSKADTVPGRHPERWRKDAAGNVVCKRF 56

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
            NC GCLC+EYDHI+PFSKGGESTADNCQILQ+RVNR KSDK  +D+ +LK +SC+V FT
Sbjct: 57  FNCIGCLCYEYDHIIPFSKGGESTADNCQILQSRVNRLKSDKYNIDSDQLKDYSCEVNFT 116

Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKD 163
           DKELDIIEMAVY DV+RPGNQCRC+T+AE LG+FK KD
Sbjct: 117 DKELDIIEMAVYSDVMRPGNQCRCRTIAEKLGKFKPKD 154


>gi|218184321|gb|EEC66748.1| hypothetical protein OsI_33113 [Oryza sativa Indica Group]
          Length = 178

 Score =  244 bits (622), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 116/179 (64%), Positives = 139/179 (77%), Gaps = 8/179 (4%)

Query: 6   SSSSPNR------SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGN 59
           + SSP R      +   +S    +ERPR FD K +S CW KA  V GRHPERWR+DAAGN
Sbjct: 2   APSSPRRDLDGEGTGGGSSAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGN 61

Query: 60  IVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFS 119
           +VC+RF +CHGCLC+EYDHIVPFSKGGEST DNCQILQTRVNR KSDK  ++  +++GFS
Sbjct: 62  VVCRRFWSCHGCLCYEYDHIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFS 121

Query: 120 CDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNHCS 178
           CD+KFTDKELD+IEMAVYGDVIRPG QCRC+TVAEMLG+ KSK++  +A C +  N  S
Sbjct: 122 CDIKFTDKELDVIEMAVYGDVIRPGKQCRCRTVAEMLGKVKSKNQ--MAACELPYNDAS 178


>gi|115481558|ref|NP_001064372.1| Os10g0335200 [Oryza sativa Japonica Group]
 gi|21671981|gb|AAM74343.1|AC115686_10 Putative HNH endonuclease [Oryza sativa Japonica Group]
 gi|23477807|gb|AAN34947.1| Putative HNH endonuclease [Oryza sativa Japonica Group]
 gi|31431266|gb|AAP53074.1| HNH endonuclease family protein, expressed [Oryza sativa Japonica
           Group]
 gi|113638981|dbj|BAF26286.1| Os10g0335200 [Oryza sativa Japonica Group]
 gi|125574417|gb|EAZ15701.1| hypothetical protein OsJ_31114 [Oryza sativa Japonica Group]
 gi|215693775|dbj|BAG88974.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215707241|dbj|BAG93701.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 178

 Score =  243 bits (619), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 116/179 (64%), Positives = 139/179 (77%), Gaps = 8/179 (4%)

Query: 6   SSSSPNR------SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGN 59
           + SSP R      +   +S    +ERPR FD K +S CW KA  V GRHPERWR+DAAGN
Sbjct: 2   APSSPRRDLDGEGTGGGSSAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGN 61

Query: 60  IVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFS 119
           +VC+RF +CHGCLC+EYDHIVPFSKGGEST DNCQILQTRVNR KSDK  ++  +++GFS
Sbjct: 62  VVCRRFWSCHGCLCYEYDHIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFS 121

Query: 120 CDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNHCS 178
           CD+KFTDKELD+IEMAVYGDVIRPG QCRC+TVAEMLG+ KSK++  +A C +  N  S
Sbjct: 122 CDIKFTDKELDVIEMAVYGDVIRPGKQCRCRTVAEMLGKVKSKNQ--MAACELPHNDAS 178


>gi|326513564|dbj|BAJ87801.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 175

 Score =  242 bits (617), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 109/150 (72%), Positives = 130/150 (86%), Gaps = 2/150 (1%)

Query: 23  EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
           +ERPR FD K ++ CW KA  + GRHPERWR+DAAGN+VC+RF +CHGCLC+EYDHIVPF
Sbjct: 26  KERPRSFDEKTRTACWRKAAVLTGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYDHIVPF 85

Query: 83  SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           SKGGEST +NCQILQTRVNR KSDK  V+  +++GFSCD+KFTDKELD+IEMAVYGDVIR
Sbjct: 86  SKGGESTVENCQILQTRVNRSKSDKAWVEKAEMQGFSCDIKFTDKELDVIEMAVYGDVIR 145

Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           PG QCRCKTVAEMLGQ KSK++  +A C++
Sbjct: 146 PGKQCRCKTVAEMLGQVKSKNQ--MAACNL 173


>gi|357146087|ref|XP_003573871.1| PREDICTED: uncharacterized protein LOC100831965 [Brachypodium
           distachyon]
          Length = 180

 Score =  241 bits (614), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 116/182 (63%), Positives = 138/182 (75%), Gaps = 6/182 (3%)

Query: 1   MSSSSSSSSPNRSRSSNSNN----SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDA 56
           M  SS+S   +   +S+S        +ERPR FD K ++ CW KA  + GRHPERWR+D 
Sbjct: 1   MGPSSTSPREHNGEASSSGRVPKPKEKERPRSFDEKTRAACWRKAAVLAGRHPERWRQDT 60

Query: 57  AGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLK 116
           AGNIVC+RF +CHGCLC+EYDHIVPFSKGGEST +NCQILQTRVNR KSDK  V+  +++
Sbjct: 61  AGNIVCRRFWSCHGCLCYEYDHIVPFSKGGESTVENCQILQTRVNRSKSDKSWVEKAEMQ 120

Query: 117 GFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNH 176
           G SCD+KFTDKELD+IEMAVYGDVIRPG QCRCKTVAEM+GQ KSKD   +A C +  N 
Sbjct: 121 GASCDIKFTDKELDVIEMAVYGDVIRPGKQCRCKTVAEMIGQVKSKDH--VAACDLPYND 178

Query: 177 CS 178
            S
Sbjct: 179 AS 180


>gi|242039281|ref|XP_002467035.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
 gi|241920889|gb|EER94033.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
          Length = 180

 Score =  240 bits (613), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 110/160 (68%), Positives = 129/160 (80%), Gaps = 2/160 (1%)

Query: 13  SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCL 72
            R S +  + +ERPR FD K ++ CW KA  + GRHPERWR DA GN+VC+RF  CHGCL
Sbjct: 17  GRRSPAQATEKERPRSFDEKTRTACWRKAAVLAGRHPERWRLDAVGNVVCRRFWGCHGCL 76

Query: 73  CFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDII 132
           C+EYDHIVPFSKGGEST +NCQILQTRVNR KSDK  V+  +++GFSCD+KFTD+ELDII
Sbjct: 77  CYEYDHIVPFSKGGESTVENCQILQTRVNRSKSDKTWVEQAEMRGFSCDIKFTDQELDII 136

Query: 133 EMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           EMAVYGDVIRPG QCRCKTVAEMLGQ KS +R  +A C +
Sbjct: 137 EMAVYGDVIRPGKQCRCKTVAEMLGQVKSMNR--MAACEL 174


>gi|116780247|gb|ABK21604.1| unknown [Picea sitchensis]
          Length = 166

 Score =  232 bits (592), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 105/156 (67%), Positives = 131/156 (83%), Gaps = 5/156 (3%)

Query: 7   SSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFC 66
           S+SP     S  +  N++R R+FD K +++CW KA+ VPGRHPERWRKD AGNIVC+RF 
Sbjct: 3   SASP-----SEESARNQKRARFFDEKMRNLCWQKADVVPGRHPERWRKDGAGNIVCRRFW 57

Query: 67  NCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTD 126
           +C GCLC+EYDHI+PFSKGGE+T +NCQILQ+RVNR K +K+Q+D T L+ +SCD+KFTD
Sbjct: 58  HCDGCLCYEYDHIIPFSKGGETTVENCQILQSRVNRSKGNKDQLDKTVLEQYSCDLKFTD 117

Query: 127 KELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSK 162
           KELD+IEMAVYG+V+RPG QCRCK+VAEMLGQ KSK
Sbjct: 118 KELDVIEMAVYGNVMRPGLQCRCKSVAEMLGQLKSK 153


>gi|195651337|gb|ACG45136.1| endonuclease/ nucleic acid binding protein [Zea mays]
          Length = 180

 Score =  232 bits (591), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 105/150 (70%), Positives = 123/150 (82%), Gaps = 2/150 (1%)

Query: 23  EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
           +ERPR FD K ++ CW KA  + GRHPERWR+DA GN+VC+RF  CHGCLC+EYDHIVPF
Sbjct: 27  KERPRSFDEKTRTACWRKAAVLAGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPF 86

Query: 83  SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           SKGGEST +NCQILQTRVNR KSDK  V   +++GFSCD+ FTDKELDIIEMAVYGDVIR
Sbjct: 87  SKGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 146

Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           PG QCRC+TVAE+LGQ K  +R  +A C +
Sbjct: 147 PGKQCRCRTVAEVLGQVKPTNR--MAACEL 174


>gi|116783732|gb|ABK23066.1| unknown [Picea sitchensis]
          Length = 168

 Score =  224 bits (572), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 123/142 (86%)

Query: 21  SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
            N +R R+FD K +++CW KA+ V GRHPERWRKD AGNI+C+RF +C GCLC+EYDHI+
Sbjct: 14  GNHKRVRFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIICRRFGHCEGCLCYEYDHII 73

Query: 81  PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
           PFSKGGE+T +NCQILQ+RVNR K +K+Q++ T L+ +SCD+KFTDKELD+IEMAVYG+V
Sbjct: 74  PFSKGGETTVENCQILQSRVNRSKGNKDQLNRTVLEQYSCDLKFTDKELDVIEMAVYGNV 133

Query: 141 IRPGNQCRCKTVAEMLGQFKSK 162
           +RPG QCRCK+VAEMLGQ KSK
Sbjct: 134 MRPGLQCRCKSVAEMLGQLKSK 155


>gi|168006043|ref|XP_001755719.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693038|gb|EDQ79392.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 180

 Score =  211 bits (536), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 93/138 (67%), Positives = 112/138 (81%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R FDGK ++ CW KA+ VPGRHPERWRKD AGNIVC+R   C GCLC+EYDH+ PFSKGG
Sbjct: 28  RIFDGKMRNQCWQKADVVPGRHPERWRKDVAGNIVCRRLTRCEGCLCYEYDHVRPFSKGG 87

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
           ++T DNCQILQ+RVNR KS++  + +  LK FSCD+KFTDKELD+IEM VYG+VIRPG Q
Sbjct: 88  QTTVDNCQILQSRVNRMKSNRSDLASDDLKQFSCDLKFTDKELDLIEMGVYGNVIRPGLQ 147

Query: 147 CRCKTVAEMLGQFKSKDR 164
           C+CK+VAE LG   SK +
Sbjct: 148 CKCKSVAETLGTALSKQK 165


>gi|302822371|ref|XP_002992844.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
 gi|300139392|gb|EFJ06134.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
          Length = 174

 Score =  197 bits (502), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 94/170 (55%), Positives = 121/170 (71%), Gaps = 8/170 (4%)

Query: 3   SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
           ++SSSSSP    ++        + R FD K +  CW KAE VPGRHPERWRKD AGN+VC
Sbjct: 4   ATSSSSSPLPDNAAR-------KLRLFDEKMRRQCWQKAEVVPGRHPERWRKDGAGNVVC 56

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
           ++F  C GC+C+E+DHI PFSKGGE+   NCQILQ+RVNR KS+K+  D   L+ +SCD+
Sbjct: 57  RKFIGCEGCMCYEFDHIRPFSKGGETDVSNCQILQSRVNRLKSNKD-ADEHLLQQYSCDL 115

Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           KFT +ELD++EMAVYG+VIRPG QCRCK+V+E LG    K +     C +
Sbjct: 116 KFTARELDLVEMAVYGNVIRPGLQCRCKSVSETLGNGVHKSKGHEPACEL 165


>gi|302811781|ref|XP_002987579.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
 gi|300144733|gb|EFJ11415.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
          Length = 174

 Score =  197 bits (502), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 94/170 (55%), Positives = 121/170 (71%), Gaps = 8/170 (4%)

Query: 3   SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
           ++SSSSSP    ++        + R FD K +  CW KAE VPGRHPERWRKD AGN+VC
Sbjct: 4   ATSSSSSPLPDNAAR-------KLRLFDEKMRRQCWEKAEVVPGRHPERWRKDGAGNVVC 56

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
           ++F  C GC+C+E+DHI PFSKGGE+   NCQILQ+RVNR KS+K+  D   L+ +SCD+
Sbjct: 57  RKFIGCEGCMCYEFDHIRPFSKGGETDVSNCQILQSRVNRLKSNKD-ADEHLLQQYSCDL 115

Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           KFT +ELD++EMAVYG+VIRPG QCRCK+V+E LG    K +     C +
Sbjct: 116 KFTARELDLVEMAVYGNVIRPGLQCRCKSVSETLGNGVHKSKGHEPACEL 165


>gi|363814422|ref|NP_001242848.1| uncharacterized protein LOC100775923 [Glycine max]
 gi|255641881|gb|ACU21209.1| unknown [Glycine max]
          Length = 121

 Score =  186 bits (472), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 86/124 (69%), Positives = 101/124 (81%), Gaps = 5/124 (4%)

Query: 6   SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
           SSSSP RSR        E+RPR+FD  AK++CW KA+TVPGRHPERWRKDAAGN+VCKRF
Sbjct: 2   SSSSPQRSR-----GDGEKRPRFFDSNAKAICWGKADTVPGRHPERWRKDAAGNVVCKRF 56

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
            NC GCLC+EYDHI+PFSKGGESTADNCQILQ+RVNRFKSDK  +D+ +LK +SC+V   
Sbjct: 57  FNCIGCLCYEYDHIIPFSKGGESTADNCQILQSRVNRFKSDKYNIDSEQLKDYSCEVILL 116

Query: 126 DKEL 129
            + L
Sbjct: 117 IRSL 120


>gi|452824505|gb|EME31507.1| endonuclease [Galdieria sulphuraria]
          Length = 146

 Score =  166 bits (420), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 73/129 (56%), Positives = 98/129 (75%), Gaps = 1/129 (0%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R F  + K+ CW KA  VPGR P+RWR DA  N VCKR  +C GCLC EYDH++P+S+GG
Sbjct: 12  RIFSNRVKAACWEKATPVPGRDPDRWRLDAFRNPVCKRLTSCEGCLCHEYDHVIPYSQGG 71

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
            ST +NCQILQTR+NR K+D+ Q+ A +L+ FSC++ F+++ELD+IEMAVYG+V R   +
Sbjct: 72  ASTVENCQILQTRINRLKADR-QLTAEQLESFSCEITFSERELDLIEMAVYGNVQRDHFR 130

Query: 147 CRCKTVAEM 155
           CRCK+  E+
Sbjct: 131 CRCKSFFEV 139


>gi|226504958|ref|NP_001144283.1| uncharacterized protein LOC100277163 [Zea mays]
 gi|195639630|gb|ACG39283.1| hypothetical protein [Zea mays]
          Length = 139

 Score =  154 bits (388), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 78/157 (49%), Positives = 93/157 (59%), Gaps = 43/157 (27%)

Query: 16  SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           S +  + +ERPR FD K ++ CW KA  + GRHPERWR+DA GN+VC+RF  CHGCLC+E
Sbjct: 20  SPAPATWKERPRSFDEKTRTACWRKAAVLSGRHPERWRQDAVGNVVCRRFWGCHGCLCYE 79

Query: 76  YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
           YDHIVPFSK                                         DKELDIIEMA
Sbjct: 80  YDHIVPFSK-----------------------------------------DKELDIIEMA 98

Query: 136 VYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           VYGDVIRPG QCRC+TVAE+LGQ K  +R  +A C +
Sbjct: 99  VYGDVIRPGKQCRCRTVAEVLGQVKPTNR--MAACEL 133


>gi|242036789|ref|XP_002465789.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
 gi|241919643|gb|EER92787.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
          Length = 188

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 85/125 (68%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE VPGR PERWR+DA GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 62  PRSFSYAVKQQCWEKAERVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKG 121

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K +K ++  ++L   S   + + +++D++E++ YG+V R  +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVELSAYGNVRRGPD 181

Query: 146 QCRCK 150
              CK
Sbjct: 182 SGGCK 186


>gi|293331837|ref|NP_001168386.1| uncharacterized protein LOC100382155 [Zea mays]
 gi|223947943|gb|ACN28055.1| unknown [Zea mays]
 gi|414865044|tpg|DAA43601.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
          Length = 188

 Score =  137 bits (345), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 60/125 (48%), Positives = 85/125 (68%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE +PGR PERWR+DA GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 62  PRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKG 121

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K +K ++  ++L   S   + + +++D++E++ YG+V R  +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVELSAYGNVRRGPD 181

Query: 146 QCRCK 150
              CK
Sbjct: 182 SGGCK 186


>gi|357113864|ref|XP_003558721.1| PREDICTED: uncharacterized protein LOC100825547 [Brachypodium
           distachyon]
          Length = 188

 Score =  135 bits (339), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 59/125 (47%), Positives = 84/125 (67%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE VPGR PERWR+DA GN+V ++   C GCLC +YDHIVP+SKG
Sbjct: 62  PRSFSYAVKQQCWEKAEKVPGRDPERWRRDALGNMVFRKLVGCPGCLCHDYDHIVPYSKG 121

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K +K ++  ++L   S   + + +++D+ E++ +G+V R  +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELAQRSAYCRVSGRDMDLFELSAFGNVRRGPD 181

Query: 146 QCRCK 150
              CK
Sbjct: 182 SGGCK 186


>gi|222624269|gb|EEE58401.1| hypothetical protein OsJ_09573 [Oryza sativa Japonica Group]
          Length = 187

 Score =  134 bits (336), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 57/117 (48%), Positives = 82/117 (70%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE +PGR PERWR+DA GN+V ++   C GCLC +YDHIVP+SKG
Sbjct: 61  PRSFPYVVKQRCWEKAERLPGRDPERWRRDALGNVVFRKLVGCPGCLCHDYDHIVPYSKG 120

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           G+ST +NCQ+LQ  VNR K +K ++  ++L   S   + + +++D++E++ YG+V R
Sbjct: 121 GKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRVSGRDMDLLELSAYGNVRR 177


>gi|229594535|ref|XP_001023127.3| HNH endonuclease family protein [Tetrahymena thermophila]
 gi|225566791|gb|EAS02882.3| HNH endonuclease family protein [Tetrahymena thermophila SB210]
          Length = 208

 Score =  134 bits (336), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 61/120 (50%), Positives = 78/120 (65%)

Query: 23  EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
           +++PR F    K  CW  A    GRHP+RWR DA GN V +   +C G LC EYDHIVPF
Sbjct: 37  QKQPRCFTKSMKDQCWNNATIFIGRHPDRWRLDAVGNPVLRGLTSCLGPLCHEYDHIVPF 96

Query: 83  SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           SKGG++   NCQILQT VNR+KS+K      +L   S   KFT++E D IE A+YGD+++
Sbjct: 97  SKGGKTEISNCQILQTMVNRYKSNKTNQSHKELIAVSSTNKFTNQEFDSIERAIYGDILK 156


>gi|326534162|dbj|BAJ89431.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 180

 Score =  133 bits (334), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 83/125 (66%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE VPGR PERWR+DA GNIV ++   C GCLC +YDHI+P+SKG
Sbjct: 54  PRSFPHAVKQQCWEKAEKVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHILPYSKG 113

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K +  ++  ++L   S   + + +++D+ E++ +G+V R  +
Sbjct: 114 GKSTLENCQVLQATVNRSKGNNTEISKSELAQRSAYCRVSGRDMDLFEVSAFGNVRRGPD 173

Query: 146 QCRCK 150
              CK
Sbjct: 174 SGGCK 178


>gi|15232817|ref|NP_190333.1| HNH endonuclease [Arabidopsis thaliana]
 gi|6522532|emb|CAB61975.1| hypothetical protein [Arabidopsis thaliana]
 gi|117168237|gb|ABK32201.1| At3g47490 [Arabidopsis thaliana]
 gi|332644767|gb|AEE78288.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 234

 Score =  132 bits (332), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 80/125 (64%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE + GR PERWR+D  GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 108 PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 167

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ +VNR K +K  +  ++L   S   +   +++D+IE+  YG+V R   
Sbjct: 168 GKSTLENCQVLQAKVNRSKGNKTDISRSELIQRSSYCRVAGRDMDLIELTAYGNVQRAPT 227

Query: 146 QCRCK 150
              C+
Sbjct: 228 SSGCR 232


>gi|168048449|ref|XP_001776679.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671971|gb|EDQ58515.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 129

 Score =  132 bits (332), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 58/114 (50%), Positives = 76/114 (66%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R F    K  CWAKAETV GR P RWR+DA GN+V ++   C GCLC +YDHIVP+SKGG
Sbjct: 1   RSFPHDVKEACWAKAETVKGRDPARWRRDALGNVVFRKLVGCQGCLCHDYDHIVPYSKGG 60

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
            ST DNCQ+LQ   NRFK  +  V  T+++  S     T +++D +E++ YG+V
Sbjct: 61  PSTLDNCQVLQAPANRFKGARINVPVTEMQQHSAYCSLTGRDMDFVELSAYGNV 114


>gi|413934346|gb|AFW68897.1| hypothetical protein ZEAMMB73_497763 [Zea mays]
          Length = 132

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 63/90 (70%), Positives = 73/90 (81%), Gaps = 2/90 (2%)

Query: 83  SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           S GGEST +NCQILQTRVNR KSDK  V   +++GFSCD+ FTDKELDIIEMAVYGDVIR
Sbjct: 39  STGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 98

Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
           PG QCRC+TVAE+LGQ K  +R  +A C +
Sbjct: 99  PGKQCRCRTVAEVLGQVKPTNR--MAACEL 126


>gi|115451005|ref|NP_001049103.1| Os03g0169800 [Oryza sativa Japonica Group]
 gi|19071624|gb|AAL84291.1|AC073556_8 putative endonuclease [Oryza sativa Japonica Group]
 gi|108706401|gb|ABF94196.1| HNH endonuclease domain-containing protein, putative, expressed
           [Oryza sativa Japonica Group]
 gi|113547574|dbj|BAF11017.1| Os03g0169800 [Oryza sativa Japonica Group]
 gi|215686970|dbj|BAG90840.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218192172|gb|EEC74599.1| hypothetical protein OsI_10187 [Oryza sativa Indica Group]
          Length = 187

 Score =  131 bits (329), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 57/117 (48%), Positives = 81/117 (69%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE V GR PERWR+DA GN+V ++   C GCLC +YDHIVP+SKG
Sbjct: 61  PRSFPYVVKQRCWEKAERVAGRDPERWRRDALGNVVFRKLVGCPGCLCHDYDHIVPYSKG 120

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           G+ST +NCQ+LQ  VNR K +K ++  ++L   S   + + +++D++E++ YG+V R
Sbjct: 121 GKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRVSGRDMDLLELSAYGNVRR 177


>gi|145521759|ref|XP_001446729.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414218|emb|CAK79332.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 58/113 (51%), Positives = 77/113 (68%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR FD  A+  CW  A+ + GR+P+RWR DA GN VCK    C G LC+++DHIVP+SKG
Sbjct: 27  PRRFDKLAREECWNNAQIMEGRNPDRWRLDAVGNPVCKALNACRGPLCYQFDHIVPYSKG 86

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
           GES + NCQ+LQ+ VN +KS+KE +   +LK  S  + FT +  D IE A+YG
Sbjct: 87  GESISKNCQLLQSFVNIYKSNKENISKDELKQVSPQLNFTSEVFDAIEYAIYG 139


>gi|297815918|ref|XP_002875842.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
           subsp. lyrata]
 gi|297321680|gb|EFH52101.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
           subsp. lyrata]
          Length = 225

 Score =  130 bits (327), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 83/135 (61%), Gaps = 1/135 (0%)

Query: 16  SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           S  ++ N E PR F    K  CW KAE + GR PERWR+D  GNIV ++   C GCLC +
Sbjct: 90  SGYDDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDPLGNIVFRKLVGCPGCLCHD 148

Query: 76  YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
           YDHIVP+SKGG+S  +NCQ+LQ +VNR K +K  +   +L   S   +   +++D+IE+ 
Sbjct: 149 YDHIVPYSKGGKSILENCQVLQAKVNRSKGNKTDISRAELIQRSSYCRVAGRDMDLIELT 208

Query: 136 VYGDVIRPGNQCRCK 150
            YG+V R      C+
Sbjct: 209 AYGNVQRAPESSGCR 223


>gi|145512195|ref|XP_001442014.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409286|emb|CAK74617.1| unnamed protein product [Paramecium tetraurelia]
          Length = 167

 Score =  129 bits (323), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 58/113 (51%), Positives = 78/113 (69%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR FD  A+  C   A+ + GR+P+RWR DA GN VCK   +C G LC+++DHIVP+SKG
Sbjct: 55  PRRFDKLAREECRNNAQIMEGRNPDRWRLDAVGNPVCKALNSCRGSLCYQFDHIVPYSKG 114

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
           GES + NCQ+LQ+ VN +KS+KE +   +LK  S  +KFT  + D IE A+YG
Sbjct: 115 GESISINCQLLQSFVNIYKSNKENISKDELKHVSPQLKFTPVQFDAIEYAIYG 167


>gi|255547375|ref|XP_002514745.1| endonuclease, putative [Ricinus communis]
 gi|223546349|gb|EEF47851.1| endonuclease, putative [Ricinus communis]
          Length = 203

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 55/115 (47%), Positives = 76/115 (66%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE V GR P+RWR+D  GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 77  PRSFPYSVKQQCWDKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 136

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
           G+ST +NCQ+LQ  VNR K ++ ++    L   S   +   +++D+IE++ YG+V
Sbjct: 137 GKSTLENCQVLQATVNRSKGNRTELSRADLIQRSSYCRVAGRDMDLIELSAYGNV 191


>gi|225457397|ref|XP_002284877.1| PREDICTED: uncharacterized protein LOC100262165 isoform 1 [Vitis
           vinifera]
          Length = 199

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 78/117 (66%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE + GR P+RWR+D  GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 73  PRSFPYSVKQQCWEKAEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 132

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           G+ST +NCQ+LQ  VNR K ++ ++    L   S   + + +++D++E++ YG+V R
Sbjct: 133 GKSTLENCQVLQATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRR 189


>gi|449477241|ref|XP_004154969.1| PREDICTED: uncharacterized protein LOC101226861 [Cucumis sativus]
          Length = 200

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 82/125 (65%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE + GR P+RWR+DA GN V ++   C GCLC +YDHI+P+SKG
Sbjct: 74  PRSFPHSVKQQCWDKAEKIKGRDPDRWRRDALGNTVFRKLVGCPGCLCHDYDHILPYSKG 133

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K ++ ++   +L   S   + + +++D++E++ YG+V R  +
Sbjct: 134 GKSTLENCQVLQATVNRSKGNRTEISRAELIQKSSYCRVSGRDMDLLELSAYGNVHRGQD 193

Query: 146 QCRCK 150
              C+
Sbjct: 194 SGGCR 198


>gi|449440796|ref|XP_004138170.1| PREDICTED: uncharacterized protein LOC101222461 [Cucumis sativus]
          Length = 200

 Score =  127 bits (318), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 82/125 (65%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE + GR P+RWR+DA GN V ++   C GCLC +YDHI+P+SKG
Sbjct: 74  PRSFPHSVKQQCWDKAEKIKGRDPDRWRRDALGNTVFRKLVGCPGCLCHDYDHILPYSKG 133

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
           G+ST +NCQ+LQ  VNR K ++ ++   +L   S   + + +++D++E++ YG+V R  +
Sbjct: 134 GKSTLENCQVLQATVNRSKGNRTEISRAELIQKSSYCRVSGRDMDLLELSAYGNVHRGQD 193

Query: 146 QCRCK 150
              C+
Sbjct: 194 SGGCR 198


>gi|351720795|ref|NP_001237956.1| uncharacterized protein LOC100500174 [Glycine max]
 gi|255629565|gb|ACU15130.1| unknown [Glycine max]
          Length = 206

 Score =  126 bits (316), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 82/124 (66%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R F    K  CW KA+ V GR P+RWR+DA GN + ++   C GCLC +YDHI+P+SKGG
Sbjct: 81  RSFPHSVKQKCWEKADKVKGRDPDRWRRDALGNTLFRKLVGCPGCLCHDYDHIIPYSKGG 140

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
           EST +NCQ+LQ  VNR K ++ ++   +L   S   + +D+++D++E++ YG+V R  + 
Sbjct: 141 ESTLENCQVLQATVNRSKGNRTEISKAELIQKSSYCRVSDRDMDLLELSAYGNVRRGPDS 200

Query: 147 CRCK 150
             C+
Sbjct: 201 GGCR 204


>gi|388520967|gb|AFK48545.1| unknown [Lotus japonicus]
          Length = 198

 Score =  124 bits (312), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 90/152 (59%), Gaps = 13/152 (8%)

Query: 2   SSSSSSSSPNRSRSSNSNNSNEE-------------RPRYFDGKAKSMCWAKAETVPGRH 48
           +SS  SSSP RS S ++   +++              PR F    K  CW KA+ + GR 
Sbjct: 35  TSSKLSSSPKRSTSPSATLLDDDPVALFDDYTHPNPNPRSFPHSVKQKCWEKADKIKGRD 94

Query: 49  PERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
           P+RWR+D  GN + ++   C GCLC +YDHIVP+SKGG+ST +NCQ+LQ  VNR K ++ 
Sbjct: 95  PDRWRRDPLGNTIFRKLVGCPGCLCHDYDHIVPYSKGGQSTLENCQVLQATVNRAKGNRT 154

Query: 109 QVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
            +    L   S   + + +++D++E++ YG+V
Sbjct: 155 GISRADLMQKSSYCRVSGRDMDLLELSAYGNV 186


>gi|224104735|ref|XP_002313547.1| predicted protein [Populus trichocarpa]
 gi|222849955|gb|EEE87502.1| predicted protein [Populus trichocarpa]
          Length = 128

 Score =  124 bits (311), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 78/117 (66%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE V GR P+RWR+D  GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 2   PRSFPYSVKQQCWEKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 61

Query: 86  GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
           G+S+ +NCQ+LQ  VNR K ++ ++   +L   S   + + +++D +E++ YG+V R
Sbjct: 62  GKSSLENCQVLQATVNRSKGNRTELSRAELIRRSSYCRVSGRDMDFVELSAYGNVRR 118


>gi|449018629|dbj|BAM82031.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 188

 Score =  123 bits (309), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 55/121 (45%), Positives = 76/121 (62%), Gaps = 4/121 (3%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R F    +++CW  A+ VPGRHP RWR D  GN+VC+    C+G LC+EYDH  P+ KGG
Sbjct: 27  RLFSHTERNICWRNAQVVPGRHPRRWRFDRVGNLVCRALHGCNGPLCYEYDHRAPWGKGG 86

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATK--LKGFS--CDVKFTDKELDIIEMAVYGDVIR 142
            S + NC +LQT  NR+KSD+ +   T+  L+  S       + +ELD IE  +YGD++R
Sbjct: 87  RSDSTNCDVLQTSANRWKSDQPESRVTEKLLRAHSRVLSRSLSQEELDTIEYVIYGDILR 146

Query: 143 P 143
           P
Sbjct: 147 P 147


>gi|452821821|gb|EME28847.1| endonuclease isoform 1 [Galdieria sulphuraria]
          Length = 199

 Score =  123 bits (309), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 62/149 (41%), Positives = 82/149 (55%), Gaps = 5/149 (3%)

Query: 3   SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
           S  SS    + +S        +  + F    K  CW KA+ V GR P RWR+D AGN++C
Sbjct: 35  SLVSSLQGTKLKSPGIEKKEAKVLKQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVIC 94

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCD- 121
           K+   C G  C ++DH+VP SKGGES  DNCQ+LQT  NR K D+     T +K +  + 
Sbjct: 95  KKLRGCRGPFCMDFDHVVPLSKGGESVLDNCQVLQTTANRHKGDRLVEPTTDIKKYYKEI 154

Query: 122 ----VKFTDKELDIIEMAVYGDVIRPGNQ 146
               V    + LDI+E  VYGDVIR G +
Sbjct: 155 SDRRVLPLQRHLDIVEFYVYGDVIRHGRR 183


>gi|116794423|gb|ABK27138.1| unknown [Picea sitchensis]
          Length = 200

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 55/122 (45%), Positives = 78/122 (63%)

Query: 21  SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
           S+E  PR F    K  CW KA+ + GR P+RWR+DA GN V ++   C GCLC +YDHI 
Sbjct: 69  SSEYNPRAFPYHIKQACWEKADKIKGRDPDRWRRDALGNTVFRKLVGCSGCLCHDYDHIH 128

Query: 81  PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
           P+SKGG+ST +NCQ+LQ   NR KS+   +  T L   S   + + +++D +E++ YG+V
Sbjct: 129 PYSKGGKSTVENCQVLQGAANRAKSNHTDMSTTDLIQKSAYCRVSGRDMDFLELSAYGNV 188

Query: 141 IR 142
            R
Sbjct: 189 HR 190


>gi|449015785|dbj|BAM79187.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 231

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 93/171 (54%), Gaps = 14/171 (8%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           R F    +   W +A+ VPGR PERWR D  GN V +    C GC C+E+DH VP+S GG
Sbjct: 26  RSFTEAQRKAAWERAKPVPGRDPERWRYDVYGNPVLRPLHGCLGCFCYEFDHRVPYSLGG 85

Query: 87  ESTADNCQILQTRVNRFKSDKEQ---VDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRP 143
            ++ DNC ILQTR+NR KS+  +   +   +++ ++C+      ++D IE AVYGDV R 
Sbjct: 86  ATSLDNCDILQTRINRLKSNALEGVNISREQIRQYACEDVSNVIDMDAIEFAVYGDVRRE 145

Query: 144 GNQCRC-----KTVAEML------GQFKSKDRWLLANCHMMVNHCSTRGSC 183
           G  CR      K  +EML       +F+S    +  +  M ++  S   SC
Sbjct: 146 GLHCRAYSLFEKVASEMLVSTGRVSRFESGSPAVTEHSPMKISRQSNIPSC 196


>gi|197306242|gb|ACH59472.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
          Length = 84

 Score =  121 bits (304), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 51/79 (64%), Positives = 65/79 (82%)

Query: 15 SSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCF 74
          SS  N SN+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+
Sbjct: 6  SSEDNPSNQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCY 65

Query: 75 EYDHIVPFSKGGESTADNC 93
          EYDHI+PFSKGG +T +NC
Sbjct: 66 EYDHIIPFSKGGGTTVENC 84


>gi|334185796|ref|NP_001190027.1| HNH endonuclease [Arabidopsis thaliana]
 gi|332644768|gb|AEE78289.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 215

 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 84/155 (54%), Gaps = 21/155 (13%)

Query: 16  SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           S   + N E PR F    K  CW KAE + GR PERWR+D  GNIV ++   C GCLC +
Sbjct: 60  SGYGDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHD 118

Query: 76  YDHIVPFS--------------------KGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
           YDHIVP+S                    KGG+ST +NCQ+LQ +VNR K +K  +  ++L
Sbjct: 119 YDHIVPYSKDMCPLEHAFDEPHKDFLDLKGGKSTLENCQVLQAKVNRSKGNKTDISRSEL 178

Query: 116 KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCK 150
              S   +   +++D+IE+  YG+V R      C+
Sbjct: 179 IQRSSYCRVAGRDMDLIELTAYGNVQRAPTSSGCR 213


>gi|197306240|gb|ACH59471.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306260|gb|ACH59481.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
          Length = 84

 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 47/72 (65%), Positives = 61/72 (84%)

Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
          N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72

Query: 82 FSKGGESTADNC 93
          FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84


>gi|197306236|gb|ACH59469.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
          Length = 84

 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 47/72 (65%), Positives = 61/72 (84%)

Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
          N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72

Query: 82 FSKGGESTADNC 93
          FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84


>gi|197306216|gb|ACH59459.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306218|gb|ACH59460.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306220|gb|ACH59461.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306222|gb|ACH59462.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306224|gb|ACH59463.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306226|gb|ACH59464.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306228|gb|ACH59465.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306230|gb|ACH59466.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306232|gb|ACH59467.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306234|gb|ACH59468.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306238|gb|ACH59470.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306244|gb|ACH59473.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306246|gb|ACH59474.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306248|gb|ACH59475.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306250|gb|ACH59476.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306252|gb|ACH59477.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306254|gb|ACH59478.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306256|gb|ACH59479.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306258|gb|ACH59480.1| HNH endonuclease domain-containing protein [Pseudotsuga
          menziesii]
 gi|197306262|gb|ACH59482.1| HNH endonuclease domain-containing protein [Pseudotsuga
          macrocarpa]
          Length = 84

 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 47/72 (65%), Positives = 61/72 (84%)

Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
          N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72

Query: 82 FSKGGESTADNC 93
          FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84


>gi|452821820|gb|EME28846.1| endonuclease isoform 2 [Galdieria sulphuraria]
          Length = 190

 Score =  114 bits (286), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 57/141 (40%), Positives = 76/141 (53%), Gaps = 5/141 (3%)

Query: 3   SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
           S  SS    + +S        +  + F    K  CW KA+ V GR P RWR+D AGN++C
Sbjct: 35  SLVSSLQGTKLKSPGIEKKEAKVLKQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVIC 94

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCD- 121
           K+   C G  C ++DH+VP SKGGES  DNCQ+LQT  NR K D+     T +K +  + 
Sbjct: 95  KKLRGCRGPFCMDFDHVVPLSKGGESVLDNCQVLQTTANRHKGDRLVEPTTDIKKYYKEI 154

Query: 122 ----VKFTDKELDIIEMAVYG 138
               V    + LDI+E  VYG
Sbjct: 155 SDRRVLPLQRHLDIVEFYVYG 175


>gi|357113854|ref|XP_003558716.1| PREDICTED: uncharacterized protein LOC100824123 [Brachypodium
           distachyon]
          Length = 246

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 92/169 (54%), Gaps = 24/169 (14%)

Query: 2   SSSSSSSSPNRSRSSNSNN-----SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDA 56
           S+   +  P  S+ +N+        NEE+   F    +  CW  AE VPGRHP RWRKD 
Sbjct: 42  SADDENKDPKDSQQANARKRTVLKKNEEK---FPRAVRKWCWINAEEVPGRHPGRWRKDP 98

Query: 57  AGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD-----KEQVD 111
            GN++CK+     G   F YDHI P S+GG+   +NCQ+L   +N FKS+     +E VD
Sbjct: 99  YGNVLCKKQTTTLGLSYFHYDHIHPRSRGGKGVKENCQLLLAYLNIFKSNDIKTREEMVD 158

Query: 112 ATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN-----QCRCKTVAEM 155
                  S  + ++++ELD+IE  VYG+V+RPG+     +CR     E+
Sbjct: 159 N------SYHITYSNEELDLIECNVYGNVVRPGHAETDPECRTYLKKEI 201


>gi|340502170|gb|EGR28883.1| hypothetical protein IMG5_167370 [Ichthyophthirius multifiliis]
          Length = 121

 Score =  112 bits (279), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 51/93 (54%), Positives = 66/93 (70%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
           GRHPERWR DA GN V K   +C+G LC++YDHIVP+SKGGE+  +NCQ+LQ+ VN+ KS
Sbjct: 3   GRHPERWRLDAVGNPVIKSLTSCNGPLCYQYDHIVPYSKGGETQIENCQLLQSIVNKHKS 62

Query: 106 DKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
           +K     ++LK  S    FT+ E D IE A+YG
Sbjct: 63  NKIDATYSELKNVSPKDDFTNFEYDCIEKAIYG 95


>gi|359491703|ref|XP_003634310.1| PREDICTED: uncharacterized protein LOC100262165 isoform 2 [Vitis
           vinifera]
          Length = 167

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 43/78 (55%), Positives = 55/78 (70%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE + GR P+RWR+D  GNIV ++   C GCLC +YDHIVP+SKG
Sbjct: 73  PRSFPYSVKQQCWEKAEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 132

Query: 86  GESTADNCQILQTRVNRF 103
           G+ST +NCQ+LQ  ++ F
Sbjct: 133 GKSTLENCQVLQENMDIF 150


>gi|414865043|tpg|DAA43600.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
          Length = 141

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/71 (56%), Positives = 48/71 (67%), Gaps = 1/71 (1%)

Query: 26  PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
           PR F    K  CW KAE +PGR PERWR+DA GNIV ++   C GCLC +YDHIVP+SK 
Sbjct: 62  PRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKV 121

Query: 86  GESTADNCQIL 96
            +S  DN + L
Sbjct: 122 PDSN-DNPKSL 131


>gi|42572595|ref|NP_974393.1| HNH endonuclease [Arabidopsis thaliana]
 gi|332644766|gb|AEE78287.1| HNH endonuclease [Arabidopsis thaliana]
          Length = 176

 Score = 82.8 bits (203), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/69 (53%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 16  SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           S   + N E PR F    K  CW KAE + GR PERWR+D  GNIV ++   C GCLC +
Sbjct: 99  SGYGDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHD 157

Query: 76  YDHIVPFSK 84
           YDHIVP+SK
Sbjct: 158 YDHIVPYSK 166


>gi|414866927|tpg|DAA45484.1| TPA: hypothetical protein ZEAMMB73_528864 [Zea mays]
          Length = 400

 Score = 79.3 bits (194), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 59/106 (55%), Gaps = 25/106 (23%)

Query: 96  LQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELDIIEM 134
           LQ     F SD E  +             A +++GFSC+         +  DKELDIIEM
Sbjct: 41  LQADPATFTSDPEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELDIIEM 100

Query: 135 AVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNH 176
           AVYGDVIRPG QCRC+ VAE+LGQ K  +R   W L +  ++ +NH
Sbjct: 101 AVYGDVIRPGKQCRCRIVAEVLGQVKPTNRMAAWPLKDHIYLHLNH 146


>gi|38605769|emb|CAE05870.3| OSJNBa0044K18.12 [Oryza sativa Japonica Group]
          Length = 204

 Score = 79.0 bits (193), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 63/135 (46%), Gaps = 16/135 (11%)

Query: 21  SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
           S   +PR F    +  CW  A  V GR  +RWR DA GN V K     H           
Sbjct: 18  SAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNGKH----------- 66

Query: 81  PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
               GG +   NCQI+QT  NR KS++  +   ++KG S  + +T   LD IE+AVY  V
Sbjct: 67  ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWTHPHLDTIEIAVYASV 121

Query: 141 IRPGNQCRCKTVAEM 155
            R    C C T+  +
Sbjct: 122 ERDDLSCTCPTIDNL 136


>gi|414866926|tpg|DAA45483.1| TPA: hypothetical protein ZEAMMB73_528864 [Zea mays]
          Length = 468

 Score = 79.0 bits (193), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 59/106 (55%), Gaps = 25/106 (23%)

Query: 96  LQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELDIIEM 134
           LQ     F SD E  +             A +++GFSC+         +  DKELDIIEM
Sbjct: 41  LQADPATFTSDPEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELDIIEM 100

Query: 135 AVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNH 176
           AVYGDVIRPG QCRC+ VAE+LGQ K  +R   W L +  ++ +NH
Sbjct: 101 AVYGDVIRPGKQCRCRIVAEVLGQVKPTNRMAAWPLKDHIYLHLNH 146


>gi|414866985|tpg|DAA45542.1| TPA: hypothetical protein ZEAMMB73_014165 [Zea mays]
          Length = 463

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 59/111 (53%), Gaps = 25/111 (22%)

Query: 92  NCQILQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELD 130
              +LQ     F SD E  +             A +++GFSC+         +  DKELD
Sbjct: 152 QLHVLQADPATFTSDLEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELD 211

Query: 131 IIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNHC 177
           IIEMA+YGDVIRPG QCRC+ VAE+LGQ K  +    W L +  ++ +NH 
Sbjct: 212 IIEMAMYGDVIRPGKQCRCRIVAEVLGQVKPTNHMAAWPLKDHIYLHLNHL 262


>gi|125548724|gb|EAY94546.1| hypothetical protein OsI_16320 [Oryza sativa Indica Group]
          Length = 232

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 62/135 (45%), Gaps = 16/135 (11%)

Query: 21  SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
           S   +PR F    +  CW  A  V GR  +R R DA GN V K     H           
Sbjct: 18  SAPAKPRSFSADVREACWNNAPKVIGRSGDRCRLDANGNPVLKALNGKH----------- 66

Query: 81  PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
               GG +   NCQI+QT  NR KS++  +   ++KG S  + +T   LD IE+AVY  V
Sbjct: 67  ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWTHPHLDTIEIAVYASV 121

Query: 141 IRPGNQCRCKTVAEM 155
            R    C C T+  +
Sbjct: 122 ERDDLSCTCPTIDNL 136


>gi|115345574|ref|YP_771756.1| hypothetical protein RD1_A0048 [Roseobacter denitrificans OCh 114]
 gi|115292895|gb|ABI93348.1| OSJNBa0044K18.12 [Roseobacter denitrificans OCh 114]
          Length = 96

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 41/71 (57%)

Query: 37  CWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
            W KA  V G++P  +R+DA GN++ K   N    + ++ DH+ P SKGG     N Q L
Sbjct: 12  VWNKARPVKGKNPNLYRRDAQGNVIYKPAYNRDSAMGWQVDHVWPRSKGGSDARKNLQAL 71

Query: 97  QTRVNRFKSDK 107
           QT  N+ KS+K
Sbjct: 72  QTGANKRKSNK 82


>gi|222629057|gb|EEE61189.1| hypothetical protein OsJ_15190 [Oryza sativa Japonica Group]
          Length = 118

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 48/105 (45%), Gaps = 16/105 (15%)

Query: 21  SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
           S   +PR F    +  CW  A  V GR  +RWR DA GN V K     H           
Sbjct: 18  SAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNGKH----------- 66

Query: 81  PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
               GG +   NCQI+QT  NR KS++  +   ++KG S  + +T
Sbjct: 67  ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWT 106


>gi|41408779|ref|NP_961615.1| hypothetical protein MAP2681c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|254774117|ref|ZP_05215633.1| hypothetical protein MaviaA2_05510 [Mycobacterium avium subsp.
           avium ATCC 25291]
 gi|417749729|ref|ZP_12398118.1| restriction endonuclease [Mycobacterium avium subsp.
           paratuberculosis S397]
 gi|41397137|gb|AAS04998.1| hypothetical protein MAP_2681c [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|336458728|gb|EGO37688.1| restriction endonuclease [Mycobacterium avium subsp.
           paratuberculosis S397]
          Length = 314

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 37/77 (48%), Gaps = 20/77 (25%)

Query: 51  RWRKDAAGN--IVCKRFCN---------CHGC---------LCFEYDHIVPFSKGGESTA 90
           RWRK AAG   ++  R  N         C  C         L  E DHIVP SKGG ST 
Sbjct: 234 RWRKSAAGQRALMTSRLRNSIKVRDHYTCRYCSVSLAAEPHLLLEVDHIVPISKGGMSTP 293

Query: 91  DNCQILQTRVNRFKSDK 107
           DN Q L  R NR KS+K
Sbjct: 294 DNLQTLCWRCNRTKSNK 310


>gi|403340432|gb|EJY69501.1| hypothetical protein OXYTRI_09762 [Oxytricha trifallax]
          Length = 153

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 26/46 (56%)

Query: 27  RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCL 72
           R F  + K +CW ++ T+P R P+RWR D  GN V      CHG L
Sbjct: 61  RIFTRQQKDVCWQQSTTIPIRDPKRWRLDPLGNPVFYTLRGCHGAL 106


>gi|297831270|ref|XP_002883517.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329357|gb|EFH59776.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 87

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 18/97 (18%)

Query: 43  TVPGRHPERWRKDAAGNIVCKRFC-NCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVN 101
            V G + +RW  D  GN+VCK       G  CF++  I    K  E TA+NCQIL     
Sbjct: 2   VVDGMNKDRWCLDVQGNLVCKPLHRGSLGAHCFQFHAI---DKDKELTAENCQILC---- 54

Query: 102 RFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
                     A   +G   D  FT ++  ++E+A+YG
Sbjct: 55  ----------AAAARGLKPDSSFTQEDFKLVEIALYG 81


>gi|429740338|ref|ZP_19274029.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
 gi|429153310|gb|EKX96095.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
          Length = 285

 Score = 45.8 bits (107), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 31/56 (55%), Gaps = 6/56 (10%)

Query: 52  WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           WR+D        R   C      E+DHI+PFSKGG +T  N Q+L  + NR KS+K
Sbjct: 234 WRRDGG------RCVKCGSQENLEFDHIIPFSKGGSNTIRNIQLLCEKCNRSKSNK 283


>gi|313205222|ref|YP_004043879.1| hypothetical protein Palpr_2764 [Paludibacter propionicigenes WB4]
 gi|312444538|gb|ADQ80894.1| hypothetical protein Palpr_2764 [Paludibacter propionicigenes WB4]
          Length = 100

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 1/76 (1%)

Query: 33  AKSMCWAKAETVPGRHPERWRKDAAGNIV-CKRFCNCHGCLCFEYDHIVPFSKGGESTAD 91
            K + W K   +PG   E WR D  G ++      N      +E DHI P + GG    +
Sbjct: 18  TKKVVWNKGAIIPGYPSEVWRSDKCGKVMKYSEHGNRSSDYGWEIDHITPVAYGGSDDLN 77

Query: 92  NCQILQTRVNRFKSDK 107
           N Q L    N  KSDK
Sbjct: 78  NLQPLNWSNNVDKSDK 93


>gi|118465054|ref|YP_880481.1| HNH endonuclease, putative [Mycobacterium avium 104]
 gi|118166341|gb|ABK67238.1| HNH endonuclease, putative [Mycobacterium avium 104]
          Length = 69

 Score = 44.7 bits (104), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 24/36 (66%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L  E +HIVP SKGG ST DN Q L  R NR KS+K
Sbjct: 30  LLLEVNHIVPISKGGMSTPDNLQTLCWRCNRTKSNK 65


>gi|258517332|ref|YP_003193554.1| HNH nuclease [Desulfotomaculum acetoxidans DSM 771]
 gi|257781037|gb|ACV64931.1| HNH nuclease [Desulfotomaculum acetoxidans DSM 771]
          Length = 104

 Score = 44.3 bits (103), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 29  FDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFC---NCHGCLCFEYDHIVPFSKG 85
           F+ +     W KA  VP  +   +RKD  G ++  RF    N +    +E DH  P ++G
Sbjct: 14  FNPRTILAVWNKAAVVPECNQSEFRKDRCGALM--RFADYDNVNSIFGWEIDHEKPVARG 71

Query: 86  GESTADNCQILQTRVNRFKSD 106
           G    DN Q L  R NR KSD
Sbjct: 72  GTDDLDNLQPLHWRNNRGKSD 92


>gi|384253642|gb|EIE27116.1| hypothetical protein COCSUDRAFT_55140 [Coccomyxa subellipsoidea
           C-169]
          Length = 134

 Score = 43.9 bits (102), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 38/70 (54%), Gaps = 6/70 (8%)

Query: 42  ETVPGRHPERWRKDAAGNIVCKRFCNCHG-CLC-FEYDHIVPFSKGGESTADNCQILQTR 99
           E +P     R+R D  GN+V        G C+C FE DHI P+++GG S  +N   L  R
Sbjct: 67  ERIPPPWHMRFRSDPYGNVVA---LEARGSCVCAFEVDHIFPWARGGLSVVENFMALFWR 123

Query: 100 VNR-FKSDKE 108
            NR  K+DK+
Sbjct: 124 SNRNVKNDKD 133


>gi|410867562|ref|YP_006982173.1| HNH endonuclease domain-containing protein [Propionibacterium
           acidipropionici ATCC 4875]
 gi|410824203|gb|AFV90818.1| HNH endonuclease domain-containing protein [Propionibacterium
           acidipropionici ATCC 4875]
          Length = 330

 Score = 43.9 bits (102), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
           L  E DHIVP S+GG ST DN Q L  R NR KS+K  +DA
Sbjct: 291 LLLEVDHIVPVSRGGLSTEDNLQTLCWRCNRSKSNK-MIDA 330


>gi|418175095|ref|ZP_12811693.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
 gi|353843655|gb|EHE23699.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
          Length = 178

 Score = 43.9 bits (102), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 26/47 (55%)

Query: 61  VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           +C         L  E DHIVP SKGG ST DN Q L  + NR KS+K
Sbjct: 130 ICAASTAEQSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 176


>gi|169346887|ref|ZP_02865835.1| type III restriction enzyme, res subunit family [Clostridium
           perfringens C str. JGS1495]
 gi|169296946|gb|EDS79070.1| type III restriction enzyme, res subunit family [Clostridium
           perfringens C str. JGS1495]
          Length = 740

 Score = 43.5 bits (101), Expect = 0.040,   Method: Composition-based stats.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F+ DHI P SKGG +T DN Q+L ++ N+ K DK
Sbjct: 705 FQIDHIKPISKGGLTTLDNLQLLCSKCNKIKGDK 738


>gi|392944277|ref|ZP_10309919.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
 gi|392287571|gb|EIV93595.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
          Length = 607

 Score = 43.1 bits (100), Expect = 0.051,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E+DH++PFS+GG S+ +N Q+L  R NR K  +
Sbjct: 573 LEFDHVIPFSRGGASSVNNLQLLCRRCNRAKGAR 606


>gi|111221190|ref|YP_711984.1| tellurium resistance protein terE [Frankia alni ACN14a]
 gi|111148722|emb|CAJ60397.1| Tellurium resistance protein terE [Frankia alni ACN14a]
          Length = 644

 Score = 43.1 bits (100), Expect = 0.053,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E+DH++PFS+GG S+ +N Q+L  R NR K  +
Sbjct: 610 LEFDHVIPFSRGGASSVNNLQLLCRRCNRSKGAR 643


>gi|218197006|gb|EEC79433.1| hypothetical protein OsI_20409 [Oryza sativa Indica Group]
          Length = 498

 Score = 43.1 bits (100), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 9/103 (8%)

Query: 81  PFSKGGESTADNCQILQTRVN--------RFKSDKEQVDATKLKGFSCDVKFTDKELDII 132
           P   GG +T +NC I+Q R N        R     + +D  ++   +  + +T+ +L  I
Sbjct: 82  PLVTGGLNTPENCAIIQGRANRNCAIIQSRANRRDDILDEMEVAALASRINWTEYQLASI 141

Query: 133 EMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVN 175
           E AV G++IR G        ++  G  +   R LL    +M+N
Sbjct: 142 EAAVIGNIIRDGKVIEIAPNSKG-GPLQDVLRSLLEKDELMIN 183


>gi|257486772|ref|ZP_05640813.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 217

 Score = 42.7 bits (99), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 6/47 (12%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
           E W++DA    VC   C     LCF  DHIVPFS+GG +T  N Q+L
Sbjct: 165 EVWQRDAG---VCVE-CGTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205


>gi|418077823|ref|ZP_12715047.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
 gi|418116043|ref|ZP_12753020.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
 gi|419439525|ref|ZP_13979582.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
 gi|419459153|ref|ZP_13999089.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
 gi|419461430|ref|ZP_14001347.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
 gi|353748845|gb|EHD29496.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
 gi|353791681|gb|EHD72055.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
 gi|379534025|gb|EHY99237.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
 gi|379534955|gb|EHZ00162.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
 gi|379582010|gb|EHZ46893.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
          Length = 356

 Score = 42.7 bits (99), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 24/39 (61%)

Query: 69  HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
              L  E DHIVP SKGG ST DN Q L  + NR KS+K
Sbjct: 316 QSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 354


>gi|422850756|ref|ZP_16897426.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
 gi|422857591|ref|ZP_16904241.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
 gi|325695504|gb|EGD37404.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
 gi|327462774|gb|EGF09096.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
          Length = 329

 Score = 42.7 bits (99), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%)

Query: 69  HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
              L  E DHI+P SKGG ST DN Q L  + NR KS+K
Sbjct: 289 QSLLLLEIDHIIPISKGGLSTRDNLQTLCWKCNRSKSNK 327


>gi|168484153|ref|ZP_02709105.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC1873-00]
 gi|168493974|ref|ZP_02718117.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC3059-06]
 gi|169834462|ref|YP_001693806.1| HNH endonuclease domain-containing protein [Streptococcus
           pneumoniae Hungary19A-6]
 gi|417685762|ref|ZP_12335042.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
 gi|417695447|ref|ZP_12344627.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
 gi|418079981|ref|ZP_12717197.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
 gi|418088689|ref|ZP_12725850.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
 gi|418090912|ref|ZP_12728057.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
 gi|418097696|ref|ZP_12734798.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
 gi|418106621|ref|ZP_12743667.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
 gi|418109254|ref|ZP_12746283.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
 gi|418113876|ref|ZP_12750869.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
 gi|418134007|ref|ZP_12770867.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
 gi|418158923|ref|ZP_12795629.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
 gi|418161331|ref|ZP_12798022.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
 gi|418168382|ref|ZP_12805028.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
 gi|418218031|ref|ZP_12844699.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
 gi|418220228|ref|ZP_12846885.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
 gi|418237869|ref|ZP_12864426.1| HNH endonuclease family protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|419422272|ref|ZP_13962491.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
 gi|419488280|ref|ZP_14028033.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
 gi|419520261|ref|ZP_14059860.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
 gi|419525050|ref|ZP_14064616.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
 gi|421248645|ref|ZP_15705108.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
 gi|421271937|ref|ZP_15722784.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
 gi|168996964|gb|ACA37576.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           Hungary19A-6]
 gi|172042557|gb|EDT50603.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC1873-00]
 gi|183576210|gb|EDT96738.1| HNH endonuclease domain protein [Streptococcus pneumoniae
           CDC3059-06]
 gi|332077580|gb|EGI88041.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
 gi|332203444|gb|EGJ17511.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
 gi|353754300|gb|EHD34913.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
 gi|353764430|gb|EHD44979.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
 gi|353766285|gb|EHD46825.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
 gi|353771852|gb|EHD52359.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
 gi|353781535|gb|EHD61977.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
 gi|353784547|gb|EHD64967.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
 gi|353788540|gb|EHD68937.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
 gi|353826578|gb|EHE06736.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
 gi|353830921|gb|EHE11050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
 gi|353836301|gb|EHE16389.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
 gi|353876228|gb|EHE56077.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
 gi|353878025|gb|EHE57864.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
 gi|353895002|gb|EHE74742.1| HNH endonuclease family protein [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|353903994|gb|EHE79508.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
 gi|379541885|gb|EHZ07050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
 gi|379560754|gb|EHZ25776.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
 gi|379589603|gb|EHZ54442.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
 gi|379590395|gb|EHZ55233.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
 gi|395615274|gb|EJG75290.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
 gi|395877496|gb|EJG88565.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
          Length = 326

 Score = 42.7 bits (99), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 24/39 (61%)

Query: 69  HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
              L  E DHIVP SKGG ST DN Q L  + NR KS+K
Sbjct: 286 QSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 324


>gi|219847405|ref|YP_002461838.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
 gi|219541664|gb|ACL23402.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
          Length = 206

 Score = 42.4 bits (98), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGEST-ADNCQILQTRVNRFKSDK 107
           E W++D    ++C      H      YDHI+P+SKGG S  A+N Q+L  R N  K DK
Sbjct: 152 EVWKRDGGRCVICSATDELH------YDHIIPYSKGGTSLKAENIQLLCARHNLEKRDK 204


>gi|227356931|ref|ZP_03841307.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
 gi|425072865|ref|ZP_18475971.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
 gi|227162908|gb|EEI47860.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
 gi|404596639|gb|EKA97159.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
          Length = 98

 Score = 42.0 bits (97), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
           FE DHI+P  KGG+ T +NCQ+L    +R K+ +E
Sbjct: 61  FELDHIIPLFKGGKDTIENCQVLCIECHRKKTKEE 95


>gi|363891377|ref|ZP_09318556.1| hypothetical protein HMPREF9630_00141 [Eubacteriaceae bacterium
           CM2]
 gi|361965434|gb|EHL18416.1| hypothetical protein HMPREF9630_00141 [Eubacteriaceae bacterium
           CM2]
          Length = 759

 Score = 41.6 bits (96), Expect = 0.15,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 9/59 (15%)

Query: 54  KDAAGNIVCKRFCNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           KDA GN  C     C G      + F+ DHI+  + GG+S ADN QIL  + N  K D+
Sbjct: 705 KDADGNYRCA----CCGISDKSRIYFQVDHIISMNNGGKSVADNLQILCRQCNGIKGDQ 759


>gi|307705062|ref|ZP_07641944.1| HNH endonuclease family protein [Streptococcus mitis SK597]
 gi|307621385|gb|EFO00440.1| HNH endonuclease family protein [Streptococcus mitis SK597]
          Length = 423

 Score = 41.2 bits (95), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 23/39 (58%)

Query: 69  HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
              L  E DHI+P SKGG ST  N Q L  + NR KSDK
Sbjct: 383 QDLLLLEVDHIIPVSKGGMSTESNLQTLCWKCNRTKSDK 421


>gi|54023566|ref|YP_117808.1| hypothetical protein nfa15980 [Nocardia farcinica IFM 10152]
 gi|54015074|dbj|BAD56444.1| hypothetical protein [Nocardia farcinica IFM 10152]
          Length = 760

 Score = 41.2 bits (95), Expect = 0.19,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D        R   C      E+DHI+P S+GG ++A N QIL    NR K  +
Sbjct: 708 EVWQRDGG------RCVECGSGHYLEFDHIIPLSRGGATSAANLQILCRACNRRKGAR 759


>gi|422680858|ref|ZP_16739129.1| stress protein, partial [Pseudomonas syringae pv. tabaci str. ATCC
           11528]
 gi|331010203|gb|EGH90259.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 138

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 39  AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
           A+ E +P     E W++DA   +     C     LCF  DHIVPFS+GG +T  N Q+L
Sbjct: 74  ARREAIPRSVQREVWQRDAGKCV----ECGTRQLLCF--DHIVPFSRGGSNTVRNLQLL 126


>gi|257486734|ref|ZP_05640775.1| HNH endonuclease [Pseudomonas syringae pv. tabaci str. ATCC 11528]
          Length = 217

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 39  AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
           A+ E +P     E W++DA   + C         LCF  DHIVPFS+GG +T  N Q+L
Sbjct: 153 ARREAIPRSVQREVWQRDAGKCVEC----GTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205


>gi|374295268|ref|YP_005045459.1| restriction endonuclease [Clostridium clariflavum DSM 19732]
 gi|359824762|gb|AEV67535.1| restriction endonuclease [Clostridium clariflavum DSM 19732]
          Length = 112

 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           ++ DHI+P SKGG ++  N Q+L  + NR KSDK
Sbjct: 78  WQIDHIIPKSKGGTNSYSNAQVLSRKQNRLKSDK 111


>gi|441509730|ref|ZP_20991644.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
           108223]
 gi|441446246|dbj|GAC49605.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
           108223]
          Length = 334

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 5/53 (9%)

Query: 59  NIVCKRFCNCHGC----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           N  C++ C+  G     L  E DHIVP S+GG ST +N Q L  + NR KS+K
Sbjct: 279 NYACRQ-CSISGAAEPHLLLEVDHIVPVSRGGLSTMENLQTLCWKCNRSKSNK 330


>gi|440742015|ref|ZP_20921345.1| TerF-like protein [Pseudomonas syringae BRIP39023]
 gi|440378101|gb|ELQ14731.1| TerF-like protein [Pseudomonas syringae BRIP39023]
          Length = 217

 Score = 41.2 bits (95), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 39  AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
           A+ E +P     E W++DA   + C         LCF  DHIVPFS+GG +T  N Q+L
Sbjct: 153 ARREAIPRSVQREVWQRDAGKCVEC----GTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205


>gi|424859400|ref|ZP_18283414.1| hypothetical protein OPAG_06556 [Rhodococcus opacus PD630]
 gi|356661909|gb|EHI42220.1| hypothetical protein OPAG_06556 [Rhodococcus opacus PD630]
          Length = 325

 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 35/77 (45%), Gaps = 20/77 (25%)

Query: 51  RWRKDAAGN--IVCKRFCNC------HGC------------LCFEYDHIVPFSKGGESTA 90
           +WRK AAG   ++  +  N       H C            L  E DHI+P SKGG +  
Sbjct: 244 KWRKSAAGQRALMTTKLRNMIKTRDNHTCRYCSISLSAEPHLLLEVDHIMPVSKGGLTAP 303

Query: 91  DNCQILQTRVNRFKSDK 107
            N Q L  R NR KS+K
Sbjct: 304 GNLQTLCWRCNRTKSNK 320


>gi|159491606|ref|XP_001703752.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270494|gb|EDO96338.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 118

 Score = 40.8 bits (94), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 55  DAAGNIVCKRFCNCHGCLC-FEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           D  GN+V     N  G +C F  DHI P+S+GG +  DN   LQ R N  K +
Sbjct: 26  DPQGNVVALAAAN--GSVCGFHGDHIFPWSRGGLTVRDNLMALQWRANILKKN 76


>gi|427705667|ref|YP_007048044.1| HNH endonuclease [Nostoc sp. PCC 7107]
 gi|427358172|gb|AFY40894.1| HNH endonuclease [Nostoc sp. PCC 7107]
          Length = 99

 Score = 40.8 bits (94), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 1/67 (1%)

Query: 36  MCWAKAETVPGRHPERWRKDAAGNIVCK-RFCNCHGCLCFEYDHIVPFSKGGESTADNCQ 94
           + W K    P   P  WR D  G ++ +  + N      +E DHI+P S GG     N +
Sbjct: 30  IVWEKGFNNPNYDPRIWRIDQFGRLINRYEYGNRQSTYGWEIDHIIPKSWGGSDHISNLR 89

Query: 95  ILQTRVN 101
            L   +N
Sbjct: 90  PLHWYIN 96


>gi|350569051|ref|ZP_08937449.1| HNH endonuclease domain protein [Propionibacterium avidum ATCC
           25577]
 gi|348661294|gb|EGY77990.1| HNH endonuclease domain protein [Propionibacterium avidum ATCC
           25577]
          Length = 109

 Score = 40.8 bits (94), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV-DATKLKGF 118
           L  E DHI+P S+GG S  DN Q L  R NR KS+K  V +  +L G 
Sbjct: 54  LLLEVDHILPVSEGGLSVDDNLQTLCWRCNRSKSNKLPVIEGHQLAGV 101


>gi|255070577|ref|XP_002507370.1| predicted protein [Micromonas sp. RCC299]
 gi|226522645|gb|ACO68628.1| predicted protein [Micromonas sp. RCC299]
          Length = 1022

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 9/60 (15%)

Query: 52  WRKDAAGNIVCK-----RFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           +R D  GN+V K      F  C    C EYDH++P+S+GG S + N +++    N+ K D
Sbjct: 784 FRVDMYGNVVAKPGAAENFSVC----CTEYDHVLPWSRGGTSKSANIEVISWIANQRKLD 839


>gi|392942375|ref|ZP_10308017.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
 gi|392285669|gb|EIV91693.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
           sp. QA3]
          Length = 607

 Score = 40.8 bits (94), Expect = 0.29,   Method: Composition-based stats.
 Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D    +       C      E+DH++P S+GG ++ +N Q+L  R N+ K  +
Sbjct: 555 EVWQRDGGACV------QCQATEYLEFDHVIPHSRGGATSVNNLQLLCRRCNQEKGAR 606


>gi|407643249|ref|YP_006807008.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
 gi|407306133|gb|AFU00034.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
          Length = 618

 Score = 40.8 bits (94), Expect = 0.30,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D    + C    + H     E+DHI+P S+GG ++A N QIL    NR K  +
Sbjct: 566 EVWQRDGGKCVECG---DSH---YLEFDHIIPLSRGGATSAANLQILCRSCNRAKGAR 617


>gi|288802822|ref|ZP_06408259.1| conserved hypothetical protein [Prevotella melaninogenica D18]
 gi|288334639|gb|EFC73077.1| conserved hypothetical protein [Prevotella melaninogenica D18]
          Length = 119

 Score = 40.4 bits (93), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 30  DGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCK-RFCNCHGCLCFEYDHIVPFSKGGES 88
           D + K  CW  A+ V G   +  RKDA G  + K  +        +E DH+ P   GG+ 
Sbjct: 3   DEEKKEYCWNHAQIVEGYDKDSIRKDACGAWIFKAHYGMRDSVFGWEVDHVFPVILGGDD 62

Query: 89  TADNCQILQTRVNRFKSD 106
              N + +Q + N  K D
Sbjct: 63  FKRNLRAMQWKNNVSKGD 80


>gi|423209364|ref|ZP_17195918.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
 gi|404617222|gb|EKB14158.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
          Length = 218

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 8/82 (9%)

Query: 34  KSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE--------YDHIVPFSKG 85
           +S+      ++P   P + R++A    V +      G  C E        +DHIVPFSKG
Sbjct: 136 ESLIRVDISSMPIPKPVKPRREAIPRAVQREVWQRDGGKCVECSSNAKLCFDHIVPFSKG 195

Query: 86  GESTADNCQILQTRVNRFKSDK 107
           G ++  N Q+L  R N  K ++
Sbjct: 196 GSNSVRNLQLLCERCNLSKGNR 217


>gi|308535332|ref|YP_003933733.1| HNH endonuclease [Geobacter bemidjiensis Bem]
 gi|308052677|gb|ADO00821.1| HNH endonuclease [Geobacter bemidjiensis Bem]
          Length = 161

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E+DHI+P SKGG  T  N Q+L  + NR K DK
Sbjct: 127 LEFDHIIPHSKGGSDTERNIQLLCMKCNRTKYDK 160


>gi|260655767|ref|ZP_05861236.1| putative HNH endonuclease domain protein [Jonquetella anthropi
           E3_33 E1]
 gi|260629383|gb|EEX47577.1| putative HNH endonuclease domain protein [Jonquetella anthropi
           E3_33 E1]
          Length = 384

 Score = 40.0 bits (92), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 66  CNCHGCLC-FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           CN H  +   E DHI P+SKGG + A+NCQ+L    NR KS+K
Sbjct: 342 CNKHFEINEMEADHITPWSKGGRTIAENCQMLCKTCNRKKSNK 384


>gi|256824147|ref|YP_003148107.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
 gi|256687540|gb|ACV05342.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
          Length = 169

 Score = 40.0 bits (92), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 33/70 (47%), Gaps = 2/70 (2%)

Query: 51  RWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV 110
           R R   A +  C  F       C E DHIVPF  GG +TADN Q L  R +  K+ K Q 
Sbjct: 82  RARDGVARDPGCGSFGATEAERC-ELDHIVPFDVGGTTTADNLQALSRRGHTRKT-KRQW 139

Query: 111 DATKLKGFSC 120
           + T   G S 
Sbjct: 140 EVTGDAGGSL 149


>gi|419855738|ref|ZP_14378488.1| HNH endonuclease domain protein [Bifidobacterium longum subsp.
           longum 44B]
 gi|386414961|gb|EIJ29502.1| HNH endonuclease domain protein [Bifidobacterium longum subsp.
           longum 44B]
          Length = 117

 Score = 40.0 bits (92), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 41/103 (39%), Gaps = 20/103 (19%)

Query: 47  RHPERWRKDAAGNIVCKRFCNCHGC-------------LCFEYDHIVPFSKGGESTA-DN 92
           RH   WR+      V   +  CH C                E D I+P SKGG      N
Sbjct: 9   RHANGWRRQQVAARVLAAYDTCHLCGRSVDKSLPPGLPGSPEVDEIIPVSKGGSPYLFSN 68

Query: 93  CQILQTRVNRFKSD------KEQVDATKLKGFSCDVKFTDKEL 129
           C++     NR +S+      +E +  T  +G++ D+K T   L
Sbjct: 69  CRLAHRWCNRIRSNHSVAWAREHIKQTFEQGYTADLKATSMPL 111


>gi|429195043|ref|ZP_19187098.1| glycosyl hydrolase, family 15 [Streptomyces ipomoeae 91-03]
 gi|428669241|gb|EKX68209.1| glycosyl hydrolase, family 15 [Streptomyces ipomoeae 91-03]
          Length = 620

 Score = 39.7 bits (91), Expect = 0.52,   Method: Composition-based stats.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 17/98 (17%)

Query: 25  RPRYFDGKAKSMCW---------AKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
           +PR+F  ++K MCW         A+ + +PGRH  RWR  A   I  + F   H   CF 
Sbjct: 423 QPRHFT-QSKMMCWVALDRAADLAEHDLIPGRHLARWRT-ARDQI--RAFVETH---CFS 475

Query: 76  YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDAT 113
            D +  + +  +S   +  +L   +  ++ D  ++DAT
Sbjct: 476 -DSLDSYVRSADSPDLDAAVLLGHLYGYRGDHRRMDAT 512


>gi|347535197|ref|YP_004842622.1| hypothetical protein FBFL15_0247 [Flavobacterium branchiophilum
           FL-15]
 gi|345528355|emb|CCB68385.1| Protein of unknown function [Flavobacterium branchiophilum FL-15]
          Length = 366

 Score = 39.7 bits (91), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%)

Query: 61  VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           +C + C        E DHI P+ +GG++TA+NCQ+L    NR KS K
Sbjct: 320 ICVKCCEQFELNEMEADHITPWHEGGKTTAENCQMLCKLDNRIKSGK 366


>gi|394775392|gb|AFN37787.1| gp99 [Mycobacteriophage MacnCheese]
          Length = 95

 Score = 39.7 bits (91), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           L F+ DHI P ++GG  T DNC     + NR KSD
Sbjct: 41  LSFQIDHITPLARGGTDTLDNCAAAHRKCNRDKSD 75


>gi|379707873|ref|YP_005263078.1| hypothetical protein NOCYR_1649 [Nocardia cyriacigeorgica GUH-2]
 gi|374845372|emb|CCF62438.1| conserved protein of unknown function; putative endonuclease domain
           [Nocardia cyriacigeorgica GUH-2]
          Length = 549

 Score = 39.7 bits (91), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D        R   C      E+DHI+P S+GG ++A N QIL    NR K  +
Sbjct: 497 EVWQRDGG------RCVECGDGHYLEFDHIIPLSRGGATSATNLQILCRACNRAKGAR 548


>gi|257439996|ref|ZP_05615751.1| putative HNH endonuclease domain protein [Faecalibacterium
           prausnitzii A2-165]
 gi|257197603|gb|EEU95887.1| HNH endonuclease domain protein [Faecalibacterium prausnitzii
           A2-165]
          Length = 367

 Score = 39.7 bits (91), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 24/30 (80%)

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           DHI P+SKGG++TA+NCQ+L    NR KS+
Sbjct: 337 DHITPWSKGGKTTAENCQMLCADCNRRKSN 366


>gi|339784200|gb|AEK09909.1| gp100 [Mycobacterium phage Pixie]
          Length = 95

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 13/67 (19%)

Query: 53  RKDAAGNIVCKRFCNCHGC-------------LCFEYDHIVPFSKGGESTADNCQILQTR 99
           R+D    ++ +R  +CH C             L F+ DHI P ++GG  T DN      +
Sbjct: 9   RRDRFRRVIKRRGDDCHLCGLPIDYDLPHDHELSFQIDHITPLARGGTDTLDNIAAAHRK 68

Query: 100 VNRFKSD 106
            NR KSD
Sbjct: 69  CNRDKSD 75


>gi|303236487|ref|ZP_07323074.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
 gi|302483338|gb|EFL46346.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
          Length = 367

 Score = 39.7 bits (91), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 24/33 (72%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DHI+P+ KGG++  DNCQ+L  + NR KS K
Sbjct: 335 EADHIIPWHKGGKTEKDNCQMLCMKCNRTKSGK 367


>gi|365173256|ref|ZP_09361377.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
 gi|363616949|gb|EHL68368.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
          Length = 374

 Score = 39.7 bits (91), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 26/39 (66%), Gaps = 5/39 (12%)

Query: 74  FEYD-----HIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           FEYD     HI P+SKGG++T DNCQ+L    N  K+DK
Sbjct: 335 FEYDEMEGDHIKPWSKGGQTTPDNCQMLCKSCNAKKTDK 373


>gi|126657420|ref|ZP_01728579.1| hypothetical protein CY0110_00635 [Cyanothece sp. CCY0110]
 gi|126621407|gb|EAZ92119.1| hypothetical protein CY0110_00635 [Cyanothece sp. CCY0110]
          Length = 636

 Score = 39.3 bits (90), Expect = 0.68,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 9/78 (11%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
           G+HPE   K +   ++ K+   C+ C L F      E DHI+P   GG++  DN Q L  
Sbjct: 497 GKHPEL--KSSIAKLLKKQKGKCNYCGLTFMPGDRIETDHIIPRKMGGDNLKDNLQALHI 554

Query: 99  RVNRFKSDKEQVDATKLK 116
             +  KS K+  D T  K
Sbjct: 555 HCHDIKSKKDIKDITSYK 572


>gi|168059079|ref|XP_001781532.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667011|gb|EDQ53651.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 189

 Score = 39.3 bits (90), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
           G  P ++R D+ GN+V     +    L +E DH  P S+GG++   N ++LQ + ++ K 
Sbjct: 97  GFDPAKFRVDSYGNVVYWN-ADPSSPLAWEIDHWFPHSRGGKTVVSNLRVLQWQASQRKK 155

Query: 106 DK 107
            +
Sbjct: 156 SR 157


>gi|409098020|ref|ZP_11218044.1| hypothetical protein PagrP_06221 [Pedobacter agri PB92]
          Length = 296

 Score = 39.3 bits (90), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
            E+DHI+PFSKGG +T  N Q+L    NR KS
Sbjct: 261 LEFDHIIPFSKGGSNTYRNIQLLCEGCNRTKS 292


>gi|354464472|gb|AER26385.1| gp94 [Mycobacterium phage Fionnbharth]
          Length = 101

 Score = 39.3 bits (90), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 20/39 (51%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV 110
           L F  DHI P + GGE T DN      + NR KSDK   
Sbjct: 48  LAFTIDHITPLALGGEDTLDNLGPAHRKCNRAKSDKPPT 86


>gi|345867540|ref|ZP_08819550.1| HNH endonuclease family protein [Bizionia argentinensis JUB59]
 gi|344048036|gb|EGV43650.1| HNH endonuclease family protein [Bizionia argentinensis JUB59]
          Length = 189

 Score = 39.3 bits (90), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 30/67 (44%)

Query: 49  PERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
           P+  R  A  N   K          +E DHIVP S+GG     N Q+L    ++ K+ +E
Sbjct: 45  PKVVRSTANKNAFLKSKGYTSTPYGYEIDHIVPLSEGGSDDPSNMQLLTVNQHKAKTARE 104

Query: 109 QVDATKL 115
           +    KL
Sbjct: 105 RTKRAKL 111


>gi|410097764|ref|ZP_11292745.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|410101011|ref|ZP_11295965.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|410101180|ref|ZP_11296130.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409213077|gb|EKN06106.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409213657|gb|EKN06673.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223854|gb|EKN16789.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 83

 Score = 39.3 bits (90), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 35/80 (43%)

Query: 28  YFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGE 87
           YF  +     W K   V G+ P  +RKD  GN++          L +  DH  P +KGG 
Sbjct: 3   YFSLETLLRIWQKGLIVRGKDPNLYRKDCYGNLMYWYSYGKFTKLGWNVDHSKPIAKGGT 62

Query: 88  STADNCQILQTRVNRFKSDK 107
           +  +N Q +    N  K+ K
Sbjct: 63  NHLNNLQPMNCFANSSKNAK 82


>gi|441213611|ref|ZP_20975857.1| hypothetical protein D806_5030 [Mycobacterium smegmatis MKD8]
 gi|440625575|gb|ELQ87421.1| hypothetical protein D806_5030 [Mycobacterium smegmatis MKD8]
          Length = 54

 Score = 39.3 bits (90), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 52  WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDKEQ 109
           W++D    + C    N H      +DHI+P+S+GG S  A+N Q++  R N  K DK Q
Sbjct: 2   WKRDQGRCVECGSTDNLH------FDHIIPYSRGGSSLVAENIQLMCARHNLAKHDKIQ 54


>gi|359427003|ref|ZP_09218078.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
 gi|358237616|dbj|GAB07660.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
          Length = 548

 Score = 38.9 bits (89), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 64  RFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
           R   C      EYDHI+P+S+GG ++ +N QIL
Sbjct: 504 RCVECGSTSYLEYDHIIPWSRGGATSVENLQIL 536


>gi|220919209|ref|YP_002494513.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-1]
 gi|219957063|gb|ACL67447.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-1]
          Length = 386

 Score = 38.9 bits (89), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 9/52 (17%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLC-----FEYDHIVPFSKGGESTADNCQIL 96
           E WR+D      C ++   +G LC      + DH+VP ++GGESTA N ++L
Sbjct: 287 EVWRRDGG----CCQWRLANGELCGSTHALQLDHVVPRARGGESTAANLRVL 334


>gi|154500496|ref|ZP_02038534.1| hypothetical protein BACCAP_04168 [Bacteroides capillosus ATCC
           29799]
 gi|150270727|gb|EDM98023.1| HNH endonuclease domain protein [Pseudoflavonifractor capillosus
           ATCC 29799]
          Length = 750

 Score = 38.9 bits (89), Expect = 0.99,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 5/47 (10%)

Query: 66  CNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           C C G      + F+ DHI+P SKGG+S  +N QIL    N  K D+
Sbjct: 704 CACCGMEAASRIPFQVDHILPMSKGGKSVPENLQILCRSCNGRKGDQ 750


>gi|256824130|ref|YP_003148090.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
 gi|256687523|gb|ACV05325.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
          Length = 198

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 70  GCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDAT 113
           G    E DH+VPF +GG++ A+N Q L TR +  K+ K Q DA+
Sbjct: 128 GAASCEIDHVVPFDEGGQTVAENLQCLSTRGHTRKT-KGQWDAS 170


>gi|443474446|ref|ZP_21064422.1| helicase domain protein [Pseudanabaena biceps PCC 7429]
 gi|443020805|gb|ELS34721.1| helicase domain protein [Pseudanabaena biceps PCC 7429]
          Length = 865

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 6/46 (13%)

Query: 69  HGCLC------FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
           + CLC       + DH+ PF  GG+++ DN Q L  R N FK   E
Sbjct: 712 NSCLCCGDHRRLQVDHVFPFILGGQTSIDNSQTLCMRCNGFKGKNE 757


>gi|406913193|gb|EKD52650.1| HNH endonuclease [uncultured bacterium]
          Length = 229

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
           E W++D    ++C    N H      +DH +PFSKGG S TA N +IL  + N  KS K
Sbjct: 175 EVWKRDKGACVICGSNKNLH------FDHELPFSKGGTSLTAKNIRILCLKHNLSKSAK 227


>gi|395519910|ref|XP_003764084.1| PREDICTED: E3 ubiquitin-protein ligase ZSWIM2 [Sarcophilus
           harrisii]
          Length = 643

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 54/145 (37%), Gaps = 26/145 (17%)

Query: 62  CKRFCNCHGCL--CFEYDHIVPFS-------KGGESTADNCQILQTRVNRFKSDKEQVDA 112
           C  +  CH C   CF   HI  F        K  E+  + C   +   N   +++ + + 
Sbjct: 259 CTEYHLCHECFVSCFHPPHIFTFRQKRNQRWKSLENNPELCPAEKIIKNTEFTNQTEEEM 318

Query: 113 TKL-KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQF------------ 159
           T L K  SC  K     L +I +  Y  ++ PGNQCR       LGQ             
Sbjct: 319 THLEKSDSCTPKHVVTSLPLILITKYSKLLAPGNQCRLCLKPFCLGQHTRLLSCNHKFHR 378

Query: 160 KSKDRWLLANCHMMVNHCSTRGSCI 184
           K  D WLL     + N C   G  +
Sbjct: 379 KCIDDWLLN----ISNSCPIDGQTV 399


>gi|386772121|ref|ZP_10094499.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
          Length = 526

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 4/40 (10%)

Query: 61  VCKRFCNCHGCLC----FEYDHIVPFSKGGESTADNCQIL 96
           + +R   C G  C     +YDHIVP S+GG ++ DN Q L
Sbjct: 394 LSRRDTTCRGPFCNAQIRQYDHIVPVSRGGPTSLDNGQGL 433


>gi|421735209|ref|ZP_16174207.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
 gi|407076867|gb|EKE49775.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
          Length = 379

 Score = 38.5 bits (88), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 24/30 (80%)

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           DHIVP+S+GG++T DNCQ+L T  N  KS+
Sbjct: 349 DHIVPWSEGGKTTLDNCQMLCTTCNIKKSN 378


>gi|187251265|ref|YP_001875747.1| restriction endonuclease [Elusimicrobium minutum Pei191]
 gi|186971425|gb|ACC98410.1| Restriction endonuclease [Elusimicrobium minutum Pei191]
          Length = 230

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 33/66 (50%), Gaps = 9/66 (13%)

Query: 43  TVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVN 101
           T+  RH   W +D    + C    N H      +DH +PFSKGG S TA N Q+L  + N
Sbjct: 171 TIVKRHV--WERDKGQCVQCGNKTNLH------FDHNIPFSKGGSSITAKNIQLLCAKCN 222

Query: 102 RFKSDK 107
             K DK
Sbjct: 223 LSKHDK 228


>gi|223933572|ref|ZP_03625553.1| HNH endonuclease [Streptococcus suis 89/1591]
 gi|223897768|gb|EEF64148.1| HNH endonuclease [Streptococcus suis 89/1591]
          Length = 115

 Score = 38.1 bits (87), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
           +C   DH+VP +KGG  S  +N Q+     NR KSDK   D +K
Sbjct: 44  MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87


>gi|451937196|gb|AGF87501.1| putative HNH endonuclease [Streptococcus phage phi7917]
 gi|451937259|gb|AGF87563.1| putative HNH endonuclease [Streptococcus phage phi891591]
          Length = 147

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
           +C   DH+VP +KGG  S  +N Q+     NR KSDK   D +K
Sbjct: 76  MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 119


>gi|76788004|ref|YP_329262.1| prophage LambdaSa03, HNH endonuclease [Streptococcus agalactiae
           A909]
 gi|76563061|gb|ABA45645.1| prophage LambdaSa03, HNH endonuclease family protein [Streptococcus
           agalactiae A909]
          Length = 124

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATKLKG 117
           L    DHIVP +KGG  S+ DN Q+   + NR KSDK  ++ T ++ 
Sbjct: 57  LSAAIDHIVPIAKGGHPSSIDNLQLTHWQCNRQKSDKLFINQTAVRA 103


>gi|126660307|ref|ZP_01731421.1| hypothetical protein CY0110_31740 [Cyanothece sp. CCY0110]
 gi|126618413|gb|EAZ89168.1| hypothetical protein CY0110_31740 [Cyanothece sp. CCY0110]
          Length = 636

 Score = 38.1 bits (87), Expect = 1.7,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 9/78 (11%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
           G+HPE   K +   ++ K+   C+ C L F      E DHI P   GG +  DN Q+L  
Sbjct: 497 GKHPEL--KKSVATLLKKQKGKCNHCGLTFMLGDLIETDHITPLKAGGNNHKDNLQVLHI 554

Query: 99  RVNRFKSDKEQVDATKLK 116
             +  K+ K+  D    K
Sbjct: 555 HCHDIKTRKDLRDIKSYK 572


>gi|451937008|gb|AGF87316.1| putative HNH endonuclease [Streptococcus phage phi20c]
          Length = 115

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
           +C   DH+VP +KGG  S  +N Q+     NR KSDK   D +K
Sbjct: 44  MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87


>gi|302038913|ref|YP_003799235.1| hypothetical protein NIDE3631 [Candidatus Nitrospira defluvii]
 gi|300606977|emb|CBK43310.1| conserved protein of unknown function, putative HNH endonuclease
           [Candidatus Nitrospira defluvii]
          Length = 249

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
           E W++D    + C    N H      YDH +P+SKGG S +A N +IL  + N  KSDK
Sbjct: 175 EVWKRDQGKCVQCGFQKNLH------YDHDIPYSKGGSSLSAQNVRILCAKCNLEKSDK 227


>gi|330832281|ref|YP_004401106.1| HNH endonuclease [Streptococcus suis ST3]
 gi|329306504|gb|AEB80920.1| HNH endonuclease [Streptococcus suis ST3]
          Length = 115

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
           +C   DH+VP +KGG  S  +N Q+     NR KSDK   D +K
Sbjct: 44  MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87


>gi|238926542|ref|ZP_04658302.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
 gi|238885488|gb|EEQ49126.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
          Length = 363

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 23/30 (76%)

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           DHI P+SKGG++T +NCQ+L    NR KS+
Sbjct: 333 DHITPWSKGGKTTPENCQMLCAPCNRRKSN 362


>gi|401564832|ref|ZP_10805697.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
 gi|400188432|gb|EJO22596.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
          Length = 754

 Score = 38.1 bits (87), Expect = 1.9,   Method: Composition-based stats.
 Identities = 20/47 (42%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 63  KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQ 109
           K F   HG    E DHI P + GG+S  DN Q+L    N  K DK +
Sbjct: 710 KVFSGRHG---LEVDHIKPMNDGGQSVLDNLQVLCKSCNASKGDKAE 753


>gi|86159768|ref|YP_466553.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
 gi|85776279|gb|ABC83116.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
          Length = 368

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 1/47 (2%)

Query: 50  ERWRKDAAG-NIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQI 95
           E W +D      V      C      E DHIVP ++GG+STADN +I
Sbjct: 277 EVWARDGGRCTFVLASGERCESAHRLELDHIVPRARGGDSTADNLRI 323


>gi|425736696|ref|ZP_18854994.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
 gi|425477862|gb|EKU45079.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
          Length = 331

 Score = 37.7 bits (86), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 22/36 (61%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L  E DHI P SKGG S  +N Q L  R NR KS+K
Sbjct: 295 LLLEVDHIKPISKGGLSEIENLQTLCWRCNRSKSNK 330


>gi|119492356|ref|ZP_01623692.1| RNA-directed DNA polymerase [Lyngbya sp. PCC 8106]
 gi|119453136|gb|EAW34304.1| RNA-directed DNA polymerase [Lyngbya sp. PCC 8106]
          Length = 511

 Score = 37.7 bits (86), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 12/88 (13%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
           G+HPE  ++ A+  ++ K+   C  C L F      E DHI+P SKGG +  +N Q+L  
Sbjct: 418 GKHPEMLKRTAS--LLKKQNGKCPKCGLSFKDGDLIELDHIIPKSKGGRNEYNNYQLLHR 475

Query: 99  RVNRFKSDKEQVDATKLKGFSCDVKFTD 126
             +    +K ++D +  KG    +K  +
Sbjct: 476 HCH---DEKTRIDGSLNKGSISQIKIPE 500


>gi|347525200|ref|YP_004831948.1| HNH endonuclease [Lactobacillus ruminis ATCC 27782]
 gi|345284159|gb|AEN78012.1| HNH endonuclease [Lactobacillus ruminis ATCC 27782]
          Length = 115

 Score = 37.7 bits (86), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 77  DHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATKLK 116
           DHIVP SKGG  S+ DN Q+     NR KSDK   D  K K
Sbjct: 53  DHIVPISKGGHPSSIDNLQLAHWTCNRQKSDKLYADGFKKK 93


>gi|158318135|ref|YP_001510643.1| stress protein [Frankia sp. EAN1pec]
 gi|158113540|gb|ABW15737.1| stress protein [Frankia sp. EAN1pec]
          Length = 560

 Score = 37.7 bits (86), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D    +       C      E+DH++P S+GG S+ +N Q+L  R N  K  +
Sbjct: 508 EVWQRDGGACV------QCQATEYLEFDHVIPHSRGGASSVNNLQLLCRRCNLEKGAR 559


>gi|428779791|ref|YP_007171577.1| Retron-type reverse transcriptase [Dactylococcopsis salina PCC
           8305]
 gi|428694070|gb|AFZ50220.1| Retron-type reverse transcriptase [Dactylococcopsis salina PCC
           8305]
          Length = 594

 Score = 37.7 bits (86), Expect = 2.1,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 9/59 (15%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGCLCF-------EYDHIVPFSKGGESTADNCQILQ 97
           G+HPE  ++ A+  ++ K+   C  C  F       E DHIVP+SKGG+    N Q+L 
Sbjct: 503 GKHPEMPKRTAS--LLKKQKGKCAHCGLFFRDGDLLEVDHIVPYSKGGKDEYKNWQLLH 559


>gi|342239735|gb|AEL19743.1| gp98 [Mycobacterium phage Larva]
          Length = 101

 Score = 37.7 bits (86), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 27/54 (50%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
           + F+ DHI P S+GG  T DN      + NR K DK + + +    F    ++T
Sbjct: 47  MSFQIDHITPLSRGGTDTLDNVAACHRKCNRDKGDKLEAELSGAVTFITSREWT 100


>gi|15807725|ref|NP_285380.1| TerF-like protein [Deinococcus radiodurans R1]
 gi|6460551|gb|AAF12257.1|AE001862_83 TerF-related protein [Deinococcus radiodurans R1]
          Length = 579

 Score = 37.7 bits (86), Expect = 2.1,   Method: Composition-based stats.
 Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 6/58 (10%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E W++D    +       C      E+DH++P S GG S+  N Q+L  R N  KS++
Sbjct: 527 EVWQRDQGQCV------ECGDTNYLEFDHVIPHSLGGASSVGNLQLLCRRCNLAKSNR 578


>gi|159897861|ref|YP_001544108.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
 gi|159890900|gb|ABX03980.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
          Length = 321

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E+DHI+P SKGG +T  N Q+L  + N  K D+
Sbjct: 287 LEFDHIIPHSKGGANTVGNVQLLCRKCNLAKGDR 320


>gi|303245412|ref|ZP_07331696.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
 gi|302493261|gb|EFL53123.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
          Length = 309

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
            + DHI P SKGG++   N Q+L +R NR K +KE  D   L
Sbjct: 138 LDVDHIKPRSKGGKNEYANLQVLCSRCNRAKGNKEDTDYRPL 179


>gi|407463091|ref|YP_006774408.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
 gi|407046713|gb|AFS81466.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
          Length = 228

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 7/58 (12%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSD 106
           E W++D    + C    N H      +DH++P SKGG S TA+N QIL  + N  K D
Sbjct: 174 EVWKRDEGKCVKCGSKENLH------FDHVIPHSKGGTSITAENIQILCAKHNLEKRD 225


>gi|171914460|ref|ZP_02929930.1| HNH endonuclease [Verrucomicrobium spinosum DSM 4136]
          Length = 193

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 16/81 (19%)

Query: 36  MCWAKAETVPGRHPER-----WRKDAAGNIVC----KRFCNCHGCLCFEYDHIVPFSKGG 86
           +  A+ + VP R P+      W +D     VC    ++     G +    DHI+P S+GG
Sbjct: 81  LVLARYDKVPKRRPKLSSRGIWDRDGG---VCQYTGRKLTRDEGNI----DHIMPRSRGG 133

Query: 87  ESTADNCQILQTRVNRFKSDK 107
           +++ DNC +   R+N  K+D+
Sbjct: 134 KTSWDNCVLADKRINSRKADR 154


>gi|402829778|ref|ZP_10878651.1| HNH endonuclease domain protein [Slackia sp. CM382]
 gi|402283445|gb|EJU31959.1| HNH endonuclease domain protein [Slackia sp. CM382]
          Length = 322

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 21/36 (58%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L  E DHIVP SKGG +  DN Q L  + NR K  K
Sbjct: 285 LLLEVDHIVPVSKGGMTKEDNLQTLCWKCNRSKGAK 320


>gi|358455199|ref|ZP_09165427.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
 gi|357081452|gb|EHI90883.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
          Length = 545

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
             CH     E+DH++P S GG ++ +N Q+L  R N  K  +
Sbjct: 503 AQCHATEYLEFDHVIPHSLGGATSVNNLQLLCRRCNLEKGAR 544


>gi|428169182|gb|EKX38118.1| hypothetical protein GUITHDRAFT_144431 [Guillardia theta CCMP2712]
          Length = 320

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 3/65 (4%)

Query: 44  VPGRHPERWRKDAAGNIVCKRFCNCHGCLCF-EYDHIVPFSKGGESTADNCQILQTRVNR 102
           +P    + ++ D  GNI+        G +C+ E DH+ P S+GG ++  N  ++Q   NR
Sbjct: 165 IPSWFHKLFKVDMWGNIIS--VDGLQGAMCYAEEDHLFPSSRGGRTSLANLVLIQFVANR 222

Query: 103 FKSDK 107
            K +K
Sbjct: 223 KKKEK 227


>gi|440783576|ref|ZP_20961189.1| putative wall-associated protein [Clostridium pasteurianum DSM 525]
 gi|440219319|gb|ELP58532.1| putative wall-associated protein [Clostridium pasteurianum DSM 525]
          Length = 356

 Score = 37.4 bits (85), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           ++ DHI+P SKGG +  DN +++   +NR K DK
Sbjct: 323 WQIDHIIPKSKGGTNGYDNARVISRHLNRLKWDK 356


>gi|434394103|ref|YP_007129050.1| HNH endonuclease [Gloeocapsa sp. PCC 7428]
 gi|428265944|gb|AFZ31890.1| HNH endonuclease [Gloeocapsa sp. PCC 7428]
          Length = 184

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 6/64 (9%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQ 109
           E  R+D   N  C+ +C  H  L    DH++P S+GG  T DN      R N+ KSD+  
Sbjct: 83  EVLRRD---NYTCQ-YCGSHKHLTL--DHVIPVSRGGLHTWDNVVTACERCNQRKSDRTP 136

Query: 110 VDAT 113
           ++A+
Sbjct: 137 IEAS 140


>gi|359424187|ref|ZP_09215309.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
 gi|358240461|dbj|GAB04891.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
          Length = 379

 Score = 37.4 bits (85), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 62  CKRFCNCHGCLCFEY-----DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           CK   N  G   F+      DHIVP+SKGG++ A+N Q+L    NR K D
Sbjct: 329 CKTVGNDDGQQVFDISQMDADHIVPWSKGGKTEAENGQMLCIGCNRSKGD 378


>gi|302669231|ref|YP_003832381.1| HNH endonuclease domain-containing protein [Butyrivibrio
           proteoclasticus B316]
 gi|302396895|gb|ADL35799.1| HNH endonuclease domain-containing protein [Butyrivibrio
           proteoclasticus B316]
          Length = 592

 Score = 37.4 bits (85), Expect = 2.8,   Method: Composition-based stats.
 Identities = 17/36 (47%), Positives = 21/36 (58%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L  E DHI+P +KGG +  DN Q L  + NR K  K
Sbjct: 553 LLLEIDHIIPIAKGGLTKEDNLQTLCWKCNRSKGSK 588


>gi|260887966|ref|ZP_05899229.1| putative HNH endonuclease domain protein [Selenomonas sputigena
           ATCC 35185]
 gi|330838550|ref|YP_004413130.1| hypothetical protein Selsp_0703 [Selenomonas sputigena ATCC 35185]
 gi|260862217|gb|EEX76717.1| putative HNH endonuclease domain protein [Selenomonas sputigena
           ATCC 35185]
 gi|329746314|gb|AEB99670.1| protein of unknown function DUF262 [Selenomonas sputigena ATCC
           35185]
          Length = 374

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DHI+P+SKGG +  DNCQ+L    N  K+DK
Sbjct: 341 EGDHIMPWSKGGHTQPDNCQMLCKSCNGKKTDK 373


>gi|153003686|ref|YP_001378011.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
 gi|152027259|gb|ABS25027.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
          Length = 355

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 12/76 (15%)

Query: 24  ERPRYFDGKAKSMCWAKAETVPGR-HPERWRKDAAGNIVCKRFCNCHGCLC---FEYDHI 79
           E+PR      KS   ++ +T+P     E WR+  AG     RF +   C C    EYDHI
Sbjct: 261 EKPR------KSRRGSRRDTIPAEVKREVWRR--AGGQCEWRFESGERCDCRRRLEYDHI 312

Query: 80  VPFSKGGESTADNCQI 95
            P + GG ST DN ++
Sbjct: 313 EPLALGGASTMDNVRL 328


>gi|404483379|ref|ZP_11018602.1| hypothetical protein HMPREF1135_01662 [Clostridiales bacterium
           OBRC5-5]
 gi|404343652|gb|EJZ70013.1| hypothetical protein HMPREF1135_01662 [Clostridiales bacterium
           OBRC5-5]
          Length = 762

 Score = 37.4 bits (85), Expect = 3.0,   Method: Composition-based stats.
 Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 9/58 (15%)

Query: 55  DAAGNIVCKRFCNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           D AGN  C     C G      + F+ DHI+  + GG+S  +N QIL    N  KSDK
Sbjct: 709 DEAGNYRCA----CCGFTDRSRVPFQVDHIIAMNNGGKSVVENLQILCRSCNARKSDK 762


>gi|256375023|ref|YP_003098683.1| HNH nuclease [Actinosynnema mirum DSM 43827]
 gi|255919326|gb|ACU34837.1| HNH nuclease [Actinosynnema mirum DSM 43827]
          Length = 218

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 66  CNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
            +C G     YDH+VP ++GG ST  N ++L
Sbjct: 179 AHCGGATDLNYDHVVPLAEGGPSTVGNLRVL 209


>gi|172035554|ref|YP_001802055.1| putative reverse transcriptase [Cyanothece sp. ATCC 51142]
 gi|354556987|ref|ZP_08976261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Cyanothece sp.
           ATCC 51472]
 gi|171697008|gb|ACB49989.1| putative reverse transcriptase [Cyanothece sp. ATCC 51142]
 gi|353551051|gb|EHC20474.1| RNA-directed DNA polymerase (Reverse transcriptase) [Cyanothece sp.
           ATCC 51472]
          Length = 607

 Score = 37.4 bits (85), Expect = 3.2,   Method: Composition-based stats.
 Identities = 40/172 (23%), Positives = 64/172 (37%), Gaps = 51/172 (29%)

Query: 15  SSNSNNSNEERPRYFDGKAKSMCWAKAETVP------------GRHPERWRKDAAGNIVC 62
           ++ S+N+      Y   K++     K ET P            G+HPE  ++ A+  ++ 
Sbjct: 459 ATRSDNNPYTLAEYGSFKSQKYAKVKGETSPYDGNLVYWSTRMGKHPEMPKRTAS--MLK 516

Query: 63  KRFCNCHGCLCF-------EYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
           K+   C  C  F       E DHI+P SKGG+    N Q+L    +  K+  +       
Sbjct: 517 KQKGKCTHCGLFFKDGDIIELDHIIPKSKGGKDEYKNWQLLHRHCHDKKTRTD------- 569

Query: 116 KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTV-AEMLGQFKSKDRWL 166
                                 G + R GN+  C +V  + L    S  RW+
Sbjct: 570 ----------------------GSLKRSGNKTDCNSVIPKQLSNIPSNYRWI 599


>gi|414581004|ref|ZP_11438144.1| hypothetical protein MA5S1215_2844 [Mycobacterium abscessus
           5S-1215]
 gi|420864922|ref|ZP_15328311.1| hypothetical protein MA4S0303_3285 [Mycobacterium abscessus
           4S-0303]
 gi|420869712|ref|ZP_15333094.1| hypothetical protein MA4S0726RA_3219 [Mycobacterium abscessus
           4S-0726-RA]
 gi|420874156|ref|ZP_15337532.1| hypothetical protein MA4S0726RB_2808 [Mycobacterium abscessus
           4S-0726-RB]
 gi|420881531|ref|ZP_15344898.1| hypothetical protein MA5S0304_2807 [Mycobacterium abscessus
           5S-0304]
 gi|420884715|ref|ZP_15348075.1| hypothetical protein MA5S0421_3062 [Mycobacterium abscessus
           5S-0421]
 gi|420891563|ref|ZP_15354910.1| hypothetical protein MA5S0422_3981 [Mycobacterium abscessus
           5S-0422]
 gi|420894868|ref|ZP_15358207.1| hypothetical protein MA5S0708_2734 [Mycobacterium abscessus
           5S-0708]
 gi|420901077|ref|ZP_15364408.1| hypothetical protein MA5S0817_2353 [Mycobacterium abscessus
           5S-0817]
 gi|420905963|ref|ZP_15369281.1| hypothetical protein MA5S1212_2490 [Mycobacterium abscessus
           5S-1212]
 gi|420974106|ref|ZP_15437297.1| hypothetical protein MA5S0921_3765 [Mycobacterium abscessus
           5S-0921]
 gi|421044511|ref|ZP_15507511.1| hypothetical protein MA4S0116S_2357 [Mycobacterium abscessus
           4S-0116-S]
 gi|392063638|gb|EIT89487.1| hypothetical protein MA4S0303_3285 [Mycobacterium abscessus
           4S-0303]
 gi|392065631|gb|EIT91479.1| hypothetical protein MA4S0726RB_2808 [Mycobacterium abscessus
           4S-0726-RB]
 gi|392069182|gb|EIT95029.1| hypothetical protein MA4S0726RA_3219 [Mycobacterium abscessus
           4S-0726-RA]
 gi|392078823|gb|EIU04650.1| hypothetical protein MA5S0422_3981 [Mycobacterium abscessus
           5S-0422]
 gi|392080478|gb|EIU06304.1| hypothetical protein MA5S0421_3062 [Mycobacterium abscessus
           5S-0421]
 gi|392086440|gb|EIU12265.1| hypothetical protein MA5S0304_2807 [Mycobacterium abscessus
           5S-0304]
 gi|392094180|gb|EIU19975.1| hypothetical protein MA5S0708_2734 [Mycobacterium abscessus
           5S-0708]
 gi|392098438|gb|EIU24232.1| hypothetical protein MA5S0817_2353 [Mycobacterium abscessus
           5S-0817]
 gi|392103867|gb|EIU29653.1| hypothetical protein MA5S1212_2490 [Mycobacterium abscessus
           5S-1212]
 gi|392116156|gb|EIU41924.1| hypothetical protein MA5S1215_2844 [Mycobacterium abscessus
           5S-1215]
 gi|392161989|gb|EIU87679.1| hypothetical protein MA5S0921_3765 [Mycobacterium abscessus
           5S-0921]
 gi|392233964|gb|EIV59462.1| hypothetical protein MA4S0116S_2357 [Mycobacterium abscessus
           4S-0116-S]
          Length = 92

 Score = 37.4 bits (85), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 23/40 (57%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVD 111
           L ++ DH++P SKGG  T DN      + NR KSDK   D
Sbjct: 41  LSYQVDHLIPLSKGGTDTLDNKVPSHRQCNRDKSDKLPED 80


>gi|420989738|ref|ZP_15452894.1| hypothetical protein MA4S0206_3302 [Mycobacterium abscessus
           4S-0206]
 gi|421040540|ref|ZP_15503548.1| hypothetical protein MA4S0116R_3259 [Mycobacterium abscessus
           4S-0116-R]
 gi|392184017|gb|EIV09668.1| hypothetical protein MA4S0206_3302 [Mycobacterium abscessus
           4S-0206]
 gi|392221468|gb|EIV46991.1| hypothetical protein MA4S0116R_3259 [Mycobacterium abscessus
           4S-0116-R]
          Length = 90

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 23/40 (57%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVD 111
           L ++ DH++P SKGG  T DN      + NR KSDK   D
Sbjct: 39  LSYQVDHLIPLSKGGTDTLDNKVPSHRQCNRDKSDKLPED 78


>gi|418753307|ref|ZP_13309558.1| RHS repeat-associated core domain protein [Leptospira santarosai
           str. MOR084]
 gi|409966309|gb|EKO34155.1| RHS repeat-associated core domain protein [Leptospira santarosai
           str. MOR084]
          Length = 893

 Score = 37.0 bits (84), Expect = 3.4,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           +++DHI P S GG++ + N QIL  + NR KS+K
Sbjct: 860 WQFDHIKPKSCGGDNCSSNLQILSRQENRIKSNK 893


>gi|213692526|ref|YP_002323112.1| HNH nuclease [Bifidobacterium longum subsp. infantis ATCC 15697 =
           JCM 1222]
 gi|213523987|gb|ACJ52734.1| HNH nuclease [Bifidobacterium longum subsp. infantis ATCC 15697 =
           JCM 1222]
          Length = 110

 Score = 37.0 bits (84), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 72  LCFEYDHIVPFSKGGESTA-DNCQILQTRVNRFKSDKEQVDA-TKLKGFSCDVKFTDKEL 129
           +  E D ++P S+GG+  + +NC++   R NR KSDK    A + L G +   K T    
Sbjct: 46  MSAEVDELIPVSRGGDPLSFNNCRLTHRRCNRIKSDKTDAYARSHLGGAAAQPKATSIPF 105

Query: 130 DI 131
           ++
Sbjct: 106 EL 107


>gi|210623659|ref|ZP_03293968.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
 gi|210153424|gb|EEA84430.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
          Length = 912

 Score = 37.0 bits (84), Expect = 3.7,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
           F+ DHIVP SKGG++  +N Q+L    N  K D
Sbjct: 658 FQIDHIVPLSKGGKTVVENLQLLTRWENAQKGD 690


>gi|417162963|ref|ZP_11998293.1| PF03235 family protein [Escherichia coli 99.0741]
 gi|386173454|gb|EIH45466.1| PF03235 family protein [Escherichia coli 99.0741]
          Length = 390

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DH+  +SKGGEST DNC++L    NR K +K
Sbjct: 358 EADHVSAWSKGGESTLDNCEMLCITHNRSKGNK 390


>gi|227484896|ref|ZP_03915212.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
           51172]
 gi|227237051|gb|EEI87066.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
           51172]
          Length = 121

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 14/20 (70%), Positives = 19/20 (95%)

Query: 77  DHIVPFSKGGESTADNCQIL 96
           DHIVP+SKGG++ A+NCQ+L
Sbjct: 91  DHIVPWSKGGKTIAENCQML 110


>gi|126326723|ref|XP_001378250.1| PREDICTED: e3 ubiquitin-protein ligase ZSWIM2 [Monodelphis
           domestica]
          Length = 648

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 54/145 (37%), Gaps = 26/145 (17%)

Query: 62  CKRFCNCHGCL--CFEYDHIVPFS-------KGGESTADNCQILQTRVNRFKSDKEQVDA 112
           C  +  CH C   C+   HI  F        K  E  +  C + ++  N    ++ + + 
Sbjct: 261 CTEYHLCHECFISCYHPAHIFIFRQKRNQRWKSLEKFSGLCDLRRSIKNLELKNQTEKEM 320

Query: 113 TKL-KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQF------------ 159
           T   K   C  K   K L +I +  +  ++ PG+QCR    A  LGQ             
Sbjct: 321 TNFEKSIVCTPKHVVKSLPLILITKHSKLLAPGHQCRLCLKAFYLGQHTRVLSCNHKFHR 380

Query: 160 KSKDRWLLANCHMMVNHCSTRGSCI 184
           K  D WLL  C    N C   G  +
Sbjct: 381 KCIDDWLLNIC----NSCPIDGQTV 401


>gi|83591793|ref|YP_425545.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum ATCC 11170]
 gi|386348484|ref|YP_006046732.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum F11]
 gi|83574707|gb|ABC21258.1| CRISPR-associated endonuclease, Csn1 family [Rhodospirillum rubrum
           ATCC 11170]
 gi|346716920|gb|AEO46935.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
           rubrum F11]
          Length = 1173

 Score = 37.0 bits (84), Expect = 4.1,   Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 24/39 (61%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
           ++ DH++P ++GG  + DN  + Q+  N+ K DK   +A
Sbjct: 624 YDIDHVIPLARGGRDSLDNMVLCQSDANKTKGDKTPWEA 662


>gi|118443767|ref|YP_877830.1| HNH endonuclease domain-containing protein [Clostridium novyi NT]
 gi|118134223|gb|ABK61267.1| HNH endonuclease domain protein [Clostridium novyi NT]
          Length = 299

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 21/36 (58%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L  E DHI+P SKGG ST  N Q L  + NR K  K
Sbjct: 262 LLLEIDHIIPISKGGMSTEKNLQTLCWKCNRKKGAK 297


>gi|291542181|emb|CBL15291.1| Restriction endonuclease [Ruminococcus bromii L2-63]
          Length = 374

 Score = 36.6 bits (83), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 5/39 (12%)

Query: 74  FEYD-----HIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           FEYD     HI P+SKGG +  DNCQ+L    N  K+DK
Sbjct: 335 FEYDEMEGDHIKPWSKGGRTILDNCQMLCKSCNAKKTDK 373


>gi|444913287|ref|ZP_21233440.1| hypothetical protein D187_05377 [Cystobacter fuscus DSM 2262]
 gi|444716046|gb|ELW56903.1| hypothetical protein D187_05377 [Cystobacter fuscus DSM 2262]
          Length = 43

 Score = 36.6 bits (83), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DH++P SKGGE T +N Q+L    N  KS+K
Sbjct: 9   ERDHVIPKSKGGEGTPENGQVLCRECNLEKSNK 41


>gi|381159335|ref|ZP_09868567.1| restriction endonuclease [Thiorhodovibrio sp. 970]
 gi|380877399|gb|EIC19491.1| restriction endonuclease [Thiorhodovibrio sp. 970]
          Length = 361

 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DHI P+ +GG+++A+NCQ+L    NR KS K
Sbjct: 329 EADHITPWHEGGKTSAENCQMLCKEDNRRKSGK 361


>gi|320538580|ref|ZP_08038435.1| HNH endonuclease domain protein [Treponema phagedenis F0421]
 gi|320144569|gb|EFW36330.1| HNH endonuclease domain protein [Treponema phagedenis F0421]
          Length = 361

 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 61  VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           +CK   N       E DHI+P+  GG++  +N Q+L    NR KSDK
Sbjct: 315 ICKDCGNYFEIEEMEGDHIIPWKDGGKTIDENLQMLCKNCNRIKSDK 361


>gi|284008322|emb|CBA74691.1| conserved hypothetical protein [Arsenophonus nasoniae]
          Length = 124

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 74  FEYDHIVPFSKGGESTADNCQIL 96
           FE DHI+P  +GG+    NCQIL
Sbjct: 89  FELDHIIPLWQGGQDVESNCQIL 111


>gi|384199724|ref|YP_005585467.1| phage protein [Bifidobacterium longum subsp. infantis ATCC 15697 =
           JCM 1222]
 gi|320458676|dbj|BAJ69297.1| hypothetical phage protein [Bifidobacterium longum subsp. infantis
           ATCC 15697 = JCM 1222]
          Length = 89

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 2/59 (3%)

Query: 75  EYDHIVPFSKGGESTA-DNCQILQTRVNRFKSDKEQVDA-TKLKGFSCDVKFTDKELDI 131
           E D ++P S+GG+  + +NC++   R NR KSDK    A + L G +   K T    ++
Sbjct: 28  EVDELIPVSRGGDPLSFNNCRLTHRRCNRIKSDKTDAYARSHLGGAAAQPKATSIPFEL 86


>gi|339782334|gb|AEK08058.1| gp99 [Mycobacterium phage Anaya]
          Length = 100

 Score = 36.2 bits (82), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F+ DHI P   GG  T DN Q    + NR KS+K
Sbjct: 43  FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76


>gi|71064769|ref|YP_263496.1| hypothetical protein Psyc_0189 [Psychrobacter arcticus 273-4]
 gi|71037754|gb|AAZ18062.1| hypothetical protein Psyc_0189 [Psychrobacter arcticus 273-4]
          Length = 585

 Score = 36.2 bits (82), Expect = 6.0,   Method: Composition-based stats.
 Identities = 19/54 (35%), Positives = 26/54 (48%)

Query: 54  KDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           +D+    +C    +    L  E DHI+P SKGG +   N Q L  + NR K  K
Sbjct: 259 RDSYACKICNLSTSDEANLLLEIDHIIPLSKGGVTCESNLQTLCWKCNRSKGSK 312


>gi|126659072|ref|ZP_01730212.1| hypothetical protein CY0110_04663 [Cyanothece sp. CCY0110]
 gi|126619600|gb|EAZ90329.1| hypothetical protein CY0110_04663 [Cyanothece sp. CCY0110]
          Length = 636

 Score = 36.2 bits (82), Expect = 6.0,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 9/78 (11%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
           G+HPE   K +   ++ K+   C+ C L F      E DHI P   GG +  DN Q L  
Sbjct: 497 GKHPEM--KTSVAILLKKQKGKCNHCGLTFMPGDKIETDHITPTKAGGNNQRDNLQALHI 554

Query: 99  RVNRFKSDKEQVDATKLK 116
             +  K+ K+ +D    K
Sbjct: 555 HCHDIKTRKDLIDIKSYK 572


>gi|304360852|ref|YP_003856973.1| gp95 [Mycobacterium phage Angelica]
 gi|302858436|gb|ADL71184.1| gp95 [Mycobacterium phage Angelica]
 gi|339753826|gb|AEJ93846.1| gp95 [Mycobacterium phage JAWS]
 gi|339755861|gb|AEJ95864.1| gp95 [Mycobacterium phage Adephagia]
 gi|347450197|gb|AEO94235.1| gp96 [Mycobacterium phage BarrelRoll]
          Length = 100

 Score = 36.2 bits (82), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F+ DHI P   GG  T DN Q    + NR KS+K
Sbjct: 43  FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76


>gi|153005759|ref|YP_001380084.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
 gi|152029332|gb|ABS27100.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
          Length = 336

 Score = 36.2 bits (82), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 12/76 (15%)

Query: 24  ERPRYFDGKAKSMCWAKAETVPGR-HPERWRKDAAGNIVCKRFCNCHGCLC---FEYDHI 79
           E+PR   G ++       +T+P     E WR+  AG     RF +   C C    EYDHI
Sbjct: 242 EKPRKGRGGSRR------DTIPAEVKREVWRR--AGGQCEWRFESGERCDCRRRLEYDHI 293

Query: 80  VPFSKGGESTADNCQI 95
            P + GG ST DN ++
Sbjct: 294 EPLALGGASTMDNVRL 309


>gi|332523012|ref|ZP_08399264.1| HNH endonuclease domain protein [Streptococcus porcinus str.
           Jelinkova 176]
 gi|332314276|gb|EGJ27261.1| HNH endonuclease domain protein [Streptococcus porcinus str.
           Jelinkova 176]
          Length = 117

 Score = 36.2 bits (82), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 72  LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDA 112
           L    DHIVP +KGG  S+ DN Q+   + NR KSDK   D 
Sbjct: 51  LSPAIDHIVPIAKGGHPSSMDNLQLTHWQCNRQKSDKLYSDV 92


>gi|304360948|ref|YP_003857068.1| gp96 [Mycobacterium phage CrimD]
 gi|302858697|gb|ADL71442.1| gp96 [Mycobacterium phage CrimD]
          Length = 100

 Score = 36.2 bits (82), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F+ DHI P   GG  T DN Q    + NR KS+K
Sbjct: 43  FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76


>gi|434385792|ref|YP_007096403.1| Retron-type reverse transcriptase [Chamaesiphon minutus PCC 6605]
 gi|428016782|gb|AFY92876.1| Retron-type reverse transcriptase [Chamaesiphon minutus PCC 6605]
          Length = 605

 Score = 36.2 bits (82), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 9/67 (13%)

Query: 38  WAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCF-------EYDHIVPFSKGGESTA 90
           W    +  G+HPE   K   GN++  +   C  C  +       E DHI+P SKGG+   
Sbjct: 496 WVYWSSRMGKHPEVSTK--VGNLLKTQKGKCAHCKNYFKDGDSLEVDHIIPKSKGGKDKY 553

Query: 91  DNCQILQ 97
           +N Q+L 
Sbjct: 554 ENWQLLH 560


>gi|406977333|gb|EKD99509.1| HNH endonuclease [uncultured bacterium]
          Length = 233

 Score = 36.2 bits (82), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 7/59 (11%)

Query: 50  ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
           E W++D    ++C    N H      +DH +PFSKGG S T  N ++L  + N  KS K
Sbjct: 179 EVWKRDKGKCVICGATTNLH------FDHDLPFSKGGTSLTEKNVKLLCLKHNLEKSAK 231


>gi|227549709|ref|ZP_03979758.1| conserved hypothetical protein [Corynebacterium lipophiloflavum DSM
           44291]
 gi|227078205|gb|EEI16168.1| conserved hypothetical protein [Corynebacterium lipophiloflavum DSM
           44291]
          Length = 472

 Score = 36.2 bits (82), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           + DH +P+ +GGE+T DN   L  R + FK+DK
Sbjct: 318 QLDHRIPYGQGGETTPDNLHCLCQRHHNFKTDK 350


>gi|126658771|ref|ZP_01729916.1| hypothetical protein CY0110_32455 [Cyanothece sp. CCY0110]
 gi|126620033|gb|EAZ90757.1| hypothetical protein CY0110_32455 [Cyanothece sp. CCY0110]
          Length = 199

 Score = 36.2 bits (82), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 46  GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
           G+HPE   K +   ++ K+   C+ C L F      E DHI P   GG +  DN Q L +
Sbjct: 60  GKHPEM--KKSVATLLKKQKGKCNHCGLTFMPGDKIETDHITPIKAGGNNLKDNLQALHS 117

Query: 99  RVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
             +  K+ ++  D    K      K  DK L   E+  + ++I+
Sbjct: 118 HCHDTKTKRDLKDIKSYKS----QKIWDKTLK--EINFHFEIIK 155


>gi|453381798|dbj|GAC83531.1| hypothetical protein GP2_013_00080 [Gordonia paraffinivorans NBRC
           108238]
          Length = 91

 Score = 35.8 bits (81), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 19/34 (55%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F  DH  PF+ GG  T DN Q    R NR KSD+
Sbjct: 43  FVVDHKKPFAHGGADTIDNKQPAHNRCNRVKSDR 76


>gi|422012873|ref|ZP_16359511.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
 gi|394752101|gb|EJF35815.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
          Length = 94

 Score = 35.8 bits (81), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 51  RWRKDAAGNIVCKRFCNCHGCLCF------EYDHIVPFSKGGESTADNCQILQTRVNRFK 104
           R RK     I  +    CH C           DH+VP SKGG ST DN ++   R N  K
Sbjct: 13  RQRKRITDAIYIRDAARCHLCHQLVRRRDASVDHVVPSSKGGPSTLDNLKLAHRRCNFAK 72

Query: 105 SDK 107
            ++
Sbjct: 73  GNR 75


>gi|331003174|ref|ZP_08326682.1| hypothetical protein HMPREF0491_01544 [Lachnospiraceae oral taxon
           107 str. F0167]
 gi|330412930|gb|EGG92309.1| hypothetical protein HMPREF0491_01544 [Lachnospiraceae oral taxon
           107 str. F0167]
          Length = 762

 Score = 35.8 bits (81), Expect = 7.9,   Method: Composition-based stats.
 Identities = 17/34 (50%), Positives = 21/34 (61%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           F+ DHIV  + GG+S  +N QIL    N  KSDK
Sbjct: 729 FQVDHIVAMNNGGKSVVENLQILCRSCNGEKSDK 762


>gi|312880866|ref|ZP_07740666.1| HNH endonuclease [Aminomonas paucivorans DSM 12260]
 gi|310784157|gb|EFQ24555.1| HNH endonuclease [Aminomonas paucivorans DSM 12260]
          Length = 345

 Score = 35.8 bits (81), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 61  VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
           +C R    HG +    DHIVP SKGG    DN + L  + N  +S+ E+V +
Sbjct: 293 LCGRSAERHGVV-LHVDHIVPRSKGGTDEMDNLRTLCMKCNLGRSNLEEVQS 343


>gi|270593505|ref|ZP_06221465.1| hypothetical protein HAINFHK1212_1873, partial [Haemophilus
           influenzae HK1212]
 gi|270318413|gb|EFA29540.1| hypothetical protein HAINFHK1212_1873 [Haemophilus influenzae
           HK1212]
          Length = 161

 Score = 35.8 bits (81), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 24/34 (70%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E DH+  +SKGG ++A+NC++L T  NR K +K
Sbjct: 128 MEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 161


>gi|119484407|ref|ZP_01619024.1| hypothetical protein L8106_01777 [Lyngbya sp. PCC 8106]
 gi|119457881|gb|EAW39004.1| hypothetical protein L8106_01777 [Lyngbya sp. PCC 8106]
          Length = 142

 Score = 35.8 bits (81), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 76  YDHIVPFSKGGESTADNCQILQTRVNRFKSD-KEQVD 111
           YDHI+P SKGGE+  DN  +     N FKSD  E +D
Sbjct: 37  YDHILPRSKGGETIFDNVCLACRTCNEFKSDITEAID 73


>gi|18496975|ref|NP_569825.1| hypothetical protein TM4_gp92 [Mycobacterium phage TM4]
          Length = 92

 Score = 35.8 bits (81), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 19/36 (52%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L F  DHI P + GG  T DN      + NR KSDK
Sbjct: 41  LSFTIDHITPLALGGTDTLDNIGAAHRKCNRDKSDK 76


>gi|411008724|ref|ZP_11385053.1| HNH nuclease [Aeromonas aquariorum AAK1]
          Length = 110

 Score = 35.8 bits (81), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 41/102 (40%), Gaps = 9/102 (8%)

Query: 29  FDGKAKSMCWAKAETVPGRHPER--WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
           F  +     WAKA T  G   E   +RKD     + K      G   +E DHI P S GG
Sbjct: 3   FSQQTIDAVWAKA-THCGTENEGNGFRKDQCTAWIKKSSYGTTGTYGWEIDHITPVSSGG 61

Query: 87  ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKE 128
                N + L    NR +SD       +LK    +VK  D E
Sbjct: 62  SDNLSNLRPLHWANNRARSDG------RLKTEKPEVKSVDNE 97


>gi|363980955|gb|AAD17657.2| gp92 [Mycobacterium phage TM4]
          Length = 93

 Score = 35.8 bits (81), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 19/36 (52%)

Query: 72  LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           L F  DHI P + GG  T DN      + NR KSDK
Sbjct: 42  LSFTIDHITPLALGGTDTLDNIGAAHRKCNRDKSDK 77


>gi|331268551|ref|YP_004395043.1| putative wall-associated protein [Clostridium botulinum BKT015925]
 gi|329125101|gb|AEB75046.1| putative wall-associated protein [Clostridium botulinum BKT015925]
          Length = 301

 Score = 35.8 bits (81), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           ++ DHI+P SKGG ++  N +I+  + NR K DK
Sbjct: 268 WQIDHIIPKSKGGTNSFGNARIISLKWNRIKWDK 301


>gi|284097476|ref|ZP_06385569.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
 gi|283831012|gb|EFC35029.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
          Length = 171

 Score = 35.8 bits (81), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 61  VCKRFCNCHGCL-CFEYDHIVPFSKGGESTADNCQILQTRVNRFK 104
           +C R C  H  L   E DHI P+S+GG++  +NCQ+L    NR K
Sbjct: 125 ICPR-CGEHFVLGAMEADHIDPWSEGGKTAPENCQMLCRPCNRRK 168


>gi|196231796|ref|ZP_03130653.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
 gi|196224268|gb|EDY18781.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
          Length = 194

 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
           DH+VP S+GG++T +NC +    VN  K+D+   +A
Sbjct: 123 DHVVPRSRGGKTTWENCVLAHREVNTRKADRLPQEA 158


>gi|406984375|gb|EKE05414.1| HNH endonuclease [uncultured bacterium]
          Length = 90

 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 22/34 (64%)

Query: 74  FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            E DHI P+  GG++ A+NCQ+L    NR KS K
Sbjct: 57  MEADHIDPWHDGGKTIAENCQMLCKECNRRKSGK 90


>gi|302782555|ref|XP_002973051.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
 gi|300159652|gb|EFJ26272.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
          Length = 204

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 48/91 (52%), Gaps = 4/91 (4%)

Query: 19  NNSNEERPRYFDGKAKSMCWAKAETV--PGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
           ++S +E P +   K K + W + + +   G  P  +R D+ GN++     +    L +E 
Sbjct: 85  SDSRKEHPSHLH-KFKDVLWQQLDHLGNAGFDPACFRVDSYGNVLYWH-ADPASPLAWEV 142

Query: 77  DHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
            H  P+S+GG++  +N ++ Q +V+  K +K
Sbjct: 143 CHWFPYSRGGKTVLNNLRLAQWQVSLKKKEK 173


>gi|147669720|ref|YP_001214538.1| HNH endonuclease [Dehalococcoides sp. BAV1]
 gi|146270668|gb|ABQ17660.1| HNH endonuclease [Dehalococcoides sp. BAV1]
          Length = 228

 Score = 35.8 bits (81), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 7/46 (15%)

Query: 52  WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQIL 96
           W +D    I C    N H      +DHI+P+SKGG S  A+N Q+L
Sbjct: 176 WARDQGKCIKCGISSNLH------FDHIIPYSKGGSSLVAENIQLL 215


>gi|406998393|gb|EKE16327.1| hypothetical protein ACD_11C00020G0032 [uncultured bacterium]
          Length = 367

 Score = 35.4 bits (80), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 75  EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
           E DHI P+ +GG++ A+NCQIL    NR KS K
Sbjct: 335 EADHIKPWHEGGKTIANNCQILCKDDNRRKSGK 367


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.131    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,934,404,746
Number of Sequences: 23463169
Number of extensions: 113516347
Number of successful extensions: 627628
Number of sequences better than 100.0: 256
Number of HSP's better than 100.0 without gapping: 185
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 627366
Number of HSP's gapped (non-prelim): 280
length of query: 189
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 55
effective length of database: 9,215,130,721
effective search space: 506832189655
effective search space used: 506832189655
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)