BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029699
(189 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359491684|ref|XP_003634305.1| PREDICTED: uncharacterized protein LOC100852985 [Vitis vinifera]
gi|297733935|emb|CBI15182.3| unnamed protein product [Vitis vinifera]
Length = 167
Score = 281 bits (718), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 133/170 (78%), Positives = 148/170 (87%), Gaps = 8/170 (4%)
Query: 6 SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
SSSSP+RSR EERPR+FD KAKS+CWAKA+ VPGRHP+RWRKDAAGNIVCKRF
Sbjct: 2 SSSSPHRSRD------KEERPRFFDPKAKSLCWAKADAVPGRHPDRWRKDAAGNIVCKRF 55
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
CNC GCLCFEYDHI+PFSKGGES +NCQILQTRVNRFKSDK+ VD T+L+G+SCDVKFT
Sbjct: 56 CNCQGCLCFEYDHIIPFSKGGESVPENCQILQTRVNRFKSDKDNVDKTQLQGYSCDVKFT 115
Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVN 175
DKELDIIEMAVYGDVIRPGNQCRC+TVAEMLGQ+K KDR +A C + N
Sbjct: 116 DKELDIIEMAVYGDVIRPGNQCRCRTVAEMLGQYKPKDR--MAACKLPFN 163
>gi|449440818|ref|XP_004138181.1| PREDICTED: uncharacterized protein LOC101204599 [Cucumis sativus]
gi|449477195|ref|XP_004154957.1| PREDICTED: uncharacterized protein LOC101224154 [Cucumis sativus]
Length = 181
Score = 275 bits (704), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 126/160 (78%), Positives = 144/160 (90%), Gaps = 2/160 (1%)
Query: 17 NSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
+ + S +ERPR+FD KAK+ CWAKA+ VPGRHPERWRKDAAGN+VCKRFCNC GCLCFEY
Sbjct: 20 SGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEY 79
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAV 136
DHIVP+SKGGESTADNCQILQTRVNRFKS+K+ VD ++LKG+SCDVKFTDKELDIIEMAV
Sbjct: 80 DHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKGYSCDVKFTDKELDIIEMAV 139
Query: 137 YGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNH 176
YGDVIRPGNQCRC+TV+E+LGQ+KSKDR LA C + N
Sbjct: 140 YGDVIRPGNQCRCRTVSEVLGQYKSKDR--LAPCKLPYNE 177
>gi|255547311|ref|XP_002514713.1| endonuclease, putative [Ricinus communis]
gi|223546317|gb|EEF47819.1| endonuclease, putative [Ricinus communis]
Length = 173
Score = 271 bits (692), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 128/167 (76%), Positives = 145/167 (86%), Gaps = 3/167 (1%)
Query: 7 SSSPNRSRS-SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
SSSP R+RS SNS + ERPR+F KAK++CWA AETVPGRHPERWRKDAAGNIVCKR
Sbjct: 2 SSSPQRARSESNSGDGAGERPRFFSNKAKNICWANAETVPGRHPERWRKDAAGNIVCKRL 61
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
NC GCLCF+YDHI+PFSKGGESTA NCQILQTRVN +K+D E VD ++LKG+SCD+KFT
Sbjct: 62 GNCQGCLCFQYDHIIPFSKGGESTAANCQILQTRVNNYKADAEAVDKSQLKGYSCDLKFT 121
Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
DKELDIIEMAVYGDVIRPGNQCRC+TVAE+LGQ KSKDR +A C +
Sbjct: 122 DKELDIIEMAVYGDVIRPGNQCRCRTVAEILGQTKSKDR--VAACKL 166
>gi|224119042|ref|XP_002317971.1| predicted protein [Populus trichocarpa]
gi|222858644|gb|EEE96191.1| predicted protein [Populus trichocarpa]
Length = 174
Score = 268 bits (684), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 127/170 (74%), Positives = 144/170 (84%), Gaps = 5/170 (2%)
Query: 6 SSSSPNRSRSSNSNNS---NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
S SSP R R NSN ++RPR+FD K MCW AE VPGRHPERWRKDAAGNIVC
Sbjct: 1 SPSSPQRYRGGNSNGGGGGEDQRPRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVC 60
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
KRFCNCHGCLC+EYDHI+PFSKGGEST DNCQILQTRVNRFKS+K+++D T+LKG+SC++
Sbjct: 61 KRFCNCHGCLCYEYDHILPFSKGGESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEI 120
Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
FTDKELDIIEMAVYGDVIRPGNQCRC+T+AEMLGQ KSKDR +A C +
Sbjct: 121 NFTDKELDIIEMAVYGDVIRPGNQCRCRTIAEMLGQLKSKDR--VAACKL 168
>gi|118481005|gb|ABK92456.1| unknown [Populus trichocarpa]
Length = 180
Score = 264 bits (675), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 118/150 (78%), Positives = 135/150 (90%), Gaps = 2/150 (1%)
Query: 23 EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
++RPR+FD K MCW AE VPGRHPERWRKDAAGNIVCKRFCNCHGCLC+EYDHI+PF
Sbjct: 26 DQRPRFFDKMVKKMCWQNAEIVPGRHPERWRKDAAGNIVCKRFCNCHGCLCYEYDHILPF 85
Query: 83 SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
SKGGEST DNCQILQTRVNRFKS+K+++D T+LKG+SC++ FTDKELDIIEMAVYGDVIR
Sbjct: 86 SKGGESTVDNCQILQTRVNRFKSNKQELDKTQLKGYSCEINFTDKELDIIEMAVYGDVIR 145
Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
PGNQCRC+T+AEMLGQ KSKDR +A C +
Sbjct: 146 PGNQCRCRTIAEMLGQLKSKDR--VAACKL 173
>gi|297850254|ref|XP_002893008.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297338850|gb|EFH69267.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 179
Score = 258 bits (658), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 118/147 (80%), Positives = 133/147 (90%)
Query: 17 NSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
N ++ EERPR+FDGKAK+ CWA A+TVPGRHPERWRKDAAGNIVCKRF NC+GCLCFEY
Sbjct: 24 NVKSNGEERPRFFDGKAKNKCWANADTVPGRHPERWRKDAAGNIVCKRFGNCNGCLCFEY 83
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAV 136
DHIVP+SKGGES A+NCQILQTRVNRFKS +E VDAT LK +SC ++FTDKELDIIEMAV
Sbjct: 84 DHIVPYSKGGESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDIIEMAV 143
Query: 137 YGDVIRPGNQCRCKTVAEMLGQFKSKD 163
YGDV+RPG +CRCKTVAE+LGQ KSKD
Sbjct: 144 YGDVLRPGKECRCKTVAELLGQSKSKD 170
>gi|15221835|ref|NP_173303.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
gi|6730716|gb|AAF27111.1|AC011809_20 Unknown protein [Arabidopsis thaliana]
gi|26451201|dbj|BAC42704.1| unknown protein [Arabidopsis thaliana]
gi|28973585|gb|AAO64117.1| unknown protein [Arabidopsis thaliana]
gi|332191624|gb|AEE29745.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana]
Length = 186
Score = 257 bits (656), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 116/148 (78%), Positives = 132/148 (89%)
Query: 16 SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
SN ++ EERPR+FDGKAK+ CWA A+ VPGRHPERWRKD AGNIVCKRF NC+GCLCFE
Sbjct: 30 SNVKSNGEERPRFFDGKAKNKCWANADIVPGRHPERWRKDVAGNIVCKRFGNCNGCLCFE 89
Query: 76 YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
YDHIVP+SKGGES A+NCQILQTRVNRFKS +E VDAT LK +SC ++FTDKELD+IEMA
Sbjct: 90 YDHIVPYSKGGESIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDVIEMA 149
Query: 136 VYGDVIRPGNQCRCKTVAEMLGQFKSKD 163
VYGDV+RPG +CRCKTVAE+LGQ KSKD
Sbjct: 150 VYGDVLRPGKECRCKTVAELLGQSKSKD 177
>gi|351722082|ref|NP_001235440.1| uncharacterized protein LOC100527904 [Glycine max]
gi|255633516|gb|ACU17116.1| unknown [Glycine max]
Length = 162
Score = 254 bits (648), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 117/158 (74%), Positives = 136/158 (86%), Gaps = 5/158 (3%)
Query: 6 SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
SSSSP RSR E+RPR+FD AK++CW+KA+TVPGRHPERWRKDAAGN+VCKRF
Sbjct: 2 SSSSPQRSRGDG-----EKRPRFFDSNAKAICWSKADTVPGRHPERWRKDAAGNVVCKRF 56
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
NC GCLC+EYDHI+PFSKGGESTADNCQILQ+RVNR KSDK +D+ +LK +SC+V FT
Sbjct: 57 FNCIGCLCYEYDHIIPFSKGGESTADNCQILQSRVNRLKSDKYNIDSDQLKDYSCEVNFT 116
Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKD 163
DKELDIIEMAVY DV+RPGNQCRC+T+AE LG+FK KD
Sbjct: 117 DKELDIIEMAVYSDVMRPGNQCRCRTIAEKLGKFKPKD 154
>gi|218184321|gb|EEC66748.1| hypothetical protein OsI_33113 [Oryza sativa Indica Group]
Length = 178
Score = 244 bits (622), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 116/179 (64%), Positives = 139/179 (77%), Gaps = 8/179 (4%)
Query: 6 SSSSPNR------SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGN 59
+ SSP R + +S +ERPR FD K +S CW KA V GRHPERWR+DAAGN
Sbjct: 2 APSSPRRDLDGEGTGGGSSAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGN 61
Query: 60 IVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFS 119
+VC+RF +CHGCLC+EYDHIVPFSKGGEST DNCQILQTRVNR KSDK ++ +++GFS
Sbjct: 62 VVCRRFWSCHGCLCYEYDHIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFS 121
Query: 120 CDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNHCS 178
CD+KFTDKELD+IEMAVYGDVIRPG QCRC+TVAEMLG+ KSK++ +A C + N S
Sbjct: 122 CDIKFTDKELDVIEMAVYGDVIRPGKQCRCRTVAEMLGKVKSKNQ--MAACELPYNDAS 178
>gi|115481558|ref|NP_001064372.1| Os10g0335200 [Oryza sativa Japonica Group]
gi|21671981|gb|AAM74343.1|AC115686_10 Putative HNH endonuclease [Oryza sativa Japonica Group]
gi|23477807|gb|AAN34947.1| Putative HNH endonuclease [Oryza sativa Japonica Group]
gi|31431266|gb|AAP53074.1| HNH endonuclease family protein, expressed [Oryza sativa Japonica
Group]
gi|113638981|dbj|BAF26286.1| Os10g0335200 [Oryza sativa Japonica Group]
gi|125574417|gb|EAZ15701.1| hypothetical protein OsJ_31114 [Oryza sativa Japonica Group]
gi|215693775|dbj|BAG88974.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707241|dbj|BAG93701.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 178
Score = 243 bits (619), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 116/179 (64%), Positives = 139/179 (77%), Gaps = 8/179 (4%)
Query: 6 SSSSPNR------SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGN 59
+ SSP R + +S +ERPR FD K +S CW KA V GRHPERWR+DAAGN
Sbjct: 2 APSSPRRDLDGEGTGGGSSAAREKERPRSFDEKTRSACWRKAAVVMGRHPERWRQDAAGN 61
Query: 60 IVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFS 119
+VC+RF +CHGCLC+EYDHIVPFSKGGEST DNCQILQTRVNR KSDK ++ +++GFS
Sbjct: 62 VVCRRFWSCHGCLCYEYDHIVPFSKGGESTVDNCQILQTRVNRSKSDKAWIEQAEMQGFS 121
Query: 120 CDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNHCS 178
CD+KFTDKELD+IEMAVYGDVIRPG QCRC+TVAEMLG+ KSK++ +A C + N S
Sbjct: 122 CDIKFTDKELDVIEMAVYGDVIRPGKQCRCRTVAEMLGKVKSKNQ--MAACELPHNDAS 178
>gi|326513564|dbj|BAJ87801.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 175
Score = 242 bits (617), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 109/150 (72%), Positives = 130/150 (86%), Gaps = 2/150 (1%)
Query: 23 EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
+ERPR FD K ++ CW KA + GRHPERWR+DAAGN+VC+RF +CHGCLC+EYDHIVPF
Sbjct: 26 KERPRSFDEKTRTACWRKAAVLTGRHPERWRQDAAGNVVCRRFWSCHGCLCYEYDHIVPF 85
Query: 83 SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
SKGGEST +NCQILQTRVNR KSDK V+ +++GFSCD+KFTDKELD+IEMAVYGDVIR
Sbjct: 86 SKGGESTVENCQILQTRVNRSKSDKAWVEKAEMQGFSCDIKFTDKELDVIEMAVYGDVIR 145
Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
PG QCRCKTVAEMLGQ KSK++ +A C++
Sbjct: 146 PGKQCRCKTVAEMLGQVKSKNQ--MAACNL 173
>gi|357146087|ref|XP_003573871.1| PREDICTED: uncharacterized protein LOC100831965 [Brachypodium
distachyon]
Length = 180
Score = 241 bits (614), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 116/182 (63%), Positives = 138/182 (75%), Gaps = 6/182 (3%)
Query: 1 MSSSSSSSSPNRSRSSNSNN----SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDA 56
M SS+S + +S+S +ERPR FD K ++ CW KA + GRHPERWR+D
Sbjct: 1 MGPSSTSPREHNGEASSSGRVPKPKEKERPRSFDEKTRAACWRKAAVLAGRHPERWRQDT 60
Query: 57 AGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLK 116
AGNIVC+RF +CHGCLC+EYDHIVPFSKGGEST +NCQILQTRVNR KSDK V+ +++
Sbjct: 61 AGNIVCRRFWSCHGCLCYEYDHIVPFSKGGESTVENCQILQTRVNRSKSDKSWVEKAEMQ 120
Query: 117 GFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVNH 176
G SCD+KFTDKELD+IEMAVYGDVIRPG QCRCKTVAEM+GQ KSKD +A C + N
Sbjct: 121 GASCDIKFTDKELDVIEMAVYGDVIRPGKQCRCKTVAEMIGQVKSKDH--VAACDLPYND 178
Query: 177 CS 178
S
Sbjct: 179 AS 180
>gi|242039281|ref|XP_002467035.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
gi|241920889|gb|EER94033.1| hypothetical protein SORBIDRAFT_01g018630 [Sorghum bicolor]
Length = 180
Score = 240 bits (613), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 110/160 (68%), Positives = 129/160 (80%), Gaps = 2/160 (1%)
Query: 13 SRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCL 72
R S + + +ERPR FD K ++ CW KA + GRHPERWR DA GN+VC+RF CHGCL
Sbjct: 17 GRRSPAQATEKERPRSFDEKTRTACWRKAAVLAGRHPERWRLDAVGNVVCRRFWGCHGCL 76
Query: 73 CFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDII 132
C+EYDHIVPFSKGGEST +NCQILQTRVNR KSDK V+ +++GFSCD+KFTD+ELDII
Sbjct: 77 CYEYDHIVPFSKGGESTVENCQILQTRVNRSKSDKTWVEQAEMRGFSCDIKFTDQELDII 136
Query: 133 EMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
EMAVYGDVIRPG QCRCKTVAEMLGQ KS +R +A C +
Sbjct: 137 EMAVYGDVIRPGKQCRCKTVAEMLGQVKSMNR--MAACEL 174
>gi|116780247|gb|ABK21604.1| unknown [Picea sitchensis]
Length = 166
Score = 232 bits (592), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 105/156 (67%), Positives = 131/156 (83%), Gaps = 5/156 (3%)
Query: 7 SSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFC 66
S+SP S + N++R R+FD K +++CW KA+ VPGRHPERWRKD AGNIVC+RF
Sbjct: 3 SASP-----SEESARNQKRARFFDEKMRNLCWQKADVVPGRHPERWRKDGAGNIVCRRFW 57
Query: 67 NCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTD 126
+C GCLC+EYDHI+PFSKGGE+T +NCQILQ+RVNR K +K+Q+D T L+ +SCD+KFTD
Sbjct: 58 HCDGCLCYEYDHIIPFSKGGETTVENCQILQSRVNRSKGNKDQLDKTVLEQYSCDLKFTD 117
Query: 127 KELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSK 162
KELD+IEMAVYG+V+RPG QCRCK+VAEMLGQ KSK
Sbjct: 118 KELDVIEMAVYGNVMRPGLQCRCKSVAEMLGQLKSK 153
>gi|195651337|gb|ACG45136.1| endonuclease/ nucleic acid binding protein [Zea mays]
Length = 180
Score = 232 bits (591), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 105/150 (70%), Positives = 123/150 (82%), Gaps = 2/150 (1%)
Query: 23 EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
+ERPR FD K ++ CW KA + GRHPERWR+DA GN+VC+RF CHGCLC+EYDHIVPF
Sbjct: 27 KERPRSFDEKTRTACWRKAAVLAGRHPERWRQDAVGNVVCRRFWGCHGCLCYEYDHIVPF 86
Query: 83 SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
SKGGEST +NCQILQTRVNR KSDK V +++GFSCD+ FTDKELDIIEMAVYGDVIR
Sbjct: 87 SKGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 146
Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
PG QCRC+TVAE+LGQ K +R +A C +
Sbjct: 147 PGKQCRCRTVAEVLGQVKPTNR--MAACEL 174
>gi|116783732|gb|ABK23066.1| unknown [Picea sitchensis]
Length = 168
Score = 224 bits (572), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 98/142 (69%), Positives = 123/142 (86%)
Query: 21 SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
N +R R+FD K +++CW KA+ V GRHPERWRKD AGNI+C+RF +C GCLC+EYDHI+
Sbjct: 14 GNHKRVRFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIICRRFGHCEGCLCYEYDHII 73
Query: 81 PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
PFSKGGE+T +NCQILQ+RVNR K +K+Q++ T L+ +SCD+KFTDKELD+IEMAVYG+V
Sbjct: 74 PFSKGGETTVENCQILQSRVNRSKGNKDQLNRTVLEQYSCDLKFTDKELDVIEMAVYGNV 133
Query: 141 IRPGNQCRCKTVAEMLGQFKSK 162
+RPG QCRCK+VAEMLGQ KSK
Sbjct: 134 MRPGLQCRCKSVAEMLGQLKSK 155
>gi|168006043|ref|XP_001755719.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693038|gb|EDQ79392.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 180
Score = 211 bits (536), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 93/138 (67%), Positives = 112/138 (81%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R FDGK ++ CW KA+ VPGRHPERWRKD AGNIVC+R C GCLC+EYDH+ PFSKGG
Sbjct: 28 RIFDGKMRNQCWQKADVVPGRHPERWRKDVAGNIVCRRLTRCEGCLCYEYDHVRPFSKGG 87
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
++T DNCQILQ+RVNR KS++ + + LK FSCD+KFTDKELD+IEM VYG+VIRPG Q
Sbjct: 88 QTTVDNCQILQSRVNRMKSNRSDLASDDLKQFSCDLKFTDKELDLIEMGVYGNVIRPGLQ 147
Query: 147 CRCKTVAEMLGQFKSKDR 164
C+CK+VAE LG SK +
Sbjct: 148 CKCKSVAETLGTALSKQK 165
>gi|302822371|ref|XP_002992844.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
gi|300139392|gb|EFJ06134.1| hypothetical protein SELMODRAFT_136010 [Selaginella moellendorffii]
Length = 174
Score = 197 bits (502), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 94/170 (55%), Positives = 121/170 (71%), Gaps = 8/170 (4%)
Query: 3 SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
++SSSSSP ++ + R FD K + CW KAE VPGRHPERWRKD AGN+VC
Sbjct: 4 ATSSSSSPLPDNAAR-------KLRLFDEKMRRQCWQKAEVVPGRHPERWRKDGAGNVVC 56
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
++F C GC+C+E+DHI PFSKGGE+ NCQILQ+RVNR KS+K+ D L+ +SCD+
Sbjct: 57 RKFIGCEGCMCYEFDHIRPFSKGGETDVSNCQILQSRVNRLKSNKD-ADEHLLQQYSCDL 115
Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
KFT +ELD++EMAVYG+VIRPG QCRCK+V+E LG K + C +
Sbjct: 116 KFTARELDLVEMAVYGNVIRPGLQCRCKSVSETLGNGVHKSKGHEPACEL 165
>gi|302811781|ref|XP_002987579.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
gi|300144733|gb|EFJ11415.1| hypothetical protein SELMODRAFT_126371 [Selaginella moellendorffii]
Length = 174
Score = 197 bits (502), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 94/170 (55%), Positives = 121/170 (71%), Gaps = 8/170 (4%)
Query: 3 SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
++SSSSSP ++ + R FD K + CW KAE VPGRHPERWRKD AGN+VC
Sbjct: 4 ATSSSSSPLPDNAAR-------KLRLFDEKMRRQCWEKAEVVPGRHPERWRKDGAGNVVC 56
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDV 122
++F C GC+C+E+DHI PFSKGGE+ NCQILQ+RVNR KS+K+ D L+ +SCD+
Sbjct: 57 RKFIGCEGCMCYEFDHIRPFSKGGETDVSNCQILQSRVNRLKSNKD-ADEHLLQQYSCDL 115
Query: 123 KFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
KFT +ELD++EMAVYG+VIRPG QCRCK+V+E LG K + C +
Sbjct: 116 KFTARELDLVEMAVYGNVIRPGLQCRCKSVSETLGNGVHKSKGHEPACEL 165
>gi|363814422|ref|NP_001242848.1| uncharacterized protein LOC100775923 [Glycine max]
gi|255641881|gb|ACU21209.1| unknown [Glycine max]
Length = 121
Score = 186 bits (472), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 86/124 (69%), Positives = 101/124 (81%), Gaps = 5/124 (4%)
Query: 6 SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
SSSSP RSR E+RPR+FD AK++CW KA+TVPGRHPERWRKDAAGN+VCKRF
Sbjct: 2 SSSSPQRSR-----GDGEKRPRFFDSNAKAICWGKADTVPGRHPERWRKDAAGNVVCKRF 56
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
NC GCLC+EYDHI+PFSKGGESTADNCQILQ+RVNRFKSDK +D+ +LK +SC+V
Sbjct: 57 FNCIGCLCYEYDHIIPFSKGGESTADNCQILQSRVNRFKSDKYNIDSEQLKDYSCEVILL 116
Query: 126 DKEL 129
+ L
Sbjct: 117 IRSL 120
>gi|452824505|gb|EME31507.1| endonuclease [Galdieria sulphuraria]
Length = 146
Score = 166 bits (420), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 98/129 (75%), Gaps = 1/129 (0%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R F + K+ CW KA VPGR P+RWR DA N VCKR +C GCLC EYDH++P+S+GG
Sbjct: 12 RIFSNRVKAACWEKATPVPGRDPDRWRLDAFRNPVCKRLTSCEGCLCHEYDHVIPYSQGG 71
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
ST +NCQILQTR+NR K+D+ Q+ A +L+ FSC++ F+++ELD+IEMAVYG+V R +
Sbjct: 72 ASTVENCQILQTRINRLKADR-QLTAEQLESFSCEITFSERELDLIEMAVYGNVQRDHFR 130
Query: 147 CRCKTVAEM 155
CRCK+ E+
Sbjct: 131 CRCKSFFEV 139
>gi|226504958|ref|NP_001144283.1| uncharacterized protein LOC100277163 [Zea mays]
gi|195639630|gb|ACG39283.1| hypothetical protein [Zea mays]
Length = 139
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/157 (49%), Positives = 93/157 (59%), Gaps = 43/157 (27%)
Query: 16 SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
S + + +ERPR FD K ++ CW KA + GRHPERWR+DA GN+VC+RF CHGCLC+E
Sbjct: 20 SPAPATWKERPRSFDEKTRTACWRKAAVLSGRHPERWRQDAVGNVVCRRFWGCHGCLCYE 79
Query: 76 YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
YDHIVPFSK DKELDIIEMA
Sbjct: 80 YDHIVPFSK-----------------------------------------DKELDIIEMA 98
Query: 136 VYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
VYGDVIRPG QCRC+TVAE+LGQ K +R +A C +
Sbjct: 99 VYGDVIRPGKQCRCRTVAEVLGQVKPTNR--MAACEL 133
>gi|242036789|ref|XP_002465789.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
gi|241919643|gb|EER92787.1| hypothetical protein SORBIDRAFT_01g045830 [Sorghum bicolor]
Length = 188
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 85/125 (68%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE VPGR PERWR+DA GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 62 PRSFSYAVKQQCWEKAERVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKG 121
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K +K ++ ++L S + + +++D++E++ YG+V R +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVELSAYGNVRRGPD 181
Query: 146 QCRCK 150
CK
Sbjct: 182 SGGCK 186
>gi|293331837|ref|NP_001168386.1| uncharacterized protein LOC100382155 [Zea mays]
gi|223947943|gb|ACN28055.1| unknown [Zea mays]
gi|414865044|tpg|DAA43601.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
Length = 188
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 85/125 (68%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE +PGR PERWR+DA GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 62 PRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKG 121
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K +K ++ ++L S + + +++D++E++ YG+V R +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELIQKSAYCRVSGRDMDLVELSAYGNVRRGPD 181
Query: 146 QCRCK 150
CK
Sbjct: 182 SGGCK 186
>gi|357113864|ref|XP_003558721.1| PREDICTED: uncharacterized protein LOC100825547 [Brachypodium
distachyon]
Length = 188
Score = 135 bits (339), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 59/125 (47%), Positives = 84/125 (67%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE VPGR PERWR+DA GN+V ++ C GCLC +YDHIVP+SKG
Sbjct: 62 PRSFSYAVKQQCWEKAEKVPGRDPERWRRDALGNMVFRKLVGCPGCLCHDYDHIVPYSKG 121
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K +K ++ ++L S + + +++D+ E++ +G+V R +
Sbjct: 122 GKSTLENCQVLQATVNRSKGNKTEISKSELAQRSAYCRVSGRDMDLFELSAFGNVRRGPD 181
Query: 146 QCRCK 150
CK
Sbjct: 182 SGGCK 186
>gi|222624269|gb|EEE58401.1| hypothetical protein OsJ_09573 [Oryza sativa Japonica Group]
Length = 187
Score = 134 bits (336), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 82/117 (70%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE +PGR PERWR+DA GN+V ++ C GCLC +YDHIVP+SKG
Sbjct: 61 PRSFPYVVKQRCWEKAERLPGRDPERWRRDALGNVVFRKLVGCPGCLCHDYDHIVPYSKG 120
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
G+ST +NCQ+LQ VNR K +K ++ ++L S + + +++D++E++ YG+V R
Sbjct: 121 GKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRVSGRDMDLLELSAYGNVRR 177
>gi|229594535|ref|XP_001023127.3| HNH endonuclease family protein [Tetrahymena thermophila]
gi|225566791|gb|EAS02882.3| HNH endonuclease family protein [Tetrahymena thermophila SB210]
Length = 208
Score = 134 bits (336), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/120 (50%), Positives = 78/120 (65%)
Query: 23 EERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPF 82
+++PR F K CW A GRHP+RWR DA GN V + +C G LC EYDHIVPF
Sbjct: 37 QKQPRCFTKSMKDQCWNNATIFIGRHPDRWRLDAVGNPVLRGLTSCLGPLCHEYDHIVPF 96
Query: 83 SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
SKGG++ NCQILQT VNR+KS+K +L S KFT++E D IE A+YGD+++
Sbjct: 97 SKGGKTEISNCQILQTMVNRYKSNKTNQSHKELIAVSSTNKFTNQEFDSIERAIYGDILK 156
>gi|326534162|dbj|BAJ89431.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 180
Score = 133 bits (334), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 83/125 (66%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE VPGR PERWR+DA GNIV ++ C GCLC +YDHI+P+SKG
Sbjct: 54 PRSFPHAVKQQCWEKAEKVPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHILPYSKG 113
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K + ++ ++L S + + +++D+ E++ +G+V R +
Sbjct: 114 GKSTLENCQVLQATVNRSKGNNTEISKSELAQRSAYCRVSGRDMDLFEVSAFGNVRRGPD 173
Query: 146 QCRCK 150
CK
Sbjct: 174 SGGCK 178
>gi|15232817|ref|NP_190333.1| HNH endonuclease [Arabidopsis thaliana]
gi|6522532|emb|CAB61975.1| hypothetical protein [Arabidopsis thaliana]
gi|117168237|gb|ABK32201.1| At3g47490 [Arabidopsis thaliana]
gi|332644767|gb|AEE78288.1| HNH endonuclease [Arabidopsis thaliana]
Length = 234
Score = 132 bits (332), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 80/125 (64%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE + GR PERWR+D GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 108 PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 167
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ +VNR K +K + ++L S + +++D+IE+ YG+V R
Sbjct: 168 GKSTLENCQVLQAKVNRSKGNKTDISRSELIQRSSYCRVAGRDMDLIELTAYGNVQRAPT 227
Query: 146 QCRCK 150
C+
Sbjct: 228 SSGCR 232
>gi|168048449|ref|XP_001776679.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671971|gb|EDQ58515.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 129
Score = 132 bits (332), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 58/114 (50%), Positives = 76/114 (66%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R F K CWAKAETV GR P RWR+DA GN+V ++ C GCLC +YDHIVP+SKGG
Sbjct: 1 RSFPHDVKEACWAKAETVKGRDPARWRRDALGNVVFRKLVGCQGCLCHDYDHIVPYSKGG 60
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
ST DNCQ+LQ NRFK + V T+++ S T +++D +E++ YG+V
Sbjct: 61 PSTLDNCQVLQAPANRFKGARINVPVTEMQQHSAYCSLTGRDMDFVELSAYGNV 114
>gi|413934346|gb|AFW68897.1| hypothetical protein ZEAMMB73_497763 [Zea mays]
Length = 132
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/90 (70%), Positives = 73/90 (81%), Gaps = 2/90 (2%)
Query: 83 SKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
S GGEST +NCQILQTRVNR KSDK V +++GFSCD+ FTDKELDIIEMAVYGDVIR
Sbjct: 39 STGGESTVENCQILQTRVNRSKSDKAWVAQAEMQGFSCDIMFTDKELDIIEMAVYGDVIR 98
Query: 143 PGNQCRCKTVAEMLGQFKSKDRWLLANCHM 172
PG QCRC+TVAE+LGQ K +R +A C +
Sbjct: 99 PGKQCRCRTVAEVLGQVKPTNR--MAACEL 126
>gi|115451005|ref|NP_001049103.1| Os03g0169800 [Oryza sativa Japonica Group]
gi|19071624|gb|AAL84291.1|AC073556_8 putative endonuclease [Oryza sativa Japonica Group]
gi|108706401|gb|ABF94196.1| HNH endonuclease domain-containing protein, putative, expressed
[Oryza sativa Japonica Group]
gi|113547574|dbj|BAF11017.1| Os03g0169800 [Oryza sativa Japonica Group]
gi|215686970|dbj|BAG90840.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218192172|gb|EEC74599.1| hypothetical protein OsI_10187 [Oryza sativa Indica Group]
Length = 187
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 81/117 (69%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE V GR PERWR+DA GN+V ++ C GCLC +YDHIVP+SKG
Sbjct: 61 PRSFPYVVKQRCWEKAERVAGRDPERWRRDALGNVVFRKLVGCPGCLCHDYDHIVPYSKG 120
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
G+ST +NCQ+LQ VNR K +K ++ ++L S + + +++D++E++ YG+V R
Sbjct: 121 GKSTLENCQVLQATVNRSKGNKTELSKSELIQKSAYCRVSGRDMDLLELSAYGNVRR 177
>gi|145521759|ref|XP_001446729.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414218|emb|CAK79332.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 77/113 (68%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR FD A+ CW A+ + GR+P+RWR DA GN VCK C G LC+++DHIVP+SKG
Sbjct: 27 PRRFDKLAREECWNNAQIMEGRNPDRWRLDAVGNPVCKALNACRGPLCYQFDHIVPYSKG 86
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
GES + NCQ+LQ+ VN +KS+KE + +LK S + FT + D IE A+YG
Sbjct: 87 GESISKNCQLLQSFVNIYKSNKENISKDELKQVSPQLNFTSEVFDAIEYAIYG 139
>gi|297815918|ref|XP_002875842.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
subsp. lyrata]
gi|297321680|gb|EFH52101.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata
subsp. lyrata]
Length = 225
Score = 130 bits (327), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 83/135 (61%), Gaps = 1/135 (0%)
Query: 16 SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
S ++ N E PR F K CW KAE + GR PERWR+D GNIV ++ C GCLC +
Sbjct: 90 SGYDDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDPLGNIVFRKLVGCPGCLCHD 148
Query: 76 YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMA 135
YDHIVP+SKGG+S +NCQ+LQ +VNR K +K + +L S + +++D+IE+
Sbjct: 149 YDHIVPYSKGGKSILENCQVLQAKVNRSKGNKTDISRAELIQRSSYCRVAGRDMDLIELT 208
Query: 136 VYGDVIRPGNQCRCK 150
YG+V R C+
Sbjct: 209 AYGNVQRAPESSGCR 223
>gi|145512195|ref|XP_001442014.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409286|emb|CAK74617.1| unnamed protein product [Paramecium tetraurelia]
Length = 167
Score = 129 bits (323), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 78/113 (69%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR FD A+ C A+ + GR+P+RWR DA GN VCK +C G LC+++DHIVP+SKG
Sbjct: 55 PRRFDKLAREECRNNAQIMEGRNPDRWRLDAVGNPVCKALNSCRGSLCYQFDHIVPYSKG 114
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
GES + NCQ+LQ+ VN +KS+KE + +LK S +KFT + D IE A+YG
Sbjct: 115 GESISINCQLLQSFVNIYKSNKENISKDELKHVSPQLKFTPVQFDAIEYAIYG 167
>gi|255547375|ref|XP_002514745.1| endonuclease, putative [Ricinus communis]
gi|223546349|gb|EEF47851.1| endonuclease, putative [Ricinus communis]
Length = 203
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 76/115 (66%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE V GR P+RWR+D GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 77 PRSFPYSVKQQCWDKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 136
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
G+ST +NCQ+LQ VNR K ++ ++ L S + +++D+IE++ YG+V
Sbjct: 137 GKSTLENCQVLQATVNRSKGNRTELSRADLIQRSSYCRVAGRDMDLIELSAYGNV 191
>gi|225457397|ref|XP_002284877.1| PREDICTED: uncharacterized protein LOC100262165 isoform 1 [Vitis
vinifera]
Length = 199
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 78/117 (66%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE + GR P+RWR+D GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 73 PRSFPYSVKQQCWEKAEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 132
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
G+ST +NCQ+LQ VNR K ++ ++ L S + + +++D++E++ YG+V R
Sbjct: 133 GKSTLENCQVLQATVNRSKGNRTEISKADLIQKSSYCRVSGRDMDLLELSAYGNVRR 189
>gi|449477241|ref|XP_004154969.1| PREDICTED: uncharacterized protein LOC101226861 [Cucumis sativus]
Length = 200
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 82/125 (65%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE + GR P+RWR+DA GN V ++ C GCLC +YDHI+P+SKG
Sbjct: 74 PRSFPHSVKQQCWDKAEKIKGRDPDRWRRDALGNTVFRKLVGCPGCLCHDYDHILPYSKG 133
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K ++ ++ +L S + + +++D++E++ YG+V R +
Sbjct: 134 GKSTLENCQVLQATVNRSKGNRTEISRAELIQKSSYCRVSGRDMDLLELSAYGNVHRGQD 193
Query: 146 QCRCK 150
C+
Sbjct: 194 SGGCR 198
>gi|449440796|ref|XP_004138170.1| PREDICTED: uncharacterized protein LOC101222461 [Cucumis sativus]
Length = 200
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 82/125 (65%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE + GR P+RWR+DA GN V ++ C GCLC +YDHI+P+SKG
Sbjct: 74 PRSFPHSVKQQCWDKAEKIKGRDPDRWRRDALGNTVFRKLVGCPGCLCHDYDHILPYSKG 133
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN 145
G+ST +NCQ+LQ VNR K ++ ++ +L S + + +++D++E++ YG+V R +
Sbjct: 134 GKSTLENCQVLQATVNRSKGNRTEISRAELIQKSSYCRVSGRDMDLLELSAYGNVHRGQD 193
Query: 146 QCRCK 150
C+
Sbjct: 194 SGGCR 198
>gi|351720795|ref|NP_001237956.1| uncharacterized protein LOC100500174 [Glycine max]
gi|255629565|gb|ACU15130.1| unknown [Glycine max]
Length = 206
Score = 126 bits (316), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 82/124 (66%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R F K CW KA+ V GR P+RWR+DA GN + ++ C GCLC +YDHI+P+SKGG
Sbjct: 81 RSFPHSVKQKCWEKADKVKGRDPDRWRRDALGNTLFRKLVGCPGCLCHDYDHIIPYSKGG 140
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQ 146
EST +NCQ+LQ VNR K ++ ++ +L S + +D+++D++E++ YG+V R +
Sbjct: 141 ESTLENCQVLQATVNRSKGNRTEISKAELIQKSSYCRVSDRDMDLLELSAYGNVRRGPDS 200
Query: 147 CRCK 150
C+
Sbjct: 201 GGCR 204
>gi|388520967|gb|AFK48545.1| unknown [Lotus japonicus]
Length = 198
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 90/152 (59%), Gaps = 13/152 (8%)
Query: 2 SSSSSSSSPNRSRSSNSNNSNEE-------------RPRYFDGKAKSMCWAKAETVPGRH 48
+SS SSSP RS S ++ +++ PR F K CW KA+ + GR
Sbjct: 35 TSSKLSSSPKRSTSPSATLLDDDPVALFDDYTHPNPNPRSFPHSVKQKCWEKADKIKGRD 94
Query: 49 PERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
P+RWR+D GN + ++ C GCLC +YDHIVP+SKGG+ST +NCQ+LQ VNR K ++
Sbjct: 95 PDRWRRDPLGNTIFRKLVGCPGCLCHDYDHIVPYSKGGQSTLENCQVLQATVNRAKGNRT 154
Query: 109 QVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
+ L S + + +++D++E++ YG+V
Sbjct: 155 GISRADLMQKSSYCRVSGRDMDLLELSAYGNV 186
>gi|224104735|ref|XP_002313547.1| predicted protein [Populus trichocarpa]
gi|222849955|gb|EEE87502.1| predicted protein [Populus trichocarpa]
Length = 128
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 78/117 (66%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE V GR P+RWR+D GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 2 PRSFPYSVKQQCWEKAEKVKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 61
Query: 86 GESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
G+S+ +NCQ+LQ VNR K ++ ++ +L S + + +++D +E++ YG+V R
Sbjct: 62 GKSSLENCQVLQATVNRSKGNRTELSRAELIRRSSYCRVSGRDMDFVELSAYGNVRR 118
>gi|449018629|dbj|BAM82031.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 188
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 55/121 (45%), Positives = 76/121 (62%), Gaps = 4/121 (3%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R F +++CW A+ VPGRHP RWR D GN+VC+ C+G LC+EYDH P+ KGG
Sbjct: 27 RLFSHTERNICWRNAQVVPGRHPRRWRFDRVGNLVCRALHGCNGPLCYEYDHRAPWGKGG 86
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATK--LKGFS--CDVKFTDKELDIIEMAVYGDVIR 142
S + NC +LQT NR+KSD+ + T+ L+ S + +ELD IE +YGD++R
Sbjct: 87 RSDSTNCDVLQTSANRWKSDQPESRVTEKLLRAHSRVLSRSLSQEELDTIEYVIYGDILR 146
Query: 143 P 143
P
Sbjct: 147 P 147
>gi|452821821|gb|EME28847.1| endonuclease isoform 1 [Galdieria sulphuraria]
Length = 199
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 3 SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
S SS + +S + + F K CW KA+ V GR P RWR+D AGN++C
Sbjct: 35 SLVSSLQGTKLKSPGIEKKEAKVLKQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVIC 94
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCD- 121
K+ C G C ++DH+VP SKGGES DNCQ+LQT NR K D+ T +K + +
Sbjct: 95 KKLRGCRGPFCMDFDHVVPLSKGGESVLDNCQVLQTTANRHKGDRLVEPTTDIKKYYKEI 154
Query: 122 ----VKFTDKELDIIEMAVYGDVIRPGNQ 146
V + LDI+E VYGDVIR G +
Sbjct: 155 SDRRVLPLQRHLDIVEFYVYGDVIRHGRR 183
>gi|116794423|gb|ABK27138.1| unknown [Picea sitchensis]
Length = 200
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 55/122 (45%), Positives = 78/122 (63%)
Query: 21 SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
S+E PR F K CW KA+ + GR P+RWR+DA GN V ++ C GCLC +YDHI
Sbjct: 69 SSEYNPRAFPYHIKQACWEKADKIKGRDPDRWRRDALGNTVFRKLVGCSGCLCHDYDHIH 128
Query: 81 PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
P+SKGG+ST +NCQ+LQ NR KS+ + T L S + + +++D +E++ YG+V
Sbjct: 129 PYSKGGKSTVENCQVLQGAANRAKSNHTDMSTTDLIQKSAYCRVSGRDMDFLELSAYGNV 188
Query: 141 IR 142
R
Sbjct: 189 HR 190
>gi|449015785|dbj|BAM79187.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 231
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 93/171 (54%), Gaps = 14/171 (8%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
R F + W +A+ VPGR PERWR D GN V + C GC C+E+DH VP+S GG
Sbjct: 26 RSFTEAQRKAAWERAKPVPGRDPERWRYDVYGNPVLRPLHGCLGCFCYEFDHRVPYSLGG 85
Query: 87 ESTADNCQILQTRVNRFKSDKEQ---VDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRP 143
++ DNC ILQTR+NR KS+ + + +++ ++C+ ++D IE AVYGDV R
Sbjct: 86 ATSLDNCDILQTRINRLKSNALEGVNISREQIRQYACEDVSNVIDMDAIEFAVYGDVRRE 145
Query: 144 GNQCRC-----KTVAEML------GQFKSKDRWLLANCHMMVNHCSTRGSC 183
G CR K +EML +F+S + + M ++ S SC
Sbjct: 146 GLHCRAYSLFEKVASEMLVSTGRVSRFESGSPAVTEHSPMKISRQSNIPSC 196
>gi|197306242|gb|ACH59472.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
Length = 84
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 51/79 (64%), Positives = 65/79 (82%)
Query: 15 SSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCF 74
SS N SN+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+
Sbjct: 6 SSEDNPSNQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCY 65
Query: 75 EYDHIVPFSKGGESTADNC 93
EYDHI+PFSKGG +T +NC
Sbjct: 66 EYDHIIPFSKGGGTTVENC 84
>gi|334185796|ref|NP_001190027.1| HNH endonuclease [Arabidopsis thaliana]
gi|332644768|gb|AEE78289.1| HNH endonuclease [Arabidopsis thaliana]
Length = 215
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 84/155 (54%), Gaps = 21/155 (13%)
Query: 16 SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
S + N E PR F K CW KAE + GR PERWR+D GNIV ++ C GCLC +
Sbjct: 60 SGYGDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHD 118
Query: 76 YDHIVPFS--------------------KGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
YDHIVP+S KGG+ST +NCQ+LQ +VNR K +K + ++L
Sbjct: 119 YDHIVPYSKDMCPLEHAFDEPHKDFLDLKGGKSTLENCQVLQAKVNRSKGNKTDISRSEL 178
Query: 116 KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCK 150
S + +++D+IE+ YG+V R C+
Sbjct: 179 IQRSSYCRVAGRDMDLIELTAYGNVQRAPTSSGCR 213
>gi|197306240|gb|ACH59471.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306260|gb|ACH59481.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
Length = 84
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 47/72 (65%), Positives = 61/72 (84%)
Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72
Query: 82 FSKGGESTADNC 93
FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84
>gi|197306236|gb|ACH59469.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
Length = 84
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 47/72 (65%), Positives = 61/72 (84%)
Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVTGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72
Query: 82 FSKGGESTADNC 93
FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84
>gi|197306216|gb|ACH59459.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306218|gb|ACH59460.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306220|gb|ACH59461.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306222|gb|ACH59462.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306224|gb|ACH59463.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306226|gb|ACH59464.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306228|gb|ACH59465.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306230|gb|ACH59466.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306232|gb|ACH59467.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306234|gb|ACH59468.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306238|gb|ACH59470.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306244|gb|ACH59473.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306246|gb|ACH59474.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306248|gb|ACH59475.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306250|gb|ACH59476.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306252|gb|ACH59477.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306254|gb|ACH59478.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306256|gb|ACH59479.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306258|gb|ACH59480.1| HNH endonuclease domain-containing protein [Pseudotsuga
menziesii]
gi|197306262|gb|ACH59482.1| HNH endonuclease domain-containing protein [Pseudotsuga
macrocarpa]
Length = 84
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 47/72 (65%), Positives = 61/72 (84%)
Query: 22 NEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVP 81
N+++ R+FD K +++CW KA+ V GRHPERWRKD AGNIVC+RF +C GCLC+EYDHI+P
Sbjct: 13 NQKKVRFFDEKMRNLCWQKADVVAGRHPERWRKDGAGNIVCRRFWHCEGCLCYEYDHIIP 72
Query: 82 FSKGGESTADNC 93
FSKGG +T +NC
Sbjct: 73 FSKGGGTTVENC 84
>gi|452821820|gb|EME28846.1| endonuclease isoform 2 [Galdieria sulphuraria]
Length = 190
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 76/141 (53%), Gaps = 5/141 (3%)
Query: 3 SSSSSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVC 62
S SS + +S + + F K CW KA+ V GR P RWR+D AGN++C
Sbjct: 35 SLVSSLQGTKLKSPGIEKKEAKVLKQFQHIDKEACWLKADAVRGRDPSRWRRDLAGNVIC 94
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCD- 121
K+ C G C ++DH+VP SKGGES DNCQ+LQT NR K D+ T +K + +
Sbjct: 95 KKLRGCRGPFCMDFDHVVPLSKGGESVLDNCQVLQTTANRHKGDRLVEPTTDIKKYYKEI 154
Query: 122 ----VKFTDKELDIIEMAVYG 138
V + LDI+E VYG
Sbjct: 155 SDRRVLPLQRHLDIVEFYVYG 175
>gi|357113854|ref|XP_003558716.1| PREDICTED: uncharacterized protein LOC100824123 [Brachypodium
distachyon]
Length = 246
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 92/169 (54%), Gaps = 24/169 (14%)
Query: 2 SSSSSSSSPNRSRSSNSNN-----SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDA 56
S+ + P S+ +N+ NEE+ F + CW AE VPGRHP RWRKD
Sbjct: 42 SADDENKDPKDSQQANARKRTVLKKNEEK---FPRAVRKWCWINAEEVPGRHPGRWRKDP 98
Query: 57 AGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD-----KEQVD 111
GN++CK+ G F YDHI P S+GG+ +NCQ+L +N FKS+ +E VD
Sbjct: 99 YGNVLCKKQTTTLGLSYFHYDHIHPRSRGGKGVKENCQLLLAYLNIFKSNDIKTREEMVD 158
Query: 112 ATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGN-----QCRCKTVAEM 155
S + ++++ELD+IE VYG+V+RPG+ +CR E+
Sbjct: 159 N------SYHITYSNEELDLIECNVYGNVVRPGHAETDPECRTYLKKEI 201
>gi|340502170|gb|EGR28883.1| hypothetical protein IMG5_167370 [Ichthyophthirius multifiliis]
Length = 121
Score = 112 bits (279), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 51/93 (54%), Positives = 66/93 (70%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
GRHPERWR DA GN V K +C+G LC++YDHIVP+SKGGE+ +NCQ+LQ+ VN+ KS
Sbjct: 3 GRHPERWRLDAVGNPVIKSLTSCNGPLCYQYDHIVPYSKGGETQIENCQLLQSIVNKHKS 62
Query: 106 DKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
+K ++LK S FT+ E D IE A+YG
Sbjct: 63 NKIDATYSELKNVSPKDDFTNFEYDCIEKAIYG 95
>gi|359491703|ref|XP_003634310.1| PREDICTED: uncharacterized protein LOC100262165 isoform 2 [Vitis
vinifera]
Length = 167
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/78 (55%), Positives = 55/78 (70%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE + GR P+RWR+D GNIV ++ C GCLC +YDHIVP+SKG
Sbjct: 73 PRSFPYSVKQQCWEKAEKIKGRDPDRWRRDPLGNIVFRKLVGCPGCLCHDYDHIVPYSKG 132
Query: 86 GESTADNCQILQTRVNRF 103
G+ST +NCQ+LQ ++ F
Sbjct: 133 GKSTLENCQVLQENMDIF 150
>gi|414865043|tpg|DAA43600.1| TPA: hypothetical protein ZEAMMB73_851898 [Zea mays]
Length = 141
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 48/71 (67%), Gaps = 1/71 (1%)
Query: 26 PRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKG 85
PR F K CW KAE +PGR PERWR+DA GNIV ++ C GCLC +YDHIVP+SK
Sbjct: 62 PRSFSYAVKQQCWEKAERLPGRDPERWRRDALGNIVFRKLVGCPGCLCHDYDHIVPYSKV 121
Query: 86 GESTADNCQIL 96
+S DN + L
Sbjct: 122 PDSN-DNPKSL 131
>gi|42572595|ref|NP_974393.1| HNH endonuclease [Arabidopsis thaliana]
gi|332644766|gb|AEE78287.1| HNH endonuclease [Arabidopsis thaliana]
Length = 176
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/69 (53%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 16 SNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
S + N E PR F K CW KAE + GR PERWR+D GNIV ++ C GCLC +
Sbjct: 99 SGYGDPNPE-PRSFPYSVKQQCWEKAEKIKGRDPERWRRDHLGNIVFRKLVGCPGCLCHD 157
Query: 76 YDHIVPFSK 84
YDHIVP+SK
Sbjct: 158 YDHIVPYSK 166
>gi|414866927|tpg|DAA45484.1| TPA: hypothetical protein ZEAMMB73_528864 [Zea mays]
Length = 400
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 59/106 (55%), Gaps = 25/106 (23%)
Query: 96 LQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELDIIEM 134
LQ F SD E + A +++GFSC+ + DKELDIIEM
Sbjct: 41 LQADPATFTSDPEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELDIIEM 100
Query: 135 AVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNH 176
AVYGDVIRPG QCRC+ VAE+LGQ K +R W L + ++ +NH
Sbjct: 101 AVYGDVIRPGKQCRCRIVAEVLGQVKPTNRMAAWPLKDHIYLHLNH 146
>gi|38605769|emb|CAE05870.3| OSJNBa0044K18.12 [Oryza sativa Japonica Group]
Length = 204
Score = 79.0 bits (193), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 21 SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
S +PR F + CW A V GR +RWR DA GN V K H
Sbjct: 18 SAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNGKH----------- 66
Query: 81 PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
GG + NCQI+QT NR KS++ + ++KG S + +T LD IE+AVY V
Sbjct: 67 ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWTHPHLDTIEIAVYASV 121
Query: 141 IRPGNQCRCKTVAEM 155
R C C T+ +
Sbjct: 122 ERDDLSCTCPTIDNL 136
>gi|414866926|tpg|DAA45483.1| TPA: hypothetical protein ZEAMMB73_528864 [Zea mays]
Length = 468
Score = 79.0 bits (193), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 59/106 (55%), Gaps = 25/106 (23%)
Query: 96 LQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELDIIEM 134
LQ F SD E + A +++GFSC+ + DKELDIIEM
Sbjct: 41 LQADPATFTSDPEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELDIIEM 100
Query: 135 AVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNH 176
AVYGDVIRPG QCRC+ VAE+LGQ K +R W L + ++ +NH
Sbjct: 101 AVYGDVIRPGKQCRCRIVAEVLGQVKPTNRMAAWPLKDHIYLHLNH 146
>gi|414866985|tpg|DAA45542.1| TPA: hypothetical protein ZEAMMB73_014165 [Zea mays]
Length = 463
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 59/111 (53%), Gaps = 25/111 (22%)
Query: 92 NCQILQTRVNRFKSDKEQVD-------------ATKLKGFSCD--------VKFTDKELD 130
+LQ F SD E + A +++GFSC+ + DKELD
Sbjct: 152 QLHVLQADPATFTSDLEDAEDFAAWSKEALSQAAAEMQGFSCESDKAWVAQAEMQDKELD 211
Query: 131 IIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDR---WLLAN-CHMMVNHC 177
IIEMA+YGDVIRPG QCRC+ VAE+LGQ K + W L + ++ +NH
Sbjct: 212 IIEMAMYGDVIRPGKQCRCRIVAEVLGQVKPTNHMAAWPLKDHIYLHLNHL 262
>gi|125548724|gb|EAY94546.1| hypothetical protein OsI_16320 [Oryza sativa Indica Group]
Length = 232
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 21 SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
S +PR F + CW A V GR +R R DA GN V K H
Sbjct: 18 SAPAKPRSFSADVREACWNNAPKVIGRSGDRCRLDANGNPVLKALNGKH----------- 66
Query: 81 PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDV 140
GG + NCQI+QT NR KS++ + ++KG S + +T LD IE+AVY V
Sbjct: 67 ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWTHPHLDTIEIAVYASV 121
Query: 141 IRPGNQCRCKTVAEM 155
R C C T+ +
Sbjct: 122 ERDDLSCTCPTIDNL 136
>gi|115345574|ref|YP_771756.1| hypothetical protein RD1_A0048 [Roseobacter denitrificans OCh 114]
gi|115292895|gb|ABI93348.1| OSJNBa0044K18.12 [Roseobacter denitrificans OCh 114]
Length = 96
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 41/71 (57%)
Query: 37 CWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
W KA V G++P +R+DA GN++ K N + ++ DH+ P SKGG N Q L
Sbjct: 12 VWNKARPVKGKNPNLYRRDAQGNVIYKPAYNRDSAMGWQVDHVWPRSKGGSDARKNLQAL 71
Query: 97 QTRVNRFKSDK 107
QT N+ KS+K
Sbjct: 72 QTGANKRKSNK 82
>gi|222629057|gb|EEE61189.1| hypothetical protein OsJ_15190 [Oryza sativa Japonica Group]
Length = 118
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 48/105 (45%), Gaps = 16/105 (15%)
Query: 21 SNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIV 80
S +PR F + CW A V GR +RWR DA GN V K H
Sbjct: 18 SAPAKPRSFSADVREACWNNAPKVIGRSGDRWRLDANGNPVLKALNGKH----------- 66
Query: 81 PFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
GG + NCQI+QT NR KS++ + ++KG S + +T
Sbjct: 67 ----GGPNDQSNCQIIQTGANRSKSNR-ILRKDQMKGLSRTMNWT 106
>gi|41408779|ref|NP_961615.1| hypothetical protein MAP2681c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|254774117|ref|ZP_05215633.1| hypothetical protein MaviaA2_05510 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|417749729|ref|ZP_12398118.1| restriction endonuclease [Mycobacterium avium subsp.
paratuberculosis S397]
gi|41397137|gb|AAS04998.1| hypothetical protein MAP_2681c [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|336458728|gb|EGO37688.1| restriction endonuclease [Mycobacterium avium subsp.
paratuberculosis S397]
Length = 314
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 37/77 (48%), Gaps = 20/77 (25%)
Query: 51 RWRKDAAGN--IVCKRFCN---------CHGC---------LCFEYDHIVPFSKGGESTA 90
RWRK AAG ++ R N C C L E DHIVP SKGG ST
Sbjct: 234 RWRKSAAGQRALMTSRLRNSIKVRDHYTCRYCSVSLAAEPHLLLEVDHIVPISKGGMSTP 293
Query: 91 DNCQILQTRVNRFKSDK 107
DN Q L R NR KS+K
Sbjct: 294 DNLQTLCWRCNRTKSNK 310
>gi|403340432|gb|EJY69501.1| hypothetical protein OXYTRI_09762 [Oxytricha trifallax]
Length = 153
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 27 RYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCL 72
R F + K +CW ++ T+P R P+RWR D GN V CHG L
Sbjct: 61 RIFTRQQKDVCWQQSTTIPIRDPKRWRLDPLGNPVFYTLRGCHGAL 106
>gi|297831270|ref|XP_002883517.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297329357|gb|EFH59776.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 87
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 45/97 (46%), Gaps = 18/97 (18%)
Query: 43 TVPGRHPERWRKDAAGNIVCKRFC-NCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVN 101
V G + +RW D GN+VCK G CF++ I K E TA+NCQIL
Sbjct: 2 VVDGMNKDRWCLDVQGNLVCKPLHRGSLGAHCFQFHAI---DKDKELTAENCQILC---- 54
Query: 102 RFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYG 138
A +G D FT ++ ++E+A+YG
Sbjct: 55 ----------AAAARGLKPDSSFTQEDFKLVEIALYG 81
>gi|429740338|ref|ZP_19274029.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
gi|429153310|gb|EKX96095.1| ftsk gamma domain protein [Prevotella saccharolytica F0055]
Length = 285
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 31/56 (55%), Gaps = 6/56 (10%)
Query: 52 WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
WR+D R C E+DHI+PFSKGG +T N Q+L + NR KS+K
Sbjct: 234 WRRDGG------RCVKCGSQENLEFDHIIPFSKGGSNTIRNIQLLCEKCNRSKSNK 283
>gi|313205222|ref|YP_004043879.1| hypothetical protein Palpr_2764 [Paludibacter propionicigenes WB4]
gi|312444538|gb|ADQ80894.1| hypothetical protein Palpr_2764 [Paludibacter propionicigenes WB4]
Length = 100
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 1/76 (1%)
Query: 33 AKSMCWAKAETVPGRHPERWRKDAAGNIV-CKRFCNCHGCLCFEYDHIVPFSKGGESTAD 91
K + W K +PG E WR D G ++ N +E DHI P + GG +
Sbjct: 18 TKKVVWNKGAIIPGYPSEVWRSDKCGKVMKYSEHGNRSSDYGWEIDHITPVAYGGSDDLN 77
Query: 92 NCQILQTRVNRFKSDK 107
N Q L N KSDK
Sbjct: 78 NLQPLNWSNNVDKSDK 93
>gi|118465054|ref|YP_880481.1| HNH endonuclease, putative [Mycobacterium avium 104]
gi|118166341|gb|ABK67238.1| HNH endonuclease, putative [Mycobacterium avium 104]
Length = 69
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 24/36 (66%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E +HIVP SKGG ST DN Q L R NR KS+K
Sbjct: 30 LLLEVNHIVPISKGGMSTPDNLQTLCWRCNRTKSNK 65
>gi|258517332|ref|YP_003193554.1| HNH nuclease [Desulfotomaculum acetoxidans DSM 771]
gi|257781037|gb|ACV64931.1| HNH nuclease [Desulfotomaculum acetoxidans DSM 771]
Length = 104
Score = 44.3 bits (103), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 29 FDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFC---NCHGCLCFEYDHIVPFSKG 85
F+ + W KA VP + +RKD G ++ RF N + +E DH P ++G
Sbjct: 14 FNPRTILAVWNKAAVVPECNQSEFRKDRCGALM--RFADYDNVNSIFGWEIDHEKPVARG 71
Query: 86 GESTADNCQILQTRVNRFKSD 106
G DN Q L R NR KSD
Sbjct: 72 GTDDLDNLQPLHWRNNRGKSD 92
>gi|384253642|gb|EIE27116.1| hypothetical protein COCSUDRAFT_55140 [Coccomyxa subellipsoidea
C-169]
Length = 134
Score = 43.9 bits (102), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
Query: 42 ETVPGRHPERWRKDAAGNIVCKRFCNCHG-CLC-FEYDHIVPFSKGGESTADNCQILQTR 99
E +P R+R D GN+V G C+C FE DHI P+++GG S +N L R
Sbjct: 67 ERIPPPWHMRFRSDPYGNVVA---LEARGSCVCAFEVDHIFPWARGGLSVVENFMALFWR 123
Query: 100 VNR-FKSDKE 108
NR K+DK+
Sbjct: 124 SNRNVKNDKD 133
>gi|410867562|ref|YP_006982173.1| HNH endonuclease domain-containing protein [Propionibacterium
acidipropionici ATCC 4875]
gi|410824203|gb|AFV90818.1| HNH endonuclease domain-containing protein [Propionibacterium
acidipropionici ATCC 4875]
Length = 330
Score = 43.9 bits (102), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
L E DHIVP S+GG ST DN Q L R NR KS+K +DA
Sbjct: 291 LLLEVDHIVPVSRGGLSTEDNLQTLCWRCNRSKSNK-MIDA 330
>gi|418175095|ref|ZP_12811693.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
gi|353843655|gb|EHE23699.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41437]
Length = 178
Score = 43.9 bits (102), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 26/47 (55%)
Query: 61 VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+C L E DHIVP SKGG ST DN Q L + NR KS+K
Sbjct: 130 ICAASTAEQSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 176
>gi|169346887|ref|ZP_02865835.1| type III restriction enzyme, res subunit family [Clostridium
perfringens C str. JGS1495]
gi|169296946|gb|EDS79070.1| type III restriction enzyme, res subunit family [Clostridium
perfringens C str. JGS1495]
Length = 740
Score = 43.5 bits (101), Expect = 0.040, Method: Composition-based stats.
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F+ DHI P SKGG +T DN Q+L ++ N+ K DK
Sbjct: 705 FQIDHIKPISKGGLTTLDNLQLLCSKCNKIKGDK 738
>gi|392944277|ref|ZP_10309919.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
gi|392287571|gb|EIV93595.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
Length = 607
Score = 43.1 bits (100), Expect = 0.051, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E+DH++PFS+GG S+ +N Q+L R NR K +
Sbjct: 573 LEFDHVIPFSRGGASSVNNLQLLCRRCNRAKGAR 606
>gi|111221190|ref|YP_711984.1| tellurium resistance protein terE [Frankia alni ACN14a]
gi|111148722|emb|CAJ60397.1| Tellurium resistance protein terE [Frankia alni ACN14a]
Length = 644
Score = 43.1 bits (100), Expect = 0.053, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E+DH++PFS+GG S+ +N Q+L R NR K +
Sbjct: 610 LEFDHVIPFSRGGASSVNNLQLLCRRCNRSKGAR 643
>gi|218197006|gb|EEC79433.1| hypothetical protein OsI_20409 [Oryza sativa Indica Group]
Length = 498
Score = 43.1 bits (100), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 9/103 (8%)
Query: 81 PFSKGGESTADNCQILQTRVN--------RFKSDKEQVDATKLKGFSCDVKFTDKELDII 132
P GG +T +NC I+Q R N R + +D ++ + + +T+ +L I
Sbjct: 82 PLVTGGLNTPENCAIIQGRANRNCAIIQSRANRRDDILDEMEVAALASRINWTEYQLASI 141
Query: 133 EMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVN 175
E AV G++IR G ++ G + R LL +M+N
Sbjct: 142 EAAVIGNIIRDGKVIEIAPNSKG-GPLQDVLRSLLEKDELMIN 183
>gi|257486772|ref|ZP_05640813.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 217
Score = 42.7 bits (99), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 6/47 (12%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
E W++DA VC C LCF DHIVPFS+GG +T N Q+L
Sbjct: 165 EVWQRDAG---VCVE-CGTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205
>gi|418077823|ref|ZP_12715047.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
gi|418116043|ref|ZP_12753020.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
gi|419439525|ref|ZP_13979582.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
gi|419459153|ref|ZP_13999089.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
gi|419461430|ref|ZP_14001347.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
gi|353748845|gb|EHD29496.1| HNH endonuclease family protein [Streptococcus pneumoniae 4027-06]
gi|353791681|gb|EHD72055.1| HNH endonuclease family protein [Streptococcus pneumoniae 6963-05]
gi|379534025|gb|EHY99237.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02270]
gi|379534955|gb|EHZ00162.1| HNH endonuclease family protein [Streptococcus pneumoniae GA02714]
gi|379582010|gb|EHZ46893.1| HNH endonuclease family protein [Streptococcus pneumoniae GA40410]
Length = 356
Score = 42.7 bits (99), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 24/39 (61%)
Query: 69 HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHIVP SKGG ST DN Q L + NR KS+K
Sbjct: 316 QSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 354
>gi|422850756|ref|ZP_16897426.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
gi|422857591|ref|ZP_16904241.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
gi|325695504|gb|EGD37404.1| HNH endonuclease domain protein [Streptococcus sanguinis SK150]
gi|327462774|gb|EGF09096.1| HNH endonuclease domain protein [Streptococcus sanguinis SK1057]
Length = 329
Score = 42.7 bits (99), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%)
Query: 69 HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHI+P SKGG ST DN Q L + NR KS+K
Sbjct: 289 QSLLLLEIDHIIPISKGGLSTRDNLQTLCWKCNRSKSNK 327
>gi|168484153|ref|ZP_02709105.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC1873-00]
gi|168493974|ref|ZP_02718117.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC3059-06]
gi|169834462|ref|YP_001693806.1| HNH endonuclease domain-containing protein [Streptococcus
pneumoniae Hungary19A-6]
gi|417685762|ref|ZP_12335042.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
gi|417695447|ref|ZP_12344627.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
gi|418079981|ref|ZP_12717197.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
gi|418088689|ref|ZP_12725850.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
gi|418090912|ref|ZP_12728057.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
gi|418097696|ref|ZP_12734798.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
gi|418106621|ref|ZP_12743667.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
gi|418109254|ref|ZP_12746283.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
gi|418113876|ref|ZP_12750869.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
gi|418134007|ref|ZP_12770867.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
gi|418158923|ref|ZP_12795629.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
gi|418161331|ref|ZP_12798022.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
gi|418168382|ref|ZP_12805028.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
gi|418218031|ref|ZP_12844699.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
gi|418220228|ref|ZP_12846885.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
gi|418237869|ref|ZP_12864426.1| HNH endonuclease family protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419422272|ref|ZP_13962491.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
gi|419488280|ref|ZP_14028033.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
gi|419520261|ref|ZP_14059860.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
gi|419525050|ref|ZP_14064616.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
gi|421248645|ref|ZP_15705108.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
gi|421271937|ref|ZP_15722784.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
gi|168996964|gb|ACA37576.1| HNH endonuclease domain protein [Streptococcus pneumoniae
Hungary19A-6]
gi|172042557|gb|EDT50603.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC1873-00]
gi|183576210|gb|EDT96738.1| HNH endonuclease domain protein [Streptococcus pneumoniae
CDC3059-06]
gi|332077580|gb|EGI88041.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41301]
gi|332203444|gb|EGJ17511.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47368]
gi|353754300|gb|EHD34913.1| HNH endonuclease family protein [Streptococcus pneumoniae 6735-05]
gi|353764430|gb|EHD44979.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43265]
gi|353766285|gb|EHD46825.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44452]
gi|353771852|gb|EHD52359.1| HNH endonuclease family protein [Streptococcus pneumoniae 6901-05]
gi|353781535|gb|EHD61977.1| HNH endonuclease family protein [Streptococcus pneumoniae GA41410]
gi|353784547|gb|EHD64967.1| HNH endonuclease family protein [Streptococcus pneumoniae GA49447]
gi|353788540|gb|EHD68937.1| HNH endonuclease family protein [Streptococcus pneumoniae 5787-06]
gi|353826578|gb|EHE06736.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17227]
gi|353830921|gb|EHE11050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA17328]
gi|353836301|gb|EHE16389.1| HNH endonuclease family protein [Streptococcus pneumoniae GA19077]
gi|353876228|gb|EHE56077.1| HNH endonuclease family protein [Streptococcus pneumoniae NP127]
gi|353878025|gb|EHE57864.1| HNH endonuclease family protein [Streptococcus pneumoniae GA47751]
gi|353895002|gb|EHE74742.1| HNH endonuclease family protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|353903994|gb|EHE79508.1| HNH endonuclease family protein [Streptococcus pneumoniae GA11426]
gi|379541885|gb|EHZ07050.1| HNH endonuclease family protein [Streptococcus pneumoniae GA05245]
gi|379560754|gb|EHZ25776.1| HNH endonuclease family protein [Streptococcus pneumoniae GA14373]
gi|379589603|gb|EHZ54442.1| HNH endonuclease family protein [Streptococcus pneumoniae GA43264]
gi|379590395|gb|EHZ55233.1| HNH endonuclease family protein [Streptococcus pneumoniae GA44386]
gi|395615274|gb|EJG75290.1| HNH endonuclease family protein [Streptococcus pneumoniae 2082239]
gi|395877496|gb|EJG88565.1| HNH endonuclease family protein [Streptococcus pneumoniae SPAR55]
Length = 326
Score = 42.7 bits (99), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 24/39 (61%)
Query: 69 HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHIVP SKGG ST DN Q L + NR KS+K
Sbjct: 286 QSLLLLEIDHIVPVSKGGLSTPDNLQTLCWKCNRSKSNK 324
>gi|219847405|ref|YP_002461838.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
gi|219541664|gb|ACL23402.1| HNH endonuclease [Chloroflexus aggregans DSM 9485]
Length = 206
Score = 42.4 bits (98), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGEST-ADNCQILQTRVNRFKSDK 107
E W++D ++C H YDHI+P+SKGG S A+N Q+L R N K DK
Sbjct: 152 EVWKRDGGRCVICSATDELH------YDHIIPYSKGGTSLKAENIQLLCARHNLEKRDK 204
>gi|227356931|ref|ZP_03841307.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
gi|425072865|ref|ZP_18475971.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
gi|227162908|gb|EEI47860.1| hypothetical protein HMPREF0693_2198 [Proteus mirabilis ATCC 29906]
gi|404596639|gb|EKA97159.1| hypothetical protein HMPREF1310_02304 [Proteus mirabilis WGLW4]
Length = 98
Score = 42.0 bits (97), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
FE DHI+P KGG+ T +NCQ+L +R K+ +E
Sbjct: 61 FELDHIIPLFKGGKDTIENCQVLCIECHRKKTKEE 95
>gi|363891377|ref|ZP_09318556.1| hypothetical protein HMPREF9630_00141 [Eubacteriaceae bacterium
CM2]
gi|361965434|gb|EHL18416.1| hypothetical protein HMPREF9630_00141 [Eubacteriaceae bacterium
CM2]
Length = 759
Score = 41.6 bits (96), Expect = 0.15, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 9/59 (15%)
Query: 54 KDAAGNIVCKRFCNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
KDA GN C C G + F+ DHI+ + GG+S ADN QIL + N K D+
Sbjct: 705 KDADGNYRCA----CCGISDKSRIYFQVDHIISMNNGGKSVADNLQILCRQCNGIKGDQ 759
>gi|307705062|ref|ZP_07641944.1| HNH endonuclease family protein [Streptococcus mitis SK597]
gi|307621385|gb|EFO00440.1| HNH endonuclease family protein [Streptococcus mitis SK597]
Length = 423
Score = 41.2 bits (95), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 23/39 (58%)
Query: 69 HGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHI+P SKGG ST N Q L + NR KSDK
Sbjct: 383 QDLLLLEVDHIIPVSKGGMSTESNLQTLCWKCNRTKSDK 421
>gi|54023566|ref|YP_117808.1| hypothetical protein nfa15980 [Nocardia farcinica IFM 10152]
gi|54015074|dbj|BAD56444.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length = 760
Score = 41.2 bits (95), Expect = 0.19, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D R C E+DHI+P S+GG ++A N QIL NR K +
Sbjct: 708 EVWQRDGG------RCVECGSGHYLEFDHIIPLSRGGATSAANLQILCRACNRRKGAR 759
>gi|422680858|ref|ZP_16739129.1| stress protein, partial [Pseudomonas syringae pv. tabaci str. ATCC
11528]
gi|331010203|gb|EGH90259.1| stress protein [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 138
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 39 AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
A+ E +P E W++DA + C LCF DHIVPFS+GG +T N Q+L
Sbjct: 74 ARREAIPRSVQREVWQRDAGKCV----ECGTRQLLCF--DHIVPFSRGGSNTVRNLQLL 126
>gi|257486734|ref|ZP_05640775.1| HNH endonuclease [Pseudomonas syringae pv. tabaci str. ATCC 11528]
Length = 217
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 39 AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
A+ E +P E W++DA + C LCF DHIVPFS+GG +T N Q+L
Sbjct: 153 ARREAIPRSVQREVWQRDAGKCVEC----GTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205
>gi|374295268|ref|YP_005045459.1| restriction endonuclease [Clostridium clariflavum DSM 19732]
gi|359824762|gb|AEV67535.1| restriction endonuclease [Clostridium clariflavum DSM 19732]
Length = 112
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
++ DHI+P SKGG ++ N Q+L + NR KSDK
Sbjct: 78 WQIDHIIPKSKGGTNSYSNAQVLSRKQNRLKSDK 111
>gi|441509730|ref|ZP_20991644.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
108223]
gi|441446246|dbj|GAC49605.1| hypothetical protein GOACH_15_00980 [Gordonia aichiensis NBRC
108223]
Length = 334
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 31/53 (58%), Gaps = 5/53 (9%)
Query: 59 NIVCKRFCNCHGC----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
N C++ C+ G L E DHIVP S+GG ST +N Q L + NR KS+K
Sbjct: 279 NYACRQ-CSISGAAEPHLLLEVDHIVPVSRGGLSTMENLQTLCWKCNRSKSNK 330
>gi|440742015|ref|ZP_20921345.1| TerF-like protein [Pseudomonas syringae BRIP39023]
gi|440378101|gb|ELQ14731.1| TerF-like protein [Pseudomonas syringae BRIP39023]
Length = 217
Score = 41.2 bits (95), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 39 AKAETVP-GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
A+ E +P E W++DA + C LCF DHIVPFS+GG +T N Q+L
Sbjct: 153 ARREAIPRSVQREVWQRDAGKCVEC----GTRQLLCF--DHIVPFSRGGSNTVRNLQLL 205
>gi|424859400|ref|ZP_18283414.1| hypothetical protein OPAG_06556 [Rhodococcus opacus PD630]
gi|356661909|gb|EHI42220.1| hypothetical protein OPAG_06556 [Rhodococcus opacus PD630]
Length = 325
Score = 41.2 bits (95), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 35/77 (45%), Gaps = 20/77 (25%)
Query: 51 RWRKDAAGN--IVCKRFCNC------HGC------------LCFEYDHIVPFSKGGESTA 90
+WRK AAG ++ + N H C L E DHI+P SKGG +
Sbjct: 244 KWRKSAAGQRALMTTKLRNMIKTRDNHTCRYCSISLSAEPHLLLEVDHIMPVSKGGLTAP 303
Query: 91 DNCQILQTRVNRFKSDK 107
N Q L R NR KS+K
Sbjct: 304 GNLQTLCWRCNRTKSNK 320
>gi|159491606|ref|XP_001703752.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270494|gb|EDO96338.1| predicted protein [Chlamydomonas reinhardtii]
Length = 118
Score = 40.8 bits (94), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 55 DAAGNIVCKRFCNCHGCLC-FEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
D GN+V N G +C F DHI P+S+GG + DN LQ R N K +
Sbjct: 26 DPQGNVVALAAAN--GSVCGFHGDHIFPWSRGGLTVRDNLMALQWRANILKKN 76
>gi|427705667|ref|YP_007048044.1| HNH endonuclease [Nostoc sp. PCC 7107]
gi|427358172|gb|AFY40894.1| HNH endonuclease [Nostoc sp. PCC 7107]
Length = 99
Score = 40.8 bits (94), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 1/67 (1%)
Query: 36 MCWAKAETVPGRHPERWRKDAAGNIVCK-RFCNCHGCLCFEYDHIVPFSKGGESTADNCQ 94
+ W K P P WR D G ++ + + N +E DHI+P S GG N +
Sbjct: 30 IVWEKGFNNPNYDPRIWRIDQFGRLINRYEYGNRQSTYGWEIDHIIPKSWGGSDHISNLR 89
Query: 95 ILQTRVN 101
L +N
Sbjct: 90 PLHWYIN 96
>gi|350569051|ref|ZP_08937449.1| HNH endonuclease domain protein [Propionibacterium avidum ATCC
25577]
gi|348661294|gb|EGY77990.1| HNH endonuclease domain protein [Propionibacterium avidum ATCC
25577]
Length = 109
Score = 40.8 bits (94), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV-DATKLKGF 118
L E DHI+P S+GG S DN Q L R NR KS+K V + +L G
Sbjct: 54 LLLEVDHILPVSEGGLSVDDNLQTLCWRCNRSKSNKLPVIEGHQLAGV 101
>gi|255070577|ref|XP_002507370.1| predicted protein [Micromonas sp. RCC299]
gi|226522645|gb|ACO68628.1| predicted protein [Micromonas sp. RCC299]
Length = 1022
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 9/60 (15%)
Query: 52 WRKDAAGNIVCK-----RFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
+R D GN+V K F C C EYDH++P+S+GG S + N +++ N+ K D
Sbjct: 784 FRVDMYGNVVAKPGAAENFSVC----CTEYDHVLPWSRGGTSKSANIEVISWIANQRKLD 839
>gi|392942375|ref|ZP_10308017.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
gi|392285669|gb|EIV91693.1| uncharacterized stress response protein, TerZ- and CABP1 [Frankia
sp. QA3]
Length = 607
Score = 40.8 bits (94), Expect = 0.29, Method: Composition-based stats.
Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D + C E+DH++P S+GG ++ +N Q+L R N+ K +
Sbjct: 555 EVWQRDGGACV------QCQATEYLEFDHVIPHSRGGATSVNNLQLLCRRCNQEKGAR 606
>gi|407643249|ref|YP_006807008.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
gi|407306133|gb|AFU00034.1| hypothetical protein O3I_010365 [Nocardia brasiliensis ATCC 700358]
Length = 618
Score = 40.8 bits (94), Expect = 0.30, Method: Composition-based stats.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D + C + H E+DHI+P S+GG ++A N QIL NR K +
Sbjct: 566 EVWQRDGGKCVECG---DSH---YLEFDHIIPLSRGGATSAANLQILCRSCNRAKGAR 617
>gi|288802822|ref|ZP_06408259.1| conserved hypothetical protein [Prevotella melaninogenica D18]
gi|288334639|gb|EFC73077.1| conserved hypothetical protein [Prevotella melaninogenica D18]
Length = 119
Score = 40.4 bits (93), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 30 DGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCK-RFCNCHGCLCFEYDHIVPFSKGGES 88
D + K CW A+ V G + RKDA G + K + +E DH+ P GG+
Sbjct: 3 DEEKKEYCWNHAQIVEGYDKDSIRKDACGAWIFKAHYGMRDSVFGWEVDHVFPVILGGDD 62
Query: 89 TADNCQILQTRVNRFKSD 106
N + +Q + N K D
Sbjct: 63 FKRNLRAMQWKNNVSKGD 80
>gi|423209364|ref|ZP_17195918.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
gi|404617222|gb|EKB14158.1| hypothetical protein HMPREF1169_01436 [Aeromonas veronii AER397]
Length = 218
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 8/82 (9%)
Query: 34 KSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE--------YDHIVPFSKG 85
+S+ ++P P + R++A V + G C E +DHIVPFSKG
Sbjct: 136 ESLIRVDISSMPIPKPVKPRREAIPRAVQREVWQRDGGKCVECSSNAKLCFDHIVPFSKG 195
Query: 86 GESTADNCQILQTRVNRFKSDK 107
G ++ N Q+L R N K ++
Sbjct: 196 GSNSVRNLQLLCERCNLSKGNR 217
>gi|308535332|ref|YP_003933733.1| HNH endonuclease [Geobacter bemidjiensis Bem]
gi|308052677|gb|ADO00821.1| HNH endonuclease [Geobacter bemidjiensis Bem]
Length = 161
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E+DHI+P SKGG T N Q+L + NR K DK
Sbjct: 127 LEFDHIIPHSKGGSDTERNIQLLCMKCNRTKYDK 160
>gi|260655767|ref|ZP_05861236.1| putative HNH endonuclease domain protein [Jonquetella anthropi
E3_33 E1]
gi|260629383|gb|EEX47577.1| putative HNH endonuclease domain protein [Jonquetella anthropi
E3_33 E1]
Length = 384
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 66 CNCHGCLC-FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
CN H + E DHI P+SKGG + A+NCQ+L NR KS+K
Sbjct: 342 CNKHFEINEMEADHITPWSKGGRTIAENCQMLCKTCNRKKSNK 384
>gi|256824147|ref|YP_003148107.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
gi|256687540|gb|ACV05342.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
Length = 169
Score = 40.0 bits (92), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 51 RWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV 110
R R A + C F C E DHIVPF GG +TADN Q L R + K+ K Q
Sbjct: 82 RARDGVARDPGCGSFGATEAERC-ELDHIVPFDVGGTTTADNLQALSRRGHTRKT-KRQW 139
Query: 111 DATKLKGFSC 120
+ T G S
Sbjct: 140 EVTGDAGGSL 149
>gi|419855738|ref|ZP_14378488.1| HNH endonuclease domain protein [Bifidobacterium longum subsp.
longum 44B]
gi|386414961|gb|EIJ29502.1| HNH endonuclease domain protein [Bifidobacterium longum subsp.
longum 44B]
Length = 117
Score = 40.0 bits (92), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 41/103 (39%), Gaps = 20/103 (19%)
Query: 47 RHPERWRKDAAGNIVCKRFCNCHGC-------------LCFEYDHIVPFSKGGESTA-DN 92
RH WR+ V + CH C E D I+P SKGG N
Sbjct: 9 RHANGWRRQQVAARVLAAYDTCHLCGRSVDKSLPPGLPGSPEVDEIIPVSKGGSPYLFSN 68
Query: 93 CQILQTRVNRFKSD------KEQVDATKLKGFSCDVKFTDKEL 129
C++ NR +S+ +E + T +G++ D+K T L
Sbjct: 69 CRLAHRWCNRIRSNHSVAWAREHIKQTFEQGYTADLKATSMPL 111
>gi|429195043|ref|ZP_19187098.1| glycosyl hydrolase, family 15 [Streptomyces ipomoeae 91-03]
gi|428669241|gb|EKX68209.1| glycosyl hydrolase, family 15 [Streptomyces ipomoeae 91-03]
Length = 620
Score = 39.7 bits (91), Expect = 0.52, Method: Composition-based stats.
Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 17/98 (17%)
Query: 25 RPRYFDGKAKSMCW---------AKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFE 75
+PR+F ++K MCW A+ + +PGRH RWR A I + F H CF
Sbjct: 423 QPRHFT-QSKMMCWVALDRAADLAEHDLIPGRHLARWRT-ARDQI--RAFVETH---CFS 475
Query: 76 YDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDAT 113
D + + + +S + +L + ++ D ++DAT
Sbjct: 476 -DSLDSYVRSADSPDLDAAVLLGHLYGYRGDHRRMDAT 512
>gi|347535197|ref|YP_004842622.1| hypothetical protein FBFL15_0247 [Flavobacterium branchiophilum
FL-15]
gi|345528355|emb|CCB68385.1| Protein of unknown function [Flavobacterium branchiophilum FL-15]
Length = 366
Score = 39.7 bits (91), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%)
Query: 61 VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+C + C E DHI P+ +GG++TA+NCQ+L NR KS K
Sbjct: 320 ICVKCCEQFELNEMEADHITPWHEGGKTTAENCQMLCKLDNRIKSGK 366
>gi|394775392|gb|AFN37787.1| gp99 [Mycobacteriophage MacnCheese]
Length = 95
Score = 39.7 bits (91), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 21/35 (60%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
L F+ DHI P ++GG T DNC + NR KSD
Sbjct: 41 LSFQIDHITPLARGGTDTLDNCAAAHRKCNRDKSD 75
>gi|379707873|ref|YP_005263078.1| hypothetical protein NOCYR_1649 [Nocardia cyriacigeorgica GUH-2]
gi|374845372|emb|CCF62438.1| conserved protein of unknown function; putative endonuclease domain
[Nocardia cyriacigeorgica GUH-2]
Length = 549
Score = 39.7 bits (91), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D R C E+DHI+P S+GG ++A N QIL NR K +
Sbjct: 497 EVWQRDGG------RCVECGDGHYLEFDHIIPLSRGGATSATNLQILCRACNRAKGAR 548
>gi|257439996|ref|ZP_05615751.1| putative HNH endonuclease domain protein [Faecalibacterium
prausnitzii A2-165]
gi|257197603|gb|EEU95887.1| HNH endonuclease domain protein [Faecalibacterium prausnitzii
A2-165]
Length = 367
Score = 39.7 bits (91), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 24/30 (80%)
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
DHI P+SKGG++TA+NCQ+L NR KS+
Sbjct: 337 DHITPWSKGGKTTAENCQMLCADCNRRKSN 366
>gi|339784200|gb|AEK09909.1| gp100 [Mycobacterium phage Pixie]
Length = 95
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 13/67 (19%)
Query: 53 RKDAAGNIVCKRFCNCHGC-------------LCFEYDHIVPFSKGGESTADNCQILQTR 99
R+D ++ +R +CH C L F+ DHI P ++GG T DN +
Sbjct: 9 RRDRFRRVIKRRGDDCHLCGLPIDYDLPHDHELSFQIDHITPLARGGTDTLDNIAAAHRK 68
Query: 100 VNRFKSD 106
NR KSD
Sbjct: 69 CNRDKSD 75
>gi|303236487|ref|ZP_07323074.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
gi|302483338|gb|EFL46346.1| HNH endonuclease domain protein [Prevotella disiens FB035-09AN]
Length = 367
Score = 39.7 bits (91), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 24/33 (72%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DHI+P+ KGG++ DNCQ+L + NR KS K
Sbjct: 335 EADHIIPWHKGGKTEKDNCQMLCMKCNRTKSGK 367
>gi|365173256|ref|ZP_09361377.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
gi|363616949|gb|EHL68368.1| hypothetical protein HMPREF1006_02508 [Synergistes sp. 3_1_syn1]
Length = 374
Score = 39.7 bits (91), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 26/39 (66%), Gaps = 5/39 (12%)
Query: 74 FEYD-----HIVPFSKGGESTADNCQILQTRVNRFKSDK 107
FEYD HI P+SKGG++T DNCQ+L N K+DK
Sbjct: 335 FEYDEMEGDHIKPWSKGGQTTPDNCQMLCKSCNAKKTDK 373
>gi|126657420|ref|ZP_01728579.1| hypothetical protein CY0110_00635 [Cyanothece sp. CCY0110]
gi|126621407|gb|EAZ92119.1| hypothetical protein CY0110_00635 [Cyanothece sp. CCY0110]
Length = 636
Score = 39.3 bits (90), Expect = 0.68, Method: Composition-based stats.
Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
G+HPE K + ++ K+ C+ C L F E DHI+P GG++ DN Q L
Sbjct: 497 GKHPEL--KSSIAKLLKKQKGKCNYCGLTFMPGDRIETDHIIPRKMGGDNLKDNLQALHI 554
Query: 99 RVNRFKSDKEQVDATKLK 116
+ KS K+ D T K
Sbjct: 555 HCHDIKSKKDIKDITSYK 572
>gi|168059079|ref|XP_001781532.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667011|gb|EDQ53651.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 189
Score = 39.3 bits (90), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
G P ++R D+ GN+V + L +E DH P S+GG++ N ++LQ + ++ K
Sbjct: 97 GFDPAKFRVDSYGNVVYWN-ADPSSPLAWEIDHWFPHSRGGKTVVSNLRVLQWQASQRKK 155
Query: 106 DK 107
+
Sbjct: 156 SR 157
>gi|409098020|ref|ZP_11218044.1| hypothetical protein PagrP_06221 [Pedobacter agri PB92]
Length = 296
Score = 39.3 bits (90), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKS 105
E+DHI+PFSKGG +T N Q+L NR KS
Sbjct: 261 LEFDHIIPFSKGGSNTYRNIQLLCEGCNRTKS 292
>gi|354464472|gb|AER26385.1| gp94 [Mycobacterium phage Fionnbharth]
Length = 101
Score = 39.3 bits (90), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 20/39 (51%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQV 110
L F DHI P + GGE T DN + NR KSDK
Sbjct: 48 LAFTIDHITPLALGGEDTLDNLGPAHRKCNRAKSDKPPT 86
>gi|345867540|ref|ZP_08819550.1| HNH endonuclease family protein [Bizionia argentinensis JUB59]
gi|344048036|gb|EGV43650.1| HNH endonuclease family protein [Bizionia argentinensis JUB59]
Length = 189
Score = 39.3 bits (90), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 30/67 (44%)
Query: 49 PERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
P+ R A N K +E DHIVP S+GG N Q+L ++ K+ +E
Sbjct: 45 PKVVRSTANKNAFLKSKGYTSTPYGYEIDHIVPLSEGGSDDPSNMQLLTVNQHKAKTARE 104
Query: 109 QVDATKL 115
+ KL
Sbjct: 105 RTKRAKL 111
>gi|410097764|ref|ZP_11292745.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
CL02T12C30]
gi|410101011|ref|ZP_11295965.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
CL02T12C30]
gi|410101180|ref|ZP_11296130.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
CL02T12C30]
gi|409213077|gb|EKN06106.1| hypothetical protein HMPREF1076_05308 [Parabacteroides goldsteinii
CL02T12C30]
gi|409213657|gb|EKN06673.1| hypothetical protein HMPREF1076_05143 [Parabacteroides goldsteinii
CL02T12C30]
gi|409223854|gb|EKN16789.1| hypothetical protein HMPREF1076_01923 [Parabacteroides goldsteinii
CL02T12C30]
Length = 83
Score = 39.3 bits (90), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 35/80 (43%)
Query: 28 YFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGE 87
YF + W K V G+ P +RKD GN++ L + DH P +KGG
Sbjct: 3 YFSLETLLRIWQKGLIVRGKDPNLYRKDCYGNLMYWYSYGKFTKLGWNVDHSKPIAKGGT 62
Query: 88 STADNCQILQTRVNRFKSDK 107
+ +N Q + N K+ K
Sbjct: 63 NHLNNLQPMNCFANSSKNAK 82
>gi|441213611|ref|ZP_20975857.1| hypothetical protein D806_5030 [Mycobacterium smegmatis MKD8]
gi|440625575|gb|ELQ87421.1| hypothetical protein D806_5030 [Mycobacterium smegmatis MKD8]
Length = 54
Score = 39.3 bits (90), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 52 WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDKEQ 109
W++D + C N H +DHI+P+S+GG S A+N Q++ R N K DK Q
Sbjct: 2 WKRDQGRCVECGSTDNLH------FDHIIPYSRGGSSLVAENIQLMCARHNLAKHDKIQ 54
>gi|359427003|ref|ZP_09218078.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
gi|358237616|dbj|GAB07660.1| hypothetical protein GOAMR_71_00130 [Gordonia amarae NBRC 15530]
Length = 548
Score = 38.9 bits (89), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 64 RFCNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
R C EYDHI+P+S+GG ++ +N QIL
Sbjct: 504 RCVECGSTSYLEYDHIIPWSRGGATSVENLQIL 536
>gi|220919209|ref|YP_002494513.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-1]
gi|219957063|gb|ACL67447.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-1]
Length = 386
Score = 38.9 bits (89), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 9/52 (17%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLC-----FEYDHIVPFSKGGESTADNCQIL 96
E WR+D C ++ +G LC + DH+VP ++GGESTA N ++L
Sbjct: 287 EVWRRDGG----CCQWRLANGELCGSTHALQLDHVVPRARGGESTAANLRVL 334
>gi|154500496|ref|ZP_02038534.1| hypothetical protein BACCAP_04168 [Bacteroides capillosus ATCC
29799]
gi|150270727|gb|EDM98023.1| HNH endonuclease domain protein [Pseudoflavonifractor capillosus
ATCC 29799]
Length = 750
Score = 38.9 bits (89), Expect = 0.99, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 66 CNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
C C G + F+ DHI+P SKGG+S +N QIL N K D+
Sbjct: 704 CACCGMEAASRIPFQVDHILPMSKGGKSVPENLQILCRSCNGRKGDQ 750
>gi|256824130|ref|YP_003148090.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
gi|256687523|gb|ACV05325.1| HNH endonuclease [Kytococcus sedentarius DSM 20547]
Length = 198
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 70 GCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDAT 113
G E DH+VPF +GG++ A+N Q L TR + K+ K Q DA+
Sbjct: 128 GAASCEIDHVVPFDEGGQTVAENLQCLSTRGHTRKT-KGQWDAS 170
>gi|443474446|ref|ZP_21064422.1| helicase domain protein [Pseudanabaena biceps PCC 7429]
gi|443020805|gb|ELS34721.1| helicase domain protein [Pseudanabaena biceps PCC 7429]
Length = 865
Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 6/46 (13%)
Query: 69 HGCLC------FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKE 108
+ CLC + DH+ PF GG+++ DN Q L R N FK E
Sbjct: 712 NSCLCCGDHRRLQVDHVFPFILGGQTSIDNSQTLCMRCNGFKGKNE 757
>gi|406913193|gb|EKD52650.1| HNH endonuclease [uncultured bacterium]
Length = 229
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
E W++D ++C N H +DH +PFSKGG S TA N +IL + N KS K
Sbjct: 175 EVWKRDKGACVICGSNKNLH------FDHELPFSKGGTSLTAKNIRILCLKHNLSKSAK 227
>gi|395519910|ref|XP_003764084.1| PREDICTED: E3 ubiquitin-protein ligase ZSWIM2 [Sarcophilus
harrisii]
Length = 643
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 54/145 (37%), Gaps = 26/145 (17%)
Query: 62 CKRFCNCHGCL--CFEYDHIVPFS-------KGGESTADNCQILQTRVNRFKSDKEQVDA 112
C + CH C CF HI F K E+ + C + N +++ + +
Sbjct: 259 CTEYHLCHECFVSCFHPPHIFTFRQKRNQRWKSLENNPELCPAEKIIKNTEFTNQTEEEM 318
Query: 113 TKL-KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQF------------ 159
T L K SC K L +I + Y ++ PGNQCR LGQ
Sbjct: 319 THLEKSDSCTPKHVVTSLPLILITKYSKLLAPGNQCRLCLKPFCLGQHTRLLSCNHKFHR 378
Query: 160 KSKDRWLLANCHMMVNHCSTRGSCI 184
K D WLL + N C G +
Sbjct: 379 KCIDDWLLN----ISNSCPIDGQTV 399
>gi|386772121|ref|ZP_10094499.1| HNH endonuclease [Brachybacterium paraconglomeratum LC44]
Length = 526
Score = 38.9 bits (89), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 4/40 (10%)
Query: 61 VCKRFCNCHGCLC----FEYDHIVPFSKGGESTADNCQIL 96
+ +R C G C +YDHIVP S+GG ++ DN Q L
Sbjct: 394 LSRRDTTCRGPFCNAQIRQYDHIVPVSRGGPTSLDNGQGL 433
>gi|421735209|ref|ZP_16174207.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
gi|407076867|gb|EKE49775.1| HNH endonuclease [Bifidobacterium bifidum LMG 13195]
Length = 379
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 24/30 (80%)
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
DHIVP+S+GG++T DNCQ+L T N KS+
Sbjct: 349 DHIVPWSEGGKTTLDNCQMLCTTCNIKKSN 378
>gi|187251265|ref|YP_001875747.1| restriction endonuclease [Elusimicrobium minutum Pei191]
gi|186971425|gb|ACC98410.1| Restriction endonuclease [Elusimicrobium minutum Pei191]
Length = 230
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 33/66 (50%), Gaps = 9/66 (13%)
Query: 43 TVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVN 101
T+ RH W +D + C N H +DH +PFSKGG S TA N Q+L + N
Sbjct: 171 TIVKRHV--WERDKGQCVQCGNKTNLH------FDHNIPFSKGGSSITAKNIQLLCAKCN 222
Query: 102 RFKSDK 107
K DK
Sbjct: 223 LSKHDK 228
>gi|223933572|ref|ZP_03625553.1| HNH endonuclease [Streptococcus suis 89/1591]
gi|223897768|gb|EEF64148.1| HNH endonuclease [Streptococcus suis 89/1591]
Length = 115
Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
+C DH+VP +KGG S +N Q+ NR KSDK D +K
Sbjct: 44 MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87
>gi|451937196|gb|AGF87501.1| putative HNH endonuclease [Streptococcus phage phi7917]
gi|451937259|gb|AGF87563.1| putative HNH endonuclease [Streptococcus phage phi891591]
Length = 147
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
+C DH+VP +KGG S +N Q+ NR KSDK D +K
Sbjct: 76 MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 119
>gi|76788004|ref|YP_329262.1| prophage LambdaSa03, HNH endonuclease [Streptococcus agalactiae
A909]
gi|76563061|gb|ABA45645.1| prophage LambdaSa03, HNH endonuclease family protein [Streptococcus
agalactiae A909]
Length = 124
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATKLKG 117
L DHIVP +KGG S+ DN Q+ + NR KSDK ++ T ++
Sbjct: 57 LSAAIDHIVPIAKGGHPSSIDNLQLTHWQCNRQKSDKLFINQTAVRA 103
>gi|126660307|ref|ZP_01731421.1| hypothetical protein CY0110_31740 [Cyanothece sp. CCY0110]
gi|126618413|gb|EAZ89168.1| hypothetical protein CY0110_31740 [Cyanothece sp. CCY0110]
Length = 636
Score = 38.1 bits (87), Expect = 1.7, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 9/78 (11%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
G+HPE K + ++ K+ C+ C L F E DHI P GG + DN Q+L
Sbjct: 497 GKHPEL--KKSVATLLKKQKGKCNHCGLTFMLGDLIETDHITPLKAGGNNHKDNLQVLHI 554
Query: 99 RVNRFKSDKEQVDATKLK 116
+ K+ K+ D K
Sbjct: 555 HCHDIKTRKDLRDIKSYK 572
>gi|451937008|gb|AGF87316.1| putative HNH endonuclease [Streptococcus phage phi20c]
Length = 115
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
+C DH+VP +KGG S +N Q+ NR KSDK D +K
Sbjct: 44 MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87
>gi|302038913|ref|YP_003799235.1| hypothetical protein NIDE3631 [Candidatus Nitrospira defluvii]
gi|300606977|emb|CBK43310.1| conserved protein of unknown function, putative HNH endonuclease
[Candidatus Nitrospira defluvii]
Length = 249
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
E W++D + C N H YDH +P+SKGG S +A N +IL + N KSDK
Sbjct: 175 EVWKRDQGKCVQCGFQKNLH------YDHDIPYSKGGSSLSAQNVRILCAKCNLEKSDK 227
>gi|330832281|ref|YP_004401106.1| HNH endonuclease [Streptococcus suis ST3]
gi|329306504|gb|AEB80920.1| HNH endonuclease [Streptococcus suis ST3]
Length = 115
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATK 114
+C DH+VP +KGG S +N Q+ NR KSDK D +K
Sbjct: 44 MCAVIDHVVPLAKGGHPSLIENLQLAHMSCNRIKSDKLFADNSK 87
>gi|238926542|ref|ZP_04658302.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
gi|238885488|gb|EEQ49126.1| conserved hypothetical protein [Selenomonas flueggei ATCC 43531]
Length = 363
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
DHI P+SKGG++T +NCQ+L NR KS+
Sbjct: 333 DHITPWSKGGKTTPENCQMLCAPCNRRKSN 362
>gi|401564832|ref|ZP_10805697.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
gi|400188432|gb|EJO22596.1| type III restriction enzyme, res subunit [Selenomonas sp. FOBRC6]
Length = 754
Score = 38.1 bits (87), Expect = 1.9, Method: Composition-based stats.
Identities = 20/47 (42%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 63 KRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQ 109
K F HG E DHI P + GG+S DN Q+L N K DK +
Sbjct: 710 KVFSGRHG---LEVDHIKPMNDGGQSVLDNLQVLCKSCNASKGDKAE 753
>gi|86159768|ref|YP_466553.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
gi|85776279|gb|ABC83116.1| HNH endonuclease [Anaeromyxobacter dehalogenans 2CP-C]
Length = 368
Score = 38.1 bits (87), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 24/47 (51%), Gaps = 1/47 (2%)
Query: 50 ERWRKDAAG-NIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQI 95
E W +D V C E DHIVP ++GG+STADN +I
Sbjct: 277 EVWARDGGRCTFVLASGERCESAHRLELDHIVPRARGGDSTADNLRI 323
>gi|425736696|ref|ZP_18854994.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
gi|425477862|gb|EKU45079.1| hypothetical protein C272_16183 [Brevibacterium casei S18]
Length = 331
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 22/36 (61%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHI P SKGG S +N Q L R NR KS+K
Sbjct: 295 LLLEVDHIKPISKGGLSEIENLQTLCWRCNRSKSNK 330
>gi|119492356|ref|ZP_01623692.1| RNA-directed DNA polymerase [Lyngbya sp. PCC 8106]
gi|119453136|gb|EAW34304.1| RNA-directed DNA polymerase [Lyngbya sp. PCC 8106]
Length = 511
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 45/88 (51%), Gaps = 12/88 (13%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
G+HPE ++ A+ ++ K+ C C L F E DHI+P SKGG + +N Q+L
Sbjct: 418 GKHPEMLKRTAS--LLKKQNGKCPKCGLSFKDGDLIELDHIIPKSKGGRNEYNNYQLLHR 475
Query: 99 RVNRFKSDKEQVDATKLKGFSCDVKFTD 126
+ +K ++D + KG +K +
Sbjct: 476 HCH---DEKTRIDGSLNKGSISQIKIPE 500
>gi|347525200|ref|YP_004831948.1| HNH endonuclease [Lactobacillus ruminis ATCC 27782]
gi|345284159|gb|AEN78012.1| HNH endonuclease [Lactobacillus ruminis ATCC 27782]
Length = 115
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 77 DHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDATKLK 116
DHIVP SKGG S+ DN Q+ NR KSDK D K K
Sbjct: 53 DHIVPISKGGHPSSIDNLQLAHWTCNRQKSDKLYADGFKKK 93
>gi|158318135|ref|YP_001510643.1| stress protein [Frankia sp. EAN1pec]
gi|158113540|gb|ABW15737.1| stress protein [Frankia sp. EAN1pec]
Length = 560
Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D + C E+DH++P S+GG S+ +N Q+L R N K +
Sbjct: 508 EVWQRDGGACV------QCQATEYLEFDHVIPHSRGGASSVNNLQLLCRRCNLEKGAR 559
>gi|428779791|ref|YP_007171577.1| Retron-type reverse transcriptase [Dactylococcopsis salina PCC
8305]
gi|428694070|gb|AFZ50220.1| Retron-type reverse transcriptase [Dactylococcopsis salina PCC
8305]
Length = 594
Score = 37.7 bits (86), Expect = 2.1, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 9/59 (15%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGCLCF-------EYDHIVPFSKGGESTADNCQILQ 97
G+HPE ++ A+ ++ K+ C C F E DHIVP+SKGG+ N Q+L
Sbjct: 503 GKHPEMPKRTAS--LLKKQKGKCAHCGLFFRDGDLLEVDHIVPYSKGGKDEYKNWQLLH 559
>gi|342239735|gb|AEL19743.1| gp98 [Mycobacterium phage Larva]
Length = 101
Score = 37.7 bits (86), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
+ F+ DHI P S+GG T DN + NR K DK + + + F ++T
Sbjct: 47 MSFQIDHITPLSRGGTDTLDNVAACHRKCNRDKGDKLEAELSGAVTFITSREWT 100
>gi|15807725|ref|NP_285380.1| TerF-like protein [Deinococcus radiodurans R1]
gi|6460551|gb|AAF12257.1|AE001862_83 TerF-related protein [Deinococcus radiodurans R1]
Length = 579
Score = 37.7 bits (86), Expect = 2.1, Method: Composition-based stats.
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E W++D + C E+DH++P S GG S+ N Q+L R N KS++
Sbjct: 527 EVWQRDQGQCV------ECGDTNYLEFDHVIPHSLGGASSVGNLQLLCRRCNLAKSNR 578
>gi|159897861|ref|YP_001544108.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
gi|159890900|gb|ABX03980.1| HNH endonuclease [Herpetosiphon aurantiacus DSM 785]
Length = 321
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 22/34 (64%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E+DHI+P SKGG +T N Q+L + N K D+
Sbjct: 287 LEFDHIIPHSKGGANTVGNVQLLCRKCNLAKGDR 320
>gi|303245412|ref|ZP_07331696.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
gi|302493261|gb|EFL53123.1| histidine triad (HIT) protein [Desulfovibrio fructosovorans JJ]
Length = 309
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
+ DHI P SKGG++ N Q+L +R NR K +KE D L
Sbjct: 138 LDVDHIKPRSKGGKNEYANLQVLCSRCNRAKGNKEDTDYRPL 179
>gi|407463091|ref|YP_006774408.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
gi|407046713|gb|AFS81466.1| HNH endonuclease [Candidatus Nitrosopumilus koreensis AR1]
Length = 228
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSD 106
E W++D + C N H +DH++P SKGG S TA+N QIL + N K D
Sbjct: 174 EVWKRDEGKCVKCGSKENLH------FDHVIPHSKGGTSITAENIQILCAKHNLEKRD 225
>gi|171914460|ref|ZP_02929930.1| HNH endonuclease [Verrucomicrobium spinosum DSM 4136]
Length = 193
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 16/81 (19%)
Query: 36 MCWAKAETVPGRHPER-----WRKDAAGNIVC----KRFCNCHGCLCFEYDHIVPFSKGG 86
+ A+ + VP R P+ W +D VC ++ G + DHI+P S+GG
Sbjct: 81 LVLARYDKVPKRRPKLSSRGIWDRDGG---VCQYTGRKLTRDEGNI----DHIMPRSRGG 133
Query: 87 ESTADNCQILQTRVNRFKSDK 107
+++ DNC + R+N K+D+
Sbjct: 134 KTSWDNCVLADKRINSRKADR 154
>gi|402829778|ref|ZP_10878651.1| HNH endonuclease domain protein [Slackia sp. CM382]
gi|402283445|gb|EJU31959.1| HNH endonuclease domain protein [Slackia sp. CM382]
Length = 322
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 21/36 (58%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHIVP SKGG + DN Q L + NR K K
Sbjct: 285 LLLEVDHIVPVSKGGMTKEDNLQTLCWKCNRSKGAK 320
>gi|358455199|ref|ZP_09165427.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
gi|357081452|gb|EHI90883.1| Protein of unknown function DUF2510 [Frankia sp. CN3]
Length = 545
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
CH E+DH++P S GG ++ +N Q+L R N K +
Sbjct: 503 AQCHATEYLEFDHVIPHSLGGATSVNNLQLLCRRCNLEKGAR 544
>gi|428169182|gb|EKX38118.1| hypothetical protein GUITHDRAFT_144431 [Guillardia theta CCMP2712]
Length = 320
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 44 VPGRHPERWRKDAAGNIVCKRFCNCHGCLCF-EYDHIVPFSKGGESTADNCQILQTRVNR 102
+P + ++ D GNI+ G +C+ E DH+ P S+GG ++ N ++Q NR
Sbjct: 165 IPSWFHKLFKVDMWGNIIS--VDGLQGAMCYAEEDHLFPSSRGGRTSLANLVLIQFVANR 222
Query: 103 FKSDK 107
K +K
Sbjct: 223 KKKEK 227
>gi|440783576|ref|ZP_20961189.1| putative wall-associated protein [Clostridium pasteurianum DSM 525]
gi|440219319|gb|ELP58532.1| putative wall-associated protein [Clostridium pasteurianum DSM 525]
Length = 356
Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
++ DHI+P SKGG + DN +++ +NR K DK
Sbjct: 323 WQIDHIIPKSKGGTNGYDNARVISRHLNRLKWDK 356
>gi|434394103|ref|YP_007129050.1| HNH endonuclease [Gloeocapsa sp. PCC 7428]
gi|428265944|gb|AFZ31890.1| HNH endonuclease [Gloeocapsa sp. PCC 7428]
Length = 184
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 6/64 (9%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQ 109
E R+D N C+ +C H L DH++P S+GG T DN R N+ KSD+
Sbjct: 83 EVLRRD---NYTCQ-YCGSHKHLTL--DHVIPVSRGGLHTWDNVVTACERCNQRKSDRTP 136
Query: 110 VDAT 113
++A+
Sbjct: 137 IEAS 140
>gi|359424187|ref|ZP_09215309.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
gi|358240461|dbj|GAB04891.1| hypothetical protein GOAMR_24_00140 [Gordonia amarae NBRC 15530]
Length = 379
Score = 37.4 bits (85), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 62 CKRFCNCHGCLCFEY-----DHIVPFSKGGESTADNCQILQTRVNRFKSD 106
CK N G F+ DHIVP+SKGG++ A+N Q+L NR K D
Sbjct: 329 CKTVGNDDGQQVFDISQMDADHIVPWSKGGKTEAENGQMLCIGCNRSKGD 378
>gi|302669231|ref|YP_003832381.1| HNH endonuclease domain-containing protein [Butyrivibrio
proteoclasticus B316]
gi|302396895|gb|ADL35799.1| HNH endonuclease domain-containing protein [Butyrivibrio
proteoclasticus B316]
Length = 592
Score = 37.4 bits (85), Expect = 2.8, Method: Composition-based stats.
Identities = 17/36 (47%), Positives = 21/36 (58%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHI+P +KGG + DN Q L + NR K K
Sbjct: 553 LLLEIDHIIPIAKGGLTKEDNLQTLCWKCNRSKGSK 588
>gi|260887966|ref|ZP_05899229.1| putative HNH endonuclease domain protein [Selenomonas sputigena
ATCC 35185]
gi|330838550|ref|YP_004413130.1| hypothetical protein Selsp_0703 [Selenomonas sputigena ATCC 35185]
gi|260862217|gb|EEX76717.1| putative HNH endonuclease domain protein [Selenomonas sputigena
ATCC 35185]
gi|329746314|gb|AEB99670.1| protein of unknown function DUF262 [Selenomonas sputigena ATCC
35185]
Length = 374
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DHI+P+SKGG + DNCQ+L N K+DK
Sbjct: 341 EGDHIMPWSKGGHTQPDNCQMLCKSCNGKKTDK 373
>gi|153003686|ref|YP_001378011.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
gi|152027259|gb|ABS25027.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
Length = 355
Score = 37.4 bits (85), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 12/76 (15%)
Query: 24 ERPRYFDGKAKSMCWAKAETVPGR-HPERWRKDAAGNIVCKRFCNCHGCLC---FEYDHI 79
E+PR KS ++ +T+P E WR+ AG RF + C C EYDHI
Sbjct: 261 EKPR------KSRRGSRRDTIPAEVKREVWRR--AGGQCEWRFESGERCDCRRRLEYDHI 312
Query: 80 VPFSKGGESTADNCQI 95
P + GG ST DN ++
Sbjct: 313 EPLALGGASTMDNVRL 328
>gi|404483379|ref|ZP_11018602.1| hypothetical protein HMPREF1135_01662 [Clostridiales bacterium
OBRC5-5]
gi|404343652|gb|EJZ70013.1| hypothetical protein HMPREF1135_01662 [Clostridiales bacterium
OBRC5-5]
Length = 762
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 9/58 (15%)
Query: 55 DAAGNIVCKRFCNCHGC-----LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
D AGN C C G + F+ DHI+ + GG+S +N QIL N KSDK
Sbjct: 709 DEAGNYRCA----CCGFTDRSRVPFQVDHIIAMNNGGKSVVENLQILCRSCNARKSDK 762
>gi|256375023|ref|YP_003098683.1| HNH nuclease [Actinosynnema mirum DSM 43827]
gi|255919326|gb|ACU34837.1| HNH nuclease [Actinosynnema mirum DSM 43827]
Length = 218
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQIL 96
+C G YDH+VP ++GG ST N ++L
Sbjct: 179 AHCGGATDLNYDHVVPLAEGGPSTVGNLRVL 209
>gi|172035554|ref|YP_001802055.1| putative reverse transcriptase [Cyanothece sp. ATCC 51142]
gi|354556987|ref|ZP_08976261.1| RNA-directed DNA polymerase (Reverse transcriptase) [Cyanothece sp.
ATCC 51472]
gi|171697008|gb|ACB49989.1| putative reverse transcriptase [Cyanothece sp. ATCC 51142]
gi|353551051|gb|EHC20474.1| RNA-directed DNA polymerase (Reverse transcriptase) [Cyanothece sp.
ATCC 51472]
Length = 607
Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats.
Identities = 40/172 (23%), Positives = 64/172 (37%), Gaps = 51/172 (29%)
Query: 15 SSNSNNSNEERPRYFDGKAKSMCWAKAETVP------------GRHPERWRKDAAGNIVC 62
++ S+N+ Y K++ K ET P G+HPE ++ A+ ++
Sbjct: 459 ATRSDNNPYTLAEYGSFKSQKYAKVKGETSPYDGNLVYWSTRMGKHPEMPKRTAS--MLK 516
Query: 63 KRFCNCHGCLCF-------EYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKL 115
K+ C C F E DHI+P SKGG+ N Q+L + K+ +
Sbjct: 517 KQKGKCTHCGLFFKDGDIIELDHIIPKSKGGKDEYKNWQLLHRHCHDKKTRTD------- 569
Query: 116 KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTV-AEMLGQFKSKDRWL 166
G + R GN+ C +V + L S RW+
Sbjct: 570 ----------------------GSLKRSGNKTDCNSVIPKQLSNIPSNYRWI 599
>gi|414581004|ref|ZP_11438144.1| hypothetical protein MA5S1215_2844 [Mycobacterium abscessus
5S-1215]
gi|420864922|ref|ZP_15328311.1| hypothetical protein MA4S0303_3285 [Mycobacterium abscessus
4S-0303]
gi|420869712|ref|ZP_15333094.1| hypothetical protein MA4S0726RA_3219 [Mycobacterium abscessus
4S-0726-RA]
gi|420874156|ref|ZP_15337532.1| hypothetical protein MA4S0726RB_2808 [Mycobacterium abscessus
4S-0726-RB]
gi|420881531|ref|ZP_15344898.1| hypothetical protein MA5S0304_2807 [Mycobacterium abscessus
5S-0304]
gi|420884715|ref|ZP_15348075.1| hypothetical protein MA5S0421_3062 [Mycobacterium abscessus
5S-0421]
gi|420891563|ref|ZP_15354910.1| hypothetical protein MA5S0422_3981 [Mycobacterium abscessus
5S-0422]
gi|420894868|ref|ZP_15358207.1| hypothetical protein MA5S0708_2734 [Mycobacterium abscessus
5S-0708]
gi|420901077|ref|ZP_15364408.1| hypothetical protein MA5S0817_2353 [Mycobacterium abscessus
5S-0817]
gi|420905963|ref|ZP_15369281.1| hypothetical protein MA5S1212_2490 [Mycobacterium abscessus
5S-1212]
gi|420974106|ref|ZP_15437297.1| hypothetical protein MA5S0921_3765 [Mycobacterium abscessus
5S-0921]
gi|421044511|ref|ZP_15507511.1| hypothetical protein MA4S0116S_2357 [Mycobacterium abscessus
4S-0116-S]
gi|392063638|gb|EIT89487.1| hypothetical protein MA4S0303_3285 [Mycobacterium abscessus
4S-0303]
gi|392065631|gb|EIT91479.1| hypothetical protein MA4S0726RB_2808 [Mycobacterium abscessus
4S-0726-RB]
gi|392069182|gb|EIT95029.1| hypothetical protein MA4S0726RA_3219 [Mycobacterium abscessus
4S-0726-RA]
gi|392078823|gb|EIU04650.1| hypothetical protein MA5S0422_3981 [Mycobacterium abscessus
5S-0422]
gi|392080478|gb|EIU06304.1| hypothetical protein MA5S0421_3062 [Mycobacterium abscessus
5S-0421]
gi|392086440|gb|EIU12265.1| hypothetical protein MA5S0304_2807 [Mycobacterium abscessus
5S-0304]
gi|392094180|gb|EIU19975.1| hypothetical protein MA5S0708_2734 [Mycobacterium abscessus
5S-0708]
gi|392098438|gb|EIU24232.1| hypothetical protein MA5S0817_2353 [Mycobacterium abscessus
5S-0817]
gi|392103867|gb|EIU29653.1| hypothetical protein MA5S1212_2490 [Mycobacterium abscessus
5S-1212]
gi|392116156|gb|EIU41924.1| hypothetical protein MA5S1215_2844 [Mycobacterium abscessus
5S-1215]
gi|392161989|gb|EIU87679.1| hypothetical protein MA5S0921_3765 [Mycobacterium abscessus
5S-0921]
gi|392233964|gb|EIV59462.1| hypothetical protein MA4S0116S_2357 [Mycobacterium abscessus
4S-0116-S]
Length = 92
Score = 37.4 bits (85), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 23/40 (57%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVD 111
L ++ DH++P SKGG T DN + NR KSDK D
Sbjct: 41 LSYQVDHLIPLSKGGTDTLDNKVPSHRQCNRDKSDKLPED 80
>gi|420989738|ref|ZP_15452894.1| hypothetical protein MA4S0206_3302 [Mycobacterium abscessus
4S-0206]
gi|421040540|ref|ZP_15503548.1| hypothetical protein MA4S0116R_3259 [Mycobacterium abscessus
4S-0116-R]
gi|392184017|gb|EIV09668.1| hypothetical protein MA4S0206_3302 [Mycobacterium abscessus
4S-0206]
gi|392221468|gb|EIV46991.1| hypothetical protein MA4S0116R_3259 [Mycobacterium abscessus
4S-0116-R]
Length = 90
Score = 37.4 bits (85), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 23/40 (57%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVD 111
L ++ DH++P SKGG T DN + NR KSDK D
Sbjct: 39 LSYQVDHLIPLSKGGTDTLDNKVPSHRQCNRDKSDKLPED 78
>gi|418753307|ref|ZP_13309558.1| RHS repeat-associated core domain protein [Leptospira santarosai
str. MOR084]
gi|409966309|gb|EKO34155.1| RHS repeat-associated core domain protein [Leptospira santarosai
str. MOR084]
Length = 893
Score = 37.0 bits (84), Expect = 3.4, Method: Composition-based stats.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+++DHI P S GG++ + N QIL + NR KS+K
Sbjct: 860 WQFDHIKPKSCGGDNCSSNLQILSRQENRIKSNK 893
>gi|213692526|ref|YP_002323112.1| HNH nuclease [Bifidobacterium longum subsp. infantis ATCC 15697 =
JCM 1222]
gi|213523987|gb|ACJ52734.1| HNH nuclease [Bifidobacterium longum subsp. infantis ATCC 15697 =
JCM 1222]
Length = 110
Score = 37.0 bits (84), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 72 LCFEYDHIVPFSKGGESTA-DNCQILQTRVNRFKSDKEQVDA-TKLKGFSCDVKFTDKEL 129
+ E D ++P S+GG+ + +NC++ R NR KSDK A + L G + K T
Sbjct: 46 MSAEVDELIPVSRGGDPLSFNNCRLTHRRCNRIKSDKTDAYARSHLGGAAAQPKATSIPF 105
Query: 130 DI 131
++
Sbjct: 106 EL 107
>gi|210623659|ref|ZP_03293968.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
gi|210153424|gb|EEA84430.1| hypothetical protein CLOHIR_01918 [Clostridium hiranonis DSM 13275]
Length = 912
Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSD 106
F+ DHIVP SKGG++ +N Q+L N K D
Sbjct: 658 FQIDHIVPLSKGGKTVVENLQLLTRWENAQKGD 690
>gi|417162963|ref|ZP_11998293.1| PF03235 family protein [Escherichia coli 99.0741]
gi|386173454|gb|EIH45466.1| PF03235 family protein [Escherichia coli 99.0741]
Length = 390
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DH+ +SKGGEST DNC++L NR K +K
Sbjct: 358 EADHVSAWSKGGESTLDNCEMLCITHNRSKGNK 390
>gi|227484896|ref|ZP_03915212.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
51172]
gi|227237051|gb|EEI87066.1| conserved hypothetical protein [Anaerococcus lactolyticus ATCC
51172]
Length = 121
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 14/20 (70%), Positives = 19/20 (95%)
Query: 77 DHIVPFSKGGESTADNCQIL 96
DHIVP+SKGG++ A+NCQ+L
Sbjct: 91 DHIVPWSKGGKTIAENCQML 110
>gi|126326723|ref|XP_001378250.1| PREDICTED: e3 ubiquitin-protein ligase ZSWIM2 [Monodelphis
domestica]
Length = 648
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 54/145 (37%), Gaps = 26/145 (17%)
Query: 62 CKRFCNCHGCL--CFEYDHIVPFS-------KGGESTADNCQILQTRVNRFKSDKEQVDA 112
C + CH C C+ HI F K E + C + ++ N ++ + +
Sbjct: 261 CTEYHLCHECFISCYHPAHIFIFRQKRNQRWKSLEKFSGLCDLRRSIKNLELKNQTEKEM 320
Query: 113 TKL-KGFSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQF------------ 159
T K C K K L +I + + ++ PG+QCR A LGQ
Sbjct: 321 TNFEKSIVCTPKHVVKSLPLILITKHSKLLAPGHQCRLCLKAFYLGQHTRVLSCNHKFHR 380
Query: 160 KSKDRWLLANCHMMVNHCSTRGSCI 184
K D WLL C N C G +
Sbjct: 381 KCIDDWLLNIC----NSCPIDGQTV 401
>gi|83591793|ref|YP_425545.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum ATCC 11170]
gi|386348484|ref|YP_006046732.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum F11]
gi|83574707|gb|ABC21258.1| CRISPR-associated endonuclease, Csn1 family [Rhodospirillum rubrum
ATCC 11170]
gi|346716920|gb|AEO46935.1| CRISPR-associated endonuclease Csn1 family protein [Rhodospirillum
rubrum F11]
Length = 1173
Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats.
Identities = 13/39 (33%), Positives = 24/39 (61%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
++ DH++P ++GG + DN + Q+ N+ K DK +A
Sbjct: 624 YDIDHVIPLARGGRDSLDNMVLCQSDANKTKGDKTPWEA 662
>gi|118443767|ref|YP_877830.1| HNH endonuclease domain-containing protein [Clostridium novyi NT]
gi|118134223|gb|ABK61267.1| HNH endonuclease domain protein [Clostridium novyi NT]
Length = 299
Score = 37.0 bits (84), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 21/36 (58%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L E DHI+P SKGG ST N Q L + NR K K
Sbjct: 262 LLLEIDHIIPISKGGMSTEKNLQTLCWKCNRKKGAK 297
>gi|291542181|emb|CBL15291.1| Restriction endonuclease [Ruminococcus bromii L2-63]
Length = 374
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 5/39 (12%)
Query: 74 FEYD-----HIVPFSKGGESTADNCQILQTRVNRFKSDK 107
FEYD HI P+SKGG + DNCQ+L N K+DK
Sbjct: 335 FEYDEMEGDHIKPWSKGGRTILDNCQMLCKSCNAKKTDK 373
>gi|444913287|ref|ZP_21233440.1| hypothetical protein D187_05377 [Cystobacter fuscus DSM 2262]
gi|444716046|gb|ELW56903.1| hypothetical protein D187_05377 [Cystobacter fuscus DSM 2262]
Length = 43
Score = 36.6 bits (83), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DH++P SKGGE T +N Q+L N KS+K
Sbjct: 9 ERDHVIPKSKGGEGTPENGQVLCRECNLEKSNK 41
>gi|381159335|ref|ZP_09868567.1| restriction endonuclease [Thiorhodovibrio sp. 970]
gi|380877399|gb|EIC19491.1| restriction endonuclease [Thiorhodovibrio sp. 970]
Length = 361
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DHI P+ +GG+++A+NCQ+L NR KS K
Sbjct: 329 EADHITPWHEGGKTSAENCQMLCKEDNRRKSGK 361
>gi|320538580|ref|ZP_08038435.1| HNH endonuclease domain protein [Treponema phagedenis F0421]
gi|320144569|gb|EFW36330.1| HNH endonuclease domain protein [Treponema phagedenis F0421]
Length = 361
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 61 VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+CK N E DHI+P+ GG++ +N Q+L NR KSDK
Sbjct: 315 ICKDCGNYFEIEEMEGDHIIPWKDGGKTIDENLQMLCKNCNRIKSDK 361
>gi|284008322|emb|CBA74691.1| conserved hypothetical protein [Arsenophonus nasoniae]
Length = 124
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 74 FEYDHIVPFSKGGESTADNCQIL 96
FE DHI+P +GG+ NCQIL
Sbjct: 89 FELDHIIPLWQGGQDVESNCQIL 111
>gi|384199724|ref|YP_005585467.1| phage protein [Bifidobacterium longum subsp. infantis ATCC 15697 =
JCM 1222]
gi|320458676|dbj|BAJ69297.1| hypothetical phage protein [Bifidobacterium longum subsp. infantis
ATCC 15697 = JCM 1222]
Length = 89
Score = 36.6 bits (83), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 75 EYDHIVPFSKGGESTA-DNCQILQTRVNRFKSDKEQVDA-TKLKGFSCDVKFTDKELDI 131
E D ++P S+GG+ + +NC++ R NR KSDK A + L G + K T ++
Sbjct: 28 EVDELIPVSRGGDPLSFNNCRLTHRRCNRIKSDKTDAYARSHLGGAAAQPKATSIPFEL 86
>gi|339782334|gb|AEK08058.1| gp99 [Mycobacterium phage Anaya]
Length = 100
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 19/34 (55%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F+ DHI P GG T DN Q + NR KS+K
Sbjct: 43 FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76
>gi|71064769|ref|YP_263496.1| hypothetical protein Psyc_0189 [Psychrobacter arcticus 273-4]
gi|71037754|gb|AAZ18062.1| hypothetical protein Psyc_0189 [Psychrobacter arcticus 273-4]
Length = 585
Score = 36.2 bits (82), Expect = 6.0, Method: Composition-based stats.
Identities = 19/54 (35%), Positives = 26/54 (48%)
Query: 54 KDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+D+ +C + L E DHI+P SKGG + N Q L + NR K K
Sbjct: 259 RDSYACKICNLSTSDEANLLLEIDHIIPLSKGGVTCESNLQTLCWKCNRSKGSK 312
>gi|126659072|ref|ZP_01730212.1| hypothetical protein CY0110_04663 [Cyanothece sp. CCY0110]
gi|126619600|gb|EAZ90329.1| hypothetical protein CY0110_04663 [Cyanothece sp. CCY0110]
Length = 636
Score = 36.2 bits (82), Expect = 6.0, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 9/78 (11%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
G+HPE K + ++ K+ C+ C L F E DHI P GG + DN Q L
Sbjct: 497 GKHPEM--KTSVAILLKKQKGKCNHCGLTFMPGDKIETDHITPTKAGGNNQRDNLQALHI 554
Query: 99 RVNRFKSDKEQVDATKLK 116
+ K+ K+ +D K
Sbjct: 555 HCHDIKTRKDLIDIKSYK 572
>gi|304360852|ref|YP_003856973.1| gp95 [Mycobacterium phage Angelica]
gi|302858436|gb|ADL71184.1| gp95 [Mycobacterium phage Angelica]
gi|339753826|gb|AEJ93846.1| gp95 [Mycobacterium phage JAWS]
gi|339755861|gb|AEJ95864.1| gp95 [Mycobacterium phage Adephagia]
gi|347450197|gb|AEO94235.1| gp96 [Mycobacterium phage BarrelRoll]
Length = 100
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 19/34 (55%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F+ DHI P GG T DN Q + NR KS+K
Sbjct: 43 FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76
>gi|153005759|ref|YP_001380084.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
gi|152029332|gb|ABS27100.1| HNH endonuclease [Anaeromyxobacter sp. Fw109-5]
Length = 336
Score = 36.2 bits (82), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 12/76 (15%)
Query: 24 ERPRYFDGKAKSMCWAKAETVPGR-HPERWRKDAAGNIVCKRFCNCHGCLC---FEYDHI 79
E+PR G ++ +T+P E WR+ AG RF + C C EYDHI
Sbjct: 242 EKPRKGRGGSRR------DTIPAEVKREVWRR--AGGQCEWRFESGERCDCRRRLEYDHI 293
Query: 80 VPFSKGGESTADNCQI 95
P + GG ST DN ++
Sbjct: 294 EPLALGGASTMDNVRL 309
>gi|332523012|ref|ZP_08399264.1| HNH endonuclease domain protein [Streptococcus porcinus str.
Jelinkova 176]
gi|332314276|gb|EGJ27261.1| HNH endonuclease domain protein [Streptococcus porcinus str.
Jelinkova 176]
Length = 117
Score = 36.2 bits (82), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 72 LCFEYDHIVPFSKGGE-STADNCQILQTRVNRFKSDKEQVDA 112
L DHIVP +KGG S+ DN Q+ + NR KSDK D
Sbjct: 51 LSPAIDHIVPIAKGGHPSSMDNLQLTHWQCNRQKSDKLYSDV 92
>gi|304360948|ref|YP_003857068.1| gp96 [Mycobacterium phage CrimD]
gi|302858697|gb|ADL71442.1| gp96 [Mycobacterium phage CrimD]
Length = 100
Score = 36.2 bits (82), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 19/34 (55%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F+ DHI P GG T DN Q + NR KS+K
Sbjct: 43 FQVDHITPLDAGGSDTLDNTQPTHRKCNRDKSNK 76
>gi|434385792|ref|YP_007096403.1| Retron-type reverse transcriptase [Chamaesiphon minutus PCC 6605]
gi|428016782|gb|AFY92876.1| Retron-type reverse transcriptase [Chamaesiphon minutus PCC 6605]
Length = 605
Score = 36.2 bits (82), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 9/67 (13%)
Query: 38 WAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCF-------EYDHIVPFSKGGESTA 90
W + G+HPE K GN++ + C C + E DHI+P SKGG+
Sbjct: 496 WVYWSSRMGKHPEVSTK--VGNLLKTQKGKCAHCKNYFKDGDSLEVDHIIPKSKGGKDKY 553
Query: 91 DNCQILQ 97
+N Q+L
Sbjct: 554 ENWQLLH 560
>gi|406977333|gb|EKD99509.1| HNH endonuclease [uncultured bacterium]
Length = 233
Score = 36.2 bits (82), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 7/59 (11%)
Query: 50 ERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQILQTRVNRFKSDK 107
E W++D ++C N H +DH +PFSKGG S T N ++L + N KS K
Sbjct: 179 EVWKRDKGKCVICGATTNLH------FDHDLPFSKGGTSLTEKNVKLLCLKHNLEKSAK 231
>gi|227549709|ref|ZP_03979758.1| conserved hypothetical protein [Corynebacterium lipophiloflavum DSM
44291]
gi|227078205|gb|EEI16168.1| conserved hypothetical protein [Corynebacterium lipophiloflavum DSM
44291]
Length = 472
Score = 36.2 bits (82), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
+ DH +P+ +GGE+T DN L R + FK+DK
Sbjct: 318 QLDHRIPYGQGGETTPDNLHCLCQRHHNFKTDK 350
>gi|126658771|ref|ZP_01729916.1| hypothetical protein CY0110_32455 [Cyanothece sp. CCY0110]
gi|126620033|gb|EAZ90757.1| hypothetical protein CY0110_32455 [Cyanothece sp. CCY0110]
Length = 199
Score = 36.2 bits (82), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 46 GRHPERWRKDAAGNIVCKRFCNCHGC-LCF------EYDHIVPFSKGGESTADNCQILQT 98
G+HPE K + ++ K+ C+ C L F E DHI P GG + DN Q L +
Sbjct: 60 GKHPEM--KKSVATLLKKQKGKCNHCGLTFMPGDKIETDHITPIKAGGNNLKDNLQALHS 117
Query: 99 RVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIR 142
+ K+ ++ D K K DK L E+ + ++I+
Sbjct: 118 HCHDTKTKRDLKDIKSYKS----QKIWDKTLK--EINFHFEIIK 155
>gi|453381798|dbj|GAC83531.1| hypothetical protein GP2_013_00080 [Gordonia paraffinivorans NBRC
108238]
Length = 91
Score = 35.8 bits (81), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 19/34 (55%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F DH PF+ GG T DN Q R NR KSD+
Sbjct: 43 FVVDHKKPFAHGGADTIDNKQPAHNRCNRVKSDR 76
>gi|422012873|ref|ZP_16359511.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
gi|394752101|gb|EJF35815.1| HNH endonuclease domain protein [Actinomyces georgiae F0490]
Length = 94
Score = 35.8 bits (81), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 6/63 (9%)
Query: 51 RWRKDAAGNIVCKRFCNCHGCLCF------EYDHIVPFSKGGESTADNCQILQTRVNRFK 104
R RK I + CH C DH+VP SKGG ST DN ++ R N K
Sbjct: 13 RQRKRITDAIYIRDAARCHLCHQLVRRRDASVDHVVPSSKGGPSTLDNLKLAHRRCNFAK 72
Query: 105 SDK 107
++
Sbjct: 73 GNR 75
>gi|331003174|ref|ZP_08326682.1| hypothetical protein HMPREF0491_01544 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330412930|gb|EGG92309.1| hypothetical protein HMPREF0491_01544 [Lachnospiraceae oral taxon
107 str. F0167]
Length = 762
Score = 35.8 bits (81), Expect = 7.9, Method: Composition-based stats.
Identities = 17/34 (50%), Positives = 21/34 (61%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
F+ DHIV + GG+S +N QIL N KSDK
Sbjct: 729 FQVDHIVAMNNGGKSVVENLQILCRSCNGEKSDK 762
>gi|312880866|ref|ZP_07740666.1| HNH endonuclease [Aminomonas paucivorans DSM 12260]
gi|310784157|gb|EFQ24555.1| HNH endonuclease [Aminomonas paucivorans DSM 12260]
Length = 345
Score = 35.8 bits (81), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 61 VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
+C R HG + DHIVP SKGG DN + L + N +S+ E+V +
Sbjct: 293 LCGRSAERHGVV-LHVDHIVPRSKGGTDEMDNLRTLCMKCNLGRSNLEEVQS 343
>gi|270593505|ref|ZP_06221465.1| hypothetical protein HAINFHK1212_1873, partial [Haemophilus
influenzae HK1212]
gi|270318413|gb|EFA29540.1| hypothetical protein HAINFHK1212_1873 [Haemophilus influenzae
HK1212]
Length = 161
Score = 35.8 bits (81), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DH+ +SKGG ++A+NC++L T NR K +K
Sbjct: 128 MEADHVSAWSKGGTTSAENCEMLCTMHNRAKGNK 161
>gi|119484407|ref|ZP_01619024.1| hypothetical protein L8106_01777 [Lyngbya sp. PCC 8106]
gi|119457881|gb|EAW39004.1| hypothetical protein L8106_01777 [Lyngbya sp. PCC 8106]
Length = 142
Score = 35.8 bits (81), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 76 YDHIVPFSKGGESTADNCQILQTRVNRFKSD-KEQVD 111
YDHI+P SKGGE+ DN + N FKSD E +D
Sbjct: 37 YDHILPRSKGGETIFDNVCLACRTCNEFKSDITEAID 73
>gi|18496975|ref|NP_569825.1| hypothetical protein TM4_gp92 [Mycobacterium phage TM4]
Length = 92
Score = 35.8 bits (81), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 19/36 (52%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L F DHI P + GG T DN + NR KSDK
Sbjct: 41 LSFTIDHITPLALGGTDTLDNIGAAHRKCNRDKSDK 76
>gi|411008724|ref|ZP_11385053.1| HNH nuclease [Aeromonas aquariorum AAK1]
Length = 110
Score = 35.8 bits (81), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 41/102 (40%), Gaps = 9/102 (8%)
Query: 29 FDGKAKSMCWAKAETVPGRHPER--WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGG 86
F + WAKA T G E +RKD + K G +E DHI P S GG
Sbjct: 3 FSQQTIDAVWAKA-THCGTENEGNGFRKDQCTAWIKKSSYGTTGTYGWEIDHITPVSSGG 61
Query: 87 ESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKE 128
N + L NR +SD +LK +VK D E
Sbjct: 62 SDNLSNLRPLHWANNRARSDG------RLKTEKPEVKSVDNE 97
>gi|363980955|gb|AAD17657.2| gp92 [Mycobacterium phage TM4]
Length = 93
Score = 35.8 bits (81), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 19/36 (52%)
Query: 72 LCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
L F DHI P + GG T DN + NR KSDK
Sbjct: 42 LSFTIDHITPLALGGTDTLDNIGAAHRKCNRDKSDK 77
>gi|331268551|ref|YP_004395043.1| putative wall-associated protein [Clostridium botulinum BKT015925]
gi|329125101|gb|AEB75046.1| putative wall-associated protein [Clostridium botulinum BKT015925]
Length = 301
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
++ DHI+P SKGG ++ N +I+ + NR K DK
Sbjct: 268 WQIDHIIPKSKGGTNSFGNARIISLKWNRIKWDK 301
>gi|284097476|ref|ZP_06385569.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
gi|283831012|gb|EFC35029.1| HNH endonuclease [Candidatus Poribacteria sp. WGA-A3]
Length = 171
Score = 35.8 bits (81), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 61 VCKRFCNCHGCL-CFEYDHIVPFSKGGESTADNCQILQTRVNRFK 104
+C R C H L E DHI P+S+GG++ +NCQ+L NR K
Sbjct: 125 ICPR-CGEHFVLGAMEADHIDPWSEGGKTAPENCQMLCRPCNRRK 168
>gi|196231796|ref|ZP_03130653.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
gi|196224268|gb|EDY18781.1| HNH endonuclease [Chthoniobacter flavus Ellin428]
Length = 194
Score = 35.8 bits (81), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDA 112
DH+VP S+GG++T +NC + VN K+D+ +A
Sbjct: 123 DHVVPRSRGGKTTWENCVLAHREVNTRKADRLPQEA 158
>gi|406984375|gb|EKE05414.1| HNH endonuclease [uncultured bacterium]
Length = 90
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DHI P+ GG++ A+NCQ+L NR KS K
Sbjct: 57 MEADHIDPWHDGGKTIAENCQMLCKECNRRKSGK 90
>gi|302782555|ref|XP_002973051.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
gi|300159652|gb|EFJ26272.1| hypothetical protein SELMODRAFT_98470 [Selaginella moellendorffii]
Length = 204
Score = 35.8 bits (81), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 48/91 (52%), Gaps = 4/91 (4%)
Query: 19 NNSNEERPRYFDGKAKSMCWAKAETV--PGRHPERWRKDAAGNIVCKRFCNCHGCLCFEY 76
++S +E P + K K + W + + + G P +R D+ GN++ + L +E
Sbjct: 85 SDSRKEHPSHLH-KFKDVLWQQLDHLGNAGFDPACFRVDSYGNVLYWH-ADPASPLAWEV 142
Query: 77 DHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
H P+S+GG++ +N ++ Q +V+ K +K
Sbjct: 143 CHWFPYSRGGKTVLNNLRLAQWQVSLKKKEK 173
>gi|147669720|ref|YP_001214538.1| HNH endonuclease [Dehalococcoides sp. BAV1]
gi|146270668|gb|ABQ17660.1| HNH endonuclease [Dehalococcoides sp. BAV1]
Length = 228
Score = 35.8 bits (81), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 7/46 (15%)
Query: 52 WRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGES-TADNCQIL 96
W +D I C N H +DHI+P+SKGG S A+N Q+L
Sbjct: 176 WARDQGKCIKCGISSNLH------FDHIIPYSKGGSSLVAENIQLL 215
>gi|406998393|gb|EKE16327.1| hypothetical protein ACD_11C00020G0032 [uncultured bacterium]
Length = 367
Score = 35.4 bits (80), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 75 EYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
E DHI P+ +GG++ A+NCQIL NR KS K
Sbjct: 335 EADHIKPWHEGGKTIANNCQILCKDDNRRKSGK 367
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.131 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,934,404,746
Number of Sequences: 23463169
Number of extensions: 113516347
Number of successful extensions: 627628
Number of sequences better than 100.0: 256
Number of HSP's better than 100.0 without gapping: 185
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 627366
Number of HSP's gapped (non-prelim): 280
length of query: 189
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 55
effective length of database: 9,215,130,721
effective search space: 506832189655
effective search space used: 506832189655
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 72 (32.3 bits)