BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033363
         (121 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255546672|ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
 gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis]
          Length = 608

 Score =  174 bits (440), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 77/94 (81%), Positives = 82/94 (87%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           VISD FTLCPDAK ATE   EEIEKII  LGLQKKRA MI+R SQEYL + WTHVTQLHG
Sbjct: 508 VISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHG 567

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFCTGKWD+VRP DHMLNYYW+FL
Sbjct: 568 VGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601


>gi|297829352|ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 435

 Score =  171 bits (433), Expect = 5e-41,   Method: Composition-based stats.
 Identities = 75/94 (79%), Positives = 81/94 (86%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           VI DLF LCPDAKTATEV+  EIE +I  LGLQKKRA MI+RFS EYL ESWTHVTQLHG
Sbjct: 336 VIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHG 395

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKYAADA+AIFC G WDRV+P DHMLNYYWEFL
Sbjct: 396 IGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429


>gi|356551311|ref|XP_003544020.1| PREDICTED: uncharacterized protein LOC100793802 [Glycine max]
          Length = 1505

 Score =  167 bits (423), Expect = 7e-40,   Method: Composition-based stats.
 Identities = 74/102 (72%), Positives = 83/102 (81%)

Query: 20   RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
            +V+S+ F LCPDAK+ T+V  EEIEK I TLG Q KRA M++R S+EYL ESWTHVTQLH
Sbjct: 1404 KVVSNFFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYLDESWTHVTQLH 1463

Query: 80   GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
            GVGKYAADA+AIF TG WDRV PTDHMLNYYWEFL S K  L
Sbjct: 1464 GVGKYAADAYAIFVTGMWDRVTPTDHMLNYYWEFLHSIKHEL 1505


>gi|296085641|emb|CBI29440.3| unnamed protein product [Vitis vinifera]
          Length = 599

 Score =  166 bits (421), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 74/99 (74%), Positives = 85/99 (85%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
           L+A RVISDLFTLCPDAKTAT+V  E IEK+I TLGLQKKRA MI+RFS+EYL +SWTHV
Sbjct: 489 LQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHV 548

Query: 76  TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           TQLHG+GKYAADA+AIFC+G W  V P DHML  YW++L
Sbjct: 549 TQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 587


>gi|224118080|ref|XP_002317727.1| predicted protein [Populus trichocarpa]
 gi|222858400|gb|EEE95947.1| predicted protein [Populus trichocarpa]
          Length = 107

 Score =  166 bits (421), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 74/106 (69%), Positives = 89/106 (83%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
           ++A RV++DLFTLCPDAK AT V  EEIE+ I +LGLQK+RA M++R S++YL E WTHV
Sbjct: 1   MQAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKMVQRLSEDYLEEDWTHV 60

Query: 76  TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           TQL GVGKYAADA+AIFCTGKW++VRP DHMLN YWE+L STK  L
Sbjct: 61  TQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYLCSTKNAL 106


>gi|147837937|emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera]
          Length = 635

 Score =  166 bits (419), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 76/103 (73%), Positives = 87/103 (84%), Gaps = 1/103 (0%)

Query: 12  IAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGES 71
           + ILL A RVISDLFTLCPDAKTAT+V  E IEK+I TLGLQKKRA MI+RFS+EYL +S
Sbjct: 522 VGILL-ASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDS 580

Query: 72  WTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           WTHVTQLHG+GKYAADA+AIFC+G W  V P DHML  YW++L
Sbjct: 581 WTHVTQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 623


>gi|225469167|ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera]
          Length = 536

 Score =  166 bits (419), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 74/99 (74%), Positives = 85/99 (85%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
           L+A RVISDLFTLCPDAKTAT+V  E IEK+I TLGLQKKRA MI+RFS+EYL +SWTHV
Sbjct: 426 LQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHV 485

Query: 76  TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           TQLHG+GKYAADA+AIFC+G W  V P DHML  YW++L
Sbjct: 486 TQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 524


>gi|42572315|ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
 gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana]
 gi|332641100|gb|AEE74621.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
          Length = 445

 Score =  165 bits (418), Expect = 3e-39,   Method: Composition-based stats.
 Identities = 73/94 (77%), Positives = 79/94 (84%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           VISDLF LC DAKTATEV  EEIE +I  LGLQKKR  MI+R S EYL ESWTHVTQLHG
Sbjct: 346 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 405

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 406 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 439


>gi|6648205|gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  165 bits (418), Expect = 3e-39,   Method: Composition-based stats.
 Identities = 73/94 (77%), Positives = 79/94 (84%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           VISDLF LC DAKTATEV  EEIE +I  LGLQKKR  MI+R S EYL ESWTHVTQLHG
Sbjct: 320 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 379

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 380 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 413


>gi|27754344|gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  165 bits (417), Expect = 4e-39,   Method: Composition-based stats.
 Identities = 73/94 (77%), Positives = 79/94 (84%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           VISDLF LC DAKTATEV  EEIE +I  LGLQKKR  MI+R S EYL ESWTHVTQLHG
Sbjct: 308 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 367

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 368 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 401


>gi|224129470|ref|XP_002320594.1| predicted protein [Populus trichocarpa]
 gi|222861367|gb|EEE98909.1| predicted protein [Populus trichocarpa]
          Length = 209

 Score =  136 bits (342), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 61/94 (64%), Positives = 71/94 (75%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           +++D FTLCPDAKT T VD  EI ++  +LG +  RA  IKR S+ YL E WTHVT L G
Sbjct: 108 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 167

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFCTG+WDRV P DHML  YWEFL
Sbjct: 168 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 201


>gi|224116426|ref|XP_002331937.1| predicted protein [Populus trichocarpa]
 gi|222875026|gb|EEF12157.1| predicted protein [Populus trichocarpa]
          Length = 249

 Score =  135 bits (340), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 61/94 (64%), Positives = 71/94 (75%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           +++D FTLCPDAKT T VD  EI ++  +LG +  RA  IKR S+ YL E WTHVT L G
Sbjct: 148 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 207

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFCTG+WDRV P DHML  YWEFL
Sbjct: 208 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 241


>gi|224114415|ref|XP_002332373.1| predicted protein [Populus trichocarpa]
 gi|222832177|gb|EEE70654.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score =  135 bits (340), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 61/94 (64%), Positives = 71/94 (75%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           +++D FTLCPDAKT T VD  EI ++  +LG +  RA  IKR S+ YL E WTHVT L G
Sbjct: 130 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 189

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           VGKYAADA+AIFCTG+WDRV P DHML  YWEFL
Sbjct: 190 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 223


>gi|449449218|ref|XP_004142362.1| PREDICTED: uncharacterized protein LOC101211755 [Cucumis sativus]
          Length = 488

 Score =  134 bits (338), Expect = 6e-30,   Method: Composition-based stats.
 Identities = 61/102 (59%), Positives = 71/102 (69%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           +A  VI  LF+LCP+ K   EV  E+IE II  LG  +KR+  + R S+ YL ESW+HVT
Sbjct: 384 QAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT 443

Query: 77  QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTK 118
           QL GVGKY ADA AIFCTG W  V P DHMLNYYW+FL S K
Sbjct: 444 QLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIK 485


>gi|168027579|ref|XP_001766307.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682521|gb|EDQ68939.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 622

 Score =  128 bits (321), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 55/99 (55%), Positives = 75/99 (75%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           +VI D F +CP+A+T    + E+I  +I +LGLQ KRA  ++RFSQEYLG +WT+VTQLH
Sbjct: 461 KVIWDFFEMCPNAETCVSTETEKIGTVIHSLGLQNKRAKTLQRFSQEYLGTTWTNVTQLH 520

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTK 118
           GVG+YA DA+AIFC G+W  V+P D +LN YW++L  T+
Sbjct: 521 GVGEYARDAYAIFCEGRWKEVQPKDIVLNDYWQWLNETQ 559


>gi|384251053|gb|EIE24531.1| DNA glycosylase [Coccomyxa subellipsoidea C-169]
          Length = 470

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 48/100 (48%), Positives = 63/100 (63%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           +VI  LF L P  + A   D E + ++I  LGL  KRAPM+ RFS+EY+ + WT+  +LH
Sbjct: 215 KVIWKLFELMPTPEDAMHADVEAVRELIEPLGLAPKRAPMLIRFSKEYVEKQWTNPVELH 274

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKG 119
           GVG+YAADA+ IFC G W  V P D  L  Y ++L  T G
Sbjct: 275 GVGQYAADAYNIFCRGDWRSVSPKDKDLLRYHQWLTETGG 314


>gi|115477908|ref|NP_001062549.1| Os09g0101100 [Oryza sativa Japonica Group]
 gi|113630782|dbj|BAF24463.1| Os09g0101100 [Oryza sativa Japonica Group]
          Length = 441

 Score =  102 bits (254), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 45/95 (47%), Positives = 64/95 (67%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           R +   F   PDA+TA   D E++ K ++ LGLQ+ +   I+RFS+ Y+ E WT++T+L 
Sbjct: 323 RKVKGFFKRYPDAQTAFSADPEKMAKYLAPLGLQRVKVNRIQRFSKAYVEEEWTYITELC 382

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GVGKYAADA+AIFC G+   V P DH L  YW+++
Sbjct: 383 GVGKYAADAYAIFCAGRATEVVPADHKLVDYWKYV 417


>gi|222641036|gb|EEE69168.1| hypothetical protein OsJ_28331 [Oryza sativa Japonica Group]
          Length = 452

 Score =  102 bits (254), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 45/95 (47%), Positives = 64/95 (67%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           R +   F   PDA+TA   D E++ K ++ LGLQ+ +   I+RFS+ Y+ E WT++T+L 
Sbjct: 323 RKVKGFFKRYPDAQTAFSADPEKMAKYLAPLGLQRVKVNRIQRFSKAYVEEEWTYITELC 382

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GVGKYAADA+AIFC G+   V P DH L  YW+++
Sbjct: 383 GVGKYAADAYAIFCAGRATEVVPADHKLVDYWKYV 417


>gi|413941982|gb|AFW74631.1| hypothetical protein ZEAMMB73_619521 [Zea mays]
          Length = 858

 Score =  101 bits (252), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 43/93 (46%), Positives = 64/93 (68%)

Query: 22  ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
           +   F   PDA+TA   D E++ + ++ LGLQ+ +   I++FS+ Y+GE WT++T+L GV
Sbjct: 734 VKGFFKRYPDAQTAYTADPEKMAEYLAPLGLQRVKTNRIQKFSKAYVGEEWTYITELCGV 793

Query: 82  GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GKYAADA+AIFC G+ + V P DH L  YW ++
Sbjct: 794 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYV 826


>gi|302812133|ref|XP_002987754.1| hypothetical protein SELMODRAFT_6975 [Selaginella moellendorffii]
 gi|302821198|ref|XP_002992263.1| hypothetical protein SELMODRAFT_4607 [Selaginella moellendorffii]
 gi|300139913|gb|EFJ06644.1| hypothetical protein SELMODRAFT_4607 [Selaginella moellendorffii]
 gi|300144373|gb|EFJ11057.1| hypothetical protein SELMODRAFT_6975 [Selaginella moellendorffii]
          Length = 139

 Score =  100 bits (248), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 60/98 (61%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           +  +++ D F L P  +   +  AE +E +I  LGL KKR  +IKRFS EYL   WT ++
Sbjct: 36  QTKKMLWDFFKLFPGPQHLIDAPAEMVEPLIKPLGLHKKRLELIKRFSHEYLYTDWTDIS 95

Query: 77  QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           QLHGVG YA DA+ IF  G+W  V P D ML  YW +L
Sbjct: 96  QLHGVGTYAEDAYMIFVEGRWKEVIPDDAMLEQYWNWL 133


>gi|218201632|gb|EEC84059.1| hypothetical protein OsI_30334 [Oryza sativa Indica Group]
          Length = 455

 Score = 99.4 bits (246), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 44/95 (46%), Positives = 62/95 (65%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           R +   F   PDA+ A   D E++ K ++ LGLQ  +   I+RFS+ Y+ E WT++T+L 
Sbjct: 326 RKVKGFFKRYPDAQAAFSADPEKMAKYLAPLGLQHVKVNRIQRFSKAYVEEEWTYITELC 385

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GVGKYAADA+AIFC G+   V P DH L  YW+++
Sbjct: 386 GVGKYAADAYAIFCAGRAPEVVPADHKLVDYWKYV 420


>gi|242039843|ref|XP_002467316.1| hypothetical protein SORBIDRAFT_01g024773 [Sorghum bicolor]
 gi|241921170|gb|EER94314.1| hypothetical protein SORBIDRAFT_01g024773 [Sorghum bicolor]
          Length = 809

 Score = 97.4 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 40/93 (43%), Positives = 63/93 (67%)

Query: 22  ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
           ++  F   PD +TA   D +++ + ++ LG Q+ +   I++FS+ Y+GE WT++T+L GV
Sbjct: 691 VNGFFERYPDPQTAYRADPKKMAEYLAPLGFQRVKTKRIQKFSKAYVGEEWTYITELCGV 750

Query: 82  GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GKYAADA+AIFC G+ + V P DH L  YW ++
Sbjct: 751 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYV 783


>gi|326533618|dbj|BAK05340.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1137

 Score = 97.4 bits (241), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 43/99 (43%), Positives = 62/99 (62%)

Query: 16   LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
            ++  R++   F   PD  +A   D +++ + ++ LGL   R   IK+ S++Y+G  WTHV
Sbjct: 1029 IQVKRMLEGFFERYPDPLSAINADPDKMAEYLAPLGLNNVRTRNIKKLSKQYVGNEWTHV 1088

Query: 76   TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
            TQL GVGKYAADA+AIFC G+   V P DH L  YW ++
Sbjct: 1089 TQLCGVGKYAADAYAIFCAGRAREVVPDDHKLVDYWNYV 1127


>gi|242039839|ref|XP_002467314.1| hypothetical protein SORBIDRAFT_01g024706 [Sorghum bicolor]
 gi|241921168|gb|EER94312.1| hypothetical protein SORBIDRAFT_01g024706 [Sorghum bicolor]
          Length = 346

 Score = 96.3 bits (238), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 40/94 (42%), Positives = 63/94 (67%)

Query: 22  ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
           ++  F   PD +TA   D +++ + ++ LG Q+ +   I++FS+ Y+GE WT++T+L GV
Sbjct: 206 VNGFFERYPDPQTAYRADPKKMAEYLAPLGFQRVKTKRIQKFSKAYVGEEWTYITELCGV 265

Query: 82  GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLV 115
           GKYAADA+AIFC G+ + V P DH L  YW ++ 
Sbjct: 266 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYVC 299


>gi|340375489|ref|XP_003386267.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Amphimedon
           queenslandica]
          Length = 251

 Score = 94.4 bits (233), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 58/94 (61%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           ++   F      + AT  + ++I  +I  LGL  KRA MI RFS EYL   W +  +LHG
Sbjct: 145 ILWKFFDCYSTPEEATAANWKDIASMIEPLGLHIKRAKMIIRFSGEYLSNDWVYPIELHG 204

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFCT +W  VRP DHMLN Y+E+L
Sbjct: 205 IGKYGNDSYRIFCTEEWKEVRPNDHMLNKYYEWL 238


>gi|307111443|gb|EFN59677.1| hypothetical protein CHLNCDRAFT_14045, partial [Chlorella
           variabilis]
          Length = 133

 Score = 93.6 bits (231), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 61/95 (64%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           +VI  LF LCP    A   D ++++ +I  LGL +KRA  I++ SQ+YL + W   T+L+
Sbjct: 39  KVIWQLFALCPTPAAAIAADVQQVQALIQPLGLFRKRAAAIQQLSQDYLYKQWRDPTELY 98

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           G+GKYAADA+ +FC G+W  V P D  L  Y ++L
Sbjct: 99  GIGKYAADAYHMFCRGRWREVAPEDKDLRRYRDWL 133


>gi|291236887|ref|XP_002738372.1| PREDICTED: methyl-CpG binding domain protein 4-like [Saccoglossus
           kowalevskii]
          Length = 862

 Score = 92.8 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 40/94 (42%), Positives = 60/94 (63%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           ++   FT  P A      D +EI  +I  LGL  KRA M+ RFS+EY+ ++W +  +L+G
Sbjct: 760 ILWQFFTKWPSAVETINADWKEIANLIHPLGLYDKRAKMLIRFSEEYIHKNWKYPIELYG 819

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W +V+P DHMLN Y ++L
Sbjct: 820 IGKYGNDSYRIFCVNEWKQVKPEDHMLNKYHDWL 853


>gi|443723536|gb|ELU11913.1| hypothetical protein CAPTEDRAFT_102919, partial [Capitella teleta]
          Length = 145

 Score = 91.7 bits (226), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 57/90 (63%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+   F L P A+     + EEI +++  LGL   RA  + RFS+EYL  +WT+  +LHG
Sbjct: 42  VLWTFFDLWPSAEATKTANWEEISELLQPLGLHVSRAHTLIRFSEEYLNVNWTYPIELHG 101

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
           +GKY  D++ IFC G+W +V P+DH LN Y
Sbjct: 102 IGKYGNDSYRIFCLGEWKQVAPSDHKLNDY 131


>gi|357166912|ref|XP_003580913.1| PREDICTED: uncharacterized protein LOC100825070 [Brachypodium
           distachyon]
          Length = 606

 Score = 89.7 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 38/95 (40%), Positives = 61/95 (64%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           +++   F   PD  +A   D +++E  ++ LGLQ  +   I++ S++Y+ E W +VT+L 
Sbjct: 471 KIVDGFFERYPDPVSAFNADPKKMEAYLAPLGLQSVKTRNIQKLSKQYVEEDWKYVTELC 530

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GVGKYAADA++IFC G+   V P DH L  YW+++
Sbjct: 531 GVGKYAADAYSIFCAGRATEVVPEDHKLVDYWKYV 565


>gi|405960128|gb|EKC26075.1| Methyl-CpG-binding domain protein 4 [Crassostrea gigas]
          Length = 826

 Score = 89.0 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 37/85 (43%), Positives = 58/85 (68%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P+  TA + D + + ++++ LGL +KRA +I RFS E+L + W +  +LHG+GKY  D++
Sbjct: 732 PNPDTARKGDWQAMARLLAPLGLHEKRAKIIIRFSDEFLTKDWKYPIELHGIGKYGNDSY 791

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V PTDH LN Y ++L
Sbjct: 792 RIFCVNEWRQVTPTDHKLNDYHKWL 816


>gi|302843300|ref|XP_002953192.1| hypothetical protein VOLCADRAFT_35340 [Volvox carteri f.
           nagariensis]
 gi|300261579|gb|EFJ45791.1| hypothetical protein VOLCADRAFT_35340 [Volvox carteri f.
           nagariensis]
          Length = 133

 Score = 88.2 bits (217), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 58/95 (61%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
           +V+  L+   P  +     D   +E I+  LGLQ +RA  + RFS+E+L   WT  TQL+
Sbjct: 39  QVLGPLWRSYPTPEAMAAADPRVVEDILRPLGLQVRRAERLVRFSEEFLSRQWTCPTQLY 98

Query: 80  GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           GVG+YAADA+ IFC G+W  V+P D  L  Y ++L
Sbjct: 99  GVGRYAADAYLIFCKGRWREVQPADKDLRRYRDWL 133


>gi|113205660|ref|NP_001037916.1| methyl-CpG binding domain protein 4 [Xenopus (Silurana) tropicalis]
 gi|89272695|emb|CAJ81414.1| methyl-CpG binding domain protein 4 [Xenopus (Silurana) tropicalis]
          Length = 472

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 55/85 (64%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P+ + A   D +E+ +++  LGL + RA  I RFS EYL + W +  +LHG+GKY  D++
Sbjct: 378 PNPEVARAADWKEMAELLQPLGLYELRAKAIVRFSDEYLTKKWRYPIELHGIGKYGNDSY 437

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V+P DH LN+Y  +L
Sbjct: 438 RIFCVNEWKQVQPQDHKLNFYHTWL 462


>gi|432866392|ref|XP_004070828.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Oryzias
           latipes]
          Length = 440

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 55/94 (58%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+   F   P A    + D + + +++  LGL + RA  + RFS EYL + W +  +LHG
Sbjct: 338 VLWQFFERYPSADVTRDADWKPMSELMKPLGLYELRAKTLIRFSDEYLTKQWRYPIELHG 397

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC G+W  V P DHMLN Y  +L
Sbjct: 398 IGKYGNDSYRIFCVGEWKEVTPEDHMLNKYHAWL 431


>gi|149633502|ref|XP_001505332.1| PREDICTED: methyl-CpG-binding domain protein 4-like
           [Ornithorhynchus anatinus]
          Length = 108

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 57/101 (56%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 5   VLWEFLEKYPSAEVARTADWKEVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHG 64

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +GKY  D++ IFC  +W +V P DH LN Y ++L      L
Sbjct: 65  IGKYGNDSYRIFCINEWKQVHPEDHKLNKYHDWLWENHEKL 105


>gi|298706383|emb|CBJ29392.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 419

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/77 (49%), Positives = 56/77 (72%)

Query: 42  EIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVR 101
           E+ ++++ LGLQ++R   I RFSQEYL ++WT+V +L+ VGKYAADA  IF   KW  V+
Sbjct: 308 EMTRVVAPLGLQERRPIAIIRFSQEYLSKAWTNVKELYWVGKYAADAHKIFIERKWREVQ 367

Query: 102 PTDHMLNYYWEFLVSTK 118
           P DH LN++ E++  T+
Sbjct: 368 PDDHALNWWVEWMRGTQ 384


>gi|440899997|gb|ELR51228.1| Methyl-CpG-binding domain protein 4, partial [Bos grunniens mutus]
          Length = 552

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 458 PSAEAARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 517

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 518 RIFCVNEWKQVHPEDHKLNKYHDWL 542


>gi|291393500|ref|XP_002713197.1| PREDICTED: methyl-CpG binding domain protein 4 [Oryctolagus
           cuniculus]
          Length = 561

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 467 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 526

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 527 RIFCVNEWKQVHPEDHKLNKYHDWL 551


>gi|354490514|ref|XP_003507402.1| PREDICTED: methyl-CpG-binding domain protein 4 [Cricetulus griseus]
 gi|344256524|gb|EGW12628.1| Methyl-CpG-binding domain protein 4 [Cricetulus griseus]
          Length = 560

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 466 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 525

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 526 RIFCVNEWKQVHPEDHKLNKYHDWL 550


>gi|50401208|sp|Q9Z2D7.1|MBD4_MOUSE RecName: Full=Methyl-CpG-binding domain protein 4; AltName:
           Full=Methyl-CpG-binding protein MBD4; AltName:
           Full=Mismatch-specific DNA N-glycosylase
 gi|5929746|gb|AAD56595.1|AF120996_1 methyl-CpG binding protein 4 [Mus musculus domesticus]
 gi|3800807|gb|AAC68878.1| methyl-CpG binding protein MBD4 [Mus musculus]
          Length = 554

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|187960081|ref|NP_034904.2| methyl-CpG-binding domain protein 4 [Mus musculus]
 gi|19353797|gb|AAH24812.1| Methyl-CpG binding domain protein 4 [Mus musculus]
          Length = 554

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|77735765|ref|NP_001029577.1| methyl-CpG-binding domain protein 4 [Bos taurus]
 gi|73586836|gb|AAI03442.1| Methyl-CpG binding domain protein 4 [Bos taurus]
          Length = 554

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ + +  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEAARTADWRDVSEFLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|395516662|ref|XP_003762506.1| PREDICTED: methyl-CpG-binding domain protein 4 [Sarcophilus
           harrisii]
          Length = 359

 Score = 83.2 bits (204), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 56/94 (59%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 256 VLWEFLEKYPSAEVARAADWKEVSELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHG 315

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W +V P DH LN Y ++L
Sbjct: 316 IGKYGNDSYRIFCVNEWKQVHPQDHKLNKYHDWL 349


>gi|148667124|gb|EDK99540.1| methyl-CpG binding domain protein 4 [Mus musculus]
          Length = 554

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|296474714|tpg|DAA16829.1| TPA: methyl-CpG binding domain protein 4 [Bos taurus]
          Length = 554

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ + +  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEAARTADWRDVSEFLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|449274057|gb|EMC83362.1| Methyl-CpG-binding domain protein 4 [Columba livia]
          Length = 188

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 57/101 (56%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ + F   P  +     D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 85  VLWEFFRKYPSPEVTRAADWKEMSELLKPLGLYALRAKTIIKFSDEYLSKRWRYPIELHG 144

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +GKY  D++ IFC  +W  V+P DH LN Y  +L   + +L
Sbjct: 145 IGKYGNDSYRIFCVNEWKEVQPQDHKLNVYHTWLRENQESL 185


>gi|344276433|ref|XP_003410013.1| PREDICTED: methyl-CpG-binding domain protein 4 [Loxodonta africana]
          Length = 566

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 531

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 532 RIFCIDEWKQVHPEDHKLNKYHDWL 556


>gi|334338449|ref|XP_001376329.2| PREDICTED: methyl-CpG-binding domain protein 4-like [Monodelphis
           domestica]
          Length = 486

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 392 PSAEVARTADWKEVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 451

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 452 RIFCVNEWKQVHPEDHKLNKYHDWL 476


>gi|417411695|gb|JAA52275.1| Putative methyl-cpg binding transcription regulator, partial
           [Desmodus rotundus]
          Length = 570

 Score = 82.8 bits (203), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 34/85 (40%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  +GL + RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 476 PSAEAARTADWRDVSELLKPIGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 535

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 536 RIFCVNEWKQVHPEDHKLNKYHDWL 560


>gi|348502661|ref|XP_003438886.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Oreochromis
           niloticus]
          Length = 385

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 36/94 (38%), Positives = 56/94 (59%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+   F   P A+   + D + + +++  LGL + RA  + RFS EYL + W +  +LHG
Sbjct: 283 VLWQFFERYPSAEVTRQADWKPMSELMKPLGLYELRAKTLIRFSDEYLTKQWRYPIELHG 342

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W +V+P DH LN Y  +L
Sbjct: 343 IGKYGNDSYRIFCINEWRQVKPDDHKLNKYHAWL 376


>gi|156398265|ref|XP_001638109.1| predicted protein [Nematostella vectensis]
 gi|156398269|ref|XP_001638111.1| predicted protein [Nematostella vectensis]
 gi|156225227|gb|EDO46046.1| predicted protein [Nematostella vectensis]
 gi|156225229|gb|EDO46048.1| predicted protein [Nematostella vectensis]
          Length = 143

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 57/94 (60%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           ++ + F   PDA  A++ D   I  ++  LGL +KRA  + +FS E+L   W +   L+G
Sbjct: 41  IMWEFFRRYPDAAEASKADPGPISDLLRPLGLHEKRAKALVQFSAEFLSRDWIYPDTLYG 100

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY +D++ IF  G+W  V+P DH LN Y ++L
Sbjct: 101 IGKYGSDSYRIFFLGEWKDVKPEDHKLNLYHDWL 134


>gi|115725067|ref|XP_783908.2| PREDICTED: uncharacterized protein LOC578658 [Strongylocentrotus
           purpuratus]
          Length = 550

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 50/77 (64%)

Query: 38  VDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKW 97
            D + I  +I  LGL  KRA M+ +FS E+L + WT+  +L G+GKY  D++ IFC  +W
Sbjct: 465 ADWQSIADLIQPLGLHTKRAKMLIQFSDEFLTKDWTYPIELSGIGKYGNDSYRIFCVNEW 524

Query: 98  DRVRPTDHMLNYYWEFL 114
             V+P DHMLN Y ++L
Sbjct: 525 KEVKPQDHMLNKYHDWL 541


>gi|149728428|ref|XP_001490372.1| PREDICTED: methyl-CpG-binding domain protein 4 [Equus caballus]
          Length = 592

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I RFS EYL + W +  +LHG+GKY  D++
Sbjct: 498 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIRFSDEYLTKQWRYPIELHGIGKYGNDSY 557

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 558 RIFCINEWKQVHPEDHKLNKYHDWL 582


>gi|410951752|ref|XP_003982557.1| PREDICTED: methyl-CpG-binding domain protein 4 [Felis catus]
          Length = 566

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 531

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556


>gi|74190664|dbj|BAE28134.1| unnamed protein product [Mus musculus]
          Length = 108

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 56/101 (55%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 5   VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 64

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +GKY  D++ IFC  +W +V P DH LN Y ++L      L
Sbjct: 65  IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWENHEKL 105


>gi|45382709|ref|NP_990024.1| methyl-CpG binding domain protein 4 [Gallus gallus]
 gi|7739720|gb|AAF68981.1|AF257107_1 5-methylcytosine G/T mismatch-specific DNA glycosylase [Gallus
           gallus]
          Length = 416

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P  + A   D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 322 PSPEVARTADWKEMSELLRPLGLYALRAKTIIKFSDEYLNKQWKYPIELHGIGKYGNDSY 381

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W  V+P DH LN Y  +L
Sbjct: 382 RIFCVNEWKEVQPQDHKLNIYHAWL 406


>gi|390475366|ref|XP_002758735.2| PREDICTED: methyl-CpG-binding domain protein 4 [Callithrix jacchus]
          Length = 566

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 472 PSAEVARNADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 531

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556


>gi|403268287|ref|XP_003926209.1| PREDICTED: methyl-CpG-binding domain protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 564

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 470 PSAEVARNADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 529

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 530 RIFCVNEWKQVHPEDHKLNKYHDWL 554


>gi|345786313|ref|XP_003432809.1| PREDICTED: methyl-CpG-binding domain protein 4 [Canis lupus
           familiaris]
          Length = 570

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 476 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 535

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 536 RIFCINEWKQVHPEDHKLNKYHDWL 560


>gi|456754090|gb|JAA74218.1| methyl-CpG binding domain protein 4 [Sus scrofa]
          Length = 567

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 473 PSAEVARTADWRDVSELLKPLGLYDHRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 532

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 533 RIFCVNEWKQVHPEDHKLNKYHDWL 557


>gi|410899959|ref|XP_003963464.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Takifugu
           rubripes]
          Length = 257

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 55/97 (56%)

Query: 25  LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKY 84
            F   P A+     + + + +++  LGL + RA  + RFS+EYL + W +  +LHG+GKY
Sbjct: 159 FFERYPSAEVTRGAEWKPMSELMKPLGLYELRAKALIRFSEEYLTKQWRYPVELHGIGKY 218

Query: 85  AADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
             D++ IFC G+W  V P DHMLN Y  +L      L
Sbjct: 219 GNDSYRIFCLGEWREVTPEDHMLNKYHAWLWENHATL 255


>gi|355701349|gb|AES01654.1| methyl-CpG binding domain protein 4 [Mustela putorius furo]
          Length = 473

 Score = 82.0 bits (201), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 379 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 438

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 439 RIFCVNEWKQVHPEDHKLNKYHDWL 463


>gi|348554822|ref|XP_003463224.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Cavia
           porcellus]
          Length = 670

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL + RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 576 PSAEVARAADWRDVSELLKPLGLYELRAKTIVKFSDEYLTKHWKYPIELHGIGKYGNDSY 635

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 636 RIFCVNEWKQVHPQDHKLNKYHDWL 660


>gi|29726680|pdb|1NGN|A Chain A, Mismatch Repair In Methylated Dna. Structure Of The
           Mismatch-Specific Thymine Glycosylase Domain Of
           Methyl-Cpg- Binding Protein Mbd4
          Length = 155

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 55/94 (58%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 52  VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 111

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W +V P DH LN Y ++L
Sbjct: 112 IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWL 145


>gi|260781241|ref|XP_002585728.1| hypothetical protein BRAFLDRAFT_119982 [Branchiostoma floridae]
 gi|229270764|gb|EEN41739.1| hypothetical protein BRAFLDRAFT_119982 [Branchiostoma floridae]
          Length = 858

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 37/105 (35%), Positives = 61/105 (58%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           KA  ++ + F   P  +   + D + I +++  LGL +KRA  + RFS E+L + W +  
Sbjct: 752 KAIPLMWEFFRRWPSPEVTRDADWKPICELLQPLGLYEKRAKTLIRFSDEFLTKDWAYPD 811

Query: 77  QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +L+G+GKY  D++ IFC  +W +V P+DH LN Y  +L+     L
Sbjct: 812 ELYGIGKYGNDSYRIFCVNEWRQVMPSDHKLNDYHNWLLENHKEL 856


>gi|394986373|pdb|4EW4|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With Dna
           Containing A Ribose Sugar
          Length = 146

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 56/101 (55%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 43  VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 102

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +GKY  D++ IFC  +W +V P DH LN Y ++L      L
Sbjct: 103 IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWENHEKL 143


>gi|67971738|dbj|BAE02211.1| unnamed protein product [Macaca fascicularis]
          Length = 498

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 404 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 463

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 464 RIFCVNEWKQVHPEDHKLNKYHDWL 488


>gi|355786370|gb|EHH66553.1| hypothetical protein EGM_03568 [Macaca fascicularis]
          Length = 578

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568


>gi|431899948|gb|ELK07895.1| Methyl-CpG-binding domain protein 4 [Pteropus alecto]
          Length = 664

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL + RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 570 PSAEVARTADWRDVSELLKPLGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 629

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 630 RIFCINEWKQVHPEDHKLNKYHDWL 654


>gi|197098838|ref|NP_001127160.1| methyl-CpG-binding domain protein 4 [Pongo abelii]
 gi|55725306|emb|CAH89518.1| hypothetical protein [Pongo abelii]
          Length = 573

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 479 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 538

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 539 RIFCVNEWKQVHPEDHKLNKYHDWL 563


>gi|442796456|ref|NP_001263202.1| methyl-CpG-binding domain protein 4 isoform 5 [Homo sapiens]
 gi|85813718|emb|CAJ55826.1| putative methyl-CpG binding protein [Homo sapiens]
          Length = 262

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 168 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 227

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 228 RIFCVNEWKQVHPEDHKLNKYHDWL 252


>gi|397518616|ref|XP_003829480.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Pan
           paniscus]
          Length = 580

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570


>gi|332817878|ref|XP_003310052.1| PREDICTED: methyl-CpG-binding domain protein 4 [Pan troglodytes]
 gi|410334151|gb|JAA36022.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
          Length = 574

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564


>gi|114589101|ref|XP_001144095.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 7 [Pan
           troglodytes]
 gi|410211778|gb|JAA03108.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
 gi|410262686|gb|JAA19309.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
 gi|410298898|gb|JAA28049.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
 gi|410334153|gb|JAA36023.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
          Length = 580

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570


>gi|397518614|ref|XP_003829479.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Pan
           paniscus]
          Length = 574

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564


>gi|332231793|ref|XP_003265079.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Nomascus
           leucogenys]
          Length = 572

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 478 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 537

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 538 RIFCVNEWKQVHPEDHKLNKYHDWL 562


>gi|332231795|ref|XP_003265080.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Nomascus
           leucogenys]
          Length = 578

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568


>gi|4505121|ref|NP_003916.1| methyl-CpG-binding domain protein 4 isoform 1 [Homo sapiens]
 gi|50400819|sp|O95243.1|MBD4_HUMAN RecName: Full=Methyl-CpG-binding domain protein 4; AltName:
           Full=Methyl-CpG-binding endonuclease 1; AltName:
           Full=Methyl-CpG-binding protein MBD4; AltName:
           Full=Mismatch-specific DNA N-glycosylase
 gi|4539759|gb|AAD22195.1|AF114784_1 methyl-CpG binding endonuclease [Homo sapiens]
 gi|19852107|gb|AAM00008.1|AF494057_1 methyl-CpG binding domain protein 4 [Homo sapiens]
 gi|3800809|gb|AAC68879.1| methyl-CpG binding protein MBD4 [Homo sapiens]
 gi|5739194|gb|AAD50374.1| methyl-CpG binding protein 4 [Homo sapiens]
 gi|119599659|gb|EAW79253.1| methyl-CpG binding domain protein 4, isoform CRA_c [Homo sapiens]
 gi|119599660|gb|EAW79254.1| methyl-CpG binding domain protein 4, isoform CRA_c [Homo sapiens]
          Length = 580

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570


>gi|442796450|ref|NP_001263199.1| methyl-CpG-binding domain protein 4 isoform 2 [Homo sapiens]
 gi|15079922|gb|AAH11752.1| MBD4 protein [Homo sapiens]
 gi|47496557|emb|CAG29301.1| MBD4 [Homo sapiens]
 gi|119599657|gb|EAW79251.1| methyl-CpG binding domain protein 4, isoform CRA_a [Homo sapiens]
 gi|119599661|gb|EAW79255.1| methyl-CpG binding domain protein 4, isoform CRA_a [Homo sapiens]
 gi|325463337|gb|ADZ15439.1| methyl-CpG binding domain protein 4 [synthetic construct]
          Length = 574

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564


>gi|426342067|ref|XP_004036337.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 580

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570


>gi|426342065|ref|XP_004036336.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 574

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564


>gi|402887074|ref|XP_003906930.1| PREDICTED: methyl-CpG-binding domain protein 4 [Papio anubis]
          Length = 261

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 167 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 226

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 227 RIFCVNEWKQVHPEDHKLNKYHDWL 251


>gi|355564527|gb|EHH21027.1| hypothetical protein EGK_04002 [Macaca mulatta]
          Length = 578

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568


>gi|343962517|dbj|BAK62846.1| methyl-CpG-binding domain protein 4 [Pan troglodytes]
          Length = 573

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 479 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 538

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 539 RIFCVNEWKQVHPEDHKLNKYHDWL 563


>gi|380817674|gb|AFE80711.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
 gi|380817676|gb|AFE80712.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
 gi|383422567|gb|AFH34497.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
 gi|383422569|gb|AFH34498.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
 gi|384950154|gb|AFI38682.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
          Length = 572

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 478 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 537

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 538 RIFCVNEWKQVHPEDHKLNKYHDWL 562


>gi|395847220|ref|XP_003796279.1| PREDICTED: methyl-CpG-binding domain protein 4 [Otolemur garnettii]
          Length = 574

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  +I +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 480 PSAEIARTADWRDISELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 539

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 540 RIFCINEWKQVHPEDHKLNKYHDWL 564


>gi|400260847|pdb|4E9E|A Chain A, Structure Of The Glycosylase Domain Of Mbd4
 gi|400260848|pdb|4E9F|A Chain A, Stucture Of The Glycosylase Domain Of Mbd4 Bound To Ap
           Site Containing Dna
          Length = 161

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 61  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEDHKLNKYHDWL 145


>gi|385252064|pdb|4DK9|A Chain A, Crystal Structure Of Mbd4 Catalytic Domain Bound To Abasic
           Dna
          Length = 157

 Score = 81.3 bits (199), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 63  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 122

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 123 RIFCVNEWKQVHPEDHKLNKYHDWL 147


>gi|256599915|pdb|3IHO|A Chain A, The C-Terminal Glycosylase Domain Of Human Mbd4
          Length = 138

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 50  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 109

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 110 RIFCVNEWKQVHPEDHKLNKYHDWL 134


>gi|449474335|ref|XP_002187591.2| PREDICTED: uncharacterized protein LOC100231901 [Taeniopygia
           guttata]
          Length = 420

 Score = 81.3 bits (199), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P  + A   D +E+ +++  LGL + RA  I RFS EYL ++W +  +LHG+GKY  D++
Sbjct: 326 PSPEVARAADWKEMSELLKPLGLYELRAKTIIRFSGEYLSKAWRYPIELHGIGKYGNDSY 385

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W  V+P DH LN Y  +L
Sbjct: 386 RIFCVNEWKEVQPQDHKLNVYHTWL 410


>gi|149049684|gb|EDM02138.1| rCG29635 [Rattus norvegicus]
          Length = 566

 Score = 80.9 bits (198), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKPWRYPIELHGIGKYGNDSY 531

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556


>gi|293346999|ref|XP_002726494.1| PREDICTED: methyl-CpG-binding domain protein 4 [Rattus norvegicus]
 gi|392347546|ref|XP_001059437.3| PREDICTED: methyl-CpG-binding domain protein 4 [Rattus norvegicus]
          Length = 566

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKPWRYPIELHGIGKYGNDSY 531

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556


>gi|426249701|ref|XP_004018588.1| PREDICTED: methyl-CpG-binding domain protein 4 [Ovis aries]
          Length = 554

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 52/85 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 460 PSAEAARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544


>gi|351711383|gb|EHB14302.1| Methyl-CpG-binding domain protein 4 [Heterocephalus glaber]
          Length = 525

 Score = 80.9 bits (198), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 53/85 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   +  ++ +++  LGL + RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 431 PSAEAARAANWRDVSELLKPLGLYELRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 490

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P DH LN Y ++L
Sbjct: 491 RIFCVNEWKQVHPEDHKLNKYHDWL 515


>gi|326928198|ref|XP_003210268.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Meleagris
           gallopavo]
          Length = 193

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 52/90 (57%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P  + A   D +E+ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 90  VLWEFLRKYPSPEIARTADWKEMSELLRPLGLYALRAKTIIKFSDEYLNKQWKYPIELHG 149

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
           +GKY  D++ IFC  +W  V+P DH LN Y
Sbjct: 150 IGKYGNDSYRIFCVNEWKEVQPQDHKLNIY 179


>gi|394986367|pdb|4EVV|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With A G:t
           Mismatch
 gi|394986370|pdb|4EW0|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With A G:5hmu (5-
           Hydroxymethyluracil) Mismatch
          Length = 146

 Score = 80.1 bits (196), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ +     P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG
Sbjct: 43  VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 102

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
           +GKY  D++ IFC  +W +V P +H LN Y ++L      L
Sbjct: 103 IGKYGNDSYRIFCVNEWKQVHPENHKLNKYHDWLWENHEKL 143


>gi|357497507|ref|XP_003619042.1| Methyl-CpG-binding domain protein [Medicago truncatula]
 gi|355494057|gb|AES75260.1| Methyl-CpG-binding domain protein [Medicago truncatula]
          Length = 96

 Score = 79.7 bits (195), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 34/64 (53%), Positives = 48/64 (75%)

Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
          +V+ + F LCPD +T  +V  EEIE++I TLGLQ KR+  ++  S+EYL E+WT+VT+LH
Sbjct: 5  QVLDNFFKLCPDVETCMQVTMEEIEEVIRTLGLQVKRSGSLQHLSREYLTETWTYVTELH 64

Query: 80 GVGK 83
          GVG 
Sbjct: 65 GVGN 68


>gi|47223201|emb|CAG11336.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 294

 Score = 79.3 bits (194), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 55/94 (58%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+   F   P A+     + + + +++  LGL + RA  + RFS+EYL + W +  +LHG
Sbjct: 192 VLWQFFERYPSAEATRGAEWKPMSELMKPLGLNELRAKALIRFSEEYLTKQWRYPIELHG 251

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W  V P DHMLN Y  +L
Sbjct: 252 IGKYGNDSYRIFCLEEWREVTPEDHMLNKYHAWL 285


>gi|242008567|ref|XP_002425074.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212508739|gb|EEB12336.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 168

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/72 (47%), Positives = 48/72 (66%)

Query: 50  LGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNY 109
           LGLQKKRA M+K+FS+EYL ++W +  +LHG+GKY  D++ IFC  +W  V+P D  L  
Sbjct: 93  LGLQKKRAFMLKKFSKEYLYKAWKYPIELHGIGKYGNDSYRIFCVNEWKHVQPNDTKLTL 152

Query: 110 YWEFLVSTKGNL 121
           Y  +L     +L
Sbjct: 153 YHNWLQENHNSL 164


>gi|400260851|pdb|4E9G|A Chain A, Structure Of The Glycosylase Domain Of Mbd4 Bound To
           Thymine Containing Dna
 gi|400260854|pdb|4E9H|A Chain A, Structure Of Glycosylase Domain Of Mbd4 Bound To 5hmu
           Containing Dna
          Length = 161

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 61  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P  H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145


>gi|449529453|ref|XP_004171714.1| PREDICTED: uncharacterized LOC101211755 [Cucumis sativus]
          Length = 619

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 36/68 (52%), Positives = 46/68 (67%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           +A  VI  LF+LCP+ K   EV  E+IE II  LG  +KR+  + R S+ YL ESW+HVT
Sbjct: 384 QAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT 443

Query: 77  QLHGVGKY 84
           QL GVGK+
Sbjct: 444 QLPGVGKF 451


>gi|321459033|gb|EFX70091.1| hypothetical protein DAPPUDRAFT_35863 [Daphnia pulex]
          Length = 144

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 54/86 (62%)

Query: 32  AKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAI 91
           A+ A + D +EI  +++ LGL + RA  I R S++YL   W   + L+G+ KY +D++ I
Sbjct: 52  AEVARQADWKEIADLMNPLGLHELRAKRIVRMSEDYLKGDWVKPSDLYGISKYGSDSYRI 111

Query: 92  FCTGKWDRVRPTDHMLNYYWEFLVST 117
           FC G+W  VRPTD ML  Y ++L + 
Sbjct: 112 FCLGEWKEVRPTDIMLKIYQDWLFTN 137


>gi|327280386|ref|XP_003224933.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Anolis
           carolinensis]
          Length = 530

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 54/94 (57%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           V+ + F   P    A   D + + +++  LGL + RA  I RFS E+L + W +  +LHG
Sbjct: 424 VLWEFFKKYPSPSVARTADWKGMSELLKPLGLYELRAKTIIRFSDEFLVKRWKYPIELHG 483

Query: 81  VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           +GKY  D++ IFC  +W  V+P DH LN Y  +L
Sbjct: 484 IGKYGNDSYRIFCVNEWKEVQPKDHKLNIYHAWL 517


>gi|400260857|pdb|4EA4|A Chain A, Structure Of The Glycosylase Domain Of Mbd4 Bound To
           5hmu-Containing Dna
          Length = 155

 Score = 78.2 bits (191), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 61  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P  H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145


>gi|400260860|pdb|4EA5|A Chain A, Structure Of The Glycoslyase Domain Of Mbd4 Bound To A
           5hmu Containing Dna
          Length = 160

 Score = 78.2 bits (191), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 51/85 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 61  PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFL 114
            IFC  +W +V P  H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145


>gi|221104829|ref|XP_002167512.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Hydra
           magnipapillata]
          Length = 160

 Score = 77.0 bits (188), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 50/81 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P      E + EEI  ++  LGL  +RA  I +FS E+L + W +  +L+G+GKY  D++
Sbjct: 67  PTPNDVNENNFEEISALMKPLGLNYRRAKNIIQFSFEFLHKDWKYPRELYGIGKYGDDSY 126

Query: 90  AIFCTGKWDRVRPTDHMLNYY 110
            +FC G++D V PTD+ LN Y
Sbjct: 127 RMFCLGQFDNVHPTDNKLNLY 147


>gi|241631815|ref|XP_002410284.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215503366|gb|EEC12860.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 686

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 50/79 (63%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
           +I     L P A++  +   ++I +++  LGL  KRA +I+RFS+EYL   W + ++LHG
Sbjct: 575 LIWKFLGLYPTAESVLQAPIQDIAELLQPLGLHNKRAAIIRRFSEEYLTRDWRYPSELHG 634

Query: 81  VGKYAADAFAIFCTGKWDR 99
           +GKY  D++ IFC  +W +
Sbjct: 635 IGKYGNDSYRIFCVKEWAK 653


>gi|297263977|ref|XP_001116090.2| PREDICTED: methyl-CpG-binding domain protein 4-like, partial
           [Macaca mulatta]
          Length = 544

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 46/74 (62%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 458 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 517

Query: 90  AIFCTGKWDRVRPT 103
            IFC  +W +VRPT
Sbjct: 518 RIFCVNEWKQVRPT 531


>gi|328712829|ref|XP_003244914.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Acyrthosiphon
           pisum]
          Length = 206

 Score = 72.8 bits (177), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 56/92 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P  +  T+ D +++   +  LGL+  R   IKRF+ ++L ++W +  +L+G+GKY  D++
Sbjct: 113 PTPEDVTKADPQQLLSSVENLGLENTRVKTIKRFTADFLLKNWKYPIELYGIGKYGNDSY 172

Query: 90  AIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
            +FC  +W +V+P D +L+ Y ++L   +  L
Sbjct: 173 RLFCVNEWRKVQPDDILLSKYKDWLTLNEKQL 204


>gi|196003048|ref|XP_002111391.1| hypothetical protein TRIADDRAFT_5883 [Trichoplax adhaerens]
 gi|190585290|gb|EDV25358.1| hypothetical protein TRIADDRAFT_5883, partial [Trichoplax
           adhaerens]
          Length = 146

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 56/100 (56%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           KA  ++   F   P  + A+  + +E+ +  S L +Q  RA +I +FS EY+ + W +  
Sbjct: 44  KAIPLLWKFFGHYPTPEIASNANWKEMAENFSALDMQHDRAKIIIKFSDEYIKKDWCYPC 103

Query: 77  QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVS 116
           +L G+GKY  D++ IFC  +W +V P D+ L  Y ++L  
Sbjct: 104 ELFGIGKYGNDSYRIFCVNEWKKVFPRDYNLVRYHKWLCQ 143


>gi|194381152|dbj|BAG64144.1| unnamed protein product [Homo sapiens]
          Length = 572

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 45/74 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPT 103
            IFC  +W +VR T
Sbjct: 546 RIFCVNEWKQVRLT 559


>gi|442796452|ref|NP_001263200.1| methyl-CpG-binding domain protein 4 isoform 3 [Homo sapiens]
          Length = 572

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 45/74 (60%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545

Query: 90  AIFCTGKWDRVRPT 103
            IFC  +W +VR T
Sbjct: 546 RIFCVNEWKQVRLT 559


>gi|390350844|ref|XP_001202024.2| PREDICTED: methyl-CpG-binding domain protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 188

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 45/84 (53%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
           KA  V+   F      +     D + I  +I  LGL  KRA M+ +FS E+L + WT+  
Sbjct: 105 KAIPVLWQFFQTWDTPEKTRSADWQSIADLIQPLGLHTKRAKMLIQFSDEFLTKDWTYPI 164

Query: 77  QLHGVGKYAADAFAIFCTGKWDRV 100
           +L G+GKY  D++ IFC  +W  V
Sbjct: 165 ELSGIGKYGNDSYRIFCVNEWKEV 188


>gi|432110910|gb|ELK34384.1| Methyl-CpG-binding domain protein 4 [Myotis davidii]
          Length = 1112

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 42/68 (61%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P A+ A   D  ++ +++  +GL + RA  I +FS EYL + W +  +LHG+GKY  D++
Sbjct: 489 PSAEVARAADWRDVSELLKPIGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 548

Query: 90  AIFCTGKW 97
            IFC  +W
Sbjct: 549 RIFCINEW 556


>gi|301779644|ref|XP_002925241.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain protein
           4-like [Ailuropoda melanoleuca]
          Length = 642

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P AK A   D  ++ + +  LGL    A    +FS EYL + W +  ++ G+G+Y  D+ 
Sbjct: 528 PSAKVARTPDRRDVSEPLRPLGLYDLGAKTTVKFSDEYLTKQWRYPIEVXGMGRYGNDSX 587

Query: 90  AI-FCTGKWDRVRPTDHMLNYYWEFL 114
            + FC  +W +V P DH LN + ++L
Sbjct: 588 RVFFCINEWKQVHPEDHKLNKHQDWL 613


>gi|397572920|gb|EJK48465.1| hypothetical protein THAOC_32731, partial [Thalassiosira oceanica]
          Length = 284

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWT--HVTQ 77
           RVISD F   P        D  ++  ++  LGL ++R   +KRF+ ++L   WT   V++
Sbjct: 148 RVISDFFEAYPSPTAVINGDTNDMATMLHPLGLNRERT--MKRFAIDFLSPEWTDGRVSR 205

Query: 78  LHGVGKYAADAFAIFCTGKWDRV 100
           LHG G +A +++ +FC G++ +V
Sbjct: 206 LHGCGNFAQNSWDVFCHGRYQQV 228


>gi|303284263|ref|XP_003061422.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456752|gb|EEH54052.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 164

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 28/130 (21%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL------G 69
           L+  R +  LF L P  + A     + IE +++ LGL +KRA    R S++YL      G
Sbjct: 35  LQVDRALWRLFLLAPTPEEARRTGLDAIEDVLAPLGLHRKRARAFVRLSEDYLAATRPRG 94

Query: 70  ESW---------------------THVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLN 108
           E+                           LHGVG YA+DA A+FC G    V P DH L 
Sbjct: 95  EALEANGGGGGGDGGLFSTRTKLAAPAASLHGVGAYASDAHAMFCDGLMG-VAPRDHALR 153

Query: 109 YYWEFLVSTK 118
           +++ + V  +
Sbjct: 154 WWYAWAVERR 163


>gi|357631679|gb|EHJ79148.1| putative methyl-CpG binding domain protein 4-like protein [Danaus
           plexippus]
          Length = 196

 Score = 59.3 bits (142), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 2/97 (2%)

Query: 18  AGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQ 77
           A   I   FT  P      +     +E+    LGL KKR  MI + S +++   W   + 
Sbjct: 67  ARPYIKSFFTDYPTPYQVLDETPSSLERFFENLGL-KKRGNMIWKLSYQFVSGKWRRASD 125

Query: 78  LHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
           L G+GKY  DA+ IFC G  D V P D  L  Y ++L
Sbjct: 126 LCGIGKYGEDAYRIFCLGHTD-VNPDDRYLKLYLDWL 161


>gi|308809063|ref|XP_003081841.1| unnamed protein product [Ostreococcus tauri]
 gi|116060308|emb|CAL55644.1| unnamed protein product [Ostreococcus tauri]
          Length = 757

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 39/115 (33%), Positives = 56/115 (48%), Gaps = 12/115 (10%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDA-----EEIEKIISTLGLQKKRAPMIKRFSQEYLGE 70
           L+  RV+  LF L P A  A  + +     E +E++I+ LGL +KR     R S++   +
Sbjct: 639 LQVDRVLWRLFLLAPTAADAIALGSTPEGLEALERVIAPLGLHRKRTDAFVRLSKDVEKQ 698

Query: 71  SWTH------VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKG 119
              H      V+  HGVG YAADA A+F  G      P DH L +Y  +    +G
Sbjct: 699 RAAHGGRVKNVSACHGVGVYAADAHALFVDGVLAG-PPRDHALRWYHAWATERRG 752


>gi|145351789|ref|XP_001420245.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580479|gb|ABO98538.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 994

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 12/106 (11%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVD-----AEEIEKIISTLGLQKKRAPMIKRFSQEY--- 67
           L+  RV+  LF L P A  A E+       E +E+II+ LGL +KR     + S++    
Sbjct: 871 LQVDRVLWRLFLLAPTAAAAVELGDTHEGLEALERIIAPLGLHRKRTNAFVKLSRDVEAQ 930

Query: 68  ---LGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
               G    +V+  HGVG YAADA A+F  G      P DH L +Y
Sbjct: 931 RAKYGGRIKNVSACHGVGVYAADAHALFVDGVLAG-PPRDHALRWY 975


>gi|299470461|emb|CBN78453.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 312

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
            +   F  CP   +    D  ++ K +  LGL ++   +IKRF++ +L  +W   ++LHG
Sbjct: 192 AVKAFFEACPTPTSVLAADLGDLRKTLLPLGLNREL--IIKRFAEGFLASAWHDPSELHG 249

Query: 81  VGKYAADAFAIFCTGK 96
            G++A+D++ IFC G 
Sbjct: 250 CGRFASDSWQIFCLGN 265


>gi|384247134|gb|EIE20622.1| hypothetical protein COCSUDRAFT_83540 [Coccomyxa subellipsoidea
          C-169]
          Length = 116

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 34/59 (57%)

Query: 37 EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG 95
          E   EE+E  I  LGLQ+ R   +K+ SQ +  + W    + +G GK+ +D++ IFC G
Sbjct: 18 EAPDEELESFIDPLGLQETRIKAVKQMSQAFFEKEWEDPVEFYGCGKFTSDSWRIFCRG 76


>gi|440804280|gb|ELR25157.1| base excision DNA repair protein, HhHGPD subfamily protein
           [Acanthamoeba castellanii str. Neff]
          Length = 229

 Score = 51.6 bits (122), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 53/113 (46%), Gaps = 14/113 (12%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL-GESWTHVTQLH 79
           V+ DLF   P  +   E D EE+  +I  LGL +KRA  +  FS+ YL    W    +L 
Sbjct: 27  VVGDLFDRYPSPEAMAEADEEELAGMIRRLGLYRKRAKALVDFSRTYLDSPPWRTPAELK 86

Query: 80  GVGKYAADAFAIFCTGK-------------WDRVRPTDHMLNYYWEFLVSTKG 119
             GKYA D++ +FC G                RVRP D  L  Y ++L +  G
Sbjct: 87  YFGKYADDSYRLFCLGGTSLSLWLTPLLLCSSRVRPRDKKLKQYRDWLFAHYG 139


>gi|390600027|gb|EIN09422.1| DNA glycosylase [Punctularia strigosozonata HHB-11173 SS5]
          Length = 173

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 52/122 (42%), Gaps = 20/122 (16%)

Query: 18  AGRVISDLF----TLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGES-- 71
           AGR    +F       P  +  +  D  E+  +I TLGLQ  RA  + + S+ YL     
Sbjct: 27  AGRTAIPIFWKILDRWPSPQALSNADVTELTHMIKTLGLQSMRAKRLIQLSEMYLAAPPD 86

Query: 72  ------------WTHVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHMLNYYWEFLVST 117
                        T V+ L G G+YA D++ IFC     W  V PTD  L  Y  +  +T
Sbjct: 87  PSILHKSRNYPVATPVSHLPGSGRYALDSYRIFCNPGIDWMNVMPTDKELIKYIRWKWAT 146

Query: 118 KG 119
            G
Sbjct: 147 VG 148


>gi|358056969|dbj|GAA97128.1| hypothetical protein E5Q_03803 [Mixia osmundae IAM 14324]
          Length = 257

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 22/121 (18%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTH------ 74
           V+  +    P A      D +E+ ++I  +GL  +RA M+ RFSQ Y+ E  T       
Sbjct: 101 VLYQILERWPTAHALAAADLQELTQVIYPIGLYNRRARMLIRFSQTYVDEPPTEGILSCN 160

Query: 75  ----------VTQLHGVGKYAADAFAIFCTG------KWDRVRPTDHMLNYYWEFLVSTK 118
                     ++   GVG+YA D++ IF          W  V P D  L  Y E+  + +
Sbjct: 161 KSDRAYVPTMISHYPGVGRYALDSYRIFIENANSSEPSWRSVLPLDKELKKYLEWRWALE 220

Query: 119 G 119
           G
Sbjct: 221 G 221


>gi|397614200|gb|EJK62655.1| hypothetical protein THAOC_16723 [Thalassiosira oceanica]
          Length = 547

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 9/103 (8%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------LGE 70
           R +   F   PDA  A + D EE+  I+S LGL  KR+  + RFS+E+            
Sbjct: 407 RSLFHFFQQWPDAVAAAKADWEEVLPIVSQLGLGNKRSQALVRFSREFKELADENDEFSL 466

Query: 71  SWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEF 113
           S   V  L+ +G+Y   A+ IF           DH L+ Y E+
Sbjct: 467 SERQVKSLYFIGEYGWSAYRIFILKDLPLTGVKDHALSLYVEY 509


>gi|255086739|ref|XP_002509336.1| predicted protein [Micromonas sp. RCC299]
 gi|226524614|gb|ACO70594.1| predicted protein [Micromonas sp. RCC299]
          Length = 175

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 58/151 (38%), Gaps = 54/151 (35%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDA---------------EEIEKIISTLGLQKKRAPMI 60
           L+  RV+  LF L P  + A  + A               + IE+I+  LGL +KRA  +
Sbjct: 9   LQVDRVLWRLFLLVPTPEDAVALGALRDGGNDGFDGKSGLDRIEEILRPLGLHRKRARAL 68

Query: 61  KRFSQEYLGESWT--------------------------------------HVTQLHGVG 82
            + S++Y+                                            V  LHGVG
Sbjct: 69  VKLSEDYVAARGAAPLPGPPGATGSDDDDDDGGGGRGSLGGFFGSSRRLSAPVASLHGVG 128

Query: 83  KYAADAFAIFCTGKWDRVRPTDHMLNYYWEF 113
            YA+DA A+FC G    V P DH L +++ +
Sbjct: 129 AYASDAHALFCEGTLG-VAPRDHALRWWYAW 158


>gi|444705535|gb|ELW46957.1| hypothetical protein TREES_T100019830 [Tupaia chinensis]
          Length = 467

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 2/45 (4%)

Query: 57  APMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVR 101
           AP++   + EYL + W +  +LHG+GKY  D++ IFC  +W + R
Sbjct: 17  APLVT--ANEYLTKQWRYPIELHGIGKYGNDSYRIFCVNEWKQKR 59


>gi|325263222|ref|ZP_08129957.1| endonuclease III [Clostridium sp. D5]
 gi|324031615|gb|EGB92895.1| endonuclease III [Clostridium sp. D5]
          Length = 212

 Score = 46.2 bits (108), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E D EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEEIERIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   +EY G   +++  + +L GVG+ +A+
Sbjct: 94  MKILKEEYEGGIPKTFNELMKLPGVGRKSAN 124


>gi|332287845|ref|YP_004422746.1| putative DNA repair protein [Chlamydophila psittaci 6BC]
 gi|384451012|ref|YP_005663612.1| endonuclease III [Chlamydophila psittaci 6BC]
 gi|325507289|gb|ADZ18927.1| putative DNA repair protein [Chlamydophila psittaci 6BC]
 gi|328915106|gb|AEB55939.1| endonuclease III [Chlamydophila psittaci 6BC]
          Length = 227

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +AILL           V   LF+L PDA+T  ++  E +  +IS  GL +++A  +   +
Sbjct: 37  VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96

Query: 65  ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
               ++Y GE   S   +TQL GVG+  A  F
Sbjct: 97  KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128


>gi|329943266|ref|ZP_08292040.1| hhH-GPD superbase excision DNA repair family protein [Chlamydophila
           psittaci Cal10]
 gi|384454931|ref|YP_005667526.1| putative DNA repair protein [Chlamydophila psittaci 02DC15]
 gi|392377074|ref|YP_004064852.1| putative DNA repair protein [Chlamydophila psittaci RD1]
 gi|407454501|ref|YP_006733609.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci 84/55]
 gi|407455769|ref|YP_006734660.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci GR9]
 gi|407458507|ref|YP_006736812.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci WS/RT/E30]
 gi|313848417|emb|CBY17421.1| putative DNA repair protein [Chlamydophila psittaci RD1]
 gi|328814813|gb|EGF84803.1| hhH-GPD superbase excision DNA repair family protein [Chlamydophila
           psittaci Cal10]
 gi|334694688|gb|AEG87905.1| putative DNA repair protein [Chlamydophila psittaci 02DC15]
 gi|405781260|gb|AFS20010.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci 84/55]
 gi|405782312|gb|AFS21061.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci GR9]
 gi|405785443|gb|AFS24189.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci WS/RT/E30]
          Length = 219

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +AILL           V   LF+L PDA+T  ++  E +  +IS  GL +++A  +   +
Sbjct: 37  VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96

Query: 65  ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
               ++Y GE   S   +TQL GVG+  A  F
Sbjct: 97  KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128


>gi|384451999|ref|YP_005664597.1| putative DNA repair protein [Chlamydophila psittaci 01DC11]
 gi|384452973|ref|YP_005665570.1| putative DNA repair protein [Chlamydophila psittaci 08DC60]
 gi|384453952|ref|YP_005666548.1| putative DNA repair protein [Chlamydophila psittaci C19/98]
 gi|406592825|ref|YP_006740005.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci CP3]
 gi|406594366|ref|YP_006742094.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci MN]
 gi|407457187|ref|YP_006735760.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci VS225]
 gi|407461123|ref|YP_006738898.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci WC]
 gi|410858857|ref|YP_006974797.1| putative DNA repair protein [Chlamydia psittaci 01DC12]
 gi|449071573|ref|YP_007438653.1| endonuclease III [Chlamydophila psittaci Mat116]
 gi|334692733|gb|AEG85952.1| putative DNA repair protein [Chlamydophila psittaci C19/98]
 gi|334693709|gb|AEG86927.1| putative DNA repair protein [Chlamydophila psittaci 01DC11]
 gi|334695662|gb|AEG88878.1| putative DNA repair protein [Chlamydophila psittaci 08DC60]
 gi|405782798|gb|AFS21546.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci MN]
 gi|405784448|gb|AFS23195.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci VS225]
 gi|405787270|gb|AFS26014.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci WC]
 gi|405788697|gb|AFS27440.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci CP3]
 gi|410811752|emb|CCO02407.1| putative DNA repair protein [Chlamydia psittaci 01DC12]
 gi|449040081|gb|AGE75505.1| endonuclease III [Chlamydophila psittaci Mat116]
          Length = 221

 Score = 45.1 bits (105), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +AILL           V   LF+L PDA+T  ++  E +  +IS  GL +++A  +   +
Sbjct: 37  VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96

Query: 65  ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
               ++Y GE   S   +TQL GVG+  A  F
Sbjct: 97  KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128


>gi|406593881|ref|YP_006741060.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci NJ1]
 gi|407459750|ref|YP_006737853.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci M56]
 gi|405786532|gb|AFS25277.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci M56]
 gi|405789753|gb|AFS28495.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
           psittaci NJ1]
          Length = 221

 Score = 45.1 bits (105), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +AILL           V   LF+L PDA+T  ++  E +  +IS  GL +++A  +   +
Sbjct: 37  VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLAQLPLENLYSLISPCGLGRRKAAYLHNLA 96

Query: 65  ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
               ++Y GE   S   +TQL GVG+  A  F
Sbjct: 97  KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128


>gi|412989966|emb|CCO20608.1| predicted protein [Bathycoccus prasinos]
          Length = 952

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 75  VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLV 115
           V++ HGVG YAADA A+FC G      P DH L +Y+ F +
Sbjct: 884 VSEFHGVGAYAADAHAMFCDGII-AAPPRDHALRWYYAFAL 923



 Score = 38.9 bits (89), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 5/58 (8%)

Query: 16  LKAGRVISDLFTLCPDAKTA-----TEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
           L+  RV+ +LFTL P A+ A     TE   E IEK ++T+GL +KRA    R S++ +
Sbjct: 748 LQVDRVLWELFTLAPTAEDAVKLGETERGIELIEKCLATIGLHRKRARAFVRLSKDVV 805


>gi|332231797|ref|XP_003265081.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Nomascus
           leucogenys]
          Length = 538

 Score = 44.3 bits (103), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+G
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 536


>gi|426342069|ref|XP_004036338.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Gorilla
           gorilla gorilla]
          Length = 540

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538


>gi|114589107|ref|XP_001143803.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Pan
           troglodytes]
          Length = 540

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538


>gi|397518618|ref|XP_003829481.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Pan
           paniscus]
          Length = 540

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538


>gi|442796454|ref|NP_001263201.1| methyl-CpG-binding domain protein 4 isoform 4 [Homo sapiens]
 gi|33150919|gb|AAP97338.1| methyl-CpG binding domain protein 4 variant [Homo sapiens]
 gi|119599658|gb|EAW79252.1| methyl-CpG binding domain protein 4, isoform CRA_b [Homo sapiens]
          Length = 540

 Score = 43.9 bits (102), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           P A+ A   D  ++ +++  LGL   RA  I +FS EYL + W +  +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538


>gi|62185498|ref|YP_220283.1| DNA repair protein [Chlamydophila abortus S26/3]
 gi|62148565|emb|CAH64337.1| putative DNA repair protein [Chlamydophila abortus S26/3]
          Length = 219

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 7/76 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---SWT 73
           V   LF+L PDA+   ++  E++  IIS  GL +++A  +   +Q    +Y GE   S  
Sbjct: 53  VTPRLFSLAPDAQALVQLPLEDLYFIISPCGLGRRKAAYLHHLAQILLEKYHGEPPASLE 112

Query: 74  HVTQLHGVGKYAADAF 89
            +TQL GVG+  A  F
Sbjct: 113 LLTQLPGVGRKTASVF 128


>gi|302674673|ref|XP_003027021.1| hypothetical protein SCHCODRAFT_61605 [Schizophyllum commune H4-8]
 gi|300100706|gb|EFI92118.1| hypothetical protein SCHCODRAFT_61605 [Schizophyllum commune H4-8]
          Length = 223

 Score = 43.5 bits (101), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 73  THVTQLHGVGKYAADAFAIFCTG---KWDRVRPTDHML 107
           T ++ L GVG+YA D+F  FCTG   +W  VRPTD  L
Sbjct: 136 TPISHLPGVGRYALDSFRTFCTGGSDEWKSVRPTDKEL 173


>gi|392584695|gb|EIW74039.1| DNA glycosylase [Coniophora puteana RWD-64-598 SS2]
          Length = 219

 Score = 43.5 bits (101), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 50/140 (35%), Gaps = 59/140 (42%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL--------------------- 68
           P  K   E D +E+  +I  LGLQ  RA  +K  S+ Y                      
Sbjct: 40  PTPKDLMEADTDELVAMIQKLGLQSTRAIRLKELSRAYHEHPPRTGTLQLEHGKFARAPA 99

Query: 69  -------------------------GESW----THVTQLHGVGKYAADAFAIFCT----- 94
                                    GE+W    T ++ L GVG+Y  D++ IFCT     
Sbjct: 100 SLRERYPPELPSTCDSEADVSLALGGEAWPYPKTPISHLPGVGRYGWDSYRIFCTDNVTG 159

Query: 95  ----GKWDRVRPTDHMLNYY 110
                +W RV+P+D  L  Y
Sbjct: 160 TGSSDEWKRVQPSDKELCRY 179


>gi|159486950|ref|XP_001701499.1| DNA repair glycosylase [Chlamydomonas reinhardtii]
 gi|158271560|gb|EDO97376.1| DNA repair glycosylase [Chlamydomonas reinhardtii]
          Length = 292

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRF 63
           + ++ L   +  +K   V  +LF   PDA+   +++A EIE II  LGL   +A  ++  
Sbjct: 107 LVAVMLSAQSTDVKVNTVTPELFRRGPDAEAMAKLEASEIEGIIRVLGLAPTKAKNVRAM 166

Query: 64  SQ----EYLGE---SWTHVTQLHGVGKYAAD 87
           SQ    +Y G+   SW  +  L GVG   A 
Sbjct: 167 SQILVEQYDGQVPGSWEGLEALPGVGHKTAS 197


>gi|260587725|ref|ZP_05853638.1| endonuclease III [Blautia hansenii DSM 20583]
 gi|260541990|gb|EEX22559.1| endonuclease III [Blautia hansenii DSM 20583]
          Length = 211

 Score = 43.1 bits (100), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E   EE+EKI+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYAKYPDVNALAEAPVEEVEKIVRPCGLGKSKARDICAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   +EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKILKEEYQGKVPDDFQALLKLPGVGRKSAN 124


>gi|331083986|ref|ZP_08333093.1| hypothetical protein HMPREF0992_02017 [Lachnospiraceae bacterium
           6_1_63FAA]
 gi|330402348|gb|EGG81918.1| hypothetical protein HMPREF0992_02017 [Lachnospiraceae bacterium
           6_1_63FAA]
          Length = 211

 Score = 43.1 bits (100), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E   EE+EKI+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYVKYPDVNALAEAPVEEVEKIVRPCGLGKSKARDICAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   +EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKILKEEYQGKVPDDFQALLKLPGVGRKSAN 124


>gi|302850094|ref|XP_002956575.1| hypothetical protein VOLCADRAFT_33476 [Volvox carteri f.
           nagariensis]
 gi|300258102|gb|EFJ42342.1| hypothetical protein VOLCADRAFT_33476 [Volvox carteri f.
           nagariensis]
          Length = 198

 Score = 42.7 bits (99), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 45/105 (42%), Gaps = 12/105 (11%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ---EYLG---- 69
           K   V  +LF   PDA     +   EIE+II  LGL   +A  ++R SQ   E  G    
Sbjct: 44  KVNTVTPELFARGPDAMAMAALKVSEIERIIRVLGLAPTKARNVQRLSQMLVELYGGQVP 103

Query: 70  ESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTD---HMLNYYW 111
           +S++ + +L GVG   A    + C        P D   H L   W
Sbjct: 104 DSFSALEELPGVGHKTAS--VVMCQAFSHPAFPVDTHIHRLAQRW 146


>gi|395323248|gb|EJF55730.1| hypothetical protein DICSQDRAFT_73043 [Dichomitus squalens LYAD-421
           SS1]
          Length = 283

 Score = 42.7 bits (99), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 73  THVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHMLNYYWEF 113
           T ++ + G G YA D++ IFCTG  +W  VRPTD  L  Y ++
Sbjct: 202 TPISHIPGCGPYALDSYRIFCTGGDEWRSVRPTDKELVKYLQW 244


>gi|347533057|ref|YP_004839820.1| DNA-(apurinic or apyrimidinic site) lyase [Roseburia hominis
           A2-183]
 gi|345503205|gb|AEN97888.1| DNA-(apurinic or apyrimidinic site) lyase [Roseburia hominis
           A2-183]
          Length = 211

 Score = 42.7 bits (99), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA------ 57
           + S+RL       +   V+ DL+   PD     + D EEIE+I+   GL   +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVQDLYREFPDVDALADADVEEIERIVKPCGLGHSKARDISAC 93

Query: 58  -PMIKRFSQEYLGESWTHVTQLHGVGKYAAD 87
             M+K   Q  + E +  + +L GVG+ +A+
Sbjct: 94  MKMLKEKYQGKVPEDFDALLELPGVGRKSAN 124


>gi|424825542|ref|ZP_18250529.1| putative DNA repair protein [Chlamydophila abortus LLG]
 gi|333410641|gb|EGK69628.1| putative DNA repair protein [Chlamydophila abortus LLG]
          Length = 219

 Score = 42.4 bits (98), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---SWT 73
           V   LF+L PDA+   ++  E++  IIS  GL +++   +   +Q    +Y GE   S  
Sbjct: 53  VTPRLFSLAPDAQALVQLPLEDLYFIISPCGLGRRKVAYLHHLAQILLEKYHGEPPASLE 112

Query: 74  HVTQLHGVGKYAADAF 89
            +TQL GVG+  A  F
Sbjct: 113 LLTQLPGVGRKTASVF 128


>gi|255280265|ref|ZP_05344820.1| endonuclease III [Bryantella formatexigens DSM 14469]
 gi|255269356|gb|EET62561.1| putative endonuclease III [Marvinbryantia formatexigens DSM 14469]
          Length = 239

 Score = 42.4 bits (98), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V++DLF   P  K   E D E+IE+I+   GL   +A      
Sbjct: 61  LVSVRLAAQCTDARVNIVVADLFVKYPGVKELAEADVEDIERIVKPCGLGHSKARDISAC 120

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   E +  + +L GVG+ +A+
Sbjct: 121 MKMLQEQYDGKIPEDFDALLKLPGVGRKSAN 151


>gi|291562481|emb|CBL41297.1| Predicted EndoIII-related endonuclease [butyrate-producing
           bacterium SS3/4]
          Length = 217

 Score = 42.0 bits (97), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD K   E D +EIE+I+   GL + +A      
Sbjct: 39  LVSVRLAAQCTDARVNVVVQDLYKKFPDVKALAEADVDEIEEIVRPCGLGRSKARDISAC 98

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   + +  + +L GVG+ +A+
Sbjct: 99  MKILYEQYHGNVPDDFDALLKLPGVGRKSAN 129


>gi|374673082|dbj|BAL50973.1| endonuclease III [Lactococcus lactis subsp. lactis IO-1]
          Length = 218

 Score = 42.0 bits (97), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 28/41 (68%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+T ++ + EEIEK+I T+GL K +A  I R SQ
Sbjct: 55 LFAAFPDAQTMSQANVEEIEKLIRTIGLYKTKAKNILRTSQ 95


>gi|167761558|ref|ZP_02433685.1| hypothetical protein CLOSCI_03969 [Clostridium scindens ATCC 35704]
 gi|336422654|ref|ZP_08602797.1| hypothetical protein HMPREF0993_02174 [Lachnospiraceae bacterium
           5_1_57FAA]
 gi|167661224|gb|EDS05354.1| putative endonuclease III [Clostridium scindens ATCC 35704]
 gi|336007827|gb|EGN37848.1| hypothetical protein HMPREF0993_02174 [Lachnospiraceae bacterium
           5_1_57FAA]
          Length = 215

 Score = 41.6 bits (96), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD +     D E+IE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYARYPDVEALAAADVEDIERIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLGE---SWTHVTQLHGVGKYAAD 87
           +K   +EY G+    +  + +L GVG+ +A+
Sbjct: 94  MKILKEEYDGKIPRDFNALLKLPGVGRKSAN 124


>gi|145357325|ref|XP_001422870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583114|gb|ABP01229.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 156

 Score = 41.6 bits (96), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 7/95 (7%)

Query: 22  ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT----- 76
           IS+ F   P    A   D++++ +II  LGL   R   I   S ++L E+          
Sbjct: 47  ISNFFAKFPTPSAALAADSQDVFEIIKRLGLFPGRMRTIVEVSTKFLLETGEFEVGLEPE 106

Query: 77  -QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
            +++G+G++  D+F IF  G   R +P D  L  +
Sbjct: 107 LKIYGIGEFGVDSFEIFARGDVRR-KPKDCNLQSF 140


>gi|15673053|ref|NP_267227.1| endonuclease III [Lactococcus lactis subsp. lactis Il1403]
 gi|418037871|ref|ZP_12676231.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
          subsp. cremoris CNCM I-1631]
 gi|12724026|gb|AAK05169.1|AE006340_2 endonuclease III [Lactococcus lactis subsp. lactis Il1403]
 gi|354693995|gb|EHE93702.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
          subsp. cremoris CNCM I-1631]
          Length = 218

 Score = 40.8 bits (94), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 27/41 (65%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+T ++   EEIEK+I T+GL K +A  I R SQ
Sbjct: 55 LFATFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95


>gi|385830614|ref|YP_005868427.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
 gi|326406622|gb|ADZ63693.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
          Length = 218

 Score = 40.8 bits (94), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 27/41 (65%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+T ++   EEIEK+I T+GL K +A  I R SQ
Sbjct: 55 LFATFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95


>gi|254468543|ref|ZP_05081949.1| endonuclease III [beta proteobacterium KB13]
 gi|207087353|gb|EDZ64636.1| endonuclease III [beta proteobacterium KB13]
          Length = 212

 Score = 40.8 bits (94), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 14/92 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        RV + LF + PD    +++  ++IE +I+++GL K +A  I++ S
Sbjct: 34  IAVILSAQTTDVQVNRVTAKLFKIAPDPLKLSKLSLDKIESLINSIGLYKNKAKNIQQTS 93

Query: 65  Q----EYLGE---SWTHVTQLHGVGKYAADAF 89
                +Y GE   S   +  L GVG+  A+  
Sbjct: 94  SMLITKYNGEVPQSRKELENLPGVGRKTANVI 125


>gi|281491571|ref|YP_003353551.1| endonuclease III [Lactococcus lactis subsp. lactis KF147]
 gi|281375289|gb|ADA64802.1| Endonuclease III [Lactococcus lactis subsp. lactis KF147]
          Length = 218

 Score = 40.8 bits (94), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 27/41 (65%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+T ++   EEIEK+I T+GL K +A  I R SQ
Sbjct: 55 LFAAFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95


>gi|420151539|ref|ZP_14658642.1| endonuclease III [Actinomyces massiliensis F0489]
 gi|394766615|gb|EJF47646.1| endonuclease III [Actinomyces massiliensis F0489]
          Length = 247

 Score = 40.8 bits (94), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP--------MIKRFSQEYLGESW 72
           V  +LF LCPDA      D +++E ++  LG+Q+ RA         +++RF    + +  
Sbjct: 87  VTPELFALCPDAGALAAADPDDLEALLRPLGMQRTRAERLIGLASGLVERFDG-VVPQGR 145

Query: 73  THVTQLHGVGKYAAD 87
             +  L GVG+  A+
Sbjct: 146 EALMTLPGVGRKTAN 160


>gi|392562227|gb|EIW55408.1| hypothetical protein TRAVEDRAFT_153786 [Trametes versicolor
           FP-101664 SS1]
          Length = 291

 Score = 40.8 bits (94), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 75  VTQLHGVGKYAADAFAIFCTGK--WDRVRPTDHMLNYYWEF 113
           V+ + G G YA D++ IFC+GK  W  VRP D  L  Y E+
Sbjct: 212 VSHIPGSGPYALDSYRIFCSGKDEWKSVRPRDKELVKYLEW 252


>gi|323693535|ref|ZP_08107739.1| endonuclease III [Clostridium symbiosum WAL-14673]
 gi|355629427|ref|ZP_09050395.1| hypothetical protein HMPREF1020_04474 [Clostridium sp. 7_3_54FAA]
 gi|323502390|gb|EGB18248.1| endonuclease III [Clostridium symbiosum WAL-14673]
 gi|354819113|gb|EHF03565.1| hypothetical protein HMPREF1020_04474 [Clostridium sp. 7_3_54FAA]
          Length = 211

 Score = 40.8 bits (94), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD K   E D ++IE+I+   GL + +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVQDLYDKFPDVKALAEADVDKIEEIVRPCGLGRSKARDINAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   E +  + +L GVG+ +A+
Sbjct: 94  MKILWEQYGGKVPEDFDALLKLPGVGRKSAN 124


>gi|308807385|ref|XP_003081003.1| putative metalloendopeptidase (ISS) [Ostreococcus tauri]
 gi|116059465|emb|CAL55172.1| putative metalloendopeptidase (ISS) [Ostreococcus tauri]
          Length = 305

 Score = 40.4 bits (93), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT---- 76
           VIS+ F   P    A   DAEE+ ++I  LGL   R   I   S   L  +         
Sbjct: 182 VISNFFEKFPTPTAALHADAEEVFEVIKRLGLFPGRHRTIVEVSTAILTNTGAFEVGLEP 241

Query: 77  --QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
             +++G+G++  D+F IF  G  +R +P D  L  +
Sbjct: 242 EKKIYGIGEFGIDSFEIFARGDINR-KPKDCNLAAF 276


>gi|29840684|ref|NP_829790.1| endonuclease III [Chlamydophila caviae GPIC]
 gi|29835034|gb|AAP05668.1| endonuclease III [Chlamydophila caviae GPIC]
          Length = 214

 Score = 40.4 bits (93), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI---- 60
           +AILL           V  +LF+  PDA+   ++  +++  IIS  GL K++A  +    
Sbjct: 37  VAILLSGNSTDKAVNSVTPELFSAAPDAQALAKLPLDKLYFIISPCGLGKRKAAYLHDLS 96

Query: 61  KRFSQEYLGE---SWTHVTQLHGVGKYAADAF 89
           K  S++Y GE   S   +T+L GVG+  A  F
Sbjct: 97  KIISEKYRGEPPASLELLTKLPGVGRKTASVF 128


>gi|336375826|gb|EGO04161.1| hypothetical protein SERLA73DRAFT_45279 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 218

 Score = 40.4 bits (93), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 46/134 (34%), Gaps = 53/134 (39%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------------------- 67
           P A    +   E+I  +I +LGLQ KRA    R SQ Y                      
Sbjct: 44  PTADALRQAKVEDIVSVIQSLGLQNKRARHFIRLSQAYVEDPPQPNVLRKSNKPHYLGYI 103

Query: 68  ----LGES-----------------------WTHVTQLHGVGKYAADAFAIFCT----GK 96
               LG S                        T V+ + G+G YA D++ IFC      +
Sbjct: 104 YIGALGSSPQGLPSHQHRRYQHYRRCRLQYPMTPVSHMPGMGPYALDSYRIFCVKSEYNE 163

Query: 97  WDRVRPTDHMLNYY 110
           W RV P D  L  Y
Sbjct: 164 WKRVVPEDKELIRY 177


>gi|323485089|ref|ZP_08090442.1| hypothetical protein HMPREF9474_02193 [Clostridium symbiosum
           WAL-14163]
 gi|323401645|gb|EGA93990.1| hypothetical protein HMPREF9474_02193 [Clostridium symbiosum
           WAL-14163]
          Length = 175

 Score = 40.4 bits (93), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD K   E D ++IE+I+   GL + +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVQDLYDKFPDVKALAEADVDKIEEIVRPCGLGRSKARDINAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   E +  + +L GVG+ +A+
Sbjct: 94  MKILWEQYGGKVPEDFDALLKLPGVGRKSAN 124


>gi|357056045|ref|ZP_09117101.1| hypothetical protein HMPREF9467_04073 [Clostridium clostridioforme
           2_1_49FAA]
 gi|355381537|gb|EHG28660.1| hypothetical protein HMPREF9467_04073 [Clostridium clostridioforme
           2_1_49FAA]
          Length = 212

 Score = 40.4 bits (93), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E D E+IE+I+   GL   +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEDIERIVKPCGLGHSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   + +  + +L GVG+ +A+
Sbjct: 94  MKILKEQYEGKVPDDFDALLKLPGVGRKSAN 124


>gi|353244084|emb|CCA75538.1| hypothetical protein PIIN_09528 [Piriformospora indica DSM 11827]
          Length = 199

 Score = 40.4 bits (93), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 57/125 (45%), Gaps = 27/125 (21%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESW-------- 72
           V++++ T  P  +  ++  A E+E +++ LGL +KR   + +F+  +L +          
Sbjct: 42  VLNEVRTRWPTMEALSQAHAPELE-LLTPLGLGRKRTQRLIQFATMWLEDPPDQMSLRPS 100

Query: 73  -------------THVTQLHGVGKYAADAFAIFC-----TGKWDRVRPTDHMLNYYWEFL 114
                        T V+ L G+G+YA D++ IFC       +W  V P D  L  +  + 
Sbjct: 101 RVQVAKGVAKYPPTAVSHLPGMGRYALDSYRIFCGQLVGGNEWKTVTPLDKELRVHMRWR 160

Query: 115 VSTKG 119
            + +G
Sbjct: 161 WAVEG 165


>gi|346307710|ref|ZP_08849841.1| hypothetical protein HMPREF9457_01550 [Dorea formicigenerans
           4_6_53AFAA]
 gi|345904878|gb|EGX74620.1| hypothetical protein HMPREF9457_01550 [Dorea formicigenerans
           4_6_53AFAA]
          Length = 212

 Score = 40.4 bits (93), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     +   EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYAKFPDVNALADAPVEEIEEIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKILRDEYNGKVPDDFDKLLKLPGVGRKSAN 124


>gi|160935307|ref|ZP_02082689.1| hypothetical protein CLOBOL_00202 [Clostridium bolteae ATCC
           BAA-613]
 gi|158441665|gb|EDP19365.1| hypothetical protein CLOBOL_00202 [Clostridium bolteae ATCC
           BAA-613]
          Length = 212

 Score = 40.4 bits (93), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E D E+IE+I+   GL   +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEDIERIVKPCGLGHSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   + +  + +L GVG+ +A+
Sbjct: 94  MKILKEQYGGKVPDDFDALLKLPGVGRKSAN 124


>gi|166031547|ref|ZP_02234376.1| hypothetical protein DORFOR_01247 [Dorea formicigenerans ATCC
           27755]
 gi|166028524|gb|EDR47281.1| putative endonuclease III [Dorea formicigenerans ATCC 27755]
          Length = 249

 Score = 40.4 bits (93), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     +   EEIE+I+   GL K +A      
Sbjct: 71  LVSVRLAAQCTDARVNVVVEDLYAKFPDVNALADAPVEEIEEIVRPCGLGKSKARDISAC 130

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   + +  + +L GVG+ +A+
Sbjct: 131 MKILRDEYNGKVPDDFDKLLKLPGVGRKSAN 161


>gi|154503500|ref|ZP_02040560.1| hypothetical protein RUMGNA_01324 [Ruminococcus gnavus ATCC 29149]
 gi|336431486|ref|ZP_08611335.1| hypothetical protein HMPREF0991_00454 [Lachnospiraceae bacterium
           2_1_58FAA]
 gi|153795600|gb|EDN78020.1| putative endonuclease III [Ruminococcus gnavus ATCC 29149]
 gi|336013788|gb|EGN43660.1| hypothetical protein HMPREF0991_00454 [Lachnospiraceae bacterium
           2_1_58FAA]
          Length = 213

 Score = 40.0 bits (92), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGE---SWT 73
           V+ DL+   PD     E D +EIE I+   GL K +A      ++    EY GE    + 
Sbjct: 53  VVKDLYAKFPDVNALAEADPQEIEAIVRPCGLGKSKARDISACMRILRDEYHGEVPDDFD 112

Query: 74  HVTQLHGVGKYAAD 87
            + +L GVG+ +A+
Sbjct: 113 ALLKLPGVGRKSAN 126


>gi|358066043|ref|ZP_09152577.1| hypothetical protein HMPREF9473_04640 [Clostridium hathewayi
           WAL-18680]
 gi|356695906|gb|EHI57531.1| hypothetical protein HMPREF9473_04640 [Clostridium hathewayi
           WAL-18680]
          Length = 211

 Score = 40.0 bits (92), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+ DL+   PD ++  E D ++IE+I+   GL   +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVQDLYAKYPDVESLAEADVDDIERIVKPCGLGHSKARDISGC 93

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    E+ G   + +  + +L GVG+ +A+
Sbjct: 94  MKMLRDEFGGKVPDDFDALMKLPGVGRKSAN 124


>gi|145350560|ref|XP_001419671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579903|gb|ABO97964.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 331

 Score = 40.0 bits (92), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 7/95 (7%)

Query: 22  ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT----- 76
           IS+ F   P    A   D++++ +II  LGL   R   I   S ++L E+          
Sbjct: 222 ISNFFAKFPTPSAALAADSQDVFEIIKRLGLFPGRMRTIVEVSTKFLLETGEFEVGLEPE 281

Query: 77  -QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
            +++G+G++  D+F IF  G   R +P D  L  +
Sbjct: 282 LKIYGIGEFGVDSFEIFARGDV-RRKPKDCNLQSF 315


>gi|389750655|gb|EIM91728.1| hypothetical protein STEHIDRAFT_70147 [Stereum hirsutum FP-91666
           SS1]
          Length = 439

 Score = 40.0 bits (92), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 73  THVTQLHGVGKYAADAFAIFCTG-----KWDRVRPTDHMLNYYWEFLVSTKG 119
           + ++ L G GKYA D++ I+C G     +W  VRP D  L  Y ++  +T+G
Sbjct: 349 SPISHLPGSGKYAIDSYRIYCGGIAGDDEWRHVRPEDKELVRYLKWRWATEG 400



 Score = 36.2 bits (82), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 31/54 (57%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE 70
           KA  +  +L    P AK  T+   +E+ +II TLGLQ+ R+  + + SQ Y+ E
Sbjct: 169 KAIPIFWELMERWPSAKAMTKAHEDELCEIIRTLGLQRTRSARLIKLSQAYIDE 222


>gi|336388964|gb|EGO30107.1| putative 5-Methylcytosine G/T mismatch-specific DNA glycosylase
           [Serpula lacrymans var. lacrymans S7.9]
          Length = 179

 Score = 40.0 bits (92), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 46/134 (34%), Gaps = 53/134 (39%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------------------- 67
           P A    +   E+I  +I +LGLQ KRA    R SQ Y                      
Sbjct: 5   PTADALRQAKVEDIVSVIQSLGLQNKRARHFIRLSQAYVEDPPQPNVLRKSNKPHYLGYI 64

Query: 68  ----LGES-----------------------WTHVTQLHGVGKYAADAFAIFCT----GK 96
               LG S                        T V+ + G+G YA D++ IFC      +
Sbjct: 65  YIGALGSSPQGLPSHQHRRYQHYRRCRLQYPMTPVSHMPGMGPYALDSYRIFCVKSEYNE 124

Query: 97  WDRVRPTDHMLNYY 110
           W RV P D  L  Y
Sbjct: 125 WKRVVPEDKELIRY 138


>gi|317473465|ref|ZP_07932759.1| endonuclease III [Anaerostipes sp. 3_2_56FAA]
 gi|316899115|gb|EFV21135.1| endonuclease III [Anaerostipes sp. 3_2_56FAA]
          Length = 211

 Score = 39.3 bits (90), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E D  +IE I+   GL + +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVQDLYEKYPDVNALAEADPADIEAIVRPCGLGRSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   +++  + +L GVG+ +A+
Sbjct: 94  MKILRDEYGGGIPDNFKDLMKLPGVGRKSAN 124


>gi|414074236|ref|YP_006999453.1| Endonuclease III [Lactococcus lactis subsp. cremoris UC509.9]
 gi|413974156|gb|AFW91620.1| Endonuclease III [Lactococcus lactis subsp. cremoris UC509.9]
          Length = 218

 Score = 39.3 bits (90), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 19 GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
           +    LF   PDA+T ++   EEIE +I T+GL K +A  I R SQ
Sbjct: 49 NKATPGLFAAFPDAQTMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95


>gi|374586841|ref|ZP_09659933.1| DNA-(apurinic or apyrimidinic site) lyase [Leptonema illini DSM
           21528]
 gi|373875702|gb|EHQ07696.1| DNA-(apurinic or apyrimidinic site) lyase [Leptonema illini DSM
           21528]
          Length = 229

 Score = 39.3 bits (90), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +A++L A        RV  +LF   PD ++  E    E+E++I + G  + +A  I+ F 
Sbjct: 37  VAVILSAQCTDEMVNRVTPELFKAFPDPRSLAEAPVAEVERLIHSTGFYRNKARNIQGFC 96

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           +  + E    V Q       + GVG+  A+
Sbjct: 97  RALVNEHGGKVPQTIEELVKMPGVGRKTAN 126


>gi|224010940|ref|XP_002294427.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969922|gb|EED88261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 472

 Score = 38.9 bits (89), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 25  LFTLC---PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
           LF+L    P   +    D EEI +IIS LGL  KR+  I RFS+EY+
Sbjct: 315 LFSLLQRWPRVHSMARADWEEIREIISPLGLMNKRSKSIIRFSKEYI 361


>gi|116511912|ref|YP_809128.1| endonuclease III / DNA-(apurinic or apyrimidinic site) lyase
          [Lactococcus lactis subsp. cremoris SK11]
 gi|385838143|ref|YP_005875773.1| Endonuclease III [Lactococcus lactis subsp. cremoris A76]
 gi|116107566|gb|ABJ72706.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
          subsp. cremoris SK11]
 gi|358749371|gb|AEU40350.1| Endonuclease III [Lactococcus lactis subsp. cremoris A76]
          Length = 218

 Score = 38.9 bits (89), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 26/41 (63%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+T ++   EEIE +I T+GL K +A  I R SQ
Sbjct: 55 LFAAFPDAQTMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95


>gi|402218777|gb|EJT98852.1| DNA glycosylase [Dacryopinax sp. DJM-731 SS1]
          Length = 445

 Score = 38.9 bits (89), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 42/127 (33%)

Query: 31  DAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE---------------SWTHV 75
           DA + + V   E+ +++  LGLQ  RA  I  FS++Y                    T V
Sbjct: 266 DAASLSHVPEVELVEVLRPLGLQNTRACRILTFSKQYGSAPPSTQLYPLRVDKTYPLTAV 325

Query: 76  TQLHGVGKYAADAFAIFCTG-------------------------KWDRVRPTDHMLNYY 110
               GVG+YAAD++ I+                            +W +VRP D  L +Y
Sbjct: 326 ANYAGVGRYAADSWRIYSPTLEGGGAPRGAKETLEGFFEPENSEQEWKKVRPLDKELRWY 385

Query: 111 --WEFLV 115
             W + V
Sbjct: 386 LIWRWAV 392


>gi|393795369|ref|ZP_10378733.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia BG20]
          Length = 235

 Score = 38.9 bits (89), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 23  SDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE-------SWTHV 75
           S++ + C D      +D  +++K++  LGL+  R+  +K  S++ L +       +   +
Sbjct: 75  SNITSRCNDFNDLQNIDLRKLKKVLKPLGLEHTRSKHLKDLSKKILADFNGKVPGTKKEL 134

Query: 76  TQLHGVGKYAADAFAIFCTG 95
             L GVG Y A+A   F  G
Sbjct: 135 LSLPGVGPYTANAVLCFAFG 154


>gi|224009405|ref|XP_002293661.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971061|gb|EED89397.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 658

 Score = 38.9 bits (89), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 25  LFTLC---PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
           LF+L    P   +    D EEI +IIS LGL  KR+  I RFS+EY+
Sbjct: 501 LFSLLQRWPRVHSMARADWEEIREIISPLGLMNKRSKSIIRFSKEYI 547


>gi|292656948|ref|YP_003536845.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
 gi|448293546|ref|ZP_21483651.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
 gi|291372790|gb|ADE05017.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
 gi|445570407|gb|ELY24970.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
          Length = 212

 Score = 38.9 bits (89), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 8/69 (11%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG-------ESWTHVTQLHGVG 82
           P  +T      E IE+ I+ LGL  KRA  I+R SQ+ +          +  + +LHGVG
Sbjct: 63  PSPETVVAASPEAIERRIAPLGL-AKRAEFIRRTSQQLIARHSGDVPRRYADLLELHGVG 121

Query: 83  KYAADAFAI 91
            Y A +  I
Sbjct: 122 DYTARSVLI 130


>gi|15618746|ref|NP_225032.1| endonuclease III [Chlamydophila pneumoniae CWL029]
 gi|15836370|ref|NP_300894.1| endonuclease III [Chlamydophila pneumoniae J138]
 gi|16752201|ref|NP_445569.1| endonuclease III [Chlamydophila pneumoniae AR39]
 gi|33242197|ref|NP_877138.1| endonuclease III [Chlamydophila pneumoniae TW-183]
 gi|4377151|gb|AAD18975.1| Enodnuclease III [Chlamydophila pneumoniae CWL029]
 gi|7189945|gb|AAF38807.1| endonuclease III [Chlamydophila pneumoniae AR39]
 gi|8979211|dbj|BAA99045.1| enodnuclease III [Chlamydophila pneumoniae J138]
 gi|33236708|gb|AAP98795.1| endonuclease III [Chlamydophila pneumoniae TW-183]
          Length = 209

 Score = 38.9 bits (89), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IAILL           V   LF   PDA++  ++   ++ ++I+  GL ++++  I + S
Sbjct: 31  IAILLSGNSTDKAVNSVTPQLFAKAPDAQSILDLPPGKLYQLIAPCGLGERKSAYIYQLS 90

Query: 65  Q----EYLGESWTH---VTQLHGVGKYAADAFAIFCTGK 96
           Q    ++ GE       +TQL GVG+  A  F     GK
Sbjct: 91  QILVRDFHGEPPNDMALLTQLPGVGRKTASVFLGIAYGK 129


>gi|384449995|ref|YP_005662597.1| putative enodnuclease III [Chlamydophila pneumoniae LPCoLN]
 gi|269302624|gb|ACZ32724.1| putative enodnuclease III [Chlamydophila pneumoniae LPCoLN]
          Length = 209

 Score = 38.9 bits (89), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IAILL           V   LF   PDA++  ++   ++ ++I+  GL ++++  I + S
Sbjct: 31  IAILLSGNSTDKAVNSVTPQLFAKAPDAQSILDLPPGKLYQLIAPCGLGERKSAYIYQLS 90

Query: 65  Q----EYLGESWTH---VTQLHGVGKYAADAFAIFCTGK 96
           Q    ++ GE       +TQL GVG+  A  F     GK
Sbjct: 91  QILVRDFHGEPPNDMALLTQLPGVGRKTASVFLGIAYGK 129


>gi|302339269|ref|YP_003804475.1| endonuclease III [Spirochaeta smaragdinae DSM 11293]
 gi|301636454|gb|ADK81881.1| endonuclease III [Spirochaeta smaragdinae DSM 11293]
          Length = 217

 Score = 38.9 bits (89), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 9/84 (10%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA--------PMIKRFSQEYLGESW 72
           V   LF   PDAK+  E D EE+E +I ++G  + +A         ++++F    + ES 
Sbjct: 51  VTEKLFRRFPDAKSLAEADGEEVEDLIHSVGFYRVKARHIVAAAAALLEKFDGS-VPESM 109

Query: 73  THVTQLHGVGKYAADAFAIFCTGK 96
             +  + G+G+  A+     C GK
Sbjct: 110 EELLMIPGLGRKGANVVLGDCFGK 133


>gi|291536153|emb|CBL09265.1| Predicted EndoIII-related endonuclease [Roseburia intestinalis
           M50/1]
          Length = 212

 Score = 38.5 bits (88), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+  L+   PD K   E   EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNIVVEKLYEKFPDVKALAEAPVEEIEEIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   E +  + +L GVG+ +A+
Sbjct: 94  MKILWEQYGGNVPEDFDSLLKLPGVGRKSAN 124


>gi|167747754|ref|ZP_02419881.1| hypothetical protein ANACAC_02475 [Anaerostipes caccae DSM 14662]
 gi|167653116|gb|EDR97245.1| putative endonuclease III [Anaerostipes caccae DSM 14662]
          Length = 231

 Score = 38.5 bits (88), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD     E D  +IE I+   GL + +A      
Sbjct: 54  LVSVRLAAQCTDARVNVVVQDLYEKYPDVNALAEADPADIEAIVRPCGLGRSKARDISAC 113

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   +++  + +L GVG+ +A+
Sbjct: 114 MKILRDEYGGGIPDNFKALMKLPGVGRKSAN 144


>gi|240146275|ref|ZP_04744876.1| endonuclease III [Roseburia intestinalis L1-82]
 gi|257201577|gb|EEU99861.1| endonuclease III [Roseburia intestinalis L1-82]
 gi|291538985|emb|CBL12096.1| Predicted EndoIII-related endonuclease [Roseburia intestinalis
           XB6B4]
          Length = 212

 Score = 38.5 bits (88), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+  L+   PD K   E   EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNIVVEKLYEKFPDVKALAEAPVEEIEEIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   E +  + +L GVG+ +A+
Sbjct: 94  MKILWEQYGGNVPEDFDSLLKLPGVGRKSAN 124


>gi|225571496|ref|ZP_03780492.1| hypothetical protein CLOHYLEM_07594 [Clostridium hylemonae DSM
           15053]
 gi|225159573|gb|EEG72192.1| hypothetical protein CLOHYLEM_07594 [Clostridium hylemonae DSM
           15053]
          Length = 211

 Score = 38.5 bits (88), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD +       EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEDLYEKYPDVEALASAPPEEIEEIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKILRDEYGGNVPDDFDALLKLPGVGRKSAN 124


>gi|449547781|gb|EMD38748.1| hypothetical protein CERSUDRAFT_133253 [Ceriporiopsis subvermispora
           B]
          Length = 282

 Score = 38.5 bits (88), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 42/110 (38%), Gaps = 39/110 (35%)

Query: 43  IEKIISTLGLQKKRAPMIKRFSQEYLGESW------------------------------ 72
           IE +I  LGL K+R+  I   S+ YL +                                
Sbjct: 140 IEDLIKHLGLGKQRSRRIIELSRAYLDDPPIPGTLRISRCYITARTVACENGSIGLTKMR 199

Query: 73  ---THVTQLHGVGKYAADAFAIFCTG--KWDRVRPTD----HMLNYYWEF 113
              T ++ L G G YA D++ IFC G   W+ V P+D      L + W F
Sbjct: 200 YPPTPISHLPGCGPYALDSYRIFCEGAADWEAVLPSDKELIRFLRWKWAF 249


>gi|153852865|ref|ZP_01994302.1| hypothetical protein DORLON_00284 [Dorea longicatena DSM 13814]
 gi|149754507|gb|EDM64438.1| putative endonuclease III [Dorea longicatena DSM 13814]
          Length = 210

 Score = 38.5 bits (88), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+  L+   PD     E   EEIE+I+   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVESLYAKYPDVNALAEATPEEIEEIVRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           ++    EY G   + +  + +L GVG+ +A+
Sbjct: 94  MRMLRDEYDGKVPDDFNKLLKLPGVGRKSAN 124


>gi|223939321|ref|ZP_03631201.1| HhH-GPD family protein [bacterium Ellin514]
 gi|223892034|gb|EEF58515.1| HhH-GPD family protein [bacterium Ellin514]
          Length = 239

 Score = 38.5 bits (88), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 6/68 (8%)

Query: 31  DAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE------SWTHVTQLHGVGKY 84
           D  T      ++I KI+  LGL ++RAP IK  + E   E      +   +  L GVG+Y
Sbjct: 79  DWDTLANTSEKKICKILQPLGLSRQRAPRIKALAVEICTEKGRFPQTQEDILLLPGVGQY 138

Query: 85  AADAFAIF 92
            A+A  +F
Sbjct: 139 IANAILLF 146


>gi|296242620|ref|YP_003650107.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
 gi|296095204|gb|ADG91155.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
          Length = 217

 Score = 38.1 bits (87), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 8/83 (9%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE-------SW 72
           +V S  F+  P+  T  +    +IE II  LGL +KRA  ++  +Q  + E       + 
Sbjct: 54  KVFSSFFSKFPNIGTLAKASETDIEAIIGELGL-RKRAGFLRELAQHAVEEFGDKIPNTL 112

Query: 73  THVTQLHGVGKYAADAFAIFCTG 95
             + +L GVG Y A+A   F  G
Sbjct: 113 EDLMKLKGVGLYTANAVRSFAYG 135


>gi|312127482|ref|YP_003992356.1| endonuclease iii [Caldicellulosiruptor hydrothermalis 108]
 gi|311777501|gb|ADQ06987.1| endonuclease III [Caldicellulosiruptor hydrothermalis 108]
          Length = 202

 Score = 38.1 bits (87), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 7/80 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGESW 72
           +  ++ ++LF   P  K+  E D EE+EK I  +G  K +A  IK  S+    +Y G   
Sbjct: 38  RVNKITAELFKKYPTLKSFAEADLEELEKDIKPVGFYKNKAKSIKETSRILIDKYNGTLP 97

Query: 73  THV---TQLHGVGKYAADAF 89
           T +    +L GVG+  A+  
Sbjct: 98  TTIEELVKLKGVGRKTANVI 117


>gi|383763496|ref|YP_005442478.1| DNA glycosylase/AP lyase [Caldilinea aerophila DSM 14535 = NBRC
           104270]
 gi|381383764|dbj|BAM00581.1| DNA glycosylase/AP lyase [Caldilinea aerophila DSM 14535 = NBRC
           104270]
          Length = 220

 Score = 37.4 bits (85), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESW 72
           +  +V   LFT  PDA+   E D  E+E++I + G  + +A  I    +R  +EY GE  
Sbjct: 57  RVNKVTPALFTKYPDARAFAEADLSELEEMIRSTGFYRNKAKNIQQACRRIVEEYDGEVP 116

Query: 73  THVTQ---LHGVGKYAAD 87
             + +   L GV +  A+
Sbjct: 117 ADMEKLLTLAGVARKTAN 134


>gi|153815700|ref|ZP_01968368.1| hypothetical protein RUMTOR_01937 [Ruminococcus torques ATCC 27756]
 gi|145846941|gb|EDK23859.1| putative endonuclease III [Ruminococcus torques ATCC 27756]
          Length = 222

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+  L+   P  +   E D +EIEKI+   GL + +A  I   
Sbjct: 44  LVSVRLAAQCTDARVNVVVEGLYEKYPTVEALAEADVDEIEKIVRPCGLGRSKARDISGC 103

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    EY G   + +  + +L GVG+ +A+
Sbjct: 104 MKMIRDEYEGKVPDDFDALMKLPGVGRKSAN 134


>gi|296118073|ref|ZP_06836655.1| endonuclease III [Corynebacterium ammoniagenes DSM 20306]
 gi|295968959|gb|EFG82202.1| endonuclease III [Corynebacterium ammoniagenes DSM 20306]
          Length = 218

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 16  LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE----- 70
           ++  +V  DLF   P A    + D  +IE II  LGL   +A  +    Q+ +G+     
Sbjct: 46  VRVNQVTPDLFKAYPRAIDYAQADVTDIEDIIRPLGLAPSKAKRLVGLGQKLVGDFDGEV 105

Query: 71  --SWTHVTQLHGVGKYAA 86
             S   +T L GVG+  A
Sbjct: 106 PTSIADLTSLPGVGRKTA 123


>gi|317501913|ref|ZP_07960097.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
 gi|331088272|ref|ZP_08337191.1| hypothetical protein HMPREF1025_00774 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|336440109|ref|ZP_08619708.1| hypothetical protein HMPREF0990_02102 [Lachnospiraceae bacterium
           1_1_57FAA]
 gi|316896593|gb|EFV18680.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
 gi|330408516|gb|EGG87982.1| hypothetical protein HMPREF1025_00774 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|336014458|gb|EGN44307.1| hypothetical protein HMPREF0990_02102 [Lachnospiraceae bacterium
           1_1_57FAA]
          Length = 212

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+  L+   P  +   E D +EIEKI+   GL + +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVEGLYEKYPTVEALAEADVDEIEKIVRPCGLGRSKARDISGC 93

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKMIRDEYEGKVPDDFDALMKLPGVGRKSAN 124


>gi|89897892|ref|YP_515002.1| enodnuclease III [Chlamydophila felis Fe/C-56]
 gi|89331264|dbj|BAE80857.1| enodnuclease III [Chlamydophila felis Fe/C-56]
          Length = 212

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 14/92 (15%)

Query: 12  IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA+LL           V   LF + PDA+   ++  +++  IIS  GL ++++  +   S
Sbjct: 37  IAVLLSGNSTDKAVNSVTPKLFAIAPDAQALAQLPLKKLYSIISPCGLGERKSEYLHNLS 96

Query: 65  ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
               + Y GE   S   +T+L G+G+  A  F
Sbjct: 97  KILLERYHGEPPASLDLLTELPGIGRKTASVF 128


>gi|375335673|ref|ZP_09777017.1| endonuclease III [Succinivibrionaceae bacterium WG-1]
          Length = 209

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 7/75 (9%)

Query: 19  GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---S 71
            +V   LF   PDA++  ++  EEIE II  +GL + +A  +K   +    ++ GE   +
Sbjct: 49  NQVTPILFAKAPDAQSMLKLTNEEIENIIKKIGLYRNKAKNLKALCESLVNDFNGEIPST 108

Query: 72  WTHVTQLHGVGKYAA 86
              + +L GVGK  A
Sbjct: 109 EEELIKLPGVGKKTA 123


>gi|194333300|ref|YP_002015160.1| endonuclease III [Prosthecochloris aestuarii DSM 271]
 gi|194311118|gb|ACF45513.1| endonuclease III [Prosthecochloris aestuarii DSM 271]
          Length = 211

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 41/95 (43%), Gaps = 9/95 (9%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS---QEYLG---- 69
           K   +   LF +CPDA++ +  D EEI  ++ ++     +A  I   S    E  G    
Sbjct: 47  KVNMITETLFKVCPDAESMSRTDPEEIRSMVRSINYNNTKAKNILAASCLLMENFGGQVP 106

Query: 70  ESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTD 104
           +S   +  L GVG+  A+       GK   V P D
Sbjct: 107 DSREELETLPGVGRKTANVVLSNAFGKP--VMPVD 139


>gi|295090057|emb|CBK76164.1| Predicted EndoIII-related endonuclease [Clostridium cf.
           saccharolyticum K10]
          Length = 211

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+ DL+   PD     E   EEIE I+   GL   +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVKDLYEKFPDVNALAEAPVEEIEAIVRPCGLGHSKAKDISDC 93

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    ++ G   +S+  +  L GVG+ +A+
Sbjct: 94  MKMLRDQFDGRVPDSFDALLSLPGVGRKSAN 124


>gi|266623597|ref|ZP_06116532.1| endonuclease III, partial [Clostridium hathewayi DSM 13479]
 gi|288864609|gb|EFC96907.1| endonuclease III [Clostridium hathewayi DSM 13479]
          Length = 191

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ DL+   PD +     + E+IEKI+   GL   +A      
Sbjct: 14  LVSVRLAAQCTDARVNVVVQDLYAEYPDVEALAGAEVEDIEKIVKPCGLGHSKARDISAC 73

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   ++Y G   + +  + +L GVG+ +A+
Sbjct: 74  MKILKEQYDGRVPDDFDALLKLPGVGRKSAN 104


>gi|225848427|ref|YP_002728590.1| endonuclease III [Sulfurihydrogenibium azorense Az-Fu1]
 gi|225644019|gb|ACN99069.1| probable endonuclease III (DNA-(apurinic orapyrimidinic site)
           lyase) [Sulfurihydrogenibium azorense Az-Fu1]
          Length = 216

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 11/94 (11%)

Query: 1   MAQIYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
           +A I S+R K+    L + R    LF +    +    + AEEIEK+I  +G  K +A  I
Sbjct: 40  IATILSLRTKDQITALASDR----LFKVADTPEKMVNLPAEEIEKLIYPVGFYKNKAKTI 95

Query: 61  KRFS----QEYLG---ESWTHVTQLHGVGKYAAD 87
           K  S    ++Y G   ++   +  L GVG+  A+
Sbjct: 96  KEISKIILEKYAGKVPDNLEDLLSLKGVGRKTAN 129


>gi|283797428|ref|ZP_06346581.1| endonuclease III [Clostridium sp. M62/1]
 gi|291074786|gb|EFE12150.1| putative endonuclease III [Clostridium sp. M62/1]
          Length = 211

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+ DL+   PD     E   EEIE I+   GL   +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVKDLYEKFPDVNALAEAPVEEIEAIVRPCGLGHSKAKDISAC 93

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    ++ G   +S+  +  L GVG+ +A+
Sbjct: 94  MKMLRDQFDGRVPDSFDALLSLPGVGRKSAN 124


>gi|125624307|ref|YP_001032790.1| endonuclease III [Lactococcus lactis subsp. cremoris MG1363]
 gi|389854667|ref|YP_006356911.1| endonuclease III [Lactococcus lactis subsp. cremoris NZ9000]
 gi|124493115|emb|CAL98079.1| putative endonuclease III (DNA repair) [Lactococcus lactis subsp.
          cremoris MG1363]
 gi|300071089|gb|ADJ60489.1| putative endonuclease III (DNA repair) [Lactococcus lactis subsp.
          cremoris NZ9000]
          Length = 218

 Score = 37.4 bits (85), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
          LF   PDA+  ++   EEIE +I T+GL K +A  I R SQ
Sbjct: 55 LFAAFPDAQAMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95


>gi|226324116|ref|ZP_03799634.1| hypothetical protein COPCOM_01894 [Coprococcus comes ATCC 27758]
 gi|225207665|gb|EEG90019.1| putative endonuclease III [Coprococcus comes ATCC 27758]
          Length = 213

 Score = 37.0 bits (84), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+  LF   P  +   E D   IE+II   GL K +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVEGLFAKYPTVEALAEADVNNIEEIIRPCGLGKSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKMLRDEYGGKIPKDFNAILKLPGVGRKSAN 124


>gi|303274809|ref|XP_003056719.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461071|gb|EEH58364.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 348

 Score = 37.0 bits (84), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/96 (26%), Positives = 40/96 (41%), Gaps = 7/96 (7%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWT------H 74
            IS  F   P    A      ++ ++I  LGL   R   +   S + L E+         
Sbjct: 240 TISAFFAKYPTPTAALAAKPNDVLEVIRPLGLFPNRFRSVVEISAKILTEAGPLDVGPDQ 299

Query: 75  VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
             +++GVG++  D+F +FC G    V P D  L  +
Sbjct: 300 ERKIYGVGEFGIDSFNVFCRGDLS-VNPGDKTLQSF 334


>gi|417345281|ref|ZP_12125440.1| hypothetical protein LTSEBAI_5486 [Salmonella enterica subsp.
          enterica serovar Baildon str. R6-199]
 gi|357953150|gb|EHJ79805.1| hypothetical protein LTSEBAI_5486 [Salmonella enterica subsp.
          enterica serovar Baildon str. R6-199]
          Length = 114

 Score = 37.0 bits (84), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%), Gaps = 1/31 (3%)

Query: 52 LQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
          LQKK  PM+K+F+ EY G S T +TQ+ GVG
Sbjct: 2  LQKKSNPMVKKFNFEYCGFS-TFITQVDGVG 31


>gi|397669156|ref|YP_006510691.1| endonuclease III [Propionibacterium propionicum F0230a]
 gi|395142567|gb|AFN46674.1| endonuclease III [Propionibacterium propionicum F0230a]
          Length = 205

 Score = 37.0 bits (84), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 12/87 (13%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG------- 69
           +  +V  +LF   PDA+     + +++E+IIS+LGL + +A  +   ++  +        
Sbjct: 43  RVNQVTPELFARYPDAEHLAGAEQDDVEEIISSLGLYRSKARSLTGLARALVADHDGRVP 102

Query: 70  ESWTHVTQLHGVGKYAA-----DAFAI 91
           +S   + +L GVG+  A     DAF I
Sbjct: 103 DSLDELVKLPGVGRKTANVVLGDAFGI 129


>gi|170085275|ref|XP_001873861.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651413|gb|EDR15653.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 242

 Score = 37.0 bits (84), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 7/42 (16%)

Query: 73  THVTQLHGVGKYAADAFAIFCT-------GKWDRVRPTDHML 107
           T ++ L G G YA D++ IFCT        +W  V PTD  L
Sbjct: 159 TPISHLPGTGPYALDSYRIFCTLHHDPSSNEWQNVTPTDKEL 200


>gi|23099212|ref|NP_692678.1| endonuclease III [Oceanobacillus iheyensis HTE831]
 gi|22777440|dbj|BAC13713.1| endonuclease III (DNA repair) [Oceanobacillus iheyensis HTE831]
          Length = 216

 Score = 37.0 bits (84), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 14/93 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA+LL A        +V +DLF      +    V+  E+E  I ++GL + +A  I++  
Sbjct: 35  IAVLLSAQCTDKLVNKVTADLFQKYKTPEDYLSVELSELENDIRSIGLYRSKAKNIQKLC 94

Query: 65  Q----EYLGE---SWTHVTQLHGVGKYAADAFA 90
           Q    EY GE   S   + +L GVG+  A+  A
Sbjct: 95  QMLLDEYNGEIPSSKEELVKLAGVGRKTANVVA 127


>gi|225410189|ref|ZP_03761378.1| hypothetical protein CLOSTASPAR_05411 [Clostridium asparagiforme
           DSM 15981]
 gi|225042293|gb|EEG52539.1| hypothetical protein CLOSTASPAR_05411 [Clostridium asparagiforme
           DSM 15981]
          Length = 261

 Score = 36.6 bits (83), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+ DL+   PD     +   EEIE+I+   GL   +A  I   
Sbjct: 77  LVSVRLAAQCTDARVNVVVQDLYAKYPDVNALADAPVEEIERIVKPCGLGHSKAKDISGC 136

Query: 61  -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
            K    +Y G   + +  + +L GVG+ +A+
Sbjct: 137 MKILRDQYGGKVPDDFDALLKLPGVGRKSAN 167


>gi|428769953|ref|YP_007161743.1| DNA-(apurinic or apyrimidinic site) lyase [Cyanobacterium aponinum
           PCC 10605]
 gi|428684232|gb|AFZ53699.1| endonuclease III [Cyanobacterium aponinum PCC 10605]
          Length = 212

 Score = 36.6 bits (83), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK----RFSQEYLG--- 69
           +  +V   LF   PDA++  + D EEIE +I + G  + +A  I+    R   ++ G   
Sbjct: 49  RVNKVTPALFKRFPDAQSFAQADREEIENLIRSTGFYRNKAKNIQNACIRIVNDFDGKVP 108

Query: 70  ESWTHVTQLHGVGKYAAD 87
           ++ T +  L GV +  A+
Sbjct: 109 QTMTELLTLAGVARKTAN 126


>gi|297584383|ref|YP_003700163.1| endonuclease III [Bacillus selenitireducens MLS10]
 gi|297142840|gb|ADH99597.1| endonuclease III [Bacillus selenitireducens MLS10]
          Length = 217

 Score = 36.6 bits (83), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 14/93 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF      +    V  EE+E+ I ++GL + +A  IK+ +
Sbjct: 35  IAVVLSAQATDSLVNKVTPRLFAKYKTPEDYANVPLEELEQDIRSIGLYRSKAKNIKKLA 94

Query: 65  Q----EYLGE---SWTHVTQLHGVGKYAADAFA 90
           Q    +Y GE   S T + +L GVG+  A+  A
Sbjct: 95  QSVIDDYQGEIPKSKTELKKLAGVGRKTANVVA 127


>gi|295398565|ref|ZP_06808597.1| endonuclease III [Aerococcus viridans ATCC 11563]
 gi|294973166|gb|EFG48961.1| endonuclease III [Aerococcus viridans ATCC 11563]
          Length = 223

 Score = 36.6 bits (83), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGL-------QKKRA 57
           IA+L+ A        +V  +LF   PD     E + E++E  I T+GL        KK A
Sbjct: 47  IAVLMSAQTTDVAVNKVTPNLFAKYPDPDHMAEAELEDLESYIKTIGLYHNKAKNMKKTA 106

Query: 58  PMIKRFSQEYLGESWTHVTQLHGVGKYAAD 87
            MI+      + ++   + QL GVG+  A+
Sbjct: 107 IMIRDEFNGQVPKTREELIQLPGVGRKTAN 136


>gi|389861522|ref|YP_006363762.1| A/G-specific adenine glycosylase [Thermogladius cellulolyticus
           1633]
 gi|388526426|gb|AFK51624.1| A/G-specific adenine glycosylase [Thermogladius cellulolyticus
           1633]
          Length = 134

 Score = 36.6 bits (83), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 7/72 (9%)

Query: 19  GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS----QEYLG---ES 71
            R    L    PD ++      EE+E+ +  LGL + RA  +K  +    +EY G   +S
Sbjct: 57  ARAFQALVERFPDVESLHSAPLEEVEEALKPLGLYRVRARQLKELAAVIVEEYGGRIPDS 116

Query: 72  WTHVTQLHGVGK 83
           W  + +L GVG+
Sbjct: 117 WEELVRLPGVGR 128


>gi|320528079|ref|ZP_08029244.1| endonuclease III [Solobacterium moorei F0204]
 gi|320131427|gb|EFW23992.1| endonuclease III [Solobacterium moorei F0204]
          Length = 217

 Score = 36.6 bits (83), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/96 (27%), Positives = 45/96 (46%), Gaps = 14/96 (14%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +A++L A        RV   LF   P      +   +++EK I++LGL + +A  I  F+
Sbjct: 34  VAVILSAQTTDVSVNRVTPALFKAYPSPYDLAKAPTKDVEKYIASLGLYRNKAKQIVGFA 93

Query: 65  Q----EYLGE---SWTHVTQLHGVGKYAADAFAIFC 93
           Q    ++ GE   +   +T L G+G+  A+     C
Sbjct: 94  QGVVEQFHGEVPHTMEELTTLPGIGRKCANVIMAEC 129


>gi|417377292|ref|ZP_12146247.1| hypothetical protein LTSEINV_5874 [Salmonella enterica subsp.
          enterica serovar Inverness str. R8-3668]
 gi|353590393|gb|EHC48930.1| hypothetical protein LTSEINV_5874 [Salmonella enterica subsp.
          enterica serovar Inverness str. R8-3668]
          Length = 114

 Score = 36.6 bits (83), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 52 LQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
          LQKK  PM+K F+ EY G S T +TQ+ GVG
Sbjct: 2  LQKKSNPMVKNFNFEYCGFS-TFITQVDGVG 31


>gi|291523946|emb|CBK89533.1| Predicted EndoIII-related endonuclease [Eubacterium rectale DSM
           17629]
          Length = 212

 Score = 36.6 bits (83), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
           + S+RL       +   V+ DLF   P  +   + D  +IE+I+   GL K +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVVDLFKKYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 93

Query: 62  -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
            R  ++  G    ++ T + +L GVG+ +A+
Sbjct: 94  MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 124


>gi|239625177|ref|ZP_04668208.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
 gi|239519407|gb|EEQ59273.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
          Length = 211

 Score = 36.6 bits (83), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ +L+   PD     E   E+IE+I+   GL   +A      
Sbjct: 34  LVSVRLAAQCTDARVNVVVKELYERYPDVGALAEAKVEDIERIVKPCGLGHSKARDISAC 93

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K   +EY G   + +  + +L GVG+ +A+
Sbjct: 94  MKILQEEYGGRVPDDFDALLKLPGVGRKSAN 124


>gi|18313897|ref|NP_560564.1| U/G & T/G mismatch specific glycosylase (PA-MIG) [Pyrobaculum
           aerophilum str. IM2]
 gi|7141252|gb|AAF37270.1|AF222335_1 U/G and T/G mismatch-specific DNA glycosylase [Pyrobaculum
           aerophilum]
 gi|18161465|gb|AAL64746.1| U/G & T/G mismatch specific glycosylase (Pa-MIG) [Pyrobaculum
           aerophilum str. IM2]
          Length = 230

 Score = 36.2 bits (82), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 7/74 (9%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT-------QLHGVG 82
           P      +   EEI+ II  LG++  RA ++K+ S+E +      +         L GVG
Sbjct: 69  PSPARLADASVEEIKAIIQPLGMEHVRATLLKKLSEELVRRFNGQIPCDRDALKSLPGVG 128

Query: 83  KYAADAFAIFCTGK 96
            YAA    +   GK
Sbjct: 129 DYAASEVLLTACGK 142


>gi|409076139|gb|EKM76513.1| hypothetical protein AGABI1DRAFT_78639 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 284

 Score = 36.2 bits (82), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 10/78 (12%)

Query: 37  EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG- 95
           E D      ++S     +K     +R +++Y     T ++ L G G YA D++ IFC   
Sbjct: 158 EYDLRPSRPVVSPTKRSRKGGTRSRRLAEKYPP---TAISHLPGAGTYALDSYRIFCLSH 214

Query: 96  ------KWDRVRPTDHML 107
                 +W  V PTD  L
Sbjct: 215 QDPLCEEWTTVLPTDKEL 232


>gi|238923093|ref|YP_002936606.1| endonuclease III [Eubacterium rectale ATCC 33656]
 gi|238874765|gb|ACR74472.1| endonuclease III [Eubacterium rectale ATCC 33656]
          Length = 226

 Score = 36.2 bits (82), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
           + S+RL       +   V+ DLF   P  +   + D  +IE+I+   GL K +A  I   
Sbjct: 48  LVSVRLAAQCTDARVNVVVVDLFKKYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 107

Query: 62  -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
            R  ++  G    ++ T + +L GVG+ +A+
Sbjct: 108 MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 138


>gi|426194481|gb|EKV44412.1| hypothetical protein AGABI2DRAFT_74704, partial [Agaricus bisporus
           var. bisporus H97]
          Length = 209

 Score = 36.2 bits (82), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 10/78 (12%)

Query: 37  EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG- 95
           E D      ++S     +K     +R +++Y     T ++ L G G YA D++ IFC   
Sbjct: 83  EYDLRPSRPVVSPTKRSRKGGTRSRRLAEKYPP---TAISHLPGAGTYALDSYRIFCLSH 139

Query: 96  ------KWDRVRPTDHML 107
                 +W  V PTD  L
Sbjct: 140 QDPLCEEWTTVLPTDKEL 157


>gi|450005195|ref|ZP_21826553.1| endonuclease III [Streptococcus mutans NMT4863]
 gi|449188837|gb|EMB90527.1| endonuclease III [Streptococcus mutans NMT4863]
          Length = 207

 Score = 36.2 bits (82), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNCQVPQTRKELESLSGVGRKTAN 125


>gi|291528591|emb|CBK94177.1| Predicted EndoIII-related endonuclease [Eubacterium rectale M104/1]
          Length = 212

 Score = 36.2 bits (82), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
           + S+RL       +   V+ DLF   P  +   + D  +IE+I+   GL K +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVVDLFKEYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 93

Query: 62  -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
            R  ++  G    ++ T + +L GVG+ +A+
Sbjct: 94  MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 124


>gi|310825899|ref|YP_003958256.1| hypothetical protein [Eubacterium limosum KIST612]
 gi|308737633|gb|ADO35293.1| hypothetical protein ELI_0274 [Eubacterium limosum KIST612]
          Length = 213

 Score = 36.2 bits (82), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
           + S+RL       +   V+ +L+   PD     + + EEIE I+   GL K +A      
Sbjct: 36  LVSVRLAAQCTDARVNVVVKELYAKFPDVAALAQAEPEEIEAIVRPCGLGKSKARDISAC 95

Query: 60  IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
           +K    +Y G   + +  + +L GVG+ +A+
Sbjct: 96  MKILRDQYDGMVPDDFDALLKLPGVGRKSAN 126


>gi|386319282|ref|YP_006015445.1| endonuclease III [Staphylococcus pseudintermedius ED99]
 gi|323464453|gb|ADX76606.1| endonuclease III [Staphylococcus pseudintermedius ED99]
          Length = 224

 Score = 36.2 bits (82), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA+LL A        RV  DLF      +    VD EE+++ I ++GL + +A  I++ S
Sbjct: 35  IAVLLSAQCTDNTVNRVTKDLFQKYHTPEDYLAVDLEELQQDIRSIGLYRNKAKNIQKLS 94

Query: 65  QEYLGE-----SWTH--VTQLHGVGKYAAD 87
           Q  L +       TH  +  L GVG+  A+
Sbjct: 95  QSLLDQYDGIVPHTHEQLEGLAGVGRKTAN 124


>gi|307110180|gb|EFN58416.1| hypothetical protein CHLNCDRAFT_50183 [Chlorella variabilis]
          Length = 295

 Score = 36.2 bits (82), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/66 (22%), Positives = 29/66 (43%)

Query: 30  PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
           P      +   + + +++  LGLQ  R   +   + ++L   W   +     G + AD++
Sbjct: 128 PTPSAVIDASDDSMLEVLQPLGLQAVRLQAVWGLAHDFLAADWQSPSDFKHCGPFVADSW 187

Query: 90  AIFCTG 95
            IFC G
Sbjct: 188 RIFCRG 193


>gi|319892444|ref|YP_004149319.1| endonuclease III [Staphylococcus pseudintermedius HKU10-03]
 gi|317162140|gb|ADV05683.1| Endonuclease III [Staphylococcus pseudintermedius HKU10-03]
          Length = 224

 Score = 36.2 bits (82), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA+LL A        RV  DLF      +    VD EE+++ I ++GL + +A  I++ S
Sbjct: 35  IAVLLSAQCTDNTVNRVTKDLFQKYHTPEDYLAVDLEELQQDIRSIGLYRNKAKNIQKLS 94

Query: 65  QEYLGE-----SWTH--VTQLHGVGKYAAD 87
           Q  L +       TH  +  L GVG+  A+
Sbjct: 95  QSLLDQYDGIVPHTHEQLEGLAGVGRKTAN 124


>gi|259416408|ref|ZP_05740328.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
 gi|259347847|gb|EEW59624.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
          Length = 791

 Score = 36.2 bits (82), Expect = 2.7,   Method: Composition-based stats.
 Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 10/95 (10%)

Query: 10  KEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG 69
           + + IL +   V  ++       K+AT+ + E +EKIIS   L++++A    R    YLG
Sbjct: 539 ERLEILRETDPVQQEMIRFRETMKSATDAEREALEKIISKR-LEEQKAIEQTRELGSYLG 597

Query: 70  ESWTHVT-QLHGVGKYAADAFAIFCTGKWDRVRPT 103
           E+   V+  L   G  AADA        WDRV+ +
Sbjct: 598 ETMQDVSAALVSSGDAAADA--------WDRVKAS 624


>gi|238917625|ref|YP_002931142.1| DNA-(apurinic or apyrimidinic site) lyase [Eubacterium eligens ATCC
           27750]
 gi|238872985|gb|ACR72695.1| DNA-(apurinic or apyrimidinic site) lyase [Eubacterium eligens ATCC
           27750]
          Length = 213

 Score = 36.2 bits (82), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
           + S+RL       +   V+  L+   PD  +      EEIEKII   GL K +A  I   
Sbjct: 34  LVSVRLAAQCTDARVNVVVKGLYEKYPDIASLAAASPEEIEKIIRPCGLGKSKAKDICAC 93

Query: 61  -KRFSQEY---LGESWTHVTQLHGVGKYAAD 87
            +   ++Y   + +S   + +L GVG+ +A+
Sbjct: 94  MRMLHEQYNDKVPDSMEELLKLPGVGRKSAN 124


>gi|160893761|ref|ZP_02074545.1| hypothetical protein CLOL250_01315 [Clostridium sp. L2-50]
 gi|156864746|gb|EDO58177.1| endonuclease III [Clostridium sp. L2-50]
          Length = 214

 Score = 36.2 bits (82), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESWTHV- 75
           V  DLF   P  +   + D +E+EK I + G  K +A  I    K+   EY GE  + + 
Sbjct: 54  VTKDLFVKYPTLQAFADADIKELEKDIYSTGFYKNKAKNIIGCAKKLISEYGGEVPSDIE 113

Query: 76  --TQLHGVGKYAADAF 89
             T+L GVG+  A+  
Sbjct: 114 SLTKLDGVGRKTANVI 129


>gi|448664779|ref|ZP_21684417.1| HhH-GPD family protein [Haloarcula amylolytica JCM 13557]
 gi|445774366|gb|EMA25387.1| HhH-GPD family protein [Haloarcula amylolytica JCM 13557]
          Length = 214

 Score = 35.8 bits (81), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 6/98 (6%)

Query: 4   IYSIRLKEIAILLKAGRVISDLFT----LCPDAKTATEVDAEEIEKIISTLGLQKKRAPM 59
           +Y + + E  +     R +++++       PD +       EEI  +I  LG    R+  
Sbjct: 40  LYEVFVAEFFLTQTPARNVAEVYPEFLERFPDLRAIDSATKEEIVAVIEPLGFYNMRSEA 99

Query: 60  IKRFSQEY--LGESWTHVTQLHGVGKYAADAFAIFCTG 95
           +K  + E   L E+   +T+L  VG Y A+A   F  G
Sbjct: 100 LKSIAAEMSELSEAVEELTELPRVGHYIANATLCFSRG 137


>gi|448626677|ref|ZP_21671456.1| HhH-GPD family protein [Haloarcula vallismortis ATCC 29715]
 gi|445760289|gb|EMA11553.1| HhH-GPD family protein [Haloarcula vallismortis ATCC 29715]
          Length = 264

 Score = 35.8 bits (81), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKR----FSQEYLGESWTHV 75
           ++ SD+    P  +   +   EEI +++ +LG    R   ++     F+QE+ GE    +
Sbjct: 93  KIYSDVIQQFPTPEALNQASEEEIREVVHSLGFVNHRQRTLEEVGRIFTQEFNGEIPNSI 152

Query: 76  TQLHG---VGKYAADAFAIFCTGK 96
             L     VG Y+A A  +F  G+
Sbjct: 153 EDLKKPWRVGDYSARATQLFARGE 176


>gi|387785664|ref|YP_006250760.1| putative endonuclease III [Streptococcus mutans LJ23]
 gi|449893396|ref|ZP_21788688.1| endonuclease III [Streptococcus mutans SF12]
 gi|450046095|ref|ZP_21838755.1| endonuclease III [Streptococcus mutans N34]
 gi|379132065|dbj|BAL68817.1| putative endonuclease III [Streptococcus mutans LJ23]
 gi|449199269|gb|EMC00343.1| endonuclease III [Streptococcus mutans N34]
 gi|449255827|gb|EMC53667.1| endonuclease III [Streptococcus mutans SF12]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|449910114|ref|ZP_21794551.1| endonuclease III [Streptococcus mutans OMZ175]
 gi|449260261|gb|EMC57765.1| endonuclease III [Streptococcus mutans OMZ175]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|41615185|ref|NP_963683.1| hypothetical protein NEQ398 [Nanoarchaeum equitans Kin4-M]
 gi|40068909|gb|AAR39244.1| NEQ398 [Nanoarchaeum equitans Kin4-M]
          Length = 212

 Score = 35.8 bits (81), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 11/101 (10%)

Query: 1   MAQIYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
           +A + SIR +E   +    R   +L+    D K   +   EEIE +I  +GL K++A  I
Sbjct: 40  IATVLSIRTREEQTI----RASLNLYNKYKDYKNLAKAPIEEIEDLIKNVGLYKQKAKWI 95

Query: 61  KRFSQEY-----LGESWTHVTQLHGVGKYAADAFAIFCTGK 96
           K  +Q +       ES+  +  L GVG+   + +      K
Sbjct: 96  KTIAQRWDYNKKCDESF--IRNLPGVGRKVGNVYLNLVCNK 134


>gi|393228457|gb|EJD36103.1| hypothetical protein AURDEDRAFT_74606, partial [Auricularia
           delicata TFB-10046 SS5]
          Length = 201

 Score = 35.8 bits (81), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 51/132 (38%), Gaps = 33/132 (25%)

Query: 21  VISDLFTLCPDAKTATEVDA--EEIEKIISTLGLQKKRAPMIKRFSQEYLG--------- 69
           +  DL    P A     V A  E++  ++S LGLQ  RA  +   S+ Y+          
Sbjct: 28  IFWDLMERWPTAHEMQHVAAPLEQLVTMLSGLGLQNIRAKRLIDLSRAYVACPPDPRVLY 87

Query: 70  ---------------------ESWTHVTQLHGVGKYAADAFAIFCT-GKWDRVRPTDHML 107
                                   T ++ L G G+YA D++ IFC   +W +  P D  L
Sbjct: 88  ASKSVSKLTEIGLASQFYDHPYPPTAISHLPGAGRYALDSYRIFCVDDEWRQCYPRDKEL 147

Query: 108 NYYWEFLVSTKG 119
             Y E+  + +G
Sbjct: 148 VRYLEWKWACEG 159


>gi|242223699|ref|XP_002477437.1| predicted protein [Postia placenta Mad-698-R]
 gi|220722987|gb|EED77364.1| predicted protein [Postia placenta Mad-698-R]
          Length = 834

 Score = 35.8 bits (81), Expect = 3.6,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 17/70 (24%)

Query: 50  LGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHML 107
           +G Q++R P              T  + L G G YA D++ IFC G  +W  V P D  L
Sbjct: 102 VGAQRQRYPP-------------TEASHLPGSGPYALDSYRIFCAGEDEWKAVMPRDKEL 148

Query: 108 NYY--WEFLV 115
             Y  W + V
Sbjct: 149 VRYLRWNWAV 158


>gi|449944366|ref|ZP_21806723.1| endonuclease III [Streptococcus mutans 11A1]
 gi|449148593|gb|EMB52450.1| endonuclease III [Streptococcus mutans 11A1]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|428781526|ref|YP_007173312.1| endonuclease III [Dactylococcopsis salina PCC 8305]
 gi|428695805|gb|AFZ51955.1| endonuclease III [Dactylococcopsis salina PCC 8305]
          Length = 211

 Score = 35.8 bits (81), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP-------MIKRFSQEYLG 69
           +  +V  DLF   PDA+T  + + EEIE +I + G  + +A        MI    Q  + 
Sbjct: 49  RVNKVTPDLFARFPDAETMAKANREEIETLIRSTGFYRNKAKNIQGACEMIVSDFQGKVP 108

Query: 70  ESWTHVTQLHGVGKYAAD 87
           ++   + +L GV +  A+
Sbjct: 109 DTMEDLLKLPGVARKTAN 126


>gi|397650248|ref|YP_006490775.1| endonuclease III [Streptococcus mutans GS-5]
 gi|449874957|ref|ZP_21781920.1| endonuclease III [Streptococcus mutans S1B]
 gi|449902335|ref|ZP_21791522.1| endonuclease III [Streptococcus mutans M230]
 gi|449918471|ref|ZP_21797359.1| endonuclease III [Streptococcus mutans 1SM1]
 gi|449974219|ref|ZP_21815157.1| endonuclease III [Streptococcus mutans 11VS1]
 gi|449990607|ref|ZP_21821644.1| endonuclease III [Streptococcus mutans NVAB]
 gi|450074616|ref|ZP_21849400.1| endonuclease III [Streptococcus mutans M2A]
 gi|450098380|ref|ZP_21857976.1| endonuclease III [Streptococcus mutans SF1]
 gi|450104976|ref|ZP_21859580.1| endonuclease III [Streptococcus mutans SF14]
 gi|450114998|ref|ZP_21863657.1| endonuclease III [Streptococcus mutans ST1]
 gi|450171669|ref|ZP_21884123.1| endonuclease III [Streptococcus mutans SM4]
 gi|392603817|gb|AFM81981.1| endonuclease III [Streptococcus mutans GS-5]
 gi|449160315|gb|EMB63588.1| endonuclease III [Streptococcus mutans 1SM1]
 gi|449178595|gb|EMB80848.1| endonuclease III [Streptococcus mutans 11VS1]
 gi|449181545|gb|EMB83626.1| endonuclease III [Streptococcus mutans NVAB]
 gi|449208905|gb|EMC09461.1| endonuclease III [Streptococcus mutans M2A]
 gi|449221577|gb|EMC21345.1| endonuclease III [Streptococcus mutans SF1]
 gi|449225196|gb|EMC24807.1| endonuclease III [Streptococcus mutans SF14]
 gi|449228467|gb|EMC27832.1| endonuclease III [Streptococcus mutans ST1]
 gi|449243721|gb|EMC42130.1| endonuclease III [Streptococcus mutans SM4]
 gi|449254335|gb|EMC52244.1| endonuclease III [Streptococcus mutans S1B]
 gi|449262861|gb|EMC60300.1| endonuclease III [Streptococcus mutans M230]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|189499467|ref|YP_001958937.1| endonuclease III [Chlorobium phaeobacteroides BS1]
 gi|189494908|gb|ACE03456.1| endonuclease III [Chlorobium phaeobacteroides BS1]
          Length = 220

 Score = 35.8 bits (81), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 7/74 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGE---SWT 73
           + ++LF+ CPDA++ + ++ +E++ II ++     +A  I    +   + Y GE   +  
Sbjct: 60  ITAELFSRCPDAESMSRIELDELKTIIRSINYYNNKAKNILAASRMLVESYKGEVPSTRE 119

Query: 74  HVTQLHGVGKYAAD 87
            +  L GVG+  A+
Sbjct: 120 KLESLPGVGRKTAN 133


>gi|450087653|ref|ZP_21854396.1| endonuclease III [Streptococcus mutans NV1996]
 gi|449217561|gb|EMC17603.1| endonuclease III [Streptococcus mutans NV1996]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKDLESLSGVGRKTAN 125


>gi|450049677|ref|ZP_21839798.1| endonuclease III [Streptococcus mutans NFSM1]
 gi|449203428|gb|EMC04287.1| endonuclease III [Streptococcus mutans NFSM1]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|449999353|ref|ZP_21824493.1| endonuclease III [Streptococcus mutans N29]
 gi|449187089|gb|EMB88890.1| endonuclease III [Streptococcus mutans N29]
          Length = 207

 Score = 35.8 bits (81), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLPGVGRKTAN 125


>gi|450127488|ref|ZP_21868617.1| endonuclease III [Streptococcus mutans U2A]
 gi|449230689|gb|EMC29938.1| endonuclease III [Streptococcus mutans U2A]
          Length = 207

 Score = 35.4 bits (80), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      + Q       L GVG+  A+
Sbjct: 96  QQLVDHYNGQIPQTRKELESLSGVGRKTAN 125


>gi|290579993|ref|YP_003484385.1| endonuclease III [Streptococcus mutans NN2025]
 gi|449965328|ref|ZP_21811829.1| endonuclease III [Streptococcus mutans 15VF2]
 gi|449969031|ref|ZP_21812970.1| endonuclease III [Streptococcus mutans 2VS1]
 gi|449980966|ref|ZP_21817506.1| endonuclease III [Streptococcus mutans 5SM3]
 gi|450031273|ref|ZP_21833606.1| endonuclease III [Streptococcus mutans G123]
 gi|450035307|ref|ZP_21834948.1| endonuclease III [Streptococcus mutans M21]
 gi|450056223|ref|ZP_21841930.1| endonuclease III [Streptococcus mutans NLML4]
 gi|450078846|ref|ZP_21851158.1| endonuclease III [Streptococcus mutans N3209]
 gi|450094137|ref|ZP_21856911.1| endonuclease III [Streptococcus mutans W6]
 gi|450112047|ref|ZP_21863022.1| endonuclease III [Streptococcus mutans SM6]
 gi|450148335|ref|ZP_21875573.1| endonuclease III [Streptococcus mutans 14D]
 gi|450166037|ref|ZP_21882131.1| endonuclease III [Streptococcus mutans B]
 gi|450181072|ref|ZP_21887605.1| endonuclease III [Streptococcus mutans 24]
 gi|254996892|dbj|BAH87493.1| putative endonuclease III [Streptococcus mutans NN2025]
 gi|449171325|gb|EMB73990.1| endonuclease III [Streptococcus mutans 15VF2]
 gi|449174470|gb|EMB76956.1| endonuclease III [Streptococcus mutans 2VS1]
 gi|449176154|gb|EMB78513.1| endonuclease III [Streptococcus mutans 5SM3]
 gi|449191781|gb|EMB93241.1| endonuclease III [Streptococcus mutans G123]
 gi|449195672|gb|EMB96985.1| endonuclease III [Streptococcus mutans M21]
 gi|449207001|gb|EMC07685.1| endonuclease III [Streptococcus mutans NLML4]
 gi|449209707|gb|EMC10215.1| endonuclease III [Streptococcus mutans N3209]
 gi|449216619|gb|EMC16721.1| endonuclease III [Streptococcus mutans W6]
 gi|449222801|gb|EMC22516.1| endonuclease III [Streptococcus mutans SM6]
 gi|449236015|gb|EMC34951.1| endonuclease III [Streptococcus mutans 14D]
 gi|449240041|gb|EMC38736.1| endonuclease III [Streptococcus mutans B]
 gi|449247360|gb|EMC45643.1| endonuclease III [Streptococcus mutans 24]
          Length = 207

 Score = 35.4 bits (80), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|449886259|ref|ZP_21786083.1| endonuclease III [Streptococcus mutans SA41]
 gi|449899436|ref|ZP_21791015.1| endonuclease III [Streptococcus mutans R221]
 gi|449914623|ref|ZP_21795691.1| endonuclease III [Streptococcus mutans 15JP3]
 gi|449924022|ref|ZP_21799362.1| endonuclease III [Streptococcus mutans 4SM1]
 gi|449932132|ref|ZP_21802695.1| endonuclease III [Streptococcus mutans 3SN1]
 gi|449936391|ref|ZP_21803955.1| endonuclease III [Streptococcus mutans 2ST1]
 gi|449947418|ref|ZP_21807363.1| endonuclease III [Streptococcus mutans 11SSST2]
 gi|449995317|ref|ZP_21822937.1| endonuclease III [Streptococcus mutans A9]
 gi|450009910|ref|ZP_21828414.1| endonuclease III [Streptococcus mutans A19]
 gi|450024675|ref|ZP_21831355.1| endonuclease III [Streptococcus mutans U138]
 gi|450039060|ref|ZP_21836029.1| endonuclease III [Streptococcus mutans T4]
 gi|450062699|ref|ZP_21844476.1| endonuclease III [Streptococcus mutans NLML5]
 gi|450121163|ref|ZP_21866166.1| endonuclease III [Streptococcus mutans ST6]
 gi|450155949|ref|ZP_21878556.1| endonuclease III [Streptococcus mutans 21]
 gi|450175809|ref|ZP_21885408.1| endonuclease III [Streptococcus mutans SM1]
 gi|449157726|gb|EMB61161.1| endonuclease III [Streptococcus mutans 15JP3]
 gi|449161745|gb|EMB64919.1| endonuclease III [Streptococcus mutans 3SN1]
 gi|449163319|gb|EMB66427.1| endonuclease III [Streptococcus mutans 4SM1]
 gi|449165606|gb|EMB68596.1| endonuclease III [Streptococcus mutans 2ST1]
 gi|449168603|gb|EMB71411.1| endonuclease III [Streptococcus mutans 11SSST2]
 gi|449184612|gb|EMB86545.1| endonuclease III [Streptococcus mutans A9]
 gi|449190539|gb|EMB92101.1| endonuclease III [Streptococcus mutans A19]
 gi|449191724|gb|EMB93190.1| endonuclease III [Streptococcus mutans U138]
 gi|449200716|gb|EMC01737.1| endonuclease III [Streptococcus mutans T4]
 gi|449205431|gb|EMC06178.1| endonuclease III [Streptococcus mutans NLML5]
 gi|449229433|gb|EMC28747.1| endonuclease III [Streptococcus mutans ST6]
 gi|449236511|gb|EMC35426.1| endonuclease III [Streptococcus mutans 21]
 gi|449246187|gb|EMC44499.1| endonuclease III [Streptococcus mutans SM1]
 gi|449254438|gb|EMC52346.1| endonuclease III [Streptococcus mutans SA41]
 gi|449258367|gb|EMC55949.1| endonuclease III [Streptococcus mutans R221]
          Length = 207

 Score = 35.4 bits (80), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|450134132|ref|ZP_21870967.1| endonuclease III [Streptococcus mutans NLML8]
 gi|449149928|gb|EMB53709.1| endonuclease III [Streptococcus mutans NLML8]
          Length = 207

 Score = 35.4 bits (80), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLPGVGRKTAN 125


>gi|450159908|ref|ZP_21879709.1| endonuclease III [Streptococcus mutans 66-2A]
 gi|449240672|gb|EMC39335.1| endonuclease III [Streptococcus mutans 66-2A]
          Length = 207

 Score = 35.4 bits (80), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|150015469|ref|YP_001307723.1| endonuclease III [Clostridium beijerinckii NCIMB 8052]
 gi|149901934|gb|ABR32767.1| endonuclease III [Clostridium beijerinckii NCIMB 8052]
          Length = 210

 Score = 35.4 bits (80), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 7/78 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP----MIKRFSQEYLGE-- 70
           K   V  DLF   PD  +   +  EE+EK I  +GL + ++     M  +  +++ GE  
Sbjct: 45  KVNEVTKDLFKDYPDLDSLLTLTNEELEKRIKQIGLYRNKSKNLILMFNQLKEKFNGEVP 104

Query: 71  -SWTHVTQLHGVGKYAAD 87
            +   +T L G G+  A+
Sbjct: 105 KTMEEITSLAGAGRKTAN 122


>gi|400289761|ref|ZP_10791788.1| endonuclease III [Streptococcus ratti FA-1 = DSM 20564]
 gi|399920552|gb|EJN93369.1| endonuclease III [Streptococcus ratti FA-1 = DSM 20564]
          Length = 209

 Score = 35.4 bits (80), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 43/90 (47%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + + ++IE  IS +GL + +A  +K  S
Sbjct: 36  IAVVLSAQTTDAAVNKVTPALFAAYPQPKDLAKANLKDIESYISKIGLYRNKAKFLKECS 95

Query: 65  QE----YLGE---SWTHVTQLHGVGKYAAD 87
           Q+    Y GE   +   +  L GVG+  A+
Sbjct: 96  QQLMDRYGGEVPHTRKELESLSGVGRKTAN 125


>gi|55980081|ref|YP_143378.1| endonuclease III [Thermus thermophilus HB8]
 gi|55771494|dbj|BAD69935.1| endonuclease III [Thermus thermophilus HB8]
          Length = 220

 Score = 35.4 bits (80), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 7/65 (10%)

Query: 25  LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGESWTH---VTQ 77
           LF   PDAK   E   EE+E  I  +GL + +A     + +R  +EY GE       + +
Sbjct: 66  LFARFPDAKALAEATPEEVEPYIRRIGLYRTKAKNLVALARRLVEEYGGEVPKEKEALMR 125

Query: 78  LHGVG 82
           L GVG
Sbjct: 126 LPGVG 130


>gi|309775704|ref|ZP_07670702.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
 gi|308916543|gb|EFP62285.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
          Length = 215

 Score = 35.4 bits (80), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 14/96 (14%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +A++L A        +V   LF   P  +   E   ++IE  I  +GL + +A  I+  S
Sbjct: 31  VAVVLSAQTTDAAVNKVTPALFEAFPTPQAMAEAQLQDIEDKIRRIGLYRNKAHSIQNLS 90

Query: 65  QEYLG-------ESWTHVTQLHGVGKYAADAFAIFC 93
           +  L        ES   +T L GVG+  A+     C
Sbjct: 91  RSLLDSFDGVVPESMKDLTSLAGVGRKTANVVRSVC 126


>gi|46200194|ref|YP_005861.1| endonuclease III [Thermus thermophilus HB27]
 gi|46197822|gb|AAS82234.1| endonuclease III [Thermus thermophilus HB27]
          Length = 220

 Score = 35.4 bits (80), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 7/65 (10%)

Query: 25  LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGESWTH---VTQ 77
           LF   PDAK   E   EE+E  I  +GL + +A     + +R  +EY GE       + +
Sbjct: 66  LFARFPDAKALAEATPEEVEPYIRRIGLYRTKAKNLVALARRLVEEYGGEVPKEKEALMR 125

Query: 78  LHGVG 82
           L GVG
Sbjct: 126 LPGVG 130


>gi|218247153|ref|YP_002372524.1| endonuclease III [Cyanothece sp. PCC 8801]
 gi|257060225|ref|YP_003138113.1| endonuclease III [Cyanothece sp. PCC 8802]
 gi|218167631|gb|ACK66368.1| endonuclease III [Cyanothece sp. PCC 8801]
 gi|256590391|gb|ACV01278.1| endonuclease III [Cyanothece sp. PCC 8802]
          Length = 220

 Score = 35.4 bits (80), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 17  KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESW 72
           +  +V   LF+  PDA +  + D EE+E+II + G  + +A  I    ++  +E+ G+  
Sbjct: 54  RVNKVTPKLFSRFPDALSLAKADREELEEIIRSTGFYRNKAKNIQGACQKIVKEFEGQVP 113

Query: 73  THVTQ---LHGVGKYAAD 87
             + Q   L GV +  A+
Sbjct: 114 KQMEQLLSLPGVARKTAN 131


>gi|449956391|ref|ZP_21809487.1| endonuclease III [Streptococcus mutans 4VF1]
 gi|450138345|ref|ZP_21872114.1| endonuclease III [Streptococcus mutans NLML1]
 gi|450143105|ref|ZP_21873238.1| endonuclease III [Streptococcus mutans 1ID3]
 gi|449152716|gb|EMB56416.1| endonuclease III [Streptococcus mutans 1ID3]
 gi|449170673|gb|EMB73369.1| endonuclease III [Streptococcus mutans 4VF1]
 gi|449234128|gb|EMC33152.1| endonuclease III [Streptococcus mutans NLML1]
          Length = 207

 Score = 35.4 bits (80), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPVLFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      V Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125


>gi|333995081|ref|YP_004527694.1| endonuclease III [Treponema azotonutricium ZAS-9]
 gi|333735400|gb|AEF81349.1| endonuclease III [Treponema azotonutricium ZAS-9]
          Length = 229

 Score = 35.4 bits (80), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 7/82 (8%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL-------GESWT 73
           V  +LF+  PD     E    E+EKI+ + G  + +A  IK  S++ L        ++  
Sbjct: 71  VTRELFSRFPDPAALAEAPLPELEKIVHSTGFFRAKAHNIKALSRQLLEKHSGEVPDTME 130

Query: 74  HVTQLHGVGKYAADAFAIFCTG 95
            +T L G G+  A      C G
Sbjct: 131 GLTALPGAGRKTASVVLSTCFG 152


>gi|407795573|ref|ZP_11142531.1| endonuclease III [Salimicrobium sp. MJ3]
 gi|407019914|gb|EKE32628.1| endonuclease III [Salimicrobium sp. MJ3]
          Length = 216

 Score = 35.0 bits (79), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 14/99 (14%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +A++L A        +V  DLF      +    V+ EE+EK I ++GL + +A  +K+ +
Sbjct: 35  VAVILSAQATDNLVNKVTPDLFRKYKSPEDIAAVETEELEKDIKSIGLYRSKAKNLKKMA 94

Query: 65  QEYL-------GESWTHVTQLHGVGKYAADAFAIFCTGK 96
              L        ES   +  L GVG+  A+  A    G+
Sbjct: 95  TTLLERFDGEVPESKESLESLAGVGRKTANVVASVAFGE 133


>gi|119719909|ref|YP_920404.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
 gi|119525029|gb|ABL78401.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
          Length = 253

 Score = 35.0 bits (79), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 8/80 (10%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQE-------YLGESW 72
           +V  + F+  P  ++    D EE+ K  S LGL  +RA  ++  ++E        +  S 
Sbjct: 85  KVFEEFFSRYPSPESLANADPEELRKFFSRLGLV-RRADALREAAREIVERHGGSVPRSK 143

Query: 73  THVTQLHGVGKYAADAFAIF 92
             + +L GVG Y A A   F
Sbjct: 144 EELLKLKGVGPYIASAVLCF 163


>gi|449881938|ref|ZP_21784703.1| endonuclease III [Streptococcus mutans SA38]
 gi|449250747|gb|EMC48796.1| endonuclease III [Streptococcus mutans SA38]
          Length = 207

 Score = 35.0 bits (79), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95

Query: 65  QE----YLG---ESWTHVTQLHGVGKYAAD 87
           Q+    Y G   ++   +  L GVG+  A+
Sbjct: 96  QQLVEYYNGQVPQTRKELESLSGVGRKTAN 125


>gi|409387026|ref|ZP_11239323.1| Endonuclease III [Lactococcus raffinolactis 4877]
 gi|399205796|emb|CCK20238.1| Endonuclease III [Lactococcus raffinolactis 4877]
          Length = 214

 Score = 34.7 bits (78), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 19/99 (19%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +    LF   PD+++    D  ++E +I T+GL + +A  I R +
Sbjct: 35  IAVILSAQATDKGVNKATPALFAKYPDSQSLASADVSDVENLIKTIGLYRSKAKNIVRTA 94

Query: 65  Q----EYLGE---SWTHVTQLHGVGKYAA-----DAFAI 91
           +    E+ GE       +  L GVG+  A     DAF I
Sbjct: 95  EMIETEFGGEIPRDKDLLQTLPGVGRKTANVVLGDAFGI 133


>gi|325663082|ref|ZP_08151532.1| hypothetical protein HMPREF0490_02272 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325470536|gb|EGC73766.1| hypothetical protein HMPREF0490_02272 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 213

 Score = 34.7 bits (78), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 7/74 (9%)

Query: 21  VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGE---SWT 73
           V+  L+   PD     + D  EIE+I+   GL K +A      +K   ++Y G+   ++ 
Sbjct: 54  VVEKLYEKFPDVDALADADVAEIEEIVRPCGLGKSKARDISACMKILKEQYGGQIPKTFD 113

Query: 74  HVTQLHGVGKYAAD 87
            + +L GVG+ +A+
Sbjct: 114 ELLKLPGVGRKSAN 127


>gi|408403428|ref|YP_006861411.1| endonuclease III [Candidatus Nitrososphaera gargensis Ga9.2]
 gi|408364024|gb|AFU57754.1| putative endonuclease III [Candidatus Nitrososphaera gargensis
           Ga9.2]
          Length = 223

 Score = 34.7 bits (78), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 20  RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT--- 76
           RV++ LF      +     D +EI+KII ++G    +A  IK+ SQ  + +    V    
Sbjct: 54  RVVNKLFARFKTPQDLASADIDEIKKIIHSIGFYNIKAERIKQVSQMLISKFGGRVPSDI 113

Query: 77  ----QLHGVGKYAADAFAIFCTGK 96
               +L GVG+  A+   ++   K
Sbjct: 114 DSLLKLPGVGRKTANCVLVYAFDK 137


>gi|373494784|ref|ZP_09585383.1| endonuclease III [Eubacterium infirmum F0142]
 gi|371967828|gb|EHO85296.1| endonuclease III [Eubacterium infirmum F0142]
          Length = 207

 Score = 34.7 bits (78), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 7/76 (9%)

Query: 19  GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP----MIKRFSQEYLG---ES 71
            +V  +LF   PDA    + +  ++E+ I  +G+ + ++     M K    EY G   ++
Sbjct: 48  NKVTPELFKAYPDANALAKAEQSDVEEYIKKIGMYRTKSKNIINMAKSLVAEYDGLVPQN 107

Query: 72  WTHVTQLHGVGKYAAD 87
           +  +T+L GVG+  A+
Sbjct: 108 FDALTKLSGVGRKTAN 123


>gi|422855015|ref|ZP_16901673.1| endonuclease III [Streptococcus sanguinis SK1]
 gi|422875881|ref|ZP_16922351.1| endonuclease III [Streptococcus sanguinis SK1056]
 gi|327462992|gb|EGF09313.1| endonuclease III [Streptococcus sanguinis SK1]
 gi|332362317|gb|EGJ40117.1| endonuclease III [Streptococcus sanguinis SK1056]
          Length = 209

 Score = 34.7 bits (78), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           +A++L A        +   DLF   P  +        +I K IS LGL + +A  +K+ +
Sbjct: 36  VAVMLSAQTTDAAVNKATPDLFEAYPSPQDMAMAGEADIAKYISRLGLYRNKAKFLKKCA 95

Query: 65  QEYLG-------ESWTHVTQLHGVGKYAAD 87
           Q+ L        ++ T +  L GVG+  A+
Sbjct: 96  QQLLDDFDGQVPQTRTELESLAGVGRKTAN 125


>gi|418054692|ref|ZP_12692748.1| HhH-GPD family protein [Hyphomicrobium denitrificans 1NES1]
 gi|353212317|gb|EHB77717.1| HhH-GPD family protein [Hyphomicrobium denitrificans 1NES1]
          Length = 271

 Score = 34.3 bits (77), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 14/66 (21%)

Query: 47  ISTLGLQKKRAPMIKRFSQEY------------LGESWTH--VTQLHGVGKYAADAFAIF 92
           ++ LGL   +   +K  +QE             LG    H  +T LHGVG + AD + +F
Sbjct: 92  LARLGLSAAKIKTLKHIAQEIAEERLNLDALADLGADEAHRTLTSLHGVGPWTADIYLLF 151

Query: 93  CTGKWD 98
           C G  D
Sbjct: 152 CLGHGD 157


>gi|325291271|ref|YP_004267452.1| A/G-specific adenine glycosylase [Syntrophobotulus glycolicus DSM
           8271]
 gi|324966672|gb|ADY57451.1| A/G-specific adenine glycosylase [Syntrophobotulus glycolicus DSM
           8271]
          Length = 361

 Score = 34.3 bits (77), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 10/102 (9%)

Query: 5   YSIRLKEIAILLKAGRVISDLFTL----CPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
           Y I L EI +      V+   +T      P  +   + D +++ K+   LG   +   + 
Sbjct: 47  YHIWLSEIMLQQTRVEVVKTYYTRFLEEIPTVEELAQTDEQKLLKLWEGLGYYSRARNLQ 106

Query: 61  K---RFSQEYLG---ESWTHVTQLHGVGKYAADAFAIFCTGK 96
           K   R   EY+G   E++  + +L GVG Y A A A  C G+
Sbjct: 107 KTARRIVDEYVGHFPETYEQLLKLPGVGPYTAGAIASICFGQ 148


>gi|24380028|ref|NP_721983.1| endonuclease III (DNA repair) [Streptococcus mutans UA159]
 gi|449865177|ref|ZP_21778838.1| endonuclease III [Streptococcus mutans U2B]
 gi|449872317|ref|ZP_21781456.1| endonuclease III [Streptococcus mutans 8ID3]
 gi|449984184|ref|ZP_21818870.1| endonuclease III [Streptococcus mutans NFSM2]
 gi|450081459|ref|ZP_21851742.1| endonuclease III [Streptococcus mutans N66]
 gi|24378018|gb|AAN59289.1|AE014995_6 putative endonuclease III (DNA repair) [Streptococcus mutans UA159]
 gi|449154570|gb|EMB58143.1| endonuclease III [Streptococcus mutans 8ID3]
 gi|449180435|gb|EMB82592.1| endonuclease III [Streptococcus mutans NFSM2]
 gi|449215264|gb|EMC15471.1| endonuclease III [Streptococcus mutans N66]
 gi|449264440|gb|EMC61782.1| endonuclease III [Streptococcus mutans U2B]
          Length = 207

 Score = 34.3 bits (77), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 14/90 (15%)

Query: 12  IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
           IA++L A        +V   LF   P  K   + D +++E  IS +GL + +A  +K  S
Sbjct: 36  IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKGCS 95

Query: 65  QEYLGESWTHVTQ-------LHGVGKYAAD 87
           Q+ +      + Q       L GVG+  A+
Sbjct: 96  QQLVEHYNGQIPQTRKELESLSGVGRKTAN 125


>gi|331092230|ref|ZP_08341058.1| hypothetical protein HMPREF9477_01701 [Lachnospiraceae bacterium
           2_1_46FAA]
 gi|330401662|gb|EGG81241.1| hypothetical protein HMPREF9477_01701 [Lachnospiraceae bacterium
           2_1_46FAA]
          Length = 145

 Score = 34.3 bits (77), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 37  EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
           E++ +E EK+I  + L K   P+ K   Q +    W + T+L G+G
Sbjct: 83  EIEDQEFEKMIRVIALDKMNIPVWKLVEQTHNEAPWKYTTELFGMG 128


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.136    0.412 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,799,820,828
Number of Sequences: 23463169
Number of extensions: 64725951
Number of successful extensions: 160939
Number of sequences better than 100.0: 285
Number of HSP's better than 100.0 without gapping: 166
Number of HSP's successfully gapped in prelim test: 119
Number of HSP's that attempted gapping in prelim test: 160744
Number of HSP's gapped (non-prelim): 297
length of query: 121
length of database: 8,064,228,071
effective HSP length: 88
effective length of query: 33
effective length of database: 5,999,469,199
effective search space: 197982483567
effective search space used: 197982483567
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)