BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033363
(121 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255546672|ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis]
Length = 608
Score = 174 bits (440), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 77/94 (81%), Positives = 82/94 (87%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
VISD FTLCPDAK ATE EEIEKII LGLQKKRA MI+R SQEYL + WTHVTQLHG
Sbjct: 508 VISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHG 567
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFCTGKWD+VRP DHMLNYYW+FL
Sbjct: 568 VGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601
>gi|297829352|ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
lyrata]
gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
lyrata]
Length = 435
Score = 171 bits (433), Expect = 5e-41, Method: Composition-based stats.
Identities = 75/94 (79%), Positives = 81/94 (86%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
VI DLF LCPDAKTATEV+ EIE +I LGLQKKRA MI+RFS EYL ESWTHVTQLHG
Sbjct: 336 VIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHG 395
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKYAADA+AIFC G WDRV+P DHMLNYYWEFL
Sbjct: 396 IGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429
>gi|356551311|ref|XP_003544020.1| PREDICTED: uncharacterized protein LOC100793802 [Glycine max]
Length = 1505
Score = 167 bits (423), Expect = 7e-40, Method: Composition-based stats.
Identities = 74/102 (72%), Positives = 83/102 (81%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+V+S+ F LCPDAK+ T+V EEIEK I TLG Q KRA M++R S+EYL ESWTHVTQLH
Sbjct: 1404 KVVSNFFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYLDESWTHVTQLH 1463
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
GVGKYAADA+AIF TG WDRV PTDHMLNYYWEFL S K L
Sbjct: 1464 GVGKYAADAYAIFVTGMWDRVTPTDHMLNYYWEFLHSIKHEL 1505
>gi|296085641|emb|CBI29440.3| unnamed protein product [Vitis vinifera]
Length = 599
Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats.
Identities = 74/99 (74%), Positives = 85/99 (85%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
L+A RVISDLFTLCPDAKTAT+V E IEK+I TLGLQKKRA MI+RFS+EYL +SWTHV
Sbjct: 489 LQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHV 548
Query: 76 TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
TQLHG+GKYAADA+AIFC+G W V P DHML YW++L
Sbjct: 549 TQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 587
>gi|224118080|ref|XP_002317727.1| predicted protein [Populus trichocarpa]
gi|222858400|gb|EEE95947.1| predicted protein [Populus trichocarpa]
Length = 107
Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 74/106 (69%), Positives = 89/106 (83%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
++A RV++DLFTLCPDAK AT V EEIE+ I +LGLQK+RA M++R S++YL E WTHV
Sbjct: 1 MQAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKMVQRLSEDYLEEDWTHV 60
Query: 76 TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
TQL GVGKYAADA+AIFCTGKW++VRP DHMLN YWE+L STK L
Sbjct: 61 TQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYLCSTKNAL 106
>gi|147837937|emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera]
Length = 635
Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats.
Identities = 76/103 (73%), Positives = 87/103 (84%), Gaps = 1/103 (0%)
Query: 12 IAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGES 71
+ ILL A RVISDLFTLCPDAKTAT+V E IEK+I TLGLQKKRA MI+RFS+EYL +S
Sbjct: 522 VGILL-ASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDS 580
Query: 72 WTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
WTHVTQLHG+GKYAADA+AIFC+G W V P DHML YW++L
Sbjct: 581 WTHVTQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 623
>gi|225469167|ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera]
Length = 536
Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats.
Identities = 74/99 (74%), Positives = 85/99 (85%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
L+A RVISDLFTLCPDAKTAT+V E IEK+I TLGLQKKRA MI+RFS+EYL +SWTHV
Sbjct: 426 LQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHV 485
Query: 76 TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
TQLHG+GKYAADA+AIFC+G W V P DHML YW++L
Sbjct: 486 TQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 524
>gi|42572315|ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana]
gi|332641100|gb|AEE74621.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
Length = 445
Score = 165 bits (418), Expect = 3e-39, Method: Composition-based stats.
Identities = 73/94 (77%), Positives = 79/94 (84%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
VISDLF LC DAKTATEV EEIE +I LGLQKKR MI+R S EYL ESWTHVTQLHG
Sbjct: 346 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 405
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 406 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 439
>gi|6648205|gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
Length = 419
Score = 165 bits (418), Expect = 3e-39, Method: Composition-based stats.
Identities = 73/94 (77%), Positives = 79/94 (84%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
VISDLF LC DAKTATEV EEIE +I LGLQKKR MI+R S EYL ESWTHVTQLHG
Sbjct: 320 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 379
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 380 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 413
>gi|27754344|gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
Length = 407
Score = 165 bits (417), Expect = 4e-39, Method: Composition-based stats.
Identities = 73/94 (77%), Positives = 79/94 (84%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
VISDLF LC DAKTATEV EEIE +I LGLQKKR MI+R S EYL ESWTHVTQLHG
Sbjct: 308 VISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHG 367
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFC G WDRV+P DHMLNYYW++L
Sbjct: 368 VGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 401
>gi|224129470|ref|XP_002320594.1| predicted protein [Populus trichocarpa]
gi|222861367|gb|EEE98909.1| predicted protein [Populus trichocarpa]
Length = 209
Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 61/94 (64%), Positives = 71/94 (75%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
+++D FTLCPDAKT T VD EI ++ +LG + RA IKR S+ YL E WTHVT L G
Sbjct: 108 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 167
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFCTG+WDRV P DHML YWEFL
Sbjct: 168 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 201
>gi|224116426|ref|XP_002331937.1| predicted protein [Populus trichocarpa]
gi|222875026|gb|EEF12157.1| predicted protein [Populus trichocarpa]
Length = 249
Score = 135 bits (340), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 61/94 (64%), Positives = 71/94 (75%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
+++D FTLCPDAKT T VD EI ++ +LG + RA IKR S+ YL E WTHVT L G
Sbjct: 148 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 207
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFCTG+WDRV P DHML YWEFL
Sbjct: 208 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 241
>gi|224114415|ref|XP_002332373.1| predicted protein [Populus trichocarpa]
gi|222832177|gb|EEE70654.1| predicted protein [Populus trichocarpa]
Length = 231
Score = 135 bits (340), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 61/94 (64%), Positives = 71/94 (75%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
+++D FTLCPDAKT T VD EI ++ +LG + RA IKR S+ YL E WTHVT L G
Sbjct: 130 ILNDFFTLCPDAKTTTNVDQNEIAQLTRSLGFKNTRAEKIKRLSEIYLQEDWTHVTFLPG 189
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
VGKYAADA+AIFCTG+WDRV P DHML YWEFL
Sbjct: 190 VGKYAADAYAIFCTGRWDRVVPEDHMLTRYWEFL 223
>gi|449449218|ref|XP_004142362.1| PREDICTED: uncharacterized protein LOC101211755 [Cucumis sativus]
Length = 488
Score = 134 bits (338), Expect = 6e-30, Method: Composition-based stats.
Identities = 61/102 (59%), Positives = 71/102 (69%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
+A VI LF+LCP+ K EV E+IE II LG +KR+ + R S+ YL ESW+HVT
Sbjct: 384 QAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT 443
Query: 77 QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTK 118
QL GVGKY ADA AIFCTG W V P DHMLNYYW+FL S K
Sbjct: 444 QLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIK 485
>gi|168027579|ref|XP_001766307.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682521|gb|EDQ68939.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 622
Score = 128 bits (321), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 55/99 (55%), Positives = 75/99 (75%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+VI D F +CP+A+T + E+I +I +LGLQ KRA ++RFSQEYLG +WT+VTQLH
Sbjct: 461 KVIWDFFEMCPNAETCVSTETEKIGTVIHSLGLQNKRAKTLQRFSQEYLGTTWTNVTQLH 520
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTK 118
GVG+YA DA+AIFC G+W V+P D +LN YW++L T+
Sbjct: 521 GVGEYARDAYAIFCEGRWKEVQPKDIVLNDYWQWLNETQ 559
>gi|384251053|gb|EIE24531.1| DNA glycosylase [Coccomyxa subellipsoidea C-169]
Length = 470
Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats.
Identities = 48/100 (48%), Positives = 63/100 (63%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+VI LF L P + A D E + ++I LGL KRAPM+ RFS+EY+ + WT+ +LH
Sbjct: 215 KVIWKLFELMPTPEDAMHADVEAVRELIEPLGLAPKRAPMLIRFSKEYVEKQWTNPVELH 274
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKG 119
GVG+YAADA+ IFC G W V P D L Y ++L T G
Sbjct: 275 GVGQYAADAYNIFCRGDWRSVSPKDKDLLRYHQWLTETGG 314
>gi|115477908|ref|NP_001062549.1| Os09g0101100 [Oryza sativa Japonica Group]
gi|113630782|dbj|BAF24463.1| Os09g0101100 [Oryza sativa Japonica Group]
Length = 441
Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats.
Identities = 45/95 (47%), Positives = 64/95 (67%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
R + F PDA+TA D E++ K ++ LGLQ+ + I+RFS+ Y+ E WT++T+L
Sbjct: 323 RKVKGFFKRYPDAQTAFSADPEKMAKYLAPLGLQRVKVNRIQRFSKAYVEEEWTYITELC 382
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GVGKYAADA+AIFC G+ V P DH L YW+++
Sbjct: 383 GVGKYAADAYAIFCAGRATEVVPADHKLVDYWKYV 417
>gi|222641036|gb|EEE69168.1| hypothetical protein OsJ_28331 [Oryza sativa Japonica Group]
Length = 452
Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats.
Identities = 45/95 (47%), Positives = 64/95 (67%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
R + F PDA+TA D E++ K ++ LGLQ+ + I+RFS+ Y+ E WT++T+L
Sbjct: 323 RKVKGFFKRYPDAQTAFSADPEKMAKYLAPLGLQRVKVNRIQRFSKAYVEEEWTYITELC 382
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GVGKYAADA+AIFC G+ V P DH L YW+++
Sbjct: 383 GVGKYAADAYAIFCAGRATEVVPADHKLVDYWKYV 417
>gi|413941982|gb|AFW74631.1| hypothetical protein ZEAMMB73_619521 [Zea mays]
Length = 858
Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats.
Identities = 43/93 (46%), Positives = 64/93 (68%)
Query: 22 ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
+ F PDA+TA D E++ + ++ LGLQ+ + I++FS+ Y+GE WT++T+L GV
Sbjct: 734 VKGFFKRYPDAQTAYTADPEKMAEYLAPLGLQRVKTNRIQKFSKAYVGEEWTYITELCGV 793
Query: 82 GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GKYAADA+AIFC G+ + V P DH L YW ++
Sbjct: 794 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYV 826
>gi|302812133|ref|XP_002987754.1| hypothetical protein SELMODRAFT_6975 [Selaginella moellendorffii]
gi|302821198|ref|XP_002992263.1| hypothetical protein SELMODRAFT_4607 [Selaginella moellendorffii]
gi|300139913|gb|EFJ06644.1| hypothetical protein SELMODRAFT_4607 [Selaginella moellendorffii]
gi|300144373|gb|EFJ11057.1| hypothetical protein SELMODRAFT_6975 [Selaginella moellendorffii]
Length = 139
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 60/98 (61%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
+ +++ D F L P + + AE +E +I LGL KKR +IKRFS EYL WT ++
Sbjct: 36 QTKKMLWDFFKLFPGPQHLIDAPAEMVEPLIKPLGLHKKRLELIKRFSHEYLYTDWTDIS 95
Query: 77 QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
QLHGVG YA DA+ IF G+W V P D ML YW +L
Sbjct: 96 QLHGVGTYAEDAYMIFVEGRWKEVIPDDAMLEQYWNWL 133
>gi|218201632|gb|EEC84059.1| hypothetical protein OsI_30334 [Oryza sativa Indica Group]
Length = 455
Score = 99.4 bits (246), Expect = 3e-19, Method: Composition-based stats.
Identities = 44/95 (46%), Positives = 62/95 (65%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
R + F PDA+ A D E++ K ++ LGLQ + I+RFS+ Y+ E WT++T+L
Sbjct: 326 RKVKGFFKRYPDAQAAFSADPEKMAKYLAPLGLQHVKVNRIQRFSKAYVEEEWTYITELC 385
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GVGKYAADA+AIFC G+ V P DH L YW+++
Sbjct: 386 GVGKYAADAYAIFCAGRAPEVVPADHKLVDYWKYV 420
>gi|242039843|ref|XP_002467316.1| hypothetical protein SORBIDRAFT_01g024773 [Sorghum bicolor]
gi|241921170|gb|EER94314.1| hypothetical protein SORBIDRAFT_01g024773 [Sorghum bicolor]
Length = 809
Score = 97.4 bits (241), Expect = 8e-19, Method: Composition-based stats.
Identities = 40/93 (43%), Positives = 63/93 (67%)
Query: 22 ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
++ F PD +TA D +++ + ++ LG Q+ + I++FS+ Y+GE WT++T+L GV
Sbjct: 691 VNGFFERYPDPQTAYRADPKKMAEYLAPLGFQRVKTKRIQKFSKAYVGEEWTYITELCGV 750
Query: 82 GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GKYAADA+AIFC G+ + V P DH L YW ++
Sbjct: 751 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYV 783
>gi|326533618|dbj|BAK05340.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1137
Score = 97.4 bits (241), Expect = 9e-19, Method: Composition-based stats.
Identities = 43/99 (43%), Positives = 62/99 (62%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHV 75
++ R++ F PD +A D +++ + ++ LGL R IK+ S++Y+G WTHV
Sbjct: 1029 IQVKRMLEGFFERYPDPLSAINADPDKMAEYLAPLGLNNVRTRNIKKLSKQYVGNEWTHV 1088
Query: 76 TQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
TQL GVGKYAADA+AIFC G+ V P DH L YW ++
Sbjct: 1089 TQLCGVGKYAADAYAIFCAGRAREVVPDDHKLVDYWNYV 1127
>gi|242039839|ref|XP_002467314.1| hypothetical protein SORBIDRAFT_01g024706 [Sorghum bicolor]
gi|241921168|gb|EER94312.1| hypothetical protein SORBIDRAFT_01g024706 [Sorghum bicolor]
Length = 346
Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 63/94 (67%)
Query: 22 ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGV 81
++ F PD +TA D +++ + ++ LG Q+ + I++FS+ Y+GE WT++T+L GV
Sbjct: 206 VNGFFERYPDPQTAYRADPKKMAEYLAPLGFQRVKTKRIQKFSKAYVGEEWTYITELCGV 265
Query: 82 GKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLV 115
GKYAADA+AIFC G+ + V P DH L YW ++
Sbjct: 266 GKYAADAYAIFCAGRANEVVPKDHKLVDYWNYVC 299
>gi|340375489|ref|XP_003386267.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Amphimedon
queenslandica]
Length = 251
Score = 94.4 bits (233), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 58/94 (61%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
++ F + AT + ++I +I LGL KRA MI RFS EYL W + +LHG
Sbjct: 145 ILWKFFDCYSTPEEATAANWKDIASMIEPLGLHIKRAKMIIRFSGEYLSNDWVYPIELHG 204
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFCT +W VRP DHMLN Y+E+L
Sbjct: 205 IGKYGNDSYRIFCTEEWKEVRPNDHMLNKYYEWL 238
>gi|307111443|gb|EFN59677.1| hypothetical protein CHLNCDRAFT_14045, partial [Chlorella
variabilis]
Length = 133
Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 61/95 (64%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+VI LF LCP A D ++++ +I LGL +KRA I++ SQ+YL + W T+L+
Sbjct: 39 KVIWQLFALCPTPAAAIAADVQQVQALIQPLGLFRKRAAAIQQLSQDYLYKQWRDPTELY 98
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
G+GKYAADA+ +FC G+W V P D L Y ++L
Sbjct: 99 GIGKYAADAYHMFCRGRWREVAPEDKDLRRYRDWL 133
>gi|291236887|ref|XP_002738372.1| PREDICTED: methyl-CpG binding domain protein 4-like [Saccoglossus
kowalevskii]
Length = 862
Score = 92.8 bits (229), Expect = 2e-17, Method: Composition-based stats.
Identities = 40/94 (42%), Positives = 60/94 (63%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
++ FT P A D +EI +I LGL KRA M+ RFS+EY+ ++W + +L+G
Sbjct: 760 ILWQFFTKWPSAVETINADWKEIANLIHPLGLYDKRAKMLIRFSEEYIHKNWKYPIELYG 819
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W +V+P DHMLN Y ++L
Sbjct: 820 IGKYGNDSYRIFCVNEWKQVKPEDHMLNKYHDWL 853
>gi|443723536|gb|ELU11913.1| hypothetical protein CAPTEDRAFT_102919, partial [Capitella teleta]
Length = 145
Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 57/90 (63%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ F L P A+ + EEI +++ LGL RA + RFS+EYL +WT+ +LHG
Sbjct: 42 VLWTFFDLWPSAEATKTANWEEISELLQPLGLHVSRAHTLIRFSEEYLNVNWTYPIELHG 101
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+GKY D++ IFC G+W +V P+DH LN Y
Sbjct: 102 IGKYGNDSYRIFCLGEWKQVAPSDHKLNDY 131
>gi|357166912|ref|XP_003580913.1| PREDICTED: uncharacterized protein LOC100825070 [Brachypodium
distachyon]
Length = 606
Score = 89.7 bits (221), Expect = 2e-16, Method: Composition-based stats.
Identities = 38/95 (40%), Positives = 61/95 (64%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+++ F PD +A D +++E ++ LGLQ + I++ S++Y+ E W +VT+L
Sbjct: 471 KIVDGFFERYPDPVSAFNADPKKMEAYLAPLGLQSVKTRNIQKLSKQYVEEDWKYVTELC 530
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GVGKYAADA++IFC G+ V P DH L YW+++
Sbjct: 531 GVGKYAADAYSIFCAGRATEVVPEDHKLVDYWKYV 565
>gi|405960128|gb|EKC26075.1| Methyl-CpG-binding domain protein 4 [Crassostrea gigas]
Length = 826
Score = 89.0 bits (219), Expect = 3e-16, Method: Composition-based stats.
Identities = 37/85 (43%), Positives = 58/85 (68%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P+ TA + D + + ++++ LGL +KRA +I RFS E+L + W + +LHG+GKY D++
Sbjct: 732 PNPDTARKGDWQAMARLLAPLGLHEKRAKIIIRFSDEFLTKDWKYPIELHGIGKYGNDSY 791
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V PTDH LN Y ++L
Sbjct: 792 RIFCVNEWRQVTPTDHKLNDYHKWL 816
>gi|302843300|ref|XP_002953192.1| hypothetical protein VOLCADRAFT_35340 [Volvox carteri f.
nagariensis]
gi|300261579|gb|EFJ45791.1| hypothetical protein VOLCADRAFT_35340 [Volvox carteri f.
nagariensis]
Length = 133
Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 58/95 (61%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+V+ L+ P + D +E I+ LGLQ +RA + RFS+E+L WT TQL+
Sbjct: 39 QVLGPLWRSYPTPEAMAAADPRVVEDILRPLGLQVRRAERLVRFSEEFLSRQWTCPTQLY 98
Query: 80 GVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
GVG+YAADA+ IFC G+W V+P D L Y ++L
Sbjct: 99 GVGRYAADAYLIFCKGRWREVQPADKDLRRYRDWL 133
>gi|113205660|ref|NP_001037916.1| methyl-CpG binding domain protein 4 [Xenopus (Silurana) tropicalis]
gi|89272695|emb|CAJ81414.1| methyl-CpG binding domain protein 4 [Xenopus (Silurana) tropicalis]
Length = 472
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 55/85 (64%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P+ + A D +E+ +++ LGL + RA I RFS EYL + W + +LHG+GKY D++
Sbjct: 378 PNPEVARAADWKEMAELLQPLGLYELRAKAIVRFSDEYLTKKWRYPIELHGIGKYGNDSY 437
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V+P DH LN+Y +L
Sbjct: 438 RIFCVNEWKQVQPQDHKLNFYHTWL 462
>gi|432866392|ref|XP_004070828.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Oryzias
latipes]
Length = 440
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 55/94 (58%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ F P A + D + + +++ LGL + RA + RFS EYL + W + +LHG
Sbjct: 338 VLWQFFERYPSADVTRDADWKPMSELMKPLGLYELRAKTLIRFSDEYLTKQWRYPIELHG 397
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC G+W V P DHMLN Y +L
Sbjct: 398 IGKYGNDSYRIFCVGEWKEVTPEDHMLNKYHAWL 431
>gi|149633502|ref|XP_001505332.1| PREDICTED: methyl-CpG-binding domain protein 4-like
[Ornithorhynchus anatinus]
Length = 108
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 57/101 (56%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D +E+ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 5 VLWEFLEKYPSAEVARTADWKEVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHG 64
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+GKY D++ IFC +W +V P DH LN Y ++L L
Sbjct: 65 IGKYGNDSYRIFCINEWKQVHPEDHKLNKYHDWLWENHEKL 105
>gi|298706383|emb|CBJ29392.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 419
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/77 (49%), Positives = 56/77 (72%)
Query: 42 EIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVR 101
E+ ++++ LGLQ++R I RFSQEYL ++WT+V +L+ VGKYAADA IF KW V+
Sbjct: 308 EMTRVVAPLGLQERRPIAIIRFSQEYLSKAWTNVKELYWVGKYAADAHKIFIERKWREVQ 367
Query: 102 PTDHMLNYYWEFLVSTK 118
P DH LN++ E++ T+
Sbjct: 368 PDDHALNWWVEWMRGTQ 384
>gi|440899997|gb|ELR51228.1| Methyl-CpG-binding domain protein 4, partial [Bos grunniens mutus]
Length = 552
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 458 PSAEAARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 517
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 518 RIFCVNEWKQVHPEDHKLNKYHDWL 542
>gi|291393500|ref|XP_002713197.1| PREDICTED: methyl-CpG binding domain protein 4 [Oryctolagus
cuniculus]
Length = 561
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 467 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 526
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 527 RIFCVNEWKQVHPEDHKLNKYHDWL 551
>gi|354490514|ref|XP_003507402.1| PREDICTED: methyl-CpG-binding domain protein 4 [Cricetulus griseus]
gi|344256524|gb|EGW12628.1| Methyl-CpG-binding domain protein 4 [Cricetulus griseus]
Length = 560
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 466 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 525
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 526 RIFCVNEWKQVHPEDHKLNKYHDWL 550
>gi|50401208|sp|Q9Z2D7.1|MBD4_MOUSE RecName: Full=Methyl-CpG-binding domain protein 4; AltName:
Full=Methyl-CpG-binding protein MBD4; AltName:
Full=Mismatch-specific DNA N-glycosylase
gi|5929746|gb|AAD56595.1|AF120996_1 methyl-CpG binding protein 4 [Mus musculus domesticus]
gi|3800807|gb|AAC68878.1| methyl-CpG binding protein MBD4 [Mus musculus]
Length = 554
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|187960081|ref|NP_034904.2| methyl-CpG-binding domain protein 4 [Mus musculus]
gi|19353797|gb|AAH24812.1| Methyl-CpG binding domain protein 4 [Mus musculus]
Length = 554
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|77735765|ref|NP_001029577.1| methyl-CpG-binding domain protein 4 [Bos taurus]
gi|73586836|gb|AAI03442.1| Methyl-CpG binding domain protein 4 [Bos taurus]
Length = 554
Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ + + LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEAARTADWRDVSEFLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|395516662|ref|XP_003762506.1| PREDICTED: methyl-CpG-binding domain protein 4 [Sarcophilus
harrisii]
Length = 359
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 56/94 (59%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D +E+ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 256 VLWEFLEKYPSAEVARAADWKEVSELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHG 315
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W +V P DH LN Y ++L
Sbjct: 316 IGKYGNDSYRIFCVNEWKQVHPQDHKLNKYHDWL 349
>gi|148667124|gb|EDK99540.1| methyl-CpG binding domain protein 4 [Mus musculus]
Length = 554
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|296474714|tpg|DAA16829.1| TPA: methyl-CpG binding domain protein 4 [Bos taurus]
Length = 554
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ + + LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEAARTADWRDVSEFLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|449274057|gb|EMC83362.1| Methyl-CpG-binding domain protein 4 [Columba livia]
Length = 188
Score = 82.8 bits (203), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 57/101 (56%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + F P + D +E+ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 85 VLWEFFRKYPSPEVTRAADWKEMSELLKPLGLYALRAKTIIKFSDEYLSKRWRYPIELHG 144
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+GKY D++ IFC +W V+P DH LN Y +L + +L
Sbjct: 145 IGKYGNDSYRIFCVNEWKEVQPQDHKLNVYHTWLRENQESL 185
>gi|344276433|ref|XP_003410013.1| PREDICTED: methyl-CpG-binding domain protein 4 [Loxodonta africana]
Length = 566
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 531
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 532 RIFCIDEWKQVHPEDHKLNKYHDWL 556
>gi|334338449|ref|XP_001376329.2| PREDICTED: methyl-CpG-binding domain protein 4-like [Monodelphis
domestica]
Length = 486
Score = 82.8 bits (203), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D +E+ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 392 PSAEVARTADWKEVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 451
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 452 RIFCVNEWKQVHPEDHKLNKYHDWL 476
>gi|417411695|gb|JAA52275.1| Putative methyl-cpg binding transcription regulator, partial
[Desmodus rotundus]
Length = 570
Score = 82.8 bits (203), Expect = 3e-14, Method: Composition-based stats.
Identities = 34/85 (40%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ +GL + RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 476 PSAEAARTADWRDVSELLKPIGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 535
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 536 RIFCVNEWKQVHPEDHKLNKYHDWL 560
>gi|348502661|ref|XP_003438886.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Oreochromis
niloticus]
Length = 385
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 36/94 (38%), Positives = 56/94 (59%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ F P A+ + D + + +++ LGL + RA + RFS EYL + W + +LHG
Sbjct: 283 VLWQFFERYPSAEVTRQADWKPMSELMKPLGLYELRAKTLIRFSDEYLTKQWRYPIELHG 342
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W +V+P DH LN Y +L
Sbjct: 343 IGKYGNDSYRIFCINEWRQVKPDDHKLNKYHAWL 376
>gi|156398265|ref|XP_001638109.1| predicted protein [Nematostella vectensis]
gi|156398269|ref|XP_001638111.1| predicted protein [Nematostella vectensis]
gi|156225227|gb|EDO46046.1| predicted protein [Nematostella vectensis]
gi|156225229|gb|EDO46048.1| predicted protein [Nematostella vectensis]
Length = 143
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 57/94 (60%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
++ + F PDA A++ D I ++ LGL +KRA + +FS E+L W + L+G
Sbjct: 41 IMWEFFRRYPDAAEASKADPGPISDLLRPLGLHEKRAKALVQFSAEFLSRDWIYPDTLYG 100
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY +D++ IF G+W V+P DH LN Y ++L
Sbjct: 101 IGKYGSDSYRIFFLGEWKDVKPEDHKLNLYHDWL 134
>gi|115725067|ref|XP_783908.2| PREDICTED: uncharacterized protein LOC578658 [Strongylocentrotus
purpuratus]
Length = 550
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 50/77 (64%)
Query: 38 VDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKW 97
D + I +I LGL KRA M+ +FS E+L + WT+ +L G+GKY D++ IFC +W
Sbjct: 465 ADWQSIADLIQPLGLHTKRAKMLIQFSDEFLTKDWTYPIELSGIGKYGNDSYRIFCVNEW 524
Query: 98 DRVRPTDHMLNYYWEFL 114
V+P DHMLN Y ++L
Sbjct: 525 KEVKPQDHMLNKYHDWL 541
>gi|149728428|ref|XP_001490372.1| PREDICTED: methyl-CpG-binding domain protein 4 [Equus caballus]
Length = 592
Score = 82.4 bits (202), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I RFS EYL + W + +LHG+GKY D++
Sbjct: 498 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIRFSDEYLTKQWRYPIELHGIGKYGNDSY 557
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 558 RIFCINEWKQVHPEDHKLNKYHDWL 582
>gi|410951752|ref|XP_003982557.1| PREDICTED: methyl-CpG-binding domain protein 4 [Felis catus]
Length = 566
Score = 82.4 bits (202), Expect = 3e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 531
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556
>gi|74190664|dbj|BAE28134.1| unnamed protein product [Mus musculus]
Length = 108
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 56/101 (55%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 5 VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 64
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+GKY D++ IFC +W +V P DH LN Y ++L L
Sbjct: 65 IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWENHEKL 105
>gi|45382709|ref|NP_990024.1| methyl-CpG binding domain protein 4 [Gallus gallus]
gi|7739720|gb|AAF68981.1|AF257107_1 5-methylcytosine G/T mismatch-specific DNA glycosylase [Gallus
gallus]
Length = 416
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P + A D +E+ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 322 PSPEVARTADWKEMSELLRPLGLYALRAKTIIKFSDEYLNKQWKYPIELHGIGKYGNDSY 381
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W V+P DH LN Y +L
Sbjct: 382 RIFCVNEWKEVQPQDHKLNIYHAWL 406
>gi|390475366|ref|XP_002758735.2| PREDICTED: methyl-CpG-binding domain protein 4 [Callithrix jacchus]
Length = 566
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 472 PSAEVARNADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 531
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556
>gi|403268287|ref|XP_003926209.1| PREDICTED: methyl-CpG-binding domain protein 4 [Saimiri boliviensis
boliviensis]
Length = 564
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 470 PSAEVARNADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 529
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 530 RIFCVNEWKQVHPEDHKLNKYHDWL 554
>gi|345786313|ref|XP_003432809.1| PREDICTED: methyl-CpG-binding domain protein 4 [Canis lupus
familiaris]
Length = 570
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 476 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 535
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 536 RIFCINEWKQVHPEDHKLNKYHDWL 560
>gi|456754090|gb|JAA74218.1| methyl-CpG binding domain protein 4 [Sus scrofa]
Length = 567
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 473 PSAEVARTADWRDVSELLKPLGLYDHRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 532
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 533 RIFCVNEWKQVHPEDHKLNKYHDWL 557
>gi|410899959|ref|XP_003963464.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Takifugu
rubripes]
Length = 257
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 55/97 (56%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKY 84
F P A+ + + + +++ LGL + RA + RFS+EYL + W + +LHG+GKY
Sbjct: 159 FFERYPSAEVTRGAEWKPMSELMKPLGLYELRAKALIRFSEEYLTKQWRYPVELHGIGKY 218
Query: 85 AADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
D++ IFC G+W V P DHMLN Y +L L
Sbjct: 219 GNDSYRIFCLGEWREVTPEDHMLNKYHAWLWENHATL 255
>gi|355701349|gb|AES01654.1| methyl-CpG binding domain protein 4 [Mustela putorius furo]
Length = 473
Score = 82.0 bits (201), Expect = 5e-14, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 379 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 438
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 439 RIFCVNEWKQVHPEDHKLNKYHDWL 463
>gi|348554822|ref|XP_003463224.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Cavia
porcellus]
Length = 670
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL + RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 576 PSAEVARAADWRDVSELLKPLGLYELRAKTIVKFSDEYLTKHWKYPIELHGIGKYGNDSY 635
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 636 RIFCVNEWKQVHPQDHKLNKYHDWL 660
>gi|29726680|pdb|1NGN|A Chain A, Mismatch Repair In Methylated Dna. Structure Of The
Mismatch-Specific Thymine Glycosylase Domain Of
Methyl-Cpg- Binding Protein Mbd4
Length = 155
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 55/94 (58%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 52 VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 111
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W +V P DH LN Y ++L
Sbjct: 112 IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWL 145
>gi|260781241|ref|XP_002585728.1| hypothetical protein BRAFLDRAFT_119982 [Branchiostoma floridae]
gi|229270764|gb|EEN41739.1| hypothetical protein BRAFLDRAFT_119982 [Branchiostoma floridae]
Length = 858
Score = 81.6 bits (200), Expect = 5e-14, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 61/105 (58%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
KA ++ + F P + + D + I +++ LGL +KRA + RFS E+L + W +
Sbjct: 752 KAIPLMWEFFRRWPSPEVTRDADWKPICELLQPLGLYEKRAKTLIRFSDEFLTKDWAYPD 811
Query: 77 QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+L+G+GKY D++ IFC +W +V P+DH LN Y +L+ L
Sbjct: 812 ELYGIGKYGNDSYRIFCVNEWRQVMPSDHKLNDYHNWLLENHKEL 856
>gi|394986373|pdb|4EW4|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With Dna
Containing A Ribose Sugar
Length = 146
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 56/101 (55%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 43 VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 102
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+GKY D++ IFC +W +V P DH LN Y ++L L
Sbjct: 103 IGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWENHEKL 143
>gi|67971738|dbj|BAE02211.1| unnamed protein product [Macaca fascicularis]
Length = 498
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 404 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 463
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 464 RIFCVNEWKQVHPEDHKLNKYHDWL 488
>gi|355786370|gb|EHH66553.1| hypothetical protein EGM_03568 [Macaca fascicularis]
Length = 578
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568
>gi|431899948|gb|ELK07895.1| Methyl-CpG-binding domain protein 4 [Pteropus alecto]
Length = 664
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL + RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 570 PSAEVARTADWRDVSELLKPLGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 629
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 630 RIFCINEWKQVHPEDHKLNKYHDWL 654
>gi|197098838|ref|NP_001127160.1| methyl-CpG-binding domain protein 4 [Pongo abelii]
gi|55725306|emb|CAH89518.1| hypothetical protein [Pongo abelii]
Length = 573
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 479 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 538
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 539 RIFCVNEWKQVHPEDHKLNKYHDWL 563
>gi|442796456|ref|NP_001263202.1| methyl-CpG-binding domain protein 4 isoform 5 [Homo sapiens]
gi|85813718|emb|CAJ55826.1| putative methyl-CpG binding protein [Homo sapiens]
Length = 262
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 168 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 227
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 228 RIFCVNEWKQVHPEDHKLNKYHDWL 252
>gi|397518616|ref|XP_003829480.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Pan
paniscus]
Length = 580
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570
>gi|332817878|ref|XP_003310052.1| PREDICTED: methyl-CpG-binding domain protein 4 [Pan troglodytes]
gi|410334151|gb|JAA36022.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
Length = 574
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564
>gi|114589101|ref|XP_001144095.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 7 [Pan
troglodytes]
gi|410211778|gb|JAA03108.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
gi|410262686|gb|JAA19309.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
gi|410298898|gb|JAA28049.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
gi|410334153|gb|JAA36023.1| methyl-CpG binding domain protein 4 [Pan troglodytes]
Length = 580
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570
>gi|397518614|ref|XP_003829479.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Pan
paniscus]
Length = 574
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564
>gi|332231793|ref|XP_003265079.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Nomascus
leucogenys]
Length = 572
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 478 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 537
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 538 RIFCVNEWKQVHPEDHKLNKYHDWL 562
>gi|332231795|ref|XP_003265080.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Nomascus
leucogenys]
Length = 578
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568
>gi|4505121|ref|NP_003916.1| methyl-CpG-binding domain protein 4 isoform 1 [Homo sapiens]
gi|50400819|sp|O95243.1|MBD4_HUMAN RecName: Full=Methyl-CpG-binding domain protein 4; AltName:
Full=Methyl-CpG-binding endonuclease 1; AltName:
Full=Methyl-CpG-binding protein MBD4; AltName:
Full=Mismatch-specific DNA N-glycosylase
gi|4539759|gb|AAD22195.1|AF114784_1 methyl-CpG binding endonuclease [Homo sapiens]
gi|19852107|gb|AAM00008.1|AF494057_1 methyl-CpG binding domain protein 4 [Homo sapiens]
gi|3800809|gb|AAC68879.1| methyl-CpG binding protein MBD4 [Homo sapiens]
gi|5739194|gb|AAD50374.1| methyl-CpG binding protein 4 [Homo sapiens]
gi|119599659|gb|EAW79253.1| methyl-CpG binding domain protein 4, isoform CRA_c [Homo sapiens]
gi|119599660|gb|EAW79254.1| methyl-CpG binding domain protein 4, isoform CRA_c [Homo sapiens]
Length = 580
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570
>gi|442796450|ref|NP_001263199.1| methyl-CpG-binding domain protein 4 isoform 2 [Homo sapiens]
gi|15079922|gb|AAH11752.1| MBD4 protein [Homo sapiens]
gi|47496557|emb|CAG29301.1| MBD4 [Homo sapiens]
gi|119599657|gb|EAW79251.1| methyl-CpG binding domain protein 4, isoform CRA_a [Homo sapiens]
gi|119599661|gb|EAW79255.1| methyl-CpG binding domain protein 4, isoform CRA_a [Homo sapiens]
gi|325463337|gb|ADZ15439.1| methyl-CpG binding domain protein 4 [synthetic construct]
Length = 574
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564
>gi|426342067|ref|XP_004036337.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 2 [Gorilla
gorilla gorilla]
Length = 580
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 546 RIFCVNEWKQVHPEDHKLNKYHDWL 570
>gi|426342065|ref|XP_004036336.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 1 [Gorilla
gorilla gorilla]
Length = 574
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 480 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 539
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 540 RIFCVNEWKQVHPEDHKLNKYHDWL 564
>gi|402887074|ref|XP_003906930.1| PREDICTED: methyl-CpG-binding domain protein 4 [Papio anubis]
Length = 261
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 167 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 226
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 227 RIFCVNEWKQVHPEDHKLNKYHDWL 251
>gi|355564527|gb|EHH21027.1| hypothetical protein EGK_04002 [Macaca mulatta]
Length = 578
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 543
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 544 RIFCVNEWKQVHPEDHKLNKYHDWL 568
>gi|343962517|dbj|BAK62846.1| methyl-CpG-binding domain protein 4 [Pan troglodytes]
Length = 573
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 479 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 538
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 539 RIFCVNEWKQVHPEDHKLNKYHDWL 563
>gi|380817674|gb|AFE80711.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
gi|380817676|gb|AFE80712.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
gi|383422567|gb|AFH34497.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
gi|383422569|gb|AFH34498.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
gi|384950154|gb|AFI38682.1| methyl-CpG-binding domain protein 4 [Macaca mulatta]
Length = 572
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 478 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 537
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 538 RIFCVNEWKQVHPEDHKLNKYHDWL 562
>gi|395847220|ref|XP_003796279.1| PREDICTED: methyl-CpG-binding domain protein 4 [Otolemur garnettii]
Length = 574
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D +I +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 480 PSAEIARTADWRDISELLKPLGLYDLRAKTIIKFSDEYLTKQWKYPIELHGIGKYGNDSY 539
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 540 RIFCINEWKQVHPEDHKLNKYHDWL 564
>gi|400260847|pdb|4E9E|A Chain A, Structure Of The Glycosylase Domain Of Mbd4
gi|400260848|pdb|4E9F|A Chain A, Stucture Of The Glycosylase Domain Of Mbd4 Bound To Ap
Site Containing Dna
Length = 161
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 61 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEDHKLNKYHDWL 145
>gi|385252064|pdb|4DK9|A Chain A, Crystal Structure Of Mbd4 Catalytic Domain Bound To Abasic
Dna
Length = 157
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 63 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 122
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 123 RIFCVNEWKQVHPEDHKLNKYHDWL 147
>gi|256599915|pdb|3IHO|A Chain A, The C-Terminal Glycosylase Domain Of Human Mbd4
Length = 138
Score = 81.3 bits (199), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 50 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 109
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 110 RIFCVNEWKQVHPEDHKLNKYHDWL 134
>gi|449474335|ref|XP_002187591.2| PREDICTED: uncharacterized protein LOC100231901 [Taeniopygia
guttata]
Length = 420
Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P + A D +E+ +++ LGL + RA I RFS EYL ++W + +LHG+GKY D++
Sbjct: 326 PSPEVARAADWKEMSELLKPLGLYELRAKTIIRFSGEYLSKAWRYPIELHGIGKYGNDSY 385
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W V+P DH LN Y +L
Sbjct: 386 RIFCVNEWKEVQPQDHKLNVYHTWL 410
>gi|149049684|gb|EDM02138.1| rCG29635 [Rattus norvegicus]
Length = 566
Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKPWRYPIELHGIGKYGNDSY 531
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556
>gi|293346999|ref|XP_002726494.1| PREDICTED: methyl-CpG-binding domain protein 4 [Rattus norvegicus]
gi|392347546|ref|XP_001059437.3| PREDICTED: methyl-CpG-binding domain protein 4 [Rattus norvegicus]
Length = 566
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 472 PSAEVARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKPWRYPIELHGIGKYGNDSY 531
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 532 RIFCVNEWKQVHPEDHKLNKYHDWL 556
>gi|426249701|ref|XP_004018588.1| PREDICTED: methyl-CpG-binding domain protein 4 [Ovis aries]
Length = 554
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 52/85 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 460 PSAEAARTADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 519
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 520 RIFCVNEWKQVHPEDHKLNKYHDWL 544
>gi|351711383|gb|EHB14302.1| Methyl-CpG-binding domain protein 4 [Heterocephalus glaber]
Length = 525
Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 53/85 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A + ++ +++ LGL + RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 431 PSAEAARAANWRDVSELLKPLGLYELRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 490
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P DH LN Y ++L
Sbjct: 491 RIFCVNEWKQVHPEDHKLNKYHDWL 515
>gi|326928198|ref|XP_003210268.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Meleagris
gallopavo]
Length = 193
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 52/90 (57%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P + A D +E+ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 90 VLWEFLRKYPSPEIARTADWKEMSELLRPLGLYALRAKTIIKFSDEYLNKQWKYPIELHG 149
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+GKY D++ IFC +W V+P DH LN Y
Sbjct: 150 IGKYGNDSYRIFCVNEWKEVQPQDHKLNIY 179
>gi|394986367|pdb|4EVV|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With A G:t
Mismatch
gi|394986370|pdb|4EW0|A Chain A, Mouse Mbd4 Glycosylase Domain In Complex With A G:5hmu (5-
Hydroxymethyluracil) Mismatch
Length = 146
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG
Sbjct: 43 VLWEFLEKYPSAEVARAADWRDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHG 102
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+GKY D++ IFC +W +V P +H LN Y ++L L
Sbjct: 103 IGKYGNDSYRIFCVNEWKQVHPENHKLNKYHDWLWENHEKL 143
>gi|357497507|ref|XP_003619042.1| Methyl-CpG-binding domain protein [Medicago truncatula]
gi|355494057|gb|AES75260.1| Methyl-CpG-binding domain protein [Medicago truncatula]
Length = 96
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 48/64 (75%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLH 79
+V+ + F LCPD +T +V EEIE++I TLGLQ KR+ ++ S+EYL E+WT+VT+LH
Sbjct: 5 QVLDNFFKLCPDVETCMQVTMEEIEEVIRTLGLQVKRSGSLQHLSREYLTETWTYVTELH 64
Query: 80 GVGK 83
GVG
Sbjct: 65 GVGN 68
>gi|47223201|emb|CAG11336.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 55/94 (58%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ F P A+ + + + +++ LGL + RA + RFS+EYL + W + +LHG
Sbjct: 192 VLWQFFERYPSAEATRGAEWKPMSELMKPLGLNELRAKALIRFSEEYLTKQWRYPIELHG 251
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W V P DHMLN Y +L
Sbjct: 252 IGKYGNDSYRIFCLEEWREVTPEDHMLNKYHAWL 285
>gi|242008567|ref|XP_002425074.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212508739|gb|EEB12336.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 168
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 48/72 (66%)
Query: 50 LGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNY 109
LGLQKKRA M+K+FS+EYL ++W + +LHG+GKY D++ IFC +W V+P D L
Sbjct: 93 LGLQKKRAFMLKKFSKEYLYKAWKYPIELHGIGKYGNDSYRIFCVNEWKHVQPNDTKLTL 152
Query: 110 YWEFLVSTKGNL 121
Y +L +L
Sbjct: 153 YHNWLQENHNSL 164
>gi|400260851|pdb|4E9G|A Chain A, Structure Of The Glycosylase Domain Of Mbd4 Bound To
Thymine Containing Dna
gi|400260854|pdb|4E9H|A Chain A, Structure Of Glycosylase Domain Of Mbd4 Bound To 5hmu
Containing Dna
Length = 161
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 61 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145
>gi|449529453|ref|XP_004171714.1| PREDICTED: uncharacterized LOC101211755 [Cucumis sativus]
Length = 619
Score = 78.2 bits (191), Expect = 6e-13, Method: Composition-based stats.
Identities = 36/68 (52%), Positives = 46/68 (67%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
+A VI LF+LCP+ K EV E+IE II LG +KR+ + R S+ YL ESW+HVT
Sbjct: 384 QAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT 443
Query: 77 QLHGVGKY 84
QL GVGK+
Sbjct: 444 QLPGVGKF 451
>gi|321459033|gb|EFX70091.1| hypothetical protein DAPPUDRAFT_35863 [Daphnia pulex]
Length = 144
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 54/86 (62%)
Query: 32 AKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAI 91
A+ A + D +EI +++ LGL + RA I R S++YL W + L+G+ KY +D++ I
Sbjct: 52 AEVARQADWKEIADLMNPLGLHELRAKRIVRMSEDYLKGDWVKPSDLYGISKYGSDSYRI 111
Query: 92 FCTGKWDRVRPTDHMLNYYWEFLVST 117
FC G+W VRPTD ML Y ++L +
Sbjct: 112 FCLGEWKEVRPTDIMLKIYQDWLFTN 137
>gi|327280386|ref|XP_003224933.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Anolis
carolinensis]
Length = 530
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 54/94 (57%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
V+ + F P A D + + +++ LGL + RA I RFS E+L + W + +LHG
Sbjct: 424 VLWEFFKKYPSPSVARTADWKGMSELLKPLGLYELRAKTIIRFSDEFLVKRWKYPIELHG 483
Query: 81 VGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
+GKY D++ IFC +W V+P DH LN Y +L
Sbjct: 484 IGKYGNDSYRIFCVNEWKEVQPKDHKLNIYHAWL 517
>gi|400260857|pdb|4EA4|A Chain A, Structure Of The Glycosylase Domain Of Mbd4 Bound To
5hmu-Containing Dna
Length = 155
Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 61 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145
>gi|400260860|pdb|4EA5|A Chain A, Structure Of The Glycoslyase Domain Of Mbd4 Bound To A
5hmu Containing Dna
Length = 160
Score = 78.2 bits (191), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 51/85 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 61 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 120
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFL 114
IFC +W +V P H LN Y ++L
Sbjct: 121 RIFCVNEWKQVHPEAHKLNKYHDWL 145
>gi|221104829|ref|XP_002167512.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Hydra
magnipapillata]
Length = 160
Score = 77.0 bits (188), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 50/81 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P E + EEI ++ LGL +RA I +FS E+L + W + +L+G+GKY D++
Sbjct: 67 PTPNDVNENNFEEISALMKPLGLNYRRAKNIIQFSFEFLHKDWKYPRELYGIGKYGDDSY 126
Query: 90 AIFCTGKWDRVRPTDHMLNYY 110
+FC G++D V PTD+ LN Y
Sbjct: 127 RMFCLGQFDNVHPTDNKLNLY 147
>gi|241631815|ref|XP_002410284.1| conserved hypothetical protein [Ixodes scapularis]
gi|215503366|gb|EEC12860.1| conserved hypothetical protein [Ixodes scapularis]
Length = 686
Score = 74.7 bits (182), Expect = 6e-12, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 50/79 (63%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
+I L P A++ + ++I +++ LGL KRA +I+RFS+EYL W + ++LHG
Sbjct: 575 LIWKFLGLYPTAESVLQAPIQDIAELLQPLGLHNKRAAIIRRFSEEYLTRDWRYPSELHG 634
Query: 81 VGKYAADAFAIFCTGKWDR 99
+GKY D++ IFC +W +
Sbjct: 635 IGKYGNDSYRIFCVKEWAK 653
>gi|297263977|ref|XP_001116090.2| PREDICTED: methyl-CpG-binding domain protein 4-like, partial
[Macaca mulatta]
Length = 544
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 46/74 (62%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 458 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 517
Query: 90 AIFCTGKWDRVRPT 103
IFC +W +VRPT
Sbjct: 518 RIFCVNEWKQVRPT 531
>gi|328712829|ref|XP_003244914.1| PREDICTED: methyl-CpG-binding domain protein 4-like [Acyrthosiphon
pisum]
Length = 206
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 56/92 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P + T+ D +++ + LGL+ R IKRF+ ++L ++W + +L+G+GKY D++
Sbjct: 113 PTPEDVTKADPQQLLSSVENLGLENTRVKTIKRFTADFLLKNWKYPIELYGIGKYGNDSY 172
Query: 90 AIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 121
+FC +W +V+P D +L+ Y ++L + L
Sbjct: 173 RLFCVNEWRKVQPDDILLSKYKDWLTLNEKQL 204
>gi|196003048|ref|XP_002111391.1| hypothetical protein TRIADDRAFT_5883 [Trichoplax adhaerens]
gi|190585290|gb|EDV25358.1| hypothetical protein TRIADDRAFT_5883, partial [Trichoplax
adhaerens]
Length = 146
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 56/100 (56%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
KA ++ F P + A+ + +E+ + S L +Q RA +I +FS EY+ + W +
Sbjct: 44 KAIPLLWKFFGHYPTPEIASNANWKEMAENFSALDMQHDRAKIIIKFSDEYIKKDWCYPC 103
Query: 77 QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVS 116
+L G+GKY D++ IFC +W +V P D+ L Y ++L
Sbjct: 104 ELFGIGKYGNDSYRIFCVNEWKKVFPRDYNLVRYHKWLCQ 143
>gi|194381152|dbj|BAG64144.1| unnamed protein product [Homo sapiens]
Length = 572
Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 45/74 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPT 103
IFC +W +VR T
Sbjct: 546 RIFCVNEWKQVRLT 559
>gi|442796452|ref|NP_001263200.1| methyl-CpG-binding domain protein 4 isoform 3 [Homo sapiens]
Length = 572
Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 45/74 (60%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIGKYGNDSY 545
Query: 90 AIFCTGKWDRVRPT 103
IFC +W +VR T
Sbjct: 546 RIFCVNEWKQVRLT 559
>gi|390350844|ref|XP_001202024.2| PREDICTED: methyl-CpG-binding domain protein 4-like
[Strongylocentrotus purpuratus]
Length = 188
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 45/84 (53%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT 76
KA V+ F + D + I +I LGL KRA M+ +FS E+L + WT+
Sbjct: 105 KAIPVLWQFFQTWDTPEKTRSADWQSIADLIQPLGLHTKRAKMLIQFSDEFLTKDWTYPI 164
Query: 77 QLHGVGKYAADAFAIFCTGKWDRV 100
+L G+GKY D++ IFC +W V
Sbjct: 165 ELSGIGKYGNDSYRIFCVNEWKEV 188
>gi|432110910|gb|ELK34384.1| Methyl-CpG-binding domain protein 4 [Myotis davidii]
Length = 1112
Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 42/68 (61%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P A+ A D ++ +++ +GL + RA I +FS EYL + W + +LHG+GKY D++
Sbjct: 489 PSAEVARAADWRDVSELLKPIGLYELRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSY 548
Query: 90 AIFCTGKW 97
IFC +W
Sbjct: 549 RIFCINEW 556
>gi|301779644|ref|XP_002925241.1| PREDICTED: LOW QUALITY PROTEIN: methyl-CpG-binding domain protein
4-like [Ailuropoda melanoleuca]
Length = 642
Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P AK A D ++ + + LGL A +FS EYL + W + ++ G+G+Y D+
Sbjct: 528 PSAKVARTPDRRDVSEPLRPLGLYDLGAKTTVKFSDEYLTKQWRYPIEVXGMGRYGNDSX 587
Query: 90 AI-FCTGKWDRVRPTDHMLNYYWEFL 114
+ FC +W +V P DH LN + ++L
Sbjct: 588 RVFFCINEWKQVHPEDHKLNKHQDWL 613
>gi|397572920|gb|EJK48465.1| hypothetical protein THAOC_32731, partial [Thalassiosira oceanica]
Length = 284
Score = 60.5 bits (145), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWT--HVTQ 77
RVISD F P D ++ ++ LGL ++R +KRF+ ++L WT V++
Sbjct: 148 RVISDFFEAYPSPTAVINGDTNDMATMLHPLGLNRERT--MKRFAIDFLSPEWTDGRVSR 205
Query: 78 LHGVGKYAADAFAIFCTGKWDRV 100
LHG G +A +++ +FC G++ +V
Sbjct: 206 LHGCGNFAQNSWDVFCHGRYQQV 228
>gi|303284263|ref|XP_003061422.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456752|gb|EEH54052.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 164
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 28/130 (21%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL------G 69
L+ R + LF L P + A + IE +++ LGL +KRA R S++YL G
Sbjct: 35 LQVDRALWRLFLLAPTPEEARRTGLDAIEDVLAPLGLHRKRARAFVRLSEDYLAATRPRG 94
Query: 70 ESW---------------------THVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLN 108
E+ LHGVG YA+DA A+FC G V P DH L
Sbjct: 95 EALEANGGGGGGDGGLFSTRTKLAAPAASLHGVGAYASDAHAMFCDGLMG-VAPRDHALR 153
Query: 109 YYWEFLVSTK 118
+++ + V +
Sbjct: 154 WWYAWAVERR 163
>gi|357631679|gb|EHJ79148.1| putative methyl-CpG binding domain protein 4-like protein [Danaus
plexippus]
Length = 196
Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 18 AGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQ 77
A I FT P + +E+ LGL KKR MI + S +++ W +
Sbjct: 67 ARPYIKSFFTDYPTPYQVLDETPSSLERFFENLGL-KKRGNMIWKLSYQFVSGKWRRASD 125
Query: 78 LHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFL 114
L G+GKY DA+ IFC G D V P D L Y ++L
Sbjct: 126 LCGIGKYGEDAYRIFCLGHTD-VNPDDRYLKLYLDWL 161
>gi|308809063|ref|XP_003081841.1| unnamed protein product [Ostreococcus tauri]
gi|116060308|emb|CAL55644.1| unnamed protein product [Ostreococcus tauri]
Length = 757
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 39/115 (33%), Positives = 56/115 (48%), Gaps = 12/115 (10%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDA-----EEIEKIISTLGLQKKRAPMIKRFSQEYLGE 70
L+ RV+ LF L P A A + + E +E++I+ LGL +KR R S++ +
Sbjct: 639 LQVDRVLWRLFLLAPTAADAIALGSTPEGLEALERVIAPLGLHRKRTDAFVRLSKDVEKQ 698
Query: 71 SWTH------VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLVSTKG 119
H V+ HGVG YAADA A+F G P DH L +Y + +G
Sbjct: 699 RAAHGGRVKNVSACHGVGVYAADAHALFVDGVLAG-PPRDHALRWYHAWATERRG 752
>gi|145351789|ref|XP_001420245.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580479|gb|ABO98538.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 994
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 12/106 (11%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVD-----AEEIEKIISTLGLQKKRAPMIKRFSQEY--- 67
L+ RV+ LF L P A A E+ E +E+II+ LGL +KR + S++
Sbjct: 871 LQVDRVLWRLFLLAPTAAAAVELGDTHEGLEALERIIAPLGLHRKRTNAFVKLSRDVEAQ 930
Query: 68 ---LGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
G +V+ HGVG YAADA A+F G P DH L +Y
Sbjct: 931 RAKYGGRIKNVSACHGVGVYAADAHALFVDGVLAG-PPRDHALRWY 975
>gi|299470461|emb|CBN78453.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 312
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHG 80
+ F CP + D ++ K + LGL ++ +IKRF++ +L +W ++LHG
Sbjct: 192 AVKAFFEACPTPTSVLAADLGDLRKTLLPLGLNREL--IIKRFAEGFLASAWHDPSELHG 249
Query: 81 VGKYAADAFAIFCTGK 96
G++A+D++ IFC G
Sbjct: 250 CGRFASDSWQIFCLGN 265
>gi|384247134|gb|EIE20622.1| hypothetical protein COCSUDRAFT_83540 [Coccomyxa subellipsoidea
C-169]
Length = 116
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%)
Query: 37 EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG 95
E EE+E I LGLQ+ R +K+ SQ + + W + +G GK+ +D++ IFC G
Sbjct: 18 EAPDEELESFIDPLGLQETRIKAVKQMSQAFFEKEWEDPVEFYGCGKFTSDSWRIFCRG 76
>gi|440804280|gb|ELR25157.1| base excision DNA repair protein, HhHGPD subfamily protein
[Acanthamoeba castellanii str. Neff]
Length = 229
Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 53/113 (46%), Gaps = 14/113 (12%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL-GESWTHVTQLH 79
V+ DLF P + E D EE+ +I LGL +KRA + FS+ YL W +L
Sbjct: 27 VVGDLFDRYPSPEAMAEADEEELAGMIRRLGLYRKRAKALVDFSRTYLDSPPWRTPAELK 86
Query: 80 GVGKYAADAFAIFCTGK-------------WDRVRPTDHMLNYYWEFLVSTKG 119
GKYA D++ +FC G RVRP D L Y ++L + G
Sbjct: 87 YFGKYADDSYRLFCLGGTSLSLWLTPLLLCSSRVRPRDKKLKQYRDWLFAHYG 139
>gi|390600027|gb|EIN09422.1| DNA glycosylase [Punctularia strigosozonata HHB-11173 SS5]
Length = 173
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 52/122 (42%), Gaps = 20/122 (16%)
Query: 18 AGRVISDLF----TLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGES-- 71
AGR +F P + + D E+ +I TLGLQ RA + + S+ YL
Sbjct: 27 AGRTAIPIFWKILDRWPSPQALSNADVTELTHMIKTLGLQSMRAKRLIQLSEMYLAAPPD 86
Query: 72 ------------WTHVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHMLNYYWEFLVST 117
T V+ L G G+YA D++ IFC W V PTD L Y + +T
Sbjct: 87 PSILHKSRNYPVATPVSHLPGSGRYALDSYRIFCNPGIDWMNVMPTDKELIKYIRWKWAT 146
Query: 118 KG 119
G
Sbjct: 147 VG 148
>gi|358056969|dbj|GAA97128.1| hypothetical protein E5Q_03803 [Mixia osmundae IAM 14324]
Length = 257
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 22/121 (18%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTH------ 74
V+ + P A D +E+ ++I +GL +RA M+ RFSQ Y+ E T
Sbjct: 101 VLYQILERWPTAHALAAADLQELTQVIYPIGLYNRRARMLIRFSQTYVDEPPTEGILSCN 160
Query: 75 ----------VTQLHGVGKYAADAFAIFCTG------KWDRVRPTDHMLNYYWEFLVSTK 118
++ GVG+YA D++ IF W V P D L Y E+ + +
Sbjct: 161 KSDRAYVPTMISHYPGVGRYALDSYRIFIENANSSEPSWRSVLPLDKELKKYLEWRWALE 220
Query: 119 G 119
G
Sbjct: 221 G 221
>gi|397614200|gb|EJK62655.1| hypothetical protein THAOC_16723 [Thalassiosira oceanica]
Length = 547
Score = 49.3 bits (116), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 9/103 (8%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------LGE 70
R + F PDA A + D EE+ I+S LGL KR+ + RFS+E+
Sbjct: 407 RSLFHFFQQWPDAVAAAKADWEEVLPIVSQLGLGNKRSQALVRFSREFKELADENDEFSL 466
Query: 71 SWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEF 113
S V L+ +G+Y A+ IF DH L+ Y E+
Sbjct: 467 SERQVKSLYFIGEYGWSAYRIFILKDLPLTGVKDHALSLYVEY 509
>gi|255086739|ref|XP_002509336.1| predicted protein [Micromonas sp. RCC299]
gi|226524614|gb|ACO70594.1| predicted protein [Micromonas sp. RCC299]
Length = 175
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 58/151 (38%), Gaps = 54/151 (35%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDA---------------EEIEKIISTLGLQKKRAPMI 60
L+ RV+ LF L P + A + A + IE+I+ LGL +KRA +
Sbjct: 9 LQVDRVLWRLFLLVPTPEDAVALGALRDGGNDGFDGKSGLDRIEEILRPLGLHRKRARAL 68
Query: 61 KRFSQEYLGESWT--------------------------------------HVTQLHGVG 82
+ S++Y+ V LHGVG
Sbjct: 69 VKLSEDYVAARGAAPLPGPPGATGSDDDDDDGGGGRGSLGGFFGSSRRLSAPVASLHGVG 128
Query: 83 KYAADAFAIFCTGKWDRVRPTDHMLNYYWEF 113
YA+DA A+FC G V P DH L +++ +
Sbjct: 129 AYASDAHALFCEGTLG-VAPRDHALRWWYAW 158
>gi|444705535|gb|ELW46957.1| hypothetical protein TREES_T100019830 [Tupaia chinensis]
Length = 467
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 29/45 (64%), Gaps = 2/45 (4%)
Query: 57 APMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTGKWDRVR 101
AP++ + EYL + W + +LHG+GKY D++ IFC +W + R
Sbjct: 17 APLVT--ANEYLTKQWRYPIELHGIGKYGNDSYRIFCVNEWKQKR 59
>gi|325263222|ref|ZP_08129957.1| endonuclease III [Clostridium sp. D5]
gi|324031615|gb|EGB92895.1| endonuclease III [Clostridium sp. D5]
Length = 212
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E D EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEEIERIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K +EY G +++ + +L GVG+ +A+
Sbjct: 94 MKILKEEYEGGIPKTFNELMKLPGVGRKSAN 124
>gi|332287845|ref|YP_004422746.1| putative DNA repair protein [Chlamydophila psittaci 6BC]
gi|384451012|ref|YP_005663612.1| endonuclease III [Chlamydophila psittaci 6BC]
gi|325507289|gb|ADZ18927.1| putative DNA repair protein [Chlamydophila psittaci 6BC]
gi|328915106|gb|AEB55939.1| endonuclease III [Chlamydophila psittaci 6BC]
Length = 227
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+AILL V LF+L PDA+T ++ E + +IS GL +++A + +
Sbjct: 37 VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96
Query: 65 ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
++Y GE S +TQL GVG+ A F
Sbjct: 97 KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128
>gi|329943266|ref|ZP_08292040.1| hhH-GPD superbase excision DNA repair family protein [Chlamydophila
psittaci Cal10]
gi|384454931|ref|YP_005667526.1| putative DNA repair protein [Chlamydophila psittaci 02DC15]
gi|392377074|ref|YP_004064852.1| putative DNA repair protein [Chlamydophila psittaci RD1]
gi|407454501|ref|YP_006733609.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci 84/55]
gi|407455769|ref|YP_006734660.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci GR9]
gi|407458507|ref|YP_006736812.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci WS/RT/E30]
gi|313848417|emb|CBY17421.1| putative DNA repair protein [Chlamydophila psittaci RD1]
gi|328814813|gb|EGF84803.1| hhH-GPD superbase excision DNA repair family protein [Chlamydophila
psittaci Cal10]
gi|334694688|gb|AEG87905.1| putative DNA repair protein [Chlamydophila psittaci 02DC15]
gi|405781260|gb|AFS20010.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci 84/55]
gi|405782312|gb|AFS21061.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci GR9]
gi|405785443|gb|AFS24189.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci WS/RT/E30]
Length = 219
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+AILL V LF+L PDA+T ++ E + +IS GL +++A + +
Sbjct: 37 VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96
Query: 65 ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
++Y GE S +TQL GVG+ A F
Sbjct: 97 KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128
>gi|384451999|ref|YP_005664597.1| putative DNA repair protein [Chlamydophila psittaci 01DC11]
gi|384452973|ref|YP_005665570.1| putative DNA repair protein [Chlamydophila psittaci 08DC60]
gi|384453952|ref|YP_005666548.1| putative DNA repair protein [Chlamydophila psittaci C19/98]
gi|406592825|ref|YP_006740005.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci CP3]
gi|406594366|ref|YP_006742094.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci MN]
gi|407457187|ref|YP_006735760.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci VS225]
gi|407461123|ref|YP_006738898.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci WC]
gi|410858857|ref|YP_006974797.1| putative DNA repair protein [Chlamydia psittaci 01DC12]
gi|449071573|ref|YP_007438653.1| endonuclease III [Chlamydophila psittaci Mat116]
gi|334692733|gb|AEG85952.1| putative DNA repair protein [Chlamydophila psittaci C19/98]
gi|334693709|gb|AEG86927.1| putative DNA repair protein [Chlamydophila psittaci 01DC11]
gi|334695662|gb|AEG88878.1| putative DNA repair protein [Chlamydophila psittaci 08DC60]
gi|405782798|gb|AFS21546.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci MN]
gi|405784448|gb|AFS23195.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci VS225]
gi|405787270|gb|AFS26014.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci WC]
gi|405788697|gb|AFS27440.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci CP3]
gi|410811752|emb|CCO02407.1| putative DNA repair protein [Chlamydia psittaci 01DC12]
gi|449040081|gb|AGE75505.1| endonuclease III [Chlamydophila psittaci Mat116]
Length = 221
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+AILL V LF+L PDA+T ++ E + +IS GL +++A + +
Sbjct: 37 VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLVQLPLENLYSLISPCGLGRRKAAYLHNLA 96
Query: 65 ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
++Y GE S +TQL GVG+ A F
Sbjct: 97 KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128
>gi|406593881|ref|YP_006741060.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci NJ1]
gi|407459750|ref|YP_006737853.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci M56]
gi|405786532|gb|AFS25277.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci M56]
gi|405789753|gb|AFS28495.1| hhH-GPD superbase excision DNA repair family protein [Chlamydia
psittaci NJ1]
Length = 221
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+AILL V LF+L PDA+T ++ E + +IS GL +++A + +
Sbjct: 37 VAILLSGNSTDKAVNAVTPRLFSLAPDAQTLAQLPLENLYSLISPCGLGRRKAAYLHNLA 96
Query: 65 ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
++Y GE S +TQL GVG+ A F
Sbjct: 97 KILLEKYTGEPPASLELLTQLPGVGRKTASVF 128
>gi|412989966|emb|CCO20608.1| predicted protein [Bathycoccus prasinos]
Length = 952
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 75 VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYYWEFLV 115
V++ HGVG YAADA A+FC G P DH L +Y+ F +
Sbjct: 884 VSEFHGVGAYAADAHAMFCDGII-AAPPRDHALRWYYAFAL 923
Score = 38.9 bits (89), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 5/58 (8%)
Query: 16 LKAGRVISDLFTLCPDAKTA-----TEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
L+ RV+ +LFTL P A+ A TE E IEK ++T+GL +KRA R S++ +
Sbjct: 748 LQVDRVLWELFTLAPTAEDAVKLGETERGIELIEKCLATIGLHRKRARAFVRLSKDVV 805
>gi|332231797|ref|XP_003265081.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Nomascus
leucogenys]
Length = 538
Score = 44.3 bits (103), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+G
Sbjct: 484 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 536
>gi|426342069|ref|XP_004036338.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Gorilla
gorilla gorilla]
Length = 540
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538
>gi|114589107|ref|XP_001143803.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Pan
troglodytes]
Length = 540
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538
>gi|397518618|ref|XP_003829481.1| PREDICTED: methyl-CpG-binding domain protein 4 isoform 3 [Pan
paniscus]
Length = 540
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538
>gi|442796454|ref|NP_001263201.1| methyl-CpG-binding domain protein 4 isoform 4 [Homo sapiens]
gi|33150919|gb|AAP97338.1| methyl-CpG binding domain protein 4 variant [Homo sapiens]
gi|119599658|gb|EAW79252.1| methyl-CpG binding domain protein 4, isoform CRA_b [Homo sapiens]
Length = 540
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 31/53 (58%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
P A+ A D ++ +++ LGL RA I +FS EYL + W + +LHG+G
Sbjct: 486 PSAEVARTADWRDVSELLKPLGLYDLRAKTIVKFSDEYLTKQWKYPIELHGIG 538
>gi|62185498|ref|YP_220283.1| DNA repair protein [Chlamydophila abortus S26/3]
gi|62148565|emb|CAH64337.1| putative DNA repair protein [Chlamydophila abortus S26/3]
Length = 219
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 7/76 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---SWT 73
V LF+L PDA+ ++ E++ IIS GL +++A + +Q +Y GE S
Sbjct: 53 VTPRLFSLAPDAQALVQLPLEDLYFIISPCGLGRRKAAYLHHLAQILLEKYHGEPPASLE 112
Query: 74 HVTQLHGVGKYAADAF 89
+TQL GVG+ A F
Sbjct: 113 LLTQLPGVGRKTASVF 128
>gi|302674673|ref|XP_003027021.1| hypothetical protein SCHCODRAFT_61605 [Schizophyllum commune H4-8]
gi|300100706|gb|EFI92118.1| hypothetical protein SCHCODRAFT_61605 [Schizophyllum commune H4-8]
Length = 223
Score = 43.5 bits (101), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 73 THVTQLHGVGKYAADAFAIFCTG---KWDRVRPTDHML 107
T ++ L GVG+YA D+F FCTG +W VRPTD L
Sbjct: 136 TPISHLPGVGRYALDSFRTFCTGGSDEWKSVRPTDKEL 173
>gi|392584695|gb|EIW74039.1| DNA glycosylase [Coniophora puteana RWD-64-598 SS2]
Length = 219
Score = 43.5 bits (101), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 50/140 (35%), Gaps = 59/140 (42%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL--------------------- 68
P K E D +E+ +I LGLQ RA +K S+ Y
Sbjct: 40 PTPKDLMEADTDELVAMIQKLGLQSTRAIRLKELSRAYHEHPPRTGTLQLEHGKFARAPA 99
Query: 69 -------------------------GESW----THVTQLHGVGKYAADAFAIFCT----- 94
GE+W T ++ L GVG+Y D++ IFCT
Sbjct: 100 SLRERYPPELPSTCDSEADVSLALGGEAWPYPKTPISHLPGVGRYGWDSYRIFCTDNVTG 159
Query: 95 ----GKWDRVRPTDHMLNYY 110
+W RV+P+D L Y
Sbjct: 160 TGSSDEWKRVQPSDKELCRY 179
>gi|159486950|ref|XP_001701499.1| DNA repair glycosylase [Chlamydomonas reinhardtii]
gi|158271560|gb|EDO97376.1| DNA repair glycosylase [Chlamydomonas reinhardtii]
Length = 292
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRF 63
+ ++ L + +K V +LF PDA+ +++A EIE II LGL +A ++
Sbjct: 107 LVAVMLSAQSTDVKVNTVTPELFRRGPDAEAMAKLEASEIEGIIRVLGLAPTKAKNVRAM 166
Query: 64 SQ----EYLGE---SWTHVTQLHGVGKYAAD 87
SQ +Y G+ SW + L GVG A
Sbjct: 167 SQILVEQYDGQVPGSWEGLEALPGVGHKTAS 197
>gi|260587725|ref|ZP_05853638.1| endonuclease III [Blautia hansenii DSM 20583]
gi|260541990|gb|EEX22559.1| endonuclease III [Blautia hansenii DSM 20583]
Length = 211
Score = 43.1 bits (100), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E EE+EKI+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYAKYPDVNALAEAPVEEVEKIVRPCGLGKSKARDICAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K +EY G + + + +L GVG+ +A+
Sbjct: 94 MKILKEEYQGKVPDDFQALLKLPGVGRKSAN 124
>gi|331083986|ref|ZP_08333093.1| hypothetical protein HMPREF0992_02017 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330402348|gb|EGG81918.1| hypothetical protein HMPREF0992_02017 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 211
Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E EE+EKI+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYVKYPDVNALAEAPVEEVEKIVRPCGLGKSKARDICAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K +EY G + + + +L GVG+ +A+
Sbjct: 94 MKILKEEYQGKVPDDFQALLKLPGVGRKSAN 124
>gi|302850094|ref|XP_002956575.1| hypothetical protein VOLCADRAFT_33476 [Volvox carteri f.
nagariensis]
gi|300258102|gb|EFJ42342.1| hypothetical protein VOLCADRAFT_33476 [Volvox carteri f.
nagariensis]
Length = 198
Score = 42.7 bits (99), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 45/105 (42%), Gaps = 12/105 (11%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ---EYLG---- 69
K V +LF PDA + EIE+II LGL +A ++R SQ E G
Sbjct: 44 KVNTVTPELFARGPDAMAMAALKVSEIERIIRVLGLAPTKARNVQRLSQMLVELYGGQVP 103
Query: 70 ESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTD---HMLNYYW 111
+S++ + +L GVG A + C P D H L W
Sbjct: 104 DSFSALEELPGVGHKTAS--VVMCQAFSHPAFPVDTHIHRLAQRW 146
>gi|395323248|gb|EJF55730.1| hypothetical protein DICSQDRAFT_73043 [Dichomitus squalens LYAD-421
SS1]
Length = 283
Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 73 THVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHMLNYYWEF 113
T ++ + G G YA D++ IFCTG +W VRPTD L Y ++
Sbjct: 202 TPISHIPGCGPYALDSYRIFCTGGDEWRSVRPTDKELVKYLQW 244
>gi|347533057|ref|YP_004839820.1| DNA-(apurinic or apyrimidinic site) lyase [Roseburia hominis
A2-183]
gi|345503205|gb|AEN97888.1| DNA-(apurinic or apyrimidinic site) lyase [Roseburia hominis
A2-183]
Length = 211
Score = 42.7 bits (99), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA------ 57
+ S+RL + V+ DL+ PD + D EEIE+I+ GL +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVQDLYREFPDVDALADADVEEIERIVKPCGLGHSKARDISAC 93
Query: 58 -PMIKRFSQEYLGESWTHVTQLHGVGKYAAD 87
M+K Q + E + + +L GVG+ +A+
Sbjct: 94 MKMLKEKYQGKVPEDFDALLELPGVGRKSAN 124
>gi|424825542|ref|ZP_18250529.1| putative DNA repair protein [Chlamydophila abortus LLG]
gi|333410641|gb|EGK69628.1| putative DNA repair protein [Chlamydophila abortus LLG]
Length = 219
Score = 42.4 bits (98), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 7/76 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---SWT 73
V LF+L PDA+ ++ E++ IIS GL +++ + +Q +Y GE S
Sbjct: 53 VTPRLFSLAPDAQALVQLPLEDLYFIISPCGLGRRKVAYLHHLAQILLEKYHGEPPASLE 112
Query: 74 HVTQLHGVGKYAADAF 89
+TQL GVG+ A F
Sbjct: 113 LLTQLPGVGRKTASVF 128
>gi|255280265|ref|ZP_05344820.1| endonuclease III [Bryantella formatexigens DSM 14469]
gi|255269356|gb|EET62561.1| putative endonuclease III [Marvinbryantia formatexigens DSM 14469]
Length = 239
Score = 42.4 bits (98), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V++DLF P K E D E+IE+I+ GL +A
Sbjct: 61 LVSVRLAAQCTDARVNIVVADLFVKYPGVKELAEADVEDIERIVKPCGLGHSKARDISAC 120
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G E + + +L GVG+ +A+
Sbjct: 121 MKMLQEQYDGKIPEDFDALLKLPGVGRKSAN 151
>gi|291562481|emb|CBL41297.1| Predicted EndoIII-related endonuclease [butyrate-producing
bacterium SS3/4]
Length = 217
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD K E D +EIE+I+ GL + +A
Sbjct: 39 LVSVRLAAQCTDARVNVVVQDLYKKFPDVKALAEADVDEIEEIVRPCGLGRSKARDISAC 98
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G + + + +L GVG+ +A+
Sbjct: 99 MKILYEQYHGNVPDDFDALLKLPGVGRKSAN 129
>gi|374673082|dbj|BAL50973.1| endonuclease III [Lactococcus lactis subsp. lactis IO-1]
Length = 218
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 28/41 (68%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+T ++ + EEIEK+I T+GL K +A I R SQ
Sbjct: 55 LFAAFPDAQTMSQANVEEIEKLIRTIGLYKTKAKNILRTSQ 95
>gi|167761558|ref|ZP_02433685.1| hypothetical protein CLOSCI_03969 [Clostridium scindens ATCC 35704]
gi|336422654|ref|ZP_08602797.1| hypothetical protein HMPREF0993_02174 [Lachnospiraceae bacterium
5_1_57FAA]
gi|167661224|gb|EDS05354.1| putative endonuclease III [Clostridium scindens ATCC 35704]
gi|336007827|gb|EGN37848.1| hypothetical protein HMPREF0993_02174 [Lachnospiraceae bacterium
5_1_57FAA]
Length = 215
Score = 41.6 bits (96), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD + D E+IE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYARYPDVEALAAADVEDIERIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLGE---SWTHVTQLHGVGKYAAD 87
+K +EY G+ + + +L GVG+ +A+
Sbjct: 94 MKILKEEYDGKIPRDFNALLKLPGVGRKSAN 124
>gi|145357325|ref|XP_001422870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583114|gb|ABP01229.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 156
Score = 41.6 bits (96), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 7/95 (7%)
Query: 22 ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT----- 76
IS+ F P A D++++ +II LGL R I S ++L E+
Sbjct: 47 ISNFFAKFPTPSAALAADSQDVFEIIKRLGLFPGRMRTIVEVSTKFLLETGEFEVGLEPE 106
Query: 77 -QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+++G+G++ D+F IF G R +P D L +
Sbjct: 107 LKIYGIGEFGVDSFEIFARGDVRR-KPKDCNLQSF 140
>gi|15673053|ref|NP_267227.1| endonuclease III [Lactococcus lactis subsp. lactis Il1403]
gi|418037871|ref|ZP_12676231.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
subsp. cremoris CNCM I-1631]
gi|12724026|gb|AAK05169.1|AE006340_2 endonuclease III [Lactococcus lactis subsp. lactis Il1403]
gi|354693995|gb|EHE93702.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
subsp. cremoris CNCM I-1631]
Length = 218
Score = 40.8 bits (94), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+T ++ EEIEK+I T+GL K +A I R SQ
Sbjct: 55 LFATFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95
>gi|385830614|ref|YP_005868427.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
gi|326406622|gb|ADZ63693.1| endonuclease III [Lactococcus lactis subsp. lactis CV56]
Length = 218
Score = 40.8 bits (94), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+T ++ EEIEK+I T+GL K +A I R SQ
Sbjct: 55 LFATFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95
>gi|254468543|ref|ZP_05081949.1| endonuclease III [beta proteobacterium KB13]
gi|207087353|gb|EDZ64636.1| endonuclease III [beta proteobacterium KB13]
Length = 212
Score = 40.8 bits (94), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 14/92 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A RV + LF + PD +++ ++IE +I+++GL K +A I++ S
Sbjct: 34 IAVILSAQTTDVQVNRVTAKLFKIAPDPLKLSKLSLDKIESLINSIGLYKNKAKNIQQTS 93
Query: 65 Q----EYLGE---SWTHVTQLHGVGKYAADAF 89
+Y GE S + L GVG+ A+
Sbjct: 94 SMLITKYNGEVPQSRKELENLPGVGRKTANVI 125
>gi|281491571|ref|YP_003353551.1| endonuclease III [Lactococcus lactis subsp. lactis KF147]
gi|281375289|gb|ADA64802.1| Endonuclease III [Lactococcus lactis subsp. lactis KF147]
Length = 218
Score = 40.8 bits (94), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+T ++ EEIEK+I T+GL K +A I R SQ
Sbjct: 55 LFAAFPDAQTMSQAKVEEIEKLIRTIGLYKTKAKNILRTSQ 95
>gi|420151539|ref|ZP_14658642.1| endonuclease III [Actinomyces massiliensis F0489]
gi|394766615|gb|EJF47646.1| endonuclease III [Actinomyces massiliensis F0489]
Length = 247
Score = 40.8 bits (94), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP--------MIKRFSQEYLGESW 72
V +LF LCPDA D +++E ++ LG+Q+ RA +++RF + +
Sbjct: 87 VTPELFALCPDAGALAAADPDDLEALLRPLGMQRTRAERLIGLASGLVERFDG-VVPQGR 145
Query: 73 THVTQLHGVGKYAAD 87
+ L GVG+ A+
Sbjct: 146 EALMTLPGVGRKTAN 160
>gi|392562227|gb|EIW55408.1| hypothetical protein TRAVEDRAFT_153786 [Trametes versicolor
FP-101664 SS1]
Length = 291
Score = 40.8 bits (94), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 75 VTQLHGVGKYAADAFAIFCTGK--WDRVRPTDHMLNYYWEF 113
V+ + G G YA D++ IFC+GK W VRP D L Y E+
Sbjct: 212 VSHIPGSGPYALDSYRIFCSGKDEWKSVRPRDKELVKYLEW 252
>gi|323693535|ref|ZP_08107739.1| endonuclease III [Clostridium symbiosum WAL-14673]
gi|355629427|ref|ZP_09050395.1| hypothetical protein HMPREF1020_04474 [Clostridium sp. 7_3_54FAA]
gi|323502390|gb|EGB18248.1| endonuclease III [Clostridium symbiosum WAL-14673]
gi|354819113|gb|EHF03565.1| hypothetical protein HMPREF1020_04474 [Clostridium sp. 7_3_54FAA]
Length = 211
Score = 40.8 bits (94), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD K E D ++IE+I+ GL + +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVQDLYDKFPDVKALAEADVDKIEEIVRPCGLGRSKARDINAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G E + + +L GVG+ +A+
Sbjct: 94 MKILWEQYGGKVPEDFDALLKLPGVGRKSAN 124
>gi|308807385|ref|XP_003081003.1| putative metalloendopeptidase (ISS) [Ostreococcus tauri]
gi|116059465|emb|CAL55172.1| putative metalloendopeptidase (ISS) [Ostreococcus tauri]
Length = 305
Score = 40.4 bits (93), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 7/96 (7%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT---- 76
VIS+ F P A DAEE+ ++I LGL R I S L +
Sbjct: 182 VISNFFEKFPTPTAALHADAEEVFEVIKRLGLFPGRHRTIVEVSTAILTNTGAFEVGLEP 241
Query: 77 --QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+++G+G++ D+F IF G +R +P D L +
Sbjct: 242 EKKIYGIGEFGIDSFEIFARGDINR-KPKDCNLAAF 276
>gi|29840684|ref|NP_829790.1| endonuclease III [Chlamydophila caviae GPIC]
gi|29835034|gb|AAP05668.1| endonuclease III [Chlamydophila caviae GPIC]
Length = 214
Score = 40.4 bits (93), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI---- 60
+AILL V +LF+ PDA+ ++ +++ IIS GL K++A +
Sbjct: 37 VAILLSGNSTDKAVNSVTPELFSAAPDAQALAKLPLDKLYFIISPCGLGKRKAAYLHDLS 96
Query: 61 KRFSQEYLGE---SWTHVTQLHGVGKYAADAF 89
K S++Y GE S +T+L GVG+ A F
Sbjct: 97 KIISEKYRGEPPASLELLTKLPGVGRKTASVF 128
>gi|336375826|gb|EGO04161.1| hypothetical protein SERLA73DRAFT_45279 [Serpula lacrymans var.
lacrymans S7.3]
Length = 218
Score = 40.4 bits (93), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 46/134 (34%), Gaps = 53/134 (39%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------------------- 67
P A + E+I +I +LGLQ KRA R SQ Y
Sbjct: 44 PTADALRQAKVEDIVSVIQSLGLQNKRARHFIRLSQAYVEDPPQPNVLRKSNKPHYLGYI 103
Query: 68 ----LGES-----------------------WTHVTQLHGVGKYAADAFAIFCT----GK 96
LG S T V+ + G+G YA D++ IFC +
Sbjct: 104 YIGALGSSPQGLPSHQHRRYQHYRRCRLQYPMTPVSHMPGMGPYALDSYRIFCVKSEYNE 163
Query: 97 WDRVRPTDHMLNYY 110
W RV P D L Y
Sbjct: 164 WKRVVPEDKELIRY 177
>gi|323485089|ref|ZP_08090442.1| hypothetical protein HMPREF9474_02193 [Clostridium symbiosum
WAL-14163]
gi|323401645|gb|EGA93990.1| hypothetical protein HMPREF9474_02193 [Clostridium symbiosum
WAL-14163]
Length = 175
Score = 40.4 bits (93), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD K E D ++IE+I+ GL + +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVQDLYDKFPDVKALAEADVDKIEEIVRPCGLGRSKARDINAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G E + + +L GVG+ +A+
Sbjct: 94 MKILWEQYGGKVPEDFDALLKLPGVGRKSAN 124
>gi|357056045|ref|ZP_09117101.1| hypothetical protein HMPREF9467_04073 [Clostridium clostridioforme
2_1_49FAA]
gi|355381537|gb|EHG28660.1| hypothetical protein HMPREF9467_04073 [Clostridium clostridioforme
2_1_49FAA]
Length = 212
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E D E+IE+I+ GL +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEDIERIVKPCGLGHSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G + + + +L GVG+ +A+
Sbjct: 94 MKILKEQYEGKVPDDFDALLKLPGVGRKSAN 124
>gi|353244084|emb|CCA75538.1| hypothetical protein PIIN_09528 [Piriformospora indica DSM 11827]
Length = 199
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 57/125 (45%), Gaps = 27/125 (21%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESW-------- 72
V++++ T P + ++ A E+E +++ LGL +KR + +F+ +L +
Sbjct: 42 VLNEVRTRWPTMEALSQAHAPELE-LLTPLGLGRKRTQRLIQFATMWLEDPPDQMSLRPS 100
Query: 73 -------------THVTQLHGVGKYAADAFAIFC-----TGKWDRVRPTDHMLNYYWEFL 114
T V+ L G+G+YA D++ IFC +W V P D L + +
Sbjct: 101 RVQVAKGVAKYPPTAVSHLPGMGRYALDSYRIFCGQLVGGNEWKTVTPLDKELRVHMRWR 160
Query: 115 VSTKG 119
+ +G
Sbjct: 161 WAVEG 165
>gi|346307710|ref|ZP_08849841.1| hypothetical protein HMPREF9457_01550 [Dorea formicigenerans
4_6_53AFAA]
gi|345904878|gb|EGX74620.1| hypothetical protein HMPREF9457_01550 [Dorea formicigenerans
4_6_53AFAA]
Length = 212
Score = 40.4 bits (93), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD + EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYAKFPDVNALADAPVEEIEEIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G + + + +L GVG+ +A+
Sbjct: 94 MKILRDEYNGKVPDDFDKLLKLPGVGRKSAN 124
>gi|160935307|ref|ZP_02082689.1| hypothetical protein CLOBOL_00202 [Clostridium bolteae ATCC
BAA-613]
gi|158441665|gb|EDP19365.1| hypothetical protein CLOBOL_00202 [Clostridium bolteae ATCC
BAA-613]
Length = 212
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E D E+IE+I+ GL +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYAKYPDVDALAEADVEDIERIVKPCGLGHSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G + + + +L GVG+ +A+
Sbjct: 94 MKILKEQYGGKVPDDFDALLKLPGVGRKSAN 124
>gi|166031547|ref|ZP_02234376.1| hypothetical protein DORFOR_01247 [Dorea formicigenerans ATCC
27755]
gi|166028524|gb|EDR47281.1| putative endonuclease III [Dorea formicigenerans ATCC 27755]
Length = 249
Score = 40.4 bits (93), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD + EEIE+I+ GL K +A
Sbjct: 71 LVSVRLAAQCTDARVNVVVEDLYAKFPDVNALADAPVEEIEEIVRPCGLGKSKARDISAC 130
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G + + + +L GVG+ +A+
Sbjct: 131 MKILRDEYNGKVPDDFDKLLKLPGVGRKSAN 161
>gi|154503500|ref|ZP_02040560.1| hypothetical protein RUMGNA_01324 [Ruminococcus gnavus ATCC 29149]
gi|336431486|ref|ZP_08611335.1| hypothetical protein HMPREF0991_00454 [Lachnospiraceae bacterium
2_1_58FAA]
gi|153795600|gb|EDN78020.1| putative endonuclease III [Ruminococcus gnavus ATCC 29149]
gi|336013788|gb|EGN43660.1| hypothetical protein HMPREF0991_00454 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 213
Score = 40.0 bits (92), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGE---SWT 73
V+ DL+ PD E D +EIE I+ GL K +A ++ EY GE +
Sbjct: 53 VVKDLYAKFPDVNALAEADPQEIEAIVRPCGLGKSKARDISACMRILRDEYHGEVPDDFD 112
Query: 74 HVTQLHGVGKYAAD 87
+ +L GVG+ +A+
Sbjct: 113 ALLKLPGVGRKSAN 126
>gi|358066043|ref|ZP_09152577.1| hypothetical protein HMPREF9473_04640 [Clostridium hathewayi
WAL-18680]
gi|356695906|gb|EHI57531.1| hypothetical protein HMPREF9473_04640 [Clostridium hathewayi
WAL-18680]
Length = 211
Score = 40.0 bits (92), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ DL+ PD ++ E D ++IE+I+ GL +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVQDLYAKYPDVESLAEADVDDIERIVKPCGLGHSKARDISGC 93
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K E+ G + + + +L GVG+ +A+
Sbjct: 94 MKMLRDEFGGKVPDDFDALMKLPGVGRKSAN 124
>gi|145350560|ref|XP_001419671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579903|gb|ABO97964.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 331
Score = 40.0 bits (92), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 7/95 (7%)
Query: 22 ISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT----- 76
IS+ F P A D++++ +II LGL R I S ++L E+
Sbjct: 222 ISNFFAKFPTPSAALAADSQDVFEIIKRLGLFPGRMRTIVEVSTKFLLETGEFEVGLEPE 281
Query: 77 -QLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+++G+G++ D+F IF G R +P D L +
Sbjct: 282 LKIYGIGEFGVDSFEIFARGDV-RRKPKDCNLQSF 315
>gi|389750655|gb|EIM91728.1| hypothetical protein STEHIDRAFT_70147 [Stereum hirsutum FP-91666
SS1]
Length = 439
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 73 THVTQLHGVGKYAADAFAIFCTG-----KWDRVRPTDHMLNYYWEFLVSTKG 119
+ ++ L G GKYA D++ I+C G +W VRP D L Y ++ +T+G
Sbjct: 349 SPISHLPGSGKYAIDSYRIYCGGIAGDDEWRHVRPEDKELVRYLKWRWATEG 400
Score = 36.2 bits (82), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE 70
KA + +L P AK T+ +E+ +II TLGLQ+ R+ + + SQ Y+ E
Sbjct: 169 KAIPIFWELMERWPSAKAMTKAHEDELCEIIRTLGLQRTRSARLIKLSQAYIDE 222
>gi|336388964|gb|EGO30107.1| putative 5-Methylcytosine G/T mismatch-specific DNA glycosylase
[Serpula lacrymans var. lacrymans S7.9]
Length = 179
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 46/134 (34%), Gaps = 53/134 (39%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY---------------------- 67
P A + E+I +I +LGLQ KRA R SQ Y
Sbjct: 5 PTADALRQAKVEDIVSVIQSLGLQNKRARHFIRLSQAYVEDPPQPNVLRKSNKPHYLGYI 64
Query: 68 ----LGES-----------------------WTHVTQLHGVGKYAADAFAIFCT----GK 96
LG S T V+ + G+G YA D++ IFC +
Sbjct: 65 YIGALGSSPQGLPSHQHRRYQHYRRCRLQYPMTPVSHMPGMGPYALDSYRIFCVKSEYNE 124
Query: 97 WDRVRPTDHMLNYY 110
W RV P D L Y
Sbjct: 125 WKRVVPEDKELIRY 138
>gi|317473465|ref|ZP_07932759.1| endonuclease III [Anaerostipes sp. 3_2_56FAA]
gi|316899115|gb|EFV21135.1| endonuclease III [Anaerostipes sp. 3_2_56FAA]
Length = 211
Score = 39.3 bits (90), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E D +IE I+ GL + +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVQDLYEKYPDVNALAEADPADIEAIVRPCGLGRSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G +++ + +L GVG+ +A+
Sbjct: 94 MKILRDEYGGGIPDNFKDLMKLPGVGRKSAN 124
>gi|414074236|ref|YP_006999453.1| Endonuclease III [Lactococcus lactis subsp. cremoris UC509.9]
gi|413974156|gb|AFW91620.1| Endonuclease III [Lactococcus lactis subsp. cremoris UC509.9]
Length = 218
Score = 39.3 bits (90), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 19 GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
+ LF PDA+T ++ EEIE +I T+GL K +A I R SQ
Sbjct: 49 NKATPGLFAAFPDAQTMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95
>gi|374586841|ref|ZP_09659933.1| DNA-(apurinic or apyrimidinic site) lyase [Leptonema illini DSM
21528]
gi|373875702|gb|EHQ07696.1| DNA-(apurinic or apyrimidinic site) lyase [Leptonema illini DSM
21528]
Length = 229
Score = 39.3 bits (90), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+A++L A RV +LF PD ++ E E+E++I + G + +A I+ F
Sbjct: 37 VAVILSAQCTDEMVNRVTPELFKAFPDPRSLAEAPVAEVERLIHSTGFYRNKARNIQGFC 96
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
+ + E V Q + GVG+ A+
Sbjct: 97 RALVNEHGGKVPQTIEELVKMPGVGRKTAN 126
>gi|224010940|ref|XP_002294427.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969922|gb|EED88261.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 472
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 25 LFTLC---PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
LF+L P + D EEI +IIS LGL KR+ I RFS+EY+
Sbjct: 315 LFSLLQRWPRVHSMARADWEEIREIISPLGLMNKRSKSIIRFSKEYI 361
>gi|116511912|ref|YP_809128.1| endonuclease III / DNA-(apurinic or apyrimidinic site) lyase
[Lactococcus lactis subsp. cremoris SK11]
gi|385838143|ref|YP_005875773.1| Endonuclease III [Lactococcus lactis subsp. cremoris A76]
gi|116107566|gb|ABJ72706.1| DNA-(apurinic or apyrimidinic site) lyase [Lactococcus lactis
subsp. cremoris SK11]
gi|358749371|gb|AEU40350.1| Endonuclease III [Lactococcus lactis subsp. cremoris A76]
Length = 218
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 26/41 (63%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+T ++ EEIE +I T+GL K +A I R SQ
Sbjct: 55 LFAAFPDAQTMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95
>gi|402218777|gb|EJT98852.1| DNA glycosylase [Dacryopinax sp. DJM-731 SS1]
Length = 445
Score = 38.9 bits (89), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 42/127 (33%)
Query: 31 DAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE---------------SWTHV 75
DA + + V E+ +++ LGLQ RA I FS++Y T V
Sbjct: 266 DAASLSHVPEVELVEVLRPLGLQNTRACRILTFSKQYGSAPPSTQLYPLRVDKTYPLTAV 325
Query: 76 TQLHGVGKYAADAFAIFCTG-------------------------KWDRVRPTDHMLNYY 110
GVG+YAAD++ I+ +W +VRP D L +Y
Sbjct: 326 ANYAGVGRYAADSWRIYSPTLEGGGAPRGAKETLEGFFEPENSEQEWKKVRPLDKELRWY 385
Query: 111 --WEFLV 115
W + V
Sbjct: 386 LIWRWAV 392
>gi|393795369|ref|ZP_10378733.1| HhH-GPD family protein [Candidatus Nitrosoarchaeum limnia BG20]
Length = 235
Score = 38.9 bits (89), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 23 SDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE-------SWTHV 75
S++ + C D +D +++K++ LGL+ R+ +K S++ L + + +
Sbjct: 75 SNITSRCNDFNDLQNIDLRKLKKVLKPLGLEHTRSKHLKDLSKKILADFNGKVPGTKKEL 134
Query: 76 TQLHGVGKYAADAFAIFCTG 95
L GVG Y A+A F G
Sbjct: 135 LSLPGVGPYTANAVLCFAFG 154
>gi|224009405|ref|XP_002293661.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971061|gb|EED89397.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 658
Score = 38.9 bits (89), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 25 LFTLC---PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 68
LF+L P + D EEI +IIS LGL KR+ I RFS+EY+
Sbjct: 501 LFSLLQRWPRVHSMARADWEEIREIISPLGLMNKRSKSIIRFSKEYI 547
>gi|292656948|ref|YP_003536845.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
gi|448293546|ref|ZP_21483651.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
gi|291372790|gb|ADE05017.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
gi|445570407|gb|ELY24970.1| A/G-specific adenine glycosylase [Haloferax volcanii DS2]
Length = 212
Score = 38.9 bits (89), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 8/69 (11%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG-------ESWTHVTQLHGVG 82
P +T E IE+ I+ LGL KRA I+R SQ+ + + + +LHGVG
Sbjct: 63 PSPETVVAASPEAIERRIAPLGL-AKRAEFIRRTSQQLIARHSGDVPRRYADLLELHGVG 121
Query: 83 KYAADAFAI 91
Y A + I
Sbjct: 122 DYTARSVLI 130
>gi|15618746|ref|NP_225032.1| endonuclease III [Chlamydophila pneumoniae CWL029]
gi|15836370|ref|NP_300894.1| endonuclease III [Chlamydophila pneumoniae J138]
gi|16752201|ref|NP_445569.1| endonuclease III [Chlamydophila pneumoniae AR39]
gi|33242197|ref|NP_877138.1| endonuclease III [Chlamydophila pneumoniae TW-183]
gi|4377151|gb|AAD18975.1| Enodnuclease III [Chlamydophila pneumoniae CWL029]
gi|7189945|gb|AAF38807.1| endonuclease III [Chlamydophila pneumoniae AR39]
gi|8979211|dbj|BAA99045.1| enodnuclease III [Chlamydophila pneumoniae J138]
gi|33236708|gb|AAP98795.1| endonuclease III [Chlamydophila pneumoniae TW-183]
Length = 209
Score = 38.9 bits (89), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IAILL V LF PDA++ ++ ++ ++I+ GL ++++ I + S
Sbjct: 31 IAILLSGNSTDKAVNSVTPQLFAKAPDAQSILDLPPGKLYQLIAPCGLGERKSAYIYQLS 90
Query: 65 Q----EYLGESWTH---VTQLHGVGKYAADAFAIFCTGK 96
Q ++ GE +TQL GVG+ A F GK
Sbjct: 91 QILVRDFHGEPPNDMALLTQLPGVGRKTASVFLGIAYGK 129
>gi|384449995|ref|YP_005662597.1| putative enodnuclease III [Chlamydophila pneumoniae LPCoLN]
gi|269302624|gb|ACZ32724.1| putative enodnuclease III [Chlamydophila pneumoniae LPCoLN]
Length = 209
Score = 38.9 bits (89), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IAILL V LF PDA++ ++ ++ ++I+ GL ++++ I + S
Sbjct: 31 IAILLSGNSTDKAVNSVTPQLFAKAPDAQSILDLPPGKLYQLIAPCGLGERKSAYIYQLS 90
Query: 65 Q----EYLGESWTH---VTQLHGVGKYAADAFAIFCTGK 96
Q ++ GE +TQL GVG+ A F GK
Sbjct: 91 QILVRDFHGEPPNDMALLTQLPGVGRKTASVFLGIAYGK 129
>gi|302339269|ref|YP_003804475.1| endonuclease III [Spirochaeta smaragdinae DSM 11293]
gi|301636454|gb|ADK81881.1| endonuclease III [Spirochaeta smaragdinae DSM 11293]
Length = 217
Score = 38.9 bits (89), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 9/84 (10%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA--------PMIKRFSQEYLGESW 72
V LF PDAK+ E D EE+E +I ++G + +A ++++F + ES
Sbjct: 51 VTEKLFRRFPDAKSLAEADGEEVEDLIHSVGFYRVKARHIVAAAAALLEKFDGS-VPESM 109
Query: 73 THVTQLHGVGKYAADAFAIFCTGK 96
+ + G+G+ A+ C GK
Sbjct: 110 EELLMIPGLGRKGANVVLGDCFGK 133
>gi|291536153|emb|CBL09265.1| Predicted EndoIII-related endonuclease [Roseburia intestinalis
M50/1]
Length = 212
Score = 38.5 bits (88), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ L+ PD K E EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNIVVEKLYEKFPDVKALAEAPVEEIEEIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G E + + +L GVG+ +A+
Sbjct: 94 MKILWEQYGGNVPEDFDSLLKLPGVGRKSAN 124
>gi|167747754|ref|ZP_02419881.1| hypothetical protein ANACAC_02475 [Anaerostipes caccae DSM 14662]
gi|167653116|gb|EDR97245.1| putative endonuclease III [Anaerostipes caccae DSM 14662]
Length = 231
Score = 38.5 bits (88), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD E D +IE I+ GL + +A
Sbjct: 54 LVSVRLAAQCTDARVNVVVQDLYEKYPDVNALAEADPADIEAIVRPCGLGRSKARDISAC 113
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G +++ + +L GVG+ +A+
Sbjct: 114 MKILRDEYGGGIPDNFKALMKLPGVGRKSAN 144
>gi|240146275|ref|ZP_04744876.1| endonuclease III [Roseburia intestinalis L1-82]
gi|257201577|gb|EEU99861.1| endonuclease III [Roseburia intestinalis L1-82]
gi|291538985|emb|CBL12096.1| Predicted EndoIII-related endonuclease [Roseburia intestinalis
XB6B4]
Length = 212
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ L+ PD K E EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNIVVEKLYEKFPDVKALAEAPVEEIEEIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G E + + +L GVG+ +A+
Sbjct: 94 MKILWEQYGGNVPEDFDSLLKLPGVGRKSAN 124
>gi|225571496|ref|ZP_03780492.1| hypothetical protein CLOHYLEM_07594 [Clostridium hylemonae DSM
15053]
gi|225159573|gb|EEG72192.1| hypothetical protein CLOHYLEM_07594 [Clostridium hylemonae DSM
15053]
Length = 211
Score = 38.5 bits (88), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD + EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEDLYEKYPDVEALASAPPEEIEEIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G + + + +L GVG+ +A+
Sbjct: 94 MKILRDEYGGNVPDDFDALLKLPGVGRKSAN 124
>gi|449547781|gb|EMD38748.1| hypothetical protein CERSUDRAFT_133253 [Ceriporiopsis subvermispora
B]
Length = 282
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 42/110 (38%), Gaps = 39/110 (35%)
Query: 43 IEKIISTLGLQKKRAPMIKRFSQEYLGESW------------------------------ 72
IE +I LGL K+R+ I S+ YL +
Sbjct: 140 IEDLIKHLGLGKQRSRRIIELSRAYLDDPPIPGTLRISRCYITARTVACENGSIGLTKMR 199
Query: 73 ---THVTQLHGVGKYAADAFAIFCTG--KWDRVRPTD----HMLNYYWEF 113
T ++ L G G YA D++ IFC G W+ V P+D L + W F
Sbjct: 200 YPPTPISHLPGCGPYALDSYRIFCEGAADWEAVLPSDKELIRFLRWKWAF 249
>gi|153852865|ref|ZP_01994302.1| hypothetical protein DORLON_00284 [Dorea longicatena DSM 13814]
gi|149754507|gb|EDM64438.1| putative endonuclease III [Dorea longicatena DSM 13814]
Length = 210
Score = 38.5 bits (88), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ L+ PD E EEIE+I+ GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVESLYAKYPDVNALAEATPEEIEEIVRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
++ EY G + + + +L GVG+ +A+
Sbjct: 94 MRMLRDEYDGKVPDDFNKLLKLPGVGRKSAN 124
>gi|223939321|ref|ZP_03631201.1| HhH-GPD family protein [bacterium Ellin514]
gi|223892034|gb|EEF58515.1| HhH-GPD family protein [bacterium Ellin514]
Length = 239
Score = 38.5 bits (88), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 6/68 (8%)
Query: 31 DAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE------SWTHVTQLHGVGKY 84
D T ++I KI+ LGL ++RAP IK + E E + + L GVG+Y
Sbjct: 79 DWDTLANTSEKKICKILQPLGLSRQRAPRIKALAVEICTEKGRFPQTQEDILLLPGVGQY 138
Query: 85 AADAFAIF 92
A+A +F
Sbjct: 139 IANAILLF 146
>gi|296242620|ref|YP_003650107.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
gi|296095204|gb|ADG91155.1| HhH-GPD family protein [Thermosphaera aggregans DSM 11486]
Length = 217
Score = 38.1 bits (87), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 8/83 (9%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE-------SW 72
+V S F+ P+ T + +IE II LGL +KRA ++ +Q + E +
Sbjct: 54 KVFSSFFSKFPNIGTLAKASETDIEAIIGELGL-RKRAGFLRELAQHAVEEFGDKIPNTL 112
Query: 73 THVTQLHGVGKYAADAFAIFCTG 95
+ +L GVG Y A+A F G
Sbjct: 113 EDLMKLKGVGLYTANAVRSFAYG 135
>gi|312127482|ref|YP_003992356.1| endonuclease iii [Caldicellulosiruptor hydrothermalis 108]
gi|311777501|gb|ADQ06987.1| endonuclease III [Caldicellulosiruptor hydrothermalis 108]
Length = 202
Score = 38.1 bits (87), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 7/80 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGESW 72
+ ++ ++LF P K+ E D EE+EK I +G K +A IK S+ +Y G
Sbjct: 38 RVNKITAELFKKYPTLKSFAEADLEELEKDIKPVGFYKNKAKSIKETSRILIDKYNGTLP 97
Query: 73 THV---TQLHGVGKYAADAF 89
T + +L GVG+ A+
Sbjct: 98 TTIEELVKLKGVGRKTANVI 117
>gi|383763496|ref|YP_005442478.1| DNA glycosylase/AP lyase [Caldilinea aerophila DSM 14535 = NBRC
104270]
gi|381383764|dbj|BAM00581.1| DNA glycosylase/AP lyase [Caldilinea aerophila DSM 14535 = NBRC
104270]
Length = 220
Score = 37.4 bits (85), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESW 72
+ +V LFT PDA+ E D E+E++I + G + +A I +R +EY GE
Sbjct: 57 RVNKVTPALFTKYPDARAFAEADLSELEEMIRSTGFYRNKAKNIQQACRRIVEEYDGEVP 116
Query: 73 THVTQ---LHGVGKYAAD 87
+ + L GV + A+
Sbjct: 117 ADMEKLLTLAGVARKTAN 134
>gi|153815700|ref|ZP_01968368.1| hypothetical protein RUMTOR_01937 [Ruminococcus torques ATCC 27756]
gi|145846941|gb|EDK23859.1| putative endonuclease III [Ruminococcus torques ATCC 27756]
Length = 222
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ L+ P + E D +EIEKI+ GL + +A I
Sbjct: 44 LVSVRLAAQCTDARVNVVVEGLYEKYPTVEALAEADVDEIEKIVRPCGLGRSKARDISGC 103
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K EY G + + + +L GVG+ +A+
Sbjct: 104 MKMIRDEYEGKVPDDFDALMKLPGVGRKSAN 134
>gi|296118073|ref|ZP_06836655.1| endonuclease III [Corynebacterium ammoniagenes DSM 20306]
gi|295968959|gb|EFG82202.1| endonuclease III [Corynebacterium ammoniagenes DSM 20306]
Length = 218
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 16 LKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGE----- 70
++ +V DLF P A + D +IE II LGL +A + Q+ +G+
Sbjct: 46 VRVNQVTPDLFKAYPRAIDYAQADVTDIEDIIRPLGLAPSKAKRLVGLGQKLVGDFDGEV 105
Query: 71 --SWTHVTQLHGVGKYAA 86
S +T L GVG+ A
Sbjct: 106 PTSIADLTSLPGVGRKTA 123
>gi|317501913|ref|ZP_07960097.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
gi|331088272|ref|ZP_08337191.1| hypothetical protein HMPREF1025_00774 [Lachnospiraceae bacterium
3_1_46FAA]
gi|336440109|ref|ZP_08619708.1| hypothetical protein HMPREF0990_02102 [Lachnospiraceae bacterium
1_1_57FAA]
gi|316896593|gb|EFV18680.1| endonuclease III [Lachnospiraceae bacterium 8_1_57FAA]
gi|330408516|gb|EGG87982.1| hypothetical protein HMPREF1025_00774 [Lachnospiraceae bacterium
3_1_46FAA]
gi|336014458|gb|EGN44307.1| hypothetical protein HMPREF0990_02102 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 212
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ L+ P + E D +EIEKI+ GL + +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVEGLYEKYPTVEALAEADVDEIEKIVRPCGLGRSKARDISGC 93
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K EY G + + + +L GVG+ +A+
Sbjct: 94 MKMIRDEYEGKVPDDFDALMKLPGVGRKSAN 124
>gi|89897892|ref|YP_515002.1| enodnuclease III [Chlamydophila felis Fe/C-56]
gi|89331264|dbj|BAE80857.1| enodnuclease III [Chlamydophila felis Fe/C-56]
Length = 212
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 14/92 (15%)
Query: 12 IAILLKAGR-------VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA+LL V LF + PDA+ ++ +++ IIS GL ++++ + S
Sbjct: 37 IAVLLSGNSTDKAVNSVTPKLFAIAPDAQALAQLPLKKLYSIISPCGLGERKSEYLHNLS 96
Query: 65 ----QEYLGE---SWTHVTQLHGVGKYAADAF 89
+ Y GE S +T+L G+G+ A F
Sbjct: 97 KILLERYHGEPPASLDLLTELPGIGRKTASVF 128
>gi|375335673|ref|ZP_09777017.1| endonuclease III [Succinivibrionaceae bacterium WG-1]
Length = 209
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 7/75 (9%)
Query: 19 GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ----EYLGE---S 71
+V LF PDA++ ++ EEIE II +GL + +A +K + ++ GE +
Sbjct: 49 NQVTPILFAKAPDAQSMLKLTNEEIENIIKKIGLYRNKAKNLKALCESLVNDFNGEIPST 108
Query: 72 WTHVTQLHGVGKYAA 86
+ +L GVGK A
Sbjct: 109 EEELIKLPGVGKKTA 123
>gi|194333300|ref|YP_002015160.1| endonuclease III [Prosthecochloris aestuarii DSM 271]
gi|194311118|gb|ACF45513.1| endonuclease III [Prosthecochloris aestuarii DSM 271]
Length = 211
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 41/95 (43%), Gaps = 9/95 (9%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS---QEYLG---- 69
K + LF +CPDA++ + D EEI ++ ++ +A I S E G
Sbjct: 47 KVNMITETLFKVCPDAESMSRTDPEEIRSMVRSINYNNTKAKNILAASCLLMENFGGQVP 106
Query: 70 ESWTHVTQLHGVGKYAADAFAIFCTGKWDRVRPTD 104
+S + L GVG+ A+ GK V P D
Sbjct: 107 DSREELETLPGVGRKTANVVLSNAFGKP--VMPVD 139
>gi|295090057|emb|CBK76164.1| Predicted EndoIII-related endonuclease [Clostridium cf.
saccharolyticum K10]
Length = 211
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ DL+ PD E EEIE I+ GL +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVKDLYEKFPDVNALAEAPVEEIEAIVRPCGLGHSKAKDISDC 93
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K ++ G +S+ + L GVG+ +A+
Sbjct: 94 MKMLRDQFDGRVPDSFDALLSLPGVGRKSAN 124
>gi|266623597|ref|ZP_06116532.1| endonuclease III, partial [Clostridium hathewayi DSM 13479]
gi|288864609|gb|EFC96907.1| endonuclease III [Clostridium hathewayi DSM 13479]
Length = 191
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ DL+ PD + + E+IEKI+ GL +A
Sbjct: 14 LVSVRLAAQCTDARVNVVVQDLYAEYPDVEALAGAEVEDIEKIVKPCGLGHSKARDISAC 73
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K ++Y G + + + +L GVG+ +A+
Sbjct: 74 MKILKEQYDGRVPDDFDALLKLPGVGRKSAN 104
>gi|225848427|ref|YP_002728590.1| endonuclease III [Sulfurihydrogenibium azorense Az-Fu1]
gi|225644019|gb|ACN99069.1| probable endonuclease III (DNA-(apurinic orapyrimidinic site)
lyase) [Sulfurihydrogenibium azorense Az-Fu1]
Length = 216
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 11/94 (11%)
Query: 1 MAQIYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
+A I S+R K+ L + R LF + + + AEEIEK+I +G K +A I
Sbjct: 40 IATILSLRTKDQITALASDR----LFKVADTPEKMVNLPAEEIEKLIYPVGFYKNKAKTI 95
Query: 61 KRFS----QEYLG---ESWTHVTQLHGVGKYAAD 87
K S ++Y G ++ + L GVG+ A+
Sbjct: 96 KEISKIILEKYAGKVPDNLEDLLSLKGVGRKTAN 129
>gi|283797428|ref|ZP_06346581.1| endonuclease III [Clostridium sp. M62/1]
gi|291074786|gb|EFE12150.1| putative endonuclease III [Clostridium sp. M62/1]
Length = 211
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ DL+ PD E EEIE I+ GL +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVKDLYEKFPDVNALAEAPVEEIEAIVRPCGLGHSKAKDISAC 93
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K ++ G +S+ + L GVG+ +A+
Sbjct: 94 MKMLRDQFDGRVPDSFDALLSLPGVGRKSAN 124
>gi|125624307|ref|YP_001032790.1| endonuclease III [Lactococcus lactis subsp. cremoris MG1363]
gi|389854667|ref|YP_006356911.1| endonuclease III [Lactococcus lactis subsp. cremoris NZ9000]
gi|124493115|emb|CAL98079.1| putative endonuclease III (DNA repair) [Lactococcus lactis subsp.
cremoris MG1363]
gi|300071089|gb|ADJ60489.1| putative endonuclease III (DNA repair) [Lactococcus lactis subsp.
cremoris NZ9000]
Length = 218
Score = 37.4 bits (85), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQ 65
LF PDA+ ++ EEIE +I T+GL K +A I R SQ
Sbjct: 55 LFAAFPDAQAMSQAKVEEIEALIRTIGLYKTKAKNILRTSQ 95
>gi|226324116|ref|ZP_03799634.1| hypothetical protein COPCOM_01894 [Coprococcus comes ATCC 27758]
gi|225207665|gb|EEG90019.1| putative endonuclease III [Coprococcus comes ATCC 27758]
Length = 213
Score = 37.0 bits (84), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ LF P + E D IE+II GL K +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVEGLFAKYPTVEALAEADVNNIEEIIRPCGLGKSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K EY G + + + +L GVG+ +A+
Sbjct: 94 MKMLRDEYGGKIPKDFNAILKLPGVGRKSAN 124
>gi|303274809|ref|XP_003056719.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461071|gb|EEH58364.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 348
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 40/96 (41%), Gaps = 7/96 (7%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWT------H 74
IS F P A ++ ++I LGL R + S + L E+
Sbjct: 240 TISAFFAKYPTPTAALAAKPNDVLEVIRPLGLFPNRFRSVVEISAKILTEAGPLDVGPDQ 299
Query: 75 VTQLHGVGKYAADAFAIFCTGKWDRVRPTDHMLNYY 110
+++GVG++ D+F +FC G V P D L +
Sbjct: 300 ERKIYGVGEFGIDSFNVFCRGDLS-VNPGDKTLQSF 334
>gi|417345281|ref|ZP_12125440.1| hypothetical protein LTSEBAI_5486 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
gi|357953150|gb|EHJ79805.1| hypothetical protein LTSEBAI_5486 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
Length = 114
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%), Gaps = 1/31 (3%)
Query: 52 LQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
LQKK PM+K+F+ EY G S T +TQ+ GVG
Sbjct: 2 LQKKSNPMVKKFNFEYCGFS-TFITQVDGVG 31
>gi|397669156|ref|YP_006510691.1| endonuclease III [Propionibacterium propionicum F0230a]
gi|395142567|gb|AFN46674.1| endonuclease III [Propionibacterium propionicum F0230a]
Length = 205
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 12/87 (13%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG------- 69
+ +V +LF PDA+ + +++E+IIS+LGL + +A + ++ +
Sbjct: 43 RVNQVTPELFARYPDAEHLAGAEQDDVEEIISSLGLYRSKARSLTGLARALVADHDGRVP 102
Query: 70 ESWTHVTQLHGVGKYAA-----DAFAI 91
+S + +L GVG+ A DAF I
Sbjct: 103 DSLDELVKLPGVGRKTANVVLGDAFGI 129
>gi|170085275|ref|XP_001873861.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651413|gb|EDR15653.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 242
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 22/42 (52%), Gaps = 7/42 (16%)
Query: 73 THVTQLHGVGKYAADAFAIFCT-------GKWDRVRPTDHML 107
T ++ L G G YA D++ IFCT +W V PTD L
Sbjct: 159 TPISHLPGTGPYALDSYRIFCTLHHDPSSNEWQNVTPTDKEL 200
>gi|23099212|ref|NP_692678.1| endonuclease III [Oceanobacillus iheyensis HTE831]
gi|22777440|dbj|BAC13713.1| endonuclease III (DNA repair) [Oceanobacillus iheyensis HTE831]
Length = 216
Score = 37.0 bits (84), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 14/93 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA+LL A +V +DLF + V+ E+E I ++GL + +A I++
Sbjct: 35 IAVLLSAQCTDKLVNKVTADLFQKYKTPEDYLSVELSELENDIRSIGLYRSKAKNIQKLC 94
Query: 65 Q----EYLGE---SWTHVTQLHGVGKYAADAFA 90
Q EY GE S + +L GVG+ A+ A
Sbjct: 95 QMLLDEYNGEIPSSKEELVKLAGVGRKTANVVA 127
>gi|225410189|ref|ZP_03761378.1| hypothetical protein CLOSTASPAR_05411 [Clostridium asparagiforme
DSM 15981]
gi|225042293|gb|EEG52539.1| hypothetical protein CLOSTASPAR_05411 [Clostridium asparagiforme
DSM 15981]
Length = 261
Score = 36.6 bits (83), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ DL+ PD + EEIE+I+ GL +A I
Sbjct: 77 LVSVRLAAQCTDARVNVVVQDLYAKYPDVNALADAPVEEIERIVKPCGLGHSKAKDISGC 136
Query: 61 -KRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
K +Y G + + + +L GVG+ +A+
Sbjct: 137 MKILRDQYGGKVPDDFDALLKLPGVGRKSAN 167
>gi|428769953|ref|YP_007161743.1| DNA-(apurinic or apyrimidinic site) lyase [Cyanobacterium aponinum
PCC 10605]
gi|428684232|gb|AFZ53699.1| endonuclease III [Cyanobacterium aponinum PCC 10605]
Length = 212
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK----RFSQEYLG--- 69
+ +V LF PDA++ + D EEIE +I + G + +A I+ R ++ G
Sbjct: 49 RVNKVTPALFKRFPDAQSFAQADREEIENLIRSTGFYRNKAKNIQNACIRIVNDFDGKVP 108
Query: 70 ESWTHVTQLHGVGKYAAD 87
++ T + L GV + A+
Sbjct: 109 QTMTELLTLAGVARKTAN 126
>gi|297584383|ref|YP_003700163.1| endonuclease III [Bacillus selenitireducens MLS10]
gi|297142840|gb|ADH99597.1| endonuclease III [Bacillus selenitireducens MLS10]
Length = 217
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 14/93 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF + V EE+E+ I ++GL + +A IK+ +
Sbjct: 35 IAVVLSAQATDSLVNKVTPRLFAKYKTPEDYANVPLEELEQDIRSIGLYRSKAKNIKKLA 94
Query: 65 Q----EYLGE---SWTHVTQLHGVGKYAADAFA 90
Q +Y GE S T + +L GVG+ A+ A
Sbjct: 95 QSVIDDYQGEIPKSKTELKKLAGVGRKTANVVA 127
>gi|295398565|ref|ZP_06808597.1| endonuclease III [Aerococcus viridans ATCC 11563]
gi|294973166|gb|EFG48961.1| endonuclease III [Aerococcus viridans ATCC 11563]
Length = 223
Score = 36.6 bits (83), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGL-------QKKRA 57
IA+L+ A +V +LF PD E + E++E I T+GL KK A
Sbjct: 47 IAVLMSAQTTDVAVNKVTPNLFAKYPDPDHMAEAELEDLESYIKTIGLYHNKAKNMKKTA 106
Query: 58 PMIKRFSQEYLGESWTHVTQLHGVGKYAAD 87
MI+ + ++ + QL GVG+ A+
Sbjct: 107 IMIRDEFNGQVPKTREELIQLPGVGRKTAN 136
>gi|389861522|ref|YP_006363762.1| A/G-specific adenine glycosylase [Thermogladius cellulolyticus
1633]
gi|388526426|gb|AFK51624.1| A/G-specific adenine glycosylase [Thermogladius cellulolyticus
1633]
Length = 134
Score = 36.6 bits (83), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 7/72 (9%)
Query: 19 GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS----QEYLG---ES 71
R L PD ++ EE+E+ + LGL + RA +K + +EY G +S
Sbjct: 57 ARAFQALVERFPDVESLHSAPLEEVEEALKPLGLYRVRARQLKELAAVIVEEYGGRIPDS 116
Query: 72 WTHVTQLHGVGK 83
W + +L GVG+
Sbjct: 117 WEELVRLPGVGR 128
>gi|320528079|ref|ZP_08029244.1| endonuclease III [Solobacterium moorei F0204]
gi|320131427|gb|EFW23992.1| endonuclease III [Solobacterium moorei F0204]
Length = 217
Score = 36.6 bits (83), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 45/96 (46%), Gaps = 14/96 (14%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+A++L A RV LF P + +++EK I++LGL + +A I F+
Sbjct: 34 VAVILSAQTTDVSVNRVTPALFKAYPSPYDLAKAPTKDVEKYIASLGLYRNKAKQIVGFA 93
Query: 65 Q----EYLGE---SWTHVTQLHGVGKYAADAFAIFC 93
Q ++ GE + +T L G+G+ A+ C
Sbjct: 94 QGVVEQFHGEVPHTMEELTTLPGIGRKCANVIMAEC 129
>gi|417377292|ref|ZP_12146247.1| hypothetical protein LTSEINV_5874 [Salmonella enterica subsp.
enterica serovar Inverness str. R8-3668]
gi|353590393|gb|EHC48930.1| hypothetical protein LTSEINV_5874 [Salmonella enterica subsp.
enterica serovar Inverness str. R8-3668]
Length = 114
Score = 36.6 bits (83), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 52 LQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
LQKK PM+K F+ EY G S T +TQ+ GVG
Sbjct: 2 LQKKSNPMVKNFNFEYCGFS-TFITQVDGVG 31
>gi|291523946|emb|CBK89533.1| Predicted EndoIII-related endonuclease [Eubacterium rectale DSM
17629]
Length = 212
Score = 36.6 bits (83), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
+ S+RL + V+ DLF P + + D +IE+I+ GL K +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVVDLFKKYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 93
Query: 62 -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
R ++ G ++ T + +L GVG+ +A+
Sbjct: 94 MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 124
>gi|239625177|ref|ZP_04668208.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239519407|gb|EEQ59273.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 211
Score = 36.6 bits (83), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ +L+ PD E E+IE+I+ GL +A
Sbjct: 34 LVSVRLAAQCTDARVNVVVKELYERYPDVGALAEAKVEDIERIVKPCGLGHSKARDISAC 93
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K +EY G + + + +L GVG+ +A+
Sbjct: 94 MKILQEEYGGRVPDDFDALLKLPGVGRKSAN 124
>gi|18313897|ref|NP_560564.1| U/G & T/G mismatch specific glycosylase (PA-MIG) [Pyrobaculum
aerophilum str. IM2]
gi|7141252|gb|AAF37270.1|AF222335_1 U/G and T/G mismatch-specific DNA glycosylase [Pyrobaculum
aerophilum]
gi|18161465|gb|AAL64746.1| U/G & T/G mismatch specific glycosylase (Pa-MIG) [Pyrobaculum
aerophilum str. IM2]
Length = 230
Score = 36.2 bits (82), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 7/74 (9%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT-------QLHGVG 82
P + EEI+ II LG++ RA ++K+ S+E + + L GVG
Sbjct: 69 PSPARLADASVEEIKAIIQPLGMEHVRATLLKKLSEELVRRFNGQIPCDRDALKSLPGVG 128
Query: 83 KYAADAFAIFCTGK 96
YAA + GK
Sbjct: 129 DYAASEVLLTACGK 142
>gi|409076139|gb|EKM76513.1| hypothetical protein AGABI1DRAFT_78639 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 284
Score = 36.2 bits (82), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 10/78 (12%)
Query: 37 EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG- 95
E D ++S +K +R +++Y T ++ L G G YA D++ IFC
Sbjct: 158 EYDLRPSRPVVSPTKRSRKGGTRSRRLAEKYPP---TAISHLPGAGTYALDSYRIFCLSH 214
Query: 96 ------KWDRVRPTDHML 107
+W V PTD L
Sbjct: 215 QDPLCEEWTTVLPTDKEL 232
>gi|238923093|ref|YP_002936606.1| endonuclease III [Eubacterium rectale ATCC 33656]
gi|238874765|gb|ACR74472.1| endonuclease III [Eubacterium rectale ATCC 33656]
Length = 226
Score = 36.2 bits (82), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
+ S+RL + V+ DLF P + + D +IE+I+ GL K +A I
Sbjct: 48 LVSVRLAAQCTDARVNVVVVDLFKKYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 107
Query: 62 -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
R ++ G ++ T + +L GVG+ +A+
Sbjct: 108 MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 138
>gi|426194481|gb|EKV44412.1| hypothetical protein AGABI2DRAFT_74704, partial [Agaricus bisporus
var. bisporus H97]
Length = 209
Score = 36.2 bits (82), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 10/78 (12%)
Query: 37 EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG- 95
E D ++S +K +R +++Y T ++ L G G YA D++ IFC
Sbjct: 83 EYDLRPSRPVVSPTKRSRKGGTRSRRLAEKYPP---TAISHLPGAGTYALDSYRIFCLSH 139
Query: 96 ------KWDRVRPTDHML 107
+W V PTD L
Sbjct: 140 QDPLCEEWTTVLPTDKEL 157
>gi|450005195|ref|ZP_21826553.1| endonuclease III [Streptococcus mutans NMT4863]
gi|449188837|gb|EMB90527.1| endonuclease III [Streptococcus mutans NMT4863]
Length = 207
Score = 36.2 bits (82), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNCQVPQTRKELESLSGVGRKTAN 125
>gi|291528591|emb|CBK94177.1| Predicted EndoIII-related endonuclease [Eubacterium rectale M104/1]
Length = 212
Score = 36.2 bits (82), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK-- 61
+ S+RL + V+ DLF P + + D +IE+I+ GL K +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVVDLFKEYPSIEALADADVSDIEEIVRPCGLGKSKARDISAC 93
Query: 62 -RFSQEYLG----ESWTHVTQLHGVGKYAAD 87
R ++ G ++ T + +L GVG+ +A+
Sbjct: 94 MRMLRDDFGGLVPDNMTDLLKLPGVGRKSAN 124
>gi|310825899|ref|YP_003958256.1| hypothetical protein [Eubacterium limosum KIST612]
gi|308737633|gb|ADO35293.1| hypothetical protein ELI_0274 [Eubacterium limosum KIST612]
Length = 213
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PM 59
+ S+RL + V+ +L+ PD + + EEIE I+ GL K +A
Sbjct: 36 LVSVRLAAQCTDARVNVVVKELYAKFPDVAALAQAEPEEIEAIVRPCGLGKSKARDISAC 95
Query: 60 IKRFSQEYLG---ESWTHVTQLHGVGKYAAD 87
+K +Y G + + + +L GVG+ +A+
Sbjct: 96 MKILRDQYDGMVPDDFDALLKLPGVGRKSAN 126
>gi|386319282|ref|YP_006015445.1| endonuclease III [Staphylococcus pseudintermedius ED99]
gi|323464453|gb|ADX76606.1| endonuclease III [Staphylococcus pseudintermedius ED99]
Length = 224
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA+LL A RV DLF + VD EE+++ I ++GL + +A I++ S
Sbjct: 35 IAVLLSAQCTDNTVNRVTKDLFQKYHTPEDYLAVDLEELQQDIRSIGLYRNKAKNIQKLS 94
Query: 65 QEYLGE-----SWTH--VTQLHGVGKYAAD 87
Q L + TH + L GVG+ A+
Sbjct: 95 QSLLDQYDGIVPHTHEQLEGLAGVGRKTAN 124
>gi|307110180|gb|EFN58416.1| hypothetical protein CHLNCDRAFT_50183 [Chlorella variabilis]
Length = 295
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/66 (22%), Positives = 29/66 (43%)
Query: 30 PDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAF 89
P + + + +++ LGLQ R + + ++L W + G + AD++
Sbjct: 128 PTPSAVIDASDDSMLEVLQPLGLQAVRLQAVWGLAHDFLAADWQSPSDFKHCGPFVADSW 187
Query: 90 AIFCTG 95
IFC G
Sbjct: 188 RIFCRG 193
>gi|319892444|ref|YP_004149319.1| endonuclease III [Staphylococcus pseudintermedius HKU10-03]
gi|317162140|gb|ADV05683.1| Endonuclease III [Staphylococcus pseudintermedius HKU10-03]
Length = 224
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA+LL A RV DLF + VD EE+++ I ++GL + +A I++ S
Sbjct: 35 IAVLLSAQCTDNTVNRVTKDLFQKYHTPEDYLAVDLEELQQDIRSIGLYRNKAKNIQKLS 94
Query: 65 QEYLGE-----SWTH--VTQLHGVGKYAAD 87
Q L + TH + L GVG+ A+
Sbjct: 95 QSLLDQYDGIVPHTHEQLEGLAGVGRKTAN 124
>gi|259416408|ref|ZP_05740328.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
gi|259347847|gb|EEW59624.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
Length = 791
Score = 36.2 bits (82), Expect = 2.7, Method: Composition-based stats.
Identities = 30/95 (31%), Positives = 47/95 (49%), Gaps = 10/95 (10%)
Query: 10 KEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLG 69
+ + IL + V ++ K+AT+ + E +EKIIS L++++A R YLG
Sbjct: 539 ERLEILRETDPVQQEMIRFRETMKSATDAEREALEKIISKR-LEEQKAIEQTRELGSYLG 597
Query: 70 ESWTHVT-QLHGVGKYAADAFAIFCTGKWDRVRPT 103
E+ V+ L G AADA WDRV+ +
Sbjct: 598 ETMQDVSAALVSSGDAAADA--------WDRVKAS 624
>gi|238917625|ref|YP_002931142.1| DNA-(apurinic or apyrimidinic site) lyase [Eubacterium eligens ATCC
27750]
gi|238872985|gb|ACR72695.1| DNA-(apurinic or apyrimidinic site) lyase [Eubacterium eligens ATCC
27750]
Length = 213
Score = 36.2 bits (82), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 7/91 (7%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI--- 60
+ S+RL + V+ L+ PD + EEIEKII GL K +A I
Sbjct: 34 LVSVRLAAQCTDARVNVVVKGLYEKYPDIASLAAASPEEIEKIIRPCGLGKSKAKDICAC 93
Query: 61 -KRFSQEY---LGESWTHVTQLHGVGKYAAD 87
+ ++Y + +S + +L GVG+ +A+
Sbjct: 94 MRMLHEQYNDKVPDSMEELLKLPGVGRKSAN 124
>gi|160893761|ref|ZP_02074545.1| hypothetical protein CLOL250_01315 [Clostridium sp. L2-50]
gi|156864746|gb|EDO58177.1| endonuclease III [Clostridium sp. L2-50]
Length = 214
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESWTHV- 75
V DLF P + + D +E+EK I + G K +A I K+ EY GE + +
Sbjct: 54 VTKDLFVKYPTLQAFADADIKELEKDIYSTGFYKNKAKNIIGCAKKLISEYGGEVPSDIE 113
Query: 76 --TQLHGVGKYAADAF 89
T+L GVG+ A+
Sbjct: 114 SLTKLDGVGRKTANVI 129
>gi|448664779|ref|ZP_21684417.1| HhH-GPD family protein [Haloarcula amylolytica JCM 13557]
gi|445774366|gb|EMA25387.1| HhH-GPD family protein [Haloarcula amylolytica JCM 13557]
Length = 214
Score = 35.8 bits (81), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 6/98 (6%)
Query: 4 IYSIRLKEIAILLKAGRVISDLFT----LCPDAKTATEVDAEEIEKIISTLGLQKKRAPM 59
+Y + + E + R +++++ PD + EEI +I LG R+
Sbjct: 40 LYEVFVAEFFLTQTPARNVAEVYPEFLERFPDLRAIDSATKEEIVAVIEPLGFYNMRSEA 99
Query: 60 IKRFSQEY--LGESWTHVTQLHGVGKYAADAFAIFCTG 95
+K + E L E+ +T+L VG Y A+A F G
Sbjct: 100 LKSIAAEMSELSEAVEELTELPRVGHYIANATLCFSRG 137
>gi|448626677|ref|ZP_21671456.1| HhH-GPD family protein [Haloarcula vallismortis ATCC 29715]
gi|445760289|gb|EMA11553.1| HhH-GPD family protein [Haloarcula vallismortis ATCC 29715]
Length = 264
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKR----FSQEYLGESWTHV 75
++ SD+ P + + EEI +++ +LG R ++ F+QE+ GE +
Sbjct: 93 KIYSDVIQQFPTPEALNQASEEEIREVVHSLGFVNHRQRTLEEVGRIFTQEFNGEIPNSI 152
Query: 76 TQLHG---VGKYAADAFAIFCTGK 96
L VG Y+A A +F G+
Sbjct: 153 EDLKKPWRVGDYSARATQLFARGE 176
>gi|387785664|ref|YP_006250760.1| putative endonuclease III [Streptococcus mutans LJ23]
gi|449893396|ref|ZP_21788688.1| endonuclease III [Streptococcus mutans SF12]
gi|450046095|ref|ZP_21838755.1| endonuclease III [Streptococcus mutans N34]
gi|379132065|dbj|BAL68817.1| putative endonuclease III [Streptococcus mutans LJ23]
gi|449199269|gb|EMC00343.1| endonuclease III [Streptococcus mutans N34]
gi|449255827|gb|EMC53667.1| endonuclease III [Streptococcus mutans SF12]
Length = 207
Score = 35.8 bits (81), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|449910114|ref|ZP_21794551.1| endonuclease III [Streptococcus mutans OMZ175]
gi|449260261|gb|EMC57765.1| endonuclease III [Streptococcus mutans OMZ175]
Length = 207
Score = 35.8 bits (81), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|41615185|ref|NP_963683.1| hypothetical protein NEQ398 [Nanoarchaeum equitans Kin4-M]
gi|40068909|gb|AAR39244.1| NEQ398 [Nanoarchaeum equitans Kin4-M]
Length = 212
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 11/101 (10%)
Query: 1 MAQIYSIRLKEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
+A + SIR +E + R +L+ D K + EEIE +I +GL K++A I
Sbjct: 40 IATVLSIRTREEQTI----RASLNLYNKYKDYKNLAKAPIEEIEDLIKNVGLYKQKAKWI 95
Query: 61 KRFSQEY-----LGESWTHVTQLHGVGKYAADAFAIFCTGK 96
K +Q + ES+ + L GVG+ + + K
Sbjct: 96 KTIAQRWDYNKKCDESF--IRNLPGVGRKVGNVYLNLVCNK 134
>gi|393228457|gb|EJD36103.1| hypothetical protein AURDEDRAFT_74606, partial [Auricularia
delicata TFB-10046 SS5]
Length = 201
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 51/132 (38%), Gaps = 33/132 (25%)
Query: 21 VISDLFTLCPDAKTATEVDA--EEIEKIISTLGLQKKRAPMIKRFSQEYLG--------- 69
+ DL P A V A E++ ++S LGLQ RA + S+ Y+
Sbjct: 28 IFWDLMERWPTAHEMQHVAAPLEQLVTMLSGLGLQNIRAKRLIDLSRAYVACPPDPRVLY 87
Query: 70 ---------------------ESWTHVTQLHGVGKYAADAFAIFCT-GKWDRVRPTDHML 107
T ++ L G G+YA D++ IFC +W + P D L
Sbjct: 88 ASKSVSKLTEIGLASQFYDHPYPPTAISHLPGAGRYALDSYRIFCVDDEWRQCYPRDKEL 147
Query: 108 NYYWEFLVSTKG 119
Y E+ + +G
Sbjct: 148 VRYLEWKWACEG 159
>gi|242223699|ref|XP_002477437.1| predicted protein [Postia placenta Mad-698-R]
gi|220722987|gb|EED77364.1| predicted protein [Postia placenta Mad-698-R]
Length = 834
Score = 35.8 bits (81), Expect = 3.6, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 17/70 (24%)
Query: 50 LGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAFAIFCTG--KWDRVRPTDHML 107
+G Q++R P T + L G G YA D++ IFC G +W V P D L
Sbjct: 102 VGAQRQRYPP-------------TEASHLPGSGPYALDSYRIFCAGEDEWKAVMPRDKEL 148
Query: 108 NYY--WEFLV 115
Y W + V
Sbjct: 149 VRYLRWNWAV 158
>gi|449944366|ref|ZP_21806723.1| endonuclease III [Streptococcus mutans 11A1]
gi|449148593|gb|EMB52450.1| endonuclease III [Streptococcus mutans 11A1]
Length = 207
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|428781526|ref|YP_007173312.1| endonuclease III [Dactylococcopsis salina PCC 8305]
gi|428695805|gb|AFZ51955.1| endonuclease III [Dactylococcopsis salina PCC 8305]
Length = 211
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP-------MIKRFSQEYLG 69
+ +V DLF PDA+T + + EEIE +I + G + +A MI Q +
Sbjct: 49 RVNKVTPDLFARFPDAETMAKANREEIETLIRSTGFYRNKAKNIQGACEMIVSDFQGKVP 108
Query: 70 ESWTHVTQLHGVGKYAAD 87
++ + +L GV + A+
Sbjct: 109 DTMEDLLKLPGVARKTAN 126
>gi|397650248|ref|YP_006490775.1| endonuclease III [Streptococcus mutans GS-5]
gi|449874957|ref|ZP_21781920.1| endonuclease III [Streptococcus mutans S1B]
gi|449902335|ref|ZP_21791522.1| endonuclease III [Streptococcus mutans M230]
gi|449918471|ref|ZP_21797359.1| endonuclease III [Streptococcus mutans 1SM1]
gi|449974219|ref|ZP_21815157.1| endonuclease III [Streptococcus mutans 11VS1]
gi|449990607|ref|ZP_21821644.1| endonuclease III [Streptococcus mutans NVAB]
gi|450074616|ref|ZP_21849400.1| endonuclease III [Streptococcus mutans M2A]
gi|450098380|ref|ZP_21857976.1| endonuclease III [Streptococcus mutans SF1]
gi|450104976|ref|ZP_21859580.1| endonuclease III [Streptococcus mutans SF14]
gi|450114998|ref|ZP_21863657.1| endonuclease III [Streptococcus mutans ST1]
gi|450171669|ref|ZP_21884123.1| endonuclease III [Streptococcus mutans SM4]
gi|392603817|gb|AFM81981.1| endonuclease III [Streptococcus mutans GS-5]
gi|449160315|gb|EMB63588.1| endonuclease III [Streptococcus mutans 1SM1]
gi|449178595|gb|EMB80848.1| endonuclease III [Streptococcus mutans 11VS1]
gi|449181545|gb|EMB83626.1| endonuclease III [Streptococcus mutans NVAB]
gi|449208905|gb|EMC09461.1| endonuclease III [Streptococcus mutans M2A]
gi|449221577|gb|EMC21345.1| endonuclease III [Streptococcus mutans SF1]
gi|449225196|gb|EMC24807.1| endonuclease III [Streptococcus mutans SF14]
gi|449228467|gb|EMC27832.1| endonuclease III [Streptococcus mutans ST1]
gi|449243721|gb|EMC42130.1| endonuclease III [Streptococcus mutans SM4]
gi|449254335|gb|EMC52244.1| endonuclease III [Streptococcus mutans S1B]
gi|449262861|gb|EMC60300.1| endonuclease III [Streptococcus mutans M230]
Length = 207
Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|189499467|ref|YP_001958937.1| endonuclease III [Chlorobium phaeobacteroides BS1]
gi|189494908|gb|ACE03456.1| endonuclease III [Chlorobium phaeobacteroides BS1]
Length = 220
Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGE---SWT 73
+ ++LF+ CPDA++ + ++ +E++ II ++ +A I + + Y GE +
Sbjct: 60 ITAELFSRCPDAESMSRIELDELKTIIRSINYYNNKAKNILAASRMLVESYKGEVPSTRE 119
Query: 74 HVTQLHGVGKYAAD 87
+ L GVG+ A+
Sbjct: 120 KLESLPGVGRKTAN 133
>gi|450087653|ref|ZP_21854396.1| endonuclease III [Streptococcus mutans NV1996]
gi|449217561|gb|EMC17603.1| endonuclease III [Streptococcus mutans NV1996]
Length = 207
Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKDLESLSGVGRKTAN 125
>gi|450049677|ref|ZP_21839798.1| endonuclease III [Streptococcus mutans NFSM1]
gi|449203428|gb|EMC04287.1| endonuclease III [Streptococcus mutans NFSM1]
Length = 207
Score = 35.8 bits (81), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|449999353|ref|ZP_21824493.1| endonuclease III [Streptococcus mutans N29]
gi|449187089|gb|EMB88890.1| endonuclease III [Streptococcus mutans N29]
Length = 207
Score = 35.8 bits (81), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLPGVGRKTAN 125
>gi|450127488|ref|ZP_21868617.1| endonuclease III [Streptococcus mutans U2A]
gi|449230689|gb|EMC29938.1| endonuclease III [Streptococcus mutans U2A]
Length = 207
Score = 35.4 bits (80), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + + Q L GVG+ A+
Sbjct: 96 QQLVDHYNGQIPQTRKELESLSGVGRKTAN 125
>gi|290579993|ref|YP_003484385.1| endonuclease III [Streptococcus mutans NN2025]
gi|449965328|ref|ZP_21811829.1| endonuclease III [Streptococcus mutans 15VF2]
gi|449969031|ref|ZP_21812970.1| endonuclease III [Streptococcus mutans 2VS1]
gi|449980966|ref|ZP_21817506.1| endonuclease III [Streptococcus mutans 5SM3]
gi|450031273|ref|ZP_21833606.1| endonuclease III [Streptococcus mutans G123]
gi|450035307|ref|ZP_21834948.1| endonuclease III [Streptococcus mutans M21]
gi|450056223|ref|ZP_21841930.1| endonuclease III [Streptococcus mutans NLML4]
gi|450078846|ref|ZP_21851158.1| endonuclease III [Streptococcus mutans N3209]
gi|450094137|ref|ZP_21856911.1| endonuclease III [Streptococcus mutans W6]
gi|450112047|ref|ZP_21863022.1| endonuclease III [Streptococcus mutans SM6]
gi|450148335|ref|ZP_21875573.1| endonuclease III [Streptococcus mutans 14D]
gi|450166037|ref|ZP_21882131.1| endonuclease III [Streptococcus mutans B]
gi|450181072|ref|ZP_21887605.1| endonuclease III [Streptococcus mutans 24]
gi|254996892|dbj|BAH87493.1| putative endonuclease III [Streptococcus mutans NN2025]
gi|449171325|gb|EMB73990.1| endonuclease III [Streptococcus mutans 15VF2]
gi|449174470|gb|EMB76956.1| endonuclease III [Streptococcus mutans 2VS1]
gi|449176154|gb|EMB78513.1| endonuclease III [Streptococcus mutans 5SM3]
gi|449191781|gb|EMB93241.1| endonuclease III [Streptococcus mutans G123]
gi|449195672|gb|EMB96985.1| endonuclease III [Streptococcus mutans M21]
gi|449207001|gb|EMC07685.1| endonuclease III [Streptococcus mutans NLML4]
gi|449209707|gb|EMC10215.1| endonuclease III [Streptococcus mutans N3209]
gi|449216619|gb|EMC16721.1| endonuclease III [Streptococcus mutans W6]
gi|449222801|gb|EMC22516.1| endonuclease III [Streptococcus mutans SM6]
gi|449236015|gb|EMC34951.1| endonuclease III [Streptococcus mutans 14D]
gi|449240041|gb|EMC38736.1| endonuclease III [Streptococcus mutans B]
gi|449247360|gb|EMC45643.1| endonuclease III [Streptococcus mutans 24]
Length = 207
Score = 35.4 bits (80), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|449886259|ref|ZP_21786083.1| endonuclease III [Streptococcus mutans SA41]
gi|449899436|ref|ZP_21791015.1| endonuclease III [Streptococcus mutans R221]
gi|449914623|ref|ZP_21795691.1| endonuclease III [Streptococcus mutans 15JP3]
gi|449924022|ref|ZP_21799362.1| endonuclease III [Streptococcus mutans 4SM1]
gi|449932132|ref|ZP_21802695.1| endonuclease III [Streptococcus mutans 3SN1]
gi|449936391|ref|ZP_21803955.1| endonuclease III [Streptococcus mutans 2ST1]
gi|449947418|ref|ZP_21807363.1| endonuclease III [Streptococcus mutans 11SSST2]
gi|449995317|ref|ZP_21822937.1| endonuclease III [Streptococcus mutans A9]
gi|450009910|ref|ZP_21828414.1| endonuclease III [Streptococcus mutans A19]
gi|450024675|ref|ZP_21831355.1| endonuclease III [Streptococcus mutans U138]
gi|450039060|ref|ZP_21836029.1| endonuclease III [Streptococcus mutans T4]
gi|450062699|ref|ZP_21844476.1| endonuclease III [Streptococcus mutans NLML5]
gi|450121163|ref|ZP_21866166.1| endonuclease III [Streptococcus mutans ST6]
gi|450155949|ref|ZP_21878556.1| endonuclease III [Streptococcus mutans 21]
gi|450175809|ref|ZP_21885408.1| endonuclease III [Streptococcus mutans SM1]
gi|449157726|gb|EMB61161.1| endonuclease III [Streptococcus mutans 15JP3]
gi|449161745|gb|EMB64919.1| endonuclease III [Streptococcus mutans 3SN1]
gi|449163319|gb|EMB66427.1| endonuclease III [Streptococcus mutans 4SM1]
gi|449165606|gb|EMB68596.1| endonuclease III [Streptococcus mutans 2ST1]
gi|449168603|gb|EMB71411.1| endonuclease III [Streptococcus mutans 11SSST2]
gi|449184612|gb|EMB86545.1| endonuclease III [Streptococcus mutans A9]
gi|449190539|gb|EMB92101.1| endonuclease III [Streptococcus mutans A19]
gi|449191724|gb|EMB93190.1| endonuclease III [Streptococcus mutans U138]
gi|449200716|gb|EMC01737.1| endonuclease III [Streptococcus mutans T4]
gi|449205431|gb|EMC06178.1| endonuclease III [Streptococcus mutans NLML5]
gi|449229433|gb|EMC28747.1| endonuclease III [Streptococcus mutans ST6]
gi|449236511|gb|EMC35426.1| endonuclease III [Streptococcus mutans 21]
gi|449246187|gb|EMC44499.1| endonuclease III [Streptococcus mutans SM1]
gi|449254438|gb|EMC52346.1| endonuclease III [Streptococcus mutans SA41]
gi|449258367|gb|EMC55949.1| endonuclease III [Streptococcus mutans R221]
Length = 207
Score = 35.4 bits (80), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|450134132|ref|ZP_21870967.1| endonuclease III [Streptococcus mutans NLML8]
gi|449149928|gb|EMB53709.1| endonuclease III [Streptococcus mutans NLML8]
Length = 207
Score = 35.4 bits (80), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLPGVGRKTAN 125
>gi|450159908|ref|ZP_21879709.1| endonuclease III [Streptococcus mutans 66-2A]
gi|449240672|gb|EMC39335.1| endonuclease III [Streptococcus mutans 66-2A]
Length = 207
Score = 35.4 bits (80), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|150015469|ref|YP_001307723.1| endonuclease III [Clostridium beijerinckii NCIMB 8052]
gi|149901934|gb|ABR32767.1| endonuclease III [Clostridium beijerinckii NCIMB 8052]
Length = 210
Score = 35.4 bits (80), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 7/78 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP----MIKRFSQEYLGE-- 70
K V DLF PD + + EE+EK I +GL + ++ M + +++ GE
Sbjct: 45 KVNEVTKDLFKDYPDLDSLLTLTNEELEKRIKQIGLYRNKSKNLILMFNQLKEKFNGEVP 104
Query: 71 -SWTHVTQLHGVGKYAAD 87
+ +T L G G+ A+
Sbjct: 105 KTMEEITSLAGAGRKTAN 122
>gi|400289761|ref|ZP_10791788.1| endonuclease III [Streptococcus ratti FA-1 = DSM 20564]
gi|399920552|gb|EJN93369.1| endonuclease III [Streptococcus ratti FA-1 = DSM 20564]
Length = 209
Score = 35.4 bits (80), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 43/90 (47%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + + ++IE IS +GL + +A +K S
Sbjct: 36 IAVVLSAQTTDAAVNKVTPALFAAYPQPKDLAKANLKDIESYISKIGLYRNKAKFLKECS 95
Query: 65 QE----YLGE---SWTHVTQLHGVGKYAAD 87
Q+ Y GE + + L GVG+ A+
Sbjct: 96 QQLMDRYGGEVPHTRKELESLSGVGRKTAN 125
>gi|55980081|ref|YP_143378.1| endonuclease III [Thermus thermophilus HB8]
gi|55771494|dbj|BAD69935.1| endonuclease III [Thermus thermophilus HB8]
Length = 220
Score = 35.4 bits (80), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 7/65 (10%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGESWTH---VTQ 77
LF PDAK E EE+E I +GL + +A + +R +EY GE + +
Sbjct: 66 LFARFPDAKALAEATPEEVEPYIRRIGLYRTKAKNLVALARRLVEEYGGEVPKEKEALMR 125
Query: 78 LHGVG 82
L GVG
Sbjct: 126 LPGVG 130
>gi|309775704|ref|ZP_07670702.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
gi|308916543|gb|EFP62285.1| endonuclease III [Erysipelotrichaceae bacterium 3_1_53]
Length = 215
Score = 35.4 bits (80), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 14/96 (14%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+A++L A +V LF P + E ++IE I +GL + +A I+ S
Sbjct: 31 VAVVLSAQTTDAAVNKVTPALFEAFPTPQAMAEAQLQDIEDKIRRIGLYRNKAHSIQNLS 90
Query: 65 QEYLG-------ESWTHVTQLHGVGKYAADAFAIFC 93
+ L ES +T L GVG+ A+ C
Sbjct: 91 RSLLDSFDGVVPESMKDLTSLAGVGRKTANVVRSVC 126
>gi|46200194|ref|YP_005861.1| endonuclease III [Thermus thermophilus HB27]
gi|46197822|gb|AAS82234.1| endonuclease III [Thermus thermophilus HB27]
Length = 220
Score = 35.4 bits (80), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 7/65 (10%)
Query: 25 LFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGESWTH---VTQ 77
LF PDAK E EE+E I +GL + +A + +R +EY GE + +
Sbjct: 66 LFARFPDAKALAEATPEEVEPYIRRIGLYRTKAKNLVALARRLVEEYGGEVPKEKEALMR 125
Query: 78 LHGVG 82
L GVG
Sbjct: 126 LPGVG 130
>gi|218247153|ref|YP_002372524.1| endonuclease III [Cyanothece sp. PCC 8801]
gi|257060225|ref|YP_003138113.1| endonuclease III [Cyanothece sp. PCC 8802]
gi|218167631|gb|ACK66368.1| endonuclease III [Cyanothece sp. PCC 8801]
gi|256590391|gb|ACV01278.1| endonuclease III [Cyanothece sp. PCC 8802]
Length = 220
Score = 35.4 bits (80), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 17 KAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI----KRFSQEYLGESW 72
+ +V LF+ PDA + + D EE+E+II + G + +A I ++ +E+ G+
Sbjct: 54 RVNKVTPKLFSRFPDALSLAKADREELEEIIRSTGFYRNKAKNIQGACQKIVKEFEGQVP 113
Query: 73 THVTQ---LHGVGKYAAD 87
+ Q L GV + A+
Sbjct: 114 KQMEQLLSLPGVARKTAN 131
>gi|449956391|ref|ZP_21809487.1| endonuclease III [Streptococcus mutans 4VF1]
gi|450138345|ref|ZP_21872114.1| endonuclease III [Streptococcus mutans NLML1]
gi|450143105|ref|ZP_21873238.1| endonuclease III [Streptococcus mutans 1ID3]
gi|449152716|gb|EMB56416.1| endonuclease III [Streptococcus mutans 1ID3]
gi|449170673|gb|EMB73369.1| endonuclease III [Streptococcus mutans 4VF1]
gi|449234128|gb|EMC33152.1| endonuclease III [Streptococcus mutans NLML1]
Length = 207
Score = 35.4 bits (80), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPVLFAAYPGPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + V Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQVPQTRKELESLSGVGRKTAN 125
>gi|333995081|ref|YP_004527694.1| endonuclease III [Treponema azotonutricium ZAS-9]
gi|333735400|gb|AEF81349.1| endonuclease III [Treponema azotonutricium ZAS-9]
Length = 229
Score = 35.4 bits (80), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 7/82 (8%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL-------GESWT 73
V +LF+ PD E E+EKI+ + G + +A IK S++ L ++
Sbjct: 71 VTRELFSRFPDPAALAEAPLPELEKIVHSTGFFRAKAHNIKALSRQLLEKHSGEVPDTME 130
Query: 74 HVTQLHGVGKYAADAFAIFCTG 95
+T L G G+ A C G
Sbjct: 131 GLTALPGAGRKTASVVLSTCFG 152
>gi|407795573|ref|ZP_11142531.1| endonuclease III [Salimicrobium sp. MJ3]
gi|407019914|gb|EKE32628.1| endonuclease III [Salimicrobium sp. MJ3]
Length = 216
Score = 35.0 bits (79), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 14/99 (14%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+A++L A +V DLF + V+ EE+EK I ++GL + +A +K+ +
Sbjct: 35 VAVILSAQATDNLVNKVTPDLFRKYKSPEDIAAVETEELEKDIKSIGLYRSKAKNLKKMA 94
Query: 65 QEYL-------GESWTHVTQLHGVGKYAADAFAIFCTGK 96
L ES + L GVG+ A+ A G+
Sbjct: 95 TTLLERFDGEVPESKESLESLAGVGRKTANVVASVAFGE 133
>gi|119719909|ref|YP_920404.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
gi|119525029|gb|ABL78401.1| HhH-GPD family protein [Thermofilum pendens Hrk 5]
Length = 253
Score = 35.0 bits (79), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 8/80 (10%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQE-------YLGESW 72
+V + F+ P ++ D EE+ K S LGL +RA ++ ++E + S
Sbjct: 85 KVFEEFFSRYPSPESLANADPEELRKFFSRLGLV-RRADALREAAREIVERHGGSVPRSK 143
Query: 73 THVTQLHGVGKYAADAFAIF 92
+ +L GVG Y A A F
Sbjct: 144 EELLKLKGVGPYIASAVLCF 163
>gi|449881938|ref|ZP_21784703.1| endonuclease III [Streptococcus mutans SA38]
gi|449250747|gb|EMC48796.1| endonuclease III [Streptococcus mutans SA38]
Length = 207
Score = 35.0 bits (79), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKECS 95
Query: 65 QE----YLG---ESWTHVTQLHGVGKYAAD 87
Q+ Y G ++ + L GVG+ A+
Sbjct: 96 QQLVEYYNGQVPQTRKELESLSGVGRKTAN 125
>gi|409387026|ref|ZP_11239323.1| Endonuclease III [Lactococcus raffinolactis 4877]
gi|399205796|emb|CCK20238.1| Endonuclease III [Lactococcus raffinolactis 4877]
Length = 214
Score = 34.7 bits (78), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 19/99 (19%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A + LF PD+++ D ++E +I T+GL + +A I R +
Sbjct: 35 IAVILSAQATDKGVNKATPALFAKYPDSQSLASADVSDVENLIKTIGLYRSKAKNIVRTA 94
Query: 65 Q----EYLGE---SWTHVTQLHGVGKYAA-----DAFAI 91
+ E+ GE + L GVG+ A DAF I
Sbjct: 95 EMIETEFGGEIPRDKDLLQTLPGVGRKTANVVLGDAFGI 133
>gi|325663082|ref|ZP_08151532.1| hypothetical protein HMPREF0490_02272 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325470536|gb|EGC73766.1| hypothetical protein HMPREF0490_02272 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 213
Score = 34.7 bits (78), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 21 VISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRA----PMIKRFSQEYLGE---SWT 73
V+ L+ PD + D EIE+I+ GL K +A +K ++Y G+ ++
Sbjct: 54 VVEKLYEKFPDVDALADADVAEIEEIVRPCGLGKSKARDISACMKILKEQYGGQIPKTFD 113
Query: 74 HVTQLHGVGKYAAD 87
+ +L GVG+ +A+
Sbjct: 114 ELLKLPGVGRKSAN 127
>gi|408403428|ref|YP_006861411.1| endonuclease III [Candidatus Nitrososphaera gargensis Ga9.2]
gi|408364024|gb|AFU57754.1| putative endonuclease III [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 223
Score = 34.7 bits (78), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 7/84 (8%)
Query: 20 RVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVT--- 76
RV++ LF + D +EI+KII ++G +A IK+ SQ + + V
Sbjct: 54 RVVNKLFARFKTPQDLASADIDEIKKIIHSIGFYNIKAERIKQVSQMLISKFGGRVPSDI 113
Query: 77 ----QLHGVGKYAADAFAIFCTGK 96
+L GVG+ A+ ++ K
Sbjct: 114 DSLLKLPGVGRKTANCVLVYAFDK 137
>gi|373494784|ref|ZP_09585383.1| endonuclease III [Eubacterium infirmum F0142]
gi|371967828|gb|EHO85296.1| endonuclease III [Eubacterium infirmum F0142]
Length = 207
Score = 34.7 bits (78), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 7/76 (9%)
Query: 19 GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAP----MIKRFSQEYLG---ES 71
+V +LF PDA + + ++E+ I +G+ + ++ M K EY G ++
Sbjct: 48 NKVTPELFKAYPDANALAKAEQSDVEEYIKKIGMYRTKSKNIINMAKSLVAEYDGLVPQN 107
Query: 72 WTHVTQLHGVGKYAAD 87
+ +T+L GVG+ A+
Sbjct: 108 FDALTKLSGVGRKTAN 123
>gi|422855015|ref|ZP_16901673.1| endonuclease III [Streptococcus sanguinis SK1]
gi|422875881|ref|ZP_16922351.1| endonuclease III [Streptococcus sanguinis SK1056]
gi|327462992|gb|EGF09313.1| endonuclease III [Streptococcus sanguinis SK1]
gi|332362317|gb|EGJ40117.1| endonuclease III [Streptococcus sanguinis SK1056]
Length = 209
Score = 34.7 bits (78), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
+A++L A + DLF P + +I K IS LGL + +A +K+ +
Sbjct: 36 VAVMLSAQTTDAAVNKATPDLFEAYPSPQDMAMAGEADIAKYISRLGLYRNKAKFLKKCA 95
Query: 65 QEYLG-------ESWTHVTQLHGVGKYAAD 87
Q+ L ++ T + L GVG+ A+
Sbjct: 96 QQLLDDFDGQVPQTRTELESLAGVGRKTAN 125
>gi|418054692|ref|ZP_12692748.1| HhH-GPD family protein [Hyphomicrobium denitrificans 1NES1]
gi|353212317|gb|EHB77717.1| HhH-GPD family protein [Hyphomicrobium denitrificans 1NES1]
Length = 271
Score = 34.3 bits (77), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 14/66 (21%)
Query: 47 ISTLGLQKKRAPMIKRFSQEY------------LGESWTH--VTQLHGVGKYAADAFAIF 92
++ LGL + +K +QE LG H +T LHGVG + AD + +F
Sbjct: 92 LARLGLSAAKIKTLKHIAQEIAEERLNLDALADLGADEAHRTLTSLHGVGPWTADIYLLF 151
Query: 93 CTGKWD 98
C G D
Sbjct: 152 CLGHGD 157
>gi|325291271|ref|YP_004267452.1| A/G-specific adenine glycosylase [Syntrophobotulus glycolicus DSM
8271]
gi|324966672|gb|ADY57451.1| A/G-specific adenine glycosylase [Syntrophobotulus glycolicus DSM
8271]
Length = 361
Score = 34.3 bits (77), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 10/102 (9%)
Query: 5 YSIRLKEIAILLKAGRVISDLFTL----CPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 60
Y I L EI + V+ +T P + + D +++ K+ LG + +
Sbjct: 47 YHIWLSEIMLQQTRVEVVKTYYTRFLEEIPTVEELAQTDEQKLLKLWEGLGYYSRARNLQ 106
Query: 61 K---RFSQEYLG---ESWTHVTQLHGVGKYAADAFAIFCTGK 96
K R EY+G E++ + +L GVG Y A A A C G+
Sbjct: 107 KTARRIVDEYVGHFPETYEQLLKLPGVGPYTAGAIASICFGQ 148
>gi|24380028|ref|NP_721983.1| endonuclease III (DNA repair) [Streptococcus mutans UA159]
gi|449865177|ref|ZP_21778838.1| endonuclease III [Streptococcus mutans U2B]
gi|449872317|ref|ZP_21781456.1| endonuclease III [Streptococcus mutans 8ID3]
gi|449984184|ref|ZP_21818870.1| endonuclease III [Streptococcus mutans NFSM2]
gi|450081459|ref|ZP_21851742.1| endonuclease III [Streptococcus mutans N66]
gi|24378018|gb|AAN59289.1|AE014995_6 putative endonuclease III (DNA repair) [Streptococcus mutans UA159]
gi|449154570|gb|EMB58143.1| endonuclease III [Streptococcus mutans 8ID3]
gi|449180435|gb|EMB82592.1| endonuclease III [Streptococcus mutans NFSM2]
gi|449215264|gb|EMC15471.1| endonuclease III [Streptococcus mutans N66]
gi|449264440|gb|EMC61782.1| endonuclease III [Streptococcus mutans U2B]
Length = 207
Score = 34.3 bits (77), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 14/90 (15%)
Query: 12 IAILLKA-------GRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFS 64
IA++L A +V LF P K + D +++E IS +GL + +A +K S
Sbjct: 36 IAVILSAQTTDAAVNKVTPALFAAYPRPKDLAKADLKDLESYISQIGLYRNKAKFLKGCS 95
Query: 65 QEYLGESWTHVTQ-------LHGVGKYAAD 87
Q+ + + Q L GVG+ A+
Sbjct: 96 QQLVEHYNGQIPQTRKELESLSGVGRKTAN 125
>gi|331092230|ref|ZP_08341058.1| hypothetical protein HMPREF9477_01701 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401662|gb|EGG81241.1| hypothetical protein HMPREF9477_01701 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 145
Score = 34.3 bits (77), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/46 (32%), Positives = 25/46 (54%)
Query: 37 EVDAEEIEKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVG 82
E++ +E EK+I + L K P+ K Q + W + T+L G+G
Sbjct: 83 EIEDQEFEKMIRVIALDKMNIPVWKLVEQTHNEAPWKYTTELFGMG 128
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.136 0.412
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,799,820,828
Number of Sequences: 23463169
Number of extensions: 64725951
Number of successful extensions: 160939
Number of sequences better than 100.0: 285
Number of HSP's better than 100.0 without gapping: 166
Number of HSP's successfully gapped in prelim test: 119
Number of HSP's that attempted gapping in prelim test: 160744
Number of HSP's gapped (non-prelim): 297
length of query: 121
length of database: 8,064,228,071
effective HSP length: 88
effective length of query: 33
effective length of database: 5,999,469,199
effective search space: 197982483567
effective search space used: 197982483567
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)