BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001299
         (1105 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
            SV=2
          Length = 1987

 Score =  628 bits (1620), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)

Query: 568  WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 624
            WD LR+     E   ER+ + MDS+D+EA+R A  SEI++AIKERG  N++A RIK+FL 
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509

Query: 625  RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
            R+V+ HG IDLEWLR  PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569

Query: 685  VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 744
            VR+GWVPLQPLP  L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629

Query: 745  KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 804
             K  PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++         PP+ +  I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689

Query: 805  -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 863
             + +E    S +     NCEPIIEEP SPG +C+E  ES+I+D  ++  D +EIPT++L 
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748

Query: 864  EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 920
              +F       ME N + LQ+   S ALVAL     S+PT KLK  + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807

Query: 921  AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 979
            +H LL G D+R+PDDPSPYLLAIW+PGE+ NS   P+ +C  K S K+C  E C  CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867

Query: 980  CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1039
             E N+  VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S  PI+VPR  I +L R 
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927

Query: 1040 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
             V FG+S T++F  L   QIQ  FW+GFVCVRGF+++ R PRPL+ RLH   SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981



 Score =  137 bits (344), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)

Query: 76   LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 135
            L Q + A+II + ++L + D   +               NA+V Y+       GALVP++
Sbjct: 904  LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946

Query: 136  IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 195
               K++K +P+V +D  T R+WNL+M   D    D+  ++  EKWW++ER VF GR  SF
Sbjct: 947  --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004

Query: 196  TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 255
             ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++     
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063

Query: 256  DENLRTTASLEPIG 269
            + N+R+    +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077


>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
          Length = 1393

 Score =  622 bits (1605), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)

Query: 567  DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 623
            DWD LRR     +   E++   MD+VDW+A+R AD  E+A+ IK RG  + +A RI+ FL
Sbjct: 858  DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917

Query: 624  NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 683
            +RLV  HGSIDLEWLR+VPPDK KEYLL   GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918  DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977

Query: 684  AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 743
            AVRLGWVPLQPLP  L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978  AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037

Query: 744  CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 802
            C K  PNC ACP++GEC+HFASA ASAR ALP        ++E G G     PL ++ P 
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089

Query: 803  PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 859
            P  R +   V   SE   ++  CEPIIEEP SP P+ +E   ++I++  F   D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147

Query: 860  LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 916
            +RL    F  N    ME NK  LQD   SSALVALTAE AS+P  KLK  + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206

Query: 917  VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 975
             LPD H LL   ++R+PDDP  YLLAIW+PGE+ +S+      C    +  LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266

Query: 976  CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1035
            CNS+ E  + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR  I  
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326

Query: 1036 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1093
            L R  V FG+S  T+F  L   +IQ  FW+G+VCVRGFDR+ R P+PL+ RLH   SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384



 Score =  114 bits (284), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)

Query: 116 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 170
           A+VPY +           GA+VP     K+ + +P+V LD  T R+W L++   +    D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552

Query: 171 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 230
             S+E   KWW++ER VF GR  SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611

Query: 231 NLSSSAYMSLAARFPL 246
           +LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627


>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
          Length = 1332

 Score =  580 bits (1496), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)

Query: 404  ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 447
            +TQ ++    P TG ST  + +P++                C +ESS  + + I   + +
Sbjct: 613  VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672

Query: 448  -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 506
              +L D  + +L  ++      K      G+   T   NDL      +SS+  +    ++
Sbjct: 673  KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728

Query: 507  VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 565
             +P   E       P++   T   ++S R +   K  K    +   KK+     S     
Sbjct: 729  EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788

Query: 566  QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 622
             DWD LR+   +G    ER+   MD+VDW+A+RC D  +IA+ I +RG  N++A RIK F
Sbjct: 789  VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848

Query: 623  LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 682
            LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL  IAFPVD NVGR
Sbjct: 849  LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908

Query: 683  IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 742
            IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909  IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968

Query: 743  FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 802
            FC K  PNC ACP++ EC+H++SA ASAR ALP P E    +        +  P+VVN  
Sbjct: 969  FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028

Query: 803  PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 856
            P + +  +   E+  +  NCEPIIEEP SP P+  E    +   D  ++G  E+      
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087

Query: 857  -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 915
             IPT+ L +   +    + + +NK     S  LV L+  AA++P RKLK    LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143

Query: 916  YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 973
            + LPD H +L GF+RR+ +D  PYLLAIW+PGE+ NS+  P+ RC     ++ LCN   C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203

Query: 974  YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1033
            + CN   E  +  VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP   I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263

Query: 1034 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1090
             +L+R +   GSS +++   L +  I+  F  G+VCVRGFDR  R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320



 Score = 91.3 bits (225), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)

Query: 78  QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 136
           +S T D I K F++L I   G  LP          N   A++ Y+     E  A+V +  
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491

Query: 137 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 196
               KK KP+V LDP T R+W L+M+  D    D  S+E+  KWW++ER +F GR  SF 
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546

Query: 197 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 247
           ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597


>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
          Length = 1044

 Score =  509 bits (1312), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)

Query: 567  DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 626
            DW+ LRR+Y+    R    MDSV+W  VR + ++ +   IK+RGQ  I++ RI +FLN  
Sbjct: 501  DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560

Query: 627  VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 686
            V  +G+IDLEWLRN P   VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561  VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620

Query: 687  LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 746
            LG VPL+PLP  + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621  LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680

Query: 747  RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 806
              PNC ACP++ ECK+FASA  S++  L  P EK    + F N   Q   + V+    I 
Sbjct: 681  TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738

Query: 807  IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 863
            +  + VS        C +P++E P SP  +  ES  ++I+D  F         +P +   
Sbjct: 739  LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796

Query: 864  EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 916
                K++  + + ++  M       S ALV  T E A +P    RK+K    LRTEH VY
Sbjct: 797  LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856

Query: 917  VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 976
            VLPD HELL  F+RR  DDPSPYLLAIW PGE+ +S   P+ +C+S  SKLC  + C  C
Sbjct: 857  VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916

Query: 977  NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1036
             ++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI   R     L
Sbjct: 917  WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976

Query: 1037 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1094
             +  +  GS+ T++F  LD R+I+  FW GF+C+R FDR+ R P+ LV RLH  P ++
Sbjct: 977  EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034



 Score = 94.0 bits (232), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)

Query: 132 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 191
           V   IK  +K +  +V LDP T++ W+++M ++D  ++     E  E  W+KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 391

Query: 192 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 246
           I  F  RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446


>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
           DSM 3109 / JCM 10099) GN=nth PE=3 SV=1
          Length = 213

 Score = 57.0 bits (136), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)

Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
           A   ++ D IKE G     A RI E    LVE +G           PD ++E LL + G+
Sbjct: 60  AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110

Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
           G K+   V  +  +  A  VD +V RI+ RLGWV  +  P +    L K  P        
Sbjct: 111 GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163

Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
           LW             ++  M+ FG+  CK ++P C  C L+  C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199


>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
           PE=3 SV=1
          Length = 212

 Score = 47.0 bits (110), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)

Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
           P+  KE L+ + G+G K+   V           VD +V R++ R+G        G+    
Sbjct: 105 PNNFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTAAI 158

Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
           + KE              L  +D+K L   H+ +I  G+  CK R P C  CP++  C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEY 205

Query: 763 FASAVAS 769
           + +  +S
Sbjct: 206 YINTFSS 212


>sp|P39788|END3_BACSU Probable endonuclease III OS=Bacillus subtilis (strain 168) GN=nth
           PE=3 SV=1
          Length = 219

 Score = 47.0 bits (110), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)

Query: 601 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 660
           E+   IK  G     A  I++    ++E +G         VP D+  + L+ + G+G K+
Sbjct: 72  ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 122

Query: 661 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 720
              V  ++    A  VD +V R++ RLG    +    ++   L+++ P  D      W  
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175

Query: 721 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 761
                       H+++I FG+  CK +SP C  CPL   C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206


>sp|Q58829|Y1434_METJA Putative endonuclease MJ1434 OS=Methanocaldococcus jannaschii
           (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
           100440) GN=MJ1434 PE=3 SV=1
          Length = 220

 Score = 46.6 bits (109), Expect = 0.001,   Method: Composition-based stats.
 Identities = 42/165 (25%), Positives = 74/165 (44%), Gaps = 20/165 (12%)

Query: 598 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV--KEYLLDIEG 655
           DE ++ + I+  G  N+ A R+K     +VE +G  + E +     D +  +  LL I G
Sbjct: 71  DEDKLKELIRPAGFYNLKAKRLKNVTKFIVENYG--NTEEMAKTDKDTLILRAELLSING 128

Query: 656 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 715
           +G ++ + + L +L   +F VD    R+  RLG              ++ E    D+I+ 
Sbjct: 129 VGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------VINEKAKYDEIKE 174

Query: 716 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
                L   D +   E H  ++   K FC+K++  C  CP++  C
Sbjct: 175 IFEKNLPK-DLEIYKEYHALIVEHCKKFCRKKAL-CDNCPIKEFC 217


>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
           Wilmington) GN=nth PE=3 SV=1
          Length = 212

 Score = 45.4 bits (106), Expect = 0.003,   Method: Composition-based stats.
 Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 19/127 (14%)

Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
           P+  KE L+ + G+G K+   V           VD +V R++ R+G        G+  + 
Sbjct: 105 PNDFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTTVI 158

Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
           + KE              L  +D+K L   H+ ++  G+  CK R P+C  C ++  C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEY 205

Query: 763 FASAVAS 769
           + +  AS
Sbjct: 206 YINTFAS 212


>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
           URRWXCal2) GN=nth PE=3 SV=1
          Length = 213

 Score = 45.1 bits (105), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)

Query: 643 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 702
           P+  KE L+ + G+G K+   V           VD +V R+A R+G       P  +   
Sbjct: 105 PNDFKE-LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGLARGNS-PEIVEKE 162

Query: 703 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
           LL+                  +++K L   H+ +I  G+  CK R P+C  CP++  C++
Sbjct: 163 LLQ-----------------IINEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEY 205

Query: 763 FASAVAS 769
           + S + S
Sbjct: 206 YNSPIIS 212


>sp|Q92GH4|END3_RICCN Endonuclease III OS=Rickettsia conorii (strain ATCC VR-613 / Malish
           7) GN=nth PE=3 SV=1
          Length = 210

 Score = 42.7 bits (99), Expect = 0.016,   Method: Composition-based stats.
 Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 18/114 (15%)

Query: 650 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 709
           L+ + G+G K+   V           VD +V R+A R+G        GD         P 
Sbjct: 111 LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVANRIGLAK-----GDT--------PE 157

Query: 710 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 763
           + + ++     L  +D K L   H+ +I  G+  CK R P+C  CP++  C ++
Sbjct: 158 IVENEL-----LQIIDTKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCDYY 206


>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2
           SV=2
          Length = 343

 Score = 41.6 bits (96), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 34/48 (70%), Gaps = 1/48 (2%)

Query: 638 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIA 684
           L+++P +K +E L  + G+G K  +C+ L+S+ H+ + PVD+++ RIA
Sbjct: 229 LKSMPFEKAREELTLLPGIGYKVADCICLMSMGHLESVPVDIHIYRIA 276


>sp|A9M2W6|PYRC_NEIM0 Dihydroorotase OS=Neisseria meningitidis serogroup C (strain
           053442) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 40.4 bits (93), Expect = 0.074,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT N +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDNATSELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|Q9K0D1|PYRC_NEIMB Dihydroorotase OS=Neisseria meningitidis serogroup B (strain MC58)
           GN=pyrC PE=3 SV=1
          Length = 344

 Score = 40.0 bits (92), Expect = 0.12,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT N +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDNATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
          Length = 345

 Score = 39.7 bits (91), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
           L E  G   L+ LR  P ++  + L  + G+G K  +C+ L++L +  A PVDV+V +IA
Sbjct: 216 LEEQGGPAWLQQLRVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIA 275

Query: 685 VR-LGWVP 691
            R  GW P
Sbjct: 276 HRDYGWHP 283


>sp|P29588|GTMR_METTF G/T mismatches repair enzyme OS=Methanobacterium thermoformicicum
           PE=1 SV=1
          Length = 221

 Score = 38.9 bits (89), Expect = 0.22,   Method: Composition-based stats.
 Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 27/197 (13%)

Query: 570 LLRRIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 627
           LLRR  +   ++ +DK  +    +E +    +SEIA  IKE G  N  A ++KE    ++
Sbjct: 44  LLRRTTAGHVKKIYDKFFVKYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVI 103

Query: 628 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 687
             +G         VP  + ++ +LD+ G+G  +   V  L+    A  VD N  R+  R 
Sbjct: 104 NDYGG-------RVP--RNRKAILDLPGVGKYTCAAVMCLAFGKKAAMVDANFVRVINR- 153

Query: 688 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL-CYLDQKTLYELHYQMITFGKIFCKK 746
                         +    Y  ++     LW      +      + +  ++ F  I C  
Sbjct: 154 --------------YFGGSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAP 199

Query: 747 RSPNCGACPLRGECKHF 763
           R P C  C +   C ++
Sbjct: 200 RKPKCEKCGMSKLCSYY 216


>sp|P46303|UVEN_MICLC Ultraviolet N-glycosylase/AP lyase OS=Micrococcus luteus (strain
           ATCC 4698 / DSM 20030 / JCM 1464 / NBRC 3333 / NCIMB
           9278 / NCTC 2665 / VKM Ac-2230) GN=pdg PE=1 SV=2
          Length = 279

 Score = 37.7 bits (86), Expect = 0.54,   Method: Composition-based stats.
 Identities = 42/178 (23%), Positives = 69/178 (38%), Gaps = 28/178 (15%)

Query: 590 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELH-GSIDLEWLRNVPPDKVKE 648
           D  A+  A E E+ + ++  G     A  I      LV  H G +         P ++++
Sbjct: 75  DAHAMAAATEPELQELVRSTGFYRNKASAILRLSQELVGRHDGEV---------PARLED 125

Query: 649 YLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
            L+ + G+G K+   V   +       VD + GR+A RLG+   +  PG       +  P
Sbjct: 126 -LVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLARRLGFTD-ETDPGKGRARRGRPVP 183

Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 766
                 M                L +++I  G+  C  R P CG CP+   C  +A+ 
Sbjct: 184 PARDWTM----------------LSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAG 225


>sp|Q5F9Y1|PYRC_NEIG1 Dihydroorotase OS=Neisseria gonorrhoeae (strain ATCC 700825 / FA
           1090) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 37.7 bits (86), Expect = 0.60,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|B4RJT2|PYRC_NEIG2 Dihydroorotase OS=Neisseria gonorrhoeae (strain NCCP11945) GN=pyrC
           PE=3 SV=1
          Length = 344

 Score = 37.4 bits (85), Expect = 0.60,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|A0R6D9|MAK_MYCS2 Maltokinase OS=Mycobacterium smegmatis (strain ATCC 700084 /
           mc(2)155) GN=mak PE=3 SV=1
          Length = 441

 Score = 37.4 bits (85), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 10/95 (10%)

Query: 74  QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 133
           ++L+ +TTA  +R      +RDG   + LQ N ADGP   +  +V    GP  E+  +  
Sbjct: 20  RELVSATTAMAVR------LRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVAT 73

Query: 134 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 168
             I + +     + + DP   R   L+  ID+  T
Sbjct: 74  IGIADGQTAY--DALYDPDATRY--LLSLIDESAT 104


>sp|Q9JVD6|PYRC_NEIMA Dihydroorotase OS=Neisseria meningitidis serogroup A / serotype 4A
           (strain Z2491) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 37.0 bits (84), Expect = 0.83,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPALVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|Q9CB92|END3_MYCLE Endonuclease III OS=Mycobacterium leprae (strain TN) GN=nth PE=3
           SV=2
          Length = 245

 Score = 36.6 bits (83), Expect = 1.2,   Method: Composition-based stats.
 Identities = 47/179 (26%), Positives = 68/179 (37%), Gaps = 27/179 (15%)

Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
           AD +E+ + I+  G     A  +      LVE     D E    VP   V   L  + G+
Sbjct: 78  ADRAELENFIRPTGFFRNKAASLIRLGQALVE---RFDGE----VPSTMVD--LFTLPGV 128

Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
           G K+   +   +       VD + GR+  R  W               +E PV  K++  
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTA-------------EEDPV--KVEHA 173

Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
           +   L   DQ TL  L +++I  G+  C  R P CG C L  +C  F         A P
Sbjct: 174 V-GELIERDQWTL--LSHRVIFHGRRVCHARKPACGVCVLAKDCPSFGLGPTEPLLAAP 229


>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=OGG1 PE=1 SV=1
          Length = 376

 Score = 36.6 bits (83), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 620 KEFLNRLVELHGSIDLEWLRNVPPDK----VKEYLLDIEGLGLKSVECVRLLSLQHIAF- 674
           ++ +N   E + + D  +L+++  D     V+E+L+   G+G K  +CV L+ L      
Sbjct: 198 RKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIV 257

Query: 675 PVDVNVGRIAVR 686
           PVDV+V RIA R
Sbjct: 258 PVDVHVSRIAKR 269


>sp|Q08214|NTG2_YEAST DNA base excision repair N-glycosylase 2 OS=Saccharomyces
           cerevisiae (strain ATCC 204508 / S288c) GN=NTG2 PE=1
           SV=1
          Length = 380

 Score = 36.6 bits (83), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 30/119 (25%)

Query: 680 VGRIAVRLGWVPLQPLPGDLHIH-LLKEYPVMDKI----------QMYLWPRLCYLDQKT 728
           +G + ++ GW  +  +  D+H+H L K +  +D I          ++ +W     L    
Sbjct: 249 MGYLTLQKGWGLIAGICVDVHVHRLCKMWNWVDPIKCKTAEHTRKELQVW-----LPHSL 303

Query: 729 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF 787
            YE++  ++ FG++ C  R   C  C         A+ V +AR      +EK I +S+F
Sbjct: 304 WYEINTVLVGFGQLICMARGKRCDLC--------LANDVCNAR------NEKLIESSKF 348


>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
          Length = 345

 Score = 35.8 bits (81), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 626 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
           L E  G   L+ LR    ++  + L  + G+G K  +C+ L++L +  A PVD++V +IA
Sbjct: 216 LEEQGGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIA 275

Query: 685 VR-LGWVP 691
            R  GW P
Sbjct: 276 HRDYGWQP 283


>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
           thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
           10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
          Length = 312

 Score = 35.8 bits (81), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 629 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 683
           L   +D+  +  +  D  +E LL++ G+G K  +C+ L   +   AFPVDV + RI
Sbjct: 205 LAEEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260


>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
           SV=1
          Length = 365

 Score = 35.4 bits (80), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 636 EWL---RNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 684
           EWL   R V   +    L  + G+G K   C+ L SL QH A PVD +V +IA
Sbjct: 235 EWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIA 287


>sp|P63540|END3_MYCTU Endonuclease III OS=Mycobacterium tuberculosis GN=nth PE=3 SV=1
          Length = 245

 Score = 35.0 bits (79), Expect = 3.1,   Method: Composition-based stats.
 Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)

Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
           AD +E+   I+  G     A  +      LVE  G         VP    K  L+ + G+
Sbjct: 78  ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128

Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
           G K+   +   +       VD + GR+  R  W                E PV  K++  
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173

Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
           +      +++K    L +++I  G+  C  R P CG C L  +C  F         A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229


>sp|P63541|END3_MYCBO Endonuclease III OS=Mycobacterium bovis (strain ATCC BAA-935 /
           AF2122/97) GN=nth PE=3 SV=1
          Length = 245

 Score = 35.0 bits (79), Expect = 3.1,   Method: Composition-based stats.
 Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)

Query: 597 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 656
           AD +E+   I+  G     A  +      LVE  G         VP    K  L+ + G+
Sbjct: 78  ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128

Query: 657 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 716
           G K+   +   +       VD + GR+  R  W                E PV  K++  
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173

Query: 717 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 775
           +      +++K    L +++I  G+  C  R P CG C L  +C  F         A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229


>sp|P34648|YOT2_CAEEL Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans
           GN=ZK632.2 PE=4 SV=1
          Length = 710

 Score = 35.0 bits (79), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 42/199 (21%), Positives = 81/199 (40%), Gaps = 29/199 (14%)

Query: 4   LDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPR 63
           LD   +Q+        +    +++  W  K +  T +S+  +K++++ +         P 
Sbjct: 454 LDDYIRQLEKSGGAGDDAKTKMEKSKWRQKLMAATHESQKLEKLVKIAK---------PA 504

Query: 64  YSRGLQVDMEQQLMQSTTAD---IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPY 120
             +GL+     QL  +   D    ++K   +  R    Q P Q     GP    +A +P 
Sbjct: 505 VVKGLE-----QLETTAANDRQAFLKKLMGVRARKEIDQTPSQ-----GPGPSTSATLPA 554

Query: 121 QVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDD-GTTKDQTSNEDMEK 179
            V P+S     V H+ K    K++ E+     +  + N +  +D+  + KD+ S E  +K
Sbjct: 555 TVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIKNSLPAVDEPSSVKDEVSEETPQK 614

Query: 180 WWQKEREVFEGRIQSFTAR 198
                 E F  ++Q   A+
Sbjct: 615 ------EAFGSKVQKRVAQ 627


>sp|A1KSU5|PYRC_NEIMF Dihydroorotase OS=Neisseria meningitidis serogroup C / serotype 2a
           (strain ATCC 700532 / FAM18) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 34.7 bits (78), Expect = 4.7,   Method: Composition-based stats.
 Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 7/119 (5%)

Query: 212 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 271
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 272 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 330
                 ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQDILFLVHGE---VTDPEID----IFDREAAFIERVMKPVLAQVPNLKVVFEHITT 176


>sp|Q89A45|MUTY_BUCBP A/G-specific adenine glycosylase OS=Buchnera aphidicola subsp.
           Baizongia pistaciae (strain Bp) GN=mutY PE=3 SV=1
          Length = 351

 Score = 34.3 bits (77), Expect = 5.2,   Method: Composition-based stats.
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 736 MITFGKIFCKKRSPNCGACPLRGECKHF 763
           +I  G + C K +P C  CPL+  CK +
Sbjct: 182 LIDIGALICLKSNPKCNICPLKSTCKSY 209


>sp|A0R567|MUTY_MYCS2 A/G-specific adenine glycosylase OS=Mycobacterium smegmatis (strain
           ATCC 700084 / mc(2)155) GN=mutY PE=3 SV=1
          Length = 293

 Score = 34.3 bits (77), Expect = 5.4,   Method: Composition-based stats.
 Identities = 36/136 (26%), Positives = 49/136 (36%), Gaps = 25/136 (18%)

Query: 625 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 684
           RL E    I  E+   VP D   + LL + G+G  +   V   + Q     VD NV R+ 
Sbjct: 86  RLHECAVVIASEYDDVVPRDV--DTLLTLPGIGAYTARAVACFAYQASVPVVDTNVRRVV 143

Query: 685 VRLGWVPLQPLPGDLHIHLLKEYPV----MDKIQMYLWPRLCYLDQKTLYELHYQMITFG 740
            R              +H   + P     +D +   L P        T       ++  G
Sbjct: 144 TRA-------------VHGAADAPASTRDLDMVAALLPP------DTTAPTFSAALMELG 184

Query: 741 KIFCKKRSPNCGACPL 756
              C  RSP CG CPL
Sbjct: 185 ATVCTARSPRCGICPL 200


>sp|P44319|END3_HAEIN Endonuclease III OS=Haemophilus influenzae (strain ATCC 51907 / DSM
           11121 / KW20 / Rd) GN=nth PE=3 SV=1
          Length = 211

 Score = 34.3 bits (77), Expect = 6.3,   Method: Composition-based stats.
 Identities = 37/157 (23%), Positives = 61/157 (38%), Gaps = 27/157 (17%)

Query: 606 IKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVR 665
           IK  G  N  A  I +    L+E H          VP ++  E L  + G+G K+   V 
Sbjct: 76  IKTIGLFNSKAENIIKTCRDLIEKHNG-------EVPENR--EALEALAGVGRKTANVVL 126

Query: 666 LLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLD 725
             +  H    VD ++ R+  R  +   + +   +   LLK  P   K+ ++ W       
Sbjct: 127 NTAFGHPTIAVDTHIFRVCNRTNFAAGKDVV-KVEEKLLKVVPNEFKVDVHHW------- 178

Query: 726 QKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 762
                     +I  G+  C  R P CG+C +   C++
Sbjct: 179 ----------LILHGRYTCIARKPRCGSCIIEDLCEY 205


>sp|P54137|NTH1_CAEEL Probable endonuclease III homolog OS=Caenorhabditis elegans
           GN=nth-1 PE=1 SV=2
          Length = 298

 Score = 33.9 bits (76), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 27/112 (24%), Positives = 47/112 (41%), Gaps = 19/112 (16%)

Query: 650 LLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYP 708
           L  + G+G K    V  ++  + +   VD +V RI+ RLGW+               +  
Sbjct: 149 LCALPGVGPKMANLVMQIAWGECVGIAVDTHVHRISNRLGWI---------------KTS 193

Query: 709 VMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 760
             +K Q  L      L +     +++ ++ FG++ C+   P CG C  R  C
Sbjct: 194 TPEKTQKALE---ILLPKSEWQPINHLLVGFGQMQCQPVRPKCGTCLCRFTC 242


>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
           OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
           GN=NTG1 PE=1 SV=1
          Length = 399

 Score = 33.9 bits (76), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 5/106 (4%)

Query: 3   RLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLW--FTAKSRGSQKIIQVERN-GLPTQK 59
           RL +L K ++    +  +Q R  Q ++W PKGLW        G  +II   RN G   Q 
Sbjct: 267 RLTKLWKWVDAQKCKTPDQTRT-QLQNWLPKGLWTEINGLLVGFGQIITKSRNLGDMLQF 325

Query: 60  MPPRYSR-GLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQC 104
           +PP   R  L  D++ QL +    +I+   K +   +G  +L ++ 
Sbjct: 326 LPPDDPRSSLDWDLQSQLYKEIQQNIMSYPKWVKYLEGKRELNVEA 371


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.316    0.132    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 420,498,180
Number of Sequences: 539616
Number of extensions: 18434944
Number of successful extensions: 47585
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 124
Number of HSP's that attempted gapping in prelim test: 47289
Number of HSP's gapped (non-prelim): 340
length of query: 1105
length of database: 191,569,459
effective HSP length: 128
effective length of query: 977
effective length of database: 122,498,611
effective search space: 119681142947
effective search space used: 119681142947
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (30.4 bits)