BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000431
         (1515 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
            SV=2
          Length = 1987

 Score =  629 bits (1622), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)

Query: 978  WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 1034
            WD LR+     E   ER+ + MDS+D+EA+R A  SEI++AIKERG  N++A RIK+FL 
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509

Query: 1035 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 1094
            R+V+ HG IDLEWLR  PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569

Query: 1095 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 1154
            VR+GWVPLQPLP  L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629

Query: 1155 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 1214
             K  PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++         PP+ +  I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689

Query: 1215 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 1273
             + +E    S +     NCEPIIEEP SPG +C+E  ES+I+D  ++  D +EIPT++L 
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748

Query: 1274 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 1330
              +F       ME N + LQ+   S ALVAL     S+PT KLK  + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807

Query: 1331 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 1389
            +H LL G D+R+PDDPSPYLLAIW+PGE+ NS   P+ +C  K S K+C  E C  CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867

Query: 1390 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1449
             E N+  VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S  PI+VPR  I +L R 
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927

Query: 1450 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1503
             V FG+S T++F  L   QIQ  FW+GFVCVRGF+++ R PRPL+ RLH   SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981



 Score =  137 bits (344), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)

Query: 486  LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 545
            L Q + A+II + ++L + D   +               NA+V Y+       GALVP++
Sbjct: 904  LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946

Query: 546  IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 605
               K++K +P+V +D  T R+WNL+M   D    D+  ++  EKWW++ER VF GR  SF
Sbjct: 947  --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004

Query: 606  TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 665
             ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++     
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063

Query: 666  DENLRTTASLEPIG 679
            + N+R+    +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077



 Score = 43.9 bits (102), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 9/98 (9%)

Query: 6   PRVLVEGMPKKEP-KPKKAPQQPKTPKPVTPNRAHETTRKRKYVRKNSVSGNVVMDESVS 64
           P+V+VEG PK++P KP + P+     KP    R   T  K K     S     + + +  
Sbjct: 306 PKVVVEGKPKRKPRKPAELPKVVVEGKPKRKPRKAATQEKVKSKETGSAKKKNLKESATK 365

Query: 65  KRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLEN 102
           K    G  + +S        P   +K+CR+ALNFDLEN
Sbjct: 366 KPANVGDMSNKS--------PEVTLKSCRKALNFDLEN 395


>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
          Length = 1393

 Score =  623 bits (1606), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)

Query: 977  DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 1033
            DWD LRR     +   E++   MD+VDW+A+R AD  E+A+ IK RG  + +A RI+ FL
Sbjct: 858  DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917

Query: 1034 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 1093
            +RLV  HGSIDLEWLR+VPPDK KEYLL   GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918  DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977

Query: 1094 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 1153
            AVRLGWVPLQPLP  L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978  AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037

Query: 1154 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 1212
            C K  PNC ACP++GEC+HFASA ASAR ALP        ++E G G     PL ++ P 
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089

Query: 1213 PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 1269
            P  R +   V   SE   ++  CEPIIEEP SP P+ +E   ++I++  F   D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147

Query: 1270 LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 1326
            +RL    F  N    ME NK  LQD   SSALVALTAE AS+P  KLK  + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206

Query: 1327 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 1385
             LPD H LL   ++R+PDDP  YLLAIW+PGE+ +S+      C    +  LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266

Query: 1386 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1445
            CNS+ E  + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR  I  
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326

Query: 1446 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1503
            L R  V FG+S  T+F  L   +IQ  FW+G+VCVRGFDR+ R P+PL+ RLH   SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384



 Score =  114 bits (285), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)

Query: 526 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 580
           A+VPY +           GA+VP     K+ + +P+V LD  T R+W L++   +    D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552

Query: 581 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 640
             S+E   KWW++ER VF GR  SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611

Query: 641 NLSSSAYMSLAARFPL 656
           +LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627


>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
          Length = 1332

 Score =  581 bits (1498), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)

Query: 814  ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 857
            +TQ ++    P TG ST  + +P++                C +ESS  + + I   + +
Sbjct: 613  VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672

Query: 858  -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 916
              +L D  + +L  ++      K      G+   T   NDL      +SS+  +    ++
Sbjct: 673  KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728

Query: 917  VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 975
             +P   E       P++   T   ++S R +   K  K    +   KK+     S     
Sbjct: 729  EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788

Query: 976  QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 1032
             DWD LR+   +G    ER+   MD+VDW+A+RC D  +IA+ I +RG  N++A RIK F
Sbjct: 789  VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848

Query: 1033 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 1092
            LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL  IAFPVD NVGR
Sbjct: 849  LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908

Query: 1093 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 1152
            IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909  IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968

Query: 1153 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 1212
            FC K  PNC ACP++ EC+H++SA ASAR ALP P E    +        +  P+VVN  
Sbjct: 969  FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028

Query: 1213 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 1266
            P + +  +   E+  +  NCEPIIEEP SP P+  E    +   D  ++G  E+      
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087

Query: 1267 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 1325
             IPT+ L +   +    + + +NK     S  LV L+  AA++P RKLK    LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143

Query: 1326 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 1383
            + LPD H +L GF+RR+ +D  PYLLAIW+PGE+ NS+  P+ RC     ++ LCN   C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203

Query: 1384 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1443
            + CN   E  +  VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP   I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263

Query: 1444 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1500
             +L+R +   GSS +++   L +  I+  F  G+VCVRGFDR  R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320



 Score = 91.7 bits (226), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)

Query: 488 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 546
           +S T D I K F++L I   G  LP          N   A++ Y+     E  A+V +  
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491

Query: 547 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 606
               KK KP+V LDP T R+W L+M+  D    D  S+E+  KWW++ER +F GR  SF 
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546

Query: 607 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 657
           ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597


>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
          Length = 1044

 Score =  511 bits (1316), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)

Query: 977  DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 1036
            DW+ LRR+Y+    R    MDSV+W  VR + ++ +   IK+RGQ  I++ RI +FLN  
Sbjct: 501  DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560

Query: 1037 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 1096
            V  +G+IDLEWLRN P   VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561  VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620

Query: 1097 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 1156
            LG VPL+PLP  + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621  LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680

Query: 1157 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 1216
              PNC ACP++ ECK+FASA  S++  L  P EK    + F N   Q   + V+    I 
Sbjct: 681  TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738

Query: 1217 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 1273
            +  + VS        C +P++E P SP  +  ES  ++I+D  F         +P +   
Sbjct: 739  LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796

Query: 1274 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 1326
                K++  + + ++  M       S ALV  T E A +P    RK+K    LRTEH VY
Sbjct: 797  LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856

Query: 1327 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 1386
            VLPD HELL  F+RR  DDPSPYLLAIW PGE+ +S   P+ +C+S  SKLC  + C  C
Sbjct: 857  VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916

Query: 1387 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1446
             ++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI   R     L
Sbjct: 917  WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976

Query: 1447 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1504
             +  +  GS+ T++F  LD R+I+  FW GF+C+R FDR+ R P+ LV RLH  P ++
Sbjct: 977  EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034



 Score = 94.4 bits (233), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)

Query: 542 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 601
           V   IK  +K +  +V LDP T++ W+++M ++D  ++     E   KW +KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKETEAKW-KKEREIFQTR 391

Query: 602 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 656
           I  F  RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446


>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
            DSM 3109 / JCM 10099) GN=nth PE=3 SV=1
          Length = 213

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)

Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
            A   ++ D IKE G     A RI E    LVE +G           PD ++E LL + G+
Sbjct: 60   AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110

Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
            G K+   V  +  +  A  VD +V RI+ RLGWV  +  P +    L K  P        
Sbjct: 111  GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163

Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1173
            LW             ++  M+ FG+  CK ++P C  C L+  C+ +
Sbjct: 164  LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199


>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
            PE=3 SV=1
          Length = 212

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)

Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
            P+  KE L+ + G+G K+   V           VD +V R++ R+G        G+    
Sbjct: 105  PNNFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTAAI 158

Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
            + KE              L  +D+K L   H+ +I  G+  CK R P C  CP++  C++
Sbjct: 159  VEKEL-------------LQIIDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEY 205

Query: 1173 FASAVAS 1179
            + +  +S
Sbjct: 206  YINTFSS 212


>sp|P39788|END3_BACSU Probable endonuclease III OS=Bacillus subtilis (strain 168) GN=nth
            PE=3 SV=1
          Length = 219

 Score = 46.6 bits (109), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)

Query: 1011 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 1070
            E+   IK  G     A  I++    ++E +G         VP D+  + L+ + G+G K+
Sbjct: 72   ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 122

Query: 1071 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 1130
               V  ++    A  VD +V R++ RLG    +    ++   L+++ P  D      W  
Sbjct: 123  ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175

Query: 1131 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 1171
                        H+++I FG+  CK +SP C  CPL   C+
Sbjct: 176  ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206


>sp|Q58829|Y1434_METJA Putative endonuclease MJ1434 OS=Methanocaldococcus jannaschii (strain
            ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440)
            GN=MJ1434 PE=3 SV=1
          Length = 220

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 42/165 (25%), Positives = 74/165 (44%), Gaps = 20/165 (12%)

Query: 1008 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV--KEYLLDIEG 1065
            DE ++ + I+  G  N+ A R+K     +VE +G  + E +     D +  +  LL I G
Sbjct: 71   DEDKLKELIRPAGFYNLKAKRLKNVTKFIVENYG--NTEEMAKTDKDTLILRAELLSING 128

Query: 1066 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 1125
            +G ++ + + L +L   +F VD    R+  RLG              ++ E    D+I+ 
Sbjct: 129  VGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------VINEKAKYDEIKE 174

Query: 1126 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 1170
                 L   D +   E H  ++   K FC+K++  C  CP++  C
Sbjct: 175  IFEKNLPK-DLEIYKEYHALIVEHCKKFCRKKAL-CDNCPIKEFC 217


>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
            Wilmington) GN=nth PE=3 SV=1
          Length = 212

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 19/127 (14%)

Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
            P+  KE L+ + G+G K+   V           VD +V R++ R+G        G+  + 
Sbjct: 105  PNDFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTTVI 158

Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
            + KE              L  +D+K L   H+ ++  G+  CK R P+C  C ++  C++
Sbjct: 159  VEKEL-------------LQIIDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEY 205

Query: 1173 FASAVAS 1179
            + +  AS
Sbjct: 206  YINTFAS 212


>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
            URRWXCal2) GN=nth PE=3 SV=1
          Length = 213

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)

Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
            P+  KE L+ + G+G K+   V           VD +V R+A R+G       P  +   
Sbjct: 105  PNDFKE-LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGLARGNS-PEIVEKE 162

Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
            LL+                  +++K L   H+ +I  G+  CK R P+C  CP++  C++
Sbjct: 163  LLQ-----------------IINEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEY 205

Query: 1173 FASAVAS 1179
            + S + S
Sbjct: 206  YNSPIIS 212


>sp|Q92GH4|END3_RICCN Endonuclease III OS=Rickettsia conorii (strain ATCC VR-613 / Malish
            7) GN=nth PE=3 SV=1
          Length = 210

 Score = 42.7 bits (99), Expect = 0.022,   Method: Composition-based stats.
 Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 18/114 (15%)

Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
            L+ + G+G K+   V           VD +V R+A R+G        GD         P 
Sbjct: 111  LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVANRIGLAK-----GDT--------PE 157

Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1173
            + + ++     L  +D K L   H+ +I  G+  CK R P+C  CP++  C ++
Sbjct: 158  IVENEL-----LQIIDTKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCDYY 206


>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2 SV=2
          Length = 343

 Score = 41.6 bits (96), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 34/48 (70%), Gaps = 1/48 (2%)

Query: 1048 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIA 1094
            L+++P +K +E L  + G+G K  +C+ L+S+ H+ + PVD+++ RIA
Sbjct: 229  LKSMPFEKAREELTLLPGIGYKVADCICLMSMGHLESVPVDIHIYRIA 276


>sp|A9M2W6|PYRC_NEIM0 Dihydroorotase OS=Neisseria meningitidis serogroup C (strain
           053442) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 40.0 bits (92), Expect = 0.14,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT N +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDNATSELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|Q9K0D1|PYRC_NEIMB Dihydroorotase OS=Neisseria meningitidis serogroup B (strain MC58)
           GN=pyrC PE=3 SV=1
          Length = 344

 Score = 39.3 bits (90), Expect = 0.22,   Method: Composition-based stats.
 Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT N +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDNATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
          Length = 345

 Score = 39.3 bits (90), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 1036 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1094
            L E  G   L+ LR  P ++  + L  + G+G K  +C+ L++L    A PVDV+V +IA
Sbjct: 216  LEEQGGPAWLQQLRVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIA 275

Query: 1095 VR-LGWVP 1101
             R  GW P
Sbjct: 276  HRDYGWHP 283


>sp|P29588|GTMR_METTF G/T mismatches repair enzyme OS=Methanobacterium thermoformicicum
            PE=1 SV=1
          Length = 221

 Score = 38.9 bits (89), Expect = 0.33,   Method: Composition-based stats.
 Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 27/197 (13%)

Query: 980  LLRRIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 1037
            LLRR  +   ++ +DK  +    +E +    +SEIA  IKE G  N  A ++KE    ++
Sbjct: 44   LLRRTTAGHVKKIYDKFFVKYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVI 103

Query: 1038 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 1097
              +G         VP  + ++ +LD+ G+G  +   V  L+    A  VD N  R+  R 
Sbjct: 104  NDYGG-------RVP--RNRKAILDLPGVGKYTCAAVMCLAFGKKAAMVDANFVRVINR- 153

Query: 1098 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL-CYLDQKTLYELHYQMITFGKIFCKK 1156
                          +    Y  ++     LW      +      + +  ++ F  I C  
Sbjct: 154  --------------YFGGSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAP 199

Query: 1157 RSPNCGACPLRGECKHF 1173
            R P C  C +   C ++
Sbjct: 200  RKPKCEKCGMSKLCSYY 216


>sp|Q08214|NTG2_YEAST DNA base excision repair N-glycosylase 2 OS=Saccharomyces cerevisiae
            (strain ATCC 204508 / S288c) GN=NTG2 PE=1 SV=1
          Length = 380

 Score = 37.4 bits (85), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 30/119 (25%)

Query: 1090 VGRIAVRLGWVPLQPLPGDLHIH-LLKEYPVMDKI----------QMYLWPRLCYLDQKT 1138
            +G + ++ GW  +  +  D+H+H L K +  +D I          ++ +W     L    
Sbjct: 249  MGYLTLQKGWGLIAGICVDVHVHRLCKMWNWVDPIKCKTAEHTRKELQVW-----LPHSL 303

Query: 1139 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF 1197
             YE++  ++ FG++ C  R   C  C         A+ V +AR      +EK I +S+F
Sbjct: 304  WYEINTVLVGFGQLICMARGKRCDLC--------LANDVCNAR------NEKLIESSKF 348


>sp|Q5F9Y1|PYRC_NEIG1 Dihydroorotase OS=Neisseria gonorrhoeae (strain ATCC 700825 / FA
           1090) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 37.0 bits (84), Expect = 1.1,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|B4RJT2|PYRC_NEIG2 Dihydroorotase OS=Neisseria gonorrhoeae (strain NCCP11945) GN=pyrC
           PE=3 SV=1
          Length = 344

 Score = 37.0 bits (84), Expect = 1.1,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|Q9JVD6|PYRC_NEIMA Dihydroorotase OS=Neisseria meningitidis serogroup A / serotype 4A
           (strain Z2491) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 36.6 bits (83), Expect = 1.6,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPALVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
               G ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176


>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
            204508 / S288c) GN=OGG1 PE=1 SV=1
          Length = 376

 Score = 36.6 bits (83), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 1030 KEFLNRLVELHGSIDLEWLRNVPPDK----VKEYLLDIEGLGLKSVECVRLLSLQHIAF- 1084
            ++ +N   E + + D  +L+++  D     V+E+L+   G+G K  +CV L+ L      
Sbjct: 198  RKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIV 257

Query: 1085 PVDVNVGRIAVR 1096
            PVDV+V RIA R
Sbjct: 258  PVDVHVSRIAKR 269


>sp|P46303|UVEN_MICLC Ultraviolet N-glycosylase/AP lyase OS=Micrococcus luteus (strain ATCC
            4698 / DSM 20030 / JCM 1464 / NBRC 3333 / NCIMB 9278 /
            NCTC 2665 / VKM Ac-2230) GN=pdg PE=1 SV=2
          Length = 279

 Score = 36.6 bits (83), Expect = 1.8,   Method: Composition-based stats.
 Identities = 43/177 (24%), Positives = 65/177 (36%), Gaps = 26/177 (14%)

Query: 1000 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 1059
            D  A+  A E E+ + ++  G     A  I      LV  H          VP     E 
Sbjct: 75   DAHAMAAATEPELQELVRSTGFYRNKASAILRLSQELVGRHDG-------EVPARL--ED 125

Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
            L+ + G+G K+   V   +       VD + GR+A RLG+   +  PG       +  P 
Sbjct: 126  LVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLARRLGFTD-ETDPGKGRARRGRPVPP 184

Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 1176
                 M                L +++I  G+  C  R P CG CP+   C  +A+ 
Sbjct: 185  ARDWTM----------------LSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAG 225


>sp|Q9CB92|END3_MYCLE Endonuclease III OS=Mycobacterium leprae (strain TN) GN=nth PE=3 SV=2
          Length = 245

 Score = 35.8 bits (81), Expect = 2.5,   Method: Composition-based stats.
 Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 18/126 (14%)

Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
            L  + G+G K+   +   +       VD + GR+  R  W               +E PV
Sbjct: 122  LFTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTA-------------EEDPV 168

Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 1179
              K++  +   L   DQ TL  L +++I  G+  C  R P CG C L  +C  F      
Sbjct: 169  --KVEHAV-GELIERDQWTL--LSHRVIFHGRRVCHARKPACGVCVLAKDCPSFGLGPTE 223

Query: 1180 ARFALP 1185
               A P
Sbjct: 224  PLLAAP 229


>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
            SV=1
          Length = 365

 Score = 35.8 bits (81), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 1046 EWL---RNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 1094
            EWL   R V   +    L  + G+G K   C+ L SL QH A PVD +V +IA
Sbjct: 235  EWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIA 287


>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
          Length = 345

 Score = 35.8 bits (81), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 1036 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1094
            L E  G   L+ LR    ++  + L  + G+G K  +C+ L++L    A PVD++V +IA
Sbjct: 216  LEEQGGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIA 275

Query: 1095 VR-LGWVP 1101
             R  GW P
Sbjct: 276  HRDYGWQP 283


>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
            thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
            10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
          Length = 312

 Score = 35.8 bits (81), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 1039 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 1093
            L   +D+  +  +  D  +E LL++ G+G K  +C+ L   +   AFPVDV + RI
Sbjct: 205  LAEEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260


>sp|A6QQS3|SPERT_BOVIN Spermatid-associated protein OS=Bos taurus GN=SPERT PE=2 SV=2
          Length = 455

 Score = 35.4 bits (80), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 63/134 (47%), Gaps = 15/134 (11%)

Query: 333 LQEPGLGLQEGFQEPRIGESP-------HFQEQKIDSLAPFPGQSSELSIGAVEPFLDQS 385
           L +PG GL   F+EP+   +P           + + +L+  PG+     +G   P L   
Sbjct: 294 LPDPGPGLPSPFEEPKGLPAPPDDSKTLRALREMVSTLSAQPGE----EVGKGGPGLPDG 349

Query: 386 NSSLEFFPRQAQAINS-QAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKG 444
           + SLE      QA+ + + E    QV+   ++L ++ N     L+E++R+ Q ++   K 
Sbjct: 350 SQSLELLREMNQALQALREENQSLQVLRDENRLLQEENRALHALREEHRLFQEEN---KA 406

Query: 445 LWFTAKSRGSQKII 458
           LW   K +  QK++
Sbjct: 407 LWENNKLKLQQKLV 420


>sp|P34648|YOT2_CAEEL Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans
           GN=ZK632.2 PE=4 SV=1
          Length = 710

 Score = 34.7 bits (78), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 42/199 (21%), Positives = 81/199 (40%), Gaps = 29/199 (14%)

Query: 414 LDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPR 473
           LD   +Q+        +    +++  W  K +  T +S+  +K++++ +         P 
Sbjct: 454 LDDYIRQLEKSGGAGDDAKTKMEKSKWRQKLMAATHESQKLEKLVKIAK---------PA 504

Query: 474 YSRGLQVDMEQQLMQSTTAD---IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPY 530
             +GL+     QL  +   D    ++K   +  R    Q P Q     GP    +A +P 
Sbjct: 505 VVKGLE-----QLETTAANDRQAFLKKLMGVRARKEIDQTPSQ-----GPGPSTSATLPA 554

Query: 531 QVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDD-GTTKDQTSNEDMEK 589
            V P+S     V H+ K    K++ E+     +  + N +  +D+  + KD+ S E  +K
Sbjct: 555 TVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIKNSLPAVDEPSSVKDEVSEETPQK 614

Query: 590 WWQKEREVFEGRIQSFTAR 608
                 E F  ++Q   A+
Sbjct: 615 ------EAFGSKVQKRVAQ 627


>sp|A0R6D9|MAK_MYCS2 Maltokinase OS=Mycobacterium smegmatis (strain ATCC 700084 /
           mc(2)155) GN=mak PE=3 SV=1
          Length = 441

 Score = 34.7 bits (78), Expect = 6.6,   Method: Composition-based stats.
 Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 10/95 (10%)

Query: 484 QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 543
           ++L+ +TTA  +R      +RDG   + LQ N ADGP   +  +V    GP  E+  +  
Sbjct: 20  RELVSATTAMAVR------LRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVAT 73

Query: 544 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 578
             I + +     + + DP   R   L+  ID+  T
Sbjct: 74  IGIADGQTAY--DALYDPDATRY--LLSLIDESAT 104


>sp|P63540|END3_MYCTU Endonuclease III OS=Mycobacterium tuberculosis GN=nth PE=3 SV=1
          Length = 245

 Score = 34.7 bits (78), Expect = 6.7,   Method: Composition-based stats.
 Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)

Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
            AD +E+   I+  G     A  +      LVE  G         VP    K  L+ + G+
Sbjct: 78   ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128

Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
            G K+   +   +       VD + GR+  R  W                E PV  K++  
Sbjct: 129  GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173

Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1185
            +      +++K    L +++I  G+  C  R P CG C L  +C  F         A P
Sbjct: 174  VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229


>sp|P63541|END3_MYCBO Endonuclease III OS=Mycobacterium bovis (strain ATCC BAA-935 /
            AF2122/97) GN=nth PE=3 SV=1
          Length = 245

 Score = 34.7 bits (78), Expect = 6.7,   Method: Composition-based stats.
 Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)

Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
            AD +E+   I+  G     A  +      LVE  G         VP    K  L+ + G+
Sbjct: 78   ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128

Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
            G K+   +   +       VD + GR+  R  W                E PV  K++  
Sbjct: 129  GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173

Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1185
            +      +++K    L +++I  G+  C  R P CG C L  +C  F         A P
Sbjct: 174  VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229


>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
           OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
           GN=NTG1 PE=1 SV=1
          Length = 399

 Score = 34.3 bits (77), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 5/102 (4%)

Query: 413 RLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLW--FTAKSRGSQKIIQVERN-GLPTQK 469
           RL +L K ++    +  +Q R  Q ++W PKGLW        G  +II   RN G   Q 
Sbjct: 267 RLTKLWKWVDAQKCKTPDQTRT-QLQNWLPKGLWTEINGLLVGFGQIITKSRNLGDMLQF 325

Query: 470 MPPRYSR-GLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQL 510
           +PP   R  L  D++ QL +    +I+   K +   +G  +L
Sbjct: 326 LPPDDPRSSLDWDLQSQLYKEIQQNIMSYPKWVKYLEGKREL 367


>sp|A1KSU5|PYRC_NEIMF Dihydroorotase OS=Neisseria meningitidis serogroup C / serotype 2a
           (strain ATCC 700532 / FAM18) GN=pyrC PE=3 SV=1
          Length = 344

 Score = 34.3 bits (77), Expect = 9.2,   Method: Composition-based stats.
 Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 7/119 (5%)

Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
           +GS  + ++ +YLT   +  L   A  +    F L      T SD  +     L P+   
Sbjct: 65  EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124

Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
                 ++   G    VT+PE D    + DR+ AF  R+ +  L +  ++KV+ + + +
Sbjct: 125 MAKQDILFLVHGE---VTDPEID----IFDREAAFIERVMKPVLAQVPNLKVVFEHITT 176


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.315    0.132    0.391 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 575,831,686
Number of Sequences: 539616
Number of extensions: 25397629
Number of successful extensions: 75141
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 117
Number of HSP's that attempted gapping in prelim test: 74690
Number of HSP's gapped (non-prelim): 435
length of query: 1515
length of database: 191,569,459
effective HSP length: 131
effective length of query: 1384
effective length of database: 120,879,763
effective search space: 167297591992
effective search space used: 167297591992
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 68 (30.8 bits)