BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000431
(1515 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 629 bits (1622), Expect = e-179, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 978 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 1034
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509
Query: 1035 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 1094
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569
Query: 1095 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 1154
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629
Query: 1155 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 1214
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689
Query: 1215 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 1273
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748
Query: 1274 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 1330
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807
Query: 1331 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 1389
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867
Query: 1390 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1449
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927
Query: 1450 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1503
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981
Score = 137 bits (344), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 486 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 545
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 904 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946
Query: 546 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 605
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 947 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004
Query: 606 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 665
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063
Query: 666 DENLRTTASLEPIG 679
+ N+R+ +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077
Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 9/98 (9%)
Query: 6 PRVLVEGMPKKEP-KPKKAPQQPKTPKPVTPNRAHETTRKRKYVRKNSVSGNVVMDESVS 64
P+V+VEG PK++P KP + P+ KP R T K K S + + +
Sbjct: 306 PKVVVEGKPKRKPRKPAELPKVVVEGKPKRKPRKAATQEKVKSKETGSAKKKNLKESATK 365
Query: 65 KRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLEN 102
K G + +S P +K+CR+ALNFDLEN
Sbjct: 366 KPANVGDMSNKS--------PEVTLKSCRKALNFDLEN 395
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 623 bits (1606), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)
Query: 977 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 1033
DWD LRR + E++ MD+VDW+A+R AD E+A+ IK RG + +A RI+ FL
Sbjct: 858 DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917
Query: 1034 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 1093
+RLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918 DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977
Query: 1094 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 1153
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978 AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037
Query: 1154 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 1212
C K PNC ACP++GEC+HFASA ASAR ALP ++E G G PL ++ P
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089
Query: 1213 PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 1269
P R + V SE ++ CEPIIEEP SP P+ +E ++I++ F D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147
Query: 1270 LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 1326
+RL F N ME NK LQD SSALVALTAE AS+P KLK + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206
Query: 1327 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 1385
LPD H LL ++R+PDDP YLLAIW+PGE+ +S+ C + LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266
Query: 1386 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1445
CNS+ E + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR I
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326
Query: 1446 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1503
L R V FG+S T+F L +IQ FW+G+VCVRGFDR+ R P+PL+ RLH SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384
Score = 114 bits (285), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)
Query: 526 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 580
A+VPY + GA+VP K+ + +P+V LD T R+W L++ + D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552
Query: 581 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 640
S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611
Query: 641 NLSSSAYMSLAARFPL 656
+LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 581 bits (1498), Expect = e-164, Method: Compositional matrix adjust.
Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)
Query: 814 ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 857
+TQ ++ P TG ST + +P++ C +ESS + + I + +
Sbjct: 613 VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672
Query: 858 -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 916
+L D + +L ++ K G+ T NDL +SS+ + ++
Sbjct: 673 KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728
Query: 917 VLPSNRESFRTGM-PQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 975
+P E P++ T ++S R + K K + KK+ S
Sbjct: 729 EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788
Query: 976 QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 1032
DWD LR+ +G ER+ MD+VDW+A+RC D +IA+ I +RG N++A RIK F
Sbjct: 789 VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848
Query: 1033 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 1092
LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL IAFPVD NVGR
Sbjct: 849 LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908
Query: 1093 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 1152
IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909 IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968
Query: 1153 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 1212
FC K PNC ACP++ EC+H++SA ASAR ALP P E + + P+VVN
Sbjct: 969 FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028
Query: 1213 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 1266
P + + + E+ + NCEPIIEEP SP P+ E + D ++G E+
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087
Query: 1267 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 1325
IPT+ L + + + + +NK S LV L+ AA++P RKLK LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143
Query: 1326 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 1383
+ LPD H +L GF+RR+ +D PYLLAIW+PGE+ NS+ P+ RC ++ LCN C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203
Query: 1384 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1443
+ CN E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263
Query: 1444 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1500
+L+R + GSS +++ L + I+ F G+VCVRGFDR R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320
Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)
Query: 488 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 546
+S T D I K F++L I G LP N A++ Y+ E A+V +
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491
Query: 547 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 606
KK KP+V LDP T R+W L+M+ D D S+E+ KWW++ER +F GR SF
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546
Query: 607 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 657
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 511 bits (1316), Expect = e-143, Method: Compositional matrix adjust.
Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)
Query: 977 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 1036
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 1037 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 1096
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 1097 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 1156
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 1157 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 1216
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 1217 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 1273
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 1274 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 1326
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 1327 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 1386
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 1387 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1446
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI R L
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976
Query: 1447 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1504
+ + GS+ T++F LD R+I+ FW GF+C+R FDR+ R P+ LV RLH P ++
Sbjct: 977 EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034
Score = 94.4 bits (233), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 542 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 601
V IK +K + +V LDP T++ W+++M ++D ++ E KW +KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKETEAKW-KKEREIFQTR 391
Query: 602 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 656
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
DSM 3109 / JCM 10099) GN=nth PE=3 SV=1
Length = 213
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
A ++ D IKE G A RI E LVE +G PD ++E LL + G+
Sbjct: 60 AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110
Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
G K+ V + + A VD +V RI+ RLGWV + P + L K P
Sbjct: 111 GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163
Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1173
LW ++ M+ FG+ CK ++P C C L+ C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199
>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
PE=3 SV=1
Length = 212
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
P+ KE L+ + G+G K+ V VD +V R++ R+G G+
Sbjct: 105 PNNFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTAAI 158
Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
+ KE L +D+K L H+ +I G+ CK R P C CP++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEY 205
Query: 1173 FASAVAS 1179
+ + +S
Sbjct: 206 YINTFSS 212
>sp|P39788|END3_BACSU Probable endonuclease III OS=Bacillus subtilis (strain 168) GN=nth
PE=3 SV=1
Length = 219
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 1011 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 1070
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 122
Query: 1071 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 1130
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175
Query: 1131 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 1171
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>sp|Q58829|Y1434_METJA Putative endonuclease MJ1434 OS=Methanocaldococcus jannaschii (strain
ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440)
GN=MJ1434 PE=3 SV=1
Length = 220
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 42/165 (25%), Positives = 74/165 (44%), Gaps = 20/165 (12%)
Query: 1008 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV--KEYLLDIEG 1065
DE ++ + I+ G N+ A R+K +VE +G + E + D + + LL I G
Sbjct: 71 DEDKLKELIRPAGFYNLKAKRLKNVTKFIVENYG--NTEEMAKTDKDTLILRAELLSING 128
Query: 1066 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 1125
+G ++ + + L +L +F VD R+ RLG ++ E D+I+
Sbjct: 129 VGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------VINEKAKYDEIKE 174
Query: 1126 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 1170
L D + E H ++ K FC+K++ C CP++ C
Sbjct: 175 IFEKNLPK-DLEIYKEYHALIVEHCKKFCRKKAL-CDNCPIKEFC 217
>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
Wilmington) GN=nth PE=3 SV=1
Length = 212
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
P+ KE L+ + G+G K+ V VD +V R++ R+G G+ +
Sbjct: 105 PNDFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTTVI 158
Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
+ KE L +D+K L H+ ++ G+ CK R P+C C ++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEY 205
Query: 1173 FASAVAS 1179
+ + AS
Sbjct: 206 YINTFAS 212
>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
URRWXCal2) GN=nth PE=3 SV=1
Length = 213
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 1053 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1112
P+ KE L+ + G+G K+ V VD +V R+A R+G P +
Sbjct: 105 PNDFKE-LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGLARGNS-PEIVEKE 162
Query: 1113 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1172
LL+ +++K L H+ +I G+ CK R P+C CP++ C++
Sbjct: 163 LLQ-----------------IINEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEY 205
Query: 1173 FASAVAS 1179
+ S + S
Sbjct: 206 YNSPIIS 212
>sp|Q92GH4|END3_RICCN Endonuclease III OS=Rickettsia conorii (strain ATCC VR-613 / Malish
7) GN=nth PE=3 SV=1
Length = 210
Score = 42.7 bits (99), Expect = 0.022, Method: Composition-based stats.
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 18/114 (15%)
Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
L+ + G+G K+ V VD +V R+A R+G GD P
Sbjct: 111 LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVANRIGLAK-----GDT--------PE 157
Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1173
+ + ++ L +D K L H+ +I G+ CK R P+C CP++ C ++
Sbjct: 158 IVENEL-----LQIIDTKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCDYY 206
>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2 SV=2
Length = 343
Score = 41.6 bits (96), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
Query: 1048 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIA 1094
L+++P +K +E L + G+G K +C+ L+S+ H+ + PVD+++ RIA
Sbjct: 229 LKSMPFEKAREELTLLPGIGYKVADCICLMSMGHLESVPVDIHIYRIA 276
>sp|A9M2W6|PYRC_NEIM0 Dihydroorotase OS=Neisseria meningitidis serogroup C (strain
053442) GN=pyrC PE=3 SV=1
Length = 344
Score = 40.0 bits (92), Expect = 0.14, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATSELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9K0D1|PYRC_NEIMB Dihydroorotase OS=Neisseria meningitidis serogroup B (strain MC58)
GN=pyrC PE=3 SV=1
Length = 344
Score = 39.3 bits (90), Expect = 0.22, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
Length = 345
Score = 39.3 bits (90), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 1036 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1094
L E G L+ LR P ++ + L + G+G K +C+ L++L A PVDV+V +IA
Sbjct: 216 LEEQGGPAWLQQLRVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIA 275
Query: 1095 VR-LGWVP 1101
R GW P
Sbjct: 276 HRDYGWHP 283
>sp|P29588|GTMR_METTF G/T mismatches repair enzyme OS=Methanobacterium thermoformicicum
PE=1 SV=1
Length = 221
Score = 38.9 bits (89), Expect = 0.33, Method: Composition-based stats.
Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 27/197 (13%)
Query: 980 LLRRIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 1037
LLRR + ++ +DK + +E + +SEIA IKE G N A ++KE ++
Sbjct: 44 LLRRTTAGHVKKIYDKFFVKYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVI 103
Query: 1038 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 1097
+G VP + ++ +LD+ G+G + V L+ A VD N R+ R
Sbjct: 104 NDYGG-------RVP--RNRKAILDLPGVGKYTCAAVMCLAFGKKAAMVDANFVRVINR- 153
Query: 1098 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL-CYLDQKTLYELHYQMITFGKIFCKK 1156
+ Y ++ LW + + + ++ F I C
Sbjct: 154 --------------YFGGSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAP 199
Query: 1157 RSPNCGACPLRGECKHF 1173
R P C C + C ++
Sbjct: 200 RKPKCEKCGMSKLCSYY 216
>sp|Q08214|NTG2_YEAST DNA base excision repair N-glycosylase 2 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=NTG2 PE=1 SV=1
Length = 380
Score = 37.4 bits (85), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 30/119 (25%)
Query: 1090 VGRIAVRLGWVPLQPLPGDLHIH-LLKEYPVMDKI----------QMYLWPRLCYLDQKT 1138
+G + ++ GW + + D+H+H L K + +D I ++ +W L
Sbjct: 249 MGYLTLQKGWGLIAGICVDVHVHRLCKMWNWVDPIKCKTAEHTRKELQVW-----LPHSL 303
Query: 1139 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF 1197
YE++ ++ FG++ C R C C A+ V +AR +EK I +S+F
Sbjct: 304 WYEINTVLVGFGQLICMARGKRCDLC--------LANDVCNAR------NEKLIESSKF 348
>sp|Q5F9Y1|PYRC_NEIG1 Dihydroorotase OS=Neisseria gonorrhoeae (strain ATCC 700825 / FA
1090) GN=pyrC PE=3 SV=1
Length = 344
Score = 37.0 bits (84), Expect = 1.1, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|B4RJT2|PYRC_NEIG2 Dihydroorotase OS=Neisseria gonorrhoeae (strain NCCP11945) GN=pyrC
PE=3 SV=1
Length = 344
Score = 37.0 bits (84), Expect = 1.1, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9JVD6|PYRC_NEIMA Dihydroorotase OS=Neisseria meningitidis serogroup A / serotype 4A
(strain Z2491) GN=pyrC PE=3 SV=1
Length = 344
Score = 36.6 bits (83), Expect = 1.6, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPALVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=OGG1 PE=1 SV=1
Length = 376
Score = 36.6 bits (83), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 1030 KEFLNRLVELHGSIDLEWLRNVPPDK----VKEYLLDIEGLGLKSVECVRLLSLQHIAF- 1084
++ +N E + + D +L+++ D V+E+L+ G+G K +CV L+ L
Sbjct: 198 RKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIV 257
Query: 1085 PVDVNVGRIAVR 1096
PVDV+V RIA R
Sbjct: 258 PVDVHVSRIAKR 269
>sp|P46303|UVEN_MICLC Ultraviolet N-glycosylase/AP lyase OS=Micrococcus luteus (strain ATCC
4698 / DSM 20030 / JCM 1464 / NBRC 3333 / NCIMB 9278 /
NCTC 2665 / VKM Ac-2230) GN=pdg PE=1 SV=2
Length = 279
Score = 36.6 bits (83), Expect = 1.8, Method: Composition-based stats.
Identities = 43/177 (24%), Positives = 65/177 (36%), Gaps = 26/177 (14%)
Query: 1000 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 1059
D A+ A E E+ + ++ G A I LV H VP E
Sbjct: 75 DAHAMAAATEPELQELVRSTGFYRNKASAILRLSQELVGRHDG-------EVPARL--ED 125
Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
L+ + G+G K+ V + VD + GR+A RLG+ + PG + P
Sbjct: 126 LVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLARRLGFTD-ETDPGKGRARRGRPVPP 184
Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 1176
M L +++I G+ C R P CG CP+ C +A+
Sbjct: 185 ARDWTM----------------LSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAG 225
>sp|Q9CB92|END3_MYCLE Endonuclease III OS=Mycobacterium leprae (strain TN) GN=nth PE=3 SV=2
Length = 245
Score = 35.8 bits (81), Expect = 2.5, Method: Composition-based stats.
Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 18/126 (14%)
Query: 1060 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1119
L + G+G K+ + + VD + GR+ R W +E PV
Sbjct: 122 LFTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTA-------------EEDPV 168
Query: 1120 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 1179
K++ + L DQ TL L +++I G+ C R P CG C L +C F
Sbjct: 169 --KVEHAV-GELIERDQWTL--LSHRVIFHGRRVCHARKPACGVCVLAKDCPSFGLGPTE 223
Query: 1180 ARFALP 1185
A P
Sbjct: 224 PLLAAP 229
>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
SV=1
Length = 365
Score = 35.8 bits (81), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 1046 EWL---RNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 1094
EWL R V + L + G+G K C+ L SL QH A PVD +V +IA
Sbjct: 235 EWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIA 287
>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
Length = 345
Score = 35.8 bits (81), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 1036 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1094
L E G L+ LR ++ + L + G+G K +C+ L++L A PVD++V +IA
Sbjct: 216 LEEQGGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIA 275
Query: 1095 VR-LGWVP 1101
R GW P
Sbjct: 276 HRDYGWQP 283
>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
Length = 312
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 1039 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 1093
L +D+ + + D +E LL++ G+G K +C+ L + AFPVDV + RI
Sbjct: 205 LAEEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>sp|A6QQS3|SPERT_BOVIN Spermatid-associated protein OS=Bos taurus GN=SPERT PE=2 SV=2
Length = 455
Score = 35.4 bits (80), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 63/134 (47%), Gaps = 15/134 (11%)
Query: 333 LQEPGLGLQEGFQEPRIGESP-------HFQEQKIDSLAPFPGQSSELSIGAVEPFLDQS 385
L +PG GL F+EP+ +P + + +L+ PG+ +G P L
Sbjct: 294 LPDPGPGLPSPFEEPKGLPAPPDDSKTLRALREMVSTLSAQPGE----EVGKGGPGLPDG 349
Query: 386 NSSLEFFPRQAQAINS-QAEFLKEQVMYRLDQLRKQINGPSSQLQEQNRVIQRKSWNPKG 444
+ SLE QA+ + + E QV+ ++L ++ N L+E++R+ Q ++ K
Sbjct: 350 SQSLELLREMNQALQALREENQSLQVLRDENRLLQEENRALHALREEHRLFQEEN---KA 406
Query: 445 LWFTAKSRGSQKII 458
LW K + QK++
Sbjct: 407 LWENNKLKLQQKLV 420
>sp|P34648|YOT2_CAEEL Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans
GN=ZK632.2 PE=4 SV=1
Length = 710
Score = 34.7 bits (78), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 42/199 (21%), Positives = 81/199 (40%), Gaps = 29/199 (14%)
Query: 414 LDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPR 473
LD +Q+ + +++ W K + T +S+ +K++++ + P
Sbjct: 454 LDDYIRQLEKSGGAGDDAKTKMEKSKWRQKLMAATHESQKLEKLVKIAK---------PA 504
Query: 474 YSRGLQVDMEQQLMQSTTAD---IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPY 530
+GL+ QL + D ++K + R Q P Q GP +A +P
Sbjct: 505 VVKGLE-----QLETTAANDRQAFLKKLMGVRARKEIDQTPSQ-----GPGPSTSATLPA 554
Query: 531 QVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDD-GTTKDQTSNEDMEK 589
V P+S V H+ K K++ E+ + + N + +D+ + KD+ S E +K
Sbjct: 555 TVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIKNSLPAVDEPSSVKDEVSEETPQK 614
Query: 590 WWQKEREVFEGRIQSFTAR 608
E F ++Q A+
Sbjct: 615 ------EAFGSKVQKRVAQ 627
>sp|A0R6D9|MAK_MYCS2 Maltokinase OS=Mycobacterium smegmatis (strain ATCC 700084 /
mc(2)155) GN=mak PE=3 SV=1
Length = 441
Score = 34.7 bits (78), Expect = 6.6, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 484 QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 543
++L+ +TTA +R +RDG + LQ N ADGP + +V GP E+ +
Sbjct: 20 RELVSATTAMAVR------LRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVAT 73
Query: 544 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 578
I + + + + DP R L+ ID+ T
Sbjct: 74 IGIADGQTAY--DALYDPDATRY--LLSLIDESAT 104
>sp|P63540|END3_MYCTU Endonuclease III OS=Mycobacterium tuberculosis GN=nth PE=3 SV=1
Length = 245
Score = 34.7 bits (78), Expect = 6.7, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1185
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P63541|END3_MYCBO Endonuclease III OS=Mycobacterium bovis (strain ATCC BAA-935 /
AF2122/97) GN=nth PE=3 SV=1
Length = 245
Score = 34.7 bits (78), Expect = 6.7, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 1007 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1066
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 1067 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1126
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 1127 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1185
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=NTG1 PE=1 SV=1
Length = 399
Score = 34.3 bits (77), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 5/102 (4%)
Query: 413 RLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLW--FTAKSRGSQKIIQVERN-GLPTQK 469
RL +L K ++ + +Q R Q ++W PKGLW G +II RN G Q
Sbjct: 267 RLTKLWKWVDAQKCKTPDQTRT-QLQNWLPKGLWTEINGLLVGFGQIITKSRNLGDMLQF 325
Query: 470 MPPRYSR-GLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQL 510
+PP R L D++ QL + +I+ K + +G +L
Sbjct: 326 LPPDDPRSSLDWDLQSQLYKEIQQNIMSYPKWVKYLEGKREL 367
>sp|A1KSU5|PYRC_NEIMF Dihydroorotase OS=Neisseria meningitidis serogroup C / serotype 2a
(strain ATCC 700532 / FAM18) GN=pyrC PE=3 SV=1
Length = 344
Score = 34.3 bits (77), Expect = 9.2, Method: Composition-based stats.
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 7/119 (5%)
Query: 622 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 681
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 682 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 740
++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQDILFLVHGE---VTDPEID----IFDREAAFIERVMKPVLAQVPNLKVVFEHITT 176
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.132 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 575,831,686
Number of Sequences: 539616
Number of extensions: 25397629
Number of successful extensions: 75141
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 117
Number of HSP's that attempted gapping in prelim test: 74690
Number of HSP's gapped (non-prelim): 435
length of query: 1515
length of database: 191,569,459
effective HSP length: 131
effective length of query: 1384
effective length of database: 120,879,763
effective search space: 167297591992
effective search space used: 167297591992
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 68 (30.8 bits)