BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000414
(1541 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 629 bits (1621), Expect = e-179, Method: Compositional matrix adjust.
Identities = 307/534 (57%), Positives = 382/534 (71%), Gaps = 10/534 (1%)
Query: 1004 WDLLRRIYSTGE---ERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLN 1060
WD LR+ E ER+ + MDS+D+EA+R A SEI++AIKERG N++A RIK+FL
Sbjct: 1450 WDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLE 1509
Query: 1061 RLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIA 1120
R+V+ HG IDLEWLR PPDK K+YLL I GLGLKSVECVRLL+L ++AFPVD NVGRIA
Sbjct: 1510 RIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIA 1569
Query: 1121 VRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFC 1180
VR+GWVPLQPLP L +HLL+ YPV++ IQ +LWPRLC LDQ+TLYELHYQ+ITFGK+FC
Sbjct: 1570 VRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFC 1629
Query: 1181 KKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPV 1240
K PNC ACP+RGEC+HFASA ASAR ALP P E+ + ++ PP+ + I +
Sbjct: 1630 TKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIEL 1689
Query: 1241 -IRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPTLRLQ 1299
+ +E S + NCEPIIEEP SPG +C+E ES+I+D ++ D +EIPT++L
Sbjct: 1690 PLPLEKSLASGAPSNRENCEPIIEEPASPGQECTEITESDIEDAYYN-EDPDEIPTIKLN 1748
Query: 1300 EREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVYVLPD 1356
+F ME N + LQ+ S ALVAL S+PT KLK + LRTEH VY LPD
Sbjct: 1749 IEQFGMTLREHMERN-MELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPD 1807
Query: 1357 AHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDS-KLCNSEICYSCNSV 1415
+H LL G D+R+PDDPSPYLLAIW+PGE+ NS P+ +C K S K+C E C CNS+
Sbjct: 1808 SHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSL 1867
Query: 1416 CERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANLRRE 1475
E N+ VRGT+LIPCRTA RG FPLNGTYFQVNE+FADHE+S PI+VPR I +L R
Sbjct: 1868 REANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRR 1927
Query: 1476 IVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1529
V FG+S T++F L QIQ FW+GFVCVRGF+++ R PRPL+ RLH SK
Sbjct: 1928 TVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASK 1981
Score = 137 bits (345), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 122/194 (62%), Gaps = 20/194 (10%)
Query: 512 LMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQ 571
L Q + A+II + ++L + D + NA+V Y+ GALVP++
Sbjct: 904 LCQDSIAEIIYRMQNLYLGDKERE------------QEQNAMVLYK-----GDGALVPYE 946
Query: 572 IKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSF 631
K++K +P+V +D T R+WNL+M D D+ ++ EKWW++ER VF GR SF
Sbjct: 947 --SKKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSF 1004
Query: 632 TARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPS 691
ARMHL+ GDRRF PWKGSVVDSV+GV+LTQNVSD+LSSSA+MSLAARFP K ++
Sbjct: 1005 IARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSRE-D 1063
Query: 692 DENLRTTASLEPIG 705
+ N+R+ +P G
Sbjct: 1064 ERNVRSVVVEDPEG 1077
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 47/98 (47%), Gaps = 9/98 (9%)
Query: 6 PRVLVEGMPKKEP-KPKKAPQQPKTPKPVTPNRAHETTRKRKYVRKNSVSGNVVMDESVS 64
P+V+VEG PK++P KP + P+ KP R T K K S + + +
Sbjct: 306 PKVVVEGKPKRKPRKPAELPKVVVEGKPKRKPRKAATQEKVKSKETGSAKKKNLKESATK 365
Query: 65 KRRKCGRGNRQSNSDAIILLPPPPIKTCRQALNFDLEN 102
K G + +S P +K+CR+ALNFDLEN
Sbjct: 366 KPANVGDMSNKS--------PEVTLKSCRKALNFDLEN 395
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 622 bits (1604), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/538 (57%), Positives = 374/538 (69%), Gaps = 22/538 (4%)
Query: 1003 DWDLLRR---IYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFL 1059
DWD LRR + E++ MD+VDW+A+R AD E+A+ IK RG + +A RI+ FL
Sbjct: 858 DWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFL 917
Query: 1060 NRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRI 1119
+RLV HGSIDLEWLR+VPPDK KEYLL GLGLKSVECVRLL+L H+AFPVD NVGRI
Sbjct: 918 DRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRI 977
Query: 1120 AVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIF 1179
AVRLGWVPLQPLP L +HLL+ YP+++ IQ YLWPRLC LDQKTLYELHYQMITFGK+F
Sbjct: 978 AVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVF 1037
Query: 1180 CKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVN-PI 1238
C K PNC ACP++GEC+HFASA ASAR ALP ++E G G PL ++ P
Sbjct: 1038 CTKSKPNCNACPMKGECRHFASAFASARLALP--------STEKGMGTPDKNPLPLHLPE 1089
Query: 1239 PVIRIEADPV---SESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEEIPT 1295
P R + V SE ++ CEPIIEEP SP P+ +E ++I++ F D EEIPT
Sbjct: 1090 PFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSIADIEEAFFE--DPEEIPT 1147
Query: 1296 LRLQEREFKENFPNFMEMNKVMLQD---SSALVALTAEAASVPTRKLKRCAHLRTEHHVY 1352
+RL F N ME NK LQD SSALVALTAE AS+P KLK + LRTEH VY
Sbjct: 1148 IRLNMDAFTSNLKKIMEHNK-ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVY 1206
Query: 1353 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRC-NSKDSKLCNSEICYS 1411
LPD H LL ++R+PDDP YLLAIW+PGE+ +S+ C + LC+ E C+S
Sbjct: 1207 ELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFS 1266
Query: 1412 CNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIAN 1471
CNS+ E + IVRGTILIPCRTA RG FPLNGTYFQVNEVFADH +S +PINVPR I
Sbjct: 1267 CNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWE 1326
Query: 1472 LRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSK 1529
L R V FG+S T+F L +IQ FW+G+VCVRGFDR+ R P+PL+ RLH SK
Sbjct: 1327 LPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASK 1384
Score = 114 bits (285), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 93/136 (68%), Gaps = 7/136 (5%)
Query: 552 AVVPYQVGP-----SSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKD 606
A+VPY + GA+VP K+ + +P+V LD T R+W L++ + D
Sbjct: 494 ALVPYTMNSQIVLFGGGAGAIVP-VTPVKKPRPRPKVDLDDETDRVWKLLLENINSEGVD 552
Query: 607 QTSNEDMEKWWQKEREVFEGRIQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSD 666
S+E KWW++ER VF GR SF ARMHL+ GDRRF PWKGSVVDSVVGV+LTQNVSD
Sbjct: 553 -GSDEQKAKWWEEERNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSD 611
Query: 667 NLSSSAYMSLAARFPL 682
+LSSSA+MSLA++FP+
Sbjct: 612 HLSSSAFMSLASQFPV 627
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 581 bits (1497), Expect = e-164, Method: Compositional matrix adjust.
Identities = 323/717 (45%), Positives = 428/717 (59%), Gaps = 39/717 (5%)
Query: 840 ITQSSVTQFWP-TGNSTADVASPSKT---------------CIKESSIAASTEIPQLENT 883
+TQ ++ P TG ST + +P++ C +ESS + + I + +
Sbjct: 613 VTQETILNLDPRTGVSTPRIRNPTRVIIEEIDDDENDIDAVCSQESSKTSDSSITSADQS 672
Query: 884 -ALLQDKVDGILFCDECLDGYTKPTRIDNGNQASTSGRNDLKSDFRSISSSDFNDPFEIS 942
+L D + +L ++ K G+ T NDL +SS+ + ++
Sbjct: 673 KTMLLDPFNTVLMNEQVDSQMVK----GKGHIPYTDDLNDLSQGISMVSSASTHCELNLN 728
Query: 943 VLPSNRE-SFRTGMPQAHDATTTSKKSPRGKGKSKEYKSDMKNDRTKKTTPKKNSDNTVQ 1001
+P E P++ T ++S R + K K + KK+ S
Sbjct: 729 EVPPEVELCSHQQDPESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKS 788
Query: 1002 QDWDLLRRIYSTG---EERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEF 1058
DWD LR+ +G ER+ MD+VDW+A+RC D +IA+ I +RG N++A RIK F
Sbjct: 789 VDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAF 848
Query: 1059 LNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGR 1118
LNRLV+ HGSIDLEWLR+VPPDK KEYLL I GLGLKSVECVRLLSL IAFPVD NVGR
Sbjct: 849 LNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGR 908
Query: 1119 IAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKI 1178
IAVRLGWVPLQPLP +L +HLL+ YPV++ +Q YLWPRLC LDQKTLYELHY MITFGK+
Sbjct: 909 IAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKV 968
Query: 1179 FCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPI 1238
FC K PNC ACP++ EC+H++SA ASAR ALP P E + + P+VVN
Sbjct: 969 FCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFR 1028
Query: 1239 PVIRIEADPVSESGYQINNCEPIIEEPRSPGPQCSESPESEIDDDDFSIGDIEE------ 1292
P + + + E+ + NCEPIIEEP SP P+ E + D ++G E+
Sbjct: 1029 PSLFLYQEKEQEA-QRSQNCEPIIEEPASPEPEYIEHDIEDYPRDKNNVGTSEDPWENKD 1087
Query: 1293 -IPTLRLQEREFKENFPNFMEMNKVMLQDSSALVALTAEAASVPTRKLKRCAHLRTEHHV 1351
IPT+ L + + + + +NK S LV L+ AA++P RKLK LRTEHHV
Sbjct: 1088 VIPTIILNK---EAGTSHDLVVNKEA-GTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHV 1143
Query: 1352 YVLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCN--SKDSKLCNSEIC 1409
+ LPD H +L GF+RR+ +D PYLLAIW+PGE+ NS+ P+ RC ++ LCN C
Sbjct: 1144 FELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKC 1203
Query: 1410 YSCNSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTI 1469
+ CN E + VRGTILIPCRTA RG FPLNGTYFQ NEVFADH++S +PI+VP I
Sbjct: 1204 FQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELI 1263
Query: 1470 ANLRREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCS 1526
+L+R + GSS +++ L + I+ F G+VCVRGFDR R P+ LV RLHCS
Sbjct: 1264 WDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLHCS 1320
Score = 91.7 bits (226), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 105/171 (61%), Gaps = 17/171 (9%)
Query: 514 QSTTADIIRK-FKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVPHQI 572
+S T D I K F++L I G LP N A++ Y+ E A+V +
Sbjct: 443 RSLTIDAITKLFEELDINKEGLCLP---------HNRETALILYKKS-YEEQKAIVKYS- 491
Query: 573 KEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGRIQSFT 632
KK KP+V LDP T R+W L+M+ D D S+E+ KWW++ER +F GR SF
Sbjct: 492 ----KKQKPKVQLDPETSRVWKLLMSSIDCDGVD-GSDEEKRKWWEEERNMFHGRANSFI 546
Query: 633 ARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLK 683
ARM ++ G+R F PWKGSVVDSVVGV+LTQNV+D+ SSSAYM LAA FP++
Sbjct: 547 ARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVADHSSSSAYMDLAAEFPVE 597
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 510 bits (1314), Expect = e-143, Method: Compositional matrix adjust.
Identities = 268/538 (49%), Positives = 342/538 (63%), Gaps = 14/538 (2%)
Query: 1003 DWDLLRRIYSTGEERSHDKMDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRL 1062
DW+ LRR+Y+ R MDSV+W VR + ++ + IK+RGQ I++ RI +FLN
Sbjct: 501 DWNNLRRMYTKEGSRPEMHMDSVNWSDVRLSGQNVLETTIKKRGQFRILSERILKFLNDE 560
Query: 1063 VELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVR 1122
V +G+IDLEWLRN P VK YLL+IEG+GLKS ECVRLL L+H AFPVD NVGRIAVR
Sbjct: 561 VNQNGNIDLEWLRNAPSHLVKRYLLEIEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVR 620
Query: 1123 LGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKK 1182
LG VPL+PLP + +H L EYP MD IQ YLWPRLC L Q+TLYELHYQMITFGK+FC K
Sbjct: 621 LGLVPLEPLPNGVQMHQLFEYPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTK 680
Query: 1183 RSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEFGNGIGQSPPLVVNPIPVIR 1242
PNC ACP++ ECK+FASA S++ L P EK + F N Q + V+ I
Sbjct: 681 TIPNCNACPMKSECKYFASAYVSSKVLLESPEEKMHEPNTFMNAHSQD--VAVDMTSNIN 738
Query: 1243 IEADPVSESGYQINNC-EPIIEEPRSPGPQCSESPESEIDDDDFS--IGDIEEIPTLRLQ 1299
+ + VS C +P++E P SP + ES ++I+D F +P +
Sbjct: 739 LVEECVSSGCSDQAICYKPLVEFPSSPRAEIPES--TDIEDVPFMNLYQSYASVPKIDFD 796
Query: 1300 EREFKENFPNFMEMNKVMLQD----SSALVALTAEAASVPT---RKLKRCAHLRTEHHVY 1352
K++ + + ++ M S ALV T E A +P RK+K LRTEH VY
Sbjct: 797 LDALKKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVY 856
Query: 1353 VLPDAHELLRGFDRRDPDDPSPYLLAIWSPGESPNSVGSPQYRCNSKDSKLCNSEICYSC 1412
VLPD HELL F+RR DDPSPYLLAIW PGE+ +S P+ +C+S SKLC + C C
Sbjct: 857 VLPDNHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSSDGSKLCKIKNCSYC 916
Query: 1413 NSVCERNAGIVRGTILIPCRTANRGRFPLNGTYFQVNEVFADHETSHHPINVPRHTIANL 1472
++ E+N+ I RGTILIPCRTA RG FPLNGTYFQ NEVFADHETS +PI R L
Sbjct: 917 WTIREQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGL 976
Query: 1473 RREIVCFGSSATTLFNALDMRQIQEYFWRGFVCVRGFDRRYRCPRPLVNRLHCSPSKK 1530
+ + GS+ T++F LD R+I+ FW GF+C+R FDR+ R P+ LV RLH P ++
Sbjct: 977 EKRALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTPPDER 1034
Score = 94.4 bits (233), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 82/115 (71%), Gaps = 2/115 (1%)
Query: 568 VPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTTKDQTSNEDMEKWWQKEREVFEGR 627
V IK +K + +V LDP T++ W+++M ++D ++ E E W+KERE+F+ R
Sbjct: 334 VTTMIKADKKLVTAKVNLDPETIKEWDVLM-VNDSPSRSYDDKE-TEAKWKKEREIFQTR 391
Query: 628 IQSFTARMHLILGDRRFKPWKGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPL 682
I F RMH + G+R+FK WKGSVVDSVVGV+LTQN +D LSS+A+MS+AA+FP+
Sbjct: 392 IDLFINRMHRLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPV 446
>sp|Q9WYK0|END3_THEMA Endonuclease III OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
DSM 3109 / JCM 10099) GN=nth PE=3 SV=1
Length = 213
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 73/167 (43%), Gaps = 27/167 (16%)
Query: 1033 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1092
A ++ D IKE G A RI E LVE +G PD ++E LL + G+
Sbjct: 60 AKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGG--------RVPDSLEE-LLKLPGV 110
Query: 1093 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1152
G K+ V + + A VD +V RI+ RLGWV + P + L K P
Sbjct: 111 GRKTANIVLWVGFKKPALAVDTHVHRISNRLGWVKTRT-PEETEEALKKLLP------ED 163
Query: 1153 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1199
LW ++ M+ FG+ CK ++P C C L+ C+ +
Sbjct: 164 LWG-----------PINGSMVEFGRRICKPQNPLCEECFLKNHCEFY 199
>sp|P39788|END3_BACSU Probable endonuclease III OS=Bacillus subtilis (strain 168) GN=nth
PE=3 SV=1
Length = 219
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 26/161 (16%)
Query: 1037 EIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKS 1096
E+ IK G A I++ ++E +G VP D+ + L+ + G+G K+
Sbjct: 72 ELQQDIKSIGLYRNKAKNIQKLSKMIIEDYGG-------EVPRDR--DELVKLPGVGRKT 122
Query: 1097 VECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPR 1156
V ++ A VD +V R++ RLG + ++ L+++ P D W
Sbjct: 123 ANVVVSVAFGVPAIAVDTHVERVSKRLGICRWKDSVLEVEKTLMRKVPKED------WS- 175
Query: 1157 LCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECK 1197
H+++I FG+ CK +SP C CPL C+
Sbjct: 176 ----------VTHHRLIFFGRYHCKAQSPRCAECPLLSLCR 206
>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
PE=3 SV=1
Length = 212
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 1079 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1138
P+ KE L+ + G+G K+ V VD +V R++ R+G G+
Sbjct: 105 PNNFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTAAI 158
Query: 1139 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1198
+ KE L +D+K L H+ +I G+ CK R P C CP++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEY 205
Query: 1199 FASAVAS 1205
+ + +S
Sbjct: 206 YINTFSS 212
>sp|Q58829|Y1434_METJA Putative endonuclease MJ1434 OS=Methanocaldococcus jannaschii (strain
ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440)
GN=MJ1434 PE=3 SV=1
Length = 220
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 42/165 (25%), Positives = 74/165 (44%), Gaps = 20/165 (12%)
Query: 1034 DESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKV--KEYLLDIEG 1091
DE ++ + I+ G N+ A R+K +VE +G + E + D + + LL I G
Sbjct: 71 DEDKLKELIRPAGFYNLKAKRLKNVTKFIVENYG--NTEEMAKTDKDTLILRAELLSING 128
Query: 1092 LGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQM 1151
+G ++ + + L +L +F VD R+ RLG ++ E D+I+
Sbjct: 129 VGKETADSILLYALDRESFVVDAYTKRMFSRLG--------------VINEKAKYDEIKE 174
Query: 1152 YLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGEC 1196
L D + E H ++ K FC+K++ C CP++ C
Sbjct: 175 IFEKNLPK-DLEIYKEYHALIVEHCKKFCRKKAL-CDNCPIKEFC 217
>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
Wilmington) GN=nth PE=3 SV=1
Length = 212
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 1079 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1138
P+ KE L+ + G+G K+ V VD +V R++ R+G G+ +
Sbjct: 105 PNDFKE-LVKLPGVGRKTANVVLNCLFAMPTMAVDTHVFRVSKRIGLAK-----GNTTVI 158
Query: 1139 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1198
+ KE L +D+K L H+ ++ G+ CK R P+C C ++ C++
Sbjct: 159 VEKEL-------------LQIIDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEY 205
Query: 1199 FASAVAS 1205
+ + AS
Sbjct: 206 YINTFAS 212
>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
URRWXCal2) GN=nth PE=3 SV=1
Length = 213
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 56/127 (44%), Gaps = 19/127 (14%)
Query: 1079 PDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIH 1138
P+ KE L+ + G+G K+ V VD +V R+A R+G P +
Sbjct: 105 PNDFKE-LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVAKRIGLARGNS-PEIVEKE 162
Query: 1139 LLKEYPVMDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKH 1198
LL+ +++K L H+ +I G+ CK R P+C CP++ C++
Sbjct: 163 LLQ-----------------IINEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEY 205
Query: 1199 FASAVAS 1205
+ S + S
Sbjct: 206 YNSPIIS 212
>sp|Q92GH4|END3_RICCN Endonuclease III OS=Rickettsia conorii (strain ATCC VR-613 / Malish
7) GN=nth PE=3 SV=1
Length = 210
Score = 42.7 bits (99), Expect = 0.023, Method: Composition-based stats.
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 18/114 (15%)
Query: 1086 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1145
L+ + G+G K+ V VD +V R+A R+G GD P
Sbjct: 111 LIKLPGVGRKTANVVLNCLFGMPTMAVDTHVFRVANRIGLAK-----GDT--------PE 157
Query: 1146 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHF 1199
+ + ++ L +D K L H+ +I G+ CK R P+C CP++ C ++
Sbjct: 158 IVENEL-----LQIIDTKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCDYY 206
>sp|Q9V3I8|OGG1_DROME N-glycosylase/DNA lyase OS=Drosophila melanogaster GN=Ogg1 PE=2 SV=2
Length = 343
Score = 41.6 bits (96), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
Query: 1074 LRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRIA 1120
L+++P +K +E L + G+G K +C+ L+S+ H+ + PVD+++ RIA
Sbjct: 229 LKSMPFEKAREELTLLPGIGYKVADCICLMSMGHLESVPVDIHIYRIA 276
>sp|A9M2W6|PYRC_NEIM0 Dihydroorotase OS=Neisseria meningitidis serogroup C (strain
053442) GN=pyrC PE=3 SV=1
Length = 344
Score = 40.0 bits (92), Expect = 0.15, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATSELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9K0D1|PYRC_NEIMB Dihydroorotase OS=Neisseria meningitidis serogroup B (strain MC58)
GN=pyrC PE=3 SV=1
Length = 344
Score = 39.3 bits (90), Expect = 0.23, Method: Composition-based stats.
Identities = 30/119 (25%), Positives = 53/119 (44%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT N + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDNATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|O08760|OGG1_MOUSE N-glycosylase/DNA lyase OS=Mus musculus GN=Ogg1 PE=2 SV=2
Length = 345
Score = 39.3 bits (90), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 1062 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1120
L E G L+ LR P ++ + L + G+G K +C+ L++L A PVDV+V +IA
Sbjct: 216 LEEQGGPAWLQQLRVAPYEEAHKALCTLPGVGAKVADCICLMALDKPQAVPVDVHVWQIA 275
Query: 1121 VR-LGWVP 1127
R GW P
Sbjct: 276 HRDYGWHP 283
>sp|P29588|GTMR_METTF G/T mismatches repair enzyme OS=Methanobacterium thermoformicicum
PE=1 SV=1
Length = 221
Score = 38.9 bits (89), Expect = 0.31, Method: Composition-based stats.
Identities = 45/197 (22%), Positives = 78/197 (39%), Gaps = 27/197 (13%)
Query: 1006 LLRRIYSTGEERSHDK--MDSVDWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLV 1063
LLRR + ++ +DK + +E + +SEIA IKE G N A ++KE ++
Sbjct: 44 LLRRTTAGHVKKIYDKFFVKYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVI 103
Query: 1064 ELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRL 1123
+G VP + ++ +LD+ G+G + V L+ A VD N R+ R
Sbjct: 104 NDYGG-------RVP--RNRKAILDLPGVGKYTCAAVMCLAFGKKAAMVDANFVRVINR- 153
Query: 1124 GWVPLQPLPGDLHIHLLKEYPVMDKIQMYLWPRL-CYLDQKTLYELHYQMITFGKIFCKK 1182
+ Y ++ LW + + + ++ F I C
Sbjct: 154 --------------YFGGSYENLNYNHKALWELAETLVPGGKCRDFNLGLMDFSAIICAP 199
Query: 1183 RSPNCGACPLRGECKHF 1199
R P C C + C ++
Sbjct: 200 RKPKCEKCGMSKLCSYY 216
>sp|Q08214|NTG2_YEAST DNA base excision repair N-glycosylase 2 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=NTG2 PE=1 SV=1
Length = 380
Score = 37.4 bits (85), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 30/119 (25%)
Query: 1116 VGRIAVRLGWVPLQPLPGDLHIH-LLKEYPVMDKI----------QMYLWPRLCYLDQKT 1164
+G + ++ GW + + D+H+H L K + +D I ++ +W L
Sbjct: 249 MGYLTLQKGWGLIAGICVDVHVHRLCKMWNWVDPIKCKTAEHTRKELQVW-----LPHSL 303
Query: 1165 LYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALPGPSEKGIVTSEF 1223
YE++ ++ FG++ C R C C A+ V +AR +EK I +S+F
Sbjct: 304 WYEINTVLVGFGQLICMARGKRCDLC--------LANDVCNAR------NEKLIESSKF 348
>sp|B4RJT2|PYRC_NEIG2 Dihydroorotase OS=Neisseria gonorrhoeae (strain NCCP11945) GN=pyrC
PE=3 SV=1
Length = 344
Score = 37.0 bits (84), Expect = 1.2, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q5F9Y1|PYRC_NEIG1 Dihydroorotase OS=Neisseria gonorrhoeae (strain ATCC 700825 / FA
1090) GN=pyrC PE=3 SV=1
Length = 344
Score = 37.0 bits (84), Expect = 1.2, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|Q9JVD6|PYRC_NEIMA Dihydroorotase OS=Neisseria meningitidis serogroup A / serotype 4A
(strain Z2491) GN=pyrC PE=3 SV=1
Length = 344
Score = 36.6 bits (83), Expect = 1.6, Method: Composition-based stats.
Identities = 29/119 (24%), Positives = 52/119 (43%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPALVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
G ++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQGILFLVHGE---VTDPEID----IFDREAAFIGRVMKPVLAQVPNLKVVFEHITT 176
>sp|P53397|OGG1_YEAST N-glycosylase/DNA lyase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=OGG1 PE=1 SV=1
Length = 376
Score = 36.6 bits (83), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 1056 KEFLNRLVELHGSIDLEWLRNVPPDK----VKEYLLDIEGLGLKSVECVRLLSLQHIAF- 1110
++ +N E + + D +L+++ D V+E+L+ G+G K +CV L+ L
Sbjct: 198 RKLVNDKAEANITSDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIV 257
Query: 1111 PVDVNVGRIAVR 1122
PVDV+V RIA R
Sbjct: 258 PVDVHVSRIAKR 269
>sp|P46303|UVEN_MICLC Ultraviolet N-glycosylase/AP lyase OS=Micrococcus luteus (strain ATCC
4698 / DSM 20030 / JCM 1464 / NBRC 3333 / NCIMB 9278 /
NCTC 2665 / VKM Ac-2230) GN=pdg PE=1 SV=2
Length = 279
Score = 36.6 bits (83), Expect = 1.8, Method: Composition-based stats.
Identities = 43/177 (24%), Positives = 65/177 (36%), Gaps = 26/177 (14%)
Query: 1026 DWEAVRCADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEY 1085
D A+ A E E+ + ++ G A I LV H VP E
Sbjct: 75 DAHAMAAATEPELQELVRSTGFYRNKASAILRLSQELVGRHDG-------EVPARL--ED 125
Query: 1086 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1145
L+ + G+G K+ V + VD + GR+A RLG+ + PG + P
Sbjct: 126 LVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLARRLGFTD-ETDPGKGRARRGRPVPP 184
Query: 1146 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASA 1202
M L +++I G+ C R P CG CP+ C +A+
Sbjct: 185 ARDWTM----------------LSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAG 225
>sp|Q9CB92|END3_MYCLE Endonuclease III OS=Mycobacterium leprae (strain TN) GN=nth PE=3 SV=2
Length = 245
Score = 35.8 bits (81), Expect = 2.6, Method: Composition-based stats.
Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 18/126 (14%)
Query: 1086 LLDIEGLGLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPV 1145
L + G+G K+ + + VD + GR+ R W +E PV
Sbjct: 122 LFTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTA-------------EEDPV 168
Query: 1146 MDKIQMYLWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVAS 1205
K++ + L DQ TL L +++I G+ C R P CG C L +C F
Sbjct: 169 --KVEHAV-GELIERDQWTL--LSHRVIFHGRRVCHARKPACGVCVLAKDCPSFGLGPTE 223
Query: 1206 ARFALP 1211
A P
Sbjct: 224 PLLAAP 229
>sp|Q9FNY7|OGG1_ARATH N-glycosylase/DNA lyase OGG1 OS=Arabidopsis thaliana GN=OGG1 PE=1
SV=1
Length = 365
Score = 35.8 bits (81), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 1072 EWL---RNVPPDKVKEYLLDIEGLGLKSVECVRLLSL-QHIAFPVDVNVGRIA 1120
EWL R V + L + G+G K C+ L SL QH A PVD +V +IA
Sbjct: 235 EWLLSLRKVELQEAVAALCTLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIA 287
>sp|O70249|OGG1_RAT N-glycosylase/DNA lyase OS=Rattus norvegicus GN=Ogg1 PE=2 SV=1
Length = 345
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 1062 LVELHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQH-IAFPVDVNVGRIA 1120
L E G L+ LR ++ + L + G+G K +C+ L++L A PVD++V +IA
Sbjct: 216 LEEQGGPAWLQQLRVASYEEAHKALCTLPGVGTKVADCICLMALDKPQAVPVDIHVWQIA 275
Query: 1121 VR-LGWVP 1127
R GW P
Sbjct: 276 HRDYGWQP 283
>sp|O27397|OGG1_METTH Probable N-glycosylase/DNA lyase OS=Methanothermobacter
thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM
10044 / NBRC 100330 / Delta H) GN=MTH_1342 PE=3 SV=1
Length = 312
Score = 35.8 bits (81), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 1065 LHGSIDLEWLRNVPPDKVKEYLLDIEGLGLKSVECVRLLSLQHI-AFPVDVNVGRI 1119
L +D+ + + D +E LL++ G+G K +C+ L + AFPVDV + RI
Sbjct: 205 LAEEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPVDVWIRRI 260
>sp|P34648|YOT2_CAEEL Uncharacterized protein ZK632.2 OS=Caenorhabditis elegans
GN=ZK632.2 PE=4 SV=1
Length = 710
Score = 34.7 bits (78), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 42/199 (21%), Positives = 81/199 (40%), Gaps = 29/199 (14%)
Query: 440 LDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLWFTAKSRGSQKIIQVERNGLPTQKMPPR 499
LD +Q+ + +++ W K + T +S+ +K++++ + P
Sbjct: 454 LDDYIRQLEKSGGAGDDAKTKMEKSKWRQKLMAATHESQKLEKLVKIAK---------PA 504
Query: 500 YSRGLQVDMEQQLMQSTTAD---IIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPY 556
+GL+ QL + D ++K + R Q P Q GP +A +P
Sbjct: 505 VVKGLE-----QLETTAANDRQAFLKKLMGVRARKEIDQTPSQ-----GPGPSTSATLPA 554
Query: 557 QVGPSSEHGALVPHQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDD-GTTKDQTSNEDMEK 615
V P+S V H+ K K++ E+ + + N + +D+ + KD+ S E +K
Sbjct: 555 TVAPTSTKAVEVEHEKKMTPLKVEKEIAASLDSSEIKNSLPAVDEPSSVKDEVSEETPQK 614
Query: 616 WWQKEREVFEGRIQSFTAR 634
E F ++Q A+
Sbjct: 615 ------EAFGSKVQKRVAQ 627
>sp|A0R6D9|MAK_MYCS2 Maltokinase OS=Mycobacterium smegmatis (strain ATCC 700084 /
mc(2)155) GN=mak PE=3 SV=1
Length = 441
Score = 34.7 bits (78), Expect = 6.7, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 510 QQLMQSTTADIIRKFKDLTIRDGGSQLPLQCNKADGPSNVHNAVVPYQVGPSSEHGALVP 569
++L+ +TTA +R +RDG + LQ N ADGP + +V GP E+ +
Sbjct: 20 RELVSATTAMAVR------LRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVAT 73
Query: 570 HQIKEKRKKLKPEVVLDPGTLRMWNLIMNIDDGTT 604
I + + + + DP R L+ ID+ T
Sbjct: 74 IGIADGQTAY--DALYDPDATRY--LLSLIDESAT 104
>sp|P63540|END3_MYCTU Endonuclease III OS=Mycobacterium tuberculosis GN=nth PE=3 SV=1
Length = 245
Score = 34.7 bits (78), Expect = 6.9, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 1033 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1092
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 1093 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1152
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 1153 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1211
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P63541|END3_MYCBO Endonuclease III OS=Mycobacterium bovis (strain ATCC BAA-935 /
AF2122/97) GN=nth PE=3 SV=1
Length = 245
Score = 34.7 bits (78), Expect = 6.9, Method: Composition-based stats.
Identities = 42/179 (23%), Positives = 65/179 (36%), Gaps = 27/179 (15%)
Query: 1033 ADESEIADAIKERGQQNIIAGRIKEFLNRLVELHGSIDLEWLRNVPPDKVKEYLLDIEGL 1092
AD +E+ I+ G A + LVE G VP K L+ + G+
Sbjct: 78 ADRTELESLIRPTGFYRNKAASLIGLGQALVERFGG-------EVPATMDK--LVTLPGV 128
Query: 1093 GLKSVECVRLLSLQHIAFPVDVNVGRIAVRLGWVPLQPLPGDLHIHLLKEYPVMDKIQMY 1152
G K+ + + VD + GR+ R W E PV K++
Sbjct: 129 GRKTANVILGNAFGIPGITVDTHFGRLVRRWRWTT-------------AEDPV--KVEQA 173
Query: 1153 LWPRLCYLDQKTLYELHYQMITFGKIFCKKRSPNCGACPLRGECKHFASAVASARFALP 1211
+ +++K L +++I G+ C R P CG C L +C F A P
Sbjct: 174 VGE---LIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAP 229
>sp|P31378|NTG1_YEAST Mitochondrial DNA base excision repair N-glycosylase 1
OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=NTG1 PE=1 SV=1
Length = 399
Score = 34.3 bits (77), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 439 RLDQLRKQINGPSSQLQEQNRVIQRKSWNPKGLW--FTAKSRGSQKIIQVERN-GLPTQK 495
RL +L K ++ + +Q R Q ++W PKGLW G +II RN G Q
Sbjct: 267 RLTKLWKWVDAQKCKTPDQTRT-QLQNWLPKGLWTEINGLLVGFGQIITKSRNLGDMLQF 325
Query: 496 MPPRYSR-GLQVDMEQQLMQSTTADIIRKFKDLTIRDGGSQLPLQC 540
+PP R L D++ QL + +I+ K + +G +L ++
Sbjct: 326 LPPDDPRSSLDWDLQSQLYKEIQQNIMSYPKWVKYLEGKRELNVEA 371
>sp|A1KSU5|PYRC_NEIMF Dihydroorotase OS=Neisseria meningitidis serogroup C / serotype 2a
(strain ATCC 700532 / FAM18) GN=pyrC PE=3 SV=1
Length = 344
Score = 33.9 bits (76), Expect = 9.5, Method: Composition-based stats.
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 7/119 (5%)
Query: 648 KGSVVDSVVGVYLTQNVSDNLSSSAYMSLAARFPLKSTNDHTPSDENLRTTASLEPIGSN 707
+GS + ++ +YLT + L A + F L T SD + L P+
Sbjct: 65 EGSAFEPLMTLYLTDQATPELVREAKAAGIVAFKLYPAGATTNSDSGVTDLFKLIPVLEE 124
Query: 708 STSNGAVYDSEGNMYFVTEPEPDRCCELKDRDDAFDSRIQRKALQENGDIKVMTDAVPS 766
++ G VT+PE D + DR+ AF R+ + L + ++KV+ + + +
Sbjct: 125 MAKQDILFLVHGE---VTDPEID----IFDREAAFIERVMKPVLAQVPNLKVVFEHITT 176
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.132 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 590,461,196
Number of Sequences: 539616
Number of extensions: 26269978
Number of successful extensions: 76342
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 114
Number of HSP's that attempted gapping in prelim test: 75892
Number of HSP's gapped (non-prelim): 431
length of query: 1541
length of database: 191,569,459
effective HSP length: 131
effective length of query: 1410
effective length of database: 120,879,763
effective search space: 170440465830
effective search space used: 170440465830
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 68 (30.8 bits)