Your job contains 1 sequence.
>002141
MKLEERQIENLLLHSFDCFSSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPED
TIEWKEKMSHQPVCDQGSMTLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSS
EIYPETYNDVLSSPNSLDSSFAPFADGTISSSNSNSDAGDSSNVPTLNSFNGSNSFVELL
QMVGSTMLHGNYNHRNGHMSSDENSKDEHSQFQTLESNTQRVKVKDIDDPKVLSRVSSIP
PSSFHPCLTQDLSVEVESYEMRREETRSSGISDVTDKIALMPEFASQTTDATKLIVAGPE
APRHGNKQSRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIE
SHNALYNRENTQLKSSVSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLR
RQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRD
HGSVDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGW
VPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKP
NCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHA
TDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQ
TLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRG
MEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQI
VRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTS
IPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKLNKVPGKADADHK
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 002141
(960 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2184432 - symbol:DME "DEMETER" species:3702 "A... 1906 1.9e-203 2
TAIR|locus:2044923 - symbol:DML1 "demeter-like 1" species... 1897 3.2e-199 2
TAIR|locus:2100138 - symbol:DML2 "demeter-like 2" species... 1502 6.0e-159 2
TAIR|locus:2124301 - symbol:DML3 "demeter-like protein 3"... 1168 1.7e-122 2
TAIR|locus:4515103342 - symbol:AT4G04957 "AT4G04957" spec... 170 1.4e-11 1
TAIR|locus:2100382 - symbol:AT3G47830 species:3702 "Arabi... 185 1.4e-11 1
TIGR_CMR|CHY_1121 - symbol:CHY_1121 "endonuclease III" sp... 85 7.1e-05 2
ASPGD|ASPL0000003678 - symbol:AN10840 species:162425 "Eme... 102 0.00089 2
>TAIR|locus:2184432 [details] [associations]
symbol:DME "DEMETER" species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0004519
"endonuclease activity" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM;ISS] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0006284 "base-excision repair" evidence=IEA] [GO:0051539 "4
iron, 4 sulfur cluster binding" evidence=IEA] [GO:0009793 "embryo
development ending in seed dormancy" evidence=IMP] [GO:0019104 "DNA
N-glycosylase activity" evidence=ISS;IDA] [GO:0043078 "polar
nucleus" evidence=IDA] [GO:0006349 "regulation of gene expression
by genetic imprinting" evidence=IMP] [GO:0003906 "DNA-(apurinic or
apyrimidinic site) lyase activity" evidence=IDA] [GO:0006306 "DNA
methylation" evidence=RCA;IDA] InterPro:IPR003265
InterPro:IPR003651 InterPro:IPR011257 Pfam:PF00730 PROSITE:PS00764
SMART:SM00525 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006355
GO:GO:0046872 GO:GO:0006284 Gene3D:1.10.340.30 SUPFAM:SSF48150
Gene3D:1.10.1670.10 InterPro:IPR023170 GO:GO:0003677 GO:GO:0006351
GO:GO:0051539 GO:GO:0009793 GO:GO:0043078 GO:GO:0006306
GO:GO:0004519 GO:GO:0003906 GO:GO:0006349 EMBL:AF521596
EMBL:DQ335243 EMBL:AL162875 EMBL:AK117994 EMBL:BT005357
IPI:IPI00516439 IPI:IPI00534354 IPI:IPI00760328 PIR:T48452
PIR:T48453 PIR:T48454 RefSeq:NP_001078527.1 RefSeq:NP_196076.2
UniGene:At.33104 ProteinModelPortal:Q8LK56 SMR:Q8LK56 STRING:Q8LK56
PaxDb:Q8LK56 PRIDE:Q8LK56 EnsemblPlants:AT5G04560.2 GeneID:830335
KEGG:ath:AT5G04560 TAIR:At5g04560 eggNOG:COG0177
HOGENOM:HOG000112227 InParanoid:Q8LK56 OMA:CTEITES PhylomeDB:Q8LK56
ProtClustDB:CLSN2690787 Genevestigator:Q8LK56 GO:GO:0019104
Uniprot:Q8LK56
Length = 1987
Score = 1906 (676.0 bits), Expect = 1.9e-203, Sum P(2) = 1.9e-203
Identities = 378/652 (57%), Positives = 461/652 (70%)
Query: 304 HGNKQSRNSM-QANKNSIAQHES-ELFGDSRFAMEPPAHAQKN--DLNLPKISSGS--ID 357
H + N + NK S Q +L S + + ++N D LP+ + +D
Sbjct: 1339 HQDDTQHNQQDEMNKASHLQKTFLDLLNSSEECLTRQSSTKQNITDGCLPRDRTAEDVVD 1398
Query: 358 AIESHNALYNRENTQLKSSVSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWD 417
+ ++++L N + SS +Q ++ KE N + K T +K WD
Sbjct: 1399 PLSNNSSLQNIL-VESNSSNKEQTAVEY---KETNAT--ILREMKGTLADGKKPTS-QWD 1451
Query: 418 SLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRL 477
SLR+ VE N G++ER ++ DS+D+EA+R A +++I+ IKERGMNNMLA RIKDFL R+
Sbjct: 1452 SLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERI 1511
Query: 478 VRDHGSVDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVR 537
V+DHG +DLEWLR+ PPDKAK+YLLS RGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR
Sbjct: 1512 VKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVR 1571
Query: 538 LGWVXXXXXXXXXXXXXXXXXXVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTK 597
+GWV VLESIQK+LWPRLCKLDQRTLYELHYQ+ITFGKVFCTK
Sbjct: 1572 MGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTK 1631
Query: 598 SKPNCNACPMRGECRHXXXXXXXXRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPL 657
S+PNCNACPMRGECRH RLALP PEE+++ SA P + I + LPL
Sbjct: 1632 SRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIELPL 1691
Query: 658 THATDLPVGKLEIAVNNCXXXXXXXXXXXXXRVQVSENDIEDTFC-EDPEEIPTIKLNMK 716
L G NC +++E+DIED + EDP+EIPTIKLN++
Sbjct: 1692 PLEKSLASGAPSNR-ENCEPIIEEPASPGQECTEITESDIEDAYYNEDPDEIPTIKLNIE 1750
Query: 717 EFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHP 776
+F TL+ +M+ N+ELQEGDMSKALVAL SIP PKLKN+SRLRTEHQVYELPDSH
Sbjct: 1751 QFGMTLREHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHR 1810
Query: 777 LLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRES 836
LL GM+KREPDDP YLLAIWTPGETANS QPPE +C + GKMC ++TC CNS+RE+
Sbjct: 1811 LLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREA 1870
Query: 837 EFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVY 896
Q VRGT+LIPCRTAMRGSFPLNGTYFQVNE+FADH+SSLKPI+VPR+W+W+LPRRTVY
Sbjct: 1871 NSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWDLPRRTVY 1930
Query: 897 FGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKL 948
FGTS+ SIF+GL+TE IQ CFW+G+VCVRGF+QK+RAPRPLMARLHFPASKL
Sbjct: 1931 FGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASKL 1982
Score = 85 (35.0 bits), Expect = 1.9e-203, Sum P(2) = 1.9e-203
Identities = 20/52 (38%), Positives = 28/52 (53%)
Query: 20 SSAFMSLAANFPLNSKQKPCHGEEITSVI-EEPAEYVLDPEDTIEWKEKMSH 70
SSAFMSLAA FP + SV+ E+P +L+ + W+EK+ H
Sbjct: 1043 SSAFMSLAARFPPKLSSSREDERNVRSVVVEDPEGCILNLNEIPSWQEKVQH 1094
Score = 46 (21.3 bits), Expect = 6.2e-202, Sum P(3) = 6.2e-202
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 27 AANFPLNSKQKPCHGEEITSVIEEPAEYV 55
A N PL + P E + SV E +YV
Sbjct: 195 ACNKPLYNLNSPIRREAVGSVCESSFQYV 223
Score = 41 (19.5 bits), Expect = 6.2e-202, Sum P(3) = 6.2e-202
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 168 NSFNGSNSFVELLQMVGST 186
NS +G+NSF E+ +G T
Sbjct: 411 NS-SGANSFSEIRDAIGGT 428
>TAIR|locus:2044923 [details] [associations]
symbol:DML1 "demeter-like 1" species:3702 "Arabidopsis
thaliana" [GO:0005634 "nucleus" evidence=ISM;IDA] [GO:0006281 "DNA
repair" evidence=IEA;IMP] [GO:0006284 "base-excision repair"
evidence=IEA] [GO:0003906 "DNA-(apurinic or apyrimidinic site)
lyase activity" evidence=ISS;IDA] [GO:0006342 "chromatin silencing"
evidence=IMP] [GO:0019104 "DNA N-glycosylase activity"
evidence=ISS;IDA;TAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0006306 "DNA methylation" evidence=RCA;IDA] [GO:0031936
"negative regulation of chromatin silencing" evidence=IMP]
[GO:0080111 "DNA demethylation" evidence=IMP] InterPro:IPR003265
InterPro:IPR003651 InterPro:IPR011257 Pfam:PF00730 PROSITE:PS00764
SMART:SM00478 SMART:SM00525 GO:GO:0005634 EMBL:CP002685
GenomeReviews:CT485783_GR GO:GO:0046872 GO:GO:0006284
Gene3D:1.10.340.30 SUPFAM:SSF48150 Gene3D:1.10.1670.10
InterPro:IPR023170 GO:GO:0003677 GO:GO:0006281 GO:GO:0006351
GO:GO:0051539 EMBL:AC006919 GO:GO:0080111 GO:GO:0031936
GO:GO:0006342 GO:GO:0006306 GO:GO:0004519 GO:GO:0003906
eggNOG:COG0177 GO:GO:0019104 EMBL:AY286009 IPI:IPI00517101
PIR:D84781 RefSeq:NP_181190.3 UniGene:At.14091
ProteinModelPortal:Q9SJQ6 SMR:Q9SJQ6 STRING:Q9SJQ6 PaxDb:Q9SJQ6
PRIDE:Q9SJQ6 EnsemblPlants:AT2G36490.1 GeneID:818224
KEGG:ath:AT2G36490 TAIR:At2g36490 HOGENOM:HOG000154178
InParanoid:Q9SJQ6 OMA:ANTASLI PhylomeDB:Q9SJQ6
ProtClustDB:CLSN2915131 Genevestigator:Q9SJQ6 GermOnline:AT2G36490
Uniprot:Q9SJQ6
Length = 1393
Score = 1897 (672.8 bits), Expect = 3.2e-199, Sum P(2) = 3.2e-199
Identities = 406/758 (53%), Positives = 495/758 (65%)
Query: 198 HMSSDE--NSKDEHSQFQTLESNTQRVKVKDIDDPKVLSRVSSIPPSSFHPCLTQDLSVE 255
++ S+E +S +H+ NTQ + KD SR SS S H + D + +
Sbjct: 649 YLDSEETMSSPPDHNHSSVTLKNTQPDEEKDYVPSNETSRSSSEIAISAHESV--DKTTD 706
Query: 256 VESYEMRREETRSSGISDVTD-KIALMPEFASQTTDATKLIVAGPEAPRHGNKQSRNSMQ 314
+ Y + + SS D TD K ++ F S+ + T +AP+ N+ +
Sbjct: 707 SKEY-VDSDRKGSSVEVDKTDEKCRVLNLFPSEDSALTCQHSMVSDAPQ-------NTER 758
Query: 315 ANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNL--PKISSGSIDA-IESHNALYNRENT 371
A +S E E + S + D N P +S G + I+ ++ +E T
Sbjct: 759 AGSSSEIDLEGE-YRTSFMKLLQGVQVSLEDSNQVSPNMSPGDCSSEIKGFQSM--KEPT 815
Query: 372 QLKSSV-SDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKK 430
KSSV S + ++ T K K +V KE++ FDWD LRR+ +A G +
Sbjct: 816 --KSSVDSSEPGCCSQQDGDVLSCQKPTLKEKGKKVLKEEKKAFDWDCLRREAQARAGIR 873
Query: 431 ERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLR 490
E+ T D++DW+A+R ADV ++A TIK RGMN+ LA RI+ FL+RLV DHGS+DLEWLR
Sbjct: 874 EKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDLEWLR 933
Query: 491 DVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXX 550
DVPPDKAKEYLLSF GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWV
Sbjct: 934 DVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESL 993
Query: 551 XXXXXXXXXVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGE 610
+LESIQKYLWPRLCKLDQ+TLYELHYQMITFGKVFCTKSKPNCNACPM+GE
Sbjct: 994 QLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACPMKGE 1053
Query: 611 CRHXXXXXXXXRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKLEI 670
CRH RLALP E K + + ++N P + + ++ P K
Sbjct: 1054 CRHFASAFASARLALPSTE-KGMGTPDKNPLPLHLPEPFQREQGSEVVQHSE-PAKK--- 1108
Query: 671 AVNNCXXXXXXXXXXXXXRVQVSENDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENL 730
V C +VS DIE+ F EDPEEIPTI+LNM FT L+ M+ N
Sbjct: 1109 -VTCCEPIIEEPASPEPETAEVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNK 1167
Query: 731 ELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPG 790
ELQ+G+MS ALVALTA AS+P PKLKN+S+LRTEH+VYELPD HPLL +EKREPDDP
Sbjct: 1168 ELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLLAQLEKREPDDPC 1227
Query: 791 RYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCR 850
YLLAIWTPGETA+SIQP S C Q +G +CDE+TCFSCNS++E+ QIVRGTILIPCR
Sbjct: 1228 SYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCR 1287
Query: 851 TAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTT 910
TAMRGSFPLNGTYFQVNEVFADH SSL PINVPRE +W LPRRTVYFGTS+P+IFKGL+T
Sbjct: 1288 TAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLST 1347
Query: 911 EGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKL 948
E IQ CFW+GYVCVRGFD+K+R P+PL+ARLHFPASKL
Sbjct: 1348 EKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385
Score = 54 (24.1 bits), Expect = 3.2e-199, Sum P(2) = 3.2e-199
Identities = 18/54 (33%), Positives = 25/54 (46%)
Query: 20 SSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPV 73
SSAFMSLA+ FP+ +S+ Y LD E+T+ +H V
Sbjct: 615 SSAFMSLASQFPVPFVPSSNFDAGTSSMPSIQITY-LDSEETMSSPPDHNHSSV 667
>TAIR|locus:2100138 [details] [associations]
symbol:DML2 "demeter-like 2" species:3702 "Arabidopsis
thaliana" [GO:0003824 "catalytic activity" evidence=IEA]
[GO:0004519 "endonuclease activity" evidence=IEA] [GO:0005634
"nucleus" evidence=ISM] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0006284 "base-excision repair" evidence=IEA;ISS] [GO:0051539 "4
iron, 4 sulfur cluster binding" evidence=IEA] [GO:0006306 "DNA
methylation" evidence=RCA] InterPro:IPR003265 InterPro:IPR003651
InterPro:IPR011257 Pfam:PF00730 PROSITE:PS00764 SMART:SM00525
GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0006355
GO:GO:0046872 GO:GO:0006284 Gene3D:1.10.340.30 SUPFAM:SSF48150
Gene3D:1.10.1670.10 InterPro:IPR023170 GO:GO:0003677 GO:GO:0006351
GO:GO:0051539 GO:GO:0004519 eggNOG:COG0177 EMBL:AC010927
IPI:IPI00533587 RefSeq:NP_187612.5 UniGene:At.40005
ProteinModelPortal:Q9SR66 STRING:Q9SR66 PaxDb:Q9SR66 PRIDE:Q9SR66
GeneID:820162 KEGG:ath:AT3G10010 TAIR:At3g10010 InParanoid:Q9SR66
OMA:STHCELN Genevestigator:Q9SR66 GermOnline:AT3G10010
Uniprot:Q9SR66
Length = 1332
Score = 1502 (533.8 bits), Expect = 6.0e-159, Sum P(2) = 6.0e-159
Identities = 315/588 (53%), Positives = 385/588 (65%)
Query: 374 KSSVSDQNKYDHSFSKEL--NGIDDATSK--SKSTRVSKEKQN-DFDWDSLRRQVEANGG 428
+S++ Q++ + + ++++ N TSK KS +K Q DWDSLR++ E+ G
Sbjct: 744 ESTIQTQDQQESTRTEDVKKNRKKPTTSKPKKKSKESAKSTQKKSVDWDSLRKEAESGGR 803
Query: 429 KKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEW 488
K+ER E T D++DW+A+RC DV+KIAN I +RGMNNMLA RIK FLNRLV+ HGS+DLEW
Sbjct: 804 KRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAFLNRLVKKHGSIDLEW 863
Query: 489 LRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXX 548
LRDVPPDKAKEYLLS GLGLKSVECVRLL+LH +AFPVDTNVGRIAVRLGWV
Sbjct: 864 LRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGRIAVRLGWVPLQPLPD 923
Query: 549 XXXXXXXXXXXVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMR 608
VLES+QKYLWPRLCKLDQ+TLYELHY MITFGKVFCTK KPNCNACPM+
Sbjct: 924 ELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKVFCTKVKPNCNACPMK 983
Query: 609 GECRHXXXXXXXXRLALPGPEEKAIVSANENRTNTQNPAMMINQLP-LPLTHATDLPVGK 667
ECRH RLALP PEE S + ++ +++N P L L + +
Sbjct: 984 AECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFRPSLFLYQEKEQEAQR 1043
Query: 668 LEIAVNNCXXXXXXXXXXXXXRVQVSENDIEDT--------FCEDPEE----IPTIKLNM 715
+ NC + E+DIED EDP E IPTI LN
Sbjct: 1044 SQ----NCEPIIEEPASPEPEYI---EHDIEDYPRDKNNVGTSEDPWENKDVIPTIILNK 1096
Query: 716 KEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVYELPDSH 775
+ T + + +E S LV L+ AA+IP KLK +LRTEH V+ELPD H
Sbjct: 1097 EAGTS------HDLVVNKEAGTSHDLVVLSTYAAAIPRRKLKIKEKLRTEHHVFELPDHH 1150
Query: 776 PLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQE-HGKMCDEKTCFSCNSVR 834
+L G E+RE +D YLLAIWTPGET NSIQPP+ RC+ E + +C+E CF CN R
Sbjct: 1151 SILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNENKCFQCNKTR 1210
Query: 835 ESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRT 894
E E Q VRGTILIPCRTAMRG FPLNGTYFQ NEVFADHDSS+ PI+VP E +W+L RR
Sbjct: 1211 EEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTELIWDLKRRV 1270
Query: 895 VYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLH 942
Y G+S+ SI KGL+ E I++ F GYVCVRGFD+++R P+ L+ RLH
Sbjct: 1271 AYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLH 1318
Score = 68 (29.0 bits), Expect = 6.0e-159, Sum P(2) = 6.0e-159
Identities = 22/52 (42%), Positives = 29/52 (55%)
Query: 9 ENLLLHSFDCFSSAFMSLAANFPL--NSKQKPCHGEEITSVIEEPAEYVLDP 58
+N+ HS SSA+M LAA FP+ N + CH E +SV +E LDP
Sbjct: 576 QNVADHSS---SSAYMDLAAEFPVEWNFNKGSCHEEWGSSVTQETI-LNLDP 623
Score = 40 (19.1 bits), Expect = 5.5e-156, Sum P(2) = 5.5e-156
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 44 ITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQG 77
ITS ++ +LDP +T+ E++ Q V +G
Sbjct: 666 ITSA-DQSKTMLLDPFNTVLMNEQVDSQMVKGKG 698
Score = 40 (19.1 bits), Expect = 5.5e-156, Sum P(2) = 5.5e-156
Identities = 17/65 (26%), Positives = 29/65 (44%)
Query: 207 DEHSQFQTLESNTQRVKVKDIDDPKV--LSRVSSIPPSSFHPCLTQDLSVEVESYEMRRE 264
DE SQ +T ++++ K + + K S + I SS T+ + E +Y +
Sbjct: 193 DEKSQLETPTLKRKKIRPKVVREGKTKKASSKAGIKKSSIAATATK--TSEESNYVRPKR 250
Query: 265 ETRSS 269
TR S
Sbjct: 251 LTRRS 255
Score = 38 (18.4 bits), Expect = 9.0e-156, Sum P(2) = 9.0e-156
Identities = 10/20 (50%), Positives = 11/20 (55%)
Query: 56 LDPEDTIEWKEKMSHQPVCD 75
LDPE + WK MS CD
Sbjct: 500 LDPETSRVWKLLMSSID-CD 518
>TAIR|locus:2124301 [details] [associations]
symbol:DML3 "demeter-like protein 3" species:3702
"Arabidopsis thaliana" [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0004519 "endonuclease activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=ISM] [GO:0006281 "DNA repair"
evidence=IEA] [GO:0006284 "base-excision repair" evidence=IEA]
[GO:0051539 "4 iron, 4 sulfur cluster binding" evidence=IEA]
[GO:0010216 "maintenance of DNA methylation" evidence=IMP]
[GO:0019104 "DNA N-glycosylase activity" evidence=IDA]
InterPro:IPR003265 InterPro:IPR003651 InterPro:IPR011257
PROSITE:PS00764 SMART:SM00478 SMART:SM00525 GO:GO:0005634
EMBL:CP002687 GenomeReviews:CT486007_GR GO:GO:0006355 GO:GO:0046872
GO:GO:0006284 Gene3D:1.10.340.30 SUPFAM:SSF48150
Gene3D:1.10.1670.10 InterPro:IPR023170 GO:GO:0003677 GO:GO:0006351
GO:GO:0051539 GO:GO:0004519 EMBL:AL021961 EMBL:AL161584
GO:GO:0010216 eggNOG:COG0177 GO:GO:0019104 EMBL:AY735663
IPI:IPI00525973 PIR:T05430 RefSeq:NP_195132.3 UniGene:At.51042
ProteinModelPortal:O49498 SMR:O49498 STRING:O49498 PRIDE:O49498
EnsemblPlants:AT4G34060.1 GeneID:829552 KEGG:ath:AT4G34060
TAIR:At4g34060 HOGENOM:HOG000064650 OMA:WARIASS PhylomeDB:O49498
ProtClustDB:CLSN2681626 Genevestigator:O49498 Uniprot:O49498
Length = 1044
Score = 1168 (416.2 bits), Expect = 1.7e-122, Sum P(2) = 1.7e-122
Identities = 259/572 (45%), Positives = 342/572 (59%)
Query: 384 DHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWE 443
D S SK + + A K++ T + +++ DW++LRR G RPE DS++W
Sbjct: 472 DESISKVEDHENTAKRKNEKTGIIEDEI--VDWNNLRRMYTKEGS---RPEMHMDSVNWS 526
Query: 444 AVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKEYLLS 503
VR + N + TIK+RG +L+ RI FLN V +G++DLEWLR+ P K YLL
Sbjct: 527 DVRLSGQNVLETTIKKRGQFRILSERILKFLNDEVNQNGNIDLEWLRNAPSHLVKRYLLE 586
Query: 504 FRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXVLES 563
G+GLKS ECVRLL L H AFPVDTNVGRIAVRLG V ++S
Sbjct: 587 IEGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVRLGLVPLEPLPNGVQMHQLFEYPSMDS 646
Query: 564 IQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHXXXXXXXXRL 623
IQKYLWPRLCKL Q TLYELHYQMITFGKVFCTK+ PNCNACPM+ EC++ ++
Sbjct: 647 IQKYLWPRLCKLPQETLYELHYQMITFGKVFCTKTIPNCNACPMKSECKYFASAYVSSKV 706
Query: 624 ALPGPEEKAIVSANENRTNTQNPAM-MINQLPLPLTHATDLPVGKLEIAVNNCXXXXXXX 682
L PEEK ++Q+ A+ M + + L + G + A+ C
Sbjct: 707 LLESPEEKMHEPNTFMNAHSQDVAVDMTSNINLVEECVSS---GCSDQAI--CYKPLVEF 761
Query: 683 XXXXXXRVQVSEN-DIEDT-FC---EDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGD- 736
R ++ E+ DIED F + +P I ++ +++++ + + + D
Sbjct: 762 PSSP--RAEIPESTDIEDVPFMNLYQSYASVPKIDFDLDALKKSVEDALVISGRMSSSDE 819
Query: 737 -MSKALVALTAGAASIPA-P--KLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRY 792
+SKALV T A IP P K+K +RLRTEH VY LPD+H LL E+R+ DDP Y
Sbjct: 820 EISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVYVLPDNHELLHDFERRKLDDPSPY 879
Query: 793 LLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTA 852
LLAIW PGET++S PP+ +CSS + K+C K C C ++RE I RGTILIPCRTA
Sbjct: 880 LLAIWQPGETSSSFVPPKKKCSS-DGSKLCKIKNCSYCWTIREQNSNIFRGTILIPCRTA 938
Query: 853 MRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEG 912
MRG+FPLNGTYFQ NEVFADH++SL PI RE L +R +Y G+++ SIFK L T
Sbjct: 939 MRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGLEKRALYCGSTVTSIFKLLDTRR 998
Query: 913 IQHCFWRGYVCVRGFDQKSRAPRPLMARLHFP 944
I+ CFW G++C+R FD+K R P+ L+ RLH P
Sbjct: 999 IELCFWTGFLCLRAFDRKQRDPKELVRRLHTP 1030
Score = 57 (25.1 bits), Expect = 1.7e-122, Sum P(2) = 1.7e-122
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 20 SSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAE 53
S+AFMS+AA FP++++ E ++ IEEP +
Sbjct: 434 SNAFMSVAAKFPVDAR------EGLSYYIEEPQD 461
Score = 51 (23.0 bits), Expect = 7.4e-122, Sum P(2) = 7.4e-122
Identities = 14/61 (22%), Positives = 29/61 (47%)
Query: 333 FAMEPPAHAQKNDLNLPKISSGSIDAIESHNALYNRENTQ---LKSSVSDQNKYDHSFSK 389
+ +E P A+ ++ + +S SI +E H R+N + ++ + D N ++K
Sbjct: 454 YYIEEPQDAKSSECII--LSDESISKVEDHENTAKRKNEKTGIIEDEIVDWNNLRRMYTK 511
Query: 390 E 390
E
Sbjct: 512 E 512
Score = 37 (18.1 bits), Expect = 2.2e-120, Sum P(2) = 2.2e-120
Identities = 10/64 (15%), Positives = 24/64 (37%)
Query: 377 VSDQNKYDHSFSKELNGIDDATSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHT 436
++D +++ + + N + +S + Q + E +G K EH
Sbjct: 2 LTDGSQHTYQNGETKNSKEHERKCDESAHLQDNSQTTHKKKEKKNSKEKHGIKHSESEHL 61
Query: 437 KDSL 440
+D +
Sbjct: 62 QDDI 65
>TAIR|locus:4515103342 [details] [associations]
symbol:AT4G04957 "AT4G04957" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL161502
IPI:IPI00891322 RefSeq:NP_001118934.1 UniGene:At.69420
EnsemblPlants:AT4G04957.1 GeneID:6240683 KEGG:ath:AT4G04957
TAIR:At4g04957 eggNOG:euNOG04714 PhylomeDB:B3H5Q5
ProtClustDB:CLSN2925558 Genevestigator:B3H5Q5 InterPro:IPR015410
Pfam:PF09331 Uniprot:B3H5Q5
Length = 78
Score = 170 (64.9 bits), Expect = 1.4e-11, P = 1.4e-11
Identities = 32/46 (69%), Positives = 38/46 (82%)
Query: 869 VFADHDSSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQ 914
+FADH SSL PI+VPRE +W+LPRRTV+FGTSIP+IFK GIQ
Sbjct: 30 MFADHASSLNPIDVPRELIWDLPRRTVFFGTSIPTIFKDSKEAGIQ 75
>TAIR|locus:2100382 [details] [associations]
symbol:AT3G47830 species:3702 "Arabidopsis thaliana"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005634
"nucleus" evidence=ISM] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0006284 "base-excision repair" evidence=IEA] InterPro:IPR003265
InterPro:IPR011257 Pfam:PF00730 SMART:SM00478 EMBL:CP002686
GO:GO:0003824 GO:GO:0006284 Gene3D:1.10.340.30 SUPFAM:SSF48150
Gene3D:1.10.1670.10 InterPro:IPR023170 KO:K10773 IPI:IPI00545586
RefSeq:NP_566893.1 UniGene:At.53825 ProteinModelPortal:F4JCQ3
SMR:F4JCQ3 PRIDE:F4JCQ3 EnsemblPlants:AT3G47830.1 GeneID:823937
KEGG:ath:AT3G47830 OMA:FGREYCS Uniprot:F4JCQ3
Length = 293
Score = 185 (70.2 bits), Expect = 1.4e-11, P = 1.4e-11
Identities = 42/100 (42%), Positives = 58/100 (58%)
Query: 442 WEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKEYL 501
W+ V A+ I N I+ G+ A IK+ LNRL + G + LE+LR + ++ K L
Sbjct: 128 WDDVLNAESKSIENAIRCGGLAPKKAVCIKNILNRLQNERGRLCLEYLRGLSVEEVKTEL 187
Query: 502 LSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWV 541
F+G+G K+V CV + L H FPVDT+V IA LGWV
Sbjct: 188 SHFKGVGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWV 227
>TIGR_CMR|CHY_1121 [details] [associations]
symbol:CHY_1121 "endonuclease III" species:246194
"Carboxydothermus hydrogenoformans Z-2901" [GO:0003906
"DNA-(apurinic or apyrimidinic site) lyase activity" evidence=ISS]
[GO:0006281 "DNA repair" evidence=ISS] InterPro:IPR000445
InterPro:IPR003265 InterPro:IPR003583 InterPro:IPR003651
InterPro:IPR004036 InterPro:IPR005759 InterPro:IPR011257
Pfam:PF00633 Pfam:PF00730 PIRSF:PIRSF001435 PROSITE:PS01155
SMART:SM00278 SMART:SM00478 SMART:SM00525 GO:GO:0006284
Gene3D:1.10.340.30 SUPFAM:SSF48150 Gene3D:1.10.1670.10
InterPro:IPR023170 GO:GO:0003677 EMBL:CP000141
GenomeReviews:CP000141_GR GO:GO:0051539 GO:GO:0005622 GO:GO:0004519
GO:GO:0003906 eggNOG:COG0177 HOGENOM:HOG000252206 KO:K10773
OMA:NNKSKHL TIGRFAMs:TIGR01083 RefSeq:YP_359967.1
ProteinModelPortal:Q3AD17 STRING:Q3AD17 GeneID:3726382
KEGG:chy:CHY_1121 PATRIC:21275382
BioCyc:CHYD246194:GJCN-1120-MONOMER Uniprot:Q3AD17
Length = 210
Score = 85 (35.0 bits), Expect = 7.1e-05, Sum P(2) = 7.1e-05
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 582 ELHYQMITFGKVFCTKSKPNCNACPMRGECR 612
+LH+++I FG+ C KP+CN CP C+
Sbjct: 174 DLHHRLIFFGRRICKAQKPSCNICPFPEFCQ 204
Score = 83 (34.3 bits), Expect = 7.1e-05, Sum P(2) = 7.1e-05
Identities = 24/72 (33%), Positives = 39/72 (54%)
Query: 468 GRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPV 527
G ++ L++ +D E+ VP A+ LL G+G K+ E + + + +FPV
Sbjct: 80 GLYRNKARNLIKIAEILDREYHGQVPDSFAE--LLKLPGVGPKTAEVIVGVGFNKPSFPV 137
Query: 528 DTNVGRIAVRLG 539
DT+V R+A RLG
Sbjct: 138 DTHVFRVARRLG 149
>ASPGD|ASPL0000003678 [details] [associations]
symbol:AN10840 species:162425 "Emericella nidulans"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0003677 "DNA
binding" evidence=IEA] [GO:0006284 "base-excision repair"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR003265 InterPro:IPR003583 InterPro:IPR011257
Pfam:PF00730 SMART:SM00278 SMART:SM00478 GO:GO:0003824
GO:GO:0006284 Gene3D:1.10.340.30 SUPFAM:SSF48150
Gene3D:1.10.1670.10 InterPro:IPR023170 GO:GO:0003677 EMBL:BN001301
EnsemblFungi:CADANIAT00007418 HOGENOM:HOG000201727 OMA:CGEVPSV
Uniprot:C8V1C2
Length = 502
Score = 102 (41.0 bits), Expect = 0.00089, Sum P(2) = 0.00089
Identities = 21/62 (33%), Positives = 32/62 (51%)
Query: 480 DHGSVDLEWLRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG 539
D + L +L +P ++ L+ + G+G K+ CV L L F VDT++ RI L
Sbjct: 358 DQNFLSLNYLHGLPTEEVMTELMKYPGIGPKTAACVLLFCLQRPCFAVDTHIFRICKWLN 417
Query: 540 WV 541
WV
Sbjct: 418 WV 419
Score = 66 (28.3 bits), Expect = 0.00089, Sum P(2) = 0.00089
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 437 KDSLDWEAVRCADVNKIANTIKERGMNNMLAGRIKDFLNRLVRDH 481
K S++W+AVR A V + IK G+ + + IK L+ + +++
Sbjct: 274 KGSVNWDAVRRAPVKDVFEAIKSGGLADSKSKNIKAILDMVYKEN 318
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.315 0.131 0.391 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 960 878 0.00085 122 3 11 22 0.43 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 8
No. of states in DFA: 624 (66 KB)
Total size of DFA: 455 KB (2215 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 84.58u 0.09s 84.67t Elapsed: 00:00:08
Total cpu time: 84.58u 0.09s 84.67t Elapsed: 00:00:08
Start: Fri May 10 18:13:51 2013 End: Fri May 10 18:13:59 2013