BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002607
(901 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SJQ6|ROS1_ARATH Protein ROS1 OS=Arabidopsis thaliana GN=ROS1 PE=1 SV=2
Length = 1393
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/554 (60%), Positives = 392/554 (70%), Gaps = 71/554 (12%)
Query: 398 TSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTI 457
T K K +V KE++ FDWD LRR+ +A G +E+ T D++DW+A+R ADV ++A TI
Sbjct: 841 TLKEKGKKVLKEEKKAFDWDCLRREAQARAGIREKTRSTMDTVDWKAIRAADVKEVAETI 900
Query: 458 KERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAK------------------- 498
K RGMN+ LA RI+ FL+RLV DHGS+DLEWLRDVPPDKAK
Sbjct: 901 KSRGMNHKLAERIQGFLDRLVNDHGSIDLEWLRDVPPDKAKEYLLSFNGLGLKSVECVRL 960
Query: 499 --------------------------YPVLESIQ--------------KYLWPRLCKLDQ 518
P+ ES+Q KYLWPRLCKLDQ
Sbjct: 961 LTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLEMYPMLESIQKYLWPRLCKLDQ 1020
Query: 519 RTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSAN 578
+TLYELHYQMITFGKVFCTKSKPNCNACPM+GECRHFASAFAS+RLALP E+
Sbjct: 1021 KTLYELHYQMITFGKVFCTKSKPNCNACPMKGECRHFASAFASARLALPSTEKGM----- 1075
Query: 579 ENRTNTQNPAMMINQLPLPLTHATDLPV---GKLEIAVNNCEPIIEEPATPEPERVQVSE 635
T +NP + LP P V + V CEPIIEEPA+PEPE +VS
Sbjct: 1076 --GTPDKNPLPL--HLPEPFQREQGSEVVQHSEPAKKVTCCEPIIEEPASPEPETAEVSI 1131
Query: 636 NDIEDTFCEDPEEIPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAP 695
DIE+ F EDPEEIPTI+LNM FT L+ M+ N ELQ+G+MS ALVALTA AS+P P
Sbjct: 1132 ADIEEAFFEDPEEIPTIRLNMDAFTSNLKKIMEHNKELQDGNMSSALVALTAETASLPMP 1191
Query: 696 KLKNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCS 755
KLKN+S+LRTEH+VYELPD HPLL +EKREPDDP YLLAIWTPGETA+SIQP S C
Sbjct: 1192 KLKNISQLRTEHRVYELPDEHPLLAQLEKREPDDPCSYLLAIWTPGETADSIQPSVSTCI 1251
Query: 756 SQEHGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHD 815
Q +G +CDE+TCFSCNS++E+ QIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADH
Sbjct: 1252 FQANGMLCDEETCFSCNSIKETRSQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHA 1311
Query: 816 SSLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAP 875
SSL PINVPRE +W LPRRTVYFGTS+P+IFKGL+TE IQ CFW+GYVCVRGFD+K+R P
Sbjct: 1312 SSLNPINVPRELIWELPRRTVYFGTSVPTIFKGLSTEKIQACFWKGYVCVRGFDRKTRGP 1371
Query: 876 RPLMARLHFPASKL 889
+PL+ARLHFPASKL
Sbjct: 1372 KPLIARLHFPASKL 1385
Score = 39.3 bits (90), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 7/102 (6%)
Query: 20 SSAFMSLAANFPLNSKQKPCHGEEITSVIEEPAEYVLDPEDTIEWKEKMSHQPVCDQGSM 79
SSAFMSLA+ FP+ +S+ Y LD E+T MS P + S+
Sbjct: 615 SSAFMSLASQFPVPFVPSSNFDAGTSSMPSIQITY-LDSEET------MSSPPDHNHSSV 667
Query: 80 TLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSSE 121
TL ++ EE++ V SN + SS+ + S +ES K +S E
Sbjct: 668 TLKNTQPDEEKDYVPSNETSRSSSEIAISAHESVDKTTDSKE 709
>sp|Q8LK56|DME_ARATH Transcriptional activator DEMETER OS=Arabidopsis thaliana GN=DME PE=1
SV=2
Length = 1987
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/959 (42%), Positives = 548/959 (57%), Gaps = 108/959 (11%)
Query: 20 SSAFMSLAANFPLNSKQKPCHGEEITSVI-EEPAEYVLDPEDTIEWKEKMSHQPVCDQGS 78
SSAFMSLAA FP + SV+ E+P +L+ + W+EK+ H
Sbjct: 1043 SSAFMSLAARFPPKLSSSREDERNVRSVVVEDPEGCILNLNEIPSWQEKVQHP-----SD 1097
Query: 79 MTLHGSESSEEREVVSSNNSLESSTSVVSSINESKCKLMNSSEIYPETYNDVLSSPNSLD 138
M + G +S + ++ +NS + N + + N E +VLSS +S D
Sbjct: 1098 MEVSGVDSGSKEQLRDCSNS------GIERFNFLEKSIQNLEE-------EVLSSQDSFD 1144
Query: 139 SSFAPFADGTISSSNSNSDAGDSSNVPTLNSFNGSNSFVELL--QMVGSTMLHGNYNHRN 196
+ S S S SDA + + +G++ V+ + L GN
Sbjct: 1145 PAIFQSCGRVGSCSCSKSDAEFPTTRCETKTVSGTSQSVQTGSPNLSDEICLQGNERPHL 1204
Query: 197 GHMSSDENSKDEHSQFQTLESNTQRVKVKD---IDDPKVLSRVSSIPPSSFHPCLTQDLS 253
S D ++ + Q + + KD P+ + + P SS+ C T+
Sbjct: 1205 YEGSGDVQKQETTNVAQKKPDLEKTMNWKDSVCFGQPRNDTNWQTTPSSSYEQCATRQPH 1264
Query: 254 V-EVESYEMRREETRSS--GISDVTDKIAL--MPEFASQTTDATKLIVAGPEAPRHGNKQ 308
V ++E + M+ E S IS D++ +P + + G P ++
Sbjct: 1265 VLDIEDFGMQGEGLGYSWMSISPRVDRVKNKNVPRRFFRQGGSVPREFTGQIIPSTPHEL 1324
Query: 309 SRNSMQANKNSIAQHESELFGDSRFAMEPPAHAQKNDLNLPKISSGSIDAIESH-----N 363
+ + +++ +H+ + + + M +H QK L+L S + S +
Sbjct: 1325 PGMGLSGSSSAVQEHQDDTQHNQQDEMNKASHLQKTFLDLLNSSEECLTRQSSTKQNITD 1384
Query: 364 ALYNRENTQ------LKSSVSDQNKY--DHSFSKELNGIDDATSKSKSTR-----VSKEK 410
R+ T L ++ S QN +S +KE ++ + + R ++ K
Sbjct: 1385 GCLPRDRTAEDVVDPLSNNSSLQNILVESNSSNKEQTAVEYKETNATILREMKGTLADGK 1444
Query: 411 QNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGRI 470
+ WDSLR+ VE N G++ER ++ DS+D+EA+R A +++I+ IKERGMNNMLA RI
Sbjct: 1445 KPTSQWDSLRKDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRI 1504
Query: 471 K---------------------------DFL-------------NRLVRDHG-------- 482
K D+L RL+ H
Sbjct: 1505 KDFLERIVKDHGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTN 1564
Query: 483 ----SVDLEWLRDVP-PDKAK------YPVLESIQKYLWPRLCKLDQRTLYELHYQMITF 531
+V + W+ P P+ + YPVLESIQK+LWPRLCKLDQRTLYELHYQ+ITF
Sbjct: 1565 VGRIAVRMGWVPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITF 1624
Query: 532 GKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMMI 591
GKVFCTKS+PNCNACPMRGECRHFASA+AS+RLALP PEE+++ SA P + I
Sbjct: 1625 GKVFCTKSRPNCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAI 1684
Query: 592 NQLPLPLTHATDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIEDTFC-EDPEEIP 650
+ LPL L G NCEPIIEEPA+P E +++E+DIED + EDP+EIP
Sbjct: 1685 PMIELPLPLEKSLASGAPS-NRENCEPIIEEPASPGQECTEITESDIEDAYYNEDPDEIP 1743
Query: 651 TIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKLKNVSRLRTEHQVY 710
TIKLN+++F TL+ +M+ N+ELQEGDMSKALVAL SIP PKLKN+SRLRTEHQVY
Sbjct: 1744 TIKLNIEQFGMTLREHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVY 1803
Query: 711 ELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFS 770
ELPDSH LL GM+KREPDDP YLLAIWTPGETANS QPPE +C + GKMC ++TC
Sbjct: 1804 ELPDSHRLLDGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSE 1863
Query: 771 CNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWN 830
CNS+RE+ Q VRGT+LIPCRTAMRGSFPLNGTYFQVNE+FADH+SSLKPI+VPR+W+W+
Sbjct: 1864 CNSLREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNELFADHESSLKPIDVPRDWIWD 1923
Query: 831 LPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFPASKL 889
LPRRTVYFGTS+ SIF+GL+TE IQ CFW+G+VCVRGF+QK+RAPRPLMARLHFPASKL
Sbjct: 1924 LPRRTVYFGTSVTSIFRGLSTEQIQFCFWKGFVCVRGFEQKTRAPRPLMARLHFPASKL 1982
>sp|Q9SR66|DML2_ARATH DEMETER-like protein 2 OS=Arabidopsis thaliana GN=DML2 PE=3 SV=2
Length = 1332
Score = 519 bits (1336), Expect = e-146, Method: Compositional matrix adjust.
Identities = 277/547 (50%), Positives = 341/547 (62%), Gaps = 86/547 (15%)
Query: 410 KQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTIKERGMNNMLAGR 469
++ DWDSLR++ E+ G K+ER E T D++DW+A+RC DV+KIAN I +RGMNNMLA R
Sbjct: 785 QKKSVDWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAER 844
Query: 470 IKDFLNRLVRDHGS---------------------------------------------- 483
IK FLNRLV+ HGS
Sbjct: 845 IKAFLNRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDT 904
Query: 484 ------VDLEWLRDVP-PDKAK------YPVLESIQKYLWPRLCKLDQRTLYELHYQMIT 530
V L W+ P PD+ + YPVLES+QKYLWPRLCKLDQ+TLYELHY MIT
Sbjct: 905 NVGRIAVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMIT 964
Query: 531 FGKVFCTKSKPNCNACPMRGECRHFASAFASSRLALPGPEEKAIVSANENRTNTQNPAMM 590
FGKVFCTK KPNCNACPM+ ECRH++SA AS+RLALP PEE S + ++ ++
Sbjct: 965 FGKVFCTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVV 1024
Query: 591 INQLP-LPLTHATDLPVGKLEIAVNNCEPIIEEPATPEPERVQVSENDIED--------T 641
+N P L L + + + NCEPIIEEPA+PEPE + E+DIED
Sbjct: 1025 VNFRPSLFLYQEKEQEAQRSQ----NCEPIIEEPASPEPEYI---EHDIEDYPRDKNNVG 1077
Query: 642 FCEDPEE----IPTIKLNMKEFTQTLQNYMQENLELQEGDMSKALVALTAGAASIPAPKL 697
EDP E IPTI LN + T + + +E S LV L+ AA+IP KL
Sbjct: 1078 TSEDPWENKDVIPTIILNKEAGTS------HDLVVNKEAGTSHDLVVLSTYAAAIPRRKL 1131
Query: 698 KNVSRLRTEHQVYELPDSHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQ 757
K +LRTEH V+ELPD H +L G E+RE +D YLLAIWTPGET NSIQPP+ RC+
Sbjct: 1132 KIKEKLRTEHHVFELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALF 1191
Query: 758 E-HGKMCDEKTCFSCNSVRESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDS 816
E + +C+E CF CN RE E Q VRGTILIPCRTAMRG FPLNGTYFQ NEVFADHDS
Sbjct: 1192 ESNNTLCNENKCFQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDS 1251
Query: 817 SLKPINVPREWLWNLPRRTVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPR 876
S+ PI+VP E +W+L RR Y G+S+ SI KGL+ E I++ F GYVCVRGFD+++R P+
Sbjct: 1252 SINPIDVPTELIWDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPK 1311
Query: 877 PLMARLH 883
L+ RLH
Sbjct: 1312 SLVKRLH 1318
>sp|O49498|DML3_ARATH DEMETER-like protein 3 OS=Arabidopsis thaliana GN=DML3 PE=2 SV=2
Length = 1044
Score = 366 bits (940), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/411 (46%), Positives = 255/411 (62%), Gaps = 45/411 (10%)
Query: 499 YPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASA 558
YP ++SIQKYLWPRLCKL Q TLYELHYQMITFGKVFCTK+ PNCNACPM+ EC++FASA
Sbjct: 641 YPSMDSIQKYLWPRLCKLPQETLYELHYQMITFGKVFCTKTIPNCNACPMKSECKYFASA 700
Query: 559 FASSRLALPGPEEKAIVSANENRTNTQNPAMMINQLPLPLTHATDLPVGKL-------EI 611
+ SS++ L PEEK P +N H+ D+ V E
Sbjct: 701 YVSSKVLLESPEEKM-----------HEPNTFMN------AHSQDVAVDMTSNINLVEEC 743
Query: 612 AVNNC-------EPIIEEPATPEPERVQVSEN-DIEDT----FCEDPEEIPTIKLNMKEF 659
+ C +P++E P++P R ++ E+ DIED + +P I ++
Sbjct: 744 VSSGCSDQAICYKPLVEFPSSP---RAEIPESTDIEDVPFMNLYQSYASVPKIDFDLDAL 800
Query: 660 TQTLQNYMQENLELQEGD--MSKALVALTAGAASIPAP---KLKNVSRLRTEHQVYELPD 714
+++++ + + + D +SKALV T A IP K+K +RLRTEH VY LPD
Sbjct: 801 KKSVEDALVISGRMSSSDEEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVYVLPD 860
Query: 715 SHPLLRGMEKREPDDPGRYLLAIWTPGETANSIQPPESRCSSQEHGKMCDEKTCFSCNSV 774
+H LL E+R+ DDP YLLAIW PGET++S PP+ +CSS + K+C K C C ++
Sbjct: 861 NHELLHDFERRKLDDPSPYLLAIWQPGETSSSFVPPKKKCSS-DGSKLCKIKNCSYCWTI 919
Query: 775 RESEFQIVRGTILIPCRTAMRGSFPLNGTYFQVNEVFADHDSSLKPINVPREWLWNLPRR 834
RE I RGTILIPCRTAMRG+FPLNGTYFQ NEVFADH++SL PI RE L +R
Sbjct: 920 REQNSNIFRGTILIPCRTAMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGLEKR 979
Query: 835 TVYFGTSIPSIFKGLTTEGIQHCFWRGYVCVRGFDQKSRAPRPLMARLHFP 885
+Y G+++ SIFK L T I+ CFW G++C+R FD+K R P+ L+ RLH P
Sbjct: 980 ALYCGSTVTSIFKLLDTRRIELCFWTGFLCLRAFDRKQRDPKELVRRLHTP 1030
Score = 73.9 bits (180), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 59/106 (55%), Gaps = 3/106 (2%)
Query: 398 TSKSKSTRVSKEKQNDFDWDSLRRQVEANGGKKERPEHTKDSLDWEAVRCADVNKIANTI 457
T+K K+ + + DW++LRR G RPE DS++W VR + N + TI
Sbjct: 484 TAKRKNEKTGIIEDEIVDWNNLRRMYTKEGS---RPEMHMDSVNWSDVRLSGQNVLETTI 540
Query: 458 KERGMNNMLAGRIKDFLNRLVRDHGSVDLEWLRDVPPDKAKYPVLE 503
K+RG +L+ RI FLN V +G++DLEWLR+ P K +LE
Sbjct: 541 KKRGQFRILSERILKFLNDEVNQNGNIDLEWLRNAPSHLVKRYLLE 586
>sp|O05956|END3_RICPR Endonuclease III OS=Rickettsia prowazekii (strain Madrid E) GN=nth
PE=3 SV=1
Length = 212
Score = 43.9 bits (102), Expect = 0.006, Method: Composition-based stats.
Identities = 15/46 (32%), Positives = 27/46 (58%)
Query: 516 LDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFAS 561
+D++ L H+ +I G+ C KP CN CP++ C ++ + F+S
Sbjct: 167 IDEKWLTYAHHWLILHGRYICKARKPGCNICPIKEYCEYYINTFSS 212
>sp|Q68W04|END3_RICTY Endonuclease III OS=Rickettsia typhi (strain ATCC VR-144 /
Wilmington) GN=nth PE=3 SV=1
Length = 212
Score = 39.3 bits (90), Expect = 0.14, Method: Composition-based stats.
Identities = 13/46 (28%), Positives = 26/46 (56%)
Query: 516 LDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFAS 561
+D++ L H+ ++ G+ C KP+C C ++ C ++ + FAS
Sbjct: 167 IDEKWLTHAHHWLVLHGRYICKARKPSCRICHIKEYCEYYINTFAS 212
>sp|Q4UK93|END3_RICFE Endonuclease III OS=Rickettsia felis (strain ATCC VR-1525 /
URRWXCal2) GN=nth PE=3 SV=1
Length = 213
Score = 38.5 bits (88), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 13/46 (28%), Positives = 26/46 (56%)
Query: 516 LDQRTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFASAFAS 561
++++ L H+ +I G+ C KP+C+ CP++ C ++ S S
Sbjct: 167 INEKWLTHAHHWLILHGRYICKARKPDCDICPIKEYCEYYNSPIIS 212
>sp|Q89A45|MUTY_BUCBP A/G-specific adenine glycosylase OS=Buchnera aphidicola subsp.
Baizongia pistaciae (strain Bp) GN=mutY PE=3 SV=1
Length = 351
Score = 36.2 bits (82), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 528 MITFGKVFCTKSKPNCNACPMRGECRHFAS 557
+I G + C KS P CN CP++ C+ + +
Sbjct: 182 LIDIGALICLKSNPKCNICPLKSTCKSYLN 211
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.313 0.128 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 334,636,577
Number of Sequences: 539616
Number of extensions: 14128769
Number of successful extensions: 36779
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 40
Number of HSP's successfully gapped in prelim test: 273
Number of HSP's that attempted gapping in prelim test: 34533
Number of HSP's gapped (non-prelim): 1715
length of query: 901
length of database: 191,569,459
effective HSP length: 127
effective length of query: 774
effective length of database: 123,038,227
effective search space: 95231587698
effective search space used: 95231587698
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (30.0 bits)