BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006651
(637 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225433251|ref|XP_002282069.1| PREDICTED: uncharacterized protein LOC100247804 [Vitis vinifera]
gi|296083723|emb|CBI23712.3| unnamed protein product [Vitis vinifera]
Length = 642
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/636 (61%), Positives = 474/636 (74%), Gaps = 4/636 (0%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
MENK N DGQ KED L+SLG++ E + N LD SE KS+RL ++ S+Q +ILNVK+
Sbjct: 10 MENKFVNGPKDGQVKEDMLNSLGEDRESAQGNDLDFSESKSKRLARKGRSEQAVILNVKK 69
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
L SR S+K THGL+ ++PKH+++LDERYLRRCLELIH SASKAA CNIS++L S
Sbjct: 70 L-QSRGSVK--VESFTHGLNNRMPKHIMSLDERYLRRCLELIHISASKAAPCNISLNLSS 126
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMIN 180
KMG D +N+ K R N CD ++ + ECP+ TG+ VV+SPAGQW+VG++MGSKSM+
Sbjct: 127 LKMGIFSDTLNLPKFRGRNTCDMSKLVIECPVATGTGDVVVSPAGQWVVGTLMGSKSMMK 186
Query: 181 ILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGNH 240
ILKSPL + G L+ D F TS+ D KG++ DF+ SPGG S SPQ + K GNH
Sbjct: 187 ILKSPLFRQFGALEDDVKFGGTSLIDAKGAMCTDFISSPGGLSIRSPQNLPKGNISQGNH 246
Query: 241 KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVF 300
++G E HKRL S SSTNSTCSDQS SS + S G+LQC W GG PHFVFS+DD++EV+
Sbjct: 247 RYGPEAVHKRLISTSSTNSTCSDQSPSSGFATVSQGMLQCTWKGGIPHFVFSVDDQREVY 306
Query: 301 VANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKIM 360
VAN +E + K++D MY FH R GGQKEH+I DN++ LVGKMKVS SF LCP++SKIM
Sbjct: 307 VANQWKAELPDDKALDCMYLFHSRTGGQKEHKIPDNKADLVGKMKVSSSFTLCPNDSKIM 366
Query: 361 ETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILE 420
ET+F FG E++ MQTS RKNKGLSKKVV+ FRT+HS K+RT SKFGG++AILE
Sbjct: 367 ETEFVFFGGRENYASKMQTSSSIIRKNKGLSKKVVEVFRTSHSSKQRTTSKFGGTTAILE 426
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQE-EVGGWGMKFLK 479
N SWE CQ+ N +LG LLEN LPPNLELAAI+VK+HLP RQE EVGGWG+KFLK
Sbjct: 427 NDSWEQCQDTCNNPSALGRTNLLENDLPPNLELAAIVVKDHLPGKRQEAEVGGWGLKFLK 486
Query: 480 KVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSG 539
KVG+K+ +L+ S PS R TG CSTSMDIL+PAG HGGPRTRNGGPS +IERWRSG
Sbjct: 487 KVGLKQNSASLEASLPSECCLRSTGDCSTSMDILVPAGIHGGPRTRNGGPSSLIERWRSG 546
Query: 540 GCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRD 599
G CDCGGWD+GCPLTVL TR +K+E + D+ +E SFDLF GS+ AP+LR+VNV D
Sbjct: 547 GHCDCGGWDIGCPLTVLKTRATKEEFSHHTDIQEEYKSFDLFAQGSELSAPILRVVNVHD 606
Query: 600 DLYFIHFHSTLSALQSFSIAVASIHSQSPTLRPKNV 635
LYFIHF S LSALQSFSIAVA IH++SP LRPK V
Sbjct: 607 GLYFIHFRSMLSALQSFSIAVAIIHTRSPILRPKTV 642
>gi|255574909|ref|XP_002528361.1| conserved hypothetical protein [Ricinus communis]
gi|223532229|gb|EEF34033.1| conserved hypothetical protein [Ricinus communis]
Length = 637
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/637 (60%), Positives = 460/637 (72%), Gaps = 2/637 (0%)
Query: 1 MENKLGNTRNDGQGKEDSLD-SLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVK 59
ME+ L + G +EDS S + V GD+ L SE +S+RLV+RVNSQ TL+ K
Sbjct: 1 MEDTLAGAKRGGHSREDSSSYSFNHDEGVLGDDELHCSESRSKRLVRRVNSQNTLVSKFK 60
Query: 60 QLFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLG 119
QL SRS IK F RS LD IPKHMVT+DE+YLRRCLE+IH S SK ++S++LG
Sbjct: 61 QLQRSRSYIKESFLRSIRVLDNWIPKHMVTVDEKYLRRCLEIIHVSTSKEVPRDLSLNLG 120
Query: 120 SAKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMI 179
+MG + D ++ AK +EN CD RF F+CP+ +G SVVISPAG WIVGSIMGSKSM+
Sbjct: 121 WGEMGVLSDGLSPAKIGSENTCDFTRFHFDCPLASGDGSVVISPAGHWIVGSIMGSKSMV 180
Query: 180 NILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGN 239
NIL SPL K G DGD NF + S +DV GSI Y+FM SP YS K+ K+T + G
Sbjct: 181 NILNSPLFRKYGAYDGDANFEKVSFSDVDGSICYNFMNSPSVLGSYSSNKLYKETPIPGK 240
Query: 240 HKHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEV 299
++ SET ++R S SSTNSTCSDQSSSS S + S G+LQC W+ GNPHFVFS+DD+K V
Sbjct: 241 QRYRSETLNERPVSMSSTNSTCSDQSSSSLSGTVSQGMLQCTWEDGNPHFVFSLDDQKVV 300
Query: 300 FVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKI 359
+VANL NS+ + +DY Y FH RKG QKE EI D ES LVGKMKVS SF+LC +NSKI
Sbjct: 301 YVANLWNSQLADDIVVDYTYLFHLRKGDQKESEIHDTESHLVGKMKVSTSFSLCQNNSKI 360
Query: 360 METQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAIL 419
ME +F LFGSNE+ MQTS RKNKGLSKKV + FRT+HS K+RT+S+ GSSAIL
Sbjct: 361 MEREFILFGSNENV-KRMQTSSHDLRKNKGLSKKVAEVFRTSHSMKKRTISRLTGSSAIL 419
Query: 420 ENSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLK 479
E+ SWEP Q+ N D+ LLE HLPPNLE+AAI+VKEH+PE R+E+ GGWG+ FLK
Sbjct: 420 ESCSWEPLQDTENNIDAHSGVNLLEKHLPPNLEMAAIVVKEHIPEKRREKGGGWGLNFLK 479
Query: 480 KVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSG 539
K K+T + +++S S AR+TG CSTS+DILIP HGGPRTRNGGPSG+ ERW+SG
Sbjct: 480 KSSAKQTTDLMESSQHYGSCARNTGDCSTSIDILIPVSLHGGPRTRNGGPSGLTERWKSG 539
Query: 540 GCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRD 599
GCCDCGGWDLGCPLTVL +RLS KE QADM EC DL GS+ AP +RMVNV D
Sbjct: 540 GCCDCGGWDLGCPLTVLKSRLSNKELSPQADMQGECKLIDLITQGSENSAPQMRMVNVHD 599
Query: 600 DLYFIHFHSTLSALQSFSIAVASIHSQSPTLRPKNVQ 636
LYF+HF STLSALQSFSIAVA IHSQSPTLRP NVQ
Sbjct: 600 GLYFVHFQSTLSALQSFSIAVAFIHSQSPTLRPDNVQ 636
>gi|224110636|ref|XP_002315586.1| predicted protein [Populus trichocarpa]
gi|222864626|gb|EEF01757.1| predicted protein [Populus trichocarpa]
Length = 615
Score = 726 bits (1874), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/626 (59%), Positives = 446/626 (71%), Gaps = 23/626 (3%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
M++ N + DG GKEDSL SL + E+SGD+ LD KQ
Sbjct: 1 MDDTFANAQKDGLGKEDSLHSLSHDRELSGDHELD----------------------FKQ 38
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
L S+S IK+ F RS H LD IPKH++T+DE+YLRRCLE IH S S+AA CN S+ L
Sbjct: 39 LQRSKSCIKDSFLRSIHVLDNWIPKHIITIDEKYLRRCLEFIHASTSQAAPCNASMYLDW 98
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMIN 180
MGF D +N+AK +N C SA F F+CP+ G+ SVVISPA QWI GSIMGSKSM+N
Sbjct: 99 GNMGFFSDGLNIAKIGNQNTCSSAGFDFDCPL-AGTGSVVISPAEQWIAGSIMGSKSMVN 157
Query: 181 ILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGNH 240
ILKSPLL + G DGD NF + + +DVKGSI YDFM SPGG S YSP K++ G+H
Sbjct: 158 ILKSPLLRRYGAYDGDANFEKVTSSDVKGSICYDFMNSPGGLSSYSPHKLDNGAQTPGSH 217
Query: 241 KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVF 300
K+ SE+ HKR S SSTNSTCSDQSSSS + + G LQC W GGNPHF+FS+DD+K V+
Sbjct: 218 KYESESLHKRFVSMSSTNSTCSDQSSSSLPAAVTQGTLQCTWKGGNPHFIFSLDDQKVVY 277
Query: 301 VANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKIM 360
VAN N S + K+++Y+Y FH RKGGQKEH I D ES LVG+M VS SF+LCP+N +IM
Sbjct: 278 VANFYNVNSADDKALEYVYLFHSRKGGQKEHMIHDKESRLVGQMNVSTSFSLCPNNCRIM 337
Query: 361 ETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILE 420
E +F LFG E+ G +Q+S RKNK LS +VV+ FRT +SFK+RT S+F GS AILE
Sbjct: 338 EREFVLFGCYENLVGELQSSSHDLRKNKRLSSRVVEVFRTNNSFKKRTNSRFDGSGAILE 397
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKK 480
NSSWEP Q N D+LG A LLENHLPPNLELAAI+VK HLPE R E+ GGWG+ FLK
Sbjct: 398 NSSWEPFQEREYNMDALGGANLLENHLPPNLELAAILVKNHLPEKRPEKTGGWGLNFLKN 457
Query: 481 VGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGG 540
+ + +T+K+S SAS ARDTG CSTS+DILIPAG HGGPRTRNGGPS +I+RWRSGG
Sbjct: 458 TAVTQAKDTMKSSVLSASCARDTGDCSTSIDILIPAGLHGGPRTRNGGPSSLIDRWRSGG 517
Query: 541 CCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDD 600
CDCGGWDLGCPLTVL +R + KE ADM EC +LF+ GS+ GAP LRMVNV D
Sbjct: 518 YCDCGGWDLGCPLTVLKSRSANKEFSSSADMQGECKLINLFIQGSENGAPPLRMVNVHDG 577
Query: 601 LYFIHFHSTLSALQSFSIAVASIHSQ 626
+Y +HF STLSALQ+FSIA+A IHSQ
Sbjct: 578 MYSVHFQSTLSALQTFSIALAFIHSQ 603
>gi|147864336|emb|CAN83005.1| hypothetical protein VITISV_003699 [Vitis vinifera]
Length = 597
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/636 (58%), Positives = 448/636 (70%), Gaps = 40/636 (6%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
MENK N DGQ KED L+SLG++ E + N LD SE KS+R+
Sbjct: 1 MENKFVNGLKDGQVKEDMLNSLGEDRESAQGNDLDFSESKSKRM---------------- 44
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
PKH+++LDERYLRRCLELIH SASKAA CNIS++L S
Sbjct: 45 -----------------------PKHIMSLDERYLRRCLELIHISASKAAPCNISLNLSS 81
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMIN 180
KMG D +N+ K R N CD ++ + ECP+ TG+ VV+SPAGQW+VG++MGSKSM+
Sbjct: 82 LKMGIFSDTLNLPKFRGRNTCDMSKLVIECPVATGTGDVVVSPAGQWVVGTLMGSKSMMK 141
Query: 181 ILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGNH 240
ILKSPL + G L+ D F TS+ D KG++ DF+ SPGG S SPQ + K GNH
Sbjct: 142 ILKSPLFRQFGALEDDVKFGGTSLIDAKGAMCTDFISSPGGLSIRSPQNLPKGNISQGNH 201
Query: 241 KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVF 300
+G E HKRL S SSTNSTCSDQS SS + S G+LQC W GG PHFVFS+DD++EV+
Sbjct: 202 IYGPEAVHKRLISTSSTNSTCSDQSPSSGFATVSQGMLQCTWKGGIPHFVFSVDDQREVY 261
Query: 301 VANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKIM 360
VAN +E + K++D MY FH R GGQKEH+I DN++ LVGKMKVS SF LCP++SKIM
Sbjct: 262 VANQWKAELPDDKALDCMYLFHSRTGGQKEHKIPDNKADLVGKMKVSSSFTLCPNDSKIM 321
Query: 361 ETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILE 420
ET+F FG E++ MQTS RKNKGLSKKVV+ FRT+HS K+RT SKFGG++AILE
Sbjct: 322 ETEFVFFGGRENYASKMQTSSSIIRKNKGLSKKVVEVFRTSHSSKQRTTSKFGGTTAILE 381
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQE-EVGGWGMKFLK 479
N SWE CQ+ N +LG LLEN LPPNLELAAI+VK+HLP RQE EVGGWG+KFLK
Sbjct: 382 NDSWEQCQDTCNNPSALGRTNLLENDLPPNLELAAIVVKDHLPGKRQEAEVGGWGLKFLK 441
Query: 480 KVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSG 539
KVG+K+ +L+ S PS R TG CSTSMDIL+PAG HGGPRTRNGGPS +IERWRSG
Sbjct: 442 KVGLKQNSASLEASLPSECCLRSTGDCSTSMDILVPAGIHGGPRTRNGGPSSLIERWRSG 501
Query: 540 GCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRD 599
G CDCGGWD+GCPLTVL TR +K+E + D+ +E SFDLF GS+ AP+LR+VNV D
Sbjct: 502 GHCDCGGWDIGCPLTVLKTRATKEEFSHHTDIQEEYKSFDLFAQGSELSAPILRVVNVHD 561
Query: 600 DLYFIHFHSTLSALQSFSIAVASIHSQSPTLRPKNV 635
LYFIHF S LSALQSFSIAVA IH++SP LRPK V
Sbjct: 562 GLYFIHFRSMLSALQSFSIAVAIIHTRSPILRPKTV 597
>gi|224102229|ref|XP_002312600.1| predicted protein [Populus trichocarpa]
gi|222852420|gb|EEE89967.1| predicted protein [Populus trichocarpa]
Length = 607
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/635 (57%), Positives = 445/635 (70%), Gaps = 31/635 (4%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
M++ N + DGQGKEDSL SL E+S VKQ
Sbjct: 1 MDDTSANAQKDGQGKEDSLHSLNHGRELS----------------------------VKQ 32
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
L S+S IK+ F R H LD IPKH+ T+DE+YLRRCLE IH S S++A CN S+ L
Sbjct: 33 LQRSKSCIKDSFLRGIHVLDNWIPKHITTIDEKYLRRCLEFIHASTSQSAPCNASMHLDW 92
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMIN 180
MGF D +N+AK +EN ARF F+CP+ G+ +VVISPA QWIVGSIMGSKSM+N
Sbjct: 93 GNMGFFSDGLNIAKTGSENTYGLARFDFDCPL-AGTGNVVISPAEQWIVGSIMGSKSMVN 151
Query: 181 ILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGNH 240
ILKSPLL + G DGD NF + S ++VKG I ++ M SP G YSP K++K+ + G+H
Sbjct: 152 ILKSPLLLRYGSYDGDANFEKVSSSEVKGPICFNLMNSPSGLRSYSPHKLDKEAQIPGSH 211
Query: 241 KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVF 300
K GSET HKRL S SS+NSTCSDQSSSS S S + G LQC W GGNPHFVFSID++K V+
Sbjct: 212 KFGSETLHKRLISMSSSNSTCSDQSSSSPSASVTQGTLQCTWKGGNPHFVFSIDNQKLVY 271
Query: 301 VANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKIM 360
VANL N +S + K+++Y+Y FH RKG QKEH D ES LVGKMKVS SF LCP NS+I
Sbjct: 272 VANLYNVDSADDKALEYVYLFHSRKGSQKEHMNHDKESHLVGKMKVSTSFALCPKNSRIT 331
Query: 361 ETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILE 420
+ +F LFG E++ G Q+S RKNKGLS +V + FRT++S KRRT S+FGGS AILE
Sbjct: 332 QREFVLFGY-ENYVGEWQSSSHDLRKNKGLSSRVAEVFRTSNSSKRRTNSRFGGSGAILE 390
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKK 480
NSSWEP Q N D+LG LLENHLPPNLELAAI+VK+HLPE RQE+ GGWG+ FLKK
Sbjct: 391 NSSWEPFQERDNNIDALGGTNLLENHLPPNLELAAILVKDHLPEKRQEKAGGWGLNFLKK 450
Query: 481 VGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGG 540
+ + + E +K+S A A+D+G CSTS+DILIPAG HGGPRTRNGGPS +I+RW SGG
Sbjct: 451 MAVTQ-AEDVKSSVLPACCAQDSGDCSTSIDILIPAGLHGGPRTRNGGPSSLIDRWISGG 509
Query: 541 CCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDD 600
CDCGGWDLGCPLTVL +R + KE QAD EC DLF+ GS+ GAP + MVNV D
Sbjct: 510 HCDCGGWDLGCPLTVLKSRSAHKEFSSQADTQGECKLVDLFIEGSENGAPPMSMVNVHDG 569
Query: 601 LYFIHFHSTLSALQSFSIAVASIHSQSPTLRPKNV 635
LYF+HF STLSALQSFSIAVA IH+Q + +N+
Sbjct: 570 LYFVHFQSTLSALQSFSIAVAFIHTQKVPVSNQNM 604
>gi|356574736|ref|XP_003555501.1| PREDICTED: uncharacterized protein LOC100814338 [Glycine max]
Length = 624
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 325/634 (51%), Positives = 420/634 (66%), Gaps = 19/634 (2%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
MEN + +G G++DSL L + E+S + DSS+ + ++L+KR NS + +++N Q
Sbjct: 1 MENTGLWIQKEGHGRKDSLRGLLNDREISEEE--DSSKSQLKKLIKRSNSHRAIVVNAIQ 58
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
S+ +K+ F + HGLD +IPKH+V++DE+Y+ RCLE IH +A KAAQ NI L +
Sbjct: 59 SQISKVFLKDLFVQRVHGLDNRIPKHVVSVDEKYIHRCLEFIHNTALKAAQFNIPESLRA 118
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISP-AG-QWIVGSIMGSKSM 178
M + + +N AK NAC S F+FECP++T + VVIS AG QW +G+IMGSKSM
Sbjct: 119 TNMETLSESLNTAKFFGGNACGSGHFVFECPVVTETGRVVISANAGDQWTLGTIMGSKSM 178
Query: 179 INILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLG 238
INIL S LL + G N R + +D +G + YDFM SP S K+EK+ ++
Sbjct: 179 INILNSTLLQQFGASHRTDNLNRMNFSDPEGLVCYDFMDSPSDLCISSSYKLEKEKQLVR 238
Query: 239 NHKHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKE 298
+HK+G + HKRL S SST STCSD SS++S + S G++QC W G PHFVFS DD+KE
Sbjct: 239 SHKYGPISVHKRLVSTSSTTSTCSDWLSSTSS-NLSQGMIQCTWKQGIPHFVFSADDQKE 297
Query: 299 VFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSK 358
V+V L +S ++K++DY+Y FH KGGQK EI D + VGKM VS SF LCP+N +
Sbjct: 298 VYVTKLRKVDSTDNKALDYVYEFHLNKGGQKGREIPDGDLPFVGKMHVSTSFTLCPNNCR 357
Query: 359 IMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAI 418
I ET+FTLF + E M S A RK+KGLSKKV F+T+ S KRRTLSKFGGSSAI
Sbjct: 358 IRETEFTLFHNIEISDKEMSASEHAHRKSKGLSKKVSKVFKTSPSSKRRTLSKFGGSSAI 417
Query: 419 LENSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFL 478
E+ WEP +LG LL+ +PPN+E+AA++VK HLP + E+VGGWG+KFL
Sbjct: 418 SESCPWEPY--------ALGGTNLLDTDVPPNIEMAAVVVKNHLPCKKPEKVGGWGLKFL 469
Query: 479 KKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRS 538
K G + + PS S ++ G CSTS ILIPAG HGGPRTRNGGPS +I+RWRS
Sbjct: 470 NKSGENQV------TLPSESCNQNNGNCSTSTSILIPAGLHGGPRTRNGGPSSLIDRWRS 523
Query: 539 GGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVR 598
GG CDCGGWD GCPLTVL R +K + + + D DEC S DL GS +P LRMVNV
Sbjct: 524 GGHCDCGGWDEGCPLTVLERRSNKADVMSKIDTQDECKSVDLVTQGSSDYSPTLRMVNVH 583
Query: 599 DDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
D LY+IHFH LSALQSFSI VA IH+QSPTL+P
Sbjct: 584 DGLYYIHFHPPLSALQSFSIVVAIIHAQSPTLQP 617
>gi|449433227|ref|XP_004134399.1| PREDICTED: uncharacterized protein LOC101209831 [Cucumis sativus]
gi|449486758|ref|XP_004157393.1| PREDICTED: uncharacterized LOC101209831 [Cucumis sativus]
Length = 634
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 287/610 (47%), Positives = 390/610 (63%), Gaps = 11/610 (1%)
Query: 29 SGDNGLDSSELKSRRLVKRVNSQQTLILNVKQLFSSRSSIKNFFCRSTHGLDTKIPKHMV 88
+GD GL + SR L K++ L L +KQ SRS + + F GLD K+PK+MV
Sbjct: 31 NGDLGLSKYSI-SRGLNKKLMLPHALYLKLKQNRISRSYVHDSFFNCNIGLDYKVPKYMV 89
Query: 89 TLDERYLRRCLELIHKSASKAAQCNISVDLGSAKMGFMWDDMNVAKCRTENACDSARFIF 148
T+DE+YLRRCLELI SA KAA+CN S+ L S K G + + ++V K +T RFI
Sbjct: 90 TIDEKYLRRCLELIQTSALKAARCNESISLSSVKTGALTESLSVDKLQTRGMAHMERFII 149
Query: 149 ECPMITGSESVVISPAGQWIVGSIMGSKSMINILKSPLLSKLGKLDGDGNFRRTSVNDVK 208
CP + V+S W VGSIMGSKSMINILKSPLL +LG + N R +ND+K
Sbjct: 150 TCPSAGEDSNTVLSSNKMWFVGSIMGSKSMINILKSPLLHQLGITEETSNLIRMDLNDIK 209
Query: 209 GSISYDFMKSPGGFSGYSPQKIEKQTWMLGNHKHGSETAHKRLASASSTNSTCSDQSSSS 268
G +FM SPG S + ++ +H+ GS+ A++R S S NS CSDQSSS
Sbjct: 210 GFTGSNFMDSPGDVDISSLKNLDNTK--PESHQDGSDAANERFFSTPSRNSLCSDQSSSG 267
Query: 269 ASTST-SLGVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGG 327
+++++ G+LQ W G+P+F+FS+DD KEV+VA+ S S ++ ++DY+Y F K G
Sbjct: 268 SASTSLCQGMLQFTWKDGSPYFIFSVDDEKEVYVASSSKVTSADNNALDYVYLFCSAKSG 327
Query: 328 QKEHEICDNESLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKN 387
K+HE+ ++ +VGKM VS S+ +C +NSKI +T+F LFG E+ + S +KN
Sbjct: 328 LKDHEVRNSRPCIVGKMTVSTSYGVCSNNSKIADTEFVLFGGIENSDLEISPSNTVLKKN 387
Query: 388 KGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQN-LGGNDDSLGAACLLENH 446
K +KV + FRT++S K R + SS + + WEP + L +DD + A
Sbjct: 388 KVFPRKVAEVFRTSNSSKHRNIPNLNRSSVMKDFCPWEPYSDKLNSSDDLICA-----RD 442
Query: 447 LPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGC 506
LPPNLELAAI+V++HLPE+ VGGWG+KFLK+ K+T +L TS + R++G C
Sbjct: 443 LPPNLELAAIVVRDHLPEDHGSRVGGWGLKFLKQAKAKQTNNSLDTSVQADCCVRNSGKC 502
Query: 507 STSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAI 566
STSMDILIPAG HGGPRTRNGGPS + ERW+SGG CDCGGWD+GCPLT+L + + +
Sbjct: 503 STSMDILIPAGLHGGPRTRNGGPSTLKERWKSGGVCDCGGWDIGCPLTILEGQSVNDDTL 562
Query: 567 YQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQ 626
QAD EC +F++ G + P LRMVN+RD LYF+HF LS+LQ FSIAVA +HS+
Sbjct: 563 RQADTQ-ECRAFNIHAKGYENSPPTLRMVNIRDGLYFVHFQPKLSSLQCFSIAVAIVHSR 621
Query: 627 SPTLRPKNVQ 636
SP+L+P+NVQ
Sbjct: 622 SPSLKPRNVQ 631
>gi|15226101|ref|NP_180894.1| uncharacterized protein [Arabidopsis thaliana]
gi|2459419|gb|AAB80654.1| hypothetical protein [Arabidopsis thaliana]
gi|219291116|gb|ACL13991.1| At2g33360 [Arabidopsis thaliana]
gi|330253728|gb|AEC08822.1| uncharacterized protein [Arabidopsis thaliana]
Length = 603
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 266/649 (40%), Positives = 370/649 (57%), Gaps = 76/649 (11%)
Query: 5 LGNTRNDGQGKEDSL-DSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNV-KQLF 62
+G+ D Q + L DSL + +SG NGL LKS L+ + Q +L+ N K L
Sbjct: 1 MGSVSPDAQKDDLELEDSLNIDENLSGGNGL----LKSGNLLSQKRQQCSLVSNSGKWLQ 56
Query: 63 SSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGSAK 122
++ K F R+T D KIP+ +V++DE+YLRRCL+LI+ SA K+A C++S++L +K
Sbjct: 57 IVKNGPKGSFSRATSFFDNKIPRQLVSVDEKYLRRCLDLINISAFKSASCSLSLNLVGSK 116
Query: 123 MGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMINIL 182
M ++ EN AR +F+ P++ S S VIS A I+G K + +L
Sbjct: 117 MSLSSRRFDLPVIPKENV---ARLVFDLPLVDDSGSPVISRA-------IIGCKRVTYML 166
Query: 183 KSPLLSKLGKLDGDGNF--RRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQ----TWM 236
P L L LDGDGN R N +SYD ++ KIE++ +
Sbjct: 167 DKPTLHSLEALDGDGNNIGTRRDHNGESELVSYDVVR-----------KIEEKNTPDSQF 215
Query: 237 LGNH--KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQ-CMWDGGNPHFVFSI 293
L H + ++ S++S+ SS S S+S S G LQ M D PHFVFS+
Sbjct: 216 LSYHSGNMPRKAVSLSSTNSLSSSSSEQSSSSWSPSSSVSQGTLQFTMKDNKTPHFVFSL 275
Query: 294 DDRKEVFVANLSNSESG---NSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISF 350
DD+KE++VA+LS + G + S+DY Y H +KG E + LVGK+KVS F
Sbjct: 276 DDQKEIYVASLSTTSVGSGFDRSSLDYSYLIHLKKGRGSEPQ------HLVGKLKVSTLF 329
Query: 351 NLCPDNSKIMETQFTLFGSNEDFGGGMQTS-RQAARKNKGLSKKVVDAFRTT-HSFKRRT 408
++ N K +E QF LF S GG Q + RKN+GL KKVVDA ++T + ++R+
Sbjct: 330 SVSSTNEKTVERQFVLFSS----GGNPQLPCHKDIRKNRGLPKKVVDALKSTKRTSRQRS 385
Query: 409 LSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQE 468
+S+F +S+I + SWEP Q +D L LL+N LPPNLE +A++V+E P +E
Sbjct: 386 ISRFSRTSSIPDFCSWEPFQE---HDSDLEPISLLDNDLPPNLETSAVVVREQFPVEEEE 442
Query: 469 -----EVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPR 523
+VGGWGMKFLKK+ + +T + K S STS+D++IP G HGGPR
Sbjct: 443 EEEVEKVGGWGMKFLKKIPLARTKDASKCSK-----------HSTSIDVVIPLGIHGGPR 491
Query: 524 TRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVN 583
RNGGPS +I+RW+SGGCCDC GWDLGCPLTVL + K ++ +CN F+LF
Sbjct: 492 NRNGGPSSLIQRWKSGGCCDCSGWDLGCPLTVLKGQARKDQS------EGQCNLFELFTE 545
Query: 584 GSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
G QG+P LR++NVRD LYF+ + +S LQSFSIA+A IHSQS LRP
Sbjct: 546 GLMQGSPGLRIMNVRDGLYFVQSQAKISVLQSFSIALAYIHSQSQRLRP 594
>gi|297823129|ref|XP_002879447.1| hypothetical protein ARALYDRAFT_902408 [Arabidopsis lyrata subsp.
lyrata]
gi|297325286|gb|EFH55706.1| hypothetical protein ARALYDRAFT_902408 [Arabidopsis lyrata subsp.
lyrata]
Length = 601
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/641 (40%), Positives = 366/641 (57%), Gaps = 62/641 (9%)
Query: 5 LGNTRNDGQGKEDSL-DSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNV-KQLF 62
+G+ D Q + L DSL + +SG+NGL LKS L+ Q +L+ N K L
Sbjct: 1 MGSVTPDAQKDDPELEDSLNIDENLSGENGL----LKSVNLLSPKRQQCSLVSNSGKWLQ 56
Query: 63 SSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGSAK 122
++ K F R+T D KIP+ +V++DE+YLRRCL+LI+ SA K+A C++S++L +K
Sbjct: 57 IVKNGPKGSFSRATSFFDNKIPRQLVSIDEKYLRRCLDLINISAFKSASCSLSLNLVGSK 116
Query: 123 MGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMINIL 182
M + EN AR +F+ P++ S S VIS A I+G K + +L
Sbjct: 117 MSLSSRHFDSPVIPKENV---ARLVFDLPLVDDSGSPVISRA-------IIGCKRVTYML 166
Query: 183 KSPLLSKLGKLDGDG--NFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLGNH 240
PLL L LDGDG + R N +S D ++ ++ Q++ GN
Sbjct: 167 DKPLLHDLEGLDGDGMNSGTRRDHNGENELVSCDVVRK---IEEKDTPDLQFQSYQSGNM 223
Query: 241 KHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQ-CMWDGGNPHFVFSIDDRKEV 299
+ ++ S++S+ S S S+S S G LQ M D PHFVFS+DD+KE+
Sbjct: 224 PQ--KAVSLSSTNSMSSSSSEQSSSGWSPSSSVSQGTLQFIMKDNKTPHFVFSLDDQKEI 281
Query: 300 FVANLSNSESG---NSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDN 356
+VA+L+ + G + S+DY Y H +KG E + LVGK+KVS F++ N
Sbjct: 282 YVASLTTTSVGSGFDRSSLDYSYLIHLKKGRGSELQ------HLVGKLKVSTLFSVSSTN 335
Query: 357 SKIMETQFTLFGSNEDFGGGMQTS-RQAARKNKGLSKKVVDAFRTT-HSFKRRTLSKFGG 414
K +E QF LF S GG Q RKN+GL KKVVDA ++T + ++R++S+F
Sbjct: 336 EKTVERQFVLFSS----GGNPQLQCHNDIRKNRGLPKKVVDALKSTKRTSRQRSISRFSR 391
Query: 415 SSAILENSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQE---EVG 471
+S+I E SWEP Q +++ L LL+N LPPNLE +A++V+E P +E +VG
Sbjct: 392 TSSIPEFCSWEPFQE---HENDLEPVSLLDNDLPPNLETSAVVVREQFPVEEEEEVEKVG 448
Query: 472 GWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSG 531
GWGMKFLKK+ + + + K S STS+D++IP+G HGGPR RNGGPS
Sbjct: 449 GWGMKFLKKMPLARAKDASKNSK-----------LSTSIDVVIPSGIHGGPRNRNGGPSS 497
Query: 532 IIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPM 591
+I+RW+SGGCCDC GWDLGCPLTVL R K + +CN F+LF G QG+P
Sbjct: 498 LIQRWKSGGCCDCSGWDLGCPLTVLKGRARKDQN------EGQCNLFELFTEGMMQGSPG 551
Query: 592 LRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
LR++NVRD L+F+ + +S LQSFSIA+A IHSQS LRP
Sbjct: 552 LRIMNVRDGLFFVQSQAKISVLQSFSIALAYIHSQSQRLRP 592
>gi|356533808|ref|XP_003535450.1| PREDICTED: uncharacterized protein LOC100802721 [Glycine max]
Length = 409
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 193/414 (46%), Positives = 262/414 (63%), Gaps = 7/414 (1%)
Query: 1 MENKLGNTRNDGQGKEDSLDSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNVKQ 60
MEN + +G+ ++DSL L L + +SS+ + ++L+KR NS + +++N Q
Sbjct: 1 MENTCLWIQKEGRRRKDSLHGL---LNDREISEEESSKSQLKKLIKRSNSHRAIVVNAIQ 57
Query: 61 LFSSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIHKSASKAAQCNISVDLGS 120
S+ K+ F R HGLD +IPKH+V++DE+Y+RRCLE IH +A KAAQ NI L +
Sbjct: 58 SQISKVFFKDLFFRRVHGLDNRIPKHVVSVDEKYIRRCLEFIHNTALKAAQFNIPESLRA 117
Query: 121 AKMGFMWDDMNVAKCRTENACDSARFIFECPMITGSESVVISP-AG-QWIVGSIMGSKSM 178
M + + +N K NAC F+FECP+ T + VVIS AG QW +GSIMGSKSM
Sbjct: 118 TNMETLSESLNTVKFFGGNACGLGHFVFECPVATENGRVVISANAGDQWTLGSIMGSKSM 177
Query: 179 INILKSPLLSKLGKLDGDGNFRRTSVNDVKGSISYDFMKSPGGFSGYSPQKIEKQTWMLG 238
INIL S LL + G N R + +D +G + YDFM SP S K+ K+ ++
Sbjct: 178 INILNSTLLQQFGASHRTDNLNRMNFSDPEGLVCYDFMDSPSDLCISSSHKLGKEKQLVR 237
Query: 239 NHKHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKE 298
+HK+GS + HKRL S SST ST SD SS++ST + G++QC W G PHFVFS DD+KE
Sbjct: 238 SHKYGSISVHKRLVSTSSTTSTSSDWLSSTSSTLSQ-GMIQCSWKQGIPHFVFSADDQKE 296
Query: 299 VFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSK 358
V+VA L +S ++K++DY+Y FH KGGQK EI D + VGKM VS SF LCP+N +
Sbjct: 297 VYVAKLKKVDSTDNKALDYVYEFHLNKGGQKGREIPDGDFPFVGKMHVSTSFTLCPNNCR 356
Query: 359 IMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKF 412
I ET+FTLF + E M TS + RK+KGLS KV F+T+ S KRRTLS++
Sbjct: 357 IRETEFTLFHNMEISDKEMSTSEHSHRKSKGLS-KVSKVFKTSPSSKRRTLSRY 409
>gi|145330350|ref|NP_001078000.1| uncharacterized protein [Arabidopsis thaliana]
gi|330253729|gb|AEC08823.1| uncharacterized protein [Arabidopsis thaliana]
Length = 448
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 175/370 (47%), Positives = 232/370 (62%), Gaps = 41/370 (11%)
Query: 274 SLGVLQ-CMWDGGNPHFVFSIDDRKEVFVANLSNSESG---NSKSMDYMYSFHWRKGGQK 329
S G LQ M D PHFVFS+DD+KE++VA+LS + G + S+DY Y H +KG
Sbjct: 100 SQGTLQFTMKDNKTPHFVFSLDDQKEIYVASLSTTSVGSGFDRSSLDYSYLIHLKKGRGS 159
Query: 330 EHEICDNESLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTS-RQAARKNK 388
E + LVGK+KVS F++ N K +E QF LF S GG Q + RKN+
Sbjct: 160 EPQ------HLVGKLKVSTLFSVSSTNEKTVERQFVLFSS----GGNPQLPCHKDIRKNR 209
Query: 389 GLSKKVVDAFRTT-HSFKRRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHL 447
GL KKVVDA ++T + ++R++S+F +S+I + SWEP Q +D L LL+N L
Sbjct: 210 GLPKKVVDALKSTKRTSRQRSISRFSRTSSIPDFCSWEPFQE---HDSDLEPISLLDNDL 266
Query: 448 PPNLELAAIIVKEHLPENRQE-----EVGGWGMKFLKKVGIKKTVETLKTSAPSASFARD 502
PPNLE +A++V+E P +E +VGGWGMKFLKK+ + +T + K S S
Sbjct: 267 PPNLETSAVVVREQFPVEEEEEEEVEKVGGWGMKFLKKIPLARTKDASKCSKHS------ 320
Query: 503 TGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSK 562
TS+D++IP G HGGPR RNGGPS +I+RW+SGGCCDC GWDLGCPLTVL + K
Sbjct: 321 -----TSIDVVIPLGIHGGPRNRNGGPSSLIQRWKSGGCCDCSGWDLGCPLTVLKGQARK 375
Query: 563 KEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVAS 622
++ +CN F+LF G QG+P LR++NVRD LYF+ + +S LQSFSIA+A
Sbjct: 376 DQS------EGQCNLFELFTEGLMQGSPGLRIMNVRDGLYFVQSQAKISVLQSFSIALAY 429
Query: 623 IHSQSPTLRP 632
IHSQS LRP
Sbjct: 430 IHSQSQRLRP 439
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 6/101 (5%)
Query: 5 LGNTRNDGQGKEDSL-DSLGKELEVSGDNGLDSSELKSRRLVKRVNSQQTLILNV-KQLF 62
+G+ D Q + L DSL + +SG NGL LKS L+ + Q +L+ N K L
Sbjct: 1 MGSVSPDAQKDDLELEDSLNIDENLSGGNGL----LKSGNLLSQKRQQCSLVSNSGKWLQ 56
Query: 63 SSRSSIKNFFCRSTHGLDTKIPKHMVTLDERYLRRCLELIH 103
++ K F R+T D KIP+ +V++DE+YLRRCL+LI+
Sbjct: 57 IVKNGPKGSFSRATSFFDNKIPRQLVSVDEKYLRRCLDLIN 97
>gi|110743347|dbj|BAE99561.1| hypothetical protein [Arabidopsis thaliana]
Length = 341
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 228/362 (62%), Gaps = 40/362 (11%)
Query: 281 MWDGGNPHFVFSIDDRKEVFVANLSNSESG---NSKSMDYMYSFHWRKGGQKEHEICDNE 337
M D PHFVFS+DD+KE++VA+LS + G + S+DY Y H +KG E +
Sbjct: 1 MKDNKTPHFVFSLDDQKEIYVASLSTTSVGSGFDRSSLDYSYLIHLKKGRGSEPQ----- 55
Query: 338 SLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTS-RQAARKNKGLSKKVVD 396
LVGK+KVS F++ N K +E QF LF S GG Q + RKN+GL KKVVD
Sbjct: 56 -HLVGKLKVSTLFSVSSTNEKTVERQFVLFSS----GGNPQLPCHKDIRKNRGLPKKVVD 110
Query: 397 AFRTT-HSFKRRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAA 455
A ++T + ++R++S+F +S+I + SWEP Q +D L LL+N LPPNLE +A
Sbjct: 111 ALKSTKRTSRQRSISRFSRTSSIPDFCSWEPFQE---HDSDLEPISLLDNDLPPNLETSA 167
Query: 456 IIVKEHLPENRQE-----EVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSM 510
++V+E P +E +VGGWGMKFLKK+ + +T + K S S TS+
Sbjct: 168 VVVREQFPVEEEEEEEVEKVGGWGMKFLKKIPLARTKDASKCSKHS-----------TSI 216
Query: 511 DILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQAD 570
D++IP G HGGPR RNGGPS +I+RW+SGGCCDC GWDLGCPLTVL + K ++
Sbjct: 217 DVVIPLGIHGGPRNRNGGPSSLIQRWKSGGCCDCSGWDLGCPLTVLKGQARKDQS----- 271
Query: 571 MHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTL 630
+CN F+LF G QG+P LR++NVRD LYF+ + +S LQSFSIA+A IHSQS L
Sbjct: 272 -EGQCNLFELFTEGLMQGSPGLRIMNVRDGLYFVQSQAKISVLQSFSIALAYIHSQSQRL 330
Query: 631 RP 632
RP
Sbjct: 331 RP 332
>gi|115450625|ref|NP_001048913.1| Os03g0139200 [Oryza sativa Japonica Group]
gi|108706091|gb|ABF93886.1| expressed protein [Oryza sativa Japonica Group]
gi|113547384|dbj|BAF10827.1| Os03g0139200 [Oryza sativa Japonica Group]
gi|215695076|dbj|BAG90267.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215697573|dbj|BAG91567.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222624164|gb|EEE58296.1| hypothetical protein OsJ_09342 [Oryza sativa Japonica Group]
Length = 582
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 228/394 (57%), Gaps = 19/394 (4%)
Query: 246 TAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVFVANLS 305
+ H+++ S+ + + + + SS +S S G+LQC+W G PHF S+D+ V+ AN +
Sbjct: 201 SMHEKMVSSDTPSMSSNQPVLSSCPSSLSAGLLQCIWKDGLPHFELSLDNPMAVYTANPT 260
Query: 306 NSESGNSKSMDYMYSFHWRKGGQKEHEICDNESL--LVGKMKVSISFNLCPDNSKIMETQ 363
+ N K +DY+Y FH + G+K+ + ++ S+ LVGKMKVS S L PD S MET+
Sbjct: 261 KAHD-NDKPLDYVYLFHSGEQGRKDW-LGNSSSVSRLVGKMKVSSSLVLNPDKSTSMETE 318
Query: 364 FTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTH-SFKRRTLSKFGGSSA--ILE 420
F L+GS +D+ MQ+S K KGL+K+V D + ++ + + + KFG SS+ I E
Sbjct: 319 FVLYGSPDDYLRQMQSS-YGVTKGKGLAKRVADIMKPSNLNSSPKHVWKFGKSSSQQIDE 377
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEV-GGWGMKFLK 479
+ + + L L+ + LP N E+AAI+V++ E R+ V GGWG+KFL+
Sbjct: 378 MTEIPEGEQCSAKESVLKN--LVADDLPTNQEIAAIVVRKQRRERRKSPVLGGWGLKFLE 435
Query: 480 KVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSG 539
K G T ++ G S+ + P G+HGG ++NG P+ +I RWRSG
Sbjct: 436 KAG---ATHPGSTEDGDVQNKKNNAG---SVSAIFPRGYHGGAASKNGSPASLIGRWRSG 489
Query: 540 GCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRD 599
G CDCGGWD+GCP+ VL ++ D S +L V G+K+G PM R+VN+ D
Sbjct: 490 GRCDCGGWDIGCPIRVLQNDGCGTSPQAESQSQDR-KSVELSVKGAKKG-PMFRLVNITD 547
Query: 600 DLYFIHFHSTLSALQSFSIAVASIHSQSPTLRPK 633
DL+ I+F S+LS LQ FS +A IHSQ+P L PK
Sbjct: 548 DLHIIYFDSSLSPLQCFSAGIAIIHSQAPHLYPK 581
>gi|218192057|gb|EEC74484.1| hypothetical protein OsI_09942 [Oryza sativa Indica Group]
Length = 582
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 228/394 (57%), Gaps = 19/394 (4%)
Query: 246 TAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQCMWDGGNPHFVFSIDDRKEVFVANLS 305
+ H+++ S+ + + + + SS +S S G+LQC+W G PHF S+D+ V+ AN +
Sbjct: 201 SMHEKMVSSDTPSMSSNQPVLSSCPSSLSAGLLQCIWKDGLPHFELSLDNPMAVYTANPT 260
Query: 306 NSESGNSKSMDYMYSFHWRKGGQKEHEICDNESL--LVGKMKVSISFNLCPDNSKIMETQ 363
+ N K +DY+Y FH + G+K+ + ++ S+ LVGKMKVS S L PD S MET+
Sbjct: 261 KAHD-NDKPLDYVYLFHSGEQGRKDG-LGNSSSVSRLVGKMKVSSSLVLNPDKSTSMETE 318
Query: 364 FTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTH-SFKRRTLSKFGGSSA--ILE 420
F L+GS +D+ MQ+S K KGL+K+V D + ++ + + + KFG SS+ I E
Sbjct: 319 FVLYGSPDDYLRQMQSS-YGVTKGKGLAKRVADIMKPSNLNSSPKHVWKFGKSSSQQIDE 377
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEV-GGWGMKFLK 479
+ + + L L+ + LP N E+AAI+V++ E R+ V GGWG+KFL+
Sbjct: 378 MTEIPEGEQCSAKESVLKN--LVADDLPTNQEIAAIVVQKQRRERRKSPVLGGWGLKFLE 435
Query: 480 KVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSG 539
K G T ++ G S+ + P G+HGG ++NG P+ +I RWRSG
Sbjct: 436 KAG---ATHPGSTEDGDVQNKKNNAG---SVSAIFPRGYHGGAASKNGSPASLIGRWRSG 489
Query: 540 GCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRD 599
G CDCGGWD+GCP+ VL ++ D S +L V G+K+G PM R+VN+ D
Sbjct: 490 GRCDCGGWDIGCPIRVLQNDGCGTSPQAESQSQDR-KSVELSVKGAKKG-PMFRLVNITD 547
Query: 600 DLYFIHFHSTLSALQSFSIAVASIHSQSPTLRPK 633
DL+ I+F S+LS LQ FS +A IHSQ+P L PK
Sbjct: 548 DLHIIYFDSSLSPLQCFSAGIAIIHSQAPHLYPK 581
>gi|242036993|ref|XP_002465891.1| hypothetical protein SORBIDRAFT_01g047670 [Sorghum bicolor]
gi|241919745|gb|EER92889.1| hypothetical protein SORBIDRAFT_01g047670 [Sorghum bicolor]
Length = 551
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 157/429 (36%), Positives = 236/429 (55%), Gaps = 53/429 (12%)
Query: 230 IEKQTWMLGNHKHGSETAHKRLASAS---STNSTCSDQSS-----SSASTSTSLGVLQCM 281
I +TW +H G+ +K A S + + CS S S +S S G+LQC+
Sbjct: 150 ISPETWETPSH--GASPMNKGTAEVSIHENKSPLCSSMGSNQPVLSQCPSSLSAGLLQCI 207
Query: 282 WDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKE---HEICDNES 338
W G PHF S+DD V+ AN + + K++DY+Y H + G+K+ H N S
Sbjct: 208 WKNGLPHFELSLDDPIAVYAANPIRVQEKD-KALDYVYMIHSGEQGRKDWMGHS--SNVS 264
Query: 339 LLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAF 398
LVGKMKVS S L D S ++ET+F L+GS +D+ M S + K KGL K+V +
Sbjct: 265 RLVGKMKVSSSLVLNSDKSSLLETEFVLYGSPDDYLKQMH-SPYSVPKGKGLVKRVTEIM 323
Query: 399 RT-------THSFK--RRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHLPP 449
RT HS+K + + +F + ILE + + + L L + LP
Sbjct: 324 RTGNVSSSPKHSWKFGKSSSHQFDDLTEILEG------EMISARESDL--MNLNTDDLPT 375
Query: 450 NLELAAIIVKEHLPENRQEEV-GGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCST 508
N ELAAI+V+E + ++E V GGWG+KFL+K + T +T +++ G +
Sbjct: 376 NQELAAIVVREQRHKRQKEPVLGGWGLKFLEKAELNHPEGTKETDV------QNSNGATQ 429
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLG----TRLSKKE 564
+ +++P G+HGG +++ PSG+I+RWRSGGCCDCGGWDLGCP+ VL L + E
Sbjct: 430 CITVVVPRGYHGGAVSKSSRPSGLIKRWRSGGCCDCGGWDLGCPIRVLNGDGCGSLPEGE 489
Query: 565 AIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+ + S +L + G+K ML++VN+ +DLY ++F+S++S LQ FS +A +H
Sbjct: 490 S-------QDSRSVELSIKGAKNET-MLKLVNITEDLYILYFNSSISPLQCFSTGIAIVH 541
Query: 625 SQSPTLRPK 633
SQ+P L PK
Sbjct: 542 SQAPQLCPK 550
>gi|356535450|ref|XP_003536258.1| PREDICTED: uncharacterized protein LOC100811792 [Glycine max]
Length = 181
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 109/184 (59%), Positives = 130/184 (70%), Gaps = 6/184 (3%)
Query: 453 LAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDI 512
+A I+VK HLP + E+VGGWG+KFL K G + + PS S ++ G CS+S I
Sbjct: 1 MATIVVKNHLPCKKPEKVGGWGLKFLNKSGENQI------TLPSESSNQNNGDCSSSTSI 54
Query: 513 LIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMH 572
LIPAGFHGGPRTRNGGPS +I+RW+SGG CDCGGWD GCPLTVL R +K E + + D
Sbjct: 55 LIPAGFHGGPRTRNGGPSSLIDRWKSGGHCDCGGWDEGCPLTVLERRSNKAEVMSKIDTG 114
Query: 573 DECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
D C S DL + GS P LRMVNV D LY+IHFH LSALQSFSIAVA IH+QSPTL+P
Sbjct: 115 DGCKSVDLVIQGSSDYGPTLRMVNVHDGLYYIHFHPPLSALQSFSIAVAIIHAQSPTLQP 174
Query: 633 KNVQ 636
+
Sbjct: 175 NGAK 178
>gi|357114119|ref|XP_003558848.1| PREDICTED: uncharacterized protein LOC100838518 [Brachypodium
distachyon]
Length = 587
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 212/368 (57%), Gaps = 35/368 (9%)
Query: 279 QCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKE-HEICDNE 337
QC+W G PHF S+D V+ AN + N K++DY+Y FH + G+K+ N
Sbjct: 241 QCIWKDGLPHFELSLDSPMAVYAANPIKAHD-NDKALDYVYLFHSGEPGKKDWMGNSSNV 299
Query: 338 SLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDA 397
S LVGKMKVS S L M+T+F L+GS +D+ MQ++ + K KGL ++V +
Sbjct: 300 SKLVGKMKVSSSLVLSSGQPSSMQTEFVLYGSPDDYLRQMQST-YSVTKGKGLVQRVAEI 358
Query: 398 FRTTHSFKRRTLS-KFGGSSAILENSSWEPCQNLGGNDDSLGAACLLE---NHLPPNLEL 453
++ + + + KF S + E LGG ++ + L+ + LP N E+
Sbjct: 359 MKSPNVYSSPKHAWKFSKPSPQQFDDMRE---MLGGEVTNIAESSLVSLTADDLPTNQEI 415
Query: 454 AAIIVKEHLPENRQEE--VGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMD 511
AAI+V+E E RQEE VGGWG+KFL+K G + E+ +D+ G + S+
Sbjct: 416 AAIVVREQRHE-RQEEPAVGGWGLKFLEKAGAVHSEES-----------KDSKGAARSIT 463
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL----GTRLSKKEAIY 567
++P G+HGG ++NG PSG+IERWR+GG C+CGGWDLGCP+ VL G LS + A Y
Sbjct: 464 AIVPRGYHGGVASKNG-PSGLIERWRTGGRCNCGGWDLGCPIRVLNNDGGGSLSPQSAAY 522
Query: 568 QADMHDECNSFDLFVNGSKQG-APMLRMVNVRDDLYFIHFHST-LSALQSFSIAVASIHS 625
E S +L + G ++ P+LR+VNV +DLY ++F ST LS LQ FS VA IHS
Sbjct: 523 S----QESKSAELSMMGVRRNHEPVLRLVNVAEDLYILYFESTLLSPLQCFSTGVAMIHS 578
Query: 626 QSPTLRPK 633
Q+P L PK
Sbjct: 579 QAPQLYPK 586
>gi|226504758|ref|NP_001145256.1| uncharacterized protein LOC100278544 [Zea mays]
gi|195653775|gb|ACG46355.1| hypothetical protein [Zea mays]
Length = 580
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 149/426 (34%), Positives = 228/426 (53%), Gaps = 39/426 (9%)
Query: 226 SPQKIEKQTWMLGNHKHGSETAHKRLASASSTNSTCSDQSS-----SSASTSTSLGVLQC 280
SP+ E ++ + G TA + CS SS S +S S +LQC
Sbjct: 175 SPETWEASSFPGASPMIGGGTAEDGVHGNKPPPPCCSSVSSNQHVLSRCPSSLSACLLQC 234
Query: 281 MWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKE---HEICDNE 337
+W G PHF S+DD A ++ DY+Y H + G+K+ H N
Sbjct: 235 IWKDGLPHFELSLDDHPIAVYAASPVRVLEKDRAFDYVYMIHSGEQGRKDWMGHS--SNV 292
Query: 338 SLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDA 397
S LVGK+KVS S L PD S ++ET+F L+GS +D+ M+ S + K KGL ++V +
Sbjct: 293 SRLVGKVKVSSSLVLNPDKSGLLETEFVLYGSPDDYLRQMR-SPYSVPKGKGLVRRVTEI 351
Query: 398 FRT-------THSFK--RRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHLP 448
R H +K + + +F + +LE + GG+D L + LP
Sbjct: 352 MRAGNVSASPRHPWKLGKSSSHQFDDLAGVLEGEMI----SAGGSD----LVSLKADDLP 403
Query: 449 PNLELAAIIVKEHLPENRQEEV-GGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCS 507
N ELAAI+V+EH R+E V GGWG+KFL+K G+ + T +T +++ G +
Sbjct: 404 SNQELAAIVVREHRRRRRKEPVSGGWGLKFLEKAGLNRPQGTEETDV------QNSDGAT 457
Query: 508 TSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIY 567
+ +++P G+HGG +++ PSG++ERWRSGG CDCGGWDLGCP+ VL ++
Sbjct: 458 HCVTVVVPRGYHGGVVSKSSRPSGLVERWRSGGRCDCGGWDLGCPIRVLNG--DGCGSLP 515
Query: 568 QADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
+ + D S ++ + G+K ML++VN+ +DLY ++F S++S LQ FS +A +HS++
Sbjct: 516 EGESRDR-RSVEISIKGAKNET-MLKLVNITEDLYILYFDSSISPLQCFSTGIAIVHSRA 573
Query: 628 PTLRPK 633
P L PK
Sbjct: 574 PELCPK 579
>gi|223943833|gb|ACN26000.1| unknown [Zea mays]
gi|414864730|tpg|DAA43287.1| TPA: hypothetical protein ZEAMMB73_073241 [Zea mays]
Length = 580
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 148/426 (34%), Positives = 227/426 (53%), Gaps = 39/426 (9%)
Query: 226 SPQKIEKQTWMLGNHKHGSETAHKRLASASSTNSTCSDQSS-----SSASTSTSLGVLQC 280
SP+ E ++ + G TA + CS SS S +S S +LQC
Sbjct: 175 SPETWEASSFPGASPMIGGGTAEDGVHGNKPPPPCCSSVSSNQHVLSRCPSSLSACLLQC 234
Query: 281 MWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKE---HEICDNE 337
+W G PHF S+DD A ++ DY+Y H + G+K+ H N
Sbjct: 235 IWKDGLPHFELSLDDHPIAVYAASPVRVLEKDRAFDYVYMIHSGEQGRKDWMGHS--SNV 292
Query: 338 SLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDA 397
S LVGK+KVS S L PD S ++ET+F L+GS +D+ M+ S + K KGL ++V +
Sbjct: 293 SRLVGKVKVSSSLVLNPDKSGLLETEFVLYGSPDDYLRQMR-SPYSVPKGKGLVRRVTEI 351
Query: 398 FRT-------THSFK--RRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLLENHLP 448
R H +K + + +F + +LE + GG+D L + LP
Sbjct: 352 MRAGNVSASPRHPWKLGKSSSHQFDDLAGVLEGEMI----SAGGSD----LVSLKADDLP 403
Query: 449 PNLELAAIIVKEHLPENRQEEV-GGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCS 507
N ELAAI+V+E R+E V GGWG+KFL+K G+ + T +T +++ G +
Sbjct: 404 SNQELAAIVVREQRRRRRKEPVSGGWGLKFLEKAGLNRPQGTEETDV------QNSDGAT 457
Query: 508 TSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIY 567
+ +++P G+HGG +++ PSG++ERWRSGG CDCGGWDLGCP+ VL ++
Sbjct: 458 HCVTVVVPRGYHGGVVSKSSRPSGLVERWRSGGRCDCGGWDLGCPIRVLNG--DGCGSLP 515
Query: 568 QADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
+ + D S ++ + G+K ML++VN+ +DLY ++F S++S LQ FS +A +HS++
Sbjct: 516 EGESRDR-RSVEISIKGAKNET-MLKLVNITEDLYILYFDSSISPLQCFSTGIAIVHSRA 573
Query: 628 PTLRPK 633
P L PK
Sbjct: 574 PELCPK 579
>gi|2341027|gb|AAB70425.1| EST gb|T21788 comes from this gene [Arabidopsis thaliana]
Length = 412
Score = 216 bits (549), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 153/399 (38%), Positives = 219/399 (54%), Gaps = 82/399 (20%)
Query: 239 NHKHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQ-CMWDGGNPHFVFSIDDRK 297
N+ SE HK++ S SS + S G LQ M G PHFVF ++++K
Sbjct: 74 NYVAESEKMHKKVISFSSK--------------TFSQGTLQFTMRANGTPHFVFKLENQK 119
Query: 298 EVFVANLS-NSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDN 356
+V+VA+LS N + NS Y H ++G E + S LVG++ VS F+
Sbjct: 120 DVYVASLSSNVQDQNS------YMIHLQRG-----ESSASSSHLVGRINVSTLFS----- 163
Query: 357 SKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSS 416
K++E +F LF SN G ++ R RKN+GLSKKVV A + +RRT
Sbjct: 164 EKVLEREFVLFSSN---GENLKIPR--TRKNRGLSKKVVHAVKN----ERRT-------- 206
Query: 417 AILENSSWEPCQNLGGNDDSLGAA---CLLENHLPPNLELAAIIVKEHLPENRQEEVGGW 473
A L +S+ P +LG D+ A CLL+N LP NLE A++VK+ E ++E+GGW
Sbjct: 207 ARLSRTSFIP--DLGSWDEQFQAQNYDCLLKNKLPTNLETLAVVVKQ---ETIEDEIGGW 261
Query: 474 GMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGII 533
G+KFLK+ + + +++ S + +T S SM+++IP+G HGGP GPS +I
Sbjct: 262 GLKFLKRSPMFQ-----RSNDASET---ETSTSSISMNVVIPSGIHGGPE---DGPSSLI 310
Query: 534 ERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLR 593
ERW+S G CDCGGWDL C LT+L + K + F+LF+ GSK L+
Sbjct: 311 ERWKSQGNCDCGGWDLCCSLTLLKGQPRKDQY------------FELFIEGSKHETTGLK 358
Query: 594 MVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
+VNV LY + F + L++LQSF+IA+A IHS+ LRP
Sbjct: 359 IVNVSGGLYLVQFEAKLTSLQSFAIALAFIHSEK--LRP 395
>gi|42561689|ref|NP_171943.2| uncharacterized protein [Arabidopsis thaliana]
gi|145323734|ref|NP_001077456.1| uncharacterized protein [Arabidopsis thaliana]
gi|51971817|dbj|BAD44573.1| unknown protein [Arabidopsis thaliana]
gi|111074328|gb|ABH04537.1| At1g04490 [Arabidopsis thaliana]
gi|332189582|gb|AEE27703.1| uncharacterized protein [Arabidopsis thaliana]
gi|332189583|gb|AEE27704.1| uncharacterized protein [Arabidopsis thaliana]
Length = 401
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 153/400 (38%), Positives = 219/400 (54%), Gaps = 84/400 (21%)
Query: 239 NHKHGSETAHKRLASASSTNSTCSDQSSSSASTSTSLGVLQ-CMWDGGNPHFVFSIDDRK 297
N+ SE HK++ S SS + S G LQ M G PHFVF ++++K
Sbjct: 74 NYVAESEKMHKKVISFSSK--------------TFSQGTLQFTMRANGTPHFVFKLENQK 119
Query: 298 EVFVANLS-NSESGNSKSMDYMYSFHWRKGGQKEHEICDNESLLVGKMKVSISFNLCPDN 356
+V+VA+LS N + NS Y H ++G E + S LVG++ VS F+
Sbjct: 120 DVYVASLSSNVQDQNS------YMIHLQRG-----ESSASSSHLVGRINVSTLFS----- 163
Query: 357 SKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSS 416
K++E +F LF SN G ++ R RKN+GLSKKVV A + +RRT
Sbjct: 164 EKVLEREFVLFSSN---GENLKIPR--TRKNRGLSKKVVHAVKN----ERRT-------- 206
Query: 417 AILENSSWEPCQNLGGNDDSLGAA---CLLENHLPPNLELAAIIVKEHLPENRQEEVGGW 473
A L +S+ P +LG D+ A CLL+N LP NLE A++VK+ E ++E+GGW
Sbjct: 207 ARLSRTSFIP--DLGSWDEQFQAQNYDCLLKNKLPTNLETLAVVVKQ---ETIEDEIGGW 261
Query: 474 GMKFLKKVGI-KKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGI 532
G+KFLK+ + +++ + +T +T S SM+++IP+G HGGP GPS +
Sbjct: 262 GLKFLKRSPMFQRSNDASET---------ETSTSSISMNVVIPSGIHGGPE---DGPSSL 309
Query: 533 IERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPML 592
IERW+S G CDCGGWDL C LT+L + K + F+LF+ GSK L
Sbjct: 310 IERWKSQGNCDCGGWDLCCSLTLLKGQPRKDQY------------FELFIEGSKHETTGL 357
Query: 593 RMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSPTLRP 632
++VNV LY + F + L++LQSF+IA+A IHS+ LRP
Sbjct: 358 KIVNVSGGLYLVQFEAKLTSLQSFAIALAFIHSEK--LRP 395
>gi|297848678|ref|XP_002892220.1| hypothetical protein ARALYDRAFT_470427 [Arabidopsis lyrata subsp.
lyrata]
gi|297338062|gb|EFH68479.1| hypothetical protein ARALYDRAFT_470427 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 195/364 (53%), Gaps = 56/364 (15%)
Query: 265 SSSSASTSTSLGVLQ-CMWDGGNPHFVFSIDDRKEVFVANLS-NSESGNSKSMDYMYSFH 322
SS + S+ S G+LQ + G PHFVF ++++K+V+VA+LS N + NS Y H
Sbjct: 91 SSKTLSSDYSQGILQFTIRANGTPHFVFKLENQKDVYVASLSSNVQDQNS------YMIH 144
Query: 323 WRKGGQKEHEICDNESLLVGKMKVSISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQ 382
++G E + S VG++ +S F+ KI+E QF LF SN G ++ R
Sbjct: 145 LQRG-----ESLSSSSYPVGRINISTLFS-----EKILERQFVLFSSN---GEHLKIPR- 190
Query: 383 AARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACL 442
+ KN+GLSK VV + + +S+ +S I SW+ Q + L L
Sbjct: 191 -SMKNRGLSKNVVHEIKND-----KRISRLSRTSFIPGLGSWD--QQFQEPNYDLEKVNL 242
Query: 443 LENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARD 502
LEN+LP NLE A++VK+ E +EE+GGWG+KFLK+ + + A+
Sbjct: 243 LENNLPTNLETLAVVVKQ---ETLEEEIGGWGLKFLKRSPMVQRTN-------DATETET 292
Query: 503 TGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSK 562
T SM+++I +G HGGP GP +IERW+S G CDCGGWDLGC LT+L + K
Sbjct: 293 TTSSMISMNVVISSGVHGGPE---DGPLSLIERWKSQGKCDCGGWDLGCSLTLLKGQPQK 349
Query: 563 KEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVAS 622
+ F+L GSK L++VNV +Y + F L++LQSF+IA+A
Sbjct: 350 DQ-------------FELLKEGSKHETSGLKIVNVSGGVYLVQFEGKLTSLQSFAIALAF 396
Query: 623 IHSQ 626
IHSQ
Sbjct: 397 IHSQ 400
>gi|297840145|ref|XP_002887954.1| hypothetical protein ARALYDRAFT_475003 [Arabidopsis lyrata subsp.
lyrata]
gi|297333795|gb|EFH64213.1| hypothetical protein ARALYDRAFT_475003 [Arabidopsis lyrata subsp.
lyrata]
Length = 517
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 171/361 (47%), Gaps = 83/361 (22%)
Query: 278 LQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNE 337
L ++ G P F+FS+D +V++A+ + + +S+ ++YSF + GG+ + E
Sbjct: 186 LSMEYELGMPVFIFSLDRPDDVYMAS-TRMDDNDSR---FVYSFRY-IGGRSNKNVSGQE 240
Query: 338 SLLVGKMKVS--ISFNLCPDNSKIMET---QFTLFGSNEDFGGGMQTSRQAARKNKGLSK 392
S L+G+M+VS I P ++E+ +F LF + +R++ K++ LS+
Sbjct: 241 SSLIGQMQVSTQICLEQEPYEEDLVESTVSEFVLFD--------IARARRSGFKHENLSR 292
Query: 393 KVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQN---------LGGNDDSLGAACLL 443
+ +SF+R ++ +SA + P QN L ++ +
Sbjct: 293 Q--------NSFRRGSIFSETENSASDQIQEKLPRQNSFNRGLTRSLSKRSENTASDPWP 344
Query: 444 ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDT 503
+ L P LE+AAI++++ + + PS
Sbjct: 345 VSDLHPGLEIAAIVIQD---------------------------SSTNSKLPS------- 370
Query: 504 GGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKK 563
+M +++P G HG P T N P+ I++RWRSGG CDC GWD+GC L VL +
Sbjct: 371 ----RAMKVIVPTGNHGFPDTENSCPTPILQRWRSGGGCDCSGWDMGCHLFVLES----- 421
Query: 564 EAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
++ ++ +LF+ G K+ P + MV +R+ Y ++FH+ LSALQ+FS+ VA
Sbjct: 422 -----PELTNDHLGLELFIEGGKEITPAITMVFIREGHYEVNFHAKLSALQAFSVCVAEF 476
Query: 624 H 624
H
Sbjct: 477 H 477
>gi|361070025|gb|AEW09324.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163949|gb|AFG64725.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163950|gb|AFG64726.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163951|gb|AFG64727.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163954|gb|AFG64729.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163956|gb|AFG64731.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163959|gb|AFG64733.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163960|gb|AFG64734.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163961|gb|AFG64735.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163963|gb|AFG64736.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163964|gb|AFG64737.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163965|gb|AFG64738.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163966|gb|AFG64739.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
Length = 137
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 57/127 (44%), Positives = 82/127 (64%), Gaps = 1/127 (0%)
Query: 508 TSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIY 567
TS+ +LIPAG HG P T + PS +IERW++GG CDCGGWDLGC LT++ K +
Sbjct: 11 TSVTVLIPAGIHGSPITESDRPSPLIERWQTGGRCDCGGWDLGCSLTIIDNSNPNKSGLP 70
Query: 568 QADMHDE-CNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQ 626
++++ E + DL++ G+K+ P L + V D LY + F + LS LQ+FSI VA +HS+
Sbjct: 71 ESNIEMEGFHPSDLYIQGAKESTPALSLAIVNDGLYLLSFQAQLSPLQAFSIGVAMLHSR 130
Query: 627 SPTLRPK 633
P + K
Sbjct: 131 EPLMSAK 137
>gi|383163952|gb|AFG64728.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163955|gb|AFG64730.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
gi|383163958|gb|AFG64732.1| Pinus taeda anonymous locus UMN_4541_01 genomic sequence
Length = 137
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 82/127 (64%), Gaps = 1/127 (0%)
Query: 508 TSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIY 567
TS+ +LIPAG HG P T + PS +IERW++GG CDCGGWDLGC LT++ K +
Sbjct: 11 TSVTVLIPAGIHGSPITESDRPSPLIERWQTGGRCDCGGWDLGCSLTIIDNSNPNKSGLP 70
Query: 568 QADMHDE-CNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQ 626
++++ E + +L++ G+K+ P L + V D LY + F + LS LQ+FSI VA +HS+
Sbjct: 71 ESNIEMEGFHPSNLYIQGAKESTPALSLAIVNDGLYLLSFQAQLSPLQAFSIGVAMLHSR 130
Query: 627 SPTLRPK 633
P + K
Sbjct: 131 EPLMSAK 137
>gi|15222448|ref|NP_176540.1| uncharacterized protein [Arabidopsis thaliana]
gi|6633852|gb|AAF19711.1|AC008047_18 F2K11.12 [Arabidopsis thaliana]
gi|63147390|gb|AAY34168.1| At1g63520 [Arabidopsis thaliana]
gi|91806017|gb|ABE65737.1| unknown [Arabidopsis thaliana]
gi|332195986|gb|AEE34107.1| uncharacterized protein [Arabidopsis thaliana]
Length = 528
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 165/357 (46%), Gaps = 75/357 (21%)
Query: 278 LQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNE 337
L+ ++ G P F+FS+D +V++A S + K ++YSF + GG+ + E
Sbjct: 187 LRMEYELGMPVFIFSLDLPDDVYMA----STRMDDKESRFVYSFSY-IGGRSNKNVSGKE 241
Query: 338 SLLVGKMKVSISFNLCPDNSKIME-------TQFTLFGSNEDFGGGMQTSRQAARKNKGL 390
S L+G+M+VS +C + E ++F LF + +R++ K++ L
Sbjct: 242 SSLIGQMQVST--QICLEQEPYEEDLVASTVSEFVLFD--------IARARRSGFKHENL 291
Query: 391 SKKVVDAFRTTHSFKR--RTLSKFGGSSAILENS-SWEPCQNLGGNDDSLGAACLLENHL 447
S++ ++FR F ++S +NS + ++L + ++ + + L
Sbjct: 292 SRQ--NSFRRGLIFSETENSVSDLLQEKLPRQNSFNRGLTRSLSKHSENTASGPWPVSDL 349
Query: 448 PPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCS 507
P LE+AAI++++ + ++ S
Sbjct: 350 HPGLEIAAIVIQDS--------------------------------------SSNSKLSS 371
Query: 508 TSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIY 567
M +++P G HG P T N P+ I++RWRSGG CDC GWD+GC L VL +
Sbjct: 372 REMKVIVPTGNHGLPDTENSCPTPILQRWRSGGGCDCSGWDMGCHLFVLES--------- 422
Query: 568 QADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
++ + + +LF+ K+ P + M +R+ Y + FH+ LSALQ+FSI VA +
Sbjct: 423 -PELINNHHGLELFIESGKEITPAMTMTCIREGHYEVKFHAKLSALQAFSICVAELQ 478
>gi|225440548|ref|XP_002273340.1| PREDICTED: uncharacterized protein LOC100245981 [Vitis vinifera]
Length = 897
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 104/195 (53%), Gaps = 31/195 (15%)
Query: 449 PNLELAAIIVK---EHL----PENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFAR 501
PN ELAAI++K E+L N+ +++ G G K E L S
Sbjct: 673 PNRELAAIVIKIPIENLNHGGDSNKNKDLMGKGFK-----------ECLPEDRCSCKLGE 721
Query: 502 DTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLS 561
+ CST+ +++P+G HG P G PS +I+RW+S G CDCGGWD+GC L +L S
Sbjct: 722 NGDPCSTT--VILPSGVHGLPS--RGAPSPLIDRWKSSGSCDCGGWDIGCKLQIL---TS 774
Query: 562 KKEAIYQADMHDECNS---FDLFVNGS--KQGAPMLRMVNVRDDLYFIHFHSTLSALQSF 616
+ + + + + CN+ FDLFV G ++ + MV ++ +Y + F++T+S +Q+F
Sbjct: 775 QDHCCWTSRLPNHCNATNRFDLFVQGGGYQEKKLIFSMVPFKEGIYSVEFNATISLIQAF 834
Query: 617 SIAVASIHSQ-SPTL 630
SI A Q SP L
Sbjct: 835 SICAAVTSQQKSPVL 849
>gi|297813617|ref|XP_002874692.1| hypothetical protein ARALYDRAFT_489973 [Arabidopsis lyrata subsp.
lyrata]
gi|297320529|gb|EFH50951.1| hypothetical protein ARALYDRAFT_489973 [Arabidopsis lyrata subsp.
lyrata]
Length = 698
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 176/373 (47%), Gaps = 66/373 (17%)
Query: 284 GGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEI-------CDN 336
G P F F D +EV+ A S++G++ ++Y+F G +K +
Sbjct: 288 NGLPVFQFVSDSPEEVYAAKTWKSDNGST----WVYTFS-SAGSRKRSSASVRGLNDVNK 342
Query: 337 ESLLVGKMKVSISFNLCP-------DNSKIMETQFTLF---------GSNEDFGGGM--- 377
ESLLV +M+VS LC D +M +F LF + ED +
Sbjct: 343 ESLLVAQMQVSC--KLCTEVRKKGQDPETLMVNEFVLFDIAQARRSISTKEDQSLPLDMV 400
Query: 378 -QTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQNLGGNDDS 436
TS+ +A+ + + + + + K+R + S LE S+
Sbjct: 401 NNTSKNSAKPDSEIRNNSMSG-DASDTLKQRCQPRRTSQSYDLEASN------------- 446
Query: 437 LGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVE--TLKTSA 494
G L P+LE+AAII+++ + E R+ +K+ + G K+ +E L +
Sbjct: 447 -GTNPWSATDLHPDLEIAAIIIQDTI-EKRE------SLKYRR--GDKRLMEKTNLLGLS 496
Query: 495 PSASFARDTGGCSTS--MDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCP 552
P ++ G +S + ++IP G HG P T N PS +I+RWRSGG CDCGGWD+ CP
Sbjct: 497 PIEEEKKELYGSRSSEKLKVVIPRGNHGLPTTENSCPSPLIQRWRSGGGCDCGGWDMACP 556
Query: 553 LTVLGT-RLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLS 611
L V+G R+S + + D + LFV G+K+ P L M V + Y +HFH+ LS
Sbjct: 557 LMVIGNPRIS---CSHDQPLVDNQHPLQLFVQGAKEHIPALYMSFVEEGQYDVHFHAQLS 613
Query: 612 ALQSFSIAVASIH 624
LQ+FSI VA +H
Sbjct: 614 TLQAFSICVAILH 626
>gi|115452131|ref|NP_001049666.1| Os03g0267600 [Oryza sativa Japonica Group]
gi|108707365|gb|ABF95160.1| expressed protein [Oryza sativa Japonica Group]
gi|113548137|dbj|BAF11580.1| Os03g0267600 [Oryza sativa Japonica Group]
gi|215704306|dbj|BAG93146.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 899
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 81/133 (60%), Gaps = 6/133 (4%)
Query: 501 RDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLG--T 558
RD S+S+ ++P GFHG + +G P +IERW+SGG CDCGGWD GC L+VL
Sbjct: 721 RDDSCASSSVLAVLPNGFHGT--STSGQPLPLIERWKSGGACDCGGWDEGCMLSVLSDDA 778
Query: 559 RLSKKEAIYQADM-HDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSF 616
R +K + QA+ D FDLFV G S++ MV+ R+ LY + F S+++ LQ+F
Sbjct: 779 RENKGDKSTQANQTTDGSQRFDLFVQGRSREDKHAFSMVSFREGLYTVEFRSSIALLQAF 838
Query: 617 SIAVASIHSQSPT 629
++ + +H + PT
Sbjct: 839 AMCIVMLHGRRPT 851
>gi|218192504|gb|EEC74931.1| hypothetical protein OsI_10891 [Oryza sativa Indica Group]
Length = 330
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 81/133 (60%), Gaps = 6/133 (4%)
Query: 501 RDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLG--T 558
RD S+S+ ++P GFHG + +G P +IERW+SGG CDCGGWD GC L+VL
Sbjct: 152 RDDSCASSSVLAVLPNGFHG--TSTSGQPLPLIERWKSGGACDCGGWDEGCMLSVLSDDA 209
Query: 559 RLSKKEAIYQADM-HDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSF 616
R +K + QA+ D FDLFV G S++ MV+ R+ LY + F S+++ LQ+F
Sbjct: 210 RENKGDKSTQANQTTDGSQRFDLFVQGRSREDKHAFSMVSFREGLYTVEFRSSIALLQAF 269
Query: 617 SIAVASIHSQSPT 629
++ + +H + PT
Sbjct: 270 AMCIVMLHGRRPT 282
>gi|15233384|ref|NP_192884.1| uncharacterized protein [Arabidopsis thaliana]
gi|7267845|emb|CAB78188.1| putative protein [Arabidopsis thaliana]
gi|7321042|emb|CAB82150.1| putative protein [Arabidopsis thaliana]
gi|332657613|gb|AEE83013.1| uncharacterized protein [Arabidopsis thaliana]
Length = 694
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 172/377 (45%), Gaps = 74/377 (19%)
Query: 284 GGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEI-------CDN 336
G P F F D +EV+ A S++G++ ++Y+F G +K
Sbjct: 286 NGLPVFQFVSDSPEEVYAAKTWKSDNGST----WVYTFS-SAGSRKRSSASVRGLTDVSK 340
Query: 337 ESLLVGKMKVSISFNLC-------PDNSKIMETQFTLF---------GSNEDFGGGMQTS 380
ESLLV +M+V+ +C D +M +F L+ + ED + T
Sbjct: 341 ESLLVAQMQVTC--KMCSEVRKKGQDPETLMVNEFVLYDIAQARRSVSTKEDQSLPLDTV 398
Query: 381 RQAAR---------KNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQNLG 431
A++ +N +S D + K+R+ K S LE S+ G
Sbjct: 399 NNASKNSVKSDSEIRNNSMSGDASD------TMKQRSQPKRTSQSYDLEASN-------G 445
Query: 432 GNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKK---VGIKKTVE 488
N S +L P+LE+AAII+++ + + + + ++K +G+ E
Sbjct: 446 TNPWSAA-------NLHPDLEIAAIIIQDTIEKRESLKYRRGDKRLMEKTNILGLSPIEE 498
Query: 489 TLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWD 548
K S S + + ++IP G HG P T N PS +I+RWRSGG CDCGGWD
Sbjct: 499 EKKELFGSRSLEK--------LKVVIPRGNHGLPTTENSCPSPLIQRWRSGGGCDCGGWD 550
Query: 549 LGCPLTVLGT-RLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFH 607
+ CPL VLG R+S + + + + LFV G+K+ P L M V + Y +HFH
Sbjct: 551 MACPLMVLGNPRIS---CSHDQPLVENQHPLQLFVQGAKEHIPALYMSFVEEGQYDVHFH 607
Query: 608 STLSALQSFSIAVASIH 624
+ LS LQ+FSI VA +H
Sbjct: 608 AQLSTLQAFSICVAILH 624
>gi|449450920|ref|XP_004143210.1| PREDICTED: uncharacterized protein LOC101204783 [Cucumis sativus]
gi|449522207|ref|XP_004168119.1| PREDICTED: uncharacterized protein LOC101226098 [Cucumis sativus]
Length = 904
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 96/183 (52%), Gaps = 16/183 (8%)
Query: 449 PNLELAAIIVKEHLPENRQ----EEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTG 504
PN ELAAIIVK P+ +Q +EV K L K G ++ K S P A
Sbjct: 673 PNEELAAIIVKIP-PKIKQGTATDEVKINTNKNLTKGGSRECFPHSKVSEPVQHPAGSES 731
Query: 505 GCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR---LS 561
ST+ +L+P+G H P GGPS +IERW SGG CDCGGWDLGC L V + +
Sbjct: 732 FISTT--VLLPSGIHSLPS--KGGPSSLIERWTSGGSCDCGGWDLGCKLRVFANQNQIIE 787
Query: 562 KKEAIYQADMHDECNSFDLFVN-GSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAV 620
K + + D+ F LF G ++ +L + +D +Y I F S+L LQ+FSI +
Sbjct: 788 KSSSSQPVPLTDQ---FKLFPQEGVQENHCVLSLAAFKDMIYSIEFDSSLPLLQAFSICL 844
Query: 621 ASI 623
A I
Sbjct: 845 AMI 847
>gi|222624629|gb|EEE58761.1| hypothetical protein OsJ_10267 [Oryza sativa Japonica Group]
Length = 330
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 81/133 (60%), Gaps = 6/133 (4%)
Query: 501 RDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLG--T 558
RD S+S+ ++P GFHG + +G P +IERW+SGG CDCGGWD GC L+VL
Sbjct: 152 RDDSCASSSVLAVLPNGFHG--TSTSGQPLPLIERWKSGGACDCGGWDEGCMLSVLSDDA 209
Query: 559 RLSKKEAIYQADM-HDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSF 616
R +K + QA+ D FDLFV G S++ MV+ R+ LY + F S+++ LQ+F
Sbjct: 210 RENKGDKSTQANQTTDGSQRFDLFVQGRSREDKHAFSMVSFREGLYTVEFRSSIALLQAF 269
Query: 617 SIAVASIHSQSPT 629
++ + +H + PT
Sbjct: 270 AMCIVMLHGRRPT 282
>gi|297740279|emb|CBI30461.3| unnamed protein product [Vitis vinifera]
Length = 855
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 100/191 (52%), Gaps = 31/191 (16%)
Query: 453 LAAIIVK---EHL----PENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGG 505
L AI++K E+L N+ +++ G G K E L S +
Sbjct: 635 LVAIVIKIPIENLNHGGDSNKNKDLMGKGFK-----------ECLPEDRCSCKLGENGDP 683
Query: 506 CSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEA 565
CST+ +++P+G HG P G PS +I+RW+S G CDCGGWD+GC L +L S+
Sbjct: 684 CSTT--VILPSGVHGLPS--RGAPSPLIDRWKSSGSCDCGGWDIGCKLQIL---TSQDHC 736
Query: 566 IYQADMHDECNS---FDLFVNGS--KQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAV 620
+ + + + CN+ FDLFV G ++ + MV ++ +Y + F++T+S +Q+FSI
Sbjct: 737 CWTSRLPNHCNATNRFDLFVQGGGYQEKKLIFSMVPFKEGIYSVEFNATISLIQAFSICA 796
Query: 621 ASIHSQ-SPTL 630
A Q SP L
Sbjct: 797 AVTSQQKSPVL 807
>gi|242037265|ref|XP_002466027.1| hypothetical protein SORBIDRAFT_01g050295 [Sorghum bicolor]
gi|241919881|gb|EER93025.1| hypothetical protein SORBIDRAFT_01g050295 [Sorghum bicolor]
Length = 736
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/198 (27%), Positives = 91/198 (45%), Gaps = 38/198 (19%)
Query: 430 LGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVE 488
+G N+ + +C E L P+ E+AA +++ +++ +E+
Sbjct: 505 IGRNNSDVSTSCPWSEEDLYPHFEIAATVIEIPFSKDKSKEM------------------ 546
Query: 489 TLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWD 548
G ++ ++ P+G HG P PS +++RWR G CDCGGWD
Sbjct: 547 -------------KNGSSPCTVKVVTPSGLHGSPSDDEASPSPLLDRWRYSGGCDCGGWD 593
Query: 549 LGCPLTVLGTRLSKK--EAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHF 606
+ CP+ +LG E++ H +LFV GSK+ P+L M + + F
Sbjct: 594 MACPIEILGNVYDNNWAESVTTNAKH----PMELFVQGSKEELPVLSMKENGRGQFLVDF 649
Query: 607 HSTLSALQSFSIAVASIH 624
H LSALQ+FS+ ++ +H
Sbjct: 650 HGRLSALQAFSVCISLLH 667
>gi|158828221|gb|ABW81099.1| unknown [Cleome spinosa]
Length = 900
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 68/119 (57%), Gaps = 3/119 (2%)
Query: 504 GGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKK 563
G S +++P+G H P GGPS +I+RW+SGG CDCGGWD+GC L +L ++
Sbjct: 682 GETQRSTTVILPSGIHSLPH--RGGPSSLIQRWKSGGACDCGGWDMGCNLRILTSQNQPS 739
Query: 564 EAIYQADMHDECNSFDLFVNGS-KQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVA 621
+ ++ + F LF G P LR +D +Y + ++S+LS LQ+FSI +A
Sbjct: 740 KPSCSSEPPSTSDGFKLFFQGGVSDNRPFLRFTTYKDGVYAVEYNSSLSLLQAFSICIA 798
>gi|302757011|ref|XP_002961929.1| hypothetical protein SELMODRAFT_76246 [Selaginella moellendorffii]
gi|300170588|gb|EFJ37189.1| hypothetical protein SELMODRAFT_76246 [Selaginella moellendorffii]
Length = 190
Score = 98.6 bits (244), Expect = 9e-18, Method: Composition-based stats.
Identities = 55/146 (37%), Positives = 83/146 (56%), Gaps = 18/146 (12%)
Query: 497 ASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL 556
A F R+ ST + +++PAG HG P +GGP +++RWRSGG CDCG WDLGC LT+L
Sbjct: 5 ADFGREEAPAST-VTVVLPAGNHGLPVRGSGGPQPLMDRWRSGGRCDCGSWDLGCGLTIL 63
Query: 557 -------------GTRLSKKEAIYQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLY 602
L +E I+ + + D SF F G +K +P LR+ + D L+
Sbjct: 64 EGQQELQRSKELINISLQVREVIFYS-LADR--SFKFFFQGKNKSQSPSLRLGAISDGLF 120
Query: 603 FIHFHSTLSALQSFSIAVASIHSQSP 628
I F + +S LQ+F+ A+A ++++ P
Sbjct: 121 PISFQAPVSYLQAFASAIAVVYTRGP 146
>gi|302775478|ref|XP_002971156.1| hypothetical protein SELMODRAFT_441460 [Selaginella moellendorffii]
gi|300161138|gb|EFJ27754.1| hypothetical protein SELMODRAFT_441460 [Selaginella moellendorffii]
Length = 869
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 76/137 (55%), Gaps = 19/137 (13%)
Query: 497 ASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL 556
A F R+ ST + +++PAG HG P +GGP +I+RWRSGG CDCG WDLGC LT+L
Sbjct: 703 ADFGREEAPAST-VTVILPAGNHGLPVRGSGGPQPLIDRWRSGGRCDCGSWDLGCGLTIL 761
Query: 557 GTRLSKKEAIYQADMHDECNSFDLFVNGSKQG-----APMLRMVNVRDDLYFIHFHSTLS 611
+ E +N S QG +P LR+ + D L+ I F + +S
Sbjct: 762 -------------EGQQELQRSKELINISLQGKNKSRSPSLRLGAISDGLFPISFQAPVS 808
Query: 612 ALQSFSIAVASIHSQSP 628
LQ+F+ A+A ++++ P
Sbjct: 809 YLQAFASAIAVVYTRGP 825
>gi|224106353|ref|XP_002314139.1| predicted protein [Populus trichocarpa]
gi|222850547|gb|EEE88094.1| predicted protein [Populus trichocarpa]
Length = 928
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 72/114 (63%), Gaps = 6/114 (5%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL--GTRLSKKEAIYQA 569
+++P+G H P GGPS +++RWRSGG CDCGGWDLGC L +L +++KK + +A
Sbjct: 762 VILPSGIHSLPN--KGGPSSLLQRWRSGGSCDCGGWDLGCKLRILVNQNQINKKSSPSKA 819
Query: 570 DMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
+ + F+L ++ P+ M +D +Y + F+++LS LQ+FS+ +A +
Sbjct: 820 CL--AIDKFELVSQCEEENQPVFIMTPFKDGIYSVEFNTSLSTLQAFSLCIAVL 871
>gi|242036211|ref|XP_002465500.1| hypothetical protein SORBIDRAFT_01g039960 [Sorghum bicolor]
gi|241919354|gb|EER92498.1| hypothetical protein SORBIDRAFT_01g039960 [Sorghum bicolor]
Length = 894
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 74/130 (56%), Gaps = 3/130 (2%)
Query: 501 RDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRL 560
R+ S ++ ++P GFHG T G P +IERW+SGG CDCGGWD GC L++L
Sbjct: 720 RNANSASANVIAVLPDGFHGTSTT--GQPLPLIERWKSGGACDCGGWDEGCTLSILTDAA 777
Query: 561 SKKEAIYQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIA 619
+ +AI D F+L V G S++ M + ++ LY + F S+++ LQ+F++
Sbjct: 778 QENDAIQANLAKDGSQRFELLVQGRSREDRHAFSMTSFKEGLYTVEFRSSIALLQAFAMC 837
Query: 620 VASIHSQSPT 629
+ +H + P+
Sbjct: 838 IVMLHGRYPS 847
>gi|449440293|ref|XP_004137919.1| PREDICTED: uncharacterized protein LOC101221609 [Cucumis sativus]
Length = 997
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 69/117 (58%), Gaps = 6/117 (5%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
+++P HG P + G PS +I RWRSGG CDCGGWD GC L +L + K+ +A
Sbjct: 838 VILPGAAHGSPSS--GEPSPLINRWRSGGVCDCGGWDEGCKLRILS--IPNKQITSKACP 893
Query: 572 HDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSP 628
+C +LFV G +Q P+ M ++ + + F S++S LQ+F I VA ++ Q+P
Sbjct: 894 ISKC--LELFVQGDQQDKPVFSMAPLKGGFFEVRFDSSISMLQAFFICVAVLNGQNP 948
>gi|356523227|ref|XP_003530243.1| PREDICTED: uncharacterized protein LOC100789408 [Glycine max]
Length = 940
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 67/118 (56%), Gaps = 5/118 (4%)
Query: 510 MDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQA 569
M +++P G H P T G PS +I RW+ GG CDCGGWD+GC L VL + +
Sbjct: 780 MTVILPGGVHSSPNT--GQPSPLIRRWKLGGTCDCGGWDVGCKLLVLSNQNLSSNIPRSS 837
Query: 570 DMHDECNSFDLFVN-GSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQ 626
+ E F LFV G+ Q P+ +V ++D Y + F S ++ LQ+F I+VA + SQ
Sbjct: 838 KSYLE--RFHLFVQEGADQNTPLFTLVPLKDGFYSVEFSSEINHLQAFFISVAVLSSQ 893
>gi|414867653|tpg|DAA46210.1| TPA: hypothetical protein ZEAMMB73_731165 [Zea mays]
Length = 699
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 95/200 (47%), Gaps = 38/200 (19%)
Query: 428 QNLGGNDDSLGAACLLENH-LPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN D L +CL L P+LE+AAI+++ L + + +E+ K G
Sbjct: 468 QNTTRIDSDLSTSCLWSREDLHPHLEVAAIVIQAPLHKTKSKEL---------KAG---- 514
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
S+P A GG HG PR PS +++R +SGG CDCGG
Sbjct: 515 ------SSPGTIKAVTAGGA------------HGLPRDDEASPSPLLDRLKSGGACDCGG 556
Query: 547 WDLGCPLTVLGTRLSKKEAIYQAD--MHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFI 604
WD+ CP+ VL Y AD M++ +LF G+K+ P L M + + +
Sbjct: 557 WDMSCPIVVLDNAYDG----YWADSVMNESKAPMELFAQGNKEVLPALSMKADGNGHFSV 612
Query: 605 HFHSTLSALQSFSIAVASIH 624
FH+ LSALQ+FSI V+ +H
Sbjct: 613 DFHAGLSALQAFSICVSLLH 632
>gi|449452468|ref|XP_004143981.1| PREDICTED: uncharacterized protein LOC101216494 [Cucumis sativus]
Length = 762
Score = 95.5 bits (236), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 174/373 (46%), Gaps = 54/373 (14%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEH---- 331
G L+ G P F FS ++V+VA ++GN+ ++Y+FH + +K +
Sbjct: 354 GSLKLEKKHGMPFFEFSQSSPEDVYVAK--TWKTGNA--FKWVYTFHSQDHHKKSNAGSF 409
Query: 332 ---EICDNESLLVGKMKVS--ISFNLCP---DNSKIMETQFTLFGSNEDFGGGMQTSRQA 383
C N SL+VG+M+VS +S L DNS M T+F L+ + +RQ+
Sbjct: 410 GLNHSCKN-SLMVGQMQVSCYLSSELRDGGFDNS--MVTEFVLYDAAR--------ARQS 458
Query: 384 ARKNKGLSKKVVDAFRTTHS----FKRRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGA 439
++G + DA + S S G+ LE ++ D
Sbjct: 459 T-ASQGSCDSIHDAVKPPKSSDTGLVGEPFSVNDGTP--LEKFKFQRKHASENCDHGSID 515
Query: 440 ACLLEN-HLPPNLELAAIIV------KEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKT 492
+C ++ L P+LE AAI++ +E L R ++ G K + +E K
Sbjct: 516 SCPWDSADLHPDLESAAIVMQIPFSKRESLKYKRGDKTSG---KLNSAIQNLSKIEQRKE 572
Query: 493 SAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCP 552
P + ++ ++IP G HG P + GPS +++RWR GG CDCGGWD+GCP
Sbjct: 573 EPPHHT-------TQETLKVVIPTGNHGLPTVESQGPSTLLDRWRLGGGCDCGGWDMGCP 625
Query: 553 LTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSA 612
L +LGT S + H +F LF G+K P L M V+D Y + FH+ LS
Sbjct: 626 LLILGTHSSH---CAENQAHKGKQTFHLFHQGAKDTTPALTMNIVKDGQYSVDFHARLST 682
Query: 613 LQSFSIAVASIHS 625
LQ+FSI VA +H+
Sbjct: 683 LQAFSICVAILHA 695
>gi|255537477|ref|XP_002509805.1| conserved hypothetical protein [Ricinus communis]
gi|223549704|gb|EEF51192.1| conserved hypothetical protein [Ricinus communis]
Length = 736
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 175/375 (46%), Gaps = 61/375 (16%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHE--- 332
G L+ G PHF FS++ +EV VA + + +++Y+FH +K +
Sbjct: 327 GCLKLDNKHGIPHFEFSLECPEEVLVAKTWKANNA----FNWVYTFHSISSRKKSNASGW 382
Query: 333 -ICD--NESLLVGKMKVSISFNLCP--------DNSKIMETQFTLFGSNEDFGGGMQTSR 381
+ D +SL+VG+M+VS LC DNS M T+F L+ + +R
Sbjct: 383 GLIDGNKDSLVVGQMEVSCY--LCSELKDVGEFDNS--MVTEFVLYD--------IAHAR 430
Query: 382 QAARKNKGLSKKVVDAFRTTHSFKR----RTLSKFGGSSAI-LENSSWEPCQNLGGNDDS 436
Q+ ++ D FK T GS I L++ S GN DS
Sbjct: 431 QSVS-----TQDPADIIEPPSDFKPGSFCETQKLNNGSEPIKLKHHSKRAFDT--GNIDS 483
Query: 437 LGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPS 496
L +LE+AAI+V+ LP ++E +K+ K G K + + + +
Sbjct: 484 SNPYPWPSAVLHADLEIAAIVVQ--LPFAKRE-----SLKY--KRGDKSSDKAHRNLLNN 534
Query: 497 ASFARDTGGCSTS------MDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLG 550
+ G TS + ++IP G H P N GPS ++ RWRSGG CDCGGWD+
Sbjct: 535 SMNEHGRKGGFTSRESPEKLKVVIPTGNHSLPIDENQGPSSLLNRWRSGGGCDCGGWDMA 594
Query: 551 CPLTVLGT-RLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHST 609
CPLTV G+ ++ E + + D +LFV G+KQ P L M V + Y + FH+
Sbjct: 595 CPLTVFGSPSINCAEDVL---LIDNQQPLELFVQGTKQKIPALTMRIVEEGKYAVDFHAQ 651
Query: 610 LSALQSFSIAVASIH 624
LS LQ+FSI VA +H
Sbjct: 652 LSTLQAFSICVAILH 666
>gi|357147325|ref|XP_003574302.1| PREDICTED: uncharacterized protein LOC100838015 [Brachypodium
distachyon]
Length = 728
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 145/585 (24%), Positives = 239/585 (40%), Gaps = 115/585 (19%)
Query: 86 HMVTLDERYLRRCLELIHKSASKAAQCNISVDLGSAKM------GFMWD---DMNVAKCR 136
H V D+R E I+ S+ K N+ LG +M G + D ++++ +
Sbjct: 145 HEVNQDDR------EFINLSSLKVPDENLK--LGRPRMDCNLLKGDVRDSLLELSLEEDT 196
Query: 137 TENAC-DSARFIFECPMITGSESVVISPAGQWIVGSIMGSKSMINILKSPLLSKLGKLDG 195
T++ C + A + E S+ P G + GS G + +++ L +K+G D
Sbjct: 197 TKSPCKNVAPHLLEGSGKKDKSSIFQHPIGVYPDGSNSGERDLVSNLPKSFSAKVGVFDA 256
Query: 196 DGNFRRTSVNDVKGS----------ISYDFMKSPGGFSGYSPQKIEKQTWMLGNHK-HGS 244
+ S++ V G I FMKS S +P +EK N G
Sbjct: 257 --TYPLESIHAVDGKKKARSSTFRKIMDPFMKSK---SLRNPSLLEKGDPKSSNAPARGK 311
Query: 245 ETA-HKRLAS-ASSTNSTCSDQSSSSA--------STSTSL-GVLQCMWDGGNPHFVFSI 293
A HK L S S T T + + +S S+ T L VL+ D F F
Sbjct: 312 NNALHKPLLSDISRTEQTLTPKCQTSVEARPMTVTSSPTHLHAVLKLDPDNCAFGFEFRT 371
Query: 294 DDRKEVFVANLSNSESGNSKSMDYMYSFHW-----RKGGQKEHEICDNESLLVGKMKVSI 348
+E AN +SGN ++++Y+FH G+ + + +VG+M VS
Sbjct: 372 KGPEESIYAN--TWKSGNE--LNWIYTFHSVGKRSSTVGRTSKDRHGSLPPIVGQMHVSS 427
Query: 349 SFNLCPDNSKIME----TQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVVDAF--RTTH 402
I+ ++F L+ D AR++ + + F T+
Sbjct: 428 YLYSEVGEDGILNNSATSEFVLY----DIA--------HARRSSAVERTQCTDFIQPTSR 475
Query: 403 SFKRRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEH 461
+ R ++S+ LE ++ QN N+ A CL + L P+LE+AA++++
Sbjct: 476 NVVRNSISRES-----LERNNMMERQNTARNNSDGSAFCLWSQEDLHPHLEVAAVVIQVP 530
Query: 462 LPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGG 521
+ R +E+ +G ++ I G HG
Sbjct: 531 FHKTRSKEL-------------------------------KSGSSPGTIKIATAGGAHGL 559
Query: 522 PRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQAD--MHDECNSFD 579
PR PS ++ R +SGG CDCGGWD+ CP+ VL Y D M++ + +
Sbjct: 560 PRDDESSPSPLLNRLKSGGRCDCGGWDMSCPIVVLDNAYDS----YWVDSVMNESKHPME 615
Query: 580 LFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
LFV G+++ P L M + + + FH+ LSALQ+FSI ++ +H
Sbjct: 616 LFVQGNQEVLPALLMKADGNGQFSVDFHARLSALQAFSICISLLH 660
>gi|255558870|ref|XP_002520458.1| hypothetical protein RCOM_0731430 [Ricinus communis]
gi|223540300|gb|EEF41871.1| hypothetical protein RCOM_0731430 [Ricinus communis]
Length = 912
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 72/122 (59%), Gaps = 9/122 (7%)
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQ 568
S +++P+G H P GGPS +I+RWRSGG CDCGGWDLGC L + + + I +
Sbjct: 741 STTVILPSGVHSLPN--KGGPSSLIQRWRSGGSCDCGGWDLGCKLKIFA---NDSQHIKK 795
Query: 569 ADMHDEC---NSFDLFVNGS-KQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+ C + F+L GS ++ P+ + +D +Y + F S+LS LQ+FS+ +A +
Sbjct: 796 SCSSKPCAISDKFELISQGSEEENRPVFSLAPFKDGIYSVEFTSSLSILQAFSLCIAVLD 855
Query: 625 SQ 626
S+
Sbjct: 856 SK 857
>gi|242040269|ref|XP_002467529.1| hypothetical protein SORBIDRAFT_01g029670 [Sorghum bicolor]
gi|241921383|gb|EER94527.1| hypothetical protein SORBIDRAFT_01g029670 [Sorghum bicolor]
Length = 704
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 93/198 (46%), Gaps = 34/198 (17%)
Query: 428 QNLGGNDDSLGAACLLENH-LPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN D L +CL L P+LE+AAI+++ + + +E +K GI K
Sbjct: 472 QNNTRTDSDLSTSCLWSREDLHPHLEVAAIVIQVPFHKTKSKE-----LKAGSSPGIIKA 526
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
V T G G HG PR PS +++R +SGG CDCGG
Sbjct: 527 V---------------TAG-----------GAHGLPRDDEASPSPLLDRLKSGGACDCGG 560
Query: 547 WDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHF 606
WD+ CP+ VL A + M++ +LF G+K+ P L M + + + F
Sbjct: 561 WDMSCPIVVLDNAYDSHWA--DSVMNESKAPMELFAQGNKEVLPALSMKADGNGHFSVDF 618
Query: 607 HSTLSALQSFSIAVASIH 624
H+ LSALQ+FSI ++ +H
Sbjct: 619 HARLSALQAFSICISLLH 636
>gi|297822665|ref|XP_002879215.1| hypothetical protein ARALYDRAFT_481867 [Arabidopsis lyrata subsp.
lyrata]
gi|297325054|gb|EFH55474.1| hypothetical protein ARALYDRAFT_481867 [Arabidopsis lyrata subsp.
lyrata]
Length = 864
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 71/121 (58%), Gaps = 9/121 (7%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
+++P+G H P GGPS +I+RW+SGG CDCGGWD+GC L +L + +K ++
Sbjct: 694 VVLPSGIHSLPH--KGGPSSLIQRWKSGGSCDCGGWDMGCNLRILTNQHNK----HRNPS 747
Query: 572 HDECNSFDLFVNGSKQ---GAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSP 628
++F LF G Q P L R+ +Y + ++++LS LQ+FSI +A ++P
Sbjct: 748 PSTSDAFKLFFQGGLQDNNNQPYLSFTTYREGVYAVEYNTSLSLLQAFSICIAVNEGRNP 807
Query: 629 T 629
+
Sbjct: 808 S 808
>gi|414866017|tpg|DAA44574.1| TPA: hypothetical protein ZEAMMB73_315261 [Zea mays]
gi|414866018|tpg|DAA44575.1| TPA: hypothetical protein ZEAMMB73_315261 [Zea mays]
gi|414866019|tpg|DAA44576.1| TPA: hypothetical protein ZEAMMB73_315261 [Zea mays]
Length = 890
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 72/124 (58%), Gaps = 3/124 (2%)
Query: 506 CSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEA 565
S ++ ++P GFHG ++ +G P +IERW+SGG CDCGGWD GC L+VL + +A
Sbjct: 721 ASANVIAVLPDGFHG--KSTSGQPLPLIERWKSGGACDCGGWDEGCMLSVLTDAAEENDA 778
Query: 566 IYQADMHDECNSFDLFVN-GSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+ D F+L GS++ M + ++ LY + F S+++ LQ+F++ + +H
Sbjct: 779 VQANQAKDGSQRFELLAQGGSREDRHAFSMTSFKEGLYTVEFRSSIALLQAFAMCIVMLH 838
Query: 625 SQSP 628
+ P
Sbjct: 839 GRYP 842
>gi|357454267|ref|XP_003597414.1| hypothetical protein MTR_2g097850 [Medicago truncatula]
gi|355486462|gb|AES67665.1| hypothetical protein MTR_2g097850 [Medicago truncatula]
Length = 915
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 71/127 (55%), Gaps = 16/127 (12%)
Query: 510 MDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGT-----RLSKKE 564
+ +++P+G H P +GGPS +IERW++GG CDCGGWDL C L +L R S+
Sbjct: 755 VTVVLPSGVHSLPS--DGGPSSLIERWKTGGSCDCGGWDLACKLKILANNNQTCRKSRTS 812
Query: 565 AIYQADMHDECNSFDLFVNGSKQ---GAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVA 621
Y AD ++LF G++Q P+ + +Y + F S+LS LQ+FSI +A
Sbjct: 813 EAYFADQ------YELFFQGTEQCQDNRPVFNFSHFEPGVYSVAFGSSLSPLQAFSICIA 866
Query: 622 SIHSQSP 628
+ + P
Sbjct: 867 MVDGKLP 873
>gi|357439755|ref|XP_003590155.1| hypothetical protein MTR_1g045110 [Medicago truncatula]
gi|355479203|gb|AES60406.1| hypothetical protein MTR_1g045110 [Medicago truncatula]
Length = 124
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 64/102 (62%), Gaps = 15/102 (14%)
Query: 442 LLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFAR 501
LL ++P N E+ AI+VKEH+P N E+VGGWG KFL K+ PS +
Sbjct: 23 LLGKNVPTNFEVGAIVVKEHVPFNCLEKVGGWGSKFLNKL-------------PSECYGG 69
Query: 502 DTGGCSTSMDILIPAGFHGGPRTRNG--GPSGIIERWRSGGC 541
S SM+I+IP HGGP++R+G GPSG+I+RW+SGGC
Sbjct: 70 KNEDYSNSMNIVIPGEIHGGPKSRDGGPGPSGLIDRWKSGGC 111
>gi|326500392|dbj|BAK06285.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 720
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 97/201 (48%), Gaps = 40/201 (19%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ +CL + L P+LE+AA++V+ + R +E+
Sbjct: 490 QNTARNNSDASVSCLWSQEDLHPHLEVAAVVVQVPFHKTRSQEL---------------- 533
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
TG ++ ++ G HG PR PS ++ R +SGG CDCGG
Sbjct: 534 ---------------KTGSSPGTVKVVTAGGAHGLPRDDETSPSPLLNRLKSGGRCDCGG 578
Query: 547 WDLGCPLTVLGTRLSKKEAIYQAD--MHDECNSFDLFVNGSKQGAPMLRM-VNVRDDLYF 603
WD+ CP+ VL Y D M++ + +LF+ G+++ P L M V+ + D +
Sbjct: 579 WDMSCPIVVLENAYDS----YWVDSVMNEGKHPMELFIKGNQEVLPALSMKVDGKGD-FS 633
Query: 604 IHFHSTLSALQSFSIAVASIH 624
+ FH+ LSALQ+FS+ ++ +H
Sbjct: 634 VDFHARLSALQAFSVCISLLH 654
>gi|255587203|ref|XP_002534178.1| hypothetical protein RCOM_0303160 [Ricinus communis]
gi|223525738|gb|EEF28202.1| hypothetical protein RCOM_0303160 [Ricinus communis]
Length = 937
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 88/181 (48%), Gaps = 17/181 (9%)
Query: 447 LPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGC 506
L PN ELAA+++K+ G G K +E + PS + D+ C
Sbjct: 720 LMPNTELAAVVIKK--------PSGNLGYDGSGSDKEKNLMEKDFSWCPSDNEHSDS--C 769
Query: 507 STSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAI 566
+ +++P G H P T G PS +I RWRSGG CDCGGWD+GC L +L ++
Sbjct: 770 T----VILPGGVHSLPST--GVPSSLIHRWRSGGSCDCGGWDVGCKLRILSNENLNQKLQ 823
Query: 567 YQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHS 625
+ + F+LFV G +Q PM M + Y + F ++S LQ+ I+V+ I
Sbjct: 824 RASGACPMSHDFELFVQGEQQQDKPMFSMAPIEKGKYSVEFRPSISPLQALFISVSVISC 883
Query: 626 Q 626
Q
Sbjct: 884 Q 884
>gi|413956209|gb|AFW88858.1| hypothetical protein ZEAMMB73_942218 [Zea mays]
Length = 1068
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 3/129 (2%)
Query: 501 RDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRL 560
R S ++ ++P GFHG +G P +IERWRSGG CDCGGWD GC L+VL
Sbjct: 893 RSANSASANVVAVLPDGFHGT--ITSGQPLPLIERWRSGGACDCGGWDEGCMLSVLTDAS 950
Query: 561 SKKEAIYQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIA 619
+ +AI D F+L G S++ M + ++ LY + F S+++ LQ+F++
Sbjct: 951 EENDAIQANQAKDGSQRFELLDQGRSREERHAFSMTSFKEGLYTVEFRSSIALLQAFAMC 1010
Query: 620 VASIHSQSP 628
+ +H + P
Sbjct: 1011 IVMLHGRYP 1019
>gi|326498347|dbj|BAJ98601.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 894
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 76/133 (57%), Gaps = 6/133 (4%)
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKE---A 565
S+ ++P GFHG + +G P ++ERWRS G CDCGGWD GC L+VL K+ +
Sbjct: 724 SVIAILPNGFHGA--STSGQPLPLMERWRSAGSCDCGGWDEGCTLSVLTDNTHGKQGCKS 781
Query: 566 IYQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
I D + FDL G S++ MV+ ++ LY + F S+++ LQ+F++ + +H
Sbjct: 782 IQVNQTQDGSHRFDLLAQGRSQEDRHAFSMVSFKEGLYAVEFRSSIALLQAFAMCIVMLH 841
Query: 625 SQSPTLRPKNVQS 637
+SP ++Q+
Sbjct: 842 GRSPVRVQTDLQA 854
>gi|224058507|ref|XP_002299532.1| predicted protein [Populus trichocarpa]
gi|222846790|gb|EEE84337.1| predicted protein [Populus trichocarpa]
Length = 737
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 175/384 (45%), Gaps = 71/384 (18%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEIC- 334
G L+ G P F FS+D +EV VA + + + +++Y+FH +K +
Sbjct: 332 GRLKLENKHGVPFFEFSLDFPEEVLVAKTWKA----NNAFNWVYTFHSISSRKKSNATGW 387
Query: 335 -----DNESLLVGKMKVSISFNLCP--------DNSKIMETQFTLFG--------SNEDF 373
+ ESL+VG+M+VS LC DNS M T+F ++ S ED
Sbjct: 388 GLTDGNKESLIVGQMQVSCY--LCSKLKDGGNFDNS--MVTEFVMYDIAHARHRVSTED- 442
Query: 374 GGGMQTSRQAARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQNLGGN 433
R + N GL + + + K + K+ ++S+ P
Sbjct: 443 ---SPDVRPDSSANPGLVGGSHEMGGNSDAVKLKHQPKYAFDRGHFDSSNPYP------- 492
Query: 434 DDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEV----GGWGMKFLKKVGIKKTV-E 488
A+ +L P+LE+AA++++ LP ++E + G G + + +V E
Sbjct: 493 ----SASAVLH----PDLEIAAVVIQ--LPFAKRESLKYKRGDKGSDEMHSNLLNLSVGE 542
Query: 489 TLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWD 548
+ + P + + ++IP G H P + GPS +++RWRSGG CDCGGWD
Sbjct: 543 QRRKTIPDKENPEN-------VKVVIPTGNHSLPSGDSRGPSSLLDRWRSGGGCDCGGWD 595
Query: 549 LGCPLTVLGT---RLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIH 605
+ CPLTV G + ++ E + D +LF+ G+K+ P L M + + Y +
Sbjct: 596 MACPLTVFGNPGIQCAEDEPLL-----DNQRPLELFLMGTKENIPALTMTVLEEGQYAVD 650
Query: 606 FHSTLSALQSFSIAVASIHSQSPT 629
FH+ LS LQ+FSI VA +H T
Sbjct: 651 FHAQLSTLQAFSICVAILHGTEAT 674
>gi|147811098|emb|CAN70168.1| hypothetical protein VITISV_006870 [Vitis vinifera]
Length = 922
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL--GTRLSKKEAIYQA 569
+++P+G H P G PS ++ERW+SGG CDCGGWD+GC L VL + KK +
Sbjct: 769 VILPSGVHSLPN--KGEPSRLLERWKSGGSCDCGGWDMGCKLRVLVNQNQHRKKPSPPTT 826
Query: 570 DMHDECNSFDLF-VNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
D F+LF + G + P+ M + +D +Y F S LS LQ+FSI +A ++S++
Sbjct: 827 DR------FELFSLEGVEADEPIFSMSSFKDGIYSAEFSSPLSLLQAFSICIAVLNSRT 879
>gi|356527630|ref|XP_003532411.1| PREDICTED: uncharacterized protein LOC100811734 [Glycine max]
Length = 731
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 173/372 (46%), Gaps = 49/372 (13%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHW---RK----GGQ 328
G L+ G P F F + K+VFVA +GN+ +++Y+FH RK G
Sbjct: 325 GNLKLENKNGVPFFEFKVKCPKDVFVAK--TWRTGNA--FNWVYTFHSIDNRKKSNVNGL 380
Query: 329 KEHEICDNESLLVGKMKVSISFNLCP-------DNSKIMETQFTLFGSNEDFGGGMQTSR 381
H+ D + +V +M VS NLC DNS M T+F L+ DF Q+ +
Sbjct: 381 GSHDF-DKDPSIVAQMLVSC--NLCSELEDKEFDNS--MVTEFVLY----DF---TQSRQ 428
Query: 382 QAARKNKGLSKKVVDAFRTTHSF----KRRTLSKFGGSSAILENSSWEPCQNLGGNDDSL 437
+ + K LS+ +D +T +F K TL + A+ +P + DD L
Sbjct: 429 SVSCEKKSLSE--LDGSKTPKAFHVGSKEETLG-LHENRAVKNKLQDKPLSSNVEFDD-L 484
Query: 438 GAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLK--KVGIKKTVETLKTSAP 495
L NLE A I+++ +P +++E +K+ + ++ K+ SA
Sbjct: 485 NFYPYLSTECYSNLENAVIVLQ--IPFSKRE-----SLKYKRGDRISAKEYSNRSDLSAV 537
Query: 496 SASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTV 555
+ + ++IP G HG P + GPS +++R R GG CDCGGWD+ CPL +
Sbjct: 538 KDRKSLHRHQIQEQVKVVIPTGNHGLPNAESHGPSSLLDRLRHGGGCDCGGWDMACPLIL 597
Query: 556 LGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQS 615
LG + + +E +LFV G+K +P M + + Y + FH+ LS LQ+
Sbjct: 598 LGN--PTIQFAEDCPLMEEHQPLELFVQGAKGSSPTFSMTRIEEGHYAVDFHAQLSTLQT 655
Query: 616 FSIAVASIHSQS 627
FSI VA +H S
Sbjct: 656 FSICVAILHGTS 667
>gi|15237799|ref|NP_200710.1| uncharacterized protein [Arabidopsis thaliana]
gi|10177630|dbj|BAB10778.1| unnamed protein product [Arabidopsis thaliana]
gi|51969718|dbj|BAD43551.1| putative protein [Arabidopsis thaliana]
gi|332009748|gb|AED97131.1| uncharacterized protein [Arabidopsis thaliana]
Length = 780
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 73/116 (62%), Gaps = 6/116 (5%)
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR--LSKKEAI 566
S +++ +G H P+ GGPS +I+RWR+GG CDCGGWD+GC L +L + LS K++
Sbjct: 614 SATVILQSGVHSMPQ--KGGPSSLIQRWRTGGSCDCGGWDMGCNLRILTNQHNLSYKKSA 671
Query: 567 YQADMHDECNSFDLFVNGSK-QGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVA 621
++ N F+LF G + + P L +++ +Y + ++S+LS LQ+FSI +A
Sbjct: 672 T-SNSPPSSNRFELFFLGEQAEEHPFLSFKPIKEGIYSVAYNSSLSQLQAFSICMA 726
>gi|357520435|ref|XP_003630506.1| hypothetical protein MTR_8g098260 [Medicago truncatula]
gi|355524528|gb|AET04982.1| hypothetical protein MTR_8g098260 [Medicago truncatula]
Length = 1286
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 170/378 (44%), Gaps = 52/378 (13%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQK------ 329
G L+ G P F F + ++VFVA +G +++Y+FH +K
Sbjct: 885 GNLKFENKNGMPFFEFKVKCPEDVFVAKTWRVGNG----FNWVYTFHSIDNKKKSNATGL 940
Query: 330 EHEICDNESLLVGKMKVSISFNLCP-------DNSKIMETQFTLFGSNEDFGGGMQTSRQ 382
E D +S +V KM VS NLC DNS M T+F L+ + SR
Sbjct: 941 EPHDFDKDSSIVAKMLVSS--NLCSEIKDNVFDNS--MMTEFVLYD--------LSHSRH 988
Query: 383 AARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILE--NSSWEPCQNLGGNDDSLGAA 440
+ K K V + +FK R S+ G E S L + DS
Sbjct: 989 SISSEK---KSVCEQ----GAFKTRKASRVGSMEETFEMVEESLAIKNKLQESGDS-KFF 1040
Query: 441 CLLENHLPPNLELAAIIVKEHLPENRQEEVG-GWGMKFLKKVGIKKTVETLKTSAPSASF 499
LL NLE+AAII++ +P +++E + G + K K+ L +
Sbjct: 1041 PLLPTESYSNLEVAAIILE--IPFSKRESLKYKRGDRLCDKEYSNKS--NLCAVVDQSRK 1096
Query: 500 ARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR 559
+ + ++IP G+HG P GPS +++R R GG CDCGGWD+ CPL +LG
Sbjct: 1097 SLHDRKVQEQLKVVIPTGYHGLPNAECKGPSSLLDRLRHGGGCDCGGWDMACPLVLLGN- 1155
Query: 560 LSKKEAIYQADMH---DECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSF 616
+I A+ H ++ +LFV G+K+ +P M V Y + FH+ LS LQ+F
Sbjct: 1156 ----PSIQFAEDHSLIEDYQPLELFVQGAKESSPTFSMKMVEAGHYAVDFHAKLSTLQAF 1211
Query: 617 SIAVASIHSQSPTLRPKN 634
SI VA +H S + P++
Sbjct: 1212 SICVAILHGTSVSSGPEH 1229
>gi|297745975|emb|CBI16031.3| unnamed protein product [Vitis vinifera]
Length = 705
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 70/119 (58%), Gaps = 11/119 (9%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL--GTRLSKKEAIYQA 569
+++P+G H P G PS ++ERW+SGG CDCGGWD+GC L VL + KK +
Sbjct: 552 VILPSGVHSLPN--KGEPSRLLERWKSGGSCDCGGWDMGCKLRVLVNQNQHRKKPSPPTT 609
Query: 570 DMHDECNSFDLF-VNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
D F+LF + G + P+ M + +D +Y F S LS LQ+FSI +A ++S++
Sbjct: 610 DR------FELFSLEGVEADEPIFSMSSFKDGIYSAEFSSPLSLLQAFSICIAVLNSRT 662
>gi|357112954|ref|XP_003558270.1| PREDICTED: uncharacterized protein LOC100841371 [Brachypodium
distachyon]
Length = 892
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 69/120 (57%), Gaps = 6/120 (5%)
Query: 513 LIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEA---IYQA 569
++P GFHG + +G P ++ERWRSGG CDCGGWD GC L+VL + +A +
Sbjct: 727 VLPNGFHGA--STSGQPLPLMERWRSGGSCDCGGWDEGCILSVLTADTQENKAYKSVPAN 784
Query: 570 DMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSP 628
D + FDL G S + MV+ ++ LY + F S+++ LQ+F++ + +H + P
Sbjct: 785 QTPDGSHRFDLLAQGRSGEDRQAFSMVSFKEGLYAVEFRSSIALLQAFAMCIVMLHDRYP 844
>gi|147815115|emb|CAN65660.1| hypothetical protein VITISV_000955 [Vitis vinifera]
Length = 937
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 169/374 (45%), Gaps = 54/374 (14%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEH---- 331
G L+ G P F FS+ ++V VA + + +++Y+FH + +K +
Sbjct: 527 GHLKLETKHGVPFFEFSLKHPEDVLVAKTWKV----NNAFNWVYTFHSIRNRKKSNASGW 582
Query: 332 --EICDNESLLVGKMKVSISFNLC---PDNSKI---METQFTLFGSNEDFGGGMQTSRQA 383
+ + E+ +VG+M+VS LC D K M T+F L+ D ++
Sbjct: 583 GLKDSNKETSMVGQMQVSCY--LCSELQDGGKFGNSMVTEFVLY----DIAHARKSI--T 634
Query: 384 ARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEP----CQ-NLGGNDDSLG 438
AR+N S + + + ++ G S L++ S + CQ N N D
Sbjct: 635 ARENSCHSPQAAEPAKGSNE-------GLAGQSLELDDVSDDAVKLKCQTNHASNHDDFD 687
Query: 439 AACLLE---NHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIK----KTVETLK 491
AA +L P+LE+AAI+++ E R+ G KF KV VE K
Sbjct: 688 AAVPYPWAPANLHPDLEIAAIVIQVPF-EKRESLKYRRGEKFSNKVHSDFLDLSMVEQKK 746
Query: 492 TSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGC 551
S A+ + ++ +G HG P + GPS +++RWR GG CDCGGWD+ C
Sbjct: 747 KDVLDMSPAK--------VKVVTASGNHGLPSGDSRGPSSLLDRWRLGGGCDCGGWDMAC 798
Query: 552 PLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLS 611
PL V + + + +LFV G+K+ P L + V + Y + FH+ LS
Sbjct: 799 PLIVFDN--PSILXVEDCPLMEXPQPLELFVQGAKERVPALTITRVEEGQYAVDFHAQLS 856
Query: 612 ALQSFSIAVASIHS 625
LQ+FSI VA +HS
Sbjct: 857 TLQAFSICVAMLHS 870
>gi|225426422|ref|XP_002270125.1| PREDICTED: uncharacterized protein LOC100246220 [Vitis vinifera]
Length = 758
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 170/374 (45%), Gaps = 54/374 (14%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEH---- 331
G L+ G P F FS+ ++V VA + + +++Y+FH + +K +
Sbjct: 348 GHLKLETKHGVPFFEFSLKHPEDVLVAKTWKV----NNAFNWVYTFHSIRNRKKSNASGW 403
Query: 332 --EICDNESLLVGKMKVSISFNLC---PDNSKI---METQFTLFGSNEDFGGGMQTSRQA 383
+ + E+ +VG+M+VS LC D K M T+F L+ D ++
Sbjct: 404 GLKDSNKETSMVGQMQVSCY--LCSELQDGGKFGNSMVTEFVLY----DIAHARKSI--T 455
Query: 384 ARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAILENSSWEP----CQ-NLGGNDDSLG 438
AR+N S + + + ++ G S L++ S + CQ N N D
Sbjct: 456 ARENSCHSPQAAEPAKGSNE-------GLAGQSLELDDVSDDAVKLKCQTNHASNHDDFD 508
Query: 439 AACLLE---NHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIK----KTVETLK 491
AA +L P+LE+AAI+++ E R+ G KF KV VE K
Sbjct: 509 AAVPYPWAPANLHPDLEIAAIVIQVPF-EKRESLKYRRGEKFSNKVHSDFLDLSMVEQKK 567
Query: 492 TSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGC 551
S A+ + ++ +G HG P + GPS +++RWR GG CDCGGWD+ C
Sbjct: 568 KDVLDMSPAK--------VKVVTASGNHGLPSGDSRGPSSLLDRWRLGGGCDCGGWDMAC 619
Query: 552 PLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLS 611
PL V + + + + +LFV G+K+ P L + V + Y + FH+ LS
Sbjct: 620 PLIVFDNPSILR--VEDCPLMESPQPLELFVQGAKERVPALTITRVEEGQYAVDFHAQLS 677
Query: 612 ALQSFSIAVASIHS 625
LQ+FSI VA +HS
Sbjct: 678 TLQAFSICVAMLHS 691
>gi|356547281|ref|XP_003542044.1| PREDICTED: uncharacterized protein LOC100798889 [Glycine max]
Length = 874
Score = 89.0 bits (219), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 85/180 (47%), Gaps = 36/180 (20%)
Query: 449 PNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCST 508
PN ELAAI+VK K V + + S+ R
Sbjct: 670 PNDELAAIVVKS-----------------------AKAVNFINYAHQSS---RQNDSQDL 703
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQ 568
+ +++P G H P NGGPS +IERWR+GG CDCGGWD+ C L +L ++ +A +
Sbjct: 704 HVTVVLPTGVHSFPS--NGGPSSLIERWRTGGSCDCGGWDMACKLKILA---NESQACRK 758
Query: 569 ADMHDEC--NSFDLFVNGSKQ---GAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
+ + C + F+LF+ + Q P + +Y + F S+ S LQ+FSI +A +
Sbjct: 759 SRISKACFPHPFELFLQVNDQDLENQPAFSFSPFKPGVYSVAFDSSFSLLQAFSICIALV 818
>gi|30687315|ref|NP_181329.2| uncharacterized protein [Arabidopsis thaliana]
gi|26451005|dbj|BAC42609.1| unknown protein [Arabidopsis thaliana]
gi|28951047|gb|AAO63447.1| At2g37930 [Arabidopsis thaliana]
gi|330254373|gb|AEC09467.1| uncharacterized protein [Arabidopsis thaliana]
Length = 467
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 68/116 (58%), Gaps = 11/116 (9%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
I++P+G H P+ N P +I RW++GG CDCGGWD+GC L VL +K + +
Sbjct: 325 IILPSGVHTLPKDGNNSPLPLINRWKTGGECDCGGWDIGCKLRVLSYNHTKTQTL----- 379
Query: 572 HDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
+SF LF ++ P +MV+ D+L+ + F S++S L++F I++A QS
Sbjct: 380 ----SSFQLF--DQERDEPAFKMVSHGDELHSVEFGSSISLLEAFFISLAVTSHQS 429
>gi|15227553|ref|NP_180512.1| uncharacterized protein [Arabidopsis thaliana]
gi|3980384|gb|AAC95187.1| unknown protein [Arabidopsis thaliana]
gi|20260412|gb|AAM13104.1| unknown protein [Arabidopsis thaliana]
gi|31711766|gb|AAP68239.1| At2g29510 [Arabidopsis thaliana]
gi|330253168|gb|AEC08262.1| uncharacterized protein [Arabidopsis thaliana]
Length = 839
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/123 (35%), Positives = 70/123 (56%), Gaps = 9/123 (7%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
+++P+G H P GGPS +I+RW+S G CDCGGWD GC L +L + +K I +
Sbjct: 668 VVLPSGVHSLPH--KGGPSSLIQRWKSDGSCDCGGWDTGCNLRILTNQHNK--PINPSPT 723
Query: 572 HDECNSFDLFVNGSKQ---GAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSP 628
++F LF G Q P L R+ +Y + ++++LS LQ+FSI +A ++P
Sbjct: 724 --TSDAFKLFFQGGVQENNNQPYLSFTTYREGVYAVEYNTSLSLLQAFSICIAVNEGRNP 781
Query: 629 TLR 631
++
Sbjct: 782 LIK 784
>gi|356511494|ref|XP_003524461.1| PREDICTED: uncharacterized protein LOC100778683 [Glycine max]
Length = 744
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 180/385 (46%), Gaps = 50/385 (12%)
Query: 264 QSSSSASTSTSL-GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFH 322
Q S AS+ L G L+ G F F + ++VFVA +GN+ +++Y+FH
Sbjct: 325 QYSGIASSPVHLHGNLKMENKHGVLFFEFKVKCPEDVFVAK--TWRTGNA--FNWVYTFH 380
Query: 323 W---RK----GGQKEHEICDNESLLVGKMKVSISF-----NLCPDNSKIMETQFTLFGSN 370
RK G H+ D +S +V +M VS + + DN M T+F L+
Sbjct: 381 SIDNRKKSNANGLGSHDF-DKDSSIVAQMLVSCNLGSELEDKVFDN--FMVTEFVLY--- 434
Query: 371 EDFGGGMQTSRQAARKNKGLSKKVVDAFRTTHS----FKRRTLSKFGGSSAILENSSWEP 426
DF Q+ +R+ K + +D +T + K TL G+ + +P
Sbjct: 435 -DFTHSRQS---VSREKKSFCE--LDGSKTPKASHVGLKEETLG-LDGNHVVKNKLQDKP 487
Query: 427 CQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
+ DD L + L PNLE+AAI+++ +P +++E +K+ K G +K+
Sbjct: 488 LSSSVEFDD-LNSYPYLSTECYPNLEIAAIVLQ--IPFSKRE-----SLKY--KRGERKS 537
Query: 487 V-ETLKTSAPSASFARDT---GGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCC 542
E S SA R + + ++IP G HG P + PS +++R R GG C
Sbjct: 538 AKECSNRSDLSAVKDRKSLHYDQIQEQVKVVIPTGNHGLPNAESQAPSSLLDRLRHGGGC 597
Query: 543 DCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLY 602
DCGGWD+ CPL +LG + + +E +LFV G+K +P M + + Y
Sbjct: 598 DCGGWDMACPLILLGN--PTIQFAEDCSLMEEHQPLELFVQGAKGSSPTFSMTRIEEGHY 655
Query: 603 FIHFHSTLSALQSFSIAVASIHSQS 627
+ FH+ LS LQ+FSI VA +H S
Sbjct: 656 AVDFHAQLSTLQAFSICVAILHGTS 680
>gi|356557400|ref|XP_003547004.1| PREDICTED: uncharacterized protein LOC100818834 [Glycine max]
Length = 872
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 68/119 (57%), Gaps = 10/119 (8%)
Query: 510 MDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQA 569
+ +++P G H P NGGPS +IERWR+GG CDCGGWD+ C L +L + +A ++
Sbjct: 703 VTVVLPTGVHSLPS--NGGPSSLIERWRTGGSCDCGGWDMACKLKILA---DESQACRKS 757
Query: 570 DMHDEC--NSFDLFV---NGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
+ C + F+LF+ + ++ P + +Y + F S+ S LQ+FSI +A +
Sbjct: 758 RISKACFPHPFELFLQVNDQDQENQPAFSFSPFKPGVYSVAFDSSFSLLQAFSICIALV 816
>gi|224138926|ref|XP_002322936.1| predicted protein [Populus trichocarpa]
gi|222867566|gb|EEF04697.1| predicted protein [Populus trichocarpa]
Length = 1005
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 69/120 (57%), Gaps = 7/120 (5%)
Query: 507 STSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR--LSKKE 564
S+S +++P G H P G PS +I RWRSGG CDCGGWD+GC L +L R SK
Sbjct: 834 SSSATVILPGGNHSMPN--EGVPSPLIHRWRSGGSCDCGGWDVGCKLRILTNRSHCSKIT 891
Query: 565 AIYQADMHDECNSFDLFVNGS-KQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
++ + +C F+LF G+ +Q P+ + V +Y F S++S+LQ+F I V I
Sbjct: 892 RTSKSCLMSDC--FELFTQGAIQQDHPIFSLAQVEKGMYSTEFSSSISSLQAFFICVDVI 949
>gi|62319768|dbj|BAD93757.1| hypothetical protein [Arabidopsis thaliana]
Length = 200
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 9/123 (7%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
+++P+G H P GGPS +I+RW+S G CDCGGWD GC L +L + +K
Sbjct: 29 VVLPSGVHSLPH--KGGPSSLIQRWKSDGSCDCGGWDTGCNLRILTNQHNKP----INPS 82
Query: 572 HDECNSFDLFVNGSKQ---GAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQSP 628
++F LF G Q P L R+ +Y + ++++LS LQ+FSI +A ++P
Sbjct: 83 PTTSDAFKLFFQGGVQENNNQPYLSFTTYREGVYAVEYNTSLSLLQAFSICIAVNEGRNP 142
Query: 629 TLR 631
++
Sbjct: 143 LIK 145
>gi|15240877|ref|NP_195723.1| uncharacterized protein [Arabidopsis thaliana]
gi|30679093|ref|NP_850748.1| uncharacterized protein [Arabidopsis thaliana]
gi|6759426|emb|CAB69831.1| putative protein [Arabidopsis thaliana]
gi|20465615|gb|AAM20139.1| unknown protein [Arabidopsis thaliana]
gi|26983886|gb|AAN86195.1| unknown protein [Arabidopsis thaliana]
gi|332002906|gb|AED90289.1| uncharacterized protein [Arabidopsis thaliana]
gi|332002907|gb|AED90290.1| uncharacterized protein [Arabidopsis thaliana]
Length = 744
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 59/114 (51%), Gaps = 13/114 (11%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
++IP G H P G PS +I RWRSGG CDCGGWD+GC L VL + +
Sbjct: 616 VIIPGGVHSFPE--KGAPSPLISRWRSGGLCDCGGWDVGCKLHVLSNKTV---------L 664
Query: 572 HDECNSFDLFVN--GSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
H SF LF + +P L M ++ +Y + F S +S LQ+F + V +
Sbjct: 665 HKFNQSFTLFDQEVSEQDSSPALAMTELKTGIYRVEFGSFVSPLQAFFVCVTVL 718
>gi|297823665|ref|XP_002879715.1| hypothetical protein ARALYDRAFT_345557 [Arabidopsis lyrata subsp.
lyrata]
gi|297325554|gb|EFH55974.1| hypothetical protein ARALYDRAFT_345557 [Arabidopsis lyrata subsp.
lyrata]
Length = 574
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 11/116 (9%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
I++P+G H P+ N P +I RW+SGG CDCGGWD+GC L VL K +
Sbjct: 433 IILPSGVHTLPKDGNNAPLPLINRWKSGGECDCGGWDIGCKLRVLSNNHIKSQTF----- 487
Query: 572 HDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHSQS 627
+SF LF + A +MV+ D+L+ + F S++S L++F I++A QS
Sbjct: 488 ----SSFQLFDQERDEAA--FKMVSHGDELHSVEFGSSISLLEAFFISLAVTSHQS 537
>gi|224087591|ref|XP_002308193.1| predicted protein [Populus trichocarpa]
gi|222854169|gb|EEE91716.1| predicted protein [Populus trichocarpa]
Length = 979
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 99/186 (53%), Gaps = 17/186 (9%)
Query: 450 NLELAAIIVKEHLPENRQEEVGGWGMKFLKK---VGIKKTVETLKTSAPSASFARDTGGC 506
N ELAA++VK E+ G ++ + + K + + L S S + + T
Sbjct: 754 NRELAAVVVK------MLNEISGLDLRQTDQNDNLMHKGSSQCLPESQCSGNLGK-TEHS 806
Query: 507 STSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVL--GTRLSKKE 564
+++ +++P G H P G PS +I RWRSGG CDCGGWD+GC L +L G++ S+
Sbjct: 807 NSATTVILPGGNHSLPN--EGVPSPLIHRWRSGGSCDCGGWDVGCKLRILSNGSQCSEIP 864
Query: 565 AIYQADMHDECNSFDLFVNGS-KQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
++ + +C F+LF G+ +Q P+ + V +Y I F S++S+LQ+F I V I
Sbjct: 865 RTSKSCLMPDC--FELFSEGANQQDQPIFSLAQVEKGMYSIEFSSSISSLQAFFIGVTVI 922
Query: 624 HSQSPT 629
Q T
Sbjct: 923 SCQKST 928
>gi|297810299|ref|XP_002873033.1| hypothetical protein ARALYDRAFT_486976 [Arabidopsis lyrata subsp.
lyrata]
gi|297318870|gb|EFH49292.1| hypothetical protein ARALYDRAFT_486976 [Arabidopsis lyrata subsp.
lyrata]
Length = 760
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 63/114 (55%), Gaps = 13/114 (11%)
Query: 512 ILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADM 571
++IP H P G PS +I RWRS G CDCGGWD+GC L VL + +
Sbjct: 632 VIIPGDVHSFPE--KGAPSPLISRWRSEGLCDCGGWDVGCKLRVLSNKTV---------L 680
Query: 572 HDECNSFDLF-VNGSKQGA-PMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASI 623
H+ SF LF GS+Q + P+L M ++ +Y + F S++S LQ+F + V +
Sbjct: 681 HEFNQSFKLFDQEGSEQDSGPVLTMTELKTGIYRVEFGSSVSPLQAFFVCVTVL 734
>gi|224071896|ref|XP_002303590.1| predicted protein [Populus trichocarpa]
gi|222841022|gb|EEE78569.1| predicted protein [Populus trichocarpa]
Length = 734
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 164/376 (43%), Gaps = 54/376 (14%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEIC- 334
G L+ G P F FS++ +EV VA + + + +++Y+FH +K +
Sbjct: 328 GCLKMEIKHGVPFFEFSLNRPEEVLVAKTWKANNAS----NWVYTFHSISNRKKSNATGR 383
Query: 335 -----DNESLLVGKMKVSISFNLCP--------DNSKIMETQFTLFGSNEDFGGGMQTSR 381
+ ES +VG+M+VS LC DNS + T+F L+ +
Sbjct: 384 GLSDGNKESSVVGQMQVSCY--LCSELKDGGNFDNSLV--TEFVLYDNVHARQRVSTEES 439
Query: 382 QAARKNKGLSKKVV------DAFRTTHSFKRRTLSKFGGSSAILENSSWEPCQNLGGNDD 435
R + G +V D FK + F L++S+ P
Sbjct: 440 PGVRPDIGAKPGLVGGSHEMDGNSDAAKFKHQPQHAFDRGD--LDSSNPYPW-------- 489
Query: 436 SLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVG-GWGMKFLKKVGIKKTVETLKTSA 494
AA +L P+LE+AA+++K LP ++E + G K K + + L
Sbjct: 490 ---AAAVLH----PDLEIAAVVIK--LPFAKRESLKYKRGDKGSDK--MHSNLLNLSVGE 538
Query: 495 PSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLT 554
RD + ++IP G H PR GPS +++RWRSGG CDCGGWD+ CPLT
Sbjct: 539 QRMKTIRDEEN-QEKVKVVIPTGKHSLPRGDGRGPSSLLDRWRSGGGCDCGGWDMACPLT 597
Query: 555 VLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQ-GAPMLRMVNVRDDLYFIHFHSTLSAL 613
V G + + D +LF+ G K+ P L M + + Y + FH+ LS L
Sbjct: 598 VFGN--PGIQCAEDEPLLDNQRPLELFLQGMKENNIPALTMTVLEEGQYAVDFHAQLSTL 655
Query: 614 QSFSIAVASIHSQSPT 629
Q+FSI VA +H T
Sbjct: 656 QAFSICVAVLHGTKAT 671
>gi|297796849|ref|XP_002866309.1| hypothetical protein ARALYDRAFT_496030 [Arabidopsis lyrata subsp.
lyrata]
gi|297312144|gb|EFH42568.1| hypothetical protein ARALYDRAFT_496030 [Arabidopsis lyrata subsp.
lyrata]
Length = 782
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 72/121 (59%), Gaps = 6/121 (4%)
Query: 509 SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR--LSKKEAI 566
S +++ +G H P+ GGPS +I+RWR+GG CDCGGWD+GC L +L + LS K +
Sbjct: 616 SATVILQSGVHSMPQ--KGGPSSLIQRWRTGGSCDCGGWDMGCNLRILTNQHNLSYKNST 673
Query: 567 YQADMHDECNSFDLFVNGSK-QGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIHS 625
N F+LF G + + P + +++ +Y + ++S+LS LQ+FSI +A S
Sbjct: 674 TSNSPPSS-NRFELFFLGEQPEELPFVSFKPIKEGIYSVVYNSSLSQLQAFSICMALAES 732
Query: 626 Q 626
+
Sbjct: 733 R 733
>gi|356495686|ref|XP_003516705.1| PREDICTED: uncharacterized protein LOC100776815 [Glycine max]
Length = 728
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 166/375 (44%), Gaps = 56/375 (14%)
Query: 276 GVLQCMWDGGNPHFVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKE----- 330
G L+ G P F F + ++VFV + +SGN+ ++ Y+FH +K
Sbjct: 322 GNLKLKNKQGLPFFEFKVKCPEDVFV--VKTWKSGNA--FNWAYTFHSMDNRKKSTATDL 377
Query: 331 -HEICDNESLLVGKMKVSISFNLCP-------DNSKIMETQFTLFGSNEDFGGGMQTSRQ 382
CD +S +V +M VS N C DNS M ++F L+ +S+
Sbjct: 378 GSHCCDKDSSMVAQMLVSS--NSCSKLEGGMFDNS--MVSEFVLYDLT-------HSSKS 426
Query: 383 AARKNKGLSKKVVDAFRTTHSFKRRTLSKFGGSSAIL---ENSSWEPCQNLGGNDD--SL 437
+ + K S + H K S+ G E + + + L GN D
Sbjct: 427 VSPEKKRYSDQ--------HCSKTLKASRVGMKGETFRPDEETLFTKNKLLSGNADFDKS 478
Query: 438 GAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIK----KTVETLKTS 493
+ L L N E+AAI+++ +P ++E +K+ ++ I + L +
Sbjct: 479 NSYPLSSTELHSNPEMAAIVLQ--IPFRKRE-----SLKYKRRDRINAEEHSKLSDLSLA 531
Query: 494 APSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPL 553
+ + + +++P G HG P + GPS +++RW+ GG CDCGGWD+ CPL
Sbjct: 532 VDQSRKSLHDRKVLEQVKVVLPTGSHGLPSAESQGPSSLLDRWKHGGGCDCGGWDMACPL 591
Query: 554 TVLGTRLSKKEAIYQADMHDE-CNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSA 612
+LG + + H E + +LF G+K+ P M V + Y + FH+ LS
Sbjct: 592 ILLGNPTIQ---FAEGRTHMEGYQTLELFTQGAKERTPTFGMTMVEEGQYAVDFHANLSP 648
Query: 613 LQSFSIAVASIHSQS 627
LQ+FSI VA +H S
Sbjct: 649 LQAFSICVAILHGNS 663
>gi|297742521|emb|CBI34670.3| unnamed protein product [Vitis vinifera]
Length = 1085
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 120/263 (45%), Gaps = 45/263 (17%)
Query: 375 GGMQTSRQAARKNKGLSKKVVDAFRTT--HSFKRRTLSKFG--GSSAILENSSWEP---- 426
GG+ + R NK K ++ F T H+ + S G G S L++ S +
Sbjct: 789 GGVTKGPASGRNNKTFRKSLLHDFANTTQHAAEPAKGSNEGLAGQSLELDDVSDDAVKLK 848
Query: 427 CQ-NLGGNDDSLGAACLLE---NHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVG 482
CQ N N D AA +L P+LE+AAI+++ + F K+
Sbjct: 849 CQTNHASNHDDFDAAVPYPWAPANLHPDLEIAAIVIQ---------------VPFEKRES 893
Query: 483 IKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCC 542
+K + R G + ++ +G HG P + GPS +++RWR GG C
Sbjct: 894 LK--------------YRR--GENPAKVKVVTASGNHGLPSGDSRGPSSLLDRWRLGGGC 937
Query: 543 DCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLY 602
DCGGWD+ CPL V + + + + +LFV G+K+ P L + V + Y
Sbjct: 938 DCGGWDMACPLIVFDNPSILR--VEDCPLMESPQPLELFVQGAKERVPALTITRVEEGQY 995
Query: 603 FIHFHSTLSALQSFSIAVASIHS 625
+ FH+ LS LQ+FSI VA +HS
Sbjct: 996 AVDFHAQLSTLQAFSICVAMLHS 1018
>gi|414864267|tpg|DAA42824.1| TPA: hypothetical protein ZEAMMB73_588556 [Zea mays]
Length = 769
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 101/223 (45%), Gaps = 39/223 (17%)
Query: 406 RRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPE 464
RR + + G+ + +S G N+ + +C E L P LE+AA +++
Sbjct: 513 RRIVEEKTGALQQPQRTSLTEENKTGRNNSDVSTSCPWSEEDLHPYLEIAATVIEVPFSR 572
Query: 465 NRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPR- 523
+ EEV G ++ ++ P+G HG P
Sbjct: 573 YKSEEV-------------------------------KNGSSPCTVKVVAPSGLHGSPSD 601
Query: 524 TRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRL--SKKEAIYQADMHDECNSFDLF 581
GGPS +++RWR GG CDCGGWD+ CP+ V+G S E+I H +LF
Sbjct: 602 DEAGGPSPLLDRWRYGGGCDCGGWDMACPIDVIGNTYDNSWAESIATNVKH----PMELF 657
Query: 582 VNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
V GSK+ P+L M + + FH LSALQ+FS+ ++ +H
Sbjct: 658 VQGSKEELPVLSMKENGRGQFVVDFHGRLSALQAFSVCISLLH 700
>gi|222624033|gb|EEE58165.1| hypothetical protein OsJ_09086 [Oryza sativa Japonica Group]
Length = 443
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 92/198 (46%), Gaps = 34/198 (17%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ + +C E L P+LE+AA +++ +P NR
Sbjct: 211 QNSARNNSDVSTSCPWSEEDLYPHLEIAATVIQ--VPFNR-------------------- 248
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
S S +G ++ ++ P+G HG P PS +++RWR GG CDCGG
Sbjct: 249 ---------SKSKELKSGLSPGTVKVITPSGLHGLPNDDETSPSPLLDRWRYGGGCDCGG 299
Query: 547 WDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHF 606
WD+ CP+ VLG A + + N +LFV G K+ P L + +HF
Sbjct: 300 WDMACPIAVLGNAYDNNWA--NSVTKEGENPMELFVLGGKEELPALSIKGNEKGQLLVHF 357
Query: 607 HSTLSALQSFSIAVASIH 624
H+ LSA+Q+FS ++ +H
Sbjct: 358 HARLSAMQAFSTCISLLH 375
>gi|218191919|gb|EEC74346.1| hypothetical protein OsI_09644 [Oryza sativa Indica Group]
Length = 443
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 92/198 (46%), Gaps = 34/198 (17%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ + +C E L P+LE+AA +++ +P NR
Sbjct: 211 QNSARNNSDVSTSCPWSEEDLYPHLEIAATVIQ--VPFNR-------------------- 248
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
S S +G ++ ++ P+G HG P PS +++RWR GG CDCGG
Sbjct: 249 ---------SKSKELKSGLSPGTVKVITPSGLHGLPNDDETSPSPLLDRWRYGGGCDCGG 299
Query: 547 WDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHF 606
WD+ CP+ VLG A + + N +LFV G K+ P L + +HF
Sbjct: 300 WDMACPIAVLGNAYDNNWA--NSVTKEGENPMELFVLGGKEELPALSIKGNEKGQLLVHF 357
Query: 607 HSTLSALQSFSIAVASIH 624
H+ LSA+Q+FS ++ +H
Sbjct: 358 HARLSAMQAFSTCISLLH 375
>gi|24418033|gb|AAN60483.1| Hypothetical protein [Oryza sativa Japonica Group]
Length = 928
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 94/198 (47%), Gaps = 34/198 (17%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ + +C E L P+LE+AA +++ +P NR + K LK
Sbjct: 505 QNSARNNSDVSTSCPWSEEDLYPHLEIAATVIQ--VPFNRSKS------KELK------- 549
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
+G ++ ++ P+G HG P PS +++RWR GG CDCGG
Sbjct: 550 ----------------SGLSPGTVKVITPSGLHGLPNDDETSPSPLLDRWRYGGGCDCGG 593
Query: 547 WDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHF 606
WD+ CP+ VLG A + + N +LFV G K+ P L + +HF
Sbjct: 594 WDMACPIAVLGNAYDNNWA--NSVTKEGENPMELFVLGGKEELPALSIKGNEKGQLLVHF 651
Query: 607 HSTLSALQSFSIAVASIH 624
H+ LSA+Q+FS ++ +H
Sbjct: 652 HARLSAMQAFSTCISLLH 669
>gi|357114468|ref|XP_003559022.1| PREDICTED: uncharacterized protein LOC100839140 [Brachypodium
distachyon]
Length = 734
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 71/127 (55%), Gaps = 6/127 (4%)
Query: 500 ARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTR 559
+++ G S ++ ++ P+G HG PS +++RWR GG CDCGGWD+ CPL VLG+
Sbjct: 541 SKENGPSSGTIKVVTPSGLHGLSNDNEASPSPLLDRWRYGGGCDCGGWDMACPLVVLGSE 600
Query: 560 LSKK--EAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFS 617
+++ + H +L V GSK+ P L M + FH+ LSALQ+FS
Sbjct: 601 YDNNWVDSVTKESKH----PMELLVQGSKEELPALSMKANGKGQLLVDFHARLSALQAFS 656
Query: 618 IAVASIH 624
+ ++ +H
Sbjct: 657 VCISLLH 663
>gi|115450161|ref|NP_001048681.1| Os03g0105800 [Oryza sativa Japonica Group]
gi|113547152|dbj|BAF10595.1| Os03g0105800 [Oryza sativa Japonica Group]
Length = 749
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 46/210 (21%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ + +C E L P+LE+AA +++ +P NR
Sbjct: 505 QNSARNNSDVSTSCPWSEEDLYPHLEIAATVIQ--VPFNR-------------------- 542
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
S S +G ++ ++ P+G HG P PS +++RWR GG CDCGG
Sbjct: 543 ---------SKSKELKSGLSPGTVKVITPSGLHGLPNDDETSPSPLLDRWRYGGGCDCGG 593
Query: 547 WDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFV------------NGSKQGAPMLRM 594
WD+ CP+ VLG A + + N +LFV G K+ P L +
Sbjct: 594 WDMACPIAVLGNAYDNNWA--NSVTKEGENPMELFVLSKNEQVVIFFEQGGKEELPALSI 651
Query: 595 VNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+HFH+ LSA+Q+FS ++ +H
Sbjct: 652 KGNEKGQLLVHFHARLSAMQAFSTCISLLH 681
>gi|218184981|gb|EEC67408.1| hypothetical protein OsI_34576 [Oryza sativa Indica Group]
Length = 870
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 147/352 (41%), Gaps = 74/352 (21%)
Query: 289 FVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNE-------SLLV 341
F F +E AN+ ++GN ++++Y+FH G++ + +V
Sbjct: 507 FEFCTKGPEESIYANIW--KAGNE--LNWIYTFH--STGKRTSTVGKTPKDRRGCLPPIV 560
Query: 342 GKMKVS------ISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVV 395
G+M VS + N +NS I E F L+ D ++S V
Sbjct: 561 GQMHVSSYLYSEVGQNGVLNNSAISE--FVLY----DIAHARRSS-------------AV 601
Query: 396 DAFRTTHSFKRRTLSKFGGS---SAILENSSWEPCQNLGGNDDSLGAACLLENHLPPNLE 452
+ + T S K + S S ++ N+ E N N D+ ++ L P+LE
Sbjct: 602 ERIQCTDSSKPKFCSAVNNSISRGSLERNNLMERQNNTRNNSDASTSSLWSREDLHPHLE 661
Query: 453 LAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDI 512
+AAI+++ + + +E+ G S ++ +
Sbjct: 662 VAAIVIQVPFHKTQSKEL-------------------------------KDGSSSGTIKV 690
Query: 513 LIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMH 572
G HG PR PS +++R ++GG CDCGGWD+ CP+ VL + + M
Sbjct: 691 AAAGGAHGLPRDDESSPSPLLDRLKTGGGCDCGGWDMSCPIVVLDN--AYDSHWVDSVMT 748
Query: 573 DECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+ + +L G+K+ P + M V + + + FH+ LSALQ+FS+ + +H
Sbjct: 749 ESKHPMELPFQGNKEALPAISMKAVGNGHFSVDFHARLSALQAFSVCICLLH 800
>gi|110289538|gb|ABG66240.1| hypothetical protein LOC_Os10g40324 [Oryza sativa Japonica Group]
Length = 1369
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 147/352 (41%), Gaps = 74/352 (21%)
Query: 289 FVFSIDDRKEVFVANLSNSESGNSKSMDYMYSFHWRKGGQKEHEICDNE-------SLLV 341
F F +E AN+ ++GN ++++Y+FH G++ + +V
Sbjct: 701 FEFCTKGPEESIYANIW--KAGNE--LNWIYTFH--STGKRTSTVGKTPKDRRGCLPPIV 754
Query: 342 GKMKVS------ISFNLCPDNSKIMETQFTLFGSNEDFGGGMQTSRQAARKNKGLSKKVV 395
G+M VS + N +NS I E F L+ D ++S V
Sbjct: 755 GQMHVSSYLYSEVGQNGVLNNSAISE--FVLY----DIAHARRSS-------------AV 795
Query: 396 DAFRTTHSFKRRTLSKFGGS---SAILENSSWEPCQNLGGNDDSLGAACLLENHLPPNLE 452
+ + T S K + S S ++ N+ E N N D+ ++ L P+LE
Sbjct: 796 ERIQCTDSSKPKFCSAVNNSISRGSLERNNLMERQNNTRNNSDASTSSLWSREDLHPHLE 855
Query: 453 LAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDI 512
+AAI+++ + + +E+ G S ++ +
Sbjct: 856 VAAIVIQVPFHKTQSKEL-------------------------------KDGSSSGTIKV 884
Query: 513 LIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKKEAIYQADMH 572
G HG PR PS +++R ++GG CDCGGWD+ CP+ VL + + M
Sbjct: 885 AAAGGAHGLPRDDESSPSPLLDRLKTGGGCDCGGWDMSCPIVVLDN--AYDSHWVDSVMT 942
Query: 573 DECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVASIH 624
+ + +L G+K+ P + M V + + + FH+ LSALQ+FS+ + +H
Sbjct: 943 ESKHPMELPFQGNKEALPAISMKAVGNGHFSVDFHARLSALQAFSVCICLLH 994
>gi|222613233|gb|EEE51365.1| hypothetical protein OsJ_32383 [Oryza sativa Japonica Group]
Length = 730
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 92/204 (45%), Gaps = 33/204 (16%)
Query: 421 NSSWEPCQNLGGNDDSLGAACLLENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKK 480
N+ E N N D+ ++ L P+LE+AAI+++ + + +E+
Sbjct: 490 NNLMERQNNTRNNSDASTSSLWSREDLHPHLEVAAIVIQVPFHKTQSKELKD-------- 541
Query: 481 VGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGG 540
G S ++ + G HG PR PS +++R ++GG
Sbjct: 542 -----------------------GSSSGTIKVAAAGGAHGLPRDDESSPSPLLDRLKTGG 578
Query: 541 CCDCGGWDLGCPLTVLGTRLSKKEAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDD 600
CDCGGWD+ CP+ VL + + M + + +L G+K+ P + M V +
Sbjct: 579 GCDCGGWDMSCPIVVLDN--AYDSHWVDSVMTESKHPMELPFQGNKEALPAISMKAVGNG 636
Query: 601 LYFIHFHSTLSALQSFSIAVASIH 624
+ + FH+ LSALQ+FS+ + +H
Sbjct: 637 HFSVDFHARLSALQAFSVCICLLH 660
>gi|224059406|ref|XP_002299841.1| predicted protein [Populus trichocarpa]
gi|222847099|gb|EEE84646.1| predicted protein [Populus trichocarpa]
Length = 799
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 27/109 (24%)
Query: 449 PNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCST 508
PN ELAAI+VK +PE + + I+ +T K + S + T G
Sbjct: 706 PNDELAAIVVK--IPE------------VINRSTIRDGNQTNKCNNYSEARCNSTSGNVQ 751
Query: 509 SMDIL-----------IPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
+ +L +P+G H P GGPS +++RWRSGG CDCGG
Sbjct: 752 NQPVLGSQSLINTTVILPSGIHSLPN--KGGPSSLLQRWRSGGSCDCGG 798
>gi|326517368|dbj|BAK00051.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 764
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 67/123 (54%), Gaps = 8/123 (6%)
Query: 506 CST--SMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLGTRLSKK 563
CS+ ++ ++ P+G HG PS +++RWR GG CDCGGWD+ CPL VL
Sbjct: 574 CSSAGTIKVVTPSGLHGLSSDNESSPSSLLDRWRYGGGCDCGGWDMACPLVVLENAYDDN 633
Query: 564 --EAIYQADMHDECNSFDLFVNGSKQGAPMLRMVNVRDDLYFIHFHSTLSALQSFSIAVA 621
+++ + H +L V GSK+ P + M + FH+ LSALQ+FS+ ++
Sbjct: 634 WVDSVTKESKH----PMELLVQGSKEELPAVSMKANGKGQLLVDFHARLSALQAFSVCIS 689
Query: 622 SIH 624
+H
Sbjct: 690 LLH 692
>gi|108705734|gb|ABF93529.1| expressed protein [Oryza sativa Japonica Group]
gi|215694971|dbj|BAG90162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 667
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 63/131 (48%), Gaps = 32/131 (24%)
Query: 428 QNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPENRQEEVGGWGMKFLKKVGIKKT 486
QN N+ + +C E L P+LE+AA +++ +P NR + K LK
Sbjct: 505 QNSARNNSDVSTSCPWSEEDLYPHLEIAATVIQ--VPFNRSKS------KELK------- 549
Query: 487 VETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPRTRNGGPSGIIERWRSGGCCDCGG 546
+G ++ ++ P+G HG P PS +++RWR GG CDCGG
Sbjct: 550 ----------------SGLSPGTVKVITPSGLHGLPNDDETSPSPLLDRWRYGGGCDCGG 593
Query: 547 WDLGCPLTVLG 557
WD+ CP+ VLG
Sbjct: 594 WDMACPIAVLG 604
>gi|414864266|tpg|DAA42823.1| TPA: hypothetical protein ZEAMMB73_588556, partial [Zea mays]
Length = 640
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 67/154 (43%), Gaps = 33/154 (21%)
Query: 406 RRTLSKFGGSSAILENSSWEPCQNLGGNDDSLGAACLL-ENHLPPNLELAAIIVKEHLPE 464
RR + + G+ + +S G N+ + +C E L P LE+AA +++
Sbjct: 458 RRIVEEKTGALQQPQRTSLTEENKTGRNNSDVSTSCPWSEEDLHPYLEIAATVIEVPFSR 517
Query: 465 NRQEEVGGWGMKFLKKVGIKKTVETLKTSAPSASFARDTGGCSTSMDILIPAGFHGGPR- 523
+ EEV G ++ ++ P+G HG P
Sbjct: 518 YKSEEV-------------------------------KNGSSPCTVKVVAPSGLHGSPSD 546
Query: 524 TRNGGPSGIIERWRSGGCCDCGGWDLGCPLTVLG 557
GGPS +++RWR GG CDCGGWD+ CP+ V+G
Sbjct: 547 DEAGGPSPLLDRWRYGGGCDCGGWDMACPIDVIG 580
>gi|212723752|ref|NP_001132902.1| uncharacterized protein LOC100194401 [Zea mays]
gi|194695712|gb|ACF81940.1| unknown [Zea mays]
Length = 126
Score = 39.7 bits (91), Expect = 5.6, Method: Composition-based stats.
Identities = 19/77 (24%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 553 LTVLGTRLSKKEAIYQADMHDECNSFDLFVNG-SKQGAPMLRMVNVRDDLYFIHFHSTLS 611
L+VL + +A+ D F+L G S++ M + ++ LY + F S+++
Sbjct: 2 LSVLTDAAEENDAVQANQAKDGSQRFELLAQGGSREDRHAFSMTSFKEGLYTVEFRSSIA 61
Query: 612 ALQSFSIAVASIHSQSP 628
LQ+F++ + +H + P
Sbjct: 62 LLQAFAMCIVMLHGRYP 78
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.131 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,878,520,686
Number of Sequences: 23463169
Number of extensions: 415341408
Number of successful extensions: 1096054
Number of sequences better than 100.0: 107
Number of HSP's better than 100.0 without gapping: 95
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 1095635
Number of HSP's gapped (non-prelim): 199
length of query: 637
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 488
effective length of database: 8,863,183,186
effective search space: 4325233394768
effective search space used: 4325233394768
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)