BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012850
(455 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224110284|ref|XP_002315471.1| predicted protein [Populus trichocarpa]
gi|222864511|gb|EEF01642.1| predicted protein [Populus trichocarpa]
Length = 466
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/467 (68%), Positives = 370/467 (79%), Gaps = 13/467 (2%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M KLRWAM+G FW+LD+STP++LEG RAVPG P PLG+SRGTRLSRPKQIDF QRFM
Sbjct: 1 MNKLRWAMDGGFWDLDRSTPRTLEGEGRAVPGEPLPLGVSRGTRLSRPKQIDFFQRFMFA 60
Query: 61 PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ-GELSKSRCLHA-- 115
PF+PS+SA + L LQRVL F+ DWFATLL QFN+ KFVSS K+ G L SR +
Sbjct: 61 PFIPSYSASSHGLSLQRVLALPFTQDWFATLLAQFNLQKFVSSFKKNGALQSSRLENIRK 120
Query: 116 -LRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTS--RKKAVFCHK--FPNHNFTLEA 170
L DKSLYALGFCSELL + DTLLLS D YG +N RKKA+F HK FPNHN +EA
Sbjct: 121 HLEDKSLYALGFCSELLLSPCDTLLLSLDFYGDDNNKKPRKKAIFHHKACFPNHNLNVEA 180
Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
+WPGL++DK GNYWDVPFSMA DLAS+ASDSG SYH MHH++G P Q GDE + VPA
Sbjct: 181 VWPGLYIDKAGNYWDVPFSMAIDLASLASDSGASYHFCMHHSAGQPMQLGGDE-TVEVPA 239
Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
+LLPG SLK+A + KKNV+IWRS AQKLKMVQP+DIFLS+PH+SASG+IGAA+ A GDN
Sbjct: 240 TLLPGISLKSAFSLKKNVEIWRSNAQKLKMVQPFDIFLSNPHISASGVIGAAVMACFGDN 299
Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
SVR V ++S GF LHA +KS+ D F SV+FTAQHGNFQRL DLTRFHARLDF
Sbjct: 300 SVRPQVVDESQQFEGFCLHAPAVKSTLLVDTFSSVSFTAQHGNFQRLLLDLTRFHARLDF 359
Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
PS SKFLSGA RLAQD+ NSQRP++ETVQA+CPK S QQQIAGPFSFRVDSGV ID K
Sbjct: 360 PSASKFLSGAIRLAQDFCNSQRPTMETVQAICPKATVSFQQQIAGPFSFRVDSGVEIDWK 419
Query: 411 NRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
N+D +C D VFAIEYAL VLGSAKAVAWY+PKH+EFM+ELRFFE+
Sbjct: 420 NKDWHMCVDDPVFAIEYALHVLGSAKAVAWYSPKHREFMVELRFFET 466
>gi|359477333|ref|XP_003631965.1| PREDICTED: uncharacterized protein LOC100853208 [Vitis vinifera]
gi|297736817|emb|CBI26018.3| unnamed protein product [Vitis vinifera]
Length = 465
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/465 (67%), Positives = 366/465 (78%), Gaps = 12/465 (2%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
MKKLRWAM+G FWELD ST +L+G ARAVP +P PLG+SRGTRLSRP QIDF QRFM+
Sbjct: 1 MKKLRWAMDGGFWELDMSTATTLDGVARAVPDDPLPLGLSRGTRLSRPMQIDFFQRFMSM 60
Query: 61 PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGEL---SKSRCLHA 115
PF+PS S T LQR F F+ +WFA+LLGQFN KFVSS+K+G L S+S L
Sbjct: 61 PFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRLLQPSESSWLQG 120
Query: 116 L----RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
+ DKSLYALG SELL T DDTLL+S ++YG RKKAVF HKFPNHN +EA+
Sbjct: 121 IGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNLMVEAV 180
Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS 231
WPGLFVDK G YWDVP SMA DLASVASDSG SYHL++HHN+G P QF+G++ + VPA+
Sbjct: 181 WPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQ-THEVPAT 239
Query: 232 LLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNS 291
LLPG K A A KKN+D+WRSKAQKLKMVQP+DIFLS+PH+S SGIIGAA TA +GDNS
Sbjct: 240 LLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACLGDNS 299
Query: 292 VRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFP 351
VR VE++S G GF LH +KS+ ADIF SVAFTAQHGNFQRLF DLTRF+ARLDFP
Sbjct: 300 VRVQVEDESHGFKGFKLHLPRVKSALVADIFASVAFTAQHGNFQRLFLDLTRFYARLDFP 359
Query: 352 SGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN 411
SGSKFL+G TRL QD +NSQ+PSLE QA+CP S+QQQI GPFSFR+DSGV ++LKN
Sbjct: 360 SGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAVNLKN 419
Query: 412 RD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
R+ I + VFAIEYALQVLGSAKA+AWY+PKH+EFM+ELRFFE
Sbjct: 420 REWHIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFE 464
>gi|147865781|emb|CAN81147.1| hypothetical protein VITISV_020812 [Vitis vinifera]
Length = 1050
Score = 632 bits (1629), Expect = e-178, Method: Compositional matrix adjust.
Identities = 313/466 (67%), Positives = 366/466 (78%), Gaps = 12/466 (2%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
MKKLRWAM+G FWELD ST +L+G ARAVP +P PLG+SRGTRLSRP QIDF QRFM+
Sbjct: 586 MKKLRWAMDGGFWELDISTATTLDGVARAVPDDPLPLGLSRGTRLSRPMQIDFFQRFMSM 645
Query: 61 PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGEL---SKSRCLHA 115
PF+PS S T LQR F F+ +WFA+LLGQFN KFVSS+K+G L S+S L
Sbjct: 646 PFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRLLQPSESSWLQG 705
Query: 116 L----RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
+ DKSLYALG SELL T DDTLL+S ++YG RKKAVF HKFPNHN +EA+
Sbjct: 706 IGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNLMVEAV 765
Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS 231
WPGLFVDK G YWDVP SMA DLASVASDSG SYHL++HHN+G P QF+G++ + VPA+
Sbjct: 766 WPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQ-THEVPAT 824
Query: 232 LLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNS 291
LLPG K A A KKN+D+WRSKAQKLKMVQP+DIFLS+PH+S SGIIGAA TA +GDNS
Sbjct: 825 LLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACLGDNS 884
Query: 292 VRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFP 351
VR VE++S G G LH +KS+ ADIF SVAFTAQHGNFQRLF DLTRF+ARLDFP
Sbjct: 885 VRVQVEDESHGFKGLKLHLPRVKSALLADIFASVAFTAQHGNFQRLFLDLTRFYARLDFP 944
Query: 352 SGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN 411
SGSKFL+G TRL QD +NSQ+PSLE QA+CP S+QQQI GPFSFR+DSGV ++LKN
Sbjct: 945 SGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAVNLKN 1004
Query: 412 RD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
R+ I + VFAIEYALQVLGSAKA+AWY+PKH+EFM+ELRFFE+
Sbjct: 1005 REWHIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFET 1050
>gi|255556596|ref|XP_002519332.1| conserved hypothetical protein [Ricinus communis]
gi|223541647|gb|EEF43196.1| conserved hypothetical protein [Ricinus communis]
Length = 466
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 301/467 (64%), Positives = 353/467 (75%), Gaps = 13/467 (2%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
MKKL WAM+G FW++D STP +LEG AR VPG+P PLGISRGT+LSRPKQ+ F QRFM++
Sbjct: 1 MKKLGWAMDGGFWDVDVSTPVTLEGVARPVPGDPLPLGISRGTKLSRPKQLHFFQRFMSS 60
Query: 61 PFLPSFS----AGTLQLQRVLPF-QFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHA 115
PF+PSFS LQ VL F FS +WF TLLGQFN KFVS S S L+
Sbjct: 61 PFIPSFSSHRHGHGFSLQSVLAFPTFSQNWFGTLLGQFNFQKFVSESGASLRSSSSSLNT 120
Query: 116 ----LRDKSLYALGFCSELLFTDDDTLLLSYDSYGH-NNTSRKKAVFCHKFPNHNFTLEA 170
L D SLYALGF SEL FT DDTLL +++++G SRKKAV HKFPNHN TLEA
Sbjct: 121 IGRQLCDASLYALGFSSELSFTPDDTLLFNFNTHGDATRNSRKKAVLHHKFPNHNLTLEA 180
Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
+ P LF+D NYWDVPFSMA DLAS+ SDSGPSYH MHHN+G P F G ++ AVPA
Sbjct: 181 VSPALFIDSSDNYWDVPFSMAIDLASLPSDSGPSYHFCMHHNAGDPKLFGGG-HTLAVPA 239
Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
LLPGFS K+A A+KKN+DIWRSKA KLKMVQP+D+F+S+PH+SASGI+GA MTA IGDN
Sbjct: 240 PLLPGFSFKSAFAFKKNIDIWRSKAPKLKMVQPFDLFISNPHISASGIVGATMTAYIGDN 299
Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
SVR + TG L A KS+ AD+F SV+ TAQHGNFQRLF DLTR H RLDF
Sbjct: 300 SVRCQEVDRFQSFTGLSLRAIAPKSALLADMFSSVSLTAQHGNFQRLFLDLTRCHLRLDF 359
Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
PSGSKFLSGA +LAQD+FNSQ+PS+ET++A+CP S+QQQIAGPFSFRVDSGV ID +
Sbjct: 360 PSGSKFLSGAAKLAQDFFNSQQPSMETIKAICPNATISLQQQIAGPFSFRVDSGVAIDRR 419
Query: 411 NR--DICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
+ D+ HD VFA+EYALQVLGSAKA+AWY PK +EFM+ELRFFE+
Sbjct: 420 KKDWDMHMHDPVFAVEYALQVLGSAKAIAWYCPKQKEFMVELRFFET 466
>gi|357449221|ref|XP_003594887.1| hypothetical protein MTR_2g035830 [Medicago truncatula]
gi|355483935|gb|AES65138.1| hypothetical protein MTR_2g035830 [Medicago truncatula]
Length = 468
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 284/468 (60%), Positives = 346/468 (73%), Gaps = 13/468 (2%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
M+KLRW M+G FW+LD STPK+L+GSA VP +P LG+SRGTRLSRP+Q+ FMQ FM
Sbjct: 1 MRKLRWVMDGGGFWDLDISTPKTLDGSACPVPEDPLTLGLSRGTRLSRPRQLQFMQLFMN 60
Query: 60 TPFLPSFSAGT-LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD 118
P P+FS LQRVL FS +WF LLGQFN+ KFVSS+K + +R L++
Sbjct: 61 APLHPTFSQPQGFSLQRVLSLPFSDNWFVFLLGQFNLQKFVSSVKSSDEKPARFSSWLKN 120
Query: 119 -------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
KSLYALG CSE T DDTLL DSY + + R KAVF HKFP+H+ T+EA+
Sbjct: 121 FGRHLQQKSLYALGLCSEFQLTPDDTLLFGLDSYDYTDKPRGKAVFHHKFPHHDLTVEAV 180
Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVA-SDSGPSYHLTMHHNSGLPTQFEGDE-NSSAVP 229
+PGLFVDK GNYWDVP SMA DLAS+ SDS YHL+ H+NSG P QFE + + VP
Sbjct: 181 YPGLFVDKIGNYWDVPLSMAVDLASLTTSDSSAGYHLSAHYNSGSPKQFESIQYENDRVP 240
Query: 230 ASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGD 289
+LLPG + K+ +Y+KN+DIWRS+ KLK++QPYDIFLS+P +SASG+IGAA TA G+
Sbjct: 241 QTLLPGLAFKSVFSYRKNLDIWRSETPKLKLIQPYDIFLSNPLLSASGMIGAAATAYFGE 300
Query: 290 NSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLD 349
NS R+ +++D S+G L A IKSSF ADIFGSV+FTAQHGNFQRLF DL+RF ARLD
Sbjct: 301 NSTRAQIDDDGQQSSGLFLQAPGIKSSFLADIFGSVSFTAQHGNFQRLFLDLSRFQARLD 360
Query: 350 FPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDL 409
FPSGSKFL GAT LAQD NSQ+P++E VQA+CP S+QQQI GP SFRVDSG+ IDL
Sbjct: 361 FPSGSKFLLGATSLAQDLLNSQKPNMEAVQAICPNATLSLQQQIVGPISFRVDSGIAIDL 420
Query: 410 KNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
KN I A D ++A+E+ALQVLGSAKAVAW PK QEFM+ELRF+E+
Sbjct: 421 KNHKWPIQAQDPIYAVEWALQVLGSAKAVAWCCPKRQEFMVELRFYET 468
>gi|297833468|ref|XP_002884616.1| PDE320 [Arabidopsis lyrata subsp. lyrata]
gi|297330456|gb|EFH60875.1| PDE320 [Arabidopsis lyrata subsp. lyrata]
Length = 479
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/480 (57%), Positives = 339/480 (70%), Gaps = 26/480 (5%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M +LRW EG+ W+LD STP +LEG ARAVP +P PLG+SRGTRLSRPKQ++F RFM +
Sbjct: 1 MNRLRWVEEGDIWDLDMSTPATLEGIARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMTS 60
Query: 61 PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
P +PSFS G LQRVL FS +W +LLGQF+V +FVS +++ G
Sbjct: 61 PLIPSFSPIRPNTGDGGCGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVSEIEKTKTFGR 120
Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
S S L+DKSLYALGFCSE L + DDTLLLSYD+Y + R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDTYKGDLEKNPRAKAIF 180
Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
H+FP HN T EA WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYH+ +HHNSG P
Sbjct: 181 NHEFPLHNLTAEAAWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHVCLHHNSGAPK 240
Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
+ D P SLLPG SLK+AV+Y+ N+D+WR KL+ +PYD+FLSSPHV+ SG
Sbjct: 241 KLNSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299
Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
IIG+ MTA+ G+NS+RS ENDS+G GF LH + S F AD G + TAQ+GNFQ+
Sbjct: 300 IIGSVMTAAFGENSIRSKSENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQKP 359
Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
F DLTRFHARLD+P G +FL+GAT +A D NS++PSLE Q +CP+V S+QQQI GPF
Sbjct: 360 FFDLTRFHARLDYPHGLRFLTGATSVAHDLLNSRQPSLEAFQKICPEVLVSLQQQIVGPF 419
Query: 398 SFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
SF+V+SG+ IDLKN + +VFAIEYALQVL SAKAVAWY+PK +EFM+EL FFE+
Sbjct: 420 SFKVESGIQIDLKNGANPVTVDKTVFAIEYALQVLLSAKAVAWYSPKQKEFMVELSFFET 479
>gi|18397783|ref|NP_566296.1| protein pigment defective 320 [Arabidopsis thaliana]
gi|6729003|gb|AAF27000.1|AC016827_11 hypothetical protein [Arabidopsis thaliana]
gi|14194139|gb|AAK56264.1|AF367275_1 AT3g06960/F17A9_11 [Arabidopsis thaliana]
gi|16323342|gb|AAL15384.1| AT3g06960/F17A9_11 [Arabidopsis thaliana]
gi|332640959|gb|AEE74480.1| protein pigment defective 320 [Arabidopsis thaliana]
Length = 479
Score = 553 bits (1424), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/480 (57%), Positives = 340/480 (70%), Gaps = 26/480 (5%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M ++RW EG+ W+LD STP +LEG+ARAVP +P PLG+SRGTRLSRPKQ++F RFMA+
Sbjct: 1 MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60
Query: 61 PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
P +PSFS G LQRVL FS +W +LLGQF+V +FV+ + + G
Sbjct: 61 PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120
Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
S S L+DKSLYALGFCSE L + DDTLLLSYD+Y + R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180
Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
H+FP HN T EA+WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240
Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
+ D P SLLPG SLK+AV+Y+ N+D+WR KL+ +PYD+FLSSPHV+ SG
Sbjct: 241 KLHSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299
Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
IIG+ MTA+ G+NS+RS ENDS+G GF LH + S F AD G + TAQ+GNFQ+
Sbjct: 300 IIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQKF 359
Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
F DLTRFHARLDFP G +FL+GAT +AQD NS++PSLE Q +CP+V S+QQQI GPF
Sbjct: 360 FFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQKICPEVLVSLQQQIVGPF 419
Query: 398 SFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
SF+V+SG+ IDL+N + +VFAIEYALQVL SAKAV Y+PK EFM+ELRFFE+
Sbjct: 420 SFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFMVELRFFET 479
>gi|356556155|ref|XP_003546392.1| PREDICTED: uncharacterized protein LOC100793455 [Glycine max]
Length = 468
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 288/469 (61%), Positives = 342/469 (72%), Gaps = 19/469 (4%)
Query: 2 KKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
+KLRW M+G FW+LD STP++L+G A VPG+P PL +SRGTRLSRP+Q+ FM RFM
Sbjct: 4 RKLRWVMDGGGFWDLDISTPQTLDGLACPVPGDPLPLSLSRGTRLSRPRQLQFMHRFMHA 63
Query: 61 PFLPSFSAG-TLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELS---KSR 111
P +PS + L L R+L FS +WF LLGQF++ +F SS+K ELS K+
Sbjct: 64 PLIPSCAKPHGLSLHRLLTLPFSDNWFVFLLGQFHLQRFFSSVKSSKEKPKELSSWLKTF 123
Query: 112 CLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
H L+DKSLYALG SE DDTLL D+Y R KAVF HKFP+H+ T+EA+
Sbjct: 124 GRH-LQDKSLYALGISSEFQLGMDDTLLFGLDAYEDTEKPRGKAVFHHKFPDHDLTVEAV 182
Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVA-SDSGPSYHLTMHHNSGLPTQFEG--DENSSAV 228
+PGLFVD GNYWD+PFSMA DLASV SDS +YHL + SG P QFE ++N
Sbjct: 183 YPGLFVDTTGNYWDLPFSMAVDLASVTTSDSSTAYHLCARYTSGSPKQFENVQNQNDRVP 242
Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
P +LLPG + K+A +Y+K VDIWRS+A+KLK+VQPYDIFLS+PHVSASGIIGAA T + G
Sbjct: 243 PPTLLPGLAFKSAFSYRKKVDIWRSEAKKLKLVQPYDIFLSNPHVSASGIIGAAATTAFG 302
Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
DNS R+ VE DGS GF L A+ IKSSF ADIF SV+FTAQHGNFQRLF DLTRF ARL
Sbjct: 303 DNSARAQVE---DGSPGFFLQASGIKSSFLADIFASVSFTAQHGNFQRLFLDLTRFQARL 359
Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
DFPSG KFLS AT L QD NSQ+PS++ VQA+ P S+QQQI GP SFRVDSG+ +D
Sbjct: 360 DFPSGFKFLSAATGLTQDLLNSQKPSMDAVQAIIPNATLSLQQQIVGPVSFRVDSGITVD 419
Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
LKN D I A + VFA+EYALQVLGSAKAVAWY PK QEFM ELRFFE+
Sbjct: 420 LKNPDWPIHALEPVFALEYALQVLGSAKAVAWYCPKRQEFMAELRFFET 468
>gi|449518839|ref|XP_004166443.1| PREDICTED: uncharacterized protein LOC101225023 [Cucumis sativus]
Length = 470
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 256/471 (54%), Positives = 332/471 (70%), Gaps = 17/471 (3%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPN--PLGISRGTRLSRPKQIDFMQRF 57
MKKLRWAM+G+ FW+LD ST ++L+GSA VP + PLG+SRG RLSR KQIDFMQRF
Sbjct: 1 MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60
Query: 58 MATPFLPSFS-AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ---GELSKSRCL 113
MA PF+PS+S + LQRV FS TLLGQFN+ KF+SSL + GE+ +S
Sbjct: 61 MAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYTS 120
Query: 114 ------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFT 167
L +SLYA+G +++L DD+L++S+D YG ++ R KAVF KF +H+ T
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180
Query: 168 LEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSA 227
+EAL PGLF++KCG YWDVP S+ DL SVASDSG SYHL+MH N+G P+Q G E + +
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL-GSEPTHS 239
Query: 228 VPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASI 287
P LLPG S K A A+KKN +IWRS A+KLKMVQPYDIFLS+PHVS S IIGA T+
Sbjct: 240 APFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYF 299
Query: 288 GDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHAR 347
GD+ RS ++ + GF++ ++ I+S+ AD+F S++F+AQ+G FQ+ + DLTRF A
Sbjct: 300 GDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSAC 359
Query: 348 LDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVI 407
+DF SGSKFLSG+ L D NS+ P E+V+A P FS+QQQIAGP SFR D+GV I
Sbjct: 360 MDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAI 419
Query: 408 DLKNRD---ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
DL + + FA+EYAL VLGSAKA+AWY+PKH+EFM+ELRF+E+
Sbjct: 420 DLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET 470
>gi|449441494|ref|XP_004138517.1| PREDICTED: uncharacterized protein LOC101205565 [Cucumis sativus]
Length = 470
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 256/471 (54%), Positives = 331/471 (70%), Gaps = 17/471 (3%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPN--PLGISRGTRLSRPKQIDFMQRF 57
MKKLRWAM+G+ FW+LD ST ++L+GSA VP + PLG+SRG RLSR KQIDFMQRF
Sbjct: 1 MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60
Query: 58 MATPFLPSFS-AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ---GELSKSRCL 113
MA PF PS+S + LQRV FS TLLGQFN+ KF+SSL + GE+ +S
Sbjct: 61 MAAPFFPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYSS 120
Query: 114 ------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFT 167
L +SLYA+G +++L DD+L++S+D YG ++ R KAVF KF +H+ T
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180
Query: 168 LEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSA 227
+EAL PGLF++KCG YWDVP S+ DL SVASDSG SYHL+MH N+G P+Q G E + +
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL-GSEPTHS 239
Query: 228 VPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASI 287
P LLPG S K A A+KKN +IWRS A+KLKMVQPYDIFLS+PHVS S IIGA T+
Sbjct: 240 APFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYF 299
Query: 288 GDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHAR 347
GD+ RS ++ + GF++ ++ I+S+ AD+F S++F+AQ+G FQ+ + DLTRF A
Sbjct: 300 GDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSAC 359
Query: 348 LDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVI 407
+DF SGSKFLSG+ L D NS+ P E+V+A P FS+QQQIAGP SFR D+GV I
Sbjct: 360 MDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAI 419
Query: 408 DLKNRD---ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
DL + + FA+EYAL VLGSAKA+AWY+PKH+EFM+ELRF+E+
Sbjct: 420 DLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET 470
>gi|326527073|dbj|BAK04478.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 246/464 (53%), Positives = 308/464 (66%), Gaps = 13/464 (2%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW + E WELD +P ++EG+AR VPG+P PLG+SRG R++R KQ+DF RFMA
Sbjct: 2 LRQMRWLTDREGRWELDVESPATMEGTARPVPGDPLPLGLSRGPRVTRTKQLDFFHRFMA 61
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
+P +PSFSA L + +W T+L Q +V K V+++K+ ++ +
Sbjct: 62 SPLVPSFSASRAGLSFHHAHLLHLAHNWSCTILEQLHVQKLVAAVKEKLSNRQEGIPWSN 121
Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
D + +LG +ELL T D LLL Y N R KA+ HK P N TL A W
Sbjct: 122 DLKRHLHDVISLGVGTELLITPDTALLLEL--YNINKGDRGKAIIHHKLPQQNLTLAASW 179
Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
PGLFVDK G YWDVP S++ADLASV S SG SYHL + NSG P F GDE S VP +L
Sbjct: 180 PGLFVDKQGVYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETSD-VPLAL 238
Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
LPG K A++ KK++D WR K KLKMVQPYD FLS PHVS +GI+GA + S+GD S
Sbjct: 239 LPGLCAKAAISIKKSIDAWRKKEDKLKMVQPYDAFLSDPHVSFTGIVGAVASGSLGDCSK 298
Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
R V +++ S F + K + AD+F SV FTAQHGNFQRLF DLTR RLD S
Sbjct: 299 RILVPDETRKSNAFRVFHERNKFAAFADLFASVTFTAQHGNFQRLFLDLTRVSTRLDISS 358
Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
GS FL GA+RLAQD+F S+RP LET VCP V S+QQQI GPFSFRV+S V IDL+N+
Sbjct: 359 GSLFLRGASRLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESSVAIDLRNQ 418
Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
D + DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE
Sbjct: 419 DHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFE 462
>gi|218187895|gb|EEC70322.1| hypothetical protein OsI_01191 [Oryza sativa Indica Group]
Length = 463
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 249/469 (53%), Positives = 314/469 (66%), Gaps = 21/469 (4%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW +G+ WELD TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2 LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
+P +PSFSA L + + + T+L Q +V K V+ +K+ E+S S
Sbjct: 62 SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121
Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTL 168
R LH + +LG +ELL T D TLLL Y R KA+F HK P+ N TL
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITL 174
Query: 169 EALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV 228
EA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ + NSG P F G E +S V
Sbjct: 175 EASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDV 234
Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
P +LLPG K AV+ KK++D+WR K KLK VQPYD+FLS PHVS +GIIGA + S G
Sbjct: 235 PVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFG 294
Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
D S R ++N+ S F K + AD+F SV FTAQHGNFQRL DLTR ARL
Sbjct: 295 DCSKRMSMQNEIQKSNAFKYFDERNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARL 354
Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
D SGS FL GA++LAQD+F S+RP LET VCP V S+QQQI GPFSFRV+S V ID
Sbjct: 355 DISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTID 414
Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
K +D + DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 415 PKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFET 463
>gi|115435680|ref|NP_001042598.1| Os01g0251000 [Oryza sativa Japonica Group]
gi|113532129|dbj|BAF04512.1| Os01g0251000 [Oryza sativa Japonica Group]
gi|215712321|dbj|BAG94448.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 249/469 (53%), Positives = 314/469 (66%), Gaps = 21/469 (4%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW +G+ WELD TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2 LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
+P +PSFSA L + + + T+L Q +V K V+ +K+ E+S S
Sbjct: 62 SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121
Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTL 168
R LH + +LG +ELL T D TLLL Y R KA+F HK P+ N TL
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITL 174
Query: 169 EALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV 228
EA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ + NSG P F G E +S V
Sbjct: 175 EASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDV 234
Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
P +LLPG K AV+ KK++D+WR K KLK VQPYD+FLS PHVS +GIIGA + S G
Sbjct: 235 PVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFG 294
Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
D S R ++N+ S F K + AD+F SV FTAQHGNFQRL DLTR ARL
Sbjct: 295 DCSKRMSMQNEILKSNAFKYFDERNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARL 354
Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
D SGS FL GA++LAQD+F S+RP LET VCP V S+QQQI GPFSFRV+S V ID
Sbjct: 355 DISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTID 414
Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
K +D + DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 415 PKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFET 463
>gi|344313274|gb|AEN14334.1| hypothetical protein rf1-C1-g5 [Zea mays]
Length = 464
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 234/465 (50%), Positives = 309/465 (66%), Gaps = 13/465 (2%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW EG+ WELD TP ++EG+AR VPG+P PLG+SRG R++RPKQ+DF RFMA
Sbjct: 3 LRRMRWMSEGDGRWELDAETPVTMEGTARPVPGDPLPLGLSRGYRITRPKQLDFFHRFMA 62
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
+P +P+FSA L + F + +W +T+L + NV K VS +K+ ++ +
Sbjct: 63 SPLVPTFSAMRDGLSVNHAHIFYMTDNWSSTILEKINVKKLVSVVKEKLANRQEEASWTK 122
Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
D + +LG +E+L T D TLLL N R KA+F H+ P+HN T +A W
Sbjct: 123 DLKNHLHDVMSLGVGTEILITPDTTLLLELFDIKKGN--RGKAIFHHELPHHNITFQASW 180
Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
PGLFVDK G YWDVP S++ADLASV S SG SYHL + NSG P F GDE VP +L
Sbjct: 181 PGLFVDKKGTYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPTAL 239
Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
LPG K A++ KK++D WR K KLK VQPYD+FLS HVS +GI+G + +GD S
Sbjct: 240 LPGLCAKVAISMKKSIDAWRKKEDKLKKVQPYDVFLSDSHVSLTGIVGGVASGYLGDCSR 299
Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
R + +++ S F + K + AD+F SV FTAQ+GNFQRLF DLT+ AR D S
Sbjct: 300 RVAIRDETHKSNAFIMFDERNKRAAFADLFASVTFTAQYGNFQRLFLDLTKASARFDITS 359
Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
GS FL GA+RLAQD+F S+RP +ET +CP V S QQQI GPFSFRV+S V ID +++
Sbjct: 360 GSLFLCGASRLAQDFFFSRRPDVETFCDICPDVTVSFQQQIVGPFSFRVESSVAIDPRSQ 419
Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
D + D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 420 DHFVRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMMELRFYET 464
>gi|222618108|gb|EEE54240.1| hypothetical protein OsJ_01114 [Oryza sativa Japonica Group]
Length = 479
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 249/485 (51%), Positives = 314/485 (64%), Gaps = 37/485 (7%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW +G+ WELD TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2 LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
+P +PSFSA L + + + T+L Q +V K V+ +K+ E+S S
Sbjct: 62 SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121
Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHK-------- 160
R LH + +LG +ELL T D TLLL Y R KA+F HK
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKANYLDTFY 174
Query: 161 --------FPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
P+ N TLEA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ + N
Sbjct: 175 AIWYLVMKLPHQNITLEASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQN 234
Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPH 272
SG P F G E +S VP +LLPG K AV+ KK++D+WR K KLK VQPYD+FLS PH
Sbjct: 235 SGEPKCFGGGEETSDVPVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPH 294
Query: 273 VSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHG 332
VS +GIIGA + S GD S R ++N+ S F K + AD+F SV FTAQHG
Sbjct: 295 VSFTGIIGAVASGSFGDCSKRMSMQNEILKSNAFKYFDERNKFAAFADLFASVNFTAQHG 354
Query: 333 NFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQ 392
NFQRL DLTR ARLD SGS FL GA++LAQD+F S+RP LET VCP V S+QQQ
Sbjct: 355 NFQRLLLDLTRVSARLDISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQ 414
Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIEL 450
I GPFSFRV+S V ID K +D + DSVFAI++AL+VLGSAKA AWY+PKHQE M+EL
Sbjct: 415 IVGPFSFRVESTVTIDPKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVEL 474
Query: 451 RFFES 455
RFFE+
Sbjct: 475 RFFET 479
>gi|357129435|ref|XP_003566367.1| PREDICTED: uncharacterized protein LOC100827567 [Brachypodium
distachyon]
Length = 461
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 240/465 (51%), Positives = 306/465 (65%), Gaps = 15/465 (3%)
Query: 1 MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW + E WELD +P ++EG+AR VPG+P PLG+SRG R++R KQ+DF RFMA
Sbjct: 2 LRQMRWMTDREGRWELDVESPATMEGTARPVPGDPLPLGLSRGPRVTRTKQLDFFHRFMA 61
Query: 60 TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
+P +P+FS L L F + +W T+L Q +V K V +K+ ++ +
Sbjct: 62 SPLVPAFSPYRSGLSLDHAHLFHLAENWSFTILEQLHVKKLVEVVKEKLSNRQEGVPWTN 121
Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
D + +LG +ELL T D TLLL Y R KA+ HK P N LEA W
Sbjct: 122 DLKRQLHDVMSLGVGTELLITPDTTLLLEL--YNIKKGERGKAIINHKLPQQNLKLEASW 179
Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
PGLFVDK G YWDVP S++AD ASV S SG SY L + NSG P F GDE + VP +L
Sbjct: 180 PGLFVDKQGVYWDVPLSLSADFASVGSSSGLSYRLLVQQNSGEPKCFGGDETND-VPLAL 238
Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
LPG +K A + KK++D+WR K KLKMVQPYD+FLS PHVS + I+GA + S+GD S
Sbjct: 239 LPGLCVKAAFSIKKSIDVWRKKEDKLKMVQPYDVFLSDPHVSFTRIVGAVASGSLGDCSK 298
Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
R V+++ S F + K + AD+F SV FTAQHGNFQRLF DLTR ARLD S
Sbjct: 299 RISVQDER--SNAFRVFNDRNKFAAFADLFASVTFTAQHGNFQRLFLDLTRVSARLDISS 356
Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
GS FL GA+RLAQD+F S+RP LET VCP S+QQQI GPFSFRV+S V ID +++
Sbjct: 357 GSLFLRGASRLAQDFFFSRRPDLETFCDVCPNTIVSLQQQIVGPFSFRVESSVAIDPRSQ 416
Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
D + DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 417 DRLVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFEA 461
>gi|56783809|dbj|BAD81221.1| unknown protein [Oryza sativa Japonica Group]
Length = 439
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 240/446 (53%), Positives = 298/446 (66%), Gaps = 20/446 (4%)
Query: 23 LEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFLPSFSA--GTLQLQRVLPFQ 80
+EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA+P +PSFSA L +
Sbjct: 1 MEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMASPLVPSFSASGAGLSVHHAHLVH 60
Query: 81 FSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS----RCLHALRDKSLYALGFCSELL 131
+ + T+L Q +V K V+ +K+ E+S S R LH + +LG +ELL
Sbjct: 61 LAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSNDIKRHLH-----DVMSLGVGTELL 115
Query: 132 FTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMA 191
T D TLLL Y R KA+F HK P+ N TLEA WPGLF+DK G YWDVP S++
Sbjct: 116 ITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITLEASWPGLFIDKNGVYWDVPLSLS 173
Query: 192 ADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIW 251
ADLASV S+SG SYH+ + NSG P F G E +S VP +LLPG K AV+ KK++D+W
Sbjct: 174 ADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDVPVALLPGLCAKAAVSIKKSIDVW 233
Query: 252 RSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAA 311
R K KLK VQPYD+FLS PHVS +GIIGA + S GD S R ++N+ S F
Sbjct: 234 RKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFGDCSKRMSMQNEILKSNAFKYFDE 293
Query: 312 DIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQ 371
K + AD+F SV FTAQHGNFQRL DLTR ARLD SGS FL GA++LAQD+F S+
Sbjct: 294 RNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARLDISSGSLFLRGASQLAQDFFFSR 353
Query: 372 RPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV 429
RP LET VCP V S+QQQI GPFSFRV+S V ID K +D + DSVFAI++AL+V
Sbjct: 354 RPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTIDPKKQDHFVRVDDSVFAIDWALKV 413
Query: 430 LGSAKAVAWYAPKHQEFMIELRFFES 455
LGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 414 LGSAKATAWYSPKHQEAMVELRFFET 439
>gi|242052461|ref|XP_002455376.1| hypothetical protein SORBIDRAFT_03g009540 [Sorghum bicolor]
gi|241927351|gb|EES00496.1| hypothetical protein SORBIDRAFT_03g009540 [Sorghum bicolor]
Length = 464
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 232/467 (49%), Positives = 311/467 (66%), Gaps = 17/467 (3%)
Query: 1 MKKLRW-AMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
++++RW + E WELD TP ++EG+ R VPG+P PLG+SRG R++RPKQ+DF RFMA
Sbjct: 3 LRRMRWMSDEDGRWELDAETPVTMEGTVRPVPGDPLPLGLSRGYRVTRPKQLDFFHRFMA 62
Query: 60 TPFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK------QGELSKSR 111
+P +P+FSA L + + +W + +L + NV K +S +K Q E S ++
Sbjct: 63 SPLVPTFSATRDGLSVNHAHILYITDNWSSIILEKINVNKLMSVVKEKFANRQEEASWTK 122
Query: 112 CLHA-LRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEA 170
L L D + +LG +E+L T D TLLL N R KA+F H+ P+HN T++A
Sbjct: 123 DLKKHLHD--VMSLGVGTEILITPDTTLLLELFDIKKGN--RGKAIFHHELPHHNITVQA 178
Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
WPGLFVDK G YWDVP S++ADL+SV S SG SYHL + NSG P F GDE VP
Sbjct: 179 SWPGLFVDKRGAYWDVPLSLSADLSSVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPI 237
Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
+LLPG K A++ KK++D+WR K KLK VQPYDIFLS HVS +GI+GA + +G+
Sbjct: 238 ALLPGLCAKAAISIKKSIDVWRKKEDKLKSVQPYDIFLSDSHVSLTGIVGAVASGYLGNC 297
Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
S R + +++ S F + K + AD+F SV FTA++GNFQRLF DLT+ AR D
Sbjct: 298 SRRVAIRDETQKSNAFRMFDERNKCAAFADLFASVTFTARYGNFQRLFLDLTKASARFDI 357
Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
SGS FL GA+RLAQD+F S+RP +ET +CP V S+QQQI GPFSFRV+S V D +
Sbjct: 358 TSGSLFLCGASRLAQDFFFSRRPDVETFCDICPDVTVSLQQQIVGPFSFRVESSVAFDPR 417
Query: 411 NRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
++D + D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 418 SQDHFVRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMVELRFYET 464
>gi|356532618|ref|XP_003534868.1| PREDICTED: uncharacterized protein LOC100804638 [Glycine max]
Length = 379
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 229/384 (59%), Positives = 272/384 (70%), Gaps = 21/384 (5%)
Query: 85 WFATLLGQFNVLKFVSSLKQG-----ELS---KSRCLHALRDKSLYALGFCSELLFTDDD 136
W LLGQF++ +FVSS+K ELS K+ H L+ KSLYALG SE DD
Sbjct: 4 WLVFLLGQFHLQRFVSSVKSSKEKPKELSSWLKTFGRH-LQQKSLYALGISSEFQLGVDD 62
Query: 137 TLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLAS 196
TL D+Y R KAVF HKFP+H+ +EA++PG FVD NYWDVPFS+A DLAS
Sbjct: 63 TLHFGLDAYEDTEKPRGKAVFHHKFPDHDLKVEAVYPGHFVDTTSNYWDVPFSVAVDLAS 122
Query: 197 VA-SDSGPSYHLTMHHNSGLPTQFEG--DENSSAVPASLLPGFSLKNAVAYKKNVDIWRS 253
V SDS +YHL+ H+ SG P QFE ++N P +LLPG + K+ +Y+K VDIWRS
Sbjct: 123 VTTSDSSTAYHLSAHYTSGSPKQFENIQNQNDRVPPPTLLPGLAFKSVFSYRKKVDIWRS 182
Query: 254 KAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADI 313
+A+KLK+VQPYDIFLS+PHVSASGIIGAA T + GDNS R+ VE DGS GF L A+ I
Sbjct: 183 EAKKLKLVQPYDIFLSNPHVSASGIIGAAATTAFGDNSARAQVE---DGSPGFFLQASGI 239
Query: 314 KSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRP 373
KSSF ADIF SV+FTAQHGNFQRLF DLTRF ARLDFPSG KFLS AT NSQ P
Sbjct: 240 KSSFLADIFASVSFTAQHGNFQRLFLDLTRFQARLDFPSGFKFLSAATGRTH-LLNSQNP 298
Query: 374 SLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLG 431
+++T + P S+QQQI GP SFRVDSG+ +DLKN R I A + VFA+EYAL+VLG
Sbjct: 299 AIQT---ILPNATLSLQQQIVGPVSFRVDSGITVDLKNPDRPIHAPEPVFALEYALKVLG 355
Query: 432 SAKAVAWYAPKHQEFMIELRFFES 455
SAKAVAWY PK QEF+ ELRFFE+
Sbjct: 356 SAKAVAWYFPKRQEFVAELRFFET 379
>gi|414876816|tpg|DAA53947.1| TPA: hypothetical protein ZEAMMB73_402660 [Zea mays]
Length = 459
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 224/462 (48%), Positives = 293/462 (63%), Gaps = 32/462 (6%)
Query: 23 LEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFLPSFSA--GTLQLQRVLPFQ 80
+EG+AR VPG+P PLG+SRG R++RPKQ+DF RFMA+P +P+FSA L + F
Sbjct: 1 MEGTARPVPGDPLPLGLSRGYRITRPKQLDFFHRFMASPLVPTFSAMRDGLSVNHAHIFY 60
Query: 81 FSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD-----KSLYALGFCSELLFTDD 135
+ +W +T+L + NV K VS +K+ ++ +D + +LG +E+L T D
Sbjct: 61 MTDNWSSTILEKINVKKLVSVVKEKLANRQEEASWTKDLKNHLHDVMSLGVGTEILITPD 120
Query: 136 DTLLLSYDSYGHNNTSRKKAVFCH--------------------KFPNHNFTLEALWPGL 175
TLLL N R KA+F H + P+HN T +A WPGL
Sbjct: 121 TTLLLELFDIKKGN--RGKAIFHHEAKNLDALNLFGIQTPLYLLQLPHHNITFQASWPGL 178
Query: 176 FVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPG 235
FVDK G YWDVP S++ADLASV S SG SYHL + NSG P F GDE VP +LLPG
Sbjct: 179 FVDKKGTYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPTALLPG 237
Query: 236 FSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSP 295
K A++ KK++D WR K KLK VQPYD+FLS HVS +GI+G + +GD S R
Sbjct: 238 LCAKVAISMKKSIDAWRKKEDKLKKVQPYDVFLSDSHVSLTGIVGGVASGYLGDCSRRVA 297
Query: 296 VENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSK 355
+ +++ S F + K + AD+F SV FTAQ+GNFQRLF DLT+ AR D SGS
Sbjct: 298 IRDETHKSNAFIMFDERNKRAAFADLFASVTFTAQYGNFQRLFLDLTKASARFDITSGSL 357
Query: 356 FLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRD-- 413
FL GA+RLAQD+F S+RP +ET +CP V S QQQI GPFSFRV+S V ID +++D
Sbjct: 358 FLCGASRLAQDFFFSRRPDVETFCDICPDVTVSFQQQIVGPFSFRVESSVAIDPRSQDHF 417
Query: 414 ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
+ D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 418 VRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMMELRFYET 459
>gi|89953447|gb|ABD83319.1| Fgenesh protein 89 [Beta vulgaris]
Length = 400
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 211/374 (56%), Positives = 268/374 (71%), Gaps = 22/374 (5%)
Query: 86 FATLLGQFNVLKFVSSLKQG---ELSKSRCLHAL----RDKSLYALGFCSELLFTDDDTL 138
F TLLGQFN+ KFVS++++ ELS S + L +KS YA GFCSE L T +DTL
Sbjct: 7 FTTLLGQFNLQKFVSAVRKAKSEELSDSSWMQVLGRHLSEKSFYAFGFCSEFLVTSEDTL 66
Query: 139 LLSYDSYGHNNTSRKKAVFCHK------------FPNHNFTLEALWPGLFVDKCGNYWDV 186
L SY++ G T+RKKAVF HK FP+ N T+E +WPGLF+DK GNYWDV
Sbjct: 67 LFSYEASGDTKTARKKAVFHHKASTKRYEGTLYQFPHLNLTVERVWPGLFIDKHGNYWDV 126
Query: 187 PFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKK 246
P ++AADLASV SDSG YHL++HH SG Q EG++ SS VPA+LLPG +K+A ++KK
Sbjct: 127 PATLAADLASVTSDSGARYHLSLHHISGAAKQVEGNK-SSDVPATLLPGLYVKSAFSFKK 185
Query: 247 NVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGF 306
N+D WRS+A+KLK VQP+D+FLSSPHVS +G+IG TA +GDNS+RSP+++
Sbjct: 186 NIDFWRSQAKKLKFVQPFDLFLSSPHVSGAGLIGVVATALVGDNSLRSPLDDQPKDYRRL 245
Query: 307 HLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQD 366
L+A+ KS+F AD+F +++F+AQ+GNFQR F DLTR H RLD PSGSKF S A LA+D
Sbjct: 246 DLYASRGKSAFLADLFATLSFSAQYGNFQRKFFDLTRIHGRLDVPSGSKFCSAAACLAKD 305
Query: 367 YFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR--DICAHDSVFAIE 424
+ SQ+ SLE VQAVCP S QQQI GPFSFRVDS + ++L+N+ + + VFAIE
Sbjct: 306 IYKSQQLSLEAVQAVCPNPMVSFQQQIVGPFSFRVDSKIAVNLENQGCKVSLDNPVFAIE 365
Query: 425 YALQVLGSAKAVAW 438
YALQVL SAKAVAW
Sbjct: 366 YALQVLFSAKAVAW 379
>gi|297789715|ref|XP_002862795.1| hypothetical protein ARALYDRAFT_920263 [Arabidopsis lyrata subsp.
lyrata]
gi|297308522|gb|EFH39053.1| hypothetical protein ARALYDRAFT_920263 [Arabidopsis lyrata subsp.
lyrata]
Length = 435
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 175/310 (56%), Positives = 216/310 (69%), Gaps = 15/310 (4%)
Query: 160 KFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQF 219
+FP HN T EA+W GLFVD G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P +
Sbjct: 127 QFPLHNLTAEAVWLGLFVDNHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPKKL 186
Query: 220 EGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGII 279
D P SLLP SL +A++Y N+D+WR KL+ +PYD+FLSSPHV+ SGII
Sbjct: 187 NSD-TMEVPPPSLLPCLSLTSAISYWTNMDLWRGTTPKLETYKPYDVFLSSPHVAVSGII 245
Query: 280 GAAM----------TASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTA 329
G SIG+NS+RS ENDS+G GF LH + S F AD G + TA
Sbjct: 246 GMISFFDLFQKLCDDRSIGENSIRSKFENDSEGVGGFSLHFPFVNSGFMADTLGLASLTA 305
Query: 330 QHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSV 389
Q+ NFQ+LF DLTRFHARLDFP G +FL+GAT ++QD NS++PSLE Q +CP+V S+
Sbjct: 306 QYRNFQKLFFDLTRFHARLDFPYGLRFLTGATSVSQDLLNSRQPSLEAFQKICPEVLVSL 365
Query: 390 QQQIAGPFSFRVDSGVVIDLKN----RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
QQQI GP SF+V+SG+ IDLKN + +VFAIEYALQVL SAKAVAWY+PK +E
Sbjct: 366 QQQIVGPSSFKVESGIQIDLKNGVGANPVTVDKTVFAIEYALQVLLSAKAVAWYSPKQKE 425
Query: 446 FMIELRFFES 455
FM+EL FFE+
Sbjct: 426 FMVELSFFET 435
>gi|42572297|ref|NP_974244.1| protein pigment defective 320 [Arabidopsis thaliana]
gi|332640960|gb|AEE74481.1| protein pigment defective 320 [Arabidopsis thaliana]
Length = 341
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 168/303 (55%), Positives = 208/303 (68%), Gaps = 24/303 (7%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M ++RW EG+ W+LD STP +LEG+ARAVP +P PLG+SRGTRLSRPKQ++F RFMA+
Sbjct: 1 MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60
Query: 61 PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
P +PSFS G LQRVL FS +W +LLGQF+V +FV+ + + G
Sbjct: 61 PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120
Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
S S L+DKSLYALGFCSE L + DDTLLLSYD+Y + R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180
Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
H+FP HN T EA+WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240
Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
+ D P SLLPG SLK+AV+Y+ N+D+WR KL+ +PYD+FLSSPHV+ SG
Sbjct: 241 KLHSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299
Query: 278 IIG 280
IIG
Sbjct: 300 IIG 302
>gi|255571101|ref|XP_002526501.1| conserved hypothetical protein [Ricinus communis]
gi|223534176|gb|EEF35892.1| conserved hypothetical protein [Ricinus communis]
Length = 465
Score = 300 bits (769), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 180/480 (37%), Positives = 263/480 (54%), Gaps = 42/480 (8%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M LR A++ FW+ S+P++LEG AR++PG+P PL +R +R R +Q+ +
Sbjct: 1 MANLRTALDSAFWDQPLSSPQTLEGCARSIPGDPFPLDATRASRALRIQQLSLLANGFPL 60
Query: 61 PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
+PS+S+ + L +Q +L S + + L+GQF K +SS+K + EL
Sbjct: 61 GLVPSYSSASPKHPPSLSVQSLLLKLASSNCWLGLIGQFRPKKLISSIKAEFSNAEELEL 120
Query: 108 SKSR-CLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNF 166
S R + DKSLY++G CS+ T +LLLS + +GH+ T R K + H+ P+H+
Sbjct: 121 SVFRDAAKHIVDKSLYSIGICSQFSPTPSTSLLLSTERHGHSATPRYKFMLFHQLPSHDI 180
Query: 167 TLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSS 226
TLEA WP LF+D G YWDVP S++ D+AS+ SD+G Y +H N+G P +
Sbjct: 181 TLEAAWPELFLDHRGGYWDVPQSISLDMASIGSDTGFRYRFGIHKNNGHPNTINAINDQP 240
Query: 227 AVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMV----------QPYDIFLSSPHVSAS 276
P +L+PG K + +Y+K+ D+WR K K V + YD+ LS PH + S
Sbjct: 241 --PFALMPGLCGKASFSYEKSKDLWRKKQSKKDSVIKTDRGSILPRSYDVRLSQPHSAIS 298
Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
GI+G A A G + + + ST +S +AD+FGSV +T QHGNF +
Sbjct: 299 GIVGGACAAWFGGRDISVSADGHNSSST-------RKRSPLNADLFGSVCYTFQHGNFTK 351
Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGP 396
L+ DLTR ARLD S A LA+ F S P++ ++QQQ+AGP
Sbjct: 352 LYGDLTRIDARLDICS-------ALTLAKRAFRWSSVSDADNALSSPRLNLTLQQQVAGP 404
Query: 397 FSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
FRVDS ID + H D V+++ Y+L++L S K VAWY+PK +E M+ELR FE
Sbjct: 405 IVFRVDSRFSIDSSSDQEGPHVEDLVYSLSYSLRLLRSGKVVAWYSPKRKEGMVELRLFE 464
>gi|449432352|ref|XP_004133963.1| PREDICTED: uncharacterized protein LOC101205636 [Cucumis sativus]
gi|449487568|ref|XP_004157691.1| PREDICTED: uncharacterized protein LOC101227878 [Cucumis sativus]
Length = 470
Score = 287 bits (735), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 174/483 (36%), Positives = 264/483 (54%), Gaps = 43/483 (8%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M LR AM+ FW+L+ S+P++L G+A+AVPG P PL +R +R R +Q+ F+
Sbjct: 1 MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRTLRIQQLSFLGNGFPL 60
Query: 61 PFLPSF------SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
+PS+ G+ LQ +L S W+A L+GQF K +SS+K Q EL
Sbjct: 61 GIIPSYCPTAHKELGSFSLQSLLFMMPSVKWWAGLVGQFRPKKLISSIKAQISAVEQLEL 120
Query: 108 SKSRCLHAL-RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNF 166
S + + +L DKSLY G CS+ ++ +S + G R KA+F H+ P H+
Sbjct: 121 SDLKDIASLFLDKSLYTYGICSQFSTGPFSSVYVSTEKLGERKGHRHKAMFYHRLPEHDI 180
Query: 167 TLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSS 226
++A WP LF+D G YWDVP S++ DL+S+ S+SG Y + +H N G+P NS
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGVPRALN-STNSD 239
Query: 227 AVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMV----------QP-YDIFLSSPHVSA 275
P +LLPG K A + +KN D+WR + +M +P YD+ L PH +
Sbjct: 240 DPPLTLLPGLCAKAAFSIEKNRDLWRDNLSEEEMTINYIRTGLKKEPAYDVRLDEPHAAI 299
Query: 276 SGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQ 335
SGIIG +++ G + V ++ DG+ +S +AD+FGS+ +T QHG F
Sbjct: 300 SGIIGGTVSSWFGGSDT---VGSNGDGNLTM---GHKKRSPLNADLFGSICYTYQHGKFL 353
Query: 336 RLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQ--AVCPKVFFSVQQQI 393
F+DLTR ARL S S F A+ F+ + S++ ++ P++ QQQ+
Sbjct: 354 NDFNDLTRIDARLSISSASGF-------AKRVFHVFKKSVDDLERSKSSPRLNLIFQQQV 406
Query: 394 AGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELR 451
AGP FR++S +++D + I H D++ ++ Y+ L SAKAV WY+PK +E M+ELR
Sbjct: 407 AGPIVFRLESKLLLDSASGKIGPHVEDTICSLTYSFLDLESAKAVFWYSPKRKEGMVELR 466
Query: 452 FFE 454
+E
Sbjct: 467 LYE 469
>gi|225453254|ref|XP_002265990.1| PREDICTED: uncharacterized protein LOC100256535 [Vitis vinifera]
gi|297734677|emb|CBI16728.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 173/481 (35%), Positives = 254/481 (52%), Gaps = 45/481 (9%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M LR AM+ FW+LD S+P++L G+ARAVPG+P PL +R +R R +Q+ F+
Sbjct: 1 MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60
Query: 61 PFLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
+PSFS G+ LQ + + +W+ L GQF K +SS+K + EL
Sbjct: 61 GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADLSAVDEWEL 120
Query: 108 S--KSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
S K H + DKSL++ G CS+L T +L++S + +G R + + H+ P H+
Sbjct: 121 STFKEVAKHFI-DKSLFSFGLCSQLSLTSASSLMVSTEQHGEKKGRRNRVMLFHQLPFHD 179
Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENS 225
TLEA WP LF+D G YW++P S++ L+S+ S+SG Y +H N G P +
Sbjct: 180 ITLEAAWPELFIDHKGRYWELPESISLGLSSLVSESGLRYRFGIHKNGGHPQSVNAINDE 239
Query: 226 SAVPASLLPGFSLKNAVAYKKNVDIWRSKAQK----------LKMVQPYDIFLSSPHVSA 275
+ P++L+PG K A +Y+K+ D+WR + ++ L YDI L PH +
Sbjct: 240 A--PSALMPGLCAKAAFSYEKSRDLWRQREKQEDGIVKTERGLVWRPSYDIRLREPHAAI 297
Query: 276 SGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQ 335
SGIIG A G + DGS+ A +S F AD+F S T QHG F+
Sbjct: 298 SGIIGGTCEAWFGGS------REHGDGSSA----DAKKRSPFGADLFASGCCTFQHGQFR 347
Query: 336 RLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAG 395
+ + DLTR ARL+ S S + L N + L + P++ QQQ+AG
Sbjct: 348 KRYGDLTRVDARLNICSASALAKRVSNLFSSSVNGAKDPLSS-----PRLNLIFQQQVAG 402
Query: 396 PFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFF 453
P FRVDS +++D D +++ Y+L++L S K VAWY+PK +E MIELR F
Sbjct: 403 PIVFRVDSKLLLDSSGGRAGPQLEDFTYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRLF 462
Query: 454 E 454
E
Sbjct: 463 E 463
>gi|356504509|ref|XP_003521038.1| PREDICTED: uncharacterized protein LOC100775974 [Glycine max]
Length = 464
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 170/479 (35%), Positives = 260/479 (54%), Gaps = 41/479 (8%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M KL+ ++ FW+L+ ++P+ +G A++VPG+P PL S +R+ RP+Q+ + +
Sbjct: 1 MAKLKTGIDSAFWDLNVASPQLHDGWAKSVPGDPFPLDGSVASRVLRPRQLSVIGNGLPL 60
Query: 61 P-FLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRC- 112
P +PS S G+ LQ +L + W+ T+ GQF K ++ +K E+S +
Sbjct: 61 PVIIPSLSPTSPKDLGSFSLQSLLLKLANPRWWLTMTGQFRPRKLIADVKN-EISNAEEF 119
Query: 113 -LHALRD-------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
L ++D KSLY+ G S+ F +LLL+ + +G R+K + HK +H
Sbjct: 120 DLSTVKDVVKHFINKSLYSFGLTSQFAFPPSTSLLLAIEGHGEKERLRRKMMVFHKLHDH 179
Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
+ TLEA WP LFVD G YWDVP S++ DL+S+ S+SG YH +H N G P +
Sbjct: 180 DLTLEAAWPQLFVDHKGKYWDVPESLSVDLSSLVSESGLRYHFGIHKNGGNPQAMNATDG 239
Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRSKA---QKLKMVQPYDIFLSSPHVSASGIIGA 281
+ P SLLPG K AV+Y+K WR K Q+ + PYD+ L PH + SGIIG+
Sbjct: 240 NP--PLSLLPGLCAKVAVSYEKIKYFWRDKGAAEQENEEALPYDVRLKEPHAAVSGIIGS 297
Query: 282 AMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDL 341
+ I + S V++ D + +S +AD+FGSV ++ QHG F + + DL
Sbjct: 298 TFASWIWNGRSLSSVDSREDQEV-----STSKRSRHNADLFGSVCYSFQHGKFTKKYGDL 352
Query: 342 TRFHARLDFPSGS----KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
TR ARLD S S K L+G++ D S++PS P++ QQQ+AGP
Sbjct: 353 TRVDARLDISSASAFAKKILNGSSSSTADV--SKQPS------ASPRLNLIFQQQVAGPV 404
Query: 398 SFRVDSGVVID--LKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
FR DS + ++ + + D + ++ Y+L+ L S K VAWY+PK +E M+E R +E
Sbjct: 405 VFRADSRIALESFARKNGVSVEDFICSLSYSLKDLQSGKIVAWYSPKRKEGMVEFRMYE 463
>gi|356520726|ref|XP_003529011.1| PREDICTED: uncharacterized protein LOC100792226 [Glycine max]
Length = 464
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 169/479 (35%), Positives = 260/479 (54%), Gaps = 41/479 (8%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M KL+ +++ FW+ + ++P+ EG ++VPG+P PL S +R+ RP+Q+ + +
Sbjct: 1 MAKLKTSIDSSFWDFNVASPQLHEGWVKSVPGDPFPLDGSVASRVLRPRQLSVIGNGLPL 60
Query: 61 PFL-PSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRC- 112
P + PS S G+ LQ +L + W+ T+ GQF K ++ +K E+S +
Sbjct: 61 PVIVPSLSPTSPKDLGSFCLQSLLLKLANPRWWLTMTGQFRPRKLIADVKN-EISNAEEF 119
Query: 113 -LHALRD-------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
L ++D KSLY+ G S+ F +LLL+ + +G R K + HK P+H
Sbjct: 120 DLSTVKDVAKHFINKSLYSFGLTSQFAFPPSTSLLLAIEGHGEKERLRSKVMVFHKLPDH 179
Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
+ TLEA WP LFVD G YWDVP S++ DL+S+ S+SG YH+ MH NS P
Sbjct: 180 DLTLEAAWPQLFVDHKGKYWDVPESLSVDLSSLVSESGLRYHIGMHKNSVNPQAMNATNG 239
Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRSKA---QKLKMVQPYDIFLSSPHVSASGIIGA 281
+ P SLLPG K AV+Y+K WR K Q+ + PYD+ L PH + SGIIG+
Sbjct: 240 NP--PLSLLPGLCAKVAVSYEKIKYFWRDKGAAEQENEEALPYDVRLKEPHAAVSGIIGS 297
Query: 282 AMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDL 341
+ I + S +++ D + +S +AD+FGSV ++ QHG F + + DL
Sbjct: 298 TFASWIWNGRSLSSIDSREDPEV-----STSKRSRHNADLFGSVCYSFQHGKFTKKYGDL 352
Query: 342 TRFHARLDFPSGS----KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
TR ARLD S S K L+G++ + + S++PS P++ QQQ+AGP
Sbjct: 353 TRVDARLDISSASAFAKKILNGSS--SSTAYVSEQPS------ASPRLNLIFQQQVAGPV 404
Query: 398 SFRVDSGVVID--LKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
FR DS + ++ + + D + ++ Y+L+ L S K VAWY+PK +E M+E R +E
Sbjct: 405 VFRADSRIALESFARKNGVSVEDFICSLSYSLKDLESGKIVAWYSPKRKEGMVEFRMYE 463
>gi|297824489|ref|XP_002880127.1| hypothetical protein ARALYDRAFT_483595 [Arabidopsis lyrata subsp.
lyrata]
gi|297325966|gb|EFH56386.1| hypothetical protein ARALYDRAFT_483595 [Arabidopsis lyrata subsp.
lyrata]
Length = 455
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 162/477 (33%), Positives = 246/477 (51%), Gaps = 46/477 (9%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M L A++ FW+ + S+P++LEG+AR+VPG P PL +R +R R +Q+ ++
Sbjct: 1 MANLNSAVDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60
Query: 61 PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
+PSF+ + L +L S +W+ L+GQF K + +K
Sbjct: 61 GIIPSFAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120
Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
++ K H + DKSLY++G +++ +LLLS + G N R K +F H H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMFVHPLEKHD 179
Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
T+EA WP LF+D G +WDVP S+ D++S+ +SG Y +H + G P G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGLRYRFGLHKSRGNPQPVNAAGAE 239
Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ------PYDIFLSSPHVSASG 277
+ S P SL+PG K AV+YK N D+WR + ++ + PYDI L PH + SG
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKANRDLWRPQEKEDNTEEGTPEFLPYDIRLKEPHAAISG 299
Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
I+G+++ A I + L +S SAD+FGS +T Q G F +L
Sbjct: 300 IVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSKL 344
Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
+ DLTR AR+D PS S A R+ + + P++ QQQ+AGP
Sbjct: 345 YGDLTRVDARVDLPSASAL---AKRIFHAFRRLSGSNNSDDTLWSPRLNLIFQQQVAGPI 401
Query: 398 SFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
F+VDS + + D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 402 VFKVDSQFQVGAARME----DLIYSLNYSLRLLESGKVVAWYSPKRKEGMIELRIFE 454
>gi|224137192|ref|XP_002327064.1| predicted protein [Populus trichocarpa]
gi|222835379|gb|EEE73814.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 172/486 (35%), Positives = 254/486 (52%), Gaps = 48/486 (9%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M LR A++ +FW+ S+ ++LEG A ++PG+P PL ++R ++ R +Q+ +
Sbjct: 1 MANLRTAVDSQFWDQPISSSQTLEGCAYSIPGDPFPLEVTRASKALRVQQLSVLGNGFPL 60
Query: 61 PFLPSFSA------GTLQLQRV-LPFQFSGDWFATLLGQFNVLKFVSSLKQGELS----- 108
+PSFS G+ LQ + L S W L+GQF K +SS+K GE +
Sbjct: 61 GTIPSFSPTSTKDLGSFSLQSLFLKLATSNSWLG-LIGQFRPKKLISSIK-GEFTNADEF 118
Query: 109 -----KSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPN 163
K H DKS+Y+LG S++ + LL + + +G R K + H+ P+
Sbjct: 119 EWPAFKDVAKHVF-DKSIYSLGLFSQISLSSSSVLLST-ERHGDKRRPRYKMMLWHELPD 176
Query: 164 HNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDE 223
H+ TLEA WPGLF+D G YWDVP S++ D++S+ S+SG Y + +H N G P +
Sbjct: 177 HDITLEAAWPGLFLDHKGKYWDVPESISLDMSSLPSESGFQYRIGVHKNGGHPQPV--NT 234
Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRS---------KAQKLKMVQP-YDIFLSSPHV 273
+ VP +L+PG K A +Y+K D WR K K K+ P +D+ L PH
Sbjct: 235 LNGEVPCALMPGLCAKAAFSYEKRKDFWRQKDKVDDTAVKTDKGKVWHPSFDMRLREPHS 294
Query: 274 SASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGN 333
+ SGIIG A G + E+ D T +S +A++FGSV +T QHG
Sbjct: 295 AISGIIGGTSVAWFGGSESSPSTESHVDMDTSI---GTKKRSPLNANLFGSVCYTFQHGR 351
Query: 334 FQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNS-QRPSLETVQ--AVCPKVFFSVQ 390
F +L+ DLTR ARLD S A+ +A+ FN +R S PK+ +Q
Sbjct: 352 FTKLYGDLTRVDARLDICS-------ASAVAKRVFNIFRRSSFSNADNPLSSPKLSLILQ 404
Query: 391 QQIAGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
QQ+AGP RVDS + + H D + ++ Y+L++L S K VAWY+PK +E M+
Sbjct: 405 QQVAGPIMVRVDSKFSLGSSSGKQGPHVEDLICSLSYSLRLLRSGKVVAWYSPKRKEGMV 464
Query: 449 ELRFFE 454
ELR FE
Sbjct: 465 ELRLFE 470
>gi|18406590|ref|NP_566021.1| uncharacterized protein [Arabidopsis thaliana]
gi|20197029|gb|AAC27466.2| expressed protein [Arabidopsis thaliana]
gi|330255356|gb|AEC10450.1| uncharacterized protein [Arabidopsis thaliana]
Length = 451
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 163/479 (34%), Positives = 250/479 (52%), Gaps = 54/479 (11%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M L A++ FW+ + S+P++LEG+AR+VPG P PL +R +R R +Q+ ++
Sbjct: 1 MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60
Query: 61 PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
+PS + + L +L S +W+ L+GQF K + +K
Sbjct: 61 GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120
Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
++ K H + DKSLY++G +++ +LLLS + G N R K + H H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHD 179
Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
T+EA WP LF+D G +WDVP S+ D++S+ +SG Y +H + G P G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVE 239
Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ-------PYDIFLSSPHVSAS 276
+ S P SL+PG K AV+YK N D+WR + ++ + PYD+ L PH + S
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAIS 299
Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
GI+G+++ A I + L +S SAD+FGS +T Q G F +
Sbjct: 300 GIVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSK 344
Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFN-SQRPSLETVQAVCPKVFFSVQQQIAG 395
L+ DLTR AR+D PS A LA+ F+ S S +T+ + P++ QQQ+AG
Sbjct: 345 LYGDLTRVDARVDLPS-------AFALAKKLFHASSNNSDDTLWS--PRLNLIFQQQVAG 395
Query: 396 PFSFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
P F+VDS + + D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 396 PIVFKVDSQFQVGAARME----DVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFE 450
>gi|15146246|gb|AAK83606.1| At2g44640/F16B22.13 [Arabidopsis thaliana]
gi|19699152|gb|AAL90942.1| At2g44640/F16B22.13 [Arabidopsis thaliana]
Length = 451
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 163/479 (34%), Positives = 250/479 (52%), Gaps = 54/479 (11%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M L A++ FW+ + S+P++LEG+AR+VPG P PL +R +R R +Q+ ++
Sbjct: 1 MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60
Query: 61 PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
+PS + + L +L S +W+ L+GQF K + +K
Sbjct: 61 GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120
Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
++ K H + DKSLY++G +++ +LLLS + G N R K + H H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHD 179
Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
T+EA WP LF+D G +WDVP S+ D++S+ +SG Y +H + G P G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVE 239
Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ-------PYDIFLSSPHVSAS 276
+ S P SL+PG K AV+YK N D+WR + ++ + PYD+ L PH + S
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAIS 299
Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
GI+G+++ A I + L +S SAD+FGS +T Q G F +
Sbjct: 300 GIVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSK 344
Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFN-SQRPSLETVQAVCPKVFFSVQQQIAG 395
L+ DLTR AR+D PS A LA+ F+ S S +T+ + P++ QQQ+AG
Sbjct: 345 LYGDLTRVDARVDLPS-------AFALAKKLFHASSNNSDDTMWS--PRLNLIFQQQVAG 395
Query: 396 PFSFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
P F+VDS + + D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 396 PIVFKVDSQFQVGAARME----DVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFE 450
>gi|357513035|ref|XP_003626806.1| hypothetical protein MTR_8g009290 [Medicago truncatula]
gi|355520828|gb|AET01282.1| hypothetical protein MTR_8g009290 [Medicago truncatula]
Length = 465
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 165/478 (34%), Positives = 243/478 (50%), Gaps = 38/478 (7%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M KLR A++ FW+L+ S+P++L+G A+ VPG+P PL S +RL R +Q+ +
Sbjct: 1 MAKLRTAIDSSFWDLNISSPQNLDGWAKFVPGDPIPLDASVSSRLYRHQQLPHITPHFPF 60
Query: 61 PFLPSF------SAGTLQLQRVLPFQFSG-DWFATLLGQFNVLKFVSSLKQG-------- 105
+PS G+ LQ +L F+G W+ GQF K + +K
Sbjct: 61 GIIPSSVPSPKKEHGSFSLQSLL-LDFTGPRWWLAATGQFRPRKMIVDIKNEICNAAEFN 119
Query: 106 -ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
+KS H + DKSL++ G S+ + ++L + +G RKK VF + +H
Sbjct: 120 LSTAKSVAKHFI-DKSLFSYGLNSQFALSPSTSVLFGLEGHGEKEKHRKKVVFFQELSDH 178
Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
+ T+EA WP LFVD G YWDVP S++ DLAS+ + G Y MH N+G P +
Sbjct: 179 DLTVEAAWPQLFVDHKGRYWDVPESISVDLASLVARRGLRYRFGMHKNNGSPQATNATD- 237
Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRS--KAQKLKMVQPYDIFLSSPHVSASGIIGAA 282
S P SLLPG K +V Y K +WR+ ++ K + P+D L PH + SGIIG++
Sbjct: 238 -SDTPLSLLPGLCAKASVTYDKIKYLWRNTETEEEYKDLFPHDQGLMVPHEAVSGIIGSS 296
Query: 283 MTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLT 342
+ I + +N D + +S F+A+IFGSV +T Q G F + F DLT
Sbjct: 297 CASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQQGRFSKKFGDLT 350
Query: 343 RFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVD 402
R ARLD S SG + + F S + + P++ QQQ+AGP FR D
Sbjct: 351 RVDARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQQQVAGPVVFRAD 406
Query: 403 SGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQEFMIELRFFE 454
S + ID +R I D + ++ + + + S K VAWY+PK E M+ELR +E
Sbjct: 407 SRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRNEGMVELRLYE 464
>gi|302818490|ref|XP_002990918.1| hypothetical protein SELMODRAFT_161311 [Selaginella moellendorffii]
gi|300141249|gb|EFJ07962.1| hypothetical protein SELMODRAFT_161311 [Selaginella moellendorffii]
Length = 474
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 154/486 (31%), Positives = 235/486 (48%), Gaps = 45/486 (9%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M L ++ +FW + + +SLE R++P P PL + RG+++SR +Q+ +QR +
Sbjct: 1 MGPLGSSLASQFWNEEVESTRSLEACVRSLPREPPPLAMVRGSKISRSQQLLLLQRLLRL 60
Query: 61 PFLPSFSAGT-----LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELS------- 108
PF PS + + L RV+ +W+ATL G+ V + + + +L
Sbjct: 61 PFAPSLAPDCDCPSGIVLDRVVASTGGTNWWATLTGRLRVQRLFAKRPEAKLELNGFSEA 120
Query: 109 --------KSRCLHALR--DKSLYALGFCSELLFTDDDTLLL-----SYDSYGHNNTS-R 152
+ HA R DKSLYAL ++LL +L + S + +G + R
Sbjct: 121 AAEKQQQRDGKWTHAKRLLDKSLYALCMRTKLLLGPSTSLSIKAEMDSLEDWGSSEVPMR 180
Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
+ HK H+ +LEA W ++D+ YWDVP +++ D+ASV SG Y L +HH+
Sbjct: 181 VHSKLLHKLKKHDLSLEAAWHERYMDREARYWDVPQAVSLDIASVGLSSGLYYRLGVHHS 240
Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL--KMVQPYDIFLSS 270
SG G +P +PG + AV+Y K +DIW+ K+ +PY++ +
Sbjct: 241 SGQSQNCCGGAGKE-IPWGAMPGTCAQAAVSYGKEIDIWKEDDAHFGRKLHKPYNLLAAR 299
Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
PH++ SG++G + AS+G N S L ++ S SAD FGS+ TAQ
Sbjct: 300 PHITLSGVVGGVVNASLGQRKQPLNAANPS-------LVKSNGDRSVSADAFGSLGVTAQ 352
Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
G FQR D T+F AR+D S +S T A+ S PSL Q ++Q
Sbjct: 353 FGLFQRFLLDHTKFAARVDIGGASSTVSARTLSAK----SSSPSLADGQNGI-TFALTMQ 407
Query: 391 QQIAGPFSFRVDSGVVIDLK--NRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
QQ+ GP RVDS DL N + + ++ +L+ LG+AK V WY P E M
Sbjct: 408 QQVLGPLRARVDSRFSPDLSTWNARPALQEVTYGLDCSLESLGAAKLVVWYCPTRSEGMA 467
Query: 449 ELRFFE 454
E+R E
Sbjct: 468 EIRLLE 473
>gi|302802025|ref|XP_002982768.1| hypothetical protein SELMODRAFT_117300 [Selaginella moellendorffii]
gi|300149358|gb|EFJ16013.1| hypothetical protein SELMODRAFT_117300 [Selaginella moellendorffii]
Length = 474
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 152/486 (31%), Positives = 230/486 (47%), Gaps = 45/486 (9%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M L ++ +FW + + +SLE R++P P PL + RG+++SR +Q+ +QR +
Sbjct: 1 MGPLGSSLASQFWNEEVESTRSLEACVRSLPREPPPLAMVRGSKISRSQQLLLLQRLLRL 60
Query: 61 PFLPSFSAGT-----LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELS------- 108
PF PS + + L RV+ +W+ATL G+ V + + +L
Sbjct: 61 PFAPSLAPDCDCPSGIVLDRVVASTGGTNWWATLTGRLRVQRLFAKRSGAKLELNGFSEA 120
Query: 109 ----------KSRCLHALRDKSLYALGFCSELLFTDDDTLLL-----SYDSYGHNNTS-R 152
K L DKSLYAL ++LL +L + S + +G + R
Sbjct: 121 AAEKQQQRDGKWTYAKRLLDKSLYALCMRTKLLLGPSTSLSIKAEMDSLEDWGSSEVPMR 180
Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
+ HK H+ +LEA W F+D+ YWDVP +++ D+ASV SG Y L +HH+
Sbjct: 181 VHSKLLHKLKKHDLSLEAAWHERFMDREARYWDVPQAVSLDIASVGLSSGLYYRLGVHHS 240
Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL--KMVQPYDIFLSS 270
SG G +P +PG + AV+Y K +DIW+ K+ +PY++ +
Sbjct: 241 SGQSQNCCGGAGKE-IPWGAMPGTCAQAAVSYGKEIDIWKEDDAHFGRKLHKPYNLLAAR 299
Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
PH++ SG++G + AS+G N S L ++ S SAD FGS+ TAQ
Sbjct: 300 PHITLSGVVGGVVNASLGQRKQPLNAANPS-------LVKSNGDRSVSADAFGSLGVTAQ 352
Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
G FQR D T+F AR+D S + T A+ S PSL Q ++Q
Sbjct: 353 FGLFQRFLLDHTKFAARVDIGGASSTVCARTLSAK----SSSPSLADGQNGI-TFALTMQ 407
Query: 391 QQIAGPFSFRVDSGVVIDLK--NRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
QQ+ GP R DS DL N + + ++ +L+ LG+AK V WY P E M
Sbjct: 408 QQVLGPLRARADSRFSPDLSTWNARPALQEVTYGLDCSLESLGAAKLVVWYCPTRSEGMA 467
Query: 449 ELRFFE 454
E+R E
Sbjct: 468 EIRLLE 473
>gi|388493962|gb|AFK35047.1| unknown [Medicago truncatula]
Length = 307
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 114/310 (36%), Positives = 161/310 (51%), Gaps = 20/310 (6%)
Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
KK F + +H+ T+EA WP LFVD G YWDVP S++ DLAS+ + G Y MH N
Sbjct: 9 KKRCFFQELSDHDLTVEAAWPQLFVDHKGRYWDVPESISVDLASLVARRGLRYRFGMHKN 68
Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRS--KAQKLKMVQPYDIFLSS 270
+G P + S P SLLPG K +V Y K +WR+ ++ K + P+D L
Sbjct: 69 NGSPQATNATD--SDTPLSLLPGLCAKASVTYDKIKYLWRNTETEEEYKDLFPHDQGLMV 126
Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
PH + SGIIG++ + I + +N D + +S F+A+IFGSV +T Q
Sbjct: 127 PHEAVSGIIGSSCASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQ 180
Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
G F + F DLTR ARLD S SG + + F S + + P++ Q
Sbjct: 181 QGRFSKKFGDLTRADARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQ 236
Query: 391 QQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQ 444
QQ+AGP FR DS + ID +R I D + ++ + + + S K VAWY+PK
Sbjct: 237 QQVAGPVVFRADSRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRN 296
Query: 445 EFMIELRFFE 454
E M+ELR +E
Sbjct: 297 EGMVELRLYE 306
>gi|168006921|ref|XP_001756157.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692667|gb|EDQ79023.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 560
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/449 (27%), Positives = 198/449 (44%), Gaps = 57/449 (12%)
Query: 4 LRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFL 63
++ + +FW S+ +S++G + +P P PLGI GTRLSR +Q+ +QRF+ P
Sbjct: 1 MKPVLSSQFWNESVSSSRSIDGISHGLPAEPPPLGIVTGTRLSRSQQLLMLQRFLGIPCT 60
Query: 64 PSF-----SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKF----------------VSSL 102
PS+ + G L L RVL +W+ATL G+ + +
Sbjct: 61 PSYVVDADNKGGLVLDRVLGSAGGENWWATLTGRVRPQRLFRRKCSTGESSSSSSSRTGS 120
Query: 103 KQGELSKSRCL----HALRDKSLYALGFCSELLFTDDDTL-------LLSYDSYGHNNTS 151
SR L H D SLYAL F S L TL LS ++ ++
Sbjct: 121 DSAAQHTSRNLVESAHRFLDNSLYALCFRSRLQLGSRTTLSTTSELDSLSCQAFADQQST 180
Query: 152 --------RKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGP 203
R + HK EA + +VD+ YWDVP +++ D AS+ + G
Sbjct: 181 GEQTRPPWRGRVSLRHKLDQQKIVAEAAYHERYVDRESRYWDVPHTVSFDFASLGAPGGF 240
Query: 204 SYHLTMHHNSGLPTQFEG---DENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKM 260
Y L +HH++G P + G + N+ P LPG L+ A + +K++++W++ + +
Sbjct: 241 RYRLGVHHSAGSPLECGGTRPEVNAGEPPVKALPGLRLQAAASVEKSLNLWKADLESIIS 300
Query: 261 VQPYDIFLSSPHVSASGIIGAAMTASI----GDNSVRSPVENDSDGSTGFHLHAADIKSS 316
+ Y +F + P +S SG++G +++ + G P D ++ I
Sbjct: 301 QKSYSLFDARPCISLSGVLGGLLSSRLDPLYGITQSLDPTGAD---------NSPPIGRR 351
Query: 317 FSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLE 376
SAD+F SV AQ G F+R F D T+ RLD + S + N + +
Sbjct: 352 HSADLFASVGLNAQFGRFKRNFWDFTKMGIRLDIGAVSALKLHKMQDVSPSSNGESGDIG 411
Query: 377 TVQAVCPKVFFSVQQQIAGPFSFRVDSGV 405
+ P + +QQQ+ GP RV S V
Sbjct: 412 DERGH-PTLALELQQQVIGPLRARVHSRV 439
>gi|147785063|emb|CAN64366.1| hypothetical protein VITISV_031303 [Vitis vinifera]
Length = 701
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 167/339 (49%), Gaps = 62/339 (18%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M LR AM+ FW+LD S+P++L G+ARAVPG+P PL +R +R R +Q+ F+
Sbjct: 1 MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60
Query: 61 PFLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
+PSFS G+ LQ + + +W+ L GQF K +SS+K + EL
Sbjct: 61 GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADLSAVDEWEL 120
Query: 108 S--KSRCLHALRDKSLYALGFCSELLFTDDDTLLLS-YDSYG------------------ 146
S K H + DKSL++ G CS+L T +L++S +++ G
Sbjct: 121 STFKEVAKHFI-DKSLFSFGLCSQLSLTSASSLMISIWETLGFVLKWLHYILPSNKILEL 179
Query: 147 ------------HN---NTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMA 191
H + K + P H+ TLEA WP LF+D G YW++P S++
Sbjct: 180 ATKWQIKCTFGQHGCVFEANAKWVALLNLLPFHDITLEAAWPELFIDHKGRYWELPESIS 239
Query: 192 ADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIW 251
L+S+ S+SG Y +H N G P + + P++L+PG K A +Y+K+ D+W
Sbjct: 240 LGLSSLVSESGLRYRFGIHKNGGHPQSVNAINDEA--PSALMPGLCAKAAFSYEKSRDLW 297
Query: 252 RSKAQK----------LKMVQPYDIFLSSPHVSASGIIG 280
R + ++ L YDI L PH + SGIIG
Sbjct: 298 RQREKQEDGIVKTERGLVWRPSYDIRLREPHAAISGIIG 336
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 80/146 (54%), Gaps = 7/146 (4%)
Query: 311 ADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNS 370
A +S F AD+F S T QHG F++ + DLTR ARL+ S S + L N
Sbjct: 560 AKKRSPFGADLFASGCCTFQHGQFRKRYGDLTRVDARLNICSASALAKRVSNLFSSSVNG 619
Query: 371 QRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQ 428
+ L + P++ QQQ+AGP FRVDS +++D D +++ Y+L+
Sbjct: 620 AKDPLSS-----PRLNLIFQQQVAGPIVFRVDSKLLLDSSGGRAGPQLEDFTYSLNYSLR 674
Query: 429 VLGSAKAVAWYAPKHQEFMIELRFFE 454
+L S K VAWY+PK +E MIELR FE
Sbjct: 675 LLRSGKVVAWYSPKRKEGMIELRLFE 700
>gi|217070234|gb|ACJ83477.1| unknown [Medicago truncatula]
Length = 183
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 96/190 (50%), Gaps = 16/190 (8%)
Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
PH + SGIIG++ + I + +N D + +S F+A+IFGSV +T Q
Sbjct: 3 PHEAVSGIIGSSCASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQ 56
Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
G F + F DLTR ARLD S SG + + F S + + P++ Q
Sbjct: 57 QGRFSKKFGDLTRADARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQ 112
Query: 391 QQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQ 444
QQ+AGP FR DS + ID +R I D + ++ + + + S K VAWY+PK
Sbjct: 113 QQVAGPVVFRADSRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRN 172
Query: 445 EFMIELRFFE 454
E M+ELR +E
Sbjct: 173 EGMVELRLYE 182
>gi|145348672|ref|XP_001418769.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578999|gb|ABO97062.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 120/516 (23%), Positives = 205/516 (39%), Gaps = 97/516 (18%)
Query: 2 KKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATP 61
K ++ AM+G F + + ++EG+ RA + + +++ + L+R +Q+ Q+F+ P
Sbjct: 13 KPMKRAMDGAFGDREHGAAANVEGTTRAHASDGSSFVMTQPSGLARSQQVFLAQKFLNLP 72
Query: 62 FLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDK 119
+PS+ S G + L RV+ G+W+ TL + + + ++ + + AL D
Sbjct: 73 AMPSYVESEGGMVLDRVVLNAGRGNWWTTLTARARAQR---AWNTEQVQEGKAHRALLDP 129
Query: 120 SLYALG--FCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKF---------------P 162
S+YALG F + L ++ + + G + F KF P
Sbjct: 130 SMYALGGRFRANAL---ENVAVKAIGELGSVKGVTR--AFAKKFKLSDEKDDDAEAALNP 184
Query: 163 NHNFTLEALWPG--LFVDKCGN--------YWDVPFSMAADLASVASDSGPSYHLTMHHN 212
+++A P L VD N Y D P S + ++S + +Y ++
Sbjct: 185 RGRVSVQAKLPSHLLSVDLSHNERYQTDDRYVDGPTSASVSVSSRGRRA-INYRVSARKW 243
Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL----KMVQPYDIFL 268
G F+ P L+ V+Y++ +WR + ++ V Y
Sbjct: 244 LGDLVPFDRKHGGGM---RTPPRKELQAGVSYEQQAVLWRGRRRRKPKSSSAVSGYTALP 300
Query: 269 SSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADI--FGSVA 326
P ++ GI GA + S+ D D K ++ FGS++
Sbjct: 301 QVPTITVGGIYGAVVRKSLDD----------------------DYKDGEQTELQNFGSLS 338
Query: 327 FTAQHGNFQRLFSDLTRFHARLD---FPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCP 383
AQ G+F R D T + RLD + +K L A + + ++A
Sbjct: 339 VNAQVGSFARPLLDFTSINLRLDAGGMGTPNKPLDAAAQTPLPLADRLITMGGRLKANPV 398
Query: 384 KVFFSVQQQIAGPFSFRVD---SGV---------VIDLKNR------------DICAHDS 419
V S+ QQ+ GP FR + SG + LK R I +
Sbjct: 399 SVTLSLGQQLLGPLRFRAEVRASGAEALSATRAGLSALKERKTMTEVRESIQKQIVKPEV 458
Query: 420 VFAIEYALQ-VLGSAKAVAWYAPKHQEFMIELRFFE 454
V+ ++ AL +GSA+ VAWY QE M ELR F+
Sbjct: 459 VYGLDCALPPTIGSARVVAWYNATRQEAMAELRLFD 494
>gi|410810516|emb|CCI61380.1| hypothetical protein (GL-M16 marker), partial [Lens orientalis]
Length = 55
Score = 77.8 bits (190), Expect = 1e-11, Method: Composition-based stats.
Identities = 38/55 (69%), Positives = 43/55 (78%), Gaps = 2/55 (3%)
Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
I GP SFRVDSG+ +DLKN + I AH+ VFA+EYAL VLGSAKAVAW PK QE
Sbjct: 1 IVGPISFRVDSGIAVDLKNLEFPIQAHEPVFAVEYALHVLGSAKAVAWCCPKPQE 55
>gi|410810518|emb|CCI61381.1| hypothetical protein (GL-M16 marker), partial [Lens culinaris
subsp. culinaris]
Length = 55
Score = 76.3 bits (186), Expect = 3e-11, Method: Composition-based stats.
Identities = 37/55 (67%), Positives = 43/55 (78%), Gaps = 2/55 (3%)
Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
I GP SFRVDSG+ +DLKN + + AH+ VFA+EYAL VLGSAKAVAW PK QE
Sbjct: 1 IVGPISFRVDSGIAVDLKNLEFPLQAHEPVFAVEYALHVLGSAKAVAWCCPKPQE 55
>gi|308806363|ref|XP_003080493.1| COG1452: Organic solvent tolerance protein OstA (ISS) [Ostreococcus
tauri]
gi|116058953|emb|CAL54660.1| COG1452: Organic solvent tolerance protein OstA (ISS) [Ostreococcus
tauri]
Length = 525
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 110/502 (21%), Positives = 186/502 (37%), Gaps = 96/502 (19%)
Query: 22 SLEGSARAVPG---NPNPLG---ISRGTRLSRPKQIDFMQRFMATPFLPSFS--AGTLQL 73
+++GSARA+ G + + +G + + LSR +Q+ Q+ P +PSF+ G + L
Sbjct: 50 AIDGSARAMGGMETSGDGVGHFSMRVPSGLSRSQQVFLAQKLFNAPAVPSFARDEGGMVL 109
Query: 74 QRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDKSLYALG-------- 125
RV+ SG W+ T+ + + + ++ + + AL D S+YALG
Sbjct: 110 DRVIANVGSGRWWTTVTARVRAQR---AWNTEQVQEGKAHRALLDASMYALGGRFRMNLL 166
Query: 126 ------FCSELLFTDDDTLLLSYDSYGHNNT---------------SRKKAVFCHKFPNH 164
EL T ++ N T +R + + K P H
Sbjct: 167 ENVALKAVGELGSARAITRAITKKFVKKNETKSDAIEVNAGEAREDARARVLLQAKIPRH 226
Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
E F K ++ D P S + LAS + + L T+ + E+
Sbjct: 227 VINAELSHNERFETKK-SHVDGPTSASVSLASRGRRM---INYRVMARKWLGTELKPFES 282
Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIW----RSKAQKLKMVQPYDIFLSSPHVSASGIIG 280
+ P + V+ ++ V +W R K V Y P ++ GI G
Sbjct: 283 APDGEMQKPPRNEVLGGVSVEQQVILWHGRRRRKRASTSAVNGYTALPQVPTIAVGGIFG 342
Query: 281 AAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSD 340
S+ D D + G + + F S+A Q G+F R D
Sbjct: 343 TVARKSL-----------DEDYTDGDEVRLQN---------FASIALHGQIGSFSRPLFD 382
Query: 341 LTRFHARLDFPSGS---KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
T + RLD S K L + + + + ++A V S QQ+ GP
Sbjct: 383 FTSVNLRLDAGSVGNPYKPLDASEQKPMSLGDRLITADGRLKASPTSVTISFAQQLLGPL 442
Query: 398 SFRVD---SGV---------------------VIDLKNRDICAHDSVFAIEYALQ-VLGS 432
FR + SG V D+ +++ + + ++ ++ L LGS
Sbjct: 443 RFRTEVRTSGAEALAATCAGLSAIKQRKPMAQVRDVFKKEVASPEVLYGLDCPLPPTLGS 502
Query: 433 AKAVAWYAPKHQEFMIELRFFE 454
A+AV WY Q+ E+R F+
Sbjct: 503 ARAVVWYNATRQDAFAEIRLFD 524
>gi|217071854|gb|ACJ84287.1| unknown [Medicago truncatula]
Length = 133
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 8/111 (7%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M KLR A++ FW+L+ S+P++L+G A+ VPG+P PL S +RL R +Q+ +
Sbjct: 1 MAKLRTAIDSSFWDLNISSPQNLDGWAKFVPGDPIPLDASVSSRLYRHQQLPHITPHFPF 60
Query: 61 PFLPSF------SAGTLQLQRVLPFQFSGD-WFATLLGQFNVLKFVSSLKQ 104
+PS G+ LQ +L F+G W+ GQF K + +K
Sbjct: 61 GIIPSSVPSPKKEHGSFSLQSLL-LDFTGPRWWLAATGQFRPRKMIVDIKN 110
>gi|307106828|gb|EFN55073.1| hypothetical protein CHLNCDRAFT_134955 [Chlorella variabilis]
Length = 847
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 321 IFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQA 380
+F S A + Q G F R + D T AR+D S + G + S+RP A
Sbjct: 697 VFASAAVSGQLGRFTRPWLDFTAASARVDMALTSPHVVGDS--------SRRPDKHRAFA 748
Query: 381 VCPK-----VFFSVQQQIAGPFSFRVDSGVVID----------------LKNRDICAHDS 419
+ + + S+ QQ+ GP RVD+ +D + + S
Sbjct: 749 LEGRGAWHALSLSLAQQVYGPVRARVDARFALDPTSVPASEGERSTLKGVLQTALSVRPS 808
Query: 420 VFAIEYALQVL-----GSAKAVAWYAPKHQEFMIELRFF 453
+ Y + G+A+ W++PK +E M ELRFF
Sbjct: 809 LLETVYGCDCVLPSTEGAARLAIWFSPKRRECMAELRFF 847
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 66/141 (46%), Gaps = 8/141 (5%)
Query: 2 KKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATP 61
+ ++ ++ +FWE + KSL+G ++P P L ++ + LSR +Q+ + + P
Sbjct: 12 QDVKASIASQFWEQPVHSTKSLDGQLWSLPCEPPALNLTGSSGLSRTQQL-LAAQAVRVP 70
Query: 62 FLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDK 119
PS+ S+ L L RVL W+ L G+ + S ++ K + A+RD
Sbjct: 71 LFPSYDGSSDGLVLDRVLAKAGGRRWWTALSGRIKAQRAYSLAASEQVGK---VDAVRDL 127
Query: 120 SLYALGFCSELLFTDDDTLLL 140
YAL CS F L+L
Sbjct: 128 GNYAL--CSRSRFQLGRRLVL 146
>gi|255071821|ref|XP_002499585.1| predicted protein [Micromonas sp. RCC299]
gi|226514847|gb|ACO60843.1| predicted protein [Micromonas sp. RCC299]
Length = 520
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 59/127 (46%), Gaps = 5/127 (3%)
Query: 1 MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
M+ + A FW+ + + +L+G R + P ++ + L+R +Q+ MQRF+
Sbjct: 9 MEPTKAAFGKAFWDQEAESAAALDGHVRVHQSHAPPFSLAMPSGLARSQQLYMMQRFLNI 68
Query: 61 PFLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD 118
P PS+ + + L RV+ W+ TL + + +S ++ R L D
Sbjct: 69 PATPSYVKAEDGMVLDRVIANVGGRTWWHTLTLRLRAQRAWNSQS---FNEGRPHRILGD 125
Query: 119 KSLYALG 125
SLYALG
Sbjct: 126 PSLYALG 132
>gi|412991412|emb|CCO16257.1| predicted protein [Bathycoccus prasinos]
Length = 714
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 4 LRWAMEGEFWELDQSTPKSLEGSARAVPGNPN-PLGISRGTRLSRPKQIDFMQRFMATPF 62
L+ ++ F+ D S+ ++EG+ R P + P + L+R +Q+ +QRF+ P
Sbjct: 82 LKTSLHKSFFSQDVSSSTTIEGTVRQHPSQTSTPFSFQMPSGLARSQQVHMLQRFLNLPA 141
Query: 63 LPSF--SAGTLQLQRVLPFQFSG-DWFAT 88
LPS+ + G L RV+ QF G +W++T
Sbjct: 142 LPSYIKNEGGFVLDRVVT-QFGGNEWWST 169
>gi|384253755|gb|EIE27229.1| hypothetical protein COCSUDRAFT_38956 [Coccomyxa subellipsoidea
C-169]
Length = 1676
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 26/106 (24%), Positives = 44/106 (41%), Gaps = 6/106 (5%)
Query: 152 RKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHH 211
R + H+ H+ W ++D Y VP S + D+AS G Y + +HH
Sbjct: 1062 RGRTWLKHQVAGHDMLASLAWNEEYLDSAAAYCRVPLSSSLDVASQDRQDGIQYRVGVHH 1121
Query: 212 NSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQK 257
+ Q E D++ +SL + AVA + +W+ Q+
Sbjct: 1122 VAT--PQTESDDSGGPWRSSL----HFQGAVAVEGEALVWKPAVQQ 1161
>gi|424877962|ref|ZP_18301602.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WU95]
gi|392520454|gb|EIW45183.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WU95]
Length = 560
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 37/162 (22%), Positives = 66/162 (40%), Gaps = 25/162 (15%)
Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
FS+A D AS+ +G Y + H +GLP +FE N A GF + A
Sbjct: 307 FSLAMDRASMVDIAGFGYPVVNEHATGLPPRFETWRNKGAEGDKDAFMGFDTEKANKILD 366
Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
YKK D +R+ + P + ++ + ++ G+ A + AS+
Sbjct: 367 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 426
Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
D S + + +DG+T F + + +++ I G+
Sbjct: 427 WRKQIIDGSFEVVMNSRADGATPFRGYYQSLSTTYGGRITGA 468
>gi|424891530|ref|ZP_18315113.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WSM2297]
gi|393185525|gb|EJC85561.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WSM2297]
Length = 563
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 37/162 (22%), Positives = 66/162 (40%), Gaps = 25/162 (15%)
Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
FS+A D S+ +G Y + H +GLP +FE N SA GF + A
Sbjct: 310 FSLAMDRKSMVDIAGFGYPVVNEHATGLPPRFETWRNKSAEGDKDAFMGFDTEKANKVLD 369
Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
YKK D +R+ + P + ++ + ++ G+ A + AS+
Sbjct: 370 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 429
Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
D S + + +DG+T F + + +++ I G+
Sbjct: 430 WRKQIIDGSFEVVMNSRADGATPFRGYYQSLSTAYGGRITGA 471
>gi|405381537|ref|ZP_11035364.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium sp. CF142]
gi|397322033|gb|EJJ26444.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium sp. CF142]
Length = 564
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
FS+A D S+ +G Y + H SGLP +FEG N +A G A
Sbjct: 311 FSLAMDRKSMVDIAGFGYPVVNEHASGLPPRFEGWRNKAAEGDKDAFMGLDTDKANKILD 370
Query: 242 -VAYKKNVDIWRS----KAQKLKMVQP--YDIFLSSPHVSASGIIGAAMTASIG------ 288
YKK D +R+ K ++ P + ++ + ++ G+ A + AS+
Sbjct: 371 DAGYKKGSDGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 430
Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
D S + + +DG+T F + ++ +++ I G+
Sbjct: 431 WRKQILDGSFEVVMNSRADGATPFRGYYQNLSTAYGGRITGA 472
>gi|399040132|ref|ZP_10735586.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium sp. CF122]
gi|398062017|gb|EJL53803.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium sp. CF122]
Length = 565
Score = 38.5 bits (88), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 25/159 (15%)
Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS-------LLPGFSLKN 240
FS+A D AS+ +G Y + H SGLP +FE N +A A+ + + +
Sbjct: 312 FSLAMDRASMVDVAGFGYPVASQHASGLPPRFEAWRNKAAEGAADEWMAYDVDKANKVLD 371
Query: 241 AVAYKKNVDIWRSK------AQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG------ 288
YKK D +R+ A + + + ++ ++A G+ A S+
Sbjct: 372 DAGYKKGADNFRTTPGGQPIAFTIIIPNGWTDWIDDAQIAAEGLRAAGFNVSVATPEYEQ 431
Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADI 321
D S + + +DGST F + + ++F+ I
Sbjct: 432 WQKQLLDGSFDVALNSRADGSTPFRGYYQSMSTAFAGRI 470
>gi|424919956|ref|ZP_18343319.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WSM597]
gi|392848971|gb|EJB01493.1| ABC-type dipeptide transport system, periplasmic component
[Rhizobium leguminosarum bv. trifolii WSM597]
Length = 557
Score = 38.1 bits (87), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 36/162 (22%), Positives = 65/162 (40%), Gaps = 25/162 (15%)
Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA-SLLPGFSLKNA----- 241
FS+A D S+ +G Y + H +GLP +FE N A GF + A
Sbjct: 304 FSLAMDRKSMVDIAGFGYPVVNEHATGLPPRFESWRNKDAEGGKDAFMGFDTEKASKILD 363
Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
YKK D +R+ + P + ++ + ++ G+ A + AS+
Sbjct: 364 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 423
Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
D S + + +DG+T F + + +++ I G+
Sbjct: 424 WRKEIIDGSFEVVMNSRADGATPFRGYYQSLSTAYGGRITGA 465
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.134 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,110,838,465
Number of Sequences: 23463169
Number of extensions: 288922477
Number of successful extensions: 631045
Number of sequences better than 100.0: 58
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 630768
Number of HSP's gapped (non-prelim): 69
length of query: 455
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 309
effective length of database: 8,933,572,693
effective search space: 2760473962137
effective search space used: 2760473962137
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)