BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012850
         (455 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224110284|ref|XP_002315471.1| predicted protein [Populus trichocarpa]
 gi|222864511|gb|EEF01642.1| predicted protein [Populus trichocarpa]
          Length = 466

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/467 (68%), Positives = 370/467 (79%), Gaps = 13/467 (2%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M KLRWAM+G FW+LD+STP++LEG  RAVPG P PLG+SRGTRLSRPKQIDF QRFM  
Sbjct: 1   MNKLRWAMDGGFWDLDRSTPRTLEGEGRAVPGEPLPLGVSRGTRLSRPKQIDFFQRFMFA 60

Query: 61  PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ-GELSKSRCLHA-- 115
           PF+PS+SA +  L LQRVL   F+ DWFATLL QFN+ KFVSS K+ G L  SR  +   
Sbjct: 61  PFIPSYSASSHGLSLQRVLALPFTQDWFATLLAQFNLQKFVSSFKKNGALQSSRLENIRK 120

Query: 116 -LRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTS--RKKAVFCHK--FPNHNFTLEA 170
            L DKSLYALGFCSELL +  DTLLLS D YG +N    RKKA+F HK  FPNHN  +EA
Sbjct: 121 HLEDKSLYALGFCSELLLSPCDTLLLSLDFYGDDNNKKPRKKAIFHHKACFPNHNLNVEA 180

Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
           +WPGL++DK GNYWDVPFSMA DLAS+ASDSG SYH  MHH++G P Q  GDE +  VPA
Sbjct: 181 VWPGLYIDKAGNYWDVPFSMAIDLASLASDSGASYHFCMHHSAGQPMQLGGDE-TVEVPA 239

Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
           +LLPG SLK+A + KKNV+IWRS AQKLKMVQP+DIFLS+PH+SASG+IGAA+ A  GDN
Sbjct: 240 TLLPGISLKSAFSLKKNVEIWRSNAQKLKMVQPFDIFLSNPHISASGVIGAAVMACFGDN 299

Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
           SVR  V ++S    GF LHA  +KS+   D F SV+FTAQHGNFQRL  DLTRFHARLDF
Sbjct: 300 SVRPQVVDESQQFEGFCLHAPAVKSTLLVDTFSSVSFTAQHGNFQRLLLDLTRFHARLDF 359

Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
           PS SKFLSGA RLAQD+ NSQRP++ETVQA+CPK   S QQQIAGPFSFRVDSGV ID K
Sbjct: 360 PSASKFLSGAIRLAQDFCNSQRPTMETVQAICPKATVSFQQQIAGPFSFRVDSGVEIDWK 419

Query: 411 NRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           N+D  +C  D VFAIEYAL VLGSAKAVAWY+PKH+EFM+ELRFFE+
Sbjct: 420 NKDWHMCVDDPVFAIEYALHVLGSAKAVAWYSPKHREFMVELRFFET 466


>gi|359477333|ref|XP_003631965.1| PREDICTED: uncharacterized protein LOC100853208 [Vitis vinifera]
 gi|297736817|emb|CBI26018.3| unnamed protein product [Vitis vinifera]
          Length = 465

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/465 (67%), Positives = 366/465 (78%), Gaps = 12/465 (2%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           MKKLRWAM+G FWELD ST  +L+G ARAVP +P PLG+SRGTRLSRP QIDF QRFM+ 
Sbjct: 1   MKKLRWAMDGGFWELDMSTATTLDGVARAVPDDPLPLGLSRGTRLSRPMQIDFFQRFMSM 60

Query: 61  PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGEL---SKSRCLHA 115
           PF+PS S  T    LQR   F F+ +WFA+LLGQFN  KFVSS+K+G L   S+S  L  
Sbjct: 61  PFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRLLQPSESSWLQG 120

Query: 116 L----RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
           +     DKSLYALG  SELL T DDTLL+S ++YG     RKKAVF HKFPNHN  +EA+
Sbjct: 121 IGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNLMVEAV 180

Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS 231
           WPGLFVDK G YWDVP SMA DLASVASDSG SYHL++HHN+G P QF+G++ +  VPA+
Sbjct: 181 WPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQ-THEVPAT 239

Query: 232 LLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNS 291
           LLPG   K A A KKN+D+WRSKAQKLKMVQP+DIFLS+PH+S SGIIGAA TA +GDNS
Sbjct: 240 LLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACLGDNS 299

Query: 292 VRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFP 351
           VR  VE++S G  GF LH   +KS+  ADIF SVAFTAQHGNFQRLF DLTRF+ARLDFP
Sbjct: 300 VRVQVEDESHGFKGFKLHLPRVKSALVADIFASVAFTAQHGNFQRLFLDLTRFYARLDFP 359

Query: 352 SGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN 411
           SGSKFL+G TRL QD +NSQ+PSLE  QA+CP    S+QQQI GPFSFR+DSGV ++LKN
Sbjct: 360 SGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAVNLKN 419

Query: 412 RD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
           R+  I   + VFAIEYALQVLGSAKA+AWY+PKH+EFM+ELRFFE
Sbjct: 420 REWHIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFE 464


>gi|147865781|emb|CAN81147.1| hypothetical protein VITISV_020812 [Vitis vinifera]
          Length = 1050

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 313/466 (67%), Positives = 366/466 (78%), Gaps = 12/466 (2%)

Query: 1    MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
            MKKLRWAM+G FWELD ST  +L+G ARAVP +P PLG+SRGTRLSRP QIDF QRFM+ 
Sbjct: 586  MKKLRWAMDGGFWELDISTATTLDGVARAVPDDPLPLGLSRGTRLSRPMQIDFFQRFMSM 645

Query: 61   PFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGEL---SKSRCLHA 115
            PF+PS S  T    LQR   F F+ +WFA+LLGQFN  KFVSS+K+G L   S+S  L  
Sbjct: 646  PFVPSSSISTHGFSLQRGFTFPFTENWFASLLGQFNFQKFVSSVKEGRLLQPSESSWLQG 705

Query: 116  L----RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
            +     DKSLYALG  SELL T DDTLL+S ++YG     RKKAVF HKFPNHN  +EA+
Sbjct: 706  IGRRFSDKSLYALGLSSELLITPDDTLLVSLEAYGDKKVPRKKAVFLHKFPNHNLMVEAV 765

Query: 172  WPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS 231
            WPGLFVDK G YWDVP SMA DLASVASDSG SYHL++HHN+G P QF+G++ +  VPA+
Sbjct: 766  WPGLFVDKFGTYWDVPLSMAIDLASVASDSGASYHLSVHHNTGTPKQFDGNQ-THEVPAT 824

Query: 232  LLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNS 291
            LLPG   K A A KKN+D+WRSKAQKLKMVQP+DIFLS+PH+S SGIIGAA TA +GDNS
Sbjct: 825  LLPGLCAKGAFALKKNIDLWRSKAQKLKMVQPFDIFLSNPHISFSGIIGAAGTACLGDNS 884

Query: 292  VRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFP 351
            VR  VE++S G  G  LH   +KS+  ADIF SVAFTAQHGNFQRLF DLTRF+ARLDFP
Sbjct: 885  VRVQVEDESHGFKGLKLHLPRVKSALLADIFASVAFTAQHGNFQRLFLDLTRFYARLDFP 944

Query: 352  SGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN 411
            SGSKFL+G TRL QD +NSQ+PSLE  QA+CP    S+QQQI GPFSFR+DSGV ++LKN
Sbjct: 945  SGSKFLAGTTRLTQDLYNSQQPSLEAFQAICPTATLSLQQQIVGPFSFRIDSGVAVNLKN 1004

Query: 412  RD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
            R+  I   + VFAIEYALQVLGSAKA+AWY+PKH+EFM+ELRFFE+
Sbjct: 1005 REWHIDVDEPVFAIEYALQVLGSAKAIAWYSPKHEEFMVELRFFET 1050


>gi|255556596|ref|XP_002519332.1| conserved hypothetical protein [Ricinus communis]
 gi|223541647|gb|EEF43196.1| conserved hypothetical protein [Ricinus communis]
          Length = 466

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 301/467 (64%), Positives = 353/467 (75%), Gaps = 13/467 (2%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           MKKL WAM+G FW++D STP +LEG AR VPG+P PLGISRGT+LSRPKQ+ F QRFM++
Sbjct: 1   MKKLGWAMDGGFWDVDVSTPVTLEGVARPVPGDPLPLGISRGTKLSRPKQLHFFQRFMSS 60

Query: 61  PFLPSFS----AGTLQLQRVLPF-QFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHA 115
           PF+PSFS         LQ VL F  FS +WF TLLGQFN  KFVS       S S  L+ 
Sbjct: 61  PFIPSFSSHRHGHGFSLQSVLAFPTFSQNWFGTLLGQFNFQKFVSESGASLRSSSSSLNT 120

Query: 116 ----LRDKSLYALGFCSELLFTDDDTLLLSYDSYGH-NNTSRKKAVFCHKFPNHNFTLEA 170
               L D SLYALGF SEL FT DDTLL +++++G     SRKKAV  HKFPNHN TLEA
Sbjct: 121 IGRQLCDASLYALGFSSELSFTPDDTLLFNFNTHGDATRNSRKKAVLHHKFPNHNLTLEA 180

Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
           + P LF+D   NYWDVPFSMA DLAS+ SDSGPSYH  MHHN+G P  F G  ++ AVPA
Sbjct: 181 VSPALFIDSSDNYWDVPFSMAIDLASLPSDSGPSYHFCMHHNAGDPKLFGGG-HTLAVPA 239

Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
            LLPGFS K+A A+KKN+DIWRSKA KLKMVQP+D+F+S+PH+SASGI+GA MTA IGDN
Sbjct: 240 PLLPGFSFKSAFAFKKNIDIWRSKAPKLKMVQPFDLFISNPHISASGIVGATMTAYIGDN 299

Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
           SVR    +     TG  L A   KS+  AD+F SV+ TAQHGNFQRLF DLTR H RLDF
Sbjct: 300 SVRCQEVDRFQSFTGLSLRAIAPKSALLADMFSSVSLTAQHGNFQRLFLDLTRCHLRLDF 359

Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
           PSGSKFLSGA +LAQD+FNSQ+PS+ET++A+CP    S+QQQIAGPFSFRVDSGV ID +
Sbjct: 360 PSGSKFLSGAAKLAQDFFNSQQPSMETIKAICPNATISLQQQIAGPFSFRVDSGVAIDRR 419

Query: 411 NR--DICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
            +  D+  HD VFA+EYALQVLGSAKA+AWY PK +EFM+ELRFFE+
Sbjct: 420 KKDWDMHMHDPVFAVEYALQVLGSAKAIAWYCPKQKEFMVELRFFET 466


>gi|357449221|ref|XP_003594887.1| hypothetical protein MTR_2g035830 [Medicago truncatula]
 gi|355483935|gb|AES65138.1| hypothetical protein MTR_2g035830 [Medicago truncatula]
          Length = 468

 Score =  560 bits (1443), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 284/468 (60%), Positives = 346/468 (73%), Gaps = 13/468 (2%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           M+KLRW M+G  FW+LD STPK+L+GSA  VP +P  LG+SRGTRLSRP+Q+ FMQ FM 
Sbjct: 1   MRKLRWVMDGGGFWDLDISTPKTLDGSACPVPEDPLTLGLSRGTRLSRPRQLQFMQLFMN 60

Query: 60  TPFLPSFSAGT-LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD 118
            P  P+FS      LQRVL   FS +WF  LLGQFN+ KFVSS+K  +   +R    L++
Sbjct: 61  APLHPTFSQPQGFSLQRVLSLPFSDNWFVFLLGQFNLQKFVSSVKSSDEKPARFSSWLKN 120

Query: 119 -------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
                  KSLYALG CSE   T DDTLL   DSY + +  R KAVF HKFP+H+ T+EA+
Sbjct: 121 FGRHLQQKSLYALGLCSEFQLTPDDTLLFGLDSYDYTDKPRGKAVFHHKFPHHDLTVEAV 180

Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVA-SDSGPSYHLTMHHNSGLPTQFEGDE-NSSAVP 229
           +PGLFVDK GNYWDVP SMA DLAS+  SDS   YHL+ H+NSG P QFE  +  +  VP
Sbjct: 181 YPGLFVDKIGNYWDVPLSMAVDLASLTTSDSSAGYHLSAHYNSGSPKQFESIQYENDRVP 240

Query: 230 ASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGD 289
            +LLPG + K+  +Y+KN+DIWRS+  KLK++QPYDIFLS+P +SASG+IGAA TA  G+
Sbjct: 241 QTLLPGLAFKSVFSYRKNLDIWRSETPKLKLIQPYDIFLSNPLLSASGMIGAAATAYFGE 300

Query: 290 NSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLD 349
           NS R+ +++D   S+G  L A  IKSSF ADIFGSV+FTAQHGNFQRLF DL+RF ARLD
Sbjct: 301 NSTRAQIDDDGQQSSGLFLQAPGIKSSFLADIFGSVSFTAQHGNFQRLFLDLSRFQARLD 360

Query: 350 FPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDL 409
           FPSGSKFL GAT LAQD  NSQ+P++E VQA+CP    S+QQQI GP SFRVDSG+ IDL
Sbjct: 361 FPSGSKFLLGATSLAQDLLNSQKPNMEAVQAICPNATLSLQQQIVGPISFRVDSGIAIDL 420

Query: 410 KNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           KN    I A D ++A+E+ALQVLGSAKAVAW  PK QEFM+ELRF+E+
Sbjct: 421 KNHKWPIQAQDPIYAVEWALQVLGSAKAVAWCCPKRQEFMVELRFYET 468


>gi|297833468|ref|XP_002884616.1| PDE320 [Arabidopsis lyrata subsp. lyrata]
 gi|297330456|gb|EFH60875.1| PDE320 [Arabidopsis lyrata subsp. lyrata]
          Length = 479

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/480 (57%), Positives = 339/480 (70%), Gaps = 26/480 (5%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M +LRW  EG+ W+LD STP +LEG ARAVP +P PLG+SRGTRLSRPKQ++F  RFM +
Sbjct: 1   MNRLRWVEEGDIWDLDMSTPATLEGIARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMTS 60

Query: 61  PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
           P +PSFS           G   LQRVL   FS +W  +LLGQF+V +FVS +++    G 
Sbjct: 61  PLIPSFSPIRPNTGDGGCGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVSEIEKTKTFGR 120

Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
            S S            L+DKSLYALGFCSE L + DDTLLLSYD+Y  +     R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDTYKGDLEKNPRAKAIF 180

Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
            H+FP HN T EA WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYH+ +HHNSG P 
Sbjct: 181 NHEFPLHNLTAEAAWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHVCLHHNSGAPK 240

Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
           +   D      P SLLPG SLK+AV+Y+ N+D+WR    KL+  +PYD+FLSSPHV+ SG
Sbjct: 241 KLNSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299

Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
           IIG+ MTA+ G+NS+RS  ENDS+G  GF LH   + S F AD  G  + TAQ+GNFQ+ 
Sbjct: 300 IIGSVMTAAFGENSIRSKSENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQKP 359

Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
           F DLTRFHARLD+P G +FL+GAT +A D  NS++PSLE  Q +CP+V  S+QQQI GPF
Sbjct: 360 FFDLTRFHARLDYPHGLRFLTGATSVAHDLLNSRQPSLEAFQKICPEVLVSLQQQIVGPF 419

Query: 398 SFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           SF+V+SG+ IDLKN    +    +VFAIEYALQVL SAKAVAWY+PK +EFM+EL FFE+
Sbjct: 420 SFKVESGIQIDLKNGANPVTVDKTVFAIEYALQVLLSAKAVAWYSPKQKEFMVELSFFET 479


>gi|18397783|ref|NP_566296.1| protein pigment defective 320 [Arabidopsis thaliana]
 gi|6729003|gb|AAF27000.1|AC016827_11 hypothetical protein [Arabidopsis thaliana]
 gi|14194139|gb|AAK56264.1|AF367275_1 AT3g06960/F17A9_11 [Arabidopsis thaliana]
 gi|16323342|gb|AAL15384.1| AT3g06960/F17A9_11 [Arabidopsis thaliana]
 gi|332640959|gb|AEE74480.1| protein pigment defective 320 [Arabidopsis thaliana]
          Length = 479

 Score =  553 bits (1424), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/480 (57%), Positives = 340/480 (70%), Gaps = 26/480 (5%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M ++RW  EG+ W+LD STP +LEG+ARAVP +P PLG+SRGTRLSRPKQ++F  RFMA+
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
           P +PSFS           G   LQRVL   FS +W  +LLGQF+V +FV+ + +    G 
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
            S S            L+DKSLYALGFCSE L + DDTLLLSYD+Y  +     R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
            H+FP HN T EA+WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P 
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
           +   D      P SLLPG SLK+AV+Y+ N+D+WR    KL+  +PYD+FLSSPHV+ SG
Sbjct: 241 KLHSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299

Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
           IIG+ MTA+ G+NS+RS  ENDS+G  GF LH   + S F AD  G  + TAQ+GNFQ+ 
Sbjct: 300 IIGSVMTAAFGENSIRSKFENDSEGVGGFSLHFPSVNSGFMADALGRASLTAQYGNFQKF 359

Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
           F DLTRFHARLDFP G +FL+GAT +AQD  NS++PSLE  Q +CP+V  S+QQQI GPF
Sbjct: 360 FFDLTRFHARLDFPHGLRFLTGATSVAQDLLNSRQPSLEAFQKICPEVLVSLQQQIVGPF 419

Query: 398 SFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           SF+V+SG+ IDL+N    +    +VFAIEYALQVL SAKAV  Y+PK  EFM+ELRFFE+
Sbjct: 420 SFKVESGIEIDLRNGANPVTVDKTVFAIEYALQVLLSAKAVVSYSPKQNEFMVELRFFET 479


>gi|356556155|ref|XP_003546392.1| PREDICTED: uncharacterized protein LOC100793455 [Glycine max]
          Length = 468

 Score =  545 bits (1403), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 288/469 (61%), Positives = 342/469 (72%), Gaps = 19/469 (4%)

Query: 2   KKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           +KLRW M+G  FW+LD STP++L+G A  VPG+P PL +SRGTRLSRP+Q+ FM RFM  
Sbjct: 4   RKLRWVMDGGGFWDLDISTPQTLDGLACPVPGDPLPLSLSRGTRLSRPRQLQFMHRFMHA 63

Query: 61  PFLPSFSAG-TLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELS---KSR 111
           P +PS +    L L R+L   FS +WF  LLGQF++ +F SS+K       ELS   K+ 
Sbjct: 64  PLIPSCAKPHGLSLHRLLTLPFSDNWFVFLLGQFHLQRFFSSVKSSKEKPKELSSWLKTF 123

Query: 112 CLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEAL 171
             H L+DKSLYALG  SE     DDTLL   D+Y      R KAVF HKFP+H+ T+EA+
Sbjct: 124 GRH-LQDKSLYALGISSEFQLGMDDTLLFGLDAYEDTEKPRGKAVFHHKFPDHDLTVEAV 182

Query: 172 WPGLFVDKCGNYWDVPFSMAADLASVA-SDSGPSYHLTMHHNSGLPTQFEG--DENSSAV 228
           +PGLFVD  GNYWD+PFSMA DLASV  SDS  +YHL   + SG P QFE   ++N    
Sbjct: 183 YPGLFVDTTGNYWDLPFSMAVDLASVTTSDSSTAYHLCARYTSGSPKQFENVQNQNDRVP 242

Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
           P +LLPG + K+A +Y+K VDIWRS+A+KLK+VQPYDIFLS+PHVSASGIIGAA T + G
Sbjct: 243 PPTLLPGLAFKSAFSYRKKVDIWRSEAKKLKLVQPYDIFLSNPHVSASGIIGAAATTAFG 302

Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
           DNS R+ VE   DGS GF L A+ IKSSF ADIF SV+FTAQHGNFQRLF DLTRF ARL
Sbjct: 303 DNSARAQVE---DGSPGFFLQASGIKSSFLADIFASVSFTAQHGNFQRLFLDLTRFQARL 359

Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
           DFPSG KFLS AT L QD  NSQ+PS++ VQA+ P    S+QQQI GP SFRVDSG+ +D
Sbjct: 360 DFPSGFKFLSAATGLTQDLLNSQKPSMDAVQAIIPNATLSLQQQIVGPVSFRVDSGITVD 419

Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           LKN D  I A + VFA+EYALQVLGSAKAVAWY PK QEFM ELRFFE+
Sbjct: 420 LKNPDWPIHALEPVFALEYALQVLGSAKAVAWYCPKRQEFMAELRFFET 468


>gi|449518839|ref|XP_004166443.1| PREDICTED: uncharacterized protein LOC101225023 [Cucumis sativus]
          Length = 470

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 256/471 (54%), Positives = 332/471 (70%), Gaps = 17/471 (3%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPN--PLGISRGTRLSRPKQIDFMQRF 57
           MKKLRWAM+G+ FW+LD ST ++L+GSA  VP   +  PLG+SRG RLSR KQIDFMQRF
Sbjct: 1   MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60

Query: 58  MATPFLPSFS-AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ---GELSKSRCL 113
           MA PF+PS+S +    LQRV    FS     TLLGQFN+ KF+SSL +   GE+ +S   
Sbjct: 61  MAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYTS 120

Query: 114 ------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFT 167
                   L  +SLYA+G  +++L   DD+L++S+D YG ++  R KAVF  KF +H+ T
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180

Query: 168 LEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSA 227
           +EAL PGLF++KCG YWDVP S+  DL SVASDSG SYHL+MH N+G P+Q  G E + +
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL-GSEPTHS 239

Query: 228 VPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASI 287
            P  LLPG S K A A+KKN +IWRS A+KLKMVQPYDIFLS+PHVS S IIGA  T+  
Sbjct: 240 APFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYF 299

Query: 288 GDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHAR 347
           GD+  RS  ++  +   GF++ ++ I+S+  AD+F S++F+AQ+G FQ+ + DLTRF A 
Sbjct: 300 GDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSAC 359

Query: 348 LDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVI 407
           +DF SGSKFLSG+  L  D  NS+ P  E+V+A  P   FS+QQQIAGP SFR D+GV I
Sbjct: 360 MDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAI 419

Query: 408 DLKNRD---ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           DL       +   +  FA+EYAL VLGSAKA+AWY+PKH+EFM+ELRF+E+
Sbjct: 420 DLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET 470


>gi|449441494|ref|XP_004138517.1| PREDICTED: uncharacterized protein LOC101205565 [Cucumis sativus]
          Length = 470

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 256/471 (54%), Positives = 331/471 (70%), Gaps = 17/471 (3%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPN--PLGISRGTRLSRPKQIDFMQRF 57
           MKKLRWAM+G+ FW+LD ST ++L+GSA  VP   +  PLG+SRG RLSR KQIDFMQRF
Sbjct: 1   MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRF 60

Query: 58  MATPFLPSFS-AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ---GELSKSRCL 113
           MA PF PS+S +    LQRV    FS     TLLGQFN+ KF+SSL +   GE+ +S   
Sbjct: 61  MAAPFFPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTGSGEMCQSYSS 120

Query: 114 ------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFT 167
                   L  +SLYA+G  +++L   DD+L++S+D YG ++  R KAVF  KF +H+ T
Sbjct: 121 LLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT 180

Query: 168 LEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSA 227
           +EAL PGLF++KCG YWDVP S+  DL SVASDSG SYHL+MH N+G P+Q  G E + +
Sbjct: 181 VEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL-GSEPTHS 239

Query: 228 VPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASI 287
            P  LLPG S K A A+KKN +IWRS A+KLKMVQPYDIFLS+PHVS S IIGA  T+  
Sbjct: 240 APFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYF 299

Query: 288 GDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHAR 347
           GD+  RS  ++  +   GF++ ++ I+S+  AD+F S++F+AQ+G FQ+ + DLTRF A 
Sbjct: 300 GDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSAC 359

Query: 348 LDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVI 407
           +DF SGSKFLSG+  L  D  NS+ P  E+V+A  P   FS+QQQIAGP SFR D+GV I
Sbjct: 360 MDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAI 419

Query: 408 DLKNRD---ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           DL       +   +  FA+EYAL VLGSAKA+AWY+PKH+EFM+ELRF+E+
Sbjct: 420 DLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET 470


>gi|326527073|dbj|BAK04478.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 246/464 (53%), Positives = 308/464 (66%), Gaps = 13/464 (2%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  + E  WELD  +P ++EG+AR VPG+P PLG+SRG R++R KQ+DF  RFMA
Sbjct: 2   LRQMRWLTDREGRWELDVESPATMEGTARPVPGDPLPLGLSRGPRVTRTKQLDFFHRFMA 61

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
           +P +PSFSA    L          + +W  T+L Q +V K V+++K+   ++   +    
Sbjct: 62  SPLVPSFSASRAGLSFHHAHLLHLAHNWSCTILEQLHVQKLVAAVKEKLSNRQEGIPWSN 121

Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
           D       + +LG  +ELL T D  LLL    Y  N   R KA+  HK P  N TL A W
Sbjct: 122 DLKRHLHDVISLGVGTELLITPDTALLLEL--YNINKGDRGKAIIHHKLPQQNLTLAASW 179

Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
           PGLFVDK G YWDVP S++ADLASV S SG SYHL +  NSG P  F GDE S  VP +L
Sbjct: 180 PGLFVDKQGVYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETSD-VPLAL 238

Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
           LPG   K A++ KK++D WR K  KLKMVQPYD FLS PHVS +GI+GA  + S+GD S 
Sbjct: 239 LPGLCAKAAISIKKSIDAWRKKEDKLKMVQPYDAFLSDPHVSFTGIVGAVASGSLGDCSK 298

Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
           R  V +++  S  F +     K +  AD+F SV FTAQHGNFQRLF DLTR   RLD  S
Sbjct: 299 RILVPDETRKSNAFRVFHERNKFAAFADLFASVTFTAQHGNFQRLFLDLTRVSTRLDISS 358

Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
           GS FL GA+RLAQD+F S+RP LET   VCP V  S+QQQI GPFSFRV+S V IDL+N+
Sbjct: 359 GSLFLRGASRLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESSVAIDLRNQ 418

Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
           D  +   DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE
Sbjct: 419 DHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFE 462


>gi|218187895|gb|EEC70322.1| hypothetical protein OsI_01191 [Oryza sativa Indica Group]
          Length = 463

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 249/469 (53%), Positives = 314/469 (66%), Gaps = 21/469 (4%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  +G+  WELD  TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2   LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
           +P +PSFSA    L +        + +   T+L Q +V K V+ +K+      E+S S  
Sbjct: 62  SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121

Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTL 168
             R LH      + +LG  +ELL T D TLLL    Y      R KA+F HK P+ N TL
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITL 174

Query: 169 EALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV 228
           EA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ +  NSG P  F G E +S V
Sbjct: 175 EASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDV 234

Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
           P +LLPG   K AV+ KK++D+WR K  KLK VQPYD+FLS PHVS +GIIGA  + S G
Sbjct: 235 PVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFG 294

Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
           D S R  ++N+   S  F       K +  AD+F SV FTAQHGNFQRL  DLTR  ARL
Sbjct: 295 DCSKRMSMQNEIQKSNAFKYFDERNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARL 354

Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
           D  SGS FL GA++LAQD+F S+RP LET   VCP V  S+QQQI GPFSFRV+S V ID
Sbjct: 355 DISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTID 414

Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
            K +D  +   DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 415 PKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFET 463


>gi|115435680|ref|NP_001042598.1| Os01g0251000 [Oryza sativa Japonica Group]
 gi|113532129|dbj|BAF04512.1| Os01g0251000 [Oryza sativa Japonica Group]
 gi|215712321|dbj|BAG94448.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 249/469 (53%), Positives = 314/469 (66%), Gaps = 21/469 (4%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  +G+  WELD  TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2   LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
           +P +PSFSA    L +        + +   T+L Q +V K V+ +K+      E+S S  
Sbjct: 62  SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121

Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTL 168
             R LH      + +LG  +ELL T D TLLL    Y      R KA+F HK P+ N TL
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITL 174

Query: 169 EALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV 228
           EA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ +  NSG P  F G E +S V
Sbjct: 175 EASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDV 234

Query: 229 PASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG 288
           P +LLPG   K AV+ KK++D+WR K  KLK VQPYD+FLS PHVS +GIIGA  + S G
Sbjct: 235 PVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFG 294

Query: 289 DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARL 348
           D S R  ++N+   S  F       K +  AD+F SV FTAQHGNFQRL  DLTR  ARL
Sbjct: 295 DCSKRMSMQNEILKSNAFKYFDERNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARL 354

Query: 349 DFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVID 408
           D  SGS FL GA++LAQD+F S+RP LET   VCP V  S+QQQI GPFSFRV+S V ID
Sbjct: 355 DISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTID 414

Query: 409 LKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
            K +D  +   DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 415 PKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFET 463


>gi|344313274|gb|AEN14334.1| hypothetical protein rf1-C1-g5 [Zea mays]
          Length = 464

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 234/465 (50%), Positives = 309/465 (66%), Gaps = 13/465 (2%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  EG+  WELD  TP ++EG+AR VPG+P PLG+SRG R++RPKQ+DF  RFMA
Sbjct: 3   LRRMRWMSEGDGRWELDAETPVTMEGTARPVPGDPLPLGLSRGYRITRPKQLDFFHRFMA 62

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
           +P +P+FSA    L +     F  + +W +T+L + NV K VS +K+   ++       +
Sbjct: 63  SPLVPTFSAMRDGLSVNHAHIFYMTDNWSSTILEKINVKKLVSVVKEKLANRQEEASWTK 122

Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
           D       + +LG  +E+L T D TLLL        N  R KA+F H+ P+HN T +A W
Sbjct: 123 DLKNHLHDVMSLGVGTEILITPDTTLLLELFDIKKGN--RGKAIFHHELPHHNITFQASW 180

Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
           PGLFVDK G YWDVP S++ADLASV S SG SYHL +  NSG P  F GDE    VP +L
Sbjct: 181 PGLFVDKKGTYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPTAL 239

Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
           LPG   K A++ KK++D WR K  KLK VQPYD+FLS  HVS +GI+G   +  +GD S 
Sbjct: 240 LPGLCAKVAISMKKSIDAWRKKEDKLKKVQPYDVFLSDSHVSLTGIVGGVASGYLGDCSR 299

Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
           R  + +++  S  F +     K +  AD+F SV FTAQ+GNFQRLF DLT+  AR D  S
Sbjct: 300 RVAIRDETHKSNAFIMFDERNKRAAFADLFASVTFTAQYGNFQRLFLDLTKASARFDITS 359

Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
           GS FL GA+RLAQD+F S+RP +ET   +CP V  S QQQI GPFSFRV+S V ID +++
Sbjct: 360 GSLFLCGASRLAQDFFFSRRPDVETFCDICPDVTVSFQQQIVGPFSFRVESSVAIDPRSQ 419

Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           D  +   D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 420 DHFVRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMMELRFYET 464


>gi|222618108|gb|EEE54240.1| hypothetical protein OsJ_01114 [Oryza sativa Japonica Group]
          Length = 479

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 249/485 (51%), Positives = 314/485 (64%), Gaps = 37/485 (7%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  +G+  WELD  TP ++EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA
Sbjct: 2   LRRMRWMADGDGRWELDVETPATMEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMA 61

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS-- 110
           +P +PSFSA    L +        + +   T+L Q +V K V+ +K+      E+S S  
Sbjct: 62  SPLVPSFSASGAGLSVHHAHLVHLAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSND 121

Query: 111 --RCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHK-------- 160
             R LH      + +LG  +ELL T D TLLL    Y      R KA+F HK        
Sbjct: 122 IKRHLH-----DVMSLGVGTELLITPDTTLLLEL--YNLKKGDRGKAIFRHKANYLDTFY 174

Query: 161 --------FPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
                    P+ N TLEA WPGLF+DK G YWDVP S++ADLASV S+SG SYH+ +  N
Sbjct: 175 AIWYLVMKLPHQNITLEASWPGLFIDKNGVYWDVPLSLSADLASVGSESGLSYHVLLQQN 234

Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPH 272
           SG P  F G E +S VP +LLPG   K AV+ KK++D+WR K  KLK VQPYD+FLS PH
Sbjct: 235 SGEPKCFGGGEETSDVPVALLPGLCAKAAVSIKKSIDVWRKKEDKLKNVQPYDVFLSEPH 294

Query: 273 VSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHG 332
           VS +GIIGA  + S GD S R  ++N+   S  F       K +  AD+F SV FTAQHG
Sbjct: 295 VSFTGIIGAVASGSFGDCSKRMSMQNEILKSNAFKYFDERNKFAAFADLFASVNFTAQHG 354

Query: 333 NFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQ 392
           NFQRL  DLTR  ARLD  SGS FL GA++LAQD+F S+RP LET   VCP V  S+QQQ
Sbjct: 355 NFQRLLLDLTRVSARLDISSGSLFLRGASQLAQDFFFSRRPDLETFCDVCPDVIVSLQQQ 414

Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIEL 450
           I GPFSFRV+S V ID K +D  +   DSVFAI++AL+VLGSAKA AWY+PKHQE M+EL
Sbjct: 415 IVGPFSFRVESTVTIDPKKQDHFVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVEL 474

Query: 451 RFFES 455
           RFFE+
Sbjct: 475 RFFET 479


>gi|357129435|ref|XP_003566367.1| PREDICTED: uncharacterized protein LOC100827567 [Brachypodium
           distachyon]
          Length = 461

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 240/465 (51%), Positives = 306/465 (65%), Gaps = 15/465 (3%)

Query: 1   MKKLRWAMEGE-FWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW  + E  WELD  +P ++EG+AR VPG+P PLG+SRG R++R KQ+DF  RFMA
Sbjct: 2   LRQMRWMTDREGRWELDVESPATMEGTARPVPGDPLPLGLSRGPRVTRTKQLDFFHRFMA 61

Query: 60  TPFLPSFSA--GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALR 117
           +P +P+FS     L L     F  + +W  T+L Q +V K V  +K+   ++   +    
Sbjct: 62  SPLVPAFSPYRSGLSLDHAHLFHLAENWSFTILEQLHVKKLVEVVKEKLSNRQEGVPWTN 121

Query: 118 D-----KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALW 172
           D       + +LG  +ELL T D TLLL    Y      R KA+  HK P  N  LEA W
Sbjct: 122 DLKRQLHDVMSLGVGTELLITPDTTLLLEL--YNIKKGERGKAIINHKLPQQNLKLEASW 179

Query: 173 PGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASL 232
           PGLFVDK G YWDVP S++AD ASV S SG SY L +  NSG P  F GDE +  VP +L
Sbjct: 180 PGLFVDKQGVYWDVPLSLSADFASVGSSSGLSYRLLVQQNSGEPKCFGGDETND-VPLAL 238

Query: 233 LPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSV 292
           LPG  +K A + KK++D+WR K  KLKMVQPYD+FLS PHVS + I+GA  + S+GD S 
Sbjct: 239 LPGLCVKAAFSIKKSIDVWRKKEDKLKMVQPYDVFLSDPHVSFTRIVGAVASGSLGDCSK 298

Query: 293 RSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPS 352
           R  V+++   S  F +     K +  AD+F SV FTAQHGNFQRLF DLTR  ARLD  S
Sbjct: 299 RISVQDER--SNAFRVFNDRNKFAAFADLFASVTFTAQHGNFQRLFLDLTRVSARLDISS 356

Query: 353 GSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR 412
           GS FL GA+RLAQD+F S+RP LET   VCP    S+QQQI GPFSFRV+S V ID +++
Sbjct: 357 GSLFLRGASRLAQDFFFSRRPDLETFCDVCPNTIVSLQQQIVGPFSFRVESSVAIDPRSQ 416

Query: 413 D--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           D  +   DSVFAI++AL+VLGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 417 DRLVRVDDSVFAIDWALKVLGSAKATAWYSPKHQEAMVELRFFEA 461


>gi|56783809|dbj|BAD81221.1| unknown protein [Oryza sativa Japonica Group]
          Length = 439

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 240/446 (53%), Positives = 298/446 (66%), Gaps = 20/446 (4%)

Query: 23  LEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFLPSFSA--GTLQLQRVLPFQ 80
           +EG+AR VPG+P PLG+SRG R++R KQ+DF+ RFMA+P +PSFSA    L +       
Sbjct: 1   MEGAARPVPGDPLPLGLSRGPRVTRTKQLDFLHRFMASPLVPSFSASGAGLSVHHAHLVH 60

Query: 81  FSGDWFATLLGQFNVLKFVSSLKQG-----ELSKS----RCLHALRDKSLYALGFCSELL 131
            + +   T+L Q +V K V+ +K+      E+S S    R LH      + +LG  +ELL
Sbjct: 61  LAHNLSFTILEQLHVQKLVAVVKEKLSNRQEVSWSNDIKRHLH-----DVMSLGVGTELL 115

Query: 132 FTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMA 191
            T D TLLL    Y      R KA+F HK P+ N TLEA WPGLF+DK G YWDVP S++
Sbjct: 116 ITPDTTLLLEL--YNLKKGDRGKAIFRHKLPHQNITLEASWPGLFIDKNGVYWDVPLSLS 173

Query: 192 ADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIW 251
           ADLASV S+SG SYH+ +  NSG P  F G E +S VP +LLPG   K AV+ KK++D+W
Sbjct: 174 ADLASVGSESGLSYHVLLQQNSGEPKCFGGGEETSDVPVALLPGLCAKAAVSIKKSIDVW 233

Query: 252 RSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAA 311
           R K  KLK VQPYD+FLS PHVS +GIIGA  + S GD S R  ++N+   S  F     
Sbjct: 234 RKKEDKLKNVQPYDVFLSEPHVSFTGIIGAVASGSFGDCSKRMSMQNEILKSNAFKYFDE 293

Query: 312 DIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQ 371
             K +  AD+F SV FTAQHGNFQRL  DLTR  ARLD  SGS FL GA++LAQD+F S+
Sbjct: 294 RNKFAAFADLFASVNFTAQHGNFQRLLLDLTRVSARLDISSGSLFLRGASQLAQDFFFSR 353

Query: 372 RPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV 429
           RP LET   VCP V  S+QQQI GPFSFRV+S V ID K +D  +   DSVFAI++AL+V
Sbjct: 354 RPDLETFCDVCPDVIVSLQQQIVGPFSFRVESTVTIDPKKQDHFVRVDDSVFAIDWALKV 413

Query: 430 LGSAKAVAWYAPKHQEFMIELRFFES 455
           LGSAKA AWY+PKHQE M+ELRFFE+
Sbjct: 414 LGSAKATAWYSPKHQEAMVELRFFET 439


>gi|242052461|ref|XP_002455376.1| hypothetical protein SORBIDRAFT_03g009540 [Sorghum bicolor]
 gi|241927351|gb|EES00496.1| hypothetical protein SORBIDRAFT_03g009540 [Sorghum bicolor]
          Length = 464

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 232/467 (49%), Positives = 311/467 (66%), Gaps = 17/467 (3%)

Query: 1   MKKLRW-AMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMA 59
           ++++RW + E   WELD  TP ++EG+ R VPG+P PLG+SRG R++RPKQ+DF  RFMA
Sbjct: 3   LRRMRWMSDEDGRWELDAETPVTMEGTVRPVPGDPLPLGLSRGYRVTRPKQLDFFHRFMA 62

Query: 60  TPFLPSFSAGT--LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK------QGELSKSR 111
           +P +P+FSA    L +        + +W + +L + NV K +S +K      Q E S ++
Sbjct: 63  SPLVPTFSATRDGLSVNHAHILYITDNWSSIILEKINVNKLMSVVKEKFANRQEEASWTK 122

Query: 112 CLHA-LRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEA 170
            L   L D  + +LG  +E+L T D TLLL        N  R KA+F H+ P+HN T++A
Sbjct: 123 DLKKHLHD--VMSLGVGTEILITPDTTLLLELFDIKKGN--RGKAIFHHELPHHNITVQA 178

Query: 171 LWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA 230
            WPGLFVDK G YWDVP S++ADL+SV S SG SYHL +  NSG P  F GDE    VP 
Sbjct: 179 SWPGLFVDKRGAYWDVPLSLSADLSSVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPI 237

Query: 231 SLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDN 290
           +LLPG   K A++ KK++D+WR K  KLK VQPYDIFLS  HVS +GI+GA  +  +G+ 
Sbjct: 238 ALLPGLCAKAAISIKKSIDVWRKKEDKLKSVQPYDIFLSDSHVSLTGIVGAVASGYLGNC 297

Query: 291 SVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDF 350
           S R  + +++  S  F +     K +  AD+F SV FTA++GNFQRLF DLT+  AR D 
Sbjct: 298 SRRVAIRDETQKSNAFRMFDERNKCAAFADLFASVTFTARYGNFQRLFLDLTKASARFDI 357

Query: 351 PSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLK 410
            SGS FL GA+RLAQD+F S+RP +ET   +CP V  S+QQQI GPFSFRV+S V  D +
Sbjct: 358 TSGSLFLCGASRLAQDFFFSRRPDVETFCDICPDVTVSLQQQIVGPFSFRVESSVAFDPR 417

Query: 411 NRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           ++D  +   D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 418 SQDHFVRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMVELRFYET 464


>gi|356532618|ref|XP_003534868.1| PREDICTED: uncharacterized protein LOC100804638 [Glycine max]
          Length = 379

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 229/384 (59%), Positives = 272/384 (70%), Gaps = 21/384 (5%)

Query: 85  WFATLLGQFNVLKFVSSLKQG-----ELS---KSRCLHALRDKSLYALGFCSELLFTDDD 136
           W   LLGQF++ +FVSS+K       ELS   K+   H L+ KSLYALG  SE     DD
Sbjct: 4   WLVFLLGQFHLQRFVSSVKSSKEKPKELSSWLKTFGRH-LQQKSLYALGISSEFQLGVDD 62

Query: 137 TLLLSYDSYGHNNTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLAS 196
           TL    D+Y      R KAVF HKFP+H+  +EA++PG FVD   NYWDVPFS+A DLAS
Sbjct: 63  TLHFGLDAYEDTEKPRGKAVFHHKFPDHDLKVEAVYPGHFVDTTSNYWDVPFSVAVDLAS 122

Query: 197 VA-SDSGPSYHLTMHHNSGLPTQFEG--DENSSAVPASLLPGFSLKNAVAYKKNVDIWRS 253
           V  SDS  +YHL+ H+ SG P QFE   ++N    P +LLPG + K+  +Y+K VDIWRS
Sbjct: 123 VTTSDSSTAYHLSAHYTSGSPKQFENIQNQNDRVPPPTLLPGLAFKSVFSYRKKVDIWRS 182

Query: 254 KAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADI 313
           +A+KLK+VQPYDIFLS+PHVSASGIIGAA T + GDNS R+ VE   DGS GF L A+ I
Sbjct: 183 EAKKLKLVQPYDIFLSNPHVSASGIIGAAATTAFGDNSARAQVE---DGSPGFFLQASGI 239

Query: 314 KSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRP 373
           KSSF ADIF SV+FTAQHGNFQRLF DLTRF ARLDFPSG KFLS AT       NSQ P
Sbjct: 240 KSSFLADIFASVSFTAQHGNFQRLFLDLTRFQARLDFPSGFKFLSAATGRTH-LLNSQNP 298

Query: 374 SLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKN--RDICAHDSVFAIEYALQVLG 431
           +++T   + P    S+QQQI GP SFRVDSG+ +DLKN  R I A + VFA+EYAL+VLG
Sbjct: 299 AIQT---ILPNATLSLQQQIVGPVSFRVDSGITVDLKNPDRPIHAPEPVFALEYALKVLG 355

Query: 432 SAKAVAWYAPKHQEFMIELRFFES 455
           SAKAVAWY PK QEF+ ELRFFE+
Sbjct: 356 SAKAVAWYFPKRQEFVAELRFFET 379


>gi|414876816|tpg|DAA53947.1| TPA: hypothetical protein ZEAMMB73_402660 [Zea mays]
          Length = 459

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 224/462 (48%), Positives = 293/462 (63%), Gaps = 32/462 (6%)

Query: 23  LEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFLPSFSA--GTLQLQRVLPFQ 80
           +EG+AR VPG+P PLG+SRG R++RPKQ+DF  RFMA+P +P+FSA    L +     F 
Sbjct: 1   MEGTARPVPGDPLPLGLSRGYRITRPKQLDFFHRFMASPLVPTFSAMRDGLSVNHAHIFY 60

Query: 81  FSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD-----KSLYALGFCSELLFTDD 135
            + +W +T+L + NV K VS +K+   ++       +D       + +LG  +E+L T D
Sbjct: 61  MTDNWSSTILEKINVKKLVSVVKEKLANRQEEASWTKDLKNHLHDVMSLGVGTEILITPD 120

Query: 136 DTLLLSYDSYGHNNTSRKKAVFCH--------------------KFPNHNFTLEALWPGL 175
            TLLL        N  R KA+F H                    + P+HN T +A WPGL
Sbjct: 121 TTLLLELFDIKKGN--RGKAIFHHEAKNLDALNLFGIQTPLYLLQLPHHNITFQASWPGL 178

Query: 176 FVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPG 235
           FVDK G YWDVP S++ADLASV S SG SYHL +  NSG P  F GDE    VP +LLPG
Sbjct: 179 FVDKKGTYWDVPLSLSADLASVGSSSGLSYHLLLQQNSGEPKCFGGDETDD-VPTALLPG 237

Query: 236 FSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSP 295
              K A++ KK++D WR K  KLK VQPYD+FLS  HVS +GI+G   +  +GD S R  
Sbjct: 238 LCAKVAISMKKSIDAWRKKEDKLKKVQPYDVFLSDSHVSLTGIVGGVASGYLGDCSRRVA 297

Query: 296 VENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSK 355
           + +++  S  F +     K +  AD+F SV FTAQ+GNFQRLF DLT+  AR D  SGS 
Sbjct: 298 IRDETHKSNAFIMFDERNKRAAFADLFASVTFTAQYGNFQRLFLDLTKASARFDITSGSL 357

Query: 356 FLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRD-- 413
           FL GA+RLAQD+F S+RP +ET   +CP V  S QQQI GPFSFRV+S V ID +++D  
Sbjct: 358 FLCGASRLAQDFFFSRRPDVETFCDICPDVTVSFQQQIVGPFSFRVESSVAIDPRSQDHF 417

Query: 414 ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFES 455
           +   D +FAI++AL+VLGSAKA AWY+PKHQE M+ELRF+E+
Sbjct: 418 VRVDDPIFAIDWALKVLGSAKATAWYSPKHQEAMMELRFYET 459


>gi|89953447|gb|ABD83319.1| Fgenesh protein 89 [Beta vulgaris]
          Length = 400

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 211/374 (56%), Positives = 268/374 (71%), Gaps = 22/374 (5%)

Query: 86  FATLLGQFNVLKFVSSLKQG---ELSKSRCLHAL----RDKSLYALGFCSELLFTDDDTL 138
           F TLLGQFN+ KFVS++++    ELS S  +  L     +KS YA GFCSE L T +DTL
Sbjct: 7   FTTLLGQFNLQKFVSAVRKAKSEELSDSSWMQVLGRHLSEKSFYAFGFCSEFLVTSEDTL 66

Query: 139 LLSYDSYGHNNTSRKKAVFCHK------------FPNHNFTLEALWPGLFVDKCGNYWDV 186
           L SY++ G   T+RKKAVF HK            FP+ N T+E +WPGLF+DK GNYWDV
Sbjct: 67  LFSYEASGDTKTARKKAVFHHKASTKRYEGTLYQFPHLNLTVERVWPGLFIDKHGNYWDV 126

Query: 187 PFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKK 246
           P ++AADLASV SDSG  YHL++HH SG   Q EG++ SS VPA+LLPG  +K+A ++KK
Sbjct: 127 PATLAADLASVTSDSGARYHLSLHHISGAAKQVEGNK-SSDVPATLLPGLYVKSAFSFKK 185

Query: 247 NVDIWRSKAQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGF 306
           N+D WRS+A+KLK VQP+D+FLSSPHVS +G+IG   TA +GDNS+RSP+++        
Sbjct: 186 NIDFWRSQAKKLKFVQPFDLFLSSPHVSGAGLIGVVATALVGDNSLRSPLDDQPKDYRRL 245

Query: 307 HLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQD 366
            L+A+  KS+F AD+F +++F+AQ+GNFQR F DLTR H RLD PSGSKF S A  LA+D
Sbjct: 246 DLYASRGKSAFLADLFATLSFSAQYGNFQRKFFDLTRIHGRLDVPSGSKFCSAAACLAKD 305

Query: 367 YFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNR--DICAHDSVFAIE 424
            + SQ+ SLE VQAVCP    S QQQI GPFSFRVDS + ++L+N+   +   + VFAIE
Sbjct: 306 IYKSQQLSLEAVQAVCPNPMVSFQQQIVGPFSFRVDSKIAVNLENQGCKVSLDNPVFAIE 365

Query: 425 YALQVLGSAKAVAW 438
           YALQVL SAKAVAW
Sbjct: 366 YALQVLFSAKAVAW 379


>gi|297789715|ref|XP_002862795.1| hypothetical protein ARALYDRAFT_920263 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297308522|gb|EFH39053.1| hypothetical protein ARALYDRAFT_920263 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 435

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 175/310 (56%), Positives = 216/310 (69%), Gaps = 15/310 (4%)

Query: 160 KFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQF 219
           +FP HN T EA+W GLFVD  G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P + 
Sbjct: 127 QFPLHNLTAEAVWLGLFVDNHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPKKL 186

Query: 220 EGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASGII 279
             D      P SLLP  SL +A++Y  N+D+WR    KL+  +PYD+FLSSPHV+ SGII
Sbjct: 187 NSD-TMEVPPPSLLPCLSLTSAISYWTNMDLWRGTTPKLETYKPYDVFLSSPHVAVSGII 245

Query: 280 GAAM----------TASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTA 329
           G               SIG+NS+RS  ENDS+G  GF LH   + S F AD  G  + TA
Sbjct: 246 GMISFFDLFQKLCDDRSIGENSIRSKFENDSEGVGGFSLHFPFVNSGFMADTLGLASLTA 305

Query: 330 QHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSV 389
           Q+ NFQ+LF DLTRFHARLDFP G +FL+GAT ++QD  NS++PSLE  Q +CP+V  S+
Sbjct: 306 QYRNFQKLFFDLTRFHARLDFPYGLRFLTGATSVSQDLLNSRQPSLEAFQKICPEVLVSL 365

Query: 390 QQQIAGPFSFRVDSGVVIDLKN----RDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
           QQQI GP SF+V+SG+ IDLKN      +    +VFAIEYALQVL SAKAVAWY+PK +E
Sbjct: 366 QQQIVGPSSFKVESGIQIDLKNGVGANPVTVDKTVFAIEYALQVLLSAKAVAWYSPKQKE 425

Query: 446 FMIELRFFES 455
           FM+EL FFE+
Sbjct: 426 FMVELSFFET 435


>gi|42572297|ref|NP_974244.1| protein pigment defective 320 [Arabidopsis thaliana]
 gi|332640960|gb|AEE74481.1| protein pigment defective 320 [Arabidopsis thaliana]
          Length = 341

 Score =  335 bits (858), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 168/303 (55%), Positives = 208/303 (68%), Gaps = 24/303 (7%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M ++RW  EG+ W+LD STP +LEG+ARAVP +P PLG+SRGTRLSRPKQ++F  RFMA+
Sbjct: 1   MNRMRWVGEGDIWDLDMSTPVTLEGTARAVPDDPLPLGLSRGTRLSRPKQVEFFHRFMAS 60

Query: 61  PFLPSFS----------AGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQ----GE 106
           P +PSFS           G   LQRVL   FS +W  +LLGQF+V +FV+ + +    G 
Sbjct: 61  PLIPSFSPIRPNTGDGGGGGFSLQRVLTLPFSNNWLVSLLGQFDVQRFVTEIDKTKAFGR 120

Query: 107 LSKSRCL-------HALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHN--NTSRKKAVF 157
            S S            L+DKSLYALGFCSE L + DDTLLLSYD+Y  +     R KA+F
Sbjct: 121 GSSSTVASRLNTIGKHLKDKSLYALGFCSEFLLSPDDTLLLSYDAYKGDLDKNPRAKAIF 180

Query: 158 CHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPT 217
            H+FP HN T EA+WPGLFVDK G YWDVP SMA DLAS+ ++SGPSYHL +HHNSG P 
Sbjct: 181 NHEFPLHNLTAEAVWPGLFVDKHGEYWDVPLSMAIDLASLPAESGPSYHLCLHHNSGSPK 240

Query: 218 QFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQPYDIFLSSPHVSASG 277
           +   D      P SLLPG SLK+AV+Y+ N+D+WR    KL+  +PYD+FLSSPHV+ SG
Sbjct: 241 KLHSD-TMEVPPPSLLPGLSLKSAVSYRTNMDLWRGTTPKLETCKPYDVFLSSPHVAVSG 299

Query: 278 IIG 280
           IIG
Sbjct: 300 IIG 302


>gi|255571101|ref|XP_002526501.1| conserved hypothetical protein [Ricinus communis]
 gi|223534176|gb|EEF35892.1| conserved hypothetical protein [Ricinus communis]
          Length = 465

 Score =  300 bits (769), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 180/480 (37%), Positives = 263/480 (54%), Gaps = 42/480 (8%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  LR A++  FW+   S+P++LEG AR++PG+P PL  +R +R  R +Q+  +      
Sbjct: 1   MANLRTALDSAFWDQPLSSPQTLEGCARSIPGDPFPLDATRASRALRIQQLSLLANGFPL 60

Query: 61  PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
             +PS+S+ +      L +Q +L    S + +  L+GQF   K +SS+K       + EL
Sbjct: 61  GLVPSYSSASPKHPPSLSVQSLLLKLASSNCWLGLIGQFRPKKLISSIKAEFSNAEELEL 120

Query: 108 SKSR-CLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNF 166
           S  R     + DKSLY++G CS+   T   +LLLS + +GH+ T R K +  H+ P+H+ 
Sbjct: 121 SVFRDAAKHIVDKSLYSIGICSQFSPTPSTSLLLSTERHGHSATPRYKFMLFHQLPSHDI 180

Query: 167 TLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSS 226
           TLEA WP LF+D  G YWDVP S++ D+AS+ SD+G  Y   +H N+G P       +  
Sbjct: 181 TLEAAWPELFLDHRGGYWDVPQSISLDMASIGSDTGFRYRFGIHKNNGHPNTINAINDQP 240

Query: 227 AVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMV----------QPYDIFLSSPHVSAS 276
             P +L+PG   K + +Y+K+ D+WR K  K   V          + YD+ LS PH + S
Sbjct: 241 --PFALMPGLCGKASFSYEKSKDLWRKKQSKKDSVIKTDRGSILPRSYDVRLSQPHSAIS 298

Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
           GI+G A  A  G   +    +  +  ST         +S  +AD+FGSV +T QHGNF +
Sbjct: 299 GIVGGACAAWFGGRDISVSADGHNSSST-------RKRSPLNADLFGSVCYTFQHGNFTK 351

Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGP 396
           L+ DLTR  ARLD  S       A  LA+  F     S        P++  ++QQQ+AGP
Sbjct: 352 LYGDLTRIDARLDICS-------ALTLAKRAFRWSSVSDADNALSSPRLNLTLQQQVAGP 404

Query: 397 FSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
             FRVDS   ID  +     H  D V+++ Y+L++L S K VAWY+PK +E M+ELR FE
Sbjct: 405 IVFRVDSRFSIDSSSDQEGPHVEDLVYSLSYSLRLLRSGKVVAWYSPKRKEGMVELRLFE 464


>gi|449432352|ref|XP_004133963.1| PREDICTED: uncharacterized protein LOC101205636 [Cucumis sativus]
 gi|449487568|ref|XP_004157691.1| PREDICTED: uncharacterized protein LOC101227878 [Cucumis sativus]
          Length = 470

 Score =  287 bits (735), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 174/483 (36%), Positives = 264/483 (54%), Gaps = 43/483 (8%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  LR AM+  FW+L+ S+P++L G+A+AVPG P PL  +R +R  R +Q+ F+      
Sbjct: 1   MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRTLRIQQLSFLGNGFPL 60

Query: 61  PFLPSF------SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
             +PS+        G+  LQ +L    S  W+A L+GQF   K +SS+K       Q EL
Sbjct: 61  GIIPSYCPTAHKELGSFSLQSLLFMMPSVKWWAGLVGQFRPKKLISSIKAQISAVEQLEL 120

Query: 108 SKSRCLHAL-RDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHNF 166
           S  + + +L  DKSLY  G CS+       ++ +S +  G     R KA+F H+ P H+ 
Sbjct: 121 SDLKDIASLFLDKSLYTYGICSQFSTGPFSSVYVSTEKLGERKGHRHKAMFYHRLPEHDI 180

Query: 167 TLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSS 226
            ++A WP LF+D  G YWDVP S++ DL+S+ S+SG  Y + +H N G+P       NS 
Sbjct: 181 NVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGVPRALN-STNSD 239

Query: 227 AVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMV----------QP-YDIFLSSPHVSA 275
             P +LLPG   K A + +KN D+WR    + +M           +P YD+ L  PH + 
Sbjct: 240 DPPLTLLPGLCAKAAFSIEKNRDLWRDNLSEEEMTINYIRTGLKKEPAYDVRLDEPHAAI 299

Query: 276 SGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQ 335
           SGIIG  +++  G +     V ++ DG+          +S  +AD+FGS+ +T QHG F 
Sbjct: 300 SGIIGGTVSSWFGGSDT---VGSNGDGNLTM---GHKKRSPLNADLFGSICYTYQHGKFL 353

Query: 336 RLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQ--AVCPKVFFSVQQQI 393
             F+DLTR  ARL   S S F       A+  F+  + S++ ++     P++    QQQ+
Sbjct: 354 NDFNDLTRIDARLSISSASGF-------AKRVFHVFKKSVDDLERSKSSPRLNLIFQQQV 406

Query: 394 AGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELR 451
           AGP  FR++S +++D  +  I  H  D++ ++ Y+   L SAKAV WY+PK +E M+ELR
Sbjct: 407 AGPIVFRLESKLLLDSASGKIGPHVEDTICSLTYSFLDLESAKAVFWYSPKRKEGMVELR 466

Query: 452 FFE 454
            +E
Sbjct: 467 LYE 469


>gi|225453254|ref|XP_002265990.1| PREDICTED: uncharacterized protein LOC100256535 [Vitis vinifera]
 gi|297734677|emb|CBI16728.3| unnamed protein product [Vitis vinifera]
          Length = 464

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 173/481 (35%), Positives = 254/481 (52%), Gaps = 45/481 (9%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  LR AM+  FW+LD S+P++L G+ARAVPG+P PL  +R +R  R +Q+ F+      
Sbjct: 1   MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60

Query: 61  PFLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
             +PSFS       G+  LQ +     + +W+  L GQF   K +SS+K       + EL
Sbjct: 61  GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADLSAVDEWEL 120

Query: 108 S--KSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
           S  K    H + DKSL++ G CS+L  T   +L++S + +G     R + +  H+ P H+
Sbjct: 121 STFKEVAKHFI-DKSLFSFGLCSQLSLTSASSLMVSTEQHGEKKGRRNRVMLFHQLPFHD 179

Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENS 225
            TLEA WP LF+D  G YW++P S++  L+S+ S+SG  Y   +H N G P       + 
Sbjct: 180 ITLEAAWPELFIDHKGRYWELPESISLGLSSLVSESGLRYRFGIHKNGGHPQSVNAINDE 239

Query: 226 SAVPASLLPGFSLKNAVAYKKNVDIWRSKAQK----------LKMVQPYDIFLSSPHVSA 275
           +  P++L+PG   K A +Y+K+ D+WR + ++          L     YDI L  PH + 
Sbjct: 240 A--PSALMPGLCAKAAFSYEKSRDLWRQREKQEDGIVKTERGLVWRPSYDIRLREPHAAI 297

Query: 276 SGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQ 335
           SGIIG    A  G +          DGS+      A  +S F AD+F S   T QHG F+
Sbjct: 298 SGIIGGTCEAWFGGS------REHGDGSSA----DAKKRSPFGADLFASGCCTFQHGQFR 347

Query: 336 RLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAG 395
           + + DLTR  ARL+  S S      + L     N  +  L +     P++    QQQ+AG
Sbjct: 348 KRYGDLTRVDARLNICSASALAKRVSNLFSSSVNGAKDPLSS-----PRLNLIFQQQVAG 402

Query: 396 PFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFF 453
           P  FRVDS +++D           D  +++ Y+L++L S K VAWY+PK +E MIELR F
Sbjct: 403 PIVFRVDSKLLLDSSGGRAGPQLEDFTYSLNYSLRLLRSGKVVAWYSPKRKEGMIELRLF 462

Query: 454 E 454
           E
Sbjct: 463 E 463


>gi|356504509|ref|XP_003521038.1| PREDICTED: uncharacterized protein LOC100775974 [Glycine max]
          Length = 464

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 170/479 (35%), Positives = 260/479 (54%), Gaps = 41/479 (8%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M KL+  ++  FW+L+ ++P+  +G A++VPG+P PL  S  +R+ RP+Q+  +   +  
Sbjct: 1   MAKLKTGIDSAFWDLNVASPQLHDGWAKSVPGDPFPLDGSVASRVLRPRQLSVIGNGLPL 60

Query: 61  P-FLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRC- 112
           P  +PS S       G+  LQ +L    +  W+ T+ GQF   K ++ +K  E+S +   
Sbjct: 61  PVIIPSLSPTSPKDLGSFSLQSLLLKLANPRWWLTMTGQFRPRKLIADVKN-EISNAEEF 119

Query: 113 -LHALRD-------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
            L  ++D       KSLY+ G  S+  F    +LLL+ + +G     R+K +  HK  +H
Sbjct: 120 DLSTVKDVVKHFINKSLYSFGLTSQFAFPPSTSLLLAIEGHGEKERLRRKMMVFHKLHDH 179

Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
           + TLEA WP LFVD  G YWDVP S++ DL+S+ S+SG  YH  +H N G P      + 
Sbjct: 180 DLTLEAAWPQLFVDHKGKYWDVPESLSVDLSSLVSESGLRYHFGIHKNGGNPQAMNATDG 239

Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRSKA---QKLKMVQPYDIFLSSPHVSASGIIGA 281
           +   P SLLPG   K AV+Y+K    WR K    Q+ +   PYD+ L  PH + SGIIG+
Sbjct: 240 NP--PLSLLPGLCAKVAVSYEKIKYFWRDKGAAEQENEEALPYDVRLKEPHAAVSGIIGS 297

Query: 282 AMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDL 341
              + I +    S V++  D        +   +S  +AD+FGSV ++ QHG F + + DL
Sbjct: 298 TFASWIWNGRSLSSVDSREDQEV-----STSKRSRHNADLFGSVCYSFQHGKFTKKYGDL 352

Query: 342 TRFHARLDFPSGS----KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
           TR  ARLD  S S    K L+G++    D   S++PS        P++    QQQ+AGP 
Sbjct: 353 TRVDARLDISSASAFAKKILNGSSSSTADV--SKQPS------ASPRLNLIFQQQVAGPV 404

Query: 398 SFRVDSGVVID--LKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
            FR DS + ++   +   +   D + ++ Y+L+ L S K VAWY+PK +E M+E R +E
Sbjct: 405 VFRADSRIALESFARKNGVSVEDFICSLSYSLKDLQSGKIVAWYSPKRKEGMVEFRMYE 463


>gi|356520726|ref|XP_003529011.1| PREDICTED: uncharacterized protein LOC100792226 [Glycine max]
          Length = 464

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 169/479 (35%), Positives = 260/479 (54%), Gaps = 41/479 (8%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M KL+ +++  FW+ + ++P+  EG  ++VPG+P PL  S  +R+ RP+Q+  +   +  
Sbjct: 1   MAKLKTSIDSSFWDFNVASPQLHEGWVKSVPGDPFPLDGSVASRVLRPRQLSVIGNGLPL 60

Query: 61  PFL-PSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRC- 112
           P + PS S       G+  LQ +L    +  W+ T+ GQF   K ++ +K  E+S +   
Sbjct: 61  PVIVPSLSPTSPKDLGSFCLQSLLLKLANPRWWLTMTGQFRPRKLIADVKN-EISNAEEF 119

Query: 113 -LHALRD-------KSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
            L  ++D       KSLY+ G  S+  F    +LLL+ + +G     R K +  HK P+H
Sbjct: 120 DLSTVKDVAKHFINKSLYSFGLTSQFAFPPSTSLLLAIEGHGEKERLRSKVMVFHKLPDH 179

Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
           + TLEA WP LFVD  G YWDVP S++ DL+S+ S+SG  YH+ MH NS  P        
Sbjct: 180 DLTLEAAWPQLFVDHKGKYWDVPESLSVDLSSLVSESGLRYHIGMHKNSVNPQAMNATNG 239

Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRSKA---QKLKMVQPYDIFLSSPHVSASGIIGA 281
           +   P SLLPG   K AV+Y+K    WR K    Q+ +   PYD+ L  PH + SGIIG+
Sbjct: 240 NP--PLSLLPGLCAKVAVSYEKIKYFWRDKGAAEQENEEALPYDVRLKEPHAAVSGIIGS 297

Query: 282 AMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDL 341
              + I +    S +++  D        +   +S  +AD+FGSV ++ QHG F + + DL
Sbjct: 298 TFASWIWNGRSLSSIDSREDPEV-----STSKRSRHNADLFGSVCYSFQHGKFTKKYGDL 352

Query: 342 TRFHARLDFPSGS----KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
           TR  ARLD  S S    K L+G++  +   + S++PS        P++    QQQ+AGP 
Sbjct: 353 TRVDARLDISSASAFAKKILNGSS--SSTAYVSEQPS------ASPRLNLIFQQQVAGPV 404

Query: 398 SFRVDSGVVID--LKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
            FR DS + ++   +   +   D + ++ Y+L+ L S K VAWY+PK +E M+E R +E
Sbjct: 405 VFRADSRIALESFARKNGVSVEDFICSLSYSLKDLESGKIVAWYSPKRKEGMVEFRMYE 463


>gi|297824489|ref|XP_002880127.1| hypothetical protein ARALYDRAFT_483595 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325966|gb|EFH56386.1| hypothetical protein ARALYDRAFT_483595 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 455

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 162/477 (33%), Positives = 246/477 (51%), Gaps = 46/477 (9%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  L  A++  FW+ + S+P++LEG+AR+VPG P PL  +R +R  R +Q+  ++     
Sbjct: 1   MANLNSAVDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60

Query: 61  PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
             +PSF+  +        L  +L    S +W+  L+GQF   K  + +K           
Sbjct: 61  GIIPSFAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120

Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
           ++ K    H + DKSLY++G  +++      +LLLS +  G  N  R K +F H    H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMFVHPLEKHD 179

Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
            T+EA WP LF+D  G +WDVP S+  D++S+  +SG  Y   +H + G   P    G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGLRYRFGLHKSRGNPQPVNAAGAE 239

Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ------PYDIFLSSPHVSASG 277
           + S  P SL+PG   K AV+YK N D+WR + ++    +      PYDI L  PH + SG
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKANRDLWRPQEKEDNTEEGTPEFLPYDIRLKEPHAAISG 299

Query: 278 IIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRL 337
           I+G+++ A I    +               L     +S  SAD+FGS  +T Q G F +L
Sbjct: 300 IVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSKL 344

Query: 338 FSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
           + DLTR  AR+D PS S     A R+   +      +        P++    QQQ+AGP 
Sbjct: 345 YGDLTRVDARVDLPSASAL---AKRIFHAFRRLSGSNNSDDTLWSPRLNLIFQQQVAGPI 401

Query: 398 SFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
            F+VDS   +     +    D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 402 VFKVDSQFQVGAARME----DLIYSLNYSLRLLESGKVVAWYSPKRKEGMIELRIFE 454


>gi|224137192|ref|XP_002327064.1| predicted protein [Populus trichocarpa]
 gi|222835379|gb|EEE73814.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 172/486 (35%), Positives = 254/486 (52%), Gaps = 48/486 (9%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  LR A++ +FW+   S+ ++LEG A ++PG+P PL ++R ++  R +Q+  +      
Sbjct: 1   MANLRTAVDSQFWDQPISSSQTLEGCAYSIPGDPFPLEVTRASKALRVQQLSVLGNGFPL 60

Query: 61  PFLPSFSA------GTLQLQRV-LPFQFSGDWFATLLGQFNVLKFVSSLKQGELS----- 108
             +PSFS       G+  LQ + L    S  W   L+GQF   K +SS+K GE +     
Sbjct: 61  GTIPSFSPTSTKDLGSFSLQSLFLKLATSNSWLG-LIGQFRPKKLISSIK-GEFTNADEF 118

Query: 109 -----KSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPN 163
                K    H   DKS+Y+LG  S++  +    LL + + +G     R K +  H+ P+
Sbjct: 119 EWPAFKDVAKHVF-DKSIYSLGLFSQISLSSSSVLLST-ERHGDKRRPRYKMMLWHELPD 176

Query: 164 HNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDE 223
           H+ TLEA WPGLF+D  G YWDVP S++ D++S+ S+SG  Y + +H N G P     + 
Sbjct: 177 HDITLEAAWPGLFLDHKGKYWDVPESISLDMSSLPSESGFQYRIGVHKNGGHPQPV--NT 234

Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRS---------KAQKLKMVQP-YDIFLSSPHV 273
            +  VP +L+PG   K A +Y+K  D WR          K  K K+  P +D+ L  PH 
Sbjct: 235 LNGEVPCALMPGLCAKAAFSYEKRKDFWRQKDKVDDTAVKTDKGKVWHPSFDMRLREPHS 294

Query: 274 SASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGN 333
           + SGIIG    A  G +      E+  D  T         +S  +A++FGSV +T QHG 
Sbjct: 295 AISGIIGGTSVAWFGGSESSPSTESHVDMDTSI---GTKKRSPLNANLFGSVCYTFQHGR 351

Query: 334 FQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNS-QRPSLETVQ--AVCPKVFFSVQ 390
           F +L+ DLTR  ARLD  S       A+ +A+  FN  +R S          PK+   +Q
Sbjct: 352 FTKLYGDLTRVDARLDICS-------ASAVAKRVFNIFRRSSFSNADNPLSSPKLSLILQ 404

Query: 391 QQIAGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
           QQ+AGP   RVDS   +   +     H  D + ++ Y+L++L S K VAWY+PK +E M+
Sbjct: 405 QQVAGPIMVRVDSKFSLGSSSGKQGPHVEDLICSLSYSLRLLRSGKVVAWYSPKRKEGMV 464

Query: 449 ELRFFE 454
           ELR FE
Sbjct: 465 ELRLFE 470


>gi|18406590|ref|NP_566021.1| uncharacterized protein [Arabidopsis thaliana]
 gi|20197029|gb|AAC27466.2| expressed protein [Arabidopsis thaliana]
 gi|330255356|gb|AEC10450.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 451

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 163/479 (34%), Positives = 250/479 (52%), Gaps = 54/479 (11%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  L  A++  FW+ + S+P++LEG+AR+VPG P PL  +R +R  R +Q+  ++     
Sbjct: 1   MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60

Query: 61  PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
             +PS +  +        L  +L    S +W+  L+GQF   K  + +K           
Sbjct: 61  GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120

Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
           ++ K    H + DKSLY++G  +++      +LLLS +  G  N  R K +  H    H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHD 179

Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
            T+EA WP LF+D  G +WDVP S+  D++S+  +SG  Y   +H + G   P    G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVE 239

Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ-------PYDIFLSSPHVSAS 276
           + S  P SL+PG   K AV+YK N D+WR + ++    +       PYD+ L  PH + S
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAIS 299

Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
           GI+G+++ A I    +               L     +S  SAD+FGS  +T Q G F +
Sbjct: 300 GIVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSK 344

Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFN-SQRPSLETVQAVCPKVFFSVQQQIAG 395
           L+ DLTR  AR+D PS       A  LA+  F+ S   S +T+ +  P++    QQQ+AG
Sbjct: 345 LYGDLTRVDARVDLPS-------AFALAKKLFHASSNNSDDTLWS--PRLNLIFQQQVAG 395

Query: 396 PFSFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
           P  F+VDS   +     +    D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 396 PIVFKVDSQFQVGAARME----DVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFE 450


>gi|15146246|gb|AAK83606.1| At2g44640/F16B22.13 [Arabidopsis thaliana]
 gi|19699152|gb|AAL90942.1| At2g44640/F16B22.13 [Arabidopsis thaliana]
          Length = 451

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 163/479 (34%), Positives = 250/479 (52%), Gaps = 54/479 (11%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  L  A++  FW+ + S+P++LEG+AR+VPG P PL  +R +R  R +Q+  ++     
Sbjct: 1   MANLNSAIDSVFWDQNVSSPQTLEGTARSVPGEPFPLDGARASRSHRIQQLSLLREGFPL 60

Query: 61  PFLPSFSAGT------LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQG--------- 105
             +PS +  +        L  +L    S +W+  L+GQF   K  + +K           
Sbjct: 61  GIIPSLAPASDKRLGSFSLNSLLLSPSSNNWWLGLVGQFKPKKLFADIKADISNAEEWDL 120

Query: 106 ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNHN 165
           ++ K    H + DKSLY++G  +++      +LLLS +  G  N  R K +  H    H+
Sbjct: 121 QVVKDTAKHIV-DKSLYSIGLWTQIALGTSSSLLLSTERLGDKNGLRNKLMLVHPLEKHD 179

Query: 166 FTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSG--LPTQFEGDE 223
            T+EA WP LF+D  G +WDVP S+  D++S+  +SG  Y   +H + G   P    G E
Sbjct: 180 LTVEAAWPDLFLDNKGRFWDVPESLNVDVSSLVPESGVRYRFGLHKSRGNPQPVNAAGVE 239

Query: 224 NSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKMVQ-------PYDIFLSSPHVSAS 276
           + S  P SL+PG   K AV+YK N D+WR + ++    +       PYD+ L  PH + S
Sbjct: 240 SGSDAPTSLMPGLCAKAAVSYKVNRDLWRPQEKEGNTEEEDKPVFLPYDLRLKEPHAAIS 299

Query: 277 GIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQR 336
           GI+G+++ A I    +               L     +S  SAD+FGS  +T Q G F +
Sbjct: 300 GIVGSSLAAWITGRGM---------------LVNGKKRSPISADVFGSACYTFQKGRFSK 344

Query: 337 LFSDLTRFHARLDFPSGSKFLSGATRLAQDYFN-SQRPSLETVQAVCPKVFFSVQQQIAG 395
           L+ DLTR  AR+D PS       A  LA+  F+ S   S +T+ +  P++    QQQ+AG
Sbjct: 345 LYGDLTRVDARVDLPS-------AFALAKKLFHASSNNSDDTMWS--PRLNLIFQQQVAG 395

Query: 396 PFSFRVDSGVVIDLKNRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMIELRFFE 454
           P  F+VDS   +     +    D ++++ Y+L++L S K VAWY+PK +E MIELR FE
Sbjct: 396 PIVFKVDSQFQVGAARME----DVIYSLNYSLRLLESGKIVAWYSPKRKEGMIELRVFE 450


>gi|357513035|ref|XP_003626806.1| hypothetical protein MTR_8g009290 [Medicago truncatula]
 gi|355520828|gb|AET01282.1| hypothetical protein MTR_8g009290 [Medicago truncatula]
          Length = 465

 Score =  251 bits (640), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 165/478 (34%), Positives = 243/478 (50%), Gaps = 38/478 (7%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M KLR A++  FW+L+ S+P++L+G A+ VPG+P PL  S  +RL R +Q+  +      
Sbjct: 1   MAKLRTAIDSSFWDLNISSPQNLDGWAKFVPGDPIPLDASVSSRLYRHQQLPHITPHFPF 60

Query: 61  PFLPSF------SAGTLQLQRVLPFQFSG-DWFATLLGQFNVLKFVSSLKQG-------- 105
             +PS         G+  LQ +L   F+G  W+    GQF   K +  +K          
Sbjct: 61  GIIPSSVPSPKKEHGSFSLQSLL-LDFTGPRWWLAATGQFRPRKMIVDIKNEICNAAEFN 119

Query: 106 -ELSKSRCLHALRDKSLYALGFCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKFPNH 164
              +KS   H + DKSL++ G  S+   +   ++L   + +G     RKK VF  +  +H
Sbjct: 120 LSTAKSVAKHFI-DKSLFSYGLNSQFALSPSTSVLFGLEGHGEKEKHRKKVVFFQELSDH 178

Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
           + T+EA WP LFVD  G YWDVP S++ DLAS+ +  G  Y   MH N+G P      + 
Sbjct: 179 DLTVEAAWPQLFVDHKGRYWDVPESISVDLASLVARRGLRYRFGMHKNNGSPQATNATD- 237

Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIWRS--KAQKLKMVQPYDIFLSSPHVSASGIIGAA 282
            S  P SLLPG   K +V Y K   +WR+    ++ K + P+D  L  PH + SGIIG++
Sbjct: 238 -SDTPLSLLPGLCAKASVTYDKIKYLWRNTETEEEYKDLFPHDQGLMVPHEAVSGIIGSS 296

Query: 283 MTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSDLT 342
             + I +       +N  D      +     +S F+A+IFGSV +T Q G F + F DLT
Sbjct: 297 CASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQQGRFSKKFGDLT 350

Query: 343 RFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPFSFRVD 402
           R  ARLD  S     SG  +   + F S    +    +  P++    QQQ+AGP  FR D
Sbjct: 351 RVDARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQQQVAGPVVFRAD 406

Query: 403 SGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQEFMIELRFFE 454
           S + ID  +R   I   D + ++ + +      + S K VAWY+PK  E M+ELR +E
Sbjct: 407 SRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRNEGMVELRLYE 464


>gi|302818490|ref|XP_002990918.1| hypothetical protein SELMODRAFT_161311 [Selaginella moellendorffii]
 gi|300141249|gb|EFJ07962.1| hypothetical protein SELMODRAFT_161311 [Selaginella moellendorffii]
          Length = 474

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 154/486 (31%), Positives = 235/486 (48%), Gaps = 45/486 (9%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  L  ++  +FW  +  + +SLE   R++P  P PL + RG+++SR +Q+  +QR +  
Sbjct: 1   MGPLGSSLASQFWNEEVESTRSLEACVRSLPREPPPLAMVRGSKISRSQQLLLLQRLLRL 60

Query: 61  PFLPSFSAGT-----LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELS------- 108
           PF PS +        + L RV+      +W+ATL G+  V +  +   + +L        
Sbjct: 61  PFAPSLAPDCDCPSGIVLDRVVASTGGTNWWATLTGRLRVQRLFAKRPEAKLELNGFSEA 120

Query: 109 --------KSRCLHALR--DKSLYALGFCSELLFTDDDTLLL-----SYDSYGHNNTS-R 152
                     +  HA R  DKSLYAL   ++LL     +L +     S + +G +    R
Sbjct: 121 AAEKQQQRDGKWTHAKRLLDKSLYALCMRTKLLLGPSTSLSIKAEMDSLEDWGSSEVPMR 180

Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
             +   HK   H+ +LEA W   ++D+   YWDVP +++ D+ASV   SG  Y L +HH+
Sbjct: 181 VHSKLLHKLKKHDLSLEAAWHERYMDREARYWDVPQAVSLDIASVGLSSGLYYRLGVHHS 240

Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL--KMVQPYDIFLSS 270
           SG      G      +P   +PG   + AV+Y K +DIW+        K+ +PY++  + 
Sbjct: 241 SGQSQNCCGGAGKE-IPWGAMPGTCAQAAVSYGKEIDIWKEDDAHFGRKLHKPYNLLAAR 299

Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
           PH++ SG++G  + AS+G         N S       L  ++   S SAD FGS+  TAQ
Sbjct: 300 PHITLSGVVGGVVNASLGQRKQPLNAANPS-------LVKSNGDRSVSADAFGSLGVTAQ 352

Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
            G FQR   D T+F AR+D    S  +S  T  A+    S  PSL   Q        ++Q
Sbjct: 353 FGLFQRFLLDHTKFAARVDIGGASSTVSARTLSAK----SSSPSLADGQNGI-TFALTMQ 407

Query: 391 QQIAGPFSFRVDSGVVIDLK--NRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
           QQ+ GP   RVDS    DL   N      +  + ++ +L+ LG+AK V WY P   E M 
Sbjct: 408 QQVLGPLRARVDSRFSPDLSTWNARPALQEVTYGLDCSLESLGAAKLVVWYCPTRSEGMA 467

Query: 449 ELRFFE 454
           E+R  E
Sbjct: 468 EIRLLE 473


>gi|302802025|ref|XP_002982768.1| hypothetical protein SELMODRAFT_117300 [Selaginella moellendorffii]
 gi|300149358|gb|EFJ16013.1| hypothetical protein SELMODRAFT_117300 [Selaginella moellendorffii]
          Length = 474

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 152/486 (31%), Positives = 230/486 (47%), Gaps = 45/486 (9%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  L  ++  +FW  +  + +SLE   R++P  P PL + RG+++SR +Q+  +QR +  
Sbjct: 1   MGPLGSSLASQFWNEEVESTRSLEACVRSLPREPPPLAMVRGSKISRSQQLLLLQRLLRL 60

Query: 61  PFLPSFSAGT-----LQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELS------- 108
           PF PS +        + L RV+      +W+ATL G+  V +  +     +L        
Sbjct: 61  PFAPSLAPDCDCPSGIVLDRVVASTGGTNWWATLTGRLRVQRLFAKRSGAKLELNGFSEA 120

Query: 109 ----------KSRCLHALRDKSLYALGFCSELLFTDDDTLLL-----SYDSYGHNNTS-R 152
                     K      L DKSLYAL   ++LL     +L +     S + +G +    R
Sbjct: 121 AAEKQQQRDGKWTYAKRLLDKSLYALCMRTKLLLGPSTSLSIKAEMDSLEDWGSSEVPMR 180

Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
             +   HK   H+ +LEA W   F+D+   YWDVP +++ D+ASV   SG  Y L +HH+
Sbjct: 181 VHSKLLHKLKKHDLSLEAAWHERFMDREARYWDVPQAVSLDIASVGLSSGLYYRLGVHHS 240

Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL--KMVQPYDIFLSS 270
           SG      G      +P   +PG   + AV+Y K +DIW+        K+ +PY++  + 
Sbjct: 241 SGQSQNCCGGAGKE-IPWGAMPGTCAQAAVSYGKEIDIWKEDDAHFGRKLHKPYNLLAAR 299

Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
           PH++ SG++G  + AS+G         N S       L  ++   S SAD FGS+  TAQ
Sbjct: 300 PHITLSGVVGGVVNASLGQRKQPLNAANPS-------LVKSNGDRSVSADAFGSLGVTAQ 352

Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
            G FQR   D T+F AR+D    S  +   T  A+    S  PSL   Q        ++Q
Sbjct: 353 FGLFQRFLLDHTKFAARVDIGGASSTVCARTLSAK----SSSPSLADGQNGI-TFALTMQ 407

Query: 391 QQIAGPFSFRVDSGVVIDLK--NRDICAHDSVFAIEYALQVLGSAKAVAWYAPKHQEFMI 448
           QQ+ GP   R DS    DL   N      +  + ++ +L+ LG+AK V WY P   E M 
Sbjct: 408 QQVLGPLRARADSRFSPDLSTWNARPALQEVTYGLDCSLESLGAAKLVVWYCPTRSEGMA 467

Query: 449 ELRFFE 454
           E+R  E
Sbjct: 468 EIRLLE 473


>gi|388493962|gb|AFK35047.1| unknown [Medicago truncatula]
          Length = 307

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 114/310 (36%), Positives = 161/310 (51%), Gaps = 20/310 (6%)

Query: 153 KKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHN 212
           KK  F  +  +H+ T+EA WP LFVD  G YWDVP S++ DLAS+ +  G  Y   MH N
Sbjct: 9   KKRCFFQELSDHDLTVEAAWPQLFVDHKGRYWDVPESISVDLASLVARRGLRYRFGMHKN 68

Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRS--KAQKLKMVQPYDIFLSS 270
           +G P      +  S  P SLLPG   K +V Y K   +WR+    ++ K + P+D  L  
Sbjct: 69  NGSPQATNATD--SDTPLSLLPGLCAKASVTYDKIKYLWRNTETEEEYKDLFPHDQGLMV 126

Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
           PH + SGIIG++  + I +       +N  D      +     +S F+A+IFGSV +T Q
Sbjct: 127 PHEAVSGIIGSSCASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQ 180

Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
            G F + F DLTR  ARLD  S     SG  +   + F S    +    +  P++    Q
Sbjct: 181 QGRFSKKFGDLTRADARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQ 236

Query: 391 QQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQ 444
           QQ+AGP  FR DS + ID  +R   I   D + ++ + +      + S K VAWY+PK  
Sbjct: 237 QQVAGPVVFRADSRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRN 296

Query: 445 EFMIELRFFE 454
           E M+ELR +E
Sbjct: 297 EGMVELRLYE 306


>gi|168006921|ref|XP_001756157.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692667|gb|EDQ79023.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 560

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 124/449 (27%), Positives = 198/449 (44%), Gaps = 57/449 (12%)

Query: 4   LRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATPFL 63
           ++  +  +FW    S+ +S++G +  +P  P PLGI  GTRLSR +Q+  +QRF+  P  
Sbjct: 1   MKPVLSSQFWNESVSSSRSIDGISHGLPAEPPPLGIVTGTRLSRSQQLLMLQRFLGIPCT 60

Query: 64  PSF-----SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKF----------------VSSL 102
           PS+     + G L L RVL      +W+ATL G+    +                  +  
Sbjct: 61  PSYVVDADNKGGLVLDRVLGSAGGENWWATLTGRVRPQRLFRRKCSTGESSSSSSSRTGS 120

Query: 103 KQGELSKSRCL----HALRDKSLYALGFCSELLFTDDDTL-------LLSYDSYGHNNTS 151
                  SR L    H   D SLYAL F S L      TL        LS  ++    ++
Sbjct: 121 DSAAQHTSRNLVESAHRFLDNSLYALCFRSRLQLGSRTTLSTTSELDSLSCQAFADQQST 180

Query: 152 --------RKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGP 203
                   R +    HK        EA +   +VD+   YWDVP +++ D AS+ +  G 
Sbjct: 181 GEQTRPPWRGRVSLRHKLDQQKIVAEAAYHERYVDRESRYWDVPHTVSFDFASLGAPGGF 240

Query: 204 SYHLTMHHNSGLPTQFEG---DENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKLKM 260
            Y L +HH++G P +  G   + N+   P   LPG  L+ A + +K++++W++  + +  
Sbjct: 241 RYRLGVHHSAGSPLECGGTRPEVNAGEPPVKALPGLRLQAAASVEKSLNLWKADLESIIS 300

Query: 261 VQPYDIFLSSPHVSASGIIGAAMTASI----GDNSVRSPVENDSDGSTGFHLHAADIKSS 316
            + Y +F + P +S SG++G  +++ +    G      P   D         ++  I   
Sbjct: 301 QKSYSLFDARPCISLSGVLGGLLSSRLDPLYGITQSLDPTGAD---------NSPPIGRR 351

Query: 317 FSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLE 376
            SAD+F SV   AQ G F+R F D T+   RLD  + S       +      N +   + 
Sbjct: 352 HSADLFASVGLNAQFGRFKRNFWDFTKMGIRLDIGAVSALKLHKMQDVSPSSNGESGDIG 411

Query: 377 TVQAVCPKVFFSVQQQIAGPFSFRVDSGV 405
             +   P +   +QQQ+ GP   RV S V
Sbjct: 412 DERGH-PTLALELQQQVIGPLRARVHSRV 439


>gi|147785063|emb|CAN64366.1| hypothetical protein VITISV_031303 [Vitis vinifera]
          Length = 701

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/339 (32%), Positives = 167/339 (49%), Gaps = 62/339 (18%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M  LR AM+  FW+LD S+P++L G+ARAVPG+P PL  +R +R  R +Q+ F+      
Sbjct: 1   MANLRTAMDAAFWDLDISSPQTLHGAARAVPGDPFPLEGARASRALRVQQLSFLGNGFPL 60

Query: 61  PFLPSFSA------GTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLK-------QGEL 107
             +PSFS       G+  LQ +     + +W+  L GQF   K +SS+K       + EL
Sbjct: 61  GIIPSFSPTSQKDLGSFSLQSLFLRPSTSNWWLGLTGQFRPKKLISSIKADLSAVDEWEL 120

Query: 108 S--KSRCLHALRDKSLYALGFCSELLFTDDDTLLLS-YDSYG------------------ 146
           S  K    H + DKSL++ G CS+L  T   +L++S +++ G                  
Sbjct: 121 STFKEVAKHFI-DKSLFSFGLCSQLSLTSASSLMISIWETLGFVLKWLHYILPSNKILEL 179

Query: 147 ------------HN---NTSRKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMA 191
                       H      + K     +  P H+ TLEA WP LF+D  G YW++P S++
Sbjct: 180 ATKWQIKCTFGQHGCVFEANAKWVALLNLLPFHDITLEAAWPELFIDHKGRYWELPESIS 239

Query: 192 ADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIW 251
             L+S+ S+SG  Y   +H N G P       + +  P++L+PG   K A +Y+K+ D+W
Sbjct: 240 LGLSSLVSESGLRYRFGIHKNGGHPQSVNAINDEA--PSALMPGLCAKAAFSYEKSRDLW 297

Query: 252 RSKAQK----------LKMVQPYDIFLSSPHVSASGIIG 280
           R + ++          L     YDI L  PH + SGIIG
Sbjct: 298 RQREKQEDGIVKTERGLVWRPSYDIRLREPHAAISGIIG 336



 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 80/146 (54%), Gaps = 7/146 (4%)

Query: 311 ADIKSSFSADIFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNS 370
           A  +S F AD+F S   T QHG F++ + DLTR  ARL+  S S      + L     N 
Sbjct: 560 AKKRSPFGADLFASGCCTFQHGQFRKRYGDLTRVDARLNICSASALAKRVSNLFSSSVNG 619

Query: 371 QRPSLETVQAVCPKVFFSVQQQIAGPFSFRVDSGVVIDLKNRDICAH--DSVFAIEYALQ 428
            +  L +     P++    QQQ+AGP  FRVDS +++D           D  +++ Y+L+
Sbjct: 620 AKDPLSS-----PRLNLIFQQQVAGPIVFRVDSKLLLDSSGGRAGPQLEDFTYSLNYSLR 674

Query: 429 VLGSAKAVAWYAPKHQEFMIELRFFE 454
           +L S K VAWY+PK +E MIELR FE
Sbjct: 675 LLRSGKVVAWYSPKRKEGMIELRLFE 700


>gi|217070234|gb|ACJ83477.1| unknown [Medicago truncatula]
          Length = 183

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 96/190 (50%), Gaps = 16/190 (8%)

Query: 271 PHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQ 330
           PH + SGIIG++  + I +       +N  D      +     +S F+A+IFGSV +T Q
Sbjct: 3   PHEAVSGIIGSSCASWIWNG------KNLIDTGEDPAVSKRRKRSRFNAEIFGSVCYTFQ 56

Query: 331 HGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQ 390
            G F + F DLTR  ARLD  S     SG  +   + F S    +    +  P++    Q
Sbjct: 57  QGRFSKKFGDLTRADARLDISSA----SGLAKKILNGFKSSSADIIEQPSASPRLNLIFQ 112

Query: 391 QQIAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQV----LGSAKAVAWYAPKHQ 444
           QQ+AGP  FR DS + ID  +R   I   D + ++ + +      + S K VAWY+PK  
Sbjct: 113 QQVAGPVVFRADSRISIDSLSRKHGISIEDFICSLNFYMDPNTSDMESGKIVAWYSPKRN 172

Query: 445 EFMIELRFFE 454
           E M+ELR +E
Sbjct: 173 EGMVELRLYE 182


>gi|145348672|ref|XP_001418769.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578999|gb|ABO97062.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 495

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 120/516 (23%), Positives = 205/516 (39%), Gaps = 97/516 (18%)

Query: 2   KKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATP 61
           K ++ AM+G F + +     ++EG+ RA   + +   +++ + L+R +Q+   Q+F+  P
Sbjct: 13  KPMKRAMDGAFGDREHGAAANVEGTTRAHASDGSSFVMTQPSGLARSQQVFLAQKFLNLP 72

Query: 62  FLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDK 119
            +PS+  S G + L RV+     G+W+ TL  +    +   +    ++ + +   AL D 
Sbjct: 73  AMPSYVESEGGMVLDRVVLNAGRGNWWTTLTARARAQR---AWNTEQVQEGKAHRALLDP 129

Query: 120 SLYALG--FCSELLFTDDDTLLLSYDSYGHNNTSRKKAVFCHKF---------------P 162
           S+YALG  F +  L   ++  + +    G      +   F  KF               P
Sbjct: 130 SMYALGGRFRANAL---ENVAVKAIGELGSVKGVTR--AFAKKFKLSDEKDDDAEAALNP 184

Query: 163 NHNFTLEALWPG--LFVDKCGN--------YWDVPFSMAADLASVASDSGPSYHLTMHHN 212
               +++A  P   L VD   N        Y D P S +  ++S    +  +Y ++    
Sbjct: 185 RGRVSVQAKLPSHLLSVDLSHNERYQTDDRYVDGPTSASVSVSSRGRRA-INYRVSARKW 243

Query: 213 SGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQKL----KMVQPYDIFL 268
            G    F+             P   L+  V+Y++   +WR + ++       V  Y    
Sbjct: 244 LGDLVPFDRKHGGGM---RTPPRKELQAGVSYEQQAVLWRGRRRRKPKSSSAVSGYTALP 300

Query: 269 SSPHVSASGIIGAAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADI--FGSVA 326
             P ++  GI GA +  S+ D                      D K     ++  FGS++
Sbjct: 301 QVPTITVGGIYGAVVRKSLDD----------------------DYKDGEQTELQNFGSLS 338

Query: 327 FTAQHGNFQRLFSDLTRFHARLD---FPSGSKFLSGATRLAQDYFNSQRPSLETVQAVCP 383
             AQ G+F R   D T  + RLD     + +K L  A +      +        ++A   
Sbjct: 339 VNAQVGSFARPLLDFTSINLRLDAGGMGTPNKPLDAAAQTPLPLADRLITMGGRLKANPV 398

Query: 384 KVFFSVQQQIAGPFSFRVD---SGV---------VIDLKNR------------DICAHDS 419
            V  S+ QQ+ GP  FR +   SG          +  LK R             I   + 
Sbjct: 399 SVTLSLGQQLLGPLRFRAEVRASGAEALSATRAGLSALKERKTMTEVRESIQKQIVKPEV 458

Query: 420 VFAIEYALQ-VLGSAKAVAWYAPKHQEFMIELRFFE 454
           V+ ++ AL   +GSA+ VAWY    QE M ELR F+
Sbjct: 459 VYGLDCALPPTIGSARVVAWYNATRQEAMAELRLFD 494


>gi|410810516|emb|CCI61380.1| hypothetical protein (GL-M16 marker), partial [Lens orientalis]
          Length = 55

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 38/55 (69%), Positives = 43/55 (78%), Gaps = 2/55 (3%)

Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
           I GP SFRVDSG+ +DLKN +  I AH+ VFA+EYAL VLGSAKAVAW  PK QE
Sbjct: 1   IVGPISFRVDSGIAVDLKNLEFPIQAHEPVFAVEYALHVLGSAKAVAWCCPKPQE 55


>gi|410810518|emb|CCI61381.1| hypothetical protein (GL-M16 marker), partial [Lens culinaris
           subsp. culinaris]
          Length = 55

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 37/55 (67%), Positives = 43/55 (78%), Gaps = 2/55 (3%)

Query: 393 IAGPFSFRVDSGVVIDLKNRD--ICAHDSVFAIEYALQVLGSAKAVAWYAPKHQE 445
           I GP SFRVDSG+ +DLKN +  + AH+ VFA+EYAL VLGSAKAVAW  PK QE
Sbjct: 1   IVGPISFRVDSGIAVDLKNLEFPLQAHEPVFAVEYALHVLGSAKAVAWCCPKPQE 55


>gi|308806363|ref|XP_003080493.1| COG1452: Organic solvent tolerance protein OstA (ISS) [Ostreococcus
           tauri]
 gi|116058953|emb|CAL54660.1| COG1452: Organic solvent tolerance protein OstA (ISS) [Ostreococcus
           tauri]
          Length = 525

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 110/502 (21%), Positives = 186/502 (37%), Gaps = 96/502 (19%)

Query: 22  SLEGSARAVPG---NPNPLG---ISRGTRLSRPKQIDFMQRFMATPFLPSFS--AGTLQL 73
           +++GSARA+ G   + + +G   +   + LSR +Q+   Q+    P +PSF+   G + L
Sbjct: 50  AIDGSARAMGGMETSGDGVGHFSMRVPSGLSRSQQVFLAQKLFNAPAVPSFARDEGGMVL 109

Query: 74  QRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDKSLYALG-------- 125
            RV+    SG W+ T+  +    +   +    ++ + +   AL D S+YALG        
Sbjct: 110 DRVIANVGSGRWWTTVTARVRAQR---AWNTEQVQEGKAHRALLDASMYALGGRFRMNLL 166

Query: 126 ------FCSELLFTDDDTLLLSYDSYGHNNT---------------SRKKAVFCHKFPNH 164
                    EL      T  ++      N T               +R + +   K P H
Sbjct: 167 ENVALKAVGELGSARAITRAITKKFVKKNETKSDAIEVNAGEAREDARARVLLQAKIPRH 226

Query: 165 NFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDEN 224
               E      F  K  ++ D P S +  LAS         +  +     L T+ +  E+
Sbjct: 227 VINAELSHNERFETKK-SHVDGPTSASVSLASRGRRM---INYRVMARKWLGTELKPFES 282

Query: 225 SSAVPASLLPGFSLKNAVAYKKNVDIW----RSKAQKLKMVQPYDIFLSSPHVSASGIIG 280
           +        P   +   V+ ++ V +W    R K      V  Y      P ++  GI G
Sbjct: 283 APDGEMQKPPRNEVLGGVSVEQQVILWHGRRRRKRASTSAVNGYTALPQVPTIAVGGIFG 342

Query: 281 AAMTASIGDNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGSVAFTAQHGNFQRLFSD 340
                S+           D D + G  +   +         F S+A   Q G+F R   D
Sbjct: 343 TVARKSL-----------DEDYTDGDEVRLQN---------FASIALHGQIGSFSRPLFD 382

Query: 341 LTRFHARLDFPSGS---KFLSGATRLAQDYFNSQRPSLETVQAVCPKVFFSVQQQIAGPF 397
            T  + RLD  S     K L  + +      +    +   ++A    V  S  QQ+ GP 
Sbjct: 383 FTSVNLRLDAGSVGNPYKPLDASEQKPMSLGDRLITADGRLKASPTSVTISFAQQLLGPL 442

Query: 398 SFRVD---SGV---------------------VIDLKNRDICAHDSVFAIEYALQ-VLGS 432
            FR +   SG                      V D+  +++ + + ++ ++  L   LGS
Sbjct: 443 RFRTEVRTSGAEALAATCAGLSAIKQRKPMAQVRDVFKKEVASPEVLYGLDCPLPPTLGS 502

Query: 433 AKAVAWYAPKHQEFMIELRFFE 454
           A+AV WY    Q+   E+R F+
Sbjct: 503 ARAVVWYNATRQDAFAEIRLFD 524


>gi|217071854|gb|ACJ84287.1| unknown [Medicago truncatula]
          Length = 133

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 59/111 (53%), Gaps = 8/111 (7%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M KLR A++  FW+L+ S+P++L+G A+ VPG+P PL  S  +RL R +Q+  +      
Sbjct: 1   MAKLRTAIDSSFWDLNISSPQNLDGWAKFVPGDPIPLDASVSSRLYRHQQLPHITPHFPF 60

Query: 61  PFLPSF------SAGTLQLQRVLPFQFSGD-WFATLLGQFNVLKFVSSLKQ 104
             +PS         G+  LQ +L   F+G  W+    GQF   K +  +K 
Sbjct: 61  GIIPSSVPSPKKEHGSFSLQSLL-LDFTGPRWWLAATGQFRPRKMIVDIKN 110


>gi|307106828|gb|EFN55073.1| hypothetical protein CHLNCDRAFT_134955 [Chlorella variabilis]
          Length = 847

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 321 IFGSVAFTAQHGNFQRLFSDLTRFHARLDFPSGSKFLSGATRLAQDYFNSQRPSLETVQA 380
           +F S A + Q G F R + D T   AR+D    S  + G +        S+RP      A
Sbjct: 697 VFASAAVSGQLGRFTRPWLDFTAASARVDMALTSPHVVGDS--------SRRPDKHRAFA 748

Query: 381 VCPK-----VFFSVQQQIAGPFSFRVDSGVVID----------------LKNRDICAHDS 419
           +  +     +  S+ QQ+ GP   RVD+   +D                +    +    S
Sbjct: 749 LEGRGAWHALSLSLAQQVYGPVRARVDARFALDPTSVPASEGERSTLKGVLQTALSVRPS 808

Query: 420 VFAIEYALQVL-----GSAKAVAWYAPKHQEFMIELRFF 453
           +    Y    +     G+A+   W++PK +E M ELRFF
Sbjct: 809 LLETVYGCDCVLPSTEGAARLAIWFSPKRRECMAELRFF 847



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 66/141 (46%), Gaps = 8/141 (5%)

Query: 2   KKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMATP 61
           + ++ ++  +FWE    + KSL+G   ++P  P  L ++  + LSR +Q+    + +  P
Sbjct: 12  QDVKASIASQFWEQPVHSTKSLDGQLWSLPCEPPALNLTGSSGLSRTQQL-LAAQAVRVP 70

Query: 62  FLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRDK 119
             PS+  S+  L L RVL       W+  L G+    +  S     ++ K   + A+RD 
Sbjct: 71  LFPSYDGSSDGLVLDRVLAKAGGRRWWTALSGRIKAQRAYSLAASEQVGK---VDAVRDL 127

Query: 120 SLYALGFCSELLFTDDDTLLL 140
             YAL  CS   F     L+L
Sbjct: 128 GNYAL--CSRSRFQLGRRLVL 146


>gi|255071821|ref|XP_002499585.1| predicted protein [Micromonas sp. RCC299]
 gi|226514847|gb|ACO60843.1| predicted protein [Micromonas sp. RCC299]
          Length = 520

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 59/127 (46%), Gaps = 5/127 (3%)

Query: 1   MKKLRWAMEGEFWELDQSTPKSLEGSARAVPGNPNPLGISRGTRLSRPKQIDFMQRFMAT 60
           M+  + A    FW+ +  +  +L+G  R    +  P  ++  + L+R +Q+  MQRF+  
Sbjct: 9   MEPTKAAFGKAFWDQEAESAAALDGHVRVHQSHAPPFSLAMPSGLARSQQLYMMQRFLNI 68

Query: 61  PFLPSF--SAGTLQLQRVLPFQFSGDWFATLLGQFNVLKFVSSLKQGELSKSRCLHALRD 118
           P  PS+  +   + L RV+       W+ TL  +    +  +S      ++ R    L D
Sbjct: 69  PATPSYVKAEDGMVLDRVIANVGGRTWWHTLTLRLRAQRAWNSQS---FNEGRPHRILGD 125

Query: 119 KSLYALG 125
            SLYALG
Sbjct: 126 PSLYALG 132


>gi|412991412|emb|CCO16257.1| predicted protein [Bathycoccus prasinos]
          Length = 714

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 4   LRWAMEGEFWELDQSTPKSLEGSARAVPGNPN-PLGISRGTRLSRPKQIDFMQRFMATPF 62
           L+ ++   F+  D S+  ++EG+ R  P   + P      + L+R +Q+  +QRF+  P 
Sbjct: 82  LKTSLHKSFFSQDVSSSTTIEGTVRQHPSQTSTPFSFQMPSGLARSQQVHMLQRFLNLPA 141

Query: 63  LPSF--SAGTLQLQRVLPFQFSG-DWFAT 88
           LPS+  + G   L RV+  QF G +W++T
Sbjct: 142 LPSYIKNEGGFVLDRVVT-QFGGNEWWST 169


>gi|384253755|gb|EIE27229.1| hypothetical protein COCSUDRAFT_38956 [Coccomyxa subellipsoidea
            C-169]
          Length = 1676

 Score = 39.7 bits (91), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 26/106 (24%), Positives = 44/106 (41%), Gaps = 6/106 (5%)

Query: 152  RKKAVFCHKFPNHNFTLEALWPGLFVDKCGNYWDVPFSMAADLASVASDSGPSYHLTMHH 211
            R +    H+   H+      W   ++D    Y  VP S + D+AS     G  Y + +HH
Sbjct: 1062 RGRTWLKHQVAGHDMLASLAWNEEYLDSAAAYCRVPLSSSLDVASQDRQDGIQYRVGVHH 1121

Query: 212  NSGLPTQFEGDENSSAVPASLLPGFSLKNAVAYKKNVDIWRSKAQK 257
             +    Q E D++     +SL      + AVA +    +W+   Q+
Sbjct: 1122 VAT--PQTESDDSGGPWRSSL----HFQGAVAVEGEALVWKPAVQQ 1161


>gi|424877962|ref|ZP_18301602.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WU95]
 gi|392520454|gb|EIW45183.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WU95]
          Length = 560

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 66/162 (40%), Gaps = 25/162 (15%)

Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
           FS+A D AS+   +G  Y +   H +GLP +FE   N  A        GF  + A     
Sbjct: 307 FSLAMDRASMVDIAGFGYPVVNEHATGLPPRFETWRNKGAEGDKDAFMGFDTEKANKILD 366

Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
              YKK  D +R+      +  P  +      ++ +  ++  G+  A + AS+       
Sbjct: 367 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 426

Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
                 D S    + + +DG+T F  +   + +++   I G+
Sbjct: 427 WRKQIIDGSFEVVMNSRADGATPFRGYYQSLSTTYGGRITGA 468


>gi|424891530|ref|ZP_18315113.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WSM2297]
 gi|393185525|gb|EJC85561.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WSM2297]
          Length = 563

 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 37/162 (22%), Positives = 66/162 (40%), Gaps = 25/162 (15%)

Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
           FS+A D  S+   +G  Y +   H +GLP +FE   N SA        GF  + A     
Sbjct: 310 FSLAMDRKSMVDIAGFGYPVVNEHATGLPPRFETWRNKSAEGDKDAFMGFDTEKANKVLD 369

Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
              YKK  D +R+      +  P  +      ++ +  ++  G+  A + AS+       
Sbjct: 370 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 429

Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
                 D S    + + +DG+T F  +   + +++   I G+
Sbjct: 430 WRKQIIDGSFEVVMNSRADGATPFRGYYQSLSTAYGGRITGA 471


>gi|405381537|ref|ZP_11035364.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium sp. CF142]
 gi|397322033|gb|EJJ26444.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium sp. CF142]
          Length = 564

 Score = 39.3 bits (90), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 38/162 (23%), Positives = 68/162 (41%), Gaps = 25/162 (15%)

Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAV-PASLLPGFSLKNA----- 241
           FS+A D  S+   +G  Y +   H SGLP +FEG  N +A        G     A     
Sbjct: 311 FSLAMDRKSMVDIAGFGYPVVNEHASGLPPRFEGWRNKAAEGDKDAFMGLDTDKANKILD 370

Query: 242 -VAYKKNVDIWRS----KAQKLKMVQP--YDIFLSSPHVSASGIIGAAMTASIG------ 288
              YKK  D +R+    K     ++ P  +  ++ +  ++  G+  A + AS+       
Sbjct: 371 DAGYKKGSDGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 430

Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
                 D S    + + +DG+T F  +  ++ +++   I G+
Sbjct: 431 WRKQILDGSFEVVMNSRADGATPFRGYYQNLSTAYGGRITGA 472


>gi|399040132|ref|ZP_10735586.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium sp. CF122]
 gi|398062017|gb|EJL53803.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium sp. CF122]
          Length = 565

 Score = 38.5 bits (88), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 25/159 (15%)

Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPAS-------LLPGFSLKN 240
           FS+A D AS+   +G  Y +   H SGLP +FE   N +A  A+       +     + +
Sbjct: 312 FSLAMDRASMVDVAGFGYPVASQHASGLPPRFEAWRNKAAEGAADEWMAYDVDKANKVLD 371

Query: 241 AVAYKKNVDIWRSK------AQKLKMVQPYDIFLSSPHVSASGIIGAAMTASIG------ 288
              YKK  D +R+       A  + +   +  ++    ++A G+  A    S+       
Sbjct: 372 DAGYKKGADNFRTTPGGQPIAFTIIIPNGWTDWIDDAQIAAEGLRAAGFNVSVATPEYEQ 431

Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADI 321
                 D S    + + +DGST F  +   + ++F+  I
Sbjct: 432 WQKQLLDGSFDVALNSRADGSTPFRGYYQSMSTAFAGRI 470


>gi|424919956|ref|ZP_18343319.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WSM597]
 gi|392848971|gb|EJB01493.1| ABC-type dipeptide transport system, periplasmic component
           [Rhizobium leguminosarum bv. trifolii WSM597]
          Length = 557

 Score = 38.1 bits (87), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 36/162 (22%), Positives = 65/162 (40%), Gaps = 25/162 (15%)

Query: 188 FSMAADLASVASDSGPSYHLTMHHNSGLPTQFEGDENSSAVPA-SLLPGFSLKNA----- 241
           FS+A D  S+   +G  Y +   H +GLP +FE   N  A        GF  + A     
Sbjct: 304 FSLAMDRKSMVDIAGFGYPVVNEHATGLPPRFESWRNKDAEGGKDAFMGFDTEKASKILD 363

Query: 242 -VAYKKNVDIWRSKAQKLKMVQPYDI------FLSSPHVSASGIIGAAMTASIG------ 288
              YKK  D +R+      +  P  +      ++ +  ++  G+  A + AS+       
Sbjct: 364 DAGYKKGADGFRTTPSGKPIAFPIIVPNGWTDWIDAVQIAVEGLRAAGINASVATPEYEQ 423

Query: 289 ------DNSVRSPVENDSDGSTGFHLHAADIKSSFSADIFGS 324
                 D S    + + +DG+T F  +   + +++   I G+
Sbjct: 424 WRKEIIDGSFEVVMNSRADGATPFRGYYQSLSTAYGGRITGA 465


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.134    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,110,838,465
Number of Sequences: 23463169
Number of extensions: 288922477
Number of successful extensions: 631045
Number of sequences better than 100.0: 58
Number of HSP's better than 100.0 without gapping: 44
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 630768
Number of HSP's gapped (non-prelim): 69
length of query: 455
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 309
effective length of database: 8,933,572,693
effective search space: 2760473962137
effective search space used: 2760473962137
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)