BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047389
         (89 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|296082551|emb|CBI21556.3| unnamed protein product [Vitis vinifera]
          Length = 135

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 58/86 (67%), Gaps = 6/86 (6%)

Query: 3  LKHVFFILTFTCLIMANIANATTRNDRLNNSMKPG-YDFATRLDVRGGLTECWNTLMKLK 61
          +K+VF ++  + LI     NA+   + L  SMKP   D   RL+  GGL ECWN L++LK
Sbjct: 4  IKNVFLLVALSWLI----TNASASRE-LPMSMKPAELDLVARLETSGGLVECWNALVELK 58

Query: 62 SCSNEIVIFFLNSQADIGPDCCRVID 87
          SC+NEI++FFLN QADIGPDCC  ID
Sbjct: 59 SCTNEIILFFLNGQADIGPDCCGAID 84


>gi|359482498|ref|XP_002274398.2| PREDICTED: uncharacterized protein LOC100260034 [Vitis vinifera]
 gi|147772490|emb|CAN65099.1| hypothetical protein VITISV_039724 [Vitis vinifera]
 gi|297743029|emb|CBI35896.3| unnamed protein product [Vitis vinifera]
          Length = 128

 Score = 88.2 bits (217), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 41/87 (47%), Positives = 55/87 (63%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MALK +F ++  T L+    ANA    D   N   PG D  TRL+  GGL ECWN LM++
Sbjct: 1  MALKTLFLLVALTWLM----ANAAATRDLPTN---PGLDLTTRLETSGGLVECWNALMEI 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          + C+NEI++FFLN Q  +GP+CC+ I 
Sbjct: 54 RQCTNEIILFFLNGQTVLGPECCQAIS 80


>gi|224060465|ref|XP_002300213.1| predicted protein [Populus trichocarpa]
 gi|222847471|gb|EEE85018.1| predicted protein [Populus trichocarpa]
          Length = 146

 Score = 78.2 bits (191), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 32/53 (60%), Positives = 42/53 (79%), Gaps = 1/53 (1%)

Query: 35 KPGYD-FATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          KPG++  + RL+  G L ECWN L+++KSC+NEIV+FF+  QADIGPDCCR I
Sbjct: 32 KPGFNSLSARLEDEGSLVECWNALVEIKSCTNEIVLFFMTGQADIGPDCCRAI 84


>gi|116830473|gb|ABK28194.1| unknown [Arabidopsis thaliana]
          Length = 126

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F   T   L++ NI+  T          +   + A RL+  GGL ECWN L +L
Sbjct: 1  MASNTSFLFATIAILLVLNISGRTLP------ETEDSTNIAARLN-GGGLMECWNALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G DCC+ ++
Sbjct: 54 KSCTNEIVLFFLNGETKLGVDCCQAVE 80


>gi|15227092|ref|NP_179766.1| uncharacterized protein [Arabidopsis thaliana]
 gi|75206217|sp|Q9SJ24.1|EC12_ARATH RecName: Full=Egg cell-secreted protein 1.2; Flags: Precursor
 gi|4417269|gb|AAD20394.1| hypothetical protein [Arabidopsis thaliana]
 gi|91805455|gb|ABE65456.1| hypothetical protein At2g21740 [Arabidopsis thaliana]
 gi|330252124|gb|AEC07218.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 125

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/87 (40%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F   T   L++ NI+  T          +   + A RL+  GGL ECWN L +L
Sbjct: 1  MASNTSFLFATIAILLVLNISGRTLP------ETEDSTNIAARLN-GGGLMECWNALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G DCC+ ++
Sbjct: 54 KSCTNEIVLFFLNGETKLGVDCCQAVE 80


>gi|15235039|ref|NP_195644.1| uncharacterized protein [Arabidopsis thaliana]
 gi|75213705|sp|Q9T039.1|EC14_ARATH RecName: Full=Egg cell-secreted protein 1.4; Flags: Precursor
 gi|4914446|emb|CAB43649.1| hypothetical protein [Arabidopsis thaliana]
 gi|7270918|emb|CAB80597.1| hypothetical protein [Arabidopsis thaliana]
 gi|91805627|gb|ABE65542.1| hypothetical protein At4g39340 [Arabidopsis thaliana]
 gi|332661658|gb|AEE87058.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 127

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F   T T LI+  + N T     L    +   + A RL   GGL ECWN L +L
Sbjct: 1  MASNTTFLFSTVTLLII--LLNTTVSGRDL--PAESSTNIAARLQ-SGGLMECWNALYEL 55

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 56 KSCTNEIVLFFLNGETKLGVSCCESVD 82


>gi|116830619|gb|ABK28267.1| unknown [Arabidopsis thaliana]
          Length = 128

 Score = 64.3 bits (155), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F   T T LI+  + N T     L    +   + A RL   GGL ECWN L +L
Sbjct: 1  MASNTTFLFSTVTLLII--LLNTTVSGRDL--PAESSTNIAARLQ-SGGLMECWNALYEL 55

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 56 KSCTNEIVLFFLNGETKLGVSCCESVD 82


>gi|297825029|ref|XP_002880397.1| hypothetical protein ARALYDRAFT_900604 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297326236|gb|EFH56656.1| hypothetical protein ARALYDRAFT_900604 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 125

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F +     L++ NI+  T              + A RL+  GGL ECWN L +L
Sbjct: 1  MASYTSFLVAIVALLLVLNISGRTLPET------ADSTNIAARLN-GGGLMECWNALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G DCC+ ++
Sbjct: 54 KSCTNEIVLFFLNGETKLGVDCCQAVE 80


>gi|297802032|ref|XP_002868900.1| hypothetical protein ARALYDRAFT_490711 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297314736|gb|EFH45159.1| hypothetical protein ARALYDRAFT_490711 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 127

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/87 (41%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          M+    F   T T LI+    N T     L    +   + A RL   GGL ECWN L +L
Sbjct: 1  MSSHTTFLFSTVTLLIL--FLNTTVSGRDL--PAESSTNIAARLQ-SGGLMECWNVLYEL 55

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 56 KSCTNEIVLFFLNGETKLGVSCCEAVD 82


>gi|255578398|ref|XP_002530064.1| conserved hypothetical protein [Ricinus communis]
 gi|223530417|gb|EEF32304.1| conserved hypothetical protein [Ricinus communis]
          Length = 135

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/64 (48%), Positives = 37/64 (57%), Gaps = 10/64 (15%)

Query: 34 MKPGYD--------FATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQAD--IGPDCC 83
          +K  YD          T     GGL +CWN LM++KSCSNEI++FFLN Q D  IG DCC
Sbjct: 27 IKQAYDNTETRVLLLETSSSSGGGLVDCWNALMEIKSCSNEIILFFLNGQTDITIGADCC 86

Query: 84 RVID 87
            I 
Sbjct: 87 SAIS 90


>gi|255578404|ref|XP_002530067.1| conserved hypothetical protein [Ricinus communis]
 gi|223530420|gb|EEF32307.1| conserved hypothetical protein [Ricinus communis]
          Length = 136

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 48/88 (54%), Gaps = 9/88 (10%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLD--VRGGLTECWNTLM 58
          MA K +  +L  T +I    A+AT   D        G     R++      L +CWN L+
Sbjct: 1  MAFKIMTLLLGLTLVI----ASATAARDV---PFISGNSLEARIEGSSSSSLVDCWNALI 53

Query: 59 KLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          ++KSCSNEI++FFLN   DIG DCCR I
Sbjct: 54 EIKSCSNEIILFFLNGHTDIGADCCRSI 81


>gi|116830475|gb|ABK28195.1| unknown [Arabidopsis thaliana]
          Length = 126

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F  +T T L++ N+++      R    +    + A RL   GGL +CW+ L +L
Sbjct: 1  MASNTSFLFVTVTLLLVLNVSS------RALPPVADSTNIAARL-TGGGLMQCWDALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 54 KSCTNEIVLFFLNGETKLGYGCCNAVD 80


>gi|15227093|ref|NP_179767.1| uncharacterized protein [Arabidopsis thaliana]
 gi|75206216|sp|Q9SJ23.1|EC13_ARATH RecName: Full=Egg cell-secreted protein 1.3; Flags: Precursor
 gi|4417270|gb|AAD20395.1| hypothetical protein [Arabidopsis thaliana]
 gi|91805457|gb|ABE65457.1| hypothetical protein At2g21750 [Arabidopsis thaliana]
 gi|330252125|gb|AEC07219.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 125

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F  +T T L++ N+++      R    +    + A RL   GGL +CW+ L +L
Sbjct: 1  MASNTSFLFVTVTLLLVLNVSS------RALPPVADSTNIAARL-TGGGLMQCWDALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 54 KSCTNEIVLFFLNGETKLGYGCCNAVD 80


>gi|297825031|ref|XP_002880398.1| hypothetical protein ARALYDRAFT_900605 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297326237|gb|EFH56657.1| hypothetical protein ARALYDRAFT_900605 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 125

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 47/87 (54%), Gaps = 7/87 (8%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA    F  +T T L+  N++       R    +    + A RL   GGL +CW+ L +L
Sbjct: 1  MASNTSFLFVTVTLLLALNVSG------RTLPVVADSTNIAARL-TGGGLMQCWDALYEL 53

Query: 61 KSCSNEIVIFFLNSQADIGPDCCRVID 87
          KSC+NEIV+FFLN +  +G  CC  +D
Sbjct: 54 KSCTNEIVLFFLNGETKLGSGCCNAVD 80


>gi|357120013|ref|XP_003561726.1| PREDICTED: uncharacterized protein LOC100836522 [Brachypodium
          distachyon]
          Length = 166

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 1/50 (2%)

Query: 37 GYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          G   A RL +  G  +CW +LM++KSC+ EI++FFLN +A +GP CCR I
Sbjct: 44 GPGLAERL-IGEGPQQCWESLMEIKSCTGEIILFFLNGEAYLGPGCCRAI 92


>gi|6683763|gb|AAF23356.1|AF109193_1 ECA1 protein [Hordeum vulgare subsp. vulgare]
 gi|326489569|dbj|BAK01765.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 174

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 36/49 (73%), Gaps = 3/49 (6%)

Query: 39 DFATRLDVRGGLTE-CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          D A RL+  G +++ CW TL+ +KSC+ EI++FFLN +A +GP CCR I
Sbjct: 43 DLADRLE--GAVSQQCWETLLHIKSCTGEIILFFLNGEAYLGPGCCRAI 89


>gi|357459299|ref|XP_003599930.1| hypothetical protein MTR_3g049180 [Medicago truncatula]
 gi|357459309|ref|XP_003599935.1| hypothetical protein MTR_3g049270 [Medicago truncatula]
 gi|355488978|gb|AES70181.1| hypothetical protein MTR_3g049180 [Medicago truncatula]
 gi|355488983|gb|AES70186.1| hypothetical protein MTR_3g049270 [Medicago truncatula]
          Length = 145

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 6/94 (6%)

Query: 1  MALKHVFFILTFTCLIMANIANATT--RNDRLNNSMKPGYDFATRLDVRGG-LTECWNTL 57
          MA  H  F+L    L+   + ++T+   + +L+N          RL V G   + CW +L
Sbjct: 1  MASSHKIFVLVALALVFIALNSSTSIVESRKLSNPNSNLMSLEARLKVSGDEPSNCWESL 60

Query: 58 MKLKSCSNEIVIFFLNSQADIGPDCC---RVIDH 88
           KL++CS EI+ FFLN +  +G  CC   RVI H
Sbjct: 61 FKLQACSGEIITFFLNGETYLGYGCCKAIRVIGH 94


>gi|147774833|emb|CAN73446.1| hypothetical protein VITISV_016791 [Vitis vinifera]
          Length = 117

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 34/48 (70%)

Query: 39 DFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          +   RL + G    CW++L +L+SC++E+++FFLN +  +GP+CCR I
Sbjct: 12 NLQARLRLEGESPNCWDSLFELQSCTSEVIMFFLNGETHLGPNCCRAI 59


>gi|449443584|ref|XP_004139557.1| PREDICTED: uncharacterized protein LOC101223174 [Cucumis sativus]
          Length = 142

 Score = 55.8 bits (133), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 37/57 (64%)

Query: 30 LNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          + +S  P    ATR+ + G  ++CW +L +L++C+ E++ FFL+ +A +G  CC+ I
Sbjct: 22 VESSPNPKSSLATRIKLEGETSDCWGSLYELQACTGEVITFFLSGEAYLGVKCCQAI 78


>gi|255557431|ref|XP_002519746.1| conserved hypothetical protein [Ricinus communis]
 gi|223541163|gb|EEF42719.1| conserved hypothetical protein [Ricinus communis]
          Length = 135

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 37/56 (66%)

Query: 31 NNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          N +M  G + A RL +    + CW++L++L++C+ EI++FFLN +  +G  CC+ I
Sbjct: 20 NEAMASGSNLAARLKLDEESSNCWDSLIQLEACTTEIILFFLNGETHLGHGCCQAI 75


>gi|125543485|gb|EAY89624.1| hypothetical protein OsI_11152 [Oryza sativa Indica Group]
          Length = 151

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 28/35 (80%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          +CW  LM++KSC+ EI++FF+N +A +GP CCR I
Sbjct: 46 QCWEVLMEIKSCTGEILLFFINGEAYLGPGCCRAI 80


>gi|115452461|ref|NP_001049831.1| Os03g0296600 [Oryza sativa Japonica Group]
 gi|108707650|gb|ABF95445.1| ECA1 protein, putative, expressed [Oryza sativa Japonica Group]
 gi|113548302|dbj|BAF11745.1| Os03g0296600 [Oryza sativa Japonica Group]
 gi|125543494|gb|EAY89633.1| hypothetical protein OsI_11163 [Oryza sativa Indica Group]
 gi|125585927|gb|EAZ26591.1| hypothetical protein OsJ_10489 [Oryza sativa Japonica Group]
 gi|215766396|dbj|BAG98624.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 151

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 28/36 (77%)

Query: 51 TECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
           +CW  LM++KSC+ EI++FF+N +A +GP CCR I
Sbjct: 45 QQCWEVLMEIKSCTGEILLFFINGEAYLGPGCCRAI 80


>gi|15223149|ref|NP_177801.1| uncharacterized protein [Arabidopsis thaliana]
 gi|75207334|sp|Q9SRD8.1|EC11_ARATH RecName: Full=Egg cell-secreted protein 1.1; Flags: Precursor
 gi|6143892|gb|AAF04438.1|AC010718_7 hypothetical protein; 47879-48355 [Arabidopsis thaliana]
 gi|52354237|gb|AAU44439.1| hypothetical protein AT1G76750 [Arabidopsis thaliana]
 gi|55740537|gb|AAV63861.1| hypothetical protein At1g76750 [Arabidopsis thaliana]
 gi|332197764|gb|AEE35885.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 158

 Score = 54.3 bits (129), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 40/81 (49%)

Query: 6  VFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSN 65
           F I+T   ++ ++   A         +  P      RL +      CW++LM+L+ CS 
Sbjct: 10 TFNIVTLMLMVASSTVTARPLMKPSMGTSSPTTSLVYRLKLDEDTGYCWDSLMQLQHCSG 69

Query: 66 EIVIFFLNSQADIGPDCCRVI 86
          E+++FFLN +  IGP CC  I
Sbjct: 70 ELILFFLNGETYIGPGCCSAI 90


>gi|297839547|ref|XP_002887655.1| hypothetical protein ARALYDRAFT_476835 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297333496|gb|EFH63914.1| hypothetical protein ARALYDRAFT_476835 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 158

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 40 FATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          +  RLD   G   CW++LM+L+ CS E+++FFLN +  IGP CC  I
Sbjct: 47 YRLRLDEDTGY--CWDSLMQLQHCSGELILFFLNGETYIGPGCCSAI 91


>gi|356574547|ref|XP_003555407.1| PREDICTED: uncharacterized protein LOC100801164 [Glycine max]
          Length = 141

 Score = 52.4 bits (124), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 31/47 (65%)

Query: 40 FATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
             RL + G  + CW++L +L++C+ E++ FFLN +  +GP CC+ I
Sbjct: 33 LVARLKLDGESSNCWDSLFELQACTGEVITFFLNGETYLGPSCCQAI 79


>gi|115484361|ref|NP_001065842.1| Os11g0168000 [Oryza sativa Japonica Group]
 gi|62701925|gb|AAX92998.1| hypothetical protein LOC_Os11g06730 [Oryza sativa Japonica Group]
 gi|62734370|gb|AAX96479.1| ECA1 protein [Oryza sativa Japonica Group]
 gi|77548909|gb|ABA91706.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644546|dbj|BAF27687.1| Os11g0168000 [Oryza sativa Japonica Group]
 gi|125533539|gb|EAY80087.1| hypothetical protein OsI_35256 [Oryza sativa Indica Group]
 gi|215692958|dbj|BAG88378.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 144

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 27/34 (79%)

Query: 53 CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          CW+ + KL SC+NEIV+FF+N ++ +GPDCC  I
Sbjct: 56 CWSAVTKLGSCTNEIVLFFVNGESYLGPDCCVAI 89


>gi|242041253|ref|XP_002468021.1| hypothetical protein SORBIDRAFT_01g038210 [Sorghum bicolor]
 gi|241921875|gb|EER95019.1| hypothetical protein SORBIDRAFT_01g038210 [Sorghum bicolor]
          Length = 171

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          +CW  L+++KSC+ EI+I F+  +A +GP CCR I
Sbjct: 59 QCWEALVEIKSCTGEIIILFIKGEAFLGPGCCRAI 93


>gi|413956023|gb|AFW88672.1| hypothetical protein ZEAMMB73_483293 [Zea mays]
          Length = 160

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          +CW  L+++KSC+ EI+I F+  +A +GP CCR I
Sbjct: 52 QCWEALVEIKSCTGEIIILFIRGEAFLGPGCCRAI 86


>gi|242084902|ref|XP_002442876.1| hypothetical protein SORBIDRAFT_08g004200 [Sorghum bicolor]
 gi|241943569|gb|EES16714.1| hypothetical protein SORBIDRAFT_08g004200 [Sorghum bicolor]
          Length = 177

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%)

Query: 50  LTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
             ECW  +M L SC +EI++FF+N ++ IGP+CC  I
Sbjct: 82  FAECWAAVMGLSSCYSEILLFFVNGESYIGPECCVAI 118


>gi|226532650|ref|NP_001140898.1| hypothetical protein precursor [Zea mays]
 gi|194701656|gb|ACF84912.1| unknown [Zea mays]
 gi|414878452|tpg|DAA55583.1| TPA: hypothetical protein ZEAMMB73_509480 [Zea mays]
          Length = 164

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 53  CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
           CW  +M L SC  EI++FF+N ++ IGPDCC  I
Sbjct: 69  CWGAVMGLSSCYGEILLFFVNGESYIGPDCCVAI 102


>gi|357464487|ref|XP_003602525.1| ECA1 protein [Medicago truncatula]
 gi|355491573|gb|AES72776.1| ECA1 protein [Medicago truncatula]
          Length = 133

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 11/84 (13%)

Query: 6  VFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRL---DVRGGLTECWNTLMKLKS 62
           FF+  F  + ++ I  AT        S+      A+RL   D  G   +CW T+++L+ 
Sbjct: 2  AFFLKLFIIISLSTIVTAT--------SLSSTKTLASRLELFDGSGPNNKCWETMLELQH 53

Query: 63 CSNEIVIFFLNSQADIGPDCCRVI 86
          C+ +IV FFLN Q  +G  CC  +
Sbjct: 54 CTGDIVTFFLNGQTHLGSGCCNAL 77


>gi|255566628|ref|XP_002524298.1| conserved hypothetical protein [Ricinus communis]
 gi|223536389|gb|EEF38038.1| conserved hypothetical protein [Ricinus communis]
          Length = 141

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 32/54 (59%)

Query: 33 SMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          S+ P      +LD       CW++L++L++C+ EI++FFLN +  +G  CC  I
Sbjct: 29 SLSPTLLARLKLDEEASSISCWDSLVQLQACTGEIILFFLNGETYLGHSCCEAI 82


>gi|224060463|ref|XP_002300212.1| predicted protein [Populus trichocarpa]
 gi|222847470|gb|EEE85017.1| predicted protein [Populus trichocarpa]
          Length = 130

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 41 ATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCC---RVIDH 88
          ATRL +    T CW +L+ L+SC + +++FFLN +  + P CC   R+I H
Sbjct: 36 ATRLRLDNEETTCWGSLLHLQSCISNVLLFFLNGETYLRPSCCHAIRIIGH 86


>gi|357152868|ref|XP_003576261.1| PREDICTED: uncharacterized protein LOC100824523 [Brachypodium
           distachyon]
          Length = 160

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 27/37 (72%)

Query: 50  LTECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
           L ECW  + +L++C++E+V+FFLN Q+ +G  CC  I
Sbjct: 67  LAECWGAVAELRACTDEMVLFFLNGQSYLGRPCCLAI 103


>gi|224096932|ref|XP_002334656.1| predicted protein [Populus trichocarpa]
 gi|222874068|gb|EEF11199.1| predicted protein [Populus trichocarpa]
          Length = 76

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 27/34 (79%)

Query: 53 CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          CW++L++L++C+ EIV+FFLN +  +G  CC+ +
Sbjct: 1  CWDSLLQLQACTGEIVLFFLNGETQLGHSCCQAL 34


>gi|224116762|ref|XP_002331871.1| predicted protein [Populus trichocarpa]
 gi|222875389|gb|EEF12520.1| predicted protein [Populus trichocarpa]
          Length = 87

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 27/34 (79%)

Query: 53 CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          CW++L++L++CS EI++FFLN +  +G  CC+ +
Sbjct: 10 CWDSLVQLQACSGEIILFFLNGETQLGRSCCQAL 43


>gi|77553109|gb|ABA95905.1| hypothetical protein LOC_Os12g06970 [Oryza sativa Japonica Group]
 gi|125578623|gb|EAZ19769.1| hypothetical protein OsJ_35349 [Oryza sativa Japonica Group]
          Length = 139

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 27/36 (75%), Gaps = 1/36 (2%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQ-ADIGPDCCRVI 86
          ECW+ + +L+SC++EIV+FFLN +   +G  CCR +
Sbjct: 56 ECWSAVAELRSCTDEIVLFFLNGETTQLGAGCCRAV 91


>gi|125525177|gb|EAY73291.1| hypothetical protein OsI_01165 [Oryza sativa Indica Group]
          Length = 142

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 27/36 (75%), Gaps = 1/36 (2%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQ-ADIGPDCCRVI 86
          ECW+ + +L+SC++EIV+FFLN +   +G  CCR +
Sbjct: 56 ECWSAVAELRSCTDEIVLFFLNGETTQLGAGCCRAV 91


>gi|224103865|ref|XP_002313224.1| predicted protein [Populus trichocarpa]
 gi|222849632|gb|EEE87179.1| predicted protein [Populus trichocarpa]
          Length = 76

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 14/34 (41%), Positives = 27/34 (79%)

Query: 53 CWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVI 86
          CW++L++L++C+ EI++FFLN +  +G  CC+ +
Sbjct: 1  CWDSLIQLQACTGEIILFFLNGETQLGHSCCQAL 34


>gi|297797495|ref|XP_002866632.1| hypothetical protein ARALYDRAFT_919796 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297312467|gb|EFH42891.1| hypothetical protein ARALYDRAFT_919796 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 154

 Score = 41.6 bits (96), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 8/47 (17%)

Query: 48 GGLTECWNTLMKLKSCSNEIVIFFLNSQADIGP--------DCCRVI 86
          G L +CWN  ++LKSC++EIV FFL+      P        DCC  I
Sbjct: 48 GNLMDCWNAGLELKSCTDEIVKFFLSQTGTTEPAVKGGIDKDCCGAI 94


>gi|186532748|ref|NP_201277.3| uncharacterized protein [Arabidopsis thaliana]
 gi|75170418|sp|Q9FGG1.1|EC15_ARATH RecName: Full=Egg cell-secreted protein 1.5; Flags: Precursor
 gi|10177203|dbj|BAB10305.1| unnamed protein product [Arabidopsis thaliana]
 gi|117168117|gb|ABK32141.1| At5g64720 [Arabidopsis thaliana]
 gi|332010560|gb|AED97943.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 155

 Score = 41.6 bits (96), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 28/54 (51%), Gaps = 8/54 (14%)

Query: 41 ATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQADIGP--------DCCRVI 86
          AT     G L +CWN  ++LKSC++EIV FFL+      P        DCC  I
Sbjct: 41 ATDHSGAGNLMDCWNAGLELKSCTDEIVKFFLSQTGTSEPPVKGGIDKDCCGAI 94


>gi|343887288|dbj|BAK61834.1| hypothetical protein [Citrus unshiu]
          Length = 202

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 8/70 (11%)

Query: 18  ANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQAD 77
           A + +A ++    N  + PG D A        +TECW+++   + C+ E+       Q +
Sbjct: 109 AGVLSAASKVSSPNLLLTPGIDVAE-------VTECWSSIASTEGCALEVYKSLTTGQIN 161

Query: 78  -IGPDCCRVI 86
            +GP CC+ I
Sbjct: 162 GVGPACCKAI 171


>gi|357152772|ref|XP_003576231.1| PREDICTED: uncharacterized protein LOC100833200 [Brachypodium
           distachyon]
          Length = 148

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 14/40 (35%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 48  GGLTECWNTLMKLKS-CSNEIVIFFLNSQADIGPDCCRVI 86
           G L +CW  + +++S C+ E  +FFL+ +A +G  CC  +
Sbjct: 62  GSLVDCWGAVDEVRSQCAEEAAVFFLDGEAYLGRACCLAV 101


>gi|343887297|dbj|BAK61843.1| hypothetical protein [Citrus unshiu]
          Length = 214

 Score = 37.0 bits (84), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 52 ECWNTLMKLKSCSNEIVIFFLNSQ-ADIGPDCCRVIDHL 89
          ECW++L  + +C  EI   FL  Q   IGP CC  I+ +
Sbjct: 51 ECWSSLTNIPTCLTEIYGSFLGGQIGQIGPACCDAINRI 89


>gi|145332735|ref|NP_001078233.1| ECA1 gametogenesis related family protein [Arabidopsis thaliana]
 gi|332644218|gb|AEE77739.1| ECA1 gametogenesis related family protein [Arabidopsis thaliana]
 gi|371782142|emb|CCE46137.1| cysteine rich peptide [Arabidopsis thaliana]
          Length = 119

 Score = 36.6 bits (83), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 12/88 (13%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGL---TECWNTL 57
          M++K+VF +L   C+I++  A       +L         F     +  GL   T+CW+++
Sbjct: 1  MSIKNVFSLLYVLCIIVSVNAQLPQFPAQL--------PFPFPFQLISGLPDITKCWSSV 52

Query: 58 MKLKSCSNEIV-IFFLNSQADIGPDCCR 84
          M +  C  EI    F     ++GP CC+
Sbjct: 53 MDIPGCIAEISQSIFTGKFGNLGPACCK 80


>gi|449461573|ref|XP_004148516.1| PREDICTED: uncharacterized protein LOC101207529 [Cucumis sativus]
 gi|449531633|ref|XP_004172790.1| PREDICTED: uncharacterized protein LOC101232079 [Cucumis sativus]
          Length = 123

 Score = 36.2 bits (82), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 13/38 (34%), Positives = 24/38 (63%)

Query: 51 TECWNTLMKLKSCSNEIVIFFLNSQADIGPDCCRVIDH 88
          ++CW+ +  +K C NEI     +++ ++  DCC+VI H
Sbjct: 37 SDCWDAINAVKGCQNEIDTAMKSNEIEVSYDCCKVILH 74


>gi|297821391|ref|XP_002878578.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324417|gb|EFH54837.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 122

 Score = 36.2 bits (82), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 3/85 (3%)

Query: 1  MALKHVFFILTFTCLIMANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKL 60
          MA+K+V   L   C++++   NA              +     +     +T+CW+++M +
Sbjct: 1  MAIKNVILFLAVICIVVS--VNAQLPQFPAPFPFPFPFQPIPNMSGLPDITKCWSSVMDI 58

Query: 61 KSCSNEIV-IFFLNSQADIGPDCCR 84
            C  EI    F+    +IGP CC+
Sbjct: 59 PGCIAEISQSIFIGKFGNIGPACCK 83


>gi|26450771|dbj|BAC42494.1| unknown protein [Arabidopsis thaliana]
 gi|28416875|gb|AAO42968.1| At5g51105 [Arabidopsis thaliana]
          Length = 123

 Score = 35.8 bits (81), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 29 RLNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQAD-IGPDCCRVI 86
             N  KP    A   D    L++CW+T+M L  C  EI    ++ + + +GP CC+  
Sbjct: 32 PFPNPFKPSPGMAGSPD----LSKCWSTVMDLPGCFEEIQQAVMSGKFEGVGPACCKAF 86


>gi|22327712|ref|NP_680421.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332008652|gb|AED96035.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 123

 Score = 35.8 bits (81), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 29 RLNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQAD-IGPDCCRVI 86
             N  KP    A   D    L++CW+T+M L  C  EI    ++ + + +GP CC+  
Sbjct: 32 PFPNPFKPSPGMAGSPD----LSKCWSTVMDLPGCFEEIQQAVMSGKFEGVGPACCKAF 86


>gi|297849394|ref|XP_002892578.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338420|gb|EFH68837.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 119

 Score = 35.4 bits (80), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 50 LTECWNTLMKLKSCSNEIVIFFLNSQ-ADIGPDCCRV 85
          +T+CW+++M +  C  EI    LN + ++IGP CC+ 
Sbjct: 45 ITKCWSSVMNIPGCITEISQAILNGRFSNIGPACCKA 81


>gi|343887287|dbj|BAK61833.1| hypothetical protein [Citrus unshiu]
          Length = 202

 Score = 35.4 bits (80), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 18/70 (25%), Positives = 34/70 (48%), Gaps = 8/70 (11%)

Query: 18  ANIANATTRNDRLNNSMKPGYDFATRLDVRGGLTECWNTLMKLKSCSNEIVIFFLNSQAD 77
           A + +A ++    N  + PG + A        +TECW+++   + C+ E+       Q +
Sbjct: 109 AGVLSAASKVSSPNLLLTPGINVAE-------VTECWSSIASTEGCALEVYKSLTTGQIN 161

Query: 78  -IGPDCCRVI 86
            +GP CC+ I
Sbjct: 162 GVGPACCKAI 171


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.329    0.140    0.446 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,229,445,297
Number of Sequences: 23463169
Number of extensions: 37153205
Number of successful extensions: 87367
Number of sequences better than 100.0: 52
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 87322
Number of HSP's gapped (non-prelim): 55
length of query: 89
length of database: 8,064,228,071
effective HSP length: 59
effective length of query: 30
effective length of database: 6,679,901,100
effective search space: 200397033000
effective search space used: 200397033000
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 69 (31.2 bits)