BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 034827
         (82 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449433457|ref|XP_004134514.1| PREDICTED: uncharacterized protein LOC101220487 isoform 2
          [Cucumis sativus]
 gi|449531986|ref|XP_004172966.1| PREDICTED: uncharacterized LOC101220487 isoform 2 [Cucumis
          sativus]
          Length = 81

 Score =  152 bits (385), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 72/76 (94%), Positives = 76/76 (100%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHL+GMGLGAVFVNQLVKWDA+LQ+DL
Sbjct: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLVGMGLGAVFVNQLVKWDAKLQEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKML+KAKAANERRYF
Sbjct: 61 DKMLDKAKAANERRYF 76


>gi|255578511|ref|XP_002530119.1| conserved hypothetical protein [Ricinus communis]
 gi|223530373|gb|EEF32263.1| conserved hypothetical protein [Ricinus communis]
          Length = 81

 Score =  145 bits (366), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 69/76 (90%), Positives = 72/76 (94%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          MA  AT IGAL GLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQLVKWDAQLQQDL
Sbjct: 1  MALGATAIGALFGLGTQMYSNALRKLPYMRHPWEHVIGMGLGAVFVNQLVKWDAQLQQDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKML+KAKAANERRYF
Sbjct: 61 DKMLDKAKAANERRYF 76


>gi|224053651|ref|XP_002297912.1| predicted protein [Populus trichocarpa]
 gi|118484384|gb|ABK94069.1| unknown [Populus trichocarpa]
 gi|222845170|gb|EEE82717.1| predicted protein [Populus trichocarpa]
          Length = 81

 Score =  144 bits (364), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 66/78 (84%), Positives = 74/78 (94%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M +SAT++GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VFVNQLVKWDAQL+QDL
Sbjct: 1  MVFSATVVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVFVNQLVKWDAQLEQDL 60

Query: 61 DKMLEKAKAANERRYFGR 78
          DKML+KAK ANERRYF +
Sbjct: 61 DKMLQKAKEANERRYFDQ 78


>gi|224075385|ref|XP_002304611.1| predicted protein [Populus trichocarpa]
 gi|118485985|gb|ABK94837.1| unknown [Populus trichocarpa]
 gi|222842043|gb|EEE79590.1| predicted protein [Populus trichocarpa]
          Length = 81

 Score =  142 bits (357), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 67/76 (88%), Positives = 71/76 (93%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SAT+IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VFVNQLVKWDAQL+QDL
Sbjct: 1  MVLSATVIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVFVNQLVKWDAQLEQDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKML KAK ANERRYF
Sbjct: 61 DKMLLKAKEANERRYF 76


>gi|356512239|ref|XP_003524828.1| PREDICTED: uncharacterized protein LOC100807917 [Glycine max]
 gi|356525004|ref|XP_003531117.1| PREDICTED: uncharacterized protein LOC100778929 [Glycine max]
          Length = 80

 Score =  139 bits (351), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 65/73 (89%), Positives = 71/73 (97%)

Query: 4  SATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
          SAT IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQL+KW+AQ++QDLDKM
Sbjct: 3  SATAIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLLKWEAQVEQDLDKM 62

Query: 64 LEKAKAANERRYF 76
          LEKAKAANERRY 
Sbjct: 63 LEKAKAANERRYI 75


>gi|2982432|emb|CAA18240.1| putative protein [Arabidopsis thaliana]
 gi|7268810|emb|CAB79015.1| putative protein [Arabidopsis thaliana]
          Length = 105

 Score =  139 bits (350), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 64/77 (83%), Positives = 71/77 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1  MPISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          D ML KA+AANERRYFG
Sbjct: 61 DVMLAKARAANERRYFG 77


>gi|115472221|ref|NP_001059709.1| Os07g0498300 [Oryza sativa Japonica Group]
 gi|33146542|dbj|BAC79719.1| unknown protein [Oryza sativa Japonica Group]
 gi|50509990|dbj|BAD30453.1| unknown protein [Oryza sativa Japonica Group]
 gi|113611245|dbj|BAF21623.1| Os07g0498300 [Oryza sativa Japonica Group]
 gi|218199665|gb|EEC82092.1| hypothetical protein OsI_26096 [Oryza sativa Indica Group]
 gi|222637090|gb|EEE67222.1| hypothetical protein OsJ_24342 [Oryza sativa Japonica Group]
          Length = 81

 Score =  137 bits (346), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 63/76 (82%), Positives = 71/76 (93%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQLVKWD +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLVKWDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKMLE+AK ANERRYF
Sbjct: 61 DKMLERAKQANERRYF 76


>gi|18415447|ref|NP_567596.1| uncharacterized protein [Arabidopsis thaliana]
 gi|15028287|gb|AAK76620.1| unknown protein [Arabidopsis thaliana]
 gi|19310709|gb|AAL85085.1| unknown protein [Arabidopsis thaliana]
 gi|21555789|gb|AAM63934.1| unknown [Arabidopsis thaliana]
 gi|332658880|gb|AEE84280.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 81

 Score =  136 bits (343), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 63/76 (82%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1  MPISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          D ML KA+AANERRYF
Sbjct: 61 DVMLAKARAANERRYF 76


>gi|297800010|ref|XP_002867889.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313725|gb|EFH44148.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 81

 Score =  136 bits (343), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 63/76 (82%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1  MPLSATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          D ML KA+AANERRYF
Sbjct: 61 DVMLAKARAANERRYF 76


>gi|357122731|ref|XP_003563068.1| PREDICTED: uncharacterized protein LOC100836427 [Brachypodium
          distachyon]
          Length = 81

 Score =  135 bits (341), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 63/76 (82%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLG VFV+QLVKWD +L+QDL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGVVFVHQLVKWDEKLKQDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKMLE+AK ANERRYF
Sbjct: 61 DKMLERAKQANERRYF 76


>gi|388517345|gb|AFK46734.1| unknown [Lotus japonicus]
          Length = 81

 Score =  135 bits (339), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 62/76 (81%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SAT IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMG+G VFVNQL+KW+AQ+++DL
Sbjct: 1  MVLSATAIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGIGVVFVNQLLKWEAQVERDL 60

Query: 61 DKMLEKAKAANERRYF 76
          D MLE+AKAANERRY 
Sbjct: 61 DVMLERAKAANERRYI 76


>gi|212274813|ref|NP_001130978.1| uncharacterized protein LOC100192083 [Zea mays]
 gi|194690614|gb|ACF79391.1| unknown [Zea mays]
 gi|414886759|tpg|DAA62773.1| TPA: hypothetical protein ZEAMMB73_824701 [Zea mays]
          Length = 100

 Score =  134 bits (338), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 62/77 (80%), Positives = 71/77 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKFDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          DKMLE+A+ ANE+RY G
Sbjct: 61 DKMLERARLANEQRYIG 77


>gi|414590318|tpg|DAA40889.1| TPA: hypothetical protein ZEAMMB73_782543, partial [Zea mays]
          Length = 121

 Score =  133 bits (335), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 60/76 (78%), Positives = 70/76 (92%)

Query: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
           M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 42  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 101

Query: 61  DKMLEKAKAANERRYF 76
           DKMLE+A+ ANE+RY 
Sbjct: 102 DKMLERARLANEQRYI 117


>gi|449433455|ref|XP_004134513.1| PREDICTED: uncharacterized protein LOC101220487 isoform 1 [Cucumis
           sativus]
 gi|449531984|ref|XP_004172965.1| PREDICTED: uncharacterized LOC101220487 isoform 1 [Cucumis sativus]
          Length = 116

 Score =  132 bits (333), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/111 (63%), Positives = 75/111 (67%), Gaps = 35/111 (31%)

Query: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRH----------------------------- 31
           MAWSATMIGALLGLGTQMYSNALRKLPYMR                              
Sbjct: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRRTFQTLVFSFPHHNPSLLFRFRFRFLSSIP 60

Query: 32  ------PWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEKAKAANERRYF 76
                 PWEHL+GMGLGAVFVNQLVKWDA+LQ+DLDKML+KAKAANERRYF
Sbjct: 61  SHNFADPWEHLVGMGLGAVFVNQLVKWDAKLQEDLDKMLDKAKAANERRYF 111


>gi|414590315|tpg|DAA40886.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
          Length = 163

 Score =  132 bits (332), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 61/77 (79%), Positives = 71/77 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          DKMLE+A+ ANE+RY G
Sbjct: 61 DKMLERARLANEQRYIG 77


>gi|414590316|tpg|DAA40887.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
          Length = 93

 Score =  132 bits (332), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 60/77 (77%), Positives = 70/77 (90%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          DKMLE+A+ ANE+RY  
Sbjct: 61 DKMLERARLANEQRYIA 77


>gi|195621014|gb|ACG32337.1| hypothetical protein [Zea mays]
 gi|195655033|gb|ACG46984.1| hypothetical protein [Zea mays]
 gi|414886760|tpg|DAA62774.1| TPA: hypothetical protein ZEAMMB73_824701 [Zea mays]
          Length = 80

 Score =  132 bits (331), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 61/76 (80%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKFDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKMLE+A+ ANE+RY 
Sbjct: 61 DKMLERARLANEQRYI 76


>gi|242050286|ref|XP_002462887.1| hypothetical protein SORBIDRAFT_02g033820 [Sorghum bicolor]
 gi|195621360|gb|ACG32510.1| hypothetical protein [Zea mays]
 gi|223974885|gb|ACN31630.1| unknown [Zea mays]
 gi|241926264|gb|EER99408.1| hypothetical protein SORBIDRAFT_02g033820 [Sorghum bicolor]
          Length = 80

 Score =  131 bits (329), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 60/76 (78%), Positives = 70/76 (92%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKMLE+A+ ANE+RY 
Sbjct: 61 DKMLERARLANEQRYI 76


>gi|326517918|dbj|BAK07211.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519622|dbj|BAK00184.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 81

 Score =  130 bits (327), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 60/76 (78%), Positives = 69/76 (90%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLG VFVNQLVK+D +++ DL
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGVVFVNQLVKFDEKVKVDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DKMLE+AK ANE+RYF
Sbjct: 61 DKMLERAKHANEQRYF 76


>gi|297738473|emb|CBI27674.3| unnamed protein product [Vitis vinifera]
          Length = 139

 Score =  122 bits (307), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 54/76 (71%), Positives = 68/76 (89%)

Query: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
           M  SAT++GA+LGL  Q++SNALRKLP MRHPWEH+L +G+GAVF NQLVKW+A++++DL
Sbjct: 59  MPLSATVVGAVLGLSVQLHSNALRKLPLMRHPWEHVLAIGIGAVFGNQLVKWEAKVEEDL 118

Query: 61  DKMLEKAKAANERRYF 76
           DK+LEKAKAANERRY 
Sbjct: 119 DKLLEKAKAANERRYI 134


>gi|225425382|ref|XP_002276698.1| PREDICTED: uncharacterized protein LOC100257379 [Vitis vinifera]
          Length = 81

 Score =  122 bits (306), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 54/76 (71%), Positives = 68/76 (89%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  SAT++GA+LGL  Q++SNALRKLP MRHPWEH+L +G+GAVF NQLVKW+A++++DL
Sbjct: 1  MPLSATVVGAVLGLSVQLHSNALRKLPLMRHPWEHVLAIGIGAVFGNQLVKWEAKVEEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DK+LEKAKAANERRY 
Sbjct: 61 DKLLEKAKAANERRYI 76


>gi|116779889|gb|ABK21463.1| unknown [Picea sitchensis]
          Length = 93

 Score =  107 bits (268), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 49/77 (63%), Positives = 61/77 (79%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M    T+ GA LGL TQ+ SN +RKLP MRHPWEH++GMGLGA+F NQL++W+A+LQ+DL
Sbjct: 1  MVTGVTLAGACLGLTTQLLSNGMRKLPLMRHPWEHVIGMGLGALFTNQLIQWEAKLQEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          DK L +AKAAN RR  G
Sbjct: 61 DKKLGEAKAANRRRLSG 77


>gi|168012964|ref|XP_001759171.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689484|gb|EDQ75855.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 84

 Score =  102 bits (255), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 44/77 (57%), Positives = 58/77 (75%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  +ATM+GA++GL  Q YSNA+RKLP MRHPWEH+L +GLGA F N +  W+ +LQ+DL
Sbjct: 1  MVATATMVGAIMGLSCQFYSNAVRKLPLMRHPWEHVLAIGLGAGFGNAVASWEVRLQEDL 60

Query: 61 DKMLEKAKAANERRYFG 77
          DK L + + AN +RY G
Sbjct: 61 DKKLSEVQTANNKRYIG 77


>gi|168021117|ref|XP_001763088.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685571|gb|EDQ71965.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 76

 Score =  101 bits (251), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 44/76 (57%), Positives = 58/76 (76%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  +ATM+GAL+GL  Q YSN +RKLP MRHPWEHLL +GLGA F N +  W+ +LQ+DL
Sbjct: 1  MVATATMVGALMGLSCQFYSNGVRKLPLMRHPWEHLLAIGLGAGFGNAVASWEVRLQEDL 60

Query: 61 DKMLEKAKAANERRYF 76
          DK L++ ++AN +RY 
Sbjct: 61 DKKLQEVQSANNKRYI 76


>gi|330318746|gb|AEC11033.1| hypothetical protein [Camellia sinensis]
          Length = 45

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/45 (84%), Positives = 42/45 (93%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVF 45
          M  ++TMIGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VF
Sbjct: 1  MPATSTMIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVF 45


>gi|302781698|ref|XP_002972623.1| hypothetical protein SELMODRAFT_97190 [Selaginella
          moellendorffii]
 gi|300160090|gb|EFJ26709.1| hypothetical protein SELMODRAFT_97190 [Selaginella
          moellendorffii]
          Length = 83

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 32/62 (51%), Positives = 45/62 (72%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  + T+ GA+LGL  Q+Y N +RKLP MRHPWEH+L MG+GA  +N+LV W+ + +  L
Sbjct: 1  MVSTPTVAGAVLGLAIQLYCNGMRKLPLMRHPWEHVLFMGIGAYAINELVAWEYRTEAAL 60

Query: 61 DK 62
          +K
Sbjct: 61 EK 62


>gi|297738183|emb|CBI27384.3| unnamed protein product [Vitis vinifera]
          Length = 58

 Score = 75.1 bits (183), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 34/51 (66%), Positives = 43/51 (84%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK 51
          M+  AT++GA+LGL  Q++SNALRKL  MRHPWEH+L +G+GAVF NQLVK
Sbjct: 8  MSLGATVVGAVLGLSVQLHSNALRKLLLMRHPWEHVLAIGIGAVFGNQLVK 58


>gi|255088009|ref|XP_002505927.1| hypothetical protein MICPUN_109457 [Micromonas sp. RCC299]
 gi|226521198|gb|ACO67185.1| hypothetical protein MICPUN_109457 [Micromonas sp. RCC299]
          Length = 80

 Score = 72.8 bits (177), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 50/68 (73%)

Query: 4  SATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
          SAT+IGA LG G Q+Y NALRKLP +R+PWEH+   G+GA   N   +W+A+L++++ ++
Sbjct: 2  SATLIGASLGFGMQLYINALRKLPLLRNPWEHVFWTGVGAAAANMNAEWEAKLRKEVAEL 61

Query: 64 LEKAKAAN 71
           ++ +A N
Sbjct: 62 NKERQATN 69


>gi|307103562|gb|EFN51821.1| hypothetical protein CHLNCDRAFT_139659 [Chlorella variabilis]
          Length = 76

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 44/71 (61%)

Query: 2  AWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLD 61
          A S    GA LG+G Q+Y NA+RKLP MR PW H+L  G G  F   LV ++ + ++DL 
Sbjct: 3  ALSIPAFGAALGVGVQLYVNAVRKLPLMRDPWLHVLWGGAGYSFGTWLVGFERRTEKDLA 62

Query: 62 KMLEKAKAANE 72
          +ML K   AN+
Sbjct: 63 EMLRKRDEANK 73


>gi|300123543|emb|CBK24815.2| unnamed protein product [Blastocystis hominis]
          Length = 81

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 41/64 (64%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M++ A + G  +GL T++YSN+LRKLPYMR PWEHL+ MG+G        K+  + +  L
Sbjct: 1  MSFIAGLTGCCIGLLTKLYSNSLRKLPYMREPWEHLIYMGVGGFSFYYAKKFIDREKNSL 60

Query: 61 DKML 64
            ML
Sbjct: 61 RTML 64


>gi|303275860|ref|XP_003057224.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461576|gb|EEH58869.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 85

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 40/60 (66%)

Query: 14 LGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEKAKAANER 73
           GTQ++ NALRKLP +R+PWEH+L  G G    N + +W+ +L +++  M  + +AAN R
Sbjct: 16 FGTQLFINALRKLPLLRNPWEHVLWTGGGVWAANAMAEWEVKLTEEVKAMQAERQAANAR 75


>gi|348680338|gb|EGZ20154.1| hypothetical protein PHYSODRAFT_285374 [Phytophthora sojae]
          Length = 77

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 47/76 (61%), Gaps = 3/76 (3%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGA---VFVNQLVKWDAQLQ 57
          M ++   IG L+GL T++ SN L+K+PYMRHPWEH+L MG+GA    ++      D +  
Sbjct: 1  MGFTGIAIGTLMGLTTKIGSNILQKVPYMRHPWEHVLFMGVGAGLGGYLQNKYHRDLEEV 60

Query: 58 QDLDKMLEKAKAANER 73
          ++L   LE+    N++
Sbjct: 61 EELRLYLERRPDVNKK 76


>gi|414590317|tpg|DAA40888.1| TPA: hypothetical protein ZEAMMB73_782543, partial [Zea mays]
          Length = 75

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 26/31 (83%), Positives = 28/31 (90%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRH 31
          M  + T+IGALLGLGTQMYSNALRKLPYMRH
Sbjct: 42 MPATPTIIGALLGLGTQMYSNALRKLPYMRH 72


>gi|449017105|dbj|BAM80507.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
          10D]
          Length = 70

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 9/73 (12%)

Query: 2  AWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLD 61
          AWS    GAL GL  Q+ SNAL+KLP MR PWEH+L    GA   ++L   + +L Q ++
Sbjct: 4  AWS----GALFGLAVQLMSNALQKLPLMRRPWEHVLFAAGGAFVGHKLGDLEVELTQRIE 59

Query: 62 KMLEKAKAANERR 74
               A  AN++R
Sbjct: 60 -----AHRANQKR 67


>gi|414590314|tpg|DAA40885.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
          Length = 92

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/30 (83%), Positives = 27/30 (90%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMR 30
          M  + T+IGALLGLGTQMYSNALRKLPYMR
Sbjct: 1  MPATPTIIGALLGLGTQMYSNALRKLPYMR 30


>gi|66270195|gb|AAY43427.1| unknown protein [Phytophthora infestans]
          Length = 66

 Score = 52.8 bits (125), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 42/65 (64%), Gaps = 3/65 (4%)

Query: 12 LGLGTQMYSNALRKLPYMRHPWEHLLGMGLGA---VFVNQLVKWDAQLQQDLDKMLEKAK 68
          +GL T++ SN L+K+PYMRHPWEH+L MG+GA    ++      D +  ++L   LE+ +
Sbjct: 1  MGLSTKLGSNVLQKVPYMRHPWEHVLFMGIGAGLGGYLQNKYHRDLEEVEELRLYLERRQ 60

Query: 69 AANER 73
            N++
Sbjct: 61 DVNKK 65


>gi|428172889|gb|EKX41795.1| hypothetical protein GUITHDRAFT_153752 [Guillardia theta
          CCMP2712]
          Length = 75

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 40/70 (57%)

Query: 7  MIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEK 66
          ++GA  G+  ++YSN+LR++  +R PWEH L  G+GA     +  W  Q     ++ L++
Sbjct: 6  LMGATFGVLVKLYSNSLRRVSLLRQPWEHFLLAGIGAYSFQCIAVWTEQNIARYEEKLKE 65

Query: 67 AKAANERRYF 76
           K   + +Y+
Sbjct: 66 FKEGKQVKYY 75


>gi|299469849|emb|CBN76703.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 88

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 37/63 (58%)

Query: 1  MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
          M  +A  +G   G+  +   N L K+   R PWE++LG G+GA   N++ +W+A L  D+
Sbjct: 1  MGITAMTVGGAFGIVAKGTVNRLMKVSLRRAPWEYVLGAGIGAYVGNKIPQWEATLISDI 60

Query: 61 DKM 63
          ++M
Sbjct: 61 NEM 63


>gi|452821647|gb|EME28675.1| hypothetical protein Gasu_38830 [Galdieria sulphuraria]
          Length = 91

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 24/38 (63%)

Query: 8  IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVF 45
          +GA +GL  Q+ +N  R LP  R PWEHL  MGLGA  
Sbjct: 6  VGAAIGLAAQIVTNGARLLPLSRRPWEHLAAMGLGAYI 43


>gi|397566851|gb|EJK45250.1| hypothetical protein THAOC_36140 [Thalassiosira oceanica]
          Length = 221

 Score = 43.5 bits (101), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%)

Query: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
           M  +A  +GA  G G  M SNA++K+P  R PW H+    LG     + V+ + +L  D+
Sbjct: 136 MGLTAFFVGAFSGTGMHMMSNAIQKVPLSRRPWMHVGCFFLGGYVGQRWVRLEKELVLDI 195

Query: 61  DKM 63
           +++
Sbjct: 196 NEI 198


>gi|325189051|emb|CCA23579.1| unknown protein putative [Albugo laibachii Nc14]
          Length = 121

 Score = 42.0 bits (97), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 3/76 (3%)

Query: 1   MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAV---FVNQLVKWDAQLQ 57
           M     + G++ G  T+  SN LRK+P M+  WE++L +G GA    ++ +    D +  
Sbjct: 44  MGLGGIVTGSIFGFSTKFGSNVLRKVPLMQESWEYVLLIGAGAFAGSYLEKKYYRDVEEV 103

Query: 58  QDLDKMLEKAKAANER 73
           +++ + LE+    + +
Sbjct: 104 EEIRQFLEQKPLTDRK 119


>gi|145350242|ref|XP_001419522.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579754|gb|ABO97815.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 84

 Score = 37.7 bits (86), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 34/57 (59%)

Query: 7  MIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
          + GA+ G+G   ++N LRKLP    PW+H +  G+GA  +      +  L++D++++
Sbjct: 9  VFGAVSGVGCAAFANGLRKLPPFSQPWKHAVAAGVGAAALMYNADLETSLKEDVERL 65


>gi|313228857|emb|CBY18008.1| unnamed protein product [Oikopleura dioica]
          Length = 1093

 Score = 35.8 bits (81), Expect = 3.0,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 4/59 (6%)

Query: 5   ATMIGALLGLGTQMYSN---ALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
           AT+ GA+LGLG + Y N    L  L YM  P+ H++ M   AVF+        Q  QDL
Sbjct: 354 ATLSGAILGLGLRGYLNNEPNLMYLSYMSGPFSHMIFMA-SAVFLGYCCNPKMQGTQDL 411


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.324    0.136    0.438 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,278,395,894
Number of Sequences: 23463169
Number of extensions: 41949846
Number of successful extensions: 138200
Number of sequences better than 100.0: 43
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 138157
Number of HSP's gapped (non-prelim): 44
length of query: 82
length of database: 8,064,228,071
effective HSP length: 53
effective length of query: 29
effective length of database: 6,820,680,114
effective search space: 197799723306
effective search space used: 197799723306
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 69 (31.2 bits)