BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034827
(82 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449433457|ref|XP_004134514.1| PREDICTED: uncharacterized protein LOC101220487 isoform 2
[Cucumis sativus]
gi|449531986|ref|XP_004172966.1| PREDICTED: uncharacterized LOC101220487 isoform 2 [Cucumis
sativus]
Length = 81
Score = 152 bits (385), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 72/76 (94%), Positives = 76/76 (100%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHL+GMGLGAVFVNQLVKWDA+LQ+DL
Sbjct: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLVGMGLGAVFVNQLVKWDAKLQEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKML+KAKAANERRYF
Sbjct: 61 DKMLDKAKAANERRYF 76
>gi|255578511|ref|XP_002530119.1| conserved hypothetical protein [Ricinus communis]
gi|223530373|gb|EEF32263.1| conserved hypothetical protein [Ricinus communis]
Length = 81
Score = 145 bits (366), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 69/76 (90%), Positives = 72/76 (94%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
MA AT IGAL GLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQLVKWDAQLQQDL
Sbjct: 1 MALGATAIGALFGLGTQMYSNALRKLPYMRHPWEHVIGMGLGAVFVNQLVKWDAQLQQDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKML+KAKAANERRYF
Sbjct: 61 DKMLDKAKAANERRYF 76
>gi|224053651|ref|XP_002297912.1| predicted protein [Populus trichocarpa]
gi|118484384|gb|ABK94069.1| unknown [Populus trichocarpa]
gi|222845170|gb|EEE82717.1| predicted protein [Populus trichocarpa]
Length = 81
Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 66/78 (84%), Positives = 74/78 (94%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M +SAT++GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VFVNQLVKWDAQL+QDL
Sbjct: 1 MVFSATVVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVFVNQLVKWDAQLEQDL 60
Query: 61 DKMLEKAKAANERRYFGR 78
DKML+KAK ANERRYF +
Sbjct: 61 DKMLQKAKEANERRYFDQ 78
>gi|224075385|ref|XP_002304611.1| predicted protein [Populus trichocarpa]
gi|118485985|gb|ABK94837.1| unknown [Populus trichocarpa]
gi|222842043|gb|EEE79590.1| predicted protein [Populus trichocarpa]
Length = 81
Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 67/76 (88%), Positives = 71/76 (93%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SAT+IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VFVNQLVKWDAQL+QDL
Sbjct: 1 MVLSATVIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVFVNQLVKWDAQLEQDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKML KAK ANERRYF
Sbjct: 61 DKMLLKAKEANERRYF 76
>gi|356512239|ref|XP_003524828.1| PREDICTED: uncharacterized protein LOC100807917 [Glycine max]
gi|356525004|ref|XP_003531117.1| PREDICTED: uncharacterized protein LOC100778929 [Glycine max]
Length = 80
Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 65/73 (89%), Positives = 71/73 (97%)
Query: 4 SATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
SAT IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQL+KW+AQ++QDLDKM
Sbjct: 3 SATAIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLLKWEAQVEQDLDKM 62
Query: 64 LEKAKAANERRYF 76
LEKAKAANERRY
Sbjct: 63 LEKAKAANERRYI 75
>gi|2982432|emb|CAA18240.1| putative protein [Arabidopsis thaliana]
gi|7268810|emb|CAB79015.1| putative protein [Arabidopsis thaliana]
Length = 105
Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 64/77 (83%), Positives = 71/77 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1 MPISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
D ML KA+AANERRYFG
Sbjct: 61 DVMLAKARAANERRYFG 77
>gi|115472221|ref|NP_001059709.1| Os07g0498300 [Oryza sativa Japonica Group]
gi|33146542|dbj|BAC79719.1| unknown protein [Oryza sativa Japonica Group]
gi|50509990|dbj|BAD30453.1| unknown protein [Oryza sativa Japonica Group]
gi|113611245|dbj|BAF21623.1| Os07g0498300 [Oryza sativa Japonica Group]
gi|218199665|gb|EEC82092.1| hypothetical protein OsI_26096 [Oryza sativa Indica Group]
gi|222637090|gb|EEE67222.1| hypothetical protein OsJ_24342 [Oryza sativa Japonica Group]
Length = 81
Score = 137 bits (346), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 63/76 (82%), Positives = 71/76 (93%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVFVNQLVKWD +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFVNQLVKWDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+AK ANERRYF
Sbjct: 61 DKMLERAKQANERRYF 76
>gi|18415447|ref|NP_567596.1| uncharacterized protein [Arabidopsis thaliana]
gi|15028287|gb|AAK76620.1| unknown protein [Arabidopsis thaliana]
gi|19310709|gb|AAL85085.1| unknown protein [Arabidopsis thaliana]
gi|21555789|gb|AAM63934.1| unknown [Arabidopsis thaliana]
gi|332658880|gb|AEE84280.1| uncharacterized protein [Arabidopsis thaliana]
Length = 81
Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 63/76 (82%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1 MPISATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60
Query: 61 DKMLEKAKAANERRYF 76
D ML KA+AANERRYF
Sbjct: 61 DVMLAKARAANERRYF 76
>gi|297800010|ref|XP_002867889.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297313725|gb|EFH44148.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 81
Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 63/76 (82%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SATM+GALLGLGTQMYSNALRKLPYMRHPWEH++GMGLGAVF NQLVKWD +L++DL
Sbjct: 1 MPLSATMVGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGAVFANQLVKWDVKLKEDL 60
Query: 61 DKMLEKAKAANERRYF 76
D ML KA+AANERRYF
Sbjct: 61 DVMLAKARAANERRYF 76
>gi|357122731|ref|XP_003563068.1| PREDICTED: uncharacterized protein LOC100836427 [Brachypodium
distachyon]
Length = 81
Score = 135 bits (341), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/76 (82%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLG VFV+QLVKWD +L+QDL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGVVFVHQLVKWDEKLKQDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+AK ANERRYF
Sbjct: 61 DKMLERAKQANERRYF 76
>gi|388517345|gb|AFK46734.1| unknown [Lotus japonicus]
Length = 81
Score = 135 bits (339), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 62/76 (81%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SAT IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMG+G VFVNQL+KW+AQ+++DL
Sbjct: 1 MVLSATAIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGIGVVFVNQLLKWEAQVERDL 60
Query: 61 DKMLEKAKAANERRYF 76
D MLE+AKAANERRY
Sbjct: 61 DVMLERAKAANERRYI 76
>gi|212274813|ref|NP_001130978.1| uncharacterized protein LOC100192083 [Zea mays]
gi|194690614|gb|ACF79391.1| unknown [Zea mays]
gi|414886759|tpg|DAA62773.1| TPA: hypothetical protein ZEAMMB73_824701 [Zea mays]
Length = 100
Score = 134 bits (338), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 62/77 (80%), Positives = 71/77 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKFDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
DKMLE+A+ ANE+RY G
Sbjct: 61 DKMLERARLANEQRYIG 77
>gi|414590318|tpg|DAA40889.1| TPA: hypothetical protein ZEAMMB73_782543, partial [Zea mays]
Length = 121
Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 60/76 (78%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 42 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 101
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+A+ ANE+RY
Sbjct: 102 DKMLERARLANEQRYI 117
>gi|449433455|ref|XP_004134513.1| PREDICTED: uncharacterized protein LOC101220487 isoform 1 [Cucumis
sativus]
gi|449531984|ref|XP_004172965.1| PREDICTED: uncharacterized LOC101220487 isoform 1 [Cucumis sativus]
Length = 116
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/111 (63%), Positives = 75/111 (67%), Gaps = 35/111 (31%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRH----------------------------- 31
MAWSATMIGALLGLGTQMYSNALRKLPYMR
Sbjct: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRRTFQTLVFSFPHHNPSLLFRFRFRFLSSIP 60
Query: 32 ------PWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEKAKAANERRYF 76
PWEHL+GMGLGAVFVNQLVKWDA+LQ+DLDKML+KAKAANERRYF
Sbjct: 61 SHNFADPWEHLVGMGLGAVFVNQLVKWDAKLQEDLDKMLDKAKAANERRYF 111
>gi|414590315|tpg|DAA40886.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
Length = 163
Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 61/77 (79%), Positives = 71/77 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
DKMLE+A+ ANE+RY G
Sbjct: 61 DKMLERARLANEQRYIG 77
>gi|414590316|tpg|DAA40887.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
Length = 93
Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 60/77 (77%), Positives = 70/77 (90%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
DKMLE+A+ ANE+RY
Sbjct: 61 DKMLERARLANEQRYIA 77
>gi|195621014|gb|ACG32337.1| hypothetical protein [Zea mays]
gi|195655033|gb|ACG46984.1| hypothetical protein [Zea mays]
gi|414886760|tpg|DAA62774.1| TPA: hypothetical protein ZEAMMB73_824701 [Zea mays]
Length = 80
Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 61/76 (80%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKFDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+A+ ANE+RY
Sbjct: 61 DKMLERARLANEQRYI 76
>gi|242050286|ref|XP_002462887.1| hypothetical protein SORBIDRAFT_02g033820 [Sorghum bicolor]
gi|195621360|gb|ACG32510.1| hypothetical protein [Zea mays]
gi|223974885|gb|ACN31630.1| unknown [Zea mays]
gi|241926264|gb|EER99408.1| hypothetical protein SORBIDRAFT_02g033820 [Sorghum bicolor]
Length = 80
Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 60/76 (78%), Positives = 70/76 (92%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLGAVFVNQLVK+D +L++DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGAVFVNQLVKFDEKLKEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+A+ ANE+RY
Sbjct: 61 DKMLERARLANEQRYI 76
>gi|326517918|dbj|BAK07211.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519622|dbj|BAK00184.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 81
Score = 130 bits (327), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 60/76 (78%), Positives = 69/76 (90%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+IGALLGLGTQMYSNALRKLPYMRHPWEH+LGMGLG VFVNQLVK+D +++ DL
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMRHPWEHVLGMGLGVVFVNQLVKFDEKVKVDL 60
Query: 61 DKMLEKAKAANERRYF 76
DKMLE+AK ANE+RYF
Sbjct: 61 DKMLERAKHANEQRYF 76
>gi|297738473|emb|CBI27674.3| unnamed protein product [Vitis vinifera]
Length = 139
Score = 122 bits (307), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 54/76 (71%), Positives = 68/76 (89%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SAT++GA+LGL Q++SNALRKLP MRHPWEH+L +G+GAVF NQLVKW+A++++DL
Sbjct: 59 MPLSATVVGAVLGLSVQLHSNALRKLPLMRHPWEHVLAIGIGAVFGNQLVKWEAKVEEDL 118
Query: 61 DKMLEKAKAANERRYF 76
DK+LEKAKAANERRY
Sbjct: 119 DKLLEKAKAANERRYI 134
>gi|225425382|ref|XP_002276698.1| PREDICTED: uncharacterized protein LOC100257379 [Vitis vinifera]
Length = 81
Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 54/76 (71%), Positives = 68/76 (89%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M SAT++GA+LGL Q++SNALRKLP MRHPWEH+L +G+GAVF NQLVKW+A++++DL
Sbjct: 1 MPLSATVVGAVLGLSVQLHSNALRKLPLMRHPWEHVLAIGIGAVFGNQLVKWEAKVEEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DK+LEKAKAANERRY
Sbjct: 61 DKLLEKAKAANERRYI 76
>gi|116779889|gb|ABK21463.1| unknown [Picea sitchensis]
Length = 93
Score = 107 bits (268), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 49/77 (63%), Positives = 61/77 (79%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M T+ GA LGL TQ+ SN +RKLP MRHPWEH++GMGLGA+F NQL++W+A+LQ+DL
Sbjct: 1 MVTGVTLAGACLGLTTQLLSNGMRKLPLMRHPWEHVIGMGLGALFTNQLIQWEAKLQEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
DK L +AKAAN RR G
Sbjct: 61 DKKLGEAKAANRRRLSG 77
>gi|168012964|ref|XP_001759171.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689484|gb|EDQ75855.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 84
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/77 (57%), Positives = 58/77 (75%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M +ATM+GA++GL Q YSNA+RKLP MRHPWEH+L +GLGA F N + W+ +LQ+DL
Sbjct: 1 MVATATMVGAIMGLSCQFYSNAVRKLPLMRHPWEHVLAIGLGAGFGNAVASWEVRLQEDL 60
Query: 61 DKMLEKAKAANERRYFG 77
DK L + + AN +RY G
Sbjct: 61 DKKLSEVQTANNKRYIG 77
>gi|168021117|ref|XP_001763088.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685571|gb|EDQ71965.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 76
Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 44/76 (57%), Positives = 58/76 (76%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M +ATM+GAL+GL Q YSN +RKLP MRHPWEHLL +GLGA F N + W+ +LQ+DL
Sbjct: 1 MVATATMVGALMGLSCQFYSNGVRKLPLMRHPWEHLLAIGLGAGFGNAVASWEVRLQEDL 60
Query: 61 DKMLEKAKAANERRYF 76
DK L++ ++AN +RY
Sbjct: 61 DKKLQEVQSANNKRYI 76
>gi|330318746|gb|AEC11033.1| hypothetical protein [Camellia sinensis]
Length = 45
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/45 (84%), Positives = 42/45 (93%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVF 45
M ++TMIGALLGLGTQMYSNALRKLPYMRHPWEH++GMGLG VF
Sbjct: 1 MPATSTMIGALLGLGTQMYSNALRKLPYMRHPWEHVVGMGLGVVF 45
>gi|302781698|ref|XP_002972623.1| hypothetical protein SELMODRAFT_97190 [Selaginella
moellendorffii]
gi|300160090|gb|EFJ26709.1| hypothetical protein SELMODRAFT_97190 [Selaginella
moellendorffii]
Length = 83
Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 32/62 (51%), Positives = 45/62 (72%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M + T+ GA+LGL Q+Y N +RKLP MRHPWEH+L MG+GA +N+LV W+ + + L
Sbjct: 1 MVSTPTVAGAVLGLAIQLYCNGMRKLPLMRHPWEHVLFMGIGAYAINELVAWEYRTEAAL 60
Query: 61 DK 62
+K
Sbjct: 61 EK 62
>gi|297738183|emb|CBI27384.3| unnamed protein product [Vitis vinifera]
Length = 58
Score = 75.1 bits (183), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/51 (66%), Positives = 43/51 (84%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVK 51
M+ AT++GA+LGL Q++SNALRKL MRHPWEH+L +G+GAVF NQLVK
Sbjct: 8 MSLGATVVGAVLGLSVQLHSNALRKLLLMRHPWEHVLAIGIGAVFGNQLVK 58
>gi|255088009|ref|XP_002505927.1| hypothetical protein MICPUN_109457 [Micromonas sp. RCC299]
gi|226521198|gb|ACO67185.1| hypothetical protein MICPUN_109457 [Micromonas sp. RCC299]
Length = 80
Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 50/68 (73%)
Query: 4 SATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
SAT+IGA LG G Q+Y NALRKLP +R+PWEH+ G+GA N +W+A+L++++ ++
Sbjct: 2 SATLIGASLGFGMQLYINALRKLPLLRNPWEHVFWTGVGAAAANMNAEWEAKLRKEVAEL 61
Query: 64 LEKAKAAN 71
++ +A N
Sbjct: 62 NKERQATN 69
>gi|307103562|gb|EFN51821.1| hypothetical protein CHLNCDRAFT_139659 [Chlorella variabilis]
Length = 76
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 44/71 (61%)
Query: 2 AWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLD 61
A S GA LG+G Q+Y NA+RKLP MR PW H+L G G F LV ++ + ++DL
Sbjct: 3 ALSIPAFGAALGVGVQLYVNAVRKLPLMRDPWLHVLWGGAGYSFGTWLVGFERRTEKDLA 62
Query: 62 KMLEKAKAANE 72
+ML K AN+
Sbjct: 63 EMLRKRDEANK 73
>gi|300123543|emb|CBK24815.2| unnamed protein product [Blastocystis hominis]
Length = 81
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 41/64 (64%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M++ A + G +GL T++YSN+LRKLPYMR PWEHL+ MG+G K+ + + L
Sbjct: 1 MSFIAGLTGCCIGLLTKLYSNSLRKLPYMREPWEHLIYMGVGGFSFYYAKKFIDREKNSL 60
Query: 61 DKML 64
ML
Sbjct: 61 RTML 64
>gi|303275860|ref|XP_003057224.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461576|gb|EEH58869.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 85
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 40/60 (66%)
Query: 14 LGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEKAKAANER 73
GTQ++ NALRKLP +R+PWEH+L G G N + +W+ +L +++ M + +AAN R
Sbjct: 16 FGTQLFINALRKLPLLRNPWEHVLWTGGGVWAANAMAEWEVKLTEEVKAMQAERQAANAR 75
>gi|348680338|gb|EGZ20154.1| hypothetical protein PHYSODRAFT_285374 [Phytophthora sojae]
Length = 77
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGA---VFVNQLVKWDAQLQ 57
M ++ IG L+GL T++ SN L+K+PYMRHPWEH+L MG+GA ++ D +
Sbjct: 1 MGFTGIAIGTLMGLTTKIGSNILQKVPYMRHPWEHVLFMGVGAGLGGYLQNKYHRDLEEV 60
Query: 58 QDLDKMLEKAKAANER 73
++L LE+ N++
Sbjct: 61 EELRLYLERRPDVNKK 76
>gi|414590317|tpg|DAA40888.1| TPA: hypothetical protein ZEAMMB73_782543, partial [Zea mays]
Length = 75
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/31 (83%), Positives = 28/31 (90%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRH 31
M + T+IGALLGLGTQMYSNALRKLPYMRH
Sbjct: 42 MPATPTIIGALLGLGTQMYSNALRKLPYMRH 72
>gi|449017105|dbj|BAM80507.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 70
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 9/73 (12%)
Query: 2 AWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLD 61
AWS GAL GL Q+ SNAL+KLP MR PWEH+L GA ++L + +L Q ++
Sbjct: 4 AWS----GALFGLAVQLMSNALQKLPLMRRPWEHVLFAAGGAFVGHKLGDLEVELTQRIE 59
Query: 62 KMLEKAKAANERR 74
A AN++R
Sbjct: 60 -----AHRANQKR 67
>gi|414590314|tpg|DAA40885.1| TPA: hypothetical protein ZEAMMB73_782543 [Zea mays]
Length = 92
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/30 (83%), Positives = 27/30 (90%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMR 30
M + T+IGALLGLGTQMYSNALRKLPYMR
Sbjct: 1 MPATPTIIGALLGLGTQMYSNALRKLPYMR 30
>gi|66270195|gb|AAY43427.1| unknown protein [Phytophthora infestans]
Length = 66
Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 42/65 (64%), Gaps = 3/65 (4%)
Query: 12 LGLGTQMYSNALRKLPYMRHPWEHLLGMGLGA---VFVNQLVKWDAQLQQDLDKMLEKAK 68
+GL T++ SN L+K+PYMRHPWEH+L MG+GA ++ D + ++L LE+ +
Sbjct: 1 MGLSTKLGSNVLQKVPYMRHPWEHVLFMGIGAGLGGYLQNKYHRDLEEVEELRLYLERRQ 60
Query: 69 AANER 73
N++
Sbjct: 61 DVNKK 65
>gi|428172889|gb|EKX41795.1| hypothetical protein GUITHDRAFT_153752 [Guillardia theta
CCMP2712]
Length = 75
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 40/70 (57%)
Query: 7 MIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKMLEK 66
++GA G+ ++YSN+LR++ +R PWEH L G+GA + W Q ++ L++
Sbjct: 6 LMGATFGVLVKLYSNSLRRVSLLRQPWEHFLLAGIGAYSFQCIAVWTEQNIARYEEKLKE 65
Query: 67 AKAANERRYF 76
K + +Y+
Sbjct: 66 FKEGKQVKYY 75
>gi|299469849|emb|CBN76703.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 88
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M +A +G G+ + N L K+ R PWE++LG G+GA N++ +W+A L D+
Sbjct: 1 MGITAMTVGGAFGIVAKGTVNRLMKVSLRRAPWEYVLGAGIGAYVGNKIPQWEATLISDI 60
Query: 61 DKM 63
++M
Sbjct: 61 NEM 63
>gi|452821647|gb|EME28675.1| hypothetical protein Gasu_38830 [Galdieria sulphuraria]
Length = 91
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 24/38 (63%)
Query: 8 IGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVF 45
+GA +GL Q+ +N R LP R PWEHL MGLGA
Sbjct: 6 VGAAIGLAAQIVTNGARLLPLSRRPWEHLAAMGLGAYI 43
>gi|397566851|gb|EJK45250.1| hypothetical protein THAOC_36140 [Thalassiosira oceanica]
Length = 221
Score = 43.5 bits (101), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
M +A +GA G G M SNA++K+P R PW H+ LG + V+ + +L D+
Sbjct: 136 MGLTAFFVGAFSGTGMHMMSNAIQKVPLSRRPWMHVGCFFLGGYVGQRWVRLEKELVLDI 195
Query: 61 DKM 63
+++
Sbjct: 196 NEI 198
>gi|325189051|emb|CCA23579.1| unknown protein putative [Albugo laibachii Nc14]
Length = 121
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 1 MAWSATMIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAV---FVNQLVKWDAQLQ 57
M + G++ G T+ SN LRK+P M+ WE++L +G GA ++ + D +
Sbjct: 44 MGLGGIVTGSIFGFSTKFGSNVLRKVPLMQESWEYVLLIGAGAFAGSYLEKKYYRDVEEV 103
Query: 58 QDLDKMLEKAKAANER 73
+++ + LE+ + +
Sbjct: 104 EEIRQFLEQKPLTDRK 119
>gi|145350242|ref|XP_001419522.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579754|gb|ABO97815.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 84
Score = 37.7 bits (86), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 34/57 (59%)
Query: 7 MIGALLGLGTQMYSNALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDLDKM 63
+ GA+ G+G ++N LRKLP PW+H + G+GA + + L++D++++
Sbjct: 9 VFGAVSGVGCAAFANGLRKLPPFSQPWKHAVAAGVGAAALMYNADLETSLKEDVERL 65
>gi|313228857|emb|CBY18008.1| unnamed protein product [Oikopleura dioica]
Length = 1093
Score = 35.8 bits (81), Expect = 3.0, Method: Composition-based stats.
Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 4/59 (6%)
Query: 5 ATMIGALLGLGTQMYSN---ALRKLPYMRHPWEHLLGMGLGAVFVNQLVKWDAQLQQDL 60
AT+ GA+LGLG + Y N L L YM P+ H++ M AVF+ Q QDL
Sbjct: 354 ATLSGAILGLGLRGYLNNEPNLMYLSYMSGPFSHMIFMA-SAVFLGYCCNPKMQGTQDL 411
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.324 0.136 0.438
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,278,395,894
Number of Sequences: 23463169
Number of extensions: 41949846
Number of successful extensions: 138200
Number of sequences better than 100.0: 43
Number of HSP's better than 100.0 without gapping: 42
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 138157
Number of HSP's gapped (non-prelim): 44
length of query: 82
length of database: 8,064,228,071
effective HSP length: 53
effective length of query: 29
effective length of database: 6,820,680,114
effective search space: 197799723306
effective search space used: 197799723306
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 69 (31.2 bits)