BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034231
(100 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225452332|ref|XP_002273651.1| PREDICTED: uncharacterized protein LOC100262706 [Vitis vinifera]
gi|296087605|emb|CBI34861.3| unnamed protein product [Vitis vinifera]
Length = 110
Score = 174 bits (441), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 83/99 (83%), Positives = 91/99 (91%), Gaps = 4/99 (4%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
L++ CRKK+R+RDH HPYKV+EITPPPK LG+RCFP NLQCGESVTIEGQAYTISAV
Sbjct: 16 LQIFCRKKDRERDH----HPYKVIEITPPPKNLGVRCFPSNLQCGESVTIEGQAYTISAV 71
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
THRYQLRKGKYEPSEKRLDVLS+ RYILNLYLENLL+QS
Sbjct: 72 THRYQLRKGKYEPSEKRLDVLSTGRYILNLYLENLLQQS 110
>gi|242076274|ref|XP_002448073.1| hypothetical protein SORBIDRAFT_06g020530 [Sorghum bicolor]
gi|241939256|gb|EES12401.1| hypothetical protein SORBIDRAFT_06g020530 [Sorghum bicolor]
Length = 120
Score = 169 bits (428), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/105 (76%), Positives = 92/105 (87%), Gaps = 6/105 (5%)
Query: 2 LKVVCRK------KERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQA 55
L VVCRK +ER+RD K + HP+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQA
Sbjct: 16 LAVVCRKNKDGRDRERERDGKEHKHPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQA 75
Query: 56 YTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
YT+SAVTHRYQLRKG+YEPSEKRLDVLS+ RYILNLYL+NLL+QS
Sbjct: 76 YTVSAVTHRYQLRKGRYEPSEKRLDVLSTGRYILNLYLQNLLDQS 120
>gi|259490454|ref|NP_001158949.1| uncharacterized protein LOC100303883 [Zea mays]
gi|195610298|gb|ACG26979.1| hypothetical protein [Zea mays]
gi|413918697|gb|AFW58629.1| hypothetical protein ZEAMMB73_224163 [Zea mays]
Length = 118
Score = 168 bits (425), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 78/103 (75%), Positives = 91/103 (88%), Gaps = 4/103 (3%)
Query: 2 LKVVCRK----KERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYT 57
VVCRK ++R+RD K + HP+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT
Sbjct: 16 FAVVCRKNRDGRDRERDGKEHKHPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYT 75
Query: 58 ISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
+SAVTHRYQLRKG+YEPSEKRLDVLS+ RYILNLYL+NLL+QS
Sbjct: 76 VSAVTHRYQLRKGRYEPSEKRLDVLSTGRYILNLYLQNLLDQS 118
>gi|326518965|dbj|BAJ92643.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 173
Score = 166 bits (420), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 77/102 (75%), Positives = 92/102 (90%), Gaps = 3/102 (2%)
Query: 2 LKVVCRKKERDRD---HKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTI 58
L V+CRKK+RDR+ K + HP+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT+
Sbjct: 72 LSVLCRKKDRDRELEREKEHKHPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYTV 131
Query: 59 SAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
SAVTHRYQLRKG+YEPSEKRLDVLS+ RYILNLYL++LL++S
Sbjct: 132 SAVTHRYQLRKGRYEPSEKRLDVLSTGRYILNLYLDSLLDKS 173
>gi|326508454|dbj|BAJ99494.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 157
Score = 166 bits (420), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 77/102 (75%), Positives = 92/102 (90%), Gaps = 3/102 (2%)
Query: 2 LKVVCRKKERDRD---HKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTI 58
L V+CRKK+RDR+ K + HP+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT+
Sbjct: 56 LSVLCRKKDRDRELEREKEHKHPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYTV 115
Query: 59 SAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
SAVTHRYQLRKG+YEPSEKRLDVLS+ RYILNLYL++LL++S
Sbjct: 116 SAVTHRYQLRKGRYEPSEKRLDVLSTGRYILNLYLDSLLDKS 157
>gi|356558849|ref|XP_003547715.1| PREDICTED: uncharacterized protein LOC100812628 [Glycine max]
Length = 115
Score = 166 bits (419), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 81/100 (81%), Positives = 91/100 (91%), Gaps = 2/100 (2%)
Query: 2 LKVVC-RKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISA 60
+V C RK++RDR+ + +I PYKVVEITPPPK LG+RC PPNLQCGESVTIEGQAYTISA
Sbjct: 17 FEVSCSRKRDRDRE-RGSILPYKVVEITPPPKSLGVRCLPPNLQCGESVTIEGQAYTISA 75
Query: 61 VTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
VTHRYQLRKGKYEPSEKRLDVLS+ RY++NLYLENLLEQS
Sbjct: 76 VTHRYQLRKGKYEPSEKRLDVLSTGRYLVNLYLENLLEQS 115
>gi|351723319|ref|NP_001235995.1| uncharacterized protein LOC100305736 [Glycine max]
gi|255626475|gb|ACU13582.1| unknown [Glycine max]
Length = 115
Score = 164 bits (416), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 80/100 (80%), Positives = 91/100 (91%), Gaps = 2/100 (2%)
Query: 2 LKVVC-RKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISA 60
+V C RK++RDR+ + +I PYKVVEITPPPK LG+RC PPNLQCGE+VTIEGQAYTISA
Sbjct: 17 FEVSCNRKRDRDRE-RGSILPYKVVEITPPPKSLGVRCLPPNLQCGENVTIEGQAYTISA 75
Query: 61 VTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
VTHRYQLRKGKYEPSEKRLDVLS+ RY++NLYLENLLEQS
Sbjct: 76 VTHRYQLRKGKYEPSEKRLDVLSTGRYLVNLYLENLLEQS 115
>gi|194698056|gb|ACF83112.1| unknown [Zea mays]
Length = 261
Score = 162 bits (409), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 72/88 (81%), Positives = 82/88 (93%)
Query: 13 RDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKY 72
RD K + HP+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT+SAVTHRYQLRKG+Y
Sbjct: 174 RDGKEHKHPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYTVSAVTHRYQLRKGRY 233
Query: 73 EPSEKRLDVLSSSRYILNLYLENLLEQS 100
EPSEKRLDVLS+ RYILNLYL+NLL+QS
Sbjct: 234 EPSEKRLDVLSTGRYILNLYLQNLLDQS 261
>gi|22327475|ref|NP_198866.2| uncharacterized protein [Arabidopsis thaliana]
gi|17644175|gb|AAL38785.1| unknown protein [Arabidopsis thaliana]
gi|21436195|gb|AAM51385.1| unknown protein [Arabidopsis thaliana]
gi|332007171|gb|AED94554.1| uncharacterized protein [Arabidopsis thaliana]
Length = 123
Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/99 (73%), Positives = 86/99 (86%), Gaps = 2/99 (2%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
L++ CRKKE+ RD + +PYKV+EITPPPK LGIRC P NLQCGE+V IEGQ YTISAV
Sbjct: 27 LQINCRKKEKGRDQSH--YPYKVIEITPPPKSLGIRCLPHNLQCGENVMIEGQTYTISAV 84
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
THRYQLRKGKYEPSE+RLDVLS++RY+LNLY +NLL+ S
Sbjct: 85 THRYQLRKGKYEPSERRLDVLSAARYVLNLYFDNLLQNS 123
>gi|145334677|ref|NP_001078684.1| uncharacterized protein [Arabidopsis thaliana]
gi|332007172|gb|AED94555.1| uncharacterized protein [Arabidopsis thaliana]
Length = 114
Score = 155 bits (393), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 73/99 (73%), Positives = 86/99 (86%), Gaps = 2/99 (2%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
L++ CRKKE+ RD + +PYKV+EITPPPK LGIRC P NLQCGE+V IEGQ YTISAV
Sbjct: 18 LQINCRKKEKGRDQSH--YPYKVIEITPPPKSLGIRCLPHNLQCGENVMIEGQTYTISAV 75
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
THRYQLRKGKYEPSE+RLDVLS++RY+LNLY +NLL+ S
Sbjct: 76 THRYQLRKGKYEPSERRLDVLSAARYVLNLYFDNLLQNS 114
>gi|297805628|ref|XP_002870698.1| hypothetical protein ARALYDRAFT_355933 [Arabidopsis lyrata subsp.
lyrata]
gi|297316534|gb|EFH46957.1| hypothetical protein ARALYDRAFT_355933 [Arabidopsis lyrata subsp.
lyrata]
Length = 115
Score = 155 bits (393), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 74/99 (74%), Positives = 87/99 (87%), Gaps = 1/99 (1%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
L++ CRKKE+ RD ++N PYKV+EITPPPK LGIRC P NLQCGE+V IEGQ YTISAV
Sbjct: 18 LEINCRKKEKGRD-QSNYPPYKVIEITPPPKSLGIRCLPHNLQCGENVMIEGQTYTISAV 76
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
THRYQLRKGKYEPSE+RLDVLS++RY+LNLY +NLL+ S
Sbjct: 77 THRYQLRKGKYEPSERRLDVLSAARYVLNLYFDNLLQNS 115
>gi|224140871|ref|XP_002323802.1| predicted protein [Populus trichocarpa]
gi|222866804|gb|EEF03935.1| predicted protein [Populus trichocarpa]
Length = 112
Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 72/87 (82%), Positives = 78/87 (89%), Gaps = 3/87 (3%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
L V CRKKERDRD +PYKV+E+TPPPK LGIRCFPPNLQCGESVTIEGQAYTIS+V
Sbjct: 17 LDVTCRKKERDRDRS---YPYKVIEVTPPPKNLGIRCFPPNLQCGESVTIEGQAYTISSV 73
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYI 88
THRYQLR+GKYEPSEKRLDVLS+ RYI
Sbjct: 74 THRYQLRRGKYEPSEKRLDVLSTGRYI 100
>gi|224055565|ref|XP_002298542.1| predicted protein [Populus trichocarpa]
gi|222845800|gb|EEE83347.1| predicted protein [Populus trichocarpa]
Length = 96
Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 74/87 (85%), Positives = 78/87 (89%), Gaps = 3/87 (3%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
+ V CRKKERDRDH +PYKV+EITPPPK LGIRCFP NLQCGESVTIEGQAYTISAV
Sbjct: 1 VHVTCRKKERDRDHS---YPYKVIEITPPPKNLGIRCFPTNLQCGESVTIEGQAYTISAV 57
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYI 88
THRYQLRKGKYEPSEKRLDVLS+ RYI
Sbjct: 58 THRYQLRKGKYEPSEKRLDVLSTGRYI 84
>gi|449450072|ref|XP_004142788.1| PREDICTED: uncharacterized protein LOC101221627 [Cucumis sativus]
gi|449522638|ref|XP_004168333.1| PREDICTED: uncharacterized LOC101221627 [Cucumis sativus]
Length = 115
Score = 144 bits (363), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 69/87 (79%), Positives = 76/87 (87%), Gaps = 3/87 (3%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
++ CR+KE++R NN PYKV+EITPPPK LGIRCFPPNLQCGESVTIEGQ YTISAV
Sbjct: 20 FQISCRRKEKER---NNFDPYKVIEITPPPKNLGIRCFPPNLQCGESVTIEGQTYTISAV 76
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYI 88
T RYQLRKGKYEPSEKRLDVLS+ RYI
Sbjct: 77 TLRYQLRKGKYEPSEKRLDVLSTGRYI 103
>gi|116782519|gb|ABK22538.1| unknown [Picea sitchensis]
Length = 147
Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 66/100 (66%), Positives = 84/100 (84%), Gaps = 4/100 (4%)
Query: 1 MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISA 60
+++V+CRK+ER+ D PYKV++ITPPP+ LGIRC P N+QCGESVTIE +AY +SA
Sbjct: 52 VVRVLCRKRERNWDRD----PYKVIDITPPPRNLGIRCLPSNMQCGESVTIEDKAYIVSA 107
Query: 61 VTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
VT+RY+LRKGKYEPSEKRLDV S+ RY+LN YLE LL++S
Sbjct: 108 VTYRYELRKGKYEPSEKRLDVQSTGRYLLNNYLEMLLKES 147
>gi|148907154|gb|ABR16720.1| unknown [Picea sitchensis]
Length = 147
Score = 138 bits (347), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 65/100 (65%), Positives = 84/100 (84%), Gaps = 4/100 (4%)
Query: 1 MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISA 60
+++V+CRK+ER+ D PYKV++ITPPP+ LGIRC P N+QCGESVTIE +AY +SA
Sbjct: 52 VVRVLCRKRERNWDRD----PYKVIDITPPPRNLGIRCLPSNMQCGESVTIEDKAYIVSA 107
Query: 61 VTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
VT+RY+LRKGKYEPSEKRLD+ S+ RY+LN YLE LL++S
Sbjct: 108 VTYRYELRKGKYEPSEKRLDLQSTGRYLLNNYLEMLLKES 147
>gi|115459040|ref|NP_001053120.1| Os04g0482900 [Oryza sativa Japonica Group]
gi|32488468|emb|CAE03139.1| OSJNBa0081L15.1 [Oryza sativa Japonica Group]
gi|38345260|emb|CAD41104.2| OSJNBb0011N17.21 [Oryza sativa Japonica Group]
gi|113564691|dbj|BAF15034.1| Os04g0482900 [Oryza sativa Japonica Group]
gi|116309940|emb|CAH66972.1| H0525D09.12 [Oryza sativa Indica Group]
gi|125590779|gb|EAZ31129.1| hypothetical protein OsJ_15226 [Oryza sativa Japonica Group]
gi|215712338|dbj|BAG94465.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 125
Score = 128 bits (321), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 56/67 (83%), Positives = 64/67 (95%)
Query: 20 HPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRL 79
+P+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT+SAVTHRYQLRKG+YEPSEKRL
Sbjct: 45 YPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYTVSAVTHRYQLRKGRYEPSEKRL 104
Query: 80 DVLSSSR 86
DVLS+ R
Sbjct: 105 DVLSTGR 111
>gi|218195073|gb|EEC77500.1| hypothetical protein OsI_16358 [Oryza sativa Indica Group]
Length = 114
Score = 127 bits (319), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 56/67 (83%), Positives = 64/67 (95%)
Query: 20 HPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRL 79
+P+KVVEITPPP+CLG+RCFP N+ CGESVTIEGQAYT+SAVTHRYQLRKG+YEPSEKRL
Sbjct: 34 YPFKVVEITPPPRCLGVRCFPTNIHCGESVTIEGQAYTVSAVTHRYQLRKGRYEPSEKRL 93
Query: 80 DVLSSSR 86
DVLS+ R
Sbjct: 94 DVLSTGR 100
>gi|168016928|ref|XP_001761000.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687686|gb|EDQ74067.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 95
Score = 117 bits (292), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/95 (56%), Positives = 70/95 (73%), Gaps = 1/95 (1%)
Query: 7 RKKER-DRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRY 65
+K+ER +R + PY V ITPPP+ LGI PPN QCGE+V ++G+ Y +S VT+RY
Sbjct: 1 KKRERGERGDRETFAPYYVTVITPPPRNLGIHSLPPNTQCGETVEVKGEPYVVSGVTYRY 60
Query: 66 QLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
QLR+GKY+PSEKRLDV S RY +NLY +NLL+ S
Sbjct: 61 QLRRGKYQPSEKRLDVQSLGRYFVNLYFDNLLDTS 95
>gi|302753714|ref|XP_002960281.1| hypothetical protein SELMODRAFT_74528 [Selaginella moellendorffii]
gi|302768022|ref|XP_002967431.1| hypothetical protein SELMODRAFT_87007 [Selaginella moellendorffii]
gi|300165422|gb|EFJ32030.1| hypothetical protein SELMODRAFT_87007 [Selaginella moellendorffii]
gi|300171220|gb|EFJ37820.1| hypothetical protein SELMODRAFT_74528 [Selaginella moellendorffii]
Length = 92
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/92 (54%), Positives = 68/92 (73%), Gaps = 1/92 (1%)
Query: 9 KERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLR 68
K RDR N+ H Y+V +TPPPK LGI C P N QCGE+VT+ G++Y +S V ++YQL+
Sbjct: 2 KRRDRADTNS-HHYRVTLVTPPPKNLGIHCLPSNTQCGETVTVSGESYIVSGVVYQYQLK 60
Query: 69 KGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
KG+Y PS K+L+V + RYILN+YL++LL S
Sbjct: 61 KGRYAPSAKKLEVQPTGRYILNMYLDSLLPDS 92
>gi|159488709|ref|XP_001702345.1| hypothetical protein CHLREDRAFT_154041 [Chlamydomonas reinhardtii]
gi|158271139|gb|EDO96965.1| predicted protein [Chlamydomonas reinhardtii]
Length = 152
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 52/77 (67%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDV 81
Y+V +TPPP+ LGI PPN QCGE + +EGQ Y ++ V +Y+LR GKY RLDV
Sbjct: 65 YEVQVVTPPPRSLGIYALPPNTQCGEEIDVEGQGYVVTTVVLQYKLRGGKYVRHHNRLDV 124
Query: 82 LSSSRYILNLYLENLLE 98
+ R+++N LENL++
Sbjct: 125 QPTGRWLVNQMLENLIK 141
>gi|302832942|ref|XP_002948035.1| hypothetical protein VOLCADRAFT_57739 [Volvox carteri f.
nagariensis]
gi|300266837|gb|EFJ51023.1| hypothetical protein VOLCADRAFT_57739 [Volvox carteri f.
nagariensis]
Length = 142
Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 52/77 (67%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDV 81
Y+V +TPPP+ LGI PPN QCGE + +EG +Y ++ V +Y+LR GKY RLDV
Sbjct: 55 YEVRVVTPPPRSLGIYALPPNTQCGEEIDVEGASYVVTTVVLQYKLRGGKYVRDHNRLDV 114
Query: 82 LSSSRYILNLYLENLLE 98
+ R+++N LE+L++
Sbjct: 115 QPTGRWLVNQMLEDLIK 131
>gi|307108635|gb|EFN56875.1| hypothetical protein CHLNCDRAFT_51665 [Chlorella variabilis]
Length = 170
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 49/77 (63%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDV 81
++V +++PP + LGI PPN G+ V I G +Y + +V +++L +GKY RL+V
Sbjct: 70 FEVEDMSPPSRSLGIHALPPNTHNGDQVEIAGSSYVVQSVVLQFKLVRGKYRRDHARLEV 129
Query: 82 LSSSRYILNLYLENLLE 98
+ R++ NLYLEN+ +
Sbjct: 130 QGTGRWMANLYLENVFQ 146
>gi|303280407|ref|XP_003059496.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459332|gb|EEH56628.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 79
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 50/79 (63%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDV 81
Y+V TPP + LG FP N CG+++ + + + + V + Y+L +GKY + RL V
Sbjct: 1 YEVKVTTPPERSLGTHRFPANTHCGDTIELRNRYFVVDKVAYHYKLERGKYRKDDSRLYV 60
Query: 82 LSSSRYILNLYLENLLEQS 100
++R++LN +L++LLE+S
Sbjct: 61 QEATRFLLNKHLDSLLEKS 79
>gi|384254141|gb|EIE27615.1| hypothetical protein COCSUDRAFT_52220 [Coccomyxa subellipsoidea
C-169]
Length = 135
Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 52/79 (65%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDV 81
++V +I PPP+ LGI P + G+ +T+E + + ++ + +Y+L G+Y+ RL+V
Sbjct: 56 FEVEKINPPPQSLGIHQLPVDTHNGDQITVEDEDFVVTGLVLKYKLVGGRYQRDHNRLEV 115
Query: 82 LSSSRYILNLYLENLLEQS 100
+SRY +NLY ++LLE+S
Sbjct: 116 QQTSRYFVNLYYDSLLEKS 134
>gi|308809083|ref|XP_003081851.1| unnamed protein product [Ostreococcus tauri]
gi|116060318|emb|CAL55654.1| unnamed protein product [Ostreococcus tauri]
Length = 429
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 29/80 (36%), Positives = 46/80 (57%)
Query: 16 KNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPS 75
K + P+ V TPPP+ LG+ P N CGE++ + + ++ VT Y L +GKY
Sbjct: 23 KTKLVPFAVRVATPPPRELGVHKLPKNTTCGETIEVRDGWFIVNRVTTMYSLERGKYRRD 82
Query: 76 EKRLDVLSSSRYILNLYLEN 95
+R++V S+ RY +N LE+
Sbjct: 83 GQRVEVESAERYFVNASLES 102
>gi|413918696|gb|AFW58628.1| hypothetical protein ZEAMMB73_224163 [Zea mays]
Length = 131
Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 34/43 (79%), Gaps = 4/43 (9%)
Query: 2 LKVVCRK----KERDRDHKNNIHPYKVVEITPPPKCLGIRCFP 40
VVCRK ++R+RD K + HP+KVVEITPPP+CLG+RCFP
Sbjct: 16 FAVVCRKNRDGRDRERDGKEHKHPFKVVEITPPPRCLGVRCFP 58
>gi|428178376|gb|EKX47251.1| hypothetical protein GUITHDRAFT_106703 [Guillardia theta CCMP2712]
Length = 438
Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 56/95 (58%), Gaps = 6/95 (6%)
Query: 8 KKERDRDHKNNIHPYKVVEITPPPKCL--GIRCFPPNLQCGESVTIEGQAYTISAVTHRY 65
K+++++D K P++V I+PPP+ + G+ F P G+ + +EG YT+S V Y
Sbjct: 348 KRQKEQDDK----PFRVKIISPPPERMLEGVHFFHPRTHNGDKIAVEGADYTVSKVQFVY 403
Query: 66 QLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
G+Y+ EK L+V S+R+ +N L +LL ++
Sbjct: 404 VYSGGRYQLEEKVLEVQKSTRFDVNEKLRSLLPEA 438
>gi|145352143|ref|XP_001420416.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580650|gb|ABO98709.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 109
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 9/98 (9%)
Query: 8 KKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQA---------YTI 58
K + R+ + PY V TPPP LG+ P N+ CGE++ + A + +
Sbjct: 9 KSQGARESSPKLVPYNVRVATPPPVDLGLHKLPKNIACGETIEVPTNADGDESANGWFIV 68
Query: 59 SAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENL 96
+ VT Y L +G+Y RLDV + RY +N LE L
Sbjct: 69 NRVTTMYNLVRGRYRKDGSRLDVQKAERYFINASLERL 106
>gi|255084489|ref|XP_002508819.1| predicted protein [Micromonas sp. RCC299]
gi|226524096|gb|ACO70077.1| predicted protein [Micromonas sp. RCC299]
Length = 148
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 22 YKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAY-TISAVTHRYQLRKGKYEPSEKRLD 80
Y+V +TPP + LG P N CG+++ ++ Y + V Y+L GKY L
Sbjct: 58 YEVKVVTPPERSLGPHMLPKNTHCGDTIELKNNKYYVVDRVCTHYRLEYGKYRRDHTGLY 117
Query: 81 VLSSSRYILNLYLENLLE 98
V +SRYILN +L+ LL+
Sbjct: 118 VQEASRYILNKHLDALLD 135
>gi|449015766|dbj|BAM79168.1| hypothetical protein CYME_CMD094C [Cyanidioschyzon merolae strain
10D]
Length = 175
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 2/99 (2%)
Query: 2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
+ V+ RKK RD+++ + P+ V TPPP LG P + G+ + + AY I
Sbjct: 73 MAVISRKKW-PRDNRDKL-PFDVYVTTPPPYYLGQFKLDPRVHNGDILEFQNHAYVIKRT 130
Query: 62 THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
Y R G++ K +DV + +R +N +L+ + +S
Sbjct: 131 RLHYYYRNGEFRVGRKSVDVNTVARSHINSFLDRIYSRS 169
>gi|412989995|emb|CCO20637.1| unknown protein [Bathycoccus prasinos]
Length = 149
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
Query: 24 VVEITPPPKCLGIRCFPPN-LQCGESVTIEGQAYTISAVTHRYQLRKGKY-EPSEKRLDV 81
V + TPP K LG F + + CG V + + IS VT Y L+ G+Y E ++R++V
Sbjct: 71 VTKRTPPRKDLGTFRFADDQMACGVRVDLNESTFVISRVTKTYTLKYGRYVEDEKRRVEV 130
Query: 82 LSSSRYILNLYLENLLEQS 100
S R +LN L ++ ++S
Sbjct: 131 ESLERSLLNRQLGDVFKRS 149
>gi|409992484|ref|ZP_11275670.1| hypothetical protein APPUASWS_15417 [Arthrospira platensis str.
Paraca]
gi|291571804|dbj|BAI94076.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409936651|gb|EKN78129.1| hypothetical protein APPUASWS_15417 [Arthrospira platensis str.
Paraca]
Length = 123
Score = 42.0 bits (97), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLD 80
P +V+ +T P K LG CF Q G + +EGQ Y + HRY L+ G+Y S+ L
Sbjct: 7 PTEVI-LTEPRKSLGNVCFDWTPQPGHYLVLEGQTYAVLERRHRYHLKSGRYRLSKIALY 65
Query: 81 VLSS 84
V S+
Sbjct: 66 VQSA 69
>gi|376007379|ref|ZP_09784577.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423063467|ref|ZP_17052257.1| hypothetical protein SPLC1_S100870 [Arthrospira platensis C1]
gi|375324339|emb|CCE20330.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406714899|gb|EKD10057.1| hypothetical protein SPLC1_S100870 [Arthrospira platensis C1]
Length = 123
Score = 42.0 bits (97), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLD 80
P +V+ +T P K LG CF Q G + +EGQ Y + HRY L+ G+Y S+ L
Sbjct: 7 PTEVI-LTEPRKSLGNVCFDWTPQPGHYLVLEGQTYAVLERRHRYHLKSGRYRLSKIALY 65
Query: 81 VLSS 84
V S+
Sbjct: 66 VQSA 69
>gi|209524360|ref|ZP_03272909.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|209495151|gb|EDZ95457.1| conserved hypothetical protein [Arthrospira maxima CS-328]
Length = 123
Score = 41.6 bits (96), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 32/60 (53%)
Query: 25 VEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSS 84
V +T P K LG CF Q G + +EGQ Y + HRY L+ G+Y S+ L V S+
Sbjct: 10 VILTEPRKSLGNVCFDWTPQPGHYLVLEGQTYAVLERRHRYHLKSGRYRLSKIALYVQSA 69
>gi|119485357|ref|ZP_01619685.1| hypothetical protein L8106_09456 [Lyngbya sp. PCC 8106]
gi|119457113|gb|EAW38239.1| hypothetical protein L8106_09456 [Lyngbya sp. PCC 8106]
Length = 126
Score = 38.1 bits (87), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLD 80
P +V+ +T P + LG F Q G + ++GQ Y + HRYQL+ G+Y + L
Sbjct: 10 PTEVI-LTHPRRSLGNIFFDWTPQPGHYLDLQGQTYAVLERRHRYQLKSGRYRLYQISLY 68
Query: 81 VLSSSR 86
V S+ R
Sbjct: 69 VQSAER 74
>gi|428215559|ref|YP_007088703.1| hypothetical protein Oscil6304_5294 [Oscillatoria acuminata PCC
6304]
gi|428003940|gb|AFY84783.1| hypothetical protein Oscil6304_5294 [Oscillatoria acuminata PCC
6304]
Length = 130
Score = 37.0 bits (84), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLD 80
P +V+ +T P + LG N Q G + +EG+ Y + HRYQL+ G+Y L
Sbjct: 7 PTEVI-LTKPSQSLGKVHLDWNPQPGHYLDLEGKTYAVLERRHRYQLKSGRYRLHNIALY 65
Query: 81 VLSSSRYI 88
V S+ R I
Sbjct: 66 VQSAQRPI 73
>gi|428217488|ref|YP_007101953.1| hypothetical protein Pse7367_1230 [Pseudanabaena sp. PCC 7367]
gi|427989270|gb|AFY69525.1| hypothetical protein Pse7367_1230 [Pseudanabaena sp. PCC 7367]
Length = 126
Score = 36.6 bits (83), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 44 QCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLEN 95
Q G + I+GQAYT+ H YQ R+G+Y S+ + V ++ + YLE
Sbjct: 31 QPGNYLEIDGQAYTVLERRHNYQFRRGRYRLSKIVISVQTAQKDPERTYLEG 82
>gi|428209575|ref|YP_007093928.1| hypothetical protein Chro_4673 [Chroococcidiopsis thermalis PCC
7203]
gi|428011496|gb|AFY90059.1| hypothetical protein Chro_4673 [Chroococcidiopsis thermalis PCC
7203]
Length = 117
Score = 36.6 bits (83), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 33/62 (53%)
Query: 25 VEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSS 84
V +T P + LG Q G + +EG+ Y + HRYQL+ G+Y ++ L V S+
Sbjct: 9 VILTHPRQSLGKVMLDWTPQPGNYLDLEGKTYAVLERRHRYQLKAGRYRLNKIALYVQSA 68
Query: 85 SR 86
+R
Sbjct: 69 AR 70
>gi|300867030|ref|ZP_07111699.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300334968|emb|CBN56865.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 117
Score = 35.8 bits (81), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLD 80
P +V+ +T P + LG Q G + ++GQ Y + HRYQLR G+Y + L
Sbjct: 7 PTEVI-LTHPRQSLGSVQLDWTPQPGNYLDLKGQTYAVLERRHRYQLRSGRYRLQKIDLY 65
Query: 81 VLSSSRYILNLYLEN 95
V S+ R ++E
Sbjct: 66 VQSAQRPTEKSFVEG 80
>gi|428300434|ref|YP_007138740.1| hypothetical protein Cal6303_3851 [Calothrix sp. PCC 6303]
gi|428236978|gb|AFZ02768.1| hypothetical protein Cal6303_3851 [Calothrix sp. PCC 6303]
Length = 120
Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 18 NIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEK 77
NI P +V+ +T P + LG Q G + +EG+ Y + HRYQL+ G+Y +
Sbjct: 4 NILPTEVI-LTHPRQSLGSVQLDWTPQPGNYLDLEGKTYAVLERRHRYQLKLGRYHLHKI 62
Query: 78 RLDVLSSSR 86
+ V S+ R
Sbjct: 63 AIYVQSAQR 71
>gi|428305242|ref|YP_007142067.1| hypothetical protein Cri9333_1669 [Crinalium epipsammum PCC 9333]
gi|428246777|gb|AFZ12557.1| hypothetical protein Cri9333_1669 [Crinalium epipsammum PCC 9333]
Length = 117
Score = 35.4 bits (80), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 16 KNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPS 75
++N P +V+ +T P + +G Q G + IEG+ Y I H+YQLR G+Y+
Sbjct: 2 ESNSLPTEVI-LTHPRQTIGNIKLDWTPQPGNYLDIEGKTYAILERRHQYQLRSGRYQLQ 60
Query: 76 EKRLDVLSSSR 86
+ L V S+ R
Sbjct: 61 KIALYVQSAPR 71
>gi|298492311|ref|YP_003722488.1| hypothetical protein Aazo_3852 ['Nostoc azollae' 0708]
gi|298234229|gb|ADI65365.1| conserved hypothetical protein ['Nostoc azollae' 0708]
Length = 123
Score = 35.4 bits (80), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 25 VEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKY 72
V +T PP+ LG Q G + +EG+ Y + H+YQL+ G+Y
Sbjct: 11 VILTNPPQSLGKLLLDWTPQPGNYLDLEGKTYAVLERRHKYQLKSGRY 58
>gi|440680685|ref|YP_007155480.1| hypothetical protein Anacy_0995 [Anabaena cylindrica PCC 7122]
gi|428677804|gb|AFZ56570.1| hypothetical protein Anacy_0995 [Anabaena cylindrica PCC 7122]
Length = 121
Score = 34.3 bits (77), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 31/62 (50%)
Query: 25 VEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSS 84
V +T P + LGI Q G + +EG+ Y + H+YQL+ G+Y + V S+
Sbjct: 10 VILTNPRQSLGILKLDWTPQPGNYLDLEGKTYAVLERRHKYQLKSGRYRLHNITIYVQSA 69
Query: 85 SR 86
R
Sbjct: 70 KR 71
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,634,960,068
Number of Sequences: 23463169
Number of extensions: 58927917
Number of successful extensions: 111390
Number of sequences better than 100.0: 49
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 111329
Number of HSP's gapped (non-prelim): 50
length of query: 100
length of database: 8,064,228,071
effective HSP length: 69
effective length of query: 31
effective length of database: 6,445,269,410
effective search space: 199803351710
effective search space used: 199803351710
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 69 (31.2 bits)