BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027603
(221 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|118488995|gb|ABK96305.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 221
Score = 363 bits (931), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 166/221 (75%), Positives = 195/221 (88%)
Query: 1 MTPRRRTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAV 60
M R++ + S+ L+SLPS+S AYRPGDIVPMSKM QYH+SRTVWHD+IGKHCPIFAV
Sbjct: 1 MEANRKSRIASVFFFLISLPSVSFAYRPGDIVPMSKMGQYHSSRTVWHDMIGKHCPIFAV 60
Query: 61 NREALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDL 120
NRE LIPIAKPTG+TG+DP K+SFQVG+EKF +PWLFVI+RKSS+VPMIDVHLRYSG+DL
Sbjct: 61 NREVLIPIAKPTGYTGSDPYKLSFQVGKEKFLIPWLFVIHRKSSEVPMIDVHLRYSGSDL 120
Query: 121 LGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASG 180
GVTAK++DMP YVEIHPD+ + FW E WPKH+L+RYTW+EQSEIDV+SGFYVLF SG
Sbjct: 121 HGVTAKVIDMPHHYVEIHPDIRQQFWDAERWPKHILVRYTWKEQSEIDVSSGFYVLFGSG 180
Query: 181 LMLSFILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
LMLSFILSIYIL+SSR+K RF++ETVAESS+P GVAKVE
Sbjct: 181 LMLSFILSIYILQSSRDKLARFVRETVAESSIPAGGVAKVE 221
>gi|224139342|ref|XP_002323065.1| predicted protein [Populus trichocarpa]
gi|222867695|gb|EEF04826.1| predicted protein [Populus trichocarpa]
Length = 221
Score = 360 bits (923), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 165/221 (74%), Positives = 194/221 (87%)
Query: 1 MTPRRRTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAV 60
M R++ + S+ L+SLPS+S AYRPGDIVPMSKM QYH+SRTVWHD+IGKHCPIFAV
Sbjct: 1 METNRKSRIASVFFFLISLPSVSFAYRPGDIVPMSKMGQYHSSRTVWHDMIGKHCPIFAV 60
Query: 61 NREALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDL 120
NRE LIPIAKPTG+TG+DP K+SFQVG+EKF +PWLFVI+RKSS+VPMIDVHLRYSG+DL
Sbjct: 61 NREVLIPIAKPTGYTGSDPYKLSFQVGKEKFLIPWLFVIHRKSSEVPMIDVHLRYSGSDL 120
Query: 121 LGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASG 180
GV AK++DMP YVEIHPD+ + FW E WPKH+L+RYTW+EQSEIDV+SGFYVLF SG
Sbjct: 121 HGVMAKVIDMPHHYVEIHPDIHQKFWDAELWPKHILVRYTWKEQSEIDVSSGFYVLFGSG 180
Query: 181 LMLSFILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
LMLSFILSIYIL+SSR+K RF++ETVAESS+P GVAKVE
Sbjct: 181 LMLSFILSIYILQSSRDKLARFVRETVAESSIPAGGVAKVE 221
>gi|255584249|ref|XP_002532862.1| conserved hypothetical protein [Ricinus communis]
gi|223527374|gb|EEF29516.1| conserved hypothetical protein [Ricinus communis]
Length = 226
Score = 352 bits (902), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 165/223 (73%), Positives = 193/223 (86%), Gaps = 3/223 (1%)
Query: 2 TPRRRTIMTSLLLL---LLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIF 58
T R + +L+LL LLSLPS S AYRPGDIVPMS+M QYH+SRT+W D+IGKHCP+F
Sbjct: 4 TSNRSSKKNALILLMSFLLSLPSTSTAYRPGDIVPMSEMGQYHSSRTLWQDMIGKHCPVF 63
Query: 59 AVNREALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGN 118
VNRE LIPI KPTG+TGADP KISFQVGREKF VPWLF+INR+SS+VPMIDVHLRYSG+
Sbjct: 64 GVNREVLIPIPKPTGYTGADPYKISFQVGREKFLVPWLFLINRRSSEVPMIDVHLRYSGS 123
Query: 119 DLLGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFA 178
DL GVTAK++DMP YVE+HP++ K FW P+ WPKHVL+RYTWEEQS+IDVASGFYVLF
Sbjct: 124 DLHGVTAKVIDMPHHYVELHPNIRKQFWDPQVWPKHVLVRYTWEEQSDIDVASGFYVLFG 183
Query: 179 SGLMLSFILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
SGLMLSFILSIYIL+SS++K RF++ETVAES++P GVAKVE
Sbjct: 184 SGLMLSFILSIYILQSSKDKLARFVRETVAESNIPAGGVAKVE 226
>gi|225439667|ref|XP_002270471.1| PREDICTED: uncharacterized protein LOC100249981 [Vitis vinifera]
gi|297735555|emb|CBI18049.3| unnamed protein product [Vitis vinifera]
Length = 219
Score = 342 bits (876), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 157/197 (79%), Positives = 174/197 (88%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISF 84
AYRPGDIVPMSKM QYH+SRTVWHDV+GKHCPIFAVNRE LIPI KPTG+TGADP KISF
Sbjct: 23 AYRPGDIVPMSKMGQYHSSRTVWHDVVGKHCPIFAVNREVLIPIQKPTGYTGADPYKISF 82
Query: 85 QVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKY 144
QVGREKF +PWL +INRKSS+VPMIDVHLRYSGNDL GV AK+VDMP YVEIHP++
Sbjct: 83 QVGREKFLIPWLLLINRKSSEVPMIDVHLRYSGNDLHGVVAKVVDMPHHYVEIHPNIRTQ 142
Query: 145 FWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQ 204
FW PE WPKHVL+RYTWEEQSEIDV SGFYVLF SGL++S ILSIY+L+SSR+K RF++
Sbjct: 143 FWNPEHWPKHVLVRYTWEEQSEIDVTSGFYVLFGSGLVMSVILSIYVLQSSRDKLARFVR 202
Query: 205 ETVAESSMPGDGVAKVE 221
ETVAE SM G GVAKVE
Sbjct: 203 ETVAEGSMSGGGVAKVE 219
>gi|226501618|ref|NP_001145033.1| uncharacterized protein LOC100278212 precursor [Zea mays]
gi|195605980|gb|ACG24820.1| hypothetical protein [Zea mays]
gi|195628092|gb|ACG35876.1| hypothetical protein [Zea mays]
gi|413944509|gb|AFW77158.1| hypothetical protein ZEAMMB73_638350 [Zea mays]
Length = 227
Score = 339 bits (870), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 155/221 (70%), Positives = 191/221 (86%), Gaps = 2/221 (0%)
Query: 2 TPRRRTIMTSLLLLLLS-LPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAV 60
+P R ++ +LLL+L S LPS++ AYRPGDIVPM + QYH SR+VW DV+G+HCP FAV
Sbjct: 8 SPARHGVLPALLLILCSSLPSLAAAYRPGDIVPMLRSGQYHGSRSVWFDVVGRHCPAFAV 67
Query: 61 NREALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDL 120
NRE L+PI KPTGFTGADP KI+FQ+G EKF VPWL+VINRK+S+VP+ID HL+YSGND+
Sbjct: 68 NREVLMPIPKPTGFTGADPYKITFQIGHEKFHVPWLYVINRKTSEVPLIDFHLKYSGNDI 127
Query: 121 LGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASG 180
LGVTAK+VDMP YVE+HPD+ K FW P++WPK+VL+RYTWEEQSEIDV GFYVLF SG
Sbjct: 128 LGVTAKVVDMPHHYVEVHPDIKKNFWDPQNWPKYVLVRYTWEEQSEIDVTGGFYVLFGSG 187
Query: 181 LMLSFILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
L+LSFIL+IY+L+SS+EK TRF++E VA+SS+P DGVAKVE
Sbjct: 188 LVLSFILAIYVLQSSQEKLTRFVREAVADSSLP-DGVAKVE 227
>gi|242087037|ref|XP_002439351.1| hypothetical protein SORBIDRAFT_09g005000 [Sorghum bicolor]
gi|241944636|gb|EES17781.1| hypothetical protein SORBIDRAFT_09g005000 [Sorghum bicolor]
Length = 228
Score = 328 bits (842), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 153/218 (70%), Positives = 183/218 (83%)
Query: 4 RRRTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNRE 63
RR + LL+L SLP + AYRPGDIVPM + QYH SRTVW DV+G+HCP FAVNRE
Sbjct: 11 RRGLLPALLLILCSSLPHFAAAYRPGDIVPMLRSGQYHGSRTVWFDVVGRHCPAFAVNRE 70
Query: 64 ALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGV 123
L+PI KPTGFTGADP KI+FQ+G EKF VPWL+VINRK+S+VP+ID HL+YSGND+LGV
Sbjct: 71 VLMPIPKPTGFTGADPYKITFQIGHEKFHVPWLYVINRKTSEVPLIDFHLKYSGNDILGV 130
Query: 124 TAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLML 183
TAK+VDMP YVE+HPD+ K FW + WPK+VL+RYTWEEQSEIDVA GFYVLF SGL+L
Sbjct: 131 TAKVVDMPHHYVEVHPDIKKNFWDLQKWPKYVLVRYTWEEQSEIDVAGGFYVLFGSGLVL 190
Query: 184 SFILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
SFIL+IY+L+SS+EK TRF++E VA+SS+P GVAKVE
Sbjct: 191 SFILAIYVLQSSQEKLTRFVREAVADSSLPEGGVAKVE 228
>gi|357134484|ref|XP_003568847.1| PREDICTED: uncharacterized protein LOC100825817 [Brachypodium
distachyon]
Length = 230
Score = 325 bits (832), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 148/204 (72%), Positives = 177/204 (86%)
Query: 18 SLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGA 77
SL ++ AYRPGDIVPM + QYH SR+VW+DVIG+HCP FAVNRE L+PI KPTGFTGA
Sbjct: 27 SLLPLADAYRPGDIVPMLRSGQYHGSRSVWYDVIGRHCPAFAVNREVLMPIPKPTGFTGA 86
Query: 78 DPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEI 137
DP KI+FQ+G EKF VPWL+VINRKSSQVPMID HL+YSGNDLLGVTAK+VDMP +VE+
Sbjct: 87 DPYKITFQIGHEKFHVPWLYVINRKSSQVPMIDFHLKYSGNDLLGVTAKVVDMPHHFVEL 146
Query: 138 HPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSRE 197
HPD+ K FW ++WPK+VL+ YTWEEQSEIDVA GFYVLF SGL+LSFIL+IY+L+SS+E
Sbjct: 147 HPDIKKTFWDQQNWPKNVLVSYTWEEQSEIDVAGGFYVLFGSGLVLSFILAIYVLQSSQE 206
Query: 198 KFTRFLQETVAESSMPGDGVAKVE 221
K TRF++E V++SS+P GVAKVE
Sbjct: 207 KLTRFVREAVSDSSLPEGGVAKVE 230
>gi|449439815|ref|XP_004137681.1| PREDICTED: uncharacterized protein LOC101203220 [Cucumis sativus]
gi|449523151|ref|XP_004168588.1| PREDICTED: uncharacterized LOC101203220 [Cucumis sativus]
Length = 218
Score = 324 bits (831), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 147/197 (74%), Positives = 171/197 (86%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISF 84
AYRPGDIVPMSKM QYH+SRTVWHD+IG+HCPI+ VNRE L+PI KP G+TGADP KISF
Sbjct: 22 AYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIYGVNREVLVPIPKPVGYTGADPYKISF 81
Query: 85 QVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKY 144
QVG+EKF VPWL VINRKS++VPMIDVHLRYSG+DL GVTAK+VDMP Y++ HP + K
Sbjct: 82 QVGKEKFLVPWLLVINRKSAEVPMIDVHLRYSGSDLHGVTAKVVDMPHIYIDTHPHISKQ 141
Query: 145 FWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQ 204
FW + WPKH+L+RYTWEEQSEIDV SG YVLF SGL LSFILS+YIL+SS++K RF++
Sbjct: 142 FWDQQHWPKHILVRYTWEEQSEIDVTSGLYVLFGSGLTLSFILSVYILQSSKDKLARFVR 201
Query: 205 ETVAESSMPGDGVAKVE 221
ETV ESS+PG GVAKVE
Sbjct: 202 ETVVESSIPGVGVAKVE 218
>gi|359806290|ref|NP_001241475.1| uncharacterized protein LOC100810722 precursor [Glycine max]
gi|255639449|gb|ACU20019.1| unknown [Glycine max]
Length = 224
Score = 323 bits (828), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 146/200 (73%), Positives = 176/200 (88%)
Query: 22 ISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVK 81
I++AYRPGDIVPMS+M QYH+SRTVW D+IG+HCPIFAVNRE L+PI KPTG+TGAD K
Sbjct: 25 IAVAYRPGDIVPMSRMGQYHSSRTVWQDLIGRHCPIFAVNREVLMPIPKPTGYTGADAYK 84
Query: 82 ISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDL 141
ISFQVGREKF +PWL V+NRKS++VPMI+V LRYSG+DL GVTAK+VDMP YVE+HP++
Sbjct: 85 ISFQVGREKFLIPWLLVVNRKSTEVPMIEVDLRYSGSDLHGVTAKVVDMPHHYVEVHPEI 144
Query: 142 PKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTR 201
K FW + WPKH+L+RYTW+E SEIDV SGF+VLF SGLMLSFILSIY+L+SSR+K R
Sbjct: 145 RKQFWDSQHWPKHILVRYTWKEHSEIDVTSGFFVLFGSGLMLSFILSIYVLQSSRDKLER 204
Query: 202 FLQETVAESSMPGDGVAKVE 221
F++ETV ESS+PG+ VAKVE
Sbjct: 205 FVRETVVESSVPGEVVAKVE 224
>gi|357506569|ref|XP_003623573.1| hypothetical protein MTR_7g072680 [Medicago truncatula]
gi|217075636|gb|ACJ86178.1| unknown [Medicago truncatula]
gi|355498588|gb|AES79791.1| hypothetical protein MTR_7g072680 [Medicago truncatula]
gi|388505934|gb|AFK41033.1| unknown [Medicago truncatula]
Length = 228
Score = 322 bits (824), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 145/197 (73%), Positives = 173/197 (87%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISF 84
AY+PGDIVPMS+M QYH+SRTVW D+IG+HCPIFAVNRE LIPI KPTG+TGADP KISF
Sbjct: 32 AYKPGDIVPMSRMGQYHSSRTVWQDLIGRHCPIFAVNREVLIPIPKPTGYTGADPYKISF 91
Query: 85 QVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKY 144
QVGREKF +PWL V+NRKS++VPMID+ L+YSG+DLLGVTAK++DMP YVEIHP++ K+
Sbjct: 92 QVGREKFYIPWLLVVNRKSTEVPMIDIELKYSGSDLLGVTAKVLDMPHHYVEIHPEIGKH 151
Query: 145 FWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQ 204
FW + WPKH+L RYTW+E SEIDV SGFYVLF SGL+LSFILSIY L+SSR+K RF++
Sbjct: 152 FWDAQHWPKHILARYTWKEHSEIDVTSGFYVLFGSGLLLSFILSIYTLQSSRDKLERFVR 211
Query: 205 ETVAESSMPGDGVAKVE 221
ETVAESS+P +AKVE
Sbjct: 212 ETVAESSIPVGEIAKVE 228
>gi|297819862|ref|XP_002877814.1| hypothetical protein ARALYDRAFT_906509 [Arabidopsis lyrata subsp.
lyrata]
gi|297323652|gb|EFH54073.1| hypothetical protein ARALYDRAFT_906509 [Arabidopsis lyrata subsp.
lyrata]
Length = 230
Score = 319 bits (817), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 147/211 (69%), Positives = 172/211 (81%)
Query: 6 RTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREAL 65
+ +L+L+ LS P++S AYRPGDIV MSKM QYH+SRT WHDVIGKHCPIFAVNRE L
Sbjct: 14 NAFIQALVLISLSFPALSSAYRPGDIVRMSKMGQYHSSRTTWHDVIGKHCPIFAVNREVL 73
Query: 66 IPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTA 125
IPIAKP G+TG DP KI FQVG EK+ + WL VINRKSS+VPMIDV+LRYSG DLLGVTA
Sbjct: 74 IPIAKPIGYTGTDPYKIKFQVGSEKYLIHWLLVINRKSSEVPMIDVNLRYSGGDLLGVTA 133
Query: 126 KIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSF 185
++VDMP Y+ HP++ K FW PE WPKHVL+RYTW+EQSEIDV+SGFYVLF S L SF
Sbjct: 134 EVVDMPHSYLNTHPEIRKQFWDPEHWPKHVLVRYTWKEQSEIDVSSGFYVLFGSALTFSF 193
Query: 186 ILSIYILESSREKFTRFLQETVAESSMPGDG 216
+LSIY+L+SSREK RF++ETV ESS G
Sbjct: 194 VLSIYVLQSSREKLARFVRETVVESSSTNVG 224
>gi|18409385|ref|NP_566953.1| uncharacterized protein [Arabidopsis thaliana]
gi|22531176|gb|AAM97092.1| putative protein [Arabidopsis thaliana]
gi|30023682|gb|AAP13374.1| At3g51610 [Arabidopsis thaliana]
gi|110742543|dbj|BAE99187.1| hypothetical protein [Arabidopsis thaliana]
gi|332645291|gb|AEE78812.1| uncharacterized protein [Arabidopsis thaliana]
Length = 230
Score = 316 bits (810), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 145/206 (70%), Positives = 170/206 (82%)
Query: 6 RTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREAL 65
+ +L+L+ LS P +S AYRPGDIV MSKM QYH+SRT WHDVIGKHCPIFAVNRE L
Sbjct: 14 NAFIQALVLISLSFPFLSSAYRPGDIVRMSKMGQYHSSRTTWHDVIGKHCPIFAVNREVL 73
Query: 66 IPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTA 125
IPIAKP G+TG DP KI FQVG EKF + WL VINRKSS+VPMIDV+LRYSG DLLGVTA
Sbjct: 74 IPIAKPIGYTGTDPYKIKFQVGSEKFLIHWLLVINRKSSEVPMIDVNLRYSGGDLLGVTA 133
Query: 126 KIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSF 185
+++DMP Y+ HP++ K FW P+ WPKHVL+RYTW+EQSEIDV+SGFYVLF S L SF
Sbjct: 134 QVIDMPHSYLNTHPEIRKQFWDPQHWPKHVLVRYTWKEQSEIDVSSGFYVLFGSALTFSF 193
Query: 186 ILSIYILESSREKFTRFLQETVAESS 211
+LSIY+L+SSREK RF++ETV ESS
Sbjct: 194 VLSIYVLQSSREKLARFVRETVVESS 219
>gi|21617957|gb|AAM67007.1| unknown [Arabidopsis thaliana]
Length = 227
Score = 316 bits (809), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 145/206 (70%), Positives = 170/206 (82%)
Query: 6 RTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREAL 65
+ +L+L+ LS P +S AYRPGDIV MSKM QYH+SRT WHDVIGKHCPIFAVNRE L
Sbjct: 11 NAFIQALVLISLSFPFLSSAYRPGDIVRMSKMGQYHSSRTTWHDVIGKHCPIFAVNREVL 70
Query: 66 IPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTA 125
IPIAKP G+TG DP KI FQVG EKF + WL VINRKSS+VPMIDV+LRYSG DLLGVTA
Sbjct: 71 IPIAKPIGYTGTDPYKIKFQVGSEKFLIHWLLVINRKSSEVPMIDVNLRYSGGDLLGVTA 130
Query: 126 KIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSF 185
+++DMP Y+ HP++ K FW P+ WPKHVL+RYTW+EQSEIDV+SGFYVLF S L SF
Sbjct: 131 QVIDMPHSYLNTHPEIRKQFWDPQHWPKHVLVRYTWKEQSEIDVSSGFYVLFGSALTFSF 190
Query: 186 ILSIYILESSREKFTRFLQETVAESS 211
+LSIY+L+SSREK RF++ETV ESS
Sbjct: 191 VLSIYVLQSSREKLARFVRETVVESS 216
>gi|218196165|gb|EEC78592.1| hypothetical protein OsI_18610 [Oryza sativa Indica Group]
Length = 228
Score = 310 bits (795), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 141/200 (70%), Positives = 170/200 (85%)
Query: 22 ISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVK 81
++ AYRPGDIVPM + QYH SR+VW DV+G+HCP FAVN E ++PI KPTGFTGADP K
Sbjct: 29 LASAYRPGDIVPMLRSGQYHGSRSVWFDVVGRHCPSFAVNHEVMMPIPKPTGFTGADPYK 88
Query: 82 ISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDL 141
I+FQ+G EKF +PWL+VINRKSS+VPMID HL+YSGNDLLGVTAK+VDMP YVE HPD+
Sbjct: 89 ITFQIGHEKFHLPWLYVINRKSSEVPMIDFHLKYSGNDLLGVTAKVVDMPHIYVEHHPDI 148
Query: 142 PKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTR 201
K FW ++WPK+VL+RYTWEEQSEIDV GFYVLF SGL+LSFIL+IY+L+SS+EK TR
Sbjct: 149 RKNFWDQQNWPKYVLVRYTWEEQSEIDVPGGFYVLFGSGLVLSFILAIYVLQSSQEKLTR 208
Query: 202 FLQETVAESSMPGDGVAKVE 221
F++E V +SS+P G AKVE
Sbjct: 209 FVREAVNDSSLPEGGFAKVE 228
>gi|115462309|ref|NP_001054754.1| Os05g0168400 [Oryza sativa Japonica Group]
gi|53982146|gb|AAV25242.1| unknown protein [Oryza sativa Japonica Group]
gi|113578305|dbj|BAF16668.1| Os05g0168400 [Oryza sativa Japonica Group]
gi|215697616|dbj|BAG91610.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741138|dbj|BAG97633.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222630339|gb|EEE62471.1| hypothetical protein OsJ_17268 [Oryza sativa Japonica Group]
Length = 227
Score = 310 bits (795), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 141/197 (71%), Positives = 168/197 (85%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISF 84
AYRPGDIVPM + QYH SR+VW DV+G+HCP FAVN E ++PI KPTGFTGADP KI+F
Sbjct: 31 AYRPGDIVPMLRSGQYHGSRSVWFDVVGRHCPSFAVNHEVMMPIPKPTGFTGADPYKITF 90
Query: 85 QVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKY 144
Q+G EKF +PWL+VINRKSS+VPMID HL+YSGNDLLGVTAK+VDMP YVE HPD+ K
Sbjct: 91 QIGHEKFHLPWLYVINRKSSEVPMIDFHLKYSGNDLLGVTAKVVDMPHIYVEHHPDIRKN 150
Query: 145 FWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQ 204
FW ++WPK+VL+RYTWEEQSEIDV GFYVLF SGL+LSFIL+IY+L+SS+EK TRF++
Sbjct: 151 FWDQQNWPKYVLVRYTWEEQSEIDVPGGFYVLFGSGLVLSFILAIYVLQSSQEKLTRFVR 210
Query: 205 ETVAESSMPGDGVAKVE 221
E V +SS+P G AKVE
Sbjct: 211 EAVNDSSLPEGGFAKVE 227
>gi|4902482|emb|CAB43523.1| hypothetical protein [Arabidopsis thaliana]
gi|6572082|emb|CAB63025.1| putative protein [Arabidopsis thaliana]
Length = 189
Score = 290 bits (742), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 131/178 (73%), Positives = 151/178 (84%)
Query: 34 MSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQVGREKFRV 93
MSKM QYH+SRT WHDVIGKHCPIFAVNRE LIPIAKP G+TG DP KI FQVG EKF +
Sbjct: 1 MSKMGQYHSSRTTWHDVIGKHCPIFAVNREVLIPIAKPIGYTGTDPYKIKFQVGSEKFLI 60
Query: 94 PWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPK 153
WL VINRKSS+VPMIDV+LRYSG DLLGVTA+++DMP Y+ HP++ K FW P+ WPK
Sbjct: 61 HWLLVINRKSSEVPMIDVNLRYSGGDLLGVTAQVIDMPHSYLNTHPEIRKQFWDPQHWPK 120
Query: 154 HVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQETVAESS 211
HVL+RYTW+EQSEIDV+SGFYVLF S L SF+LSIY+L+SSREK RF++ETV ESS
Sbjct: 121 HVLVRYTWKEQSEIDVSSGFYVLFGSALTFSFVLSIYVLQSSREKLARFVRETVVESS 178
>gi|116792279|gb|ABK26301.1| unknown [Picea sitchensis]
Length = 218
Score = 288 bits (738), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 131/215 (60%), Positives = 170/215 (79%), Gaps = 1/215 (0%)
Query: 7 TIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALI 66
T++ L++++ + S +YR GD+VP S+M QYH RT WHD++G HCPIF VNRE L+
Sbjct: 5 TMLLQLVVVIGMMGMASASYRVGDLVPTSRMGQYHAMRTNWHDILGHHCPIFGVNREVLL 64
Query: 67 PIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAK 126
PI KPTG+TGAD KISFQVGREKF +PWL VINRKS ++PMID+HLR+SG D+ GVTAK
Sbjct: 65 PIPKPTGYTGADAYKISFQVGREKFFIPWLLVINRKSPEIPMIDIHLRFSGGDIHGVTAK 124
Query: 127 IVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFI 186
+V+MP++YV+ H DL K FW PE WPK +L+RY WEE SEIDV+ GFYVLF +G +L+ +
Sbjct: 125 VVNMPRKYVDSHEDLLKEFWDPEHWPKRILVRYFWEETSEIDVSGGFYVLFGAGFLLTIV 184
Query: 187 LSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
+SIYIL+SS+EK RF++E V ESS+P + AKVE
Sbjct: 185 MSIYILQSSQEKLVRFVRENVVESSLPMEE-AKVE 218
>gi|356530183|ref|XP_003533663.1| PREDICTED: uncharacterized protein LOC100817258 [Glycine max]
Length = 185
Score = 261 bits (668), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 117/163 (71%), Positives = 138/163 (84%)
Query: 18 SLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGA 77
S I++AYRPGDIVPMS M QYH+SRTVW D+IG+HCPIFAVNRE L+PI KPTG+TGA
Sbjct: 21 SFLRIAVAYRPGDIVPMSCMGQYHSSRTVWQDLIGRHCPIFAVNREVLMPIPKPTGYTGA 80
Query: 78 DPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEI 137
D KI FQVGREKF +PWL V+NRKS++VPMI+V LRYSGNDL GVTAK+VDMP YVE+
Sbjct: 81 DAYKILFQVGREKFLIPWLLVVNRKSTEVPMIEVDLRYSGNDLHGVTAKVVDMPHHYVEV 140
Query: 138 HPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASG 180
HP++ K FW + WPKH+L+RYTW+E SEIDV SGF+VLF SG
Sbjct: 141 HPEISKQFWDSQHWPKHILVRYTWKEHSEIDVTSGFFVLFGSG 183
>gi|219887697|gb|ACL54223.1| unknown [Zea mays]
gi|413944508|gb|AFW77157.1| hypothetical protein ZEAMMB73_638350 [Zea mays]
Length = 155
Score = 254 bits (648), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 115/156 (73%), Positives = 140/156 (89%), Gaps = 1/156 (0%)
Query: 66 IPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTA 125
+PI KPTGFTGADP KI+FQ+G EKF VPWL+VINRK+S+VP+ID HL+YSGND+LGVTA
Sbjct: 1 MPIPKPTGFTGADPYKITFQIGHEKFHVPWLYVINRKTSEVPLIDFHLKYSGNDILGVTA 60
Query: 126 KIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSF 185
K+VDMP YVE+HPD+ K FW P++WPK+VL+RYTWEEQSEIDV GFYVLF SGL+LSF
Sbjct: 61 KVVDMPHHYVEVHPDIKKNFWDPQNWPKYVLVRYTWEEQSEIDVTGGFYVLFGSGLVLSF 120
Query: 186 ILSIYILESSREKFTRFLQETVAESSMPGDGVAKVE 221
IL+IY+L+SS+EK TRF++E VA+SS+P DGVAKVE
Sbjct: 121 ILAIYVLQSSQEKLTRFVREAVADSSLP-DGVAKVE 155
>gi|302762418|ref|XP_002964631.1| hypothetical protein SELMODRAFT_81108 [Selaginella moellendorffii]
gi|300168360|gb|EFJ34964.1| hypothetical protein SELMODRAFT_81108 [Selaginella moellendorffii]
Length = 196
Score = 235 bits (600), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 138/190 (72%)
Query: 12 LLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKP 71
LLL+++ Y G+ VP+++ QYHN RT WHD +G+HCP F +NRE ++PI KP
Sbjct: 7 FLLLVIAAIGACDGYFLGEYVPLARRGQYHNMRTPWHDYLGRHCPRFGINREVVVPIPKP 66
Query: 72 TGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMP 131
G++GADP KIS Q G E+ PWLF++ R+SS++PMIDV LRY+G DL GV+AK+V+MP
Sbjct: 67 VGYSGADPYKISLQFGHERITTPWLFIVGRQSSKMPMIDVTLRYTGGDLEGVSAKVVEMP 126
Query: 132 QRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYI 191
+ YV H K FW P WPKHVL+RY W SEIDVA+GFY+LF S +L+ ++SIYI
Sbjct: 127 EDYVAEHEGTFKEFWDPSQWPKHVLVRYNWYAYSEIDVATGFYILFGSAFLLTLVMSIYI 186
Query: 192 LESSREKFTR 201
L+SS++K +R
Sbjct: 187 LQSSKDKLSR 196
>gi|302815661|ref|XP_002989511.1| hypothetical protein SELMODRAFT_130036 [Selaginella moellendorffii]
gi|300142689|gb|EFJ09387.1| hypothetical protein SELMODRAFT_130036 [Selaginella moellendorffii]
Length = 196
Score = 232 bits (591), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 100/190 (52%), Positives = 137/190 (72%)
Query: 12 LLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKP 71
LLL+++ Y G+ VP+++ QYHN RT WHD +G+HCP F ++RE ++PI KP
Sbjct: 7 FLLLVIAAIGACDGYFLGEYVPLARRGQYHNMRTPWHDYLGRHCPRFGIDREVVVPIPKP 66
Query: 72 TGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMP 131
G++GADP KIS Q G E+ PWLF++ R+SS++PMIDV LRY+G DL GV+AK+V+MP
Sbjct: 67 VGYSGADPYKISLQFGHERITTPWLFIVGRQSSKMPMIDVTLRYTGGDLEGVSAKVVEMP 126
Query: 132 QRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYI 191
+ Y H K FW P WPKHVL+RY W SEIDVA+GFY+LF S +L+ ++SIYI
Sbjct: 127 EDYAAEHEGTFKEFWDPSQWPKHVLVRYNWYAYSEIDVATGFYILFGSAFLLTLVMSIYI 186
Query: 192 LESSREKFTR 201
L+SS++K +R
Sbjct: 187 LQSSKDKLSR 196
>gi|326497509|dbj|BAK05844.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 156
Score = 220 bits (561), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 97/139 (69%), Positives = 123/139 (88%)
Query: 83 SFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLP 142
+FQ+G EKF VPWL+VINRKSS+VP+ID HL+Y+GNDLLGVTAK+VDMP +VE+HPD+
Sbjct: 18 TFQIGHEKFHVPWLYVINRKSSEVPLIDFHLKYTGNDLLGVTAKVVDMPHHFVELHPDIK 77
Query: 143 KYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTRF 202
K FW ++WPK+VL+ YTWEEQSEIDV +GFYVLF SGL+LSFIL+IY+L+SS+EK TRF
Sbjct: 78 KNFWDQQNWPKYVLVSYTWEEQSEIDVTAGFYVLFGSGLVLSFILAIYVLQSSQEKLTRF 137
Query: 203 LQETVAESSMPGDGVAKVE 221
++E V++SS+P GVAKVE
Sbjct: 138 VREAVSDSSLPEGGVAKVE 156
>gi|255633058|gb|ACU16884.1| unknown [Glycine max]
Length = 176
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 96/134 (71%), Positives = 116/134 (86%)
Query: 22 ISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVK 81
I++AYRPGDIVPMS+M QYH+SRTVW D+IG+HCPIFAVNRE L+PI KPTG+TGAD K
Sbjct: 25 IAVAYRPGDIVPMSRMGQYHSSRTVWQDLIGRHCPIFAVNREVLMPIPKPTGYTGADAYK 84
Query: 82 ISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDL 141
IS QVGREKF +PWL V+NRKS++VPMI+V LRYSG+DL GVTAK+VDMP YVE+HP++
Sbjct: 85 ISSQVGREKFLIPWLLVVNRKSTEVPMIEVDLRYSGSDLHGVTAKVVDMPHHYVEVHPEI 144
Query: 142 PKYFWQPESWPKHV 155
K FW + WPKH+
Sbjct: 145 RKQFWDSQHWPKHI 158
>gi|168065793|ref|XP_001784831.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663585|gb|EDQ50341.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 230
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 140/201 (69%), Gaps = 2/201 (0%)
Query: 21 SISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPV 80
+LAY+ GD VP+++ Q+H T W D +G+HCP F V+ E ++P+ KP GFT ++
Sbjct: 32 DFALAYQTGDPVPVARRGQFHGQTTSWMDQLGRHCPHFGVDTEVVMPLPKPVGFTESENY 91
Query: 81 KISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPD 140
KISFQ GREK+ PWL +I RK QVPM++V LR+SG DL GVT K+V MP++Y+ H
Sbjct: 92 KISFQFGREKYLTPWLLMIGRK--QVPMLEVTLRHSGGDLEGVTTKVVPMPEKYLTEHAS 149
Query: 141 LPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFT 200
+ + F P WPKH+L++YTW E+SEIDV G +VLF SG +L I ++YIL++S++K
Sbjct: 150 IRQKFEDPADWPKHILVQYTWVEKSEIDVVGGLFVLFLSGNILFVITALYILQTSKDKLA 209
Query: 201 RFLQETVAESSMPGDGVAKVE 221
RFL+E V E+SM G V K +
Sbjct: 210 RFLKENVVETSMTGGDVPKAD 230
>gi|159464010|ref|XP_001690235.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284223|gb|EDP09973.1| predicted protein [Chlamydomonas reinhardtii]
Length = 226
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 110/192 (57%), Gaps = 1/192 (0%)
Query: 21 SISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPT-GFTGADP 79
+++ +R GD + S+ +Q+H SRT W D++G+HCP F ++R +PI KP F D
Sbjct: 20 ALATTFRDGDYIHTSRKAQFHQSRTNWQDLLGQHCPRFGIDRLVAVPIPKPQLAFGKGDT 79
Query: 80 VKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHP 139
K+ F ++ PWL ++ + VP+I V LR SG +LLG TA ++D P+ Y + HP
Sbjct: 80 YKLQFSFDGDRHLTPWLPLLGEGAPAVPLIIVTLRRSGEELLGATAAVLDAPEEYKQRHP 139
Query: 140 DLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKF 199
L WPKHVL+ Y ++ ++++D+ G YVLF GL+ +L + L + K
Sbjct: 140 VLVSELHNVTHWPKHVLVHYRFDTRNDVDLDRGLYVLFPIGLIAVLVLCLSALRGVQPKL 199
Query: 200 TRFLQETVAESS 211
+FL + AE +
Sbjct: 200 AQFLADVTAEGT 211
>gi|195650167|gb|ACG44551.1| hypothetical protein [Zea mays]
gi|413944507|gb|AFW77156.1| hypothetical protein ZEAMMB73_638350 [Zea mays]
Length = 91
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/92 (71%), Positives = 81/92 (88%), Gaps = 1/92 (1%)
Query: 130 MPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSI 189
MP YVE+HPD+ K FW P++WPK+VL+RYTWEEQSEIDV GFYVLF SGL+LSFIL+I
Sbjct: 1 MPHHYVEVHPDIKKNFWDPQNWPKYVLVRYTWEEQSEIDVTGGFYVLFGSGLVLSFILAI 60
Query: 190 YILESSREKFTRFLQETVAESSMPGDGVAKVE 221
Y+L+SS+EK TRF++E VA+SS+P DGVAKVE
Sbjct: 61 YVLQSSQEKLTRFVREAVADSSLP-DGVAKVE 91
>gi|302833932|ref|XP_002948529.1| hypothetical protein VOLCADRAFT_80300 [Volvox carteri f.
nagariensis]
gi|300266216|gb|EFJ50404.1| hypothetical protein VOLCADRAFT_80300 [Volvox carteri f.
nagariensis]
Length = 226
Score = 139 bits (350), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 106/182 (58%), Gaps = 1/182 (0%)
Query: 23 SLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPT-GFTGADPVK 81
+ ++R GD + S+ +Q+H +RT W D++G+HCP F ++R +PI KP F D K
Sbjct: 44 ATSFRDGDYIHTSRKAQFHQARTNWQDLLGQHCPRFGIDRLVAVPIPKPQLPFGEKDTYK 103
Query: 82 ISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDL 141
+ F ++ PW+ ++ + + VP++ V LR SG +LLG +A++VD P Y + HP L
Sbjct: 104 LQFSFDGDRHLTPWVPLLGQGAPSVPLVMVTLRRSGTELLGASAEVVDAPAEYQQHHPVL 163
Query: 142 PKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYILESSREKFTR 201
WPKHVL+ Y ++ ++++D+ G Y+LF +GL+ L + L + K +
Sbjct: 164 VAEMRNVSHWPKHVLVHYRFDTRNDVDLDRGLYLLFPAGLLAVLALCLSALRGVQPKLAQ 223
Query: 202 FL 203
FL
Sbjct: 224 FL 225
>gi|307105928|gb|EFN54175.1| hypothetical protein CHLNCDRAFT_17489, partial [Chlorella
variabilis]
Length = 170
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 89/155 (57%)
Query: 26 YRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQ 85
++ GD VP ++ +Q+H RT WHD++G+HCP F V R +P+ +P G+ AD K+ F
Sbjct: 16 FQEGDFVPSARRAQFHGQRTHWHDILGQHCPKFGVKRLVAVPLPQPLGYKAADDYKVQFS 75
Query: 86 VGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYF 145
++ PWL VI ++++ P ++V L SG+ ++ V+A + + + H L + F
Sbjct: 76 FDGDRHLTPWLPVIGKRAASPPYVEVELTRSGDSIVAVSASVYQLDEEDQAQHAPLVREF 135
Query: 146 WQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASG 180
WPKH+L+ YTW + E D +G VLF+
Sbjct: 136 LNATHWPKHLLVHYTWHTRHEEDEEAGLLVLFSGA 170
>gi|255075015|ref|XP_002501182.1| predicted protein [Micromonas sp. RCC299]
gi|226516445|gb|ACO62440.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 173
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 81/153 (52%)
Query: 26 YRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQ 85
Y GD+VP ++ +Q+H RT WHD+ +HCP F + +PI KPT + D KIS
Sbjct: 21 YHHGDLVPTARRAQFHGQRTHWHDLTARHCPKFGEDHAVAVPIPKPTSWREDDEYKISLS 80
Query: 86 VGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYF 145
++ WL + VPM+D+ + + ++ V A V + ++Y++ H L + F
Sbjct: 81 FEGDRHLTGWLLRGDEDPGVVPMLDIEITHGRGEIRAVKADTVAVGRKYLKTHRSLVEEF 140
Query: 146 WQPESWPKHVLIRYTWEEQSEIDVASGFYVLFA 178
WPKH+L+RY W E++++D VL
Sbjct: 141 HNHTVWPKHLLVRYRWTEKTDVDAKFSSTVLLG 173
>gi|297794977|ref|XP_002865373.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297849718|ref|XP_002892740.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311208|gb|EFH41632.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297338582|gb|EFH68999.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 83
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/70 (70%), Positives = 54/70 (77%)
Query: 6 RTIMTSLLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREAL 65
+ +L L+ LS P S AYRPGDIV MSKM QYH+SR WHDVIGKHCPIFAVNRE L
Sbjct: 14 NAFIQALFLISLSFPIPSSAYRPGDIVRMSKMGQYHSSRITWHDVIGKHCPIFAVNREVL 73
Query: 66 IPIAKPTGFT 75
IPIAKP G+T
Sbjct: 74 IPIAKPIGYT 83
>gi|303290658|ref|XP_003064616.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454214|gb|EEH51521.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 139
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 74/139 (53%), Gaps = 1/139 (0%)
Query: 35 SKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQVGREKFRVP 94
S+ +Q+H RT WHD++ HCP F + +P+ +PT + D KI ++
Sbjct: 1 SRRAQFHGERTAWHDLLATHCPTFDRDGVVAVPLPRPTNLSDDDAYKIRLSFDSDRHHTD 60
Query: 95 WLFVI-NRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPK 153
W+ VI +K VPMIDV ++ V A++V +P+ Y+ H L +WPK
Sbjct: 61 WMTVIPEKKEPPVPMIDVAFYVDDGRVVAVAAQVVPLPRAYLREHRALVAEHHDRGAWPK 120
Query: 154 HVLIRYTWEEQSEIDVASG 172
HV++RY W+++ +DV G
Sbjct: 121 HVIVRYRWKKRFRVDVNGG 139
>gi|428168436|gb|EKX37381.1| hypothetical protein GUITHDRAFT_145078 [Guillardia theta CCMP2712]
Length = 259
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 87/185 (47%), Gaps = 17/185 (9%)
Query: 12 LLLLLLSLPSISLAYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKP 71
LL+ LL++ + + GD V K Q+ RT+W DV+ P++AV+R + + P
Sbjct: 8 LLVGLLAMVGCAHGFFRGDPVQTFKRMQFQGKRTMWSDVLVGQGPLYAVDRS--VELNYP 65
Query: 72 -TGFTGADPVKISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDM 130
+D +K+ EKF W+ V + + + ++ V L YSGN++ + K
Sbjct: 66 YNDIAASDTLKMQLAYDHEKFSSEWITVSDGNGNHLDVLTVELVYSGNEIHEIRHKY--- 122
Query: 131 PQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIY 190
RY ++ P P P + + Y W +++ D G LFA+G ++ +L ++
Sbjct: 123 --RYKQVEPHEPH--------PDRITVEYHWISEADTDTRMGLDFLFAAGCCVT-VLVVF 171
Query: 191 ILESS 195
+ S
Sbjct: 172 VTSSD 176
>gi|384250249|gb|EIE23729.1| hypothetical protein COCSUDRAFT_66074 [Coccomyxa subellipsoidea
C-169]
Length = 114
Score = 70.9 bits (172), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 54/102 (52%)
Query: 108 MIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEI 167
M+ VHL + + L V+AK++ +PQ Y+ H L F WPKH+L+ Y+W Q +
Sbjct: 1 MLHVHLTRTSDALTSVSAKVLPVPQGYLYAHHQLFDEFQNATVWPKHLLVEYSWTTQHSV 60
Query: 168 DVASGFYVLFASGLMLSFILSIYILESSREKFTRFLQETVAE 209
D S YV+F L++ L++ ++ + K F+ E E
Sbjct: 61 DALSALYVIFFICLLVFMALALNVVVAYETKLKAFMAEIAGE 102
>gi|424512876|emb|CCO66460.1| unknown protein [Bathycoccus prasinos]
Length = 318
Score = 67.0 bits (162), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 85/211 (40%), Gaps = 49/211 (23%)
Query: 29 GDIVPMSKMSQYHNS--RTVWHDVIGKHCPIFAVNREALIPIAKPTGFT----------- 75
GDIVP S +S RT W DV+ KHCP F N+ + KPT +
Sbjct: 67 GDIVPTSSRMHLKSSKQRTQWMDVLEKHCPSFGRNKMVAYRVEKPTSHSLMTTTEGEEEG 126
Query: 76 ---GADPVKISFQVGREKFRVPWLFVINRKSSQV---------------PMIDVHLRYSG 117
++ +KI F E+ W+ + R SS V PMI +
Sbjct: 127 KEQSSNEIKIQFAFDNERHFTVWMPL--RFSSSVSPPRTTTTKGKEKKIPMITFTFTHHA 184
Query: 118 NDLLGVT----------AKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEI 167
++ V AK Q Y + + + WPKHVL++Y + E+ +
Sbjct: 185 GYIVNVKSESSFIERKGAKKQLHQQHYEALAEEWSSTLARRSEWPKHVLLKYEFIEKESV 244
Query: 168 DVASGFYVLFASGLMLSFILSIYILESSREK 198
DV G G+++SF+L +++ SR K
Sbjct: 245 DVNKGL------GVLMSFMLLVFVASVSRVK 269
>gi|66816681|ref|XP_642350.1| hypothetical protein DDB_G0278411 [Dictyostelium discoideum AX4]
gi|60470398|gb|EAL68378.1| hypothetical protein DDB_G0278411 [Dictyostelium discoideum AX4]
Length = 334
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 78/175 (44%), Gaps = 19/175 (10%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTV-WHDVIGKHCPIFAVN-REALIPIAKPTGFTGADPV-- 80
AY+ +IV + S TV HD+ K P F + ++L+ + D +
Sbjct: 22 AYKDREIVRTHLQTIIGTSGTVDNHDISTKLAPRFKRDLSKSLLAVDSKINGKKVDILPH 81
Query: 81 ---KISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEI 137
+I F +KF+ W+ + + + + ID YS + KI+D+ +
Sbjct: 82 TLYRILFSFDNDKFKTSWITICDGNGTYLNQIDFSFTYSND-------KILDL-----KW 129
Query: 138 HPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYIL 192
H D + P + I Y W+E + D++SG ++LF G+++S + IYI+
Sbjct: 130 HLDYNTEEVHHKKKPNNFYINYKWKEIKDKDISSGLFILFLFGILISSVSVIYII 184
>gi|328867371|gb|EGG15754.1| hypothetical protein DFA_10597 [Dictyostelium fasciculatum]
Length = 259
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 63/150 (42%), Gaps = 13/150 (8%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISF 84
A R GD++ SK S + T W D+ K P F +++ + T K+SF
Sbjct: 32 ALRTGDLIRFSKKSIHDMKSTEWTDIKAKFSPRFKIDKTITLSSLTSVDLTKDSLYKMSF 91
Query: 85 QVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKY 144
+KF W+ V + + + I+ +L Y+ +DL+ V + EIH +
Sbjct: 92 SFF-DKFTTTWITVADGNGTFLNHIEFNLYYANDDLIDVK---FTLDYNDEEIHRGMK-- 145
Query: 145 FWQPESWPKHVLIRYTWEEQSEIDVASGFY 174
P H + Y W + +E D+ + +
Sbjct: 146 -------PDHFYLIYKWHQMNEHDIPTSLF 168
>gi|413944510|gb|AFW77159.1| hypothetical protein ZEAMMB73_638350, partial [Zea mays]
Length = 78
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 31/42 (73%), Gaps = 1/42 (2%)
Query: 2 TPRRRTIMTSLLLLLLS-LPSISLAYRPGDIVPMSKMSQYHN 42
+P R ++ +LLL+L S LPS++ AYRPGDIVPM + QYH
Sbjct: 35 SPARHGVLPALLLILCSSLPSLAAAYRPGDIVPMLRSGQYHG 76
>gi|330803600|ref|XP_003289792.1| hypothetical protein DICPUDRAFT_80560 [Dictyostelium purpureum]
gi|325080103|gb|EGC33673.1| hypothetical protein DICPUDRAFT_80560 [Dictyostelium purpureum]
Length = 312
Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 78/172 (45%), Gaps = 18/172 (10%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPV---- 80
AY+P +IV ++ ++ + HD+ + P F V+ + + A+ +
Sbjct: 18 AYKPREIVRVNLQTKT-GANIEHHDLSPRIAPKFKVDLSKSLLNLDSSNNKKAELLPNIL 76
Query: 81 -KISFQVGREKFRVPWLFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHP 139
KI F ++F+ W+ V + + ID YS + KI+D+ +Y +
Sbjct: 77 YKILFSFDNDRFKTTWITVSDGNGKFLNHIDFTFIYSND-------KIIDL--KYNLDYN 127
Query: 140 DLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGFYVLFASGLMLSFILSIYI 191
D + + +S I Y WEE + D++SG +VLF GL++S I +I
Sbjct: 128 DEAVHHGKKQS---DFYINYRWEEIRDKDLSSGLFVLFTVGLIVSGISFTHI 176
>gi|290977722|ref|XP_002671586.1| predicted protein [Naegleria gruberi]
gi|284085156|gb|EFC38842.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 45.4 bits (106), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 39/196 (19%), Positives = 76/196 (38%), Gaps = 39/196 (19%)
Query: 25 AYRPGDIVPMSKMSQYHNSRTVWHDVIGKHCPIFAVNR-----------EALIPIAK-PT 72
++ GD +PM K Q++ RT W +V P + + + EA + K P
Sbjct: 30 GHKIGDAIPMLKKIQFNEKRTEWSEVPYGDAPRYGIPKKITITGVAAILEAFLETPKSPA 89
Query: 73 GFT---------GADPVKISFQVGREKFRVPWLFVINRKSSQ----------VPMIDVHL 113
G D +K+SF F PW+ + ++ + + ID
Sbjct: 90 GTDKYYEELIAQSKDDLKLSFAFHHPNFETPWITLFSKGTPDEVTSKNSYHFLKQIDFMF 149
Query: 114 RYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYFWQPESWPKHVLIRYTWEEQSEIDVASGF 173
Y G+ +L + + +E + + + +P + +++ Y W+E E D ++G
Sbjct: 150 LYHGDTILEI--------KHSLEYYDEEDEASNRPSTINNDIVVNYYWKEVVEKDTSAGL 201
Query: 174 YVLFASGLMLSFILSI 189
L+ + I+ I
Sbjct: 202 TFLYVISFIFGSIIMI 217
>gi|261380902|ref|ZP_05985475.1| transcriptional regulator, LysR family [Neisseria subflava NJ9703]
gi|284796151|gb|EFC51498.1| transcriptional regulator, LysR family [Neisseria subflava NJ9703]
Length = 301
Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 54/113 (47%), Gaps = 18/113 (15%)
Query: 36 KMSQYHNSRTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQVGREKFRVPW 95
++ QY T WHD + + I E + I+ PTG++ +P+K + Q R ++ P
Sbjct: 64 QLQQYARQLTDWHDAVEREMGILQTEPEGEVRISLPTGYSAVEPMKRTVQTLRHRY--PK 121
Query: 96 LFVINRKSSQVPMIDVHLRYSGNDLLGVTAKIVDMPQRYVEIHPDLPKYFWQP 148
+ +I ++++ ++D+ ND D+ R V +HPD P +P
Sbjct: 122 IRLILNENNR--LVDLQ-----ND--------TDIAIRVV-LHPDDPDSIARP 158
>gi|332638977|ref|ZP_08417840.1| L-ribulose-5-phosphate 4-epimerase [Weissella cibaria KACC 11862]
Length = 237
Score = 37.0 bits (84), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 152 PKHVLIRYTWEEQSEIDVASGFYVLFASGL 181
PKH LI++TW SEID SG +V+ SG+
Sbjct: 17 PKHDLIKFTWGNVSEIDRESGLFVIKPSGV 46
>gi|440791471|gb|ELR12709.1| hypothetical protein ACA1_092190 [Acanthamoeba castellanii str.
Neff]
Length = 337
Score = 36.6 bits (83), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 2/81 (2%)
Query: 44 RTVWHDVIGKHCPIFAVNREALIPIAKPTGFTGADPVKISFQVGREKFRVPWLFVINRKS 103
T W + K+ P F + +++P + D K+SF KF PW+ V + K
Sbjct: 28 ETPWTGLTAKYSPRFRYEKASMLPNPQ-QDIKAHDFFKLSFSFVDGKFTTPWITVWDAKQ 86
Query: 104 S-QVPMIDVHLRYSGNDLLGV 123
+ + I+ YSGN+++GV
Sbjct: 87 NIYLNYIEFVFLYSGNEIIGV 107
>gi|341820777|emb|CCC57081.1| L-ribulose-5-phosphate 4-epimerase [Weissella thailandensis fsh4-2]
Length = 234
Score = 36.6 bits (83), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 152 PKHVLIRYTWEEQSEIDVASGFYVLFASGL 181
PKH LI++TW SEID SG +V+ SG+
Sbjct: 17 PKHGLIKFTWGNVSEIDRESGLFVIKPSGV 46
>gi|403236994|ref|ZP_10915580.1| L-ribulose-5-phosphate 4-epimerase [Bacillus sp. 10403023]
Length = 231
Score = 36.2 bits (82), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 152 PKHVLIRYTWEEQSEIDVASGFYVLFASGL 181
PKH L+++TW S ID ASG +V+ SG+
Sbjct: 17 PKHGLVKFTWGNASAIDHASGLFVIKPSGV 46
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.138 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,447,248,314
Number of Sequences: 23463169
Number of extensions: 141324081
Number of successful extensions: 318553
Number of sequences better than 100.0: 45
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 318502
Number of HSP's gapped (non-prelim): 45
length of query: 221
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 84
effective length of database: 9,144,741,214
effective search space: 768158261976
effective search space used: 768158261976
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 74 (33.1 bits)