BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021746
(308 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225443239|ref|XP_002271931.1| PREDICTED: uncharacterized protein LOC100263834 [Vitis vinifera]
Length = 301
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 227/284 (79%), Positives = 245/284 (86%), Gaps = 6/284 (2%)
Query: 31 AKPTPVSAFAM------ASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD 84
+KPTP F +S TQVAPA IVGGGRVG AL+ MG G DLLVKRG+ VPLD
Sbjct: 18 SKPTPPIKFRFRFRVRASSAMATQVAPAVIVGGGRVGRALQGMGSGDDLLVKRGDSVPLD 77
Query: 85 FEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSK 144
F GPI VCTRNDDL+AVLE+ P+SRW+DLVFFQNGM+EPWL+SKGL DA+QVLAYFAVSK
Sbjct: 78 FAGPILVCTRNDDLDAVLESTPKSRWSDLVFFQNGMLEPWLQSKGLSDADQVLAYFAVSK 137
Query: 145 LGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISA 204
LGE PIDGKTDTNPEGLTAAYGKWAS VA RL GGLSCKVLDKEAFQKQMLEKLIWISA
Sbjct: 138 LGEPPIDGKTDTNPEGLTAAYGKWASAVAGRLHAGGLSCKVLDKEAFQKQMLEKLIWISA 197
Query: 205 FMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVAN 264
FMLVGARH GATVGVVEKE+RSEVS+LI ELA AAA EKGI FD MEDRLCAYSRAV++
Sbjct: 198 FMLVGARHPGATVGVVEKEFRSEVSSLIGELASAAAVEKGIVFDEGMEDRLCAYSRAVSH 257
Query: 265 FPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
FPTA KEFKWRNGWFYSLS+KA AEGKPDPCPLHT WLKE+K+V
Sbjct: 258 FPTAAKEFKWRNGWFYSLSQKAIAEGKPDPCPLHTEWLKELKIV 301
>gi|356526300|ref|XP_003531756.1| PREDICTED: uncharacterized protein LOC100810662 [Glycine max]
Length = 294
Score = 459 bits (1180), Expect = e-127, Method: Compositional matrix adjust.
Identities = 215/286 (75%), Positives = 250/286 (87%), Gaps = 3/286 (1%)
Query: 23 PTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVP 82
PT S RF + A + T+VAPA IVGGGRVG AL++MG GQDLLV+RGE VP
Sbjct: 12 PTTSAFRFKASAMATTIAAPA---TKVAPAVIVGGGRVGRALQDMGTGQDLLVRRGESVP 68
Query: 83 LDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
L+FEGPIFVCTRNDDLE+VL++ P SRW DLVFFQNGM+EPWLESKGL+DANQVLAYFAV
Sbjct: 69 LNFEGPIFVCTRNDDLESVLQSTPSSRWGDLVFFQNGMLEPWLESKGLEDANQVLAYFAV 128
Query: 143 SKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWI 202
SK+GE PIDG+TD NPEGLTAAYGKWAS+VA RL+ GGLSCKVLDKE FQKQMLEKLIWI
Sbjct: 129 SKIGETPIDGRTDINPEGLTAAYGKWASIVAARLNAGGLSCKVLDKEVFQKQMLEKLIWI 188
Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
+ MLVGARH G +VGVV+KE+R+E+S+LI ELA AA++EKG+TF+ AME+RLCAYSRAV
Sbjct: 189 CSVMLVGARHGGVSVGVVDKEFRTELSSLITELASAASSEKGLTFEEAMEERLCAYSRAV 248
Query: 263 ANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
A+FPTAVKEFKWRNGWFYSLSEKA+A+GKPDPCPLH+ WLKE+++V
Sbjct: 249 AHFPTAVKEFKWRNGWFYSLSEKATAQGKPDPCPLHSQWLKELRIV 294
>gi|224115808|ref|XP_002332062.1| predicted protein [Populus trichocarpa]
gi|222831948|gb|EEE70425.1| predicted protein [Populus trichocarpa]
Length = 269
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 217/261 (83%), Positives = 238/261 (91%)
Query: 48 QVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPR 107
+V PA IVG GRVG AL+EMG GQDLLVKRGE VPLDFEGPI VCTRNDDL+AVLEA P+
Sbjct: 9 KVTPAVIVGAGRVGRALQEMGSGQDLLVKRGEPVPLDFEGPILVCTRNDDLDAVLEATPK 68
Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
+RW+DLVFFQNGM+EPW +SKGL DA+ VLAYFAVSKLGE P DGKTDTNPEGLTAAYGK
Sbjct: 69 ARWSDLVFFQNGMLEPWFQSKGLGDADHVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGK 128
Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
WAS VA RL+ GGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH GATVGV+EKE+ SE
Sbjct: 129 WASAVAARLNAGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVMEKEFHSE 188
Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
VS+LI ELA AAAAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA
Sbjct: 189 VSSLITELASAAAAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAV 248
Query: 288 AEGKPDPCPLHTAWLKEIKVV 308
+EGKPDPCPLHT+WLKE+KVV
Sbjct: 249 SEGKPDPCPLHTSWLKELKVV 269
>gi|118484329|gb|ABK94042.1| unknown [Populus trichocarpa]
Length = 309
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 232/309 (75%), Positives = 257/309 (83%), Gaps = 1/309 (0%)
Query: 1 MAANALTRQLSSTSTLWPPLKKPTFSKPRFAKPTPVSAFAMASF-TTTQVAPAAIVGGGR 59
MA + LT L ++S + K F + AMA+ T +VAPA IVGGGR
Sbjct: 1 MATHTLTATLCNSSLIKSNSKFTFFGTRIITRTQKTQTLAMATAPTAAKVAPAVIVGGGR 60
Query: 60 VGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNG 119
VG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE P+SRW+DLVFFQNG
Sbjct: 61 VGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEVTPKSRWSDLVFFQNG 120
Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
M+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDT+PEGLTAAYGKWAS VA RL G
Sbjct: 121 MLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTDPEGLTAAYGKWASAVAARLHAG 180
Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAA 239
GLSCKVLDKE FQKQMLEKLIWISAFML GARH GATVG VEKE+RSEVS+LIAELA AA
Sbjct: 181 GLSCKVLDKETFQKQMLEKLIWISAFMLAGARHPGATVGAVEKEFRSEVSSLIAELASAA 240
Query: 240 AAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHT 299
AAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+KA AEGKPDPCPLHT
Sbjct: 241 AAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSDKAVAEGKPDPCPLHT 300
Query: 300 AWLKEIKVV 308
AW+KE+KVV
Sbjct: 301 AWIKELKVV 309
>gi|356523119|ref|XP_003530189.1| PREDICTED: uncharacterized protein LOC100795565 [Glycine max]
Length = 298
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 211/264 (79%), Positives = 242/264 (91%), Gaps = 1/264 (0%)
Query: 46 TTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAA 105
+T+VAPA IVGGGRVG AL++MG GQDLLV+RGE VP +FEGPIFVCTRNDDLE+VL++
Sbjct: 35 STKVAPAVIVGGGRVGRALQDMGTGQDLLVRRGESVPPNFEGPIFVCTRNDDLESVLQST 94
Query: 106 PRSRWN-DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P SRW DLVFFQNGM+EPWLESKGL+DANQVLAYFAVSK+GE PIDG+TDTNPEGLTAA
Sbjct: 95 PSSRWRADLVFFQNGMMEPWLESKGLEDANQVLAYFAVSKIGESPIDGRTDTNPEGLTAA 154
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
YGKWASVVA RL+ GGLSCKVLDKEAFQKQMLEKLIWI + MLVGARH G +VGVVEKE+
Sbjct: 155 YGKWASVVAARLNAGGLSCKVLDKEAFQKQMLEKLIWICSVMLVGARHGGVSVGVVEKEF 214
Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
R+E+S+LI ELA AAA+EKG+ F+ AME+RLCAYSRAVA+FPTAVKEFKWRNGWF SLSE
Sbjct: 215 RTELSSLITELASAAASEKGLKFEEAMEERLCAYSRAVAHFPTAVKEFKWRNGWFCSLSE 274
Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
KA+A+GKPDPCPLH+ WLKE+++V
Sbjct: 275 KANAQGKPDPCPLHSQWLKELRIV 298
>gi|224108611|ref|XP_002333371.1| predicted protein [Populus trichocarpa]
gi|222836353|gb|EEE74760.1| predicted protein [Populus trichocarpa]
Length = 269
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 224/264 (84%), Positives = 241/264 (91%)
Query: 45 TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
T +VAPA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE
Sbjct: 6 TAAKVAPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEV 65
Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P+SRW+DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAA
Sbjct: 66 TPKSRWSDLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAA 125
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
YGKWAS VA RL GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATVG VEKE+
Sbjct: 126 YGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEF 185
Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+
Sbjct: 186 RSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSD 245
Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 KAVAEGKPDPCPLHTAWLKELKVV 269
>gi|224072767|ref|XP_002303871.1| predicted protein [Populus trichocarpa]
gi|222841303|gb|EEE78850.1| predicted protein [Populus trichocarpa]
Length = 269
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 222/264 (84%), Positives = 240/264 (90%)
Query: 45 TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
T +V PA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGP+ VCTRNDDL+AVLE
Sbjct: 6 TAAKVVPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPVLVCTRNDDLDAVLEV 65
Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P+SRW+DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAA
Sbjct: 66 TPKSRWSDLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAA 125
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
YGKWAS VA RL GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATVG VEKE+
Sbjct: 126 YGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEF 185
Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLS+
Sbjct: 186 RSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSD 245
Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 KAVAEGKPDPCPLHTAWLKELKVV 269
>gi|449436211|ref|XP_004135887.1| PREDICTED: uncharacterized protein LOC101216543 [Cucumis sativus]
gi|449509154|ref|XP_004163511.1| PREDICTED: uncharacterized protein LOC101231909 [Cucumis sativus]
Length = 300
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 227/293 (77%), Positives = 246/293 (83%), Gaps = 8/293 (2%)
Query: 24 TFSKPRFAK--PTPVSAFA------MASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLV 75
T S P PVS F MA+ T T+V PA IVG GRVG AL +MG G+D+LV
Sbjct: 8 TISNPSLCSQLSKPVSRFGARRLRIMAAATETKVVPAVIVGSGRVGRALLDMGNGEDVLV 67
Query: 76 KRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQ 135
KRGE VPLDF GPI VCTRNDDLEAVLEA PRSRWNDLVFFQNGM++PW ESKGLKDANQ
Sbjct: 68 KRGESVPLDFSGPILVCTRNDDLEAVLEATPRSRWNDLVFFQNGMLDPWYESKGLKDANQ 127
Query: 136 VLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQM 195
VLAYFA+SKLGE P+DG TDTNPEGLTAAYGKWAS VA RLS GLSCKVL KEAF+KQM
Sbjct: 128 VLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLSAAGLSCKVLGKEAFEKQM 187
Query: 196 LEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRL 255
LEKLIWISAFMLVGARH GATVG VEK+YRSEVS+LIAELA AAAAE+ + F+ +E+RL
Sbjct: 188 LEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFEEGIEERL 247
Query: 256 CAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
CAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA A GKPDPCPLHTAWLKE+KVV
Sbjct: 248 CAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLKELKVV 300
>gi|297844530|ref|XP_002890146.1| hypothetical protein ARALYDRAFT_471810 [Arabidopsis lyrata subsp.
lyrata]
gi|297335988|gb|EFH66405.1| hypothetical protein ARALYDRAFT_471810 [Arabidopsis lyrata subsp.
lyrata]
Length = 313
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/258 (82%), Positives = 231/258 (89%)
Query: 51 PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
PA IVGGGRVG AL+EMG G+DLLVKRGE VP+DFEGPI VCTRNDDL+AVLEA P+SRW
Sbjct: 56 PAVIVGGGRVGRALQEMGNGEDLLVKRGEAVPVDFEGPILVCTRNDDLDAVLEATPQSRW 115
Query: 111 NDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
DLVFFQNGM+EPW ESKGL D +QVLAYFAVSKLGE P+DGKTDTNPEGLTAAYGKWAS
Sbjct: 116 KDLVFFQNGMMEPWFESKGLGDTDQVLAYFAVSKLGEPPVDGKTDTNPEGLTAAYGKWAS 175
Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSA 230
+A RL GGLSCKVLDKEAFQKQMLEKLIWI AFMLVGARH GA+VG VEKEYR EVS
Sbjct: 176 EIAARLQSGGLSCKVLDKEAFQKQMLEKLIWICAFMLVGARHPGASVGTVEKEYRDEVSI 235
Query: 231 LIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEG 290
LI ELA AAAAEKG+TF+ M +RLCAYSRAV++FPTAVKEFKWRNGWFYSLSEKA AEG
Sbjct: 236 LIQELAAAAAAEKGLTFEENMVERLCAYSRAVSHFPTAVKEFKWRNGWFYSLSEKAIAEG 295
Query: 291 KPDPCPLHTAWLKEIKVV 308
KPDPCPLHT WLKE+KV+
Sbjct: 296 KPDPCPLHTEWLKELKVI 313
>gi|18394322|ref|NP_563991.1| uncharacterized protein [Arabidopsis thaliana]
gi|6587814|gb|AAF18505.1|AC010924_18 ESTs gb|AI992412, gb|AI994629 come from this gene [Arabidopsis
thaliana]
gi|12083328|gb|AAG48823.1|AF332460_1 unknown protein [Arabidopsis thaliana]
gi|18176355|gb|AAL60029.1| unknown protein [Arabidopsis thaliana]
gi|332191287|gb|AEE29408.1| uncharacterized protein [Arabidopsis thaliana]
Length = 313
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 223/287 (77%), Positives = 248/287 (86%), Gaps = 2/287 (0%)
Query: 22 KPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELV 81
KP+ S+ RFA P + A ++ T ++APA IVGGGRVG AL+EMG G+DLLVKRGE V
Sbjct: 29 KPSVSRARFAVPM--AMAAASAATAKKLAPAVIVGGGRVGRALQEMGNGEDLLVKRGEAV 86
Query: 82 PLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFA 141
P+DFEGPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM+EPW ESKGL D +QVLAYFA
Sbjct: 87 PVDFEGPILVCTRNDDLDAVLEATPQSRWKDLVFFQNGMMEPWFESKGLGDTDQVLAYFA 146
Query: 142 VSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIW 201
VSKLGE P+DGKTDTNPEGLTAAYGKWAS +A RL GGLSCKVLDKEAFQKQMLEKLIW
Sbjct: 147 VSKLGEPPVDGKTDTNPEGLTAAYGKWASEIAARLQSGGLSCKVLDKEAFQKQMLEKLIW 206
Query: 202 ISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA 261
I AFMLVGARH GA+VG VEKEYR EVS LI ELA AAAAEKG+TF+ M +RLCAYSRA
Sbjct: 207 ICAFMLVGARHPGASVGTVEKEYRDEVSRLIQELAAAAAAEKGLTFEENMVERLCAYSRA 266
Query: 262 VANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
V++FPTAVKEFKWRNGWFYSLSEKA AEG+PDPCPLHT WLKE+KV+
Sbjct: 267 VSHFPTAVKEFKWRNGWFYSLSEKAIAEGQPDPCPLHTEWLKELKVI 313
>gi|224072759|ref|XP_002303867.1| predicted protein [Populus trichocarpa]
gi|222841299|gb|EEE78846.1| predicted protein [Populus trichocarpa]
Length = 276
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 224/271 (82%), Positives = 241/271 (88%), Gaps = 7/271 (2%)
Query: 45 TTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
T +VAPA IVGGGRVG AL+EMG GQDLLVKRG+ VPLDFEGPI VCTRNDDL+AVLE
Sbjct: 6 TAAKVAPAVIVGGGRVGRALQEMGSGQDLLVKRGDPVPLDFEGPILVCTRNDDLDAVLEV 65
Query: 105 APRSRWN-------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTN 157
P+SRW+ DLVFFQNGM+EPW +SKGL DA+QVLAYFAVSKLGE PIDGKTDTN
Sbjct: 66 TPKSRWSVDVDNLADLVFFQNGMLEPWFQSKGLGDADQVLAYFAVSKLGEPPIDGKTDTN 125
Query: 158 PEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATV 217
PEGLTAAYGKWAS VA RL GGLSCKVLDKE FQKQMLEKLIWISAFMLVGARH GATV
Sbjct: 126 PEGLTAAYGKWASAVAARLHAGGLSCKVLDKETFQKQMLEKLIWISAFMLVGARHPGATV 185
Query: 218 GVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNG 277
G VEKE+RSEVS+LIAELA AAAAEKGI F+ A+E+RLCAYSRAVA+FPTAVKEFKWRNG
Sbjct: 186 GAVEKEFRSEVSSLIAELASAAAAEKGIVFEEAIEERLCAYSRAVAHFPTAVKEFKWRNG 245
Query: 278 WFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
WFYSLS+KA AEGKPDPCPLHTAWLKE+KVV
Sbjct: 246 WFYSLSDKAVAEGKPDPCPLHTAWLKELKVV 276
>gi|298204760|emb|CBI25258.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 205/242 (84%), Positives = 220/242 (90%)
Query: 67 MGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLE 126
MG G DLLVKRG+ VPLDF GPI VCTRNDDL+AVLE+ P+SRW+DLVFFQNGM+EPWL+
Sbjct: 1 MGSGDDLLVKRGDSVPLDFAGPILVCTRNDDLDAVLESTPKSRWSDLVFFQNGMLEPWLQ 60
Query: 127 SKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVL 186
SKGL DA+QVLAYFAVSKLGE PIDGKTDTNPEGLTAAYGKWAS VA RL GGLSCKVL
Sbjct: 61 SKGLSDADQVLAYFAVSKLGEPPIDGKTDTNPEGLTAAYGKWASAVAGRLHAGGLSCKVL 120
Query: 187 DKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGIT 246
DKEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKE+RSEVS+LI ELA AAA EKGI
Sbjct: 121 DKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEFRSEVSSLIGELASAAAVEKGIV 180
Query: 247 FDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIK 306
FD MEDRLCAYSRAV++FPTA KEFKWRNGWFYSLS+KA AEGKPDPCPLHT WLKE+K
Sbjct: 181 FDEGMEDRLCAYSRAVSHFPTAAKEFKWRNGWFYSLSQKAIAEGKPDPCPLHTEWLKELK 240
Query: 307 VV 308
+V
Sbjct: 241 IV 242
>gi|148908977|gb|ABR17592.1| unknown [Picea sitchensis]
Length = 311
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 200/290 (68%), Positives = 237/290 (81%), Gaps = 5/290 (1%)
Query: 22 KPTFSKPRFAKPTP---VSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRG 78
P F+K + P VSA AMA+ + V P+ I+G GRVG AL+ MG G+D++V+RG
Sbjct: 24 NPKFNKSLCSANYPRIRVSA-AMAAVQES-VIPSVIIGAGRVGQALQNMGTGKDIIVRRG 81
Query: 79 ELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLA 138
E VP +F GPIFVCTRNDDL+AVLE PR RW DLVFFQNGM+EPWL+SKGL +A+QVLA
Sbjct: 82 EKVPENFPGPIFVCTRNDDLDAVLEITPRPRWEDLVFFQNGMLEPWLQSKGLVNASQVLA 141
Query: 139 YFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEK 198
YFAV+KLGE PIDGKT+TNPEGLTA GKWAS +A RL + GLSCKVLDK F+K MLEK
Sbjct: 142 YFAVAKLGEAPIDGKTETNPEGLTACSGKWASAIAARLHLAGLSCKVLDKGNFEKSMLEK 201
Query: 199 LIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAY 258
LIWISAFMLVGARH G+TVG VEKEYR+EV +LI ELA AAAAEK I FD +E+RLCAY
Sbjct: 202 LIWISAFMLVGARHQGSTVGTVEKEYRNEVVSLIKELASAAAAEKNIAFDQGLEERLCAY 261
Query: 259 SRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
SR VA+FPTA+KEFKWRNGWFY LS+KA+A+G+PDPCP+HT+WLKE+ +
Sbjct: 262 SRTVAHFPTALKEFKWRNGWFYLLSQKATADGRPDPCPIHTSWLKELGAI 311
>gi|294463883|gb|ADE77464.1| unknown [Picea sitchensis]
Length = 267
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 190/260 (73%), Positives = 223/260 (85%)
Query: 49 VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRS 108
V P+ I+G GRVG AL+ MG G+D++V+RGE VP +F GPIFVCTRNDDL+AVLE PR
Sbjct: 8 VIPSVIIGAGRVGQALQNMGTGKDIIVRRGEKVPENFPGPIFVCTRNDDLDAVLEITPRP 67
Query: 109 RWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKW 168
RW DLVFFQNGM+EPWL+SKGL +A+QVLAYFAV+KLGE PIDGKT+TNPEGLTA GKW
Sbjct: 68 RWEDLVFFQNGMLEPWLQSKGLVNASQVLAYFAVAKLGEAPIDGKTETNPEGLTACSGKW 127
Query: 169 ASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEV 228
AS +A RL + GLSCKVLDK F+K MLEKLIWISAFMLVGARH G+TVG VEKEYR+EV
Sbjct: 128 ASAIAARLHLAGLSCKVLDKGNFEKSMLEKLIWISAFMLVGARHQGSTVGTVEKEYRNEV 187
Query: 229 SALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASA 288
+LI ELA AAAAEK I FD +E+RLCAYSR VA+FPTA+KEFKWRNGWFY LS+KA+A
Sbjct: 188 VSLIKELASAAAAEKNIAFDQGLEERLCAYSRTVAHFPTALKEFKWRNGWFYLLSQKATA 247
Query: 289 EGKPDPCPLHTAWLKEIKVV 308
+G+PDPCP+HT+WLKE+ +
Sbjct: 248 DGRPDPCPIHTSWLKELGAI 267
>gi|255543391|ref|XP_002512758.1| conserved hypothetical protein [Ricinus communis]
gi|223547769|gb|EEF49261.1| conserved hypothetical protein [Ricinus communis]
Length = 294
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 218/303 (71%), Positives = 239/303 (78%), Gaps = 12/303 (3%)
Query: 6 LTRQLSSTSTLWPPLKKPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALK 65
+T LSS + P K S KP +S AMA+ V+PA IVGGGRVG AL+
Sbjct: 4 ITLSLSSCFIIKPTSKFTFLSTKIITKPQKISILAMATTPAATVSPAVIVGGGRVGRALQ 63
Query: 66 EMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWL 125
EMG GQDLLVKRGE VPLDFEGPI VCT L L+ NGM+EPWL
Sbjct: 64 EMGNGQDLLVKRGEPVPLDFEGPILVCTXXLKLLLSLDGT------------NGMLEPWL 111
Query: 126 ESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKV 185
ESKGL+DA+QVLAYFAV+KLGE PIDGKTDTNPEGLTAAYGKWA+ VA RL GGLSCKV
Sbjct: 112 ESKGLEDADQVLAYFAVAKLGEPPIDGKTDTNPEGLTAAYGKWAAAVASRLHAGGLSCKV 171
Query: 186 LDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGI 245
LDK+AFQKQMLEKLIWI+AFMLVGARH GATVG VEKEYRSEVS+LIAELA AAAAEKGI
Sbjct: 172 LDKDAFQKQMLEKLIWIAAFMLVGARHPGATVGAVEKEYRSEVSSLIAELASAAAAEKGI 231
Query: 246 TFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
F+ +E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA AEGKPDPCPLHTAWLKE+
Sbjct: 232 VFENGVEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAEGKPDPCPLHTAWLKEL 291
Query: 306 KVV 308
KV+
Sbjct: 292 KVI 294
>gi|413922060|gb|AFW61992.1| hypothetical protein ZEAMMB73_116806 [Zea mays]
Length = 307
Score = 392 bits (1008), Expect = e-107, Method: Compositional matrix adjust.
Identities = 216/298 (72%), Positives = 241/298 (80%), Gaps = 13/298 (4%)
Query: 18 PPLKKPTFSKPRFAKPTPVSAFAM------ASFTTTQVAPAAIVGGGRVGTALKEMGK-G 70
PPL++ S P+P SA A A+ TT+V PA +VGGGRVG AL MG G
Sbjct: 16 PPLRRLLLS------PSPSSAAARRIARVSAAMATTEVHPAVVVGGGRVGQALVSMGPPG 69
Query: 71 QDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGL 130
D+LV RGE VP D GPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM++PW + KGL
Sbjct: 70 GDVLVGRGEKVPDDAPGPILVCTRNDDLDAVLEATPKSRWRDLVFFQNGMLDPWFKIKGL 129
Query: 131 KDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEA 190
ANQVLAYFAVSK+GE P+DG TDTNPEGLTAA+G WA VA RL GGL+CKVL+KEA
Sbjct: 130 VGANQVLAYFAVSKVGEPPVDGITDTNPEGLTAAFGSWAPAVAARLQNGGLTCKVLEKEA 189
Query: 191 FQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPA 250
FQKQMLEKLIWISAFMLVGARH GATVGVVEK YRSEV++LIAELA AAAAE+ +TF
Sbjct: 190 FQKQMLEKLIWISAFMLVGARHPGATVGVVEKNYRSEVASLIAELASAAAAEEQLTFAEG 249
Query: 251 MEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
MEDRLCAYSRAVA+FPTAVKEFKWRNGWFYSL+EKA A GKPDPCPLHTAWLKEI+V+
Sbjct: 250 MEDRLCAYSRAVAHFPTAVKEFKWRNGWFYSLTEKALAAGKPDPCPLHTAWLKEIRVI 307
>gi|242078737|ref|XP_002444137.1| hypothetical protein SORBIDRAFT_07g009470 [Sorghum bicolor]
gi|241940487|gb|EES13632.1| hypothetical protein SORBIDRAFT_07g009470 [Sorghum bicolor]
Length = 309
Score = 392 bits (1008), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/269 (78%), Positives = 231/269 (85%), Gaps = 2/269 (0%)
Query: 42 ASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLEA 100
A+ TT V PA +VGGGRVG AL MG G D+LV RGE VP D GPI VCTRNDDL+
Sbjct: 41 AAMATTAVQPAVVVGGGRVGQALLSMGPPGGDVLVGRGEKVPDDAPGPILVCTRNDDLDG 100
Query: 101 VLEAAPRSRWN-DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
VLEA P+SRW DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 101 VLEATPKSRWRADLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 160
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAA+G WA VA RL GGL+CKVL KEAFQKQMLEKLIWISAFMLVGARH GATVGV
Sbjct: 161 GLTAAFGSWAPAVAARLQNGGLTCKVLQKEAFQKQMLEKLIWISAFMLVGARHPGATVGV 220
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
VEK+YRSEV++LIAELA AAAAE+G+TFD +E+RLCAYSRAVA+FPTAVKEFKWRNGWF
Sbjct: 221 VEKDYRSEVASLIAELASAAAAERGLTFDEGIEERLCAYSRAVAHFPTAVKEFKWRNGWF 280
Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
YSL+EKA A G+PDPCPLHTAWLKEIKV+
Sbjct: 281 YSLTEKALAAGRPDPCPLHTAWLKEIKVI 309
>gi|218200803|gb|EEC83230.1| hypothetical protein OsI_28519 [Oryza sativa Indica Group]
Length = 271
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 207/271 (76%), Positives = 231/271 (85%), Gaps = 3/271 (1%)
Query: 41 MASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
MA+ T V PA +VGGGRVG AL MG G D+LV RGE VP D GPI VCTRNDDL+
Sbjct: 1 MAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGRGEKVPEDAPGPILVCTRNDDLD 60
Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
AVL+A PRSRW DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 61 AVLDATPRSRWRDLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 120
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAA+G WA VA RL GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG
Sbjct: 121 GLTAAFGNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGA 180
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAV--KEFKWRNG 277
VEKEYRSEV++LIAELA AAA E+G+TFD +E+RLCAYSRAVA+FPTA+ ++FKWRNG
Sbjct: 181 VEKEYRSEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTALSMEQFKWRNG 240
Query: 278 WFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
WFYSL+EKA A+GKPDPCP HTAWLKEIKV+
Sbjct: 241 WFYSLTEKALAQGKPDPCPFHTAWLKEIKVI 271
>gi|388506366|gb|AFK41249.1| unknown [Medicago truncatula]
Length = 270
Score = 389 bits (999), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/254 (77%), Positives = 219/254 (86%), Gaps = 5/254 (1%)
Query: 31 AKPTP-----VSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF 85
A P P VS+ ++ TTT+VAPA IVGGGRVG AL+ MG GQDLLVKRG+ +PLDF
Sbjct: 17 ATPLPKFRFRVSSVMASTVTTTKVAPAVIVGGGRVGKALEGMGDGQDLLVKRGDSIPLDF 76
Query: 86 EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKL 145
EGPI VCTRNDDL++VL+ P SRW DLVFFQNGM+EPWLESKGLKD NQVLAYFAVSKL
Sbjct: 77 EGPILVCTRNDDLDSVLQTTPLSRWKDLVFFQNGMLEPWLESKGLKDGNQVLAYFAVSKL 136
Query: 146 GERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAF 205
GE PIDGKTDTNPEGLTAAYGKW+S V+ERL GGLSCKVLDKEAFQKQMLEKLIWI +
Sbjct: 137 GESPIDGKTDTNPEGLTAAYGKWSSAVSERLQAGGLSCKVLDKEAFQKQMLEKLIWICSV 196
Query: 206 MLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANF 265
MLVGARH G +VGVVEKE+ E+ +LIAELA AAA+EKG+TF+ AMEDRLCAYSRAVA+F
Sbjct: 197 MLVGARHGGVSVGVVEKEFHLELCSLIAELASAAASEKGLTFEQAMEDRLCAYSRAVAHF 256
Query: 266 PTAVKEFKWRNGWF 279
PTAVKEFKWRNGWF
Sbjct: 257 PTAVKEFKWRNGWF 270
>gi|168042101|ref|XP_001773528.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675230|gb|EDQ61728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 269
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/269 (69%), Positives = 212/269 (78%)
Query: 40 AMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
AMA+ T +VA IVG GRVG A ++MG G+D++V+RGE VP D GPIFVCTRND L
Sbjct: 1 AMATATAGEVAAGVIVGAGRVGQAFEKMGGGRDIVVRRGETVPEDSAGPIFVCTRNDVLG 60
Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
+++A P SR DLVF QNGM+EPWLESKGL DA QVL YFAV+KLG+ P+DGKTDTNPE
Sbjct: 61 TIVDATPASRRGDLVFIQNGMLEPWLESKGLGDATQVLVYFAVAKLGDPPLDGKTDTNPE 120
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAA GKWA A RL GLSCKVLD E F+K MLEKLIWI AFMLVGARH GATVGV
Sbjct: 121 GLTAATGKWAKATAARLHSAGLSCKVLDAEEFKKPMLEKLIWICAFMLVGARHPGATVGV 180
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
VEKE+R EV LI ELA AA AEK + FD + DRLCAY+R+VA+FPTAVKEF+WRNGWF
Sbjct: 181 VEKEHRKEVEDLINELAGAAEAEKQLKFDAGIVDRLCAYARSVAHFPTAVKEFEWRNGWF 240
Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
Y +S +A GK DPCPLHTAWLKEI VV
Sbjct: 241 YEISRRALESGKSDPCPLHTAWLKEIGVV 269
>gi|147765908|emb|CAN73372.1| hypothetical protein VITISV_037533 [Vitis vinifera]
Length = 306
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 189/239 (79%), Positives = 202/239 (84%), Gaps = 12/239 (5%)
Query: 47 TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
TQV PA IVGGGRVG AL+ MG G DLLVKRG+ VPLDF GPI VCTRNDDL+AVLE+ P
Sbjct: 3 TQVVPAVIVGGGRVGRALQGMGSGDDLLVKRGDSVPLDFAGPILVCTRNDDLDAVLESTP 62
Query: 107 RSRWN------------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKT 154
+SRW+ DLVFFQNGM+EPWL+SKGL DA+QVLAYFAVSKLGE PIDGKT
Sbjct: 63 KSRWSVHDFVYDLFNLADLVFFQNGMLEPWLQSKGLSDADQVLAYFAVSKLGEPPIDGKT 122
Query: 155 DTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTG 214
DTNPEGLTAAYGKWAS VA RL GGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH G
Sbjct: 123 DTNPEGLTAAYGKWASAVAGRLHAGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPG 182
Query: 215 ATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFK 273
ATVGVVEKE+RSEVS+LI ELA AAA EKGI FD MEDRLCAYSRAV++FPTA KE K
Sbjct: 183 ATVGVVEKEFRSEVSSLIGELASAAAVEKGIVFDEGMEDRLCAYSRAVSHFPTAAKEVK 241
>gi|357145388|ref|XP_003573626.1| PREDICTED: uncharacterized protein LOC100834027 [Brachypodium
distachyon]
Length = 310
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 207/264 (78%), Positives = 226/264 (85%), Gaps = 2/264 (0%)
Query: 47 TQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDF-EGPIFVCTRNDDLEAVLEA 104
T V PA +VGGGRVG AL MG G D+LV RGE VP GPI VCTRNDDL+ VLEA
Sbjct: 47 TAVQPAVVVGGGRVGQALLGMGPLGGDVLVGRGEKVPDGAPAGPILVCTRNDDLDGVLEA 106
Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P+SRW DLVFFQNGM++PW ESKGL ANQVLAYFAVSK+GE PIDG TDTNPEGLTAA
Sbjct: 107 TPKSRWRDLVFFQNGMLDPWFESKGLVGANQVLAYFAVSKVGEPPIDGITDTNPEGLTAA 166
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
+G WA VA RL GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKEY
Sbjct: 167 FGSWAPAVAARLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEY 226
Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
RSEVS+LIAELA AAAAE+G+ FD +E+RLCAYSRAVA+FPTAVKEFKWRNGWFYSL+E
Sbjct: 227 RSEVSSLIAELASAAAAERGLMFDEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLTE 286
Query: 285 KASAEGKPDPCPLHTAWLKEIKVV 308
KA A+GK DPCPLHTAWLKEIKV+
Sbjct: 287 KALAQGKTDPCPLHTAWLKEIKVI 310
>gi|326497007|dbj|BAK02088.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 322
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/270 (75%), Positives = 226/270 (83%), Gaps = 3/270 (1%)
Query: 42 ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
A+ T V PA +VGGGRVG AL MG G DLL++RGE +P GPI VCTRNDDL
Sbjct: 53 AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 112
Query: 99 EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
+ VLEA P+SRW DLVFFQNGM++PWLESKGL ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 113 DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 172
Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
EGLTAA+G WA VA RL GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 173 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 232
Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
VE+EYRSEVS+LIAELA AAAAE+G++FD ME+RLCAYS AVA+FPTAVKEFKWRNGW
Sbjct: 233 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRNGW 292
Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 293 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 322
>gi|326511878|dbj|BAJ92083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 203/270 (75%), Positives = 226/270 (83%), Gaps = 3/270 (1%)
Query: 42 ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
A+ T V PA +VGGGRVG AL MG G DLL++RGE +P GPI VCTRNDDL
Sbjct: 37 AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 96
Query: 99 EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
+ VLEA P+SRW DLVFFQNGM++PWLESKGL ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 97 DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 156
Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
EGLTAA+G WA VA RL GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 157 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 216
Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
VE+EYRSEVS+LIAELA AAAAE+G++FD ME+RLCAYS AVA+FPTAVKEFKWRNGW
Sbjct: 217 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRNGW 276
Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 277 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 306
>gi|326489475|dbj|BAK01718.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 202/270 (74%), Positives = 226/270 (83%), Gaps = 3/270 (1%)
Query: 42 ASFTTTQVAPAAIVGGGRVGTALKEMG--KGQDLLVKRGELVPLDF-EGPIFVCTRNDDL 98
A+ T V PA +VGGGRVG AL MG G DLL++RGE +P GPI VCTRNDDL
Sbjct: 37 AAVMATAVQPAVVVGGGRVGQALLSMGPSSGGDLLLRRGEALPPSAPAGPILVCTRNDDL 96
Query: 99 EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
+ VLEA P+SRW DLVFFQNGM++PWLESKGL ANQVLAYFAVSKLGE P+DG TD NP
Sbjct: 97 DGVLEATPKSRWQDLVFFQNGMLDPWLESKGLAGANQVLAYFAVSKLGEPPVDGITDANP 156
Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
EGLTAA+G WA VA RL GGL+CKVL K AF+KQMLEKLIWISAFMLVGARH GATVG
Sbjct: 157 EGLTAAFGNWAPAVAARLHNGGLTCKVLHKGAFEKQMLEKLIWISAFMLVGARHPGATVG 216
Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGW 278
VE+EYRSEVS+LIAELA AAAAE+G++FD ME+RLCAYS AVA+FPTAVKEFKWR+GW
Sbjct: 217 AVEEEYRSEVSSLIAELASAAAAERGLSFDEGMEERLCAYSTAVAHFPTAVKEFKWRSGW 276
Query: 279 FYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
FYSL+EKA A+ KPDPCPLHTAWLKEIKV+
Sbjct: 277 FYSLTEKALAQDKPDPCPLHTAWLKEIKVI 306
>gi|226504428|ref|NP_001145052.1| uncharacterized protein LOC100278239 [Zea mays]
gi|195650351|gb|ACG44643.1| hypothetical protein [Zea mays]
Length = 261
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 202/259 (77%), Positives = 222/259 (85%), Gaps = 1/259 (0%)
Query: 46 TTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
TT+V PA +VGGGRVG AL MG G D+LV RGE VP D GPI VCTRNDDL+AVLEA
Sbjct: 3 TTEVHPAVVVGGGRVGQALVSMGPPGGDVLVGRGEKVPDDAPGPILVCTRNDDLDAVLEA 62
Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P+SRW DLVFFQNGM++PW + KGL ANQVLAYFAVSK+GE P+DG TDTNPEGLTAA
Sbjct: 63 TPKSRWRDLVFFQNGMLDPWFKIKGLVGANQVLAYFAVSKVGEPPVDGITDTNPEGLTAA 122
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEY 224
+G WA VA RL GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVGVVEK Y
Sbjct: 123 FGSWAPAVAARLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKNY 182
Query: 225 RSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSE 284
RSEV++LIAELA AAAAE+ +TF MEDRLCAYSRAVA+FPTAVKEFKWRNGWFYSL+E
Sbjct: 183 RSEVASLIAELASAAAAEEQLTFAEGMEDRLCAYSRAVAHFPTAVKEFKWRNGWFYSLTE 242
Query: 285 KASAEGKPDPCPLHTAWLK 303
KA A GKPDPCPLHTAWL+
Sbjct: 243 KALAAGKPDPCPLHTAWLE 261
>gi|227204173|dbj|BAH56938.1| AT1G16080 [Arabidopsis thaliana]
Length = 309
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 192/250 (76%), Positives = 214/250 (85%), Gaps = 2/250 (0%)
Query: 22 KPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELV 81
KP+ S+ RFA P ++A + A T ++APA IVGGGRVG AL+EMG G+DLLVKRGE V
Sbjct: 29 KPSVSRARFAVPMAMAAASAA--TAKKLAPAVIVGGGRVGRALQEMGNGEDLLVKRGEAV 86
Query: 82 PLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFA 141
P+DFEGPI VCTRNDDL+AVLEA P+SRW DLVFFQNGM+EPW ESKGL D +QVLAYFA
Sbjct: 87 PVDFEGPILVCTRNDDLDAVLEATPQSRWKDLVFFQNGMMEPWFESKGLGDTDQVLAYFA 146
Query: 142 VSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIW 201
VSKLGE P+DGKTDTNPEGLTAAYGKWAS +A RL GGLSCKVLDKEAFQKQMLEKLIW
Sbjct: 147 VSKLGEPPVDGKTDTNPEGLTAAYGKWASEIAARLQSGGLSCKVLDKEAFQKQMLEKLIW 206
Query: 202 ISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA 261
I AFMLVGARH GA+VG VEKEYR EVS LI ELA AAAAEKG+TF+ M +RLCAYSRA
Sbjct: 207 ICAFMLVGARHPGASVGTVEKEYRDEVSRLIQELAAAAAAEKGLTFEENMVERLCAYSRA 266
Query: 262 VANFPTAVKE 271
V++FPTAVKE
Sbjct: 267 VSHFPTAVKE 276
>gi|302769095|ref|XP_002967967.1| hypothetical protein SELMODRAFT_169446 [Selaginella moellendorffii]
gi|300164705|gb|EFJ31314.1| hypothetical protein SELMODRAFT_169446 [Selaginella moellendorffii]
Length = 269
Score = 363 bits (932), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 173/259 (66%), Positives = 209/259 (80%), Gaps = 1/259 (0%)
Query: 51 PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLEAVLEAAPRSR 109
PA IVG GR+G+AL+++G G+D+++ RG+ P GPIFVCTRN+DL+AV++A P R
Sbjct: 5 PAVIVGRGRIGSALEKLGHGRDVVLLRGQPFPSSLSSGPIFVCTRNNDLDAVVDATPAHR 64
Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
DLVFFQNGM+EPWLE +GL +A QVL YFAV+KLG+ P+DGKTD NPEGLTAA GKWA
Sbjct: 65 REDLVFFQNGMLEPWLEKRGLAEATQVLVYFAVAKLGDEPVDGKTDANPEGLTAACGKWA 124
Query: 170 SVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVS 229
+ VA RL GGLSCKVLD F K MLEKLIWISAFMLVGARH GATVG VEK YR EVS
Sbjct: 125 AAVAARLHAGGLSCKVLDAREFAKPMLEKLIWISAFMLVGARHPGATVGDVEKRYRDEVS 184
Query: 230 ALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAE 289
LI ELA AA++EK + FDP +E+RLC+Y+R+VA FPTAVKEF+WRNGWFY +SE+A A
Sbjct: 185 ELITELAAAASSEKSVAFDPGLEERLCSYARSVAYFPTAVKEFEWRNGWFYGISERAIAS 244
Query: 290 GKPDPCPLHTAWLKEIKVV 308
G DPCPLH+ WL+E++ V
Sbjct: 245 GTTDPCPLHSRWLQEVEAV 263
>gi|302761062|ref|XP_002963953.1| hypothetical protein SELMODRAFT_166437 [Selaginella moellendorffii]
gi|300167682|gb|EFJ34286.1| hypothetical protein SELMODRAFT_166437 [Selaginella moellendorffii]
Length = 269
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 173/259 (66%), Positives = 208/259 (80%), Gaps = 1/259 (0%)
Query: 51 PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLEAVLEAAPRSR 109
PA IVG GR+G+AL+++G G+D+++ RG+ P GPIFVCTRN+DL+AV++A P R
Sbjct: 5 PAVIVGRGRIGSALEKLGHGRDVVLLRGQPFPSSLSSGPIFVCTRNNDLDAVVDATPAHR 64
Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
DLVFFQNGM+EPWLE +GL DA QVL YFAV+KLG+ P+DGKTD NPEGLTAA GKWA
Sbjct: 65 REDLVFFQNGMLEPWLEKRGLADATQVLVYFAVAKLGDEPVDGKTDANPEGLTAACGKWA 124
Query: 170 SVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVS 229
+ VA RL GGLSCKVLD F K MLEKLIWISAFMLVGARH GATVG VEK YR EVS
Sbjct: 125 AAVAARLHAGGLSCKVLDAREFAKPMLEKLIWISAFMLVGARHPGATVGDVEKRYRDEVS 184
Query: 230 ALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAE 289
LI ELA AA++EK F P +E+RLC+Y+R+VA+FPTAVKEF+WRNGWFY +SE+A A
Sbjct: 185 ELITELAAAASSEKSAAFGPGLEERLCSYARSVAHFPTAVKEFEWRNGWFYGISERAIAS 244
Query: 290 GKPDPCPLHTAWLKEIKVV 308
G DPCPLH+ WL+E++ V
Sbjct: 245 GTTDPCPLHSRWLQEVEAV 263
>gi|224053056|ref|XP_002297685.1| predicted protein [Populus trichocarpa]
gi|222844943|gb|EEE82490.1| predicted protein [Populus trichocarpa]
Length = 207
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 165/197 (83%), Positives = 179/197 (90%), Gaps = 3/197 (1%)
Query: 112 DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASV 171
DLVFFQNGM+EPW +SKGL DA+ VLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS
Sbjct: 14 DLVFFQNGMLEPWFQSKGLGDADHVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASA 73
Query: 172 VAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSAL 231
V L GGLSCK+LDKEAFQKQMLEKLIWISAFMLVGARH GATVGVVEKE+R EVS+L
Sbjct: 74 V---LHAGGLSCKLLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVVEKEFRYEVSSL 130
Query: 232 IAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGK 291
I ELA AAAAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA +EGK
Sbjct: 131 ITELASAAAAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAVSEGK 190
Query: 292 PDPCPLHTAWLKEIKVV 308
PDPCPLHT+WLKE+KVV
Sbjct: 191 PDPCPLHTSWLKELKVV 207
>gi|224143703|ref|XP_002336073.1| predicted protein [Populus trichocarpa]
gi|222869914|gb|EEF07045.1| predicted protein [Populus trichocarpa]
Length = 189
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 160/189 (84%), Positives = 175/189 (92%)
Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
M+EPW +SKGL DA+QVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS VA RL+ G
Sbjct: 1 MLEPWFQSKGLGDADQVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASAVAARLNAG 60
Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAA 239
GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARH GATVGV+EKE+R EVS+LI ELA AA
Sbjct: 61 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHPGATVGVMEKEFRYEVSSLITELASAA 120
Query: 240 AAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHT 299
AAEKGI F+ A+E+RLCAY+RAVA+FPTAVKEFKWRNGWFYSLSEKA +EGKPDPCPLHT
Sbjct: 121 AAEKGIVFEEAIEERLCAYARAVAHFPTAVKEFKWRNGWFYSLSEKAVSEGKPDPCPLHT 180
Query: 300 AWLKEIKVV 308
+WLKE+KVV
Sbjct: 181 SWLKELKVV 189
>gi|222640230|gb|EEE68362.1| hypothetical protein OsJ_26666 [Oryza sativa Japonica Group]
Length = 242
Score = 323 bits (829), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 177/232 (76%), Positives = 196/232 (84%), Gaps = 1/232 (0%)
Query: 41 MASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
MA+ T V PA +VGGGRVG AL MG G D+LV RGE VP D GPI VCTRNDDL+
Sbjct: 1 MAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGRGEKVPEDAPGPILVCTRNDDLD 60
Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
AVL+A PRSRW DLVFFQNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPE
Sbjct: 61 AVLDATPRSRWRDLVFFQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPE 120
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAA+G WA VA RL GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG
Sbjct: 121 GLTAAFGNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGA 180
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKE 271
VEKEY+SEV++LIAELA AAA E+G+TFD +E+RLCAYSRAVA+FPTAVKE
Sbjct: 181 VEKEYQSEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTAVKE 232
>gi|297608302|ref|NP_001061403.2| Os08g0266300 [Oryza sativa Japonica Group]
gi|255678306|dbj|BAF23317.2| Os08g0266300 [Oryza sativa Japonica Group]
Length = 339
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 197/323 (60%), Positives = 221/323 (68%), Gaps = 50/323 (15%)
Query: 19 PLKKPTFSKPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGK-GQDLLVKR 77
PL++ + S A VSA AMA+ T V PA +VGGGRVG AL MG G D+LV R
Sbjct: 15 PLRRLSSSTTTRAFSFRVSAAAMAAAAKTAVQPAVVVGGGRVGQALLSMGPPGADVLVGR 74
Query: 78 GELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVF---------------------- 115
GE VP D GPI VCTRNDDL+AVL+A PRSRW
Sbjct: 75 GEKVPEDAPGPILVCTRNDDLDAVLDATPRSRWRGTFVPLPALLSSPLRLPPGSRAVSCS 134
Query: 116 ----------FQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
QNGM++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPEGLTAA+
Sbjct: 135 ALLLLVSCSAIQNGMLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPEGLTAAF 194
Query: 166 GKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYR 225
G WA VA RL GGL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG VEKEY+
Sbjct: 195 GNWAPAVATRLQNGGLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEYQ 254
Query: 226 SEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEK 285
SEV++LIAELA AAA E+G+TFD +E+RLCAYSRAVA+FPTA
Sbjct: 255 SEVASLIAELASAAAEERGLTFDAGIEERLCAYSRAVAHFPTA----------------- 297
Query: 286 ASAEGKPDPCPLHTAWLKEIKVV 308
A A+GKPDPCPLHTAWLKEIKV+
Sbjct: 298 ALAQGKPDPCPLHTAWLKEIKVI 320
>gi|307105110|gb|EFN53361.1| hypothetical protein CHLNCDRAFT_137108 [Chlorella variabilis]
Length = 270
Score = 320 bits (819), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 163/267 (61%), Positives = 194/267 (72%), Gaps = 10/267 (3%)
Query: 51 PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
P IVGGGRVG AL +MG GQD+LV+RGE V GPI VCTRNDDL+AV++A P R
Sbjct: 5 PFVIVGGGRVGQALADMGAGQDVLVRRGEAVS-GPPGPIVVCTRNDDLQAVVDATPAERR 63
Query: 111 N---------DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGL 161
LV+ QNGM++PWL+ +GL + QVL YFAV+K G+ P DGKTD NPEGL
Sbjct: 64 QGKQAAREAGHLVWIQNGMLQPWLDERGLGETTQVLVYFAVAKKGDPPTDGKTDVNPEGL 123
Query: 162 TAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVE 221
TAAYG A+ VA RL GLSCKVLDK F K MLEKL+WI AFMLVGARH G TVG VE
Sbjct: 124 TAAYGPHAAAVAARLHSAGLSCKVLDKGEFAKCMLEKLVWICAFMLVGARHGGCTVGEVE 183
Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
++ EVS LI +LA A +A GIT D + +RL AY+R+VA+FPTAVKEF+WRNG+FY
Sbjct: 184 SQHTGEVSQLIGQLAAAGSAALGITLDSGVPERLLAYARSVAHFPTAVKEFQWRNGYFYG 243
Query: 282 LSEKASAEGKPDPCPLHTAWLKEIKVV 308
LS++A A G+PDPCP HTAWLKE +
Sbjct: 244 LSQQAQAAGQPDPCPTHTAWLKEAGAI 270
>gi|384245300|gb|EIE18795.1| hypothetical protein COCSUDRAFT_26256 [Coccomyxa subellipsoidea
C-169]
Length = 266
Score = 317 bits (811), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 156/254 (61%), Positives = 188/254 (74%), Gaps = 2/254 (0%)
Query: 54 IVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDL 113
IVG GRVG AL EMGK D+LV RG +P +GP+ VCTRND L+ V+ P R DL
Sbjct: 14 IVGRGRVGAALAEMGK-DDVLVGRGGRIPEGPQGPVIVCTRNDALDDVVSMTPPDRKKDL 72
Query: 114 VFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVA 173
VF QNGM++PWL+ +GL D QVL YFAV+K GE P+DGKTD NPEGLTAA+G A VA
Sbjct: 73 VFIQNGMLQPWLDGQGLGDNTQVLVYFAVAKKGEAPLDGKTDLNPEGLTAAWGIHAEAVA 132
Query: 174 ERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIA 233
ERL GLSC D+ F K MLEKLIWISA+MLVGARH+ A +G VE+++RSEV +LI
Sbjct: 133 ERLKANGLSCSTPDRPEFDKAMLEKLIWISAYMLVGARHS-ANIGDVERQHRSEVGSLIE 191
Query: 234 ELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPD 293
EL AAE G+T D +RL AYSR+VA+FPTAVKEF+WRNGWF+ +S+KA AEG+ D
Sbjct: 192 ELGAGGAAELGVTLDSNFAERLHAYSRSVAHFPTAVKEFQWRNGWFHDISKKALAEGRAD 251
Query: 294 PCPLHTAWLKEIKV 307
P P HTAWLK++ V
Sbjct: 252 PFPTHTAWLKDLGV 265
>gi|159464353|ref|XP_001690406.1| hypothetical protein CHLREDRAFT_182934 [Chlamydomonas reinhardtii]
gi|158279906|gb|EDP05665.1| predicted protein [Chlamydomonas reinhardtii]
Length = 290
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/278 (56%), Positives = 193/278 (69%), Gaps = 2/278 (0%)
Query: 31 AKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIF 90
A PT + A M + Q A IVGGGRVG AL +MG G D++V RG+ + PI
Sbjct: 15 ASPTQLRAARMTCNASKQQA-WTIVGGGRVGLALADMGPGGDVVVGRGQKIEGPAGSPIV 73
Query: 91 VCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPI 150
VCTRNDDL+AV++ R DLVF QNGM++PWL+++GL QVL YFAV+K G++P
Sbjct: 74 VCTRNDDLQAVVDLTAPERRKDLVFIQNGMLQPWLDARGLGGNTQVLVYFAVAKKGDKPT 133
Query: 151 DGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGA 210
DGKTD NPEGLTAA+G A VA+RL GGLSCKVL FQ+ MLEKL+WISAFM VGA
Sbjct: 134 DGKTDVNPEGLTAAFGPHAQAVADRLHAGGLSCKVLGAADFQRAMLEKLVWISAFMAVGA 193
Query: 211 RHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVK 270
+H A VG VE + +EVS +I EL A AAE G+ RL AY+R+VA+FPTAVK
Sbjct: 194 KHK-ANVGQVESAHTAEVSDIIQELCAAGAAELGVGLAGGEVARLLAYARSVAHFPTAVK 252
Query: 271 EFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
EF WRNGWFY LS+KA A G+PDPCP HTA L+E+ +
Sbjct: 253 EFPWRNGWFYGLSQKALAAGQPDPCPKHTALLREVGAI 290
>gi|302828238|ref|XP_002945686.1| hypothetical protein VOLCADRAFT_54899 [Volvox carteri f.
nagariensis]
gi|300268501|gb|EFJ52681.1| hypothetical protein VOLCADRAFT_54899 [Volvox carteri f.
nagariensis]
Length = 268
Score = 308 bits (790), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 156/269 (57%), Positives = 191/269 (71%), Gaps = 5/269 (1%)
Query: 40 AMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLE 99
++ TT+QV IVGGGRVG A+ +MG G D++V RG+ V GPI VCTRNDDL+
Sbjct: 5 CLSGLTTSQVF--TIVGGGRVGQAIADMGPGGDVVVTRGQRVE-GPPGPIVVCTRNDDLQ 61
Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
V PR LVF QNGM++PWL+ +GL + QVL YFAV+K G+RP DGKTD NPE
Sbjct: 62 -VSRRTPRVTPPHLVFIQNGMLQPWLDERGLGENTQVLVYFAVAKKGDRPTDGKTDVNPE 120
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAAYG+ A VA+RL GGLSCKVL K+ F + MLEKL+WISA+MLVGA+H +VG
Sbjct: 121 GLTAAYGRHAQAVADRLHSGGLSCKVLGKKDFTRAMLEKLVWISAYMLVGAKHK-VSVGQ 179
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
VE ++ EVS LI EL A AAE G+ +RL AY+R+VA+FPTAVKEF WRNGWF
Sbjct: 180 VEAQHTGEVSELIRELCTAGAAELGVELAGGEVERLNAYARSVAHFPTAVKEFSWRNGWF 239
Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
Y L++KA G+PDPCPLHT L E+ V
Sbjct: 240 YGLTQKALTAGRPDPCPLHTQLLTEVGAV 268
>gi|255081100|ref|XP_002504116.1| predicted protein [Micromonas sp. RCC299]
gi|226519383|gb|ACO65374.1| predicted protein [Micromonas sp. RCC299]
Length = 265
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 152/262 (58%), Positives = 193/262 (73%), Gaps = 2/262 (0%)
Query: 49 VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPR 107
VAP+ IVGGGRVG AL +MG D+++KRG+ P + GPI+VCTRND L+ V++ P
Sbjct: 4 VAPSIIVGGGRVGQALADMGVPGDVIMKRGDAFPAEPSTGPIYVCTRNDALQGVIDQTPP 63
Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
R DLVF QNGM++PWL+++GL D Q L YFAV+K GE+P DG TD NPEGLTAA GK
Sbjct: 64 ERRADLVFLQNGMLQPWLDARGLGDNTQALIYFAVAKFGEKPTDGITDVNPEGLTAANGK 123
Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
A+ + RL+ GG+ C VLD +AF+ M EKLIWI AFM VGA H GATVG VE +R+E
Sbjct: 124 HAAALQARLAAGGMRCHVLDDDAFKASMFEKLIWICAFMAVGAAHPGATVGDVESTHRAE 183
Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
V LI EL +AAEKG+ F+P +RLCAY+RAVA+FPTAVKEF+WRNG+FY +S++A
Sbjct: 184 VVELIEELLEGSAAEKGVKFEPGSVERLCAYARAVAHFPTAVKEFEWRNGFFYGISQEAI 243
Query: 288 AEG-KPDPCPLHTAWLKEIKVV 308
G +PDP P HTA LK++ +
Sbjct: 244 TAGDRPDPFPKHTALLKQVGAI 265
>gi|303280695|ref|XP_003059640.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459476|gb|EEH56772.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 273
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 145/258 (56%), Positives = 183/258 (70%), Gaps = 1/258 (0%)
Query: 49 VAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPR 107
V P+ I+GGGRVG AL +MG D++VKRG+ P GPI+VCTRND L+ V++A P
Sbjct: 12 VMPSIIIGGGRVGQALFDMGVDGDVIVKRGDAFPSSPATGPIYVCTRNDALQGVVDATPE 71
Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
+R DLVF QNGM++PWL+++GL Q L YFAV+K+ E+P DG TD NPEGLTAA G+
Sbjct: 72 ARRKDLVFLQNGMLQPWLDARGLGGNTQALVYFAVAKMNEKPTDGITDANPEGLTAANGE 131
Query: 168 WASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSE 227
WA A RL GGL C VLD +AF+ M EKLIWI AFM VGA H GATVG VE ++ E
Sbjct: 132 WADAFAARLRSGGLRCHVLDDDAFRASMFEKLIWICAFMGVGAAHPGATVGDVESAHKEE 191
Query: 228 VSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKAS 287
V ALIAEL + + KG+ F RL AY+RAV++FPTAVKEF+WRNG+F +SE+
Sbjct: 192 VEALIAELVAGSESAKGVRFAEGTIPRLAAYARAVSHFPTAVKEFEWRNGFFCGISEREL 251
Query: 288 AEGKPDPCPLHTAWLKEI 305
A G+ DP P HTA +K++
Sbjct: 252 AAGRLDPFPTHTALMKQV 269
>gi|356562062|ref|XP_003549294.1| PREDICTED: uncharacterized protein LOC100803380 [Glycine max]
Length = 275
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 138/189 (73%), Positives = 162/189 (85%)
Query: 102 LEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGL 161
+++ P SRW DLVFFQNGM+EPWLESKGL+DANQVLAYF VSK+GE PIDG+TDTNPE L
Sbjct: 41 VQSTPFSRWGDLVFFQNGMMEPWLESKGLEDANQVLAYFVVSKIGETPIDGRTDTNPERL 100
Query: 162 TAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVE 221
AAYGKWAS+VA RL+VGGLSCKVLDKE FQKQMLEK IWI + MLVGA H G +VG V+
Sbjct: 101 IAAYGKWASIVAARLNVGGLSCKVLDKEVFQKQMLEKRIWICSVMLVGATHGGVSVGAVD 160
Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
E+ +E+S LI ELA A++EKG+TF+ AME+RLCAYSRAVA+FPTAV+EFKWRNGWFYS
Sbjct: 161 IEFHTELSNLITELASTASSEKGLTFEEAMEERLCAYSRAVAHFPTAVREFKWRNGWFYS 220
Query: 282 LSEKASAEG 290
L ++ S G
Sbjct: 221 LFDEISDHG 229
>gi|299116682|emb|CBN74827.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 315
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 143/259 (55%), Positives = 179/259 (69%), Gaps = 2/259 (0%)
Query: 47 TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
T AAIVGGGR+G AL +MG G D LV RG+++P D +GPI+VCTRNDDLE V+ P
Sbjct: 51 TATTMAAIVGGGRIGCALYDMGGGGDTLVGRGDIIP-DGDGPIYVCTRNDDLEDVISRTP 109
Query: 107 RSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
+R DLVF QNG++ P LE GL Q L YFAVSK GE PIDG TD NPEGLT+ G
Sbjct: 110 PARREDLVFLQNGVLGPLLEKHGLAKNTQALIYFAVSKKGEPPIDGITDANPEGLTSTCG 169
Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
KWA + ERL+ GGL+C L ++ + +M EK IWI AFM+ GA H +G V+ ++
Sbjct: 170 KWAGELKERLAKGGLTCHELAEDRYTARMYEKHIWICAFMVAGAAHK-CNMGQVDADHAD 228
Query: 227 EVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
EV LI EL A E G+ F+ ++DRLCAYSR+VAN+P AVKEF WRNG+F+ +SEKA
Sbjct: 229 EVRTLIGELQKAIEVEYGVAFEDGVKDRLCAYSRSVANYPAAVKEFSWRNGFFWDISEKA 288
Query: 287 SAEGKPDPCPLHTAWLKEI 305
EG DPCPLHT L+++
Sbjct: 289 RKEGTEDPCPLHTKLLRQV 307
>gi|412991433|emb|CCO16278.1| predicted protein [Bathycoccus prasinos]
Length = 320
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 149/281 (53%), Positives = 188/281 (66%), Gaps = 2/281 (0%)
Query: 29 RFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEG 87
R ++ V+A MAS ++ +V PA I+GGGRVG AL +MG D++VKR + P G
Sbjct: 41 RTSRGRKVTATVMAS-SSNKVEPAIIIGGGRVGQALFDMGVEGDVIVKRDDEFPSSPSSG 99
Query: 88 PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGE 147
PIFVCTRND LE V+E P +R DL+F QNG ++ +L+SKGL+ Q L YFAV+K GE
Sbjct: 100 PIFVCTRNDVLETVIEKTPANRREDLIFMQNGYLDSFLQSKGLEKNTQALIYFAVAKFGE 159
Query: 148 RPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFML 207
+P DG T NPEGLTA GKW+ VA R G LSC V D ++ MLEKLIWISAFML
Sbjct: 160 KPTDGVTSVNPEGLTAVTGKWSDQVAARFHNGDLSCHVKDAAKYRASMLEKLIWISAFML 219
Query: 208 VGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPT 267
VGA+H GATVG V ++SEV LI EL A +TF+ +RL AY+ +VA+FPT
Sbjct: 220 VGAQHPGATVGDVATTHKSEVEGLIKELVQGVADINDVTFEAGTLERLLAYADSVAHFPT 279
Query: 268 AVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
AVKEF+WRNG+FY+LS + + K DP P HTA LK I +
Sbjct: 280 AVKEFEWRNGFFYNLSTQTMQQSKDDPFPKHTAMLKAINAI 320
>gi|145349172|ref|XP_001419014.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579244|gb|ABO97307.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 269
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/269 (54%), Positives = 180/269 (66%), Gaps = 1/269 (0%)
Query: 41 MASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE-GPIFVCTRNDDLE 99
M + T +V P+ I+GGGRVG AL++MG D++V+RGE P + + GPI+V TRND LE
Sbjct: 1 MTNATKGEVMPSIIIGGGRVGQALRDMGVEGDVVVRRGEAFPSEPKTGPIYVATRNDVLE 60
Query: 100 AVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPE 159
V++ P R DLVF QNGM++ +L+++GL Q L YFAV+ LG++P DG T+ NPE
Sbjct: 61 DVIKNCPADRREDLVFMQNGMLQQFLDAQGLGANTQALIYFAVAALGDKPTDGVTELNPE 120
Query: 160 GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGV 219
GLTAA GKWA VA R G LSC VLD ++ M EKLIWI A MLVGA+H GA VG
Sbjct: 121 GLTAATGKWADAVAARFHNGDLSCHVLDAPTYEGSMYEKLIWICAMMLVGAQHEGAKVGD 180
Query: 220 VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWF 279
V Y+ EV ALI EL A G F P DRL AY++ VA+FP VKEF WRNG+F
Sbjct: 181 VATTYKDEVVALIEELLSGTTACTGAKFAPGAVDRLIAYAKTVAHFPCGVKEFPWRNGYF 240
Query: 280 YSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
Y LSEKA A+GK DP P HTA LK I +
Sbjct: 241 YGLSEKAIADGKSDPFPTHTALLKNIGAI 269
>gi|308806395|ref|XP_003080509.1| unnamed protein product [Ostreococcus tauri]
gi|116058969|emb|CAL54676.1| unnamed protein product [Ostreococcus tauri]
Length = 267
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/263 (54%), Positives = 174/263 (66%), Gaps = 1/263 (0%)
Query: 47 TQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAA 105
T VAPA I+GGGRVG AL+EMG D++V+RGE P + GPIFV TRND LE V+
Sbjct: 5 TSVAPAIIIGGGRVGQALREMGPEGDVVVRRGEAFPSEPSSGPIFVATRNDVLEDVIANT 64
Query: 106 PRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
P +R DLVF QNGM++ +L+++GL Q L YFAV+K G+ P DG T+ NPEGLTA
Sbjct: 65 PANRREDLVFMQNGMLQTFLDAQGLGANTQALIYFAVAKAGDAPTDGVTELNPEGLTAVT 124
Query: 166 GKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYR 225
GKWA VA R G LSC VLD +Q M EKLIWI A MLVGA+H GA VG V ++
Sbjct: 125 GKWADAVAARFHAGNLSCHVLDASTYQGSMYEKLIWICAMMLVGAQHEGAKVGDVATTHK 184
Query: 226 SEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEK 285
EV ALI EL G F P DRL AY++ VA+FP VKEF WRNG+F+ LS +
Sbjct: 185 DEVVALIEELLSGTTTCTGAQFAPGAVDRLIAYAKTVAHFPCGVKEFPWRNGYFHGLSAE 244
Query: 286 ASAEGKPDPCPLHTAWLKEIKVV 308
A AEGKPDP P+HT L+ I +
Sbjct: 245 AIAEGKPDPFPIHTKLLQNIGAI 267
>gi|323455643|gb|EGB11511.1| hypothetical protein AURANDRAFT_21633, partial [Aureococcus
anophagefferens]
Length = 281
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/264 (52%), Positives = 172/264 (65%), Gaps = 6/264 (2%)
Query: 46 TTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF--EGPIFVCTRNDDLEAVLE 103
T PA IVG GR+G+ LK G D ++ P D GPI+VCTRND L +V+
Sbjct: 13 TEHDVPAVIVGSGRIGSMLKSDG---DAVMTSSSGWPADAPESGPIYVCTRNDALASVVA 69
Query: 104 AAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTA 163
A P R DL F QNGM+ P+LE +GL DA QVL Y AV+KLGE P DG TD NPEGLTA
Sbjct: 70 AVPEDRRADLCFLQNGMLGPFLEGQGLGDATQVLVYLAVAKLGETPTDGVTDANPEGLTA 129
Query: 164 AYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKE 223
A GKWA A RL++ L+C V + +AF MLEK +WI AFMLVGA H G +VG VE++
Sbjct: 130 ATGKWADAFASRLALKELTCHVKEGDAFAVAMLEKHVWICAFMLVGATH-GCSVGDVERK 188
Query: 224 YRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLS 283
+ +E AL AEL A A G+ +RL AY+R VA+FPTAVKEF+WRNGWFY ++
Sbjct: 189 HAAEFDALAAELLDAGAEALGVAVPAGAPERLKAYARTVAHFPTAVKEFEWRNGWFYKIT 248
Query: 284 EKASAEGKPDPCPLHTAWLKEIKV 307
A A GK DP PLHT L ++++
Sbjct: 249 NDAVAAGKEDPMPLHTQKLYDLEL 272
>gi|428179462|gb|EKX48333.1| hypothetical protein GUITHDRAFT_157525 [Guillardia theta CCMP2712]
Length = 274
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 129/261 (49%), Positives = 174/261 (66%), Gaps = 5/261 (1%)
Query: 52 AAIVGGGRVGTALKEMGKG---QDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRS 108
I+G GR+GTALKEMGK D++V+RGE +P + GPI+V TRNDDL+ V+ A P S
Sbjct: 9 CTIIGAGRIGTALKEMGKASGFDDVVVRRGEKIPAEGSGPIYVTTRNDDLKGVISACPPS 68
Query: 109 RWNDLVFF-QNGMIEPWL-ESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
R DLVF+ QNG IE +L E K D+ +VL Y AV+KLG++P DG TD NPEGLTAA G
Sbjct: 69 RKGDLVFYGQNGYIEDFLKEEKVFDDSTKVLVYMAVAKLGDKPTDGITDLNPEGLTAATG 128
Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
KWA VA RL GL+C VL+++ F+ M EK +WI FML+G H G TVG VE +++
Sbjct: 129 KWAEQVAARLRGSGLTCNVLNEDEFRACMFEKHMWICTFMLLGVHHGGITVGEVESKHKE 188
Query: 227 EVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
+V ++ E+ E+GI F + +RLCAY+R+VA+FPTA+KEF WRN ++ ++
Sbjct: 189 DVFKMVEEMKKCLEVERGIKFKDGVPERLCAYARSVAHFPTALKEFSWRNKFWLDITRAR 248
Query: 287 SAEGKPDPCPLHTAWLKEIKV 307
+G D P HT L ++ V
Sbjct: 249 QGKGAEDLTPFHTKLLLKLPV 269
>gi|224006582|ref|XP_002292251.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971893|gb|EED90226.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 303
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 121/300 (40%), Positives = 166/300 (55%), Gaps = 21/300 (7%)
Query: 14 STLWPPL---KKPTFSKPRFAKPTPVSAFAMASFTTTQVA--PAAIVGGGRVGTALKEMG 68
ST +P L T S F+ P S + TTQ++ ++GGGR+G+ +
Sbjct: 4 STAFPALLAFSMYTTSAQGFSSSAPSSCTTSSCRQTTQLSATDTIVIGGGRIGSLISNDA 63
Query: 69 KGQDLLVKRGELVPLDF----EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPW 124
K L+ R + V EGPIF+ TRND LE+++E P SR DLVF QNG ++ +
Sbjct: 64 K----LLGRNDSVASSIDPNGEGPIFIATRNDVLESIVEDCPESRRKDLVFLQNGYLDNF 119
Query: 125 LESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
L SK L Q L Y +V+ G P+DG T NPEGLTAA G A A+RL+ GL C
Sbjct: 120 LASKDLLSNTQALLYLSVTAKGVDPVDGITTVNPEGLTAATGVHAQAFADRLATLGLKCN 179
Query: 185 VLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKG 244
V+ E ++ M EKLIWIS +MLVG +VG +++ V +I+EL A +A++G
Sbjct: 180 VVSTEDYRPAMFEKLIWISTYMLVGTAKNCKSVGQAGSDHKHLVENIISELVAAVSAKEG 239
Query: 245 ITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
I F +RL AY+ V +FP VKEF+WRN +FY L ++A CPLH L+E
Sbjct: 240 IAFQEGTIERLAAYTDVVTDFPCGVKEFEWRNKYFYDLGDEA--------CPLHNGLLRE 291
>gi|219116430|ref|XP_002179010.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409777|gb|EEC49708.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 231
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 101/233 (43%), Positives = 143/233 (61%), Gaps = 2/233 (0%)
Query: 54 IVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRWNDL 113
I+G GR+G L E GK + + R + + + +G I + TRND LE+++E P +R DL
Sbjct: 1 IIGSGRIGALLAEAGKCK--VSGREDSIDSEGKGSILIATRNDSLESIVEKCPENRRKDL 58
Query: 114 VFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVA 173
VF QNG ++ +L+SKGL QVL Y + + + + +DG T NPEGLTAA G A A
Sbjct: 59 VFMQNGYLDKFLDSKGLLGNTQVLLYLSKASMEAKAVDGVTSVNPEGLTAATGIHAQAFA 118
Query: 174 ERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIA 233
+RL+ L C V+ ++++ M EKLIWIS +MLVG +VG +EYR V A++
Sbjct: 119 DRLASLDLKCNVVSPKSYRPAMFEKLIWISTYMLVGTAKECKSVGQASEEYRDIVEAVVN 178
Query: 234 ELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKA 286
EL A A++GI F RL AY+ VA+FP AVKEF+WRN +F+ L + A
Sbjct: 179 ELTAAVCAKEGIEFPDGTMARLAAYTDVVADFPCAVKEFQWRNQYFFDLGDDA 231
>gi|388504328|gb|AFK40230.1| unknown [Lotus japonicus]
Length = 114
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 87/114 (76%), Positives = 102/114 (89%)
Query: 195 MLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDR 254
MLEKLIWI + MLVGARH G +VG VEKE+ E+ +LI+ELALAAA+EKG+TF+ MEDR
Sbjct: 1 MLEKLIWICSVMLVGARHGGVSVGNVEKEFHLELCSLISELALAAASEKGLTFEEGMEDR 60
Query: 255 LCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKVV 308
+CAYSRAVA+FPTAVKEFKWRNGWFYSLSEKA+A+GK DPCPLHT WLKE+K+V
Sbjct: 61 MCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKATAQGKQDPCPLHTQWLKELKIV 114
>gi|428171530|gb|EKX40446.1| hypothetical protein GUITHDRAFT_75611 [Guillardia theta CCMP2712]
Length = 276
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/266 (36%), Positives = 155/266 (58%), Gaps = 15/266 (5%)
Query: 48 QVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE---------GPIFVCTRNDDL 98
+++PA IVG GRVG +K G D +V+ + + GPI++C N L
Sbjct: 2 ELSPAVIVGDGRVGHMMKNRGDPGDAMVRSTSELRFQLKEAPGGGPKSGPIYLCVPNSAL 61
Query: 99 EAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNP 158
+ ++EA P R DLVF +G + P+L+ GL+ Q L YF + KLG P+DGKT +P
Sbjct: 62 KELVEACPPERRQDLVFMGSGNLGPFLKQFGLEGNTQALIYFVIDKLGHDPVDGKTSLDP 121
Query: 159 EGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG 218
+GLTA GKW+ +A+RL + LSC+++D F+ LE+L+W A LV + VG
Sbjct: 122 DGLTAVTGKWSGDLAKRLKMVNLSCRIMDSNKFEISRLERLVWTCATHLVAKANKFTKVG 181
Query: 219 VVEKEYRSEVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFK 273
V + + SEV ++ EL + E+ ++ +E +R+ Y+ ++ +F T VK+ +
Sbjct: 182 QVVESHWSEVEEVMHEL-VRVLNEQAVSGSVQLEASQFYERMKEYTLSIGHFNTVVKDLE 240
Query: 274 WRNGWFYSLSEKASAEGKPDPCPLHT 299
+RNG+FY +S++ A+G PDPCP+HT
Sbjct: 241 FRNGYFYKISQERLAKGLPDPCPMHT 266
>gi|397642896|gb|EJK75525.1| hypothetical protein THAOC_02750 [Thalassiosira oceanica]
Length = 298
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 156/282 (55%), Gaps = 28/282 (9%)
Query: 27 KPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE 86
+ R T +SA A A+ T+ V ++GGGR+G+ L E L+ R + + +
Sbjct: 29 QSRTTATTSLSA-ANANVATSDV----VIGGGRIGSILAEDAT----LLGREDSISSSID 79
Query: 87 ----GPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
GPI+V TRND LE +++ P R DLVF QNG ++ +LE KGL Q L Y +V
Sbjct: 80 PAGTGPIYVATRNDVLEGIVDDCPAGRRKDLVFLQNGYLDSFLERKGLLQNTQALLYLSV 139
Query: 143 SKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWI 202
G P+DG T NPEGLTAA G+WA A++L+ L V+ E ++ M EKLIWI
Sbjct: 140 PAKGADPVDGITSVNPEGLTAATGEWAQAFADKLATLDLKMNVVTPEQYRPAMFEKLIWI 199
Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
S +MLVG ++VG E++ V +IAEL A +A++GI+F +RL AY+ V
Sbjct: 200 STYMLVGTVKDCSSVGQAGNEHKQLVEDVIAELVAAVSAKEGISFTSGTIERLAAYTDVV 259
Query: 263 ANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
A+FP VK +FY L ++A CP+H L+E
Sbjct: 260 ADFPCGVK-------YFYDLGDEA--------CPIHNRLLRE 286
>gi|37806393|dbj|BAC99931.1| unknown protein [Oryza sativa Japonica Group]
Length = 142
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 90/113 (79%), Positives = 99/113 (87%)
Query: 120 MIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVG 179
M++PW ESKGL DANQVLAYFAVSKLGE P+DG TDTNPEGLTAA+G WA VA RL G
Sbjct: 1 MLDPWFESKGLVDANQVLAYFAVSKLGEPPVDGITDTNPEGLTAAFGNWAPAVATRLQNG 60
Query: 180 GLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSALI 232
GL+CKVL+KEAFQKQMLEKLIWISAFMLVGARH GATVG VEKEY+SE + +I
Sbjct: 61 GLTCKVLEKEAFQKQMLEKLIWISAFMLVGARHPGATVGAVEKEYQSEFTLVI 113
>gi|452823237|gb|EME30249.1| hypothetical protein Gasu_24020 [Galdieria sulphuraria]
Length = 305
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 98/262 (37%), Positives = 140/262 (53%), Gaps = 20/262 (7%)
Query: 46 TTQVAPAAIVGGGRVGTAL-KEMGKGQDLLVKRGELVPLDFEGPIFVCTRNDDLEAVLEA 104
T Q ++G GR+G+ L + G + ++V RGE +P D GPI+VCTRNDDL++++
Sbjct: 58 TGQSFQEVVIGNGRIGSLLARAPGMKEPVVVTRGESIPADGYGPIYVCTRNDDLDSIVAR 117
Query: 105 APRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAA 164
P R DLVF QNGMI+ +LE + +++ Q L YFAV+ G+ P+DG G T
Sbjct: 118 TPSDRRPDLVFIQNGMIQTFLEERNIQNNTQALIYFAVASKGDSPVDG-------GGTVV 170
Query: 165 YGKWASVVAERLSVGGLSCKVL-DKEAFQKQMLEKLIWISAFM-LVGARHTGATVGVVEK 222
YGKWA A+R+ G C V+ DK F K+M+EKL+WI L H V +
Sbjct: 171 YGKWAEAFAQRVCSLGCKCSVVEDKTEFIKKMVEKLLWICVMGPLCATSHFTCGQVVANE 230
Query: 223 EYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSL 282
++ L+ EL A I D + R+ YS V+++ VKE WRNGWF L
Sbjct: 231 TCLRDLEDLVNELKPIAEEVLRIKLDDGVVSRIVEYSNKVSDYRAGVKEIAWRNGWF--L 288
Query: 283 SEKASAEGKPDPCPLHTAWLKE 304
K + PLH +++K
Sbjct: 289 ERKKT--------PLHVSYMKN 302
>gi|428181809|gb|EKX50672.1| hypothetical protein GUITHDRAFT_151218 [Guillardia theta CCMP2712]
Length = 357
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 98/274 (35%), Positives = 151/274 (55%), Gaps = 21/274 (7%)
Query: 50 APAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFEG----PIFVCTRNDDLEAVLEAA 105
P+ IVGGGRVG L+ G G D++V RG +P EG P++VC +LE V+ +
Sbjct: 72 TPSVIVGGGRVGEMLRGAGGGDDVVVGRGGQIPEVHEGLESFPVYVCVPTSELENVIRSC 131
Query: 106 PRSRWNDLVFFQNGMIEPWLESKGL--KDANQVLAYFAVSKLGERPIDGKTDTNPE---- 159
P+S+ +DLVF Q+GMIEP L+ L + Q+L YF + G RP D D +
Sbjct: 132 PQSKLDDLVFVQDGMIEPILKKYALCGNENTQMLPYFTLFSKGGRPQDSLVDLGTDAYGE 191
Query: 160 ----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAF----QKQMLEKLIWISAFMLVGAR 211
G T+ GKWA + +RL C + F ++ MLEK+I+ S F LVGA
Sbjct: 192 PKYAGETSICGKWAGALKQRLDRLNF-CHRHSRTMFYRDWRRAMLEKVIFDSVFPLVGAL 250
Query: 212 HTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRA--VANFPTAV 269
+ G ++G V + + E +I E+A + +T +E+RL AYS+ V P +
Sbjct: 251 NKGISIGEVGQSHTEECQEMITEIARSLPGTLAVTLMYGVEERLFAYSQHDNVEFRPCKI 310
Query: 270 KEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLK 303
+++ +RNG++Y+ S++A G PDP P+H+ +L+
Sbjct: 311 EQYPFRNGFWYTFSQQAKQIGFPDPTPMHSEYLE 344
>gi|449016404|dbj|BAM79806.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 312
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 130/238 (54%), Gaps = 18/238 (7%)
Query: 52 AAIVGGGRVGTALKEMGKGQDL----LVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPR 107
A +VG GR+GT L KG +L L++R + + +GPI+VCTRNDDL V+ P
Sbjct: 59 ATVVGAGRIGTLLAS-AKGDELAPVRLLRRQDSIEGGRKGPIYVCTRNDDLAEVVAKCPP 117
Query: 108 SRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGK 167
R DLVF QNGM+ +L L+ A Q L +FAV ++G+ P DG G T +G
Sbjct: 118 VRRGDLVFLQNGMLRSFLLEHNLEHATQALVFFAVPRVGDPPADG-------GGTVVWGP 170
Query: 168 WASVVAERLSVGGLSCKVL-DKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
WA+ A R+ G C VL DK+ F +M+EKL+WI F ++ G VG + ++
Sbjct: 171 WAAAFAARVRSCGCQCTVLEDKQEFDCRMIEKLLWICIFGVLSESQNGMPVGEIAEKQPD 230
Query: 227 EVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFKWRNGWF 279
V L+ ELA A E G+ A++ L YS+ + +F KE++WRNGWF
Sbjct: 231 RVRQLVDELAPIAEREVGLEGKHALDRGQLLQSLLDYSKKIPDFVAGFKEYRWRNGWF 288
>gi|299117349|emb|CBN75305.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 322
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 144/284 (50%), Gaps = 31/284 (10%)
Query: 51 PAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDF--------EG--------PIFVCTR 94
P+ I+G G++GTAL MG G+D+++ RGE +P +G PI+VC
Sbjct: 29 PSVIIGRGKLGTALASMGMGEDVVLGRGEAIPDSLPCSSGKGEDGEPAMLQNFPIYVCVP 88
Query: 95 NDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLA--YFAVSKLGERPI-- 150
+++ AV+++ P + +DLVF +G +EP L+S+G+ Q A YF+V K+G R +
Sbjct: 89 EEEVAAVIDSCPDDKKDDLVFMSSGCLEPLLKSRGMCRGEQTQATLYFSVDKVGGRAVDA 148
Query: 151 ------DGKTDTNPEGLTAAYGKWASVVAERL--SVGGLSCKVLDKEAFQKQMLEKLIWI 202
DG+ + G TA GKW A RL S GL C +++ M EKLI+
Sbjct: 149 MTVIGTDGRGEPKLAGETAVCGKWKGAFAMRLQQSPFGLHCSTEYYLEWRRVMFEKLIFD 208
Query: 203 SAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAV 262
A LVGA H TVG V + + E+ ++ EL + ++ +RL AY V
Sbjct: 209 CAVNLVGALHKFPTVGYVVQYHAEELVDMLFELKNTLRSTLAVSLMNGYPERLIAYGDQV 268
Query: 263 ANFPTA---VKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLK 303
+ T + FKW N +FY +S A DPCP+HT +L
Sbjct: 269 GDRQTTRMDARTFKWYNEFFYDISNTALKAKFGDPCPMHTEYLN 312
>gi|149918918|ref|ZP_01907404.1| hypothetical protein PPSIR1_16645 [Plesiocystis pacifica SIR-1]
gi|149820292|gb|EDM79709.1| hypothetical protein PPSIR1_16645 [Plesiocystis pacifica SIR-1]
Length = 260
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/264 (38%), Positives = 138/264 (52%), Gaps = 25/264 (9%)
Query: 54 IVGGGRVGTALKEMGKGQDLLVKR-------GELVPLDFEGPIFVCTRNDDLEAVLEAAP 106
+VG GRVGTAL + + LV R G P PI + RND L AVLE P
Sbjct: 3 VVGAGRVGTALAALDPERVTLVDRVANWAFVGAEAPGRPGEPIVLAVRNDALPAVLERIP 62
Query: 107 RSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYG 166
++R DLVF QNGM+ P+L+++GL +A + L +FAV G P G + G
Sbjct: 63 KARHPDLVFVQNGMLRPYLDARGLGEATRGLLFFAVPTRG-------AALEPGGESPFCG 115
Query: 167 KWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRS 226
A V E L G+ V+D F LEKLIW SAF L+ VG V +E+
Sbjct: 116 PKAEAVVEDLLAMGVPAAVVDAPKFAAVELEKLIWNSAFGLL-CEARACDVGTVVREHGE 174
Query: 227 EVSALIAELALAAAAEKGITFDPAME-----DRLCAYSRAVANFPTAVKEFKWRNGWFYS 281
+ AL+AE+ E G+ + A+E +RL +YS ++A++ AVKE+ WRNGWF
Sbjct: 175 ALRALVAEMLAVGGPELGLVGEAALELEALVERLASYSLSIASYRGAVKEWTWRNGWFV- 233
Query: 282 LSEKASAEGKPDPCPLHTAWLKEI 305
E+A A G+ CP+H L E+
Sbjct: 234 --ERARARGR--ACPVHDGLLAEV 253
>gi|302848804|ref|XP_002955933.1| hypothetical protein VOLCADRAFT_96870 [Volvox carteri f.
nagariensis]
gi|300258659|gb|EFJ42893.1| hypothetical protein VOLCADRAFT_96870 [Volvox carteri f.
nagariensis]
Length = 353
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/277 (35%), Positives = 137/277 (49%), Gaps = 43/277 (15%)
Query: 54 IVGGGRVGTALKEMGKGQDL---LVKRGELVPLDFEGPIFVCTRNDDLEAVLEAAPRSRW 110
+VG GRVG A +M G DL + +R + LD +GPI+VCT ND L+ VL+ P +R
Sbjct: 90 LVGYGRVGKAFAKM-LGDDLAATVSRRNGSISLDGKGPIYVCTTNDALDDVLQRTPPARR 148
Query: 111 NDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
DLV QNG + PWL+ GL+D V Y A S G DG+ T A G+W
Sbjct: 149 RDLVLLQNGWLLPWLDRNGLEDVTLVALYMAASP-GGTATDGRR-------TVASGRWGD 200
Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEVSA 230
VA L+ GG+SC V+ ++A K ++EK +W S F ++ GA VG + +R +V A
Sbjct: 201 HVASTLNRGGVSCAVVSRDALYKALVEKALWASIFWMLSCALGGAKVGDIAVHHRLDVKA 260
Query: 231 LIAEL------ALAAAAEK--------------GITFDPAMEDRLCAYSRAVANFPTAVK 270
L+ EL +L AAA K G D A+ D + +AV + A+
Sbjct: 261 LVDELLPMMRQSLEAAAAKRPGADAAAAALRDHGTVLD-ALMDYSMSIGQAVPSREMALA 319
Query: 271 EFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKV 307
EF WRNG + PLH +WL+ V
Sbjct: 320 EFSWRNGALLEMGLT----------PLHCSWLQRAGV 346
>gi|95113962|gb|ABF55515.1| chloroplast At1g16080 protein [Solanum lycopersicum]
Length = 75
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/73 (84%), Positives = 65/73 (89%)
Query: 112 DLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASV 171
+LVFFQNGM+EPW +SKGL DANQVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS
Sbjct: 3 NLVFFQNGMLEPWFQSKGLGDANQVLAYFAVSKLGEPPTDGKTDTNPEGLTAAYGKWASA 62
Query: 172 VAERLSVGGLSCK 184
VA RL GGLSCK
Sbjct: 63 VAARLQNGGLSCK 75
>gi|95113968|gb|ABF55518.1| chloroplast At1g16080 protein [Lactuca sativa]
Length = 72
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 59/72 (81%), Positives = 64/72 (88%)
Query: 113 LVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVV 172
LVFFQNGM+EPW ESKGL DA+QVLAYFAV+KLGE PIDG TDTNPEGLTAA+GKW S V
Sbjct: 1 LVFFQNGMLEPWFESKGLGDADQVLAYFAVAKLGEPPIDGITDTNPEGLTAAFGKWGSAV 60
Query: 173 AERLSVGGLSCK 184
A+RL GGLSCK
Sbjct: 61 ADRLRAGGLSCK 72
>gi|95113973|gb|ABF55520.1| chloroplast At1g16080 protein [Coffea canephora]
Length = 69
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/69 (81%), Positives = 59/69 (85%)
Query: 116 FQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAER 175
FQNGM+E W +SKGL D +QVLAYFAVSKLGE P DGKTDTNPEGLTAAYGKWAS VA R
Sbjct: 1 FQNGMLETWXQSKGLSDPDQVLAYFAVSKLGESPTDGKTDTNPEGLTAAYGKWASAVAGR 60
Query: 176 LSVGGLSCK 184
L GGLSCK
Sbjct: 61 LRAGGLSCK 69
>gi|255080954|ref|XP_002504043.1| predicted protein [Micromonas sp. RCC299]
gi|226519310|gb|ACO65301.1| predicted protein [Micromonas sp. RCC299]
Length = 326
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 138/305 (45%), Gaps = 43/305 (14%)
Query: 29 RFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKG---QDLLVKRGELVPLDF 85
R ++ +P A AS + IVG GRVGT + +M + + V RG +
Sbjct: 20 RASRMSPARCRADASSSGADTV--TIVGRGRVGTTIGKMCESLGVRHAFVTRG-MASFPP 76
Query: 86 EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMI-EPWLESKGLKD-ANQVLAYFAVS 143
GPI+V T DL+ VL P R NDLV Q G++ + +L+ +GL A QV Y + S
Sbjct: 77 SGPIYVATHASDLDDVLALVPTDRANDLVLLQGGLLRDDFLQRRGLAGVATQVALYMSAS 136
Query: 144 KLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGG-LSCKVLDKEAFQKQMLEKLIWI 202
G T + G T A G A+ V E L+ GG + C V++++ F+ + KL W
Sbjct: 137 GDG-------TARDGGGGTCACGPRAADVRELLTRGGNVRCAVVNEDDFRVASVGKLCWT 189
Query: 203 SAFMLVGARHTGA--------TVG-VVEKEYRSEVSALIAELALAAAAEKGITFDP---- 249
SAF L+ R A TVG VV+ E+ + +A L A G D
Sbjct: 190 SAFWLL-CRSVAAERGLPRALTVGEVVDSEHGAAAVRTLARELLDCAEASGELDDTAGDE 248
Query: 250 ---AMEDRLCAYSR----AVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWL 302
AM + YSR AV + A+KE +RNGWF + +S P H L
Sbjct: 249 AREAMTRAMFEYSRSIPSAVPSLEMALKEGGFRNGWFLARRTVSS------PQRTHETHL 302
Query: 303 KEIKV 307
+ I V
Sbjct: 303 RRIGV 307
>gi|159491118|ref|XP_001703520.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280444|gb|EDP06202.1| predicted protein [Chlamydomonas reinhardtii]
Length = 349
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 111/274 (40%), Gaps = 71/274 (25%)
Query: 86 EGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLK-------------- 131
+GPI+VCT ND L+ VLE PR+R DLVFFQNG + PWL L+
Sbjct: 86 QGPIYVCTTNDALDWVLEQTPRNRRCDLVFFQNGWLAPWLAHNSLRLGPGSAAGAAAEGG 145
Query: 132 ----------------------DANQVLAYFAVSKLGERPIDGKTDTNPEGL-TAAYGKW 168
+A+ + V+ + G T +GL T A G+W
Sbjct: 146 AAAAAGREAEAARGDGGGAGGAEASGPGSAVEVTLVALYMAAGPDGTAKDGLRTVASGRW 205
Query: 169 ASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVGVVEKEYRSEV 228
A V+ LS G + C+ + ++EKL+W F ++ A G VG + +R +V
Sbjct: 206 AQHVSRTLSRGAVRCRAVGGADMYGAVVEKLLWACVFWMMSAALGGMNVGDIVARHRGDV 265
Query: 229 SALIAELALAAAAE-KGITFD-----------------------PAMEDRLCAYSRAVAN 264
+AL AEL + +GI A+ + + S AV +
Sbjct: 266 AALTAELLPPLCRQLRGIAAAGAGPEAAEAAAALEAPAGAERVVAALVEYSVSISSAVPS 325
Query: 265 FPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLH 298
A+ EF+WRNG + P PLH
Sbjct: 326 RDMALAEFRWRNGALLDMG----------PTPLH 349
>gi|303271083|ref|XP_003054903.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462877|gb|EEH60155.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 376
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 129/311 (41%), Gaps = 69/311 (22%)
Query: 54 IVGGGRVGTAL-KEMGKGQDLLVKRGELVPLD-FEGPIFVCTRNDDLEAVLEAAPRSRWN 111
+ G GRV + L + +G + +LV R P D GPI+V T DL AVL P SR
Sbjct: 63 VFGRGRVASVLARALGISRCVLVGRYNSFPADDIGGPIYVATHASDLAAVLAKVPASRRR 122
Query: 112 DLVFFQNGMI-EPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWAS 170
D+V Q G++ WLE++G DA QV + + + G+ +DG G T A G A+
Sbjct: 123 DVVLMQGGLLRRAWLEARGAADATQVALFLSARRDGD-VVDGG------GRTVACGPHAA 175
Query: 171 VVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGA-----TVG--VVEKE 223
V E L+ G+ C +D F EKL+W S F L+ A GA TVG V + +
Sbjct: 176 HVVETLARVGVRCAEVDATEFGTAATEKLLWASIFWLMCATRGGAGGDAVTVGEIVNDVD 235
Query: 224 YRSEVSALIAEL-----------ALAAAAEKGIT-------------------------- 246
+ AL EL A AAE G +
Sbjct: 236 ASARAEALAVELMRVVVAAGELDGGADAAEDGTSTVAGSPPAPALPLLDAEALEYEGTGF 295
Query: 247 -FDPAMEDRLCAYS--RAVANFP-------TAVKEFKWRNGWFYSLSEKASAEGKPDPCP 296
D E L R + P A +E +RNGWF + +E+ + E P
Sbjct: 296 GTDAEFEQGLIVIGLFRYCDDIPDAAPSAEMATREGAFRNGWFLAAAERYALE-----TP 350
Query: 297 LHTAWLKEIKV 307
LH L++ V
Sbjct: 351 LHERLLRDAGV 361
>gi|224053048|ref|XP_002297681.1| predicted protein [Populus trichocarpa]
gi|222844939|gb|EEE82486.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/74 (63%), Positives = 54/74 (72%), Gaps = 14/74 (18%)
Query: 235 LALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDP 294
LA AAAAEKGI F A+E+RLCAY+ AVA+FPTAVKE A +EGKPDP
Sbjct: 1 LASAAAAEKGIVFKEAIEERLCAYAGAVAHFPTAVKE--------------AVSEGKPDP 46
Query: 295 CPLHTAWLKEIKVV 308
CPLHT+WLKE+KVV
Sbjct: 47 CPLHTSWLKELKVV 60
>gi|95113971|gb|ABF55519.1| chloroplast At1g16080 protein [Solanum tuberosum]
Length = 81
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/75 (60%), Positives = 50/75 (66%)
Query: 110 WNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWA 169
W+DLVFFQNGM EP +SK L++A LAY AVSKLG+ P DGKTDT PEG AYGKW
Sbjct: 7 WSDLVFFQNGMQEPGFQSKALEEAEPSLAYLAVSKLGKPPTDGKTDTIPEGWMPAYGKWP 66
Query: 170 SVVAERLSVGGLSCK 184
S GG S K
Sbjct: 67 SPGVPGCRNGGFSSK 81
>gi|422293651|gb|EKU20951.1| hypothetical protein NGA_0116101 [Nannochloropsis gaditana CCMP526]
gi|422295684|gb|EKU22983.1| hypothetical protein NGA_0116102 [Nannochloropsis gaditana CCMP526]
Length = 222
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 100/214 (46%), Gaps = 28/214 (13%)
Query: 116 FQNGMIEPWLESKGL--KDANQVLAYFAVSKLGERPIDG-------KTDTNPE--GLTAA 164
Q+ +E L+ GL K Q L YF++S G P++G KTD P+ G T
Sbjct: 1 MQSACLETLLKRYGLANKKTTQCLPYFSISPFG-LPVEGRVQYDVSKTDGEPKFAGQTVV 59
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGA---TVGVVE 221
G W VAERL L +V+D ++K MLE+L++ S F L+ H + T G +
Sbjct: 60 CGTWRGAVAERLRKQDLHVEVMDYTYWRKHMLERLLFDSLFGLLACVHADSGEKTYGELA 119
Query: 222 KEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPT---------AVKEF 272
+ + E+ ++ EL+ A A + +E R+ R +A P +E+
Sbjct: 120 QNHWEEIDDMVWELSRAIANIMAVVLLTHVERRV----RGLAEGPHWRDRKPQMPTCREY 175
Query: 273 KWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIK 306
+RNG SLS ++ G P P P H +L ++
Sbjct: 176 PFRNGMLRSLSRSFTSRGFPSPMPTHDEYLDFVR 209
>gi|218200804|gb|EEC83231.1| hypothetical protein OsI_28521 [Oryza sativa Indica Group]
Length = 162
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/39 (84%), Positives = 38/39 (97%)
Query: 269 VKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEIKV 307
+++FKWRNGWFYSL+EKA A+GKPDPCPLHTAWLKEIKV
Sbjct: 1 MEQFKWRNGWFYSLTEKALAQGKPDPCPLHTAWLKEIKV 39
>gi|95113966|gb|ABF55517.1| chloroplast At1g16080 protein [Helianthus annuus]
Length = 48
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 38/48 (79%), Positives = 40/48 (83%)
Query: 137 LAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
YFAVS LGE P+DG TD NPEGLTAAYGKWASVVA RL+ GGLSCK
Sbjct: 1 FGYFAVSXLGEPPVDGITDANPEGLTAAYGKWASVVATRLNAGGLSCK 48
>gi|255543393|ref|XP_002512759.1| hypothetical protein RCOM_1442650 [Ricinus communis]
gi|223547770|gb|EEF49262.1| hypothetical protein RCOM_1442650 [Ricinus communis]
Length = 76
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 35/48 (72%), Positives = 41/48 (85%), Gaps = 2/48 (4%)
Query: 259 SRAVANFPTAVKE--FKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKE 304
SRAV + P AV+E FKWRNGW YSLS+KA AEGKPDPCPL+T+WLK+
Sbjct: 4 SRAVGHLPAAVREAKFKWRNGWSYSLSQKAIAEGKPDPCPLYTSWLKD 51
>gi|397645963|gb|EJK77058.1| hypothetical protein THAOC_01133, partial [Thalassiosira oceanica]
Length = 147
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 27 KPRFAKPTPVSAFAMASFTTTQVAPAAIVGGGRVGTALKEMGKGQDLLVKRGELVPLDFE 86
+ R T +SA A A+ T+ V ++GGGR+G+ L E L+ R + + +
Sbjct: 29 QSRTTATTSLSA-ANANVATSDV----VIGGGRIGSILAEDAT----LLGREDSISSSID 79
Query: 87 ----GPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
GPI+V TRND LE +++ P R DLVF QNG ++ +LE KGL Q L Y +V
Sbjct: 80 PAGTGPIYVATRNDVLEGIVDDWPAGRRKDLVFLQNGYLDSFLERKGLLQNTQALLYLSV 139
Query: 143 SKLGERPI 150
G P+
Sbjct: 140 PAKGADPV 147
>gi|95113964|gb|ABF55516.1| chloroplast At1g16080 protein [Helianthus annuus]
Length = 49
Score = 67.8 bits (164), Expect = 7e-09, Method: Composition-based stats.
Identities = 35/49 (71%), Positives = 37/49 (75%)
Query: 136 VLAYFAVSKLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCK 184
VLAYFAVSKLGE +DG TD NPEGLTAA ASV RL+ GGLSCK
Sbjct: 1 VLAYFAVSKLGEPHVDGITDANPEGLTAADDTCASVAYSRLNAGGLSCK 49
>gi|219113737|ref|XP_002186452.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583302|gb|ACI65922.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 326
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 100/234 (42%), Gaps = 30/234 (12%)
Query: 84 DFEGPIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVS 143
D PIFV T V + +R DLV+ NG+ P E + V+ +F V
Sbjct: 101 DDGNPIFVATPAHSWRDVYTYSKPARRTDLVWVGNGLPLPAFERSTV-----VVPHFGVL 155
Query: 144 KLGERPIDGKTDTNPEGLTAAYGKWASVVAERLSVGGLSCKVLDK-EAFQKQMLEKLIWI 202
++GE P+ + ++P T YGK V G+ KV+D K++W
Sbjct: 156 QVGESPV--TSISSPP--TYVYGKHTDAVCTISQKEGVFVKVIDSWHDLYAITARKVLWA 211
Query: 203 SAFMLVGARHTGA---TVGVVEKEYRSEVSALIAELALAAAAEKGIT-FDPAMED---RL 255
S LV RH+ TV K+ R + L++EL A G++ + ED L
Sbjct: 212 SVMWLV--RHSETPPLTVAAAHKQKRQLLIDLVSELFAPLQAIVGLSAYTNEYEDTFIYL 269
Query: 256 CAYS----RAVANFPTAVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
YS A+ N A+ E K RNG + SLS + P LH A ++E+
Sbjct: 270 ELYSMSMPYAIPNKQLAIDEIKERNGIWLSLSAQF-------PQELHQALVREV 316
>gi|223992611|ref|XP_002285989.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977304|gb|EED95630.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 409
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 97/222 (43%), Gaps = 29/222 (13%)
Query: 63 ALKEMGKGQDLLVKRGELVPLDF----EGPIFVCTRNDDLEAVLEAAP---RSRWNDLVF 115
A+ +GKG DLL+ E +P PI++ +L ++ ++R D VF
Sbjct: 110 AIVGLGKGNDLLLTPNEPLPPAIGETTSYPIYITLPPYELREFIQQLDEEWKARAEDFVF 169
Query: 116 FQNG----MIEPWLESKGL-KDA-NQVLAYFAVSKLG-------ERPIDGKT----DTNP 158
F G ++EP L G+ +DA QV F++ K G +P D D N
Sbjct: 170 FSGGKICGVVEPTLREFGMCRDAMTQVTVGFSLPKPGTGLGGTTRKPEDMSCNIGADVNG 229
Query: 159 E----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGA-RHT 213
E G + A GKW +A RLS + CK + +++ M E+ ++ + + +VGA R
Sbjct: 230 ENKWAGESVACGKWNGAIAGRLSENAIRCKTVFYREWRRAMWERALYDAVWNVVGAVRDE 289
Query: 214 GATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRL 255
V + E S ++ E+A +T EDRL
Sbjct: 290 PTDHSDVAMFFELEASDMMWEIANHLRGGLAVTLIYGFEDRL 331
>gi|428178288|gb|EKX47164.1| hypothetical protein GUITHDRAFT_152174 [Guillardia theta CCMP2712]
Length = 123
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 21/125 (16%)
Query: 54 IVGG-GRVGTALKEMGKGQDLLVKRGELVPLDFEG-------PIFVCTRNDDLEAVLEAA 105
IVGG GR+G+A++ G DL R LD G PIFVCT ++LE V+
Sbjct: 5 IVGGNGRIGSAIRRFA-GDDLYCTR----KLDKIGVNSPTNTPIFVCTGVENLEEVVSKT 59
Query: 106 PRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGERPIDGKTDTNPEGLTAAY 165
SR DLVF QNG++ L +D + YF+V + +G G +
Sbjct: 60 EPSRRRDLVFMQNGLVRSLLVDLE-EDQTIAVLYFSVLERNGPAKEG-------GCSYVQ 111
Query: 166 GKWAS 170
G+WA
Sbjct: 112 GRWAQ 116
>gi|397641392|gb|EJK74625.1| hypothetical protein THAOC_03687 [Thalassiosira oceanica]
Length = 381
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 101/242 (41%), Gaps = 37/242 (15%)
Query: 63 ALKEMGKGQDLLVKRGELVPLDFEG----PIFVCT---RNDDLEAVLEAAPRSRWNDLVF 115
A+ +GKG+D++ + +P G PI++ D LE ++R D VF
Sbjct: 85 AIVALGKGEDIVRSPRDPLPDTMGGRDKYPIYIALPPYELRDFLQNLEGDWKNRTEDFVF 144
Query: 116 FQNG----MIEPWLESKGL-KDA-NQVLAYFAVSKLGERPIDGKT--------------D 155
G ++EP L G+ +D+ Q+L F++ RP+ G D
Sbjct: 145 ISGGRSCGVVEPILREFGMCRDSMTQLLVGFSLP----RPLTGSVNARKPEDLSVRLGED 200
Query: 156 TNPE----GLTAAYGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGAR 211
N E G + A GKW VA RL + CK +++ M E+ ++ + +VG+
Sbjct: 201 ANGEAKWAGESQACGKWNGAVAHRLKENAIRCKQCFYREWRRAMWEQAMYDCVWNVVGSV 260
Query: 212 HTGATVGV-VEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVA-NFPTAV 269
T + V Y E S ++ ELA +T EDRL +++ + P +
Sbjct: 261 REEKTDHLDVALYYELESSDMMWELANGLRGGLAVTLIYGFEDRLFSFAEQRGKDMPCEI 320
Query: 270 KE 271
E
Sbjct: 321 DE 322
>gi|317051822|ref|YP_004112938.1| 2-dehydropantoate 2-reductase [Desulfurispirillum indicum S5]
gi|316946906|gb|ADU66382.1| 2-dehydropantoate 2-reductase [Desulfurispirillum indicum S5]
Length = 318
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 79/167 (47%), Gaps = 17/167 (10%)
Query: 113 LVFFQNGM-IE-PWLESKGLKDANQVLAYFAVSKLGERPIDGKT------DTNPEGLTAA 164
+V QNG+ IE P+ ++ + VLA+ VS+L I + PEG T
Sbjct: 107 IVLIQNGIDIEVPYRQAYPEHEIISVLAFVCVSRLSPGKILHQDYGKLALGNYPEGKTP- 165
Query: 165 YGKWASVVAERLSVGGLSCKVLDKEAFQKQMLEKLIWISAFMLVGARHTGATVG--VVEK 222
+ + E L +GG++C+V D+ Q+ KL+W ++F + G + ++
Sbjct: 166 ---YLEALVELLRLGGVTCEVTDR--VQEARWRKLVWNASFNTLSVLGGGVDTAQLLADE 220
Query: 223 EYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAYSRAVANFPTAV 269
E V ++ E+ L AAAE G P++ D ++R + + T++
Sbjct: 221 ESCQLVRQVMEEVCLVAAAE-GHRLVPSIVDEQIEHTRVMTPYKTSM 266
>gi|223995543|ref|XP_002287445.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976561|gb|EED94888.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 800
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 65/154 (42%), Gaps = 6/154 (3%)
Query: 88 PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQVLAYFAVSKLGE 147
PI+ C + +E+V A R DLVF N + L +D + + +F V + +
Sbjct: 92 PIYACIPSSSIESVYYATIPQRRKDLVFLCNCIPSRHLAFASGEDVSIAILHFGVPTMKD 151
Query: 148 RPIDGKTDTNPEG-LTAAYGKWASVVAERLSVGGLSCKVLDKEA-FQKQMLEKLIWISAF 205
+ +T+PE T YG+ A+ + L G++ V A Q +K+ W S
Sbjct: 152 ADSYPQANTSPESPPTVVYGRHATALVRLLHNDGIASVVASTPAEVQAAAAKKIAWSSLM 211
Query: 206 ML----VGARHTGATVGVVEKEYRSEVSALIAEL 235
L VG TV V K ++ L+ E+
Sbjct: 212 WLLCHDVGNEEDPLTVTEVHKYKSEQLQCLVEEI 245
>gi|397623999|gb|EJK67231.1| hypothetical protein THAOC_11766 [Thalassiosira oceanica]
Length = 800
Score = 44.3 bits (103), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 100/240 (41%), Gaps = 30/240 (12%)
Query: 88 PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGLKDANQV-LAYFAVS--- 143
PI+ C + L +V A R D+V+ N + L D V + +F +S
Sbjct: 124 PIYACVPSTALHSVWTATAPYRRRDIVYLCNCIPSRHLAFVNSSDDVTVGILHFGISFDD 183
Query: 144 KLGERPIDGKTDTNPEG-LTAAYGKWASVVAERLSVGGLSCKVLDK-EAFQKQMLEKLIW 201
+ ERP + K +T+P T +G+ A VA L G+ KV + Q +K+ W
Sbjct: 184 SVNERP-NPKLNTSPGSPETILHGRHALAVARLLEDDGIPVKVASEWNEVQSAAAKKIAW 242
Query: 202 ISAFMLVGARH--------TGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAME- 252
S L+ H AT + K+ E+ + LA + +G + D E
Sbjct: 243 SSLMWLLCHEHADPIPVKEVHATKAGLLKQLVEEIHPALTILANESGTVEGSSMDTVAEV 302
Query: 253 -DRLCAYSRAVAN---FPT---AVKEFKWRNGWFYSLSEKASAEGKPDPCPLHTAWLKEI 305
D L +YS ++ P+ A+ E RNG SL S EG+ H++ L+++
Sbjct: 303 MDYLESYSMSINGGNVVPSKRLALDELHERNGLLLSL---VSDEGQGR----HSSLLRQV 355
>gi|422295724|gb|EKU23023.1| hypothetical protein NGA_0718300 [Nannochloropsis gaditana CCMP526]
Length = 124
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 88 PIFVCTRNDDLEAVLEAAPRSRWNDLVFFQNGMIEPWLESKGL--KDANQVLAYFAVSKL 145
PI VC ++E+V+ P+ + NDLVF Q+ +E L+ G K Q L F++S
Sbjct: 30 PIDVCVPEWEVESVILGCPKEKPNDLVFMQSACLETLLKRYGWANKKTTQCLPDFSISPF 89
Query: 146 G 146
G
Sbjct: 90 G 90
>gi|408532212|emb|CCK30386.1| 2-dehydropantoate 2-reductase [Streptomyces davawensis JCM 4913]
Length = 298
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 86/198 (43%), Gaps = 19/198 (9%)
Query: 88 PIFVC---TRNDDLEAVLEAAPRSRWND--LVFFQNGMIEPWLESKGLKDANQVLAYFAV 142
P+ VC ++ L++ LE P D LV F NG+ P + +D A V
Sbjct: 69 PVDVCLVGVKHTSLDSALERIPAEALGDALLVPFLNGVEHPAVLRAHHRDDRVAPAVIRV 128
Query: 143 --SKLGERPIDGKTDTNPEGLTAAYGKWASV--VAERLSVGGLSCKVLDKEAFQKQMLEK 198
+++G I+ + LTA V + E L G +VLD E+ + K
Sbjct: 129 ESTRVGPGVIEHGSPFAEIDLTADSVPRERVEGLGEVLGAAGPVTRVLDGES--GVLWAK 186
Query: 199 LIWISAFMLVGARHTGATVGVVEKEYRSEVSALIAELALAAAAEKGITFDPAMEDRLCAY 258
+ +++ F L+ R+ G +G V +YR E+ AL+ E+A + A G DP A
Sbjct: 187 MSFLAPFALLTTRY-GLPLGEVRTQYRDELVALVEEVAAVSQACGG-PGDPGQ-----AL 239
Query: 259 SRAVANFPTAVKEFKWRN 276
R A FP A K R+
Sbjct: 240 GRYDA-FPAATKSSMQRD 256
>gi|422010884|ref|ZP_16357793.1| peptidase, S8/S53 family [Actinomyces georgiae F0490]
gi|394767135|gb|EJF47951.1| peptidase, S8/S53 family [Actinomyces georgiae F0490]
Length = 766
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 238 AAAAEKGITFDPAMEDRLCAYSRAVANFPTAVKEFKWRN---GWFYSLSE 284
A KG DPA +D + ++AV F TAVK +W N GWFY S+
Sbjct: 554 ATVTAKGANNDPAADDDI---TQAVGFFATAVKTGRWMNTGKGWFYQYSD 600
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,861,057,079
Number of Sequences: 23463169
Number of extensions: 196665070
Number of successful extensions: 504682
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 74
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 504517
Number of HSP's gapped (non-prelim): 91
length of query: 308
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 166
effective length of database: 9,027,425,369
effective search space: 1498552611254
effective search space used: 1498552611254
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)