BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017641
(368 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255544061|ref|XP_002513093.1| conserved hypothetical protein [Ricinus communis]
gi|223548104|gb|EEF49596.1| conserved hypothetical protein [Ricinus communis]
Length = 452
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 233/369 (63%), Positives = 276/369 (74%), Gaps = 35/369 (9%)
Query: 25 QKPKLSSL--------KLSFFLNATSNNDEASEQPISENSPEPEASLDPVKLAFEKAKAY 76
QKP +SL K+SF LN + N E+S+ P E++PEP+ DPVKLAFEKAKAY
Sbjct: 15 QKPTTTSLLPSKTSFFKVSFSLNPS--NAESSQNP-PESTPEPDP--DPVKLAFEKAKAY 69
Query: 77 RKLKESNSDSKYEQNP----------DKDV---ARAAVEKAKEYKKNKDTVSFQNGT--- 120
+K E +K EQNP DK+V + A+EKA+EYKKN+ V + T
Sbjct: 70 KKTIEETKKAKLEQNPVEGSAGNNGKDKEVPVSVKVAMEKAREYKKNQ-VVGTKGATESE 128
Query: 121 -NSGFKGEGRGNLPK----EVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPF 175
+SG K GNL + + KK+ L ISSIDFMGLNFADKK G+GLPAGL PV DPF
Sbjct: 129 MSSGLKARSEGNLQRGSIEKTATKKDKLSISSIDFMGLNFADKKTGRGLPAGLAPVIDPF 188
Query: 176 AEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGG 235
EG+ PEVEIIVGDT F + T P P Q+E+ + YKPKVSTWGVFPRPGNISKTFGGG
Sbjct: 189 PEGNLPEVEIIVGDTDNFDDPTTSMPKPSQDESSDVYKPKVSTWGVFPRPGNISKTFGGG 248
Query: 236 RTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGK 295
RTIRPGDVLETAE RAAK++RT+QLLAAY+K +GL++DPK+K ECEKALKDGDSLMDSGK
Sbjct: 249 RTIRPGDVLETAEERAAKDDRTKQLLAAYRKKMGLSIDPKIKFECEKALKDGDSLMDSGK 308
Query: 296 LKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKR 355
LKEA+P+Y+KV++K+ F+SELHGLAALQWSICQDSL R EA+ MYEKLQSHPNA VSK+
Sbjct: 309 LKEAVPYYQKVIDKLPFQSELHGLAALQWSICQDSLSRQNEAQAMYEKLQSHPNAKVSKK 368
Query: 356 ARQFMFSFQ 364
ARQFMFSFQ
Sbjct: 369 ARQFMFSFQ 377
>gi|356538737|ref|XP_003537857.1| PREDICTED: uncharacterized protein LOC100784194 [Glycine max]
Length = 442
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 209/374 (55%), Positives = 260/374 (69%), Gaps = 14/374 (3%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEPE 60
MA LQ W SS+ + + +A P + A ++ S +P S ++PEP
Sbjct: 1 MACLQPPWFSSLRVV-SPAKLAAGPPPSTTYKPAKLLFWAVGPDNAESSEPASPDAPEPA 59
Query: 61 ASLDPVKLAFEKAKAYRKLKESNSDSKYEQNPDK-------DVARAAVEKAKEYKKNKDT 113
A +DPV+LAF KA AY+ ++ +S E+N D + A+EKAK+YK+NK
Sbjct: 60 APVDPVELAFSKANAYKADYNNDENSAEERNVGDETPKDLPDSVKIAIEKAKKYKQNKAV 119
Query: 114 VSFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTD 173
+ T R + +VG K E L +S +DF GL+FADKK +GLP GLVP+++
Sbjct: 120 AAVTETTQGSLGVSERSSGKNKVGKKGE-LSVSRMDFAGLDFADKKMTRGLPPGLVPISE 178
Query: 174 PFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQ---EENLEFYKPKVSTWGVFPRPGNISK 230
P+ +GD PEVE+I+GDTSKF ++T P P+Q E+ E YKPKVSTWGVFPRPGNISK
Sbjct: 179 PYFDGDLPEVELIIGDTSKFDDATT--PEPEQTNKEDEAELYKPKVSTWGVFPRPGNISK 236
Query: 231 TFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSL 290
TFGGGR IRPG+VLET E +A KE RT+QLLAAYKK GLNVDPKLKSECE+ALKDGD L
Sbjct: 237 TFGGGRVIRPGEVLETKEEKAVKEARTKQLLAAYKKKTGLNVDPKLKSECEEALKDGDLL 296
Query: 291 MDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNA 350
M+ GKLKEALP+YEKVM+K+ F+SELHGLAALQWSICQDSL R EAR MYEKL+SHPN
Sbjct: 297 MNVGKLKEALPYYEKVMDKLTFQSELHGLAALQWSICQDSLSRSNEARGMYEKLKSHPNP 356
Query: 351 LVSKRARQFMFSFQ 364
VSK+ARQFM+SFQ
Sbjct: 357 KVSKKARQFMYSFQ 370
>gi|296081287|emb|CBI17731.3| unnamed protein product [Vitis vinifera]
Length = 454
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 212/386 (54%), Positives = 263/386 (68%), Gaps = 37/386 (9%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEP- 59
++S + S +S + I N P++ LK+SF + + +N E+S + P
Sbjct: 17 ISSTKPSLFTSTTPILN---------PQIRPLKVSF--SQSQSNAESSPSNSPNSGEGPL 65
Query: 60 ---EASLDPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDV----------------ARAA 100
S DP KL+F +AK Y+K + D NP + A
Sbjct: 66 EVKPGSSDPDKLSFARAKEYKKTTKQEMDG----NPGLGSEGNGNGGGEAQEMPVSVKIA 121
Query: 101 VEKAKEYKKNKDTV--SFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGLNFADK 158
+EKAKEY KNK V S + G G + ++ KKE L ISSIDF+GLNF+DK
Sbjct: 122 LEKAKEYTKNKGVVGDSRSEALSGGNAGNLGNGIVEKGDDKKEQLAISSIDFVGLNFSDK 181
Query: 159 KEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVST 218
K G+GLPAGLVPV+DPF EGD P VE+IVGDTSKF ++T P P Q++N + YKPKVST
Sbjct: 182 KTGRGLPAGLVPVSDPFPEGDLPGVELIVGDTSKFEDATSSNPKPTQDDNSDLYKPKVST 241
Query: 219 WGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKS 278
WGVFPRP NISKTFGGGR IRPG+VLETA+ +AAKE RTRQLLAAYK +GLN+DPKLKS
Sbjct: 242 WGVFPRPNNISKTFGGGRVIRPGEVLETADDKAAKEARTRQLLAAYKTKIGLNIDPKLKS 301
Query: 279 ECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEAR 338
EC++ALK+GDSLM+ GKLKEALP+YEKVM+K+ F+SELHGLAALQWSIC DSL RP EAR
Sbjct: 302 ECDQALKNGDSLMNIGKLKEALPYYEKVMDKLTFQSELHGLAALQWSICLDSLSRPDEAR 361
Query: 339 IMYEKLQSHPNALVSKRARQFMFSFQ 364
+MYE+LQ+HPNALVSK+ARQ +F FQ
Sbjct: 362 VMYERLQTHPNALVSKKARQLIFGFQ 387
>gi|225449870|ref|XP_002265393.1| PREDICTED: uncharacterized protein LOC100248180 [Vitis vinifera]
Length = 457
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 212/400 (53%), Positives = 261/400 (65%), Gaps = 62/400 (15%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEP- 59
++S + S +S + I N P++ LK+SF + + +N E+S + P
Sbjct: 17 ISSTKPSLFTSTTPILN---------PQIRPLKVSF--SQSQSNAESSPSNSPNSGEGPL 65
Query: 60 ---EASLDPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDV----------------ARAA 100
S DP KL+F +AK Y+K + D NP + A
Sbjct: 66 EVKPGSSDPDKLSFARAKEYKKTTKQEMDG----NPGLGSEGNGNGGGEAQEMPVSVKIA 121
Query: 101 VEKAKEYKKNKDTVSFQNGTNSGFKGEGRGNLPKEV----------------GGKKEGLK 144
+EKAKEY KNK G G+ R E+ KKE L
Sbjct: 122 LEKAKEYTKNK-----------GVVGDSRSEALSELTGGNAGNLGNGIVEKGDDKKEQLA 170
Query: 145 ISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPK 204
ISSIDF+GLNF+DKK G+GLPAGLVPV+DPF EGD P VE+IVGDTSKF ++T P P
Sbjct: 171 ISSIDFVGLNFSDKKTGRGLPAGLVPVSDPFPEGDLPGVELIVGDTSKFEDATSSNPKPT 230
Query: 205 QEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAY 264
Q++N + YKPKVSTWGVFPRP NISKTFGGGR IRPG+VLETA+ +AAKE RTRQLLAAY
Sbjct: 231 QDDNSDLYKPKVSTWGVFPRPNNISKTFGGGRVIRPGEVLETADDKAAKEARTRQLLAAY 290
Query: 265 KKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQW 324
K +GLN+DPKLKSEC++ALK+GDSLM+ GKLKEALP+YEKVM+K+ F+SELHGLAALQW
Sbjct: 291 KTKIGLNIDPKLKSECDQALKNGDSLMNIGKLKEALPYYEKVMDKLTFQSELHGLAALQW 350
Query: 325 SICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
SIC DSL RP EAR+MYE+LQ+HPNALVSK+ARQ +F FQ
Sbjct: 351 SICLDSLSRPDEARVMYERLQTHPNALVSKKARQLIFGFQ 390
>gi|449452134|ref|XP_004143815.1| PREDICTED: uncharacterized protein LOC101215292 [Cucumis sativus]
Length = 475
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 205/346 (59%), Positives = 240/346 (69%), Gaps = 36/346 (10%)
Query: 52 ISENSPEPEASLDPVKLAFEKAKAYRKLKESNS--------------------------- 84
+S+ +P P LDPVKLAFE+AKAY+KL +S S
Sbjct: 61 VSDAAPPP---LDPVKLAFERAKAYKKLSKSGSNLNVELKPGVGSEGNSVQTGKSGVLSF 117
Query: 85 DSKYEQNPDKDVARAAVEKAKEYKKNKDTVSFQN-----GTNSGFKGEGRGNLP-KEVGG 138
D EQ + R AVE A E K V+ TN G GNL K+ G
Sbjct: 118 DGADEQRKMQGGVRVAVESANEVKGEAKVVTDGTKGGVINTNEGLNDRDGGNLGNKQKGD 177
Query: 139 KKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTV 198
KK L ISSIDF+GL FADKK+ +GLPAGLVP++DPF+ D PEVEIIVGD+SKF ++TV
Sbjct: 178 KKGELSISSIDFIGLGFADKKKSRGLPAGLVPISDPFSVEDLPEVEIIVGDSSKFDDATV 237
Query: 199 LRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTR 258
P QE++ +FYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLET E +A KE RT+
Sbjct: 238 SEIKPTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAVKEARTK 297
Query: 259 QLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHG 318
+L+AAYKK GL +D KLKSECE AL++GDSLM+ GKLKEALP+YE +M K+ F+SELHG
Sbjct: 298 ELIAAYKKKFGLTIDAKLKSECEMALEEGDSLMNDGKLKEALPYYETIMEKVNFQSELHG 357
Query: 319 LAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
LAALQWSICQDSL RP AR MYEKL+SHPN VSK+ARQFMFSFQ
Sbjct: 358 LAALQWSICQDSLSRPDVAREMYEKLKSHPNPRVSKKARQFMFSFQ 403
>gi|224053314|ref|XP_002297759.1| predicted protein [Populus trichocarpa]
gi|222845017|gb|EEE82564.1| predicted protein [Populus trichocarpa]
Length = 315
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/226 (77%), Positives = 201/226 (88%)
Query: 139 KKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTV 198
K++ L ISSIDF+GL FADKK+G+GLPAGLVP+TDPF+EG+ P+VEIIVGDTSKF + +
Sbjct: 17 KEQKLSISSIDFVGLEFADKKKGRGLPAGLVPITDPFSEGNLPDVEIIVGDTSKFEDPST 76
Query: 199 LRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTR 258
L P QE+N + YKPKVSTWGVFPRPGNISKTFGGG+TIRPGD LETAEARAAK+ERT+
Sbjct: 77 LTSKPTQEDNPDLYKPKVSTWGVFPRPGNISKTFGGGKTIRPGDELETAEARAAKDERTK 136
Query: 259 QLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHG 318
QL+AAY+KS+GLNVDP +K ECEKALKDGDSLMDSGKL +ALP+Y+ VM+K+ FKSELHG
Sbjct: 137 QLIAAYRKSIGLNVDPNVKLECEKALKDGDSLMDSGKLNDALPYYQMVMDKLPFKSELHG 196
Query: 319 LAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
LAALQWSICQDSL RP EAR MYEKLQSHPN VSK ARQFMFSFQ
Sbjct: 197 LAALQWSICQDSLSRPNEARAMYEKLQSHPNVKVSKIARQFMFSFQ 242
>gi|388502572|gb|AFK39352.1| unknown [Lotus japonicus]
Length = 478
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 212/391 (54%), Positives = 259/391 (66%), Gaps = 34/391 (8%)
Query: 1 MASLQTSWLSSVSTIKN------KNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISE 54
MASL++ WL + + KN N S P S+ +SNN E+SE
Sbjct: 1 MASLRSPWLRCIVSPKNGAIFPSTNVSVCMCLPS-STTTFRLCCALSSNNAESSEP---- 55
Query: 55 NSPEPEAS-LDPVKLAFEKAKAYRKLK-ESNSDSKYEQNP----------------DKDV 96
SPE + +DPVKLAF KAK Y++ +SN S EQ+ KD+
Sbjct: 56 TSPEAQTGPVDPVKLAFNKAKTYKEESGKSNPGSGIEQSDANGENSVNQGDAVDGRQKDL 115
Query: 97 ---ARAAVEKAKEYKKNKDTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGL 153
+ A+EKAK+YK+NK + T G +G L VG K+ L +S IDF+GL
Sbjct: 116 PVSLKIAMEKAKKYKQNKGVAIAE--TTQGLQGGRERTLENSVGEKEGKLSVSKIDFVGL 173
Query: 154 NFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYK 213
+F DKK+ +GLP GLVPVTD F++GD EV IVGD +KF T P +EE E YK
Sbjct: 174 DFGDKKKARGLPPGLVPVTDSFSDGDLTEVGFIVGDATKFDAVTDPEPEQTKEEESELYK 233
Query: 214 PKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVD 273
P VSTWGVFPRPGNISKTFGGGR IRPG+VLET E +A +E RT+QLLAAYKK GL++D
Sbjct: 234 PTVSTWGVFPRPGNISKTFGGGRVIRPGEVLETDEEKAMREVRTKQLLAAYKKRTGLDID 293
Query: 274 PKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHR 333
PKLKSECE+ALKDGD LM++GKLKEALP+YEKVM+K+ FKSELHGLAALQWSIC DSL+R
Sbjct: 294 PKLKSECEEALKDGDLLMNAGKLKEALPYYEKVMDKITFKSELHGLAALQWSICLDSLNR 353
Query: 334 PKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
EA+ MYEKL+SHPNA VSK+ARQF +SFQ
Sbjct: 354 SNEAQSMYEKLKSHPNAKVSKKARQFSYSFQ 384
>gi|33945880|emb|CAE45591.1| hypothetical protein [Lotus japonicus]
Length = 1217
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 215/397 (54%), Positives = 265/397 (66%), Gaps = 38/397 (9%)
Query: 1 MASLQTSWLSSVSTIKN------KNDSSAKQKPK-LSSLKLSFFLNATSNNDEASEQPIS 53
MASL++ WL + + KN N S P ++ KL L +SNN E+SE
Sbjct: 1 MASLRSPWLRCIVSPKNGAIFPSTNVSVCMCLPSSTTTFKLCCAL--SSNNAESSEP--- 55
Query: 54 ENSPEPEAS-LDPVKLAFEKAKAYRKLK-ESNSDSKYEQNP----------------DKD 95
SPE + +DPVKLAF KAK Y++ +SN S EQ+ KD
Sbjct: 56 -TSPEAQTGPVDPVKLAFNKAKTYKEESGKSNPGSGIEQSDANGENSVNQGDAVDGRQKD 114
Query: 96 V---ARAAVEKAKEYKKNKDTVSFQN--GTNSGFKGEG--RGNLPKEVGGKKEGLKISSI 148
+ + A+EKAK+YK+NK + G +G + +G L VG K+ L +S I
Sbjct: 115 LPVSLKIAMEKAKKYKQNKGVAIAETTQGLPTGNQTQGGRERTLENSVGEKEGKLSVSKI 174
Query: 149 DFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEEN 208
DF+GL+F DKK+ +GLP GLVPVTD F++GD EVE IVGD +KF T P +EE
Sbjct: 175 DFVGLDFGDKKKTRGLPPGLVPVTDSFSDGDLTEVEFIVGDATKFDAVTDPEPEQTKEEE 234
Query: 209 LEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSV 268
E YKP VSTWGVFPRPGNISKTFGGGR IRPG+VLET E +A +E RT+QLLAAYKK
Sbjct: 235 SELYKPTVSTWGVFPRPGNISKTFGGGRVIRPGEVLETDEEKAMREVRTKQLLAAYKKRT 294
Query: 269 GLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQ 328
GL++DPKLKSECE+ALKDGD LM++GKLKEALP+YEKVM+K+ FKSELHGLAALQWSIC
Sbjct: 295 GLDIDPKLKSECEEALKDGDLLMNAGKLKEALPYYEKVMDKITFKSELHGLAALQWSICL 354
Query: 329 DSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQV 365
DSL+R EA+ MYEKL+SHPNA VSK+ARQF +SFQ
Sbjct: 355 DSLNRSNEAQSMYEKLKSHPNAKVSKKARQFSYSFQA 391
>gi|164605522|dbj|BAF98588.1| CM0216.540.nc [Lotus japonicus]
Length = 484
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 215/397 (54%), Positives = 265/397 (66%), Gaps = 38/397 (9%)
Query: 1 MASLQTSWLSSVSTIKN------KNDSSAKQKPK-LSSLKLSFFLNATSNNDEASEQPIS 53
MASL++ WL + + KN N S P ++ KL L +SNN E+SE
Sbjct: 1 MASLRSPWLRCIVSPKNGAIFPSTNVSVCMCLPSSTTTFKLCCAL--SSNNAESSE---- 54
Query: 54 ENSPEPEAS-LDPVKLAFEKAKAYRKLK-ESNSDSKYEQNP----------------DKD 95
SPE + +DPVKLAF KAK Y++ +SN S EQ+ KD
Sbjct: 55 PTSPEAQTGPVDPVKLAFNKAKTYKEESGKSNPGSGIEQSDANGENSVNQGDAVDGRQKD 114
Query: 96 V---ARAAVEKAKEYKKNKDTVSFQN--GTNSGFKGEG--RGNLPKEVGGKKEGLKISSI 148
+ + A+EKAK+YK+NK + G +G + +G L VG K+ L +S I
Sbjct: 115 LPVSLKIAMEKAKKYKQNKGVAIAETTQGLPTGNQTQGGRERTLENSVGEKEGKLSVSKI 174
Query: 149 DFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEEN 208
DF+GL+F DKK+ +GLP GLVPVTD F++GD EVE IVGD +KF T P +EE
Sbjct: 175 DFVGLDFGDKKKTRGLPPGLVPVTDSFSDGDLTEVEFIVGDATKFDAVTDPEPEQTKEEE 234
Query: 209 LEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSV 268
E YKP VSTWGVFPRPGNISKTFGGGR IRPG+VLET E +A +E RT+QLLAAYKK
Sbjct: 235 SELYKPTVSTWGVFPRPGNISKTFGGGRVIRPGEVLETDEEKAMREVRTKQLLAAYKKRT 294
Query: 269 GLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQ 328
GL++DPKLKSECE+ALKDGD LM++GKLKEALP+YEKVM+K+ FKSELHGLAALQWSIC
Sbjct: 295 GLDIDPKLKSECEEALKDGDLLMNAGKLKEALPYYEKVMDKITFKSELHGLAALQWSICL 354
Query: 329 DSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQV 365
DSL+R EA+ MYEKL+SHPNA VSK+ARQF +SFQ
Sbjct: 355 DSLNRSNEAQSMYEKLKSHPNAKVSKKARQFSYSFQA 391
>gi|297823763|ref|XP_002879764.1| binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297325603|gb|EFH56023.1| binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 458
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 210/374 (56%), Positives = 262/374 (70%), Gaps = 24/374 (6%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEPE 60
MAS SWLSS T Q+PK SLKLS F ++++ + S E E
Sbjct: 25 MASFNHSWLSSPLTETPTCFLYPSQQPK--SLKLSLFRTRSNSSSDRSS--------EVE 74
Query: 61 ASLDPVKLAFEKAKAYRKLKESNSDSKYEQNP-DKDV---ARAAVEKAKEYKKNK----D 112
++DPVKLA +KA+AY+K K + EQN D+++ +AA++KA +YKK K D
Sbjct: 75 LAVDPVKLALKKAEAYKKSKSEGKER--EQNAGDEELPLSVKAAMQKANDYKKRKGLGTD 132
Query: 113 TV--SFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVP 170
V + + T F + ++ KK+ LK+SSIDFMGL FADKK +GLPAGLVP
Sbjct: 133 AVEEAKPSSTEQSFV-RSSNKVVEDNDVKKKELKVSSIDFMGLGFADKKSTRGLPAGLVP 191
Query: 171 VTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISK 230
+ D EGD PEVE IVGD ++F E V + + N + YKPKVSTWGVFPRP NISK
Sbjct: 192 IVDYLPEGDLPEVEFIVGDKTRFAEK-VKEVEQEGDGNSDVYKPKVSTWGVFPRPSNISK 250
Query: 231 TFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSL 290
TFGGGRT+RPGD +ETAE R +EE+T++LL AYK+S+GLN+DPKLK ECEKA+++G+SL
Sbjct: 251 TFGGGRTLRPGDSVETAEERIVREEQTKKLLIAYKESIGLNIDPKLKLECEKAIEEGNSL 310
Query: 291 MDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNA 350
MDSGKLKEALP+YEKVM K+VFKSELHGLAALQWSICQDSL + +AR MYEKL SHPN+
Sbjct: 311 MDSGKLKEALPYYEKVMEKIVFKSELHGLAALQWSICQDSLRKTDKARRMYEKLISHPNS 370
Query: 351 LVSKRARQFMFSFQ 364
VSK+ARQFMFSFQ
Sbjct: 371 GVSKKARQFMFSFQ 384
>gi|79570570|ref|NP_181412.2| uncharacterized protein [Arabidopsis thaliana]
gi|330254489|gb|AEC09583.1| uncharacterized protein [Arabidopsis thaliana]
Length = 465
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 211/373 (56%), Positives = 259/373 (69%), Gaps = 21/373 (5%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEPE 60
MAS SWLSS T S Q PK SLKLS F ++++ E E
Sbjct: 31 MASFNHSWLSSPLTETPAFFFSPSQHPK--SLKLSLFRTRSNSSSPDRSS-------EVE 81
Query: 61 ASLDPVKLAFEKAKAYRKLKESNSDSKYEQNP-DKDV---ARAAVEKAKEYKKNK----D 112
+DPVKLA +KA+AY+K K + K E+N D+++ +AA++KA +YKK K D
Sbjct: 82 LDVDPVKLALKKAEAYKKSK--SEQKKPEKNAGDEELPLSVKAAMQKANDYKKRKGLGTD 139
Query: 113 TVSFQNGTNSGFKGEGRGN-LPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPV 171
V+ +N+ N + ++ KK+ LK+SSIDFMGL FADKK +GLPAGLVPV
Sbjct: 140 AVAKAKPSNTDQSFVRLANKVVEDNDVKKKELKVSSIDFMGLGFADKKSTRGLPAGLVPV 199
Query: 172 TDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKT 231
D EGD PEVE IVGD ++F E V + + N + YKPKVSTWGVFPRP NISKT
Sbjct: 200 VDYLPEGDLPEVEFIVGDKTRFAEK-VKEVEQEGDGNSDVYKPKVSTWGVFPRPSNISKT 258
Query: 232 FGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLM 291
FGGGRT+RPGD +ETAE R +EE+T++LL AYK+S+GLN+DPKLK ECEKA+ +G+SLM
Sbjct: 259 FGGGRTLRPGDSVETAEERIVREEKTKKLLIAYKESLGLNIDPKLKLECEKAIDEGNSLM 318
Query: 292 DSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNAL 351
DSGKLKEALP+YEKVM K+VFKSELHGLAALQWSICQDSL + +AR MYEKL SHPN
Sbjct: 319 DSGKLKEALPYYEKVMEKIVFKSELHGLAALQWSICQDSLRKTDKARRMYEKLISHPNPG 378
Query: 352 VSKRARQFMFSFQ 364
VSK+ARQFMFSFQ
Sbjct: 379 VSKKARQFMFSFQ 391
>gi|20197392|gb|AAM15057.1| hypothetical protein [Arabidopsis thaliana]
gi|51968582|dbj|BAD42983.1| hypothetical protein [Arabidopsis thaliana]
gi|109134175|gb|ABG25085.1| At2g38780 [Arabidopsis thaliana]
Length = 435
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 211/373 (56%), Positives = 259/373 (69%), Gaps = 21/373 (5%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEPE 60
MAS SWLSS T S Q PK SLKLS F ++++ E E
Sbjct: 1 MASFNHSWLSSPLTETPAFFFSPSQHPK--SLKLSLFRTRSNSSSPDRSS-------EVE 51
Query: 61 ASLDPVKLAFEKAKAYRKLKESNSDSKYEQNP-DKDV---ARAAVEKAKEYKKNK----D 112
+DPVKLA +KA+AY+K K + K E+N D+++ +AA++KA +YKK K D
Sbjct: 52 LDVDPVKLALKKAEAYKKSK--SEQKKPEKNAGDEELPLSVKAAMQKANDYKKRKGLGTD 109
Query: 113 TVSFQNGTNSGFKGEGRGN-LPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPV 171
V+ +N+ N + ++ KK+ LK+SSIDFMGL FADKK +GLPAGLVPV
Sbjct: 110 AVAKAKPSNTDQSFVRLANKVVEDNDVKKKELKVSSIDFMGLGFADKKSTRGLPAGLVPV 169
Query: 172 TDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKT 231
D EGD PEVE IVGD ++F E V + + N + YKPKVSTWGVFPRP NISKT
Sbjct: 170 VDYLPEGDLPEVEFIVGDKTRFAEK-VKEVEQEGDGNSDVYKPKVSTWGVFPRPSNISKT 228
Query: 232 FGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLM 291
FGGGRT+RPGD +ETAE R +EE+T++LL AYK+S+GLN+DPKLK ECEKA+ +G+SLM
Sbjct: 229 FGGGRTLRPGDSVETAEERIVREEKTKKLLIAYKESLGLNIDPKLKLECEKAIDEGNSLM 288
Query: 292 DSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNAL 351
DSGKLKEALP+YEKVM K+VFKSELHGLAALQWSICQDSL + +AR MYEKL SHPN
Sbjct: 289 DSGKLKEALPYYEKVMEKIVFKSELHGLAALQWSICQDSLRKTDKARRMYEKLISHPNPG 348
Query: 352 VSKRARQFMFSFQ 364
VSK+ARQFMFSFQ
Sbjct: 349 VSKKARQFMFSFQ 361
>gi|357473415|ref|XP_003606992.1| hypothetical protein MTR_4g071010 [Medicago truncatula]
gi|355508047|gb|AES89189.1| hypothetical protein MTR_4g071010 [Medicago truncatula]
Length = 442
Score = 359 bits (922), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 202/349 (57%), Positives = 252/349 (72%), Gaps = 18/349 (5%)
Query: 29 LSSLKLSFFLNATSNNDEASEQPISENSPEPEASLDPVKLAFEKAKAYRKLKESNSDSKY 88
L + KL +N NN E+SE ++ E AS+DP+KLAF KAKAY++ +SN+D
Sbjct: 27 LKTTKLCCVMN-KENNKESSE--VNSIEAEKTASVDPIKLAFNKAKAYKESIKSNTDLGI 83
Query: 89 EQNP----DKDVA---RAAVEKAKEYKKNKDTVSFQNGTNSGFKGEGRGNLPKEVG---- 137
EQN KDV+ + A+EKAK+YK+NK + T+ G +G + V
Sbjct: 84 EQNSADGGKKDVSVSVKIAMEKAKKYKQNKGVAVSE--TDQGLQGGSESTWGENVNDNSV 141
Query: 138 GKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGEST 197
KK L +S IDF+GL FAD K+ +GLP GLVP +DPF++ D PEVE+IVGDT+ F +T
Sbjct: 142 SKKGELSVSKIDFVGLGFADNKKTRGLPPGLVPFSDPFSDDDLPEVELIVGDTNNFNATT 201
Query: 198 VLRPGPKQ--EENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEE 255
P P+Q ++ E YKPKVSTWGVFPRP NISKT+GGGRTIRPG+VLE E +AAKE
Sbjct: 202 TTAPQPEQTKDDESELYKPKVSTWGVFPRPNNISKTYGGGRTIRPGEVLENEEEKAAKEA 261
Query: 256 RTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSE 315
RT+Q+LAAYKK GL++DPKLK ECE+ALKDGD LMD+GKLK+ALP+YEKV++K+ FKSE
Sbjct: 262 RTKQMLAAYKKKYGLSIDPKLKIECEEALKDGDFLMDAGKLKDALPYYEKVIDKLPFKSE 321
Query: 316 LHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
LHGLAALQWSICQDSL R EAR MYEKLQSHP+ V K+AR FM+SFQ
Sbjct: 322 LHGLAALQWSICQDSLSRHNEARSMYEKLQSHPSVKVGKKARHFMYSFQ 370
>gi|388491228|gb|AFK33680.1| unknown [Medicago truncatula]
Length = 442
Score = 359 bits (922), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 202/349 (57%), Positives = 252/349 (72%), Gaps = 18/349 (5%)
Query: 29 LSSLKLSFFLNATSNNDEASEQPISENSPEPEASLDPVKLAFEKAKAYRKLKESNSDSKY 88
L + KL +N NN E+SE ++ E AS+DP+KLAF KAKAY++ +SN+D
Sbjct: 27 LKTTKLCCVMNK-ENNKESSE--VNSIEAEKTASVDPIKLAFNKAKAYKESIKSNTDLGI 83
Query: 89 EQNP----DKDVA---RAAVEKAKEYKKNKDTVSFQNGTNSGFKGEGRGNLPKEVG---- 137
EQN KDV+ + A+EKAK+YK+NK + T+ G +G + V
Sbjct: 84 EQNSADGGKKDVSVSVKIAMEKAKKYKQNKGVAVSE--TDQGLQGGSESTWGENVNDNSV 141
Query: 138 GKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGEST 197
KK L +S IDF+GL FAD K+ +GLP GLVP +DPF++ D PEVE+IVGDT+ F +T
Sbjct: 142 SKKGELSVSKIDFVGLGFADNKKTRGLPPGLVPFSDPFSDDDLPEVELIVGDTNNFNATT 201
Query: 198 VLRPGPKQ--EENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEE 255
P P+Q ++ E YKPKVSTWGVFPRP NISKT+GGGRTIRPG+VLE E +AAKE
Sbjct: 202 TTAPQPEQTKDDESELYKPKVSTWGVFPRPNNISKTYGGGRTIRPGEVLENEEEKAAKEA 261
Query: 256 RTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSE 315
RT+Q+LAAYKK GL++DPKLK ECE+ALKDGD LMD+GKLK+ALP+YEKV++K+ FKSE
Sbjct: 262 RTKQMLAAYKKKYGLSIDPKLKIECEEALKDGDFLMDAGKLKDALPYYEKVIDKLPFKSE 321
Query: 316 LHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
LHGLAALQWSICQDSL R EAR MYEKLQSHP+ V K+AR FM+SFQ
Sbjct: 322 LHGLAALQWSICQDSLSRHNEARSMYEKLQSHPSVKVGKKARHFMYSFQ 370
>gi|449518929|ref|XP_004166488.1| PREDICTED: uncharacterized LOC101215292 [Cucumis sativus]
Length = 326
Score = 353 bits (905), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 172/243 (70%), Positives = 199/243 (81%), Gaps = 1/243 (0%)
Query: 123 GFKGEGRGNLP-KEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTP 181
G GNL K+ G KK L ISSIDF+GL FADKK+ +GLPAGLVP++DPF+ D P
Sbjct: 12 GLNDRDGGNLGNKQKGDKKGELSISSIDFIGLGFADKKKSRGLPAGLVPISDPFSVEDLP 71
Query: 182 EVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPG 241
EVEIIVGD+SKF ++TV P QE++ +FYKPKVSTWGVFPRPGNISKTFGGGRTIRPG
Sbjct: 72 EVEIIVGDSSKFDDATVSEIKPTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPG 131
Query: 242 DVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALP 301
DVLET E +A KE RT++L+AAYKK GL +D KLKSECE AL++GDSLM+ GKLKEALP
Sbjct: 132 DVLETDEEKAVKEARTKELIAAYKKKFGLTIDAKLKSECEMALEEGDSLMNDGKLKEALP 191
Query: 302 FYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMF 361
+YE +M K+ F+SELHGLAALQWSICQDSL RP AR MYEKL+SHPN VSK+ARQFMF
Sbjct: 192 YYETIMEKVNFQSELHGLAALQWSICQDSLSRPDVAREMYEKLKSHPNPRVSKKARQFMF 251
Query: 362 SFQ 364
SFQ
Sbjct: 252 SFQ 254
>gi|62320013|dbj|BAD94145.1| hypothetical protein [Arabidopsis thaliana]
Length = 347
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 172/327 (52%), Positives = 215/327 (65%), Gaps = 32/327 (9%)
Query: 1 MASLQTSWLSSVSTIKNKNDSSAKQKPKLSSLKLSFFLNATSNNDEASEQPISENSPEPE 60
MAS SWLSS T S Q PK SLKLS F ++++ + S E E
Sbjct: 31 MASFNHSWLSSPLTETPAFFFSPSQHPK--SLKLSLFRTRSNSS--------PDRSSEVE 80
Query: 61 ASLDPVKLAFEKAKAYRKLKESNSDSKYEQN-PDKDV--------ARAAVEKAKEYKKNK 111
+DPVKLA +KA+AY+K SK EQ P+K+ +AA++KA +YKK K
Sbjct: 81 LDVDPVKLALKKAEAYKK-------SKSEQKKPEKNAGDEELPLSVKAAMQKANDYKKRK 133
Query: 112 ----DTVSFQNGTNSGFKGEGRGN-LPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPA 166
D V+ +N+ N + ++ KK+ LK+SSIDFMGL FADKK +GLPA
Sbjct: 134 GLGTDAVAKAKPSNTDQSFVRLANKVVEDNDVKKKELKVSSIDFMGLGFADKKSTRGLPA 193
Query: 167 GLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPG 226
GLVPV D EGD PEVE IVGD ++F E V + + N + YKPKVSTWGVFPRP
Sbjct: 194 GLVPVVDYLPEGDLPEVEFIVGDKTRFAEK-VKEVEQEGDGNSDVYKPKVSTWGVFPRPS 252
Query: 227 NISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKD 286
NISKTFGGGRT+RPGD +ETAE R +EE+T++LL AYK+S+GLN+DPKLK ECEKA+ +
Sbjct: 253 NISKTFGGGRTLRPGDSVETAEERIVREEKTKKLLIAYKESLGLNIDPKLKLECEKAIDE 312
Query: 287 GDSLMDSGKLKEALPFYEKVMNKMVFK 313
G+SLMDSGKLKEALP+YEKVM K+VFK
Sbjct: 313 GNSLMDSGKLKEALPYYEKVMEKIVFK 339
>gi|148907087|gb|ABR16687.1| unknown [Picea sitchensis]
Length = 382
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 156/324 (48%), Positives = 208/324 (64%), Gaps = 29/324 (8%)
Query: 65 PVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARA-----------AVEKAKEYKKNK-- 111
PVK A EKAK Y+K ++ N + P+ + + A+E+ + YKK +
Sbjct: 65 PVKRALEKAKDYKKKQQQNQNDCAPNVPESEFVSSEPKGTGGAAFDALERERAYKKQQGE 124
Query: 112 -------DTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK---ISSIDFMGLNFADKKEG 161
+TV + T G R P++ + K ISSIDF+GL F++KK
Sbjct: 125 KSYPKEDNTVEVEIITKDGVIRR-RVTPPEQAFSNVKSYKNKGISSIDFIGLEFSEKKPK 183
Query: 162 KGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGV 221
+GLPAGL + + G PEVEII D +K V P P + + YKPKV+TWG+
Sbjct: 184 RGLPAGLNIGLELPSSGTLPEVEIITRDAAK----NVKTP-PGNDNSSGLYKPKVATWGI 238
Query: 222 FPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECE 281
FPRP NISKT+GGG+ IRPGDVLET E +A +E +T++LL YK+ +GLNVDPK+K+ CE
Sbjct: 239 FPRPENISKTYGGGKVIRPGDVLETKEEKAMREAKTKKLLDDYKEKMGLNVDPKVKANCE 298
Query: 282 KALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMY 341
K LK+G+ LM+ GKL+EAL YEK+M +MVF+SELHGLAALQWS+C DS+ R EARIMY
Sbjct: 299 KVLKEGNELMEIGKLREALVLYEKIMEEMVFQSELHGLAALQWSVCLDSISRSDEARIMY 358
Query: 342 EKLQSHPNALVSKRARQFMFSFQV 365
EKL+SHPN + K+A+Q +FSFQV
Sbjct: 359 EKLRSHPNGSIRKKAKQIIFSFQV 382
>gi|224284621|gb|ACN40043.1| unknown [Picea sitchensis]
Length = 455
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 155/323 (47%), Positives = 208/323 (64%), Gaps = 29/323 (8%)
Query: 65 PVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARA-----------AVEKAKEYKKNK-- 111
PVK A EKAK Y+K ++ N + P+ + + A+E+A+ YKK +
Sbjct: 65 PVKRALEKAKDYKKKQQQNQNDCAPNVPESEFVSSEPKGTGGAAFDALERARAYKKQQGE 124
Query: 112 -------DTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK---ISSIDFMGLNFADKKEG 161
+TV + T G R P++ + K ISSIDF+GL F++KK
Sbjct: 125 KSYPKEDNTVEVEIITKDGVIRR-RVTPPEQAFSNVKSYKNKGISSIDFIGLEFSEKKPK 183
Query: 162 KGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGV 221
+GLPAGL + + G PEVEII D +K V P P + + YKPKV+TWG+
Sbjct: 184 RGLPAGLNIGLELPSSGTLPEVEIITRDAAK----NVKTP-PGNDNSSGLYKPKVATWGI 238
Query: 222 FPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECE 281
FPRP NISKT+GGG+ IRPGDVLET E +A +E +T++LL YK+ +GLNVDPK+K+ CE
Sbjct: 239 FPRPENISKTYGGGKVIRPGDVLETKEEKAMREAKTKKLLDDYKEKMGLNVDPKVKANCE 298
Query: 282 KALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMY 341
K LK+G+ LM+ GKL+EA+ YEK+M +MVF+SELHGLAALQWS+C DS+ R EARIMY
Sbjct: 299 KVLKEGNELMEIGKLREAIVLYEKIMEEMVFQSELHGLAALQWSVCLDSISRSDEARIMY 358
Query: 342 EKLQSHPNALVSKRARQFMFSFQ 364
EKL+SHPN + K+A+Q +FSFQ
Sbjct: 359 EKLRSHPNGSIRKKAKQIIFSFQ 381
>gi|148906442|gb|ABR16374.1| unknown [Picea sitchensis]
Length = 480
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 207/348 (59%), Gaps = 54/348 (15%)
Query: 65 PVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARA-----------AVEKAKEYKKNK-- 111
PVK A EKAK Y+K ++ N + P+ + + A+E+ + YKK +
Sbjct: 65 PVKRALEKAKDYKKKQQQNQNDCAPNVPESEFVSSEPKGTGGAAFDALERERAYKKQQGE 124
Query: 112 -------DTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK---ISSIDFMGLNFADKKEG 161
+TV + T G R P++ + K ISSIDF+GL F++KK
Sbjct: 125 KSYPKEDNTVEVEIITKDGVIRR-RVTPPEQAFSNVKSYKNKGISSIDFIGLEFSEKKPK 183
Query: 162 KGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGV 221
+GLPAGL + + G PEVEII D +K V P P + + YKPKV+TWG+
Sbjct: 184 RGLPAGLNIGLELPSSGTLPEVEIITRDAAK----NVKTP-PGNDNSSGLYKPKVATWGI 238
Query: 222 FPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECE 281
FPRP NISKT+GGG+ IRPGDVLET E +A +E +T++LL YK+ +GLNVDPK+K+ CE
Sbjct: 239 FPRPENISKTYGGGKVIRPGDVLETKEEKAMREAKTKKLLDDYKEKMGLNVDPKVKANCE 298
Query: 282 KALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHR-------- 333
K LK+G+ LM+ GKL+EAL YEK+M +MVF+SELHGLAALQWS+C DS+ R
Sbjct: 299 KVLKEGNELMEIGKLREALVLYEKIMEEMVFQSELHGLAALQWSVCLDSISRLVTDLKGY 358
Query: 334 -----------------PKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
EARIMYEKL+SHPN + K+A+Q +FSFQ
Sbjct: 359 AAQKHIGGFPTHFYLVGSDEARIMYEKLRSHPNGSIRKKAKQIIFSFQ 406
>gi|357133904|ref|XP_003568561.1| PREDICTED: uncharacterized protein LOC100843051 [Brachypodium
distachyon]
Length = 483
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 151/353 (42%), Positives = 203/353 (57%), Gaps = 58/353 (16%)
Query: 64 DPVKLAFEKAKAYRKLKESNSDSKYEQNPD--KDVARAAVEKAKEYKKN----------- 110
DPVKLAF +A AY+K K + + K+ ++ A EKA EY+
Sbjct: 69 DPVKLAFARAAAYKKEKANPTPPPPPPPQTPAKESSKGAFEKALEYRNGDGGGLGGGSAF 128
Query: 111 -KDTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGLNFADKKEGKGLPAGLV 169
K + +F G N+ +G + KK DF+GL+F +KK +G P GL
Sbjct: 129 LKASPTF--GQNTFASKDGAFG---KAANKKGEYVYDETDFLGLDFFEKKRYQGPPPGLS 183
Query: 170 PVTDPFAEGDTPEVEIIVGDTSKFGES--------------------------------- 196
P DP D PEVEI++GD SKFG+S
Sbjct: 184 PAVDPSPNEDFPEVEIVIGDPSKFGKSRRSTENQPVDDNESDETSRSRIIEQNEDGKVEE 243
Query: 197 ----TVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAA 252
+V+ P ++++N EFYKP+V++WG+FPRP NISK +GGGR I G ++AE +AA
Sbjct: 244 TTPSSVIEP--EEDKNSEFYKPRVTSWGMFPRPQNISKAYGGGRNISLGGEKQSAEEKAA 301
Query: 253 KEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVF 312
K++RTR+LLAAY +D K K+EC +ALK GD LM++G+LK+ALP+YEKVM F
Sbjct: 302 KDKRTRELLAAYMSGQNKTMDAKTKAECTEALKKGDELMNAGRLKQALPYYEKVMQAADF 361
Query: 313 KSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQV 365
K+ELHG+AALQWSIC DSL R KEA MY KL+ HPN L++K+A+ FMFSFQ
Sbjct: 362 KTELHGMAALQWSICLDSLCRSKEAMSMYSKLKYHPNDLINKKAKMFMFSFQA 414
>gi|222631415|gb|EEE63547.1| hypothetical protein OsJ_18363 [Oryza sativa Japonica Group]
Length = 495
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 132/260 (50%), Positives = 173/260 (66%), Gaps = 33/260 (12%)
Query: 139 KKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGE--- 195
KKE + DF+GL+F +KK KG P GL P DPF + D PEVEII+GD SKFG+
Sbjct: 167 KKEEYEYDETDFLGLDFFEKKSYKGPPPGLAPAADPFPDKDFPEVEIIIGDPSKFGKTHR 226
Query: 196 STVLRPG----PKQ--------------------------EENLEFYKPKVSTWGVFPRP 225
ST ++P P++ EE+ + YKP V +WG+FPRP
Sbjct: 227 STEVQPADDSEPEETSRSTTEEKKEENKPDETPPSTVTEPEEDEDEYKPTVRSWGMFPRP 286
Query: 226 GNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALK 285
NISK +GGGR IR G +AE +AAK++RT++L+AAY+ S + VD K K+EC +ALK
Sbjct: 287 QNISKAYGGGRNIRLGGETRSAEEKAAKDKRTKELIAAYRNSQNMIVDAKTKAECTEALK 346
Query: 286 DGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQ 345
+GD LM++G+LK+ALP+YEKVM + FK+ELHG+AALQWSIC DSL R KEA MY KL+
Sbjct: 347 EGDELMNTGRLKQALPYYEKVMQAVDFKTELHGMAALQWSICLDSLCRSKEAMSMYSKLK 406
Query: 346 SHPNALVSKRARQFMFSFQV 365
+HPN+ +SK+A FMFSFQ
Sbjct: 407 NHPNSEISKKANMFMFSFQA 426
>gi|326489284|dbj|BAK01625.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528243|dbj|BAJ93303.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 497
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 198/359 (55%), Gaps = 59/359 (16%)
Query: 64 DPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARAAVEKAKEYKKNKDT----VSFQNG 119
DPVKLAF +A AY+K + + + P AKE +K+ + ++NG
Sbjct: 72 DPVKLAFARAAAYKKDRANPTPKPPPPPPPPSPPPPPQPSAKESVGSKEAFERALEYRNG 131
Query: 120 TNSGFKGE----------GRGNLPKEVGG------KKEGLKISSIDFMGLNFADKKEGKG 163
G G+ + G KK G DF+GL F +KK +G
Sbjct: 132 NGGGLDAGSALLNPSPPFGQSTFTNKEGAFGKMAKKKGGYAYDETDFLGLGFFEKKRYQG 191
Query: 164 LPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGES--------------------------- 196
P GL DPF+ D PEVEI++GD SKFG+S
Sbjct: 192 PPPGLSQGIDPFSNEDFPEVEIVIGDPSKFGKSRRSTENQPVDDSESEETSRSKSSKQNE 251
Query: 197 ----------TVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLET 246
TV+ P + +EN E YKP+V+TWG+FPRP NISK +GGGR I G ++
Sbjct: 252 DGKVEETPLSTVIEP--EDDENSESYKPRVTTWGMFPRPQNISKAYGGGRNISLGGETQS 309
Query: 247 AEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKV 306
AE +AAK++RTR+LLAAY +D K K+EC KALK+GD LM++G+LK+ALP+YEKV
Sbjct: 310 AEEKAAKDKRTRELLAAYIGGKNKTLDAKTKAECTKALKEGDELMNAGRLKQALPYYEKV 369
Query: 307 MNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQV 365
M FK+ELHG+AALQWSIC DSL R KEA MY KL+ HPN VSK+A+ FMFSFQ
Sbjct: 370 MQAADFKTELHGMAALQWSICLDSLCRSKEAMGMYSKLKYHPNDRVSKKAKMFMFSFQA 428
>gi|326495948|dbj|BAJ90596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 497
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 198/359 (55%), Gaps = 59/359 (16%)
Query: 64 DPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARAAVEKAKEYKKNKDT----VSFQNG 119
DPVKLAF +A AY+K + + + P AKE +K+ + ++NG
Sbjct: 72 DPVKLAFARAAAYKKDRANPTPKPPPPPPPPSPPPPPQPSAKESVGSKEAFERALEYRNG 131
Query: 120 TNSGFKGE----------GRGNLPKEVGG------KKEGLKISSIDFMGLNFADKKEGKG 163
G G+ + G KK G DF+GL F +KK +G
Sbjct: 132 NGGGLDAGSALLNPSPPFGQSTFTNKEGAFGKMAKKKGGYAYDETDFLGLGFFEKKRYQG 191
Query: 164 LPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGES--------------------------- 196
P GL DPF+ D PEVEI++GD SKFG+S
Sbjct: 192 PPPGLSQGIDPFSNEDFPEVEIVIGDPSKFGKSRRSTENQPVDDSESEETSRSKSSKQNE 251
Query: 197 ----------TVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLET 246
TV+ P + +EN E YKP+V+TWG+FPRP NISK +GGGR I G ++
Sbjct: 252 DGKVEETPLSTVIEP--EDDENSESYKPRVTTWGMFPRPQNISKAYGGGRNISLGGETQS 309
Query: 247 AEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKV 306
AE +AAK++RTR+LLAAY +D K K+EC KALK+GD LM++G+LK+ALP+YEKV
Sbjct: 310 AEEKAAKDKRTRELLAAYIGGKNKTLDAKTKAECTKALKEGDELMNAGRLKQALPYYEKV 369
Query: 307 MNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQV 365
M FK+ELHG+AALQWSIC DSL R KEA MY KL+ HPN VSK+A+ FMFSFQ
Sbjct: 370 MQAADFKTELHGMAALQWSICLDSLCRSKEAMGMYSKLKYHPNDRVSKKAKMFMFSFQA 428
>gi|218196697|gb|EEC79124.1| hypothetical protein OsI_19769 [Oryza sativa Indica Group]
Length = 353
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/260 (50%), Positives = 173/260 (66%), Gaps = 33/260 (12%)
Query: 139 KKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGE--- 195
KKE + DF+GL+F +KK KG P GL P DPF + D PEVEII+GD SKFG+
Sbjct: 25 KKEEYEYDETDFLGLDFFEKKSYKGPPPGLAPAADPFPDKDFPEVEIIIGDPSKFGKTHR 84
Query: 196 STVLRPG----PKQ--------------------------EENLEFYKPKVSTWGVFPRP 225
ST ++P P++ EE+ + YKP V +WG+FPRP
Sbjct: 85 STEVQPADDSEPEETSRSTTEEKKEENKPDETPPSTVTEPEEDEDEYKPTVRSWGMFPRP 144
Query: 226 GNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALK 285
NISK +GGGR IR G +AE +AAK++RT++L+AAY+ S + VD K K+EC +ALK
Sbjct: 145 QNISKAYGGGRNIRLGGETRSAEEKAAKDKRTKELIAAYRNSQNMIVDAKTKAECTEALK 204
Query: 286 DGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQ 345
+GD LM++G+LK+ALP+YEKVM + FK+ELHG+AALQWSIC DSL R KEA MY KL+
Sbjct: 205 EGDELMNTGRLKQALPYYEKVMQAVDFKTELHGMAALQWSICLDSLCRSKEAMSMYSKLK 264
Query: 346 SHPNALVSKRARQFMFSFQV 365
+HPN+ +SK+A FMFSFQ
Sbjct: 265 NHPNSEISKKANMFMFSFQA 284
>gi|25956270|dbj|BAC41325.1| hypothetical protein [Lotus japonicus]
Length = 366
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 168/360 (46%), Positives = 210/360 (58%), Gaps = 60/360 (16%)
Query: 1 MASLQTSWLSSVSTIKN------KNDSSAKQKPK-LSSLKLSFFLNATSNNDEASEQPIS 53
MASL++ WL + + KN N S P ++ KL L +SNN E+SE
Sbjct: 1 MASLRSPWLRCIVSPKNGAIFPSTNVSVCMCLPSSTTTFKLCCAL--SSNNAESSEP--- 55
Query: 54 ENSPEPEAS-LDPVKLAFEKAKAYRKLK-ESNSDSKYEQNPDKDVARAAVEKAKEYKKNK 111
SPE + +DPVKLAF KAK Y++ +SN S EQ
Sbjct: 56 -TSPEAQTGPVDPVKLAFNKAKTYKEESGKSNPGSGIEQ--------------------- 93
Query: 112 DTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLKISSIDFMGLNFADK-KEGKGLPAGLVP 170
S NG NS +G+ V G+++ L +S + + A K K+ KG+ A
Sbjct: 94 ---SDANGENSVNQGDA-------VDGRQKDLPVSLK--IAMEKAKKYKQNKGV-AIAET 140
Query: 171 VTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISK 230
D F++GD EVE IVGD +KF T P +EE E YKP VSTWGVFPRPGNISK
Sbjct: 141 TQDSFSDGDLTEVEFIVGDATKFDAVTDPEPEQTKEEESELYKPTVSTWGVFPRPGNISK 200
Query: 231 TFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSL 290
TFGGGR IRPG+VLET E +A +E RT+QLLAAYKK GL++DPKLKSECE+ALKDGD L
Sbjct: 201 TFGGGRVIRPGEVLETDEEKAMREVRTKQLLAAYKKRTGLDIDPKLKSECEEALKDGDLL 260
Query: 291 MDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNA 350
M++GKLKEALP+YEKVM+K+ FK H L + R EA+ MYEKL+SHPNA
Sbjct: 261 MNAGKLKEALPYYEKVMDKITFKISKHILM----------IDRSNEAQSMYEKLKSHPNA 310
>gi|413948948|gb|AFW81597.1| hypothetical protein ZEAMMB73_414807 [Zea mays]
Length = 472
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 124/260 (47%), Positives = 171/260 (65%), Gaps = 32/260 (12%)
Query: 138 GKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGES- 196
GK+ G + DF+GL+F +KK KGLP GL P +P + D PEV+IIVGD SKF +S
Sbjct: 169 GKRGGYEFDEADFLGLDFFEKKRSKGLPPGLAPAFEPLRDDDFPEVDIIVGDPSKFDKSP 228
Query: 197 --TVLRP------------------------GP-----KQEENLEFYKPKVSTWGVFPRP 225
T ++P P + EE+ + Y+P V +WG+FPRP
Sbjct: 229 RRTEIQPVDNGESGEASQSTTKPNEADKAGIAPPSTVIETEEDEDVYRPTVRSWGMFPRP 288
Query: 226 GNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALK 285
NISK +GGGR IR G ++AE +AAK++RT++L+AAY+ + +D K K+EC KAL+
Sbjct: 289 QNISKAYGGGRNIRLGGETQSAEEKAAKDKRTKELIAAYRNRQNMVIDAKTKAECIKALR 348
Query: 286 DGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQ 345
+GD +M++G+LK+ALP+YEKVM+ + FK+E HG AALQWSIC DSL R KEA MY KL+
Sbjct: 349 EGDEMMNTGRLKQALPYYEKVMDAVDFKTEFHGRAALQWSICLDSLCRSKEAMSMYSKLK 408
Query: 346 SHPNALVSKRARQFMFSFQV 365
+HPN+ +SK+A F+FSFQ
Sbjct: 409 NHPNSEISKKANMFVFSFQA 428
>gi|413948947|gb|AFW81596.1| hypothetical protein ZEAMMB73_414807 [Zea mays]
Length = 497
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 124/259 (47%), Positives = 171/259 (66%), Gaps = 32/259 (12%)
Query: 138 GKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGES- 196
GK+ G + DF+GL+F +KK KGLP GL P +P + D PEV+IIVGD SKF +S
Sbjct: 169 GKRGGYEFDEADFLGLDFFEKKRSKGLPPGLAPAFEPLRDDDFPEVDIIVGDPSKFDKSP 228
Query: 197 --TVLRP------------------------GP-----KQEENLEFYKPKVSTWGVFPRP 225
T ++P P + EE+ + Y+P V +WG+FPRP
Sbjct: 229 RRTEIQPVDNGESGEASQSTTKPNEADKAGIAPPSTVIETEEDEDVYRPTVRSWGMFPRP 288
Query: 226 GNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALK 285
NISK +GGGR IR G ++AE +AAK++RT++L+AAY+ + +D K K+EC KAL+
Sbjct: 289 QNISKAYGGGRNIRLGGETQSAEEKAAKDKRTKELIAAYRNRQNMVIDAKTKAECIKALR 348
Query: 286 DGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQ 345
+GD +M++G+LK+ALP+YEKVM+ + FK+E HG AALQWSIC DSL R KEA MY KL+
Sbjct: 349 EGDEMMNTGRLKQALPYYEKVMDAVDFKTEFHGRAALQWSICLDSLCRSKEAMSMYSKLK 408
Query: 346 SHPNALVSKRARQFMFSFQ 364
+HPN+ +SK+A F+FSFQ
Sbjct: 409 NHPNSEISKKANMFVFSFQ 427
>gi|242087761|ref|XP_002439713.1| hypothetical protein SORBIDRAFT_09g018870 [Sorghum bicolor]
gi|241944998|gb|EES18143.1| hypothetical protein SORBIDRAFT_09g018870 [Sorghum bicolor]
Length = 488
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/359 (41%), Positives = 205/359 (57%), Gaps = 64/359 (17%)
Query: 64 DPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDV-------------ARAAVEKAKEYKKN 110
DPVKLAF +A AY+K ++S S + P + ++ A ++A EY+
Sbjct: 66 DPVKLAFARAAAYKKERDSPSPAPPPAPPPQSPPPSQPQASVVEPGSKEAFKRALEYRNG 125
Query: 111 KDTVSFQN---------GTNSGFKGEGRGNLPKEVG-GKKEGLKISSIDFMGLNFADKKE 160
+ G + F G+ L +EV GKK + DF+GL+F +KK
Sbjct: 126 NGAGAGAAGGGGDSPLLGGSPDF-GQN-ALLSEEVSFGKKGAYEFDETDFLGLDFFEKKR 183
Query: 161 GKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGES------------------------ 196
+GLP GL P +P + D PEVEIIVGD SKF +S
Sbjct: 184 SRGLPPGLAPAFEPLRDDDFPEVEIIVGDPSKFEKSPRPTEIQPVDDTESEETSQSSTTK 243
Query: 197 -----------TVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLE 245
TVL P EE+ + Y+P V +WG+FPRP NISK +GGGR IR G
Sbjct: 244 PNESDKVAPPSTVLEP----EEDEDVYRPTVRSWGMFPRPQNISKAYGGGRNIRLGGETL 299
Query: 246 TAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEK 305
+AE +AAK++RT++L+AAY+ + +D K K+EC +AL++GD +M++G+LK+ALP+YEK
Sbjct: 300 SAEEKAAKDKRTKELVAAYRNRQNIVIDAKTKAECIEALREGDEMMNTGRLKQALPYYEK 359
Query: 306 VMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
VM+ + FK+ELHG AALQWSIC DSL R KEA MY KL++HPN + K+A F+FSFQ
Sbjct: 360 VMDAVDFKTELHGKAALQWSICLDSLCRSKEAMSMYSKLKNHPNIEIRKKANMFVFSFQ 418
>gi|302787923|ref|XP_002975731.1| hypothetical protein SELMODRAFT_103783 [Selaginella moellendorffii]
gi|300156732|gb|EFJ23360.1| hypothetical protein SELMODRAFT_103783 [Selaginella moellendorffii]
Length = 337
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 109/220 (49%), Positives = 157/220 (71%), Gaps = 18/220 (8%)
Query: 148 IDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEII---VGDTSKFGESTVLRPGPK 204
++F+GL+F++KK+ G PAGL+ + G PEVEII V D SK G+
Sbjct: 53 MEFVGLDFSEKKK-PGRPAGLIAEFEAPPPGQLPEVEIITRDVEDASKAGD--------- 102
Query: 205 QEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAY 264
+ YKPKVSTWGVFPRP NISKTFGGGRTI+PG+ LET E + A+E +T++LLA Y
Sbjct: 103 -----DLYKPKVSTWGVFPRPPNISKTFGGGRTIKPGEELETDEEKKARETKTKELLAEY 157
Query: 265 KKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQW 324
++ +GL++DP++ SEC++ + + L+ SGKL++A+ +E VM KMVF+SELHG AAL
Sbjct: 158 RRKMGLDIDPRVSSECQEIMSKAEGLLKSGKLRDAVQNFEIVMEKMVFQSELHGQAALSK 217
Query: 325 SICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
++C +S++R +EAR++Y+KL SH + V KRA+Q +FSF+
Sbjct: 218 ALCLESMNRSREARVIYDKLVSHSSQSVKKRAKQLIFSFE 257
>gi|302783807|ref|XP_002973676.1| hypothetical protein SELMODRAFT_54354 [Selaginella moellendorffii]
gi|300158714|gb|EFJ25336.1| hypothetical protein SELMODRAFT_54354 [Selaginella moellendorffii]
Length = 284
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 109/220 (49%), Positives = 157/220 (71%), Gaps = 18/220 (8%)
Query: 148 IDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEII---VGDTSKFGESTVLRPGPK 204
++F+GL+F++KK+ G PAGL+ + G PEVEII V D SK G+
Sbjct: 7 MEFVGLDFSEKKK-PGRPAGLIAEFEAPPPGQLPEVEIITRDVEDASKSGD--------- 56
Query: 205 QEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAY 264
+ YKPKVSTWGVFPRP NISKTFGGGRTI+PG+ LET E + A+E +T++LLA Y
Sbjct: 57 -----DLYKPKVSTWGVFPRPPNISKTFGGGRTIKPGEELETDEEKKAREAKTKELLAEY 111
Query: 265 KKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQW 324
++ +GL++DP++ SEC++ + + L+ SGKL++A+ +E VM KMVF+SELHG AAL
Sbjct: 112 RRKMGLDIDPRVSSECQEIMSKAEGLLKSGKLRDAVQNFEIVMEKMVFQSELHGQAALSK 171
Query: 325 SICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
++C +S++R +EAR++Y+KL SH + V KRA+Q +FSF+
Sbjct: 172 ALCLESMNRSREARVIYDKLVSHSSQSVKKRAKQLIFSFE 211
>gi|297742692|emb|CBI35145.3| unnamed protein product [Vitis vinifera]
Length = 205
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 108/170 (63%), Positives = 129/170 (75%), Gaps = 1/170 (0%)
Query: 163 GLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVF 222
GL AGLVPV+DPF EGD PEVE+IV DTSKF ++T P P Q++N + YK KVSTWGVF
Sbjct: 17 GLLAGLVPVSDPFPEGDLPEVELIVEDTSKFEDTTSSNPKPTQDDNSDLYKLKVSTWGVF 76
Query: 223 PRPGNISKTFGGGRTIRPGDVLETAEARAA-KEERTRQLLAAYKKSVGLNVDPKLKSECE 281
PRP NISKT G + E + K ++ +LLA YK +GLN+DPKLKSEC+
Sbjct: 77 PRPSNISKTQAQVSANTLGKKIGARERKEKNKVKKHLKLLATYKTKIGLNIDPKLKSECD 136
Query: 282 KALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSL 331
+ALKDGDSLM+ GKLKEALP+YEKVM+K+ F+SELHGLAALQWSIC DSL
Sbjct: 137 QALKDGDSLMNIGKLKEALPYYEKVMDKLAFQSELHGLAALQWSICLDSL 186
>gi|168066316|ref|XP_001785086.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663346|gb|EDQ50115.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 465
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 192/324 (59%), Gaps = 24/324 (7%)
Query: 51 PISENSPEPEASLDPVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARAAVEKAKEYKKN 110
P SEN+ + E SL +L+FEKAKAY KL++S + N K A ++ K+
Sbjct: 92 PKSENN-QMEGSL-AAQLSFEKAKAY-KLQQSEMIAAL--NEKKPEPAAPPPIVEDEDKD 146
Query: 111 KDTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK---ISSIDFMGLNFADKKEGKGLPAG 167
K+ V + T G + R P+ + K +S++DF+GL FADKK PAG
Sbjct: 147 KNVVEVEIYTRDGIV-KRRVLKPETAFANVKNYKAKGVSTMDFVGLGFADKK-STNRPAG 204
Query: 168 LVPVTDPFA--EGDTPEVEIIVGDTSKFGESTVLRP-GPKQEENLEFYKPKVSTWGVFPR 224
L ++ F G PEVE++ D V P EN YKP+V+TWG+FPR
Sbjct: 205 L---SESFEAPSGPLPEVEMLTRDAG------VADPEATADSENA--YKPRVATWGMFPR 253
Query: 225 PGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKAL 284
P +ISK +GGGR I+PG LET E R A+E +T++LL YKK VGLNV PK+K+ CE +
Sbjct: 254 PADISKAYGGGRNIKPGQKLETEEERIARETKTKKLLDDYKKKVGLNVSPKVKAMCEMNM 313
Query: 285 KDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKL 344
K G +LMD G+L EA + VM +M F SELHG+AALQ ++C DS +R EA++ YEKL
Sbjct: 314 KQGQNLMDRGRLGEAQAAFLSVMEEMPFPSELHGMAALQEAVCLDSSNRSDEAKVKYEKL 373
Query: 345 QSHPNALVSKRARQFMFSFQVYVK 368
SHPN V K+ARQ +F FQ VK
Sbjct: 374 VSHPNGAVRKKARQLLFGFQAAVK 397
>gi|297825963|ref|XP_002880864.1| hypothetical protein ARALYDRAFT_344420 [Arabidopsis lyrata subsp.
lyrata]
gi|297326703|gb|EFH57123.1| hypothetical protein ARALYDRAFT_344420 [Arabidopsis lyrata subsp.
lyrata]
Length = 199
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 89/134 (66%), Positives = 106/134 (79%), Gaps = 13/134 (9%)
Query: 231 TFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSL 290
FGGGRT+RPGD +ETAE R +E++T++ L AYK+S+GLN+DPKLK ECEKA+ +G+SL
Sbjct: 10 AFGGGRTLRPGDSVETAEERIVREKQTKKFLIAYKESIGLNIDPKLKLECEKAIDEGNSL 69
Query: 291 MDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNA 350
MDSG+LKEALP+YE KSELHGLAALQWSICQDSL + MYEKL SHPN
Sbjct: 70 MDSGRLKEALPYYE--------KSELHGLAALQWSICQDSLRKR-----MYEKLLSHPNP 116
Query: 351 LVSKRARQFMFSFQ 364
VSK+ARQ MFSFQ
Sbjct: 117 RVSKKARQLMFSFQ 130
>gi|307107904|gb|EFN56145.1| hypothetical protein CHLNCDRAFT_144810 [Chlorella variabilis]
Length = 1259
Score = 142 bits (359), Expect = 2e-31, Method: Composition-based stats.
Identities = 67/161 (41%), Positives = 108/161 (67%)
Query: 204 KQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAA 263
++E+ + +KPKV+TWGVFPRP NIS+ +GGGRT+RPG LE E +EER ++ +
Sbjct: 318 EEEKGEDLHKPKVATWGVFPRPRNISEAYGGGRTLRPGQALEPQEQAVEREERIKRSMQE 377
Query: 264 YKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQ 323
Y+KS+GL+VDP+++++ + +G +L D G + ALP + M +S L G A LQ
Sbjct: 378 YRKSLGLDVDPEVEAQAQALCDEGTALFDQGLISLALPKFAAASELMALRSRLGGEARLQ 437
Query: 324 WSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
+IC DSL + +EA +Y+ +Q+HP V+K+A++ +F ++
Sbjct: 438 QAICLDSLGQNQEAYDIYKNIQTHPAPGVAKKAKRMLFGWK 478
>gi|384248837|gb|EIE22320.1| hypothetical protein COCSUDRAFT_42658 [Coccomyxa subellipsoidea
C-169]
Length = 338
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 100/152 (65%)
Query: 212 YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLN 271
Y+PKV+TWGVFPRP NIS+ +GGGRTIRPG+ LE+ A+++R + L++YKK+ GL
Sbjct: 113 YRPKVATWGVFPRPSNISREYGGGRTIRPGEALESEAETLARKKRVAEALSSYKKNAGLE 172
Query: 272 VDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSL 331
VDP ++ ++A G LM G+L AL +++V+ + ++ + G LQ +IC DSL
Sbjct: 173 VDPAAEASAKEAYDRGVQLMQRGQLAAALVPFDEVLTFVSMRTRVGGETTLQKAICLDSL 232
Query: 332 HRPKEARIMYEKLQSHPNALVSKRARQFMFSF 363
R EA +Y+++ H VSK+A+Q +F F
Sbjct: 233 GRQDEAMALYKQISRHSAPGVSKKAKQMLFGF 264
>gi|159482192|ref|XP_001699157.1| hypothetical protein CHLREDRAFT_152235 [Chlamydomonas reinhardtii]
gi|158273220|gb|EDO99012.1| predicted protein [Chlamydomonas reinhardtii]
Length = 361
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/180 (39%), Positives = 106/180 (58%), Gaps = 13/180 (7%)
Query: 185 IIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVL 244
++V D++ GE V YKPKVSTWG+FPRP +IS+ +GGGR ++PG L
Sbjct: 123 VLVEDSASGGEEDV-------------YKPKVSTWGMFPRPKDISEAYGGGRNLKPGQEL 169
Query: 245 ETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYE 304
ET + +A +E LAAYK GL VDP+ + + K ++G GKLK A +E
Sbjct: 170 ETPKQKAERERSYAAALAAYKFKAGLEVDPEDEEKAAKVYEEGMEFFADGKLKAAYDKFE 229
Query: 305 KVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
K + + K++ GLA LQ +I DS+ + A+ +Y+ + +H A VSK+A+Q +FSF+
Sbjct: 230 KAIALVPVKTKYGGLATLQKAIVLDSVGNHEAAQKLYKSIATHGVAQVSKKAKQMLFSFE 289
>gi|302838913|ref|XP_002951014.1| hypothetical protein VOLCADRAFT_117762 [Volvox carteri f.
nagariensis]
gi|300263709|gb|EFJ47908.1| hypothetical protein VOLCADRAFT_117762 [Volvox carteri f.
nagariensis]
Length = 529
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 101/156 (64%), Gaps = 1/156 (0%)
Query: 210 EFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVG 269
+ Y PKV+TWG+FPRP +ISK +GGGR I+PG LETAE RA +E LAAYK G
Sbjct: 302 DMYTPKVATWGIFPRPRDISKAYGGGRNIKPGQELETAEQRAERERNYAAALAAYKARAG 361
Query: 270 LNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELH-GLAALQWSICQ 328
L VDP+ + + ++G L +G+LK + +EKV+ + K++ + GLA LQ +I
Sbjct: 362 LEVDPEEEQRAAQLYEEGMQLFSAGQLKASYDKFEKVLAAVPVKTKQYGGLATLQKAIVL 421
Query: 329 DSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
DS+ + + A+ +Y+ + +H A VSK+A+Q +F F+
Sbjct: 422 DSVGQSEAAQKLYKSIANHGVAQVSKKAKQMLFGFE 457
>gi|297813419|ref|XP_002874593.1| hypothetical protein ARALYDRAFT_911258 [Arabidopsis lyrata subsp.
lyrata]
gi|297320430|gb|EFH50852.1| hypothetical protein ARALYDRAFT_911258 [Arabidopsis lyrata subsp.
lyrata]
Length = 130
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/90 (62%), Positives = 65/90 (72%), Gaps = 5/90 (5%)
Query: 145 ISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFG--ESTVLRPG 202
+SSIDF+GL FADKK +GLPAGLVPV D +GD PEVE+IVGD ++F E V + G
Sbjct: 39 LSSIDFIGLGFADKKSTRGLPAGLVPVVDYLPQGDLPEVELIVGDQTRFAKKEKEVEQEG 98
Query: 203 PKQEENLEFYKPKVSTWGVFPRPGNISKTF 232
+ N YKPKVSTWGVFPRP NISKT
Sbjct: 99 ---DGNSHVYKPKVSTWGVFPRPSNISKTL 125
>gi|303284803|ref|XP_003061692.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457022|gb|EEH54322.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 472
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 88/160 (55%), Gaps = 3/160 (1%)
Query: 212 YKPKVSTWGVFPRPGNISKTFGGGRTIRPGD-VLETAEARAAKEERTRQLLAAYKKSVGL 270
YKP VSTWGVFPRP NIS+ +GGG+TI+ G+ V ET E + A+ +R R L ++ + G+
Sbjct: 244 YKPSVSTWGVFPRPDNISEAYGGGKTIKAGEFVPETEEEKEARRKRVRDKLKQFRSNAGI 303
Query: 271 NVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKV--MNKMVFKSELHGLAALQWSICQ 328
V + AL + SLM GKL+ + E + + + K E+ G +++C
Sbjct: 304 EVSNGTIVRWQAALTEAKSLMSQGKLQASYELLEPIVKVENISPKLEIGGDMIFNYAVCL 363
Query: 329 DSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQVYVK 368
DS R +EA MY++ P+ VSK A + ++ K
Sbjct: 364 DSCGRREEALEMYKRCVGCPHGKVSKMADRMIWGMTTASK 403
>gi|449016632|dbj|BAM80034.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 327
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 86/154 (55%), Gaps = 1/154 (0%)
Query: 212 YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLN 271
YKP+VSTWGVFPRP ++S+TFGGGR+I G V +A +E+R R LL Y+ N
Sbjct: 98 YKPRVSTWGVFPRPLDVSQTFGGGRSIPIGGVSVPDDAAQKREQRLRVLLERYRMKRNEN 157
Query: 272 VDP-KLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDS 330
+ + +S AL+ + ++ +GK + AL V + EL LQ ++CQD+
Sbjct: 158 AETEEERSALSAALESAERMLQAGKYESALEQLTAVRPLARVRGELGSRVLLQVALCQDA 217
Query: 331 LHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
L ++AR +Y +LQ V A+Q +SFQ
Sbjct: 218 LGSCEDARKLYGELQHSSEPNVRDAAKQLFYSFQ 251
>gi|115463599|ref|NP_001055399.1| Os05g0381500 [Oryza sativa Japonica Group]
gi|47777423|gb|AAT38057.1| unknown protein [Oryza sativa Japonica Group]
gi|113578950|dbj|BAF17313.1| Os05g0381500 [Oryza sativa Japonica Group]
Length = 301
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 67/126 (53%), Gaps = 33/126 (26%)
Query: 138 GKKEGLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGE-- 195
KKE + DF+GL+F +KK KG P GL P DPF + D PEVEII+GD SKFG+
Sbjct: 166 NKKEEYEYDETDFLGLDFFEKKSYKGPPPGLAPAADPFPDKDFPEVEIIIGDPSKFGKTH 225
Query: 196 -STVLRPG----PKQ--------------------------EENLEFYKPKVSTWGVFPR 224
ST ++P P++ EE+ + YKP V +WG+FPR
Sbjct: 226 RSTEVQPADDSEPEETSRSTTEEKKEENKPDETPPSTVTEPEEDEDEYKPTVRSWGMFPR 285
Query: 225 PGNISK 230
P NISK
Sbjct: 286 PQNISK 291
>gi|297825943|ref|XP_002880854.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326693|gb|EFH57113.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 141
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 40/55 (72%), Positives = 47/55 (85%)
Query: 310 MVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
+V +SELHGLAALQWSICQDSL + ++AR +YEKL SHPN VSK+ARQ MFSFQ
Sbjct: 1 VVVQSELHGLAALQWSICQDSLRKAEKARRIYEKLLSHPNPRVSKKARQLMFSFQ 55
>gi|412987966|emb|CCO19362.1| predicted protein [Bathycoccus prasinos]
Length = 430
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 99/185 (53%), Gaps = 7/185 (3%)
Query: 183 VEIIVGDTSK--FGESTVLRPGPKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRP 240
V+II DTS F E + G E YKPKVSTWGVFPRP +ISK +GGGRT++
Sbjct: 173 VQIISKDTSYNPFDEDESV-VGMDDIETGVNYKPKVSTWGVFPRPKDISKEYGGGRTLKK 231
Query: 241 G----DVLETAEARAAKEERTRQLLAAYKKSVGLNVDPKLKSECEKALKDGDSLMDSGKL 296
+++E+ E A++ R + L Y+ + LN+ + + + +AL + + + G +
Sbjct: 232 DEKGQNIIESKEETEARKIRVAKKLEKYRTTNALNMTKEEEEKAREALAEANEFLRVGSV 291
Query: 297 KEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRA 356
+ + E + +SEL G +++C D+ R EA Y+++ + LVSK+A
Sbjct: 292 TKGIAVLEPFEKDINARSELGGQVIFCYAMCLDNAQRRDEALKQYKRVLGNQYGLVSKQA 351
Query: 357 RQFMF 361
+ ++
Sbjct: 352 ERMLW 356
>gi|145352223|ref|XP_001420453.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580687|gb|ABO98746.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 283
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 86/153 (56%), Gaps = 2/153 (1%)
Query: 212 YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLN 271
YKP VSTWGVF RP +ISKT+GGGRT++ + LE EA AA+ R + LA Y++ G+
Sbjct: 58 YKPSVSTWGVFERPPDISKTYGGGRTVKREE-LEDEEAIAARRARVAKKLAKYREDQGMT 116
Query: 272 VDPKLKSECEKALKDGDSLMDSGKLKEALPFYE-KVMNKMVFKSELHGLAALQWSICQDS 330
+ +E + L+ M G +++AL E ++++ K+E+ G ++IC D+
Sbjct: 117 LTNAEYAEVREVLERSGEAMKRGYMQDALDLLEPWALDRIGQKTEMGGRIIFNYAICLDN 176
Query: 331 LHRPKEARIMYEKLQSHPNALVSKRARQFMFSF 363
L R EA Y + + VSK+A + ++
Sbjct: 177 LQRRDEALKQYRRCIGNQYGNVSKQADRMIWGM 209
>gi|219118404|ref|XP_002179976.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408233|gb|EEC48167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 777
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 81/157 (51%), Gaps = 4/157 (2%)
Query: 210 EFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVG 269
+ Y+PK+ +WG FPRP +ISK +GGGR I PG +AR TR+ L Y++ VG
Sbjct: 331 DVYQPKMGSWGAFPRPRDISKAYGGGRRIGPG--FSNEQARVESTAATRERLQRYREKVG 388
Query: 270 LNVDP-KLKS-ECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSIC 327
+ V+ KL + E +AL+ G M+ G A+ EKV S++ L+ ++
Sbjct: 389 IEVESEKLHADEIAEALRIGSLAMERGIYSTAVSALEKVTRYCSSNSKVGSKVFLELAMA 448
Query: 328 QDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
+++ R +EA +Y L + A++ ++ +
Sbjct: 449 YEAVGRTEEAIAVYTTLSKCRMEDIKHNAKRLLYGLE 485
>gi|255086213|ref|XP_002509073.1| predicted protein [Micromonas sp. RCC299]
gi|226524351|gb|ACO70331.1| predicted protein [Micromonas sp. RCC299]
Length = 319
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 56/96 (58%)
Query: 212 YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLN 271
Y+PKVSTWGVF RP NISK FGGGR I G ET E A + +R + L Y+K G++
Sbjct: 217 YQPKVSTWGVFERPDNISKEFGGGRKISMGGEEETEEQIAERRKRVQDKLNKYRKEQGID 276
Query: 272 VDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVM 307
VD KS ALKD + + G +E + + ++
Sbjct: 277 VDEGTKSRWYVALKDAKASLSKGNFQEGIELLQDIV 312
>gi|255544069|ref|XP_002513097.1| hypothetical protein RCOM_1453380 [Ricinus communis]
gi|223548108|gb|EEF49600.1| hypothetical protein RCOM_1453380 [Ricinus communis]
Length = 88
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/75 (52%), Positives = 51/75 (68%), Gaps = 4/75 (5%)
Query: 291 MDSGKLKEALPFYEKVMNKMVFKSELHGLAALQWSICQDSLHRPKEARIMYEKLQSHPNA 350
MD GKLKEALP Y+KVM+K+ ++ + L ++ + R EA+ MYEKLQSHP+A
Sbjct: 1 MDYGKLKEALPSYQKVMDKLPLQTAVLTLLPKIVTM----IDRRNEAQAMYEKLQSHPDA 56
Query: 351 LVSKRARQFMFSFQV 365
V K+ARQ FSFQV
Sbjct: 57 EVCKKARQIKFSFQV 71
>gi|397573944|gb|EJK48943.1| hypothetical protein THAOC_32220 [Thalassiosira oceanica]
Length = 781
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 78/164 (47%), Gaps = 3/164 (1%)
Query: 203 PKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLA 262
P EE Y+PKV TWG FPRP NIS +GGG+ + DV + R E T++ L
Sbjct: 325 PPAEEKDADYQPKVGTWGAFPRPRNISTAYGGGKRV-GADVRTDEKKRQESIESTKERLR 383
Query: 263 AYKKSVGLNVDPKL--KSECEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLA 320
AY++ +G+ V ++ K ++AL M G + EKV KS+L G
Sbjct: 384 AYRERMGIVVQSEIDNKEIIDEALALAGRAMLRGSYSVGVSALEKVTQYCSTKSKLGGKV 443
Query: 321 ALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
L+ ++ ++ + +A +Y L P + A + ++ +
Sbjct: 444 FLELAMAYEAEGKTDQAIGLYAALTKSPIEQIKMNADKLLYGLE 487
>gi|116786983|gb|ABK24330.1| unknown [Picea sitchensis]
Length = 253
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 87/181 (48%), Gaps = 29/181 (16%)
Query: 65 PVKLAFEKAKAYRKLKESNSDSKYEQNPDKDVARA-----------AVEKAKEYKKNK-- 111
PVK A EKAK Y+K ++ N + P+ + + A+E+A+ YKK +
Sbjct: 65 PVKRALEKAKDYKKKQQQNQNDCAPNVPESEFVSSEPKGTGGAAFDALERARAYKKQQGE 124
Query: 112 -------DTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK---ISSIDFMGLNFADKKEG 161
+TV + T G R P + + K ISSIDF+GL F++KK
Sbjct: 125 KSYPKEDNTVEVEIITKDGVIRR-RVTPPGQAFSNVKSYKNKGISSIDFIGLEFSEKKPK 183
Query: 162 KGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPGPKQEENLEFYKPKVSTWGV 221
+GLPAGL + + G PEVEII D +K V P P + + YKPKV+TW
Sbjct: 184 RGLPAGLNIGLELPSSGTLPEVEIITRDAAK----NVKTP-PGNDNSSGLYKPKVATWES 238
Query: 222 F 222
F
Sbjct: 239 F 239
>gi|298708542|emb|CBJ49175.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 648
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 8/161 (4%)
Query: 203 PKQEENLEFYKPKVSTWGVFPRPGNISKTFGGGR--TIRPGDVLETAEARAAKEERTRQL 260
PK+ +++ +KP VSTWGVF RP ++SK FGGGR T++ D E + ++E T+ +
Sbjct: 185 PKEGADVDTHKPVVSTWGVFERPKDMSKAFGGGRDPTLKRLD----PEEKRRRDEETKAI 240
Query: 261 LAAYKKSVGLNVDPKLKSECE--KALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHG 318
L +Y+ S G ++ + +SE E AL M G A+ E V + L
Sbjct: 241 LNSYRGSTGEDLKREKESETEIKAALAQSKRAMKFGDTFGAVSALEGVKQECSIHGPLGA 300
Query: 319 LAALQWSICQDSLHRPKEARIMYEKLQSHPNALVSKRARQF 359
+ L+ ++ ++ R +A+ +Y+ L+ N + +A++
Sbjct: 301 VVFLELAMALEATGRSYQAQDIYKVLRRSKNREIKSQAKRL 341
>gi|224015169|ref|XP_002297244.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968098|gb|EED86451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 703
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 75/160 (46%), Gaps = 6/160 (3%)
Query: 209 LEFYKP-KVSTWGVFPRPGNISKTFGGGRTIRPGDVLETAEA-RAAKEERTRQLLAAYKK 266
+E Y P K TWG FPRP NIS +GGG+ I G + T EA R E T++ L AY++
Sbjct: 250 VENYNPSKSGTWGAFPRPRNISVAYGGGKRIGAG--VRTDEAKRQQSIEDTKEKLRAYRE 307
Query: 267 SVGLNVDPKLKSE--CEKALKDGDSLMDSGKLKEALPFYEKVMNKMVFKSELHGLAALQW 324
+G+ V + ++AL M G + EKV KS++ G L+
Sbjct: 308 RMGIEVQSEKDHADVIDEALAIAGRAMQRGSYSTGVSALEKVTQYCSTKSKVGGKVFLEL 367
Query: 325 SICQDSLHRPKEARIMYEKLQSHPNALVSKRARQFMFSFQ 364
++ ++ + +A +Y L P + A + ++ +
Sbjct: 368 AMAYEAEGKTDQAISLYAALTKSPIEQIKMNAEKLLYGLE 407
>gi|422292728|gb|EKU20030.1| hypothetical protein NGA_2010400, partial [Nannochloropsis gaditana
CCMP526]
Length = 492
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 204 KQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDV-LETAEARAAKEERTRQLLA 262
K+ +N+E YKP+ ++WG F RP NI FGGG I G V LE + AK + +
Sbjct: 93 KRFQNVETYKPQSTSWGEFARPKNIPDAFGGGVHIPEGGVTLEGDKEWLAKYDTLMAQVR 152
Query: 263 AYKKSV-GLNVDPKLKSECEK---ALKDGDSLMDSGKLKEA----LPFYEKVMNKMVFKS 314
YKK V G + P K++ E+ A M G+ + A LPF + +S
Sbjct: 153 EYKKGVQGTVLTPLEKADEERIKVARTQARKYMLRGEYEAAKAVLLPF----RSICSVRS 208
Query: 315 ELHGLAALQWSICQDSLHRPKEARIMYEKLQSHP 348
L + ++ ++++ KEA+ +Y +++S P
Sbjct: 209 SLGASTLMDLAMAHEAVYEFKEAQDIYTQVRSSP 242
>gi|387219189|gb|AFJ69303.1| hypothetical protein NGATSA_2010400 [Nannochloropsis gaditana
CCMP526]
Length = 460
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 204 KQEENLEFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDV-LETAEARAAKEERTRQLLA 262
K+ +N+E YKP+ ++WG F RP NI FGGG I G V LE + AK + +
Sbjct: 61 KRFQNVETYKPQSTSWGEFARPKNIPDAFGGGVHIPEGGVTLEGDKEWLAKYDTLMAQVR 120
Query: 263 AYKKSV-GLNVDPKLKSECEK---ALKDGDSLMDSGKLKEA----LPFYEKVMNKMVFKS 314
YKK V G + P K++ E+ A M G+ + A LPF + +S
Sbjct: 121 EYKKGVQGTVLTPLEKADEERIKVARTQARKYMLRGEYEAAKAVLLPF----RSICSVRS 176
Query: 315 ELHGLAALQWSICQDSLHRPKEARIMYEKLQSHP 348
L + ++ ++++ KEA+ +Y +++S P
Sbjct: 177 SLGASTLMDLAMAHEAVYEFKEAQDIYTQVRSSP 210
>gi|296081029|emb|CBI18310.3| unnamed protein product [Vitis vinifera]
Length = 43
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/25 (80%), Positives = 23/25 (92%)
Query: 340 MYEKLQSHPNALVSKRARQFMFSFQ 364
MYE+LQ+HPNALVSK+ARQ MF FQ
Sbjct: 1 MYERLQTHPNALVSKKARQLMFGFQ 25
>gi|319653144|ref|ZP_08007246.1| hypothetical protein HMPREF1013_03861 [Bacillus sp. 2_A_57_CT2]
gi|317395065|gb|EFV75801.1| hypothetical protein HMPREF1013_03861 [Bacillus sp. 2_A_57_CT2]
Length = 286
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 86/196 (43%), Gaps = 33/196 (16%)
Query: 35 SFFLNATSNNDEASEQPISENSPEPEASLDPVKLAFEKAKAYRKLKESNSDSKYEQNPDK 94
S+FL A + N A+E +S N +P++ A+ Y +LK + S N +
Sbjct: 108 SYFLFADTANHTAAEFALS-NGFKPQS-----------AQTYMELKGNPS------NELR 149
Query: 95 DVARAAVEKAKEYKKNKDTVSFQNGTNSGFKGEGRGNLPKEV--GGKKEGL--------- 143
D++ E+ DT+ F + SG + GR + +V G +GL
Sbjct: 150 DISLLPPHSHSEFIHLHDTL-FPHTYYSGKEIIGRLSDEHKVYTSGDADGLNGYLYAEYN 208
Query: 144 ---KISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLR 200
K SI+F+G+ F +K+G G + V D F +++ VG T++ S +
Sbjct: 209 RDEKEGSIEFIGVKFCKRKKGIGRKLLDMAVHDLFVNAGAASIKLCVGTTNQKALSLYKK 268
Query: 201 PGPKQEENLEFYKPKV 216
G K E +L FYK +
Sbjct: 269 AGFKVERSLSFYKLYI 284
>gi|449518927|ref|XP_004166487.1| PREDICTED: uncharacterized protein LOC101232417 [Cucumis sativus]
Length = 146
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 29/41 (70%), Gaps = 3/41 (7%)
Query: 52 ISENSPEPEASLDPVKLAFEKAKAYRKLKESNSDSKYEQNP 92
+S+ +P P LDPVKLAFE+AKAY+KL +S S+ E P
Sbjct: 61 VSDAAPPP---LDPVKLAFERAKAYKKLSKSGSNLNVELKP 98
>gi|326319531|ref|YP_004237203.1| SMC domain-containing protein [Acidovorax avenae subsp. avenae ATCC
19860]
gi|323376367|gb|ADX48636.1| SMC domain protein [Acidovorax avenae subsp. avenae ATCC 19860]
Length = 944
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 87 KYEQNPDKDVARAAVEKAKEYK--KNKDTVSFQNGTNSGFKGEGRGNLPKEVGGKKEGLK 144
+Y +N + A A VE A + +N DTV Q G F +G+G++ G G +
Sbjct: 783 RYRKNIQELGALAGVEVAADLPLLENDDTVLAQAGLKVHFAFDGKGSIGMNDGEASGGQQ 842
Query: 145 I--SSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRPG 202
+ S I +GL D++ G G G V + +PFA D ++ +VG + ++ +
Sbjct: 843 VIKSLILLVGL-LKDEESGSG---GFVFIDEPFAHLDVRNIQ-LVGHFLRSTQAQYVLTT 897
Query: 203 PKQEENLEFYKP 214
P NLE ++P
Sbjct: 898 PI-THNLEVFEP 908
>gi|120613396|ref|YP_973074.1| SMC domain-containing protein [Acidovorax citrulli AAC00-1]
gi|120591860|gb|ABM35300.1| SMC domain protein [Acidovorax citrulli AAC00-1]
Length = 944
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 87 KYEQNPDKDVARAAVEKAKEYK--KNKDTVSFQNGTNSGFKGEGRGNL---PKEVGGKKE 141
+Y +N + A A VE A + +N DTV Q G F +G+G++ E G ++
Sbjct: 783 RYRKNIQELGALAGVEVAADLPLLENDDTVLAQAGLKVHFAFDGKGSIGLNDGEASGGQQ 842
Query: 142 GLKISSIDFMGLNFADKKEGKGLPAGLVPVTDPFAEGDTPEVEIIVGDTSKFGESTVLRP 201
+K S I +GL D++ G G G V + +PFA D ++ +VG + ++ +
Sbjct: 843 VIK-SLILLVGL-LKDEESGSG---GFVFIDEPFAHLDVRNIQ-LVGHFLRSTQAQYVLT 896
Query: 202 GPKQEENLEFYKP 214
P NLE ++P
Sbjct: 897 TPI-THNLEVFEP 908
>gi|392397224|ref|YP_006433825.1| SH3 domain-containing protein [Flexibacter litoralis DSM 6794]
gi|390528302|gb|AFM04032.1| SH3 domain-containing protein [Flexibacter litoralis DSM 6794]
Length = 256
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 27/40 (67%)
Query: 270 LNVDPKLKSECEKALKDGDSLMDSGKLKEALPFYEKVMNK 309
+++ + +++ EK LK GDSL SG+ EAL YE ++NK
Sbjct: 30 ISISFRSQAQNEKLLKQGDSLFSSGRYVEALSLYEHLLNK 69
>gi|224069196|ref|XP_002326298.1| AP2/ERF domain-containing transcription factor [Populus
trichocarpa]
gi|148372128|gb|ABQ62995.1| RAP2-like protein [Populus trichocarpa]
gi|222833491|gb|EEE71968.1| AP2/ERF domain-containing transcription factor [Populus
trichocarpa]
Length = 209
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 45/89 (50%), Gaps = 7/89 (7%)
Query: 222 FPR-PGNISKTFGGGRTIRPGDVLETAEARAAKEERTRQLLAAYKKSVGLNV-DPKLKSE 279
FP P NI+ GGR++ P ++ E A AR A E R ++ ++V + + +S
Sbjct: 73 FPENPPNIA----GGRSLSPAEIQEAA-ARFANSEPQRSQPDRFETDQSVSVSESRAESP 127
Query: 280 CEKALKDGDSLMDSGKLKEALPFYEKVMN 308
C + DG M+ G+L PF + +MN
Sbjct: 128 CPSVVSDGTVQMEGGELMWDGPFLDMLMN 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.310 0.128 0.358
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,952,142,568
Number of Sequences: 23463169
Number of extensions: 269083575
Number of successful extensions: 671384
Number of sequences better than 100.0: 317
Number of HSP's better than 100.0 without gapping: 68
Number of HSP's successfully gapped in prelim test: 249
Number of HSP's that attempted gapping in prelim test: 670804
Number of HSP's gapped (non-prelim): 665
length of query: 368
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 224
effective length of database: 8,980,499,031
effective search space: 2011631782944
effective search space used: 2011631782944
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 77 (34.3 bits)