BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 035921
(305 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225444981|ref|XP_002282754.1| PREDICTED: uncharacterized protein LOC100249262 [Vitis vinifera]
Length = 304
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/298 (69%), Positives = 243/298 (81%), Gaps = 5/298 (1%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
++V V IL++ST +S+C+KKPV AR++D+PYI CQVCEKLA+QLYHQV+KKQAQI
Sbjct: 8 TIIVLVFSILVISTWLQVSHCSKKPVGVARRDDIPYIKCQVCEKLAAQLYHQVEKKQAQI 67
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
SPKKISEFEII I+ENVCNLKKEEADWIL+IDIVEQGDKL LVEQ++EGQCNSECKTIER
Sbjct: 68 SPKKISEFEIIEISENVCNLKKEEADWILRIDIVEQGDKLKLVEQDSEGQCNSECKTIER 127
Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
ACQE+MGYSDTD AEY+Y +KP I SLVNYLCKDL+KACS PPPV K+R PGEPFVPK
Sbjct: 128 ACQEVMGYSDTDAAEYVYKNKPQIDSLVNYLCKDLTKACSASPPPVPKDRTPGEPFVPKS 187
Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLG 246
KEAEME+IM+SMEGMPGAPGM+MYS+EDLMNM+NF GD++ D+DDDDE FP LG
Sbjct: 188 LKEAEMEKIMKSMEGMPGAPGMKMYSREDLMNMQNF----GDEDADDDDDDEPHFPSKLG 243
Query: 247 KVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTE 304
K R KES DWK +T GI TG LK+ A++VSN +++WW G K SKK K
Sbjct: 244 KNLRAKESTKPDWKHTVTKGIKETGQALKRHASRVSNQIRQWWRG-KITSSKKNSKAH 300
>gi|297738696|emb|CBI27941.3| unnamed protein product [Vitis vinifera]
Length = 301
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/298 (69%), Positives = 243/298 (81%), Gaps = 5/298 (1%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
++V V IL++ST +S+C+KKPV AR++D+PYI CQVCEKLA+QLYHQV+KKQAQI
Sbjct: 5 TIIVLVFSILVISTWLQVSHCSKKPVGVARRDDIPYIKCQVCEKLAAQLYHQVEKKQAQI 64
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
SPKKISEFEII I+ENVCNLKKEEADWIL+IDIVEQGDKL LVEQ++EGQCNSECKTIER
Sbjct: 65 SPKKISEFEIIEISENVCNLKKEEADWILRIDIVEQGDKLKLVEQDSEGQCNSECKTIER 124
Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
ACQE+MGYSDTD AEY+Y +KP I SLVNYLCKDL+KACS PPPV K+R PGEPFVPK
Sbjct: 125 ACQEVMGYSDTDAAEYVYKNKPQIDSLVNYLCKDLTKACSASPPPVPKDRTPGEPFVPKS 184
Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLG 246
KEAEME+IM+SMEGMPGAPGM+MYS+EDLMNM+NF GD++ D+DDDDE FP LG
Sbjct: 185 LKEAEMEKIMKSMEGMPGAPGMKMYSREDLMNMQNF----GDEDADDDDDDEPHFPSKLG 240
Query: 247 KVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTE 304
K R KES DWK +T GI TG LK+ A++VSN +++WW G K SKK K
Sbjct: 241 KNLRAKESTKPDWKHTVTKGIKETGQALKRHASRVSNQIRQWWRG-KITSSKKNSKAH 297
>gi|356498081|ref|XP_003517882.1| PREDICTED: uncharacterized protein LOC100778421 [Glycine max]
Length = 303
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/283 (72%), Positives = 241/283 (85%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P+S+C+KKPV ARKED+PYI CQVCEKLA +LY QVQ KQA+I+PKKISE++II IAEN
Sbjct: 21 PLSHCSKKPVGVARKEDIPYIKCQVCEKLAKELYQQVQNKQAEIAPKKISEYQIIEIAEN 80
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKK EADWIL+IDIVE+ D+L LVEQ++EGQCNSECKTIERACQE++GYSDTDVAEY
Sbjct: 81 VCNLKKVEADWILRIDIVEKEDRLELVEQDSEGQCNSECKTIERACQEVIGYSDTDVAEY 140
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
LY SKPDI SL NYLCKDLSKACS+KPPPV K+R PGEPFV K +KEAEME++++SMEGM
Sbjct: 141 LYNSKPDIDSLRNYLCKDLSKACSSKPPPVPKDRAPGEPFVAKSSKEAEMEKLLKSMEGM 200
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM+MYS++DLMNMKNFGGEDGD+ D++DDDDE+ FP LG V R KES DWKQ
Sbjct: 201 PGAPGMKMYSRDDLMNMKNFGGEDGDEEDEDDDDDEASFPSKLGNVLRAKESGKRDWKQM 260
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
+ GI + LKK ANKVSNH+++WW G + SKK GKTEL
Sbjct: 261 VKKGIEDASMTLKKHANKVSNHIRQWWRGKRRTNSKKGGKTEL 303
>gi|356520856|ref|XP_003529076.1| PREDICTED: uncharacterized protein LOC100782313 [Glycine max]
Length = 300
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/283 (72%), Positives = 240/283 (84%), Gaps = 1/283 (0%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P S+C+KKPV ARKED+ YI CQVCEKLA +LY QVQ KQA+I+PKKISE++II IAEN
Sbjct: 19 PFSHCSKKPVGVARKEDIQYIKCQVCEKLAKELYQQVQNKQAEIAPKKISEYQIIEIAEN 78
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKK EADWIL+IDIVE+ D+L LVEQ++EGQCNSECKTIERACQE++GYSDTDVAEY
Sbjct: 79 VCNLKKAEADWILRIDIVEKEDRLELVEQDSEGQCNSECKTIERACQEVIGYSDTDVAEY 138
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
LY+SKPDI SL NYLCKDL+ ACSTKPPPV K+R PGEPFV K +KEAEME++++SMEGM
Sbjct: 139 LYSSKPDIDSLRNYLCKDLTNACSTKPPPVPKDRAPGEPFVAKSSKEAEMEKLLKSMEGM 198
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM+MYS++DLMNMKNFG EDGDD D++D+++ S FP LGKV R KES+ DWKQ
Sbjct: 199 PGAPGMKMYSRDDLMNMKNFGDEDGDDEDEDDNEEAS-FPSKLGKVLRAKESEKRDWKQV 257
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
I GI +T LKK ANKVSNH+++WW G K SKK GKTEL
Sbjct: 258 IKKGIEDTSMTLKKHANKVSNHIRQWWRGKKITNSKKGGKTEL 300
>gi|449528651|ref|XP_004171317.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101206912
[Cucumis sativus]
Length = 300
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/286 (70%), Positives = 235/286 (82%), Gaps = 6/286 (2%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISE--FEIIGIA 80
P C+KKP A ARKED+PYI CQVCEKLA++LYHQV+KKQA+I+PKK S II IA
Sbjct: 18 PTFECSKKPSAAARKEDIPYIKCQVCEKLAAELYHQVEKKQAEIAPKKYSSVVIXIIEIA 77
Query: 81 ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
ENVCNLKK EADWIL+IDIVEQGDKL LV+Q EGQCNSECKTIERACQE+MGYSDTDVA
Sbjct: 78 ENVCNLKKAEADWILQIDIVEQGDKLELVDQNTEGQCNSECKTIERACQEVMGYSDTDVA 137
Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
EYLY+SKP I SLVNYLCKDL+K+C+TKPPPV K+R PGEPFV K +KEAEME++MRSME
Sbjct: 138 EYLYSSKPKIDSLVNYLCKDLTKSCTTKPPPVPKDRTPGEPFVAKSSKEAEMEKMMRSME 197
Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
GMPGAPGM+MYS++DLMNMKNFGGE D+D+E++++E +FP NLGKV REKE K+ DWK
Sbjct: 198 GMPGAPGMKMYSRDDLMNMKNFGGE---DDDEEEEEEEENFPSNLGKVLREKERKNNDWK 254
Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG-IKAARSKKPGKTEL 305
RIT G+ G+ LK+ A KVSN V+ WW +S KP K EL
Sbjct: 255 NRITKGVSKAGEALKQHAYKVSNKVRHWWRAKTGGLKSSKPTKQEL 300
>gi|357501185|ref|XP_003620881.1| hypothetical protein MTR_6g091780 [Medicago truncatula]
gi|355495896|gb|AES77099.1| hypothetical protein MTR_6g091780 [Medicago truncatula]
Length = 301
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 202/300 (67%), Positives = 249/300 (83%), Gaps = 2/300 (0%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
V ++ V +++++ P+S+CAKKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3 GVFWFLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E ++EGQCNSECKT+ER
Sbjct: 63 SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEHDSEGQCNSECKTVER 122
Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
ACQE+MGYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K
Sbjct: 123 ACQEVMGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 182
Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
+KEAEME++++SMEGMPGAPGM+MYS++DLM KNFG E+ DD D+++++DE+DFP L
Sbjct: 183 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 241
Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
GK+ + KE++ GDWKQ+I GI++T LKK A KVSNH+QRWW G K SKK K+EL
Sbjct: 242 GKILKSKENEKGDWKQKIRKGIVDTSTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 301
>gi|449435830|ref|XP_004135697.1| PREDICTED: uncharacterized protein LOC101206912 [Cucumis sativus]
Length = 294
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/284 (70%), Positives = 232/284 (81%), Gaps = 8/284 (2%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P C+KKP A ARKED+PYI CQVCEKLA++LYHQV+KKQA+I+PKKISE++II IAEN
Sbjct: 18 PTFECSKKPSAAARKEDIPYIKCQVCEKLAAELYHQVEKKQAEIAPKKISEYQIIEIAEN 77
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKK EADWIL+IDIVEQGDKL LV+Q EGQCNSECKTIERACQE+MGYSDTDVAEY
Sbjct: 78 VCNLKKAEADWILQIDIVEQGDKLELVDQNTEGQCNSECKTIERACQEVMGYSDTDVAEY 137
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
LY+SKP I SLVNYLCKDL+K+C+TKPPPV K+R PGEPFV K +KEAEME++MRSMEGM
Sbjct: 138 LYSSKPKIDSLVNYLCKDLTKSCTTKPPPVPKDRTPGEPFVAKSSKEAEMEKMMRSMEGM 197
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM+MYS++DLMNMKNFGGED D+ ++E+ LGKV REKE K+ DWK R
Sbjct: 198 PGAPGMKMYSRDDLMNMKNFGGEDDDEEEEENIF-------YLGKVLREKERKNNDWKNR 250
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSG-IKAARSKKPGKTEL 305
IT G+ G+ LK+ A KVSN V+ WW +S KP K EL
Sbjct: 251 ITKGVSKAGEALKQHAYKVSNKVRHWWRAKTGGLKSSKPTKQEL 294
>gi|217071942|gb|ACJ84331.1| unknown [Medicago truncatula]
Length = 301
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 201/300 (67%), Positives = 248/300 (82%), Gaps = 2/300 (0%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
V ++ V +++++ P+S+CAKKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3 GVFWFLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E ++EGQCNSE KT+ER
Sbjct: 63 SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEHDSEGQCNSEYKTVER 122
Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
ACQE+MGYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K
Sbjct: 123 ACQEVMGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 182
Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
+KEAEME++++SMEGMPGAPGM+MYS++DLM KNFG E+ DD D+++++DE+DFP L
Sbjct: 183 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 241
Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
GK+ + KE++ GDWKQ+I GI++T LKK A KVSNH+QRWW G K SKK K+EL
Sbjct: 242 GKILKSKENEKGDWKQKIRKGIVDTSTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 301
>gi|224122586|ref|XP_002330518.1| predicted protein [Populus trichocarpa]
gi|222872452|gb|EEF09583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/283 (65%), Positives = 231/283 (81%), Gaps = 6/283 (2%)
Query: 23 PISYCAK--KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIA 80
P+S AK KP+ ARK+D+PYI CQVCEK+ASQLY Q QKKQ QISP+KISE+++I IA
Sbjct: 22 PVSESAKTDKPLIVARKDDIPYIKCQVCEKVASQLYRQAQKKQDQISPQKISEYQVIEIA 81
Query: 81 ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
ENVCNLKKEEADWI+K+DI+EQGDKL L+EQ+ EGQCNSECKTIE+ CQ++M Y DTDVA
Sbjct: 82 ENVCNLKKEEADWIMKLDIMEQGDKLELIEQDTEGQCNSECKTIEQTCQQVMDYVDTDVA 141
Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
EY+YT+KP I LV +LCKDL+ +C+ KPPPV KNR+PGEPFVPKP+KE EME+IMRSME
Sbjct: 142 EYIYTTKPQIDLLVKHLCKDLTNSCTKKPPPVPKNRMPGEPFVPKPSKEVEMEKIMRSME 201
Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
GMPGAPGM+MYS+EDLM M NF G+++ +++DD++ FP N+GKV REK+ DWK
Sbjct: 202 GMPGAPGMKMYSREDLMGMNNF----GNEDAEDEDDEDEQFPSNMGKVLREKQDMKDDWK 257
Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKT 303
Q+I +GI +T + LK+ N VS +Q+WW GI+AA +KK KT
Sbjct: 258 QKIFEGIKSTAEALKRHTNIVSKQIQKWWKGIRAAPAKKNSKT 300
>gi|357501049|ref|XP_003620813.1| hypothetical protein MTR_6g090810 [Medicago truncatula]
gi|355495828|gb|AES77031.1| hypothetical protein MTR_6g090810 [Medicago truncatula]
Length = 286
Score = 366 bits (939), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 192/300 (64%), Positives = 237/300 (79%), Gaps = 17/300 (5%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
V + V +++++ P+S+CAKKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3 GVFWLLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIER 126
SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L +T+ER
Sbjct: 63 SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRL---------------ETVER 107
Query: 127 ACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKP 186
ACQE++GYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV K
Sbjct: 108 ACQEVLGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFVAKS 167
Query: 187 AKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFPKNL 245
+KEAEME++++SMEGMPGAPGM+MYS++DLM KNFG E+ DD D+++++DE+DFP L
Sbjct: 168 SKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFPSKL 226
Query: 246 GKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
GKV + KE++ GDWKQ+I GI++TG LKK A KVSNH+QRWW G K SKK K+EL
Sbjct: 227 GKVLKSKENEKGDWKQKIRKGIVDTGTTLKKHATKVSNHIQRWWKGKKTTSSKKNSKSEL 286
>gi|15233608|ref|NP_194681.1| uncharacterized protein [Arabidopsis thaliana]
gi|5123546|emb|CAB45312.1| putative protein [Arabidopsis thaliana]
gi|7269851|emb|CAB79710.1| putative protein [Arabidopsis thaliana]
gi|332660239|gb|AEE85639.1| uncharacterized protein [Arabidopsis thaliana]
Length = 306
Score = 361 bits (926), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 181/270 (67%), Positives = 225/270 (83%), Gaps = 1/270 (0%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P+S AKKP + RKEDVPYI CQVCEKL+S+L+ V++KQ QISPKKISE+EII IAEN
Sbjct: 23 PVSDAAKKPSSTPRKEDVPYIKCQVCEKLSSRLHQLVKEKQQQISPKKISEYEIIEIAEN 82
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKKEEADW+LKIDIVE+GD LVLVEQ+ EG CNS+CKTIE ACQ+++GYSDTDVAEY
Sbjct: 83 VCNLKKEEADWMLKIDIVEKGDNLVLVEQQEEGMCNSKCKTIENACQKVIGYSDTDVAEY 142
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
+Y SKPD+ SLVN+LCKDL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 143 IYKSKPDLVSLVNHLCKDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 202
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM++YS+ED+ N G ED D +DDED++++ FPKNLGKV +EKESK+ + K+
Sbjct: 203 PGAPGMKVYSREDI-EKGNIGNEDDDGDDDEDEEEDDKFPKNLGKVLKEKESKTEELKKT 261
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
IT G+ LK+ A KVSN V+RWW G+
Sbjct: 262 ITKEFKKKGEALKRHAQKVSNRVRRWWKGL 291
>gi|21553422|gb|AAM62515.1| unknown [Arabidopsis thaliana]
Length = 303
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 181/270 (67%), Positives = 225/270 (83%), Gaps = 1/270 (0%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P+S AKKP + RKEDVPYI CQVCEKL+S+L+ V++KQ QISPKKISE+EII IAEN
Sbjct: 20 PVSDAAKKPSSTPRKEDVPYIKCQVCEKLSSRLHQLVKEKQQQISPKKISEYEIIEIAEN 79
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKKEEADW+LKIDIVE+GD LVLVEQ+ EG CNS+CKTIE ACQ+++GYSDTDVAEY
Sbjct: 80 VCNLKKEEADWMLKIDIVEKGDNLVLVEQQEEGMCNSKCKTIENACQKVIGYSDTDVAEY 139
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
+Y SKPD+ SLVN+LCKDL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 140 IYKSKPDLVSLVNHLCKDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 199
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM++YS+ED+ N G ED D +DDED++++ FPKNLGKV +EKESK+ + K+
Sbjct: 200 PGAPGMKVYSREDI-EKGNIGNEDDDGDDDEDEEEDDKFPKNLGKVLKEKESKTEELKKT 258
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
IT G+ LK+ A KVSN V+RWW G+
Sbjct: 259 ITKEFKKKGEALKRHAQKVSNRVRRWWKGL 288
>gi|297799038|ref|XP_002867403.1| hypothetical protein ARALYDRAFT_491806 [Arabidopsis lyrata subsp.
lyrata]
gi|297313239|gb|EFH43662.1| hypothetical protein ARALYDRAFT_491806 [Arabidopsis lyrata subsp.
lyrata]
Length = 307
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 181/272 (66%), Positives = 224/272 (82%), Gaps = 1/272 (0%)
Query: 23 PISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAEN 82
P+S AKKP + RKEDVPYI CQVCEKLAS+L+ V++KQ QISPKKISE+EII IAEN
Sbjct: 23 PVSDGAKKPSSAPRKEDVPYIKCQVCEKLASRLHQLVKEKQQQISPKKISEYEIIEIAEN 82
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY 142
VCNLKKEEADW+LKIDIVE+GDKLVLVEQ EG CNS+CKTIE ACQ+++GYSDTDVAE+
Sbjct: 83 VCNLKKEEADWMLKIDIVEKGDKLVLVEQAEEGMCNSKCKTIENACQKVIGYSDTDVAEF 142
Query: 143 LYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGM 202
+Y SKPD+ SLVN+LC DL+ ACS KPPPV K+R+PGEPFV KP+K+AEM++I+RSM+GM
Sbjct: 143 IYKSKPDLVSLVNHLCNDLTDACSKKPPPVPKDRVPGEPFVAKPSKDAEMDKILRSMQGM 202
Query: 203 PGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQR 262
PGAPGM++YS+ED+ N G ED D +DDED++++ FPKNLGKV +EKESK+ + K+
Sbjct: 203 PGAPGMKVYSREDI-EKGNIGNEDEDGDDDEDEEEDEKFPKNLGKVLKEKESKTEELKKT 261
Query: 263 ITDGILNTGDVLKKQANKVSNHVQRWWSGIKA 294
I G+VLK A KVSN V+RWW G++
Sbjct: 262 IAKEFKKKGEVLKIHAQKVSNRVRRWWKGLRT 293
>gi|357501201|ref|XP_003620889.1| hypothetical protein MTR_6g091890 [Medicago truncatula]
gi|357501239|ref|XP_003620908.1| hypothetical protein MTR_6g092180 [Medicago truncatula]
gi|355495904|gb|AES77107.1| hypothetical protein MTR_6g091890 [Medicago truncatula]
gi|355495923|gb|AES77126.1| hypothetical protein MTR_6g092180 [Medicago truncatula]
Length = 304
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 186/303 (61%), Positives = 239/303 (78%), Gaps = 5/303 (1%)
Query: 7 AVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI 66
V + V +++++ P+S+CAKKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+I
Sbjct: 3 GVFWLLVVSVVIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEI 62
Query: 67 SPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKL-VLVEQEAEGQCNSECKTIE 125
SPKKISE++II IAENVCNLKK EADWIL+IDIVE+ D+L ++ +++ + + +
Sbjct: 63 SPKKISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLENMILKDSAILNDRQLSVLV 122
Query: 126 RA--CQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFV 183
R+ +++GYSDTDVAEYLY+SKPDI SL NYLCKDLSKAC+TKPPPV K+R PGEPFV
Sbjct: 123 RSNLVLQVLGYSDTDVAEYLYSSKPDIDSLTNYLCKDLSKACNTKPPPVPKDRTPGEPFV 182
Query: 184 PKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGED-GDDNDDEDDDDESDFP 242
K +KEAEME++++SMEGMPGAPGM+MYS++DLM KNFG E+ DD D+++++DE+DFP
Sbjct: 183 AKSSKEAEMEKLLKSMEGMPGAPGMKMYSRDDLMK-KNFGAENEDDDEDEDEEEDEADFP 241
Query: 243 KNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGK 302
LGKV + KE++ GDWKQ+I GI++TG LKK A KVSNH+QRWW G K SKK K
Sbjct: 242 SKLGKVLKSKENEKGDWKQKIRKGIVDTGTTLKKHATKVSNHIQRWWKGKKTTSSKKNSK 301
Query: 303 TEL 305
+EL
Sbjct: 302 SEL 304
>gi|255546197|ref|XP_002514158.1| conserved hypothetical protein [Ricinus communis]
gi|223546614|gb|EEF48112.1| conserved hypothetical protein [Ricinus communis]
Length = 243
Score = 333 bits (853), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 152/205 (74%), Positives = 180/205 (87%)
Query: 20 TCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGI 79
C P+S AKKPV AR ED+PYI CQVCEKLA +LY QVQKKQA+I PKKI+E++II I
Sbjct: 19 ACVPVSLSAKKPVLSARTEDIPYIKCQVCEKLAKELYQQVQKKQAEILPKKITEYQIIEI 78
Query: 80 AENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDV 139
AENVCNLKK+EADWI+KIDIVE+GDKL LVEQ+ EGQCNSECKT+E+AC+E+MGY+DTD+
Sbjct: 79 AENVCNLKKQEADWIMKIDIVEKGDKLELVEQDVEGQCNSECKTVEKACEEVMGYADTDI 138
Query: 140 AEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM 199
AEY+++SKP+I SLVNY CKDL+ AC KPPPV K+R PGEPF PKP+K+AEME+IMRSM
Sbjct: 139 AEYIFSSKPNIDSLVNYFCKDLTNACVKKPPPVPKDRSPGEPFEPKPSKDAEMEKIMRSM 198
Query: 200 EGMPGAPGMQMYSKEDLMNMKNFGG 224
+ MPGAP MQMYS+EDLMNMKNFG
Sbjct: 199 QDMPGAPNMQMYSREDLMNMKNFGN 223
>gi|212274637|ref|NP_001130151.1| uncharacterized protein LOC100191245 precursor [Zea mays]
gi|194688408|gb|ACF78288.1| unknown [Zea mays]
gi|413953188|gb|AFW85837.1| hypothetical protein ZEAMMB73_915175 [Zea mays]
Length = 303
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 162/285 (56%), Positives = 213/285 (74%), Gaps = 4/285 (1%)
Query: 22 APISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIA 80
A ++ KKP AR+ED+PYI CQVCE++A ++ QV KK + P KK+ E EII +A
Sbjct: 22 AEVAAGQKKPATAARREDIPYIRCQVCERIAREISAQVAKKHQALPPSKKVPEIEIIEVA 81
Query: 81 ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
ENVCNLKK+EADW+L+IDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE++GY+DTDVA
Sbjct: 82 ENVCNLKKQEADWMLRIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVIGYADTDVA 141
Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
E++Y + P + L+ +LCKDLSKAC+ PPPV K+R+PGEPF KP+K+AEME+I+RSME
Sbjct: 142 EFVYKNNPSVDQLMKFLCKDLSKACAKDPPPVPKDRVPGEPFARKPSKDAEMEKILRSME 201
Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
GMPGAP M+MYS++DLM NFG E DD++++D+D++ +FPKNLGK + K S+ D K
Sbjct: 202 GMPGAPSMKMYSRDDLMK-NNFGTE--DDDEEDDEDEDDNFPKNLGKALKNKGSQKKDLK 258
Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
Q++ +T LK NKVS V++WW G K KTEL
Sbjct: 259 QQVVQQFKDTSKKLKGHVNKVSGMVKKWWKGAKKPAKSSKSKTEL 303
>gi|326521176|dbj|BAJ96791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 156/262 (59%), Positives = 205/262 (78%), Gaps = 2/262 (0%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLK 87
KKP AR+EDVPYI CQVCE++A ++ QV KQ + P KK+ E EII IAENVCNLK
Sbjct: 32 KKPATAARREDVPYIRCQVCERIAREISAQVAAKQQALPPSKKVPEIEIIDIAENVCNLK 91
Query: 88 KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
K+EADW+L+IDIVE+GD+L LV+Q+ EG CNSECKTIERACQE+MGY+DTDVAE++YT+K
Sbjct: 92 KQEADWMLRIDIVEKGDRLELVDQDEEGHCNSECKTIERACQEVMGYADTDVAEFVYTNK 151
Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
P + L+ ++CKD++KAC+ PPPV K+R+PGEPF KP+K+AEME+IMRSMEGMPGAP
Sbjct: 152 PSVDQLIKFVCKDITKACAANPPPVPKDRVPGEPFATKPSKDAEMEKIMRSMEGMPGAPN 211
Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
M+MYS++DLM NFG E +D++DED+D++ + PKNLG + + K + D KQ++ I
Sbjct: 212 MKMYSRDDLMK-NNFGVEGDEDDEDEDEDEDYNLPKNLGNILKNKGPQKKDLKQQVVKQI 270
Query: 268 LNTGDVLKKQANKVSNHVQRWW 289
+TG LK +KVS V+ WW
Sbjct: 271 KDTGKKLKGHVSKVSKVVKNWW 292
>gi|195627550|gb|ACG35605.1| hypothetical protein [Zea mays]
Length = 303
Score = 317 bits (813), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 157/271 (57%), Positives = 208/271 (76%), Gaps = 4/271 (1%)
Query: 22 APISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIA 80
A ++ KKP AR+ED+PYI CQVCE++A ++ QV KK + P KK+ E EII +A
Sbjct: 22 AEVAAGQKKPATAARREDIPYIRCQVCERIAREISAQVAKKHQALPPSKKVPEIEIIEVA 81
Query: 81 ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
ENVCNLKK+EADW+L+IDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE++GY+DTDVA
Sbjct: 82 ENVCNLKKQEADWMLRIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVIGYADTDVA 141
Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
E++Y + P + L+ +LCKDLSKAC+ PPPV K+R+PGEPF KP+K+AEME+I+RSME
Sbjct: 142 EFVYKNNPSVDQLMKFLCKDLSKACAKDPPPVPKDRVPGEPFARKPSKDAEMEKILRSME 201
Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
GMPGAP M+MYS++DLM NFG E DD++++D+D++ +FPKNLGK + K S+ D K
Sbjct: 202 GMPGAPSMKMYSRDDLMK-NNFGTE--DDDEEDDEDEDDNFPKNLGKALKNKGSQKKDLK 258
Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG 291
Q++ +T LK NKVS V++WW G
Sbjct: 259 QQVVQQFKDTSKKLKGHVNKVSGMVKKWWKG 289
>gi|116794067|gb|ABK26994.1| unknown [Picea sitchensis]
Length = 314
Score = 300 bits (767), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 158/301 (52%), Positives = 210/301 (69%), Gaps = 18/301 (5%)
Query: 1 MARIKAAVLVYV-AVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQV 59
MAR ++V AVI + AP KKP+A AR+ED+PYI C VCE++ QL QV
Sbjct: 3 MARFLGLGFLFVFAVIAGQAMAAP-----KKPIAVARREDIPYIKCGVCEQIVKQLDRQV 57
Query: 60 QKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNS 119
+ QI+PK++SE++II +AEN+CNLKKEE DW+L +DIVE GDKL LVEQE EG CN+
Sbjct: 58 TSIRNQIAPKRLSEYQIIEVAENICNLKKEEGDWLLHLDIVEDGDKLKLVEQEQEGDCNA 117
Query: 120 ECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPG 179
ECKTIER CQE++GY DTDVAE+L+ S+P SL+ +LC DLS+ACS KPPPV K+R PG
Sbjct: 118 ECKTIERTCQEVIGYHDTDVAEFLFKSEPQSSSLLKFLCNDLSRACSVKPPPVPKDRTPG 177
Query: 180 EPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDES 239
EPFVPK +KEAEME+IMRSM GMPGA M+MYSK+DLMNM NFG +G+D DDE+D+D+
Sbjct: 178 EPFVPKSSKEAEMEKIMRSMSGMPGASNMKMYSKDDLMNMPNFGS-NGEDEDDEEDEDDD 236
Query: 240 DFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQA-----------NKVSNHVQRW 288
+ R S DWK+ + D + +T +K + +++S+ +++W
Sbjct: 237 YTSSRMRNTKRPLVSVESDWKKSLLDRVQSTFSNVKDRTQHAVLGAKWSLDRISDKIRKW 296
Query: 289 W 289
+
Sbjct: 297 Y 297
>gi|357501233|ref|XP_003620905.1| hypothetical protein MTR_6g092150 [Medicago truncatula]
gi|355495920|gb|AES77123.1| hypothetical protein MTR_6g092150 [Medicago truncatula]
Length = 298
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 207/295 (70%), Gaps = 38/295 (12%)
Query: 12 VAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKI 71
+ V + +++ P+S+CAKKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+ISPKKI
Sbjct: 41 LVVPVAIASWIPLSHCAKKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEISPKKI 100
Query: 72 SEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-EAEGQCNSECKTIERACQE 130
SE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E+ ++EGQCNSECKT+ERACQE
Sbjct: 101 SEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHDSEGQCNSECKTVERACQE 160
Query: 131 IMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEA 190
+MGYSDTDVAEYLY+ KPDI SL NYLCKDLSK+C+T K R PGEPFV K + EA
Sbjct: 161 VMGYSDTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT------KTRTPGEPFVAKSSNEA 214
Query: 191 EMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFR 250
EME++++SME NFG E+ DD++DED+D++
Sbjct: 215 EMEKLLKSME--------------------NFGAENEDDDEDEDEDEDE----------- 243
Query: 251 EKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
+++ D+ ++ ++T LKK A KVSNH+Q+WW G KA SKK K EL
Sbjct: 244 DEDEDEADFPSKLELWSVDTSTALKKHATKVSNHIQQWWKGKKAISSKKNPKPEL 298
>gi|357110756|ref|XP_003557182.1| PREDICTED: uncharacterized protein LOC100845828 [Brachypodium
distachyon]
Length = 304
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 160/274 (58%), Positives = 212/274 (77%), Gaps = 6/274 (2%)
Query: 33 AGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLKKEEA 91
AR+EDVPYI CQVCE++A ++ QV +KQ +SP KK+ E EII IAENVCNLKK+EA
Sbjct: 36 TAARREDVPYIRCQVCERIAREISAQVARKQQALSPSKKVPEIEIIEIAENVCNLKKQEA 95
Query: 92 DWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIK 151
DW+L+IDIVE+GD+L LV+Q+ EG CNSECKTIERACQE+MGY+DTDVAE++YT+KP +
Sbjct: 96 DWMLRIDIVEKGDRLELVDQDEEGHCNSECKTIERACQEVMGYADTDVAEFVYTNKPSLD 155
Query: 152 SLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMY 211
LV ++CKD++KAC+ PP V K+R+PGEPF KP+K+AEME+IMRSM +PGAP M+MY
Sbjct: 156 QLVKFVCKDITKACAVDPPSVPKDRVPGEPFAAKPSKDAEMEKIMRSMGDIPGAPSMKMY 215
Query: 212 SKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGILNTG 271
S++DLM NFG GDD++D+D+DDE +FPKNLGK ++K + D K+++ I +TG
Sbjct: 216 SRDDLMQ-NNFGA-GGDDDEDDDEDDEDNFPKNLGKALKDKPPQKNDIKEQVVKQIKDTG 273
Query: 272 DVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
LK +KVS V++WW K +S GK+EL
Sbjct: 274 KKLKGHVSKVSKVVKKWWQAKKPTKS---GKSEL 304
>gi|115466448|ref|NP_001056823.1| Os06g0150400 [Oryza sativa Japonica Group]
gi|55296609|dbj|BAD69311.1| unknown protein [Oryza sativa Japonica Group]
gi|55297262|dbj|BAD69047.1| unknown protein [Oryza sativa Japonica Group]
gi|113594863|dbj|BAF18737.1| Os06g0150400 [Oryza sativa Japonica Group]
Length = 306
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 163/264 (61%), Positives = 206/264 (78%), Gaps = 4/264 (1%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
KK ARKED+PYI CQVCE++A ++ QV KKQ + + KK+ E EII IAENVCNLK
Sbjct: 34 KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93
Query: 88 KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
K+EADW+LKIDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE+MGY+DTDVAE++Y K
Sbjct: 94 KQEADWMLKIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVMGYADTDVAEFVYKKK 153
Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
P LV +LCKDLS+AC PPPV K+R+PGEPF KP+K+AEM+RI++SMEG+PGAP
Sbjct: 154 PSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSMEGIPGAPS 213
Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
M+MYS++DLM NFG + D+DD+D+D++ DFPKNLGKVF++K S D KQ++ I
Sbjct: 214 MKMYSRDDLMK-NNFGVD--GDDDDDDEDEDDDFPKNLGKVFKDKGSPKKDLKQQVVKQI 270
Query: 268 LNTGDVLKKQANKVSNHVQRWWSG 291
+TG LK NKVS V++WW G
Sbjct: 271 KDTGKKLKGHVNKVSKVVKKWWQG 294
>gi|218197595|gb|EEC80022.1| hypothetical protein OsI_21695 [Oryza sativa Indica Group]
Length = 306
Score = 287 bits (735), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 162/264 (61%), Positives = 204/264 (77%), Gaps = 4/264 (1%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
KK ARKED+PYI CQVCE+ A ++ QV KKQ + + KK+ E EII IAENVCNLK
Sbjct: 34 KKVATAARKEDIPYIRCQVCERTAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93
Query: 88 KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
K+EADW+LKIDIVE+GDKL LVEQ+ EG CN+ECKTIERACQE+MGY+DTDVAE++Y K
Sbjct: 94 KQEADWMLKIDIVEKGDKLELVEQDEEGHCNAECKTIERACQEVMGYADTDVAEFVYKKK 153
Query: 148 PDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
P LV +LCKDLS+AC PPPV K+R+PGEPF KP+K+AEM+RI++SMEG+PGAP
Sbjct: 154 PSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSMEGIPGAPS 213
Query: 208 MQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDGI 267
M+MYS++DLM NFG + +DD+D+D++ DFPKNLGKVF++K S D KQ++ I
Sbjct: 214 MKMYSRDDLMK-NNFGVDGD--DDDDDEDEDDDFPKNLGKVFKDKGSPKKDLKQQVVKQI 270
Query: 268 LNTGDVLKKQANKVSNHVQRWWSG 291
+TG LK NKVS V++WW G
Sbjct: 271 KDTGKKLKGHVNKVSKVVKKWWQG 294
>gi|357501043|ref|XP_003620810.1| hypothetical protein MTR_6g090780 [Medicago truncatula]
gi|355495825|gb|AES77028.1| hypothetical protein MTR_6g090780 [Medicago truncatula]
Length = 293
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/297 (55%), Positives = 198/297 (66%), Gaps = 57/297 (19%)
Query: 17 LLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEI 76
LS AP KKPV ARKEDVPYI CQVCE LA QLY QVQ K+A+ISPKKISE++I
Sbjct: 46 FLSRTAP-----KKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSKKAEISPKKISEYQI 100
Query: 77 IGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-EAEGQCNSECKTIERACQEIMGYS 135
I IAENVCNLKK EADWIL+IDIVE+ D+L L E+ ++EGQCNSECKT+ERACQE+MGYS
Sbjct: 101 IEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHDSEGQCNSECKTVERACQEVMGYS 160
Query: 136 DTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERI 195
DTDVAEYLY+ KPDI SL NYLCKDLSK+C+T K R PGEPFV K + EAEME++
Sbjct: 161 DTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT------KTRTPGEPFVAKSSNEAEMEKL 214
Query: 196 MRSMEGMPGAPGMQMYSKEDLMNMKNFGG-------EDGDDNDDEDDDDESDFPKNLGKV 248
++SME NFG ++ +D D+++D+DE+DFP
Sbjct: 215 LKSME--------------------NFGAENEDDDEDEDEDEDEDEDEDEADFP------ 248
Query: 249 FREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
SK W ++T LKK A KVSNH+Q+WW G KA SKK K EL
Sbjct: 249 -----SKLELWS-------VDTSTALKKHATKVSNHIQQWWKGKKAISSKKNPKPEL 293
>gi|168003107|ref|XP_001754254.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694356|gb|EDQ80704.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 306
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 139/274 (50%), Positives = 179/274 (65%), Gaps = 5/274 (1%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKK 88
KK VA ARKED+P I C VCE +A QL QV+ K+ + +P+KI+E EII +AEN+CN+KK
Sbjct: 22 KKVVATARKEDIPLIKCSVCEVIAKQLVRQVKDKREKAAPRKITELEIIDVAENICNMKK 81
Query: 89 EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKP 148
EE+DWILK+DIVE+GDKL LVEQ+ EG CN+ECKTIERAC+E+MG DTD+AEYLY +
Sbjct: 82 EESDWILKLDIVEKGDKLQLVEQQEEGVCNTECKTIERACREVMGDHDTDLAEYLYKAGA 141
Query: 149 DIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGM 208
L LCKDLSKAC K P V K+R+PGE F PK KEAEME++MRS+ +PGAP M
Sbjct: 142 TRAGLTKLLCKDLSKACVGKTPRVPKDRVPGEAFTPKGGKEAEMEKLMRSLNDVPGAPSM 201
Query: 209 QMYSKEDLM-NMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKE-SKSGDWKQRITDG 266
+M+S+E+L N GG G D+ D +D++E + + SK + T
Sbjct: 202 KMFSREELQKNPGLMGGGRGSDDPDAEDEEEDEDEDEGETDAKTNSVSKLSNEVGARTQK 261
Query: 267 ILNTGDVLKKQANKVSNHVQRWWSGIKAARSKKP 300
+ T K NK + WW+G K AR P
Sbjct: 262 AIRTAS---KSVNKAMRTISSWWNGKKPARKAAP 292
>gi|357452911|ref|XP_003596732.1| hypothetical protein MTR_2g084730 [Medicago truncatula]
gi|355485780|gb|AES66983.1| hypothetical protein MTR_2g084730 [Medicago truncatula]
Length = 223
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 139/253 (54%), Positives = 173/253 (68%), Gaps = 36/253 (14%)
Query: 54 QLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ-E 112
+ Y Q K+ + ISE++II IAENVCNLKK EADWIL+IDIVE+ D+L L E+ +
Sbjct: 6 KFYFQ---KRTSVIVGDISEYQIIEIAENVCNLKKVEADWILRIDIVEKADRLELEEEHD 62
Query: 113 AEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPV 172
+EGQCNSECKT+ERACQE+MGYSDTDVAEYLY+ KPDI SL NYLCKDLSK+C+T
Sbjct: 63 SEGQCNSECKTVERACQEVMGYSDTDVAEYLYSCKPDIDSLTNYLCKDLSKSCNT----- 117
Query: 173 LKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDD 232
K R PGEPFV K + EAEME++++SMEGMPG P M+MYS N ++ +D D+
Sbjct: 118 -KTRTPGEPFVAKSSNEAEMEKLLKSMEGMPGVPDMKMYS--------NDDEDEDEDEDE 168
Query: 233 EDDDDESDFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
++D+DE+DFP SK W ++T LKK A KVSNH+Q+WW G
Sbjct: 169 DEDEDEADFP-----------SKLELWS-------VDTSTALKKHATKVSNHIQQWWKGK 210
Query: 293 KAARSKKPGKTEL 305
KA SKK K EL
Sbjct: 211 KAISSKKNPKPEL 223
>gi|222634964|gb|EEE65096.1| hypothetical protein OsJ_20142 [Oryza sativa Japonica Group]
Length = 290
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 139/271 (51%), Positives = 180/271 (66%), Gaps = 34/271 (12%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEI-------IGIA 80
KK ARKED+PYI CQVCE++A ++ QV KKQ + + KK+ F I
Sbjct: 34 KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVLNFRFGSRKSGFTEIP 93
Query: 81 ENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVA 140
EN CNLK +EADW+LKIDIVE+GDKL E+MGY+DTDVA
Sbjct: 94 ENGCNLKNQEADWMLKIDIVEKGDKL-----------------------EVMGYADTDVA 130
Query: 141 EYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
E++Y KP LV +LCKDLS+AC PPPV K+R+PGEPF KP+K+AEM+RI++SME
Sbjct: 131 EFVYKKKPSADQLVKFLCKDLSEACVVDPPPVPKDRVPGEPFAAKPSKDAEMDRILKSME 190
Query: 201 GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWK 260
G+PGAP M+MYS++DLM NFG + D+DD+D+D++ DFPKNLGKVF++K S D K
Sbjct: 191 GIPGAPSMKMYSRDDLMK-NNFGVD--GDDDDDDEDEDDDFPKNLGKVFKDKGSPKKDLK 247
Query: 261 QRITDGILNTGDVLKKQANKVSNHVQRWWSG 291
Q++ I +TG LK NKVS V++WW G
Sbjct: 248 QQVVKQIKDTGKKLKGHVNKVSKVVKKWWQG 278
>gi|302759156|ref|XP_002963001.1| hypothetical protein SELMODRAFT_404499 [Selaginella moellendorffii]
gi|300169862|gb|EFJ36464.1| hypothetical protein SELMODRAFT_404499 [Selaginella moellendorffii]
Length = 312
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 125/289 (43%), Positives = 193/289 (66%), Gaps = 20/289 (6%)
Query: 35 ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
ARKE++ Y+ C VCE+++ QL+ +V +K++ KK+SEFEII +AEN+CN+KK E++W+
Sbjct: 26 ARKENIKYLGCAVCEQISKQLFEKVSEKKS--IKKKLSEFEIIELAENICNVKKRESEWM 83
Query: 95 LKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KS 152
+DIV +G+KL LVEQ EG+CN++C+TIER CQE++G DTD+AE+++T D+ ++
Sbjct: 84 FFLDIVREGNKLKLVEQPEEGECNTKCRTIERTCQEVIGDHDTDIAEFIHTHLRDLSEEA 143
Query: 153 LVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYS 212
+ LCK+++K+CS+K P + K GEPF PKP K+A+M R+MRSM MPG GM+MYS
Sbjct: 144 IFKSLCKEVTKSCSSKLPALPKTLDLGEPFTPKPTKDADMARLMRSMGDMPGGQGMKMYS 203
Query: 213 KEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE------------KESKSGDWK 260
K+DLMNM N+ D DD+DD+DD DE ++L K+ + K
Sbjct: 204 KDDLMNMNNYPAGDPDDDDDDDDTDEGSKLQSLLKISKTPQKQNLKNPANPKNQPKRSML 263
Query: 261 QRITDGILNTG-DVL---KKQANKVSNHVQRWWSGIKAARSKKPGKTEL 305
++ D + + G D L +K +KV+ +V+RWW K+++ ++ K +L
Sbjct: 264 TQVLDHVQSIGRDTLSSARKTKDKVTKYVKRWWRRAKSSKQRQAQKVDL 312
>gi|302797140|ref|XP_002980331.1| hypothetical protein SELMODRAFT_420060 [Selaginella moellendorffii]
gi|300151947|gb|EFJ18591.1| hypothetical protein SELMODRAFT_420060 [Selaginella moellendorffii]
Length = 197
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/175 (49%), Positives = 131/175 (74%), Gaps = 5/175 (2%)
Query: 35 ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
ARKE++ Y+ C VCE+++ QL+ +V +K++ KK+SEFEII +AEN+CN+KK E++W+
Sbjct: 26 ARKENIKYLRCAVCEQISKQLFEKVSEKKS--IKKKLSEFEIIELAENICNVKKRESEWM 83
Query: 95 LKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KS 152
+DIV +G+KL LVEQ EG+CN++C+TIER CQE++G DTD+AE+++T D+ ++
Sbjct: 84 FFLDIVREGNKLKLVEQPEEGECNTKCRTIERTCQEVIGDHDTDIAEFIHTHLRDLSEEA 143
Query: 153 LVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPG 207
+ LCK+++K+CS+K P + K GEPF PKP K+A+M R+MRSM G+ P
Sbjct: 144 IFKSLCKEVTKSCSSKLPALPKTLDLGEPFTPKPTKDADMARLMRSM-GVSFRPS 197
>gi|156382000|ref|XP_001632343.1| predicted protein [Nematostella vectensis]
gi|156219397|gb|EDO40280.1| predicted protein [Nematostella vectensis]
Length = 191
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 71/178 (39%), Positives = 109/178 (61%)
Query: 39 DVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKID 98
+VP+I C+ C+K LY + + + + KK+ E +II + E CN KEE +WI K D
Sbjct: 12 EVPHIRCETCQKAVKHLYRRTSEMREKSPRKKVEEDKIIELVEASCNPDKEEGEWISKFD 71
Query: 99 IVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLC 158
IV++GD L L E+E G+C EC+TI +AC+E + DTD+AE ++ +K + L N +C
Sbjct: 72 IVKKGDSLQLSEKEVFGRCKQECRTIAKACEETISEVDTDLAEMIWQNKLSLSKLTNKIC 131
Query: 159 KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDL 216
+SKAC+ K + ++ E F E ++IM+ M G+PG PGM+MYS+ED+
Sbjct: 132 HSMSKACTKKAQNLKIDKRFDEEFEAMSEDERNADKIMKQMRGVPGMPGMEMYSREDI 189
>gi|384249540|gb|EIE23021.1| hypothetical protein COCSUDRAFT_6458, partial [Coccomyxa
subellipsoidea C-169]
Length = 188
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 73/171 (42%), Positives = 105/171 (61%), Gaps = 1/171 (0%)
Query: 30 KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP-KKISEFEIIGIAENVCNLKK 88
+P A K D+PYI C VCE + Y QV+ + + P KK+ E I+ + E + + K
Sbjct: 17 EPDALPVKSDLPYIRCGVCEAVGKNAYRQVKSARDALKPGKKLEELVILEMVEKMGDPAK 76
Query: 89 EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKP 148
+E +WI KID+VE G KL LVE E G+C ECKT++RA QE++ DTD+AE L+ K
Sbjct: 77 DEGEWIAKIDLVEDGKKLKLVEHEELGKCREECKTVQRAAQEVLSDHDTDIAEKLWQGKM 136
Query: 149 DIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM 199
N+LC D++ AC KPPP+ K+R PG+ F +EA+M+++M M
Sbjct: 137 SRSQFSNWLCYDVTGACKQKPPPLPKDRAPGQAFQVIDPEEAQMQKMMAGM 187
>gi|308799811|ref|XP_003074687.1| unnamed protein product [Ostreococcus tauri]
gi|116000857|emb|CAL50537.1| unnamed protein product [Ostreococcus tauri]
Length = 312
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 134/225 (59%), Gaps = 11/225 (4%)
Query: 4 IKAAVLVYVAVILLLSTCAPISYC-------AKKPVAGARKEDVPYITCQVCEKLASQLY 56
IKA V VA++ + + A A K + A DV YI C+ CE L +
Sbjct: 3 IKATTKVAVALLAISAVSANGVEAKKGGKKDASKTIPRAVASDVKYIECETCELLVNAAV 62
Query: 57 HQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAE-- 114
+ + + A++ KK+ E ++I E +C++ +E DWI+ D+ E+G KL LV+ AE
Sbjct: 63 KEYKTQYAEVGSKKLRETDVIEKMEKMCDVASKEGDWIITKDLQEKGSKLRLVDMGAENY 122
Query: 115 GQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KSLVNYLCKDLSKACSTKPPPV 172
G+C +ECKTI +AC++I+G DTDV EYL+ S D+ K+ +LC++ + +C+ PPP+
Sbjct: 123 GECRTECKTIVKACEKILGPRDTDVGEYLFVSDGDVDAKAFRKWLCEEETTSCAKAPPPL 182
Query: 173 LKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLM 217
K+R GE F + K+ EMER+M M+GMPG G QM+S++ +M
Sbjct: 183 PKDRPKGEAFKKRDKKDVEMERLMAGMKGMPGMGGAQMFSRDQMM 227
>gi|449671666|ref|XP_002170798.2| PREDICTED: uncharacterized protein LOC100207705 [Hydra
magnipapillata]
Length = 270
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 130/215 (60%), Gaps = 8/215 (3%)
Query: 38 EDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKI 97
ED+P+I C VC+K A L+ ++ Q S KK+ E +++ I E C+ DWI ++
Sbjct: 33 EDLPFIKCDVCQKAAKVLFKTIEN---QRSEKKLDEDDVLSIVEKSCDTDVLYGDWISRL 89
Query: 98 DIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYL 157
DIVE+ + L +VE EG+C+ ECKTI R+C++I+G DTD+ E L+ ++ + + +N++
Sbjct: 90 DIVEKENDLKVVEHPQEGKCDRECKTISRSCEDIIGDIDTDIGELLWKNEMKLATFINHV 149
Query: 158 CKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLM 217
C L+ +C K V K FV KE + +MR+M+ +PG PGM+MYS++DL
Sbjct: 150 CYKLTNSCKAKKKYV-KGSHKDYKFVEMSEKEKQARDLMRNMKSVPGMPGMEMYSQDDLQ 208
Query: 218 NMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREK 252
NM+ + G N+DE+ D S+ +N +V+R K
Sbjct: 209 NMRE---QMGVRNEDENQQD-SEQSENYDEVYRTK 239
>gi|145341176|ref|XP_001415689.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575912|gb|ABO93981.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 233
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 127/214 (59%), Gaps = 4/214 (1%)
Query: 5 KAAVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQA 64
+ AV A I + A KP+A A K DV +I C+VCE LA + + +
Sbjct: 1 RRAVTTIAASIAIAGLVACADAKPSKPIARAVKSDVKFIKCEVCEHLADAVTQEYAALRE 60
Query: 65 QISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVE--QEAEGQCNSECK 122
+ KK++E +I E +C+ +E DWI+ D+ E+G +L +V+ +E G+C ECK
Sbjct: 61 ETPEKKLTETMVIEKIEKMCDPSAKEGDWIITKDLQEKGSRLRVVDMGKENYGECGKECK 120
Query: 123 TIERACQEIMGYSDTDVAEYLYTSKP-DIK-SLVNYLCKDLSKACSTKPPPVLKNRIPGE 180
TI +AC++I+G DTDV E+LYTS D++ +L +LC++ + +C K PP+ +R GE
Sbjct: 121 TIVKACEDILGPRDTDVGEFLYTSNAGDLRGALREWLCEEETNSCGGKIPPLPSDRPKGE 180
Query: 181 PFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKE 214
F + K+ EMER+M M+GMPG G QM+S++
Sbjct: 181 KFKKRDKKDVEMERLMAGMKGMPGMGGAQMFSRD 214
>gi|125554107|gb|EAY99712.1| hypothetical protein OsI_21693 [Oryza sativa Indica Group]
Length = 163
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 92/138 (66%), Gaps = 24/138 (17%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQI-SPKKISEFEIIGIAENVCNLK 87
KK ARKED+PYI CQVCE++A ++ QV KKQ + + KK+ E EII IAENVCNLK
Sbjct: 34 KKVATAARKEDIPYIRCQVCERIAREISAQVAKKQQALPATKKVPEIEIIEIAENVCNLK 93
Query: 88 KEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK 147
K+EADW+LKIDIVE+GDKL E+MGY+DTDVAE++Y K
Sbjct: 94 KQEADWMLKIDIVEKGDKL-----------------------EVMGYADTDVAEFVYKKK 130
Query: 148 PDIKSLVNYLCKDLSKAC 165
P LV +LCKDLS+AC
Sbjct: 131 PSADQLVKFLCKDLSEAC 148
>gi|159475098|ref|XP_001695660.1| hypothetical protein CHLREDRAFT_205897 [Chlamydomonas reinhardtii]
gi|158275671|gb|EDP01447.1| predicted protein [Chlamydomonas reinhardtii]
Length = 314
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/235 (34%), Positives = 120/235 (51%), Gaps = 8/235 (3%)
Query: 1 MARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQ 60
MAR+ L + + L +++ KP A K D+ YI CQVCE +A Q +
Sbjct: 1 MARLNLLQLCVLGLCLAVASAQFGGKRPPKPKAPVAKGDIKYIKCQVCEAIAKQSIKATK 60
Query: 61 KKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSE 120
Q PKK+ E +I+ E +C+ ++ +WI K DIVE+ LVL + A G+C SE
Sbjct: 61 DLIEQAGPKKVPEADILERVEKMCDPDTDDGEWITKYDIVEEDSALVLKDTGAVGRCKSE 120
Query: 121 CKTIERACQEIMGYSD-TDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPG 179
C+TI RAC+ I D TD++ L+ K ++ N++C D S ACS K P + K
Sbjct: 121 CRTIARACEMISDDVDLTDLSAMLFKGKKRA-AISNWMCHDASDACSKKAPALTKREGAD 179
Query: 180 EPFVPKPAKEAEMERIMRSME--GMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDD 232
E P E ER+MRSM+ G+ G MY+KE + + +DN+D
Sbjct: 180 EAHAPMDEDELRTERMMRSMKAAGLSGT----MYNKETMQEELAEMADQYEDNED 230
>gi|388507226|gb|AFK41679.1| unknown [Lotus japonicus]
Length = 116
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/98 (64%), Positives = 82/98 (83%), Gaps = 1/98 (1%)
Query: 192 MERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE 251
ME++++SMEGMPGAPGM+MYS++DLMNMKNFGG++ D+++DEDDDD++ FP LGKV R+
Sbjct: 1 MEKLLKSMEGMPGAPGMKMYSRDDLMNMKNFGGDE-DEDEDEDDDDQASFPSKLGKVLRD 59
Query: 252 KESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWW 289
KES DWKQ I GI +T LKK AN++SNHVQ+WW
Sbjct: 60 KESGKSDWKQLIKKGITDTSMTLKKHANRISNHVQKWW 97
>gi|255084251|ref|XP_002508700.1| predicted protein [Micromonas sp. RCC299]
gi|226523977|gb|ACO69958.1| predicted protein [Micromonas sp. RCC299]
Length = 343
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 110/184 (59%), Gaps = 7/184 (3%)
Query: 35 ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
A K DV YI CQVC+++A L + + K+ E +++ E VC+ + E +W+
Sbjct: 74 AVKSDVKYIRCQVCQEVAKVLSREASTLR-DTKGAKLKESDVLEKVEKVCDPEAVEGEWL 132
Query: 95 LKIDIVEQGDKLVLV-EQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
+ D+ E+G +L +V ++ G C SECKT++RAC++I+ DTDVAE L+T P +L
Sbjct: 133 ITHDLQEKGSELKMVLMKDQYGACGSECKTMQRACEKIVADRDTDVAEILFTGSPQRAAL 192
Query: 154 VNYLC----KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQ 209
V ++C +D AC K PP+ K+R G F P K+ +M+R+M++M+ M G GM
Sbjct: 193 VQHMCHNDDEDGFGACVKKAPPLPKSRPKGPAFDPVDKKDLDMQRMMKNMKDM-GMGGMS 251
Query: 210 MYSK 213
MY++
Sbjct: 252 MYNR 255
>gi|307105084|gb|EFN53335.1| hypothetical protein CHLNCDRAFT_137061 [Chlorella variabilis]
Length = 293
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 33/199 (16%)
Query: 31 PVAGARKEDVPYITCQ--VCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKK 88
P R D+PYI CQ VCE L + QV E +II E + +
Sbjct: 29 PKEKPRLADIPYIQCQARVCELLTKNAWRQV------------DEMKIIERMEKITTAWR 76
Query: 89 EEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYT--- 145
E +WI +D+VE+GDKLV+ G+C ECKTIE A + IMG DTD+AE L+T
Sbjct: 77 AEGEWIASLDLVEKGDKLVV------GKCGVECKTIEAAAERIMGEHDTDIAEMLFTVGL 130
Query: 146 ----SKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSME- 200
K N+LC + ++ C KPPP+ K R PG PF P + +ER+M ME
Sbjct: 131 VLGLGKKTRAQFNNWLCYEETQVCKYKPPPLPKERPPGPPFEPMAQGDQNLERMMGEMED 190
Query: 201 -GMPGAPGMQMYSKEDLMN 218
G+ G+ +YS++DL++
Sbjct: 191 KGLKGS----LYSRDDLLS 205
>gi|412992360|emb|CCO20073.1| predicted protein [Bathycoccus prasinos]
Length = 356
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 107/183 (58%), Gaps = 5/183 (2%)
Query: 35 ARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWI 94
A K D I C+VC+K+ S++ + + + +K++E ++I E +C+ ++ W
Sbjct: 53 AVKSDSKLIHCEVCQKV-SKIAFETGVRLREEKGEKLTEVDVIEEMELMCDESDDKTMWA 111
Query: 95 LKIDI-VEQGDKLVL--VEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSK-PDI 150
++ D+ V +KLVL + ++A C + CKT+ +AC+++M DTD+AE L+ ++
Sbjct: 112 VQNDLKVNFRNKLVLKFMGEDAYSSCETTCKTMLKACEDVMEGRDTDLAEILFKNRIKTA 171
Query: 151 KSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQM 210
K + +LC + +++CS K V K R P F K K+ + ER+M ++ MPG PGMQM
Sbjct: 172 KGIEEWLCTEETESCSKKAKSVSKTRKPLPEFKKKEKKDIDTERMMGKLKNMPGMPGMQM 231
Query: 211 YSK 213
Y++
Sbjct: 232 YNR 234
>gi|303287402|ref|XP_003062990.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455626|gb|EEH52929.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 336
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 92/166 (55%), Gaps = 8/166 (4%)
Query: 37 KEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILK 96
K DV Y+ C VC+++A L H+V K + KK++E +++ E VC+ EE +W++
Sbjct: 51 KSDVKYVKCAVCQEVAKTL-HRVAAKVREDKGKKLTEADVLEKVEKVCDPDAEEGEWLIT 109
Query: 97 IDIVEQGDKLVLVEQEAE-GQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDI--KSL 153
D+VE+G +L + + G+C +ECKT++ AC+ I+ DTDVAE L+T + +
Sbjct: 110 HDLVERGKRLDMKHMGGKFGECGAECKTMQMACERIVSDRDTDVAEMLFTDGASLARAGI 169
Query: 154 VNYLCKDLSK----ACSTKPPPVLKNRIPGEPFVPKPAKEAEMERI 195
LC D ++ AC P + K R G F K K+ EM R+
Sbjct: 170 QAKLCNDANEKAYSACRKPAPALPKTRPKGPAFKEKDKKDVEMARL 215
>gi|156343882|ref|XP_001621149.1| hypothetical protein NEMVEDRAFT_v1g222317 [Nematostella vectensis]
gi|156206818|gb|EDO29049.1| predicted protein [Nematostella vectensis]
Length = 376
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/133 (38%), Positives = 77/133 (57%)
Query: 71 ISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQE 130
+ E +II + E CN KEE +WI K DIV++GD L L E+E G+C EC+TI +AC+E
Sbjct: 1 VEEDKIIELVEASCNPDKEEGEWISKFDIVKKGDSLQLSEKEVFGRCKQECRTIAKACEE 60
Query: 131 IMGYSDTDVAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEA 190
+ DTD+AE ++ +K + L N +C +SKAC+ K + ++ E F E
Sbjct: 61 TISEVDTDLAEMIWQNKLSLSKLTNKICHSMSKACTKKAQNLKIDKRFDEEFEAMSEDER 120
Query: 191 EMERIMRSMEGMP 203
++IM+ M P
Sbjct: 121 NADKIMKQMSYGP 133
>gi|307105085|gb|EFN53336.1| hypothetical protein CHLNCDRAFT_137062 [Chlorella variabilis]
Length = 167
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 82/141 (58%), Gaps = 4/141 (2%)
Query: 94 ILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
+ K+D+VE+GDKLV+ E G C SECKT++ A + IMG DTD+AE L+T K +
Sbjct: 1 MAKLDLVEEGDKLVVKEMGQVGNCGSECKTVQLAAERIMGEHDTDIAEILFTGKMNRAQF 60
Query: 154 VNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
+LC D S+AC PP+ R PG PF P+ + ++ ++ +E A ++ +
Sbjct: 61 NRWLCWDESEACKKDAPPLPNGRKPGPPFDPRQEGDQSIDAMLGGLEDKGLANNKKL--R 118
Query: 214 EDLMNMKNFGGEDGDDNDDED 234
E+L M+ G ++ D DD+D
Sbjct: 119 EEL--MEKLGVKEADRVDDDD 137
>gi|302836977|ref|XP_002950048.1| hypothetical protein VOLCADRAFT_104574 [Volvox carteri f.
nagariensis]
gi|300264521|gb|EFJ48716.1| hypothetical protein VOLCADRAFT_104574 [Volvox carteri f.
nagariensis]
Length = 154
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 81/151 (53%), Gaps = 8/151 (5%)
Query: 83 VCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSD-TDVAE 141
+CN + DWI + DIVEQG L L + + G+C SEC+TI RAC+ I D TD++
Sbjct: 1 MCNPDQNAGDWITQYDIVEQGTSLELKDTDMIGRCKSECRTIARACELITEDIDLTDLSA 60
Query: 142 YLYTSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEG 201
LY K ++ N+LC D + ACS KPPP + E P E R+MR ME
Sbjct: 61 MLYKGKK-RAAITNWLCYDATDACSRKPPPFSAGQRVDEVHEPLDEDEVRNTRMMRDMEA 119
Query: 202 MPGAPGMQMYS----KEDLMNMKNFGGEDGD 228
M G G +Y+ E+L M++ G+D D
Sbjct: 120 M-GLSG-SLYNTDTLSEELEEMQDVYGDDPD 148
>gi|357501197|ref|XP_003620887.1| hypothetical protein MTR_6g091870 [Medicago truncatula]
gi|355495902|gb|AES77105.1| hypothetical protein MTR_6g091870 [Medicago truncatula]
Length = 125
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/123 (43%), Positives = 75/123 (60%), Gaps = 27/123 (21%)
Query: 174 KNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDE 233
+ R PGEPFV K + EAEME++++SME NFG E+ DD++DE
Sbjct: 17 RTRTPGEPFVAKSSNEAEMEKLLKSME--------------------NFGAENEDDDEDE 56
Query: 234 DDDDE-------SDFPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQ 286
D+D++ +DFP LGKV R KE++ GDWKQ+I GI++T LKK A KVSNH+
Sbjct: 57 DEDEDEDEDEDEADFPSKLGKVLRSKENEKGDWKQKIRKGIVDTSTALKKHATKVSNHIH 116
Query: 287 RWW 289
++
Sbjct: 117 LFF 119
>gi|299472707|emb|CBN80275.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 246
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 84/176 (47%), Gaps = 8/176 (4%)
Query: 30 KPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKE 89
KP + K D+ I C VC+ LA LY +V + + K+ E +I + E C+ +
Sbjct: 25 KPKSDPVKGDLAAIKCGVCQGLAEHLYEKVAEMREAAPFGKVHEDDISTVIEGACDPATK 84
Query: 90 EADWILKIDIVEQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEY---LYTS 146
E WI + DIVE+G L L +Q++ C EC+TI ++C ++ + DV E L+ +
Sbjct: 85 EGRWITEYDIVEEGGYLELKKQDSFSHCKEECETIAKSCT-VLVDDELDVEEVVVALWKN 143
Query: 147 KPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAE----MERIMRS 198
K + L C + S CS K F PK AK+ E MER+ S
Sbjct: 144 KLGLSELQKAACGEWSDRCSGKKRVKAPKDRQDYEFEPKTAKDFEVAEAMERLQNS 199
>gi|357501193|ref|XP_003620885.1| hypothetical protein MTR_6g091840 [Medicago truncatula]
gi|355495900|gb|AES77103.1| hypothetical protein MTR_6g091840 [Medicago truncatula]
Length = 117
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 72/133 (54%), Gaps = 33/133 (24%)
Query: 174 KNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDE 233
+ R PGEPFV K + EAEME++++SME NFG E+ DDN+DE
Sbjct: 17 RTRTPGEPFVAKSSNEAEMEKLVKSME--------------------NFGAENEDDNEDE 56
Query: 234 DDDDESD-FPKNLGKVFREKESKSGDWKQRITDGILNTGDVLKKQANKVSNHVQRWWSGI 292
D+D++ FP LGKV R KE++ GDWKQ+I GI+ G H WW G
Sbjct: 57 DEDEDEADFPSKLGKVLRSKENEKGDWKQKIRKGIVECG------------HKHGWWKGK 104
Query: 293 KAARSKKPGKTEL 305
KA SKK K EL
Sbjct: 105 KAISSKKNPKPEL 117
>gi|357499651|ref|XP_003620114.1| hypothetical protein MTR_6g077350 [Medicago truncatula]
gi|355495129|gb|AES76332.1| hypothetical protein MTR_6g077350 [Medicago truncatula]
Length = 158
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/69 (59%), Positives = 46/69 (66%), Gaps = 6/69 (8%)
Query: 4 IKAAVLVYVAVIL-LLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK 62
IK + Y + L LS AP KKPV ARKEDVPYI CQVCE LA QLY QVQ K
Sbjct: 84 IKERIFFYWRLHLGFLSRTAP-----KKPVGIARKEDVPYIKCQVCEILAKQLYQQVQSK 138
Query: 63 QAQISPKKI 71
+A+ISPKK+
Sbjct: 139 KAEISPKKV 147
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 36/45 (80%)
Query: 109 VEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSL 153
V + EG CNSECKT+E ACQ+++GYSDTDVAEY++ SK + SL
Sbjct: 375 VLEYEEGHCNSECKTVETACQKVIGYSDTDVAEYIHISKSYLTSL 419
>gi|407410087|gb|EKF32656.1| hypothetical protein MOQ_003488 [Trypanosoma cruzi marinkellei]
Length = 323
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 102/230 (44%), Gaps = 26/230 (11%)
Query: 10 VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
V+VA +LL S A + K P A K+ V I+C VC + + Y VQ A +
Sbjct: 19 VFVAAMLLFSE-AGFAKKEKPPHAPFPKDVVDAISCGVCNFMVRKAYSDVQLLFNASVET 77
Query: 69 K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
+ +++E +++ E+VCN E WI +I I + D L + +C C T
Sbjct: 78 RVRVNEDDVLTAIEDVCNPFSEVGHWIRQITITHESDTAPFLLGVTRSSVYAKCKGICST 137
Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
+ AC+ +M + + D + EY D ++ N LC+ S + S
Sbjct: 138 VVEACEAVMDHENMDQLSPRLLRLHEY-----ADAEAFANALCQSSSICTERQGLSASRY 192
Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMNMK 220
++N I + KE E+ER+M ME ++S++++ +M+
Sbjct: 193 EEVQNMIDADTVEEIDPKEMEIERMMDHMERKENR-RQTIFSRDEITSMQ 241
>gi|323453478|gb|EGB09349.1| hypothetical protein AURANDRAFT_62913 [Aureococcus anophagefferens]
Length = 1856
Score = 63.9 bits (154), Expect = 8e-08, Method: Composition-based stats.
Identities = 63/233 (27%), Positives = 100/233 (42%), Gaps = 51/233 (21%)
Query: 29 KKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP------------KKISEFEI 76
K+P+A K D+P I C VCE +A++L+ + KQ + +P +K+S F
Sbjct: 158 KEPLA--VKGDLPCIRCDVCEIVATELWREA-TKQREAAPLTAASGKPGEKKRKVSTFSE 214
Query: 77 IGIAE---NVCNLKKEEADWILKIDIVE-QG-------------------DKLVLVEQEA 113
+ E VCN +KE +W+ +D+VE QG + + V +
Sbjct: 215 ADVNELVVGVCNRRKEMGEWLWYVDLVESQGPPGEGSAWRGLTKKEKASNENFLQVYRAQ 274
Query: 114 EG----QCNSECKTIERACQEIMGYSDTDVAEY---LYTSKPDIKSLVNYLCKDLSKACS 166
G + + E T +R+C + DV E L+ D K+ CK+LS C
Sbjct: 275 RGGPIRKWDRESATAKRSCDLLFDDDIGDVEELVVPLWRGDVDEKAFKKLACKELSGRCG 334
Query: 167 TKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSM--EGMPGAPGMQMYSKEDLM 217
PV + + + ER+M +M +GMP M M S+ED+M
Sbjct: 335 KDRKPVNAQGREDFEYDAQDKSLLDTERMMENMAAQGMP----MVMQSREDMM 383
>gi|428168650|gb|EKX37592.1| hypothetical protein GUITHDRAFT_116234 [Guillardia theta CCMP2712]
Length = 318
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 74/147 (50%), Gaps = 7/147 (4%)
Query: 42 YITCQVCEKLASQLY-HQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIV 100
YI C C+ +A LY H + + + +K+ E I+ + E +C+ + + +W+ D+V
Sbjct: 44 YIKCDGCKIMADNLYRHAMDLLKEKGGREKMGEEPILELLEQICDQGEAKGEWLNAYDVV 103
Query: 101 EQGDKLVLVEQEAEGQCNSECKTIERACQEIMGYS---DTDVAEYLYTSK--PDIKSLVN 155
+ K+ L +Q+ C EC + C+++ S ++D+AE +Y ++
Sbjct: 104 GKKGKISLQKQDGFQSCGPECHALVSVCEQVKVRSEAGESDIAEKIYRGHHIASAEAFSE 163
Query: 156 YLCKDLSKACSTKPPPVLKNRIPGEPF 182
+CK ++ +C +P P LK+ EPF
Sbjct: 164 AICKKMTTSCK-EPLPKLKDPRIDEPF 189
>gi|71408335|ref|XP_806579.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870363|gb|EAN84728.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 330
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 92/210 (43%), Gaps = 25/210 (11%)
Query: 10 VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
++ AV LL S A ++ K P A K+ V I+C VC + + Y VQ A +
Sbjct: 26 IFFAVTLLFSE-AGVAKKEKPPHAPLPKDVVDAISCGVCNFMVGKAYGDVQLLFNASVQT 84
Query: 69 K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
+ +++E +++ E+VCN + WI +I I + D L + A +C C T
Sbjct: 85 RVRVNEDDVLTAIEDVCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144
Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
+ AC+ +M + + D + EY ++ N LC+ S + S
Sbjct: 145 VVEACEAVMDHKNMDQLSPRLLRLNEY-----AGAEAFANALCQSSSICTERQGLSAARY 199
Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
++N I + KE E+ER+M ME
Sbjct: 200 EEVRNMIDEDTVEEIDPKEMEIERMMDHME 229
>gi|71405983|ref|XP_805566.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70869024|gb|EAN83715.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 330
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 91/210 (43%), Gaps = 25/210 (11%)
Query: 10 VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
++ AV LL S A ++ K P A K+ V I+C VC + + Y VQ A +
Sbjct: 26 IFFAVTLLFSA-AGVAKKEKPPHAPFPKDVVNAISCGVCNFMVGKAYGDVQLLVNASVQT 84
Query: 69 K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
+ +++E +++ E++CN + WI +I I + D L + A +C C T
Sbjct: 85 RVRVNEDDVLTAIEDLCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144
Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLS-----KACSTKPP 170
+ AC+ +M + + D + EY ++ N LC S + S
Sbjct: 145 VVEACEAVMDHENMDQLSPRLLRLNEY-----AGAEAFANALCLPSSICTERQGLSAARY 199
Query: 171 PVLKNRIPGEPFVPKPAKEAEMERIMRSME 200
++N I + KE E+ER+M ME
Sbjct: 200 EEVRNMIDEDTVEEIDPKEMEIERMMDHME 229
>gi|387198247|gb|AFJ68839.1| hypothetical protein NGATSA_3049300 [Nannochloropsis gaditana
CCMP526]
Length = 281
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 81/184 (44%), Gaps = 14/184 (7%)
Query: 35 ARKEDVPYITCQVCEKLASQLYHQVQK---KQAQISPKKISEFEIIGIAENVCNLKKEEA 91
A+K D+PYI C VC QL + + ++ + K + ++ + VC+ E
Sbjct: 73 AQKTDLPYIACDVCMLTMEQLSNVTRNLALREENLHKGKPKKSLVLKATKKVCDYASYEG 132
Query: 92 DWILKIDIVEQ----GDKLVLVEQEAEGQCNSECKTIERACQEIM---GYSDTDVAEYLY 144
WI DI E+ G L LV Q G E TI ++C ++ G +VA+ L
Sbjct: 133 RWITYYDIQEKDRVGGTLLKLVAQPESGMPEVEADTIAKSCTKLFKSEGIDRDEVADDLL 192
Query: 145 TSKPDIKSLVNYLCKDLSKACSTKPPPVLKNRIPGE-PFVPKPAKEAEMERIMRSM--EG 201
+ + +L C D+S CS +++ + + F PK + +RIM + G
Sbjct: 193 EAL-SVSALQKKYCTDISSRCSKTAQSLMEKGLRRDYAFKPKTKEAKSYDRIMDAFAEAG 251
Query: 202 MPGA 205
+ GA
Sbjct: 252 VLGA 255
>gi|388496004|gb|AFK36068.1| unknown [Lotus japonicus]
Length = 62
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 20/23 (86%), Positives = 23/23 (100%)
Query: 202 MPGAPGMQMYSKEDLMNMKNFGG 224
MPGAPGM+MYS++DLMNMKNFGG
Sbjct: 1 MPGAPGMKMYSRDDLMNMKNFGG 23
>gi|407849528|gb|EKG04244.1| hypothetical protein TCSYLVIO_004702 [Trypanosoma cruzi]
Length = 330
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 29/212 (13%)
Query: 10 VYVAVILLLSTCAPISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKK-QAQISP 68
++ AV LL S A ++ K A K+ V I+C VC + + Y VQ A +
Sbjct: 26 IFFAVTLLFSE-AGVAKKEKPSHAPFPKDVVDAISCGVCNFMVGKAYGDVQLLFNASVQT 84
Query: 69 K-KISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDK----LVLVEQEAEGQCNSECKT 123
+ +++E +++ E++CN + WI +I I + D L + A +C C T
Sbjct: 85 RVRVNEDDVLTAIEDLCNPFSDVGQWIRQITITHESDTAPFLLGVSRLPAYAKCKRVCST 144
Query: 124 IERACQEIMGYSDTD--------VAEYLYTSKPDIKSLVNYLCKDLSKACSTKPPPV--- 172
+ AC+ +M + D + EY ++ N LC LS + T+ +
Sbjct: 145 VVEACEAVMDHESMDQLSPRLLRLNEY-----AGAEAFANALC--LSSSICTERQGLSAA 197
Query: 173 ----LKNRIPGEPFVPKPAKEAEMERIMRSME 200
++N I + KE E+ER+M ME
Sbjct: 198 KYEEVRNMIDEDTVEEIDPKEMEIERMMDHME 229
>gi|340057508|emb|CCC51854.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 323
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 45/249 (18%)
Query: 4 IKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKLAS 53
+KA LV +A + +L+ P A ARKE ++P+ I C VC +
Sbjct: 23 VKAVSLVVLAALTVLTL---------PPTAVARKEKPLNIPFPQDVIDNIRCDVCNIMTR 73
Query: 54 QLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQ 111
+ Y V ++ +E +++ E+VCN E WI I + L
Sbjct: 74 KAYDLVHALFVLSDSTRVPVNEEDVMVAIEDVCNPFAESGQWIRLFSIGKTPSALTAAPA 133
Query: 112 EA---------EGQCNSECKTIERACQEIMGYSDTDVA--EYLYTSK-PDIKSLVNYLCK 159
+C CKTI AC+ ++ + DV + L K + + +C
Sbjct: 134 PLHLTLLPLPNHTKCKRTCKTIWEACEAVIDHDSMDVLSPQLLRLRKYANADAFARSVC- 192
Query: 160 DLSKACSTK---PP---PVLKNRIPGEPFVPKPAKEAEM--ERIMRSMEGMPGAPGMQMY 211
D++ C+ + PP VL I + VP+ ++ EM ER+M ME ++Y
Sbjct: 193 DVASFCTERGAIPPQRYEVLGGMIAED--VPEEIEQKEMDIERMMDFMER-KNNQRHEIY 249
Query: 212 SKEDLMNMK 220
S+E++ +M+
Sbjct: 250 SREEVTSMQ 258
>gi|261333030|emb|CBH16025.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 300
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 102/247 (41%), Gaps = 42/247 (17%)
Query: 2 ARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKL 51
A +K LV VA++ + P A+KE +VP+ + C VC +
Sbjct: 4 APVKGVFLVLVALVPMF------------PAVDAKKEKPMNVPFPQDAIDDVRCDVCRLM 51
Query: 52 ASQLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDI-------VEQ 102
A + Y V + +I +E ++ E++CN WI ++ + ++
Sbjct: 52 AKRAYGYVYELFVTSEHTRIPVNEENVLVAIEDICNPLAASGQWIRQVAVDTSSPAAAKE 111
Query: 103 GDKLVLVEQEAEGQCNSECKTIERACQEIMGYSDTD--VAEYLYTSK-PDIKSLVNYLCK 159
+ L E +C CKT+ AC+ ++ D D ++ L K D +L + C
Sbjct: 112 PFHITLTELSHHSKCKRTCKTLSDACEAVVDDDDMDQFSSKLLILKKYKDASALADATC- 170
Query: 160 DLSKAC------STKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
+S C S+K L + + KE ++ER M ME G ++YS+
Sbjct: 171 GVSAFCTSRTGFSSKRYKELLTLVASDVVEVIEQKEMDIERFMDEME-RKGNRRQEIYSR 229
Query: 214 EDLMNMK 220
E+++ M+
Sbjct: 230 EEVIKMQ 236
>gi|71748168|ref|XP_823139.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832807|gb|EAN78311.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 300
Score = 44.3 bits (103), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 100/247 (40%), Gaps = 42/247 (17%)
Query: 2 ARIKAAVLVYVAVILLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKL 51
A +K LV VA++ + P A+KE +VP+ + C VC +
Sbjct: 4 APVKGVFLVLVALVPMF------------PAVDAKKEKPMNVPFPQDAIDDVRCDVCRLM 51
Query: 52 ASQLYHQVQKKQAQISPKKI--SEFEIIGIAENVCNLKKEEADWILKIDIVEQGD----- 104
A + Y V + +I +E ++ E++CN WI ++ +
Sbjct: 52 AKRAYGYVYELFVTSEHTRIPVNEENVLVAIEDICNPLAASGQWIRQVAVDTSSPAAAKG 111
Query: 105 --KLVLVEQEAEGQCNSECKTIERACQEIMGYSDTD--VAEYLYTSK-PDIKSLVNYLCK 159
+ L E +C CKT+ AC+ ++ D D ++ L K D +L + C
Sbjct: 112 PFHITLTELPHHSKCKRTCKTLSDACEAVVDDDDMDQFSSKLLILKKYKDASALADATC- 170
Query: 160 DLSKAC------STKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSK 213
+S C S+K L + + KE ++ER M ME G ++YS+
Sbjct: 171 GVSAFCTSRTGFSSKRYKELLTLVASDVVEVIEQKEMDIERFMDEME-RKGNRRQEIYSR 229
Query: 214 EDLMNMK 220
E+++ M+
Sbjct: 230 EEVIKMQ 236
>gi|342184535|emb|CCC94017.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 307
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 97/241 (40%), Gaps = 45/241 (18%)
Query: 16 LLLSTCAPISYCAKKPVAGARKE---DVPY-------ITCQVCEKLASQLYHQVQKKQAQ 65
L L A ++ + A+KE +VPY + C VC A Q Y +V +
Sbjct: 3 LFLVKYAAVTLACMALLTSAKKEKPMNVPYPQDVVDDMRCDVCHTAAKQAYARVL--ELH 60
Query: 66 ISPKK----ISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ----- 116
++ K+ ++E +++ E++CN E WI K + + L A Q
Sbjct: 61 VASKRTRVPVNEEDVLVAIEDLCNPLASEGQWIRKFGV----NTSYLTSAGAPAQPLFIS 116
Query: 117 ---------CNSECKTIERACQEIMGYSDTD-VAEYL--YTSKPDIKSL------VNYLC 158
C CKT AC+ +M + D + YL T+ D ++ V C
Sbjct: 117 VYTSRNYLKCKRTCKTASDACELVMDDEEMDQFSSYLLRLTTFKDAEAFAEKTCGVASFC 176
Query: 159 KDLSKACSTKPPPVLKNRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGMQMYSKEDLMN 218
K S + + VL+ I E KE +MER+M ME +YS+E+++
Sbjct: 177 KKRSGFSAKRYKEVLEV-IALETPEEIDQKEMDMERLMDQMERRNNQR-QDIYSREEIIK 234
Query: 219 M 219
M
Sbjct: 235 M 235
>gi|195487538|ref|XP_002091949.1| GE13920 [Drosophila yakuba]
gi|194178050|gb|EDW91661.1| GE13920 [Drosophila yakuba]
Length = 416
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 47/100 (47%), Gaps = 14/100 (14%)
Query: 205 APGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFREK-ESKSGDWKQR- 262
AP + YS + + E DDN+ E+D ES F +NL ++ E+ +S + +R
Sbjct: 294 APPVASYSSS---YKRTYANELSDDNEAEEDQYESTFARNLARLRAERIGDRSSPYSRRT 350
Query: 263 ITDGILNTGDV---------LKKQANKVSNHVQRWWSGIK 293
+ G + +G + L+ N VS RWW+ ++
Sbjct: 351 LASGNVGSGSLGYEPLARRSLRPDDNSVSVAFNRWWNSLE 390
>gi|392959126|ref|ZP_10324611.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
DSM 17108]
gi|421053010|ref|ZP_15515993.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
B4]
gi|421062083|ref|ZP_15524295.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
B3]
gi|421068445|ref|ZP_15529750.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
A12]
gi|421070001|ref|ZP_15531139.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
A11]
gi|392442484|gb|EIW20067.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
B4]
gi|392442614|gb|EIW20193.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
A12]
gi|392444763|gb|EIW22165.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
B3]
gi|392448906|gb|EIW26078.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
A11]
gi|392456776|gb|EIW33513.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
DSM 17108]
Length = 346
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 13/84 (15%)
Query: 63 QAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECK 122
QA ++P +I+ E+ GI EN+ E D+ + GDK+++V Q G+C+ +
Sbjct: 49 QANVTPPRITGHELTGIIENI---GSEVEDY-------QVGDKVLVVPQMTCGKCHYCLQ 98
Query: 123 TIERACQEIM--GYS-DTDVAEYL 143
C+E++ GYS D AEY+
Sbjct: 99 GKSNLCEEVLLFGYSLDGGFAEYI 122
>gi|421078291|ref|ZP_15539248.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
JBW45]
gi|392523608|gb|EIW46777.1| Alcohol dehydrogenase GroES domain protein [Pelosinus fermentans
JBW45]
Length = 346
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 13/84 (15%)
Query: 63 QAQISPKKISEFEIIGIAENVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSECK 122
QA ++P +I+ E+ GI EN+ E D+ + GDK+++V Q G+C+ +
Sbjct: 49 QANVTPPRITGHELTGIIENI---GSEVEDY-------QVGDKVLVVPQMTCGKCHYCLQ 98
Query: 123 TIERACQEIM--GYS-DTDVAEYL 143
C+E++ GYS D AEY+
Sbjct: 99 GKSNLCEEVLLFGYSLDGGFAEYI 122
>gi|413942884|gb|AFW75533.1| hypothetical protein ZEAMMB73_462245 [Zea mays]
Length = 76
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 29/64 (45%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 192 MERIMRSMEGMPGAPGMQMYSKEDLMNMKNFGGEDGDDNDDEDDDDESDFPKNLGKVFRE 251
ME+++RSME M G GM YS++DLM KN G + DD+++ D D++ +F KN+GK ++
Sbjct: 1 MEKLLRSMEVMAGYAGM--YSRDDLM--KNNFGTEDDDDEEGDGDEDDNFAKNMGKALKD 56
Query: 252 KESK 255
K S+
Sbjct: 57 KGSQ 60
>gi|423720329|ref|ZP_17694511.1| hypothetical protein, DUF2515 family [Geobacillus thermoglucosidans
TNO-09.020]
gi|383366714|gb|EID44000.1| hypothetical protein, DUF2515 family [Geobacillus thermoglucosidans
TNO-09.020]
Length = 383
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 15/110 (13%)
Query: 24 ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
+ + P G RK+ P++ VCE Q Y + + K QI P +I +++ +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325
Query: 82 NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQCNSE-----CKTIER 126
NV + + E DW ++VE E+ GQ N E C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVE-------YLAESNGQINGEITNAYCETLEK 368
>gi|341881244|gb|EGT37179.1| hypothetical protein CAEBREN_31119 [Caenorhabditis brenneri]
Length = 380
Score = 38.9 bits (89), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 13/107 (12%)
Query: 46 QVCEKL------ASQLYHQVQKKQAQISPKKISEFEIIGIAENVCNLKKEEADWIL-KID 98
QVCE++ A++L H+ + S ++ F I+G AEN+ ++K+ ++I+ K+
Sbjct: 237 QVCEQMCRLNRNAAKLLHKYEAH----SSTDVTGFGILGHAENLARVQKQPVEFIIEKLP 292
Query: 99 IVEQGDKLV--LVEQEAEGQCNSECKTIERACQEIMGYSDTDVAEYL 143
I+E D++V ++ EG E ++ E + ++ S+ + EY+
Sbjct: 293 IIEYMDEIVDKMIATGGEGFRLFEGRSAETSGGLLIAMSEKNAKEYI 339
>gi|336235792|ref|YP_004588408.1| hypothetical protein Geoth_2406 [Geobacillus thermoglucosidasius
C56-YS93]
gi|335362647|gb|AEH48327.1| Protein of unknown function DUF2515 [Geobacillus
thermoglucosidasius C56-YS93]
Length = 383
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 24 ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
+ + P G RK+ P++ VCE Q Y + + K QI P +I +++ +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325
Query: 82 NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ-CNSECKTIER 126
NV + + E DW ++VE L ++ G+ N+ C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVEY---LTESNEQINGEITNAYCETLEK 368
>gi|393239155|gb|EJD46688.1| hypothetical protein AURDEDRAFT_164143 [Auricularia delicata
TFB-10046 SS5]
Length = 1365
Score = 37.7 bits (86), Expect = 6.4, Method: Composition-based stats.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 5/75 (6%)
Query: 223 GGEDGDDNDDEDDDDESDFPKNLGKVFREKESKSGDWKQRITDG---ILNTGDVLKKQAN 279
G ED D NDD + D+ DF G E E + G+W + DG ++ G + K+Q
Sbjct: 380 GVEDDDTNDDALNPDKPDFAD--GAYIDEDEDEDGNWVTMLPDGKPYVIKKGPLTKEQRK 437
Query: 280 KVSNHVQRWWSGIKA 294
KV+ V+++ + A
Sbjct: 438 KVAAVVKKFEDSLHA 452
>gi|312111344|ref|YP_003989660.1| hypothetical protein GY4MC1_2322 [Geobacillus sp. Y4.1MC1]
gi|311216445|gb|ADP75049.1| Protein of unknown function DUF2515 [Geobacillus sp. Y4.1MC1]
Length = 383
Score = 37.7 bits (86), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 24 ISYCAKKPVAGARKEDVPYITCQVCEKLASQLYHQVQKKQAQISP--KKISEFEIIGIAE 81
+ + P G RK+ P++ VCE Q Y + + K QI P +I +++ +
Sbjct: 267 VRWAFLHPHTGTRKDYWPHLFNDVCESAPRQPYRR-RIKDCQIQPGVPRIYSPQLLYAWK 325
Query: 82 NVCNLKKEEADWILKIDIVEQGDKLVLVEQEAEGQ-CNSECKTIER 126
NV + + E DW ++VE L ++ G+ N+ C+T+E+
Sbjct: 326 NVVHEEAERGDWFEDWNVVEY---LAESNEQINGEITNAYCETLEK 368
>gi|261863831|gb|ACY01390.1| AT-less polyketide synthase [Streptomyces platensis subsp. rosaceus]
Length = 3192
Score = 37.4 bits (85), Expect = 7.7, Method: Composition-based stats.
Identities = 36/139 (25%), Positives = 60/139 (43%), Gaps = 13/139 (9%)
Query: 120 ECKTIERACQEIMGYSDTDVAEYLYTSKPDIKSLVNYLCK----DLSKACSTK-PPPVLK 174
E + + E+ G TD+ ++ DI+SL YL +LS S P PV
Sbjct: 2470 ESIALAKLASELNGVLGTDLTPAVFFECSDIRSLAEYLLDRYGPELSLPTSADAPAPVAA 2529
Query: 175 NRIPGEPFVPKPAKEAEMERIMRSMEGMPGAPGM-----QMYSKEDLMNMKNFGGEDGDD 229
R P +P P + + I+ + PGA + Q+ + EDL + ++ G+ D
Sbjct: 2530 TR-PSPVPMPAPGPDDDAVAIVGAAGRFPGADDLDTFWQQLRAGEDL--IADYPGDRFDG 2586
Query: 230 NDDEDDDDESDFPKNLGKV 248
+ +DFPK G++
Sbjct: 2587 GPYAEVVARADFPKFAGRI 2605
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.132 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,923,564,249
Number of Sequences: 23463169
Number of extensions: 216491904
Number of successful extensions: 1371195
Number of sequences better than 100.0: 616
Number of HSP's better than 100.0 without gapping: 125
Number of HSP's successfully gapped in prelim test: 491
Number of HSP's that attempted gapping in prelim test: 1363822
Number of HSP's gapped (non-prelim): 4396
length of query: 305
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 163
effective length of database: 9,027,425,369
effective search space: 1471470335147
effective search space used: 1471470335147
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 76 (33.9 bits)