BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028400
(209 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224071369|ref|XP_002303426.1| predicted protein [Populus trichocarpa]
gi|222840858|gb|EEE78405.1| predicted protein [Populus trichocarpa]
Length = 207
Score = 278 bits (710), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/210 (70%), Positives = 163/210 (77%), Gaps = 5/210 (2%)
Query: 2 VSESDNPEQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQI 61
+SE NP SGL IS + IRSFLLSAS+D +PREL++ ALNLSS N YKS R I
Sbjct: 1 MSEPRNPP---SGLRISMTEPIRSFLLSASKDGDVPRELQDLALNLSSQNNLHYKSFRSI 57
Query: 62 WVGSLPSIRP-DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDIL 120
W+ S PS RP DL RLFSGS F+ +SPKPREKSEELKARLR+L E AER Y ELVKDI
Sbjct: 58 WIRSPPSTRPPDLIRLFSGSNFILSSPKPREKSEELKARLRKLEEMAERKAYAELVKDIT 117
Query: 121 PKSSATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGM 180
P+ + EPFSSYKDQ+GFGLHVVLIMFTGYLVGY AFRALF HS AMSAAGGILGLV GM
Sbjct: 118 PRKNTDEPFSSYKDQMGFGLHVVLIMFTGYLVGYAAFRALFGHSPAMSAAGGILGLVFGM 177
Query: 181 LVETLLFIIRSSNHDNKSSA-TASKQKKNQ 209
LVET LFIIR+SN D +SS SK KKNQ
Sbjct: 178 LVETFLFIIRTSNPDPRSSTPNTSKLKKNQ 207
>gi|255573453|ref|XP_002527652.1| conserved hypothetical protein [Ricinus communis]
gi|223532957|gb|EEF34723.1| conserved hypothetical protein [Ricinus communis]
Length = 213
Score = 261 bits (668), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 134/197 (68%), Positives = 151/197 (76%), Gaps = 2/197 (1%)
Query: 1 MVSESDNPEQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQ 60
M S +D P N GL I + IRSFLLSAS++ L ++LR+ A L S + YKS+R
Sbjct: 1 MSSATDPP--NPPGLTILLTEPIRSFLLSASKEPHLSQQLRDLASFLVSQHNVQYKSLRS 58
Query: 61 IWVGSLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDIL 120
IW+ S S RPDL RLFSGSE V TSPKPREKSEELKARLR+L E AER+ Y+ELVKDI
Sbjct: 59 IWIASPHSTRPDLIRLFSGSELVLTSPKPREKSEELKARLRKLEELAERNAYQELVKDIT 118
Query: 121 PKSSATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGM 180
P+ EPFSSYKDQLGFGLHV L MFTGYLVGY AFRALF HS AMSAAGGILGLVCGM
Sbjct: 119 PRKDEIEPFSSYKDQLGFGLHVALTMFTGYLVGYAAFRALFGHSPAMSAAGGILGLVCGM 178
Query: 181 LVETLLFIIRSSNHDNK 197
LVETLLFIIR+S+ D +
Sbjct: 179 LVETLLFIIRTSSQDPR 195
>gi|225427171|ref|XP_002279767.1| PREDICTED: uncharacterized protein LOC100259119 [Vitis vinifera]
gi|297742081|emb|CBI33868.3| unnamed protein product [Vitis vinifera]
Length = 206
Score = 253 bits (646), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 130/197 (65%), Positives = 154/197 (78%)
Query: 13 SGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPD 72
+GLII N D+IRSFL SAS D L +L+ TA L+S PYK +R +WV S RP
Sbjct: 10 NGLIIGNTDTIRSFLRSASTDPHLSADLQLTASTLASQTTLPYKPLRALWVASPSQTRPK 69
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSY 132
L L GS+F+FTSPKPREKSEELK RLR+LA+ AER+ Y ELVKDI PK + EPF+SY
Sbjct: 70 LSGLLYGSQFIFTSPKPREKSEELKLRLRKLADMAERNAYEELVKDITPKKTTDEPFASY 129
Query: 133 KDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS 192
KDQLGFGLHVV+ MF+GYLVGY AFRALFSH+ M+AAGGILGLVC MLVETLLFIIR+S
Sbjct: 130 KDQLGFGLHVVVTMFSGYLVGYAAFRALFSHNPVMNAAGGILGLVCAMLVETLLFIIRTS 189
Query: 193 NHDNKSSATASKQKKNQ 209
+HD +SS++ S+ KKNQ
Sbjct: 190 SHDLRSSSSTSRMKKNQ 206
>gi|449441468|ref|XP_004138504.1| PREDICTED: uncharacterized protein LOC101222938 [Cucumis sativus]
gi|449522720|ref|XP_004168374.1| PREDICTED: uncharacterized protein LOC101228437 [Cucumis sativus]
Length = 206
Score = 251 bits (642), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 128/184 (69%), Positives = 142/184 (77%)
Query: 7 NPEQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSL 66
N SGL++S+ ++IRSFL SAS D QL ELR+ A +L S PYK +R IW +
Sbjct: 3 NDHALPSGLVLSHTEAIRSFLTSASIDSQLSDELRQIASDLVSQRNIPYKLLRAIWFATE 62
Query: 67 PSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSAT 126
S RPDL RL +GSEFVFTSPKPREKSEELKARL++LA+ AER Y+ELVKDI PK
Sbjct: 63 SSTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLADVAERKAYQELVKDIAPKKPID 122
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLL 186
EPFSSYKDQLGFGLHVVLIMFTGYLVGY FRALF H MSAAGGILGLV GMLVETLL
Sbjct: 123 EPFSSYKDQLGFGLHVVLIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLL 182
Query: 187 FIIR 190
FIIR
Sbjct: 183 FIIR 186
>gi|388492418|gb|AFK34275.1| unknown [Lotus japonicus]
Length = 227
Score = 249 bits (637), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 122/195 (62%), Positives = 149/195 (76%)
Query: 14 GLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPDL 73
GLIIS DSIR+FL AS + L ELR+++ +L + PY +R +W+ S PS RP+L
Sbjct: 22 GLIISITDSIRTFLTGASNNPTLSLELRQSSSDLLRQSDVPYAPLRAVWIASDPSTRPEL 81
Query: 74 FRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSYK 133
RLFSG+ FVF+SPKPREKSEELK RLR+L + AER Y+ELVKDI PK EPFSSYK
Sbjct: 82 TRLFSGTRFVFSSPKPREKSEELKERLRKLEDLAERKAYQELVKDIAPKQDVVEPFSSYK 141
Query: 134 DQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSSN 193
DQLGFGLHV++ MFTGYLVGY AFRALF+H AM+AAGGILGLV MLVET LFIIRSSN
Sbjct: 142 DQLGFGLHVLVTMFTGYLVGYAAFRALFNHGPAMNAAGGILGLVGAMLVETFLFIIRSSN 201
Query: 194 HDNKSSATASKQKKN 208
D+ + ++K+ ++
Sbjct: 202 LDDSKTRKSNKKPRS 216
>gi|15237323|ref|NP_200110.1| uncharacterized protein [Arabidopsis thaliana]
gi|8809587|dbj|BAA97138.1| unnamed protein product [Arabidopsis thaliana]
gi|46518365|gb|AAS99664.1| At5g52980 [Arabidopsis thaliana]
gi|48310362|gb|AAT41806.1| At5g52980 [Arabidopsis thaliana]
gi|110743071|dbj|BAE99428.1| hypothetical protein [Arabidopsis thaliana]
gi|332008903|gb|AED96286.1| uncharacterized protein [Arabidopsis thaliana]
Length = 222
Score = 249 bits (636), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 131/201 (65%), Positives = 148/201 (73%), Gaps = 1/201 (0%)
Query: 6 DNPE-QNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
D P+ +N SGLI+S + +RSFL AS D +L ELR A +L S N PYK +R IW G
Sbjct: 2 DQPDPKNGSGLILSATEPLRSFLALASGDRRLSDELRGIASDLRSKNTIPYKLLRAIWTG 61
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S PS RPDL LFSGS FVFTSPKPREKSEELK RL +L E AER EY ELVKDI PK
Sbjct: 62 SDPSTRPDLLGLFSGSNFVFTSPKPREKSEELKLRLVKLREIAERKEYAELVKDITPKKQ 121
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHV L MFTGYLVGY +FRALF+ + A+SAAGGILGLV MLVET
Sbjct: 122 VEEPFSSYKDQLGFGLHVGLTMFTGYLVGYASFRALFNRNPALSAAGGILGLVLAMLVET 181
Query: 185 LLFIIRSSNHDNKSSATASKQ 205
LLFII++S D S+ + Q
Sbjct: 182 LLFIIKTSKDDQIQSSKSFTQ 202
>gi|351721100|ref|NP_001238222.1| uncharacterized protein LOC100499678 [Glycine max]
gi|255625721|gb|ACU13205.1| unknown [Glycine max]
Length = 215
Score = 249 bits (635), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 127/211 (60%), Positives = 151/211 (71%), Gaps = 10/211 (4%)
Query: 9 EQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPS 68
+ N GL+IS D IR FL AS+D L EL+ T+ +L ++ PY R +W+ S PS
Sbjct: 5 KSNRVGLVISRTDPIRDFLSDASKDATLSDELQRTSSDLLLQSEIPYVPFRTVWMASDPS 64
Query: 69 IRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEP 128
RPDL RLFSG+ F+F+SPKPREKSEELKARL++L + AER Y+ELVKDI P EP
Sbjct: 65 TRPDLTRLFSGTRFIFSSPKPREKSEELKARLKKLEDIAERKAYQELVKDITPPKDVQEP 124
Query: 129 FSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFI 188
FSSYKDQLGFGLHVV+ MFTGYL+GY AFRALF+HS AM+AAGGILGLV M VET LFI
Sbjct: 125 FSSYKDQLGFGLHVVVTMFTGYLLGYAAFRALFNHSPAMNAAGGILGLVGAMFVETFLFI 184
Query: 189 IRSSNHDN----------KSSATASKQKKNQ 209
IRSSN D +SS + S+ KKNQ
Sbjct: 185 IRSSNADADKTRESSQKPRSSFSTSRIKKNQ 215
>gi|297796123|ref|XP_002865946.1| hypothetical protein ARALYDRAFT_918360 [Arabidopsis lyrata subsp.
lyrata]
gi|297311781|gb|EFH42205.1| hypothetical protein ARALYDRAFT_918360 [Arabidopsis lyrata subsp.
lyrata]
Length = 212
Score = 245 bits (625), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/211 (62%), Positives = 151/211 (71%), Gaps = 7/211 (3%)
Query: 6 DNPE-QNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
D P+ +N +GLI+S + +RSFL AS D++L ELR A +LSS N YK +R IW G
Sbjct: 2 DQPDPKNGAGLILSVTEPMRSFLAIASGDLRLSEELRGIASDLSSKNTVSYKLLRAIWTG 61
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S P RP L LFSGS+FVFTSPKPREKSEELK RL +L E AER EY ELVKDI PK
Sbjct: 62 SDPLTRPGLLGLFSGSDFVFTSPKPREKSEELKMRLLKLKEIAERKEYAELVKDITPKKQ 121
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHV L MFTGYLVGY +FRALF+ + A+SAAGGILGLV MLVET
Sbjct: 122 VEEPFSSYKDQLGFGLHVGLTMFTGYLVGYASFRALFNRNPALSAAGGILGLVLAMLVET 181
Query: 185 LLFIIRSSNHDNKSSATA------SKQKKNQ 209
LLFII++S D S+ + KKNQ
Sbjct: 182 LLFIIKTSKDDQIQSSKSFTPSFTPTTKKNQ 212
>gi|297826025|ref|XP_002880895.1| hypothetical protein ARALYDRAFT_481612 [Arabidopsis lyrata subsp.
lyrata]
gi|297326734|gb|EFH57154.1| hypothetical protein ARALYDRAFT_481612 [Arabidopsis lyrata subsp.
lyrata]
Length = 212
Score = 244 bits (623), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/211 (62%), Positives = 150/211 (71%), Gaps = 7/211 (3%)
Query: 6 DNPE-QNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
D P+ +N SGLI+S + +RSFL AS D++L ELR A +LSS N YK +R IW G
Sbjct: 2 DQPDPKNGSGLILSVTEPMRSFLAIASGDLRLSEELRGIASDLSSKNSVSYKLLRAIWTG 61
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S P RP L LFSG +FVFTSPKPREKSEELK RL +L E AER EY ELVKDI PK
Sbjct: 62 SDPLTRPGLLGLFSGCDFVFTSPKPREKSEELKMRLLKLKEIAERKEYAELVKDITPKKQ 121
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHV L MFTGYLVGY +FRALF+ + A+SAAGGILGLV MLVET
Sbjct: 122 VEEPFSSYKDQLGFGLHVGLTMFTGYLVGYASFRALFNRNPALSAAGGILGLVLAMLVET 181
Query: 185 LLFIIRSSNHDNKSSATA------SKQKKNQ 209
LLFII++S D S+ + KKNQ
Sbjct: 182 LLFIIKTSKDDQIQSSKSFTPSFTPTTKKNQ 212
>gi|356576193|ref|XP_003556218.1| PREDICTED: uncharacterized protein LOC100808257 [Glycine max]
Length = 215
Score = 243 bits (620), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 128/215 (59%), Positives = 149/215 (69%), Gaps = 12/215 (5%)
Query: 5 SDNPEQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
+D P N G I D IR FL AS+D L ELR T+ +L + PY +R +W+
Sbjct: 3 TDKP--NRVGSFIYITDPIRVFLSGASKDATLSDELRRTSSDLLLQSDVPYAQLRAVWMA 60
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S PS RPDL RLFSG+ F+F+SPKPREKSEELKARL++L + AER Y+ELVKDI P
Sbjct: 61 SDPSTRPDLTRLFSGTRFIFSSPKPREKSEELKARLKKLEDIAERKAYQELVKDITPPKD 120
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHVV+ MFTGYL+GY AFRALF+HS AM+AAGGILGLV M VET
Sbjct: 121 VPEPFSSYKDQLGFGLHVVVTMFTGYLLGYAAFRALFNHSPAMNAAGGILGLVGAMFVET 180
Query: 185 LLFIIRSSNHD----------NKSSATASKQKKNQ 209
LFIIRSSN D +SS + S KKNQ
Sbjct: 181 FLFIIRSSNTDANKTRKSSQKARSSFSTSGIKKNQ 215
>gi|255647472|gb|ACU24200.1| unknown [Glycine max]
Length = 215
Score = 243 bits (620), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 128/215 (59%), Positives = 148/215 (68%), Gaps = 12/215 (5%)
Query: 5 SDNPEQNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
+D P N G I D IR FL AS+D L ELR T+ +L + PY +R +W+
Sbjct: 3 TDKP--NWVGSFIYITDPIRVFLSGASKDATLSDELRRTSSDLLHQSDVPYAQLRAVWMA 60
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S PS RPDL RLFSG+ F+F+SPKPREKSEELKARL++L + AER Y+ELVKDI P
Sbjct: 61 SDPSTRPDLTRLFSGTRFIFSSPKPREKSEELKARLKKLEDIAERKAYQELVKDITPPKD 120
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHVV+ MFTGYL GY AFRALF+HS AM+AAGGILGLV M VET
Sbjct: 121 VPEPFSSYKDQLGFGLHVVVTMFTGYLFGYAAFRALFNHSPAMNAAGGILGLVGAMFVET 180
Query: 185 LLFIIRSSNHD----------NKSSATASKQKKNQ 209
LFIIRSSN D +SS + S KKNQ
Sbjct: 181 FLFIIRSSNTDANKTRKSSQKARSSFSTSGIKKNQ 215
>gi|388505564|gb|AFK40848.1| unknown [Medicago truncatula]
Length = 216
Score = 242 bits (617), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 124/208 (59%), Positives = 150/208 (72%), Gaps = 10/208 (4%)
Query: 11 NSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIR 70
N GLIIS+ IR+FL AS D L ELR+ +L + + PY+ +R IW+ S PS R
Sbjct: 8 NRVGLIISSTHFIRAFLSEASTDTTLSAELRQKCSDLIAQSNVPYEPLRAIWIASDPSTR 67
Query: 71 PDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFS 130
P+L +L G+ F+F+SPKPREKSEELKARL++L + AER Y+ELVKDI PK TEPFS
Sbjct: 68 PELTQLLWGTSFIFSSPKPREKSEELKARLKKLEDIAERKAYQELVKDIAPKEDVTEPFS 127
Query: 131 SYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIR 190
SYKDQLGFGLHV++ MFTGYLVGY AFRALF HS AM+AAGGILGLV MLVET LFIIR
Sbjct: 128 SYKDQLGFGLHVLVTMFTGYLVGYAAFRALFDHSPAMNAAGGILGLVGAMLVETFLFIIR 187
Query: 191 SSNHDN----------KSSATASKQKKN 208
+S+ D+ +SS + S KKN
Sbjct: 188 NSDLDSSKTKRSNQKPRSSFSTSSLKKN 215
>gi|357443357|ref|XP_003591956.1| hypothetical protein MTR_1g095620 [Medicago truncatula]
gi|358349054|ref|XP_003638555.1| hypothetical protein MTR_136s0027 [Medicago truncatula]
gi|355481004|gb|AES62207.1| hypothetical protein MTR_1g095620 [Medicago truncatula]
gi|355504490|gb|AES85693.1| hypothetical protein MTR_136s0027 [Medicago truncatula]
gi|388490618|gb|AFK33375.1| unknown [Medicago truncatula]
Length = 216
Score = 242 bits (617), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 124/208 (59%), Positives = 150/208 (72%), Gaps = 10/208 (4%)
Query: 11 NSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIR 70
N GLIIS+ IR+FL AS D L ELR+ +L + + PY+ +R IW+ S PS R
Sbjct: 8 NRVGLIISSTHFIRAFLSEASTDTTLSAELRQKCSDLIAQSNVPYEPLRAIWIASDPSTR 67
Query: 71 PDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFS 130
P+L +L G+ F+F+SPKPREKSEELKARL++L + AER Y+ELVKDI PK TEPFS
Sbjct: 68 PELTQLLWGTSFIFSSPKPREKSEELKARLKKLEDIAERKAYQELVKDIAPKEDVTEPFS 127
Query: 131 SYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIR 190
SYKDQLGFGLHV++ MFTGYLVGY AFRALF HS AM+AAGGILGLV MLVET LFIIR
Sbjct: 128 SYKDQLGFGLHVLVTMFTGYLVGYAAFRALFDHSPAMNAAGGILGLVGAMLVETFLFIIR 187
Query: 191 SSNHDN----------KSSATASKQKKN 208
+S+ D+ +SS + S KKN
Sbjct: 188 NSDLDSNKTKRSNQKPRSSFSTSSLKKN 215
>gi|297796117|ref|XP_002865943.1| hypothetical protein ARALYDRAFT_495363 [Arabidopsis lyrata subsp.
lyrata]
gi|297311778|gb|EFH42202.1| hypothetical protein ARALYDRAFT_495363 [Arabidopsis lyrata subsp.
lyrata]
Length = 215
Score = 237 bits (604), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/214 (63%), Positives = 153/214 (71%), Gaps = 10/214 (4%)
Query: 6 DNPE-QNSSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVG 64
D P+ +N +GLI+S + IRSFL AS D L ELRE A +L S N PYK +R +W G
Sbjct: 2 DQPDPKNGAGLILSATEPIRSFLALASGDRHLSEELREIATDLRSKNTVPYKLLRALWTG 61
Query: 65 SLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS 124
S PS RPDL LFSGS+FVFTSPKPREKSEELK RL +L E AER EY ELVKDI PK
Sbjct: 62 SDPSTRPDLLGLFSGSDFVFTSPKPREKSEELKLRLLKLREIAERKEYAELVKDITPKKQ 121
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
EPFSSYKDQLGFGLHV L MFTGYLVGY +FRALF+ + A+SAAGGILGLV MLVET
Sbjct: 122 VEEPFSSYKDQLGFGLHVGLTMFTGYLVGYASFRALFNRNPALSAAGGILGLVLAMLVET 181
Query: 185 LLFIIRSSNHDNKSSATASKQ---------KKNQ 209
LLFII++S D S+ + Q KKNQ
Sbjct: 182 LLFIIKTSKDDQIQSSKSFTQSSASFTPTTKKNQ 215
>gi|125538675|gb|EAY85070.1| hypothetical protein OsI_06427 [Oryza sativa Indica Group]
Length = 199
Score = 224 bits (572), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 109/196 (55%), Positives = 147/196 (75%), Gaps = 2/196 (1%)
Query: 12 SSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRP 71
+SGL+++ D +R+FL SA+ LP +LR+ A +L++ + PY+S+R IW + P RP
Sbjct: 2 ASGLVVAATDPLRAFLASAAASHDLPADLRDLASSLAARSAVPYRSLRDIWCAASPGARP 61
Query: 72 DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEP--F 129
L RL G++F+ +SPKPR+KS+ELKARL +L E ER EY ELV+D+ P + P F
Sbjct: 62 PLRRLLHGADFLLSSPKPRDKSDELKARLDKLREMQERKEYAELVRDVAPPAKEDRPELF 121
Query: 130 SSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFII 189
SSYKDQ+GFGLHVVLIMFTGYLVG++AFRALF++S M+AAGGILGLV GML+ET+LFII
Sbjct: 122 SSYKDQIGFGLHVVLIMFTGYLVGFVAFRALFNNSPVMNAAGGILGLVGGMLMETVLFII 181
Query: 190 RSSNHDNKSSATASKQ 205
RSS+ + +S K+
Sbjct: 182 RSSSKELATSVPRPKK 197
>gi|226498204|ref|NP_001143957.1| uncharacterized protein LOC100276771 [Zea mays]
gi|195632544|gb|ACG36708.1| hypothetical protein [Zea mays]
gi|238005752|gb|ACR33911.1| unknown [Zea mays]
gi|414867905|tpg|DAA46462.1| TPA: hypothetical protein ZEAMMB73_551061 [Zea mays]
Length = 203
Score = 221 bits (562), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 113/199 (56%), Positives = 143/199 (71%), Gaps = 5/199 (2%)
Query: 13 SGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPD 72
+GL ++ IRSFL SA+ V LP +LR+ A +L+S Y+S+R IW + P RP
Sbjct: 8 TGLAVAATAPIRSFLSSAAASVSLPADLRDLASDLASDPAVSYRSLRAIWCATSPDTRPP 67
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILP--KSSATEPFS 130
L L G++FV SPKPREKS+ELKARL +L E ER EY ELV+D+ P K A EPFS
Sbjct: 68 LRDLLQGADFVLPSPKPREKSDELKARLEKLRETQERKEYAELVRDVAPPSKDDAPEPFS 127
Query: 131 SYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIR 190
SYKDQ+GFGLHVV+IMFTGYLVG+ F+ALF+++ ++AAGGILGLV GMLVET+LFIIR
Sbjct: 128 SYKDQIGFGLHVVVIMFTGYLVGFAMFKALFNNNAVLNAAGGILGLVGGMLVETILFIIR 187
Query: 191 SSNHDNKSSATASKQKKNQ 209
SS +K A+ + KK Q
Sbjct: 188 SS---SKELASVPRSKKAQ 203
>gi|125568702|gb|EAZ10217.1| hypothetical protein OsJ_00047 [Oryza sativa Japonica Group]
Length = 199
Score = 212 bits (540), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 145/194 (74%), Gaps = 2/194 (1%)
Query: 14 GLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPDL 73
GL+++ D +R+F SA+ LP +LR+ A +L++ + PY+S+R IW + P RP L
Sbjct: 4 GLVVAATDPLRAFFASAAASHDLPADLRDLASSLAARSAVPYRSLRDIWCAASPGARPPL 63
Query: 74 FRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILP--KSSATEPFSS 131
RL G++F+ +SPKPR+KS+ELKARL +L E ER EY ELV+D+ P K + E FSS
Sbjct: 64 RRLLHGADFLLSSPKPRDKSDELKARLDKLREMQERKEYAELVRDVAPPAKEDSPELFSS 123
Query: 132 YKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRS 191
YKDQ+GFGLHVVLIMFTGYLVG++AFRALF++S M+AAGGILGLV GML+ET+LFIIRS
Sbjct: 124 YKDQIGFGLHVVLIMFTGYLVGFVAFRALFNNSPVMNAAGGILGLVGGMLMETVLFIIRS 183
Query: 192 SNHDNKSSATASKQ 205
S+ + +S K+
Sbjct: 184 SSKELATSVPRPKK 197
>gi|242034961|ref|XP_002464875.1| hypothetical protein SORBIDRAFT_01g028050 [Sorghum bicolor]
gi|241918729|gb|EER91873.1| hypothetical protein SORBIDRAFT_01g028050 [Sorghum bicolor]
Length = 204
Score = 206 bits (523), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 112/198 (56%), Positives = 141/198 (71%), Gaps = 2/198 (1%)
Query: 12 SSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRP 71
+ GL ++ D I SFL SA+ V L +LR+ A +L+S Y+S+R IW + RP
Sbjct: 7 APGLAVAATDPISSFLSSAAASVDLDTDLRDLAADLTSDLAVSYRSLRAIWCATSSDTRP 66
Query: 72 DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILP--KSSATEPF 129
L L G++FV SPKPREKS+ELKARL +L E ER EY ELV+D+ P K A EPF
Sbjct: 67 PLRDLLQGADFVLPSPKPREKSDELKARLEKLREIQERKEYAELVRDVAPPSKDEAPEPF 126
Query: 130 SSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFII 189
SSYKDQ+GFGLHVV+IMFTGYLVG+ F+ALF++S +SAAGGILGLV GMLVET+LFII
Sbjct: 127 SSYKDQIGFGLHVVVIMFTGYLVGFSMFKALFNNSAVLSAAGGILGLVGGMLVETVLFII 186
Query: 190 RSSNHDNKSSATASKQKK 207
RSS+ + SS SK+ +
Sbjct: 187 RSSSKELASSVPRSKKDQ 204
>gi|357147547|ref|XP_003574388.1| PREDICTED: uncharacterized protein LOC100838611 [Brachypodium
distachyon]
Length = 203
Score = 202 bits (513), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/178 (59%), Positives = 131/178 (73%)
Query: 12 SSGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRP 71
+ GL+++ D IRSFL SA+ L +LR+ A L+S PY+SIR IW RP
Sbjct: 7 TPGLVLAATDPIRSFLSSAAASADLAADLRDLASALASEPSVPYRSIRAIWCADSSPDRP 66
Query: 72 DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSS 131
L +L ++FV SPKPREKS+ELKARL +L E ER EY +LV+D+ PK +EPFSS
Sbjct: 67 PLRQLLRDAQFVLLSPKPREKSDELKARLEKLREMQERKEYADLVRDVAPKEDNSEPFSS 126
Query: 132 YKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFII 189
YKDQLGFGLHVV+IMFTGYLVG+ AFRALFS+S M+AAGGILG+V GML+ET+LFII
Sbjct: 127 YKDQLGFGLHVVVIMFTGYLVGFAAFRALFSNSPVMNAAGGILGVVGGMLMETVLFII 184
>gi|294463425|gb|ADE77243.1| unknown [Picea sitchensis]
Length = 199
Score = 188 bits (477), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/189 (51%), Positives = 129/189 (68%), Gaps = 1/189 (0%)
Query: 14 GLIISNNDSIRSFL-LSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPD 72
GL++ +N +IR FL + S E+ +A S ++ PY +IR IW PS RP
Sbjct: 9 GLVLKSNLAIREFLEMVVSFANTDSSEILSSAGKYVSSDEVPYTAIRDIWFQIEPSRRPK 68
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSY 132
RL +G+ FV SPKPREKS+ELK RL +L E A+R EY LV+D+ K + FSS+
Sbjct: 69 FSRLLAGASFVLPSPKPREKSKELKDRLERLQEMADRKEYDALVRDVSLKKEDRQYFSSF 128
Query: 133 KDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS 192
KDQLG+GLHV++IMF GYLVGYL+FRALF H+ M+AAGGI GLV ML+ET+LFIIRSS
Sbjct: 129 KDQLGYGLHVIVIMFVGYLVGYLSFRALFGHNPIMNAAGGICGLVSAMLLETVLFIIRSS 188
Query: 193 NHDNKSSAT 201
++ ++T
Sbjct: 189 SYQMSPAST 197
>gi|302785249|ref|XP_002974396.1| hypothetical protein SELMODRAFT_271067 [Selaginella moellendorffii]
gi|300157994|gb|EFJ24618.1| hypothetical protein SELMODRAFT_271067 [Selaginella moellendorffii]
Length = 215
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 118/187 (63%), Gaps = 1/187 (0%)
Query: 13 SGLIISNNDSIRSFLLSASEDVQL-PRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRP 71
+GL + I+ FL +AS L P + E AL + PY+ IR +W P RP
Sbjct: 7 AGLRFHSTAGIQEFLDAASSSASLLPDRVSELALRYKDASVVPYQVIRDVWSNFAPEARP 66
Query: 72 DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSS 131
RL G+ F F SPKPREK +EL+ RL++L E A++ Y++LVKD++ ++ P S+
Sbjct: 67 SFRRLLEGATFAFHSPKPREKPKELQERLQKLQELADQKAYKDLVKDVVKENEEMVPLST 126
Query: 132 YKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRS 191
YKDQ+GFGLHV++ MFTG+L+G+ FR+ F S + AGG++GL GM++ET+LFI+R
Sbjct: 127 YKDQIGFGLHVIVTMFTGFLLGFALFRSQFEESVVLHVAGGLIGLFAGMILETVLFIVRD 186
Query: 192 SNHDNKS 198
+ N S
Sbjct: 187 YQYQNSS 193
>gi|115445121|ref|NP_001046340.1| Os02g0225300 [Oryza sativa Japonica Group]
gi|113535871|dbj|BAF08254.1| Os02g0225300, partial [Oryza sativa Japonica Group]
Length = 160
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/133 (63%), Positives = 107/133 (80%), Gaps = 2/133 (1%)
Query: 75 RLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILP--KSSATEPFSSY 132
RL G++F+ +SPKPR+KS+ELKARL +L E ER EY ELV+D+ P K + E FSSY
Sbjct: 26 RLLHGADFLLSSPKPRDKSDELKARLDKLREMQERKEYAELVRDVAPPAKEDSPELFSSY 85
Query: 133 KDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS 192
KDQ+GFGLHVVLIMFTGYLVG++AFRALF++S M+AAGGILGLV GML+ET+LFIIRSS
Sbjct: 86 KDQIGFGLHVVLIMFTGYLVGFVAFRALFNNSPVMNAAGGILGLVGGMLMETVLFIIRSS 145
Query: 193 NHDNKSSATASKQ 205
+ + +S K+
Sbjct: 146 SKELATSVPRPKK 158
>gi|302807995|ref|XP_002985692.1| hypothetical protein SELMODRAFT_181844 [Selaginella moellendorffii]
gi|300146601|gb|EFJ13270.1| hypothetical protein SELMODRAFT_181844 [Selaginella moellendorffii]
Length = 214
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 117/187 (62%), Gaps = 1/187 (0%)
Query: 13 SGLIISNNDSIRSFLLSASEDVQL-PRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRP 71
+GL + I+ FL +AS L P + E AL + PY+ IR +W P RP
Sbjct: 7 AGLRFHSTAGIQEFLDAASSSASLLPDRVSELALRYKDASVVPYQVIRDVWSNFSPEARP 66
Query: 72 DLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSS 131
RL G+ F F SPKPREK EL+ RL++L E A++ Y++LVKD++ ++ P S+
Sbjct: 67 SFRRLLEGATFAFQSPKPREKPRELQERLQKLQELADQKAYKDLVKDVVKENEEMVPLST 126
Query: 132 YKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRS 191
YKDQ+GFGLHV++ MFTG+L+G+ FR+ F S + AGG++GL GM++ET+LFI+R
Sbjct: 127 YKDQIGFGLHVIVTMFTGFLLGFALFRSQFEESVVLHVAGGLIGLFAGMILETVLFIVRD 186
Query: 192 SNHDNKS 198
+ N S
Sbjct: 187 YQYQNSS 193
>gi|167999143|ref|XP_001752277.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696672|gb|EDQ83010.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 245
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/196 (40%), Positives = 117/196 (59%), Gaps = 1/196 (0%)
Query: 13 SGLIISNNDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPD 72
+GL++ + +I+S L +A+ ++ +LR+ AL PY IR W P
Sbjct: 1 TGLVLHSCPNIKSLLSNAASSEKISVDLRDAALLYLDAPSVPYPIIRDTWAQLTPGYGVP 60
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSAT-EPFSS 131
++ +GS FV SPKPREKS+EL RL +L + A+++ Y+ELVKD+ + E FSS
Sbjct: 61 FHQVLAGSSFVLESPKPREKSKELVDRLAKLQDLADKNAYKELVKDVTKSADDDREYFSS 120
Query: 132 YKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRS 191
YK+ LGFG+HV+ IMFTGY+ GY R+ F S M A GG+ GL+ GM VET+LFI R+
Sbjct: 121 YKNSLGFGIHVLFIMFTGYMFGYAIVRSQFPESPPMHAVGGVFGLIAGMFVETILFITRA 180
Query: 192 SNHDNKSSATASKQKK 207
+ + A+ + K
Sbjct: 181 QMAEKRQQLRANVKAK 196
>gi|215693950|dbj|BAG89159.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 104
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/100 (66%), Positives = 81/100 (81%), Gaps = 2/100 (2%)
Query: 108 ERDEYRELVKDILP--KSSATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHST 165
ER EY ELV+D+ P K + E FSSYKDQ+GFGLHVVLIMFTGYLVG++AFRALF++S
Sbjct: 3 ERKEYAELVRDVAPPAKEDSPELFSSYKDQIGFGLHVVLIMFTGYLVGFVAFRALFNNSP 62
Query: 166 AMSAAGGILGLVCGMLVETLLFIIRSSNHDNKSSATASKQ 205
M+AAGGILGLV GML+ET+LFIIRSS+ + +S K+
Sbjct: 63 VMNAAGGILGLVGGMLMETVLFIIRSSSKELATSVPRPKK 102
>gi|159464663|ref|XP_001690561.1| hypothetical protein CHLREDRAFT_188413 [Chlamydomonas reinhardtii]
gi|158280061|gb|EDP05820.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 85/180 (47%), Gaps = 7/180 (3%)
Query: 20 NDSIRSFLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQI---WVGSLPSIRPDLFRL 76
+ + + L A+ LP ++RE A L + + S++ + + + P L ++
Sbjct: 7 TEGVATLLEEAASCSSLPNDVREDAQKLHGAEQIDWSSLKAVADAYRAANPGKPVYLHKI 66
Query: 77 FSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEP----FSSY 132
S + + SP +EKS EL ARL +L + Y +V D+ + +
Sbjct: 67 CSQRDVILPSPPVKEKSPELVARLAKLQAELDNKRYDAMVADVTSAERKADEMRGVLPNA 126
Query: 133 KDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS 192
+ QL FG HV++ MFT + + + F S G +GL CG+L+ET+L IIR++
Sbjct: 127 RLQLSFGAHVLVTMFTFWATAFYGTKIWFGWDPLWSGMAGAVGLTCGLLIETVLLIIRTN 186
>gi|414867904|tpg|DAA46461.1| TPA: hypothetical protein ZEAMMB73_551061 [Zea mays]
Length = 61
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/64 (60%), Positives = 50/64 (78%), Gaps = 3/64 (4%)
Query: 146 MFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSSNHDNKSSATASKQ 205
MFTGYLVG+ F+ALF+++ ++AAGGILGLV GMLVET+LFIIRSS +K A+ +
Sbjct: 1 MFTGYLVGFAMFKALFNNNAVLNAAGGILGLVGGMLVETILFIIRSS---SKELASVPRS 57
Query: 206 KKNQ 209
KK Q
Sbjct: 58 KKAQ 61
>gi|428182752|gb|EKX51612.1| hypothetical protein GUITHDRAFT_42190, partial [Guillardia theta
CCMP2712]
Length = 128
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 66/128 (51%), Gaps = 6/128 (4%)
Query: 70 RPDLFRLFSGSEFVFTS-PKPREKSEELKARLRQLAERAERDEYRELVKDI----LPKSS 124
RP L L GS P PR+KS EL+ARL ++ + E EY + +D+ L +
Sbjct: 1 RPCLHELLEGSRIPLPDKPAPRKKSPELEARLAKIKAQIEEQEYDMMTRDVRRAELDQGD 60
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
++ F S +G GL+V++ T + GY A A + + G++GL+ G +ET
Sbjct: 61 PSD-FKSASGAIGEGLNVLVTKGTAFATGYYASVAAWGTDPFWNTIAGLVGLIIGFFIET 119
Query: 185 LLFIIRSS 192
LF+ RSS
Sbjct: 120 TLFVARSS 127
>gi|307104917|gb|EFN53168.1| hypothetical protein CHLNCDRAFT_136965 [Chlorella variabilis]
Length = 261
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 72/128 (56%), Gaps = 7/128 (5%)
Query: 71 PDLFRLFSGSEFVFTSPKPRE-KSEELKARLRQLAERAERDEYRELVKDI-----LPKSS 124
P L +L GS F P+ RE K EL+ RL +L R E+ +Y +V D+ ++
Sbjct: 64 PPLDQLCRGSAIAFPQPRRREGKPPELQRRLEELRARLEQAQYDAMVADVTQGERAAAAA 123
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
++Y+ Q+ FG+HV+ +M Y+ G++ A S+ A+ G++ ++ ML+ET
Sbjct: 124 REGGLATYRQQISFGVHVLAMMAAFYIFGHVLGMAATSNR-ALHPMVGLVLMMIAMLLET 182
Query: 185 LLFIIRSS 192
+L+IIR++
Sbjct: 183 VLYIIRTT 190
>gi|452822527|gb|EME29545.1| ABC transporter, subfamily B, ATP-binding & transmembrane domain
isoform 2 [Galdieria sulphuraria]
Length = 883
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 65/124 (52%), Gaps = 8/124 (6%)
Query: 91 EKSEELKARLRQLAERAERDEYRELVKDILPKSSATEP-----FSSYKDQLGFGLHVVLI 145
++S L+ RL+ L +R E+ YREL++D LP + E F S+ +QL GL V+
Sbjct: 763 QRSPLLEERLKVLRQRHEQKRYRELIQD-LPSFTPNEASMAVNFQSFWEQLTLGLQVLTS 821
Query: 146 MFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSSNHDNKSSATASKQ 205
M TG++ GY L + T ++ G++ +C M V+ L I D + T +K
Sbjct: 822 MVTGFIFGYYVAWKLTKNQT-IALVWGLVAFICAMFVDVFLIITGLLAKDMMTLRT-TKS 879
Query: 206 KKNQ 209
KKN+
Sbjct: 880 KKNK 883
>gi|290978840|ref|XP_002672143.1| predicted protein [Naegleria gruberi]
gi|284085717|gb|EFC39399.1| predicted protein [Naegleria gruberi]
Length = 239
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 68/128 (53%), Gaps = 5/128 (3%)
Query: 73 LFRLFSGSEFVFT--SPKPREKSEELKARLRQLAERAERDEYRELVKDILPK-SSATEPF 129
L +L GS+ T +P +K + QL ++ +EY ++VK+++ + E
Sbjct: 72 LHKLVKGSKIRLTKQTPTITKKDPKWLKLTEQLKKKMADEEYSKMVKNVVGRPRDEREAI 131
Query: 130 SSYKDQLGFGLHVVLIMFTGYLVGYLAFRALF--SHSTAMSAAGGILGLVCGMLVETLLF 187
+SYK QLG GL +V+ M T +++ + + +F + T G+ GL+ G+L++ +L
Sbjct: 132 NSYKGQLGVGLDLVVTMATMFILFFTISQYIFEGENKTQNQMLFGLFGLIVGLLIDAILV 191
Query: 188 IIRSSNHD 195
I+R +D
Sbjct: 192 IVRGEKYD 199
>gi|424513669|emb|CCO66291.1| hypothetical protein CHLNCDRAFT_136965 [Bathycoccus prasinos]
Length = 217
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 65/126 (51%), Gaps = 12/126 (9%)
Query: 76 LFSGSEFVFTSPKPRE-KSEELKARLRQLAERAERDEYRELVKDI-----LPKSSATEPF 129
L GSE +F +PK R K+ E +L ++ ER E ++Y +V D+ + + SA E
Sbjct: 90 LLRGSEVIFKAPKKRAPKTPEFARKLDRIRERLEEEKYEAMVADVTKATGISEKSARERR 149
Query: 130 SS-----YKDQLGFGLHVVLIMFTGYLVGYLAFRALF-SHSTAMSAAGGILGLVCGMLVE 183
+S ++ LG +HV+ +M ++ G++A +F + G +G +C + +E
Sbjct: 150 TSSIGYLIRNDLGMAVHVMTLMAACFVGGFIAGAHVFPEFGWEIHFVCGTVGAICCLFME 209
Query: 184 TLLFII 189
LFI+
Sbjct: 210 AALFIM 215
>gi|403357183|gb|EJY78212.1| hypothetical protein OXYTRI_24636 [Oxytricha trifallax]
Length = 309
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 58/117 (49%), Gaps = 13/117 (11%)
Query: 87 PKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSYKDQ-----LGFGLH 141
K RE+S ++ L ++ R E+ +YRE+ K + A + + ++ Q L FG
Sbjct: 189 KKAREESHQI--VLEEIKNRVEKKKYREMTKSLRGGDQAEQEYDIHQKQEARFTLAFGFG 246
Query: 142 VVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGM---LVETLLFIIRSSNHD 195
+ +MF ++ GY +F S S IL LV G+ ++ET+LF+IR D
Sbjct: 247 FITLMFLSFVCGYFLGTHIFQLSQTNSL---ILSLVVGISTIIMETILFVIRMEKMD 300
>gi|358332087|dbj|GAA50803.1| hypothetical protein CLF_105062 [Clonorchis sinensis]
Length = 211
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 85/193 (44%), Gaps = 20/193 (10%)
Query: 11 NSSGLIISNNDSIRSFLLSASEDVQLPRELRETALN-LSSLNKAPYKSIRQI--WVGSLP 67
N + + ND I SFL E LP +LR+ L S + P+ ++ + W+
Sbjct: 5 NQIAVDLHGNDVIYSFLKRCLECDDLPGKLRDLVQPILDSKSTIPFVTLYECHSWLRCRQ 64
Query: 68 SIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPK----- 122
L+RLF+ + V +PK ++ +L R++ L E+ Y ++V+ I PK
Sbjct: 65 ENPGPLWRLFASTTVVLPAPKEPIRNADLDKRVKALREKFSNQAYDKMVEGIGPKLALSN 124
Query: 123 --------SSATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGIL 174
SS Q+ L+ ++++ G++ GY + S + +S A
Sbjct: 125 SMKGSETNSSLCSELKCLNKQMVMVLNFIVVVSAGFVFGYF-IPDMLSSNLDISMAT--- 180
Query: 175 GLVCGMLVETLLF 187
L+CG+ V L+F
Sbjct: 181 RLMCGLSVSLLIF 193
>gi|66803398|ref|XP_635542.1| hypothetical protein DDB_G0290801 [Dictyostelium discoideum AX4]
gi|60463862|gb|EAL62034.1| hypothetical protein DDB_G0290801 [Dictyostelium discoideum AX4]
Length = 251
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 60/123 (48%), Gaps = 7/123 (5%)
Query: 85 TSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSYKDQLGFGLHVVL 144
+P P E AR + L + + +Y +LVK++ K + SS++ QL +++++
Sbjct: 99 VTPDP-----EFIARKQHLEKLNKEYQYSKLVKNVAKKETDLNELSSFRSQLSIAVNILV 153
Query: 145 IMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS-NHDNKSSATAS 203
FT + G + G +LG++ G+ VE LF+IRSS N K A +
Sbjct: 154 TFFTLLVGGIFVGHNWLNSQLYGLVCGLVLGII-GVAVEIWLFVIRSSQNQMEKDKAEKT 212
Query: 204 KQK 206
++K
Sbjct: 213 EEK 215
>gi|299117632|emb|CBN75474.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 163
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 53/90 (58%), Gaps = 2/90 (2%)
Query: 108 ERDEYRELVKDI--LPKSSATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHST 165
E+ EY+ ++ +I P+ + + +S+K Q+G G ++V+ +FT +++ Y A + + +
Sbjct: 62 EQKEYQRMIDNIQRAPRETVRDIQASFKFQVGMGANMVVAVFTTFIISYWASKFIVGDNK 121
Query: 166 AMSAAGGILGLVCGMLVETLLFIIRSSNHD 195
+ G++G + MLVE L+F+ RS D
Sbjct: 122 SHRLVIGLVGAMFIMLVELLVFVTRSLKAD 151
>gi|196012983|ref|XP_002116353.1| hypothetical protein TRIADDRAFT_60361 [Trichoplax adhaerens]
gi|190580944|gb|EDV21023.1| hypothetical protein TRIADDRAFT_60361 [Trichoplax adhaerens]
Length = 193
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 12/104 (11%)
Query: 36 LPRELRETALNLSSLNKAPY---------KSIRQIWV---GSLPSIRPDLFRLFSGSEFV 83
LP +L NL+ +++ + +SI Q +V G + + L++L +GSE
Sbjct: 22 LPHDLHARLHNLTQKDQSQWPEAIPFQLVQSIHQYFVKTKGDDGTQKTYLYQLLAGSELY 81
Query: 84 FTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATE 127
+ +P +++ EL ARL +L + EY+E+VKD+ PK SA E
Sbjct: 82 RQAYQPPKRNPELLARLERLKAQQANREYQEMVKDVSPKQSAKE 125
>gi|330793105|ref|XP_003284626.1| hypothetical protein DICPUDRAFT_75603 [Dictyostelium purpureum]
gi|325085425|gb|EGC38832.1| hypothetical protein DICPUDRAFT_75603 [Dictyostelium purpureum]
Length = 259
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 60/121 (49%), Gaps = 3/121 (2%)
Query: 76 LFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSYKDQ 135
L GS + + + E A+ + L + ++ EY +++K+I K +S+K Q
Sbjct: 96 LLKGSSIYYPPKIVQPPTPEQIAKKQYLEKISKEYEYSKMIKNISKKDEDKNELASFKTQ 155
Query: 136 LGFGLHVVLIMFTGYLVG-YLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSSNH 194
L +++++ FT +VG YL + L S M G + G + G+ E LF++R + H
Sbjct: 156 LSIAVNILVTFFTLLVVGMYLGNKWLNSLVKGM-IVGLVFG-IGGVAAEVWLFVLRGTEH 213
Query: 195 D 195
+
Sbjct: 214 E 214
>gi|307212362|gb|EFN88157.1| Transmembrane protein 199 [Harpegnathos saltator]
Length = 224
Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/146 (23%), Positives = 64/146 (43%), Gaps = 11/146 (7%)
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSAT 126
L LF S+ + SP+ ++ EL+AR+++L + + EYR + K I LP+ S
Sbjct: 80 LHELFDNSDVILPSPQITPRNPELEARIKKLKAQQDAREYRAMTKSIDSVRKFLPEDSIA 139
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLV---E 183
QL + + G+ G+ L + +LG++C ++V E
Sbjct: 140 YQMKQINKQLIAVAQFIFSVIAGFAFGFTGIELLVENLD--FGFRLLLGIICALIVALAE 197
Query: 184 TLLFIIRSSNHDNKSSATASKQKKNQ 209
++ + +D AT+ +K +Q
Sbjct: 198 IYFLALKLNEYDFDVLATSKTKKLHQ 223
>gi|255070725|ref|XP_002507444.1| hypothetical protein MICPUN_54861 [Micromonas sp. RCC299]
gi|226522719|gb|ACO68702.1| hypothetical protein MICPUN_54861 [Micromonas sp. RCC299]
Length = 219
Score = 44.7 bits (104), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 56/139 (40%), Gaps = 25/139 (17%)
Query: 79 GSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI----------LP----KSS 124
G +P P ++ L A + +L A EY + +D+ +P K
Sbjct: 79 GGGVQLVAPVPPARARALDAEISRLRRDAHDREYAAMTRDVSHSNWGLGFRIPNENEKDD 138
Query: 125 ATEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSH-----------STAMSAAGGI 173
S+ LGFG HV+ +MF L G+ A A+ +T + A+
Sbjct: 139 GRLRMSALTRDLGFGAHVLTVMFACALGGWFAGDAIVRGFGDRFDDDSVTNTIVKASLAG 198
Query: 174 LGLVCGMLVETLLFIIRSS 192
+G + M VE LLF++R
Sbjct: 199 IGAIGAMTVEALLFMLRDG 217
>gi|307189390|gb|EFN73800.1| Transmembrane protein 199 [Camponotus floridanus]
Length = 144
Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 11/142 (7%)
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSAT 126
L LF+ + + PK ++ EL+AR+++L + EY+ + K+I LP+ S
Sbjct: 4 LHELFNDCDVILPIPKESPRNPELEARIQKLKAQQNAREYQTMTKNIDSMRKFLPEDSIA 63
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLV---E 183
QL ++ + TG++ G++ + + +LG++C +++ E
Sbjct: 64 YQMKQINKQLIAVAQFIISVITGFVFGFIGIELMVGNLD--FGFRLLLGIICALIIALAE 121
Query: 184 TLLFIIRSSNHDNKSSATASKQ 205
++ S D S S Q
Sbjct: 122 IYFLAVKLSETDTTSKPIKSHQ 143
>gi|145509473|ref|XP_001440675.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407903|emb|CAK73278.1| unnamed protein product [Paramecium tetraurelia]
Length = 208
Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 63/136 (46%), Gaps = 22/136 (16%)
Query: 88 KPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEPFSSYKDQLGFGLHVVLIMF 147
+P+E +E++AR + +RA + +L K + + E Y+ + FGL + +F
Sbjct: 78 RPKEVLDEIRAREMRCKDRAFISKSSQLTKGVF-QEKKQESIKEYEKSVNFGLSFIFSIF 136
Query: 148 T----GYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFII-------------R 190
GY +G F+ + S A++A L+ ++VET L+I+ R
Sbjct: 137 ASGLLGYYLGIYFFQLTYDQSLALAA----FFLIGALIVETGLYIVKIMKEDRIRKINQR 192
Query: 191 SSNHDNKSSATASKQK 206
+N NK+ T K++
Sbjct: 193 KNNKQNKTVPTKFKKE 208
>gi|308802369|ref|XP_003078498.1| unnamed protein product [Ostreococcus tauri]
gi|116056950|emb|CAL53239.1| unnamed protein product [Ostreococcus tauri]
Length = 227
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 12/140 (8%)
Query: 63 VGSLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI--- 119
+G + +R F L + E T + E+ E +A + +L + E +Y ++ +D+
Sbjct: 83 MGVIDPVRDIRFEL-TRRERTLTKEEENER-EAHRALMDKLRDEVEARKYADITRDVDRG 140
Query: 120 -LPKSSATEP-FSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHS--TAMSAAGGI-- 173
+S+A P F S G G HV+ IMF GY+ RAL ++ A G+
Sbjct: 141 RASRSNAPSPSFKSDVHAYGLGAHVLTIMFACATAGYVGGRALEQGGFFVGVAWARGLCT 200
Query: 174 -LGLVCGMLVETLLFIIRSS 192
LG G+ E L I+R S
Sbjct: 201 ALGAALGLFTEVGLLILRES 220
>gi|424852971|ref|ZP_18277348.1| hypothetical protein OPAG_09164 [Rhodococcus opacus PD630]
gi|356664894|gb|EHI44976.1| hypothetical protein OPAG_09164 [Rhodococcus opacus PD630]
Length = 200
Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 98 ARLRQLAERAERDEYRELVKDILPKSSATEPFSSYKDQLGFGLHVVLIMFT-GYLVGYLA 156
AR QL E E L++DI P S +P S G G+ V+ T G L+ ++
Sbjct: 33 ARGGQLFEVQHFGESHSLIRDIFP--SRLQPLSISLANTGVGIVTVVAPLTSGLLIDHIG 90
Query: 157 FRALFSHSTAMSAAGGILGLV 177
A+F S A+ AAGGIL L+
Sbjct: 91 VGAVFWFSFALCAAGGILALL 111
>gi|407034119|gb|EKE37070.1| hypothetical protein ENU1_210410 [Entamoeba nuttalli P19]
Length = 157
Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 111 EYRELVKDILPKSSATEPFSSYKDQLGFGLHVVL-IMFTGYLVGY-LAFRALFSHSTAMS 168
EY + KD+ K + F + K+ G+ L IM T + G L F +
Sbjct: 46 EYNHMTKDLQEKYGTSLKFKNNKNDNEIGISEALNIMVTMIVCGCALGFIGYGYGGPVYA 105
Query: 169 AAGGILGLVCGMLVETLLFIIRSSNHDNKSSATASKQKKN 208
GG +G+ GM+V+TLL II S+ S + +Q +N
Sbjct: 106 GLGGAIGIFIGMIVDTLLVIIYSNKLKQPKSHSKHQQIQN 145
>gi|390355366|ref|XP_003728532.1| PREDICTED: transmembrane protein 199-like [Strongylocentrotus
purpuratus]
Length = 145
Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 16/137 (11%)
Query: 63 VGSLPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPK 122
+GS P+ L L G++ + E++ EL ARL +L E EYR + K++ K
Sbjct: 13 IGSKPAY---LHELLEGADVYHPPLREPERNPELVARLERLQAEQEDREYRRMTKNVKIK 69
Query: 123 SSATEPFS-------SYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAM--SAAGGI 173
+ P + +++ Q+ +VVL + G++ GY+A AL+ + AG I
Sbjct: 70 TYGESPLAQVGQEVRTFQVQMMGIFNVVLTIGGGFVFGYMA--ALYQDQPIIWRVTAGLI 127
Query: 174 LGLVCGMLVETLLFIIR 190
L + V L F I+
Sbjct: 128 LAFIVA--VADLYFFIK 142
>gi|427795259|gb|JAA63081.1| Putative protein arginine n-methyltransferase prmt1, partial
[Rhipicephalus pulchellus]
Length = 506
Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 56/128 (43%), Gaps = 19/128 (14%)
Query: 66 LPSIRPDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSSA 125
+P ++ DL + E + PKP ++ E+L++RL Q + ER RE+ + L +
Sbjct: 129 VPVVQDDLLLTYDVEELLSAGPKPEDECEQLRSRLLQAEQTIER--MREVAQSWLQSNEQ 186
Query: 126 TEP-------FSSYKDQLGFGLHVVLIMFTGYLVGY-----LAFRALFSHSTAMS--AAG 171
EP F SY +G+H+ ++ + Y L A + T +
Sbjct: 187 AEPVEHDEPYFESYGH---YGIHLEMLSDRPRMDSYRKAIELNADACIRNRTVLDVGCGT 243
Query: 172 GILGLVCG 179
GIL ++C
Sbjct: 244 GILSMLCA 251
>gi|332026248|gb|EGI66387.1| Transmembrane protein 199 [Acromyrmex echinatior]
Length = 224
Score = 40.0 bits (92), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 25/116 (21%), Positives = 50/116 (43%), Gaps = 8/116 (6%)
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSAT 126
L LF S+ + +PK ++ EL+AR+++L + EY+ + K I LP+ S
Sbjct: 82 LHELFDDSDVILPTPKETPRNPELEARIQKLTAQQNNREYQAMTKSIDSSRKFLPEDSVA 141
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLV 182
QL + + G+ G++ + +LG++C +++
Sbjct: 142 YQMKQINKQLIAVAQFIFSVIAGFAFGFIGVELIVGDLDF--GFRLLLGIICALII 195
>gi|116053428|ref|YP_793753.1| hypothetical protein PA14_69700 [Pseudomonas aeruginosa UCBPP-PA14]
gi|421177542|ref|ZP_15635191.1| hypothetical protein PACI27_5757 [Pseudomonas aeruginosa CI27]
gi|115588649|gb|ABJ14664.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|404529231|gb|EKA39279.1| hypothetical protein PACI27_5757 [Pseudomonas aeruginosa CI27]
Length = 233
Score = 39.7 bits (91), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 9/122 (7%)
Query: 90 REKSEELKARLRQLAERAER-----DEYRELVKDILPKSSATEPFSSYKDQLGFGLHVVL 144
RE++ E++ RL QL + A D+ R LV D+L +S + S+ +D L HV
Sbjct: 59 RERNIEMRHRLSQLMDVARENDRLFDKTRRLVLDLLDATSLEDVVSTVEDSLRHEFHVPY 118
Query: 145 IMFTGYLVGYLAFRALFSHSTAMSAAGGILG---LVCGML-VETLLFIIRSSNHDNKSSA 200
+ + ++ S + A A GG+L VCG+L L F+ S+ D SA
Sbjct: 119 VSLILFSDSSVSVGRSVSSAEAHQAIGGLLSGGKTVCGVLRPHELAFLFGESDRDEIGSA 178
Query: 201 TA 202
Sbjct: 179 AV 180
>gi|322788000|gb|EFZ13841.1| hypothetical protein SINV_09869 [Solenopsis invicta]
Length = 223
Score = 39.3 bits (90), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 25/116 (21%), Positives = 50/116 (43%), Gaps = 8/116 (6%)
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSAT 126
L LF S+ + +PK ++ EL+AR+++L + EY+ + K I LP+ +
Sbjct: 81 LHELFDDSDVILPTPKETPRNPELEARIQKLTAQQNTREYQAMTKSIDSVRKFLPEDTLA 140
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLV 182
QL V + G+ G++ + +LG++C +++
Sbjct: 141 YQMKQMNKQLIAVAQFVFSVIAGFAFGFIGVELIIGDLDF--GFRLLLGIICALII 194
>gi|194695802|gb|ACF81985.1| unknown [Zea mays]
gi|413945279|gb|AFW77928.1| 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imi
dazole-4-carboxamide isomerase isoform 1 [Zea mays]
gi|413945280|gb|AFW77929.1| 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino]imi
dazole-4-carboxamide isomerase isoform 2 [Zea mays]
Length = 333
Score = 39.3 bits (90), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 60/151 (39%), Gaps = 8/151 (5%)
Query: 26 FLLSASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLPSIRPDLFRLFSGSEFVFT 85
LL+ ++ + R A + SL AP + WVGS S+RP R G V
Sbjct: 14 ILLAVNDTCGVTRMTSTCAARVPSLRWAPQRG----WVGSWVSVRPAKCRTSGGHAVVCA 69
Query: 86 SPKPREKSEELKARLRQLAERAERD---EYRELVKDILPKSSATEPFSSYKDQLGFGLHV 142
+ R + K +++Q+ RD + ELV + S E YK G HV
Sbjct: 70 AVSFRPCIDIHKGKVKQIVGSTLRDLANDSMELVTNFESDKSPAEFAKFYKADELLGGHV 129
Query: 143 VLIMFTGYLVGYLAFRALFSHSTAMSAAGGI 173
+++ A AL ++ + GGI
Sbjct: 130 IMLGANPSSQA-AALEALRAYPGGLQVGGGI 159
>gi|326435904|gb|EGD81474.1| hypothetical protein PTSG_02192 [Salpingoeca sp. ATCC 50818]
Length = 315
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 11/145 (7%)
Query: 71 PDLFRLFSGSEFV---FTSPKPREKSEELKARLRQLAERAERDEYRELVKDILPKSS-AT 126
P + RL S+ V P+P S EL R ++L EY ++VKD+ PK T
Sbjct: 132 PRMHRLMRRSQIVRQPLIEPEP---SPELVERRKKLKAHIANLEYNKMVKDLEPKRDFDT 188
Query: 127 EPFSSYKDQLGF--GLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVET 184
P + Q F ++V+ + + G+ A F + + G +V G V
Sbjct: 189 SPMEVREVQRQFTVAINVLATLVACFFFGFYASTYAFEKTGSRVLFGIACAVVAG--VAE 246
Query: 185 LLFIIRSSNHDNKSSATASKQKKNQ 209
L F++R + + ++Q+K Q
Sbjct: 247 LWFLLRIAMQKEAEAERKAQQEKEQ 271
>gi|443699171|gb|ELT98781.1| hypothetical protein CAPTEDRAFT_191829 [Capitella teleta]
Length = 201
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 83/182 (45%), Gaps = 28/182 (15%)
Query: 23 IRSFLLSASEDVQLPRELRETALNLSSLNKA---------PYKSIRQIWVGSLPSIRPDL 73
IR+ LL+A+ LP ++RE + +S ++ P+ ++ + L + DL
Sbjct: 19 IRTILLTAN----LPHDVREELVKYTSGRRSTDDRKRRSIPFDLVKTVH-KHLSEEQQDL 73
Query: 74 F--RLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSA 125
+ L GSE + R + EL+AR+++L + + EY ++ K++ P+ S
Sbjct: 74 YLHELLEGSEVYLHETESRTRDPELEARVQKLKDELAQREYEQMTKNVGFQQKRNPEESI 133
Query: 126 TEPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETL 185
+ S QL +H+ + T +VG F A + ++ + + GM+V T
Sbjct: 134 GKQVRSLNKQL---IHIFNFLLT--VVGSFMF-AYKATEYSLEKPNIAVQICSGMVVGTA 187
Query: 186 LF 187
+F
Sbjct: 188 VF 189
>gi|67481417|ref|XP_656058.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473235|gb|EAL50674.1| hypothetical protein EHI_117980 [Entamoeba histolytica HM-1:IMSS]
gi|449702001|gb|EMD42716.1| Hypothetical protein EHI5A_002580 [Entamoeba histolytica KU27]
Length = 157
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 111 EYRELVKDILPKSSATEPFSSYKDQLGFGLHVVL-IMFTGYLVGY-LAFRALFSHSTAMS 168
EY + KD+ K + F + ++ G+ L IM T + G L F +
Sbjct: 46 EYNHMTKDLQEKYGTSLKFKNNQNDNEIGISEALNIMVTMIVCGCALGFIGYGYGGPVYA 105
Query: 169 AAGGILGLVCGMLVETLLFIIRSS 192
GG +G+ GM+V+TLL II S+
Sbjct: 106 GLGGAIGIFIGMIVDTLLVIIYSN 129
>gi|344999797|ref|YP_004802651.1| Lanthionine synthetase C family protein [Streptomyces sp.
SirexAA-E]
gi|344315423|gb|AEN10111.1| Lanthionine synthetase C family protein [Streptomyces sp.
SirexAA-E]
Length = 1045
Score = 38.5 bits (88), Expect = 1.6, Method: Composition-based stats.
Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 30 ASEDVQLPRELRETALNLSSLNKAPYKSIRQIWVGSLP--SIRPDLFRLFSGSEFVFTSP 87
A+E ++ LR AL S+L + I ++W G +P + RPD L+SG+ P
Sbjct: 550 ATERQRVFALLRTGALGPSALPGLEDEEIAELWRGDVPVFTTRPDTTDLWSGTGRAVPGP 609
Query: 88 KPREKSEELKARLRQLAERAERDEYRELVKDILPKSSATEP 128
E ++A++R L + +R + +++ + S++TEP
Sbjct: 610 AGATGLERVEAKVRAL-DTVDRQDQERIIRTAM-ASTSTEP 648
>gi|281203106|gb|EFA77307.1| hypothetical protein PPL_12518 [Polysphondylium pallidum PN500]
Length = 897
Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats.
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 83 VFTSPKPREKSEELKARLRQLAERAERDE-YRELVKDILPKSSATEPFSSYKDQLGFGLH 141
V+ PKP E + K RQ + ++ Y+++VK++ + S Q GL+
Sbjct: 109 VYIEPKPVEIEDPEKIAKRQYLQGLYNEQRYKKMVKNVAGEREDAMELGSISSQASIGLN 168
Query: 142 VVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLVETLLFIIRSS 192
++ FT G ++ A G G++ G+ E LF+I+SS
Sbjct: 169 ILATFFTLLACGLFLGGKMWDSYVLGLALGLAFGII-GVAAEVWLFVIKSS 218
>gi|325183005|emb|CCA17459.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 189
Score = 36.6 bits (83), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
Query: 76 LFSGSEFVFTSPKPREK--SEELKARLRQLAERAERDE----YRELVKDILPKSSATE-- 127
L + S+ + T P ++ SEE KA+L + ++ +R E Y + ++I ++ E
Sbjct: 74 LINRSKVIVTKASPIKEVVSEEEKAQLEKRKKKLQRIEEEARYHRMTRNIKHLTATAEIQ 133
Query: 128 -PFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAM 167
S + L G+++++ T ++VGY+ AL H T +
Sbjct: 134 QSIKSVRQHLTTGINMIVARITMFVVGYMISAALTDHETTV 174
>gi|242025578|ref|XP_002433201.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212518742|gb|EEB20463.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 438
Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 29/146 (19%), Positives = 66/146 (45%), Gaps = 12/146 (8%)
Query: 71 PDLFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI-LPKSSATEPF 129
P L L GS+ + + E++ +L R+ +L + E EY+ + +++ L K E
Sbjct: 279 PYLHELMKGSDVILPKNEVVERNPDLLKRIEKLKKEQEEREYKAMTQNVNLDKKHFPEDS 338
Query: 130 SSYKDQLGFGLHVVLI------MFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGML-- 181
SY+ + H++ + + G+ G+L + + +LG+ C ++
Sbjct: 339 VSYQMNM-MNRHLIAVGQLLVSVGAGFAFGFLGVELIVGNLNF--GIRLLLGIACALIIA 395
Query: 182 VETLLFIIRSSNHDNKSSATASKQKK 207
V F+ + N ++K T +++++
Sbjct: 396 VAEFYFLAKHLNEEDKKEKTTTRRER 421
>gi|167389302|ref|XP_001738906.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897659|gb|EDR24744.1| hypothetical protein EDI_206920 [Entamoeba dispar SAW760]
Length = 157
Score = 36.6 bits (83), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 2/84 (2%)
Query: 111 EYRELVKDILPKSSATEPFSSYKDQLGFGLHVVL-IMFTGYLVGY-LAFRALFSHSTAMS 168
EY + KD+ K F + + G+ L IM T + G L F +
Sbjct: 46 EYNHMTKDLQEKYGTGIKFKNNQSDNEIGISEALNIMVTMIVCGCALGFIGYGYGGPVYA 105
Query: 169 AAGGILGLVCGMLVETLLFIIRSS 192
GG +G+ GM+V+TLL II S+
Sbjct: 106 GLGGAIGIFIGMIVDTLLVIIYSN 129
>gi|383849063|ref|XP_003700166.1| PREDICTED: transmembrane protein 199-like [Megachile rotundata]
Length = 222
Score = 36.2 bits (82), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 29/143 (20%), Positives = 60/143 (41%), Gaps = 10/143 (6%)
Query: 73 LFRLFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI------LPKSSAT 126
L L ++ +PK ++ EL+AR+++L + + EY+ + K+I P S
Sbjct: 78 LHELLEEADIKLPTPKHIPRNPELEARIQKLKAQQDAREYQAMTKNIDSSRRKFPDDSIA 137
Query: 127 EPFSSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGGILGLVCGMLV--ET 184
QL + + G++ G+ + + +LG++C +++
Sbjct: 138 YQMKQINRQLIAVAQFIFSVLAGFVFGFKGVELIVGNLD--FGFRLLLGIICALIIALAE 195
Query: 185 LLFIIRSSNHDNKSSATASKQKK 207
+ F+ N D +A+A KK
Sbjct: 196 IYFLAIKLNEDGYEAASAPMPKK 218
>gi|430811677|emb|CCJ30874.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 939
Score = 36.2 bits (82), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 14/130 (10%)
Query: 54 PYKSIRQIWVGSLPSIRPDLFRLFSGSEFV-FTSPKPREKSEELKARLRQLAERAERDEY 112
PY S +++ SL SI +L RL+SGS + + R++ + L+ RL +E
Sbjct: 129 PYFSTKELSKMSLRSILENLKRLYSGSYGIEYIHIPDRQQCDWLRERLEIPIPYQYNNEE 188
Query: 113 RELVKDILPKSSATEPF--SSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAA 170
++ + D L +S E F + Y + FGL + G +AL HS +
Sbjct: 189 KQRILDRLIRSDLFEKFLATKYPNDKRFGLEGCESLIPG-------MKALIDHSVEL--- 238
Query: 171 GGILGLVCGM 180
GI +V GM
Sbjct: 239 -GIKSIVIGM 247
>gi|430811116|emb|CCJ31407.1| unnamed protein product [Pneumocystis jirovecii]
Length = 939
Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 14/130 (10%)
Query: 54 PYKSIRQIWVGSLPSIRPDLFRLFSGSEFV-FTSPKPREKSEELKARLRQLAERAERDEY 112
PY S +++ SL SI +L RL+SGS + + R++ + L+ RL +E
Sbjct: 107 PYFSTKELSKMSLRSILENLKRLYSGSYGIEYIHIPDRQQCDWLRERLEIPIPYQYNNEE 166
Query: 113 RELVKDILPKSSATEPF--SSYKDQLGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAA 170
++ + D L +S E F + Y + FGL + G +AL HS +
Sbjct: 167 KQRILDRLIRSDLFEKFLATKYPNDKRFGLEGCESLIPG-------MKALIDHSVEL--- 216
Query: 171 GGILGLVCGM 180
GI +V GM
Sbjct: 217 -GIKSIVIGM 225
>gi|397592421|gb|EJK55660.1| hypothetical protein THAOC_24581 [Thalassiosira oceanica]
Length = 286
Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 60/144 (41%), Gaps = 15/144 (10%)
Query: 79 GSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVKDI---LPKSSATEPFSSYKDQ 135
GS VF +P+ + + RL +L ++E Y +L ++ + T +Y
Sbjct: 121 GSRLVFETPQKKAADPTYQKRLERLRLKSEERMYSKLTTNLQQTVRDDDVTAKSMTYATS 180
Query: 136 LGFGLHVVLIMFTGYLVGYLAFRALFSHSTAMSAAGG-------ILGLVCG---MLVETL 185
+G + V I F G + + A GG I G+V G + VE +
Sbjct: 181 IGLNMVVAPISF-GVFMYFFAGSIFSRFFDEQDDVGGNIDIRRVIAGVVSGVFMLFVEMI 239
Query: 186 LFIIRSSNHDNKSSATASKQKKNQ 209
LF+IRS D S SK+K+ Q
Sbjct: 240 LFVIRSHEID-ASVRKKSKRKEYQ 262
>gi|339242967|ref|XP_003377409.1| putative EH domain-containing protein 1 [Trichinella spiralis]
gi|316973791|gb|EFV57346.1| putative EH domain-containing protein 1 [Trichinella spiralis]
Length = 809
Score = 35.8 bits (81), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 67/144 (46%), Gaps = 14/144 (9%)
Query: 76 LFSGSEFVFTSPKPREKSEELKARLRQLAERAERDEYRELVK--DILPKSSATEPFSSYK 133
L SG + T PK RE++ EL+AR+++L + E+ EY + D++ F S++
Sbjct: 66 LQSGDIILPTLPK-RERNPELEARIQKLKKEQEQREYDRMTSNVDVVQLHRQRNKFGSFQ 124
Query: 134 D-------QLGFGLHVVLIMFTGYLVGYLAFRALF-SHSTAMSAAGGILGLVCGMLVETL 185
QL L+ +L + +L G+ + + + S + A G+L V L + L
Sbjct: 125 QDLAILNRQLIIILNTLLTVICSFLFGFYSVQFVSPSSDYTLRVAVGLLFAVVVFLSDIL 184
Query: 186 LFIIRSSNHDNKSSATASKQKKNQ 209
++ +H +TA +Q Q
Sbjct: 185 ---VQIHSHLKLVMSTAKEQNAKQ 205
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.131 0.360
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,950,108,930
Number of Sequences: 23463169
Number of extensions: 111719028
Number of successful extensions: 428101
Number of sequences better than 100.0: 84
Number of HSP's better than 100.0 without gapping: 36
Number of HSP's successfully gapped in prelim test: 48
Number of HSP's that attempted gapping in prelim test: 428041
Number of HSP's gapped (non-prelim): 87
length of query: 209
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 73
effective length of database: 9,168,204,383
effective search space: 669278919959
effective search space used: 669278919959
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 73 (32.7 bits)