BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026147
(242 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297831700|ref|XP_002883732.1| hypothetical protein ARALYDRAFT_480213 [Arabidopsis lyrata subsp.
lyrata]
gi|297329572|gb|EFH59991.1| hypothetical protein ARALYDRAFT_480213 [Arabidopsis lyrata subsp.
lyrata]
Length = 288
Score = 318 bits (816), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 157/239 (65%), Positives = 187/239 (78%), Gaps = 6/239 (2%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKKEDLSI-VSIPQ-- 57
MSTI + R +L+ + + ++F R W + +DS+SS T+KKE+LS+ +SIP
Sbjct: 1 MSTISIHRTELLRITHNRSRISRQRFRRTIPLWKLTIDSRSSDTSKKEELSVKISIPPQV 60
Query: 58 ---RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
R LRFDRLQP ++E DR EFGKFVAREA++DEEYWTAAWLRAESHWE R+NE
Sbjct: 61 DQLRPEGLRFDRLQPPEQEFGHEDRLEFGKFVAREAMVDEEYWTAAWLRAESHWEDRSNE 120
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYVDN+KRKFAEQEFNAIKRRC+G+ GQ+ CIVAVKK+E ++KR+V+KSVVGTLDLSIR
Sbjct: 121 RYVDNYKRKFAEQEFNAIKRRCKGMQGQKCSCIVAVKKEEKHIKRSVIKSVVGTLDLSIR 180
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
Y LQGE FPGE+V LF INR G NRYGYIANLCVAKSARRQGIA NML FAVESA+
Sbjct: 181 YFLQGETFPGEKVKSQLFCSINREGSNRYGYIANLCVAKSARRQGIACNMLRFAVESAR 239
>gi|30678254|ref|NP_671784.2| acyl-CoA N-acyltransferases-like protein [Arabidopsis thaliana]
gi|27413497|gb|AAO11666.1| hypothetical protein [Arabidopsis thaliana]
gi|61742576|gb|AAX55109.1| hypothetical protein At2g06025 [Arabidopsis thaliana]
gi|62318765|dbj|BAD93799.1| hypothetical protein [Arabidopsis thaliana]
gi|330250902|gb|AEC05996.1| acyl-CoA N-acyltransferases-like protein [Arabidopsis thaliana]
Length = 288
Score = 313 bits (803), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/239 (65%), Positives = 184/239 (76%), Gaps = 6/239 (2%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKKEDLSI-VSIP--- 56
MSTI + R L+ +++ R W + ++S+SS T+KKE+LS+ +SIP
Sbjct: 1 MSTISIHRTETLRITHARSRIFRQRYRRTIPLWKLTINSRSSDTSKKEELSVQISIPPQV 60
Query: 57 --QRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
R LRFDRLQP + E DRFEFGKFVAREA+LDEEYWTAAWLRAESHWE R+NE
Sbjct: 61 DQSRPEGLRFDRLQPPEPEFGHEDRFEFGKFVAREAMLDEEYWTAAWLRAESHWEDRSNE 120
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYVDN+KRKFAEQEFNAIKRRC+G+ GQ+ CIVAVKK+E ++KR+V+KSVVGTLDLSIR
Sbjct: 121 RYVDNYKRKFAEQEFNAIKRRCKGMQGQKCSCIVAVKKEEKHIKRSVIKSVVGTLDLSIR 180
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
Y LQGE FPGE+V LF IN+ G NRYGYIANLCVAKSARRQGIA NML FAVESA+
Sbjct: 181 YFLQGETFPGEKVKSQLFCSINQEGSNRYGYIANLCVAKSARRQGIACNMLRFAVESAR 239
>gi|388491720|gb|AFK33926.1| unknown [Lotus japonicus]
Length = 302
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/253 (60%), Positives = 186/253 (73%), Gaps = 19/253 (7%)
Query: 2 STILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK---------EDLSI 52
STI + RP +G H KF RI+A+WT +DSK S+T E+L +
Sbjct: 5 STIPIPRPEFHTLFFNGSPIPH-KFPRIAASWTTTLDSKFSQTMNSNSMKKMKKKEELPV 63
Query: 53 --VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLR 103
+S P S++L+FDRLQPSD EL++ +RFEFG+FVAREA+LDEEYWTAAWLR
Sbjct: 64 QQLSTPPVSKVETLSSDNLQFDRLQPSDHELVRENRFEFGQFVAREAVLDEEYWTAAWLR 123
Query: 104 AESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLK 163
AESHWE RT +RY+DN+KRKFA+QEFNA+KRRC+ G CI+ VKK++ N+K +V+K
Sbjct: 124 AESHWENRTYDRYIDNYKRKFADQEFNAVKRRCKVQTGDNCMCIITVKKEQKNLKHSVIK 183
Query: 164 SVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASN 223
SVVGTLDL+ RYLLQGE FPGERV P+F INR PN+YGYIANLCVAKSARR GIASN
Sbjct: 184 SVVGTLDLNFRYLLQGETFPGERVKAPIFCTINRTAPNKYGYIANLCVAKSARRHGIASN 243
Query: 224 MLYFAVESAKSNA 236
M+YFAVE+AKSN
Sbjct: 244 MMYFAVETAKSNG 256
>gi|356556386|ref|XP_003546507.1| PREDICTED: uncharacterized protein LOC100808675 [Glycine max]
Length = 302
Score = 305 bits (780), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/256 (62%), Positives = 186/256 (72%), Gaps = 22/256 (8%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTK-----------KED 49
MSTI + RP +G + K RI+A+WTM MDSK S TTK KE+
Sbjct: 4 MSTIPIHRPEFRTLFFNGS-PSPCKCLRITASWTMTMDSKFSPTTKNSNDNNYNLKKKEE 62
Query: 50 LSIVSIPQ---------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAA 100
LS V IP SNDL+FDRLQPSD+EL + RFEFG+FVAREA+LDEEYWTAA
Sbjct: 63 LS-VQIPTPPISKVETLNSNDLQFDRLQPSDQELGRVKRFEFGQFVAREAVLDEEYWTAA 121
Query: 101 WLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRT 160
WLRAESHWE R +RYVDN+KRKFAEQEFNA+KRRC+ NG CI+ V+K++ N K +
Sbjct: 122 WLRAESHWEDRPYDRYVDNYKRKFAEQEFNALKRRCKVQNGDSCACIITVRKEQKNAKHS 181
Query: 161 VLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGI 220
VLKSVVGTLDL+IRYLLQGE +PGERV PLF INR P+RYGYIANLCV KS RRQGI
Sbjct: 182 VLKSVVGTLDLNIRYLLQGETYPGERVKAPLFCSINRTPPSRYGYIANLCVIKSVRRQGI 241
Query: 221 ASNMLYFAVESAKSNA 236
ASNM+ FA+E+AKSN
Sbjct: 242 ASNMMSFAIEAAKSNG 257
>gi|356530427|ref|XP_003533783.1| PREDICTED: uncharacterized protein LOC100795426 [Glycine max]
Length = 304
Score = 302 bits (773), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 156/257 (60%), Positives = 184/257 (71%), Gaps = 22/257 (8%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK------------- 47
MSTI + RP +G + K RI+A+W M MDSK S T K
Sbjct: 4 MSTIPIHRPEFCTLFFNGSPSPY-KCPRITASWIMAMDSKFSSTIKNNSDNDYNLKKKKD 62
Query: 48 EDLSI-VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTA 99
E+LS+ +S P SNDL+FDRLQPSD+EL + RFEFG+FVARE +LDEEYWTA
Sbjct: 63 EELSVQISTPPISKVETLNSNDLQFDRLQPSDQELGRVKRFEFGQFVARETVLDEEYWTA 122
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAESHWE R ERYVDN+KRKFAEQEFNA+KRRC+ NG CI+ V+K++ N K
Sbjct: 123 AWLRAESHWEDRPFERYVDNYKRKFAEQEFNALKRRCKVQNGDSCACIITVRKEQKNAKH 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
++LKSVVGTLDL+IRYLLQGE +PGERV PLF INR P+RYGYIANLCV KSARRQG
Sbjct: 183 SILKSVVGTLDLNIRYLLQGETYPGERVKAPLFCSINRTPPSRYGYIANLCVIKSARRQG 242
Query: 220 IASNMLYFAVESAKSNA 236
IASNM+ FA+E+AKSN
Sbjct: 243 IASNMMSFAIEAAKSNG 259
>gi|297738568|emb|CBI27813.3| unnamed protein product [Vitis vinifera]
Length = 523
Score = 289 bits (740), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 145/206 (70%), Positives = 167/206 (81%), Gaps = 9/206 (4%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 271 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 329
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAESHWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 330 AAVLDEEYWTAAWLRAESHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 389
Query: 149 AVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIAN 208
AVKK+E VK+TVLKSVVGTLDLSIRYLL G+ FPGE+V PLF INR ++YGY+AN
Sbjct: 390 AVKKEERGVKQTVLKSVVGTLDLSIRYLLNGDTFPGEQVKAPLFCSINRTRSSKYGYVAN 449
Query: 209 LCVAKSARRQGIASNMLYFAVESAKS 234
LCVAKS RRQGIASNML+FAVESAKS
Sbjct: 450 LCVAKSVRRQGIASNMLHFAVESAKS 475
>gi|225444742|ref|XP_002278105.1| PREDICTED: uncharacterized protein LOC100254310 [Vitis vinifera]
Length = 254
Score = 288 bits (736), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 145/206 (70%), Positives = 167/206 (81%), Gaps = 9/206 (4%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 1 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 59
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAESHWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 60 AAVLDEEYWTAAWLRAESHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 119
Query: 149 AVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIAN 208
AVKK+E VK+TVLKSVVGTLDLSIRYLL G+ FPGE+V PLF INR ++YGY+AN
Sbjct: 120 AVKKEERGVKQTVLKSVVGTLDLSIRYLLNGDTFPGEQVKAPLFCSINRTRSSKYGYVAN 179
Query: 209 LCVAKSARRQGIASNMLYFAVESAKS 234
LCVAKS RRQGIASNML+FAVESAKS
Sbjct: 180 LCVAKSVRRQGIASNMLHFAVESAKS 205
>gi|147801241|emb|CAN72326.1| hypothetical protein VITISV_041246 [Vitis vinifera]
Length = 928
Score = 276 bits (707), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 143/209 (68%), Positives = 164/209 (78%), Gaps = 12/209 (5%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 671 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 729
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAE HWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 730 AAVLDEEYWTAAWLRAEXHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 789
Query: 149 A---VKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGY 205
A VKK+E VK+TVLKSVVGTLDLSIRYLL G FPGE+V PLF IN ++YGY
Sbjct: 790 ANYQVKKEERGVKQTVLKSVVGTLDLSIRYLLNGXTFPGEQVKAPLFCSINXTXSSKYGY 849
Query: 206 IANLCVAKSARRQGIASNMLYFAVESAKS 234
+ANLCVAKS RRQGIASNML+FAVESAKS
Sbjct: 850 VANLCVAKSVRRQGIASNMLHFAVESAKS 878
>gi|224069136|ref|XP_002326283.1| predicted protein [Populus trichocarpa]
gi|222833476|gb|EEE71953.1| predicted protein [Populus trichocarpa]
Length = 257
Score = 275 bits (704), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 139/209 (66%), Positives = 164/209 (78%), Gaps = 13/209 (6%)
Query: 39 SKSSRTTKKEDLSIVSIPQRS-----------NDLRFDRLQPSDRELLQHDRFEFGKFVA 87
S SS T KKE+LSI +P S +DLRFDRLQ ++EL+ +FEFG+FVA
Sbjct: 5 SGSSETRKKEELSI-QVPATSISSSETQRFAGSDLRFDRLQVPEKELMHDRKFEFGQFVA 63
Query: 88 REALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCI 147
REA++DEEYWTAAWLRAESHWEGR N+RYVDN KRKFAEQEF+AIKRR L+GQ+ CI
Sbjct: 64 REAVIDEEYWTAAWLRAESHWEGR-NDRYVDNHKRKFAEQEFHAIKRRRTSLHGQKCRCI 122
Query: 148 VAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIA 207
+ V+K++ +VKRTVLK VVGTLDLS+R LL GE FPGERV PLF I+ GPNRYGY+A
Sbjct: 123 IMVRKEDKHVKRTVLKGVVGTLDLSVRCLLHGETFPGERVKAPLFCSIHGTGPNRYGYVA 182
Query: 208 NLCVAKSARRQGIASNMLYFAVESAKSNA 236
NLCVAKSARRQGIA+NML+FA+E KSN
Sbjct: 183 NLCVAKSARRQGIATNMLHFAIELVKSNG 211
>gi|449452562|ref|XP_004144028.1| PREDICTED: uncharacterized protein LOC101207838 [Cucumis sativus]
gi|449500482|ref|XP_004161109.1| PREDICTED: uncharacterized protein LOC101229607 [Cucumis sativus]
Length = 299
Score = 259 bits (662), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 138/244 (56%), Positives = 172/244 (70%), Gaps = 13/244 (5%)
Query: 2 STILVQRPAVLKFHRDGGFRNHRKFSRISA--TWTMRMDSKSSRTTKKEDLSI-VSIPQR 58
+ I + RP L +D RNH KF R SA +W M MD KS +T KKE++SI +S P
Sbjct: 3 AAISIYRPEFLGSVQDRC-RNHLKFHRTSAFASWNMTMDYKSHQTMKKEEVSIQISTPLL 61
Query: 59 S--------NDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
+ L+FDR P D +L+ + EFG+FVAREA++DEE WTAAWLRAESHWE
Sbjct: 62 LPKLKPLAWSGLQFDRPPPDDEDLIHLRKLEFGQFVAREAVIDEELWTAAWLRAESHWEN 121
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLD 170
R N+RYVD+FKRKFAEQEFNAIK+RC G GQ CIV V+K++ ++KRTV+KSVV TLD
Sbjct: 122 RQNDRYVDSFKRKFAEQEFNAIKKRCGGQYGQTCTCIVTVRKEQKHIKRTVIKSVVATLD 181
Query: 171 LSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVE 230
L +R+L+ GE FPGER + IN+ PN+Y YI+NLCV K+ARRQGIA NML FAV
Sbjct: 182 LCLRHLMHGETFPGEREKSHVCS-INKEIPNKYAYISNLCVLKAARRQGIAGNMLKFAVL 240
Query: 231 SAKS 234
+AKS
Sbjct: 241 TAKS 244
>gi|356545766|ref|XP_003541306.1| PREDICTED: uncharacterized protein LOC100784591 [Glycine max]
Length = 282
Score = 239 bits (610), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 155/219 (70%), Gaps = 13/219 (5%)
Query: 26 FSRISATWTMRMDSKSSRTTKKEDLSIVSIPQRSN--------DLRFDRLQPSDRELLQH 77
F RIS ++ M + TKK+ V +P ++ DL F RLQPSD EL H
Sbjct: 17 FFRIS---SLAMTIDTDFLTKKKKQVSVQLPSLTSPVSTVRFCDLNFGRLQPSDDELDPH 73
Query: 78 DRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCR 137
+RFEFG FVAR+ALL EEYWTAAWLRAE W RT++ YV N KR FA+QEF+AIK+RC+
Sbjct: 74 NRFEFGNFVARQALLHEEYWTAAWLRAE-EWASRTHKLYVVNHKRNFADQEFDAIKKRCK 132
Query: 138 -GLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCIN 196
+GQ CI+ V+K + NVKR+++ SVVGTLDL+IRYL GE FPGE V+ P F I
Sbjct: 133 EQQDGQSSTCIITVRKQDKNVKRSIISSVVGTLDLNIRYLRLGETFPGEPVDAPRFCKIE 192
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSN 235
R +RYGYIANLCVAKS RR+GIASNMLYFAVESAKS+
Sbjct: 193 RTPSSRYGYIANLCVAKSVRRKGIASNMLYFAVESAKSS 231
>gi|356564758|ref|XP_003550615.1| PREDICTED: uncharacterized protein LOC100800028 [Glycine max]
Length = 281
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 121/210 (57%), Positives = 147/210 (70%), Gaps = 9/210 (4%)
Query: 35 MRMDSKSSRTTKKE---DLSIVSIPQRSN----DLRFDRLQPSDRELLQHDRFEFGKFVA 87
M +D+ T KK+ LS + P S DL F RLQPSD EL H RFEFG FVA
Sbjct: 27 MTIDTDFLNTKKKQVSFQLSSLITPPVSTVRFCDLNFGRLQPSDEELGPHKRFEFGNFVA 86
Query: 88 REALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGL-NGQRHYC 146
REALL+EEYW+AA LRAE W RT++ YV + +R FAEQEFNAIK+RC+ L +G C
Sbjct: 87 REALLEEEYWSAACLRAE-EWANRTDKLYVVDRQRNFAEQEFNAIKKRCKELQDGHSSTC 145
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYI 206
I+ V+K + NVK +++SVVGTLDL+I YL +GE FPGE V+ P F I+R +RYGYI
Sbjct: 146 IITVRKPQKNVKLPIIESVVGTLDLNIIYLRRGETFPGELVDSPPFCKIDRTASSRYGYI 205
Query: 207 ANLCVAKSARRQGIASNMLYFAVESAKSNA 236
ANLCVAKS R+G+AS MLYFAVESAKS
Sbjct: 206 ANLCVAKSLYRKGVASKMLYFAVESAKSTG 235
>gi|326496873|dbj|BAJ98463.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521206|dbj|BAJ96806.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 309
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 94/179 (52%), Positives = 134/179 (74%), Gaps = 4/179 (2%)
Query: 56 PQRSNDLRFDRLQP-SDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
P +DLRF+RL+P +++ ++ R+ FG++VAREA++DEEYW AAWLRAE H+E ++ +
Sbjct: 84 PAEPSDLRFNRLRPPTEKSDCKYTRY-FGRYVAREAIVDEEYWIAAWLRAEDHYEDQSGD 142
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYV++FKRKFA QEF+A+K+RC G+++ C VAVK D+ V RTVL SVVGT+DL +R
Sbjct: 143 RYVESFKRKFASQEFHALKKRCSRQVGEKYTCFVAVKNDD--VTRTVLNSVVGTVDLCVR 200
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
+ L GE +P E N P + I + ++GY+ N+CVAK ARRQGIASNML A+++A+
Sbjct: 201 HPLYGETYPAEPGNMPFYSRIYQPDQPKFGYLTNVCVAKYARRQGIASNMLLLAIDAAR 259
>gi|115451085|ref|NP_001049143.1| Os03g0177000 [Oryza sativa Japonica Group]
gi|108706472|gb|ABF94267.1| GCN5-related N-acetyltransferase, putative, expressed [Oryza sativa
Japonica Group]
gi|113547614|dbj|BAF11057.1| Os03g0177000 [Oryza sativa Japonica Group]
gi|215701031|dbj|BAG92455.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218192192|gb|EEC74619.1| hypothetical protein OsI_10235 [Oryza sativa Indica Group]
gi|222624295|gb|EEE58427.1| hypothetical protein OsJ_09628 [Oryza sativa Japonica Group]
Length = 306
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 101/198 (51%), Positives = 139/198 (70%), Gaps = 4/198 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPSDREL-LQHDRFEFGKFVAREALLDEEYWTA 99
SS + +++ P +DLRF+RL+PS E ++ RF FG +VAREA++DEEYW A
Sbjct: 68 SSLSNPMSEVTTPFHPAAPSDLRFNRLRPSVEESDCKYKRF-FGCYVAREAIIDEEYWIA 126
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE+ +E ++++RYV++FKRKFA QEF+A+KRRC L G+++ C VAVK D ++KR
Sbjct: 127 AWLRAENRYEDQSSDRYVESFKRKFASQEFHALKRRCSKLQGEKYICFVAVKND--DLKR 184
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ IR+ L GE FP E I + ++GY+ N+CVAK ARRQG
Sbjct: 185 TVLNSVVGTLDVCIRHPLHGETFPAEPGKSSFHCRIYQPDQPKFGYLTNVCVAKYARRQG 244
Query: 220 IASNMLYFAVESAKSNAG 237
IASNML A+++A+ N
Sbjct: 245 IASNMLLLAIDAARLNGA 262
>gi|225435967|ref|XP_002270467.1| PREDICTED: uncharacterized protein LOC100244484 [Vitis vinifera]
Length = 289
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/184 (55%), Positives = 132/184 (71%), Gaps = 5/184 (2%)
Query: 58 RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYV 117
R +L F+RLQ +D E R FG+FVAREA+LDEEYWTAAWLRAE+HWE + R+V
Sbjct: 63 RLPELSFNRLQLTDEEYCGLQRRNFGRFVAREAVLDEEYWTAAWLRAEAHWESLSYMRHV 122
Query: 118 DNFKRKFAEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
DN+KRK+AE+EF A+KRRC G +G + +C VAV K+E N++RTVL SVVGTLDLSIR
Sbjct: 123 DNYKRKYAEEEFYALKRRCAGQDGNLLKCFCFVAV-KEEKNIRRTVLNSVVGTLDLSIRQ 181
Query: 176 LLQGENFPGE-RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
++GE +PGE + + ++Y YIAN+CV+K ARRQGIA NML A + A S
Sbjct: 182 YIRGETYPGEIKRLSSVLASPEPFDAHKYAYIANVCVSKFARRQGIALNMLSLANDVA-S 240
Query: 235 NAGI 238
+AG+
Sbjct: 241 SAGM 244
>gi|413956845|gb|AFW89494.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 266
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/197 (49%), Positives = 136/197 (69%), Gaps = 4/197 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNA 236
IASNML A+++AK N
Sbjct: 243 IASNMLLLAIDAAKLNG 259
>gi|296083942|emb|CBI24330.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/184 (55%), Positives = 132/184 (71%), Gaps = 5/184 (2%)
Query: 58 RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYV 117
R +L F+RLQ +D E R FG+FVAREA+LDEEYWTAAWLRAE+HWE + R+V
Sbjct: 62 RLPELSFNRLQLTDEEYCGLQRRNFGRFVAREAVLDEEYWTAAWLRAEAHWESLSYMRHV 121
Query: 118 DNFKRKFAEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
DN+KRK+AE+EF A+KRRC G +G + +C VAV K+E N++RTVL SVVGTLDLSIR
Sbjct: 122 DNYKRKYAEEEFYALKRRCAGQDGNLLKCFCFVAV-KEEKNIRRTVLNSVVGTLDLSIRQ 180
Query: 176 LLQGENFPGE-RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
++GE +PGE + + ++Y YIAN+CV+K ARRQGIA NML A + A S
Sbjct: 181 YIRGETYPGEIKRLSSVLASPEPFDAHKYAYIANVCVSKFARRQGIALNMLSLANDVA-S 239
Query: 235 NAGI 238
+AG+
Sbjct: 240 SAGM 243
>gi|413956846|gb|AFW89495.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 304
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/198 (49%), Positives = 136/198 (68%), Gaps = 4/198 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNAG 237
IASNML A+++AK N
Sbjct: 243 IASNMLLLAIDAAKLNGA 260
>gi|223975831|gb|ACN32103.1| unknown [Zea mays]
Length = 285
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/198 (49%), Positives = 136/198 (68%), Gaps = 4/198 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNAG 237
IASNML A+++AK N
Sbjct: 243 IASNMLLLAIDAAKLNGA 260
>gi|357113782|ref|XP_003558680.1| PREDICTED: uncharacterized protein LOC100839034 [Brachypodium
distachyon]
Length = 308
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 92/182 (50%), Positives = 129/182 (70%), Gaps = 2/182 (1%)
Query: 56 PQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNER 115
P +DLRF+RL+PS E + FG +VAREA++DEEYW AAWLRAE +E ++ +R
Sbjct: 83 PAEPSDLRFNRLRPSVEESDCKYKRLFGCYVAREAIMDEEYWIAAWLRAEHRYEDQSGDR 142
Query: 116 YVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
YV++FKRKFA QEF+A+K+RC G+++ C VAVK D+ + RTV+ SVVGTLD+ +R+
Sbjct: 143 YVESFKRKFASQEFHALKKRCSNQVGEKYTCFVAVKNDD--LTRTVVNSVVGTLDVCVRH 200
Query: 176 LLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSN 235
L GE +P E N P + I + ++GY+ N+CVAK ARRQGIA+NML A+++A+ N
Sbjct: 201 PLHGEKYPEEPGNSPFYARIYQPHQPKFGYLTNVCVAKYARRQGIATNMLLLAIDAARFN 260
Query: 236 AG 237
Sbjct: 261 GA 262
>gi|302796302|ref|XP_002979913.1| hypothetical protein SELMODRAFT_444359 [Selaginella moellendorffii]
gi|300152140|gb|EFJ18783.1| hypothetical protein SELMODRAFT_444359 [Selaginella moellendorffii]
Length = 346
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 106/173 (61%), Gaps = 25/173 (14%)
Query: 69 PSDRELLQHDRFEFGK----FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKF 124
P++ LL R F + ++ REA DE++WTA+WLR+E ++E + ++R+V ++K+KF
Sbjct: 39 PANARLLTGCRDSFNQQPNFYLVREAQSDEDFWTASWLRSECYFEDQGSDRFVQSYKKKF 98
Query: 125 AEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN- 181
+EQEF +KRRC G G R C++AV D ++L V+GT DLS+R L G++
Sbjct: 99 SEQEFITLKRRCAGRYGNCLRCTCLLAVLNDGS----SLLNRVIGTTDLSLRQALPGDDS 154
Query: 182 -FPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
G + P YGYIAN+CV++S RR+G+AS++L +V+ AK
Sbjct: 155 RTLGIQTQP-------------YGYIANVCVSRSHRRRGVASSLLESSVQVAK 194
>gi|302811410|ref|XP_002987394.1| hypothetical protein SELMODRAFT_446972 [Selaginella moellendorffii]
gi|300144800|gb|EFJ11481.1| hypothetical protein SELMODRAFT_446972 [Selaginella moellendorffii]
Length = 530
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 104/171 (60%), Gaps = 17/171 (9%)
Query: 69 PSDRELLQHDRFEFGK----FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKF 124
P++ LL R F + ++ REA DE++WTA+WLR+E ++E + ++R+V ++K+KF
Sbjct: 39 PANARLLTGCRDSFNQQPNFYLVREARSDEDFWTASWLRSECYFEDQGSDRFVQSYKKKF 98
Query: 125 AEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENF 182
+EQEF +KRRC G G R C++AV D ++L V+GT DLS+R L
Sbjct: 99 SEQEFITLKRRCAGRYGNCLRCTCLLAVLND----GSSLLNRVIGTTDLSLRQALPTAQ- 153
Query: 183 PGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
R + G + YGYIAN+CV++S RR+G+AS++L AV+ AK
Sbjct: 154 --PRDDSRTLGIQTQ----PYGYIANVCVSRSHRRRGVASSLLESAVQVAK 198
>gi|168029391|ref|XP_001767209.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681464|gb|EDQ67890.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 331
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 84/152 (55%), Gaps = 19/152 (12%)
Query: 82 FGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG 141
+ F EA L+ E AAWLRA+++ E + RYVD+FK+KFAEQEF A+KRR G G
Sbjct: 112 YNHFDIHEARLEYELKDAAWLRADAYSEQQPFTRYVDSFKKKFAEQEFAALKRRLSGQPG 171
Query: 142 QRHYCIVAVKKDEGNVKRTVL---KSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRR 198
Y + D V + V+GTLD S+ GE+F G R+
Sbjct: 172 S--YVSNTLPTDSIEVDWQDCEKDRQVLGTLDFSLHRPSPGESFGGHRL----------- 218
Query: 199 GPNRYGYIANLCVAKSARRQGIASNMLYFAVE 230
YGYIAN+CV KSAR+QGIAS +L AV+
Sbjct: 219 ---PYGYIANVCVEKSARKQGIASALLERAVQ 247
>gi|356525559|ref|XP_003531392.1| PREDICTED: uncharacterized protein LOC100790837 [Glycine max]
Length = 92
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/78 (56%), Positives = 52/78 (66%), Gaps = 9/78 (11%)
Query: 45 TKKEDLSIVSIPQRSN--------DLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEY 96
TKK+ V +P ++ DL F RLQPSD EL H+RFEFG FVAR+ALL EEY
Sbjct: 9 TKKKKQVSVQLPSLTSPVSTVRFCDLNFGRLQPSDDELDPHNRFEFGNFVARQALLHEEY 68
Query: 97 WTAAWLRAESHWEGRTNE 114
WTAAWLRAE W RT++
Sbjct: 69 WTAAWLRAE-EWASRTDK 85
>gi|226509785|ref|NP_001141650.1| uncharacterized protein LOC100273775 [Zea mays]
gi|194705416|gb|ACF86792.1| unknown [Zea mays]
gi|413956844|gb|AFW89493.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 140
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNER 115
AWLRAE +E + R
Sbjct: 125 AWLRAEDQYENESGNR 140
>gi|413916376|gb|AFW56308.1| hypothetical protein ZEAMMB73_275268 [Zea mays]
Length = 195
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW
Sbjct: 121 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGYYVAREAVIDEEYWIV 179
Query: 100 AWLRAESHWEGRTNER 115
AWLRAE +E + R
Sbjct: 180 AWLRAEDQYENESGNR 195
>gi|294463954|gb|ADE77498.1| unknown [Picea sitchensis]
Length = 55
Score = 55.8 bits (133), Expect = 2e-05, Method: Composition-based stats.
Identities = 26/36 (72%), Positives = 31/36 (86%)
Query: 201 NRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNA 236
+RYGY+AN+CV+K ARRQGIASNML AVE AKS+
Sbjct: 10 HRYGYVANVCVSKFARRQGIASNMLQLAVEVAKSSG 45
>gi|255636981|gb|ACU18823.1| unknown [Glycine max]
Length = 106
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 53/104 (50%), Gaps = 22/104 (21%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK------------- 47
MSTI + RP +G + K RI+A+W M MDSK S T K
Sbjct: 4 MSTIPIHRPEFCTLFFNGSPSPY-KCPRITASWIMAMDSKFSSTIKNNSDNDYNLKKKKD 62
Query: 48 EDLSI-VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFG 83
E+LS+ +S P SNDL+FDRLQPSD+EL + R + G
Sbjct: 63 EELSVQISTPPISKVETLNSNDLQFDRLQPSDQELGRVKRQQHG 106
>gi|255644778|gb|ACU22891.1| unknown [Glycine max]
Length = 240
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 72/154 (46%), Gaps = 27/154 (17%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNGQRHYC 146
E+ ++E W AA LR S + R + + + R AE+EF A+K R G + +R C
Sbjct: 33 ESFYEDELWAAACLRVRSFNQFRPDAFGILDHTRYLAEREFEALKERVSGKRMGFRRVSC 92
Query: 147 I--------VAVKKDE--GNVKRTV---LKSVVGTLDLSIRYLLQGENFPGERVN--PPL 191
I +A D+ + K + + VVGTLDL+ Q + P E V P +
Sbjct: 93 INASLPLSHIATLSDDLCSSCKFSTNGEDRIVVGTLDLN-----QCLSLPDEIVGAKPEV 147
Query: 192 FGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
G R Y++N+CVAK R G+A +L
Sbjct: 148 IGADITR-----AYLSNVCVAKELHRNGLAYALL 176
>gi|357479907|ref|XP_003610239.1| hypothetical protein MTR_4g129490 [Medicago truncatula]
gi|355511294|gb|AES92436.1| hypothetical protein MTR_4g129490 [Medicago truncatula]
Length = 317
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 110 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 169
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 170 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 223
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 224 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 253
>gi|217069962|gb|ACJ83341.1| unknown [Medicago truncatula]
Length = 223
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 58 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 117
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 118 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 171
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 172 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 201
>gi|388514139|gb|AFK45131.1| unknown [Medicago truncatula]
Length = 265
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 58 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 117
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 118 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 171
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 172 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 201
>gi|384245237|gb|EIE18732.1| hypothetical protein COCSUDRAFT_60039 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 50.1 bits (118), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 65/134 (48%), Gaps = 27/134 (20%)
Query: 115 RYVDNFKRKFAEQEFNAIKRRCR---GLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDL 171
RYV +FKR+F EQE ++K R R G + C+VAV ++EG+ +V+GTLD
Sbjct: 241 RYVGSFKRQFKEQEARSLKERIRCRPGSDTPECVCLVAV-EEEGDS-----GTVMGTLD- 293
Query: 172 SIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
E PP +R P Y+ N+ V++ R++GI ++ A +
Sbjct: 294 ------------AEPPGPPT--PKHRHIPEGATYLLNVVVSQKHRKRGIGRALMAAAGKL 339
Query: 232 AKSNAG---ICTCA 242
AK G +CT A
Sbjct: 340 AKEQWGSGSMCTHA 353
>gi|255070783|ref|XP_002507473.1| gcn5-related n-acetyltransferase family protein [Micromonas sp.
RCC299]
gi|226522748|gb|ACO68731.1| gcn5-related n-acetyltransferase family protein [Micromonas sp.
RCC299]
Length = 359
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 75/175 (42%), Gaps = 29/175 (16%)
Query: 68 QPSDRELLQHDR-FEFGKFVAREALLDEEYWTAAWLRAESHW---EGRTN----ERYVDN 119
P EL R FE + A E+ AA LRA + + E R R++
Sbjct: 107 HPYTEELATKIRNFETSQISVITAESFSEFSVAAELRATAFYDDLEARQALPFPSRFIAT 166
Query: 120 FKRKFAEQEFNAIKRRCRGLNG--QRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLL 177
F+R+FA++E A+K R G +R C++A D G +VG LD+S+R
Sbjct: 167 FRREFAQRECLALKERTAHFTGTLRRCLCLMARLPDLG---------LVGCLDISVRVGP 217
Query: 178 QGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESA 232
G G CI Y YI N+ V + ARR G AS ML + E A
Sbjct: 218 CGSQVNGL--------CIPDN--EEYVYIDNVAVDERARRLGSASAMLEASSEIA 262
>gi|384245461|gb|EIE18955.1| hypothetical protein COCSUDRAFT_49126 [Coccomyxa subellipsoidea
C-169]
Length = 171
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 57/121 (47%), Gaps = 28/121 (23%)
Query: 125 AEQEFNAIKRRCRGLNGQRH-----YCIVAV--------KKDEGNVKRTVLKSVVGTLDL 171
++EF A+K + R L R C+VAV ++ ++VVGTLDL
Sbjct: 3 GDEEFEALKEQ-RDLRNARQPNENSACLVAVWPSESLEESPSSLGLQAIGEEAVVGTLDL 61
Query: 172 SIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
+ GE G+ NP Y+AN+CVAK+A+RQG+ +++ AV+
Sbjct: 62 YAIRAMPGEVLIGDCSNP--------------AYLANVCVAKAAQRQGVGRELIHNAVDL 107
Query: 232 A 232
A
Sbjct: 108 A 108
>gi|18417112|ref|NP_567795.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
gi|15081771|gb|AAK82540.1| AT4g28030/T13J8_140 [Arabidopsis thaliana]
gi|22137098|gb|AAM91394.1| At4g28030/T13J8_140 [Arabidopsis thaliana]
gi|332660024|gb|AEE85424.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
Length = 274
Score = 45.1 bits (105), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 78/196 (39%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRG-------------------LNGQRHYCIVAVK 151
+ + + +R AE+EF A+K R G L+ A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSFEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|21536955|gb|AAM61296.1| unknown [Arabidopsis thaliana]
Length = 274
Score = 44.7 bits (104), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 79/196 (40%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQRHYCI-----------------VAVK 151
+ + + +R AE+EF A+K R G R CI A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSSEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|79325293|ref|NP_001031734.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
gi|222423521|dbj|BAH19730.1| AT4G28030 [Arabidopsis thaliana]
gi|332660025|gb|AEE85425.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
Length = 263
Score = 44.7 bits (104), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 78/196 (39%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRG-------------------LNGQRHYCIVAVK 151
+ + + +R AE+EF A+K R G L+ A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSFEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|255585013|ref|XP_002533217.1| N-acetyltransferase, putative [Ricinus communis]
gi|223526974|gb|EEF29170.1| N-acetyltransferase, putative [Ricinus communis]
Length = 266
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 63/161 (39%), Gaps = 34/161 (21%)
Query: 85 FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNGQ 142
V E ++E W A+ LR S + + + K+ AE+EF A+K R G +
Sbjct: 53 LVIAETEAEDELWAASCLRVRSFHNFHDSSFSIQDHKKYLAEREFEAVKERISGKRTGFR 112
Query: 143 RHYCIVAVKKDEGNVKRTVL------------KSVVGTLDLSIRYLLQGENFPGERVNPP 190
R CI A L + VVGTLDL+ Q P E
Sbjct: 113 RVSCINATLPSSQLSDYDDLCTECKYTNNGEDRVVVGTLDLN-----QCLRLPDE----- 162
Query: 191 LFGCINRRGPNRYG------YIANLCVAKSARRQGIASNML 225
I + P G Y++N+CVAK RQG+ ++
Sbjct: 163 ----ITGKKPEGIGADFLRAYLSNVCVAKELHRQGLGYELI 199
>gi|302844083|ref|XP_002953582.1| hypothetical protein VOLCADRAFT_106017 [Volvox carteri f.
nagariensis]
gi|300260991|gb|EFJ45206.1| hypothetical protein VOLCADRAFT_106017 [Volvox carteri f.
nagariensis]
Length = 382
Score = 43.9 bits (102), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 69/170 (40%), Gaps = 43/170 (25%)
Query: 102 LRAESHWEGRTNERYVDNFKRKFAEQ-------------------EFNAIKRRCRGLNGQ 142
++A + WE T + VD+ + Q E A + C G N Q
Sbjct: 135 MKANAEWESVTAKVEVDDVDPGYGRQATAPTTITTTVTASEGDGAELPAASQHC-GSNDQ 193
Query: 143 RHYCIVA-----------VKKDEGNVKRTVLKS---VVGTLDLSIRYLLQGENFPGERVN 188
++A + + G+ + VVG+LDL++ + L E G
Sbjct: 194 ESGALLAALRSCLDACAQLPAEPGSSASASPRPRRLVVGSLDLNVGHTLPSEELIGRH-- 251
Query: 189 PPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNAGI 238
P F +RR Y++N+CVA +ARR G+A ++L VE AG+
Sbjct: 252 -PQFDPRHRRA-----YLSNVCVAPAARRLGVARSLLRH-VEEVARGAGV 294
>gi|159487811|ref|XP_001701916.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281135|gb|EDP06891.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1037
Score = 43.5 bits (101), Expect = 0.071, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 40/71 (56%), Gaps = 10/71 (14%)
Query: 165 VVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGP-NRYGYIANLCVAKSARRQGIASN 223
VVG+LDL++ G P E L G + P +R Y++N+CVA +ARR G+A
Sbjct: 182 VVGSLDLNV-----GHTLPSEE----LIGRQPKEDPRHRRAYLSNVCVAPAARRMGLARA 232
Query: 224 MLYFAVESAKS 234
+L A E A+S
Sbjct: 233 LLRVAEEEARS 243
>gi|224088384|ref|XP_002308434.1| predicted protein [Populus trichocarpa]
gi|222854410|gb|EEE91957.1| predicted protein [Populus trichocarpa]
Length = 249
Score = 43.5 bits (101), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 62/149 (41%), Gaps = 24/149 (16%)
Query: 93 DEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG----------LNGQ 142
+++ W AA LR S E + + + + KR AE+EF A+K R G LN
Sbjct: 47 EDQLWAAACLRVRSFHEFKPSTFGIQDHKRYLAEREFEALKERIAGKRTGFNRVSCLNAS 106
Query: 143 RHYCIVAVKKDEGNVKRTVL------KSVVGTLDLSIRYLLQGENFPGERVNPPLFGCIN 196
+ D+ + + VVGTLD++ Q + P E G
Sbjct: 107 LPLSQLLSLPDDDLCAQCKFSENGEDRVVVGTLDVN-----QSMSLPDEITGMKPEGI-- 159
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNML 225
G GY++N+CVA R G+ +++
Sbjct: 160 -EGQFARGYLSNVCVANELHRNGLGYDLV 187
>gi|297799176|ref|XP_002867472.1| hypothetical protein ARALYDRAFT_328890 [Arabidopsis lyrata subsp.
lyrata]
gi|297313308|gb|EFH43731.1| hypothetical protein ARALYDRAFT_328890 [Arabidopsis lyrata subsp.
lyrata]
Length = 275
Score = 43.5 bits (101), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 66/162 (40%), Gaps = 35/162 (21%)
Query: 85 FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQ 142
FV E++ ++E W AA LR + E + + + +R AE+EF A+K R G
Sbjct: 58 FVISESVSEDELWAAACLRVRTFNELNPSAYNIQDHRRYLAEREFEALKERTSGKREGFT 117
Query: 143 RHYCIVAV-----------------KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE 185
R CI A K +G R VVG+LDL+ Q P E
Sbjct: 118 RVACINATLPLSQLSSSSEDLCSSCKFSDGIEDRV----VVGSLDLN-----QCRWLPDE 168
Query: 186 --RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
P G R Y++N+CVAK R G+ ++
Sbjct: 169 IAGTKPEGIGVDFAR-----AYLSNVCVAKELHRNGVGYKLI 205
>gi|302828192|ref|XP_002945663.1| hypothetical protein VOLCADRAFT_120146 [Volvox carteri f.
nagariensis]
gi|300268478|gb|EFJ52658.1| hypothetical protein VOLCADRAFT_120146 [Volvox carteri f.
nagariensis]
Length = 459
Score = 43.1 bits (100), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 66/139 (47%), Gaps = 23/139 (16%)
Query: 94 EEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQ-RHY--CIVAV 150
+E A LRA++++ R+V + K+KF EQE ++++R L+ Q + Y C+VA+
Sbjct: 260 DELRQVATLRADAYY-AENQSRFVGSLKKKFVEQEVESLQQRTTILSRQGKPYSECLVAI 318
Query: 151 KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLC 210
G +V+ +D+ + L G + G + P GC Y+ N+
Sbjct: 319 DSASG--------AVLACIDMRLPAALNGTHPHGVPQDDP-SGC----------YLLNVV 359
Query: 211 VAKSARRQGIASNMLYFAV 229
V + R QG+ ++ A+
Sbjct: 360 VREDVRGQGLGRGIMQAAM 378
>gi|225445718|ref|XP_002270456.1| PREDICTED: uncharacterized protein LOC100246646 [Vitis vinifera]
gi|297743735|emb|CBI36618.3| unnamed protein product [Vitis vinifera]
Length = 279
Score = 42.0 bits (97), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 73/167 (43%), Gaps = 28/167 (16%)
Query: 76 QHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRR 135
Q + + V E + +++ W AA LR S ++ + +D+ KR AE+EF A+K R
Sbjct: 57 QTFKIDKSSLVVAETVSEDQLWAAACLRIRSFYQFGPSYG-IDDHKRYLAEREFEALKER 115
Query: 136 CRGL--NGQRHYCIVA-VKKDEGNVKRTVL------------KSVVGTLDLSIRYLLQGE 180
G +R CI A + E + L + V+GTLDL+ Q
Sbjct: 116 VAGKREGFRRVSCINATIPLSEISSFSDDLCAACKFTHNGEDRVVIGTLDLN-----QCV 170
Query: 181 NFPGE--RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ P E + P G R Y++N+CVAK R G+ ++
Sbjct: 171 SLPDEITGMKPQGIGADFLR-----AYLSNVCVAKELHRNGLGYALV 212
>gi|209524724|ref|ZP_03273271.1| GCN5-related N-acetyltransferase [Arthrospira maxima CS-328]
gi|376004001|ref|ZP_09781787.1| GCN5-related N-acetyltransferase [Arthrospira sp. PCC 8005]
gi|423067739|ref|ZP_17056529.1| GCN5-related N-acetyltransferase [Arthrospira platensis C1]
gi|209494868|gb|EDZ95176.1| GCN5-related N-acetyltransferase [Arthrospira maxima CS-328]
gi|375327624|emb|CCE17540.1| GCN5-related N-acetyltransferase [Arthrospira sp. PCC 8005]
gi|406710844|gb|EKD06047.1| GCN5-related N-acetyltransferase [Arthrospira platensis C1]
Length = 205
Score = 42.0 bits (97), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++++R NP G +
Sbjct: 76 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMALR-----------SHNPWSIG-------D 117
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 118 RYPYISNLAVHPTARRQGVARELL 141
>gi|307107002|gb|EFN55246.1| hypothetical protein CHLNCDRAFT_134568 [Chlorella variabilis]
Length = 237
Score = 41.6 bits (96), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 29/40 (72%)
Query: 99 AAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG 138
AAWLRAE+++E R+ R+ +F ++FA QE A+ RR +G
Sbjct: 37 AAWLRAEAYYEKRSVGRFKASFIKQFAGQELYALLRRTKG 76
>gi|168016797|ref|XP_001760935.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687944|gb|EDQ74324.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 176
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 53/121 (43%), Gaps = 31/121 (25%)
Query: 125 AEQEFNAIKRRCRGLNG--QRHYCIVAV------------------KKDEGNVKRTVLKS 164
AE EF A+ + GL +R CI+A+ K N + VL
Sbjct: 6 AEDEFTALTSKVAGLEQGYKRIVCILALYPLSCLPDRSLVDLHPALKVTLANGEEHVL-- 63
Query: 165 VVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNM 224
VG+LDLS QG+ PGE V G R GY++N+CVA R++GI +
Sbjct: 64 -VGSLDLS-----QGKVLPGEIVGSKPQGSSAER---ERGYLSNVCVAPLMRQRGIGVAL 114
Query: 225 L 225
L
Sbjct: 115 L 115
>gi|28812153|dbj|BAC65034.1| GCN5-related N-acetyltransferase-like [Oryza sativa Japonica Group]
gi|37806163|dbj|BAC99667.1| GCN5-related N-acetyltransferase-like [Oryza sativa Japonica Group]
gi|125560809|gb|EAZ06257.1| hypothetical protein OsI_28494 [Oryza sativa Indica Group]
gi|125602757|gb|EAZ42082.1| hypothetical protein OsJ_26642 [Oryza sativa Japonica Group]
Length = 291
Score = 41.2 bits (95), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 73/158 (46%), Gaps = 23/158 (14%)
Query: 83 GKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LN 140
G RE + +E A LR + +E T+ ++ ++ A++E++A++ R G +N
Sbjct: 76 GAVAVREFVTLDELRAAVRLRIRTFYEYATDSYGAEDLRKSLADREYDALQDRISGKMIN 135
Query: 141 GQRHYCI--------VAVKKDE-----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERV 187
QR CI V +E V+ + VVG+LDL+ + L + G+R
Sbjct: 136 FQRVSCINGTVPLLPSLVSAEELCSTCKFVEDGEERVVVGSLDLN-QCLWLPDELTGKRP 194
Query: 188 NPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+N R Y++N+CVAK +R G+ ++
Sbjct: 195 G------VNESSHTR-AYLSNVCVAKELQRNGLGYALV 225
>gi|384244775|gb|EIE18273.1| hypothetical protein COCSUDRAFT_60465 [Coccomyxa subellipsoidea
C-169]
Length = 163
Score = 40.8 bits (94), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 10/75 (13%)
Query: 165 VVGTLDLSIRYLLQGENFPGER-VNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASN 223
VVGTLD++ QG N PG + G + R + GY++N+C K+ RR G+A
Sbjct: 47 VVGTLDIN-----QGVNLPGGHLIGLEPEGAVAR---IQRGYLSNICTQKAVRRTGVAMR 98
Query: 224 MLYFAVESAKSNAGI 238
++ A++ A S AGI
Sbjct: 99 LIRAAIDEAAS-AGI 112
>gi|428769044|ref|YP_007160834.1| N-acetyltransferase GCN5 [Cyanobacterium aponinum PCC 10605]
gi|428683323|gb|AFZ52790.1| GCN5-related N-acetyltransferase [Cyanobacterium aponinum PCC
10605]
Length = 185
Score = 40.8 bits (94), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 24/98 (24%)
Query: 140 NGQRHYCIVAV--KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINR 197
N + + C++AV EG V + +L GT++LS R EN+ +
Sbjct: 59 NEKENICLIAVFINNTEGKVNQEIL----GTVELSFR-----ENYRWAK----------- 98
Query: 198 RGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSN 235
+Y YIANL V ++ RRQGIAS +L + A+ N
Sbjct: 99 --KEKYAYIANLAVRENCRRQGIASQLLSRCEQIAQEN 134
>gi|308808185|ref|XP_003081403.1| GCN5-related N-acetyltransferase (ISS) [Ostreococcus tauri]
gi|116059865|emb|CAL55572.1| GCN5-related N-acetyltransferase (ISS) [Ostreococcus tauri]
Length = 269
Score = 40.8 bits (94), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 12/80 (15%)
Query: 161 VLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGI 220
V+ V G+LDL++ GE P E L G ++ P + Y++N+ V RR+GI
Sbjct: 152 VMSIVAGSLDLNV-----GERLPAEE----LAGALS---PGKRAYLSNVSVLPPVRRRGI 199
Query: 221 ASNMLYFAVESAKSNAGICT 240
A M+ A++ A+ G+ T
Sbjct: 200 AFGMINRALDVARDEFGVET 219
>gi|434389140|ref|YP_007099751.1| acetyltransferase [Chamaesiphon minutus PCC 6605]
gi|428020130|gb|AFY96224.1| acetyltransferase [Chamaesiphon minutus PCC 6605]
Length = 202
Score = 40.8 bits (94), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 23/93 (24%)
Query: 142 QRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
Q+ C +AV D G V+GT+++ IR N+ + P+
Sbjct: 79 QQQVCSIAVYADRGQ------SQVIGTIEVGIR----TTNY-------------RQPKPH 115
Query: 202 RYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
RY YI+NL V++ RR+G+A +L E KS
Sbjct: 116 RYPYISNLAVSRDFRRRGVAQQLLIGCEELTKS 148
>gi|449508815|ref|XP_004163419.1| PREDICTED: uncharacterized protein LOC101228360 [Cucumis sativus]
Length = 288
Score = 40.0 bits (92), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 31/170 (18%)
Query: 84 KFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNG 141
KF E +E W AA LR + + + + + K+ AE EF A+K R G +
Sbjct: 74 KFRVSEGTSHDELWAAASLRVRTFNQLPPDSFGIHDHKKYLAEHEFEAMKERIAGKRVGF 133
Query: 142 QRHYCIVA-VKKDEGNVKRTVLKS------------VVGTLDLSIRYLLQGENFPGE--R 186
+R CI A + E + L S VVG+LD++ Q P E
Sbjct: 134 KRVSCINATLPLSEISTLAEDLCSTCKFSDNGEDRVVVGSLDIN-----QCVRLPDEITG 188
Query: 187 VNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNA 236
+ P G R Y++N+CVAK +R G+ Y + AK+ A
Sbjct: 189 MKPEGIGADFAR-----AYLSNVCVAKELQRNGLG----YALIAKAKTIA 229
>gi|291568630|dbj|BAI90902.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 210
Score = 40.0 bits (92), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++ +R +P G +
Sbjct: 81 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMGLR-----------SPSPWSIG-------D 122
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 123 RYPYISNLAVHPTARRQGVARELL 146
>gi|409993391|ref|ZP_11276533.1| acetyltransferase [Arthrospira platensis str. Paraca]
gi|409935717|gb|EKN77239.1| acetyltransferase [Arthrospira platensis str. Paraca]
Length = 205
Score = 40.0 bits (92), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++ +R +P G +
Sbjct: 76 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMGLR-----------SPSPWSIG-------D 117
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 118 RYPYISNLAVHPTARRQGVARELL 141
>gi|428207047|ref|YP_007091400.1| N-acetyltransferase GCN5 [Chroococcidiopsis thermalis PCC 7203]
gi|428008968|gb|AFY87531.1| GCN5-related N-acetyltransferase [Chroococcidiopsis thermalis PCC
7203]
Length = 228
Score = 40.0 bits (92), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 16/111 (14%)
Query: 129 FNAIKRRCRGLNGQRHYCIVAVKK-----DEGNVKRTVLKSVVGTLDLSIRYLLQGENFP 183
+ I+ R H C+VAV+K + G + GT++L++R + G
Sbjct: 71 YEDIRHRIHA-TAPHHVCLVAVEKTAQNENTGMYSHDSTLGIAGTVELALRTIPLGTTC- 128
Query: 184 GERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
+ R +Y Y++NL V + RRQGIA +L + AKS
Sbjct: 129 ---------SFTSYRQGYQYPYLSNLAVHTTRRRQGIAGKLLLSCEQVAKS 170
>gi|225872570|ref|YP_002754025.1| GNAT family acetyltransferase [Acidobacterium capsulatum ATCC
51196]
gi|225792052|gb|ACO32142.1| acetyltransferase, GNAT family [Acidobacterium capsulatum ATCC
51196]
Length = 152
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNAGICTC 241
RRG YGY+ + VA+SARRQG+ +L FAVE G+ +
Sbjct: 65 RRGELHYGYVVTIDVAESARRQGVGRALL-FAVEKHLREEGLASI 108
>gi|172038596|ref|YP_001805097.1| hypothetical protein cce_3683 [Cyanothece sp. ATCC 51142]
gi|354554064|ref|ZP_08973369.1| GCN5-related N-acetyltransferase [Cyanothece sp. ATCC 51472]
gi|171700050|gb|ACB53031.1| hypothetical protein cce_3683 [Cyanothece sp. ATCC 51142]
gi|353553743|gb|EHC23134.1| GCN5-related N-acetyltransferase [Cyanothece sp. ATCC 51472]
Length = 218
Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 25/97 (25%)
Query: 129 FNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVN 188
+ ++ R RG + C VAV+K K T + V+GT++LS++ G
Sbjct: 82 YEDLRSRLRG-TIPYYCCFVAVEKTSTLTKTT--EKVIGTIELSLK---SG--------- 126
Query: 189 PPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
F C Y YI+NL V +S RRQGIA +L
Sbjct: 127 ---FNC-------HYLYISNLAVIQSHRRQGIAKQLL 153
>gi|159481971|ref|XP_001699048.1| hypothetical protein CHLREDRAFT_152106 [Chlamydomonas reinhardtii]
gi|158273311|gb|EDO99102.1| predicted protein [Chlamydomonas reinhardtii]
Length = 213
Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 23/135 (17%)
Query: 94 EEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRH---YCIVAV 150
+E A LRA++++ R+V + K+KF EQE ++++R + Q C+VAV
Sbjct: 23 DELRQVATLRADAYY-AENFSRFVGSLKKKFVEQEVESLQQRTYATSKQGTPFCECLVAV 81
Query: 151 KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLC 210
+ +V+G +DL + L G + P GC Y+ N+
Sbjct: 82 EAGS--------STVLGCIDLRLPAALNGAH-PAGVPAGDTAGC----------YLLNVV 122
Query: 211 VAKSARRQGIASNML 225
V + R QG+ ++
Sbjct: 123 VREDVRGQGLGRAIM 137
>gi|428773700|ref|YP_007165488.1| N-acetyltransferase GCN5 [Cyanobacterium stanieri PCC 7202]
gi|428687979|gb|AFZ47839.1| GCN5-related N-acetyltransferase [Cyanobacterium stanieri PCC 7202]
Length = 212
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 47/101 (46%), Gaps = 23/101 (22%)
Query: 135 RCRGLNGQRHYC-IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFG 193
R R + HYC ++A+ T + +VGT+++S+R NP +G
Sbjct: 77 RDRFYSNNPHYCCLIAIIP----ASSTQPEQIVGTIEISLR-------------NPYGWG 119
Query: 194 CINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
+Y YI+NL V K RRQGI S +L E A+S
Sbjct: 120 -----SKKKYPYISNLAVKKEFRRQGIGSQLLQKCEEIAQS 155
>gi|384252882|gb|EIE26357.1| acyl-CoA N-acyltransferase [Coccomyxa subellipsoidea C-169]
Length = 306
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 177 LQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
+QG++ P ER+ F R P YI+NL V+ +ARR G+ +L A + AK
Sbjct: 198 MQGQHMPAERIEKGPFVAYQR--PEDVAYISNLAVSPAARRLGVGEELLTAAEKVAK 252
>gi|159476828|ref|XP_001696513.1| hypothetical protein CHLREDRAFT_142152 [Chlamydomonas reinhardtii]
gi|158282738|gb|EDP08490.1| predicted protein [Chlamydomonas reinhardtii]
Length = 283
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/43 (37%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
+ GC+ G Y Y++++CVA +ARR+G+A ++ A E A+
Sbjct: 189 VLGCLPP-GTREYAYVSSMCVAPTARRRGVAQALMSAAEEQAR 230
>gi|428318351|ref|YP_007116233.1| GCN5-related N-acetyltransferase [Oscillatoria nigro-viridis PCC
7112]
gi|428242031|gb|AFZ07817.1| GCN5-related N-acetyltransferase [Oscillatoria nigro-viridis PCC
7112]
Length = 207
Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 52/109 (47%), Gaps = 22/109 (20%)
Query: 129 FNAIKRRCRGLNGQRHYCIVA--VKKDEG--NVKRTVLKSVVGTLDLSIRYLLQGENFPG 184
+ +K R R + + C+VA V ++EG N + + + GT+++++R FP
Sbjct: 61 YEDLKNRLRS-KAEHYICLVAELVSREEGTQNYRSHRAQRITGTVEMALR-----SRFPW 114
Query: 185 ERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
+ N + Y Y++NL V RRQG+A +L E+A+
Sbjct: 115 QIPN------------SDYPYLSNLAVHPEYRRQGVAQQLLSNCEETAR 151
>gi|412990748|emb|CCO18120.1| predicted protein [Bathycoccus prasinos]
Length = 266
Score = 37.4 bits (85), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 11/72 (15%)
Query: 165 VVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYG--YIANLCVAKSARRQGIAS 222
V GTLDL++ G N P E L G G R Y++N+CV +S R GIA
Sbjct: 152 VCGTLDLNV-----GVNLPAEE----LVGKFPIEGNARKKRCYMSNVCVLESRRNLGIAR 202
Query: 223 NMLYFAVESAKS 234
++ A+E AK+
Sbjct: 203 QLIERAIEDAKN 214
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,653,114,024
Number of Sequences: 23463169
Number of extensions: 143285261
Number of successful extensions: 270855
Number of sequences better than 100.0: 66
Number of HSP's better than 100.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 270761
Number of HSP's gapped (non-prelim): 70
length of query: 242
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 104
effective length of database: 9,121,278,045
effective search space: 948612916680
effective search space used: 948612916680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 75 (33.5 bits)