BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025429
(253 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297831700|ref|XP_002883732.1| hypothetical protein ARALYDRAFT_480213 [Arabidopsis lyrata subsp.
lyrata]
gi|297329572|gb|EFH59991.1| hypothetical protein ARALYDRAFT_480213 [Arabidopsis lyrata subsp.
lyrata]
Length = 288
Score = 322 bits (824), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 158/242 (65%), Positives = 189/242 (78%), Gaps = 6/242 (2%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKKEDLSI-VSIPQ-- 57
MSTI + R +L+ + + ++F R W + +DS+SS T+KKE+LS+ +SIP
Sbjct: 1 MSTISIHRTELLRITHNRSRISRQRFRRTIPLWKLTIDSRSSDTSKKEELSVKISIPPQV 60
Query: 58 ---RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
R LRFDRLQP ++E DR EFGKFVAREA++DEEYWTAAWLRAESHWE R+NE
Sbjct: 61 DQLRPEGLRFDRLQPPEQEFGHEDRLEFGKFVAREAMVDEEYWTAAWLRAESHWEDRSNE 120
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYVDN+KRKFAEQEFNAIKRRC+G+ GQ+ CIVAVKK+E ++KR+V+KSVVGTLDLSIR
Sbjct: 121 RYVDNYKRKFAEQEFNAIKRRCKGMQGQKCSCIVAVKKEEKHIKRSVIKSVVGTLDLSIR 180
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
Y LQGE FPGE+V LF INR G NRYGYIANLCVAKSARRQGIA NML FAVESA+
Sbjct: 181 YFLQGETFPGEKVKSQLFCSINREGSNRYGYIANLCVAKSARRQGIACNMLRFAVESARL 240
Query: 235 NG 236
+G
Sbjct: 241 SG 242
>gi|30678254|ref|NP_671784.2| acyl-CoA N-acyltransferases-like protein [Arabidopsis thaliana]
gi|27413497|gb|AAO11666.1| hypothetical protein [Arabidopsis thaliana]
gi|61742576|gb|AAX55109.1| hypothetical protein At2g06025 [Arabidopsis thaliana]
gi|62318765|dbj|BAD93799.1| hypothetical protein [Arabidopsis thaliana]
gi|330250902|gb|AEC05996.1| acyl-CoA N-acyltransferases-like protein [Arabidopsis thaliana]
Length = 288
Score = 317 bits (811), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 157/242 (64%), Positives = 186/242 (76%), Gaps = 6/242 (2%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKKEDLSI-VSIP--- 56
MSTI + R L+ +++ R W + ++S+SS T+KKE+LS+ +SIP
Sbjct: 1 MSTISIHRTETLRITHARSRIFRQRYRRTIPLWKLTINSRSSDTSKKEELSVQISIPPQV 60
Query: 57 --QRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
R LRFDRLQP + E DRFEFGKFVAREA+LDEEYWTAAWLRAESHWE R+NE
Sbjct: 61 DQSRPEGLRFDRLQPPEPEFGHEDRFEFGKFVAREAMLDEEYWTAAWLRAESHWEDRSNE 120
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYVDN+KRKFAEQEFNAIKRRC+G+ GQ+ CIVAVKK+E ++KR+V+KSVVGTLDLSIR
Sbjct: 121 RYVDNYKRKFAEQEFNAIKRRCKGMQGQKCSCIVAVKKEEKHIKRSVIKSVVGTLDLSIR 180
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
Y LQGE FPGE+V LF IN+ G NRYGYIANLCVAKSARRQGIA NML FAVESA+
Sbjct: 181 YFLQGETFPGEKVKSQLFCSINQEGSNRYGYIANLCVAKSARRQGIACNMLRFAVESARL 240
Query: 235 NG 236
+G
Sbjct: 241 SG 242
>gi|356556386|ref|XP_003546507.1| PREDICTED: uncharacterized protein LOC100808675 [Glycine max]
Length = 302
Score = 309 bits (792), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 161/256 (62%), Positives = 187/256 (73%), Gaps = 22/256 (8%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTK-----------KED 49
MSTI + RP +G + K RI+A+WTM MDSK S TTK KE+
Sbjct: 4 MSTIPIHRPEFRTLFFNGS-PSPCKCLRITASWTMTMDSKFSPTTKNSNDNNYNLKKKEE 62
Query: 50 LSIVSIPQ---------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAA 100
LS V IP SNDL+FDRLQPSD+EL + RFEFG+FVAREA+LDEEYWTAA
Sbjct: 63 LS-VQIPTPPISKVETLNSNDLQFDRLQPSDQELGRVKRFEFGQFVAREAVLDEEYWTAA 121
Query: 101 WLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRT 160
WLRAESHWE R +RYVDN+KRKFAEQEFNA+KRRC+ NG CI+ V+K++ N K +
Sbjct: 122 WLRAESHWEDRPYDRYVDNYKRKFAEQEFNALKRRCKVQNGDSCACIITVRKEQKNAKHS 181
Query: 161 VLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGI 220
VLKSVVGTLDL+IRYLLQGE +PGERV PLF INR P+RYGYIANLCV KS RRQGI
Sbjct: 182 VLKSVVGTLDLNIRYLLQGETYPGERVKAPLFCSINRTPPSRYGYIANLCVIKSVRRQGI 241
Query: 221 ASNMLYFAVESAKSNG 236
ASNM+ FA+E+AKSNG
Sbjct: 242 ASNMMSFAIEAAKSNG 257
>gi|388491720|gb|AFK33926.1| unknown [Lotus japonicus]
Length = 302
Score = 309 bits (791), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 156/266 (58%), Positives = 192/266 (72%), Gaps = 20/266 (7%)
Query: 2 STILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK---------EDLSI 52
STI + RP +G H KF RI+A+WT +DSK S+T E+L +
Sbjct: 5 STIPIPRPEFHTLFFNGSPIPH-KFPRIAASWTTTLDSKFSQTMNSNSMKKMKKKEELPV 63
Query: 53 --VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLR 103
+S P S++L+FDRLQPSD EL++ +RFEFG+FVAREA+LDEEYWTAAWLR
Sbjct: 64 QQLSTPPVSKVETLSSDNLQFDRLQPSDHELVRENRFEFGQFVAREAVLDEEYWTAAWLR 123
Query: 104 AESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLK 163
AESHWE RT +RY+DN+KRKFA+QEFNA+KRRC+ G CI+ VKK++ N+K +V+K
Sbjct: 124 AESHWENRTYDRYIDNYKRKFADQEFNAVKRRCKVQTGDNCMCIITVKKEQKNLKHSVIK 183
Query: 164 SVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASN 223
SVVGTLDL+ RYLLQGE FPGERV P+F INR PN+YGYIANLCVAKSARR GIASN
Sbjct: 184 SVVGTLDLNFRYLLQGETFPGERVKAPIFCTINRTAPNKYGYIANLCVAKSARRHGIASN 243
Query: 224 MLYFAVESAKSNGKINQASVNKKHEN 249
M+YFAVE+AKSNG + Q V+ N
Sbjct: 244 MMYFAVETAKSNG-VTQVYVHVDRNN 268
>gi|356530427|ref|XP_003533783.1| PREDICTED: uncharacterized protein LOC100795426 [Glycine max]
Length = 304
Score = 306 bits (785), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 157/257 (61%), Positives = 185/257 (71%), Gaps = 22/257 (8%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK------------- 47
MSTI + RP +G + K RI+A+W M MDSK S T K
Sbjct: 4 MSTIPIHRPEFCTLFFNGSPSPY-KCPRITASWIMAMDSKFSSTIKNNSDNDYNLKKKKD 62
Query: 48 EDLSI-VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTA 99
E+LS+ +S P SNDL+FDRLQPSD+EL + RFEFG+FVARE +LDEEYWTA
Sbjct: 63 EELSVQISTPPISKVETLNSNDLQFDRLQPSDQELGRVKRFEFGQFVARETVLDEEYWTA 122
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAESHWE R ERYVDN+KRKFAEQEFNA+KRRC+ NG CI+ V+K++ N K
Sbjct: 123 AWLRAESHWEDRPFERYVDNYKRKFAEQEFNALKRRCKVQNGDSCACIITVRKEQKNAKH 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
++LKSVVGTLDL+IRYLLQGE +PGERV PLF INR P+RYGYIANLCV KSARRQG
Sbjct: 183 SILKSVVGTLDLNIRYLLQGETYPGERVKAPLFCSINRTPPSRYGYIANLCVIKSARRQG 242
Query: 220 IASNMLYFAVESAKSNG 236
IASNM+ FA+E+AKSNG
Sbjct: 243 IASNMMSFAIEAAKSNG 259
>gi|297738568|emb|CBI27813.3| unnamed protein product [Vitis vinifera]
Length = 523
Score = 291 bits (746), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/208 (70%), Positives = 168/208 (80%), Gaps = 9/208 (4%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 271 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 329
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAESHWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 330 AAVLDEEYWTAAWLRAESHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 389
Query: 149 AVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIAN 208
AVKK+E VK+TVLKSVVGTLDLSIRYLL G+ FPGE+V PLF INR ++YGY+AN
Sbjct: 390 AVKKEERGVKQTVLKSVVGTLDLSIRYLLNGDTFPGEQVKAPLFCSINRTRSSKYGYVAN 449
Query: 209 LCVAKSARRQGIASNMLYFAVESAKSNG 236
LCVAKS RRQGIASNML+FAVESAKS G
Sbjct: 450 LCVAKSVRRQGIASNMLHFAVESAKSIG 477
>gi|225444742|ref|XP_002278105.1| PREDICTED: uncharacterized protein LOC100254310 [Vitis vinifera]
Length = 254
Score = 290 bits (741), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 146/208 (70%), Positives = 168/208 (80%), Gaps = 9/208 (4%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 1 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 59
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAESHWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 60 AAVLDEEYWTAAWLRAESHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 119
Query: 149 AVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIAN 208
AVKK+E VK+TVLKSVVGTLDLSIRYLL G+ FPGE+V PLF INR ++YGY+AN
Sbjct: 120 AVKKEERGVKQTVLKSVVGTLDLSIRYLLNGDTFPGEQVKAPLFCSINRTRSSKYGYVAN 179
Query: 209 LCVAKSARRQGIASNMLYFAVESAKSNG 236
LCVAKS RRQGIASNML+FAVESAKS G
Sbjct: 180 LCVAKSVRRQGIASNMLHFAVESAKSIG 207
>gi|224069136|ref|XP_002326283.1| predicted protein [Populus trichocarpa]
gi|222833476|gb|EEE71953.1| predicted protein [Populus trichocarpa]
Length = 257
Score = 279 bits (714), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 143/217 (65%), Positives = 169/217 (77%), Gaps = 14/217 (6%)
Query: 39 SKSSRTTKKEDLSIVSIPQRS-----------NDLRFDRLQPSDRELLQHDRFEFGKFVA 87
S SS T KKE+LSI +P S +DLRFDRLQ ++EL+ +FEFG+FVA
Sbjct: 5 SGSSETRKKEELSI-QVPATSISSSETQRFAGSDLRFDRLQVPEKELMHDRKFEFGQFVA 63
Query: 88 REALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCI 147
REA++DEEYWTAAWLRAESHWEGR N+RYVDN KRKFAEQEF+AIKRR L+GQ+ CI
Sbjct: 64 REAVIDEEYWTAAWLRAESHWEGR-NDRYVDNHKRKFAEQEFHAIKRRRTSLHGQKCRCI 122
Query: 148 VAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIA 207
+ V+K++ +VKRTVLK VVGTLDLS+R LL GE FPGERV PLF I+ GPNRYGY+A
Sbjct: 123 IMVRKEDKHVKRTVLKGVVGTLDLSVRCLLHGETFPGERVKAPLFCSIHGTGPNRYGYVA 182
Query: 208 NLCVAKSARRQGIASNMLYFAVESAKSNGKINQASVN 244
NLCVAKSARRQGIA+NML+FA+E KSNG I A V+
Sbjct: 183 NLCVAKSARRQGIATNMLHFAIELVKSNG-IEHAYVH 218
>gi|147801241|emb|CAN72326.1| hypothetical protein VITISV_041246 [Vitis vinifera]
Length = 928
Score = 279 bits (713), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 144/211 (68%), Positives = 165/211 (78%), Gaps = 12/211 (5%)
Query: 37 MDSKSSRTTKKEDLSI----VSIPQ----RSNDLRFDRLQPSDRELLQHDRFEFGKFVAR 88
MDS SS T KKE+ SI S+PQ R ++L+FD+LQPSD E + R EFG F+AR
Sbjct: 671 MDSNSS-TRKKEEFSIEIQAPSVPQFRTERPSNLQFDQLQPSDEEFNEGSRAEFGPFMAR 729
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIV 148
A+LDEEYWTAAWLRAE HWEGR NERY +N+KRKFAEQEFNA+KRRCRG N Q++ CIV
Sbjct: 730 AAVLDEEYWTAAWLRAEXHWEGRPNERYAENYKRKFAEQEFNALKRRCRGQNDQKYTCIV 789
Query: 149 A---VKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGY 205
A VKK+E VK+TVLKSVVGTLDLSIRYLL G FPGE+V PLF IN ++YGY
Sbjct: 790 ANYQVKKEERGVKQTVLKSVVGTLDLSIRYLLNGXTFPGEQVKAPLFCSINXTXSSKYGY 849
Query: 206 IANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+ANLCVAKS RRQGIASNML+FAVESAKS G
Sbjct: 850 VANLCVAKSVRRQGIASNMLHFAVESAKSIG 880
>gi|449452562|ref|XP_004144028.1| PREDICTED: uncharacterized protein LOC101207838 [Cucumis sativus]
gi|449500482|ref|XP_004161109.1| PREDICTED: uncharacterized protein LOC101229607 [Cucumis sativus]
Length = 299
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/259 (55%), Positives = 179/259 (69%), Gaps = 14/259 (5%)
Query: 2 STILVQRPAVLKFHRDGGFRNHRKFSRISA--TWTMRMDSKSSRTTKKEDLSI-VSIPQR 58
+ I + RP L +D RNH KF R SA +W M MD KS +T KKE++SI +S P
Sbjct: 3 AAISIYRPEFLGSVQDRC-RNHLKFHRTSAFASWNMTMDYKSHQTMKKEEVSIQISTPLL 61
Query: 59 S--------NDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
+ L+FDR P D +L+ + EFG+FVAREA++DEE WTAAWLRAESHWE
Sbjct: 62 LPKLKPLAWSGLQFDRPPPDDEDLIHLRKLEFGQFVAREAVIDEELWTAAWLRAESHWEN 121
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLD 170
R N+RYVD+FKRKFAEQEFNAIK+RC G GQ CIV V+K++ ++KRTV+KSVV TLD
Sbjct: 122 RQNDRYVDSFKRKFAEQEFNAIKKRCGGQYGQTCTCIVTVRKEQKHIKRTVIKSVVATLD 181
Query: 171 LSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVE 230
L +R+L+ GE FPGER + IN+ PN+Y YI+NLCV K+ARRQGIA NML FAV
Sbjct: 182 LCLRHLMHGETFPGEREKSHVCS-INKEIPNKYAYISNLCVLKAARRQGIAGNMLKFAVL 240
Query: 231 SAKSNGKINQASVNKKHEN 249
+AKS G I Q V+ + N
Sbjct: 241 TAKSRG-IEQVYVHVRRNN 258
>gi|356545766|ref|XP_003541306.1| PREDICTED: uncharacterized protein LOC100784591 [Glycine max]
Length = 282
Score = 242 bits (618), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 129/220 (58%), Positives = 156/220 (70%), Gaps = 13/220 (5%)
Query: 26 FSRISATWTMRMDSKSSRTTKKEDLSIVSIPQRSN--------DLRFDRLQPSDRELLQH 77
F RIS ++ M + TKK+ V +P ++ DL F RLQPSD EL H
Sbjct: 17 FFRIS---SLAMTIDTDFLTKKKKQVSVQLPSLTSPVSTVRFCDLNFGRLQPSDDELDPH 73
Query: 78 DRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCR 137
+RFEFG FVAR+ALL EEYWTAAWLRAE W RT++ YV N KR FA+QEF+AIK+RC+
Sbjct: 74 NRFEFGNFVARQALLHEEYWTAAWLRAE-EWASRTHKLYVVNHKRNFADQEFDAIKKRCK 132
Query: 138 -GLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCIN 196
+GQ CI+ V+K + NVKR+++ SVVGTLDL+IRYL GE FPGE V+ P F I
Sbjct: 133 EQQDGQSSTCIITVRKQDKNVKRSIISSVVGTLDLNIRYLRLGETFPGEPVDAPRFCKIE 192
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
R +RYGYIANLCVAKS RR+GIASNMLYFAVESAKS+G
Sbjct: 193 RTPSSRYGYIANLCVAKSVRRKGIASNMLYFAVESAKSSG 232
>gi|356564758|ref|XP_003550615.1| PREDICTED: uncharacterized protein LOC100800028 [Glycine max]
Length = 281
Score = 228 bits (580), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/223 (56%), Positives = 151/223 (67%), Gaps = 10/223 (4%)
Query: 23 HRKFSR-ISATWTMRMDSKSSRTTKKE---DLSIVSIPQRSN----DLRFDRLQPSDREL 74
H F R M +D+ T KK+ LS + P S DL F RLQPSD EL
Sbjct: 14 HSSFFRPYPKCLAMTIDTDFLNTKKKQVSFQLSSLITPPVSTVRFCDLNFGRLQPSDEEL 73
Query: 75 LQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKR 134
H RFEFG FVAREALL+EEYW+AA LRAE W RT++ YV + +R FAEQEFNAIK+
Sbjct: 74 GPHKRFEFGNFVAREALLEEEYWSAACLRAE-EWANRTDKLYVVDRQRNFAEQEFNAIKK 132
Query: 135 RCRGL-NGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFG 193
RC+ L +G CI+ V+K + NVK +++SVVGTLDL+I YL +GE FPGE V+ P F
Sbjct: 133 RCKELQDGHSSTCIITVRKPQKNVKLPIIESVVGTLDLNIIYLRRGETFPGELVDSPPFC 192
Query: 194 CINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
I+R +RYGYIANLCVAKS R+G+AS MLYFAVESAKS G
Sbjct: 193 KIDRTASSRYGYIANLCVAKSLYRKGVASKMLYFAVESAKSTG 235
>gi|326496873|dbj|BAJ98463.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326521206|dbj|BAJ96806.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 309
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 136/182 (74%), Gaps = 4/182 (2%)
Query: 56 PQRSNDLRFDRLQP-SDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNE 114
P +DLRF+RL+P +++ ++ R+ FG++VAREA++DEEYW AAWLRAE H+E ++ +
Sbjct: 84 PAEPSDLRFNRLRPPTEKSDCKYTRY-FGRYVAREAIVDEEYWIAAWLRAEDHYEDQSGD 142
Query: 115 RYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIR 174
RYV++FKRKFA QEF+A+K+RC G+++ C VAVK D+ V RTVL SVVGT+DL +R
Sbjct: 143 RYVESFKRKFASQEFHALKKRCSRQVGEKYTCFVAVKNDD--VTRTVLNSVVGTVDLCVR 200
Query: 175 YLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
+ L GE +P E N P + I + ++GY+ N+CVAK ARRQGIASNML A+++A+
Sbjct: 201 HPLYGETYPAEPGNMPFYSRIYQPDQPKFGYLTNVCVAKYARRQGIASNMLLLAIDAARL 260
Query: 235 NG 236
+G
Sbjct: 261 DG 262
>gi|115451085|ref|NP_001049143.1| Os03g0177000 [Oryza sativa Japonica Group]
gi|108706472|gb|ABF94267.1| GCN5-related N-acetyltransferase, putative, expressed [Oryza sativa
Japonica Group]
gi|113547614|dbj|BAF11057.1| Os03g0177000 [Oryza sativa Japonica Group]
gi|215701031|dbj|BAG92455.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218192192|gb|EEC74619.1| hypothetical protein OsI_10235 [Oryza sativa Indica Group]
gi|222624295|gb|EEE58427.1| hypothetical protein OsJ_09628 [Oryza sativa Japonica Group]
Length = 306
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 102/197 (51%), Positives = 140/197 (71%), Gaps = 4/197 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPSDREL-LQHDRFEFGKFVAREALLDEEYWTA 99
SS + +++ P +DLRF+RL+PS E ++ RF FG +VAREA++DEEYW A
Sbjct: 68 SSLSNPMSEVTTPFHPAAPSDLRFNRLRPSVEESDCKYKRF-FGCYVAREAIIDEEYWIA 126
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE+ +E ++++RYV++FKRKFA QEF+A+KRRC L G+++ C VAVK D ++KR
Sbjct: 127 AWLRAENRYEDQSSDRYVESFKRKFASQEFHALKRRCSKLQGEKYICFVAVKND--DLKR 184
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ IR+ L GE FP E I + ++GY+ N+CVAK ARRQG
Sbjct: 185 TVLNSVVGTLDVCIRHPLHGETFPAEPGKSSFHCRIYQPDQPKFGYLTNVCVAKYARRQG 244
Query: 220 IASNMLYFAVESAKSNG 236
IASNML A+++A+ NG
Sbjct: 245 IASNMLLLAIDAARLNG 261
>gi|223975831|gb|ACN32103.1| unknown [Zea mays]
Length = 285
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 138/200 (69%), Gaps = 4/200 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNGKIN 239
IASNML A+++AK NG N
Sbjct: 243 IASNMLLLAIDAAKLNGAEN 262
>gi|413956846|gb|AFW89495.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 304
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 100/200 (50%), Positives = 138/200 (69%), Gaps = 4/200 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNGKIN 239
IASNML A+++AK NG N
Sbjct: 243 IASNMLLLAIDAAKLNGAEN 262
>gi|413956845|gb|AFW89494.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 266
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 99/197 (50%), Positives = 137/197 (69%), Gaps = 4/197 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKR 159
AWLRAE +E + RYV++FKRKFA +EF+A+K+RC + +++ C+VAVK D+ ++R
Sbjct: 125 AWLRAEDQYENESGNRYVESFKRKFASKEFHALKKRCNTQHREKYICLVAVKNDD--IRR 182
Query: 160 TVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQG 219
TVL SVVGTLD+ +R+ L GE FP E P L I + ++GY+ N+CVAK ARRQG
Sbjct: 183 TVLNSVVGTLDVCVRHPLYGEKFPEEPGKPSLHCRIYQPDQPKFGYVTNVCVAKYARRQG 242
Query: 220 IASNMLYFAVESAKSNG 236
IASNML A+++AK NG
Sbjct: 243 IASNMLLLAIDAAKLNG 259
>gi|357113782|ref|XP_003558680.1| PREDICTED: uncharacterized protein LOC100839034 [Brachypodium
distachyon]
Length = 308
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 93/181 (51%), Positives = 130/181 (71%), Gaps = 2/181 (1%)
Query: 56 PQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNER 115
P +DLRF+RL+PS E + FG +VAREA++DEEYW AAWLRAE +E ++ +R
Sbjct: 83 PAEPSDLRFNRLRPSVEESDCKYKRLFGCYVAREAIMDEEYWIAAWLRAEHRYEDQSGDR 142
Query: 116 YVDNFKRKFAEQEFNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
YV++FKRKFA QEF+A+K+RC G+++ C VAVK D+ + RTV+ SVVGTLD+ +R+
Sbjct: 143 YVESFKRKFASQEFHALKKRCSNQVGEKYTCFVAVKNDD--LTRTVVNSVVGTLDVCVRH 200
Query: 176 LLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSN 235
L GE +P E N P + I + ++GY+ N+CVAK ARRQGIA+NML A+++A+ N
Sbjct: 201 PLHGEKYPEEPGNSPFYARIYQPHQPKFGYLTNVCVAKYARRQGIATNMLLLAIDAARFN 260
Query: 236 G 236
G
Sbjct: 261 G 261
>gi|225435967|ref|XP_002270467.1| PREDICTED: uncharacterized protein LOC100244484 [Vitis vinifera]
Length = 289
Score = 193 bits (490), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 103/183 (56%), Positives = 131/183 (71%), Gaps = 6/183 (3%)
Query: 58 RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYV 117
R +L F+RLQ +D E R FG+FVAREA+LDEEYWTAAWLRAE+HWE + R+V
Sbjct: 63 RLPELSFNRLQLTDEEYCGLQRRNFGRFVAREAVLDEEYWTAAWLRAEAHWESLSYMRHV 122
Query: 118 DNFKRKFAEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
DN+KRK+AE+EF A+KRRC G +G + +C VAV K+E N++RTVL SVVGTLDLSIR
Sbjct: 123 DNYKRKYAEEEFYALKRRCAGQDGNLLKCFCFVAV-KEEKNIRRTVLNSVVGTLDLSIRQ 181
Query: 176 LLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
++GE +PGE R++ L ++Y YIAN+CV+K ARRQGIA NML A + A
Sbjct: 182 YIRGETYPGEIKRLSSVL-ASPEPFDAHKYAYIANVCVSKFARRQGIALNMLSLANDVAS 240
Query: 234 SNG 236
S G
Sbjct: 241 SAG 243
>gi|296083942|emb|CBI24330.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 101/182 (55%), Positives = 129/182 (70%), Gaps = 4/182 (2%)
Query: 58 RSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYV 117
R +L F+RLQ +D E R FG+FVAREA+LDEEYWTAAWLRAE+HWE + R+V
Sbjct: 62 RLPELSFNRLQLTDEEYCGLQRRNFGRFVAREAVLDEEYWTAAWLRAEAHWESLSYMRHV 121
Query: 118 DNFKRKFAEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRY 175
DN+KRK+AE+EF A+KRRC G +G + +C VAV K+E N++RTVL SVVGTLDLSIR
Sbjct: 122 DNYKRKYAEEEFYALKRRCAGQDGNLLKCFCFVAV-KEEKNIRRTVLNSVVGTLDLSIRQ 180
Query: 176 LLQGENFPGE-RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKS 234
++GE +PGE + + ++Y YIAN+CV+K ARRQGIA NML A + A S
Sbjct: 181 YIRGETYPGEIKRLSSVLASPEPFDAHKYAYIANVCVSKFARRQGIALNMLSLANDVASS 240
Query: 235 NG 236
G
Sbjct: 241 AG 242
>gi|302796302|ref|XP_002979913.1| hypothetical protein SELMODRAFT_444359 [Selaginella moellendorffii]
gi|300152140|gb|EFJ18783.1| hypothetical protein SELMODRAFT_444359 [Selaginella moellendorffii]
Length = 346
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 106/173 (61%), Gaps = 25/173 (14%)
Query: 69 PSDRELLQHDRFEFGK----FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKF 124
P++ LL R F + ++ REA DE++WTA+WLR+E ++E + ++R+V ++K+KF
Sbjct: 39 PANARLLTGCRDSFNQQPNFYLVREAQSDEDFWTASWLRSECYFEDQGSDRFVQSYKKKF 98
Query: 125 AEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN- 181
+EQEF +KRRC G G R C++AV D ++L V+GT DLS+R L G++
Sbjct: 99 SEQEFITLKRRCAGRYGNCLRCTCLLAVLNDGS----SLLNRVIGTTDLSLRQALPGDDS 154
Query: 182 -FPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
G + P YGYIAN+CV++S RR+G+AS++L +V+ AK
Sbjct: 155 RTLGIQTQP-------------YGYIANVCVSRSHRRRGVASSLLESSVQVAK 194
>gi|302811410|ref|XP_002987394.1| hypothetical protein SELMODRAFT_446972 [Selaginella moellendorffii]
gi|300144800|gb|EFJ11481.1| hypothetical protein SELMODRAFT_446972 [Selaginella moellendorffii]
Length = 530
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 104/171 (60%), Gaps = 17/171 (9%)
Query: 69 PSDRELLQHDRFEFGK----FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKF 124
P++ LL R F + ++ REA DE++WTA+WLR+E ++E + ++R+V ++K+KF
Sbjct: 39 PANARLLTGCRDSFNQQPNFYLVREARSDEDFWTASWLRSECYFEDQGSDRFVQSYKKKF 98
Query: 125 AEQEFNAIKRRCRGLNGQ--RHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENF 182
+EQEF +KRRC G G R C++AV D ++L V+GT DLS+R L
Sbjct: 99 SEQEFITLKRRCAGRYGNCLRCTCLLAVLND----GSSLLNRVIGTTDLSLRQALPTAQ- 153
Query: 183 PGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAK 233
R + G + YGYIAN+CV++S RR+G+AS++L AV+ AK
Sbjct: 154 --PRDDSRTLGIQTQ----PYGYIANVCVSRSHRRRGVASSLLESAVQVAK 198
>gi|168029391|ref|XP_001767209.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681464|gb|EDQ67890.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 331
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 84/152 (55%), Gaps = 19/152 (12%)
Query: 82 FGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG 141
+ F EA L+ E AAWLRA+++ E + RYVD+FK+KFAEQEF A+KRR G G
Sbjct: 112 YNHFDIHEARLEYELKDAAWLRADAYSEQQPFTRYVDSFKKKFAEQEFAALKRRLSGQPG 171
Query: 142 QRHYCIVAVKKDEGNVKRTVL---KSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRR 198
Y + D V + V+GTLD S+ GE+F G R+
Sbjct: 172 S--YVSNTLPTDSIEVDWQDCEKDRQVLGTLDFSLHRPSPGESFGGHRL----------- 218
Query: 199 GPNRYGYIANLCVAKSARRQGIASNMLYFAVE 230
YGYIAN+CV KSAR+QGIAS +L AV+
Sbjct: 219 ---PYGYIANVCVEKSARKQGIASALLERAVQ 247
>gi|356525559|ref|XP_003531392.1| PREDICTED: uncharacterized protein LOC100790837 [Glycine max]
Length = 92
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/77 (57%), Positives = 51/77 (66%), Gaps = 9/77 (11%)
Query: 45 TKKEDLSIVSIPQRSN--------DLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEY 96
TKK+ V +P ++ DL F RLQPSD EL H+RFEFG FVAR+ALL EEY
Sbjct: 9 TKKKKQVSVQLPSLTSPVSTVRFCDLNFGRLQPSDDELDPHNRFEFGNFVARQALLHEEY 68
Query: 97 WTAAWLRAESHWEGRTN 113
WTAAWLRAE W RT+
Sbjct: 69 WTAAWLRAE-EWASRTD 84
>gi|226509785|ref|NP_001141650.1| uncharacterized protein LOC100273775 [Zea mays]
gi|194705416|gb|ACF86792.1| unknown [Zea mays]
gi|413956844|gb|AFW89493.1| hypothetical protein ZEAMMB73_408635 [Zea mays]
Length = 140
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW A
Sbjct: 66 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGCYVAREAVIDEEYWIA 124
Query: 100 AWLRAESHWEGRTNER 115
AWLRAE +E + R
Sbjct: 125 AWLRAEDQYENESGNR 140
>gi|413916376|gb|AFW56308.1| hypothetical protein ZEAMMB73_275268 [Zea mays]
Length = 195
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 41 SSRTTKKEDLSIVSIPQRSNDLRFDRLQPS-DRELLQHDRFEFGKFVAREALLDEEYWTA 99
SS + K + +P +DLRF+RL+PS D ++ R FG +VAREA++DEEYW
Sbjct: 121 SSLSNKASVVPTPLLPSGQSDLRFNRLRPSIDESDCKYKRL-FGYYVAREAVIDEEYWIV 179
Query: 100 AWLRAESHWEGRTNER 115
AWLRAE +E + R
Sbjct: 180 AWLRAEDQYENESGNR 195
>gi|294463954|gb|ADE77498.1| unknown [Picea sitchensis]
Length = 55
Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats.
Identities = 27/36 (75%), Positives = 32/36 (88%)
Query: 201 NRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+RYGY+AN+CV+K ARRQGIASNML AVE AKS+G
Sbjct: 10 HRYGYVANVCVSKFARRQGIASNMLQLAVEVAKSSG 45
>gi|255636981|gb|ACU18823.1| unknown [Glycine max]
Length = 106
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 53/104 (50%), Gaps = 22/104 (21%)
Query: 1 MSTILVQRPAVLKFHRDGGFRNHRKFSRISATWTMRMDSKSSRTTKK------------- 47
MSTI + RP +G + K RI+A+W M MDSK S T K
Sbjct: 4 MSTIPIHRPEFCTLFFNGSPSPY-KCPRITASWIMAMDSKFSSTIKNNSDNDYNLKKKKD 62
Query: 48 EDLSI-VSIPQ-------RSNDLRFDRLQPSDRELLQHDRFEFG 83
E+LS+ +S P SNDL+FDRLQPSD+EL + R + G
Sbjct: 63 EELSVQISTPPISKVETLNSNDLQFDRLQPSDQELGRVKRQQHG 106
>gi|255644778|gb|ACU22891.1| unknown [Glycine max]
Length = 240
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 72/154 (46%), Gaps = 27/154 (17%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNGQRHYC 146
E+ ++E W AA LR S + R + + + R AE+EF A+K R G + +R C
Sbjct: 33 ESFYEDELWAAACLRVRSFNQFRPDAFGILDHTRYLAEREFEALKERVSGKRMGFRRVSC 92
Query: 147 I--------VAVKKDE--GNVKRTV---LKSVVGTLDLSIRYLLQGENFPGERVN--PPL 191
I +A D+ + K + + VVGTLDL+ Q + P E V P +
Sbjct: 93 INASLPLSHIATLSDDLCSSCKFSTNGEDRIVVGTLDLN-----QCLSLPDEIVGAKPEV 147
Query: 192 FGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
G R Y++N+CVAK R G+A +L
Sbjct: 148 IGADITR-----AYLSNVCVAKELHRNGLAYALL 176
>gi|357479907|ref|XP_003610239.1| hypothetical protein MTR_4g129490 [Medicago truncatula]
gi|355511294|gb|AES92436.1| hypothetical protein MTR_4g129490 [Medicago truncatula]
Length = 317
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 110 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 169
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 170 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 223
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 224 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 253
>gi|388514139|gb|AFK45131.1| unknown [Medicago truncatula]
Length = 265
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 58 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 117
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 118 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 171
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 172 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 201
>gi|217069962|gb|ACJ83341.1| unknown [Medicago truncatula]
Length = 223
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 29/155 (18%)
Query: 89 EALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNG--QRHYC 146
E+ D+E+W A+ LR S + R + + + + AE+EF A+K R G N +R C
Sbjct: 58 ESFYDDEFWAASSLRVRSFNQFRPDTFGLQDHAKYLAEREFEALKERVSGKNKAFRRVSC 117
Query: 147 IVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGEN--------------FPGERVN--PP 190
I A ++ + L S+ L S +Y GE+ P E V P
Sbjct: 118 INA------SLPMSHLSSLYDDLCSSCKYSASGEDRVVVGSLDLNQCLSLPDEIVGMKPE 171
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ G R Y++N+CVA+ R G+A +L
Sbjct: 172 VSGADTTR-----AYLSNVCVARELHRNGLAYELL 201
>gi|255070783|ref|XP_002507473.1| gcn5-related n-acetyltransferase family protein [Micromonas sp.
RCC299]
gi|226522748|gb|ACO68731.1| gcn5-related n-acetyltransferase family protein [Micromonas sp.
RCC299]
Length = 359
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 75/175 (42%), Gaps = 29/175 (16%)
Query: 68 QPSDRELLQHDR-FEFGKFVAREALLDEEYWTAAWLRAESHW---EGRTN----ERYVDN 119
P EL R FE + A E+ AA LRA + + E R R++
Sbjct: 107 HPYTEELATKIRNFETSQISVITAESFSEFSVAAELRATAFYDDLEARQALPFPSRFIAT 166
Query: 120 FKRKFAEQEFNAIKRRCRGLNG--QRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLL 177
F+R+FA++E A+K R G +R C++A D G +VG LD+S+R
Sbjct: 167 FRREFAQRECLALKERTAHFTGTLRRCLCLMARLPDLG---------LVGCLDISVRVGP 217
Query: 178 QGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESA 232
G G CI Y YI N+ V + ARR G AS ML + E A
Sbjct: 218 CGSQVNGL--------CIPDN--EEYVYIDNVAVDERARRLGSASAMLEASSEIA 262
>gi|384245237|gb|EIE18732.1| hypothetical protein COCSUDRAFT_60039 [Coccomyxa subellipsoidea
C-169]
Length = 394
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 61/122 (50%), Gaps = 24/122 (19%)
Query: 115 RYVDNFKRKFAEQEFNAIKR--RCR-GLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDL 171
RYV +FKR+F EQE ++K RCR G + C+VAV ++EG+ +V+GTLD
Sbjct: 241 RYVGSFKRQFKEQEARSLKERIRCRPGSDTPECVCLVAV-EEEGDS-----GTVMGTLD- 293
Query: 172 SIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
E PP +R P Y+ N+ V++ R++GI ++ A +
Sbjct: 294 ------------AEPPGPPT--PKHRHIPEGATYLLNVVVSQKHRKRGIGRALMAAAGKL 339
Query: 232 AK 233
AK
Sbjct: 340 AK 341
>gi|384245461|gb|EIE18955.1| hypothetical protein COCSUDRAFT_49126 [Coccomyxa subellipsoidea
C-169]
Length = 171
Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 57/121 (47%), Gaps = 28/121 (23%)
Query: 125 AEQEFNAIKRRCRGLNGQRH-----YCIVAV--------KKDEGNVKRTVLKSVVGTLDL 171
++EF A+K + R L R C+VAV ++ ++VVGTLDL
Sbjct: 3 GDEEFEALKEQ-RDLRNARQPNENSACLVAVWPSESLEESPSSLGLQAIGEEAVVGTLDL 61
Query: 172 SIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
+ GE G+ NP Y+AN+CVAK+A+RQG+ +++ AV+
Sbjct: 62 YAIRAMPGEVLIGDCSNP--------------AYLANVCVAKAAQRQGVGRELIHNAVDL 107
Query: 232 A 232
A
Sbjct: 108 A 108
>gi|159487811|ref|XP_001701916.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281135|gb|EDP06891.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1037
Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 10/73 (13%)
Query: 165 VVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGP-NRYGYIANLCVAKSARRQGIASN 223
VVG+LDL++ G P E L G + P +R Y++N+CVA +ARR G+A
Sbjct: 182 VVGSLDLNV-----GHTLPSEE----LIGRQPKEDPRHRRAYLSNVCVAPAARRMGLARA 232
Query: 224 MLYFAVESAKSNG 236
+L A E A+S G
Sbjct: 233 LLRVAEEEARSKG 245
>gi|18417112|ref|NP_567795.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
gi|15081771|gb|AAK82540.1| AT4g28030/T13J8_140 [Arabidopsis thaliana]
gi|22137098|gb|AAM91394.1| At4g28030/T13J8_140 [Arabidopsis thaliana]
gi|332660024|gb|AEE85424.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
Length = 274
Score = 45.1 bits (105), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 79/196 (40%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQRHYCI-----------------VAVK 151
+ + + +R AE+EF A+K R G R CI A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSFEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|21536955|gb|AAM61296.1| unknown [Arabidopsis thaliana]
Length = 274
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 79/196 (40%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQRHYCI-----------------VAVK 151
+ + + +R AE+EF A+K R G R CI A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSSEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|79325293|ref|NP_001031734.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
gi|222423521|dbj|BAH19730.1| AT4G28030 [Arabidopsis thaliana]
gi|332660025|gb|AEE85425.1| GCN5-related N-acetyltransferase (GNAT) family protein [Arabidopsis
thaliana]
Length = 263
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 79/196 (40%), Gaps = 39/196 (19%)
Query: 51 SIVSIPQRSNDLRFDRLQPSDRELLQHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEG 110
S +SIP S LRF + S + + FV E++ ++E W AA LR + E
Sbjct: 28 SYLSIP--SLKLRFRPVAASSH--ICAPAIDKSTFVISESVSEDELWAAACLRVRTFNEL 83
Query: 111 RTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQRHYCI-----------------VAVK 151
+ + + +R AE+EF A+K R G R CI A K
Sbjct: 84 NPSAYNIQDHRRYLAEREFEALKERTSGKREGFTRVACINATLPLSQLSSSFEDLCSACK 143
Query: 152 KDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE--RVNPPLFGCINRRGPNRYGYIANL 209
+G R VVG+LDL+ Q P E P G R Y++N+
Sbjct: 144 FSDGIEDRV----VVGSLDLN-----QCRWLPDEIAGTKPEGIGVDFAR-----AYLSNV 189
Query: 210 CVAKSARRQGIASNML 225
CVAK R G+ ++
Sbjct: 190 CVAKELHRNGVGYKLI 205
>gi|302844083|ref|XP_002953582.1| hypothetical protein VOLCADRAFT_106017 [Volvox carteri f.
nagariensis]
gi|300260991|gb|EFJ45206.1| hypothetical protein VOLCADRAFT_106017 [Volvox carteri f.
nagariensis]
Length = 382
Score = 44.3 bits (103), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 68/168 (40%), Gaps = 42/168 (25%)
Query: 102 LRAESHWEGRTNERYVDNFKRKFAEQ-------------------EFNAIKRRCRGLNGQ 142
++A + WE T + VD+ + Q E A + C G N Q
Sbjct: 135 MKANAEWESVTAKVEVDDVDPGYGRQATAPTTITTTVTASEGDGAELPAASQHC-GSNDQ 193
Query: 143 RHYCIVA-----------VKKDEGNVKRTVLKS---VVGTLDLSIRYLLQGENFPGERVN 188
++A + + G+ + VVG+LDL++ + L E G
Sbjct: 194 ESGALLAALRSCLDACAQLPAEPGSSASASPRPRRLVVGSLDLNVGHTLPSEELIGRH-- 251
Query: 189 PPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
P F +RR Y++N+CVA +ARR G+A ++L E A+ G
Sbjct: 252 -PQFDPRHRRA-----YLSNVCVAPAARRLGVARSLLRHVEEVARGAG 293
>gi|255585013|ref|XP_002533217.1| N-acetyltransferase, putative [Ricinus communis]
gi|223526974|gb|EEF29170.1| N-acetyltransferase, putative [Ricinus communis]
Length = 266
Score = 44.3 bits (103), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 63/161 (39%), Gaps = 34/161 (21%)
Query: 85 FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNGQ 142
V E ++E W A+ LR S + + + K+ AE+EF A+K R G +
Sbjct: 53 LVIAETEAEDELWAASCLRVRSFHNFHDSSFSIQDHKKYLAEREFEAVKERISGKRTGFR 112
Query: 143 RHYCIVAVKKDEGNVKRTVL------------KSVVGTLDLSIRYLLQGENFPGERVNPP 190
R CI A L + VVGTLDL+ Q P E
Sbjct: 113 RVSCINATLPSSQLSDYDDLCTECKYTNNGEDRVVVGTLDLN-----QCLRLPDE----- 162
Query: 191 LFGCINRRGPNRYG------YIANLCVAKSARRQGIASNML 225
I + P G Y++N+CVAK RQG+ ++
Sbjct: 163 ----ITGKKPEGIGADFLRAYLSNVCVAKELHRQGLGYELI 199
>gi|224088384|ref|XP_002308434.1| predicted protein [Populus trichocarpa]
gi|222854410|gb|EEE91957.1| predicted protein [Populus trichocarpa]
Length = 249
Score = 43.5 bits (101), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 62/149 (41%), Gaps = 24/149 (16%)
Query: 93 DEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG----------LNGQ 142
+++ W AA LR S E + + + + KR AE+EF A+K R G LN
Sbjct: 47 EDQLWAAACLRVRSFHEFKPSTFGIQDHKRYLAEREFEALKERIAGKRTGFNRVSCLNAS 106
Query: 143 RHYCIVAVKKDEGNVKRTVL------KSVVGTLDLSIRYLLQGENFPGERVNPPLFGCIN 196
+ D+ + + VVGTLD++ Q + P E G
Sbjct: 107 LPLSQLLSLPDDDLCAQCKFSENGEDRVVVGTLDVN-----QSMSLPDEITGMKPEGI-- 159
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNML 225
G GY++N+CVA R G+ +++
Sbjct: 160 -EGQFARGYLSNVCVANELHRNGLGYDLV 187
>gi|297799176|ref|XP_002867472.1| hypothetical protein ARALYDRAFT_328890 [Arabidopsis lyrata subsp.
lyrata]
gi|297313308|gb|EFH43731.1| hypothetical protein ARALYDRAFT_328890 [Arabidopsis lyrata subsp.
lyrata]
Length = 275
Score = 43.1 bits (100), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 66/162 (40%), Gaps = 35/162 (21%)
Query: 85 FVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGL--NGQ 142
FV E++ ++E W AA LR + E + + + +R AE+EF A+K R G
Sbjct: 58 FVISESVSEDELWAAACLRVRTFNELNPSAYNIQDHRRYLAEREFEALKERTSGKREGFT 117
Query: 143 RHYCIVAV-----------------KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGE 185
R CI A K +G R VVG+LDL+ Q P E
Sbjct: 118 RVACINATLPLSQLSSSSEDLCSSCKFSDGIEDRV----VVGSLDLN-----QCRWLPDE 168
Query: 186 --RVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
P G R Y++N+CVAK R G+ ++
Sbjct: 169 IAGTKPEGIGVDFAR-----AYLSNVCVAKELHRNGVGYKLI 205
>gi|302828192|ref|XP_002945663.1| hypothetical protein VOLCADRAFT_120146 [Volvox carteri f.
nagariensis]
gi|300268478|gb|EFJ52658.1| hypothetical protein VOLCADRAFT_120146 [Volvox carteri f.
nagariensis]
Length = 459
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 66/139 (47%), Gaps = 23/139 (16%)
Query: 94 EEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQ-RHY--CIVAV 150
+E A LRA++++ R+V + K+KF EQE ++++R L+ Q + Y C+VA+
Sbjct: 260 DELRQVATLRADAYY-AENQSRFVGSLKKKFVEQEVESLQQRTTILSRQGKPYSECLVAI 318
Query: 151 KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLC 210
G +V+ +D+ + L G + G + P GC Y+ N+
Sbjct: 319 DSASG--------AVLACIDMRLPAALNGTHPHGVPQDDP-SGC----------YLLNVV 359
Query: 211 VAKSARRQGIASNMLYFAV 229
V + R QG+ ++ A+
Sbjct: 360 VREDVRGQGLGRGIMQAAM 378
>gi|434389140|ref|YP_007099751.1| acetyltransferase [Chamaesiphon minutus PCC 6605]
gi|428020130|gb|AFY96224.1| acetyltransferase [Chamaesiphon minutus PCC 6605]
Length = 202
Score = 42.0 bits (97), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 23/95 (24%)
Query: 142 QRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
Q+ C +AV D G V+GT+++ IR N+ + P+
Sbjct: 79 QQQVCSIAVYADRGQ------SQVIGTIEVGIR----TTNY-------------RQPKPH 115
Query: 202 RYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
RY YI+NL V++ RR+G+A +L E KS G
Sbjct: 116 RYPYISNLAVSRDFRRRGVAQQLLIGCEELTKSWG 150
>gi|209524724|ref|ZP_03273271.1| GCN5-related N-acetyltransferase [Arthrospira maxima CS-328]
gi|376004001|ref|ZP_09781787.1| GCN5-related N-acetyltransferase [Arthrospira sp. PCC 8005]
gi|423067739|ref|ZP_17056529.1| GCN5-related N-acetyltransferase [Arthrospira platensis C1]
gi|209494868|gb|EDZ95176.1| GCN5-related N-acetyltransferase [Arthrospira maxima CS-328]
gi|375327624|emb|CCE17540.1| GCN5-related N-acetyltransferase [Arthrospira sp. PCC 8005]
gi|406710844|gb|EKD06047.1| GCN5-related N-acetyltransferase [Arthrospira platensis C1]
Length = 205
Score = 42.0 bits (97), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++++R NP G +
Sbjct: 76 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMALR-----------SHNPWSIG-------D 117
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 118 RYPYISNLAVHPTARRQGVARELL 141
>gi|225445718|ref|XP_002270456.1| PREDICTED: uncharacterized protein LOC100246646 [Vitis vinifera]
gi|297743735|emb|CBI36618.3| unnamed protein product [Vitis vinifera]
Length = 279
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 71/174 (40%), Gaps = 42/174 (24%)
Query: 76 QHDRFEFGKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRR 135
Q + + V E + +++ W AA LR S ++ + +D+ KR AE+EF A+K R
Sbjct: 57 QTFKIDKSSLVVAETVSEDQLWAAACLRIRSFYQFGPSYG-IDDHKRYLAEREFEALKER 115
Query: 136 CRGL--NGQRHYCI----------------VAVKKDEGNVKRTVLKSVVGTLDLSIRYLL 177
G +R CI A K N + V V+GTLDL+
Sbjct: 116 VAGKREGFRRVSCINATIPLSEISSFSDDLCAACKFTHNGEDRV---VIGTLDLN----- 167
Query: 178 QGENFPGERVNPPLFGCINRRGPNRYG------YIANLCVAKSARRQGIASNML 225
Q + P E I P G Y++N+CVAK R G+ ++
Sbjct: 168 QCVSLPDE---------ITGMKPQGIGADFLRAYLSNVCVAKELHRNGLGYALV 212
>gi|168016797|ref|XP_001760935.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687944|gb|EDQ74324.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 176
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 53/118 (44%), Gaps = 25/118 (21%)
Query: 125 AEQEFNAIKRRCRGLNG--QRHYCIVAVKKDEGNVKRTVL---------------KSVVG 167
AE EF A+ + GL +R CI+A+ R+++ +VG
Sbjct: 6 AEDEFTALTSKVAGLEQGYKRIVCILALYPLSCLPDRSLVDLHPALKVTLANGEEHVLVG 65
Query: 168 TLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+LDLS QG+ PGE V G R GY++N+CVA R++GI +L
Sbjct: 66 SLDLS-----QGKVLPGEIVGSKPQGSSAER---ERGYLSNVCVAPLMRQRGIGVALL 115
>gi|384244775|gb|EIE18273.1| hypothetical protein COCSUDRAFT_60465 [Coccomyxa subellipsoidea
C-169]
Length = 163
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 40/73 (54%), Gaps = 9/73 (12%)
Query: 165 VVGTLDLSIRYLLQGENFPGER-VNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASN 223
VVGTLD++ QG N PG + G + R + GY++N+C K+ RR G+A
Sbjct: 47 VVGTLDIN-----QGVNLPGGHLIGLEPEGAVAR---IQRGYLSNICTQKAVRRTGVAMR 98
Query: 224 MLYFAVESAKSNG 236
++ A++ A S G
Sbjct: 99 LIRAAIDEAASAG 111
>gi|428207047|ref|YP_007091400.1| N-acetyltransferase GCN5 [Chroococcidiopsis thermalis PCC 7203]
gi|428008968|gb|AFY87531.1| GCN5-related N-acetyltransferase [Chroococcidiopsis thermalis PCC
7203]
Length = 228
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 49/113 (43%), Gaps = 16/113 (14%)
Query: 129 FNAIKRRCRGLNGQRHYCIVAVKK-----DEGNVKRTVLKSVVGTLDLSIRYLLQGENFP 183
+ I+ R H C+VAV+K + G + GT++L++R + G
Sbjct: 71 YEDIRHRIHA-TAPHHVCLVAVEKTAQNENTGMYSHDSTLGIAGTVELALRTIPLGTTC- 128
Query: 184 GERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+ R +Y Y++NL V + RRQGIA +L + AKS G
Sbjct: 129 ---------SFTSYRQGYQYPYLSNLAVHTTRRRQGIAGKLLLSCEQVAKSWG 172
>gi|307107002|gb|EFN55246.1| hypothetical protein CHLNCDRAFT_134568 [Chlorella variabilis]
Length = 237
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 29/40 (72%)
Query: 99 AAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG 138
AAWLRAE+++E R+ R+ +F ++FA QE A+ RR +G
Sbjct: 37 AAWLRAEAYYEKRSVGRFKASFIKQFAGQELYALLRRTKG 76
>gi|145350958|ref|XP_001419858.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580090|gb|ABO98151.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 174
Score = 41.6 bits (96), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 12/85 (14%)
Query: 165 VVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNM 224
+VG+LD++I G+ P E L G R R Y++N+ VA +ARR G+AS M
Sbjct: 61 IVGSLDINI-----GDRLPAEE----LAGT---RSLGRRAYLSNVGVAPAARRTGVASRM 108
Query: 225 LYFAVESAKSNGKINQASVNKKHEN 249
+ A A+ ++ V+ +H+N
Sbjct: 109 IERASIIARDEYAVHTLYVHVQHDN 133
>gi|28812153|dbj|BAC65034.1| GCN5-related N-acetyltransferase-like [Oryza sativa Japonica Group]
gi|37806163|dbj|BAC99667.1| GCN5-related N-acetyltransferase-like [Oryza sativa Japonica Group]
gi|125560809|gb|EAZ06257.1| hypothetical protein OsI_28494 [Oryza sativa Indica Group]
gi|125602757|gb|EAZ42082.1| hypothetical protein OsJ_26642 [Oryza sativa Japonica Group]
Length = 291
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 73/158 (46%), Gaps = 23/158 (14%)
Query: 83 GKFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LN 140
G RE + +E A LR + +E T+ ++ ++ A++E++A++ R G +N
Sbjct: 76 GAVAVREFVTLDELRAAVRLRIRTFYEYATDSYGAEDLRKSLADREYDALQDRISGKMIN 135
Query: 141 GQRHYCI--------VAVKKDEGN-----VKRTVLKSVVGTLDLSIRYLLQGENFPGERV 187
QR CI V +E V+ + VVG+LDL+ + L + G+R
Sbjct: 136 FQRVSCINGTVPLLPSLVSAEELCSTCKFVEDGEERVVVGSLDLN-QCLWLPDELTGKRP 194
Query: 188 NPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+N R Y++N+CVAK +R G+ ++
Sbjct: 195 G------VNESSHTR-AYLSNVCVAKELQRNGLGYALV 225
>gi|428769044|ref|YP_007160834.1| N-acetyltransferase GCN5 [Cyanobacterium aponinum PCC 10605]
gi|428683323|gb|AFZ52790.1| GCN5-related N-acetyltransferase [Cyanobacterium aponinum PCC
10605]
Length = 185
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 24/99 (24%)
Query: 140 NGQRHYCIVAV--KKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINR 197
N + + C++AV EG V + +L GT++LS R EN+ +
Sbjct: 59 NEKENICLIAVFINNTEGKVNQEIL----GTVELSFR-----ENYRWAK----------- 98
Query: 198 RGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+Y YIANL V ++ RRQGIAS +L + A+ N
Sbjct: 99 --KEKYAYIANLAVRENCRRQGIASQLLSRCEQIAQENN 135
>gi|332523534|ref|ZP_08399786.1| acetyltransferase, GNAT family [Streptococcus porcinus str.
Jelinkova 176]
gi|332314798|gb|EGJ27783.1| acetyltransferase, GNAT family [Streptococcus porcinus str.
Jelinkova 176]
Length = 168
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 198 RGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKINQASVNKKHEN 249
R P + YI +LC+A+ AR QGI +L FA + + +G N ++N H N
Sbjct: 95 RYPKKSLYIEDLCLAQEARHQGIGQKLLQFAEKYGRDHGAYN-VTLNVWHAN 145
>gi|359395787|ref|ZP_09188839.1| Acetyltransferase [Halomonas boliviensis LC1]
gi|357970052|gb|EHJ92499.1| Acetyltransferase [Halomonas boliviensis LC1]
Length = 143
Score = 40.0 bits (92), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 174 RYLLQ--GENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
+YLL+ G +F E V L G I + GY+ +L VA S RR GIA+ ++ +E+
Sbjct: 37 KYLLRNPGLSFVAE-VEGKLVGTIMAGHDGKRGYVQHLSVADSHRRMGIATELVNLCLEA 95
Query: 232 AKSNG 236
KS G
Sbjct: 96 LKSEG 100
>gi|291568630|dbj|BAI90902.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 210
Score = 40.0 bits (92), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++ +R +P G +
Sbjct: 81 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMGLR-----------SPSPWSIG-------D 122
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 123 RYPYISNLAVHPTARRQGVARELL 146
>gi|409993391|ref|ZP_11276533.1| acetyltransferase [Arthrospira platensis str. Paraca]
gi|409935717|gb|EKN77239.1| acetyltransferase [Arthrospira platensis str. Paraca]
Length = 205
Score = 40.0 bits (92), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 22/84 (26%)
Query: 146 CIVAVKKDE----GNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPN 201
C+VA D GN+ V + +VGT+++ +R +P G +
Sbjct: 76 CLVAAIADSPHKLGNIPLNVPEHLVGTVEMGLR-----------SPSPWSIG-------D 117
Query: 202 RYGYIANLCVAKSARRQGIASNML 225
RY YI+NL V +ARRQG+A +L
Sbjct: 118 RYPYISNLAVHPTARRQGVARELL 141
>gi|428773700|ref|YP_007165488.1| N-acetyltransferase GCN5 [Cyanobacterium stanieri PCC 7202]
gi|428687979|gb|AFZ47839.1| GCN5-related N-acetyltransferase [Cyanobacterium stanieri PCC 7202]
Length = 212
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 22/97 (22%)
Query: 140 NGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRG 199
N + C++A+ T + +VGT+++S+R NP +G
Sbjct: 83 NNPHYCCLIAIIP----ASSTQPEQIVGTIEISLR-------------NPYGWG-----S 120
Query: 200 PNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+Y YI+NL V K RRQGI S +L E A+S G
Sbjct: 121 KKKYPYISNLAVKKEFRRQGIGSQLLQKCEEIAQSWG 157
>gi|449508815|ref|XP_004163419.1| PREDICTED: uncharacterized protein LOC101228360 [Cucumis sativus]
Length = 288
Score = 40.0 bits (92), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 67/159 (42%), Gaps = 27/159 (16%)
Query: 84 KFVAREALLDEEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRG--LNG 141
KF E +E W AA LR + + + + + K+ AE EF A+K R G +
Sbjct: 74 KFRVSEGTSHDELWAAASLRVRTFNQLPPDSFGIHDHKKYLAEHEFEAMKERIAGKRVGF 133
Query: 142 QRHYCIVA-VKKDEGNVKRTVLKS------------VVGTLDLSIRYLLQGENFPGE--R 186
+R CI A + E + L S VVG+LD++ Q P E
Sbjct: 134 KRVSCINATLPLSEISTLAEDLCSTCKFSDNGEDRVVVGSLDIN-----QCVRLPDEITG 188
Query: 187 VNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNML 225
+ P G R Y++N+CVAK +R G+ ++
Sbjct: 189 MKPEGIGADFAR-----AYLSNVCVAKELQRNGLGYALI 222
>gi|172038596|ref|YP_001805097.1| hypothetical protein cce_3683 [Cyanothece sp. ATCC 51142]
gi|354554064|ref|ZP_08973369.1| GCN5-related N-acetyltransferase [Cyanothece sp. ATCC 51472]
gi|171700050|gb|ACB53031.1| hypothetical protein cce_3683 [Cyanothece sp. ATCC 51142]
gi|353553743|gb|EHC23134.1| GCN5-related N-acetyltransferase [Cyanothece sp. ATCC 51472]
Length = 218
Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 49/108 (45%), Gaps = 25/108 (23%)
Query: 129 FNAIKRRCRGLNGQRHYCIVAVKKDEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVN 188
+ ++ R RG + C VAV+K K T + V+GT++LS++ G
Sbjct: 82 YEDLRSRLRG-TIPYYCCFVAVEKTSTLTKTT--EKVIGTIELSLK---SG--------- 126
Query: 189 PPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
F C Y YI+NL V +S RRQGIA +L E A G
Sbjct: 127 ---FNC-------HYLYISNLAVIQSHRRQGIAKQLLQQCEEIASKWG 164
>gi|159476828|ref|XP_001696513.1| hypothetical protein CHLREDRAFT_142152 [Chlamydomonas reinhardtii]
gi|158282738|gb|EDP08490.1| predicted protein [Chlamydomonas reinhardtii]
Length = 283
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 191 LFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGK 237
+ GC+ G Y Y++++CVA +ARR+G+A ++ A E A+ G+
Sbjct: 189 VLGCLPP-GTREYAYVSSMCVAPTARRRGVAQALMSAAEEQARLWGQ 234
>gi|308808185|ref|XP_003081403.1| GCN5-related N-acetyltransferase (ISS) [Ostreococcus tauri]
gi|116059865|emb|CAL55572.1| GCN5-related N-acetyltransferase (ISS) [Ostreococcus tauri]
Length = 269
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 12/89 (13%)
Query: 161 VLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGI 220
V+ V G+LDL++ GE P E L G ++ P + Y++N+ V RR+GI
Sbjct: 152 VMSIVAGSLDLNV-----GERLPAEE----LAGALS---PGKRAYLSNVSVLPPVRRRGI 199
Query: 221 ASNMLYFAVESAKSNGKINQASVNKKHEN 249
A M+ A++ A+ + V+ + N
Sbjct: 200 AFGMINRALDVARDEFGVETVYVHAEASN 228
>gi|389743003|gb|EIM84188.1| acyl-CoA N-acyltransferase [Stereum hirsutum FP-91666 SS1]
Length = 235
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 2/56 (3%)
Query: 194 CINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKINQASVNKKHEN 249
++R NR GYIA L V+KS R++GIAS ++ ++E+ KS+G + + + +H+N
Sbjct: 69 SMHRERTNR-GYIAMLSVSKSWRKRGIASALVRHSMEAMKSSG-VEEIVLETEHDN 122
>gi|428318351|ref|YP_007116233.1| GCN5-related N-acetyltransferase [Oscillatoria nigro-viridis PCC
7112]
gi|428242031|gb|AFZ07817.1| GCN5-related N-acetyltransferase [Oscillatoria nigro-viridis PCC
7112]
Length = 207
Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 53/112 (47%), Gaps = 22/112 (19%)
Query: 129 FNAIKRRCRGLNGQRHYCIVA--VKKDEG--NVKRTVLKSVVGTLDLSIRYLLQGENFPG 184
+ +K R R + + C+VA V ++EG N + + + GT+++++R FP
Sbjct: 61 YEDLKNRLRS-KAEHYICLVAELVSREEGTQNYRSHRAQRITGTVEMALR-----SRFPW 114
Query: 185 ERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+ N + Y Y++NL V RRQG+A +L E+A+ G
Sbjct: 115 QIPN------------SDYPYLSNLAVHPEYRRQGVAQQLLSNCEETAREWG 154
>gi|384252882|gb|EIE26357.1| acyl-CoA N-acyltransferase [Coccomyxa subellipsoidea C-169]
Length = 306
Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 177 LQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNG 236
+QG++ P ER+ F R P YI+NL V+ +ARR G+ +L A + AK G
Sbjct: 198 MQGQHMPAERIEKGPFVAYQR--PEDVAYISNLAVSPAARRLGVGEELLTAAEKVAKDWG 255
>gi|159481971|ref|XP_001699048.1| hypothetical protein CHLREDRAFT_152106 [Chlamydomonas reinhardtii]
gi|158273311|gb|EDO99102.1| predicted protein [Chlamydomonas reinhardtii]
Length = 213
Score = 38.5 bits (88), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 94 EEYWTAAWLRAESHWEGRTNERYVDNFKRKFAEQEFNAIKRRCRGLNGQ-RHYCIVAVKK 152
+E A LRA++++ R+V + K+KF EQE ++++R + Q +C V
Sbjct: 23 DELRQVATLRADAYY-AENFSRFVGSLKKKFVEQEVESLQQRTYATSKQGTPFCECLVAV 81
Query: 153 DEGNVKRTVLKSVVGTLDLSIRYLLQGENFPGERVNPPLFGCINRRGPNRYGYIANLCVA 212
+ G+ +V+G +DL + L G + P GC Y+ N+ V
Sbjct: 82 EAGS------STVLGCIDLRLPAALNGAH-PAGVPAGDTAGC----------YLLNVVVR 124
Query: 213 KSARRQGIASNML 225
+ R QG+ ++
Sbjct: 125 EDVRGQGLGRAIM 137
>gi|321258346|ref|XP_003193896.1| hypothetical protein CGB_D8720W [Cryptococcus gattii WM276]
gi|317460366|gb|ADV22109.1| hypothetical protein CNJ00870 [Cryptococcus gattii WM276]
Length = 244
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 198 RGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKINQASVNKKHEN 249
RG NR GYI L VAK RR+GI ++ AVE G Q + +H+N
Sbjct: 124 RGTNR-GYIGMLSVAKDYRRRGIGRKLVEIAVEEMAKRGA-KQVMLETEHDN 173
>gi|399909570|ref|ZP_10778122.1| hypothetical protein HKM-1_08860 [Halomonas sp. KM-1]
Length = 145
Score = 38.1 bits (87), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 169 LDLSIRYLLQGENFPG-ERVNPPLFGCINRRGPNRYGYIANLCVAKSARRQGIASNMLYF 227
+D IR L+ +F G R N L G + + +++ LCV K+ RR+G+ + +L
Sbjct: 39 VDTFIREHLEAGHFFGCARFNDRLLGAVAVIADEQAWWLSELCVRKATRRRGVGTRLLAL 98
Query: 228 AVESAKSNGK 237
++A+S G+
Sbjct: 99 VSDAARSEGR 108
>gi|225872570|ref|YP_002754025.1| GNAT family acetyltransferase [Acidobacterium capsulatum ATCC
51196]
gi|225792052|gb|ACO32142.1| acetyltransferase, GNAT family [Acidobacterium capsulatum ATCC
51196]
Length = 152
Score = 38.1 bits (87), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Query: 197 RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVE 230
RRG YGY+ + VA+SARRQG+ +L FAVE
Sbjct: 65 RRGELHYGYVVTIDVAESARRQGVGRALL-FAVE 97
>gi|411119818|ref|ZP_11392194.1| sortase-like acyltransferase [Oscillatoriales cyanobacterium
JSC-12]
gi|410709974|gb|EKQ67485.1| sortase-like acyltransferase [Oscillatoriales cyanobacterium
JSC-12]
Length = 146
Score = 38.1 bits (87), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 34/71 (47%), Gaps = 8/71 (11%)
Query: 180 ENFPGERVNPPLFGCINRR--------GPNRYGYIANLCVAKSARRQGIASNMLYFAVES 231
E F +R N L G + R NR Y+ L + + RRQGIAS +L A +
Sbjct: 44 ETFVHDRGNGKLGGFLETRIRDYAEGCDGNRVAYVEGLYIDEDLRRQGIASQLLRAAEQW 103
Query: 232 AKSNGKINQAS 242
A+S G + AS
Sbjct: 104 ARSQGIVEMAS 114
>gi|412990748|emb|CCO18120.1| predicted protein [Bathycoccus prasinos]
Length = 266
Score = 37.7 bits (86), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 12/88 (13%)
Query: 165 VVGTLDLSIRYLLQGENFPGERV--NPPLFGCINRRGPNRYGYIANLCVAKSARRQGIAS 222
V GTLDL++ G N P E + P+ G N R + Y++N+CV +S R GIA
Sbjct: 152 VCGTLDLNV-----GVNLPAEELVGKFPIEG--NAR--KKRCYMSNVCVLESRRNLGIAR 202
Query: 223 NMLYFAVESAKSNGKINQASVNKKHENF 250
++ A+E AK N + + V+ EN
Sbjct: 203 QLIERAIEDAK-NINVEEIYVHVVSENI 229
>gi|352102711|ref|ZP_08959348.1| hypothetical protein HAL1_08609 [Halomonas sp. HAL1]
gi|350599884|gb|EHA15966.1| hypothetical protein HAL1_08609 [Halomonas sp. HAL1]
Length = 160
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 191 LFGCIN-RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKI 238
L G + + G +R +++ LCV K+ RR+G+ S ++ E AK G++
Sbjct: 63 LLGAVAVQEGEDRAWWLSELCVRKTTRRRGVGSRLMALLGEQAKQEGRV 111
>gi|448748360|ref|ZP_21729997.1| Acyl-CoA N-acyltransferase [Halomonas titanicae BH1]
gi|445564062|gb|ELY20192.1| Acyl-CoA N-acyltransferase [Halomonas titanicae BH1]
Length = 160
Score = 37.4 bits (85), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 191 LFGCIN-RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKI 238
L G + + G +R +++ LCV K+ RR+G+ S ++ E AK G++
Sbjct: 63 LLGAVAVQEGADRAWWLSELCVRKTTRRRGVGSRLMALLGEQAKQEGRV 111
>gi|359396308|ref|ZP_09189360.1| hypothetical protein KUC_2982 [Halomonas boliviensis LC1]
gi|357970573|gb|EHJ93020.1| hypothetical protein KUC_2982 [Halomonas boliviensis LC1]
Length = 160
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 191 LFGCIN-RRGPNRYGYIANLCVAKSARRQGIASNMLYFAVESAKSNGKI 238
L G + + G +R +++ LCV K+ RR+G+ S ++ E AK G++
Sbjct: 63 LLGAVAVQEGEDRAWWLSELCVRKTTRRRGVGSRLMALLGEQAKQEGRV 111
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.405
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,808,446,382
Number of Sequences: 23463169
Number of extensions: 148107188
Number of successful extensions: 289176
Number of sequences better than 100.0: 76
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 37
Number of HSP's that attempted gapping in prelim test: 289077
Number of HSP's gapped (non-prelim): 80
length of query: 253
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 114
effective length of database: 9,097,814,876
effective search space: 1037150895864
effective search space used: 1037150895864
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)